The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.
BLASTX 2.2.1 [Apr-13-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Fgr-S_9_F06_T7.seq Fgr-S_9_F06_T7 0 0 0 1 420
(420 letters)
Database: nr
722,277 sequences; 228,742,487 total letters
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB03117.1| (AP002040) gene_id:MEC18.18~unknown protein... 54 3e-07
dbj|BAB37068.1| (AP002562) putative enzyme [Escherichia col... 31 2.1
gb|AAG57898.1|AE005506_6 (AE005506) putative enzyme [Escher... 31 2.1
pir||T00797 hypothetical protein F24L7.15 - Arabidopsis tha... 31 2.1
dbj|BAB26289.1| (AK009440) putative [Mus musculus] 30 2.7
pir||E72240 hypothetical protein TM1562 - Thermotoga mariti... 30 3.5
gb|AAF59429.2| (AC006723) Hypothetical protein Y19D10B.2 [C... 30 4.6
pir||T25120 hypothetical protein T22C8.7 - Caenorhabditis e... 29 6.0
emb|CAC30265.1| (AL583919) probable DNA methylase [Mycobact... 29 7.9
pir||T18422 hypothetical protein C0145c - malaria parasite ... 29 7.9
pir||T23183 hypothetical protein K01D12.5 - Caenorhabditis ... 29 7.9
>dbj|BAB03117.1| (AP002040) gene_id:MEC18.18~unknown protein [Arabidopsis thaliana]
gb|AAG51059.1|AC069473_21 (AC069473) unknown protein; 42843-40829 [Arabidopsis thaliana]
Length = 360
Score = 53.5 bits (127), Expect = 3e-07
Identities = 22/28 (78%), Positives = 27/28 (95%)
Frame = +2
Query: 38 NSTVVENTEKGWKELIFQRIRAVFGFGV 121
N+TVVENTE+GW++LIF RIRAVFGFG+
Sbjct: 333 NATVVENTERGWRDLIFHRIRAVFGFGI 360
>dbj|BAB37068.1| (AP002562) putative enzyme [Escherichia coli O157:H7]
Length = 433
Score = 30.8 bits (68), Expect = 2.1
Identities = 32/114 (28%), Positives = 60/114 (52%), Gaps = 10/114 (8%)
Frame = +3
Query: 12 AEVRKTGMGTQPWWRTPRKAGKSSSSRGYVQCSVLGSEKASLRSI-FNEVDPAFSWLSMF 188
+E+ +T G PW+ + + S R ++Q + ++K R++ + EV+P L +F
Sbjct: 236 SEILETVSGEMPWYDS-NGLRLTFSPRDFIQVNAGVNQKMVARALEWLEVEPEDRVLDLF 294
Query: 189 CCLSLFDADRHTKLSELPGIEKYVLPCIHHAGL------CFEVVT--HESLSL*LM*RP- 341
C + F T+ + + G+E +P + G C + VT HE+L + +P
Sbjct: 295 CGMGNFTLPLATQAASVVGVEG--VPALVEKGQQNARLNCLQNVTFYHENLEEDVTKQPW 352
Query: 342 AKDG 353
AK+G
Sbjct: 353 AKNG 356
>gb|AAG57898.1|AE005506_6 (AE005506) putative enzyme [Escherichia coli O157:H7 EDL933]
Length = 433
Score = 30.8 bits (68), Expect = 2.1
Identities = 32/114 (28%), Positives = 60/114 (52%), Gaps = 10/114 (8%)
Frame = +3
Query: 12 AEVRKTGMGTQPWWRTPRKAGKSSSSRGYVQCSVLGSEKASLRSI-FNEVDPAFSWLSMF 188
+E+ +T G PW+ + + S R ++Q + ++K R++ + EV+P L +F
Sbjct: 236 SEILETVSGEMPWYDS-NGLRLTFSPRDFIQVNAGVNQKMVARALEWLEVEPEDRVLDLF 294
Query: 189 CCLSLFDADRHTKLSELPGIEKYVLPCIHHAGL------CFEVVT--HESLSL*LM*RP- 341
C + F T+ + + G+E +P + G C + VT HE+L + +P
Sbjct: 295 CGMGNFTLPLATQAASVVGVEG--VPALVEKGQQNARLNCLQNVTFYHENLEEDVTKQPW 352
Query: 342 AKDG 353
AK+G
Sbjct: 353 AKNG 356
>pir||T00797 hypothetical protein F24L7.15 - Arabidopsis thaliana
gb|AAC04492.1| (AC003974) unknown protein [Arabidopsis thaliana]
Length = 327
Score = 30.8 bits (68), Expect = 2.1
Identities = 22/80 (27%), Positives = 37/80 (45%), Gaps = 1/80 (1%)
Frame = -3
Query: 301 TTSKHNPA**IQGNTYFSIPGSSLSF-VCLSASNKLRQQNMDSHEKAGSTSLKMDLKDAF 125
TT + P I+ + PGSS +C+S SN+ SH + +T ++ + F
Sbjct: 197 TTRESTPCSLIRRPEIMTTPGSSTKLNICVSESNQREDSLSRSHRRRPTTP---EMDEFF 253
Query: 124 SDPKTEHCTYPLEDELFPAFL 62
S + E +E +FP F+
Sbjct: 254 SGAEEEQQKQFIEKYVFPRFI 274
>dbj|BAB26289.1| (AK009440) putative [Mus musculus]
Length = 362
Score = 30.4 bits (67), Expect = 2.7
Identities = 14/38 (36%), Positives = 19/38 (49%)
Frame = +3
Query: 6 VAAEVRKTGMGTQPWWRTPRKAGKSSSSRGYVQCSVLG 119
V + +KT PW R+ G++SS V C VLG
Sbjct: 292 VKEDTQKTPTSASPWGEKDREKGRASSDPASVLCIVLG 329
>pir||E72240 hypothetical protein TM1562 - Thermotoga maritima (strain MSB8)
gb|AAD36628.1|AE001801_15 (AE001801) hypothetical protein [Thermotoga maritima]
Length = 266
Score = 30.0 bits (66), Expect = 3.5
Identities = 14/31 (45%), Positives = 19/31 (61%)
Frame = -1
Query: 129 LFQTPKPNTARILWKMSSFQPFSVFSTTVEF 37
LFQTP PN A +L+ S S + TT+E+
Sbjct: 67 LFQTPAPNIAGVLYLKRSTLTESSYRTTIEY 97
>gb|AAF59429.2| (AC006723) Hypothetical protein Y19D10B.2 [Caenorhabditis elegans]
Length = 534
Score = 29.6 bits (65), Expect = 4.6
Identities = 13/41 (31%), Positives = 24/41 (57%)
Frame = +1
Query: 193 VLVYSMLIDTQNLVNFRELKNTCYLVFITLGCALKSSLMKV 315
++ S++I+ NL NF L+N Y+ +T C L + ++ V
Sbjct: 492 IIFGSLIINGTNLTNFNFLENLKYVAQLTSSCVLHALILMV 532
>pir||T25120 hypothetical protein T22C8.7 - Caenorhabditis elegans
emb|CAA88879.1| (Z49071) N-terminal portion of this predicted gene show strong
similarity to C. elegans cuticulin [Caenorhabditis
elegans]
Length = 609
Score = 29.3 bits (64), Expect = 6.0
Identities = 12/27 (44%), Positives = 15/27 (55%)
Frame = -1
Query: 114 KPNTARILWKMSSFQPFSVFSTTVEFP 34
KP T+ LW+ +SF P S F FP
Sbjct: 390 KPRTSPFLWRNNSFSPSSKFRNGYNFP 416
>emb|CAC30265.1| (AL583919) probable DNA methylase [Mycobacterium leprae]
Length = 555
Score = 28.9 bits (63), Expect = 7.9
Identities = 13/35 (37%), Positives = 20/35 (57%)
Frame = +2
Query: 281 WVVL*SRHS*KFELVTHVEAGERWSLSDYYFCSLR 385
W++ H LVTH++AG+R + Y CS+R
Sbjct: 352 WLLDAPHHPTDSALVTHIDAGKRAGVDKGYKCSIR 386
>pir||T18422 hypothetical protein C0145c - malaria parasite (Plasmodium
falciparum)
emb|CAB10576.1| (Z97348) PFC0145c (MAL3P1.11), Hypothetical protein, len: 1441 aa,
predicted using hexExon [Plasmodium falciparum]
Length = 1441
Score = 28.9 bits (63), Expect = 7.9
Identities = 16/44 (36%), Positives = 25/44 (56%)
Frame = -1
Query: 399 TMSYRRKEQK**SLSDHLSPAST*VTSSNFHE*RLQSTTQRDEY 268
+MS++ +E+K LS S+ +SS+F+ L T QRD Y
Sbjct: 755 SMSFQNEEEKENEKKRRLSNVSSYTSSSSFNSYNLSGTHQRDGY 798
>pir||T23183 hypothetical protein K01D12.5 - Caenorhabditis elegans
emb|CAA99872.1| (Z75543) K01D12.5 [Caenorhabditis elegans]
Length = 285
Score = 28.9 bits (63), Expect = 7.9
Identities = 15/54 (27%), Positives = 28/54 (51%), Gaps = 5/54 (9%)
Frame = -2
Query: 386 DARNKNNSHSATIFRRPLHESQAQT-----FMSDDFKAQPSVMNTR*HVFFNSRKFTKF 225
D + + SH +T+ P+H S+ QT + + +PSV+ V SR + ++
Sbjct: 25 DGLSSHQSHQSTVTSAPIHVSEQQTSRNRPYNQPQLQQKPSVLTQSYQVLPESRNYYQY 83
Database: nr
Posted date: Jul 23, 2001 8:52 PM
Number of letters in database: 228,742,487
Number of sequences in database: 722,277
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.267 0.0410 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 181,361,986
Number of Sequences: 722277
Number of extensions: 3411167
Number of successful extensions: 8243
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 8118
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 8242
length of database: 228,742,487
effective HSP length: 115
effective length of database: 145,680,632
effective search space used: 3496335168
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)