The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.
BLASTX 2.1.1 [Aug-8-2000]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Contig20.seq Contig20
(877 letters)
Database: nr
565,281 sequences; 177,575,912 total letters
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB91806.1| (AL356192) conserved hypothetical protein [... 67 2e-10
gb|AAF96859.1| (AE004423) hypothetical protein [Vibrio chol... 35 1.3
pir||T05197 hypothetical protein F4I10.100 - Arabidopsis th... 33 3.8
>emb|CAB91806.1| (AL356192) conserved hypothetical protein [Neurospora crassa]
Length = 498
Score = 67.1 bits (161), Expect = 2e-10
Identities = 35/73 (47%), Positives = 46/73 (62%)
Frame = -1
Query: 679 GYASSDGFPASGYATHYALPSLSQQQAHLYREKALFWGTVGSFAVSFLLILVAGTLISTG 500
GYASS+ FP SGY SL QQ+ +R++ LFWGT G + V+F+L+ +A LI TG
Sbjct: 368 GYASSEEFPESGYR------SLEQQRIARHRDETLFWGTWGCYLVTFVLMGIATMLIMTG 421
Query: 499 RHKLRAEVDAGVT 461
RHK+R E T
Sbjct: 422 RHKMRMEARVSST 434
>gb|AAF96859.1| (AE004423) hypothetical protein [Vibrio cholerae]
Length = 307
Score = 34.8 bits (78), Expect = 1.3
Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Frame = -1
Query: 577 LFWGTVGSFAVSFL-LILVAGTLISTGRHKLRAEVDAGVTVGVVASLFTSCSALGMTLYR 401
L W TV S + F+ ++++AG +TG + R E+ AG++ G LF + S L +LYR
Sbjct: 118 LAWRTVLSVCIIFVGVVMLAG---NTGSIEWRPELLAGLSAG----LFNAGSQL--SLYR 168
Query: 400 HDQLSLPHLMAVWTTFIASCILNGMLLILVVGNSP*YPWAT 278
Q + + TF+ + +L LL+LV PW++
Sbjct: 169 AAQSDMRSIEIHGWTFLVAALLLSPLLLLV-------PWSS 202
>pir||T05197 hypothetical protein F4I10.100 - Arabidopsis thaliana
emb|CAB36791.1| (AL035525) putative protein [Arabidopsis thaliana]
emb|CAB80034.1| (AL161583) putative protein [Arabidopsis thaliana]
Length = 990
Score = 33.2 bits (74), Expect = 3.8
Identities = 19/71 (26%), Positives = 34/71 (47%)
Frame = -1
Query: 541 FLLILVAGTLISTGRHKLRAEVDAGVTVGVVASLFTSCSALGMTLYRHDQLSLPHLMAVW 362
FL + G + K ++D V+ G++ ++ C + + D + +P +A W
Sbjct: 529 FLFAINQGKQVHAYAIKSGYDLDLWVSSGIL-DMYVKCGDMSAAQFAFDSIPVPDDVA-W 586
Query: 361 TTFIASCILNG 329
TT I+ CI NG
Sbjct: 587 TTMISGCIENG 597
Database: nr
Posted date: Sep 29, 2000 9:53 PM
Number of letters in database: 177,575,912
Number of sequences in database: 565,281
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.270 0.0470 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 291675272
Number of Sequences: 565281
Number of extensions: 6224318
Number of successful extensions: 13954
Number of sequences better than 10.0: 6
Number of HSP's better than 10.0 without gapping: 1
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 13952
Number of HSP's gapped (non-prelim): 3
length of query: 292
length of database: 177,575,912
effective HSP length: 54
effective length of query: 237
effective length of database: 147,050,738
effective search space: 34851024906
effective search space used: 34851024906
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 71 (32.1 bits)