The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.1.1 [Aug-8-2000]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Contig20.seq Contig20
         (877 letters)

Database: nr
           565,281 sequences; 177,575,912 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAB91806.1|  (AL356192) conserved hypothetical protein [...    67  2e-10
gb|AAF96859.1|  (AE004423) hypothetical protein [Vibrio chol...    35  1.3
pir||T05197  hypothetical protein F4I10.100 - Arabidopsis th...    33  3.8

>emb|CAB91806.1| (AL356192) conserved hypothetical protein [Neurospora crassa]
          Length = 498

 Score = 67.1 bits (161), Expect = 2e-10
 Identities = 35/73 (47%), Positives = 46/73 (62%)
 Frame = -1

Query: 679 GYASSDGFPASGYATHYALPSLSQQQAHLYREKALFWGTVGSFAVSFLLILVAGTLISTG 500
           GYASS+ FP SGY       SL QQ+   +R++ LFWGT G + V+F+L+ +A  LI TG
Sbjct: 368 GYASSEEFPESGYR------SLEQQRIARHRDETLFWGTWGCYLVTFVLMGIATMLIMTG 421

Query: 499 RHKLRAEVDAGVT 461
           RHK+R E     T
Sbjct: 422 RHKMRMEARVSST 434
>gb|AAF96859.1| (AE004423) hypothetical protein [Vibrio cholerae]
          Length = 307

 Score = 34.8 bits (78), Expect = 1.3
 Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
 Frame = -1

Query: 577 LFWGTVGSFAVSFL-LILVAGTLISTGRHKLRAEVDAGVTVGVVASLFTSCSALGMTLYR 401
           L W TV S  + F+ ++++AG   +TG  + R E+ AG++ G    LF + S L  +LYR
Sbjct: 118 LAWRTVLSVCIIFVGVVMLAG---NTGSIEWRPELLAGLSAG----LFNAGSQL--SLYR 168

Query: 400 HDQLSLPHLMAVWTTFIASCILNGMLLILVVGNSP*YPWAT 278
             Q  +  +     TF+ + +L   LL+LV       PW++
Sbjct: 169 AAQSDMRSIEIHGWTFLVAALLLSPLLLLV-------PWSS 202
>pir||T05197 hypothetical protein F4I10.100 - Arabidopsis thaliana
 emb|CAB36791.1| (AL035525) putative protein [Arabidopsis thaliana]
 emb|CAB80034.1| (AL161583) putative protein [Arabidopsis thaliana]
          Length = 990

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 19/71 (26%), Positives = 34/71 (47%)
 Frame = -1

Query: 541 FLLILVAGTLISTGRHKLRAEVDAGVTVGVVASLFTSCSALGMTLYRHDQLSLPHLMAVW 362
           FL  +  G  +     K   ++D  V+ G++  ++  C  +    +  D + +P  +A W
Sbjct: 529 FLFAINQGKQVHAYAIKSGYDLDLWVSSGIL-DMYVKCGDMSAAQFAFDSIPVPDDVA-W 586

Query: 361 TTFIASCILNG 329
           TT I+ CI NG
Sbjct: 587 TTMISGCIENG 597
Database: nr Posted date: Sep 29, 2000 9:53 PM Number of letters in database: 177,575,912 Number of sequences in database: 565,281 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 291675272 Number of Sequences: 565281 Number of extensions: 6224318 Number of successful extensions: 13954 Number of sequences better than 10.0: 6 Number of HSP's better than 10.0 without gapping: 1 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 13952 Number of HSP's gapped (non-prelim): 3 length of query: 292 length of database: 177,575,912 effective HSP length: 54 effective length of query: 237 effective length of database: 147,050,738 effective search space: 34851024906 effective search space used: 34851024906 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 71 (32.1 bits)