The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.1.1 [Aug-8-2000]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Contig629.seq Contig629
         (1437 letters)

Database: nr
           565,281 sequences; 177,575,912 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAA19972.1|  (AB003498) subtilisin-like protease [Pneumo...    37  0.46
dbj|BAA19973.1|  (AB003499) subtilisin-like protease 3 [Pneu...    37  0.46
dbj|BAA96299.1|  (AB035548) Orf5 [Streptomyces virginiae]          36  0.79
pir||A34615  profilaggrin - rat (fragment) >gi|204144|gb|AAA...    36  1.0
gb|AAF47658.1|  (AE003475) CG16973 gene product [Drosophila ...    36  1.0
pir||JC6316  probable protein kinase (EC 2.7.1.-) - fruit fl...    35  1.8
pir||S51306  G-box binding factor 1A - rape >gi|7488453|pir|...    34  5.2
sp|P50185|MTD5_DACSA  MODIFICATION METHYLASE DSAV (CYTOSINE-...    33  6.8
dbj|BAA85429.1|  (AP000616) hypothetical protein [Oryza sativa]    33  9.0
sp|O14640|DVL1_HUMAN  SEGMENT POLARITY PROTEIN DISHEVELLED H...    33  9.0
gb|AAB06574.1|  (L81125) monooxygenase subunit [Pseudomonas ...    33  9.0

>dbj|BAA19972.1| (AB003498) subtilisin-like protease [Pneumocystis carinii]
          Length = 258

 Score = 37.1 bits (84), Expect = 0.46
 Identities = 21/51 (41%), Positives = 28/51 (54%)
 Frame = +2

Query: 341 DNGLDYSNHARGGDHQAYGSHDVGSRDIQGPEKPPRPDGQSHGIAHANEEA 493
           DN LDY+N     ++ + GS+D  S+D     KP  PD +SHG   A E A
Sbjct: 160 DNALDYTNEDLAPNYNSEGSYDFDSKDFD--PKPDNPD-ESHGTKCAGEVA 207
>dbj|BAA19973.1| (AB003499) subtilisin-like protease 3 [Pneumocystis carinii]
          Length = 520

 Score = 37.1 bits (84), Expect = 0.46
 Identities = 21/51 (41%), Positives = 28/51 (54%)
 Frame = +2

Query: 341 DNGLDYSNHARGGDHQAYGSHDVGSRDIQGPEKPPRPDGQSHGIAHANEEA 493
           DN LDY+N     ++ + GS+D  S+D     KP  PD +SHG   A E A
Sbjct: 22  DNALDYTNEDLAPNYNSEGSYDFDSKDFD--PKPDNPD-ESHGTKCAGEVA 69
>dbj|BAA96299.1| (AB035548) Orf5 [Streptomyces virginiae]
          Length = 277

 Score = 36.4 bits (82), Expect = 0.79
 Identities = 41/114 (35%), Positives = 49/114 (42%), Gaps = 1/114 (0%)
 Frame = -2

Query: 674 PPLQARRLSRQTPQSQPSCVAYCRRCVGTTYRRALSSRRTAAGSVFVAEHQSRANPSPRR 495
           PP   RRL+RQ P++      + RR  GT  R A   RR          H+SR  PSP R
Sbjct: 151 PPGHLRRLARQPPRTG-RLRPHLRR-TGTPARPARGHRR----------HRSRWGPSPPR 198

Query: 494 LLLRSHAQFRGFDHRVVVAS-PALECRASRHRGTRMLDDRLHGHGYCNQGHCRPP 333
                 A  R  DHR +V   PAL              DRL G G+  +G  RPP
Sbjct: 199 RAPGPAAPPRHADHRPLVRQLPAL------------ARDRLPGRGHV-RGEPRPP 240
>pir||A34615 profilaggrin - rat (fragment)
 gb|AAA41161.1| (M21759) profilaggrin [Rattus norvegicus]
          Length = 625

 Score = 36.0 bits (81), Expect = 1.0
 Identities = 45/165 (27%), Positives = 62/165 (37%), Gaps = 9/165 (5%)
 Frame = +2

Query: 185 QRAA*MSPDHGSSRSRRGVWSHXXXXXXXXXXXXXXXXXXXX-------SQRKDTEEEDD 343
           QRAA    D  S+R R    +H                           + R  +E    
Sbjct: 370 QRAARHEQDSDSTRQRGSHQAHSSARTQEEIARGRSGATASEGPGPQREAARDSSEHAQS 429

Query: 344 NGLDYSNHARGGDHQAYGSHDVGSRDIQGPEKPPRPDGQSHGIA--HANEEATGGGWGSR 517
              + S+  R G H    +H+   R  Q  ++  R  G   G A  H+  EA+GG  G R
Sbjct: 430 RRTETSSRGRSG-HSTGRAHE--DRHQQATDRSAR-SGSRGGQAGSHSESEASGGQAGRR 485

Query: 518 VTGALRRTPSPQQFFDSTAPADTSSQHIGGSRRRKTVAIVVSAETGAVPEEEDP 679
            T A R T  P+Q  D+     T S     S +R   +   S  TG+    E P
Sbjct: 486 GTAATRHTSRPEQSPDTA--GRTGSSRGQQSAQRHGDSTPGSTRTGSRGRGESP 537
>gb|AAF47658.1| (AE003475) CG16973 gene product [Drosophila melanogaster]
          Length = 979

 Score = 36.0 bits (81), Expect = 1.0
 Identities = 20/49 (40%), Positives = 27/49 (54%)
 Frame = +2

Query: 410 GSRDIQGPEKPPRPDGQSHGIAHANEEATGGGWGSRVTGALRRTPSPQQ 556
           GS+    PE PPR + QS G++ +   A+GGG  S+   AL     PQQ
Sbjct: 493 GSQQQAQPEAPPRNNRQSSGLSSSGGSASGGGGSSKPAAAL-----PQQ 536
>pir||JC6316 probable protein kinase (EC 2.7.1.-) - fruit fly (Drosophila
           melanogaster)
          Length = 1102

 Score = 35.2 bits (79), Expect = 1.8
 Identities = 21/49 (42%), Positives = 28/49 (56%)
 Frame = +2

Query: 410 GSRDIQGPEKPPRPDGQSHGIAHANEEATGGGWGSRVTGALRRTPSPQQ 556
           GS+  Q PE PPR + QS G++ +   A+GGG  S+   AL     PQQ
Sbjct: 542 GSQQAQ-PEAPPRNNRQSSGLSSSGGSASGGGGSSKPAAAL-----PQQ 584
>pir||S51306 G-box binding factor 1A - rape
 pir||S66312 G-box binding factor 1A - rape
 emb|CAA58774.1| (X83922) G-box binding factor 1A [Brassica napus]
          Length = 313

 Score = 33.6 bits (75), Expect = 5.2
 Identities = 24/82 (29%), Positives = 42/82 (50%), Gaps = 8/82 (9%)
 Frame = +2

Query: 386 QAYGSHDVGSRDIQGPEKPPRPDGQSH-GIAHANEEATGG-------GWGSRVTGALRRT 541
           QA G    GS   +G      P G  + G++H++E  TGG           +  G++R+ 
Sbjct: 106 QAPGKKSKGSLKSKGEGGEKAPSGSGNDGVSHSDESVTGGSSDENDENANHQEHGSVRKP 165

Query: 542 PSPQQFFDSTAPADTSSQHIGGSRRRKTVA 631
              Q   D+++ ++T+ + I GS   K +A
Sbjct: 166 SFGQMLADASSQSNTTGEMIQGSVPMKPLA 195
>sp|P50185|MTD5_DACSA MODIFICATION METHYLASE DSAV (CYTOSINE-SPECIFIC METHYLTRANSFERASE
           DSAV) (M.DSAV)
 pir||S50098 site-specific DNA-methyltransferase (cytosine-specific) (EC
           2.1.1.73) DsaV - Dactylococcopsis salina sp. nov. (DSM
           4880)
 gb|AAA86046.1| (U10528) DsaV methyltransferase [Dactylococcopsis salina]
          Length = 351

 Score = 33.2 bits (74), Expect = 6.8
 Identities = 22/64 (34%), Positives = 34/64 (52%)
 Frame = -2

Query: 590 TTYRRALSSRRTAAGSVFVAEHQSRANPSPRRLLLRSHAQFRGFDHRVVVASPALECRAS 411
           ++Y R +S+R    GS  + E   +AN +PR L  R  A+ +GF    V+  P  +C+A 
Sbjct: 232 SSYTRTISARYYKDGSEVLVE---QANKNPRVLTPRECARLQGFPESFVI--PVSDCQAW 286

Query: 410 RHRG 399
           R  G
Sbjct: 287 RQFG 290
>dbj|BAA85429.1| (AP000616) hypothetical protein [Oryza sativa]
          Length = 720

 Score = 32.8 bits (73), Expect = 9.0
 Identities = 31/94 (32%), Positives = 43/94 (44%), Gaps = 4/94 (4%)
 Frame = +1

Query: 238 SLESLGAPRSYTYSRDCRCGCLGMEPEKGH--GRGGRQWP*LQ*PCPWRRSSSIRVPRCR 411
           S+   GA R+ T +R+      G + ++G   GRGGR+ P +    P   SS   V RC 
Sbjct: 364 SMRVRGASRATTRARE------GEDEQEGRTEGRGGRRVPGV----PRHASSHAPVSRCS 413

Query: 412 LARHS--RAGEATTTRWSKPRNCACERRSNRRGLGFAR 519
            A       G AT  +WS+       R +   G GF R
Sbjct: 414 AADGDGESRGRATMAQWSRSVKWEGLRWNGELGFGFYR 451
>sp|O14640|DVL1_HUMAN SEGMENT POLARITY PROTEIN DISHEVELLED HOMOLOG DVL-1 (DISHEVELLED-1)
           (DSH HOMOLOG 1)
 gb|AAB65242.1| (AF006011) dishevelled 1 [Homo sapiens]
          Length = 670

 Score = 32.8 bits (73), Expect = 9.0
 Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 7/79 (8%)
 Frame = +2

Query: 341 DNGLDYSNHARGGDHQAYGSHDVGS----RDIQGPEKPPRPDGQSHGIAHANEEA---TG 499
           D G  Y + + G   Q+ GS   GS    R   G EK  R  G     + ++  A    G
Sbjct: 530 DPGFSYGSGSTGSQ-QSEGSKSSGSTRSSRRAPGREKERRAAGAGGSGSESDHTAPSGVG 588

Query: 500 GGWGSRVTGALRRTPSPQQFFDSTAP 577
             W  R  G L R  SP+    +TAP
Sbjct: 589 SSWRERPAGQLSRGSSPRSQASATAP 614
>gb|AAB06574.1| (L81125) monooxygenase subunit [Pseudomonas sp.]
          Length = 504

 Score = 32.8 bits (73), Expect = 9.0
 Identities = 29/113 (25%), Positives = 40/113 (34%), Gaps = 1/113 (0%)
 Frame = +1

Query: 262 RSYTYSRDCRCGC-LGMEPEKGHGRGGRQWP*LQ*PCPWRRSSSIRVPRCRLARHSRAGE 438
           RS+     CR  C +G      HG     WP      PWR +S      C    H     
Sbjct: 381 RSFRLLSSCRPTCSIGWPATAMHGCA--PWP----ATPWRSTSICGA--CAPGCHGSPAF 432

Query: 439 ATTTRWSKPRNCACERRSNRRGLGFARDWCSATNTEPAAVLRLDSARRYVVPTH 600
                  +P  C+       R   +   WC+A  + PAA+      RR+  PT+
Sbjct: 433 RHRLAPRRPVRCSARYTVPSRATSYPAPWCAAKASRPAAI------RRWTKPTN 480
Database: nr Posted date: Sep 29, 2000 9:53 PM Number of letters in database: 177,575,912 Number of sequences in database: 565,281 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 475581302 Number of Sequences: 565281 Number of extensions: 10393569 Number of successful extensions: 38844 Number of sequences better than 10.0: 22 Number of HSP's better than 10.0 without gapping: 2 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 38836 Number of HSP's gapped (non-prelim): 16 length of query: 479 length of database: 177,575,912 effective HSP length: 54 effective length of query: 424 effective length of database: 147,050,738 effective search space: 62349512912 effective search space used: 62349512912 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 73 (32.8 bits)