The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.1.1 [Aug-8-2000]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Contig17.seq Contig17
         (876 letters)

Database: nr
           565,281 sequences; 177,575,912 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAA18616.1|  (U09738) circumsporozoite protein [Plasmodiu...    38  0.11
gb|AAA18617.1|  (U09765) circumsporozoite protein [Plasmodiu...    38  0.15
pir||T41653  probable transcription or splicing factor - fis...    38  0.20
pir||S23809  collagen alpha 2(I) chain homolog - sea urchin ...    38  0.20
pir||T20450  hypothetical protein E04D5.1 - Caenorhabditis e...    37  0.34
pir||B42856  ubiquitin carrier protein E2 - human                  36  0.44
ref|NP_055316.1|  ubiquitin carrier protein >gi|2501433|sp|Q...    36  0.44
pir||S42731  collagen alpha 1 chain - sea urchin (Hemicentro...    36  0.58
sp|P17656|CC02_CAEEL  CUTICLE COLLAGEN 2 >gi|84426|pir||B312...    36  0.76
pir||T26004  hypothetical protein ZK863.2 - Caenorhabditis e...    36  0.76
pir||A36226  collagen alpha 1 chain - sea urchin (Paracentro...    35  1.3
pir||T20497  hypothetical protein F02D10.1 - Caenorhabditis ...    35  1.3
pir||T35861  probable large secreted protein - Streptomyces ...    34  1.7
pir||T18637  hypothetical protein B0024.1 - Caenorhabditis e...    34  1.7
gb|AAF36091.1|  (AF218624) flagelliform silk protein [Nephil...    34  1.7
sp|P46804|SPD2_NEPCL  SPIDROIN 2 (DRAGLINE SILK FIBROIN 2) >...    34  2.2
pir||T32734  myosin-IA - Acanthamoeba castellanii >gi|359947...    34  2.2
pir||I40333  tracheal colonization factor A precursor - Bord...    34  2.2
emb|CAA08832.1|  (AJ009785) tracheal colonization factor [Bo...    34  2.2
pir||T29731  hypothetical protein F41F3.4 - Caenorhabditis e...    34  2.2
pir||T15143  hypothetical protein T28F2.8 - Caenorhabditis e...    34  2.9
gb|AAF84591.1|AE004000_8  (AE004000) hypothetical protein [X...    34  2.9
gb|AAC67560.1|  (AF095461) fibrinogen A-alpha chain [Felis c...    34  2.9
gb|AAC33847.1|  (AF043944) nongradient byssal precursor [Myt...    33  3.8
dbj|BAA07815.1|  (D43758) fibrinogen A-alpha-chain [Macaca m...    33  3.8
pir||I50694  collagen alpha 1(III) chain - chicken (fragment...    33  3.8
gb|AAF82207.1|AC067971_15  (AC067971) EST gb|AI998275 comes ...    33  3.8
pir||T42310  hypothetical protein - phage SPP1 >gi|2764887|e...    33  3.8
gb|AAF36092.1|  (AF218624) flagelliform silk protein [Nephil...    33  3.8
gb|AAF52203.1|  (AE003608) vkg gene product [Drosophila mela...    33  5.0
pir||S74598  hypothetical protein sll1040 - Synechocystis sp...    33  5.0
gb|AAF47772.1|  (AE003478) sty gene product [Drosophila mela...    33  5.0
sp|P04258|CA13_BOVIN  COLLAGEN ALPHA 1(III) CHAIN >gi|71410|...    33  5.0
pir||T27525  hypothetical protein ZC373.7 - Caenorhabditis e...    33  5.0
dbj|BAA90381.1|  (AP001081) hypothetical protein [Oryza sativa]    33  5.0
pir||A41726  homeotic protein BarH2 - fruit fly (Drosophila ...    33  5.0
gb|AAC67561.1|  (AF095462) fibrinogen A-alpha chain [Equus c...    32  6.6
sp|P20631|CC13_CAEEL  CUTICLE COLLAGEN 13 PRECURSOR >gi|8443...    32  6.6
gb|AAF60775.1|  (AC024811) contains similarity to Pfam famil...    32  6.6
sp|P20630|CC12_CAEEL  CUTICLE COLLAGEN 12 PRECURSOR >gi|8442...    32  6.6
gb|AAF55419.1|  (AE003717) CG5866 gene product [Drosophila m...    32  6.6
pir||A27353  collagen alpha 1(III) chain precursor - mouse (...    32  8.6
gb|AAF76432.1|AF272661_1  (AF272661) alpha 4 type V collagen...    32  8.6
pir||T20720  hypothetical protein F10F2.9 - Caenorhabditis e...    32  8.6
pir||T00041  BH-protocadherin PCDH7 (clone BH-Pcdh-b) - huma...    32  8.6
pir||G02127  fus-like protein - human (fragment) >gi|1040970...    32  8.6
sp|P10569|MYSC_ACACA  MYOSIN IC HEAVY CHAIN >gi|71609|pir||M...    32  8.6
gb|AAC58530.1|  (U85043) gag polyprotein [feline syncytial v...    32  8.6
dbj|BAA33381.1|  (AB008374) alpha 3 type I collagen [Oncorhy...    32  8.6
pir||T35089  probable integral membrane transport protein - ...    32  8.6
pir||T08435  la costa protein - fruit fly (Drosophila melano...    32  8.6
pir||T29982  hypothetical protein F11G11.12 - Caenorhabditis...    32  8.6
ref|NP_037465.1|  EH domain-binding mitotic phosphoprotein >...    32  8.6
pir||T00042  BH-protocadherin PCDH7 (clone BH-Pcdh-c) - huma...    32  8.6
ref|NP_002580.1|  BH-protocadherin (brain-heart) >gi|7512301...    32  8.6
dbj|BAB01598.1|  (AB046016) unnamed protein product [Macaca ...    32  8.6
pir||T21070  hypothetical protein F17C8.2 - Caenorhabditis e...    32  8.6
gb|AAC98089.1|  (AF051353) myosin IC heavy chain [Acanthamoe...    32  8.6
dbj|BAA07813.1|  (D43756) fibrinogen A-alpha-chain [Canis fa...    32  8.6

>gb|AAA18616.1| (U09738) circumsporozoite protein [Plasmodium vivax-like sp.]
          Length = 393

 Score = 38.3 bits (87), Expect = 0.11
 Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q++G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 104 GGAAAPGANQEGGAAAPGANQEDGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 158

Query: 673 GG 668
           GG
Sbjct: 159 GG 160
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 181 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 235

Query: 673 GG 668
           GG
Sbjct: 236 GG 237
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 159 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 213

Query: 673 GG 668
           GG
Sbjct: 214 GG 215
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 170 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 224

Query: 673 GG 668
           GG
Sbjct: 225 GG 226
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 137 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 191

Query: 673 GG 668
           GG
Sbjct: 192 GG 193
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 148 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 202

Query: 673 GG 668
           GG
Sbjct: 203 GG 204
 Score = 37.1 bits (84), Expect = 0.26
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 192 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQG 246

Query: 673 GG 668
           GG
Sbjct: 247 GG 248
 Score = 35.6 bits (80), Expect = 0.76
 Identities = 28/67 (41%), Positives = 38/67 (55%), Gaps = 15/67 (22%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDS------NEGGSAA 692
           GG++ P A  E G   PGA Q  G+A P      G  AP  ++   +       EGG+AA
Sbjct: 225 GGAAAPGANQEGGAAAPGANQGGGAAAPGANQGGGAAAPGANQGGGAAAPGANQEGGAAA 284

Query: 691 PWARE-----RGGRNQDNAAGN 641
           P A +      GG+ Q+N   N
Sbjct: 285 PGANQGGAKPAGGQGQNNEGAN 306
 Score = 34.8 bits (78), Expect = 1.3
 Identities = 24/58 (41%), Positives = 32/58 (54%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++      GG+AAP A + 
Sbjct: 214 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQGGGAAAPGANQG-----GGAAAPGANQG 268

Query: 673 GG 668
           GG
Sbjct: 269 GG 270
>gb|AAA18617.1| (U09765) circumsporozoite protein [Plasmodium simiovale]
          Length = 393

 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 148 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 202

Query: 673 GG 668
           GG
Sbjct: 203 GG 204
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 126 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 180

Query: 673 GG 668
           GG
Sbjct: 181 GG 182
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 115 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 169

Query: 673 GG 668
           GG
Sbjct: 170 GG 171
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 104 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 158

Query: 673 GG 668
           GG
Sbjct: 159 GG 160
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 159 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 213

Query: 673 GG 668
           GG
Sbjct: 214 GG 215
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 170 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 224

Query: 673 GG 668
           GG
Sbjct: 225 GG 226
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 181 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 235

Query: 673 GG 668
           GG
Sbjct: 236 GG 237
 Score = 37.9 bits (86), Expect = 0.15
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 137 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 191

Query: 673 GG 668
           GG
Sbjct: 192 GG 193
 Score = 37.1 bits (84), Expect = 0.26
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++     EGG+AAP A + 
Sbjct: 192 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQG 246

Query: 673 GG 668
           GG
Sbjct: 247 GG 248
 Score = 35.6 bits (80), Expect = 0.76
 Identities = 28/67 (41%), Positives = 38/67 (55%), Gaps = 15/67 (22%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDS------NEGGSAA 692
           GG++ P A  E G   PGA Q  G+A P      G  AP  ++   +       EGG+AA
Sbjct: 225 GGAAAPGANQEGGAAAPGANQGGGAAAPGANQGGGAAAPGANQGGGAAAPGANQEGGAAA 284

Query: 691 PWARE-----RGGRNQDNAAGN 641
           P A +      GG+ Q+N   N
Sbjct: 285 PGANQGGAKSAGGQGQNNEGAN 306
 Score = 34.8 bits (78), Expect = 1.3
 Identities = 24/58 (41%), Positives = 32/58 (54%), Gaps = 4/58 (6%)
 Frame = -3

Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GG++ P A  E G   PGA Q+ G+A P      G  AP  ++      GG+AAP A + 
Sbjct: 214 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQGGGAAAPGANQG-----GGAAAPGANQG 268

Query: 673 GG 668
           GG
Sbjct: 269 GG 270
>pir||T41653 probable transcription or splicing factor - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAA20438.1| (AL031323) putative transcription or splicing factor
           [Schizosaccharomyces pombe]
 gb|AAF02214.1|AF073779_1 (AF073779) putative splicing factor BBP/SF1 [Schizosaccharomyces
           pombe]
          Length = 587

 Score = 37.5 bits (85), Expect = 0.20
 Identities = 25/74 (33%), Positives = 36/74 (47%), Gaps = 3/74 (4%)
 Frame = -3

Query: 868 HEELMQELGGGSSGPPARIEAGPGAQ--DNGSAKPWERGPTGGPAPWRSRNQD-SNEGGS 698
           ++ LMQELGGGS+      E     +  ++G+A P      G P PW + +   S+   S
Sbjct: 367 YQSLMQELGGGSAISNGNGEPQKSIEFSESGAASP----QAGHPPPWAAASTSVSSSTSS 422

Query: 697 AAPWARERGGRNQDNAA 647
            APWA+        N A
Sbjct: 423 PAPWAKPASSAAPSNPA 439
>pir||S23809 collagen alpha 2(I) chain homolog - sea urchin (Strongylocentrotus
            purpuratus)
 gb|AAA30035.1| (M92040) alpha-1 collagen [Strongylocentrotus purpuratus]
          Length = 1414

 Score = 37.5 bits (85), Expect = 0.20
 Identities = 20/65 (30%), Positives = 25/65 (37%), Gaps = 1/65 (1%)
 Frame = -3

Query: 841  GGSSGPP-ARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
            GG +GPP A    GP  +      P   GP G P P        ++G   AP  +   G 
Sbjct: 938  GGPAGPPGAAGSRGPAGKSGDRGSPGAVGPAGNPGPAGENGMPGSDGNDGAPGPQGSRGE 997

Query: 664  NQDNAA 647
              D  A
Sbjct: 998  KGDTGA 1003
>pir||T20450 hypothetical protein E04D5.1 - Caenorhabditis elegans
 emb|CAA91279.1| (Z66496) cDNA EST yk84b6.3 comes from this gene; cDNA EST yk84b6.5
           comes from this gene~cDNA EST yk117a5.5 comes from this
           gene; cDNA EST yk132f3.3 comes from this gene~cDNA EST
           yk132f3.5 comes from this gene; cDNA EST yk165g1.3 comes
           from this gene~cD>
          Length = 618

 Score = 36.7 bits (83), Expect = 0.34
 Identities = 16/43 (37%), Positives = 23/43 (53%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
           GGGS+GPP+     PG Q+   A+P   G    P P+R +  +
Sbjct: 525 GGGSAGPPSAAAPTPGNQNQRPAQPRANGNGNAPQPFRPQQSE 567
>pir||B42856 ubiquitin carrier protein E2 - human
          Length = 247

 Score = 36.4 bits (82), Expect = 0.44
 Identities = 21/71 (29%), Positives = 28/71 (38%)
 Frame = -3

Query: 859 LMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR 680
           L+ E+ GG+ GP  R EAG        A   + G  GGP            GG+  P A+
Sbjct: 172 LLTEIHGGAGGPSGRAEAGRALASGTEASSTDPGAPGGP------------GGAEGPMAK 219

Query: 679 ERGGRNQDNAA 647
           +  G      A
Sbjct: 220 KHAGERDKKLA 230
>ref|NP_055316.1| ubiquitin carrier protein
 sp|Q16763|UBCE_HUMAN UBIQUITIN-CONJUGATING ENZYME E2-24 KD (UBIQUITIN-PROTEIN LIGASE)
           (UBIQUITIN CARRIER PROTEIN) (E2-EPF5)
 gb|AAA58446.1| (M91670) ubiquitin carrier protein [Homo sapiens]
          Length = 225

 Score = 36.4 bits (82), Expect = 0.44
 Identities = 21/71 (29%), Positives = 28/71 (38%)
 Frame = -3

Query: 859 LMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR 680
           L+ E+ GG+ GP  R EAG        A   + G  GGP            GG+  P A+
Sbjct: 150 LLTEIHGGAGGPSGRAEAGRALASGTEASSTDPGAPGGP------------GGAEGPMAK 197

Query: 679 ERGGRNQDNAA 647
           +  G      A
Sbjct: 198 KHAGERDKKLA 208
>pir||S42731 collagen alpha 1 chain - sea urchin (Hemicentrotus pulcherrimus)
           (fragment)
 gb|AAB30065.1| (S70718) fibrillar collagen alpha 120 and 140 chains [Hemicentrotus
           pulcherrimus=sea urchins, tests, Peptide, 632 aa]
          Length = 632

 Score = 36.0 bits (81), Expect = 0.58
 Identities = 19/65 (29%), Positives = 25/65 (38%), Gaps = 1/65 (1%)
 Frame = -3

Query: 841 GGSSGPPARIEA-GPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
           GG +GPP    + GP  +      P   GP G P P        ++G   AP  +   G 
Sbjct: 156 GGPAGPPGEAGSRGPPGKSGERGSPGAVGPPGNPGPAGENGMPGSDGNDGAPGPQGARGE 215

Query: 664 NQDNAA 647
             D  A
Sbjct: 216 KGDTGA 221
>sp|P17656|CC02_CAEEL CUTICLE COLLAGEN 2
 pir||B31219 collagen 2 - Caenorhabditis elegans
 emb|CAA23464.1| (V00148) unnamed protein product [Caenorhabditis elegans]
 gb|AAA27990.1| (J01048) collagen [Caenorhabditis elegans]
 emb|CAA92620.1| (Z68301) predicted using Genefinder~similar to cuticle collagen
           (CC02)~cDNA EST yk94a4.5 comes from this gene~cDNA EST
           yk94a4.3 comes from this gene~cDNA EST yk68d1.5 comes
           from this gene~cDNA EST yk68d1.3 comes from this gene
           [Caenorhabditis elegans]
          Length = 301

 Score = 35.6 bits (80), Expect = 0.76
 Identities = 23/65 (35%), Positives = 26/65 (39%), Gaps = 3/65 (4%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEG---GSAAPWARERGG 668
           G +GPP     GP       A P   GP G P P      D   G   G   P A E+GG
Sbjct: 159 GPAGPPG--PPGPDGNPGSPAGPSGPGPAGPPGPAGPAGNDGAPGAPGGPGEPGASEQGG 216

Query: 667 RNQDNAAG 644
             +   AG
Sbjct: 217 PGEPGPAG 224
>pir||T26004 hypothetical protein ZK863.2 - Caenorhabditis elegans
 emb|CAB09131.1| (Z95621) similar to collagen~cDNA EST yk93b5.5 comes from this
           gene~cDNA EST yk120a5.5 comes from this gene
           [Caenorhabditis elegans]
 emb|CAB01457.1| (Z78019) similar to collagen~cDNA EST yk93b5.5 comes from this
           gene~cDNA EST yk120a5.5 comes from this gene
           [Caenorhabditis elegans]
          Length = 330

 Score = 35.6 bits (80), Expect = 0.76
 Identities = 23/66 (34%), Positives = 28/66 (41%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRNQ 659
           G +GPP     GP     G + P   GP G P P      D N G + A    + GG  Q
Sbjct: 175 GPAGPPG--PPGPDGAPGGGSGPGPAGPAGPPGP------DGNPGSAGAD--GQPGGPGQ 224

Query: 658 DNAAGN 641
           D A G+
Sbjct: 225 DGAPGS 230
>pir||A36226 collagen alpha 1 chain - sea urchin (Paracentrotus lividus)
 gb|AAA29438.1| (M25282) alpha collagen type 1 precursor [Paracentrotus lividus]
          Length = 730

 Score = 34.8 bits (78), Expect = 1.3
 Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQ-------DNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR 680
           G+SGPP      PG++       D GS  P   GP G P P     Q  ++G   AP A+
Sbjct: 254 GASGPPGA-PGEPGSRGSHGKSGDRGS--PGAVGPPGNPGPAGENGQPGSDGNDGAPGAQ 310

Query: 679 ERGGRNQDNAA 647
              G   D  A
Sbjct: 311 GPRGEKGDTGA 321
>pir||T20497 hypothetical protein F02D10.1 - Caenorhabditis elegans
 emb|CAA91932.1| (Z67990) similar to cuticle collagen [Caenorhabditis elegans]
          Length = 316

 Score = 34.8 bits (78), Expect = 1.3
 Identities = 21/54 (38%), Positives = 24/54 (43%), Gaps = 2/54 (3%)
 Frame = -3

Query: 850 ELGGGSSGPPARIEAGPGAQD--NGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
           E G G +GPP    AGP   D  +GS      GP G P P      D N G +  P
Sbjct: 232 EAGPGPAGPPG--PAGPAGPDGQSGSGSAGGPGPKGPPGPAGQPGSDGNPGTAGPP 285
>pir||T35861 probable large secreted protein - Streptomyces coelicolor
 emb|CAB41562.1| (AL049727) putative large secreted protein [Streptomyces coelicolor
           A3(2)]
          Length = 877

 Score = 34.4 bits (77), Expect = 1.7
 Identities = 28/78 (35%), Positives = 35/78 (43%), Gaps = 8/78 (10%)
 Frame = -3

Query: 874 IRHEELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPT-------GGPA-PWRSRNQ 719
           I H+ +++E  GG+ GP        GA  NG+  PW   PT        GPA    S   
Sbjct: 637 IPHDIVVREPAGGADGP--------GAAANGARAPWVAAPTARDSADPAGPAIDTGSVTG 688

Query: 718 DSNEGGSAAPWARERGGRNQDNAAGN 641
              E G AAP A          AAG+
Sbjct: 689 TGTETGQAAPGAAAGRSATASAAAGD 714
>pir||T18637 hypothetical protein B0024.1 - Caenorhabditis elegans
 emb|CAA94874.1| (Z71178) similar to collagen [Caenorhabditis elegans]
          Length = 297

 Score = 34.4 bits (77), Expect = 1.7
 Identities = 24/66 (36%), Positives = 27/66 (40%), Gaps = 2/66 (3%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQD--NGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERG 671
           G GS G P R    PG Q       +P ERGP+G   P     Q  N G    P A    
Sbjct: 216 GHGSPGAPGRA-GQPGRQGAPGNPGRPGERGPSGPCGPAGRSGQPGNRGSDGHPGAPGNP 274

Query: 670 GRNQDNAA 647
           G    +AA
Sbjct: 275 GLQGSDAA 282
>gb|AAF36091.1| (AF218624) flagelliform silk protein [Nephila madagascariensis]
          Length = 1884

 Score = 34.4 bits (77), Expect = 1.7
 Identities = 26/65 (40%), Positives = 28/65 (43%)
 Frame = -3

Query: 844  GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
            G G SGP     AGPG    G A P   GP GG  P  +    S  GG A P    RGG 
Sbjct: 1591 GPGGSGPGG---AGPGGAGPGGAGPGGAGP-GGVGPGGAGPGGSGPGG-AGPGGAGRGGA 1645

Query: 664  NQDNA 650
             +  A
Sbjct: 1646 GRGGA 1650
>sp|P46804|SPD2_NEPCL SPIDROIN 2 (DRAGLINE SILK FIBROIN 2)
 pir||A44112 spidroin 2, dragline silk fibroin - orb spider (Nephila clavipes)
           (fragment)
 gb|AAA29381.1| (M92913) dragline silk fibroin [Nephila clavipes]
          Length = 627

 Score = 34.0 bits (76), Expect = 2.2
 Identities = 24/67 (35%), Positives = 29/67 (42%), Gaps = 1/67 (1%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTG-GPAPWRSRNQDSNEGGSAAPWARERGG 668
           G GS+   A   +GPG Q  G   P ++GP G GP       Q  +  GSAA  A    G
Sbjct: 160 GPGSAAAAAAAASGPGQQGPGGYGPGQQGPGGYGPG-----QQGPSGPGSAAAAAAAASG 214

Query: 667 RNQDNAAG 644
             Q    G
Sbjct: 215 PGQQGPGG 222
>pir||T32734 myosin-IA - Acanthamoeba castellanii
 gb|AAC35357.1| (AF085185) Myosin-IA [Acanthamoeba castellanii]
          Length = 1215

 Score = 34.0 bits (76), Expect = 2.2
 Identities = 22/66 (33%), Positives = 23/66 (34%), Gaps = 2/66 (3%)
 Frame = -3

Query: 841  GGSSGPPARIEAGPGAQD--NGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGG 668
            GG  GPP     G GA     G+  P   GP G P   R        GG   P     GG
Sbjct: 1046 GGPGGPPGPAGPGRGAPGPGRGAPGPSRGGPGGPPPGGRGMPPPGGRGGPGGPGPAGPGG 1105

Query: 667  RNQDNAAG 644
            R      G
Sbjct: 1106 RGMPAPGG 1113
 Score = 32.1 bits (71), Expect = 8.6
 Identities = 22/67 (32%), Positives = 24/67 (34%), Gaps = 2/67 (2%)
 Frame = -3

Query: 844  GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPW--RSRNQDSNEGGSAAPWARERG 671
            G G+ GP      GP     G   P  RG  GGP P     R   +  GG   P      
Sbjct: 1065 GRGAPGPSRGGPGGPPPGGRGMPPPGGRGGPGGPGPAGPGGRGMPAPGGGRGGPGGPGPA 1124

Query: 670  GRNQDNAAG 644
            GR     AG
Sbjct: 1125 GRGGPGPAG 1133
>pir||I40333 tracheal colonization factor A precursor - Bordetella pertussis
 gb|AAC43453.1| (U16754) tracheal colonization factor [Bordetella pertussis]
          Length = 672

 Score = 34.0 bits (76), Expect = 2.2
 Identities = 24/59 (40%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
 Frame = -3

Query: 844 GGGSSG---PPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GGG  G   PPA    G G   NG+A+  ERG   GP P         EGG   P   + 
Sbjct: 289 GGGDEGQRPPPAAGNGGNGG--NGNAQLPERGDDAGPKP------PEGEGGDEGPQPPQG 340

Query: 673 GG 668
           GG
Sbjct: 341 GG 342
>emb|CAA08832.1| (AJ009785) tracheal colonization factor [Bordetella pertussis]
          Length = 647

 Score = 34.0 bits (76), Expect = 2.2
 Identities = 24/59 (40%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
 Frame = -3

Query: 844 GGGSSG---PPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
           GGG  G   PPA    G G   NG+A+  ERG   GP P         EGG   P   + 
Sbjct: 264 GGGDEGQRPPPAAGNGGNGG--NGNAQLPERGDDAGPKP------PEGEGGDEGPQPPQG 315

Query: 673 GG 668
           GG
Sbjct: 316 GG 317
>pir||T29731 hypothetical protein F41F3.4 - Caenorhabditis elegans
 gb|AAA97982.1| (U55366) Similar to cuticle collagen [Caenorhabditis elegans]
          Length = 310

 Score = 34.0 bits (76), Expect = 2.2
 Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 17/69 (24%)
 Frame = -3

Query: 847 LGGGSSGP----PARIEAGPGAQD------------NGSAKPWERGPTGGPAPWRSRNQD 716
           +GGGS  P    P     GPG               + S++P + GP G P P     Q 
Sbjct: 183 VGGGSGAPGAPGPKGAPGGPGQPGRDGQPGQAGQPGSSSSEPGQPGPNGQPGPRGPPGQA 242

Query: 715 SNEGGSAAPWA-RERGGRNQDNAAGN 641
            + GG+  P    + G R  D   GN
Sbjct: 243 GSPGGNGQPGGPGQPGQRGSDGQPGN 268
>pir||T15143 hypothetical protein T28F2.8 - Caenorhabditis elegans
 gb|AAB53059.1| (AF000198) Similar to cuticular collagen [Caenorhabditis elegans]
          Length = 435

 Score = 33.6 bits (75), Expect = 2.9
 Identities = 26/67 (38%), Positives = 29/67 (42%), Gaps = 2/67 (2%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDN--GSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERG 671
           G G +GPP     GP  QD   G+A+P   GP G P P      D   GG   P     G
Sbjct: 250 GPGPAGPPG--PPGPPGQDGSGGAAQP---GPPGPPGP---PGNDGQPGGPGQP-----G 296

Query: 670 GRNQDNAAG 644
           G  QD   G
Sbjct: 297 GPGQDGGPG 305
>gb|AAF84591.1|AE004000_8 (AE004000) hypothetical protein [Xylella fastidiosa]
          Length = 302

 Score = 33.6 bits (75), Expect = 2.9
 Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 8/53 (15%)
 Frame = -3

Query: 835 SSGPPARIEAGPGAQDNGSAKPWERGPT--------GGPAPWRSRNQDSNEGGSAAPWAR 680
           SS PPA  E G   +   +A P   GPT          P   R  + D+NE G+A+P A 
Sbjct: 56  SSTPPALPEPGVIVRPPANASPPTTGPTPTAQPASPASPTSTRESDADANEAGTASPTAN 115

Query: 679 E 677
           +
Sbjct: 116 D 116
>gb|AAC67560.1| (AF095461) fibrinogen A-alpha chain [Felis catus]
          Length = 463

 Score = 33.6 bits (75), Expect = 2.9
 Identities = 20/56 (35%), Positives = 26/56 (45%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARE 677
           G GSSGP +     PG+   GS+  W  G + GP    + N  S+   SA  W  E
Sbjct: 188 GPGSSGPGSSGAWNPGSSGPGSSGAWNPG-SSGPGSGSTWNAGSSGVSSAGTWDTE 242
 Score = 32.8 bits (73), Expect = 5.0
 Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 4/51 (7%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTG----GPAPWRSRNQDSNEGGSAAPW 686
           GSSGP +     PG+   GS+  W  G +G    GP    + N  S+  GS+  W
Sbjct: 159 GSSGPGSSGAWNPGSSTPGSSGAWNPGSSGPGSSGPGSSGAWNPGSSGPGSSGAW 213
 Score = 32.1 bits (71), Expect = 8.6
 Identities = 18/51 (35%), Positives = 25/51 (48%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
           GSSGP +     PG+   GS+  W  G + GP    + N  S+  GS+  W
Sbjct: 133 GSSGPGSSGAWNPGSTGPGSSGAWNPG-SSGPGSSGAWNPGSSTPGSSGAW 182
>gb|AAC33847.1| (AF043944) nongradient byssal precursor [Mytilus edulis]
          Length = 904

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 22/67 (32%), Positives = 25/67 (36%), Gaps = 4/67 (5%)
 Frame = -3

Query: 841 GGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER---- 674
           GG  GPP     GP         P E+G  G P       Q  N G    P A  +    
Sbjct: 241 GGPPGPPGHSPQGPQGSRGAPGAPGEQGANGSP------GQPGNAGAPGQPGAPGQAGAP 294

Query: 673 GGRNQDNAAGN 641
           G R    AAG+
Sbjct: 295 GARGPSGAAGH 305
 Score = 32.1 bits (71), Expect = 8.6
 Identities = 20/65 (30%), Positives = 26/65 (39%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRNQ 659
           G  G P   + GP   +  +  P  +GPTG      S  +D   GG   P A  +GG   
Sbjct: 415 GDKGAPG--DVGPEGPEGPAGGPGPKGPTGPQGAKGSPGEDGEPGGEGEPGA--KGGDGL 470

Query: 658 DNAAG 644
              AG
Sbjct: 471 PGQAG 475
>dbj|BAA07815.1| (D43758) fibrinogen A-alpha-chain [Macaca mulatta]
          Length = 455

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 18/51 (35%), Positives = 24/51 (46%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
           GSSGP +     PG+   GS   W+ G + GP    + N  S+  GS   W
Sbjct: 120 GSSGPGSTGHQSPGSSGPGSTGTWKPG-SSGPGSTGTWNPGSSGTGSTGTW 169
 Score = 32.1 bits (71), Expect = 8.6
 Identities = 19/51 (37%), Positives = 22/51 (42%), Gaps = 2/51 (3%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGP--TGGPAPWRSRNQDSNEGGSAAPW 686
           GSSGP +     PG+   GS   W  G   TG    W   N  S+  GS   W
Sbjct: 146 GSSGPGSTGTWNPGSSGTGSTGTWNPGSSGTGSTGTW---NPGSSGSGSTGTW 195
>pir||I50694 collagen alpha 1(III) chain - chicken (fragment)
 gb|AAA83407.1| (U07973) alpha-1 collagen type III [Gallus gallus]
          Length = 886

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 21/56 (37%), Positives = 26/56 (45%), Gaps = 2/56 (3%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNG-SAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR-ERG 671
           G+ GPP    A  G    G    P ERG +G P P   + +   +G    P AR ERG
Sbjct: 704 GAKGPPGPPGAPGGTGLPGLQGMPGERGASGSPGPKGDKGEPGGKGADGLPGARGERG 761
>gb|AAF82207.1|AC067971_15 (AC067971) EST gb|AI998275 comes from this gene. [Arabidopsis
           thaliana]
          Length = 155

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 18/61 (29%), Positives = 27/61 (43%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
           GGG +    R   G G   +  ++ W+RG  GG  P  +   + + GG +A   R  G  
Sbjct: 74  GGGGARSGGRSRGGGGGSSSSRSRDWKRG--GGVVPIHTGGGNGSLGGGSAGSHRSSGSM 131

Query: 664 N 662
           N
Sbjct: 132 N 132
>pir||T42310 hypothetical protein - phage SPP1
 emb|CAA66517.1| (X97918) gene 24.1 [Bacteriophage SPP1]
          Length = 91

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
 Frame = +1

Query: 508 QEQRH--MHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYHRHYQRHYPGCDHR 675
           QEQR   + R  DR   Q   Q +  ++ + H+KH   LD+++YHR Y  H   C  R
Sbjct: 18  QEQRISLLERTSDRHDQQ--IQAVTESLSKIHVKHH--LDKANYHRSYHFHALNCCSR 71
>gb|AAF36092.1| (AF218624) flagelliform silk protein [Nephila madagascariensis]
          Length = 626

 Score = 33.2 bits (74), Expect = 3.8
 Identities = 25/67 (37%), Positives = 28/67 (41%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
           G G +GP     AGPG    G A P   GP GG  P  +    +  GG A P    RGG 
Sbjct: 314 GPGGAGPGG---AGPGGVGPGGAGPGGAGP-GGVGPGGAGPGGAGPGG-AGPGGAGRGGA 368

Query: 664 NQDNAAG 644
               A G
Sbjct: 369 GPGGAGG 375
>gb|AAF52203.1| (AE003608) vkg gene product [Drosophila melanogaster]
          Length = 1940

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 20/66 (30%), Positives = 23/66 (34%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRNQ 659
           G SG P     GP   D    +P  +GP+G P     R      G    P     GG N 
Sbjct: 108 GKSGEPGT--PGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINS 165

Query: 658 DNAAGN 641
               GN
Sbjct: 166 KGTKGN 171
>pir||S74598 hypothetical protein sll1040 - Synechocystis sp. (strain PCC 6803)
 dbj|BAA16750.1| (D90900) hypothetical protein [Synechocystis sp.]
          Length = 765

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 21/63 (33%), Positives = 30/63 (47%), Gaps = 1/63 (1%)
 Frame = +2

Query: 242 IVEGLLLSETSILKHPGLA*PHLSTGGWGRRRLITGRRWWSVARGRRWGRC-CTSILLDE 418
           I+ GLL    ++L  P  A   LS+GG    +  +   WW   + RR GR  C+ + L  
Sbjct: 14  ILLGLLAFTLAVLIPPATAQITLSSGGNNGSQNTSSTPWWDTNKARRCGRLWCSDVFLQG 73

Query: 419 SINV 430
           S  V
Sbjct: 74  SSQV 77
>gb|AAF47772.1| (AE003478) sty gene product [Drosophila melanogaster]
          Length = 589

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 15/55 (27%), Positives = 26/55 (47%)
 Frame = +1

Query: 508 QEQRHMHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYHRHYQRHYPGCDH 672
           Q  +H+H  Q ++H+Q   Q+     +Q+HL+H+     +      Q    G DH
Sbjct: 234 QRNQHLHLQQHQQHLQQQQQQQQQQQQQQHLQHQQNQQHARLATTTQATSVGSDH 288
>sp|P04258|CA13_BOVIN COLLAGEN ALPHA 1(III) CHAIN
 pir||CGBO7S collagen alpha 1(III) chain - bovine
          Length = 1049

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 18/54 (33%), Positives = 23/54 (42%)
 Frame = -3

Query: 850 ELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
           E G G++GPP     G          P ERG  GGP P   + +  + G   AP
Sbjct: 548 EGGKGAAGPPG--PPGSAGTPGLQGMPGERGGPGGPGPKGDKGEPGSSGVDGAP 599
 Score = 32.5 bits (72), Expect = 6.6
 Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 7/66 (10%)
 Frame = -3

Query: 841 GGSSGPPARIEAG-PGAQ--DNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW----A 683
           GG  GP  + + G PG+   D    K   RGPTG   P     Q  ++G S AP     A
Sbjct: 576 GGPGGPGPKGDKGEPGSSGVDGAPGKDGPRGPTGPIGPPGPAGQPGDKGESGAPGVPGIA 635

Query: 682 RERGGRNQDNAAG 644
             RGG  +    G
Sbjct: 636 GPRGGPGERGEQG 648
 Score = 32.1 bits (71), Expect = 8.6
 Identities = 20/59 (33%), Positives = 23/59 (38%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRN 662
           G  GPP  I  GP  +D  S +P   GP G P P   +      G       R   GRN
Sbjct: 60  GPPGPPGAI--GPSGKDGESGRPGRPGPRGFPGPPGMKGPAGMPGFPGMKGHRGFDGRN 116
>pir||T27525 hypothetical protein ZC373.7 - Caenorhabditis elegans
 emb|CAA88979.1| (Z49131) similar to cuticular collagen~cDNA EST yk94d7.3 comes from
           this gene~cDNA EST yk94d7.5 comes from this gene~cDNA
           EST yk291h6.5 comes from this gene [Caenorhabditis
           elegans]
          Length = 297

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 27/65 (41%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
 Frame = -3

Query: 844 GGGSSGPP-ARIEAGP-----GAQDNGSA-KPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
           G GS+G P A  +AGP     G  +NGSA  P   GP G P    ++  D + G   AP 
Sbjct: 215 GKGSAGAPGAPGKAGPAGPAGGPGNNGSAGTPGPAGPAGNPGTPGNKGSDGHPGTPGAP- 273

Query: 685 ARERGGRNQDNA 650
               GG   D A
Sbjct: 274 ----GGPGHDAA 281
>dbj|BAA90381.1| (AP001081) hypothetical protein [Oryza sativa]
          Length = 235

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 21/61 (34%), Positives = 26/61 (42%), Gaps = 5/61 (8%)
 Frame = -3

Query: 841 GGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWA-----RE 677
           GG  GP     +G    D G   P +RG  GGP           +GG   P       R 
Sbjct: 157 GGDGGPGG---SGGRGGDGGQGGPGQRGGDGGPGGAGGPGGHGGDGGDGGPSGSPLRKRR 213

Query: 676 RGGRNQ 659
           RGGR++
Sbjct: 214 RGGRSR 219
>pir||A41726 homeotic protein BarH2 - fruit fly (Drosophila melanogaster)
 gb|AAB59218.1| (M82887) dual bar protein [Drosophila melanogaster]
          Length = 640

 Score = 32.8 bits (73), Expect = 5.0
 Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
 Frame = +1

Query: 508 QEQRHMHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYHRHY-QRHYP 660
           Q+Q+H H  Q ++H Q   Q+ +   +Q+ L+     +R     HY +RH P
Sbjct: 113 QQQQHHHHQQQQQHQQAALQQYI-VQQQQLLRFEREREREREREHYRERHSP 163
>gb|AAC67561.1| (AF095462) fibrinogen A-alpha chain [Equus caballus]
          Length = 481

 Score = 32.5 bits (72), Expect = 6.6
 Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPT--GGPAPWRSRNQDSNEGGSAAP 689
           GS GP +     PG+   GS+ PW  G +  G  + W   N  S+E GS  P
Sbjct: 130 GSYGPGSASTWNPGSSQPGSSGPWTSGSSGLGSASTW---NPGSSEPGSDGP 178
>sp|P20631|CC13_CAEEL CUTICLE COLLAGEN 13 PRECURSOR
 pir||S08170 collagen col-13 precursor - Caenorhabditis elegans
 emb|CAA35955.1| (X51623) collagen [Caenorhabditis elegans]
 emb|CAA98258.1| (Z73972) predicted using Genefinder~similar to collagen
           [Caenorhabditis elegans]
          Length = 316

 Score = 32.5 bits (72), Expect = 6.6
 Identities = 20/50 (40%), Positives = 23/50 (46%), Gaps = 4/50 (8%)
 Frame = -3

Query: 838 GSSGPPARI--EAGPGA--QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
           G SG P +      PGA  Q  G+A P   GP G P P      + N G   AP
Sbjct: 179 GPSGAPGQKGPSGAPGAPGQSGGAALPGPPGPAGPPGPAGQPGSNGNAGAPGAP 232
>gb|AAF60775.1| (AC024811) contains similarity to Pfam family PF01391 (Collagen
           triple helix repeat (20 copies)), score=73.8, E=3.5e-18,
           N=2 [Caenorhabditis elegans]
          Length = 285

 Score = 32.5 bits (72), Expect = 6.6
 Identities = 21/58 (36%), Positives = 24/58 (41%), Gaps = 2/58 (3%)
 Frame = -3

Query: 841 GGSSGPPAR--IEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGG 668
           GG  GPP +  I   PG      A P  +GP+G P P     Q    G    P A   GG
Sbjct: 148 GGPPGPPGQDGIPGNPGRNGEDGA-PGPQGPSGPPGPPGQPGQPGQRGPPGEPGALLPGG 206
>sp|P20630|CC12_CAEEL CUTICLE COLLAGEN 12 PRECURSOR
 pir||S08169 collagen col-12 precursor - Caenorhabditis elegans
 emb|CAA35954.1| (X51622) collagen [Caenorhabditis elegans]
 emb|CAA98257.1| (Z73972) predicted using Genefinder~similar to collagen~cDNA EST
           yk120e2.3 comes from this gene~cDNA EST yk72b10.3 comes
           from this gene~cDNA EST yk72b10.5 comes from this
           gene~cDNA EST yk120e2.5 comes from this gene~cDNA EST
           CEMSG34FB comes from this g>
          Length = 316

 Score = 32.5 bits (72), Expect = 6.6
 Identities = 20/50 (40%), Positives = 23/50 (46%), Gaps = 4/50 (8%)
 Frame = -3

Query: 838 GSSGPPARI--EAGPGA--QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
           G SG P +      PGA  Q  G+A P   GP G P P      + N G   AP
Sbjct: 179 GPSGAPGQKGPSGAPGAPGQSGGAALPGPPGPAGPPGPAGQPGSNGNAGAPGAP 232
>gb|AAF55419.1| (AE003717) CG5866 gene product [Drosophila melanogaster]
          Length = 206

 Score = 32.5 bits (72), Expect = 6.6
 Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
 Frame = +1

Query: 514 QRHMHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYH-RHYQRHYPGCDHRVRGPR 690
           Q H H +   + ++  FQE+  +  Q  + H H+   + YH  H   H+  C   +  PR
Sbjct: 111 QHHYHGSHPNQQLEAPFQEVHGSHHQHPIHHNHL--EAPYHGTHGVHHHRNCHATIHCPR 168
>pir||A27353 collagen alpha 1(III) chain precursor - mouse (fragment)
 gb|AAA37338.1| (M18933) alpha-1 type-III collagen precursor [Mus musculus]
          Length = 488

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 15/34 (44%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
 Frame = -3

Query: 838 GSSGPPARI-EAGPGAQDNGSAKPWERGPTGGPAP 737
           G  GPP  +  AGP  +D  S +P   GP G P P
Sbjct: 212 GPPGPPGALGPAGPAGKDGESGRPGRPGPRGLPGP 246
>gb|AAF76432.1|AF272661_1 (AF272661) alpha 4 type V collagen [Rattus norvegicus]
          Length = 1737

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 20/65 (30%), Positives = 24/65 (36%), Gaps = 1/65 (1%)
 Frame = -3

Query: 838 GSSGPPARI-EAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRN 662
           GS GPP      GP  +      P   GP G P P   +    N G        E+G R 
Sbjct: 674 GSEGPPGHPGHEGPTGEKGAQGPPGSAGPQGYPGPRGVKGTSGNRGLQG-----EKGERG 728

Query: 661 QDNAAG 644
           +D   G
Sbjct: 729 EDGFPG 734
>pir||T20720 hypothetical protein F10F2.9 - Caenorhabditis elegans
 emb|CAA84658.1| (Z35598) Asparagine, Serine and Glycine rich predicted protein
           [Caenorhabditis elegans]
 emb|CAA86461.1| (Z46343) Asparagine, Serine and Glycine rich predicted protein
           [Caenorhabditis elegans]
          Length = 549

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 26/74 (35%), Positives = 31/74 (41%), Gaps = 9/74 (12%)
 Frame = -3

Query: 862 ELMQELGGGSSGPPARIEAGPGAQDN--------GSAKPWERGPTGGPAPWRSRNQDSNE 707
           EL    GG + G  +   +G G  DN         S   W  G TGG       N   N+
Sbjct: 334 ELSNNWGGSNGGSGSNGGSGGGNTDNDNWGSNNGNSGGSWGNGGTGGSGGSGGGNWGDND 393

Query: 706 G-GSAAPWARERGGRNQDNAAGN 641
             GS+  W    G  NQDN   N
Sbjct: 394 NYGSSNKW---NGNGNQDNDNDN 413
>pir||T00041 BH-protocadherin PCDH7 (clone BH-Pcdh-b) - human
 dbj|BAA25195.1| (AB006756) PCDH7 (BH-Pcdh)b [Homo sapiens]
          Length = 1072

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 27/71 (38%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
 Frame = -3

Query: 874 IRHEELMQELGGGSSGPPARIEAG-------PGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
           I   EL+QE GGG SG  +R  AG       PG   NG+        +GG +    R  D
Sbjct: 177 IERYELLQEPGGGGSGGESR-RAGAADSAPYPGGGGNGA--------SGGGSGGSKRRLD 227

Query: 715 SNEGGSAAPWARERGGRN 662
           ++EGG         GGR+
Sbjct: 228 ASEGGGGT----NPGGRS 241
>pir||G02127 fus-like protein - human (fragment)
 gb|AAA79948.1| (U36561) fus-like protein [Homo sapiens]
          Length = 528

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 21/67 (31%), Positives = 30/67 (44%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
           GGGS G      +G G    G       G +GG   +   NQD + GG      ++RGGR
Sbjct: 166 GGGSYGQDQSSMSGSGGGGGGGGG----GGSGGGGGYG--NQDQSGGGGGGYGQQDRGGR 219

Query: 664 NQDNAAG 644
            +  ++G
Sbjct: 220 GRGRSSG 226
>sp|P10569|MYSC_ACACA MYOSIN IC HEAVY CHAIN
 pir||MWAXIC myosin heavy chain IC - Acanthamoeba castellanii
 gb|AAA27707.1| (J02974) myosin IB heavy chain [Acanthamoeba castellanii]
          Length = 1168

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 22/75 (29%), Positives = 32/75 (42%)
 Frame = -3

Query: 865  EELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
            ++++   GGG  G   R   GP      S +P   G  GGP+P+  R   S    +A+  
Sbjct: 919  DQILGAKGGGGGGGRGR--GGPSPSGAVSPRPSPGGGGGGPSPFGGRPSPSGPPAAASAP 976

Query: 685  ARERGGRNQDNAAGN 641
              E+     D AA N
Sbjct: 977  GPEQARALYDFAAEN 991
>gb|AAC58530.1| (U85043) gag polyprotein [feline syncytial virus]
          Length = 489

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 20/61 (32%), Positives = 24/61 (38%), Gaps = 5/61 (8%)
 Frame = -3

Query: 871 RHEELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRS-----RNQDSNE 707
           R+ +  Q  G G  GP      G G        P  RGP  GP P  +     R Q    
Sbjct: 420 RNPQQPQRYGQGPPGPNPYRRFGDGGNPQQQGPPPNRGPDQGPRPGGNPRGGGRGQGPRN 479

Query: 706 GGSAAP 689
           GG + P
Sbjct: 480 GGGSVP 485
>dbj|BAA33381.1| (AB008374) alpha 3 type I collagen [Oncorhynchus mykiss]
          Length = 678

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 21/57 (36%), Positives = 26/57 (44%), Gaps = 6/57 (10%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNG------SAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARE 677
           GS+GPP    AG   Q  G      + +P E G  G P P  +     N+G   AP    
Sbjct: 104 GSAGPPG--PAGKEGQKGGRGETGIAGRPGEAGAAGPPGPSGASGAKGNDGPMGAPGTPG 161

Query: 676 RGG 668
            GG
Sbjct: 162 PGG 164
>pir||T35089 probable integral membrane transport protein - Streptomyces
           coelicolor
 emb|CAB51452.1| (AL096884) putative integral membrane transport protein
           [Streptomyces coelicolor A3(2)]
          Length = 306

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 27/76 (35%), Positives = 36/76 (46%), Gaps = 6/76 (7%)
 Frame = -1

Query: 813 SRPALALKITAARSPGS------EVLPADLHHGAAVTKILTKEDRQPPGPANAVVATRIM 652
           S PA A   TA+  P         V+PAD   GAA       E R+  GPA      R+ 
Sbjct: 14  SAPARA---TASNDPSEGELRDVSVVPADALRGAAPAA----EGRRAAGPAALGPKARLW 66

Query: 651 PLVMAVIATTVKHMAVLQVPLL 586
           P ++AV    +    V ++PLL
Sbjct: 67  PSLVAVYRAQLSRARVARIPLL 88
>pir||T08435 la costa protein - fruit fly (Drosophila melanogaster)
 gb|AAC28405.1| (AF017777) la costa [Drosophila melanogaster]
 gb|AAF50816.1| (AE003568) lcs gene product [Drosophila melanogaster]
          Length = 145

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 19/52 (36%), Positives = 19/52 (36%)
 Frame = -3

Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
           G G  G P R   GPG    G   P  RGP GGP            GG   P
Sbjct: 28  GPGGPGGPGRGRGGPGRGPGGPGGPGGRGP-GGPGGPGGPGGPGGPGGPGGP 78
>pir||T29982 hypothetical protein F11G11.12 - Caenorhabditis elegans
 gb|AAB37843.1| (U80451) Similar to collagen [Caenorhabditis elegans]
          Length = 285

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 23/65 (35%), Positives = 27/65 (41%), Gaps = 1/65 (1%)
 Frame = -3

Query: 838 GSSGPPARIEA-GPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRN 662
           G+ G P    A GP  +D    +P   GP G P P     QD   G   AP   E G   
Sbjct: 211 GNPGQPGEPGAQGPPGEDG---RPGNSGPQGPPGPQGEPGQDGAPGNPGAP--GEAGEPG 265

Query: 661 QDNAAG 644
           +D A G
Sbjct: 266 KDGAKG 271
>ref|NP_037465.1| EH domain-binding mitotic phosphoprotein
 gb|AAD38326.1|AF073727_1 (AF073727) EH domain-binding mitotic phosphoprotein [Homo sapiens]
          Length = 551

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 20/58 (34%), Positives = 23/58 (39%), Gaps = 3/58 (5%)
 Frame = -3

Query: 841 GGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPA--PWRSRNQD-SNEGGSAAPWARERG 671
           GG   PPA    G  A    S  PW      GP+  PW       + EG +  PW    G
Sbjct: 272 GGPPVPPAADPWGGPAPTPASGDPWRPAAPAGPSVDPWGGTPAPAAGEGPTPDPWGSSDG 331

Query: 670 G 668
           G
Sbjct: 332 G 332
>pir||T00042 BH-protocadherin PCDH7 (clone BH-Pcdh-c) - human
 dbj|BAA25196.1| (AB006757) PCDH7 (BH-Pcdh)c [Homo sapiens]
          Length = 1200

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 27/71 (38%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
 Frame = -3

Query: 874 IRHEELMQELGGGSSGPPARIEAG-------PGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
           I   EL+QE GGG SG  +R  AG       PG   NG+        +GG +    R  D
Sbjct: 177 IERYELLQEPGGGGSGGESR-RAGAADSAPYPGGGGNGA--------SGGGSGGSKRRLD 227

Query: 715 SNEGGSAAPWARERGGRN 662
           ++EGG         GGR+
Sbjct: 228 ASEGGGGT----NPGGRS 241
>ref|NP_002580.1| BH-protocadherin (brain-heart)
 pir||T00040 BH-protocadherin PCDH7 - human
 dbj|BAA25194.1| (AB006755) PCDH7 (BH-Pcdh)a [Homo sapiens]
          Length = 1069

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 27/71 (38%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
 Frame = -3

Query: 874 IRHEELMQELGGGSSGPPARIEAG-------PGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
           I   EL+QE GGG SG  +R  AG       PG   NG+        +GG +    R  D
Sbjct: 177 IERYELLQEPGGGGSGGESR-RAGAADSAPYPGGGGNGA--------SGGGSGGSKRRLD 227

Query: 715 SNEGGSAAPWARERGGRN 662
           ++EGG         GGR+
Sbjct: 228 ASEGGGGT----NPGGRS 241
>dbj|BAB01598.1| (AB046016) unnamed protein product [Macaca fascicularis]
          Length = 344

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 12/19 (63%), Positives = 15/19 (78%), Gaps = 17/19 (89%)
 Frame = -3

Query: 841 GGSSG--PPARI----EAGPGAQD-----------NGSAKPWERGP--TGGPAPWRSRNQ 719
           GG+ G  PPAR     + GPGA+D            GSA P E  P   GGP   R+  Q
Sbjct: 168 GGARGRSPPARAAGGAQPGPGAEDVQPAGRPGAPTAGSAPPLEGRPQGAGGPGALRAEGQ 227

Query: 718 DS 713
           DS
Sbjct: 228 DS 229
>pir||T21070 hypothetical protein F17C8.2 - Caenorhabditis elegans
 emb|CAA84800.1| (Z35719) similar to cuticular collagen~cDNA EST yk170g12.5 comes
           from this gene~cDNA EST yk125f8.5 comes from this
           gene~cDNA EST yk125f8.3 comes from this gene~cDNA EST
           yk170g12.3 comes from this gene~cDNA EST yk191e5.3 comes
           from this gene~cDNA EST yk>
          Length = 296

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 23/66 (34%), Positives = 27/66 (40%), Gaps = 8/66 (12%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQ--------DNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWA 683
           G SGP    EAGP  +        + G  +P   GP G P P          G   AP  
Sbjct: 189 GDSGPNG--EAGPNGEPGAPGKDGEKGKGEPGPAGPPGPPGPGGPPGDAGGAGSDGAP-- 244

Query: 682 RERGGRNQDNAAGN 641
             +G   QD   GN
Sbjct: 245 GPQGPPGQDGTPGN 258
>gb|AAC98089.1| (AF051353) myosin IC heavy chain [Acanthamoeba castellanii]
          Length = 1186

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 22/75 (29%), Positives = 32/75 (42%)
 Frame = -3

Query: 865  EELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
            ++++   GGG  G   R   GP      S +P   G  GGP+P+  R   S    +A+  
Sbjct: 937  DQILGAKGGGGGGGRGR--GGPSPSGAVSPRPSPGGGGGGPSPFGGRPSPSGPPAAASAP 994

Query: 685  ARERGGRNQDNAAGN 641
              E+     D AA N
Sbjct: 995  GPEQARALYDFAAEN 1009
>dbj|BAA07813.1| (D43756) fibrinogen A-alpha-chain [Canis familiaris]
          Length = 443

 Score = 32.1 bits (71), Expect = 8.6
 Identities = 21/52 (40%), Positives = 25/52 (47%), Gaps = 7/52 (13%)
 Frame = -3

Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPT-------GGPAPWRSRNQDSNEGGSAAPWA 683
           GSS P +     PG+   GSA PW  G T       G    W +R   S   GSA  W+
Sbjct: 112 GSSTPGSAGTWNPGSTGPGSAGPWSSGSTRPGSTGPGSAGTWSTR-PGSTGPGSAGTWS 169
Database: nr Posted date: Sep 29, 2000 9:53 PM Number of letters in database: 177,575,912 Number of sequences in database: 565,281 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 247189426 Number of Sequences: 565281 Number of extensions: 5128878 Number of successful extensions: 21429 Number of sequences better than 10.0: 118 Number of HSP's better than 10.0 without gapping: 6 Number of HSP's successfully gapped in prelim test: 56 Number of HSP's that attempted gapping in prelim test: 21094 Number of HSP's gapped (non-prelim): 323 length of query: 292 length of database: 177,575,912 effective HSP length: 54 effective length of query: 237 effective length of database: 147,050,738 effective search space: 34851024906 effective search space used: 34851024906 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 71 (32.1 bits)