Score E
Sequences producing significant alignments: (bits) Value
gb|AAA18616.1| (U09738) circumsporozoite protein [Plasmodiu... 38 0.11
gb|AAA18617.1| (U09765) circumsporozoite protein [Plasmodiu... 38 0.15
pir||T41653 probable transcription or splicing factor - fis... 38 0.20
pir||S23809 collagen alpha 2(I) chain homolog - sea urchin ... 38 0.20
pir||T20450 hypothetical protein E04D5.1 - Caenorhabditis e... 37 0.34
pir||B42856 ubiquitin carrier protein E2 - human 36 0.44
ref|NP_055316.1| ubiquitin carrier protein >gi|2501433|sp|Q... 36 0.44
pir||S42731 collagen alpha 1 chain - sea urchin (Hemicentro... 36 0.58
sp|P17656|CC02_CAEEL CUTICLE COLLAGEN 2 >gi|84426|pir||B312... 36 0.76
pir||T26004 hypothetical protein ZK863.2 - Caenorhabditis e... 36 0.76
pir||A36226 collagen alpha 1 chain - sea urchin (Paracentro... 35 1.3
pir||T20497 hypothetical protein F02D10.1 - Caenorhabditis ... 35 1.3
pir||T35861 probable large secreted protein - Streptomyces ... 34 1.7
pir||T18637 hypothetical protein B0024.1 - Caenorhabditis e... 34 1.7
gb|AAF36091.1| (AF218624) flagelliform silk protein [Nephil... 34 1.7
sp|P46804|SPD2_NEPCL SPIDROIN 2 (DRAGLINE SILK FIBROIN 2) >... 34 2.2
pir||T32734 myosin-IA - Acanthamoeba castellanii >gi|359947... 34 2.2
pir||I40333 tracheal colonization factor A precursor - Bord... 34 2.2
emb|CAA08832.1| (AJ009785) tracheal colonization factor [Bo... 34 2.2
pir||T29731 hypothetical protein F41F3.4 - Caenorhabditis e... 34 2.2
pir||T15143 hypothetical protein T28F2.8 - Caenorhabditis e... 34 2.9
gb|AAF84591.1|AE004000_8 (AE004000) hypothetical protein [X... 34 2.9
gb|AAC67560.1| (AF095461) fibrinogen A-alpha chain [Felis c... 34 2.9
gb|AAC33847.1| (AF043944) nongradient byssal precursor [Myt... 33 3.8
dbj|BAA07815.1| (D43758) fibrinogen A-alpha-chain [Macaca m... 33 3.8
pir||I50694 collagen alpha 1(III) chain - chicken (fragment... 33 3.8
gb|AAF82207.1|AC067971_15 (AC067971) EST gb|AI998275 comes ... 33 3.8
pir||T42310 hypothetical protein - phage SPP1 >gi|2764887|e... 33 3.8
gb|AAF36092.1| (AF218624) flagelliform silk protein [Nephil... 33 3.8
gb|AAF52203.1| (AE003608) vkg gene product [Drosophila mela... 33 5.0
pir||S74598 hypothetical protein sll1040 - Synechocystis sp... 33 5.0
gb|AAF47772.1| (AE003478) sty gene product [Drosophila mela... 33 5.0
sp|P04258|CA13_BOVIN COLLAGEN ALPHA 1(III) CHAIN >gi|71410|... 33 5.0
pir||T27525 hypothetical protein ZC373.7 - Caenorhabditis e... 33 5.0
dbj|BAA90381.1| (AP001081) hypothetical protein [Oryza sativa] 33 5.0
pir||A41726 homeotic protein BarH2 - fruit fly (Drosophila ... 33 5.0
gb|AAC67561.1| (AF095462) fibrinogen A-alpha chain [Equus c... 32 6.6
sp|P20631|CC13_CAEEL CUTICLE COLLAGEN 13 PRECURSOR >gi|8443... 32 6.6
gb|AAF60775.1| (AC024811) contains similarity to Pfam famil... 32 6.6
sp|P20630|CC12_CAEEL CUTICLE COLLAGEN 12 PRECURSOR >gi|8442... 32 6.6
gb|AAF55419.1| (AE003717) CG5866 gene product [Drosophila m... 32 6.6
pir||A27353 collagen alpha 1(III) chain precursor - mouse (... 32 8.6
gb|AAF76432.1|AF272661_1 (AF272661) alpha 4 type V collagen... 32 8.6
pir||T20720 hypothetical protein F10F2.9 - Caenorhabditis e... 32 8.6
pir||T00041 BH-protocadherin PCDH7 (clone BH-Pcdh-b) - huma... 32 8.6
pir||G02127 fus-like protein - human (fragment) >gi|1040970... 32 8.6
sp|P10569|MYSC_ACACA MYOSIN IC HEAVY CHAIN >gi|71609|pir||M... 32 8.6
gb|AAC58530.1| (U85043) gag polyprotein [feline syncytial v... 32 8.6
dbj|BAA33381.1| (AB008374) alpha 3 type I collagen [Oncorhy... 32 8.6
pir||T35089 probable integral membrane transport protein - ... 32 8.6
pir||T08435 la costa protein - fruit fly (Drosophila melano... 32 8.6
pir||T29982 hypothetical protein F11G11.12 - Caenorhabditis... 32 8.6
ref|NP_037465.1| EH domain-binding mitotic phosphoprotein >... 32 8.6
pir||T00042 BH-protocadherin PCDH7 (clone BH-Pcdh-c) - huma... 32 8.6
ref|NP_002580.1| BH-protocadherin (brain-heart) >gi|7512301... 32 8.6
dbj|BAB01598.1| (AB046016) unnamed protein product [Macaca ... 32 8.6
pir||T21070 hypothetical protein F17C8.2 - Caenorhabditis e... 32 8.6
gb|AAC98089.1| (AF051353) myosin IC heavy chain [Acanthamoe... 32 8.6
dbj|BAA07813.1| (D43756) fibrinogen A-alpha-chain [Canis fa... 32 8.6
>gb|AAA18616.1| (U09738) circumsporozoite protein [Plasmodium vivax-like sp.]
Length = 393
Score = 38.3 bits (87), Expect = 0.11
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q++G+A P G AP ++ EGG+AAP A +
Sbjct: 104 GGAAAPGANQEGGAAAPGANQEDGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 158
Query: 673 GG 668
GG
Sbjct: 159 GG 160
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 181 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 235
Query: 673 GG 668
GG
Sbjct: 236 GG 237
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 159 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 213
Query: 673 GG 668
GG
Sbjct: 214 GG 215
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 170 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 224
Query: 673 GG 668
GG
Sbjct: 225 GG 226
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 137 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 191
Query: 673 GG 668
GG
Sbjct: 192 GG 193
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 148 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 202
Query: 673 GG 668
GG
Sbjct: 203 GG 204
Score = 37.1 bits (84), Expect = 0.26
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 192 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQG 246
Query: 673 GG 668
GG
Sbjct: 247 GG 248
Score = 35.6 bits (80), Expect = 0.76
Identities = 28/67 (41%), Positives = 38/67 (55%), Gaps = 15/67 (22%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDS------NEGGSAA 692
GG++ P A E G PGA Q G+A P G AP ++ + EGG+AA
Sbjct: 225 GGAAAPGANQEGGAAAPGANQGGGAAAPGANQGGGAAAPGANQGGGAAAPGANQEGGAAA 284
Query: 691 PWARE-----RGGRNQDNAAGN 641
P A + GG+ Q+N N
Sbjct: 285 PGANQGGAKPAGGQGQNNEGAN 306
Score = 34.8 bits (78), Expect = 1.3
Identities = 24/58 (41%), Positives = 32/58 (54%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ GG+AAP A +
Sbjct: 214 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQGGGAAAPGANQG-----GGAAAPGANQG 268
Query: 673 GG 668
GG
Sbjct: 269 GG 270
>gb|AAA18617.1| (U09765) circumsporozoite protein [Plasmodium simiovale]
Length = 393
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 148 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 202
Query: 673 GG 668
GG
Sbjct: 203 GG 204
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 126 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 180
Query: 673 GG 668
GG
Sbjct: 181 GG 182
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 115 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 169
Query: 673 GG 668
GG
Sbjct: 170 GG 171
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 104 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 158
Query: 673 GG 668
GG
Sbjct: 159 GG 160
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 159 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 213
Query: 673 GG 668
GG
Sbjct: 214 GG 215
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 170 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 224
Query: 673 GG 668
GG
Sbjct: 225 GG 226
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 181 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 235
Query: 673 GG 668
GG
Sbjct: 236 GG 237
Score = 37.9 bits (86), Expect = 0.15
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 137 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQE 191
Query: 673 GG 668
GG
Sbjct: 192 GG 193
Score = 37.1 bits (84), Expect = 0.26
Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ EGG+AAP A +
Sbjct: 192 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQEGGAAAPGANQ-----EGGAAAPGANQG 246
Query: 673 GG 668
GG
Sbjct: 247 GG 248
Score = 35.6 bits (80), Expect = 0.76
Identities = 28/67 (41%), Positives = 38/67 (55%), Gaps = 15/67 (22%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDS------NEGGSAA 692
GG++ P A E G PGA Q G+A P G AP ++ + EGG+AA
Sbjct: 225 GGAAAPGANQEGGAAAPGANQGGGAAAPGANQGGGAAAPGANQGGGAAAPGANQEGGAAA 284
Query: 691 PWARE-----RGGRNQDNAAGN 641
P A + GG+ Q+N N
Sbjct: 285 PGANQGGAKSAGGQGQNNEGAN 306
Score = 34.8 bits (78), Expect = 1.3
Identities = 24/58 (41%), Positives = 32/58 (54%), Gaps = 4/58 (6%)
Frame = -3
Query: 841 GGSSGPPARIEAG---PGA-QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GG++ P A E G PGA Q+ G+A P G AP ++ GG+AAP A +
Sbjct: 214 GGAAAPGANQEGGAAAPGANQEGGAAAPGANQGGGAAAPGANQG-----GGAAAPGANQG 268
Query: 673 GG 668
GG
Sbjct: 269 GG 270
>pir||T41653 probable transcription or splicing factor - fission yeast
(Schizosaccharomyces pombe)
emb|CAA20438.1| (AL031323) putative transcription or splicing factor
[Schizosaccharomyces pombe]
gb|AAF02214.1|AF073779_1 (AF073779) putative splicing factor BBP/SF1 [Schizosaccharomyces
pombe]
Length = 587
Score = 37.5 bits (85), Expect = 0.20
Identities = 25/74 (33%), Positives = 36/74 (47%), Gaps = 3/74 (4%)
Frame = -3
Query: 868 HEELMQELGGGSSGPPARIEAGPGAQ--DNGSAKPWERGPTGGPAPWRSRNQD-SNEGGS 698
++ LMQELGGGS+ E + ++G+A P G P PW + + S+ S
Sbjct: 367 YQSLMQELGGGSAISNGNGEPQKSIEFSESGAASP----QAGHPPPWAAASTSVSSSTSS 422
Query: 697 AAPWARERGGRNQDNAA 647
APWA+ N A
Sbjct: 423 PAPWAKPASSAAPSNPA 439
>pir||S23809 collagen alpha 2(I) chain homolog - sea urchin (Strongylocentrotus
purpuratus)
gb|AAA30035.1| (M92040) alpha-1 collagen [Strongylocentrotus purpuratus]
Length = 1414
Score = 37.5 bits (85), Expect = 0.20
Identities = 20/65 (30%), Positives = 25/65 (37%), Gaps = 1/65 (1%)
Frame = -3
Query: 841 GGSSGPP-ARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
GG +GPP A GP + P GP G P P ++G AP + G
Sbjct: 938 GGPAGPPGAAGSRGPAGKSGDRGSPGAVGPAGNPGPAGENGMPGSDGNDGAPGPQGSRGE 997
Query: 664 NQDNAA 647
D A
Sbjct: 998 KGDTGA 1003
>pir||T20450 hypothetical protein E04D5.1 - Caenorhabditis elegans
emb|CAA91279.1| (Z66496) cDNA EST yk84b6.3 comes from this gene; cDNA EST yk84b6.5
comes from this gene~cDNA EST yk117a5.5 comes from this
gene; cDNA EST yk132f3.3 comes from this gene~cDNA EST
yk132f3.5 comes from this gene; cDNA EST yk165g1.3 comes
from this gene~cD>
Length = 618
Score = 36.7 bits (83), Expect = 0.34
Identities = 16/43 (37%), Positives = 23/43 (53%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
GGGS+GPP+ PG Q+ A+P G P P+R + +
Sbjct: 525 GGGSAGPPSAAAPTPGNQNQRPAQPRANGNGNAPQPFRPQQSE 567
>pir||B42856 ubiquitin carrier protein E2 - human
Length = 247
Score = 36.4 bits (82), Expect = 0.44
Identities = 21/71 (29%), Positives = 28/71 (38%)
Frame = -3
Query: 859 LMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR 680
L+ E+ GG+ GP R EAG A + G GGP GG+ P A+
Sbjct: 172 LLTEIHGGAGGPSGRAEAGRALASGTEASSTDPGAPGGP------------GGAEGPMAK 219
Query: 679 ERGGRNQDNAA 647
+ G A
Sbjct: 220 KHAGERDKKLA 230
>ref|NP_055316.1| ubiquitin carrier protein
sp|Q16763|UBCE_HUMAN UBIQUITIN-CONJUGATING ENZYME E2-24 KD (UBIQUITIN-PROTEIN LIGASE)
(UBIQUITIN CARRIER PROTEIN) (E2-EPF5)
gb|AAA58446.1| (M91670) ubiquitin carrier protein [Homo sapiens]
Length = 225
Score = 36.4 bits (82), Expect = 0.44
Identities = 21/71 (29%), Positives = 28/71 (38%)
Frame = -3
Query: 859 LMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR 680
L+ E+ GG+ GP R EAG A + G GGP GG+ P A+
Sbjct: 150 LLTEIHGGAGGPSGRAEAGRALASGTEASSTDPGAPGGP------------GGAEGPMAK 197
Query: 679 ERGGRNQDNAA 647
+ G A
Sbjct: 198 KHAGERDKKLA 208
>pir||S42731 collagen alpha 1 chain - sea urchin (Hemicentrotus pulcherrimus)
(fragment)
gb|AAB30065.1| (S70718) fibrillar collagen alpha 120 and 140 chains [Hemicentrotus
pulcherrimus=sea urchins, tests, Peptide, 632 aa]
Length = 632
Score = 36.0 bits (81), Expect = 0.58
Identities = 19/65 (29%), Positives = 25/65 (38%), Gaps = 1/65 (1%)
Frame = -3
Query: 841 GGSSGPPARIEA-GPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
GG +GPP + GP + P GP G P P ++G AP + G
Sbjct: 156 GGPAGPPGEAGSRGPPGKSGERGSPGAVGPPGNPGPAGENGMPGSDGNDGAPGPQGARGE 215
Query: 664 NQDNAA 647
D A
Sbjct: 216 KGDTGA 221
>sp|P17656|CC02_CAEEL CUTICLE COLLAGEN 2
pir||B31219 collagen 2 - Caenorhabditis elegans
emb|CAA23464.1| (V00148) unnamed protein product [Caenorhabditis elegans]
gb|AAA27990.1| (J01048) collagen [Caenorhabditis elegans]
emb|CAA92620.1| (Z68301) predicted using Genefinder~similar to cuticle collagen
(CC02)~cDNA EST yk94a4.5 comes from this gene~cDNA EST
yk94a4.3 comes from this gene~cDNA EST yk68d1.5 comes
from this gene~cDNA EST yk68d1.3 comes from this gene
[Caenorhabditis elegans]
Length = 301
Score = 35.6 bits (80), Expect = 0.76
Identities = 23/65 (35%), Positives = 26/65 (39%), Gaps = 3/65 (4%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEG---GSAAPWARERGG 668
G +GPP GP A P GP G P P D G G P A E+GG
Sbjct: 159 GPAGPPG--PPGPDGNPGSPAGPSGPGPAGPPGPAGPAGNDGAPGAPGGPGEPGASEQGG 216
Query: 667 RNQDNAAG 644
+ AG
Sbjct: 217 PGEPGPAG 224
>pir||T26004 hypothetical protein ZK863.2 - Caenorhabditis elegans
emb|CAB09131.1| (Z95621) similar to collagen~cDNA EST yk93b5.5 comes from this
gene~cDNA EST yk120a5.5 comes from this gene
[Caenorhabditis elegans]
emb|CAB01457.1| (Z78019) similar to collagen~cDNA EST yk93b5.5 comes from this
gene~cDNA EST yk120a5.5 comes from this gene
[Caenorhabditis elegans]
Length = 330
Score = 35.6 bits (80), Expect = 0.76
Identities = 23/66 (34%), Positives = 28/66 (41%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRNQ 659
G +GPP GP G + P GP G P P D N G + A + GG Q
Sbjct: 175 GPAGPPG--PPGPDGAPGGGSGPGPAGPAGPPGP------DGNPGSAGAD--GQPGGPGQ 224
Query: 658 DNAAGN 641
D A G+
Sbjct: 225 DGAPGS 230
>pir||A36226 collagen alpha 1 chain - sea urchin (Paracentrotus lividus)
gb|AAA29438.1| (M25282) alpha collagen type 1 precursor [Paracentrotus lividus]
Length = 730
Score = 34.8 bits (78), Expect = 1.3
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQ-------DNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR 680
G+SGPP PG++ D GS P GP G P P Q ++G AP A+
Sbjct: 254 GASGPPGA-PGEPGSRGSHGKSGDRGS--PGAVGPPGNPGPAGENGQPGSDGNDGAPGAQ 310
Query: 679 ERGGRNQDNAA 647
G D A
Sbjct: 311 GPRGEKGDTGA 321
>pir||T20497 hypothetical protein F02D10.1 - Caenorhabditis elegans
emb|CAA91932.1| (Z67990) similar to cuticle collagen [Caenorhabditis elegans]
Length = 316
Score = 34.8 bits (78), Expect = 1.3
Identities = 21/54 (38%), Positives = 24/54 (43%), Gaps = 2/54 (3%)
Frame = -3
Query: 850 ELGGGSSGPPARIEAGPGAQD--NGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
E G G +GPP AGP D +GS GP G P P D N G + P
Sbjct: 232 EAGPGPAGPPG--PAGPAGPDGQSGSGSAGGPGPKGPPGPAGQPGSDGNPGTAGPP 285
>pir||T35861 probable large secreted protein - Streptomyces coelicolor
emb|CAB41562.1| (AL049727) putative large secreted protein [Streptomyces coelicolor
A3(2)]
Length = 877
Score = 34.4 bits (77), Expect = 1.7
Identities = 28/78 (35%), Positives = 35/78 (43%), Gaps = 8/78 (10%)
Frame = -3
Query: 874 IRHEELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPT-------GGPA-PWRSRNQ 719
I H+ +++E GG+ GP GA NG+ PW PT GPA S
Sbjct: 637 IPHDIVVREPAGGADGP--------GAAANGARAPWVAAPTARDSADPAGPAIDTGSVTG 688
Query: 718 DSNEGGSAAPWARERGGRNQDNAAGN 641
E G AAP A AAG+
Sbjct: 689 TGTETGQAAPGAAAGRSATASAAAGD 714
>pir||T18637 hypothetical protein B0024.1 - Caenorhabditis elegans
emb|CAA94874.1| (Z71178) similar to collagen [Caenorhabditis elegans]
Length = 297
Score = 34.4 bits (77), Expect = 1.7
Identities = 24/66 (36%), Positives = 27/66 (40%), Gaps = 2/66 (3%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQD--NGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERG 671
G GS G P R PG Q +P ERGP+G P Q N G P A
Sbjct: 216 GHGSPGAPGRA-GQPGRQGAPGNPGRPGERGPSGPCGPAGRSGQPGNRGSDGHPGAPGNP 274
Query: 670 GRNQDNAA 647
G +AA
Sbjct: 275 GLQGSDAA 282
>gb|AAF36091.1| (AF218624) flagelliform silk protein [Nephila madagascariensis]
Length = 1884
Score = 34.4 bits (77), Expect = 1.7
Identities = 26/65 (40%), Positives = 28/65 (43%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
G G SGP AGPG G A P GP GG P + S GG A P RGG
Sbjct: 1591 GPGGSGPGG---AGPGGAGPGGAGPGGAGP-GGVGPGGAGPGGSGPGG-AGPGGAGRGGA 1645
Query: 664 NQDNA 650
+ A
Sbjct: 1646 GRGGA 1650
>sp|P46804|SPD2_NEPCL SPIDROIN 2 (DRAGLINE SILK FIBROIN 2)
pir||A44112 spidroin 2, dragline silk fibroin - orb spider (Nephila clavipes)
(fragment)
gb|AAA29381.1| (M92913) dragline silk fibroin [Nephila clavipes]
Length = 627
Score = 34.0 bits (76), Expect = 2.2
Identities = 24/67 (35%), Positives = 29/67 (42%), Gaps = 1/67 (1%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTG-GPAPWRSRNQDSNEGGSAAPWARERGG 668
G GS+ A +GPG Q G P ++GP G GP Q + GSAA A G
Sbjct: 160 GPGSAAAAAAAASGPGQQGPGGYGPGQQGPGGYGPG-----QQGPSGPGSAAAAAAAASG 214
Query: 667 RNQDNAAG 644
Q G
Sbjct: 215 PGQQGPGG 222
>pir||T32734 myosin-IA - Acanthamoeba castellanii
gb|AAC35357.1| (AF085185) Myosin-IA [Acanthamoeba castellanii]
Length = 1215
Score = 34.0 bits (76), Expect = 2.2
Identities = 22/66 (33%), Positives = 23/66 (34%), Gaps = 2/66 (3%)
Frame = -3
Query: 841 GGSSGPPARIEAGPGAQD--NGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGG 668
GG GPP G GA G+ P GP G P R GG P GG
Sbjct: 1046 GGPGGPPGPAGPGRGAPGPGRGAPGPSRGGPGGPPPGGRGMPPPGGRGGPGGPGPAGPGG 1105
Query: 667 RNQDNAAG 644
R G
Sbjct: 1106 RGMPAPGG 1113
Score = 32.1 bits (71), Expect = 8.6
Identities = 22/67 (32%), Positives = 24/67 (34%), Gaps = 2/67 (2%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPW--RSRNQDSNEGGSAAPWARERG 671
G G+ GP GP G P RG GGP P R + GG P
Sbjct: 1065 GRGAPGPSRGGPGGPPPGGRGMPPPGGRGGPGGPGPAGPGGRGMPAPGGGRGGPGGPGPA 1124
Query: 670 GRNQDNAAG 644
GR AG
Sbjct: 1125 GRGGPGPAG 1133
>pir||I40333 tracheal colonization factor A precursor - Bordetella pertussis
gb|AAC43453.1| (U16754) tracheal colonization factor [Bordetella pertussis]
Length = 672
Score = 34.0 bits (76), Expect = 2.2
Identities = 24/59 (40%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
Frame = -3
Query: 844 GGGSSG---PPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GGG G PPA G G NG+A+ ERG GP P EGG P +
Sbjct: 289 GGGDEGQRPPPAAGNGGNGG--NGNAQLPERGDDAGPKP------PEGEGGDEGPQPPQG 340
Query: 673 GG 668
GG
Sbjct: 341 GG 342
>emb|CAA08832.1| (AJ009785) tracheal colonization factor [Bordetella pertussis]
Length = 647
Score = 34.0 bits (76), Expect = 2.2
Identities = 24/59 (40%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
Frame = -3
Query: 844 GGGSSG---PPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER 674
GGG G PPA G G NG+A+ ERG GP P EGG P +
Sbjct: 264 GGGDEGQRPPPAAGNGGNGG--NGNAQLPERGDDAGPKP------PEGEGGDEGPQPPQG 315
Query: 673 GG 668
GG
Sbjct: 316 GG 317
>pir||T29731 hypothetical protein F41F3.4 - Caenorhabditis elegans
gb|AAA97982.1| (U55366) Similar to cuticle collagen [Caenorhabditis elegans]
Length = 310
Score = 34.0 bits (76), Expect = 2.2
Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 17/69 (24%)
Frame = -3
Query: 847 LGGGSSGP----PARIEAGPGAQD------------NGSAKPWERGPTGGPAPWRSRNQD 716
+GGGS P P GPG + S++P + GP G P P Q
Sbjct: 183 VGGGSGAPGAPGPKGAPGGPGQPGRDGQPGQAGQPGSSSSEPGQPGPNGQPGPRGPPGQA 242
Query: 715 SNEGGSAAPWA-RERGGRNQDNAAGN 641
+ GG+ P + G R D GN
Sbjct: 243 GSPGGNGQPGGPGQPGQRGSDGQPGN 268
>pir||T15143 hypothetical protein T28F2.8 - Caenorhabditis elegans
gb|AAB53059.1| (AF000198) Similar to cuticular collagen [Caenorhabditis elegans]
Length = 435
Score = 33.6 bits (75), Expect = 2.9
Identities = 26/67 (38%), Positives = 29/67 (42%), Gaps = 2/67 (2%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDN--GSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERG 671
G G +GPP GP QD G+A+P GP G P P D GG P G
Sbjct: 250 GPGPAGPPG--PPGPPGQDGSGGAAQP---GPPGPPGP---PGNDGQPGGPGQP-----G 296
Query: 670 GRNQDNAAG 644
G QD G
Sbjct: 297 GPGQDGGPG 305
>gb|AAF84591.1|AE004000_8 (AE004000) hypothetical protein [Xylella fastidiosa]
Length = 302
Score = 33.6 bits (75), Expect = 2.9
Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 8/53 (15%)
Frame = -3
Query: 835 SSGPPARIEAGPGAQDNGSAKPWERGPT--------GGPAPWRSRNQDSNEGGSAAPWAR 680
SS PPA E G + +A P GPT P R + D+NE G+A+P A
Sbjct: 56 SSTPPALPEPGVIVRPPANASPPTTGPTPTAQPASPASPTSTRESDADANEAGTASPTAN 115
Query: 679 E 677
+
Sbjct: 116 D 116
>gb|AAC67560.1| (AF095461) fibrinogen A-alpha chain [Felis catus]
Length = 463
Score = 33.6 bits (75), Expect = 2.9
Identities = 20/56 (35%), Positives = 26/56 (45%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARE 677
G GSSGP + PG+ GS+ W G + GP + N S+ SA W E
Sbjct: 188 GPGSSGPGSSGAWNPGSSGPGSSGAWNPG-SSGPGSGSTWNAGSSGVSSAGTWDTE 242
Score = 32.8 bits (73), Expect = 5.0
Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 4/51 (7%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTG----GPAPWRSRNQDSNEGGSAAPW 686
GSSGP + PG+ GS+ W G +G GP + N S+ GS+ W
Sbjct: 159 GSSGPGSSGAWNPGSSTPGSSGAWNPGSSGPGSSGPGSSGAWNPGSSGPGSSGAW 213
Score = 32.1 bits (71), Expect = 8.6
Identities = 18/51 (35%), Positives = 25/51 (48%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
GSSGP + PG+ GS+ W G + GP + N S+ GS+ W
Sbjct: 133 GSSGPGSSGAWNPGSTGPGSSGAWNPG-SSGPGSSGAWNPGSSTPGSSGAW 182
>gb|AAC33847.1| (AF043944) nongradient byssal precursor [Mytilus edulis]
Length = 904
Score = 33.2 bits (74), Expect = 3.8
Identities = 22/67 (32%), Positives = 25/67 (36%), Gaps = 4/67 (5%)
Frame = -3
Query: 841 GGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARER---- 674
GG GPP GP P E+G G P Q N G P A +
Sbjct: 241 GGPPGPPGHSPQGPQGSRGAPGAPGEQGANGSP------GQPGNAGAPGQPGAPGQAGAP 294
Query: 673 GGRNQDNAAGN 641
G R AAG+
Sbjct: 295 GARGPSGAAGH 305
Score = 32.1 bits (71), Expect = 8.6
Identities = 20/65 (30%), Positives = 26/65 (39%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRNQ 659
G G P + GP + + P +GPTG S +D GG P A +GG
Sbjct: 415 GDKGAPG--DVGPEGPEGPAGGPGPKGPTGPQGAKGSPGEDGEPGGEGEPGA--KGGDGL 470
Query: 658 DNAAG 644
AG
Sbjct: 471 PGQAG 475
>dbj|BAA07815.1| (D43758) fibrinogen A-alpha-chain [Macaca mulatta]
Length = 455
Score = 33.2 bits (74), Expect = 3.8
Identities = 18/51 (35%), Positives = 24/51 (46%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
GSSGP + PG+ GS W+ G + GP + N S+ GS W
Sbjct: 120 GSSGPGSTGHQSPGSSGPGSTGTWKPG-SSGPGSTGTWNPGSSGTGSTGTW 169
Score = 32.1 bits (71), Expect = 8.6
Identities = 19/51 (37%), Positives = 22/51 (42%), Gaps = 2/51 (3%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGP--TGGPAPWRSRNQDSNEGGSAAPW 686
GSSGP + PG+ GS W G TG W N S+ GS W
Sbjct: 146 GSSGPGSTGTWNPGSSGTGSTGTWNPGSSGTGSTGTW---NPGSSGSGSTGTW 195
>pir||I50694 collagen alpha 1(III) chain - chicken (fragment)
gb|AAA83407.1| (U07973) alpha-1 collagen type III [Gallus gallus]
Length = 886
Score = 33.2 bits (74), Expect = 3.8
Identities = 21/56 (37%), Positives = 26/56 (45%), Gaps = 2/56 (3%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNG-SAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWAR-ERG 671
G+ GPP A G G P ERG +G P P + + +G P AR ERG
Sbjct: 704 GAKGPPGPPGAPGGTGLPGLQGMPGERGASGSPGPKGDKGEPGGKGADGLPGARGERG 761
>gb|AAF82207.1|AC067971_15 (AC067971) EST gb|AI998275 comes from this gene. [Arabidopsis
thaliana]
Length = 155
Score = 33.2 bits (74), Expect = 3.8
Identities = 18/61 (29%), Positives = 27/61 (43%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
GGG + R G G + ++ W+RG GG P + + + GG +A R G
Sbjct: 74 GGGGARSGGRSRGGGGGSSSSRSRDWKRG--GGVVPIHTGGGNGSLGGGSAGSHRSSGSM 131
Query: 664 N 662
N
Sbjct: 132 N 132
>pir||T42310 hypothetical protein - phage SPP1
emb|CAA66517.1| (X97918) gene 24.1 [Bacteriophage SPP1]
Length = 91
Score = 33.2 bits (74), Expect = 3.8
Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Frame = +1
Query: 508 QEQRH--MHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYHRHYQRHYPGCDHR 675
QEQR + R DR Q Q + ++ + H+KH LD+++YHR Y H C R
Sbjct: 18 QEQRISLLERTSDRHDQQ--IQAVTESLSKIHVKHH--LDKANYHRSYHFHALNCCSR 71
>gb|AAF36092.1| (AF218624) flagelliform silk protein [Nephila madagascariensis]
Length = 626
Score = 33.2 bits (74), Expect = 3.8
Identities = 25/67 (37%), Positives = 28/67 (41%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
G G +GP AGPG G A P GP GG P + + GG A P RGG
Sbjct: 314 GPGGAGPGG---AGPGGVGPGGAGPGGAGP-GGVGPGGAGPGGAGPGG-AGPGGAGRGGA 368
Query: 664 NQDNAAG 644
A G
Sbjct: 369 GPGGAGG 375
>gb|AAF52203.1| (AE003608) vkg gene product [Drosophila melanogaster]
Length = 1940
Score = 32.8 bits (73), Expect = 5.0
Identities = 20/66 (30%), Positives = 23/66 (34%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRNQ 659
G SG P GP D +P +GP+G P R G P GG N
Sbjct: 108 GKSGEPGT--PGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINS 165
Query: 658 DNAAGN 641
GN
Sbjct: 166 KGTKGN 171
>pir||S74598 hypothetical protein sll1040 - Synechocystis sp. (strain PCC 6803)
dbj|BAA16750.1| (D90900) hypothetical protein [Synechocystis sp.]
Length = 765
Score = 32.8 bits (73), Expect = 5.0
Identities = 21/63 (33%), Positives = 30/63 (47%), Gaps = 1/63 (1%)
Frame = +2
Query: 242 IVEGLLLSETSILKHPGLA*PHLSTGGWGRRRLITGRRWWSVARGRRWGRC-CTSILLDE 418
I+ GLL ++L P A LS+GG + + WW + RR GR C+ + L
Sbjct: 14 ILLGLLAFTLAVLIPPATAQITLSSGGNNGSQNTSSTPWWDTNKARRCGRLWCSDVFLQG 73
Query: 419 SINV 430
S V
Sbjct: 74 SSQV 77
>gb|AAF47772.1| (AE003478) sty gene product [Drosophila melanogaster]
Length = 589
Score = 32.8 bits (73), Expect = 5.0
Identities = 15/55 (27%), Positives = 26/55 (47%)
Frame = +1
Query: 508 QEQRHMHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYHRHYQRHYPGCDH 672
Q +H+H Q ++H+Q Q+ +Q+HL+H+ + Q G DH
Sbjct: 234 QRNQHLHLQQHQQHLQQQQQQQQQQQQQQHLQHQQNQQHARLATTTQATSVGSDH 288
>sp|P04258|CA13_BOVIN COLLAGEN ALPHA 1(III) CHAIN
pir||CGBO7S collagen alpha 1(III) chain - bovine
Length = 1049
Score = 32.8 bits (73), Expect = 5.0
Identities = 18/54 (33%), Positives = 23/54 (42%)
Frame = -3
Query: 850 ELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
E G G++GPP G P ERG GGP P + + + G AP
Sbjct: 548 EGGKGAAGPPG--PPGSAGTPGLQGMPGERGGPGGPGPKGDKGEPGSSGVDGAP 599
Score = 32.5 bits (72), Expect = 6.6
Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 7/66 (10%)
Frame = -3
Query: 841 GGSSGPPARIEAG-PGAQ--DNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW----A 683
GG GP + + G PG+ D K RGPTG P Q ++G S AP A
Sbjct: 576 GGPGGPGPKGDKGEPGSSGVDGAPGKDGPRGPTGPIGPPGPAGQPGDKGESGAPGVPGIA 635
Query: 682 RERGGRNQDNAAG 644
RGG + G
Sbjct: 636 GPRGGPGERGEQG 648
Score = 32.1 bits (71), Expect = 8.6
Identities = 20/59 (33%), Positives = 23/59 (38%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRN 662
G GPP I GP +D S +P GP G P P + G R GRN
Sbjct: 60 GPPGPPGAI--GPSGKDGESGRPGRPGPRGFPGPPGMKGPAGMPGFPGMKGHRGFDGRN 116
>pir||T27525 hypothetical protein ZC373.7 - Caenorhabditis elegans
emb|CAA88979.1| (Z49131) similar to cuticular collagen~cDNA EST yk94d7.3 comes from
this gene~cDNA EST yk94d7.5 comes from this gene~cDNA
EST yk291h6.5 comes from this gene [Caenorhabditis
elegans]
Length = 297
Score = 32.8 bits (73), Expect = 5.0
Identities = 27/65 (41%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Frame = -3
Query: 844 GGGSSGPP-ARIEAGP-----GAQDNGSA-KPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
G GS+G P A +AGP G +NGSA P GP G P ++ D + G AP
Sbjct: 215 GKGSAGAPGAPGKAGPAGPAGGPGNNGSAGTPGPAGPAGNPGTPGNKGSDGHPGTPGAP- 273
Query: 685 ARERGGRNQDNA 650
GG D A
Sbjct: 274 ----GGPGHDAA 281
>dbj|BAA90381.1| (AP001081) hypothetical protein [Oryza sativa]
Length = 235
Score = 32.8 bits (73), Expect = 5.0
Identities = 21/61 (34%), Positives = 26/61 (42%), Gaps = 5/61 (8%)
Frame = -3
Query: 841 GGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWA-----RE 677
GG GP +G D G P +RG GGP +GG P R
Sbjct: 157 GGDGGPGG---SGGRGGDGGQGGPGQRGGDGGPGGAGGPGGHGGDGGDGGPSGSPLRKRR 213
Query: 676 RGGRNQ 659
RGGR++
Sbjct: 214 RGGRSR 219
>pir||A41726 homeotic protein BarH2 - fruit fly (Drosophila melanogaster)
gb|AAB59218.1| (M82887) dual bar protein [Drosophila melanogaster]
Length = 640
Score = 32.8 bits (73), Expect = 5.0
Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Frame = +1
Query: 508 QEQRHMHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYHRHY-QRHYP 660
Q+Q+H H Q ++H Q Q+ + +Q+ L+ +R HY +RH P
Sbjct: 113 QQQQHHHHQQQQQHQQAALQQYI-VQQQQLLRFEREREREREREHYRERHSP 163
>gb|AAC67561.1| (AF095462) fibrinogen A-alpha chain [Equus caballus]
Length = 481
Score = 32.5 bits (72), Expect = 6.6
Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPT--GGPAPWRSRNQDSNEGGSAAP 689
GS GP + PG+ GS+ PW G + G + W N S+E GS P
Sbjct: 130 GSYGPGSASTWNPGSSQPGSSGPWTSGSSGLGSASTW---NPGSSEPGSDGP 178
>sp|P20631|CC13_CAEEL CUTICLE COLLAGEN 13 PRECURSOR
pir||S08170 collagen col-13 precursor - Caenorhabditis elegans
emb|CAA35955.1| (X51623) collagen [Caenorhabditis elegans]
emb|CAA98258.1| (Z73972) predicted using Genefinder~similar to collagen
[Caenorhabditis elegans]
Length = 316
Score = 32.5 bits (72), Expect = 6.6
Identities = 20/50 (40%), Positives = 23/50 (46%), Gaps = 4/50 (8%)
Frame = -3
Query: 838 GSSGPPARI--EAGPGA--QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
G SG P + PGA Q G+A P GP G P P + N G AP
Sbjct: 179 GPSGAPGQKGPSGAPGAPGQSGGAALPGPPGPAGPPGPAGQPGSNGNAGAPGAP 232
>gb|AAF60775.1| (AC024811) contains similarity to Pfam family PF01391 (Collagen
triple helix repeat (20 copies)), score=73.8, E=3.5e-18,
N=2 [Caenorhabditis elegans]
Length = 285
Score = 32.5 bits (72), Expect = 6.6
Identities = 21/58 (36%), Positives = 24/58 (41%), Gaps = 2/58 (3%)
Frame = -3
Query: 841 GGSSGPPAR--IEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGG 668
GG GPP + I PG A P +GP+G P P Q G P A GG
Sbjct: 148 GGPPGPPGQDGIPGNPGRNGEDGA-PGPQGPSGPPGPPGQPGQPGQRGPPGEPGALLPGG 206
>sp|P20630|CC12_CAEEL CUTICLE COLLAGEN 12 PRECURSOR
pir||S08169 collagen col-12 precursor - Caenorhabditis elegans
emb|CAA35954.1| (X51622) collagen [Caenorhabditis elegans]
emb|CAA98257.1| (Z73972) predicted using Genefinder~similar to collagen~cDNA EST
yk120e2.3 comes from this gene~cDNA EST yk72b10.3 comes
from this gene~cDNA EST yk72b10.5 comes from this
gene~cDNA EST yk120e2.5 comes from this gene~cDNA EST
CEMSG34FB comes from this g>
Length = 316
Score = 32.5 bits (72), Expect = 6.6
Identities = 20/50 (40%), Positives = 23/50 (46%), Gaps = 4/50 (8%)
Frame = -3
Query: 838 GSSGPPARI--EAGPGA--QDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
G SG P + PGA Q G+A P GP G P P + N G AP
Sbjct: 179 GPSGAPGQKGPSGAPGAPGQSGGAALPGPPGPAGPPGPAGQPGSNGNAGAPGAP 232
>gb|AAF55419.1| (AE003717) CG5866 gene product [Drosophila melanogaster]
Length = 206
Score = 32.5 bits (72), Expect = 6.6
Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
Frame = +1
Query: 514 QRHMHRNQDRRHIQPGFQELVAAMEQRHLKHRHMLDRSSYH-RHYQRHYPGCDHRVRGPR 690
Q H H + + ++ FQE+ + Q + H H+ + YH H H+ C + PR
Sbjct: 111 QHHYHGSHPNQQLEAPFQEVHGSHHQHPIHHNHL--EAPYHGTHGVHHHRNCHATIHCPR 168
>pir||A27353 collagen alpha 1(III) chain precursor - mouse (fragment)
gb|AAA37338.1| (M18933) alpha-1 type-III collagen precursor [Mus musculus]
Length = 488
Score = 32.1 bits (71), Expect = 8.6
Identities = 15/34 (44%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
Frame = -3
Query: 838 GSSGPPARI-EAGPGAQDNGSAKPWERGPTGGPAP 737
G GPP + AGP +D S +P GP G P P
Sbjct: 212 GPPGPPGALGPAGPAGKDGESGRPGRPGPRGLPGP 246
>gb|AAF76432.1|AF272661_1 (AF272661) alpha 4 type V collagen [Rattus norvegicus]
Length = 1737
Score = 32.1 bits (71), Expect = 8.6
Identities = 20/65 (30%), Positives = 24/65 (36%), Gaps = 1/65 (1%)
Frame = -3
Query: 838 GSSGPPARI-EAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRN 662
GS GPP GP + P GP G P P + N G E+G R
Sbjct: 674 GSEGPPGHPGHEGPTGEKGAQGPPGSAGPQGYPGPRGVKGTSGNRGLQG-----EKGERG 728
Query: 661 QDNAAG 644
+D G
Sbjct: 729 EDGFPG 734
>pir||T20720 hypothetical protein F10F2.9 - Caenorhabditis elegans
emb|CAA84658.1| (Z35598) Asparagine, Serine and Glycine rich predicted protein
[Caenorhabditis elegans]
emb|CAA86461.1| (Z46343) Asparagine, Serine and Glycine rich predicted protein
[Caenorhabditis elegans]
Length = 549
Score = 32.1 bits (71), Expect = 8.6
Identities = 26/74 (35%), Positives = 31/74 (41%), Gaps = 9/74 (12%)
Frame = -3
Query: 862 ELMQELGGGSSGPPARIEAGPGAQDN--------GSAKPWERGPTGGPAPWRSRNQDSNE 707
EL GG + G + +G G DN S W G TGG N N+
Sbjct: 334 ELSNNWGGSNGGSGSNGGSGGGNTDNDNWGSNNGNSGGSWGNGGTGGSGGSGGGNWGDND 393
Query: 706 G-GSAAPWARERGGRNQDNAAGN 641
GS+ W G NQDN N
Sbjct: 394 NYGSSNKW---NGNGNQDNDNDN 413
>pir||T00041 BH-protocadherin PCDH7 (clone BH-Pcdh-b) - human
dbj|BAA25195.1| (AB006756) PCDH7 (BH-Pcdh)b [Homo sapiens]
Length = 1072
Score = 32.1 bits (71), Expect = 8.6
Identities = 27/71 (38%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
Frame = -3
Query: 874 IRHEELMQELGGGSSGPPARIEAG-------PGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
I EL+QE GGG SG +R AG PG NG+ +GG + R D
Sbjct: 177 IERYELLQEPGGGGSGGESR-RAGAADSAPYPGGGGNGA--------SGGGSGGSKRRLD 227
Query: 715 SNEGGSAAPWARERGGRN 662
++EGG GGR+
Sbjct: 228 ASEGGGGT----NPGGRS 241
>pir||G02127 fus-like protein - human (fragment)
gb|AAA79948.1| (U36561) fus-like protein [Homo sapiens]
Length = 528
Score = 32.1 bits (71), Expect = 8.6
Identities = 21/67 (31%), Positives = 30/67 (44%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGR 665
GGGS G +G G G G +GG + NQD + GG ++RGGR
Sbjct: 166 GGGSYGQDQSSMSGSGGGGGGGGG----GGSGGGGGYG--NQDQSGGGGGGYGQQDRGGR 219
Query: 664 NQDNAAG 644
+ ++G
Sbjct: 220 GRGRSSG 226
>sp|P10569|MYSC_ACACA MYOSIN IC HEAVY CHAIN
pir||MWAXIC myosin heavy chain IC - Acanthamoeba castellanii
gb|AAA27707.1| (J02974) myosin IB heavy chain [Acanthamoeba castellanii]
Length = 1168
Score = 32.1 bits (71), Expect = 8.6
Identities = 22/75 (29%), Positives = 32/75 (42%)
Frame = -3
Query: 865 EELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
++++ GGG G R GP S +P G GGP+P+ R S +A+
Sbjct: 919 DQILGAKGGGGGGGRGR--GGPSPSGAVSPRPSPGGGGGGPSPFGGRPSPSGPPAAASAP 976
Query: 685 ARERGGRNQDNAAGN 641
E+ D AA N
Sbjct: 977 GPEQARALYDFAAEN 991
>gb|AAC58530.1| (U85043) gag polyprotein [feline syncytial virus]
Length = 489
Score = 32.1 bits (71), Expect = 8.6
Identities = 20/61 (32%), Positives = 24/61 (38%), Gaps = 5/61 (8%)
Frame = -3
Query: 871 RHEELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRS-----RNQDSNE 707
R+ + Q G G GP G G P RGP GP P + R Q
Sbjct: 420 RNPQQPQRYGQGPPGPNPYRRFGDGGNPQQQGPPPNRGPDQGPRPGGNPRGGGRGQGPRN 479
Query: 706 GGSAAP 689
GG + P
Sbjct: 480 GGGSVP 485
>dbj|BAA33381.1| (AB008374) alpha 3 type I collagen [Oncorhynchus mykiss]
Length = 678
Score = 32.1 bits (71), Expect = 8.6
Identities = 21/57 (36%), Positives = 26/57 (44%), Gaps = 6/57 (10%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNG------SAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARE 677
GS+GPP AG Q G + +P E G G P P + N+G AP
Sbjct: 104 GSAGPPG--PAGKEGQKGGRGETGIAGRPGEAGAAGPPGPSGASGAKGNDGPMGAPGTPG 161
Query: 676 RGG 668
GG
Sbjct: 162 PGG 164
>pir||T35089 probable integral membrane transport protein - Streptomyces
coelicolor
emb|CAB51452.1| (AL096884) putative integral membrane transport protein
[Streptomyces coelicolor A3(2)]
Length = 306
Score = 32.1 bits (71), Expect = 8.6
Identities = 27/76 (35%), Positives = 36/76 (46%), Gaps = 6/76 (7%)
Frame = -1
Query: 813 SRPALALKITAARSPGS------EVLPADLHHGAAVTKILTKEDRQPPGPANAVVATRIM 652
S PA A TA+ P V+PAD GAA E R+ GPA R+
Sbjct: 14 SAPARA---TASNDPSEGELRDVSVVPADALRGAAPAA----EGRRAAGPAALGPKARLW 66
Query: 651 PLVMAVIATTVKHMAVLQVPLL 586
P ++AV + V ++PLL
Sbjct: 67 PSLVAVYRAQLSRARVARIPLL 88
>pir||T08435 la costa protein - fruit fly (Drosophila melanogaster)
gb|AAC28405.1| (AF017777) la costa [Drosophila melanogaster]
gb|AAF50816.1| (AE003568) lcs gene product [Drosophila melanogaster]
Length = 145
Score = 32.1 bits (71), Expect = 8.6
Identities = 19/52 (36%), Positives = 19/52 (36%)
Frame = -3
Query: 844 GGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAP 689
G G G P R GPG G P RGP GGP GG P
Sbjct: 28 GPGGPGGPGRGRGGPGRGPGGPGGPGGRGP-GGPGGPGGPGGPGGPGGPGGP 78
>pir||T29982 hypothetical protein F11G11.12 - Caenorhabditis elegans
gb|AAB37843.1| (U80451) Similar to collagen [Caenorhabditis elegans]
Length = 285
Score = 32.1 bits (71), Expect = 8.6
Identities = 23/65 (35%), Positives = 27/65 (41%), Gaps = 1/65 (1%)
Frame = -3
Query: 838 GSSGPPARIEA-GPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWARERGGRN 662
G+ G P A GP +D +P GP G P P QD G AP E G
Sbjct: 211 GNPGQPGEPGAQGPPGEDG---RPGNSGPQGPPGPQGEPGQDGAPGNPGAP--GEAGEPG 265
Query: 661 QDNAAG 644
+D A G
Sbjct: 266 KDGAKG 271
>ref|NP_037465.1| EH domain-binding mitotic phosphoprotein
gb|AAD38326.1|AF073727_1 (AF073727) EH domain-binding mitotic phosphoprotein [Homo sapiens]
Length = 551
Score = 32.1 bits (71), Expect = 8.6
Identities = 20/58 (34%), Positives = 23/58 (39%), Gaps = 3/58 (5%)
Frame = -3
Query: 841 GGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPA--PWRSRNQD-SNEGGSAAPWARERG 671
GG PPA G A S PW GP+ PW + EG + PW G
Sbjct: 272 GGPPVPPAADPWGGPAPTPASGDPWRPAAPAGPSVDPWGGTPAPAAGEGPTPDPWGSSDG 331
Query: 670 G 668
G
Sbjct: 332 G 332
>pir||T00042 BH-protocadherin PCDH7 (clone BH-Pcdh-c) - human
dbj|BAA25196.1| (AB006757) PCDH7 (BH-Pcdh)c [Homo sapiens]
Length = 1200
Score = 32.1 bits (71), Expect = 8.6
Identities = 27/71 (38%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
Frame = -3
Query: 874 IRHEELMQELGGGSSGPPARIEAG-------PGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
I EL+QE GGG SG +R AG PG NG+ +GG + R D
Sbjct: 177 IERYELLQEPGGGGSGGESR-RAGAADSAPYPGGGGNGA--------SGGGSGGSKRRLD 227
Query: 715 SNEGGSAAPWARERGGRN 662
++EGG GGR+
Sbjct: 228 ASEGGGGT----NPGGRS 241
>ref|NP_002580.1| BH-protocadherin (brain-heart)
pir||T00040 BH-protocadherin PCDH7 - human
dbj|BAA25194.1| (AB006755) PCDH7 (BH-Pcdh)a [Homo sapiens]
Length = 1069
Score = 32.1 bits (71), Expect = 8.6
Identities = 27/71 (38%), Positives = 35/71 (49%), Gaps = 7/71 (9%)
Frame = -3
Query: 874 IRHEELMQELGGGSSGPPARIEAG-------PGAQDNGSAKPWERGPTGGPAPWRSRNQD 716
I EL+QE GGG SG +R AG PG NG+ +GG + R D
Sbjct: 177 IERYELLQEPGGGGSGGESR-RAGAADSAPYPGGGGNGA--------SGGGSGGSKRRLD 227
Query: 715 SNEGGSAAPWARERGGRN 662
++EGG GGR+
Sbjct: 228 ASEGGGGT----NPGGRS 241
>dbj|BAB01598.1| (AB046016) unnamed protein product [Macaca fascicularis]
Length = 344
Score = 32.1 bits (71), Expect = 8.6
Identities = 12/19 (63%), Positives = 15/19 (78%), Gaps = 17/19 (89%)
Frame = -3
Query: 841 GGSSG--PPARI----EAGPGAQD-----------NGSAKPWERGP--TGGPAPWRSRNQ 719
GG+ G PPAR + GPGA+D GSA P E P GGP R+ Q
Sbjct: 168 GGARGRSPPARAAGGAQPGPGAEDVQPAGRPGAPTAGSAPPLEGRPQGAGGPGALRAEGQ 227
Query: 718 DS 713
DS
Sbjct: 228 DS 229
>pir||T21070 hypothetical protein F17C8.2 - Caenorhabditis elegans
emb|CAA84800.1| (Z35719) similar to cuticular collagen~cDNA EST yk170g12.5 comes
from this gene~cDNA EST yk125f8.5 comes from this
gene~cDNA EST yk125f8.3 comes from this gene~cDNA EST
yk170g12.3 comes from this gene~cDNA EST yk191e5.3 comes
from this gene~cDNA EST yk>
Length = 296
Score = 32.1 bits (71), Expect = 8.6
Identities = 23/66 (34%), Positives = 27/66 (40%), Gaps = 8/66 (12%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQ--------DNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPWA 683
G SGP EAGP + + G +P GP G P P G AP
Sbjct: 189 GDSGPNG--EAGPNGEPGAPGKDGEKGKGEPGPAGPPGPPGPGGPPGDAGGAGSDGAP-- 244
Query: 682 RERGGRNQDNAAGN 641
+G QD GN
Sbjct: 245 GPQGPPGQDGTPGN 258
>gb|AAC98089.1| (AF051353) myosin IC heavy chain [Acanthamoeba castellanii]
Length = 1186
Score = 32.1 bits (71), Expect = 8.6
Identities = 22/75 (29%), Positives = 32/75 (42%)
Frame = -3
Query: 865 EELMQELGGGSSGPPARIEAGPGAQDNGSAKPWERGPTGGPAPWRSRNQDSNEGGSAAPW 686
++++ GGG G R GP S +P G GGP+P+ R S +A+
Sbjct: 937 DQILGAKGGGGGGGRGR--GGPSPSGAVSPRPSPGGGGGGPSPFGGRPSPSGPPAAASAP 994
Query: 685 ARERGGRNQDNAAGN 641
E+ D AA N
Sbjct: 995 GPEQARALYDFAAEN 1009
>dbj|BAA07813.1| (D43756) fibrinogen A-alpha-chain [Canis familiaris]
Length = 443
Score = 32.1 bits (71), Expect = 8.6
Identities = 21/52 (40%), Positives = 25/52 (47%), Gaps = 7/52 (13%)
Frame = -3
Query: 838 GSSGPPARIEAGPGAQDNGSAKPWERGPT-------GGPAPWRSRNQDSNEGGSAAPWA 683
GSS P + PG+ GSA PW G T G W +R S GSA W+
Sbjct: 112 GSSTPGSAGTWNPGSTGPGSAGPWSSGSTRPGSTGPGSAGTWSTR-PGSTGPGSAGTWS 169
Database: nr
Posted date: Sep 29, 2000 9:53 PM
Number of letters in database: 177,575,912
Number of sequences in database: 565,281
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.270 0.0470 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 247189426
Number of Sequences: 565281
Number of extensions: 5128878
Number of successful extensions: 21429
Number of sequences better than 10.0: 118
Number of HSP's better than 10.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 56
Number of HSP's that attempted gapping in prelim test: 21094
Number of HSP's gapped (non-prelim): 323
length of query: 292
length of database: 177,575,912
effective HSP length: 54
effective length of query: 237
effective length of database: 147,050,738
effective search space: 34851024906
effective search space used: 34851024906
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 71 (32.1 bits)