Score E
Sequences producing significant alignments: (bits) Value
gb|AAD28558.1|AF118227_1 (AF118227) infection structure spe... 90 2e-17
ref|NP_055753.1| KIAA0867 protein >gi|4240223|dbj|BAA74890.... 36 0.48
gb|AAD38191.1|AF154572_1 (AF154572) ERG2 protein [Rattus no... 35 0.82
emb|CAB59672.1| (AL132674) putative membrane protein [Strep... 35 1.1
gb|AAC60129.1| (U43200) antifreeze glycopeptide AFGP polypr... 33 3.2
gb|AAD49340.1|AF167708_1 (AF167708) excretory/secretory muc... 33 4.2
dbj|BAA90346.1| (AP001080) Similar to geranylgeranyl hydrog... 32 5.4
emb|CAB67640.1| (AL132966) putative protein [Arabidopsis th... 32 7.1
pir||H70589 hypothetical glycine-rich protein Rv2853 - Myco... 32 7.1
sp|Q60528|MUC1_MESAU MUCIN 1 PRECURSOR >gi|1353410|gb|AAB53... 32 7.1
pir||PC4395 mucin 3 - human (fragment) >gi|2454615|gb|AAB71... 32 7.1
pir||A70647 probable PPE protein - Mycobacterium tuberculos... 32 7.1
pir||E70820 hypothetical glycine-rich protein Rv0977 - Myco... 32 7.1
pir||T33819 hypothetical protein W05F2.7 - Caenorhabditis e... 32 9.3
ref|NP_012284.1| cell surface flocculin with structure simi... 32 9.3
gb|AAA75046.1| (U31232) core protein [Hepatitis C virus] 32 9.3
gb|AAA36334.1| (M22405) intestinal mucin [Homo sapiens] 32 9.3
gb|AAA59164.1| (M94132) MUC2 [Homo sapiens] 32 9.3
dbj|BAA90631.1| (AP001129) hypothetical protein [Oryza sativa] 32 9.3
emb|CAA07489.1| (AJ007394) mucin [Anopheles gambiae] 32 9.3
ref|NP_002448.1| mucin 2, intestinal/tracheal >gi|2506877|s... 32 9.3
pir||A43932 mucin 2 precursor, intestinal - human (fragments) 32 9.3
>gb|AAD28558.1|AF118227_1 (AF118227) infection structure specific protein [Pyricularia
grisea]
Length = 217
Score = 90.5 bits (221), Expect = 2e-17
Identities = 59/167 (35%), Positives = 88/167 (52%), Gaps = 20/167 (11%)
Frame = +2
Query: 2 HEAIMKRGAIEARQTGLPSLGDISEECQSAVLDIAQGVPTPAPEIVSDLLKNPQTDPCSF 181
H + + GA AR+T PSL I++ C ++++ I++ +PTP +I+S L N +T+ CS
Sbjct: 27 HAVVRREGA--ARETASPSLPSINDPCLASLMSISKTLPTPGADILSALASNTETNACSI 84
Query: 182 STPXXXXXXXXXXXXXXXXWYGQNQDDIMSAVKECPELASLASLVPVCE-------ASAT 340
+ WY N I S + +CP+L+S A +PVC AS+T
Sbjct: 85 TVAASVSDKFNAYTSTIRSWYSSNSAAINSVLSQCPKLSSYAGDLPVCTGKDSSNGASST 144
Query: 341 GAPAFTGLTATT-------TGVVPPVLSSADETGKTPAGTP---TCGGSSPVETN---GA 481
+G TT +G +SA ++ P G+ T GS P ETN GA
Sbjct: 145 ATKGSSGAATTTGRATDAGSGSATAATASAGDSSSNPTGSAASVTKSGSGPRETNFVAGA 204
Query: 482 VAREGGL 502
VA G L
Sbjct: 205 VAAIGFL 211
>ref|NP_055753.1| KIAA0867 protein
dbj|BAA74890.1| (AB020674) KIAA0867 protein [Homo sapiens]
Length = 526
Score = 36.0 bits (81), Expect = 0.48
Identities = 43/122 (35%), Positives = 52/122 (42%), Gaps = 17/122 (13%)
Frame = +2
Query: 113 VPTPAPEIVSDLLKNPQTDPCSFSTPXXXXXXXXXXXXXXXXWYGQNQDDIMSA--VKEC 286
VP P PE VS +LKN + P +FS GQ Q IM++ +K
Sbjct: 165 VPAPKPEPVSLVLKNARIAPAAFS--------------------GQPQAVIMTSGPLKRE 204
Query: 287 PELASLAS-----LVPVCEASATGAPAF------TGLTATTTGVVPPVLSSADETGKTPA 433
LAS S + P A A G P F T L T+ PV T + P
Sbjct: 205 GMLASTVSQSNVVIAPAAIARAPGVPEFHSSILVTDLGHGTSSPPAPVSRLFPSTAQDPL 264
Query: 434 G----TPTCGGSSPVETNG 478
G P GGS V G
Sbjct: 265 GKGEQVPLHGGSPQVTVTG 283
>gb|AAD38191.1|AF154572_1 (AF154572) ERG2 protein [Rattus norvegicus]
Length = 175
Score = 35.2 bits (79), Expect = 0.82
Identities = 32/88 (36%), Positives = 40/88 (45%), Gaps = 3/88 (3%)
Frame = +2
Query: 266 MSAVKECPELASL-ASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTP 442
+ A+K L+SL AS + VC A A G A T VPPVLS+ TG A +
Sbjct: 60 LGALKAGTVLSSLPASALAVCPIGVKTAVAMLG-GAVTVAAVPPVLSAMGFTGSGIATSS 118
Query: 443 TCGGSSPVE--TNGAVAREGGLLYVNRDAGA 529
V NG GGL+ + AGA
Sbjct: 119 LAAKLMSVSAIANGGGVATGGLVATLQSAGA 149
>emb|CAB59672.1| (AL132674) putative membrane protein [Streptomyces coelicolor
A3(2)]
Length = 583
Score = 34.8 bits (78), Expect = 1.1
Identities = 28/70 (40%), Positives = 31/70 (44%), Gaps = 5/70 (7%)
Frame = +2
Query: 335 ATGAPAFTGLTATT-----TGVVPPVLSSADETGKTPAGTPTCGGSSPVETNGAVAREGG 499
ATG P TG T TT G S+A TG T GT GG SP G V G
Sbjct: 492 ATGGPQTTGGTGTTGGSDTAGGTTGGTSTAGTTGSTTGGTT--GGQSPQGGRGGVLASTG 549
Query: 500 LLYVNRDAGARVVAA 544
+ AGA + A
Sbjct: 550 STVLLGSAGAALALA 564
>gb|AAC60129.1| (U43200) antifreeze glycopeptide AFGP polyprotein precursor
[Boreogadus saida]
Length = 507
Score = 33.2 bits (74), Expect = 3.2
Identities = 23/72 (31%), Positives = 29/72 (39%)
Frame = +2
Query: 272 AVKECPELASLASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCG 451
A P A+ A+ +AT A A T TA T + A TG TPA PT G
Sbjct: 414 ATPATPATAATAATAATAATAATAATAATAATAPTPARAARAATPA--TGATPATAPTAG 471
Query: 452 GSSPVETNGAVA 487
++ T A
Sbjct: 472 TAATAATAATAA 483
>gb|AAD49340.1|AF167708_1 (AF167708) excretory/secretory mucin MUC-3 [Toxocara canis]
Length = 269
Score = 32.8 bits (73), Expect = 4.2
Identities = 20/66 (30%), Positives = 28/66 (42%), Gaps = 4/66 (6%)
Frame = +2
Query: 287 PELASLASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGK----TPAGTPTCGG 454
P + A+ A+ T AP T TTT P ++A T T PT
Sbjct: 122 PTTTTAAATTTTAAATTTAAPTTTTAAPTTTTATPTTTTAAPTTTTAAPTTTTAAPTTTT 181
Query: 455 SSPVETNGAV 484
++P T GA+
Sbjct: 182 AAPTTTTGAI 191
>dbj|BAA90346.1| (AP001080) Similar to geranylgeranyl hydrogenase. (AF069318) [Oryza
sativa]
dbj|BAA92518.1| (AP001383) Similar to geranylgeranyl hydrogenase. (AF069318) [Oryza
sativa]
Length = 452
Score = 32.5 bits (72), Expect = 5.4
Identities = 23/67 (34%), Positives = 31/67 (45%), Gaps = 10/67 (14%)
Frame = +2
Query: 266 MSAVKECPELASLASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETG-------- 421
M+A S A LV C +SAT AP L G P S+A+
Sbjct: 1 MAAAVAAAACHSPARLVVTCSSSATPAPPRRPLRVAVVGGGPAGASAAEALASAGAQAFL 60
Query: 422 --KTPAGTPTCGGSSPV 466
+ P+G CGG+ P+
Sbjct: 61 LERNPSGAKPCGGAIPL 77
>emb|CAB67640.1| (AL132966) putative protein [Arabidopsis thaliana]
Length = 310
Score = 32.1 bits (71), Expect = 7.1
Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 12/60 (20%)
Frame = +1
Query: 10 HHEARCHRG------SPDRPPFSR*HQRGVPVRRS------RHRPGRPYSRPRDRL*SAQ 153
HHEAR G +P PP S+ H+R P+ S H P RP + P
Sbjct: 105 HHEARPMNGHDPLAITPSPPPPSKTHERSRPITPSPPPPSKTHEPSRPNTPP-------- 156
Query: 154 EPPDRPLQLQHP 189
PP P + P
Sbjct: 157 -PPPPPSKTHEP 167
>pir||H70589 hypothetical glycine-rich protein Rv2853 - Mycobacterium
tuberculosis (strain H37RV)
emb|CAB08453.1| (Z95207) PE_PGRS [Mycobacterium tuberculosis]
Length = 615
Score = 32.1 bits (71), Expect = 7.1
Identities = 19/68 (27%), Positives = 27/68 (38%)
Frame = +2
Query: 326 EASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCGGSSPVETNGAVAREGGLL 505
+A T A ++ + A + L + +T G P G + T G GGLL
Sbjct: 78 QALTTAAASYASVEAANASPLQVALDVINAPAQTLLGRPLIGNGADGSTPGQAGGPGGLL 137
Query: 506 YVNRDAGA 529
Y N GA
Sbjct: 138 YGNGGNGA 145
>sp|Q60528|MUC1_MESAU MUCIN 1 PRECURSOR
gb|AAB53965.1| (U36918) mucin [Mesocricetus auratus]
Length = 676
Score = 32.1 bits (71), Expect = 7.1
Identities = 19/68 (27%), Positives = 31/68 (44%)
Frame = +2
Query: 305 ASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCGGSSPVETNGAV 484
++LVP A+ +GA A T + + P ++ T K PA TP GS T+ +
Sbjct: 332 SALVPTTSAAHSGASAMTNSSESDLATTPIDSGTSISTTKAPATTPVHNGSLVPTTSSVL 391
Query: 485 AREGGLLY 508
L++
Sbjct: 392 GSATTLIH 399
>pir||PC4395 mucin 3 - human (fragment)
gb|AAB71685.1| (AF016692) small intestinal mucin MUC3 [Homo sapiens]
Length = 648
Score = 32.1 bits (71), Expect = 7.1
Identities = 20/67 (29%), Positives = 32/67 (46%), Gaps = 3/67 (4%)
Frame = -2
Query: 450 PQVGVPAGVLPVSSAEERTGGTTPV---VVAVRPVKAGAPVAEASQTGTRLASEASSGHS 280
P +P PV+S+E T TTPV +A + A T R+++ + S
Sbjct: 54 PLTNMPVSTTPVASSEASTLSTTPVDSNTFVTSSSQASSSPATLQVTTMRMSTPSEGSSS 113
Query: 279 LTADMISSWF 250
LT ++SS +
Sbjct: 114 LTTMLLSSTY 123
>pir||A70647 probable PPE protein - Mycobacterium tuberculosis (strain H37RV)
emb|CAB06293.1| (Z83867) PPE [Mycobacterium tuberculosis]
Length = 409
Score = 32.1 bits (71), Expect = 7.1
Identities = 23/64 (35%), Positives = 26/64 (39%)
Frame = +2
Query: 353 FTGLTATTTGVVPPVLSSADETGKTPAGTPTCGGSSPVETNGAVAREGGLLYVNRDAGAR 532
F GL A TG TG GT GG V T GA A GG+ YV +
Sbjct: 175 FPGLGAGATGATGGESVGTGATGGESVGT---GGGESVGTGGATASGGGVGYVGSGVASA 231
Query: 533 VVAA 544
+AA
Sbjct: 232 GLAA 235
>pir||E70820 hypothetical glycine-rich protein Rv0977 - Mycobacterium
tuberculosis (strain H37RV)
emb|CAA17576.1| (AL021999) PE_PGRS [Mycobacterium tuberculosis]
Length = 923
Score = 32.1 bits (71), Expect = 7.1
Identities = 18/51 (35%), Positives = 23/51 (44%)
Frame = +2
Query: 380 GVVPPVLSSADETGKTPAGTPTCGGSSPVETNGAVAREGGLLYVNRDAGAR 532
GV P ++ A T P G GGS + G GGLL+ N AG +
Sbjct: 173 GVGGPGIAGAAGTAGLPGGNGANGGSGGIGGAGGAGGNGGLLFGNGGAGGQ 223
>pir||T33819 hypothetical protein W05F2.7 - Caenorhabditis elegans
gb|AAC78217.1| (AF106582) weak similarity to collagen alpha [Caenorhabditis
elegans]
Length = 1367
Score = 31.7 bits (70), Expect = 9.3
Identities = 23/48 (47%), Positives = 24/48 (49%)
Frame = +2
Query: 335 ATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCGGSSPVETNG 478
ATGAP TGV PV SS G TP TP GG+ P ET G
Sbjct: 489 ATGAPG-----QPFTGVTMPVGSS----GMTPP-TPVTGGTGPTETTG 526
>ref|NP_012284.1| cell surface flocculin with structure similar to
serine/threonine-rich GPI-anchored cell wall proteins;
Muc1p
sp|P08640|AMYH_YEAST GLUCOAMYLASE S1/S2 PRECURSOR (GLUCAN 1,4-ALPHA-GLUCOSIDASE)
(1,4-ALPHA-D-GLUCAN GLUCOHYDROLASE)
pir||S48478 glucan 1,4-alpha-glucosidase (EC 3.2.1.3) - yeast (Saccharomyces
cerevisiae)
emb|CAA86176.1| (Z38061) mal5, sta1, len: 1367, CAI: 0.3, AMYH_YEAST P08640
GLUCOAMYLASE S1 (EC 3.2.1.3) [Saccharomyces cerevisiae]
gb|AAC49609.1| (U30626) glucoamylase [Saccharomyces cerevisiae var. diastaticus]
Length = 1367
Score = 31.7 bits (70), Expect = 9.3
Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Frame = +2
Query: 299 SLASLVPVCEASAT---GAPAFTGLTATTTGVVPPVLSSADETGKTPAGTP----TCGGS 457
S ++ VP +S T APA T ++TT PV SS E+ P TP T S
Sbjct: 591 SSSAPVPTPSSSTTESSSAPAPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSS 650
Query: 458 SPVETNGAVARE 493
+PV T + E
Sbjct: 651 APVPTPSSSTTE 662
>gb|AAA75046.1| (U31232) core protein [Hepatitis C virus]
Length = 103
Score = 31.7 bits (70), Expect = 9.3
Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Frame = -2
Query: 147 RSETISGAGVGTPWAM--SRTADWHSSLMSPREGRPVWRAS 31
RSE S A G PW + + + W L+SPR RP W S
Sbjct: 61 RSEGRSWAQPGYPWPLYGNESCGWAGWLLSPRGSRPSWGPS 101
>gb|AAA36334.1| (M22405) intestinal mucin [Homo sapiens]
Length = 278
Score = 31.7 bits (70), Expect = 9.3
Identities = 23/61 (37%), Positives = 30/61 (48%), Gaps = 3/61 (4%)
Frame = +2
Query: 302 LASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCG---GSSPVET 472
L++L P E ++T P+ TT+G LS T +P GTPT G GSS T
Sbjct: 164 LSTLPPAIEMTSTAPPSTPTAPTTTSG--GHTLSPPPSTTTSPPGTPTRGTTTGSSSAPT 221
Query: 473 NGAV 484
V
Sbjct: 222 PSTV 225
>gb|AAA59164.1| (M94132) MUC2 [Homo sapiens]
Length = 984
Score = 31.7 bits (70), Expect = 9.3
Identities = 23/61 (37%), Positives = 30/61 (48%), Gaps = 3/61 (4%)
Frame = +2
Query: 302 LASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCG---GSSPVET 472
L++L P E ++T P+ TT+G LS T +P GTPT G GSS T
Sbjct: 43 LSTLPPAIEMTSTAPPSTPTAPTTTSG--GHTLSPPPSTTTSPPGTPTRGTTTGSSSAPT 100
Query: 473 NGAV 484
V
Sbjct: 101 PSTV 104
>dbj|BAA90631.1| (AP001129) hypothetical protein [Oryza sativa]
Length = 739
Score = 31.7 bits (70), Expect = 9.3
Identities = 20/61 (32%), Positives = 27/61 (43%)
Frame = +1
Query: 7 GHHEARCHRGSPDRPPFSR*HQRGVPVRRSRHRPGRPYSRPRDRL*SAQEPPDRPLQLQH 186
G H+ R RG+ DRPP S R + RR P P + P L + PP ++
Sbjct: 101 GRHQRR-RRGTLDRPPSSLLPPRAIDSRRLHDHPSNPDNIPLLPLPTLPSPPHPGTLIRR 159
Query: 187 P 189
P
Sbjct: 160 P 160
>emb|CAA07489.1| (AJ007394) mucin [Anopheles gambiae]
Length = 112
Score = 31.7 bits (70), Expect = 9.3
Identities = 20/54 (37%), Positives = 26/54 (48%)
Frame = +2
Query: 320 VCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCGGSSPVETNGA 481
V A+ T AP T + TTT V P ++ G+T T T S PV T G+
Sbjct: 29 VAPATTTVAPTTTTVAPTTTTTVAPTTTTTVAPGQT---TTTTVASGPVTTTGS 79
>ref|NP_002448.1| mucin 2, intestinal/tracheal
sp|Q02817|MUC2_HUMAN MUCIN 2 PRECURSOR (INTESTINAL MUCIN 2)
gb|AAB95295.1| (L21998) mucin [Homo sapiens]
Length = 5179
Score = 31.7 bits (70), Expect = 9.3
Identities = 23/61 (37%), Positives = 30/61 (48%), Gaps = 3/61 (4%)
Frame = +2
Query: 302 LASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCG---GSSPVET 472
L++L P E ++T P+ TT+G LS T +P GTPT G GSS T
Sbjct: 4238 LSTLPPAIEMTSTAPPSTPTAPTTTSG--GHTLSPPPSTTTSPPGTPTRGTTTGSSSAPT 4295
Query: 473 NGAV 484
V
Sbjct: 4296 PSTV 4299
>pir||A43932 mucin 2 precursor, intestinal - human (fragments)
Length = 3020
Score = 31.7 bits (70), Expect = 9.3
Identities = 23/61 (37%), Positives = 30/61 (48%), Gaps = 3/61 (4%)
Frame = +2
Query: 302 LASLVPVCEASATGAPAFTGLTATTTGVVPPVLSSADETGKTPAGTPTCG---GSSPVET 472
L++L P E ++T P+ TT+G LS T +P GTPT G GSS T
Sbjct: 2079 LSTLPPAIEMTSTAPPSTPTAPTTTSG--GHTLSPPPSTTTSPPGTPTRGTTTGSSSAPT 2136
Query: 473 NGAV 484
V
Sbjct: 2137 PSTV 2140
Database: nr
Posted date: Jun 11, 2000 9:46 PM
Number of letters in database: 159,777,284
Number of sequences in database: 509,459
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.270 0.0470 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 162539671
Number of Sequences: 509459
Number of extensions: 3010644
Number of successful extensions: 17817
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 17775
Number of HSP's gapped (non-prelim): 80
length of query: 275
length of database: 159,777,284
effective HSP length: 55
effective length of query: 219
effective length of database: 131,757,039
effective search space: 28854791541
effective search space used: 28854791541
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 70 (31.7 bits)