The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.1.1 [Aug-8-2000]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Contig77.seq Contig77
         (823 letters)

Database: nr
           565,281 sequences; 177,575,912 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAA12721.1|  (D85079) homeobox [Strongylocentrotus purpu...    39  0.047
gb|AAD20910.2|  (AC006234) putative LRR receptor protein kin...    39  0.081
pir||T07907  hydroxyproline-rich glycoprotein GAS28 precurso...    35  1.2
gb|AAG10390.1|  (AY007501) enabled-like protein [Hirudo medi...    34  1.6
pir||T41547  hypothetical protein SPCC70.01 - fission yeast ...    34  1.6
pir||T41011  hypothetical protein SPCC1494.10 - fission yeas...    34  1.6
pir||T04652  hypothetical protein F10N7.260 - Arabidopsis th...    34  1.6
pir||T06336  proline-rich protein precursor - soybean >gi|35...    34  1.6
pir||T38547  probable cell division protein kinase - fission...    34  1.6
pir||S21323  probable endoglucanase - Ruminococcus flavefaci...    34  2.1
gb|AAC36357.1|  (AF091342) neurofilament-M subunit [Bos taurus]    34  2.1
pir||T19765  hypothetical protein C36A4.5 - Caenorhabditis e...    34  2.7
pir||H72695  hypothetical protein APE0984 - Aeropyrum pernix...    33  3.5
gb|AAF22175.1|AF136381_1  (AF136381) c-Cbl-associated protei...    33  3.5
pir||S42674  adhesive protein - bifurcate mussel (fragments)       33  4.6
gb|AAG01391.1|AF199499_1  (AF199499) period 2 [Xenopus laevis]     33  4.6
dbj|BAA78424.1|  (AB021264) polyprotein [Arabidopsis thaliana]     32  6.1
dbj|BAA97097.1|  (AP002460) En/Spm transposon protein-like [...    32  6.1
pir||T30049  hypothetical protein K06C4.8 - Caenorhabditis e...    32  6.1
pir||S54265  glycoprotein gC - bovine herpesvirus 5 >gi|8049...    32  6.1
pir||T01397  LTR gag/pol polyprotein homolog T4I9.16 - Arabi...    32  6.1
pir||I58377  LTG19 - human >gi|436042|dbj|BAA03406.1| (D1453...    32  6.1
pir||T21338  hypothetical protein F25D7.4 - Caenorhabditis e...    32  6.1
ref|NP_010086.1|  Component (p150) of COPII coat of secretor...    32  7.9
gb|AAA50367.1|  (U15219) Web1p [Saccharomyces cerevisiae]          32  7.9
gb|AAB58418.1|  (U37500) RNA polymerase II largest subunit [...    32  7.9
gb|AAD41978.1|AC006438_10  (AC006438) unknown protein [Arabi...    32  7.9
sp|P11414|RPB1_CRIGR  DNA-DIRECTED RNA POLYMERASE II LARGEST...    32  7.9
ref|NP_056200.1|  DKFZP586P1422 protein >gi|7512944|pir||T17...    32  7.9
gb|AAC41377.1|  (AF036382) MLL [Takifugu rubripes]                 32  7.9
pir||I38186  RNA polymerase II largest subunit - human >gi|8...    32  7.9
sp|P08775|RPB1_MOUSE  DNA-DIRECTED RNA POLYMERASE II LARGEST...    32  7.9
dbj|BAA22376.1|  (D87293) RNA polymerase II largest subunit ...    32  7.9
ref|NP_000928.1|  polymerase (RNA) II (DNA directed) polypep...    32  7.9
ref|NP_033115.1|  RNA polymerase II 1 >gi|90464|pir||A28490 ...    32  7.9
gb|AAF04257.1|  (AF139114) subtilisin-like serine protease [...    32  7.9
pir||F75513  ABC transporter, ATP-binding protein, EF-3 fami...    32  7.9
sp|Q09729|YA4C_SCHPO  HYPOTHETICAL 65.9 KD PROTEIN C31A2.12 ...    32  7.9
ref|NP_056526.1|  glioma tumor suppressor candidate region g...    32  7.9

>dbj|BAA12721.1| (D85079) homeobox [Strongylocentrotus purpuratus]
          Length = 405

 Score = 39.5 bits (90), Expect = 0.047
 Identities = 26/76 (34%), Positives = 38/76 (49%), Gaps = 7/76 (9%)
 Frame = +3

Query: 279 RLMLLPHRECHLPIHHPLKSPARSALQAPTPTRLLTNDANHQTSRKPPSQPRQQ------ 440
           ++ L P R    P HHP + PA  A Q   P     +D +++T   PPSQPR +      
Sbjct: 23  KMRLSPDR----PSHHPARPPASLASQPLRPAHYQDDDDSYRTHTPPPSQPRTKGFSIES 78

Query: 441 -TTLFAKHRSLPKLARPTRALPS 506
             +   KHR  P+    + + PS
Sbjct: 79  ILSPTDKHRKSPRSPPMSPSSPS 101
>gb|AAD20910.2| (AC006234) putative LRR receptor protein kinase [Arabidopsis
           thaliana]
          Length = 743

 Score = 38.7 bits (88), Expect = 0.081
 Identities = 34/93 (36%), Positives = 41/93 (43%)
 Frame = +2

Query: 182 FNQPSPKFYLSIPNFYIYASRTIKQNITIMASPPYATSPSGMSPTYPSPAQIPSKKRSSG 361
           FN P P+  LSIPNF    +     N+TI  SP   T PS  SP  P     PS   S+G
Sbjct: 228 FNGPIPEKLLSIPNFIKGGNLF---NVTIAPSPSPETPPSPTSPKRPFFGP-PSPNASAG 283

Query: 362 ADANAPAHKRRKPSNLSQASVXXXXXNHPLRQT 460
              +  AH R  PS+           +HP R T
Sbjct: 284 ---HGQAHVRSPPSD-----------HHPSRPT 302
>pir||T07907 hydroxyproline-rich glycoprotein GAS28 precursor - Chlamydomonas
           reinhardtii
 gb|AAB69862.1| (AF015883) hydroxyproline-rich glycoprotein gas28p precursor
           [Chlamydomonas reinhardtii]
          Length = 446

 Score = 34.8 bits (78), Expect = 1.2
 Identities = 31/91 (34%), Positives = 37/91 (40%), Gaps = 7/91 (7%)
 Frame = +2

Query: 233 YASRTIKQN-----ITIM--ASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKR 391
           Y SR   Q+     ++I+  A PP+  SPS  SP  PSP   PS         + P    
Sbjct: 200 YTSRLFNQDRDCCPVSILGNAPPPHVPSPSPPSPPSPSPPPPPSPPPPPPPTPSPP---- 255

Query: 392 RKPSNLSQASVXXXXXNHPLRQTSFPPEARSPYPRSPF 505
             P  L  A         P      PP A  P PRS F
Sbjct: 256 --PPELPPAQPDAPARKRP------PPPASPPPPRSDF 285
>gb|AAG10390.1| (AY007501) enabled-like protein [Hirudo medicinalis]
          Length = 509

 Score = 34.4 bits (77), Expect = 1.6
 Identities = 30/71 (42%), Positives = 35/71 (49%), Gaps = 12/71 (16%)
 Frame = +2

Query: 272 ASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAP-------AHKRRKPS-----NLSQ 415
           ASPP A SPS M+P  P+P   P     S A A  P       A K  KPS     N + 
Sbjct: 255 ASPPVAPSPSAMAPPPPAPPPPP----PSPAPAAPPPPPLPDVAQKSGKPSTSPVGNNNV 310

Query: 416 ASVXXXXXNHPLRQTSFPPEARS 484
            S+     +  LR  S PP   S
Sbjct: 311 NSLAAALKSAQLRSVSKPPNQAS 333
>pir||T41547 hypothetical protein SPCC70.01 - fission yeast (Schizosaccharomyces
           pombe)
 emb|CAA19351.1| (AL023794) hypothetical protein [Schizosaccharomyces pombe]
          Length = 964

 Score = 34.4 bits (77), Expect = 1.6
 Identities = 25/104 (24%), Positives = 48/104 (46%), Gaps = 10/104 (9%)
 Frame = +2

Query: 176 LRFNQPSPKFYLSIPNFYIYASRTIKQNI----TIMASPPYATSPSGM------SPTYPS 325
           ++   PS +  ++ P   + AS+ +  +      ++ +P    +PS +      +PT P 
Sbjct: 183 IQLYDPSTQRQMARPMSNLQASQPVPSSTFSRSAVVPNPSLPLNPSVLQGQVMNNPTIPK 242

Query: 326 PAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPLRQTSFPPEARSP 487
               PS        +  P+H  + P N   AS      NHP++ ++F P   +P
Sbjct: 243 GT--PSTSIEGAKTSIPPSHAMQNPHNSFPASADRLQKNHPVQSSNFNPYTPAP 294
>pir||T41011 hypothetical protein SPCC1494.10 - fission yeast
           (Schizosaccharomyces pombe) (fragment)
 emb|CAA19308.1| (AL023776) hypothetical protein [Schizosaccharomyces pombe]
          Length = 513

 Score = 34.4 bits (77), Expect = 1.6
 Identities = 25/104 (24%), Positives = 48/104 (46%), Gaps = 10/104 (9%)
 Frame = +2

Query: 176 LRFNQPSPKFYLSIPNFYIYASRTIKQNI----TIMASPPYATSPSGM------SPTYPS 325
           ++   PS +  ++ P   + AS+ +  +      ++ +P    +PS +      +PT P 
Sbjct: 183 IQLYDPSTQRQMARPMSNLQASQPVPSSTFSRSAVVPNPSLPLNPSVLQGQVMNNPTIPK 242

Query: 326 PAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPLRQTSFPPEARSP 487
               PS        +  P+H  + P N   AS      NHP++ ++F P   +P
Sbjct: 243 GT--PSTSIEGAKTSIPPSHAMQNPHNSFPASADRLQKNHPVQSSNFNPYTPAP 294
>pir||T04652 hypothetical protein F10N7.260 - Arabidopsis thaliana
 emb|CAA16596.1| (AL021636) hypothetical protein [Arabidopsis thaliana]
 emb|CAB79911.1| (AL161580) hypothetical protein [Arabidopsis thaliana]
          Length = 191

 Score = 34.4 bits (77), Expect = 1.6
 Identities = 20/68 (29%), Positives = 33/68 (48%)
 Frame = +3

Query: 252 NKTSQSWLLRLMLLPHRECHLPIHHPLKSPARSALQAPTPTRLLTNDANHQTSRKPPSQP 431
           ++T  SW+    LLP+ +       P KSP RS +      R++ N+  +Q+   PP QP
Sbjct: 19  SRTLISWVRCKSLLPNHQSRDVTTSPAKSPFRSNI-----LRIIRNEIEYQSDYAPPHQP 73

Query: 432 RQQTTLFA 455
             +   F+
Sbjct: 74  ATEFKSFS 81
>pir||T06336 proline-rich protein precursor - soybean
 gb|AAC34889.1| (AF086759) proline-rich protein precursor [Glycine max]
          Length = 338

 Score = 34.4 bits (77), Expect = 1.6
 Identities = 25/106 (23%), Positives = 39/106 (36%)
 Frame = +2

Query: 185 NQPSPKFYLSIPNFYIYASRTIKQNITIMASPPYATSPSGMSPTYPSPAQIPSKKRSSGA 364
           NQPS     ++P     +     Q++T+   P    +P     + P  +Q+P+    S  
Sbjct: 75  NQPSVPVSSAVPALSHQSQPMAAQSLTMPQQPKGHLAPQVALASLPQSSQLPNIPSPS-- 132

Query: 365 DANAPAHKRRKPSNLSQASVXXXXXNHPLRQTSFPPEARSPYPRSP 502
                 H   +P + +Q S       HPL    FP     P  R P
Sbjct: 133 -----LHSLSQPLHPTQMSTASSHLQHPLLTPGFPHMPLPPQIRQP 173
>pir||T38547 probable cell division protein kinase - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAB16269.1| (Z99165) putative cell division protein kinase [Schizosaccharomyces
           pombe]
          Length = 593

 Score = 34.4 bits (77), Expect = 1.6
 Identities = 25/89 (28%), Positives = 41/89 (45%), Gaps = 3/89 (3%)
 Frame = +2

Query: 218 PNFYIYASRTIKQNITI---MASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHK 388
           P+FY   S+   +N+T     A    ++S + +SP  PS  +  SK+++S    + P+  
Sbjct: 113 PSFYRSGSQKRARNLTTKDYFAKRSESSSSASVSPISPSANRNDSKRQASSFRRSPPSSV 172

Query: 389 RRKPSNLSQASVXXXXXNHPLRQTSFPPEARS 484
             KPS  +   V     + P    S P E  S
Sbjct: 173 HMKPSAFNGRKVSRRPSSSPPPIPSIPHETTS 204
>pir||S21323 probable endoglucanase - Ruminococcus flavefaciens
 emb|CAA39559.1| (X56082) endo-glucanase [Ruminococcus flavefaciens]
          Length = 680

 Score = 34.0 bits (76), Expect = 2.1
 Identities = 19/45 (42%), Positives = 25/45 (55%)
 Frame = +2

Query: 263 TIMASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRK 397
           T  +SPP + SPS  SPT PSP+   S   SS A   +P  + R+
Sbjct: 11  TPSSSPPTSPSPSPTSPTPPSPSTESSPTPSSPASPRSPTVRCRE 55
>gb|AAC36357.1| (AF091342) neurofilament-M subunit [Bos taurus]
          Length = 810

 Score = 34.0 bits (76), Expect = 2.1
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)
 Frame = +2

Query: 272 ASPPYATSPSGMSPTYPSP-AQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHP 448
           A  P A SP+  SPT  SP A+ P  K  +     A +   + P+  S  +      +  
Sbjct: 502 AKSPVAKSPTTKSPTAKSPEAKSPEAKSPTAKSPTAKSPVAKSPTAKSPEAKSPEAKSPT 561

Query: 449 LRQ-TSFPPEARSPYPRSP 502
            +  T+  P A+SP P+SP
Sbjct: 562 AKSPTAKSPAAKSPAPKSP 580
>pir||T19765 hypothetical protein C36A4.5 - Caenorhabditis elegans
 emb|CAA91271.1| (Z66495) similar to claustrin like~cDNA EST yk541h12.5 comes from
           this gene [Caenorhabditis elegans]
          Length = 954

 Score = 33.6 bits (75), Expect = 2.7
 Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 3/81 (3%)
 Frame = +2

Query: 275 SPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPLR 454
           S P AT+P+  S    +PA  P   R   +D          P + +Q S        P  
Sbjct: 438 SAPKATAPASTSS---APATAPEASRRDPSDVTIVLDDSLFPEDFNQGSAPMDIVVIP-- 492

Query: 455 QTSFPP---EARSPYPRSPFVDAQ 517
            T  PP   EAR+ +P+ P VDA+
Sbjct: 493 PTPEPPRLEEARATHPKEPIVDAE 516
>pir||H72695 hypothetical protein APE0984 - Aeropyrum pernix (strain K1)
 dbj|BAA79968.1| (AP000060) 191aa long hypothetical protein [Aeropyrum pernix]
          Length = 191

 Score = 33.2 bits (74), Expect = 3.5
 Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
 Frame = +3

Query: 315 PIHH--PLKSPARSALQAPTPTRLLTNDANHQTSRKPP 422
           P HH  P  +P   A + P P R+L N   H T+R PP
Sbjct: 38  PRHHLQPRPNPTGQAQRLPRPDRVLHNPRRHNTARVPP 75
>gb|AAF22175.1|AF136381_1 (AF136381) c-Cbl-associated protein SH3P12 [Homo sapiens]
          Length = 1004

 Score = 33.2 bits (74), Expect = 3.5
 Identities = 25/77 (32%), Positives = 29/77 (37%)
 Frame = +2

Query: 272 ASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPL 451
           A PP    P G  PT P+ A   S       D   P H +R P +   AS        P+
Sbjct: 168 ARPPTPLGPLGCVPTIPATASAASPLTFPTLDDFIPPHLQRWPHHSQPASACGSFA--PI 225

Query: 452 RQTSFPPEARSPYPRSP 502
            QT  PP    P P  P
Sbjct: 226 SQT--PPSFSPPPPLVP 240
>pir||S42674 adhesive protein - bifurcate mussel (fragments)
          Length = 219

 Score = 32.8 bits (73), Expect = 4.6
 Identities = 17/58 (29%), Positives = 24/58 (41%)
 Frame = +2

Query: 233 YASRTIKQNITIMASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSN 406
           Y ++         A   Y+T PS     Y +PA+ P+K  S G     P     KPS+
Sbjct: 156 YTTKPSSYGTGYKAPTKYSTKPSSYGTGYKAPAKYPTKPSSYGTGYKYPTKYTTKPSS 213
>gb|AAG01391.1|AF199499_1 (AF199499) period 2 [Xenopus laevis]
          Length = 1427

 Score = 32.8 bits (73), Expect = 4.6
 Identities = 27/79 (34%), Positives = 37/79 (46%), Gaps = 12/79 (15%)
 Frame = +2

Query: 281  PYATSPSGMSPTYPSP--------AQIPSKKRSSGADANAPAHKRRK----PSNLSQASV 424
            PY   P+   P +PSP        AQIPS   S+     AP          P+ +  AS+
Sbjct: 1001 PYPVLPAYPLPVFPSPQIPVPNETAQIPSANASTSQPFPAPLLPPMVALVLPNYVYPASL 1060

Query: 425  XXXXXNHPLRQTSFPPEARSPYPRSPFVDAQ 517
                   P  Q +FP +  S  P+S F  +Q
Sbjct: 1061 PTSLYPGPAPQPAFPAQQTSYLPQSTFPASQ 1091
>dbj|BAA78424.1| (AB021264) polyprotein [Arabidopsis thaliana]
          Length = 1330

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
 Frame = +2

Query: 275 SPPYATSPSGMSPTYPSPAQIPSKKRS--SGADANAPAHK----RRKPSNLSQASVXXXX 436
           SP   +SPS +  T  S + +PS   S  S ++  AP+H       +P     ++     
Sbjct: 651 SPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPI 710

Query: 437 XNHPLRQTSFP--PEARSPYPRSP 502
            N+P   +  P  P   SP P+SP
Sbjct: 711 LNNPNPNSPSPNSPNQNSPLPQSP 734
>dbj|BAA97097.1| (AP002460) En/Spm transposon protein-like [Arabidopsis thaliana]
          Length = 504

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 25/84 (29%), Positives = 39/84 (45%), Gaps = 4/84 (4%)
 Frame = +2

Query: 272 ASPP--YATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNH 445
           +SPP  YA +P+  +   PS   IP     +GA ++AP ++   P       +     N 
Sbjct: 32  SSPPQQYAFTPAATTVLVPS--SIPPLGAGAGASSSAPHYRNYPPPQ----QLFQHSTNQ 85

Query: 446 PLRQTSFPPE--ARSPYPRSPFVDAQAH 523
           P R    PP+  A+   P SP  +  +H
Sbjct: 86  PQRVDPLPPQETAQQDPPLSPDPETPSH 113
>pir||T30049 hypothetical protein K06C4.8 - Caenorhabditis elegans
 gb|AAF98228.1| (U64843) similar to family 1 of G-protein coupled receptors
           [Caenorhabditis elegans]
          Length = 355

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 14/30 (46%), Positives = 16/30 (52%)
 Frame = +3

Query: 57  PRIGISKYVFFPSYRSASSVHHSFRAFSLL 146
           P  G  K+ FFP Y    SVH  F AF L+
Sbjct: 74  PSFGTYKFPFFPGYHEIGSVHTIFSAFCLI 103
>pir||S54265 glycoprotein gC - bovine herpesvirus 5
 emb|CAA89199.1| (Z49224) glycoprotein gC [Bovine herpesvirus 5]
          Length = 487

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 24/91 (26%), Positives = 36/91 (39%)
 Frame = +2

Query: 230 IYASRTIKQNITIMASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNL 409
           + A R + +     ASPP       +SP+   P+  P++  + GA           PS+ 
Sbjct: 19  LLAGRGLAEEAEGEASPPPLPPRPSLSPSPSPPSPTPAETTNDGAGTLGAT--PTAPSHS 76

Query: 410 SQASVXXXXXNHPLRQTSFPPEARSPYPRSP 502
             A+      +     T  PP A S  PRSP
Sbjct: 77  PPATPKDSTTDETPLGTPAPPPANSTRPRSP 107
>pir||T01397 LTR gag/pol polyprotein homolog T4I9.16 - Arabidopsis thaliana
 gb|AAC79110.1| (AF069442) putative polyprotein of LTR transposon [Arabidopsis
            thaliana]
 emb|CAB77781.1| (AL161495) putative polyprotein of LTR transposon [Arabidopsis
            thaliana]
          Length = 1456

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
 Frame = +2

Query: 275  SPPYATSPSGMSPTYPSPAQIPSKKRS--SGADANAPAHK----RRKPSNLSQASVXXXX 436
            SP   +SPS +  T  S + +PS   S  S ++  AP+H       +P     ++     
Sbjct: 777  SPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPI 836

Query: 437  XNHPLRQTSFP--PEARSPYPRSP 502
             N+P   +  P  P   SP P+SP
Sbjct: 837  LNNPNPNSPSPNSPNQNSPLPQSP 860
>pir||I58377 LTG19 - human
 dbj|BAA03406.1| (D14539) LTG19 [Homo sapiens]
          Length = 559

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 16/40 (40%), Positives = 23/40 (57%)
 Frame = +2

Query: 287 ATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSN 406
           +TSP G  P  P P    S KR + AD+  P+ K++K S+
Sbjct: 265 STSPKGGPPPPPPPPPRASSKRPATADSPKPSAKKQKKSS 304
>pir||T21338 hypothetical protein F25D7.4 - Caenorhabditis elegans
 emb|CAB01697.1| (Z78418) similar to claustrin like~cDNA EST yk385b10.3 comes from
           this gene~cDNA EST yk385b10.5 comes from this gene~cDNA
           EST CEMSH64F comes from this gene~cDNA EST CEMSH64R
           comes from this gene~cDNA EST yk576a11.3 comes from this
           gene~cDNA EST yk576a1>
          Length = 932

 Score = 32.5 bits (72), Expect = 6.1
 Identities = 26/81 (32%), Positives = 35/81 (43%), Gaps = 3/81 (3%)
 Frame = +2

Query: 275 SPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPLR 454
           S P AT+P+  S    +PA  P   R    D          P + +Q S        P  
Sbjct: 416 SAPKATAPASTSS---APATAPEASRRDPNDVTIVLDDSLFPEDFNQGSAPMDIVVIP-- 470

Query: 455 QTSFPP---EARSPYPRSPFVDAQ 517
            T  PP   EAR+ +P+ P VDA+
Sbjct: 471 PTPEPPRLEEARATHPKEPIVDAE 494
>ref|NP_010086.1| Component (p150) of COPII coat of secretory pathway vesicles; Sec31p
 sp|P38968|WEB1_YEAST WEB1 PROTEIN (PROTEIN TRANSPORT PROTEIN SEC31)
 pir||S58782 SEC31 protein - yeast (Saccharomyces cerevisiae)
 emb|CAA58252.1| (X83276) putative ORF [Saccharomyces cerevisiae]
 emb|CAA98772.1| (Z74243) ORF YDL195w [Saccharomyces cerevisiae]
          Length = 1273

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 30/94 (31%), Positives = 41/94 (42%), Gaps = 11/94 (11%)
 Frame = +2

Query: 233  YASRTIKQNITIMASP--PYATS-PSGMSPTY-----PSPAQIPSKKRSSGADANAPAHK 388
            Y +    +N+ ++ +P  P  TS PS  +P Y      S   +P K        +AP H 
Sbjct: 763  YTNAKTNKNVPVLPTPGMPSTTSIPSMQAPFYGMTPGASANALPPKPYVPATTTSAPVHT 822

Query: 389  RRK---PSNLSQASVXXXXXNHPLRQTSFPPEARSPYPRSPFVDA 514
              K   PS  S AS      N   R  SF P      P +P+  A
Sbjct: 823  EGKYAPPSQPSMASPFVNKTNSSTRLNSFAP------PPNPYATA 861
>gb|AAA50367.1| (U15219) Web1p [Saccharomyces cerevisiae]
          Length = 1273

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 30/94 (31%), Positives = 41/94 (42%), Gaps = 11/94 (11%)
 Frame = +2

Query: 233  YASRTIKQNITIMASP--PYATS-PSGMSPTY-----PSPAQIPSKKRSSGADANAPAHK 388
            Y +    +N+ ++ +P  P  TS PS  +P Y      S   +P K        +AP H 
Sbjct: 763  YTNAKTNKNVPVLPTPGMPSTTSIPSMQAPFYGMTPGASANALPPKPYVPATTTSAPVHT 822

Query: 389  RRK---PSNLSQASVXXXXXNHPLRQTSFPPEARSPYPRSPFVDA 514
              K   PS  S AS      N   R  SF P      P +P+  A
Sbjct: 823  EGKYAPPSQPSMASPFVNKTNSSTRLNSFAP------PPNPYATA 861
>gb|AAB58418.1| (U37500) RNA polymerase II largest subunit [Mus musculus]
          Length = 1966

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275  SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
            S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 1578 SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 1637

Query: 449  LRQTSFPPEARSPYPRSP 502
                S+ P + S  P SP
Sbjct: 1638 PTSPSYSPTSPSYSPTSP 1655
>gb|AAD41978.1|AC006438_10 (AC006438) unknown protein [Arabidopsis thaliana]
          Length = 727

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 21/79 (26%), Positives = 31/79 (38%)
 Frame = +2

Query: 266 IMASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNH 445
           ++ SPP A+SP    P + +P+               P HK + P    Q +        
Sbjct: 443 VVHSPPPASSPPTSPPVHSTPS---------------PVHKPQPPKESPQPNDPYDQSPV 487

Query: 446 PLRQTSFPPEARSPYPRSP 502
             R++  PP   SP P SP
Sbjct: 488 KFRRSPPPPPVHSPPPPSP 506
>sp|P11414|RPB1_CRIGR DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
 pir||A27677 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - Chinese
           hamster (fragment)
 gb|AAA37008.1| (M19538) RNA polymerase II largest subunit [Cricetulus griseus]
          Length = 467

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275 SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
           S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 75  SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 134

Query: 449 LRQTSFPPEARSPYPRSP 502
               S+ P + S  P SP
Sbjct: 135 PTSPSYSPTSPSYSPTSP 152
>ref|NP_056200.1| DKFZP586P1422 protein
 pir||T17257 hypothetical protein DKFZp586P1422.1 - human
 emb|CAB55947.1| (AL117472) hypothetical protein [Homo sapiens]
          Length = 816

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 24/77 (31%), Positives = 28/77 (36%)
 Frame = +2

Query: 272 ASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPL 451
           A PP    P G  PT P+ A   S       D   P H +R P +   A         P+
Sbjct: 145 ARPPTPLGPLGCVPTIPATASAASPLTFPTLDDFIPPHLQRWPHHSQPARASGSFA--PI 202

Query: 452 RQTSFPPEARSPYPRSP 502
            QT  PP    P P  P
Sbjct: 203 SQT--PPSFSPPPPLVP 217
>gb|AAC41377.1| (AF036382) MLL [Takifugu rubripes]
          Length = 4498

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 18/76 (23%), Positives = 34/76 (44%)
 Frame = +2

Query: 272  ASPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPL 451
            +S   ++S S  S +  SP+       S    ++AP+ ++++P N   +S        PL
Sbjct: 1494 SSSSSSSSSSSSSSSSSSPSSSAQNSPSQATQSHAPSQQQQRPVNSQASSKKDASPKIPL 1553

Query: 452  RQTSFPPEARSPYPRS 499
             +T   P+  S   +S
Sbjct: 1554 SETKRKPQQHSAQQQS 1569
>pir||I38186 RNA polymerase II largest subunit - human
 emb|CAA52862.1| (X74874) RNA polymerase II largest subunit [Homo sapiens]
          Length = 1970

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275  SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
            S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 1578 SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 1637

Query: 449  LRQTSFPPEARSPYPRSP 502
                S+ P + S  P SP
Sbjct: 1638 PTSPSYSPTSPSYSPTSP 1655
>sp|P08775|RPB1_MOUSE DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
          Length = 1970

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275  SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
            S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 1578 SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 1637

Query: 449  LRQTSFPPEARSPYPRSP 502
                S+ P + S  P SP
Sbjct: 1638 PTSPSYSPTSPSYSPTSP 1655
>dbj|BAA22376.1| (D87293) RNA polymerase II largest subunit [Cricetulus griseus]
          Length = 1970

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275  SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
            S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 1578 SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 1637

Query: 449  LRQTSFPPEARSPYPRSP 502
                S+ P + S  P SP
Sbjct: 1638 PTSPSYSPTSPSYSPTSP 1655
>ref|NP_000928.1| polymerase (RNA) II (DNA directed) polypeptide A (220kD)
 sp|P24928|RPB1_HUMAN DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (RPB1)
 pir||S21054 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - human
 emb|CAA45125.1| (X63564) RNA polymerase II largest subunit [Homo sapiens]
 dbj|BAA22377.1| (D87294) RNA polymerase II largest subunit [Cricetulus griseus]
          Length = 1970

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275  SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
            S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 1578 SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 1637

Query: 449  LRQTSFPPEARSPYPRSP 502
                S+ P + S  P SP
Sbjct: 1638 PTSPSYSPTSPSYSPTSP 1655
>ref|NP_033115.1| RNA polymerase II 1
 pir||A28490 DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain - mouse
 gb|AAA40071.1| (M12130) RNA polymerase II [Mus musculus]
          Length = 1932

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/76 (28%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
 Frame = +2

Query: 275  SPPYATSPSG-MSPTYPSPAQIPSKKRSSGADANAPAHKRRKPS-NLSQASVXXXXXNHP 448
            S PY  SP G MSP+Y   +     +   G    +P++    PS + +  S      N+ 
Sbjct: 1540 SSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPTSPSYSPTSPNYS 1599

Query: 449  LRQTSFPPEARSPYPRSP 502
                S+ P + S  P SP
Sbjct: 1600 PTSPSYSPTSPSYSPTSP 1617
>gb|AAF04257.1| (AF139114) subtilisin-like serine protease [Neospora caninum]
          Length = 865

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 24/81 (29%), Positives = 33/81 (40%)
 Frame = +2

Query: 275 SPPYATSPSGMSPTYPSPAQIPSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPLR 454
           SP   + P G SP  PSP + PS+ R   A   +P     +PS             HP  
Sbjct: 749 SPSKPSPPEGSSPRVPSPHRHPSRSRLPSAVEPSPPPASPQPS------------PHPSP 796

Query: 455 QTSFPPEARSPYPRSPFVDAQ 517
             + P +  +P P SP  D +
Sbjct: 797 PDTSPTKPSTP-PPSPSQDPE 816
>pir||F75513 ABC transporter, ATP-binding protein, EF-3 family - Deinococcus
           radiodurans (strain R1)
 gb|AAF10056.1|AE001907_2 (AE001907) ABC transporter, ATP-binding protein, EF-3 family
           [Deinococcus radiodurans]
          Length = 649

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 20/51 (39%), Positives = 24/51 (46%), Gaps = 1/51 (1%)
 Frame = +2

Query: 338 PSKKRSSGADANAPAHKRRKPSNLSQASVXXXXXNHPLRQTSFP-PEARSPY 490
           P KKR  G  A AP  +RR   +  +        N P  +TSFP PE R  Y
Sbjct: 12  PGKKRRGGQAAKAPRARRRNCVDFLRLDFSLHKFNGPGVRTSFPLPEQRVGY 63
>sp|Q09729|YA4C_SCHPO HYPOTHETICAL 65.9 KD PROTEIN C31A2.12 IN CHROMOSOME I
 pir||S58106 hypothetical protein SPAC31A2.12 - fission yeast
           (Schizosaccharomyces pombe)
 pir||T38610 hypothetical 65.9 kd protein - fission yeast (Schizosaccharomyces
           pombe)
 emb|CAA90470.1| (Z50113) hypothetical 65.9 kd protein. [Schizosaccharomyces pombe]
          Length = 596

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 22/65 (33%), Positives = 36/65 (54%), Gaps = 5/65 (7%)
 Frame = +2

Query: 269 MASPPYATSPSG-MSPTYPSPAQIPSKKR----SSGADANAPAHKRRKPSNLSQASVXXX 433
           + +PP  TSPS  +SPT  +   + S++     +S +   +P+H     ++LSQAS    
Sbjct: 500 IVTPPQRTSPSFFVSPTESTRQSLDSRRSFEHSTSSSSGISPSHSSASLAHLSQASNPNG 559

Query: 434 XXNHPLRQTS 463
             + P R TS
Sbjct: 560 SSSAPHRPTS 569
>ref|NP_056526.1| glioma tumor suppressor candidate region gene 1
 gb|AAF62874.1| (AF182077) glioma tumor suppressor candidate region protein 1 [Homo
            sapiens]
          Length = 1509

 Score = 32.1 bits (71), Expect = 7.9
 Identities = 27/101 (26%), Positives = 44/101 (42%), Gaps = 10/101 (9%)
 Frame = +2

Query: 191  PSPKFYLSIPNFYIYASRTIKQNITIMASPPYATSPSGMSPTYPSPAQIPSKKRSSGADA 370
            P P+   ++P  ++  ++        +  PP A++P+   PT P P Q P + +S   + 
Sbjct: 772  PPPQAPPTLPGIFVIQNQ--------LGVPPPASNPA---PTAPGPPQPPLRPQSQPPEG 820

Query: 371  NAPAHKRRKPSNLSQASVXXXXXNHPLR-------QTSFPPEA---RSPYP 493
              P      PS+ S A       +  L        Q  FPP     +SP P
Sbjct: 821  PLPPAPHLPPSSTSSAVASSSETSSRLPAPTPSDFQLQFPPSQGPHKSPTP 871
Database: nr Posted date: Sep 29, 2000 9:53 PM Number of letters in database: 177,575,912 Number of sequences in database: 565,281 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 198067664 Number of Sequences: 565281 Number of extensions: 3903862 Number of successful extensions: 18110 Number of sequences better than 10.0: 78 Number of HSP's better than 10.0 without gapping: 8 Number of HSP's successfully gapped in prelim test: 47 Number of HSP's that attempted gapping in prelim test: 18024 Number of HSP's gapped (non-prelim): 134 length of query: 274 length of database: 177,575,912 effective HSP length: 54 effective length of query: 219 effective length of database: 147,050,738 effective search space: 32204111622 effective search space used: 32204111622 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 70 (31.7 bits)