The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.1.1 [Aug-8-2000]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Contig23.seq Contig23
         (633 letters)

Database: nr
           565,281 sequences; 177,575,912 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T25507  hypothetical protein C04E6.6 - Caenorhabditis e...    35  0.87
pir||T02504  hypothetical protein T19C21.10 - Arabidopsis th...    35  0.87
ref|NP_009572.1|  Ybr016wp >gi|586474|sp|P38216|YBM6_YEAST H...    34  1.5
dbj|BAB08888.1|  (AB012243) gene_id:MIJ24.6~ref|NP_013897.1~...    33  2.6
gb|AAF89556.1|AF165312_1  (AF165312) pre-T-cell receptor alp...    33  3.4
emb|CAB66922.1|  (AL132965) putative protein [Arabidopsis th...    33  3.4
gb|AAB18373.1|  (U38996) pre TCR alpha [Homo sapiens]              33  3.4
gb|AAB06194.1|  (U36759) pre-T cell receptor alpha-type chai...    33  3.4
gb|AAC83346.1|  (AF084941) pre-T cell receptor alpha chain 1...    33  3.4
pir||T10265  arabinogalactan-protein AGP2 - Persian tobacco ...    33  3.4
pir||T29074  hypothetical protein SC1C2.25c - Streptomyces c...    32  4.4
emb|CAB95018.1|  (AJ252020) argininosuccinase and n-acetylgl...    32  7.6
emb|CAB95024.1|  (AJ252021) argininosuccinase and n-acetylgl...    32  7.6
ref|NP_033306.1|  synovial sarcoma, translocated to X chromo...    31  9.9
ref|NP_033377.1|  telomerase associated protein 1 >gi|751384...    31  9.9

>pir||T25507 hypothetical protein C04E6.6 - Caenorhabditis elegans
 gb|AAB52333.1| (U97012) contains similarity to a ground domain, also weakly
           similar to drosophila fork head domain transcription
           factor SLP1 (SP:P32030) [Caenorhabditis elegans]
          Length = 518

 Score = 34.8 bits (78), Expect = 0.87
 Identities = 25/79 (31%), Positives = 33/79 (41%)
 Frame = -3

Query: 526 LPEHAHPDQLDTYRQSHATDRTAVNNFPDDNNAYNKYENNPLGYQEPNHGRQDGYGQIHY 347
           LP H +  +++T   +H T R  V +         K+ NN   YQ P    Q GY Q  +
Sbjct: 94  LPIHVNKSKINT---NHKTKRQGVYSALAQPPPAAKWGNNRPSYQNPTPSYQQGYPQ-QF 149

Query: 346 PEPQYGVTQPPVRQAENPN 290
           P PQ      P  Q  N N
Sbjct: 150 PAPQRQQQYQPQPQTTNYN 168
>pir||T02504 hypothetical protein T19C21.10 - Arabidopsis thaliana
 gb|AAC28763.1| (AC004683) unknown protein [Arabidopsis thaliana]
          Length = 671

 Score = 34.8 bits (78), Expect = 0.87
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
 Frame = -3

Query: 445 PDDNNAYNKYENNPL--GYQEPNHGRQD-GYGQIHYPEPQ--YGVTQPPVRQAENPNYR 284
           P     Y++++ +    GY +P H +Q  GY Q+  P+PQ  Y  +QP  +    P+ R
Sbjct: 491 PQAQQGYSQHQQHQQQQGYSQPQHSQQQQGYSQLQQPQPQQGYSQSQPQAQVQMQPSTR 549
>ref|NP_009572.1| Ybr016wp
 sp|P38216|YBM6_YEAST HYPOTHETICAL 14.6 KD PROTEIN IN TTP1-KAP104 INTERGENIC REGION
 pir||S45871 probable membrane protein YBR016w - yeast (Saccharomyces
           cerevisiae)
 emb|CAA84958.1| (Z35885) ORF YBR016w [Saccharomyces cerevisiae]
          Length = 128

 Score = 34.0 bits (76), Expect = 1.5
 Identities = 28/107 (26%), Positives = 41/107 (38%)
 Frame = -3

Query: 619 DDYTSGYGAKGGFFSRIFRRRKNTAAGNDDALPEHAHPDQLDTYRQSHATDRTAVNNFPD 440
           +DY  G   +   +SR      ++A  N      +  P Q   Y Q         N    
Sbjct: 4   NDYYGGTAGEKSQYSRPSNPPPSSAHQNKTQERGYP-PQQQQQYYQQQQQHPGYYNQQGY 62

Query: 439 DNNAYNKYENNPLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAE 299
           +   YN+   N  GY +  + +Q GY Q  + +P Y   QPP R  E
Sbjct: 63  NQQGYNQQGYNQQGYNQQGYNQQ-GYNQQGHQQPVYVQQQPPQRGNE 108
>dbj|BAB08888.1| (AB012243) gene_id:MIJ24.6~ref|NP_013897.1~similar to unknown
           protein [Arabidopsis thaliana]
          Length = 381

 Score = 33.2 bits (74), Expect = 2.6
 Identities = 26/114 (22%), Positives = 45/114 (38%), Gaps = 6/114 (5%)
 Frame = -3

Query: 628 RHEDDYTSGYGAKGGFFSRIFRRRKNTAAGNDDALPEHAHPDQLDTYRQSHATDRTAVNN 449
           R E    SGYG +    S     RK +   +++    +  P    +  Q     + +   
Sbjct: 173 RPESGLGSGYGGR----SESEYERKPSYGRSEEQEEGYRKPSYGRSEEQEEGYRKPSYGR 228

Query: 448 FPDDNNAYNK------YENNPLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAENPNY 287
             +    Y K       E    GY++P++GR +   +  Y +P YG +   V     P+Y
Sbjct: 229 SEEQEEGYRKPSYGRSEEEQEEGYRKPSYGRSEEQEEGSYRKPSYGRSDDQVESYIKPSY 288
>gb|AAF89556.1|AF165312_1 (AF165312) pre-T-cell receptor alpha chain [Homo sapiens]
          Length = 256

 Score = 32.8 bits (73), Expect = 3.4
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
 Frame = -2

Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
           +P+   R W D+     PG+PVW   +  S    P  P + WC RSRL A
Sbjct: 187 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 234
>emb|CAB66922.1| (AL132965) putative protein [Arabidopsis thaliana]
          Length = 651

 Score = 32.8 bits (73), Expect = 3.4
 Identities = 15/38 (39%), Positives = 16/38 (41%)
 Frame = -3

Query: 406 PLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAENP 293
           P GY  P  G   GY    YP PQY    PP    + P
Sbjct: 574 PAGYPPPQQGYGQGYPAQGYPPPQYPQGHPPQYPYQGP 611
>gb|AAB18373.1| (U38996) pre TCR alpha [Homo sapiens]
          Length = 269

 Score = 32.8 bits (73), Expect = 3.4
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
 Frame = -2

Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
           +P+   R W D+     PG+PVW   +  S    P  P + WC RSRL A
Sbjct: 200 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 247
>gb|AAB06194.1| (U36759) pre-T cell receptor alpha-type chain precursor [Homo
           sapiens]
          Length = 281

 Score = 32.8 bits (73), Expect = 3.4
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
 Frame = -2

Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
           +P+   R W D+     PG+PVW   +  S    P  P + WC RSRL A
Sbjct: 212 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 259
>gb|AAC83346.1| (AF084941) pre-T cell receptor alpha chain 1 precursor [Homo
           sapiens]
          Length = 281

 Score = 32.8 bits (73), Expect = 3.4
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
 Frame = -2

Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
           +P+   R W D+     PG+PVW   +  S    P  P + WC RSRL A
Sbjct: 212 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 259
>pir||T10265 arabinogalactan-protein AGP2 - Persian tobacco
 gb|AAB35284.1| (S79359) arabinogalactan-protein, AGP [Nicotiana alata,
           cell-suspension culture filtrate, Peptide, 461 aa]
          Length = 461

 Score = 32.8 bits (73), Expect = 3.4
 Identities = 25/116 (21%), Positives = 45/116 (38%)
 Frame = -3

Query: 625 HEDDYTSGYGAKGGFFSRIFRRRKNTAAGNDDALPEHAHPDQLDTYRQSHATDRTAVNNF 446
           + + Y+  Y      FS  +    N    N+     + + +  + + +++       NN 
Sbjct: 282 NNNGYSQNYMNNNNGFSESYNNNNNNNNNNNVFSENYNNNNNNNVFSENY-------NNN 334

Query: 445 PDDNNAYNKYENNPLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAENPNYRYD 278
            ++N  Y  Y NN  GY E N+ +   Y        + G++    R  EN  Y YD
Sbjct: 335 NNNNAFYENYNNNNNGYSE-NYNQASSYNNNDNTVERQGLSD--TRFLENGKYYYD 387
>pir||T29074 hypothetical protein SC1C2.25c - Streptomyces coelicolor
 emb|CAA19992.1| (AL031124) hypothetical protein SC1C2.25c [Streptomyces coelicolor
            A3(2)]
          Length = 1329

 Score = 32.5 bits (72), Expect = 4.4
 Identities = 26/84 (30%), Positives = 35/84 (40%), Gaps = 2/84 (2%)
 Frame = -3

Query: 529  ALPEHAHPDQLDTYRQSHATDRTAVNNFPDDN--NAYNKYENNPLGYQEPNHGRQDGYGQ 356
            A  E A PD      +   +  T     P  N  +A N+YE+    Y + +   QD YGQ
Sbjct: 973  ATAEFARPDFDAPAPRRDESQDTGQYAQPGQNQYDARNEYEDQ---YGQQSQYGQDQYGQ 1029

Query: 355  IHYPEPQYGVTQPPVRQAENPNYRYD 278
              Y   QYG   P   Q + P +  D
Sbjct: 1030 DQYAPGQYGQAGPGQDQYDRPRFGQD 1055
>emb|CAB95018.1| (AJ252020) argininosuccinase and n-acetylglutamate synthase
           [Moritella sp. 2674]
          Length = 470

 Score = 31.7 bits (70), Expect = 7.6
 Identities = 13/24 (54%), Positives = 19/24 (79%), Gaps = 1/24 (4%)
 Frame = -3

Query: 214 NQHRLIPGWM-LQKEQPVTYEHSCL 143
           +QH ++PG+  LQ+ QPVT+ H CL
Sbjct: 146 HQHTVLPGYTHLQRAQPVTFSHWCL 170
>emb|CAB95024.1| (AJ252021) argininosuccinase and n-acetylglutamate synthase
           [Moritella sp. 2693]
          Length = 629

 Score = 31.7 bits (70), Expect = 7.6
 Identities = 13/24 (54%), Positives = 19/24 (79%), Gaps = 1/24 (4%)
 Frame = -3

Query: 214 NQHRLIPGWM-LQKEQPVTYEHSCL 143
           +QH ++PG+  LQ+ QPVT+ H CL
Sbjct: 146 HQHTVLPGYTHLQRAQPVTFSHWCL 170
>ref|NP_033306.1| synovial sarcoma, translocated to X chromosome
 sp|Q62280|SSXT_MOUSE SSXT PROTEIN (SYT PROTEIN)
 emb|CAA63733.1| (X93357) homolog of human SYT [Mus musculus]
          Length = 418

 Score = 31.3 bits (69), Expect = 9.9
 Identities = 23/86 (26%), Positives = 32/86 (36%)
 Frame = -3

Query: 523 PEHAHPDQLDTYRQSHATDRTAVNNFPDDNNAYNKYENNPLGYQEPNHGRQDGYGQIHYP 344
           P+  +P Q   Y             +P    +Y   +  P G Q PN+ +  G     Y 
Sbjct: 341 PQQGYPPQQQQY--------PGQQGYPGQQQSYGPSQGGP-GPQYPNYPQGQGQQYGGYR 391

Query: 343 EPQYGVTQPPVRQAENPNYRYDDGVY 266
             Q G  QPP ++     Y YD G Y
Sbjct: 392 PTQPGPPQPPQQRP----YGYDQGQY 413
>ref|NP_033377.1| telomerase associated protein 1
 pir||T30987 telomerase-associated protein 1 - mouse
 gb|AAC53043.1| (U86137) telomerase protein-1 [Mus musculus]
          Length = 2629

 Score = 31.3 bits (69), Expect = 9.9
 Identities = 19/63 (30%), Positives = 28/63 (44%)
 Frame = -1

Query: 390 SQTTDAKTAMARFTTRSPSMASLNHLFARLRTQTTATMMVSTIAPRCKLQSWEVHKQKGT 211
           S TT  K  M R + ++   AS  HL    R Q  A M +  +  + K +   +HK +  
Sbjct: 595 SNTTLMKRIMIRNSKKNRRPASRKHLCTLTRRQLRAAMTIPVMYEQLKREKLRLHKARQW 654

Query: 210 NTD 202
           N D
Sbjct: 655 NCD 657
Database: nr Posted date: Sep 29, 2000 9:53 PM Number of letters in database: 177,575,912 Number of sequences in database: 565,281 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 187812843 Number of Sequences: 565281 Number of extensions: 4188074 Number of successful extensions: 12258 Number of sequences better than 10.0: 30 Number of HSP's better than 10.0 without gapping: 2 Number of HSP's successfully gapped in prelim test: 13 Number of HSP's that attempted gapping in prelim test: 12248 Number of HSP's gapped (non-prelim): 17 length of query: 211 length of database: 177,575,912 effective HSP length: 52 effective length of query: 158 effective length of database: 148,181,300 effective search space: 23412645400 effective search space used: 23412645400 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 69 (31.3 bits)