The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.
BLASTX 2.1.1 [Aug-8-2000]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Contig23.seq Contig23
(633 letters)
Database: nr
565,281 sequences; 177,575,912 total letters
Score E
Sequences producing significant alignments: (bits) Value
pir||T25507 hypothetical protein C04E6.6 - Caenorhabditis e... 35 0.87
pir||T02504 hypothetical protein T19C21.10 - Arabidopsis th... 35 0.87
ref|NP_009572.1| Ybr016wp >gi|586474|sp|P38216|YBM6_YEAST H... 34 1.5
dbj|BAB08888.1| (AB012243) gene_id:MIJ24.6~ref|NP_013897.1~... 33 2.6
gb|AAF89556.1|AF165312_1 (AF165312) pre-T-cell receptor alp... 33 3.4
emb|CAB66922.1| (AL132965) putative protein [Arabidopsis th... 33 3.4
gb|AAB18373.1| (U38996) pre TCR alpha [Homo sapiens] 33 3.4
gb|AAB06194.1| (U36759) pre-T cell receptor alpha-type chai... 33 3.4
gb|AAC83346.1| (AF084941) pre-T cell receptor alpha chain 1... 33 3.4
pir||T10265 arabinogalactan-protein AGP2 - Persian tobacco ... 33 3.4
pir||T29074 hypothetical protein SC1C2.25c - Streptomyces c... 32 4.4
emb|CAB95018.1| (AJ252020) argininosuccinase and n-acetylgl... 32 7.6
emb|CAB95024.1| (AJ252021) argininosuccinase and n-acetylgl... 32 7.6
ref|NP_033306.1| synovial sarcoma, translocated to X chromo... 31 9.9
ref|NP_033377.1| telomerase associated protein 1 >gi|751384... 31 9.9
>pir||T25507 hypothetical protein C04E6.6 - Caenorhabditis elegans
gb|AAB52333.1| (U97012) contains similarity to a ground domain, also weakly
similar to drosophila fork head domain transcription
factor SLP1 (SP:P32030) [Caenorhabditis elegans]
Length = 518
Score = 34.8 bits (78), Expect = 0.87
Identities = 25/79 (31%), Positives = 33/79 (41%)
Frame = -3
Query: 526 LPEHAHPDQLDTYRQSHATDRTAVNNFPDDNNAYNKYENNPLGYQEPNHGRQDGYGQIHY 347
LP H + +++T +H T R V + K+ NN YQ P Q GY Q +
Sbjct: 94 LPIHVNKSKINT---NHKTKRQGVYSALAQPPPAAKWGNNRPSYQNPTPSYQQGYPQ-QF 149
Query: 346 PEPQYGVTQPPVRQAENPN 290
P PQ P Q N N
Sbjct: 150 PAPQRQQQYQPQPQTTNYN 168
>pir||T02504 hypothetical protein T19C21.10 - Arabidopsis thaliana
gb|AAC28763.1| (AC004683) unknown protein [Arabidopsis thaliana]
Length = 671
Score = 34.8 bits (78), Expect = 0.87
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
Frame = -3
Query: 445 PDDNNAYNKYENNPL--GYQEPNHGRQD-GYGQIHYPEPQ--YGVTQPPVRQAENPNYR 284
P Y++++ + GY +P H +Q GY Q+ P+PQ Y +QP + P+ R
Sbjct: 491 PQAQQGYSQHQQHQQQQGYSQPQHSQQQQGYSQLQQPQPQQGYSQSQPQAQVQMQPSTR 549
>ref|NP_009572.1| Ybr016wp
sp|P38216|YBM6_YEAST HYPOTHETICAL 14.6 KD PROTEIN IN TTP1-KAP104 INTERGENIC REGION
pir||S45871 probable membrane protein YBR016w - yeast (Saccharomyces
cerevisiae)
emb|CAA84958.1| (Z35885) ORF YBR016w [Saccharomyces cerevisiae]
Length = 128
Score = 34.0 bits (76), Expect = 1.5
Identities = 28/107 (26%), Positives = 41/107 (38%)
Frame = -3
Query: 619 DDYTSGYGAKGGFFSRIFRRRKNTAAGNDDALPEHAHPDQLDTYRQSHATDRTAVNNFPD 440
+DY G + +SR ++A N + P Q Y Q N
Sbjct: 4 NDYYGGTAGEKSQYSRPSNPPPSSAHQNKTQERGYP-PQQQQQYYQQQQQHPGYYNQQGY 62
Query: 439 DNNAYNKYENNPLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAE 299
+ YN+ N GY + + +Q GY Q + +P Y QPP R E
Sbjct: 63 NQQGYNQQGYNQQGYNQQGYNQQ-GYNQQGHQQPVYVQQQPPQRGNE 108
>dbj|BAB08888.1| (AB012243) gene_id:MIJ24.6~ref|NP_013897.1~similar to unknown
protein [Arabidopsis thaliana]
Length = 381
Score = 33.2 bits (74), Expect = 2.6
Identities = 26/114 (22%), Positives = 45/114 (38%), Gaps = 6/114 (5%)
Frame = -3
Query: 628 RHEDDYTSGYGAKGGFFSRIFRRRKNTAAGNDDALPEHAHPDQLDTYRQSHATDRTAVNN 449
R E SGYG + S RK + +++ + P + Q + +
Sbjct: 173 RPESGLGSGYGGR----SESEYERKPSYGRSEEQEEGYRKPSYGRSEEQEEGYRKPSYGR 228
Query: 448 FPDDNNAYNK------YENNPLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAENPNY 287
+ Y K E GY++P++GR + + Y +P YG + V P+Y
Sbjct: 229 SEEQEEGYRKPSYGRSEEEQEEGYRKPSYGRSEEQEEGSYRKPSYGRSDDQVESYIKPSY 288
>gb|AAF89556.1|AF165312_1 (AF165312) pre-T-cell receptor alpha chain [Homo sapiens]
Length = 256
Score = 32.8 bits (73), Expect = 3.4
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Frame = -2
Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
+P+ R W D+ PG+PVW + S P P + WC RSRL A
Sbjct: 187 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 234
>emb|CAB66922.1| (AL132965) putative protein [Arabidopsis thaliana]
Length = 651
Score = 32.8 bits (73), Expect = 3.4
Identities = 15/38 (39%), Positives = 16/38 (41%)
Frame = -3
Query: 406 PLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAENP 293
P GY P G GY YP PQY PP + P
Sbjct: 574 PAGYPPPQQGYGQGYPAQGYPPPQYPQGHPPQYPYQGP 611
>gb|AAB18373.1| (U38996) pre TCR alpha [Homo sapiens]
Length = 269
Score = 32.8 bits (73), Expect = 3.4
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Frame = -2
Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
+P+ R W D+ PG+PVW + S P P + WC RSRL A
Sbjct: 200 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 247
>gb|AAB06194.1| (U36759) pre-T cell receptor alpha-type chain precursor [Homo
sapiens]
Length = 281
Score = 32.8 bits (73), Expect = 3.4
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Frame = -2
Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
+P+ R W D+ PG+PVW + S P P + WC RSRL A
Sbjct: 212 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 259
>gb|AAC83346.1| (AF084941) pre-T cell receptor alpha chain 1 precursor [Homo
sapiens]
Length = 281
Score = 32.8 bits (73), Expect = 3.4
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Frame = -2
Query: 386 KPRTPRRLWPDS----LPGAPVWRHSTTCSPG*EPKLPLR*WCLRSRLGA 249
+P+ R W D+ PG+PVW + S P P + WC RSRL A
Sbjct: 212 RPQPRDRRWGDTPPGRKPGSPVWGEGSYLSS--YPTCPAQAWCSRSRLRA 259
>pir||T10265 arabinogalactan-protein AGP2 - Persian tobacco
gb|AAB35284.1| (S79359) arabinogalactan-protein, AGP [Nicotiana alata,
cell-suspension culture filtrate, Peptide, 461 aa]
Length = 461
Score = 32.8 bits (73), Expect = 3.4
Identities = 25/116 (21%), Positives = 45/116 (38%)
Frame = -3
Query: 625 HEDDYTSGYGAKGGFFSRIFRRRKNTAAGNDDALPEHAHPDQLDTYRQSHATDRTAVNNF 446
+ + Y+ Y FS + N N+ + + + + + +++ NN
Sbjct: 282 NNNGYSQNYMNNNNGFSESYNNNNNNNNNNNVFSENYNNNNNNNVFSENY-------NNN 334
Query: 445 PDDNNAYNKYENNPLGYQEPNHGRQDGYGQIHYPEPQYGVTQPPVRQAENPNYRYD 278
++N Y Y NN GY E N+ + Y + G++ R EN Y YD
Sbjct: 335 NNNNAFYENYNNNNNGYSE-NYNQASSYNNNDNTVERQGLSD--TRFLENGKYYYD 387
>pir||T29074 hypothetical protein SC1C2.25c - Streptomyces coelicolor
emb|CAA19992.1| (AL031124) hypothetical protein SC1C2.25c [Streptomyces coelicolor
A3(2)]
Length = 1329
Score = 32.5 bits (72), Expect = 4.4
Identities = 26/84 (30%), Positives = 35/84 (40%), Gaps = 2/84 (2%)
Frame = -3
Query: 529 ALPEHAHPDQLDTYRQSHATDRTAVNNFPDDN--NAYNKYENNPLGYQEPNHGRQDGYGQ 356
A E A PD + + T P N +A N+YE+ Y + + QD YGQ
Sbjct: 973 ATAEFARPDFDAPAPRRDESQDTGQYAQPGQNQYDARNEYEDQ---YGQQSQYGQDQYGQ 1029
Query: 355 IHYPEPQYGVTQPPVRQAENPNYRYD 278
Y QYG P Q + P + D
Sbjct: 1030 DQYAPGQYGQAGPGQDQYDRPRFGQD 1055
>emb|CAB95018.1| (AJ252020) argininosuccinase and n-acetylglutamate synthase
[Moritella sp. 2674]
Length = 470
Score = 31.7 bits (70), Expect = 7.6
Identities = 13/24 (54%), Positives = 19/24 (79%), Gaps = 1/24 (4%)
Frame = -3
Query: 214 NQHRLIPGWM-LQKEQPVTYEHSCL 143
+QH ++PG+ LQ+ QPVT+ H CL
Sbjct: 146 HQHTVLPGYTHLQRAQPVTFSHWCL 170
>emb|CAB95024.1| (AJ252021) argininosuccinase and n-acetylglutamate synthase
[Moritella sp. 2693]
Length = 629
Score = 31.7 bits (70), Expect = 7.6
Identities = 13/24 (54%), Positives = 19/24 (79%), Gaps = 1/24 (4%)
Frame = -3
Query: 214 NQHRLIPGWM-LQKEQPVTYEHSCL 143
+QH ++PG+ LQ+ QPVT+ H CL
Sbjct: 146 HQHTVLPGYTHLQRAQPVTFSHWCL 170
>ref|NP_033306.1| synovial sarcoma, translocated to X chromosome
sp|Q62280|SSXT_MOUSE SSXT PROTEIN (SYT PROTEIN)
emb|CAA63733.1| (X93357) homolog of human SYT [Mus musculus]
Length = 418
Score = 31.3 bits (69), Expect = 9.9
Identities = 23/86 (26%), Positives = 32/86 (36%)
Frame = -3
Query: 523 PEHAHPDQLDTYRQSHATDRTAVNNFPDDNNAYNKYENNPLGYQEPNHGRQDGYGQIHYP 344
P+ +P Q Y +P +Y + P G Q PN+ + G Y
Sbjct: 341 PQQGYPPQQQQY--------PGQQGYPGQQQSYGPSQGGP-GPQYPNYPQGQGQQYGGYR 391
Query: 343 EPQYGVTQPPVRQAENPNYRYDDGVY 266
Q G QPP ++ Y YD G Y
Sbjct: 392 PTQPGPPQPPQQRP----YGYDQGQY 413
>ref|NP_033377.1| telomerase associated protein 1
pir||T30987 telomerase-associated protein 1 - mouse
gb|AAC53043.1| (U86137) telomerase protein-1 [Mus musculus]
Length = 2629
Score = 31.3 bits (69), Expect = 9.9
Identities = 19/63 (30%), Positives = 28/63 (44%)
Frame = -1
Query: 390 SQTTDAKTAMARFTTRSPSMASLNHLFARLRTQTTATMMVSTIAPRCKLQSWEVHKQKGT 211
S TT K M R + ++ AS HL R Q A M + + + K + +HK +
Sbjct: 595 SNTTLMKRIMIRNSKKNRRPASRKHLCTLTRRQLRAAMTIPVMYEQLKREKLRLHKARQW 654
Query: 210 NTD 202
N D
Sbjct: 655 NCD 657
Database: nr
Posted date: Sep 29, 2000 9:53 PM
Number of letters in database: 177,575,912
Number of sequences in database: 565,281
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.270 0.0470 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 187812843
Number of Sequences: 565281
Number of extensions: 4188074
Number of successful extensions: 12258
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 12248
Number of HSP's gapped (non-prelim): 17
length of query: 211
length of database: 177,575,912
effective HSP length: 52
effective length of query: 158
effective length of database: 148,181,300
effective search space: 23412645400
effective search space used: 23412645400
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 69 (31.3 bits)