The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.1.1 [Aug-8-2000]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Contig75.seq Contig75
         (550 letters)

Database: nr
           565,281 sequences; 177,575,912 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAA86524.1|  (AB033036) KIAA1210 protein [Homo sapiens]        33  2.8
gb|AAF50191.1|  (AE003550) CG14165 gene product [Drosophila ...    32  3.6
ref|NP_057417.1|  RNA binding protein >gi|6649242|gb|AAF2143...    32  4.8
gb|AAD38646.1|AF145671_1  (AF145671) BcDNA.GH11973 [Drosophi...    32  6.2
sp|P45525|YCJF_ECOLI  HYPOTHETICAL 39.4 KDA PROTEIN IN PSPE-...    32  6.2
emb|CAB82345.1|  (AL162004) hypothetical protein [Homo sapiens]    32  6.2
gb|AAF45932.1|  (AE003430) CG12691 gene product [Drosophila ...    32  6.2
ref|NP_005145.1|  ubiquitin specific protease 8 >gi|731046|s...    32  6.2
gb|AAF51718.1|  (AE003594) BcDNA:GH11973 gene product [alt 1...    32  6.2
ref|NP_014639.1|  DNA binding protein involved in transcript...    31  8.2
emb|CAB61059.1|  (AL132948) predicted using Genefinder; prel...    31  8.2
sp|P34099|KAPC_DICDI  CAMP-DEPENDENT PROTEIN KINASE CATALYTI...    31  8.2
gb|AAB20716.1|  serine/threonine protein kinase [Dictyosteli...    31  8.2
gb|AAF49857.1|  (AE003539) CG11259 gene product [Drosophila ...    31  8.2
gb|AAA34839.1|  (M36822) SIN3 open reading frame [Saccharomy...    31  8.2
sp|Q03372|HMSH_DROME  MUSCLE SEGMENTATION HOMEOBOX >gi|10986...    31  8.2

>dbj|BAA86524.1| (AB033036) KIAA1210 protein [Homo sapiens]
          Length = 1115

 Score = 32.8 bits (73), Expect = 2.8
 Identities = 14/40 (35%), Positives = 23/40 (57%)
 Frame = +1

Query: 124 LHQNSVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFSQS 243
           + Q   S+P+++A +   S+ P P K L  PL NP+  Q+
Sbjct: 365 VEQEVSSSPKSMAVEESISMKPLPPKLLCQPLMNPKVQQN 404
>gb|AAF50191.1| (AE003550) CG14165 gene product [Drosophila melanogaster]
          Length = 660

 Score = 32.5 bits (72), Expect = 3.6
 Identities = 26/109 (23%), Positives = 48/109 (43%), Gaps = 3/109 (2%)
 Frame = +1

Query: 166 KHRTSLSPTPTKSLNLPLQNPRFSQSETHQEANP---QSHTSINPRDISPVARIP*LDIP 336
           ++  S+ P P + L L    PR  Q   +Q   P     H S+NP+      + P   IP
Sbjct: 43  RYAVSIQPQPQQQLQLQQPRPRQYQGP-YQLPLPLPAPQHRSVNPQQQQQQHQQPYQVIP 101

Query: 337 VNPRSQLAHGEQVSHVRHEKHKQHNPPDSQVWRRPQHERLIEREEEVHISEL 492
                ++   E  +   HE+ +Q      Q  ++ QH+   +++++ H  +L
Sbjct: 102 EEQFLKILEEELQARAYHEQLRQQQQHQQQQQQQQQHQ---QQQQQQHHKQL 150
>ref|NP_057417.1| RNA binding protein
 gb|AAF21439.1|AF201422_1 (AF201422) splicing coactivator subunit SRm300 [Homo sapiens]
          Length = 2296

 Score = 32.1 bits (71), Expect = 4.8
 Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 7/99 (7%)
 Frame = +3

Query: 114  SPTLAPKFGIQPEKPRYQTP-------NFAIPNTDEXXXXXXXXXXXXXXRNPSRS*PSE 272
            SPT APK  ++  +P   TP       + +  ++                 + S S  S 
Sbjct: 2162 SPTPAPKEAVREGRPPEPTPAKRKRRSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 2221

Query: 273  SYQHKPKGYFPSRSHSMT*HPSKSPFPARSWRTSVPRPPREAQTAQ 410
            S        FP ++       +  P  A  WR  VP+PP   +  Q
Sbjct: 2222 SSSSSSSFPFPCKAWPSGLAQTCKPQEATPWRAEVPQPPEANRLPQ 2267
>gb|AAD38646.1|AF145671_1 (AF145671) BcDNA.GH11973 [Drosophila melanogaster]
 gb|AAF51717.1| (AE003594) BcDNA:GH11973 gene product [alt 2] [Drosophila
           melanogaster]
          Length = 800

 Score = 31.7 bits (70), Expect = 6.2
 Identities = 34/136 (25%), Positives = 56/136 (41%), Gaps = 14/136 (10%)
 Frame = +1

Query: 109 YNLQLLHQNSVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFS---------QSETHQEA 261
           Y+ +L  Q      R L  + R  L+    +     L+  R           Q   H E 
Sbjct: 323 YDRKLQRQRERERRRQLRRQERQKLAREQKRERQRRLKEERQRLQREEQQRRQRLHHDEP 382

Query: 262 NPQSHTSINPRDISPVARIP*LDIPVN---PRSQLAHGEQVSHVRHEKHKQHNPPDSQVW 432
            PQ +  + P+ I  + +    D+  N      Q    EQ   +R E+ KQ +  + +  
Sbjct: 383 KPQVNPQVKPQVIDELRQRSGEDLDKNQTDEHEQKLRNEQEKKLREEQQKQRDEQEQK-- 440

Query: 433 RRPQHERLIEREEE--VHISELRHGNRCSL 516
            R + +RL + EE+   H  EL+      L
Sbjct: 441 DREEQDRLKQEEEQARTHQKELKENQEQQL 470
>sp|P45525|YCJF_ECOLI HYPOTHETICAL 39.4 KDA PROTEIN IN PSPE-TYRR INTERGENIC REGION
 pir||E64881 membrane protein ycjF - Escherichia coli
 dbj|BAA14903.1| (D90770) ORF_ID:o260#4; similar to [SwissProt Accession Number
           P43931] [Escherichia coli]
 dbj|BAA14914.1| (D90771) ORF_ID:o260#4~similar to [SwissProt Accession Number
           P43931] [Escherichia coli]
 gb|AAC74404.1| (AE000230) orf, hypothetical protein [Escherichia coli]
          Length = 353

 Score = 31.7 bits (70), Expect = 6.2
 Identities = 14/46 (30%), Positives = 23/46 (49%)
 Frame = -2

Query: 372 LFSMSELGTGIYWDVKSWNASDWGNIPWVYAGMTLRVSFLMGFGLG 235
           LF  S +G G+ W + +W   DW  +    AG     + ++G G+G
Sbjct: 77  LFGASVVGQGVQWTMNAWQTQDWVALGGCAAG-----ALIIGAGVG 117
>emb|CAB82345.1| (AL162004) hypothetical protein [Homo sapiens]
          Length = 1299

 Score = 31.7 bits (70), Expect = 6.2
 Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 1/99 (1%)
 Frame = +1

Query: 136 SVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFSQSETHQEA-NPQSHTSINPRDISPVA 312
           + S P   A+    + +PTP  + N     P  +Q++TH+ A NP   TS + +   P  
Sbjct: 446 TASVPLAPASASAPAPAPTPVSAPNPAPPAPAQTQAQTHKPAQNPLQTTSQSSKQPPPSI 505

Query: 313 RIP*LDIPVNPRSQLAHGEQVSHVRHEKHKQHNPPDSQVW 432
           R+P    P N    +A G+ +     +  + H    +++W
Sbjct: 506 RLPSAQTP-NGTDYVASGKSI-----QTPQSHGTLTAELW 539
>gb|AAF45932.1| (AE003430) CG12691 gene product [Drosophila melanogaster]
          Length = 150

 Score = 31.7 bits (70), Expect = 6.2
 Identities = 12/30 (40%), Positives = 18/30 (60%)
 Frame = +1

Query: 376 SHVRHEKHKQHNPPDSQVWRRPQHERLIER 465
           S +  ++H+ HN P S +   P+H RL ER
Sbjct: 82  SRLDPDEHRDHNEPSSSITAAPEHSRLQER 111
>ref|NP_005145.1| ubiquitin specific protease 8
 sp|P40818|UBP8_HUMAN UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 8 (UBIQUITIN THIOLESTERASE 8)
           (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 8)
           (DEUBIQUITINATING ENZYME 8) (KIAA0055)
 dbj|BAA06225.1| (D29956) This gene is similar to tre oncogene(X63547). [Homo
           sapiens]
          Length = 1118

 Score = 31.7 bits (70), Expect = 6.2
 Identities = 27/103 (26%), Positives = 50/103 (48%)
 Frame = +1

Query: 166 KHRTSLSPTPTKSLNLPLQNPRFSQSETHQEANPQSHTSINPRDISPVARIP*LDIPVNP 345
           K+   +  T   ++ LP ++   S+S  H++ +PQS   I  R   PV   P L +    
Sbjct: 402 KNVPQIDRTKKPAVKLPEEHRIKSESTNHEQQSPQSGKVIPDRSTKPVVFSPTLMLTDEE 461

Query: 346 RSQLAHGEQVSHVRHEKHKQHNPPDSQVWRRPQHERLIEREEE 474
           ++++ H E  + +  EK+KQ      +  +  Q E+L + E+E
Sbjct: 462 KARI-HAE--TALLMEKNKQEKELRER-QQEEQKEKLRKEEQE 500
>gb|AAF51718.1| (AE003594) BcDNA:GH11973 gene product [alt 1] [Drosophila
           melanogaster]
          Length = 658

 Score = 31.7 bits (70), Expect = 6.2
 Identities = 34/136 (25%), Positives = 56/136 (41%), Gaps = 14/136 (10%)
 Frame = +1

Query: 109 YNLQLLHQNSVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFS---------QSETHQEA 261
           Y+ +L  Q      R L  + R  L+    +     L+  R           Q   H E 
Sbjct: 181 YDRKLQRQRERERRRQLRRQERQKLAREQKRERQRRLKEERQRLQREEQQRRQRLHHDEP 240

Query: 262 NPQSHTSINPRDISPVARIP*LDIPVN---PRSQLAHGEQVSHVRHEKHKQHNPPDSQVW 432
            PQ +  + P+ I  + +    D+  N      Q    EQ   +R E+ KQ +  + +  
Sbjct: 241 KPQVNPQVKPQVIDELRQRSGEDLDKNQTDEHEQKLRNEQEKKLREEQQKQRDEQEQK-- 298

Query: 433 RRPQHERLIEREEE--VHISELRHGNRCSL 516
            R + +RL + EE+   H  EL+      L
Sbjct: 299 DREEQDRLKQEEEQARTHQKELKENQEQQL 328
>ref|NP_014639.1| DNA binding protein involved in transcriptional regulation; Sin3p
 sp|P22579|SIN3_YEAST PAIRED AMPHIPATHIC HELIX PROTEIN
 pir||RGBYS3 regulatory protein SIN3 - yeast (Saccharomyces cerevisiae)
 emb|CAA99003.1| (Z74746) ORF YOL004w [Saccharomyces cerevisiae]
          Length = 1536

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 19/50 (38%), Positives = 27/50 (54%)
 Frame = +1

Query: 208 NLPLQNPRFSQSETHQEANPQSHTSINPRDISPVARIP*LDIPVNPRSQL 357
           N PL + R S +E +  ++ Q H   +P+ ISP+A     DIPV P   L
Sbjct: 593 NPPLSDLRTSLTEQYAPSSIQ-HQQQHPQSISPIANTQYGDIPVRPEIDL 641
>emb|CAB61059.1| (AL132948) predicted using Genefinder; preliminary prediction
           [Caenorhabditis elegans]
          Length = 728

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 16/41 (39%), Positives = 21/41 (51%)
 Frame = +1

Query: 355 LAHGEQVSHVRHEKHKQHNPPDSQVWRRPQHERLIEREEEV 477
           LA  EQ   V H  H  H   + +VW    HE++IE EE +
Sbjct: 684 LARMEQRFGVEHHHHIVHQEQEEEVW---DHEQIIEEEEVI 721
>sp|P34099|KAPC_DICDI CAMP-DEPENDENT PROTEIN KINASE CATALYTIC SUBUNIT
 pir||JQ1150 protein kinase (EC 2.7.1.37) cAMP-dependent, catalytic chain -
           slime mold (Dictyostelium discoideum)
          Length = 648

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 23/87 (26%), Positives = 35/87 (39%)
 Frame = +1

Query: 115 LQLLHQNSVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFSQSETHQEANPQSHTSINPR 294
           L L H +S   P N+        S  PT+    P+  P   Q ++ Q+   Q    I P 
Sbjct: 257 LSLQHAHSSYTPSNVLHSPTHFQSSLPTRLDTNPITTPIRQQQQSQQQLQQQQLQQIPPP 316

Query: 295 DISPVARIP*LDIPVNPRSQLAHGEQV 375
            ++     P    PVN R +L   +Q+
Sbjct: 317 TVNSFFLPP----PVNARERLKEFKQI 339
>gb|AAB20716.1| serine/threonine protein kinase [Dictyostelium, Peptide, 648 aa]
          Length = 648

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 23/87 (26%), Positives = 35/87 (39%)
 Frame = +1

Query: 115 LQLLHQNSVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFSQSETHQEANPQSHTSINPR 294
           L L H +S   P N+        S  PT+    P+  P   Q ++ Q+   Q    I P 
Sbjct: 257 LSLQHAHSSYTPSNVLHSPTHFQSSLPTRLDTNPITTPIRQQQQSQQQLQQQQLQQIPPP 316

Query: 295 DISPVARIP*LDIPVNPRSQLAHGEQV 375
            ++     P    PVN R +L   +Q+
Sbjct: 317 TVNSFFLPP----PVNARERLKEFKQI 339
>gb|AAF49857.1| (AE003539) CG11259 gene product [Drosophila melanogaster]
          Length = 996

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 16/48 (33%), Positives = 24/48 (49%)
 Frame = +1

Query: 133 NSVSNPRNLATKHRTSLSPTPTKSLNLPLQNPRFSQSETHQEANPQSH 276
           +S+SNP       +   SPT +    +P+  PR S S+T   A P +H
Sbjct: 500 SSISNP-----SEKPHSSPTLSHGKKMPMPTPRISISKTQTPAKPMTH 542
>gb|AAA34839.1| (M36822) SIN3 open reading frame [Saccharomyces cerevisiae]
          Length = 1538

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 19/50 (38%), Positives = 27/50 (54%)
 Frame = +1

Query: 208 NLPLQNPRFSQSETHQEANPQSHTSINPRDISPVARIP*LDIPVNPRSQL 357
           N PL + R S +E +  ++ Q H   +P+ ISP+A     DIPV P   L
Sbjct: 595 NPPLSDLRTSLTEQYAPSSIQ-HQQQHPQSISPIANTQYGDIPVRPEIDL 643
>sp|Q03372|HMSH_DROME MUSCLE SEGMENTATION HOMEOBOX
 gb|AAC47329.1| (U33319) MSH [Drosophila melanogaster]
 gb|AAB62975.1| (AF009038) muscle segment homeobox [Drosophila melanogaster]
          Length = 515

 Score = 31.3 bits (69), Expect = 8.2
 Identities = 26/85 (30%), Positives = 42/85 (48%), Gaps = 8/85 (9%)
 Frame = +1

Query: 172 RTSLSPTPTKSLNLPLQNPRFSQSETHQEANPQSH------TSINPRDISP--VARIP*L 327
           +T  SPT   S N P  N   + S ++  +N  S+      TS N  ++SP   A +  L
Sbjct: 16  QTMTSPTVPPSTNTPAGNLIITSSSSNSGSNSGSNMSSGNMTSSNLTNLSPSHPAGLNAL 75

Query: 328 DIPVNPRSQLAHGEQVSHVRHEKHKQHNPPDSQ 426
             P +P + L   +Q    +H++H+Q      Q
Sbjct: 76  ASPTSPSALLLAHQQHLLQQHQQHQQQQQQQQQ 108
Database: nr Posted date: Sep 29, 2000 9:53 PM Number of letters in database: 177,575,912 Number of sequences in database: 565,281 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 187937036 Number of Sequences: 565281 Number of extensions: 4085101 Number of successful extensions: 17232 Number of sequences better than 10.0: 32 Number of HSP's better than 10.0 without gapping: 3 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 17214 Number of HSP's gapped (non-prelim): 32 length of query: 183 length of database: 177,575,912 effective HSP length: 52 effective length of query: 130 effective length of database: 148,181,300 effective search space: 19263569000 effective search space used: 19263569000 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 68 (30.9 bits)