The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.

BLASTX 2.2.1 [Apr-13-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= 1606_E09_I18ZS5.seq 1606_E09_I18ZS5 0 0 0 1 444
         (444 letters)

Database: nr
           705,144 sequences; 222,175,239 total letters


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAC27452.1|  (AJ297887) DigA protein [Aspergillus nidulans]   176  7e-44
dbj|BAA95999.1|  (AB040908) KIAA1475 protein [Homo sapiens]        97  5e-20
gb|AAG34679.1|AF308802_1  (AF308802) vacuolar protein sortin...    97  5e-20
gb|AAF79641.1|AC025416_15  (AC025416) F5O11.22 [Arabidopsis ...    74  4e-13
gb|AAF88074.1|AC025417_2  (AC025417) T12C24.2 [Arabidopsis t...    74  4e-13
ref|NP_013249.1|  vacuolar membrane protein; Pep3p [Saccharo...    68  3e-11
gb|AAF45652.1|  (AE003421) dor gene product [Drosophila mela...    65  2e-10
pir||S54252  deep orange protein - fruit fly (Drosophila mel...    65  2e-10
sp|Q24314|DOR_DROME  DEEP ORANGE PROTEIN >gi|2832850|emb|CAA...    64  5e-10
pir||T34481  hypothetical protein W06B4.3 - Caenorhabditis e...    60  5e-09
gb|AAC46819.2|  (U23522) similar to vacuolar membrane protei...    60  5e-09
pir||T41607  probable vacuolar membrane protein - fission ye...    57  3e-08
emb|CAB91628.1|  (AJ289080) Vps18 protein [Candida albicans]       45  2e-04
gb|AAK40566.1|  (AE006659) DNA-directed RNA polymerase, subu...    35  0.18
emb|CAB95611.1|  (AL359782) hypothetical protein, CHR1.386 [...    34  0.32
gb|AAF54693.1|  (AE003692) CG6923 gene product [Drosophila m...    33  0.54
dbj|BAB08428.1|  (AB017067) gb|AAF32311.1~gene_id:MJC20.5~si...    33  0.70
ref|NP_055463.1|  KIAA0675 gene product [Homo sapiens] >gi|7...    33  0.92
ref|NP_066681.1|  similar to ardC gene in pSa(IncW plasmid) ...    33  0.92
dbj|BAB22082.1|  (AK002414) putative [Mus musculus]                32  1.2
pir||E72520  hypothetical protein APE2138 - Aeropyrum pernix...    32  1.2
emb|CAA11444.1|  (AJ223577) intermediate filament protein C1...    32  1.6
sp|Q10322|DMA1_SCHPO  DMA1 PROTEIN >gi|7490447|pir||T37862 d...    32  2.0
ref|NP_111000.1|  Uncharacterized conserved protein [Thermop...    32  2.0
gb|AAF58107.2|  (AE003809) CG8242 gene product [Drosophila m...    31  2.7
ref|NP_106707.1|  antirestriction protein [Mesorhizobium lot...    31  2.7
gb|AAK38349.1|  (AY030283) PHD zinc finger transcription fac...    31  3.5
gb|AAH01657.1|AAH01657  (BC001657) Unknown (protein for IMAG...    31  3.5
gb|AAD22654.1|AC007138_18  (AC007138) putative CHP-rich zinc...    30  4.5
pir||T00600  hypothetical protein T8K22.6 - Arabidopsis thal...    30  5.9
pir||T00603  hypothetical protein T8K22.9 - Arabidopsis thal...    30  5.9
dbj|BAB22071.1|  (AK002399) putative [Mus musculus]                30  7.8
gb|AAF81249.1|AF267753_1  (AF267753) putative potassium chan...    30  7.8
pir||T03835  vacA protein - slime mold (Dictyostelium discoi...    30  7.8
gb|AAB71964.1|  (AC002292) Hypothetical protein [Arabidopsis...    30  7.8
dbj|BAB27445.1|  (AK011171) putative [Mus musculus]                30  7.8
pir||A33926  DNA-directed RNA polymerase (EC 2.7.7.6) chain ...    30  7.8
sp|P15350|RPA1_HALHA  DNA-DIRECTED RNA POLYMERASE SUBUNIT A'...    30  7.8

>emb|CAC27452.1| (AJ297887) DigA protein [Aspergillus nidulans]
          Length = 963

 Score =  176 bits (445), Expect = 7e-44
 Identities = 85/126 (67%), Positives = 102/126 (80%)
 Frame = +2

Query: 17   TATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAG 196
            TA  I+ +IAALD RYAIVEPGEKC+TC LP+LSRQFFVFPCQH+FHSDCLGR+VLE AG
Sbjct: 839  TARQIRSEIAALDTRYAIVEPGEKCWTCSLPVLSRQFFVFPCQHAFHSDCLGREVLEGAG 898

Query: 197  VGKSSRIRELQMQIQKGLVSGTQRETVVAELDALVASSCILCSDLAIKRIDEPFITHNDY 376
             GK   IR+LQ Q+ +G ++ +QRE VV ELD L+A +CILC D AIK+ID+PFIT  D 
Sbjct: 899  -GKKKYIRDLQSQLNEGALTSSQREEVVKELDGLIAEACILCGDHAIKQIDKPFITATDN 957

Query: 377  VNEWIL 394
            V+EW L
Sbjct: 958  VDEWCL 963
>dbj|BAA95999.1| (AB040908) KIAA1475 protein [Homo sapiens]
          Length = 986

 Score = 96.7 bits (239), Expect = 5e-20
 Identities = 57/133 (42%), Positives = 77/133 (57%), Gaps = 17/133 (12%)
 Frame = +2

Query: 17   TATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAG 196
            +A  I+ D+  L  RY  VEP +KC TC  PLL+R F++F C H FH+DCL + V     
Sbjct: 842  SAQRIRRDLQELRGRYGTVEPQDKCATCDFPLLNRPFYLFLCGHMFHADCLLQAVRPGLP 901

Query: 197  VGKSSRIRELQMQI------QKG-----------LVSGTQRETVVAELDALVASSCILCS 325
              K +R+ ELQ ++       KG             +G  RE + A+LD LVA+ C+ C 
Sbjct: 902  AYKQARLEELQRKLGAAPPPAKGSARAKEAEGGAATAGPSREQLKADLDELVAAECVYCG 961

Query: 326  DLAIKRIDEPFITHNDYVNEWIL*EEDRSW 415
            +L I+ ID PFI    Y       EE  SW
Sbjct: 962  ELMIRSIDRPFIDPQRYE------EEQLSW 985
>gb|AAG34679.1|AF308802_1 (AF308802) vacuolar protein sorting protein 18 [Homo sapiens]
          Length = 973

 Score = 96.7 bits (239), Expect = 5e-20
 Identities = 57/133 (42%), Positives = 77/133 (57%), Gaps = 17/133 (12%)
 Frame = +2

Query: 17   TATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAG 196
            +A  I+ D+  L  RY  VEP +KC TC  PLL+R F++F C H FH+DCL + V     
Sbjct: 829  SAQRIRRDLQELRGRYGTVEPQDKCATCDFPLLNRPFYLFLCGHMFHADCLLQAVRPGLP 888

Query: 197  VGKSSRIRELQMQI------QKG-----------LVSGTQRETVVAELDALVASSCILCS 325
              K +R+ ELQ ++       KG             +G  RE + A+LD LVA+ C+ C 
Sbjct: 889  AYKQARLEELQRKLGAAPPPAKGSARAKEAEGGAATAGPSREQLKADLDELVAAECVYCG 948

Query: 326  DLAIKRIDEPFITHNDYVNEWIL*EEDRSW 415
            +L I+ ID PFI    Y       EE  SW
Sbjct: 949  ELMIRSIDRPFIDPQRYE------EEQLSW 972
>gb|AAF79641.1|AC025416_15 (AC025416) F5O11.22 [Arabidopsis thaliana]
          Length = 1063

 Score = 73.9 bits (180), Expect = 4e-13
 Identities = 51/128 (39%), Positives = 71/128 (54%), Gaps = 35/128 (27%)
 Frame = +2

Query: 20   ATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQ-----------------FFVFPCQH 148
            A NI+ DI+AL  RYA+++  E+C  C   +L                    F+VFPC H
Sbjct: 888  ADNIRNDISALTQRYAVIDRDEECGVCKRKILMMSGDFRMAQGYSSAGPLAPFYVFPCGH 947

Query: 149  SFHSDCLGRKVLEQAGVGKSSRIRELQMQI----------------QKGLVSGTQRETVV 280
            SFH+ CL   V   A   ++  I +LQ Q+                 + + S T  + + 
Sbjct: 948  SFHAQCLITHVTSCAHEEQAEHILDLQKQLTLLGSETRRDINGNRSDEPITSTTTADKLR 1007

Query: 281  AELDALVASSCILCSDLAIKRIDEPFITHND--YVNEWIL*EE 403
            +ELD  +AS C  C +L I  I  PFI   D  Y   W L  E
Sbjct: 1008 SELDDAIASECPFCGELMINEITLPFIKPEDSQYSTSWDLRSE 1050
>gb|AAF88074.1|AC025417_2 (AC025417) T12C24.2 [Arabidopsis thaliana]
          Length = 1010

 Score = 73.9 bits (180), Expect = 4e-13
 Identities = 51/128 (39%), Positives = 71/128 (54%), Gaps = 35/128 (27%)
 Frame = +2

Query: 20   ATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQ-----------------FFVFPCQH 148
            A NI+ DI+AL  RYA+++  E+C  C   +L                    F+VFPC H
Sbjct: 835  ADNIRNDISALTQRYAVIDRDEECGVCKRKILMMSGDFRMAQGYSSAGPLAPFYVFPCGH 894

Query: 149  SFHSDCLGRKVLEQAGVGKSSRIRELQMQI----------------QKGLVSGTQRETVV 280
            SFH+ CL   V   A   ++  I +LQ Q+                 + + S T  + + 
Sbjct: 895  SFHAQCLITHVTSCAHEEQAEHILDLQKQLTLLGSETRRDINGNRSDEPITSTTTADKLR 954

Query: 281  AELDALVASSCILCSDLAIKRIDEPFITHND--YVNEWIL*EE 403
            +ELD  +AS C  C +L I  I  PFI   D  Y   W L  E
Sbjct: 955  SELDDAIASECPFCGELMINEITLPFIKPEDSQYSTSWDLRSE 997
>ref|NP_013249.1| vacuolar membrane protein; Pep3p [Saccharomyces cerevisiae]
 sp|P27801|PEP3_YEAST VACUOLAR MEMBRANE PROTEIN PEP3
 pir||A41943 vacuolar membrane protein PEP3 - yeast (Saccharomyces cerevisiae)
 gb|AAA34852.1| (M65144) PEP3 [Saccharomyces cerevisiae]
 gb|AAB82382.1| (U53879) Pep3p: Vacuolar membrane protein [Saccharomyces cerevisiae]
 emb|CAA97720.1| (Z73320) ORF YLR148w [Saccharomyces cerevisiae]
          Length = 918

 Score = 67.8 bits (164), Expect = 3e-11
 Identities = 36/120 (30%), Positives = 63/120 (52%)
 Frame = +2

Query: 29   IKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGVGKS 208
            I  +I+  +  Y I+EPG+ C  CG  L  ++F VFPC H FH +C+ R +L       +
Sbjct: 806  INTEISKFNEIYRILEPGKSCDECGKFLQIKKFIVFPCGHCFHWNCIIRVIL-------N 858

Query: 209  SRIRELQMQIQKGLVSGTQRETVVAELDALVASSCILCSDLAIKRIDEPFITHNDYVNEW 388
            S    L+ + +  L + ++    + +L+ ++   C LCSD+ I +ID+P       + +W
Sbjct: 859  SNDYNLRQKTENFLKAKSKHN--LNDLENIIVEKCGLCSDININKIDQPISIDETELAKW 916
>gb|AAF45652.1| (AE003421) dor gene product [Drosophila melanogaster]
          Length = 1002

 Score = 64.7 bits (156), Expect = 2e-10
 Identities = 42/127 (33%), Positives = 66/127 (51%), Gaps = 18/127 (14%)
 Frame = +2

Query: 8    TRPTATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLE 187
            T      +  ++  L      VE  + C  C + LL + FF+F C H FHSDCL + V+ 
Sbjct: 858  TTEQTDRVTAELQQLRQHSLTVESQDTCEICEMMLLVKPFFIFICGHKFHSDCLEKHVVP 917

Query: 188  QAGVGKSSRIRELQMQI----------QKGLVSGTQ-------RETVVAELDALVASSCI 316
                 +  R+  L+ Q+          Q G +S  Q       R  +  E++ ++A+ C+
Sbjct: 918  LLTKEQCRRLGTLKQQLEAEVQTQAQPQSGALSKQQAMELQRKRAALKTEIEDILAADCL 977

Query: 317  LCSDLAIKRIDEPFITHNDYVN-EW 388
             C  L I  ID+PF+   + VN EW
Sbjct: 978  FCG-LLISTIDQPFVDDWEQVNVEW 1001
>pir||S54252 deep orange protein - fruit fly (Drosophila melanogaster)
 emb|CAA60382.1| (X86683) deep orange (dor) [Drosophila melanogaster]
          Length = 1002

 Score = 64.7 bits (156), Expect = 2e-10
 Identities = 42/127 (33%), Positives = 66/127 (51%), Gaps = 18/127 (14%)
 Frame = +2

Query: 8    TRPTATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLE 187
            T      +  ++  L      VE  + C  C + LL + FF+F C H FHSDCL + V+ 
Sbjct: 858  TTEQTDRVTAELQQLRQHSLTVESQDTCEICEMMLLVKPFFIFICGHKFHSDCLEKHVVP 917

Query: 188  QAGVGKSSRIRELQMQI----------QKGLVSGTQ-------RETVVAELDALVASSCI 316
                 +  R+  L+ Q+          Q G +S  Q       R  +  E++ ++A+ C+
Sbjct: 918  LLTKEQCRRLGTLKQQLEAEVQTQAQPQSGALSKQQAMELQRKRAALKTEIEDILAADCL 977

Query: 317  LCSDLAIKRIDEPFITHNDYVN-EW 388
             C  L I  ID+PF+   + VN EW
Sbjct: 978  FCG-LLISTIDQPFVDDWEQVNVEW 1001
>sp|Q24314|DOR_DROME DEEP ORANGE PROTEIN
 emb|CAA16809.1| (AL021726) EG:171E4.1 [Drosophila melanogaster]
          Length = 1002

 Score = 63.5 bits (153), Expect = 5e-10
 Identities = 40/106 (37%), Positives = 61/106 (56%), Gaps = 18/106 (16%)
 Frame = +2

Query: 71   VEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGVGKSSRIRELQMQI---- 238
            VE  + C  C + LL + FF+F C H FHSDCL + V+      +  R+  L+ Q+    
Sbjct: 879  VESQDTCEICEMMLLVKPFFIFICGHKFHSDCLEKHVVPLLTKEQCRRLGTLKQQLEAEV 938

Query: 239  ------QKGLVSGTQ-------RETVVAELDALVASSCILCSDLAIKRIDEPFITHNDYV 379
                  Q G +S  Q       R  +  E++ ++A+ C+ C  L I  ID+PF+   + V
Sbjct: 939  QTQAQPQSGALSKQQAMELQRKRAALKTEIEDILAADCLFCG-LLISTIDQPFVDDWEQV 997

Query: 380  N-EW 388
            N EW
Sbjct: 998  NVEW 1001
>pir||T34481 hypothetical protein W06B4.3 - Caenorhabditis elegans
          Length = 543

 Score = 60.1 bits (144), Expect = 5e-09
 Identities = 37/125 (29%), Positives = 70/125 (55%), Gaps = 19/125 (15%)
 Frame = +2

Query: 20  ATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGV 199
           A+ I+     L +R  +V+P + C  C  P+  R F V  C+H FH +CL   ++     
Sbjct: 400 ASEIRDKQEKLKNRTTVVKPSDVCSHCARPISGRAFNVHSCRHFFHRECLEIAMISFLSQ 459

Query: 200 GKSSRIREL---------QMQI------QKGLVSGTQRE-TVVAELDALVASSCILCSDL 331
            +  +++ L         QM+       QKG +   ++   + A +  +V + C LC ++
Sbjct: 460 EEVEKMKTLIIDEERVLSQMKAEQLAGNQKGFIEKQEKYLKIAAFISNIVGAECPLCGNI 519

Query: 332 AIKRIDEPFITHNDY---VNEWIL 394
           AI +ID+ F++  ++   +N W+L
Sbjct: 520 AISQIDKQFLSDEEFAADLNTWLL 543
>gb|AAC46819.2| (U23522) similar to vacuolar membrane protein PEP3 (SP:PEP3_YEAST,
            P27801) [Caenorhabditis elegans]
          Length = 1010

 Score = 60.1 bits (144), Expect = 5e-09
 Identities = 37/125 (29%), Positives = 70/125 (55%), Gaps = 19/125 (15%)
 Frame = +2

Query: 20   ATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGV 199
            A+ I+     L +R  +V+P + C  C  P+  R F V  C+H FH +CL   ++     
Sbjct: 867  ASEIRDKQEKLKNRTTVVKPSDVCSHCARPISGRAFNVHSCRHFFHRECLEIAMISFLSQ 926

Query: 200  GKSSRIREL---------QMQI------QKGLVSGTQRE-TVVAELDALVASSCILCSDL 331
             +  +++ L         QM+       QKG +   ++   + A +  +V + C LC ++
Sbjct: 927  EEVEKMKTLIIDEERVLSQMKAEQLAGNQKGFIEKQEKYLKIAAFISNIVGAECPLCGNI 986

Query: 332  AIKRIDEPFITHNDY---VNEWIL 394
            AI +ID+ F++  ++   +N W+L
Sbjct: 987  AISQIDKQFLSDEEFAADLNTWLL 1010
>pir||T41607 probable vacuolar membrane protein - fission yeast
            (Schizosaccharomyces pombe)
 emb|CAA21292.1| (AL031855) putative vacuolar membrane protein [Schizosaccharomyces
            pombe]
          Length = 900

 Score = 57.4 bits (137), Expect = 3e-08
 Identities = 25/71 (35%), Positives = 41/71 (57%)
 Frame = +2

Query: 20   ATNIKVDIAALDHRYAIVEPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGV 199
            A  I+ +   + +RY ++EP E C+ C  PL S  F +FPCQH+FH  C+  K  + A  
Sbjct: 814  AHEIQTNAENMRNRYIVLEPNESCWHCNQPLFSEPFVLFPCQHAFHRSCMLEKTYKLA-- 871

Query: 200  GKSSRIRELQM 232
             + + ++E Q+
Sbjct: 872  SEKNILKECQL 882
>emb|CAB91628.1| (AJ289080) Vps18 protein [Candida albicans]
          Length = 810

 Score = 44.7 bits (104), Expect = 2e-04
 Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
 Frame = +2

Query: 65  AIVEPGEKCYTCGLPLLSRQFFVFP-CQHSFHSDCL 169
           AI+EPGE C  CG  L+   F  FP C H+FH +C+
Sbjct: 761 AIIEPGEPCRKCGKLLVQENFVYFPNCHHAFHKECM 796
>gb|AAK40566.1| (AE006659) DNA-directed RNA polymerase, subunit A' (rpoA1)
           [Sulfolobus solfataricus]
          Length = 880

 Score = 35.0 bits (79), Expect = 0.18
 Identities = 31/108 (28%), Positives = 57/108 (52%), Gaps = 18/108 (16%)
 Frame = +2

Query: 35  VDIAALDHRYAIVEPGEKCYTCG------------LPLLSRQFFVFPCQHSFH---SDC- 166
           ++ + +D R  ++EPG+KC TCG            + L+     V   +H +    + C 
Sbjct: 40  IEGSVMDPRLGVIEPGQKCPTCGNTLGNCPGHFGHIELVRPVIHVGLVKHIYEFLKATCR 99

Query: 167 -LGRKVLEQAGVGKSSRIRELQMQIQKGLVSGTQRETVVAELDALVASSCILCSDLAIK- 340
             GR  + +  + K SRI      I+K   S  +R T   +  A+ A  C  C++   K 
Sbjct: 100 RCGRVKISEDEIEKYSRIYN---AIKKRWPSAARRLTEYVKKTAMKAQVCPHCNEKQYKI 156

Query: 341 RIDEPF 358
           ++++P+
Sbjct: 157 KLEKPY 162
>emb|CAB95611.1| (AL359782) hypothetical protein, CHR1.386 [Trypanosoma brucei]
          Length = 731

 Score = 34.3 bits (77), Expect = 0.32
 Identities = 20/53 (37%), Positives = 29/53 (53%), Gaps = 7/53 (13%)
 Frame = +2

Query: 11  RPTATNIKVDIA----ALDHRYAIVEPGEKC-YTCGLPL--LSRQFFVFPCQHSFHSDCL 169
           RP A  I+VD+        HR  + E   +C ++CGL +  L       PC H+F  +CL
Sbjct: 578 RPVAQEIRVDVKNSLLTAMHRLIVAEEATECIFSCGLCMQPLKSPLTCVPCGHTFCEECL 637
>gb|AAF54693.1| (AE003692) CG6923 gene product [Drosophila melanogaster]
          Length = 1256

 Score = 33.5 bits (75), Expect = 0.54
 Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 10/55 (18%)
 Frame = +2

Query: 5    PTRPT--ATNIKVDIAALDHRYAIV-------EPGEKCYTC-GLPLLSRQFFVFPCQHSF 154
            P+RP   AT   ++   L H+Y  V       E  EKC  C  L  +  +    PC H F
Sbjct: 1150 PSRPNRGATLETIERNTLPHKYRRVRRPSETDEDAEKCAICLNLFEIENEVRRLPCMHLF 1209

Query: 155  HSDCL 169
            H+DC+
Sbjct: 1210 HTDCV 1214
>dbj|BAB08428.1| (AB017067) gb|AAF32311.1~gene_id:MJC20.5~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 565

 Score = 33.1 bits (74), Expect = 0.70
 Identities = 20/63 (31%), Positives = 35/63 (54%)
 Frame = -3

Query: 205 LSDARLLQYLSPQAIRVERVLTGKNKELSTQ*GQSTSITLLTRLHDCVAMVECSNINLDV 26
           L+  RLLQ L P+   + R L+GK + +ST   + T    +T +   V++  C+++ L  
Sbjct: 458 LNHERLLQVLKPEPREMGRNLSGKAETMSTNVERKTVKVNITEI---VSVTPCADLTLPP 514

Query: 25  GSG 17
           G+G
Sbjct: 515 GAG 517
>ref|NP_055463.1| KIAA0675 gene product [Homo sapiens]
 pir||T00362 hypothetical protein KIAA0675 - human
 dbj|BAA31650.1| (AB014575) KIAA0675 protein [Homo sapiens]
          Length = 1208

 Score = 32.7 bits (73), Expect = 0.92
 Identities = 18/51 (35%), Positives = 22/51 (42%)
 Frame = +2

Query: 74   EPGEKCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGVGKSSRIREL 226
            E  E C  C   L      V PC H FH+ C+ R  L Q G   + R+  L
Sbjct: 1143 EEEEPCVICHENLSPENLSVLPCAHKFHAQCI-RPWLMQQGTCPTCRLHVL 1192
>ref|NP_066681.1| similar to ardC gene in pSa(IncW plasmid) [Rhizobium rhizogenes]
 dbj|BAB16219.1| (AP002086) similar to ardC gene in pSa(IncW plasmid) [Rhizobium
           rhizogenes]
          Length = 309

 Score = 32.7 bits (73), Expect = 0.92
 Identities = 17/42 (40%), Positives = 22/42 (51%)
 Frame = +2

Query: 266 RETVVAELDALVASSCILCSDLAIKRIDEPFITHNDYVNEWI 391
           RE ++AEL      SC LC+DL I    EP   H  Y+  W+
Sbjct: 234 REELIAEL-----GSCFLCADLGIAPELEPRPDHASYLQSWL 270
>dbj|BAB22082.1| (AK002414) putative [Mus musculus]
          Length = 379

 Score = 32.3 bits (72), Expect = 1.2
 Identities = 15/36 (41%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
 Frame = +2

Query: 62  YAIVEPG-EKCYTCGLPLLSRQFF-VFPCQHSFHSDCL 169
           +++ EPG E C  C     ++Q+  V PC+H FH DC+
Sbjct: 317 HSLPEPGTETCAVCLDYFCNKQWLRVLPCKHEFHRDCV 354
>pir||E72520 hypothetical protein APE2138 - Aeropyrum pernix (strain K1)
 dbj|BAA81149.1| (AP000063) 122aa long hypothetical protein [Aeropyrum pernix]
          Length = 122

 Score = 32.3 bits (72), Expect = 1.2
 Identities = 19/57 (33%), Positives = 28/57 (48%)
 Frame = +2

Query: 149 SFHSDCLGRKVLEQAGVGKSSRIRELQMQIQKGLVSGTQRETVVAELDALVASSCIL 319
           S  S C+G+KV E AGV K S   E  + + K L    + E +V  + A  +   +L
Sbjct: 40  SMLSACIGKKVAENAGVDKVSVSIEAYVNVDKLLEGKEEIEHIVVRIRAPASEEAVL 96
>emb|CAA11444.1| (AJ223577) intermediate filament protein C1 [Branchiostoma
           floridae]
          Length = 839

 Score = 32.0 bits (71), Expect = 1.6
 Identities = 19/53 (35%), Positives = 27/53 (50%)
 Frame = +2

Query: 146 HSFHSDCLGRKVLEQAGVGKSSRIRELQMQIQKGLVSGTQRETVVAELDALVA 304
           HS  S      V E+   G  SRI  LQ +I   L    +R+T++A+L A +A
Sbjct: 472 HSVQSSVQSSAVFEKQVSGLESRIGSLQGEIDSKLAIIRERDTMIADLRAQIA 524
>sp|Q10322|DMA1_SCHPO DMA1 PROTEIN
 pir||T37862 dma1 protein - fission yeast (Schizosaccharomyces pombe)
 emb|CAA93693.1| (Z69795) dma1 protein. [Schizosaccharomyces pombe]
 emb|CAA57466.1| (X81883) non-essential gene [Schizosaccharomyces pombe]
          Length = 267

 Score = 31.6 bits (70), Expect = 2.0
 Identities = 13/28 (46%), Positives = 18/28 (63%), Gaps = 1/28 (3%)
 Frame = +2

Query: 86  KCYTCGLPLLSRQ-FFVFPCQHSFHSDCL 169
           +C  C +P+L  Q  FV PC HS+H  C+
Sbjct: 191 ECCICLMPVLPCQALFVAPCSHSYHYKCI 219
>ref|NP_111000.1| Uncharacterized conserved protein [Thermoplasma volcanium]
 dbj|BAB59622.1| (AP000992) unknown product [Thermoplasma volcanium]
          Length = 100

 Score = 31.6 bits (70), Expect = 2.0
 Identities = 16/55 (29%), Positives = 24/55 (43%)
 Frame = +2

Query: 86  KCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGVGKSSRIRELQMQIQKGL 250
           KCY C  P+ + + F F  + S H DC       + G  K   +R L + +   L
Sbjct: 5   KCYVCEKPVKTGEKFTFTKKGSVHYDCFIADKRNKLGEEKLENLRVLSILLDSNL 59
>gb|AAF58107.2| (AE003809) CG8242 gene product [Drosophila melanogaster]
          Length = 422

 Score = 31.2 bits (69), Expect = 2.7
 Identities = 13/41 (31%), Positives = 21/41 (50%)
 Frame = +2

Query: 86  KCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGVGKS 208
           KC+ CG P+ +   +V    H++HS C      +Q   G+S
Sbjct: 367 KCFACGFPVEAGDRWVEALNHNYHSQCFNCTFCKQNLEGQS 407
>ref|NP_106707.1| antirestriction protein [Mesorhizobium loti]
 dbj|BAB52493.1| (AP003008) antirestriction protein [Mesorhizobium loti]
          Length = 320

 Score = 31.2 bits (69), Expect = 2.7
 Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
 Frame = +2

Query: 266 RETVVAELDALVASSCILCSDLAIKRIDEPFITHNDYVNEW--IL*EEDRSWWKRA 427
           RE +VAE+ A       +CS L I    EP + H DY+  W  +L E++R+ ++ A
Sbjct: 247 REELVAEITA-----AFVCSTLGI----EPTVRHADYIGTWLKVLREDNRAIFRAA 293
>gb|AAK38349.1| (AY030283) PHD zinc finger transcription factor [Homo sapiens]
          Length = 704

 Score = 30.8 bits (68), Expect = 3.5
 Identities = 19/56 (33%), Positives = 25/56 (43%), Gaps = 5/56 (8%)
 Frame = +2

Query: 2   RPTRPTATNIKVDIAALDHRYAIVEPGEKCYTCG-----LPLLSRQFFVFPCQHSFHSDC 166
           R    T  N+K     LDH   +  P + C+TC       PL+   +    C   FH DC
Sbjct: 245 RKEETTGKNVKKTQHELDHNGLVPLPVKVCFTCNRSCRVAPLIQCDY----CPLLFHMDC 300

Query: 167 L 169
           L
Sbjct: 301 L 301
>gb|AAH01657.1|AAH01657 (BC001657) Unknown (protein for IMAGE:3356959) [Homo sapiens]
          Length = 487

 Score = 30.8 bits (68), Expect = 3.5
 Identities = 19/56 (33%), Positives = 25/56 (43%), Gaps = 5/56 (8%)
 Frame = +2

Query: 2   RPTRPTATNIKVDIAALDHRYAIVEPGEKCYTCG-----LPLLSRQFFVFPCQHSFHSDC 166
           R    T  N+K     LDH   +  P + C+TC       PL+   +    C   FH DC
Sbjct: 160 RKEETTGKNVKKTQHELDHNGLVPLPVKVCFTCNRSCRVAPLIQCDY----CPLLFHMDC 215

Query: 167 L 169
           L
Sbjct: 216 L 216
>gb|AAD22654.1|AC007138_18 (AC007138) putative CHP-rich zinc finger protein [Arabidopsis
           thaliana]
 emb|CAB80685.1| (AL161493) putative CHP-rich zinc finger protein [Arabidopsis
           thaliana]
          Length = 658

 Score = 30.4 bits (67), Expect = 4.5
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 2/57 (3%)
 Frame = +2

Query: 2   RPTRPTATNIKVDIAALDHRYAIVEPGEK--CYTCGLPLLSRQFFVFPCQHSFHSDCLG 172
           RP + +  N+K    A DH+  ++   +   C  CGL      +  F C    H DCLG
Sbjct: 233 RPPQQSLLNLK----AHDHQLTLLPKLDSFTCNACGLKGDRSPYVCFQCGFMIHQDCLG 287
>pir||T00600 hypothetical protein T8K22.6 - Arabidopsis thaliana
 gb|AAC18923.1| (AC004136) hypothetical protein [Arabidopsis thaliana]
          Length = 627

 Score = 30.0 bits (66), Expect = 5.9
 Identities = 10/32 (31%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
 Frame = +2

Query: 74  EPGEKCYTCGLPLLSRQ-FFVFPCQHSFHSDCL 169
           E G++CY+C +  +    +F   C   FH +C+
Sbjct: 114 EEGDECYSCSIQTIGTDYYFCATCDKRFHKECV 146
>pir||T00603 hypothetical protein T8K22.9 - Arabidopsis thaliana
 gb|AAC18926.1| (AC004136) hypothetical protein [Arabidopsis thaliana]
          Length = 627

 Score = 30.0 bits (66), Expect = 5.9
 Identities = 10/32 (31%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
 Frame = +2

Query: 74  EPGEKCYTCGLPLLSRQ-FFVFPCQHSFHSDCL 169
           E G++CY+C +  +    +F   C   FH +C+
Sbjct: 114 EEGDECYSCSIQTIGTDYYFCATCDKRFHKECV 146
>dbj|BAB22071.1| (AK002399) putative [Mus musculus]
          Length = 456

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 25/77 (32%), Positives = 38/77 (48%), Gaps = 7/77 (9%)
 Frame = +2

Query: 77  PGEKCYTCGLPLLSRQFFV-FPCQHSFHSDCLGRKV--LEQAGVGKSSRIRELQMQIQKG 247
           P  +C  C      ++ F   PC H FH  CL R +  +EQ  +    + +E Q  + K 
Sbjct: 130 PHGQCVICLYGFQEKEAFTKTPCYHYFHCHCLARYIQHMEQE-LTTQEQEQERQHVVTKQ 188

Query: 248 LVSGTQ----RETVVAELDALVAS 307
              G Q    RE +V +L +L A+
Sbjct: 189 KAVGVQCPVCREPLVYDLASLKAA 212
>gb|AAF81249.1|AF267753_1 (AF267753) putative potassium channel protein Mkt1p
           [Mesembryanthemum crystallinum]
          Length = 870

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 6/52 (11%)
 Frame = -3

Query: 313 TRRRNESVQFC-----HNRLPLRTTDQALLYLHLKF-PDTR*LSDARLLQYLSPQAIR 158
           TRR  +++Q        NRLP+R  DQ L +L LK+  D+  L    +L  L P+AI+
Sbjct: 306 TRRFRDTIQAASSFGLRNRLPVRLQDQMLAHLCLKYRTDSEGLQQQEVLDSL-PKAIK 362
>pir||T03835 vacA protein - slime mold (Dictyostelium discoideum)  (fragment)
 gb|AAB69389.1| (AF015565) VacA [Dictyostelium discoideum]
          Length = 708

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 9/15 (60%), Positives = 13/15 (86%)
 Frame = +2

Query: 131 VFPCQHSFHSDCLGR 175
           V+ C H+FHS+CLG+
Sbjct: 558 VYQCNHTFHSECLGK 572
>gb|AAB71964.1| (AC002292) Hypothetical protein [Arabidopsis thaliana]
          Length = 664

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 15/41 (36%), Positives = 22/41 (53%)
 Frame = +2

Query: 86  KCYTCGLPLLSRQFFVFPCQHSFHSDCLGRKVLEQAGVGKS 208
           KC  C L + S   F F C+ +FH  C+  K ++  G+ KS
Sbjct: 507 KCIACKLDIKSYGHFCFICEVAFHIKCI--KDVDGLGIVKS 545
>dbj|BAB27445.1| (AK011171) putative [Mus musculus]
          Length = 121

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 25/77 (32%), Positives = 38/77 (48%), Gaps = 7/77 (9%)
 Frame = +2

Query: 77  PGEKCYTCGLPLLSRQFFV-FPCQHSFHSDCLGRKV--LEQAGVGKSSRIRELQMQIQKG 247
           P  +C  C      ++ F   PC H FH  CL R +  +EQ  +    + +E Q  + K 
Sbjct: 19  PHGQCVICLYGFQEKEAFTKTPCYHYFHCHCLARYIQHMEQE-LTTQEQEQERQHVVTKQ 77

Query: 248 LVSGTQ----RETVVAELDALVAS 307
              G Q    RE +V +L +L A+
Sbjct: 78  KAVGVQCPVCREPLVYDLASLKAA 101
>pir||A33926 DNA-directed RNA polymerase (EC 2.7.7.6) chain A - Halobacterium
           salinarum
          Length = 972

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 9/23 (39%), Positives = 16/23 (69%)
 Frame = +2

Query: 35  VDIAALDHRYAIVEPGEKCYTCG 103
           +D+  +D R  +++PG +C TCG
Sbjct: 44  IDMGLMDPRLGVIDPGLECKTCG 66
>sp|P15350|RPA1_HALHA DNA-DIRECTED RNA POLYMERASE SUBUNIT A'
 emb|CAA40426.1| (X57144) RNA polymerase subunit A [Halobacterium salinarum]
          Length = 971

 Score = 29.6 bits (65), Expect = 7.8
 Identities = 9/23 (39%), Positives = 16/23 (69%)
 Frame = +2

Query: 35  VDIAALDHRYAIVEPGEKCYTCG 103
           +D+  +D R  +++PG +C TCG
Sbjct: 44  IDMGLMDPRLGVIDPGLECKTCG 66
Database: nr Posted date: Jun 30, 2001 11:40 PM Number of letters in database: 222,175,239 Number of sequences in database: 705,144 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.267 0.0410 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 181,233,900 Number of Sequences: 705144 Number of extensions: 3516882 Number of successful extensions: 10720 Number of sequences better than 10.0: 76 Number of HSP's better than 10.0 without gapping: 10341 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 10705 length of database: 222,175,239 effective HSP length: 107 effective length of database: 146,724,831 effective search space used: 5868993240 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits)