The query sequence for this search has been filtered. Filtering
eliminates low complexity regions that commonly give spuriously high
scores that reflect compositional bias rather than significant
position-by-position alignment. Filtering can eliminate these potentially
confounding matches (e.g., hits against proline-rich regions or poly-A
tails) from the blast reports, leaving regions whose blast statistics
reflect the specificity of their pairwise alignment.
BLASTX 2.1.1 [Aug-8-2000]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Contig21.seq Contig21
(1011 letters)
Database: nr
565,281 sequences; 177,575,912 total letters
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB91755.1| (AL356173) condensin complex component cnd2... 126 4e-28
ref|NP_009455.1| involved in chromosome maintenance; simila... 87 3e-16
pir||T41281 probable condensin subunit - fission yeast (Sch... 84 3e-15
gb|AAC60203.1| (U90125) 13S condensin XCAP-H subunit [Xenop... 71 2e-11
dbj|BAA07556.1| (D38553) KIAA0074; The ha1438 gene product ... 66 6e-10
pir||T02558 hypothetical protein T26B15.15 - Arabidopsis th... 62 1e-08
gb|AAF53866.1| (AE003665) barr gene product [Drosophila mel... 43 0.005
gb|AAB40125.1| (U74488) Barren [Drosophila melanogaster] 43 0.005
pir||T33853 hypothetical protein D1037.2 - Caenorhabditis e... 35 1.5
gb|AAF50353.1| (AE003553) CG5741 gene product [Drosophila m... 32 7.8
>emb|CAB91755.1| (AL356173) condensin complex component cnd2 related protein
[Neurospora crassa]
Length = 832
Score = 126 bits (313), Expect = 4e-28
Identities = 78/131 (59%), Positives = 86/131 (65%), Gaps = 20/131 (15%)
Frame = -3
Query: 1000 EEEFWAKQKAPQN----TDDTAPPGGDYDANFFQXXX---XXXXXXXXXXXXXXXXXXXA 842
+E FWA+QKAP N + D P GDYDANFFQ A
Sbjct: 543 DEAFWAQQKAPLNQGPSSQDDHLPQGDYDANFFQDDGLPFANGGDDDDDEDMDEEVFADA 602
Query: 841 REHFSPGV-----------DGQAGLTEGGGFTALLNGETVTNTG--AFGTTLVTQTRRVR 701
R+HFSPG +G AGLT G T NG TVTN AFGT LVTQ+RRVR
Sbjct: 603 RDHFSPGPGGTQGDGAAGKEGTAGLTMDVGMTGAFNGLTVTNPADLAFGTMLVTQSRRVR 662
Query: 700 PEYVQYARVAKKVDVRRLKEELWKGMDNDIL 608
PEYVQYAR AKKVDVRRLKEELW+GM D+L
Sbjct: 663 PEYVQYARRAKKVDVRRLKEELWRGMGMDML 693
>ref|NP_009455.1| involved in chromosome maintenance; similar to Drosophila barren,
Xenopus XCAP-H, and human BRRN1; Brn1p
sp|P38170|YBJ7_YEAST HYPOTHETICAL 83.0 KD PROTEIN IN ATP1-ROX3 INTERGENIC REGION
pir||S45403 hypothetical protein YBL097w - yeast (Saccharomyces cerevisiae)
emb|CAA56003.1| (X79489) C-728 protein [Saccharomyces cerevisiae]
emb|CAA84919.1| (Z35858) ORF YBL097w [Saccharomyces cerevisiae]
Length = 728
Score = 87.0 bits (212), Expect = 3e-16
Identities = 45/103 (43%), Positives = 70/103 (67%), Gaps = 7/103 (6%)
Frame = -3
Query: 706 VRPEYVQYARVAKKVDVRRLKEELWKGMDNDIL---GKQPEPLASPDSDFKQD----QPL 548
+R V Y+RV+KKVDVRRLK+ +W+ ++N I ++ +S DS+ + + L
Sbjct: 605 IRENKVTYSRVSKKVDVRRLKKNVWRSINNLIQEHDSRKNREQSSNDSETHTEDESTKEL 664
Query: 547 KFTEVMNNLQSVYPKPVMDDISTSYCFICLLHLANEKGLVIENTPGLSEL 398
KF++++ + +Y + DISTS+CFICLLHLANE GL I +T ++L
Sbjct: 665 KFSDIIQGISKMYSDDTLKDISTSFCFICLLHLANEHGLQITHTENYNDL 714
>pir||T41281 probable condensin subunit - fission yeast (Schizosaccharomyces
pombe)
emb|CAB41651.1| (AL049728) putative condensin subunit [Schizosaccharomyces pombe]
dbj|BAA82625.1| (AB030213) subunit of condensin complex [Schizosaccharomyces pombe]
Length = 742
Score = 83.9 bits (204), Expect = 3e-15
Identities = 48/124 (38%), Positives = 73/124 (58%), Gaps = 5/124 (4%)
Frame = -3
Query: 760 TVTNTGAFGTTLVTQTRRVRPEYVQYARVAKKVDVRRLKEELWKGMDNDILGKQPEPLAS 581
T ++ FG L+ R +P+ + YA+ AKKVDVR LKE+LWK +D + K+ +
Sbjct: 599 TPPSSSGFGDNLLLTARLAKPDMLNYAKRAKKVDVRVLKEKLWKCLDLENTIKENSINSH 658
Query: 580 PDSDFKQDQ----PLK-FTEVMNNLQSVYPKPVMDDISTSYCFICLLHLANEKGLVIENT 416
+ + + P+K F +N L+ Y K + DISTS+ FIC+LHLANE L + +
Sbjct: 659 IEGSEMESEETNMPVKSFFSTVNQLEETYEKKELKDISTSFAFICVLHLANEHNLELTSN 718
Query: 415 PGLSELEIR 389
S++ IR
Sbjct: 719 EDFSDVFIR 727
>gb|AAC60203.1| (U90125) 13S condensin XCAP-H subunit [Xenopus laevis]
Length = 699
Score = 71.0 bits (171), Expect = 2e-11
Identities = 45/151 (29%), Positives = 79/151 (51%), Gaps = 8/151 (5%)
Frame = -3
Query: 835 HFSPGVDGQAGLTEGGGF------TALLNGETVTNTGAFGTTLVTQTRRVRPEYVQYARV 674
+F P + + G F +A + E N ++G + + ++V +QYA+
Sbjct: 541 NFCPALQAADSDDDDGAFLGPESNSAGFSAENQMNITSYGESNLVAGQKVNKIEIQYAKT 600
Query: 673 AKKVDVRRLKEELWKGMDNDILGKQPEPLASPDSDFK--QDQPLKFTEVMNNLQSVYPKP 500
AKK+D++RLK +W + N ++ P + + D D+ + F+ V + LQ P
Sbjct: 601 AKKMDMKRLKSSMWSLLANCPESQEEMPSSKEEIDAALITDEQV-FSSVTHGLQKRLPPV 659
Query: 499 VMDDISTSYCFICLLHLANEKGLVIENTPGLSELEIRRD 383
+ ++S F CLLHLANEK L ++ LS++ I +D
Sbjct: 660 MAQNLSVPLAFACLLHLANEKNLKLQGMDDLSDVMIMQD 698
>dbj|BAA07556.1| (D38553) KIAA0074; The ha1438 gene product is related to a C728
protein encoded in S.cerevisiae chromosome II.; human
homologue of XCAP-H, Acc#:U90125 [Homo sapiens]
Length = 747
Score = 66.0 bits (158), Expect = 6e-10
Identities = 44/132 (33%), Positives = 74/132 (55%), Gaps = 11/132 (8%)
Frame = -3
Query: 781 TALLNGETVTNTGAFGTT-----LVTQTRRVRPEYVQYARVAKKVDVRRLKEELWKGMDN 617
TA NG+T G TT LV + ++V + YA+ AKK+D+++LK+ +W +
Sbjct: 604 TAQQNGDTPEAQGLDITTYGESNLVAEPQKVNKIEIHYAKTAKKMDMKKLKQSMWSLL-T 662
Query: 616 DILGKQPEPLASPDSDFKQDQPLKFTE------VMNNLQSVYPKPVMDDISTSYCFICLL 455
+ GK+ + A+ K+ + + + +LQ P + ++S F CLL
Sbjct: 663 ALSGKEADAEANHREAGKEAALAEVADEKMLSGLTKDLQRSLPPVMAQNLSIPLAFACLL 722
Query: 454 HLANEKGLVIENTPGLSELEIRR 386
HLANEK L +E T LS++ +R+
Sbjct: 723 HLANEKNLKLEGTEDLSDVLVRQ 745
>pir||T02558 hypothetical protein T26B15.15 - Arabidopsis thaliana
gb|AAC25941.1| (AC004681) hypothetical protein [Arabidopsis thaliana]
Length = 704
Score = 61.7 bits (147), Expect = 1e-08
Identities = 43/113 (38%), Positives = 62/113 (54%), Gaps = 7/113 (6%)
Frame = -3
Query: 730 TLVTQTRRVRPEYVQYARVAKKVDVRRLKEELWKGMDNDILGKQP--EPLASPDSDFKQD 557
TL++Q R+V VQY + +K+VDV+ LKE LW+ + QP + L D + +Q+
Sbjct: 587 TLISQPRQVNKIDVQYDKASKQVDVQVLKETLWECLQES---HQPPIQNLIVQDEEHQQE 643
Query: 556 QPLKFTEVMNNLQSVYPKPVM-----DDISTSYCFICLLHLANEKGLVIENTPGLSELEI 392
P + L + +P DIS CFICLLHLANE L + + L +L I
Sbjct: 644 PPE--SRSFKVLLASFPDDCQAAERTQDISPHLCFICLLHLANEHNLSLIGSQNLDDLTI 701
>gb|AAF53866.1| (AE003665) barr gene product [Drosophila melanogaster]
Length = 735
Score = 43.0 bits (99), Expect = 0.005
Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 4/143 (2%)
Frame = -3
Query: 814 GQAGLTEGGGFTALLNGETVTNTGAFGTTLVTQTRRVRPEYVQYARVAKKVDVRRLKEEL 635
G+ GLT+ +N GT +V V +A+ AK +D++ LK+
Sbjct: 593 GEIGLTQ-------MNATCNNTVFEIGTEFEGAPSQVAKVIVPFAKRAKVIDMKNLKKSC 645
Query: 634 WKGMDNDILGKQPEPLASPDSDFKQDQPLK----FTEVMNNLQSVYPKPVMDDISTSYCF 467
+ +L PE K + K F +V L + + D +S S F
Sbjct: 646 NSLIQKQLLNAVPEETIPSHPKKKGEHYSKGFASFQQVYQKLPDLLTTKMSDSLSPSVAF 705
Query: 466 ICLLHLANEKGLVIENTPGLSELEIRR 386
+LHLAN+ L + L + +IR+
Sbjct: 706 YAVLHLANDLKLRLIPQEDLEDFQIRQ 732
>gb|AAB40125.1| (U74488) Barren [Drosophila melanogaster]
Length = 736
Score = 43.0 bits (99), Expect = 0.005
Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 4/143 (2%)
Frame = -3
Query: 814 GQAGLTEGGGFTALLNGETVTNTGAFGTTLVTQTRRVRPEYVQYARVAKKVDVRRLKEEL 635
G+ GLT+ +N GT +V V +A+ AK +D++ LK+
Sbjct: 594 GEIGLTQ-------MNATCNNTVFEIGTEFEGAPSQVAKVIVPFAKRAKVIDMKNLKKSC 646
Query: 634 WKGMDNDILGKQPEPLASPDSDFKQDQPLK----FTEVMNNLQSVYPKPVMDDISTSYCF 467
+ +L PE K + K F +V L + + D +S S F
Sbjct: 647 NSLIQKQLLNAVPEETIPSHPKKKGEHYSKGFASFQQVYQKLPDLLTTKMSDSLSPSVAF 706
Query: 466 ICLLHLANEKGLVIENTPGLSELEIRR 386
+LHLAN+ L + L + +IR+
Sbjct: 707 YAVLHLANDLKLRLIPQEDLEDFQIRQ 733
>pir||T33853 hypothetical protein D1037.2 - Caenorhabditis elegans
gb|AAC78492.1| (AF106592) contains similarity to human putative DNA binding
protein SON3 (GB:X63753) [Caenorhabditis elegans]
Length = 676
Score = 34.8 bits (78), Expect = 1.5
Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 1/95 (1%)
Frame = -3
Query: 658 VRRLKEELWKGMDNDILGKQPEPLAS-PDSDFKQDQPLKFTEVMNNLQSVYPKPVMDDIS 482
VRR ++WKG++ND++ + P L+S ++F +K + + + + M+
Sbjct: 203 VRRNVIKMWKGIENDVILRTPRGLSSFEQANFLSMHKVKKVRGVEEVLFFFSESPMETEG 262
Query: 481 TSYCFICLLHLANEKGLVIENTPGLSELEIRRDWTA 374
C LH G V E+ PG L ++W++
Sbjct: 263 ------CGLHKVARVGRVCEDDPG-GRLSYNKEWSS 291
>gb|AAF50353.1| (AE003553) CG5741 gene product [Drosophila melanogaster]
Length = 1060
Score = 32.5 bits (72), Expect = 7.8
Identities = 13/42 (30%), Positives = 23/42 (53%)
Frame = -3
Query: 736 GTTLVTQTRRVRPEYVQYARVAKKVDVRRLKEELWKGMDNDI 611
G + R P+YV++ V + +R ++ELWKG+D +
Sbjct: 705 GMIMSANVARNSPDYVRFMAVYQSRRAKRQEDELWKGIDQSL 746
Database: nr
Posted date: Sep 29, 2000 9:53 PM
Number of letters in database: 177,575,912
Number of sequences in database: 565,281
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.270 0.0470 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 324684909
Number of Sequences: 565281
Number of extensions: 6702915
Number of successful extensions: 18921
Number of sequences better than 10.0: 20
Number of HSP's better than 10.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 18901
Number of HSP's gapped (non-prelim): 14
length of query: 337
length of database: 177,575,912
effective HSP length: 54
effective length of query: 282
effective length of database: 147,050,738
effective search space: 41468308116
effective search space used: 41468308116
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 71 (32.1 bits)