Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC014476A_C01 KMC014476A_c01
(514 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB10561.1| topoisomerase-like protein [Arabidopsis thaliana] 269 1e-71
ref|NP_568968.1| topoisomerase, putative; protein id: At5g63190.... 269 1e-71
gb|AAM63106.1| topoisomerase-like protein [Arabidopsis thaliana] 269 1e-71
pir||T06664 hypothetical protein F6I7.10 - Arabidopsis thaliana ... 264 5e-70
ref|NP_567708.1| topoisomerase, putative; protein id: At4g24800.... 264 5e-70
>dbj|BAB10561.1| topoisomerase-like protein [Arabidopsis thaliana]
Length = 729
Score = 269 bits (688), Expect = 1e-71
Identities = 133/171 (77%), Positives = 153/171 (88%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
ALYADVI P QIRDGF L+ S DDLAVDILDAV++LALF+ARA+VD+ILPP FL R++K
Sbjct: 211 ALYADVILPDQIRDGFIRLLRSVDDLAVDILDAVNVLALFIARAIVDEILPPVFLVRSKK 270
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
LPES KG QVI TAEKSYLSAPHHAELVE++WGG+TH TVEE KKKI+++LKEYV++GD
Sbjct: 271 ILPESCKGFQVIVTAEKSYLSAPHHAELVEKKWGGSTHTTVEETKKKISEILKEYVENGD 330
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EACRCIRELGVSFFHHEVVKRALVLAM++ +AE L+LKLLKE AEEGLI
Sbjct: 331 TYEACRCIRELGVSFFHHEVVKRALVLAMDSPTAESLVLKLLKETAEEGLI 381
Score = 154 bits (390), Expect = 4e-37
Identities = 86/171 (50%), Positives = 116/171 (67%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
AL+ ++ S +GF ML+ESA+D A+DI+DA + LALFLARAV+DD+L P L
Sbjct: 509 ALHMELFSTEDFINGFIMLLESAEDTALDIMDASNELALFLARAVIDDVLAPLNLEDIST 568
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
LP S G + +++A +S +SA H E + R WGG T VE+ K KI+ LL+EY G
Sbjct: 569 KLPPKSTGTETVRSA-RSLISARHAGERLLRSWGGGTGWIVEDAKDKISKLLEEYETGGV 627
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EAC+CIR+LG+ FF+HEVVK+ALV+AME ++ LL LL+E EGLI
Sbjct: 628 TSEACQCIRDLGMPFFNHEVVKKALVMAMEKQNDR--LLNLLEECFGEGLI 676
Score = 47.8 bits (112), Expect = 8e-05
Identities = 38/160 (23%), Positives = 71/160 (43%)
Frame = +2
Query: 17 VISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARKALPES 196
+IS +Q+ GFF + ES DDLA+DI A + + +A+ L +F + + +S
Sbjct: 380 LISSSQMVKGFFRVAESLDDLALDIPSAKKLFDSIVPKAISGGWLDDSFKITSDQDGEKS 439
Query: 197 SKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGDTLEAC 376
S+ ++ Q KK ++++EY S D E
Sbjct: 440 SQDGKLRQ------------------------------YKKDTVNIIQEYFLSDDIPELI 469
Query: 377 RCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEA 496
R +++LG ++ +KR + LA++ ++ E + +L A
Sbjct: 470 RSLQDLGAPEYNPVFLKRLITLALDRKNREKEMASVLLSA 509
Score = 40.8 bits (94), Expect = 0.009
Identities = 20/65 (30%), Positives = 33/65 (50%)
Frame = +2
Query: 302 VEEVKKKIADLLKEYVDSGDTLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLK 481
+ + KK + ++ EY +GD A +RELG S +H KR + +AM+ E +
Sbjct: 147 LNDYKKSVVSIIDEYFSTGDVKVAASDLRELGSSEYHPYFTKRLVSMAMDRHDKEKEMAS 206
Query: 482 LLKEA 496
+L A
Sbjct: 207 VLLSA 211
>ref|NP_568968.1| topoisomerase, putative; protein id: At5g63190.1, supported by
cDNA: 19433. [Arabidopsis thaliana]
gi|14532646|gb|AAK64051.1| putative topoisomerase
[Arabidopsis thaliana] gi|23296935|gb|AAN13205.1|
putative topoisomerase [Arabidopsis thaliana]
Length = 702
Score = 269 bits (688), Expect = 1e-71
Identities = 133/171 (77%), Positives = 153/171 (88%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
ALYADVI P QIRDGF L+ S DDLAVDILDAV++LALF+ARA+VD+ILPP FL R++K
Sbjct: 184 ALYADVILPDQIRDGFIRLLRSVDDLAVDILDAVNVLALFIARAIVDEILPPVFLVRSKK 243
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
LPES KG QVI TAEKSYLSAPHHAELVE++WGG+TH TVEE KKKI+++LKEYV++GD
Sbjct: 244 ILPESCKGFQVIVTAEKSYLSAPHHAELVEKKWGGSTHTTVEETKKKISEILKEYVENGD 303
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EACRCIRELGVSFFHHEVVKRALVLAM++ +AE L+LKLLKE AEEGLI
Sbjct: 304 TYEACRCIRELGVSFFHHEVVKRALVLAMDSPTAESLVLKLLKETAEEGLI 354
Score = 154 bits (390), Expect = 4e-37
Identities = 86/171 (50%), Positives = 116/171 (67%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
AL+ ++ S +GF ML+ESA+D A+DI+DA + LALFLARAV+DD+L P L
Sbjct: 482 ALHMELFSTEDFINGFIMLLESAEDTALDIMDASNELALFLARAVIDDVLAPLNLEDIST 541
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
LP S G + +++A +S +SA H E + R WGG T VE+ K KI+ LL+EY G
Sbjct: 542 KLPPKSTGTETVRSA-RSLISARHAGERLLRSWGGGTGWIVEDAKDKISKLLEEYETGGV 600
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EAC+CIR+LG+ FF+HEVVK+ALV+AME ++ LL LL+E EGLI
Sbjct: 601 TSEACQCIRDLGMPFFNHEVVKKALVMAMEKQNDR--LLNLLEECFGEGLI 649
Score = 47.8 bits (112), Expect = 8e-05
Identities = 38/160 (23%), Positives = 71/160 (43%)
Frame = +2
Query: 17 VISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARKALPES 196
+IS +Q+ GFF + ES DDLA+DI A + + +A+ L +F + + +S
Sbjct: 353 LISSSQMVKGFFRVAESLDDLALDIPSAKKLFDSIVPKAISGGWLDDSFKITSDQDGEKS 412
Query: 197 SKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGDTLEAC 376
S+ ++ Q KK ++++EY S D E
Sbjct: 413 SQDGKLRQ------------------------------YKKDTVNIIQEYFLSDDIPELI 442
Query: 377 RCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEA 496
R +++LG ++ +KR + LA++ ++ E + +L A
Sbjct: 443 RSLQDLGAPEYNPVFLKRLITLALDRKNREKEMASVLLSA 482
Score = 40.8 bits (94), Expect = 0.009
Identities = 20/65 (30%), Positives = 33/65 (50%)
Frame = +2
Query: 302 VEEVKKKIADLLKEYVDSGDTLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLK 481
+ + KK + ++ EY +GD A +RELG S +H KR + +AM+ E +
Sbjct: 120 LNDYKKSVVSIIDEYFSTGDVKVAASDLRELGSSEYHPYFTKRLVSMAMDRHDKEKEMAS 179
Query: 482 LLKEA 496
+L A
Sbjct: 180 VLLSA 184
>gb|AAM63106.1| topoisomerase-like protein [Arabidopsis thaliana]
Length = 702
Score = 269 bits (688), Expect = 1e-71
Identities = 133/171 (77%), Positives = 153/171 (88%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
ALYADVI P QIRDGF L+ S DDLAVDILDAV++LALF+ARA+VD+ILPP FL R++K
Sbjct: 184 ALYADVILPDQIRDGFIRLLRSVDDLAVDILDAVNVLALFIARAIVDEILPPVFLVRSKK 243
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
LPES KG QVI TAEKSYLSAPHHAELVE++WGG+TH TVEE KKKI+++LKEYV++GD
Sbjct: 244 ILPESCKGFQVIVTAEKSYLSAPHHAELVEKKWGGSTHTTVEETKKKISEILKEYVENGD 303
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EACRCIRELGVSFFHHEVVKRALVLAM++ +AE L+LKLLKE AEEGLI
Sbjct: 304 TYEACRCIRELGVSFFHHEVVKRALVLAMDSPTAESLVLKLLKETAEEGLI 354
Score = 154 bits (390), Expect = 4e-37
Identities = 86/171 (50%), Positives = 116/171 (67%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
AL+ ++ S +GF ML+ESA+D A+DI+DA + LALFLARAV+DD+L P L
Sbjct: 482 ALHMELFSTEDFINGFIMLLESAEDTALDIMDASNELALFLARAVIDDVLAPLNLEDIST 541
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
LP S G + +++A +S +SA H E + R WGG T VE+ K KI+ LL+EY G
Sbjct: 542 KLPPKSTGTETVRSA-RSLISARHAGERLLRSWGGGTGWIVEDAKDKISKLLEEYETGGV 600
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EAC+CIR+LG+ FF+HEVVK+ALV+AME ++ LL LL+E EGLI
Sbjct: 601 TSEACQCIRDLGMPFFNHEVVKKALVMAMEKQNDR--LLNLLEECFGEGLI 649
Score = 47.4 bits (111), Expect = 1e-04
Identities = 38/160 (23%), Positives = 70/160 (43%)
Frame = +2
Query: 17 VISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARKALPES 196
+IS +Q+ GFF + ES DDLA+DI A + + +A+ L +F + + +S
Sbjct: 353 LISSSQMVKGFFRVAESLDDLALDIPSAKKLFDSIVPKAISGGWLDDSFKITSDQDGEKS 412
Query: 197 SKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGDTLEAC 376
S+ ++ Q KK ++++EY S D E
Sbjct: 413 SQDGKLRQ------------------------------YKKDTVNIIQEYFLSDDIPELI 442
Query: 377 RCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEA 496
R +++LG ++ +KR + LA++ + E + +L A
Sbjct: 443 RSLQDLGAPEYNPVFLKRLITLALDRKXREKEMASVLLSA 482
Score = 40.8 bits (94), Expect = 0.009
Identities = 20/65 (30%), Positives = 33/65 (50%)
Frame = +2
Query: 302 VEEVKKKIADLLKEYVDSGDTLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLK 481
+ + KK + ++ EY +GD A +RELG S +H KR + +AM+ E +
Sbjct: 120 LNDYKKSVVSIIDEYFSTGDVKVAASDLRELGSSEYHPYFTKRLVSMAMDRHDKEKEMAS 179
Query: 482 LLKEA 496
+L A
Sbjct: 180 VLLSA 184
>pir||T06664 hypothetical protein F6I7.10 - Arabidopsis thaliana
gi|4678259|emb|CAB41120.1| putative protein [Arabidopsis
thaliana] gi|7269331|emb|CAB79390.1| putative protein
[Arabidopsis thaliana]
Length = 942
Score = 264 bits (674), Expect = 5e-70
Identities = 134/171 (78%), Positives = 149/171 (86%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
ALYADVI+P QIRDGF +L+ESADD VDI DAV++LALFLARAVVDDILPPAFL RA K
Sbjct: 178 ALYADVINPNQIRDGFVLLLESADDFVVDIPDAVNVLALFLARAVVDDILPPAFLPRAAK 237
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
ALP +SKG QV+QTAEKSYLSA HHAELVERRWGG T TVEEVKKKIAD+L EYV++G+
Sbjct: 238 ALPITSKGYQVVQTAEKSYLSAAHHAELVERRWGGQTRTTVEEVKKKIADILNEYVETGE 297
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EACRC+RELGVSFFHHEVVKRALV A+EN +AE +LKLL EAA E LI
Sbjct: 298 TYEACRCVRELGVSFFHHEVVKRALVTALENHAAEAPVLKLLNEAASENLI 348
Score = 150 bits (379), Expect = 8e-36
Identities = 81/171 (47%), Positives = 116/171 (67%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
+L+ ++ + + DGF ML+ESA+D A+DILDA + LALFLARAV+DD+L P L
Sbjct: 476 SLHIEMFTTEDVADGFVMLLESAEDTALDILDASNELALFLARAVIDDVLAPFNLEEISS 535
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
L +S G + ++ A +S + A H E + R WGG + VE+ K KI++LL+EY SG
Sbjct: 536 KLRPNSSGTETVKMA-RSLIFARHAGERLLRCWGGGSGWAVEDAKDKISNLLEEYESSGL 594
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
EAC+CI ELG+ FF+HEVVK+ALV+ ME + + ++L LL+E+ EGLI
Sbjct: 595 VSEACKCIHELGMPFFNHEVVKKALVMGMEKKK-DKMMLDLLQESFSEGLI 644
Score = 41.6 bits (96), Expect = 0.005
Identities = 28/94 (29%), Positives = 44/94 (46%)
Frame = +2
Query: 215 IQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGDTLEACRCIREL 394
I + +Y S ELV G T +++ KK A ++ EY +GD A + EL
Sbjct: 89 IDPNDPNYDSGEEPFELV----GATLSDPLDDYKKAAASIINEYFSTGDVDVAAADLIEL 144
Query: 395 GVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEA 496
G S +H +KR + +AM+ E + +L A
Sbjct: 145 GSSEYHPYFIKRLVSVAMDRHDKEKEMASVLLSA 178
>ref|NP_567708.1| topoisomerase, putative; protein id: At4g24800.1, supported by
cDNA: gi_17063162 [Arabidopsis thaliana]
gi|17063163|gb|AAL32978.1| AT4g24800/F6I7_10
[Arabidopsis thaliana] gi|21700927|gb|AAM70587.1|
AT4g24800/F6I7_10 [Arabidopsis thaliana]
Length = 702
Score = 264 bits (674), Expect = 5e-70
Identities = 134/171 (78%), Positives = 149/171 (86%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
ALYADVI+P QIRDGF +L+ESADD VDI DAV++LALFLARAVVDDILPPAFL RA K
Sbjct: 178 ALYADVINPNQIRDGFVLLLESADDFVVDIPDAVNVLALFLARAVVDDILPPAFLPRAAK 237
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
ALP +SKG QV+QTAEKSYLSA HHAELVERRWGG T TVEEVKKKIAD+L EYV++G+
Sbjct: 238 ALPITSKGYQVVQTAEKSYLSAAHHAELVERRWGGQTRTTVEEVKKKIADILNEYVETGE 297
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
T EACRC+RELGVSFFHHEVVKRALV A+EN +AE +LKLL EAA E LI
Sbjct: 298 TYEACRCVRELGVSFFHHEVVKRALVTALENHAAEAPVLKLLNEAASENLI 348
Score = 150 bits (379), Expect = 8e-36
Identities = 81/171 (47%), Positives = 116/171 (67%)
Frame = +2
Query: 2 ALYADVISPAQIRDGFFMLIESADDLAVDILDAVDILALFLARAVVDDILPPAFLARARK 181
+L+ ++ + + DGF ML+ESA+D A+DILDA + LALFLARAV+DD+L P L
Sbjct: 476 SLHIEMFTTEDVADGFVMLLESAEDTALDILDASNELALFLARAVIDDVLAPFNLEEISS 535
Query: 182 ALPESSKGAQVIQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGD 361
L +S G + ++ A +S + A H E + R WGG + VE+ K KI++LL+EY SG
Sbjct: 536 KLRPNSSGTETVKMA-RSLIFARHAGERLLRCWGGGSGWAVEDAKDKISNLLEEYESSGL 594
Query: 362 TLEACRCIRELGVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEAAEEGLI 514
EAC+CI ELG+ FF+HEVVK+ALV+ ME + + ++L LL+E+ EGLI
Sbjct: 595 VSEACKCIHELGMPFFNHEVVKKALVMGMEKKK-DKMMLDLLQESFSEGLI 644
Score = 41.6 bits (96), Expect = 0.005
Identities = 28/94 (29%), Positives = 44/94 (46%)
Frame = +2
Query: 215 IQTAEKSYLSAPHHAELVERRWGGTTHITVEEVKKKIADLLKEYVDSGDTLEACRCIREL 394
I + +Y S ELV G T +++ KK A ++ EY +GD A + EL
Sbjct: 89 IDPNDPNYDSGEEPFELV----GATLSDPLDDYKKAAASIINEYFSTGDVDVAAADLIEL 144
Query: 395 GVSFFHHEVVKRALVLAMENRSAEPLLLKLLKEA 496
G S +H +KR + +AM+ E + +L A
Sbjct: 145 GSSEYHPYFIKRLVSVAMDRHDKEKEMASVLLSA 178
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 459,695,016
Number of Sequences: 1393205
Number of extensions: 10353000
Number of successful extensions: 37388
Number of sequences better than 10.0: 95
Number of HSP's better than 10.0 without gapping: 35498
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 37291
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 16232377112
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)