Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005260A_C01 KMC005260A_c01
(541 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_188906.1| hypothetical protein; protein id: At3g22670.1 [... 113 2e-24
gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza ... 106 2e-22
gb|AAF26800.1|AC016829_24 hypothetical protein [Arabidopsis thal... 79 3e-14
ref|NP_566222.1| expressed protein; protein id: At3g04130.1, sup... 79 3e-14
ref|NP_176522.1| unknown protein; protein id: At1g63330.1 [Arabi... 59 9e-09
>ref|NP_188906.1| hypothetical protein; protein id: At3g22670.1 [Arabidopsis
thaliana] gi|9279685|dbj|BAB01242.1|
gb|AAF26800.1~gene_id:MWI23.4~similar to unknown protein
[Arabidopsis thaliana]
Length = 562
Score = 113 bits (282), Expect = 2e-24
Identities = 66/153 (43%), Positives = 90/153 (58%), Gaps = 5/153 (3%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEE---RSCTPDLETYHPLLKMC 369
EDM QG+ RDV+ YNTMIS A HSR+E ALRLLK ME+ SC+P++ETY PLLKMC
Sbjct: 402 EDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNVETYAPLLKMC 461
Query: 368 CKKKRMKVLKFLVGAHAQE*FEP*FGNFLTFGTWPCV--KVENLIMLAHSFEELISRGLT 195
C KK+MK+L L+ + ++ C+ KVE + FEE + +G+
Sbjct: 462 CHKKKMKLLGILLHHMVKNDVSIDVSTYILLIRGLCMSGKVEEACLF---FEEAVRKGMV 518
Query: 194 PRPGALKPLLKDLEAKSMLKEKEHIEKLMTPPT 96
PR K L+ +LE K+M + K I+ L+ T
Sbjct: 519 PRDSTCKMLVDELEKKNMAEAKLKIQSLVQSKT 551
Score = 36.2 bits (82), Expect = 0.26
Identities = 19/64 (29%), Positives = 33/64 (50%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKK 360
E+M + G +VVTY ++ + + AL + ++M+E C PD + Y L+ + K
Sbjct: 332 EEMRENGCNPNVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKT 391
Query: 359 KRMK 348
R K
Sbjct: 392 GRFK 395
Score = 35.8 bits (81), Expect = 0.34
Identities = 16/53 (30%), Positives = 26/53 (48%)
Frame = -3
Query: 509 DVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKKRM 351
DVVTY + + C +L+EM E C P++ TY ++ K K++
Sbjct: 307 DVVTYTSFVEAYCKEGDFRRVNEMLEEMRENGCNPNVVTYTIVMHSLGKSKQV 359
>gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza sativa (japonica
cultivar-group)]
Length = 1833
Score = 106 bits (265), Expect = 2e-22
Identities = 55/148 (37%), Positives = 88/148 (59%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKK 360
E+M GI +V T+NT+IS AC HS+ E AL+LL +MEE+SC PD++TY PLLK+CCK+
Sbjct: 1678 EEMRTTGIAPNVTTFNTLISAACDHSQAENALKLLVKMEEQSCNPDIKTYTPLLKLCCKR 1737
Query: 359 KRMKVLKFLVGAHAQE*FEP*FGNFLTFGTWPCVKVENLIMLAHSFEELISRGLTPRPGA 180
+ +K+L FLV ++ P F + +W C + + EE++S+G P+
Sbjct: 1738 QWVKILLFLVCHMFRKDISPDFSTYTLLVSWLC-RNGKVAQSCLFLEEMVSKGFAPKQET 1796
Query: 179 LKPLLKDLEAKSMLKEKEHIEKLMTPPT 96
+++ LE +++ + I+ L T T
Sbjct: 1797 FDLVMEKLEKRNLQSVYKKIQVLRTQVT 1824
Score = 42.4 bits (98), Expect = 0.004
Identities = 21/64 (32%), Positives = 32/64 (49%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKK 360
E+M + G VVTY +++ C +T LL EM +R C P++ TY L+ K
Sbjct: 1573 EEMKQHGFSPSVVTYTSLVEAYCMEKDFQTVYALLDEMRKRRCPPNVVTYTILMHALGKA 1632
Query: 359 KRMK 348
R +
Sbjct: 1633 GRTR 1636
>gb|AAF26800.1|AC016829_24 hypothetical protein [Arabidopsis thaliana]
Length = 572
Score = 79.3 bits (194), Expect = 3e-14
Identities = 52/148 (35%), Positives = 79/148 (53%), Gaps = 5/148 (3%)
Frame = -3
Query: 536 DMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERS-CTPDLETYHPLLKMCCKK 360
+MP+ G+ + TYN+MI+ C H E+ A+ LLKEME + C PD+ TY PLL+ C K+
Sbjct: 419 EMPELGVSINTSTYNSMIAMYCHHDEEDKAIELLKEMESSNLCNPDVHTYQPLLRSCFKR 478
Query: 359 KRM----KVLKFLVGAHAQE*FEP*FGNFLTFGTWPCVKVENLIMLAHSFEELISRGLTP 192
+ K+LK +V H E + TF + FEE+IS+ +TP
Sbjct: 479 GDVVEVGKLLKEMVTKHHLSLDE----STYTFLIQRLCRANMCEWAYCLFEEMISQDITP 534
Query: 191 RPGALKPLLKDLEAKSMLKEKEHIEKLM 108
R LL++++ K+M + E IE +M
Sbjct: 535 RHRTCLLLLEEVKKKNMHESAERIEHIM 562
Score = 38.1 bits (87), Expect = 0.069
Identities = 34/171 (19%), Positives = 68/171 (38%)
Frame = -3
Query: 536 DMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKK 357
+M G + +TY T++S+ A E ALR+ M+ C PD Y+ L+ +
Sbjct: 348 EMEANGSPPNSITYTTIMSSLNAQKEFEEALRVATRMKRSGCKPDSLFYNCLIHTLARAG 407
Query: 356 RMKVLKFLVGAHAQE*FEP*FGNFLTFGTWPCVKVENLIMLAHSFEELISRGLTPRPGAL 177
R++ + + E G + T+ + + M H EE
Sbjct: 408 RLEEAERVFRVEMPE-----LGVSINTSTYNSM----IAMYCHHDEE----------DKA 448
Query: 176 KPLLKDLEAKSMLKEKEHIEKLMTPPTYKMYFIVSISYLYLSLISSHAMKI 24
LLK++E+ ++ H + + +K +V + L +++ H + +
Sbjct: 449 IELLKEMESSNLCNPDVHTYQPLLRSCFKRGDVVEVGKLLKEMVTKHHLSL 499
>ref|NP_566222.1| expressed protein; protein id: At3g04130.1, supported by cDNA:
gi_15292876 [Arabidopsis thaliana]
Length = 508
Score = 79.3 bits (194), Expect = 3e-14
Identities = 52/148 (35%), Positives = 79/148 (53%), Gaps = 5/148 (3%)
Frame = -3
Query: 536 DMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERS-CTPDLETYHPLLKMCCKK 360
+MP+ G+ + TYN+MI+ C H E+ A+ LLKEME + C PD+ TY PLL+ C K+
Sbjct: 355 EMPELGVSINTSTYNSMIAMYCHHDEEDKAIELLKEMESSNLCNPDVHTYQPLLRSCFKR 414
Query: 359 KRM----KVLKFLVGAHAQE*FEP*FGNFLTFGTWPCVKVENLIMLAHSFEELISRGLTP 192
+ K+LK +V H E + TF + FEE+IS+ +TP
Sbjct: 415 GDVVEVGKLLKEMVTKHHLSLDE----STYTFLIQRLCRANMCEWAYCLFEEMISQDITP 470
Query: 191 RPGALKPLLKDLEAKSMLKEKEHIEKLM 108
R LL++++ K+M + E IE +M
Sbjct: 471 RHRTCLLLLEEVKKKNMHESAERIEHIM 498
Score = 38.1 bits (87), Expect = 0.069
Identities = 34/171 (19%), Positives = 68/171 (38%)
Frame = -3
Query: 536 DMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKK 357
+M G + +TY T++S+ A E ALR+ M+ C PD Y+ L+ +
Sbjct: 284 EMEANGSPPNSITYTTIMSSLNAQKEFEEALRVATRMKRSGCKPDSLFYNCLIHTLARAG 343
Query: 356 RMKVLKFLVGAHAQE*FEP*FGNFLTFGTWPCVKVENLIMLAHSFEELISRGLTPRPGAL 177
R++ + + E G + T+ + + M H EE
Sbjct: 344 RLEEAERVFRVEMPE-----LGVSINTSTYNSM----IAMYCHHDEE----------DKA 384
Query: 176 KPLLKDLEAKSMLKEKEHIEKLMTPPTYKMYFIVSISYLYLSLISSHAMKI 24
LLK++E+ ++ H + + +K +V + L +++ H + +
Sbjct: 385 IELLKEMESSNLCNPDVHTYQPLLRSCFKRGDVVEVGKLLKEMVTKHHLSL 435
>ref|NP_176522.1| unknown protein; protein id: At1g63330.1 [Arabidopsis thaliana]
gi|25404421|pir||C96659 unknown protein, 19199-17308
[imported] - Arabidopsis thaliana
gi|12324362|gb|AAG52154.1|AC022355_15 unknown protein;
19199-17308 [Arabidopsis thaliana]
Length = 558
Score = 59.3 bits (142), Expect(2) = 9e-09
Identities = 26/64 (40%), Positives = 42/64 (65%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKK 360
+DM K+ I D+ TYN++I+ C H R + A ++ + M + C PDL+TY+ L+K CK
Sbjct: 278 DDMIKRSIDPDIFTYNSLINGFCMHDRLDKAKQMFEFMVSKDCFPDLDTYNTLIKGFCKS 337
Query: 359 KRMK 348
KR++
Sbjct: 338 KRVE 341
Score = 47.0 bits (110), Expect = 1e-04
Identities = 31/111 (27%), Positives = 49/111 (43%), Gaps = 2/111 (1%)
Frame = -3
Query: 518 IVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKKRMKVLK 339
I DVV +NT+I + C + + AL L KEME + P++ TY L+ C R
Sbjct: 180 IEADVVIFNTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSYGRWSDAS 239
Query: 338 FLVGAHAQE*FEP*FGNFLTFGTW--PCVKVENLIMLAHSFEELISRGLTP 192
L+ ++ P N +TF VK + +++I R + P
Sbjct: 240 QLLSDMIEKKINP---NLVTFNALIDAFVKEGKFVEAEKLHDDMIKRSIDP 287
Score = 43.9 bits (102), Expect = 0.001
Identities = 22/50 (44%), Positives = 32/50 (64%)
Frame = -3
Query: 524 QGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLK 375
+G+ +VVTYNTMIS C+ + A LLK+M+E PD TY+ L++
Sbjct: 458 KGVKPNVVTYNTMISGLCSKRLLQEAYALLKKMKEDGPLPDSGTYNTLIR 507
Score = 43.9 bits (102), Expect(2) = 7e-05
Identities = 21/60 (35%), Positives = 38/60 (63%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKK 360
++M +GI +VVTY+++IS C++ R A +LL +M E+ P+L T++ L+ K+
Sbjct: 208 KEMETKGIRPNVVTYSSLISCLCSYGRWSDASQLLSDMIEKKINPNLVTFNALIDAFVKE 267
Score = 40.8 bits (94), Expect = 0.011
Identities = 18/60 (30%), Positives = 32/60 (53%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKK 360
+ M + G D +T+ T+I H++ A+ L+ M +R C P+L TY ++ CK+
Sbjct: 103 DQMVEMGYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKR 162
Score = 37.0 bits (84), Expect = 0.15
Identities = 19/63 (30%), Positives = 31/63 (49%)
Frame = -3
Query: 536 DMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKK 357
+M +G+V D VTY T+I + A ++ K+M PD+ TY LL C
Sbjct: 349 EMSHRGLVGDTVTYTTLIQGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNG 408
Query: 356 RMK 348
+++
Sbjct: 409 KLE 411
Score = 35.4 bits (80), Expect = 0.44
Identities = 19/55 (34%), Positives = 25/55 (44%)
Frame = -3
Query: 539 EDMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLK 375
E M + D+ TYNT+I C R E L +EM R D TY L++
Sbjct: 313 EFMVSKDCFPDLDTYNTLIKGFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLIQ 367
Score = 35.0 bits (79), Expect = 0.58
Identities = 20/66 (30%), Positives = 34/66 (51%)
Frame = -3
Query: 536 DMPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKK 357
DM ++ I ++VT+N +I + A +L +M +RS PD+ TY+ L+ C
Sbjct: 244 DMIEKKINPNLVTFNALIDAFVKEGKFVEAEKLHDDMIKRSIDPDIFTYNSLINGFCMHD 303
Query: 356 RMKVLK 339
R+ K
Sbjct: 304 RLDKAK 309
Score = 32.0 bits (71), Expect = 4.9
Identities = 15/62 (24%), Positives = 28/62 (44%)
Frame = -3
Query: 533 MPKQGIVRDVVTYNTMISTACAHSREETALRLLKEMEERSCTPDLETYHPLLKMCCKKKR 354
M K I D+ Y TMI C + + L + + P++ TY+ ++ C K+
Sbjct: 420 MQKSEIKLDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTMISGLCSKRL 479
Query: 353 MK 348
++
Sbjct: 480 LQ 481
Score = 23.5 bits (49), Expect(2) = 7e-05
Identities = 8/23 (34%), Positives = 16/23 (68%)
Frame = -1
Query: 334 LLEHMLKNDLSPDLGTFSLLVHG 266
L + M+K + PD+ T++ L++G
Sbjct: 276 LHDDMIKRSIDPDIFTYNSLING 298
Score = 21.6 bits (44), Expect(2) = 9e-09
Identities = 7/23 (30%), Positives = 16/23 (69%)
Frame = -1
Query: 334 LLEHMLKNDLSPDLGTFSLLVHG 266
+ + M+ + + PD+ T+S+L+ G
Sbjct: 381 VFKQMVSDGVPPDIMTYSILLDG 403
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 461,269,672
Number of Sequences: 1393205
Number of extensions: 9364616
Number of successful extensions: 26545
Number of sequences better than 10.0: 485
Number of HSP's better than 10.0 without gapping: 23082
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26225
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)