Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000740A_C01 KMC000740A_c01
(657 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
sp|P35100|CLPA_PEA ATP-dependent clp protease ATP-binding subuni... 319 2e-86
sp|P31542|CLAB_LYCES ATP-dependent clp protease ATP-binding subu... 272 3e-72
ref|NP_568746.1| ATP-dependent Clp protease ATP-binding subunit ... 271 4e-72
sp|P31541|CLAA_LYCES ATP-dependent clp protease ATP-binding subu... 266 2e-70
pir||T52292 endopeptidase Clp (EC 3.4.21.92) ATP-binding chain C... 259 2e-68
>sp|P35100|CLPA_PEA ATP-dependent clp protease ATP-binding subunit clpA homolog,
chloroplast precursor gi|419773|pir||S31164
endopeptidase Clp (EC 3.4.21.-) ATP-binding chain,
chloroplast [similarity] - garden pea
gi|169128|gb|AAA33680.1| nuclear encoded precursor to
chloroplast protein
Length = 922
Score = 319 bits (818), Expect = 2e-86
Identities = 164/192 (85%), Positives = 176/192 (91%)
Frame = +3
Query: 81 MARVLAQSINVPDLVARQRHGNHKGSGKSKRSAKMMCALQTSGFRLAGFSGLRTYNPLDT 260
MARVLAQS++VP LVA + HKGSGKSKRS K MCAL+TSG R++GFSGLRT+N L+T
Sbjct: 1 MARVLAQSLSVPGLVAGHKDSQHKGSGKSKRSVKTMCALRTSGLRMSGFSGLRTFNHLNT 60
Query: 261 MLRPGLDFHSKVSIATSSRRGKATRGVPKAMFERFTEKAIKVIMLAQEEARRLGHNFVGT 440
M+RPGLDFHSKVS A SSRR +A R +P+AMFERFTEKAIKVIMLAQEEARRLGHNFVGT
Sbjct: 61 MMRPGLDFHSKVSKAVSSRRARAKRFIPRAMFERFTEKAIKVIMLAQEEARRLGHNFVGT 120
Query: 441 EQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLEL 620
EQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLEL
Sbjct: 121 EQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLEL 180
Query: 621 SLEEARQLGHNY 656
S EEARQLGHNY
Sbjct: 181 SQEEARQLGHNY 192
Score = 73.2 bits (178), Expect = 3e-12
Identities = 32/74 (43%), Positives = 55/74 (74%)
Frame = +3
Query: 363 FTEKAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVE 542
FT +A +V+ L+QEEAR+LGHN++G+E +LLGL+ EG G+AA+VL+++G + + R +V
Sbjct: 170 FTPRAKRVLELSQEEARQLGHNYIGSEHLLLGLLREGEGVAARVLENLGADPTNIRTQVI 229
Query: 543 KIIGRGSGFVAVEI 584
+++G + V +
Sbjct: 230 RMVGESADSVTATV 243
>sp|P31542|CLAB_LYCES ATP-dependent clp protease ATP-binding subunit clpA homolog CD4B,
chloroplast precursor gi|100190|pir||B35905
endopeptidase Clp (EC 3.4.21.-) ATP-binding chain cd4B,
chloroplast [similarity] - tomato
gi|170435|gb|AAA34161.1| ATP-dependent protease (CD4B)
Length = 923
Score = 272 bits (695), Expect = 3e-72
Identities = 144/192 (75%), Positives = 157/192 (81%)
Frame = +3
Query: 81 MARVLAQSINVPDLVARQRHGNHKGSGKSKRSAKMMCALQTSGFRLAGFSGLRTYNPLDT 260
MAR L QS ++P VA +R GSGK+KR+ M+C Q+S L F+GLR N +DT
Sbjct: 1 MARALVQSTSIPSSVAGERTTKFNGSGKTKRAVTMLCNAQSSSLTLRDFTGLRGCNAIDT 60
Query: 261 MLRPGLDFHSKVSIATSSRRGKATRGVPKAMFERFTEKAIKVIMLAQEEARRLGHNFVGT 440
++R G SKV+ AT RR + R VPKAMFERFTEKAIKVIMLAQEEARRLGHNFVGT
Sbjct: 61 LVRSGETLQSKVAAATYVRRPRGCRFVPKAMFERFTEKAIKVIMLAQEEARRLGHNFVGT 120
Query: 441 EQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLEL 620
EQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLEL
Sbjct: 121 EQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLEL 180
Query: 621 SLEEARQLGHNY 656
SLEEARQLGHNY
Sbjct: 181 SLEEARQLGHNY 192
Score = 70.9 bits (172), Expect = 1e-11
Identities = 31/74 (41%), Positives = 54/74 (72%)
Frame = +3
Query: 363 FTEKAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVE 542
FT +A +V+ L+ EEAR+LGHN++G+E +LLGL+ EG G+AA+VL+++G + + R +V
Sbjct: 170 FTPRAKRVLELSLEEARQLGHNYIGSEHLLLGLLREGEGVAARVLENLGADPSNIRTQVI 229
Query: 543 KIIGRGSGFVAVEI 584
+++G + V +
Sbjct: 230 RMVGESNEAVGASV 243
>ref|NP_568746.1| ATP-dependent Clp protease ATP-binding subunit (ClpC1); protein id:
At5g50920.1, supported by cDNA: gi_20856955, supported
by cDNA: gi_2921157 [Arabidopsis thaliana]
gi|9758239|dbj|BAB08738.1| ATP-dependent Clp protease,
ATP-binding subunit [Arabidopsis thaliana]
gi|20856956|gb|AAM26692.1| AT5g50920/K3K7_7 [Arabidopsis
thaliana]
Length = 929
Score = 271 bits (694), Expect = 4e-72
Identities = 149/191 (78%), Positives = 160/191 (83%), Gaps = 1/191 (0%)
Frame = +3
Query: 87 RVLAQSINVPDLVARQRHGNHKGSGKSKRSAKMMCA-LQTSGFRLAGFSGLRTYNPLDTM 263
RVLAQS P L QR+ +GSG+S+RS KMMC+ LQ SG R+ GF GLR N LDT+
Sbjct: 6 RVLAQS-TPPSLACYQRNVPSRGSGRSRRSVKMMCSQLQVSGLRMQGFMGLRGNNALDTL 64
Query: 264 LRPGLDFHSKVSIATSSRRGKATRGVPKAMFERFTEKAIKVIMLAQEEARRLGHNFVGTE 443
+ DFHSKV A + +GKA+R KAMFERFTEKAIKVIMLAQEEARRLGHNFVGTE
Sbjct: 65 GKSRQDFHSKVRQAMNVPKGKASRFTVKAMFERFTEKAIKVIMLAQEEARRLGHNFVGTE 124
Query: 444 QILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLELS 623
QILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLELS
Sbjct: 125 QILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLELS 184
Query: 624 LEEARQLGHNY 656
LEEARQLGHNY
Sbjct: 185 LEEARQLGHNY 195
Score = 70.1 bits (170), Expect = 2e-11
Identities = 31/71 (43%), Positives = 53/71 (73%)
Frame = +3
Query: 363 FTEKAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVE 542
FT +A +V+ L+ EEAR+LGHN++G+E +LLGL+ EG G+AA+VL+++G + + R +V
Sbjct: 173 FTPRAKRVLELSLEEARQLGHNYIGSEHLLLGLLREGEGVAARVLENLGADPSNIRTQVI 232
Query: 543 KIIGRGSGFVA 575
+++G + A
Sbjct: 233 RMVGENNEVTA 243
>sp|P31541|CLAA_LYCES ATP-dependent clp protease ATP-binding subunit clpA homolog CD4A,
chloroplast precursor gi|100189|pir||A35905
endopeptidase Clp (EC 3.4.21.-) ATP-binding chain cd4A,
chloroplast [similarity] - tomato
gi|170433|gb|AAA34160.1| ATP-dependent protease (CD4A)
Length = 926
Score = 266 bits (680), Expect = 2e-70
Identities = 143/194 (73%), Positives = 158/194 (80%), Gaps = 1/194 (0%)
Frame = +3
Query: 78 IMARVLAQSINVPDLVARQRHGNHKGSGKSKRSAKMMCALQTSGFRLAGFSGLRTYNPLD 257
+MAR L QS N+ VA +R G GS K +R+ +M+C ++ RL F+GLR N LD
Sbjct: 1 MMARALVQSTNILPSVAGERAGQFNGSRKDQRTVRMLCNVKCCSSRLNNFAGLRGCNALD 60
Query: 258 TML-RPGLDFHSKVSIATSSRRGKATRGVPKAMFERFTEKAIKVIMLAQEEARRLGHNFV 434
T+L + G HSKV+ AT RR + R VPKAMFERFTEKAIKVIMLAQEEARRLGHNFV
Sbjct: 61 TLLVKSGETLHSKVAAATFVRRPRGCRFVPKAMFERFTEKAIKVIMLAQEEARRLGHNFV 120
Query: 435 GTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVL 614
GTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGF+AVEIPFTPRAKRVL
Sbjct: 121 GTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFIAVEIPFTPRAKRVL 180
Query: 615 ELSLEEARQLGHNY 656
ELSLEEARQLGHNY
Sbjct: 181 ELSLEEARQLGHNY 194
Score = 71.6 bits (174), Expect = 8e-12
Identities = 32/74 (43%), Positives = 54/74 (72%)
Frame = +3
Query: 363 FTEKAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVE 542
FT +A +V+ L+ EEAR+LGHN++G+E +LLGL+ EG G+AA+VL+++G + + R +V
Sbjct: 172 FTPRAKRVLELSLEEARQLGHNYIGSEHLLLGLLREGEGVAARVLENLGADPTNIRTQVI 231
Query: 543 KIIGRGSGFVAVEI 584
+++G S V +
Sbjct: 232 RMVGESSEAVGASV 245
>pir||T52292 endopeptidase Clp (EC 3.4.21.92) ATP-binding chain C, chloroplast
[imported] - Arabidopsis thaliana
gi|2921158|gb|AAC04687.1| ClpC [Arabidopsis thaliana]
Length = 928
Score = 259 bits (663), Expect = 2e-68
Identities = 142/190 (74%), Positives = 153/190 (79%)
Frame = +3
Query: 87 RVLAQSINVPDLVARQRHGNHKGSGKSKRSAKMMCALQTSGFRLAGFSGLRTYNPLDTML 266
RVLAQS P L QR+ +GSG+S+RS KMMC + + GF GLR N LDT+
Sbjct: 6 RVLAQS-TPPSLACYQRNVPSRGSGRSRRSVKMMCIIFNVWLPMQGFMGLRGNNALDTLG 64
Query: 267 RPGLDFHSKVSIATSSRRGKATRGVPKAMFERFTEKAIKVIMLAQEEARRLGHNFVGTEQ 446
+ DFHSKV A + +GKA+R KAMFERFTEKAIKVIMLAQEEARRLGHNFVGTEQ
Sbjct: 65 KSRQDFHSKVRQAMNVPKGKASRFTVKAMFERFTEKAIKVIMLAQEEARRLGHNFVGTEQ 124
Query: 447 ILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLELSL 626
ILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLELSL
Sbjct: 125 ILLGLIGEGTGIAAKVLKSMGINLKDARVEVEKIIGRGSGFVAVEIPFTPRAKRVLELSL 184
Query: 627 EEARQLGHNY 656
E RQLGHNY
Sbjct: 185 EATRQLGHNY 194
Score = 66.2 bits (160), Expect = 4e-10
Identities = 29/71 (40%), Positives = 51/71 (70%)
Frame = +3
Query: 363 FTEKAIKVIMLAQEEARRLGHNFVGTEQILLGLIGEGTGIAAKVLKSMGINLKDARVEVE 542
FT +A +V+ L+ E R+LGHN++G+E +LLGL+ EG G+AA+VL+++G + + R +V
Sbjct: 172 FTPRAKRVLELSLEATRQLGHNYIGSEHLLLGLLREGEGVAARVLENLGADPSNIRTQVI 231
Query: 543 KIIGRGSGFVA 575
+++G + A
Sbjct: 232 RMVGENNEVTA 242
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 584,327,882
Number of Sequences: 1393205
Number of extensions: 12927959
Number of successful extensions: 31846
Number of sequences better than 10.0: 157
Number of HSP's better than 10.0 without gapping: 30695
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31730
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 28006887348
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)