Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000228A_C01 KMC000228A_c01
(1014 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||T07807 endopeptidase Clp (EC 3.4.21.-) ATP-binding chain SB... 404 e-112
gb|AAA67927.1| AtHSP101 386 e-106
ref|NP_565083.1| heat shock protein 101 (HSP101); protein id: At... 386 e-106
gb|AAL32674.1| heat shock protein 101 [Arabidopsis thaliana] 386 e-106
gb|AAC83688.2| 101 kDa heat shock protein; HSP101 [Nicotiana tab... 384 e-105
>pir||T07807 endopeptidase Clp (EC 3.4.21.-) ATP-binding chain SB100 [similarity]
- soybean gi|530207|gb|AAA66338.1| heat shock protein
Length = 911
Score = 404 bits (1039), Expect = e-112
Identities = 220/271 (81%), Positives = 236/271 (86%), Gaps = 4/271 (1%)
Frame = +1
Query: 214 MNPDKFTHKTNEALAGAHELAMSSGHAQMTPLHLASTLISDPNGIFFQAISNSSG-EESA 390
MNP+KFTHKTNEALA AHELAMSSGHAQ+TP+HLA LISDPNGIF AI+++ G EESA
Sbjct: 1 MNPEKFTHKTNEALASAHELAMSSGHAQLTPIHLAHALISDPNGIFVLAINSAGGGEESA 60
Query: 391 RAVERVLNQALKKLPSQSPPPDEIPASTTLIKAIRRAQAAQKSRGDTHLAVDQLILGILE 570
RAVERVLNQALKKLP QSPPPDE+PAST L++AIRRAQAAQKSRGDT LAVDQLILGILE
Sbjct: 61 RAVERVLNQALKKLPCQSPPPDEVPASTNLVRAIRRAQAAQKSRGDTRLAVDQLILGILE 120
Query: 571 DSQIGDLLKEAGVAAAKVKSELDKLRGKVGKKVESASGDTTFQALKTYGRDLVEQAGKLD 750
DSQIGDLLKEAGVA AKV+SE+DKLRGK GKKVESASGDT FQALKTYGRDLVEQAGKLD
Sbjct: 121 DSQIGDLLKEAGVAVAKVESEVDKLRGKEGKKVESASGDTNFQALKTYGRDLVEQAGKLD 180
Query: 751 PVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSIFLMLGLL 930
PVIGRDEEIRRVVRILSRRTKNNPVL+GEPGVGKTAVVEGLAQRIVRGDVPS + L+
Sbjct: 181 PVIGRDEEIRRVVRILSRRTKNNPVLVGEPGVGKTAVVEGLAQRIVRGDVPSNLADVRLI 240
Query: 931 HWTWVLWLLVLSI---GENLRRGEAVLKEVE 1014
+ LV GE R +AVLKEVE
Sbjct: 241 --ALDMGALVAGAKYRGEFEERLKAVLKEVE 269
>gb|AAA67927.1| AtHSP101
Length = 911
Score = 386 bits (992), Expect = e-106
Identities = 205/270 (75%), Positives = 232/270 (85%), Gaps = 3/270 (1%)
Frame = +1
Query: 214 MNPDKFTHKTNEALAGAHELAMSSGHAQMTPLHLASTLISDPNGIFFQAISNSSGEESAR 393
MNP+KFTHKTNE +A AHELA+++GHAQ TPLHLA LISDP GIF QAIS++ GE +A+
Sbjct: 1 MNPEKFTHKTNETIATAHELAVNAGHAQFTPLHLAGALISDPTGIFPQAISSAGGENAAQ 60
Query: 394 AVERVLNQALKKLPSQSPPPDEIPASTTLIKAIRRAQAAQKSRGDTHLAVDQLILGILED 573
+ ERV+NQALKKLPSQSPPPD+IPAS++LIK IRRAQAAQKSRGDTHLAVDQLI+G+LED
Sbjct: 61 SAERVINQALKKLPSQSPPPDDIPASSSLIKVIRRAQAAQKSRGDTHLAVDQLIMGLLED 120
Query: 574 SQIGDLLKEAGVAAAKVKSELDKLRGKVGKKVESASGDTTFQALKTYGRDLVEQAGKLDP 753
SQI DLL E GVA A+VKSE++KLRGK GKKVESASGDT FQALKTYGRDLVEQAGKLDP
Sbjct: 121 SQIRDLLNEVGVATARVKSEVEKLRGKEGKKVESASGDTNFQALKTYGRDLVEQAGKLDP 180
Query: 754 VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSIFLMLGLLH 933
VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIV+GDVP+ + L+
Sbjct: 181 VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVKGDVPNSLTDVRLI- 239
Query: 934 WTWVLWLLVLSI---GENLRRGEAVLKEVE 1014
+ + LV GE R ++VLKEVE
Sbjct: 240 -SLDMGALVAGAKYRGEFEERLKSVLKEVE 268
>ref|NP_565083.1| heat shock protein 101 (HSP101); protein id: At1g74310.1, supported
by cDNA: gi_537445 [Arabidopsis thaliana]
gi|21264430|sp|P42730|H101_ARATH Heat shock protein 101
gi|25289887|pir||F96771 heat shock protein 101,
13093-16240 [imported] - Arabidopsis thaliana
gi|6715468|gb|AAF26423.1|AF218796_1 heat shock protein
101 [Arabidopsis thaliana]
gi|12324908|gb|AAG52410.1|AC020579_12 heat shock protein
101; 13093-16240 [Arabidopsis thaliana]
Length = 911
Score = 386 bits (992), Expect = e-106
Identities = 205/270 (75%), Positives = 232/270 (85%), Gaps = 3/270 (1%)
Frame = +1
Query: 214 MNPDKFTHKTNEALAGAHELAMSSGHAQMTPLHLASTLISDPNGIFFQAISNSSGEESAR 393
MNP+KFTHKTNE +A AHELA+++GHAQ TPLHLA LISDP GIF QAIS++ GE +A+
Sbjct: 1 MNPEKFTHKTNETIATAHELAVNAGHAQFTPLHLAGALISDPTGIFPQAISSAGGENAAQ 60
Query: 394 AVERVLNQALKKLPSQSPPPDEIPASTTLIKAIRRAQAAQKSRGDTHLAVDQLILGILED 573
+ ERV+NQALKKLPSQSPPPD+IPAS++LIK IRRAQAAQKSRGDTHLAVDQLI+G+LED
Sbjct: 61 SAERVINQALKKLPSQSPPPDDIPASSSLIKVIRRAQAAQKSRGDTHLAVDQLIMGLLED 120
Query: 574 SQIGDLLKEAGVAAAKVKSELDKLRGKVGKKVESASGDTTFQALKTYGRDLVEQAGKLDP 753
SQI DLL E GVA A+VKSE++KLRGK GKKVESASGDT FQALKTYGRDLVEQAGKLDP
Sbjct: 121 SQIRDLLNEVGVATARVKSEVEKLRGKEGKKVESASGDTNFQALKTYGRDLVEQAGKLDP 180
Query: 754 VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSIFLMLGLLH 933
VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIV+GDVP+ + L+
Sbjct: 181 VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVKGDVPNSLTDVRLI- 239
Query: 934 WTWVLWLLVLSI---GENLRRGEAVLKEVE 1014
+ + LV GE R ++VLKEVE
Sbjct: 240 -SLDMGALVAGAKYRGEFEERLKSVLKEVE 268
>gb|AAL32674.1| heat shock protein 101 [Arabidopsis thaliana]
Length = 460
Score = 386 bits (991), Expect = e-106
Identities = 205/270 (75%), Positives = 231/270 (84%), Gaps = 3/270 (1%)
Frame = +1
Query: 214 MNPDKFTHKTNEALAGAHELAMSSGHAQMTPLHLASTLISDPNGIFFQAISNSSGEESAR 393
MNP+KFTHKTNE +A AHELA+++GHAQ TPLHLA LISDP GIF QAIS++ GE +A+
Sbjct: 1 MNPEKFTHKTNETIATAHELAVNAGHAQFTPLHLAGALISDPTGIFPQAISSAGGENAAQ 60
Query: 394 AVERVLNQALKKLPSQSPPPDEIPASTTLIKAIRRAQAAQKSRGDTHLAVDQLILGILED 573
+ ERV+NQALKKLPSQSPPPD+IPAS++LIK IRRAQAAQKSRGDTHLAVDQLI+G+LED
Sbjct: 61 SAERVINQALKKLPSQSPPPDDIPASSSLIKVIRRAQAAQKSRGDTHLAVDQLIMGLLED 120
Query: 574 SQIGDLLKEAGVAAAKVKSELDKLRGKVGKKVESASGDTTFQALKTYGRDLVEQAGKLDP 753
SQI DLL E GVA A+VKSE +KLRGK GKKVESASGDT FQALKTYGRDLVEQAGKLDP
Sbjct: 121 SQIRDLLNEVGVATARVKSEFEKLRGKEGKKVESASGDTNFQALKTYGRDLVEQAGKLDP 180
Query: 754 VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSIFLMLGLLH 933
VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIV+GDVP+ + L+
Sbjct: 181 VIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVKGDVPNSLTDVRLI- 239
Query: 934 WTWVLWLLVLSI---GENLRRGEAVLKEVE 1014
+ + LV GE R ++VLKEVE
Sbjct: 240 -SLDMGALVAGAKYRGEFEERLKSVLKEVE 268
>gb|AAC83688.2| 101 kDa heat shock protein; HSP101 [Nicotiana tabacum]
Length = 909
Score = 384 bits (985), Expect = e-105
Identities = 209/271 (77%), Positives = 233/271 (85%), Gaps = 4/271 (1%)
Frame = +1
Query: 214 MNPDKFTHKTNEALAGAHELAMSSGHAQMTPLHLASTLISDPNGIFFQAISNSSG-EESA 390
MNP+KFTHKTNEALAGA ELA+S+GHAQ TPLH+A LISD NGIF QAI N+ G EE A
Sbjct: 1 MNPEKFTHKTNEALAGALELALSAGHAQFTPLHMAVALISDHNGIFRQAIVNAGGNEEVA 60
Query: 391 RAVERVLNQALKKLPSQSPPPDEIPASTTLIKAIRRAQAAQKSRGDTHLAVDQLILGILE 570
+VERVLNQA+KKLPSQ+P PDEIP ST+LIK +RRAQ++QKSRGD+HLAVDQLILG+LE
Sbjct: 61 NSVERVLNQAMKKLPSQTPAPDEIPPSTSLIKVLRRAQSSQKSRGDSHLAVDQLILGLLE 120
Query: 571 DSQIGDLLKEAGVAAAKVKSELDKLRGKVGKKVESASGDTTFQALKTYGRDLVEQAGKLD 750
DSQIGDLLKEAGV+A++VKSE++KLRGK G+KVESASGDTTFQAL TYGRDLVEQAGKLD
Sbjct: 121 DSQIGDLLKEAGVSASRVKSEVEKLRGKEGRKVESASGDTTFQALNTYGRDLVEQAGKLD 180
Query: 751 PVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSIFLMLGLL 930
PVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPS + L+
Sbjct: 181 PVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLADVRLI 240
Query: 931 HWTWVLWLLVLSI---GENLRRGEAVLKEVE 1014
+ LV GE R +AVLKEVE
Sbjct: 241 --ALDMGALVAGAKYRGEFEERLKAVLKEVE 269
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 914,893,500
Number of Sequences: 1393205
Number of extensions: 21406765
Number of successful extensions: 76410
Number of sequences better than 10.0: 563
Number of HSP's better than 10.0 without gapping: 69869
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 75714
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 58773479151
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)