Miyakogusa Predicted Gene
- Lj5g3v1598380.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1598380.1 tr|I1JBF6|I1JBF6_SOYBN ATP-dependent Clp protease
proteolytic subunit OS=Glycine max GN=Gma.6743
PE=,97.11,0,CLP_protease,ClpP/TepA; CLP_PROTEASE_SER,ClpP, active
site; CLP_PROTEASE_HIS,ClpP, active site; ATP-,CUFF.55553.1
(173 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G02560.1 | Symbols: CLPP5, NCLPP5, NCLPP1 | nuclear encoded C... 334 2e-92
AT1G66670.1 | Symbols: CLPP3, NCLPP3 | CLP protease proteolytic ... 193 5e-50
AT5G45390.1 | Symbols: CLPP4, NCLPP4 | CLP protease P4 | chr5:18... 183 6e-47
AT5G23140.1 | Symbols: CLPP2, NCLPP7 | nuclear-encoded CLP prote... 166 7e-42
AT1G11750.2 | Symbols: CLPP6 | CLP protease proteolytic subunit ... 151 2e-37
AT1G11750.1 | Symbols: CLPP6, NCLPP1, NCLPP6 | CLP protease prot... 151 2e-37
ATCG00670.1 | Symbols: CLPP1, PCLPP | plastid-encoded CLP P | ch... 136 7e-33
AT1G12410.1 | Symbols: CLPR2, NCLPP2, CLP2 | CLP protease proteo... 134 4e-32
AT4G17040.1 | Symbols: CLPR4 | CLP protease R subunit 4 | chr4:9... 125 1e-29
AT1G09130.3 | Symbols: | ATP-dependent caseinolytic (Clp) prote... 119 1e-27
AT1G09130.2 | Symbols: | ATP-dependent caseinolytic (Clp) prote... 119 1e-27
AT1G09130.1 | Symbols: | ATP-dependent caseinolytic (Clp) prote... 119 1e-27
AT1G49970.1 | Symbols: CLPR1, NCLPP5, SVR2 | CLP protease proteo... 93 1e-19
>AT1G02560.1 | Symbols: CLPP5, NCLPP5, NCLPP1 | nuclear encoded CLP
protease 5 | chr1:538000-539805 FORWARD LENGTH=298
Length = 298
Score = 334 bits (856), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 161/164 (98%), Positives = 162/164 (98%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 60
MANIIVAQLLYLDAVDP KDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM
Sbjct: 135 MANIIVAQLLYLDAVDPTKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 194
Query: 61 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTG 120
GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYL+YHTG
Sbjct: 195 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLAYHTG 254
Query: 121 QSLEKINQDTDRDFFMSAKEAKEYGLIDGVIMNPLKALQPLPAA 164
QSLEKINQDTDRDFFMSAKEAKEYGLIDGVIMNPLKALQPL AA
Sbjct: 255 QSLEKINQDTDRDFFMSAKEAKEYGLIDGVIMNPLKALQPLAAA 298
>AT1G66670.1 | Symbols: CLPP3, NCLPP3 | CLP protease proteolytic
subunit 3 | chr1:24863995-24865646 REVERSE LENGTH=309
Length = 309
Score = 193 bits (491), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 87/150 (58%), Positives = 116/150 (77%)
Query: 2 ANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASMG 61
A+++++QLL LDA D +DI +++NSPGGS+TAGM I+D M+ + DVSTVC+GLAASMG
Sbjct: 107 ADLVISQLLLLDAEDSERDITLFINSPGGSITAGMGIYDAMKQCKADVSTVCLGLAASMG 166
Query: 62 AFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTGQ 121
AFLL++G+KGKRY +PNS++MIHQPLG A G T++ I+ EM++HK LN S TG+
Sbjct: 167 AFLLASGSKGKRYCMPNSKVMIHQPLGTAGGKATEMSIRIREMMYHKIKLNKIFSRITGK 226
Query: 122 SLEKINQDTDRDFFMSAKEAKEYGLIDGVI 151
+I DTDRD F++ EAKEYGLID VI
Sbjct: 227 PESEIESDTDRDNFLNPWEAKEYGLIDAVI 256
>AT5G45390.1 | Symbols: CLPP4, NCLPP4 | CLP protease P4 |
chr5:18396351-18397586 FORWARD LENGTH=292
Length = 292
Score = 183 bits (464), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 86/163 (52%), Positives = 125/163 (76%), Gaps = 1/163 (0%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 60
+A+ I++QLL LDA DP KDI +++NSPGGS++A MAI+D ++ +R DVST+ +G+AAS
Sbjct: 100 VADAIMSQLLLLDAKDPKKDIKLFINSPGGSLSATMAIYDVVQLVRADVSTIALGIAAST 159
Query: 61 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTG 120
+ +L AGTKGKR+++PN+RIMIHQPLGGA G D++IQA E++H+K N+ ++ T
Sbjct: 160 ASIILGAGTKGKRFAMPNTRIMIHQPLGGASGQAIDVEIQAKEVMHNKNNVTSIIAGCTS 219
Query: 121 QSLEKINQDTDRDFFMSAKEAKEYGLIDGVIM-NPLKALQPLP 162
+S E++ +D DRD +MS EA EYGLIDGVI + + L+P+P
Sbjct: 220 RSFEQVLKDIDRDRYMSPIEAVEYGLIDGVIDGDSIIPLEPVP 262
>AT5G23140.1 | Symbols: CLPP2, NCLPP7 | nuclear-encoded CLP protease
P7 | chr5:7783811-7784826 FORWARD LENGTH=241
Length = 241
Score = 166 bits (420), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 83/171 (48%), Positives = 119/171 (69%), Gaps = 2/171 (1%)
Query: 2 ANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASMG 61
++++VAQLLYL++ +P+K I MY+NSPGG VTAG+AI+DTM++IR +ST+C+G AASM
Sbjct: 70 SHVVVAQLLYLESENPSKPIHMYLNSPGGHVTAGLAIYDTMQYIRSPISTICLGQAASMA 129
Query: 62 AFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTGQ 121
+ LL+AG KG+R SLPN+ +MIHQP GG G DI I +++ LN HTGQ
Sbjct: 130 SLLLAAGAKGQRRSLPNATVMIHQPSGGYSGQAKDITIHTKQIVRVWDALNELYVKHTGQ 189
Query: 122 SLEKINQDTDRDFFMSAKEAKEYGLIDGVI-MNPLKALQPLPAAEEGKDRA 171
L+ + + DRD FM+ +EAK +G+ID VI PL+ ++ E KD++
Sbjct: 190 PLDVVANNMDRDHFMTPEEAKAFGIIDEVIDERPLELVKD-AVGNESKDKS 239
>AT1G11750.2 | Symbols: CLPP6 | CLP protease proteolytic subunit 6 |
chr1:3967609-3969535 FORWARD LENGTH=289
Length = 289
Score = 151 bits (382), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 71/151 (47%), Positives = 102/151 (67%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 60
+A +++QL+ L ++D DI+MY+N PGGS + +AI+D M I+P V TV G+AAS
Sbjct: 135 VAQRVISQLVTLASIDDKSDILMYLNCPGGSTYSVLAIYDCMSWIKPKVGTVAFGVAASQ 194
Query: 61 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTG 120
GA LL+ G KG RY++PN+R+MIHQP G G D+ Q NE + + ++ + TG
Sbjct: 195 GALLLAGGEKGMRYAMPNTRVMIHQPQTGCGGHVEDVRRQVNEAIEARQKIDRMYAAFTG 254
Query: 121 QSLEKINQDTDRDFFMSAKEAKEYGLIDGVI 151
Q LEK+ Q T+RD F+SA EA E+GLIDG++
Sbjct: 255 QPLEKVQQYTERDRFLSASEALEFGLIDGLL 285
>AT1G11750.1 | Symbols: CLPP6, NCLPP1, NCLPP6 | CLP protease
proteolytic subunit 6 | chr1:3967609-3969535 FORWARD
LENGTH=271
Length = 271
Score = 151 bits (382), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 71/151 (47%), Positives = 102/151 (67%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 60
+A +++QL+ L ++D DI+MY+N PGGS + +AI+D M I+P V TV G+AAS
Sbjct: 117 VAQRVISQLVTLASIDDKSDILMYLNCPGGSTYSVLAIYDCMSWIKPKVGTVAFGVAASQ 176
Query: 61 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTG 120
GA LL+ G KG RY++PN+R+MIHQP G G D+ Q NE + + ++ + TG
Sbjct: 177 GALLLAGGEKGMRYAMPNTRVMIHQPQTGCGGHVEDVRRQVNEAIEARQKIDRMYAAFTG 236
Query: 121 QSLEKINQDTDRDFFMSAKEAKEYGLIDGVI 151
Q LEK+ Q T+RD F+SA EA E+GLIDG++
Sbjct: 237 QPLEKVQQYTERDRFLSASEALEFGLIDGLL 267
>ATCG00670.1 | Symbols: CLPP1, PCLPP | plastid-encoded CLP P |
chrC:69910-71882 REVERSE LENGTH=196
Length = 196
Score = 136 bits (343), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 100/151 (66%), Gaps = 1/151 (0%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 60
++N +++ ++YL KD+ +++NSPGG V +GMAI+DTM+ +RPDV T+C+GLAAS+
Sbjct: 43 ISNQLISLMIYLSIEKDTKDLYLFINSPGGWVISGMAIYDTMQFVRPDVQTICMGLAASI 102
Query: 61 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQT-DIDIQANEMLHHKANLNGYLSYHT 119
+F+L G KR + P++R+MIHQP QT + ++A E+L + + T
Sbjct: 103 ASFILVGGAITKRIAFPHARVMIHQPASSFYEAQTGEFILEAEELLKLRETITRVYVQRT 162
Query: 120 GQSLEKINQDTDRDFFMSAKEAKEYGLIDGV 150
G+ + I++D +RD FMSA EA+ +G++D V
Sbjct: 163 GKPIWVISEDMERDVFMSATEAQAHGIVDLV 193
>AT1G12410.1 | Symbols: CLPR2, NCLPP2, CLP2 | CLP protease
proteolytic subunit 2 | chr1:4223099-4224954 FORWARD
LENGTH=279
Length = 279
Score = 134 bits (336), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 66/154 (42%), Positives = 99/154 (64%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPGGSVTAGMAIFDTMRHIRPDVSTVCVGLAASM 60
+N I+A +LYLD +D ++ I MY+N PGG +T +AI+DTM+ ++ V T CVGLA ++
Sbjct: 110 FSNQILATMLYLDTLDDSRRIYMYLNGPGGDLTPSLAIYDTMKSLKSPVGTHCVGLAYNL 169
Query: 61 GAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLNGYLSYHTG 120
FLL+AG KG R+++P SRI + P G A+G DI +A E+ + L L+ +TG
Sbjct: 170 AGFLLAAGEKGHRFAMPLSRIALQSPAGAARGQADDIQNEAKELSRIRDYLFNELAKNTG 229
Query: 121 QSLEKINQDTDRDFFMSAKEAKEYGLIDGVIMNP 154
Q E++ +D R +A+EA EYGLID ++ P
Sbjct: 230 QPAERVFKDLSRVKRFNAEEAIEYGLIDKIVRPP 263
>AT4G17040.1 | Symbols: CLPR4 | CLP protease R subunit 4 |
chr4:9586740-9589297 REVERSE LENGTH=305
Length = 305
Score = 125 bits (314), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 91/161 (56%), Gaps = 8/161 (4%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPG--------GSVTAGMAIFDTMRHIRPDVSTV 52
+ +I+A+ LYL D K I +Y+NS G G T AI+D M +++P + T+
Sbjct: 126 VTELILAEFLYLQYEDEEKPIYLYINSTGTTKNGEKLGYDTEAFAIYDVMGYVKPPIFTL 185
Query: 53 CVGLAASMGAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQGGQTDIDIQANEMLHHKANLN 112
CVG A A LL+AG KG R +LP+S IMI QP+ QG TD++I E+ H K +
Sbjct: 186 CVGNAWGEAALLLTAGAKGNRSALPSSTIMIKQPIARFQGQATDVEIARKEIKHIKTEMV 245
Query: 113 GYLSYHTGQSLEKINQDTDRDFFMSAKEAKEYGLIDGVIMN 153
S H G+S E+I D R + S EA EYG+ID V+ N
Sbjct: 246 KLYSKHIGKSPEQIEADMKRPKYFSPTEAVEYGIIDKVVYN 286
>AT1G09130.3 | Symbols: | ATP-dependent caseinolytic (Clp)
protease/crotonase family protein | chr1:2939731-2942217
REVERSE LENGTH=370
Length = 370
Score = 119 bits (298), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 99/163 (60%), Gaps = 11/163 (6%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPG---------GSVTAGMAIFDTMRHIRPDVST 51
+ ++VA+L+YL +DP + I +Y+NS G G + G AI+D++ ++ +V T
Sbjct: 142 VTELVVAELMYLQWLDPKEPIYIYINSTGTTRDDGETVGMESEGFAIYDSLMQLKNEVHT 201
Query: 52 VCVGLAASMGAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQG--GQTDIDIQANEMLHHKA 109
VCVG A LLSAGTKGKR+ +P+++ MI QP + G +D+ I+A E++ ++
Sbjct: 202 VCVGAAIGQACLLLSAGTKGKRFMMPHAKAMIQQPRVPSSGLMPASDVLIRAKEVITNRD 261
Query: 110 NLNGYLSYHTGQSLEKINQDTDRDFFMSAKEAKEYGLIDGVIM 152
L LS HTG S+E + R ++M A +AKE+G+ID ++
Sbjct: 262 ILVELLSKHTGNSVETVANVMRRPYYMDAPKAKEFGVIDRILW 304
>AT1G09130.2 | Symbols: | ATP-dependent caseinolytic (Clp)
protease/crotonase family protein | chr1:2940063-2942217
REVERSE LENGTH=330
Length = 330
Score = 119 bits (298), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 99/163 (60%), Gaps = 11/163 (6%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPG---------GSVTAGMAIFDTMRHIRPDVST 51
+ ++VA+L+YL +DP + I +Y+NS G G + G AI+D++ ++ +V T
Sbjct: 142 VTELVVAELMYLQWLDPKEPIYIYINSTGTTRDDGETVGMESEGFAIYDSLMQLKNEVHT 201
Query: 52 VCVGLAASMGAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQG--GQTDIDIQANEMLHHKA 109
VCVG A LLSAGTKGKR+ +P+++ MI QP + G +D+ I+A E++ ++
Sbjct: 202 VCVGAAIGQACLLLSAGTKGKRFMMPHAKAMIQQPRVPSSGLMPASDVLIRAKEVITNRD 261
Query: 110 NLNGYLSYHTGQSLEKINQDTDRDFFMSAKEAKEYGLIDGVIM 152
L LS HTG S+E + R ++M A +AKE+G+ID ++
Sbjct: 262 ILVELLSKHTGNSVETVANVMRRPYYMDAPKAKEFGVIDRILW 304
>AT1G09130.1 | Symbols: | ATP-dependent caseinolytic (Clp)
protease/crotonase family protein | chr1:2940063-2942217
REVERSE LENGTH=330
Length = 330
Score = 119 bits (298), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 99/163 (60%), Gaps = 11/163 (6%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPG---------GSVTAGMAIFDTMRHIRPDVST 51
+ ++VA+L+YL +DP + I +Y+NS G G + G AI+D++ ++ +V T
Sbjct: 142 VTELVVAELMYLQWLDPKEPIYIYINSTGTTRDDGETVGMESEGFAIYDSLMQLKNEVHT 201
Query: 52 VCVGLAASMGAFLLSAGTKGKRYSLPNSRIMIHQPLGGAQG--GQTDIDIQANEMLHHKA 109
VCVG A LLSAGTKGKR+ +P+++ MI QP + G +D+ I+A E++ ++
Sbjct: 202 VCVGAAIGQACLLLSAGTKGKRFMMPHAKAMIQQPRVPSSGLMPASDVLIRAKEVITNRD 261
Query: 110 NLNGYLSYHTGQSLEKINQDTDRDFFMSAKEAKEYGLIDGVIM 152
L LS HTG S+E + R ++M A +AKE+G+ID ++
Sbjct: 262 ILVELLSKHTGNSVETVANVMRRPYYMDAPKAKEFGVIDRILW 304
>AT1G49970.1 | Symbols: CLPR1, NCLPP5, SVR2 | CLP protease
proteolytic subunit 1 | chr1:18501936-18504462 REVERSE
LENGTH=387
Length = 387
Score = 92.8 bits (229), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 87/160 (54%), Gaps = 10/160 (6%)
Query: 1 MANIIVAQLLYLDAVDPNKDIVMYVNSPG---------GSVTAGMAIFDTMRHIRPDVST 51
+ ++VAQ ++LD +P K I +Y+NSPG GS T AI DT+ + + DV T
Sbjct: 192 VTELLVAQFMWLDYDNPTKPIYLYINSPGTQNEKMETVGSETEAYAIADTISYCKSDVYT 251
Query: 52 VCVGLAASMGAFLLSAGTKGKRYSLPNSRIMIHQP-LGGAQGGQTDIDIQANEMLHHKAN 110
+ G+A A LLS G KG R P+S ++ P + + G D+ I+A E+ +
Sbjct: 252 INCGMAFGQAAMLLSLGKKGYRAVQPHSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEY 311
Query: 111 LNGYLSYHTGQSLEKINQDTDRDFFMSAKEAKEYGLIDGV 150
L+ TG+S E+IN+D R ++ A+ A +YG+ D +
Sbjct: 312 YIELLAKGTGKSKEQINEDIKRPKYLQAQAAIDYGIADKI 351