Miyakogusa Predicted Gene
- Lj4g3v2046230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2046230.1 Non Chatacterized Hit- tr|I1MSU3|I1MSU3_SOYBN
Uncharacterized protein OS=Glycine max PE=3 SV=1,89.9,0,Transketolase,
pyrimidine binding domain,Transketolase-like, pyrimidine-binding
domain; TRANSKETOLAS,CUFF.50183.1
(416 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G15560.1 | Symbols: CLA1, DEF, CLA, DXS, DXPS2 | Deoxyxylulos... 616 e-177
AT3G21500.2 | Symbols: DXPS1 | 1-deoxy-D-xylulose 5-phosphate sy... 543 e-155
AT3G21500.1 | Symbols: DXPS1 | 1-deoxy-D-xylulose 5-phosphate sy... 537 e-153
AT5G11380.1 | Symbols: DXPS3 | 1-deoxy-D-xylulose 5-phosphate sy... 437 e-123
AT5G11380.2 | Symbols: DXPS3 | 1-deoxy-D-xylulose 5-phosphate sy... 285 3e-77
AT5G50850.1 | Symbols: MAB1 | Transketolase family protein | chr... 64 1e-10
AT1G55510.1 | Symbols: BCDH BETA1 | branched-chain alpha-keto ac... 59 5e-09
AT3G13450.1 | Symbols: DIN4 | Transketolase family protein | chr... 58 1e-08
>AT4G15560.1 | Symbols: CLA1, DEF, CLA, DXS, DXPS2 |
Deoxyxylulose-5-phosphate synthase |
chr4:8884218-8887254 FORWARD LENGTH=717
Length = 717
Score = 616 bits (1589), Expect = e-177, Method: Compositional matrix adjust.
Identities = 289/399 (72%), Positives = 331/399 (82%)
Query: 2 RGMVSGSGACFFEELGLFYIGPVDGHDMEDLVHILKDVKALPALGPVLIHVISEKGKGYR 61
RGM+SG+G+ FEELGL+YIGPVDGH+++DLV ILK+VK+ GPVLIHV++EKG+GY
Sbjct: 310 RGMISGTGSSLFEELGLYYIGPVDGHNIDDLVAILKEVKSTRTTGPVLIHVVTEKGRGYP 369
Query: 62 PAEVATDKMHGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHAAMGGG 121
AE A DK HGVVKFDP +G+Q K T+SYT YFAE+L AEA+ D+ +VAIHAAMGGG
Sbjct: 370 YAERADDKYHGVVKFDPATGRQFKTTNKTQSYTTYFAEALVAEAEVDKDVVAIHAAMGGG 429
Query: 122 TGLNLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAIYSSFLQRGYDQVAHDVDL 181
TGLNLFQ+ FP RCFDVGIAEQHAVTFAAGLA EGLKPFCAIYSSF+QR YDQV HDVDL
Sbjct: 430 TGLNLFQRRFPTRCFDVGIAEQHAVTFAAGLACEGLKPFCAIYSSFMQRAYDQVVHDVDL 489
Query: 182 QKLPVRFAIDRAGLVGADGPTHCGAFDTTFMACLPNMVVMAPSDETELMHMVATAAAIDD 241
QKLPVRFA+DRAGLVGADGPTHCGAFD TFMACLPNM+VMAPSDE +L +MVATA AIDD
Sbjct: 490 QKLPVRFAMDRAGLVGADGPTHCGAFDVTFMACLPNMIVMAPSDEADLFNMVATAVAIDD 549
Query: 242 RPSCFRYPRGNGIGSILPPNNKGTPLEVGKGRVLKEGSRVALVGYGTMVQSCMEAAKVLD 301
RPSCFRYPRGNGIG LPP NKG P+E+GKGR+LKEG RVAL+GYG+ VQSC+ AA +L+
Sbjct: 550 RPSCFRYPRGNGIGVALPPGNKGVPIEIGKGRILKEGERVALLGYGSAVQSCLGAAVMLE 609
Query: 302 AHGISTTVADARFCKPLDGNLMRQLAREHEILITVEEGSIGGFGSHVSQFXXXXXXXXXX 361
G++ TVADARFCKPLD L+R LA+ HE+LITVEEGSIGGFGSHV QF
Sbjct: 610 ERGLNVTVADARFCKPLDRALIRSLAKSHEVLITVEEGSIGGFGSHVVQFLALDGLLDGK 669
Query: 362 XKWRAMTLPDKYINHGTQRDQIEVAGLSSKHIAATALSL 400
KWR M LPD+YI+HG DQ+ AGL HIAATAL+L
Sbjct: 670 LKWRPMVLPDRYIDHGAPADQLAEAGLMPSHIAATALNL 708
>AT3G21500.2 | Symbols: DXPS1 | 1-deoxy-D-xylulose 5-phosphate
synthase 1 | chr3:7573907-7576594 REVERSE LENGTH=641
Length = 641
Score = 543 bits (1400), Expect = e-155, Method: Compositional matrix adjust.
Identities = 252/349 (72%), Positives = 293/349 (83%)
Query: 3 GMVSGSGACFFEELGLFYIGPVDGHDMEDLVHILKDVKALPALGPVLIHVISEKGKGYRP 62
GM+ + + FEELG Y+GPVDGH+++DLV IL+ +K+ +GPVLIHV++EKG+GY
Sbjct: 269 GMIRETSSTLFEELGFHYVGPVDGHNIDDLVSILETLKSTKTIGPVLIHVVTEKGRGYPY 328
Query: 63 AEVATDKMHGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHAAMGGGT 122
AE A DK HGV+KFDP++GKQ K + T+SYT F E+L AEA+AD+ IVAIHAAMGGGT
Sbjct: 329 AERADDKYHGVLKFDPETGKQFKNISKTQSYTSCFVEALIAEAEADKDIVAIHAAMGGGT 388
Query: 123 GLNLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAIYSSFLQRGYDQVAHDVDLQ 182
LNLF+ FP RCFDVGIAEQHAVTFAAGLA EGLKPFC IYSSF+QR YDQV HDVDLQ
Sbjct: 389 MLNLFESRFPTRCFDVGIAEQHAVTFAAGLACEGLKPFCTIYSSFMQRAYDQVVHDVDLQ 448
Query: 183 KLPVRFAIDRAGLVGADGPTHCGAFDTTFMACLPNMVVMAPSDETELMHMVATAAAIDDR 242
KLPVRFAIDRAGL+GADGPTHCGAFD TFMACLPNM+VMAPSDE EL +MVATAAAIDDR
Sbjct: 449 KLPVRFAIDRAGLMGADGPTHCGAFDVTFMACLPNMIVMAPSDEAELFNMVATAAAIDDR 508
Query: 243 PSCFRYPRGNGIGSILPPNNKGTPLEVGKGRVLKEGSRVALVGYGTMVQSCMEAAKVLDA 302
PSCFRY RGNGIG LPP NKG PL++G+GR+L++G RVAL+GYG+ VQ C+EAA +L
Sbjct: 509 PSCFRYHRGNGIGVSLPPGNKGVPLQIGRGRILRDGERVALLGYGSAVQRCLEAASMLSE 568
Query: 303 HGISTTVADARFCKPLDGNLMRQLAREHEILITVEEGSIGGFGSHVSQF 351
G+ TVADARFCKPLD L+R LA+ HE+LITVEEGSIGGFGSHV QF
Sbjct: 569 RGLKITVADARFCKPLDVALIRSLAKSHEVLITVEEGSIGGFGSHVVQF 617
>AT3G21500.1 | Symbols: DXPS1 | 1-deoxy-D-xylulose 5-phosphate
synthase 1 | chr3:7573907-7576594 REVERSE LENGTH=640
Length = 640
Score = 537 bits (1384), Expect = e-153, Method: Compositional matrix adjust.
Identities = 251/349 (71%), Positives = 292/349 (83%), Gaps = 1/349 (0%)
Query: 3 GMVSGSGACFFEELGLFYIGPVDGHDMEDLVHILKDVKALPALGPVLIHVISEKGKGYRP 62
GM+ + + FEELG Y+GPVDGH+++DLV IL+ +K+ +GPVLIHV++EKG+GY
Sbjct: 269 GMIRETSSTLFEELGFHYVGPVDGHNIDDLVSILETLKSTKTIGPVLIHVVTEKGRGYPY 328
Query: 63 AEVATDKMHGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHAAMGGGT 122
AE A DK H V+KFDP++GKQ K + T+SYT F E+L AEA+AD+ IVAIHAAMGGGT
Sbjct: 329 AERADDKYH-VLKFDPETGKQFKNISKTQSYTSCFVEALIAEAEADKDIVAIHAAMGGGT 387
Query: 123 GLNLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAIYSSFLQRGYDQVAHDVDLQ 182
LNLF+ FP RCFDVGIAEQHAVTFAAGLA EGLKPFC IYSSF+QR YDQV HDVDLQ
Sbjct: 388 MLNLFESRFPTRCFDVGIAEQHAVTFAAGLACEGLKPFCTIYSSFMQRAYDQVVHDVDLQ 447
Query: 183 KLPVRFAIDRAGLVGADGPTHCGAFDTTFMACLPNMVVMAPSDETELMHMVATAAAIDDR 242
KLPVRFAIDRAGL+GADGPTHCGAFD TFMACLPNM+VMAPSDE EL +MVATAAAIDDR
Sbjct: 448 KLPVRFAIDRAGLMGADGPTHCGAFDVTFMACLPNMIVMAPSDEAELFNMVATAAAIDDR 507
Query: 243 PSCFRYPRGNGIGSILPPNNKGTPLEVGKGRVLKEGSRVALVGYGTMVQSCMEAAKVLDA 302
PSCFRY RGNGIG LPP NKG PL++G+GR+L++G RVAL+GYG+ VQ C+EAA +L
Sbjct: 508 PSCFRYHRGNGIGVSLPPGNKGVPLQIGRGRILRDGERVALLGYGSAVQRCLEAASMLSE 567
Query: 303 HGISTTVADARFCKPLDGNLMRQLAREHEILITVEEGSIGGFGSHVSQF 351
G+ TVADARFCKPLD L+R LA+ HE+LITVEEGSIGGFGSHV QF
Sbjct: 568 RGLKITVADARFCKPLDVALIRSLAKSHEVLITVEEGSIGGFGSHVVQF 616
>AT5G11380.1 | Symbols: DXPS3 | 1-deoxy-D-xylulose 5-phosphate
synthase 3 | chr5:3630172-3633250 FORWARD LENGTH=700
Length = 700
Score = 437 bits (1123), Expect = e-123, Method: Compositional matrix adjust.
Identities = 219/400 (54%), Positives = 278/400 (69%), Gaps = 22/400 (5%)
Query: 2 RGMVSGSGACFFEELGLFYIGPVDGHDMEDLVHILKDVKALPALGPVLIHVISEKGKGYR 61
RGMV +G+ FEELGL+YIGPVDGH++EDLV +L++V +L ++GPVL+HVI+E G R
Sbjct: 310 RGMVGPTGSTLFEELGLYYIGPVDGHNIEDLVCVLREVSSLDSMGPVLVHVITE---GNR 366
Query: 62 PAEVATDKMHGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHAAMGGG 121
AE + M VK R+Y+ F E+L EA+ D IV +HA M
Sbjct: 367 DAETVKNIM---VK-------------DRRTYSDCFVEALVMEAEKDRDIVVVHAGMEMD 410
Query: 122 TGLNLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAIYSSFLQRGYDQVAHDVDL 181
L FQ+ FP+R F+VG+AEQHAVTF+AGL++ GLKPFC I S+FLQR YDQV HDVD
Sbjct: 411 PSLLTFQERFPDRFFNVGMAEQHAVTFSAGLSSGGLKPFCIIPSAFLQRAYDQVVHDVDR 470
Query: 182 QKLPVRFAIDRAGLVGADGPTHCGAFDTTFMACLPNMVVMAPSDETELMHMVATAAAIDD 241
Q+ VRF I AGLVG+DGP CGAFD FM+ LPNM+ MAP+DE EL++MVATAA + D
Sbjct: 471 QRKAVRFVITSAGLVGSDGPVQCGAFDIAFMSSLPNMIAMAPADEDELVNMVATAAYVTD 530
Query: 242 RPSCFRYPRGNGIG-SILPPNNKGTPLEVGKGRVLKEGSRVALVGYGTMVQSCMEAAKVL 300
RP CFR+PRG+ + + L P G P+E+G+GRVL EG VAL+GYG MVQ+C+ A +L
Sbjct: 531 RPVCFRFPRGSIVNMNYLVPT--GLPIEIGRGRVLVEGQDVALLGYGAMVQNCLHAHSLL 588
Query: 301 DAHGISTTVADARFCKPLDGNLMRQLAREHEILITVEEGSIGGFGSHVSQFXXXXXXXXX 360
G++ TVADARFCKPLD L+R L + H+ LITVEEG +GGFGSHV+QF
Sbjct: 589 SKLGLNVTVADARFCKPLDIKLVRDLCQNHKFLITVEEGCVGGFGSHVAQFIALDGQLDG 648
Query: 361 XXKWRAMTLPDKYINHGTQRDQIEVAGLSSKHIAATALSL 400
KWR + LPD YI + R+Q+ +AGL+ HIAATALSL
Sbjct: 649 NIKWRPIVLPDGYIEEASPREQLALAGLTGHHIAATALSL 688
>AT5G11380.2 | Symbols: DXPS3 | 1-deoxy-D-xylulose 5-phosphate
synthase 3 | chr5:3630172-3632762 FORWARD LENGTH=565
Length = 565
Score = 285 bits (730), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 140/251 (55%), Positives = 177/251 (70%), Gaps = 19/251 (7%)
Query: 2 RGMVSGSGACFFEELGLFYIGPVDGHDMEDLVHILKDVKALPALGPVLIHVISEKGKGYR 61
RGMV +G+ FEELGL+YIGPVDGH++EDLV +L++V +L ++GPVL+HVI+E G R
Sbjct: 310 RGMVGPTGSTLFEELGLYYIGPVDGHNIEDLVCVLREVSSLDSMGPVLVHVITE---GNR 366
Query: 62 PAEVATDKMHGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHAAMGGG 121
AE + M VK R+Y+ F E+L EA+ D IV +HA M
Sbjct: 367 DAETVKNIM---VK-------------DRRTYSDCFVEALVMEAEKDRDIVVVHAGMEMD 410
Query: 122 TGLNLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAIYSSFLQRGYDQVAHDVDL 181
L FQ+ FP+R F+VG+AEQHAVTF+AGL++ GLKPFC I S+FLQR YDQV HDVD
Sbjct: 411 PSLLTFQERFPDRFFNVGMAEQHAVTFSAGLSSGGLKPFCIIPSAFLQRAYDQVVHDVDR 470
Query: 182 QKLPVRFAIDRAGLVGADGPTHCGAFDTTFMACLPNMVVMAPSDETELMHMVATAAAIDD 241
Q+ VRF I AGLVG+DGP CGAFD FM+ LPNM+ MAP+DE EL++MVATAA + D
Sbjct: 471 QRKAVRFVITSAGLVGSDGPVQCGAFDIAFMSSLPNMIAMAPADEDELVNMVATAAYVTD 530
Query: 242 RPSCFRYPRGN 252
RP CFR+PRG+
Sbjct: 531 RPVCFRFPRGS 541
>AT5G50850.1 | Symbols: MAB1 | Transketolase family protein |
chr5:20689671-20692976 FORWARD LENGTH=363
Length = 363
Score = 64.3 bits (155), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 107/260 (41%), Gaps = 26/260 (10%)
Query: 100 SLTAEADADERIVAIHAAMGGGTGL-----NLFQKHFPERCFDVGIAEQHAVTFAAGLAA 154
++ E AD ++ + +G G L +K+ PER +D I E G A
Sbjct: 45 AIDEEMSADPKVFVMGEEVGQYQGAYKITKGLLEKYGPERVYDTPITEAGFTGIGVGAAY 104
Query: 155 EGLKPFCAIYS-SFLQRGYDQVAHDVDLQK--------LPVRFAIDRAGLVGADGPTHCG 205
GLKP + +F + D + + +P+ F G G H
Sbjct: 105 AGLKPVVEFMTFNFSMQAIDHIINSAAKSNYMSAGQINVPIVFRGPNGAAAGV-GAQHSQ 163
Query: 206 AFDTTFMACLPNMVVMAPSDETELMHMVATAAAIDDRPSCFRYPRGNGI--GSILPPNNK 263
+ + A +P + V+AP + ++ AA D P F N + G P + +
Sbjct: 164 CY-AAWYASVPGLKVLAPYSAEDARGLL-KAAIRDPDPVVFL---ENELLYGESFPISEE 218
Query: 264 GTP----LEVGKGRVLKEGSRVALVGYGTMVQSCMEAAKVLDAHGISTTVADARFCKPLD 319
L +GK ++ +EG V +V + MV ++AA+ L GIS V + R +PLD
Sbjct: 219 ALDSSFCLPIGKAKIEREGKDVTIVTFSKMVGFALKAAEKLAEEGISAEVINLRSIRPLD 278
Query: 320 GNLMRQLAREHEILITVEEG 339
+ R+ L+TVEEG
Sbjct: 279 RATINASVRKTSRLVTVEEG 298
>AT1G55510.1 | Symbols: BCDH BETA1 | branched-chain alpha-keto acid
decarboxylase E1 beta subunit | chr1:20723482-20725505
FORWARD LENGTH=352
Length = 352
Score = 58.9 bits (141), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 126/301 (41%), Gaps = 39/301 (12%)
Query: 71 HGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHAAMGGG------TGL 124
HG + ++GK L ++ ++L D D R +G G TGL
Sbjct: 19 HGARRVSTETGKPLNLYSAIN-------QALHIALDTDPRSYVFGEDVGFGGVFRCTTGL 71
Query: 125 NLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAI-YSSFLQRGYDQVAHDVDLQK 183
++ R F+ + EQ V F GLAA G + I ++ ++ +DQ+ ++ +
Sbjct: 72 A--ERFGKNRVFNTPLCEQGIVGFGIGLAAMGNRAIVEIQFADYIYPAFDQIVNEAAKFR 129
Query: 184 LPVRFAIDRAGL--------VGADGPTHCGAFDTTFMACLPNMVVMAPSDETELMHMVAT 235
+ GL VG G H + F +P + V+ P E ++ +
Sbjct: 130 YRSGNQFNCGGLTIRAPYGAVGHGGHYHSQS-PEAFFCHVPGIKVVIPRSPREAKGLLLS 188
Query: 236 AAAIDDRPSCFRYPRGNGIGSI--LPPNNKGTPLEVGKGRVLKEGSRVALVGYG----TM 289
D P F P+ ++ +P ++ PL + V++EG+ + LVG+G M
Sbjct: 189 CIR-DPNPVVFFEPKWLYRQAVEEVPEHDYMIPLS--EAEVIREGNDITLVGWGAQLTVM 245
Query: 290 VQSCMEAAKVLDAHGISTTVADARFCKPLDGNLMR-QLAREHEILITVEEGSIGGFGSHV 348
Q+C++A K GIS + D + P D + + + +LI+ E GGFG+ +
Sbjct: 246 EQACLDAEK----EGISCELIDLKTLLPWDKETVEASVKKTGRLLISHEAPVTGGFGAEI 301
Query: 349 S 349
S
Sbjct: 302 S 302
>AT3G13450.1 | Symbols: DIN4 | Transketolase family protein |
chr3:4382340-4384295 REVERSE LENGTH=358
Length = 358
Score = 57.8 bits (138), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 130/313 (41%), Gaps = 43/313 (13%)
Query: 57 GKGYRPAEVATDKMHGVVKFDPKSGKQLKPKTSTRSYTQYFAESLTAEADADERIVAIHA 116
G GYR M V+ +SGK + ++ A + E D +
Sbjct: 19 GHGYR--------MLSTVENVSESGKSMNLYSAINQ-----ALHIALETDPRSYVFGEDV 65
Query: 117 AMGG----GTGLNLFQKHFPERCFDVGIAEQHAVTFAAGLAAEGLKPFCAI-YSSFLQRG 171
GG TGL ++ R F+ + EQ V F GLAA G + I ++ ++
Sbjct: 66 GFGGVFRCTTGLA--ERFGKSRVFNTPLCEQGIVGFGIGLAAMGNRVIAEIQFADYIFPA 123
Query: 172 YDQVAHDVDLQKLPVRFAIDRAGL--------VGADGPTHCGAFDTTFMACLPNMVVMAP 223
+DQ+ ++ + + GL VG G H + F +P + V+ P
Sbjct: 124 FDQIVNEAAKFRYRSGNQFNCGGLTIRAPYGAVGHGGHYHSQS-PEAFFCHVPGIKVVIP 182
Query: 224 SDETELMHMVATAAAIDDRPSCFRYPRGNGIGSI--LPPNNKGTPLEVGKGRVLKEGSRV 281
E ++ ++ D P F P+ ++ +P ++ PL + V++EGS +
Sbjct: 183 RSPREAKGLLLSSIR-DPNPVVFFEPKWLYRQAVEDVPEDDYMIPLS--EAEVMREGSDI 239
Query: 282 ALVGYGT----MVQSCMEAAKVLDAHGISTTVADARFCKPLDGNLMRQLAREH-EILITV 336
LVG+G M Q+C++A + GIS + D + P D ++ R+ +LI+
Sbjct: 240 TLVGWGAQLTIMEQACLDA----ENEGISCELIDLKTLIPWDKEIVETSVRKTGRLLISH 295
Query: 337 EEGSIGGFGSHVS 349
E GGFG+ ++
Sbjct: 296 EAPVTGGFGAEIA 308