Miyakogusa Predicted Gene
- Lj6g3v1946290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1946290.1 Non Chatacterized Hit- tr|I1KRU1|I1KRU1_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,86.45,0,SUBFAMILY NOT
NAMED,NULL; DEHYDROGENASE RELATED,NULL; seg,NULL; Thiamin
diphosphate-binding fold (TH,CUFF.60241.1
(399 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G34590.1 | Symbols: | Transketolase family protein | chr2:14... 619 e-177
AT1G30120.1 | Symbols: PDH-E1 BETA | pyruvate dehydrogenase E1 b... 612 e-175
AT5G50850.1 | Symbols: MAB1 | Transketolase family protein | chr... 240 1e-63
AT1G55510.1 | Symbols: BCDH BETA1 | branched-chain alpha-keto ac... 190 2e-48
AT3G13450.1 | Symbols: DIN4 | Transketolase family protein | chr... 188 5e-48
>AT2G34590.1 | Symbols: | Transketolase family protein |
chr2:14568956-14570844 REVERSE LENGTH=406
Length = 406
Score = 619 bits (1596), Expect = e-177, Method: Compositional matrix adjust.
Identities = 315/407 (77%), Positives = 329/407 (80%), Gaps = 9/407 (2%)
Query: 1 MATLFQGVGAATAFSA-----SNKLHLPSRGSLSESKGSIFVVRSDAWMNNLLNLEARQP 55
M+ + QG GAATA S SNKL PSR SLS V SD+ +L AR+
Sbjct: 1 MSAILQGAGAATALSPFNSIDSNKLVAPSRSSLSVRSKRYIVAGSDSKSFGS-SLVARRS 59
Query: 56 QRLITSAVATKAD---SSASTKTGHXXXXXXXXXXXXXXXXXRDPRVCVMGEDVGDYGGS 112
+ LI +AV TKAD SS S+K GH RDP VCVMGEDVG YGGS
Sbjct: 60 EPLIPNAVTTKADTAASSTSSKPGHELLLFEALQEGLEEEMDRDPHVCVMGEDVGHYGGS 119
Query: 113 YKVTKGLAPKFGDLRVLDTPIAENAFTGMGIGAAMTGLRPIIEGMNMGFLLLAFNQISNN 172
YKVTKGLA KFGDLRVLDTPI ENAFTGMGIGAAMTGLRP+IEGMNMGFLLLAFNQISNN
Sbjct: 120 YKVTKGLADKFGDLRVLDTPICENAFTGMGIGAAMTGLRPVIEGMNMGFLLLAFNQISNN 179
Query: 173 CGMLHYTSGGQFKXXXXXXXXXXXXXQLGAEHSQRLESYFQSIPGIQMVACSTPYNAKGL 232
CGMLHYTSGGQF QLGAEHSQRLESYFQSIPGIQMVACSTPYNAKGL
Sbjct: 180 CGMLHYTSGGQFTIPVVIRGPGGVGRQLGAEHSQRLESYFQSIPGIQMVACSTPYNAKGL 239
Query: 233 MKAAIRSENPVILFEHVLLYNLKERIPDEEYVLSLEEAEMVRPGEHITILTYSRMRYHVM 292
MKAAIRSENPVILFEHVLLYNLKE IPDEEY+ +LEEAEMVRPGEHITILTYSRMRYHVM
Sbjct: 240 MKAAIRSENPVILFEHVLLYNLKESIPDEEYICNLEEAEMVRPGEHITILTYSRMRYHVM 299
Query: 293 QAAKTLVNKGYDPEVIDIRSLKPFDLHTIGNSVKKTHRVLIVEECMRTGGIGASLTAAIT 352
QAAKTLVNKGYDPEVIDIRSLKPFDL+TIGNSVKKTHRVLIVEECMRTGGIGASLTAAI
Sbjct: 300 QAAKTLVNKGYDPEVIDIRSLKPFDLYTIGNSVKKTHRVLIVEECMRTGGIGASLTAAIN 359
Query: 353 ENFNDYLDAPVVCLSSQDVPTPYTGPLEEWTVVQPAQIVTAVEQLCQ 399
ENF+DYLDAPV+CLSSQDVPTPY G LEEWTVVQPAQIVTAVEQLCQ
Sbjct: 360 ENFHDYLDAPVMCLSSQDVPTPYAGTLEEWTVVQPAQIVTAVEQLCQ 406
>AT1G30120.1 | Symbols: PDH-E1 BETA | pyruvate dehydrogenase E1 beta
| chr1:10584350-10586477 REVERSE LENGTH=406
Length = 406
Score = 612 bits (1577), Expect = e-175, Method: Compositional matrix adjust.
Identities = 314/408 (76%), Positives = 331/408 (81%), Gaps = 11/408 (2%)
Query: 1 MATLFQGVGAATA----FSA--SNKLHL-PSRGSLSESKGSIFVVRSDAWMNNL-LNLEA 52
M+++ G GAAT F++ S KL + PSR +LS V SDA + L
Sbjct: 1 MSSIIHGAGAATTTLSTFNSVDSKKLFVAPSRTNLSVRSQRYIVAGSDASKKSFGSGLRV 60
Query: 53 RQPQRLITSAVATK-ADSSASTKTGHXXXXXXXXXXXXXXXXXRDPRVCVMGEDVGDYGG 111
R Q+LI +AVATK AD+SAST GH RDP VCVMGEDVG YGG
Sbjct: 61 RHSQKLIPNAVATKEADTSAST--GHELLLFEALQEGLEEEMDRDPHVCVMGEDVGHYGG 118
Query: 112 SYKVTKGLAPKFGDLRVLDTPIAENAFTGMGIGAAMTGLRPIIEGMNMGFLLLAFNQISN 171
SYKVTKGLA KFGDLRVLDTPI ENAFTGMGIGAAMTGLRP+IEGMNMGFLLLAFNQISN
Sbjct: 119 SYKVTKGLADKFGDLRVLDTPICENAFTGMGIGAAMTGLRPVIEGMNMGFLLLAFNQISN 178
Query: 172 NCGMLHYTSGGQFKXXXXXXXXXXXXXQLGAEHSQRLESYFQSIPGIQMVACSTPYNAKG 231
NCGMLHYTSGGQF QLGAEHSQRLESYFQSIPGIQMVACSTPYNAKG
Sbjct: 179 NCGMLHYTSGGQFTIPVVIRGPGGVGRQLGAEHSQRLESYFQSIPGIQMVACSTPYNAKG 238
Query: 232 LMKAAIRSENPVILFEHVLLYNLKERIPDEEYVLSLEEAEMVRPGEHITILTYSRMRYHV 291
LMKAAIRSENPVILFEHVLLYNLKE+IPDE+YV +LEEAEMVRPGEHITILTYSRMRYHV
Sbjct: 239 LMKAAIRSENPVILFEHVLLYNLKEKIPDEDYVCNLEEAEMVRPGEHITILTYSRMRYHV 298
Query: 292 MQAAKTLVNKGYDPEVIDIRSLKPFDLHTIGNSVKKTHRVLIVEECMRTGGIGASLTAAI 351
MQAAKTLVNKGYDPEVIDIRSLKPFDLHTIGNSVKKTHRVLIVEECMRTGGIGASLTAAI
Sbjct: 299 MQAAKTLVNKGYDPEVIDIRSLKPFDLHTIGNSVKKTHRVLIVEECMRTGGIGASLTAAI 358
Query: 352 TENFNDYLDAPVVCLSSQDVPTPYTGPLEEWTVVQPAQIVTAVEQLCQ 399
ENF+DYLDAPV+CLSSQDVPTPY G LEEWTVVQPAQIVTAVEQLCQ
Sbjct: 359 NENFHDYLDAPVMCLSSQDVPTPYAGTLEEWTVVQPAQIVTAVEQLCQ 406
>AT5G50850.1 | Symbols: MAB1 | Transketolase family protein |
chr5:20689671-20692976 FORWARD LENGTH=363
Length = 363
Score = 240 bits (613), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 185/307 (60%), Gaps = 4/307 (1%)
Query: 96 DPRVCVMGEDVGDYGGSYKVTKGLAPKFGDLRVLDTPIAENAFTGMGIGAAMTGLRPIIE 155
DP+V VMGE+VG Y G+YK+TKGL K+G RV DTPI E FTG+G+GAA GL+P++E
Sbjct: 53 DPKVFVMGEEVGQYQGAYKITKGLLEKYGPERVYDTPITEAGFTGIGVGAAYAGLKPVVE 112
Query: 156 GMNMGFLLLAFNQISNNCGMLHYTSGGQFKXXXXXXXXXXXXXQLGAEHSQRLESYFQSI 215
M F + A + I N+ +Y S GQ +GA+HSQ +++ S+
Sbjct: 113 FMTFNFSMQAIDHIINSAAKSNYMSAGQINVPIVFRGPNGAAAGVGAQHSQCYAAWYASV 172
Query: 216 PGIQMVACSTPYNAKGLMKAAIRSENPVILFEHVLLYN----LKERIPDEEYVLSLEEAE 271
PG++++A + +A+GL+KAAIR +PV+ E+ LLY + E D + L + +A+
Sbjct: 173 PGLKVLAPYSAEDARGLLKAAIRDPDPVVFLENELLYGESFPISEEALDSSFCLPIGKAK 232
Query: 272 MVRPGEHITILTYSRMRYHVMQAAKTLVNKGYDPEVIDIRSLKPFDLHTIGNSVKKTHRV 331
+ R G+ +TI+T+S+M ++AA+ L +G EVI++RS++P D TI SV+KT R+
Sbjct: 233 IEREGKDVTIVTFSKMVGFALKAAEKLAEEGISAEVINLRSIRPLDRATINASVRKTSRL 292
Query: 332 LIVEECMRTGGIGASLTAAITENFNDYLDAPVVCLSSQDVPTPYTGPLEEWTVVQPAQIV 391
+ VEE G+ A + A++ E YLDAPV ++ DVP PY LE + Q IV
Sbjct: 293 VTVEEGFPQHGVCAEICASVVEESFSYLDAPVERIAGADVPMPYAANLERLALPQIEDIV 352
Query: 392 TAVEQLC 398
A ++ C
Sbjct: 353 RASKRAC 359
>AT1G55510.1 | Symbols: BCDH BETA1 | branched-chain alpha-keto acid
decarboxylase E1 beta subunit | chr1:20723482-20725505
FORWARD LENGTH=352
Length = 352
Score = 190 bits (482), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 114/305 (37%), Positives = 157/305 (51%), Gaps = 3/305 (0%)
Query: 72 STKTGHXXXXXXXXXXXXXXXXXRDPRVCVMGEDVGDYGGSYKVTKGLAPKFGDLRVLDT 131
ST+TG DPR V GEDVG +GG ++ T GLA +FG RV +T
Sbjct: 25 STETGKPLNLYSAINQALHIALDTDPRSYVFGEDVG-FGGVFRCTTGLAERFGKNRVFNT 83
Query: 132 PIAENAFTGMGIGAAMTGLRPIIEGMNMGFLLLAFNQISNNCGMLHYTSGGQFKXXXXXX 191
P+ E G GIG A G R I+E ++ AF+QI N Y SG QF
Sbjct: 84 PLCEQGIVGFGIGLAAMGNRAIVEIQFADYIYPAFDQIVNEAAKFRYRSGNQFNCGGLTI 143
Query: 192 XXXXXXXQLGAE-HSQRLESYFQSIPGIQMVACSTPYNAKGLMKAAIRSENPVILFEHVL 250
G HSQ E++F +PGI++V +P AKGL+ + IR NPV+ FE
Sbjct: 144 RAPYGAVGHGGHYHSQSPEAFFCHVPGIKVVIPRSPREAKGLLLSCIRDPNPVVFFEPKW 203
Query: 251 LYNLK-ERIPDEEYVLSLEEAEMVRPGEHITILTYSRMRYHVMQAAKTLVNKGYDPEVID 309
LY E +P+ +Y++ L EAE++R G IT++ + + QA +G E+ID
Sbjct: 204 LYRQAVEEVPEHDYMIPLSEAEVIREGNDITLVGWGAQLTVMEQACLDAEKEGISCELID 263
Query: 310 IRSLKPFDLHTIGNSVKKTHRVLIVEECMRTGGIGASLTAAITENFNDYLDAPVVCLSSQ 369
+++L P+D T+ SVKKT R+LI E TGG GA ++A I E L+APV +
Sbjct: 264 LKTLLPWDKETVEASVKKTGRLLISHEAPVTGGFGAEISATILERCFLKLEAPVSRVCGL 323
Query: 370 DVPTP 374
D P P
Sbjct: 324 DTPFP 328
>AT3G13450.1 | Symbols: DIN4 | Transketolase family protein |
chr3:4382340-4384295 REVERSE LENGTH=358
Length = 358
Score = 188 bits (478), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 152/281 (54%), Gaps = 3/281 (1%)
Query: 96 DPRVCVMGEDVGDYGGSYKVTKGLAPKFGDLRVLDTPIAENAFTGMGIGAAMTGLRPIIE 155
DPR V GEDVG +GG ++ T GLA +FG RV +TP+ E G GIG A G R I E
Sbjct: 55 DPRSYVFGEDVG-FGGVFRCTTGLAERFGKSRVFNTPLCEQGIVGFGIGLAAMGNRVIAE 113
Query: 156 GMNMGFLLLAFNQISNNCGMLHYTSGGQFKXXXXXXXXXXXXXQLGAE-HSQRLESYFQS 214
++ AF+QI N Y SG QF G HSQ E++F
Sbjct: 114 IQFADYIFPAFDQIVNEAAKFRYRSGNQFNCGGLTIRAPYGAVGHGGHYHSQSPEAFFCH 173
Query: 215 IPGIQMVACSTPYNAKGLMKAAIRSENPVILFEHVLLYNLK-ERIPDEEYVLSLEEAEMV 273
+PGI++V +P AKGL+ ++IR NPV+ FE LY E +P+++Y++ L EAE++
Sbjct: 174 VPGIKVVIPRSPREAKGLLLSSIRDPNPVVFFEPKWLYRQAVEDVPEDDYMIPLSEAEVM 233
Query: 274 RPGEHITILTYSRMRYHVMQAAKTLVNKGYDPEVIDIRSLKPFDLHTIGNSVKKTHRVLI 333
R G IT++ + + QA N+G E+ID+++L P+D + SV+KT R+LI
Sbjct: 234 REGSDITLVGWGAQLTIMEQACLDAENEGISCELIDLKTLIPWDKEIVETSVRKTGRLLI 293
Query: 334 VEECMRTGGIGASLTAAITENFNDYLDAPVVCLSSQDVPTP 374
E TGG GA + A I E L+APV + D P P
Sbjct: 294 SHEAPVTGGFGAEIAATIVERCFLRLEAPVSRVCGLDTPFP 334