Miyakogusa Predicted Gene
- Lj1g3v0912020.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0912020.1 Non Chatacterized Hit- tr|I1KAH2|I1KAH2_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,82.44,0,E1_dh,Dehydrogenase, E1 component; 2-OXOISOVALERATE
DEHYDROGENASE ALPHA SUBUNIT-RELATED,NULL; PYRUVA,CUFF.26509.1
(482 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G09300.1 | Symbols: | Thiamin diphosphate-binding fold (THDP... 642 0.0
AT5G09300.2 | Symbols: | Thiamin diphosphate-binding fold (THDP... 640 0.0
AT1G21400.1 | Symbols: | Thiamin diphosphate-binding fold (THDP... 638 0.0
AT5G34780.1 | Symbols: | Thiamin diphosphate-binding fold (THDP... 325 4e-89
AT1G59900.1 | Symbols: AT-E1 ALPHA, E1 ALPHA | pyruvate dehydrog... 142 5e-34
AT1G24180.1 | Symbols: IAR4 | Thiamin diphosphate-binding fold (... 136 4e-32
AT1G01090.1 | Symbols: PDH-E1 ALPHA | pyruvate dehydrogenase E1 ... 123 4e-28
>AT5G09300.1 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr5:2884282-2886797 REVERSE LENGTH=472
Length = 472
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 290/398 (72%), Positives = 348/398 (87%)
Query: 85 DHDQVIDFPGGKVAFTSEMRFISESSGKRVPCYRVLDDNGEPMNHSNYVQVSKEMAVKMY 144
++ QV+DFPGGKVAFT E++FISES +RVPCYRVLDDNG+ + +S +VQVS+E+AVK+Y
Sbjct: 75 NNHQVMDFPGGKVAFTPEIQFISESDKERVPCYRVLDDNGQLITNSQFVQVSEEVAVKIY 134
Query: 145 SEMVTLQTMDSIFYEVQRQGRISFYLTSMGEEAVNIXXXXXXXXDDIVLPQYREPGVLLW 204
S+MVTLQ MD+IFYE QRQGR+SFY T++GEEA+NI D++ PQYREPGVLLW
Sbjct: 135 SDMVTLQIMDNIFYEAQRQGRLSFYATAIGEEAINIASAAALTPQDVIFPQYREPGVLLW 194
Query: 205 RGFTLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISSPIATQLPQAVGAAYSLKMD 264
RGFTLQ+FANQCFGN SD GKGRQMP+HYGSN+LNYFT+S+ IATQLP AVGAAYSLKMD
Sbjct: 195 RGFTLQEFANQCFGNKSDYGKGRQMPVHYGSNKLNYFTVSATIATQLPNAVGAAYSLKMD 254
Query: 265 GKSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVV 324
K ACAVT+ GDGGTSEGDFHA +N AAVMEAPV+FICRNNGWAISTPT +QFRSDG+VV
Sbjct: 255 KKDACAVTYFGDGGTSEGDFHAALNIAAVMEAPVLFICRNNGWAISTPTSDQFRSDGVVV 314
Query: 325 KGQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYR 384
KG+AYGI SIRVDGNDALA+YSAVHTARE+AIREQRP+LIEALTYRVGHHSTSDDST+YR
Sbjct: 315 KGRAYGIRSIRVDGNDALAMYSAVHTAREMAIREQRPILIEALTYRVGHHSTSDDSTRYR 374
Query: 385 PVDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLE 444
EIE+W ARNP++RF+ W+E NGWWSDK E +LRS ++K+++ A++VAEK +KP L+
Sbjct: 375 SAGEIEWWNKARNPLSRFRTWIESNGWWSDKTESDLRSRIKKEMLEALRVAEKTEKPNLQ 434
Query: 445 DLFHDVYDQVPSNLQEQENLLRETIKKNPKDYPSDVPL 482
++F DVYD PSNL+EQE L+R+TI +P+DYPSDVPL
Sbjct: 435 NMFSDVYDVPPSNLREQELLVRQTINSHPQDYPSDVPL 472
>AT5G09300.2 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr5:2884282-2886291 REVERSE LENGTH=401
Length = 401
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 290/395 (73%), Positives = 346/395 (87%)
Query: 88 QVIDFPGGKVAFTSEMRFISESSGKRVPCYRVLDDNGEPMNHSNYVQVSKEMAVKMYSEM 147
QV+DFPGGKVAFT E++FISES +RVPCYRVLDDNG+ + +S +VQVS+E+AVK+YS+M
Sbjct: 7 QVMDFPGGKVAFTPEIQFISESDKERVPCYRVLDDNGQLITNSQFVQVSEEVAVKIYSDM 66
Query: 148 VTLQTMDSIFYEVQRQGRISFYLTSMGEEAVNIXXXXXXXXDDIVLPQYREPGVLLWRGF 207
VTLQ MD+IFYE QRQGR+SFY T++GEEA+NI D++ PQYREPGVLLWRGF
Sbjct: 67 VTLQIMDNIFYEAQRQGRLSFYATAIGEEAINIASAAALTPQDVIFPQYREPGVLLWRGF 126
Query: 208 TLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISSPIATQLPQAVGAAYSLKMDGKS 267
TLQ+FANQCFGN SD GKGRQMP+HYGSN+LNYFT+S+ IATQLP AVGAAYSLKMD K
Sbjct: 127 TLQEFANQCFGNKSDYGKGRQMPVHYGSNKLNYFTVSATIATQLPNAVGAAYSLKMDKKD 186
Query: 268 ACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVVKGQ 327
ACAVT+ GDGGTSEGDFHA +N AAVMEAPV+FICRNNGWAISTPT +QFRSDG+VVKG+
Sbjct: 187 ACAVTYFGDGGTSEGDFHAALNIAAVMEAPVLFICRNNGWAISTPTSDQFRSDGVVVKGR 246
Query: 328 AYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYRPVD 387
AYGI SIRVDGNDALA+YSAVHTARE+AIREQRP+LIEALTYRVGHHSTSDDST+YR
Sbjct: 247 AYGIRSIRVDGNDALAMYSAVHTAREMAIREQRPILIEALTYRVGHHSTSDDSTRYRSAG 306
Query: 388 EIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLEDLF 447
EIE+W ARNP++RF+ W+E NGWWSDK E +LRS ++K+++ A++VAEK +KP L+++F
Sbjct: 307 EIEWWNKARNPLSRFRTWIESNGWWSDKTESDLRSRIKKEMLEALRVAEKTEKPNLQNMF 366
Query: 448 HDVYDQVPSNLQEQENLLRETIKKNPKDYPSDVPL 482
DVYD PSNL+EQE L+R+TI +P+DYPSDVPL
Sbjct: 367 SDVYDVPPSNLREQELLVRQTINSHPQDYPSDVPL 401
>AT1G21400.1 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr1:7493492-7496240 FORWARD LENGTH=472
Length = 472
Score = 638 bits (1646), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/425 (71%), Positives = 353/425 (83%), Gaps = 6/425 (1%)
Query: 59 NNSATISYFS--HYESTKAEAQ---LELQEDDHD-QVIDFPGGKVAFTSEMRFISESSGK 112
+++A +S F +EST E Q L Q D+ D Q +DFPGGKV +TSEM+FI ESS +
Sbjct: 43 SSTAYLSPFGSLRHESTAVETQADHLVQQIDEVDAQELDFPGGKVGYTSEMKFIPESSSR 102
Query: 113 RVPCYRVLDDNGEPMNHSNYVQVSKEMAVKMYSEMVTLQTMDSIFYEVQRQGRISFYLTS 172
R+PCYRVLD++G + S+++ VS+++AV+MY +M TLQ MD IFYE QRQGRISFYLTS
Sbjct: 103 RIPCYRVLDEDGRIIPDSDFIPVSEKLAVRMYEQMATLQVMDHIFYEAQRQGRISFYLTS 162
Query: 173 MGEEAVNIXXXXXXXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTSDLGKGRQMPIH 232
+GEEA+NI DD+VLPQYREPGVLLWRGFTL++FANQCFGN +D GKGRQMPIH
Sbjct: 163 VGEEAINIASAAALSPDDVVLPQYREPGVLLWRGFTLEEFANQCFGNKADYGKGRQMPIH 222
Query: 233 YGSNELNYFTISSPIATQLPQAVGAAYSLKMDGKSACAVTFCGDGGTSEGDFHAGMNFAA 292
YGSN LNYFTISSPIATQLPQA G YSLKMD K+AC VTF GDGGTSEGDFHAG+NFAA
Sbjct: 223 YGSNRLNYFTISSPIATQLPQAAGVGYSLKMDKKNACTVTFIGDGGTSEGDFHAGLNFAA 282
Query: 293 VMEAPVIFICRNNGWAISTPTEEQFRSDGIVVKGQAYGIWSIRVDGNDALAVYSAVHTAR 352
VMEAPV+FICRNNGWAIST EQFRSDGIVVKGQAYGI SIRVDGNDALAVYSAV +AR
Sbjct: 283 VMEAPVVFICRNNGWAISTHISEQFRSDGIVVKGQAYGIRSIRVDGNDALAVYSAVRSAR 342
Query: 353 EIAIREQRPVLIEALTYRVGHHSTSDDSTKYRPVDEIEYWKMARNPVNRFKRWVEMNGWW 412
E+A+ EQRPVLIE +TYRVGHHSTSDDSTKYR DEI+YWKM+RNPVNRF++WVE NGWW
Sbjct: 343 EMAVTEQRPVLIEMMTYRVGHHSTSDDSTKYRAADEIQYWKMSRNPVNRFRKWVEDNGWW 402
Query: 413 SDKDELELRSSVRKQLMNAIQVAEKAQKPPLEDLFHDVYDQVPSNLQEQENLLRETIKKN 472
S++DE +LRS+ RKQL+ AIQ AEK +K PL +LF+DVYD P NL+EQE L+E +KK
Sbjct: 403 SEEDESKLRSNARKQLLQAIQAAEKWEKQPLTELFNDVYDVKPKNLEEQELGLKELVKKQ 462
Query: 473 PKDYP 477
P+DYP
Sbjct: 463 PQDYP 467
>AT5G34780.1 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr5:12961682-12963892 REVERSE LENGTH=365
Length = 365
Score = 325 bits (833), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 159/211 (75%), Positives = 183/211 (86%)
Query: 266 KSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVVK 325
K+ACAVTF GDGGTSEGDFHAG+NFAAVMEAPV+FICRNNGWAIST EQFRSDGIVVK
Sbjct: 26 KNACAVTFIGDGGTSEGDFHAGLNFAAVMEAPVVFICRNNGWAISTHISEQFRSDGIVVK 85
Query: 326 GQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYRP 385
GQAYGI SIRVDGNDALAVYSAV +ARE+A+ EQRPVLIE + YRVGHHSTSDDSTKYR
Sbjct: 86 GQAYGIRSIRVDGNDALAVYSAVCSAREMAVTEQRPVLIEMMIYRVGHHSTSDDSTKYRA 145
Query: 386 VDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLED 445
DEI+YWKM+RN VNRF++ VE NGWWS++DE +LRS+ RKQL+ AIQ AEK +K PL +
Sbjct: 146 ADEIQYWKMSRNSVNRFRKSVEDNGWWSEEDESKLRSNARKQLLQAIQAAEKWEKQPLTE 205
Query: 446 LFHDVYDQVPSNLQEQENLLRETIKKNPKDY 476
LF+DVYD P NL+E+E L+E I+K P+DY
Sbjct: 206 LFNDVYDVKPKNLEEEELGLKELIEKQPQDY 236
>AT1G59900.1 | Symbols: AT-E1 ALPHA, E1 ALPHA | pyruvate
dehydrogenase complex E1 alpha subunit |
chr1:22051368-22053660 FORWARD LENGTH=389
Length = 389
Score = 142 bits (358), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 157/326 (48%), Gaps = 10/326 (3%)
Query: 130 SNYVQVSKEMAVKMYSEMVTLQTM----DSIFYEVQRQGRISFYLTSMGEEAVNIXXXXX 185
S V+ S + + + M ++ M DS++ +G Y G+EAV I
Sbjct: 49 SRSVESSSQELLDFFRTMALMRRMEIAADSLYKAKLIRGFCHLY---DGQEAVAIGMEAA 105
Query: 186 XXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISS 245
D ++ YR+ + L RG +L + ++ G + KG+ +H+ E +++
Sbjct: 106 ITKKDAIITAYRDHCIFLGRGGSLHEVFSELMGRQAGCSKGKGGSMHFYKKESSFYGGHG 165
Query: 246 PIATQLPQAVGAAYSLKMDGKSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNN 305
+ Q+P G A++ K + + A GDG ++G +N +A+ + P I +C NN
Sbjct: 166 IVGAQVPLGCGIAFAQKYNKEEAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENN 225
Query: 306 GWAISTPTEEQFRSDGIVVKGQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIE 365
+ + T +S +G + ++VDG DA AV A A++ A+ E+ P+++E
Sbjct: 226 HYGMGTAEWRAAKSPSYYKRGDY--VPGLKVDGMDAFAVKQACKFAKQHAL-EKGPIILE 282
Query: 366 ALTYRVGHHSTSDDSTKYRPVDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVR 425
TYR HS SD + YR DEI + R+P+ R K+ V + ++K+ ++ +R
Sbjct: 283 MDTYRYHGHSMSDPGSTYRTRDEISGVRQERDPIERIKKLVLSHDLATEKELKDMEKEIR 342
Query: 426 KQLMNAIQVAEKAQKPPLEDLFHDVY 451
K++ +AI A+ P +LF +VY
Sbjct: 343 KEVDDAIAKAKDCPMPEPSELFTNVY 368
>AT1G24180.1 | Symbols: IAR4 | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr1:8560777-8563382 REVERSE LENGTH=393
Length = 393
Score = 136 bits (342), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 156/326 (47%), Gaps = 10/326 (3%)
Query: 130 SNYVQVSKEMAVKMYSEMVTLQTM----DSIFYEVQRQGRISFYLTSMGEEAVNIXXXXX 185
S V+ S E + + +M ++ M DS++ +G Y G+EA+ +
Sbjct: 53 SRSVETSSEEILAFFRDMARMRRMEIAADSLYKAKLIRGFCHLY---DGQEALAVGMEAA 109
Query: 186 XXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISS 245
D ++ YR+ + RG L ++ G + G+ +H+ + +++
Sbjct: 110 ITKKDAIITSYRDHCTFIGRGGKLVDAFSELMGRKTGCSHGKGGSMHFYKKDASFYGGHG 169
Query: 246 PIATQLPQAVGAAYSLKMDGKSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNN 305
+ Q+P G A++ K + A GDG ++G +N +A+ + P I +C NN
Sbjct: 170 IVGAQIPLGCGLAFAQKYNKDEAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENN 229
Query: 306 GWAISTPTEEQFRSDGIVVKGQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIE 365
+ + T T +S +G + ++VDG DALAV A A+E A++ P+++E
Sbjct: 230 HYGMGTATWRSAKSPAYFKRGDY--VPGLKVDGMDALAVKQACKFAKEHALKNG-PIILE 286
Query: 366 ALTYRVGHHSTSDDSTKYRPVDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVR 425
TYR HS SD + YR DEI + R+P+ R ++ + + ++K+ ++ +R
Sbjct: 287 MDTYRYHGHSMSDPGSTYRTRDEISGVRQVRDPIERVRKLLLTHDIATEKELKDMEKEIR 346
Query: 426 KQLMNAIQVAEKAQKPPLEDLFHDVY 451
K++ +A+ A+++ P +LF ++Y
Sbjct: 347 KEVDDAVAQAKESPIPDASELFTNMY 372
>AT1G01090.1 | Symbols: PDH-E1 ALPHA | pyruvate dehydrogenase E1
alpha | chr1:47705-49166 REVERSE LENGTH=428
Length = 428
Score = 123 bits (308), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 163/357 (45%), Gaps = 12/357 (3%)
Query: 104 RFISESSGKRVPCYRVLDDNGEPMNHSNY-VQVSKEMAVKMYSEMVTLQTMDSIFYEVQR 162
R ++ +R P V + E + +N + ++KE +++Y +M+ ++ + + ++
Sbjct: 47 RLNHSNATRRSPVVSVQEVVKEKQSTNNTSLLITKEEGLELYEDMILGRSFEDMCAQMYY 106
Query: 163 QGRI-SFYLTSMGEEAVNIXXXXXXXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTS 221
+G++ F G+EAV+ D V+ YR+ L +G + + ++ FG +
Sbjct: 107 RGKMFGFVHLYNGQEAVSTGFIKLLTKSDSVVSTYRDHVHALSKGVSARAVMSELFGKVT 166
Query: 222 DLGKGRQMPIHYGSNELNYFTISSPIATQLPQAVGAAYS-------LKMDGKSACAVTFC 274
+G+ +H S E N + I +P A GAA+S LK D V F
Sbjct: 167 GCCRGQGGSMHMFSKEHNMLGGFAFIGEGIPVATGAAFSSKYRREVLKQDCDDV-TVAFF 225
Query: 275 GDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVVKGQAYGIWSI 334
GDG + G F +N AA+ + P+IF+ NN WAI I KG A+G+ +
Sbjct: 226 GDGTCNNGQFFECLNMAALYKLPIIFVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGV 285
Query: 335 RVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYRPVDEIEYWKM 394
VDG D L V A A R + P L+E TYR HS +D ++ +Y
Sbjct: 286 HVDGMDVLKVREVAKEAVTRARRGEGPTLVECETYRFRGHSLADPDELRDAAEKAKY--A 343
Query: 395 ARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLEDLFHDVY 451
AR+P+ K+++ N + + + + + + A++ A+ + +P L +V+
Sbjct: 344 ARDPIAALKKYLIENKLAKEAELKSIEKKIDELVEEAVEFADASPQPGRSQLLENVF 400