Miyakogusa Predicted Gene
- Lj3g3v2517580.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2517580.1 Non Chatacterized Hit- tr|I1MKC6|I1MKC6_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.56117
PE,90.57,0,PYRUVATE DEHYDROGENASE E1 COMPONENT, ALPHA SUBUNIT,NULL;
PYRUVATE DEHYDROGENASE E1 COMPONENT, ALPHA ,CUFF.44115.1
(433 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G01090.1 | Symbols: PDH-E1 ALPHA | pyruvate dehydrogenase E1 ... 675 0.0
AT1G59900.1 | Symbols: AT-E1 ALPHA, E1 ALPHA | pyruvate dehydrog... 246 2e-65
AT1G24180.1 | Symbols: IAR4 | Thiamin diphosphate-binding fold (... 235 5e-62
AT5G09300.1 | Symbols: | Thiamin diphosphate-binding fold (THDP... 130 2e-30
AT5G09300.2 | Symbols: | Thiamin diphosphate-binding fold (THDP... 130 3e-30
AT1G21400.1 | Symbols: | Thiamin diphosphate-binding fold (THDP... 119 3e-27
AT5G34780.1 | Symbols: | Thiamin diphosphate-binding fold (THDP... 84 2e-16
>AT1G01090.1 | Symbols: PDH-E1 ALPHA | pyruvate dehydrogenase E1
alpha | chr1:47705-49166 REVERSE LENGTH=428
Length = 428
Score = 675 bits (1742), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/436 (77%), Positives = 361/436 (82%), Gaps = 16/436 (3%)
Query: 5 ATKFSHLPPPINSTIPRSNDNKPLSFDVSRANPSSSFLGSARKVLRFNAGPAKVLAQXXX 64
AT F+ P + +T+P ++ R P SSFLGS R + + L
Sbjct: 2 ATAFA--PTKLTATVPLHGSHENRLLLPIRLAPPSSFLGSTRSL------SLRRLNHSNA 53
Query: 65 XXXXPAAAV---LLERTS----NLLITKEEGLELYEDMILGRFFEDKCAEMYYRGKMFGF 117
P +V + E+ S +LLITKEEGLELYEDMILGR FED CA+MYYRGKMFGF
Sbjct: 54 TRRSPVVSVQEVVKEKQSTNNTSLLITKEEGLELYEDMILGRSFEDMCAQMYYRGKMFGF 113
Query: 118 VHLYNGQEAVSTGFIKLLKKEDSVVSTYRDHVHALSKGVPARAVMSELFGKATGVCRGQG 177
VHLYNGQEAVSTGFIKLL K DSVVSTYRDHVHALSKGV ARAVMSELFGK TG CRGQG
Sbjct: 114 VHLYNGQEAVSTGFIKLLTKSDSVVSTYRDHVHALSKGVSARAVMSELFGKVTGCCRGQG 173
Query: 178 GSMHMFSKEHNVLGGFAFIGEGIPVATGAAFSSKYRREVLNQADCDHVTLAFFGDGTCNN 237
GSMHMFSKEHN+LGGFAFIGEGIPVATGAAFSSKYRREVL Q DCD VT+AFFGDGTCNN
Sbjct: 174 GSMHMFSKEHNMLGGFAFIGEGIPVATGAAFSSKYRREVLKQ-DCDDVTVAFFGDGTCNN 232
Query: 238 GQFYECLNMAALWKLPIVFVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDV 297
GQF+ECLNMAAL+KLPI+FVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDV
Sbjct: 233 GQFFECLNMAALYKLPIIFVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDV 292
Query: 298 LKVREVAKEAIGRARRGEGPTLVECETYRFRGHSLADPDELRDPAEKEHYAGRDPITALK 357
LKVREVAKEA+ RARRGEGPTLVECETYRFRGHSLADPDELRD AEK YA RDPI ALK
Sbjct: 293 LKVREVAKEAVTRARRGEGPTLVECETYRFRGHSLADPDELRDAAEKAKYAARDPIAALK 352
Query: 358 KYIFENNLASEQELKAIEKKIDEVLEEAVEFADESPLPPRSQLLENVFADPKGFGIGPDG 417
KY+ EN LA E ELK+IEKKIDE++EEAVEFAD SP P RSQLLENVFADPKGFGIGPDG
Sbjct: 353 KYLIENKLAKEAELKSIEKKIDELVEEAVEFADASPQPGRSQLLENVFADPKGFGIGPDG 412
Query: 418 KYRCEDPKFTEGTAHV 433
+YRCEDPKFTEGTA V
Sbjct: 413 RYRCEDPKFTEGTAQV 428
>AT1G59900.1 | Symbols: AT-E1 ALPHA, E1 ALPHA | pyruvate
dehydrogenase complex E1 alpha subunit |
chr1:22051368-22053660 FORWARD LENGTH=389
Length = 389
Score = 246 bits (628), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 132/345 (38%), Positives = 199/345 (57%), Gaps = 27/345 (7%)
Query: 84 TKEEGLELYEDMILGRFFEDKCAEMYYRGKMFGFVHLYNGQEAVSTGFIKLLKKEDSVVS 143
+ +E L+ + M L R E +Y + GF HLY+GQEAV+ G + K+D++++
Sbjct: 55 SSQELLDFFRTMALMRRMEIAADSLYKAKLIRGFCHLYDGQEAVAIGMEAAITKKDAIIT 114
Query: 144 TYRDHVHALSKGVPARAVMSELFGKATGVCRGQGGSMHMFSKEHNVLGGFAFIGEGIPVA 203
YRDH L +G V SEL G+ G +G+GGSMH + KE + GG +G +P+
Sbjct: 115 AYRDHCIFLGRGGSLHEVFSELMGRQAGCSKGKGGSMHFYKKESSFYGGHGIVGAQVPLG 174
Query: 204 TGAAFSSKYRREVLNQADCDHVTLAFFGDGTCNNGQFYECLNMAALWKLPIVFVVENNLW 263
G AF+ KY +E + VT A +GDG N GQ +E LN++ALW LP + V ENN +
Sbjct: 175 CGIAFAQKYNKE-------EAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENNHY 227
Query: 264 AIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDVLKVREVAKEAIGRARRGEGPTLVECE 323
+G + RA P +K+G +PG+ VDGMD V++ K A A +GP ++E +
Sbjct: 228 GMGTAEWRAAKSPSYYKRGDY--VPGLKVDGMDAFAVKQACKFAKQHALE-KGPIILEMD 284
Query: 324 TYRFRGHSLADP-------DELRDPAEKEHYAGRDPITALKKYIFENNLASEQELKAIEK 376
TYR+ GHS++DP DE+ ++ RDPI +KK + ++LA+E+ELK +EK
Sbjct: 285 TYRYHGHSMSDPGSTYRTRDEISGVRQE-----RDPIERIKKLVLSHDLATEKELKDMEK 339
Query: 377 KIDEVLEEAVEFADESPLPPRSQLLENVFADPKGFG---IGPDGK 418
+I + +++A+ A + P+P S+L NV+ KGFG GPD K
Sbjct: 340 EIRKEVDDAIAKAKDCPMPEPSELFTNVYV--KGFGTESFGPDRK 382
>AT1G24180.1 | Symbols: IAR4 | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr1:8560777-8563382 REVERSE LENGTH=393
Length = 393
Score = 235 bits (599), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 193/339 (56%), Gaps = 15/339 (4%)
Query: 84 TKEEGLELYEDMILGRFFEDKCAEMYYRGKMFGFVHLYNGQEAVSTGFIKLLKKEDSVVS 143
+ EE L + DM R E +Y + GF HLY+GQEA++ G + K+D++++
Sbjct: 59 SSEEILAFFRDMARMRRMEIAADSLYKAKLIRGFCHLYDGQEALAVGMEAAITKKDAIIT 118
Query: 144 TYRDHVHALSKGVPARAVMSELFGKATGVCRGQGGSMHMFSKEHNVLGGFAFIGEGIPVA 203
+YRDH + +G SEL G+ TG G+GGSMH + K+ + GG +G IP+
Sbjct: 119 SYRDHCTFIGRGGKLVDAFSELMGRKTGCSHGKGGSMHFYKKDASFYGGHGIVGAQIPLG 178
Query: 204 TGAAFSSKYRREVLNQADCDHVTLAFFGDGTCNNGQFYECLNMAALWKLPIVFVVENNLW 263
G AF+ KY ++ + VT A +GDG N GQ +E LN++ALW LP + V ENN +
Sbjct: 179 CGLAFAQKYNKD-------EAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENNHY 231
Query: 264 AIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDVLKVREVAKEAIGRARRGEGPTLVECE 323
+G + R+ P +K+G +PG+ VDGMD L V++ K A A + GP ++E +
Sbjct: 232 GMGTATWRSAKSPAYFKRGDY--VPGLKVDGMDALAVKQACKFAKEHALK-NGPIILEMD 288
Query: 324 TYRFRGHSLADPD---ELRDPAEKEHYAGRDPITALKKYIFENNLASEQELKAIEKKIDE 380
TYR+ GHS++DP RD RDPI ++K + +++A+E+ELK +EK+I +
Sbjct: 289 TYRYHGHSMSDPGSTYRTRDEISGVRQV-RDPIERVRKLLLTHDIATEKELKDMEKEIRK 347
Query: 381 VLEEAVEFADESPLPPRSQLLENVFADPKGF-GIGPDGK 418
+++AV A ESP+P S+L N++ G G D K
Sbjct: 348 EVDDAVAQAKESPIPDASELFTNMYVKDCGVESFGADRK 386
>AT5G09300.1 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr5:2884282-2886797 REVERSE LENGTH=472
Length = 472
Score = 130 bits (326), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 160/336 (47%), Gaps = 14/336 (4%)
Query: 74 LLERTSNLLITKEEGLELYEDMILGRFFEDKCAEMYYRGKMFGFVHLYNGQEAVSTGFIK 133
L+ + + +++E +++Y DM+ + ++ E +G++ F G+EA++
Sbjct: 116 LITNSQFVQVSEEVAVKIYSDMVTLQIMDNIFYEAQRQGRL-SFYATAIGEEAINIASAA 174
Query: 134 LLKKEDSVVSTYRDHVHALSKGVPARAVMSELFGKATGVCRGQGGSMHMFSKEHNVLGGF 193
L +D + YR+ L +G + ++ FG + +G+ +H S + N
Sbjct: 175 ALTPQDVIFPQYREPGVLLWRGFTLQEFANQCFGNKSDYGKGRQMPVHYGSNKLNYFTVS 234
Query: 194 AFIGEGIPVATGAAFSSKYRREVLNQADCDHVTLAFFGDGTCNNGQFYECLNMAALWKLP 253
A I +P A GAA+S K ++ D + +FGDG + G F+ LN+AA+ + P
Sbjct: 235 ATIATQLPNAVGAAYSLKMDKK-------DACAVTYFGDGGTSEGDFHAALNIAAVMEAP 287
Query: 254 IVFVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDVLKVREVAKEAIGRARR 313
++F+ NN WAI + KG A+G+ + VDG D L + A A R
Sbjct: 288 VLFICRNNGWAISTPTSDQFRSDGVVVKGRAYGIRSIRVDGNDALAMYSAVHTAREMAIR 347
Query: 314 GEGPTLVECETYRFRGHSLADPD-ELRDPAEKEHY-AGRDPITALKKYIFENNLASEQEL 371
+ P L+E TYR HS +D R E E + R+P++ + +I N S++
Sbjct: 348 EQRPILIEALTYRVGHHSTSDDSTRYRSAGEIEWWNKARNPLSRFRTWIESNGWWSDKTE 407
Query: 372 KAIEKKIDEVLEEAVEFADESPLPPRSQLLENVFAD 407
+ +I + + EA+ A+++ P L+N+F+D
Sbjct: 408 SDLRSRIKKEMLEALRVAEKTEKPN----LQNMFSD 439
>AT5G09300.2 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr5:2884282-2886291 REVERSE LENGTH=401
Length = 401
Score = 130 bits (326), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 160/336 (47%), Gaps = 14/336 (4%)
Query: 74 LLERTSNLLITKEEGLELYEDMILGRFFEDKCAEMYYRGKMFGFVHLYNGQEAVSTGFIK 133
L+ + + +++E +++Y DM+ + ++ E +G++ F G+EA++
Sbjct: 45 LITNSQFVQVSEEVAVKIYSDMVTLQIMDNIFYEAQRQGRL-SFYATAIGEEAINIASAA 103
Query: 134 LLKKEDSVVSTYRDHVHALSKGVPARAVMSELFGKATGVCRGQGGSMHMFSKEHNVLGGF 193
L +D + YR+ L +G + ++ FG + +G+ +H S + N
Sbjct: 104 ALTPQDVIFPQYREPGVLLWRGFTLQEFANQCFGNKSDYGKGRQMPVHYGSNKLNYFTVS 163
Query: 194 AFIGEGIPVATGAAFSSKYRREVLNQADCDHVTLAFFGDGTCNNGQFYECLNMAALWKLP 253
A I +P A GAA+S K ++ D + +FGDG + G F+ LN+AA+ + P
Sbjct: 164 ATIATQLPNAVGAAYSLKMDKK-------DACAVTYFGDGGTSEGDFHAALNIAAVMEAP 216
Query: 254 IVFVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDVLKVREVAKEAIGRARR 313
++F+ NN WAI + KG A+G+ + VDG D L + A A R
Sbjct: 217 VLFICRNNGWAISTPTSDQFRSDGVVVKGRAYGIRSIRVDGNDALAMYSAVHTAREMAIR 276
Query: 314 GEGPTLVECETYRFRGHSLADPD-ELRDPAEKEHY-AGRDPITALKKYIFENNLASEQEL 371
+ P L+E TYR HS +D R E E + R+P++ + +I N S++
Sbjct: 277 EQRPILIEALTYRVGHHSTSDDSTRYRSAGEIEWWNKARNPLSRFRTWIESNGWWSDKTE 336
Query: 372 KAIEKKIDEVLEEAVEFADESPLPPRSQLLENVFAD 407
+ +I + + EA+ A+++ P L+N+F+D
Sbjct: 337 SDLRSRIKKEMLEALRVAEKTEKPN----LQNMFSD 368
>AT1G21400.1 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr1:7493492-7496240 FORWARD LENGTH=472
Length = 472
Score = 119 bits (299), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 148/328 (45%), Gaps = 16/328 (4%)
Query: 83 ITKEEGLELYEDMILGRFFEDKCAEMYYRGKMFGFVHLY---NGQEAVSTGFIKLLKKED 139
++++ + +YE M + + ++Y + G + Y G+EA++ L +D
Sbjct: 125 VSEKLAVRMYEQMATLQVMD----HIFYEAQRQGRISFYLTSVGEEAINIASAAALSPDD 180
Query: 140 SVVSTYRDHVHALSKGVPARAVMSELFGKATGVCRGQGGSMHMFSKEHNVLGGFAFIGEG 199
V+ YR+ L +G ++ FG +G+ +H S N + I
Sbjct: 181 VVLPQYREPGVLLWRGFTLEEFANQCFGNKADYGKGRQMPIHYGSNRLNYFTISSPIATQ 240
Query: 200 IPVATGAAFSSKYRREVLNQADCDHVTLAFFGDGTCNNGQFYECLNMAALWKLPIVFVVE 259
+P A G +S K ++ + T+ F GDG + G F+ LN AA+ + P+VF+
Sbjct: 241 LPQAAGVGYSLKMDKK-------NACTVTFIGDGGTSEGDFHAGLNFAAVMEAPVVFICR 293
Query: 260 NNLWAIGMSHLRATSDPEIWKKGPAFGMPGVHVDGMDVLKVREVAKEAIGRARRGEGPTL 319
NN WAI I KG A+G+ + VDG D L V + A A + P L
Sbjct: 294 NNGWAISTHISEQFRSDGIVVKGQAYGIRSIRVDGNDALAVYSAVRSAREMAVTEQRPVL 353
Query: 320 VECETYRFRGHSLADPDELRDPAEKEHY--AGRDPITALKKYIFENNLASEQELKAIEKK 377
+E TYR HS +D A++ Y R+P+ +K++ +N SE++ +
Sbjct: 354 IEMMTYRVGHHSTSDDSTKYRAADEIQYWKMSRNPVNRFRKWVEDNGWWSEEDESKLRSN 413
Query: 378 IDEVLEEAVEFADESPLPPRSQLLENVF 405
+ L +A++ A++ P ++L +V+
Sbjct: 414 ARKQLLQAIQAAEKWEKQPLTELFNDVY 441
>AT5G34780.1 | Symbols: | Thiamin diphosphate-binding fold
(THDP-binding) superfamily protein |
chr5:12961682-12963892 REVERSE LENGTH=365
Length = 365
Score = 84.0 bits (206), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 88/190 (46%), Gaps = 5/190 (2%)
Query: 221 DC---DHVTLAFFGDGTCNNGQFYECLNMAALWKLPIVFVVENNLWAIGMSHLRATSDPE 277
DC + + F GDG + G F+ LN AA+ + P+VF+ NN WAI
Sbjct: 22 DCWEKNACAVTFIGDGGTSEGDFHAGLNFAAVMEAPVVFICRNNGWAISTHISEQFRSDG 81
Query: 278 IWKKGPAFGMPGVHVDGMDVLKVREVAKEAIGRARRGEGPTLVECETYRFRGHSLADPDE 337
I KG A+G+ + VDG D L V A A + P L+E YR HS +D
Sbjct: 82 IVVKGQAYGIRSIRVDGNDALAVYSAVCSAREMAVTEQRPVLIEMMIYRVGHHSTSDDST 141
Query: 338 LRDPAEKEHY--AGRDPITALKKYIFENNLASEQELKAIEKKIDEVLEEAVEFADESPLP 395
A++ Y R+ + +K + +N SE++ + + L +A++ A++
Sbjct: 142 KYRAADEIQYWKMSRNSVNRFRKSVEDNGWWSEEDESKLRSNARKQLLQAIQAAEKWEKQ 201
Query: 396 PRSQLLENVF 405
P ++L +V+
Sbjct: 202 PLTELFNDVY 211