Miyakogusa Predicted Gene

Lj1g3v0912020.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v0912020.1 Non Chatacterized Hit- tr|I1KAH2|I1KAH2_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,82.44,0,E1_dh,Dehydrogenase, E1 component; 2-OXOISOVALERATE
DEHYDROGENASE ALPHA SUBUNIT-RELATED,NULL; PYRUVA,CUFF.26509.1
         (482 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G09300.1 | Symbols:  | Thiamin diphosphate-binding fold (THDP...   642   0.0  
AT5G09300.2 | Symbols:  | Thiamin diphosphate-binding fold (THDP...   640   0.0  
AT1G21400.1 | Symbols:  | Thiamin diphosphate-binding fold (THDP...   638   0.0  
AT5G34780.1 | Symbols:  | Thiamin diphosphate-binding fold (THDP...   325   4e-89
AT1G59900.1 | Symbols: AT-E1 ALPHA, E1 ALPHA | pyruvate dehydrog...   142   5e-34
AT1G24180.1 | Symbols: IAR4 | Thiamin diphosphate-binding fold (...   136   4e-32
AT1G01090.1 | Symbols: PDH-E1 ALPHA | pyruvate dehydrogenase E1 ...   123   4e-28

>AT5G09300.1 | Symbols:  | Thiamin diphosphate-binding fold
           (THDP-binding) superfamily protein |
           chr5:2884282-2886797 REVERSE LENGTH=472
          Length = 472

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 290/398 (72%), Positives = 348/398 (87%)

Query: 85  DHDQVIDFPGGKVAFTSEMRFISESSGKRVPCYRVLDDNGEPMNHSNYVQVSKEMAVKMY 144
           ++ QV+DFPGGKVAFT E++FISES  +RVPCYRVLDDNG+ + +S +VQVS+E+AVK+Y
Sbjct: 75  NNHQVMDFPGGKVAFTPEIQFISESDKERVPCYRVLDDNGQLITNSQFVQVSEEVAVKIY 134

Query: 145 SEMVTLQTMDSIFYEVQRQGRISFYLTSMGEEAVNIXXXXXXXXDDIVLPQYREPGVLLW 204
           S+MVTLQ MD+IFYE QRQGR+SFY T++GEEA+NI         D++ PQYREPGVLLW
Sbjct: 135 SDMVTLQIMDNIFYEAQRQGRLSFYATAIGEEAINIASAAALTPQDVIFPQYREPGVLLW 194

Query: 205 RGFTLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISSPIATQLPQAVGAAYSLKMD 264
           RGFTLQ+FANQCFGN SD GKGRQMP+HYGSN+LNYFT+S+ IATQLP AVGAAYSLKMD
Sbjct: 195 RGFTLQEFANQCFGNKSDYGKGRQMPVHYGSNKLNYFTVSATIATQLPNAVGAAYSLKMD 254

Query: 265 GKSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVV 324
            K ACAVT+ GDGGTSEGDFHA +N AAVMEAPV+FICRNNGWAISTPT +QFRSDG+VV
Sbjct: 255 KKDACAVTYFGDGGTSEGDFHAALNIAAVMEAPVLFICRNNGWAISTPTSDQFRSDGVVV 314

Query: 325 KGQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYR 384
           KG+AYGI SIRVDGNDALA+YSAVHTARE+AIREQRP+LIEALTYRVGHHSTSDDST+YR
Sbjct: 315 KGRAYGIRSIRVDGNDALAMYSAVHTAREMAIREQRPILIEALTYRVGHHSTSDDSTRYR 374

Query: 385 PVDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLE 444
              EIE+W  ARNP++RF+ W+E NGWWSDK E +LRS ++K+++ A++VAEK +KP L+
Sbjct: 375 SAGEIEWWNKARNPLSRFRTWIESNGWWSDKTESDLRSRIKKEMLEALRVAEKTEKPNLQ 434

Query: 445 DLFHDVYDQVPSNLQEQENLLRETIKKNPKDYPSDVPL 482
           ++F DVYD  PSNL+EQE L+R+TI  +P+DYPSDVPL
Sbjct: 435 NMFSDVYDVPPSNLREQELLVRQTINSHPQDYPSDVPL 472


>AT5G09300.2 | Symbols:  | Thiamin diphosphate-binding fold
           (THDP-binding) superfamily protein |
           chr5:2884282-2886291 REVERSE LENGTH=401
          Length = 401

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 290/395 (73%), Positives = 346/395 (87%)

Query: 88  QVIDFPGGKVAFTSEMRFISESSGKRVPCYRVLDDNGEPMNHSNYVQVSKEMAVKMYSEM 147
           QV+DFPGGKVAFT E++FISES  +RVPCYRVLDDNG+ + +S +VQVS+E+AVK+YS+M
Sbjct: 7   QVMDFPGGKVAFTPEIQFISESDKERVPCYRVLDDNGQLITNSQFVQVSEEVAVKIYSDM 66

Query: 148 VTLQTMDSIFYEVQRQGRISFYLTSMGEEAVNIXXXXXXXXDDIVLPQYREPGVLLWRGF 207
           VTLQ MD+IFYE QRQGR+SFY T++GEEA+NI         D++ PQYREPGVLLWRGF
Sbjct: 67  VTLQIMDNIFYEAQRQGRLSFYATAIGEEAINIASAAALTPQDVIFPQYREPGVLLWRGF 126

Query: 208 TLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISSPIATQLPQAVGAAYSLKMDGKS 267
           TLQ+FANQCFGN SD GKGRQMP+HYGSN+LNYFT+S+ IATQLP AVGAAYSLKMD K 
Sbjct: 127 TLQEFANQCFGNKSDYGKGRQMPVHYGSNKLNYFTVSATIATQLPNAVGAAYSLKMDKKD 186

Query: 268 ACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVVKGQ 327
           ACAVT+ GDGGTSEGDFHA +N AAVMEAPV+FICRNNGWAISTPT +QFRSDG+VVKG+
Sbjct: 187 ACAVTYFGDGGTSEGDFHAALNIAAVMEAPVLFICRNNGWAISTPTSDQFRSDGVVVKGR 246

Query: 328 AYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYRPVD 387
           AYGI SIRVDGNDALA+YSAVHTARE+AIREQRP+LIEALTYRVGHHSTSDDST+YR   
Sbjct: 247 AYGIRSIRVDGNDALAMYSAVHTAREMAIREQRPILIEALTYRVGHHSTSDDSTRYRSAG 306

Query: 388 EIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLEDLF 447
           EIE+W  ARNP++RF+ W+E NGWWSDK E +LRS ++K+++ A++VAEK +KP L+++F
Sbjct: 307 EIEWWNKARNPLSRFRTWIESNGWWSDKTESDLRSRIKKEMLEALRVAEKTEKPNLQNMF 366

Query: 448 HDVYDQVPSNLQEQENLLRETIKKNPKDYPSDVPL 482
            DVYD  PSNL+EQE L+R+TI  +P+DYPSDVPL
Sbjct: 367 SDVYDVPPSNLREQELLVRQTINSHPQDYPSDVPL 401


>AT1G21400.1 | Symbols:  | Thiamin diphosphate-binding fold
           (THDP-binding) superfamily protein |
           chr1:7493492-7496240 FORWARD LENGTH=472
          Length = 472

 Score =  638 bits (1646), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 302/425 (71%), Positives = 353/425 (83%), Gaps = 6/425 (1%)

Query: 59  NNSATISYFS--HYESTKAEAQ---LELQEDDHD-QVIDFPGGKVAFTSEMRFISESSGK 112
           +++A +S F    +EST  E Q   L  Q D+ D Q +DFPGGKV +TSEM+FI ESS +
Sbjct: 43  SSTAYLSPFGSLRHESTAVETQADHLVQQIDEVDAQELDFPGGKVGYTSEMKFIPESSSR 102

Query: 113 RVPCYRVLDDNGEPMNHSNYVQVSKEMAVKMYSEMVTLQTMDSIFYEVQRQGRISFYLTS 172
           R+PCYRVLD++G  +  S+++ VS+++AV+MY +M TLQ MD IFYE QRQGRISFYLTS
Sbjct: 103 RIPCYRVLDEDGRIIPDSDFIPVSEKLAVRMYEQMATLQVMDHIFYEAQRQGRISFYLTS 162

Query: 173 MGEEAVNIXXXXXXXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTSDLGKGRQMPIH 232
           +GEEA+NI        DD+VLPQYREPGVLLWRGFTL++FANQCFGN +D GKGRQMPIH
Sbjct: 163 VGEEAINIASAAALSPDDVVLPQYREPGVLLWRGFTLEEFANQCFGNKADYGKGRQMPIH 222

Query: 233 YGSNELNYFTISSPIATQLPQAVGAAYSLKMDGKSACAVTFCGDGGTSEGDFHAGMNFAA 292
           YGSN LNYFTISSPIATQLPQA G  YSLKMD K+AC VTF GDGGTSEGDFHAG+NFAA
Sbjct: 223 YGSNRLNYFTISSPIATQLPQAAGVGYSLKMDKKNACTVTFIGDGGTSEGDFHAGLNFAA 282

Query: 293 VMEAPVIFICRNNGWAISTPTEEQFRSDGIVVKGQAYGIWSIRVDGNDALAVYSAVHTAR 352
           VMEAPV+FICRNNGWAIST   EQFRSDGIVVKGQAYGI SIRVDGNDALAVYSAV +AR
Sbjct: 283 VMEAPVVFICRNNGWAISTHISEQFRSDGIVVKGQAYGIRSIRVDGNDALAVYSAVRSAR 342

Query: 353 EIAIREQRPVLIEALTYRVGHHSTSDDSTKYRPVDEIEYWKMARNPVNRFKRWVEMNGWW 412
           E+A+ EQRPVLIE +TYRVGHHSTSDDSTKYR  DEI+YWKM+RNPVNRF++WVE NGWW
Sbjct: 343 EMAVTEQRPVLIEMMTYRVGHHSTSDDSTKYRAADEIQYWKMSRNPVNRFRKWVEDNGWW 402

Query: 413 SDKDELELRSSVRKQLMNAIQVAEKAQKPPLEDLFHDVYDQVPSNLQEQENLLRETIKKN 472
           S++DE +LRS+ RKQL+ AIQ AEK +K PL +LF+DVYD  P NL+EQE  L+E +KK 
Sbjct: 403 SEEDESKLRSNARKQLLQAIQAAEKWEKQPLTELFNDVYDVKPKNLEEQELGLKELVKKQ 462

Query: 473 PKDYP 477
           P+DYP
Sbjct: 463 PQDYP 467


>AT5G34780.1 | Symbols:  | Thiamin diphosphate-binding fold
           (THDP-binding) superfamily protein |
           chr5:12961682-12963892 REVERSE LENGTH=365
          Length = 365

 Score =  325 bits (833), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 159/211 (75%), Positives = 183/211 (86%)

Query: 266 KSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVVK 325
           K+ACAVTF GDGGTSEGDFHAG+NFAAVMEAPV+FICRNNGWAIST   EQFRSDGIVVK
Sbjct: 26  KNACAVTFIGDGGTSEGDFHAGLNFAAVMEAPVVFICRNNGWAISTHISEQFRSDGIVVK 85

Query: 326 GQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYRP 385
           GQAYGI SIRVDGNDALAVYSAV +ARE+A+ EQRPVLIE + YRVGHHSTSDDSTKYR 
Sbjct: 86  GQAYGIRSIRVDGNDALAVYSAVCSAREMAVTEQRPVLIEMMIYRVGHHSTSDDSTKYRA 145

Query: 386 VDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLED 445
            DEI+YWKM+RN VNRF++ VE NGWWS++DE +LRS+ RKQL+ AIQ AEK +K PL +
Sbjct: 146 ADEIQYWKMSRNSVNRFRKSVEDNGWWSEEDESKLRSNARKQLLQAIQAAEKWEKQPLTE 205

Query: 446 LFHDVYDQVPSNLQEQENLLRETIKKNPKDY 476
           LF+DVYD  P NL+E+E  L+E I+K P+DY
Sbjct: 206 LFNDVYDVKPKNLEEEELGLKELIEKQPQDY 236


>AT1G59900.1 | Symbols: AT-E1 ALPHA, E1 ALPHA | pyruvate
           dehydrogenase complex E1 alpha subunit |
           chr1:22051368-22053660 FORWARD LENGTH=389
          Length = 389

 Score =  142 bits (358), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 87/326 (26%), Positives = 157/326 (48%), Gaps = 10/326 (3%)

Query: 130 SNYVQVSKEMAVKMYSEMVTLQTM----DSIFYEVQRQGRISFYLTSMGEEAVNIXXXXX 185
           S  V+ S +  +  +  M  ++ M    DS++     +G    Y    G+EAV I     
Sbjct: 49  SRSVESSSQELLDFFRTMALMRRMEIAADSLYKAKLIRGFCHLY---DGQEAVAIGMEAA 105

Query: 186 XXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISS 245
               D ++  YR+  + L RG +L +  ++  G  +   KG+   +H+   E +++    
Sbjct: 106 ITKKDAIITAYRDHCIFLGRGGSLHEVFSELMGRQAGCSKGKGGSMHFYKKESSFYGGHG 165

Query: 246 PIATQLPQAVGAAYSLKMDGKSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNN 305
            +  Q+P   G A++ K + + A      GDG  ++G     +N +A+ + P I +C NN
Sbjct: 166 IVGAQVPLGCGIAFAQKYNKEEAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENN 225

Query: 306 GWAISTPTEEQFRSDGIVVKGQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIE 365
            + + T      +S     +G    +  ++VDG DA AV  A   A++ A+ E+ P+++E
Sbjct: 226 HYGMGTAEWRAAKSPSYYKRGDY--VPGLKVDGMDAFAVKQACKFAKQHAL-EKGPIILE 282

Query: 366 ALTYRVGHHSTSDDSTKYRPVDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVR 425
             TYR   HS SD  + YR  DEI   +  R+P+ R K+ V  +   ++K+  ++   +R
Sbjct: 283 MDTYRYHGHSMSDPGSTYRTRDEISGVRQERDPIERIKKLVLSHDLATEKELKDMEKEIR 342

Query: 426 KQLMNAIQVAEKAQKPPLEDLFHDVY 451
           K++ +AI  A+    P   +LF +VY
Sbjct: 343 KEVDDAIAKAKDCPMPEPSELFTNVY 368


>AT1G24180.1 | Symbols: IAR4 | Thiamin diphosphate-binding fold
           (THDP-binding) superfamily protein |
           chr1:8560777-8563382 REVERSE LENGTH=393
          Length = 393

 Score =  136 bits (342), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 156/326 (47%), Gaps = 10/326 (3%)

Query: 130 SNYVQVSKEMAVKMYSEMVTLQTM----DSIFYEVQRQGRISFYLTSMGEEAVNIXXXXX 185
           S  V+ S E  +  + +M  ++ M    DS++     +G    Y    G+EA+ +     
Sbjct: 53  SRSVETSSEEILAFFRDMARMRRMEIAADSLYKAKLIRGFCHLY---DGQEALAVGMEAA 109

Query: 186 XXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTSDLGKGRQMPIHYGSNELNYFTISS 245
               D ++  YR+    + RG  L    ++  G  +    G+   +H+   + +++    
Sbjct: 110 ITKKDAIITSYRDHCTFIGRGGKLVDAFSELMGRKTGCSHGKGGSMHFYKKDASFYGGHG 169

Query: 246 PIATQLPQAVGAAYSLKMDGKSACAVTFCGDGGTSEGDFHAGMNFAAVMEAPVIFICRNN 305
            +  Q+P   G A++ K +   A      GDG  ++G     +N +A+ + P I +C NN
Sbjct: 170 IVGAQIPLGCGLAFAQKYNKDEAVTFALYGDGAANQGQLFEALNISALWDLPAILVCENN 229

Query: 306 GWAISTPTEEQFRSDGIVVKGQAYGIWSIRVDGNDALAVYSAVHTAREIAIREQRPVLIE 365
            + + T T    +S     +G    +  ++VDG DALAV  A   A+E A++   P+++E
Sbjct: 230 HYGMGTATWRSAKSPAYFKRGDY--VPGLKVDGMDALAVKQACKFAKEHALKNG-PIILE 286

Query: 366 ALTYRVGHHSTSDDSTKYRPVDEIEYWKMARNPVNRFKRWVEMNGWWSDKDELELRSSVR 425
             TYR   HS SD  + YR  DEI   +  R+P+ R ++ +  +   ++K+  ++   +R
Sbjct: 287 MDTYRYHGHSMSDPGSTYRTRDEISGVRQVRDPIERVRKLLLTHDIATEKELKDMEKEIR 346

Query: 426 KQLMNAIQVAEKAQKPPLEDLFHDVY 451
           K++ +A+  A+++  P   +LF ++Y
Sbjct: 347 KEVDDAVAQAKESPIPDASELFTNMY 372


>AT1G01090.1 | Symbols: PDH-E1 ALPHA | pyruvate dehydrogenase E1
           alpha | chr1:47705-49166 REVERSE LENGTH=428
          Length = 428

 Score =  123 bits (308), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 163/357 (45%), Gaps = 12/357 (3%)

Query: 104 RFISESSGKRVPCYRVLDDNGEPMNHSNY-VQVSKEMAVKMYSEMVTLQTMDSIFYEVQR 162
           R    ++ +R P   V +   E  + +N  + ++KE  +++Y +M+  ++ + +  ++  
Sbjct: 47  RLNHSNATRRSPVVSVQEVVKEKQSTNNTSLLITKEEGLELYEDMILGRSFEDMCAQMYY 106

Query: 163 QGRI-SFYLTSMGEEAVNIXXXXXXXXDDIVLPQYREPGVLLWRGFTLQQFANQCFGNTS 221
           +G++  F     G+EAV+          D V+  YR+    L +G + +   ++ FG  +
Sbjct: 107 RGKMFGFVHLYNGQEAVSTGFIKLLTKSDSVVSTYRDHVHALSKGVSARAVMSELFGKVT 166

Query: 222 DLGKGRQMPIHYGSNELNYFTISSPIATQLPQAVGAAYS-------LKMDGKSACAVTFC 274
              +G+   +H  S E N     + I   +P A GAA+S       LK D      V F 
Sbjct: 167 GCCRGQGGSMHMFSKEHNMLGGFAFIGEGIPVATGAAFSSKYRREVLKQDCDDV-TVAFF 225

Query: 275 GDGGTSEGDFHAGMNFAAVMEAPVIFICRNNGWAISTPTEEQFRSDGIVVKGQAYGIWSI 334
           GDG  + G F   +N AA+ + P+IF+  NN WAI            I  KG A+G+  +
Sbjct: 226 GDGTCNNGQFFECLNMAALYKLPIIFVVENNLWAIGMSHLRATSDPEIWKKGPAFGMPGV 285

Query: 335 RVDGNDALAVYSAVHTAREIAIREQRPVLIEALTYRVGHHSTSDDSTKYRPVDEIEYWKM 394
            VDG D L V      A   A R + P L+E  TYR   HS +D        ++ +Y   
Sbjct: 286 HVDGMDVLKVREVAKEAVTRARRGEGPTLVECETYRFRGHSLADPDELRDAAEKAKY--A 343

Query: 395 ARNPVNRFKRWVEMNGWWSDKDELELRSSVRKQLMNAIQVAEKAQKPPLEDLFHDVY 451
           AR+P+   K+++  N    + +   +   + + +  A++ A+ + +P    L  +V+
Sbjct: 344 ARDPIAALKKYLIENKLAKEAELKSIEKKIDELVEEAVEFADASPQPGRSQLLENVF 400