Miyakogusa Predicted Gene

Lj2g3v1968430.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v1968430.2 Non Characterized Hit- tr|D3AH31|D3AH31_9CLOT
Putative F5/8 type C domain protein OS=Clostridium
hat,32.54,0.00000000000002,seg,NULL; Glyco_hydro_43,Glycoside
hydrolase, family 43;
Arabinanase/levansucrase/invertase,Glycosyl,CUFF.38134.2
         (466 letters)

Database: Medicago_aa4.0v1 
           62,319 sequences; 21,947,249 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC | ch...   766   0.0  
Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC | ch...   754   0.0  
Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC | ch...   754   0.0  
Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC | ch...   607   e-174
Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC | ch...   607   e-174

>Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC |
           chr5:5348911-5354586 | 20130731
          Length = 469

 Score =  766 bits (1979), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/467 (79%), Positives = 403/467 (86%), Gaps = 1/467 (0%)

Query: 1   MSGSSVHSTVRVVTGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHP 59
           MSGSSVHS V VV  GR S++  V+            Y+YV H +R    EP+L V++HP
Sbjct: 1   MSGSSVHSLVLVVPRGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHP 60

Query: 60  QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAI 119
           QFRELQEVEEES+ V                  TTTLI+EFLDENSQ+R VFFPG KRAI
Sbjct: 61  QFRELQEVEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAI 120

Query: 120 DPMQAAGDDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKK 179
           DP+ A  +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKK
Sbjct: 121 DPILAVENDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKK 180

Query: 180 GAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYV 239
           GAARVDIIGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYV
Sbjct: 181 GAARVDIIGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYV 240

Query: 240 MWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSE 299
           MWMHIDDANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSE
Sbjct: 241 MWMHIDDANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSE 300

Query: 300 DNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAH 359
           DNSELHIGPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAH
Sbjct: 301 DNSELHIGPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAH 360

Query: 360 AAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDS 419
           AAESI+G WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DS
Sbjct: 361 AAESIMGPWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDS 420

Query: 420 RYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 466
           RYVWLPLIVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 421 RYVWLPLIVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 467


>Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC |
           chr5:5348835-5354586 | 20130731
          Length = 472

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/460 (78%), Positives = 397/460 (86%), Gaps = 1/460 (0%)

Query: 8   STVRVVTGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHPQFRELQE 66
           +++R   GGR S++  V+            Y+YV H +R    EP+L V++HPQFRELQE
Sbjct: 11  TSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHPQFRELQE 70

Query: 67  VEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAIDPMQAAG 126
           VEEES+ V                  TTTLI+EFLDENSQ+R VFFPG KRAIDP+ A  
Sbjct: 71  VEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAIDPILAVE 130

Query: 127 DDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKKGAARVDI 186
           +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKKGAARVDI
Sbjct: 131 NDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKKGAARVDI 190

Query: 187 IGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYVMWMHIDD 246
           IGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYVMWMHIDD
Sbjct: 191 IGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD 250

Query: 247 ANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSEDNSELHI 306
           ANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSEDNSELHI
Sbjct: 251 ANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSEDNSELHI 310

Query: 307 GPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAHAAESILG 366
           GPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAHAAESI+G
Sbjct: 311 GPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAESIMG 370

Query: 367 TWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDSRYVWLPL 426
            WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DSRYVWLPL
Sbjct: 371 PWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDSRYVWLPL 430

Query: 427 IVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 466
           IVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 431 IVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470


>Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC |
           chr5:5349653-5354338 | 20130731
          Length = 472

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/460 (78%), Positives = 397/460 (86%), Gaps = 1/460 (0%)

Query: 8   STVRVVTGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHPQFRELQE 66
           +++R   GGR S++  V+            Y+YV H +R    EP+L V++HPQFRELQE
Sbjct: 11  TSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHPQFRELQE 70

Query: 67  VEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAIDPMQAAG 126
           VEEES+ V                  TTTLI+EFLDENSQ+R VFFPG KRAIDP+ A  
Sbjct: 71  VEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAIDPILAVE 130

Query: 127 DDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKKGAARVDI 186
           +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKKGAARVDI
Sbjct: 131 NDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKKGAARVDI 190

Query: 187 IGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYVMWMHIDD 246
           IGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYVMWMHIDD
Sbjct: 191 IGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD 250

Query: 247 ANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSEDNSELHI 306
           ANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSEDNSELHI
Sbjct: 251 ANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSEDNSELHI 310

Query: 307 GPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAHAAESILG 366
           GPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAHAAESI+G
Sbjct: 311 GPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAESIMG 370

Query: 367 TWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDSRYVWLPL 426
            WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DSRYVWLPL
Sbjct: 371 PWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDSRYVWLPL 430

Query: 427 IVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 466
           IVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 431 IVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470


>Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC |
           chr4:40009583-40006901 | 20130731
          Length = 465

 Score =  607 bits (1564), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 288/421 (68%), Positives = 337/421 (80%), Gaps = 15/421 (3%)

Query: 53  LRVTHHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLR 108
           L + H P     F +LQ VE+E+ Q+                   T L++EFLD++S LR
Sbjct: 52  LNIIHPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLR 106

Query: 109 QVFFPGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYW 164
            VFFP   + IDPM+  G   +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYW
Sbjct: 107 HVFFP--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYW 164

Query: 165 YGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVL 224
           YGEYKDGPTY  + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVL
Sbjct: 165 YGEYKDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVL 224

Query: 225 ERPKVIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTV 284
           ERPKVIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+
Sbjct: 225 ERPKVIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTL 284

Query: 285 FKDDDGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMI 344
           FKD+D  AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG  REAPA+FKH+GTYYM+
Sbjct: 285 FKDEDNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMV 344

Query: 345 TSGCTGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSF 404
           TSGCTGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+  FPG F
Sbjct: 345 TSGCTGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLF 404

Query: 405 IFMADRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSS 464
           IFMADRWNP++L+DSRYVWLPLIV G  DQ  +Y F+  LW RVSIYWH+KW+LP G ++
Sbjct: 405 IFMADRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNT 464

Query: 465 F 465
           F
Sbjct: 465 F 465


>Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC |
           chr4:40009583-40006901 | 20130731
          Length = 465

 Score =  607 bits (1564), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 288/421 (68%), Positives = 337/421 (80%), Gaps = 15/421 (3%)

Query: 53  LRVTHHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLR 108
           L + H P     F +LQ VE+E+ Q+                   T L++EFLD++S LR
Sbjct: 52  LNIIHPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLR 106

Query: 109 QVFFPGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYW 164
            VFFP   + IDPM+  G   +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYW
Sbjct: 107 HVFFP--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYW 164

Query: 165 YGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVL 224
           YGEYKDGPTY  + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVL
Sbjct: 165 YGEYKDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVL 224

Query: 225 ERPKVIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTV 284
           ERPKVIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+
Sbjct: 225 ERPKVIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTL 284

Query: 285 FKDDDGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMI 344
           FKD+D  AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG  REAPA+FKH+GTYYM+
Sbjct: 285 FKDEDNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMV 344

Query: 345 TSGCTGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSF 404
           TSGCTGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+  FPG F
Sbjct: 345 TSGCTGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLF 404

Query: 405 IFMADRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSS 464
           IFMADRWNP++L+DSRYVWLPLIV G  DQ  +Y F+  LW RVSIYWH+KW+LP G ++
Sbjct: 405 IFMADRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNT 464

Query: 465 F 465
           F
Sbjct: 465 F 465