Miyakogusa Predicted Gene

Lj2g3v1968430.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v1968430.1 Non Characterized Hit- tr|D3AH31|D3AH31_9CLOT
Putative F5/8 type C domain protein OS=Clostridium
hat,32.54,0.00000000000002,seg,NULL; Glyco_hydro_43,Glycoside
hydrolase, family 43;
Arabinanase/levansucrase/invertase,Glycosyl,CUFF.38134.1
         (469 letters)

Database: Medicago_aa4.0v1 
           62,319 sequences; 21,947,249 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC | ch...   776   0.0  
Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC | ch...   776   0.0  
Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC | ch...   748   0.0  
Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC | ch...   614   e-176
Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC | ch...   614   e-176

>Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC |
           chr5:5348835-5354586 | 20130731
          Length = 472

 Score =  776 bits (2004), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/470 (79%), Positives = 408/470 (86%), Gaps = 1/470 (0%)

Query: 1   MRIRNKYRKPTTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVT 59
           MRIRNK +KPT+L C+AGGR S++  V+            Y+YV H +R    EP+L V+
Sbjct: 1   MRIRNKCKKPTSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVS 60

Query: 60  HHPQFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHK 119
           +HPQFRELQEVEEES+ V                  TTTLI+EFLDENSQ+R VFFPG K
Sbjct: 61  NHPQFRELQEVEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRK 120

Query: 120 RAIDPMQAAGDDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHA 179
           RAIDP+ A  +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHA
Sbjct: 121 RAIDPILAVENDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHA 180

Query: 180 HKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTG 239
           HKKGAARVDIIGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT 
Sbjct: 181 HKKGAARVDIIGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTE 240

Query: 240 KYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIY 299
           KYVMWMHIDDANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++Y
Sbjct: 241 KYVMWMHIDDANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVY 300

Query: 300 SSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEA 359
           SSEDNSELHIGPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEA
Sbjct: 301 SSEDNSELHIGPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEA 360

Query: 360 LAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADL 419
           LAHAAESI+G WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL
Sbjct: 361 LAHAAESIMGPWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADL 420

Query: 420 KDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 469
           +DSRYVWLPLIVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 421 RDSRYVWLPLIVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470


>Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC |
           chr5:5349653-5354338 | 20130731
          Length = 472

 Score =  776 bits (2004), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/470 (79%), Positives = 408/470 (86%), Gaps = 1/470 (0%)

Query: 1   MRIRNKYRKPTTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVT 59
           MRIRNK +KPT+L C+AGGR S++  V+            Y+YV H +R    EP+L V+
Sbjct: 1   MRIRNKCKKPTSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVS 60

Query: 60  HHPQFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHK 119
           +HPQFRELQEVEEES+ V                  TTTLI+EFLDENSQ+R VFFPG K
Sbjct: 61  NHPQFRELQEVEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRK 120

Query: 120 RAIDPMQAAGDDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHA 179
           RAIDP+ A  +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHA
Sbjct: 121 RAIDPILAVENDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHA 180

Query: 180 HKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTG 239
           HKKGAARVDIIGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT 
Sbjct: 181 HKKGAARVDIIGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTE 240

Query: 240 KYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIY 299
           KYVMWMHIDDANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++Y
Sbjct: 241 KYVMWMHIDDANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVY 300

Query: 300 SSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEA 359
           SSEDNSELHIGPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEA
Sbjct: 301 SSEDNSELHIGPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEA 360

Query: 360 LAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADL 419
           LAHAAESI+G WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL
Sbjct: 361 LAHAAESIMGPWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADL 420

Query: 420 KDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 469
           +DSRYVWLPLIVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 421 RDSRYVWLPLIVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470


>Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC |
           chr5:5348911-5354586 | 20130731
          Length = 469

 Score =  748 bits (1931), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/452 (79%), Positives = 392/452 (86%), Gaps = 1/452 (0%)

Query: 19  GRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHPQFRELQEVEEESIQV 77
           GR S++  V+            Y+YV H +R    EP+L V++HPQFRELQEVEEES+ V
Sbjct: 16  GRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHPQFRELQEVEEESLHV 75

Query: 78  XXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAIDPMQAAGDDKYYYYP 137
                             TTTLI+EFLDENSQ+R VFFPG KRAIDP+ A  +DKY+YYP
Sbjct: 76  PPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAIDPILAVENDKYHYYP 135

Query: 138 GRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSS 197
           GR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKKGAARVDIIGVGCYSS
Sbjct: 136 GRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKKGAARVDIIGVGCYSS 195

Query: 198 KDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYVMWMHIDDANYTKAAV 257
           KDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYVMWMHIDDANYTKA+V
Sbjct: 196 KDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYVMWMHIDDANYTKASV 255

Query: 258 GVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSEDNSELHIGPLTEDYL 317
           GVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSEDNSELHIGPLT+DYL
Sbjct: 256 GVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSEDNSELHIGPLTQDYL 315

Query: 318 NVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAHAAESILGTWETMGNP 377
           NVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAHAAESI+G WETMGNP
Sbjct: 316 NVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAESIMGPWETMGNP 375

Query: 378 CMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDSRYVWLPLIVAGPADQ 437
           C+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DSRYVWLPLIVAGPAD+
Sbjct: 376 CLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDSRYVWLPLIVAGPADE 435

Query: 438 PLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 469
           PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 436 PLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 467


>Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC |
           chr4:40009583-40006901 | 20130731
          Length = 465

 Score =  614 bits (1584), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 300/477 (62%), Positives = 357/477 (74%), Gaps = 21/477 (4%)

Query: 1   MRIRNKYRKP-TTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVVEPQLRVT 59
           MR++N Y+KP T L C++  R+ S   V+                 +         L + 
Sbjct: 1   MRMKNLYKKPITNLRCSSWSRYCSISLVILWTLLILGCILLLHLYSNNNTS-----LNII 55

Query: 60  HHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFF 115
           H P     F +LQ VE+E+ Q+                   T L++EFLD++S LR VFF
Sbjct: 56  HPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLRHVFF 110

Query: 116 PGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEY 171
           P   + IDPM+  G   +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYWYGEY
Sbjct: 111 P--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYWYGEY 168

Query: 172 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPK 231
           KDGPTY  + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVLERPK
Sbjct: 169 KDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVLERPK 228

Query: 232 VIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDD 291
           VIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+FKD+
Sbjct: 229 VIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTLFKDE 288

Query: 292 DGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGC 351
           D  AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG  REAPA+FKH+GTYYM+TSGC
Sbjct: 289 DNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMVTSGC 348

Query: 352 TGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMA 411
           TGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+  FPG FIFMA
Sbjct: 349 TGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLFIFMA 408

Query: 412 DRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSF 468
           DRWNP++L+DSRYVWLPLIV G  DQ  +Y F+  LW RVSIYWH+KW+LP G ++F
Sbjct: 409 DRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNTF 465


>Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC |
           chr4:40009583-40006901 | 20130731
          Length = 465

 Score =  614 bits (1584), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 300/477 (62%), Positives = 357/477 (74%), Gaps = 21/477 (4%)

Query: 1   MRIRNKYRKP-TTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVVEPQLRVT 59
           MR++N Y+KP T L C++  R+ S   V+                 +         L + 
Sbjct: 1   MRMKNLYKKPITNLRCSSWSRYCSISLVILWTLLILGCILLLHLYSNNNTS-----LNII 55

Query: 60  HHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFF 115
           H P     F +LQ VE+E+ Q+                   T L++EFLD++S LR VFF
Sbjct: 56  HPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLRHVFF 110

Query: 116 PGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEY 171
           P   + IDPM+  G   +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYWYGEY
Sbjct: 111 P--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYWYGEY 168

Query: 172 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPK 231
           KDGPTY  + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVLERPK
Sbjct: 169 KDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVLERPK 228

Query: 232 VIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDD 291
           VIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+FKD+
Sbjct: 229 VIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTLFKDE 288

Query: 292 DGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGC 351
           D  AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG  REAPA+FKH+GTYYM+TSGC
Sbjct: 289 DNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMVTSGC 348

Query: 352 TGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMA 411
           TGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+  FPG FIFMA
Sbjct: 349 TGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLFIFMA 408

Query: 412 DRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSF 468
           DRWNP++L+DSRYVWLPLIV G  DQ  +Y F+  LW RVSIYWH+KW+LP G ++F
Sbjct: 409 DRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNTF 465