Miyakogusa Predicted Gene
- Lj2g3v1968430.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1968430.1 Non Characterized Hit- tr|D3AH31|D3AH31_9CLOT
Putative F5/8 type C domain protein OS=Clostridium
hat,32.54,0.00000000000002,seg,NULL; Glyco_hydro_43,Glycoside
hydrolase, family 43;
Arabinanase/levansucrase/invertase,Glycosyl,CUFF.38134.1
(469 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC | ch... 776 0.0
Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC | ch... 776 0.0
Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC | ch... 748 0.0
Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC | ch... 614 e-176
Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC | ch... 614 e-176
>Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC |
chr5:5348835-5354586 | 20130731
Length = 472
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/470 (79%), Positives = 408/470 (86%), Gaps = 1/470 (0%)
Query: 1 MRIRNKYRKPTTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVT 59
MRIRNK +KPT+L C+AGGR S++ V+ Y+YV H +R EP+L V+
Sbjct: 1 MRIRNKCKKPTSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVS 60
Query: 60 HHPQFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHK 119
+HPQFRELQEVEEES+ V TTTLI+EFLDENSQ+R VFFPG K
Sbjct: 61 NHPQFRELQEVEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRK 120
Query: 120 RAIDPMQAAGDDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHA 179
RAIDP+ A +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHA
Sbjct: 121 RAIDPILAVENDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHA 180
Query: 180 HKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTG 239
HKKGAARVDIIGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT
Sbjct: 181 HKKGAARVDIIGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTE 240
Query: 240 KYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIY 299
KYVMWMHIDDANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++Y
Sbjct: 241 KYVMWMHIDDANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVY 300
Query: 300 SSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEA 359
SSEDNSELHIGPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEA
Sbjct: 301 SSEDNSELHIGPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEA 360
Query: 360 LAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADL 419
LAHAAESI+G WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL
Sbjct: 361 LAHAAESIMGPWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADL 420
Query: 420 KDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 469
+DSRYVWLPLIVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 421 RDSRYVWLPLIVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470
>Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC |
chr5:5349653-5354338 | 20130731
Length = 472
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/470 (79%), Positives = 408/470 (86%), Gaps = 1/470 (0%)
Query: 1 MRIRNKYRKPTTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVT 59
MRIRNK +KPT+L C+AGGR S++ V+ Y+YV H +R EP+L V+
Sbjct: 1 MRIRNKCKKPTSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVS 60
Query: 60 HHPQFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHK 119
+HPQFRELQEVEEES+ V TTTLI+EFLDENSQ+R VFFPG K
Sbjct: 61 NHPQFRELQEVEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRK 120
Query: 120 RAIDPMQAAGDDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHA 179
RAIDP+ A +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHA
Sbjct: 121 RAIDPILAVENDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHA 180
Query: 180 HKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTG 239
HKKGAARVDIIGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT
Sbjct: 181 HKKGAARVDIIGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTE 240
Query: 240 KYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIY 299
KYVMWMHIDDANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++Y
Sbjct: 241 KYVMWMHIDDANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVY 300
Query: 300 SSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEA 359
SSEDNSELHIGPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEA
Sbjct: 301 SSEDNSELHIGPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEA 360
Query: 360 LAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADL 419
LAHAAESI+G WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL
Sbjct: 361 LAHAAESIMGPWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADL 420
Query: 420 KDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 469
+DSRYVWLPLIVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 421 RDSRYVWLPLIVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470
>Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC |
chr5:5348911-5354586 | 20130731
Length = 469
Score = 748 bits (1931), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/452 (79%), Positives = 392/452 (86%), Gaps = 1/452 (0%)
Query: 19 GRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHPQFRELQEVEEESIQV 77
GR S++ V+ Y+YV H +R EP+L V++HPQFRELQEVEEES+ V
Sbjct: 16 GRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHPQFRELQEVEEESLHV 75
Query: 78 XXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAIDPMQAAGDDKYYYYP 137
TTTLI+EFLDENSQ+R VFFPG KRAIDP+ A +DKY+YYP
Sbjct: 76 PPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAIDPILAVENDKYHYYP 135
Query: 138 GRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSS 197
GR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKKGAARVDIIGVGCYSS
Sbjct: 136 GRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKKGAARVDIIGVGCYSS 195
Query: 198 KDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYVMWMHIDDANYTKAAV 257
KDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYVMWMHIDDANYTKA+V
Sbjct: 196 KDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYVMWMHIDDANYTKASV 255
Query: 258 GVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSEDNSELHIGPLTEDYL 317
GVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSEDNSELHIGPLT+DYL
Sbjct: 256 GVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSEDNSELHIGPLTQDYL 315
Query: 318 NVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAHAAESILGTWETMGNP 377
NVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAHAAESI+G WETMGNP
Sbjct: 316 NVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAESIMGPWETMGNP 375
Query: 378 CMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDSRYVWLPLIVAGPADQ 437
C+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DSRYVWLPLIVAGPAD+
Sbjct: 376 CLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDSRYVWLPLIVAGPADE 435
Query: 438 PLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 469
PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 436 PLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 467
>Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC |
chr4:40009583-40006901 | 20130731
Length = 465
Score = 614 bits (1584), Expect = e-176, Method: Compositional matrix adjust.
Identities = 300/477 (62%), Positives = 357/477 (74%), Gaps = 21/477 (4%)
Query: 1 MRIRNKYRKP-TTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVVEPQLRVT 59
MR++N Y+KP T L C++ R+ S V+ + L +
Sbjct: 1 MRMKNLYKKPITNLRCSSWSRYCSISLVILWTLLILGCILLLHLYSNNNTS-----LNII 55
Query: 60 HHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFF 115
H P F +LQ VE+E+ Q+ T L++EFLD++S LR VFF
Sbjct: 56 HPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLRHVFF 110
Query: 116 PGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEY 171
P + IDPM+ G +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYWYGEY
Sbjct: 111 P--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYWYGEY 168
Query: 172 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPK 231
KDGPTY + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVLERPK
Sbjct: 169 KDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVLERPK 228
Query: 232 VIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDD 291
VIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+FKD+
Sbjct: 229 VIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTLFKDE 288
Query: 292 DGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGC 351
D AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG REAPA+FKH+GTYYM+TSGC
Sbjct: 289 DNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMVTSGC 348
Query: 352 TGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMA 411
TGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+ FPG FIFMA
Sbjct: 349 TGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLFIFMA 408
Query: 412 DRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSF 468
DRWNP++L+DSRYVWLPLIV G DQ +Y F+ LW RVSIYWH+KW+LP G ++F
Sbjct: 409 DRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNTF 465
>Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC |
chr4:40009583-40006901 | 20130731
Length = 465
Score = 614 bits (1584), Expect = e-176, Method: Compositional matrix adjust.
Identities = 300/477 (62%), Positives = 357/477 (74%), Gaps = 21/477 (4%)
Query: 1 MRIRNKYRKP-TTLPCNAGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVVEPQLRVT 59
MR++N Y+KP T L C++ R+ S V+ + L +
Sbjct: 1 MRMKNLYKKPITNLRCSSWSRYCSISLVILWTLLILGCILLLHLYSNNNTS-----LNII 55
Query: 60 HHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFF 115
H P F +LQ VE+E+ Q+ T L++EFLD++S LR VFF
Sbjct: 56 HPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLRHVFF 110
Query: 116 PGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEY 171
P + IDPM+ G +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYWYGEY
Sbjct: 111 P--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYWYGEY 168
Query: 172 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPK 231
KDGPTY + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVLERPK
Sbjct: 169 KDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVLERPK 228
Query: 232 VIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDD 291
VIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+FKD+
Sbjct: 229 VIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTLFKDE 288
Query: 292 DGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGC 351
D AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG REAPA+FKH+GTYYM+TSGC
Sbjct: 289 DNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMVTSGC 348
Query: 352 TGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMA 411
TGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+ FPG FIFMA
Sbjct: 349 TGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLFIFMA 408
Query: 412 DRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSF 468
DRWNP++L+DSRYVWLPLIV G DQ +Y F+ LW RVSIYWH+KW+LP G ++F
Sbjct: 409 DRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNTF 465