Miyakogusa Predicted Gene
- Lj2g3v1968430.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1968430.2 Non Characterized Hit- tr|D3AH31|D3AH31_9CLOT
Putative F5/8 type C domain protein OS=Clostridium
hat,32.54,0.00000000000002,seg,NULL; Glyco_hydro_43,Glycoside
hydrolase, family 43;
Arabinanase/levansucrase/invertase,Glycosyl,CUFF.38134.2
(466 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC | ch... 766 0.0
Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC | ch... 754 0.0
Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC | ch... 754 0.0
Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC | ch... 607 e-174
Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC | ch... 607 e-174
>Medtr5g015460.3 | glycosyl hydrolase family 43 protein | HC |
chr5:5348911-5354586 | 20130731
Length = 469
Score = 766 bits (1979), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/467 (79%), Positives = 403/467 (86%), Gaps = 1/467 (0%)
Query: 1 MSGSSVHSTVRVVTGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHP 59
MSGSSVHS V VV GR S++ V+ Y+YV H +R EP+L V++HP
Sbjct: 1 MSGSSVHSLVLVVPRGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHP 60
Query: 60 QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAI 119
QFRELQEVEEES+ V TTTLI+EFLDENSQ+R VFFPG KRAI
Sbjct: 61 QFRELQEVEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAI 120
Query: 120 DPMQAAGDDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKK 179
DP+ A +DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKK
Sbjct: 121 DPILAVENDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKK 180
Query: 180 GAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYV 239
GAARVDIIGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYV
Sbjct: 181 GAARVDIIGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYV 240
Query: 240 MWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSE 299
MWMHIDDANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSE
Sbjct: 241 MWMHIDDANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSE 300
Query: 300 DNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAH 359
DNSELHIGPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAH
Sbjct: 301 DNSELHIGPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAH 360
Query: 360 AAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDS 419
AAESI+G WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DS
Sbjct: 361 AAESIMGPWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDS 420
Query: 420 RYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 466
RYVWLPLIVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 421 RYVWLPLIVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 467
>Medtr5g015460.1 | glycosyl hydrolase family 43 protein | HC |
chr5:5348835-5354586 | 20130731
Length = 472
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/460 (78%), Positives = 397/460 (86%), Gaps = 1/460 (0%)
Query: 8 STVRVVTGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHPQFRELQE 66
+++R GGR S++ V+ Y+YV H +R EP+L V++HPQFRELQE
Sbjct: 11 TSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHPQFRELQE 70
Query: 67 VEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAIDPMQAAG 126
VEEES+ V TTTLI+EFLDENSQ+R VFFPG KRAIDP+ A
Sbjct: 71 VEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAIDPILAVE 130
Query: 127 DDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKKGAARVDI 186
+DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKKGAARVDI
Sbjct: 131 NDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKKGAARVDI 190
Query: 187 IGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYVMWMHIDD 246
IGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYVMWMHIDD
Sbjct: 191 IGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD 250
Query: 247 ANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSEDNSELHI 306
ANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSEDNSELHI
Sbjct: 251 ANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSEDNSELHI 310
Query: 307 GPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAHAAESILG 366
GPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAHAAESI+G
Sbjct: 311 GPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAESIMG 370
Query: 367 TWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDSRYVWLPL 426
WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DSRYVWLPL
Sbjct: 371 PWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDSRYVWLPL 430
Query: 427 IVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 466
IVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 431 IVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470
>Medtr5g015460.2 | glycosyl hydrolase family 43 protein | HC |
chr5:5349653-5354338 | 20130731
Length = 472
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/460 (78%), Positives = 397/460 (86%), Gaps = 1/460 (0%)
Query: 8 STVRVVTGGRFSSTFAVVXXXXXXXXXXXXYTYVRHEERQVV-EPQLRVTHHPQFRELQE 66
+++R GGR S++ V+ Y+YV H +R EP+L V++HPQFRELQE
Sbjct: 11 TSLRCDAGGRCSTSVFVLSLLGCLLLFQLLYSYVHHVDRHGGGEPRLLVSNHPQFRELQE 70
Query: 67 VEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLRQVFFPGHKRAIDPMQAAG 126
VEEES+ V TTTLI+EFLDENSQ+R VFFPG KRAIDP+ A
Sbjct: 71 VEEESLHVPPPKGKRSPRAVKRRPKRTTTLIDEFLDENSQMRHVFFPGRKRAIDPILAVE 130
Query: 127 DDKYYYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYWYGEYKDGPTYHAHKKGAARVDI 186
+DKY+YYPGR+WLDTDG+PIQAHGGGIL+DK SRTYYWYGEYKDG TYHAHKKGAARVDI
Sbjct: 131 NDKYHYYPGRMWLDTDGHPIQAHGGGILYDKSSRTYYWYGEYKDGITYHAHKKGAARVDI 190
Query: 187 IGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVLERPKVIYNEKTGKYVMWMHIDD 246
IGVGCYSSKDLWTWKHEG+VLAAEET+ETHDLHKSNVLERPKVIYNEKT KYVMWMHIDD
Sbjct: 191 IGVGCYSSKDLWTWKHEGIVLAAEETDETHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD 250
Query: 247 ANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTVFKDDDGAAYLIYSSEDNSELHI 306
ANYTKA+VGVAISD PDGPF+YLGS RPHGFESRDMTVFKDDDG AY++YSSEDNSELHI
Sbjct: 251 ANYTKASVGVAISDAPDGPFNYLGSHRPHGFESRDMTVFKDDDGVAYIVYSSEDNSELHI 310
Query: 307 GPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMITSGCTGWAPNEALAHAAESILG 366
GPLT+DYLNVT VM+RILVG HREAPA+FKHQGTYYMITSGCTGWAPNEALAHAAESI+G
Sbjct: 311 GPLTQDYLNVTSVMRRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAESIMG 370
Query: 367 TWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSFIFMADRWNPADLKDSRYVWLPL 426
WETMGNPC+GGNKMFRLTTFFAQSTFVLPI GFPG+FIFMADRWNPADL+DSRYVWLPL
Sbjct: 371 PWETMGNPCLGGNKMFRLTTFFAQSTFVLPISGFPGAFIFMADRWNPADLRDSRYVWLPL 430
Query: 427 IVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSSFK 466
IVAGPAD+PLEYSF FP WSRVSIYWHRKWRLPQG + F+
Sbjct: 431 IVAGPADEPLEYSFGFPWWSRVSIYWHRKWRLPQGWNPFQ 470
>Medtr4g097110.2 | glycosyl hydrolase family 43 protein | HC |
chr4:40009583-40006901 | 20130731
Length = 465
Score = 607 bits (1564), Expect = e-174, Method: Compositional matrix adjust.
Identities = 288/421 (68%), Positives = 337/421 (80%), Gaps = 15/421 (3%)
Query: 53 LRVTHHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLR 108
L + H P F +LQ VE+E+ Q+ T L++EFLD++S LR
Sbjct: 52 LNIIHPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLR 106
Query: 109 QVFFPGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYW 164
VFFP + IDPM+ G +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYW
Sbjct: 107 HVFFP--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYW 164
Query: 165 YGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVL 224
YGEYKDGPTY + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVL
Sbjct: 165 YGEYKDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVL 224
Query: 225 ERPKVIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTV 284
ERPKVIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+
Sbjct: 225 ERPKVIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTL 284
Query: 285 FKDDDGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMI 344
FKD+D AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG REAPA+FKH+GTYYM+
Sbjct: 285 FKDEDNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMV 344
Query: 345 TSGCTGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSF 404
TSGCTGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+ FPG F
Sbjct: 345 TSGCTGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLF 404
Query: 405 IFMADRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSS 464
IFMADRWNP++L+DSRYVWLPLIV G DQ +Y F+ LW RVSIYWH+KW+LP G ++
Sbjct: 405 IFMADRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNT 464
Query: 465 F 465
F
Sbjct: 465 F 465
>Medtr4g097110.1 | glycosyl hydrolase family 43 protein | HC |
chr4:40009583-40006901 | 20130731
Length = 465
Score = 607 bits (1564), Expect = e-174, Method: Compositional matrix adjust.
Identities = 288/421 (68%), Positives = 337/421 (80%), Gaps = 15/421 (3%)
Query: 53 LRVTHHP----QFRELQEVEEESIQVXXXXXXXXXXXXXXXXXXTTTLIEEFLDENSQLR 108
L + H P F +LQ VE+E+ Q+ T L++EFLD++S LR
Sbjct: 52 LNIIHPPPLPSHFHQLQHVEKENFQIPPPNKKRSPQSKS-----ITPLVDEFLDQDSSLR 106
Query: 109 QVFFPGHKRAIDPMQAAG---DDKY-YYYPGRVWLDTDGNPIQAHGGGILFDKYSRTYYW 164
VFFP + IDPM+ G +D Y YYYPG++WLDTDGNPIQAHGG IL+D+ S TYYW
Sbjct: 107 HVFFP--HKTIDPMKTIGKGKNDSYNYYYPGKIWLDTDGNPIQAHGGCILYDENSSTYYW 164
Query: 165 YGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKHEGVVLAAEETNETHDLHKSNVL 224
YGEYKDGPTY + KG ARVDIIGVGCYSSKDLWTWK EG+ LAAE+T++THDLHKSNVL
Sbjct: 165 YGEYKDGPTYLHNNKGPARVDIIGVGCYSSKDLWTWKKEGIALAAEKTDKTHDLHKSNVL 224
Query: 225 ERPKVIYNEKTGKYVMWMHIDDANYTKAAVGVAISDTPDGPFDYLGSQRPHGFESRDMTV 284
ERPKVIYNEKT KYVMWMHID+ANY KA VG+A SDTP GPF YLGSQRPH ++SRDMT+
Sbjct: 225 ERPKVIYNEKTRKYVMWMHIDNANYAKATVGIAFSDTPTGPFKYLGSQRPHRYQSRDMTL 284
Query: 285 FKDDDGAAYLIYSSEDNSELHIGPLTEDYLNVTPVMKRILVGHHREAPAIFKHQGTYYMI 344
FKD+D AYLIYSSE+N+ +HIGPLTEDYLNVT VMKRI VG REAPA+FKH+GTYYM+
Sbjct: 285 FKDEDNVAYLIYSSEENNVMHIGPLTEDYLNVTSVMKRIFVGQRREAPAMFKHKGTYYMV 344
Query: 345 TSGCTGWAPNEALAHAAESILGTWETMGNPCMGGNKMFRLTTFFAQSTFVLPIPGFPGSF 404
TSGCTGWAPNEAL H+AE+ILGTWET+GNPC+ GNKMFR++TF AQSTFVLP+ FPG F
Sbjct: 345 TSGCTGWAPNEALVHSAETILGTWETIGNPCVAGNKMFRVSTFLAQSTFVLPLTRFPGLF 404
Query: 405 IFMADRWNPADLKDSRYVWLPLIVAGPADQPLEYSFEFPLWSRVSIYWHRKWRLPQGLSS 464
IFMADRWNP++L+DSRYVWLPLIV G DQ +Y F+ LW RVSIYWH+KW+LP G ++
Sbjct: 405 IFMADRWNPSELRDSRYVWLPLIVDGHEDQAFQYGFDNKLWPRVSIYWHKKWKLPLGWNT 464
Query: 465 F 465
F
Sbjct: 465 F 465