Miyakogusa Predicted Gene
- Lj0g3v0171059.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0171059.1 CUFF.10736.1
(759 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G54440.1 | Symbols: | glycoside hydrolase family 2 protein |... 1107 0.0
AT3G54440.3 | Symbols: | glycoside hydrolase family 2 protein |... 1106 0.0
AT3G54440.2 | Symbols: | glycoside hydrolase family 2 protein |... 1102 0.0
>AT3G54440.1 | Symbols: | glycoside hydrolase family 2 protein |
chr3:20148494-20157019 REVERSE LENGTH=1107
Length = 1107
Score = 1107 bits (2863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/759 (67%), Positives = 615/759 (81%), Gaps = 3/759 (0%)
Query: 1 MLVGKLQSPKLWSAEQPYLYTLVVTLKDQTGRVVDCESCPVGFRKVSKAHKQLLVNGHAV 60
+L GKL SP LWSAEQP +Y LV+TLKD +G+V+D ES VG R+VSKA KQLLVNGH V
Sbjct: 347 LLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPV 406
Query: 61 VIRGVNRHEHHPQVGKTNIESCMVKDLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYM 120
VI+GVNRHEHHP+VGKTNIE+CMVKDL++MK+ NINAVRNSHYPQHPRWYELCDLFGMYM
Sbjct: 407 VIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYM 466
Query: 121 IDEANIETHGFYDCKYFKHPTLEPSWATAMLDRVIGMVERDKNHTCIISWSLGNESEFGT 180
IDEANIETHGF + KHP EPSWA AMLDRV+GMVERDKNHTCIISWSLGNE+ +G
Sbjct: 467 IDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGP 526
Query: 181 NHFALAGWIRGRDSTRVLHYEGGGSRTPCTDIVCPMYMRVWDILKIADDPTETRPLILCE 240
NH A+AGWIR +D +R++HYEGGGSRT TDIVCPMYMRVWDI+KIA D E+RPLILCE
Sbjct: 527 NHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDIIKIALDQNESRPLILCE 586
Query: 241 YSHAMGNSNGNLHIYWEAIDKTFGLQGGFIWDWVDQGLWKVGADGKKHWAYGGDFGDIPN 300
Y HAMGNSNGN+ YWEAID TFGLQGGFIWDWVDQGL K+G+DG K WAYGGDFGD PN
Sbjct: 587 YQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPN 646
Query: 301 DLNFCLNGLIWPDRTPHPVLHEVKHLYQPIKVTLNEGKLEIKNTHFFQTTEGLEFSWYVS 360
DLNFCLNGLIWPDRTPHP LHEVKH YQPIKV+L +G +++ NT+FF TTE LEFSW +
Sbjct: 647 DLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGMIKVANTYFFNTTEELEFSWTIH 706
Query: 361 AVGCNLGSGILSLAPIKPQSSYVVDWQSGPWYSLWAXXXXXXVFLSISAKLLNTTRWVEA 420
G LGSG LS+ IKPQ+S+ ++W+SGPW+S W +FL+I+AKLLN TR +EA
Sbjct: 707 GDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEA 766
Query: 421 GHIVSTAQIQLPSRREIVPHAINISGGTLLAQTLGDTIKVSQLDDWDLTLNTKTGLIESW 480
GH++S+ QI LP++ +I+P AI + ++ +T+GD IK+SQ D W+L +N + G IE W
Sbjct: 767 GHLLSSTQIPLPAKGQIIPQAIKKTDTSITCETVGDFIKISQKDSWELMVNVRKGTIEGW 826
Query: 481 KVKGIPVMNKGILPSFWRAPTDNDKGGEAASYLSRWKAAGMDTLHFITESCSVQNITQNM 540
K++G+ +MN+ ILP FWRAPTDNDKGG +SY SRWKAA +D + F+ ESCSV++IT
Sbjct: 827 KIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWKAAQLDNVEFLVESCSVKSITDKS 886
Query: 541 VKVRVVFL---GVTKDESKVLFTTDMTYTIYASGDVIMECNVKPNPDLPPLPRVGIEVKV 597
V++ ++L +S LF ++TY IY SGD+I V+PN DLPPLPRVGIE +
Sbjct: 887 VEIEFIYLGSSASGSSKSDALFKVNVTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHI 946
Query: 598 EKPLDQVTWYGRGPFECYPDRKAAAEVAVYEKNVNDLHVPYIVPGECSGRADVRWATFRD 657
EK LD+V WYG+GPFECYPDRKAAA VA+YE NV D+HVPYIVPGE GR DVRW TFR+
Sbjct: 947 EKTLDRVEWYGKGPFECYPDRKAAAHVAIYEHNVGDMHVPYIVPGENGGRTDVRWVTFRN 1006
Query: 658 KSGFGIYASKYGSSPPMQMSASYYSTSELERATHNQDLIEGDSIEIHLDHKHMGLGGDDS 717
K G GIYAS YGSS MQM+ASYY+T EL RATH +DLI+G +IE+HLDHKHMGLGGDDS
Sbjct: 1007 KDGVGIYASTYGSSSLMQMNASYYTTGELHRATHEEDLIKGQNIEVHLDHKHMGLGGDDS 1066
Query: 718 WSPCVHEQYLIPPVPYSFSVRLCPVIPSTSGHDIYRSQL 756
W+PCVH+++LIPP YSFS+RLCP+ STSG +IY+ QL
Sbjct: 1067 WTPCVHDKFLIPPAQYSFSLRLCPITASTSGLNIYKDQL 1105
>AT3G54440.3 | Symbols: | glycoside hydrolase family 2 protein |
chr3:20148494-20157019 REVERSE LENGTH=1120
Length = 1120
Score = 1106 bits (2860), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/759 (67%), Positives = 615/759 (81%), Gaps = 3/759 (0%)
Query: 1 MLVGKLQSPKLWSAEQPYLYTLVVTLKDQTGRVVDCESCPVGFRKVSKAHKQLLVNGHAV 60
+L GKL SP LWSAEQP +Y LV+TLKD +G+V+D ES VG R+VSKA KQLLVNGH V
Sbjct: 360 LLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPV 419
Query: 61 VIRGVNRHEHHPQVGKTNIESCMVKDLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYM 120
VI+GVNRHEHHP+VGKTNIE+CMVKDL++MK+ NINAVRNSHYPQHPRWYELCDLFGMYM
Sbjct: 420 VIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYM 479
Query: 121 IDEANIETHGFYDCKYFKHPTLEPSWATAMLDRVIGMVERDKNHTCIISWSLGNESEFGT 180
IDEANIETHGF + KHP EPSWA AMLDRV+GMVERDKNHTCIISWSLGNE+ +G
Sbjct: 480 IDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGP 539
Query: 181 NHFALAGWIRGRDSTRVLHYEGGGSRTPCTDIVCPMYMRVWDILKIADDPTETRPLILCE 240
NH A+AGWIR +D +R++HYEGGGSRT TDIVCPMYMRVWDI+KIA D E+RPLILCE
Sbjct: 540 NHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDIIKIALDQNESRPLILCE 599
Query: 241 YSHAMGNSNGNLHIYWEAIDKTFGLQGGFIWDWVDQGLWKVGADGKKHWAYGGDFGDIPN 300
Y HAMGNSNGN+ YWEAID TFGLQGGFIWDWVDQGL K+G+DG K WAYGGDFGD PN
Sbjct: 600 YQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPN 659
Query: 301 DLNFCLNGLIWPDRTPHPVLHEVKHLYQPIKVTLNEGKLEIKNTHFFQTTEGLEFSWYVS 360
DLNFCLNGLIWPDRTPHP LHEVKH YQPIKV+L +G +++ NT+FF TTE LEFSW +
Sbjct: 660 DLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGMIKVANTYFFNTTEELEFSWTIH 719
Query: 361 AVGCNLGSGILSLAPIKPQSSYVVDWQSGPWYSLWAXXXXXXVFLSISAKLLNTTRWVEA 420
G LGSG LS+ IKPQ+S+ ++W+SGPW+S W +FL+I+AKLLN TR +EA
Sbjct: 720 GDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEA 779
Query: 421 GHIVSTAQIQLPSRREIVPHAINISGGTLLAQTLGDTIKVSQLDDWDLTLNTKTGLIESW 480
GH++S+ QI LP++ +I+P AI + ++ +T+GD IK+SQ D W+L +N + G IE W
Sbjct: 780 GHLLSSTQIPLPAKGQIIPQAIKKTDTSITCETVGDFIKISQKDSWELMVNVRKGTIEGW 839
Query: 481 KVKGIPVMNKGILPSFWRAPTDNDKGGEAASYLSRWKAAGMDTLHFITESCSVQNITQNM 540
K++G+ +MN+ ILP FWRAPTDNDKGG +SY SRWKAA +D + F+ ESCSV++IT
Sbjct: 840 KIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWKAAQLDNVEFLVESCSVKSITDKS 899
Query: 541 VKVRVVFL---GVTKDESKVLFTTDMTYTIYASGDVIMECNVKPNPDLPPLPRVGIEVKV 597
V++ ++L +S LF ++TY IY SGD+I V+PN DLPPLPRVGIE +
Sbjct: 900 VEIEFIYLGSSASGSSKSDALFKVNVTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHI 959
Query: 598 EKPLDQVTWYGRGPFECYPDRKAAAEVAVYEKNVNDLHVPYIVPGECSGRADVRWATFRD 657
EK LD+V WYG+GPFECYPDRKAAA VA+YE NV D+HVPYIVPGE GR DVRW TFR+
Sbjct: 960 EKTLDRVEWYGKGPFECYPDRKAAAHVAIYEHNVGDMHVPYIVPGENGGRTDVRWVTFRN 1019
Query: 658 KSGFGIYASKYGSSPPMQMSASYYSTSELERATHNQDLIEGDSIEIHLDHKHMGLGGDDS 717
K G GIYAS YGSS MQM+ASYY+T EL RATH +DLI+G +IE+HLDHKHMGLGGDDS
Sbjct: 1020 KDGVGIYASTYGSSSLMQMNASYYTTGELHRATHEEDLIKGQNIEVHLDHKHMGLGGDDS 1079
Query: 718 WSPCVHEQYLIPPVPYSFSVRLCPVIPSTSGHDIYRSQL 756
W+PCVH+++LIPP YSFS+RLCP+ STSG +IY+ QL
Sbjct: 1080 WTPCVHDKFLIPPAQYSFSLRLCPITASTSGLNIYKDQL 1118
>AT3G54440.2 | Symbols: | glycoside hydrolase family 2 protein |
chr3:20148494-20157019 REVERSE LENGTH=1108
Length = 1108
Score = 1102 bits (2851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/760 (67%), Positives = 615/760 (80%), Gaps = 4/760 (0%)
Query: 1 MLVGKLQSPKLWSAEQPYLYTLVVTLKDQTGRVVDCESCPVGFRKVSKAHKQLLVNGHAV 60
+L GKL SP LWSAEQP +Y LV+TLKD +G+V+D ES VG R+VSKA KQLLVNGH V
Sbjct: 347 LLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSESSIVGIRQVSKAFKQLLVNGHPV 406
Query: 61 VIRGVNRHEHHPQVGKTNIESCMVKDLVLMKQNNINAVRNSHYPQHPRWYELCDLFGMYM 120
VI+GVNRHEHHP+VGKTNIE+CMVKDL++MK+ NINAVRNSHYPQHPRWYELCDLFGMYM
Sbjct: 407 VIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYM 466
Query: 121 IDEANIETHGFYDCKYFKHPTLEPSWATAMLDRVIGMVERDKNHTCIISWSLGNESEFGT 180
IDEANIETHGF + KHP EPSWA AMLDRV+GMVERDKNHTCIISWSLGNE+ +G
Sbjct: 467 IDEANIETHGFDLSGHLKHPAKEPSWAAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGP 526
Query: 181 NHFALAGWIRGRDSTRVLHYEGGGSRTPCTDIVCPMYMRVWDILKIADDPTETRPLILCE 240
NH A+AGWIR +D +R++HYEGGGSRT TDIVCPMYMRVWDI+KIA D E+RPLILCE
Sbjct: 527 NHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDIIKIALDQNESRPLILCE 586
Query: 241 YSHAMGNSNGNLHIYWEAIDKTFGLQGGFIWDWVDQGLWKVGADGKKHWAYGGDFGDIPN 300
Y HAMGNSNGN+ YWEAID TFGLQGGFIWDWVDQGL K+G+DG K WAYGGDFGD PN
Sbjct: 587 YQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPN 646
Query: 301 DLNFCLNGLIWPDRTPHPVLHEVKHLYQPIKVTLNEGKLEIKNTHFFQTTEGLEFSWYVS 360
DLNFCLNGLIWPDRTPHP LHEVKH YQPIKV+L +G +++ NT+FF TTE LEFSW +
Sbjct: 647 DLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGMIKVANTYFFNTTEELEFSWTIH 706
Query: 361 AVGCNLGSGILSLAPIKPQSSYVVDWQSGPWYSLWAXXXXXXVFLSISAKLLNTTRWVEA 420
G LGSG LS+ IKPQ+S+ ++W+SGPW+S W +FL+I+AKLLN TR +EA
Sbjct: 707 GDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEA 766
Query: 421 GHIVSTAQIQLPSRREIVPHAINISGGTLLAQTLGDTIKVSQLDDWDLTLNTKTGLIESW 480
GH++S+ QI LP++ +I+P AI + ++ +T+GD IK+SQ D W+L +N + G IE W
Sbjct: 767 GHLLSSTQIPLPAKGQIIPQAIKKTDTSITCETVGDFIKISQKDSWELMVNVRKGTIEGW 826
Query: 481 KVKGIPVMNKGILPSFWRAPTDNDKGGEAASYLSRWKAAGMDTLHFITESCSVQNITQNM 540
K++G+ +MN+ ILP FWRAPTDNDKGG +SY SRWKAA +D + F+ ESCSV++IT
Sbjct: 827 KIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWKAAQLDNVEFLVESCSVKSITDKS 886
Query: 541 VKVRVVFL---GVTKDESKVLFTTDMTYTIYASGDVIMECNVKPNPDLPPLPRVGIEVKV 597
V++ ++L +S LF ++TY IY SGD+I V+PN DLPPLPRVGIE +
Sbjct: 887 VEIEFIYLGSSASGSSKSDALFKVNVTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHI 946
Query: 598 EKPLDQVTWYGRGPFECYPDRKAAAEVAVYEKNVNDLHVPYIVPGECSGRADVRWATFRD 657
EK LD+V WYG+GPFECYPDRKAAA VA+YE NV D+HVPYIVPGE GR DVRW TFR+
Sbjct: 947 EKTLDRVEWYGKGPFECYPDRKAAAHVAIYEHNVGDMHVPYIVPGENGGRTDVRWVTFRN 1006
Query: 658 KSGFGIYASKYGSSPPMQMSASYYSTSELERATHNQDLIEGDSIE-IHLDHKHMGLGGDD 716
K G GIYAS YGSS MQM+ASYY+T EL RATH +DLI+G +IE +HLDHKHMGLGGDD
Sbjct: 1007 KDGVGIYASTYGSSSLMQMNASYYTTGELHRATHEEDLIKGQNIEVVHLDHKHMGLGGDD 1066
Query: 717 SWSPCVHEQYLIPPVPYSFSVRLCPVIPSTSGHDIYRSQL 756
SW+PCVH+++LIPP YSFS+RLCP+ STSG +IY+ QL
Sbjct: 1067 SWTPCVHDKFLIPPAQYSFSLRLCPITASTSGLNIYKDQL 1106