Miyakogusa Predicted Gene
- Lj2g3v1734780.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1734780.1 Non Chatacterized Hit- tr|I3SYV6|I3SYV6_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.4,0,Galactose
mutarotase-like,Glycoside hydrolase-type carbohydrate-binding; no
description,Glycoside hy,CUFF.37769.1
(312 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G66530.2 | Symbols: | Galactose mutarotase-like superfamily ... 470 e-133
AT5G66530.1 | Symbols: | Galactose mutarotase-like superfamily ... 470 e-133
AT5G57330.1 | Symbols: | Galactose mutarotase-like superfamily ... 147 1e-35
AT3G01590.1 | Symbols: | Galactose mutarotase-like superfamily ... 144 7e-35
AT3G01590.2 | Symbols: | Galactose mutarotase-like superfamily ... 144 7e-35
AT4G23730.1 | Symbols: | Galactose mutarotase-like superfamily ... 143 2e-34
AT5G14500.1 | Symbols: | aldose 1-epimerase family protein | ch... 140 1e-33
AT4G25900.1 | Symbols: | Galactose mutarotase-like superfamily ... 139 2e-33
AT3G61610.1 | Symbols: | Galactose mutarotase-like superfamily ... 122 2e-28
>AT5G66530.2 | Symbols: | Galactose mutarotase-like superfamily
protein | chr5:26553821-26555575 REVERSE LENGTH=307
Length = 307
Score = 470 bits (1209), Expect = e-133, Method: Compositional matrix adjust.
Identities = 224/312 (71%), Positives = 253/312 (81%), Gaps = 5/312 (1%)
Query: 1 MATVSMAFCVNTVTFSNHGRHNRSRGMAFASLGKEATTLGVKLTEGEGSLPKLVLTSPAG 60
MA VS++ T N R +R R A AS +T GV++ EGEG+LPKLVLTSP
Sbjct: 1 MAIVSVSNSFLTFNSPNQLRFSRRRFSAMAS-----STTGVRVAEGEGNLPKLVLTSPQN 55
Query: 61 SEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVPHCFPQFGPGPIQQHGFA 120
SEAEIYLFGGCITSWKV SG DLLFVRPDAVFNK KPISGG+PHCFPQFGPG IQQHGF
Sbjct: 56 SEAEIYLFGGCITSWKVASGKDLLFVRPDAVFNKIKPISGGIPHCFPQFGPGLIQQHGFG 115
Query: 121 RNMDWTVADSESVEGNPVVTLELKDAPYSRDIWDFSFHALFKVTLNAKSLSTELKVKNTD 180
RNMDW+V DS++ + N VTLELKD PYSR +WDF+F AL+KV + A SLSTELK+ NTD
Sbjct: 116 RNMDWSVVDSQNADDNAAVTLELKDGPYSRAMWDFAFQALYKVIVGADSLSTELKITNTD 175
Query: 181 NKAFSFNTALHTYFRASVSGTSVKGLKGCKTLNKHPDPNNPVEGTEERDVVTFPGFVDCI 240
+K FSF+TALHTYFRAS +G SV+GLKGCKTLNK PDP NP+EG E+RD VTFPGFVD +
Sbjct: 176 DKPFSFSTALHTYFRASSAGASVRGLKGCKTLNKDPDPKNPIEGKEDRDAVTFPGFVDTV 235
Query: 241 YLDAANELQLDNGLGDLISIKNTNWSDAVLWNPHLQMEACYKDFVCVENAKIGSVQLEPE 300
YLDA NELQ DNGLGD I IKNTNWSDAVLWNPH QMEACY+DFVCVENAK+G V+LEP
Sbjct: 236 YLDAPNELQFDNGLGDKIIIKNTNWSDAVLWNPHTQMEACYRDFVCVENAKLGDVKLEPG 295
Query: 301 QTWTAVQHLSIA 312
Q+WTA Q LSI+
Sbjct: 296 QSWTATQLLSIS 307
>AT5G66530.1 | Symbols: | Galactose mutarotase-like superfamily
protein | chr5:26553821-26555575 REVERSE LENGTH=307
Length = 307
Score = 470 bits (1209), Expect = e-133, Method: Compositional matrix adjust.
Identities = 224/312 (71%), Positives = 253/312 (81%), Gaps = 5/312 (1%)
Query: 1 MATVSMAFCVNTVTFSNHGRHNRSRGMAFASLGKEATTLGVKLTEGEGSLPKLVLTSPAG 60
MA VS++ T N R +R R A AS +T GV++ EGEG+LPKLVLTSP
Sbjct: 1 MAIVSVSNSFLTFNSPNQLRFSRRRFSAMAS-----STTGVRVAEGEGNLPKLVLTSPQN 55
Query: 61 SEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVPHCFPQFGPGPIQQHGFA 120
SEAEIYLFGGCITSWKV SG DLLFVRPDAVFNK KPISGG+PHCFPQFGPG IQQHGF
Sbjct: 56 SEAEIYLFGGCITSWKVASGKDLLFVRPDAVFNKIKPISGGIPHCFPQFGPGLIQQHGFG 115
Query: 121 RNMDWTVADSESVEGNPVVTLELKDAPYSRDIWDFSFHALFKVTLNAKSLSTELKVKNTD 180
RNMDW+V DS++ + N VTLELKD PYSR +WDF+F AL+KV + A SLSTELK+ NTD
Sbjct: 116 RNMDWSVVDSQNADDNAAVTLELKDGPYSRAMWDFAFQALYKVIVGADSLSTELKITNTD 175
Query: 181 NKAFSFNTALHTYFRASVSGTSVKGLKGCKTLNKHPDPNNPVEGTEERDVVTFPGFVDCI 240
+K FSF+TALHTYFRAS +G SV+GLKGCKTLNK PDP NP+EG E+RD VTFPGFVD +
Sbjct: 176 DKPFSFSTALHTYFRASSAGASVRGLKGCKTLNKDPDPKNPIEGKEDRDAVTFPGFVDTV 235
Query: 241 YLDAANELQLDNGLGDLISIKNTNWSDAVLWNPHLQMEACYKDFVCVENAKIGSVQLEPE 300
YLDA NELQ DNGLGD I IKNTNWSDAVLWNPH QMEACY+DFVCVENAK+G V+LEP
Sbjct: 236 YLDAPNELQFDNGLGDKIIIKNTNWSDAVLWNPHTQMEACYRDFVCVENAKLGDVKLEPG 295
Query: 301 QTWTAVQHLSIA 312
Q+WTA Q LSI+
Sbjct: 296 QSWTATQLLSIS 307
>AT5G57330.1 | Symbols: | Galactose mutarotase-like superfamily
protein | chr5:23218392-23220664 FORWARD LENGTH=312
Length = 312
Score = 147 bits (371), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 100/294 (34%), Positives = 144/294 (48%), Gaps = 33/294 (11%)
Query: 42 KLTEGEGSLPKLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGG 101
+L +G L K+VL G AE+YL+G +TSWK +G +LL + A+F KPI GG
Sbjct: 9 ELAKGINGLDKIVLRESRGRSAEVYLYGSHVTSWKNENGEELLHLSSKAIFKPPKPIRGG 68
Query: 102 VPHCFPQFGP-GPIQQHGFARNMDWTVADSESVEGNP-----------VVTLELKDAPYS 149
+P CFPQF G ++ HGFARN W VE NP V L L+
Sbjct: 69 IPLCFPQFSNFGTLESHGFARNRIW------EVEANPPPLPLNSCSSAFVDLILRPTEDD 122
Query: 150 RDIWDFSFHALFKVTLNAK---SLSTELKVKNTDNKAFSFNTALHTYFRASVSGTSVKGL 206
IW +F ++ L + +L++ ++ N+D K F+F A HTYF SVS S +
Sbjct: 123 LKIWPNNFEFRLRIALGTEGELTLTSRIRNTNSDGKPFTFTFAYHTYF--SVSDISEVRV 180
Query: 207 KGCKTLNKHPDPNNPVEGTEERDVVTFPGFVDCIYLDAANELQ-LDNGLGDLISIKNTNW 265
+G +TL+ + + TE+ D +TF VD IYL ++ LD+ I+
Sbjct: 181 EGLETLDYLDNLKDRERFTEQGDAITFESEVDKIYLSTPTKIAILDHEKKRTFVIRKDGL 240
Query: 266 SDAVLWNPHLQMEAC--------YKDFVCVENAKIGS-VQLEPEQTWTAVQHLS 310
+DAV+WNP + YK +CVE A I + L+P + W LS
Sbjct: 241 ADAVVWNPWDKKSKTISDLGDEDYKHMLCVEAAAIERPITLKPGEEWKGRLELS 294
>AT3G01590.1 | Symbols: | Galactose mutarotase-like superfamily
protein | chr3:226647-228346 FORWARD LENGTH=306
Length = 306
Score = 144 bits (363), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 145/283 (51%), Gaps = 18/283 (6%)
Query: 45 EGEGSLPKLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVPH 104
+G+GS +++LT P GS AE+ LFGG + SWK +LL++ A + K I GG+P
Sbjct: 8 DGDGS-SRIILTEPRGSTAEVLLFGGQVISWKNERREELLYMSSKAQYKPPKAIRGGIPV 66
Query: 105 CFPQFGP-GPIQQHGFARNMDWTVADSES----VEGNPVVTLELKDAPYSRDIWDFSFHA 159
CFPQFG G +++HGFARN W+ + S V L LK W SF
Sbjct: 67 CFPQFGNFGGLERHGFARNKFWSHDEDPSPLPPANKQSSVDLILKSTEDDLKTWPHSFEL 126
Query: 160 LFKVTLNAKSLSTELKVKNTDNKAFSFNTALHTYFRASVSGTSVKGLKGCKTLNKHPDPN 219
+++++ L+ +V+N D+KAFSF AL Y VS S ++G +TL+ +
Sbjct: 127 RIRISISPGKLTLIPRVRNIDSKAFSFMFALRNYL--YVSDISEVRVEGLETLDYLDNLI 184
Query: 220 NPVEGTEERDVVTFPGFVDCIYLDAANELQ-LDNGLGDLISIKNTNWSDAVLWNPHLQM- 277
TE+ D +TF G VD +YL+ ++ +D+ I ++ +AV+WNP +
Sbjct: 185 GKERFTEQADAITFDGEVDRVYLNTPTKIAVIDHERKRTIELRKEGMPNAVVWNPWDKKA 244
Query: 278 -------EACYKDFVCVENAKIGS-VQLEPEQTWTAVQHLSIA 312
+ YK +CV++ I V L+P + W Q LSI
Sbjct: 245 KTIADMGDEDYKTMLCVDSGVIEPLVLLKPREEWKGRQELSIV 287
>AT3G01590.2 | Symbols: | Galactose mutarotase-like superfamily
protein | chr3:226647-228346 FORWARD LENGTH=306
Length = 306
Score = 144 bits (363), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 145/283 (51%), Gaps = 18/283 (6%)
Query: 45 EGEGSLPKLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVPH 104
+G+GS +++LT P GS AE+ LFGG + SWK +LL++ A + K I GG+P
Sbjct: 8 DGDGS-SRIILTEPRGSTAEVLLFGGQVISWKNERREELLYMSSKAQYKPPKAIRGGIPV 66
Query: 105 CFPQFGP-GPIQQHGFARNMDWTVADSES----VEGNPVVTLELKDAPYSRDIWDFSFHA 159
CFPQFG G +++HGFARN W+ + S V L LK W SF
Sbjct: 67 CFPQFGNFGGLERHGFARNKFWSHDEDPSPLPPANKQSSVDLILKSTEDDLKTWPHSFEL 126
Query: 160 LFKVTLNAKSLSTELKVKNTDNKAFSFNTALHTYFRASVSGTSVKGLKGCKTLNKHPDPN 219
+++++ L+ +V+N D+KAFSF AL Y VS S ++G +TL+ +
Sbjct: 127 RIRISISPGKLTLIPRVRNIDSKAFSFMFALRNYL--YVSDISEVRVEGLETLDYLDNLI 184
Query: 220 NPVEGTEERDVVTFPGFVDCIYLDAANELQ-LDNGLGDLISIKNTNWSDAVLWNPHLQM- 277
TE+ D +TF G VD +YL+ ++ +D+ I ++ +AV+WNP +
Sbjct: 185 GKERFTEQADAITFDGEVDRVYLNTPTKIAVIDHERKRTIELRKEGMPNAVVWNPWDKKA 244
Query: 278 -------EACYKDFVCVENAKIGS-VQLEPEQTWTAVQHLSIA 312
+ YK +CV++ I V L+P + W Q LSI
Sbjct: 245 KTIADMGDEDYKTMLCVDSGVIEPLVLLKPREEWKGRQELSIV 287
>AT4G23730.1 | Symbols: | Galactose mutarotase-like superfamily
protein | chr4:12362955-12364792 FORWARD LENGTH=306
Length = 306
Score = 143 bits (360), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 96/291 (32%), Positives = 149/291 (51%), Gaps = 21/291 (7%)
Query: 41 VKLTEGEGSLPKLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISG 100
V L + +++L +P G+ +I L GG + SWK G++LLF A P+ G
Sbjct: 15 VDLVKDRNGTDQILLQNPRGASVKISLHGGQVLSWKTDKGDELLFNSTKANLKPPHPVRG 74
Query: 101 GVPHCFPQFGP-GPIQQHGFARNMDWTVAD------SESVEGNPVVTLELKDAPY-SRDI 152
G+P CFPQFG G ++QHGFARN W V + S G V L LK + + I
Sbjct: 75 GIPICFPQFGTRGSLEQHGFARNKMWLVENNPPALPSFDSTGKAYVDLVLKSSDEDTMRI 134
Query: 153 WDFSFHALFKVTLNAK-SLSTELKVKNTDNKAFSFNTALHTYFRASVSGTSVKGLKGCKT 211
W +SF +V+L +L+ +V+N ++K FSF+ A HTYF S+S S L+G +T
Sbjct: 135 WPYSFEFHLRVSLALDGNLTLISRVRNINSKPFSFSIAYHTYF--SISDISEVRLEGLET 192
Query: 212 LNKHPDPNNPVEGTEERDVVTFPGFVDCIYLDAANELQL-DNGLGDLISIKNTNWSDAVL 270
L+ + ++ TE+ D +TF +D +YL++ + + + D+ IK D V+
Sbjct: 193 LDYLDNMHDRERFTEQGDALTFESEIDRVYLNSKDVVAIFDHERKRTFLIKKEGLPDVVV 252
Query: 271 WNPHLQMEAC--------YKDFVCVENAKIGS-VQLEPEQTWTAVQHLSIA 312
WNP + Y+ +CV+ A I + L+P + WT HLS+
Sbjct: 253 WNPWEKKARALTDLGDDEYRHMLCVDGAAIEKPITLKPGEEWTGKLHLSLV 303
>AT5G14500.1 | Symbols: | aldose 1-epimerase family protein |
chr5:4674503-4676368 REVERSE LENGTH=306
Length = 306
Score = 140 bits (352), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 139/276 (50%), Gaps = 17/276 (6%)
Query: 52 KLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVPHCFPQFGP 111
+++LT PAGS AE+ L+GG + SWK LL++ A K I GG+P FPQFG
Sbjct: 14 RILLTDPAGSTAEVLLYGGQVVSWKNERREKLLYMSTKAQLKPPKAIRGGLPISFPQFGN 73
Query: 112 -GPIQQHGFARNMDWTVADSES----VEGNPVVTLELKDAPYSRDIWDFSFHALFKVTLN 166
G +++HGFARN W++ + S V L LK IW SF +++++
Sbjct: 74 FGALERHGFARNRFWSLDNDPSPLPPANQQSTVDLVLKSTEDDLKIWPHSFELRVRISIS 133
Query: 167 AKSLSTELKVKNTDNKAFSFNTALHTYFRASVSGTSVKGLKGCKTLNKHPDPNNPVEGTE 226
L+ +V+NTD KAFSF +L Y VS S ++G +TL+ + TE
Sbjct: 134 PGKLTIIPRVRNTDTKAFSFMFSLRNYL--YVSDISEVRVEGLETLDYLDNLMRRERFTE 191
Query: 227 ERDVVTFPGFVDCIYLDAANELQ-LDNGLGDLISIKNTNWSDAVLWNPHLQM-------- 277
+ D +TF G VD +YL+ ++ +D+ I ++ +A +WNP +
Sbjct: 192 QADAITFDGEVDKVYLNTPTKIAIIDHERKRTIELRKEGMPNAAVWNPWDKKAKSIADMG 251
Query: 278 EACYKDFVCVENAKIGS-VQLEPEQTWTAVQHLSIA 312
+ Y +CV++ I S + L+P + W Q LSI
Sbjct: 252 DEDYTTMLCVDSGAIESPIVLKPHEEWKGRQELSIV 287
>AT4G25900.1 | Symbols: | Galactose mutarotase-like superfamily
protein | chr4:13161487-13163397 FORWARD LENGTH=318
Length = 318
Score = 139 bits (351), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 44 TEGEGSLPKLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVP 103
T+G L K+++ G AE+YL+GG ++SWK +G +LL + A+F PI GG+P
Sbjct: 34 TKGVNGLDKIIIRDRRGRSAEVYLYGGQVSSWKNENGEELLVMSSKAIFQPPTPIRGGIP 93
Query: 104 HCFPQFG-PGPIQQHGFARNMDWTVADSESVEGNPV---------VTLELKDAPYSRDIW 153
FPQ+ GP+ HGF R W VE P V L ++ + IW
Sbjct: 94 VLFPQYSNTGPLPSHGFVRQRFW------EVETKPPPLPSLSTAHVDLIVRSSNEDLKIW 147
Query: 154 DFSFHALFKVTL-NAKSLSTELKVKNTDNKAFSFNTALHTYFRAS-VSGTSVKGLKGCKT 211
F +V L + L+ +VKNTD K F+F ALH YF S +S V+GL
Sbjct: 148 PHKFEYRLRVALGHDGDLTLTSRVKNTDTKPFNFTFALHPYFAVSNISEIHVEGLHNLDY 207
Query: 212 LNKHPDPNNPVEGTEERDVVTFPGFVDCIYLDAANELQL-DNGLGDLISIKNTNWSDAVL 270
L++ N T+ V+TF +D +YL ++L++ D+ I + DAV+
Sbjct: 208 LDQQ---KNRTRFTDHEKVITFNAQLDRLYLSTPDQLRIVDHKKKKTIVVHKEGQVDAVV 264
Query: 271 WNP------HLQMEACYKDFVCVENAKIGS-VQLEPEQTWTAVQHLSIA 312
WNP L +E YK FV VE+A + + + P + W + H+S+
Sbjct: 265 WNPWDKKVSDLGVED-YKRFVTVESAAVAKPITVNPGKEWKGILHVSVV 312
>AT3G61610.1 | Symbols: | Galactose mutarotase-like superfamily
protein | chr3:22799480-22801029 FORWARD LENGTH=317
Length = 317
Score = 122 bits (307), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/272 (31%), Positives = 131/272 (48%), Gaps = 20/272 (7%)
Query: 52 KLVLTSPAGSEAEIYLFGGCITSWKVPSGNDLLFVRPDAVFNKKKPISGGVPHCFPQFGP 111
+++L +P G+ A+I L GG + SW+ G +LLF A+F K + GG+ C+PQFG
Sbjct: 25 QVLLRNPHGASAKISLHGGQVISWRNELGEELLFTSNKAIFKPPKSMRGGIQICYPQFGD 84
Query: 112 -GPIQQHGFARNMDWTVAD------SESVEGNPVVTLELKDAPYSRDIWDFSFHALFKVT 164
G + QHGFARN W + + S G V L LK + W SF +V+
Sbjct: 85 CGSLDQHGFARNKIWVIDENPPPLNSNESLGKSFVDLLLKPSEDDLKQWPHSFEFRLRVS 144
Query: 165 LNAK-SLSTELKVKNTDNKAFSFNTALHTYFRASVSGTSVKGLKGCKTLNKHPDPNNPVE 223
L L+ +++N + K FSF+ A HTY SVS S ++G +TL+ + +
Sbjct: 145 LAVDGDLTLTSRIRNINGKPFSFSFAYHTYL--SVSDISEVRIEGLETLDYLDNLSQRQL 202
Query: 224 GTEERDVVTFPGFVDCIYLDAANELQ-LDNGLGDLISIKNTNWSDAVLWNPHLQMEAC-- 280
TE+ D +TF +D YL + + LD+ I D V+WNP +
Sbjct: 203 LTEQGDAITFESEMDRTYLRSPKVVAVLDHERKRTYVIGKEGLPDTVVWNPWEKKSKTMA 262
Query: 281 ------YKDFVCVENAKIGS-VQLEPEQTWTA 305
YK +CV+ A + + L+P + WT
Sbjct: 263 DFGDEEYKSMLCVDGAAVERPITLKPGEEWTG 294