Miyakogusa Predicted Gene
- Lj1g3v2534830.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2534830.1 Non Chatacterized Hit- tr|I1JK01|I1JK01_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.10144
PE,88.76,0,(Trans)glycosidases,Glycoside hydrolase, superfamily; no
description,Glycoside hydrolase, catalytic
,NODE_64210_length_1132_cov_116.350708.path1.1
(384 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G01930.1 | Symbols: MAN6, AtMAN6 | Glycosyl hydrolase superfa... 560 e-160
AT5G66460.1 | Symbols: MAN7, AtMAN7 | Glycosyl hydrolase superfa... 375 e-104
AT4G28320.1 | Symbols: MAN5, AtMAN5 | Glycosyl hydrolase superfa... 371 e-103
AT2G20680.1 | Symbols: MAN2, AtMAN2 | Glycosyl hydrolase superfa... 362 e-100
AT3G10890.1 | Symbols: | Glycosyl hydrolase superfamily protein... 358 5e-99
AT3G10900.1 | Symbols: | Glycosyl hydrolase superfamily protein... 355 3e-98
AT1G02310.1 | Symbols: MAN1 | Glycosyl hydrolase superfamily pro... 330 9e-91
AT3G30540.1 | Symbols: | Glycosyl hydrolase superfamily protein... 282 2e-76
>AT5G01930.1 | Symbols: MAN6, AtMAN6 | Glycosyl hydrolase
superfamily protein | chr5:361189-362867 REVERSE
LENGTH=448
Length = 448
Score = 560 bits (1442), Expect = e-160, Method: Compositional matrix adjust.
Identities = 266/384 (69%), Positives = 319/384 (83%), Gaps = 13/384 (3%)
Query: 1 MEKHKSFRLSIISLVLFLTLTKSLRSSAFHGSEYEDWESNHMENSILSYGSEMEADEWQM 60
M+ FR+ + S V F+ LT++ R+ A D +S E + S G E +W+M
Sbjct: 1 MKDQLGFRIVLCSAV-FIILTQN-RALA-------DLDSESHEVNSESVGEE----QWEM 47
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAASVGMTVCRTWAFND 120
V+ KG QF +N QPFYVNGFNTYW+M AADNSTRGKVTEVF+ A++VGMTV RTWAFND
Sbjct: 48 VQRKGMQFTLNGQPFYVNGFNTYWMMTLAADNSTRGKVTEVFQQASAVGMTVGRTWAFND 107
Query: 121 GEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTAAG 180
G+WRALQKSPS YDE+VF+ALDFV+SEA+KYKIRLILSLVNNW+AYGGKAQYVKWG A+G
Sbjct: 108 GQWRALQKSPSVYDEEVFKALDFVLSEARKYKIRLILSLVNNWDAYGGKAQYVKWGNASG 167
Query: 181 LNLTSDDDFFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSDPS 240
LNLTSDDDFF++PTLR++Y++H +TVLNRVNTFTNITYK DPTIFAWELMNEPRC SDPS
Sbjct: 168 LNLTSDDDFFTNPTLRNFYQSHVRTVLNRVNTFTNITYKNDPTIFAWELMNEPRCPSDPS 227
Query: 241 GDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQVNPNSFAQQVGTDFIRNHQ 300
GDKLQ WI+EMA FVKS+D KHLVEIGLEGFYGPS P R + NPN +A QVGTDFIRN+Q
Sbjct: 228 GDKLQSWIQEMAVFVKSLDAKHLVEIGLEGFYGPSAPARTRFNPNPYAAQVGTDFIRNNQ 287
Query: 301 VLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVSAKSP 360
VLG+DFASVH+YPDSWIS +V+++ L F SWM+AH+EDAE YLGMPV+F EFGVSA P
Sbjct: 288 VLGIDFASVHVYPDSWISPAVSNSFLEFTSSWMQAHVEDAEMYLGMPVLFTEFGVSAHDP 347
Query: 361 GYNSTYRNNLINTVYKTILNSTKK 384
G+N+++R+ ++NTVYK LNST+K
Sbjct: 348 GFNTSFRDMMLNTVYKMTLNSTRK 371
>AT5G66460.1 | Symbols: MAN7, AtMAN7 | Glycosyl hydrolase
superfamily protein | chr5:26538911-26540837 REVERSE
LENGTH=431
Length = 431
Score = 375 bits (963), Expect = e-104, Method: Compositional matrix adjust.
Identities = 170/324 (52%), Positives = 226/324 (69%), Gaps = 3/324 (0%)
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAASVGMTVCRTWAFND 120
V+TKG QF +N P+Y NGFN YWLM A+D S R K++ F+ A+ G+TV RTWAF+D
Sbjct: 32 VRTKGVQFSLNGYPYYANGFNAYWLMYVASDPSQRSKISTAFQDASRHGLTVARTWAFSD 91
Query: 121 GEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTAAG 180
G +RALQ SP Y+ED+FQ LDF ++EA+++ I++ILS NN+E++GG+ QYV W + G
Sbjct: 92 GGYRALQYSPGSYNEDMFQGLDFALAEARRHGIKIILSFANNYESFGGRKQYVDWARSRG 151
Query: 181 LNLTSDDDFFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSDPS 240
++S+DDFF+ ++ +YK H K VLNR NTFT + YK+DPTI AWELMNEPRC SDPS
Sbjct: 152 RPVSSEDDFFTDSLVKDFYKNHIKAVLNRFNTFTKVHYKDDPTIMAWELMNEPRCPSDPS 211
Query: 241 GDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQVNPNSFAQQVGTDFIRNHQ 300
G +Q WI EMA VKS+D HL+E GLEGFYG S+PQ +NP Q GTDFI N++
Sbjct: 212 GRAIQAWITEMAAHVKSLDRNHLLEAGLEGFYGQSSPQSKTLNPPG---QFGTDFIANNR 268
Query: 301 VLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVSAKSP 360
+ G+DF +VH YPD W S + + F+ W++AHI+DA+ L P++ AEFG S K P
Sbjct: 269 IPGIDFVTVHSYPDEWFPDSSEQSQMDFLNKWLDAHIQDAQNVLHKPIILAEFGKSMKKP 328
Query: 361 GYNSTYRNNLINTVYKTILNSTKK 384
GY R+ + NTVY I S K+
Sbjct: 329 GYTPAQRDIVFNTVYSKIYGSAKR 352
>AT4G28320.1 | Symbols: MAN5, AtMAN5 | Glycosyl hydrolase
superfamily protein | chr4:14018293-14019972 REVERSE
LENGTH=431
Length = 431
Score = 371 bits (952), Expect = e-103, Method: Compositional matrix adjust.
Identities = 169/325 (52%), Positives = 228/325 (70%), Gaps = 1/325 (0%)
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAASVGMTVCRTWAFND 120
VK G QFVV+D+P YVNG+N+YW M A D +R V E+ + A +G+TVCRTWAFND
Sbjct: 42 VKRNGTQFVVDDKPLYVNGWNSYWFMDHAVDEHSRNLVGEMLEAGAKMGLTVCRTWAFND 101
Query: 121 GEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTAAG 180
G + ALQ SP +DE VFQALD V++EA+K+ +RL+LSLVNN +AYGGK QYVKW G
Sbjct: 102 GGYNALQISPGRFDERVFQALDHVIAEARKHDVRLLLSLVNNLQAYGGKTQYVKWAWQEG 161
Query: 181 LNLTSDDD-FFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSDP 239
+ L+S +D FF P++R+Y+K + K +L R N+ T I Y+ DPTIFAWEL+NEPRCT+D
Sbjct: 162 VGLSSSNDSFFFDPSIRNYFKNYLKVLLTRKNSVTGIEYRNDPTIFAWELINEPRCTTDV 221
Query: 240 SGDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQVNPNSFAQQVGTDFIRNH 299
SG LQ+WI EM F+KSID KHL+ +GLEGFYGP++P+ VNP +A Q+GTDF++N
Sbjct: 222 SGKTLQDWIDEMTGFIKSIDDKHLLTVGLEGFYGPNSPKGLTVNPEQWASQLGTDFVQNS 281
Query: 300 QVLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVSAKS 359
+DFASVHIYPD W + L F++ WM++HIED + L PV+F EFG+S ++
Sbjct: 282 NSSNIDFASVHIYPDHWFHNQTFEEKLKFVVKWMQSHIEDGLKELKKPVLFTEFGLSNQN 341
Query: 360 PGYNSTYRNNLINTVYKTILNSTKK 384
Y + R+ ++ + S K+
Sbjct: 342 KDYEPSQRDKFYRIIFDVVYKSAKR 366
>AT2G20680.1 | Symbols: MAN2, AtMAN2 | Glycosyl hydrolase
superfamily protein | chr2:8921024-8923066 FORWARD
LENGTH=433
Length = 433
Score = 362 bits (930), Expect = e-100, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 232/339 (68%), Gaps = 2/339 (0%)
Query: 47 LSYGSEMEADEWQMVKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAA 106
L +G + E E VK G QFVV+ + YVNG+N+YW M A ++ +R +V+ + + A
Sbjct: 30 LWFGLKTEG-ELAFVKRNGTQFVVDGKALYVNGWNSYWFMDHAVNDHSRHRVSAMLEAGA 88
Query: 107 SVGMTVCRTWAFNDGEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAY 166
+G+TVCRTWAFNDG + ALQ SP +DE VF+ALD V++EAK + +RL+LSLVNN +AY
Sbjct: 89 KMGLTVCRTWAFNDGGYNALQISPGRFDERVFKALDHVIAEAKTHGVRLLLSLVNNLQAY 148
Query: 167 GGKAQYVKWGTAAGLNLTSDDD-FFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIF 225
GGK QYV W G+ L+S +D FF P++R Y+K + +L R N+ T I Y+ DPTIF
Sbjct: 149 GGKTQYVNWAWQEGVGLSSSNDSFFFDPSIRRYFKNYLTVLLTRKNSLTGIEYRNDPTIF 208
Query: 226 AWELMNEPRCTSDPSGDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQVNPN 285
AWEL+NEPRC SD SGD LQ+WI EM F+KSID KHL+ +GLEGFYGPS+P++ VNP
Sbjct: 209 AWELINEPRCMSDVSGDTLQDWINEMTAFIKSIDNKHLLTVGLEGFYGPSSPKKLTVNPE 268
Query: 286 SFAQQVGTDFIRNHQVLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLG 345
+A ++G+DF+RN +DFASVHIYPD W + L F++ WM +HIED ++ L
Sbjct: 269 RWASELGSDFVRNSDSPNIDFASVHIYPDHWFHDQGFEEKLKFVVKWMLSHIEDGDKELK 328
Query: 346 MPVVFAEFGVSAKSPGYNSTYRNNLINTVYKTILNSTKK 384
PV+F EFG+S + Y+ + R+ T++ I S K+
Sbjct: 329 KPVLFTEFGLSNLNKDYDPSQRDRFYRTIFDVIYKSAKR 367
>AT3G10890.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr3:3407455-3409000 REVERSE LENGTH=414
Length = 414
Score = 358 bits (918), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 160/325 (49%), Positives = 221/325 (68%), Gaps = 2/325 (0%)
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAASVGMTVCRTWAFND 120
V KG QF++N +PFY NGFN YWL A D++TR K+T VF++A +T+ RTW F D
Sbjct: 32 VSRKGVQFILNGKPFYANGFNAYWLAYEATDSTTRFKITYVFQNATIHDLTIVRTWGFRD 91
Query: 121 GEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTAAG 180
G +RALQ +P YDE FQ LDF ++EAK+ I++I++ VNN+ +GG+ QYV W G
Sbjct: 92 GGYRALQIAPGVYDEKTFQGLDFAIAEAKRLGIKMIITFVNNYSDFGGRKQYVDWAKNTG 151
Query: 181 LNLTSDDDFFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSDPS 240
N++SDDDF+++P ++ YYK H KT++NRVNTFT + YK++PTI WELMNEP+C +DPS
Sbjct: 152 QNVSSDDDFYTNPLVKQYYKNHVKTMVNRVNTFTKVEYKDEPTIMGWELMNEPQCRADPS 211
Query: 241 GDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQR-FQVNPNSFAQQVGTDFIRNH 299
G L W+ EMA +VKS+D+KHL+ GLEGFYG S+PQR +NP + A +GTDFI NH
Sbjct: 212 GKTLTAWMNEMALYVKSVDSKHLLSTGLEGFYGDSSPQRKTSLNPVA-ANVLGTDFIANH 270
Query: 300 QVLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVSAKS 359
++ +DFAS+H YPD W + L + W+E H+EDA+ L P++ EFG +
Sbjct: 271 KLDAIDFASIHSYPDLWFPNLDEKSRLNLLRKWLECHLEDAQNILKKPLILGEFGKPTNT 330
Query: 360 PGYNSTYRNNLINTVYKTILNSTKK 384
PGY R+ + N + TI S +K
Sbjct: 331 PGYTQAQRDAVFNATFDTIYESAEK 355
>AT3G10900.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr3:3410252-3412070 REVERSE LENGTH=408
Length = 408
Score = 355 bits (911), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 167/326 (51%), Positives = 219/326 (67%), Gaps = 3/326 (0%)
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAASVGMTVCRTWAFND 120
V G QF++N +PFY NGFN YWL A D +TR K+T VF++A S+G+T+ RTW F D
Sbjct: 30 VSRNGVQFILNGKPFYANGFNAYWLAYEATDPTTRFKITNVFQNATSLGLTIARTWGFRD 89
Query: 121 GE-WRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTAA 179
G +RALQ +P YDE FQ LDFV++EAK+ I+LI+ LVNNW+ YGGK QYV W +
Sbjct: 90 GAIYRALQTAPGSYDEQTFQGLDFVIAEAKRIGIKLIILLVNNWDDYGGKKQYVDWARSK 149
Query: 180 GLNLTSDDDFFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSDP 239
G ++S+DDF+ +P ++ +YK H KTVLNRVNTFT + YK++P I AW+LMNEPRC D
Sbjct: 150 GEVVSSNDDFYRNPVIKDFYKNHVKTVLNRVNTFTKVAYKDEPAIMAWQLMNEPRCGVDK 209
Query: 240 SGDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQ-VNPNSFAQQVGTDFIRN 298
SG L +WI EMA FVKS+D HL+ G EGFYG S+P+R +NP S A VG DFI N
Sbjct: 210 SGKTLMDWINEMAPFVKSVDPNHLLSTGHEGFYGDSSPERKNSLNPVS-ANTVGADFIAN 268
Query: 299 HQVLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVSAK 358
H + +DFAS+H D W + ++ L FI W+E HIEDA+ L PV+ AEFG+ +
Sbjct: 269 HNIDAIDFASMHCGSDLWFQRLDQNSRLAFIKRWLEGHIEDAQNILKKPVILAEFGLGSD 328
Query: 359 SPGYNSTYRNNLINTVYKTILNSTKK 384
+P Y R+ + T Y I S +K
Sbjct: 329 TPRYTLANRDGVFTTTYDIIYASAQK 354
>AT1G02310.1 | Symbols: MAN1 | Glycosyl hydrolase superfamily
protein | chr1:458243-460652 REVERSE LENGTH=411
Length = 411
Score = 330 bits (846), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 159/327 (48%), Positives = 218/327 (66%), Gaps = 10/327 (3%)
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGK--VTEVFKHAASVGMTVCRTWAF 118
V G QFV+N + Y+NGFN YW+M AAD +++G+ VT + A++VGM V R W F
Sbjct: 30 VGRNGTQFVLNGEQVYLNGFNAYWMMTTAADTASKGRATVTTALRQASAVGMNVARIWGF 89
Query: 119 NDGEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTA 178
N+G++ LQ SP Y EDVF+ LDFVV EA ++ I+LI+SLVNN+E YGG+ +YV+W
Sbjct: 90 NEGDYIPLQISPGSYSEDVFKGLDFVVYEAGRFNIKLIISLVNNFEDYGGRKKYVEW--- 146
Query: 179 AGLNLTSDDDFFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSD 238
AGL+ D+F+++ ++ +YK H KTVL R NT T YK+DPTIF+WEL+NEPRC
Sbjct: 147 AGLD--EPDEFYTNSAVKQFYKNHVKTVLTRKNTITGRMYKDDPTIFSWELINEPRCNDS 204
Query: 239 PSGDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQVNPNSFAQQVGTDFIRN 298
+ + LQ+W+KEMA +VKSID+ HL+EIGLEGFYG S P+R NP GTDFI N
Sbjct: 205 TASNILQDWVKEMASYVKSIDSNHLLEIGLEGFYGESIPERTVYNPGGRV-LTGTDFITN 263
Query: 299 HQVLGVDFASVHIYPDSW--ISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVS 356
+Q+ +DFA++HIYPDSW + S F+ W+ AHIED + + P++ EFG S
Sbjct: 264 NQIPDIDFATIHIYPDSWLPLQSSRTGEQDTFVDRWIGAHIEDCDNIIKKPLLITEFGKS 323
Query: 357 AKSPGYNSTYRNNLINTVYKTILNSTK 383
+K PG++ RN VY I +S +
Sbjct: 324 SKYPGFSLEKRNKFFQRVYDVIYDSAR 350
>AT3G30540.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr3:12144792-12146475 REVERSE LENGTH=329
Length = 329
Score = 282 bits (722), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 188/319 (58%), Gaps = 39/319 (12%)
Query: 61 VKTKGNQFVVNDQPFYVNGFNTYWLMVFAADNSTRGKVTEVFKHAASVGMTVCRTWAFND 120
V G QF++N +PFY NGFN YWL A D +TR K+T VF++A S+
Sbjct: 30 VSRNGVQFILNGKPFYANGFNAYWLAYEATDPATRFKITNVFQNATSL------------ 77
Query: 121 GEWRALQKSPSGYDEDVFQALDFVVSEAKKYKIRLILSLVNNWEAYGGKAQYVKWGTAAG 180
+EAK+ I+LI+ LVNNW+ YGGK QYV W + G
Sbjct: 78 -------------------------AEAKRVGIKLIIPLVNNWDDYGGKKQYVDWARSKG 112
Query: 181 LNLTSDDDFFSHPTLRSYYKAHAKTVLNRVNTFTNITYKEDPTIFAWELMNEPRCTSDPS 240
++S+DDF+ +P ++ +YK H KT+LNRVNTFT + YK++P AW+LMNEPRC D S
Sbjct: 113 EMVSSNDDFYRNPVIKEFYKNHVKTMLNRVNTFTKVAYKDEPASMAWQLMNEPRCGVDRS 172
Query: 241 GDKLQEWIKEMAFFVKSIDTKHLVEIGLEGFYGPSTPQRFQ-VNPNSFAQQVGTDFIRNH 299
G L WI EMA FVKS+D HL+ G EGFYG S+P+R +NP S A VG DFI NH
Sbjct: 173 GKTLMAWINEMALFVKSVDPNHLLSTGHEGFYGDSSPERKNSLNPVS-ANTVGADFIANH 231
Query: 300 QVLGVDFASVHIYPDSWISQSVADNHLPFIMSWMEAHIEDAEEYLGMPVVFAEFGVSAKS 359
+ +DFAS+H D W + ++ L FI W+E HIEDA+ L PV+ AEFG+ + +
Sbjct: 232 NIDAIDFASMHCGSDLWFQRLDQNSRLAFIKRWLEGHIEDAQNNLKKPVILAEFGLGSDT 291
Query: 360 PGYNSTYRNNLINTVYKTI 378
P Y R+++ T Y I
Sbjct: 292 PRYTLANRDDVFTTTYDII 310