Miyakogusa Predicted Gene
- Lj0g3v0294419.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0294419.1 Non Chatacterized Hit- tr|I1KD37|I1KD37_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.45995
PE,90.5,0,Nucleotide-diphospho-sugar transferases,NULL; SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; RGP,R,CUFF.19727.1
(360 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G02230.1 | Symbols: RGP1, ATRGP1 | reversibly glycosylated po... 634 0.0
AT5G15650.1 | Symbols: RGP2, ATRGP2 | reversibly glycosylated po... 632 0.0
AT3G08900.1 | Symbols: RGP3, RGP | reversibly glycosylated polyp... 626 e-180
AT5G50750.1 | Symbols: RGP4 | reversibly glycosylated polypeptid... 578 e-165
AT5G16510.2 | Symbols: | Alpha-1,4-glucan-protein synthase fami... 360 1e-99
AT5G16510.1 | Symbols: | Alpha-1,4-glucan-protein synthase fami... 360 1e-99
>AT3G02230.1 | Symbols: RGP1, ATRGP1 | reversibly glycosylated
polypeptide 1 | chr3:415463-417304 FORWARD LENGTH=357
Length = 357
Score = 634 bits (1635), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 296/338 (87%), Positives = 317/338 (93%)
Query: 9 PVLKDELDIVIPTIRNLDFLEMWRAFFEPYHLIIVQDGDPTKTIKVPQGFDYELYNRNDI 68
P+LKDELDIVIPTIRNLDFLEMWR F +PYHLIIVQDGDP+KTI VP+GFDYELYNRNDI
Sbjct: 16 PLLKDELDIVIPTIRNLDFLEMWRPFLQPYHLIIVQDGDPSKTIAVPEGFDYELYNRNDI 75
Query: 69 NRILGPKASCISFKDSACRCFGFLMSKKKYIFTIDDDCFVAKDPSGAEINALQQHIKNLL 128
NRILGPKASCISFKDSACRCFG+++SKKKYIFTIDDDCFVAKDPSG +NAL+QHIKNLL
Sbjct: 76 NRILGPKASCISFKDSACRCFGYMVSKKKYIFTIDDDCFVAKDPSGKAVNALEQHIKNLL 135
Query: 129 TPSTPFFFNTLYDPYREGTDFVRGYPFSLREGVPTAISHGLWLNIPDYDAPTQLVKPLER 188
PSTPFFFNTLYDPYREG DFVRGYPFSLREGV TA+SHGLWLNIPDYDAPTQLVKP ER
Sbjct: 136 CPSTPFFFNTLYDPYREGADFVRGYPFSLREGVSTAVSHGLWLNIPDYDAPTQLVKPKER 195
Query: 189 NTRYVDAVMTIPKGTLFPMCGMNLAFNRELIGPAMYFGLMGDGQPLGRYDDMWAGWCMKV 248
NTRYVDAVMTIPKGTLFPMCGMNLAF+RELIGPAMYFGLMGDGQP+GRYDDMWAGWC+KV
Sbjct: 196 NTRYVDAVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGWCIKV 255
Query: 249 ISDHLGLGVKTGLPYIWHSKASNPFVNLKKEYKGIYWQEELIPFFQSVSLPKDCTTVQAC 308
I DHLGLGVKTGLPYI+HSKASNPFVNLKKEYKGI+WQE++IPFFQS L K+ TVQ C
Sbjct: 256 ICDHLGLGVKTGLPYIYHSKASNPFVNLKKEYKGIFWQEDIIPFFQSAKLTKEAVTVQQC 315
Query: 309 YVELSKQVKAKLGGVDEYFNKLADAMVTWIEAWDELNP 346
Y+ELSK VK KL +D YF+KLADAMVTWIEAWDELNP
Sbjct: 316 YMELSKLVKEKLSPIDPYFDKLADAMVTWIEAWDELNP 353
>AT5G15650.1 | Symbols: RGP2, ATRGP2 | reversibly glycosylated
polypeptide 2 | chr5:5092203-5094093 FORWARD LENGTH=360
Length = 360
Score = 632 bits (1629), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 293/346 (84%), Positives = 317/346 (91%)
Query: 8 TPVLKDELDIVIPTIRNLDFLEMWRAFFEPYHLIIVQDGDPTKTIKVPQGFDYELYNRND 67
TP+LKDELDIVIPTIRNLDFLEMWR F +PYHLIIVQDGDP+K I VP+G+DYELYNRND
Sbjct: 15 TPLLKDELDIVIPTIRNLDFLEMWRPFLQPYHLIIVQDGDPSKKIHVPEGYDYELYNRND 74
Query: 68 INRILGPKASCISFKDSACRCFGFLMSKKKYIFTIDDDCFVAKDPSGAEINALQQHIKNL 127
INRILGPKASCISFKDSACRCFG+++SKKKYIFTIDDDCFVAKDPSG +NAL+QHIKNL
Sbjct: 75 INRILGPKASCISFKDSACRCFGYMVSKKKYIFTIDDDCFVAKDPSGKAVNALEQHIKNL 134
Query: 128 LTPSTPFFFNTLYDPYREGTDFVRGYPFSLREGVPTAISHGLWLNIPDYDAPTQLVKPLE 187
L PS+PFFFNTLYDPYREG DFVRGYPFSLREGV TA+SHGLWLNIPDYDAPTQLVKP E
Sbjct: 135 LCPSSPFFFNTLYDPYREGADFVRGYPFSLREGVSTAVSHGLWLNIPDYDAPTQLVKPKE 194
Query: 188 RNTRYVDAVMTIPKGTLFPMCGMNLAFNRELIGPAMYFGLMGDGQPLGRYDDMWAGWCMK 247
RNTRYVDAVMTIPKGTLFPMCGMNLAF+R+LIGPAMYFGLMGDGQP+GRYDDMWAGWC+K
Sbjct: 195 RNTRYVDAVMTIPKGTLFPMCGMNLAFDRDLIGPAMYFGLMGDGQPIGRYDDMWAGWCIK 254
Query: 248 VISDHLGLGVKTGLPYIWHSKASNPFVNLKKEYKGIYWQEELIPFFQSVSLPKDCTTVQA 307
VI DHL LGVKTGLPYI+HSKASNPFVNLKKEYKGI+WQEE+IPFFQ+ L K+ TVQ
Sbjct: 255 VICDHLSLGVKTGLPYIYHSKASNPFVNLKKEYKGIFWQEEIIPFFQNAKLSKEAVTVQQ 314
Query: 308 CYVELSKQVKAKLGGVDEYFNKLADAMVTWIEAWDELNPSGGKSDA 353
CY+ELSK VK KL +D YF+KLADAMVTWIEAWDELNP A
Sbjct: 315 CYIELSKMVKEKLSSLDPYFDKLADAMVTWIEAWDELNPPAASGKA 360
>AT3G08900.1 | Symbols: RGP3, RGP | reversibly glycosylated
polypeptide 3 | chr3:2708347-2709714 REVERSE LENGTH=362
Length = 362
Score = 626 bits (1614), Expect = e-180, Method: Compositional matrix adjust.
Identities = 289/342 (84%), Positives = 317/342 (92%)
Query: 8 TPVLKDELDIVIPTIRNLDFLEMWRAFFEPYHLIIVQDGDPTKTIKVPQGFDYELYNRND 67
TP+LKDELDIVIPTIRNLDFLEMWR FFE YHLIIVQDGDP+K I +P GFDYELYNRND
Sbjct: 11 TPMLKDELDIVIPTIRNLDFLEMWRPFFEQYHLIIVQDGDPSKVINIPVGFDYELYNRND 70
Query: 68 INRILGPKASCISFKDSACRCFGFLMSKKKYIFTIDDDCFVAKDPSGAEINALQQHIKNL 127
INRILGPKASCISFKDSACRCFG+++SKKKYI+TIDDDCFVAKDP+G EINAL+QHIKNL
Sbjct: 71 INRILGPKASCISFKDSACRCFGYMVSKKKYIYTIDDDCFVAKDPTGKEINALEQHIKNL 130
Query: 128 LTPSTPFFFNTLYDPYREGTDFVRGYPFSLREGVPTAISHGLWLNIPDYDAPTQLVKPLE 187
L+PSTP FFNTLYDPYR+G DFVRGYPFS+REG TA+SHGLWLNIPDYDAPTQLVKPLE
Sbjct: 131 LSPSTPHFFNTLYDPYRDGADFVRGYPFSMREGAITAVSHGLWLNIPDYDAPTQLVKPLE 190
Query: 188 RNTRYVDAVMTIPKGTLFPMCGMNLAFNRELIGPAMYFGLMGDGQPLGRYDDMWAGWCMK 247
+N+RYVDAVMTIPKGTLFPMCGMNLAF+RELIGPAMYFGLMGDGQP+GRYDDMWAGWC+K
Sbjct: 191 KNSRYVDAVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGWCVK 250
Query: 248 VISDHLGLGVKTGLPYIWHSKASNPFVNLKKEYKGIYWQEELIPFFQSVSLPKDCTTVQA 307
VI DH+G GVKTGLPYIWHSKASNPFVNLKKEY GI+WQEE IPFFQSV+LPK+CT+VQ
Sbjct: 251 VICDHMGWGVKTGLPYIWHSKASNPFVNLKKEYNGIFWQEEAIPFFQSVTLPKECTSVQQ 310
Query: 308 CYVELSKQVKAKLGGVDEYFNKLADAMVTWIEAWDELNPSGG 349
CY+EL+K V+ KLG VD YF LA MVTWIEAW+ELN + G
Sbjct: 311 CYLELAKLVREKLGKVDPYFITLATGMVTWIEAWEELNSAEG 352
>AT5G50750.1 | Symbols: RGP4 | reversibly glycosylated polypeptide 4
| chr5:20641066-20642470 FORWARD LENGTH=364
Length = 364
Score = 578 bits (1489), Expect = e-165, Method: Compositional matrix adjust.
Identities = 266/350 (76%), Positives = 302/350 (86%), Gaps = 1/350 (0%)
Query: 4 ASSATPVLKDELDIVIPTIRNLDFLEMWRAFFEPYHLIIVQDGDPTKTIKVPQGFDYELY 63
A A P LKD+LDIVIPTIR+LDFLE WR F YHLIIVQDGDP+ I+VP+G+DYELY
Sbjct: 8 AIEAAP-LKDDLDIVIPTIRSLDFLEQWRPFLHHYHLIIVQDGDPSIKIRVPEGYDYELY 66
Query: 64 NRNDINRILGPKASCISFKDSACRCFGFLMSKKKYIFTIDDDCFVAKDPSGAEINALQQH 123
NRNDINRILGP+A+CIS+KD CRCFGF++SKKKYI+TIDDDCFVAKDPSG +IN + QH
Sbjct: 67 NRNDINRILGPRANCISYKDGGCRCFGFMVSKKKYIYTIDDDCFVAKDPSGKDINVIAQH 126
Query: 124 IKNLLTPSTPFFFNTLYDPYREGTDFVRGYPFSLREGVPTAISHGLWLNIPDYDAPTQLV 183
IKNL TPSTP +FNTLYDP+R+GTDFVRGYPFSLREGV TAISHGLWLNIPDYDAPTQLV
Sbjct: 127 IKNLETPSTPHYFNTLYDPFRDGTDFVRGYPFSLREGVQTAISHGLWLNIPDYDAPTQLV 186
Query: 184 KPLERNTRYVDAVMTIPKGTLFPMCGMNLAFNRELIGPAMYFGLMGDGQPLGRYDDMWAG 243
KP ERNTRYVDAVMTIPK L+PMCGMNLAFNREL+GPAMYFGLMG+GQP+ RYDDMWAG
Sbjct: 187 KPRERNTRYVDAVMTIPKRVLYPMCGMNLAFNRELVGPAMYFGLMGEGQPISRYDDMWAG 246
Query: 244 WCMKVISDHLGLGVKTGLPYIWHSKASNPFVNLKKEYKGIYWQEELIPFFQSVSLPKDCT 303
W KV+ DHLG GVKTGLPY+WHSKASNPFVNLKKE+KG++WQE+++PFFQ++ L K+
Sbjct: 247 WAAKVVCDHLGFGVKTGLPYLWHSKASNPFVNLKKEHKGLHWQEDMVPFFQNLRLSKESD 306
Query: 304 TVQACYVELSKQVKAKLGGVDEYFNKLADAMVTWIEAWDELNPSGGKSDA 353
T CY+E+S K KL VD YF KLADAMV WIEAW+ELNP K +
Sbjct: 307 TAAKCYMEISNMTKEKLTKVDPYFEKLADAMVVWIEAWEELNPPVKKKQS 356
>AT5G16510.2 | Symbols: | Alpha-1,4-glucan-protein synthase family
protein | chr5:5393296-5394342 FORWARD LENGTH=348
Length = 348
Score = 360 bits (923), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 241/349 (69%), Gaps = 8/349 (2%)
Query: 6 SATPVLKDELDIVIPTIRNLD---FLEMWRAFFEPYHLIIVQDGDPTKTIKVPQGFDYEL 62
S + K+E+DIVI + N D FL WR FF +HLI+V+D + + + +P+GFD ++
Sbjct: 2 SLAEINKNEVDIVIGAL-NADLTQFLTSWRPFFSGFHLIVVKDPELKEELNIPEGFDVDV 60
Query: 63 YNRNDINRILGPKASCISFKDSACRCFGFLMSKKKYIFTIDDDCFVAKDPSGAEINALQQ 122
Y++ D+ +++G S + F +CR FG+L+SKKKYI +IDDDC AKDP G ++A+ Q
Sbjct: 61 YSKTDMEKVVGASNSTM-FSGYSCRYFGYLVSKKKYIVSIDDDCVPAKDPKGFLVDAVTQ 119
Query: 123 HIKNLLTPSTPFFFNTLYDPYREGTDFVRGYPFSLREGVPTAISHGLWLNIPDYDAPTQL 182
H+ NL P+TP FFNTLYDPY EG DFVRGYPFSLR GVP A S GLWLN+ D DAPTQ
Sbjct: 120 HVINLENPATPLFFNTLYDPYCEGADFVRGYPFSLRSGVPCAASCGLWLNLADLDAPTQA 179
Query: 183 VKPLERNTRYVDAVMTIPKGTLFPMCGMNLAFNRELIGPAMYFGLMGDGQPLGRY---DD 239
+K +RNT YVDAVMT+P + P+ G+N+AFNREL+GPA+ L G+ R+ +D
Sbjct: 180 LKTEKRNTAYVDAVMTVPAKAMLPISGINIAFNRELVGPALVPALRLAGEGKVRWETLED 239
Query: 240 MWAGWCMKVISDHLGLGVKTGLPYIWHSKASNPFVNLKKEYKGIYWQEELIPFFQSVSLP 299
+W G C+K ISDHLG GVKTGLPY+W ++ + +L+K+++G+ E+ +PFF S+ LP
Sbjct: 240 VWCGMCLKHISDHLGYGVKTGLPYVWRNERGDAVESLRKKWEGMKLMEKSVPFFDSLKLP 299
Query: 300 KDCTTVQACYVELSKQVKAKLGGVDEYFNKLADAMVTWIEAWDELNPSG 348
+ V+ C +EL+K VK +LG D F + ADAMV W++ W+ +N S
Sbjct: 300 ETALKVEDCVIELAKAVKEQLGSDDPAFTQAADAMVKWVQLWNSVNSSA 348
>AT5G16510.1 | Symbols: | Alpha-1,4-glucan-protein synthase family
protein | chr5:5393296-5394342 FORWARD LENGTH=348
Length = 348
Score = 360 bits (923), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 241/349 (69%), Gaps = 8/349 (2%)
Query: 6 SATPVLKDELDIVIPTIRNLD---FLEMWRAFFEPYHLIIVQDGDPTKTIKVPQGFDYEL 62
S + K+E+DIVI + N D FL WR FF +HLI+V+D + + + +P+GFD ++
Sbjct: 2 SLAEINKNEVDIVIGAL-NADLTQFLTSWRPFFSGFHLIVVKDPELKEELNIPEGFDVDV 60
Query: 63 YNRNDINRILGPKASCISFKDSACRCFGFLMSKKKYIFTIDDDCFVAKDPSGAEINALQQ 122
Y++ D+ +++G S + F +CR FG+L+SKKKYI +IDDDC AKDP G ++A+ Q
Sbjct: 61 YSKTDMEKVVGASNSTM-FSGYSCRYFGYLVSKKKYIVSIDDDCVPAKDPKGFLVDAVTQ 119
Query: 123 HIKNLLTPSTPFFFNTLYDPYREGTDFVRGYPFSLREGVPTAISHGLWLNIPDYDAPTQL 182
H+ NL P+TP FFNTLYDPY EG DFVRGYPFSLR GVP A S GLWLN+ D DAPTQ
Sbjct: 120 HVINLENPATPLFFNTLYDPYCEGADFVRGYPFSLRSGVPCAASCGLWLNLADLDAPTQA 179
Query: 183 VKPLERNTRYVDAVMTIPKGTLFPMCGMNLAFNRELIGPAMYFGLMGDGQPLGRY---DD 239
+K +RNT YVDAVMT+P + P+ G+N+AFNREL+GPA+ L G+ R+ +D
Sbjct: 180 LKTEKRNTAYVDAVMTVPAKAMLPISGINIAFNRELVGPALVPALRLAGEGKVRWETLED 239
Query: 240 MWAGWCMKVISDHLGLGVKTGLPYIWHSKASNPFVNLKKEYKGIYWQEELIPFFQSVSLP 299
+W G C+K ISDHLG GVKTGLPY+W ++ + +L+K+++G+ E+ +PFF S+ LP
Sbjct: 240 VWCGMCLKHISDHLGYGVKTGLPYVWRNERGDAVESLRKKWEGMKLMEKSVPFFDSLKLP 299
Query: 300 KDCTTVQACYVELSKQVKAKLGGVDEYFNKLADAMVTWIEAWDELNPSG 348
+ V+ C +EL+K VK +LG D F + ADAMV W++ W+ +N S
Sbjct: 300 ETALKVEDCVIELAKAVKEQLGSDDPAFTQAADAMVKWVQLWNSVNSSA 348