Miyakogusa Predicted Gene
- Lj0g3v0228359.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0228359.1 CUFF.14925.1
(355 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G15650.1 | Symbols: RGP2, ATRGP2 | reversibly glycosylated po... 663 0.0
AT3G02230.1 | Symbols: RGP1, ATRGP1 | reversibly glycosylated po... 660 0.0
AT3G08900.1 | Symbols: RGP3, RGP | reversibly glycosylated polyp... 637 0.0
AT5G50750.1 | Symbols: RGP4 | reversibly glycosylated polypeptid... 582 e-166
AT5G16510.2 | Symbols: | Alpha-1,4-glucan-protein synthase fami... 356 1e-98
AT5G16510.1 | Symbols: | Alpha-1,4-glucan-protein synthase fami... 356 1e-98
>AT5G15650.1 | Symbols: RGP2, ATRGP2 | reversibly glycosylated
polypeptide 2 | chr5:5092203-5094093 FORWARD LENGTH=360
Length = 360
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/351 (88%), Positives = 332/351 (94%), Gaps = 2/351 (0%)
Query: 5 VSPTPLLKDELDIVIPTIRNLDFLEMWRPFFQPYHMIIVQDGDPSKTIHVPDGFDYELYN 64
V+PTPLLKDELDIVIPTIRNLDFLEMWRPF QPYH+IIVQDGDPSK IHVP+G+DYELYN
Sbjct: 12 VNPTPLLKDELDIVIPTIRNLDFLEMWRPFLQPYHLIIVQDGDPSKKIHVPEGYDYELYN 71
Query: 65 RNDINRILGPKANCISFKDSACRCFGYMVSKKKYIYTIDDDCFVANDPSGKKINALEQHI 124
RNDINRILGPKA+CISFKDSACRCFGYMVSKKKYI+TIDDDCFVA DPSGK +NALEQHI
Sbjct: 72 RNDINRILGPKASCISFKDSACRCFGYMVSKKKYIFTIDDDCFVAKDPSGKAVNALEQHI 131
Query: 125 KNLLCPSTPYFFNTLYEPYREGADFVRGYPFSLREGVPTAASHGLWLNIPDYDAPTQLVK 184
KNLLCPS+P+FFNTLY+PYREGADFVRGYPFSLREGV TA SHGLWLNIPDYDAPTQLVK
Sbjct: 132 KNLLCPSSPFFFNTLYDPYREGADFVRGYPFSLREGVSTAVSHGLWLNIPDYDAPTQLVK 191
Query: 185 PLERNTRYVDMVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGW 244
P ERNTRYVD VMTIPKGTLFPMCGMNLAFDR+LIGPAMYFGLMGDGQPIGRYDDMWAGW
Sbjct: 192 PKERNTRYVDAVMTIPKGTLFPMCGMNLAFDRDLIGPAMYFGLMGDGQPIGRYDDMWAGW 251
Query: 245 CCKVICDHLGLGIKTGLPYIYHSKASNPFVNLRKEYKGIFWQEDIIPFFQSAVLPKEATT 304
C KVICDHL LG+KTGLPYIYHSKASNPFVNL+KEYKGIFWQE+IIPFFQ+A L KEA T
Sbjct: 252 CIKVICDHLSLGVKTGLPYIYHSKASNPFVNLKKEYKGIFWQEEIIPFFQNAKLSKEAVT 311
Query: 305 VQKCYIELAKQVKEKLTKIDPYFDKLADAMVTWIEAWDELNPAGAGANGKA 355
VQ+CYIEL+K VKEKL+ +DPYFDKLADAMVTWIEAWDELNP A+GKA
Sbjct: 312 VQQCYIELSKMVKEKLSSLDPYFDKLADAMVTWIEAWDELNP--PAASGKA 360
>AT3G02230.1 | Symbols: RGP1, ATRGP1 | reversibly glycosylated
polypeptide 1 | chr3:415463-417304 FORWARD LENGTH=357
Length = 357
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/342 (91%), Positives = 326/342 (95%)
Query: 5 VSPTPLLKDELDIVIPTIRNLDFLEMWRPFFQPYHMIIVQDGDPSKTIHVPDGFDYELYN 64
V+ PLLKDELDIVIPTIRNLDFLEMWRPF QPYH+IIVQDGDPSKTI VP+GFDYELYN
Sbjct: 12 VNHIPLLKDELDIVIPTIRNLDFLEMWRPFLQPYHLIIVQDGDPSKTIAVPEGFDYELYN 71
Query: 65 RNDINRILGPKANCISFKDSACRCFGYMVSKKKYIYTIDDDCFVANDPSGKKINALEQHI 124
RNDINRILGPKA+CISFKDSACRCFGYMVSKKKYI+TIDDDCFVA DPSGK +NALEQHI
Sbjct: 72 RNDINRILGPKASCISFKDSACRCFGYMVSKKKYIFTIDDDCFVAKDPSGKAVNALEQHI 131
Query: 125 KNLLCPSTPYFFNTLYEPYREGADFVRGYPFSLREGVPTAASHGLWLNIPDYDAPTQLVK 184
KNLLCPSTP+FFNTLY+PYREGADFVRGYPFSLREGV TA SHGLWLNIPDYDAPTQLVK
Sbjct: 132 KNLLCPSTPFFFNTLYDPYREGADFVRGYPFSLREGVSTAVSHGLWLNIPDYDAPTQLVK 191
Query: 185 PLERNTRYVDMVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGW 244
P ERNTRYVD VMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGW
Sbjct: 192 PKERNTRYVDAVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGW 251
Query: 245 CCKVICDHLGLGIKTGLPYIYHSKASNPFVNLRKEYKGIFWQEDIIPFFQSAVLPKEATT 304
C KVICDHLGLG+KTGLPYIYHSKASNPFVNL+KEYKGIFWQEDIIPFFQSA L KEA T
Sbjct: 252 CIKVICDHLGLGVKTGLPYIYHSKASNPFVNLKKEYKGIFWQEDIIPFFQSAKLTKEAVT 311
Query: 305 VQKCYIELAKQVKEKLTKIDPYFDKLADAMVTWIEAWDELNP 346
VQ+CY+EL+K VKEKL+ IDPYFDKLADAMVTWIEAWDELNP
Sbjct: 312 VQQCYMELSKLVKEKLSPIDPYFDKLADAMVTWIEAWDELNP 353
>AT3G08900.1 | Symbols: RGP3, RGP | reversibly glycosylated
polypeptide 3 | chr3:2708347-2709714 REVERSE LENGTH=362
Length = 362
Score = 637 bits (1643), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 296/347 (85%), Positives = 322/347 (92%)
Query: 1 MASSVSPTPLLKDELDIVIPTIRNLDFLEMWRPFFQPYHMIIVQDGDPSKTIHVPDGFDY 60
+ SSV PTP+LKDELDIVIPTIRNLDFLEMWRPFF+ YH+IIVQDGDPSK I++P GFDY
Sbjct: 4 LYSSVKPTPMLKDELDIVIPTIRNLDFLEMWRPFFEQYHLIIVQDGDPSKVINIPVGFDY 63
Query: 61 ELYNRNDINRILGPKANCISFKDSACRCFGYMVSKKKYIYTIDDDCFVANDPSGKKINAL 120
ELYNRNDINRILGPKA+CISFKDSACRCFGYMVSKKKYIYTIDDDCFVA DP+GK+INAL
Sbjct: 64 ELYNRNDINRILGPKASCISFKDSACRCFGYMVSKKKYIYTIDDDCFVAKDPTGKEINAL 123
Query: 121 EQHIKNLLCPSTPYFFNTLYEPYREGADFVRGYPFSLREGVPTAASHGLWLNIPDYDAPT 180
EQHIKNLL PSTP+FFNTLY+PYR+GADFVRGYPFS+REG TA SHGLWLNIPDYDAPT
Sbjct: 124 EQHIKNLLSPSTPHFFNTLYDPYRDGADFVRGYPFSMREGAITAVSHGLWLNIPDYDAPT 183
Query: 181 QLVKPLERNTRYVDMVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDM 240
QLVKPLE+N+RYVD VMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDM
Sbjct: 184 QLVKPLEKNSRYVDAVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDM 243
Query: 241 WAGWCCKVICDHLGLGIKTGLPYIYHSKASNPFVNLRKEYKGIFWQEDIIPFFQSAVLPK 300
WAGWC KVICDH+G G+KTGLPYI+HSKASNPFVNL+KEY GIFWQE+ IPFFQS LPK
Sbjct: 244 WAGWCVKVICDHMGWGVKTGLPYIWHSKASNPFVNLKKEYNGIFWQEEAIPFFQSVTLPK 303
Query: 301 EATTVQKCYIELAKQVKEKLTKIDPYFDKLADAMVTWIEAWDELNPA 347
E T+VQ+CY+ELAK V+EKL K+DPYF LA MVTWIEAW+ELN A
Sbjct: 304 ECTSVQQCYLELAKLVREKLGKVDPYFITLATGMVTWIEAWEELNSA 350
>AT5G50750.1 | Symbols: RGP4 | reversibly glycosylated polypeptide 4
| chr5:20641066-20642470 FORWARD LENGTH=364
Length = 364
Score = 582 bits (1500), Expect = e-166, Method: Compositional matrix adjust.
Identities = 264/336 (78%), Positives = 299/336 (88%)
Query: 11 LKDELDIVIPTIRNLDFLEMWRPFFQPYHMIIVQDGDPSKTIHVPDGFDYELYNRNDINR 70
LKD+LDIVIPTIR+LDFLE WRPF YH+IIVQDGDPS I VP+G+DYELYNRNDINR
Sbjct: 14 LKDDLDIVIPTIRSLDFLEQWRPFLHHYHLIIVQDGDPSIKIRVPEGYDYELYNRNDINR 73
Query: 71 ILGPKANCISFKDSACRCFGYMVSKKKYIYTIDDDCFVANDPSGKKINALEQHIKNLLCP 130
ILGP+ANCIS+KD CRCFG+MVSKKKYIYTIDDDCFVA DPSGK IN + QHIKNL P
Sbjct: 74 ILGPRANCISYKDGGCRCFGFMVSKKKYIYTIDDDCFVAKDPSGKDINVIAQHIKNLETP 133
Query: 131 STPYFFNTLYEPYREGADFVRGYPFSLREGVPTAASHGLWLNIPDYDAPTQLVKPLERNT 190
STP++FNTLY+P+R+G DFVRGYPFSLREGV TA SHGLWLNIPDYDAPTQLVKP ERNT
Sbjct: 134 STPHYFNTLYDPFRDGTDFVRGYPFSLREGVQTAISHGLWLNIPDYDAPTQLVKPRERNT 193
Query: 191 RYVDMVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRYDDMWAGWCCKVIC 250
RYVD VMTIPK L+PMCGMNLAF+REL+GPAMYFGLMG+GQPI RYDDMWAGW KV+C
Sbjct: 194 RYVDAVMTIPKRVLYPMCGMNLAFNRELVGPAMYFGLMGEGQPISRYDDMWAGWAAKVVC 253
Query: 251 DHLGLGIKTGLPYIYHSKASNPFVNLRKEYKGIFWQEDIIPFFQSAVLPKEATTVQKCYI 310
DHLG G+KTGLPY++HSKASNPFVNL+KE+KG+ WQED++PFFQ+ L KE+ T KCY+
Sbjct: 254 DHLGFGVKTGLPYLWHSKASNPFVNLKKEHKGLHWQEDMVPFFQNLRLSKESDTAAKCYM 313
Query: 311 ELAKQVKEKLTKIDPYFDKLADAMVTWIEAWDELNP 346
E++ KEKLTK+DPYF+KLADAMV WIEAW+ELNP
Sbjct: 314 EISNMTKEKLTKVDPYFEKLADAMVVWIEAWEELNP 349
>AT5G16510.2 | Symbols: | Alpha-1,4-glucan-protein synthase family
protein | chr5:5393296-5394342 FORWARD LENGTH=348
Length = 348
Score = 356 bits (914), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 170/342 (49%), Positives = 237/342 (69%), Gaps = 6/342 (1%)
Query: 12 KDELDIVIPTIRN--LDFLEMWRPFFQPYHMIIVQDGDPSKTIHVPDGFDYELYNRNDIN 69
K+E+DIVI + FL WRPFF +H+I+V+D + + +++P+GFD ++Y++ D+
Sbjct: 8 KNEVDIVIGALNADLTQFLTSWRPFFSGFHLIVVKDPELKEELNIPEGFDVDVYSKTDME 67
Query: 70 RILGPKANCISFKDSACRCFGYMVSKKKYIYTIDDDCFVANDPSGKKINALEQHIKNLLC 129
+++G +N F +CR FGY+VSKKKYI +IDDDC A DP G ++A+ QH+ NL
Sbjct: 68 KVVGA-SNSTMFSGYSCRYFGYLVSKKKYIVSIDDDCVPAKDPKGFLVDAVTQHVINLEN 126
Query: 130 PSTPYFFNTLYEPYREGADFVRGYPFSLREGVPTAASHGLWLNIPDYDAPTQLVKPLERN 189
P+TP FFNTLY+PY EGADFVRGYPFSLR GVP AAS GLWLN+ D DAPTQ +K +RN
Sbjct: 127 PATPLFFNTLYDPYCEGADFVRGYPFSLRSGVPCAASCGLWLNLADLDAPTQALKTEKRN 186
Query: 190 TRYVDMVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRY---DDMWAGWCC 246
T YVD VMT+P + P+ G+N+AF+REL+GPA+ L G+ R+ +D+W G C
Sbjct: 187 TAYVDAVMTVPAKAMLPISGINIAFNRELVGPALVPALRLAGEGKVRWETLEDVWCGMCL 246
Query: 247 KVICDHLGLGIKTGLPYIYHSKASNPFVNLRKEYKGIFWQEDIIPFFQSAVLPKEATTVQ 306
K I DHLG G+KTGLPY++ ++ + +LRK+++G+ E +PFF S LP+ A V+
Sbjct: 247 KHISDHLGYGVKTGLPYVWRNERGDAVESLRKKWEGMKLMEKSVPFFDSLKLPETALKVE 306
Query: 307 KCYIELAKQVKEKLTKIDPYFDKLADAMVTWIEAWDELNPAG 348
C IELAK VKE+L DP F + ADAMV W++ W+ +N +
Sbjct: 307 DCVIELAKAVKEQLGSDDPAFTQAADAMVKWVQLWNSVNSSA 348
>AT5G16510.1 | Symbols: | Alpha-1,4-glucan-protein synthase family
protein | chr5:5393296-5394342 FORWARD LENGTH=348
Length = 348
Score = 356 bits (914), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 170/342 (49%), Positives = 237/342 (69%), Gaps = 6/342 (1%)
Query: 12 KDELDIVIPTIRN--LDFLEMWRPFFQPYHMIIVQDGDPSKTIHVPDGFDYELYNRNDIN 69
K+E+DIVI + FL WRPFF +H+I+V+D + + +++P+GFD ++Y++ D+
Sbjct: 8 KNEVDIVIGALNADLTQFLTSWRPFFSGFHLIVVKDPELKEELNIPEGFDVDVYSKTDME 67
Query: 70 RILGPKANCISFKDSACRCFGYMVSKKKYIYTIDDDCFVANDPSGKKINALEQHIKNLLC 129
+++G +N F +CR FGY+VSKKKYI +IDDDC A DP G ++A+ QH+ NL
Sbjct: 68 KVVGA-SNSTMFSGYSCRYFGYLVSKKKYIVSIDDDCVPAKDPKGFLVDAVTQHVINLEN 126
Query: 130 PSTPYFFNTLYEPYREGADFVRGYPFSLREGVPTAASHGLWLNIPDYDAPTQLVKPLERN 189
P+TP FFNTLY+PY EGADFVRGYPFSLR GVP AAS GLWLN+ D DAPTQ +K +RN
Sbjct: 127 PATPLFFNTLYDPYCEGADFVRGYPFSLRSGVPCAASCGLWLNLADLDAPTQALKTEKRN 186
Query: 190 TRYVDMVMTIPKGTLFPMCGMNLAFDRELIGPAMYFGLMGDGQPIGRY---DDMWAGWCC 246
T YVD VMT+P + P+ G+N+AF+REL+GPA+ L G+ R+ +D+W G C
Sbjct: 187 TAYVDAVMTVPAKAMLPISGINIAFNRELVGPALVPALRLAGEGKVRWETLEDVWCGMCL 246
Query: 247 KVICDHLGLGIKTGLPYIYHSKASNPFVNLRKEYKGIFWQEDIIPFFQSAVLPKEATTVQ 306
K I DHLG G+KTGLPY++ ++ + +LRK+++G+ E +PFF S LP+ A V+
Sbjct: 247 KHISDHLGYGVKTGLPYVWRNERGDAVESLRKKWEGMKLMEKSVPFFDSLKLPETALKVE 306
Query: 307 KCYIELAKQVKEKLTKIDPYFDKLADAMVTWIEAWDELNPAG 348
C IELAK VKE+L DP F + ADAMV W++ W+ +N +
Sbjct: 307 DCVIELAKAVKEQLGSDDPAFTQAADAMVKWVQLWNSVNSSA 348