Miyakogusa Predicted Gene
- Lj4g3v0668360.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0668360.1 tr|C1EJ26|C1EJ26_MICSR Predicted protein
OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_89045
,37.04,2e-18,GLUCOSIDASE II BETA SUBUNIT,Glucosidase 2 subunit beta;
N-LINKED OLIGOSACCHARIDE PROCESSING,NULL; Ma,CUFF.47856.1
(622 letters)
Database: Glyma1.pep
75,778 sequences; 25,431,882 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Glyma07g35090.1 783 0.0
Glyma20g02950.2 425 e-119
Glyma20g02950.1 425 e-119
Glyma12g14110.1 147 4e-35
Glyma06g43790.1 143 5e-34
Glyma06g43790.2 131 2e-30
Glyma12g19470.1 92 2e-18
Glyma08g36100.1 86 1e-16
>Glyma07g35090.1
Length = 629
Score = 783 bits (2021), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/607 (65%), Positives = 441/607 (72%), Gaps = 18/607 (2%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
I+P+DDKYYK+SDVI+CKDGSGKF K LNDDFCDC DGTDEPGTSACP GKFYCRNAGH
Sbjct: 28 ISPEDDKYYKASDVIRCKDGSGKFTKAQLNDDFCDCADGTDEPGTSACPGGKFYCRNAGH 87
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRK 148
SPVYLFSSRVNDGICDCCDGTDEYDG+ KCPNTCWEAGKVAR++L+KKIATYQEGVK+RK
Sbjct: 88 SPVYLFSSRVNDGICDCCDGTDEYDGQVKCPNTCWEAGKVARDRLEKKIATYQEGVKLRK 147
Query: 149 QEVEQAKLAMEKDEAELSKLKNEESVLKGIVKQLXXXXXXXXXXXXXXXXXXXXXXXXXX 208
E+EQAK+AMEKDEAELSKLK EES+LKGIVKQL
Sbjct: 148 LEMEQAKVAMEKDEAELSKLKKEESILKGIVKQLKDHKEQIEKAEEKERLQKEKEEKQKK 207
Query: 209 XXXXXXXXXXXXADEDVEHNNEVEMPSHVEDNDVASNHDNIEIQEDSPTNQXXXXXXXXX 268
ADED H NE E S VEDN V +NHD IE E SP +Q
Sbjct: 208 ESEEKANEAKDKADEDTGHRNEAEKHSDVEDNSVENNHDKIENLEGSPADQDEAGDKLED 267
Query: 269 XXXNSDGATDSPGSEGSLHHXXXXXXXXXXXXXXXXXXTDLLTGKEDSSEEVINTGKDVS 328
N D A+DSPGSEGSLH+ TD+ G ++SS E+IN G D S
Sbjct: 268 VLDNDDEASDSPGSEGSLHNKVEENAKEAEEEPIVKSETDIKVGNKESSAEIINKGNDAS 327
Query: 329 ENTEGLSKEELGKLVASRWTGEDTGKKNAEADTSLDNEHQDDLP---NVEEYEGYASETX 385
ENTEGLS+EELG+LVASRWTGE+T K +AE DT+LDNE +DLP N EEYEGYASET
Sbjct: 328 ENTEGLSREELGRLVASRWTGENTDKSSAEPDTTLDNEDHEDLPKGTNNEEYEGYASETD 387
Query: 386 XXXXXXXXXXXXXX-------XXXXXXXXXXXLSSSY---TDTEPDLADDSTTDSPSWLE 435
L+SSY +D EPDL +D+PSWLE
Sbjct: 388 DDIDSNKYDDDSHKYDDEDEVDEEYREDEHDDLTSSYKSDSDNEPDL-----SDNPSWLE 442
Query: 436 KIQNSVRNIFQVVNIFQTPVNQSDAARIRKEYDESSTKLSKIQSRISSLKKKLKQDFGPA 495
KIQ +VRNIFQ VN+FQ PVNQSDAAR+RKEYDESS KLSKIQSRISSLK+KLK DFGPA
Sbjct: 443 KIQRTVRNIFQAVNLFQAPVNQSDAARVRKEYDESSAKLSKIQSRISSLKQKLKHDFGPA 502
Query: 496 KEFYSFYDRCFESKQNKYTYKVCPYKQASQEEGYSTTRLGRWDKFEDSYKVMVFSEGDKC 555
KEFYSFYD CFE K+NKYTYKVCPYKQASQEEGYS TRLG WDKFEDSY+VMVFS GDKC
Sbjct: 503 KEFYSFYDHCFEGKENKYTYKVCPYKQASQEEGYSNTRLGSWDKFEDSYRVMVFSNGDKC 562
Query: 556 WNGPDRSLKVRLKCGLTYEITDVDEPSRCEYVALLATPALCHEEKLKELQHKLDKLNSEI 615
WNGPDRSLKV+L+CGL EITDVDEPSRCEYVA+L+TPALC EE+LKELQHKLD LNSEI
Sbjct: 563 WNGPDRSLKVKLRCGLKNEITDVDEPSRCEYVAVLSTPALCQEERLKELQHKLDLLNSEI 622
Query: 616 PEKHDEL 622
P HDEL
Sbjct: 623 PSNHDEL 629
>Glyma20g02950.2
Length = 568
Score = 425 bits (1092), Expect = e-119, Method: Compositional matrix adjust.
Identities = 218/309 (70%), Positives = 240/309 (77%), Gaps = 15/309 (4%)
Query: 324 GKDVSENTEGLSKEELGKLVASRWTGEDTGKKNAEADTSLDNEHQDDLP--NVEEYEGYA 381
G D SENTEGLS+EELG+LVASRWTGE+T K +A DT+LDNE ++D N EEYEGYA
Sbjct: 265 GNDASENTEGLSREELGRLVASRWTGENTDKPSAVPDTTLDNEDREDPKGRNNEEYEGYA 324
Query: 382 SETXXXXXXXXXXXXXXXXXXXXXXXXXX-----LSSSY---TDTEPDLADDSTTDSPSW 433
SET LSSSY +D EPDL+D+ PSW
Sbjct: 325 SETDDDNNKYDDDSHKYDDEDEVDDEYREDEHDDLSSSYKSDSDNEPDLSDN-----PSW 379
Query: 434 LEKIQNSVRNIFQVVNIFQTPVNQSDAARIRKEYDESSTKLSKIQSRISSLKKKLKQDFG 493
LEKIQ +VRNIFQVVN+FQ PVNQ+DAAR+RKEYDESS KLSKIQSRISSLK+KLK DFG
Sbjct: 380 LEKIQRTVRNIFQVVNLFQAPVNQTDAARVRKEYDESSAKLSKIQSRISSLKQKLKHDFG 439
Query: 494 PAKEFYSFYDRCFESKQNKYTYKVCPYKQASQEEGYSTTRLGRWDKFEDSYKVMVFSEGD 553
PAKEFYSFYD CFE K+NKYTYKVCPYKQASQEEGYS TRLG WDKFEDSY+VMVFS GD
Sbjct: 440 PAKEFYSFYDHCFEGKENKYTYKVCPYKQASQEEGYSNTRLGSWDKFEDSYRVMVFSNGD 499
Query: 554 KCWNGPDRSLKVRLKCGLTYEITDVDEPSRCEYVALLATPALCHEEKLKELQHKLDKLNS 613
KCWNGPDRSLKV+L+CGL EITDVDEPSRCEYVA+L+TP LC EE+LKELQ KLD LNS
Sbjct: 500 KCWNGPDRSLKVKLRCGLKNEITDVDEPSRCEYVAVLSTPTLCQEERLKELQLKLDLLNS 559
Query: 614 EIPEKHDEL 622
EIP HDEL
Sbjct: 560 EIPANHDEL 568
Score = 321 bits (823), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 157/231 (67%), Positives = 170/231 (73%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
+AP+DD YYKSSDVI CKDGSGKF K NDDFCDC DGTDEPGTSACP GKFYCRNAGH
Sbjct: 34 VAPEDDDYYKSSDVISCKDGSGKFTKAQFNDDFCDCADGTDEPGTSACPGGKFYCRNAGH 93
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRK 148
SPVYLFSSRVNDGICDCCDGTDEYDG+ KCPNTCWEAGKVAR++LKKKIATYQEGVK+RK
Sbjct: 94 SPVYLFSSRVNDGICDCCDGTDEYDGQVKCPNTCWEAGKVARDRLKKKIATYQEGVKLRK 153
Query: 149 QEVEQAKLAMEKDEAELSKLKNEESVLKGIVKQLXXXXXXXXXXXXXXXXXXXXXXXXXX 208
QE+EQAK+AMEKDEAELSKLK EES+LKGIVKQL
Sbjct: 154 QEIEQAKVAMEKDEAELSKLKKEESILKGIVKQLKDHKEQIDKAEEEERLQKEKEEKQKR 213
Query: 209 XXXXXXXXXXXXADEDVEHNNEVEMPSHVEDNDVASNHDNIEIQEDSPTNQ 259
ADED EH NE E S +EDN + +NHD IE E SP +Q
Sbjct: 214 ESEEKANEAKDKADEDTEHRNEAEKHSDIEDNTLENNHDKIENLEGSPADQ 264
>Glyma20g02950.1
Length = 568
Score = 425 bits (1092), Expect = e-119, Method: Compositional matrix adjust.
Identities = 218/309 (70%), Positives = 240/309 (77%), Gaps = 15/309 (4%)
Query: 324 GKDVSENTEGLSKEELGKLVASRWTGEDTGKKNAEADTSLDNEHQDDLP--NVEEYEGYA 381
G D SENTEGLS+EELG+LVASRWTGE+T K +A DT+LDNE ++D N EEYEGYA
Sbjct: 265 GNDASENTEGLSREELGRLVASRWTGENTDKPSAVPDTTLDNEDREDPKGRNNEEYEGYA 324
Query: 382 SETXXXXXXXXXXXXXXXXXXXXXXXXXX-----LSSSY---TDTEPDLADDSTTDSPSW 433
SET LSSSY +D EPDL+D+ PSW
Sbjct: 325 SETDDDNNKYDDDSHKYDDEDEVDDEYREDEHDDLSSSYKSDSDNEPDLSDN-----PSW 379
Query: 434 LEKIQNSVRNIFQVVNIFQTPVNQSDAARIRKEYDESSTKLSKIQSRISSLKKKLKQDFG 493
LEKIQ +VRNIFQVVN+FQ PVNQ+DAAR+RKEYDESS KLSKIQSRISSLK+KLK DFG
Sbjct: 380 LEKIQRTVRNIFQVVNLFQAPVNQTDAARVRKEYDESSAKLSKIQSRISSLKQKLKHDFG 439
Query: 494 PAKEFYSFYDRCFESKQNKYTYKVCPYKQASQEEGYSTTRLGRWDKFEDSYKVMVFSEGD 553
PAKEFYSFYD CFE K+NKYTYKVCPYKQASQEEGYS TRLG WDKFEDSY+VMVFS GD
Sbjct: 440 PAKEFYSFYDHCFEGKENKYTYKVCPYKQASQEEGYSNTRLGSWDKFEDSYRVMVFSNGD 499
Query: 554 KCWNGPDRSLKVRLKCGLTYEITDVDEPSRCEYVALLATPALCHEEKLKELQHKLDKLNS 613
KCWNGPDRSLKV+L+CGL EITDVDEPSRCEYVA+L+TP LC EE+LKELQ KLD LNS
Sbjct: 500 KCWNGPDRSLKVKLRCGLKNEITDVDEPSRCEYVAVLSTPTLCQEERLKELQLKLDLLNS 559
Query: 614 EIPEKHDEL 622
EIP HDEL
Sbjct: 560 EIPANHDEL 568
Score = 321 bits (823), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 157/231 (67%), Positives = 170/231 (73%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
+AP+DD YYKSSDVI CKDGSGKF K NDDFCDC DGTDEPGTSACP GKFYCRNAGH
Sbjct: 34 VAPEDDDYYKSSDVISCKDGSGKFTKAQFNDDFCDCADGTDEPGTSACPGGKFYCRNAGH 93
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRK 148
SPVYLFSSRVNDGICDCCDGTDEYDG+ KCPNTCWEAGKVAR++LKKKIATYQEGVK+RK
Sbjct: 94 SPVYLFSSRVNDGICDCCDGTDEYDGQVKCPNTCWEAGKVARDRLKKKIATYQEGVKLRK 153
Query: 149 QEVEQAKLAMEKDEAELSKLKNEESVLKGIVKQLXXXXXXXXXXXXXXXXXXXXXXXXXX 208
QE+EQAK+AMEKDEAELSKLK EES+LKGIVKQL
Sbjct: 154 QEIEQAKVAMEKDEAELSKLKKEESILKGIVKQLKDHKEQIDKAEEEERLQKEKEEKQKR 213
Query: 209 XXXXXXXXXXXXADEDVEHNNEVEMPSHVEDNDVASNHDNIEIQEDSPTNQ 259
ADED EH NE E S +EDN + +NHD IE E SP +Q
Sbjct: 214 ESEEKANEAKDKADEDTEHRNEAEKHSDIEDNTLENNHDKIENLEGSPADQ 264
>Glyma12g14110.1
Length = 188
Score = 147 bits (371), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 85/130 (65%), Gaps = 1/130 (0%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
I P D+KYY +S+VIKC+DGS F++D LND+FCDC DGTDEPGTSACP GKFYCRN G
Sbjct: 32 IHPLDEKYY-NSEVIKCRDGSKSFSRDRLNDNFCDCPDGTDEPGTSACPNGKFYCRNLGS 90
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRK 148
P ++ SS VND CDCCDG+DEYDG CPNTC G K Q GVK +
Sbjct: 91 KPQFIVSSHVNDHFCDCCDGSDEYDGIICCPNTCVMGGNAESTFSNCKSEASQNGVKSEE 150
Query: 149 QEVEQAKLAM 158
KL +
Sbjct: 151 SVHTGLKLVI 160
>Glyma06g43790.1
Length = 189
Score = 143 bits (361), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 72/130 (55%), Positives = 84/130 (64%), Gaps = 1/130 (0%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
I P D+KYY SS++IKCKD S F++D LND+FCDC DGTDEPGTSACP GKFYCRN G
Sbjct: 33 IHPLDEKYY-SSEMIKCKDESKSFSRDRLNDNFCDCPDGTDEPGTSACPNGKFYCRNLGS 91
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRK 148
P ++ SS VND CDCCDG+DEYDG CPNTC G K + GVK +
Sbjct: 92 KPQFIVSSHVNDHFCDCCDGSDEYDGTICCPNTCVMGGNAESTFRNCKSKASKNGVKSEE 151
Query: 149 QEVEQAKLAM 158
KL +
Sbjct: 152 SVHTGLKLVI 161
>Glyma06g43790.2
Length = 145
Score = 131 bits (330), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/117 (54%), Positives = 74/117 (63%)
Query: 42 VIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGHSPVYLFSSRVNDG 101
+IKCKD S F++D LND+FCDC DGTDEPGTSACP GKFYCRN G P ++ SS VND
Sbjct: 1 MIKCKDESKSFSRDRLNDNFCDCPDGTDEPGTSACPNGKFYCRNLGSKPQFIVSSHVNDH 60
Query: 102 ICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRKQEVEQAKLAM 158
CDCCDG+DEYDG CPNTC G K + GVK + KL +
Sbjct: 61 FCDCCDGSDEYDGTICCPNTCVMGGNAESTFRNCKSKASKNGVKSEESVHTGLKLVI 117
>Glyma12g19470.1
Length = 57
Score = 92.0 bits (227), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/56 (73%), Positives = 46/56 (82%)
Query: 33 DDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
+D YYKSS+VI+CKDGSGKF K LNDDFC+C DG DE GTSACP GKFYC+N G
Sbjct: 1 NDNYYKSSNVIRCKDGSGKFTKAQLNDDFCECADGIDELGTSACPGGKFYCQNVGR 56
>Glyma08g36100.1
Length = 102
Score = 85.9 bits (211), Expect = 1e-16, Method: Composition-based stats.
Identities = 39/57 (68%), Positives = 41/57 (71%), Gaps = 7/57 (12%)
Query: 45 CKDGSGKFNKDHLNDDFCDCLDGTDEPG-------TSACPRGKFYCRNAGHSPVYLF 94
CKDGSGKF K LNDD CDC+DGT+EPG T ACP KFYCRNAGHS VYL
Sbjct: 1 CKDGSGKFTKAQLNDDMCDCVDGTNEPGFYFNYEITMACPGEKFYCRNAGHSHVYLL 57