Miyakogusa Predicted Gene
- Lj4g3v0668360.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0668360.1 tr|C1EJ26|C1EJ26_MICSR Predicted protein
OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_89045
,37.04,2e-18,GLUCOSIDASE II BETA SUBUNIT,Glucosidase 2 subunit beta;
N-LINKED OLIGOSACCHARIDE PROCESSING,NULL; Ma,CUFF.47856.1
(622 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G56360.1 | Symbols: PSL4 | calmodulin-binding protein | chr5:... 553 e-157
AT2G42390.1 | Symbols: | protein kinase C substrate, heavy chai... 155 8e-38
>AT5G56360.1 | Symbols: PSL4 | calmodulin-binding protein |
chr5:22823586-22827950 REVERSE LENGTH=647
Length = 647
Score = 553 bits (1424), Expect = e-157, Method: Compositional matrix adjust.
Identities = 314/611 (51%), Positives = 386/611 (63%), Gaps = 20/611 (3%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
I+PQD+KYYKSS IKCKDGS KF K LNDDFCDC DGTDEPGTSACP GKFYCRNAGH
Sbjct: 40 ISPQDEKYYKSSSEIKCKDGSKKFTKAQLNDDFCDCSDGTDEPGTSACPTGKFYCRNAGH 99
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKVARNKLKKKIATYQEGVKVRK 148
SPV LFSSRVNDGICDCCDG+DEYDG C NTCWEAGK AR LKKKI TY +G+ +R+
Sbjct: 100 SPVILFSSRVNDGICDCCDGSDEYDGHVSCQNTCWEAGKAARENLKKKIETYNQGLVIRR 159
Query: 149 QEVEQAKLAMEKDEAELSKLKNEESVLKGIVKQLXXXXXXXXXXXXXXXXXXXXXXXXXX 208
QE+EQAK+ +EKD AEL KLK+E+ +LKG+V QL
Sbjct: 160 QEIEQAKVGLEKDAAELKKLKSEQKILKGLVDQLKDRKEQIEKVEEKERLQKEKEEKEKK 219
Query: 209 XXXXXXXXXXXXADEDVEHNNEVEMPSH------VEDNDVASNHDNIEIQEDSPTNQXXX 262
A+E + + +VE SH V +D ++HD I +D P+++
Sbjct: 220 EAELAAQQGKGDAEEKTDDSEKVEESSHDEGTPAVSQHDETTHHDEIGNYKDYPSDEEPA 279
Query: 263 XXXXXXXXXNSDGATDSPGSEGSLHHXXXXXXXXXXXXXXXXXXTDLLTGKEDSSEEVIN 322
+ T+ P E + D KE+ S+EV
Sbjct: 280 AEGEPTSILDEATHTN-PADEHVVERKEESTSSEDSSSPTDESQNDGSAEKEE-SDEVKK 337
Query: 323 TGKDVSENTEGLSKEELGKLVASRWTGEDTGKKNAEAD---TSLDNEHQDDLP----NVE 375
V+E E LSKEELG+LVASRWTGE + K EAD + D E+ + P +
Sbjct: 338 VEDFVTEKKEELSKEELGRLVASRWTGEKSDKP-TEADDIPKADDQENHEHTPITAHEAD 396
Query: 376 EYEGYASETXXXXXXXXXXXXXXXXXXXXXXXXXXLSSSY--TDTEPDLADDSTTDSPSW 433
E +G+ S+ SSS +D + D+ TT +P+W
Sbjct: 397 EDDGFVSDGDEDTSDDGKYSDHEPEDDSYEEEYRHDSSSSYKSDADDDVDFSETTSNPTW 456
Query: 434 LEKIQNSVRNIFQVVNIFQ-TPVNQSDAARIRKEYDESSTKLSKIQSRISSLKKKLKQDF 492
LEKIQ +V+NI VN+FQ TPV++S+A R+RKEYDESS+KL+KIQSRISSL+KKLKQDF
Sbjct: 457 LEKIQKTVKNILLAVNLFQTTPVDKSEADRVRKEYDESSSKLNKIQSRISSLEKKLKQDF 516
Query: 493 GPAKEFYSFYDRCFESKQNKYTYKVCPYKQASQEEGYSTTRLGRWDKFEDSYKVMVFSEG 552
GP KEFYSF+ RCFESKQ KYTYKVC YK+A+QEEGYS TRLG WDKFE+SY+ M ++ G
Sbjct: 517 GPEKEFYSFHGRCFESKQGKYTYKVCAYKEATQEEGYSKTRLGEWDKFENSYQFMSYTNG 576
Query: 553 DKCWNGPDRSLKVRLKCGLTYEITDVDEPSRCEYVALLATPALCHEEKLKELQHKLDKL- 611
+KCWNGPDRSLKV+L+CGL E+ DVDEPSRCEY A+L+TPA C E+KLKELQ KL+KL
Sbjct: 577 EKCWNGPDRSLKVKLRCGLKNELMDVDEPSRCEYAAILSTPARCLEDKLKELQQKLEKLM 636
Query: 612 NSEIPEKHDEL 622
N + P+ HDEL
Sbjct: 637 NQDKPQNHDEL 647
>AT2G42390.1 | Symbols: | protein kinase C substrate, heavy
chain-related | chr2:17651381-17652509 REVERSE
LENGTH=212
Length = 212
Score = 155 bits (392), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 69/100 (69%), Positives = 79/100 (79%), Gaps = 1/100 (1%)
Query: 29 IAPQDDKYYKSSDVIKCKDGSGKFNKDHLNDDFCDCLDGTDEPGTSACPRGKFYCRNAGH 88
+ P D+KY+ S DVIKCKDGS F +D LND+FCDCLDGTDEPGTSACP GKFYCRN G
Sbjct: 28 VHPLDEKYFDS-DVIKCKDGSKSFTRDRLNDNFCDCLDGTDEPGTSACPNGKFYCRNIGS 86
Query: 89 SPVYLFSSRVNDGICDCCDGTDEYDGKAKCPNTCWEAGKV 128
SP +++SSRVND ICDCCDG+DEY+ CPNTC G V
Sbjct: 87 SPKFVYSSRVNDRICDCCDGSDEYESSIHCPNTCIMGGNV 126