Miyakogusa Predicted Gene
- Lj4g3v1327600.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1327600.1 tr|C1MZM1|C1MZM1_MICPC Predicted protein
OS=Micromonas pusilla (strain CCMP1545)
GN=MICPUCDRAFT_4810,39.47,3e-19,Staygreen,Staygreen protein; FAMILY
NOT NAMED,NULL; seg,NULL,CUFF.48843.1
(189 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G22920.1 | Symbols: ATNYE1, NYE1 | non-yellowing 1 | chr4:120... 293 6e-80
AT4G11911.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 283 7e-77
AT4G11910.1 | Symbols: | INVOLVED IN: biological_process unknow... 278 1e-75
AT1G44000.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 157 6e-39
>AT4G22920.1 | Symbols: ATNYE1, NYE1 | non-yellowing 1 |
chr4:12016776-12017969 REVERSE LENGTH=268
Length = 268
Score = 293 bits (749), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 135/164 (82%), Positives = 151/164 (92%)
Query: 26 RRIRKKNQAVFPVARLFGPAMFEASKLKVLFLGVDENKHPGDLPRTYTLTHSDITSKITL 85
RR +KKNQ++ PVARLFGPA+FE+SKLKVLFLGVDE KHP LPRTYTLTHSDIT+K+TL
Sbjct: 36 RRSKKKNQSIVPVARLFGPAIFESSKLKVLFLGVDEKKHPSTLPRTYTLTHSDITAKLTL 95
Query: 86 AISHNINNSQLQGWYNRLQRDEVVAQWRKIKGNMSLHVHCHISGGHFLLDLCAKLRYFIF 145
AIS +INNSQLQGW NRL RDEVVA+W+K+KG MSLHVHCHISGGHFLLDL AK RYFIF
Sbjct: 96 AISQSINNSQLQGWANRLYRDEVVAEWKKVKGKMSLHVHCHISGGHFLLDLFAKFRYFIF 155
Query: 146 CKELPVVLKAFIHGDENLFNNYPELEESLVWVYFHSNISEFNKV 189
CKELPVVLKAF+HGD NL NNYPEL+E+LVWVYFHSN++EFNKV
Sbjct: 156 CKELPVVLKAFVHGDGNLLNNYPELQEALVWVYFHSNVNEFNKV 199
>AT4G11911.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT4G11910.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr4:7158150-7159440 FORWARD LENGTH=273
Length = 273
Score = 283 bits (723), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 131/168 (77%), Positives = 148/168 (88%)
Query: 22 PHRTRRIRKKNQAVFPVARLFGPAMFEASKLKVLFLGVDENKHPGDLPRTYTLTHSDITS 81
P TR ++K Q++FPVARLFG A+FEASKL V FLGVDE KHP +LPRTYT THSDIT+
Sbjct: 28 PSTTRSSKRKKQSMFPVARLFGQAIFEASKLNVKFLGVDEKKHPPNLPRTYTFTHSDITA 87
Query: 82 KITLAISHNINNSQLQGWYNRLQRDEVVAQWRKIKGNMSLHVHCHISGGHFLLDLCAKLR 141
K+TLAISH+INNSQLQGW NRL RDEVVA+WRK+K NMSLHVHCHISG HFLLDL A+LR
Sbjct: 88 KLTLAISHSINNSQLQGWANRLYRDEVVAEWRKVKSNMSLHVHCHISGDHFLLDLIAELR 147
Query: 142 YFIFCKELPVVLKAFIHGDENLFNNYPELEESLVWVYFHSNISEFNKV 189
YFIFCKELP+VLKAF+HGDEN+ NNYPEL E+ VWVYFHSNI +FNKV
Sbjct: 148 YFIFCKELPMVLKAFVHGDENMLNNYPELHEAFVWVYFHSNIPKFNKV 195
>AT4G11910.1 | Symbols: | INVOLVED IN: biological_process unknown;
LOCATED IN: chloroplast; BEST Arabidopsis thaliana
protein match is: non-yellowing 1 (TAIR:AT4G22920.1);
Has 206 Blast hits to 202 proteins in 67 species: Archae
- 0; Bacteria - 86; Metazoa - 0; Fungi - 0; Plants -
118; Viruses - 0; Other Eukaryotes - 2 (source: NCBI
BLink). | chr4:7156435-7157839 FORWARD LENGTH=271
Length = 271
Score = 278 bits (711), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 129/165 (78%), Positives = 147/165 (89%)
Query: 25 TRRIRKKNQAVFPVARLFGPAMFEASKLKVLFLGVDENKHPGDLPRTYTLTHSDITSKIT 84
TRR + KN+++ PVARLFGPA+FEASKLKVLFLGVDE KHP LPRTYTLTHSDIT+K+T
Sbjct: 31 TRRSKMKNRSIVPVARLFGPAIFEASKLKVLFLGVDEKKHPAKLPRTYTLTHSDITAKLT 90
Query: 85 LAISHNINNSQLQGWYNRLQRDEVVAQWRKIKGNMSLHVHCHISGGHFLLDLCAKLRYFI 144
LAIS +INNSQLQGW N+L RDEVV +W+K+KG MSLHVHCHISGGHF L+L AKLRY+I
Sbjct: 91 LAISQSINNSQLQGWANKLFRDEVVGEWKKVKGKMSLHVHCHISGGHFFLNLIAKLRYYI 150
Query: 145 FCKELPVVLKAFIHGDENLFNNYPELEESLVWVYFHSNISEFNKV 189
FCKELPVVL+AF HGDE L NN+PEL+ES VWVYFHSNI E+NKV
Sbjct: 151 FCKELPVVLEAFAHGDEYLLNNHPELQESPVWVYFHSNIPEYNKV 195
>AT1G44000.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G11911.1); Has 216 Blast hits to 212 proteins
in 76 species: Archae - 0; Bacteria - 96; Metazoa - 0;
Fungi - 0; Plants - 118; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:16708201-16709521 REVERSE
LENGTH=260
Length = 260
Score = 157 bits (396), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 111/162 (68%), Gaps = 7/162 (4%)
Query: 32 NQAVFPVARLFGP-AMFEASKLKVLFLG-VDENKHPGDL--PRTYTLTHSDITSKITLAI 87
N V RL P A F++SKLKV FLG + ENK G + PRTY L+H D T+ +TL I
Sbjct: 57 NTLVSEAVRLLVPQANFDSSKLKVEFLGELLENKSNGGIITPRTYILSHCDFTANLTLTI 116
Query: 88 SHNINNSQLQGWYNRLQRDEVVAQWRKIKGNMSLHVHCHISGGHFLLDLCAKLRYFIFCK 147
S+ IN QL+GWY ++D+VVA+W+K+ + LH+HC +SG L D+ A+LRY IF K
Sbjct: 117 SNVINLDQLEGWY---KKDDVVAEWKKVNDELRLHIHCCVSGMSLLQDVAAELRYHIFSK 173
Query: 148 ELPVVLKAFIHGDENLFNNYPELEESLVWVYFHSNISEFNKV 189
ELP+VLKA +HGD +F PEL ++ VWVYFHS+ ++N++
Sbjct: 174 ELPLVLKAVVHGDSVMFRENPELMDAYVWVYFHSSTPKYNRI 215