Miyakogusa Predicted Gene
- Lj5g3v2297780.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2297780.1 tr|A4S2E7|A4S2E7_OSTLU Predicted protein
OS=Ostreococcus lucimarinus (strain CCE9901)
GN=OSTLU_33429,30.3,0.00000000000003,SUBFAMILY NOT NAMED,NULL;
SAP30,Histone deacetylase complex subunit SAP30/SAP30-like;
SAP30_Sin3_bdg,CUFF.57244.1
(230 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G19330.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 319 1e-87
AT1G19330.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 315 2e-86
AT1G19330.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 311 2e-85
AT1G75060.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 308 3e-84
AT1G75060.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 301 2e-82
>AT1G19330.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins
in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40;
Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr1:6680824-6683290 REVERSE
LENGTH=242
Length = 242
Score = 319 bits (817), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 177/230 (76%), Positives = 189/230 (82%), Gaps = 4/230 (1%)
Query: 2 LVEAAMESSVNGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 60
++EA S V GG F Q+ GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV
Sbjct: 1 MLEAVDSSGVVNGG-FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 59
Query: 61 KKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENVQWNGSDMASDDTQ 120
KKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NGSDM S+DT
Sbjct: 60 KKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMTSEDTL 119
Query: 121 KTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWRYWRHF 180
K HKS+ R RS SSHKTMSRS S DSQSK S +MKVDLSKLEM AL YWRHF
Sbjct: 120 KPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLNYWRHF 179
Query: 181 NLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLKSTTVCK 230
NLVDAIPNPSKEQL+D VQRHFMSQQ+DELQVI+GFVQAAKR+K CK
Sbjct: 180 NLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKA--CK 227
>AT1G19330.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins
in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40;
Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr1:6680824-6683290 REVERSE
LENGTH=247
Length = 247
Score = 315 bits (806), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 178/235 (75%), Positives = 190/235 (80%), Gaps = 9/235 (3%)
Query: 2 LVEAAMESSVNGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 60
++EA S V GG F Q+ GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV
Sbjct: 1 MLEAVDSSGVVNGG-FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 59
Query: 61 KKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENVQWNGSDM-----A 115
KKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NGSDM A
Sbjct: 60 KKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPA 119
Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWR 175
S+DT K HKS+ R RS SSHKTMSRS S DSQSK S +MKVDLSKLEM AL
Sbjct: 120 SEDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLN 179
Query: 176 YWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLKSTTVCK 230
YWRHFNLVDAIPNPSKEQL+D VQRHFMSQQ+DELQVI+GFVQAAKR+K CK
Sbjct: 180 YWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKA--CK 232
>AT1G19330.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G75060.1). | chr1:6680824-6683290 REVERSE
LENGTH=248
Length = 248
Score = 311 bits (798), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 178/236 (75%), Positives = 189/236 (80%), Gaps = 10/236 (4%)
Query: 2 LVEAAMESSVNGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 60
++EA S V GG F Q+ GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV
Sbjct: 1 MLEAVDSSGVVNGG-FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 59
Query: 61 KKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENVQWNGSDM-----A 115
KKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NGSDM A
Sbjct: 60 KKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPA 119
Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGS-VTMTHGSMKVDLSKLEMAALW 174
S+DT K HKS+ R RS SSHKTMSRS S DSQSK S T KVDLSKLEM AL
Sbjct: 120 SEDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMQKVDLSKLEMPALL 179
Query: 175 RYWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLKSTTVCK 230
YWRHFNLVDAIPNPSKEQL+D VQRHFMSQQ+DELQVI+GFVQAAKR+K CK
Sbjct: 180 NYWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKA--CK 233
>AT1G75060.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins
in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:28181481-28183285 REVERSE
LENGTH=242
Length = 242
Score = 308 bits (788), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 162/229 (70%), Positives = 183/229 (79%), Gaps = 5/229 (2%)
Query: 1 MLVEAAMESSV--NGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 57
ML +SS+ GGG FSQL GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ
Sbjct: 1 MLEAPVNKSSILNGGGGGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 60
Query: 58 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFEN-VQWN-GSDMA 115
GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE ++ QWN SDM
Sbjct: 61 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMT 120
Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWR 175
++DT K HKS+ R +RS S K + R S DS SK S +MKVDL+KL+MAAL R
Sbjct: 121 TEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKISSITPRLNMKVDLTKLDMAALLR 180
Query: 176 YWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLK 224
YWRHFNLVDA+PNP+KEQL+D +QRHFMSQQ+DELQVI+GFVQAA +K
Sbjct: 181 YWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMK 229
>AT1G75060.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G19330.2);
Has 104 Blast hits to 104 proteins in 22 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:28181481-28183285 REVERSE LENGTH=241
Length = 241
Score = 301 bits (771), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 161/229 (70%), Positives = 182/229 (79%), Gaps = 6/229 (2%)
Query: 1 MLVEAAMESSV--NGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 57
ML +SS+ GGG FSQL GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ
Sbjct: 1 MLEAPVNKSSILNGGGGGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 60
Query: 58 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFEN-VQWN-GSDMA 115
GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE ++ QWN SDM
Sbjct: 61 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMT 120
Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWR 175
++DT K HKS+ R +RS S K + R S DS SK S +M VDL+KL+MAAL R
Sbjct: 121 TEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKISSITPRLNM-VDLTKLDMAALLR 179
Query: 176 YWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLK 224
YWRHFNLVDA+PNP+KEQL+D +QRHFMSQQ+DELQVI+GFVQAA +K
Sbjct: 180 YWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMK 228