Miyakogusa Predicted Gene
- Lj0g3v0078789.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0078789.1 tr|A4S2E7|A4S2E7_OSTLU Predicted protein
OS=Ostreococcus lucimarinus (strain CCE9901)
GN=OSTLU_33429,31.5,0.000000000000003,SAP30_Sin3_bdg,Histone
deacetylase complex subunit SAP30, Sin3 binding domain; SUBFAMILY NOT
NAMED,N,CUFF.4029.1
(228 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G19330.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 330 4e-91
AT1G75060.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 328 2e-90
AT1G19330.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 327 5e-90
AT1G19330.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 327 5e-90
AT1G75060.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 324 3e-89
>AT1G19330.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins
in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40;
Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr1:6680824-6683290 REVERSE
LENGTH=242
Length = 242
Score = 330 bits (847), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 182/230 (79%), Positives = 193/230 (83%), Gaps = 5/230 (2%)
Query: 1 MLEATMESSSVNGGGDRFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG 58
MLEA S VNGG F Q+QS GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG
Sbjct: 1 MLEAVDSSGVVNGG---FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG 57
Query: 59 VVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENTQWNGSDMASDD 118
VVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FENTQ NGSDM S+D
Sbjct: 58 VVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMTSED 117
Query: 119 TQKSHKSRHRTHRSLGSSHKTTSRSFSADSQSKGSISMPHGWTKVDLSKLEMAALWRYWR 178
T K HKS+ R RS SSHKT SRS S+DSQSK S P KVDLSKLEM AL YWR
Sbjct: 118 TLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLNYWR 177
Query: 179 HFNLVDAVPNPSKEQLVDVVQRHFMSQQMDELQVIMGFVQAAKRLKTMCK 228
HFNLVDA+PNPSKEQL+D+VQRHFMSQQMDELQVI+GFVQAAKR+K CK
Sbjct: 178 HFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKACK 227
>AT1G75060.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins
in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:28181481-28183285 REVERSE
LENGTH=242
Length = 242
Score = 328 bits (842), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 172/233 (73%), Positives = 191/233 (81%), Gaps = 5/233 (2%)
Query: 1 MLEATMESSSV-NGGGDRFSQLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 57
MLEA + SS+ NGGG FSQLQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ
Sbjct: 1 MLEAPVNKSSILNGGGGGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 60
Query: 58 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFEN-TQWN-GSDMA 115
GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE ++ TQWN SDM
Sbjct: 61 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMT 120
Query: 116 SDDTQKSHKSRHRTHRSLGSSHKTTSRSFSADSQSKGSISMPHGWTKVDLSKLEMAALWR 175
++DT K HKS+ R HRS S K R S DS SK S P KVDL+KL+MAAL R
Sbjct: 121 TEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKISSITPRLNMKVDLTKLDMAALLR 180
Query: 176 YWRHFNLVDAVPNPSKEQLVDVVQRHFMSQQMDELQVIMGFVQAAKRLKTMCK 228
YWRHFNLVDA+PNP+KEQL+D++QRHFMSQQMDELQVI+GFVQAA +K C+
Sbjct: 181 YWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMKKACQ 233
>AT1G19330.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G75060.1). | chr1:6680824-6683290 REVERSE
LENGTH=248
Length = 248
Score = 327 bits (837), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 174/236 (73%), Positives = 186/236 (78%), Gaps = 11/236 (4%)
Query: 1 MLEATMESSSVNGGGDRFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG 58
MLEA S VNGG F Q+QS GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG
Sbjct: 1 MLEAVDSSGVVNGG---FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG 57
Query: 59 VVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENTQWNGSDM---- 114
VVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FENTQ NGSDM
Sbjct: 58 VVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSF 117
Query: 115 -ASDDTQKSHKSRHRTHRSLGSSHKT-TSRSFSADSQSKGSISMPHGWTKVDLSKLEMAA 172
AS+DT K HKS+ R RS SSHKT + S + P KVDLSKLEM A
Sbjct: 118 PASEDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMQKVDLSKLEMPA 177
Query: 173 LWRYWRHFNLVDAVPNPSKEQLVDVVQRHFMSQQMDELQVIMGFVQAAKRLKTMCK 228
L YWRHFNLVDA+PNPSKEQL+D+VQRHFMSQQMDELQVI+GFVQAAKR+K CK
Sbjct: 178 LLNYWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKACK 233
>AT1G19330.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins
in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40;
Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr1:6680824-6683290 REVERSE
LENGTH=247
Length = 247
Score = 327 bits (837), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 183/235 (77%), Positives = 194/235 (82%), Gaps = 10/235 (4%)
Query: 1 MLEATMESSSVNGGGDRFSQLQS-CGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG 58
MLEA S VNGG F Q+QS GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG
Sbjct: 1 MLEAVDSSGVVNGG---FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQG 57
Query: 59 VVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENTQWNGSDM---- 114
VVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FENTQ NGSDM
Sbjct: 58 VVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSF 117
Query: 115 -ASDDTQKSHKSRHRTHRSLGSSHKTTSRSFSADSQSKGSISMPHGWTKVDLSKLEMAAL 173
AS+DT K HKS+ R RS SSHKT SRS S+DSQSK S P KVDLSKLEM AL
Sbjct: 118 PASEDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPAL 177
Query: 174 WRYWRHFNLVDAVPNPSKEQLVDVVQRHFMSQQMDELQVIMGFVQAAKRLKTMCK 228
YWRHFNLVDA+PNPSKEQL+D+VQRHFMSQQMDELQVI+GFVQAAKR+K CK
Sbjct: 178 LNYWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKACK 232
>AT1G75060.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G19330.2);
Has 104 Blast hits to 104 proteins in 22 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:28181481-28183285 REVERSE LENGTH=241
Length = 241
Score = 324 bits (830), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 171/233 (73%), Positives = 190/233 (81%), Gaps = 6/233 (2%)
Query: 1 MLEATMESSSV-NGGGDRFSQLQSC-GD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 57
MLEA + SS+ NGGG FSQLQSC GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ
Sbjct: 1 MLEAPVNKSSILNGGGGGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 60
Query: 58 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFEN-TQWN-GSDMA 115
GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE ++ TQWN SDM
Sbjct: 61 GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMT 120
Query: 116 SDDTQKSHKSRHRTHRSLGSSHKTTSRSFSADSQSKGSISMPHGWTKVDLSKLEMAALWR 175
++DT K HKS+ R HRS S K R S DS SK S P VDL+KL+MAAL R
Sbjct: 121 TEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKISSITPR-LNMVDLTKLDMAALLR 179
Query: 176 YWRHFNLVDAVPNPSKEQLVDVVQRHFMSQQMDELQVIMGFVQAAKRLKTMCK 228
YWRHFNLVDA+PNP+KEQL+D++QRHFMSQQMDELQVI+GFVQAA +K C+
Sbjct: 180 YWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMKKACQ 232