Miyakogusa Predicted Gene

Lj5g3v2297780.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2297780.1 tr|A4S2E7|A4S2E7_OSTLU Predicted protein
OS=Ostreococcus lucimarinus (strain CCE9901)
GN=OSTLU_33429,30.3,0.00000000000003,SUBFAMILY NOT NAMED,NULL;
SAP30,Histone deacetylase complex subunit SAP30/SAP30-like;
SAP30_Sin3_bdg,CUFF.57244.1
         (230 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G19330.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   319   1e-87
AT1G19330.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   315   2e-86
AT1G19330.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   311   2e-85
AT1G75060.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   308   3e-84
AT1G75060.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   301   2e-82

>AT1G19330.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins
           in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40;
           Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr1:6680824-6683290 REVERSE
           LENGTH=242
          Length = 242

 Score =  319 bits (817), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 177/230 (76%), Positives = 189/230 (82%), Gaps = 4/230 (1%)

Query: 2   LVEAAMESSVNGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 60
           ++EA   S V  GG F Q+    GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV
Sbjct: 1   MLEAVDSSGVVNGG-FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 59

Query: 61  KKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENVQWNGSDMASDDTQ 120
           KKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NGSDM S+DT 
Sbjct: 60  KKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMTSEDTL 119

Query: 121 KTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWRYWRHF 180
           K HKS+ R  RS  SSHKTMSRS S DSQSK S      +MKVDLSKLEM AL  YWRHF
Sbjct: 120 KPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLNYWRHF 179

Query: 181 NLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLKSTTVCK 230
           NLVDAIPNPSKEQL+D VQRHFMSQQ+DELQVI+GFVQAAKR+K    CK
Sbjct: 180 NLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKA--CK 227


>AT1G19330.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G75060.1); Has 145 Blast hits to 145 proteins
           in 43 species: Archae - 0; Bacteria - 0; Metazoa - 40;
           Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr1:6680824-6683290 REVERSE
           LENGTH=247
          Length = 247

 Score =  315 bits (806), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 178/235 (75%), Positives = 190/235 (80%), Gaps = 9/235 (3%)

Query: 2   LVEAAMESSVNGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 60
           ++EA   S V  GG F Q+    GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV
Sbjct: 1   MLEAVDSSGVVNGG-FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 59

Query: 61  KKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENVQWNGSDM-----A 115
           KKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NGSDM     A
Sbjct: 60  KKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPA 119

Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWR 175
           S+DT K HKS+ R  RS  SSHKTMSRS S DSQSK S      +MKVDLSKLEM AL  
Sbjct: 120 SEDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMKVDLSKLEMPALLN 179

Query: 176 YWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLKSTTVCK 230
           YWRHFNLVDAIPNPSKEQL+D VQRHFMSQQ+DELQVI+GFVQAAKR+K    CK
Sbjct: 180 YWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKA--CK 232


>AT1G19330.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G75060.1). | chr1:6680824-6683290 REVERSE
           LENGTH=248
          Length = 248

 Score =  311 bits (798), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 178/236 (75%), Positives = 189/236 (80%), Gaps = 10/236 (4%)

Query: 2   LVEAAMESSVNGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 60
           ++EA   S V  GG F Q+    GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV
Sbjct: 1   MLEAVDSSGVVNGG-FPQIQSFYGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVV 59

Query: 61  KKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENVQWNGSDM-----A 115
           KKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEEDDDL+FEN Q NGSDM     A
Sbjct: 60  KKAVGLGGWHWLVLTNGIEVKLQRNALSVLEPPTGNEEDDDLDFENTQRNGSDMIVSFPA 119

Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGS-VTMTHGSMKVDLSKLEMAALW 174
           S+DT K HKS+ R  RS  SSHKTMSRS S DSQSK S  T      KVDLSKLEM AL 
Sbjct: 120 SEDTLKPHKSKLRGQRSSRSSHKTMSRSLSSDSQSKSSGFTPPENMQKVDLSKLEMPALL 179

Query: 175 RYWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLKSTTVCK 230
            YWRHFNLVDAIPNPSKEQL+D VQRHFMSQQ+DELQVI+GFVQAAKR+K    CK
Sbjct: 180 NYWRHFNLVDAIPNPSKEQLIDIVQRHFMSQQMDELQVIVGFVQAAKRMKKA--CK 233


>AT1G75060.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G19330.2); Has 104 Blast hits to 104 proteins
           in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:28181481-28183285 REVERSE
           LENGTH=242
          Length = 242

 Score =  308 bits (788), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 162/229 (70%), Positives = 183/229 (79%), Gaps = 5/229 (2%)

Query: 1   MLVEAAMESSV--NGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 57
           ML     +SS+   GGG FSQL    GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ
Sbjct: 1   MLEAPVNKSSILNGGGGGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 60

Query: 58  GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFEN-VQWN-GSDMA 115
           GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE ++  QWN  SDM 
Sbjct: 61  GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMT 120

Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWR 175
           ++DT K HKS+ R +RS   S K + R  S DS SK S      +MKVDL+KL+MAAL R
Sbjct: 121 TEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKISSITPRLNMKVDLTKLDMAALLR 180

Query: 176 YWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLK 224
           YWRHFNLVDA+PNP+KEQL+D +QRHFMSQQ+DELQVI+GFVQAA  +K
Sbjct: 181 YWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMK 229


>AT1G75060.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G19330.2);
           Has 104 Blast hits to 104 proteins in 22 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr1:28181481-28183285 REVERSE LENGTH=241
          Length = 241

 Score =  301 bits (771), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 161/229 (70%), Positives = 182/229 (79%), Gaps = 6/229 (2%)

Query: 1   MLVEAAMESSV--NGGGAFSQLHQSCGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 57
           ML     +SS+   GGG FSQL    GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ
Sbjct: 1   MLEAPVNKSSILNGGGGGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQ 60

Query: 58  GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFEN-VQWN-GSDMA 115
           GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSV+E PTGNEED+DLE ++  QWN  SDM 
Sbjct: 61  GVVKKAVGLGGWHWLVLTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMT 120

Query: 116 SDDTQKTHKSRHRMNRSLGSSHKTMSRSFSGDSQSKGSVTMTHGSMKVDLSKLEMAALWR 175
           ++DT K HKS+ R +RS   S K + R  S DS SK S      +M VDL+KL+MAAL R
Sbjct: 121 TEDTLKPHKSKKRGHRSSRLSQKALYREVSCDSHSKISSITPRLNM-VDLTKLDMAALLR 179

Query: 176 YWRHFNLVDAIPNPSKEQLVDGVQRHFMSQQIDELQVIMGFVQAAKRLK 224
           YWRHFNLVDA+PNP+KEQL+D +QRHFMSQQ+DELQVI+GFVQAA  +K
Sbjct: 180 YWRHFNLVDALPNPTKEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMK 228