Miyakogusa Predicted Gene

Lj0g3v0300909.3
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0300909.3 Non Chatacterized Hit- tr|A5C9J2|A5C9J2_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,41.07,3e-19,seg,NULL; DUF4308,Domain of unknown function
DUF4308,CUFF.20247.3
         (145 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G52220.1 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   168   1e-42
AT1G52220.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   161   1e-40
AT1G52220.3 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   100   5e-22
AT2G46820.2 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I...    98   2e-21
AT2G46820.1 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I...    98   2e-21
AT4G01150.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    71   3e-13
AT4G38100.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...    62   1e-10

>AT1G52220.1 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast thylakoid membrane, chloroplast; EXPRESSED
           IN: 23 plant structures; EXPRESSED DURING: 13 growth
           stages; BEST Arabidopsis thaliana protein match is:
           photosystem I P subunit (TAIR:AT2G46820.2); Has 291
           Blast hits to 291 proteins in 50 species: Archae - 0;
           Bacteria - 90; Metazoa - 0; Fungi - 0; Plants - 200;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr1:19453770-19454605 REVERSE LENGTH=156
          Length = 156

 Score =  168 bits (425), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 89/156 (57%), Positives = 105/156 (67%), Gaps = 11/156 (7%)

Query: 1   MASIAASLQPPLLLHGRKSHTGNFPSFPVSLLPGRRNL-----------IALVVKASGXX 49
           MASI+A+L  PLLL  RKS+  +    P SL  G  +L           I+L+VKASG  
Sbjct: 1   MASISATLPSPLLLTQRKSNLTSIQKLPFSLTRGTNDLSPLSLTRNPSSISLMVKASGES 60

Query: 50  XXXXXXXXXXXXXXNVWDKPEDRLGLIGFGFAGIVALWASANLITAVDQLPVLPTVLELI 109
                         NVWDK EDRLGLIG GFAGIVALWAS NLITA+D+LPV+ +  EL+
Sbjct: 61  SDSSTDLDVVSTIQNVWDKSEDRLGLIGLGFAGIVALWASLNLITAIDKLPVISSGFELV 120

Query: 110 GILFSVWFTYRYLLFKPDREELFQILNKSVSDILGQ 145
           GILFS WFTYRYLLFKPDR+EL +I+ KSV+DILGQ
Sbjct: 121 GILFSTWFTYRYLLFKPDRQELSKIVKKSVADILGQ 156


>AT1G52220.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast thylakoid membrane, chloroplast; EXPRESSED
           IN: 23 plant structures; EXPRESSED DURING: 13 growth
           stages; BEST Arabidopsis thaliana protein match is:
           photosystem I P subunit (TAIR:AT2G46820.2); Has 35333
           Blast hits to 34131 proteins in 2444 species: Archae -
           798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
           Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr1:19453770-19454605 REVERSE
           LENGTH=155
          Length = 155

 Score =  161 bits (408), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 88/156 (56%), Positives = 104/156 (66%), Gaps = 12/156 (7%)

Query: 1   MASIAASLQPPLLLHGRKSHTGNFPSFPVSLLPGRRNL-----------IALVVKASGXX 49
           MASI+A+L  PLLL  RKS+  +    P SL  G  +L           I+L+VKASG  
Sbjct: 1   MASISATLPSPLLLTQRKSNLTSIQKLPFSLTRGTNDLSPLSLTRNPSSISLMVKASGES 60

Query: 50  XXXXXXXXXXXXXXNVWDKPEDRLGLIGFGFAGIVALWASANLITAVDQLPVLPTVLELI 109
                         N WDK EDRLGLIG GFAGIVALWAS NLITA+D+LPV+ +  EL+
Sbjct: 61  SDSSTDLDVVSTIQN-WDKSEDRLGLIGLGFAGIVALWASLNLITAIDKLPVISSGFELV 119

Query: 110 GILFSVWFTYRYLLFKPDREELFQILNKSVSDILGQ 145
           GILFS WFTYRYLLFKPDR+EL +I+ KSV+DILGQ
Sbjct: 120 GILFSTWFTYRYLLFKPDRQELSKIVKKSVADILGQ 155


>AT1G52220.3 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast thylakoid membrane, chloroplast; EXPRESSED
           IN: 23 plant structures; EXPRESSED DURING: 13 growth
           stages; BEST Arabidopsis thaliana protein match is:
           photosystem I P subunit (TAIR:AT2G46820.2); Has 251
           Blast hits to 251 proteins in 43 species: Archae - 0;
           Bacteria - 66; Metazoa - 0; Fungi - 0; Plants - 184;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr1:19453770-19454605 REVERSE LENGTH=127
          Length = 127

 Score = 99.8 bits (247), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 79/156 (50%), Gaps = 40/156 (25%)

Query: 1   MASIAASLQPPLLLHGRKSHTGNFPSFPVSLLPGRRNL-----------IALVVKASGXX 49
           MASI+A+L  PLLL  RKS+  +    P SL  G  +L           I+L+VKASG  
Sbjct: 1   MASISATLPSPLLLTQRKSNLTSIQKLPFSLTRGTNDLSPLSLTRNPSSISLMVKASGES 60

Query: 50  XXXXXXXXXXXXXXNVWDKPEDRLGLIGFGFAGIVALWASANLITAVDQLPVLPTVLELI 109
                         NV                             A+D+LPV+ +  EL+
Sbjct: 61  SDSSTDLDVVSTIQNV-----------------------------AIDKLPVISSGFELV 91

Query: 110 GILFSVWFTYRYLLFKPDREELFQILNKSVSDILGQ 145
           GILFS WFTYRYLLFKPDR+EL +I+ KSV+DILGQ
Sbjct: 92  GILFSTWFTYRYLLFKPDRQELSKIVKKSVADILGQ 127


>AT2G46820.2 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I P
           subunit | chr2:19243729-19244870 FORWARD LENGTH=174
          Length = 174

 Score = 98.2 bits (243), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 60/82 (73%)

Query: 64  NVWDKPEDRLGLIGFGFAGIVALWASANLITAVDQLPVLPTVLELIGILFSVWFTYRYLL 123
             W+K +D+  +    FAG+VALW SA +I+A+D+LP++P VLEL+GI ++ WFTY+ L+
Sbjct: 92  EAWEKVDDKYAIGSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFTYKNLV 151

Query: 124 FKPDREELFQILNKSVSDILGQ 145
           FKPDRE LF+ +  +  DILG 
Sbjct: 152 FKPDREALFEKVKSTYKDILGS 173


>AT2G46820.1 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I P
           subunit | chr2:19243729-19244870 FORWARD LENGTH=174
          Length = 174

 Score = 98.2 bits (243), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 60/82 (73%)

Query: 64  NVWDKPEDRLGLIGFGFAGIVALWASANLITAVDQLPVLPTVLELIGILFSVWFTYRYLL 123
             W+K +D+  +    FAG+VALW SA +I+A+D+LP++P VLEL+GI ++ WFTY+ L+
Sbjct: 92  EAWEKVDDKYAIGSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFTYKNLV 151

Query: 124 FKPDREELFQILNKSVSDILGQ 145
           FKPDRE LF+ +  +  DILG 
Sbjct: 152 FKPDREALFEKVKSTYKDILGS 173


>AT4G01150.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: thylakoid,
           chloroplast thylakoid membrane, chloroplast,
           plastoglobule, chloroplast envelope; EXPRESSED IN: 23
           plant structures; EXPRESSED DURING: 14 growth stages;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT4G38100.1); Has 323 Blast hits to 323
           proteins in 59 species: Archae - 0; Bacteria - 107;
           Metazoa - 0; Fungi - 0; Plants - 206; Viruses - 0; Other
           Eukaryotes - 10 (source: NCBI BLink). |
           chr4:493692-494668 FORWARD LENGTH=164
          Length = 164

 Score = 70.9 bits (172), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 52/80 (65%)

Query: 66  WDKPEDRLGLIGFGFAGIVALWASANLITAVDQLPVLPTVLELIGILFSVWFTYRYLLFK 125
           WD  E++  ++ +G   IVA+W S+ ++ A++ +P+LP V+EL+G+ ++ WF YRYLLFK
Sbjct: 84  WDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSVPLLPKVMELVGLGYTGWFVYRYLLFK 143

Query: 126 PDREELFQILNKSVSDILGQ 145
             R+EL + +      I G 
Sbjct: 144 SSRKELAEDIESLKKKIAGS 163


>AT4G38100.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast
           thylakoid membrane; EXPRESSED IN: 23 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G01150.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:17887033-17888177 REVERSE LENGTH=193
          Length = 193

 Score = 62.0 bits (149), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 49/67 (73%)

Query: 78  FGFAGIVALWASANLITAVDQLPVLPTVLELIGILFSVWFTYRYLLFKPDREELFQILNK 137
           +G   IVAL+ ++ ++++++ +P+ P ++E++G+ +++WFT RYLLFK +REEL   +++
Sbjct: 123 YGSGAIVALYLTSAIVSSLEAIPLFPKLMEVVGLGYTLWFTTRYLLFKRNREELKTKVSE 182

Query: 138 SVSDILG 144
               +LG
Sbjct: 183 IKKQVLG 189