Miyakogusa Predicted Gene
- Lj1g3v5060720.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v5060720.1 tr|A9SNI1|A9SNI1_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_165894,30.95,4e-19,seg,NULL; DUF4308,Domain of unknown
function DUF4308,CUFF.33970.1
(170 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G46820.2 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I... 148 2e-36
AT2G46820.1 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I... 148 2e-36
AT1G52220.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 90 8e-19
AT1G52220.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 86 2e-17
AT4G01150.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 78 3e-15
AT4G38100.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 67 5e-12
AT1G52220.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 51 4e-07
>AT2G46820.2 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I P
subunit | chr2:19243729-19244870 FORWARD LENGTH=174
Length = 174
Score = 148 bits (373), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/119 (63%), Positives = 85/119 (71%), Gaps = 1/119 (0%)
Query: 52 KATAFCRKVARNVMXXXXXXXXXXXXXXXXXLNGGELPEFVKTIQEAWDKVEDKYAVSSL 111
KATA+CRK+ RNV+ E VKT QEAW+KV+DKYA+ SL
Sbjct: 48 KATAYCRKIVRNVVTRATTEVGEAPATTTEAETTELP-EIVKTAQEAWEKVDDKYAIGSL 106
Query: 112 GVAGFVALWGSAGVISAIDRIPLVPGVLEVVGIGYTGWFAYKNLVFKPDREALFRKVKE 170
AG VALWGSAG+ISAIDR+PLVPGVLE+VGIGYTGWF YKNLVFKPDREALF KVK
Sbjct: 107 AFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFTYKNLVFKPDREALFEKVKS 165
>AT2G46820.1 | Symbols: PTAC8, TMP14, PSAP, PSI-P | photosystem I P
subunit | chr2:19243729-19244870 FORWARD LENGTH=174
Length = 174
Score = 148 bits (373), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/119 (63%), Positives = 85/119 (71%), Gaps = 1/119 (0%)
Query: 52 KATAFCRKVARNVMXXXXXXXXXXXXXXXXXLNGGELPEFVKTIQEAWDKVEDKYAVSSL 111
KATA+CRK+ RNV+ E VKT QEAW+KV+DKYA+ SL
Sbjct: 48 KATAYCRKIVRNVVTRATTEVGEAPATTTEAETTELP-EIVKTAQEAWEKVDDKYAIGSL 106
Query: 112 GVAGFVALWGSAGVISAIDRIPLVPGVLEVVGIGYTGWFAYKNLVFKPDREALFRKVKE 170
AG VALWGSAG+ISAIDR+PLVPGVLE+VGIGYTGWF YKNLVFKPDREALF KVK
Sbjct: 107 AFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFTYKNLVFKPDREALFEKVKS 165
>AT1G52220.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast thylakoid membrane, chloroplast; EXPRESSED
IN: 23 plant structures; EXPRESSED DURING: 13 growth
stages; BEST Arabidopsis thaliana protein match is:
photosystem I P subunit (TAIR:AT2G46820.2); Has 291
Blast hits to 291 proteins in 50 species: Archae - 0;
Bacteria - 90; Metazoa - 0; Fungi - 0; Plants - 200;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr1:19453770-19454605 REVERSE LENGTH=156
Length = 156
Score = 89.7 bits (221), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 56/81 (69%)
Query: 90 EFVKTIQEAWDKVEDKYAVSSLGVAGFVALWGSAGVISAIDRIPLVPGVLEVVGIGYTGW 149
+ V TIQ WDK ED+ + LG AG VALW S +I+AID++P++ E+VGI ++ W
Sbjct: 68 DVVSTIQNVWDKSEDRLGLIGLGFAGIVALWASLNLITAIDKLPVISSGFELVGILFSTW 127
Query: 150 FAYKNLVFKPDREALFRKVKE 170
F Y+ L+FKPDR+ L + VK+
Sbjct: 128 FTYRYLLFKPDRQELSKIVKK 148
>AT1G52220.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast thylakoid membrane, chloroplast; EXPRESSED
IN: 23 plant structures; EXPRESSED DURING: 13 growth
stages; BEST Arabidopsis thaliana protein match is:
photosystem I P subunit (TAIR:AT2G46820.2); Has 35333
Blast hits to 34131 proteins in 2444 species: Archae -
798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr1:19453770-19454605 REVERSE
LENGTH=155
Length = 155
Score = 85.5 bits (210), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 56/81 (69%), Gaps = 1/81 (1%)
Query: 90 EFVKTIQEAWDKVEDKYAVSSLGVAGFVALWGSAGVISAIDRIPLVPGVLEVVGIGYTGW 149
+ V TIQ WDK ED+ + LG AG VALW S +I+AID++P++ E+VGI ++ W
Sbjct: 68 DVVSTIQN-WDKSEDRLGLIGLGFAGIVALWASLNLITAIDKLPVISSGFELVGILFSTW 126
Query: 150 FAYKNLVFKPDREALFRKVKE 170
F Y+ L+FKPDR+ L + VK+
Sbjct: 127 FTYRYLLFKPDRQELSKIVKK 147
>AT4G01150.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: thylakoid,
chloroplast thylakoid membrane, chloroplast,
plastoglobule, chloroplast envelope; EXPRESSED IN: 23
plant structures; EXPRESSED DURING: 14 growth stages;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT4G38100.1); Has 323 Blast hits to 323
proteins in 59 species: Archae - 0; Bacteria - 107;
Metazoa - 0; Fungi - 0; Plants - 206; Viruses - 0; Other
Eukaryotes - 10 (source: NCBI BLink). |
chr4:493692-494668 FORWARD LENGTH=164
Length = 164
Score = 77.8 bits (190), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 53/80 (66%)
Query: 90 EFVKTIQEAWDKVEDKYAVSSLGVAGFVALWGSAGVISAIDRIPLVPGVLEVVGIGYTGW 149
E + ++E WD +E+K V G VA+W S+ V+ AI+ +PL+P V+E+VG+GYTGW
Sbjct: 75 ELITDLKEKWDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSVPLLPKVMELVGLGYTGW 134
Query: 150 FAYKNLVFKPDREALFRKVK 169
F Y+ L+FK R+ L ++
Sbjct: 135 FVYRYLLFKSSRKELAEDIE 154
>AT4G38100.1 | Symbols: | unknown protein; LOCATED IN: chloroplast
thylakoid membrane; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G01150.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:17887033-17888177 REVERSE LENGTH=193
Length = 193
Score = 67.4 bits (163), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 90 EFVKTIQEAWDKVEDKYAVSSLGVAGFVALWGSAGVISAIDRIPLVPGVLEVVGIGYTGW 149
EF+ I+ DK Y++ G VAL+ ++ ++S+++ IPL P ++EVVG+GYT W
Sbjct: 105 EFLNDIKLDSDKT---YSILLYGSGAIVALYLTSAIVSSLEAIPLFPKLMEVVGLGYTLW 161
Query: 150 FAYKNLVFKPDREALFRKVKE 170
F + L+FK +RE L KV E
Sbjct: 162 FTTRYLLFKRNREELKTKVSE 182
>AT1G52220.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast thylakoid membrane, chloroplast; EXPRESSED
IN: 23 plant structures; EXPRESSED DURING: 13 growth
stages; BEST Arabidopsis thaliana protein match is:
photosystem I P subunit (TAIR:AT2G46820.2); Has 251
Blast hits to 251 proteins in 43 species: Archae - 0;
Bacteria - 66; Metazoa - 0; Fungi - 0; Plants - 184;
Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
| chr1:19453770-19454605 REVERSE LENGTH=127
Length = 127
Score = 51.2 bits (121), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 32/43 (74%)
Query: 128 AIDRIPLVPGVLEVVGIGYTGWFAYKNLVFKPDREALFRKVKE 170
AID++P++ E+VGI ++ WF Y+ L+FKPDR+ L + VK+
Sbjct: 77 AIDKLPVISSGFELVGILFSTWFTYRYLLFKPDRQELSKIVKK 119