Miyakogusa Predicted Gene
- Lj3g3v2693120.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2693120.1 Non Chatacterized Hit- tr|C5XNT4|C5XNT4_SORBI
Putative uncharacterized protein Sb03g025940 OS=Sorghu,38.93,1e-17,
,CUFF.44434.1
(173 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G55160.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 141 2e-34
AT1G55160.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 138 2e-33
AT1G55160.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 128 2e-30
AT2G19530.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 80 6e-16
>AT1G55160.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion,
plastid; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G19530.1);
Has 63 Blast hits to 63 proteins in 14 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:20578476-20579803 FORWARD LENGTH=188
Length = 188
Score = 141 bits (355), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/196 (44%), Positives = 113/196 (57%), Gaps = 43/196 (21%)
Query: 9 APKLYNHKPRK----AQLKQSKGQHKFSSPP-----------------GMGTQTTAXXXX 47
PKL+ +KP+K AQLK + F++P MG +
Sbjct: 5 TPKLFTNKPKKKAIIAQLKHVEAN--FNNPTVPPSSKPSPAAAAAASYTMGGGSVPPPPP 62
Query: 48 XXXKESFARRYKYLWPMLLAVNLGVGAYLFVRTKKKDI--------GEEEQDASPVPVKE 99
KESFARRYKY+WP+LL VNL VG YLF RTKKKD+ + A+PV V++
Sbjct: 63 P--KESFARRYKYVWPLLLTVNLAVGGYLFFRTKKKDLDPVVEETAAKSSSVAAPVTVEK 120
Query: 100 TVAHVAETRVSPAPIASPVI--EREPIPVDQQRELFKWILEEKRKVKPKDAEEKRKIDEE 157
T++ +A PV+ REPIP QQRELFKW+LEEKRKV PK+AEEK++ DEE
Sbjct: 121 TLSSTV--------VAEPVVIKAREPIPEKQQRELFKWMLEEKRKVNPKNAEEKKRNDEE 172
Query: 158 KALLKNLIRSKSIPSI 173
KA+LK I SK+IP+
Sbjct: 173 KAILKQFIGSKTIPTF 188
>AT1G55160.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion,
plastid; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G19530.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr1:20578804-20579803
FORWARD LENGTH=137
Length = 137
Score = 138 bits (348), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/133 (55%), Positives = 92/133 (69%), Gaps = 18/133 (13%)
Query: 51 KESFARRYKYLWPMLLAVNLGVGAYLFVRTKKKDI--------GEEEQDASPVPVKETVA 102
KESFARRYKY+WP+LL VNL VG YLF RTKKKD+ + A+PV V++T++
Sbjct: 13 KESFARRYKYVWPLLLTVNLAVGGYLFFRTKKKDLDPVVEETAAKSSSVAAPVTVEKTLS 72
Query: 103 HVAETRVSPAPIASPVI--EREPIPVDQQRELFKWILEEKRKVKPKDAEEKRKIDEEKAL 160
+A PV+ REPIP QQRELFKW+LEEKRKV PK+AEEK++ DEEKA+
Sbjct: 73 STV--------VAEPVVIKAREPIPEKQQRELFKWMLEEKRKVNPKNAEEKKRNDEEKAI 124
Query: 161 LKNLIRSKSIPSI 173
LK I SK+IP+
Sbjct: 125 LKQFIGSKTIPTF 137
>AT1G55160.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion,
plastid; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G19530.1);
Has 63 Blast hits to 63 proteins in 14 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:20578476-20579803 FORWARD LENGTH=213
Length = 213
Score = 128 bits (322), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 114/221 (51%), Gaps = 68/221 (30%)
Query: 9 APKLYNHKPRK----AQLKQSKGQHKFSSPP-----------------GMGTQTTAXXXX 47
PKL+ +KP+K AQLK + F++P MG +
Sbjct: 5 TPKLFTNKPKKKAIIAQLKHVEAN--FNNPTVPPSSKPSPAAAAAASYTMGGGSVPPPPP 62
Query: 48 XXXKESFARRYKYLWPMLLAVNLGVG-------------------------AYLFVRTKK 82
KESFARRYKY+WP+LL VNL VG +YLF RTKK
Sbjct: 63 P--KESFARRYKYVWPLLLTVNLAVGGFCSSLDENRIVFSFIFMMLRVIYDSYLFFRTKK 120
Query: 83 KDI--------GEEEQDASPVPVKETVAHVAETRVSPAPIASPVI--EREPIPVDQQREL 132
KD+ + A+PV V++T+ S +A PV+ REPIP QQREL
Sbjct: 121 KDLDPVVEETAAKSSSVAAPVTVEKTL--------SSTVVAEPVVIKAREPIPEKQQREL 172
Query: 133 FKWILEEKRKVKPKDAEEKRKIDEEKALLKNLIRSKSIPSI 173
FKW+LEEKRKV PK+AEEK++ DEEKA+LK I SK+IP+
Sbjct: 173 FKWMLEEKRKVNPKNAEEKKRNDEEKAILKQFIGSKTIPTF 213
>AT2G19530.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G55160.2); Has 461 Blast hits to 346 proteins
in 80 species: Archae - 0; Bacteria - 16; Metazoa - 89;
Fungi - 28; Plants - 57; Viruses - 0; Other Eukaryotes -
271 (source: NCBI BLink). | chr2:8460219-8461486 FORWARD
LENGTH=202
Length = 202
Score = 80.5 bits (197), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 77/161 (47%), Gaps = 49/161 (30%)
Query: 62 WPMLLAVNLGVGAYLFVRTKKKDI-------------------------------GEEEQ 90
W + NLG AY+F ++KDI G EE
Sbjct: 23 WRATMIFNLGFAAYIFAIKREKDIDADEKKKVKKGSEARHKGVKKGAVNTEIEKKGAEET 82
Query: 91 D-----ASPVPVKETVAHVAE-------TRVSPAPIASPV------IEREPIPVDQQREL 132
D + +P KE + E T + + V + R+PIP D+Q+EL
Sbjct: 83 DKAKEAETAIPEKEETKLIPELDPLFEFTDATDQSMFQTVATEHVKVARKPIPEDEQKEL 142
Query: 133 FKWILEEKRKVKPKDAEEKRKIDEEKALLKNLIRSKSIPSI 173
FKWILEEKRK++PKD +EK++IDEEKA+LK IR++ IP +
Sbjct: 143 FKWILEEKRKIEPKDRKEKKQIDEEKAILKQFIRAERIPKL 183