Miyakogusa Predicted Gene
- Lj4g3v1880630.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1880630.1 tr|Q9XI35|Q9XI35_ARATH AT1G15350 protein
OS=Arabidopsis thaliana GN=F9L1.29 PE=4
SV=1,42.73,3e-19,DUF4050,Domain of unknown function DUF4050;
seg,NULL,CUFF.49811.1
(173 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 172 1e-43
AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 172 1e-43
AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 127 4e-30
AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 123 5e-29
AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 123 5e-29
AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 122 9e-29
AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 122 9e-29
AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 100 5e-22
AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 80 8e-16
AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 68 4e-12
AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 68 4e-12
>AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
LENGTH=169
Length = 169
Score = 172 bits (435), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/168 (50%), Positives = 121/168 (72%), Gaps = 5/168 (2%)
Query: 5 IRTAWITDLFAXXXXXXXXXXKPTPIIAVDEPAKGLRIQGQSVRKPTISDGFWSSSTCDL 64
I +WI LF KP I+AVDEP+KGLRIQG+ V+KP++S+ FWS+STC++
Sbjct: 6 IIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEM 65
Query: 65 DNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQ 124
DNST+QSQ+S+SS+S NN ++ ++S EFVN GL LWN++R QW+ +G S+K+
Sbjct: 66 DNSTLQSQRSMSSISFTNN----TSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK 121
Query: 125 TQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
+ R+P ++WN +YESL+G + F + +PL EMV+FLVDVWE+EG+Y
Sbjct: 122 A-KVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLY 168
>AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:8799934-8802333
REVERSE LENGTH=169
Length = 169
Score = 172 bits (435), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/168 (50%), Positives = 121/168 (72%), Gaps = 5/168 (2%)
Query: 5 IRTAWITDLFAXXXXXXXXXXKPTPIIAVDEPAKGLRIQGQSVRKPTISDGFWSSSTCDL 64
I +WI LF KP I+AVDEP+KGLRIQG+ V+KP++S+ FWS+STC++
Sbjct: 6 IIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEM 65
Query: 65 DNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQ 124
DNST+QSQ+S+SS+S NN ++ ++S EFVN GL LWN++R QW+ +G S+K+
Sbjct: 66 DNSTLQSQRSMSSISFTNN----TSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK 121
Query: 125 TQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
+ R+P ++WN +YESL+G + F + +PL EMV+FLVDVWE+EG+Y
Sbjct: 122 A-KVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLY 168
>AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25360.2);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr4:15615184-15616200
REVERSE LENGTH=161
Length = 161
Score = 127 bits (319), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 95/145 (65%), Gaps = 14/145 (9%)
Query: 30 IIAVDEPAKGLRIQGQSVRKPTIS-DGFWSSSTCDLD-NSTIQSQKSISSVSTLNNILYH 87
++ VDEP+KGL+IQG+ V+K + S D FWS+STCD+D N TIQSQ S N +
Sbjct: 28 VVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQSS--------NPPFD 79
Query: 88 SNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQ 147
++S T EFVN GL+LWN +R QW +Q +P ++WN++Y+SL+ T +
Sbjct: 80 PQCSTSNST--EFVNHGLILWNHTRQQWRECL--TRQQCLVPEPAISWNSTYDSLLSTNK 135
Query: 148 PFPKSVPLSEMVEFLVDVWEREGMY 172
FP+ +PL EMV FLVDVWE EG+Y
Sbjct: 136 LFPQPIPLKEMVHFLVDVWEEEGLY 160
>AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
LENGTH=162
Length = 162
Score = 123 bits (309), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 60/129 (46%), Positives = 91/129 (70%), Gaps = 5/129 (3%)
Query: 47 VRKPTI--SDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEG 104
+RKP++ S+ FW+++T D++++ + S+SS+ST N + SS+ AEFVN G
Sbjct: 37 LRKPSVVASEDFWTNTTLDMESN---AHGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHG 93
Query: 105 LLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVD 164
L+LWN++R QWVG RS+ + R+P LN N +YESL+G+ + FP+ +PL EMV+FLV+
Sbjct: 94 LVLWNQTRQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVE 153
Query: 165 VWEREGMYG 173
VWE EG+YG
Sbjct: 154 VWEEEGLYG 162
>AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:5340243-5341216 FORWARD LENGTH=161
Length = 161
Score = 123 bits (309), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 60/129 (46%), Positives = 91/129 (70%), Gaps = 5/129 (3%)
Query: 47 VRKPTI--SDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEG 104
+RKP++ S+ FW+++T D++++ + S+SS+ST N + SS+ AEFVN G
Sbjct: 36 LRKPSVVASEDFWTNTTLDMESN---AHGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHG 92
Query: 105 LLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVD 164
L+LWN++R QWVG RS+ + R+P LN N +YESL+G+ + FP+ +PL EMV+FLV+
Sbjct: 93 LVLWNQTRQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVE 152
Query: 165 VWEREGMYG 173
VWE EG+YG
Sbjct: 153 VWEEEGLYG 161
>AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 122 bits (307), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 91/140 (65%), Gaps = 7/140 (5%)
Query: 34 DEPAKGLRIQGQSVRKPTISDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSS 93
D P+ + + +KP++S+ FWS+ST D+DN T SQ S+SS N + +
Sbjct: 20 DPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSSS---NQTFDSQSAARN 73
Query: 94 AGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTS-YESLIGTRQPFPKS 152
+ E+VN+GLLLWN++R +WVG + + KLNWNT+ Y+SL+G+ + FP+
Sbjct: 74 SNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQP 133
Query: 153 VPLSEMVEFLVDVWEREGMY 172
+PL+EMV+FLVD+WE+EG+Y
Sbjct: 134 IPLTEMVDFLVDIWEQEGLY 153
>AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 122 bits (307), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 59/140 (42%), Positives = 91/140 (65%), Gaps = 7/140 (5%)
Query: 34 DEPAKGLRIQGQSVRKPTISDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSS 93
D P+ + + +KP++S+ FWS+ST D+DN T SQ S+SS N + +
Sbjct: 20 DPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSSS---NQTFDSQSAARN 73
Query: 94 AGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTS-YESLIGTRQPFPKS 152
+ E+VN+GLLLWN++R +WVG + + KLNWNT+ Y+SL+G+ + FP+
Sbjct: 74 SNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQP 133
Query: 153 VPLSEMVEFLVDVWEREGMY 172
+PL+EMV+FLVD+WE+EG+Y
Sbjct: 134 IPLTEMVDFLVDIWEQEGLY 153
>AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
hits to 145 proteins in 25 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
Other Eukaryotes - 7 (source: NCBI BLink). |
chr1:5278481-5279056 REVERSE LENGTH=108
Length = 108
Score = 100 bits (249), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 72/110 (65%), Gaps = 4/110 (3%)
Query: 64 LDNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKK 123
+DN T SQ S+SS N + ++ E+VN+GLLLWN++R +WVG +
Sbjct: 1 MDNITFPSQGSLSSS---NQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDKPNN 57
Query: 124 QTQQKRDPKLNWNTS-YESLIGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
+ KLNWNT+ Y+SL+G+ + FP+ +PL+EMV+FLVD+WE+EG+Y
Sbjct: 58 PVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLY 107
>AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
LENGTH=112
Length = 112
Score = 79.7 bits (195), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 56/81 (69%), Gaps = 1/81 (1%)
Query: 92 SSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQPFPK 151
+++ T VN G +W E+R +WVG +S+++ +D ++W+T+YE L+ T +PF +
Sbjct: 32 TNSENTLTLVNHGAKMWQENREKWVG-DQSRQRKNTAKDQIISWSTTYEDLLSTHEPFSE 90
Query: 152 SVPLSEMVEFLVDVWEREGMY 172
S+PL EMV+FLVD+W EG+Y
Sbjct: 91 SIPLPEMVDFLVDIWYDEGLY 111
>AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 67.8 bits (164), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 83 NILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESL 142
N+ +S+ + FVN + W E R +WVG S + ++ +P + +N +YE L
Sbjct: 9 NVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGD-PSNRTSEMPDEPVIGFNATYEDL 67
Query: 143 IGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
+ + PF K +PL+EMV+FL D+W +G++
Sbjct: 68 LTSNTPFNKPIPLAEMVDFLFDIWHGDGLF 97
>AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 67.8 bits (164), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 83 NILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESL 142
N+ +S+ + FVN + W E R +WVG S + ++ +P + +N +YE L
Sbjct: 9 NVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGD-PSNRTSEMPDEPVIGFNATYEDL 67
Query: 143 IGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
+ + PF K +PL+EMV+FL D+W +G++
Sbjct: 68 LTSNTPFNKPIPLAEMVDFLFDIWHGDGLF 97