Miyakogusa Predicted Gene
- Lj1g3v0013250.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0013250.1 Non Chatacterized Hit- tr|K3YW60|K3YW60_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si018492,49.57,3e-19,DUF4050,Domain of unknown function
DUF4050,NODE_83668_length_744_cov_26.380377.path1.1
(140 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 156 3e-39
AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 156 3e-39
AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 143 4e-35
AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 135 1e-32
AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 135 1e-32
AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 119 6e-28
AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 119 6e-28
AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 105 7e-24
AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 84 2e-17
AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 84 2e-17
AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 78 2e-15
>AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
LENGTH=169
Length = 169
Score = 156 bits (395), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 81/140 (57%), Positives = 101/140 (72%), Gaps = 4/140 (2%)
Query: 1 MDEASKGLRNQGQTVTKDDGSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSAC 60
+DE SKGLR QG+ V K S DFWS++T EMD++ QS RS+SS + +N S++
Sbjct: 34 VDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTNN----TSTSA 89
Query: 61 SQTAPPEFVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPI 120
S + P EFVNHGL W Q RQQW+ N TS+ + +VREP IS NATY++LLG NK F +PI
Sbjct: 90 STSNPTEFVNHGLNLWNQTRQQWLANGTSQKKAKVREPTISWNATYESLLGMNKRFSRPI 149
Query: 121 PLREMVYFLVDIWEQEGLYD 140
PL EMV FLVD+WEQEGLYD
Sbjct: 150 PLPEMVDFLVDVWEQEGLYD 169
>AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:8799934-8802333
REVERSE LENGTH=169
Length = 169
Score = 156 bits (395), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 81/140 (57%), Positives = 101/140 (72%), Gaps = 4/140 (2%)
Query: 1 MDEASKGLRNQGQTVTKDDGSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSAC 60
+DE SKGLR QG+ V K S DFWS++T EMD++ QS RS+SS + +N S++
Sbjct: 34 VDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTNN----TSTSA 89
Query: 61 SQTAPPEFVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPI 120
S + P EFVNHGL W Q RQQW+ N TS+ + +VREP IS NATY++LLG NK F +PI
Sbjct: 90 STSNPTEFVNHGLNLWNQTRQQWLANGTSQKKAKVREPTISWNATYESLLGMNKRFSRPI 149
Query: 121 PLREMVYFLVDIWEQEGLYD 140
PL EMV FLVD+WEQEGLYD
Sbjct: 150 PLPEMVDFLVDVWEQEGLYD 169
>AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25360.2);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr4:15615184-15616200
REVERSE LENGTH=161
Length = 161
Score = 143 bits (361), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 77/140 (55%), Positives = 94/140 (67%), Gaps = 11/140 (7%)
Query: 1 MDEASKGLRNQGQTVTKDDGSSD-FWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSA 59
+DE SKGL+ QG+ V KD SSD FWS++T +MDHN + +SN P DPQ
Sbjct: 31 VDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNIT------IQSQSSNPPFDPQ--- 81
Query: 60 CSQTAPPEFVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQP 119
CS + EFVNHGL+ W RQQW T + Q V EP IS N+TYD+LL NK FPQP
Sbjct: 82 CSTSNSTEFVNHGLILWNHTRQQWRECLTRQ-QCLVPEPAISWNSTYDSLLSTNKLFPQP 140
Query: 120 IPLREMVYFLVDIWEQEGLY 139
IPL+EMV+FLVD+WE+EGLY
Sbjct: 141 IPLKEMVHFLVDVWEEEGLY 160
>AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:5340243-5341216 FORWARD LENGTH=161
Length = 161
Score = 135 bits (340), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 67/121 (55%), Positives = 86/121 (71%), Gaps = 4/121 (3%)
Query: 20 GSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQM 79
S DFW++ T++M+ NA S VSS +T+N D Q S P EFVNHGL+ W Q
Sbjct: 43 ASEDFWTNTTLDMESNAHGS---VSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQT 99
Query: 80 RQQWVGNKTSESQTEV-REPKISLNATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
RQQWVG+K SES+ V REP ++ N TY++LLG+NK FP+PIPL EMV FLV++WE+EGL
Sbjct: 100 RQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 159
Query: 139 Y 139
Y
Sbjct: 160 Y 160
>AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
LENGTH=162
Length = 162
Score = 135 bits (339), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 67/121 (55%), Positives = 86/121 (71%), Gaps = 4/121 (3%)
Query: 20 GSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQM 79
S DFW++ T++M+ NA S VSS +T+N D Q S P EFVNHGL+ W Q
Sbjct: 44 ASEDFWTNTTLDMESNAHGS---VSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQT 100
Query: 80 RQQWVGNKTSESQTEV-REPKISLNATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
RQQWVG+K SES+ V REP ++ N TY++LLG+NK FP+PIPL EMV FLV++WE+EGL
Sbjct: 101 RQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 160
Query: 139 Y 139
Y
Sbjct: 161 Y 161
>AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 119 bits (298), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 68/122 (55%), Positives = 86/122 (70%), Gaps = 5/122 (4%)
Query: 21 SSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQMR 80
S DFWS++T++MD+ S S+SS SN D QS+A + APPE+VN GLL W Q R
Sbjct: 36 SEDFWSTSTVDMDNITFPSQGSLSS---SNQTFDSQSAARNSNAPPEYVNQGLLLWNQTR 92
Query: 81 QQWVG-NKTSESQTEVREPKISLN-ATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
++WVG +K + + K++ N ATYD+LLG+NK FPQPIPL EMV FLVDIWEQEGL
Sbjct: 93 ERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152
Query: 139 YD 140
YD
Sbjct: 153 YD 154
>AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 119 bits (298), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 68/122 (55%), Positives = 86/122 (70%), Gaps = 5/122 (4%)
Query: 21 SSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQMR 80
S DFWS++T++MD+ S S+SS SN D QS+A + APPE+VN GLL W Q R
Sbjct: 36 SEDFWSTSTVDMDNITFPSQGSLSS---SNQTFDSQSAARNSNAPPEYVNQGLLLWNQTR 92
Query: 81 QQWVG-NKTSESQTEVREPKISLN-ATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
++WVG +K + + K++ N ATYD+LLG+NK FPQPIPL EMV FLVDIWEQEGL
Sbjct: 93 ERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152
Query: 139 YD 140
YD
Sbjct: 153 YD 154
>AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
hits to 145 proteins in 25 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
Other Eukaryotes - 7 (source: NCBI BLink). |
chr1:5278481-5279056 REVERSE LENGTH=108
Length = 108
Score = 105 bits (263), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 58/101 (57%), Positives = 72/101 (71%), Gaps = 2/101 (1%)
Query: 42 SVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQMRQQWVG-NKTSESQTEVREPKI 100
S S ++SN D QS+A + APPE+VN GLL W Q R++WVG +K + + K+
Sbjct: 8 SQGSLSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKL 67
Query: 101 SLN-ATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGLYD 140
+ N ATYD+LLG+NK FPQPIPL EMV FLVDIWEQEGLYD
Sbjct: 68 NWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 108
>AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 84.3 bits (207), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 36/73 (49%), Positives = 51/73 (69%)
Query: 68 FVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPIPLREMVY 127
FVNH + W +MR++WVG+ ++ + EP I NATY++LL +N PF +PIPL EMV
Sbjct: 26 FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYEDLLTSNTPFNKPIPLAEMVD 85
Query: 128 FLVDIWEQEGLYD 140
FL DIW +GL++
Sbjct: 86 FLFDIWHGDGLFE 98
>AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 84.3 bits (207), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 36/73 (49%), Positives = 51/73 (69%)
Query: 68 FVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPIPLREMVY 127
FVNH + W +MR++WVG+ ++ + EP I NATY++LL +N PF +PIPL EMV
Sbjct: 26 FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYEDLLTSNTPFNKPIPLAEMVD 85
Query: 128 FLVDIWEQEGLYD 140
FL DIW +GL++
Sbjct: 86 FLFDIWHGDGLFE 98
>AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
LENGTH=112
Length = 112
Score = 77.8 bits (190), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 50/73 (68%)
Query: 68 FVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPIPLREMVY 127
VNHG W + R++WVG+++ + + ++ IS + TY++LL ++PF + IPL EMV
Sbjct: 40 LVNHGAKMWQENREKWVGDQSRQRKNTAKDQIISWSTTYEDLLSTHEPFSESIPLPEMVD 99
Query: 128 FLVDIWEQEGLYD 140
FLVDIW EGLYD
Sbjct: 100 FLVDIWYDEGLYD 112