Miyakogusa Predicted Gene
- Lj1g3v1911640.5
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1911640.5 Non Chatacterized Hit- tr|K3YW60|K3YW60_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si018492,38.69,2e-19,DUF4050,Domain of unknown function
DUF4050,CUFF.28111.5
(170 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 155 1e-38
AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 155 1e-38
AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 120 7e-28
AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 119 7e-28
AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 115 2e-26
AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 114 4e-26
AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 114 4e-26
AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 95 2e-20
AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 87 4e-18
AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 78 3e-15
AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 78 3e-15
>AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
LENGTH=169
Length = 169
Score = 155 bits (392), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/172 (51%), Positives = 115/172 (66%), Gaps = 8/172 (4%)
Query: 1 MHICRYVPPWLSQILACMGGCLGCFPKP---ISMDEASKGLITQGSAINNYDRSEDIWSS 57
MH+ +P W+ Q+ CMGGC GC KP +++DE SKGL QG + SED WS+
Sbjct: 1 MHLREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWST 60
Query: 58 SSFYMDHSAGYSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGN 117
S+ MD+S SQRS SSI+ N+ S S++ +P EFVNHGL LW QTR+QW+ N
Sbjct: 61 STCEMDNSTLQSQRSMSSISFTNNTSTSASTS-----NPTEFVNHGLNLWNQTRQQWLAN 115
Query: 118 KKTERRMQVGESRISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
++++ +V E ISWNATYESLLG + F +PIPL EMV+FLVD+WE EGL
Sbjct: 116 GTSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGL 167
>AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:8799934-8802333
REVERSE LENGTH=169
Length = 169
Score = 155 bits (392), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/172 (51%), Positives = 115/172 (66%), Gaps = 8/172 (4%)
Query: 1 MHICRYVPPWLSQILACMGGCLGCFPKP---ISMDEASKGLITQGSAINNYDRSEDIWSS 57
MH+ +P W+ Q+ CMGGC GC KP +++DE SKGL QG + SED WS+
Sbjct: 1 MHLREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWST 60
Query: 58 SSFYMDHSAGYSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGN 117
S+ MD+S SQRS SSI+ N+ S S++ +P EFVNHGL LW QTR+QW+ N
Sbjct: 61 STCEMDNSTLQSQRSMSSISFTNNTSTSASTS-----NPTEFVNHGLNLWNQTRQQWLAN 115
Query: 118 KKTERRMQVGESRISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
++++ +V E ISWNATYESLLG + F +PIPL EMV+FLVD+WE EGL
Sbjct: 116 GTSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGL 167
>AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25360.2);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr4:15615184-15616200
REVERSE LENGTH=161
Length = 161
Score = 120 bits (300), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 88/155 (56%), Gaps = 16/155 (10%)
Query: 20 GCLGCFPKP----ISMDEASKGLITQGSAIN-NYDRSEDIWSSSSFYMDHSAGYSQRSFS 74
C GC + + +DE SKGL QG + + S+D WS+S+ MDH+ +S
Sbjct: 16 NCFGCCNRERRLVVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQS-- 73
Query: 75 SITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWN 134
N P DPQ S N EFVNHGL LW TR+QW T ++ V E ISWN
Sbjct: 74 ----SNPPFDPQCSTSNST----EFVNHGLILWNHTRQQW-RECLTRQQCLVPEPAISWN 124
Query: 135 ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
+TY+SLL T + F QPIPL EMV FLVD+WE EGL
Sbjct: 125 STYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGL 159
>AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:5340243-5341216 FORWARD LENGTH=161
Length = 161
Score = 119 bits (299), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 93/163 (57%), Gaps = 15/163 (9%)
Query: 18 MGGCLGCFPKPISMDEASKGLITQGSAINNYD----------RSEDIWSSSSFYMDHSAG 67
MG CL CF K + L + D SED W++++ M+ +A
Sbjct: 1 MGSCLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA- 59
Query: 68 YSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVG 127
S SSI+ N D Q G+ + P EFVNHGL LW QTR+QWVG+K++E R VG
Sbjct: 60 --HGSVSSISTTNLTIDSQG-CGSSSNEPAEFVNHGLVLWNQTRQQWVGDKRSESRKSVG 116
Query: 128 -ESRISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
E ++ N TYESLLG+ + F +PIPL EMV+FLV++WE EGL
Sbjct: 117 REPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 159
>AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
LENGTH=162
Length = 162
Score = 115 bits (288), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 91/160 (56%), Gaps = 15/160 (9%)
Query: 21 CLGCFPKPISMDEASKGLITQGSAINNYD----------RSEDIWSSSSFYMDHSAGYSQ 70
CL CF K + L + D SED W++++ M+ +A
Sbjct: 5 CLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA---H 61
Query: 71 RSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVG-ES 129
S SSI+ N D Q G+ + P EFVNHGL LW QTR+QWVG+K++E R VG E
Sbjct: 62 GSVSSISTTNLTIDSQG-CGSSSNEPAEFVNHGLVLWNQTRQQWVGDKRSESRKSVGREP 120
Query: 130 RISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
++ N TYESLLG+ + F +PIPL EMV+FLV++WE EGL
Sbjct: 121 ILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 160
>AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 114 bits (284), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 94/156 (60%), Gaps = 8/156 (5%)
Query: 18 MGGCLGCFPKPISMDEASKGLITQGSA--INNYDRSEDIWSSSSFYMDHSAGYSQRSFSS 75
MGGC+GC+ + S + K + A SED WS+S+ MD+ SQ S SS
Sbjct: 1 MGGCVGCYREHRSTAASLKDPPSNSIARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSS 60
Query: 76 ITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGE-SRISWN 134
N D QS+A N + P E+VN GL LW QTR++WVG K + + ++++WN
Sbjct: 61 S---NQTFDSQSAARNS-NAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWN 116
Query: 135 -ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
ATY+SLLG+ + F QPIPL EMV+FLVDIWE EGL
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152
>AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 114 bits (284), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 94/156 (60%), Gaps = 8/156 (5%)
Query: 18 MGGCLGCFPKPISMDEASKGLITQGSA--INNYDRSEDIWSSSSFYMDHSAGYSQRSFSS 75
MGGC+GC+ + S + K + A SED WS+S+ MD+ SQ S SS
Sbjct: 1 MGGCVGCYREHRSTAASLKDPPSNSIARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSS 60
Query: 76 ITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGE-SRISWN 134
N D QS+A N + P E+VN GL LW QTR++WVG K + + ++++WN
Sbjct: 61 S---NQTFDSQSAARNS-NAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWN 116
Query: 135 -ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
ATY+SLLG+ + F QPIPL EMV+FLVDIWE EGL
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152
>AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
hits to 145 proteins in 25 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
Other Eukaryotes - 7 (source: NCBI BLink). |
chr1:5278481-5279056 REVERSE LENGTH=108
Length = 108
Score = 95.1 bits (235), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/110 (50%), Positives = 72/110 (65%), Gaps = 6/110 (5%)
Query: 62 MDHSAGYSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTE 121
MD+ SQ S SS N D QS+A N + P E+VN GL LW QTR++WVG K
Sbjct: 1 MDNITFPSQGSLSS---SNQTFDSQSAARNS-NAPPEYVNQGLLLWNQTRERWVGKDKPN 56
Query: 122 RRMQVGE-SRISWN-ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
+ + ++++WN ATY+SLLG+ + F QPIPL EMV+FLVDIWE EGL
Sbjct: 57 NPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 106
>AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
LENGTH=112
Length = 112
Score = 87.4 bits (215), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 56/86 (65%)
Query: 84 DPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWNATYESLLGT 143
D +SS ++ VNHG ++W++ R++WVG++ +R+ + ISW+ TYE LL T
Sbjct: 25 DEKSSVKTNSENTLTLVNHGAKMWQENREKWVGDQSRQRKNTAKDQIISWSTTYEDLLST 84
Query: 144 TRPFRQPIPLGEMVEFLVDIWELEGL 169
PF + IPL EMV+FLVDIW EGL
Sbjct: 85 HEPFSESIPLPEMVDFLVDIWYDEGL 110
>AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 77.8 bits (190), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 2/91 (2%)
Query: 81 HPSDPQSSAGNQIDHPEE--FVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWNATYE 138
P + S++ N + E FVNH W++ RK+WVG+ E I +NATYE
Sbjct: 6 EPGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYE 65
Query: 139 SLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
LL + PF +PIPL EMV+FL DIW +GL
Sbjct: 66 DLLTSNTPFNKPIPLAEMVDFLFDIWHGDGL 96
>AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 77.8 bits (190), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 2/91 (2%)
Query: 81 HPSDPQSSAGNQIDHPEE--FVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWNATYE 138
P + S++ N + E FVNH W++ RK+WVG+ E I +NATYE
Sbjct: 6 EPGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYE 65
Query: 139 SLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
LL + PF +PIPL EMV+FL DIW +GL
Sbjct: 66 DLLTSNTPFNKPIPLAEMVDFLFDIWHGDGL 96