Miyakogusa Predicted Gene
- Lj3g3v0065550.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0065550.1 Non Chatacterized Hit- tr|C6T0Q9|C6T0Q9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.1641
PE=,81.53,0,DUF4050,Domain of unknown function DUF4050,CUFF.40268.1
(157 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 177 2e-45
AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 177 2e-45
AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 162 6e-41
AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 162 6e-41
AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 152 8e-38
AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 148 2e-36
AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 143 4e-35
AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 126 5e-30
AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 91 2e-19
AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 86 1e-17
AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 86 1e-17
>AT5G25360.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
LENGTH=169
Length = 169
Score = 177 bits (449), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 91/146 (62%), Positives = 105/146 (71%), Gaps = 3/146 (2%)
Query: 12 KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGSISSTSVTNLPA 71
KP LI D PSK L + VKK S SEDFW+TST +MDNS +QSQ S+SS S TN
Sbjct: 27 KPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTN-NT 85
Query: 72 DPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATYDSLLASNK 131
A +SNPTEFVNHGL LWNQTRQ+W+ N +K + REP +SWNATY+SLL NK
Sbjct: 86 STSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK--AKVREPTISWNATYESLLGMNK 143
Query: 132 PFPQPIPLAEMVDFLVDIWEQEGLYD 157
F +PIPL EMVDFLVD+WEQEGLYD
Sbjct: 144 RFSRPIPLPEMVDFLVDVWEQEGLYD 169
>AT5G25360.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:8799934-8802333
REVERSE LENGTH=169
Length = 169
Score = 177 bits (449), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 91/146 (62%), Positives = 105/146 (71%), Gaps = 3/146 (2%)
Query: 12 KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGSISSTSVTNLPA 71
KP LI D PSK L + VKK S SEDFW+TST +MDNS +QSQ S+SS S TN
Sbjct: 27 KPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTN-NT 85
Query: 72 DPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATYDSLLASNK 131
A +SNPTEFVNHGL LWNQTRQ+W+ N +K + REP +SWNATY+SLL NK
Sbjct: 86 STSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK--AKVREPTISWNATYESLLGMNK 143
Query: 132 PFPQPIPLAEMVDFLVDIWEQEGLYD 157
F +PIPL EMVDFLVD+WEQEGLYD
Sbjct: 144 RFSRPIPLPEMVDFLVDVWEQEGLYD 169
>AT1G15350.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 162 bits (411), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 87/158 (55%), Positives = 107/158 (67%), Gaps = 7/158 (4%)
Query: 3 MGGCLGLHK--KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGS 60
MGGC+G ++ + T + D PS + + KK S SEDFW+TST DMDN SQGS
Sbjct: 1 MGGCVGCYREHRSTAASLKDPPSNSIA---RPCKKPSVSEDFWSTSTVDMDNITFPSQGS 57
Query: 61 ISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWN 120
+SS++ T S+ P E+VN GL LWNQTR+RW+G +K P + ++ KL+WN
Sbjct: 58 LSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDK-PNNPVDHNQGAKLNWN 116
Query: 121 -ATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
ATYDSLL SNK FPQPIPL EMVDFLVDIWEQEGLYD
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154
>AT1G15350.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
LENGTH=154
Length = 154
Score = 162 bits (411), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 87/158 (55%), Positives = 107/158 (67%), Gaps = 7/158 (4%)
Query: 3 MGGCLGLHK--KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGS 60
MGGC+G ++ + T + D PS + + KK S SEDFW+TST DMDN SQGS
Sbjct: 1 MGGCVGCYREHRSTAASLKDPPSNSIA---RPCKKPSVSEDFWSTSTVDMDNITFPSQGS 57
Query: 61 ISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWN 120
+SS++ T S+ P E+VN GL LWNQTR+RW+G +K P + ++ KL+WN
Sbjct: 58 LSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDK-PNNPVDHNQGAKLNWN 116
Query: 121 -ATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
ATYDSLL SNK FPQPIPL EMVDFLVDIWEQEGLYD
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154
>AT4G32342.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25360.2);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr4:15615184-15616200
REVERSE LENGTH=161
Length = 161
Score = 152 bits (384), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 82/158 (51%), Positives = 103/158 (65%), Gaps = 14/158 (8%)
Query: 1 MHMGGCLGLHKKPTLIATADVPSKRLTQHNKAVKK-ASTSEDFWTTSTHDMD-NSAVQSQ 58
++ GC ++ L+ D PSK L K VKK +++S+DFW+TST DMD N +QSQ
Sbjct: 15 INCFGCCNRERR--LVVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQ 72
Query: 59 GSISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLS 118
S N P DP +SN TEFVNHGL LWN TRQ+W + +Q EP +S
Sbjct: 73 SS-------NPPFDPQCSTSNSTEFVNHGLILWNHTRQQW---RECLTRQQCLVPEPAIS 122
Query: 119 WNATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLY 156
WN+TYDSLL++NK FPQPIPL EMV FLVD+WE+EGLY
Sbjct: 123 WNSTYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGLY 160
>AT3G15770.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:5340243-5341216 FORWARD LENGTH=161
Length = 161
Score = 148 bits (373), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/164 (51%), Positives = 105/164 (64%), Gaps = 14/164 (8%)
Query: 3 MGGCLGLHKKPTLIATADVP----SKRLTQHN-KAVKKAST--SEDFWTTSTHDMDNSAV 55
MG CL K + DVP L + + ++K S SEDFWT +T DM+++A
Sbjct: 1 MGSCLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA- 59
Query: 56 QSQGSISSTSVTNLPADPH-AGSSN--PTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQS 112
GS+SS S TNL D GSS+ P EFVNHGL LWNQTRQ+W+G +KR E +
Sbjct: 60 --HGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQTRQQWVG-DKRSESRKSVG 116
Query: 113 REPKLSWNATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLY 156
REP L+ N TY+SLL SNK FP+PIPL EMV FLV++WE+EGLY
Sbjct: 117 REPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGLY 160
>AT3G15770.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
LENGTH=162
Length = 162
Score = 143 bits (361), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/161 (50%), Positives = 103/161 (63%), Gaps = 14/161 (8%)
Query: 6 CLGLHKKPTLIATADVP----SKRLTQHN-KAVKKAST--SEDFWTTSTHDMDNSAVQSQ 58
CL K + DVP L + + ++K S SEDFWT +T DM+++A
Sbjct: 5 CLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA---H 61
Query: 59 GSISSTSVTNLPADPH-AGSSN--PTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREP 115
GS+SS S TNL D GSS+ P EFVNHGL LWNQTRQ+W+G +KR E + REP
Sbjct: 62 GSVSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQTRQQWVG-DKRSESRKSVGREP 120
Query: 116 KLSWNATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLY 156
L+ N TY+SLL SNK FP+PIPL EMV FLV++WE+EGLY
Sbjct: 121 ILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGLY 161
>AT1G15350.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
hits to 145 proteins in 25 species: Archae - 0; Bacteria
- 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
Other Eukaryotes - 7 (source: NCBI BLink). |
chr1:5278481-5279056 REVERSE LENGTH=108
Length = 108
Score = 126 bits (317), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 66/109 (60%), Positives = 78/109 (71%), Gaps = 2/109 (1%)
Query: 50 MDNSAVQSQGSISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQT 109
MDN SQGS+SS++ T S+ P E+VN GL LWNQTR+RW+G +K P
Sbjct: 1 MDNITFPSQGSLSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDK-PNNPV 59
Query: 110 EQSREPKLSWN-ATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
+ ++ KL+WN ATYDSLL SNK FPQPIPL EMVDFLVDIWEQEGLYD
Sbjct: 60 DHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 108
>AT3G54880.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
LENGTH=112
Length = 112
Score = 91.3 bits (225), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 56/82 (68%), Gaps = 2/82 (2%)
Query: 76 GSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATYDSLLASNKPFPQ 135
S N VNHG +W + R++W+G+ R K T +++ +SW+ TY+ LL++++PF +
Sbjct: 33 NSENTLTLVNHGAKMWQENREKWVGDQSRQRKNT--AKDQIISWSTTYEDLLSTHEPFSE 90
Query: 136 PIPLAEMVDFLVDIWEQEGLYD 157
IPL EMVDFLVDIW EGLYD
Sbjct: 91 SIPLPEMVDFLVDIWYDEGLYD 112
>AT5G03440.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 85.5 bits (210), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 8/94 (8%)
Query: 70 PADPHAGSSNPTE------FVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATY 123
P + + SSN E FVNH W + R++W+G+ + +E EP + +NATY
Sbjct: 7 PGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSN--RTSEMPDEPVIGFNATY 64
Query: 124 DSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
+ LL SN PF +PIPLAEMVDFL DIW +GL++
Sbjct: 65 EDLLTSNTPFNKPIPLAEMVDFLFDIWHGDGLFE 98
>AT5G03440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:857179-857898 REVERSE LENGTH=98
Length = 98
Score = 85.5 bits (210), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 8/94 (8%)
Query: 70 PADPHAGSSNPTE------FVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATY 123
P + + SSN E FVNH W + R++W+G+ + +E EP + +NATY
Sbjct: 7 PGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSN--RTSEMPDEPVIGFNATY 64
Query: 124 DSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
+ LL SN PF +PIPLAEMVDFL DIW +GL++
Sbjct: 65 EDLLTSNTPFNKPIPLAEMVDFLFDIWHGDGLFE 98