Miyakogusa Predicted Gene
- Lj4g3v1881580.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1881580.1 Non Chatacterized Hit- tr|I1LZQ5|I1LZQ5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.18586 PE,70.22,0,FAMILY
NOT NAMED,NULL; seg,NULL; UBA-like,UBA-like; coiled-coil,NULL;
CUE,Ubiquitin system
component,NODE_12628_length_1819_cov_81.542053.path1.1
(267 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G32440.3 | Symbols: | Ubiquitin system component Cue protein... 244 4e-65
AT5G32440.1 | Symbols: | Ubiquitin system component Cue protein... 244 6e-65
AT1G80040.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 182 2e-46
AT1G80040.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 177 8e-45
AT5G32440.2 | Symbols: | Ubiquitin system component Cue protein... 118 4e-27
AT5G02510.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 110 1e-24
AT1G80040.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 83 2e-16
>AT5G32440.3 | Symbols: | Ubiquitin system component Cue protein |
chr5:12077014-12078396 FORWARD LENGTH=265
Length = 265
Score = 244 bits (624), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 152/279 (54%), Positives = 197/279 (70%), Gaps = 26/279 (9%)
Query: 1 MSAIIVCGKRSALFQ---PSSPPISTKRIRC-----XXXXXXXXXXXXXXLLHHLTTLFP 52
MSAI VCGKRS LF+ +SPP+S K++RC LL HL +FP
Sbjct: 1 MSAI-VCGKRS-LFEDLAAASPPVS-KKLRCFSSSSSSRFSPPIPPSSSLLLDHLAAIFP 57
Query: 53 FMDPQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQG 112
MD Q+LE+A+EECG+DLDSAIR LN+LRL S +++ DSA S V + N +PQ QG
Sbjct: 58 DMDKQILERAIEECGDDLDSAIRCLNQLRLE-SANKNSDSATNQSPVVIQEPNVEPQQQG 116
Query: 113 ETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSIC 172
+ A E V L+G EWV+ FVREMM+AS+M DAKARA+R LEALEKSI
Sbjct: 117 RS------AKEEPNVLN----LDGTEWVELFVREMMNASDMKDAKARAARALEALEKSIN 166
Query: 173 ERSRSET----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDLKQLV 228
R+ ++ ++ENMMLK+Q+EA++QEN +LKRA+ Q +RQ+E E+++QEL+ L+QLV
Sbjct: 167 ARTGTDAMQNLQQENMMLKQQLEAIVQENSLLKRAVVTQQKRQRESEDQSQELQHLRQLV 226
Query: 229 SQYQEQLRTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
+QYQEQLRTLEVNNYALT+HLKQA+Q+SSIPG +HPDVF
Sbjct: 227 TQYQEQLRTLEVNNYALTLHLKQAQQNSSIPGRYHPDVF 265
>AT5G32440.1 | Symbols: | Ubiquitin system component Cue protein |
chr5:12077014-12078396 FORWARD LENGTH=264
Length = 264
Score = 244 bits (622), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 152/279 (54%), Positives = 196/279 (70%), Gaps = 27/279 (9%)
Query: 1 MSAIIVCGKRSALFQ---PSSPPISTKRIRC-----XXXXXXXXXXXXXXLLHHLTTLFP 52
MSAI VCGKRS LF+ +SPP+S K++RC LL HL +FP
Sbjct: 1 MSAI-VCGKRS-LFEDLAAASPPVS-KKLRCFSSSSSSRFSPPIPPSSSLLLDHLAAIFP 57
Query: 53 FMDPQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQG 112
MD Q+LE+A+EECG+DLDSAIR LN+LRL S +++ DSA S V + N +PQ QG
Sbjct: 58 DMDKQILERAIEECGDDLDSAIRCLNQLRLE-SANKNSDSATNQSPVVIQEPNVEPQQQG 116
Query: 113 ETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSIC 172
K E V L+G EWV+ FVREMM+AS+M DAKARA+R LEALEKSI
Sbjct: 117 SAK-------EEPNVLN----LDGTEWVELFVREMMNASDMKDAKARAARALEALEKSIN 165
Query: 173 ERSRSET----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDLKQLV 228
R+ ++ ++ENMMLK+Q+EA++QEN +LKRA+ Q +RQ+E E+++QEL+ L+QLV
Sbjct: 166 ARTGTDAMQNLQQENMMLKQQLEAIVQENSLLKRAVVTQQKRQRESEDQSQELQHLRQLV 225
Query: 229 SQYQEQLRTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
+QYQEQLRTLEVNNYALT+HLKQA+Q+SSIPG +HPDVF
Sbjct: 226 TQYQEQLRTLEVNNYALTLHLKQAQQNSSIPGRYHPDVF 264
>AT1G80040.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast; EXPRESSED IN: 24 plant structures;
EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
DOMAIN/s: Ubiquitin system component Cue
(InterPro:IPR003892); BEST Arabidopsis thaliana protein
match is: Ubiquitin system component Cue protein
(TAIR:AT5G32440.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr1:30109447-30111051 REVERSE LENGTH=248
Length = 248
Score = 182 bits (462), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 157/272 (57%), Gaps = 29/272 (10%)
Query: 1 MSAIIVCGKRSALFQPSSPPISTKRIRC-XXXXXXXXXXXXXXLLHHLTTLFPFMDPQLL 59
MSA+ CG + + F +S P S+KR RC L L + FP ++ +L
Sbjct: 1 MSAVY-CGTKRSYFDDNSSPPSSKRFRCFSPSNSPIWSSPPSSSLDQLHSAFPHIELTVL 59
Query: 60 EKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQGETKCDTE 119
KALE+ G+D ++A++SL S A+ + A +L + ET
Sbjct: 60 VKALEDNGSDFNAAMKSLY-------------SFASSEEKKAEELAAGGAATQET----- 101
Query: 120 DAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSICERSRSET 179
D V G + P +G +WV+ VRE++ +S DDAK RA+RVLEALEK + R+R E
Sbjct: 102 -----DAVCGGNPPTSGDDWVELLVREVLQSSGTDDAKVRAARVLEALEKMLSARAREEA 156
Query: 180 ----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDLKQLVSQYQEQL 235
+ E + +++QVE L+++N +LKRA+ IQHERQK E+ N +L LKQLV QYQE+L
Sbjct: 157 GNKFQEEKVAVQQQVETLVKDNTVLKRAVAIQHERQKALEDANHQLGLLKQLVPQYQEKL 216
Query: 236 RTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
R LEVNNYAL M L+Q E +S+P F+PDVF
Sbjct: 217 RNLEVNNYALRMQLQQVEHGNSMPARFNPDVF 248
>AT1G80040.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast; EXPRESSED IN: 24 plant structures;
EXPRESSED DURING: 15 growth stages; BEST Arabidopsis
thaliana protein match is: Ubiquitin system component
Cue protein (TAIR:AT5G32440.1). | chr1:30109447-30111051
REVERSE LENGTH=259
Length = 259
Score = 177 bits (448), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 112/283 (39%), Positives = 158/283 (55%), Gaps = 40/283 (14%)
Query: 1 MSAIIVCGKRSALFQPSSPPISTKRIRC-XXXXXXXXXXXXXXLLHHLTTLFPFMD---- 55
MSA+ CG + + F +S P S+KR RC L L + FP ++
Sbjct: 1 MSAVY-CGTKRSYFDDNSSPPSSKRFRCFSPSNSPIWSSPPSSSLDQLHSAFPHIELTVA 59
Query: 56 -------PQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQP 108
Q+L KALE+ G+D ++A++SL S A+ + A +L
Sbjct: 60 SKIHVSVAQVLVKALEDNGSDFNAAMKSLY-------------SFASSEEKKAEELAAGG 106
Query: 109 QSQGETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALE 168
+ ET D V G + P +G +WV+ VRE++ +S DDAK RA+RVLEALE
Sbjct: 107 AATQET----------DAVCGGNPPTSGDDWVELLVREVLQSSGTDDAKVRAARVLEALE 156
Query: 169 KSICERSRSET----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDL 224
K + R+R E + E + +++QVE L+++N +LKRA+ IQHERQK E+ N +L L
Sbjct: 157 KMLSARAREEAGNKFQEEKVAVQQQVETLVKDNTVLKRAVAIQHERQKALEDANHQLGLL 216
Query: 225 KQLVSQYQEQLRTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
KQLV QYQE+LR LEVNNYAL M L+Q E +S+P F+PDVF
Sbjct: 217 KQLVPQYQEKLRNLEVNNYALRMQLQQVEHGNSMPARFNPDVF 259
>AT5G32440.2 | Symbols: | Ubiquitin system component Cue protein |
chr5:12077014-12078091 FORWARD LENGTH=187
Length = 187
Score = 118 bits (296), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 91/187 (48%), Positives = 115/187 (61%), Gaps = 23/187 (12%)
Query: 1 MSAIIVCGKRSALFQ---PSSPPISTKRIRC-----XXXXXXXXXXXXXXLLHHLTTLFP 52
MSAI VCGKRS LF+ +SPP+S K++RC LL HL +FP
Sbjct: 1 MSAI-VCGKRS-LFEDLAAASPPVS-KKLRCFSSSSSSRFSPPIPPSSSLLLDHLAAIFP 57
Query: 53 FMDPQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQG 112
MD Q+LE+A+EECG+DLDSAIR LN+LRL S +++ DSA S V + N +PQ QG
Sbjct: 58 DMDKQILERAIEECGDDLDSAIRCLNQLRL-ESANKNSDSATNQSPVVIQEPNVEPQQQG 116
Query: 113 ETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSIC 172
K E V L+G EWV+ FVREMM+AS+M DAKARA+R LEALEKSI
Sbjct: 117 SAK-------EEPNVLN----LDGTEWVELFVREMMNASDMKDAKARAARALEALEKSIN 165
Query: 173 ERSRSET 179
R+ ++
Sbjct: 166 ARTGTDA 172
>AT5G02510.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Ubiquitin system component Cue protein
(TAIR:AT5G32440.1); Has 166 Blast hits to 166 proteins
in 31 species: Archae - 0; Bacteria - 4; Metazoa - 3;
Fungi - 0; Plants - 142; Viruses - 0; Other Eukaryotes -
17 (source: NCBI BLink). | chr5:560135-560860 FORWARD
LENGTH=179
Length = 179
Score = 110 bits (274), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 90/135 (66%), Gaps = 1/135 (0%)
Query: 134 LNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSICERSRSETERENMMLKEQVEAL 193
++GA+WVD V EM A N+DD + R + +LEALE I + + + + E +KE +++L
Sbjct: 45 IDGAKWVDRLVSEMTKAINIDDMRRRVAVILEALESIIKKNTNASKKLEYASMKESLQSL 104
Query: 194 IQENIILKRALCIQHERQKEYENKNQELKDLKQLVSQYQEQLRTLEVNNYALTMHLKQA- 252
I +N ILKR + QH+R E E K +++ L+ +V QYQEQ+ LE++NYA+ +HL+++
Sbjct: 105 INDNQILKRVIANQHQRSSENEEKAKQVLHLRGVVGQYQEQVHKLELSNYAMKLHLQRSQ 164
Query: 253 EQSSSIPGHFHPDVF 267
+Q +S G+ PD++
Sbjct: 165 QQQTSFSGNLPPDIY 179
>AT1G80040.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast; EXPRESSED IN: 24 plant structures;
EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
DOMAIN/s: Ubiquitin system component Cue
(InterPro:IPR003892); BEST Arabidopsis thaliana protein
match is: Ubiquitin system component Cue protein
(TAIR:AT5G32440.3); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr1:30110059-30111051 REVERSE LENGTH=180
Length = 180
Score = 82.8 bits (203), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 90/179 (50%), Gaps = 25/179 (13%)
Query: 1 MSAIIVCGKRSALFQPSSPPISTKRIRC-XXXXXXXXXXXXXXLLHHLTTLFPFMDPQLL 59
MSA+ CG + + F +S P S+KR RC L L + FP ++ +L
Sbjct: 1 MSAVY-CGTKRSYFDDNSSPPSSKRFRCFSPSNSPIWSSPPSSSLDQLHSAFPHIELTVL 59
Query: 60 EKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQGETKCDTE 119
KALE+ G+D ++A++SL S A+ + A +L + ET
Sbjct: 60 VKALEDNGSDFNAAMKSLY-------------SFASSEEKKAEELAAGGAATQET----- 101
Query: 120 DAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSICERSRSE 178
D V G + P +G +WV+ VRE++ +S DDAK RA+RVLEALEK + R+R E
Sbjct: 102 -----DAVCGGNPPTSGDDWVELLVREVLQSSGTDDAKVRAARVLEALEKMLSARAREE 155