Miyakogusa Predicted Gene

Lj6g3v0528430.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v0528430.1 Non Chatacterized Hit- tr|I1LZQ5|I1LZQ5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.18586
PE,70.22,0,CUE,Ubiquitin system component Cue; seg,NULL;
coiled-coil,NULL; UBA-like,UBA-like; FAMILY NOT
NAMED,,NODE_12628_length_1819_cov_81.542053.path2.1
         (267 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G32440.3 | Symbols:  | Ubiquitin system component Cue protein...   244   4e-65
AT5G32440.1 | Symbols:  | Ubiquitin system component Cue protein...   244   6e-65
AT1G80040.1 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   182   2e-46
AT1G80040.3 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   177   8e-45
AT5G32440.2 | Symbols:  | Ubiquitin system component Cue protein...   118   4e-27
AT5G02510.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   110   1e-24
AT1G80040.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...    83   2e-16

>AT5G32440.3 | Symbols:  | Ubiquitin system component Cue protein |
           chr5:12077014-12078396 FORWARD LENGTH=265
          Length = 265

 Score =  244 bits (624), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 152/279 (54%), Positives = 197/279 (70%), Gaps = 26/279 (9%)

Query: 1   MSAIIVCGKRSALFQ---PSSPPISTKRIRC-----XXXXXXXXXXXXXXLLHHLTTLFP 52
           MSAI VCGKRS LF+    +SPP+S K++RC                   LL HL  +FP
Sbjct: 1   MSAI-VCGKRS-LFEDLAAASPPVS-KKLRCFSSSSSSRFSPPIPPSSSLLLDHLAAIFP 57

Query: 53  FMDPQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQG 112
            MD Q+LE+A+EECG+DLDSAIR LN+LRL  S +++ DSA   S  V  + N +PQ QG
Sbjct: 58  DMDKQILERAIEECGDDLDSAIRCLNQLRLE-SANKNSDSATNQSPVVIQEPNVEPQQQG 116

Query: 113 ETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSIC 172
            +      A  E  V      L+G EWV+ FVREMM+AS+M DAKARA+R LEALEKSI 
Sbjct: 117 RS------AKEEPNVLN----LDGTEWVELFVREMMNASDMKDAKARAARALEALEKSIN 166

Query: 173 ERSRSET----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDLKQLV 228
            R+ ++     ++ENMMLK+Q+EA++QEN +LKRA+  Q +RQ+E E+++QEL+ L+QLV
Sbjct: 167 ARTGTDAMQNLQQENMMLKQQLEAIVQENSLLKRAVVTQQKRQRESEDQSQELQHLRQLV 226

Query: 229 SQYQEQLRTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
           +QYQEQLRTLEVNNYALT+HLKQA+Q+SSIPG +HPDVF
Sbjct: 227 TQYQEQLRTLEVNNYALTLHLKQAQQNSSIPGRYHPDVF 265


>AT5G32440.1 | Symbols:  | Ubiquitin system component Cue protein |
           chr5:12077014-12078396 FORWARD LENGTH=264
          Length = 264

 Score =  244 bits (622), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 152/279 (54%), Positives = 196/279 (70%), Gaps = 27/279 (9%)

Query: 1   MSAIIVCGKRSALFQ---PSSPPISTKRIRC-----XXXXXXXXXXXXXXLLHHLTTLFP 52
           MSAI VCGKRS LF+    +SPP+S K++RC                   LL HL  +FP
Sbjct: 1   MSAI-VCGKRS-LFEDLAAASPPVS-KKLRCFSSSSSSRFSPPIPPSSSLLLDHLAAIFP 57

Query: 53  FMDPQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQG 112
            MD Q+LE+A+EECG+DLDSAIR LN+LRL  S +++ DSA   S  V  + N +PQ QG
Sbjct: 58  DMDKQILERAIEECGDDLDSAIRCLNQLRLE-SANKNSDSATNQSPVVIQEPNVEPQQQG 116

Query: 113 ETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSIC 172
             K        E  V      L+G EWV+ FVREMM+AS+M DAKARA+R LEALEKSI 
Sbjct: 117 SAK-------EEPNVLN----LDGTEWVELFVREMMNASDMKDAKARAARALEALEKSIN 165

Query: 173 ERSRSET----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDLKQLV 228
            R+ ++     ++ENMMLK+Q+EA++QEN +LKRA+  Q +RQ+E E+++QEL+ L+QLV
Sbjct: 166 ARTGTDAMQNLQQENMMLKQQLEAIVQENSLLKRAVVTQQKRQRESEDQSQELQHLRQLV 225

Query: 229 SQYQEQLRTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
           +QYQEQLRTLEVNNYALT+HLKQA+Q+SSIPG +HPDVF
Sbjct: 226 TQYQEQLRTLEVNNYALTLHLKQAQQNSSIPGRYHPDVF 264


>AT1G80040.1 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast; EXPRESSED IN: 24 plant structures;
           EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
           DOMAIN/s: Ubiquitin system component Cue
           (InterPro:IPR003892); BEST Arabidopsis thaliana protein
           match is: Ubiquitin system component Cue protein
           (TAIR:AT5G32440.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr1:30109447-30111051 REVERSE LENGTH=248
          Length = 248

 Score =  182 bits (462), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 111/272 (40%), Positives = 157/272 (57%), Gaps = 29/272 (10%)

Query: 1   MSAIIVCGKRSALFQPSSPPISTKRIRC-XXXXXXXXXXXXXXLLHHLTTLFPFMDPQLL 59
           MSA+  CG + + F  +S P S+KR RC                L  L + FP ++  +L
Sbjct: 1   MSAVY-CGTKRSYFDDNSSPPSSKRFRCFSPSNSPIWSSPPSSSLDQLHSAFPHIELTVL 59

Query: 60  EKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQGETKCDTE 119
            KALE+ G+D ++A++SL              S A+  +  A +L     +  ET     
Sbjct: 60  VKALEDNGSDFNAAMKSLY-------------SFASSEEKKAEELAAGGAATQET----- 101

Query: 120 DAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSICERSRSET 179
                D V G + P +G +WV+  VRE++ +S  DDAK RA+RVLEALEK +  R+R E 
Sbjct: 102 -----DAVCGGNPPTSGDDWVELLVREVLQSSGTDDAKVRAARVLEALEKMLSARAREEA 156

Query: 180 ----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDLKQLVSQYQEQL 235
               + E + +++QVE L+++N +LKRA+ IQHERQK  E+ N +L  LKQLV QYQE+L
Sbjct: 157 GNKFQEEKVAVQQQVETLVKDNTVLKRAVAIQHERQKALEDANHQLGLLKQLVPQYQEKL 216

Query: 236 RTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
           R LEVNNYAL M L+Q E  +S+P  F+PDVF
Sbjct: 217 RNLEVNNYALRMQLQQVEHGNSMPARFNPDVF 248


>AT1G80040.3 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast; EXPRESSED IN: 24 plant structures;
           EXPRESSED DURING: 15 growth stages; BEST Arabidopsis
           thaliana protein match is: Ubiquitin system component
           Cue protein (TAIR:AT5G32440.1). | chr1:30109447-30111051
           REVERSE LENGTH=259
          Length = 259

 Score =  177 bits (448), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 112/283 (39%), Positives = 158/283 (55%), Gaps = 40/283 (14%)

Query: 1   MSAIIVCGKRSALFQPSSPPISTKRIRC-XXXXXXXXXXXXXXLLHHLTTLFPFMD---- 55
           MSA+  CG + + F  +S P S+KR RC                L  L + FP ++    
Sbjct: 1   MSAVY-CGTKRSYFDDNSSPPSSKRFRCFSPSNSPIWSSPPSSSLDQLHSAFPHIELTVA 59

Query: 56  -------PQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQP 108
                   Q+L KALE+ G+D ++A++SL              S A+  +  A +L    
Sbjct: 60  SKIHVSVAQVLVKALEDNGSDFNAAMKSLY-------------SFASSEEKKAEELAAGG 106

Query: 109 QSQGETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALE 168
            +  ET          D V G + P +G +WV+  VRE++ +S  DDAK RA+RVLEALE
Sbjct: 107 AATQET----------DAVCGGNPPTSGDDWVELLVREVLQSSGTDDAKVRAARVLEALE 156

Query: 169 KSICERSRSET----ERENMMLKEQVEALIQENIILKRALCIQHERQKEYENKNQELKDL 224
           K +  R+R E     + E + +++QVE L+++N +LKRA+ IQHERQK  E+ N +L  L
Sbjct: 157 KMLSARAREEAGNKFQEEKVAVQQQVETLVKDNTVLKRAVAIQHERQKALEDANHQLGLL 216

Query: 225 KQLVSQYQEQLRTLEVNNYALTMHLKQAEQSSSIPGHFHPDVF 267
           KQLV QYQE+LR LEVNNYAL M L+Q E  +S+P  F+PDVF
Sbjct: 217 KQLVPQYQEKLRNLEVNNYALRMQLQQVEHGNSMPARFNPDVF 259


>AT5G32440.2 | Symbols:  | Ubiquitin system component Cue protein |
           chr5:12077014-12078091 FORWARD LENGTH=187
          Length = 187

 Score =  118 bits (296), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/187 (48%), Positives = 115/187 (61%), Gaps = 23/187 (12%)

Query: 1   MSAIIVCGKRSALFQ---PSSPPISTKRIRC-----XXXXXXXXXXXXXXLLHHLTTLFP 52
           MSAI VCGKRS LF+    +SPP+S K++RC                   LL HL  +FP
Sbjct: 1   MSAI-VCGKRS-LFEDLAAASPPVS-KKLRCFSSSSSSRFSPPIPPSSSLLLDHLAAIFP 57

Query: 53  FMDPQLLEKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQG 112
            MD Q+LE+A+EECG+DLDSAIR LN+LRL  S +++ DSA   S  V  + N +PQ QG
Sbjct: 58  DMDKQILERAIEECGDDLDSAIRCLNQLRL-ESANKNSDSATNQSPVVIQEPNVEPQQQG 116

Query: 113 ETKCDTEDAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSIC 172
             K        E  V      L+G EWV+ FVREMM+AS+M DAKARA+R LEALEKSI 
Sbjct: 117 SAK-------EEPNVLN----LDGTEWVELFVREMMNASDMKDAKARAARALEALEKSIN 165

Query: 173 ERSRSET 179
            R+ ++ 
Sbjct: 166 ARTGTDA 172


>AT5G02510.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: Ubiquitin system component Cue protein
           (TAIR:AT5G32440.1); Has 166 Blast hits to 166 proteins
           in 31 species: Archae - 0; Bacteria - 4; Metazoa - 3;
           Fungi - 0; Plants - 142; Viruses - 0; Other Eukaryotes -
           17 (source: NCBI BLink). | chr5:560135-560860 FORWARD
           LENGTH=179
          Length = 179

 Score =  110 bits (274), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 90/135 (66%), Gaps = 1/135 (0%)

Query: 134 LNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSICERSRSETERENMMLKEQVEAL 193
           ++GA+WVD  V EM  A N+DD + R + +LEALE  I + + +  + E   +KE +++L
Sbjct: 45  IDGAKWVDRLVSEMTKAINIDDMRRRVAVILEALESIIKKNTNASKKLEYASMKESLQSL 104

Query: 194 IQENIILKRALCIQHERQKEYENKNQELKDLKQLVSQYQEQLRTLEVNNYALTMHLKQA- 252
           I +N ILKR +  QH+R  E E K +++  L+ +V QYQEQ+  LE++NYA+ +HL+++ 
Sbjct: 105 INDNQILKRVIANQHQRSSENEEKAKQVLHLRGVVGQYQEQVHKLELSNYAMKLHLQRSQ 164

Query: 253 EQSSSIPGHFHPDVF 267
           +Q +S  G+  PD++
Sbjct: 165 QQQTSFSGNLPPDIY 179


>AT1G80040.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           chloroplast; EXPRESSED IN: 24 plant structures;
           EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
           DOMAIN/s: Ubiquitin system component Cue
           (InterPro:IPR003892); BEST Arabidopsis thaliana protein
           match is: Ubiquitin system component Cue protein
           (TAIR:AT5G32440.3); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:30110059-30111051 REVERSE LENGTH=180
          Length = 180

 Score = 82.8 bits (203), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 90/179 (50%), Gaps = 25/179 (13%)

Query: 1   MSAIIVCGKRSALFQPSSPPISTKRIRC-XXXXXXXXXXXXXXLLHHLTTLFPFMDPQLL 59
           MSA+  CG + + F  +S P S+KR RC                L  L + FP ++  +L
Sbjct: 1   MSAVY-CGTKRSYFDDNSSPPSSKRFRCFSPSNSPIWSSPPSSSLDQLHSAFPHIELTVL 59

Query: 60  EKALEECGNDLDSAIRSLNELRLGGSVHQSIDSAATGSDHVAVDLNDQPQSQGETKCDTE 119
            KALE+ G+D ++A++SL              S A+  +  A +L     +  ET     
Sbjct: 60  VKALEDNGSDFNAAMKSLY-------------SFASSEEKKAEELAAGGAATQET----- 101

Query: 120 DAASEDQVAGQSYPLNGAEWVDHFVREMMSASNMDDAKARASRVLEALEKSICERSRSE 178
                D V G + P +G +WV+  VRE++ +S  DDAK RA+RVLEALEK +  R+R E
Sbjct: 102 -----DAVCGGNPPTSGDDWVELLVREVLQSSGTDDAKVRAARVLEALEKMLSARAREE 155