Miyakogusa Predicted Gene

Lj5g3v0539650.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0539650.1 Non Chatacterized Hit- tr|K3YW60|K3YW60_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si018492,49.57,3e-19,DUF4050,Domain of unknown function
DUF4050,CUFF.53278.1
         (140 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   156   3e-39
AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   156   3e-39
AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   143   4e-35
AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   135   1e-32
AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   135   1e-32
AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   119   6e-28
AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   119   6e-28
AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   105   7e-24
AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    84   2e-17
AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    84   2e-17
AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    78   2e-15

>AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
           LENGTH=169
          Length = 169

 Score =  156 bits (395), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 81/140 (57%), Positives = 101/140 (72%), Gaps = 4/140 (2%)

Query: 1   MDEASKGLRNQGQTVTKDDGSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSAC 60
           +DE SKGLR QG+ V K   S DFWS++T EMD++  QS RS+SS + +N      S++ 
Sbjct: 34  VDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTNN----TSTSA 89

Query: 61  SQTAPPEFVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPI 120
           S + P EFVNHGL  W Q RQQW+ N TS+ + +VREP IS NATY++LLG NK F +PI
Sbjct: 90  STSNPTEFVNHGLNLWNQTRQQWLANGTSQKKAKVREPTISWNATYESLLGMNKRFSRPI 149

Query: 121 PLREMVYFLVDIWEQEGLYD 140
           PL EMV FLVD+WEQEGLYD
Sbjct: 150 PLPEMVDFLVDVWEQEGLYD 169


>AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:8799934-8802333
           REVERSE LENGTH=169
          Length = 169

 Score =  156 bits (395), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 81/140 (57%), Positives = 101/140 (72%), Gaps = 4/140 (2%)

Query: 1   MDEASKGLRNQGQTVTKDDGSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSAC 60
           +DE SKGLR QG+ V K   S DFWS++T EMD++  QS RS+SS + +N      S++ 
Sbjct: 34  VDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTNN----TSTSA 89

Query: 61  SQTAPPEFVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPI 120
           S + P EFVNHGL  W Q RQQW+ N TS+ + +VREP IS NATY++LLG NK F +PI
Sbjct: 90  STSNPTEFVNHGLNLWNQTRQQWLANGTSQKKAKVREPTISWNATYESLLGMNKRFSRPI 149

Query: 121 PLREMVYFLVDIWEQEGLYD 140
           PL EMV FLVD+WEQEGLYD
Sbjct: 150 PLPEMVDFLVDVWEQEGLYD 169


>AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25360.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr4:15615184-15616200
           REVERSE LENGTH=161
          Length = 161

 Score =  143 bits (361), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 77/140 (55%), Positives = 94/140 (67%), Gaps = 11/140 (7%)

Query: 1   MDEASKGLRNQGQTVTKDDGSSD-FWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSA 59
           +DE SKGL+ QG+ V KD  SSD FWS++T +MDHN          + +SN P DPQ   
Sbjct: 31  VDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNIT------IQSQSSNPPFDPQ--- 81

Query: 60  CSQTAPPEFVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQP 119
           CS +   EFVNHGL+ W   RQQW    T + Q  V EP IS N+TYD+LL  NK FPQP
Sbjct: 82  CSTSNSTEFVNHGLILWNHTRQQWRECLTRQ-QCLVPEPAISWNSTYDSLLSTNKLFPQP 140

Query: 120 IPLREMVYFLVDIWEQEGLY 139
           IPL+EMV+FLVD+WE+EGLY
Sbjct: 141 IPLKEMVHFLVDVWEEEGLY 160


>AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:5340243-5341216 FORWARD LENGTH=161
          Length = 161

 Score =  135 bits (340), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 67/121 (55%), Positives = 86/121 (71%), Gaps = 4/121 (3%)

Query: 20  GSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQM 79
            S DFW++ T++M+ NA  S   VSS +T+N   D Q    S   P EFVNHGL+ W Q 
Sbjct: 43  ASEDFWTNTTLDMESNAHGS---VSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQT 99

Query: 80  RQQWVGNKTSESQTEV-REPKISLNATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
           RQQWVG+K SES+  V REP ++ N TY++LLG+NK FP+PIPL EMV FLV++WE+EGL
Sbjct: 100 RQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 159

Query: 139 Y 139
           Y
Sbjct: 160 Y 160


>AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
           LENGTH=162
          Length = 162

 Score =  135 bits (339), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 67/121 (55%), Positives = 86/121 (71%), Gaps = 4/121 (3%)

Query: 20  GSSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQM 79
            S DFW++ T++M+ NA  S   VSS +T+N   D Q    S   P EFVNHGL+ W Q 
Sbjct: 44  ASEDFWTNTTLDMESNAHGS---VSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQT 100

Query: 80  RQQWVGNKTSESQTEV-REPKISLNATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
           RQQWVG+K SES+  V REP ++ N TY++LLG+NK FP+PIPL EMV FLV++WE+EGL
Sbjct: 101 RQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 160

Query: 139 Y 139
           Y
Sbjct: 161 Y 161


>AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  119 bits (298), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 68/122 (55%), Positives = 86/122 (70%), Gaps = 5/122 (4%)

Query: 21  SSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQMR 80
           S DFWS++T++MD+    S  S+SS   SN   D QS+A +  APPE+VN GLL W Q R
Sbjct: 36  SEDFWSTSTVDMDNITFPSQGSLSS---SNQTFDSQSAARNSNAPPEYVNQGLLLWNQTR 92

Query: 81  QQWVG-NKTSESQTEVREPKISLN-ATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
           ++WVG +K +      +  K++ N ATYD+LLG+NK FPQPIPL EMV FLVDIWEQEGL
Sbjct: 93  ERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152

Query: 139 YD 140
           YD
Sbjct: 153 YD 154


>AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  119 bits (298), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 68/122 (55%), Positives = 86/122 (70%), Gaps = 5/122 (4%)

Query: 21  SSDFWSSNTIEMDHNAAQSLRSVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQMR 80
           S DFWS++T++MD+    S  S+SS   SN   D QS+A +  APPE+VN GLL W Q R
Sbjct: 36  SEDFWSTSTVDMDNITFPSQGSLSS---SNQTFDSQSAARNSNAPPEYVNQGLLLWNQTR 92

Query: 81  QQWVG-NKTSESQTEVREPKISLN-ATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGL 138
           ++WVG +K +      +  K++ N ATYD+LLG+NK FPQPIPL EMV FLVDIWEQEGL
Sbjct: 93  ERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152

Query: 139 YD 140
           YD
Sbjct: 153 YD 154


>AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
           hits to 145 proteins in 25 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
           Other Eukaryotes - 7 (source: NCBI BLink). |
           chr1:5278481-5279056 REVERSE LENGTH=108
          Length = 108

 Score =  105 bits (263), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 58/101 (57%), Positives = 72/101 (71%), Gaps = 2/101 (1%)

Query: 42  SVSSTATSNYPSDPQSSACSQTAPPEFVNHGLLHWYQMRQQWVG-NKTSESQTEVREPKI 100
           S  S ++SN   D QS+A +  APPE+VN GLL W Q R++WVG +K +      +  K+
Sbjct: 8   SQGSLSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKL 67

Query: 101 SLN-ATYDNLLGNNKPFPQPIPLREMVYFLVDIWEQEGLYD 140
           + N ATYD+LLG+NK FPQPIPL EMV FLVDIWEQEGLYD
Sbjct: 68  NWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 108


>AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 84.3 bits (207), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 36/73 (49%), Positives = 51/73 (69%)

Query: 68  FVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPIPLREMVY 127
           FVNH  + W +MR++WVG+ ++ +     EP I  NATY++LL +N PF +PIPL EMV 
Sbjct: 26  FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYEDLLTSNTPFNKPIPLAEMVD 85

Query: 128 FLVDIWEQEGLYD 140
           FL DIW  +GL++
Sbjct: 86  FLFDIWHGDGLFE 98


>AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 84.3 bits (207), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 36/73 (49%), Positives = 51/73 (69%)

Query: 68  FVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPIPLREMVY 127
           FVNH  + W +MR++WVG+ ++ +     EP I  NATY++LL +N PF +PIPL EMV 
Sbjct: 26  FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYEDLLTSNTPFNKPIPLAEMVD 85

Query: 128 FLVDIWEQEGLYD 140
           FL DIW  +GL++
Sbjct: 86  FLFDIWHGDGLFE 98


>AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
           LENGTH=112
          Length = 112

 Score = 77.8 bits (190), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 50/73 (68%)

Query: 68  FVNHGLLHWYQMRQQWVGNKTSESQTEVREPKISLNATYDNLLGNNKPFPQPIPLREMVY 127
            VNHG   W + R++WVG+++ + +   ++  IS + TY++LL  ++PF + IPL EMV 
Sbjct: 40  LVNHGAKMWQENREKWVGDQSRQRKNTAKDQIISWSTTYEDLLSTHEPFSESIPLPEMVD 99

Query: 128 FLVDIWEQEGLYD 140
           FLVDIW  EGLYD
Sbjct: 100 FLVDIWYDEGLYD 112