Miyakogusa Predicted Gene

Lj1g3v1911640.5
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v1911640.5 Non Chatacterized Hit- tr|K3YW60|K3YW60_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si018492,38.69,2e-19,DUF4050,Domain of unknown function
DUF4050,CUFF.28111.5
         (170 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   155   1e-38
AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   155   1e-38
AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   120   7e-28
AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   119   7e-28
AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   115   2e-26
AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   114   4e-26
AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   114   4e-26
AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    95   2e-20
AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    87   4e-18
AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    78   3e-15
AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    78   3e-15

>AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
           LENGTH=169
          Length = 169

 Score =  155 bits (392), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 88/172 (51%), Positives = 115/172 (66%), Gaps = 8/172 (4%)

Query: 1   MHICRYVPPWLSQILACMGGCLGCFPKP---ISMDEASKGLITQGSAINNYDRSEDIWSS 57
           MH+   +P W+ Q+  CMGGC GC  KP   +++DE SKGL  QG  +     SED WS+
Sbjct: 1   MHLREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWST 60

Query: 58  SSFYMDHSAGYSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGN 117
           S+  MD+S   SQRS SSI+  N+ S   S++     +P EFVNHGL LW QTR+QW+ N
Sbjct: 61  STCEMDNSTLQSQRSMSSISFTNNTSTSASTS-----NPTEFVNHGLNLWNQTRQQWLAN 115

Query: 118 KKTERRMQVGESRISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
             ++++ +V E  ISWNATYESLLG  + F +PIPL EMV+FLVD+WE EGL
Sbjct: 116 GTSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGL 167


>AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:8799934-8802333
           REVERSE LENGTH=169
          Length = 169

 Score =  155 bits (392), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 88/172 (51%), Positives = 115/172 (66%), Gaps = 8/172 (4%)

Query: 1   MHICRYVPPWLSQILACMGGCLGCFPKP---ISMDEASKGLITQGSAINNYDRSEDIWSS 57
           MH+   +P W+ Q+  CMGGC GC  KP   +++DE SKGL  QG  +     SED WS+
Sbjct: 1   MHLREIIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWST 60

Query: 58  SSFYMDHSAGYSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGN 117
           S+  MD+S   SQRS SSI+  N+ S   S++     +P EFVNHGL LW QTR+QW+ N
Sbjct: 61  STCEMDNSTLQSQRSMSSISFTNNTSTSASTS-----NPTEFVNHGLNLWNQTRQQWLAN 115

Query: 118 KKTERRMQVGESRISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
             ++++ +V E  ISWNATYESLLG  + F +PIPL EMV+FLVD+WE EGL
Sbjct: 116 GTSQKKAKVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGL 167


>AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25360.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr4:15615184-15616200
           REVERSE LENGTH=161
          Length = 161

 Score =  120 bits (300), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 88/155 (56%), Gaps = 16/155 (10%)

Query: 20  GCLGCFPKP----ISMDEASKGLITQGSAIN-NYDRSEDIWSSSSFYMDHSAGYSQRSFS 74
            C GC  +     + +DE SKGL  QG  +  +   S+D WS+S+  MDH+     +S  
Sbjct: 16  NCFGCCNRERRLVVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQS-- 73

Query: 75  SITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWN 134
                N P DPQ S  N      EFVNHGL LW  TR+QW     T ++  V E  ISWN
Sbjct: 74  ----SNPPFDPQCSTSNST----EFVNHGLILWNHTRQQW-RECLTRQQCLVPEPAISWN 124

Query: 135 ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
           +TY+SLL T + F QPIPL EMV FLVD+WE EGL
Sbjct: 125 STYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGL 159


>AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:5340243-5341216 FORWARD LENGTH=161
          Length = 161

 Score =  119 bits (299), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 70/163 (42%), Positives = 93/163 (57%), Gaps = 15/163 (9%)

Query: 18  MGGCLGCFPKPISMDEASKGLITQGSAINNYD----------RSEDIWSSSSFYMDHSAG 67
           MG CL CF K  +       L      +   D           SED W++++  M+ +A 
Sbjct: 1   MGSCLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA- 59

Query: 68  YSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVG 127
               S SSI+  N   D Q   G+  + P EFVNHGL LW QTR+QWVG+K++E R  VG
Sbjct: 60  --HGSVSSISTTNLTIDSQG-CGSSSNEPAEFVNHGLVLWNQTRQQWVGDKRSESRKSVG 116

Query: 128 -ESRISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
            E  ++ N TYESLLG+ + F +PIPL EMV+FLV++WE EGL
Sbjct: 117 REPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 159


>AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
           LENGTH=162
          Length = 162

 Score =  115 bits (288), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/160 (42%), Positives = 91/160 (56%), Gaps = 15/160 (9%)

Query: 21  CLGCFPKPISMDEASKGLITQGSAINNYD----------RSEDIWSSSSFYMDHSAGYSQ 70
           CL CF K  +       L      +   D           SED W++++  M+ +A    
Sbjct: 5   CLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA---H 61

Query: 71  RSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVG-ES 129
            S SSI+  N   D Q   G+  + P EFVNHGL LW QTR+QWVG+K++E R  VG E 
Sbjct: 62  GSVSSISTTNLTIDSQG-CGSSSNEPAEFVNHGLVLWNQTRQQWVGDKRSESRKSVGREP 120

Query: 130 RISWNATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
            ++ N TYESLLG+ + F +PIPL EMV+FLV++WE EGL
Sbjct: 121 ILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGL 160


>AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  114 bits (284), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 94/156 (60%), Gaps = 8/156 (5%)

Query: 18  MGGCLGCFPKPISMDEASKGLITQGSA--INNYDRSEDIWSSSSFYMDHSAGYSQRSFSS 75
           MGGC+GC+ +  S   + K   +   A        SED WS+S+  MD+    SQ S SS
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSIARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSS 60

Query: 76  ITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGE-SRISWN 134
               N   D QS+A N  + P E+VN GL LW QTR++WVG  K    +   + ++++WN
Sbjct: 61  S---NQTFDSQSAARNS-NAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWN 116

Query: 135 -ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
            ATY+SLLG+ + F QPIPL EMV+FLVDIWE EGL
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152


>AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  114 bits (284), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 94/156 (60%), Gaps = 8/156 (5%)

Query: 18  MGGCLGCFPKPISMDEASKGLITQGSA--INNYDRSEDIWSSSSFYMDHSAGYSQRSFSS 75
           MGGC+GC+ +  S   + K   +   A        SED WS+S+  MD+    SQ S SS
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSIARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSS 60

Query: 76  ITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGE-SRISWN 134
               N   D QS+A N  + P E+VN GL LW QTR++WVG  K    +   + ++++WN
Sbjct: 61  S---NQTFDSQSAARNS-NAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWN 116

Query: 135 -ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
            ATY+SLLG+ + F QPIPL EMV+FLVDIWE EGL
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 152


>AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
           hits to 145 proteins in 25 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
           Other Eukaryotes - 7 (source: NCBI BLink). |
           chr1:5278481-5279056 REVERSE LENGTH=108
          Length = 108

 Score = 95.1 bits (235), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 56/110 (50%), Positives = 72/110 (65%), Gaps = 6/110 (5%)

Query: 62  MDHSAGYSQRSFSSITIPNHPSDPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTE 121
           MD+    SQ S SS    N   D QS+A N  + P E+VN GL LW QTR++WVG  K  
Sbjct: 1   MDNITFPSQGSLSS---SNQTFDSQSAARNS-NAPPEYVNQGLLLWNQTRERWVGKDKPN 56

Query: 122 RRMQVGE-SRISWN-ATYESLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
             +   + ++++WN ATY+SLLG+ + F QPIPL EMV+FLVDIWE EGL
Sbjct: 57  NPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGL 106


>AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
           LENGTH=112
          Length = 112

 Score = 87.4 bits (215), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 39/86 (45%), Positives = 56/86 (65%)

Query: 84  DPQSSAGNQIDHPEEFVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWNATYESLLGT 143
           D +SS     ++    VNHG ++W++ R++WVG++  +R+    +  ISW+ TYE LL T
Sbjct: 25  DEKSSVKTNSENTLTLVNHGAKMWQENREKWVGDQSRQRKNTAKDQIISWSTTYEDLLST 84

Query: 144 TRPFRQPIPLGEMVEFLVDIWELEGL 169
             PF + IPL EMV+FLVDIW  EGL
Sbjct: 85  HEPFSESIPLPEMVDFLVDIWYDEGL 110


>AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 77.8 bits (190), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 2/91 (2%)

Query: 81  HPSDPQSSAGNQIDHPEE--FVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWNATYE 138
            P +  S++ N  +   E  FVNH    W++ RK+WVG+          E  I +NATYE
Sbjct: 6   EPGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYE 65

Query: 139 SLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
            LL +  PF +PIPL EMV+FL DIW  +GL
Sbjct: 66  DLLTSNTPFNKPIPLAEMVDFLFDIWHGDGL 96


>AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 77.8 bits (190), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 2/91 (2%)

Query: 81  HPSDPQSSAGNQIDHPEE--FVNHGLRLWKQTRKQWVGNKKTERRMQVGESRISWNATYE 138
            P +  S++ N  +   E  FVNH    W++ RK+WVG+          E  I +NATYE
Sbjct: 6   EPGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYE 65

Query: 139 SLLGTTRPFRQPIPLGEMVEFLVDIWELEGL 169
            LL +  PF +PIPL EMV+FL DIW  +GL
Sbjct: 66  DLLTSNTPFNKPIPLAEMVDFLFDIWHGDGL 96