Miyakogusa Predicted Gene

Lj5g3v1794690.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1794690.1 Non Chatacterized Hit- tr|B9SUC3|B9SUC3_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,50.48,1e-18,DUF4050,Domain of unknown function DUF4050,CUFF.55944.1
         (119 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    90   3e-19
AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    69   6e-13
AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    69   6e-13
AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    61   2e-10
AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    61   2e-10
AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    56   5e-09
AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    56   5e-09
AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    53   4e-08
AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    49   8e-07
AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   9e-07
AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   9e-07

>AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
           LENGTH=112
          Length = 112

 Score = 90.1 bits (222), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 42/75 (56%), Positives = 52/75 (69%)

Query: 31  KAYRSSHSNGKQNLKQTSNFVNHAAIAWHENRKRWVGDKSRHPPREAKDPIISWSTSYEE 90
           K  +   S+ K N + T   VNH A  W ENR++WVGD+SR     AKD IISWST+YE+
Sbjct: 21  KLVKDEKSSVKTNSENTLTLVNHGAKMWQENREKWVGDQSRQRKNTAKDQIISWSTTYED 80

Query: 91  LLSTNEPFAEPIPLP 105
           LLST+EPF+E IPLP
Sbjct: 81  LLSTHEPFSESIPLP 95


>AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 68.9 bits (167), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 50  FVNHAAIAWHENRKRWVGDKSRHPPREAKDPIISWSTSYEELLSTNEPFAEPIPL 104
           FVNHA IAW E RK+WVGD S        +P+I ++ +YE+LL++N PF +PIPL
Sbjct: 26  FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYEDLLTSNTPFNKPIPL 80


>AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 68.9 bits (167), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 50  FVNHAAIAWHENRKRWVGDKSRHPPREAKDPIISWSTSYEELLSTNEPFAEPIPL 104
           FVNHA IAW E RK+WVGD S        +P+I ++ +YE+LL++N PF +PIPL
Sbjct: 26  FVNHAEIAWQEMRKKWVGDPSNRTSEMPDEPVIGFNATYEDLLTSNTPFNKPIPL 80


>AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
           LENGTH=169
          Length = 169

 Score = 60.8 bits (146), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 50/91 (54%), Gaps = 7/91 (7%)

Query: 15  TLKHSKSADEVKRSNEKAYRSSHSNGKQNLKQTSNFVNHAAIAWHENRKRWVGDKSRHPP 74
           TL+  +S   +  +N  +  +S SN  +       FVNH    W++ R++W+ + +    
Sbjct: 69  TLQSQRSMSSISFTNNTSTSASTSNPTE-------FVNHGLNLWNQTRQQWLANGTSQKK 121

Query: 75  REAKDPIISWSTSYEELLSTNEPFAEPIPLP 105
            + ++P ISW+ +YE LL  N+ F+ PIPLP
Sbjct: 122 AKVREPTISWNATYESLLGMNKRFSRPIPLP 152


>AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:8799934-8802333
           REVERSE LENGTH=169
          Length = 169

 Score = 60.8 bits (146), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 50/91 (54%), Gaps = 7/91 (7%)

Query: 15  TLKHSKSADEVKRSNEKAYRSSHSNGKQNLKQTSNFVNHAAIAWHENRKRWVGDKSRHPP 74
           TL+  +S   +  +N  +  +S SN  +       FVNH    W++ R++W+ + +    
Sbjct: 69  TLQSQRSMSSISFTNNTSTSASTSNPTE-------FVNHGLNLWNQTRQQWLANGTSQKK 121

Query: 75  REAKDPIISWSTSYEELLSTNEPFAEPIPLP 105
            + ++P ISW+ +YE LL  N+ F+ PIPLP
Sbjct: 122 AKVREPTISWNATYESLLGMNKRFSRPIPLP 152


>AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:5340243-5341216 FORWARD LENGTH=161
          Length = 161

 Score = 56.2 bits (134), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 39/57 (68%), Gaps = 3/57 (5%)

Query: 50  FVNHAAIAWHENRKRWVGDKSRHPPRE--AKDPIISWSTSYEELLSTNEPFAEPIPL 104
           FVNH  + W++ R++WVGDK R   R+   ++PI++ + +YE LL +N+ F  PIPL
Sbjct: 88  FVNHGLVLWNQTRQQWVGDK-RSESRKSVGREPILNENVTYESLLGSNKRFPRPIPL 143


>AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
           LENGTH=162
          Length = 162

 Score = 56.2 bits (134), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 39/57 (68%), Gaps = 3/57 (5%)

Query: 50  FVNHAAIAWHENRKRWVGDKSRHPPRE--AKDPIISWSTSYEELLSTNEPFAEPIPL 104
           FVNH  + W++ R++WVGDK R   R+   ++PI++ + +YE LL +N+ F  PIPL
Sbjct: 89  FVNHGLVLWNQTRQQWVGDK-RSESRKSVGREPILNENVTYESLLGSNKRFPRPIPL 144


>AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25360.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr4:15615184-15616200
           REVERSE LENGTH=161
          Length = 161

 Score = 52.8 bits (125), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 47  TSNFVNHAAIAWHENRKRWVGDKSRHPPREAKDPIISWSTSYEELLSTNEPFAEPIPL 104
           ++ FVNH  I W+  R++W    +R       +P ISW+++Y+ LLSTN+ F +PIPL
Sbjct: 87  STEFVNHGLILWNHTRQQWRECLTRQQCL-VPEPAISWNSTYDSLLSTNKLFPQPIPL 143


>AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
           hits to 145 proteins in 25 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
           Other Eukaryotes - 7 (source: NCBI BLink). |
           chr1:5278481-5279056 REVERSE LENGTH=108
          Length = 108

 Score = 48.5 bits (114), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 50  FVNHAAIAWHENRKRWVG-DKSRHPPREAKDPIISWST-SYEELLSTNEPFAEPIPL 104
           +VN   + W++ R+RWVG DK  +P    +   ++W+T +Y+ LL +N+ F +PIPL
Sbjct: 34  YVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPL 90


>AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score = 48.5 bits (114), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 50  FVNHAAIAWHENRKRWVG-DKSRHPPREAKDPIISWST-SYEELLSTNEPFAEPIPL 104
           +VN   + W++ R+RWVG DK  +P    +   ++W+T +Y+ LL +N+ F +PIPL
Sbjct: 80  YVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPL 136


>AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score = 48.5 bits (114), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 50  FVNHAAIAWHENRKRWVG-DKSRHPPREAKDPIISWST-SYEELLSTNEPFAEPIPL 104
           +VN   + W++ R+RWVG DK  +P    +   ++W+T +Y+ LL +N+ F +PIPL
Sbjct: 80  YVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPL 136