Miyakogusa Predicted Gene

Lj6g3v0024590.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v0024590.1 CUFF.57382.1
         (111 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G33430.1 | Symbols: DAL1, DAL | differentiation and greening-...   133   2e-32
AT2G35240.1 | Symbols:  | plastid developmental protein DAG, put...   133   3e-32
AT1G32580.1 | Symbols:  | plastid developmental protein DAG, put...   131   1e-31
AT1G11430.1 | Symbols:  | plastid developmental protein DAG, put...    83   3e-17
AT3G06790.2 | Symbols:  | plastid developmental protein DAG, put...    80   4e-16
AT4G20020.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    79   4e-16
AT4G20020.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    79   5e-16
AT3G06790.1 | Symbols:  | plastid developmental protein DAG, put...    79   8e-16
AT3G15000.1 | Symbols:  | cobalt ion binding | chr3:5050321-5052...    70   3e-13
AT1G72530.1 | Symbols:  | plastid developmental protein DAG, put...    60   2e-10
AT5G44780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    60   3e-10
AT1G53260.2 | Symbols:  | LOCATED IN: endomembrane system; BEST ...    56   6e-09
AT1G72530.2 | Symbols:  | plastid developmental protein DAG, put...    55   1e-08
AT1G53260.1 | Symbols:  | LOCATED IN: endomembrane system; BEST ...    53   4e-08

>AT2G33430.1 | Symbols: DAL1, DAL | differentiation and
           greening-like 1 | chr2:14162732-14164729 FORWARD
           LENGTH=219
          Length = 219

 Score =  133 bits (335), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 65/97 (67%), Positives = 76/97 (78%), Gaps = 1/97 (1%)

Query: 12  CCLSHIFFLVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDP 71
           C +  +  +V SEEEAKKRIY++SCE+YLGFGCEIDEETS KL+G  G++FVLPDSYVDP
Sbjct: 107 CYIQTLAKVVGSEEEAKKRIYNVSCERYLGFGCEIDEETSTKLEGLPGVLFVLPDSYVDP 166

Query: 72  EYKDYGGELFVNGAIVQRPPERQKRFGQKQTTRKTSR 108
           E KDYG ELFVNG IVQR PERQ+R  + Q  R   R
Sbjct: 167 ENKDYGAELFVNGEIVQRSPERQRRV-EPQPQRAQDR 202


>AT2G35240.1 | Symbols:  | plastid developmental protein DAG,
           putative | chr2:14845099-14846262 REVERSE LENGTH=232
          Length = 232

 Score =  133 bits (334), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 59/97 (60%), Positives = 77/97 (79%)

Query: 12  CCLSHIFFLVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDP 71
           C +  +  +V SEEEA+K+IY++SCE+Y GFGCEIDEETS+KL+G  G++FVLPDSYVDP
Sbjct: 119 CYVQTLAKIVGSEEEARKKIYNVSCERYFGFGCEIDEETSNKLEGLPGVLFVLPDSYVDP 178

Query: 72  EYKDYGGELFVNGAIVQRPPERQKRFGQKQTTRKTSR 108
           E+KDYG ELFVNG +V RPPERQ+R  +    R + +
Sbjct: 179 EFKDYGAELFVNGEVVPRPPERQRRMVELTNQRGSDK 215


>AT1G32580.1 | Symbols:  | plastid developmental protein DAG,
           putative | chr1:11784108-11785430 FORWARD LENGTH=229
          Length = 229

 Score =  131 bits (329), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 59/97 (60%), Positives = 76/97 (78%)

Query: 12  CCLSHIFFLVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDP 71
           C +  +  ++ SEEEAKK+IY++SCE+Y GFGCEIDEETS+KL+G  G++F+LPDSYVD 
Sbjct: 116 CYVQTLAKIIGSEEEAKKKIYNVSCERYFGFGCEIDEETSNKLEGLPGVLFILPDSYVDQ 175

Query: 72  EYKDYGGELFVNGAIVQRPPERQKRFGQKQTTRKTSR 108
           E KDYG ELFVNG IVQRPPERQ++  +  T R   +
Sbjct: 176 ENKDYGAELFVNGEIVQRPPERQRKIIELTTQRTNDK 212


>AT1G11430.1 | Symbols:  | plastid developmental protein DAG,
           putative | chr1:3847273-3848938 FORWARD LENGTH=232
          Length = 232

 Score = 83.2 bits (204), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 37/74 (50%), Positives = 53/74 (71%)

Query: 14  LSHIFFLVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEY 73
           L+ +  ++ S EEAKK +Y+ S   Y GF C IDEETS+K  G  G+++VLPDSY+D + 
Sbjct: 112 LNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSEKFKGLPGVLWVLPDSYIDVKN 171

Query: 74  KDYGGELFVNGAIV 87
           KDYGG+ ++NG I+
Sbjct: 172 KDYGGDKYINGEII 185


>AT3G06790.2 | Symbols:  | plastid developmental protein DAG,
           putative | chr3:2144564-2145743 REVERSE LENGTH=244
          Length = 244

 Score = 79.7 bits (195), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 52/77 (67%), Gaps = 3/77 (3%)

Query: 24  EEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGELFVN 83
           EEEAKK+IYS+    Y GFG  I EE S K+    G+++VLPDSY+D   KDYGG+L+V 
Sbjct: 128 EEEAKKKIYSVCTSTYTGFGALISEELSCKVKALPGVLWVLPDSYLDVPNKDYGGDLYVE 187

Query: 84  GAIVQRPPERQKRFGQK 100
           G ++ RP   Q RF ++
Sbjct: 188 GKVIPRP---QYRFTEQ 201


>AT4G20020.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G44780.1); Has 28928 Blast hits to 16023
           proteins in 1033 species: Archae - 4; Bacteria - 4155;
           Metazoa - 15463; Fungi - 2938; Plants - 3091; Viruses -
           205; Other Eukaryotes - 3072 (source: NCBI BLink). |
           chr4:10844433-10846085 REVERSE LENGTH=419
          Length = 419

 Score = 79.3 bits (194), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 35/75 (46%), Positives = 49/75 (65%)

Query: 20  LVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGE 79
           L  S EEAK+R+Y+ S   Y GF   + E+ S+K     G++F+LPDSY+DP+ K+YGG+
Sbjct: 120 LGISVEEAKQRMYACSTTTYQGFQAIMTEQESEKFKDLPGVVFILPDSYIDPQNKEYGGD 179

Query: 80  LFVNGAIVQRPPERQ 94
            + NG I  RPP  Q
Sbjct: 180 KYENGVITHRPPPIQ 194


>AT4G20020.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G44780.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:10844360-10846085 REVERSE LENGTH=406
          Length = 406

 Score = 79.3 bits (194), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 35/75 (46%), Positives = 49/75 (65%)

Query: 20  LVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGE 79
           L  S EEAK+R+Y+ S   Y GF   + E+ S+K     G++F+LPDSY+DP+ K+YGG+
Sbjct: 120 LGISVEEAKQRMYACSTTTYQGFQAIMTEQESEKFKDLPGVVFILPDSYIDPQNKEYGGD 179

Query: 80  LFVNGAIVQRPPERQ 94
            + NG I  RPP  Q
Sbjct: 180 KYENGVITHRPPPIQ 194


>AT3G06790.1 | Symbols:  | plastid developmental protein DAG,
           putative | chr3:2144564-2145743 REVERSE LENGTH=244
          Length = 244

 Score = 78.6 bits (192), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 38/77 (49%), Positives = 52/77 (67%), Gaps = 3/77 (3%)

Query: 24  EEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGELFVN 83
           +EEAKK+IYS+    Y GFG  I EE S K+    G+++VLPDSY+D   KDYGG+L+V 
Sbjct: 128 QEEAKKKIYSVCTSTYTGFGALISEELSCKVKALPGVLWVLPDSYLDVPNKDYGGDLYVE 187

Query: 84  GAIVQRPPERQKRFGQK 100
           G ++ RP   Q RF ++
Sbjct: 188 GKVIPRP---QYRFTEQ 201


>AT3G15000.1 | Symbols:  | cobalt ion binding | chr3:5050321-5052121
           FORWARD LENGTH=395
          Length = 395

 Score = 70.1 bits (170), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 49/78 (62%)

Query: 20  LVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGE 79
           +V SE+EA+ +IYS+S   Y  FG  + E+ S KL   + + +VLPDSY+D   KDYGGE
Sbjct: 127 IVGSEDEARMKIYSVSTRCYYAFGALVSEDLSHKLKELSNVRWVLPDSYLDVRNKDYGGE 186

Query: 80  LFVNGAIVQRPPERQKRF 97
            F++G  V   P+  + +
Sbjct: 187 PFIDGKAVPYDPKYHEEW 204


>AT1G72530.1 | Symbols:  | plastid developmental protein DAG,
           putative | chr1:27312999-27313937 FORWARD LENGTH=188
          Length = 188

 Score = 60.5 bits (145), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 30/65 (46%), Positives = 41/65 (63%)

Query: 23  SEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGELFV 82
           SEEEAK+ IYS+S + Y  FGC I E  + K+     + +VLPDS++      YGGE FV
Sbjct: 87  SEEEAKRSIYSVSTKYYYAFGCRIHEPLTYKIRSLPDVKWVLPDSFIVDGDNRYGGEPFV 146

Query: 83  NGAIV 87
           +G +V
Sbjct: 147 DGEVV 151


>AT5G44780.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G20020.2); Has 9661 Blast hits to 6233 proteins
           in 635 species: Archae - 4; Bacteria - 1116; Metazoa -
           4251; Fungi - 1510; Plants - 1359; Viruses - 43; Other
           Eukaryotes - 1378 (source: NCBI BLink). |
           chr5:18068100-18070544 FORWARD LENGTH=723
          Length = 723

 Score = 60.1 bits (144), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 29/71 (40%), Positives = 40/71 (56%)

Query: 20  LVFSEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGE 79
           L  S EEAKK+IY+I    Y GF   +     +K     G+ +++PDSY+D E K YGG+
Sbjct: 114 LAISLEEAKKKIYAICTTSYQGFQATMTIGEVEKFRDLPGVQYIIPDSYIDVENKVYGGD 173

Query: 80  LFVNGAIVQRP 90
            + NG I   P
Sbjct: 174 KYENGVITPGP 184


>AT1G53260.2 | Symbols:  | LOCATED IN: endomembrane system; BEST
          Arabidopsis thaliana protein match is: cobalt ion
          binding (TAIR:AT3G15000.1); Has 246 Blast hits to 241
          proteins in 32 species: Archae - 0; Bacteria - 2;
          Metazoa - 7; Fungi - 16; Plants - 212; Viruses - 1;
          Other Eukaryotes - 8 (source: NCBI BLink). |
          chr1:19859406-19860421 REVERSE LENGTH=230
          Length = 230

 Score = 55.8 bits (133), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 42/68 (61%)

Query: 30 RIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGELFVNGAIVQR 89
          +IYS+S + Y  FG  + E+ S K+     + +VLPDSY+D + KDYGGE F++G  V  
Sbjct: 2  KIYSVSHKCYFAFGALVSEDLSHKIKELPKVKWVLPDSYLDGKNKDYGGEPFIDGKAVPY 61

Query: 90 PPERQKRF 97
           P+  + +
Sbjct: 62 DPKYHEEW 69


>AT1G72530.2 | Symbols:  | plastid developmental protein DAG,
           putative | chr1:27312999-27313937 FORWARD LENGTH=192
          Length = 192

 Score = 55.1 bits (131), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 4/69 (5%)

Query: 23  SEEEAKKRIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYG----G 78
           SEEEAK+ IYS+S + Y  FGC I E  + K+     + +VLPDS++      YG    G
Sbjct: 87  SEEEAKRSIYSVSTKYYYAFGCRIHEPLTYKIRSLPDVKWVLPDSFIVDGDNRYGVFFAG 146

Query: 79  ELFVNGAIV 87
           E FV+G +V
Sbjct: 147 EPFVDGEVV 155


>AT1G53260.1 | Symbols:  | LOCATED IN: endomembrane system; BEST
          Arabidopsis thaliana protein match is: cobalt ion
          binding (TAIR:AT3G15000.1); Has 32763 Blast hits to
          18534 proteins in 929 species: Archae - 22; Bacteria -
          2420; Metazoa - 15140; Fungi - 5401; Plants - 5313;
          Viruses - 485; Other Eukaryotes - 3982 (source: NCBI
          BLink). | chr1:19859393-19860421 REVERSE LENGTH=271
          Length = 271

 Score = 53.1 bits (126), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 38/58 (65%)

Query: 30 RIYSISCEKYLGFGCEIDEETSDKLDGKAGIMFVLPDSYVDPEYKDYGGELFVNGAIV 87
          +IYS+S + Y  FG  + E+ S K+     + +VLPDSY+D + KDYGGE F++G  V
Sbjct: 2  KIYSVSHKCYFAFGALVSEDLSHKIKELPKVKWVLPDSYLDGKNKDYGGEPFIDGKAV 59