Miyakogusa Predicted Gene

Lj2g3v1058260.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v1058260.1 tr|Q10CY9|Q10CY9_ORYSJ Expressed protein OS=Oryza
sativa subsp. japonica GN=LOC_Os03g53590 PE=4
SV=1,36,3e-18,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,CUFF.36097.1
         (319 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    73   2e-13
AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    72   6e-13
AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    72   6e-13
AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    71   1e-12
AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    71   1e-12
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    70   2e-12
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    63   2e-10
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    63   2e-10
AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    54   2e-07
AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    54   2e-07
AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    53   3e-07

>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 73.2 bits (178), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/145 (24%), Positives = 74/145 (51%), Gaps = 2/145 (1%)

Query: 6   QATTSSGNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDK 65
           Q   S    + WT  MD  L++  + +   GN+V   F T A + + +  +  F  + +K
Sbjct: 316 QEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNK 375

Query: 66  SKIKNRWKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNT 125
             +KNR+K L++ ++D+  + +   +GF+W+    +  A+ ++W   IQ+ P+A + R  
Sbjct: 376 DVLKNRYKHLRRLYNDIKFLLEQ--NGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVK 433

Query: 126 SLPHYEAMVTLYGNDRATGEEAETA 150
           ++P Y  +  ++G + + G     A
Sbjct: 434 TIPSYPNLCFIFGKETSDGRYTRLA 458



 Score = 70.1 bits (170), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 60/122 (49%), Gaps = 2/122 (1%)

Query: 15  MSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKT 74
           + WTR MD  L++  + + + GNK+   FT QA   +A   +  F ++ D   ++NR+  
Sbjct: 509 IEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYIL 568

Query: 75  LKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMV 134
           L K+  D+ +I    + GF W+       AE E WEA I+  P A   +  +L  Y  + 
Sbjct: 569 LMKERDDINNIL--NLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLC 626

Query: 135 TL 136
            L
Sbjct: 627 KL 628



 Score = 62.0 bits (149), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 9/141 (6%)

Query: 17  WTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKTLK 76
           WT  MD   V   + +   GNK    F+ QA   +    +  F+ +  K  +++R+  L 
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231

Query: 77  KKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVTL 136
           K + D+  I K    GF+W+ +  +  A+  VW++ I+  P A   R  SLP Y  + T+
Sbjct: 232 KYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTI 289

Query: 137 Y------GND-RATGEEAETA 150
           +      G D R  G  A+T+
Sbjct: 290 FACQAEQGTDHRDDGSAAQTS 310



 Score = 58.2 bits (139), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/148 (20%), Positives = 69/148 (46%), Gaps = 2/148 (1%)

Query: 6   QATTSSGNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDK 65
           Q T +      WT +M+   ++  +     GN+    F  QA + + +  +  F  + DK
Sbjct: 4   QTTCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDK 63

Query: 66  SKIKNRWKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNT 125
             +K+R+  L K+++DV  +  +G  GF W+ +      +  +W   +++ P+A   +  
Sbjct: 64  DVLKSRYTNLWKQYNDVKCLLDHG--GFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTK 121

Query: 126 SLPHYEAMVTLYGNDRATGEEAETASEM 153
            + ++  +  +YG   A G  + ++ ++
Sbjct: 122 PVLNFSDLCLIYGYTVADGRYSMSSHDL 149


>AT4G02550.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=307
          Length = 307

 Score = 72.0 bits (175), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 70/152 (46%), Gaps = 3/152 (1%)

Query: 12  GNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNR 71
           G N+ W+  MD  L+ A   +   GNKV+  F  +A+      ++  F + +   K  NR
Sbjct: 17  GRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINR 76

Query: 72  WKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEP-EVWEALIQSKPKAANCRNTSLPHY 130
            KT+KK++  + DI      GF WN ST + D E  E+W   I   P A   R   +  Y
Sbjct: 77  LKTIKKRYRVMRDILSR--DGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMY 134

Query: 131 EAMVTLYGNDRATGEEAETASEMRKRLNSTTE 162
           E + T+ G+ +  G+  +   E    LN   +
Sbjct: 135 EELRTVCGDYQTPGKYNKVKKESSHHLNDVKQ 166


>AT4G02550.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:1120622-1121674 REVERSE LENGTH=322
          Length = 322

 Score = 71.6 bits (174), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 70/152 (46%), Gaps = 3/152 (1%)

Query: 12  GNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNR 71
           G N+ W+  MD  L+ A   +   GNKV+  F  +A+      ++  F + +   K  NR
Sbjct: 32  GRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINR 91

Query: 72  WKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEP-EVWEALIQSKPKAANCRNTSLPHY 130
            KT+KK++  + DI      GF WN ST + D E  E+W   I   P A   R   +  Y
Sbjct: 92  LKTIKKRYRVMRDILSR--DGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMY 149

Query: 131 EAMVTLYGNDRATGEEAETASEMRKRLNSTTE 162
           E + T+ G+ +  G+  +   E    LN   +
Sbjct: 150 EELRTVCGDYQTPGKYNKVKKESSHHLNDVKQ 181


>AT4G02550.4 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 18 plant
           structures; EXPRESSED DURING: 7 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score = 70.9 bits (172), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 3/138 (2%)

Query: 12  GNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNR 71
           G N+ W+  MD  L+ A   +   GNKV+  F  +A+      ++  F + +   K  NR
Sbjct: 17  GRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINR 76

Query: 72  WKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEP-EVWEALIQSKPKAANCRNTSLPHY 130
            KT+KK++  + DI      GF WN ST + D E  E+W   I   P A   R   +  Y
Sbjct: 77  LKTIKKRYRVMRDILSR--DGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMY 134

Query: 131 EAMVTLYGNDRATGEEAE 148
           E + T+ G+ +  G   E
Sbjct: 135 EELRTVCGDYQTPGSSEE 152


>AT4G02550.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
           - 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
           LENGTH=278
          Length = 278

 Score = 70.9 bits (172), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 3/138 (2%)

Query: 12  GNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNR 71
           G N+ W+  MD  L+ A   +   GNKV+  F  +A+      ++  F + +   K  NR
Sbjct: 17  GRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINR 76

Query: 72  WKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEP-EVWEALIQSKPKAANCRNTSLPHY 130
            KT+KK++  + DI      GF WN ST + D E  E+W   I   P A   R   +  Y
Sbjct: 77  LKTIKKRYRVMRDILSR--DGFWWNSSTKMIDCESDELWRRYIAVNPDAKAFRGKQIEMY 134

Query: 131 EAMVTLYGNDRATGEEAE 148
           E + T+ G+ +  G   E
Sbjct: 135 EELRTVCGDYQTPGSSEE 152


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score = 70.1 bits (170), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 60/122 (49%), Gaps = 2/122 (1%)

Query: 15  MSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKT 74
           + WTR MD  L++  + + + GNK+   FT QA   +A   +  F ++ D   ++NR+  
Sbjct: 532 IEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYIL 591

Query: 75  LKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMV 134
           L K+  D+ +I    + GF W+       AE E WEA I+  P A   +  +L  Y  + 
Sbjct: 592 LMKERDDINNIL--NLDGFTWDVEKQTIVAEDEYWEAYIKEHPDATIYKGKTLDSYGNLC 649

Query: 135 TL 136
            L
Sbjct: 650 KL 651



 Score = 62.0 bits (149), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 9/141 (6%)

Query: 17  WTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKTLK 76
           WT  MD   V   + +   GNK    F+ QA   +    +  F+ +  K  +++R+  L 
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231

Query: 77  KKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVTL 136
           K + D+  I K    GF+W+ +  +  A+  VW++ I+  P A   R  SLP Y  + T+
Sbjct: 232 KYYKDMEAILKE--DGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSYNDLDTI 289

Query: 137 Y------GND-RATGEEAETA 150
           +      G D R  G  A+T+
Sbjct: 290 FACQAEQGTDHRDDGSAAQTS 310



 Score = 60.1 bits (144), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 74/168 (44%), Gaps = 25/168 (14%)

Query: 6   QATTSSGNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDK 65
           Q   S    + WT  MD  L++  + +   GN+V   F T A + + +  +  F  + +K
Sbjct: 316 QEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNK 375

Query: 66  SKIKNRWKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALI------------ 113
             +KNR+K L++ ++D+  + +   +GF+W+    +  A+ ++W   I            
Sbjct: 376 DVLKNRYKHLRRLYNDIKFLLEQ--NGFSWDARRDMVIADDDIWNTYIQACHILFLFKIS 433

Query: 114 -----------QSKPKAANCRNTSLPHYEAMVTLYGNDRATGEEAETA 150
                      Q+ P+A + R  ++P Y  +  ++G + + G     A
Sbjct: 434 VICLCLQMKHVQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTRLA 481



 Score = 58.5 bits (140), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 30/148 (20%), Positives = 69/148 (46%), Gaps = 2/148 (1%)

Query: 6   QATTSSGNNMSWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDK 65
           Q T +      WT +M+   ++  +     GN+    F  QA + + +  +  F  + DK
Sbjct: 4   QTTCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDK 63

Query: 66  SKIKNRWKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNT 125
             +K+R+  L K+++DV  +  +G  GF W+ +      +  +W   +++ P+A   +  
Sbjct: 64  DVLKSRYTNLWKQYNDVKCLLDHG--GFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTK 121

Query: 126 SLPHYEAMVTLYGNDRATGEEAETASEM 153
            + ++  +  +YG   A G  + ++ ++
Sbjct: 122 PVLNFSDLCLIYGYTVADGRYSMSSHDL 149


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 63.2 bits (152), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 62/124 (50%), Gaps = 2/124 (1%)

Query: 16  SWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKTL 75
           +W   MD   ++  + +   GN++ G F  QA   + +  +  F    D   +KNR+K+L
Sbjct: 185 TWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSL 244

Query: 76  KKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVT 135
           +++F+ +  I ++   GFAW+    +  A+  VW+  I++   A       +P+Y+ +  
Sbjct: 245 RRQFNAIKSILRS--DGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCV 302

Query: 136 LYGN 139
           L G+
Sbjct: 303 LCGD 306



 Score = 55.8 bits (133), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/137 (24%), Positives = 64/137 (46%), Gaps = 3/137 (2%)

Query: 17  WTRSMDDALVNAFMHEFTAGNKVNGQ-FTTQAHDRIASELSVLFAMKIDKSKIKNRWKTL 75
           WT  MD   +   + +   GN+     F+ +A   ++   +  F     K  +KNR KTL
Sbjct: 14  WTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLKNRHKTL 73

Query: 76  KKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVT 135
           +  F  V ++      GF+W+ +  +  A+  VW+  ++  P + + R  S+P Y+ +  
Sbjct: 74  RNLFKSVNNLLIE--DGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYKDLCL 131

Query: 136 LYGNDRATGEEAETASE 152
           +Y +  +  +  E+ SE
Sbjct: 132 VYSDGMSEHKAEESISE 148


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 63.2 bits (152), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 62/124 (50%), Gaps = 2/124 (1%)

Query: 16  SWTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKTL 75
           +W   MD   ++  + +   GN++ G F  QA   + +  +  F    D   +KNR+K+L
Sbjct: 185 TWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSL 244

Query: 76  KKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVT 135
           +++F+ +  I ++   GFAW+    +  A+  VW+  I++   A       +P+Y+ +  
Sbjct: 245 RRQFNAIKSILRS--DGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCV 302

Query: 136 LYGN 139
           L G+
Sbjct: 303 LCGD 306



 Score = 55.8 bits (133), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/137 (24%), Positives = 64/137 (46%), Gaps = 3/137 (2%)

Query: 17  WTRSMDDALVNAFMHEFTAGNKVNGQ-FTTQAHDRIASELSVLFAMKIDKSKIKNRWKTL 75
           WT  MD   +   + +   GN+     F+ +A   ++   +  F     K  +KNR KTL
Sbjct: 14  WTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVLKNRHKTL 73

Query: 76  KKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVT 135
           +  F  V ++      GF+W+ +  +  A+  VW+  ++  P + + R  S+P Y+ +  
Sbjct: 74  RNLFKSVNNLLIE--DGFSWDDTRQMVVADNCVWDEYLKIHPDSRSFRIKSIPCYKDLCL 131

Query: 136 LYGNDRATGEEAETASE 152
           +Y +  +  +  E+ SE
Sbjct: 132 VYSDGMSEHKAEESISE 148


>AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
           in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
           - 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
           LENGTH=449
          Length = 449

 Score = 53.9 bits (128), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 5/155 (3%)

Query: 6   QATTSSGNNMS---WTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMK 62
           Q+  SS N  +   W+ S     ++  + E   GN+ +  F  +    I   ++    + 
Sbjct: 156 QSMCSSSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLG 215

Query: 63  IDKSKIKNRWKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANC 122
             + ++KN W   +K +     +   G S   W+P +  + A  E W   I+  P+A   
Sbjct: 216 YTRPQLKNHWDCTRKAWKIWCQLV--GASSMKWDPESRSFGATEEEWRIYIRENPRAGQF 273

Query: 123 RNTSLPHYEAMVTLYGNDRATGEEAETASEMRKRL 157
           R+  +PH + +  ++      GE     S  RK+L
Sbjct: 274 RHKEVPHADQLAIIFNGVIEPGETYTPPSRSRKKL 308


>AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1743234-1744751
           REVERSE LENGTH=449
          Length = 449

 Score = 53.9 bits (128), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 5/155 (3%)

Query: 6   QATTSSGNNMS---WTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMK 62
           Q+  SS N  +   W+ S     ++  + E   GN+ +  F  +    I   ++    + 
Sbjct: 156 QSMCSSSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLG 215

Query: 63  IDKSKIKNRWKTLKKKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANC 122
             + ++KN W   +K +     +   G S   W+P +  + A  E W   I+  P+A   
Sbjct: 216 YTRPQLKNHWDCTRKAWKIWCQLV--GASSMKWDPESRSFGATEEEWRIYIRENPRAGQF 273

Query: 123 RNTSLPHYEAMVTLYGNDRATGEEAETASEMRKRL 157
           R+  +PH + +  ++      GE     S  RK+L
Sbjct: 274 RHKEVPHADQLAIIFNGVIEPGETYTPPSRSRKKL 308


>AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
           10 (source: NCBI BLink). | chr1:10598764-10599527
           FORWARD LENGTH=222
          Length = 222

 Score = 52.8 bits (125), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 59/131 (45%), Gaps = 3/131 (2%)

Query: 17  WTRSMDDALVNAFMHEFTAGNKVNGQFTTQAHDRIASELSVLFAMKIDKSKIKNRWKTLK 76
           WT    D L+      +   + + G+ T ++  ++   L+       +     +R K LK
Sbjct: 17  WTPDETDVLIELIRQNWRDSSGIIGKLTVES--KLLPALNKRLGCNKNHKNYMSRLKFLK 74

Query: 77  KKFSDVYDIFKNGMSGFAWNPSTHLWDAEPEVWEALIQSKPKAANCRNTSLPHYEAMVTL 136
             +    D+ K   SGF W+P T  + A  EVW   +++ P   + +  S+ H+E +  +
Sbjct: 75  NLYQSYLDL-KRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQII 133

Query: 137 YGNDRATGEEA 147
           +G+  ATG  A
Sbjct: 134 FGDVVATGSFA 144