Miyakogusa Predicted Gene

Lj5g3v2298170.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2298170.1 Non Chatacterized Hit- tr|K4B0A6|K4B0A6_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,40.27,5e-19,seg,NULL; DUF1685,Protein of unknown function
DUF1685; FAMILY NOT NAMED,NULL,CUFF.57335.1
         (129 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G05870.4 | Symbols:  | Protein of unknown function (DUF1685) ...   163   3e-41
AT1G05870.3 | Symbols:  | Protein of unknown function (DUF1685) ...   163   3e-41
AT1G05870.2 | Symbols:  | Protein of unknown function (DUF1685) ...   163   3e-41
AT1G05870.1 | Symbols:  | Protein of unknown function (DUF1685) ...   163   3e-41
AT2G31560.2 | Symbols:  | Protein of unknown function (DUF1685) ...   160   2e-40
AT2G31560.1 | Symbols:  | Protein of unknown function (DUF1685) ...   160   2e-40
AT2G43340.1 | Symbols:  | Protein of unknown function (DUF1685) ...   155   6e-39
AT3G22690.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Protein of...    92   1e-19
AT4G33985.1 | Symbols:  | Protein of unknown function (DUF1685) ...    80   4e-16
AT3G04700.1 | Symbols:  | Protein of unknown function (DUF1685) ...    76   7e-15
AT3G04710.3 | Symbols: TPR10 | ankyrin repeat family protein | c...    75   2e-14
AT1G08790.1 | Symbols:  | Protein of unknown function (DUF1685) ...    74   2e-14
AT5G28690.1 | Symbols:  | Protein of unknown function (DUF1685) ...    70   3e-13
AT3G50350.2 | Symbols:  | Protein of unknown function (DUF1685) ...    69   9e-13
AT2G15590.2 | Symbols:  | Protein of unknown function (DUF1685) ...    67   3e-12
AT3G62070.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    60   5e-10
AT2G15610.1 | Symbols:  | Protein of unknown function (DUF1685) ...    57   4e-09
AT2G15590.1 | Symbols:  | Protein of unknown function (DUF1685) ...    56   5e-09
AT2G46940.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    55   1e-08
AT3G50350.1 | Symbols:  | Protein of unknown function (DUF1685) ...    55   1e-08
AT4G01670.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    53   4e-08

>AT1G05870.4 | Symbols:  | Protein of unknown function (DUF1685) |
           chr1:1772454-1773228 REVERSE LENGTH=189
          Length = 189

 Score =  163 bits (412), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)

Query: 13  ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
           E K  ++LLEGYVE A          DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51  ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110

Query: 63  RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
            NTLPALELCYSMS KF                           IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170

Query: 111 KARLKFWAQAVACTVKLCS 129
           KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189


>AT1G05870.3 | Symbols:  | Protein of unknown function (DUF1685) |
           chr1:1772454-1773228 REVERSE LENGTH=189
          Length = 189

 Score =  163 bits (412), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)

Query: 13  ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
           E K  ++LLEGYVE A          DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51  ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110

Query: 63  RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
            NTLPALELCYSMS KF                           IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170

Query: 111 KARLKFWAQAVACTVKLCS 129
           KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189


>AT1G05870.2 | Symbols:  | Protein of unknown function (DUF1685) |
           chr1:1772454-1773228 REVERSE LENGTH=189
          Length = 189

 Score =  163 bits (412), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)

Query: 13  ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
           E K  ++LLEGYVE A          DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51  ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110

Query: 63  RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
            NTLPALELCYSMS KF                           IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170

Query: 111 KARLKFWAQAVACTVKLCS 129
           KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189


>AT1G05870.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr1:1772454-1773228 REVERSE LENGTH=189
          Length = 189

 Score =  163 bits (412), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)

Query: 13  ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
           E K  ++LLEGYVE A          DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51  ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110

Query: 63  RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
            NTLPALELCYSMS KF                           IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170

Query: 111 KARLKFWAQAVACTVKLCS 129
           KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189


>AT2G31560.2 | Symbols:  | Protein of unknown function (DUF1685) |
           chr2:13436611-13437312 FORWARD LENGTH=202
          Length = 202

 Score =  160 bits (405), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 83/130 (63%), Positives = 92/130 (70%), Gaps = 13/130 (10%)

Query: 13  ENKNKKLLLEGYV--EEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALE 70
           E K  ++LLEGY   ++ DL R+KSLTDDDLEELKGC+DLGFGFSYDEIPEL NTLPALE
Sbjct: 73  EKKKSQVLLEGYALDDQDDLTRAKSLTDDDLEELKGCLDLGFGFSYDEIPELCNTLPALE 132

Query: 71  LCYSMSHKF-----------XXXXXXXXXXXXXXXIANWKISSPGDHPEDVKARLKFWAQ 119
           LCYSMS KF                          IANWKISSPGD P+DVKARLK+WAQ
Sbjct: 133 LCYSMSQKFLDDKQQNHHKSQEEDDSSPPPTTTAPIANWKISSPGDDPDDVKARLKYWAQ 192

Query: 120 AVACTVKLCS 129
            VACTV+LCS
Sbjct: 193 TVACTVRLCS 202


>AT2G31560.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr2:13436611-13437312 FORWARD LENGTH=202
          Length = 202

 Score =  160 bits (405), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 83/130 (63%), Positives = 92/130 (70%), Gaps = 13/130 (10%)

Query: 13  ENKNKKLLLEGYV--EEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALE 70
           E K  ++LLEGY   ++ DL R+KSLTDDDLEELKGC+DLGFGFSYDEIPEL NTLPALE
Sbjct: 73  EKKKSQVLLEGYALDDQDDLTRAKSLTDDDLEELKGCLDLGFGFSYDEIPELCNTLPALE 132

Query: 71  LCYSMSHKF-----------XXXXXXXXXXXXXXXIANWKISSPGDHPEDVKARLKFWAQ 119
           LCYSMS KF                          IANWKISSPGD P+DVKARLK+WAQ
Sbjct: 133 LCYSMSQKFLDDKQQNHHKSQEEDDSSPPPTTTAPIANWKISSPGDDPDDVKARLKYWAQ 192

Query: 120 AVACTVKLCS 129
            VACTV+LCS
Sbjct: 193 TVACTVRLCS 202


>AT2G43340.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr2:18007769-18008416 FORWARD LENGTH=189
          Length = 189

 Score =  155 bits (392), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 81/128 (63%), Positives = 92/128 (71%), Gaps = 15/128 (11%)

Query: 17  KKLLLEGYVEEA----DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALELC 72
             +LLEGYV ++    DL R+KSLTDDDLEELKGCVDLGFGF+Y+EIPEL NTLPALELC
Sbjct: 62  SNVLLEGYVVDSAVNDDLKRTKSLTDDDLEELKGCVDLGFGFNYEEIPELCNTLPALELC 121

Query: 73  YSMSHKFXXX-----------XXXXXXXXXXXXIANWKISSPGDHPEDVKARLKFWAQAV 121
           YSMS KF                          IA+WKISSPGD+P+DVKARLKFWAQAV
Sbjct: 122 YSMSQKFIDQDHHHHSSSSPEKKSSVLDSPVSPIASWKISSPGDNPDDVKARLKFWAQAV 181

Query: 122 ACTVKLCS 129
           ACTV+LC+
Sbjct: 182 ACTVRLCT 189


>AT3G22690.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Protein of
           unknown function DUF1685 (InterPro:IPR012881),
           Pentatricopeptide repeat (InterPro:IPR002885); BEST
           Arabidopsis thaliana protein match is: Tetratricopeptide
           repeat (TPR)-like superfamily protein
           (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716
           proteins in 280 species: Archae - 2; Bacteria - 10;
           Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0;
           Other Eukaryotes - 904 (source: NCBI BLink). |
           chr3:8021347-8024534 REVERSE LENGTH=938
          Length = 938

 Score = 91.7 bits (226), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 51/85 (60%), Positives = 56/85 (65%), Gaps = 4/85 (4%)

Query: 36  LTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXI 95
           LTDDDLE LKGC+DLGFGF+YDEIP L  TLPALELCYSMS K                I
Sbjct: 843 LTDDDLEVLKGCLDLGFGFNYDEIPALCKTLPALELCYSMSQK---NLDDKHTPSLQLPI 899

Query: 96  ANWKISSPGDHPEDVKARLKFWAQA 120
               + S  D+P+DVKARLK WAQA
Sbjct: 900 GRSLVPSC-DNPDDVKARLKCWAQA 923


>AT4G33985.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr4:16288301-16288857 REVERSE LENGTH=154
          Length = 154

 Score = 80.1 bits (196), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 69/136 (50%), Gaps = 22/136 (16%)

Query: 2   EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGF---SYDE 58
           EAW RK+G     +              L RSKS+TD+DLEELKGC++LGFGF   S D 
Sbjct: 29  EAWLRKKGKQSLGR--------------LGRSKSVTDEDLEELKGCIELGFGFEPDSPDL 74

Query: 59  IPELRNTLPALELCYSMSHKFXXXXXXXX-----XXXXXXXIANWKISSPGDHPEDVKAR 113
            P L  TLPAL L  +++ ++                     ++  I   GD PE +K R
Sbjct: 75  DPRLSETLPALGLYCAVNKQYSSRLSRTSSLSSIASEGENSNSSTTIVDQGDDPETMKLR 134

Query: 114 LKFWAQAVACTVKLCS 129
           LK WAQ VAC+VK  S
Sbjct: 135 LKQWAQVVACSVKQFS 150


>AT3G04700.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr3:1276948-1277607 FORWARD LENGTH=191
          Length = 191

 Score = 75.9 bits (185), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 48/139 (34%), Positives = 72/139 (51%), Gaps = 18/139 (12%)

Query: 3   AWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
           AW R+       + KKLL +G  +  +L    +LTD+DL ELKG ++LGFGF+ +   +L
Sbjct: 51  AWERRRRQMIMIQEKKLLHKGASD--NLCIQANLTDEDLNELKGSIELGFGFNEEAGQKL 108

Query: 63  RNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIA----------------NWKISSPGDH 106
            NTLPAL+L ++++ +                 +                + KI  PGD 
Sbjct: 109 CNTLPALDLYFAVNRQLSPLPSPSSSRSSSASASTFSYSIPCSPKKTDSDSVKILCPGDD 168

Query: 107 PEDVKARLKFWAQAVACTV 125
           P+ +K RL+ WAQAVAC+V
Sbjct: 169 PQQMKQRLRHWAQAVACSV 187


>AT3G04710.3 | Symbols: TPR10 | ankyrin repeat family protein |
           chr3:1276948-1280942 FORWARD LENGTH=680
          Length = 680

 Score = 74.7 bits (182), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 18/139 (12%)

Query: 3   AWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
           AW R+       + KKLL +G  +  +L    +LTD+DL ELKG ++LGFGF+ +   +L
Sbjct: 51  AWERRRRQMIMIQEKKLLHKGASD--NLCIQANLTDEDLNELKGSIELGFGFNEEAGQKL 108

Query: 63  RNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIA----------------NWKISSPGDH 106
            NTLPAL+L ++++ +                 +                + KI  PGD 
Sbjct: 109 CNTLPALDLYFAVNRQLSPLPSPSSSRSSSASASTFSYSIPCSPKKTDSDSVKILCPGDD 168

Query: 107 PEDVKARLKFWAQAVACTV 125
           P+ +K RL+ WAQAVAC++
Sbjct: 169 PQQMKQRLRHWAQAVACSI 187


>AT1G08790.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr1:2811989-2812646 FORWARD LENGTH=190
          Length = 190

 Score = 73.9 bits (180), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 25/133 (18%)

Query: 13  ENKNKKLL--LEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALE 70
           E + +++L  LE + E  D      LTD+DL ELKG ++LGFGF+ ++   L  TLPAL+
Sbjct: 58  ERRRRQMLHHLEKHNEGGD-----DLTDEDLSELKGSIELGFGFNEEQGQHLTTTLPALD 112

Query: 71  LCYSMSHKFXXXXXXXXXXXXXXXIA------------------NWKISSPGDHPEDVKA 112
           L ++++ +                                    + K+ SPGD P+ VK 
Sbjct: 113 LYFAVTRQISPVSTPGSGGSSSSSRPTSLGDRSSSFGSPISDSDSLKVMSPGDDPQQVKT 172

Query: 113 RLKFWAQAVACTV 125
           RL+ WAQAVAC+V
Sbjct: 173 RLRHWAQAVACSV 185


>AT5G28690.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr5:10723033-10723702 FORWARD LENGTH=192
          Length = 192

 Score = 70.5 bits (171), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/142 (35%), Positives = 69/142 (48%), Gaps = 25/142 (17%)

Query: 2   EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPE 61
           E  RR+    QE K K       V E D   S  LTD+DL ELKG ++LGFGFS +   +
Sbjct: 54  EKRRRQMLKIQEKKQK------SVSEND-NDSPDLTDEDLRELKGSIELGFGFSEEAGQK 106

Query: 62  LRNTLPALELCYSMSHKFXXXXXXXXXX------------------XXXXXIANWKISSP 103
           L NTLPAL+L ++++ +                                    + KI  P
Sbjct: 107 LCNTLPALDLYFAVNRQLSPLPSPSSSNGGDGSLSSTSVSSSSIPCSPKTDSDSLKILCP 166

Query: 104 GDHPEDVKARLKFWAQAVACTV 125
           GD+P+ VK RL+ WAQAVAC++
Sbjct: 167 GDNPQQVKQRLRHWAQAVACSL 188


>AT3G50350.2 | Symbols:  | Protein of unknown function (DUF1685) |
           chr3:18672164-18673665 FORWARD LENGTH=171
          Length = 171

 Score = 68.9 bits (167), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 53/103 (51%), Gaps = 7/103 (6%)

Query: 30  LARSKSLTDDDLEELKGCVDLGFGFSYDEI--PELRNTLPALELCYSMSHKFXXXXXXX- 86
           L R KSLTD+DL+ELK   +LGFGF   E   P L NTLPALEL +++   +        
Sbjct: 63  LRRGKSLTDEDLDELKASFELGFGFGSPENADPRLSNTLPALELYFAVQKSYNDAVSNKS 122

Query: 87  ----XXXXXXXXIANWKISSPGDHPEDVKARLKFWAQAVACTV 125
                         +  +    D P+ VK +LK WA+ VACTV
Sbjct: 123 TTSSSSLSDGDTSPHHTVYQTSDDPQTVKTKLKQWARVVACTV 165


>AT2G15590.2 | Symbols:  | Protein of unknown function (DUF1685) |
           chr2:6801950-6802506 FORWARD LENGTH=155
          Length = 155

 Score = 67.4 bits (163), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 7/107 (6%)

Query: 30  LARSKSLTDDDLEELKGCVDLGFGF---SYDEIPELRNTLPALELCYSMSHKFXXXXXXX 86
           L RSKS+T+DD+EELKGC +LGFGF   S D  P L +T+PAL+L  ++  ++       
Sbjct: 40  LPRSKSVTNDDIEELKGCFELGFGFETESPDLNPRLSHTIPALDLYCAVHRQYSNHLSRT 99

Query: 87  XXXXXXXXIAN----WKISSPGDHPEDVKARLKFWAQAVACTVKLCS 129
                   ++N      I   GD  + +K +LK WA+ V  +V+  S
Sbjct: 100 SSFASDHEVSNSNNITTIVDKGDDRKTMKQKLKQWAKVVGFSVRHSS 146


>AT3G62070.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G46940.1); Has 137 Blast hits to 135 proteins
           in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 137; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr3:22983710-22984482 REVERSE
           LENGTH=228
          Length = 228

 Score = 59.7 bits (143), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 41/82 (50%), Gaps = 12/82 (14%)

Query: 41  LEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIANWKI 100
           L+E+K C DLGF        EL   +P      +    F               IA W+I
Sbjct: 150 LQEVKACRDLGF--------ELEVPVPGRISVSTTGSNFDTQTSSGGDSP----IATWRI 197

Query: 101 SSPGDHPEDVKARLKFWAQAVA 122
           SSPGD P++VKARLK WAQAVA
Sbjct: 198 SSPGDDPKEVKARLKVWAQAVA 219


>AT2G15610.1 | Symbols:  | Protein of unknown function (DUF1685) |
          chr2:6806033-6806686 FORWARD LENGTH=153
          Length = 153

 Score = 57.0 bits (136), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 19/81 (23%)

Query: 2  EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKS-LTDDDLEELKGCVDLGFGFSYDEI- 59
          EAW R +  H                A L RSKS +T+DD+EEL+GC DLGFGF  D + 
Sbjct: 27 EAWLRMKKKH--------------PSARLHRSKSCVTNDDIEELRGCFDLGFGFEPDSLD 72

Query: 60 --PELRNTLPALELCYSMSHK 78
            P L  T+PAL+L YS  H+
Sbjct: 73 FNPSLSKTIPALDL-YSAIHR 92


>AT2G15590.1 | Symbols:  | Protein of unknown function (DUF1685) |
          chr2:6801950-6802403 FORWARD LENGTH=125
          Length = 125

 Score = 56.2 bits (134), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/73 (49%), Positives = 45/73 (61%), Gaps = 17/73 (23%)

Query: 2  EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGF---SYDE 58
          EAW RK+      K + L L        L RSKS+T+DD+EELKGC +LGFGF   S D 
Sbjct: 26 EAWLRKK------KKRPLDL--------LPRSKSVTNDDIEELKGCFELGFGFETESPDL 71

Query: 59 IPELRNTLPALEL 71
           P L +T+PAL+L
Sbjct: 72 NPRLSHTIPALDL 84


>AT2G46940.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G62070.1); Has 143 Blast hits to 141 proteins
           in 14 species: Archae - 0; Bacteria - 0; Metazoa - 4;
           Fungi - 0; Plants - 139; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr2:19286658-19287505 REVERSE
           LENGTH=252
          Length = 252

 Score = 55.5 bits (132), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 42/82 (51%), Gaps = 20/82 (24%)

Query: 41  LEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIANWKI 100
            EE+K C DLGF     ++P             S S++                IANW+I
Sbjct: 185 FEEVKACRDLGFEL---DVPGR----------VSGSNR-------ETSSGGNSPIANWRI 224

Query: 101 SSPGDHPEDVKARLKFWAQAVA 122
           SSPGD P++VKARLK WAQAVA
Sbjct: 225 SSPGDDPKEVKARLKMWAQAVA 246


>AT3G50350.1 | Symbols:  | Protein of unknown function (DUF1685) |
           chr3:18672906-18673706 FORWARD LENGTH=181
          Length = 181

 Score = 55.1 bits (131), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 28/52 (53%), Positives = 35/52 (67%), Gaps = 2/52 (3%)

Query: 30  LARSKSLTDDDLEELKGCVDLGFGFSYDEI--PELRNTLPALELCYSMSHKF 79
           L R KSLTD+DL+ELK   +LGFGF   E   P L NTLPALEL +++   +
Sbjct: 67  LRRGKSLTDEDLDELKASFELGFGFGSPENADPRLSNTLPALELYFAVQKSY 118


>AT4G01670.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G62070.1); Has 141 Blast hits to 139 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 138; Viruses - 0; Other Eukaryotes -
           3 (source: NCBI BLink). | chr4:711464-712511 REVERSE
           LENGTH=249
          Length = 249

 Score = 53.1 bits (126), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 40/83 (48%), Gaps = 18/83 (21%)

Query: 40  DLEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIANWK 99
           DLEE+K C DLGF             L    + YS S                   +N +
Sbjct: 172 DLEEVKACKDLGF------------ELEPGRVSYSGS------TVDTSSGGNSPISSNHR 213

Query: 100 ISSPGDHPEDVKARLKFWAQAVA 122
           ISSPGD P+DVKARLK WAQAVA
Sbjct: 214 ISSPGDDPKDVKARLKAWAQAVA 236