Miyakogusa Predicted Gene

Lj6g3v2258860.3
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v2258860.3 tr|C1E8U4|C1E8U4_MICSR Predicted protein
OS=Micromonas sp. (strain RCC299 / NOUM17) GN=MICPUN_59495
,39.29,1e-18,seg,NULL; Acid proteases,Peptidase aspartic; CHLOROPLAST
NUCLEIOD DNA-BINDING-RELATED,NULL; ASPARTYL,CUFF.60996.3
         (315 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G43100.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   344   5e-95
AT3G50050.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   306   2e-83
AT3G42550.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    81   1e-15
AT5G36260.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    77   2e-14
AT1G05840.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    76   3e-14
AT2G36670.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    71   1e-12
AT2G36670.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    71   1e-12
AT5G22850.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    69   5e-12
AT1G08210.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    68   1e-11
AT2G28010.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    65   7e-11
AT2G28030.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    64   9e-11
AT1G65240.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    62   4e-10
AT2G28040.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    60   2e-09
AT2G28220.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    57   1e-08
AT1G77480.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    56   4e-08
AT3G02740.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    56   4e-08
AT1G77480.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    55   5e-08
AT4G33490.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    55   6e-08
AT1G49050.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    54   1e-07
AT1G49050.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    54   2e-07
AT2G03200.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    54   2e-07
AT3G59080.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    50   2e-06
AT3G59080.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    50   3e-06
AT3G18490.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    50   3e-06
AT3G25700.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   3e-06
AT2G42980.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   3e-06
AT1G25510.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   5e-06
AT1G44130.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   5e-06
AT3G20015.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    48   7e-06
AT5G10770.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    48   8e-06

>AT5G43100.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:17299264-17302718 FORWARD LENGTH=631
          Length = 631

 Score =  344 bits (882), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 166/313 (53%), Positives = 223/313 (71%), Gaps = 3/313 (0%)

Query: 1   MHVAGKRLPLNPKVFDGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPN 60
           MHVAGK L LNPKVF+GKHGTVLDSGTTYAY P       K A+IKE+ SLK+I GPDPN
Sbjct: 267 MHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPN 326

Query: 61  YKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGK 120
           Y D+CFSGAG DV+++   FP + M F NG KL LSPENYLF H+KVRGAYCLG+F + +
Sbjct: 327 YDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD-R 385

Query: 121 DPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSELWARLHVSDALPPVPPNSEGTNLAKA 180
           D TTLLGGIVVRNTLV YDRE+ K+GFLKTNCS++W RL   ++  P  P S+  N +  
Sbjct: 386 DSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLAAPESPAPTSPISQ--NKSSN 443

Query: 181 FEPSVAPSASQFNIHQGELQIAQITIVISFNISYMDMKPHITELADLIAHELDVNTSQVH 240
             PS A S S  +   G  ++  IT  +S +++   +KP  +E+AD IAHELD+ ++QV 
Sbjct: 444 ISPSPATSESPTSHLPGVFRVGVITFEVSISVNNSSLKPKFSEIADFIAHELDIQSAQVR 503

Query: 241 LMNFSSLGNGSLSRWVITPRPSANFFSNTTAMSMISRISEHQLQLPDKFGSYNLVDWHAK 300
           L+NFSS GN    +W + P  S+ + SNTTA++++  + E++L+LP +FGSY L++W A+
Sbjct: 504 LLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIMLLLKENRLRLPGQFGSYKLLEWKAE 563

Query: 301 PPSKRTWWQQYFV 313
              K++WW+++ +
Sbjct: 564 QKKKQSWWEKHLL 576


>AT3G50050.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:18554138-18557115 REVERSE LENGTH=632
          Length = 632

 Score =  306 bits (783), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/307 (51%), Positives = 210/307 (68%), Gaps = 11/307 (3%)

Query: 1   MHVAGKRLPLNPKVFDGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPN 60
           + VAGK+L L+ +VFDG+HG VLDSGTTYAYLP       + A+++E+ +LKQI GPDPN
Sbjct: 284 IRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPN 343

Query: 61  YKDICFSGAGSD-VSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNG 119
           +KD CF  A S+ VS+LS+ FP V+MVF++G    LSPENY+F HSKV GAYCLGVF NG
Sbjct: 344 FKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNG 403

Query: 120 KDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSELWARLHVSDALPPVPPNSEGTNLAK 179
           KD TTLLGGIVVRNTLV+YDRE++KVGF +TNCSEL  RLH+  A PP          A 
Sbjct: 404 KDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHIDGAPPP----------AT 453

Query: 180 AFEPSVAPSASQFNIHQGELQIAQITIVISFNISYMDMKPHITELADLIAHELDVNTSQV 239
                  PS +  +   G  Q+ QI + I   ++   +KP I +L+ + + ELDV +SQV
Sbjct: 454 LPSNDSNPSHNSSSNLSGVTQVGQINLDIQLTVNSSYLKPRIEDLSKIFSKELDVKSSQV 513

Query: 240 HLMNFSSLGNGSLSRWVITPRPSANFFSNTTAMSMISRISEHQLQLPDKFGSYNLVDWHA 299
            L N +S GN SL R V+ P   + +FSN TA +++SR + HQ++LP+ FG+Y LV++  
Sbjct: 514 SLSNLTSKGNESLVRMVVLPPEPSTWFSNVTATNIVSRFTNHQIKLPEIFGNYQLVNYKL 573

Query: 300 KPPSKRT 306
           +PP KRT
Sbjct: 574 EPPRKRT 580


>AT3G42550.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:14665728-14669135 REVERSE LENGTH=430
          Length = 430

 Score = 80.9 bits (198), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 75/153 (49%), Gaps = 9/153 (5%)

Query: 7   RLPLNPKVFD--GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDI 64
           RLP++P VF     +GT++DSGTT  + P         AI   L  + Q   P P     
Sbjct: 241 RLPIDPSVFSVAKGYGTIIDSGTTLVHFPGEAYDPLIQAI---LNVVSQYGRPIPYESFQ 297

Query: 65  CFSGAGSDVSQL--SRSFPVVDMVFENGHKLALSPENYLFPH--SKVRGAYCLGVFSNGK 120
           CF+      S L  +  FP V + F  G  + + PE YLF          +CLG +S+  
Sbjct: 298 CFNITSGISSHLVIADMFPEVHLGFAGGASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTS 357

Query: 121 DPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
              T++G + +R+ + +YD +H ++G+ + NCS
Sbjct: 358 RRITIIGEVAIRDKMFVYDLDHQRIGWAEYNCS 390


>AT5G36260.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:14285068-14288179 REVERSE LENGTH=482
          Length = 482

 Score = 76.6 bits (187), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 83/160 (51%), Gaps = 17/160 (10%)

Query: 1   MHVAGKRLPLNPKVF--DGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           M V G  + L P +   +G  GT++DSGTT AYLP        +++I+++ + +Q+    
Sbjct: 285 MDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLY----NSLIEKITAKQQVKLHM 340

Query: 59  PNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSN 118
                 CFS      S   ++FPVV++ FE+  KL++ P +YLF  S     YC G  S 
Sbjct: 341 VQETFACFSFT----SNTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMYCFGWQSG 394

Query: 119 GKDP-----TTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
           G          LLG +V+ N LV+YD E+  +G+   NCS
Sbjct: 395 GMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 434


>AT1G05840.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:1762843-1766150 REVERSE LENGTH=485
          Length = 485

 Score = 75.9 bits (185), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 15/141 (10%)

Query: 18  KHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDICFSGAGSDVSQLS 77
           + G ++DSGTT AYLP          I  +  +LK +   D +YK  CF  +G    ++ 
Sbjct: 309 RKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALK-VHIVDKDYK--CFQYSG----RVD 361

Query: 78  RSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLG-----VFSNGKDPTTLLGGIVVR 132
             FP V   FEN   L + P +YLFPH    G +C+G     + S  +   TLLG +V+ 
Sbjct: 362 EGFPNVTFHFENSVFLRVYPHDYLFPH---EGMWCIGWQNSAMQSRDRRNMTLLGDLVLS 418

Query: 133 NTLVIYDREHTKVGFLKTNCS 153
           N LV+YD E+  +G+ + NCS
Sbjct: 419 NKLVLYDLENQLIGWTEYNCS 439


>AT2G36670.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:15364949-15368016 REVERSE LENGTH=507
          Length = 507

 Score = 70.9 bits (172), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 82/157 (52%), Gaps = 12/157 (7%)

Query: 1   MHVAGKRLPLNPKVFDGKH--GTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           + V G+ LPL+  VF+  +  GT++D+GTT  YL         +AI     S+ Q+  P 
Sbjct: 310 IGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISN---SVSQLVTPI 366

Query: 59  PNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGA--YCLGVF 116
            +  + C+  + S    +S  FP V + F  G  + L P++YLF +    GA  +C+G F
Sbjct: 367 ISNGEQCYLVSTS----ISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-F 421

Query: 117 SNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
               +  T+LG +V+++ + +YD    ++G+   +CS
Sbjct: 422 QKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 458


>AT2G36670.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:15364949-15368016 REVERSE LENGTH=512
          Length = 512

 Score = 70.9 bits (172), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 82/157 (52%), Gaps = 12/157 (7%)

Query: 1   MHVAGKRLPLNPKVFDGKH--GTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           + V G+ LPL+  VF+  +  GT++D+GTT  YL         +AI     S+ Q+  P 
Sbjct: 315 IGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISN---SVSQLVTPI 371

Query: 59  PNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGA--YCLGVF 116
            +  + C+  + S    +S  FP V + F  G  + L P++YLF +    GA  +C+G F
Sbjct: 372 ISNGEQCYLVSTS----ISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-F 426

Query: 117 SNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
               +  T+LG +V+++ + +YD    ++G+   +CS
Sbjct: 427 QKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 463


>AT5G22850.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:7633717-7636298 REVERSE LENGTH=493
          Length = 493

 Score = 68.9 bits (167), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 95/211 (45%), Gaps = 18/211 (8%)

Query: 1   MHVAGKRLPLNPKVFD--GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKEL-QSLKQISGP 57
           + V G+ LP+NP VF      GT++D+GTT AYL          AI   + QS++    P
Sbjct: 292 ISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR----P 347

Query: 58  DPNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRG--AYCLGV 115
             +  + C+       + +   FP V + F  G  + L+P++YL   + V G   +C+G 
Sbjct: 348 VVSKGNQCYV----ITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGF 403

Query: 116 FSNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSELWARLHVSDALPPVPPNSEGT 175
                   T+LG +V+++ + +YD    ++G+   +CS       V+ +       SE  
Sbjct: 404 QRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCST-----SVNVSATSSSGRSEYV 458

Query: 176 NLAKAFEPSVAPSASQFNIHQGELQIAQITI 206
           N  +  E + AP     +I    L +  + I
Sbjct: 459 NAGQFSENAAAPQKLSLDIVGNTLMLLLMVI 489


>AT1G08210.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:2577119-2580581 REVERSE LENGTH=492
          Length = 492

 Score = 67.8 bits (164), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 72/156 (46%), Gaps = 10/156 (6%)

Query: 1   MHVAGKRLPLNPKVFD--GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           + V G+ LP++P VF      GT++D+GTT AYLP         A+     ++ Q   P 
Sbjct: 293 IAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVAN---AVSQYGRPI 349

Query: 59  PNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYL-FPHSKVRGAYCLGVFS 117
                 CF     DV      FP V + F  G  + L P  YL    S     +C+G   
Sbjct: 350 TYESYQCFEITAGDVD----VFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQR 405

Query: 118 NGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
                 T+LG +V+++ +V+YD    ++G+ + +CS
Sbjct: 406 MSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>AT2G28010.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:11930579-11931769 REVERSE LENGTH=396
          Length = 396

 Score = 65.1 bits (157), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 64/156 (41%), Gaps = 12/156 (7%)

Query: 3   VAGKRLPLNPKVFDGKHGT-VLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNY 61
           V   R+      F    G  V+DSGTT  Y P       + A+   + +++     DP  
Sbjct: 250 VGNTRIETMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVR---AADPTG 306

Query: 62  KD-ICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGK 120
            D +C++    D+      FPV+ M F  G  L L   N ++  S   G +CL +  N  
Sbjct: 307 NDMLCYNSDTIDI------FPVITMHFSGGVDLVLDKYN-MYMESNNGGVFCLAIICNSP 359

Query: 121 DPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSELW 156
               + G     N LV YD     V F  TNCS LW
Sbjct: 360 TQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSALW 395


>AT2G28030.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:11934208-11935386 REVERSE LENGTH=392
          Length = 392

 Score = 64.3 bits (155), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 59/136 (43%), Gaps = 11/136 (8%)

Query: 22  VLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKD-ICFSGAGSDVSQLSRSF 80
           ++DSGTT  Y P       + A+   + +++     DP   D +C+     D+      F
Sbjct: 266 IIDSGTTLTYFPVSYCNLVREAVDHYVTAVRT---ADPTGNDMLCYYTDTIDI------F 316

Query: 81  PVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGKDPTTLLGGIVVRNTLVIYDR 140
           PV+ M F  G  L L   N ++  +  RG +CL +  N      + G     N LV YD 
Sbjct: 317 PVITMHFSGGADLVLDKYN-MYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDS 375

Query: 141 EHTKVGFLKTNCSELW 156
               V F  TNCS LW
Sbjct: 376 SSLLVSFSPTNCSALW 391


>AT1G65240.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:24230963-24233349 REVERSE LENGTH=475
          Length = 475

 Score = 62.4 bits (150), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 16/158 (10%)

Query: 1   MHVAGKRLPLNPKVFDGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPN 60
           M V G  L L P+      GT++DSGTT AY P          I+   Q +K +   +  
Sbjct: 281 MDVDGTSLDL-PRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILAR-QPVK-LHIVEET 337

Query: 61  YKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNG- 119
           ++  CFS +    + +  +FP V   FE+  KL + P +YLF   +    YC G  + G 
Sbjct: 338 FQ--CFSFS----TNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEE--ELYCFGWQAGGL 389

Query: 120 ----KDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
               +    LLG +V+ N LV+YD ++  +G+   NCS
Sbjct: 390 TTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427


>AT2G28040.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:11936203-11937390 REVERSE LENGTH=395
          Length = 395

 Score = 60.5 bits (145), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 58/139 (41%), Gaps = 12/139 (8%)

Query: 18  KHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDICFSGAGSDVSQLS 77
           K   V+DSG+T  Y P       + A+ + + +++      P    +C+     D+    
Sbjct: 268 KGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRF-----PRSDILCYYSKTIDI---- 318

Query: 78  RSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGKDPTTLLGGIVVRNTLVI 137
             FPV+ M F  G  L L   N ++  S   G +CL +  N      + G     N LV 
Sbjct: 319 --FPVITMHFSGGADLVLDKYN-MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVG 375

Query: 138 YDREHTKVGFLKTNCSELW 156
           YD     V F  TNCS LW
Sbjct: 376 YDSSSLLVSFKPTNCSALW 394


>AT2G28220.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:12033953-12037527 FORWARD LENGTH=756
          Length = 756

 Score = 57.0 bits (136), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 63/145 (43%), Gaps = 12/145 (8%)

Query: 15  FDGKHGTV-LDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDI-CFSGAGSD 72
           F  + G + +DSGTT  Y P       + A+ + + ++K    PD    ++ C+     D
Sbjct: 622 FHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKV---PDMGSDNLLCYYSDTID 678

Query: 73  VSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGKDPTTLLGGIVVR 132
           +      FPV+ M F  G  L L   N ++  +   G +CL +  N      + G     
Sbjct: 679 I------FPVITMHFSGGADLVLDKYN-MYLETITGGIFCLAIGCNDPSMPAVFGNRAQN 731

Query: 133 NTLVIYDREHTKVGFLKTNCSELWA 157
           N LV YD     + F  TNCS LW+
Sbjct: 732 NFLVGYDPSSNVISFSPTNCSALWS 756



 Score = 49.7 bits (117), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 53/119 (44%), Gaps = 11/119 (9%)

Query: 22  VLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKD-ICFSGAGSDVSQLSRSF 80
           V+DSG+T  Y P       + A+ + + +++    PDP+  D +C+     D+      F
Sbjct: 291 VIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRV---PDPSGNDMLCYFSETIDI------F 341

Query: 81  PVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGKDPTTLLGGIVVRNTLVIYD 139
           PV+ M F  G  L L   N ++  S   G +CL +  N      + G     N LV YD
Sbjct: 342 PVITMHFSGGADLVLDKYN-MYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYD 399


>AT1G77480.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:29114946-29117150 REVERSE LENGTH=432
          Length = 432

 Score = 55.8 bits (133), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/142 (25%), Positives = 64/142 (45%), Gaps = 10/142 (7%)

Query: 22  VLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDICFSGAG--SDVSQLSRS 79
           V DSG++Y Y            I K+L         D     +C+ G      + ++ + 
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKY 342

Query: 80  FPVVDMVF---ENGHKLALSPENYLFPHSKVRGAYCLGVFSN---GKDPTTLLGGIVVRN 133
           F  + + F   +NG    + PE+YL    K  G  CLG+ +    G +   ++G I  + 
Sbjct: 343 FKTITLRFGNQKNGQLFQVPPESYLIITEK--GRVCLGILNGTEIGLEGYNIIGDISFQG 400

Query: 134 TLVIYDREHTKVGFLKTNCSEL 155
            +VIYD E  ++G++ ++C +L
Sbjct: 401 IMVIYDNEKQRIGWISSDCDKL 422


>AT3G02740.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:590561-593089 FORWARD LENGTH=488
          Length = 488

 Score = 55.8 bits (133), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 71/161 (44%), Gaps = 19/161 (11%)

Query: 1   MHVAGKRLPLNPKVFDG--KHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           + V    L L+   FD     G ++DSGTT  YLP        + I   L S  +++   
Sbjct: 291 IEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI---LASHPELTLHT 347

Query: 59  PNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVR-GAYCLG--- 114
                 CF        +L R FP V   F+    LA+ P  YLF   +VR   +C G   
Sbjct: 348 VQESFTCFH----YTDKLDR-FPTVTFQFDKSVSLAVYPREYLF---QVREDTWCFGWQN 399

Query: 115 --VFSNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
             + + G    T+LG + + N LV+YD E+  +G+   NCS
Sbjct: 400 GGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>AT1G77480.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:29114705-29117150 REVERSE LENGTH=466
          Length = 466

 Score = 55.5 bits (132), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/142 (25%), Positives = 64/142 (45%), Gaps = 10/142 (7%)

Query: 22  VLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDICFSGAG--SDVSQLSRS 79
           V DSG++Y Y            I K+L         D     +C+ G      + ++ + 
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKY 342

Query: 80  FPVVDMVF---ENGHKLALSPENYLFPHSKVRGAYCLGVFSN---GKDPTTLLGGIVVRN 133
           F  + + F   +NG    + PE+YL    K  G  CLG+ +    G +   ++G I  + 
Sbjct: 343 FKTITLRFGNQKNGQLFQVPPESYLIITEK--GRVCLGILNGTEIGLEGYNIIGDISFQG 400

Query: 134 TLVIYDREHTKVGFLKTNCSEL 155
            +VIYD E  ++G++ ++C +L
Sbjct: 401 IMVIYDNEKQRIGWISSDCDKL 422


>AT4G33490.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:16108781-16110679 REVERSE LENGTH=425
          Length = 425

 Score = 55.1 bits (131), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 35/144 (24%), Positives = 66/144 (45%), Gaps = 11/144 (7%)

Query: 21  TVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDICFSGAG--SDVSQLSR 78
           TV DSG++Y Y          + + +EL         D +   +C+ G      + ++ +
Sbjct: 273 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 332

Query: 79  SFPVVDMVFENGHK----LALSPENYLFPHSKVRGAYCLGVFSN---GKDPTTLLGGIVV 131
            F  + + F+ G +      + PE YL     ++G  CLG+ +    G     L+G I +
Sbjct: 333 YFKPLALSFKTGWRSKTLFEIPPEAYLI--ISMKGNVCLGILNGTEIGLQNLNLIGDISM 390

Query: 132 RNTLVIYDREHTKVGFLKTNCSEL 155
           ++ ++IYD E   +G++  +C EL
Sbjct: 391 QDQMIIYDNEKQSIGWMPVDCDEL 414


>AT1G49050.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:18150638-18153186 FORWARD LENGTH=583
          Length = 583

 Score = 54.3 bits (129), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 74/160 (46%), Gaps = 31/160 (19%)

Query: 15  FDGKHGTV----LDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKD------I 64
            DG++G V     D+G++Y Y P         A  + + SL+++SG +    D      I
Sbjct: 418 LDGENGRVGKVLFDTGSSYTYFP-------NQAYSQLVTSLQEVSGLELTRDDSDETLPI 470

Query: 65  CFSGAG----SDVSQLSRSFPVVDMVFEN-----GHKLALSPENYLFPHSKVRGAYCLGV 115
           C+        S +S + + F  + +   +       KL + PE+YL   +K  G  CLG+
Sbjct: 471 CWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNK--GNVCLGI 528

Query: 116 FSNG---KDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNC 152
                     T +LG I +R  L++YD    ++G++K++C
Sbjct: 529 LDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568


>AT1G49050.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:18151161-18153186 FORWARD LENGTH=410
          Length = 410

 Score = 53.5 bits (127), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 74/160 (46%), Gaps = 31/160 (19%)

Query: 15  FDGKHGTV----LDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKD------I 64
            DG++G V     D+G++Y Y P         A  + + SL+++SG +    D      I
Sbjct: 245 LDGENGRVGKVLFDTGSSYTYFP-------NQAYSQLVTSLQEVSGLELTRDDSDETLPI 297

Query: 65  CFSGAG----SDVSQLSRSFPVVDMVFEN-----GHKLALSPENYLFPHSKVRGAYCLGV 115
           C+        S +S + + F  + +   +       KL + PE+YL   +K  G  CLG+
Sbjct: 298 CWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNK--GNVCLGI 355

Query: 116 FSNG---KDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNC 152
                     T +LG I +R  L++YD    ++G++K++C
Sbjct: 356 LDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 395


>AT2G03200.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:966506-967891 REVERSE LENGTH=461
          Length = 461

 Score = 53.5 bits (127), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 15/160 (9%)

Query: 1   MHVAGKRLPLNPKVF----DGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISG 56
           + V  KRL +    F    DG  G ++DSGTT  YL        K      +      SG
Sbjct: 312 ITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 371

Query: 57  PDPNYKDICFSGAGSDVSQLSRSFPVVDMVFE-NGHKLALSPENYLFPHSKVRGAYCLGV 115
                 D+CF      +   +++  V  M+F   G  L L  ENY+   S   G  CL +
Sbjct: 372 --STGLDLCFK-----LPDAAKNIAVPKMIFHFKGADLELPGENYMVADSST-GVLCLAM 423

Query: 116 FSNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSEL 155
            S+  +  ++ G +  +N  V++D E   V F+ T C +L
Sbjct: 424 GSS--NGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461


>AT3G59080.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:21836812-21838419 FORWARD LENGTH=499
          Length = 499

 Score = 49.7 bits (117), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 77/163 (47%), Gaps = 23/163 (14%)

Query: 3   VAGKRLPLNPKVF----DGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           VAG+ L +  + +    DG  GT++DSGTT +Y         K+ I ++ +      G  
Sbjct: 350 VAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAK------GKY 403

Query: 59  PNYKDI-----CFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPEN-YLFPHSKVRGAYC 112
           P Y+D      CF+ +G    QL    P + + F +G       EN +++ +  +    C
Sbjct: 404 PVYRDFPILDPCFNVSGIHNVQL----PELGIAFADGAVWNFPTENSFIWLNEDL---VC 456

Query: 113 LGVFSNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSEL 155
           L +    K   +++G    +N  ++YD + +++G+  T C+++
Sbjct: 457 LAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 499


>AT3G59080.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:21836812-21838419 FORWARD LENGTH=535
          Length = 535

 Score = 49.7 bits (117), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 77/163 (47%), Gaps = 23/163 (14%)

Query: 3   VAGKRLPLNPKVF----DGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           VAG+ L +  + +    DG  GT++DSGTT +Y         K+ I ++ +      G  
Sbjct: 386 VAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAK------GKY 439

Query: 59  PNYKDI-----CFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPEN-YLFPHSKVRGAYC 112
           P Y+D      CF+ +G    QL    P + + F +G       EN +++ +  +    C
Sbjct: 440 PVYRDFPILDPCFNVSGIHNVQL----PELGIAFADGAVWNFPTENSFIWLNEDL---VC 492

Query: 113 LGVFSNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSEL 155
           L +    K   +++G    +N  ++YD + +++G+  T C+++
Sbjct: 493 LAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 535


>AT3G18490.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:6349090-6350592 REVERSE LENGTH=500
          Length = 500

 Score = 49.7 bits (117), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 63/157 (40%), Gaps = 13/157 (8%)

Query: 1   MHVAGKRLPLNPKVFD----GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISG 56
             V G+++ L   +FD    G  G +LD GT    L        + A +K   +LK+ S 
Sbjct: 352 FSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSS 411

Query: 57  PDPNYKDICFSGAGSDVSQLSR-SFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGV 115
               + D C+     D S LS    P V   F  G  L L  +NYL P     G +C   
Sbjct: 412 SISLF-DTCY-----DFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDD-SGTFCFA- 463

Query: 116 FSNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNC 152
           F+      +++G +  + T + YD     +G     C
Sbjct: 464 FAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>AT3G25700.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:9358937-9360295 FORWARD LENGTH=452
          Length = 452

 Score = 49.3 bits (116), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/158 (24%), Positives = 76/158 (48%), Gaps = 11/158 (6%)

Query: 1   MHVAGKRLPLNPKVFD----GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISG 56
           + V G +L ++P +++    G  GTV+DSGTT A+L          A+ + ++ L     
Sbjct: 299 VFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK-LPIADA 357

Query: 57  PDPNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVF 116
             P + D+C + +G  V++  +  P +   F  G      P NY     +     CL + 
Sbjct: 358 LTPGF-DLCVNVSG--VTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ--IQCLAIQ 412

Query: 117 S-NGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCS 153
           S + K   +++G ++ +  L  +DR+ +++GF +  C+
Sbjct: 413 SVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>AT2G42980.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:17875005-17876588 REVERSE LENGTH=527
          Length = 527

 Score = 49.3 bits (116), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/157 (24%), Positives = 72/157 (45%), Gaps = 9/157 (5%)

Query: 3   VAGKRLPLNPKVF----DGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPD 58
           V GK L +  + +    DG  GT++DSGTT +Y         K+   ++++    I    
Sbjct: 376 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 435

Query: 59  PNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSN 118
           P   D CF+ +G  + + +   P + + F +G       EN     S+     CL +   
Sbjct: 436 P-VLDPCFNVSG--IEENNIHLPELGIAFVDGTVWNFPAENSFIWLSE--DLVCLAILGT 490

Query: 119 GKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNCSEL 155
            K   +++G    +N  ++YD + +++GF  T C+++
Sbjct: 491 PKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCADI 527


>AT1G25510.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:8959372-8960823 REVERSE LENGTH=483
          Length = 483

 Score = 48.5 bits (114), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 67/156 (42%), Gaps = 12/156 (7%)

Query: 1   MHVAGKRLPLNPKVFD----GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISG 56
           + V G+ L +    F+    G  G ++DSGT    L        + + +K    L++ +G
Sbjct: 336 ISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAG 395

Query: 57  PDPNYKDICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVF 116
                 D C++ +     ++    P V   F  G  LAL  +NY+ P   V G +CL  F
Sbjct: 396 V--AMFDTCYNLSAKTTVEV----PTVAFHFPGGKMLALPAKNYMIPVDSV-GTFCLA-F 447

Query: 117 SNGKDPTTLLGGIVVRNTLVIYDREHTKVGFLKTNC 152
           +       ++G +  + T V +D  ++ +GF    C
Sbjct: 448 APTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>AT1G44130.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:16787508-16789318 REVERSE LENGTH=405
          Length = 405

 Score = 48.5 bits (114), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 35/142 (24%), Positives = 67/142 (47%), Gaps = 10/142 (7%)

Query: 22  VLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYKDICFSGAG--SDVSQLSRS 79
           + D+G++Y Y          + I  +L+        +     IC+ GA     V ++   
Sbjct: 263 IFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNF 322

Query: 80  FPVVDMVFENGHK---LALSPENYLFPHSKVRGAYCLGVFSN---GKDPTTLLGGIVVRN 133
           F  + + F NG +   L L+PE YL       G  CLG+ +    G   + ++G I ++ 
Sbjct: 323 FKTITINFTNGRRNTQLYLAPELYLIVSK--TGNVCLGLLNGSEVGLQNSNVIGDISMQG 380

Query: 134 TLVIYDREHTKVGFLKTNCSEL 155
            ++IYD E  ++G++ ++C++L
Sbjct: 381 LMMIYDNEKQQLGWVSSDCNKL 402


>AT3G20015.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:6978746-6980158 REVERSE LENGTH=470
          Length = 470

 Score = 48.1 bits (113), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 61/152 (40%), Gaps = 16/152 (10%)

Query: 7   RLPLNPKVFD----GKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPNYK 62
           R+PL   VFD    G  G V+D+GT    LP       +     +  +L + SG   +  
Sbjct: 329 RIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASG--VSIF 386

Query: 63  DICFSGAGSDVSQLSRSFPVVDMVFENGHKLALSPENYLFPHSKVRGAYCLGVFSNGKDP 122
           D C+  +G     +S   P V   F  G  L L   N+L P     G YC   F+    P
Sbjct: 387 DTCYDLSGF----VSVRVPTVSFYFTEGPVLTLPARNFLMPVDD-SGTYC---FAFAASP 438

Query: 123 TTL--LGGIVVRNTLVIYDREHTKVGFLKTNC 152
           T L  +G I      V +D  +  VGF    C
Sbjct: 439 TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470


>AT5G10770.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:3403331-3405331 REVERSE LENGTH=474
          Length = 474

 Score = 48.1 bits (113), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 67/157 (42%), Gaps = 16/157 (10%)

Query: 1   MHVAGKRLPLNPKVFDGKHGTVLDSGTTYAYLPXXXXXXXKHAIIKELQSLKQISGPDPN 60
           + V G++LP+   VF    G ++DSGT    LP       + +   ++      SG   +
Sbjct: 330 ITVGGQKLPIPSTVFS-TPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV--S 386

Query: 61  YKDICFSGAGSDVSQL-SRSFPVVDMVFENGHKLALSPEN--YLFPHSKVRGAYCLGVFS 117
             D CF     D+S   + + P V   F  G  + L  +   Y+F  S+V    CL    
Sbjct: 387 ILDTCF-----DLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQV----CLAFAG 437

Query: 118 NGKDPTTLLGGIVVRNTL-VIYDREHTKVGFLKTNCS 153
           N  D    + G V + TL V+YD    +VGF    CS
Sbjct: 438 NSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474