Miyakogusa Predicted Gene

Lj0g3v0295819.4
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0295819.4 Non Chatacterized Hit- tr|I1LW14|I1LW14_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.43132
PE,67.45,0,Asp,Peptidase A1; seg,NULL; no description,Peptidase
aspartic, catalytic; ASP_PROTEASE,Peptidase asp,CUFF.19817.4
         (300 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G10080.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   273   1e-73
AT4G35880.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   161   5e-40
AT2G17760.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   150   1e-36
AT3G51330.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   149   2e-36
AT3G51350.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   137   6e-33
AT3G51360.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   109   2e-24
AT3G51340.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   108   4e-24
AT3G50050.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    76   3e-14
AT5G43100.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    76   3e-14
AT1G08210.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    73   3e-13
AT1G49050.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    70   2e-12
AT1G49050.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    70   2e-12
AT3G02740.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    68   6e-12
AT5G36260.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    68   6e-12
AT1G05840.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    67   1e-11
AT1G65240.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    67   2e-11
AT2G36670.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    66   3e-11
AT2G36670.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    66   3e-11
AT1G44130.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    64   1e-10
AT4G33490.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    62   5e-10
AT1G77480.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    61   8e-10
AT1G77480.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    61   8e-10
AT1G79720.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    60   1e-09
AT5G10770.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    56   4e-08
AT5G22850.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    55   7e-08
AT4G12920.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    53   2e-07
AT4G33490.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    52   4e-07
AT5G45120.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    52   6e-07
AT3G52500.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    50   2e-06
AT4G16563.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   6e-06

>AT5G10080.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:3150843-3153380 FORWARD LENGTH=528
          Length = 528

 Score =  273 bits (698), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 139/267 (52%), Positives = 185/267 (69%), Gaps = 8/267 (2%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQG 64
           CG+KQSG YLDG APDG++GLGP  ISVPS L++ GL+RNSFS+C ++++SGRI FGD G
Sbjct: 230 CGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMG 289

Query: 65  HVTQQSTQFLPVD-GEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKKV 123
              QQST FL +D  ++  YIVGVE  C+G+ CLK T F   IDSG SFTYLP +IY+KV
Sbjct: 290 PSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKV 349

Query: 124 VMEFDKQVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQTILIRNPLSTFSINE 183
            +E D+ +NAT + F+ + WEYCY++S++  V  P IKL FS N T +I  PL  F  ++
Sbjct: 350 ALEIDRHINATSKNFEGVSWEYCYESSAEPKV--PAIKLKFSHNNTFVIHKPLFVFQQSQ 407

Query: 184 EYTAMCLTVYNSKDDYV-TIGQNYLRGYRLVFDRENLRFGWSRSNC-EDSVGVRVNS--T 239
                CL +  S  + + +IGQNY+RGYR+VFDREN++ GWS S C ED +     S  +
Sbjct: 408 GLVQFCLPISPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQEDKIEPPQASPGS 467

Query: 240 SPSPSTLPANQQQSPPNTHSVPPAIAG 266
           + SP+ LP ++QQS    H+V PAIAG
Sbjct: 468 TSSPNPLPTDEQQS-RGGHAVSPAIAG 493


>AT4G35880.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:16993339-16995721 FORWARD LENGTH=524
          Length = 524

 Score =  161 bits (407), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 95/228 (41%), Positives = 128/228 (56%), Gaps = 6/228 (2%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQG 64
           CG+ QSG +LD AAP+G+ GLG   ISVPS+LA EGL+ +SFS+C   D  GRI FGD+G
Sbjct: 230 CGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKG 289

Query: 65  HVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKKVV 124
              Q+ T F  ++    +Y + V R  VG+  L    F AL D+G SFTYL   +Y  V 
Sbjct: 290 SSDQEETPF-NLNPSHPNYNITVTRVRVGT-TLIDDEFTALFDTGTSFTYLVDPMYTTVS 347

Query: 125 MEFDKQVNATRET-FQQLPWEYCYDASSQEVVN-TPPIKLTFSKNQTILIRNPLSTFSIN 182
             F  Q    R +   ++P+EYCYD S+    +  P + LT   N    I +P+   S  
Sbjct: 348 ESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVISTE 407

Query: 183 EEYTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRSNCED 230
            E    CL +  S +  + IGQNY+ GYR+VFDRE L   W + +C D
Sbjct: 408 GEL-VYCLAIVKSSELNI-IGQNYMTGYRVVFDREKLVLAWKKFDCYD 453


>AT2G17760.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:7713488-7716269 FORWARD LENGTH=513
          Length = 513

 Score =  150 bits (378), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 96/251 (38%), Positives = 136/251 (54%), Gaps = 21/251 (8%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQG 64
           CG+ Q+G + DGAAP+G+ GLG   ISVPS+LA+EG+  NSFS+C  +D +GRI FGD+G
Sbjct: 227 CGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKG 286

Query: 65  HVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTG---FQALIDSGASFTYLPHDIYK 121
            V Q+ T  L +      Y + V +  VG      TG   F A+ DSG SFTYL    Y 
Sbjct: 287 SVDQRETP-LNIRQPHPTYNITVTKISVGG----NTGDLEFDAVFDSGTSFTYLTDAAYT 341

Query: 122 KVVMEFDKQVNATR--ETFQQLPWEYCYDAS-SQEVVNTPPIKLTFSKNQTILIRNPLST 178
            +   F+      R   T  +LP+EYCY  S +++    P + LT     +  + +PL  
Sbjct: 342 LISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVV 401

Query: 179 FSINEEYTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRSNCEDSVGVRVNS 238
             + ++    CL +   +D  + IGQN++ GYR+VFDRE L  GW  S+C         +
Sbjct: 402 IPM-KDTDVYCLAIMKIEDISI-IGQNFMTGYRVVFDREKLILGWKESDCY--------T 451

Query: 239 TSPSPSTLPAN 249
              S  TLP+N
Sbjct: 452 GETSARTLPSN 462


>AT3G51330.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:19053480-19056152 REVERSE LENGTH=529
          Length = 529

 Score =  149 bits (377), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/230 (40%), Positives = 127/230 (55%), Gaps = 9/230 (3%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLND--DESGRILFGD 62
           CG+ Q+G     AA +G+LGLG    SVPS+LA+  +  NSFS+C  +  D  GRI FGD
Sbjct: 227 CGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGD 286

Query: 63  QGHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKK 122
           +G+  Q  T  LP +     Y V V    VG   + G    AL D+G SFT+L    Y  
Sbjct: 287 KGYTDQMETPLLPTEPS-PTYAVSVTEVSVGGDAV-GVQLLALFDTGTSFTHLLEPEYGL 344

Query: 123 VVMEFDKQVNATRETFQ-QLPWEYCYDAS-SQEVVNTPPIKLTFSKNQTILIRNPLSTFS 180
           +   FD  V   R     +LP+E+CYD S ++  +  P + +TF     + +RNPL    
Sbjct: 345 ITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIV- 403

Query: 181 INEEYTAM-CLTVYNSKDDYVTI-GQNYLRGYRLVFDRENLRFGWSRSNC 228
            NE+ +AM CL +  S D  + I GQN++ GYR+VFDRE +  GW RS+C
Sbjct: 404 WNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDC 453


>AT3G51350.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:19060485-19063248 REVERSE LENGTH=528
          Length = 528

 Score =  137 bits (346), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 99/285 (34%), Positives = 141/285 (49%), Gaps = 17/285 (5%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLND--DESGRILFGD 62
           CG+KQ+G +    + +GVLGLG    SVPSLLA+  +  NSFS+C        GRI FGD
Sbjct: 226 CGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRISFGD 285

Query: 63  QGHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKK 122
           +G+  Q+ T F+ V      Y V +    V    +    F A  D+G+SFT+L    Y  
Sbjct: 286 RGYTDQEETPFISV-APSTAYGVNISGVSVAGDPVDIRLF-AKFDTGSSFTHLREPAYGV 343

Query: 123 VVMEFDKQVNATRETFQ-QLPWEYCYDAS-SQEVVNTPPIKLTFSKNQTILIRNPLSTFS 180
           +   FD+ V   R     +LP+E+CYD S +   +  P +++TF     I++ NP  T  
Sbjct: 344 LTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTAR 403

Query: 181 INEEYTAMCLTVYNSKDDYV-TIGQNYLRGYRLVFDRENLRFGWSRSNC------EDSVG 233
             E     CL V  S    +  IGQN++ GYR+VFDRE +  GW +S C      E +  
Sbjct: 404 TQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFDRERMILGWKQSLCFEDESLESTTP 463

Query: 234 VRVNSTSPSPSTLPANQQQSPPNTHSVPPAIAGHTSPKPSTATPG 278
                 +P+PS      +  PP   + PP I    +P+ ST  PG
Sbjct: 464 PPPEVEAPAPSVSAPPPRSLPPTVSATPPPI----NPRNSTGNPG 504


>AT3G51360.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:19064294-19066560 REVERSE LENGTH=488
          Length = 488

 Score =  109 bits (272), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 77/277 (27%), Positives = 130/277 (46%), Gaps = 37/277 (13%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQG 64
           C   Q G + +  A +G++GL    I+VP++L + G+  +SFS+C   +  G I FGD+G
Sbjct: 211 CSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKG 269

Query: 65  HVTQQSTQFLPVDGEF--IDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKK 122
              Q  T   P+ G    + Y V + +F VG   +  T F A  DSG + T+L    Y  
Sbjct: 270 SSDQLET---PLSGTISPMFYDVSITKFKVGKVTVD-TEFTATFDSGTAVTWLIEPYYTA 325

Query: 123 VVMEF-----DKQVNATRETFQQLPWEYCY-DASSQEVVNTPPIKLTFSKNQTILIRNPL 176
           +   F     D++++ + ++    P+E+CY   S+ +    P +           + +P+
Sbjct: 326 LTTNFHLSVPDRRLSKSVDS----PFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPI 381

Query: 177 STFSINE-EYTAMCLTVYNSKD-DYVTIGQNYLRGYRLVFDRENLRFGWSRSNCEDSVGV 234
             F  ++  +   CL V    + D+  IGQN++  YR+V DRE    GW +SNC D+ G 
Sbjct: 382 LVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRERRILGWKKSNCNDTNGF 441

Query: 235 RVNSTSPSPSTLPANQQQSPPNTHSVPPAIAGHTSPK 271
                             + P   + PP++A  +SP+
Sbjct: 442 ------------------TGPTALAKPPSMAPTSSPR 460


>AT3G51340.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:19057013-19059788 REVERSE LENGTH=530
          Length = 530

 Score =  108 bits (271), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 82/236 (34%), Positives = 117/236 (49%), Gaps = 15/236 (6%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDES--GRILFGD 62
           CG+ Q+G +    A +GVLGL     SVPSLLA+  +  NSFS+C     S  GRI FGD
Sbjct: 227 CGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGD 286

Query: 63  QGHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKK 122
           +G+  Q+ T  + ++     Y V V    VG   +    F AL D+G+SFT L    Y  
Sbjct: 287 KGYTDQEETPLVSLETS-TAYGVNVTGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGV 344

Query: 123 VVMEFDKQVNATRETFQ-QLPWEYCYDASSQEVVNTPPIKLTFSK-------NQTILIRN 174
               FD  +   R       P+E+CYD   + + +    +   SK       +    I+N
Sbjct: 345 FTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQN 404

Query: 175 -PLSTFSINEEYTAM-CLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRSNC 228
               + S + E T M CL +  S +  + IGQN + G+R+VFDRE +  GW +SNC
Sbjct: 405 DSQESVSYSNEGTKMYCLGILKSINLNI-IGQNLMSGHRIVFDRERMILGWKQSNC 459


>AT3G50050.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:18554138-18557115 REVERSE LENGTH=632
          Length = 632

 Score = 76.3 bits (186), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 113/254 (44%), Gaps = 21/254 (8%)

Query: 12  GYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLN--DDESGRILFGDQGHVTQQ 69
           G L     DG++GLG G +S+   L ++GLI NSF +C    D   G ++ G  G     
Sbjct: 205 GDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILG--GFDYPS 262

Query: 70  STQFLPVDGEFIDY----IVGVERFCVGSFCLKGTGFQ----ALIDSGASFTYLPHDIYK 121
              F   D +   Y    + G+ R       L    F     A++DSG ++ YLP   + 
Sbjct: 263 DMVFTDSDPDRSPYYNIDLTGI-RVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFA 321

Query: 122 KVVMEFDKQVNATRETFQQLP--WEYCYDASSQEVVNT-----PPIKLTFSKNQTILIRN 174
                  ++V+  ++     P   + C+  ++   V+      P +++ F   Q+ L+  
Sbjct: 322 AFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSP 381

Query: 175 PLSTFSINEEYTAMCLTVY-NSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRSNCEDSVG 233
               F  ++ + A CL V+ N KD    +G   +R   +V+DREN + G+ R+NC +   
Sbjct: 382 ENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSD 441

Query: 234 VRVNSTSPSPSTLP 247
                 +P P+TLP
Sbjct: 442 RLHIDGAPPPATLP 455


>AT5G43100.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:17299264-17302718 FORWARD LENGTH=631
          Length = 631

 Score = 75.9 bits (185), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 69/292 (23%), Positives = 128/292 (43%), Gaps = 31/292 (10%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDE--SGRILFGD 62
           C  +++G      A DG++GLG G +SV   L ++G+I + FS+C    E   G ++ G 
Sbjct: 182 CENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGK 240

Query: 63  ----QGHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLK------GTGFQALIDSGASF 112
                G V   S  F         Y + +++  V    LK            ++DSG ++
Sbjct: 241 ISPPPGMVFSHSDPF-----RSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTY 295

Query: 113 TYLPHDIYKKVVMEFDKQVNATRETFQQLPW--EYCYDASSQEVVNT----PPIKLTFSK 166
            Y P + +  +     K++ + +      P   + C+  + ++V       P I + F  
Sbjct: 296 AYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGN 355

Query: 167 NQTILIRNPLSTFSINEEYTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRS 226
            Q +++      F   +   A CL ++  +D    +G   +R   + +DREN + G+ ++
Sbjct: 356 GQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKT 415

Query: 227 NCEDSVGVRVNSTSPSPSTLPANQQQSPPNTHSVPPAIAGHTSPKPSTATPG 278
           NC D         SP+P++ P +Q +S     ++ P+ A  TS  P++  PG
Sbjct: 416 NCSDIWRRLAAPESPAPTS-PISQNKS----SNISPSPA--TSESPTSHLPG 460


>AT1G08210.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:2577119-2580581 REVERSE LENGTH=492
          Length = 492

 Score = 72.8 bits (177), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 66/249 (26%), Positives = 108/249 (43%), Gaps = 24/249 (9%)

Query: 18  APDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQGHVTQQSTQFLPVD 77
           A DG+ GLG GS+SV S LA +GL    FS CL  D+SG  +    G + +  T + P+ 
Sbjct: 222 AVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLV 280

Query: 78  GEFIDYIVGVERFCVGSFCLK--------GTGFQALIDSGASFTYLPHDIYKKVVMEFDK 129
                Y V ++   V    L          TG   +ID+G +  YLP + Y   +     
Sbjct: 281 PSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVAN 340

Query: 130 QVN-----ATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQTILI--RNPLSTFSIN 182
            V+      T E++Q      C++ ++ +V   P + L+F+   ++++  R  L  FS +
Sbjct: 341 AVSQYGRPITYESYQ------CFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFS-S 393

Query: 183 EEYTAMCLTVYNSKDDYVTI-GQNYLRGYRLVFDRENLRFGWSRSNCEDSVGVRVNSTSP 241
              +  C+         +TI G   L+   +V+D    R GW+  +C   V V  +    
Sbjct: 394 SGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCSLEVNVSASRGGR 453

Query: 242 SPSTLPANQ 250
           S   +   Q
Sbjct: 454 SKDVINTGQ 462


>AT1G49050.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:18150638-18153186 FORWARD LENGTH=583
          Length = 583

 Score = 70.1 bits (170), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/249 (26%), Positives = 106/249 (42%), Gaps = 25/249 (10%)

Query: 5   CGRKQSGGYLDGAAP-DGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGR-ILFGD 62
           CG  Q G  L+     DG+LGL    IS+PS LA  G+I N    CL  D +G   +F  
Sbjct: 320 CGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMG 379

Query: 63  QGHVTQQSTQFLPV--DGEFIDYIVGVERFCVGSFCL-----KGTGFQALIDSGASFTYL 115
              V      ++P+  D     Y + V +   G   L      G   + L D+G+S+TY 
Sbjct: 380 SDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYF 439

Query: 116 PHDIYKKVVMEFDKQ--VNATR-ETFQQLP--WE----YCYDASSQEVVNTPPIKLTFSK 166
           P+  Y ++V    +   +  TR ++ + LP  W     + + + S       PI L    
Sbjct: 440 PNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGS 499

Query: 167 NQTILIRNPL---STFSINEEYTAMCLTVYNSKDDY----VTIGQNYLRGYRLVFDRENL 219
              I+ R  L     + I      +CL + +    +    + +G   +RG+ +V+D    
Sbjct: 500 KWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKR 559

Query: 220 RFGWSRSNC 228
           R GW +S+C
Sbjct: 560 RIGWMKSDC 568


>AT1G49050.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:18151161-18153186 FORWARD LENGTH=410
          Length = 410

 Score = 69.7 bits (169), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 107/251 (42%), Gaps = 29/251 (11%)

Query: 5   CGRKQSGGYLDGAAP-DGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGR-ILFGD 62
           CG  Q G  L+     DG+LGL    IS+PS LA  G+I N    CL  D +G   +F  
Sbjct: 147 CGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMG 206

Query: 63  QGHVTQQSTQFLPV--DGEFIDYIVGVERFCVGSFCL-----KGTGFQALIDSGASFTYL 115
              V      ++P+  D     Y + V +   G   L      G   + L D+G+S+TY 
Sbjct: 207 SDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYF 266

Query: 116 PHDIYKKVVMEFDKQ--VNATR-ETFQQLPWEYCYDASSQEVVNT--------PPIKLTF 164
           P+  Y ++V    +   +  TR ++ + LP   C+ A +    ++         PI L  
Sbjct: 267 PNQAYSQLVTSLQEVSGLELTRDDSDETLP--ICWRAKTNFPFSSLSDVKKFFRPITLQI 324

Query: 165 SKNQTILIRNPL---STFSINEEYTAMCLTVYNSKDDY----VTIGQNYLRGYRLVFDRE 217
                I+ R  L     + I      +CL + +    +    + +G   +RG+ +V+D  
Sbjct: 325 GSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNV 384

Query: 218 NLRFGWSRSNC 228
             R GW +S+C
Sbjct: 385 KRRIGWMKSDC 395


>AT3G02740.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:590561-593089 FORWARD LENGTH=488
          Length = 488

 Score = 68.2 bits (165), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 105/251 (41%), Gaps = 31/251 (12%)

Query: 5   CGRKQSGGYLDG-AAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQ 63
           CG KQSG   +  AA DG++G G  + S  S LA +G ++ SF+ CL+++  G I     
Sbjct: 207 CGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AI 264

Query: 64  GHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLK--------GTGFQALIDSGASFTYL 115
           G V     +  P+  +   Y V +    VG+  L+        G     +IDSG +  YL
Sbjct: 265 GEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYL 324

Query: 116 PHDIYKKVVMEF-----DKQVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQTI 170
           P  +Y  ++ E      +  ++  +E+F    +       + ++   P +   F K+ ++
Sbjct: 325 PDAVYNPLLNEILASHPELTLHTVQESFTCFHY-------TDKLDRFPTVTFQFDKSVSL 377

Query: 171 LIRNPLSTFSINEEYTAMCLTVYN------SKDDYVTIGQNYLRGYRLVFDRENLRFGWS 224
            +      F + E+    C    N             +G   L    +V+D EN   GW+
Sbjct: 378 AVYPREYLFQVRED--TWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWT 435

Query: 225 RSNCEDSVGVR 235
             NC   + V+
Sbjct: 436 NHNCSGGIQVK 446


>AT5G36260.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:14285068-14288179 REVERSE LENGTH=482
          Length = 482

 Score = 68.2 bits (165), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 111/256 (43%), Gaps = 41/256 (16%)

Query: 5   CGRKQSG--GYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRIL-FG 61
           CG+ QSG  G  D A  DG++G G  + S+ S LA  G  +  FS CL++   G I   G
Sbjct: 201 CGKNQSGQLGQTDSAV-DGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVG 259

Query: 62  DQGHVTQQSTQFLP-------------VDGEFIDYIVGVERFCVGSFCLKGTGFQALIDS 108
           +      ++T  +P             VDG+ ID    +           G G   +IDS
Sbjct: 260 EVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLAS-------TNGDG-GTIIDS 311

Query: 109 GASFTYLPHDIY----KKVVMEFDKQVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTF 164
           G +  YLP ++Y    +K+  +   +++  +ETF       C+  +S      P + L F
Sbjct: 312 GTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA------CFSFTSNTDKAFPVVNLHF 365

Query: 165 SKNQTILIRNPLSTFSINEE-----YTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENL 219
             +  + +      FS+ E+     + +  +T  +   D + +G   L    +V+D EN 
Sbjct: 366 EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGA-DVILLGDLVLSNKLVVYDLENE 424

Query: 220 RFGWSRSNCEDSVGVR 235
             GW+  NC  S+ V+
Sbjct: 425 VIGWADHNCSSSIKVK 440


>AT1G05840.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:1762843-1766150 REVERSE LENGTH=485
          Length = 485

 Score = 67.4 bits (163), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/252 (26%), Positives = 105/252 (41%), Gaps = 24/252 (9%)

Query: 5   CGRKQSGGYLDGA---APDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFG 61
           CG +QSG  LD +   A DG+LG G  + S+ S LA  G ++  F+ CL+    G I   
Sbjct: 205 CGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIF-- 261

Query: 62  DQGHVTQQSTQFLPVDGEFIDYIVGVERFCVGS--FCLKGTGFQ------ALIDSGASFT 113
             G V Q      P+      Y V +    VG     +    FQ      A+IDSG +  
Sbjct: 262 AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLA 321

Query: 114 YLPHDIYKKVVMEFDKQVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQTILIR 173
           YLP  IY+ +V +   Q  A +       ++ C+  S +     P   +TF    ++ +R
Sbjct: 322 YLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFP--NVTFHFENSVFLR 378

Query: 174 NPLSTFSINEEYTAMCLTVYNS------KDDYVTIGQNYLRGYRLVFDRENLRFGWSRSN 227
                +    E    C+   NS      + +   +G   L    +++D EN   GW+  N
Sbjct: 379 VYPHDYLFPHE-GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYN 437

Query: 228 CEDSVGVRVNST 239
           C  S+ V+   T
Sbjct: 438 CSSSIKVKDEGT 449


>AT1G65240.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:24230963-24233349 REVERSE LENGTH=475
          Length = 475

 Score = 66.6 bits (161), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/252 (24%), Positives = 108/252 (42%), Gaps = 36/252 (14%)

Query: 5   CGRKQSGGYLDG-AAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRIL-FGD 62
           CG  QSG   +G +A DGV+G G  + SV S LA  G  +  FS CL++ + G I   G 
Sbjct: 197 CGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGV 256

Query: 63  QGHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTGFQ----------ALIDSGASF 112
                 ++T  +P    +   ++G++        + GT              ++DSG + 
Sbjct: 257 VDSPKVKTTPMVPNQMHYNVMLMGMD--------VDGTSLDLPRSIVRNGGTIVDSGTTL 308

Query: 113 TYLPHDIYKKVVMEFDK----QVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQ 168
            Y P  +Y  ++         +++   ETFQ      C+  S+      PP+   F  + 
Sbjct: 309 AYFPKVLYDSLIETILARQPVKLHIVEETFQ------CFSFSTNVDEAFPPVSFEFEDSV 362

Query: 169 TILIRNPLSTFSINEE-----YTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGW 223
            + +      F++ EE     + A  LT  + + + + +G   L    +V+D +N   GW
Sbjct: 363 KLTVYPHDYLFTLEEELYCFGWQAGGLTT-DERSEVILLGDLVLSNKLVVYDLDNEVIGW 421

Query: 224 SRSNCEDSVGVR 235
           +  NC  S+ ++
Sbjct: 422 ADHNCSSSIKIK 433


>AT2G36670.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:15364949-15368016 REVERSE LENGTH=507
          Length = 507

 Score = 65.9 bits (159), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 64/251 (25%), Positives = 103/251 (41%), Gaps = 25/251 (9%)

Query: 5   CGRKQSGGYLDG-AAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQ 63
           C   QSG       A DG+ G G G +SV S L+  G+    FS CL  D SG  +F   
Sbjct: 225 CSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVF-VL 283

Query: 64  GHVTQQSTQFLPVDGEFIDYIVGVERFCVGS--FCLKGTGFQA------LIDSGASFTYL 115
           G +      + P+      Y + +    V      L    F+A      ++D+G + TYL
Sbjct: 284 GEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYL 343

Query: 116 PHDIYKKVVMEFDKQVNATRETFQQLP------WEYCYDASSQEVVNTPPIKLTFSKNQT 169
             + Y       D  +NA   +  QL        E CY  S+      P + L F+   +
Sbjct: 344 VKEAY-------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS 396

Query: 170 ILIR--NPLSTFSINEEYTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRSN 227
           +++R  + L  + I +  +  C+    + ++   +G   L+    V+D    R GW+  +
Sbjct: 397 MMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYD 456

Query: 228 CEDSVGVRVNS 238
           C  SV V + S
Sbjct: 457 CSMSVNVSITS 467


>AT2G36670.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:15364949-15368016 REVERSE LENGTH=512
          Length = 512

 Score = 65.9 bits (159), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 64/251 (25%), Positives = 103/251 (41%), Gaps = 25/251 (9%)

Query: 5   CGRKQSGGYLDG-AAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQ 63
           C   QSG       A DG+ G G G +SV S L+  G+    FS CL  D SG  +F   
Sbjct: 230 CSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVF-VL 288

Query: 64  GHVTQQSTQFLPVDGEFIDYIVGVERFCVGS--FCLKGTGFQA------LIDSGASFTYL 115
           G +      + P+      Y + +    V      L    F+A      ++D+G + TYL
Sbjct: 289 GEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYL 348

Query: 116 PHDIYKKVVMEFDKQVNATRETFQQLP------WEYCYDASSQEVVNTPPIKLTFSKNQT 169
             + Y       D  +NA   +  QL        E CY  S+      P + L F+   +
Sbjct: 349 VKEAY-------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS 401

Query: 170 ILIR--NPLSTFSINEEYTAMCLTVYNSKDDYVTIGQNYLRGYRLVFDRENLRFGWSRSN 227
           +++R  + L  + I +  +  C+    + ++   +G   L+    V+D    R GW+  +
Sbjct: 402 MMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYD 461

Query: 228 CEDSVGVRVNS 238
           C  SV V + S
Sbjct: 462 CSMSVNVSITS 472


>AT1G44130.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:16787508-16789318 REVERSE LENGTH=405
          Length = 405

 Score = 63.9 bits (154), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/226 (25%), Positives = 92/226 (40%), Gaps = 16/226 (7%)

Query: 18  APDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQGHVTQQSTQFLPVD 77
           A  GVLGLG G I + + L   GL RN    CL+    G + FGD   V      + P+ 
Sbjct: 177 ATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGFLFFGDN-LVPSIGVAWTPLL 235

Query: 78  GEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKKVV--MEFDKQVNATR 135
            +   Y  G              G + + D+G+S+TY     Y+ ++  +  D +V+  +
Sbjct: 236 SQDNHYTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLK 295

Query: 136 ETFQQLPWEYCYDA-----SSQEVVN---TPPIKLTFSKNQTILIRNPLSTFSINEEYTA 187
              +      C+       S  EV N   T  I  T  +  T L   P   + I  +   
Sbjct: 296 VAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQLYLAP-ELYLIVSKTGN 354

Query: 188 MCLTVYNSKD----DYVTIGQNYLRGYRLVFDRENLRFGWSRSNCE 229
           +CL + N  +    +   IG   ++G  +++D E  + GW  S+C 
Sbjct: 355 VCLGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLGWVSSDCN 400


>AT4G33490.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:16108781-16110679 REVERSE LENGTH=425
          Length = 425

 Score = 62.0 bits (149), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 105/251 (41%), Gaps = 25/251 (9%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQG 64
           CG  Q  G       DGVLGLG G +S+ S L  +G ++N    CL+    G + FGD  
Sbjct: 173 CGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDL 232

Query: 65  HVTQQSTQFLPVDGEFIDY---IVGVERFCVGSFCLKGTGFQALI---DSGASFTYLPHD 118
           + + +   + P+  E+  +    +G E    G    + TG + L+   DSG+S+TY    
Sbjct: 233 YDSSR-VSWTPMSREYSKHYSPAMGGELLFGG----RTTGLKNLLTVFDSGSSYTYFNSK 287

Query: 119 IYKKVVMEFDKQVNAT--RETFQQLPWEYCYDA-----SSQEVVNT-PPIKLTFSK--NQ 168
            Y+ V     ++++    +E         C+       S +EV     P+ L+F      
Sbjct: 288 AYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRS 347

Query: 169 TILIRNPLSTFSINEEYTAMCLTVYNSKD----DYVTIGQNYLRGYRLVFDRENLRFGWS 224
             L   P   + I      +CL + N  +    +   IG   ++   +++D E    GW 
Sbjct: 348 KTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWM 407

Query: 225 RSNCEDSVGVR 235
             +C++   ++
Sbjct: 408 PVDCDELASLK 418


>AT1G77480.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:29114946-29117150 REVERSE LENGTH=432
          Length = 432

 Score = 61.2 bits (147), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 57/226 (25%), Positives = 93/226 (41%), Gaps = 20/226 (8%)

Query: 21  GVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQ----GHVTQQSTQFLPV 76
           G+LGLG G + + + L   G+ +N    CL+    G +  GD+      VT  S   L  
Sbjct: 198 GILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTS---LAT 254

Query: 77  DGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKKVVMEFDKQVNATRE 136
           +    +Y+ G              G   + DSG+S+TY   + Y+ ++    K +N    
Sbjct: 255 NSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPL 314

Query: 137 T----FQQLP--WEYCYDASSQEVVNT--PPIKLTFSKNQT-ILIRNPLSTFSINEEYTA 187
           T     + LP  W+      S + V      I L F   +   L + P  ++ I  E   
Sbjct: 315 TDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGR 374

Query: 188 MCLTVYNSK----DDYVTIGQNYLRGYRLVFDRENLRFGWSRSNCE 229
           +CL + N      + Y  IG    +G  +++D E  R GW  S+C+
Sbjct: 375 VCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCD 420


>AT1G77480.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:29114705-29117150 REVERSE LENGTH=466
          Length = 466

 Score = 61.2 bits (147), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 57/226 (25%), Positives = 93/226 (41%), Gaps = 20/226 (8%)

Query: 21  GVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQ----GHVTQQSTQFLPV 76
           G+LGLG G + + + L   G+ +N    CL+    G +  GD+      VT  S   L  
Sbjct: 198 GILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTS---LAT 254

Query: 77  DGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIYKKVVMEFDKQVNATRE 136
           +    +Y+ G              G   + DSG+S+TY   + Y+ ++    K +N    
Sbjct: 255 NSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPL 314

Query: 137 T----FQQLP--WEYCYDASSQEVVNT--PPIKLTFSKNQT-ILIRNPLSTFSINEEYTA 187
           T     + LP  W+      S + V      I L F   +   L + P  ++ I  E   
Sbjct: 315 TDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGR 374

Query: 188 MCLTVYNSK----DDYVTIGQNYLRGYRLVFDRENLRFGWSRSNCE 229
           +CL + N      + Y  IG    +G  +++D E  R GW  S+C+
Sbjct: 375 VCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCD 420


>AT1G79720.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:29997259-29998951 REVERSE LENGTH=484
          Length = 484

 Score = 60.5 bits (145), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 16/198 (8%)

Query: 46  FSICL---NDDESGRILFGDQGHVTQQSTQFL-------PVDGEFIDYIVGVERFCVGSF 95
           FS CL    D  SG + FG+   V   ST          P    F  YI+ +    +G  
Sbjct: 288 FSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSF--YILNLTGASIGGV 345

Query: 96  CLKGTGFQ--ALIDSGASFTYLPHDIYKKVVMEFDKQVNATRETFQQLPWEYCYDASSQE 153
            LK + F    LIDSG   T LP  IYK V +EF KQ +           + C++ +S E
Sbjct: 346 ELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYE 405

Query: 154 VVNTPPIKLTFSKNQTILIRNPLSTFSINEEYTAMCLTVYN-SKDDYVTIGQNY-LRGYR 211
            ++ P IK+ F  N  + +      + +  + + +CL + + S ++ V I  NY  +  R
Sbjct: 406 DISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQR 465

Query: 212 LVFDRENLRFGWSRSNCE 229
           +++D    R G    NC 
Sbjct: 466 VIYDTTQERLGIVGENCR 483


>AT5G10770.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:3403331-3405331 REVERSE LENGTH=474
          Length = 474

 Score = 55.8 bits (133), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 63/241 (26%), Positives = 95/241 (39%), Gaps = 29/241 (12%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDES--GRILFGD 62
           CG    G    G A  G+LGLG   +S PS  A        FS CL    S  G + FG 
Sbjct: 247 CGENNQG-LFTGVA--GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGS 301

Query: 63  QGHVTQQSTQFLPV----DG------EFIDYIVGVERFCVGSFCLKGTGFQALIDSGASF 112
            G    +S +F P+    DG        +   VG ++  + S      G  ALIDSG   
Sbjct: 302 AG--ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVI 357

Query: 113 TYLPHDIYKKVVMEFDKQVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQTILI 172
           T LP   Y  +   F  +++    T      + C+D S  + V  P +  +FS    + +
Sbjct: 358 TRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVEL 417

Query: 173 --RNPLSTFSINEEYTAMCLTVYNSKDD--YVTIGQNYLRGYRLVFDRENLRFGWSRSNC 228
             +     F I++    +CL    + DD      G    +   +V+D    R G++ + C
Sbjct: 418 GSKGIFYVFKISQ----VCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473

Query: 229 E 229
            
Sbjct: 474 S 474


>AT5G22850.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:7633717-7636298 REVERSE LENGTH=493
          Length = 493

 Score = 54.7 bits (130), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 65/261 (24%), Positives = 104/261 (39%), Gaps = 20/261 (7%)

Query: 5   CGRKQSGGYLDG-AAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQ 63
           C   Q+G  +    A DG+ G G   +SV S LA +G+    FS CL  +  G  +    
Sbjct: 207 CSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGIL-VL 265

Query: 64  GHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCL--------KGTGFQALIDSGASFTYL 115
           G + + +  F P+      Y V +    V    L           G   +ID+G +  YL
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325

Query: 116 PHDIYKKVVMEFDKQVNATRETFQQL--PWEYCYDASSQEVVNTPPIKLTFSKNQTILIR 173
               Y   V   +   NA  ++ + +      CY  ++      PP+ L F+   ++ + 
Sbjct: 326 SEAAYVPFV---EAITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFL- 381

Query: 174 NPLSTF--SINEEYTAM-CLTVYNSKDDYVTI-GQNYLRGYRLVFDRENLRFGWSRSNCE 229
           NP        N   TA+ C+     ++  +TI G   L+    V+D    R GW+  +C 
Sbjct: 382 NPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441

Query: 230 DSVGVRVNSTSPSPSTLPANQ 250
            SV V   S+S     + A Q
Sbjct: 442 TSVNVSATSSSGRSEYVNAGQ 462


>AT4G12920.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:7568286-7569455 FORWARD LENGTH=389
          Length = 389

 Score = 53.1 bits (126), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/231 (23%), Positives = 95/231 (41%), Gaps = 22/231 (9%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLND----DESGRILF 60
           C     G Y  G    G+LGLG G  S+   + E G   + FS CL +      S  ++ 
Sbjct: 174 CNTLSDGSYFTGT---GILGLGVGKYSI---IGEFG---SKFSFCLGEISEPKASHNLIL 224

Query: 61  GDQGHVTQQSTQFLPVDGEFIDYIVGVERFCVGSFCLKGTGFQALIDSGASFTYLPHDIY 120
           GD  +V    T     +G     I  +E   VG         Q  +D+G++ ++L  ++Y
Sbjct: 225 GDGANVQGHPTVINITEGH---TIFQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLY 281

Query: 121 KKVVMEFDKQVNATRETFQQLPWEYCYDASSQEVVNTPPIKLTFSKNQTILIRNPLSTFS 180
            K V  FD  + +   +++      CY A + E +    +   F     + + N  + F 
Sbjct: 282 YKFVDAFDDLIGSRPLSYEP---TLCYKADTIERLEKMDVGFKFDVGAELSV-NIHNIFI 337

Query: 181 INEEYTAMCLTVYNSKDDY--VTIGQNYLRGYRLVFDRENLRFGWSRSNCE 229
                   CL + N+K+ +  V IG   ++GY + +D        ++ +C+
Sbjct: 338 QQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDCD 388


>AT4G33490.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:16108928-16110670 REVERSE LENGTH=401
          Length = 401

 Score = 52.4 bits (124), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 53/209 (25%), Positives = 87/209 (41%), Gaps = 21/209 (10%)

Query: 5   CGRKQSGGYLDGAAPDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGDQG 64
           CG  Q  G       DGVLGLG G +S+ S L  +G ++N    CL+    G + FGD  
Sbjct: 170 CGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDL 229

Query: 65  HVTQQSTQFLPVDGEFIDY---IVGVERFCVGSFCLKGTGFQALI---DSGASFTYLPHD 118
           + + +   + P+  E+  +    +G E    G    + TG + L+   DSG+S+TY    
Sbjct: 230 YDSSR-VSWTPMSREYSKHYSPAMGGELLFGG----RTTGLKNLLTVFDSGSSYTYFNSK 284

Query: 119 IYKKVVMEFDKQVNAT--RETFQQLPWEYCYDA-----SSQEVVNT-PPIKLTFSK--NQ 168
            Y+ V     ++++    +E         C+       S +EV     P+ L+F      
Sbjct: 285 AYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRS 344

Query: 169 TILIRNPLSTFSINEEYTAMCLTVYNSKD 197
             L   P   + I      +CL + N  +
Sbjct: 345 KTLFEIPPEAYLIISMKGNVCLGILNGTE 373


>AT5G45120.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:18241003-18242478 FORWARD LENGTH=491
          Length = 491

 Score = 52.0 bits (123), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 62/254 (24%), Positives = 105/254 (41%), Gaps = 48/254 (18%)

Query: 19  PDGVLGLGPGSISVPSLLAEEGLIRNSFSICL-------NDDESGRILFGDQGHVTQ--Q 69
           P G+ G G G +S+PS L   G +   FS C        N + S  ++ G          
Sbjct: 229 PIGIAGFGRGLLSLPSQL---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTD 285

Query: 70  STQFLPVDGEFI---DYIVGVERFCVGS-------------FCLKGTGFQALIDSGASFT 113
           S QF P+    +    Y +G+E   +G+             F  +G G   L+DSG ++T
Sbjct: 286 SLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNG-GMLVDSGTTYT 344

Query: 114 YLPHDIYKKVVMEFDKQVNATR--ETFQQLPWEYCY-------DASSQE---VVNTPPIK 161
           +LP   Y +++      +   R  ET  +  ++ CY       + +S E   ++  P I 
Sbjct: 345 HLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSIT 404

Query: 162 LTFSKNQTILIRNPLSTFSI---NEEYTAMCLTVYNSKD-DY---VTIGQNYLRGYRLVF 214
             F  N T+L+    S +++   ++     CL   N +D DY      G    +  ++V+
Sbjct: 405 FHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVY 464

Query: 215 DRENLRFGWSRSNC 228
           D E  R G+   +C
Sbjct: 465 DLEKERIGFQAMDC 478


>AT3G52500.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:19465644-19467053 REVERSE LENGTH=469
          Length = 469

 Score = 50.1 bits (118), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 104/248 (41%), Gaps = 43/248 (17%)

Query: 19  PDGVLGLGPGSISVPSLLAEEGLIRNSFSICLNDDESGRILFGD--QGHVTQQSTQFLPV 76
           P G+ G G G +S+PS +  +       S   +D      L  D   GH +   T  L  
Sbjct: 225 PAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTY 284

Query: 77  ----------DGEFIDYI-VGVERFCVGSFCLK-----------GTGFQALIDSGASFTY 114
                     +  F++Y  + + R  VG   +K           G G  +++DSG++FT+
Sbjct: 285 TPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDG-GSIVDSGSTFTF 343

Query: 115 LPHDIYKKVVMEFDKQV-NATRETFQQLPWEY----CYDASSQEVVNTPPIKLTFSKNQT 169
           +   +++ V  EF  Q+ N TRE  + L  E     C++ S +  V  P +   F     
Sbjct: 344 MERPVFELVAEEFASQMSNYTRE--KDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAK 401

Query: 170 ILIRNPLST-FSINEEYTAMCLTVYNSKD--------DYVTIGQNYLRGYRLVFDRENLR 220
           + +  PLS  F+       +CLTV + K           + +G    + Y + +D EN R
Sbjct: 402 LEL--PLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDR 459

Query: 221 FGWSRSNC 228
           FG+++  C
Sbjct: 460 FGFAKKKC 467


>AT4G16563.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:9329933-9331432 REVERSE LENGTH=499
          Length = 499

 Score = 48.5 bits (114), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 102/272 (37%), Gaps = 62/272 (22%)

Query: 17  AAPDGVLGLGPGSISVPSLLAEEG-LIRNSFSICL--NDDESGRIL---------FGDQG 64
           A P GV G G G +S+P+ LA     + NSFS CL  +  +S R+          F D+ 
Sbjct: 222 AEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKK 281

Query: 65  HVTQQSTQFLPVD-------GEFID------------YIVGVERFCVGSFCLKGTGFQAL 105
                +T              EF+             Y V ++   +G   +        
Sbjct: 282 EKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGISIGKRNIPAPAMLRR 341

Query: 106 ID----------SGASFTYLPHDIYKKVVMEFDKQVNATRETFQQLP----WEYCYDASS 151
           ID          SG +FT LP   Y  VV EFD +V    E   ++        CY  + 
Sbjct: 342 IDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN- 400

Query: 152 QEVVNTPPIKLTFSKNQ---TILIRNPLSTF-----SINEEYTAMCLTVYNSKDDY---- 199
            + V  P + L F+ N+   T+  RN    F        E+    CL + N  D+     
Sbjct: 401 -QTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRG 459

Query: 200 ---VTIGQNYLRGYRLVFDRENLRFGWSRSNC 228
                +G    +G+ +V+D  N R G+++  C
Sbjct: 460 GTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491