Miyakogusa Predicted Gene

Lj5g3v1203340.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1203340.1 Non Chatacterized Hit- tr|I3SQ74|I3SQ74_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,99.55,0,seg,NULL;
BASIC 7S GLOBULIN-RELATED,NULL; ASPARTYL PROTEASES,Peptidase A1; Acid
proteases,Peptidase ,CUFF.54970.1
         (440 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G03220.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   397   e-111
AT1G03230.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   374   e-104
AT5G19120.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   175   5e-44
AT5G19110.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   160   2e-39
AT5G19100.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   159   3e-39
AT5G48430.1 | Symbols:  | Eukaryotic aspartyl protease family pr...   152   3e-37
AT3G52500.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    80   2e-15
AT1G09750.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    70   3e-12
AT3G25700.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    70   3e-12
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty...    70   4e-12
AT2G39710.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    68   1e-11
AT2G03200.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    67   2e-11
AT2G42980.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    66   4e-11
AT1G01300.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    65   1e-10
AT5G10770.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    64   2e-10
AT1G79720.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    64   3e-10
AT3G54400.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    62   6e-10
AT3G12700.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    62   6e-10
AT2G35615.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    62   1e-09
AT3G02740.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    60   3e-09
AT1G25510.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    59   8e-09
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil...    57   2e-08
AT3G18490.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    57   3e-08
AT5G36260.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    56   6e-08
AT1G64830.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    55   9e-08
AT3G59080.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    54   2e-07
AT5G22850.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    53   5e-07
AT4G33490.2 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   5e-06
AT4G33490.1 | Symbols:  | Eukaryotic aspartyl protease family pr...    49   8e-06

>AT1G03220.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:787143-788444 FORWARD LENGTH=433
          Length = 433

 Score =  397 bits (1021), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 220/431 (51%), Positives = 279/431 (64%), Gaps = 38/431 (8%)

Query: 27  AQTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS 86
           AQT FRPKAL+LP+TKD   S  QY T I QRTPLVP  +  DLGG  LWV+C+ + YVS
Sbjct: 22  AQTPFRPKALLLPVTKD--QSTLQYTTVINQRTPLVPASVVFDLGGRELWVDCD-KGYVS 78

Query: 87  STFKPARCGSSQCSLFGLTGCS----------GDKICGRSPSNTVTGVSSYGDIHSDVVS 136
           ST++  RC S+ CS  G T C            +  CG  P NTVTG ++ G+   DVVS
Sbjct: 79  STYQSPRCNSAVCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVTGTATSGEFALDVVS 138

Query: 137 VNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHR 196
           + ST+G+ P +VV +PN +F CG+  +  GLAKG  GMAG+GR  + LPSQF++AFSFHR
Sbjct: 139 IQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHR 198

Query: 197 KFAICLTANSGADGVMFFGDGPYNLNQDVS-KVLTYTPLITNPVSTAPSAFLGEPSVEYF 255
           KFA+CLT+     GV FFG+GPY     +    L  TPL+ NPVSTA +   GE S EYF
Sbjct: 199 KFAVCLTS---GKGVAFFGNGPYVFLPGIQISSLQTTPLLINPVSTASAFSQGEKSSEYF 255

Query: 256 IGVKSIKVSEKNVPLNTTLLSINKN-GVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA 314
           IGV +I++ EK VP+N TLL IN + G+GGTKIS+VNPYTV+E++IY A    FVK   A
Sbjct: 256 IGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPYTVLESSIYNAFTSEFVKQAAA 315

Query: 315 PTVSPVA---PFGTCFATKDISFSRIGPGVPAIDLVLQN-GVEWPIIGANSMVQF-DDVI 369
            ++  VA   PFG CF+TK++  +R+G  VP I+LVL +  V W I GANSMV   DDVI
Sbjct: 316 RSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVWRIFGANSMVSVSDDVI 375

Query: 370 CLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFRSLFL-EH 428
           CLGFVD G N +              TS+ IG  QLE+NL++FDLA+++ GF S  L   
Sbjct: 376 CLGFVDGGVNAR--------------TSVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQ 421

Query: 429 DNCQNFRFTSS 439
            NC NF FTS+
Sbjct: 422 TNCANFNFTST 432


>AT1G03230.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:790110-791414 FORWARD LENGTH=434
          Length = 434

 Score =  374 bits (961), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 209/431 (48%), Positives = 272/431 (63%), Gaps = 38/431 (8%)

Query: 27  AQTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS 86
           AQ SFRPKAL+LP+TKD   S  QY T I QRTPLVP  +  DLGG   WV+C+ + YVS
Sbjct: 23  AQPSFRPKALLLPVTKD--PSTLQYTTVINQRTPLVPASVVFDLGGREFWVDCD-QGYVS 79

Query: 87  STFKPARCGSSQCSLFGLTGCS----------GDKICGRSPSNTVTGVSSYGDIHSDVVS 136
           +T++  RC S+ CS  G   C            +  CG  P N++TG ++ G+   DVVS
Sbjct: 80  TTYRSPRCNSAVCSRAGSIACGTCFSPPRPGCSNNTCGAFPDNSITGWATSGEFALDVVS 139

Query: 137 VNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHR 196
           + ST+G+ P + V +PN +F CGS  +  GLAKG  GMAG+GR  + LP QF++AFSF+R
Sbjct: 140 IQSTNGSNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNR 199

Query: 197 KFAICLTANSGADGVMFFGDGPYNLNQDVS-KVLTYTPLITNPVSTAPSAFLGEPSVEYF 255
           KFA+CLT+     GV FFG+GPY     +    L  TPL+ NP +T      GE S EYF
Sbjct: 200 KFAVCLTS---GRGVAFFGNGPYVFLPGIQISRLQKTPLLINPGTTVFEFSKGEKSPEYF 256

Query: 256 IGVKSIKVSEKNVPLNTTLLSINKN-GVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA 314
           IGV +IK+ EK +P++ TLL IN + G+GGTKIS+VNPYTV+E++IYKA    F++   A
Sbjct: 257 IGVTAIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAA 316

Query: 315 PTVSPVA---PFGTCFATKDISFSRIGPGVPAIDLVLQN-GVEWPIIGANSMVQF-DDVI 369
            ++  VA   PFG CF+TK++  +R+G  VP I LVL +  V W I GANSMV   DDVI
Sbjct: 317 RSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVI 376

Query: 370 CLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFRSLFL-EH 428
           CLGFVD G NP A              S+ IG  QLE+NL++FDLA+++ GF S  L   
Sbjct: 377 CLGFVDGGVNPGA--------------SVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQ 422

Query: 429 DNCQNFRFTSS 439
            NC NF FTS+
Sbjct: 423 TNCANFNFTST 433


>AT5G19120.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:6414585-6415745 FORWARD LENGTH=386
          Length = 386

 Score =  175 bits (444), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 138/406 (33%), Positives = 197/406 (48%), Gaps = 64/406 (15%)

Query: 27  AQTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS 86
           +Q S     +V P+ KD+ +   QY+ QI+      PVKL +DL G  LW +C +R   S
Sbjct: 23  SQISDSVNGVVFPVVKDLPTG--QYLAQIRLGDSPDPVKLVVDLAGSILWFDCSSRHVSS 80

Query: 87  ST---------FKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSV 137
           S             A+ G+ + S    +    +  C     N   G+++ G++ SDV+SV
Sbjct: 81  SRNLISGSSSGCLKAKVGNERVSSSSSSRKDQNADCELLVKNDAFGITARGELFSDVMSV 140

Query: 138 NSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
            S   T+P  V    + LF C    +  GLA G  G+ GLGR ++SLPSQ ++  +  R+
Sbjct: 141 GSV--TSPGTV----DLLFACTPPWLLRGLASGAQGVMGLGRAQISLPSQLAAETNERRR 194

Query: 198 FAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIG 257
             + L   S  +GV+             S+ L YTPL+T              S  Y I 
Sbjct: 195 LTVYL---SPLNGVVSTSSVEEVFGVAASRSLVYTPLLTG------------SSGNYVIN 239

Query: 258 VKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLG-APT 316
           VKSI+V+ + + +   L           ++STV PYT++E++IYK  A+A+ K+ G A +
Sbjct: 240 VKSIRVNGEKLSVEGPL---------AVELSTVVPYTILESSIYKVFAEAYAKAAGEATS 290

Query: 317 VSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNG-VEWPIIGANSMVQFDDVICLGFVD 375
           V PVAPFG CF T D+ F       PA+DL LQ+  V W I G N M           VD
Sbjct: 291 VPPVAPFGLCF-TSDVDF-------PAVDLALQSEMVRWRIHGKNLM-----------VD 331

Query: 376 AGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
            G   + S  G V+GGS  V  I +G  QLE  +L FDL  S +GF
Sbjct: 332 VGGGVRCS--GIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375


>AT5G19110.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:6411720-6413170 REVERSE LENGTH=405
          Length = 405

 Score =  160 bits (404), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 132/411 (32%), Positives = 192/411 (46%), Gaps = 49/411 (11%)

Query: 37  VLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVSSTFKPARCGS 96
           +LPITK   ++L  Y T         PV L LDLG    W++C   + +SS  +   C S
Sbjct: 27  LLPITKHEPTNL-FYTTFNVGSAAKSPVNLLLDLGTNLTWLDCRKLKSLSS-LRLVTCQS 84

Query: 97  SQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLF 156
           S C      GC+G     + P+         G +  D  S+ +TDG      VSV +F F
Sbjct: 85  STCKSIPGNGCAGKSCLYKQPNPLGQNPVVTGRVVQDRASLYTTDGGKFLSQVSVRHFTF 144

Query: 157 ICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL----TANSGADGVM 212
            C  +    GL   V G+  L     S   Q +SAF+   KF++CL    T +    G+ 
Sbjct: 145 SCAGEKALQGLPPPVDGVLALSPGSSSFTKQVTSAFNVIPKFSLCLPSSGTGHFYIAGIH 204

Query: 213 FFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNT 272
           +F   P+N +              NP+    +   G  S +Y I VKSI V    + LN 
Sbjct: 205 YFIP-PFNSSD-------------NPIPRTLTPIKGTDSGDYLITVKSIYVGGTALKLNP 250

Query: 273 TLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAF---VKSLGAPTVSPVAPFGTCFAT 329
            LL+      GG K+STV  YTV++T IY A+A +F    K++G   V  VAPF  CF +
Sbjct: 251 DLLT------GGAKLSTVVHYTVLQTDIYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDS 304

Query: 330 KDISFS-RIGPGVPAIDLVLQ---NGVEWPIIGANSMVQFDD-VICLGFVDAGSNPKASQ 384
           +    +   GP VP I++ L      V+W   GAN++V+  + V+CL F+D G  PK   
Sbjct: 305 RTAGKNLTAGPNVPVIEIGLPGRIGEVKWGFYGANTVVKVKETVMCLAFIDGGKTPKDLM 364

Query: 385 VGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF-RSLFLEHDNCQNF 434
           V              IG HQL++++L+FD + + L F  SL L + +C  +
Sbjct: 365 V--------------IGTHQLQDHMLEFDFSGTVLAFSESLLLHNTSCSTW 401


>AT5G19100.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:6408242-6409417 REVERSE LENGTH=391
          Length = 391

 Score =  159 bits (402), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 124/361 (34%), Positives = 173/361 (47%), Gaps = 58/361 (16%)

Query: 65  KLTLDLGGGY-LWVNCENRQYVSSTFKPARCGSSQCSLFGLT-GCSGDKICGRSPSNTVT 122
           K  LDL G   L  NC      S+T+ P RCGS++C        C  + I  +    TV 
Sbjct: 56  KFVLDLNGAAPLLQNCPTAA-KSTTYHPIRCGSTRCKYANPNFPCPNNVIAKK---RTVC 111

Query: 123 GVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRV 182
             S    +  D V +  T     T+   + + L +       +G         GL  T +
Sbjct: 112 LSSDNSRLFRDTVPLLYTFNGVYTRDSEMSSSLTL----TCTDGAPALKQRTIGLANTHL 167

Query: 183 SLPSQFSSAFSFHRKFAICLTANSGA---DGVMFFGDGPYNL---NQDVSKVLTYTPLIT 236
           S+PSQ  S +    K A+CL +   +   +G ++ G G Y     ++DVSK+   TPLI 
Sbjct: 168 SIPSQLISMYQLPHKIALCLPSTERSQSHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIG 227

Query: 237 NPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVM 296
           N             S EY I VKSI++  K VP+            G TKIST+ PYTV 
Sbjct: 228 N-----------GKSGEYLIDVKSIQIGAKTVPI----------PYGATKISTLAPYTVF 266

Query: 297 ETTIYKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPI 356
           +T++YKA+  AF +++       V PFG CF      +S  G GVP IDLVL  G +W I
Sbjct: 267 QTSLYKALLTAFTENIKIAKAPAVKPFGACF------YSNGGRGVPVIDLVLSGGAKWRI 320

Query: 357 IGANSMVQFD-DVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLA 415
            G+NS+V+ + +V+CLGFVD G  PK           +P   I IG  Q+E+NL++FDL 
Sbjct: 321 YGSNSLVKVNKNVVCLGFVDGGVKPK-----------YP---IVIGGFQMEDNLVEFDLE 366

Query: 416 A 416
           A
Sbjct: 367 A 367


>AT5G48430.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:19627892-19629112 REVERSE LENGTH=406
          Length = 406

 Score =  152 bits (385), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 131/426 (30%), Positives = 200/426 (46%), Gaps = 67/426 (15%)

Query: 31  FRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN---RQYVSS 87
           + PKALV  ++K+    +  +     Q       +  + +GG YL   C +   R  V  
Sbjct: 24  YPPKALVSTVSKNTILPIFTFTLNTNQ-------EFFIHIGGPYLVRKCNDGLPRPIVP- 75

Query: 88  TFKPARCGSSQCSL---FGLTGCS--GDKICGRSPSNTVTGVSSYGDI-HSDV-----VS 136
                 CGS  C+L   F    CS   +KI     +   T    +  I +SD      +S
Sbjct: 76  ------CGSPVCALTRRFTPHQCSLPSNKIINGVCACQATAFEPFQRICNSDQFTYGDLS 129

Query: 137 VNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSS-AFSFH 195
           ++S    +P+  V++ N  ++C  +        GV G+AGL  T ++  +Q +       
Sbjct: 130 ISSLKPISPS--VTINNVYYLCIPQPFLVDFPPGVFGLAGLAPTALATWNQLTRPRLGLE 187

Query: 196 RKFAICLTANSG--ADGVMFFGDGPYNL-NQDVSKVLTYTPLITNPVSTAPSAFLGEPSV 252
           +KFA+CL ++      G ++FG GPY L N D   +L+YT LITNP              
Sbjct: 188 KKFALCLPSDENPLKKGAIYFGGGPYKLRNIDARSMLSYTRLITNPRKLN---------- 237

Query: 253 EYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL 312
            YF+G+K I V+   +       + ++NG GG  +ST+ P+T++ + IY+   +AF ++ 
Sbjct: 238 NYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTMLRSDIYRVFIEAFSQAT 297

Query: 313 -GAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVIC 370
            G P VS   PF  C +T   +F      VP IDL L NGV W +  AN+M +  DDV C
Sbjct: 298 SGIPRVSSTTPFEFCLSTTT-NFQ-----VPRIDLELANGVIWKLSPANAMKKVSDDVAC 351

Query: 371 LGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF-RSLFLEHD 429
           L               FVNGG     ++ IG HQ+EN L++FD+  S  GF  SL L   
Sbjct: 352 L--------------AFVNGGDAAAQAVMIGIHQMENTLVEFDVGRSAFGFSSSLGLVSA 397

Query: 430 NCQNFR 435
           +C +F+
Sbjct: 398 SCGDFQ 403


>AT3G52500.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:19465644-19467053 REVERSE LENGTH=469
          Length = 469

 Score = 80.5 bits (197), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 105/406 (25%), Positives = 167/406 (41%), Gaps = 84/406 (20%)

Query: 59  TPLVPVKLTLDLGGGYLWVNCENRQYVS--------------------STFKPARCGSSQ 98
           TP   +    D G   +W+ C +R   S                    S+ K   C S +
Sbjct: 98  TPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPK 157

Query: 99  CS-LFGLT-GCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFL- 155
           C  L+G    C G   C  +  N   G   Y   +     + ST G   T+ +  P+   
Sbjct: 158 CQFLYGPNVQCRG---CDPNTRNCTVGCPPYILQYG----LGSTAGVLITEKLDFPDLTV 210

Query: 156 --FICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTA------NSG 207
             F+ G  ++     +   G+AG GR  VSLPSQ +      ++F+ CL +      N  
Sbjct: 211 PDFVVGCSIIS---TRQPAGIAGFGRGPVSLPSQMN-----LKRFSHCLVSRRFDDTNVT 262

Query: 208 ADGVMFFGDGPYNLNQDVSKV--LTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSE 265
            D  +  G G    +   SK   LTYTP   NP + +  AFL      Y++ ++ I V  
Sbjct: 263 TDLDLDTGSG----HNSGSKTPGLTYTPFRKNP-NVSNKAFL----EYYYLNLRRIYVGR 313

Query: 266 KNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPT----VSPVA 321
           K+V +    L+   NG GG+ + + + +T ME  +++ VA+ F   +   T    +    
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373

Query: 322 PFGTCF---ATKDISFSRIGPGVPAIDLVLQNG--VEWPIIGANSMVQFDDVICLGFV-D 375
             G CF      D++       VP +    + G  +E P+    + V   D +CL  V D
Sbjct: 374 GLGPCFNISGKGDVT-------VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSD 426

Query: 376 AGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
              NP        +GG+ P  +I +G+ Q +N L+++DL   R GF
Sbjct: 427 KTVNP--------SGGTGP--AIILGSFQQQNYLVEYDLENDRFGF 462


>AT1G09750.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:3157541-3158960 FORWARD LENGTH=449
          Length = 449

 Score = 70.1 bits (170), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 164/404 (40%), Gaps = 59/404 (14%)

Query: 32  RPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVSST--- 88
           +PK   +P+       +  Y+ + K  TP   + + LD     +W+ C      S+    
Sbjct: 85  KPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 144

Query: 89  --------FKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVS-SYGDIHSDVVSVNS 139
                   +    C ++QC     T   G      SP  +V   + SYG   S   S+  
Sbjct: 145 FNTNSSSTYSTVSCSTAQC-----TQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQ 199

Query: 140 TDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFA 199
              T    V+  PNF F C +    N L     G+ GLGR  +SL SQ +S +S    F+
Sbjct: 200 DTLTLAPDVI--PNFSFGCINSASGNSLPP--QGLMGLGRGPMSLVSQTTSLYS--GVFS 253

Query: 200 ICLTANSGADGVMFFGDGPYNLNQ-DVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGV 258
            CL +        F+  G   L      K + YTPL+ NP           PS+ Y++ +
Sbjct: 254 YCLPSFRS-----FYFSGSLKLGLLGQPKSIRYTPLLRNP---------RRPSL-YYVNL 298

Query: 259 KSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTVS 318
             + V    VP++   L+ + N   GT I +    T     +Y+A+ D F K +   + S
Sbjct: 299 TGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFS 358

Query: 319 PVAPFGTCFATKDISFS-RIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVICLGFVDAG 377
            +  F TCF+  + + + +I   + ++DL L   +E  +I +++      + CL    AG
Sbjct: 359 TLGAFDTCFSADNENVAPKITLHMTSLDLKLP--MENTLIHSSA----GTLTCLSM--AG 410

Query: 378 SNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
               A+ V  V           I   Q +N  + FD+  SR+G 
Sbjct: 411 IRQNANAVLNV-----------IANLQQQNLRILFDVPNSRIGI 443


>AT3G25700.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:9358937-9360295 FORWARD LENGTH=452
          Length = 452

 Score = 70.1 bits (170), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 95/409 (23%), Positives = 161/409 (39%), Gaps = 62/409 (15%)

Query: 39  PITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS------------ 86
           P+     S   QY   ++   P   + L  D G   +WV C   +  S            
Sbjct: 72  PVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRH 131

Query: 87  -STFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVS---SYGDIHSDVVSVNSTD- 141
            STF PA C    C L  +       IC  +  ++        + G + S + +  +T  
Sbjct: 132 SSTFSPAHCYDPVCRL--VPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSL 189

Query: 142 GTTPTKVVSVPNFLFICGSKVVQNGLA----KGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
            T+  K   + +  F CG ++    ++     G  G+ GLGR  +S  SQ    F    K
Sbjct: 190 KTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG--NK 247

Query: 198 FAICL---TANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEY 254
           F+ CL   T +      +  G+G       +SK L +TPL+TNP+S         P+  Y
Sbjct: 248 FSYCLMDYTLSPPPTSYLIIGNG----GDGISK-LFFTPLLTNPLS---------PTF-Y 292

Query: 255 FIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA 314
           ++ +KS+ V+   + ++ ++  I+ +G GGT + +      +    Y++V  A  + +  
Sbjct: 293 YVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKL 352

Query: 315 PTVSPVAP-FGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVI-CLG 372
           P    + P F  C     +  ++    +P +      G  +     N  ++ ++ I CL 
Sbjct: 353 PIADALTPGFDLCVNVSGV--TKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLA 410

Query: 373 FVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
                 +PK   VGF            IG    +  L +FD   SRLGF
Sbjct: 411 IQSV--DPK---VGFS----------VIGNLMQQGFLFEFDRDRSRLGF 444


>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
           protease family protein | chr5:435322-436683 FORWARD
           LENGTH=453
          Length = 453

 Score = 69.7 bits (169), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 154/392 (39%), Gaps = 74/392 (18%)

Query: 64  VKLTLDLGGGYLWVNCENRQ-----------YVSSTFKPARCGSSQC-----SLFGLTGC 107
           + + +D G    W+ C NR              SS++ P  C S  C            C
Sbjct: 86  ISMVIDTGSELSWLRC-NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASC 144

Query: 108 SGDKICGRSPSNTVTGVSSYGDIHSDVVSV-NSTDGTTPTKVVSVPNFLFICGSKVVQNG 166
             DK+C  + S      SS G++ +++    NST+ +         N +F C   V  + 
Sbjct: 145 DSDKLCHATLS-YADASSSEGNLAAEIFHFGNSTNDS---------NLIFGCMGSVSGSD 194

Query: 167 LAKGV--TGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQD 224
             +    TG+ G+ R  +S  SQ         KF+ C++      G +  GD     N  
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLLGDS----NFT 245

Query: 225 VSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGG 284
               L YTPLI   +ST    F     V Y + +  IKV+ K +P+  ++L  +  G G 
Sbjct: 246 WLTPLNYTPLIR--ISTPLPYF---DRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQ 300

Query: 285 TKISTVNPYTVMETTIYKAVADAFV-KSLGAPTVSPVAPF---GTCFATKDISFSRIGPG 340
           T + +   +T +   +Y A+   F+ ++ G  TV     F   GT      IS  RI  G
Sbjct: 301 TMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSG 360

Query: 341 V----PAIDLVLQNGVEWPIIGANSMVQF-------DDVICLGFVDAGSNPKASQVGFVN 389
           +    P + LV + G E  + G   + +        D V C  F   G++       +V 
Sbjct: 361 ILHRLPTVSLVFE-GAEIAVSGQPLLYRVPHLTVGNDSVYCFTF---GNSDLMGMEAYV- 415

Query: 390 GGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
                     IG H  +N  ++FDL  SR+G 
Sbjct: 416 ----------IGHHHQQNMWIEFDLQRSRIGL 437


>AT2G39710.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:16562051-16563379 REVERSE LENGTH=442
          Length = 442

 Score = 68.2 bits (165), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/379 (23%), Positives = 154/379 (40%), Gaps = 52/379 (13%)

Query: 64  VKLTLDLGGGYLWVNCENRQYVSSTFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTG 123
           + + LD G    W++C+    + S F P        S +    CS   IC     +    
Sbjct: 78  ISMVLDTGSELSWLHCKKSPNLGSVFNPV-----SSSTYSPVPCS-SPICRTRTRDLPIP 131

Query: 124 VSSYGDIHSDVVSVNSTDGTT-------PTKV---VSVPNFLFICGSKVVQNGLAKGV-- 171
            S     H   V+++  D T+        T V   V+ P  LF C    + +   +    
Sbjct: 132 ASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKS 191

Query: 172 TGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQDVSKVLTY 231
           TG+ G+ R  +S  +Q    FS   KF+ C++  S + G +  GD  Y+    +     Y
Sbjct: 192 TGLMGMNRGSLSFVNQL--GFS---KFSYCISG-SDSSGFLLLGDASYSWLGPIQ----Y 241

Query: 232 TPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVN 291
           TPL+   + + P  +     V Y + ++ I+V  K + L  ++   +  G G T + +  
Sbjct: 242 TPLV---LQSTPLPYFDR--VAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGT 296

Query: 292 PYTVMETTIYKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIG-------PGVPAI 344
            +T +   +Y A+ + F+    +       P      T D+ + ++G        G+P +
Sbjct: 297 QFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCY-KVGSTTRPNFSGLPMV 355

Query: 345 DLVLQNGVEWPIIGANSMVQFDDVICLGFVDAGSNPKASQVGFVNGGSH--PVTSITIGA 402
            L+ + G E  + G   + + +         AGS  K     F  G S    + +  IG 
Sbjct: 356 SLMFR-GAEMSVSGQKLLYRVN--------GAGSEGKEEVYCFTFGNSDLLGIEAFVIGH 406

Query: 403 HQLENNLLKFDLAASRLGF 421
           H  +N  ++FDLA SR+GF
Sbjct: 407 HHQQNVWMEFDLAKSRVGF 425


>AT2G03200.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:966506-967891 REVERSE LENGTH=461
          Length = 461

 Score = 67.4 bits (163), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/349 (23%), Positives = 144/349 (41%), Gaps = 56/349 (16%)

Query: 50  QYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYV------------SSTFKPARCGSS 97
           +++ ++    P V     +D G   +W  C+                 SS++    C S 
Sbjct: 106 EFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165

Query: 98  QCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFI 157
            C+    + C+ DK       +    + +YGD +S    + +T+  T     S+    F 
Sbjct: 166 LCNALPRSNCNEDK-------DACEYLYTYGD-YSSTRGLLATETFTFEDENSISGIGFG 217

Query: 158 CGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTA--NSGADGVMFFG 215
           CG +   +G ++G +G+ GLGR  +SL SQ         KF+ CLT+  +S A   +F G
Sbjct: 218 CGVENEGDGFSQG-SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEASSSLFIG 271

Query: 216 D---GPYN-----LNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKN 267
               G  N     L+ +V+K ++   L+ NP          +PS  Y++ ++ I V  K 
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMS---LLRNP---------DQPSF-YYLELQGITVGAKR 318

Query: 268 VPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPT-VSPVAPFGTC 326
           + +  +   + ++G GG  I +    T +E T +K + + F   +  P   S       C
Sbjct: 319 LSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLC 378

Query: 327 FATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDD--VICLGF 373
           F   D + +     VP +    + G +  + G N MV      V+CL  
Sbjct: 379 FKLPDAAKN---IAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAM 423


>AT2G42980.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:17875005-17876588 REVERSE LENGTH=527
          Length = 527

 Score = 66.2 bits (160), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 101/415 (24%), Positives = 156/415 (37%), Gaps = 63/415 (15%)

Query: 33  PKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNC--------ENRQY 84
           P  L+  +   +T    +Y   +   TP     L LD G    W+ C        +N  +
Sbjct: 142 PGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMF 201

Query: 85  ----VSSTFKPARCGSSQCSLFGL----TGCSGDKICGRSPSNTVTGVSS--YGDIHSDV 134
                S++FK   C   +CSL         C  D      P     G  S   GD   + 
Sbjct: 202 YDPKTSASFKNITCNDPRCSLISSPDPPVQCESDN--QSCPYFYWYGDRSNTTGDFAVET 259

Query: 135 VSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSF 194
            +VN T     +    V N +F CG      GL  G +G+ GLGR  +S  SQ  S +  
Sbjct: 260 FTVNLTTTEGGSSEYKVGNMMFGCGH--WNRGLFSGASGLLGLGRGPLSFSSQLQSLYG- 316

Query: 195 HRKFAICL---TANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPS 251
              F+ CL    +N+     + FG+    LN      L +T  +             E S
Sbjct: 317 -HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNH---TNLNFTSFVNGK----------ENS 362

Query: 252 VE--YFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFV 309
           VE  Y+I +KSI V  K + +     +I+ +G GGT I +    +      Y+ + + F 
Sbjct: 363 VETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFA 422

Query: 310 KSLGA--PTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-D 366
           + +    P          CF    I  + I   +P + +   +G  W     NS +   +
Sbjct: 423 EKMKENYPIFRDFPVLDPCFNVSGIEENNI--HLPELGIAFVDGTVWNFPAENSFIWLSE 480

Query: 367 DVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
           D++CL  +                G+   T   IG +Q +N  + +D   SRLGF
Sbjct: 481 DLVCLAIL----------------GTPKSTFSIIGNYQQQNFHILYDTKRSRLGF 519


>AT1G01300.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:117065-118522 FORWARD LENGTH=485
          Length = 485

 Score = 64.7 bits (156), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 106/409 (25%), Positives = 167/409 (40%), Gaps = 71/409 (17%)

Query: 32  RPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE--NRQYVSST- 88
           RP      +   ++    +Y T++   TP   V + LD G   +W+ C    R Y  S  
Sbjct: 123 RPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP 182

Query: 89  -FKPAR--------CGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNS 139
            F P +        C S  C      GC+  +        T     SYGD  S  V   S
Sbjct: 183 IFDPRKSKTYATIPCSSPHCRRLDSAGCNTRR-------KTCLYQVSYGD-GSFTVGDFS 234

Query: 140 TDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFA 199
           T+ T   +   V      CG      GL  G  G+ GLG+ ++S P Q  +   F++KF+
Sbjct: 235 TE-TLTFRRNRVKGVALGCGHD--NEGLFVGAAGLLGLGKGKLSFPGQ--TGHRFNQKFS 289

Query: 200 ICLTANSGAD--GVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIG 257
            CL   S +     + FG      N  VS++  +TPL++NP          +    Y++G
Sbjct: 290 YCLVDRSASSKPSSVVFG------NAAVSRIARFTPLLSNP----------KLDTFYYVG 333

Query: 258 VKSIKVSEKNVP-LNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPT 316
           +  I V    VP +  +L  +++ G GG  I +    T +    Y A+ DAF   +GA T
Sbjct: 334 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVGAKT 391

Query: 317 VSPVAP----FGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVICLG 372
           +   AP    F TCF   +++  +    VP + L  + G +  +   N ++  D      
Sbjct: 392 LK-RAPDFSLFDTCFDLSNMNEVK----VPTVVLHFR-GADVSLPATNYLIPVDTNGKFC 445

Query: 373 FVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
           F  AG+    S +G +               Q +   + +DLA+SR+GF
Sbjct: 446 FAFAGTMGGLSIIGNI---------------QQQGFRVVYDLASSRVGF 479


>AT5G10770.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:3403331-3405331 REVERSE LENGTH=474
          Length = 474

 Score = 64.3 bits (155), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 132/311 (42%), Gaps = 46/311 (14%)

Query: 32  RPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE--------NRQ 83
             K+  LP     T     YI  +   TP   + L  D G    W  C+         ++
Sbjct: 113 ESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKE 172

Query: 84  YV-----SSTFKPARCGSSQC-SLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSV 137
            +     S+++    C S+ C SL   TG +G   C  S SN + G+  YGD  S  V  
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGS--C--SASNCIYGIQ-YGD-QSFSVGF 226

Query: 138 NSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
            + +  T T         F CG      GL  GV G+ GLGR ++S PSQ  +A ++++ 
Sbjct: 227 LAKEKFTLTNSDVFDGVYFGCGEN--NQGLFTGVAGLLGLGRDKLSFPSQ--TATAYNKI 282

Query: 198 FAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIG 257
           F+ CL +++   G + FG      +  +S+ + +TP+ T          + + +  Y + 
Sbjct: 283 FSYCLPSSASYTGHLTFG------SAGISRSVKFTPIST----------ITDGTSFYGLN 326

Query: 258 VKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA-PT 316
           + +I V  + +P+ +T+ S       G  I +    T +    Y A+  +F   +   PT
Sbjct: 327 IVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPT 381

Query: 317 VSPVAPFGTCF 327
            S V+   TCF
Sbjct: 382 TSGVSILDTCF 392


>AT1G79720.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:29997259-29998951 REVERSE LENGTH=484
          Length = 484

 Score = 63.5 bits (153), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 158/386 (40%), Gaps = 82/386 (21%)

Query: 64  VKLTLDLGGGYLWVNCE------NRQ------YVSSTFKPARCGSSQCS-LFGLTG---- 106
           + L +D G    WV C+      N+Q       VSS++K   C SS C  L   T     
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205

Query: 107 CSGDKICGRSPSNTVTGVSSYGD---IHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVV 163
           C G+    ++P   V    SYGD      D+ S +   G T      + NF+F CG    
Sbjct: 206 CGGNNGVVKTPCEYVV---SYGDGSYTRGDLASESILLGDTK-----LENFVFGCGRN-- 255

Query: 164 QNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL-TANSGADGVMFFGDGPYNLN 222
             GL  G +G+ GLGR+ VSL SQ  +  +F+  F+ CL +   GA G + FG+      
Sbjct: 256 NKGLFGGSSGLMGLGRSSVSLVSQ--TLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYT 313

Query: 223 QDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGV 282
              S  ++YTPL+ N            P +  F  +     S   V L ++         
Sbjct: 314 NSTS--VSYTPLVQN------------PQLRSFYILNLTGASIGGVELKSSSFGRGILID 359

Query: 283 GGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVSPVAPFGTCF---ATKDISFSRIG 338
            GT I+ + P      +IYKAV   F+K   G PT    +   TCF   + +DIS     
Sbjct: 360 SGTVITRLPP------SIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS----- 408

Query: 339 PGVPAIDLVLQNGVEWP--IIGANSMVQFD-DVICLGFVDAGSNPKASQVGFVNGGSHPV 395
             +P I ++ Q   E    + G    V+ D  ++CL      S    ++VG         
Sbjct: 409 --IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLAL---ASLSYENEVGI-------- 455

Query: 396 TSITIGAHQLENNLLKFDLAASRLGF 421
               IG +Q +N  + +D    RLG 
Sbjct: 456 ----IGNYQQKNQRVIYDTTQERLGI 477


>AT3G54400.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:20140291-20142599 REVERSE LENGTH=425
          Length = 425

 Score = 62.4 bits (150), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 121/290 (41%), Gaps = 39/290 (13%)

Query: 49  PQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVSST--FKPAR--------CGSSQ 98
           P YI +    TP  P+ + LD      W+ C      SS+  F P++        C + Q
Sbjct: 86  PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQ 145

Query: 99  CSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFIC 158
           C       C+  K CG + +   + + +Y  +  D +++ S           +PN+ F C
Sbjct: 146 CKQAPNPSCTVSKSCGFNMTYGGSTIEAY--LTQDTLTLASD---------VIPNYTFGC 194

Query: 159 GSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGP 218
            +K   +G +    G+ GLGR  +SL SQ  S   +   F+ CL  +  ++       GP
Sbjct: 195 INKA--SGTSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGP 250

Query: 219 YNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSIN 278
            N  Q +   +  TPL+ NP            S  Y++ +  I+V  K V + T+ L+ +
Sbjct: 251 KN--QPIR--IKTTPLLKNP----------RRSSLYYVNLVGIRVGNKIVDIPTSALAFD 296

Query: 279 KNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTVSPVAPFGTCFA 328
                GT   +   YT +    Y AV + F + +     + +  F TC++
Sbjct: 297 PATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYS 346


>AT3G12700.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:4037136-4039043 FORWARD LENGTH=461
          Length = 461

 Score = 62.4 bits (150), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 93/392 (23%), Positives = 149/392 (38%), Gaps = 61/392 (15%)

Query: 50  QYITQIKQRTPLVPVKLTLDLGGGYLWVNC-------ENRQYV----SSTFKPARCGSSQ 98
           QY T+I+  TP    ++ +D G    WVNC       +NR+      S +FK   C +  
Sbjct: 105 QYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQT 164

Query: 99  C-----SLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPN 153
           C     +LF LT C               G ++ G    + ++V  T+G    ++  +P 
Sbjct: 165 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNG----RMARLPG 220

Query: 154 FLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMF 213
            L  C S        +G  G+ GL  +  S  S  +S +    KF+ CL  +        
Sbjct: 221 HLIGCSSSFTGQSF-QGADGVLGLAFSDFSFTSTATSLYG--AKFSYCLVDHLS------ 271

Query: 214 FGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTT 273
                   N++VS  L +    +   +   +  L    +  F  +  I +S     L+  
Sbjct: 272 --------NKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 323

Query: 274 LLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVSPVA-PFGTCFA-TK 330
               +    GGT + +    T++    YK V     + L     V P   P   CF+ T 
Sbjct: 324 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 383

Query: 331 DISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFD-DVICLGFVDAGSNPKASQVGFVN 389
             + S++    P +   L+ G  +     + +V     V CLGFV AG+           
Sbjct: 384 GFNVSKL----PQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGT----------- 428

Query: 390 GGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
               P T++ IG    +N L +FDL AS L F
Sbjct: 429 ----PATNV-IGNIMQQNYLWEFDLMASTLSF 455


>AT2G35615.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr2:14959391-14960734 FORWARD LENGTH=447
          Length = 447

 Score = 61.6 bits (148), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 149/355 (41%), Gaps = 59/355 (16%)

Query: 50  QYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYV------------SSTFKPARCGSS 97
           ++   I   TP + V    D G    WV C+  Q              SST+K   C S 
Sbjct: 84  EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143

Query: 98  QCSLFGLT--GC-SGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNF 154
            C     T  GC   + IC    S      S  GD+ ++ VS++S  G+     VS P  
Sbjct: 144 NCQALSSTERGCDESNNICKYRYSYGDQSFSK-GDVATETVSIDSASGSP----VSFPGT 198

Query: 155 LFICGSKVVQNG--LAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL---TANSGAD 209
           +F CG     NG    +  +G+ GLG   +SL SQ  S+ S  +KF+ CL   +A +   
Sbjct: 199 VFGCG---YNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS--KKFSYCLSHKSATTNGT 253

Query: 210 GVMFFGDG--PYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKN 267
            V+  G    P +L++D   V   TPL+             EP   Y++ +++I V +K 
Sbjct: 254 SVINLGTNSIPSSLSKDSGVV--STPLVDK-----------EPLTYYYLTLEAISVGKKK 300

Query: 268 VPLNTTLLSINKNGV-----GGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVS-PV 320
           +P   +  + N +G+     G   I +    T++E   +   + A  +S+ GA  VS P 
Sbjct: 301 IPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQ 360

Query: 321 APFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICLGFV 374
                CF +          G+P I +    G +  +   N+ V+  +D++CL  V
Sbjct: 361 GLLSHCFKSGSAEI-----GLPEITVHF-TGADVRLSPINAFVKLSEDMVCLSMV 409


>AT3G02740.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:590561-593089 FORWARD LENGTH=488
          Length = 488

 Score = 60.1 bits (144), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 98/423 (23%), Positives = 155/423 (36%), Gaps = 92/423 (21%)

Query: 35  ALVLPITKDVT-SSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNC-------------E 80
           A+ +P+  D    S+  Y  +I   TP     + +D G   LWVNC             E
Sbjct: 68  AIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVE 127

Query: 81  NRQY---VSSTFKPARCGSSQCSLFG-LTGCSGDKICGRSPSNTVTGVSSYGD------- 129
              Y    SST K   C  + CS     + C     C          V  YGD       
Sbjct: 128 LTPYDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQY--------VIMYGDGSSTNGY 179

Query: 130 -----IHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGL----AKGVTGMAGLGRT 180
                +H D+V+ N   G+T   ++      F CGSK  Q+G        V G+ G G++
Sbjct: 180 LVKDVVHLDLVTGNRQTGSTNGTII------FGCGSK--QSGQLGESQAAVDGIMGFGQS 231

Query: 181 RVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVS 240
             S  SQ +S     R FA CL  N+G         G + + + VS  +  TP+++    
Sbjct: 232 NSSFISQLASQGKVKRSFAHCLDNNNGG--------GIFAIGEVVSPKVKTTPMLS---- 279

Query: 241 TAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTI 300
                     S  Y + + +I+V    + L++   + +     G  I +      +   +
Sbjct: 280 ---------KSAHYSVNLNAIEVGNSVLELSSN--AFDSGDDKGVIIDSGTTLVYLPDAV 328

Query: 301 YKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGAN 360
           Y  + +  + S    T+  V    TCF   D    R     P +       V   +    
Sbjct: 329 YNPLLNEILASHPELTLHTVQESFTCFHYTD-KLDRF----PTVTFQFDKSVSLAVYPRE 383

Query: 361 SMVQF-DDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITI-GAHQLENNLLKFDLAASR 418
            + Q  +D  C G+ + G   K        GG+    S+TI G   L N L+ +D+    
Sbjct: 384 YLFQVREDTWCFGWQNGGLQTK--------GGA----SLTILGDMALSNKLVVYDIENQV 431

Query: 419 LGF 421
           +G+
Sbjct: 432 IGW 434


>AT1G25510.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:8959372-8960823 REVERSE LENGTH=483
          Length = 483

 Score = 58.5 bits (140), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 98/403 (24%), Positives = 158/403 (39%), Gaps = 76/403 (18%)

Query: 39  PITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN------------RQYVS 86
           P+    T    +Y T++    P   V + LD G    W+ C                  S
Sbjct: 136 PLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSS 195

Query: 87  STFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSY--GDIHSDVVSVNSTDGTT 144
           S+++P  C + QC+   ++ C  +  C    S    G  SY  GD  ++ +++ ST    
Sbjct: 196 SSYEPLSCDTPQCNALEVSECR-NATCLYEVSY---GDGSYTVGDFATETLTIGST---- 247

Query: 145 PTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLT- 203
                 V N    CG      GL  G  G+ GLG   ++LPSQ ++       F+ CL  
Sbjct: 248 -----LVQNVAVGCGHS--NEGLFVGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVD 295

Query: 204 ANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKV 263
            +S +   + FG         +S      PL+ N           +    Y++G+  I V
Sbjct: 296 RDSDSASTVDFGTS-------LSPDAVVAPLLRN----------HQLDTFYYLGLTGISV 338

Query: 264 SEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVK-SLGAPTVSPVAP 322
             + + +  +   ++++G GG  I +    T ++T IY ++ D+FVK +L     + VA 
Sbjct: 339 GGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAM 398

Query: 323 FGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDV--ICLGFVDAGSNP 380
           F TC+       ++    VP +      G    +   N M+  D V   CL F      P
Sbjct: 399 FDTCYNLS----AKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA-----P 449

Query: 381 KASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFRS 423
            AS +              IG  Q +   + FDLA S +GF S
Sbjct: 450 TASSLAI------------IGNVQQQGTRVTFDLANSLIGFSS 480


>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
           protein | chr5:12594474-12595787 FORWARD LENGTH=437
          Length = 437

 Score = 57.4 bits (137), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 89/352 (25%), Positives = 150/352 (42%), Gaps = 55/352 (15%)

Query: 43  DVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE--NRQY----------VSSTFK 90
           D+TS+  +Y+  +   TP  P+    D G   LW  C   +  Y           SST+K
Sbjct: 82  DLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYK 141

Query: 91  PARCGSSQC-SLFGLTGCS-GDKICGRSPSNTVTGVSSY--GDIHSDVVSVNSTDGTTPT 146
              C SSQC +L     CS  D  C  S S    G +SY  G+I  D +++ S+D    T
Sbjct: 142 DVSCSSSQCTALENQASCSTNDNTCSYSLS---YGDNSYTKGNIAVDTLTLGSSD----T 194

Query: 147 KVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANS 206
           + + + N +  CG         K  +G+ GLG   VSL  Q     S   KF+ CL   +
Sbjct: 195 RPMQLKNIIIGCGHNNA-GTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLT 251

Query: 207 GADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVE--YFIGVKSIKVS 264
                    D    +N   + +++ + +++ P+       + + S E  Y++ +KSI V 
Sbjct: 252 SK------KDQTSKINFGTNAIVSGSGVVSTPL-------IAKASQETFYYLTLKSISVG 298

Query: 265 EKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAP-TVSPVAPF 323
            K +  + +    ++  +    I +    T++ T  Y  + DA   S+ A     P +  
Sbjct: 299 SKQIQYSGSDSESSEGNI---IIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGL 355

Query: 324 GTCF-ATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICLGF 373
             C+ AT D+        VP I +   +G +  +  +N+ VQ  +D++C  F
Sbjct: 356 SLCYSATGDLK-------VPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAF 399


>AT3G18490.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:6349090-6350592 REVERSE LENGTH=500
          Length = 500

 Score = 56.6 bits (135), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 92/401 (22%), Positives = 153/401 (38%), Gaps = 64/401 (15%)

Query: 28  QTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE-----NR 82
            T ++ + L  P+    +    +Y ++I   TP   + L LD G    W+ CE      +
Sbjct: 139 DTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQ 198

Query: 83  Q-------YVSSTFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVV 135
           Q         SST+K   C + QCSL   + C  +K   +          SYGD  S  V
Sbjct: 199 QSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQV---------SYGD-GSFTV 248

Query: 136 SVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFH 195
              +TD  T      + N    CG      GL  G  G+ GLG   +S+ +Q  +     
Sbjct: 249 GELATDTVTFGNSGKINNVALGCGHD--NEGLFTGAAGLLGLGGGVLSITNQMKAT---- 302

Query: 196 RKFAICLT-ANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEY 254
             F+ CL   +SG    + F                 +  +    +TAP     +    Y
Sbjct: 303 -SFSYCLVDRDSGKSSSLDFN----------------SVQLGGGDATAPLLRNKKIDTFY 345

Query: 255 FIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVK---S 311
           ++G+    V  + V L   +  ++ +G GG  +      T ++T  Y ++ DAF+K   +
Sbjct: 346 YVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVN 405

Query: 312 LGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDD--VI 369
           L   + S ++ F TC+    +S  +    VP +      G    +   N ++  DD    
Sbjct: 406 LKKGS-SSISLFDTCYDFSSLSTVK----VPTVAFHFTGGKSLDLPAKNYLIPVDDSGTF 460

Query: 370 CLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLL 410
           C  F      P +S +  +       T IT   + L  N++
Sbjct: 461 CFAFA-----PTSSSLSIIGNVQQQGTRIT---YDLSKNVI 493


>AT5G36260.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:14285068-14288179 REVERSE LENGTH=482
          Length = 482

 Score = 55.8 bits (133), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 94/423 (22%), Positives = 160/423 (37%), Gaps = 71/423 (16%)

Query: 27  AQTSFRPKALV----LPITKDVTS-SLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE- 80
           +  SFR   ++    LP+  D  + S+  Y T+IK  +P     + +D G   LWVNC  
Sbjct: 49  SHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAP 108

Query: 81  ----------------NRQYVSSTFKPARCGSSQCSLFGLTG-CSGDKICGRSPSNTVTG 123
                                SST K   C    CS    +  C   K C         G
Sbjct: 109 CPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYG-DG 167

Query: 124 VSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNG----LAKGVTGMAGLGR 179
            +S GD   D +++    G   T  ++    +F CG    Q+G        V G+ G G+
Sbjct: 168 STSDGDFIKDNITLEQVTGNLRTAPLA-QEVVFGCGKN--QSGQLGQTDSAVDGIMGFGQ 224

Query: 180 TRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPV 239
           +  S+ SQ ++  S  R F+ CL   +G         G + + +  S V+  TP++ N  
Sbjct: 225 SNTSIISQLAAGGSTKRIFSHCLDNMNGG--------GIFAVGEVESPVVKTTPIVPN-- 274

Query: 240 STAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETT 299
                       V Y + +K + V    + L  +L S   NG GGT I +      +   
Sbjct: 275 -----------QVHYNVILKGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQN 321

Query: 300 IYKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGA 359
           +Y ++ +  + +     +  V     CF+      S      P ++L  ++ ++  +   
Sbjct: 322 LYNSLIEK-ITAKQQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFEDSLKLSVYPH 376

Query: 360 NSMVQF-DDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASR 418
           + +    +D+ C G+   G     +Q G           I +G   L N L+ +DL    
Sbjct: 377 DYLFSLREDMYCFGWQSGG---MTTQDG--------ADVILLGDLVLSNKLVVYDLENEV 425

Query: 419 LGF 421
           +G+
Sbjct: 426 IGW 428


>AT1G64830.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr1:24091271-24092566 REVERSE LENGTH=431
          Length = 431

 Score = 55.1 bits (131), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 143/350 (40%), Gaps = 55/350 (15%)

Query: 44  VTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYV------------SSTFKP 91
           +TS+  +Y+  I   TP VP+    D G   +W  C   +              SST++ 
Sbjct: 79  ITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRK 138

Query: 92  ARCGSSQCSLFGLTGCSGDKICGRSPSNTVT-GVSSY--GDIHSDVVSVNSTDGTTPTKV 148
             C SSQC       CS D+    + S T+T G +SY  GD+  D V++    G++  + 
Sbjct: 139 VSCSSSQCRALEDASCSTDE---NTCSYTITYGDNSYTKGDVAVDTVTM----GSSGRRP 191

Query: 149 VSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL---TAN 205
           VS+ N +  CG +          +G+ GLG    SL SQ     S + KF+ CL   T+ 
Sbjct: 192 VSLRNMIIGCGHENT-GTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSE 248

Query: 206 SGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSE 265
           +G    + FG                  +++     + S    +P+  YF+ +++I V  
Sbjct: 249 TGLTSKINFGTN---------------GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGS 293

Query: 266 KNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTV-SPVAPFG 324
           K +   +T+      G G   I +    T++ +  Y  +      ++ A  V  P     
Sbjct: 294 KKIQFTSTIFG---TGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILS 350

Query: 325 TCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICLGF 373
            C+  +D S  +    VP I +  + G +  +   N+ V   +DV C  F
Sbjct: 351 LCY--RDSSSFK----VPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAF 393


>AT3G59080.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr3:21836812-21838419 FORWARD LENGTH=535
          Length = 535

 Score = 53.9 bits (128), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 98/413 (23%), Positives = 161/413 (38%), Gaps = 67/413 (16%)

Query: 36  LVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNC--------ENRQY--- 84
           LV  +   +T    +Y   +   +P     L LD G    W+ C        +N  +   
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDP 214

Query: 85  -VSSTFKPARCGSSQCSLFGLTG----CSGDKICGRSPSNTVTGVSSY--GDIHSDVVSV 137
             S+++K   C   +C+L         C  D      P     G SS   GD   +  +V
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDN--QSCPYYYWYGDSSNTTGDFAVETFTV 272

Query: 138 NSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
           N T     +++ +V N +F CG      GL  G  G+ GLGR  +S  SQ  S +     
Sbjct: 273 NLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG--HS 328

Query: 198 FAICL---TANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEY 254
           F+ CL    +++     + FG+    L+       ++     N V T            Y
Sbjct: 329 FSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTF-----------Y 377

Query: 255 FIGVKSIKVSEK--NVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL 312
           ++ +KSI V+ +  N+P  T   +I+ +G GGT I +    +      Y+     F+K+ 
Sbjct: 378 YVQIKSILVAGEVLNIPEET--WNISSDGAGGTIIDSGTTLSYFAEPAYE-----FIKNK 430

Query: 313 GAPTVSPVAPFGTCFATKDISFSRIGPG---VPAIDLVLQNGVEWPIIGANSMVQF-DDV 368
            A       P    F   D  F+  G     +P + +   +G  W     NS +   +D+
Sbjct: 431 IAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDL 490

Query: 369 ICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
           +CL  +     PK++             SI IG +Q +N  + +D   SRLG+
Sbjct: 491 VCLAML---GTPKSA------------FSI-IGNYQQQNFHILYDTKRSRLGY 527


>AT5G22850.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr5:7633717-7636298 REVERSE LENGTH=493
          Length = 493

 Score = 52.8 bits (125), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/399 (22%), Positives = 157/399 (39%), Gaps = 72/399 (18%)

Query: 51  YITQIKQRTPLVPVKLTLDLGGGYLWVNCEN---------RQYVSSTFKPARCGSSQCSL 101
           Y T+++  TP     + +D G   LWV+C +          Q   + F P   GSS  + 
Sbjct: 81  YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDP---GSSVTA- 136

Query: 102 FGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSK 161
                CS D+ C     ++ +G S   ++ +         GT+   V  V  F  I GS 
Sbjct: 137 -SPISCS-DQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSS 194

Query: 162 VVQNGLA------------------KGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLT 203
           +V N  A                  + V G+ G G+  +S+ SQ +S     R F+ CL 
Sbjct: 195 LVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254

Query: 204 ANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKV 263
             +G  G++  G       + V   + +TPL+       PS    +P   Y + + SI V
Sbjct: 255 GENGGGGILVLG-------EIVEPNMVFTPLV-------PS----QP--HYNVNLLSISV 294

Query: 264 SEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTVSPVAPF 323
           + + +P+N ++ S   NG  GT I T      +    Y    +A   ++ + +V PV   
Sbjct: 295 NGQALPINPSVFS-TSNG-QGTIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSK 351

Query: 324 GT-CFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVICLGFVDAGSNPKA 382
           G  C+       + +G   P + L    G    +   + ++Q ++V              
Sbjct: 352 GNQCYVIT----TSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNV---------GGTAV 398

Query: 383 SQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
             +GF    +  +T   +G   L++ +  +DL   R+G+
Sbjct: 399 WCIGFQRIQNQGIT--ILGDLVLKDKIFVYDLVGQRIGW 435


>AT4G33490.2 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:16108781-16110679 REVERSE LENGTH=425
          Length = 425

 Score = 49.3 bits (116), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 85/213 (39%), Gaps = 27/213 (12%)

Query: 35  ALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN-----RQYVSSTF 89
           ++V P+  +V   L  Y   I    P  P  L LD G    W+ C+       +     +
Sbjct: 45  SVVFPVHGNVYP-LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 103

Query: 90  KPAR----CGSSQCSLFGLTG---CSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDG 142
           +P+     C    C    L     C   + C         G SS G +  DV S+N T G
Sbjct: 104 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYE-VEYADGGSSLGVLVRDVFSMNYTQG 162

Query: 143 TTPTKVVSVPNFLFICGSKVVQNGLAKG-VTGMAGLGRTRVSLPSQFSSAFSFHRKFAIC 201
              T     P     CG   +    +   + G+ GLGR +VS+ SQ  S          C
Sbjct: 163 LRLT-----PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHC 217

Query: 202 LTANSGADGVMFFGDGPYNLNQDVSKVLTYTPL 234
           L++  G  G++FFGD  Y    D S+V ++TP+
Sbjct: 218 LSSLGG--GILFFGDDLY----DSSRV-SWTPM 243


>AT4G33490.1 | Symbols:  | Eukaryotic aspartyl protease family
           protein | chr4:16108928-16110670 REVERSE LENGTH=401
          Length = 401

 Score = 48.5 bits (114), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 85/213 (39%), Gaps = 27/213 (12%)

Query: 35  ALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN-----RQYVSSTF 89
           ++V P+  +V   L  Y   I    P  P  L LD G    W+ C+       +     +
Sbjct: 42  SVVFPVHGNVYP-LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 100

Query: 90  KPAR----CGSSQCSLFGLTG---CSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDG 142
           +P+     C    C    L     C   + C         G SS G +  DV S+N T G
Sbjct: 101 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYE-VEYADGGSSLGVLVRDVFSMNYTQG 159

Query: 143 TTPTKVVSVPNFLFICGSKVVQNGLAKG-VTGMAGLGRTRVSLPSQFSSAFSFHRKFAIC 201
              T     P     CG   +    +   + G+ GLGR +VS+ SQ  S          C
Sbjct: 160 LRLT-----PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHC 214

Query: 202 LTANSGADGVMFFGDGPYNLNQDVSKVLTYTPL 234
           L++  G  G++FFGD  Y    D S+V ++TP+
Sbjct: 215 LSSLGG--GILFFGDDLY----DSSRV-SWTPM 240