Miyakogusa Predicted Gene
- Lj4g3v0772170.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0772170.2 tr|G7JCS6|G7JCS6_MEDTR Aspartic proteinase
nepenthesin-1 OS=Medicago truncatula GN=MTR_4g095270
PE=3,85.64,0,seg,NULL; CHLOROPLAST NUCLEIOD DNA-BINDING-RELATED,NULL;
ASPARTYL PROTEASES,Peptidase A1; Asp,Peptid,CUFF.48039.2
(590 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 712 0.0
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 688 0.0
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 186 5e-47
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 173 4e-43
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 170 2e-42
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 170 3e-42
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 149 4e-36
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 146 4e-35
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 144 2e-34
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 140 3e-33
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 135 1e-31
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 126 4e-29
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 126 5e-29
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 124 1e-28
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 124 2e-28
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 124 2e-28
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 119 4e-27
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 116 5e-26
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 116 6e-26
AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 8e-26
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 1e-25
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 113 4e-25
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 8e-25
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 110 2e-24
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 5e-24
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 6e-24
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 108 8e-24
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 108 1e-23
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 2e-23
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 2e-23
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 3e-23
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 3e-23
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 3e-23
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 105 1e-22
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 105 1e-22
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 105 1e-22
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 104 2e-22
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 4e-22
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 6e-22
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 1e-21
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 101 1e-21
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 5e-21
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 99 7e-21
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 7e-21
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 98 2e-20
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 97 3e-20
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 6e-20
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 7e-20
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 8e-19
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 3e-18
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 88 2e-17
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 2e-17
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 87 3e-17
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 5e-17
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 82 1e-15
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 1e-14
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 78 2e-14
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 77 3e-14
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 9e-12
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 65 1e-10
AT1G69100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 4e-10
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 7e-10
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 8e-10
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 54 4e-07
AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 50 3e-06
AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 50 5e-06
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 712 bits (1838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/551 (61%), Positives = 426/551 (77%), Gaps = 9/551 (1%)
Query: 41 MVLPL-YLSASNSSTSELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQ 99
M+ PL Y S E RR+LH S+ PNA M+L+DDLL NGYYTTRLWIGTPPQ
Sbjct: 31 MIFPLSYSSLPPRPRVEDFRRRRLHQSQ---LPNAHMKLYDDLLSNGYYTTRLWIGTPPQ 87
Query: 100 MFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDDRMQCVYE 159
FALIVDTGSTVTYVPCS+C+QCG+HQDPKFQP+LS++YQ +KC DCNCDD+ CVYE
Sbjct: 88 EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPDCNCDDEGKLCVYE 147
Query: 160 RQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLS 219
R+YAEMS+SSGVL ED+ISFGN+S+L+PQRAVFGCEN ETGDL+SQ ADGIMGLGRG LS
Sbjct: 148 RRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLS 207
Query: 220 IMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEI 279
++DQLVDK V+ D FSLCYGGM+VGGGAMVLG ISPP MVF++SDP RSPYYNIDLK++
Sbjct: 208 VVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQM 267
Query: 280 HVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNY 339
HVAGK L LN VF+GKHGTVLDSGTTYAY P+ AF A KDA++KE+ SL +I GPDPNY
Sbjct: 268 HVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNY 327
Query: 340 HDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKD 399
D+CFSGAG DV++I FP + M FGNGQK LSPENY+FRH+KVRGAYCLG+F + +D
Sbjct: 328 DDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD-RD 386
Query: 400 PTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAELWERLQTSVAPPQMAPNTEVRNSTKAL 459
TTLLGGIVVRNTLVTYDRE K+GF KTNC+++W RL +P +P ++ ++S +
Sbjct: 387 STTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLAAPESPAPTSPISQNKSSNISP 446
Query: 460 APSVAP-PVSQHHVPAGELKIAQITIAISFNISYVDMKPHITELAGLMAHELDVNTSQVH 518
+P+ + P S H+P G ++ IT +S +++ +KP +E+A +AHELD+ ++QV
Sbjct: 447 SPATSESPTS--HLP-GVFRVGVITFEVSISVNNSSLKPKFSEIADFIAHELDIQSAQVR 503
Query: 519 LLNFTSFGNGSLSRWAITPRPNADYISDATATRIIAQLSEHHIQLPGTFGGYKLIDWNVE 578
LLNF+S GN +W + P +++YIS+ TA I+ L E+ ++LPG FG YKL++W E
Sbjct: 504 LLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIMLLLKENRLRLPGQFGSYKLLEWKAE 563
Query: 579 PAPKWNWWQQY 589
K +WW+++
Sbjct: 564 QKKKQSWWEKH 574
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/550 (61%), Positives = 419/550 (76%), Gaps = 24/550 (4%)
Query: 41 MVLPLYLSASNSSTSELD-PRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQ 99
MV PL+LS NSS+ + P R+LH S+SK P++RMRL+DDLL+NGYYTTRLWIGTPPQ
Sbjct: 45 MVFPLFLSQPNSSSRSISIPHRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQ 104
Query: 100 MFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDDRMQCVYE 159
MFALIVD+GSTVTYVPCS CEQCG+HQDPKFQP++SSTYQPVKC +DCNCDDDR QCVYE
Sbjct: 105 MFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMDCNCDDDREQCVYE 164
Query: 160 RQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLS 219
R+YAE S+S GVLGED+ISFGN+S+L PQRAVFGCE VETGDLYSQ ADGI+GLG+GDLS
Sbjct: 165 REYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLS 224
Query: 220 IMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEI 279
++DQLVDK ++S+SF LCYGGMDVGGG+M+LGG P+DMVF SDP RSPYYNIDL I
Sbjct: 225 LVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGI 284
Query: 280 HVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNY 339
VAGK+L L+S VFDG+HG VLDSGTTYAYLP+AAF AF++A+++E+ +L QI GPDPN+
Sbjct: 285 RVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNF 344
Query: 340 HDICFSGAGTD-VSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGK 398
D CF A ++ VS++SK FP V+MVF +GQ + LSPENYMFRHSKV GAYCLGVF NGK
Sbjct: 345 KDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGK 404
Query: 399 DPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAELWERLQTSVAPPQMAPNTEVRNSTKA 458
D TTLLGGIVVRNTLV YDRE +K+GFW+TNC+EL +RL APP
Sbjct: 405 DHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHIDGAPP-------------- 450
Query: 459 LAPSVAP------PVSQHHVPAGELKIAQITIAISFNISYVDMKPHITELAGLMAHELDV 512
P+ P + +G ++ QI + I ++ +KP I +L+ + + ELDV
Sbjct: 451 --PATLPSNDSNPSHNSSSNLSGVTQVGQINLDIQLTVNSSYLKPRIEDLSKIFSKELDV 508
Query: 513 NTSQVHLLNFTSFGNGSLSRWAITPRPNADYISDATATRIIAQLSEHHIQLPGTFGGYKL 572
+SQV L N TS GN SL R + P + + S+ TAT I+++ + H I+LP FG Y+L
Sbjct: 509 KSSQVSLSNLTSKGNESLVRMVVLPPEPSTWFSNVTATNIVSRFTNHQIKLPEIFGNYQL 568
Query: 573 IDWNVEPAPK 582
+++ +EP K
Sbjct: 569 VNYKLEPPRK 578
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 186 bits (471), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 191/377 (50%), Gaps = 35/377 (9%)
Query: 81 DLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK-----FQPDLS 135
D + G Y T+L +GTPP+ F + VDTGS V +V C+SC C + + F P S
Sbjct: 74 DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133
Query: 136 STYQPVKC----------TLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGN--QS 183
T P+ C + D C C Y QY + S +SG DV+ F S
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193
Query: 184 ELAPQR---AVFGCENVETGDLYS--QHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY 238
L P VFGC +TGDL + DGI G G+ +S++ QL + + FS C
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 239 GGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKH- 297
G + GGG +VLG I P +MVF P + P+YN++L I V G+ LP+N +VF +
Sbjct: 254 KGENGGGGILVLGEIVEP-NMVFTPLVPSQ-PHYNVNLLSISVNGQALPINPSVFSTSNG 311
Query: 298 -GTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISK 356
GT++D+GTT AYL EAA+ F +AI +++Q P + + C+ + +
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITN---AVSQSVRPVVSKGNQCY----VITTSVGD 364
Query: 357 SFPVVDMVFGNGQKYSLSPENYMFRHSKVRG--AYCLGVFQNGKDPTTLLGGIVVRNTLV 414
FP V + F G L+P++Y+ + + V G +C+G + T+LG +V+++ +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIF 424
Query: 415 TYDREQTKIGFWKTNCA 431
YD +IG+ +C+
Sbjct: 425 VYDLVGQRIGWANYDCS 441
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 173 bits (438), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/381 (33%), Positives = 194/381 (50%), Gaps = 45/381 (11%)
Query: 81 DLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQC----GRHQDPKF------ 130
D L G Y T++ +G+PP F + +DTGS + +V CSSC C G D F
Sbjct: 93 DPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGS 152
Query: 131 ---------QPDLSSTYQPVKCTLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFG- 180
P SS +Q T C ++ QC Y +Y + S +SG D F
Sbjct: 153 LTAGSVTCSDPICSSVFQ----TTAAQCSENN-QCGYSFRYGDGSGTSGYYMTDTFYFDA 207
Query: 181 --NQSELAPQRA--VFGCENVETGDLYS--QHADGIMGLGRGDLSIMDQLVDKNVVSDSF 234
+S +A A VFGC ++GDL + DGI G G+G LS++ QL + + F
Sbjct: 208 ILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVF 267
Query: 235 SLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFD 294
S C G GGG VLG I P MV++ P + P+YN++L I V G+ LPL++ VF+
Sbjct: 268 SHCLKGDGSGGGVFVLGEILVP-GMVYSPLVPSQ-PHYNLNLLSIGVNGQMLPLDAAVFE 325
Query: 295 GKH--GTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVS 352
+ GT++D+GTT YL + A+ F +AI S++Q+ P + + C+ + +
Sbjct: 326 ASNTRGTIVDTGTTLTYLVKEAYDLFLNAISN---SVSQLVTPIISNGEQCYLVS----T 378
Query: 353 QISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGA--YCLGVFQNGKDPTTLLGGIVVR 410
IS FP V + F G L P++Y+F + GA +C+G FQ + T+LG +V++
Sbjct: 379 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-FQKAPEEQTILGDLVLK 437
Query: 411 NTLVTYDREQTKIGFWKTNCA 431
+ + YD + +IG+ +C+
Sbjct: 438 DKVFVYDLARQRIGWASYDCS 458
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 170 bits (431), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 191/374 (51%), Gaps = 45/374 (12%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQC----GRHQDPKF------------- 130
Y T++ +G+PP F + +DTGS + +V CSSC C G D F
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 131 --QPDLSSTYQPVKCTLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFG---NQSEL 185
P SS +Q T C ++ QC Y +Y + S +SG D F +S +
Sbjct: 165 CSDPICSSVFQ----TTAAQCSENN-QCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 219
Query: 186 APQRA--VFGCENVETGDLYS--QHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGM 241
A A VFGC ++GDL + DGI G G+G LS++ QL + + FS C G
Sbjct: 220 ANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 279
Query: 242 DVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKH--GT 299
GGG VLG I P MV++ P + P+YN++L I V G+ LPL++ VF+ + GT
Sbjct: 280 GSGGGVFVLGEILVP-GMVYSPLVPSQ-PHYNLNLLSIGVNGQMLPLDAAVFEASNTRGT 337
Query: 300 VLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFP 359
++D+GTT YL + A+ F +AI S++Q+ P + + C+ + + IS FP
Sbjct: 338 IVDTGTTLTYLVKEAYDLFLNAIS---NSVSQLVTPIISNGEQCYLVS----TSISDMFP 390
Query: 360 VVDMVFGNGQKYSLSPENYMFRHSKVRGA--YCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417
V + F G L P++Y+F + GA +C+G FQ + T+LG +V+++ + YD
Sbjct: 391 SVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-FQKAPEEQTILGDLVLKDKVFVYD 449
Query: 418 REQTKIGFWKTNCA 431
+ +IG+ +C+
Sbjct: 450 LARQRIGWASYDCS 463
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 170 bits (431), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 129/399 (32%), Positives = 204/399 (51%), Gaps = 39/399 (9%)
Query: 81 DLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK-----FQPDLS 135
D L G Y T++ +GTPP+ F + +DTGS V +V C+SC C + + + F P +S
Sbjct: 77 DPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVS 136
Query: 136 STYQPV-----KC----TLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGN--QSE 184
S+ V +C + C + + C Y +Y + S +SG D +SF S
Sbjct: 137 SSASLVSCSDRRCYSNFQTESGCSPNNL-CSYSFKYGDGSGTSGYYISDFMSFDTVITST 195
Query: 185 LAPQRA---VFGCENVETGDLY--SQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYG 239
LA + VFGC N+++GDL + DGI GLG+G LS++ QL + + FS C
Sbjct: 196 LAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLK 255
Query: 240 GMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFD--GKH 297
G GGG MVLG I P D V+ P + P+YN++L+ I V G+ LP++ +VF
Sbjct: 256 GDKSGGGIMVLGQIKRP-DTVYTPLVPSQ-PHYNVNLQSIAVNGQILPIDPSVFTIATGD 313
Query: 298 GTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKS 357
GT++D+GTT AYLP+ A++ F A+ +++Q P CF DV
Sbjct: 314 GTIIDTGTTLAYLPDEAYSPFIQAVAN---AVSQYGRPITYESYQCFEITAGDV----DV 366
Query: 358 FPVVDMVFGNGQKYSLSPENYM-FRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTY 416
FP V + F G L P Y+ S +C+G + T+LG +V+++ +V Y
Sbjct: 367 FPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVY 426
Query: 417 DREQTKIGFWKTNCAELWERLQTSVAPPQMAPNTEVRNS 455
D + +IG+ + +C+ L+ +V+ + + +V N+
Sbjct: 427 DLVRQRIGWAEYDCS-----LEVNVSASRGGRSKDVINT 460
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 149 bits (377), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/419 (28%), Positives = 197/419 (47%), Gaps = 55/419 (13%)
Query: 47 LSASNSSTSELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVD 106
L S ++ E D RRQL P D + G Y ++ IGTP + + + VD
Sbjct: 42 LQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPD---IPGLYYAKIGIGTPAKSYYVQVD 98
Query: 107 TGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDD-------------- 152
TGS + +V C C+QC R + L + + L +CDDD
Sbjct: 99 TGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL-VSCDDDFCYQISGGPLSGCK 157
Query: 153 -RMQCVYERQYAEMSTSSGVLGEDVISFGN-----QSELAPQRAVFGCENVETGDLYSQH 206
M C Y Y + S+++G +DV+ + + +++ A +FGC ++GDL S +
Sbjct: 158 ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSN 217
Query: 207 A---DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGG-ISPPTDMVFA 262
DGI+G G+ + S++ QL V F+ C G + GGG +G + P +M
Sbjct: 218 EEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN-GGGIFAIGRVVQPKVNMT-- 274
Query: 263 YSDPV--RSPYYNIDLKEIHVAGKRLPLNSNVFD--GKHGTVLDSGTTYAYLPEAAFTAF 318
P+ P+YN+++ + V + L + +++F + G ++DSGTT AYLPE +
Sbjct: 275 ---PLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPL 331
Query: 319 KDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENY 378
I + +L ++ D +Y CF +G ++ + FP V F N + P +Y
Sbjct: 332 VKKITSQEPAL-KVHIVDKDYK--CFQYSG----RVDEGFPNVTFHFENSVFLRVYPHDY 384
Query: 379 MFRHSKVRGAYCLGVFQNG------KDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
+F H G +C+G +QN + TLLG +V+ N LV YD E IG+ + NC+
Sbjct: 385 LFPH---EGMWCIG-WQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 146 bits (368), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 133/445 (29%), Positives = 205/445 (46%), Gaps = 57/445 (12%)
Query: 62 QLHGSESKRHPNARMRLHDDLLLNG--------YYTTRLWIGTPPQMFALIVDTGSTVTY 113
+L +S RH ARM + DL L G Y T++ +G+PP+ + + VDTGS + +
Sbjct: 46 ELKSHDSFRH--ARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILW 103
Query: 114 VPCSSCEQCGRHQD-----PKFQPDLSSTYQPVKCTLD-CN-------CDDDRMQCVYER 160
V C+ C +C D + SST + V C D C+ C + C Y
Sbjct: 104 VNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKK-PCSYHV 162
Query: 161 QYAEMSTSSGVLGEDVISF----GN-QSELAPQRAVFGCENVETGDL--YSQHADGIMGL 213
Y + STS G +D I+ GN ++ Q VFGC ++G L DGIMG
Sbjct: 163 VYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGF 222
Query: 214 GRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPV--RSPY 271
G+ + SI+ QL FS C M+ GGG +G + P + P+ +
Sbjct: 223 GQSNTSIISQLAAGGSTKRIFSHCLDNMN-GGGIFAVGEVESPV----VKTTPIVPNQVH 277
Query: 272 YNIDLKEIHVAGK--RLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSL 329
YN+ LK + V G LP + +G GT++DSGTT AYLP+ + +++++++ +
Sbjct: 278 YNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLY----NSLIEKITAK 333
Query: 330 NQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAY 389
Q+ CFS S K+FPVV++ F + K S+ P +Y+F S Y
Sbjct: 334 QQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFEDSLKLSVYPHDYLF--SLREDMY 387
Query: 390 CLG------VFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAELWERLQTSVAP 443
C G Q+G D LLG +V+ N LV YD E IG+ NC+ + S A
Sbjct: 388 CFGWQSGGMTTQDGAD-VILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKDGSGAA 446
Query: 444 PQMAPNTEVRNSTKALAPSVAPPVS 468
Q+ + ++ + ++ +S
Sbjct: 447 YQLGAENLISAASSVMNGTLVTLLS 471
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 144 bits (363), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 178/375 (47%), Gaps = 46/375 (12%)
Query: 86 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQD----PKFQPDLSSTYQPV 141
G Y ++ +GTP + F + VDTGS + +V C+ C +C R D + D SST + V
Sbjct: 83 GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSV 142
Query: 142 KCTLD-CNCDDDRMQ------CVYERQYAEMSTSSGVLGEDVISF----GN-QSELAPQR 189
C+ + C+ + R + C Y Y + S+++G L +DV+ GN Q+
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGT 202
Query: 190 AVFGCENVETGDLYSQHA--DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGA 247
+FGC + ++G L A DGIMG G+ + S + QL + V SF+ C + GGG
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN-GGGI 261
Query: 248 MVLGGISPPTDMVFAYSDPV--RSPYYNIDLKEIHVAGKRLPLNSNVFDG--KHGTVLDS 303
+G + P + P+ +S +Y+++L I V L L+SN FD G ++DS
Sbjct: 262 FAIGEVVSPK----VKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDS 317
Query: 304 GTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDM 363
GTT YLP+A + + I+ S P+ H + S + FP V
Sbjct: 318 GTTLVYLPDAVYNPLLNEILA--------SHPELTLHTVQESFTCFHYTDKLDRFPTVTF 369
Query: 364 VFGNGQKYSLSPENYMFRHSKVR-GAYCLGVFQNGKDPT------TLLGGIVVRNTLVTY 416
F ++ P Y+F +VR +C G +QNG T T+LG + + N LV Y
Sbjct: 370 QFDKSVSLAVYPREYLF---QVREDTWCFG-WQNGGLQTKGGASLTILGDMALSNKLVVY 425
Query: 417 DREQTKIGFWKTNCA 431
D E IG+ NC+
Sbjct: 426 DIENQVIGWTNHNCS 440
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 140 bits (352), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 173/374 (46%), Gaps = 41/374 (10%)
Query: 85 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCT 144
+G + L IG P ++ IVDTGS + + C C +C P F P+ SS+Y V C+
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCS 163
Query: 145 LD-------CNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 197
NC++D+ C Y Y + S++ G+L + +F +++ ++ FGC
Sbjct: 164 SGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISG--IGFGCGVE 221
Query: 198 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNV------VSD---SFSLCYGGMDVG---- 244
GD +SQ G++GLGRG LS++ QL + + D S SL G + G
Sbjct: 222 NEGDGFSQ-GSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNK 280
Query: 245 GGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVF----DGKHGTV 300
GA + G ++ ++ +P + +Y ++L+ I V KRL + + F DG G +
Sbjct: 281 TGASLDGEVTKTMSLL---RNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMI 337
Query: 301 LDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPV 360
+DSGTT YL E AF K+ + SG D+CF + +K+ V
Sbjct: 338 IDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG--STGLDLCFK-----LPDAAKNIAV 390
Query: 361 VDMVFG-NGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRE 419
M+F G L ENYM S G CL + + ++ G + +N V +D E
Sbjct: 391 PKMIFHFKGADLELPGENYMVADSST-GVLCLAM--GSSNGMSIFGNVQQQNFNVLHDLE 447
Query: 420 QTKIGFWKTNCAEL 433
+ + F T C +L
Sbjct: 448 KETVSFVPTECGKL 461
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 135 bits (339), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 123/433 (28%), Positives = 195/433 (45%), Gaps = 56/433 (12%)
Query: 41 MVLPLYLSASNSSTSE---LDPRRQLHGSESKRHPNARMR---LHDDLLLNGYYTTRLWI 94
+ LPL + S ++ LD RR LH +R P ++ + +G Y L I
Sbjct: 32 LKLPLLRKSPFPSPTQALALDTRR-LHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRI 90
Query: 95 GTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK-FQPDLSSTYQPVKC---------- 143
G PPQ LI DTGS + +V CS+C C H F P SST+ P C
Sbjct: 91 GQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150
Query: 144 ---TLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQRAVFGCENVE 198
CN C YE YA+ S +SG+ + S S E + FGC
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRI 210
Query: 199 TGDLYS----QHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLC---YGGMDVGGGAMVLG 251
+G S A+G+MGLGRG +S QL + + FS C Y +++G
Sbjct: 211 SGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYCLMDYTLSPPPTSYLIIG 268
Query: 252 ----GISPPTDMVFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFD----GKHGTVL 301
GIS + F ++P+ +Y + LK + V G +L ++ ++++ G GTV+
Sbjct: 269 NGGDGIS---KLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVV 325
Query: 302 DSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVV 361
DSGTT A+L E A+ + A+ + ++ L P + D+C + +G V++ K P +
Sbjct: 326 DSGTTLAFLAEPAYRSVIAAVRRRVK-LPIADALTPGF-DLCVNVSG--VTKPEKILPRL 381
Query: 362 DMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPT---TLLGGIVVRNTLVTYDR 418
F G + P NY + CL + DP +++G ++ + L +DR
Sbjct: 382 KFEFSGGAVFVPPPRNYFIETEE--QIQCLAI--QSVDPKVGFSVIGNLMQQGFLFEFDR 437
Query: 419 EQTKIGFWKTNCA 431
+++++GF + CA
Sbjct: 438 DRSRLGFSRRGCA 450
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 126 bits (317), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 156/391 (39%), Gaps = 34/391 (8%)
Query: 54 TSELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTY 113
T +L RR S + + + N Y +L +GTPP I+DTGS +T+
Sbjct: 31 TMDLIHRRSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITW 90
Query: 114 VPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDDRMQCVYERQYAEMSTSSGVLG 173
C C C P F P SST++ +C D C YE Y + + + G L
Sbjct: 91 TQCLPCVHCYEQNAPIFDPSKSSTFKEKRC--------DGHSCPYEVDYFDHTYTMGTLA 142
Query: 174 EDVISFGNQS--ELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVS 231
+ I+ + S + GC + G++GL G S++ Q+ +
Sbjct: 143 TETITLHSTSGEPFVMPETIIGCG--HNNSWFKPSFSGMVGLNWGPSSLITQMGGE--YP 198
Query: 232 DSFSLCYGG-----MDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRL 286
S C+ G ++ G A+V G T M + P +Y ++L + V R+
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTAKP---GFYYLNLDAVSVGNTRI 255
Query: 287 PLNSNVFDGKHGT-VLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHD-ICF 344
F G V+DSGTT Y P + + A+ + + + DP +D +C+
Sbjct: 256 ETMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTA---VRAADPTGNDMLCY 312
Query: 345 SGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLL 404
+ D+ FPV+ M F G L N M+ S G +CL + N +
Sbjct: 313 NSDTIDI------FPVITMHFSGGVDLVLDKYN-MYMESNNGGVFCLAIICNSPTQEAIF 365
Query: 405 GGIVVRNTLVTYDREQTKIGFWKTNCAELWE 435
G N LV YD + F TNC+ LW
Sbjct: 366 GNRAQNNFLVGYDSSSLLVSFSPTNCSALWN 396
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 126 bits (316), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 185/406 (45%), Gaps = 66/406 (16%)
Query: 67 ESKRHPNARMRLHDDLLLNG--------YYTTRLWIGTPPQMFALIVDTGSTVTYVPCSS 118
+++RH +RM DL L G Y T++ +G+PP+ + + VDTGS + ++ C
Sbjct: 47 DTRRH--SRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKP 104
Query: 119 CEQCGRHQDPKFQPDL-----SSTYQPVKCTLDCNCDDD-------------RMQCVYER 160
C +C + F+ L SST + V CDDD + C Y
Sbjct: 105 CPKCPTKTNLNFRLSLFDMNASSTSKKV------GCDDDFCSFISQSDSCQPALGCSYHI 158
Query: 161 QYAEMSTSSGVLGEDVISFGNQS---ELAP--QRAVFGCENVETGDLYS--QHADGIMGL 213
YA+ STS G D+++ + + P Q VFGC + ++G L + DG+MG
Sbjct: 159 VYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGF 218
Query: 214 GRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPV--RSPY 271
G+ + S++ QL FS C + GGG +G + P + P+ +
Sbjct: 219 GQSNTSVLSQLAATGDAKRVFSHCLDNVK-GGGIFAVGVVDSPK----VKTTPMVPNQMH 273
Query: 272 YNIDLKEIHVAGKRLPL-NSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLN 330
YN+ L + V G L L S V +G GT++DSGTT AY P+ + D++++ + +
Sbjct: 274 YNVMLMGMDVDGTSLDLPRSIVRNG--GTIVDSGTTLAYFPKVLY----DSLIETILARQ 327
Query: 331 QISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYC 390
+ CFS + + + ++FP V F + K ++ P +Y+F + YC
Sbjct: 328 PVKLHIVEETFQCFSFS----TNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEE--ELYC 381
Query: 391 LGVFQNG-----KDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
G G + LLG +V+ N LV YD + IG+ NC+
Sbjct: 382 FGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 124 bits (312), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/409 (27%), Positives = 182/409 (44%), Gaps = 51/409 (12%)
Query: 62 QLHGSESKRHPNARMRLHDDLLLN-------------GYYTTRLWIGTPPQMFALIVDTG 108
+ SE+ + +A+++L + L + GYY L IG PP++F L +DTG
Sbjct: 28 RFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTG 87
Query: 109 STVTYVPCSS-CEQCGRHQDPKFQPDLSST-YQPVKCT-----LDCNCDDDRMQCVYERQ 161
S +T+V C + C C + + +++P+ ++ + C+ D C D QC YE
Sbjct: 88 SDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIG 147
Query: 162 YAEMSTSSGVLGEDVISFG-NQSELAPQRAVFGC--ENVETGDLYSQHADGIMGLGRGDL 218
Y++ ++S G L D + + R FGC + G GI+GLGRG +
Sbjct: 148 YSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKV 207
Query: 219 SIMDQL----VDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNI 274
+ QL + KNV+ S G G + +G P+ V S SP N
Sbjct: 208 GLSTQLKSLGITKNVIVHCLS------HTGKGFLSIGDELVPSSGVTWTSLATNSPSKN- 260
Query: 275 DLKEIHVAGKRLPLNSNVFDGKHG--TVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQI 332
++AG L ++ G G V DSG++Y Y A+ A D I K+L
Sbjct: 261 -----YMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLT 315
Query: 333 SGPDPNYHDICFSGAG--TDVSQISKSFPVVDMVFG---NGQKYSLSPENYMFRHSKVRG 387
D +C+ G + ++ K F + + FG NGQ + + PE+Y+ K G
Sbjct: 316 DTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEK--G 373
Query: 388 AYCLGVFQN---GKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAEL 433
CLG+ G + ++G I + +V YD E+ +IG+ ++C +L
Sbjct: 374 RVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 422
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 124 bits (311), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 113/410 (27%), Positives = 184/410 (44%), Gaps = 53/410 (12%)
Query: 62 QLHGSESKRHPNARMRLHDDLLLN-------------GYYTTRLWIGTPPQMFALIVDTG 108
+ SE+ + +A+++L + L + GYY L IG PP++F L +DTG
Sbjct: 28 RFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTG 87
Query: 109 STVTYVPCSS-CEQCGRHQDPKFQPDLSST-YQPVKCT-----LDCNCDDDRMQCVYERQ 161
S +T+V C + C C + + +++P+ ++ + C+ D C D QC YE
Sbjct: 88 SDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIG 147
Query: 162 YAEMSTSSGVLGEDVI--SFGNQSELAPQRAVFGC--ENVETGDLYSQHADGIMGLGRGD 217
Y++ ++S G L D + N S + R FGC + G GI+GLGRG
Sbjct: 148 YSDHASSIGALVTDEVPLKLANGS-IMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGK 206
Query: 218 LSIMDQL----VDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYN 273
+ + QL + KNV+ S G G + +G P+ V S SP N
Sbjct: 207 VGLSTQLKSLGITKNVIVHCLS------HTGKGFLSIGDELVPSSGVTWTSLATNSPSKN 260
Query: 274 IDLKEIHVAGKRLPLNSNVFDGKHG--TVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQ 331
++AG L ++ G G V DSG++Y Y A+ A D I K+L
Sbjct: 261 ------YMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPL 314
Query: 332 ISGPDPNYHDICFSGAG--TDVSQISKSFPVVDMVFG---NGQKYSLSPENYMFRHSKVR 386
D +C+ G + ++ K F + + FG NGQ + + PE+Y+ K
Sbjct: 315 TDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEK-- 372
Query: 387 GAYCLGVFQN---GKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAEL 433
G CLG+ G + ++G I + +V YD E+ +IG+ ++C +L
Sbjct: 373 GRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 422
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 124 bits (310), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/415 (26%), Positives = 184/415 (44%), Gaps = 31/415 (7%)
Query: 41 MVLPLYLSASNSSTSELDPRRQLHGSESKRHPNAR----MRLHDDLLLNGYYTTRLWIGT 96
M++ + +S +S +D R + S R A +H ++ GYY + IG
Sbjct: 9 MIVLMVMSLVLGFSSAVDFRWRKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQ 68
Query: 97 PPQMFALIVDTGSTVTYVPCSS-CEQCGRHQDPKFQP--DLSSTYQPVKCTLDCNCD--- 150
PP+ + L +DTGS +T++ C + C +C P +QP DL P+ L N +
Sbjct: 69 PPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRC 128
Query: 151 DDRMQCVYERQYAEMSTSSGVLGEDVISFG-NQSELAPQRAVFGCENVETGDLYSQH-AD 208
+ QC YE +YA+ +S GVL DV S Q R GC + S H D
Sbjct: 129 ETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLD 188
Query: 209 GIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG-GISPPTDMVFAYSDPV 267
G++GLGRG +SI+ QL + V + C + GGG + G + + + +
Sbjct: 189 GVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSL--GGGILFFGDDLYDSSRVSWTPMSRE 246
Query: 268 RSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQ 327
S +Y+ + + G R N+ TV DSG++Y Y A+ A + +EL
Sbjct: 247 YSKHYSPAMGGELLFGGRTTGLKNLL-----TVFDSGSSYTYFNSKAYQAVTYLLKRELS 301
Query: 328 SLNQISGPDPNYHDICFSGAG--TDVSQISKSFPVVDMVFGNGQK----YSLSPENYMFR 381
D + +C+ G + ++ K F + + F G + + + PE Y+
Sbjct: 302 GKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLI- 360
Query: 382 HSKVRGAYCLGVFQN---GKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAEL 433
++G CLG+ G L+G I +++ ++ YD E+ IG+ +C EL
Sbjct: 361 -ISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 119 bits (299), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 157/384 (40%), Gaps = 36/384 (9%)
Query: 61 RQLHGSESKRHPNARMRLHDDLLLNGY-YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSC 119
R+ + S S+ + D + + Y Y +L IGTPP ++DTGS + C C
Sbjct: 37 RRSNASSSRVFNTQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC 96
Query: 120 EQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISF 179
C P F P SST++ ++ CD C YE Y S + G L + ++
Sbjct: 97 VHCYNQTAPIFDPSKSSTFKEIR------CDTHDHSCPYELVYGGKSYTKGTLVTETVTI 150
Query: 180 GNQS--ELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLC 237
+ S + GC +G + G++GL RG S++ Q+ + S C
Sbjct: 151 HSTSGQPFVMPETIIGCGRNNSG--FKPGFAGVVGLDRGPKSLITQMGGE--YPGLMSYC 206
Query: 238 YGG-----MDVGGGAMVLG-GISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSN 291
+ G ++ G A+V G G+ T V + +Y ++L + V R+
Sbjct: 207 FAGKGTSKINFGANAIVAGDGVVSTTVFV----KTAKPGFYYLNLDAVSVGNTRIETVGT 262
Query: 292 VFDGKHGT-VLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTD 350
F G V+DSG+T Y PE+ + A+ + + ++ P +C+ D
Sbjct: 263 PFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRF-----PRSDILCYYSKTID 317
Query: 351 VSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVR 410
+ FPV+ M F G L N M+ S G +CL + N + G
Sbjct: 318 I------FPVITMHFSGGADLVLDKYN-MYVASNTGGVFCLAIICNSPIEEAIFGNRAQN 370
Query: 411 NTLVTYDREQTKIGFWKTNCAELW 434
N LV YD + F TNC+ LW
Sbjct: 371 NFLVGYDSSSLLVSFKPTNCSALW 394
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 116 bits (290), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 116/442 (26%), Positives = 192/442 (43%), Gaps = 65/442 (14%)
Query: 28 AILQNRHHGSLPAMVLPLYLSASNSSTSELDPRRQLHGSE--------SKRHPNARMRLH 79
+L R SL ++ L S +S S L R + S SK P +R+
Sbjct: 4 TLLPLRIRASLLIIIFALTCSKECTSHSRLTLRTKTQESSKIKIGYLHSKSTPASRL--- 60
Query: 80 DDLLLNGYYT---------TRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKF 130
D+L + T + IG PP L++DTGS +T++ C C +C P F
Sbjct: 61 DNLWTVSHVTPIPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFF 119
Query: 131 QPDLSSTYQPVKCTLDCNC------DDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSE 184
P SSTY+ C + D+ C Y +Y + S + G+L E+ ++F +
Sbjct: 120 HPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDD 179
Query: 185 --LAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD 242
++ Q VFGC +G +++++ G++GLG G SI V +N S FS C+G +
Sbjct: 180 GLISKQNIVFGCGQDNSG--FTKYS-GVLGLGPGTFSI----VTRNFGS-KFSYCFGSLT 231
Query: 243 ----------VGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNV 292
+G GA + G +P + Y +DL+ I K L +
Sbjct: 232 NPTYPHNILILGNGAKIEGDPTPLQ---------IFQDRYYLDLQAISFGEKLLDIEPGT 282
Query: 293 FD---GKHGTVLDSGTTYAYLPEAAFTAFKDAIVKEL-QSLNQISGPDPNYHDICFSGAG 348
F + GTV+D+G + L A+ + I L + L ++ D Y C+ G
Sbjct: 283 FQRYRSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWD-QYTTPCYEG-- 339
Query: 349 TDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIV 408
++ FPVV F G + +L E+ +F S+ ++CL + N D +++G +
Sbjct: 340 -NLKLDLYGFPVVTFHFAGGAELALDVES-LFVSSESGDSFCLAMTMNTFDDMSVIGAMA 397
Query: 409 VRNTLVTYDREQTKIGFWKTNC 430
+N V Y+ K+ F +T+C
Sbjct: 398 QQNYNVGYNLRTMKVYFQRTDC 419
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 116 bits (290), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 172/386 (44%), Gaps = 53/386 (13%)
Query: 85 NGYYTTRLWIGTPP--QMFALIVDTGSTVTYVPCSS-CEQCGRHQDPKFQPD-------- 133
+G Y TR+ +G P Q + L +DTGS +T++ C + C C + + ++P
Sbjct: 200 DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSS 259
Query: 134 --LSSTYQPVKCTLDC-NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFG-NQSELAPQR 189
Q + T C NC QC YE +YA+ S S GVL +D + LA
Sbjct: 260 EAFCVEVQRNQLTEHCENCH----QCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESD 315
Query: 190 AVFGCENVETGDLYSQ--HADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGA 247
VFGC + G L + DGI+GL R +S+ QL + ++S+ C G G
Sbjct: 316 IVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGY 375
Query: 248 MVLGG-ISPPTDMVFA-YSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVL-DSG 304
+ +G + P M + R Y + + ++ L L+ +G+ G VL D+G
Sbjct: 376 IFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGE--NGRVGKVLFDTG 433
Query: 305 TTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHD------ICFSGAG----TDVSQI 354
++Y Y P A++ + SL ++SG + D IC+ + +S +
Sbjct: 434 SSYTYFPNQAYSQL-------VTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDV 486
Query: 355 SKSFPVVDMVFGN-----GQKYSLSPENYMFRHSKVRGAYCLGVFQNGK---DPTTLLGG 406
K F + + G+ +K + PE+Y+ +K G CLG+ T +LG
Sbjct: 487 KKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNK--GNVCLGILDGSSVHDGSTIILGD 544
Query: 407 IVVRNTLVTYDREQTKIGFWKTNCAE 432
I +R L+ YD + +IG+ K++C
Sbjct: 545 ISMRGHLIVYDNVKRRIGWMKSDCVR 570
>AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:14665728-14669135 REVERSE LENGTH=430
Length = 430
Score = 115 bits (288), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 116/430 (26%), Positives = 177/430 (41%), Gaps = 85/430 (19%)
Query: 40 AMVLPLYLSASNSSTSELDPRRQLHGSESKRHP-----------NARMRLHDDLLLNGYY 88
A VLPL S ELD QL +S RH N ++ +LL+ Y
Sbjct: 22 ATVLPLKRMIPPSH--ELD-LTQLMTFDSARHGRLLQSPVHGSFNWKVERDTSILLSALY 78
Query: 89 TTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCN 148
T + IGTPP+ +++DTGS + +V C+SC C H F P SS+ + C+ D
Sbjct: 79 YTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGCPLHNVTFFDPGASSSAVKLACS-DKR 137
Query: 149 CDDDRMQ---------CVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVET 199
C D + C Y+ +Y + S +SG D+ISF S+
Sbjct: 138 CSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSDWT------------- 184
Query: 200 GDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSF-SLCYGGMDVGGGAMVLGGISPPTD 258
+ D S V + + +F +LC S P
Sbjct: 185 ------------YIAFRDNSTWHPWVRQGAIIGTFPALC----------------STPCS 216
Query: 259 MVFAYSDPVRSPYYNIDLKE---IHVAGKRLPLNSNVFD--GKHGTVLDSGTTYAYLPEA 313
V S P+ YYN + V RLP++ +VF +GT++DSGTT + P
Sbjct: 217 TV--SSQPL---YYNPQFSHMMTVAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVHFPGE 271
Query: 314 AFTAFKDAIVKELQSLNQISGPDPNYHDICF---SGAGTDVSQISKSFPVVDMVFGNGQK 370
A+ AI L ++Q P P CF SG + + I+ FP V + F G
Sbjct: 272 AYDPLIQAI---LNVVSQYGRPIPYESFQCFNITSGISSHLV-IADMFPEVHLGFAGGAS 327
Query: 371 YSLSPENYMFRH--SKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKT 428
+ PE Y+F+ +CLG + + T++G + +R+ + YD + +IG+ +
Sbjct: 328 MVIKPEAYLFQKFLDLTNAIWCLGFYSSTSRRITIIGEVAIRDKMFVYDLDHQRIGWAEY 387
Query: 429 NCAELWERLQ 438
NC+ R Q
Sbjct: 388 NCSLDVTRAQ 397
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 115 bits (288), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 150/359 (41%), Gaps = 36/359 (10%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDC 147
Y +L +GTPP +DTGS + + C C C P F P SST++ +C
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRC---- 476
Query: 148 NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQRAVFGCENVETGDLYSQ 205
+ C YE YA+ + S G+L + ++ + S GC T YS
Sbjct: 477 ----NGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSG 532
Query: 206 HA---DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGG-----MDVGGGAMVLGGISPPT 257
A GI+GL G LS++ Q+ S C+ G ++ G A+V G +
Sbjct: 533 FASSSSGIVGLNMGPLSLISQM--DLPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAA 590
Query: 258 DMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTV-LDSGTTYAYLPEAAFT 316
DM F D +P+Y ++L + V + F + G + +DSGTT Y P +
Sbjct: 591 DM-FIKKD---NPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFIDSGTTLTYFPMSYCN 646
Query: 317 AFKDAIVKELQSLNQISGPDPNYHD-ICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSP 375
++A+ Q + + PD + +C+ D+ FPV+ M F G L
Sbjct: 647 LVREAVE---QVVTAVKVPDMGSDNLLCYYSDTIDI------FPVITMHFSGGADLVLDK 697
Query: 376 ENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAELW 434
N M+ + G +CL + N + G N LV YD I F TNC+ LW
Sbjct: 698 YN-MYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCSALW 755
Score = 112 bits (279), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 160/381 (41%), Gaps = 48/381 (12%)
Query: 50 SNSSTSELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGS 109
SNSS+ L + QL G+ P A D L Y +L +GTPP A +DTGS
Sbjct: 54 SNSSSFRLS-KNQLQGAS----PYA-----DTLFDYNIYLMKLQVGTPPFEIAAEIDTGS 103
Query: 110 TVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDDRMQCVYERQYAEMSTSS 169
+ + C C C DP F P SST+ +C C YE Y + + S
Sbjct: 104 DLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRC--------HGKSCHYEIIYEDNTYSK 155
Query: 170 GVLGEDVISFGNQS--ELAPQRAVFGCENVETGDL----YSQHADGIMGLGRGDLSIMDQ 223
G+L + ++ + S GC + DL ++ + GI+GL G S++ Q
Sbjct: 156 GILATETVTIHSTSGEPFVMAETTIGC-GLHNTDLDNSGFASSSSGIVGLNMGPRSLISQ 214
Query: 224 LVDKNVVSDSFSLCYGG-----MDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKE 278
+ S C+ G ++ G A+V G + DM F D +P+Y ++L
Sbjct: 215 M--DLPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADM-FIKKD---NPFYYLNLDA 268
Query: 279 IHVAGKRLPLNSNVFDGKHGT-VLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDP 337
+ V R+ F + G V+DSG+T Y P + + A+ Q + + PDP
Sbjct: 269 VSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVE---QVVTAVRVPDP 325
Query: 338 NYHD-ICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQN 396
+ +D +C+ D+ FPV+ M F G L N M+ S G +CL + N
Sbjct: 326 SGNDMLCYFSETIDI------FPVITMHFSGGADLVLDKYN-MYMESNSGGLFCLAIICN 378
Query: 397 GKDPTTLLGGIVVRNTLVTYD 417
+ G N LV YD
Sbjct: 379 SPTQEAIFGNRAQNNFLVGYD 399
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 113 bits (282), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 170/383 (44%), Gaps = 53/383 (13%)
Query: 88 YTTRLWIGTPP--QMFALIVDTGSTVTYVPCSS-CEQCGRHQDPKFQPD----------L 134
Y TR+ +G P Q + L +DTGS +T++ C + C C + + ++P
Sbjct: 30 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 89
Query: 135 SSTYQPVKCTLDC-NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFG-NQSELAPQRAVF 192
Q + T C NC QC YE +YA+ S S GVL +D + LA VF
Sbjct: 90 CVEVQRNQLTEHCENCH----QCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 145
Query: 193 GCENVETGDLYSQ--HADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVL 250
GC + G L + DGI+GL R +S+ QL + ++S+ C G G + +
Sbjct: 146 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFM 205
Query: 251 GG-ISPPTDMVFA-YSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVL-DSGTTY 307
G + P M + R Y + + ++ L L+ +G+ G VL D+G++Y
Sbjct: 206 GSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGE--NGRVGKVLFDTGSSY 263
Query: 308 AYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHD------ICFSGAG----TDVSQISKS 357
Y P A++ + SL ++SG + D IC+ + +S + K
Sbjct: 264 TYFPNQAYSQL-------VTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKF 316
Query: 358 FPVVDMVFGN-----GQKYSLSPENYMFRHSKVRGAYCLGVFQNGK---DPTTLLGGIVV 409
F + + G+ +K + PE+Y+ +K G CLG+ T +LG I +
Sbjct: 317 FRPITLQIGSKWLIISRKLLIQPEDYLIISNK--GNVCLGILDGSSVHDGSTIILGDISM 374
Query: 410 RNTLVTYDREQTKIGFWKTNCAE 432
R L+ YD + +IG+ K++C
Sbjct: 375 RGHLIVYDNVKRRIGWMKSDCVR 397
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 112 bits (280), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 174/379 (45%), Gaps = 50/379 (13%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQD----PK------FQP 132
LL Y + +GTPP F + +DTGS + ++PC+ C R + P+ + P
Sbjct: 97 LLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTP 156
Query: 133 DLSSTYQPVKCT-LDC----NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSE-LA 186
+ S+T ++C+ C C C Y+ Y+ + + G L +DV+ + E L
Sbjct: 157 NASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLT 216
Query: 187 PQRA--VFGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDV 243
P +A GC +TG ++ +G++GLG S+ L N+ ++SFS+C+G
Sbjct: 217 PVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFG---- 272
Query: 244 GGGAMVLGGISPPTDMVFAYSDPVRSPY--------YNIDLKEIHVAGKRLPLNSNVFDG 295
V+G + + Y+D +P+ Y +++ + VAG P++ +F
Sbjct: 273 ----RVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGD--PVDIRLF-- 324
Query: 296 KHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVS--Q 353
D+G+++ +L E A+ + + ++ + P+ + + C+ D+S
Sbjct: 325 ---AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPF-EFCY-----DLSPNA 375
Query: 354 ISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTL 413
+ FP+V+M F G K L+ + R + YCLGV ++ ++G V
Sbjct: 376 TTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYR 435
Query: 414 VTYDREQTKIGFWKTNCAE 432
+ +DRE+ +G+ ++ C E
Sbjct: 436 IVFDRERMILGWKQSLCFE 454
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 110 bits (276), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 161/364 (44%), Gaps = 40/364 (10%)
Query: 85 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCT 144
+G Y +R+ +GTP + L++DTGS V ++ C C C + DP F P SSTY+ + C+
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCS 218
Query: 145 L-DCN------CDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 197
C+ C ++ C+Y+ Y + S + G L D ++FGN ++ GC +
Sbjct: 219 APQCSLLETSACRSNK--CLYQVSYGDGSFTVGELATDTVTFGNSGKI--NNVALGCGHD 274
Query: 198 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMV------LG 251
G L++ A + G LSI +Q+ + SFS C D G + + LG
Sbjct: 275 NEG-LFTGAAGLLGLGGGV-LSITNQM-----KATSFSYCLVDRDSGKSSSLDFNSVQLG 327
Query: 252 GISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFD----GKHGTVLDSGTTY 307
G ++ + +Y + L V G+++ L +FD G G +LD GT
Sbjct: 328 GGDATAPLL---RNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAV 384
Query: 308 AYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISK-SFPVVDMVFG 366
L A+ + +DA +K +L + S + D C+ D S +S P V F
Sbjct: 385 TRLQTQAYNSLRDAFLKLTVNLKKGSS-SISLFDTCY-----DFSSLSTVKVPTVAFHFT 438
Query: 367 NGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFW 426
G+ L +NY+ G +C F +++G + + T +TYD + IG
Sbjct: 439 GGKSLDLPAKNYLIPVDD-SGTFCFA-FAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLS 496
Query: 427 KTNC 430
C
Sbjct: 497 GNKC 500
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 109 bits (273), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 105/410 (25%), Positives = 178/410 (43%), Gaps = 31/410 (7%)
Query: 32 NRHHGSLPAMVLPLYLSASNSSTSELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTR 91
NR + +++ L L+ +N S ++L P ++ +E ++ A + + +G Y TR
Sbjct: 94 NRDTARVKSLITRLDLAINNISKADLKPISTMYTTE-EQDIEAPL-ISGTTQGSGEYFTR 151
Query: 92 LWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKC-TLDCNC- 149
+ IG P + +++DTGS V ++ C+ C C +P F+P SS+Y+P+ C T CN
Sbjct: 152 VGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNAL 211
Query: 150 ---DDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLYSQH 206
+ C+YE Y + S + G + ++ G S L +NV G +S
Sbjct: 212 EVSECRNATCLYEVSYGDGSYTVGDFATETLTIG--STLV--------QNVAVGCGHSNE 261
Query: 207 ADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPTDMVFA--YS 264
+ G L + + + SFS C D + V G S D V A
Sbjct: 262 GLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVDFGTSLSPDAVVAPLLR 321
Query: 265 DPVRSPYYNIDLKEIHVAGKRLPLNSNVFD----GKHGTVLDSGTTYAYLPEAAFTAFKD 320
+ +Y + L I V G+ L + + F+ G G ++DSGT L + + +D
Sbjct: 322 NHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRD 381
Query: 321 AIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMF 380
+ VK L + +G D C++ + ++ P V F G+ +L +NYM
Sbjct: 382 SFVKGTLDLEKAAG--VAMFDTCYNLSAKTTVEV----PTVAFHFPGGKMLALPAKNYMI 435
Query: 381 RHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNC 430
V G +CL F ++G + + T VT+D + IGF C
Sbjct: 436 PVDSV-GTFCLA-FAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 109 bits (272), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 162/359 (45%), Gaps = 41/359 (11%)
Query: 94 IGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK---------FQPDLSSTYQPVKC- 143
+GTP F + +DTGS + ++PC C C R + P+ SST V C
Sbjct: 110 VGTPSDWFMVALDTGSDLFWLPCD-CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCN 168
Query: 144 ----TLDCNCDDDRMQCVYERQYAEMSTSS-GVLGEDV---ISFGNQSELAPQRAVFGCE 195
T C C Y+ +Y TSS GVL EDV +S S+ P R FGC
Sbjct: 169 STLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCG 228
Query: 196 NVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG--G 252
V+TG + A +G+ GLG D+S+ L + + ++SFS+C+G + G G + G G
Sbjct: 229 QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG--NDGAGRISFGDKG 286
Query: 253 ISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPE 312
+ P P YNI + +I V G N D + V DSGT++ YL +
Sbjct: 287 SVDQRETPLNIRQP--HPTYNITVTKISVGG-------NTGDLEFDAVFDSGTSFTYLTD 337
Query: 313 AAFTAFKDAIVK-ELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKY 371
AA+T ++ L Q + + + + C++ ++ S +P V++ G Y
Sbjct: 338 AAYTLISESFNSLALDKRYQTTDSELPF-EYCYA---LSPNKDSFQYPAVNLTMKGGSSY 393
Query: 372 SLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNC 430
+ + K YCL + + + +++G + V +DRE+ +G+ +++C
Sbjct: 394 PVY-HPLVVIPMKDTDVYCLAIMK--IEDISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 108 bits (271), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 152/366 (41%), Gaps = 35/366 (9%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSC-EQCGRHQDPKFQPDLSSTYQPV 141
L +G Y + +GTP +LI DTGS +T+ C C C ++P F P S++Y V
Sbjct: 127 LGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 186
Query: 142 KCTLDCNCDD-----------DRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRA 190
C+ C C+Y QY + S S G L ++ + N
Sbjct: 187 SCS-SAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVF--DGV 243
Query: 191 VFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVL 250
FGC G L++ A G++GLGR LS Q + FS C G +
Sbjct: 244 YFGCGENNQG-LFTGVA-GLLGLGRDKLSFPSQTA--TAYNKIFSYCLPSSASYTGHLTF 299
Query: 251 G--GISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYA 308
G GIS + + +Y +++ I V G++LP+ S VF G ++DSGT
Sbjct: 300 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-TPGALIDSGTVIT 358
Query: 309 YLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNG 368
LP A+ A + + ++ SG + D CF +G I P V F G
Sbjct: 359 RLPPKAYAALRSSFKAKMSKYPTTSG--VSILDTCFDLSGFKTVTI----PKVAFSFSGG 412
Query: 369 QKYSLSPEN--YMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTL-VTYDREQTKIGF 425
L + Y+F+ S+V CL N D + G V + TL V YD ++GF
Sbjct: 413 AVVELGSKGIFYVFKISQV----CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGF 468
Query: 426 WKTNCA 431
C+
Sbjct: 469 APNGCS 474
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 108 bits (269), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 169/390 (43%), Gaps = 31/390 (7%)
Query: 41 MVLPLYLSASNSSTSELDPRRQLHGSESKRHPNAR----MRLHDDLLLNGYYTTRLWIGT 96
M++ + +S +S +D R + S R A +H ++ GYY + IG
Sbjct: 6 MIVLMVMSLVLGFSSAVDFRWRKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQ 65
Query: 97 PPQMFALIVDTGSTVTYVPCSS-CEQCGRHQDPKFQP--DLSSTYQPVKCTLDCNCD--- 150
PP+ + L +DTGS +T++ C + C +C P +QP DL P+ L N +
Sbjct: 66 PPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRC 125
Query: 151 DDRMQCVYERQYAEMSTSSGVLGEDVISFG-NQSELAPQRAVFGCENVETGDLYSQH-AD 208
+ QC YE +YA+ +S GVL DV S Q R GC + S H D
Sbjct: 126 ETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLD 185
Query: 209 GIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG-GISPPTDMVFAYSDPV 267
G++GLGRG +SI+ QL + V + C + GGG + G + + + +
Sbjct: 186 GVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSL--GGGILFFGDDLYDSSRVSWTPMSRE 243
Query: 268 RSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQ 327
S +Y+ + + G R N+ TV DSG++Y Y A+ A + +EL
Sbjct: 244 YSKHYSPAMGGELLFGGRTTGLKNLL-----TVFDSGSSYTYFNSKAYQAVTYLLKRELS 298
Query: 328 SLNQISGPDPNYHDICFSGAG--TDVSQISKSFPVVDMVFGNGQK----YSLSPENYMFR 381
D + +C+ G + ++ K F + + F G + + + PE Y+
Sbjct: 299 GKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLI- 357
Query: 382 HSKVRGAYCLGVFQN---GKDPTTLLGGIV 408
++G CLG+ G L+GG V
Sbjct: 358 -ISMKGNVCLGILNGTEIGLQNLNLIGGTV 386
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 107 bits (268), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 32/364 (8%)
Query: 80 DDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQ 139
D L Y +L +GTPP +DTGS + + C C C P F P SST++
Sbjct: 53 DTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK 112
Query: 140 PVKCTLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSE---LAPQRAVFGCEN 196
+C + C Y+ YA+ + S G L + ++ + S + P+ + GC +
Sbjct: 113 EKRC--------NGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTI-GCGH 163
Query: 197 VETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGG---MDVGGGAMVLGGI 253
+ G++GL G S++ Q+ + S+ G ++ G A+V G
Sbjct: 164 --NSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG 221
Query: 254 SPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGT-VLDSGTTYAYLPE 312
T M + P Y ++L + V + F G ++DSGTT Y P
Sbjct: 222 VVSTTMFLTTAKP---GLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPV 278
Query: 313 AAFTAFKDAIVKELQSLNQISGPDPNYHD-ICFSGAGTDVSQISKSFPVVDMVFGNGQKY 371
+ ++A+ + ++ DP +D +C+ D+ FPV+ M F G
Sbjct: 279 SYCNLVREAVDHYVTAVRT---ADPTGNDMLCYYTDTIDI------FPVITMHFSGGADL 329
Query: 372 SLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
L N M+ + RG +CL + N + G N LV YD + F TNC+
Sbjct: 330 VLDKYN-MYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
Query: 432 ELWE 435
LW
Sbjct: 389 ALWN 392
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 107 bits (267), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/416 (26%), Positives = 173/416 (41%), Gaps = 49/416 (11%)
Query: 44 PLYLSASNSSTS-----ELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPP 98
P Y SA SS R L S PN+ G Y + IGTPP
Sbjct: 39 PFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSN--RGEYLMNISIGTPP 96
Query: 99 QMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTL-------DCNCDD 151
I DTGS + + C+ CE C + P F P SSTY+ V C+ D +C
Sbjct: 97 VPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCST 156
Query: 152 DRMQCVYERQYAEMSTSSGVLGEDVISFGNQSE--LAPQRAVFGCENVETGDLYSQHADG 209
D C Y Y + S + G + D ++ G+ ++ + + GC + TG + G
Sbjct: 157 DENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTG-TFDPAGSG 215
Query: 210 IMGLGRGDLSIMDQLVDKNVVSDSFSLCY----------GGMDVGGGAMVLGGISPPTDM 259
I+GLG G S++ QL + ++ FS C ++ G +V G T M
Sbjct: 216 IIGLGGGSTSLVSQL--RKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSM 273
Query: 260 VFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVF-DGKHGTVLDSGTTYAYLPEAAFTAF 318
V DP + YY ++L+ I V K++ S +F G+ V+DSGTT LP +
Sbjct: 274 V--KKDP--ATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYEL 329
Query: 319 KDAIVKELQSLNQISGPDPNYHDICFSGAGT-DVSQISKSFPVVDMVFGNGQKYSLSPEN 377
+ + +++ ++ PD +C+ + + V I+ F D+ GN + E+
Sbjct: 330 ESVVASTIKA-ERVQDPD-GILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNTFVAVSED 387
Query: 378 YMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAEL 433
C N + T+ G + N LV YD + F KT+C+++
Sbjct: 388 ----------VSCFAFAAN--EQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQM 431
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 107 bits (266), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 162/365 (44%), Gaps = 32/365 (8%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDC 147
Y T + +GTP + F ++VDTGS +T+V C + G+ F+ D S +++ V C L
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRAR-GKDNRRVFRADESKSFKTVGC-LTQ 163
Query: 148 NCDDDRMQ-------------CVYERQYAEMSTSSGVLGEDVISFG--NQSELAPQRAVF 192
C D M C Y+ +YA+ S + GV ++ I+ G N +
Sbjct: 164 TCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLI 223
Query: 193 GCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGG 252
GC + TG + Q ADG++GL D S S+ L + ++ G
Sbjct: 224 GCSSSFTGQSF-QGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFG 282
Query: 253 ISPPTDMVFAYSDPVR----SPYYNIDLKEIHVAGKRLPLNSNVFDGKH--GTVLDSGTT 306
S T F + P+ P+Y I++ I + L + S V+D GT+LDSGT+
Sbjct: 283 SSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTS 342
Query: 307 YAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFS-GAGTDVSQISKSFPVVDMVF 365
L +AA+ + + L L ++ P+ + CFS +G +VS++ P +
Sbjct: 343 LTLLADAAYKQVVTGLARYLVELKRVK-PEGVPIEYCFSFTSGFNVSKL----PQLTFHL 397
Query: 366 GNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGF 425
G ++ ++Y+ + G CLG G T ++G I+ +N L +D + + F
Sbjct: 398 KGGARFEPHRKSYLVDAAP--GVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSF 455
Query: 426 WKTNC 430
+ C
Sbjct: 456 APSAC 460
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 107 bits (266), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 102/395 (25%), Positives = 157/395 (39%), Gaps = 40/395 (10%)
Query: 94 IGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK---------FQPDLSSTYQPVKC- 143
IGTP Q F + +DTGS + ++PC+ C R + + P S + V C
Sbjct: 95 IGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYNPSKSKSSSKVTCN 154
Query: 144 ----TLDCNCDDDRMQCVYERQY-AEMSTSSGVLGEDVISFGNQS-ELAPQRAVFGCENV 197
L C C Y +Y + S S+GVL EDVI + E R FGC
Sbjct: 155 STLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDARITFGCSES 214
Query: 198 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPT 257
+ G +GIMGL D+++ + LV V SDSFS+C+G G G + G
Sbjct: 215 QLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPN--GKGTISFGDKGSSD 272
Query: 258 DMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFTA 317
+ S + +Y++ + + V + D + DSGT +L E +TA
Sbjct: 273 QLETPLSGTISPMFYDVSITKFKVGKVTV-------DTEFTATFDSGTAVTWLIEPYYTA 325
Query: 318 FKDAI---VKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLS 374
V + + + P + C+ T P V G Y +
Sbjct: 326 LTTNFHLSVPDRRLSKSVDSP----FEFCYIITSTSDED---KLPSVSFEMKGGAAYDVF 378
Query: 375 PENYMFRHSK-VRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAEL 433
+F S YCL V + +++G + N + +DRE+ +G+ K+NC +
Sbjct: 379 SPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRERRILGWKKSNCNDT 438
Query: 434 --WERLQTSVAPPQMAPNTEVR--NSTKALAPSVA 464
+ PP MAP + R N + L P A
Sbjct: 439 NGFTGPTALAKPPSMAPTSSPRTINLSSRLNPLAA 473
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 107 bits (266), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/391 (24%), Positives = 166/391 (42%), Gaps = 63/391 (16%)
Query: 94 IGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDP-----------KFQPDLSSTYQPVK 142
IGTP F + +DTGS + ++PC+ C QC ++ P SST +
Sbjct: 106 IGTPSVSFLVALDTGSNLLWIPCN-CVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFL 164
Query: 143 CTLD-----CNCDDDRMQCVYERQYAEMSTSS-GVLGEDVISFG--------NQSELAPQ 188
C+ +C+ + QC Y Y +TSS G+L ED++ N S
Sbjct: 165 CSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKA 224
Query: 189 RAVFGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGA 247
R V GC ++GD A DG+MGLG ++S+ L ++ +SFSLC+ D G
Sbjct: 225 RVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED--SGR 282
Query: 248 MVLGGISPPTDMVFAYSDPVRSPYYNIDLKEI--HVAGKRLPL--NSNVFDGKHGTVLDS 303
+ G + P +P+ +D + ++ G NS + T +DS
Sbjct: 283 IYFGDMGPSIQQ--------STPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDS 334
Query: 304 GTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDI----CFSGAGTDVSQISKSFP 359
G ++ YLPE + I + + + ++ N+ + C+ S P
Sbjct: 335 GQSFTYLPEEIYRKVALEIDRHINATSK------NFEGVSWEYCYE------SSAEPKVP 382
Query: 360 VVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRE 419
+ + F + + + ++F+ S+ +CL + +G++ +G +R + +DRE
Sbjct: 383 AIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMVFDRE 442
Query: 420 QTKIGFWKTNCAELWERLQTSVAPPQMAPNT 450
K+G+ + C E + PPQ +P +
Sbjct: 443 NMKLGWSPSKCQE------DKIEPPQASPGS 467
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 105 bits (261), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 160/370 (43%), Gaps = 46/370 (12%)
Query: 85 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCT 144
+G Y TRL +GTP + +++DTGS + ++ C+ C +C DP F P S TY + C+
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198
Query: 145 ------LD-CNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 197
LD C+ R C+Y+ Y + S + G + ++F + GC +
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRV---KGVALGCGHD 255
Query: 198 ETGDLYSQHADGIMGLG----------RGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGA 247
G +G G R + LVD++ S S+ +G V
Sbjct: 256 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAV---- 311
Query: 248 MVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLP-LNSNVFD----GKHGTVLD 302
I+ T ++ S+P +Y + L I V G R+P + +++F G G ++D
Sbjct: 312 ---SRIARFTPLL---SNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIID 365
Query: 303 SGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISK-SFPVV 361
SGT+ L A+ A +DA ++L + PD + D CF D+S +++ P V
Sbjct: 366 SGTSVTRLIRPAYIAMRDAFRVGAKTLKR--APDFSLFDTCF-----DLSNMNEVKVPTV 418
Query: 362 DMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQT 421
+ F G SL NY+ G +C F +++G I + V YD +
Sbjct: 419 VLHF-RGADVSLPATNYLI-PVDTNGKFCF-AFAGTMGGLSIIGNIQQQGFRVVYDLASS 475
Query: 422 KIGFWKTNCA 431
++GF CA
Sbjct: 476 RVGFAPGGCA 485
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 105 bits (261), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 168/385 (43%), Gaps = 68/385 (17%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQ--DPKFQPDLSSTYQPVKCTL 145
+ +G PP I+DTGS++ ++ C C+ C P F P LSST+ +
Sbjct: 96 FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTF------V 149
Query: 146 DCNCDD------------DRMQCVYERQYAEMSTSSGVLGEDVISFG--NQSELAPQRAV 191
+C+CDD +CVYE+ Y + S GVL ++ ++F N + + Q
Sbjct: 150 ECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 209
Query: 192 FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGM---DVGGGAM 248
FGC E G+ H GI+GLG S+ QL K FS C G + + G +
Sbjct: 210 FGC-GYENGEQLESHFTGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNYGYNQL 262
Query: 249 VLGG----ISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDG---KHGTVL 301
VLG + PT + F + + Y ++L+ I V +L + VF + G +L
Sbjct: 263 VLGEDADILGDPTPIEFETENSI----YYMNLEGISVGDTQLNIEPVVFKRRGPRTGVIL 318
Query: 302 DSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPN-----YHD-ICFSGAGTDVSQIS 355
DSGT Y +L + A+ + I L DP + D +C+ G VS+
Sbjct: 319 DSGTLYTWLADIAYRELYNEIKSIL---------DPKLERFWFRDFLCYHGR---VSEEL 366
Query: 356 KSFPVVDMVFGNGQKYSLSPENYMFRHSK--VRGAYCLGV---FQNGKD--PTTLLGGIV 408
FPVV F G + ++ + + S+ +C+ V ++G + T +G +
Sbjct: 367 IGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMA 426
Query: 409 VRNTLVTYDREQTKIGFWKTNCAEL 433
+ + YD ++ I + +C +L
Sbjct: 427 QQYYNIGYDLKEKNIYLQRIDCVQL 451
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 105 bits (261), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 158/361 (43%), Gaps = 42/361 (11%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTL-D 146
Y R IGTP Q + +DT + ++PCS C C F P SS+ + ++C
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCS--SSVLFDPSKSSSSRTLQCEAPQ 145
Query: 147 C------NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETG 200
C +C + C + Y ST L +D ++ S++ P FGC N +G
Sbjct: 146 CKQAPNPSCTVSK-SCGFNMTYGG-STIEAYLTQDTLTLA--SDVIPNY-TFGCINKASG 200
Query: 201 DLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVG--GGAMVLGGISPPTD 258
S A G+MGLGRG LS++ Q +N+ +FS C G++ LG + P
Sbjct: 201 T--SLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR 256
Query: 259 MVFA--YSDPVRSPYYNIDLKEIHVAGK--RLPLNSNVFDGK--HGTVLDSGTTYAYLPE 312
+ +P RS Y ++L I V K +P ++ FD GT+ DSGT Y L E
Sbjct: 257 IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVE 316
Query: 313 AAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYS 372
A+ A ++ + +++ N S D C+SG S FP V +F G +
Sbjct: 317 PAYVAVRNEFRRRVKNANATS---LGGFDTCYSG--------SVVFPSVTFMFA-GMNVT 364
Query: 373 LSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVV---RNTLVTYDREQTKIGFWKTN 429
L P+N + HS CL + + ++L I +N V D +++G +
Sbjct: 365 LPPDNLLI-HSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRET 423
Query: 430 C 430
C
Sbjct: 424 C 424
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 104 bits (260), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 171/388 (44%), Gaps = 54/388 (13%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVK 142
L +G Y + +G+PP+ F+LI+DTGS + ++ C C C + + P S++Y+ +
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNIT 224
Query: 143 CT-LDCN----------CDDDRMQCVYERQYAEMSTSSGVLGEDVISF-----GNQSELA 186
C CN C D C Y Y + S ++G + + G SEL
Sbjct: 225 CNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELY 284
Query: 187 P-QRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY--GGMDV 243
+ +FGC + G + +G G S QL +++ SFS C D
Sbjct: 285 NVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFS--SQL--QSLYGHSFSYCLVDRNSDT 340
Query: 244 GGGAMVLGG-----ISPP----TDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVF- 293
+ ++ G +S P T V + V + YY + +K I VAG+ L + +
Sbjct: 341 NVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYY-VQIKSILVAGEVLNIPEETWN 399
Query: 294 ---DGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDI-----CFS 345
DG GT++DSGTT +Y E A+ K+ I ++ + G P Y D CF+
Sbjct: 400 ISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAK------GKYPVYRDFPILDPCFN 453
Query: 346 GAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLG 405
+G Q+ P + + F +G ++ EN ++ CL + K +++G
Sbjct: 454 VSGIHNVQL----PELGIAFADGAVWNFPTENSFIWLNE--DLVCLAMLGTPKSAFSIIG 507
Query: 406 GIVVRNTLVTYDREQTKIGFWKTNCAEL 433
+N + YD +++++G+ T CA++
Sbjct: 508 NYQQQNFHILYDTKRSRLGYAPTKCADI 535
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 103 bits (257), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 165/369 (44%), Gaps = 34/369 (9%)
Query: 86 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSS-CEQCGRHQDPKFQP--DLSSTYQPVK 142
GYY+ + IG+PP+ F +DTGS +T+V C + C C + +++P ++ P+
Sbjct: 47 GYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQYKPKGNIIPCSNPIC 106
Query: 143 CTL----DCNCDDDRMQCVYERQYAEMSTSSGVLGED--VISFGNQSELAPQRAVFGCEN 196
L +C + + QC YE +YA+ +S G L D + N S + P A FGC
Sbjct: 107 TALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTDQFPLKLVNGSFMQPPVA-FGCGY 165
Query: 197 VETGDLYSQH----ADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGG 252
++ S H G++GLGRG + ++ QLV + + C GGG + G
Sbjct: 166 DQS--YPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSK--GGGFLFFGD 221
Query: 253 ISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPE 312
P+ V + +Y ++ GK L + D+G++Y Y
Sbjct: 222 NLVPSIGVAWTPLLSQDNHYTTGPADLLFNGKPTGLKG------LKLIFDTGSSYTYFNS 275
Query: 313 AAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAG--TDVSQISKSFPVVDMVFGNGQK 370
A+ + I +L+ + IC+ GA V ++ F + + F NG++
Sbjct: 276 KAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNGRR 335
Query: 371 YS---LSPENYMFRHSKVRGAYCLGVFQN---GKDPTTLLGGIVVRNTLVTYDREQTKIG 424
+ L+PE Y+ G CLG+ G + ++G I ++ ++ YD E+ ++G
Sbjct: 336 NTQLYLAPELYLIVSK--TGNVCLGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLG 393
Query: 425 FWKTNCAEL 433
+ ++C +L
Sbjct: 394 WVSSDCNKL 402
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 102 bits (255), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 164/382 (42%), Gaps = 40/382 (10%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPV- 141
L +G Y + +GTPP+ F+LI+DTGS + ++ C C C + P S++++ +
Sbjct: 155 LGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNIT 214
Query: 142 ----KCTL------DCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISF------GNQSEL 185
+C+L C+ D C Y Y + S ++G + + G SE
Sbjct: 215 CNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEY 274
Query: 186 APQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQ----------LVDKNVVSDSFS 235
+FGC + G +G G S Q LVD+N ++ S
Sbjct: 275 KVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSS 334
Query: 236 LCYGGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVF-- 293
G D ++ T V + V + YY I +K I V GK L + +
Sbjct: 335 KLIFGED---KDLLNHTNLNFTSFVNGKENSVETFYY-IQIKSILVGGKALDIPEETWNI 390
Query: 294 --DGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDV 351
DG GT++DSGTT +Y E A+ K+ ++++ I P D CF+ +G +
Sbjct: 391 SSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFP-VLDPCFNVSGIEE 449
Query: 352 SQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRN 411
+ I P + + F +G ++ EN S+ CL + K +++G +N
Sbjct: 450 NNI--HLPELGIAFVDGTVWNFPAENSFIWLSE--DLVCLAILGTPKSTFSIIGNYQQQN 505
Query: 412 TLVTYDREQTKIGFWKTNCAEL 433
+ YD +++++GF T CA++
Sbjct: 506 FHILYDTKRSRLGFTPTKCADI 527
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 102 bits (253), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 162/362 (44%), Gaps = 35/362 (9%)
Query: 85 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCT 144
+G Y R+ +G+PP+ +++D+GS + +V C C+ C + DP F P S +Y V C
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187
Query: 145 LDCNCDDDRMQ--------CVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCEN 196
C DR++ C YE Y + S + G L + ++F ++ + GC +
Sbjct: 188 SSV-C--DRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF---AKTVVRNVAMGCGH 241
Query: 197 VETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPP 256
G ++G+G G +S + QL + + + L G D G++V G + P
Sbjct: 242 RNRGMFIGAAG--LLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTD-STGSLVFGREALP 298
Query: 257 TDMVFAYSDPVRSP----YYNIDLKEIHVAGKRLPLNSNVFD----GKHGTVLDSGTTYA 308
++ VR+P +Y + LK + V G R+PL VFD G G V+D+GT
Sbjct: 299 VGA--SWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVT 356
Query: 309 YLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNG 368
LP AA+ AF+D + +L + SG + D C+ +G +S P V F G
Sbjct: 357 RLPTAAYVAFRDGFKSQTANLPRASG--VSIFDTCYDLSGF----VSVRVPTVSFYFTEG 410
Query: 369 QKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKT 428
+L N++ G YC F +++G I V++D +GF
Sbjct: 411 PVLTLPARNFLMPVDD-SGTYCF-AFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPN 468
Query: 429 NC 430
C
Sbjct: 469 VC 470
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 101 bits (252), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 170/389 (43%), Gaps = 65/389 (16%)
Query: 95 GTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDP----KFQPDLSSTYQPVKCT------ 144
GTPPQ ++++DTGS ++++ +C R +P F P SS+Y P+ C+
Sbjct: 80 GTPPQNISMVIDTGSELSWL------RCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRT 133
Query: 145 ------LDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVE 198
+ +CD D++ C YA+ S+S G L ++ FGN + +FGC
Sbjct: 134 RTRDFLIPASCDSDKL-CHATLSYADASSSEGNLAAEIFHFGNSTN--DSNLIFGCMGSV 190
Query: 199 TGDLYSQ--HADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPP 256
+G + G++G+ RG LS + Q+ FS C G D G ++LG +
Sbjct: 191 SGSDPEEDTKTTGLLGMNRGSLSFISQMGFPK-----FSYCISGTDDFPGFLLLGDSNFT 245
Query: 257 TDMVFAYSDPVRS----PY-----YNIDLKEIHVAGKRLPLNSNVFDGKHG----TVLDS 303
Y+ +R PY Y + L I V GK LP+ +V H T++DS
Sbjct: 246 WLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDS 305
Query: 304 GTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNY-----HDICFSGAGTDV-SQISKS 357
GT + +L +TA + + + + DP++ D+C+ + + S I
Sbjct: 306 GTQFTFLLGPVYTALRSHFLNRTNGILTVY-EDPDFVFQGTMDLCYRISPVRIRSGILHR 364
Query: 358 FPVVDMVFGNGQKYSLSPENYMFR--HSKV--RGAYCLGVFQNGKDPTTLLGGIVV---- 409
P V +VF G + ++S + ++R H V YC F G + V+
Sbjct: 365 LPTVSLVF-EGAEIAVSGQPLLYRVPHLTVGNDSVYC---FTFGNSDLMGMEAYVIGHHH 420
Query: 410 -RNTLVTYDREQTKIGFWKTNCAELWERL 437
+N + +D ++++IG C +RL
Sbjct: 421 QQNMWIEFDLQRSRIGLAPVECDVSGQRL 449
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 99.8 bits (247), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 169/383 (44%), Gaps = 57/383 (14%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK----------FQP 132
LL + + +GTP F + +DTGS + ++PC+ C R + P
Sbjct: 97 LLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSP 156
Query: 133 DLSSTYQPVKCTLDCNCDDDR-----------MQCVYERQYAEMST-SSGVLGEDVISFG 180
+ SST ++C+ DDR C Y+ QY T ++G L EDV+
Sbjct: 157 NTSSTSSSIRCS------DDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLV 210
Query: 181 NQSE-LAPQRA--VFGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSL 236
+ E L P +A GC +TG L S A +G++GLG D S+ L + ++SFS+
Sbjct: 211 TEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSM 270
Query: 237 CYGGMDVGGGAMVLG--GISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFD 294
C+G + G + G G + + ++P SP Y + + E+ V G + +
Sbjct: 271 CFGNIIDVVGRISFGDKGYTDQMETPLLPTEP--SPTYAVSVTEVSVGGDAVGV------ 322
Query: 295 GKHGTVLDSGTTYAYLPEAAF----TAFKDAIVKELQSLNQISGPDPNY-HDICFSGAGT 349
+ + D+GT++ +L E + AF D + + + + DP + C+ +
Sbjct: 323 -QLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPI------DPELPFEFCYDLSPN 375
Query: 350 DVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVV 409
+ + FP V M F G + L ++ + YCLG+ ++ ++G +
Sbjct: 376 KTTIL---FPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFM 432
Query: 410 RNTLVTYDREQTKIGFWKTNCAE 432
+ +DRE+ +G+ +++C E
Sbjct: 433 SGYRIVFDRERMILGWKRSDCFE 455
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 99.4 bits (246), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 169/397 (42%), Gaps = 47/397 (11%)
Query: 60 RRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSC 119
R H +E P ++ L + +G Y + IGTPP I DTGS + + C+ C
Sbjct: 65 NRVFHFTEKDNTPQPQIDLTSN---SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC 121
Query: 120 EQCGRHQDPKFQPDLSSTYQPVKCTLD--------CNCDDDRMQCVYERQYAEMSTSSGV 171
+ C DP F P SSTY+ V C+ +C + C Y Y + S + G
Sbjct: 122 DDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGN 181
Query: 172 LGEDVISFGNQSELAP---QRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKN 228
+ D ++ G+ S+ P + + GC + G +++ GI+GLG G +S++ QL D
Sbjct: 182 IAVDTLTLGS-SDTRPMQLKNIIIGCGHNNAG-TFNKKGSGIVGLGGGPVSLIKQLGDS- 238
Query: 229 VVSDSFSLCY----------GGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKE 278
+ FS C ++ G A+V G T ++ S + +Y + LK
Sbjct: 239 -IDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKAS---QETFYYLTLKS 294
Query: 279 IHVAGKRLPL-NSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDP 337
I V K++ S+ + ++DSGTT LP ++ +DA+ S++ DP
Sbjct: 295 ISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVA---SSIDAEKKQDP 351
Query: 338 NYH-DICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQN 396
+C+S G PV+ M F +G L N + S+ +
Sbjct: 352 QSGLSLCYSATG------DLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAF----R 400
Query: 397 GKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCAEL 433
G ++ G + N LV YD + F T+CA++
Sbjct: 401 GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 99.0 bits (245), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 110/447 (24%), Positives = 189/447 (42%), Gaps = 87/447 (19%)
Query: 31 QNRHHGSLPAMVLPLYLSASNSSTSELDPRRQLHGSESKRHPNARMRLHDDLLL--NGYY 88
+N S+ ++ PL ++S+ L L ++++ P + D L N
Sbjct: 14 KNFLRISVLLLIFPLTFCKTSSTNQTL-----LFSLKTQKLPQSS---SDKLSFRHNVTL 65
Query: 89 TTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCT---- 144
T L +G PPQ ++++DTGS ++++ C G F P SSTY PV C+
Sbjct: 66 TVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPIC 121
Query: 145 --------LDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCEN 196
+ +CD C YA+ ++ G L + G+ + +FGC +
Sbjct: 122 RTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPG---TLFGCMD 178
Query: 197 --VETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGIS 254
+ + + G+MG+ RG LS ++QL FS C G D G ++LG
Sbjct: 179 SGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSK-----FSYCISGSD-SSGFLLLG--- 229
Query: 255 PPTDMVFAYSDPVRS----------PY-----YNIDLKEIHVAGKRLPLNSNVF----DG 295
D +++ P++ PY Y + L+ I V K L L +VF G
Sbjct: 230 ---DASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286
Query: 296 KHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQS-LNQISGPDPNYH---DICFSGAGTDV 351
T++DSGT + +L +TA K+ + + +S L + PD + D+C+ T
Sbjct: 287 AGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTR 346
Query: 352 SQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGA--------YCLGVFQNGKDPTTL 403
S P+V ++F G + S+S + ++R V GA YC F G
Sbjct: 347 PNFS-GLPMVSLMF-RGAEMSVSGQKLLYR---VNGAGSEGKEEVYC---FTFGNSDLLG 398
Query: 404 LGGIVV-----RNTLVTYDREQTKIGF 425
+ V+ +N + +D ++++GF
Sbjct: 399 IEAFVIGHHHQQNVWMEFDLAKSRVGF 425
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 97.8 bits (242), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 154/369 (41%), Gaps = 40/369 (10%)
Query: 85 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKC- 143
+G Y RL +GTP +++DTGS V ++ CS C+ C D F P S T+ V C
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCG 191
Query: 144 TLDCNCDDDRMQCV--------YERQYAEMSTSSGVLGEDVISF-GNQSELAPQRAVFGC 194
+ C DD +CV Y+ Y + S + G + ++F G + + P GC
Sbjct: 192 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVP----LGC 247
Query: 195 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY------GGMDVGGGAM 248
+ G + GRG LS Q KN + FS C G +
Sbjct: 248 GHDNEGLFVGAAGLLGL--GRGGLSFPSQ--TKNRYNGKFSYCLVDRTSSGSSSKPPSTI 303
Query: 249 VLGGISPPTDMVFA--YSDPVRSPYYNIDLKEIHVAGKRLP-LNSNVFD----GKHGTVL 301
V G + P VF ++P +Y + L I V G R+P ++ + F G G ++
Sbjct: 304 VFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 363
Query: 302 DSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVV 361
DSGT+ L + A+ A +DA L + P + D CF +G ++ P V
Sbjct: 364 DSGTSVTRLTQPAYVALRDAF--RLGATKLKRAPSYSLFDTCFDLSGMTTVKV----PTV 417
Query: 362 DMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQT 421
FG G+ SL NY+ G +C F +++G I + V YD +
Sbjct: 418 VFHFGGGE-VSLPASNYLI-PVNTEGRFCFA-FAGTMGSLSIIGNIQQQGFRVAYDLVGS 474
Query: 422 KIGFWKTNC 430
++GF C
Sbjct: 475 RVGFLSRAC 483
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 97.4 bits (241), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 155/368 (42%), Gaps = 55/368 (14%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTL-- 145
+ + IG+PP L +DT S + ++ C C C P F P S T++ C
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144
Query: 146 ----DCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGN----QSELAPQRAVFGCENV 197
+ + C Y +Y + + S G+L +++ F S A VFGC +
Sbjct: 145 YSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHD 204
Query: 198 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD---------VGG--G 246
G+ GI+GLG G+ S++ + K FS C+G +D V G G
Sbjct: 205 NYGE--PLVGTGILGLGYGEFSLVHRFGKK------FSYCFGSLDDPSYPHNVLVLGDDG 256
Query: 247 AMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKH-----GTVL 301
A +LG +P + + +Y + ++ I V G LP++ VF+ H GT++
Sbjct: 257 ANILGDTTPLE---------IHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTII 307
Query: 302 DSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDI----CFSGAGTDVSQISKS 357
D+G + L E A+ K+ I + + + D + D+ C++G + +
Sbjct: 308 DTGNSLTSLVEEAYKPLKNRIEDIFE--GRFTAADVSQDDMIKMECYNG-NFERDLVESG 364
Query: 358 FPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417
FP+V F G + SL ++ + S +CL V + +G ++ + YD
Sbjct: 365 FPIVTFHFSEGAELSLDVKSLFMKLSP--NVFCLAVTPGNLNS---IGATAQQSYNIGYD 419
Query: 418 REQTKIGF 425
E ++ F
Sbjct: 420 LEAMEVSF 427
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 95.9 bits (237), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 160/367 (43%), Gaps = 42/367 (11%)
Query: 86 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKC-T 144
G Y R +GTPPQ+ +++DT + ++PCS C C + F + SSTY V C T
Sbjct: 102 GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCS-NASTSFNTNSSSTYSTVSCST 160
Query: 145 LDC------NCDDDRMQ---CVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCE 195
C C Q C + + Y S+ S L +D ++ ++ P + FGC
Sbjct: 161 AQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLA--PDVIPNFS-FGCI 217
Query: 196 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD--VGGGAMVLGGI 253
N +G+ S G+MGLGRG +S++ Q ++ S FS C G++ LG +
Sbjct: 218 NSASGN--SLPPQGLMGLGRGPMSLVSQ--TTSLYSGVFSYCLPSFRSFYFSGSLKLGLL 273
Query: 254 SPPTDMVFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNSN--VFDGKH--GTVLDSGTTY 307
P + + +P R Y ++L + V ++P++ FD GT++DSGT
Sbjct: 274 GQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVI 333
Query: 308 AYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVS-QISKSFPVVDMVFG 366
+ + A +D K+ +N S D CFS +V+ +I+ +D+
Sbjct: 334 TRFAQPVYEAIRDEFRKQ---VNVSSFSTLGAFDTCFSADNENVAPKITLHMTSLDL--- 387
Query: 367 NGQKYSLSPENYMFRHSKVRGAYCL---GVFQNGKDPTTLLGGIVVRNTLVTYDREQTKI 423
L EN + HS CL G+ QN ++ + +N + +D ++I
Sbjct: 388 -----KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441
Query: 424 GFWKTNC 430
G C
Sbjct: 442 GIAPEPC 448
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 95.9 bits (237), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 110/406 (27%), Positives = 168/406 (41%), Gaps = 53/406 (13%)
Query: 61 RQLHGSESKRHPNARMRLHDDLL-LNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSC 119
R + S H ++ L L+ +G + + IGTPP I DTGS +T+V C C
Sbjct: 57 RSVSRSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC 116
Query: 120 EQCGRHQDPKFQPDLSSTYQPVKC-TLDCN--------CDDDRMQCVYERQYAEMSTSSG 170
+QC + P F SSTY+ C + +C CD+ C Y Y + S S G
Sbjct: 117 QQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKG 176
Query: 171 VLGEDVISF--GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKN 228
+ + +S + S ++ VFGC G + + GI+GLG G LS++ QL +
Sbjct: 177 DVATETVSIDSASGSPVSFPGTVFGC-GYNNGGTFDETGSGIIGLGGGHLSLISQL--GS 233
Query: 229 VVSDSFSLCYGGMDV---GGGAMVLGGISPPTDMVFAYSDPVRSP--------YYNIDLK 277
+S FS C G + LG S P+ + S V +P YY + L+
Sbjct: 234 SISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLS-KDSGVVSTPLVDKEPLTYYYLTLE 292
Query: 278 EIHVAGKRLPLNSNVF----DG-----KHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQS 328
I V K++P + + DG ++DSGTT L F F A+ + +
Sbjct: 293 AISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG 352
Query: 329 LNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGA 388
++S P CF ++ P + + F G LSP N + S+
Sbjct: 353 AKRVSDPQ-GLLSHCFKSGSAEI-----GLPEITVHF-TGADVRLSPINAFVKLSE--DM 403
Query: 389 YCLGVFQNGKDPTT---LLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
CL + PTT + G + LV YD E + F +C+
Sbjct: 404 VCLSMV-----PTTEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 92.4 bits (228), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 168/377 (44%), Gaps = 42/377 (11%)
Query: 87 YYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCG---------RHQDPKFQPDLSST 137
+YTT + +GTP F + +DTGS + +VPC C +C + + P +S+T
Sbjct: 107 HYTT-VKLGTPGMRFMVALDTGSDLFWVPCD-CGKCAPTEGATYASEFELSIYNPKVSTT 164
Query: 138 YQPVKCTLDC-----NCDDDRMQCVYERQYAEMSTS-SGVLGEDVISFGNQSELAPQRA- 190
+ V C C C Y Y TS SG+L EDV+ + + P+R
Sbjct: 165 NKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDK-NPERVE 223
Query: 191 ---VFGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGG 246
FGC V++G A +G+ GLG +S+ L + +V+DSFS+C+G VG
Sbjct: 224 AYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRI 283
Query: 247 AMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTT 306
+ G S + F + P YNI + + V + + D + + D+GT+
Sbjct: 284 SFGDKGSSDQEETPFNLNPS--HPNYNITVTRVRVG-------TTLIDDEFTALFDTGTS 334
Query: 307 YAYLPEAAFTAFKDAIVKELQSLNQISGPDPNY-HDICFSGAGTDVSQISKSFPVVDMVF 365
+ YL + +T ++ + Q ++ PD + C+ + + + P + +
Sbjct: 335 FTYLVDPMYTTVSESFHSQAQ--DKRHSPDSRIPFEYCYDMSNDANASL---IPSLSLTM 389
Query: 366 GNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGF 425
++++ + + ++ YCL + ++ + ++G + V +DRE+ + +
Sbjct: 390 KGNSHFTIN-DPIIVISTEGELVYCLAIVKSSE--LNIIGQNYMTGYRVVFDREKLVLAW 446
Query: 426 WKTNCAELWERLQTSVA 442
K +C ++ E T+VA
Sbjct: 447 KKFDCYDI-EETNTTVA 462
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 90.9 bits (224), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/394 (26%), Positives = 161/394 (40%), Gaps = 49/394 (12%)
Query: 70 RHPNARMRLHDDLLLNG-YYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDP 128
R + L L+ NG Y + IGTPP I DTGS +T+V C C+QC + P
Sbjct: 66 RRFTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSP 125
Query: 129 KFQPDLSSTYQPVKC-TLDCN--------CDDDRMQCVYERQYAEMSTSSGVLGEDVISF 179
F SSTY+ C + C CD+ + C Y Y + S + G + + IS
Sbjct: 126 LFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISI 185
Query: 180 GNQSELAPQ--RAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLC 237
+ S + VFGC G + + GI+GLG G LS++ QL S+ L
Sbjct: 186 DSSSGSSVSFPGTVFGC-GYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLS 244
Query: 238 YGGMDVGGGAMV-LGGISPPTD---------MVFAYSDPVRSPYYNIDLKEIHVAGKRLP 287
+ G +++ LG S P++ DP YY + L+ + V +LP
Sbjct: 245 HTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDP--ETYYFLTLEAVTVGKTKLP 302
Query: 288 -------LNSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYH 340
LN ++DSGTT L + F A+ + + ++S P
Sbjct: 303 YTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQ-GLL 361
Query: 341 DICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDP 400
CF ++ P + M F N LSP N + ++ CL + P
Sbjct: 362 THCFKSGDKEI-----GLPAITMHFTNAD-VKLSPINAFVKLNE--DTVCLSMI-----P 408
Query: 401 TT---LLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
TT + G +V + LV YD E + F + +C+
Sbjct: 409 TTEVAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 90.5 bits (223), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 162/379 (42%), Gaps = 62/379 (16%)
Query: 92 LWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDP-----KFQPDLSSTYQPVKC--- 143
L IGTPPQ +++DTGS ++++ QC R + P F P LSS++ + C
Sbjct: 76 LPIGTPPQAQQMVLDTGSQLSWI------QCHRKKLPPKPKTSFDPSLSSSFSTLPCSHP 129
Query: 144 ---------TLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGC 194
TL +CD +R+ C Y YA+ + + G L ++ I+F N +E+ P + GC
Sbjct: 130 LCKPRIPDFTLPTSCDSNRL-CHYSYFYADGTFAEGNLVKEKITFSN-TEITPP-LILGC 186
Query: 195 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGI- 253
+ D GI+G+ RG LS + Q FS C G G
Sbjct: 187 ATESSDD------RGILGMNRGRLSFVSQ-----AKISKFSYCIPPKSNRPGFTPTGSFY 235
Query: 254 --SPPTDMVFAYSDPVRSP-----------YYNIDLKEIHVAGKRLPLNSNVF----DGK 296
P F Y + P Y + + I K+L ++ +VF G
Sbjct: 236 LGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 295
Query: 297 HGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISK 356
T++DSG+ + +L +AA+ + I+ + + D+CF G +V+ I +
Sbjct: 296 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG---NVAMIPR 352
Query: 357 SFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGK--DPTTLLGGIVVRNTLV 414
+ VF G + + E + G +C+G+ ++ + ++G + +N V
Sbjct: 353 LIGDLVFVFTRGVEILVPKERVLVNVGG--GIHCVGIGRSSMLGAASNIIGNVHQQNLWV 410
Query: 415 TYDREQTKIGFWKTNCAEL 433
+D ++GF K +C+ +
Sbjct: 411 EFDVTNRRVGFAKADCSRV 429
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 87.8 bits (216), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 159/379 (41%), Gaps = 58/379 (15%)
Query: 92 LWIGTPPQMFALIVDTGSTVTYVPC--SSCEQCGRHQDPKFQPDLSSTYQPVKC------ 143
L IGTP Q L++DTGS ++++ C ++ F P LSS++ + C
Sbjct: 84 LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCK 143
Query: 144 ------TLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 197
TL +CD +R+ C Y YA+ + + G L ++ +F N P + GC
Sbjct: 144 PRIPDFTLPTSCDSNRL-CHYSYFYADGTFAEGNLVKEKFTFSNSQTTPP--LILGCAKE 200
Query: 198 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD-----VGGGAMVLGG 252
T + GI+G+ G LS + Q FS C G+ LG
Sbjct: 201 STDE------KGILGMNLGRLSFISQ-----AKISKFSYCIPTRSNRPGLASTGSFYLG- 248
Query: 253 ISPPTDMVFAYSDPVRSPY-----------YNIDLKEIHVAGKRLPLNSNVF----DGKH 297
P F Y + P Y + L+ I + KRL + +VF G
Sbjct: 249 -DNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSG 307
Query: 298 GTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGA-GTDVSQISK 356
T++DSG+ + +L + A+ K+ IV+ + S + + D+CF G ++ ++
Sbjct: 308 QTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRL-- 365
Query: 357 SFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGK--DPTTLLGGIVVRNTLV 414
+ D+VF G+ + E + G +C+G+ ++ + ++G + +N V
Sbjct: 366 ---IGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWV 422
Query: 415 TYDREQTKIGFWKTNCAEL 433
+D ++GF K C L
Sbjct: 423 EFDVTNRRVGFSKAECRLL 441
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 87.4 bits (215), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 137/357 (38%), Gaps = 28/357 (7%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSC-EQCGRHQDPKFQPDLSSTYQPV 141
L +G Y + IGTP +L+ DTGS +T+ C C C ++PKF P SSTYQ V
Sbjct: 127 LGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNV 186
Query: 142 KCTLDCNCDD----DRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 197
C+ C+D CVY Y + S + G L ++ + N L + FGC
Sbjct: 187 SCSSP-MCEDAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVL--EDVYFGCGEN 243
Query: 198 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGG-GAMVLGGISPP 256
G +G G+ L N+ FS C G + G
Sbjct: 244 NQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNI----FSYCLPSFTSNSTGHLTFGSAGIS 299
Query: 257 TDMVFAYSDPVRSPY-YNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAF 315
+ F S + Y ID+ I V K L + N F G ++DSGT + LP +
Sbjct: 300 ESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFS-TEGAIIDSGTVFTRLPTKVY 358
Query: 316 TAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSL-- 373
+ +++ S SG D C+ G D + ++P + F L
Sbjct: 359 AELRSVFKEKMSSYKSTSG--YGLFDTCYDFTGLD----TVTYPTIAFSFAGSTVVELDG 412
Query: 374 SPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNC 430
S + + S+V CL F D + G + V YD ++GF C
Sbjct: 413 SGISLPIKISQV----CLA-FAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 87.0 bits (214), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 160/377 (42%), Gaps = 68/377 (18%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVK 142
L +G Y + +G+PP+ F+LI+DTGS + ++ C C
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC----------------------- 201
Query: 143 CTLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISF-----GNQSELAP-QRAVFGCEN 196
DC +D C Y Y + S ++G + + G SEL + +FGC +
Sbjct: 202 --YDCFQQNDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH 259
Query: 197 VETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY--GGMDVGGGAMVLGG-- 252
G + +G G S QL +++ SFS C D + ++ G
Sbjct: 260 WNRGLFHGAAGLLGLGRGPLSFS--SQL--QSLYGHSFSYCLVDRNSDTNVSSKLIFGED 315
Query: 253 ---ISPP----TDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVF----DGKHGTVL 301
+S P T V + V + YY + +K I VAG+ L + + DG GT++
Sbjct: 316 KDLLSHPNLNFTSFVAGKENLVDTFYY-VQIKSILVAGEVLNIPEETWNISSDGAGGTII 374
Query: 302 DSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDI-----CFSGAGTDVSQISK 356
DSGTT +Y E A+ K+ I ++ + G P Y D CF+ +G Q+
Sbjct: 375 DSGTTLSYFAEPAYEFIKNKIAEKAK------GKYPVYRDFPILDPCFNVSGIHNVQL-- 426
Query: 357 SFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTY 416
P + + F +G ++ EN ++ CL + K +++G +N + Y
Sbjct: 427 --PELGIAFADGAVWNFPTENSFIWLNE--DLVCLAMLGTPKSAFSIIGNYQQQNFHILY 482
Query: 417 DREQTKIGFWKTNCAEL 433
D +++++G+ T CA++
Sbjct: 483 DTKRSRLGYAPTKCADI 499
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 86.3 bits (212), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 102/406 (25%), Positives = 160/406 (39%), Gaps = 85/406 (20%)
Query: 86 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSS---CEQCGRHQ-----DPKFQPDLSST 137
G Y+ L GTP Q + DTGS++ ++PC+S C C P+F P SS+
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSS 147
Query: 138 YQPVKC-----------TLDCN-CDDDRMQCV-----YERQYAEMSTSSGVLGEDVISFG 180
+ + C + C CD + C Y QY + +++GVL + + F
Sbjct: 148 SKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYG-LGSTAGVLITEKLDF- 205
Query: 181 NQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLV--------------D 226
+L V GC + T + GI G GRG +S+ Q+ D
Sbjct: 206 --PDLTVPDFVVGCSIIST-----RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFDD 258
Query: 227 KNVVSDSFSLCYGGMDVG----GGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVA 282
NV +D +D G G+ G P S+ YY ++L+ I+V
Sbjct: 259 TNVTTDL------DLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG 312
Query: 283 GKRLPLNSNVF----DGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKEL------QSLNQI 332
K + + +G G+++DSG+T+ ++ F + ++ + L +
Sbjct: 313 RKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKE 372
Query: 333 SGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLG 392
+G P ++ SG G DV + P + F G K L NY F CL
Sbjct: 373 TGLGPCFN---ISGKG-DV-----TVPELIFEFKGGAKLELPLSNY-FTFVGNTDTVCLT 422
Query: 393 VFQN-------GKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
V + G P +LG +N LV YD E + GF K C+
Sbjct: 423 VVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 82.0 bits (201), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 164/374 (43%), Gaps = 56/374 (14%)
Query: 94 IGTPPQMFALIVDTGSTVTYVPCSSCEQCGRH-QDPKFQ---------PDLSSTYQPVKC 143
+GTP F + +DTGS + ++PC+ C +D +F P+ S+T ++C
Sbjct: 109 LGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRC 168
Query: 144 T-LDC----NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSE-LAPQRA--VFGCE 195
+ C C C Y+ + + ++G L +DV+ + E L P A GC
Sbjct: 169 SDKRCFGSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCG 228
Query: 196 NVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGIS 254
+TG + A +G++GL + S+ L N+ ++SFS+C+G + V+G IS
Sbjct: 229 QNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRI-----ISVVGRIS 283
Query: 255 PPTDMVFAYSDPVRSPY--------YNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTT 306
Y+D +P Y +++ + V G +P++ +F + D+G++
Sbjct: 284 FGDK---GYTDQEETPLVSLETSTAYGVNVTGVSVGG--VPVDVPLF-----ALFDTGSS 333
Query: 307 YAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICF--------SGAGTDVSQISKSF 358
+ L E+A+ F A ++ + PD + + C+ S A Q
Sbjct: 334 FTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPF-EFCYDLREEHLNSDARPRHMQSKCYN 392
Query: 359 PVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 418
P D Q S +Y +K+ YCLG+ ++ ++G ++ + +DR
Sbjct: 393 PCRDDFRWRIQNDSQESVSYSNEGTKM---YCLGILKSIN--LNIIGQNLMSGHRIVFDR 447
Query: 419 EQTKIGFWKTNCAE 432
E+ +G+ ++NC E
Sbjct: 448 ERMILGWKQSNCFE 461
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 78.6 bits (192), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 112/456 (24%), Positives = 174/456 (38%), Gaps = 85/456 (18%)
Query: 45 LYLSASNSSTSELDPRRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALI 104
L L+ + SS S P+ Q K + + + + Y L IGTPPQ +
Sbjct: 40 LVLTLTKSSVSLPTPKSQTQERIKKPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVY 99
Query: 105 VDTGSTVTYVPCSS----CEQCG--RHQDPK----FQPDLSSTYQPVKCT----LDCNCD 150
+DTGS +T+VPC + C +C ++ D K F P SST C ++ +
Sbjct: 100 LDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSS 159
Query: 151 DD----------------RMQCV-----YERQYAEMSTSSGVLGEDVISFGNQSELAPQR 189
D+ + CV + Y E SG+L D++ ++ P R
Sbjct: 160 DNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILK--ARTRDVP-R 216
Query: 190 AVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY----------- 238
FGC + GI G GRG LS+ QL + FS C+
Sbjct: 217 FSFGCVTSTY-----REPIGIAGFGRGLLSLPSQL---GFLEKGFSHCFLPFKFVNNPNI 268
Query: 239 -GGMDVGGGAMVLGGISPPTDMVFAYSDPVRSPYYNIDLKEI----HVAGKRLPLNSNVF 293
+ +G A+ + ++ + P+ Y I L+ I ++ ++PL F
Sbjct: 269 SSPLILGASALSIN-LTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQF 327
Query: 294 D--GKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICF------- 344
D G G ++DSGTTY +LPE ++ + + D+C+
Sbjct: 328 DSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNN 387
Query: 345 --SGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRG---AYCLGVFQNGKD 399
+ DV I FP + F N L N + S CL +FQN +D
Sbjct: 388 NLTSLENDVMMI---FPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCL-LFQNMED 443
Query: 400 ----PTTLLGGIVVRNTLVTYDREQTKIGFWKTNCA 431
P + G +N V YD E+ +IGF +C
Sbjct: 444 GDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 77.8 bits (190), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 144/355 (40%), Gaps = 43/355 (12%)
Query: 101 FALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDD--------- 151
+LIVDTGS +T+V C C C Q P + P +SS+Y+ V C C D
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCN-SSTCQDLVAATSNSG 204
Query: 152 --------DRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLY 203
+ C Y Y + S + G L + I G+ + VFGC G
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT---KLENFVFGCGRNNKGLFG 261
Query: 204 SQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGG-GAMVLGGISP--PTDMV 260
MGLGR +S++ Q + + FS C ++ G G++ G S
Sbjct: 262 GSSGL--MGLGRSSVSLVSQTL--KTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTS 317
Query: 261 FAYSDPVRSP----YYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAAFT 316
+Y+ V++P +Y ++L + G + L S+ F G ++DSGT LP + +
Sbjct: 318 VSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFG--RGILIDSGTVITRLPPSIYK 373
Query: 317 AFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPE 376
A K +K+ + P + D CF+ + S P++ M+F + +
Sbjct: 374 AVKIEFLKQFSGFP--TAPGYSILDTCFNLTSYE----DISIPIIKMIFQGNAELEVDVT 427
Query: 377 NYMFRHSKVRGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDREQTKIGFWKTNC 430
+ CL + + ++ ++G +N V YD Q ++G NC
Sbjct: 428 GVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 77.4 bits (189), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 37/368 (10%)
Query: 81 DLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQP 140
+L + Y + IGTP Q L +DT S V ++PCS C C + F P S++++
Sbjct: 108 QMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKN 165
Query: 141 VKCTL-DC----NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRA-VFGC 194
V C+ C N C + Y S ++ L +D I P +A FGC
Sbjct: 166 VSCSAPQCKQVPNPTCGARACSFNLTYGSSSIAAN-LSQDTIRLAAD----PIKAFTFGC 220
Query: 195 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD--VGGGAMVLGG 252
N G G++GLGRG LS+M Q +++ +FS C G++ LG
Sbjct: 221 VNKVAGGGTIPPPQGLLGLGRGPLSLMSQ--AQSIYKSTFSYCLPSFRSLTFSGSLRLGP 278
Query: 253 ISPPTDMVFA--YSDPVRSPYYNIDLKEIHVAGK--RLPLNSNVFDGK--HGTVLDSGTT 306
S P + + +P RS Y ++L I V K LP + F+ GT+ DSGT
Sbjct: 279 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 338
Query: 307 YAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFG 366
Y L + + A ++ K ++ + + D C+SG V I+ F V+M
Sbjct: 339 YTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGF-DTCYSGQ-VKVPTITFMFKGVNMTM- 395
Query: 367 NGQKYSLSPENYMFRHSKVRGAYCLGVF---QNGKDPTTLLGGIVVRNTLVTYDREQTKI 423
P + + HS CL + +N ++ + +N V D ++
Sbjct: 396 --------PADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRL 447
Query: 424 GFWKTNCA 431
G + C+
Sbjct: 448 GLARERCS 455
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 68.9 bits (167), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 151/390 (38%), Gaps = 50/390 (12%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDC 147
YTT + TP +++ D G +V C ++ P+ + S + C C
Sbjct: 45 YTTVINQRTPLVPASVVFDLGGREFWVDCDQGYVSTTYRSPRCNSAVCSRAGSIACG-TC 103
Query: 148 ------NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAV------FGCE 195
C ++ + +TS G DV+S + + P R V F C
Sbjct: 104 FSPPRPGCSNNTCGAFPDNSITGWATS-GEFALDVVSIQSTNGSNPGRFVKIPNLIFSCG 162
Query: 196 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY----GGMDVGGGAMV-L 250
+ ++ A G+ G+GR ++ + Q + F++C G G G V L
Sbjct: 163 STSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNRKFAVCLTSGRGVAFFGNGPYVFL 222
Query: 251 GGI-------SP----PTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLP-----LNSNVFD 294
GI +P P VF +S +SP Y I + I + K LP L N
Sbjct: 223 PGIQISRLQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKIVEKTLPIDPTLLKINAST 282
Query: 295 GKHGTVLDSGTTYAYLPEAAFTAFKDAIVKE--LQSLNQISGPDPNYHDICFSGAGTDVS 352
G GT + S Y L + + AF +++ +S+ +++ P CFS V+
Sbjct: 283 GIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAARSIKRVASVKP--FGACFSTKNVGVT 340
Query: 353 QISKSFPVVDMVF-GNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDP--TTLLGGIVV 409
++ + P + +V + + N M S CLG G +P + ++GG +
Sbjct: 341 RLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSD--DVICLGFVDGGVNPGASVVIGGFQL 398
Query: 410 RNTLVTYDREQTKIGF------WKTNCAEL 433
+ L+ +D K GF +TNCA
Sbjct: 399 EDNLIEFDLASNKFGFSSTLLGRQTNCANF 428
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 65.1 bits (157), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 53/126 (42%), Gaps = 16/126 (12%)
Query: 85 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPK-FQPDLSSTYQPVKC 143
+G Y L IG PPQ LI DTGS + +V CS+C C H F P SST+ P C
Sbjct: 81 SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHC 140
Query: 144 -------------TLDCNCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQ 188
CN C YE YA+ S +SG+ + S S E +
Sbjct: 141 YDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLK 200
Query: 189 RAVFGC 194
FGC
Sbjct: 201 SVAFGC 206
Score = 57.0 bits (136), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 70/141 (49%), Gaps = 11/141 (7%)
Query: 294 DGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQ 353
G GTV+DSGTT A+L E A+ + A+ + ++ L P + D+C + +G V++
Sbjct: 216 SGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK-LPIADALTPGF-DLCVNVSG--VTK 271
Query: 354 ISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPT---TLLGGIVVR 410
K P + F G + P NY + CL + DP +++G ++ +
Sbjct: 272 PEKILPRLKFEFSGGAVFVPPPRNYFIETEE--QIQCLAI--QSVDPKVGFSVIGNLMQQ 327
Query: 411 NTLVTYDREQTKIGFWKTNCA 431
L +DR+++++GF + CA
Sbjct: 328 GFLFEFDRDRSRLGFSRRGCA 348
>AT1G69100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:25979999-25981436 FORWARD LENGTH=367
Length = 367
Score = 63.5 bits (153), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 106/236 (44%), Gaps = 33/236 (13%)
Query: 94 IGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDDDR 153
+G+PPQ F ++ DTGST +VP + H+ PKF D S T + +K
Sbjct: 53 VGSPPQKFNVVFDTGSTDLWVPSKEWPEETDHKHPKFDKDASKTCRLMK--------GGE 104
Query: 154 MQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLYSQHADGIMGL 213
+ Y E + G+L +D ++ G + + +F N +T S DG++GL
Sbjct: 105 VNIAY-----ETGSVVGILAQDNVNVGGV--VIKSQDLFLARNPDT-YFRSVKFDGVIGL 156
Query: 214 G------RGDLSIMDQLVDKNVVSDSFSLCY-------GGMDVGGGAMVLGGISPPTDMV 260
G +G +++ + +V + +++ Y GG D GG ++ GG P
Sbjct: 157 GIKSSRAQGSVTVWENMVKQKLITKPIFSLYLRPHKGDGGEDPNGGQIMFGGFDPKQFKG 216
Query: 261 FAYSDPVR--SPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAYLPEAA 314
P++ + I + +I++ GK P + D + ++DSG+T + P+ A
Sbjct: 217 EHVYVPMKLSDDRWKIKMSKIYINGK--PAINFCDDVECTAMVDSGSTDIFGPDEA 270
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 62.8 bits (151), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/374 (21%), Positives = 148/374 (39%), Gaps = 75/374 (20%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQC-GRHQDPKFQPDLSSTYQPVKCTLD 146
+ + G+P + L +DTGS++T+ C C C + PK++P S TY+ D
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYR------D 111
Query: 147 CNCDDDRMQ-------------CVYERQYAEMSTSSGVLGEDVISFGNQSELAPQR--AV 191
C+D + C Y++ Y + + G L +++I+ +
Sbjct: 112 AMCEDSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVY 171
Query: 192 FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG 251
FGC + G ++ GI+GLG G SI+ + K FS C LG
Sbjct: 172 FGCNTLSDGSYFT--GTGILGLGVGKYSIIGEFGSK------FSFC------------LG 211
Query: 252 GISPPT---DMVFAYSDPVRSPYYNIDLKEIH--------VAGKRLPLNSNVFDGKHGTV 300
IS P +++ V+ I++ E H + G+ + L+ V
Sbjct: 212 EISEPKASHNLILGDGANVQGHPTVINITEGHTIFQLESIIVGEEITLDDPV-----QVF 266
Query: 301 LDSGTTYAYLPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPV 360
+D+G+T ++L + F DA + + G P ++ + ++ K
Sbjct: 267 VDTGSTLSHLSTNLYYKFVDA-------FDDLIGSRPLSYEPTLCYKADTIERLEKM--D 317
Query: 361 VDMVFGNGQKYSLSPENYMFRHS--KVRGAYCLGVFQNGKDPTT--LLGGIVVRNTLVTY 416
V F G + S++ N + ++R CL + QN K+ + ++G I ++ V Y
Sbjct: 318 VGFKFDVGAELSVNIHNIFIQQGPPEIR---CLAI-QNNKESFSHVIIGVIAMQGYNVGY 373
Query: 417 DREQTKIGFWKTNC 430
D K +C
Sbjct: 374 DLSAKTAYINKQDC 387
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 62.8 bits (151), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 147/390 (37%), Gaps = 50/390 (12%)
Query: 88 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVKCTLDC 147
YTT + TP +++ D G +V C +Q P+ + S C C
Sbjct: 44 YTTVINQRTPLVPASVVFDLGGRELWVDCDKGYVSSTYQSPRCNSAVCSRAGSTSCG-TC 102
Query: 148 ------NCDDDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAV------FGCE 195
C ++ + + +TS G DV+S + + P R V F C
Sbjct: 103 FSPPRPGCSNNTCGGIPDNTVTGTATS-GEFALDVVSIQSTNGSNPGRVVKIPNLIFDCG 161
Query: 196 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY----GGMDVGGGAMV-L 250
++ G+ G+GR ++ + Q F++C G G G V L
Sbjct: 162 ATFLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGVAFFGNGPYVFL 221
Query: 251 GGI-------SP----PTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNS-----NVFD 294
GI +P P A+S +S Y I + I + K +P+N N
Sbjct: 222 PGIQISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINAST 281
Query: 295 GKHGTVLDSGTTYAYLPEAAFTAFKDAIVKEL--QSLNQISGPDPNYHDICFSGAGTDVS 352
G GT + S Y L + + AF VK+ +S+ +++ P CFS V+
Sbjct: 282 GIGGTKISSVNPYTVLESSIYNAFTSEFVKQAAARSIKRVASVKP--FGACFSTKNVGVT 339
Query: 353 QISKSFPVVDMVF-GNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPTT--LLGGIVV 409
++ + P +++V + + N M S CLG G + T ++GG +
Sbjct: 340 RLGYAVPEIELVLHSKDVVWRIFGANSMVSVSD--DVICLGFVDGGVNARTSVVIGGFQL 397
Query: 410 RNTLVTYDREQTKIGF------WKTNCAEL 433
+ L+ +D K GF +TNCA
Sbjct: 398 EDNLIEFDLASNKFGFSSTLLGRQTNCANF 427
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 53.5 bits (127), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 96/236 (40%), Gaps = 30/236 (12%)
Query: 209 GIMGLGRGDLSIMDQLVDKNV-VSDSFSLCY---------GGMDVGGGAMVLGGISPPTD 258
G+ GL L+ +QL + + F+LC G + GGG L I +
Sbjct: 164 GLAGLAPTALATWNQLTRPRLGLEKKFALCLPSDENPLKKGAIYFGGGPYKLRNIDARSM 223
Query: 259 MVFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNSNVF----DGKHGTVLDSGTTYAYLPE 312
+ + ++P + Y + LK I V G R+ N F +G G L + + L
Sbjct: 224 LSYTRLITNPRKLNNYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTMLRS 283
Query: 313 AAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQKYS 372
+ F +A + + ++S P + C S + + P +D+ NG +
Sbjct: 284 DIYRVFIEAFSQATSGIPRVSSTTP--FEFCLS------TTTNFQVPRIDLELANGVIWK 335
Query: 373 LSPENYMFRHSKVRGAYCLGVFQNGKDP---TTLLGGIVVRNTLVTYDREQTKIGF 425
LSP N M + S CL F NG D ++G + NTLV +D ++ GF
Sbjct: 336 LSPANAMKKVSD--DVACLA-FVNGGDAAAQAVMIGIHQMENTLVEFDVGRSAFGF 388
>AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6408242-6409417 REVERSE LENGTH=391
Length = 391
Score = 50.4 bits (119), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/318 (21%), Positives = 120/318 (37%), Gaps = 61/318 (19%)
Query: 135 SSTYQPVKC-TLDCNCDDDRMQC----VYERQYAEMSTSSGVLGEDVISFGNQSELAPQR 189
S+TY P++C + C + C + +++ +S+ + L D + L
Sbjct: 77 STTYHPIRCGSTRCKYANPNFPCPNNVIAKKRTVCLSSDNSRLFRDTVPL-----LYTFN 131
Query: 190 AVFGCENVETGDLYSQHADG-------IMGLGRGDLSIMDQLVDKNVVSDSFSLCY---- 238
V+ ++ + L DG +GL LSI QL+ + +LC
Sbjct: 132 GVYTRDSEMSSSLTLTCTDGAPALKQRTIGLANTHLSIPSQLISMYQLPHKIALCLPSTE 191
Query: 239 ------GGMDVGGGAMVLGGISPPTDMVFAYSDPV----RSPYYNIDLKEIHVAGKRLPL 288
G + +G G +FA S P+ +S Y ID+K I + K +P+
Sbjct: 192 RSQSHNGDLWIGKGEYYYLPYDKDVSKIFA-STPLIGNGKSGEYLIDVKSIQIGAKTVPI 250
Query: 289 NSNVFDGKHGTVLDSGTTYAYLPEAAFTAFKDAIVKEL-----QSLNQISGPDPNYHDIC 343
+G S A +T F+ ++ K L +++ P C
Sbjct: 251 -------PYGATKISTL-------APYTVFQTSLYKALLTAFTENIKIAKAPAVKPFGAC 296
Query: 344 FSGAGTDVSQISKSFPVVDMVFGNGQKYSLSPENYMFRHSKVRGAYCLGVFQNGKDPT-- 401
F G + PV+D+V G K+ + N + + +K CLG G P
Sbjct: 297 FYSNG------GRGVPVIDLVLSGGAKWRIYGSNSLVKVNK--NVVCLGFVDGGVKPKYP 348
Query: 402 TLLGGIVVRNTLVTYDRE 419
++GG + + LV +D E
Sbjct: 349 IVIGGFQMEDNLVEFDLE 366
>AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6414585-6415745 FORWARD LENGTH=386
Length = 386
Score = 49.7 bits (117), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 77/359 (21%), Positives = 138/359 (38%), Gaps = 39/359 (10%)
Query: 83 LLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSSCEQCGRHQDPKFQPDLSSTYQPVK 142
L G Y ++ +G P L+VD ++ + CSS RH S+ +K
Sbjct: 40 LPTGQYLAQIRLGDSPDPVKLVVDLAGSILWFDCSS-----RHVSSSRNLISGSSSGCLK 94
Query: 143 CTLD--------CNCDDDRMQC-VYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFG 193
+ + D C + + A T+ G L DV+S G+ + +F
Sbjct: 95 AKVGNERVSSSSSSRKDQNADCELLVKNDAFGITARGELFSDVMSVGSVTSPGTVDLLFA 154
Query: 194 CENVETGDLYSQHADGIMGLGRGDLSIMDQLV----DKNVVSDSFSLCYGGMDVGGGAMV 249
C + A G+MGLGR +S+ QL ++ ++ S G + V
Sbjct: 155 CTPPWLLRGLASGAQGVMGLGRAQISLPSQLAAETNERRRLTVYLSPLNGVVSTSSVEEV 214
Query: 250 LGGISPPTDMVFAYSDPVRSPYYNIDLKEIHVAGKRLPLNSNVFDGKHGTVLDSGTTYAY 309
G++ +V+ S Y I++K I V G++L + +G L + Y
Sbjct: 215 F-GVAASRSLVYTPLLTGSSGNYVINVKSIRVNGEKLSV-----EGPLAVELSTVVPYTI 268
Query: 310 LPEAAFTAFKDAIVKELQSLNQISGPDPNYHDICFSGAGTDVSQISKSFPVVDMVFGNGQ 369
L + + F +A K + P +CF+ +DV FP VD+ +
Sbjct: 269 LESSIYKVFAEAYAKAAGEATSV--PPVAPFGLCFT---SDV-----DFPAVDLALQSEM 318
Query: 370 -KYSLSPENYMFRHSKVRGAYCLGVFQNGKDPT--TLLGGIVVRNTLVTYDREQTKIGF 425
++ + +N M G C G+ G ++GG+ + ++ +D + +GF
Sbjct: 319 VRWRIHGKNLMVDVGG--GVRCSGIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375