Miyakogusa Predicted Gene
- Lj0g3v0159799.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0159799.1 Non Chatacterized Hit- tr|I1LZC5|I1LZC5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.4857
PE=,83.98,0,Asp,Peptidase A1; no description,Peptidase aspartic,
catalytic; seg,NULL; Acid proteases,Peptidase a,CUFF.9909.1
(478 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 537 e-153
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 536 e-152
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 496 e-140
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 418 e-117
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 292 4e-79
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 274 1e-73
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 269 2e-72
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 259 2e-69
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 160 2e-39
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 144 1e-34
AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family pr... 144 1e-34
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 143 2e-34
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 132 4e-31
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 122 5e-28
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 121 9e-28
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 121 1e-27
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 120 2e-27
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 120 2e-27
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 120 2e-27
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 119 5e-27
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 119 6e-27
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 118 9e-27
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 113 4e-25
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 5e-25
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 106 4e-23
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 105 9e-23
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 102 6e-22
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 2e-21
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 9e-21
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 98 1e-20
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 98 1e-20
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 98 1e-20
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 97 4e-20
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 95 1e-19
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 2e-19
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 2e-19
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 93 3e-19
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 8e-19
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 9e-19
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 3e-18
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 89 9e-18
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 88 1e-17
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 88 1e-17
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 3e-17
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 7e-17
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 8e-17
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 84 2e-16
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 82 8e-16
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 2e-15
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 4e-15
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 5e-15
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 78 1e-14
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 78 2e-14
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 78 2e-14
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 77 3e-14
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 73 5e-13
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 71 2e-12
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 7e-12
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 67 3e-11
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 3e-11
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 60 4e-09
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 537 bits (1384), Expect = e-153, Method: Compositional matrix adjust.
Identities = 264/390 (67%), Positives = 307/390 (78%), Gaps = 2/390 (0%)
Query: 49 PFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFD 108
P+++ + LYFTKVK+GSPP EFNVQIDTGSDILWV C++CSNCP +SGLGI+L+FFD
Sbjct: 94 PYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFD 153
Query: 109 TVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDM 168
GS TAG V CSDPIC+S Q AA+CS + NQC Y+F+YGDGSGTSGYY++D YFD
Sbjct: 154 APGSLTAGSVTCSDPICSSVFQTTAAQCS-ENNQCGYSFRYGDGSGTSGYYMTDTFYFDA 212
Query: 169 ILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVF 228
ILG++ NSSA IVFGCSTYQSGDLTK+DKAVDGIFGFG G LSVVSQLSSRGITP VF
Sbjct: 213 ILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVF 272
Query: 229 SHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATS 288
SHC P +VYSPLVPSQPHYNLNL SI VNGQ+L ++ AVF S
Sbjct: 273 SHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEAS 332
Query: 289 NNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTV 348
N RGTIVD GTTL YLV+EAYD +NAI+ +VSQ +PIIS G QCYLVST + D+FP+V
Sbjct: 333 NTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSV 392
Query: 349 SLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLAN 408
SLNFAGG SM+L+P+ YL YG DGA+MWCIGFQK E TILGDLVLKDK+ VYDLA
Sbjct: 393 SLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLAR 452
Query: 409 QRIGWTNYDCSLSVNVSVTSSKDEYISAGQ 438
QRIGW +YDCS+SVNVS+TS KD +++GQ
Sbjct: 453 QRIGWASYDCSMSVNVSITSGKD-IVNSGQ 481
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 536 bits (1381), Expect = e-152, Method: Compositional matrix adjust.
Identities = 263/380 (69%), Positives = 302/380 (79%), Gaps = 2/380 (0%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLV 118
LYFTKVK+GSPP EFNVQIDTGSDILWV C++CSNCP +SGLGI+L+FFD GS TAG V
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
CSDPIC+S Q AA+CS + NQC Y+F+YGDGSGTSGYY++D YFD ILG++ NS
Sbjct: 159 TCSDPICSSVFQTTAAQCS-ENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
SA IVFGCSTYQSGDLTK+DKAVDGIFGFG G LSVVSQLSSRGITP VFSHC
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSG 277
Query: 239 XXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
P +VYSPLVPSQPHYNLNL SI VNGQ+L ++ AVF SN RGTIVD G
Sbjct: 278 GGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTG 337
Query: 299 TTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVSM 358
TTL YLV+EAYD +NAI+ +VSQ +PIIS G QCYLVST + D+FP+VSLNFAGG SM
Sbjct: 338 TTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 397
Query: 359 LLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLANQRIGWTNYDC 418
+L+P+ YL YG DGA+MWCIGFQK E TILGDLVLKDK+ VYDLA QRIGW +YDC
Sbjct: 398 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
Query: 419 SLSVNVSVTSSKDEYISAGQ 438
S+SVNVS+TS KD +++GQ
Sbjct: 458 SMSVNVSITSGKD-IVNSGQ 476
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 496 bits (1278), Expect = e-140, Method: Compositional matrix adjust.
Identities = 249/421 (59%), Positives = 312/421 (74%), Gaps = 14/421 (3%)
Query: 28 ARD--HYGEVLDM---VTTCPLD-IISPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGS 81
ARD +G +L V P+D PF++ LY+TK+++G+PPR+F VQ+DTGS
Sbjct: 48 ARDEARHGRLLQSLGGVIDFPVDGTFDPFVVG-----LYYTKLRLGTPPRDFYVQVDTGS 102
Query: 82 DILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVN 141
D+LWV+C +C+ CPQTSGL I+LNFFD S TA + CSD C+ G+Q + + CS Q N
Sbjct: 103 DVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNN 162
Query: 142 QCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAV 201
C+YTFQYGDGSGTSG+YVSD + FDMI+G + NS+A +VFGCST Q+GDL K+D+AV
Sbjct: 163 LCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAV 222
Query: 202 DGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQ 261
DGIFGFG +SV+SQL+S+GI P+VFSHC EP++V++PLVPSQ
Sbjct: 223 DGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVFTPLVPSQ 282
Query: 262 PHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVS 321
PHYN+NL SI+VNGQ L IN +VF+TSN +GTI+D GTTLAYL + AY P V AIT AVS
Sbjct: 283 PHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVS 342
Query: 322 QSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIG 381
QS P++SKGNQCY+++T VGDIFP VSLNFAGG SM L P+ YL+ V G A+WCIG
Sbjct: 343 QSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG 402
Query: 382 FQKVQ-EGVTILGDLVLKDKIVVYDLANQRIGWTNYDCSLSVNVSVTSS--KDEYISAGQ 438
FQ++Q +G+TILGDLVLKDKI VYDL QRIGW NYDCS SVNVS TSS + EY++AGQ
Sbjct: 403 FQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVSATSSSGRSEYVNAGQ 462
Query: 439 L 439
Sbjct: 463 F 463
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 418 bits (1074), Expect = e-117, Method: Compositional matrix adjust.
Identities = 221/392 (56%), Positives = 274/392 (69%), Gaps = 8/392 (2%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLV 118
LY+TKVK+G+PPREFNVQIDTGSD+LWV+C +C+ CP+TS L I+L+FFD S++A LV
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
CSD C S Q + CSP N CSY+F+YGDGSGTSGYY+SD M FD ++ +NS
Sbjct: 143 SCSDRRCYSNFQTESG-CSPN-NLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
SA VFGCS QSGDL + +AVDGIFG G G+LSV+SQL+ +G+ P+VFSHC
Sbjct: 201 SAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG 260
Query: 239 XXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
P VY+PLVPSQPHYN+NLQSIAVNGQ+L I+ +VF + GTI+D G
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTG 320
Query: 299 TTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVSM 358
TTLAYL EAY P + A+ AVSQ PI + QC+ ++ D+FP VSL+FAGG SM
Sbjct: 321 TTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASM 380
Query: 359 LLKPEQYLMPYGFVDGAAMWCIGFQKV-QEGVTILGDLVLKDKIVVYDLANQRIGWTNYD 417
+L P YL + G+++WCIGFQ++ +TILGDLVLKDK+VVYDL QRIGW YD
Sbjct: 381 VLGPRAYLQIFS-SSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYD 439
Query: 418 CSLSVNVSVT---SSKDEYISAGQLRVSSSES 446
CSL VNVS + SKD I+ GQ R S SES
Sbjct: 440 CSLEVNVSASRGGRSKD-VINTGQWRESGSES 470
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 292 bits (747), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 151/381 (39%), Positives = 221/381 (58%), Gaps = 19/381 (4%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLV 118
LY+ K+ +G+P + + VQ+DTGSDI+WVNC C CP+ S LGIEL ++ S + LV
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
C D C G + C ++ C Y YGDGS T+GY+V D + +D + G +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMS-CPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197
Query: 179 SANIVFGCSTYQSGDLTKA-DKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX 237
+ +++FGC QSGDL + ++A+DGI GFG S++SQL+S G K+F+HC
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNG 257
Query: 238 XXXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDC 297
+P + +PLVP+QPHYN+N+ ++ V + L+I +F + +G I+D
Sbjct: 258 GGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDS 316
Query: 298 GTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVS 357
GTTLAYL + Y+PLV IT+ I+ K +C+ S RV + FP V+ +F V
Sbjct: 317 GTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVF 376
Query: 358 MLLKPEQYLMPYGFVDGAAMWCIGFQ------KVQEGVTILGDLVLKDKIVVYDLANQRI 411
+ + P YL P+ MWCIG+Q + + +T+LGDLVL +K+V+YDL NQ I
Sbjct: 377 LRVYPHDYLFPH-----EGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLI 431
Query: 412 GWTNYDCSLSVNVSVTSSKDE 432
GWT Y+CS S+ V KDE
Sbjct: 432 GWTEYNCSSSIKV-----KDE 447
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 274 bits (700), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 155/407 (38%), Positives = 229/407 (56%), Gaps = 19/407 (4%)
Query: 53 EAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGS 112
E+IG LYF K+ +G+P R+F+VQ+DTGSDILWVNC C CP+ S L +EL +D S
Sbjct: 80 ESIG--LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDAS 136
Query: 113 TTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQ 172
+TA V CSD C+ Q +EC + C Y YGDGS T+GY V D ++ D++ G
Sbjct: 137 STAKSVSCSDNFCSYVNQ--RSECHSG-STCQYVIMYGDGSSTNGYLVKDVVHLDLVTGN 193
Query: 173 APPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCX 232
+++ I+FGC + QSG L ++ AVDGI GFG S +SQL+S+G + F+HC
Sbjct: 194 RQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHC- 252
Query: 233 XXXXXXXXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRG 292
P + +P++ HY++NL +I V +L ++ F + +++G
Sbjct: 253 LDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKG 312
Query: 293 TIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNF 352
I+D GTTL YL Y+PL+N I + + T + + C+ + ++ D FPTV+ F
Sbjct: 313 VIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHYTDKL-DRFPTVTFQF 371
Query: 353 AGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQ------KVQEGVTILGDLVLKDKIVVYDL 406
VS+ + P +YL F WC G+Q K +TILGD+ L +K+VVYD+
Sbjct: 372 DKSVSLAVYPREYL----FQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDI 427
Query: 407 ANQRIGWTNYDCSLSVNVSVTSSKDEY-ISAGQLRVSSSESVTGILS 452
NQ IGWTN++CS + V S Y + A L SSS ++T +L+
Sbjct: 428 ENQVIGWTNHNCSGGIQVKDEESGAIYTVGAHNLSWSSSLAITKLLT 474
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 269 bits (688), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 154/407 (37%), Positives = 228/407 (56%), Gaps = 15/407 (3%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLV 118
LYFTK+K+GSPP+E+ VQ+DTGSDILWVNC C CP + LGI L+ +D+ S+T+ V
Sbjct: 77 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 136
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
C D C+ +Q +E CSY YGDGS + G ++ D + + + G
Sbjct: 137 GCEDDFCSFIMQ---SETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPL 193
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
+ +VFGC QSG L + D AVDGI GFG S++SQL++ G T ++FSHC
Sbjct: 194 AQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHC-LDNMNG 252
Query: 239 XXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
P + +P+VP+Q HYN+ L+ + V+G + + ++ +T+ + GTI+D G
Sbjct: 253 GGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSG 312
Query: 299 TTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVSM 358
TTLAYL Q Y+ L+ I TA Q ++ + C+ ++ FP V+L+F + +
Sbjct: 313 TTLAYLPQNLYNSLIEKI-TAKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKL 371
Query: 359 LLKPEQYLMPYGFVDGAAMWCIGFQK----VQEG--VTILGDLVLKDKIVVYDLANQRIG 412
+ P YL F M+C G+Q Q+G V +LGDLVL +K+VVYDL N+ IG
Sbjct: 372 SVYPHDYL----FSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIG 427
Query: 413 WTNYDCSLSVNVSVTSSKDEYISAGQLRVSSSESVTGILSKLLPVSI 459
W +++CS S+ V S + A L ++S + G L LL + I
Sbjct: 428 WADHNCSSSIKVKDGSGAAYQLGAENLISAASSVMNGTLVTLLSILI 474
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 259 bits (663), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 154/416 (37%), Positives = 222/416 (53%), Gaps = 23/416 (5%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLV 118
LYFTK+K+GSPP+E++VQ+DTGSDILW+NC C CP + L L+ FD S+T+ V
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
C D C+ Q + C P + CSY Y D S + G ++ D + + + G
Sbjct: 133 GCDDDFCSFISQ--SDSCQPALG-CSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPL 189
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
+VFGC + QSG L D AVDG+ GFG SV+SQL++ G +VFSHC
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHC-LDNVKG 248
Query: 239 XXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
P + +P+VP+Q HYN+ L + V+G L + +++ N GTIVD G
Sbjct: 249 GGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV---RNGGTIVDSG 305
Query: 299 TTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVSM 358
TTLAY + YD L+ I A I+ + QC+ ST V + FP VS F V +
Sbjct: 306 TTLAYFPKVLYDSLIETI-LARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKL 364
Query: 359 LLKPEQYLMPYGFVDGAAMWCIGFQ------KVQEGVTILGDLVLKDKIVVYDLANQRIG 412
+ P YL F ++C G+Q + V +LGDLVL +K+VVYDL N+ IG
Sbjct: 365 TVYPHDYL----FTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIG 420
Query: 413 WTNYDCSLSVNVSVTSSKDEYISAGQLRVSSSES---VTGILSKLLPVSIVAALSM 465
W +++CS S+ + S S G +SS+ +T +L+ L P+ ++A S+
Sbjct: 421 WADHNCSSSIKIKDGSGG--VYSVGADNLSSAPRLLMITKLLTILSPLIVMAFTSL 474
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 160 bits (404), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 179/369 (48%), Gaps = 35/369 (9%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
Y T++ +G+PP+EF + +DTGS + +V C+TC C + + ST+ +
Sbjct: 76 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPEL-----STSYQALK 130
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
C +P C C + C Y +Y + S +SG D + F +P
Sbjct: 131 C-NPDCN---------CDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSP----- 175
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCX-XXXXXX 238
VFGC ++GDL + DGI G G G LSVV QL +G+ VFS C
Sbjct: 176 QRAVFGCENEETGDL--FSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG 233
Query: 239 XXXXXXXXXXEPSIVYSPLVP-SQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDC 297
P +V+S P P+YN++L+ + V G+ L +N VF + GT++D
Sbjct: 234 GAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVF--NGKHGTVLDS 291
Query: 298 GTTLAYLVQEAYDPLVNAITTAV---SQSTSPIISKGNQCYLVSTR----VGDIFPTVSL 350
GTT AY +EA+ + +A+ + + P + + C+ + R + + FP +++
Sbjct: 292 GTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAM 351
Query: 351 NFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLANQR 410
F G ++L PE YL + V GA +C+G ++ T+LG +V+++ +V YD N +
Sbjct: 352 EFGNGQKLILSPENYLFRHTKVRGA--YCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDK 409
Query: 411 IGWTNYDCS 419
+G+ +CS
Sbjct: 410 LGFLKTNCS 418
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 144 bits (363), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 173/371 (46%), Gaps = 37/371 (9%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
Y T++ +G+PP+ F + +D+GS + +V C+ C C + F S+T V
Sbjct: 93 YTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPK-----FQPEMSSTYQPVK 147
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
C+ C QC Y +Y + S + G D + F P
Sbjct: 148 CN----------MDCNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTP----- 192
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
VFGC T ++GDL + DGI G G G LS+V QL +G+ F C
Sbjct: 193 QRAVFGCETVETGDLYS--QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGG 250
Query: 240 XXXXXXXXXEPS-IVYSPLVPSQ-PHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDC 297
PS +V++ P + P+YN++L I V G+ LS++ VF G ++D
Sbjct: 251 GSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVF--DGEHGAVLDS 308
Query: 298 GTTLAYLVQEAYDPLVNAITTAVS---QSTSPIISKGNQCYLVS-----TRVGDIFPTVS 349
GTT AYL A+ A+ VS Q P + + C+ V+ + + IFP+V
Sbjct: 309 GTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVE 368
Query: 350 LNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIG-FQKVQEGVTILGDLVLKDKIVVYDLAN 408
+ F G S LL PE Y+ + V GA +C+G F ++ T+LG +V+++ +VVYD N
Sbjct: 369 MVFKSGQSWLLSPENYMFRHSKVHGA--YCLGVFPNGKDHTTLLGGIVVRNTLVVYDREN 426
Query: 409 QRIGWTNYDCS 419
++G+ +CS
Sbjct: 427 SKVGFWRTNCS 437
>AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:14665728-14669135 REVERSE LENGTH=430
Length = 430
Score = 144 bits (363), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 126/208 (60%), Gaps = 20/208 (9%)
Query: 250 PSIVYSP--LVPSQP-HYN---LNLQSIAVNGQLLSINQAVFATSNNRGTIVDCGTTLAY 303
P++ +P V SQP +YN ++ ++AVN L I+ +VF+ + GTI+D GTTL +
Sbjct: 208 PALCSTPCSTVSSQPLYYNPQFSHMMTVAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVH 267
Query: 304 LVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTR------VGDIFPTVSLNFAGGVS 357
EAYDPL+ AI VSQ PI + QC+ +++ + D+FP V L FAGG S
Sbjct: 268 FPGEAYDPLIQAILNVVSQYGRPIPYESFQCFNITSGISSHLVIADMFPEVHLGFAGGAS 327
Query: 358 MLLKPEQYLMPYGFVD-GAAMWCIGF-QKVQEGVTILGDLVLKDKIVVYDLANQRIGWTN 415
M++KPE YL F+D A+WC+GF +TI+G++ ++DK+ VYDL +QRIGW
Sbjct: 328 MVIKPEAYLFQ-KFLDLTNAIWCLGFYSSTSRRITIIGEVAIRDKMFVYDLDHQRIGWAE 386
Query: 416 YDCSLSV-----NVSVTSSKDEYISAGQ 438
Y+CSL V N +T++K ++G+
Sbjct: 387 YNCSLDVTRAQQNKDITNTKHSTGNSGK 414
Score = 105 bits (263), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 75/111 (67%), Gaps = 7/111 (6%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLV 118
LY+T V++G+PPRE +V IDTGSD++WV+CN+C CP + + FFD S++A +
Sbjct: 77 LYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKL 131
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMI 169
CSD C+S +Q + CS + C+Y +YGDGS TSGYY+SD + FD +
Sbjct: 132 ACSDKRCSSDLQ-KKSRCS-LLESCTYKVEYGDGSVTSGYYISDLISFDTM 180
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 143 bits (361), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 162/401 (40%), Gaps = 46/401 (11%)
Query: 38 MVTTCPLDIISPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNT-CSNCPQ 96
+ + P ++ P Y +++GSPP+ F IDTGSD+ WV C+ CS C
Sbjct: 27 FIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTL 86
Query: 97 TSGLGIELNFFDTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTS 156
L + ++PCS+PICT+ C QC Y +Y D +
Sbjct: 87 PPNLQYK---------PKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSM 137
Query: 157 GYYVSDAMYFDMILGQ--APPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSV 214
G V+D ++ G PPV FGC QS A G+ G G G + +
Sbjct: 138 GALVTDQFPLKLVNGSFMQPPV------AFGCGYDQSYPSAHPPPATAGVLGLGRGKIGL 191
Query: 215 VSQLSSRGITPKVFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVN 274
++QL S G+T V HC + ++PL+ HY + N
Sbjct: 192 LTQLVSAGLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQDNHYTTGPADLLFN 251
Query: 275 GQLLSINQAVFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAI---------TTAVSQSTS 325
G+ + I D G++ Y +AY ++N I A T
Sbjct: 252 GKPTGLKGLKL--------IFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTL 303
Query: 326 PIISKGNQCYLVSTRVGDIFPTVSLNFAGG---VSMLLKPEQYLMPYGFVDGAAMWCIGF 382
PI KG + + V + F T+++NF G + L PE YL+ V C+G
Sbjct: 304 PICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQLYLAPELYLI----VSKTGNVCLGL 359
Query: 383 QKVQE----GVTILGDLVLKDKIVVYDLANQRIGWTNYDCS 419
E ++GD+ ++ +++YD Q++GW + DC+
Sbjct: 360 LNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLGWVSSDCN 400
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 132 bits (333), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 167/381 (43%), Gaps = 33/381 (8%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
YF +++G PP+ + DTGSD++WV C+ C NC S + F S+T
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATV----FFPRHSSTFSPAH 139
Query: 120 CSDPICTSGVQGAAAECSPQVN---QCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPV 176
C DP+C + A C Y + Y DGS TSG + + G+ +
Sbjct: 140 CYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARL 199
Query: 177 NSSANIVFGCSTYQSGDLTKAD--KAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXX 234
S + FGC SG +G+ G G G +S SQL R FS+C
Sbjct: 200 KS---VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYCLMD 254
Query: 235 XXXXXXXXXXXXXXE-----PSIVYSPLV--PSQP-HYNLNLQSIAVNGQLLSINQAVFA 286
+ ++PL+ P P Y + L+S+ VNG L I+ +++
Sbjct: 255 YTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWE 314
Query: 287 --TSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG-NQCYLVS--TRV 341
S N GT+VD GTTLA+L + AY ++ A+ V + ++ G + C VS T+
Sbjct: 315 IDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKP 374
Query: 342 GDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKV--QEGVTILGDLVLKD 399
I P + F+GG + P Y + + C+ Q V + G +++G+L+ +
Sbjct: 375 EKILPRLKFEFSGGAVFVPPPRNYFI----ETEEQIQCLAIQSVDPKVGFSVIGNLMQQG 430
Query: 400 KIVVYDLANQRIGWTNYDCSL 420
+ +D R+G++ C+L
Sbjct: 431 FLFEFDRDRSRLGFSRRGCAL 451
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 122 bits (306), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 161/388 (41%), Gaps = 49/388 (12%)
Query: 59 LYFTKVKMGSPP--REFNVQIDTGSDILWVNCNT-CSNCPQTSGLGIELNFFDTVGSTTA 115
LY+T++ +G P + +++ IDTGS++ W+ C+ C++C + +
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN--------QLYKPRKD 253
Query: 116 GLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPP 175
LV S+ C + E +QC Y +Y D S + G D + + G
Sbjct: 254 NLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSL-- 311
Query: 176 VNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXX 235
+ ++IVFGC Q G L DGI G +S+ SQL+SRGI V HC
Sbjct: 312 --AESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASD 369
Query: 236 XXXXXXXXXXXXXEPS--IVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGT 293
PS + + P++ ++ L + + +S Q + + G
Sbjct: 370 LNGEGYIFMGSDLVPSHGMTWVPML-----HDSRLDAYQMQVTKMSYGQGMLSLDGENGR 424
Query: 294 I----VDCGTTLAYLVQEAYDPLVNA--------ITTAVSQSTSPIISKGNQCYLVS--T 339
+ D G++ Y +AY LV + +T S T PI + + S +
Sbjct: 425 VGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLS 484
Query: 340 RVGDIFPTVSLNFAG-----GVSMLLKPEQYLMPYGFVDGAAMWCIGF---QKVQEGVT- 390
V F ++L +L++PE YL+ + C+G V +G T
Sbjct: 485 DVKKFFRPITLQIGSKWLIISRKLLIQPEDYLI----ISNKGNVCLGILDGSSVHDGSTI 540
Query: 391 ILGDLVLKDKIVVYDLANQRIGWTNYDC 418
ILGD+ ++ ++VYD +RIGW DC
Sbjct: 541 ILGDISMRGHLIVYDNVKRRIGWMKSDC 568
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 121 bits (304), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 169/379 (44%), Gaps = 38/379 (10%)
Query: 52 LEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQT----SGLGIELNFF 107
++A+GF L++ V +G+P F V +DTGSD+ W+ C+ C+NC + G ++LN +
Sbjct: 97 VDALGF-LHYANVTVGTPSDWFMVALDTGSDLFWLPCD-CTNCVRELKAPGGSSLDLNIY 154
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQY-GDGSGTSGYYVSDAMYF 166
S+T+ VPC+ +CT G + A+ E + C Y +Y +G+ ++G V D ++
Sbjct: 155 SPNASSTSTKVPCNSTLCTRGDRCASPE-----SDCPYQIRYLSNGTSSTGVLVEDVLH- 208
Query: 167 DMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPK 226
++ A + FGC Q+G A +G+FG G +SV S L+ GI
Sbjct: 209 -LVSNDKSSKAIPARVTFGCGQVQTGVFHDG-AAPNGLFGLGLEDISVPSVLAKEGIAAN 266
Query: 227 VFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPH--YNLNLQSIAVNGQLLSIN-QA 283
FS C + +PL QPH YN+ + I+V G + A
Sbjct: 267 SFSMCFGNDGAGRISFGDKGSVDQR--ETPLNIRQPHPTYNITVTKISVGGNTGDLEFDA 324
Query: 284 VFATSNNRGTIVDCGTTLAYLVQEAYDPL---VNAITTAVSQSTSPIISKGNQCYLVSTR 340
VF D GT+ YL AY + N++ T+ CY +S
Sbjct: 325 VF----------DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPN 374
Query: 341 VGDI-FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKD 399
+P V+L GG S + ++P D ++C+ K+ E ++I+G +
Sbjct: 375 KDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTD---VYCLAIMKI-EDISIIGQNFMTG 430
Query: 400 KIVVYDLANQRIGWTNYDC 418
VV+D +GW DC
Sbjct: 431 YRVVFDREKLILGWKESDC 449
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 121 bits (304), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 161/388 (41%), Gaps = 49/388 (12%)
Query: 59 LYFTKVKMGSPP--REFNVQIDTGSDILWVNCNT-CSNCPQTSGLGIELNFFDTVGSTTA 115
LY+T++ +G P + +++ IDTGS++ W+ C+ C++C + +
Sbjct: 29 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN--------QLYKPRKD 80
Query: 116 GLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPP 175
LV S+ C + E +QC Y +Y D S + G D + + G
Sbjct: 81 NLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSL-- 138
Query: 176 VNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXX 235
+ ++IVFGC Q G L DGI G +S+ SQL+SRGI V HC
Sbjct: 139 --AESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASD 196
Query: 236 XXXXXXXXXXXXXEPS--IVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGT 293
PS + + P++ ++ L + + +S Q + + G
Sbjct: 197 LNGEGYIFMGSDLVPSHGMTWVPML-----HDSRLDAYQMQVTKMSYGQGMLSLDGENGR 251
Query: 294 I----VDCGTTLAYLVQEAYDPLVNA--------ITTAVSQSTSPIISKGNQCYLVS--T 339
+ D G++ Y +AY LV + +T S T PI + + S +
Sbjct: 252 VGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLS 311
Query: 340 RVGDIFPTVSLNFAG-----GVSMLLKPEQYLMPYGFVDGAAMWCIGF---QKVQEGVT- 390
V F ++L +L++PE YL+ + C+G V +G T
Sbjct: 312 DVKKFFRPITLQIGSKWLIISRKLLIQPEDYLI----ISNKGNVCLGILDGSSVHDGSTI 367
Query: 391 ILGDLVLKDKIVVYDLANQRIGWTNYDC 418
ILGD+ ++ ++VYD +RIGW DC
Sbjct: 368 ILGDISMRGHLIVYDNVKRRIGWMKSDC 395
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 120 bits (301), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 118/429 (27%), Positives = 196/429 (45%), Gaps = 51/429 (11%)
Query: 2 SPVGGVI-KVNFVAPILVKKVARLDFIARDHYGEVLDMVTTCPLDIISPFILEAI-GFRL 59
S V G++ K+ F V+ V R D + Y E T D+ +P + A G
Sbjct: 111 SRVAGIVAKIRFA----VEGVDRSDL--KPVYNEDTRYQTE---DLTTPVVSGASQGSGE 161
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
YF+++ +G+P +E + +DTGSD+ W+ C C++C Q S F+ S+T +
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYKSLT 216
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
CS P C S ++ +A + N+C Y YGDGS T G +D + F G + +N
Sbjct: 217 CSAPQC-SLLETSAC----RSNKCLYQVSYGDGSFTVGELATDTVTF----GNSGKIN-- 265
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
N+ GC G T A + G LS+ +Q+ + FS+C
Sbjct: 266 -NVALGCGHDNEGLFTGAAGLLGLG----GGVLSITNQMKATS-----FSYCLVDRDSGK 315
Query: 240 XXXXXXXXXE--PSIVYSPLVPSQP---HYNLNLQSIAVNGQLLSINQAVF--ATSNNRG 292
+ +PL+ ++ Y + L +V G+ + + A+F S + G
Sbjct: 316 SSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGG 375
Query: 293 TIVDCGTTLAYLVQEAYDPLVNA---ITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVS 349
I+DCGT + L +AY+ L +A +T + + +S IS + CY S+ PTV+
Sbjct: 376 VILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSS-ISLFDTCYDFSSLSTVKVPTVA 434
Query: 350 LNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLANQ 409
+F GG S+ L + YL+P VD + +C F ++I+G++ + + YDL+
Sbjct: 435 FHFTGGKSLDLPAKNYLIP---VDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKN 491
Query: 410 RIGWTNYDC 418
IG + C
Sbjct: 492 VIGLSGNKC 500
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 120 bits (301), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 158/379 (41%), Gaps = 47/379 (12%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNT-CSNCPQTSGLGIELNFFDTVGSTTAGLV 118
Y + +G PPR + + +DTGSD+ W+ C+ C C L + ++ L+
Sbjct: 60 YNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC---------LEAPHPLYQPSSDLI 110
Query: 119 PCSDPICTSGVQGAAAEC-SPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVN 177
PC+DP+C + + C +P+ QC Y +Y DG + G V D + G +
Sbjct: 111 PCNDPLCKALHLNSNQRCETPE--QCDYEVEYADGGSSLGVLVRDVFSMNYTQG----LR 164
Query: 178 SSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX 237
+ + GC Y + +DG+ G G G +S++SQL S+G V HC
Sbjct: 165 LTPRLALGCG-YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 223
Query: 238 XXXXXXXXXXXEPSIVYSPLVPS-QPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVD 296
+ ++P+ HY S A+ G+LL + N T+ D
Sbjct: 224 GILFFGDDLYDSSRVSWTPMSREYSKHY-----SPAMGGELLFGGRTT--GLKNLLTVFD 276
Query: 297 CGTTLAYLVQEAYDPLV---------NAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPT 347
G++ Y +AY + + A T P+ +G + ++ V F
Sbjct: 277 SGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 336
Query: 348 VSLNFAGGVS----MLLKPEQYLMPYGFVDGAAMWCIGFQKVQE----GVTILGDLVLKD 399
++L+F G + PE YL+ + C+G E + ++GD+ ++D
Sbjct: 337 LALSFKTGWRSKTLFEIPPEAYLI----ISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 392
Query: 400 KIVVYDLANQRIGWTNYDC 418
++++YD Q IGW DC
Sbjct: 393 QMIIYDNEKQSIGWMPVDC 411
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 120 bits (300), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 40/383 (10%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNT-CSNCPQTSGLGIELNFFDTVGSTTAGLV 118
Y+ + +G+PP+ F++ IDTGSD+ WV C+ C+ C + + N +
Sbjct: 67 YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNH---------NTL 117
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
PCS +C+ C+ +QC Y Y D + + G V+D + + G
Sbjct: 118 PCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSI----M 173
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
+ + FGC Q GI G G G + + +QL S GIT V HC
Sbjct: 174 NLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKG 233
Query: 239 XXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
+ ++ L + P N +A +LL ++ N + D G
Sbjct: 234 FLSIGDELVPSSGVTWTSLATNSPSKNY----MAGPAELLFNDKTTGVKGIN--VVFDSG 287
Query: 299 TTLAYLVQEAYDPLVNAI---------TTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVS 349
++ Y EAY +++ I T + P+ KG + V F T++
Sbjct: 288 SSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTIT 347
Query: 350 LNFA---GGVSMLLKPEQYLMPYGFVDGAAMWCIGF----QKVQEGVTILGDLVLKDKIV 402
L F G + PE YL+ + C+G + EG I+GD+ + +V
Sbjct: 348 LRFGNQKNGQLFQVPPESYLI----ITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMV 403
Query: 403 VYDLANQRIGWTNYDCSLSVNVS 425
+YD QRIGW + DC NV+
Sbjct: 404 IYDNEKQRIGWISSDCDKLPNVN 426
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 119 bits (297), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 151/376 (40%), Gaps = 40/376 (10%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNT-CSNCPQTSGLGIELNFFDTVGSTTAGLV 118
Y+ + +G+PP+ F++ IDTGSD+ WV C+ C+ C + + N +
Sbjct: 67 YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNH---------NTL 117
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
PCS +C+ C+ +QC Y Y D + + G V+D + + G
Sbjct: 118 PCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSI----M 173
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
+ + FGC Q GI G G G + + +QL S GIT V HC
Sbjct: 174 NLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKG 233
Query: 239 XXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
+ ++ L + P N +A +LL ++ N + D G
Sbjct: 234 FLSIGDELVPSSGVTWTSLATNSPSKNY----MAGPAELLFNDKTTGVKGIN--VVFDSG 287
Query: 299 TTLAYLVQEAYDPLVNAI---------TTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVS 349
++ Y EAY +++ I T + P+ KG + V F T++
Sbjct: 288 SSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTIT 347
Query: 350 LNFA---GGVSMLLKPEQYLMPYGFVDGAAMWCIGF----QKVQEGVTILGDLVLKDKIV 402
L F G + PE YL+ + C+G + EG I+GD+ + +V
Sbjct: 348 LRFGNQKNGQLFQVPPESYLI----ITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMV 403
Query: 403 VYDLANQRIGWTNYDC 418
+YD QRIGW + DC
Sbjct: 404 IYDNEKQRIGWISSDC 419
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 119 bits (297), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 157/375 (41%), Gaps = 38/375 (10%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNC---------PQTSGLGIELNFFDT 109
LY+ V +G+PP F V +DTGSD+ W+ CN + C PQ+ + LN +
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQS----VPLNLYTP 156
Query: 110 VGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMI 169
STT+ + CSD C + +CS + C Y Y + +GT G + D ++
Sbjct: 157 NASTTSSSIRCSDKRCF-----GSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATE 211
Query: 170 LGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFS 229
PV AN+ GC Q+G L + + +V+G+ G G SV S L+ IT FS
Sbjct: 212 DENLTPVK--ANVTLGCGQKQTG-LFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFS 268
Query: 230 HCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPH--YNLNLQSIAVNGQLLSINQAVFAT 287
C +P + P Y +N+ ++V G + I +FA
Sbjct: 269 MCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDIR--LFAK 326
Query: 288 SNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISK--GNQCYLVSTRVGDI- 344
D G++ +L + AY L + V P+ + CY +S I
Sbjct: 327 -------FDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQ 379
Query: 345 FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIG-FQKVQEGVTILGDLVLKDKIVV 403
FP V + F GG ++L + +G M+C+G + V + ++G + +V
Sbjct: 380 FPLVEMTFIGGSKIILNNPFFTART--QEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIV 437
Query: 404 YDLANQRIGWTNYDC 418
+D +GW C
Sbjct: 438 FDRERMILGWKQSLC 452
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 118 bits (296), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 111/413 (26%), Positives = 176/413 (42%), Gaps = 48/413 (11%)
Query: 21 VARLDF----IARDHYGEVLDMVTTCPLDIISPFILEAI-GFRLYFTKVKMGSPPREFNV 75
+ RLD I++ + M TT DI +P I G YFT+V +G P RE +
Sbjct: 104 ITRLDLAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYM 163
Query: 76 QIDTGSDILWVNCNTCSNC-PQTSGLGIELNFFDTVGSTTAGLVPCSDPICTSGVQGAAA 134
+DTGSD+ W+ C C++C QT + F+ S++ + C P C + +
Sbjct: 164 VLDTGSDVNWLQCTPCADCYHQTEPI------FEPSSSSSYEPLSCDTPQCNA---LEVS 214
Query: 135 ECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSSANIVFGCSTYQSGDL 194
EC C Y YGDGS T G + ++ + L Q N+ GC G
Sbjct: 215 ECRNAT--CLYEVSYGDGSYTVGDFATETLTIGSTLVQ--------NVAVGCGHSNEGLF 264
Query: 195 --TKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXXXXXXXX-XXXEPS 251
+ G P L+ S FS+C P
Sbjct: 265 VGAAGLLGLGGGLLALPSQLNTTS-----------FSYCLVDRDSDSASTVDFGTSLSPD 313
Query: 252 IVYSPLVPSQ---PHYNLNLQSIAVNGQLLSINQAVFA--TSNNRGTIVDCGTTLAYLVQ 306
V +PL+ + Y L L I+V G+LL I Q+ F S + G I+D GT + L
Sbjct: 314 AVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQT 373
Query: 307 EAYDPLVNA-ITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVSMLLKPEQY 365
E Y+ L ++ + + + ++ + CY +S + PTV+ +F GG + L + Y
Sbjct: 374 EIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNY 433
Query: 366 LMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLANQRIGWTNYDC 418
++P VD +C+ F + I+G++ + V +DLAN IG+++ C
Sbjct: 434 MIP---VDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 113 bits (282), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 170/379 (44%), Gaps = 32/379 (8%)
Query: 52 LEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQT-SGLGIE----LNF 106
++ +GF L++ V +G+P F V +DTGSD+ W+ CN S C + +G+ LN
Sbjct: 95 IDLLGF-LHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNL 153
Query: 107 FDTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQY-GDGSGTSGYYVSDAMY 165
+ S+T+ + CSD C ++ CS + C Y QY + T+G D ++
Sbjct: 154 YSPNTSSTSSSIRCSDDRCF-----GSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDVLH 208
Query: 166 FDMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITP 225
PV ANI GC Q+G L ++ AV+G+ G G SV S L+ IT
Sbjct: 209 LVTEDEGLEPVK--ANITLGCGKNQTGFL-QSSAAVNGLLGLGLKDYSVPSILAKAKITA 265
Query: 226 KVFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPH--YNLNLQSIAVNGQLLSINQA 283
FS C + +PL+P++P Y +++ ++V G + +
Sbjct: 266 NSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQLL 325
Query: 284 VFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISK--GNQCYLVS-TR 340
+ D GT+ +L++ Y + A V+ PI + CY +S +
Sbjct: 326 A---------LFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNK 376
Query: 341 VGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQK-VQEGVTILGDLVLKD 399
+FP V++ F GG M L+ +++ D +AM+C+G K V + I+G +
Sbjct: 377 TTILFPRVAMTFEGGSQMFLRNPLFIVWNE--DNSAMYCLGILKSVDFKINIIGQNFMSG 434
Query: 400 KIVVYDLANQRIGWTNYDC 418
+V+D +GW DC
Sbjct: 435 YRIVFDRERMILGWKRSDC 453
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 112 bits (281), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 159/379 (41%), Gaps = 39/379 (10%)
Query: 52 LEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGL----GIELNFF 107
+ ++GF L++T VK+G+P F V +DTGSD+ WV C+ C C T G EL+ +
Sbjct: 100 ISSLGF-LHYTTVKLGTPGMRFMVALDTGSDLFWVPCD-CGKCAPTEGATYASEFELSIY 157
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDG-SGTSGYYVSDAMYF 166
+ STT V C++ +C Q C + C Y Y + TSG + D M+
Sbjct: 158 NPKVSTTNKKVTCNNSLCAQRNQ-----CLGTFSTCPYMVSYVSAQTSTSGILMEDVMH- 211
Query: 167 DMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPK 226
+ P A + FGC QSG A +G+FG G +SV S L+ G+
Sbjct: 212 -LTTEDKNPERVEAYVTFGCGQVQSGSFLDI-AAPNGLFGLGMEKISVPSVLAREGLVAD 269
Query: 227 VFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLSINQAVFA 286
FS C + L PS P+YN+ + + V L+
Sbjct: 270 SFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLI-------- 321
Query: 287 TSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAV-SQSTSPIISKGNQ-----CYLVSTR 340
+ + D GT+ YLV DP+ ++ + SQ+ S ++ CY +S
Sbjct: 322 -DDEFTALFDTGTSFTYLV----DPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSND 376
Query: 341 V-GDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKD 399
+ P++SL G + ++ +G ++C+ K E + I+G +
Sbjct: 377 ANASLIPSLSLTMKGNSHFTINDPIIVIS---TEGELVYCLAIVKSSE-LNIIGQNYMTG 432
Query: 400 KIVVYDLANQRIGWTNYDC 418
VV+D + W +DC
Sbjct: 433 YRVVFDREKLVLAWKKFDC 451
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 106 bits (264), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 163/380 (42%), Gaps = 56/380 (14%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
YFT++ +G+P R + +DTGSDI+W+ C C C S FD S T +P
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSD-----PIFDPRKSKTYATIP 196
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
CS P C + +A C+ + C Y YGDGS T G + ++ + F N
Sbjct: 197 CSSPHCR---RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR--------NRV 245
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
+ GC G A + G LS Q R + FS+C
Sbjct: 246 KGVALGCGHDNEGLFVGAAGLLGLGK----GKLSFPGQTGHR--FNQKFSYC-------- 291
Query: 240 XXXXXXXXXEPSIV------------YSPLVPSQPH----YNLNLQSIAVNG-QLLSINQ 282
+PS V ++PL+ S P Y + L I+V G ++ +
Sbjct: 292 -LVDRSASSKPSSVVFGNAAVSRIARFTPLL-SNPKLDTFYYVGLLGISVGGTRVPGVTA 349
Query: 283 AVFATSN--NRGTIVDCGTTLAYLVQEAYDPLVNAITT-AVSQSTSPIISKGNQCYLVST 339
++F N G I+D GT++ L++ AY + +A A + +P S + C+ +S
Sbjct: 350 SLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSN 409
Query: 340 RVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKD 399
PTV L+F G + L YL+P VD +C F G++I+G++ +
Sbjct: 410 MNEVKVPTVVLHFR-GADVSLPATNYLIP---VDTNGKFCFAFAGTMGGLSIIGNIQQQG 465
Query: 400 KIVVYDLANQRIGWTNYDCS 419
VVYDLA+ R+G+ C+
Sbjct: 466 FRVVYDLASSRVGFAPGGCA 485
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 105 bits (261), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 162/379 (42%), Gaps = 43/379 (11%)
Query: 59 LYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGL-----GIELNFFDTVGST 113
L++T + +G+P F V +DTGS++LW+ CN P TS +LN ++ S+
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158
Query: 114 TAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDG-SGTSGYYVSDAMYF-----D 167
T+ + CS +C S A++C QC YT Y G + +SG V D ++ +
Sbjct: 159 TSKVFLCSHKLCDS-----ASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNN 213
Query: 168 MILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKV 227
++ + V A +V GC QSGD A DG+ G GP +SV S LS G+
Sbjct: 214 RLMNGSSSVK--ARVVIGCGKKQSGDYLDG-VAPDGLMGLGPAEISVPSFLSKAGLMRNS 270
Query: 228 FSHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPS--QPHYNLNLQSIAVNGQLLSINQAVF 285
FS C I + + PS Q L L + +G ++ +
Sbjct: 271 FSLCFDEEDSG------------RIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCI 318
Query: 286 ATS----NNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRV 341
S + T +D G + YL +E Y + I ++ +TS + Y +
Sbjct: 319 GNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHIN-ATSKNFEGVSWEYCYESSA 377
Query: 342 GDIFPTVSLNFAGGVSMLL-KPEQYLMPYGFVDGAAMWCIGFQKV-QEGVTILGDLVLKD 399
P + L F+ + ++ KP L + G +C+ QEG+ +G ++
Sbjct: 378 EPKVPAIKLKFSHNNTFVIHKP---LFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRG 434
Query: 400 KIVVYDLANQRIGWTNYDC 418
+V+D N ++GW+ C
Sbjct: 435 YRMVFDRENMKLGWSPSKC 453
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 102 bits (254), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 163/383 (42%), Gaps = 49/383 (12%)
Query: 69 PPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVPCSDPICTSG 128
PP+ ++ IDTGS++ W+ CN SN P +N FD S++ +PCS P C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSN-PN------PVNNFDPTRSSSYSPIPCSSPTCRTR 134
Query: 129 VQGAAAECSPQVNQ-CSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSSANIVFGCS 187
+ S ++ C T Y D S + G ++ +F G + + +N++FGC
Sbjct: 135 TRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHF----GNS---TNDSNLIFGCM 187
Query: 188 TYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX---XXXXXXX 244
SG + D G+ G G+LS +SQ+ PK FS+C
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMG----FPK-FSYCISGTDDFPGFLLLGDS 242
Query: 245 XXXXEPSIVYSPLVP-SQP-------HYNLNLQSIAVNGQLLSINQAVFATSNNRG--TI 294
+ Y+PL+ S P Y + L I VNG+LL I ++V + T+
Sbjct: 243 NFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTM 302
Query: 295 VDCGTTLAYLVQEAYDPL-------VNAITTAVSQSTSPIISKGNQCYLVS---TRVGDI 344
VD GT +L+ Y L N I T + CY +S R G +
Sbjct: 303 VDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGIL 362
Query: 345 --FPTVSLNFAGG-VSMLLKPEQYLMPYGFVDGAAMWCIGFQK---VQEGVTILGDLVLK 398
PTVSL F G +++ +P Y +P+ V +++C F + ++G +
Sbjct: 363 HRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQ 422
Query: 399 DKIVVYDLANQRIGWTNYDCSLS 421
+ + +DL RIG +C +S
Sbjct: 423 NMWIEFDLQRSRIGLAPVECDVS 445
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 100 bits (250), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 157/380 (41%), Gaps = 49/380 (12%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
+ ++ +G+P +++ +DTGSD++W C C+ C FD S++ V
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC-----FDQPTPIFDPEKSSSYSKVG 161
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
CS +C + + + C+ + C Y + YGD S T G ++ F+ NS
Sbjct: 162 CSSGLCNALPR---SNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE-------DENSI 211
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
+ I FGC GD G+ G G G LS++SQL FS+C
Sbjct: 212 SGIGFGCGVENEGD---GFSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSE 263
Query: 240 XXXX---------------XXXXXEPSIVYSPLV-PSQPH-YNLNLQSIAVNGQLLSINQ 282
E + S L P QP Y L LQ I V + LS+ +
Sbjct: 264 ASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEK 323
Query: 283 AVF--ATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG-NQCYLVST 339
+ F A G I+D GTT+ YL + A+ L T+ +S S G + C+ +
Sbjct: 324 STFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPD 383
Query: 340 RVGDI-FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLK 398
+I P + +F G + L E Y++ + C+ G++I G++ +
Sbjct: 384 AAKNIAVPKMIFHFK-GADLELPGENYMVAD---SSTGVLCLAMGS-SNGMSIFGNVQQQ 438
Query: 399 DKIVVYDLANQRIGWTNYDC 418
+ V++DL + + + +C
Sbjct: 439 NFNVLHDLEKETVSFVPTEC 458
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 98.6 bits (244), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 132/324 (40%), Gaps = 39/324 (12%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNT-CSNCPQTSGLGIELNFFDTVGSTTAGLV 118
Y + +G PPR + + +DTGSD+ W+ C+ C C + + ++ L+
Sbjct: 57 YNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAP---------HPLYQPSSDLI 107
Query: 119 PCSDPICTSGVQGAAAEC-SPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVN 177
PC+DP+C + + C +P+ QC Y +Y DG + G V D + G +
Sbjct: 108 PCNDPLCKALHLNSNQRCETPE--QCDYEVEYADGGSSLGVLVRDVFSMNYTQG----LR 161
Query: 178 SSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX 237
+ + GC Y + +DG+ G G G +S++SQL S+G V HC
Sbjct: 162 LTPRLALGCG-YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 220
Query: 238 XXXXXXXXXXXEPSIVYSPLVPS-QPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVD 296
+ ++P+ HY S A+ G+LL + N T+ D
Sbjct: 221 GILFFGDDLYDSSRVSWTPMSREYSKHY-----SPAMGGELLFGGRT--TGLKNLLTVFD 273
Query: 297 CGTTLAYLVQEAYDPLV---------NAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPT 347
G++ Y +AY + + A T P+ +G + ++ V F
Sbjct: 274 SGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKP 333
Query: 348 VSLNFAGGVS----MLLKPEQYLM 367
++L+F G + PE YL+
Sbjct: 334 LALSFKTGWRSKTLFEIPPEAYLI 357
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 98.2 bits (243), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/432 (24%), Positives = 189/432 (43%), Gaps = 44/432 (10%)
Query: 53 EAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQT----SGLGIELNFFD 108
E I F L++ V +G+P + F V +DTGSD+ W+ CN S C ++ G I+LN ++
Sbjct: 83 EEISF-LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYN 141
Query: 109 TVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQY-GDGSGTSGYYVSDAMYFD 167
S ++ V C+ +C C V+ C Y +Y GS ++G V D ++
Sbjct: 142 PSKSKSSSKVTCNSTLC-----ALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMS 196
Query: 168 MILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKV 227
G+A A I FGCS Q G + AV+GI G ++V + L G+
Sbjct: 197 TEEGEA----RDARITFGCSESQLGLFKEV--AVNGIMGLAIADIAVPNMLVKAGVASDS 250
Query: 228 FSHCXXXXXXXXXXXXXXXXXEPSIVYSPL--VPSQPHYNLNLQSIAVNGQLLSINQAVF 285
FS C + + +PL S Y++++ V ++++
Sbjct: 251 FSMCFGPNGKGTISFGDKGSSDQ--LETPLSGTISPMFYDVSITKFKVGK--VTVDTEFT 306
Query: 286 ATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAV--SQSTSPIISKGNQCYLV-STRVG 342
AT D GT + +L++ Y L +V + + + S CY++ ST
Sbjct: 307 AT-------FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDE 359
Query: 343 DIFPTVSLNFAGGVSM-LLKPEQYLMPYGFVDGA-AMWCIG-FQKVQEGVTILGDLVLKD 399
D P+VS GG + + P ++ + DG+ ++C+ ++V +I+G + +
Sbjct: 360 DKLPSVSFEMKGGAAYDVFSP---ILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTN 416
Query: 400 KIVVYDLANQRIGWTNYDCSLSVNVSVTSSKDEYISAGQLRVSSSESVTGILSKLLPVSI 459
+V+D + +GW +C+ + + + + +SS + S+L P +
Sbjct: 417 YRIVHDRERRILGWKKSNCN---DTNGFTGPTALAKPPSMAPTSSPRTINLSSRLNP--L 471
Query: 460 VAALSMHIVIFM 471
AA S+ I+ F+
Sbjct: 472 AAASSLFIICFI 483
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 98.2 bits (243), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 166/384 (43%), Gaps = 34/384 (8%)
Query: 55 IGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTT 114
+G YF V +GSPP+ F++ +DTGSD+ W+ C C +C Q +G F+D S +
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGA-----FYDPKASAS 219
Query: 115 AGLVPCSDPICT-SGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMIL-GQ 172
+ C+D C C C Y + YGD S T+G + + ++ G
Sbjct: 220 YKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGG 279
Query: 173 APPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCX 232
+ + + N++FGC + G A + G LS SQL S + FS+C
Sbjct: 280 SSELYNVENMMFGCGHWNRGLFHGAAGLLGLG----RGPLSFSSQLQS--LYGHSFSYCL 333
Query: 233 XXXXXXXXXXXXXXXXE-------PSIVYSPLVPSQPH-----YNLNLQSIAVNGQLLSI 280
E P++ ++ V + + Y + ++SI V G++L+I
Sbjct: 334 VDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNI 393
Query: 281 NQAVFATSNN--RGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG---NQCY 335
+ + S++ GTI+D GTTL+Y + AY+ + N I ++ P+ + C+
Sbjct: 394 PEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEK-AKGKYPVYRDFPILDPCF 452
Query: 336 LVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDL 395
VS P + + FA G E + + D + +G K +I+G+
Sbjct: 453 NVSGIHNVQLPELGIAFADGAVWNFPTENSFI-WLNEDLVCLAMLGTPK--SAFSIIGNY 509
Query: 396 VLKDKIVVYDLANQRIGWTNYDCS 419
++ ++YD R+G+ C+
Sbjct: 510 QQQNFHILYDTKRSRLGYAPTKCA 533
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 98.2 bits (243), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 155/378 (41%), Gaps = 51/378 (13%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
+ + +G+PP + IDTGSD+ W++C C PQT + FF S+T
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQT------IPFFHPSRSSTYRNAS 131
Query: 120 CSDPICTSGVQGAAAECSPQV------NQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQA 173
C +A PQ+ C Y +Y D S T G + + F+
Sbjct: 132 CV----------SAPHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFET---SD 178
Query: 174 PPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXX 233
+ S NIVFGC SG TK G+ G GPG S+V +R K FS+C
Sbjct: 179 DGLISKQNIVFGCGQDNSG-FTK----YSGVLGLGPGTFSIV----TRNFGSK-FSYCFG 228
Query: 234 XXXXXXXXXXXXXXXEPSIVY---SPLVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNN 290
+ + +PL Q Y L+LQ+I+ +LL I F +
Sbjct: 229 SLTNPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRS 288
Query: 291 R-GTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDI----- 344
+ GT++D G + L +EAY+ L I + + + K Y G++
Sbjct: 289 QGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRV--KDWDQYTTPCYEGNLKLDLY 346
Query: 345 -FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGF-QKVQEGVTILGDLVLKDKIV 402
FP V+ +FAGG + L E + + +C+ + ++++G + ++ V
Sbjct: 347 GFPVVTFHFAGGAELALDVESLFVS---SESGDSFCLAMTMNTFDDMSVIGAMAQQNYNV 403
Query: 403 VYDLANQRIGWTNYDCSL 420
Y+L ++ + DC +
Sbjct: 404 GYNLRTMKVYFQRTDCEI 421
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 96.7 bits (239), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 149/385 (38%), Gaps = 40/385 (10%)
Query: 51 ILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTV 110
+ +AI L+ +G PP +DTGS +LW+ C C +C + F+
Sbjct: 87 VEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIH---PVFNPA 143
Query: 111 GSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMIL 170
S+T C D C G N+C Y Y G+G+ G + + F
Sbjct: 144 LSSTFVECSCDDRFCRYAPNGHCGSS----NKCVYEQVYISGTGSKGVLAKERLTF---- 195
Query: 171 GQAPPVNS--SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVF 228
P N+ + I FGC Y++G+ + + GI G G S+ QL S+ F
Sbjct: 196 -TTPNGNTVVTQPIAFGCG-YENGE--QLESHFTGILGLGAKPTSLAVQLGSK------F 245
Query: 229 SHCXXXXXXXXXXXXXXXXXEPSIVYSPLVPSQPH-----YNLNLQSIAVNGQLLSINQA 283
S+C E + + P + Y +NL+ I+V L+I
Sbjct: 246 SYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPV 305
Query: 284 VFATSNNR-GTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVG 342
VF R G I+D GT +L AY L N I + + + CY RV
Sbjct: 306 VFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDFLCY--HGRVS 363
Query: 343 DI---FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEG------VTILG 393
+ FP V+ +FAGG + ++ P + ++C+ + +E T +G
Sbjct: 364 EELIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIG 423
Query: 394 DLVLKDKIVVYDLANQRIGWTNYDC 418
+ + + YDL + I DC
Sbjct: 424 LMAQQYYNIGYDLKEKNIYLQRIDC 448
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 156/376 (41%), Gaps = 73/376 (19%)
Query: 21 VARLDFIARDHYGEVLDMVTTCPLDIISPFILEAIGFRLYFTKVKMGSPPREFNVQIDTG 80
V RL+++ G+++ ++ P I +A + + +GSPP + +DT
Sbjct: 56 VERLEYLKAKTTGDIIAHLSPN-----VPIIPQA-----FLVNISIGSPPITQLLHMDTA 105
Query: 81 SDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVPCSDPICTSGVQGAAAECSPQV 140
SD+LW+ C C NC S L FD S T C TS + + +
Sbjct: 106 SDLLWIQCLPCINCYAQS-----LPIFDPSRSYTHRNETCR----TSQYSMPSLKFNANT 156
Query: 141 NQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSSANIVFGCSTYQSGDLTKADKA 200
C Y+ +Y D +G+ G + + F+ I ++ + ++VFGC G+
Sbjct: 157 RSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSA-ALHDVVFGCGHDNYGEPLVG--- 212
Query: 201 VDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLV-- 258
GI G G G S+V + K FS+C +PS ++ LV
Sbjct: 213 -TGILGLGYGEFSLVHRFG------KKFSYC------------FGSLDDPSYPHNVLVLG 253
Query: 259 -----------PSQPH---YNLNLQSIAVNGQLLSINQAVFATSNNR---GTIVDCGTTL 301
P + H Y + +++I+V+G +L I+ VF ++ GTI+D G +L
Sbjct: 254 DDGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSL 313
Query: 302 AYLVQEAYDPLVNAIT-------TAVSQSTSPIISKGNQCY---LVSTRVGDIFPTVSLN 351
LV+EAY PL N I TA S +I +CY V FP V+ +
Sbjct: 314 TSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKM--ECYNGNFERDLVESGFPIVTFH 371
Query: 352 FAGGVSMLLKPEQYLM 367
F+ G + L + M
Sbjct: 372 FSEGAELSLDVKSLFM 387
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 94.4 bits (233), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 155/375 (41%), Gaps = 42/375 (11%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNC-PQTSGLGIELNFFDTVGSTTAGLV 118
YF ++ +G+P + +DTGSD++W+ C+ C C QT + FD S T V
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAI------FDPKKSKTFATV 188
Query: 119 PCSDPICTSGVQGAAAEC-SPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVN 177
PC +C ++EC + + C Y YGDGS T G + ++ + F P+
Sbjct: 189 PCGSRLCRR--LDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPL- 245
Query: 178 SSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX 237
GC G A + G LS SQ +R FS+C
Sbjct: 246 -------GCGHDNEGLFVGAAGLLGLG----RGGLSFPSQTKNR--YNGKFSYCLVDRTS 292
Query: 238 XXXXXXXX-------XXXEPSIVYSPLVPS---QPHYNLNLQSIAVNGQL---LSINQAV 284
+ V++PL+ + Y L L I+V G +S +Q
Sbjct: 293 SGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFK 352
Query: 285 FATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQ-STSPIISKGNQCYLVSTRVGD 343
+ N G I+D GT++ L Q AY L +A ++ +P S + C+ +S
Sbjct: 353 LDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTV 412
Query: 344 IFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVV 403
PTV +F GG + L YL+P V+ +C F ++I+G++ + V
Sbjct: 413 KVPTVVFHFGGG-EVSLPASNYLIP---VNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVA 468
Query: 404 YDLANQRIGWTNYDC 418
YDL R+G+ + C
Sbjct: 469 YDLVGSRVGFLSRAC 483
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 94.0 bits (232), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 157/373 (42%), Gaps = 42/373 (11%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNC-PQTSGLGIELNFFDTVGSTTAGLV 118
Y + +G+PP DTGSD++W CN C +C QTS L FD S+T V
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPL------FDPKESSTYRKV 139
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
CS C + A CS N CSYT YGD S T G D + M PV S
Sbjct: 140 SCSSSQCRALED---ASCSTDENTCSYTITYGDNSYTKGDVAVDTV--TMGSSGRRPV-S 193
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
N++ GC +G D A GI G G G+ S+VSQL + I K FS+C
Sbjct: 194 LRNMIIGCGHENTGTF---DPAGSGIIGLGGGSTSLVSQL-RKSINGK-FSYCLVPFTSE 248
Query: 239 XXXXXXXXXXEPSI------VYSPLVPSQP--HYNLNLQSIAVNGQLLSINQAVFATSNN 290
I V + +V P +Y LNL++I+V + + +F T
Sbjct: 249 TGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEG 308
Query: 291 RGTIVDCGTTLAYLVQEAYDPLVNAITTAVS----QSTSPIISKGNQCYLVSTRVGDIFP 346
++D GTTL L Y L + + + + Q I+S CY S+ P
Sbjct: 309 -NIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILS---LCYRDSSSFK--VP 362
Query: 347 TVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDL 406
++++F GG L ++ + C F E +TI G+L + +V YD
Sbjct: 363 DITVHFKGGDVKLGNLNTFVAV-----SEDVSCFAF-AANEQLTIFGNLAQMNFLVGYDT 416
Query: 407 ANQRIGWTNYDCS 419
+ + + DCS
Sbjct: 417 VSGTVSFKKTDCS 429
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 93.2 bits (230), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 159/385 (41%), Gaps = 44/385 (11%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
+F + +G+PP + DTGSD+ WV C C C + +G FD S+T P
Sbjct: 85 FFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENG-----PIFDKKKSSTYKSEP 139
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
C C + + C N C Y + YGD S + G ++ + D G PV S
Sbjct: 140 CDSRNCQA-LSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGS--PV-SF 195
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
VFGC G D+ GI G G G LS++SQL S K FS+C
Sbjct: 196 PGTVFGCGYNNGGTF---DETGSGIIGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATT 250
Query: 240 XXXX----------XXXXXEPSIVYSPLVPSQP--HYNLNLQSIAV-------NGQLLSI 280
+ +V +PLV +P +Y L L++I+V G +
Sbjct: 251 NGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNP 310
Query: 281 NQAVFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG--NQCYLV- 337
N + + I+D GTTL L +D +A+ +V+ + +G + C+
Sbjct: 311 NDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSG 370
Query: 338 STRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVL 397
S +G P ++++F G + L P + M C+ E V I G+
Sbjct: 371 SAEIG--LPEITVHFT-GADVRLSPINAFVKL----SEDMVCLSMVPTTE-VAIYGNFAQ 422
Query: 398 KDKIVVYDLANQRIGWTNYDCSLSV 422
D +V YDL + + + + DCS ++
Sbjct: 423 MDFLVGYDLETRTVSFQHMDCSANL 447
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 92.0 bits (227), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 107/420 (25%), Positives = 169/420 (40%), Gaps = 77/420 (18%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNT----CSNC----------PQT-SGLGIEL 104
Y + +G+PP+ V +DTGSD+ WV C C C P S L
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142
Query: 105 NFFDTVGSTTAGLVPCSD----PICTSGVQGAAAECSPQVNQC-SYTFQYGDGSGTSGYY 159
+F D+ S+ + SD P +G + S V C S+ + YG+G SG
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202
Query: 160 VSDAMYFDMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLS 219
D + FGC +T + GI GFG G LS+ SQL
Sbjct: 203 TRDIL--------KARTRDVPRFSFGC-------VTSTYREPIGIAGFGRGLLSLPSQL- 246
Query: 220 SRGITPKVFSHC----------XXXXXXXXXXXXXXXXXEPSIVYSPLV--PSQPH-YNL 266
G K FSHC S+ ++P++ P P+ Y +
Sbjct: 247 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYI 304
Query: 267 NLQSIAVNGQL------LSINQAVFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAV 320
L+SI + + L++ Q F + N G +VD GTT +L + Y L+ + + +
Sbjct: 305 GLESITIGTNITPTQVPLTLRQ--FDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTI 362
Query: 321 S--QSTSPIISKG-NQCYLV----------STRVGDIFPTVSLNFAGGVSMLL-KPEQYL 366
+ ++T G + CY V V IFP+++ +F ++LL + +
Sbjct: 363 TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFY 422
Query: 367 MPYGFVDGAAMWCIGFQKVQEG----VTILGDLVLKDKIVVYDLANQRIGWTNYDCSLSV 422
DG+ + C+ FQ +++G + G ++ VVYDL +RIG+ DC L
Sbjct: 423 AMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLEA 482
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 91.7 bits (226), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 112/424 (26%), Positives = 170/424 (40%), Gaps = 56/424 (13%)
Query: 9 KVNFVAPILVKKVARLDFIARDHYGEVLDMVTTCPLDIISPFILEAIGFRLYFTKVKMGS 68
+VN + L KK +A DH E T P S +G Y V +G+
Sbjct: 94 RVNSIHSKLSKK------LATDHVSE--SKSTDLPAKDGS-----TLGSGNYIVTVGLGT 140
Query: 69 PPREFNVQIDTGSDILWVNCNTC-SNCPQTSGLGIELNFFDTVGSTTAGLVPCSDPICT- 126
P + ++ DTGSD+ W C C C + F+ ST+ V CS C
Sbjct: 141 PKNDLSLIFDTGSDLTWTQCQPCVRTCYDQ-----KEPIFNPSKSTSYYNVSCSSAACGS 195
Query: 127 -SGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYF---DMILGQAPPVNSSANI 182
S G A CS + C Y QYGD S + G+ + D+ G +
Sbjct: 196 LSSATGNAGSCS--ASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDG----------V 243
Query: 183 VFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCX-XXXXXXXXX 241
FGC G T V G+ G G LS SQ ++ K+FS+C
Sbjct: 244 YFGCGENNQGLFT----GVAGLLGLGRDKLSFPSQTAT--AYNKIFSYCLPSSASYTGHL 297
Query: 242 XXXXXXXEPSIVYSP---LVPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCG 298
S+ ++P + Y LN+ +I V GQ L I VF+T G ++D G
Sbjct: 298 TFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP---GALIDSG 354
Query: 299 TTLAYLVQEAYDPLVNAITTAVSQ-STSPIISKGNQCYLVSTRVGDIFPTVSLNFAGGVS 357
T + L +AY L ++ +S+ T+ +S + C+ +S P V+ +F+GG
Sbjct: 355 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAV 414
Query: 358 MLLKPEQYLMPYGFVDGAAMWCIGFQKVQE--GVTILGDLVLKDKIVVYDLANQRIGWTN 415
+ L + +V + C+ F + I G++ + VVYD A R+G+
Sbjct: 415 VELGSKGIF----YVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAP 470
Query: 416 YDCS 419
CS
Sbjct: 471 NGCS 474
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 90.1 bits (222), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 166/387 (42%), Gaps = 36/387 (9%)
Query: 54 AIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGST 113
+G YF V +G+PP+ F++ +DTGSD+ W+ C C +C +G+ F+D S
Sbjct: 154 TLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGM-----FYDPKTSA 208
Query: 114 TAGLVPCSDPICTS-GVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQ 172
+ + C+DP C+ +C C Y + YGD S T+G + + ++ +
Sbjct: 209 SFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTE 268
Query: 173 APPVNSS-ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHC 231
N++FGC + G + A + G LS SQL S + FS+C
Sbjct: 269 GGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLG----RGPLSFSSQLQS--LYGHSFSYC 322
Query: 232 XXXXXXXXXXXXXXXXXEP-------SIVYSPLVPSQPH-----YNLNLQSIAVNGQLLS 279
E ++ ++ V + + Y + ++SI V G+ L
Sbjct: 323 LVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALD 382
Query: 280 INQAVFATSN--NRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG---NQC 334
I + + S+ + GTI+D GTTL+Y + AY+ + N + ++ PI + C
Sbjct: 383 IPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENY-PIFRDFPVLDPC 441
Query: 335 YLVS-TRVGDI-FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTIL 392
+ VS +I P + + F G E + + D + +G K +I+
Sbjct: 442 FNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFI-WLSEDLVCLAILGTPK--STFSII 498
Query: 393 GDLVLKDKIVVYDLANQRIGWTNYDCS 419
G+ ++ ++YD R+G+T C+
Sbjct: 499 GNYQQQNFHILYDTKRSRLGFTPTKCA 525
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 88.6 bits (218), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 92/387 (23%), Positives = 151/387 (39%), Gaps = 43/387 (11%)
Query: 52 LEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQ-----TSGLGIELNF 106
L +GF L++ V +G+P F V +DTGSD+ W+ CN + C + LN
Sbjct: 96 LNFLGF-LHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNL 154
Query: 107 FDTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYF 166
+ STT+ + CSD C + +CS + C Y + T+G + D ++
Sbjct: 155 YTPNASTTSSSIRCSDKRCF-----GSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHL 209
Query: 167 DMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPK 226
PVN AN+ GC Q+G + D AV+G+ G SV S L+ IT
Sbjct: 210 VTEDEDLKPVN--ANVTLGCGQNQTGAF-QTDIAVNGVLGLSMKEYSVPSLLAKANITAN 266
Query: 227 VFSHCXXXXXXXXXXXXXXXXXEPSIVYSPLV--PSQPHYNLNLQSIAVNGQLLSINQAV 284
FS C +PLV + Y +N+ ++V G + ++ +
Sbjct: 267 SFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGVNVTGVSVGG--VPVDVPL 324
Query: 285 FATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISK--GNQCYLVSTRV- 341
FA + D G++ L++ AY A + P+ CY +
Sbjct: 325 FA-------LFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHL 377
Query: 342 -GDIFPT---------VSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTI 391
D P +F + Q + Y +G M+C+G K + I
Sbjct: 378 NSDARPRHMQSKCYNPCRDDFRWRIQ---NDSQESVSYS-NEGTKMYCLGILK-SINLNI 432
Query: 392 LGDLVLKDKIVVYDLANQRIGWTNYDC 418
+G ++ +V+D +GW +C
Sbjct: 433 IGQNLMSGHRIVFDRERMILGWKQSNC 459
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 88.2 bits (217), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/409 (23%), Positives = 158/409 (38%), Gaps = 63/409 (15%)
Query: 52 LEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNT---CSNCPQTSGLGIEL-NFF 107
L A + Y + G+P + DTGS ++W+ C + CS C SGL L F
Sbjct: 82 LSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGC-DFSGLDPTLIPRF 140
Query: 108 DTVGSTTAGLVPCSDPICT--SGVQGAAAECSPQVNQCS-----YTFQYGDGSGTSGYYV 160
S+++ ++ C P C G C P C+ Y QYG GS T+G +
Sbjct: 141 IPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLI 199
Query: 161 SDAMYFDMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSS 220
++ + F P + + V GCS + + GI GFG G +S+ SQ++
Sbjct: 200 TEKLDF--------PDLTVPDFVVGCSIIST-------RQPAGIAGFGRGPVSLPSQMNL 244
Query: 221 RGITPKVFSHCXXXXXXXXXXXXXXXXXE-----------PSIVYSPLVPSQ-------- 261
K FSHC + P + Y+P +
Sbjct: 245 -----KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFL 299
Query: 262 PHYNLNLQSIAVNGQLLSINQAVFA--TSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTA 319
+Y LNL+ I V + + I A T+ + G+IVD G+T ++ + ++ + +
Sbjct: 300 EYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQ 359
Query: 320 VSQSTSPIISKGNQ----CYLVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGA 375
+S T + C+ +S + P + F GG + L Y G D
Sbjct: 360 MSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTV 419
Query: 376 AMWCIGFQKVQEG-----VTILGDLVLKDKIVVYDLANQRIGWTNYDCS 419
+ + + V ILG ++ +V YDL N R G+ CS
Sbjct: 420 CLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 87.8 bits (216), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 158/372 (42%), Gaps = 38/372 (10%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
Y + K+G+PP+ + +DT +D +W+ C+ CS C S T S+T V
Sbjct: 104 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFN------TNSSSTYSTVS 157
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
CS CT SPQ + CS+ YG S S V D + AP V
Sbjct: 158 CSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL------APDV--I 209
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX-- 237
N FGC SG+ G+ G G G +S+VSQ +S + VFS+C
Sbjct: 210 PNFSFGCINSASGN----SLPPQGLMGLGRGPMSLVSQTTS--LYSGVFSYCLPSFRSFY 263
Query: 238 -XXXXXXXXXXXEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQLLSINQA--VFATSNNR 291
SI Y+PL+ P +P Y +NL ++V + ++ F ++
Sbjct: 264 FSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGA 323
Query: 292 GTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVSLN 351
GTI+D GT + Q Y+ + + V+ S+ + + C+ S ++ P ++L+
Sbjct: 324 GTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAFDTCF--SADNENVAPKITLH 381
Query: 352 FAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEG----VTILGDLVLKDKIVVYDLA 407
+ + L E L+ + C+ +++ + ++ +L ++ +++D+
Sbjct: 382 MT-SLDLKLPMENTLI---HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVP 437
Query: 408 NQRIGWTNYDCS 419
N RIG C+
Sbjct: 438 NSRIGIAPEPCN 449
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 87.0 bits (214), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 108/409 (26%), Positives = 164/409 (40%), Gaps = 73/409 (17%)
Query: 53 EAIGFRLYFT---KVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDT 109
+ + FR T + +G PP+ ++ +DTGS++ W++C N LG + F+
Sbjct: 55 DKLSFRHNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPN------LG---SVFNP 105
Query: 110 VGSTTAGLVPCSDPICTSGVQGA--AAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFD 167
V S+T VPCS PIC + + A C P+ + C Y D + G +
Sbjct: 106 VSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHET---- 161
Query: 168 MILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKV 227
++G + +FGC ++ D G+ G G+LS V+QL G +
Sbjct: 162 FVIGSV----TRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL---GFSK-- 212
Query: 228 FSHCXXXXXXXXXXXXXXXXXE--PSIVYSPLV-PSQP-------HYNLNLQSIAVNGQL 277
FS+C I Y+PLV S P Y + L+ I V ++
Sbjct: 213 FSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKI 272
Query: 278 LSINQAVFATSNNRG--TIVDCGTTLAYLVQEAYDPLVNAITTAVSQ-----STSPIISK 330
LS+ ++VF + T+VD GT +L+ Y L N T + +
Sbjct: 273 LSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQ 332
Query: 331 GNQ--CYLVSTRVGDIF---PTVSLNFAG------GVSMLL---------KPEQYLMPYG 370
G CY V + F P VSL F G G +L K E Y +G
Sbjct: 333 GTMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG 392
Query: 371 FVD--GAAMWCIGFQKVQEGVTILGDLVLKDKI-----VVYDLANQRIG 412
D G + IG Q+ V + DL K ++ V DLA+QR+G
Sbjct: 393 NSDLLGIEAFVIGHHH-QQNVWMEFDLA-KSRVGFAGNVRCDLASQRLG 439
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 85.5 bits (210), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 35/362 (9%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
YF ++ +GSPPR+ + ID+GSD++WV C C C + S FD S + V
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPAKSGSYTGVS 185
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
C +C ++ + C Y YGDGS T G + + F + +
Sbjct: 186 CGSSVCDR-IENSGCHS----GGCRYEVMYGDGSYTKGTLALETLTFAKTVVR------- 233
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
N+ GC G A + G++S V QLS G T F +C
Sbjct: 234 -NVAMGCGHRNRGMFIGAAGLLGIGG----GSMSFVGQLS--GQTGGAFGYCLVSRGTDS 286
Query: 240 XXXXXXXXXEPSIVYS--PLV--PSQPH-YNLNLQSIAVNGQLLSINQAVF--ATSNNRG 292
+ S PLV P P Y + L+ + V G + + VF + + G
Sbjct: 287 TGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGG 346
Query: 293 TIVDCGTTLAYLVQEAYDPLVNAITTAVSQ-STSPIISKGNQCYLVSTRVGDIFPTVSLN 351
++D GT + L AY + + + + +S + CY +S V PTVS
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406
Query: 352 FAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLANQRI 411
F G + L +LMP VD + +C F G++I+G++ + V +D AN +
Sbjct: 407 FTEGPVLTLPARNFLMP---VDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFV 463
Query: 412 GW 413
G+
Sbjct: 464 GF 465
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 85.5 bits (210), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 150/374 (40%), Gaps = 50/374 (13%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
Y + +G+P + V +DT +D W+ C+ C C + FD S+++ +
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-------LFDPSKSSSSRTLQ 140
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYF--DMILGQAPPVN 177
C P C Q C+ C + YG GS Y D + D+I
Sbjct: 141 CEAPQCK---QAPNPSCTVS-KSCGFNMTYG-GSTIEAYLTQDTLTLASDVI-------- 187
Query: 178 SSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXX 237
N FGC SG A G+ G G G LS++SQ S+ + FS+C
Sbjct: 188 --PNYTFGCINKASGTSLPA----QGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKS 239
Query: 238 XXXXXXXXX--XXEP-SIVYSPLVPSQPH---YNLNLQSIAVNGQLLSINQAVFA--TSN 289
+P I +PL+ + Y +NL I V +++ I + A +
Sbjct: 240 SNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPAT 299
Query: 290 NRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVSTRVGDIFPTVS 349
GTI D GT LV+ AY + N V + + + + CY S +FP+V+
Sbjct: 300 GAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYSGSV----VFPSVT 355
Query: 350 LNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQ----KVQEGVTILGDLVLKDKIVVYD 405
FA G+++ L P+ L+ + C+ V + ++ + ++ V+ D
Sbjct: 356 FMFA-GMNVTLPPDNLLI---HSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLID 411
Query: 406 LANQRIGWTNYDCS 419
+ N R+G + C+
Sbjct: 412 VPNSRLGISRETCT 425
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 83.6 bits (205), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/393 (23%), Positives = 158/393 (40%), Gaps = 72/393 (18%)
Query: 64 VKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVPCSDP 123
+ +G+PP+ + +DTGS + W+ C+ P+ FD S++ +PCS P
Sbjct: 76 LPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPK------TSFDPSLSSSFSTLPCSHP 129
Query: 124 ICTSGVQGAAAECSPQVNQ-CSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSSANI 182
+C + S N+ C Y++ Y DG+ G V + + F P +
Sbjct: 130 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPP-------L 182
Query: 183 VFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX---X 239
+ GC+T S D GI G G LS VSQ FS+C
Sbjct: 183 ILGCATESSDD--------RGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGFT 229
Query: 240 XXXXXXXXXEP--------SIVYSPLVPSQPH-----YNLNLQSIAVNGQLLSINQAVFA 286
P S++ P P+ Y + + I + L+I+ +VF
Sbjct: 230 PTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFR 289
Query: 287 --TSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG-------NQCY-- 335
+ T+VD G+ +LV AYD + I T V + + KG + C+
Sbjct: 290 PDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRR----LKKGYVYGGTADMCFDG 345
Query: 336 ---LVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQK---VQEGV 389
++ +GD+ F GV +L+ E+ L+ G + C+G + +
Sbjct: 346 NVAMIPRLIGDLV----FVFTRGVEILVPKERVLVNV----GGGIHCVGIGRSSMLGAAS 397
Query: 390 TILGDLVLKDKIVVYDLANQRIGWTNYDCSLSV 422
I+G++ ++ V +D+ N+R+G+ DCS V
Sbjct: 398 NIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 82.0 bits (201), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 88/378 (23%), Positives = 154/378 (40%), Gaps = 40/378 (10%)
Query: 48 SPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFF 107
SP+ + +Y K+++G+PP E +IDTGSDI+W C C NC
Sbjct: 409 SPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNC------------- 455
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFD 167
+ P DP +S + E N C Y Y D + + G ++ +
Sbjct: 456 ------YSQFAPIFDPSKSSTFR----EQRCNGNSCHYEIIYADKTYSKGILATETVTIP 505
Query: 168 MILGQAPPVNSSANIVFGCSTYQSG-DLTKADKAVDGIFGFGPGALSVVSQLSSRGITPK 226
G+ P V + I GC + + + GI G G LS++SQ+ P
Sbjct: 506 STSGE-PFVMAETKI--GCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLP--YPG 560
Query: 227 VFSHCXXXXXXXXXX--XXXXXXXEPSIVYSPLVPS-QPHYNLNLQSIAVNGQLLSINQA 283
+ S+C + ++ + P Y LNL +++V L++
Sbjct: 561 LISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGT 620
Query: 284 VFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGN-QCYLVSTRVG 342
F + +D GTTL Y + + A+ V+ P + N CY T
Sbjct: 621 PFHAEDGN-IFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTI-- 677
Query: 343 DIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGV-TILGDLVLKDKI 401
DIFP ++++F+GG ++L ++Y M + G ++C+ + + G+ + +
Sbjct: 678 DIFPVITMHFSGGADLVL--DKYNMYLETITG-GIFCLAIGCNDPSMPAVFGNRAQNNFL 734
Query: 402 VVYDLANQRIGWTNYDCS 419
V YD ++ I ++ +CS
Sbjct: 735 VGYDPSSNVISFSPTNCS 752
Score = 72.8 bits (177), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 147/366 (40%), Gaps = 44/366 (12%)
Query: 48 SPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFF 107
SP+ + +Y K+++G+PP E +IDTGSD++W C C +C + F
Sbjct: 70 SPYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDC---------YSQF 120
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFD 167
D P DP +S E C Y Y D + + G ++ +
Sbjct: 121 D----------PIFDPSKSSTFN----EQRCHGKSCHYEIIYEDNTYSKGILATETVTIH 166
Query: 168 MILGQAPPVNSSANIVFGCSTYQSG-DLTKADKAVDGIFGFGPGALSVVSQLSSRGITPK 226
G+ P V + I GC + + D + + GI G G S++SQ+ P
Sbjct: 167 STSGE-PFVMAETTI--GCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLP--YPG 221
Query: 227 VFSHCXXXXXXXXXX--XXXXXXXEPSIVYSPLVPS-QPHYNLNLQSIAVNGQLLSINQA 283
+ S+C + ++ + P Y LNL +++V +
Sbjct: 222 LISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGT 281
Query: 284 VFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQ-CYLVSTRVG 342
F + ++D G+T+ Y + + A+ V+ P S + CY T
Sbjct: 282 PFHAEDGN-IVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSETI-- 338
Query: 343 DIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGF---QKVQEGVTILGDLVLKD 399
DIFP ++++F+GG ++L ++Y M Y + ++C+ QE I G+ +
Sbjct: 339 DIFPVITMHFSGGADLVL--DKYNM-YMESNSGGLFCLAIICNSPTQEA--IFGNRAQNN 393
Query: 400 KIVVYD 405
+V YD
Sbjct: 394 FLVGYD 399
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/377 (21%), Positives = 150/377 (39%), Gaps = 38/377 (10%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGL--GIELNFFDTVGSTTAGL 117
YFT++++G+P ++F V +DTGS++ WVNC + + E F TVG
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVG------ 159
Query: 118 VPCSDPICTSGVQG--AAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPP 175
C C + + C CSY ++Y DGS G + + + + G+
Sbjct: 160 --CLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMAR 217
Query: 176 VNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXX 235
+ + GCS+ +G ++ + DG+ G S S +S + FS+C
Sbjct: 218 LPGH---LIGCSSSFTG---QSFQGADGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDH 269
Query: 236 XXXXXXXXXX---XXXEPSIVYSPLVPSQ-----PHYNLNLQSIAVNGQLLSINQAVFAT 287
+ P P Y +N+ I++ +L I V+
Sbjct: 270 LSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDA 329
Query: 288 SNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQ--STSPIISKGNQCYLVSTRVG-DI 344
++ GTI+D GT+L L AY +V + + + P C+ ++
Sbjct: 330 TSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSK 389
Query: 345 FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGA-AMWCIGFQKVQEGVT-ILGDLVLKDKIV 402
P ++ + GG + YL VD A + C+GF T ++G+++ ++ +
Sbjct: 390 LPQLTFHLKGGARFEPHRKSYL-----VDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLW 444
Query: 403 VYDLANQRIGWTNYDCS 419
+DL + + C+
Sbjct: 445 EFDLMASTLSFAPSACT 461
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 79.7 bits (195), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 135/332 (40%), Gaps = 56/332 (16%)
Query: 48 SPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFF 107
SP+ + +Y K+++G+PP E +IDTGSD++W C C+NC
Sbjct: 49 SPYADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNC------------- 95
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFD 167
+ P DP +S + E N C Y Y D + + G ++ +
Sbjct: 96 ------YSQYAPIFDPSNSSTFK----EKRCNGNSCHYKIIYADTTYSKGTLATETVTIH 145
Query: 168 MILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKV 227
G+ P V I GC S G+ G G S+++Q+ G P +
Sbjct: 146 STSGE-PFVMPETTI--GCGHNSS----WFKPTFSGMVGLSWGPSSLITQMG--GEYPGL 196
Query: 228 FSHCXXXXXXXXXXXXXXXXXEPSIVYSP---LVPSQPH-YNLNLQSIAV-NGQLLSINQ 282
S+C V S L ++P Y LNL +++V + + ++
Sbjct: 197 MSYCFASQGTSKINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGT 256
Query: 283 AVFATSNNRGTIVDCGTTLAY-------LVQEAYDPLVNAITTAVSQSTSPIISKGNQCY 335
A N I+D GTTL Y LV+EA D V A+ TA P GN
Sbjct: 257 TFHALEGN--IIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTA-----DP---TGNDML 306
Query: 336 LVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLM 367
T DIFP ++++F+GG ++L ++Y M
Sbjct: 307 CYYTDTIDIFPVITMHFSGGADLVL--DKYNM 336
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 79.3 bits (194), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/388 (22%), Positives = 150/388 (38%), Gaps = 62/388 (15%)
Query: 64 VKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVPCSDP 123
+ +G+P + + +DTGS + W+ C+ FD S++ +PCS P
Sbjct: 84 LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTS---FDPSLSSSFSDLPCSHP 140
Query: 124 ICTSGVQGAAAECSPQVNQ-CSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSSANI 182
+C + S N+ C Y++ Y DG+ G V + F P +
Sbjct: 141 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPP-------L 193
Query: 183 VFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX---X 239
+ GC+ K GI G G LS +SQ FS+C
Sbjct: 194 ILGCA--------KESTDEKGILGMNLGRLSFISQAKI-----SKFSYCIPTRSNRPGLA 240
Query: 240 XXXXXXXXXEP--------SIVYSPLVPSQPH-----YNLNLQSIAVNGQLLSINQAVFA 286
P S++ P P+ Y + LQ I + + L+I +VF
Sbjct: 241 STGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFR 300
Query: 287 --TSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQ---------STSPIISKGNQCY 335
+ T+VD G+ +LV AYD + I V ST+ + GN
Sbjct: 301 PDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSM 360
Query: 336 LVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQK---VQEGVTIL 392
+ +GD+ F GV +L++ + L+ G + C+G + + I+
Sbjct: 361 EIGRLIGDLV----FEFGRGVEILVEKQSLLVNVG----GGIHCVGIGRSSMLGAASNII 412
Query: 393 GDLVLKDKIVVYDLANQRIGWTNYDCSL 420
G++ ++ V +D+ N+R+G++ +C L
Sbjct: 413 GNVHQQNLWVEFDVTNRRVGFSKAECRL 440
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 78.2 bits (191), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 157/374 (41%), Gaps = 42/374 (11%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNC-PQTSGLGIELNFFDTVGSTTAGLV 118
Y V +G+PP DTGSD+LW C C +C Q L FD S+T V
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------FDPKTSSTYKDV 143
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAP--PV 176
CS CT+ A+ CS N CSY+ YGD S T G D + LG + P+
Sbjct: 144 SCSSSQCTALENQAS--CSTNDNTCSYSLSYGDNSYTKGNIAVDT----LTLGSSDTRPM 197
Query: 177 NSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXX 236
NI+ GC +G +K GI G G G +S++ QL I K FS+C
Sbjct: 198 Q-LKNIIIGCGHNNAGTF---NKKGSGIVGLGGGPVSLIKQLGD-SIDGK-FSYCLVPLT 251
Query: 237 XXXXXXXXXXXXEPSIVYSPLVPSQP---------HYNLNLQSIAVNGQLLSINQAVFAT 287
+IV V S P Y L L+SI+V + + + + +
Sbjct: 252 SKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSD-SE 310
Query: 288 SNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG-NQCYLVSTRVGDI-F 345
S+ I+D GTTL L E Y L +A+ +++ G + CY + GD+
Sbjct: 311 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY---SATGDLKV 367
Query: 346 PTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYD 405
P ++++F G L ++ + C F + +I G++ + +V YD
Sbjct: 368 PVITMHFDGADVKLDSSNAFVQV-----SEDLVCFAF-RGSPSFSIYGNVAQMNFLVGYD 421
Query: 406 LANQRIGWTNYDCS 419
++ + + DC+
Sbjct: 422 TVSKTVSFKPTDCA 435
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 77.8 bits (190), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 158/384 (41%), Gaps = 57/384 (14%)
Query: 48 SPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFF 107
SP+ Y K+++G+PP E +DTGS+ +W C C +C +
Sbjct: 53 SPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTA-------- 104
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFD 167
P DP +S + C + C Y YG S T G V++ +
Sbjct: 105 -----------PIFDPSKSSTFK--EIRCDTHDHSCPYELVYGGKSYTKGTLVTETVTIH 151
Query: 168 MILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKV 227
GQ P I+ GC SG G+ G G S+++Q+ G P +
Sbjct: 152 STSGQ--PFVMPETII-GCGRNNSG----FKPGFAGVVGLDRGPKSLITQMG--GEYPGL 202
Query: 228 FSHCXXXXXXXXXXXXXXXXXEPSIVYSPLV---PSQP-HYNLNLQSIAV-NGQLLSINQ 282
S+C V S V ++P Y LNL +++V N ++ ++
Sbjct: 203 MSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGT 262
Query: 283 AVFATSNNRGTIVDCGTTLAYLVQEAYDPLVNA----ITTAVSQSTSPIISKGNQCYLVS 338
A N ++D G+TL Y E+Y LV + TAV S I+ CY
Sbjct: 263 PFHALKGN--IVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPRSDIL-----CYY-- 312
Query: 339 TRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGF---QKVQEGVTILGDL 395
++ DIFP ++++F+GG ++L ++Y M Y + ++C+ ++E I G+
Sbjct: 313 SKTIDIFPVITMHFSGGADLVL--DKYNM-YVASNTGGVFCLAIICNSPIEEA--IFGNR 367
Query: 396 VLKDKIVVYDLANQRIGWTNYDCS 419
+ +V YD ++ + + +CS
Sbjct: 368 AQNNFLVGYDSSSLLVSFKPTNCS 391
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 77.8 bits (190), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 148/383 (38%), Gaps = 48/383 (12%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNC-PQTSGLGIELNFFDTVGSTTAGLV 118
YF + +G+PP + DTGSD+ WV C C C Q S L FD S+T
Sbjct: 85 YFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPL------FDKKKSSTYKTE 138
Query: 119 PCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNS 178
C C + C + C Y + YGD S T G ++ + D G +
Sbjct: 139 SCDSKTC-QALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSF-- 195
Query: 179 SANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXX 238
VFGC Y +G + + G GP LS+VSQL S K FS+C
Sbjct: 196 -PGTVFGCG-YNNGGTFEETGSGIIGLGGGP--LSLVSQLGSS--IGKKFSYCLSHTAAT 249
Query: 239 XXXXXXXXXXEPSI----------VYSPLVPSQP--HYNLNLQSIAVNGQLLSINQAVFA 286
SI + +PL+ P +Y L L+++ V L +
Sbjct: 250 TNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYG 309
Query: 287 ----TSNNRGT-IVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG--NQCYLVST 339
+S G I+D GTTL L YD A+ +V+ + +G C+
Sbjct: 310 LNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCF---- 365
Query: 340 RVGDI---FPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLV 396
+ GD P ++++F + L P + C+ E V I G++V
Sbjct: 366 KSGDKEIGLPAITMHFT-NADVKLSPINAFVKL----NEDTVCLSMIPTTE-VAIYGNMV 419
Query: 397 LKDKIVVYDLANQRIGWTNYDCS 419
D +V YDL + + + DCS
Sbjct: 420 QMDFLVGYDLETKTVSFQRMDCS 442
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 76.6 bits (187), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/384 (23%), Positives = 156/384 (40%), Gaps = 70/384 (18%)
Query: 55 IGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTT 114
+G YF V +GSPP+ F++ +DTGSD+ W+ C C +C Q +
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND--------------- 209
Query: 115 AGLVPCSDPICTSGVQGAAAECSPQVNQ-CSYTFQYGDGSGTSGYYVSDAMYFDMIL-GQ 172
NQ C Y + YGD S T+G + + ++ G
Sbjct: 210 --------------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGG 243
Query: 173 APPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCX 232
+ + + N++FGC + G A + G LS SQL S + FS+C
Sbjct: 244 SSELYNVENMMFGCGHWNRGLFHGAAGLLGLG----RGPLSFSSQLQS--LYGHSFSYCL 297
Query: 233 XXXXXXXXXXXXXXXXE-------PSIVYSPLVPSQPH-----YNLNLQSIAVNGQLLSI 280
E P++ ++ V + + Y + ++SI V G++L+I
Sbjct: 298 VDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNI 357
Query: 281 NQAVFATSNNR--GTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG---NQCY 335
+ + S++ GTI+D GTTL+Y + AY+ + N I ++ P+ + C+
Sbjct: 358 PEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEK-AKGKYPVYRDFPILDPCF 416
Query: 336 LVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDL 395
VS P + + FA G E + + D + +G K +I+G+
Sbjct: 417 NVSGIHNVQLPELGIAFADGAVWNFPTENSFI-WLNEDLVCLAMLGTPK--SAFSIIGNY 473
Query: 396 VLKDKIVVYDLANQRIGWTNYDCS 419
++ ++YD R+G+ C+
Sbjct: 474 QQQNFHILYDTKRSRLGYAPTKCA 497
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 72.8 bits (177), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 92/387 (23%), Positives = 160/387 (41%), Gaps = 62/387 (16%)
Query: 48 SPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFF 107
SP+ +Y K+++G+PP E IDTGS+I W C C +C + +
Sbjct: 53 SPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNA-------- 104
Query: 108 DTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFD 167
P DP +S + E + C Y Y D + T G ++ +
Sbjct: 105 -----------PIFDPSKSSTFK----EKRCDGHSCPYEVDYFDHTYTMGTLATETITLH 149
Query: 168 MILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKV 227
G+ P I+ GC S + G+ G G S+++Q+ G P +
Sbjct: 150 STSGE--PFVMPETII-GCGHNNS----WFKPSFSGMVGLNWGPSSLITQMG--GEYPGL 200
Query: 228 FSHCXXXXXXXXXXXXXXXXXEPSIVYSP---LVPSQP-HYNLNLQSIAV-NGQLLSINQ 282
S+C V S + ++P Y LNL +++V N ++ ++
Sbjct: 201 MSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGT 260
Query: 283 AVFATSNNRGTIVDCGTTLAY-------LVQEAYDPLVNAITTAVSQSTSPIISKGNQCY 335
A N ++D GTTL Y LV++A + +V A+ A P GN
Sbjct: 261 TFHALEGN--IVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAA-----DP---TGNDML 310
Query: 336 LVSTRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGF---QKVQEGVTIL 392
++ DIFP ++++F+GGV ++L ++Y M Y + ++C+ QE I
Sbjct: 311 CYNSDTIDIFPVITMHFSGGVDLVL--DKYNM-YMESNNGGVFCLAIICNSPTQEA--IF 365
Query: 393 GDLVLKDKIVVYDLANQRIGWTNYDCS 419
G+ + +V YD ++ + ++ +CS
Sbjct: 366 GNRAQNNFLVGYDSSSLLVSFSPTNCS 392
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 71.2 bits (173), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 140/369 (37%), Gaps = 46/369 (12%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
Y + +G+P + ++ DTGSD+ W C C S + F+ S+T V
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG----SCYSQKEPKFNPSSSSTYQNVS 187
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
CS P+C A CS + C Y+ YGD S T G+ + NS
Sbjct: 188 CSSPMCED-----AESCS--ASNCVYSIVYGDKSFTQGFLAKEKFTL---------TNSD 231
Query: 180 --ANIVFGCSTYQSG--DLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHC--XX 233
++ FGC G D + P + +FS+C
Sbjct: 232 VLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN--------NIFSYCLPSF 283
Query: 234 XXXXXXXXXXXXXXXEPSIVYSPL--VPSQPHYNLNLQSIAVNGQLLSINQAVFATSNNR 291
S+ ++P+ PS +Y +++ I+V + L+I F+T
Sbjct: 284 TSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFST---E 340
Query: 292 GTIVDCGTTLAYLVQEAYDPLVNAITTAVS--QSTSPIISKGNQCYLVSTRVGDIFPTVS 349
G I+D GT L + Y L + +S +STS + CY + +PT++
Sbjct: 341 GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSG-YGLFDTCYDFTGLDTVTYPTIA 399
Query: 350 LNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVTILGDLVLKDKIVVYDLANQ 409
+FAG + L +P + C+ F + I G++ VVYD+A
Sbjct: 400 FSFAGSTVVELDGSGISLPIKI----SQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGG 455
Query: 410 RIGWTNYDC 418
R+G+ C
Sbjct: 456 RVGFAPNGC 464
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 68.9 bits (167), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 146/374 (39%), Gaps = 47/374 (12%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
Y K +G+P + + +DT SD+ W+ C+ C CP + F ST+ V
Sbjct: 115 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA-------FSPAKSTSFKNVS 167
Query: 120 CSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPVNSS 179
CS P C Q C + CS+ YG S ++ + D I A P+ +
Sbjct: 168 CSAPQCK---QVPNPTCGARA--CSFNLTYGSSS------IAANLSQDTIRLAADPIKA- 215
Query: 180 ANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXXXXXXXX 239
FGC +G T G G LS++SQ S I FS+C
Sbjct: 216 --FTFGCVNKVAGGGTIPPPQGLLGLGR--GPLSLMSQAQS--IYKSTFSYCLPSFRSLT 269
Query: 240 XXXXXXX--XXEPSIVYSPLVPSQPH----YNLNLQSIAVNGQLLSINQA--VFATSNNR 291
+P V + P Y +NL +I V +++ + A F S
Sbjct: 270 FSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGA 329
Query: 292 GTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKG--NQCYLVSTRVGDIFPTVS 349
GTI D GT L + Y+ + N V +T+ + S G + CY +V PT++
Sbjct: 330 GTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVKV----PTIT 385
Query: 350 LNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEG----VTILGDLVLKDKIVVYD 405
F GV+M + P LM + + C+ E V ++ + ++ V+ D
Sbjct: 386 FMFK-GVNMTM-PADNLMLHS--TAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLID 441
Query: 406 LANQRIGWTNYDCS 419
+ N R+G CS
Sbjct: 442 VPNGRLGLARERCS 455
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 59/137 (43%), Gaps = 10/137 (7%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTSGLGIELNFFDTVGSTTAGLVP 119
YF +++G PP+ + DTGSD++WV C+ C NC S + F S+T
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATV----FFPRHSSTFSPAH 139
Query: 120 CSDPICTSGVQGAAAECSPQVN---QCSYTFQYGDGSGTSGYYVSDAMYFDMILGQAPPV 176
C DP+C + A C Y + Y DGS TSG + + G+ +
Sbjct: 140 CYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARL 199
Query: 177 NSSANIVFGCSTYQSGD 193
S + FGC SG
Sbjct: 200 KS---VAFGCGFRISGQ 213
Score = 63.9 bits (154), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 77/159 (48%), Gaps = 12/159 (7%)
Query: 267 NLQSIAVNGQLLSINQAVFATSNNRGTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSP 326
L+S+A Q+V S N GT+VD GTTLA+L + AY ++ A+ V +
Sbjct: 198 RLKSVAFGCGFRISGQSV---SGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 254
Query: 327 IISKG-NQCYLVS--TRVGDIFPTVSLNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQ 383
++ G + C VS T+ I P + F+GG + P Y + + C+ Q
Sbjct: 255 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFI----ETEEQIQCLAIQ 310
Query: 384 KV--QEGVTILGDLVLKDKIVVYDLANQRIGWTNYDCSL 420
V + G +++G+L+ + + +D R+G++ C+L
Sbjct: 311 SVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCAL 349
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/409 (22%), Positives = 160/409 (39%), Gaps = 68/409 (16%)
Query: 39 VTTCPLDIISPFILEAIGFRLYFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNCPQTS 98
V+ + + S LE++ Y V++G + ++ +DTGSD+ WV C C +C
Sbjct: 117 VSETQIPLTSGIKLESLN---YIVTVELGG--KNMSLIVDTGSDLTWVQCQPCRSCYNQQ 171
Query: 99 GLGIELNFFDTVGSTTAGLVPCSDPICTSGVQGAAAECSPQVNQ------CSYTFQYGDG 152
G +D S++ V C+ C V + N C Y YGDG
Sbjct: 172 G-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDG 226
Query: 153 SGTSGYYVSDAMYFDMILGQAPPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGAL 212
S T G S++ ++LG N VFGC G + + ++
Sbjct: 227 SYTRGDLASES----ILLGDTKL----ENFVFGCGRNNKGLFGGSSGLMGLG----RSSV 274
Query: 213 SVVSQLSSRGITPKVFSHCX------XXXXXXXXXXXXXXXXEPSIVYSPLVPS---QPH 263
S+VSQ + VFS+C S+ Y+PLV + +
Sbjct: 275 SLVSQ--TLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSF 332
Query: 264 YNLNLQSIAVNGQLLSINQAVFATSNNRGTIVDCGTTLAYLVQEAYDPL-VNAITTAVSQ 322
Y LNL ++ G L ++S RG ++D GT + L Y + + +
Sbjct: 333 YILNLTGASIGGVELK------SSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGF 386
Query: 323 STSPIISKGNQCYLVSTRVGDIFPTVSLNFAG---------GVSMLLKPEQYLMPYGFVD 373
T+P S + C+ +++ P + + F G GV +KP+
Sbjct: 387 PTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPD---------- 436
Query: 374 GAAMWCIGFQKV--QEGVTILGDLVLKDKIVVYDLANQRIGWTNYDCSL 420
A++ C+ + + V I+G+ K++ V+YD +R+G +C +
Sbjct: 437 -ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENCRV 484
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 60.1 bits (144), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 81/373 (21%), Positives = 137/373 (36%), Gaps = 53/373 (14%)
Query: 60 YFTKVKMGSPPREFNVQIDTGSDILWVNCNTCSNC------PQTSGLGIELNFFDTVGST 113
+ ++ GSP ++ + +DTGS + W C CS+C P+ + S
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPK----------YRPAASI 107
Query: 114 TAGLVPCSDPICTSGVQGAAAECSPQVNQCSYTFQYGDGSGTSGYYVSDAMYFDMILGQA 173
T C D S A P C+Y Y D + G + + D G
Sbjct: 108 TYRDAMCEDSHPKSNPHFAF---DPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGF 164
Query: 174 PPVNSSANIVFGCSTYQSGDLTKADKAVDGIFGFGPGALSVVSQLSSRGITPKVFSHCXX 233
V+ + FGC+T G GI G G G S++ + S+ FS C
Sbjct: 165 KRVHG---VYFGCNTLSDGSYFTG----TGILGLGVGKYSIIGEFGSK------FSFCLG 211
Query: 234 XXXXXXXXXXXXXXXEPSIVYSPLV--PSQPHYNLNLQSIAVNGQLLSINQAVFATSNNR 291
++ P V ++ H L+SI V G+ ++++ V
Sbjct: 212 EISEPKASHNLILGDGANVQGHPTVINITEGHTIFQLESIIV-GEEITLDDPV------- 263
Query: 292 GTIVDCGTTLAYLVQEAYDPLVNAITTAVSQSTSPIISKGNQCYLVST--RVGDIFPTVS 349
VD G+TL++L Y V+A + + P+ + CY T R+ + V
Sbjct: 264 QVFVDTGSTLSHLSTNLYYKFVDAFDDLI--GSRPLSYEPTLCYKADTIERLEKM--DVG 319
Query: 350 LNFAGGVSMLLKPEQYLMPYGFVDGAAMWCIGFQKVQEGVT--ILGDLVLKDKIVVYDLA 407
F G + + + G + C+ Q +E + I+G + ++ V YDL+
Sbjct: 320 FKFDVGAELSVNIHNIFIQQG---PPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLS 376
Query: 408 NQRIGWTNYDCSL 420
+ DC +
Sbjct: 377 AKTAYINKQDCDM 389