Miyakogusa Predicted Gene
- Lj1g3v0715560.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0715560.1 Non Chatacterized Hit- tr|I1JCE0|I1JCE0_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max
PE=3,81.27,0,Asp,Peptidase A1; ASP_PROTEASE,Peptidase aspartic, active
site; Acid proteases,Peptidase aspartic; C,CUFF.26205.1
(515 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 669 0.0
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 481 e-136
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 398 e-111
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 379 e-105
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 364 e-101
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 352 3e-97
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 298 4e-81
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 128 9e-30
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 126 3e-29
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 122 5e-28
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 120 2e-27
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 119 5e-27
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 2e-26
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 5e-26
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 115 7e-26
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 7e-26
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 108 7e-24
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 104 2e-22
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 2e-21
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 2e-21
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 97 3e-20
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 95 1e-19
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 1e-18
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 5e-18
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 89 5e-18
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 88 1e-17
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 3e-17
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 8e-17
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 84 3e-16
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 83 4e-16
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 83 4e-16
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 6e-15
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 79 8e-15
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 78 1e-14
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 78 1e-14
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 77 2e-14
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 77 2e-14
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 6e-14
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 6e-14
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 75 9e-14
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 73 4e-13
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 73 5e-13
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 73 6e-13
AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family pr... 71 2e-12
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 4e-12
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 5e-12
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 8e-12
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 68 1e-11
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 2e-11
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 67 3e-11
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 3e-11
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 4e-10
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 6e-10
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 9e-10
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 61 2e-09
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 58 1e-08
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 55 1e-07
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/508 (66%), Positives = 400/508 (78%), Gaps = 23/508 (4%)
Query: 18 WGCCHGQIYTFTMHHRYSEAVKKWSHSA---AGAPEKGTFEYYSELADRDRFLRGRKLSQ 74
+G C+G+I+TF MHHR+S+ VK+WS S A P KG+FEY++ L RD +RGR+LS+
Sbjct: 21 FGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFAKFPPKGSFEYFNALVLRDWLIRGRRLSE 80
Query: 75 IGAG----LAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRC 130
+ L FSDGNST RISSLGFLHYTTV+LGTPG++FMVALDTGSDLFWVPCDC +C
Sbjct: 81 SESESESSLTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKC 140
Query: 131 ATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVS 190
A T+ +AS +F+LS+Y+P VS+T+KKVTCNNSLC RNQCLGTFS CPYMVSYVS
Sbjct: 141 APTEGATYAS----EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVS 196
Query: 191 AETSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKIS 250
A+TSTSGIL+EDV+HLT +D + VEA V FGCGQVQSGSFLD+AAPNGLFGLGMEKIS
Sbjct: 197 AQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKIS 256
Query: 251 VPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRV 310
VPS+L+REG ADSFSMCFG DG+GRISFGDKGS DQ+ETPFN+NPSHP YNITVT+VRV
Sbjct: 257 VPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRV 316
Query: 311 GTTLIDLEFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPD 370
GTTLID EFTALFD+GTSFTYLVDP YT VSESFHSQ QD+R DS+IPFEYCYDMS D
Sbjct: 317 GTTLIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSND 376
Query: 371 ANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYRVV 430
AN SL+PS+SL M G SHF I+DP+I+IST+ ELVYCL++VKSS LNIIGQN+MTGYRVV
Sbjct: 377 ANASLIPSLSLTMKGNSHFTINDPIIVISTEGELVYCLAIVKSSELNIIGQNYMTGYRVV 436
Query: 431 FDREKLILGWKKSDCFDDIEDHNAVPTRPHSDNVPPALAAGL---GNYPATGKTNDNSQQ 487
FDREKL+L WKK DC+D E + V + V PA+AAG+ N KTN +
Sbjct: 437 FDREKLVLAWKKFDCYDIEETNTTVAGTNKTAAVAPAMAAGIKTHNNSSELHKTNQTISK 496
Query: 488 STASPS---------SHYSFGFLVLRFV 506
S +SP+ S + F F++L V
Sbjct: 497 SNSSPNQISKTVDVWSFFRFVFILLPLV 524
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 481 bits (1237), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/418 (55%), Positives = 299/418 (71%), Gaps = 7/418 (1%)
Query: 31 HHRYSEAVKKWSHSAAGAPEKGTFEYYSELADRDRFLRGRKLSQIGAGLA-FSDGNSTFR 89
HHR+S+ V G P + + +YY +A RDR +RGR+L+ L FSDGN T R
Sbjct: 38 HHRFSDQVVG-VLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTFSDGNETVR 96
Query: 90 ISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLS 149
+ +LGFLHY V +GTP FMVALDTGSDLFW+PCDCT C + S DL+
Sbjct: 97 VDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVR----ELKAPGGSSLDLN 152
Query: 150 VYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTND 209
+YSPN SSTS KV CN++LC ++C S+CPY + Y+S TS++G+LVEDVLHL ++
Sbjct: 153 IYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSN 212
Query: 210 DNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF 269
D + A V FGCGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG A+SFSMCF
Sbjct: 213 DKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCF 272
Query: 270 GRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTALFDSGTSF 329
G DG GRISFGDKGS+DQ ETP N+ HPTYNITVT++ VG DLEF A+FDSGTSF
Sbjct: 273 GNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSF 332
Query: 330 TYLVDPPYTNVSESFHSQVQDRR-RPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSH 388
TYL D YT +SESF+S D+R + DS++PFEYCY +SP+ ++ P+V+L M GGS
Sbjct: 333 TYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSS 392
Query: 389 FAISDPVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDCF 446
+ + P+++I + VYCL+++K ++IIGQNFMTGYRVVFDREKLILGWK+SDC+
Sbjct: 393 YPVYHPLVVIPMKDTDVYCLAIMKIEDISIIGQNFMTGYRVVFDREKLILGWKESDCY 450
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 398 bits (1022), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/441 (46%), Positives = 284/441 (64%), Gaps = 12/441 (2%)
Query: 18 WG---CCHGQIYTFTMHHRYSEAVKKWSHSAAGAPEKGTFEYYSELADRDRFLRGRKLSQ 74
WG C ++F +HH +S+ VK+ PEKG+ EY+ LA RDR +RGR L+
Sbjct: 18 WGLERCEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRLIRGRGLAS 77
Query: 75 IGAG--LAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCAT 132
+ F GN T I LGFLHY V +GTP F+VALDTGSDLFW+PC+C
Sbjct: 78 NNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCI 137
Query: 133 TDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAE 192
D L+ L++YSPN SSTS + C++ C ++C S+CPY + Y+S +
Sbjct: 138 RDLKEVG--LSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKD 195
Query: 193 TSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 252
T T+G L EDVLHL +D V+AN+ GCG+ Q+G AA NGL GLG++ SVP
Sbjct: 196 TFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVP 255
Query: 253 SMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRV 310
S+L++ TA+SFSMCFG D +GRISFGDKG DQ ETP PTY ++VT+V V
Sbjct: 256 SILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSV 315
Query: 311 GTTLIDLEFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPD 370
G + ++ ALFD+GTSFT+L++P Y ++++F V D+RRP+D ++PFE+CYD+SP+
Sbjct: 316 GGDAVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPN 375
Query: 371 ANTSLLPSVSLAMGGGSHFAISDPVIII-STQSELVYCLSLVKSS--GLNIIGQNFMTGY 427
T L P V++ GGS + +P+ I+ + + +YCL ++KS +NIIGQNFM+GY
Sbjct: 376 KTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGY 435
Query: 428 RVVFDREKLILGWKKSDCFDD 448
R+VFDRE++ILGWK+SDCF+D
Sbjct: 436 RIVFDRERMILGWKRSDCFED 456
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 379 bits (972), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/441 (44%), Positives = 273/441 (61%), Gaps = 13/441 (2%)
Query: 18 WG---CCHGQIYTFTMHHRYSEAVKKWSHSAAGAPEKGTFEYYSELADRDRFLRGRKLSQ 74
WG C + F +HH +S++VK+ PE+G+ EY+ LA RDR +RGR L+
Sbjct: 18 WGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRLIRGRGLAS 77
Query: 75 IG--AGLAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCAT 132
+ F GN T + LG L+Y V +GTP F+VALDTGSDLFW+PC+C
Sbjct: 78 NNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCI 137
Query: 133 TDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAE 192
D + L++Y+PN S+TS + C++ C +C S CPY +SY S
Sbjct: 138 RDLEDIG--VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNS 194
Query: 193 TSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 252
T T G L++DVLHL +D + V+ANV GCGQ Q+G F + NG+ GLG++ SVP
Sbjct: 195 TGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVP 254
Query: 253 SMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRV 310
S+L++ TA+SFSMCFGR +GRISFGD+G DQ+ETPF Y + ++ V V
Sbjct: 255 SLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGVSV 314
Query: 311 GTTLIDLEFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPD 370
+D+ A FD+G+SFT+L +P Y +++SF V+DRRRPVD ++PFE+CYD+SP+
Sbjct: 315 AGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPN 374
Query: 371 ANTSLLPSVSLAMGGGSHFAISDPVIIISTQS-ELVYCLSLVKSSGL--NIIGQNFMTGY 427
A T P V + GGS +++P TQ ++YCL ++KS GL N+IGQNF+ GY
Sbjct: 375 ATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGY 434
Query: 428 RVVFDREKLILGWKKSDCFDD 448
R+VFDRE++ILGWK+S CF+D
Sbjct: 435 RIVFDRERMILGWKQSLCFED 455
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 364 bits (934), Expect = e-101, Method: Compositional matrix adjust.
Identities = 201/452 (44%), Positives = 289/452 (63%), Gaps = 29/452 (6%)
Query: 27 TFTMHHRYSEAVKKWSHSAAGAPEKGTFEYYSELADRDRFLRGRKLSQIGAG---LAFSD 83
+F +HHR+SE VK G PE G+ +YY L RDR GR+L+ ++F+
Sbjct: 23 SFEIHHRFSEQVKT-VLGGHGLPEMGSLDYYKALVHRDR---GRQLTSNNNNQTTISFAQ 78
Query: 84 GNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALA 143
GNST IS FLHY V +GTP F+VALDTGSDLFW+PC+C +T + +
Sbjct: 79 GNSTEEIS---FLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCN---STCVRSMETDQG 132
Query: 144 SDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV 203
L++Y+P+ S +S KVTCN++LC RN+C+ S+CPY + Y+S + ++G+LVEDV
Sbjct: 133 ERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDV 192
Query: 204 LHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTAD 263
+H++ ++ +A + FGC + Q G F +VA NG+ GL + I+VP+ML + G +D
Sbjct: 193 IHMSTEEGEAR--DARITFGCSESQLGLFKEVAV-NGIMGLAIADIAVPNMLVKAGVASD 249
Query: 264 SFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTALF 323
SFSMCFG +G G ISFGDKGS DQ ETP + S Y++++T+ +VG +D EFTA F
Sbjct: 250 SFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTATF 309
Query: 324 DSGTSFTYLVDPPYTNVSESFHSQVQDRR--RPVDSKIPFEYCYDMSPDANTSLLPSVSL 381
DSGT+ T+L++P YT ++ +FH V DRR + VDS PFE+CY ++ ++ LPSVS
Sbjct: 310 DSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDS--PFEFCYIITSTSDEDKLPSVSF 367
Query: 382 AMGGGSHFAISDPVIIISTQ--SELVYCLSLVK--SSGLNIIGQNFMTGYRVVFDREKLI 437
M GG+ + + P+++ T S VYCL+++K ++ +IIGQNFMT YR+V DRE+ I
Sbjct: 368 EMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRERRI 427
Query: 438 LGWKKSDCFDDIEDHNAVPTRPHSDNVPPALA 469
LGWKKS+C D N T P + PP++A
Sbjct: 428 LGWKKSNC----NDTNGF-TGPTALAKPPSMA 454
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 352 bits (903), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 189/461 (40%), Positives = 271/461 (58%), Gaps = 22/461 (4%)
Query: 4 FRTITSIFLFFFSVWGCCHGQIYTFTMHHRYSEAVKKWSHSAAGAPEKGTFEYYSELADR 63
F ++ + L F+ + C ++F +HH +S+ VK+ PE G+ EY+ LA R
Sbjct: 8 FVLLSMLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHR 67
Query: 64 DRFLRGRKLSQIGAGLAFSD--GNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLF 121
DRF+RGR L+ + N T ++ LGFLHY V LGTP F+VALDTGSDLF
Sbjct: 68 DRFIRGRGLASNNEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLF 127
Query: 122 WVPCDCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSN 181
W+PC+C D + + L++Y+PN S+TS + C++ C +C S
Sbjct: 128 WLPCNCGTTCIHDLKD--ARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESI 185
Query: 182 CPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGL 241
CPY ++ +S+ T T+G L++DVLHL +D V ANV GCGQ Q+G+F A NG+
Sbjct: 186 CPYQIA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGV 244
Query: 242 FGLGMEKISVPSMLSREGFTADSFSMCFGR--DGIGRISFGDKGSLDQDETPFNMNPSHP 299
GL M++ SVPS+L++ TA+SFSMCFGR +GRISFGDKG DQ+ETP +
Sbjct: 245 LGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETST 304
Query: 300 TYNITVTQVRVGTTLIDLEFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKI 359
Y + VT V VG +D+ ALFD+G+SFT L++ Y +++F ++D+RRPVD
Sbjct: 305 AYGVNVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDF 364
Query: 360 PFEYCYDMSPD-ANTSLLPSVSLAMGGGSHFAISD-----------PVIIISTQSELVYC 407
PFE+CYD+ + N+ P M + D + S + +YC
Sbjct: 365 PFEFCYDLREEHLNSDARPR---HMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYC 421
Query: 408 LSLVKSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDCFDD 448
L ++KS LNIIGQN M+G+R+VFDRE++ILGWK+S+CF+D
Sbjct: 422 LGILKSINLNIIGQNLMSGHRIVFDRERMILGWKQSNCFED 462
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 298 bits (764), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 187/461 (40%), Positives = 259/461 (56%), Gaps = 28/461 (6%)
Query: 25 IYTFTMHHRYSE---AVKKWSHSAAGAPEKGTFEYYSELADRDRFLRGR-KLSQIGAGLA 80
+++ + HR+S+ A K S+ P K + EYY LA+ D F R R L L
Sbjct: 24 LFSSRLIHRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAESD-FRRQRMNLGAKVQSLV 82
Query: 81 FSDGNSTFRISS---LGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTA 137
S+G+ T ISS G+LHYT + +GTP V F+VALDTGS+L W+PC+C +CA ST
Sbjct: 83 PSEGSKT--ISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTY 140
Query: 138 FASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSG 197
++S LA+ DL+ Y+P+ SSTSK C++ LC + C CPY V+Y+S TS+SG
Sbjct: 141 YSS-LATK-DLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSG 198
Query: 198 ILVEDVLHLTNDDNHHGL-----VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 252
+LVED+LHLT + N+ + V+A V+ GCG+ QSG +LD AP+GL GLG +ISVP
Sbjct: 199 LLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVP 258
Query: 253 SMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNM--NPSHPTYNITVTQVRV 310
S LS+ G +SFS+CF + GRI FGD G Q TPF N + Y + V +
Sbjct: 259 SFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCI 318
Query: 311 GTT-LIDLEFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSP 369
G + L FT DSG SFTYL + Y V+ + + + + +EYCY+ S
Sbjct: 319 GNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNFEG-VSWEYCYESSA 377
Query: 370 DANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELV-YCLSLVKS--SGLNIIGQNFMTG 426
+ +P++ L + F I P+ + LV +CL + S G+ IGQN+M G
Sbjct: 378 EPK---VPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRG 434
Query: 427 YRVVFDREKLILGWKKSDCFDD-IEDHNAVPTRPHSDNVPP 466
YR+VFDRE + LGW S C +D IE A P S N P
Sbjct: 435 YRMVFDRENMKLGWSPSKCQEDKIEPPQASPGSTSSPNPLP 475
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 128 bits (322), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 126/442 (28%), Positives = 197/442 (44%), Gaps = 59/442 (13%)
Query: 44 SAAGAPEKGTFEYYSELADRDRFLRGRKLSQIGAGLAFSDGNSTFRISSLGF-------- 95
+AA A E FE S+ A G+++ +GA A + +S++
Sbjct: 27 TAATASENLVFEVRSKFA-------GKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQP 79
Query: 96 ----LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDF-DLS 149
L++ + LGTP F V +DTGSD+ WV C C RC SD +L+
Sbjct: 80 ESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPR----------KSDLVELT 129
Query: 150 VYSPNVSSTSKKVTCNNSLCMHRNQ---CLGTFSNCPYMVSYVSAETSTSGILVEDVLH- 205
Y + SST+K V+C+++ C + NQ C + S C Y++ Y +ST+G LV+DV+H
Sbjct: 130 PYDVDASSTAKSVSCSDNFCSYVNQRSEC-HSGSTCQYVIMYGDG-SSTNGYLVKDVVHL 187
Query: 206 -LTNDDNHHGLVEANVIFGCGQVQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGFTAD 263
L + G +IFGCG QSG + AA +G+ G G S S L+ +G
Sbjct: 188 DLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKR 247
Query: 264 SFSMCF-GRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTA- 321
SF+ C +G G + G+ S TP +H Y++ + + VG ++++L A
Sbjct: 248 SFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLELSSNAF 305
Query: 322 --------LFDSGTSFTYLVDPPYTN-VSESFHSQVQDRRRPV-DSKIPFEYC--YDMSP 369
+ DSGT+ YL D Y ++E S + V +S F Y D P
Sbjct: 306 DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHYTDKLDRFP 365
Query: 370 DANTSLLPSVSLAMGGGSH-FAISDPVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYR 428
SVSLA+ + F + + Q+ L + L I+G ++
Sbjct: 366 TVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNG---GLQTKGGASLTILGDMALSNKL 422
Query: 429 VVFDREKLILGWKKSDCFDDIE 450
VV+D E ++GW +C I+
Sbjct: 423 VVYDIENQVIGWTNHNCSGGIQ 444
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 126 bits (317), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 122/417 (29%), Positives = 183/417 (43%), Gaps = 49/417 (11%)
Query: 58 SELADRDRFLRGRKLSQIGAGLAFS-DGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDT 116
S+L RD GR L +G + F DG TF +G L+YT ++LGTP F V +DT
Sbjct: 44 SQLKARDEARHGRLLQSLGGVIDFPVDG--TFDPFVVG-LYYTKLRLGTPPRDFYVQVDT 100
Query: 117 GSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRNQ- 174
GSD+ WV C C C T L+ + P S T+ ++C++ C Q
Sbjct: 101 GSDVLWVSCASCNGCPQTSGL--------QIQLNFFDPGSSVTASPISCSDQRCSWGIQS 152
Query: 175 ----CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG--LVE---ANVIFGCG 225
C + C Y Y + TSG V DVL D G LV A V+FGC
Sbjct: 153 SDSGCSVQNNLCAYTFQYGDG-SGTSGFYVSDVLQF---DMIVGSSLVPNSTAPVVFGCS 208
Query: 226 QVQSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDGIGRI-SFGDK 282
Q+G + A +G+FG G + +SV S L+ +G FS C G +G G I G+
Sbjct: 209 TSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEI 268
Query: 283 GSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTA---------LFDSGTSFTYLV 333
+ TP + PS P YN+ + + V + + + + D+GT+ YL
Sbjct: 269 VEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLS 326
Query: 334 DPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISD 393
+ Y E+ + V RPV SK CY ++ + P VSL GG+ ++
Sbjct: 327 EAAYVPFVEAITNAVSQSVRPVVSK--GNQCYVITTSVG-DIFPPVSLNFAGGASMFLNP 383
Query: 394 PVIIISTQS---ELVYCLSL--VKSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
+I + V+C+ +++ G+ I+G + V+D +GW DC
Sbjct: 384 QDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 122 bits (307), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 196/436 (44%), Gaps = 53/436 (12%)
Query: 38 VKKWSHSAAGAPEKGTFEYYSELADRDRFLRGRKLSQIGAGLAFSDGNSTFRISSLGFLH 97
V K H AG +K E++ D R L+ I L G+S R+ S+G L+
Sbjct: 26 VFKAQHKFAG--KKKNLEHFKS---HDTRRHSRMLASIDLPLG---GDS--RVDSVG-LY 74
Query: 98 YTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVS 156
+T ++LG+P ++ V +DTGSD+ W+ C C +C T + +F LS++ N S
Sbjct: 75 FTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNL--------NFRLSLFDMNAS 126
Query: 157 STSKKVTCNNSLCMHRNQ---CLGTFSNCPYMVSYVSAETSTSGILVEDVLHL--TNDDN 211
STSKKV C++ C +Q C C Y + Y E+++ G + D+L L D
Sbjct: 127 STSKKVGCDDDFCSFISQSDSCQPAL-GCSYHIVYAD-ESTSDGKFIRDMLTLEQVTGDL 184
Query: 212 HHGLVEANVIFGCGQVQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG 270
G + V+FGCG QSG + +A +G+ G G SV S L+ G FS C
Sbjct: 185 KTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLD 244
Query: 271 R-DGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFT------ALF 323
G G + G S TP M P+ YN+ + + V T +DL + +
Sbjct: 245 NVKGGGIFAVGVVDSPKVKTTP--MVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 302
Query: 324 DSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFE--YCYDMSPDANTSLLPSVSL 381
DSGT+ Y Y ++ E+ + R+PV I E C+ S + + + P VS
Sbjct: 303 DSGTTLAYFPKVLYDSLIETILA-----RQPVKLHIVEETFQCFSFSTNVDEA-FPPVSF 356
Query: 382 AMGGGSHFAISDPVIIISTQSELVYC-------LSLVKSSGLNIIGQNFMTGYRVVFDRE 434
+ P + T E +YC L+ + S + ++G ++ VV+D +
Sbjct: 357 EFEDSVKLTVY-PHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415
Query: 435 KLILGWKKSDCFDDIE 450
++GW +C I+
Sbjct: 416 NEVIGWADHNCSSSIK 431
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 120 bits (301), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/421 (26%), Positives = 181/421 (42%), Gaps = 43/421 (10%)
Query: 52 GTFEYYSELADRDRFLRGRKLSQIGAGLAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFM 111
G + SEL D F R L+ I L G+S R S+G L++T ++LG+P ++
Sbjct: 39 GKEKQLSELKSHDSFRHARMLANIDLPLG---GDS--RADSIG-LYFTKIKLGSPPKEYY 92
Query: 112 VALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLC- 169
V +DTGSD+ WV C C +C LS+Y SSTSK V C + C
Sbjct: 93 VQVDTGSDILWVNCAPCPKCPVKTDLGIP--------LSLYDSKTSSTSKNVGCEDDFCS 144
Query: 170 -MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV-LHLTNDDNHHGLVEANVIFGCGQV 227
+ +++ G C Y V Y TS + +++ L + + V+FGCG+
Sbjct: 145 FIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKN 204
Query: 228 QSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDGIGRISFGDKGSL 285
QSG +A +G+ G G S+ S L+ G T FS C +G G + G+ S
Sbjct: 205 QSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGEVESP 264
Query: 286 DQDETPFNMNPSHPTYNITVTQVRVGTTLIDL---------EFTALFDSGTSFTYLVDPP 336
TP N H YN+ + + V IDL + + DSGT+ YL
Sbjct: 265 VVKTTPIVPNQVH--YNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNL 322
Query: 337 YTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISDPVI 396
Y ++ E ++ Q + V C+ + + + + P V+L ++
Sbjct: 323 YNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKA-FPVVNLHFEDSLKLSVYPHDY 378
Query: 397 IISTQSELVYCL-----SLVKSSGLNII--GQNFMTGYRVVFDREKLILGWKKSDCFDDI 449
+ S + ++ YC + G ++I G ++ VV+D E ++GW +C I
Sbjct: 379 LFSLREDM-YCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSI 437
Query: 450 E 450
+
Sbjct: 438 K 438
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 119 bits (298), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 112/403 (27%), Positives = 177/403 (43%), Gaps = 62/403 (15%)
Query: 92 SLGFLHYTTVQLGTP--GVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFD 147
+G L+YT + +G P G + + +DTGS+L W+ CD CT CA +
Sbjct: 25 QMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN------------ 72
Query: 148 LSVYSPNVSSTSKKVTCNNSLCM--HRNQCLGTFSNC---PYMVSYVSAETSTSGILVED 202
+Y P + V + + C+ RNQ NC Y + Y S G+L +D
Sbjct: 73 -QLYKPRKDNL---VRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSM-GVLTKD 127
Query: 203 VLHLTNDDNHHG-LVEANVIFGCGQVQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGF 260
HL H+G L E++++FGCG Q G L+ + +G+ GL KIS+PS L+ G
Sbjct: 128 KFHLKL---HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGI 184
Query: 261 TADSFSMCFGRD--GIGRISFGDK--GSLDQDETPFNMNPSHPTYNITVTQVRVGTTLID 316
++ C D G G I G S P + Y + VT++ G ++
Sbjct: 185 ISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLS 244
Query: 317 LEF------TALFDSGTSFTYLVDPPYTNVSESFH--SQVQDRRRPVDSKIPFEYCYDMS 368
L+ LFD+G+S+TY + Y+ + S S ++ R D +P C+
Sbjct: 245 LDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPI--CWRAK 302
Query: 369 PDANTSLLPSVS-----LAMGGGSHFAISDPVIIISTQSELVY------CLSLVKSSGLN 417
+ S L V + + GS + I ++I + L+ CL ++ S ++
Sbjct: 303 TNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVH 362
Query: 418 -----IIGQNFMTGYRVVFDREKLILGWKKSDCFDDIE-DHNA 454
I+G M G+ +V+D K +GW KSDC E DHN
Sbjct: 363 DGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREIDHNV 405
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 117 bits (293), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 120/427 (28%), Positives = 177/427 (41%), Gaps = 60/427 (14%)
Query: 58 SELADRDR------FLRGRKLSQIGAGLAFSDGNST---FRISSLGFLHYTTVQLGTPGV 108
SEL RDR L G + S +G + F S+ S + L++T V+LG+P
Sbjct: 57 SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPT 116
Query: 109 KFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNS 167
+F V +DTGSD+ WV C C+ C + DL + S T+ VTC++
Sbjct: 117 EFNVQIDTGSDILWVTCSSCSNCPHSSGLG--------IDLHFFDAPGSLTAGSVTCSDP 168
Query: 168 LC-----MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVE--ANV 220
+C QC + C Y Y + TSG + D + + A +
Sbjct: 169 ICSSVFQTTAAQC-SENNQCGYSFRYGDG-SGTSGYYMTDTFYFDAILGESLVANSSAPI 226
Query: 221 IFGCGQVQSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISF 279
+FGC QSG A +G+FG G K+SV S LS G T FS C DG G F
Sbjct: 227 VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVF 286
Query: 280 GDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLE---FTA------LFDSGTSFT 330
L + PS P YN+ + + V ++ L+ F A + D+GT+ T
Sbjct: 287 VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLT 346
Query: 331 YLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGS--- 387
YLV Y + + V P+ S E CY +S + + + PSVSL GG+
Sbjct: 347 YLVKEAYDLFLNAISNSVSQLVTPIISN--GEQCYLVS-TSISDMFPSVSLNFAGGASMM 403
Query: 388 --------HFAISDPVIIISTQSELVYCLSLVKS-SGLNIIGQNFMTGYRVVFDREKLIL 438
H+ I D ++C+ K+ I+G + V+D + +
Sbjct: 404 LRPQDYLFHYGIYD--------GASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRI 455
Query: 439 GWKKSDC 445
GW DC
Sbjct: 456 GWASYDC 462
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 115 bits (289), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 170/377 (45%), Gaps = 46/377 (12%)
Query: 96 LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
L+YT V+LGTP +F V +DTGSD+ WV C C C T LS + P
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSEL--------QIQLSFFDPG 134
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSN---CPYMVSYVSAETSTSGILVEDVLHLTNDDN 211
VSS++ V+C++ C Q S C Y Y + TSG + D +
Sbjct: 135 VSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDG-SGTSGYYISDFMSFDTVIT 193
Query: 212 HHGLVE--ANVIFGCGQVQSGSFLDVAAP----NGLFGLGMEKISVPSMLSREGFTADSF 265
+ A +FGC +QSG D+ P +G+FGLG +SV S L+ +G F
Sbjct: 194 STLAINSSAPFVFGCSNLQSG---DLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVF 250
Query: 266 SMCFGRD--GIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLE---FT 320
S C D G G + G D TP + PS P YN+ + + V ++ ++ FT
Sbjct: 251 SHCLKGDKSGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFT 308
Query: 321 ------ALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFE--YCYDMSPDAN 372
+ D+GT+ YL D Y+ ++ + V RP I +E C++++ +
Sbjct: 309 IATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRP----ITYESYQCFEITA-GD 363
Query: 373 TSLLPSVSLAMGGGSHFAISDP--VIIISTQSELVYCLSLVKSS--GLNIIGQNFMTGYR 428
+ P VSL+ GG+ + + I S+ ++C+ + S + I+G +
Sbjct: 364 VDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKV 423
Query: 429 VVFDREKLILGWKKSDC 445
VV+D + +GW + DC
Sbjct: 424 VVYDLVRQRIGWAEYDC 440
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 115 bits (288), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 175/424 (41%), Gaps = 59/424 (13%)
Query: 58 SELADRDRFLRGRKLSQIGAGLAFSDGNST-FRISS-----LGFLHYTTVQLGTPGVKFM 111
SEL RDR R L +G G S G F + L L++T V+LG+P +F
Sbjct: 57 SELRARDRVRHARIL--LGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFN 114
Query: 112 VALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLC- 169
V +DTGSD+ WV C C+ C + DL + S T+ VTC++ +C
Sbjct: 115 VQIDTGSDILWVTCSSCSNCPHSSGLG--------IDLHFFDAPGSLTAGSVTCSDPICS 166
Query: 170 ----MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVE--ANVIFG 223
QC + C Y Y + TSG + D + + A ++FG
Sbjct: 167 SVFQTTAAQC-SENNQCGYSFRYGDG-SGTSGYYMTDTFYFDAILGESLVANSSAPIVFG 224
Query: 224 CGQVQSGSFLDV-AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDK 282
C QSG A +G+FG G K+SV S LS G T FS C DG G F
Sbjct: 225 CSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLG 284
Query: 283 GSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLE---FTA------LFDSGTSFTYLV 333
L + PS P YN+ + + V ++ L+ F A + D+GT+ TYLV
Sbjct: 285 EILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLV 344
Query: 334 DPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGS------ 387
Y + + V P+ S E CY +S + + + PSVSL GG+
Sbjct: 345 KEAYDLFLNAISNSVSQLVTPIISN--GEQCYLVS-TSISDMFPSVSLNFAGGASMMLRP 401
Query: 388 -----HFAISDPVIIISTQSELVYCLSLVKS-SGLNIIGQNFMTGYRVVFDREKLILGWK 441
H+ I D ++C+ K+ I+G + V+D + +GW
Sbjct: 402 QDYLFHYGIYD--------GASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWA 453
Query: 442 KSDC 445
DC
Sbjct: 454 SYDC 457
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 115 bits (288), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 175/399 (43%), Gaps = 62/399 (15%)
Query: 96 LHYTTVQLGTP--GVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFDLSVY 151
L+YT + +G P G + + +DTGS+L W+ CD CT CA + +Y
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN-------------QLY 248
Query: 152 SPNVSSTSKKVTCNNSLCM--HRNQCLGTFSNC---PYMVSYVSAETSTSGILVEDVLHL 206
P + V + + C+ RNQ NC Y + Y S G+L +D HL
Sbjct: 249 KPRKDNL---VRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSM-GVLTKDKFHL 304
Query: 207 TNDDNHHG-LVEANVIFGCGQVQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGFTADS 264
H+G L E++++FGCG Q G L+ + +G+ GL KIS+PS L+ G ++
Sbjct: 305 KL---HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNV 361
Query: 265 FSMCFGRD--GIGRISFGDKGSLDQDET--PFNMNPSHPTYNITVTQVRVGTTLIDLEF- 319
C D G G I G T P + Y + VT++ G ++ L+
Sbjct: 362 VGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGE 421
Query: 320 -----TALFDSGTSFTYLVDPPYTNVSESFH--SQVQDRRRPVDSKIPFEYCYDMSPDAN 372
LFD+G+S+TY + Y+ + S S ++ R D +P C+ +
Sbjct: 422 NGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPI--CWRAKTNFP 479
Query: 373 TSLLPSVS-----LAMGGGSHFAISDPVIIISTQSELVY------CLSLVKSSGLN---- 417
S L V + + GS + I ++I + L+ CL ++ S ++
Sbjct: 480 FSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGST 539
Query: 418 -IIGQNFMTGYRVVFDREKLILGWKKSDCFDDIE-DHNA 454
I+G M G+ +V+D K +GW KSDC E DHN
Sbjct: 540 IILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREIDHNV 578
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 108 bits (271), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 171/387 (44%), Gaps = 54/387 (13%)
Query: 96 LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
L+Y + +GTP + V +DTGSD+ WV C C +C + +L++Y+ +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLG--------IELTLYNID 130
Query: 155 VSSTSKKVTCNNSLCMH-----RNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTN- 208
S + K V+C++ C + C S CPY+ Y +ST+G V+DV+ +
Sbjct: 131 ESDSGKLVSCDDDFCYQISGGPLSGCKANMS-CPYLEIYGDG-SSTAGYFVKDVVQYDSV 188
Query: 209 -DDNHHGLVEANVIFGCGQVQSGSF--LDVAAPNGLFGLGMEKISVPSMLSREGFTADSF 265
D +VIFGCG QSG + A +G+ G G S+ S L+ G F
Sbjct: 189 AGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIF 248
Query: 266 SMCF-GRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDL------- 317
+ C GR+G G + G + TP + P+ P YN+ +T V+VG + +
Sbjct: 249 AHCLDGRNGGGIFAIGRVVQPKVNMTP--LVPNQPHYNVNMTAVQVGQEFLTIPADLFQP 306
Query: 318 --EFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEY-CYDMSPDANTS 374
A+ DSGT+ YL + Y + + SQ + + K +Y C+ S +
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDK---DYKCFQYSGRVDEG 363
Query: 375 LLPSVSLAMGGGSHFAISDPVIIISTQ----SELVYCLSLVKSS-------GLNIIGQNF 423
P+V+ HF S + + E ++C+ S+ + ++G
Sbjct: 364 -FPNVTF------HFENSVFLRVYPHDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDLV 416
Query: 424 MTGYRVVFDREKLILGWKKSDCFDDIE 450
++ V++D E ++GW + +C I+
Sbjct: 417 LSNKLVLYDLENQLIGWTEYNCSSSIK 443
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 104 bits (259), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 176/394 (44%), Gaps = 59/394 (14%)
Query: 97 HYTT-VQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
+YTT + +GTP F + +D+GS + +VPC DC +C F P
Sbjct: 92 YYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKF-------------QPE 138
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG 214
+SST + V CN M N C C Y Y + +S+ G+L ED++ N+
Sbjct: 139 MSSTYQPVKCN----MDCN-CDDDREQCVYEREY-AEHSSSKGVLGEDLISFGNESQ--- 189
Query: 215 LVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 274
L +FGC V++G A +G+ GLG +S+ L +G ++SF +C+G +
Sbjct: 190 LTPQRAVFGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDV 248
Query: 275 GRISFGDKGSLDQDETPFNM-----NPSH-PTYNITVTQVRVG-------TTLIDLEFTA 321
G G L + P +M +P P YNI +T +RV + + D E A
Sbjct: 249 G----GGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGA 304
Query: 322 LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIP--FEYCYDMSPDANTSLL--- 376
+ DSGT++ YL D + E+ +V ++ +D P + C+ ++ S L
Sbjct: 305 VLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQ-IDGPDPNFKDTCFQVAASNYVSELSKI 363
Query: 377 -PSVSLAMGGGSHFAIS-DPVIIISTQSELVYCLSLVKSSG--LNIIGQNFMTGYRVVFD 432
PSV + G + +S + + ++ YCL + + ++G + VV+D
Sbjct: 364 FPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYD 423
Query: 433 REKLILGWKKSDCFDDIEDHNAVPTRPHSDNVPP 466
RE +G+ +++C + + R H D PP
Sbjct: 424 RENSKVGFWRTNC-------SELSDRLHIDGAPP 450
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 100 bits (250), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 171/411 (41%), Gaps = 65/411 (15%)
Query: 60 LADRDRFLRGRKLSQI-GAGLAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGS 118
L D+ RFL L+ + + + + G + + + + +GTP +VALDT +
Sbjct: 54 LQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPT----YIVRANIGTPAQPMLVALDTSN 109
Query: 119 DLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHR-NQCL 176
D W+PC C C+++ ++ P+ SS+S+ + C C N
Sbjct: 110 DAAWIPCSGCVGCSSS---------------VLFDPSKSSSSRTLQCEAPQCKQAPNPSC 154
Query: 177 GTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVA 236
+C + ++Y ++ L +D L L +D V N FGC SG+ L
Sbjct: 155 TVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------VIPNYTFGCINKASGTSLPA- 205
Query: 237 APNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDG----IGRISFGDKGS-LDQDETP 291
GL GLG +S+ S + +FS C G + G K + TP
Sbjct: 206 --QGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261
Query: 292 FNMNPSHPT-YNITVTQVRVGTTLIDLEFTAL-----------FDSGTSFTYLVDPPYTN 339
NP + Y + + +RVG ++D+ +AL FDSGT +T LV+P Y
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321
Query: 340 VSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISDPVIIIS 399
V F +V++ S F+ CY + + + PSV+ G + D ++I S
Sbjct: 322 VRNEFRRRVKNAN--ATSLGGFDTCY-----SGSVVFPSVTFMFAGMNVTLPPDNLLIHS 374
Query: 400 TQSELVYCLSLVKS-----SGLNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
+ L CL++ + S LN+I +RV+ D LG + C
Sbjct: 375 SAGNL-SCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 100 bits (249), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 118/476 (24%), Positives = 192/476 (40%), Gaps = 78/476 (16%)
Query: 57 YSELADRDRF--LRGRKLSQIGAGLAFSDGNSTFRISSLGFLHYTT-VQLGTPGVKFMVA 113
YS L R R R R+L Q + + L +YTT + +GTP +F +
Sbjct: 37 YSSLPPRPRVEDFRRRRLHQS----QLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALI 92
Query: 114 LDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHR 172
+DTGS + +VPC C +C F P +S++ + + CN
Sbjct: 93 VDTGSTVTYVPCSTCKQCGKHQDPKF-------------QPELSTSYQALKCNPDC---- 135
Query: 173 NQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSF 232
C C Y Y +S+SG+L ED++ N+ L +FGC ++G
Sbjct: 136 -NCDDEGKLCVYERRYAEM-SSSSGVLSEDLISFGNESQ---LSPQRAVFGCENEETGDL 190
Query: 233 LDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPF 292
A +G+ GLG K+SV L +G D FS+C+G +G G L + P
Sbjct: 191 FSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG----GGAMVLGKISPPP 245
Query: 293 NMNPSH------PTYNITVTQVRVGTTLIDL-------EFTALFDSGTSFTYLVDPPYTN 339
M SH P YNI + Q+ V + L + + DSGT++ Y +
Sbjct: 246 GMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIA 305
Query: 340 VSESFHSQVQDRRRPVDSKIPF--EYCY-----DMSPDANTSLLPSVSLAMGGGSHFAIS 392
+ ++ ++ +R + P + C+ D++ N P +++ G G +S
Sbjct: 306 IKDAVIKEIPSLKR-IHGPDPNYDDVCFSGAGRDVAEIHN--FFPEIAMEFGNGQKLILS 362
Query: 393 -DPVIIISTQSELVYCLSLVK-SSGLNIIGQNFMTGYRVVFDREKLILGWKKSDCFDDIE 450
+ + T+ YCL + ++G + V +DRE LG+ K++C D
Sbjct: 363 PENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWR 422
Query: 451 D----HNAVPTRPHSDNVPPALAAGLGNYPATGKTNDNSQQSTASPSSHYSFGFLV 502
+ PT P S N ++ + PAT + SP+SH F V
Sbjct: 423 RLAAPESPAPTSPISQNK----SSNISPSPAT----------SESPTSHLPGVFRV 464
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 97.1 bits (240), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 159/389 (40%), Gaps = 64/389 (16%)
Query: 90 ISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFD 147
+ LG+ + T+ +G P + + LDTGSDL W+ CD C RC
Sbjct: 54 VYPLGYYN-VTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC----------------- 95
Query: 148 LSVYSPNVSSTSKKVTCNNSLC----MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV 203
L P +S + CN+ LC ++ NQ T C Y V Y +S G+LV DV
Sbjct: 96 LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSL-GVLVRDV 154
Query: 204 LHLTNDDNHHGL-VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTA 262
+ + GL + + GCG Q +G+ GLG K+S+ S L +G+
Sbjct: 155 FSM---NYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVK 211
Query: 263 DSFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTAL 322
+ C G G + FGD D + + P Y+ + G L T L
Sbjct: 212 NVIGHCLSSLGGGILFFGDD---LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGL 268
Query: 323 ------FDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPV-----DSKIPFEYCYD----- 366
FDSG+S+TY Y V+ ++ +P+ D +P C+
Sbjct: 269 KNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSG--KPLKEARDDHTLPL--CWQGRRPF 324
Query: 367 MSPDANTSLLPSVSLAMGGG----SHFAI-SDPVIIISTQSELVYCLSLVKSS-----GL 416
MS + ++L+ G + F I + +IIS + + CL ++ + L
Sbjct: 325 MSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNV--CLGILNGTEIGLQNL 382
Query: 417 NIIGQNFMTGYRVVFDREKLILGWKKSDC 445
N+IG M +++D EK +GW DC
Sbjct: 383 NLIGDISMQDQMIIYDNEKQSIGWMPVDC 411
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 95.1 bits (235), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 162/377 (42%), Gaps = 61/377 (16%)
Query: 101 VQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTS 159
+ +G P VK+ +DTGSDL W C CT C D ++ P SS+
Sbjct: 111 LSIGNPAVKYSAIVDTGSDLIWTQCKPCTECF-------------DQPTPIFDPEKSSSY 157
Query: 160 KKVTCNNSLC--MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVE 217
KV C++ LC + R+ C C Y+ +Y +ST G+L + ++++ G
Sbjct: 158 SKVGCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTFEDENSISG--- 213
Query: 218 ANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFT--------ADSFSMCF 269
+ FGCG G + GL GLG +S+ S L F+ +++ S F
Sbjct: 214 --IGFGCGVENEGDGFSQGS--GLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLF 269
Query: 270 GRDGIGRISFG----DKGSLDQDETP---FNMNPSHPT-YNITVTQVRVGTTLIDLEFTA 321
IG ++ G SLD + T NP P+ Y + + + VG + +E +
Sbjct: 270 ----IGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKST 325
Query: 322 -----------LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVD--SKIPFEYCYDMS 368
+ DSGT+ TYL + + + E F S++ PVD + C+ +
Sbjct: 326 FELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS---LPVDDSGSTGLDLCFKL- 381
Query: 369 PDANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYR 428
PDA ++ + G+ + +++ S V CL++ S+G++I G +
Sbjct: 382 PDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFN 441
Query: 429 VVFDREKLILGWKKSDC 445
V+ D EK + + ++C
Sbjct: 442 VLHDLEKETVSFVPTEC 458
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 91.3 bits (225), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 162/397 (40%), Gaps = 79/397 (19%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLSVYSPNVS 156
++ +++G P ++ DTGSDL WV C R + S A +V+ P S
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA-----------TVFFPRHS 132
Query: 157 STSKKVTCNNSLCMHRNQ------CLGT--FSNCPYMVSYVSAETSTSGILVEDVLHLTN 208
ST C + +C + C T S C Y Y TSG+ + L
Sbjct: 133 STFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSL-TSGLFARETTSLKT 191
Query: 209 DDNHHGLVEANVIFGC-----GQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTAD 263
+++ V FGC GQ SG+ + A NG+ GLG IS S L R +
Sbjct: 192 SSGKEARLKS-VAFGCGFRISGQSVSGTSFNGA--NGVMGLGRGPISFASQLGRR--FGN 246
Query: 264 SFSMCF-----------------GRDGIGRISFGDKGSLDQDETPFNMNPSHPTYN---- 302
FS C G DGI ++ F TP NP PT+
Sbjct: 247 KFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFF----------TPLLTNPLSPTFYYVKL 296
Query: 303 ----ITVTQVRVGTTLIDLEFTA----LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRP 354
+ ++R+ ++ +++ + + DSGT+ +L +P Y +V + +V + P
Sbjct: 297 KSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRV---KLP 353
Query: 355 V-DSKIP-FEYCYDMSPDANT-SLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLV 411
+ D+ P F+ C ++S +LP + GG+ F I T+ E + CL++
Sbjct: 354 IADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETE-EQIQCLAIQ 412
Query: 412 ---KSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
G ++IG G+ FDR++ LG+ + C
Sbjct: 413 SVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 90.9 bits (224), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 161/374 (43%), Gaps = 44/374 (11%)
Query: 91 SSLGFLHY-TTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDL 148
S+LG +Y TV LGTP + DTGSDL W C C R T + D
Sbjct: 125 STLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR------TCY------DQKE 172
Query: 149 SVYSPNVSSTSKKVTCNNSLCMHRNQCLG-----TFSNCPYMVSYVSAETSTSGILVEDV 203
+++P+ S++ V+C+++ C + G + SNC Y + Y ++ + G L ++
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEK 231
Query: 204 LHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTAD 263
LTN D G V FGCG+ G F VA GL GLG +K+S PS +
Sbjct: 232 FTLTNSDVFDG-----VYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIF 283
Query: 264 SFSMCFGRDGIGRISFGDKG-SLDQDETPFN-MNPSHPTYNITVTQVRVGTTLIDLEFT- 320
S+ + G ++FG G S TP + + Y + + + VG + + T
Sbjct: 284 SYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 343
Query: 321 -----ALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIP-FEYCYDMSPDANTS 374
AL DSGT T L Y + SF +++ + P S + + C+D+S T
Sbjct: 344 FSTPGALIDSGTVITRLPPKAYAALRSSFKAKMS--KYPTTSGVSILDTCFDLS-GFKTV 400
Query: 375 LLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKS---SGLNIIGQNFMTGYRVVF 431
+P V+ + GG+ + I + V CL+ + S I G VV+
Sbjct: 401 TIPKVAFSFSGGAVVELGSKGIFYVFKISQV-CLAFAGNSDDSNAAIFGNVQQQTLEVVY 459
Query: 432 DREKLILGWKKSDC 445
D +G+ + C
Sbjct: 460 DGAGGRVGFAPNGC 473
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 89.7 bits (221), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 151/385 (39%), Gaps = 61/385 (15%)
Query: 93 LGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFDLSV 150
LG+ +Y + +G P F + +DTGSDL WV CD C C +
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ------------- 109
Query: 151 YSPNVSSTSKKVTCNNSLCM-----HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV-L 204
Y PN ++ + C++ LC C C Y + Y +S ++ ++V L
Sbjct: 110 YKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPL 165
Query: 205 HLTNDDNHHGLVEANVIFGCG-QVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTAD 263
L N ++ + FGCG Q+ G+ GLG K+ + + L G T +
Sbjct: 166 KLAN----GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKN 221
Query: 264 SFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRV----GTTLIDLEF 319
C G G +S GD+ T ++ + P+ N + TT +
Sbjct: 222 VIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVK-GI 280
Query: 320 TALFDSGTSFTYLVDPPYTNVSESFHSQVQ---------DRRRPV--DSKIPFEYCYDMS 368
+FDSG+S+TY Y + + + D+ PV K P + S
Sbjct: 281 NVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLK-----S 335
Query: 369 PDANTSLLPSVSLAMG---GGSHFAISDPVIIISTQSELVYCLSLVKSS-----GLNIIG 420
D +++L G G F + +I T+ V CL ++ + G NIIG
Sbjct: 336 LDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRV-CLGILNGTEIGLEGYNIIG 394
Query: 421 QNFMTGYRVVFDREKLILGWKKSDC 445
G V++D EK +GW SDC
Sbjct: 395 DISFQGIMVIYDNEKQRIGWISSDC 419
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 89.4 bits (220), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 151/385 (39%), Gaps = 61/385 (15%)
Query: 93 LGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFDLSV 150
LG+ +Y + +G P F + +DTGSDL WV CD C C +
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ------------- 109
Query: 151 YSPNVSSTSKKVTCNNSLCM-----HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV-L 204
Y PN ++ + C++ LC C C Y + Y +S ++ ++V L
Sbjct: 110 YKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPL 165
Query: 205 HLTNDDNHHGLVEANVIFGCG-QVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTAD 263
L N ++ + FGCG Q+ G+ GLG K+ + + L G T +
Sbjct: 166 KLAN----GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKN 221
Query: 264 SFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRV----GTTLIDLEF 319
C G G +S GD+ T ++ + P+ N + TT +
Sbjct: 222 VIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVK-GI 280
Query: 320 TALFDSGTSFTYLVDPPYTNVSESFHSQVQ---------DRRRPV--DSKIPFEYCYDMS 368
+FDSG+S+TY Y + + + D+ PV K P + S
Sbjct: 281 NVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLK-----S 335
Query: 369 PDANTSLLPSVSLAMG---GGSHFAISDPVIIISTQSELVYCLSLVKSS-----GLNIIG 420
D +++L G G F + +I T+ V CL ++ + G NIIG
Sbjct: 336 LDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRV-CLGILNGTEIGLEGYNIIG 394
Query: 421 QNFMTGYRVVFDREKLILGWKKSDC 445
G V++D EK +GW SDC
Sbjct: 395 DISFQGIMVIYDNEKQRIGWISSDC 419
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 87.8 bits (216), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 159/385 (41%), Gaps = 78/385 (20%)
Query: 101 VQLGTPGVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFDLSVYSPNVSST 158
+Q+G+P F +DTGSDL WV CD C+ C PN+
Sbjct: 53 MQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCT-------------------LPPNLQYK 93
Query: 159 SKK--VTCNNSLCM-----HRNQCLGTFSNCPYMVSYVSAETSTSGILVED--VLHLTND 209
K + C+N +C ++ C C Y V Y + S+ G LV D L L N
Sbjct: 94 PKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYAD-QGSSMGALVTDQFPLKLVNG 152
Query: 210 DNHHGLVEANVIFGCGQVQS-GSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMC 268
++ V FGCG QS S A G+ GLG KI + + L G T + C
Sbjct: 153 S----FMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHC 208
Query: 269 FGRDGIGRISFGDK--GSLDQDETP-------FNMNPSHPTYNITVTQVRVGTTLIDLEF 319
G G + FGD S+ TP + P+ +N T ++ G LI
Sbjct: 209 LSSKGGGFLFFGDNLVPSIGVAWTPLLSQDNHYTTGPADLLFNGKPTGLK-GLKLI---- 263
Query: 320 TALFDSGTSFTYLVDPPYTNVSESFHSQV---------QDRRRPVDSK--IPFEYCYDMS 368
FD+G+S+TY Y + + + +D+ P+ K PF+ ++
Sbjct: 264 ---FDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVK 320
Query: 369 PDANTSLLPSVSLAMGGG---SHFAISDPVIIISTQSELVYCLSLVKSS--GL---NIIG 420
+ ++++ G + ++ + +I +++ V CL L+ S GL N+IG
Sbjct: 321 -----NFFKTITINFTNGRRNTQLYLAPELYLIVSKTGNV-CLGLLNGSEVGLQNSNVIG 374
Query: 421 QNFMTGYRVVFDREKLILGWKKSDC 445
M G +++D EK LGW SDC
Sbjct: 375 DISMQGLMMIYDNEKQQLGWVSSDC 399
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 86.7 bits (213), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/375 (23%), Positives = 144/375 (38%), Gaps = 46/375 (12%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLSVYSPNVS 156
++T +++GTP KF V +DTGS+L WV C A D + V+ + S
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCR------------YRARGKD-NRRVFRADES 152
Query: 157 STSKKVTCNNSLCMHRNQCLGTFSNCP-------YMVSYVSAETSTSGILVEDVLHLTND 209
+ K V C C L + + CP Y Y ++ G+ ++ + +
Sbjct: 153 KSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADG-SAAQGVFAKETITVGLT 211
Query: 210 DNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF 269
+ + ++I GC +G A +G+ GL S S + FS C
Sbjct: 212 NGRMARLPGHLI-GCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTAT--SLYGAKFSYCL 266
Query: 270 -----GRDGIGRISFGDKGSLD---QDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTA 321
++ + FG S + TP ++ P Y I V + +G ++D+
Sbjct: 267 VDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQV 326
Query: 322 ---------LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDAN 372
+ DSGTS T L D Y V + + +R +P EYC+ + N
Sbjct: 327 WDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFN 386
Query: 373 TSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSS--GLNIIGQNFMTGYRVV 430
S LP ++ + GG+ F ++ + V CL V + N+IG Y
Sbjct: 387 VSKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFVSAGTPATNVIGNIMQQNYLWE 445
Query: 431 FDREKLILGWKKSDC 445
FD L + S C
Sbjct: 446 FDLMASTLSFAPSAC 460
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 85.5 bits (210), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 160/368 (43%), Gaps = 53/368 (14%)
Query: 101 VQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLSVYSPNVSSTSK 160
+ +G P V ++ +DTGSDL W+ C +C + + P+ SST +
Sbjct: 82 ISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQ-------------TIPFFHPSRSSTYR 128
Query: 161 KVTCNNS----LCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLV 216
+C ++ + R++ G NC Y + Y ++T GIL E+ L D+ GL+
Sbjct: 129 NASCVSAPHAMPQIFRDEKTG---NCQYHLRYRDF-SNTRGILAEEKLTFETSDD--GLI 182
Query: 217 -EANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI- 274
+ N++FGCGQ SG +G+ GLG S+ ++R FS CFG
Sbjct: 183 SKQNIVFGCGQDNSG----FTKYSGVLGLGPGTFSI---VTRN--FGSKFSYCFGSLTNP 233
Query: 275 ----GRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLE----------FT 320
+ G+ ++ D TP + Y + + + G L+D+E
Sbjct: 234 TYPHNILILGNGAKIEGDPTPLQI--FQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGG 291
Query: 321 ALFDSGTSFTYLVDPPYTNVSESFHSQVQD-RRRPVDSKIPFEYCYDMSPDANTSLLPSV 379
+ D+G S T L Y +SE + + RR D CY+ + + P V
Sbjct: 292 TVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVV 351
Query: 380 SLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSS--GLNIIGQNFMTGYRVVFDREKLI 437
+ GG+ A+ + +S++S +CL++ ++ +++IG Y V ++ +
Sbjct: 352 TFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMK 411
Query: 438 LGWKKSDC 445
+ ++++DC
Sbjct: 412 VYFQRTDC 419
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 83.6 bits (205), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 111/432 (25%), Positives = 162/432 (37%), Gaps = 105/432 (24%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPC-----DCTRCATTDSTAFASALASDFDLSVY 151
+ T+ +GTP V LDTGSDL WVPC DC C + S SV+
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSP-------SVF 135
Query: 152 SPNVSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETST---------------- 195
SP SSTS + +C +S C+ + F C VS +
Sbjct: 136 SPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEG 195
Query: 196 ---SGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP 252
SGIL D+L D FGC V S P G+ G G +S+P
Sbjct: 196 GLISGILTRDILKARTRD------VPRFSFGC--VTS----TYREPIGIAGFGRGLLSLP 243
Query: 253 SMLSREGFTADSFSMCFGR--------------DGIGRISFGDKGSLDQDETPFNMNPSH 298
S L GF FS CF G +S SL TP P +
Sbjct: 244 SQL---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSL--QFTPMLNTPMY 298
Query: 299 P-TYNITVTQVRVGTTLIDLEFT-------------ALFDSGTSFTYLVDPPYTNVSESF 344
P +Y I + + +GT + + L DSGT++T+L +P Y+ + +
Sbjct: 299 PNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTL 358
Query: 345 HSQVQ-DRRRPVDSKIPFEYCYDMS-PDANTS--------LLPSVS---------LAMGG 385
S + R +S+ F+ CY + P+ N + + PS++ L G
Sbjct: 359 QSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQG 418
Query: 386 GSHFAISDPVIIISTQSELVYCLSLVKSSG-----LNIIGQNFMTGYRVVFDREKLILGW 440
S +A+S P + +V CL + G +VV+D EK +G+
Sbjct: 419 NSFYAMSAP-----SDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGF 473
Query: 441 KKSDCFDDIEDH 452
+ DC + H
Sbjct: 474 QAMDCVLEAASH 485
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 83.2 bits (204), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 151/379 (39%), Gaps = 59/379 (15%)
Query: 90 ISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD--CTRCATTDSTAFASALASDFD 147
+ LG+ + T+ +G P + + LDTGSDL W+ CD C RC
Sbjct: 51 VYPLGY-YNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC----------------- 92
Query: 148 LSVYSPNVSSTSKKVTCNNSLC----MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV 203
L P +S + CN+ LC ++ NQ T C Y V Y +S G+LV DV
Sbjct: 93 LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSL-GVLVRDV 151
Query: 204 LHLTNDDNHHGL-VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTA 262
+ + GL + + GCG Q +G+ GLG K+S+ S L +G+
Sbjct: 152 FSM---NYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVK 208
Query: 263 DSFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTAL 322
+ C G G + FGD D + + P Y+ + G L T L
Sbjct: 209 NVIGHCLSSLGGGILFFGDD---LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGL 265
Query: 323 ------FDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPV-----DSKIPFEYCYD----- 366
FDSG+S+TY Y V+ ++ +P+ D +P C+
Sbjct: 266 KNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSG--KPLKEARDDHTLPL--CWQGRRPF 321
Query: 367 MSPDANTSLLPSVSLAMGGG----SHFAI-SDPVIIISTQSELVYCLSLVKSSGLNIIGQ 421
MS + ++L+ G + F I + +IIS + + CL ++ + + +
Sbjct: 322 MSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNV--CLGILNGTEIGLQNL 379
Query: 422 NFMTGYRVVFDREKLILGW 440
N + G + + L W
Sbjct: 380 NLIGGTVFILHTLAISLSW 398
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 83.2 bits (204), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 104/471 (22%), Positives = 183/471 (38%), Gaps = 62/471 (13%)
Query: 8 TSIFLF----FFSVWGCCHGQIYTFTMHHRYSEAVKKWSHSAAGAPEKGTFEYYSELADR 63
TS+ LF +F V + HR S A + PE + R
Sbjct: 7 TSLLLFITVSYFVVTESIKPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLTDISSAR 66
Query: 64 DRFLRGRKLSQIGAGLAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWV 123
++L+ ++G+ D + S L +G P V + +DTGS L W+
Sbjct: 67 FKYLQNSIDKELGSSNFQVDVEQAIKTS----LFLVNFSVGQPPVPQLTIMDTGSSLLWI 122
Query: 124 PCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHR-NQCLGTFSN 181
C C C++ V++P +SST + +C++ C + N G+ +
Sbjct: 123 QCQPCKHCSSDHMIH-----------PVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNK 171
Query: 182 CPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGL 241
C Y Y+S T + G+L ++ L T N + +V + FGCG ++G L+ + G+
Sbjct: 172 CVYEQVYISG-TGSKGVLAKERLTFTT-PNGNTVVTQPIAFGCG-YENGEQLE-SHFTGI 227
Query: 242 FGLGMEKISVPSMLSREGFTADSFSMCFG-----RDGIGRISFGDKGSLDQDETPFNMNP 296
GLG + S+ L FS C G G ++ G+ + D TP
Sbjct: 228 LGLGAKPTSLAVQL------GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFET 281
Query: 297 SHPTYNITVTQVRVGTTLIDLEFT----------ALFDSGTSFTYLVDPPYTNVSESFHS 346
+ Y + + + VG T +++E + DSGT +T+L D Y ++
Sbjct: 282 ENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAY----RELYN 337
Query: 347 QVQDRRRPVDSKIPFE--YCYDMSPDANTSLLPSVSLAMGGGSHFAISDPVI---IISTQ 401
+++ P + F CY P V+ GG+ A+ + +
Sbjct: 338 EIKSILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSEPN 397
Query: 402 SELVYCLSL--VKSSG-----LNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
+ V+C+S+ K G IG Y + +D ++ + ++ DC
Sbjct: 398 TFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDC 448
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 79.3 bits (194), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 111/467 (23%), Positives = 189/467 (40%), Gaps = 96/467 (20%)
Query: 29 TMHHRYSEAVKKWSH----------SAAGAPEKGTFEYYSELADRDRFLRGRKLSQIGAG 78
T+H R++++ K+ + S GAPE G+ ++ + +G
Sbjct: 108 TLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSP---------------GKLIATLESG 152
Query: 79 LAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTA 137
+ G ++ V +GTP F + LDTGSDL W+ C C C +
Sbjct: 153 MTLGSGE-----------YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGM- 200
Query: 138 FASALASDFDLSVYSPNVSSTSKKVTCNNSLCMHRN------QCLGTFSNCPYMVSYVSA 191
Y P S++ K +TCN+ C + QC +CPY Y
Sbjct: 201 ------------FYDPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDR 248
Query: 192 ETSTSGILVED-VLHLTNDDNHHGLVE-ANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKI 249
+T VE ++LT + + N++FGCG G F + L GLG +
Sbjct: 249 SNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASG---LLGLGRGPL 305
Query: 250 SVPSMLSREGFTADSFSMCF-GRDGIGRIS----FG-DKGSLDQDETPF------NMNPS 297
S S L + SFS C R+ +S FG DK L+ F N
Sbjct: 306 SFSSQL--QSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSV 363
Query: 298 HPTYNITVTQVRVGTTLIDL-----------EFTALFDSGTSFTYLVDPPYTNVSESFHS 346
Y I + + VG +D+ + + DSGT+ +Y +P Y + F
Sbjct: 364 ETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAE 423
Query: 347 QVQDRRRPVDSKIP-FEYCYDMSP-DANTSLLPSVSLAMGGGS--HFAISDPVIIISTQS 402
++++ P+ P + C+++S + N LP + +A G+ +F + I + S
Sbjct: 424 KMKE-NYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWL---S 479
Query: 403 ELVYCLSLVKS--SGLNIIGQNFMTGYRVVFDREKLILGWKKSDCFD 447
E + CL+++ + S +IIG + +++D ++ LG+ + C D
Sbjct: 480 EDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCAD 526
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 79.0 bits (193), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/417 (23%), Positives = 163/417 (39%), Gaps = 54/417 (12%)
Query: 56 YYSELADRDRFLRGRKLSQIGAGLAFSDGNST----FRISSLGFLHYTTVQLGTPGVKFM 111
+Y+ + + LR + F++ ++T ++S + V +GTP M
Sbjct: 45 FYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIM 104
Query: 112 VALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCM 170
DTGSDL W C C C T F P SST K V+C++S C
Sbjct: 105 AIADTGSDLLWTQCAPCDDCYTQVDPLF-------------DPKTSSTYKDVSCSSSQCT 151
Query: 171 ---HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEANVIFGCGQV 227
++ C + C Y +SY + T G + D L L + D ++ N+I GCG
Sbjct: 152 ALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRPMQLK-NIIIGCGHN 209
Query: 228 QSGSFLDVAAPNGLFGLGMEKISVP-SMLSREGFTAD-SFSMCF-----GRDGIGRISFG 280
+G+F N + P S++ + G + D FS C +D +I+FG
Sbjct: 210 NAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263
Query: 281 DKGSLDQD---ETPFNMNPSHPT-YNITVTQVRVGTTLIDLEFTA--------LFDSGTS 328
+ TP S T Y +T+ + VG+ I + + DSGT+
Sbjct: 264 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTT 323
Query: 329 FTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSH 388
T L Y+ + ++ S + D + D + CY + D +P +++ G
Sbjct: 324 LTLLPTEFYSELEDAVASSI-DAEKKQDPQSGLSLCYSATGDLK---VPVITMHFDGADV 379
Query: 389 FAISDPVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
S + SE + C + S +I G + V +D + +K +DC
Sbjct: 380 KLDSSNAFV--QVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 78.2 bits (191), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 150/360 (41%), Gaps = 53/360 (14%)
Query: 101 VQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTS 159
+Q+GTP + LDTGS+ W C C C + TA ++ P+ SST
Sbjct: 69 LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCY--NQTA-----------PIFDPSKSSTF 115
Query: 160 KKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEAN 219
K++ C+ H + +CPY + Y ++ T G LV + + + + + V
Sbjct: 116 KEIRCDT----HDH-------SCPYELVY-GGKSYTKGTLVTETVTI-HSTSGQPFVMPE 162
Query: 220 VIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISF 279
I GCG+ SG A G+ GL S+ + + E S CF G +I+F
Sbjct: 163 TIIGCGRNNSGFKPGFA---GVVGLDRGPKSLITQMGGE--YPGLMSYCFAGKGTSKINF 217
Query: 280 GDKGSLDQD----ETPFNMNPSHPTYNITVTQVRVGTTLID--------LEFTALFDSGT 327
G + D T F Y + + V VG T I+ L+ + DSG+
Sbjct: 218 GANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 277
Query: 328 SFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGS 387
+ TY + Y N+ QV R S I CY + P +++ GG+
Sbjct: 278 TLTYFPE-SYCNLVRKAVEQVVTAVRFPRSDI---LCYY---SKTIDIFPVITMHFSGGA 330
Query: 388 HFAISDPVIIISTQSELVYCLSLVKSSGLN--IIGQNFMTGYRVVFDREKLILGWKKSDC 445
+ + +++ + V+CL+++ +S + I G + V +D L++ +K ++C
Sbjct: 331 DLVLDKYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 390
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 78.2 bits (191), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 156/367 (42%), Gaps = 49/367 (13%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFD-LSVYSPN 154
++T V +G P + + LDTGSD+ W+ C C C F + +S ++ LS +P
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 207
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG 214
++ + C N+ C++ ++ + Y V + ET T G
Sbjct: 208 CNAL-EVSECRNATCLYEV----SYGDGSYTVGDFATETLTIG---------------ST 247
Query: 215 LVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF---GR 271
LV+ NV GCG G F+ A L GLG +++PS L+ SFS C
Sbjct: 248 LVQ-NVAVGCGHSNEGLFVGAAG---LLGLGGGLLALPSQLN-----TTSFSYCLVDRDS 298
Query: 272 DGIGRISFGDKGSLDQDETPFNMNPSHPT-YNITVTQVRVGTTLIDLEFTA--------- 321
D + FG S D P N T Y + +T + VG L+ + ++
Sbjct: 299 DSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSG 358
Query: 322 --LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSV 379
+ DSGT+ T L Y ++ +SF D + + F+ CY++S T +P+V
Sbjct: 359 GIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAM-FDTCYNLSAK-TTVEVPTV 416
Query: 380 SLAMGGGSHFAISDPVIIISTQSELVYCLSLV-KSSGLNIIGQNFMTGYRVVFDREKLIL 438
+ GG A+ +I S +CL+ +S L IIG G RV FD ++
Sbjct: 417 AFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLI 476
Query: 439 GWKKSDC 445
G+ + C
Sbjct: 477 GFSSNKC 483
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 77.4 bits (189), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 154/395 (38%), Gaps = 69/395 (17%)
Query: 100 TVQLGTPGVKFMVALDTGSDLFWVPCDCTR-CATTDSTAFASALASDFDLSVYSPNVSST 158
++ GTP DTGS L W+PC C+ D + L F P SS+
Sbjct: 93 SLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRF-----IPKNSSS 147
Query: 159 SKKVTCNNSLCM----HRNQCLG---TFSNC-----PYMVSYVSAETSTSGILVEDVLHL 206
SK + C + C QC G NC PY++ Y ST+G+L+ + L
Sbjct: 148 SKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQY--GLGSTAGVLITEKLDF 205
Query: 207 TNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFS 266
+ L + + GC S + P G+ G G +S+PS ++ + F+ S
Sbjct: 206 PD------LTVPDFVVGC------SIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVS 253
Query: 267 MCFGRDGIGRISFGDKGSLDQDE--------TPFNMNPSHPT------YNITVTQVRVGT 312
F + D GS TPF NP+ Y + + ++ VG
Sbjct: 254 RRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313
Query: 313 TLIDLEFT-----------ALFDSGTSFTYLVDPPYTNVSESFHSQVQD--RRRPVDSKI 359
+ + + ++ DSG++FT++ P + V+E F SQ+ + R + ++ +
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373
Query: 360 PFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLN-- 417
C+++S + + +P + GG+ + + CL++V +N
Sbjct: 374 GLGPCFNISGKGDVT-VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPS 432
Query: 418 -------IIGQNFMTGYRVVFDREKLILGWKKSDC 445
I+G Y V +D E G+ K C
Sbjct: 433 GGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 77.4 bits (189), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 145/378 (38%), Gaps = 64/378 (16%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNV 155
++T + +GTP + LDTGSD+ W+ C C RC + ++ P
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSD-------------PIFDPRK 188
Query: 156 SSTSKKVTCNNSLCMHRNQ--CLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHH 213
S T + C++ C + C C Y VSY + E + N
Sbjct: 189 SKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR---- 244
Query: 214 GLVEANVIFGCGQVQSGSFLDVAAPN----------GLFGLGMEKISVPSMLSREGFTAD 263
V GCG G F+ A G G + ++ R +
Sbjct: 245 ---VKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKP 301
Query: 264 SFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPT-YNITVTQVRVGTTLIDLEFTAL 322
S S+ FG + RI+ TP NP T Y + + + VG T + +L
Sbjct: 302 S-SVVFGNAAVSRIA---------RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASL 351
Query: 323 F------------DSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPD 370
F DSGTS T L+ P Y + ++F + +R D + F+ C+D+S +
Sbjct: 352 FKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSL-FDTCFDLS-N 409
Query: 371 ANTSLLPSVSLAMGGGSHFAISDPVI--IISTQSELVYCLSLVKS-SGLNIIGQNFMTGY 427
N +P+V L G +S P +I + +C + + GL+IIG G+
Sbjct: 410 MNEVKVPTVVLHFRGAD---VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGF 466
Query: 428 RVVFDREKLILGWKKSDC 445
RVV+D +G+ C
Sbjct: 467 RVVYDLASSRVGFAPGGC 484
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 75.9 bits (185), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 152/362 (41%), Gaps = 42/362 (11%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CT-RCATTDSTAFASALASDFDLSVYSPN 154
+ T+ +GTP + DTGSDL W C+ C C + F +P+
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKF-------------NPS 178
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG 214
SST + V+C++ +C C + SNC Y + Y ++ T G L ++ LTN D
Sbjct: 179 SSSTYQNVSCSSPMCEDAESC--SASNCVYSIVY-GDKSFTQGFLAKEKFTLTNSD---- 231
Query: 215 LVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMC---FGR 271
V +V FGCG+ G F D A G G + + + FS C F
Sbjct: 232 -VLEDVYFGCGENNQGLF-DGVAGLLGLGPGKLSLPAQTTTTYNNI----FSYCLPSFTS 285
Query: 272 DGIGRISFGDKGSLDQDE-TPFNMNPSHPTYNITVTQVRVGTTLIDL---EFT---ALFD 324
+ G ++FG G + + TP + PS Y I + + VG + + F+ A+ D
Sbjct: 286 NSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIID 345
Query: 325 SGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMG 384
SGT FT L Y + F ++ + + F+ CYD + +T P+++ +
Sbjct: 346 SGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCYDFT-GLDTVTYPTIAFSFA 403
Query: 385 GGSHFAISDPVIIISTQSELVYCLSLVKSSGL-NIIGQNFMTGYRVVFDREKLILGWKKS 443
G + + I + + V CL+ + L I G T VV+D +G+ +
Sbjct: 404 GSTVVELDGSGISLPIKISQV-CLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPN 462
Query: 444 DC 445
C
Sbjct: 463 GC 464
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 75.9 bits (185), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 148/367 (40%), Gaps = 46/367 (12%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNV 155
+++ + +GTP + + LDTGSD+ W+ C+ C C SD V++P
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQ----------QSD---PVFNPTS 208
Query: 156 SSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGL 215
SST K +TC+ C + C Y VSY + T G L D + N +
Sbjct: 209 SSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDG-SFTVGELATDTVTFGNSGKIN-- 265
Query: 216 VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIG 275
NV GCG G F A V S+ ++ A SFS C G
Sbjct: 266 ---NVALGCGHDNEGLFTGAAGLL------GLGGGVLSITNQ--MKATSFSYCLVDRDSG 314
Query: 276 RISFGDKGSLD---QDET-PFNMNPSHPT--------YNITVTQVRVGTTLIDLEFTA-- 321
+ S D S+ D T P N T +++ +V + + D++ +
Sbjct: 315 KSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG 374
Query: 322 --LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSV 379
+ D GT+ T L Y ++ ++F + ++ S F+ CYD S +T +P+V
Sbjct: 375 GVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFS-SLSTVKVPTV 433
Query: 380 SLAMGGGSHFAISDPVIIISTQSELVYCLSLV-KSSGLNIIGQNFMTGYRVVFDREKLIL 438
+ GG + +I +C + SS L+IIG G R+ +D K ++
Sbjct: 434 AFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVI 493
Query: 439 GWKKSDC 445
G + C
Sbjct: 494 GLSGNKC 500
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 75.5 bits (184), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 159/386 (41%), Gaps = 56/386 (14%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNV 155
++ V +G+P F + LDTGSDL W+ C C C + + Y P
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNG-------------AFYDPKA 216
Query: 156 SSTSKKVTCNNSLCMHRNQ------CLGTFSNCPYMVSYVSAETSTSGILVED-VLHLTN 208
S++ K +TCN+ C + C +CPY Y + +T VE ++LT
Sbjct: 217 SASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 276
Query: 209 DDNHHGLVEA-NVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSM 267
+ L N++FGCG G F A L GLG +S S L + SFS
Sbjct: 277 NGGSSELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSY 331
Query: 268 CF-GRDGIGRIS----FG-DKGSLDQDETPF------NMNPSHPTYNITVTQVRVGTTLI 315
C R+ +S FG DK L F N Y + + + V ++
Sbjct: 332 CLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVL 391
Query: 316 DL-EFT----------ALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIP-FEY 363
++ E T + DSGT+ +Y +P Y + + + + PV P +
Sbjct: 392 NIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG-KYPVYRDFPILDP 450
Query: 364 CYDMSPDANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKS--SGLNIIGQ 421
C+++S N LP + +A G+ + I +LV CL+++ + S +IIG
Sbjct: 451 CFNVSGIHNVQ-LPELGIAFADGAVWNFPTENSFIWLNEDLV-CLAMLGTPKSAFSIIGN 508
Query: 422 NFMTGYRVVFDREKLILGWKKSDCFD 447
+ +++D ++ LG+ + C D
Sbjct: 509 YQQQNFHILYDTKRSRLGYAPTKCAD 534
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 73.2 bits (178), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 143/368 (38%), Gaps = 58/368 (15%)
Query: 96 LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
++ +Q+GTP + +DTGS++ W C C C ++ ++ P+
Sbjct: 64 VYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNA-------------PIFDPS 110
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG 214
SST K+ C+ +CPY V Y T T G L + + L + +
Sbjct: 111 KSSTFKEKRCDG-------------HSCPYEVDYFD-HTYTMGTLATETITL-HSTSGEP 155
Query: 215 LVEANVIFGCGQVQS---GSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGR 271
V I GCG S SF +G+ GL S+ + + E S CF
Sbjct: 156 FVMPETIIGCGHNNSWFKPSF------SGMVGLNWGPSSLITQMGGE--YPGLMSYCFSG 207
Query: 272 DGIGRISFGDKGSLDQD---ETPFNMNPSHPT-YNITVTQVRVGTTLID--------LEF 319
G +I+FG + D T M + P Y + + V VG T I+ LE
Sbjct: 208 QGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEG 267
Query: 320 TALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSV 379
+ DSGT+ TY Y N+ V R D CY+ + P +
Sbjct: 268 NIVIDSGTTLTYF-PVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN---SDTIDIFPVI 323
Query: 380 SLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLN--IIGQNFMTGYRVVFDREKLI 437
++ GG + + + + + V+CL+++ +S I G + V +D L+
Sbjct: 324 TMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLL 383
Query: 438 LGWKKSDC 445
+ + ++C
Sbjct: 384 VSFSPTNC 391
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 73.2 bits (178), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 139/367 (37%), Gaps = 45/367 (12%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNV 155
++ + +G+P + +D+GSD+ WV C C C V+ P
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-------------PVFDPAK 177
Query: 156 SSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGL 215
S + V+C +S+C C Y V Y + T G L + L
Sbjct: 178 SGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDG-SYTKGTLALETLTFAK------T 230
Query: 216 VEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF---GRD 272
V NV GCG G F+ A G+ G M + S G T +F C G D
Sbjct: 231 VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLS-----GQTGGAFGYCLVSRGTD 285
Query: 273 GIGRISFGDKG-SLDQDETPFNMNPSHPTY--------NITVTQVRVGTTLIDLEFTA-- 321
G + FG + + P NP P++ + ++ + + DL T
Sbjct: 286 STGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDG 345
Query: 322 --LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSV 379
+ D+GT+ T L Y + F SQ + R I F+ CYD+S + +P+V
Sbjct: 346 GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FDTCYDLSGFVSVR-VPTV 403
Query: 380 SLAMGGGSHFAISDPVIIISTQSELVYCLSLVKS-SGLNIIGQNFMTGYRVVFDREKLIL 438
S G + ++ YC + S +GL+IIG G +V FD +
Sbjct: 404 SFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFV 463
Query: 439 GWKKSDC 445
G+ + C
Sbjct: 464 GFGPNVC 470
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 72.8 bits (177), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 98/424 (23%), Positives = 170/424 (40%), Gaps = 77/424 (18%)
Query: 48 APEKGTFEYYSELADRDRFLRGRKLSQIGAGLAFSDGNSTFRISSLGFLHYTTVQLGTPG 107
A ++ +Y S L + GR + I +G ++T+ + +L +GTP
Sbjct: 82 AQDQARLQYLSSL------VAGRSVVPIASGRQMLQ-STTYIVKAL---------IGTPA 125
Query: 108 VKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNN 166
++A+DT SD+ W+PC C C + +TAF+ A ++ F K V+C+
Sbjct: 126 QPLLLAMDTSSDVAWIPCSGCVGCPS--NTAFSPAKSTSF-------------KNVSCSA 170
Query: 167 SLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEANVIFGC-G 225
C C + ++Y S +S + L +D + L D ++A FGC
Sbjct: 171 PQCKQVPNPTCGARACSFNLTYGS--SSIAANLSQDTIRLAADP-----IKA-FTFGCVN 222
Query: 226 QVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSL 285
+V G + P LG+ + + M + +FS C ++F L
Sbjct: 223 KVAGGGTI----PPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCL--PSFRSLTFSGSLRL 276
Query: 286 DQDETPFNM-------NPSHPT-YNITVTQVRVGTTLIDLEFTA-----------LFDSG 326
P + NP + Y + + +RVG ++DL A +FDSG
Sbjct: 277 GPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSG 336
Query: 327 TSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGG 386
T +T L P Y V F +V+ V S F+ CY + +P+++ G
Sbjct: 337 TVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY-----SGQVKVPTITFMFKGV 391
Query: 387 SHFAISDPVIIISTQSELVYCLSLVKS-----SGLNIIGQNFMTGYRVVFDREKLILGWK 441
+ +D +++ ST CL++ + S +N+I +RV+ D LG
Sbjct: 392 NMTMPADNLMLHSTAGS-TSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLA 450
Query: 442 KSDC 445
+ C
Sbjct: 451 RERC 454
>AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:14665728-14669135 REVERSE LENGTH=430
Length = 430
Score = 70.9 bits (172), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 150/374 (40%), Gaps = 71/374 (18%)
Query: 93 LGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVY 151
L L+YTTVQ+GTP + V +DTGSDL WV C+ C C + T F
Sbjct: 74 LSALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGCPLHNVTFF------------- 120
Query: 152 SPNVSSTSKKVTCNNSLC---MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHL-T 207
P SS++ K+ C++ C + + +C Y V Y TSG + D++ T
Sbjct: 121 DPGASSSAVKLACSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSV-TSGYYISDLISFDT 179
Query: 208 NDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSM 267
D + N + V+ G+ + G F P++ S T S +
Sbjct: 180 MSDWTYIAFRDNSTWH-PWVRQGAII------GTF---------PALCSTPCSTVSSQPL 223
Query: 268 CFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLEFTALFDSGT 327
+ F ++ ++ ++PS V V G + + DSGT
Sbjct: 224 YYNPQ------FSHMMTVAVNDLRLPIDPS-------VFSVAKG-------YGTIIDSGT 263
Query: 328 SFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFE--YCYDMSPDANTSLL-----PSVS 380
+ + Y + ++ + V RP IP+E C++++ ++ L+ P V
Sbjct: 264 TLVHFPGEAYDPLIQAILNVVSQYGRP----IPYESFQCFNITSGISSHLVIADMFPEVH 319
Query: 381 LAMGGGSHFAISDPVIIISTQSEL---VYCLSLVKSSG--LNIIGQNFMTGYRVVFDREK 435
L GG+ I + +L ++CL S+ + IIG+ + V+D +
Sbjct: 320 LGFAGGASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTSRRITIIGEVAIRDKMFVYDLDH 379
Query: 436 LILGWKKSDCFDDI 449
+GW + +C D+
Sbjct: 380 QRIGWAEYNCSLDV 393
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 70.1 bits (170), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 96/403 (23%), Positives = 158/403 (39%), Gaps = 62/403 (15%)
Query: 61 ADRDRFLRGRKLSQIGAGLAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDL 120
+R +L+ + I A L+ I FL + +G+P + ++ +DT SDL
Sbjct: 56 VERLEYLKAKTTGDIIAHLS-----PNVPIIPQAFL--VNISIGSPPITQLLHMDTASDL 108
Query: 121 FWVPC-DCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNS-LCMHRNQCLGT 178
W+ C C C +A +L ++ P+ S T + TC S M +
Sbjct: 109 LWIQCLPCINC-------YAQSLP------IFDPSRSYTHRNETCRTSQYSMPSLKFNAN 155
Query: 179 FSNCPYMVSYVSAETSTSGILVEDVLHL-TNDDNHHGLVEANVIFGCGQVQSGSFLDVAA 237
+C Y + YV +T + GIL ++L T D +V+FGCG G L
Sbjct: 156 TRSCEYSMRYVD-DTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGEPL---V 211
Query: 238 PNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDG-----IGRISFGDKGS-LDQDETP 291
G+ GLG + S+ ++ FS CFG + GD G+ + D TP
Sbjct: 212 GTGILGLGYGEFSLVHRFGKK------FSYCFGSLDDPSYPHNVLVLGDDGANILGDTTP 265
Query: 292 FNMNPSHPTYNITVTQVRVGTTLIDLEFTA------------LFDSGTSFTYLVDPPY-- 337
++ Y +T+ + V ++ ++ + D+G S T LV+ Y
Sbjct: 266 LEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKP 323
Query: 338 --TNVSESFHSQVQDRRRPVDSKIPFEYCY--DMSPDANTSLLPSVSLAMGGGSHFAISD 393
+ + F + D I E CY + D S P V+ G+ ++ D
Sbjct: 324 LKNRIEDIFEGRFTAADVSQDDMIKME-CYNGNFERDLVESGFPIVTFHFSEGAELSL-D 381
Query: 394 PVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYRVVFDREKL 436
+ S V+CL+ V LN IG Y + +D E +
Sbjct: 382 VKSLFMKLSPNVFCLA-VTPGNLNSIGATAQQSYNIGYDLEAM 423
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 69.7 bits (169), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 144/378 (38%), Gaps = 58/378 (15%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRC-ATTDSTAFASALASDFDLSVYSPN 154
++ + +GTP + LDTGSD+ W+ C C C TD +++ P
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD--------------AIFDPK 180
Query: 155 VSSTSKKVTCNNSLCMH---RNQCLGTFSN-CPYMVSYVSAETSTSGILVEDVLHLTNDD 210
S T V C + LC ++C+ S C Y VSY + E +
Sbjct: 181 KSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETL------- 233
Query: 211 NHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF- 269
HG +V GCG G F+ A GL +S PS FS C
Sbjct: 234 TFHGARVDHVPLGCGHDNEGLFVGAAGLLGLG---RGGLSFPSQTKNR--YNGKFSYCLV 288
Query: 270 -------GRDGIGRISFGDKGSLDQDE-TPFNMNPSHPT-YNITVTQVRVGTTLI----- 315
I FG+ TP NP T Y + + + VG + +
Sbjct: 289 DRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSE 348
Query: 316 ---DLEFTA----LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMS 368
L+ T + DSGTS T L P Y + ++F +R + F+ C+D+S
Sbjct: 349 SQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSL-FDTCFDLS 407
Query: 369 PDANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSSG-LNIIGQNFMTGY 427
T +P+V GGG ++ +I +E +C + + G L+IIG G+
Sbjct: 408 -GMTTVKVPTVVFHFGGG-EVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGF 465
Query: 428 RVVFDREKLILGWKKSDC 445
RV +D +G+ C
Sbjct: 466 RVAYDLVGSRVGFLSRAC 483
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 68.9 bits (167), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 154/388 (39%), Gaps = 77/388 (19%)
Query: 100 TVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLSVYSPNVSSTS 159
T+ +G P + LDTGS+L W+ C S SV++P SST
Sbjct: 68 TLAVGDPPQNISMVLDTGSELSWLHCK----------------KSPNLGSVFNPVSSSTY 111
Query: 160 KKVTCNNSLCMHRNQCLGTFSNCP-------YMVSYVSAETSTSGILVEDVLHLTNDDNH 212
V C++ +C R + L ++C +SY A TS G L + + +
Sbjct: 112 SPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADA-TSIEGNLAHETFVIGS---- 166
Query: 213 HGLVEANVIFGC--GQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF- 269
+ +FGC + S S D A GL G+ +S + L GF+ FS C
Sbjct: 167 --VTRPGTLFGCMDSGLSSNSEED-AKSTGLMGMNRGSLSFVNQL---GFS--KFSYCIS 218
Query: 270 GRDGIGRISFGDK-----GSLDQDETPFNMNP----SHPTYNITVTQVRVGTTLIDLEFT 320
G D G + GD G + P Y + + +RVG+ ++ L +
Sbjct: 219 GSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKS 278
Query: 321 A-----------LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDS-----KIPFEYC 364
+ DSGT FT+L+ P YT + F +Q + R VD + + C
Sbjct: 279 VFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLC 338
Query: 365 YDM--SPDANTSLLPSVSLAMGGGSHFAISDPVIII------STQSELVYCLSLVKSSGL 416
Y + + N S LP VSL M G+ ++S ++ S E VYC + S L
Sbjct: 339 YKVGSTTRPNFSGLPMVSL-MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLL 397
Query: 417 NI----IGQNFMTGYRVVFDREKLILGW 440
I IG + + FD K +G+
Sbjct: 398 GIEAFVIGHHHQQNVWMEFDLAKSRVGF 425
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 68.2 bits (165), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 140/366 (38%), Gaps = 54/366 (14%)
Query: 96 LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
++ +Q+GTP + +DTGSDL W C CT C S A FD P+
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNC--------YSQYAPIFD-----PS 106
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDV-LHLTNDDNHH 213
SST K+ CN ++C Y + Y S + E V +H T+ +
Sbjct: 107 NSSTFKEKRCNG-------------NSCHYKIIYADTTYSKGTLATETVTIHSTSGE--- 150
Query: 214 GLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDG 273
V GCG S F + G+ GL S+ + + E S CF G
Sbjct: 151 PFVMPETTIGCGH-NSSWFKPTFS--GMVGLSWGPSSLITQMGGE--YPGLMSYCFASQG 205
Query: 274 IGRISFGDKGSLDQD---ETPFNMNPSHP-TYNITVTQVRVGTTLID--------LEFTA 321
+I+FG + D T + + P Y + + V VG T ++ LE
Sbjct: 206 TSKINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNI 265
Query: 322 LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSL 381
+ DSGT+ TY Y N+ R D CY + P +++
Sbjct: 266 IIDSGTTLTYF-PVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTD---TIDIFPVITM 321
Query: 382 AMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLN--IIGQNFMTGYRVVFDREKLILG 439
GG+ + + I T + +CL+++ ++ I G + V +D L++
Sbjct: 322 HFSGGADLVLDKYNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVS 381
Query: 440 WKKSDC 445
+ ++C
Sbjct: 382 FSPTNC 387
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 67.4 bits (163), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 148/364 (40%), Gaps = 44/364 (12%)
Query: 101 VQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTS 159
+ +GTP V + DTGSDL W C+ C C S F P SST
Sbjct: 90 ISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLF-------------DPKESSTY 136
Query: 160 KKVTCNNSLC--MHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVE 217
+KV+C++S C + C + C Y ++Y + + V+ V ++ L
Sbjct: 137 RKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSL-- 194
Query: 218 ANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF----GRDG 273
N+I GCG +G+F A +G+ GLG S+ S L + FS C G
Sbjct: 195 RNMIIGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETG 250
Query: 274 I-GRISFGDKGSLDQDE-TPFNMNPSHPT--YNITVTQVRVGTTLIDL--------EFTA 321
+ +I+FG G + D +M P Y + + + VG+ I E
Sbjct: 251 LTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNI 310
Query: 322 LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSL 381
+ DSGT+ T L Y + S ++ R I CY D+++ +P +++
Sbjct: 311 VIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGI-LSLCYR---DSSSFKVPDITV 366
Query: 382 AMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYRVVFDREKLILGWK 441
GG + + ++ SE V C + + L I G + V +D + +K
Sbjct: 367 HFKGG-DVKLGNLNTFVAV-SEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFK 424
Query: 442 KSDC 445
K+DC
Sbjct: 425 KTDC 428
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/386 (24%), Positives = 157/386 (40%), Gaps = 71/386 (18%)
Query: 106 PGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCN 165
P + +DTGS+L W+ C+ R + + ++FD P SS+ + C+
Sbjct: 82 PPQNISMVIDTGSELSWLRCN--RSSNPNP-------VNNFD-----PTRSSSYSPIPCS 127
Query: 166 NSLCMHRNQCLGTFSNCP------YMVSYVSAETSTSGILVEDVLHLTNDDNHHGLVEAN 219
+ C R + ++C +SY A +S+ G L ++ H N N ++N
Sbjct: 128 SPTCRTRTRDFLIPASCDSDKLCHATLSYADA-SSSEGNLAAEIFHFGNSTN-----DSN 181
Query: 220 VIFGC-GQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRIS 278
+IFGC G V + GL LGM + S+ S +S+ GF S+ + D G +
Sbjct: 182 LIFGCMGSVSGSDPEEDTKTTGL--LGMNRGSL-SFISQMGFPKFSYCISGTDDFPGFLL 238
Query: 279 FGDKGSLDQDETPFNMNP----SHP-------TYNITVTQVRVGTTLIDLEFTAL----- 322
GD TP N P S P Y + +T ++V L+ + + L
Sbjct: 239 LGDSNF--TWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHT 296
Query: 323 ------FDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDS-----KIPFEYCYDMSPDA 371
DSGT FT+L+ P YT + F ++ + + + CY +SP
Sbjct: 297 GAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVR 356
Query: 372 NTS----LLPSVSLAMGGGSHFAISDPVII----ISTQSELVYCLSLVKSSGLN----II 419
S LP+VSL G P++ ++ ++ VYC + S + +I
Sbjct: 357 IRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVI 416
Query: 420 GQNFMTGYRVVFDREKLILGWKKSDC 445
G + + FD ++ +G +C
Sbjct: 417 GHHHQQNMWIEFDLQRSRIGLAPVEC 442
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 62/373 (16%)
Query: 102 QLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSVYSPNVSSTSK 160
+LGTP + LDT +D W+PC C+ C+ AS + S YS
Sbjct: 109 KLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSN------ASTSFNTNSSSTYS-------- 154
Query: 161 KVTCNNSLCMH-RNQCLGTFSNCPYMVSY---VSAETSTSGILVEDVLHLTNDDNHHGLV 216
V+C+ + C R + S P + S+ ++S S LV+D L L D V
Sbjct: 155 TVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD------V 208
Query: 217 EANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDG--- 273
N FGC SG+ L P GL GLG +S+ S + + FS C
Sbjct: 209 IPNFSFGCINSASGNSL---PPQGLMGLGRGPMSLVSQTT--SLYSGVFSYCLPSFRSFY 263
Query: 274 -IGRISFGDKGSLDQDE-TPFNMNPSHPT-YNITVTQVRVGTTLID-----LEFTA---- 321
G + G G TP NP P+ Y + +T V VG+ + L F A
Sbjct: 264 FSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGA 323
Query: 322 --LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSV 379
+ DSGT T P Y + + F QV + F+ C+ N ++ P +
Sbjct: 324 GTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSS--FSTLGAFDTCFSAD---NENVAPKI 378
Query: 380 SLAMGGGSHFAISDPV--IIISTQSELVYCLSLV-----KSSGLNIIGQNFMTGYRVVFD 432
+L M + + P+ +I + + + CLS+ ++ LN+I R++FD
Sbjct: 379 TLHM---TSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFD 435
Query: 433 REKLILGWKKSDC 445
+G C
Sbjct: 436 VPNSRIGIAPEPC 448
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 63.2 bits (152), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 148/396 (37%), Gaps = 76/396 (19%)
Query: 92 SLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALASDFDLSV 150
S G ++ ++ +GTP K DTGSDL WV C C +C +S F
Sbjct: 80 SNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLF------------ 127
Query: 151 YSPNVSSTSKKVTCNNSLCM----HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHL 206
SST K +C++ C H C + C Y SY + + E +
Sbjct: 128 -DKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISID 186
Query: 207 TNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVP-SMLSREGFT-ADS 264
++ + +FGCG G+F + + + P S++S+ G +
Sbjct: 187 SSSGSSVSF--PGTVFGCGYNNGGTFEETGS------GIIGLGGGPLSLVSQLGSSIGKK 238
Query: 265 FSMCFGR-----DGIGRISFG----------DKGSLDQDETPFNMNPSHPTYNITVTQVR 309
FS C +G I+ G D +L TP Y +T+ V
Sbjct: 239 FSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATL---TTPLIQKDPETYYFLTLEAVT 295
Query: 310 VGTTLIDLEFTA----------------LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRR 353
VG T L +T + DSGT+ T L Y + + V +R
Sbjct: 296 VGKT--KLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKR 353
Query: 354 PVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISD----PVIIISTQSELVYCLS 409
D + +C+ S D LP++++ HF +D P+ +E CLS
Sbjct: 354 VSDPQGLLTHCFK-SGDKEIG-LPAITM------HFTNADVKLSPINAFVKLNEDTVCLS 405
Query: 410 LVKSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
++ ++ + I G + V +D E + +++ DC
Sbjct: 406 MIPTTEVAIYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 62.8 bits (151), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 143/370 (38%), Gaps = 58/370 (15%)
Query: 96 LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
++ +Q+GTP + + +DTGSD+ W C C C S A FD P+
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNC--------YSQFAPIFD-----PS 466
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG 214
SST ++ CN ++C Y + Y +T + GIL + + + +
Sbjct: 467 KSSTFREQRCNG-------------NSCHYEIIYAD-KTYSKGILATETVTIPSTSGEP- 511
Query: 215 LVEANVIFGCG----QVQSGSFLDVAAPNGLFGLGMEKISVPSM--LSREGFTADSFSMC 268
V A GCG +Q F ++ +G+ GL M +S+ S L G S C
Sbjct: 512 FVMAETKIGCGLDNTNLQYSGF--ASSSSGIVGLNMGPLSLISQMDLPYPGL----ISYC 565
Query: 269 FGRDGIGRISFGDKGSLDQDET---PFNMNPSHPTYNITVTQVRVGTTLIDLEFTA---- 321
F G +I+FG + D T + +P Y + + V V LI T
Sbjct: 566 FSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAE 625
Query: 322 ----LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLP 377
DSGT+ TY Y N+ QV + D CY + P
Sbjct: 626 DGNIFIDSGTTLTYF-PMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYY---SDTIDIFP 681
Query: 378 SVSLAMGGGSHFAISDPVIIISTQSELVYCLSL--VKSSGLNIIGQNFMTGYRVVFDREK 435
+++ GG+ + + + T + ++CL++ S + G + V +D
Sbjct: 682 VITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSS 741
Query: 436 LILGWKKSDC 445
++ + ++C
Sbjct: 742 NVISFSPTNC 751
Score = 59.3 bits (142), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/363 (23%), Positives = 141/363 (38%), Gaps = 58/363 (15%)
Query: 96 LHYTTVQLGTPGVKFMVALDTGSDLFWVPC-DCTRCATTDSTAFASALASDFDLSVYSPN 154
++ +Q+GTP + +DTGSDL W C C C S FD ++ P+
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDC------------YSQFD-PIFDPS 127
Query: 155 VSSTSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTNDDNHHG 214
SST + +C G +C Y + Y T + GIL + + + + +
Sbjct: 128 KSST-----------FNEQRCHG--KSCHYEIIY-EDNTYSKGILATETVTI-HSTSGEP 172
Query: 215 LVEANVIFGCG----QVQSGSFLDVAAPNGLFGLGMEKISVPSM--LSREGFTADSFSMC 268
V A GCG + + F ++ +G+ GL M S+ S L G S C
Sbjct: 173 FVMAETTIGCGLHNTDLDNSGF--ASSSSGIVGLNMGPRSLISQMDLPYPGL----ISYC 226
Query: 269 FGRDGIGRISFGDKGSLDQDET---PFNMNPSHPTYNITVTQVRV--------GTTLIDL 317
F G +I+FG + D T + +P Y + + V V GT
Sbjct: 227 FSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAE 286
Query: 318 EFTALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLP 377
+ + DSG++ TY Y N+ QV R D CY + P
Sbjct: 287 DGNIVIDSGSTVTYF-PVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYF---SETIDIFP 342
Query: 378 SVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKSSGLN--IIGQNFMTGYRVVFDREK 435
+++ GG+ + + + + S ++CL+++ +S I G + V +D
Sbjct: 343 VITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSS 402
Query: 436 LIL 438
L+L
Sbjct: 403 LLL 405
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 62.0 bits (149), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 87/393 (22%), Positives = 153/393 (38%), Gaps = 88/393 (22%)
Query: 100 TVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDL-----SVYSPN 154
++ +GTP + LDTGS L W+ C + T+F +L+S F + P
Sbjct: 75 SLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPR 134
Query: 155 VSSTSKKVTCN-NSLCMHRNQCLGTFSNCPYMVSYVSAE-TSTSGILVEDVLHLTNDDNH 212
+ + +C+ N LC + SY A+ T G LV++ + +N +
Sbjct: 135 IPDFTLPTSCDSNRLCHY---------------SYFYADGTFAEGNLVKEKITFSNTE-- 177
Query: 213 HGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKIS-------------VPSMLSREG 259
+ +I GC S G+ G+ ++S +P +R G
Sbjct: 178 ---ITPPLILGCATESSDD-------RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPG 227
Query: 260 FT-ADSFSMCFGRDGIGRISFGDKGSLDQDETPFNMNPSHPTYNITVTQVRVGTTLIDLE 318
FT SF + + G + + + + N++P Y + + +R G +++
Sbjct: 228 FTPTGSFYLGDNPNSHG-FKYVSLLTFPESQRMPNLDPL--AYTVPMIGIRFGLKKLNIS 284
Query: 319 FTA-----------LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIPFEYCY-- 365
+ + DSG+ FT+LVD Y V ++V R + Y Y
Sbjct: 285 GSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKK-------GYVYGG 337
Query: 366 --DMSPDANTSLLPSVSLAMGGGSHFAISDPVIIISTQSEL-------VYCLSLVKSSGL 416
DM D N +++P + G F + V I+ + + ++C+ + +SS L
Sbjct: 338 TADMCFDGNVAMIPRLI----GDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSML 393
Query: 417 ----NIIGQNFMTGYRVVFDREKLILGWKKSDC 445
NIIG V FD +G+ K+DC
Sbjct: 394 GAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 60.8 bits (146), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 159/392 (40%), Gaps = 67/392 (17%)
Query: 86 STFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD-CTRCATTDSTAFASALAS 144
S ++ SL ++ TV+LG + +V DTGSDL WV C C C
Sbjct: 126 SGIKLESLNYI--VTVELGGKNMSLIV--DTGSDLTWVQCQPCRSCYNQQG--------- 172
Query: 145 DFDLSVYSPNVSSTSKKVTCNNSLCM-------HRNQCLG----TFSNCPYMVSYVSAET 193
+Y P+VSS+ K V CN+S C + C G + C Y+VSY +
Sbjct: 173 ----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDG-S 227
Query: 194 STSGILVEDVLHLTNDDNHHGLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPS 253
T G L + + L + N +FGCG+ G F + S S
Sbjct: 228 YTRGDLASESILLGDTKLE------NFVFGCGRNNKGLFGGSSGLM------GLGRSSVS 275
Query: 254 MLSREGFTADS-FSMCFG--RDGI-GRISFGDKGSLDQDE-----TPFNMNPSHPT-YNI 303
++S+ T + FS C DG G +SFG+ S+ + TP NP + Y +
Sbjct: 276 LVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 335
Query: 304 TVTQVRVGTTLIDLEFTA-----LFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSK 358
+T +G ++L+ ++ L DSGT T L Y V F Q
Sbjct: 336 NLTGASIGG--VELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYS 393
Query: 359 IPFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISDPVI--IISTQSELVYCLSLVKSSGL 416
I + C++++ + S +P + + G + + + + + LV CL+L S
Sbjct: 394 I-LDTCFNLTSYEDIS-IPIIKMIFQGNAELEVDVTGVFYFVKPDASLV-CLALASLSYE 450
Query: 417 N---IIGQNFMTGYRVVFDREKLILGWKKSDC 445
N IIG RV++D + LG +C
Sbjct: 451 NEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 58.2 bits (139), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 106/476 (22%), Positives = 173/476 (36%), Gaps = 82/476 (17%)
Query: 11 FLFFFSVWGCCHGQIYTFTMH--HRYSEAVKKWSHSAAGAPEKGTFEYYSELADRDRFLR 68
F FFSV G F++ HR S ++ ++ + ++ RF
Sbjct: 9 FFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQIT-VTDRLNAAFLRSVSRSRRFNH 67
Query: 69 GRKLSQIGAGLAFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCD-C 127
+ + +GL +DG + ++ +GTP +K DTGSDL WV C C
Sbjct: 68 QLSQTDLQSGLIGADGE-----------FFMSITIGTPPIKVFAIADTGSDLTWVQCKPC 116
Query: 128 TRCATTDSTAFASALASDFDLSVYSPNVSSTSKKVTCNNSLCMH----RNQCLGTFSNCP 183
+C + ++ SST K C++ C C + + C
Sbjct: 117 QQCYKENG-------------PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICK 163
Query: 184 YMVSYVSAETSTSGILVEDVLHLTNDDNHHG--LVEANVIFGCGQVQSGSFLDVAAPNGL 241
Y SY S + E V + D+ G + +FGCG G+F + +
Sbjct: 164 YRYSYGDQSFSKGDVATETV----SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIG 219
Query: 242 FGLGMEKISVPSMLSREGFTADSFSMCFGR-----DGIGRISFGDK---GSLDQD----E 289
LG +S+ S L + FS C +G I+ G SL +D
Sbjct: 220 --LGGGHLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVS 275
Query: 290 TPFNMNPSHPTYNITVTQVRVGTTLI----------------DLEFTALFDSGTSFTYLV 333
TP Y +T+ + VG I + + DSGT+ T L
Sbjct: 276 TPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLE 335
Query: 334 DPPYTNVSESFHSQVQDRRRPVDSKIPFEYCYDMSPDANTSLLPSVSLAMGGGSHFAISD 393
+ S + V +R D + +C+ S A LP +++ HF +D
Sbjct: 336 AGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFK-SGSAEIG-LPEITV------HFTGAD 387
Query: 394 ----PVIIISTQSELVYCLSLVKSSGLNIIGQNFMTGYRVVFDREKLILGWKKSDC 445
P+ SE + CLS+V ++ + I G + V +D E + ++ DC
Sbjct: 388 VRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 147/379 (38%), Gaps = 78/379 (20%)
Query: 97 HYTTVQLGTPGVKFMVALDTGSDLFWVPCDCTRCATTDSTAFASALASDFDLSVYSPNVS 156
++ V +G+P F + LDTGSDL W+ C
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQC------------------------------- 198
Query: 157 STSKKVTCNNSLCMHRNQCLGTFSNCPYMVSYVSAETSTSGILVED-VLHLTNDDNHHGL 215
+ C + + NQ +CPY Y + +T VE ++LT + L
Sbjct: 199 -----LPCYDCFQQNDNQ------SCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSEL 247
Query: 216 VEA-NVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDG 273
N++FGCG G F A L GLG +S S L + SFS C R+
Sbjct: 248 YNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNS 302
Query: 274 IGRIS----FG-DKGSLDQDETPF------NMNPSHPTYNITVTQVRVGTTLIDL-EFT- 320
+S FG DK L F N Y + + + V ++++ E T
Sbjct: 303 DTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETW 362
Query: 321 ---------ALFDSGTSFTYLVDPPYTNVSESFHSQVQDRRRPVDSKIP-FEYCYDMSPD 370
+ DSGT+ +Y +P Y + + + + PV P + C+++S
Sbjct: 363 NISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKY-PVYRDFPILDPCFNVSGI 421
Query: 371 ANTSLLPSVSLAMGGGSHFAISDPVIIISTQSELVYCLSLVKS--SGLNIIGQNFMTGYR 428
N L P + +A G+ + I +LV CL+++ + S +IIG +
Sbjct: 422 HNVQL-PELGIAFADGAVWNFPTENSFIWLNEDLV-CLAMLGTPKSAFSIIGNYQQQNFH 479
Query: 429 VVFDREKLILGWKKSDCFD 447
+++D ++ LG+ + C D
Sbjct: 480 ILYDTKRSRLGYAPTKCAD 498