Miyakogusa Predicted Gene
- Lj3g3v2061660.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2061660.1 Non Chatacterized Hit- tr|I3SAC8|I3SAC8_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,99.43,0,ASP_PROTEASE,Peptidase aspartic, active site;
Asp,Peptidase A1; Acid proteases,Peptidase aspartic; n,CUFF.43541.1
(492 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 511 e-145
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 308 4e-84
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 242 4e-64
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 222 5e-58
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 220 2e-57
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 215 6e-56
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 211 1e-54
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 204 1e-52
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 202 4e-52
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 197 2e-50
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 183 2e-46
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 174 2e-43
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 163 2e-40
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 161 1e-39
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 160 2e-39
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 160 2e-39
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 153 3e-37
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 152 7e-37
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 151 9e-37
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 148 8e-36
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 145 8e-35
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 138 7e-33
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 135 7e-32
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 134 1e-31
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 130 2e-30
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 129 4e-30
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 129 6e-30
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 128 1e-29
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 127 3e-29
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 126 4e-29
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 123 2e-28
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 118 9e-27
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 116 3e-26
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 8e-26
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 114 1e-25
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 111 1e-24
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 3e-24
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 6e-24
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 3e-23
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 104 1e-22
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 104 1e-22
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 3e-22
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 103 4e-22
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 102 5e-22
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 7e-20
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 95 1e-19
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 2e-19
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 93 3e-19
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 9e-19
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 4e-18
AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 3e-17
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 6e-17
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 7e-17
AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family pr... 85 9e-17
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 83 4e-16
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 83 5e-16
AT5G24820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 81 1e-15
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 4e-15
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 6e-14
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 74 3e-13
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 2e-12
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 3e-12
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 70 4e-12
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 5e-12
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 7e-12
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 3e-11
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 8e-10
AT3G12700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 57 2e-08
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 511 bits (1317), Expect = e-145, Method: Compositional matrix adjust.
Identities = 262/458 (57%), Positives = 336/458 (73%), Gaps = 9/458 (1%)
Query: 40 VSSFEEKKVFNLQ---ILQRKQQLGSLGCLHPESRQEKGAIILEMKDRSYCSKKKVNWHR 96
V +EKK+ ++ +K S C + + + LEMK R CS K ++ +
Sbjct: 26 VHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGKGRESTTLEMKHRELCSGKTIDLGK 85
Query: 97 KLHNQLTLDDLHVRSMQNRLRKMVSSHSVE-VSQIQIPLASGVNFQTLNYIVTMELGGQN 155
K+ L LD++ V+S+Q +++ M SS + + VS+ QIPL SG+ ++LNYIVT+ELGG+N
Sbjct: 86 KMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKN 145
Query: 156 MTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGA 215
M++I+DTGSDLTWVQC+PC SCYNQQGP++ PS SSSY+++ CNSSTCQ L T N+G
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205
Query: 216 CESN----PSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSG 271
C N + C Y V+YGDGSYT G+L +E + G + NFVFGCG+NNKGLFGG SG
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSSG 265
Query: 272 LMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVP 331
LMGLGRS++SL+SQT TF GVFSYCLP + GASGSL+ GN+SSV+ N T ++YT +V
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQ 325
Query: 332 NPQLSNFYMLNLTGIDVGGVAGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFS 391
NPQL +FY+LNLTG +GGV ++ SFG G LIDSGTVITRL PS+YKA+K EFLKQFS
Sbjct: 326 NPQLRSFYILNLTGASIGGVELKSSSFGR-GILIDSGTVITRLPPSIYKAVKIEFLKQFS 384
Query: 392 GFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLAL 451
GFP+APG+SILDTCFNLT E+++IP I M F+ N L VD TG+FY VK DAS VCLAL
Sbjct: 385 GFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLAL 444
Query: 452 ASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
ASLS E ++ IIGNYQQ+NQRVIYDT Q ++G GE+C
Sbjct: 445 ASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 308 bits (790), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 175/397 (44%), Positives = 240/397 (60%), Gaps = 18/397 (4%)
Query: 102 LTLDDLHVRSMQNRL-RKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELG--GQNMTV 158
L LD V S+ ++L +K+ + H E +P G + NYIVT+ LG ++++
Sbjct: 88 LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 147
Query: 159 IIDTGSDLTWVQCEPCM-SCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACE 217
I DTGSDLTW QC+PC+ +CY+Q+ P+F PS S+SY ++ C+S+ C SL TGNAG+C
Sbjct: 148 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 207
Query: 218 SNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFV-FGCGKNNKGLFGGVSGLMGLG 276
+ SNC Y + YGD S++ G L E + V + V FGCG+NN+GLF GV+GL+GLG
Sbjct: 208 A--SNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLG 265
Query: 277 RSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNES-SVFKNLTPIAYTRMVPNPQL 335
R LS SQT + + +FSYCLP + A +G L G+ S TPI+
Sbjct: 266 RDKLSFPSQTATAYNKIFSYCLP-SSASYTGHLTFGSAGISRSVKFTPISTIT-----DG 319
Query: 336 SNFYMLNLTGIDVGGVAGQAPS--FGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGF 393
++FY LN+ I VGG PS F G LIDSGTVITRL P Y AL++ F + S +
Sbjct: 320 TSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKY 379
Query: 394 PSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALAS 453
P+ G SILDTCF+L+G + V IP ++ +F V+ + + GIFY+ K SQVCLA A
Sbjct: 380 PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAG 437
Query: 454 LSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
SD+ + AI GN QQ+ V+YD +VGFA CS
Sbjct: 438 NSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 242 bits (618), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 159/402 (39%), Positives = 232/402 (57%), Gaps = 27/402 (6%)
Query: 96 RKLHNQLTL-DDLHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELG-- 152
R H+++ D V S+ ++L K ++ E ++P SG+ + NYIVT+ +G
Sbjct: 82 RVDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTP 141
Query: 153 GQNMTVIIDTGSDLTWVQCEPCM-SCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTG 211
+++++ DTGSDLTW QCEPC+ SCY+Q+ P F PS+SS+YQ++ C+S C+
Sbjct: 142 KHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCE------- 194
Query: 212 NAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISV-SNFVFGCGKNNKGLFGGVS 270
+A +C + SNC Y++ YGD S+T G L E + V + FGCG+NN+GLF GV+
Sbjct: 195 DAESCSA--SNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVA 252
Query: 271 GLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNES-SVFKNLTPIAYTRM 329
GL+GLG LSL +QT +T+ +FSYCLP + ++G L G+ S TPI+
Sbjct: 253 GLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPIS---- 308
Query: 330 VPNPQLSNFYMLNLTGIDVGG--VAGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEFL 387
P N Y +++ GI VG +A SF G +IDSGTV TRL VY L++ F
Sbjct: 309 -SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFK 366
Query: 388 KQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQV 447
++ S + S G+ + DTC++ TG + V PTI+ +F + V+ +D +GI +K SQV
Sbjct: 367 EKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIK--ISQV 424
Query: 448 CLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
CLA A D AI GN QQ V+YD +VGFA C
Sbjct: 425 CLAFAGNDDL--PAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 222 bits (565), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 145/403 (35%), Positives = 224/403 (55%), Gaps = 28/403 (6%)
Query: 97 KLHNQLTLDDLHVRSM-QNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELG--G 153
++ + +T DL + ++ + L+ + + ++ E I+ PL SG + Y + +G
Sbjct: 99 RVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPA 158
Query: 154 QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNA 213
+ + +++DTGSD+ W+QC PC CY+Q P+F+PS+SSSY+ + C++ C +L+++
Sbjct: 159 REVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVS---- 214
Query: 214 GACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLM 273
E + C Y V+YGDGSYT G+ E L+ G V N GCG +N+GLF G +GL+
Sbjct: 215 ---ECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLL 271
Query: 274 GLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNP 333
GLG L+L SQ N+T FSYCL D+ ++ ++ G S + P ++ N
Sbjct: 272 GLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGTSLSPDAVVAP-----LLRNH 323
Query: 334 QLSNFYMLNLTGIDVGGVAGQAP--SF-----GNGGGLIDSGTVITRLAPSVYKALKAEF 386
QL FY L LTGI VGG Q P SF G+GG +IDSGT +TRL +Y +L+ F
Sbjct: 324 QLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSF 383
Query: 387 LKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQ 446
+K A G ++ DTC+NL+ V +PT++ +F +L + A + I +
Sbjct: 384 VKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKN-YMIPVDSVGT 442
Query: 447 VCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
CLA A + +AIIGN QQ+ RV +D S +GF+ C
Sbjct: 443 FCLAFAPTASS--LAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 220 bits (560), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 152/435 (34%), Positives = 225/435 (51%), Gaps = 31/435 (7%)
Query: 69 ESRQEKGAIILEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMV---SSHSV 125
+ K + L +DR + S N H +LH ++ D V ++ R+ V S
Sbjct: 53 DESSSKYTLRLLHRDR-FPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRY 111
Query: 126 EVSQIQIPLASGVNFQTLNYIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQGP 183
EV+ + SG++ + Y V + +G ++ ++ID+GSD+ WVQC+PC CY Q P
Sbjct: 112 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDP 171
Query: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH 243
VF P+ S SY + C SS C ++ N+G C S C Y V YGDGSYT G L E
Sbjct: 172 VFDPAKSGSYTGVSCGSSVCDRIE----NSG-CHSG--GCRYEVMYGDGSYTKGTLALET 224
Query: 244 LSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDA 303
L+F V N GCG N+G+F G +GL+G+G ++S + Q + GG F YCL
Sbjct: 225 LTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGT 284
Query: 304 GASGSLAMGNESSVFKNLTPI--AYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPS---- 357
++GSL G E+ P+ ++ +V NP+ +FY + L G+ VGGV P
Sbjct: 285 DSTGSLVFGREA------LPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFD 338
Query: 358 ---FGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEV 414
G+GG ++D+GT +TRL + Y A + F Q + P A G SI DTC++L+G V
Sbjct: 339 LTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSV 398
Query: 415 NIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVI 474
+PT+S F + VL + A F + +D+ C A A + ++IIGN QQ +V
Sbjct: 399 RVPTVSFYFTEGPVLTLPARN-FLMPVDDSGTYCFAFA--ASPTGLSIIGNIQQEGIQVS 455
Query: 475 YDTKQSKVGFAGESC 489
+D VGF C
Sbjct: 456 FDGANGFVGFGPNVC 470
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 215 bits (547), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 148/422 (35%), Positives = 217/422 (51%), Gaps = 47/422 (11%)
Query: 98 LHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIP----------------LASGVNFQ 141
L + T D+L +Q R++ S+ QIP + SG++
Sbjct: 82 LSSNKTPDELFSSRLQRDSRRV---KSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQG 138
Query: 142 TLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCN 199
+ Y + +G + + +++DTGSD+ W+QC PC CY+Q P+F P S +Y +IPC+
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198
Query: 200 SSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCG 259
S C+ L ++ C + C Y V+YGDGS+T G+ E L+F V GCG
Sbjct: 199 SPHCRRL-----DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG 253
Query: 260 KNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGAS-GSLAMGNESSVF 318
+N+GLF G +GL+GLG+ LS QT F FSYCL A + S+ GN +
Sbjct: 254 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAA--- 310
Query: 319 KNLTPIA-YTRMVPNPQLSNFYMLNLTGIDVGG--VAGQAPSF------GNGGGLIDSGT 369
++ IA +T ++ NP+L FY + L GI VGG V G S GNGG +IDSGT
Sbjct: 311 --VSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368
Query: 370 VITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVL 429
+TRL Y A++ F AP FS+ DTCF+L+ EV +PT+ ++F
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG---A 425
Query: 430 NVDATGIFYIVKEDAS-QVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGES 488
+V Y++ D + + C A A ++IIGN QQ+ RV+YD S+VGFA
Sbjct: 426 DVSLPATNYLIPVDTNGKFCFAFAGTMG--GLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483
Query: 489 CS 490
C+
Sbjct: 484 CA 485
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 211 bits (536), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 142/368 (38%), Positives = 204/368 (55%), Gaps = 24/368 (6%)
Query: 136 SGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSY 193
SG++ + Y + + +G N+ +++DTGSD+ W+QC PC +CYNQ +F P S ++
Sbjct: 126 SGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTF 185
Query: 194 QSIPCNSSTCQSLQLTTGNAGACESNPSN-CSYAVNYGDGSYTNGELGAEHLSFGGISVS 252
++PC S C+ L ++ C + S C Y V+YGDGS+T G+ E L+F G V
Sbjct: 186 ATVPCGSRLCRRLD----DSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD 241
Query: 253 NFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMG 312
+ GCG +N+GLF G +GL+GLGR LS SQT + + G FSYCL D +SGS +
Sbjct: 242 HVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCL--VDRTSSGSSSKP 299
Query: 313 NESSVFKNLT---PIAYTRMVPNPQLSNFYMLNLTGIDVGG--VAGQAPS------FGNG 361
+ VF N +T ++ NP+L FY L L GI VGG V G + S GNG
Sbjct: 300 PSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNG 359
Query: 362 GGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISM 421
G +IDSGT +TRL Y AL+ F + AP +S+ DTCF+L+G V +PT+
Sbjct: 360 GVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVF 419
Query: 422 NFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSK 481
+F V ++ A+ V + + C A A ++IIGN QQ+ RV YD S+
Sbjct: 420 HFGGGEV-SLPASNYLIPVNTEG-RFCFAFAGTMGS--LSIIGNIQQQGFRVAYDLVGSR 475
Query: 482 VGFAGESC 489
VGF +C
Sbjct: 476 VGFLSRAC 483
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 204 bits (518), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 147/485 (30%), Positives = 239/485 (49%), Gaps = 52/485 (10%)
Query: 36 IANGVSSFEEKKVFNLQILQRKQQLGSLGCLHPESRQE------KGAIILEMKDR-SYCS 88
+ + VSS ++ + IL SL PES + + LE+ R ++ +
Sbjct: 37 VLDVVSSLQQTQT----ILSLDPTRSSLTTTKPESLSDPVFFNSSSPLSLELHSRDTFVA 92
Query: 89 KKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVS-------------SHSVEVSQIQIPLA 135
+ ++ ++L D V + ++R V + + P+
Sbjct: 93 SQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVV 152
Query: 136 SGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSY 193
SG + + Y + +G + M +++DTGSD+ W+QCEPC CY Q PVF P++SS+Y
Sbjct: 153 SGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTY 212
Query: 194 QSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGI-SVS 252
+S+ C++ C L+ AC SN C Y V+YGDGS+T GEL + ++FG ++
Sbjct: 213 KSLTCSAPQCSLLE-----TSACRSN--KCLYQVSYGDGSFTVGELATDTVTFGNSGKIN 265
Query: 253 NFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMG 312
N GCG +N+GLF G +GL+GLG LS+ +Q +T FSYCL D+G S SL
Sbjct: 266 NVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRDSGKSSSLDFN 322
Query: 313 NESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAP-------SFGNGGGLI 365
+ + T ++ N ++ FY + L+G VGG P + G+GG ++
Sbjct: 323 SVQLGGGDAT----APLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 366 DSGTVITRLAPSVYKALKAEFLKQFSGFPS-APGFSILDTCFNLTGNEEVNIPTISMNFE 424
D GT +TRL Y +L+ FLK + S+ DTC++ + V +PT++ +F
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFT 438
Query: 425 DNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGF 484
L++ A + I +D+ C A A S ++IIGN QQ+ R+ YD ++ +G
Sbjct: 439 GGKSLDLPAKN-YLIPVDDSGTFCFAFAPTSSS--LSIIGNVQQQGTRITYDLSKNVIGL 495
Query: 485 AGESC 489
+G C
Sbjct: 496 SGNKC 500
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 202 bits (514), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 149/413 (36%), Positives = 221/413 (53%), Gaps = 37/413 (8%)
Query: 110 RSMQNRLRKMVSSH-----SVEVS--QIQIPLASGVNFQTLNYIVTMELGG--QNMTVII 160
+ ++RK ++S + EVS ++ L SG+ + Y + + +G ++ ++I+
Sbjct: 118 KQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLIL 177
Query: 161 DTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNP 220
DTGSDL W+QC PC C++Q G + P TS+S+++I CN C SL + CES+
Sbjct: 178 DTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPRC-SLISSPDPPVQCESDN 236
Query: 221 SNCSYAVNYGDGSYTNGELGAEHLSF------GGIS---VSNFVFGCGKNNKGLFGGVSG 271
+C Y YGD S T G+ E + GG S V N +FGCG N+GLF G SG
Sbjct: 237 QSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASG 296
Query: 272 LMGLGRSNLSLISQTNSTFGGVFSYCLPP--TDAGASGSLAMGNESSVFKNLTPIAYTRM 329
L+GLGR LS SQ S +G FSYCL ++ S L G + + N T + +T
Sbjct: 297 LLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLL-NHTNLNFTSF 355
Query: 330 VPNPQ--LSNFYMLNLTGIDVGGVAGQAP-------SFGNGGGLIDSGTVITRLAPSVYK 380
V + + FY + + I VGG A P S G+GG +IDSGT ++ A Y+
Sbjct: 356 VNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYE 415
Query: 381 ALKAEFLKQF-SGFPSAPGFSILDTCFNLTGNEEVNI--PTISMNFEDNVVLNVDATGIF 437
+K +F ++ +P F +LD CFN++G EE NI P + + F D V N A F
Sbjct: 416 IIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSF 475
Query: 438 YIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ ED VCLA+ + + +IIGNYQQ+N ++YDTK+S++GF C+
Sbjct: 476 IWLSEDL--VCLAILG-TPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 197 bits (500), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 145/441 (32%), Positives = 226/441 (51%), Gaps = 35/441 (7%)
Query: 78 ILEMKDRSYCSKKKVNWHRKL---HNQLTLDDLHVRSMQNRLRKMVSSHSVE--VSQIQI 132
+LE++ R + + H+++ +NQ T+ ++ + + + SVE Q+
Sbjct: 100 VLELQIRDLTRIQTL--HKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVA 157
Query: 133 PLASGVNFQTLNYIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTS 190
L SG+ + Y + + +G ++ ++I+DTGSDL W+QC PC C+ Q G + P S
Sbjct: 158 TLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKAS 217
Query: 191 SSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF---- 246
+SY++I CN C +L + C+S+ +C Y YGD S T G+ E +
Sbjct: 218 ASYKNITCNDQRC-NLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 276
Query: 247 -GGIS----VSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPP- 300
GG S V N +FGCG N+GLF G +GL+GLGR LS SQ S +G FSYCL
Sbjct: 277 NGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR 336
Query: 301 -TDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQ--LSNFYMLNLTGIDVGGVAGQAP- 356
+D S L G + + + + +T V + + FY + + I V G P
Sbjct: 337 NSDTNVSSKLIFGEDKDLLSHPN-LNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPE 395
Query: 357 ------SFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSG-FPSAPGFSILDTCFNLT 409
S G GG +IDSGT ++ A Y+ +K + ++ G +P F ILD CFN++
Sbjct: 396 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVS 455
Query: 410 GNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQR 469
G V +P + + F D V N F + ED VCLA+ + + +IIGNYQQ+
Sbjct: 456 GIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDL--VCLAMLG-TPKSAFSIIGNYQQQ 512
Query: 470 NQRVIYDTKQSKVGFAGESCS 490
N ++YDTK+S++G+A C+
Sbjct: 513 NFHILYDTKRSRLGYAPTKCA 533
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 183 bits (465), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/391 (34%), Positives = 198/391 (50%), Gaps = 39/391 (9%)
Query: 120 VSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNM--TVIIDTGSDLTWVQCEPCMSC 177
V+S + + I+ P G +++ + +G + + I+DTGSDL W QC+PC C
Sbjct: 86 VASKPDDTNNIKAPTHGGSG----EFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC 141
Query: 178 YNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNG 237
++Q P+F P SSSY + C+S C +L + C + C Y YGD S T G
Sbjct: 142 FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSN-----CNEDKDACEYLYTYGDYSSTRG 196
Query: 238 ELGAEHLSFGGI-SVSNFVFGCGKNNKGL-FGGVSGLMGLGRSNLSLISQTNSTFGGVFS 295
L E +F S+S FGCG N+G F SGL+GLGR LSLISQ T FS
Sbjct: 197 LLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KFS 253
Query: 296 YCLPP-TDAGASGSLAMGNESSVFKNLTPIAYT-------RMVPNPQLSNFYMLNLTGID 347
YCL D+ AS SL +G+ +S N T + ++ NP +FY L L GI
Sbjct: 254 YCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGIT 313
Query: 348 VGG--VAGQAPSF-----GNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFS 400
VG ++ + +F G GG +IDSGT IT L + +K LK EF + S G +
Sbjct: 314 VGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGST 373
Query: 401 ILDTCFNL-TGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQV-CLALASLSDEY 458
LD CF L + + +P + +F+ +++ G Y+V + ++ V CLA+ S
Sbjct: 374 GLDLCFKLPDAAKNIAVPKMIFHFKG---ADLELPGENYMVADSSTGVLCLAMGS---SN 427
Query: 459 DIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
++I GN QQ+N V++D ++ V F C
Sbjct: 428 GMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 174 bits (440), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 208/402 (51%), Gaps = 37/402 (9%)
Query: 107 LHVRSMQ-NRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGG--QNMTVIIDTG 163
LH+ S +RL + S + + +P+ASG NY+V +LG Q M +++DT
Sbjct: 65 LHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTS 124
Query: 164 SDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTC-QSLQLTTGNAGACESNPSN 222
+D W+ C C C N ++SS+Y ++ C+++ C Q+ LT ++ PS
Sbjct: 125 NDAVWLPCSGCSGCSNASTSFNT-NSSSTYSTVSCSTAQCTQARGLTCPSS---SPQPSV 180
Query: 223 CSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSL 282
CS+ +YG S + L + L+ + NF FGC + G GLMGLGR +SL
Sbjct: 181 CSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSL 240
Query: 283 ISQTNSTFGGVFSYCLPPTDAGA-SGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYML 341
+SQT S + GVFSYCLP + SGSL +G + I YT ++ NP+ + Y +
Sbjct: 241 VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYYV 296
Query: 342 NLTGIDVGGVAGQAP---------SFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQ--F 390
NLTG+ VG V Q P + G +IDSGTVITR A VY+A++ EF KQ
Sbjct: 297 NLTGVSVGSV--QVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNV 354
Query: 391 SGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFED-NVVLNVDATGIFYIVKEDASQVCL 449
S F + F DTCF+ NE V P I+++ ++ L ++ T I + CL
Sbjct: 355 SSFSTLGAF---DTCFS-ADNENV-APKITLHMTSLDLKLPMENT---LIHSSAGTLTCL 406
Query: 450 ALASLSDEYD--IAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
++A + + + +I N QQ+N R+++D S++G A E C
Sbjct: 407 SMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 163 bits (413), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 122/389 (31%), Positives = 180/389 (46%), Gaps = 30/389 (7%)
Query: 114 NRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMTV--IIDTGSDLTWVQC 171
NR+ + QI + SG Y++ + +G + I DTGSDL W QC
Sbjct: 65 NRVFHFTEKDNTPQPQIDLTSNSG------EYLMNVSIGTPPFPIMAIADTGSDLLWTQC 118
Query: 172 EPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGD 231
PC CY Q P+F P TSS+Y+ + C+SS C +L+ N +C +N + CSY+++YGD
Sbjct: 119 APCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALE----NQASCSTNDNTCSYSLSYGD 174
Query: 232 GSYTNGELGAEHLSFGG-----ISVSNFVFGCGKNNKGLFG-GVSGLMGLGRSNLSLISQ 285
SYT G + + L+ G + + N + GCG NN G F SG++GLG +SLI Q
Sbjct: 175 NSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQ 234
Query: 286 TNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTG 345
+ G FSYCL P + + + ++ + + + T ++ FY L L
Sbjct: 235 LGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKS 294
Query: 346 IDVGGVAGQAPSFGNGGG----LIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSI 401
I VG Q + +IDSGT +T L Y L+ S
Sbjct: 295 ISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSG 354
Query: 402 LDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIA 461
L C++ TG ++ +P I+M+F D + +D++ F V ED VC A +
Sbjct: 355 LSLCYSATG--DLKVPVITMHF-DGADVKLDSSNAFVQVSEDL--VCFAFRG---SPSFS 406
Query: 462 IIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
I GN Q N V YDT V F C+
Sbjct: 407 IYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 161 bits (407), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 129/398 (32%), Positives = 192/398 (48%), Gaps = 34/398 (8%)
Query: 110 RSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLN---YIVTMELGGQNMTV--IIDTGS 164
+ M+N +R+ S +++ S S +F T N Y++ + +G + + I DTGS
Sbjct: 49 QRMRNAIRRSARS-TLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGS 107
Query: 165 DLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCS 224
DL W QC PC CY Q P+F P SS+Y+ + C+SS C++L+ +C ++ + CS
Sbjct: 108 DLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQCRALE-----DASCSTDENTCS 162
Query: 225 YAVNYGDGSYTNGELGAEHLSFGG-----ISVSNFVFGCGKNNKGLFG-GVSGLMGLGRS 278
Y + YGD SYT G++ + ++ G +S+ N + GCG N G F SG++GLG
Sbjct: 163 YTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGG 222
Query: 279 NLSLISQTNSTFGGVFSYCLPP--TDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLS 336
+ SL+SQ + G FSYCL P ++ G + + G V + + T MV +
Sbjct: 223 STSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSGD--GVVSTSMV-KKDPA 279
Query: 337 NFYMLNLTGIDVGGVAGQAPS--FGNGGG--LIDSGTVITRLAPSVYKALKAEFLKQFSG 392
+Y LNL I VG Q S FG G G +IDSGT +T L + Y L++
Sbjct: 280 TYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKA 339
Query: 393 FPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALA 452
IL C+ + +P I+++F+ V + F V ED S C A A
Sbjct: 340 ERVQDPDGILSLCYR--DSSSFKVPDITVHFKGGDV-KLGNLNTFVAVSEDVS--CFAFA 394
Query: 453 SLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ + I GN Q N V YDT V F CS
Sbjct: 395 A---NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 160 bits (405), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 133/441 (30%), Positives = 208/441 (47%), Gaps = 71/441 (16%)
Query: 78 ILEMKDRSYCSKKKVNWHRKL---HNQLTLDDLHVRSMQNRLRKMVSSHSVE--VSQIQI 132
+LE++ R + + H+++ +NQ T+ ++ + + + SVE Q+
Sbjct: 100 VLELQIRDLTRIQTL--HKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVA 157
Query: 133 PLASGVNFQTLNYIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTS 190
L SG+ + Y + + +G ++ ++I+DTGSDL W+QC PC C+ Q
Sbjct: 158 TLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQ---------- 207
Query: 191 SSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF---- 246
++ +C Y YGD S T G+ E +
Sbjct: 208 ---------------------------NDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 240
Query: 247 -GGIS----VSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPP- 300
GG S V N +FGCG N+GLF G +GL+GLGR LS SQ S +G FSYCL
Sbjct: 241 NGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR 300
Query: 301 -TDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQ--LSNFYMLNLTGIDVGGVAGQAP- 356
+D S L G + + + + +T V + + FY + + I V G P
Sbjct: 301 NSDTNVSSKLIFGEDKDLLSHPN-LNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPE 359
Query: 357 ------SFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSG-FPSAPGFSILDTCFNLT 409
S G GG +IDSGT ++ A Y+ +K + ++ G +P F ILD CFN++
Sbjct: 360 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVS 419
Query: 410 GNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQR 469
G V +P + + F D V N F + ED VCLA+ + + +IIGNYQQ+
Sbjct: 420 GIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDL--VCLAMLG-TPKSAFSIIGNYQQQ 476
Query: 470 NQRVIYDTKQSKVGFAGESCS 490
N ++YDTK+S++G+A C+
Sbjct: 477 NFHILYDTKRSRLGYAPTKCA 497
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 160 bits (404), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 130/427 (30%), Positives = 198/427 (46%), Gaps = 60/427 (14%)
Query: 85 SYCS--KKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNF-Q 141
S CS K V+W L +Q++ R + S V + +P+ASG Q
Sbjct: 38 SLCSPFKTSVSWADTL-------------LQDKARFLYLSSLAGVRKSSVPIASGRAIVQ 84
Query: 142 TLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCN 199
+ YIV +G Q M V +DT +D W+ C C+ C + +F PS SSS +++ C
Sbjct: 85 SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCE 142
Query: 200 SSTCQSLQLTTGNAGACESNPS-----NCSYAVNYGDGSYTNGELGAEHLSFGGISVSNF 254
+ C+ NPS +C + + YG GS L + L+ + N+
Sbjct: 143 APQCKQ-----------APNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNY 190
Query: 255 VFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGA-SGSLAMGN 313
FGC G GLMGLGR LSLISQ+ + + FSYCLP + + SGSL +G
Sbjct: 191 TFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGP 250
Query: 314 ESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFG-------NGGGLID 366
++ + I T ++ NP+ S+ Y +NL GI VG P+ G + D
Sbjct: 251 KNQPIR----IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFD 306
Query: 367 SGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFED- 425
SGTV TRL Y A++ EF ++ +A DTC+ + V P+++ F
Sbjct: 307 SGTVYTRLVEPAYVAVRNEFRRRVKN-ANATSLGGFDTCY----SGSVVFPSVTFMFAGM 361
Query: 426 NVVLNVDATGIFYIVKEDASQVCLALASLSDEYD--IAIIGNYQQRNQRVIYDTKQSKVG 483
NV L D I + CLA+A+ + + +I + QQ+N RV+ D S++G
Sbjct: 362 NVTLPPDN---LLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLG 418
Query: 484 FAGESCS 490
+ E+C+
Sbjct: 419 ISRETCT 425
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 153 bits (386), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 134/418 (32%), Positives = 200/418 (47%), Gaps = 45/418 (10%)
Query: 101 QLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMTV-- 158
Q+T+ D R LR + S Q L SG+ + +++ +G + V
Sbjct: 44 QITVTD---RLNAAFLRSVSRSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFA 100
Query: 159 IIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACES 218
I DTGSDLTWVQC+PC CY + GP+F SS+Y+S PC+S CQ+L T C+
Sbjct: 101 IADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSST---ERGCDE 157
Query: 219 NPSNCSYAVNYGDGSYTNGELGAEHLSFGG-----ISVSNFVFGCGKNNKGLFGGV-SGL 272
+ + C Y +YGD S++ G++ E +S +S VFGCG NN G F SG+
Sbjct: 158 SNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGI 217
Query: 273 MGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGS--LAMGNES--SVFKNLTPIAYTR 328
+GLG +LSLISQ S+ FSYCL A +G+ + +G S S + + T
Sbjct: 218 IGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTP 277
Query: 329 MVPNPQLSNFYMLNLTGIDVGGVAGQAPSFG--------------NGGGLIDSGTVITRL 374
+V L+ +Y L L I VG + P G +G +IDSGT +T L
Sbjct: 278 LVDKEPLT-YYYLTLEAISVG--KKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLL 334
Query: 375 APSVYKALKAEFLKQFSGFP--SAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVD 432
+ + + +G S P +L CF +G+ E+ +P I+++F +V
Sbjct: 335 EAGFFDKFSSAVEESVTGAKRVSDPQ-GLLSHCFK-SGSAEIGLPEITVHFTG---ADVR 389
Query: 433 ATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ I VK VCL++ + ++AI GN+ Q + V YD + V F CS
Sbjct: 390 LSPINAFVKLSEDMVCLSMVPTT---EVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 152 bits (383), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 187/417 (44%), Gaps = 51/417 (12%)
Query: 111 SMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTW 168
++ R +S + ++ P+ SG + Y V + +G Q++ +I DTGSDL W
Sbjct: 50 ALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVW 109
Query: 169 VQCEPCMSC-YNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNP--SNCSY 225
V+C C +C ++ VF P SS++ C C+ L A C S C Y
Sbjct: 110 VKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCR-LVPKPDRAPICNHTRIHSTCHY 168
Query: 226 AVNYGDGSYTNGELGAEHLSFGGIS-----VSNFVFGCGKNNKGL------FGGVSGLMG 274
Y DGS T+G E S S + + FGCG G F G +G+MG
Sbjct: 169 EYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMG 228
Query: 275 LGRSNLSLISQTNSTFGGVFSYCL-------PPTDAGASGSLAMGNESSVFKNLTPIAYT 327
LGR +S SQ FG FSYCL PPT L +GN ++ + +T
Sbjct: 229 LGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSY-----LIIGNGGD---GISKLFFT 280
Query: 328 RMVPNPQLSNFYMLNLTGIDVGGVAGQA-PSF------GNGGGLIDSGTVITRLAPSVYK 380
++ NP FY + L + V G + PS GNGG ++DSGT + LA Y+
Sbjct: 281 PLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYR 340
Query: 381 ALKAEFLKQFSGFPSA----PGFSILDTCFNLTG--NEEVNIPTISMNFEDNVVLNVDAT 434
++ A ++ P A PGF D C N++G E +P + F V V
Sbjct: 341 SVIAAVRRRVK-LPIADALTPGF---DLCVNVSGVTKPEKILPRLKFEFSGGAVF-VPPP 395
Query: 435 GIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCSF 491
++I E+ Q CLA+ S+ + ++IGN Q+ +D +S++GF+ C+
Sbjct: 396 RNYFIETEEQIQ-CLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCAL 451
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 151 bits (382), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 135/448 (30%), Positives = 208/448 (46%), Gaps = 54/448 (12%)
Query: 70 SRQEKGAIILEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQ 129
S + + +E+ R N H + ++L + +RS+ +R R+ +
Sbjct: 22 SSANRENLTVELIHRDSPHSPLYNPHHTVSDRL--NAAFLRSI-SRSRRFTT-------- 70
Query: 130 IQIPLASGVNFQTLNYIVTMELGGQNMTV--IIDTGSDLTWVQCEPCMSCYNQQGPVFKP 187
+ L SG+ Y +++ +G V I DTGSDLTWVQC+PC CY Q P+F
Sbjct: 71 -KTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDK 129
Query: 188 STSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFG 247
SS+Y++ C+S TCQ+L + + C+ + C Y +YGD S+T G++ E +S
Sbjct: 130 KKSSTYKTESCDSKTCQAL---SEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISID 186
Query: 248 GISVSNF-----VFGCGKNNKGLFGGVSGLMGLGRSN-LSLISQTNSTFGGVFSYCLPPT 301
S S+ VFGCG NN G F + LSL+SQ S+ G FSYCL T
Sbjct: 187 SSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHT 246
Query: 302 DAGASGSLAMG-NESSVFKNLTPIAYTRMVP----NPQLSNFYMLNLTGIDVGGVAGQAP 356
A +G+ + +S+ N + + T P +P+ +Y L L + VG + P
Sbjct: 247 AATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE--TYYFLTLEAVTVGKT--KLP 302
Query: 357 SFGNGGGL------------IDSGTVITRLAPSVYKALKAEFLKQFSGFP--SAPGFSIL 402
G G GL IDSGT +T L Y + +G S P +L
Sbjct: 303 YTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQ-GLL 361
Query: 403 DTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAI 462
CF +G++E+ +P I+M+F + +V + I VK + VCL++ + ++AI
Sbjct: 362 THCFK-SGDKEIGLPAITMHFTN---ADVKLSPINAFVKLNEDTVCLSMIPTT---EVAI 414
Query: 463 IGNYQQRNQRVIYDTKQSKVGFAGESCS 490
GN Q + V YD + V F CS
Sbjct: 415 YGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 148 bits (374), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 139/458 (30%), Positives = 216/458 (47%), Gaps = 64/458 (13%)
Query: 62 SLGCLHPESRQEK----GAIILEMKDRSYCSKKK----VNWHRKLHNQLTLDDLHVRSMQ 113
+LG HP K G+ + S CS K ++W ++ L D ++
Sbjct: 33 ALGLNHPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQ--- 89
Query: 114 NRLRKMVSSHSVEVSQIQIPLASGVN-FQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQ 170
L +V+ SV +P+ASG Q+ YIV +G Q + + +DT SD+ W+
Sbjct: 90 -YLSSLVAGRSV------VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIP 142
Query: 171 CEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYG 230
C C+ C + F P+ S+S++++ C++ C+ + T A A CS+ + YG
Sbjct: 143 CSGCVGCPSNTA--FSPAKSTSFKNVSCSAPQCKQVPNPTCGARA-------CSFNLTYG 193
Query: 231 DGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGV----SGLMGLGRSNLSLISQT 286
S L + + + F FGC NK GG GL+GLGR LSL+SQ
Sbjct: 194 SSSIA-ANLSQDTIRLAADPIKAFTFGCV--NKVAGGGTIPPPQGLLGLGRGPLSLMSQA 250
Query: 287 NSTFGGVFSYCLPPTDAGA-SGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTG 345
S + FSYCLP + SGSL +G S + + YT+++ NP+ S+ Y +NL
Sbjct: 251 QSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQR----VKYTQLLRNPRRSSLYYVNLVA 306
Query: 346 IDVGGVAGQAPSFG-------NGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPG 398
I VG P G + DSGTV TRLA VY+A++ EF K+ P+
Sbjct: 307 IRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK--PTTAV 364
Query: 399 FSIL---DTCFNLTGNEEVNIPTISMNFED-NVVLNVDATGIFYIVKEDASQVCLALASL 454
+ L DTC+ + +V +PTI+ F+ N+ + D + S CLA+A+
Sbjct: 365 VTSLGGFDTCY----SGQVKVPTITFMFKGVNMTMPADN---LMLHSTAGSTSCLAMAAA 417
Query: 455 SDEYD--IAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ + + +I + QQ+N RV+ D ++G A E CS
Sbjct: 418 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 145 bits (365), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 118/403 (29%), Positives = 193/403 (47%), Gaps = 58/403 (14%)
Query: 123 HSVEVSQIQIPLASGVNFQTLN-YIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYN 179
H+ ++ I +PL ++ Y ++LG + V +DTGSD+ WV C PC C
Sbjct: 55 HARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPV 114
Query: 180 QQG-----PVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSY 234
+ ++ TSS+ +++ C C S + + GA + CSY V YGDGS
Sbjct: 115 KTDLGIPLSLYDSKTSSTSKNVGCEDDFC-SFIMQSETCGAKKP----CSYHVVYGDGST 169
Query: 235 TNGELGAEHLSFGGIS--------VSNFVFGCGKNNKGLFG----GVSGLMGLGRSNLSL 282
++G+ ++++ ++ VFGCGKN G G V G+MG G+SN S+
Sbjct: 170 SDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSI 229
Query: 283 ISQTNSTFGG----VFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNF 338
ISQ + GG +FS+CL + G G A+G S TPI VPN
Sbjct: 230 ISQLAA--GGSTKRIFSHCLDNMNGG--GIFAVGEVESPVVKTTPI-----VPN---QVH 277
Query: 339 YMLNLTGIDVGGVAGQAP-----SFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGF 393
Y + L G+DV G P + G+GG +IDSGT + L ++Y +L +++ +
Sbjct: 278 YNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAK 333
Query: 394 PSAPGFSILDT--CFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLAL 451
+ +T CF+ T N + P ++++FED++ L+V + ++ED C
Sbjct: 334 QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED--MYCFGW 391
Query: 452 AS----LSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
S D D+ ++G+ N+ V+YD + +G+A +CS
Sbjct: 392 QSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 434
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 138 bits (348), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 121/401 (30%), Positives = 180/401 (44%), Gaps = 51/401 (12%)
Query: 102 LTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQ--NMTVI 159
T+D +H RS SS V +Q+ P A V F T Y++ +++G + +
Sbjct: 30 FTIDLIHRRSN-------ASSSRVFNTQLGSPYADTV-FDTYEYLMKLQIGTPPFEIEAV 81
Query: 160 IDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESN 219
+DTGS+ W QC PC+ CYNQ P+F PS SS+++ I C+++
Sbjct: 82 LDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI------------------RCDTH 123
Query: 220 PSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFV-----FGCGKNNKGLFGGVSGLMG 274
+C Y + YG SYT G L E ++ S FV GCG+NN G G +G++G
Sbjct: 124 DHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKPGFAGVVG 183
Query: 275 LGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQ 334
L R SLI+Q + G+ SYC AG S +++ ++ T V +
Sbjct: 184 LDRGPKSLITQMGGEYPGLMSYCF----AGKGTSKINFGANAIVAGDGVVSTTVFVKTAK 239
Query: 335 LSNFYMLNLTGIDVGG----VAGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQF 390
FY LNL + VG G G +IDSG+ +T P Y L + ++Q
Sbjct: 240 -PGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF-PESYCNLVRKAVEQV 297
Query: 391 SGFPSAPGFSILDTCFNLTGNEEVNI-PTISMNFEDNVVLNVDATGIFYIVKEDASQVCL 449
P IL C+ ++ ++I P I+M+F L +D + Y+ CL
Sbjct: 298 VTAVRFPRSDIL--CYY---SKTIDIFPVITMHFSGGADLVLDKYNM-YVASNTGGVFCL 351
Query: 450 ALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
A+ S + AI GN Q N V YD+ V F +CS
Sbjct: 352 AIICNS-PIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 135 bits (340), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 175/374 (46%), Gaps = 47/374 (12%)
Query: 147 VTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQ 204
VT+ +G QN+++++DTGS+L+W+ C+ + G VF P +SS+Y +PC+S C+
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 122
Query: 205 SLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGK---- 260
+ +C+ C A++Y D + G L E G ++ +FGC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182
Query: 261 NNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKN 320
+N +GLMG+ R +LS ++Q + FSYC+ +D +SG L +G+ S +
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSD--SSGFLLLGDAS--YSW 235
Query: 321 LTPIAYTRMV----PNPQLSNF-YMLNLTGIDVGGVAGQAPS-------FGNGGGLIDSG 368
L PI YT +V P P Y + L GI VG P G G ++DSG
Sbjct: 236 LGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295
Query: 369 TVITRLAPSVYKALKAEFLKQFSG---FPSAPGF---SILDTCFNLTGNEEVN---IPTI 419
T T L VY ALK EF+ Q P F +D C+ + N +P +
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMV 355
Query: 420 SMNFEDNVVLNVDATGIFYIVKEDASQ-----VCLALASLSDEYDIA--IIGNYQQRNQR 472
S+ F ++V + Y V S+ C + SD I +IG++ Q+N
Sbjct: 356 SLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGN-SDLLGIEAFVIGHHHQQNVW 413
Query: 473 VIYDTKQSKVGFAG 486
+ +D +S+VGFAG
Sbjct: 414 MEFDLAKSRVGFAG 427
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 134 bits (338), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 180/372 (48%), Gaps = 49/372 (13%)
Query: 154 QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPV--FKPSTSSSYQSIPCNSSTCQSLQLTTG 211
QN++++IDTGS+L+W++C + PV F P+ SSSY IPC+S TC++
Sbjct: 84 QNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139
Query: 212 NAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFG-GISVSNFVFGCGKNNKG----LF 266
+C+S+ C ++Y D S + G L AE FG + SN +FGC + G
Sbjct: 140 IPASCDSD-KLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEED 198
Query: 267 GGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAY 326
+GL+G+ R +LS ISQ FSYC+ TD G L +G+ S F LTP+ Y
Sbjct: 199 TKTTGLLGMNRGSLSFISQMGFP---KFSYCISGTD-DFPGFLLLGD--SNFTWLTPLNY 252
Query: 327 TRMV----PNPQLSNF-YMLNLTGIDVGGVAGQAPS-------FGNGGGLIDSGTVITRL 374
T ++ P P Y + LTGI V G P G G ++DSGT T L
Sbjct: 253 TPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFL 312
Query: 375 APSVYKALKAEFLKQFSGFPSA---PGF---SILDTCFNLTGNEEVN-----IPTISMNF 423
VY AL++ FL + +G + P F +D C+ ++ + +PT+S+ F
Sbjct: 313 LGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVF 372
Query: 424 EDNVVLNVDATGIFYIVKE----DASQVCLALASLSD--EYDIAIIGNYQQRNQRVIYDT 477
E + V + Y V + S C + SD + +IG++ Q+N + +D
Sbjct: 373 E-GAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGN-SDLMGMEAYVIGHHHQQNMWIEFDL 430
Query: 478 KQSKVGFAGESC 489
++S++G A C
Sbjct: 431 QRSRIGLAPVEC 442
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 130 bits (328), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 112/413 (27%), Positives = 185/413 (44%), Gaps = 57/413 (13%)
Query: 108 HVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLN-YIVTMELGG--QNMTVIIDTGS 164
H +S R HS ++ I +PL ++ Y ++LG + V +DTGS
Sbjct: 42 HFKSHDTR------RHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGS 95
Query: 165 DLTWVQCEPCMSC-----YNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESN 219
D+ W+ C+PC C N + +F + SS+ + + C+ C + + +C+
Sbjct: 96 DILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFIS----QSDSCQ-- 149
Query: 220 PS-NCSYAVNYGDGSYTNGELGAEHLSFGGISVS--------NFVFGCGKNNKGLFG--- 267
P+ CSY + Y D S ++G+ + L+ ++ VFGCG + G G
Sbjct: 150 PALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGD 209
Query: 268 -GVSGLMGLGRSNLSLISQTNSTFGG--VFSYCLPPTDAGASGSLAMGNESSVFKNLTPI 324
V G+MG G+SN S++SQ +T VFS+CL G G A+G S TP
Sbjct: 210 SAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG--GIFAVGVVDSPKVKTTP- 266
Query: 325 AYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPS--FGNGGGLIDSGTVITRLAPSVYKAL 382
MVPN Y + L G+DV G + P NGG ++DSGT + +Y +L
Sbjct: 267 ----MVPNQM---HYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSL 319
Query: 383 KAEFLKQFSGFPSAPGFSILDT---CFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYI 439
L + I++ CF+ + N + P +S FED+V L V +
Sbjct: 320 IETILAR-----QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFT 374
Query: 440 VKEDASQVCLALASLS--DEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
++E+ L+ + ++ ++G+ N+ V+YD +G+A +CS
Sbjct: 375 LEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 129 bits (325), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 171/387 (44%), Gaps = 42/387 (10%)
Query: 130 IQIPLASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGP---- 183
+++ L SG+++ T Y + +G + V++DTGS+LTWV C Y +G
Sbjct: 91 VKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCR-----YRARGKDNRR 145
Query: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH 243
VF+ S S++++ C + TC+ + + C + + CSY Y DGS G E
Sbjct: 146 VFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKET 205
Query: 244 LSFG-----GISVSNFVFGCGKNNKGL-FGGVSGLMGLGRSNLSLISQTNSTFGGVFSYC 297
++ G + + GC + G F G G++GL S+ S S S +G FSYC
Sbjct: 206 ITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYC 265
Query: 298 LPP--TDAGASGSLAMGNESS---VFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVA 352
L ++ S L G+ S F+ TP+ TR+ P FY +N+ GI +G
Sbjct: 266 LVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPP------FYAINVIGISLGYDM 319
Query: 353 GQAPS-----FGNGGGLIDSGTVITRLAPSVYKALK---AEFLKQFSGFPSAPGFSILDT 404
PS GG ++DSGT +T LA + YK + A +L + P ++
Sbjct: 320 LDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRV--KPEGVPIEY 377
Query: 405 CFNLTGNEEVN-IPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAII 463
CF+ T V+ +P ++ + + Y+V CL S +I
Sbjct: 378 CFSFTSGFNVSKLPQLTFHLKGGARFEPHRKS--YLVDAAPGVKCLGFVSAGTP-ATNVI 434
Query: 464 GNYQQRNQRVIYDTKQSKVGFAGESCS 490
GN Q+N +D S + FA +C+
Sbjct: 435 GNIMQQNYLWEFDLMASTLSFAPSACT 461
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 129 bits (323), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 166/360 (46%), Gaps = 61/360 (16%)
Query: 158 VIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACE 217
++IDTGSDLTW+ C PC CY Q P F PS SS+Y++ C S+ Q+ +
Sbjct: 93 LLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQIFR------D 145
Query: 218 SNPSNCSYAVNYGDGSYTNGELGAEHLSF-----GGISVSNFVFGCGKNNKGLFGGVSGL 272
NC Y + Y D S T G L E L+F G IS N VFGCG++N G F SG+
Sbjct: 146 EKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSG-FTKYSGV 204
Query: 273 MGLGRSNLSLISQTNSTFGGVFSYCL----PPTDAGASGSLAMGNESSVFKNLTPIAYTR 328
+GLG S++++ FG FSYC PT L +GN + + + TP+
Sbjct: 205 LGLGPGTFSIVTR---NFGSKFSYCFGSLTNPT--YPHNILILGNGAKIEGDPTPLQI-- 257
Query: 329 MVPNPQLSNFYMLNLTGIDVGG--VAGQAPSF----GNGGGLIDSGTVITRLAPSVYKAL 382
+ Y L+L I G + + +F GG +ID+G T LA Y+ L
Sbjct: 258 ------FQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYETL 311
Query: 383 KAEF----------LKQFSGFPSAPGFSILDTCFNLTGNEEVNI---PTISMNFEDNVVL 429
E +K + + + C+ GN ++++ P ++ +F L
Sbjct: 312 SEEIDFLLGEVLRRVKDWDQYTTP--------CYE--GNLKLDLYGFPVVTFHFAGGAEL 361
Query: 430 NVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
+D +F + E CLA+ +++ D+++IG Q+N V Y+ + KV F C
Sbjct: 362 ALDVESLF-VSSESGDSFCLAM-TMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 128 bits (321), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/429 (28%), Positives = 187/429 (43%), Gaps = 62/429 (14%)
Query: 108 HVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMTV--IIDTGSD 165
H S++ + S+ + + ++ PL++ Y V++ G + T+ + DTGS
Sbjct: 56 HGTSIKPDEDALSSTTTASATVVKSPLSAK---SYGGYSVSLSFGTPSQTIPFVFDTGSS 112
Query: 166 LTWVQCEPCMSCYNQQG-----------PVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAG 214
L W+ PC S Y G P F P SSS + I C S CQ L
Sbjct: 113 LVWL---PCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCR 169
Query: 215 ACESNPSNCS-----YAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGV 269
C+ N NC+ Y + YG GS T G L E L F ++V +FV GC +
Sbjct: 170 GCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFPDLTVPDFVVGCSIISTRQ---P 225
Query: 270 SGLMGLGRSNLSLISQTNSTFGGVFSYCL-----PPTDAGASGSLAMGNESSVFKNLTPI 324
+G+ G GR +SL SQ N FS+CL T+ L G+ + +
Sbjct: 226 AGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGL 282
Query: 325 AYTRMVPNPQLSN-----FYMLNLTGIDVGGVAGQAP-------SFGNGGGLIDSGTVIT 372
YT NP +SN +Y LNL I VG + P + G+GG ++DSG+ T
Sbjct: 283 TYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFT 342
Query: 373 RLAPSVYKALKAEFLKQFSGFPSAPGFSI---LDTCFNLTGNEEVNIPTISMNFEDNVVL 429
+ V++ + EF Q S + L CFN++G +V +P + F+ L
Sbjct: 343 FMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKL 402
Query: 430 NVDATGIFYIVKEDASQVCLALASLSDEY--------DIAIIGNYQQRNQRVIYDTKQSK 481
+ + F V + VCL + +SD+ I+G++QQ+N V YD + +
Sbjct: 403 ELPLSNYFTFVG-NTDTVCLTV--VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDR 459
Query: 482 VGFAGESCS 490
GFA + CS
Sbjct: 460 FGFAKKKCS 468
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 127 bits (318), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 121/402 (30%), Positives = 179/402 (44%), Gaps = 51/402 (12%)
Query: 120 VSSHSVEVSQIQIPLASGVNFQTLN-YIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMS 176
V HS +S I IPL +++ Y + LG ++ V +DTGSD+ WV C C+
Sbjct: 59 VHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIR 118
Query: 177 CYNQQGPV----FKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDG 232
C + V + SS+ +S+ C+ + C + C S S C Y + YGDG
Sbjct: 119 CPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVN----QRSECHSG-STCQYVIMYGDG 173
Query: 233 SYTNGELGAE--HLSF------GGISVSNFVFGCGKNNKGLFG----GVSGLMGLGRSNL 280
S TNG L + HL G + +FGCG G G V G+MG G+SN
Sbjct: 174 SSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNS 233
Query: 281 SLISQTNSTFGGV---FSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSN 337
S ISQ S G V F++CL + G G A+G S TP+ S
Sbjct: 234 SFISQLASQ-GKVKRSFAHCLDNNNGG--GIFAIGEVVSPKVKTTPMLSK--------SA 282
Query: 338 FYMLNLTGIDVGGVAGQAPS--FGNG---GGLIDSGTVITRLAPSVYKALKAEFLKQFSG 392
Y +NL I+VG + S F +G G +IDSGT + L +VY L E L +
Sbjct: 283 HYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEIL---AS 339
Query: 393 FPSAPGFSILD--TCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLA 450
P ++ + TCF+ T + PT++ F+ +V L V + V+ED
Sbjct: 340 HPELTLHTVQESFTCFHYTDKLD-RFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQ 398
Query: 451 LASLSDE--YDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
L + + I+G+ N+ V+YD + +G+ +CS
Sbjct: 399 NGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 126 bits (316), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 174/402 (43%), Gaps = 52/402 (12%)
Query: 102 LTLDDLHVRS-MQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMTV-- 158
T+D +H RS +R+ S S P A+ V F Y++ +++G +
Sbjct: 30 FTMDLIHRRSNASSRVSNTQSGSS--------PYANTV-FDNSVYLMKLQVGTPPFEIQA 80
Query: 159 IIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACES 218
IIDTGS++TW QC PC+ CY Q P+F PS SS+++ C+ +C
Sbjct: 81 IIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCDGHSCP-------------- 126
Query: 219 NPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVF-----GCGKNNKGLFGGVSGLM 273
Y V+Y D +YT G L E ++ S FV GCG NN SG++
Sbjct: 127 ------YEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSWFKPSFSGMV 180
Query: 274 GLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNP 333
GL SLI+Q + G+ SYC + + G + V + + T M
Sbjct: 181 GLNWGPSSLITQMGGEYPGLMSYCF---SGQGTSKINFGANAIVAGD--GVVSTTMFMTT 235
Query: 334 QLSNFYMLNLTGIDVGG--VAGQAPSFG--NGGGLIDSGTVITRLAPSVYKALKAEFLKQ 389
FY LNL + VG + +F G +IDSGT +T S ++
Sbjct: 236 AKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHV 295
Query: 390 FSGFPSAPGFSILDTCFNLTGNEEVNI-PTISMNFEDNVVLNVDATGIFYIVKEDASQVC 448
+ +A C+N ++ ++I P I+M+F V L +D + Y+ + C
Sbjct: 296 VTAVRAADPTGNDMLCYN---SDTIDIFPVITMHFSGGVDLVLDKYNM-YMESNNGGVFC 351
Query: 449 LALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
LA+ S + AI GN Q N V YD+ V F+ +CS
Sbjct: 352 LAIICNSPTQE-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 123 bits (309), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 176/372 (47%), Gaps = 61/372 (16%)
Query: 145 YIVTMELGGQNMTVII--DTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSS- 201
++V + +G +T ++ DT SDL W+QC PC++CY Q P+F PS S ++++ C +S
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144
Query: 202 -TCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGI-------SVSN 253
+ SL+ +N +C Y++ Y D + + G L E L F I ++ +
Sbjct: 145 YSMPSLKFN--------ANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHD 196
Query: 254 FVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGA--SGSLAM 311
VFGCG +N G +G++GLG SL+ + FG FSYC D + L +
Sbjct: 197 VVFGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGKKFSYCFGSLDDPSYPHNVLVL 252
Query: 312 GNE-SSVFKNLTPIAYTRMVPNPQLSN-FYMLNLTGIDVGG--------VAGQAPSFGNG 361
G++ +++ + TP+ ++ N FY + + I V G V + G G
Sbjct: 253 GDDGANILGDTTPL---------EIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLG 303
Query: 362 GGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDT----CFNLTGNEEVNI- 416
G +ID+G +T L YK LK F G +A S D C+N GN E ++
Sbjct: 304 GTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYN--GNFERDLV 361
Query: 417 ----PTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQR 472
P ++ +F + L++D +F +K + CLA+ ++ IG Q++
Sbjct: 362 ESGFPIVTFHFSEGAELSLDVKSLF--MKLSPNVFCLAVTP----GNLNSIGATAQQSYN 415
Query: 473 VIYDTKQSKVGF 484
+ YD + +V F
Sbjct: 416 IGYDLEAMEVSF 427
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 118 bits (296), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 149/353 (42%), Gaps = 37/353 (10%)
Query: 154 QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNA 213
Q +I+DTGS +T+V C C C Q P F+P S+SYQ++ CN C
Sbjct: 87 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPD-CN--------- 136
Query: 214 GACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGG---ISVSNFVFGCGKNNKG-LFG-G 268
C+ C Y Y + S ++G L + +SFG +S VFGC G LF
Sbjct: 137 --CDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEETGDLFSQR 194
Query: 269 VSGLMGLGRSNLSLISQ--TNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAY 326
G+MGLGR LS++ Q VFS C + G G++ +G S P
Sbjct: 195 ADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG-GAMVLGKIS------PPPGM 247
Query: 327 TRMVPNPQLSNFYMLNLTGIDVGGVAGQA-PSFGNG--GGLIDSGTVITRLAPSVYKALK 383
+P S +Y ++L + V G + + P NG G ++DSGT + A+K
Sbjct: 248 VFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIK 307
Query: 384 AEFLKQFSGFPS--APGFSILDTCFNLTGNEEVNI----PTISMNFEDNVVLNVDATGIF 437
+K+ P + D CF+ G + I P I+M F + L +
Sbjct: 308 DAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYL 367
Query: 438 YIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ + CL + D ++G RN V YD + K+GF +CS
Sbjct: 368 FRHTKVRGAYCLGI--FPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 116 bits (291), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 132/437 (30%), Positives = 190/437 (43%), Gaps = 77/437 (17%)
Query: 113 QNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGG--QNMTVIIDTGSDLTWVQ 170
Q R++K +SS V + PL + Y++T+ +G Q + V +DTGSDLTWV
Sbjct: 59 QERIKKPLSS----VDVVMEPLREVRD----GYLITLNIGTPPQAVQVYLDTGSDLTWVP 110
Query: 171 CE----PCMSCYN------QQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESN- 219
C C+ CY+ + VF P SS+ C SS C + + C
Sbjct: 111 CGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAG 170
Query: 220 -------PSNC-----SYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFG 267
S C S+A YG+G +G L + L V F FGC + +
Sbjct: 171 CSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTST---YR 227
Query: 268 GVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPP----TDAGASGSLAMGNESSVFKNLTP 323
G+ G GR LSL SQ G FS+C P + S L +G S++ NLT
Sbjct: 228 EPIGIAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILG-ASALSINLTD 285
Query: 324 -IAYTRMVPNPQLSNFYMLNLTGIDVGG--VAGQAP-------SFGNGGGLIDSGTVITR 373
+ +T M+ P N Y + L I +G Q P S GNGG L+DSGT T
Sbjct: 286 SLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTH 345
Query: 374 LAPSVYKALKAEFLKQFSGFPSAP------GFSILDTCF-------NLTGNEE---VNIP 417
L Y L L+ +P A GF D C+ NLT E + P
Sbjct: 346 LPEPFYSQLLTT-LQSTITYPRATETESRTGF---DLCYKVPCPNNNLTSLENDVMMIFP 401
Query: 418 TISMNFEDNVVLNVDATGIFYIV--KEDASQV-CLALASLSD-EYDIA-IIGNYQQRNQR 472
+I+ +F +N L + FY + D S V CL ++ D +Y A + G++QQ+N +
Sbjct: 402 SITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVK 461
Query: 473 VIYDTKQSKVGFAGESC 489
V+YD ++ ++GF C
Sbjct: 462 VVYDLEKERIGFQAMDC 478
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 115 bits (287), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 158/354 (44%), Gaps = 37/354 (10%)
Query: 154 QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNA 213
Q +I+D+GS +T+V C C C Q P F+P SS+YQ + CN C
Sbjct: 104 QMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMD-CN--------- 153
Query: 214 GACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGG---ISVSNFVFGCGKNNKG-LFG-G 268
C+ + C Y Y + S + G LG + +SFG ++ VFGC G L+
Sbjct: 154 --CDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQR 211
Query: 269 VSGLMGLGRSNLSLISQ--TNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAY 326
G++GLG+ +LSL+ Q F C D G GS+ +G F + + +
Sbjct: 212 ADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGG-GSMILGG----FDYPSDMVF 266
Query: 327 TRMVPNPQLSNFYMLNLTGIDVGG--VAGQAPSF-GNGGGLIDSGTVITRLAPSVYKALK 383
T +P S +Y ++LTGI V G ++ + F G G ++DSGT L + + A +
Sbjct: 267 TDS--DPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFE 324
Query: 384 AEFLKQFSGFP--SAPGFSILDTCFNLTGNEEVN-----IPTISMNFEDNVVLNVDATGI 436
+++ S P + DTCF + + V+ P++ M F+ +
Sbjct: 325 EAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENY 384
Query: 437 FYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ + CL + ++ ++G RN V+YD + SKVGF +CS
Sbjct: 385 MFRHSKVHGAYCLGVFPNGKDH-TTLLGGIVVRNTLVVYDRENSKVGFWRTNCS 437
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 114 bits (286), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/340 (29%), Positives = 143/340 (42%), Gaps = 38/340 (11%)
Query: 160 IDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESN 219
IDTGSDL W QC PC +CY+Q P+F PS SS+++ CN ++C
Sbjct: 78 IDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNSCH--------------- 122
Query: 220 PSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVF-----GCGKNNKGLFGGVSGLMG 274
Y + Y D +Y+ G L E ++ S FV GCG N+ SG++G
Sbjct: 123 -----YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMVG 177
Query: 275 LGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQ 334
L SLI+Q + G+ SYC + + + G + V + + T M
Sbjct: 178 LSWGPSSLITQMGGEYPGLMSYCFA---SQGTSKINFGTNAIVAGD--GVVSTTMFLTTA 232
Query: 335 LSNFYMLNLTGIDVGG--VAGQAPSFG--NGGGLIDSGTVITRLAPSVYKALKAEFLKQF 390
Y LNL + VG V +F G +IDSGT +T P Y L E + +
Sbjct: 233 KPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF-PVSYCNLVREAVDHY 291
Query: 391 SGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLA 450
+ D T ++ P I+M+F L +D + YI CLA
Sbjct: 292 VTAVRTADPTGNDMLCYYTDTIDI-FPVITMHFSGGADLVLDKYNM-YIETITRGTFCLA 349
Query: 451 LASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+ + D AI GN Q N V YD+ V F+ +CS
Sbjct: 350 IICNNPPQD-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 111 bits (277), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 156/364 (42%), Gaps = 50/364 (13%)
Query: 145 YIVTMELGGQNMTVI--IDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSST 202
Y++ +++G ++ IDTGSD+ W QC PC +CY+Q P+F PS SS+++ CN ++
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNGNS 480
Query: 203 CQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFV-----FG 257
C Y + Y D +Y+ G L E ++ S FV G
Sbjct: 481 CH--------------------YEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIG 520
Query: 258 CGKNN-----KGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMG 312
CG +N G SG++GL LSLISQ + + G+ SYC + + G
Sbjct: 521 CGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF---SGQGTSKINFG 577
Query: 313 NESSVFKNLTPIAYTRMVP--NPQLSNFYMLNLTGIDVG----GVAGQAPSFGNGGGLID 366
+ V + T +A + NP FY LNL + V G +G ID
Sbjct: 578 TNAIVAGDGT-VAADMFIKKDNP----FYYLNLDAVSVEDNLIATLGTPFHAEDGNIFID 632
Query: 367 SGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDN 426
SGT +T P Y L E ++Q P + + ++ P I+M+F
Sbjct: 633 SGTTLTYF-PMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTIDI-FPVITMHFSGG 690
Query: 427 VVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAG 486
L +D + Y+ CLA+ +D A+ GN Q N V YD + + F+
Sbjct: 691 ADLVLDKYNM-YLETITGGIFCLAIG-CNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSP 748
Query: 487 ESCS 490
+CS
Sbjct: 749 TNCS 752
Score = 99.8 bits (247), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 151/353 (42%), Gaps = 54/353 (15%)
Query: 145 YIVTMELGGQNMTVI--IDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSST 202
Y++ +++G + IDTGSDL W QC PC CY+Q P+F PS SS++ C+ +
Sbjct: 82 YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGKS 141
Query: 203 CQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFV-----FG 257
C Y + Y D +Y+ G L E ++ S FV G
Sbjct: 142 CH--------------------YEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIG 181
Query: 258 CG-----KNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMG 312
CG +N G SG++GL SLISQ + + G+ SYC + + G
Sbjct: 182 CGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCF---SGQGTSKINFG 238
Query: 313 NESSVFKNLTPIAYTRMVP--NPQLSNFYMLNLTGIDVGG----VAGQAPSFGNGGGLID 366
+ V + T +A + NP FY LNL + V G +G +ID
Sbjct: 239 TNAIVAGDGT-VAADMFIKKDNP----FYYLNLDAVSVEDNRIETLGTPFHAEDGNIVID 293
Query: 367 SGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDT-CFNLTGNEEVNI-PTISMNFE 424
SG+ +T P Y L + ++Q P S D C+ +E ++I P I+M+F
Sbjct: 294 SGSTVTYF-PVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCY---FSETIDIFPVITMHFS 349
Query: 425 DNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDT 477
L +D + Y+ CLA+ S + AI GN Q N V YD+
Sbjct: 350 GGADLVLDKYNM-YMESNSGGLFCLAIICNSPTQE-AIFGNRAQNNFLVGYDS 400
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 109 bits (273), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/409 (26%), Positives = 178/409 (43%), Gaps = 57/409 (13%)
Query: 110 RSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMT--VIIDTGSDLT 167
+ +QN + K + S S Q+ + + +T ++V +G + I+DTGS L
Sbjct: 68 KYLQNSIDKELGS-----SNFQVDVEQAI--KTSLFLVNFSVGQPPVPQLTIMDTGSSLL 120
Query: 168 WVQCEPCMSCYNQQ--GPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSY 225
W+QC+PC C + PVF P+ SS++ C+ C+ G C S+ + C Y
Sbjct: 121 WIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCR-----YAPNGHCGSS-NKCVY 174
Query: 226 AVNYGDGSYTNGELGAEHLSF----GGISVSN-FVFGCG-KNNKGLFGGVSGLMGLGRSN 279
Y G+ + G L E L+F G V+ FGCG +N + L +G++GLG
Sbjct: 175 EQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKP 234
Query: 280 LSLISQTNSTFGGVFSYCLPPTDAGASG--SLAMGNESSVFKNLTPIAYTRMVPNPQLSN 337
SL Q G FSYC+ G L +G ++ + + TPI + ++
Sbjct: 235 TSLAVQ----LGSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETE------NS 284
Query: 338 FYMLNLTGIDVGGV---------AGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEFLK 388
Y +NL GI VG + P G ++DSGT+ T LA Y+ L E
Sbjct: 285 IYYMNLEGISVGDTQLNIEPVVFKRRGP---RTGVILDSGTLYTWLADIAYRELYNEIKS 341
Query: 389 QFSGFPSAPGFSILD-TCFNLTGNEE-VNIPTISMNFEDNVVLNVDATGIFYIVKEDAS- 445
P F D C++ +EE + P ++ +F L ++AT +FY + E +
Sbjct: 342 ILD--PKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTF 399
Query: 446 -QVCLALASLSD---EY-DIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
C+++ + EY + IG Q+ + YD K+ + C
Sbjct: 400 NVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDC 448
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 109 bits (272), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 168/390 (43%), Gaps = 72/390 (18%)
Query: 142 TLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCN 199
++ I+++ +G Q +++DTGS L+W+QC + F PS SSS+ ++PC+
Sbjct: 69 SMALIISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCS 127
Query: 200 SSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVS-NFVFGC 258
C+ +C+SN C Y+ Y DG++ G L E ++F ++ + GC
Sbjct: 128 HPLCKPRIPDFTLPTSCDSN-RLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGC 186
Query: 259 GKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDA----GASGSLAMGN- 313
+ G++G+ R LS +SQ + FSYC+PP +GS +G+
Sbjct: 187 ATESS----DDRGILGMNRGRLSFVSQAKIS---KFSYCIPPKSNRPGFTPTGSFYLGDN 239
Query: 314 ---------------ESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSF 358
ES NL P+AYT VP + L +++ G + +
Sbjct: 240 PNSHGFKYVSLLTFPESQRMPNLDPLAYT--VPMIGIR----FGLKKLNISGSVFRPDAG 293
Query: 359 GNGGGLIDSGTVITRLAPSVYKALKAEFLKQ------------------FSGFPSAPGFS 400
G+G ++DSG+ T L + Y ++AE + + F G +
Sbjct: 294 GSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRL 353
Query: 401 ILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDI 460
I D F T E+ +P ++ V++NV G + V S + A ++
Sbjct: 354 IGDLVFVFTRGVEILVP------KERVLVNVG--GGIHCVGIGRSSMLGAASN------- 398
Query: 461 AIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
IIGN Q+N V +D +VGFA CS
Sbjct: 399 -IIGNVHQQNLWVEFDVTNRRVGFAKADCS 427
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 107 bits (266), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 164/365 (44%), Gaps = 52/365 (14%)
Query: 158 VIIDTGSDLTWVQCEPCMSCYNQ-----QGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGN 212
V +DTGSD+ WV C C C + + ++ S S + + C+ C Q++ G
Sbjct: 95 VQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFC--YQISGGP 152
Query: 213 AGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVS--------NFVFGCGKNNKG 264
C++N S C Y YGDGS T G + + + ++ + +FGCG G
Sbjct: 153 LSGCKANMS-CPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSG 211
Query: 265 LFG-----GVSGLMGLGRSNLSLISQTNST--FGGVFSYCLPPTDAGASGSLAMGNESSV 317
+ G++G G++N S+ISQ S+ +F++CL + G G A+G
Sbjct: 212 DLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--GIFAIGRVVQP 269
Query: 318 FKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPS--FGNG---GGLIDSGTVIT 372
N+TP +VPN Y +N+T + VG P+ F G G +IDSGT +
Sbjct: 270 KVNMTP-----LVPN---QPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLA 321
Query: 373 RLAPSVYKALKAEFLKQFSGFPSAPGFSILD---TCFNLTGNEEVNIPTISMNFEDNVVL 429
L +Y+ L +K+ + A I+D CF +G + P ++ +FE++V L
Sbjct: 322 YLPEIIYEPL----VKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFL 377
Query: 430 NVDATGIFYIVKEDASQVCLALASLS----DEYDIAIIGNYQQRNQRVIYDTKQSKVGFA 485
V + C+ + + D ++ ++G+ N+ V+YD + +G+
Sbjct: 378 RVYPHDYLF---PHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWT 434
Query: 486 GESCS 490
+CS
Sbjct: 435 EYNCS 439
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 104 bits (260), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 167/375 (44%), Gaps = 40/375 (10%)
Query: 142 TLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGPV--FKPSTSSSYQSIP 197
++ I+++ +G Q+ +++DTGS L+W+QC P P F PS SSS+ +P
Sbjct: 77 SMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLP 136
Query: 198 CNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVS-NFVF 256
C+ C+ +C+SN C Y+ Y DG++ G L E +F + +
Sbjct: 137 CSHPLCKPRIPDFTLPTSCDSN-RLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLIL 195
Query: 257 GCGK---NNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTD----AGASGSL 309
GC K + KG+ G M LGR LS ISQ + FSYC+P ++GS
Sbjct: 196 GCAKESTDEKGILG-----MNLGR--LSFISQAKIS---KFSYCIPTRSNRPGLASTGSF 245
Query: 310 AMGN--ESSVFKNLTPIAYTRMVPNPQLSNF-YMLNLTGIDVG-------GVAGQAPSFG 359
+G+ S FK ++ + + + P L Y + L GI +G G + + G
Sbjct: 246 YLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGG 305
Query: 360 NGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGF---SILDTCFNLTGNEEVN- 415
+G ++DSG+ T L Y +K E ++ G G+ S D CF+ + E+
Sbjct: 306 SGQTMVDSGSEFTHLVDVAYDKVKEEIVR-LVGSRLKKGYVYGSTADMCFDGNHSMEIGR 364
Query: 416 -IPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVI 474
I + F V + V+ + V V + +S+ IIGN Q+N V
Sbjct: 365 LIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAAS-NIIGNVHQQNLWVE 423
Query: 475 YDTKQSKVGFAGESC 489
+D +VGF+ C
Sbjct: 424 FDVTNRRVGFSKAEC 438
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 104 bits (260), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 165/383 (43%), Gaps = 51/383 (13%)
Query: 142 TLNYIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQG-----PVFKPSTSSSYQ 194
T+ Y ++LG V IDTGSD+ WV C C +C + G F S +
Sbjct: 102 TMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAG 161
Query: 195 SIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGI----- 249
S+ C+ C S+ TT A C N + C Y+ YGDGS T+G + F I
Sbjct: 162 SVTCSDPICSSVFQTT--AAQCSEN-NQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESL 218
Query: 250 ---SVSNFVFGCGKNNKGLF----GGVSGLMGLGRSNLSLISQTNS--TFGGVFSYCLPP 300
S + VFGC G V G+ G G+ LS++SQ +S VFS+CL
Sbjct: 219 VANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-K 277
Query: 301 TDAGASGSLAMGNESSVFKNLTP-IAYTRMVPNPQLSNFYMLNLTGIDVGGV-----AGQ 354
D G +G + L P + Y+ +VP+ Y LNL I V G A
Sbjct: 278 GDGSGGGVFVLG------EILVPGMVYSPLVPS---QPHYNLNLLSIGVNGQMLPLDAAV 328
Query: 355 APSFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPS---APGFSILDTCFNLTGN 411
+ G ++D+GT +T L Y FL S S P S + C+ ++ +
Sbjct: 329 FEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSVSQLVTPIISNGEQCYLVSTS 384
Query: 412 EEVNIPTISMNFED--NVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQR 469
P++S+NF +++L Y + + AS C+ +E I+G+ +
Sbjct: 385 ISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ--TILGDLVLK 442
Query: 470 NQRVIYDTKQSKVGFAGESCSFT 492
++ +YD + ++G+A CS +
Sbjct: 443 DKVFVYDLARQRIGWASYDCSMS 465
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 103 bits (256), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 117/444 (26%), Positives = 186/444 (41%), Gaps = 66/444 (14%)
Query: 96 RKLHNQLTLDDL------HVRSMQNRLRKMV--------SSHSVEVSQIQIPLASGVNFQ 141
R+ H ++ +DL V SM L V S+ S++ S P+ V
Sbjct: 141 REFHERILEEDLGLENENFVESMDLELVNPVKVNDVLSTSAGSIDSSTTIFPVGGNVYPD 200
Query: 142 TLNY---IVTMELGGQNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIP 197
L Y +V GQ + IDTGS+LTW+QC+ PC SC ++KP + +S
Sbjct: 201 GLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSE 260
Query: 198 CNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAE--HLSF--GGISVSN 253
Q QLT CE N C Y + Y D SY+ G L + HL G ++ S+
Sbjct: 261 AFCVEVQRNQLTE----HCE-NCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESD 315
Query: 254 FVFGCGKNNKGLFGG----VSGLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDAGASG 307
VFGCG + +GL G++GL R+ +SL SQ S V +CL +D G
Sbjct: 316 IVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCL-ASDLNGEG 374
Query: 308 SLAMGNESSVFKNLTP---IAYTRMVPNPQLSNFYMLNLTGIDVG-GVAGQAPSFGN-GG 362
+ MG++ L P + + M+ + +L + Y + +T + G G+ G G
Sbjct: 375 YIFMGSD------LVPSHGMTWVPMLHDSRL-DAYQMQVTKMSYGQGMLSLDGENGRVGK 427
Query: 363 GLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMN 422
L D+G+ T Y L L++ SG S D + + N P S++
Sbjct: 428 VLFDTGSSYTYFPNQAYSQLVTS-LQEVSGLELTRDDS--DETLPICWRAKTNFPFSSLS 484
Query: 423 ----FEDNVVLNVDATGIF-----------YIVKEDASQVCLALASLSDEYD--IAIIGN 465
F + L + + + Y++ + VCL + S +D I+G+
Sbjct: 485 DVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGD 544
Query: 466 YQQRNQRVIYDTKQSKVGFAGESC 489
R ++YD + ++G+ C
Sbjct: 545 ISMRGHLIVYDNVKRRIGWMKSDC 568
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 103 bits (256), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 163/380 (42%), Gaps = 51/380 (13%)
Query: 145 YIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQG-----PVFKPSTSSSYQSIP 197
Y ++LG V IDTGSD+ WV C C +C + G F S + S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 198 CNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGI-------- 249
C+ C S+ TT A C N + C Y+ YGDGS T+G + F I
Sbjct: 160 CSDPICSSVFQTT--AAQCSEN-NQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 250 SVSNFVFGCGKNNKGLF----GGVSGLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDA 303
S + VFGC G V G+ G G+ LS++SQ +S VFS+CL D
Sbjct: 217 SSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDG 275
Query: 304 GASGSLAMGNESSVFKNLTP-IAYTRMVPNPQLSNFYMLNLTGIDVGGV-----AGQAPS 357
G +G + L P + Y+ +VP+ Y LNL I V G A +
Sbjct: 276 SGGGVFVLG------EILVPGMVYSPLVPS---QPHYNLNLLSIGVNGQMLPLDAAVFEA 326
Query: 358 FGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPS---APGFSILDTCFNLTGNEEV 414
G ++D+GT +T L Y FL S S P S + C+ ++ +
Sbjct: 327 SNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSVSQLVTPIISNGEQCYLVSTSISD 382
Query: 415 NIPTISMNFED--NVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQR 472
P++S+NF +++L Y + + AS C+ +E I+G+ +++
Sbjct: 383 MFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ--TILGDLVLKDKV 440
Query: 473 VIYDTKQSKVGFAGESCSFT 492
+YD + ++G+A CS +
Sbjct: 441 FVYDLARQRIGWASYDCSMS 460
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 102 bits (254), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 160/370 (43%), Gaps = 49/370 (13%)
Query: 153 GQNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTG 211
GQ + IDTGS+LTW+QC+ PC SC ++KP + +S Q QLT
Sbjct: 42 GQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTE- 100
Query: 212 NAGACESNPSNCSYAVNYGDGSYTNGELGAE--HLSF--GGISVSNFVFGCGKNNKGLFG 267
CE N C Y + Y D SY+ G L + HL G ++ S+ VFGCG + +GL
Sbjct: 101 ---HCE-NCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLL 156
Query: 268 G----VSGLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDAGASGSLAMGNESSVFKNL 321
G++GL R+ +SL SQ S V +CL +D G + MG++ L
Sbjct: 157 NTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCL-ASDLNGEGYIFMGSD------L 209
Query: 322 TP---IAYTRMVPNPQLSNFYMLNLTGIDVG-GVAGQAPSFGN-GGGLIDSGTVITRLAP 376
P + + M+ + +L + Y + +T + G G+ G G L D+G+ T
Sbjct: 210 VPSHGMTWVPMLHDSRL-DAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268
Query: 377 SVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMN----FEDNVVLNVD 432
Y L L++ SG S D + + N P S++ F + L +
Sbjct: 269 QAYSQLVTS-LQEVSGLELTRDDS--DETLPICWRAKTNFPFSSLSDVKKFFRPITLQIG 325
Query: 433 ATGIF-----------YIVKEDASQVCLALASLSDEYD--IAIIGNYQQRNQRVIYDTKQ 479
+ + Y++ + VCL + S +D I+G+ R ++YD +
Sbjct: 326 SKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVK 385
Query: 480 SKVGFAGESC 489
++G+ C
Sbjct: 386 RRIGWMKSDC 395
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 95.5 bits (236), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 160/378 (42%), Gaps = 50/378 (13%)
Query: 145 YIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQ-----QGPVFKPSTSSSYQSIP 197
Y ++LG + V IDTGSD+ WV C C C Q F P SSS +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 198 CNSSTCQS-LQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSN--- 253
C+ C S Q +G C N + CSY+ YGDGS T+G ++ +SF + S
Sbjct: 144 CSDRRCYSNFQTESG----CSPN-NLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAI 198
Query: 254 -----FVFGCGKNNKGLF----GGVSGLMGLGRSNLSLISQ--TNSTFGGVFSYCLPPTD 302
FVFGC G V G+ GLG+ +LS+ISQ VFS+CL D
Sbjct: 199 NSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL-KGD 257
Query: 303 AGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGG----VAGQAPSF 358
G + +G YT +VP+ Y +NL I V G + +
Sbjct: 258 KSGGGIMVLGQIKR-----PDTVYTPLVPS---QPHYNVNLQSIAVNGQILPIDPSVFTI 309
Query: 359 GNGGG-LIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDT---CFNLTGNEEV 414
G G +ID+GT + L Y + F++ + S G I CF +T +
Sbjct: 310 ATGDGTIIDTGTTLAYLPDEAY----SPFIQAVANAVSQYGRPITYESYQCFEITAGDVD 365
Query: 415 NIPTISMNFEDNVVLNVDATGIFYIVKEDASQV-CLALASLSDEYDIAIIGNYQQRNQRV 473
P +S++F + + I S + C+ +S I I+G+ +++ V
Sbjct: 366 VFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHR-RITILGDLVLKDKVV 424
Query: 474 IYDTKQSKVGFAGESCSF 491
+YD + ++G+A CS
Sbjct: 425 VYDLVRQRIGWAEYDCSL 442
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 95.1 bits (235), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/390 (25%), Positives = 175/390 (44%), Gaps = 46/390 (11%)
Query: 127 VSQIQIPLASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCE-PCMSCYNQQGP 183
VS + P+ V + Y VT+ +G + + +DTGSDLTW+QC+ PC+ C P
Sbjct: 43 VSSVVFPVHGNV-YPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP 101
Query: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH 243
+++PS+ IPCN C++L L + CE+ P C Y V Y DG + G L +
Sbjct: 102 LYQPSS----DLIPCNDPLCKALHLNSNQ--RCET-PEQCDYEVEYADGGSSLGVLVRDV 154
Query: 244 LSFG---GISVS-NFVFGCGKNN---KGLFGGVSGLMGLGRSNLSLISQTNST--FGGVF 294
S G+ ++ GCG + + G++GLGR +S++SQ +S V
Sbjct: 155 FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVI 214
Query: 295 SYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQ 354
+CL + G L G++ + + +++T M + + S Y + G + G G+
Sbjct: 215 GHCL---SSLGGGILFFGDD---LYDSSRVSWTPM--SREYSKHYSPAMGGELLFG--GR 264
Query: 355 APSFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFP--SAPGFSILDTCFN----L 408
N + DSG+ T Y+A+ ++ SG P A L C+
Sbjct: 265 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 324
Query: 409 TGNEEVN--IPTISMNFE----DNVVLNVDATGIFYIVKEDASQVCLALASLSD--EYDI 460
EEV ++++F+ + + Y++ VCL + + ++ ++
Sbjct: 325 MSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEA--YLIISMKGNVCLGILNGTEIGLQNL 382
Query: 461 AIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+IG+ ++Q +IYD ++ +G+ C
Sbjct: 383 NLIGDISMQDQMIIYDNEKQSIGWMPVDCD 412
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 94.4 bits (233), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 167/376 (44%), Gaps = 45/376 (11%)
Query: 145 YIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQG-----PVFKPSTSSSYQSIP 197
Y + LG ++ V +DTGSD+ WV C C C G F P +S + I
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 198 CNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGI-------- 249
C+ C S + + ++G C + C+Y YGDGS T+G ++ L F I
Sbjct: 141 CSDQRC-SWGIQSSDSG-CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPN 198
Query: 250 SVSNFVFGCGKNNKGLF----GGVSGLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDA 303
S + VFGC + G V G+ G G+ +S+ISQ S VFS+CL +
Sbjct: 199 STAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENG 258
Query: 304 GASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVA----GQAPSFG 359
G G L +G V N+ +T +VP+ Y +NL I V G A S
Sbjct: 259 GG-GILVLGE--IVEPNMV---FTPLVPS---QPHYNVNLLSISVNGQALPINPSVFSTS 309
Query: 360 NG-GGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPT 418
NG G +ID+GT + L+ + Y E + P S + C+ +T + P
Sbjct: 310 NGQGTIIDTGTTLAYLSEAAYVPF-VEAITNAVSQSVRPVVSKGNQCYVITTSVGDIFPP 368
Query: 419 ISMNFEDNVVLNVDATGIFYIVKED----ASQVCLALASLSDEYDIAIIGNYQQRNQRVI 474
+S+NF + ++ Y+++++ + C+ + ++ I I+G+ +++ +
Sbjct: 369 VSLNFAGGASMFLNPQD--YLIQQNNVGGTAVWCIGFQRIQNQ-GITILGDLVLKDKIFV 425
Query: 475 YDTKQSKVGFAGESCS 490
YD ++G+A CS
Sbjct: 426 YDLVGQRIGWANYDCS 441
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 93.2 bits (230), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 118/476 (24%), Positives = 201/476 (42%), Gaps = 65/476 (13%)
Query: 48 VFNLQILQRKQQLGSL---GCLHPESRQEKGAIILEMKDRSYCSKKKVNWHRKLHNQLTL 104
+F + L ++ L SL +H S + + +I S +K+ + ++R L
Sbjct: 9 LFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAES--- 65
Query: 105 DDLHVRSMQ--NRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMT--VII 160
D + M +++ +V S + ++SG +F L+Y +++G +++ V +
Sbjct: 66 -DFRRQRMNLGAKVQSLVPSEGSKT------ISSGNDFGWLHY-TWIDIGTPSVSFLVAL 117
Query: 161 DTGSDLTW-----VQCEPCMSCY-----NQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTT 210
DTGS+L W VQC P S Y + + PS+SS+ + C+ C S
Sbjct: 118 DTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDS----- 172
Query: 211 GNAGACESNPSNCSYAVNYGDGSYTNGELGAE---HLSF--------GGISV-SNFVFGC 258
A CES C Y VNY G+ ++ L E HL++ G SV + V GC
Sbjct: 173 --ASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGC 230
Query: 259 GKNNKGLF-GGVS--GLMGLGRSNLSLISQTNST--FGGVFSYCLPPTDAGASGSLAMGN 313
GK G + GV+ GLMGLG + +S+ S + FS C D SG + G+
Sbjct: 231 GKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCF---DEEDSGRIYFGD 287
Query: 314 ESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGNGGGLIDSGTVITR 373
+ TP + N + S Y++ + +G + SF IDSG T
Sbjct: 288 MGPSIQQSTPFL---QLDNNKYSG-YIVGVEACCIGNSCLKQTSFTT---FIDSGQSFTY 340
Query: 374 LAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDA 433
L +Y+ + E + + ++ F + + + E +P I + F N +
Sbjct: 341 LPEEIYRKVALEIDRHINA--TSKNFEGVSWEYCYESSAEPKVPAIKLKFSHNNTFVIHK 398
Query: 434 TGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
+ + Q CL + S S + I IG R R+++D + K+G++ C
Sbjct: 399 PLFVFQQSQGLVQFCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKC 453
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 92.0 bits (227), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 106/413 (25%), Positives = 176/413 (42%), Gaps = 56/413 (13%)
Query: 108 HVRSMQNRLRKMVSSHSVEVSQIQIPLASG---VNFQTLNYI----VTMELGGQNMTVII 160
+ R M +R R + Q + + G V L ++ VT+ V +
Sbjct: 62 YYRVMAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVAL 121
Query: 161 DTGSDLTWVQCEPCMSCYNQ-QGP--------VFKPSTSSSYQSIPCNSSTCQSLQLTTG 211
DTGSDL W+ C+ C +C + + P ++ P+ SS+ +PCNS+ C T G
Sbjct: 122 DTGSDLFWLPCD-CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLC-----TRG 175
Query: 212 NAGACESNPSNCSYAVNY-GDGSYTNGELGAE--HLSFGGISV----SNFVFGCGKNNKG 264
+ C S S+C Y + Y +G+ + G L + HL S + FGCG+ G
Sbjct: 176 DR--CASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQTG 233
Query: 265 LF---GGVSGLMGLGRSNLSLIS--QTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFK 319
+F +GL GLG ++S+ S FS C + GA G ++ G++ SV +
Sbjct: 234 VFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCF--GNDGA-GRISFGDKGSVDQ 290
Query: 320 NLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGNGGGLIDSGTVITRLAPSVY 379
TP+ + P+P Y + +T I VGG G + DSGT T L + Y
Sbjct: 291 RETPLNIRQ--PHPT----YNITVTKISVGGNTGDL----EFDAVFDSGTSFTYLTDAAY 340
Query: 380 KALKAEF--LKQFSGFPSAPGFSILDTCFNLTGNEE-VNIPTISMNFEDNVVLNVDATGI 436
+ F L + + + C+ L+ N++ P +++ + V +
Sbjct: 341 TLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPV-YHPL 399
Query: 437 FYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
I +D CLA+ + D I+IIG RV++D ++ +G+ C
Sbjct: 400 VVIPMKDTDVYCLAIMKIED---ISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 89.7 bits (221), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 115/449 (25%), Positives = 202/449 (44%), Gaps = 61/449 (13%)
Query: 75 GAIILEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQIPL 134
G++ E+ R K V L +LD ++R R++ S+++ +Q I
Sbjct: 20 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDRGRQLTSNNN---NQTTISF 76
Query: 135 ASGVNFQTLNYI----VTMELGGQNMTVIIDTGSDLTWVQCEPCMSCYN----QQGP--- 183
A G + + ++++ VT+ Q V +DTGSDL W+ C +C QG
Sbjct: 77 AQGNSTEEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIK 136
Query: 184 --VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNY-GDGSYTNGELG 240
++ PS S S + CNS+ C +L+ C S S+C Y + Y GS + G L
Sbjct: 137 LNIYNPSKSKSSSKVTCNSTLC-ALR------NRCISPVSDCPYRIRYLSPGSKSTGVLV 189
Query: 241 AE--HLSF--GGISVSNFVFGCGKNNKGLFG--GVSGLMGLGRSNLSLISQTNSTFGGV- 293
+ H+S G + FGC ++ GLF V+G+MGL +++++ + GV
Sbjct: 190 EDVIHMSTEEGEARDARITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNMLVK--AGVA 247
Query: 294 ---FSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGG 350
FS C P G+++ G++ S + TP++ T +P FY +++T VG
Sbjct: 248 SDSFSMCFGP---NGKGTISFGDKGSSDQLETPLSGTI---SPM---FYDVSITKFKVGK 298
Query: 351 VAGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEFL-----KQFSGFPSAPGFSILDTC 405
V DSGT +T L Y AL F ++ S +P + C
Sbjct: 299 VTVDT----EFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSP----FEFC 350
Query: 406 FNLTG-NEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQV-CLALASLSDEYDIAII 463
+ +T ++E +P++S + +V + + + + + QV CLA+ + D +II
Sbjct: 351 YIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNA-DFSII 409
Query: 464 GNYQQRNQRVIYDTKQSKVGFAGESCSFT 492
G N R+++D ++ +G+ +C+ T
Sbjct: 410 GQNFMTNYRIVHDRERRILGWKKSNCNDT 438
>AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:9329933-9331432 REVERSE LENGTH=499
Length = 499
Score = 86.7 bits (213), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 124/451 (27%), Positives = 189/451 (41%), Gaps = 93/451 (20%)
Query: 115 RLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQNMTVIIDTGSDLTWVQCEP- 173
R R+ H + Q+ +P++SG ++ + + +++ +DTGSDL W C P
Sbjct: 60 RFRR--HHHKQQQQQLSLPISSGSDYLISLSVGSSS---SAVSLYLDTGSDLVWFPCRPF 114
Query: 174 -CMSCYNQQGPVFKPSTSSS---------------YQSIP----CNSSTCQSLQLTTGNA 213
C+ C ++ P PS+ SS + S+P C S C + TG+
Sbjct: 115 TCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGD- 173
Query: 214 GACESNPSNCS-YAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGL 272
C ++ C + YGDGS +L ++ LS +SVSNF FGC G+
Sbjct: 174 --CNTSSYPCPPFYYAYGDGSLV-AKLYSDSLSLPSVSVSNFTFGCAHTT---LAEPIGV 227
Query: 273 MGLGRSNLSLISQT---NSTFGGVFSYCLPPTDAGAS----------GSLAMGNESSV-- 317
G GR LSL +Q + G FSYCL + G E V
Sbjct: 228 AGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGT 287
Query: 318 ----------FKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSF-------GN 360
K +T M+ NP+ FY ++L GI +G AP+ G
Sbjct: 288 TDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGG 347
Query: 361 GGGLIDSGTVITRLAPSVYKALKAEFLKQFSGF--------PSAPGFSILDTCFNLTGNE 412
GG ++DSGT T L Y ++ EF + PS S + C+ L N+
Sbjct: 348 GGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPS----SGMSPCYYL--NQ 401
Query: 413 EVNIPTISMNFEDNV-VLNVDATGIFYIV------KEDASQV-CLALASLSDEYDI---- 460
V +P + ++F N + + FY KE+ ++ CL L + DE ++
Sbjct: 402 TVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGT 461
Query: 461 -AIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
AI+GNYQQ+ V+YD +VGFA C+
Sbjct: 462 GAILGNYQQQGFEVVYDLLNRRVGFAKRKCA 492
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 85.9 bits (211), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 163/373 (43%), Gaps = 39/373 (10%)
Query: 130 IQIPLASGVNFQTLNYIVTMELGG--QNMTVIIDTGSDLTWVQCEPCMSCYNQQ-GPVFK 186
+ +PL+S + + L ++ + G + + +DTGS LTW QC PC CY Q+ P ++
Sbjct: 43 VSLPLSSPHSQRGLAFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYR 102
Query: 187 PSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF 246
P+ S +Y+ C S +S A + C+Y +Y D + G L E ++
Sbjct: 103 PAASITYRDAMCEDSHPKS-----NPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITV 157
Query: 247 ----GGIS-VSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPP- 300
GG V FGC + G + +G++GLG S+I + FG FS+CL
Sbjct: 158 DTHDGGFKRVHGVYFGCNTLSDGSYFTGTGILGLGVGKYSIIGE----FGSKFSFCLGEI 213
Query: 301 TDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGN 360
++ AS +L +G+ ++V + T I T QL + + G+ + +
Sbjct: 214 SEPKASHNLILGDGANVQGHPTVINITEGHTIFQLESI------------IVGEEITLDD 261
Query: 361 GGGL-IDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILDT-CFNLTGNEEVNIPT 418
+ +D+G+ ++ L+ ++Y +F+ F + S T C+ E +
Sbjct: 262 PVQVFVDTGSTLSHLSTNLY----YKFVDAFDDLIGSRPLSYEPTLCYKADTIERLEKMD 317
Query: 419 ISMNFEDNVVLNVDATGIFYIVKEDASQV-CLALASLSDEYDIAIIGNYQQRNQRVIYDT 477
+ F+ L+V+ IF +++ ++ CLA+ + + + IIG + V YD
Sbjct: 318 VGFKFDVGAELSVNIHNIF--IQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDL 375
Query: 478 KQSKVGFAGESCS 490
+ C
Sbjct: 376 SAKTAYINKQDCD 388
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 85.5 bits (210), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 131/280 (46%), Gaps = 30/280 (10%)
Query: 127 VSQIQIPLASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCE-PCMSCYNQQGP 183
VS + P+ V + Y VT+ +G + + +DTGSDLTW+QC+ PC+ C P
Sbjct: 40 VSSVVFPVHGNV-YPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP 98
Query: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH 243
+++PS+ IPCN C++L L + CE+ P C Y V Y DG + G L +
Sbjct: 99 LYQPSS----DLIPCNDPLCKALHLNSNQ--RCET-PEQCDYEVEYADGGSSLGVLVRDV 151
Query: 244 LSFG---GISVS-NFVFGCGKNN---KGLFGGVSGLMGLGRSNLSLISQTNST--FGGVF 294
S G+ ++ GCG + + G++GLGR +S++SQ +S V
Sbjct: 152 FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVI 211
Query: 295 SYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQ 354
+CL + G L G++ + + +++T M + + S Y + G + G G+
Sbjct: 212 GHCL---SSLGGGILFFGDD---LYDSSRVSWTPM--SREYSKHYSPAMGGELLFG--GR 261
Query: 355 APSFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFP 394
N + DSG+ T Y+A+ ++ SG P
Sbjct: 262 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKP 301
>AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:14665728-14669135 REVERSE LENGTH=430
Length = 430
Score = 85.1 bits (209), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 93/409 (22%), Positives = 156/409 (38%), Gaps = 77/409 (18%)
Query: 116 LRKMVS-SHSVEVSQI------------QIPLASGVNFQTLN---------YIVTMELGG 153
L++M+ SH ++++Q+ Q P+ N++ Y T+++G
Sbjct: 27 LKRMIPPSHELDLTQLMTFDSARHGRLLQSPVHGSFNWKVERDTSILLSALYYTTVQIGT 86
Query: 154 --QNMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTG 211
+ + V+IDTGSDL WV C C+ C F P SSS + C+ C S
Sbjct: 87 PPRELDVVIDTGSDLVWVSCNSCVGCPLHNVTFFDPGASSSAVKLACSDKRCSSDLQKKS 146
Query: 212 NAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSG 271
ES C+Y V YGDGS T+G ++ +SF ++S++ + ++N V
Sbjct: 147 RCSLLES----CTYKVEYGDGSVTSGYYISDLISFD--TMSDWTYIAFRDNSTWHPWV-- 198
Query: 272 LMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVP 331
+ + G + C P +S P+ Y
Sbjct: 199 -------------RQGAIIGTFPALCSTPCSTVSS---------------QPLYY----- 225
Query: 332 NPQLSNFYMLNLTGIDVGGVAGQAPSFGNG-GGLIDSGTVITRLAPSVYKALKAEFLKQF 390
NPQ S+ + + + + + S G G +IDSGT + Y L L
Sbjct: 226 NPQFSHMMTVAVNDLRL-PIDPSVFSVAKGYGTIIDSGTTLVHFPGEAYDPLIQAILNVV 284
Query: 391 SGFPSAPGFSILDTCFNLTGNEEVNI------PTISMNFEDNVVLNVDATGIFYIVKEDA 444
S + + CFN+T ++ P + + F + + + D
Sbjct: 285 SQYGRPIPYESFQ-CFNITSGISSHLVIADMFPEVHLGFAGGASMVIKPEAYLFQKFLDL 343
Query: 445 SQV--CLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESCSF 491
+ CL S S I IIG R++ +YD ++G+A +CS
Sbjct: 344 TNAIWCLGFYS-STSRRITIIGEVAIRDKMFVYDLDHQRIGWAEYNCSL 391
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 83.2 bits (204), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 148/362 (40%), Gaps = 56/362 (15%)
Query: 160 IDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACES 218
IDTGSDLTWVQC+ PC C + +KP ++ ++PC+ C L L C
Sbjct: 84 IDTGSDLTWVQCDAPCNGCTKPRAKQYKP----NHNTLPCSHILCSGLDLPQDR--PCAD 137
Query: 219 NPSNCSYAVNYGDGSYTNGELGAEH----LSFGGISVSNFVFGCGKNNKG----LFGGVS 270
C Y + Y D + + G L + L+ G I FGCG + + +
Sbjct: 138 PEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTA 197
Query: 271 GLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTP---IA 325
G++GLGR + L +Q S V +CL T G L++G+E L P +
Sbjct: 198 GILGLGRGKVGLSTQLKSLGITKNVIVHCLSHT---GKGFLSIGDE------LVPSSGVT 248
Query: 326 YTRMVPNPQLSNFYMLN----LTGIDVGGVAGQAPSFGNGGGLIDSGTVITRLAPSVYKA 381
+T + N N YM L GV G F DSG+ T Y+A
Sbjct: 249 WTSLATNSPSKN-YMAGPAELLFNDKTTGVKGINVVF-------DSGSSYTYFNAEAYQA 300
Query: 382 LKAEFLKQFSGFP--SAPGFSILDTCFN----LTGNEEVN--IPTISMNF---EDNVVLN 430
+ K +G P L C+ L +EV TI++ F ++ +
Sbjct: 301 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 360
Query: 431 VDATGIFYIVKEDASQVCLALASLSD--EYDIAIIGNYQQRNQRVIYDTKQSKVGFAGES 488
V Y++ + +VCL + + ++ IIG+ + VIYD ++ ++G+
Sbjct: 361 VPPES--YLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSD 418
Query: 489 CS 490
C
Sbjct: 419 CD 420
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 82.8 bits (203), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 148/362 (40%), Gaps = 56/362 (15%)
Query: 160 IDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACES 218
IDTGSDLTWVQC+ PC C + +KP ++ ++PC+ C L L C
Sbjct: 84 IDTGSDLTWVQCDAPCNGCTKPRAKQYKP----NHNTLPCSHILCSGLDLPQDR--PCAD 137
Query: 219 NPSNCSYAVNYGDGSYTNGELGAEH----LSFGGISVSNFVFGCGKNNKG----LFGGVS 270
C Y + Y D + + G L + L+ G I FGCG + + +
Sbjct: 138 PEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTA 197
Query: 271 GLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTP---IA 325
G++GLGR + L +Q S V +CL T G L++G+E L P +
Sbjct: 198 GILGLGRGKVGLSTQLKSLGITKNVIVHCLSHT---GKGFLSIGDE------LVPSSGVT 248
Query: 326 YTRMVPNPQLSNFYMLN----LTGIDVGGVAGQAPSFGNGGGLIDSGTVITRLAPSVYKA 381
+T + N N YM L GV G F DSG+ T Y+A
Sbjct: 249 WTSLATNSPSKN-YMAGPAELLFNDKTTGVKGINVVF-------DSGSSYTYFNAEAYQA 300
Query: 382 LKAEFLKQFSGFP--SAPGFSILDTCFN----LTGNEEVN--IPTISMNF---EDNVVLN 430
+ K +G P L C+ L +EV TI++ F ++ +
Sbjct: 301 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 360
Query: 431 VDATGIFYIVKEDASQVCLALASLSD--EYDIAIIGNYQQRNQRVIYDTKQSKVGFAGES 488
V Y++ + +VCL + + ++ IIG+ + VIYD ++ ++G+
Sbjct: 361 VPPES--YLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSD 418
Query: 489 CS 490
C
Sbjct: 419 CD 420
>AT5G24820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:8523406-8525297 FORWARD LENGTH=407
Length = 407
Score = 81.3 bits (199), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 21/247 (8%)
Query: 251 VSNFVFGCGKNNKGL----FGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGAS 306
V FVFGCG N+ GGV G + L SL+SQ T FS+CL P+ AG+
Sbjct: 166 VRGFVFGCGARNRATPEEDGGGVDGRLSLTTHRFSLLSQLRLT---RFSHCLWPSAAGSR 222
Query: 307 GSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGNGGGLID 366
+ +G+ +S ++ + M S Y + L GI +G + S + G ID
Sbjct: 223 NYIRLGSAASYGGDMVLVPMLNMTGTEAYS--YHVALFGISLG--QQRMRSNESSGIAID 278
Query: 367 SGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSILD-TCFNLTGNEEVN-IPTISMNFE 424
GT T L PS+Y+ +K E Q A + + + CF E++ +P ++++F+
Sbjct: 279 VGTYYTSLEPSLYEEVKEELTAQIG---PAVAYEVNELMCFTTEVGLEIDSLPKLTLHFQ 335
Query: 425 DNVVLNVDATGIFYIVKEDASQVCLAL--ASLSDEYDIAIIGNYQQRNQRVIYDTKQSKV 482
+ + G++ +++ S +C AL +S+ DE I ++G + V YDT Q +
Sbjct: 336 -GLDYTISNKGLY--LQDSPSSLCTALVRSSMKDEERINVLGASAFVDHAVGYDTSQRML 392
Query: 483 GFAGESC 489
F C
Sbjct: 393 AFQQRDC 399
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 79.7 bits (195), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 152/370 (41%), Gaps = 50/370 (13%)
Query: 145 YIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGP---------VFKPSTSSSY 193
+ T++LG G V +DTGSDL WV C+ C C +G ++ P S++
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPCD-CGKCAPTEGATYASEFELSIYNPKVSTTN 165
Query: 194 QSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDG-SYTNGELGAE--HLSFGGIS 250
+ + CN+S C C S C Y V+Y + T+G L + HL+ +
Sbjct: 166 KKVTCNNSLC-------AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKN 218
Query: 251 ---VSNFV-FGCGKNNKGLFGGVS---GLMGLGRSNLSLIS--QTNSTFGGVFSYCLPPT 301
V +V FGCG+ G F ++ GL GLG +S+ S FS C
Sbjct: 219 PERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF--- 275
Query: 302 DAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGNG 361
G ++ G++ S + TP PN Y + +T + VG
Sbjct: 276 GHDGVGRISFGDKGSSDQEETPFNLNPSHPN------YNITVTRVRVGTTLID----DEF 325
Query: 362 GGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSAPGFSI-LDTCFNLTGNEEVN-IPTI 419
L D+GT T L +Y + F Q +P I + C++++ + + IP++
Sbjct: 326 TALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSL 385
Query: 420 SMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIGNYQQRNQRVIYDTKQ 479
S+ + N ++ I I E CLA+ S ++ IIG RV++D ++
Sbjct: 386 SLTMKGNSHFTINDP-IIVISTEGELVYCLAIVKSS---ELNIIGQNYMTGYRVVFDREK 441
Query: 480 SKVGFAGESC 489
+ + C
Sbjct: 442 LVLAWKKFDC 451
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 75.9 bits (185), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 156/392 (39%), Gaps = 60/392 (15%)
Query: 134 LASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSC--------YNQQGP 183
L +NF + + LG V +DTGSDL W+ C +C +++ P
Sbjct: 92 LTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVP 151
Query: 184 --VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGA 241
++ P+ S++ SI C+ C +G C S S C Y + + T G L
Sbjct: 152 LNLYTPNASTTSSSIRCSDKRCFG-------SGKCSSPESICPYQIALSSNTVTTGTLLQ 204
Query: 242 E--HLSFGGISV----SNFVFGCGKNNKGLFG---GVSGLMGLGRSNL---SLISQTNST 289
+ HL + +N GCG+N G F V+G++GL SL+++ N T
Sbjct: 205 DVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANIT 264
Query: 290 FGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
FS C G ++ G++ + TP+ + + S Y +N+TG+ VG
Sbjct: 265 -ANSFSMCFGRI-ISVVGRISFGDKGYTDQEETPLV------SLETSTAYGVNVTGVSVG 316
Query: 350 GVAGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEF--LKQFSGFPSAPGFSILDTCFN 407
GV P F L D+G+ T L S Y F L + P P F + C++
Sbjct: 317 GVPVDVPLF----ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFP-FEFCYD 371
Query: 408 LTGNEEVNIPTISMNFE--------DNVVLNV--DATGIFYIVKEDASQVCLALASLSDE 457
L E +N + + D+ + D+ E CL +
Sbjct: 372 LR-EEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYCLGILK---S 427
Query: 458 YDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
++ IIG R+++D ++ +G+ +C
Sbjct: 428 INLNIIGQNLMSGHRIVFDRERMILGWKQSNC 459
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 73.6 bits (179), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 149/360 (41%), Gaps = 52/360 (14%)
Query: 158 VIIDTGSDLTWVQCEPCMSCYN--------QQGP--VFKPSTSSSYQSIPCNSSTCQSLQ 207
V +DTGSDL W+ C +C Q P ++ P+ S++ SI C+ C
Sbjct: 117 VALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCF--- 173
Query: 208 LTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAE--HLSFGGISVS----NFVFGCGKN 261
+ C S S C Y ++Y + + T G L + HL+ +++ N GCG+
Sbjct: 174 ----GSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQK 229
Query: 262 NKGLF---GGVSGLMGL---GRSNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNES 315
GLF V+G++GL G S SL+++ N T FS C G G ++ G+
Sbjct: 230 QTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANIT-ANSFSMCFGRV-IGNVGRISFGDRG 287
Query: 316 SVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGNGGGLIDSGTVITRLA 375
+ TP + + P S Y +N++G+ V G F D+G+ T L
Sbjct: 288 YTDQEETP--FISVAP----STAYGVNISGVSVAGDPVDIRLFAK----FDTGSSFTHLR 337
Query: 376 PSVYKALKAEF--LKQFSGFPSAPGFSILDTCFNLTGNE-EVNIPTISMNF--EDNVVLN 430
Y L F L + P P + C++L+ N + P + M F ++LN
Sbjct: 338 EPAYGVLTKSFDELVEDRRRPVDPELP-FEFCYDLSPNATTIQFPLVEMTFIGGSKIILN 396
Query: 431 VDATGIFYIVKEDASQVCLALASL-SDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
F+ + V L L S I +IG R+++D ++ +G+ C
Sbjct: 397 ----NPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFDRERMILGWKQSLC 452
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 70.5 bits (171), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 104/233 (44%), Gaps = 19/233 (8%)
Query: 268 GVSGLMGLGRSNLSLISQTNSTFGGV---FSYCLPPTDAG-ASGSLAMGNESSVFKNL-- 321
GV GL GL + L+ +Q G+ F+ CLP + G++ G +N+
Sbjct: 161 GVFGLAGLAPTALATWNQLTRPRLGLEKKFALCLPSDENPLKKGAIYFGGGPYKLRNIDA 220
Query: 322 -TPIAYTRMVPNPQLSNFYMLNLTGIDVGG----VAGQAPSF---GNGGGLIDSGTVITR 373
+ ++YTR++ NP+ N Y L L GI V G A A +F G+GG + + T
Sbjct: 221 RSMLSYTRLITNPRKLNNYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTM 280
Query: 374 LAPSVYKALKAEFLKQFSGFPSAPGFSILDTCFNLTGNEEVNIPTISMNFEDNVVLNVDA 433
L +Y+ F + SG P + + C + T N +V P I + + V+ +
Sbjct: 281 LRSDIYRVFIEAFSQATSGIPRVSSTTPFEFCLSTTTNFQV--PRIDLELANGVIWKLSP 338
Query: 434 TGIFYIVKEDASQVCLALASLSDEYDIAI-IGNYQQRNQRVIYDTKQSKVGFA 485
V +D + CLA + D A+ IG +Q N V +D +S GF+
Sbjct: 339 ANAMKKVSDDVA--CLAFVNGGDAAAQAVMIGIHQMENTLVEFDVGRSAFGFS 389
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 70.5 bits (171), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 151/391 (38%), Gaps = 51/391 (13%)
Query: 128 SQIQIPLASGVNFQTLNYIVTMELGG--QNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPV 184
S + PL+ V F Y V M++G + IDTGSDLTWVQC+ PC C
Sbjct: 33 SSVVFPLSGNV-FPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQ 91
Query: 185 FKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHL 244
+KP + IPC++ C +L N C + C Y V Y D + G L +
Sbjct: 92 YKPKGN----IIPCSNPICTALHWP--NKPHCPNPQEQCDYEVKYADQGSSMGALVTDQF 145
Query: 245 SF----GGISVSNFVFGCGKNNKGLFG----GVSGLMGLGRSNLSLISQTNST--FGGVF 294
G FGCG + +G++GLGR + L++Q S V
Sbjct: 146 PLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVV 205
Query: 295 SYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQ 354
+CL + G L G+ NL P P N Y + G
Sbjct: 206 GHCL---SSKGGGFLFFGD------NLVPSIGVAWTPLLSQDNHYTTGPADLLFNG---- 252
Query: 355 APSFGNGGGLI-DSGTVITRLAPSVYKA---LKAEFLKQFSGFPSAPGFSILDTCFN--- 407
P+ G LI D+G+ T Y+ L LK S A L C+
Sbjct: 253 KPTGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLK-VSPLKVAKEDKTLPICWKGAK 311
Query: 408 -LTGNEEVN--IPTISMNFED---NVVLNVDATGIFYIVKEDASQVCLALASLSDE--YD 459
EV TI++NF + N L + Y++ VCL L + S+ +
Sbjct: 312 PFKSVLEVKNFFKTITINFTNGRRNTQLYLAPE--LYLIVSKTGNVCLGLLNGSEVGLQN 369
Query: 460 IAIIGNYQQRNQRVIYDTKQSKVGFAGESCS 490
+IG+ + +IYD ++ ++G+ C+
Sbjct: 370 SNVIGDISMQGLMMIYDNEKQQLGWVSSDCN 400
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 69.7 bits (169), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 70/159 (44%), Gaps = 11/159 (6%)
Query: 111 SMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTW 168
++ R +S + ++ P+ SG + Y V + +G Q++ +I DTGSDL W
Sbjct: 50 ALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVW 109
Query: 169 VQCEPCMSC-YNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNP--SNCSY 225
V+C C +C ++ VF P SS++ C C+ L A C S C Y
Sbjct: 110 VKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCR-LVPKPDRAPICNHTRIHSTCHY 168
Query: 226 AVNYGDGSYTNGELGAEHLSFGGIS-----VSNFVFGCG 259
Y DGS T+G E S S + + FGCG
Sbjct: 169 EYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCG 207
Score = 62.4 bits (150), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 75/147 (51%), Gaps = 13/147 (8%)
Query: 351 VAGQAPSFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSGFPSA----PGFSILDTCF 406
++GQ+ S GNGG ++DSGT + LA Y+++ A ++ P A PGF D C
Sbjct: 210 ISGQSVS-GNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK-LPIADALTPGF---DLCV 264
Query: 407 NLTG--NEEVNIPTISMNFEDNVVLNVDATGIFYIVKEDASQVCLALASLSDEYDIAIIG 464
N++G E +P + F V V ++I E+ Q CLA+ S+ + ++IG
Sbjct: 265 NVSGVTKPEKILPRLKFEFSGGAVF-VPPPRNYFIETEEQIQ-CLAIQSVDPKVGFSVIG 322
Query: 465 NYQQRNQRVIYDTKQSKVGFAGESCSF 491
N Q+ +D +S++GF+ C+
Sbjct: 323 NLMQQGFLFEFDRDRSRLGFSRRGCAL 349
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 69.3 bits (168), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 151/358 (42%), Gaps = 47/358 (13%)
Query: 158 VIIDTGSDLTWVQCEPCMSC--------YNQQGP--VFKPSTSSSYQSIPCNSSTCQSLQ 207
V +DTGSDL W+ C +C +Q P ++ P+TSS+ SI C+ C
Sbjct: 117 VALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSS 176
Query: 208 LTTGNAGACESNPSNCSYAVNY-GDGSYTNGELGAEHLSF----GGIS--VSNFVFGCGK 260
+ A S+C Y + Y ++T G L + L G+ +N GCGK
Sbjct: 177 RCSSPA-------SSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANITLGCGK 229
Query: 261 NNKGLF---GGVSGLMGLGRSNLSL--ISQTNSTFGGVFSYCLPPTDAGASGSLAMGNES 315
N G V+GL+GLG + S+ I FS C G ++ G++
Sbjct: 230 NQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNI-IDVVGRISFGDKG 288
Query: 316 SVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQAPSFGNGGGLIDSGTVITRLA 375
+ TP+ T P+P Y +++T + VGG A L D+GT T L
Sbjct: 289 YTDQMETPLLPTE--PSPT----YAVSVTEVSVGGDAVGVQLL----ALFDTGTSFTHLL 338
Query: 376 PSVYKALKAEFLKQFSG--FPSAPGFSILDTCFNLTGNE-EVNIPTISMNFEDNVVLNVD 432
Y + F + P P + C++L+ N+ + P ++M FE + +
Sbjct: 339 EPEYGLITKAFDDHVTDKRRPIDPELP-FEFCYDLSPNKTTILFPRVAMTFEGGSQMFL- 396
Query: 433 ATGIFYIVKEDASQV-CLALASLSDEYDIAIIGNYQQRNQRVIYDTKQSKVGFAGESC 489
+F + ED S + CL + S ++ I IIG R+++D ++ +G+ C
Sbjct: 397 RNPLFIVWNEDNSAMYCLGILK-SVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDC 453
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 68.9 bits (167), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 148/365 (40%), Gaps = 64/365 (17%)
Query: 156 MTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGA 215
+ +++D G++LTW+ C K + SS + + C SSTC+S+ GN A
Sbjct: 53 VNLLLDLGTNLTWLDCR-------------KLKSLSSLRLVTCQSSTCKSIP---GNGCA 96
Query: 216 CES-----------NPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCG--KNN 262
+S NP V YT G + LS +SV +F F C K
Sbjct: 97 GKSCLYKQPNPLGQNPVVTGRVVQDRASLYTTD--GGKFLS--QVSVRHFTFSCAGEKAL 152
Query: 263 KGLFGGVSGLMGLGRSNLSLISQTNSTFGGV--FSYCLPPTDAGASGSLAM--------G 312
+GL V G++ L + S Q S F + FS CLP + G +
Sbjct: 153 QGLPPPVDGVLALSPGSSSFTKQVTSAFNVIPKFSLCLPSSGTGHFYIAGIHYFIPPFNS 212
Query: 313 NESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGGVAGQA-PSFGNGGGLIDSGTVI 371
+++ + + LTPI T S Y++ + I VGG A + P GG + +
Sbjct: 213 SDNPIPRTLTPIKGTD-------SGDYLITVKSIYVGGTALKLNPDLLTGGAKLSTVVHY 265
Query: 372 TRLAPSVYKALKAEF-LK-QFSGFPSAPGFSILDTCF-------NLTGNEEVNIPTISMN 422
T L +Y AL F LK + G P + CF NLT N+P I +
Sbjct: 266 TVLQTDIYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDSRTAGKNLTAGP--NVPVIEIG 323
Query: 423 FEDNV-VLNVDATGIFYIVKEDASQVCLA-LASLSDEYDIAIIGNYQQRNQRVIYDTKQS 480
+ + G +VK + +CLA + D+ +IG +Q ++ + +D +
Sbjct: 324 LPGRIGEVKWGFYGANTVVKVKETVMCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGT 383
Query: 481 KVGFA 485
+ F+
Sbjct: 384 VLAFS 388
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 145/381 (38%), Gaps = 76/381 (19%)
Query: 157 TVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGAC 216
+V+ D G WV C+ K SS+YQS CNS+ C T+ G C
Sbjct: 58 SVVFDLGGRELWVDCD-------------KGYVSSTYQSPRCNSAVCSRAGSTS--CGTC 102
Query: 217 ESNPS-NCSYAV------NYGDGSYTNGELGAEHLSFGG---------ISVSNFVFGCGK 260
S P CS N G+ T+GE + +S + + N +F CG
Sbjct: 103 FSPPRPGCSNNTCGGIPDNTVTGTATSGEFALDVVSIQSTNGSNPGRVVKIPNLIFDCGA 162
Query: 261 NN--KGLFGGVSGLMGLGRSNLSLISQTNSTFG--GVFSYCLPPTDAGASGSLAMGNESS 316
KGL G G+ G+GR N+ L SQ + F F+ CL G GN
Sbjct: 163 TFLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHRKFAVCL----TSGKGVAFFGNGPY 218
Query: 317 VFK---NLTPIAYTRMVPNP----------QLSNFYMLNLTGIDVGGVAGQAP------- 356
VF ++ + T ++ NP + S+ Y + +T I + V P
Sbjct: 219 VFLPGIQISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQI--VEKTVPINPTLLK 276
Query: 357 ---SFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSG--FPSAPGFSILDTCFNLTGN 411
S G GG I S T L S+Y A +EF+KQ + CF+ T N
Sbjct: 277 INASTGIGGTKISSVNPYTVLESSIYNAFTSEFVKQAAARSIKRVASVKPFGACFS-TKN 335
Query: 412 EEVN-----IPTISMNFE-DNVVLNVDATGIFYIVKEDASQVCLALASLS-DEYDIAIIG 464
V +P I + +VV + V +D +CL + +IG
Sbjct: 336 VGVTRLGYAVPEIELVLHSKDVVWRIFGANSMVSVSDDV--ICLGFVDGGVNARTSVVIG 393
Query: 465 NYQQRNQRVIYDTKQSKVGFA 485
+Q + + +D +K GF+
Sbjct: 394 GFQLEDNLIEFDLASNKFGFS 414
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 62.0 bits (149), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 148/381 (38%), Gaps = 76/381 (19%)
Query: 157 TVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGAC 216
+V+ D G WV C+ QG V S++Y+S CNS+ C + + G C
Sbjct: 59 SVVFDLGGREFWVDCD--------QGYV-----STTYRSPRCNSAVCS--RAGSIACGTC 103
Query: 217 ESNPS-NCS------YAVNYGDGSYTNGELGAEHLSFGG---------ISVSNFVFGCGK 260
S P CS + N G T+GE + +S + + N +F CG
Sbjct: 104 FSPPRPGCSNNTCGAFPDNSITGWATSGEFALDVVSIQSTNGSNPGRFVKIPNLIFSCGS 163
Query: 261 NN--KGLFGGVSGLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDAGASGSLAMGNESS 316
+ KGL G G+ G+GR N+ L Q + +F F+ CL G GN
Sbjct: 164 TSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNRKFAVCL----TSGRGVAFFGNGPY 219
Query: 317 VFK---NLTPIAYTRMVPNPQLSNF----------YMLNLTGIDVGGVAGQAP------- 356
VF ++ + T ++ NP + F Y + +T I + V P
Sbjct: 220 VFLPGIQISRLQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKI--VEKTLPIDPTLLK 277
Query: 357 ---SFGNGGGLIDSGTVITRLAPSVYKALKAEFLKQFSG--FPSAPGFSILDTCFNLTGN 411
S G GG I S T L S+YKA +EF++Q + CF+ T N
Sbjct: 278 INASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAARSIKRVASVKPFGACFS-TKN 336
Query: 412 EEVN-----IPTISMNFE-DNVVLNVDATGIFYIVKEDASQVCLALASLS-DEYDIAIIG 464
V +P I + +VV + V +D +CL + +IG
Sbjct: 337 VGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDV--ICLGFVDGGVNPGASVVIG 394
Query: 465 NYQQRNQRVIYDTKQSKVGFA 485
+Q + + +D +K GF+
Sbjct: 395 GFQLEDNLIEFDLASNKFGFS 415
>AT3G12700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4038387 FORWARD LENGTH=263
Length = 263
Score = 57.4 bits (137), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/145 (24%), Positives = 67/145 (46%), Gaps = 24/145 (16%)
Query: 130 IQIPLASGVNFQTLNYIVTMELG--GQNMTVIIDTGSDLTWVQCEPCMSCYNQQGP---- 183
+++ L SG+++ T Y + +G + V++DTGS+LTWV C Y +G
Sbjct: 91 VKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCR-----YRARGKDNRR 145
Query: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVN--YG----------- 230
VF+ S S++++ C + TC+ + + C + + CSY +G
Sbjct: 146 VFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYREFFGVAWIRCKCIAR 205
Query: 231 DGSYTNGELGAEHLSFGGISVSNFV 255
+G ++G +H ++ +S V
Sbjct: 206 EGEIKYMQMGQQHKAYSQRRLSQLV 230