Miyakogusa Predicted Gene
- Lj0g3v0315369.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0315369.1 Non Chatacterized Hit- tr|I3T3U8|I3T3U8_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.66,0,no
description,Peptidase aspartic, catalytic; seg,NULL; Acid
proteases,Peptidase aspartic; PEPSIN,Pe,CUFF.21308.1
(450 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 538 e-153
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 484 e-137
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 481 e-136
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 442 e-124
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 332 2e-91
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 328 3e-90
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 323 2e-88
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 314 1e-85
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 153 2e-37
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 144 2e-34
AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family pr... 135 5e-32
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 135 7e-32
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 133 2e-31
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 126 3e-29
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 119 6e-27
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 118 8e-27
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 118 1e-26
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 1e-26
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 1e-26
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 117 2e-26
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 2e-26
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 114 1e-25
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 113 3e-25
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 5e-25
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 7e-25
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 3e-24
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 4e-24
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 4e-24
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 108 1e-23
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 106 4e-23
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 106 4e-23
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 104 1e-22
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 3e-22
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 4e-22
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 101 1e-21
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 101 1e-21
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 1e-21
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 4e-21
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 4e-21
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 4e-21
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 5e-21
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 5e-21
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 6e-21
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 98 1e-20
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 97 2e-20
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 96 5e-20
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 2e-19
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 1e-18
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 3e-18
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 88 1e-17
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 2e-17
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 2e-17
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 3e-17
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 80 3e-15
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 3e-15
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 7e-15
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 73 4e-13
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 73 4e-13
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 71 2e-12
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 63 4e-10
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 1e-09
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 55 1e-07
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 49 5e-06
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 538 bits (1385), Expect = e-153, Method: Compositional matrix adjust.
Identities = 251/437 (57%), Positives = 327/437 (74%), Gaps = 2/437 (0%)
Query: 13 VEIFCVVANANLVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTK 72
V + C N+ + R Q SLT +K HD R+ IL+ +D LGG G P++ GLY+ K
Sbjct: 24 VSVSCNPGVFNVKYRYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAK 83
Query: 73 IGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHN 132
IG+G+P+K YYVQVDTGSDI+WVNC++C +CPR+S +GI LTLY+ S + + VSC+ +
Sbjct: 84 IGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDD 143
Query: 133 FCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIF 192
FC G + GCKA CPY YGDGS+T GY+V+D + ++ V G+L T T N S+IF
Sbjct: 144 FCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIF 203
Query: 193 GCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSI 252
GCGA QSG L SS+EEALDGI+GFG+ANSS++SQLA+SG+VKKIF+HCLD GGGIF+I
Sbjct: 204 GCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAI 263
Query: 253 GEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYL 312
G VV+PKV TPLVPN HYNV + ++V + L +P+D F + KG +IDSGTTLAYL
Sbjct: 264 GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYL 323
Query: 313 PRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHD 372
P I+Y+ L+ K+ +++P LKV++V++ Y CFQY+G VD GFP V HFE+S+ L VYPHD
Sbjct: 324 PEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHD 383
Query: 373 YLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSS 432
YLF ++G WCIGWQ SA ++++ ++MTLLGD VLSNKLV+YDLEN IGWT+YNCSSS
Sbjct: 384 YLFPHEG--MWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSS 441
Query: 433 IKVKDEKTGIVHTVGAH 449
IKVKDE TG VH VG+H
Sbjct: 442 IKVKDEGTGTVHLVGSH 458
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 484 bits (1246), Expect = e-137, Method: Compositional matrix adjust.
Identities = 235/433 (54%), Positives = 306/433 (70%), Gaps = 10/433 (2%)
Query: 21 NANLVFPVQRR----QASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLG 76
+ N VF V + + L+ +K+HDS R R+L+ +D LGG+ + GLYFTKI LG
Sbjct: 26 SGNFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLG 85
Query: 77 SPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSS 136
SP K+YYVQVDTGSDILWVNC C +CP K+D+GI L+LYD K S TS+ V CE +FCS
Sbjct: 86 SPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSF 145
Query: 137 TYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGA 196
+ C A+ PC Y + YGDGS + G +++D +T +V GNL TA ++FGCG
Sbjct: 146 IMQSET--CGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGK 203
Query: 197 AQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVV 256
QSG L + + A+DGI+GFGQ+N+S++SQLAA G K+IFSHCLD GGGIF++GEV
Sbjct: 204 NQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGEVE 262
Query: 257 EPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIV 316
P VKTTP+VPN HYNVILK ++VDGD + LP + GT+IDSGTTLAYLP+ +
Sbjct: 263 SPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNL 322
Query: 317 YDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFN 376
Y+ L+ K+ AKQ ++K+++V+E ++CF +T N D FP+V LHFEDSL L+VYPHDYLF+
Sbjct: 323 YNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFS 381
Query: 377 YKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIKVK 436
+ D Y C GWQ T++G D+ LLGD VLSNKLVVYDLEN IGW D+NCSSSIKVK
Sbjct: 382 LREDMY-CFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 440
Query: 437 DEKTGIVHTVGAH 449
D +G + +GA
Sbjct: 441 D-GSGAAYQLGAE 452
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 481 bits (1237), Expect = e-136, Method: Compositional matrix adjust.
Identities = 236/435 (54%), Positives = 306/435 (70%), Gaps = 11/435 (2%)
Query: 20 ANANLVFPVQ-----RRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIG 74
A+ NLVF V+ +R L ++AHD R R+LSA+D LGG+ P GLYF KIG
Sbjct: 31 ASENLVFEVRSKFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIG 90
Query: 75 LGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFC 134
LG+PS+D++VQVDTGSDILWVNC C RCPRKSD+ + LT YD S T++ VSC NFC
Sbjct: 91 LGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSCSDNFC 149
Query: 135 SSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGC 194
S Y + C + + C Y I YGDGS+T GY V+D + + V GN T + N +IIFGC
Sbjct: 150 S--YVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGC 207
Query: 195 GAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGE 254
G+ QSG L S + A+DGI+GFGQ+NSS +SQLA+ GKVK+ F+HCLD N GGGIF+IGE
Sbjct: 208 GSKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGE 266
Query: 255 VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPR 314
VV PKVKTTP++ AHY+V L IEV +L+L S+ FDS + KG +IDSGTTL YLP
Sbjct: 267 VVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPD 326
Query: 315 IVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYL 374
VY+ L++++LA P L ++ V+E ++CF YT +D FP V F+ S+SL VYP +YL
Sbjct: 327 AVYNPLLNEILASHPELTLHTVQESFTCFHYTDKLDR-FPTVTFQFDKSVSLAVYPREYL 385
Query: 375 FNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIK 434
F + D+ WC GWQ +TK G +T+LGD LSNKLVVYD+EN IGWT++NCS I+
Sbjct: 386 FQVREDT-WCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCSGGIQ 444
Query: 435 VKDEKTGIVHTVGAH 449
VKDE++G ++TVGAH
Sbjct: 445 VKDEESGAIYTVGAH 459
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 442 bits (1136), Expect = e-124, Method: Compositional matrix adjust.
Identities = 220/433 (50%), Positives = 296/433 (68%), Gaps = 13/433 (3%)
Query: 20 ANANLVFPVQRRQA----SLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGL 75
A+AN VF Q + A +L K+HD+ R R+L+++D LGG+ + GLYFTKI L
Sbjct: 21 ASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKL 80
Query: 76 GSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCS 135
GSP K+Y+VQVDTGSDILW+NC C +CP K+++ L+L+D S TS+ V C+ +FCS
Sbjct: 81 GSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCS 140
Query: 136 STYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCG 195
+ C+ C Y I Y D S + G +++D LT +V G+L T ++FGCG
Sbjct: 141 FISQSD--SCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCG 198
Query: 196 AAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEV 255
+ QSG L + + A+DG++GFGQ+N+SVLSQLAA+G K++FSHCLD GGGIF++G V
Sbjct: 199 SDQSGQLG-NGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVV 257
Query: 256 VEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRI 315
PKVKTTP+VPN HYNV+L ++VDG L LP NG GT++DSGTTLAY P++
Sbjct: 258 DSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV--RNG-GTIVDSGTTLAYFPKV 314
Query: 316 VYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLF 375
+YD L+ +LA+QP +K+++VEE + CF ++ NVD FP V FEDS+ LTVYPHDYLF
Sbjct: 315 LYDSLIETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLF 373
Query: 376 NYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIKV 435
+ + Y C GWQ T ++ LLGD VLSNKLVVYDL+N IGW D+NCSSSIK+
Sbjct: 374 TLEEELY-CFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKI 432
Query: 436 KDEKTGIVHTVGA 448
KD +G V++VGA
Sbjct: 433 KD-GSGGVYSVGA 444
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 332 bits (852), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 173/415 (41%), Positives = 246/415 (59%), Gaps = 15/415 (3%)
Query: 31 RQASLTGIKAHDSSRRGRILSA----VDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQV 86
+ L+ +KA D +R GR+L + +DF + G P V GLY+TK+ LG+P +D+YVQV
Sbjct: 39 HEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQV 98
Query: 87 DTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCK 146
DTGSD+LWV+C C CP+ S + I L +DP S T+ +SC CS + GC
Sbjct: 99 DTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCS 158
Query: 147 AENP-CPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASS 205
+N C Y+ YGDGS T+G+YV D L F+ + G+ + ++FGC +Q+G L
Sbjct: 159 VQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLV-K 217
Query: 206 SEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD-TNVGGGIFSIGEVVEPKVKTTP 264
S+ A+DGI GFGQ SV+SQLA+ G ++FSHCL N GGGI +GE+VEP + TP
Sbjct: 218 SDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVFTP 277
Query: 265 LVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKV 324
LVP+ HYNV L +I V+G L + F + NG+GT+ID+GTTLAYL Y + +
Sbjct: 278 LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAI 337
Query: 325 LAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLF---NYKGDS 381
+ +V + C+ T +V FP V L+F S+ + P DYL N G +
Sbjct: 338 TNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTA 397
Query: 382 YWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIKVK 436
WCIG+Q+ ++ +T+LGD VL +K+ VYDL IGW +Y+CS+S+ V
Sbjct: 398 VWCIGFQRIQNQ-----GITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVS 447
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 328 bits (842), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 173/436 (39%), Positives = 249/436 (57%), Gaps = 26/436 (5%)
Query: 20 ANANLVFPVQR-----RQASLTGIKAHDSSRRGRIL----------SAVDFNLGGNGLPN 64
A + P+QR L+ ++A D R RIL VDF + G+ P
Sbjct: 36 AGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPY 95
Query: 65 VTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTS 124
+ GLYFTK+ LGSP ++ VQ+DTGSDILWV C C+ CP S +GI L +D S T+
Sbjct: 96 LVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTA 155
Query: 125 EFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTA 184
V+C CSS ++ C N C YS YGDGS T+GYY+ D F+ + G A
Sbjct: 156 GSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA 215
Query: 185 TQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN 244
++ I+FGC QSG L + S++A+DGI GFG+ SV+SQL++ G +FSHCL +
Sbjct: 216 NSSAPIVFGCSTYQSGDL-TKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 245 -VGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVI 303
GGG+F +GE++ P + +PLVP+ HYN+ L +I V+G +L L + F++ N +GT++
Sbjct: 275 GSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIV 334
Query: 304 DSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDS 363
D+GTTL YL + YD ++ + +L ++ C+ + ++ FP V L+F
Sbjct: 335 DTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGG 394
Query: 364 LSLTVYPHDYLFN---YKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENM 420
S+ + P DYLF+ Y G S WCIG+QK+ E T+LGD VL +K+ VYDL
Sbjct: 395 ASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE------QTILGDLVLKDKVFVYDLARQ 448
Query: 421 TIGWTDYNCSSSIKVK 436
IGW Y+CS S+ V
Sbjct: 449 RIGWASYDCSMSVNVS 464
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 323 bits (827), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 249/441 (56%), Gaps = 31/441 (7%)
Query: 20 ANANLVFPVQR-----RQASLTGIKAHDSSRRGRIL----------SAVDFNLGGNGLPN 64
A + P+QR L+ ++A D R RIL VDF + G+ P
Sbjct: 36 AGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPY 95
Query: 65 VTG-----LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119
+ G LYFTK+ LGSP ++ VQ+DTGSDILWV C C+ CP S +GI L +D
Sbjct: 96 LVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAP 155
Query: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG 179
S T+ V+C CSS ++ C N C YS YGDGS T+GYY+ D F+ + G
Sbjct: 156 GSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILG 215
Query: 180 NLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 239
A ++ I+FGC QSG L + S++A+DGI GFG+ SV+SQL++ G +FSH
Sbjct: 216 ESLVANSSAPIVFGCSTYQSGDL-TKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSH 274
Query: 240 CLDTN-VGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENG 298
CL + GGG+F +GE++ P + +PLVP+ HYN+ L +I V+G +L L + F++ N
Sbjct: 275 CLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT 334
Query: 299 KGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKL 358
+GT++D+GTTL YL + YD ++ + +L ++ C+ + ++ FP V L
Sbjct: 335 RGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSL 394
Query: 359 HFEDSLSLTVYPHDYLFN---YKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVY 415
+F S+ + P DYLF+ Y G S WCIG+QK+ E T+LGD VL +K+ VY
Sbjct: 395 NFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE------QTILGDLVLKDKVFVY 448
Query: 416 DLENMTIGWTDYNCSSSIKVK 436
DL IGW Y+CS S+ V
Sbjct: 449 DLARQRIGWASYDCSMSVNVS 469
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 314 bits (804), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 172/416 (41%), Positives = 244/416 (58%), Gaps = 17/416 (4%)
Query: 35 LTGIKAHDSSRRGRILSA-----VDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTG 89
LT ++A DS+R GR+L + V+F + G P + GLY+TK+ LG+P +++ VQ+DTG
Sbjct: 45 LTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTG 104
Query: 90 SDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAEN 149
SD+LWV+C C CP+ S++ I L+ +DP S ++ VSC C S ++ GC N
Sbjct: 105 SDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTES-GCSPNN 163
Query: 150 PCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEA 209
C YS YGDGS T+GYY+ D+++F+ V + ++ +FGC QSG L A
Sbjct: 164 LCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDL-QRPRRA 222
Query: 210 LDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN-VGGGIFSIGEVVEPKVKTTPLVPN 268
+DGI G GQ + SV+SQLA G ++FSHCL + GGGI +G++ P TPLVP+
Sbjct: 223 VDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVYTPLVPS 282
Query: 269 MAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQ 328
HYNV L++I V+G IL + F G GT+ID+GTTLAYLP Y + V
Sbjct: 283 QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAV 342
Query: 329 PRLKVYLVEEQYSCFQYT-GNVDSGFPIVKLHFEDSLSLTVYPHDYL--FNYKGDSYWCI 385
+ + E Y CF+ T G+VD FP V L F S+ + P YL F+ G S WCI
Sbjct: 343 SQYGRPITYESYQCFEITAGDVDV-FPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCI 401
Query: 386 GWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIKVKDEKTG 441
G+Q+ + + +T+LGD VL +K+VVYDL IGW +Y+CS + V + G
Sbjct: 402 GFQRMSH-----RRITILGDLVLKDKVVVYDLVRQRIGWAEYDCSLEVNVSASRGG 452
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 153 bits (387), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 126/425 (29%), Positives = 202/425 (47%), Gaps = 55/425 (12%)
Query: 24 LVFPVQRRQASLTGIKAHDSSRRGRI----LSAVDFNLGGNGLPNVTGLYFTKIGLGSPS 79
++FP+ +SL + RR R+ L L + L N G Y T++ +G+P
Sbjct: 31 MIFPLS--YSSLPPRPRVEDFRRRRLHQSQLPNAHMKLYDDLLSN--GYYTTRLWIGTPP 86
Query: 80 KDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYE 139
+++ + VDTGS + +V C C +C + D + P+ S + + + C + C+ E
Sbjct: 87 QEFALIVDTGSTVTYVPCSTCKQCGKHQD-----PKFQPELSTSYQALKCNPD-CNCDDE 140
Query: 140 GRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQS 199
G++ C Y Y + S+++G +D ++F GN + +FGC ++
Sbjct: 141 GKL--------CVYERRYAEMSSSSGVLSEDLISF----GN-ESQLSPQRAVFGCENEET 187
Query: 200 GTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-DTNVGGGIFSIGEVVEP 258
G L S + DGI+G G+ SV+ QL G ++ +FS C VGGG +G++ P
Sbjct: 188 GDLFS---QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPP 244
Query: 259 K----VKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGK-GTVIDSGTTLAYLP 313
+ P +YN+ LK + V G L+L F NGK GTV+DSGTT AY P
Sbjct: 245 PGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFP 299
Query: 314 RIVYDQLMSKVLAKQPRLK-VYLVEEQYS--CFQYTGN----VDSGFPIVKLHFEDSLSL 366
+ + + V+ + P LK ++ + Y CF G + + FP + + F + L
Sbjct: 300 KEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKL 359
Query: 367 TVYPHDYLFNY-KGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWT 425
+ P +YLF + K +C+G T TLLG V+ N LV YD EN +G+
Sbjct: 360 ILSPENYLFRHTKVRGAYCLGIFPDRDST------TLLGGIVVRNTLVTYDRENDKLGFL 413
Query: 426 DYNCS 430
NCS
Sbjct: 414 KTNCS 418
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 144 bits (362), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/380 (31%), Positives = 178/380 (46%), Gaps = 45/380 (11%)
Query: 65 VTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTS 124
+ G Y T++ +G+P + + + VD+GS + +V C +C +C + D + P+ S T
Sbjct: 89 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQD-----PKFQPEMSSTY 143
Query: 125 EFVSCEHNF-CSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHT 183
+ V C + C E C Y Y + S++ G +D ++F GN
Sbjct: 144 QPVKCNMDCNCDDDREQ----------CVYEREYAEHSSSKGVLGEDLISF----GNESQ 189
Query: 184 ATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-D 242
T + +FGC ++G L S + DGIIG GQ + S++ QL G + F C
Sbjct: 190 LTPQRA-VFGCETVETGDLYS---QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGG 245
Query: 243 TNVGGGIFSIGEVVEPK--VKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKG 300
+VGGG +G P V T +YN+ L I V G L L S FD E+ G
Sbjct: 246 MDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEH--G 303
Query: 301 TVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEE---QYSCFQ-----YTGNVDSG 352
V+DSGTT AYLP + V+ + LK + + +CFQ Y +
Sbjct: 304 AVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKI 363
Query: 353 FPIVKLHFEDSLSLTVYPHDYLFNY-KGDSYWCIGWQKSASETKNGKD-MTLLGDFVLSN 410
FP V++ F+ S + P +Y+F + K +C+G NGKD TLLG V+ N
Sbjct: 364 FPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLG------VFPNGKDHTTLLGGIVVRN 417
Query: 411 KLVVYDLENMTIGWTDYNCS 430
LVVYD EN +G+ NCS
Sbjct: 418 TLVVYDRENSKVGFWRTNCS 437
>AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:14665728-14669135 REVERSE LENGTH=430
Length = 430
Score = 135 bits (340), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 114/451 (25%), Positives = 193/451 (42%), Gaps = 99/451 (21%)
Query: 25 VFPVQR-----RQASLTGIKAHDSSRRGRILSA-----VDFNLGGNGLPNVTGLYFTKIG 74
V P++R + LT + DS+R GR+L + ++ + + ++ LY+T +
Sbjct: 24 VLPLKRMIPPSHELDLTQLMTFDSARHGRLLQSPVHGSFNWKVERDTSILLSALYYTTVQ 83
Query: 75 LGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFC 134
+G+P ++ V +DTGSD++WV+C C CP + +T +DP S ++ ++C C
Sbjct: 84 IGTPPRELDVVIDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRC 138
Query: 135 SSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFN----------RVNGNLHTA 184
SS + + C C Y + YGDGS T+GYY+ D ++F+ R N H
Sbjct: 139 SSDLQKKS-RCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSDWTYIAFRDNSTWHPW 197
Query: 185 TQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN 244
+ +II A S ++ S + L FSH +
Sbjct: 198 VRQGAIIGTFPALCSTPCSTVSSQPL---------------------YYNPQFSHMMTVA 236
Query: 245 VGGGIFSIGEVVEPKVKTTPLVPNMAH-YNVILKNIEVDGDILQLPSDTFDSENGKGTVI 303
V + ++ P V ++A Y I I+ ++ P + +D
Sbjct: 237 VN----------DLRLPIDPSVFSVAKGYGTI---IDSGTTLVHFPGEAYD--------- 274
Query: 304 DSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSG------FPIVK 357
P I Q + V+++ R Y E + CF T + S FP V
Sbjct: 275 ---------PLI---QAILNVVSQYGRPIPY---ESFQCFNITSGISSHLVIADMFPEVH 319
Query: 358 LHFEDSLSLTVYPHDYLFNYKGD---SYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVV 414
L F S+ + P YLF D + WC+G+ S S + +T++G+ + +K+ V
Sbjct: 320 LGFAGGASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTS-----RRITIIGEVAIRDKMFV 374
Query: 415 YDLENMTIGWTDYNCSSSIKVKDEKTGIVHT 445
YDL++ IGW +YNCS + + I +T
Sbjct: 375 YDLDHQRIGWAEYNCSLDVTRAQQNKDITNT 405
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 135 bits (339), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 116/413 (28%), Positives = 185/413 (44%), Gaps = 49/413 (11%)
Query: 43 SSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNC-VECT 101
S R R +S+V F + GN P G Y I +G P + YY+ +DTGSD+ W+ C C
Sbjct: 36 SDRFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCV 93
Query: 102 RCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGS 161
RC LY P +S+ + C C + + C+ C Y + Y DG
Sbjct: 94 RCLEAPH-----PLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGG 144
Query: 162 ATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANS 221
++ G V+D + N G T + GCG Q +SS LDG++G G+
Sbjct: 145 SSLGVLVRDVFSMNYTQGLRLTP----RLALGCGYDQ--IPGASSHHPLDGVLGLGRGKV 198
Query: 222 SVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVV--EPKVKTTPLVPNMA-HYNVILKN 278
S+LSQL + G VK + HCL +++GGGI G+ + +V TP+ + HY+ +
Sbjct: 199 SILSQLHSQGYVKNVIGHCL-SSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAM-- 255
Query: 279 IEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQ---LMSKVLAKQPRLK--- 332
G L T +N TV DSG++ Y Y L+ + L+ +P +
Sbjct: 256 ----GGELLFGGRTTGLKN-LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARD 310
Query: 333 ---VYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLS----LTVYPHDYL-FNYKGDSYWC 384
+ L + F V F + L F+ + P YL + KG+ C
Sbjct: 311 DHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNV--C 368
Query: 385 IGWQKSASETKNG-KDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIKVK 436
+G + T+ G +++ L+GD + +++++YD E +IGW +C +K
Sbjct: 369 LG---ILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDELASLK 418
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 133 bits (335), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/412 (29%), Positives = 183/412 (44%), Gaps = 47/412 (11%)
Query: 38 IKAH-DSSRRGRILSAVDFNL----GGNGLPNVTGL---YFTKIGLGSPSKDYYVQVDTG 89
+ AH D RGR L+ D +L GN V L ++ + +G+PS + V +DTG
Sbjct: 65 VMAHRDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTG 124
Query: 90 SDILWVNCVECTRCPRKSDI----GIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGC 145
SD+ W+ C +CT C R+ + L +Y P S TS V C C+ R C
Sbjct: 125 SDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCT-----RGDRC 178
Query: 146 KA-ENPCPYSISY-GDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLA 203
+ E+ CPY I Y +G+++TG V+D L V+ + + + + FGCG Q+G
Sbjct: 179 ASPESDCPYQIRYLSNGTSSTGVLVEDVLHL--VSNDKSSKAIPARVTFGCGQVQTGVFH 236
Query: 204 SSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTT 263
+ A +G+ G G + SV S LA G FS C N G G S G+ + T
Sbjct: 237 DGA--APNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG-NDGAGRISFGDKGSVDQRET 293
Query: 264 PLVPNMAH--YNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLM 321
PL H YN+ + I V G+ L D V DSGT+ YL Y +
Sbjct: 294 PLNIRQPHPTYNITVTKISVGGNTGDLEFD---------AVFDSGTSFTYLTDAAYTLIS 344
Query: 322 SKV--LAKQPRLKVYLVEEQYS-CFQYTGNVDS-GFPIVKLHFEDSLSLTVYPHDYLFNY 377
LA R + E + C+ + N DS +P V L + S VY +
Sbjct: 345 ESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM 404
Query: 378 KGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429
K +C+ K +D++++G ++ VV+D E + +GW + +C
Sbjct: 405 KDTDVYCLAIMKI-------EDISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 126 bits (317), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 123/443 (27%), Positives = 196/443 (44%), Gaps = 58/443 (13%)
Query: 14 EIFCVVANANLVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKI 73
E F + LV PV+ T + DSS F +GGN P+ GLY+T+I
Sbjct: 157 ENFVESMDLELVNPVKVNDVLSTSAGSIDSS-------TTIFPVGGNVYPD--GLYYTRI 207
Query: 74 GLGSPS--KDYYVQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCE 130
+G P + Y++ +DTGS++ W+ C CT C + ++ LY P++ V
Sbjct: 208 LVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN-----QLYKPRKDN---LVRSS 259
Query: 131 HNFCSSTYEGRIL-GCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSS 189
FC ++ C+ + C Y I Y D S + G +D NG+L S
Sbjct: 260 EAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSL----AESD 315
Query: 190 IIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGG-G 248
I+FGCG Q G L ++ + DGI+G +A S+ SQLA+ G + + HCL +++ G G
Sbjct: 316 IVFGCGYDQQGLLLNTLLKT-DGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEG 374
Query: 249 IFSIGEVVEPKVKTT--PLVPN--MAHYNVILKNIEVDGDILQLPSDTFDSENGK-GTVI 303
+G + P T P++ + + Y + + + +L L D ENG+ G V+
Sbjct: 375 YIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSL-----DGENGRVGKVL 429
Query: 304 -DSGTTLAYLPRIVYDQLMSKVLA----------KQPRLKVYLVEEQYSCFQYTGNVDSG 352
D+G++ Y P Y QL++ + L + + F +V
Sbjct: 430 FDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKF 489
Query: 353 FPIVKLHFED-----SLSLTVYPHDYL-FNYKGDSYWCIGWQKSASETKNGKDMTLLGDF 406
F + L S L + P DYL + KG+ C+G +S +G + +LGD
Sbjct: 490 FRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNV--CLGILDGSS-VHDGSTI-ILGDI 545
Query: 407 VLSNKLVVYDLENMTIGWTDYNC 429
+ L+VYD IGW +C
Sbjct: 546 SMRGHLIVYDNVKRRIGWMKSDC 568
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 119 bits (297), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 181/377 (48%), Gaps = 37/377 (9%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWV--NCVEC---TRCPRKSDIGIGLTLYDPKRSK 122
L++T I +G+PS + V +DTGS++LW+ NCV+C T S L Y+P S
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158
Query: 123 TSEFVSCEHNFCSSTYEGRILGCKA-ENPCPYSISYGDG-SATTGYYVQD--YLTFNRVN 178
TS+ C H C S + C++ + CPY+++Y G ++++G V+D +LT+N N
Sbjct: 159 TSKVFLCSHKLCDSASD-----CESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNN 213
Query: 179 GNLH-TATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIF 237
++ +++ + ++ GCG QSG A DG++G G A SV S L+ +G ++ F
Sbjct: 214 RLMNGSSSVKARVVIGCGKKQSGDYLDGV--APDGLMGLGPAEISVPSFLSKAGLMRNSF 271
Query: 238 SHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPS----DTF 293
S C D G I+ G++ ++TP + L N + G I+ + + ++
Sbjct: 272 SLCFDEEDSGRIY-FGDMGPSIQQSTPFLQ--------LDNNKYSGYIVGVEACCIGNSC 322
Query: 294 DSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGF 353
+ T IDSG + YLP +Y ++ ++ + E + Y + +
Sbjct: 323 LKQTSFTTFIDSGQSFTYLPEEIYRKVALEI-DRHINATSKNFEGVSWEYCYESSAEPKV 381
Query: 354 PIVKLHFEDSLSLTVYPHDYLFNY-KGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKL 412
P +KL F + + ++ ++F +G +C+ S E + +G +
Sbjct: 382 PAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQE-----GIGSIGQNYMRGYR 436
Query: 413 VVYDLENMTIGWTDYNC 429
+V+D ENM +GW+ C
Sbjct: 437 MVFDRENMKLGWSPSKC 453
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 118 bits (296), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 113/440 (25%), Positives = 201/440 (45%), Gaps = 59/440 (13%)
Query: 21 NANLVFPVQRRQ----ASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLG 76
N N V Q++ + T + + + G++++ ++ +G+ +G YF + +G
Sbjct: 123 NQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVATLE-----SGMTLGSGEYFMDVLVG 177
Query: 77 SPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCS- 135
SP K + + +DTGSD+ W+ C+ C C +++ YDPK S + + ++C C+
Sbjct: 178 SPPKHFSLILDTGSDLNWIQCLPCYDCFQQNG-----AFYDPKASASYKNITCNDQRCNL 232
Query: 136 STYEGRILGCKAEN-PCPYSISYGDGSATTGYYVQDYLTFN-RVNGNLHTATQNSSIIFG 193
+ + CK++N CPY YGD S TTG + + T N NG +++FG
Sbjct: 233 VSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFG 292
Query: 194 CGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-----DTNVGGG 248
CG G ++ G + SS L L FS+CL DTNV
Sbjct: 293 CGHWNRGLFHGAAGLLGLGRG--PLSFSSQLQSLYGHS-----FSYCLVDRNSDTNVSSK 345
Query: 249 IFSIGE----VVEPKVKTTPLVPNMAH-----YNVILKNIEVDGDILQLPSDTFD--SEN 297
+ GE + P + T V + Y V +K+I V G++L +P +T++ S+
Sbjct: 346 LI-FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDG 404
Query: 298 GKGTVIDSGTTLAYLPRIVYD----QLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGF 353
GT+IDSGTTL+Y Y+ ++ K K P + + + + CF +G +
Sbjct: 405 AGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILD--PCFNVSGIHNVQL 462
Query: 354 PIVKLHFEDSLSLTVYPHDYLFNYKGDSYWC---IGWQKSASETKNGKDMTLLGDFVLSN 410
P + + F D ++ +P + F + + C +G KSA +++G++ N
Sbjct: 463 PELGIAFADG-AVWNFPTENSFIWLNEDLVCLAMLGTPKSA--------FSIIGNYQQQN 513
Query: 411 KLVVYDLENMTIGWTDYNCS 430
++YD + +G+ C+
Sbjct: 514 FHILYDTKRSRLGYAPTKCA 533
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 118 bits (295), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 109/406 (26%), Positives = 181/406 (44%), Gaps = 45/406 (11%)
Query: 44 SRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRC 103
SR+ V +LG +G+ T YFT+I +G+P+K + V VDTGS++ WVNC R
Sbjct: 82 SRKRNSTVGVKMDLG-SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG 140
Query: 104 PRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEG--RILGCKAEN-PCPYSISYGDG 160
+ + SK+ + V C C + C + PC Y Y DG
Sbjct: 141 KDNRRV------FRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADG 194
Query: 161 SATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQAN 220
SA G + ++ +T NG + + + GC ++ +G S + DG++G ++
Sbjct: 195 SAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTG----QSFQGADGVLGLAFSD 247
Query: 221 SSVLSQLAASGKVKKIFSHCL-----DTNVGGGIFSIGEVVEPKV---KTTPL----VPN 268
S S A+ FS+CL + NV + G K +TTPL +P
Sbjct: 248 FSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLI-FGSSRSTKTAFRRTTPLDLTRIPP 304
Query: 269 MAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQL---MSKVL 325
NVI I + D+L +PS +D+ +G GT++DSGT+L L Y Q+ +++ L
Sbjct: 305 FYAINVI--GISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYL 362
Query: 326 AKQPRLKVYLVEEQYSCFQYTGNVD-SGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWC 384
+ R+K V +Y CF +T + S P + H + + YL + C
Sbjct: 363 VELKRVKPEGVPIEY-CFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD-AAPGVKC 420
Query: 385 IGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430
+G+ + + N ++G+ + N L +DL T+ + C+
Sbjct: 421 LGFVSAGTPATN-----VIGNIMQQNYLWEFDLMASTLSFAPSACT 461
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 117 bits (294), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 90/294 (30%), Positives = 135/294 (45%), Gaps = 32/294 (10%)
Query: 43 SSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNC-VECT 101
S R R +S+V F + GN P G Y I +G P + YY+ +DTGSD+ W+ C C
Sbjct: 33 SDRFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCV 90
Query: 102 RCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGS 161
RC LY P +S+ + C C + + C+ C Y + Y DG
Sbjct: 91 RCLEAPH-----PLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGG 141
Query: 162 ATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANS 221
++ G V+D + N G T + GCG Q +SS LDG++G G+
Sbjct: 142 SSLGVLVRDVFSMNYTQGLRLTP----RLALGCGYDQ--IPGASSHHPLDGVLGLGRGKV 195
Query: 222 SVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVV--EPKVKTTPLVPNMA-HYNVILKN 278
S+LSQL + G VK + HCL +++GGGI G+ + +V TP+ + HY+ +
Sbjct: 196 SILSQLHSQGYVKNVIGHCL-SSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAM-- 252
Query: 279 IEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQ---LMSKVLAKQP 329
G L T +N TV DSG++ Y Y L+ + L+ +P
Sbjct: 253 ----GGELLFGGRTTGLKN-LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKP 301
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 117 bits (293), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 165/373 (44%), Gaps = 43/373 (11%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
+G YFT++G+G P+++ Y+ +DTGSD+ W+ C C C +++ +++P S + E
Sbjct: 145 SGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTE-----PIFEPSSSSSYE 199
Query: 126 FVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTAT 185
+SC+ C++ + C+ C Y +SYGDGS T G + + LT T
Sbjct: 200 PLSCDTPQCNAL---EVSECRNAT-CLYEVSYGDGSYTVGDFATETLTIGS------TLV 249
Query: 186 QNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL---D 242
QN ++ GCG + G ++ G ++ SQL + FS+CL D
Sbjct: 250 QNVAV--GCGHSNEGLFVGAAGLLGL-----GGGLLALPSQLNTTS-----FSYCLVDRD 297
Query: 243 TNVGGGIFSIGEVVEPKVKTTPLVPNM---AHYNVILKNIEVDGDILQLPSDTF--DSEN 297
++ + G + P PL+ N Y + L I V G++LQ+P +F D
Sbjct: 298 SDSASTV-DFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESG 356
Query: 298 GKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRL-KVYLVEEQYSCFQYTGNVDSGFPIV 356
G +IDSGT + L +Y+ L + L K V +C+ + P V
Sbjct: 357 SGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTV 416
Query: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYD 416
HF L + +Y+ +C+ + +AS + ++G+ V +D
Sbjct: 417 AFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASS------LAIIGNVQQQGTRVTFD 470
Query: 417 LENMTIGWTDYNC 429
L N IG++ C
Sbjct: 471 LANSLIGFSSNKC 483
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 117 bits (293), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 171/385 (44%), Gaps = 41/385 (10%)
Query: 68 LYFTKIGLGSPS--KDYYVQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTS 124
LY+T+I +G P + Y++ +DTGS++ W+ C CT C + ++ LY P++
Sbjct: 29 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN-----QLYKPRKDN-- 81
Query: 125 EFVSCEHNFCSSTYEGRIL-GCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHT 183
V FC ++ C+ + C Y I Y D S + G +D NG+L
Sbjct: 82 -LVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSL-- 138
Query: 184 ATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDT 243
S I+FGCG Q G L ++ + DGI+G +A S+ SQLA+ G + + HCL +
Sbjct: 139 --AESDIVFGCGYDQQGLLLNTLLKT-DGILGLSRAKISLPSQLASRGIISNVVGHCLAS 195
Query: 244 NVGG-GIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGK-GT 301
++ G G +G + P T VP + + ++V + D ENG+ G
Sbjct: 196 DLNGEGYIFMGSDLVPSHGMT-WVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGK 254
Query: 302 VI-DSGTTLAYLPRIVYDQLMSKVLA----------KQPRLKVYLVEEQYSCFQYTGNVD 350
V+ D+G++ Y P Y QL++ + L + + F +V
Sbjct: 255 VLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVK 314
Query: 351 SGFPIVKLHFED-----SLSLTVYPHDYL-FNYKGDSYWCIGWQKSASETKNGKDMTLLG 404
F + L S L + P DYL + KG+ C+G +S +G + +LG
Sbjct: 315 KFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNV--CLGILDGSS-VHDGSTI-ILG 370
Query: 405 DFVLSNKLVVYDLENMTIGWTDYNC 429
D + L+VYD IGW +C
Sbjct: 371 DISMRGHLIVYDNVKRRIGWMKSDC 395
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 169/398 (42%), Gaps = 72/398 (18%)
Query: 60 NGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119
+GL +G YFT++G+G+P++ Y+ +DTGSDI+W+ C C RC +SD ++DP+
Sbjct: 133 SGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSD-----PIFDPR 187
Query: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENP-CPYSISYGDGSATTGYYVQDYLTF--NR 176
+SKT + C C GC C Y +SYGDGS T G + + LTF NR
Sbjct: 188 KSKTYATIPCSSPHCRRLDSA---GCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR 244
Query: 177 VNGNLHTATQNSSIIFGCGAAQSGTLA--------SSSEEALDGIIG--FGQANSSVLSQ 226
V G + GCG G + + G G F Q S L
Sbjct: 245 VKG----------VALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVD 294
Query: 227 LAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNM---AHYNVILKNIEVDG 283
+AS K + +F V + TPL+ N Y V L I V G
Sbjct: 295 RSASSKPSSV------------VFG-NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGG 341
Query: 284 DILQLPSDT-----FDSENGKGTVIDSGTTLAYLPRIVYDQLM------SKVLAKQPRLK 332
++P T D G +IDSGT++ L R Y + +K L + P
Sbjct: 342 T--RVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFS 399
Query: 333 VYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSAS 392
++ +CF + + P V LHF + +++ +YL + +C + +
Sbjct: 400 LF-----DTCFDLSNMNEVKVPTVVLHFRGA-DVSLPATNYLIPVDTNGKFCFAF----A 449
Query: 393 ETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430
T G ++++G+ VVYDL + +G+ C+
Sbjct: 450 GTMGG--LSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 114 bits (286), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 176/373 (47%), Gaps = 38/373 (10%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIG----LTLYDPKRSKT 123
L++T + LG+P + V +DTGSD+ WV C +C +C L++Y+PK S T
Sbjct: 106 LHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTT 164
Query: 124 SEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDG-SATTGYYVQDYLTFNRVNGNLH 182
++ V+C ++ C+ + LG + CPY +SY ++T+G ++D + + N
Sbjct: 165 NKKVTCNNSLCAQ--RNQCLGTFST--CPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPE 220
Query: 183 TATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD 242
+ + FGCG QSG+ + A +G+ G G SV S LA G V FS C
Sbjct: 221 RV--EAYVTFGCGQVQSGSFLDIA--APNGLFGLGMEKISVPSVLAREGLVADSFSMCFG 276
Query: 243 TNVGGGIFSIGEVVEPKVKTTP--LVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKG 300
+ G G S G+ + TP L P+ +YN+ + + V ++ D F +
Sbjct: 277 HD-GVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID---DEFTA----- 327
Query: 301 TVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYS---CFQYTGNVDSGF-PIV 356
+ D+GT+ YL +Y + S+ Q + K + + + C+ + + ++ P +
Sbjct: 328 -LFDTGTSFTYLVDPMYTTV-SESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLIPSL 385
Query: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYD 416
L + + T+ + + +G+ +C+ KS+ ++ ++G ++ VV+D
Sbjct: 386 SLTMKGNSHFTINDPIIVISTEGELVYCLAIVKSS-------ELNIIGQNYMTGYRVVFD 438
Query: 417 LENMTIGWTDYNC 429
E + + W ++C
Sbjct: 439 REKLVLAWKKFDC 451
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 113 bits (283), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/430 (26%), Positives = 192/430 (44%), Gaps = 46/430 (10%)
Query: 26 FPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQ 85
FP + +L + H S R + + V + +G + +G YF + +G P + +
Sbjct: 42 FPSPTQALALDTRRLHFLSLRRKPIPFVKSPVV-SGAASGSGQYFVDLRIGQPPQSLLLI 100
Query: 86 VDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEG-RILG 144
DTGSD++WV C C C S T++ P+ S T C C + R
Sbjct: 101 ADTGSDLVWVKCSACRNCSHHSPA----TVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPI 156
Query: 145 C---KAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSG- 200
C + + C Y Y DGS T+G + ++ + +G + S+ FGCG SG
Sbjct: 157 CNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGK---EARLKSVAFGCGFRISGQ 213
Query: 201 TLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL--------DTN---VGGGI 249
+++ +S +G++G G+ S SQL + FS+CL T+ +G G
Sbjct: 214 SVSGTSFNGANGVMGLGRGPISFASQLGR--RFGNKFSYCLMDYTLSPPPTSYLIIGNGG 271
Query: 250 FSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQL-PS--DTFDSENGKGTVIDSG 306
I ++ + T PL P Y V LK++ V+G L++ PS + DS NG GTV+DSG
Sbjct: 272 DGISKLFFTPLLTNPLSPTF--YYVKLKSVFVNGAKLRIDPSIWEIDDSGNG-GTVVDSG 328
Query: 307 TTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNV------DSGFPIVKLHF 360
TTLA+L Y +++ V + R+K+ + + F NV + P +K F
Sbjct: 329 TTLAFLAEPAYRSVIAAV---RRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEF 385
Query: 361 EDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENM 420
P +Y + + C+ Q + + K G +++G+ + L +D +
Sbjct: 386 SGGAVFVPPPRNYFIETE-EQIQCLAIQ--SVDPKVG--FSVIGNLMQQGFLFEFDRDRS 440
Query: 421 TIGWTDYNCS 430
+G++ C+
Sbjct: 441 RLGFSRRGCA 450
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 112 bits (280), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 163/376 (43%), Gaps = 38/376 (10%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRK-SDIG----IGLTLYDPKRSK 122
LY+ + +G+P + V +DTGSD+ W+ C T C R DIG + L LY P S
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 123 TSEFVSCEHNFCSSTYEGRILGCKAENP-CPYSISYGDGSATTGYYVQDYLTFNRVNGNL 181
TS + C C + + C + + CPY ISY + + T G +QD L + NL
Sbjct: 161 TSSSIRCSDKRCFGSKK-----CSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENL 215
Query: 182 HTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL 241
N + GCG Q+G ++ +++G++G G SV S LA + FS C
Sbjct: 216 TPVKAN--VTLGCGQKQTGLFQRNN--SVNGVLGLGIKGYSVPSLLAKANITANSFSMCF 271
Query: 242 DTNVGG-GIFSIGEVVEPKVKTTPLV---PNMAHYNVILKNIEVDGDILQLPSDTFDSEN 297
+G G S G+ + TP + P+ A Y V + + V GD + + F
Sbjct: 272 GRVIGNVGRISFGDRGYTDQEETPFISVAPSTA-YGVNISGVSVAGDPVDI--RLFAK-- 326
Query: 298 GKGTVIDSGTTLAYLPRIVYDQLMSKV--LAKQPRLKVYLVEEQYSCFQYTGNVDS-GFP 354
D+G++ +L Y L L + R V C+ + N + FP
Sbjct: 327 -----FDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQFP 381
Query: 355 IVKLHFEDSLSLTV-YPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLV 413
+V++ F + + P +G+ +C+G KS G + ++G ++ +
Sbjct: 382 LVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSV-----GLKINVIGQNFVAGYRI 436
Query: 414 VYDLENMTIGWTDYNC 429
V+D E M +GW C
Sbjct: 437 VFDRERMILGWKQSLC 452
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 112 bits (279), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 185/417 (44%), Gaps = 44/417 (10%)
Query: 31 RQASLTGIKAHDSSRRGRILSAVDFN------LGGNGLPNVTGLYFTKIGLGSPSKDYYV 84
SL KA RGR L++ + N GN ++ L++ + +G+P++ + V
Sbjct: 45 EMGSLDYYKALVHRDRGRQLTSNNNNQTTISFAQGNSTEEISFLHYANVTIGTPAQWFLV 104
Query: 85 QVDTGSDILWVNCVECTRCPR--KSDIG--IGLTLYDPKRSKTSEFVSCEHNFCSSTYEG 140
+DTGSD+ W+ C + C R ++D G I L +Y+P +SK+S V+C C+
Sbjct: 105 ALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCA--LRN 162
Query: 141 RILGCKAENPCPYSISY-GDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQS 199
R + ++ CPY I Y GS +TG V+D + + G +++ I FGC +Q
Sbjct: 163 RCISPVSD--CPYRIRYLSPGSKSTGVLVEDVIHMSTEEGE----ARDARITFGCSESQL 216
Query: 200 GTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK 259
G E A++GI+G A+ +V + L +G FS C N G G S G+
Sbjct: 217 GLF---KEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPN-GKGTISFGDKGSSD 272
Query: 260 VKTTPLVPNMA--HYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVY 317
TPL ++ Y+V + +V G + T D+E DSGT + +L Y
Sbjct: 273 QLETPLSGTISPMFYDVSITKFKV-GKV------TVDTE--FTATFDSGTAVTWLIEPYY 323
Query: 318 DQLMSKVLAKQP--RLKVYLVEEQYSCFQYTGNVDSG-FPIVKLHFEDSLSLTVYPHDYL 374
L + P RL + C+ T D P V + + V+ +
Sbjct: 324 TALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILV 383
Query: 375 FNYKGDSY--WCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429
F+ S+ +C+ K + D +++G ++N +V+D E +GW NC
Sbjct: 384 FDTSDGSFQVYCLAVLKQVN-----ADFSIIGQNFMTNYRIVHDRERRILGWKKSNC 435
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 109 bits (273), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/424 (24%), Positives = 180/424 (42%), Gaps = 54/424 (12%)
Query: 24 LVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYY 83
++ P+ + T IK+ SS V F L GN P G Y + +GSP K +
Sbjct: 14 VIVPLSKSSIFKTFIKSSPSS--------VVFPLSGNVFP--LGYYSVLMQIGSPPKAFQ 63
Query: 84 VQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRI 142
+DTGSD+ WV C C+ C ++ Y PK + C + C++ +
Sbjct: 64 FDIDTGSDLTWVQCDAPCSGCTLPPNLQ-----YKPK----GNIIPCSNPICTALHWPNK 114
Query: 143 LGCK-AENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGT 201
C + C Y + Y D ++ G V D VNG+ + FGCG QS
Sbjct: 115 PHCPNPQEQCDYEVKYADQGSSMGALVTDQFPLKLVNGSF----MQPPVAFGCGYDQSYP 170
Query: 202 LASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK-- 259
++ A G++G G+ +L+QL ++G + + HCL + GGG G+ + P
Sbjct: 171 -SAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSK-GGGFLFFGDNLVPSIG 228
Query: 260 VKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQ 319
V TPL+ HY ++ +G L G + D+G++ Y Y
Sbjct: 229 VAWTPLLSQDNHYTTGPADLLFNGKPTGL--------KGLKLIFDTGSSYTYFNSKAYQT 280
Query: 320 LMSKV----------LAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFED---SLSL 366
+++ + +AK+ + + + + F+ V + F + ++F + + L
Sbjct: 281 IINLIGNDLKVSPLKVAKEDK-TLPICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQL 339
Query: 367 TVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTD 426
+ P YL K + C+G + SE ++ ++GD + +++YD E +GW
Sbjct: 340 YLAPELYLIVSKTGNV-CLGLL-NGSEV-GLQNSNVIGDISMQGLMMIYDNEKQQLGWVS 396
Query: 427 YNCS 430
+C+
Sbjct: 397 SDCN 400
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 109 bits (273), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 169/382 (44%), Gaps = 45/382 (11%)
Query: 60 NGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119
+G+ +G YF +IG+GSP +D Y+ +D+GSD++WV C C C ++SD ++DP
Sbjct: 122 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPA 176
Query: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG 179
+S + VSC + C GC + C Y + YGDGS T G + LTF +
Sbjct: 177 KSGSYTGVSCGSSVCDRIENS---GCHSGG-CRYEVMYGDGSYTKGTLALETLTFAK--- 229
Query: 180 NLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 239
T +N + GCG G ++ G + S + QL SG+ F +
Sbjct: 230 ---TVVRN--VAMGCGHRNRGMFIGAAGLLGI-----GGGSMSFVGQL--SGQTGGAFGY 277
Query: 240 CL---DTNVGGGIFSIGEVVEPKVKTTPLVPN---MAHYNVILKNIEVDGDILQLPSDTF 293
CL T+ G + E + PLV N + Y V LK + V G + LP F
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 337
Query: 294 D-SENGK-GTVIDSGTTLAYLPRIVY----DQLMSKVLAKQPRLKVYLVEEQYSCFQYTG 347
D +E G G V+D+GT + LP Y D S+ A PR + + +C+ +G
Sbjct: 338 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQT-ANLPRASGVSIFD--TCYDLSG 394
Query: 348 NVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFV 407
V P V +F + LT+ ++L +C + S + ++++G+
Sbjct: 395 FVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPT------GLSIIGNIQ 448
Query: 408 LSNKLVVYDLENMTIGWTDYNC 429
V +D N +G+ C
Sbjct: 449 QEGIQVSFDGANGFVGFGPNVC 470
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 109 bits (272), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 162/380 (42%), Gaps = 54/380 (14%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
+G YF++IG+G+P+K+ Y+ +DTGSD+ W+ C C C ++SD +++P S T +
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYK 213
Query: 126 FVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTAT 185
++C CS C++ N C Y +SYGDGS T G D +TF +G ++
Sbjct: 214 SLTCSAPQCSLLETS---ACRS-NKCLYQVSYGDGSFTVGELATDTVTFGN-SGKIN--- 265
Query: 186 QNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNV 245
++ GCG G ++ G S+ +Q+ A+ FS+CL
Sbjct: 266 ---NVALGCGHDNEGLFTGAAGLLGLGGGVL-----SITNQMKATS-----FSYCLVDRD 312
Query: 246 GGGIFSI---------GEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTF--D 294
G S+ G+ P ++ + Y V L V G+ + LP F D
Sbjct: 313 SGKSSSLDFNSVQLGGGDATAPLLRNKKI---DTFYYVGLSGFSVGGEKVVLPDAIFDVD 369
Query: 295 SENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLK-----VYLVEEQYSCFQYTGNV 349
+ G ++D GT + L Y+ L L LK + L + +C+ ++
Sbjct: 370 ASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFD---TCYDFSSLS 426
Query: 350 DSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLS 409
P V HF SL + +YL +C + ++S ++++G+
Sbjct: 427 TVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSS------LSIIGNVQQQ 480
Query: 410 NKLVVYDLENMTIGWTDYNC 429
+ YDL IG + C
Sbjct: 481 GTRITYDLSKNVIGLSGNKC 500
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 108 bits (269), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 157/374 (41%), Gaps = 37/374 (9%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTR-CPRKSDIGIGLTLYDPKRSKTS 124
+G Y +GLG+P D + DTGSD+ W C C R C + + +++P +S +
Sbjct: 129 SGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKE-----PIFNPSKSTSY 183
Query: 125 EFVSCEHNFCS--STYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLH 182
VSC C S+ G C A N C Y I YGD S + G+ ++ T N ++
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFTL--TNSDVF 240
Query: 183 TATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD 242
+ FGCG G + G++G G+ S SQ A + KIFS+CL
Sbjct: 241 DG-----VYFGCGENNQGLFT-----GVAGLLGLGRDKLSFPSQTATA--YNKIFSYCLP 288
Query: 243 TNVG-GGIFSIGEV-VEPKVKTTP---LVPNMAHYNVILKNIEVDGDILQLPSDTFDSEN 297
++ G + G + VK TP + + Y + + I V G L +PS F +
Sbjct: 289 SSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP- 347
Query: 298 GKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYL-VEEQYSCFQYTGNVDSGFPIV 356
G +IDSGT + LP Y L S AK + V +CF +G P V
Sbjct: 348 --GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKV 405
Query: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYD 416
F + + + +K S C+ + ++ ++ + + G+ VVYD
Sbjct: 406 AFSFSGGAVVELGSKGIFYVFK-ISQVCLAFAGNSDDS----NAAIFGNVQQQTLEVVYD 460
Query: 417 LENMTIGWTDYNCS 430
+G+ CS
Sbjct: 461 GAGGRVGFAPNGCS 474
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 106 bits (264), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 106/397 (26%), Positives = 160/397 (40%), Gaps = 40/397 (10%)
Query: 48 RILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNC-VECTRC--P 104
R+ S V F + GN P G Y+ + +G+P K + + +DTGSD+ WV C C C P
Sbjct: 48 RLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKP 105
Query: 105 RKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCK-AENPCPYSISYGDGSAT 163
R Y P + + C H CS + C E+ C Y I Y D +++
Sbjct: 106 RAKQ-------YKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASS 154
Query: 164 TGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSV 223
G V D + NG++ N + FGCG Q GI+G G+ +
Sbjct: 155 IGALVTDEVPLKLANGSI----MNLRLTFGCGYDQQNPGPHPPPPTA-GILGLGRGKVGL 209
Query: 224 LSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK--VKTTPLVPNMAHYNVILKNIEV 281
+QL + G K + HCL ++ G G SIG+ + P V T L N N + E+
Sbjct: 210 STQLKSLGITKNVIVHCL-SHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAEL 268
Query: 282 DGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYD---QLMSKVLAKQPRLKVYLVEE 338
L +D G V DSG++ Y Y L+ K L +P +
Sbjct: 269 ------LFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKS 322
Query: 339 QYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQ-KSASETKNG 397
C++ + S + K +L + LF +SY I + + NG
Sbjct: 323 LPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNG 382
Query: 398 KDMTL-----LGDFVLSNKLVVYDLENMTIGWTDYNC 429
++ L +GD +V+YD E IGW +C
Sbjct: 383 TEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDC 419
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 106 bits (264), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 106/397 (26%), Positives = 160/397 (40%), Gaps = 40/397 (10%)
Query: 48 RILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNC-VECTRC--P 104
R+ S V F + GN P G Y+ + +G+P K + + +DTGSD+ WV C C C P
Sbjct: 48 RLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKP 105
Query: 105 RKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCK-AENPCPYSISYGDGSAT 163
R Y P + + C H CS + C E+ C Y I Y D +++
Sbjct: 106 RAKQ-------YKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASS 154
Query: 164 TGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSV 223
G V D + NG++ N + FGCG Q GI+G G+ +
Sbjct: 155 IGALVTDEVPLKLANGSI----MNLRLTFGCGYDQQNPGPHPPPPTA-GILGLGRGKVGL 209
Query: 224 LSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK--VKTTPLVPNMAHYNVILKNIEV 281
+QL + G K + HCL ++ G G SIG+ + P V T L N N + E+
Sbjct: 210 STQLKSLGITKNVIVHCL-SHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAEL 268
Query: 282 DGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYD---QLMSKVLAKQPRLKVYLVEE 338
L +D G V DSG++ Y Y L+ K L +P +
Sbjct: 269 ------LFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKS 322
Query: 339 QYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQ-KSASETKNG 397
C++ + S + K +L + LF +SY I + + NG
Sbjct: 323 LPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNG 382
Query: 398 KDMTL-----LGDFVLSNKLVVYDLENMTIGWTDYNC 429
++ L +GD +V+YD E IGW +C
Sbjct: 383 TEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDC 419
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 104 bits (260), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 157/385 (40%), Gaps = 42/385 (10%)
Query: 60 NGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119
+GL +G YF ++G+G+P+ + Y+ +DTGSD++W+ C C C ++D ++DPK
Sbjct: 126 SGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD-----AIFDPK 180
Query: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG 179
+SKT V C C + + C Y +SYGDGS T G + + LTF+
Sbjct: 181 KSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFH---- 236
Query: 180 NLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 239
+ + GCG G ++ G+ S SQ + FS+
Sbjct: 237 ----GARVDHVPLGCGHDNEGLFVGAAGLLGL-----GRGGLSFPSQ--TKNRYNGKFSY 285
Query: 240 CL-------DTNVGGGIFSIGEVVEPKVKT-TPLVPNM---AHYNVILKNIEVDGDILQL 288
CL ++ G PK TPL+ N Y + L I V G +
Sbjct: 286 CLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 345
Query: 289 PSDT---FDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQY-SCFQ 344
S++ D+ G +IDSGT++ L + Y L +LK + +CF
Sbjct: 346 VSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFD 405
Query: 345 YTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLG 404
+G P V HF +++ +YL + +C + + ++++G
Sbjct: 406 LSGMTTVKVPTVVFHFGGG-EVSLPASNYLIPVNTEGRFCFAFAGTMGS------LSIIG 458
Query: 405 DFVLSNKLVVYDLENMTIGWTDYNC 429
+ V YDL +G+ C
Sbjct: 459 NIQQQGFRVAYDLVGSRVGFLSRAC 483
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 103 bits (256), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 185/417 (44%), Gaps = 57/417 (13%)
Query: 43 SSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTR 102
S R LS D +GL G +F I +G+P + DTGSD+ WV C C +
Sbjct: 62 SRRFNHQLSQTDLQ---SGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ 118
Query: 103 CPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGC-KAENPCPYSISYGDGS 161
C +++ ++D K+S T + C+ C + GC ++ N C Y SYGD S
Sbjct: 119 CYKENG-----PIFDKKKSSTYKSEPCDSRNCQALSSTE-RGCDESNNICKYRYSYGDQS 172
Query: 162 ATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANS 221
+ G + ++ + +G + +FGCG GT +E GIIG G +
Sbjct: 173 FSKGDVATETVSIDSASG---SPVSFPGTVFGCGYNNGGTF----DETGSGIIGLGGGHL 225
Query: 222 SVLSQLAASGKVKKIFSHCLD----TNVGGGIFSIGEVVEPK-------VKTTPLVPN-- 268
S++SQL +S + K FS+CL T G + ++G P V +TPLV
Sbjct: 226 SLISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEP 283
Query: 269 MAHYNVILKNIEVDGDILQLPSDTFD-------SENGKGTVIDSGTTLAYLPRIVYDQLM 321
+ +Y + L+ I V + +++ SE +IDSGTTL L +D+
Sbjct: 284 LTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFS 343
Query: 322 SKVL-----AKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFN 376
S V AK+ L+ CF+ +G+ + G P + +HF + + + P +
Sbjct: 344 SAVEESVTGAKRVSDPQGLLSH---CFK-SGSAEIGLPEITVHFTGA-DVRLSPINAFVK 398
Query: 377 YKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSI 433
D C+ + ++ + G+F + LV YDLE T+ + +CS+++
Sbjct: 399 LSED-MVCLSMVPTT-------EVAIYGNFAQMDFLVGYDLETRTVSFQHMDCSANL 447
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 102 bits (255), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 158/357 (44%), Gaps = 47/357 (13%)
Query: 31 RQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGS 90
++AS+ ++ + G I++ + N+ P + + I +GSP + +DT S
Sbjct: 52 KEASVERLEYLKAKTTGDIIAHLSPNV-----PIIPQAFLVNISIGSPPITQLLHMDTAS 106
Query: 91 DILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAE-N 149
D+LW+ C+ C C +S L ++DP RS T +C +S Y L A
Sbjct: 107 DLLWIQCLPCINCYAQS-----LPIFDPSRSYTHRNETCR----TSQYSMPSLKFNANTR 157
Query: 150 PCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEA 209
C YS+ Y D + + G ++ L FN + +A + ++FGCG G +
Sbjct: 158 SCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALH-DVVFGCGHDNYGEPLVGT--- 213
Query: 210 LDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----DTNVGGGIFSIGEVVEPKV-KTTP 264
GI+G G S++ + K FS+C D + + +G+ + TTP
Sbjct: 214 --GILGLGYGEFSLVHRFG------KKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTP 265
Query: 265 LVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGK---GTVIDSGTTLAYLPRIVYDQLM 321
L + Y V ++ I VDG IL + F+ + GT+ID+G +L L Y L
Sbjct: 266 LEIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLK 325
Query: 322 SKVL-AKQPRLKVYLVEE----QYSCFQYTGN-----VDSGFPIVKLHFEDSLSLTV 368
+++ + R V + + C Y GN V+SGFPIV HF + L++
Sbjct: 326 NRIEDIFEGRFTAADVSQDDMIKMEC--YNGNFERDLVESGFPIVTFHFSEGAELSL 380
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 101 bits (252), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 152/378 (40%), Gaps = 36/378 (9%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
T L+ +G P +DTGS +LW+ C C C I +++P S T
Sbjct: 93 TSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIH---PVFNPALSSTFV 149
Query: 126 FVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTAT 185
SC+ FC G C + N C Y Y G+ + G ++ LTF NGN T
Sbjct: 150 ECSCDDRFCRYAPNGH---CGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNT-VVT 205
Query: 186 QNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL---- 241
Q I FGCG L E GI+G G +S+ QL + FS+C+
Sbjct: 206 Q--PIAFGCGYENGEQL----ESHFTGILGLGAKPTSLAVQLGSK------FSYCIGDLA 253
Query: 242 DTNVGGGIFSIGEVVEPKVKTTPLVPNMAH--YNVILKNIEVDGDILQLPSDTFDSENGK 299
+ N G +GE + TP+ + Y + L+ I V L + F +
Sbjct: 254 NKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPR 313
Query: 300 -GTVIDSGTTLAYLPRIVYDQLMSKVLA-KQPRLKVYLVEEQYSCFQYTGNVDS---GFP 354
G ++DSGT +L I Y +L +++ + P+L+ + + + C Y G V GFP
Sbjct: 314 TGVILDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFRD-FLC--YHGRVSEELIGFP 370
Query: 355 IVKLHFEDSLSLTVYPHDYLFNYKGD---SYWCIGWQKSASETKNGKDMTLLGDFVLSNK 411
+V HF L + + + +C+ + + K+ T +G
Sbjct: 371 VVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYY 430
Query: 412 LVVYDLENMTIGWTDYNC 429
+ YDL+ I +C
Sbjct: 431 NIGYDLKEKNIYLQRIDC 448
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 101 bits (252), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 170/393 (43%), Gaps = 67/393 (17%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
+G + ++ +G+P+ Y VDTGSD++W C CT C + ++DP++S +
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQP-----TPIFDPEKSSSYS 158
Query: 126 FVSCEHNFCSSTYEGRILGCKAE-NPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTA 184
V C C++ C + + C Y +YGD S+T G + TF N
Sbjct: 159 KVGCSSGLCNALPRSN---CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN------ 209
Query: 185 TQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL--- 241
S I FGCG G S G++G G+ S++SQL + FS+CL
Sbjct: 210 -SISGIGFGCGVENEGDGFSQGS----GLVGLGRGPLSLISQLKETK-----FSYCLTSI 259
Query: 242 -DTNVGGGIFSIGEVVEPKV------------KTTPLVPN---MAHYNVILKNIEVDGDI 285
D+ +F IG + V KT L+ N + Y + L+ I V
Sbjct: 260 EDSEASSSLF-IGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKR 318
Query: 286 LQLPSDTFD-SENGK-GTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYL-VEEQYS- 341
L + TF+ +E+G G +IDSGTT+ YL + KVL ++ ++ L V++ S
Sbjct: 319 LSVEKSTFELAEDGTGGMIIDSGTTITYLEETAF-----KVLKEEFTSRMSLPVDDSGST 373
Query: 342 ----CFQYTGNVDS-GFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKN 396
CF+ + P + HF+ + L + +Y+ C+ S N
Sbjct: 374 GLDLCFKLPDAAKNIAVPKMIFHFKGA-DLELPGENYMVADSSTGVLCLAMGSS-----N 427
Query: 397 GKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429
G M++ G+ N V++DLE T+ + C
Sbjct: 428 G--MSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 100 bits (250), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 155/371 (41%), Gaps = 42/371 (11%)
Query: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
Y + +G+P++ V +DT +D W+ C C C L+DP +S +S +
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNS 188
CE C + C C ++++YG GS Y QD LT +
Sbjct: 141 CEAPQCK---QAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTL--------ASDVIP 188
Query: 189 SIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVG-- 246
+ FGC SGT G++G G+ S++SQ + + FS+CL +
Sbjct: 189 NYTFGCINKASGT-----SLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 247 -GGIFSIGEVVEP-KVKTTPLVPNMAH---YNVILKNIEVDGDILQLPSD--TFDSENGK 299
G +G +P ++KTTPL+ N Y V L I V I+ +P+ FD G
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 300 GTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLH 359
GT+ DSGT L Y + ++ + + +C Y+G+V FP V
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTC--YSGSVV--FPSVTFM 357
Query: 360 FEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLEN 419
F +++T+ P + L + + C+ +A+ + ++ N V+ D+ N
Sbjct: 358 FAG-MNVTLPPDNLLIHSSAGNLSCLAM--AAAPVNVNSVLNVIASMQQQNHRVLIDVPN 414
Query: 420 MTIGWTDYNCS 430
+G + C+
Sbjct: 415 SRLGISRETCT 425
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 99.8 bits (247), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 115/386 (29%), Positives = 167/386 (43%), Gaps = 62/386 (16%)
Query: 67 GLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEF 126
G Y I +G+P DTGSD++W C C C +++ L+DPK S T
Sbjct: 84 GEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTS-----PLFDPKESSTYRK 138
Query: 127 VSCEHNFCSSTYEGRILGCKA-ENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTAT 185
VSC + C + + C EN C Y+I+YGD S T G D +T +G +
Sbjct: 139 VSCSSSQCRALEDA---SCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGS-SGRRPVSL 194
Query: 186 QNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL---- 241
+N +I GCG +GT + A GIIG G ++S++SQL S + FS+CL
Sbjct: 195 RN--MIIGCGHENTGTF----DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFT 246
Query: 242 -------DTNVG-GGIFS-IGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDT 292
N G GI S G V VK P +Y + L+ I V +Q S
Sbjct: 247 SETGLTSKINFGTNGIVSGDGVVSTSMVKKDP----ATYYFLNLEAISVGSKKIQFTSTI 302
Query: 293 FDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSG 352
F + G VIDSGTTL LP Y +L S V + +K V++
Sbjct: 303 FGTGEGN-IVIDSGTTLTLLPSNFYYELESVVAST---IKAERVQDPDG----------- 347
Query: 353 FPIVKLHFEDSLSLTVYPHDYLFNYKG--------DSYWCIGWQKSASETKNGKDMTLLG 404
I+ L + DS S V D ++KG +++ + S + +T+ G
Sbjct: 348 --ILSLCYRDSSSFKV--PDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIFG 403
Query: 405 DFVLSNKLVVYDLENMTIGWTDYNCS 430
+ N LV YD + T+ + +CS
Sbjct: 404 NLAQMNFLVGYDTVSGTVSFKKTDCS 429
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 99.4 bits (246), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 105/401 (26%), Positives = 185/401 (46%), Gaps = 68/401 (16%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
+G YF + +G+P K + + +DTGSD+ W+ C+ C C ++ + YDPK S + +
Sbjct: 157 SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGM-----FYDPKTSASFK 211
Query: 126 FVSCEHNFCSS-TYEGRILGCKAEN-PCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHT 183
++C CS + + C+++N CPY YGD S TTG + + T NL T
Sbjct: 212 NITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV-----NLTT 266
Query: 184 ATQNSS------IIFGCGAAQSGTLA---------------SSSEEALDG-----IIGFG 217
SS ++FGCG G + SS ++L G +
Sbjct: 267 TEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR 326
Query: 218 QANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILK 277
+N++V S+L G+ K + +H TN+ F G+ E V+T Y + +K
Sbjct: 327 NSNTNVSSKLIF-GEDKDLLNH---TNLNFTSFVNGK--ENSVET--------FYYIQIK 372
Query: 278 NIEVDGDILQLPSDTFD--SENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQ----PRL 331
+I V G L +P +T++ S+ GT+IDSGTTL+Y Y+ + +K K P
Sbjct: 373 SILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIF 432
Query: 332 KVYLVEEQYSCFQYTGNVDSG--FPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQK 389
+ + V + CF +G ++ P + + F D ++ +P + F + + C+
Sbjct: 433 RDFPVLD--PCFNVSGIEENNIHLPELGIAFVDG-TVWNFPAENSFIWLSEDLVCLAILG 489
Query: 390 SASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430
+ T +++G++ N ++YD + +G+T C+
Sbjct: 490 TPKST-----FSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 99.4 bits (246), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 115/415 (27%), Positives = 186/415 (44%), Gaps = 57/415 (13%)
Query: 44 SRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRC 103
SR R + D +GL + G YF I +G+P + DTGSD+ WV C C +C
Sbjct: 63 SRSRRFTTKTDLQ---SGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQC 119
Query: 104 PRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGC-KAENPCPYSISYGDGSA 162
+++ L+D K+S T + SC+ C + E GC ++++ C Y SYGD S
Sbjct: 120 YKQNS-----PLFDKKKSSTYKTESCDSKTCQALSEHE-EGCDESKDICKYRYSYGDNSF 173
Query: 163 TTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSS 222
T G + ++ + + ++ +FGCG GT EE GIIG G S
Sbjct: 174 TKGDVATETISIDSSS---GSSVSFPGTVFGCGYNNGGTF----EETGSGIIGLGGGPLS 226
Query: 223 VLSQLAASGKVKKIFSHCLD----TNVGGGIFSIGEVVEPK-------VKTTPLVPN--M 269
++SQL +S + K FS+CL T G + ++G P TTPL+
Sbjct: 227 LVSQLGSS--IGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE 284
Query: 270 AHYNVILKNIEVDGDILQLPSDTFDSENGKGT------VIDSGTTLAYLPRIVYDQLMSK 323
+Y + L+ + V L + NGK + +IDSGTTL L YD +
Sbjct: 285 TYYFLTLEAVTVGKTKLPYTGGGY-GLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTA 343
Query: 324 VL-----AKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYK 378
V AK+ L+ CF+ +G+ + G P + +HF ++ + + P +
Sbjct: 344 VEESVTGAKRVSDPQGLLTH---CFK-SGDKEIGLPAITMHFTNA-DVKLSPINAFVKLN 398
Query: 379 GDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSI 433
D+ C+ + ++ + G+ V + LV YDLE T+ + +CS ++
Sbjct: 399 EDTV-CLSMIPTT-------EVAIYGNMVQMDFLVGYDLETKTVSFQRMDCSGNL 445
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 99.4 bits (246), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 118/466 (25%), Positives = 192/466 (41%), Gaps = 87/466 (18%)
Query: 24 LVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYY 83
LV + + SL K+ R + LS+VD + L V Y + +G+P +
Sbjct: 40 LVLTLTKSSVSLPTPKSQTQERIKKPLSSVDVVM--EPLREVRDGYLITLNIGTPPQAVQ 97
Query: 84 VQVDTGSDILWVNC----VECTRC--PRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSST 137
V +DTGSD+ WV C +C C + +D+ +++ P S TS SC +FC
Sbjct: 98 VYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLK-SPSVFSPLHSSTSFRDSCASSFCVEI 156
Query: 138 YEG-------RILGCKAE--------NPCP-YSISYGDGSATTGYYVQDYLTFNRVNGNL 181
+ + GC PCP ++ +YG+G +G +D L
Sbjct: 157 HSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDIL--------- 207
Query: 182 HTATQN-SSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHC 240
T++ FGC + S+ E + GI GFG+ S+ SQL G ++K FSHC
Sbjct: 208 KARTRDVPRFSFGC-------VTSTYREPI-GIAGFGRGLLSLPSQL---GFLEKGFSHC 256
Query: 241 L-------DTNV------GGGIFSIG---EVVEPKVKTTPLVPNMAHYNVILKNIEVDGD 284
+ N+ G SI + + TP+ PN Y + L++I + +
Sbjct: 257 FLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPN--SYYIGLESITIGTN 314
Query: 285 I----LQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQY 340
I + L FDS+ G ++DSGTT +LP Y QL++ + + + E +
Sbjct: 315 ITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRT 374
Query: 341 S---CFQ----------YTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDS----YW 383
C++ +V FP + HF ++ +L + + + S
Sbjct: 375 GFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQ 434
Query: 384 CIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429
C+ +Q E + + G F N VVYDLE IG+ +C
Sbjct: 435 CLLFQN--MEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 99.4 bits (246), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 41/378 (10%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRK-SDIGIG----LTLYDPKRSK 122
L++ + +G+P+ + V +DTGSD+ W+ C + C R ++G+ L LY P S
Sbjct: 101 LHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160
Query: 123 TSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISY-GDGSATTGYYVQDYLTFNRVNGNL 181
TS + C + C G + CPY I Y + TTG +D L + L
Sbjct: 161 TSSSIRCSDDRCF----GSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGL 216
Query: 182 HTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL 241
N I GCG Q+G L SS+ A++G++G G + SV S LA + FS C
Sbjct: 217 EPVKAN--ITLGCGKNQTGFLQSSA--AVNGLLGLGLKDYSVPSILAKAKITANSFSMCF 272
Query: 242 DTNVG-GGIFSIGEVVEPKVKTTPLVPNMAH--YNVILKNIEVDGDI--LQLPSDTFDSE 296
+ G S G+ TPL+P Y V + + V GD +QL +
Sbjct: 273 GNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQLLA------ 326
Query: 297 NGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYS---CFQYTGNVDS-G 352
+ D+GT+ +L Y L++K K ++ + C+ + N +
Sbjct: 327 -----LFDTGTSFTHLLEPEYG-LITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTIL 380
Query: 353 FPIVKLHFEDSLSLTVY-PHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNK 411
FP V + FE + + P ++N + +C+G KS N ++G +S
Sbjct: 381 FPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKIN-----IIGQNFMSGY 435
Query: 412 LVVYDLENMTIGWTDYNC 429
+V+D E M +GW +C
Sbjct: 436 RIVFDRERMILGWKRSDC 453
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 99.0 bits (245), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 157/379 (41%), Gaps = 42/379 (11%)
Query: 62 LPNVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRS 121
+PN + I +G+P + +DTGSD+ W++C+ C P+ + + P RS
Sbjct: 72 IPNPAA-FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQT------IPFFHPSRS 124
Query: 122 KTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNL 181
T SC + +I + C Y + Y D S T G ++ LTF + L
Sbjct: 125 STYRNASC---VSAPHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGL 181
Query: 182 HTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL 241
+ QN I+FGCG SG S G++G G S++++ S K F
Sbjct: 182 -ISKQN--IVFGCGQDNSGFTKYS------GVLGLGPGTFSIVTRNFGS-KFSYCFGSLT 231
Query: 242 DTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGK-G 300
+ I +G + + TPL Y + L+ I +L + TF + G
Sbjct: 232 NPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGG 291
Query: 301 TVIDSGTTLAYLPRIVY-------DQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVD--- 350
TVID+G + L R Y D L+ +VL R+K + +QY+ Y GN+
Sbjct: 292 TVIDTGCSPTILAREAYETLSEEIDFLLGEVLR---RVKDW---DQYTTPCYEGNLKLDL 345
Query: 351 SGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSN 410
GFP+V HF L + + + +C+ A DM+++G N
Sbjct: 346 YGFPVVTFHFAGGAELALDVESLFVSSESGDSFCL-----AMTMNTFDDMSVIGAMAQQN 400
Query: 411 KLVVYDLENMTIGWTDYNC 429
V Y+L M + + +C
Sbjct: 401 YNVGYNLRTMKVYFQRTDC 419
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 97.8 bits (242), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 163/378 (43%), Gaps = 42/378 (11%)
Query: 66 TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125
+G Y + +G+P DTGSD+LW C C C + D L+DPK S T +
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVD-----PLFDPKTSSTYK 141
Query: 126 FVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTAT 185
VSC + C++ E + +N C YS+SYGD S T G D LT +
Sbjct: 142 DVSCSSSQCTA-LENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTR---PM 197
Query: 186 QNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL---- 241
Q +II GCG +GT + GI+G G S++ QL S + FS+CL
Sbjct: 198 QLKNIIIGCGHNNAGTF----NKKGSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLT 251
Query: 242 -----DTNVGGGIFSIGEVVEPKVKTTPLVPNMAH---YNVILKNIEVDGDILQLPSDTF 293
+ + G +I V V +TPL+ + Y + LK+I V +Q S +
Sbjct: 252 SKKDQTSKINFGTNAI--VSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY-SGSD 308
Query: 294 DSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYS-CFQYTGNVDSG 352
+ +IDSGTTL LP Y +L V + K + S C+ TG D
Sbjct: 309 SESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLK 366
Query: 353 FPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKL 412
P++ +HF D + + + D C ++ S S ++ G+ N L
Sbjct: 367 VPVITMHF-DGADVKLDSSNAFVQVSED-LVCFAFRGSPS-------FSIYGNVAQMNFL 417
Query: 413 VVYDLENMTIGWTDYNCS 430
V YD + T+ + +C+
Sbjct: 418 VGYDTVSKTVSFKPTDCA 435
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 97.1 bits (240), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 182/427 (42%), Gaps = 56/427 (13%)
Query: 24 LVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYY 83
LV R Q+ IKA SS + +S L +G+ + Y + LG K+
Sbjct: 91 LVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLT-SGIKLESLNYIVTVELGG--KNMS 147
Query: 84 VQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFC-----SSTY 138
+ VDTGSD+ WV C C C + LYDP S + + V C + C +++
Sbjct: 148 LIVDTGSDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSN 202
Query: 139 EGRILGCKA--ENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGA 196
G G + PC Y +SYGDGS T G + + L T+ + +FGCG
Sbjct: 203 SGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESI--------LLGDTKLENFVFGCGR 254
Query: 197 AQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGG--GIFSIGE 254
G SS G+++ S++SQ + +FS+CL + G G S G
Sbjct: 255 NNKGLFGGSSGLMGL-----GRSSVSLVSQTLKT--FNGVFSYCLPSLEDGASGSLSFGN 307
Query: 255 -----VVEPKVKTTPLVPN---MAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSG 306
V TPLV N + Y + L + G ++L S +F G+G +IDSG
Sbjct: 308 DSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSF----GRGILIDSG 361
Query: 307 TTLAYLPRIVYDQLMSKVLAK---QPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDS 363
T + LP +Y + + L + P Y + + +CF T D PI+K+ F+ +
Sbjct: 362 TVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILD--TCFNLTSYEDISIPIIKMIFQGN 419
Query: 364 LSLTVYPHDYLFNYKGD-SYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTI 422
L V + K D S C+ + E + G ++G++ N+ V+YD +
Sbjct: 420 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVG----IIGNYQQKNQRVIYDTTQERL 475
Query: 423 GWTDYNC 429
G NC
Sbjct: 476 GIVGENC 482
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 95.9 bits (237), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 104/438 (23%), Positives = 183/438 (41%), Gaps = 91/438 (20%)
Query: 21 NANLVFPVQRRQ----ASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLG 76
N N V Q++ + T + + + G++++ ++ +G+ +G YF + +G
Sbjct: 123 NQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVATLE-----SGMTLGSGEYFMDVLVG 177
Query: 77 SPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSS 136
SP K + + +DTGSD+ W+ C+ C C +++D
Sbjct: 178 SPPKHFSLILDTGSDLNWIQCLPCYDCFQQND---------------------------- 209
Query: 137 TYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFN-RVNGNLHTATQNSSIIFGCG 195
CPY YGD S TTG + + T N NG +++FGCG
Sbjct: 210 -----------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCG 258
Query: 196 AAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-----DTNVGGGIF 250
G ++ G + SS L L FS+CL DTNV +
Sbjct: 259 HWNRGLFHGAAGLLGLGRGP--LSFSSQLQSLYGHS-----FSYCLVDRNSDTNVSSKLI 311
Query: 251 SIGE----VVEPKVKTTPLVPNMAH-----YNVILKNIEVDGDILQLPSDTFD--SENGK 299
GE + P + T V + Y V +K+I V G++L +P +T++ S+
Sbjct: 312 -FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAG 370
Query: 300 GTVIDSGTTLAYLPRIVYD----QLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPI 355
GT+IDSGTTL+Y Y+ ++ K K P + + + + CF +G + P
Sbjct: 371 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILD--PCFNVSGIHNVQLPE 428
Query: 356 VKLHFEDSLSLTVYPHDYLFNYKGDSYWC---IGWQKSASETKNGKDMTLLGDFVLSNKL 412
+ + F D ++ +P + F + + C +G KSA +++G++ N
Sbjct: 429 LGIAFADG-AVWNFPTENSFIWLNEDLVCLAMLGTPKSA--------FSIIGNYQQQNFH 479
Query: 413 VVYDLENMTIGWTDYNCS 430
++YD + +G+ C+
Sbjct: 480 ILYDTKRSRLGYAPTKCA 497
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 94.4 bits (233), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 161/391 (41%), Gaps = 62/391 (15%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRC------PRKSDIGIGLTLYDPKRS 121
L++ + LG+P+ + V +DTGSD+ W+ C T C R S+ + L LY P S
Sbjct: 102 LHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSE-SVPLNLYTPNAS 160
Query: 122 KTSEFVSCEHNFCSSTYEGRILGCKA-ENPCPYSISYGDGSATTGYYVQDYLTFNRVNGN 180
TS + C C + + C + E+ CPY I+ + TTG +QD L + +
Sbjct: 161 TTSSSIRCSDKRCFGSGK-----CSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDED 215
Query: 181 LHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHC 240
L N+++ GCG Q+G A ++ A++G++G SV S LA + FS C
Sbjct: 216 LKPV--NANVTLGCGQNQTG--AFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMC 271
Query: 241 LDTNVG-GGIFSIGEVVEPKVKTTPLV--PNMAHYNVILKNIEVDGDILQLPSDTFDSEN 297
+ G S G+ + TPLV Y V + + V G + +P
Sbjct: 272 FGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGVNVTGVSVGGVPVDVP-------- 323
Query: 298 GKGTVIDSGTTLAYLPRIVY-------DQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVD 350
+ D+G++ L Y D LM ++P + E Y + N D
Sbjct: 324 -LFALFDTGSSFTLLLESAYGVFTKAFDDLMED--KRRPVDPDFPFEFCYDLREEHLNSD 380
Query: 351 SGFPIVKLHFEDSLSLTVYPHDYLFNYKGDS------------YWCIGWQKSASETKNGK 398
+ + S D+ + + DS +C+G KS
Sbjct: 381 A-----RPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSI------- 428
Query: 399 DMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429
++ ++G ++S +V+D E M +GW NC
Sbjct: 429 NLNIIGQNLMSGHRIVFDRERMILGWKQSNC 459
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 91.7 bits (226), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 153/370 (41%), Gaps = 49/370 (13%)
Query: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
Y K+ +G+P + +DTGS+ +W C+ C C ++ ++DP +S T + +
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTA-----PIFDPSKSSTFKEIR 119
Query: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNS 188
C+ + ++ CPY + YG S T G V + +T + +G +
Sbjct: 120 CDTH---------------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPET- 163
Query: 189 SIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL------D 242
I GCG SG + G++G + S+++Q+ G+ + S+C
Sbjct: 164 --IIGCGRNNSGF-----KPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSK 214
Query: 243 TNVGGGIFSIGE-VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGT 301
N G G+ VV V P + N L + V ++ F + G
Sbjct: 215 INFGANAIVAGDGVVSTTVFVKTAKPGFYYLN--LDAVSVGNTRIETVGTPFHALKGN-I 271
Query: 302 VIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFE 361
VIDSG+TL Y P Y L+ K + +Q V C+ Y+ +D FP++ +HF
Sbjct: 272 VIDSGSTLTYFPE-SYCNLVRKAV-EQVVTAVRFPRSDILCY-YSKTIDI-FPVITMHFS 327
Query: 362 DSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMT 421
L + ++ +C+ A + + + G+ +N LV YD ++
Sbjct: 328 GGADLVLDKYNMYVASNTGGVFCL-----AIICNSPIEEAIFGNRAQNNFLVGYDSSSLL 382
Query: 422 IGWTDYNCSS 431
+ + NCS+
Sbjct: 383 VSFKPTNCSA 392
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 90.1 bits (222), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/378 (22%), Positives = 150/378 (39%), Gaps = 62/378 (16%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
+Y K+ +G+P + +DTGS+I W C+ C C ++ ++DP +S T +
Sbjct: 64 VYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQN-----APIFDPSKSSTFKEK 118
Query: 128 SCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQN 187
C+ + CPY + Y D + T G + +T + +G +
Sbjct: 119 RCD-----------------GHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPET 161
Query: 188 SSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD----- 242
I GCG S + + G++G SS+++Q+ G+ + S+C
Sbjct: 162 ---IIGCGHNNSWF-----KPSFSGMVGLNWGPSSLITQMG--GEYPGLMSYCFSGQGTS 211
Query: 243 -TNVGGGIFSIGE-VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKG 300
N G G+ VV + T P + N L + V ++ TF + G
Sbjct: 212 KINFGANAIVAGDGVVSTTMFMTTAKPGFYYLN--LDAVSVGNTRIETMGTTFHALEGN- 268
Query: 301 TVIDSGTTLAYLP-------RIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGF 353
VIDSGTTL Y P R + +++ V A P L Y + F
Sbjct: 269 IVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLC--------YNSDTIDIF 320
Query: 354 PIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLV 413
P++ +HF + L + ++ +C+ A + + G+ +N LV
Sbjct: 321 PVITMHFSGGVDLVLDKYNMYMESNNGGVFCL-----AIICNSPTQEAIFGNRAQNNFLV 375
Query: 414 VYDLENMTIGWTDYNCSS 431
YD ++ + ++ NCS+
Sbjct: 376 GYDSSSLLVSFSPTNCSA 393
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 88.2 bits (217), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 109/410 (26%), Positives = 171/410 (41%), Gaps = 75/410 (18%)
Query: 67 GLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTL---YDPKRSKT 123
G Y + G+PS+ DTGS ++W+ C C G+ TL + PK S +
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSS 147
Query: 124 SEFVSCEHNFCSSTYEGRILGCKAENP----C-----PYSISYGDGSATTGYYVQDYLTF 174
S+ + C+ C Y G + C+ +P C PY + YG GS T G + + L F
Sbjct: 148 SKIIGCQSPKCQFLY-GPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDF 205
Query: 175 NRVNGNLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVK 234
+ + GC ++ S+ + A GI GFG+ S+ SQ+
Sbjct: 206 PDLT--------VPDFVVGC------SIISTRQPA--GIAGFGRGPVSLPSQMNL----- 244
Query: 235 KIFSHCL------DTNV--------GGGIFSIGEVVEPKVKTTPLVPN--------MAHY 272
K FSHCL DTNV G G S + P + TP N + +Y
Sbjct: 245 KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKT--PGLTYTPFRKNPNVSNKAFLEYY 302
Query: 273 NVILKNIEVDGDILQLPSDTF-DSENGK-GTVIDSGTTLAYLPRIVYDQLMSKVLAKQPR 330
+ L+ I V +++P NG G+++DSG+T ++ R V+ +L+++ A Q
Sbjct: 303 YLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVF-ELVAEEFASQ-- 359
Query: 331 LKVYLVEEQYS-------CFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDS-- 381
+ Y E+ CF +G D P + F+ L + P F + G++
Sbjct: 360 MSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLEL-PLSNYFTFVGNTDT 418
Query: 382 -YWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430
+ K+ + + +LG F N LV YDLEN G+ CS
Sbjct: 419 VCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 87.0 bits (214), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 137/348 (39%), Gaps = 43/348 (12%)
Query: 28 VQRRQASLTGIKAHDSSRRGRILSAVDFN--LGGNGLPNVTGLYFTKIGLGSPSKDYYVQ 85
++R QA + I + S +S +G+ +G Y IG+G+P D +
Sbjct: 89 IRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLV 148
Query: 86 VDTGSDILWVNCVECT-RCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILG 144
DTGSD+ W C C C + + ++P S T + VSC C
Sbjct: 149 FDTGSDLTWTQCEPCLGSCYSQKE-----PKFNPSSSSTYQNVSCSSPMCEDAES----- 198
Query: 145 CKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSGTLAS 204
C A N C YSI YGD S T G+ ++ T L + + FGCG G
Sbjct: 199 CSASN-CVYSIVYGDKSFTQGFLAKEKFT-------LTNSDVLEDVYFGCGENNQGLFDG 250
Query: 205 SSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD--TNVGGGIFSIGEV-VEPKVK 261
+ + + IFS+CL T+ G + G + VK
Sbjct: 251 VAGLLG-------LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVK 303
Query: 262 TTPL--VPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQ 319
TP+ P+ +Y + + I V L + ++F +E G +IDSGT LP VY +
Sbjct: 304 FTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE---GAIIDSGTVFTRLPTKVYAE 360
Query: 320 LMSKVLAKQPRLKVYLVEEQY----SCFQYTGNVDSGFPIVKLHFEDS 363
L S + ++ Y Y +C+ +TG +P + F S
Sbjct: 361 LRSVF---KEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGS 405
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 87.0 bits (214), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 170/383 (44%), Gaps = 56/383 (14%)
Query: 67 GLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEF 126
G Y + LG+P + ++ +DT +D +W+ C C+ C S S T
Sbjct: 102 GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNT------NSSSTYST 155
Query: 127 VSCEHNFCSSTYEGRILGCKAENP----CPYSISYGDGSATTGYYVQDYLTFNR-VNGNL 181
VSC C+ + R L C + +P C ++ SYG S+ + VQD LT V N
Sbjct: 156 VSCSTAQCT---QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNF 212
Query: 182 HTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL 241
FGC + SG + G++G G+ S++SQ + +FS+CL
Sbjct: 213 S---------FGCINSASG-----NSLPPQGLMGLGRGPMSLVSQ--TTSLYSGVFSYCL 256
Query: 242 DTNVG---GGIFSIGEVVEPK-VKTTPLVPNMAH---YNVILKNIEVDGDILQLPSD--- 291
+ G +G + +PK ++ TPL+ N Y V L + V +Q+P D
Sbjct: 257 PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGS--VQVPVDPVY 314
Query: 292 -TFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQY-SCFQYTGNV 349
TFD+ +G GT+IDSGT + + VY+ + + KQ + + + +CF + +
Sbjct: 315 LTFDANSGAGTIIDSGTVITRFAQPVYEAIRDE-FRKQVNVSSFSTLGAFDTCF--SADN 371
Query: 350 DSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCI---GWQKSASETKNGKDMTLLGDF 406
++ P + LH SL L + + L + + C+ G +++A+ N ++ +
Sbjct: 372 ENVAPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN-----VIANL 425
Query: 407 VLSNKLVVYDLENMTIGWTDYNC 429
N +++D+ N IG C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 86.7 bits (213), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 154/378 (40%), Gaps = 62/378 (16%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
+Y K+ +G+P + ++DTGSD++W C+ CT C + ++DP S T +
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQY-----APIFDPSNSSTFK-- 112
Query: 128 SCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQN 187
E R G N C Y I Y D + + G + +T + +G +
Sbjct: 113 -----------EKRCNG----NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPET 157
Query: 188 SSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDT---- 243
+ GCG +S + G++G SS+++Q+ G+ + S+C +
Sbjct: 158 T---IGCGHN-----SSWFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTS 207
Query: 244 --NVGGGIFSIGE-VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKG 300
N G G+ VV + T P + + N L + V ++ TF + G
Sbjct: 208 KINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLN--LDAVSVGDTHVETMGTTFHALEGN- 264
Query: 301 TVIDSGTTLAYLP-------RIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGF 353
+IDSGTTL Y P R D ++ V P L C+ YT +D F
Sbjct: 265 IIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTADPTGNDML------CY-YTDTIDI-F 316
Query: 354 PIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLV 413
P++ +HF L + ++ +C+ A N + G+ +N LV
Sbjct: 317 PVITMHFSGGADLVLDKYNMYIETITRGTFCL-----AIICNNPPQDAIFGNRAQNNFLV 371
Query: 414 VYDLENMTIGWTDYNCSS 431
YD ++ + ++ NCS+
Sbjct: 372 GYDSSSLLVSFSPTNCSA 389
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 80.1 bits (196), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/399 (24%), Positives = 170/399 (42%), Gaps = 63/399 (15%)
Query: 78 PSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSST 137
P ++ + +DTGS++ W+ RC R S+ + +DP RS + + C C +
Sbjct: 82 PPQNISMVIDTGSELSWL------RCNRSSNPN-PVNNFDPTRSSSYSPIPCSSPTCRTR 134
Query: 138 YEGRIL--GCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCG 195
++ C ++ C ++SY D S++ G + F GN +T +S++IFGC
Sbjct: 135 TRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHF----GN---STNDSNLIFGCM 187
Query: 196 AAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD-TNVGGGIFSIGE 254
+ SG+ + G++G + + S +SQ+ FS+C+ T+ G +G+
Sbjct: 188 GSVSGS-DPEEDTKTTGLLGMNRGSLSFISQMGFPK-----FSYCISGTDDFPGFLLLGD 241
Query: 255 V---------VEPKVKTTPLVP--NMAHYNVILKNIEVDGDILQLPSDTF--DSENGKGT 301
P ++ + +P + Y V L I+V+G +L +P D T
Sbjct: 242 SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQT 301
Query: 302 VIDSGTTLAYLPRIVYDQLMSKVLAK-QPRLKVYLVEEQYSCFQYTGN---------VDS 351
++DSGT +L VY L S L + L VY E+ FQ T + + S
Sbjct: 302 MVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVY--EDPDFVFQGTMDLCYRISPVRIRS 359
Query: 352 G----FPIVKLHFEDS-LSLTVYPHDYLFNY---KGDSYWCIGWQKSASETKNGKDMTLL 403
G P V L FE + ++++ P Y + DS +C + S G + ++
Sbjct: 360 GILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNS---DLMGMEAYVI 416
Query: 404 GDFVLSNKLVVYDLENMTIGWTDYNCSSSIKVKDEKTGI 442
G N + +DL+ IG C V ++ GI
Sbjct: 417 GHHHQQNMWIEFDLQRSRIGLAPVEC----DVSGQRLGI 451
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 80.1 bits (196), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/373 (23%), Positives = 149/373 (39%), Gaps = 48/373 (12%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
+Y K+ +G+P + ++DTGSDI+W C+ C C + ++DP +S T
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQF-----APIFDPSKSSTFR-- 472
Query: 128 SCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQN 187
E R G N C Y I Y D + + G + +T +G +
Sbjct: 473 -----------EQRCNG----NSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAET 517
Query: 188 SSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQ--LAASGKVKKIFSHCLDTNV 245
GCG + S + GI+G S++SQ L G + FS + +
Sbjct: 518 K---IGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKI 574
Query: 246 GGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDS 305
G +I + + Y + L + V+ +++ F +E+G IDS
Sbjct: 575 NFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGN-IFIDS 633
Query: 306 GTTLAYLP-------RIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKL 358
GTTL Y P R +Q+++ V K P + + C+ Y+ +D FP++ +
Sbjct: 634 GTTLTYFPMSYCNLVREAVEQVVTAV--KVPDMG----SDNLLCY-YSDTIDI-FPVITM 685
Query: 359 HFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLE 418
HF L + ++ +C+ A + + G+ +N LV YD
Sbjct: 686 HFSGGADLVLDKYNMYLETITGGIFCL-----AIGCNDPSMPAVFGNRAQNNFLVGYDPS 740
Query: 419 NMTIGWTDYNCSS 431
+ I ++ NCS+
Sbjct: 741 SNVISFSPTNCSA 753
Score = 73.6 bits (179), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/364 (21%), Positives = 146/364 (40%), Gaps = 48/364 (13%)
Query: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
+Y K+ +G+P + ++DTGSD++W C+ C C + D ++DP +S T
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFD-----PIFDPSKSSTFN-- 133
Query: 128 SCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQN 187
E R G C Y I Y D + + G + +T + +G +
Sbjct: 134 -----------EQRCHG----KSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAET 178
Query: 188 SSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQ--LAASGKVKKIFSHCLDTNV 245
+ GCG + S + GI+G S++SQ L G + FS + +
Sbjct: 179 T---IGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGTSKI 235
Query: 246 GGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDS 305
G +I + + Y + L + V+ + ++ F +E+G VIDS
Sbjct: 236 NFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGN-IVIDS 294
Query: 306 GTTLAYLP-------RIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKL 358
G+T+ Y P R +Q+++ V P L C+ ++ +D FP++ +
Sbjct: 295 GSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDML------CY-FSETIDI-FPVITM 346
Query: 359 HFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLE 418
HF L + ++ +C+ ++ + + G+ +N LV YD
Sbjct: 347 HFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQE-----AIFGNRAQNNFLVGYDSS 401
Query: 419 NMTI 422
++ +
Sbjct: 402 SLLL 405
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 78.6 bits (192), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 159/384 (41%), Gaps = 56/384 (14%)
Query: 73 IGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHN 132
+ +G+PS+ + +DTGS + W ++C K + T +DP S + + C H
Sbjct: 84 LPIGTPSQSQELVLDTGSQLSW---IQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140
Query: 133 FCSSTYEGRIL--GCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSI 190
C L C + C YS Y DG+ G V++ TF+ + +
Sbjct: 141 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFS-------NSQTTPPL 193
Query: 191 IFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN------ 244
I GC S++E GI+G S +SQ K+ K FS+C+ T
Sbjct: 194 ILGCA-------KESTDE--KGILGMNLGRLSFISQ----AKISK-FSYCIPTRSNRPGL 239
Query: 245 VGGGIFSIGE------------VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDT 292
G F +G+ + P+ + P + +A Y V L+ I + L +P
Sbjct: 240 ASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLA-YTVPLQGIRIGQKRLNIPGSV 298
Query: 293 FDSENGKG--TVIDSGTTLAYLPRIVYDQLMSKVLAK-QPRLKVYLVEEQYS--CFQYTG 347
F + G T++DSG+ +L + YD++ +++ RLK V + CF
Sbjct: 299 FRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNH 358
Query: 348 NVDSGFPIVKLHFE--DSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGD 405
+++ G I L FE + + V L N G + C+G +S+ G ++G+
Sbjct: 359 SMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIH-CVGIGRSSML---GAASNIIGN 414
Query: 406 FVLSNKLVVYDLENMTIGWTDYNC 429
N V +D+ N +G++ C
Sbjct: 415 VHQQNLWVEFDVTNRRVGFSKAEC 438
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 73.2 bits (178), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 134/319 (42%), Gaps = 57/319 (17%)
Query: 73 IGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHN 132
+ +G P ++ + +DTGS++ W++C K +G ++++P S T V C
Sbjct: 69 LAVGDPPQNISMVLDTGSELSWLHC--------KKSPNLG-SVFNPVSSSTYSPVPCSSP 119
Query: 133 FCSSTYEGRILGCKAE---NPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSS 189
C + + + + C +ISY D ++ G + V
Sbjct: 120 ICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT--------RPG 171
Query: 190 IIFGCGAAQSGTLASSSEEALD-GIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGG 248
+FGC SG ++S E+A G++G + + S ++QL S FS+C+ + G
Sbjct: 172 TLFGC--MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSK-----FSYCISGSDSSG 224
Query: 249 IFSIGEVVEP---KVKTTPLV----P----NMAHYNVILKNIEVDGDILQLPSDTF--DS 295
+G+ ++ TPLV P + Y V L+ I V IL LP F D
Sbjct: 225 FLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDH 284
Query: 296 ENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVD----- 350
T++DSGT +L VY L ++ + Q + + LV++ FQ G +D
Sbjct: 285 TGAGQTMVDSGTQFTFLMGPVYTALKNEFIT-QTKSVLRLVDDPDFVFQ--GTMDLCYKV 341
Query: 351 --------SGFPIVKLHFE 361
SG P+V L F
Sbjct: 342 GSTTRPNFSGLPMVSLMFR 360
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 73.2 bits (178), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 153/375 (40%), Gaps = 47/375 (12%)
Query: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
Y K +G+P++ + +DT SD+ W+ C C CP T + P +S + + VS
Sbjct: 115 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-------TAFSPAKSTSFKNVS 167
Query: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNS 188
C C G +A C ++++YG S+ QD + A
Sbjct: 168 CSAPQCKQV-PNPTCGARA---CSFNLTYGS-SSIAANLSQDTIRL--------AADPIK 214
Query: 189 SIIFGC--GAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVG 246
+ FGC A GT+ G+ S++SQ A K FS+CL +
Sbjct: 215 AFTFGCVNKVAGGGTIPPPQGLLGL-----GRGPLSLMSQ--AQSIYKSTFSYCLPSFRS 267
Query: 247 ---GGIFSIGEVVEP-KVKTTPLVPNMAH---YNVILKNIEVDGDILQLPSD--TFDSEN 297
G +G +P +VK T L+ N Y V L I V ++ LP F+
Sbjct: 268 LTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPST 327
Query: 298 GKGTVIDSGTTLAYLPRIVYDQLMSKVLAK-QPRLKVYLVEEQY-SCFQYTGNVDSGFPI 355
G GT+ DSGT L + VY+ + ++ + +P V + +C Y+G V P
Sbjct: 328 GAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTC--YSGQVK--VPT 383
Query: 356 VKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVY 415
+ F+ +++T+ + + + S C+ +A E N + ++ N V+
Sbjct: 384 ITFMFKG-VNMTMPADNLMLHSTAGSTSCLAM-AAAPENVNSV-VNVIASMQQQNHRVLI 440
Query: 416 DLENMTIGWTDYNCS 430
D+ N +G CS
Sbjct: 441 DVPNGRLGLARERCS 455
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 71.2 bits (173), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/387 (22%), Positives = 154/387 (39%), Gaps = 58/387 (14%)
Query: 73 IGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHN 132
+ +G+P + + +DTGS + W+ C P+ T +DP S + + C H
Sbjct: 76 LPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPK------TSFDPSLSSSFSTLPCSHP 129
Query: 133 FCSSTYEGRIL--GCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSI 190
C L C + C YS Y DG+ G V++ +TF+ +
Sbjct: 130 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE-------ITPPL 182
Query: 191 IFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL--DTNVGG- 247
I GC S GI+G + S +SQ K+ K FS+C+ +N G
Sbjct: 183 ILGCATESSDD---------RGILGMNRGRLSFVSQ----AKISK-FSYCIPPKSNRPGF 228
Query: 248 ---GIFSIGE------------VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDT 292
G F +G+ + P+ + P + +A Y V + I L +
Sbjct: 229 TPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA-YTVPMIGIRFGLKKLNISGSV 287
Query: 293 FDSENGKG--TVIDSGTTLAYLPRIVYDQLMSKVLAK-QPRLKVYLVEEQYSCFQYTGNV 349
F + G T++DSG+ +L YD++ ++++ + RLK V + + GNV
Sbjct: 288 FRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNV 347
Query: 350 DSGFPIVK---LHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDF 406
++ F + + V L N G + C+G +S+ G ++G+
Sbjct: 348 AMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGGIH-CVGIGRSSML---GAASNIIGNV 403
Query: 407 VLSNKLVVYDLENMTIGWTDYNCSSSI 433
N V +D+ N +G+ +CS +
Sbjct: 404 HQQNLWVEFDVTNRRVGFAKADCSRVV 430
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 62.8 bits (151), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 77/179 (43%), Gaps = 12/179 (6%)
Query: 26 FPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPNVTGLYFTKIGLGSPSKDYYVQ 85
FP + +L + H S R + + V + +G + +G YF + +G P + +
Sbjct: 42 FPSPTQALALDTRRLHFLSLRRKPIPFVKSPVV-SGAASGSGQYFVDLRIGQPPQSLLLI 100
Query: 86 VDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEG-RILG 144
DTGSD++WV C C C S T++ P+ S T C C + R
Sbjct: 101 ADTGSDLVWVKCSACRNCSHHSPA----TVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPI 156
Query: 145 C---KAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTATQNSSIIFGCGAAQSG 200
C + + C Y Y DGS T+G + ++ + +G + S+ FGCG SG
Sbjct: 157 CNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGK---EARLKSVAFGCGFRISG 212
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/383 (23%), Positives = 139/383 (36%), Gaps = 75/383 (19%)
Query: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
+ +I GSP K ++ +DTGS + W C C+ C + Y P S T
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKI----YPKYRPAASITYRDAM 113
Query: 129 CEHNFCSSTYEGRILGCKAENP----CPYSISYGDGSATTGYYVQDYLTFNRVNGNLHTA 184
CE + S A +P C Y Y D + G Q+ +T + +G
Sbjct: 114 CEDSHPKSNPH------FAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRV 167
Query: 185 TQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN 244
+ FGC G+ + + GI+G G S++ + + FS CL
Sbjct: 168 ---HGVYFGCNTLSDGSYFTGT-----GILGLGVGKYSIIGEFGSK------FSFCL--- 210
Query: 245 VGGGIFSIGEVVEPKVKTTPLVPNMA----HYNVILKNIEVDGDILQLPSDTFDSE---- 296
GE+ EPK ++ + A H VI NI I QL S E
Sbjct: 211 --------GEISEPKASHNLILGDGANVQGHPTVI--NITEGHTIFQLESIIVGEEITLD 260
Query: 297 NGKGTVIDSGTTLAYLPRIVY--------DQLMSKVLAKQPRL--KVYLVEEQYSCFQYT 346
+ +D+G+TL++L +Y D + S+ L+ +P L K +E
Sbjct: 261 DPVQVFVDTGSTLSHLSTNLYYKFVDAFDDLIGSRPLSYEPTLCYKADTIER-------L 313
Query: 347 GNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDF 406
+D GF F+ L+V H+ C+ Q + K ++G
Sbjct: 314 EKMDVGF-----KFDVGAELSVNIHNIFIQQGPPEIRCLAIQNN----KESFSHVIIGVI 364
Query: 407 VLSNKLVVYDLENMTIGWTDYNC 429
+ V YDL T +C
Sbjct: 365 AMQGYNVGYDLSAKTAYINKQDC 387
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/397 (21%), Positives = 150/397 (37%), Gaps = 64/397 (16%)
Query: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
Y T I +P V D G LWV+C D G + Y R ++
Sbjct: 44 YTTVINQRTPLVPASVVFDLGGRELWVDC----------DKGYVSSTYQSPRCNSAVCSR 93
Query: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG-NLHTATQN 187
C + + GC + G+AT+G + D ++ NG N +
Sbjct: 94 AGSTSCGTCFSPPRPGCSNNTCGGIPDNTVTGTATSGEFALDVVSIQSTNGSNPGRVVKI 153
Query: 188 SSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGG 247
++IF CGA L + G+ G G+ N + SQ AA+ + F+ CL + G
Sbjct: 154 PNLIFDCGAT---FLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGV 210
Query: 248 GIFSIGE-VVEPKV-----KTTPLVPN--------------------MAHYNVILKNIEV 281
F G V P + +TTPL+ N + ++ K + +
Sbjct: 211 AFFGNGPYVFLPGIQISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPI 270
Query: 282 DGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVL---AKQPRLKVYLVEE 338
+ +L++ + T GT I S L +Y+ S+ + A + +V V+
Sbjct: 271 NPTLLKINAST----GIGGTKISSVNPYTVLESSIYNAFTSEFVKQAAARSIKRVASVKP 326
Query: 339 QYSCFQYTGNVDSGFPIVKLHFE-DSLSLTVYPHDYLFNYKG--------DSYWCIGWQK 389
+CF T NV + +L + + L ++ D ++ G D C+G+
Sbjct: 327 FGACFS-TKNVG----VTRLGYAVPEIELVLHSKDVVWRIFGANSMVSVSDDVICLGFVD 381
Query: 390 SASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTD 426
N + ++G F L + L+ +DL + G++
Sbjct: 382 GG---VNARTSVVIGGFQLEDNLIEFDLASNKFGFSS 415
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 49.3 bits (116), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 89/410 (21%), Positives = 149/410 (36%), Gaps = 90/410 (21%)
Query: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNC-----------VECTR--CPRKSDIGIGLTL 115
Y T I +P V D G WV+C C C R I G T
Sbjct: 45 YTTVINQRTPLVPASVVFDLGGREFWVDCDQGYVSTTYRSPRCNSAVCSRAGSIACG-TC 103
Query: 116 YDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFN 175
+ P R C +N C + + I G AT+G + D ++
Sbjct: 104 FSPPRP------GCSNNTCGAFPDNSI----------------TGWATSGEFALDVVSIQ 141
Query: 176 RVNG-NLHTATQNSSIIFGCGAAQSGTLASSSEEALDGIIGFGQANSSVLSQLAASGKVK 234
NG N + ++IF CG S +L + G+ G G+ N + Q AA+
Sbjct: 142 STNGSNPGRFVKIPNLIFSCG---STSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFN 198
Query: 235 KIFSHCLDTNVGGGIFSIGE-VVEP-----KVKTTPLVPN-------------------- 268
+ F+ CL + G F G V P +++ TPL+ N
Sbjct: 199 RKFAVCLTSGRGVAFFGNGPYVFLPGIQISRLQKTPLLINPGTTVFEFSKGEKSPEYFIG 258
Query: 269 MAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVL--- 325
+ ++ K + +D +L++ + T GT I S L +Y S+ +
Sbjct: 259 VTAIKIVEKTLPIDPTLLKINAST----GIGGTKISSVNPYTVLESSIYKAFTSEFIRQA 314
Query: 326 AKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFE-DSLSLTVYPHDYLFNYKG----- 379
A + +V V+ +CF T NV + +L + + L ++ D ++ G
Sbjct: 315 AARSIKRVASVKPFGACFS-TKNVG----VTRLGYAVPEIQLVLHSKDVVWRIFGANSMV 369
Query: 380 ---DSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTD 426
D C+G+ N ++G F L + L+ +DL + G++
Sbjct: 370 SVSDDVICLGFVDGGV---NPGASVVIGGFQLEDNLIEFDLASNKFGFSS 416