Miyakogusa Predicted Gene
- Lj5g3v1203340.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1203340.1 Non Chatacterized Hit- tr|I3SQ74|I3SQ74_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,99.55,0,seg,NULL;
BASIC 7S GLOBULIN-RELATED,NULL; ASPARTYL PROTEASES,Peptidase A1; Acid
proteases,Peptidase ,CUFF.54970.1
(440 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 397 e-111
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 374 e-104
AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 175 5e-44
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 160 2e-39
AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 159 3e-39
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 152 3e-37
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 2e-15
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 3e-12
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 3e-12
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 70 4e-12
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 68 1e-11
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 2e-11
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 66 4e-11
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 65 1e-10
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 2e-10
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 3e-10
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 6e-10
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 6e-10
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 1e-09
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 60 3e-09
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 59 8e-09
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 57 2e-08
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 57 3e-08
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 56 6e-08
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 55 9e-08
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 54 2e-07
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 53 5e-07
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 49 5e-06
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 49 8e-06
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 397 bits (1021), Expect = e-111, Method: Compositional matrix adjust.
Identities = 220/431 (51%), Positives = 279/431 (64%), Gaps = 38/431 (8%)
Query: 27 AQTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS 86
AQT FRPKAL+LP+TKD S QY T I QRTPLVP + DLGG LWV+C+ + YVS
Sbjct: 22 AQTPFRPKALLLPVTKD--QSTLQYTTVINQRTPLVPASVVFDLGGRELWVDCD-KGYVS 78
Query: 87 STFKPARCGSSQCSLFGLTGCS----------GDKICGRSPSNTVTGVSSYGDIHSDVVS 136
ST++ RC S+ CS G T C + CG P NTVTG ++ G+ DVVS
Sbjct: 79 STYQSPRCNSAVCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVTGTATSGEFALDVVS 138
Query: 137 VNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHR 196
+ ST+G+ P +VV +PN +F CG+ + GLAKG GMAG+GR + LPSQF++AFSFHR
Sbjct: 139 IQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHR 198
Query: 197 KFAICLTANSGADGVMFFGDGPYNLNQDVS-KVLTYTPLITNPVSTAPSAFLGEPSVEYF 255
KFA+CLT+ GV FFG+GPY + L TPL+ NPVSTA + GE S EYF
Sbjct: 199 KFAVCLTS---GKGVAFFGNGPYVFLPGIQISSLQTTPLLINPVSTASAFSQGEKSSEYF 255
Query: 256 IGVKSIKVSEKNVPLNTTLLSINKN-GVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA 314
IGV +I++ EK VP+N TLL IN + G+GGTKIS+VNPYTV+E++IY A FVK A
Sbjct: 256 IGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPYTVLESSIYNAFTSEFVKQAAA 315
Query: 315 PTVSPVA---PFGTCFATKDISFSRIGPGVPAIDLVLQN-GVEWPIIGANSMVQF-DDVI 369
++ VA PFG CF+TK++ +R+G VP I+LVL + V W I GANSMV DDVI
Sbjct: 316 RSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVWRIFGANSMVSVSDDVI 375
Query: 370 CLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFRSLFL-EH 428
CLGFVD G N + TS+ IG QLE+NL++FDLA+++ GF S L
Sbjct: 376 CLGFVDGGVNAR--------------TSVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQ 421
Query: 429 DNCQNFRFTSS 439
NC NF FTS+
Sbjct: 422 TNCANFNFTST 432
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 374 bits (961), Expect = e-104, Method: Compositional matrix adjust.
Identities = 209/431 (48%), Positives = 272/431 (63%), Gaps = 38/431 (8%)
Query: 27 AQTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS 86
AQ SFRPKAL+LP+TKD S QY T I QRTPLVP + DLGG WV+C+ + YVS
Sbjct: 23 AQPSFRPKALLLPVTKD--PSTLQYTTVINQRTPLVPASVVFDLGGREFWVDCD-QGYVS 79
Query: 87 STFKPARCGSSQCSLFGLTGCS----------GDKICGRSPSNTVTGVSSYGDIHSDVVS 136
+T++ RC S+ CS G C + CG P N++TG ++ G+ DVVS
Sbjct: 80 TTYRSPRCNSAVCSRAGSIACGTCFSPPRPGCSNNTCGAFPDNSITGWATSGEFALDVVS 139
Query: 137 VNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHR 196
+ ST+G+ P + V +PN +F CGS + GLAKG GMAG+GR + LP QF++AFSF+R
Sbjct: 140 IQSTNGSNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNR 199
Query: 197 KFAICLTANSGADGVMFFGDGPYNLNQDVS-KVLTYTPLITNPVSTAPSAFLGEPSVEYF 255
KFA+CLT+ GV FFG+GPY + L TPL+ NP +T GE S EYF
Sbjct: 200 KFAVCLTS---GRGVAFFGNGPYVFLPGIQISRLQKTPLLINPGTTVFEFSKGEKSPEYF 256
Query: 256 IGVKSIKVSEKNVPLNTTLLSINKN-GVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA 314
IGV +IK+ EK +P++ TLL IN + G+GGTKIS+VNPYTV+E++IYKA F++ A
Sbjct: 257 IGVTAIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAA 316
Query: 315 PTVSPVA---PFGTCFATKDISFSRIGPGVPAIDLVLQN-GVEWPIIGANSMVQF-DDVI 369
++ VA PFG CF+TK++ +R+G VP I LVL + V W I GANSMV DDVI
Sbjct: 317 RSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVI 376
Query: 370 CLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFRSLFL-EH 428
CLGFVD G NP A S+ IG QLE+NL++FDLA+++ GF S L
Sbjct: 377 CLGFVDGGVNPGA--------------SVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQ 422
Query: 429 DNCQNFRFTSS 439
NC NF FTS+
Sbjct: 423 TNCANFNFTST 433
>AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6414585-6415745 FORWARD LENGTH=386
Length = 386
Score = 175 bits (444), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 138/406 (33%), Positives = 197/406 (48%), Gaps = 64/406 (15%)
Query: 27 AQTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS 86
+Q S +V P+ KD+ + QY+ QI+ PVKL +DL G LW +C +R S
Sbjct: 23 SQISDSVNGVVFPVVKDLPTG--QYLAQIRLGDSPDPVKLVVDLAGSILWFDCSSRHVSS 80
Query: 87 ST---------FKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSV 137
S A+ G+ + S + + C N G+++ G++ SDV+SV
Sbjct: 81 SRNLISGSSSGCLKAKVGNERVSSSSSSRKDQNADCELLVKNDAFGITARGELFSDVMSV 140
Query: 138 NSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
S T+P V + LF C + GLA G G+ GLGR ++SLPSQ ++ + R+
Sbjct: 141 GSV--TSPGTV----DLLFACTPPWLLRGLASGAQGVMGLGRAQISLPSQLAAETNERRR 194
Query: 198 FAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIG 257
+ L S +GV+ S+ L YTPL+T S Y I
Sbjct: 195 LTVYL---SPLNGVVSTSSVEEVFGVAASRSLVYTPLLTG------------SSGNYVIN 239
Query: 258 VKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLG-APT 316
VKSI+V+ + + + L ++STV PYT++E++IYK A+A+ K+ G A +
Sbjct: 240 VKSIRVNGEKLSVEGPL---------AVELSTVVPYTILESSIYKVFAEAYAKAAGEATS 290
Query: 317 VSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNG-VEWPIIGANSMVQFDDVICLGFVD 375
V PVAPFG CF T D+ F PA+DL LQ+ V W I G N M VD
Sbjct: 291 VPPVAPFGLCF-TSDVDF-------PAVDLALQSEMVRWRIHGKNLM-----------VD 331
Query: 376 AGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
G + S G V+GGS V I +G QLE +L FDL S +GF
Sbjct: 332 VGGGVRCS--GIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 160 bits (404), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/411 (32%), Positives = 192/411 (46%), Gaps = 49/411 (11%)
Query: 37 VLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVSSTFKPARCGS 96
+LPITK ++L Y T PV L LDLG W++C + +SS + C S
Sbjct: 27 LLPITKHEPTNL-FYTTFNVGSAAKSPVNLLLDLGTNLTWLDCRKLKSLSS-LRLVTCQS 84
Query: 97 SQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLF 156
S C GC+G + P+ G + D S+ +TDG VSV +F F
Sbjct: 85 STCKSIPGNGCAGKSCLYKQPNPLGQNPVVTGRVVQDRASLYTTDGGKFLSQVSVRHFTF 144
Query: 157 ICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL----TANSGADGVM 212
C + GL V G+ L S Q +SAF+ KF++CL T + G+
Sbjct: 145 SCAGEKALQGLPPPVDGVLALSPGSSSFTKQVTSAFNVIPKFSLCLPSSGTGHFYIAGIH 204
Query: 213 FFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNT 272
+F P+N + NP+ + G S +Y I VKSI V + LN
Sbjct: 205 YFIP-PFNSSD-------------NPIPRTLTPIKGTDSGDYLITVKSIYVGGTALKLNP 250
Query: 273 TLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAF---VKSLGAPTVSPVAPFGTCFAT 329
LL+ GG K+STV YTV++T IY A+A +F K++G V VAPF CF +
Sbjct: 251 DLLT------GGAKLSTVVHYTVLQTDIYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDS 304
Query: 330 KDISFS-RIGPGVPAIDLVLQ---NGVEWPIIGANSMVQFDD-VICLGFVDAGSNPKASQ 384
+ + GP VP I++ L V+W GAN++V+ + V+CL F+D G PK
Sbjct: 305 RTAGKNLTAGPNVPVIEIGLPGRIGEVKWGFYGANTVVKVKETVMCLAFIDGGKTPKDLM 364
Query: 385 VGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF-RSLFLEHDNCQNF 434
V IG HQL++++L+FD + + L F SL L + +C +
Sbjct: 365 V--------------IGTHQLQDHMLEFDFSGTVLAFSESLLLHNTSCSTW 401
>AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6408242-6409417 REVERSE LENGTH=391
Length = 391
Score = 159 bits (402), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 124/361 (34%), Positives = 173/361 (47%), Gaps = 58/361 (16%)
Query: 65 KLTLDLGGGY-LWVNCENRQYVSSTFKPARCGSSQCSLFGLT-GCSGDKICGRSPSNTVT 122
K LDL G L NC S+T+ P RCGS++C C + I + TV
Sbjct: 56 KFVLDLNGAAPLLQNCPTAA-KSTTYHPIRCGSTRCKYANPNFPCPNNVIAKK---RTVC 111
Query: 123 GVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRV 182
S + D V + T T+ + + L + +G GL T +
Sbjct: 112 LSSDNSRLFRDTVPLLYTFNGVYTRDSEMSSSLTL----TCTDGAPALKQRTIGLANTHL 167
Query: 183 SLPSQFSSAFSFHRKFAICLTANSGA---DGVMFFGDGPYNL---NQDVSKVLTYTPLIT 236
S+PSQ S + K A+CL + + +G ++ G G Y ++DVSK+ TPLI
Sbjct: 168 SIPSQLISMYQLPHKIALCLPSTERSQSHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIG 227
Query: 237 NPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVM 296
N S EY I VKSI++ K VP+ G TKIST+ PYTV
Sbjct: 228 N-----------GKSGEYLIDVKSIQIGAKTVPI----------PYGATKISTLAPYTVF 266
Query: 297 ETTIYKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPI 356
+T++YKA+ AF +++ V PFG CF +S G GVP IDLVL G +W I
Sbjct: 267 QTSLYKALLTAFTENIKIAKAPAVKPFGACF------YSNGGRGVPVIDLVLSGGAKWRI 320
Query: 357 IGANSMVQFD-DVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLA 415
G+NS+V+ + +V+CLGFVD G PK +P I IG Q+E+NL++FDL
Sbjct: 321 YGSNSLVKVNKNVVCLGFVDGGVKPK-----------YP---IVIGGFQMEDNLVEFDLE 366
Query: 416 A 416
A
Sbjct: 367 A 367
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 152 bits (385), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 131/426 (30%), Positives = 200/426 (46%), Gaps = 67/426 (15%)
Query: 31 FRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN---RQYVSS 87
+ PKALV ++K+ + + Q + + +GG YL C + R V
Sbjct: 24 YPPKALVSTVSKNTILPIFTFTLNTNQ-------EFFIHIGGPYLVRKCNDGLPRPIVP- 75
Query: 88 TFKPARCGSSQCSL---FGLTGCS--GDKICGRSPSNTVTGVSSYGDI-HSDV-----VS 136
CGS C+L F CS +KI + T + I +SD +S
Sbjct: 76 ------CGSPVCALTRRFTPHQCSLPSNKIINGVCACQATAFEPFQRICNSDQFTYGDLS 129
Query: 137 VNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSS-AFSFH 195
++S +P+ V++ N ++C + GV G+AGL T ++ +Q +
Sbjct: 130 ISSLKPISPS--VTINNVYYLCIPQPFLVDFPPGVFGLAGLAPTALATWNQLTRPRLGLE 187
Query: 196 RKFAICLTANSG--ADGVMFFGDGPYNL-NQDVSKVLTYTPLITNPVSTAPSAFLGEPSV 252
+KFA+CL ++ G ++FG GPY L N D +L+YT LITNP
Sbjct: 188 KKFALCLPSDENPLKKGAIYFGGGPYKLRNIDARSMLSYTRLITNPRKLN---------- 237
Query: 253 EYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL 312
YF+G+K I V+ + + ++NG GG +ST+ P+T++ + IY+ +AF ++
Sbjct: 238 NYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTMLRSDIYRVFIEAFSQAT 297
Query: 313 -GAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVIC 370
G P VS PF C +T +F VP IDL L NGV W + AN+M + DDV C
Sbjct: 298 SGIPRVSSTTPFEFCLSTTT-NFQ-----VPRIDLELANGVIWKLSPANAMKKVSDDVAC 351
Query: 371 LGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF-RSLFLEHD 429
L FVNGG ++ IG HQ+EN L++FD+ S GF SL L
Sbjct: 352 L--------------AFVNGGDAAAQAVMIGIHQMENTLVEFDVGRSAFGFSSSLGLVSA 397
Query: 430 NCQNFR 435
+C +F+
Sbjct: 398 SCGDFQ 403
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/406 (25%), Positives = 167/406 (41%), Gaps = 84/406 (20%)
Query: 59 TPLVPVKLTLDLGGGYLWVNCENRQYVS--------------------STFKPARCGSSQ 98
TP + D G +W+ C +R S S+ K C S +
Sbjct: 98 TPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPK 157
Query: 99 CS-LFGLT-GCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFL- 155
C L+G C G C + N G Y + + ST G T+ + P+
Sbjct: 158 CQFLYGPNVQCRG---CDPNTRNCTVGCPPYILQYG----LGSTAGVLITEKLDFPDLTV 210
Query: 156 --FICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTA------NSG 207
F+ G ++ + G+AG GR VSLPSQ + ++F+ CL + N
Sbjct: 211 PDFVVGCSIIS---TRQPAGIAGFGRGPVSLPSQMN-----LKRFSHCLVSRRFDDTNVT 262
Query: 208 ADGVMFFGDGPYNLNQDVSKV--LTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSE 265
D + G G + SK LTYTP NP + + AFL Y++ ++ I V
Sbjct: 263 TDLDLDTGSG----HNSGSKTPGLTYTPFRKNP-NVSNKAFL----EYYYLNLRRIYVGR 313
Query: 266 KNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPT----VSPVA 321
K+V + L+ NG GG+ + + + +T ME +++ VA+ F + T +
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373
Query: 322 PFGTCF---ATKDISFSRIGPGVPAIDLVLQNG--VEWPIIGANSMVQFDDVICLGFV-D 375
G CF D++ VP + + G +E P+ + V D +CL V D
Sbjct: 374 GLGPCFNISGKGDVT-------VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSD 426
Query: 376 AGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
NP +GG+ P +I +G+ Q +N L+++DL R GF
Sbjct: 427 KTVNP--------SGGTGP--AIILGSFQQQNYLVEYDLENDRFGF 462
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 70.1 bits (170), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 101/404 (25%), Positives = 164/404 (40%), Gaps = 59/404 (14%)
Query: 32 RPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVSST--- 88
+PK +P+ + Y+ + K TP + + LD +W+ C S+
Sbjct: 85 KPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 144
Query: 89 --------FKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVS-SYGDIHSDVVSVNS 139
+ C ++QC T G SP +V + SYG S S+
Sbjct: 145 FNTNSSSTYSTVSCSTAQC-----TQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQ 199
Query: 140 TDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFA 199
T V+ PNF F C + N L G+ GLGR +SL SQ +S +S F+
Sbjct: 200 DTLTLAPDVI--PNFSFGCINSASGNSLPP--QGLMGLGRGPMSLVSQTTSLYS--GVFS 253
Query: 200 ICLTANSGADGVMFFGDGPYNLNQ-DVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGV 258
CL + F+ G L K + YTPL+ NP PS+ Y++ +
Sbjct: 254 YCLPSFRS-----FYFSGSLKLGLLGQPKSIRYTPLLRNP---------RRPSL-YYVNL 298
Query: 259 KSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTVS 318
+ V VP++ L+ + N GT I + T +Y+A+ D F K + + S
Sbjct: 299 TGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFS 358
Query: 319 PVAPFGTCFATKDISFS-RIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVICLGFVDAG 377
+ F TCF+ + + + +I + ++DL L +E +I +++ + CL AG
Sbjct: 359 TLGAFDTCFSADNENVAPKITLHMTSLDLKLP--MENTLIHSSA----GTLTCLSM--AG 410
Query: 378 SNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
A+ V V I Q +N + FD+ SR+G
Sbjct: 411 IRQNANAVLNV-----------IANLQQQNLRILFDVPNSRIGI 443
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 70.1 bits (170), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/409 (23%), Positives = 161/409 (39%), Gaps = 62/409 (15%)
Query: 39 PITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVS------------ 86
P+ S QY ++ P + L D G +WV C + S
Sbjct: 72 PVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRH 131
Query: 87 -STFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVS---SYGDIHSDVVSVNSTD- 141
STF PA C C L + IC + ++ + G + S + + +T
Sbjct: 132 SSTFSPAHCYDPVCRL--VPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSL 189
Query: 142 GTTPTKVVSVPNFLFICGSKVVQNGLA----KGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
T+ K + + F CG ++ ++ G G+ GLGR +S SQ F K
Sbjct: 190 KTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG--NK 247
Query: 198 FAICL---TANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEY 254
F+ CL T + + G+G +SK L +TPL+TNP+S P+ Y
Sbjct: 248 FSYCLMDYTLSPPPTSYLIIGNG----GDGISK-LFFTPLLTNPLS---------PTF-Y 292
Query: 255 FIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA 314
++ +KS+ V+ + ++ ++ I+ +G GGT + + + Y++V A + +
Sbjct: 293 YVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKL 352
Query: 315 PTVSPVAP-FGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVI-CLG 372
P + P F C + ++ +P + G + N ++ ++ I CL
Sbjct: 353 PIADALTPGFDLCVNVSGV--TKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLA 410
Query: 373 FVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
+PK VGF IG + L +FD SRLGF
Sbjct: 411 IQSV--DPK---VGFS----------VIGNLMQQGFLFEFDRDRSRLGF 444
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 69.7 bits (169), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 154/392 (39%), Gaps = 74/392 (18%)
Query: 64 VKLTLDLGGGYLWVNCENRQ-----------YVSSTFKPARCGSSQC-----SLFGLTGC 107
+ + +D G W+ C NR SS++ P C S C C
Sbjct: 86 ISMVIDTGSELSWLRC-NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASC 144
Query: 108 SGDKICGRSPSNTVTGVSSYGDIHSDVVSV-NSTDGTTPTKVVSVPNFLFICGSKVVQNG 166
DK+C + S SS G++ +++ NST+ + N +F C V +
Sbjct: 145 DSDKLCHATLS-YADASSSEGNLAAEIFHFGNSTNDS---------NLIFGCMGSVSGSD 194
Query: 167 LAKGV--TGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQD 224
+ TG+ G+ R +S SQ KF+ C++ G + GD N
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLLGDS----NFT 245
Query: 225 VSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGG 284
L YTPLI +ST F V Y + + IKV+ K +P+ ++L + G G
Sbjct: 246 WLTPLNYTPLIR--ISTPLPYF---DRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQ 300
Query: 285 TKISTVNPYTVMETTIYKAVADAFV-KSLGAPTVSPVAPF---GTCFATKDISFSRIGPG 340
T + + +T + +Y A+ F+ ++ G TV F GT IS RI G
Sbjct: 301 TMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSG 360
Query: 341 V----PAIDLVLQNGVEWPIIGANSMVQF-------DDVICLGFVDAGSNPKASQVGFVN 389
+ P + LV + G E + G + + D V C F G++ +V
Sbjct: 361 ILHRLPTVSLVFE-GAEIAVSGQPLLYRVPHLTVGNDSVYCFTF---GNSDLMGMEAYV- 415
Query: 390 GGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
IG H +N ++FDL SR+G
Sbjct: 416 ----------IGHHHQQNMWIEFDLQRSRIGL 437
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 68.2 bits (165), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/379 (23%), Positives = 154/379 (40%), Gaps = 52/379 (13%)
Query: 64 VKLTLDLGGGYLWVNCENRQYVSSTFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTG 123
+ + LD G W++C+ + S F P S + CS IC +
Sbjct: 78 ISMVLDTGSELSWLHCKKSPNLGSVFNPV-----SSSTYSPVPCS-SPICRTRTRDLPIP 131
Query: 124 VSSYGDIHSDVVSVNSTDGTT-------PTKV---VSVPNFLFICGSKVVQNGLAKGV-- 171
S H V+++ D T+ T V V+ P LF C + + +
Sbjct: 132 ASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKS 191
Query: 172 TGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQDVSKVLTY 231
TG+ G+ R +S +Q FS KF+ C++ S + G + GD Y+ + Y
Sbjct: 192 TGLMGMNRGSLSFVNQL--GFS---KFSYCISG-SDSSGFLLLGDASYSWLGPIQ----Y 241
Query: 232 TPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVN 291
TPL+ + + P + V Y + ++ I+V K + L ++ + G G T + +
Sbjct: 242 TPLV---LQSTPLPYFDR--VAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGT 296
Query: 292 PYTVMETTIYKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIG-------PGVPAI 344
+T + +Y A+ + F+ + P T D+ + ++G G+P +
Sbjct: 297 QFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCY-KVGSTTRPNFSGLPMV 355
Query: 345 DLVLQNGVEWPIIGANSMVQFDDVICLGFVDAGSNPKASQVGFVNGGSH--PVTSITIGA 402
L+ + G E + G + + + AGS K F G S + + IG
Sbjct: 356 SLMFR-GAEMSVSGQKLLYRVN--------GAGSEGKEEVYCFTFGNSDLLGIEAFVIGH 406
Query: 403 HQLENNLLKFDLAASRLGF 421
H +N ++FDLA SR+GF
Sbjct: 407 HHQQNVWMEFDLAKSRVGF 425
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 67.4 bits (163), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/349 (23%), Positives = 144/349 (41%), Gaps = 56/349 (16%)
Query: 50 QYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYV------------SSTFKPARCGSS 97
+++ ++ P V +D G +W C+ SS++ C S
Sbjct: 106 EFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165
Query: 98 QCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFI 157
C+ + C+ DK + + +YGD +S + +T+ T S+ F
Sbjct: 166 LCNALPRSNCNEDK-------DACEYLYTYGD-YSSTRGLLATETFTFEDENSISGIGFG 217
Query: 158 CGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTA--NSGADGVMFFG 215
CG + +G ++G +G+ GLGR +SL SQ KF+ CLT+ +S A +F G
Sbjct: 218 CGVENEGDGFSQG-SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEASSSLFIG 271
Query: 216 D---GPYN-----LNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKN 267
G N L+ +V+K ++ L+ NP +PS Y++ ++ I V K
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMS---LLRNP---------DQPSF-YYLELQGITVGAKR 318
Query: 268 VPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPT-VSPVAPFGTC 326
+ + + + ++G GG I + T +E T +K + + F + P S C
Sbjct: 319 LSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLC 378
Query: 327 FATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDD--VICLGF 373
F D + + VP + + G + + G N MV V+CL
Sbjct: 379 FKLPDAAKN---IAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAM 423
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 66.2 bits (160), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 101/415 (24%), Positives = 156/415 (37%), Gaps = 63/415 (15%)
Query: 33 PKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNC--------ENRQY 84
P L+ + +T +Y + TP L LD G W+ C +N +
Sbjct: 142 PGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMF 201
Query: 85 ----VSSTFKPARCGSSQCSLFGL----TGCSGDKICGRSPSNTVTGVSS--YGDIHSDV 134
S++FK C +CSL C D P G S GD +
Sbjct: 202 YDPKTSASFKNITCNDPRCSLISSPDPPVQCESDN--QSCPYFYWYGDRSNTTGDFAVET 259
Query: 135 VSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSF 194
+VN T + V N +F CG GL G +G+ GLGR +S SQ S +
Sbjct: 260 FTVNLTTTEGGSSEYKVGNMMFGCGH--WNRGLFSGASGLLGLGRGPLSFSSQLQSLYG- 316
Query: 195 HRKFAICL---TANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPS 251
F+ CL +N+ + FG+ LN L +T + E S
Sbjct: 317 -HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNH---TNLNFTSFVNGK----------ENS 362
Query: 252 VE--YFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFV 309
VE Y+I +KSI V K + + +I+ +G GGT I + + Y+ + + F
Sbjct: 363 VETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFA 422
Query: 310 KSLGA--PTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-D 366
+ + P CF I + I +P + + +G W NS + +
Sbjct: 423 EKMKENYPIFRDFPVLDPCFNVSGIEENNI--HLPELGIAFVDGTVWNFPAENSFIWLSE 480
Query: 367 DVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
D++CL + G+ T IG +Q +N + +D SRLGF
Sbjct: 481 DLVCLAIL----------------GTPKSTFSIIGNYQQQNFHILYDTKRSRLGF 519
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 64.7 bits (156), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 106/409 (25%), Positives = 167/409 (40%), Gaps = 71/409 (17%)
Query: 32 RPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE--NRQYVSST- 88
RP + ++ +Y T++ TP V + LD G +W+ C R Y S
Sbjct: 123 RPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP 182
Query: 89 -FKPAR--------CGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNS 139
F P + C S C GC+ + T SYGD S V S
Sbjct: 183 IFDPRKSKTYATIPCSSPHCRRLDSAGCNTRR-------KTCLYQVSYGD-GSFTVGDFS 234
Query: 140 TDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFA 199
T+ T + V CG GL G G+ GLG+ ++S P Q + F++KF+
Sbjct: 235 TE-TLTFRRNRVKGVALGCGHD--NEGLFVGAAGLLGLGKGKLSFPGQ--TGHRFNQKFS 289
Query: 200 ICLTANSGAD--GVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIG 257
CL S + + FG N VS++ +TPL++NP + Y++G
Sbjct: 290 YCLVDRSASSKPSSVVFG------NAAVSRIARFTPLLSNP----------KLDTFYYVG 333
Query: 258 VKSIKVSEKNVP-LNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPT 316
+ I V VP + +L +++ G GG I + T + Y A+ DAF +GA T
Sbjct: 334 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVGAKT 391
Query: 317 VSPVAP----FGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVICLG 372
+ AP F TCF +++ + VP + L + G + + N ++ D
Sbjct: 392 LK-RAPDFSLFDTCFDLSNMNEVK----VPTVVLHFR-GADVSLPATNYLIPVDTNGKFC 445
Query: 373 FVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
F AG+ S +G + Q + + +DLA+SR+GF
Sbjct: 446 FAFAGTMGGLSIIGNI---------------QQQGFRVVYDLASSRVGF 479
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 64.3 bits (155), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 132/311 (42%), Gaps = 46/311 (14%)
Query: 32 RPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE--------NRQ 83
K+ LP T YI + TP + L D G W C+ ++
Sbjct: 113 ESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKE 172
Query: 84 YV-----SSTFKPARCGSSQC-SLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSV 137
+ S+++ C S+ C SL TG +G C S SN + G+ YGD S V
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGS--C--SASNCIYGIQ-YGD-QSFSVGF 226
Query: 138 NSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
+ + T T F CG GL GV G+ GLGR ++S PSQ +A ++++
Sbjct: 227 LAKEKFTLTNSDVFDGVYFGCGEN--NQGLFTGVAGLLGLGRDKLSFPSQ--TATAYNKI 282
Query: 198 FAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIG 257
F+ CL +++ G + FG + +S+ + +TP+ T + + + Y +
Sbjct: 283 FSYCLPSSASYTGHLTFG------SAGISRSVKFTPIST----------ITDGTSFYGLN 326
Query: 258 VKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGA-PT 316
+ +I V + +P+ +T+ S G I + T + Y A+ +F + PT
Sbjct: 327 IVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPT 381
Query: 317 VSPVAPFGTCF 327
S V+ TCF
Sbjct: 382 TSGVSILDTCF 392
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 63.5 bits (153), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 158/386 (40%), Gaps = 82/386 (21%)
Query: 64 VKLTLDLGGGYLWVNCE------NRQ------YVSSTFKPARCGSSQCS-LFGLTG---- 106
+ L +D G WV C+ N+Q VSS++K C SS C L T
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205
Query: 107 CSGDKICGRSPSNTVTGVSSYGD---IHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVV 163
C G+ ++P V SYGD D+ S + G T + NF+F CG
Sbjct: 206 CGGNNGVVKTPCEYVV---SYGDGSYTRGDLASESILLGDTK-----LENFVFGCGRN-- 255
Query: 164 QNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL-TANSGADGVMFFGDGPYNLN 222
GL G +G+ GLGR+ VSL SQ + +F+ F+ CL + GA G + FG+
Sbjct: 256 NKGLFGGSSGLMGLGRSSVSLVSQ--TLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYT 313
Query: 223 QDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGV 282
S ++YTPL+ N P + F + S V L ++
Sbjct: 314 NSTS--VSYTPLVQN------------PQLRSFYILNLTGASIGGVELKSSSFGRGILID 359
Query: 283 GGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVSPVAPFGTCF---ATKDISFSRIG 338
GT I+ + P +IYKAV F+K G PT + TCF + +DIS
Sbjct: 360 SGTVITRLPP------SIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS----- 408
Query: 339 PGVPAIDLVLQNGVEWP--IIGANSMVQFD-DVICLGFVDAGSNPKASQVGFVNGGSHPV 395
+P I ++ Q E + G V+ D ++CL S ++VG
Sbjct: 409 --IPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLAL---ASLSYENEVGI-------- 455
Query: 396 TSITIGAHQLENNLLKFDLAASRLGF 421
IG +Q +N + +D RLG
Sbjct: 456 ----IGNYQQKNQRVIYDTTQERLGI 477
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 62.4 bits (150), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 121/290 (41%), Gaps = 39/290 (13%)
Query: 49 PQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYVSST--FKPAR--------CGSSQ 98
P YI + TP P+ + LD W+ C SS+ F P++ C + Q
Sbjct: 86 PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQ 145
Query: 99 CSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFIC 158
C C+ K CG + + + + +Y + D +++ S +PN+ F C
Sbjct: 146 CKQAPNPSCTVSKSCGFNMTYGGSTIEAY--LTQDTLTLASD---------VIPNYTFGC 194
Query: 159 GSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGP 218
+K +G + G+ GLGR +SL SQ S + F+ CL + ++ GP
Sbjct: 195 INKA--SGTSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGP 250
Query: 219 YNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSIN 278
N Q + + TPL+ NP S Y++ + I+V K V + T+ L+ +
Sbjct: 251 KN--QPIR--IKTTPLLKNP----------RRSSLYYVNLVGIRVGNKIVDIPTSALAFD 296
Query: 279 KNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTVSPVAPFGTCFA 328
GT + YT + Y AV + F + + + + F TC++
Sbjct: 297 PATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYS 346
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 62.4 bits (150), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 93/392 (23%), Positives = 149/392 (38%), Gaps = 61/392 (15%)
Query: 50 QYITQIKQRTPLVPVKLTLDLGGGYLWVNC-------ENRQYV----SSTFKPARCGSSQ 98
QY T+I+ TP ++ +D G WVNC +NR+ S +FK C +
Sbjct: 105 QYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQT 164
Query: 99 C-----SLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPN 153
C +LF LT C G ++ G + ++V T+G ++ +P
Sbjct: 165 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNG----RMARLPG 220
Query: 154 FLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGVMF 213
L C S +G G+ GL + S S +S + KF+ CL +
Sbjct: 221 HLIGCSSSFTGQSF-QGADGVLGLAFSDFSFTSTATSLYG--AKFSYCLVDHLS------ 271
Query: 214 FGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTT 273
N++VS L + + + + L + F + I +S L+
Sbjct: 272 --------NKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 323
Query: 274 LLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVSPVA-PFGTCFA-TK 330
+ GGT + + T++ YK V + L V P P CF+ T
Sbjct: 324 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 383
Query: 331 DISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFD-DVICLGFVDAGSNPKASQVGFVN 389
+ S++ P + L+ G + + +V V CLGFV AG+
Sbjct: 384 GFNVSKL----PQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGT----------- 428
Query: 390 GGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
P T++ IG +N L +FDL AS L F
Sbjct: 429 ----PATNV-IGNIMQQNYLWEFDLMASTLSF 455
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 149/355 (41%), Gaps = 59/355 (16%)
Query: 50 QYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYV------------SSTFKPARCGSS 97
++ I TP + V D G WV C+ Q SST+K C S
Sbjct: 84 EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143
Query: 98 QCSLFGLT--GC-SGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNF 154
C T GC + IC S S GD+ ++ VS++S G+ VS P
Sbjct: 144 NCQALSSTERGCDESNNICKYRYSYGDQSFSK-GDVATETVSIDSASGSP----VSFPGT 198
Query: 155 LFICGSKVVQNG--LAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL---TANSGAD 209
+F CG NG + +G+ GLG +SL SQ S+ S +KF+ CL +A +
Sbjct: 199 VFGCG---YNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS--KKFSYCLSHKSATTNGT 253
Query: 210 GVMFFGDG--PYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSEKN 267
V+ G P +L++D V TPL+ EP Y++ +++I V +K
Sbjct: 254 SVINLGTNSIPSSLSKDSGVV--STPLVDK-----------EPLTYYYLTLEAISVGKKK 300
Query: 268 VPLNTTLLSINKNGV-----GGTKISTVNPYTVMETTIYKAVADAFVKSL-GAPTVS-PV 320
+P + + N +G+ G I + T++E + + A +S+ GA VS P
Sbjct: 301 IPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQ 360
Query: 321 APFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICLGFV 374
CF + G+P I + G + + N+ V+ +D++CL V
Sbjct: 361 GLLSHCFKSGSAEI-----GLPEITVHF-TGADVRLSPINAFVKLSEDMVCLSMV 409
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 60.1 bits (144), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 98/423 (23%), Positives = 155/423 (36%), Gaps = 92/423 (21%)
Query: 35 ALVLPITKDVT-SSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNC-------------E 80
A+ +P+ D S+ Y +I TP + +D G LWVNC E
Sbjct: 68 AIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVE 127
Query: 81 NRQY---VSSTFKPARCGSSQCSLFG-LTGCSGDKICGRSPSNTVTGVSSYGD------- 129
Y SST K C + CS + C C V YGD
Sbjct: 128 LTPYDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQY--------VIMYGDGSSTNGY 179
Query: 130 -----IHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNGL----AKGVTGMAGLGRT 180
+H D+V+ N G+T ++ F CGSK Q+G V G+ G G++
Sbjct: 180 LVKDVVHLDLVTGNRQTGSTNGTII------FGCGSK--QSGQLGESQAAVDGIMGFGQS 231
Query: 181 RVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVS 240
S SQ +S R FA CL N+G G + + + VS + TP+++
Sbjct: 232 NSSFISQLASQGKVKRSFAHCLDNNNGG--------GIFAIGEVVSPKVKTTPMLS---- 279
Query: 241 TAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTI 300
S Y + + +I+V + L++ + + G I + + +
Sbjct: 280 ---------KSAHYSVNLNAIEVGNSVLELSSN--AFDSGDDKGVIIDSGTTLVYLPDAV 328
Query: 301 YKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGAN 360
Y + + + S T+ V TCF D R P + V +
Sbjct: 329 YNPLLNEILASHPELTLHTVQESFTCFHYTD-KLDRF----PTVTFQFDKSVSLAVYPRE 383
Query: 361 SMVQF-DDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITI-GAHQLENNLLKFDLAASR 418
+ Q +D C G+ + G K GG+ S+TI G L N L+ +D+
Sbjct: 384 YLFQVREDTWCFGWQNGGLQTK--------GGA----SLTILGDMALSNKLVVYDIENQV 431
Query: 419 LGF 421
+G+
Sbjct: 432 IGW 434
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 58.5 bits (140), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 98/403 (24%), Positives = 158/403 (39%), Gaps = 76/403 (18%)
Query: 39 PITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN------------RQYVS 86
P+ T +Y T++ P V + LD G W+ C S
Sbjct: 136 PLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSS 195
Query: 87 STFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSY--GDIHSDVVSVNSTDGTT 144
S+++P C + QC+ ++ C + C S G SY GD ++ +++ ST
Sbjct: 196 SSYEPLSCDTPQCNALEVSECR-NATCLYEVSY---GDGSYTVGDFATETLTIGST---- 247
Query: 145 PTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLT- 203
V N CG GL G G+ GLG ++LPSQ ++ F+ CL
Sbjct: 248 -----LVQNVAVGCGHS--NEGLFVGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVD 295
Query: 204 ANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKV 263
+S + + FG +S PL+ N + Y++G+ I V
Sbjct: 296 RDSDSASTVDFGTS-------LSPDAVVAPLLRN----------HQLDTFYYLGLTGISV 338
Query: 264 SEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVK-SLGAPTVSPVAP 322
+ + + + ++++G GG I + T ++T IY ++ D+FVK +L + VA
Sbjct: 339 GGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAM 398
Query: 323 FGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDV--ICLGFVDAGSNP 380
F TC+ ++ VP + G + N M+ D V CL F P
Sbjct: 399 FDTCYNLS----AKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA-----P 449
Query: 381 KASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFRS 423
AS + IG Q + + FDLA S +GF S
Sbjct: 450 TASSLAI------------IGNVQQQGTRVTFDLANSLIGFSS 480
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 57.4 bits (137), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 150/352 (42%), Gaps = 55/352 (15%)
Query: 43 DVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE--NRQY----------VSSTFK 90
D+TS+ +Y+ + TP P+ D G LW C + Y SST+K
Sbjct: 82 DLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYK 141
Query: 91 PARCGSSQC-SLFGLTGCS-GDKICGRSPSNTVTGVSSY--GDIHSDVVSVNSTDGTTPT 146
C SSQC +L CS D C S S G +SY G+I D +++ S+D T
Sbjct: 142 DVSCSSSQCTALENQASCSTNDNTCSYSLS---YGDNSYTKGNIAVDTLTLGSSD----T 194
Query: 147 KVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANS 206
+ + + N + CG K +G+ GLG VSL Q S KF+ CL +
Sbjct: 195 RPMQLKNIIIGCGHNNA-GTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLT 251
Query: 207 GADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVE--YFIGVKSIKVS 264
D +N + +++ + +++ P+ + + S E Y++ +KSI V
Sbjct: 252 SK------KDQTSKINFGTNAIVSGSGVVSTPL-------IAKASQETFYYLTLKSISVG 298
Query: 265 EKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAP-TVSPVAPF 323
K + + + ++ + I + T++ T Y + DA S+ A P +
Sbjct: 299 SKQIQYSGSDSESSEGNI---IIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGL 355
Query: 324 GTCF-ATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICLGF 373
C+ AT D+ VP I + +G + + +N+ VQ +D++C F
Sbjct: 356 SLCYSATGDLK-------VPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAF 399
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 56.6 bits (135), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 153/401 (38%), Gaps = 64/401 (15%)
Query: 28 QTSFRPKALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE-----NR 82
T ++ + L P+ + +Y ++I TP + L LD G W+ CE +
Sbjct: 139 DTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQ 198
Query: 83 Q-------YVSSTFKPARCGSSQCSLFGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVV 135
Q SST+K C + QCSL + C +K + SYGD S V
Sbjct: 199 QSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQV---------SYGD-GSFTV 248
Query: 136 SVNSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFH 195
+TD T + N CG GL G G+ GLG +S+ +Q +
Sbjct: 249 GELATDTVTFGNSGKINNVALGCGHD--NEGLFTGAAGLLGLGGGVLSITNQMKAT---- 302
Query: 196 RKFAICLT-ANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEY 254
F+ CL +SG + F + + +TAP + Y
Sbjct: 303 -SFSYCLVDRDSGKSSSLDFN----------------SVQLGGGDATAPLLRNKKIDTFY 345
Query: 255 FIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVK---S 311
++G+ V + V L + ++ +G GG + T ++T Y ++ DAF+K +
Sbjct: 346 YVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVN 405
Query: 312 LGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDD--VI 369
L + S ++ F TC+ +S + VP + G + N ++ DD
Sbjct: 406 LKKGS-SSISLFDTCYDFSSLSTVK----VPTVAFHFTGGKSLDLPAKNYLIPVDDSGTF 460
Query: 370 CLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLL 410
C F P +S + + T IT + L N++
Sbjct: 461 CFAFA-----PTSSSLSIIGNVQQQGTRIT---YDLSKNVI 493
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 55.8 bits (133), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 94/423 (22%), Positives = 160/423 (37%), Gaps = 71/423 (16%)
Query: 27 AQTSFRPKALV----LPITKDVTS-SLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCE- 80
+ SFR ++ LP+ D + S+ Y T+IK +P + +D G LWVNC
Sbjct: 49 SHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAP 108
Query: 81 ----------------NRQYVSSTFKPARCGSSQCSLFGLTG-CSGDKICGRSPSNTVTG 123
SST K C CS + C K C G
Sbjct: 109 CPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYG-DG 167
Query: 124 VSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSKVVQNG----LAKGVTGMAGLGR 179
+S GD D +++ G T ++ +F CG Q+G V G+ G G+
Sbjct: 168 STSDGDFIKDNITLEQVTGNLRTAPLA-QEVVFGCGKN--QSGQLGQTDSAVDGIMGFGQ 224
Query: 180 TRVSLPSQFSSAFSFHRKFAICLTANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPV 239
+ S+ SQ ++ S R F+ CL +G G + + + S V+ TP++ N
Sbjct: 225 SNTSIISQLAAGGSTKRIFSHCLDNMNGG--------GIFAVGEVESPVVKTTPIVPN-- 274
Query: 240 STAPSAFLGEPSVEYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETT 299
V Y + +K + V + L +L S NG GGT I + +
Sbjct: 275 -----------QVHYNVILKGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQN 321
Query: 300 IYKAVADAFVKSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGA 359
+Y ++ + + + + V CF+ S P ++L ++ ++ +
Sbjct: 322 LYNSLIEK-ITAKQQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFEDSLKLSVYPH 376
Query: 360 NSMVQF-DDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASR 418
+ + +D+ C G+ G +Q G I +G L N L+ +DL
Sbjct: 377 DYLFSLREDMYCFGWQSGG---MTTQDG--------ADVILLGDLVLSNKLVVYDLENEV 425
Query: 419 LGF 421
+G+
Sbjct: 426 IGW 428
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 55.1 bits (131), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 143/350 (40%), Gaps = 55/350 (15%)
Query: 44 VTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCENRQYV------------SSTFKP 91
+TS+ +Y+ I TP VP+ D G +W C + SST++
Sbjct: 79 ITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRK 138
Query: 92 ARCGSSQCSLFGLTGCSGDKICGRSPSNTVT-GVSSY--GDIHSDVVSVNSTDGTTPTKV 148
C SSQC CS D+ + S T+T G +SY GD+ D V++ G++ +
Sbjct: 139 VSCSSSQCRALEDASCSTDE---NTCSYTITYGDNSYTKGDVAVDTVTM----GSSGRRP 191
Query: 149 VSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICL---TAN 205
VS+ N + CG + +G+ GLG SL SQ S + KF+ CL T+
Sbjct: 192 VSLRNMIIGCGHENT-GTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSE 248
Query: 206 SGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKVSE 265
+G + FG +++ + S +P+ YF+ +++I V
Sbjct: 249 TGLTSKINFGTN---------------GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGS 293
Query: 266 KNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTV-SPVAPFG 324
K + +T+ G G I + T++ + Y + ++ A V P
Sbjct: 294 KKIQFTSTIFG---TGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILS 350
Query: 325 TCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICLGF 373
C+ +D S + VP I + + G + + N+ V +DV C F
Sbjct: 351 LCY--RDSSSFK----VPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAF 393
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 98/413 (23%), Positives = 161/413 (38%), Gaps = 67/413 (16%)
Query: 36 LVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNC--------ENRQY--- 84
LV + +T +Y + +P L LD G W+ C +N +
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDP 214
Query: 85 -VSSTFKPARCGSSQCSLFGLTG----CSGDKICGRSPSNTVTGVSSY--GDIHSDVVSV 137
S+++K C +C+L C D P G SS GD + +V
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDN--QSCPYYYWYGDSSNTTGDFAVETFTV 272
Query: 138 NSTDGTTPTKVVSVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRK 197
N T +++ +V N +F CG GL G G+ GLGR +S SQ S +
Sbjct: 273 NLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG--HS 328
Query: 198 FAICL---TANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEY 254
F+ CL +++ + FG+ L+ ++ N V T Y
Sbjct: 329 FSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTF-----------Y 377
Query: 255 FIGVKSIKVSEK--NVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSL 312
++ +KSI V+ + N+P T +I+ +G GGT I + + Y+ F+K+
Sbjct: 378 YVQIKSILVAGEVLNIPEET--WNISSDGAGGTIIDSGTTLSYFAEPAYE-----FIKNK 430
Query: 313 GAPTVSPVAPFGTCFATKDISFSRIGPG---VPAIDLVLQNGVEWPIIGANSMVQF-DDV 368
A P F D F+ G +P + + +G W NS + +D+
Sbjct: 431 IAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDL 490
Query: 369 ICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
+CL + PK++ SI IG +Q +N + +D SRLG+
Sbjct: 491 VCLAML---GTPKSA------------FSI-IGNYQQQNFHILYDTKRSRLGY 527
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 52.8 bits (125), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 90/399 (22%), Positives = 157/399 (39%), Gaps = 72/399 (18%)
Query: 51 YITQIKQRTPLVPVKLTLDLGGGYLWVNCEN---------RQYVSSTFKPARCGSSQCSL 101
Y T+++ TP + +D G LWV+C + Q + F P GSS +
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDP---GSSVTA- 136
Query: 102 FGLTGCSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFLFICGSK 161
CS D+ C ++ +G S ++ + GT+ V V F I GS
Sbjct: 137 -SPISCS-DQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSS 194
Query: 162 VVQNGLA------------------KGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLT 203
+V N A + V G+ G G+ +S+ SQ +S R F+ CL
Sbjct: 195 LVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLK 254
Query: 204 ANSGADGVMFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSIKV 263
+G G++ G + V + +TPL+ PS +P Y + + SI V
Sbjct: 255 GENGGGGILVLG-------EIVEPNMVFTPLV-------PS----QP--HYNVNLLSISV 294
Query: 264 SEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLGAPTVSPVAPF 323
+ + +P+N ++ S NG GT I T + Y +A ++ + +V PV
Sbjct: 295 NGQALPINPSVFS-TSNG-QGTIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSK 351
Query: 324 GT-CFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQFDDVICLGFVDAGSNPKA 382
G C+ + +G P + L G + + ++Q ++V
Sbjct: 352 GNQCYVIT----TSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNV---------GGTAV 398
Query: 383 SQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
+GF + +T +G L++ + +DL R+G+
Sbjct: 399 WCIGFQRIQNQGIT--ILGDLVLKDKIFVYDLVGQRIGW 435
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 49.3 bits (116), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 85/213 (39%), Gaps = 27/213 (12%)
Query: 35 ALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN-----RQYVSSTF 89
++V P+ +V L Y I P P L LD G W+ C+ + +
Sbjct: 45 SVVFPVHGNVYP-LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 103
Query: 90 KPAR----CGSSQCSLFGLTG---CSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDG 142
+P+ C C L C + C G SS G + DV S+N T G
Sbjct: 104 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYE-VEYADGGSSLGVLVRDVFSMNYTQG 162
Query: 143 TTPTKVVSVPNFLFICGSKVVQNGLAKG-VTGMAGLGRTRVSLPSQFSSAFSFHRKFAIC 201
T P CG + + + G+ GLGR +VS+ SQ S C
Sbjct: 163 LRLT-----PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHC 217
Query: 202 LTANSGADGVMFFGDGPYNLNQDVSKVLTYTPL 234
L++ G G++FFGD Y D S+V ++TP+
Sbjct: 218 LSSLGG--GILFFGDDLY----DSSRV-SWTPM 243
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 48.5 bits (114), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 85/213 (39%), Gaps = 27/213 (12%)
Query: 35 ALVLPITKDVTSSLPQYITQIKQRTPLVPVKLTLDLGGGYLWVNCEN-----RQYVSSTF 89
++V P+ +V L Y I P P L LD G W+ C+ + +
Sbjct: 42 SVVFPVHGNVYP-LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 100
Query: 90 KPAR----CGSSQCSLFGLTG---CSGDKICGRSPSNTVTGVSSYGDIHSDVVSVNSTDG 142
+P+ C C L C + C G SS G + DV S+N T G
Sbjct: 101 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYE-VEYADGGSSLGVLVRDVFSMNYTQG 159
Query: 143 TTPTKVVSVPNFLFICGSKVVQNGLAKG-VTGMAGLGRTRVSLPSQFSSAFSFHRKFAIC 201
T P CG + + + G+ GLGR +VS+ SQ S C
Sbjct: 160 LRLT-----PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHC 214
Query: 202 LTANSGADGVMFFGDGPYNLNQDVSKVLTYTPL 234
L++ G G++FFGD Y D S+V ++TP+
Sbjct: 215 LSSLGG--GILFFGDDLY----DSSRV-SWTPM 240