Miyakogusa Predicted Gene
- Lj0g3v0160329.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0160329.1 Non Chatacterized Hit- tr|I1LMG6|I1LMG6_SOYBN
Uncharacterized protein OS=Glycine max PE=3 SV=1,78.93,0,no
description,Peptidase aspartic, catalytic; Acid proteases,Peptidase
aspartic; Asp,Peptidase A1; s,CUFF.9946.1
(449 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 596 e-170
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 496 e-140
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 256 2e-68
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 248 5e-66
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 150 2e-36
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 130 3e-30
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 127 1e-29
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 126 3e-29
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 118 8e-27
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 1e-26
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 5e-25
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 6e-25
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 108 6e-24
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 106 3e-23
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 3e-22
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 3e-21
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 7e-21
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 98 1e-20
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 97 2e-20
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 97 2e-20
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 5e-20
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 5e-20
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 6e-20
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 1e-19
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 2e-19
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 94 2e-19
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 5e-19
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 92 9e-19
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 4e-18
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 4e-18
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 89 4e-18
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 89 6e-18
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 89 8e-18
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 2e-17
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 4e-17
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 2e-15
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 2e-15
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 3e-15
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 7e-15
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 9e-15
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 77 2e-14
AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 4e-14
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 5e-14
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 74 2e-13
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 68 1e-11
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 67 2e-11
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 2e-11
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 67 3e-11
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 66 4e-11
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 66 6e-11
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 2e-10
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 4e-10
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 4e-10
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 61 2e-09
AT5G24820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 57 2e-08
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 57 2e-08
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 55 7e-08
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 54 3e-07
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 53 4e-07
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 53 4e-07
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 52 6e-07
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 49 7e-06
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 596 bits (1536), Expect = e-170, Method: Compositional matrix adjust.
Identities = 286/417 (68%), Positives = 342/417 (82%), Gaps = 12/417 (2%)
Query: 33 ASPQTLLILPLKVQTHPHGSVSIPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGS 92
+S L+ LK Q +P SS KLSF+HNVTLTV+L VG PPQ+++MVLDTGS
Sbjct: 34 SSTNQTLLFSLKTQ-------KLPQSSSDKLSFRHNVTLTVTLAVGDPPQNISMVLDTGS 86
Query: 93 ELSWLHCKKLPNLNSVFNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPK-NLCHATVS 151
ELSWLHCKK PNL SVFNP SS+Y+P PC+SP+C+TRTRD PIP SCDPK +LCH +S
Sbjct: 87 ELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAIS 146
Query: 152 YADATSIEGNLATETFFVAGSPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVA 211
YADATSIEGNLA ETF + +PGT FGCMDSG +SN++ED+K+TGLMGMNRGSLSFV
Sbjct: 147 YADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVN 206
Query: 212 QMGLPKFSYCISGSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGI 271
Q+G KFSYCISGSDSSG LL GDA ++WLGP++YTP+V +STPLPYFDRVAYTV+L+GI
Sbjct: 207 QLGFSKFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGI 266
Query: 272 RVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDD 331
RVG K+L L KS+FVPDHTG+GQTMVDSGTQFTFL+GPVY AL+ EF+ QTK VL L+DD
Sbjct: 267 RVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDD 326
Query: 332 PNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSE 391
P+FVFQG MDLCY+VGS + V+L+F GAEMSVSG++LLY+V + A ++G E
Sbjct: 327 PDFVFQGTMDLCYKVGSTTRPNFSGLPM-VSLMFRGAEMSVSGQKLLYRV-NGAGSEGKE 384
Query: 392 DTVYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFA-DTRCELASQRLGM 447
+ VYCFTFGNS+L+GIEA+VIGHHHQQNVWMEFDL SRVGFA + RC+LASQRLG+
Sbjct: 385 E-VYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRCDLASQRLGL 440
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 496 bits (1277), Expect = e-140, Method: Compositional matrix adjust.
Identities = 254/416 (61%), Positives = 311/416 (74%), Gaps = 13/416 (3%)
Query: 39 LILPLKVQTHPHGSVSIPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLH 98
L+LPLK + P + KL F HNVTLTV+LTVG+PPQ+++MV+DTGSELSWL
Sbjct: 46 LVLPLKTRITPTDHRP-----TDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLR 100
Query: 99 CKKLPNLNSV--FNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADAT 156
C + N N V F+P SSSY+P PC+SP C+TRTRDF IP SCD LCHAT+SYADA+
Sbjct: 101 CNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADAS 160
Query: 157 SIEGNLATETFFVAGSPQPGT-TFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGL 215
S EGNLA E F S FGCM S S+ +ED+KTTGL+GMNRGSLSF++QMG
Sbjct: 161 SSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGF 220
Query: 216 PKFSYCISGSDS-SGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVG 274
PKFSYCISG+D G LL GD+ F WL PL YTP+++ STPLPYFDRVAYTV+L GI+V
Sbjct: 221 PKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVN 280
Query: 275 KKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNF 334
KLL + KS+ VPDHTG+GQTMVDSGTQFTFLLGPVY ALR F+ +T G+LT+ +DP+F
Sbjct: 281 GKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDF 340
Query: 335 VFQGAMDLCYRVGSNR-KSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDT 393
VFQG MDLCYR+ R +S V+LVFEGAE++VSG+ LLY+V + D+
Sbjct: 341 VFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGN---DS 397
Query: 394 VYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCELASQRLGMGS 449
VYCFTFGNS+L+G+EAYVIGHHHQQN+W+EFDL SR+G A C+++ QRLG+GS
Sbjct: 398 VYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSGQRLGIGS 453
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 256 bits (655), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 154/395 (38%), Positives = 210/395 (53%), Gaps = 41/395 (10%)
Query: 58 PSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHC--KKLP-NLNSVFNPQLS 114
P + + F++++ L +SL +G+PPQ+ MVLDTGS+LSW+ C KKLP + F+P LS
Sbjct: 59 PYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLS 118
Query: 115 SSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETF-FVAGSP 173
SS++ PC+ P+CK R DF +P SCD LCH + YAD T EGNL E F
Sbjct: 119 SSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEI 178
Query: 174 QPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCI------SGSDS 227
P GC A E S G++GMNRG LSFV+Q + KFSYCI G
Sbjct: 179 TPPLILGC--------ATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTP 230
Query: 228 SGVLLFGDAK----FAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKS 283
+G GD F ++ L + ES +P D +AYTV + GIR G K L + S
Sbjct: 231 TGSFYLGDNPNSHGFKYVSLLTF----PESQRMPNLDPLAYTVPMIGIRFGLKKLNISGS 286
Query: 284 IFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLC 343
+F PD GSGQTMVDSG++FT L+ Y +R E + + L +V+ G D+C
Sbjct: 287 VFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRL----KKGYVYGGTADMC 342
Query: 344 YRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSE 403
+ N V + G E+ V ER+L VG ++C G S
Sbjct: 343 F--DGNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNVG---------GGIHCVGIGRSS 391
Query: 404 LVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
++G + +IG+ HQQN+W+EFD+ N RVGFA C
Sbjct: 392 MLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 248 bits (634), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 153/409 (37%), Positives = 210/409 (51%), Gaps = 53/409 (12%)
Query: 56 PIPSSRKLSFQHNV----TLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNL------ 105
P P S +F+ N+ L +SL +G+P QS +VLDTGS+LSW+ C
Sbjct: 61 PSPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPP 120
Query: 106 NSVFNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATE 165
+ F+P LSSS++ PC+ P+CK R DF +P SCD LCH + YAD T EGNL E
Sbjct: 121 TTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKE 180
Query: 166 TF-FVAGSPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCI-- 222
F F P GC A E + G++GMN G LSF++Q + KFSYCI
Sbjct: 181 KFTFSNSQTTPPLILGC--------AKESTDEKGILGMNLGRLSFISQAKISKFSYCIPT 232
Query: 223 ----SGSDSSGVLLFGD----AKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVG 274
G S+G GD F ++ L + +S +P D +AYTV LQGIR+G
Sbjct: 233 RSNRPGLASTGSFYLGDNPNSRGFKYVSLLTF----PQSQRMPNLDPLAYTVPLQGIRIG 288
Query: 275 KKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNF 334
+K L + S+F PD GSGQTMVDSG++FT L+ Y ++EE V L +
Sbjct: 289 QKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRL----KKGY 344
Query: 335 VFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFE---GAEMSVSGERLLYKVGDVAAAKGSE 391
V+ D+C+ + LVFE G E+ V + LL VG
Sbjct: 345 VYGSTADMCF----DGNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVG--------- 391
Query: 392 DTVYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCEL 440
++C G S ++G + +IG+ HQQN+W+EFD+ N RVGF+ C L
Sbjct: 392 GGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECRL 440
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 150 bits (378), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 180/383 (46%), Gaps = 50/383 (13%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNL----NSVFNPQLSSSYNPTPCTSPVCK 128
+ L++G+P + ++DTGS+L W CK +F+P+ SSSY+ C+S +C
Sbjct: 109 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 168
Query: 129 TRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETF-FVAGSPQPGTTFGCMDSGFT 187
P + K+ C +Y D +S G LATETF F + G FGC G
Sbjct: 169 A----LPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGC---GVE 221
Query: 188 SNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYC---ISGSDSSGVLLFGDAKFAWL--- 241
+ D S+ +GL+G+ RG LS ++Q+ KFSYC I S++S L G +
Sbjct: 222 NEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKT 281
Query: 242 -----GPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTM 296
G + T + + P F Y + LQGI VG K L +EKS F G+G +
Sbjct: 282 GASLDGEVTKTMSLLRNPDQPSF----YYLELQGITVGAKRLSVEKSTFELAEDGTGGMI 337
Query: 297 VDSGTQFTFLLGPVYKALREEFVAQTKGVLTL-LDDPNFVFQGAMDLCYRVGSNRKSXXX 355
+DSGT T+L +K L+EEF ++ ++L +DD +DLC+++ K+
Sbjct: 338 IDSGTTITYLEETAFKVLKEEFTSR----MSLPVDDSGST---GLDLCFKLPDAAKN--- 387
Query: 356 XXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHH 415
+ F+GA++ + GE Y V D + V C G+S + I G+
Sbjct: 388 IAVPKMIFHFKGADLELPGEN--YMVADSSTG------VLCLAMGSSNGMSI----FGNV 435
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
QQN + DL V F T C
Sbjct: 436 QQQNFNVLHDLEKETVSFVPTEC 458
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 130 bits (326), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 173/393 (44%), Gaps = 49/393 (12%)
Query: 54 SIPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS--VFNP 111
S+PI S R + + T V +G+P Q + + LDT ++ +W+ C +S +F+P
Sbjct: 73 SVPIASGRAI--VQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDP 130
Query: 112 QLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAG 171
SSS C +P CK P P SC C ++Y +T IE L +T +A
Sbjct: 131 SKSSSSRTLQCEAPQCK----QAPNP-SCTVSKSCGFNMTYGGST-IEAYLTQDTLTLAS 184
Query: 172 SPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQ---MGLPKFSYCISGSDSS 228
P TFGC++ + GLMG+ RG LS ++Q + FSYC+ S SS
Sbjct: 185 DVIPNYTFGCINKA----SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSS 240
Query: 229 GVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVA--YTVRLQGIRVGKKLLQLEKSIFV 286
G + LGP + P+ ++TPL R + Y V L GIRVG K++ + S
Sbjct: 241 N--FSGSLR---LGP-KNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294
Query: 287 PDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRV 346
D T+ DSGT +T L+ P Y A+R EF + K + N G D CY
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVK-------NANATSLGGFDTCY-- 345
Query: 347 GSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSEL-V 405
+VT +F G +++ + LL S + C + + V
Sbjct: 346 ------SGSVVFPSVTFMFAGMNVTLPPDNLLI--------HSSAGNLSCLAMAAAPVNV 391
Query: 406 GIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
VI QQN + D+ NSR+G + C
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 127 bits (320), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/395 (29%), Positives = 178/395 (45%), Gaps = 56/395 (14%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLN-----SVFNPQLSSSYNPTPCTSPVC 127
V L +G PPQS+ ++ DTGS+L W+ C N + +VF P+ SS+++P C PVC
Sbjct: 86 VDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVC 145
Query: 128 KTRTRDFPIPVSCDPKNL---CHATVSYADATSIEGNLATETFFVAGSPQP-----GTTF 179
+ + P+ C+ + CH YAD + G A ET + S F
Sbjct: 146 RLVPKPDRAPI-CNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAF 204
Query: 180 GCMDSGFTSNADEDSKTT-----GLMGMNRGSLSFVAQMGLP---KFSYCIS----GSDS 227
GC GF + S T+ G+MG+ RG +SF +Q+G KFSYC+
Sbjct: 205 GC---GFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 261
Query: 228 SGVLLFGDAKFAWLGPLRYTPMVKESTPL-PYFDRVAYTVRLQGIRVGKKLLQLEKSIFV 286
+ L+ G+ + L +TP++ + PL P F Y V+L+ + V L+++ SI+
Sbjct: 262 TSYLIIGNGG-DGISKLFFTPLL--TNPLSPTF----YYVKLKSVFVNGAKLRIDPSIWE 314
Query: 287 PDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRV 346
D +G+G T+VDSGT FL P Y+++ + K + P F DLC V
Sbjct: 315 IDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGF------DLCVNV 368
Query: 347 GSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSE-LV 405
K + F G + V R + +E+ + C + + V
Sbjct: 369 SGVTKPEKILPR--LKFEFSGGAVFVPPPRNYF--------IETEEQIQCLAIQSVDPKV 418
Query: 406 GIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCEL 440
G VIG+ QQ EFD SR+GF+ C L
Sbjct: 419 GFS--VIGNLMQQGFLFEFDRDRSRLGFSRRGCAL 451
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 126 bits (316), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 174/376 (46%), Gaps = 49/376 (13%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS----VFNPQLSSSYNPTPCTSPVCKTR 130
L VG+P + V MVLDTGS++ WL C S +F+P+ S +Y PC+SP C+
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRL 205
Query: 131 TRDFPIPVSCDP-KNLCHATVSYADATSIEGNLATETFFVAGSPQPGTTFGCMDSGFTSN 189
C+ + C VSY D + G+ +TET + G GC N
Sbjct: 206 D-----SAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGH----DN 256
Query: 190 ADEDSKTTGLMGMNRGSLSFVAQMGL---PKFSYCI---SGSDSSGVLLFGDAKFAWLGP 243
GL+G+ +G LSF Q G KFSYC+ S S ++FG+A + +
Sbjct: 257 EGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA- 315
Query: 244 LRYTPMVKESTPLPYFDRVAYTVRLQGIRV-GKKLLQLEKSIFVPDHTGSGQTMVDSGTQ 302
R+TP++ P D Y V L GI V G ++ + S+F D G+G ++DSGT
Sbjct: 316 -RFTPLLSN----PKLD-TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 369
Query: 303 FTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVT 362
T L+ P Y A+R+ F G TL P+F D C+ + + + V
Sbjct: 370 VTRLIRPAYIAMRDAFRV---GAKTLKRAPDFSL---FDTCFDLSNMNE----VKVPTVV 419
Query: 363 LVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQQNVWM 422
L F GA++S+ L V + +CF F + + G+ +IG+ QQ +
Sbjct: 420 LHFRGADVSLPATNYLIPV--------DTNGKFCFAFAGT-MGGLS--IIGNIQQQGFRV 468
Query: 423 EFDLVNSRVGFADTRC 438
+DL +SRVGFA C
Sbjct: 469 VYDLASSRVGFAPGGC 484
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 118 bits (296), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 112/396 (28%), Positives = 174/396 (43%), Gaps = 46/396 (11%)
Query: 54 SIPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWL---HCKKLPNLNSVFN 110
S+P+ S +L H V +G+PPQ + MVLDT ++ WL C N ++ FN
Sbjct: 90 SVPVASGNQL---HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFN 146
Query: 111 PQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVA 170
SS+Y+ C++ C T+ R P S ++C SY +S +L +T +A
Sbjct: 147 TNSSSTYSTVSCSTAQC-TQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLA 205
Query: 171 GSPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQ---MGLPKFSYCISGSDS 227
P +FGC++S + GLMG+ RG +S V+Q + FSYC+ S
Sbjct: 206 PDVIPNFSFGCINSA----SGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRS 261
Query: 228 SGVLLFGDAKFAWLG---PLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSI 284
G K LG +RYTP+++ P + Y V L G+ VG + ++
Sbjct: 262 --FYFSGSLKLGLLGQPKSIRYTPLLRN----PRRPSL-YYVNLTGVSVGSVQVPVDPVY 314
Query: 285 FVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCY 344
D T++DSGT T PVY+A+R+EF Q ++ +F GA D C+
Sbjct: 315 LTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQ-------VNVSSFSTLGAFDTCF 367
Query: 345 RVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTF-GNSE 403
+ + +TL ++ + E L S T+ C + G +
Sbjct: 368 SADNENVAPK------ITLHMTSLDLKLPMENTLI--------HSSAGTLTCLSMAGIRQ 413
Query: 404 LVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCE 439
VI + QQN+ + FD+ NSR+G A C
Sbjct: 414 NANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 117 bits (294), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 160/369 (43%), Gaps = 46/369 (12%)
Query: 77 VGSPPQSVTMVLDTGSELSWLHCKKLPNL----NSVFNPQLSSSYNPTPCTSPVCKTRTR 132
+G P + V MVLDTGS+++WL C + +F P SSSY P C +P C
Sbjct: 154 IGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNA--- 210
Query: 133 DFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQPGTTFGCMDSGFTSNADE 192
+ VS C VSY D + G+ ATET + + GC S N
Sbjct: 211 ---LEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGCGHS----NEGL 263
Query: 193 DSKTTGLMGMNRGSLSFVAQMGLPKFSYCI--SGSDSSGVLLFGDAKFAWLGPLRYTPMV 250
GL+G+ G L+ +Q+ FSYC+ SDS+ + FG + L P +
Sbjct: 264 FVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVDFGTS----LSPDAVVAPL 319
Query: 251 KESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPV 310
+ L F Y + L GI VG +LLQ+ +S F D +GSG ++DSGT T L +
Sbjct: 320 LRNHQLDTF----YYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEI 375
Query: 311 YKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEM 370
Y +LR+ FV KG L L D CY N + V F G +M
Sbjct: 376 YNSLRDSFV---KGTLDLEKAAGVAM---FDTCY----NLSAKTTVEVPTVAFHFPGGKM 425
Query: 371 -SVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNS 429
++ + + V V +C F + +IG+ QQ + FDL NS
Sbjct: 426 LALPAKNYMIPVDSVG--------TFCLAFAPT---ASSLAIIGNVQQQGTRVTFDLANS 474
Query: 430 RVGFADTRC 438
+GF+ +C
Sbjct: 475 LIGFSSNKC 483
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 112 bits (280), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 121/422 (28%), Positives = 186/422 (44%), Gaps = 81/422 (19%)
Query: 60 SRKLSFQHNVTLT--------------VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNL 105
SR F H ++ T +S+T+G+PP V + DTGS+L+W+ CK
Sbjct: 60 SRSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQC 119
Query: 106 NS----VFNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDP-KNLCHATVSYADATSIEG 160
+F+ + SS+Y PC S C+ + CD N+C SY D + +G
Sbjct: 120 YKENGPIFDKKKSSTYKSEPCDSRNCQALSS---TERGCDESNNICKYRYSYGDQSFSKG 176
Query: 161 NLATETFFV---AGSPQ--PGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMG- 214
++ATET + +GSP PGT FGC G+ + D +G++G+ G LS ++Q+G
Sbjct: 177 DVATETVSIDSASGSPVSFPGTVFGC---GYNNGGTFDETGSGIIGLGGGHLSLISQLGS 233
Query: 215 --LPKFSYCIS----GSDSSGVLLFGD----AKFAWLGPLRYTPMVKESTPLPYFDRVAY 264
KFSYC+S ++ + V+ G + + + TP+V + PL Y Y
Sbjct: 234 SISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTY-----Y 287
Query: 265 TVRLQGIRVGKKLLQLEKSIFVPDHTG-----SGQTMVDSGTQFTFLLGPVYKALR---E 316
+ L+ I VGKK + S + P+ G SG ++DSGT T L + E
Sbjct: 288 YLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVE 347
Query: 317 EFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGER 376
E V K V DP QG + C++ GS +T+ F GA++ +S
Sbjct: 348 ESVTGAKRV----SDP----QGLLSHCFKSGSAE-----IGLPEITVHFTGADVRLSPIN 394
Query: 377 LLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADT 436
K+ SED V C S + E + G+ Q + + +DL V F
Sbjct: 395 AFVKL--------SEDMV-CL----SMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQHM 441
Query: 437 RC 438
C
Sbjct: 442 DC 443
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 112 bits (280), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 169/397 (42%), Gaps = 46/397 (11%)
Query: 51 GSVSIPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNL--NSV 108
G +PI S R++ + T V +G+P Q + + +DT S+++W+ C N+
Sbjct: 97 GRSVVPIASGRQM--LQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA 154
Query: 109 FNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFF 168
F+P S+S+ C++P CK P P +C + C ++Y ++SI NL+ +T
Sbjct: 155 FSPAKSTSFKNVSCSAPQCK----QVPNP-TCGAR-ACSFNLTYG-SSSIAANLSQDTIR 207
Query: 169 VAGSPQPGTTFGCMDS-GFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISGSDS 227
+A P TFGC++ GL +S + FSYC+ S
Sbjct: 208 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRS 267
Query: 228 SGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVA-YTVRLQGIRVGKKLLQLEKSIFV 286
L F + LGP VK + L R + Y V L IRVG+K++ L +
Sbjct: 268 ---LTFSGS--LRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIA 322
Query: 287 PDHTGSGQTMVDSGTQFTFLLGPVYKALREEF---VAQTKGVLTLLDDPNFVFQGAMDLC 343
+ + T+ DSGT +T L PVY+A+R EF V T V+T L G D C
Sbjct: 323 FNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSL--------GGFDTC 374
Query: 344 YRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNS- 402
Y +T +F+G M++ + L+ + + C +
Sbjct: 375 Y--------SGQVKVPTITFMFKGVNMTMPADNLML--------HSTAGSTSCLAMAAAP 418
Query: 403 ELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCE 439
E V VI QQN + D+ N R+G A RC
Sbjct: 419 ENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 108 bits (271), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 108/412 (26%), Positives = 159/412 (38%), Gaps = 80/412 (19%)
Query: 72 TVSLTVGSPPQSVTMVLDTGSELSWLHCKKL------------PNLNSVFNPQLSSSYNP 119
+VSL+ G+P Q++ V DTGS L WL C P L F P+ SSS
Sbjct: 91 SVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKI 150
Query: 120 TPCTSPVCKTRTRDFPIPVSCDPKNL-----CHATVSYADATSIEGNLATETFFVAGSPQ 174
C SP C+ CDP C + S G L TE
Sbjct: 151 IGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTV 210
Query: 175 PGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCI------------ 222
P GC + + G+ G RG +S +QM L +FS+C+
Sbjct: 211 PDFVVGC-------SIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFDDTNVTT 263
Query: 223 -----------SGSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGI 271
SGS + G+ P R P V L Y Y + L+ I
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTY---------TPFRKNPNVSNKAFLEY-----YYLNLRRI 309
Query: 272 RVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDD 331
VG+K +++ P G G ++VDSG+ FTF+ PV++ + EEF +Q + +
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQ---MSNYTRE 366
Query: 332 PNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSE 391
+ + + C+ + L+FE G +L + + G+
Sbjct: 367 KDLEKETGLGPCFNISGK------GDVTVPELIFEFK----GGAKLELPLSNYFTFVGNT 416
Query: 392 DTVYCFTFGNSELVGIE-----AYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
DTV C T + + V A ++G QQN +E+DL N R GFA +C
Sbjct: 417 DTV-CLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 106 bits (265), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 170/382 (44%), Gaps = 55/382 (14%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHC---KKLPN-LNSVFNPQLSSSYNPTPCTSPVCKTR 130
L VG+P +V MVLDTGS++ WL C K N +++F+P+ S ++ PC S +C+ R
Sbjct: 139 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCR-R 197
Query: 131 TRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQPGTTFGCMDSGFTSNA 190
D V+ K C VSY D + EG+ +TET G+ GC N
Sbjct: 198 LDDSSECVTRRSKT-CLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHD----NE 252
Query: 191 DEDSKTTGLMGMNRGSLSFVAQMGLP---KFSYCI-------SGSDSSGVLLFGDAKFAW 240
GL+G+ RG LSF +Q KFSYC+ S S ++FG+A
Sbjct: 253 GLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAA--- 309
Query: 241 LGPLRYTPMVKESTPL---PYFDRVAYTVRLQGIRVG-KKLLQLEKSIFVPDHTGSGQTM 296
P TPL P D Y ++L GI VG ++ + +S F D TG+G +
Sbjct: 310 ------VPKTSVFTPLLTNPKLDTFYY-LQLLGISVGGSRVPGVSESQFKLDATGNGGVI 362
Query: 297 VDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXX 356
+DSGT T L P Y ALR+ F G L P++ D C+ +
Sbjct: 363 IDSGTSVTRLTQPAYVALRDAF---RLGATKLKRAPSYSL---FDTCFDL----SGMTTV 412
Query: 357 XXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHH 416
V F G E+S+ L V + + +CF F + +G +IG+
Sbjct: 413 KVPTVVFHFGGGEVSLPASNYLIPV--------NTEGRFCFAFAGT--MG-SLSIIGNIQ 461
Query: 417 QQNVWMEFDLVNSRVGFADTRC 438
QQ + +DLV SRVGF C
Sbjct: 462 QQGFRVAYDLVGSRVGFLSRAC 483
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 103 bits (257), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 171/383 (44%), Gaps = 59/383 (15%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNL----NSVFNPQLSSSYNPTPCTSPVCK 128
+++++G+PP + + DTGS+L W C + + +F+P+ SS+Y C+S C+
Sbjct: 88 MNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQCR 147
Query: 129 TRTRDFPIPVSCD-PKNLCHATVSYADATSIEGNLATETFFVAGSPQ-----PGTTFGCM 182
SC +N C T++Y D + +G++A +T + S + GC
Sbjct: 148 ALE-----DASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGC- 201
Query: 183 DSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLP---KFSYCI----SGSDSSGVLLFGD 235
G + D +G++G+ GS S V+Q+ KFSYC+ S + + + FG
Sbjct: 202 --GHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGT 259
Query: 236 AKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQT 295
+ T MVK+ YF + L+ I VG K +Q +IF TG G
Sbjct: 260 NGIVSGDGVVSTSMVKKDPATYYF------LNLEAISVGSKKIQFTSTIF---GTGEGNI 310
Query: 296 MVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXX 355
++DSGT T L Y L E VA T + DP+ G + LCYR S+ K
Sbjct: 311 VIDSGTTLTLLPSNFYYEL-ESVVASTIKA-ERVQDPD----GILSLCYRDSSSFK---- 360
Query: 356 XXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHH 415
+T+ F+G ++ K+G++ + V CF F +E + I G+
Sbjct: 361 --VPDITVHFKGGDV---------KLGNLNTFVAVSEDVSCFAFAANEQLTI----FGNL 405
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
Q N + +D V+ V F T C
Sbjct: 406 AQMNFLVGYDTVSGTVSFKKTDC 428
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 99.8 bits (247), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 160/378 (42%), Gaps = 48/378 (12%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKK-----LPNLNSVFNPQLSSSYNPTPCTSPVC 127
V++ +G+P ++++ DTGS+L+W C+ +FNP S+SY C+S C
Sbjct: 134 VTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAAC 193
Query: 128 KTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQ-PGTTFGCMDSGF 186
+ + SC N C + Y D + G LA E F + S G FGC +
Sbjct: 194 GSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGE--- 249
Query: 187 TSNADEDSKTTGLMGMNRGSLSFVAQMGLPK---FSYCISGSDS-SGVLLFGDAKFAWLG 242
+N + GL+G+ R LSF +Q FSYC+ S S +G L FG A +
Sbjct: 250 -NNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAGISR-- 306
Query: 243 PLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQ 302
+++TP+ + + Y + + I VG + L + ++F + ++DSGT
Sbjct: 307 SVKFTPISTITDGTSF-----YGLNIVAITVGGQKLPIPSTVF-----STPGALIDSGTV 356
Query: 303 FTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVT 362
T L Y ALR F A+ + P +D C+ + + V
Sbjct: 357 ITRLPPKAYAALRSSFKAK------MSKYPTTSGVSILDTCFDLSGFK----TVTIPKVA 406
Query: 363 LVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTF-GNSELVGIEAYVIGHHHQQNVW 421
F G + G + ++ V ++ C F GNS+ A + G+ QQ +
Sbjct: 407 FSFSGGAVVELGSKGIFYVFKISQV--------CLAFAGNSD--DSNAAIFGNVQQQTLE 456
Query: 422 MEFDLVNSRVGFADTRCE 439
+ +D RVGFA C
Sbjct: 457 VVYDGAGGRVGFAPNGCS 474
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 99.0 bits (245), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 170/398 (42%), Gaps = 67/398 (16%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLP-----NLNSVF-NPQLSSSYNPTPCTSPV 126
+ + VG+PP+ +++LDTGS+L+WL C LP + N +F +P+ S+S+ C P
Sbjct: 162 MDVLVGTPPKHFSLILDTGSDLNWLQC--LPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 127 CKTRTRDFPIPVSCDPKNL-CHATVSYADATSIEGNLATETFFVAGSPQPGTTFGCMDSG 185
C + P PV C+ N C Y D ++ G+ A ETF V + G +
Sbjct: 220 CSLISSPDP-PVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGS------- 271
Query: 186 FTSNADEDSKTTGLMGMNRG--------------SLSFVAQMGL---PKFSYCI----SG 224
S + G NRG LSF +Q+ FSYC+ S
Sbjct: 272 --SEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSN 329
Query: 225 SDSSGVLLFG-DAKFAWLGPLRYTPMV--KESTPLPYFDRVAYTVRLQGIRVGKKLLQLE 281
++ S L+FG D L +T V KE++ + Y ++++ I VG K L +
Sbjct: 330 TNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETF-----YYIQIKSILVGGKALDIP 384
Query: 282 KSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDD-PNFVFQGAM 340
+ + G G T++DSGT ++ P Y+ ++ +F + K + D P +
Sbjct: 385 EETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFP------VL 438
Query: 341 DLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFG 400
D C+ V ++ + V +G + E + SED V G
Sbjct: 439 DPCFNVSGIEENNIHLPELGIAFV-DGTVWNFPAENSFIWL--------SEDLVCLAILG 489
Query: 401 NSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
+ +IG++ QQN + +D SR+GF T+C
Sbjct: 490 TPKST---FSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 97.8 bits (242), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/424 (26%), Positives = 179/424 (42%), Gaps = 64/424 (15%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSW----------LHCKKLPNLN----SVFNPQLSSSYN 118
++L +G+PPQ+V + LDTGS+L+W + C L N + SVF+P SS+
Sbjct: 85 ITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSF 144
Query: 119 PTPCTSPVC-KTRTRDFPIP----VSCDPKNLCHATV---------SYADATSIEGNLAT 164
C S C + + D P C L +T +Y + I G L
Sbjct: 145 RDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTR 204
Query: 165 ETFFVAGSPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPK--FSYC- 221
+ P +FGC+ S + + G+ G RG LS +Q+G + FS+C
Sbjct: 205 DILKARTRDVPRFSFGCVTSTY-------REPIGIAGFGRGLLSLPSQLGFLEKGFSHCF 257
Query: 222 -----ISGSDSSGVLLFGDAKFA--WLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVG 274
++ + S L+ G + + L++TPM+ +TP+ Y + +Y + L+ I +G
Sbjct: 258 LPFKFVNNPNISSPLILGASALSINLTDSLQFTPML--NTPM-YPN--SYYIGLESITIG 312
Query: 275 KKL--LQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDP 332
+ Q+ ++ D G+G +VDSGT +T L P Y L + + +T
Sbjct: 313 TNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQL----LTTLQSTITYPRAT 368
Query: 333 NFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMS-VSGERLLYKVGD---VAAAK 388
+ DLCY+V + V ++F ++ LL G+ +A
Sbjct: 369 ETESRTGFDLCYKVPCPNNNLTSLEND-VMMIFPSITFHFLNNATLLLPQGNSFYAMSAP 427
Query: 389 GSEDTVYCFTFGNSELVGI-EAYVIGHHHQQNVWMEFDLVNSRVGFADTRC--ELASQRL 445
V C F N E A V G QQNV + +DL R+GF C E AS L
Sbjct: 428 SDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLEAASHGL 487
Query: 446 GMGS 449
GS
Sbjct: 488 NQGS 491
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 97.4 bits (241), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 163/381 (42%), Gaps = 67/381 (17%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS----VFNPQLSSSYNPTPCTSPVCK 128
+ L VG+PP + +DTGS++ W C PN S +F+P SS++ C
Sbjct: 423 MKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNG---- 478
Query: 129 TRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFV---AGSP--QPGTTFGC-M 182
N CH + YAD T +G LATET + +G P T GC +
Sbjct: 479 ---------------NSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGL 523
Query: 183 DSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPK---FSYCISGSDSSGVLLFGDAKFA 239
D+ + S ++G++G+N G LS ++QM LP SYC SG +S + +A A
Sbjct: 524 DNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKINFGTNAIVA 583
Query: 240 WLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDS 299
G + +K+ P Y + L + V L+ ++ P H G +DS
Sbjct: 584 GDGTVAADMFIKKDNPF-------YYLNLDAVSVEDNLI---ATLGTPFHAEDGNIFIDS 633
Query: 300 GTQFTFLLGPV-YKALREEFVAQTKGVLTLLDDPNFVFQGAMD-LCYRVGSNRKSXXXXX 357
GT T+ P+ Y L E V Q V+T + P+ G+ + LCY S
Sbjct: 634 GTTLTYF--PMSYCNLVREAVEQ---VVTAVKVPD---MGSDNLLCYY------SDTIDI 679
Query: 358 XXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQ 417
+T+ F G V + +Y + G ++C G ++ + A V G+ Q
Sbjct: 680 FPVITMHFSGGADLVLDKYNMY----LETITGG---IFCLAIGCND-PSMPA-VFGNRAQ 730
Query: 418 QNVWMEFDLVNSRVGFADTRC 438
N + +D ++ + F+ T C
Sbjct: 731 NNFLVGYDPSSNVISFSPTNC 751
Score = 91.3 bits (225), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 121/271 (44%), Gaps = 46/271 (16%)
Query: 65 FQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS----VFNPQLSSSYNPT 120
F +N+ L + L VG+PP + +DTGS+L W C P+ S +F+P SS++N
Sbjct: 77 FDYNIYL-MKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQ 135
Query: 121 PCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFV---AGSP--QP 175
C CH + Y D T +G LATET + +G P
Sbjct: 136 RCHGKS-------------------CHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMA 176
Query: 176 GTTFGC-MDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPK---FSYCISGSDSSGVL 231
TT GC + + N+ S ++G++G+N G S ++QM LP SYC SG +S +
Sbjct: 177 ETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGTSKIN 236
Query: 232 LFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTG 291
+A A G + +K+ P Y + A +V I +++ P H
Sbjct: 237 FGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRI----------ETLGTPFHAE 286
Query: 292 SGQTMVDSGTQFTFLLGPV-YKALREEFVAQ 321
G ++DSG+ T+ PV Y L + V Q
Sbjct: 287 DGNIVIDSGSTVTYF--PVSYCNLVRKAVEQ 315
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 97.4 bits (241), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 155/374 (41%), Gaps = 53/374 (14%)
Query: 77 VGSPPQSVTMVLDTGSELSWLHCKKLPNL----NSVFNPQLSSSYNPTPCTSPVCKTRTR 132
VG+P + + +VLDTGS+++W+ C+ + + VFNP SS+Y C++P C
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL--- 224
Query: 133 DFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQ-PGTTFGCMDSGFTSNAD 191
+ S N C VSY D + G LAT+T S + GC N
Sbjct: 225 ---LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHD----NEG 277
Query: 192 EDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISGSDS--SGVLLFGDAKFAWLGPLRYTPM 249
+ GL+G+ G LS QM FSYC+ DS S L F + G P+
Sbjct: 278 LFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLG--GGDATAPL 335
Query: 250 VKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGP 309
++ + Y V L G VG + + L +IF D +GSG ++D GT T L
Sbjct: 336 LRNKKIDTF-----YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQ 390
Query: 310 VYKALREEFVAQT----KGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVF 365
Y +LR+ F+ T KG ++ D CY + S V F
Sbjct: 391 AYNSLRDAFLKLTVNLKKGSSSI---------SLFDTCY----DFSSLSTVKVPTVAFHF 437
Query: 366 EGAE-MSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQQNVWMEF 424
G + + + + L V D +CF F + +IG+ QQ + +
Sbjct: 438 TGGKSLDLPAKNYLIPVDDSG--------TFCFAFAPTS---SSLSIIGNVQQQGTRITY 486
Query: 425 DLVNSRVGFADTRC 438
DL + +G + +C
Sbjct: 487 DLSKNVIGLSGNKC 500
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 95.9 bits (237), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 164/383 (42%), Gaps = 51/383 (13%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCK---KLPNLNSVFNPQLSSSYNPTPCTSPVCKTRT 131
+ VG+P + +V+DTGSEL+W++C+ + + VF S S+ C + CK
Sbjct: 110 IRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDL 169
Query: 132 RDFPIPVSC-DPKNLCHATVSYADATSIEGNLATETFFVA-----GSPQPGTTFGCMDSG 185
+ +C P C YAD ++ +G A ET V + PG GC S
Sbjct: 170 MNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGC-SSS 228
Query: 186 FTSNADEDSKTTGLMGMNRGSLSFVA---QMGLPKFSYC----ISGSDSSGVLLFGDAKF 238
FT + + + G++G+ SF + + KFSYC +S + S L+FG ++
Sbjct: 229 FTGQSFQGAD--GVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRS 286
Query: 239 AWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVD 298
R TP+ + T +P F Y + + GI +G +L + ++ D T G T++D
Sbjct: 287 TKTAFRRTTPL--DLTRIPPF----YAINVIGISLGYDMLDIPSQVW--DATSGGGTILD 338
Query: 299 SGTQFTFLLGPVYKAL---REEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXX 355
SGT T L YK + ++ + K V P V ++ C+ S
Sbjct: 339 SGTSLTLLADAAYKQVVTGLARYLVELKRV-----KPEGV---PIEYCFSFTSG---FNV 387
Query: 356 XXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHH 415
+T +G R Y V AA G V C F ++ VIG+
Sbjct: 388 SKLPQLTFHLKGGA-RFEPHRKSYLVD---AAPG----VKCLGFVSAGTPATN--VIGNI 437
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
QQN EFDL+ S + FA + C
Sbjct: 438 MQQNYLWEFDLMASTLSFAPSAC 460
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 95.9 bits (237), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 156/376 (41%), Gaps = 48/376 (12%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCK--KLPNLNS--VFNPQLSSSYNPTPCTSPVCK 128
V + VGSPP+ MV+D+GS++ W+ C+ KL S VF+P S SY C S VC
Sbjct: 133 VRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCD 192
Query: 129 TRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQPGTTFGCMDSGFTS 188
I S C V Y D + +G LA ET A + GC
Sbjct: 193 R------IENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHR---- 242
Query: 189 NADEDSKTTGLMGMNRGSLSFVAQMGLP---KFSYCI--SGSDSSGVLLFGDAKFAWLGP 243
N GL+G+ GS+SFV Q+ F YC+ G+DS+G L+FG + A
Sbjct: 243 NRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFG--REALPVG 300
Query: 244 LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQF 303
+ P+V+ ++ + + G+R+ L +F TG G ++D+GT
Sbjct: 301 ASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-----PLPDGVFDLTETGDGGVVMDTGTAV 355
Query: 304 TFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTL 363
T L Y A R+ F +QT + P D CY + V+
Sbjct: 356 TRLPTAAYVAFRDGFKSQTANL------PRASGVSIFDTCYDL----SGFVSVRVPTVSF 405
Query: 364 VF-EGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQQNVWM 422
F EG +++ L V D YCF F S G+ +IG+ Q+ + +
Sbjct: 406 YFTEGPVLTLPARNFLMPVDDSG--------TYCFAFAASP-TGLS--IIGNIQQEGIQV 454
Query: 423 EFDLVNSRVGFADTRC 438
FD N VGF C
Sbjct: 455 SFDGANGFVGFGPNVC 470
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 95.5 bits (236), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 176/392 (44%), Gaps = 57/392 (14%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLN------SVFNPQLSSSYNPTPCTSPV 126
+ + VGSPP+ +++LDTGS+L+W+ C LP + + ++P+ S+SY C
Sbjct: 172 MDVLVGSPPKHFSLILDTGSDLNWIQC--LPCYDCFQQNGAFYDPKASASYKNITCNDQR 229
Query: 127 CK-TRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQPGTT------- 178
C + D P+P D ++ C Y D+++ G+ A ETF V + G++
Sbjct: 230 CNLVSSPDPPMPCKSDNQS-CPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVEN 288
Query: 179 --FGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGL---PKFSYCI----SGSDSSG 229
FGC N GL+G+ RG LSF +Q+ FSYC+ S ++ S
Sbjct: 289 MMFGCGH----WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 344
Query: 230 VLLFGDAKFAWLGP-LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPD 288
L+FG+ K P L +T V L Y V+++ I V ++L + + +
Sbjct: 345 KLIFGEDKDLLSHPNLNFTSFVAGKENLV---DTFYYVQIKSILVAGEVLNIPEETWNIS 401
Query: 289 HTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDD-PNFVFQGAMDLCYRVG 347
G+G T++DSGT ++ P Y+ ++ + + KG + D P +D C+ V
Sbjct: 402 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFP------ILDPCFNVS 455
Query: 348 SNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGI 407
A +GA + E + +ED V G +
Sbjct: 456 GIHNVQLPELGIAFA---DGAVWNFPTENSFIWL--------NEDLVCLAMLGTPK---- 500
Query: 408 EAY-VIGHHHQQNVWMEFDLVNSRVGFADTRC 438
A+ +IG++ QQN + +D SR+G+A T+C
Sbjct: 501 SAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 94.4 bits (233), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/405 (26%), Positives = 182/405 (44%), Gaps = 61/405 (15%)
Query: 55 IPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS----VFN 110
IP+ S KL +L +TV +++++++DTGS+L+W+ C+ + + +++
Sbjct: 122 IPLTSGIKLE-----SLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYD 176
Query: 111 PQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNL-----CHATVSYADATSIEGNLATE 165
P +SSSY C S C+ C N C VSY D + G+LA+E
Sbjct: 177 PSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASE 236
Query: 166 TFFVAGSPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM-----GLPKFSY 220
+ + + FGC +N ++GLMG+ R S+S V+Q G+ FSY
Sbjct: 237 SILLGDTKLENFVFGCG----RNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGV--FSY 290
Query: 221 CISGSD--SSGVLLFGDAKFAWLG--PLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKK 276
C+ + +SG L FG+ + + YTP+V+ P R Y + L G +G
Sbjct: 291 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQN----PQL-RSFYILNLTGASIGG- 344
Query: 277 LLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVF 336
++L+ S F G G ++DSGT T L +YKA++ EF+ Q G T P +
Sbjct: 345 -VELKSSSF-----GRG-ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTA---PGYSI 394
Query: 337 QGAMDLCYRVGSNRKSXXXXXXXAVTLVFEG-AEMSVSGERLLYKVGDVAAAKGSEDTVY 395
+D C+ N S + ++F+G AE+ V + Y V A+ +
Sbjct: 395 ---LDTCF----NLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-------LV 440
Query: 396 CFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCEL 440
C + E +IG++ Q+N + +D R+G C +
Sbjct: 441 CLALASLSYEN-EVGIIGNYQQKNQRVIYDTTQERLGIVGENCRV 484
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 94.0 bits (232), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 114/424 (26%), Positives = 179/424 (42%), Gaps = 77/424 (18%)
Query: 50 HGSVSIPIPSSRKLSFQH---NVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKL---- 102
H S S +P SR + N T L +G+PPQ +++D+GS ++++ C
Sbjct: 69 HKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCG 128
Query: 103 PNLNSVFNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSC-DPKNLCHATVSYADATSIEGN 161
+ + F P++SS+Y P C + +C D + C YA+ +S +G
Sbjct: 129 KHQDPKFQPEMSSTYQPVKCN------------MDCNCDDDREQCVYEREYAEHSSSKGV 176
Query: 162 LATETFFVAGSPQ---PGTTFGC--MDSG--FTSNADEDSKTTGLMGMNRGSLSFVAQM- 213
L + Q FGC +++G ++ AD G++G+ +G LS V Q+
Sbjct: 177 LGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRAD------GIIGLGQGDLSLVDQLV 230
Query: 214 --GL--PKFSYCISGSD-SSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRL 268
GL F C G D G ++ G F + + +T + +P Y + L
Sbjct: 231 DKGLISNSFGLCYGGMDVGGGSMILG--GFDYPSDMVFTDSDPDRSPY-------YNIDL 281
Query: 269 QGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTL 328
GIRV K L L +F +H ++DSGT + +L A EE V + L
Sbjct: 282 TGIRVAGKQLSLHSRVFDGEHGA----VLDSGTTYAYLPD-AAFAAFEEAVMREVSTLKQ 336
Query: 329 LD--DPNFVFQGAMDLCYRV-GSNRKSXXXXXXXAVTLVFE-GAEMSVSGERLLYKVGDV 384
+D DPNF D C++V SN S +V +VF+ G +S E +++ V
Sbjct: 337 IDGPDPNF-----KDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKV 391
Query: 385 AAAKGSEDTVYC---FTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCELA 441
A YC F G + V+ +N + +D NS+VGF T C
Sbjct: 392 HGA-------YCLGVFPNGKDHTTLLGGIVV-----RNTLVVYDRENSKVGFWRTNCSEL 439
Query: 442 SQRL 445
S RL
Sbjct: 440 SDRL 443
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 94.0 bits (232), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 159/384 (41%), Gaps = 58/384 (15%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLH---CKKLPNLNSVFNPQLSSSYNPTPCTSPVCKT 129
++++G+PP +++DTGS+L+W+H CK P F+P SS+Y C S
Sbjct: 80 ANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPFFHPSRSSTYRNASCVSA---- 135
Query: 130 RTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGS-----PQPGTTFGC--M 182
P + C + Y D ++ G LA E S + FGC
Sbjct: 136 -PHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQD 194
Query: 183 DSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISG----SDSSGVLLFGDAKF 238
+SGFT K +G++G+ G+ S V + KFSYC + +L+ G+
Sbjct: 195 NSGFT-------KYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNPTYPHNILILGNG-- 245
Query: 239 AWLGPLRYTPMVKESTPLPYF-DRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMV 297
+ + TPL F DR Y + LQ I G+KLL +E F + G T++
Sbjct: 246 --------AKIEGDPTPLQIFQDR--YYLDLQAISFGEKLLDIEPGTF-QRYRSQGGTVI 294
Query: 298 DSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXX 357
D+G T L Y+ L EE VL + D + CY N K
Sbjct: 295 DTGCSPTILAREAYETLSEEIDFLLGEVLRRVKD----WDQYTTPCYE--GNLK-LDLYG 347
Query: 358 XXAVTLVFE-GAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHH 416
VT F GAE+++ E L V++ G +C + + VIG
Sbjct: 348 FPVVTFHFAGGAELALDVESLF-----VSSESGDS---FCLAMTMNTFDDMS--VIGAMA 397
Query: 417 QQNVWMEFDLVNSRVGFADTRCEL 440
QQN + ++L +V F T CE+
Sbjct: 398 QQNYNVGYNLRTMKVYFQRTDCEI 421
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 92.4 bits (228), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 160/386 (41%), Gaps = 70/386 (18%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS------VFNPQLSSSYNPTPCTSPV 126
V++++GSPP + + +DT S+L W+ C LP +N +F+P S ++ C
Sbjct: 87 VNISIGSPPITQLLHMDTASDLLWIQC--LPCINCYAQSLPIFDPSRSYTHRNETC---- 140
Query: 127 CKTRTRDFPIPVSCDPKNL--CHATVSYADATSIEGNLATETFFV-------AGSPQPGT 177
RT + +P N C ++ Y D T +G LA E + +
Sbjct: 141 ---RTSQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDV 197
Query: 178 TFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISGSDS----SGVLLF 233
FGC N E TG++G+ G S V + G KFSYC D VL+
Sbjct: 198 VFGCGH----DNYGEPLVGTGILGLGYGEFSLVHRFG-KKFSYCFGSLDDPSYPHNVLVL 252
Query: 234 GDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDH-TGS 292
GD LG ++TPL + Y V ++ I V +L ++ +F +H TG
Sbjct: 253 GDDGANILG---------DTTPLEIHNGFYY-VTIEAISVDGIILPIDPRVFNRNHQTGL 302
Query: 293 GQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDL--CYRVGSNR 350
G T++D+G T L+ YK L+ +G T D V Q M CY G+
Sbjct: 303 GGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAAD----VSQDDMIKMECYN-GNFE 357
Query: 351 KSXXXXXXXAVTLVF-EGAEMSVSGERLLYKVGDVAAAKGSEDTVYCF--TFGNSELVGI 407
+ VT F EGAE+S+ + L K+ V+C T GN +G
Sbjct: 358 RDLVESGFPIVTFHFSEGAELSLDVKSLFMKL---------SPNVFCLAVTPGNLNSIGA 408
Query: 408 EAYVIGHHHQQNVWMEFDLVNSRVGF 433
A QQ+ + +DL V F
Sbjct: 409 TA-------QQSYNIGYDLEAMEVSF 427
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 91.7 bits (226), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 162/383 (42%), Gaps = 57/383 (14%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPN----LNSVFNPQLSSSYNPTPCTSPVCK 128
+++++G+PP + + DTGS+L W C + ++ +F+P+ SS+Y C+S C
Sbjct: 92 MNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCT 151
Query: 129 TRTRDFPIPVSCDPK-NLCHATVSYADATSIEGNLATETFFVAGSPQ-----PGTTFGCM 182
SC N C ++SY D + +GN+A +T + S GC
Sbjct: 152 ALENQ----ASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC- 206
Query: 183 DSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLP---KFSYCI----SGSDSSGVLLFGD 235
G + + K +G++G+ G +S + Q+G KFSYC+ S D + + FG
Sbjct: 207 --GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264
Query: 236 AKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQT 295
+ TP++ +++ + Y + L+ I VG K +Q S + G
Sbjct: 265 NAIVSGSGVVSTPLIAKASQETF-----YYLTLKSISVGSKQIQYSGSDSE---SSEGNI 316
Query: 296 MVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXX 355
++DSGT T L Y L + + DP Q + LCY + K
Sbjct: 317 IIDSGTTLTLLPTEFYSELEDAVASSIDAEKK--QDP----QSGLSLCYSATGDLK---- 366
Query: 356 XXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHH 415
+T+ F+GA++ + +V SED V CF F S I G+
Sbjct: 367 --VPVITMHFDGADVKLDSSNAFVQV--------SEDLV-CFAFRGSPSFSI----YGNV 411
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
Q N + +D V+ V F T C
Sbjct: 412 AQMNFLVGYDTVSKTVSFKPTDC 434
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 90.9 bits (224), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 173/390 (44%), Gaps = 59/390 (15%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCK---KLPNLNS-VFNPQLSSSYNPTPCTSPVCK 128
+S+++G+PP V + DTGS+L+W+ CK + NS +F+ + SS+Y C S C+
Sbjct: 87 MSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQ 146
Query: 129 TRTRDFPIPVSCDP-KNLCHATVSYADATSIEGNLATETFFVAGSPQ-----PGTTFGCM 182
+ CD K++C SY D + +G++ATET + S PGT FGC
Sbjct: 147 ALSEH---EEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGC- 202
Query: 183 DSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGL---PKFSYCIS----GSDSSGVLLFGD 235
G+ + + +G++G+ G LS V+Q+G KFSYC+S ++ + V+ G
Sbjct: 203 --GYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGT 260
Query: 236 AKFAWLGPLRYTPMVKESTPLPYFD-RVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGS-- 292
P + + + +TPL D Y + L+ + VGK L + + S
Sbjct: 261 NSIPS-NPSKDSATL--TTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKR 317
Query: 293 -GQTMVDSGTQFTFLLGPVYKALR---EEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGS 348
G ++DSGT T L Y EE V K V DP QG + C++ G
Sbjct: 318 TGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV----SDP----QGLLTHCFKSGD 369
Query: 349 NRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIE 408
A+T+ F A++ +S K+ +EDTV C S + E
Sbjct: 370 KE-----IGLPAITMHFTNADVKLSPINAFVKL--------NEDTV-CL----SMIPTTE 411
Query: 409 AYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
+ G+ Q + + +DL V F C
Sbjct: 412 VAIYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 89.7 bits (221), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 156/383 (40%), Gaps = 49/383 (12%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS------VFNPQLSSSYNPTPCTSPV 126
V+ +VG PP ++DTGS L W+ C+ + +S VFNP LSS++ C
Sbjct: 98 VNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRF 157
Query: 127 CKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQPGTT------FG 180
C+ C N C Y T +G LA E +P T FG
Sbjct: 158 CRYAPNG-----HCGSSNKCVYEQVYISGTGSKGVLAKERLTFT-TPNGNTVVTQPIAFG 211
Query: 181 CMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISGSDSSGVLLFGDAKFAW 240
C G+ + +S TG++G+ S Q+G KFSYCI G L + +
Sbjct: 212 C---GYENGEQLESHFTGILGLGAKPTSLAVQLG-SKFSYCI------GDLANKNYGYNQ 261
Query: 241 LGPLRYTPMVKESTPLPY-FDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDS 299
L ++ + TP+ + + Y + L+GI VG L +E +F +G ++DS
Sbjct: 262 LVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTG-VILDS 320
Query: 300 GTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXX 359
GT +T+L Y+ L E + ++LD F LCY R S
Sbjct: 321 GTLYTWLADIAYRELYNE-------IKSILDPKLERFWFRDFLCYH---GRVSEELIGFP 370
Query: 360 AVTLVFE-GAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVG---IEAYVIGHH 415
VT F GAE+++ + Y + ++ + V+C + ++ G E IG
Sbjct: 371 VVTFHFAGGAELAMEATSMFYPL-----SEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLM 425
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
QQ + +DL + C
Sbjct: 426 AQQYYNIGYDLKEKNIYLQRIDC 448
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 89.7 bits (221), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 166/392 (42%), Gaps = 68/392 (17%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCKKLPNLN---------SVFNPQLSSSYNPTPCTSP 125
+ +GSPP+ + +DTGS++ W++C P S+++ + SS+ C
Sbjct: 82 IKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDD 141
Query: 126 VCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFF---VAGSPQPG-----T 177
C + +C K C V Y D ++ +G+ + V G+ +
Sbjct: 142 FCSFIMQ----SETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEV 197
Query: 178 TFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FSYCISGSDSSGVLL 232
FGC + DS G+MG + + S ++Q+ G K FS+C+ + G+
Sbjct: 198 VFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFA 257
Query: 233 FGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGS 292
G+ + +P+VK + +P ++V Y V L+G+ V + L S+ G
Sbjct: 258 VGEVE---------SPVVKTTPIVP--NQVHYNVILKGMDVDGDPIDLPPSL--ASTNGD 304
Query: 293 GQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDD--PNFVFQGAMDLCYRVGSNR 350
G T++DSGT +L +Y +L E+ A+ + L ++ + F F D + V
Sbjct: 305 GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSFTSNTDKAFPV---- 360
Query: 351 KSXXXXXXXAVTLVFEGA-EMSVSGERLLYKVGDVAAAKGSEDTVYCFTF---GNSELVG 406
V L FE + ++SV L+ + + +YCF + G + G
Sbjct: 361 ----------VNLHFEDSLKLSVYPHDYLFSL---------REDMYCFGWQSGGMTTQDG 401
Query: 407 IEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
+ ++G N + +DL N +G+AD C
Sbjct: 402 ADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 89.4 bits (220), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 105/444 (23%), Positives = 182/444 (40%), Gaps = 65/444 (14%)
Query: 27 QIQTSDASPQTLLILPLKVQTHPHGSVSIPIPSSRK---LSFQHNVTLTVSLTVGSPPQS 83
+++ D ++L Q+ G V P+ S + + + + +GSPP
Sbjct: 58 ELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTE 117
Query: 84 VTMVLDTGSELSWLHCKKLPNLN---------SVFNPQLSSSYNPTPCTSPVCKTRTRDF 134
+ +DTGS++ W+ C N F+ S + C+ P+C + +
Sbjct: 118 FNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQT- 176
Query: 135 PIPVSCDPKNLCHATVSYADATSIEGNLATETFF---------VAGSPQPGTTFGCMDSG 185
C N C + Y D + G T+TF+ VA S P FGC
Sbjct: 177 -TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAP-IVFGCSTYQ 234
Query: 186 FTSNADEDSKTTGLMGMNRGSLSFVAQM---GL--PKFSYCISGSDS-SGVLLFGDAKFA 239
D G+ G +G LS V+Q+ G+ P FS+C+ G S GV + G+
Sbjct: 235 SGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEI--- 291
Query: 240 WLGP-LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVD 298
L P + Y+P+V P+ Y + L I V ++L L+ ++F +T T+VD
Sbjct: 292 -LVPGMVYSPLVPSQ---PH-----YNLNLLSIGVNGQMLPLDAAVFEASNTRG--TIVD 340
Query: 299 SGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXX 358
+GT T+L+ Y + F+ ++ L P + G + CY V ++
Sbjct: 341 TGTTLTYLVKEAY----DLFLNAISNSVSQLVTP-IISNG--EQCYLVSTSISD----MF 389
Query: 359 XAVTLVFE-GAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQ 417
+V+L F GA M + + L+ G A +++C F + E ++G
Sbjct: 390 PSVSLNFAGGASMMLRPQDYLFHYGIYDGA-----SMWCIGFQKAPE---EQTILGDLVL 441
Query: 418 QNVWMEFDLVNSRVGFADTRCELA 441
++ +DL R+G+A C ++
Sbjct: 442 KDKVFVYDLARQRIGWASYDCSMS 465
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 89.0 bits (219), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/390 (25%), Positives = 166/390 (42%), Gaps = 67/390 (17%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCKKLP------NLN---SVFNPQLSSSYNPTPCTSP 125
+ +GSPP+ + +DTGS++ W++CK P NLN S+F+ SS+ C
Sbjct: 78 IKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDD 137
Query: 126 VCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFF---VAGSPQPG-----T 177
C ++ SC P C + YAD ++ +G + V G + G
Sbjct: 138 FCSFISQ----SDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEV 193
Query: 178 TFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FSYCISGSDSSGVLL 232
FGC + DS G+MG + + S ++Q+ G K FS+C+ G+
Sbjct: 194 VFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFA 253
Query: 233 FGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGS 292
G + +P VK + +P +++ Y V L G+ V L L +SI +
Sbjct: 254 VG---------VVDSPKVKTTPMVP--NQMHYNVMLMGMDVDGTSLDLPRSI-----VRN 297
Query: 293 GQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKS 352
G T+VDSGT + +Y +L E +A+ L ++++ FQ C+ +N
Sbjct: 298 GGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEE---TFQ-----CFSFSTNVDE 349
Query: 353 XXXXXXXAVTLVFEGA-EMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVG---IE 408
V+ FE + +++V L+ + E+ +YCF + L E
Sbjct: 350 ----AFPPVSFEFEDSVKLTVYPHDYLFTL---------EEELYCFGWQAGGLTTDERSE 396
Query: 409 AYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
++G N + +DL N +G+AD C
Sbjct: 397 VILLGDLVLSNKLVVYDLDNEVIGWADHNC 426
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 88.6 bits (218), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 98/393 (24%), Positives = 165/393 (41%), Gaps = 62/393 (15%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCKKLPNLN---------SVFNPQLSSSYNPTPCTSP 125
+ +GSPP + +DTGS++ W+ C N F+ S + C+ P
Sbjct: 104 VKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDP 163
Query: 126 VCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFF---------VAGSPQPG 176
+C + + C N C + Y D + G T+TF+ VA S P
Sbjct: 164 ICSSVFQT--TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAP- 220
Query: 177 TTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM---GL--PKFSYCISGSDS-SGV 230
FGC D G+ G +G LS V+Q+ G+ P FS+C+ G S GV
Sbjct: 221 IVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGV 280
Query: 231 LLFGDAKFAWLGP-LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDH 289
+ G+ L P + Y+P+V P+ Y + L I V ++L L+ ++F +
Sbjct: 281 FVLGEI----LVPGMVYSPLVPSQ---PH-----YNLNLLSIGVNGQMLPLDAAVFEASN 328
Query: 290 TGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSN 349
T T+VD+GT T+L+ Y + F+ ++ L P + G + CY V ++
Sbjct: 329 TRG--TIVDTGTTLTYLVKEAY----DLFLNAISNSVSQLVTP-IISNG--EQCYLVSTS 379
Query: 350 RKSXXXXXXXAVTLVFE-GAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIE 408
+V+L F GA M + + L+ G A +++C F + E
Sbjct: 380 ISD----MFPSVSLNFAGGASMMLRPQDYLFHYGIYDGA-----SMWCIGFQKAPE---E 427
Query: 409 AYVIGHHHQQNVWMEFDLVNSRVGFADTRCELA 441
++G ++ +DL R+G+A C ++
Sbjct: 428 QTILGDLVLKDKVFVYDLARQRIGWASYDCSMS 460
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 87.0 bits (214), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 159/383 (41%), Gaps = 69/383 (18%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSV-------FNPQLSSSYNPTPCTSP 125
V++ +G+P +++V DTGS+L+W C+ P L S FNP SS+Y C+SP
Sbjct: 134 VTIGIGTPKHDLSLVFDTGSDLTWTQCE--PCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 126 VCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQ-PGTTFGCMDS 184
+C+ SC N C ++ Y D + +G LA E F + S FGC ++
Sbjct: 192 MCEDAE-------SCSASN-CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGEN 243
Query: 185 G---FTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISG--SDSSGVLLFGDAKFA 239
F A G + + + + + FSYC+ S+S+G L FG A +
Sbjct: 244 NQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNI----FSYCLPSFTSNSTGHLTFGSAGIS 299
Query: 240 WLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDS 299
+++TP+ S P + Y + + GI VG K L + P+ + ++DS
Sbjct: 300 E--SVKFTPI--SSFPSAF----NYGIDIIGISVGDKELAI-----TPNSFSTEGAIIDS 346
Query: 300 GTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXXXX 359
GT FT L VY LR F + + G D CY +
Sbjct: 347 GTVFTRLPTKVYAELRSVFKEKMSSYKSTSG------YGLFDTCY----DFTGLDTVTYP 396
Query: 360 AVTLVFEGA---EMSVSGERLLYKVGDVAAAKGSEDTVYCFTF-GNSELVGIEAYVIGHH 415
+ F G+ E+ SG L K+ V C F GN +L I G+
Sbjct: 397 TIAFSFAGSTVVELDGSGISLPIKISQV-----------CLAFAGNDDLPAI----FGNV 441
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
Q + + +D+ RVGFA C
Sbjct: 442 QQTTLDVVYDVAGGRVGFAPNGC 464
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 86.3 bits (212), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/400 (24%), Positives = 168/400 (42%), Gaps = 68/400 (17%)
Query: 68 NVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKL----PNLNSVFNPQLSSSYNPTPCT 123
N T L +G+PPQ +++DTGS ++++ C + + F P+LS+SY C
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131
Query: 124 SPVCKTRTRDFPIPVSCDPK-NLCHATVSYADATSIEGNLATETFFVAG----SPQPGTT 178
+P C +CD + LC YA+ +S G L+ + SPQ
Sbjct: 132 NPDC-----------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQ-RAV 179
Query: 179 FGCMDSG----FTSNADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FSYCISGSDSSG 229
FGC + F+ AD G+MG+ RG LS V Q+ G+ + FS C G + G
Sbjct: 180 FGCENEETGDLFSQRAD------GIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG 233
Query: 230 VLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDH 289
+ LG + P + S P F Y + L+ + V K L+L +F
Sbjct: 234 GAMV-------LGKISPPPGMVFSHSDP-FRSPYYNIDLKQMHVAGKSLKLNPKVF---- 281
Query: 290 TGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLD-DPNFVFQGAMDLCYRVGS 348
G T++DSGT + + + A+++ + + + + DPN+ D+C+
Sbjct: 282 NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNY-----DDVCFSGAG 336
Query: 349 NRKSXXXXXXXAVTLVF-EGAEMSVSGERLLYKVGDVAAAKGSEDTVYCF-TFGNSELVG 406
+ + + F G ++ +S E L++ V A YC F + +
Sbjct: 337 RDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGA-------YCLGIFPDRD--- 386
Query: 407 IEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCELASQRLG 446
++G +N + +D N ++GF T C +RL
Sbjct: 387 -STTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLA 425
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 124/292 (42%), Gaps = 51/292 (17%)
Query: 65 FQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS----VFNPQLSSSYNPT 120
F +N+ L + L VG+PP + +DTGS+L W C N S +F+P SS++
Sbjct: 56 FDYNIYL-MKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEK 114
Query: 121 PCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFV---AGSP--QP 175
C N CH + YAD T +G LATET + +G P P
Sbjct: 115 RCNG-------------------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMP 155
Query: 176 GTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMG--LPKF-SYCISGSDSSGVLL 232
TT GC G S+ + + +G++G++ G S + QMG P SYC + +S +
Sbjct: 156 ETTIGC---GHNSSWFKPTF-SGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINF 211
Query: 233 FGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGS 292
+A A G + T + + P Y+ + L + VG ++ + F H
Sbjct: 212 GTNAIVAGDGVVSTTMFLTTAKPGLYY------LNLDAVSVGDTHVETMGTTF---HALE 262
Query: 293 GQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCY 344
G ++DSGT T+ +RE V T DP G LCY
Sbjct: 263 GNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTA--DPT----GNDMLCY 308
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/404 (24%), Positives = 161/404 (39%), Gaps = 57/404 (14%)
Query: 58 PSSRKLSFQHNV----TLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQL 113
PSS NV +V + +GSPP++ +DTGS+L+W+ C P P L
Sbjct: 32 PSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDA-PCSGCTLPPNL 90
Query: 114 SSSYNP----TPCTSPVCKTRTRDFPIPVSC-DPKNLCHATVSYADATSIEGNLATETF- 167
Y P PC++P+C +P C +P+ C V YAD S G L T+ F
Sbjct: 91 --QYKPKGNIIPCSNPIC--TALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTDQFP 146
Query: 168 --FVAGS-PQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FS 219
V GS QP FGC +A T G++G+ RG + + Q+ GL +
Sbjct: 147 LKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVG 206
Query: 220 YCISGSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQ 279
+C+S S G L FGD +G + +TP++ + ++ + G G K L+
Sbjct: 207 HCLS-SKGGGFLFFGDNLVPSIG-VAWTPLLSQDN---HYTTGPADLLFNGKPTGLKGLK 261
Query: 280 LEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTK-GVLTLLDDPNFVFQG 338
L + D+G+ +T+ Y+ + K L + +
Sbjct: 262 L---------------IFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKE-----DK 301
Query: 339 AMDLCYRVGSNRKSXXXXXX--XAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYC 396
+ +C++ KS +T+ F + LY ++ V C
Sbjct: 302 TLPICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQ----LYLAPELYLIVSKTGNV-C 356
Query: 397 FTFGNSELVGIE-AYVIGHHHQQNVWMEFDLVNSRVGFADTRCE 439
N VG++ + VIG Q + M +D ++G+ + C
Sbjct: 357 LGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLGWVSSDCN 400
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 80.1 bits (196), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/389 (23%), Positives = 164/389 (42%), Gaps = 57/389 (14%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHC---------KKLPNLNSVFNPQLSSSYNPTPCTSP 125
+ +G+PP+ + +DTGS++ W+ C +L S F+P +SSS + C+
Sbjct: 88 VKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDR 147
Query: 126 VCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATE--------TFFVAGSPQPGT 177
C + +F C P NLC + Y D + G ++ T +A +
Sbjct: 148 RCYS---NFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPF 204
Query: 178 TFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGL----PK-FSYCISGSDS-SGVL 231
FGC + G+ G+ +GSLS ++Q+ + P+ FS+C+ G S G++
Sbjct: 205 VFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIM 264
Query: 232 LFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTG 291
+ G K YTP+V P+ Y V LQ I V ++L ++ S+F TG
Sbjct: 265 VLGQIKRP---DTVYTPLVPSQ---PH-----YNVNLQSIAVNGQILPIDPSVFTI-ATG 312
Query: 292 SGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRK 351
G T++D+GT +L Y F+ ++ P ++ C+ +
Sbjct: 313 DG-TIIDTGTTLAYLPDEAY----SPFIQAVANAVSQYGRP-ITYESYQ--CFEI----T 360
Query: 352 SXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYV 411
+ V+L F G V G R ++ S +++C F + +
Sbjct: 361 AGDVDVFPQVSLSFAGGASMVLGPRAYLQI-----FSSSGSSIWCIGF--QRMSHRRITI 413
Query: 412 IGHHHQQNVWMEFDLVNSRVGFADTRCEL 440
+G ++ + +DLV R+G+A+ C L
Sbjct: 414 LGDLVLKDKVVVYDLVRQRIGWAEYDCSL 442
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 79.0 bits (193), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 159/383 (41%), Gaps = 76/383 (19%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS------VFNPQLSSSYNPTPCTSPV 126
+ L +G+PP + VLDTGSE W C LP ++ +F+P SS++ C
Sbjct: 67 MKLQIGTPPFEIEAVLDTGSEHIWTQC--LPCVHCYNQTAPIFDPSKSSTFKEIRC---- 120
Query: 127 CKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFV---AGSP--QPGTTFGC 181
T D SC P L + SY +G L TET + +G P P T GC
Sbjct: 121 ---DTHDH----SC-PYELVYGGKSYT-----KGTLVTETVTIHSTSGQPFVMPETIIGC 167
Query: 182 --MDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMG--LPKF-SYCISGSDSSGVLLFGDA 236
+SGF G++G++RG S + QMG P SYC +G +S + +A
Sbjct: 168 GRNNSGFKPGF------AGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANA 221
Query: 237 KFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTM 296
A G + T VK + P Y+ + L + VG ++ ++ P H G +
Sbjct: 222 IVAGDGVVSTTVFVKTAKPGFYY------LNLDAVSVGNTRIE---TVGTPFHALKGNIV 272
Query: 297 VDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSNRKSXXXX 356
+DSG+ T+ Y L + V Q V+T + P + LCY S
Sbjct: 273 IDSGSTLTY-FPESYCNLVRKAVEQ---VVTAVRFPR-----SDILCYY------SKTID 317
Query: 357 XXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFT-FGNSELVGIEAYVIGHH 415
+T+ F G V + +Y VA+ G V+C NS IE + G+
Sbjct: 318 IFPVITMHFSGGADLVLDKYNMY----VASNTGG---VFCLAIICNSP---IEEAIFGNR 367
Query: 416 HQQNVWMEFDLVNSRVGFADTRC 438
Q N + +D + V F T C
Sbjct: 368 AQNNFLVGYDSSSLLVSFKPTNC 390
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 78.6 bits (192), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 84/392 (21%), Positives = 154/392 (39%), Gaps = 69/392 (17%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCK---KLPNLNSV-----FNPQLSSSYNPTPCTSPV 126
+ +G+P + + +DTGS++ W++C + P + + ++ SS+ C+
Sbjct: 89 IGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSCSDNF 148
Query: 127 CKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETF---FVAGSPQPG-----TT 178
C + C + C + Y D +S G L + V G+ Q G
Sbjct: 149 CSYVNQ----RSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTII 204
Query: 179 FGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMG-----LPKFSYCISGSDSSGVLLF 233
FGC + + G+MG + + SF++Q+ F++C+ ++ G+
Sbjct: 205 FGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAI 264
Query: 234 GDAKFAWLGP-LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGS 292
G+ + P ++ TPM+ +S Y+V L I VG +L+L + F D
Sbjct: 265 GEV----VSPKVKTTPMLSKSAH--------YSVNLNAIEVGNSVLELSSNAF--DSGDD 310
Query: 293 GQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMD--LCYRVGSNR 350
++DSGT +L VY L E +A P + C+
Sbjct: 311 KGVIIDSGTTLVYLPDAVYNPLLNEILAS---------HPELTLHTVQESFTCFHY---- 357
Query: 351 KSXXXXXXXAVTLVFEGA-EMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELV---G 406
+ VT F+ + ++V L++V EDT +CF + N L G
Sbjct: 358 -TDKLDRFPTVTFQFDKSVSLAVYPREYLFQV--------REDT-WCFGWQNGGLQTKGG 407
Query: 407 IEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
++G N + +D+ N +G+ + C
Sbjct: 408 ASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 77.0 bits (188), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/390 (25%), Positives = 155/390 (39%), Gaps = 74/390 (18%)
Query: 65 FQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS------VFNPQLSSSYN 118
F ++V L + L VG+PP + ++DTGSE++W C LP ++ +F+P SS++
Sbjct: 60 FDNSVYL-MKLQVGTPPFEIQAIIDTGSEITWTQC--LPCVHCYEQNAPIFDPSKSSTFK 116
Query: 119 PTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFV---AGSP-- 173
C C V Y D T G LATET + +G P
Sbjct: 117 EKRCDGHSCPYE-------------------VDYFDHTYTMGTLATETITLHSTSGEPFV 157
Query: 174 QPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMG--LPKF-SYCISGSDSSGV 230
P T GC +N+ +G++G+N G S + QMG P SYC SG +S +
Sbjct: 158 MPETIIGCGH----NNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGTSKI 213
Query: 231 LLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHT 290
+A A G + T + + P Y+ + L + VG ++ + F H
Sbjct: 214 NFGANAIVAGDGVVSTTMFMTTAKPGFYY------LNLDAVSVGNTRIETMGTTF---HA 264
Query: 291 GSGQTMVDSGTQFTFLLGPV-YKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSN 349
G ++DSGT T+ PV Y L + V + D G LCY
Sbjct: 265 LEGNIVIDSGTTLTYF--PVSYCNLVRQAVEHVVTAVRAADP-----TGNDMLCY----- 312
Query: 350 RKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFT-FGNSELVGIE 408
S +T+ F G V + +Y + + V+C NS +
Sbjct: 313 -NSDTIDIFPVITMHFSGGVDLVLDKYNMYM-------ESNNGGVFCLAIICNSP---TQ 361
Query: 409 AYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
+ G+ Q N + +D + V F+ T C
Sbjct: 362 EAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
>AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:9329933-9331432 REVERSE LENGTH=499
Length = 499
Score = 76.3 bits (186), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 126/301 (41%), Gaps = 62/301 (20%)
Query: 178 TFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGL------PKFSYCISGS--DSSG 229
TFGC + ++ G+ G RG LS AQ+ + FSYC+ DS
Sbjct: 213 TFGCAHTTL-------AEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDR 265
Query: 230 V-----LLFG---DAKFAWLG----------------PLRYTPMVKESTPLPYFDRVAYT 265
V L+ G D K +G +T M+ E+ PYF Y+
Sbjct: 266 VRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEML-ENPKHPYF----YS 320
Query: 266 VRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGV 325
V LQGI +GK+ + + D G G +VDSGT FT L Y ++ EEF ++ V
Sbjct: 321 VSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRV 380
Query: 326 LTLLD--DPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGER--LLYKV 381
D +P+ M CY + K A+ L F G SV+ R Y+
Sbjct: 381 HERADRVEPS----SGMSPCYYLNQTVK------VPALVLHFAGNRSSVTLPRRNYFYEF 430
Query: 382 GDVAAAKGSEDTVYCFTFGN----SELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTR 437
D K + + C N SEL G ++G++ QQ + +DL+N RVGFA +
Sbjct: 431 MDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRK 490
Query: 438 C 438
C
Sbjct: 491 C 491
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 76.3 bits (186), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 96/394 (24%), Positives = 164/394 (41%), Gaps = 66/394 (16%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWLHCKK---LPNLNSV------FNPQLSSSYNPTPCTSP 125
L +G+PP+ + +DTGS++ W+ C P + + F+P S + +P C+
Sbjct: 85 LRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQ 144
Query: 126 VCK--TRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETF----FVAGSPQPGTT- 178
C ++ D V NLC T Y D + G ++ V S P +T
Sbjct: 145 RCSWGIQSSDSGCSVQ---NNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTA 201
Query: 179 ---FGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMG----LPK-FSYCISGSD-SSG 229
FGC S D G+ G + +S ++Q+ P+ FS+C+ G + G
Sbjct: 202 PVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGG 261
Query: 230 VLLFGDAKFAWLGP-LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPD 288
+L+ G+ + P + +TP+V P+ Y V L I V + L + S+F
Sbjct: 262 ILVLGEI----VEPNMVFTPLVPSQ---PH-----YNVNLLSISVNGQALPINPSVF--- 306
Query: 289 HTGSGQ-TMVDSGTQFTFLLGPVYKALREEFV-AQTKGVLTLLDDPNFVFQGAMDLCYRV 346
T +GQ T++D+GT +L Y E A ++ V ++ N CY +
Sbjct: 307 STSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGN--------QCYVI 358
Query: 347 GSNRKSXXXXXXXAVTLVFE-GAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELV 405
+ V+L F GA M ++ + L + +V V+C F +
Sbjct: 359 ----TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGT-----AVWCIGFQRIQNQ 409
Query: 406 GIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCE 439
GI ++G ++ +DLV R+G+A+ C
Sbjct: 410 GIT--ILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 158/385 (41%), Gaps = 79/385 (20%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQLSSSYNPTPCTSPVCKTRTR 132
+ + VGSPP+ +++LDTGS+L+W+ C PC + +
Sbjct: 172 MDVLVGSPPKHFSLILDTGSDLNWIQC--------------------LPCYDCFQQNDNQ 211
Query: 133 DFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQPGTT---------FGCMD 183
C Y D+++ G+ A ETF V + G++ FGC
Sbjct: 212 S------------CPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH 259
Query: 184 SGFTSNADEDSKTTGLMGMNRGSLSFVAQMGL---PKFSYCI----SGSDSSGVLLFGDA 236
N GL+G+ RG LSF +Q+ FSYC+ S ++ S L+FG+
Sbjct: 260 W----NRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGED 315
Query: 237 KFAWLGP-LRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQT 295
K P L +T V L Y V+++ I V ++L + + + G+G T
Sbjct: 316 KDLLSHPNLNFTSFVAGKENLV---DTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGT 372
Query: 296 MVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDD-PNFVFQGAMDLCYRVGSNRKSXX 354
++DSGT ++ P Y+ ++ + + KG + D P +D C+ V
Sbjct: 373 IIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFP------ILDPCFNVSGIHNVQL 426
Query: 355 XXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAY-VIG 413
A +GA + E + +ED V G + A+ +IG
Sbjct: 427 PELGIAFA---DGAVWNFPTENSFIWL--------NEDLVCLAMLGTPK----SAFSIIG 471
Query: 414 HHHQQNVWMEFDLVNSRVGFADTRC 438
++ QQN + +D SR+G+A T+C
Sbjct: 472 NYQQQNFHILYDTKRSRLGYAPTKC 496
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 67.8 bits (164), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 133/295 (45%), Gaps = 45/295 (15%)
Query: 75 LTVGSPPQSVTMVLDTGSELSW---LHCKKLPNLNSVFNPQLSSSYNPTPCTSPVCKTRT 131
+ +G+P +S + +DTGS++ W + CK+ P S +L + YN S +
Sbjct: 84 IGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPR-RSTLGIEL-TLYNIDESDSGKLVSCD 141
Query: 132 RDFPIPVSCDPKNLCHATVS------YADATSIEGNLATETF---FVAGSPQPGTT---- 178
DF +S P + C A +S Y D +S G + VAG + T
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 179 -FGC--MDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FSYCISGSDSSGV 230
FGC SG +++E++ G++G + + S ++Q+ G K F++C+ G + G+
Sbjct: 202 IFGCGARQSGDLDSSNEEA-LDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGI 260
Query: 231 LLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVP-DH 289
G P V + +P ++ Y V + ++VG++ L + +F P D
Sbjct: 261 FAIGRV---------VQPKVNMTPLVP--NQPHYNVNMTAVQVGQEFLTIPADLFQPGDR 309
Query: 290 TGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPN---FVFQGAMD 341
G+ ++DSGT +L +Y+ L ++ +Q + + D + F + G +D
Sbjct: 310 KGA---IIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVD 361
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 67.4 bits (163), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 156/385 (40%), Gaps = 51/385 (13%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNP---QLSSSYNPTPCTSPVCKT 129
V L +G+PP+ + +DTGS+L+W+ C N P Q ++N PC+ +C
Sbjct: 69 VLLNIGNPPKLFDLDIDTGSDLTWVQCDA--PCNGCTKPRAKQYKPNHNTLPCSHILCS- 125
Query: 130 RTRDFPIPVSC-DPKNLCHATVSYADATSIEGNLATETF---FVAGSPQP-GTTFGCMDS 184
D P C DP++ C + Y+D S G L T+ GS TFGC
Sbjct: 126 -GLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYD 184
Query: 185 GFTSNADEDSKTTGLMGMNRGSLSFVAQ---MGLPK--FSYCISGSDSSGVLLFGDAKFA 239
T G++G+ RG + Q +G+ K +C+S + G L GD
Sbjct: 185 QQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVP 243
Query: 240 WLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMV-D 298
G + +T + S Y A +LL +K+ V G +V D
Sbjct: 244 SSG-VTWTSLATNSPSKNYMAGPA------------ELLFNDKTTGV-----KGINVVFD 285
Query: 299 SGTQFTFLLGPVYKALREEFVAQTKGV-LTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXX 357
SG+ +T+ Y+A+ + G LT D ++ +C++ KS
Sbjct: 286 SGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD-----DKSLPVCWKGKKPLKSLDEVK 340
Query: 358 X--XAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAY-VIGH 414
+TL F + +G+ L++V + +E C N +G+E Y +IG
Sbjct: 341 KYFKTITLRFGNQK---NGQ--LFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGD 395
Query: 415 HHQQNVWMEFDLVNSRVGFADTRCE 439
Q + + +D R+G+ + C+
Sbjct: 396 ISFQGIMVIYDNEKQRIGWISSDCD 420
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 67.0 bits (162), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 156/385 (40%), Gaps = 51/385 (13%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNP---QLSSSYNPTPCTSPVCKT 129
V L +G+PP+ + +DTGS+L+W+ C N P Q ++N PC+ +C
Sbjct: 69 VLLNIGNPPKLFDLDIDTGSDLTWVQCD--APCNGCTKPRAKQYKPNHNTLPCSHILCS- 125
Query: 130 RTRDFPIPVSC-DPKNLCHATVSYADATSIEGNLATETF---FVAGSPQP-GTTFGCMDS 184
D P C DP++ C + Y+D S G L T+ GS TFGC
Sbjct: 126 -GLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYD 184
Query: 185 GFTSNADEDSKTTGLMGMNRGSLSFVAQ---MGLPK--FSYCISGSDSSGVLLFGDAKFA 239
T G++G+ RG + Q +G+ K +C+S + G L GD
Sbjct: 185 QQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVP 243
Query: 240 WLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMV-D 298
G + +T + S Y A +LL +K+ V G +V D
Sbjct: 244 SSG-VTWTSLATNSPSKNYMAGPA------------ELLFNDKTTGV-----KGINVVFD 285
Query: 299 SGTQFTFLLGPVYKALREEFVAQTKGV-LTLLDDPNFVFQGAMDLCYRVGSNRKSXXXXX 357
SG+ +T+ Y+A+ + G LT D ++ +C++ KS
Sbjct: 286 SGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDK-----SLPVCWKGKKPLKSLDEVK 340
Query: 358 X--XAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAY-VIGH 414
+TL F + +G+ L++V + +E C N +G+E Y +IG
Sbjct: 341 KYFKTITLRFGNQK---NGQ--LFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGD 395
Query: 415 HHQQNVWMEFDLVNSRVGFADTRCE 439
Q + + +D R+G+ + C+
Sbjct: 396 ISFQGIMVIYDNEKQRIGWISSDCD 420
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 66.6 bits (161), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 108/256 (42%), Gaps = 37/256 (14%)
Query: 72 TVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQ--LSSSYNPTPCTSPVCKT 129
V++ +G PP+ + LDTGS+L+WL C P + + P S + PC P+CK
Sbjct: 61 NVTINIGQPPRPYYLDLDTGSDLTWLQCDA-PCVRCLEAPHPLYQPSSDLIPCNDPLCKA 119
Query: 130 RTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETF---FVAG-SPQPGTTFGCMDSG 185
+ C+ C V YAD S G L + F + G P GC G
Sbjct: 120 LHLN--SNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGC---G 174
Query: 186 FTS--NADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FSYCISGSDSSGVLLFGDAKF 238
+ A G++G+ RG +S ++Q+ G K +C+S S G+L FGD +
Sbjct: 175 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLS-SLGGGILFFGDDLY 233
Query: 239 AWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVD 298
+ +TPM +E + Y + + G G K L T+ D
Sbjct: 234 D-SSRVSWTPMSREYSK-HYSPAMGGELLFGGRTTGLKNL---------------LTVFD 276
Query: 299 SGTQFTFLLGPVYKAL 314
SG+ +T+ Y+A+
Sbjct: 277 SGSSYTYFNSKAYQAV 292
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 66.2 bits (160), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 111/257 (43%), Gaps = 39/257 (15%)
Query: 72 TVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQ--LSSSYNPTPCTSPVCKT 129
V++ +G PP+ + LDTGS+L+WL C P + + P S + PC P+CK
Sbjct: 58 NVTINIGQPPRPYYLDLDTGSDLTWLQCDA-PCVRCLEAPHPLYQPSSDLIPCNDPLCKA 116
Query: 130 RTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETF---FVAG-SPQPGTTFGCMDSG 185
+ C+ C V YAD S G L + F + G P GC G
Sbjct: 117 LHLN--SNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGC---G 171
Query: 186 FTS--NADEDSKTTGLMGMNRGSLSFVAQM---GLPK--FSYCISGSDSSGVLLFGDAKF 238
+ A G++G+ RG +S ++Q+ G K +C+S S G+L FGD +
Sbjct: 172 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLS-SLGGGILFFGDDLY 230
Query: 239 AWLGPLRYTPMVKESTPLPYFDRVAYTVRLQG-IRVGKKLLQLEKSIFVPDHTGSGQTMV 297
+ +TPM +E + Y+ + G + G + L+ + T+
Sbjct: 231 D-SSRVSWTPMSREYSK-------HYSPAMGGELLFGGRTTGLKNLL----------TVF 272
Query: 298 DSGTQFTFLLGPVYKAL 314
DSG+ +T+ Y+A+
Sbjct: 273 DSGSSYTYFNSKAYQAV 289
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 65.9 bits (159), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 80/177 (45%), Gaps = 20/177 (11%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLN-----SVFNPQLSSSYNPTPCTSPVC 127
V L +G PPQS+ ++ DTGS+L W+ C N + +VF P+ SS+++P C PVC
Sbjct: 86 VDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVC 145
Query: 128 KTRTRDFPIPVSCDPKNL---CHATVSYADATSIEGNLATETFFVAGSPQ-----PGTTF 179
+ + P+ C+ + CH YAD + G A ET + S F
Sbjct: 146 RLVPKPDRAPI-CNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAF 204
Query: 180 GCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISGSDSSGVLLFGDA 236
GC GF + S G + + +L+F+A+ P + I+ L DA
Sbjct: 205 GC---GFRISGQSVSGNGGTVVDSGTTLAFLAE---PAYRSVIAAVRRRVKLPIADA 255
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 64.3 bits (155), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 96/392 (24%), Positives = 148/392 (37%), Gaps = 74/392 (18%)
Query: 73 VSLTVGSPPQSVTMVLDTGSELSWLHCK--------KLPNLNS----VFNPQLSSSYNPT 120
++TVG+P + LDTGS+L WL C K P +S +++P SS+
Sbjct: 106 ANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 165
Query: 121 PCTSPVCKTRTRDFPIPVSCDPKNLCHATVSY-ADATSIEGNLATETFFVAGSPQPG--- 176
PC S +C R P++ C + Y ++ TS G L + + + +
Sbjct: 166 PCNSTLCTRGDR------CASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 219
Query: 177 ---TTFGCMDSGFTSNADEDSKTTGLMGMNRGSL---SFVAQMGLP--KFSYCISGSDSS 228
TFGC T + + GL G+ + S +A+ G+ FS C G+D +
Sbjct: 220 PARVTFGCGQVQ-TGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCF-GNDGA 277
Query: 229 GVLLFGDAKFAWLGPLRYTPM-VKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVP 287
G + FGD R TP+ +++ P Y + + I VG LE
Sbjct: 278 GRISFGDKGSV---DQRETPLNIRQPHP-------TYNITVTKISVGGNTGDLEF----- 322
Query: 288 DHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVG 347
+ DSGT FT+L Y + E F + D F+ CY +
Sbjct: 323 ------DAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFE----YCYALS 372
Query: 348 SNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDT-VYCFTFGNSELVG 406
N+ S AV L +G Y V +DT VYC E +
Sbjct: 373 PNKDS---FQYPAVNLTMKGGSS--------YPVYHPLVVIPMKDTDVYCLAIMKIEDIS 421
Query: 407 IEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
I IG + + FD +G+ ++ C
Sbjct: 422 I----IGQNFMTGYRVVFDREKLILGWKESDC 449
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 63.2 bits (152), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 152/391 (38%), Gaps = 69/391 (17%)
Query: 74 SLTVGSPPQSVTMVLDTGSELSWLHC-------KKLPNLN-------SVFNPQLSSSYNP 119
+++VG+PP S + LDTGS+L WL C + L ++ +++ P S++ +
Sbjct: 105 NVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTTSSS 164
Query: 120 TPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVAGSPQ----- 174
C+ C F P ++C +SY+++T +G L + +A +
Sbjct: 165 IRCSDKRC------FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPV 218
Query: 175 -PGTTFGC--MDSGF--TSNADEDSKTTGLMGMNRGSLSFVAQMGLPKFSYCISGSDSSG 229
T GC +G +N+ G+ G + SL A + FS C
Sbjct: 219 KANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFG------ 272
Query: 230 VLLFGDAKFAWLGPLRYTPMVKESTP-LPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPD 288
+ G+ G YT +E TP + AY V + G+ V + + +F
Sbjct: 273 -RVIGNVGRISFGDRGYTD--QEETPFISVAPSTAYGVNISGVSVAGDPVDIR--LFA-- 325
Query: 289 HTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGS 348
D+G+ FT L P Y L + F + + DP F+ CY +
Sbjct: 326 -------KFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPV-DPELPFE----FCYDLSP 373
Query: 349 NRKSXXXXXXXAVTLVFEGAEMS-VSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGI 407
N A T+ F EM+ + G +++ A + +YC G + VG+
Sbjct: 374 N----------ATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYC--LGVLKSVGL 421
Query: 408 EAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
+ VIG + + FD +G+ + C
Sbjct: 422 KINVIGQNFVAGYRIVFDRERMILGWKQSLC 452
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 62.8 bits (151), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 89/377 (23%), Positives = 147/377 (38%), Gaps = 42/377 (11%)
Query: 74 SLTVGSPPQS-VTMVLDTGSELSWLHCKKLPNLNSVFNPQLSSSYNPTPCTSPVCKTRTR 132
+ VGS +S V ++LD G+ L+WL C+KL +L SS C S CK+
Sbjct: 42 TFNVGSAAKSPVNLLLDLGTNLTWLDCRKLKSL---------SSLRLVTCQSSTCKSIPG 92
Query: 133 DFPIPVSC---DPKNLCHATVSYADATSIEGNLATET--FFVAGSPQPGTTFGCMDSGFT 187
+ SC P L V +L T F++ TF C +G
Sbjct: 93 NGCAGKSCLYKQPNPLGQNPVVTGRVVQDRASLYTTDGGKFLSQVSVRHFTFSC--AGEK 150
Query: 188 SNADEDSKTTGLMGMNRGSLSFVAQMG-----LPKFSYCISGSDSSGVLLFGDAKFAWLG 242
+ G++ ++ GS SF Q+ +PKFS C+ S + + G F +
Sbjct: 151 ALQGLPPPVDGVLALSPGSSSFTKQVTSAFNVIPKFSLCLPSSGTGHFYIAGIHYF--IP 208
Query: 243 PLRYT--PMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQTMVDSG 300
P + P+ + TP+ D Y + ++ I VG L+L + +G + +
Sbjct: 209 PFNSSDNPIPRTLTPIKGTDSGDYLITVKSIYVGGTALKLNPDLL------TGGAKLSTV 262
Query: 301 TQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFV-FQGAMDLCYRVGSNRKSXXXXXXX 359
+T L +Y AL + F + K + + P+ F+ D G N +
Sbjct: 263 VHYTVLQTDIYNALAQSFTLKAKA-MGIAKVPSVAPFKHCFD-SRTAGKNLTAGPNVPVI 320
Query: 360 AVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYVIGHHHQQN 419
+ L E+ Y V K +TV C F + + VIG H Q+
Sbjct: 321 EIGLPGRIGEVKWG----FYGANTVVKVK---ETVMCLAFIDGGKTPKDLMVIGTHQLQD 373
Query: 420 VWMEFDLVNSRVGFADT 436
+EFD + + F+++
Sbjct: 374 HMLEFDFSGTVLAFSES 390
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 60.8 bits (146), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 93/404 (23%), Positives = 158/404 (39%), Gaps = 92/404 (22%)
Query: 75 LTVGSPPQSVTMVLDTGSELSWL--HCKKLPNLNSVFNPQLSS----SYNPTP------- 121
+ +G+P S + LDTGS L W+ +C + L S + L++ YNP+
Sbjct: 104 IDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 122 -CTSPVCKTRTRDFPIPVSCD-PKNLCHATVSYADATSIEGNLATETFFV---------- 169
C+ +C + + C+ PK C TV+Y + L E
Sbjct: 164 LCSHKLCDSAS-------DCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLM 216
Query: 170 --AGSPQPGTTFGCMDSGFTSNAD--EDSKTTGLMGMNRGSL---SFVAQMGLPK--FSY 220
+ S + GC G + D + GLMG+ + SF+++ GL + FS
Sbjct: 217 NGSSSVKARVVIGC---GKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSL 273
Query: 221 CISGSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIR---VGKKL 277
C D SG + FGD P +++STP D Y+ + G+ +G
Sbjct: 274 CFDEED-SGRIYFGDMG----------PSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSC 322
Query: 278 LQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQ 337
L+ S T +DSG FT+L +Y+ + E ++ + F+
Sbjct: 323 LK----------QTSFTTFIDSGQSFTYLPEEIYRKVALEIDRH-------INATSKNFE 365
Query: 338 G-AMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTV-Y 395
G + + CY + K A+ L F V + L + S+ V +
Sbjct: 366 GVSWEYCYESSAEPK------VPAIKLKFSHNNTFVIHKPLF-------VFQQSQGLVQF 412
Query: 396 CFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRCE 439
C S GI + IG ++ + M FD N ++G++ ++C+
Sbjct: 413 CLPISPSGQEGIGS--IGQNYMRGYRMVFDRENMKLGWSPSKCQ 454
>AT5G24820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:8523406-8525297 FORWARD LENGTH=407
Length = 407
Score = 57.4 bits (137), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/290 (21%), Positives = 124/290 (42%), Gaps = 47/290 (16%)
Query: 57 IPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQLSSS 116
IP+ L F+ N L V +T+G+P ++ + LD+ + L+ L + + + S++
Sbjct: 34 IPNGFFLPFESN--LYVEITIGTPTRTFNLKLDSSTHLTCLDNDD--DHQCSLSDKSSNT 89
Query: 117 YNPTPCT-SPVCKTRTRDF-------------PIPVSCDPKNLCHATVSYADATSIEGNL 162
++ C S +C + ++ + + C P + C Y + S G L
Sbjct: 90 FSTISCNNSSLCPHVSTNYTNYFNATTTNTTTSVSLLCTPSDFCR----YEASPSSSGYL 145
Query: 163 ATETFFVAGSPQP---------GTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQM 213
++T + S G FGC + ++ G + + S ++Q+
Sbjct: 146 VSDTLQLTSSITDQENSLSIVRGFVFGCGARNRATPEEDGGGVDGRLSLTTHRFSLLSQL 205
Query: 214 GLPKFSYCI--SGSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGI 271
L +FS+C+ S + S + G A ++ G + PM+ + Y +Y V L GI
Sbjct: 206 RLTRFSHCLWPSAAGSRNYIRLGSAA-SYGGDMVLVPMLNMTGTEAY----SYHVALFGI 260
Query: 272 RVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQ 321
+G++ ++ +S + +D GT +T L +Y+ ++EE AQ
Sbjct: 261 SLGQQRMRSNESSGIA---------IDVGTYYTSLEPSLYEEVKEELTAQ 301
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 57.0 bits (136), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/347 (20%), Positives = 119/347 (34%), Gaps = 61/347 (17%)
Query: 113 LSSSYNPTPCTSP-------VCKTRTRDF-PIPVSCDPKNLCHATVSYADATSIEGNLAT 164
L+ + P C+ P VC + F P C+ + +S + I ++
Sbjct: 83 LTRRFTPHQCSLPSNKIINGVCACQATAFEPFQRICNSDQFTYGDLSISSLKPISPSVTI 142
Query: 165 ETFFVAGSPQPGTTFGCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLP------KF 218
+ PQP D GL G+ +L+ Q+ P KF
Sbjct: 143 NNVYYLCIPQPFLV------------DFPPGVFGLAGLAPTALATWNQLTRPRLGLEKKF 190
Query: 219 SYCISGSDS---SGVLLFGDAKFAWLGP-----LRYTPMVKESTPLPYFDRVAYTVRLQG 270
+ C+ ++ G + FG + L YT ++ L Y + L+G
Sbjct: 191 ALCLPSDENPLKKGAIYFGGGPYKLRNIDARSMLSYTRLITNPRKLN-----NYFLGLKG 245
Query: 271 IRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLD 330
I V + + F D G G + + FT L +Y+ E F T G+
Sbjct: 246 ISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTMLRSDIYRVFIEAFSQATSGI----- 300
Query: 331 DPNFVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVS-GERLLYKVGDVAAAKG 389
P + C +N F+ + + +++K+ A K
Sbjct: 301 -PRVSSTTPFEFCLSTTTN---------------FQVPRIDLELANGVIWKLSPANAMKK 344
Query: 390 SEDTVYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADT 436
D V C F N +A +IG H +N +EFD+ S GF+ +
Sbjct: 345 VSDDVACLAFVNGGDAAAQAVMIGIHQMENTLVEFDVGRSAFGFSSS 391
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 55.5 bits (132), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 113/288 (39%), Gaps = 66/288 (22%)
Query: 59 SSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNS----------- 107
S+ ++SF H ++T+G+P Q + LDTGS+L WL C N NS
Sbjct: 81 STEEISFLH----YANVTIGTPAQWFLVALDTGSDLFWLPC----NCNSTCVRSMETDQG 132
Query: 108 ------VFNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGN 161
++NP S S + C S +C R R P + C + Y S
Sbjct: 133 ERIKLNIYNPSKSKSSSKVTCNSTLCALRNR------CISPVSDCPYRIRYLSPGSKSTG 186
Query: 162 LATETFFVAGSPQPG------TTFGCMDSGFTSNADEDSKTTGLMGMNRGSLS---FVAQ 212
+ E + S + G TFGC +S ++ G+MG+ ++ + +
Sbjct: 187 VLVED-VIHMSTEEGEARDARITFGCSESQL--GLFKEVAVNGIMGLAIADIAVPNMLVK 243
Query: 213 MGLP--KFSYCISGSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQG 270
G+ FS C G + G + FGD + TP+ +P+ Y V +
Sbjct: 244 AGVASDSFSMCF-GPNGKGTISFGDKGSS---DQLETPLSGTISPM------FYDVSITK 293
Query: 271 IRVGKKLLQLEKSIFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEF 318
+VGK + E + DSGT T+L+ P Y AL F
Sbjct: 294 FKVGKVTVDTEFT-----------ATFDSGTAVTWLIEPYYTALTTNF 330
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 53.5 bits (127), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/399 (19%), Positives = 156/399 (39%), Gaps = 48/399 (12%)
Query: 63 LSFQHNVTLTVSLTVGSPP--QSVTMVLDTGSELSWLHCKK-----LPNLNSVFNPQLSS 115
+ Q + + VG P Q + +DTGSEL+W+ C N ++ P+
Sbjct: 22 MCIQMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKD- 80
Query: 116 SYNPTPCTSPVCKTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVA---GS 172
N + C R+ + C+ + C + YAD + G L + F + GS
Sbjct: 81 --NLVRSSEAFCVEVQRN-QLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGS 137
Query: 173 -PQPGTTFGCMDSGFTSNA---DEDSKTTGLMGMNRGSLSFVAQMGLPKF-----SYCIS 223
+ FGC G+ + KT G++G++R +S +Q+ +C++
Sbjct: 138 LAESDIVFGC---GYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA 194
Query: 224 GSDSSGVLLFGDAKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKS 283
+ +F + + + PM+ +S D AY +++ + G+ +L L+
Sbjct: 195 SDLNGEGYIFMGSDLVPSHGMTWVPMLHDSR----LD--AYQMQVTKMSYGQGMLSLDG- 247
Query: 284 IFVPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLC 343
++ G+ + D+G+ +T+ Y L + + G+ DD + + +C
Sbjct: 248 ----ENGRVGKVLFDTGSSYTYFPNQAYSQLVTS-LQEVSGLELTRDDSD----ETLPIC 298
Query: 344 YRVGSN----RKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTF 399
+R +N S +TL G++ + +LL + D +
Sbjct: 299 WRAKTNFPFSSLSDVKKFFRPITLQI-GSKWLIISRKLLIQPEDYLIISNKGNVCLGILD 357
Query: 400 GNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTRC 438
G+S G ++G + + +D V R+G+ + C
Sbjct: 358 GSSVHDG-STIILGDISMRGHLIVYDNVKRRIGWMKSDC 395
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 53.1 bits (126), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 88/403 (21%), Positives = 146/403 (36%), Gaps = 57/403 (14%)
Query: 66 QHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQLSSSYNPTPCTSP 125
Q + T + +P ++V D G W+ C K +SS+Y C S
Sbjct: 39 QSTLQYTTVINQRTPLVPASVVFDLGGRELWVDCDK---------GYVSSTYQSPRCNSA 89
Query: 126 VCKTRTRD-----FPIPVSCDPKNLCHA----TVSYADATSIEGNLATETFFVAGSPQPG 176
VC F P N C TV+ ATS E L + PG
Sbjct: 90 VCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVT-GTATSGEFALDVVSIQSTNGSNPG 148
Query: 177 TTFGC----MDSGFTSNADEDSK-TTGLMGMNRGSLSFVAQMGLP-----KFSYCISGSD 226
D G T +K T G+ GM R ++ +Q KF+ C++
Sbjct: 149 RVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLT--S 206
Query: 227 SSGVLLFGDAKFAWL----------GPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKK 276
GV FG+ + +L PL P+ S Y + + I++ +K
Sbjct: 207 GKGVAFFGNGPYVFLPGIQISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEK 266
Query: 277 LLQLEKSIF-VPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQT--KGVLTLLDDPN 333
+ + ++ + TG G T + S +T L +Y A EFV Q + + +
Sbjct: 267 TVPINPTLLKINASTGIGGTKISSVNPYTVLESSIYNAFTSEFVKQAAARSIKRVASVKP 326
Query: 334 FVFQGAMDLCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDT 393
F GA VG R + E+ + + +++++ + D
Sbjct: 327 F---GACFSTKNVGVTR----------LGYAVPEIELVLHSKDVVWRIFGANSMVSVSDD 373
Query: 394 VYCFTFGNSELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADT 436
V C F + + + VIG ++ +EFDL +++ GF+ T
Sbjct: 374 VICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNKFGFSST 416
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 53.1 bits (126), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/387 (19%), Positives = 153/387 (39%), Gaps = 48/387 (12%)
Query: 75 LTVGSPP--QSVTMVLDTGSELSWLHCKK-----LPNLNSVFNPQLSSSYNPTPCTSPVC 127
+ VG P Q + +DTGSEL+W+ C N ++ P+ N + C
Sbjct: 207 ILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKD---NLVRSSEAFC 263
Query: 128 KTRTRDFPIPVSCDPKNLCHATVSYADATSIEGNLATETFFVA---GS-PQPGTTFGCMD 183
R+ + C+ + C + YAD + G L + F + GS + FGC
Sbjct: 264 VEVQRN-QLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGC-- 320
Query: 184 SGFTSNA---DEDSKTTGLMGMNRGSLSFVAQMGLPKF-----SYCISGSDSSGVLLFGD 235
G+ + KT G++G++R +S +Q+ +C++ + +F
Sbjct: 321 -GYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMG 379
Query: 236 AKFAWLGPLRYTPMVKESTPLPYFDRVAYTVRLQGIRVGKKLLQLEKSIFVPDHTGSGQT 295
+ + + PM+ +S D AY +++ + G+ +L L+ ++ G+
Sbjct: 380 SDLVPSHGMTWVPMLHDSR----LD--AYQMQVTKMSYGQGMLSLDG-----ENGRVGKV 428
Query: 296 MVDSGTQFTFLLGPVYKALREEFVAQTKGVLTLLDDPNFVFQGAMDLCYRVGSN----RK 351
+ D+G+ +T+ Y L + + G+ DD + + +C+R +N
Sbjct: 429 LFDTGSSYTYFPNQAYSQLVTS-LQEVSGLELTRDDSD----ETLPICWRAKTNFPFSSL 483
Query: 352 SXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGNSELVGIEAYV 411
S +TL G++ + +LL + D + G+S G +
Sbjct: 484 SDVKKFFRPITLQI-GSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDG-STII 541
Query: 412 IGHHHQQNVWMEFDLVNSRVGFADTRC 438
+G + + +D V R+G+ + C
Sbjct: 542 LGDISMRGHLIVYDNVKRRIGWMKSDC 568
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 52.4 bits (124), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 82/202 (40%), Gaps = 35/202 (17%)
Query: 53 VSIPIPSSRKLSFQHNVTLTVSLTVGSPPQSVTMVLDTGSELSWLHCKKLPN-----LNS 107
VS+P+ S Q + + GSP + + +DTGS L+W C + +
Sbjct: 43 VSLPLSSPHS---QRGLAFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYP 99
Query: 108 VFNPQLSSSYNPTPCTSPVCKTRTRDFPIPVSCDP-KNLCHATVSYADATSIEGNLATET 166
+ P S +Y C K+ + DP +C Y D T+I+G LA E
Sbjct: 100 KYRPAASITYRDAMCEDSHPKSNPH-----FAFDPLTRICTYQQHYLDETNIKGTLAQEM 154
Query: 167 FFV----AGSPQ-PGTTFGCM----DSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLPK 217
V G + G FGC S FT TG++G+ G S + + G K
Sbjct: 155 ITVDTHDGGFKRVHGVYFGCNTLSDGSYFTG--------TGILGLGVGKYSIIGEFG-SK 205
Query: 218 FSYC---ISGSDSSGVLLFGDA 236
FS+C IS +S L+ GD
Sbjct: 206 FSFCLGEISEPKASHNLILGDG 227
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 48.9 bits (115), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 82/395 (20%), Positives = 148/395 (37%), Gaps = 67/395 (16%)
Query: 79 SPPQSVTMVLDTGSELSWLHCKKLPNLNSVFNPQLSSSYNPTPCTSPVCKTRTRDFPIPV 138
+P ++V D G W+ C + +S++Y C S VC +R
Sbjct: 53 TPLVPASVVFDLGGREFWVDCDQ---------GYVSTTYRSPRCNSAVC-SRAGSIACGT 102
Query: 139 SCDP------KNLCHATVSYAD------ATSIEGNLATETFFVAGSPQPGT-------TF 179
P N C A + D ATS E L + PG F
Sbjct: 103 CFSPPRPGCSNNTCGA---FPDNSITGWATSGEFALDVVSIQSTNGSNPGRFVKIPNLIF 159
Query: 180 GCMDSGFTSNADEDSKTTGLMGMNRGSLSFVAQMGLP-----KFSYCISGSDSSGVLLFG 234
C + + + G+ GM R ++ Q KF+ C++ GV FG
Sbjct: 160 SCGSTSLLKGLAKGA--VGMAGMGRHNIGLPLQFAAAFSFNRKFAVCLT--SGRGVAFFG 215
Query: 235 DAKFAWL-----GPLRYTPM-VKESTPLPYFDR----VAYTVRLQGIRVGKKLLQLEKSI 284
+ + +L L+ TP+ + T + F + Y + + I++ +K L ++ ++
Sbjct: 216 NGPYVFLPGIQISRLQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKIVEKTLPIDPTL 275
Query: 285 F-VPDHTGSGQTMVDSGTQFTFLLGPVYKALREEFVAQT--KGVLTLLDDPNFVFQGAMD 341
+ TG G T + S +T L +YKA EF+ Q + + + F GA
Sbjct: 276 LKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAARSIKRVASVKPF---GACF 332
Query: 342 LCYRVGSNRKSXXXXXXXAVTLVFEGAEMSVSGERLLYKVGDVAAAKGSEDTVYCFTFGN 401
VG R + ++ + + +++++ + D V C F +
Sbjct: 333 STKNVGVTR----------LGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVICLGFVD 382
Query: 402 SELVGIEAYVIGHHHQQNVWMEFDLVNSRVGFADT 436
+ + VIG ++ +EFDL +++ GF+ T
Sbjct: 383 GGVNPGASVVIGGFQLEDNLIEFDLASNKFGFSST 417