Miyakogusa Predicted Gene
- Lj3g3v0937980.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0937980.1 Non Chatacterized Hit- tr|I1MJG1|I1MJG1_SOYBN
Uncharacterized protein OS=Glycine max PE=3
SV=1,66.52,0,PEPSIN,Peptidase A1; no description,Peptidase aspartic,
catalytic; Asp,Peptidase A1; CHLOROPLAST NUC,CUFF.41703.1
(500 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 361 e-100
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 197 2e-50
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 169 6e-42
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 166 3e-41
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 162 5e-40
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 158 9e-39
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 156 3e-38
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 153 3e-37
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 145 8e-35
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 138 1e-32
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 135 9e-32
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 134 2e-31
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 133 3e-31
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 127 3e-29
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 119 6e-27
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 1e-26
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 3e-26
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 5e-26
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 113 4e-25
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 6e-25
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 4e-24
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 104 1e-22
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 2e-22
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 3e-22
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 5e-22
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 101 1e-21
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 2e-21
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 2e-21
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 3e-21
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 97 2e-20
AT3G12700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 97 2e-20
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 97 3e-20
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 8e-20
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 95 1e-19
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 6e-19
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 6e-19
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 92 1e-18
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 89 5e-18
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 88 1e-17
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 87 2e-17
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 87 4e-17
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 86 5e-17
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 85 1e-16
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 84 2e-16
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 5e-15
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 6e-14
AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 2e-11
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 3e-11
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 66 4e-11
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 3e-10
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 1e-09
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 61 2e-09
AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family pr... 56 5e-08
AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family pr... 54 3e-07
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 361 bits (927), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/439 (44%), Positives = 264/439 (60%), Gaps = 40/439 (9%)
Query: 69 RDTLRRQSMNQRFGLRNSNNGSHR---RKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTP 125
RDTL + +++ + ++ H RK + V ++ + SG DYG +YF +++VGTP
Sbjct: 56 RDTLLPKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 115
Query: 126 GQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGV 185
+KF + DTGSE TW N ++ K V
Sbjct: 116 AKKFRVVVDTGSELTWVNCRYRARGKDN-----------------------------RRV 146
Query: 186 FCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTI 245
F S++FKTV C ++ CKV+L +LFSLT CP PS PC YD Y DGS+A+G F +TI
Sbjct: 147 FRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETI 206
Query: 246 TVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSY 305
TV L+NGR +L IGC+ + G +F + G+LGL ++ +F A YG KFSY
Sbjct: 207 TVGLTNGRMARLPGHLIGCSSSF-TGQSF-QGADGVLGLAFSDFSFTSTATSLYGAKFSY 264
Query: 306 CLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLA--APFYGVNVVGISVGGQMLKIPS 363
CLVDHLS++NVS+YL FG+ + + R T L L PFY +NV+GIS+G ML IPS
Sbjct: 265 CLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPS 324
Query: 364 QVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRV-PAGDFGGLDYCFD-A 421
QVWD + GGTI+DSGT+LT LA AY+Q+ L + L ++KRV P G ++YCF
Sbjct: 325 QVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGV--PIEYCFSFT 382
Query: 422 KGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQNH 481
GF+ S +P+L FH GG RFEP KSY++D AP VKC+G ++ P +VIGNIMQQN+
Sbjct: 383 SGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNY 442
Query: 482 LWEFDLAHNTVGFAPSACN 500
LWEFDL +T+ FAPSAC
Sbjct: 443 LWEFDLMASTLSFAPSACT 461
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 197 bits (500), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 135/428 (31%), Positives = 208/428 (48%), Gaps = 56/428 (13%)
Query: 90 SHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSV---H 146
S RRK V+ P+ SG G G+YFV +++G P Q L ADTGS+ W +
Sbjct: 60 SLRRKPIPFVK--SPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRN 117
Query: 147 KTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCK- 205
+H+ T VF P+ S TF C C+
Sbjct: 118 CSHHSPAT------------------------------VFFPRHSSTFSPAHCYDPVCRL 147
Query: 206 VELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCT 265
V D + + C Y+ Y DGS G F +T +++ S+G++ +L ++ GC
Sbjct: 148 VPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCG 207
Query: 266 KTI----VNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLT 321
I V+G +FN G++GLG +F + ++G KFSYCL+D+ +SYL
Sbjct: 208 FRISGQSVSGTSFN-GANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLI 266
Query: 322 FGTPKVKLLSEMRRTELF---LAAPFYGVNVVGISVGGQMLKIPSQVWDFN--AQGGTII 376
G +S++ T L L+ FY V + + V G L+I +W+ + GGT++
Sbjct: 267 IGN-GGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVV 325
Query: 377 DSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDF--GGLDYCFDAKGF--DESSVPRL 432
DSGTTL LA PAY + A+++ + ++P D G D C + G E +PRL
Sbjct: 326 DSGTTLAFLAEPAYRSVIAAVRRRV----KLPIADALTPGFDLCVNVSGVTKPEKILPRL 381
Query: 433 VFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAINGP-GASVIGNIMQQNHLWEFDLAHNT 491
F F+GG F PP ++Y I+ Q++C+ + +++ G SVIGN+MQQ L+EFD +
Sbjct: 382 KFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSR 441
Query: 492 VGFAPSAC 499
+GF+ C
Sbjct: 442 LGFSRRGC 449
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 169 bits (427), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 141/453 (31%), Positives = 203/453 (44%), Gaps = 80/453 (17%)
Query: 69 RDTLRRQSMNQRFGLRNSNNGSHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQK 128
RD+LR +S+ + N + +R F + SG G GEYF+++ VGTP
Sbjct: 89 RDSLRVKSITSLAAVSTGRNAT-KRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATN 147
Query: 129 FWLAADTGSEFTWF--NSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVF 186
++ DTGS+ W + +N+T + +F
Sbjct: 148 VYMVLDTGSDVVWLQCSPCKACYNQT------------------------------DAIF 177
Query: 187 CPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTIT 246
P++S+TF TV C SR C+ L D S + S CLY +SY DGS +G F ++T+T
Sbjct: 178 DPKKSKTFATVPCGSRLCR-RLDD--SSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 234
Query: 247 VELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGIL-------GLGYAKDAFVDKAALQY 299
++ ++ +GC D G+ GLG +F + +Y
Sbjct: 235 FH-----GARVDHVPLGC----------GHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRY 279
Query: 300 GGKFSYCLVDHLSHQNVSSY---LTFG---TPKVKLLSEMRRTELFLAAP----FYGVNV 349
GKFSYCLVD S + S + FG PK + + + L P FY + +
Sbjct: 280 NGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPL------LTNPKLDTFYYLQL 333
Query: 350 VGISVGGQMLKIPSQV---WDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKR 406
+GISVGG + S+ D GG IIDSGT++T L PAY L +A + TK+KR
Sbjct: 334 LGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKR 393
Query: 407 VPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAIN 466
P+ + D CFD G VP +VFHF GG P +Y+I V + + A
Sbjct: 394 APS--YSLFDTCFDLSGMTTVKVPTVVFHFGGG-EVSLPASNYLIPVNTEGRFCFAFAGT 450
Query: 467 GPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
S+IGNI QQ +DL + VGF AC
Sbjct: 451 MGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 166 bits (421), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 149/512 (29%), Positives = 220/512 (42%), Gaps = 80/512 (15%)
Query: 28 GFNDLEEEEVQGMSME--LVHRHDARRFAGEVDQV--EAIKGFILRDTLRRQSMNQRFGL 83
GF+ E+E + + E V H RR ++ ++ +RD R Q++++R
Sbjct: 61 GFSSPEKEPTKERTGENKTVKFHLKRRETTTTEKATTNSVLELQIRDLTRIQTLHKRVLE 120
Query: 84 RNSNNG---SHRRKDSEMV--------------QFQLPMHSGRDYGLGEYFVQVKVGTPG 126
+N+ N ++ D E+V Q + SG G GEYF+ V VG+P
Sbjct: 121 KNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPP 180
Query: 127 QKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPC---- 182
+ F L DTGS+ W + PC
Sbjct: 181 KHFSLILDTGSDLNWIQCL-----------------------------------PCYDCF 205
Query: 183 --NGVFC-PQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGF 239
NG F P+ S ++K +TC+ ++C + +S C + C Y Y D S+ G
Sbjct: 206 QQNGAFYDPKASASYKNITCNDQRCNL-VSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGD 264
Query: 240 FGSDTITVELS-NGRKGKLHN---LTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKA 295
F +T TV L+ NG +L+N + GC N F+ G + +F +
Sbjct: 265 FAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGR-GPLSFSSQL 321
Query: 296 ALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAP------FYGVNV 349
YG FSYCLVD S NVSS L FG K L F+A FY V +
Sbjct: 322 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQI 381
Query: 350 VGISVGGQMLKIPSQVWDFNAQG--GTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRV 407
I V G++L IP + W+ ++ G GTIIDSGTTL+ A PAYE + + + K K
Sbjct: 382 KSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEK-AKGKYP 440
Query: 408 PAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAING 467
DF LD CF+ G +P L FA G + P ++ I + + C+ +L
Sbjct: 441 VYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPK 500
Query: 468 PGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
S+IGN QQN +D + +G+AP+ C
Sbjct: 501 SAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 162 bits (410), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 129/413 (31%), Positives = 185/413 (44%), Gaps = 69/413 (16%)
Query: 101 FQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXX 160
F + SG G GEYF ++ VGTP + ++ DTGS+ W +Q+
Sbjct: 127 FSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQS------ 180
Query: 161 XXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKP 220
+ +F P++S+T+ T+ CSS C+ C
Sbjct: 181 ----------------------DPIFDPRKSKTYATIPCSSPHCR-----RLDSAGCNTR 213
Query: 221 SDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGG 280
CLY +SY DGS G F ++T+T R+ ++ + +GC D G
Sbjct: 214 RKTCLYQVSYGDGSFTVGDFSTETLTF-----RRNRVKGVALGC----------GHDNEG 258
Query: 281 IL-------GLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEM 333
+ GLG K +F + ++ KFSYCLVD + SS + FG V S +
Sbjct: 259 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS-VVFGNAAV---SRI 314
Query: 334 RRTELFLAAP----FYGVNVVGISVGGQMLK-IPSQVWDFN--AQGGTIIDSGTTLTNLA 386
R L+ P FY V ++GISVGG + + + ++ + GG IIDSGT++T L
Sbjct: 315 ARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLI 374
Query: 387 LPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPV 446
PAY + +A + +KR P DF D CFD +E VP +V HF G P
Sbjct: 375 RPAYIAMRDAFRVGAKTLKRAP--DFSLFDTCFDLSNMNEVKVPTVVLHFRGA-DVSLPA 431
Query: 447 KSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+Y+I V K A G S+IGNI QQ +DLA + VGFAP C
Sbjct: 432 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 158 bits (399), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 135/465 (29%), Positives = 200/465 (43%), Gaps = 73/465 (15%)
Query: 68 LRDTLRRQSMNQRFGLRNSNNGSHRRK----DSEMV--------QFQLPMHSGRDYGLGE 115
++D R ++++ RF RK D +V + + SG G GE
Sbjct: 100 IQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSGE 159
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXX 175
YF+ V VGTP + F L DTGS+ W +
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCL------------------------------ 189
Query: 176 XXXNNPC------NGVFC-PQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDI 228
PC NG+F P+ S +FK +TC+ +C + +S C + C Y
Sbjct: 190 -----PCYDCFHQNGMFYDPKTSASFKNITCNDPRCSL-ISSPDPPVQCESDNQSCPYFY 243
Query: 229 SYVDGSSAKGFFGSDTITVELSNGRKG----KLHNLTIGCTKTIVNGVTFNEDTGGILGL 284
Y D S+ G F +T TV L+ G K+ N+ GC N F+ +G +
Sbjct: 244 WYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGH--WNRGLFSGASGLLGLG 301
Query: 285 GYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFL---- 340
+F + YG FSYCLVD S+ NVSS L FG K L F+
Sbjct: 302 RGPL-SFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKE 360
Query: 341 --AAPFYGVNVVGISVGGQMLKIPSQVWDFNA--QGGTIIDSGTTLTNLALPAYEQLFEA 396
FY + + I VGG+ L IP + W+ ++ GGTIIDSGTTL+ A PAYE +
Sbjct: 361 NSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNK 420
Query: 397 LKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSV--PRLVFHFAGGVRFEPPVKSYIIDVA 454
+ + + + DF LD CF+ G +E+++ P L F G + P ++ I ++
Sbjct: 421 FAEKMKENYPI-FRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLS 479
Query: 455 PQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ C+ +L S+IGN QQN +D + +GF P+ C
Sbjct: 480 EDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 156 bits (395), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 131/477 (27%), Positives = 205/477 (42%), Gaps = 80/477 (16%)
Query: 33 EEEEVQGMSMELVHRHDARRFAGEVDQVE-AIKGFILRDTLRRQSMNQRFGLRNSNNGSH 91
+ ++ + +++ + R D+ R AG V ++ A++G R L+ N
Sbjct: 94 QHKDYKSLTLSRLER-DSSRVAGIVAKIRFAVEGV------------DRSDLKPVYNEDT 140
Query: 92 RRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNK 151
R + ++ P+ SG G GEYF ++ VGTP ++ +L DTGS+ W
Sbjct: 141 RYQTEDLTT---PVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCY 197
Query: 152 TQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDL 211
Q+ + VF P S T+K++TCS+ +C L
Sbjct: 198 QQS----------------------------DPVFNPTSSSTYKSLTCSAPQCS-----L 224
Query: 212 FSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNG 271
+ C S+ CLY +SY DGS G +DT+T G GK++N+ +GC
Sbjct: 225 LETSACR--SNKCLYQVSYGDGSFTVGELATDTVTF----GNSGKINNVALGC------- 271
Query: 272 VTFNEDTGGIL----GLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKV 327
D G+ GL + FSYCLVD S + SS L F + ++
Sbjct: 272 ---GHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGK--SSSLDFNSVQL 326
Query: 328 ---KLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQG--GTIIDSGTTL 382
+ + R + FY V + G SVGG+ + +P ++D +A G G I+D GT +
Sbjct: 327 GGGDATAPLLRNKKI--DTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAV 384
Query: 383 TNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRF 442
T L AY L +A K +K+ + D C+D VP + FHF GG
Sbjct: 385 TRLQTQAYNSLRDAFLKLTVNLKK-GSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSL 443
Query: 443 EPPVKSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ P K+Y+I V A S+IGN+ QQ +DL+ N +G + + C
Sbjct: 444 DLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 153 bits (386), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 133/469 (28%), Positives = 202/469 (43%), Gaps = 64/469 (13%)
Query: 41 SMELVHRHD--ARRFAGEVDQVEAIKGFILR-DTLRRQSMNQRFGLRNSNNGSHRRKDSE 97
S+ + HRH +R G+ + ++ ILR D R S++ + + + + K ++
Sbjct: 61 SLHVTHRHGTCSRLNNGKATSPDHVE--ILRLDQARVNSIHSKLSKKLATDHVSESKSTD 118
Query: 98 MVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFN---SVHKTHNKTQT 154
LP G G G Y V V +GTP L DTGS+ TW V +++ +
Sbjct: 119 -----LPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP 173
Query: 155 XXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSL 214
+F P +S ++ V+CSS C S +
Sbjct: 174 ------------------------------IFNPSKSTSYYNVSCSSAACGSLSSATGNA 203
Query: 215 TYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTF 274
C + C+Y I Y D S + GF + T+ S+ G + GC + N
Sbjct: 204 GSCSASN--CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDG----VYFGCGE---NNQGL 254
Query: 275 NEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMR 334
G+LGLG K +F + A Y FSYCL S+ + +LTFG+ + +
Sbjct: 255 FTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASY---TGHLTFGSAGISRSVKFT 311
Query: 335 RTELFL-AAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQL 393
FYG+N+V I+VGGQ L IPS V+ + G +IDSGT +T L AY L
Sbjct: 312 PISTITDGTSFYGLNIVAITVGGQKLPIPSTVF---STPGALIDSGTVITRLPPKAYAAL 368
Query: 394 FEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKS--YII 451
+ K ++K LD CFD GF ++P++ F F+GG E K Y+
Sbjct: 369 RSSFKAKMSKYPTTSGVSI--LDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVF 426
Query: 452 DVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
++ QV + A++ GN+ QQ +D A VGFAP+ C+
Sbjct: 427 KIS-QVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 145 bits (365), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 190/444 (42%), Gaps = 68/444 (15%)
Query: 69 RDTLRRQSMNQRFGLRNSNNGSHRRK------DSEMVQFQLPMHSGRDYGLGEYFVQVKV 122
RDT R +S+ R L +N K +E + P+ SG G GEYF +V +
Sbjct: 95 RDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGI 154
Query: 123 GTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPC 182
G P ++ ++ DTGS+ W QT
Sbjct: 155 GKPAREVYMVLDTGSDVNWLQCTPCADCYHQT---------------------------- 186
Query: 183 NGVFCPQRSRTFKTVTCSSRKCK-VELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFG 241
+F P S +++ ++C + +C +E+S+ + T CLY++SY DGS G F
Sbjct: 187 EPIFEPSSSSSYEPLSCDTPQCNALEVSECRNAT--------CLYEVSYGDGSYTVGDFA 238
Query: 242 SDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYG- 300
++T+T+ + N+ +GC + NE + Q
Sbjct: 239 TETLTI-----GSTLVQNVAVGCGHS-------NEGLFVGAAGLLGLGGGLLALPSQLNT 286
Query: 301 GKFSYCLVDHLSHQNVSSYLTFGT---PKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQ 357
FSYCLVD S + +S + FGT P + +R +L FY + + GISVGG+
Sbjct: 287 TSFSYCLVDRDS--DSASTVDFGTSLSPDAVVAPLLRNHQL---DTFYYLGLTGISVGGE 341
Query: 358 MLKIPSQVW--DFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGL 415
+L+IP + D + GG IIDSGT +T L Y L ++ K +++ A
Sbjct: 342 LLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEK--AAGVAMF 399
Query: 416 DYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAINGPGASVIGN 475
D C++ VP + FHF GG P K+Y+I V A ++IGN
Sbjct: 400 DTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGN 459
Query: 476 IMQQNHLWEFDLAHNTVGFAPSAC 499
+ QQ FDLA++ +GF+ + C
Sbjct: 460 VQQQGTRVTFDLANSLIGFSSNKC 483
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 138 bits (347), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/465 (28%), Positives = 202/465 (43%), Gaps = 64/465 (13%)
Query: 39 GMSMELVHRHDARRFAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSNNGSHRRKDSEM 98
G + +L+HR + + + + LR+ + R S+N+ F H +
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQR---LRNAIHR-SVNRVF---------HFTEKDNT 76
Query: 99 VQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXX 158
Q Q+ + S GEY + V +GTP ADTGS+ W TQ
Sbjct: 77 PQPQIDLTSNS----GEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQV---- 128
Query: 159 XXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCP 218
+ +F P+ S T+K V+CSS +C L + C
Sbjct: 129 ------------------------DPLFDPKTSSTYKDVSCSSSQCTA----LENQASCS 160
Query: 219 KPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDT 278
+ C Y +SY D S KG DT+T+ S+ R +L N+ IGC N TFN+
Sbjct: 161 TNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHN--NAGTFNKKG 218
Query: 279 GGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTEL 338
GI+GLG + + + GKFSYCLV S ++ +S + FGT + S + T L
Sbjct: 219 SGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPL 278
Query: 339 FLAAP---FYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFE 395
A FY + + ISVG + ++ S +++G IIDSGTTLT L Y +L +
Sbjct: 279 IAKASQETFYYLTLKSISVGSKQIQY-SGSDSESSEGNIIIDSGTTLTLLPTEFYSELED 337
Query: 396 ALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGG-VRFEPPVKSYIIDVA 454
A+ S+ K+ GL C+ A G + VP + HF G V+ + + + V+
Sbjct: 338 AVASSIDAEKKQDPQ--SGLSLCYSATG--DLKVPVITMHFDGADVKLDS--SNAFVQVS 391
Query: 455 PQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ C P S+ GN+ Q N L +D TV F P+ C
Sbjct: 392 EDLVCFAFRG--SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 135 bits (339), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 136/472 (28%), Positives = 206/472 (43%), Gaps = 77/472 (16%)
Query: 39 GMSMELVHRHDARRFAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSNNGSHRRKDSEM 98
G +++L+HR + + + + +R+ +RR + R L+ SN+ D+
Sbjct: 25 GFTIDLIHRDSPKSPFYNSAETSSQR---MRNAIRRSA---RSTLQFSND------DASP 72
Query: 99 VQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXX 158
Q + S R GEY + + +GTP ADTGS+ W TQ
Sbjct: 73 NSPQSFITSNR----GEYLMNISIGTPPVPILAIADTGSDLIW----------TQC---- 114
Query: 159 XXXXXXXXXXXXXXXXXXXXNNPCNG-------VFCPQRSRTFKTVTCSSRKCKVELSDL 211
NPC +F P+ S T++ V+CSS +C+ L D
Sbjct: 115 ---------------------NPCEDCYQQTSPLFDPKESSTYRKVSCSSSQCRA-LED- 151
Query: 212 FSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNG 271
C + C Y I+Y D S KG DT+T+ S R L N+ IGC N
Sbjct: 152 ---ASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHE--NT 206
Query: 272 VTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLS 331
TF+ GI+GLG + V + GKFSYCLV S ++S + FGT +
Sbjct: 207 GTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSGD 266
Query: 332 EMRRTELFLAAP--FYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPA 389
+ T + P +Y +N+ ISVG + ++ S ++ +G +IDSGTTLT L
Sbjct: 267 GVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFG-TGEGNIVIDSGTTLTLLPSNF 325
Query: 390 YEQLFEALKKSLTKVKRVPAGDFGGLDYCF-DAKGFDESSVPRLVFHFAGGVRFEPPVKS 448
Y +L E++ S K +RV D G L C+ D+ F VP + HF GG + +
Sbjct: 326 YYEL-ESVVASTIKAERVQDPD-GILSLCYRDSSSF---KVPDITVHFKGGDVKLGNLNT 380
Query: 449 YIIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
++ V+ V C A ++ GN+ Q N L +D TV F + C+
Sbjct: 381 FVA-VSEDVSCFAFAA--NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 134 bits (336), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 134/470 (28%), Positives = 192/470 (40%), Gaps = 80/470 (17%)
Query: 41 SMELVHRHDARRFA---GEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSNNGSHRRKDSE 97
S+ +VH H A VD E I+ RD R +S+ + +NS N K +E
Sbjct: 64 SLRVVHMHGACSHLSSDARVDHDEIIR----RDQARVESIYSKLS-KNSANEVSEAKSTE 118
Query: 98 MVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXX 157
LP SG G G Y V + +GTP L DTGS+ TW TQ
Sbjct: 119 -----LPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTW----------TQC--- 160
Query: 158 XXXXXXXXXXXXXXXXXXXXXNNPCNGV--------FCPQRSRTFKTVTCSSRKCKVELS 209
PC G F P S T++ V+CSS C+ S
Sbjct: 161 ----------------------EPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES 198
Query: 210 DLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIV 269
C + C+Y I Y D S +GF + T+ S+ L ++ GC +
Sbjct: 199 -------CSASN--CVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGE--- 242
Query: 270 NGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKL 329
N + G+LGLG K + + Y FSYCL S N + +LTFG+ +
Sbjct: 243 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTS--NSTGHLTFGSAGISE 300
Query: 330 LSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPA 389
+ F +A YG++++GISVG + L I + + G IIDSGT T L
Sbjct: 301 SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSF---STEGAIIDSGTVFTRLPTKV 357
Query: 390 YEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSY 449
Y +L K+ ++ K +G D C+D G D + P + F FAG E
Sbjct: 358 YAELRSVFKEKMSSYKSTSG--YGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGI 415
Query: 450 IIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ + C+ A N ++ GN+ Q +D+A VGFAP+ C
Sbjct: 416 SLPIKISQVCLA-FAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 133 bits (334), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 122/429 (28%), Positives = 181/429 (42%), Gaps = 82/429 (19%)
Query: 94 KDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQ 153
K + + P H G GE+ +++ +G P K+ DTGS+ W T Q
Sbjct: 89 KPDDTNNIKAPTHGGS----GEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQ 144
Query: 154 TXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFS 213
+F P++S ++ V CSS C
Sbjct: 145 PTP----------------------------IFDPEKSSSYSKVGCSSGLCNA-----LP 171
Query: 214 LTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVT 273
+ C + D C Y +Y D SS +G ++T T E N G + GC GV
Sbjct: 172 RSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISG----IGFGC------GVE 221
Query: 274 FNEDTG-----GILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGT---- 324
NE G G++GLG + + + KFSYCL + SS L G+
Sbjct: 222 -NEGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLT-SIEDSEASSSLFIGSLASG 276
Query: 325 ----PKVKLLSEMRRTELFLAAP----FYGVNVVGISVGGQMLKIPSQVWDF--NAQGGT 374
L E+ +T L P FY + + GI+VG + L + ++ + GG
Sbjct: 277 IVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGM 336
Query: 375 IIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFG--GLDYCFDAKGFDES-SVPR 431
IIDSGTT+T L E F+ LK+ T +P D G GLD CF ++ +VP+
Sbjct: 337 IIDSGTTITYLE----ETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPK 392
Query: 432 LVFHFAGGVRFEPPVKSYII-DVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHN 490
++FHF G E P ++Y++ D + V C+ + + NG S+ GN+ QQN DL
Sbjct: 393 MIFHFKG-ADLELPGENYMVADSSTGVLCLAMGSSNG--MSIFGNVQQQNFNVLHDLEKE 449
Query: 491 TVGFAPSAC 499
TV F P+ C
Sbjct: 450 TVSFVPTEC 458
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 127 bits (318), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 135/288 (46%), Gaps = 16/288 (5%)
Query: 224 CLYDISYVDGSSAKGFFGSDTITVELS-NGRKGKLHN---LTIGCTKTIVNGVTFNEDTG 279
C Y Y D S+ G F +T TV L+ NG +L+N + GC N F+ G
Sbjct: 213 CPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAG 270
Query: 280 GILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELF 339
+ +F + YG FSYCLVD S NVSS L FG K L F
Sbjct: 271 LLGLGR-GPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSF 329
Query: 340 LAAP------FYGVNVVGISVGGQMLKIPSQVWDFNAQG--GTIIDSGTTLTNLALPAYE 391
+A FY V + I V G++L IP + W+ ++ G GTIIDSGTTL+ A PAYE
Sbjct: 330 VAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE 389
Query: 392 QLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYII 451
+ + + K K DF LD CF+ G +P L FA G + P ++ I
Sbjct: 390 FIKNKIAEK-AKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFI 448
Query: 452 DVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ + C+ +L S+IGN QQN +D + +G+AP+ C
Sbjct: 449 WLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 496
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 119 bits (297), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 122/489 (24%), Positives = 203/489 (41%), Gaps = 94/489 (19%)
Query: 38 QGMSMELVHRHDARR--FAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSNNGSHRRKD 95
+ S+EL+HR + ++ + + LR R + N +
Sbjct: 24 KNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQLS------------- 70
Query: 96 SEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTX 155
Q + SG GE+F+ + +GTP K + ADTGS+ TW
Sbjct: 71 ------QTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQC----------- 113
Query: 156 XXXXXXXXXXXXXXXXXXXXXXXNNPC------NG-VFCPQRSRTFKTVTCSSRKCKVEL 208
PC NG +F ++S T+K+ C SR C+
Sbjct: 114 ------------------------KPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALS 149
Query: 209 SDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTI 268
S + C + ++ C Y SY D S +KG ++T++++ ++G GC
Sbjct: 150 S---TERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYN- 205
Query: 269 VNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVK 328
NG TF+E GI+GLG + + + KFSYCL + N +S + GT +
Sbjct: 206 -NGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIP 264
Query: 329 LLSEMRRTELFLAAP--------FYGVNVVGISVGGQMLKIPSQVWDFN---------AQ 371
S + + ++ P +Y + + ISVG + KIP +N
Sbjct: 265 --SSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKK--KIPYTGSSYNPNDDGILSETS 320
Query: 372 GGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPR 431
G IIDSGTTLT L +++ A+++S+T KRV + G L +CF + G E +P
Sbjct: 321 GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRV-SDPQGLLSHCFKS-GSAEIGLPE 378
Query: 432 LVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNT 491
+ HF G P+ ++ + ++ + C+ ++ ++ GN Q + L +DL T
Sbjct: 379 ITVHFTGADVRLSPINAF-VKLSEDMVCLSMVPTT--EVAIYGNFAQMDFLVGYDLETRT 435
Query: 492 VGFAPSACN 500
V F C+
Sbjct: 436 VSFQHMDCS 444
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 117 bits (294), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 127/478 (26%), Positives = 195/478 (40%), Gaps = 68/478 (14%)
Query: 32 LEEEEVQGMSMELVHRHDARRFAGEV--DQVEAIKGFILRDTLRRQSMNQRFGLRNSNNG 89
+E ++ L+HR RF + + + RDT R ++ +R + +
Sbjct: 51 FSDESSSKYTLRLLHRD---RFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSS 107
Query: 90 SHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTH 149
R E+ F + SG D G GEYFV++ VG+P + ++ D+GS+ W
Sbjct: 108 DSRY---EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKL 164
Query: 150 NKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKC-KVEL 208
Q+ + VF P +S ++ V+C S C ++E
Sbjct: 165 CYKQS----------------------------DPVFDPAKSGSYTGVSCGSSVCDRIEN 196
Query: 209 SDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTI 268
S S C Y++ Y DGS KG +T+T K + N+ +GC
Sbjct: 197 SGCHS--------GGCRYEVMYGDGSYTKGTLALETLTFA-----KTVVRNVAMGCGHR- 242
Query: 269 VNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVS-----SYLTFG 323
N F G + G + +FV + + Q GG F YCLV + S L G
Sbjct: 243 -NRGMFIGAAGLLGIGGGSM-SFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVG 300
Query: 324 TPKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFN--AQGGTIIDSGTT 381
V L+ R A FY V + G+ VGG + +P V+D GG ++D+GT
Sbjct: 301 ASWVPLVRNPR------APSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTA 354
Query: 382 LTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVR 441
+T L AY + K + R A D C+D GF VP + F+F G
Sbjct: 355 VTRLPTAAYVAFRDGFKSQTANLPR--ASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPV 412
Query: 442 FEPPVKSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
P +++++ V A + G S+IGNI Q+ FD A+ VGF P+ C
Sbjct: 413 LTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 117 bits (292), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 113/421 (26%), Positives = 173/421 (41%), Gaps = 83/421 (19%)
Query: 102 QLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXX 161
+P+ SG +G Y V+ K+GTP Q ++ DT ++ W
Sbjct: 90 SVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWL------------------- 130
Query: 162 XXXXXXXXXXXXXXXXXNNPCNGVF-CPQRSRT--------FKTVTCSSRKCKVELSDLF 212
PC+G C S + + TV+CS+ +C + L
Sbjct: 131 -------------------PCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQC-TQARGLT 170
Query: 213 SLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGV 272
+ P+PS C ++ SY SS DT+T+ + N + GC +N
Sbjct: 171 CPSSSPQPSV-CSFNQSYGGDSSFSASLVQDTLTLA-----PDVIPNFSFGC----INSA 220
Query: 273 TFNE-DTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLT--FGTPK--- 326
+ N G++GLG + V + Y G FSYCL S S G PK
Sbjct: 221 SGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIR 280
Query: 327 -VKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPS--QVWDFNAQGGTIIDSGTTLT 383
LL RR L Y VN+ G+SVG + + +D N+ GTIIDSGT +T
Sbjct: 281 YTPLLRNPRRPSL------YYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVIT 334
Query: 384 NLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFE 443
A P YE + + +K +V G D CF A +E+ P++ H + +
Sbjct: 335 RFAQPVYEAIRDEFRK---QVNVSSFSTLGAFDTCFSAD--NENVAPKITLHMT-SLDLK 388
Query: 444 PPVKSYII-DVAPQVKCIGVLAINGPGAS---VIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
P+++ +I A + C+ + I + VI N+ QQN FD+ ++ +G AP C
Sbjct: 389 LPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
Query: 500 N 500
N
Sbjct: 449 N 449
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 115 bits (289), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 112/424 (26%), Positives = 171/424 (40%), Gaps = 100/424 (23%)
Query: 103 LPMHSGRDYGLGE-YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXX 161
+P+ SGR Y V+ +GTP Q +A DT ++ W
Sbjct: 74 VPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWI------------------- 114
Query: 162 XXXXXXXXXXXXXXXXXNNPCNG--------VFCPQRSRTFKTVTCSSRKCKVELSDLFS 213
PC+G +F P +S + +T+ C + +CK
Sbjct: 115 -------------------PCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCK-------- 147
Query: 214 LTYCPKPS----DPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIV 269
P PS C ++++Y GS+ + + DT+T+ + N T GC
Sbjct: 148 --QAPNPSCTVSKSCGFNMTY-GGSTIEAYLTQDTLTLA-----SDVIPNYTFGCINK-A 198
Query: 270 NGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPK--- 326
+G + G++GLG + + ++ Y FSYCL + S N S L G PK
Sbjct: 199 SGTSLPAQ--GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKS-SNFSGSLRLG-PKNQP 254
Query: 327 -----VKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQ--GGTIIDSG 379
LL RR+ L Y VN+VGI VG +++ IP+ F+ GTI DSG
Sbjct: 255 IRIKTTPLLKNPRRSSL------YYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 380 TTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGG 439
T T L PAY + ++ +VK A GG D C+ P + F FAG
Sbjct: 309 TVYTRLVEPAYVAVRNEFRR---RVKNANATSLGGFDTCYSGSVV----FPSVTFMFAGM 361
Query: 440 VRFEPPVKSYIIDVAPQVKCIGVLA----INGPGASVIGNIMQQNHLWEFDLAHNTVGFA 495
PP I A + C+ + A +N +VI ++ QQNH D+ ++ +G +
Sbjct: 362 NVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNS-VLNVIASMQQQNHRVLIDVPNSRLGIS 420
Query: 496 PSAC 499
C
Sbjct: 421 RETC 424
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 113 bits (282), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 123/450 (27%), Positives = 191/450 (42%), Gaps = 55/450 (12%)
Query: 66 FILRDTLRRQSMNQRFGLRNSNNGSHRRKDSEMVQF--QLPMHSGRDYGLGEYFVQVKVG 123
I RD+ N + + N + R S +F + + SG GEYF+ + +G
Sbjct: 33 LIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTKTDLQSGLISNGGEYFMSISIG 92
Query: 124 TPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCN 183
TP K + ADTGS+ TW Q N+P
Sbjct: 93 TPPSKVFAIADTGSDLTWVQCKPCQQCYKQ-------------------------NSP-- 125
Query: 184 GVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSD 243
+F ++S T+KT +C S+ C+ LS+ C + D C Y SY D S KG ++
Sbjct: 126 -LFDKKKSSTYKTESCDSKTCQA-LSE--HEEGCDESKDICKYRYSYGDNSFTKGDVATE 181
Query: 244 TITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKF 303
TI+++ S+G GC NG TF E GI+GLG + V + G KF
Sbjct: 182 TISIDSSSGSSVSFPGTVFGCGYN--NGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKF 239
Query: 304 SYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAP--------FYGVNVVGISVG 355
SYCL + N +S + GT + S + L P +Y + + ++VG
Sbjct: 240 SYCLSHTAATTNGTSVINLGTNSIP--SNPSKDSATLTTPLIQKDPETYYFLTLEAVTVG 297
Query: 356 GQMLKIPSQVWDFNAQ-----GGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAG 410
L + N + G IIDSGTTLT L Y+ A+++S+T KRV +
Sbjct: 298 KTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV-SD 356
Query: 411 DFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAINGPGA 470
G L +CF + G E +P + HF P+ ++ + + C+ ++
Sbjct: 357 PQGLLTHCFKS-GDKEIGLPAITMHFTNADVKLSPINAF-VKLNEDTVCLSMIPTT--EV 412
Query: 471 SVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
++ GN++Q + L +DL TV F C+
Sbjct: 413 AIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 112 bits (280), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 120/444 (27%), Positives = 189/444 (42%), Gaps = 79/444 (17%)
Query: 85 NSNNGSHRRKDSEMVQFQLPMHSGRDYGLGEYF-------VQVKVGTPGQKFWLAADTGS 137
++ SHR S ++ + P S Y F + + +GTP Q + DTGS
Sbjct: 35 STTTNSHRFTTS-LLSRKNPSPSSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGS 93
Query: 138 EFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTV 197
+ +W + H K F P S +F T+
Sbjct: 94 QLSWI----QCHRKKLPPKPKTS-------------------------FDPSLSSSFSTL 124
Query: 198 TCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKL 257
CS CK + D T C + C Y Y DG+ A+G + IT SN
Sbjct: 125 PCSHPLCKPRIPDFTLPTSC-DSNRLCHYSYFYADGTFAEGNLVKEKIT--FSNTEITP- 180
Query: 258 HNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVS 317
L +GC T + D GILG+ + +FV +A + KFSYC+ + +
Sbjct: 181 -PLILGC-------ATESSDDRGILGMNRGRLSFVSQAKIS---KFSYCIPPKSNRPGFT 229
Query: 318 SYLTF---------GTPKVKLLSEMRRTELFLAAPF-YGVNVVGISVGGQMLKIPSQVW- 366
+F G V LL+ + P Y V ++GI G + L I V+
Sbjct: 230 PTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFR 289
Query: 367 -DFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKV-KRVPAG-DFGGL-DYCFDAK 422
D G T++DSG+ T+L AY+++ + +T+V +R+ G +GG D CFD
Sbjct: 290 PDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEI---MTRVGRRLKKGYVYGGTADMCFDG- 345
Query: 423 GFDESSVPRL----VFHFAGGVRFEPPVKSYIIDVAPQVKCIGV--LAINGPGASVIGNI 476
+ + +PRL VF F GV P + +++V + C+G+ ++ G +++IGN+
Sbjct: 346 --NVAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAASNIIGNV 403
Query: 477 MQQNHLWEFDLAHNTVGFAPSACN 500
QQN EFD+ + VGFA + C+
Sbjct: 404 HQQNLWVEFDVTNRRVGFAKADCS 427
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 109 bits (273), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 108/400 (27%), Positives = 171/400 (42%), Gaps = 61/400 (15%)
Query: 118 VQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXX 177
+ + +GTP Q L DTGS+ +W + H K
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWI----QCHPKKIKKPLPPPTTS-------------- 123
Query: 178 XNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAK 237
F P S +F + CS CK + D T C + C Y Y DG+ A+
Sbjct: 124 --------FDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSC-DSNRLCHYSYFYADGTFAE 174
Query: 238 GFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAAL 297
G + T SN + L +GC K + D GILG+ + +F+ +A +
Sbjct: 175 GNLVKEKFT--FSNSQTTP--PLILGCAKE-------STDEKGILGMNLGRLSFISQAKI 223
Query: 298 QYGGKFSYCLVDHLSHQNVSSYLTF---------GTPKVKLLSEMRRTELFLAAPF-YGV 347
KFSYC+ + ++S +F G V LL+ + + P Y V
Sbjct: 224 S---KFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTV 280
Query: 348 NVVGISVGGQMLKIPSQVW--DFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVK 405
+ GI +G + L IP V+ D G T++DSG+ T+L AY+++ E + + +
Sbjct: 281 PLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGS-- 338
Query: 406 RVPAGDFGG--LDYCFDAKGFDESS--VPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIG 461
R+ G G D CFD E + LVF F GV +S +++V + C+G
Sbjct: 339 RLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVG 398
Query: 462 V--LAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ ++ G +++IGN+ QQN EFD+ + VGF+ + C
Sbjct: 399 IGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAEC 438
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 104 bits (260), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 167/400 (41%), Gaps = 48/400 (12%)
Query: 113 LGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXX 172
+G Y+ ++++GTP + F++ DTGS+ W S + QT
Sbjct: 78 VGLYYTKLRLGTPPRDFYVQVDTGSDVLWV-SCASCNGCPQTSGLQIQL----------- 125
Query: 173 XXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCK--VELSDLFSLTYCPKPSDPCLYDISY 230
F P S T ++CS ++C ++ SD + C ++ C Y Y
Sbjct: 126 -----------NFFDPGSSVTASPISCSDQRCSWGIQSSD----SGCSVQNNLCAYTFQY 170
Query: 231 VDGSSAKGFFGSDTITVELSNGRK---GKLHNLTIGC-TKTIVNGVTFNEDTGGILGLGY 286
DGS GF+ SD + ++ G + GC T + V + GI G G
Sbjct: 171 GDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQ 230
Query: 287 AKDAFVDKAALQYGGK--FSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAPF 344
+ + + A Q FS+CL + L G + M T L + P
Sbjct: 231 QGMSVISQLASQGIAPRVFSHCLKGENGGGGI---LVLGE---IVEPNMVFTPLVPSQPH 284
Query: 345 YGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKV 404
Y VN++ ISV GQ L I V+ + GTIID+GTTL L+ AY EA+ ++++
Sbjct: 285 YNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQS 344
Query: 405 KRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDV----APQVKCI 460
R P G Y D P + +FAGG + Y+I V CI
Sbjct: 345 VR-PVVSKGNQCYVITTSVGD--IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCI 401
Query: 461 GVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
G I G +++G+++ ++ ++ +DL +G+A C+
Sbjct: 402 GFQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 103 bits (257), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/462 (23%), Positives = 183/462 (39%), Gaps = 74/462 (16%)
Query: 51 RRFAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSNNGSHRRKDSEMVQFQLPMHSGRD 110
+FAG+ Q+ +K D+ R M L G R DS
Sbjct: 35 HKFAGKEKQLSELKS---HDSFRHARMLANIDLPL---GGDSRADS-------------- 74
Query: 111 YGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXX 170
+G YF ++K+G+P +++++ DTGS+ W N +T
Sbjct: 75 --IGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSST 132
Query: 171 XXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISY 230
N C FC S ++ TC ++K PC Y + Y
Sbjct: 133 SK------NVGCEDDFC---SFIMQSETCGAKK-------------------PCSYHVVY 164
Query: 231 VDGSSAKGFFGSDTITVELSNG--RKGKL-HNLTIGCTKTIVNGVTFNEDTG--GILGLG 285
DGS++ G F D IT+E G R L + GC K +G D+ GI+G G
Sbjct: 165 GDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKN-QSGQLGQTDSAVDGIMGFG 223
Query: 286 YAKDAFVDKAALQYGGK--FSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAP 343
+ + + + A K FS+CL D+++ + + +P VK T +
Sbjct: 224 QSNTSIISQLAAGGSTKRIFSHCL-DNMNGGGIFAVGEVESPVVK------TTPIVPNQV 276
Query: 344 FYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTK 403
Y V + G+ V G + +P + N GGTIIDSGTTL L + L+ +L + +T
Sbjct: 277 HYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLP----QNLYNSLIEKITA 332
Query: 404 VKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCI--- 460
++V CF + + P + HF ++ Y+ + + C
Sbjct: 333 KQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQ 392
Query: 461 --GVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
G+ +G ++G+++ N L +DL + +G+A C+
Sbjct: 393 SGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 434
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 103 bits (256), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 114/416 (27%), Positives = 163/416 (39%), Gaps = 79/416 (18%)
Query: 103 LPMHSGRD-YGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXX 161
+P+ SGR Y V+ +GTP Q LA DT S+ W
Sbjct: 101 VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWI------------------- 141
Query: 162 XXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPS 221
P N F P +S +FK V+CS+ +CK P P+
Sbjct: 142 -----------PCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCK----------QVPNPT 180
Query: 222 ---DPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDT 278
C ++++Y SS DTI + + T GC + G T
Sbjct: 181 CGARACSFNLTY-GSSSIAANLSQDTIRLA-----ADPIKAFTFGCVNKVAGGGTI-PPP 233
Query: 279 GGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGT---PK----VKLLS 331
G+LGLG + + +A Y FSYCL S S L G P+ +LL
Sbjct: 234 QGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSL-TFSGSLRLGPTSQPQRVKYTQLLR 292
Query: 332 EMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQ--GGTIIDSGTTLTNLALPA 389
RR+ L Y VN+V I VG +++ +P FN GTI DSGT T LA P
Sbjct: 293 NPRRSSL------YYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPV 346
Query: 390 YEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSY 449
YE + +K + V GG D C+ + VP + F F GV P +
Sbjct: 347 YEAVRNEFRKRVKPTTAV-VTSLGGFDTCYSG----QVKVPTITFMFK-GVNMTMPADNL 400
Query: 450 II-DVAPQVKCIGVLA----INGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
++ A C+ + A +N +VI ++ QQNH D+ + +G A C+
Sbjct: 401 MLHSTAGSTSCLAMAAAPENVNS-VVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 102 bits (255), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 114/407 (28%), Positives = 167/407 (41%), Gaps = 76/407 (18%)
Query: 118 VQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXX 177
V + VG P Q + DTGSE +W + K+ N
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLH-CKKSPN--------------------------- 98
Query: 178 XNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAK 237
VF P S T+ V CSS C+ DL C + C ISY D +S +
Sbjct: 99 ----LGSVFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIE 154
Query: 238 GFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNED----TGGILGLGYAKDAFVD 293
G +T + S R G L GC + G++ N + + G++G+ +FV+
Sbjct: 155 GNLAHETFVIG-SVTRPGTL----FGCMDS---GLSSNSEEDAKSTGLMGMNRGSLSFVN 206
Query: 294 KAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAA---PF-----Y 345
+ KFSYC +S + S +L G L ++ T L L + P+ Y
Sbjct: 207 QLGFS---KFSYC----ISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAY 259
Query: 346 GVNVVGISVGGQMLKIPSQVW--DFNAQGGTIIDSGTTLTNLALPAYEQL-FEALKKSLT 402
V + GI VG ++L +P V+ D G T++DSGT T L P Y L E + ++ +
Sbjct: 260 TVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKS 319
Query: 403 KVKRVPAGDF---GGLDYCFDAKGFDE---SSVPRLVFHFAG------GVRFEPPVKSYI 450
++ V DF G +D C+ S +P + F G G + V
Sbjct: 320 VLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAG 379
Query: 451 IDVAPQVKC--IGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFA 495
+ +V C G + G A VIG+ QQN EFDLA + VGFA
Sbjct: 380 SEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFA 426
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 101 bits (252), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 173/441 (39%), Gaps = 65/441 (14%)
Query: 66 FILRDTLRRQSMNQRFGLRNSNNGSHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTP 125
F L + FG R NN S ++ ++ Y Y ++++VGTP
Sbjct: 371 FCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTP 430
Query: 126 GQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGV 185
+ DTGS+ W + + +Q +
Sbjct: 431 PFEIVAEIDTGSDIIWTQCMPCPNCYSQFAP----------------------------I 462
Query: 186 FCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTI 245
F P +S TF+ C+ C Y+I Y D + +KG ++T+
Sbjct: 463 FDPSKSSTFREQRCNGNSCH--------------------YEIIYADKTYSKGILATETV 502
Query: 246 TVELSNGRKGKLHNLTIGC--TKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKF 303
T+ ++G + IGC T + F + GI+GL + + + L Y G
Sbjct: 503 TIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI 562
Query: 304 SYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAA--PFYGVNVVGISVGGQMLKI 361
SYC S Q S + FGT + ++F+ PFY +N+ +SV + I
Sbjct: 563 SYC----FSGQGTSK-INFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNL--I 615
Query: 362 PSQVWDFNAQGGTI-IDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFD 420
+ F+A+ G I IDSGTTLT + + EA+++ +T VK G L C+
Sbjct: 616 ATLGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLL--CYY 673
Query: 421 AKGFDESSVPRLVFHFAGGVRFE-PPVKSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQ 479
+ D P + HF+GG Y+ + + C+ + + +V GN Q
Sbjct: 674 SDTID--IFPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQN 731
Query: 480 NHLWEFDLAHNTVGFAPSACN 500
N L +D + N + F+P+ C+
Sbjct: 732 NFLVGYDPSSNVISFSPTNCS 752
Score = 90.1 bits (222), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/410 (23%), Positives = 169/410 (41%), Gaps = 69/410 (16%)
Query: 84 RNSNNGSHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFN 143
R SN+ S R +++ + DY + Y ++++VGTP + DTGS+ W
Sbjct: 52 RRSNSSSFRLSKNQLQGASPYADTLFDYNI--YLMKLQVGTPPFEIAAEIDTGSDLIWTQ 109
Query: 144 SVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRK 203
+ +Q + +F P +S TF C +
Sbjct: 110 CMPCPDCYSQF----------------------------DPIFDPSKSSTFNEQRCHGKS 141
Query: 204 CKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIG 263
C Y+I Y D + +KG ++T+T+ ++G + TIG
Sbjct: 142 CH--------------------YEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIG 181
Query: 264 C--TKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLT 321
C T ++ F + GI+GL + + + L Y G SYC S Q S +
Sbjct: 182 CGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYC----FSGQGTSK-IN 236
Query: 322 FGTPKVKLLSEMRRTELFLAA--PFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTI-IDS 378
FGT + ++F+ PFY +N+ +SV +I + F+A+ G I IDS
Sbjct: 237 FGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDN--RIETLGTPFHAEDGNIVIDS 294
Query: 379 GTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLD-YCFDAKGFDESSVPRLVFHFA 437
G+T+T + + +A+++ +T V RVP D G D C+ ++ D P + HF+
Sbjct: 295 GSTVTYFPVSYCNLVRKAVEQVVTAV-RVP--DPSGNDMLCYFSETID--IFPVITMHFS 349
Query: 438 GGVRFE-PPVKSYIIDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFD 486
GG Y+ + + C+ ++ + ++ GN Q N L +D
Sbjct: 350 GGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYD 399
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 100 bits (250), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 164/397 (41%), Gaps = 72/397 (18%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXX 175
+ V + +G+P L DT S+ W + + Q+
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLP------------------- 125
Query: 176 XXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSS 235
+F P RS T + TC + + + SL + + C Y + YVD +
Sbjct: 126 ---------IFDPSRSYTHRNETCRTSQYSMP-----SLKFNAN-TRSCEYSMRYVDDTG 170
Query: 236 AKGFFGSDTITVE--LSNGRKGKLHNLTIGCTKTIVNGVTFNEDT--GGILGLGYAKDAF 291
+KG + + LH++ GC + E GILGLGY + +
Sbjct: 171 SKGILAREMLLFNTIYDESSSAALHDVVFGCGHD-----NYGEPLVGTGILGLGYGEFSL 225
Query: 292 VDKAALQYGGKFSYCL--VDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVNV 349
V + +G KFSYC +D S+ + + L G +L + T L + FY V +
Sbjct: 226 VHR----FGKKFSYCFGSLDDPSYPH--NVLVLGDDGANILGD--TTPLEIHNGFYYVTI 277
Query: 350 VGISVGGQMLKIPSQVWDFNAQ---GGTIIDSGTTLTNLALPAYEQLFEALKKSLTKV-- 404
ISV G +L I +V++ N Q GGTIID+G +LT+L E+ ++ LK + +
Sbjct: 278 EAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLV----EEAYKPLKNRIEDIFE 333
Query: 405 KRVPAGDFGGLDY----CFDA---KGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQV 457
R A D D C++ + ES P + FHF+ G VKS + ++P V
Sbjct: 334 GRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNV 393
Query: 458 KCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGF 494
C LA+ + IG QQ++ +DL V F
Sbjct: 394 FC---LAVTPGNLNSIGATAQQSYNIGYDLEAMEVSF 427
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 100 bits (250), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 158/396 (39%), Gaps = 69/396 (17%)
Query: 110 DYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXX 169
DY + Y ++++VGTP + DTGS+ W + T+ +Q
Sbjct: 57 DYNI--YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP------------- 101
Query: 170 XXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDIS 229
+F P S TFK C+ C Y I
Sbjct: 102 ---------------IFDPSNSSTFKEKRCNGNSCH--------------------YKII 126
Query: 230 YVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKD 289
Y D + +KG ++T+T+ ++G + TIGC N F G++GL +
Sbjct: 127 YADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGH---NSSWFKPTFSGMVGLSWGPS 183
Query: 290 AFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFL--AAP-FYG 346
+ + + +Y G SYC +S + FGT + + T +FL A P Y
Sbjct: 184 SLITQMGGEYPGLMSYCFASQ-----GTSKINFGTNAIVAGDGVVSTTMFLTTAKPGLYY 238
Query: 347 VNVVGISVGGQMLKIPSQVWDFNA-QGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVK 405
+N+ +SVG ++ F+A +G IIDSGTTLT + + EA+ +T V+
Sbjct: 239 LNLDAVSVGDTHVETMGTT--FHALEGNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAVR 296
Query: 406 RVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFE-PPVKSYIIDVAPQVKCIGVLA 464
A G C+ D P + HF+GG YI + C+ ++
Sbjct: 297 T--ADPTGNDMLCYYTDTID--IFPVITMHFSGGADLVLDKYNMYIETITRGTFCLAIIC 352
Query: 465 INGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
N P ++ GN Q N L +D + V F+P+ C+
Sbjct: 353 NNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 100 bits (248), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 161/391 (41%), Gaps = 69/391 (17%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXX 175
Y ++++VGTP + DTGSE TW + H Q
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQ---------------------- 102
Query: 176 XXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSS 235
N P +F P +S TFK C C Y++ Y D +
Sbjct: 103 ---NAP---IFDPSKSSTFKEKRCDGHSCP--------------------YEVDYFDHTY 136
Query: 236 AKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKA 295
G ++TIT+ ++G + IGC N F G++GL + + + +
Sbjct: 137 TMGTLATETITLHSTSGEPFVMPETIIGCGH---NNSWFKPSFSGMVGLNWGPSSLITQM 193
Query: 296 ALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFL--AAP-FYGVNVVGI 352
+Y G SYC S Q S + FG + + T +F+ A P FY +N+ +
Sbjct: 194 GGEYPGLMSYC----FSGQGTSK-INFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAV 248
Query: 353 SVGGQMLKIPSQVWDFNA-QGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGD 411
SVG +I + F+A +G +IDSGTTLT + + +A++ +T V+ A D
Sbjct: 249 SVGNT--RIETMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVR---AAD 303
Query: 412 FGGLD-YCFDAKGFDESSVPRLVFHFAGGVRFE-PPVKSYIIDVAPQVKCIGVLAINGPG 469
G D C+++ D P + HF+GGV Y+ V C+ ++ +
Sbjct: 304 PTGNDMLCYNSDTID--IFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQ 361
Query: 470 ASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
++ GN Q N L +D + V F+P+ C+
Sbjct: 362 EAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 97.4 bits (241), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 80/136 (58%), Gaps = 9/136 (6%)
Query: 369 NAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDF--GGLDYCFDAKGF-- 424
+ GGT++DSGTTL LA PAY + A+++ + ++P D G D C + G
Sbjct: 216 SGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRV----KLPIADALTPGFDLCVNVSGVTK 271
Query: 425 DESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAINGP-GASVIGNIMQQNHLW 483
E +PRL F F+GG F PP ++Y I+ Q++C+ + +++ G SVIGN+MQQ L+
Sbjct: 272 PEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLF 331
Query: 484 EFDLAHNTVGFAPSAC 499
EFD + +GF+ C
Sbjct: 332 EFDRDRSRLGFSRRGC 347
>AT3G12700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4038387 FORWARD LENGTH=263
Length = 263
Score = 97.4 bits (241), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 94/198 (47%), Gaps = 41/198 (20%)
Query: 69 RDTLRRQSMNQRFGLRNSNNGSHR---RKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTP 125
RDTL + +++ + ++ H RK + V ++ + SG DYG +YF +++VGTP
Sbjct: 56 RDTLLPKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 115
Query: 126 GQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGV 185
+KF + DTGSE TW N ++ K V
Sbjct: 116 AKKFRVVVDTGSELTWVNCRYRARGKDN-----------------------------RRV 146
Query: 186 FCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTI 245
F S++FKTV C ++ CKV+L +LFSLT CP PS PC YD + FFG I
Sbjct: 147 FRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDY--------REFFGVAWI 198
Query: 246 TVELSNGRKGKLHNLTIG 263
+ R+G++ + +G
Sbjct: 199 RCKCI-AREGEIKYMQMG 215
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 96.7 bits (239), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 119/487 (24%), Positives = 189/487 (38%), Gaps = 93/487 (19%)
Query: 38 QGMSMELVHRHDARRFAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSNNGSHRRKDSE 97
+ ++E+ HR +D + ++ ++ D +R QS+ + S+ +
Sbjct: 64 ESTTLEMKHRELCS--GKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSST-----TEQS 116
Query: 98 MVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXX 157
+ + Q+P+ SG Y V V++G G+ L DTGS+ TW
Sbjct: 117 VSETQIPLTSGIKLESLNYIVTVELG--GKNMSLIVDTGSDLTWVQC------------- 161
Query: 158 XXXXXXXXXXXXXXXXXXXXXNNPCNG-------VFCPQRSRTFKTVTCSSRKCKVELSD 210
PC ++ P S ++KTV C+S C+ D
Sbjct: 162 ----------------------QPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQ----D 195
Query: 211 LFSLTYCPKP--------SDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTI 262
L + T P PC Y +SY DGS +G S++I + KL N
Sbjct: 196 LVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVF 250
Query: 263 GCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVD-------HLSHQN 315
GC + N + G++GLG + + V + + G FSYCL LS N
Sbjct: 251 GCGR---NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGN 307
Query: 316 VSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTI 375
SS T T V ++ +L FY +N+ G S+GG LK S F G +
Sbjct: 308 DSSVYTNST-SVSYTPLVQNPQL---RSFYILNLTGASIGGVELKSSS----FGR--GIL 357
Query: 376 IDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFH 435
IDSGT +T L Y+ + K + P + LD CF+ +++ S+P +
Sbjct: 358 IDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPG--YSILDTCFNLTSYEDISIPIIKMI 415
Query: 436 FAGGVRFEPPVKSYIIDVAPQVK--CIGVLAINGPG-ASVIGNIMQQNHLWEFDLAHNTV 492
F G E V V P C+ + +++ +IGN Q+N +D +
Sbjct: 416 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 475
Query: 493 GFAPSAC 499
G C
Sbjct: 476 GIVGENC 482
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 95.5 bits (236), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 120/493 (24%), Positives = 179/493 (36%), Gaps = 88/493 (17%)
Query: 24 VVVHGFNDLEEEEVQGMSMELVHRHDARRFAGE----VDQVEAIKGFILRDTLR----RQ 75
+ V F E + M+M+L+HR R + + IK + R +
Sbjct: 13 ITVSYFVVTESIKPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLTDISSARFKYLQN 72
Query: 76 SMNQRFGLRNSNNGSHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADT 135
S+++ G N FQ+ + L + V VG P DT
Sbjct: 73 SIDKELGSSN---------------FQVDVEQAIKTSL--FLVNFSVGQPPVPQLTIMDT 115
Query: 136 GSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFK 195
GS W H + ++ + VF P S TF
Sbjct: 116 GSSLLWIQCQPCKHCSS--------------------------DHMIHPVFNPALSSTFV 149
Query: 196 TVTCSSRKCKVELSDLFSLTYCPK----PSDPCLYDISYVDGSSAKGFFGSDTITVELSN 251
+C R C+ Y P S+ C+Y+ Y+ G+ +KG + +T N
Sbjct: 150 ECSCDDRFCR----------YAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPN 199
Query: 252 GRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHL 311
G + GC NG GILGLG + A+Q G KFSYC+ D
Sbjct: 200 GNTVVTQPIAFGCGYE--NGEQLESHFTGILGLGAKPTSL----AVQLGSKFSYCIGDLA 253
Query: 312 SHQNVSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNA- 370
+ + L G +L + E Y +N+ GISVG L I V+
Sbjct: 254 NKNYGYNQLVLGE-DADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGP 312
Query: 371 QGGTIIDSGTTLTNLALPAYEQLFEALKKSL-TKVKRVPAGDFGGLDYCFDAKGFDE-SS 428
+ G I+DSGT T LA AY +L+ +K L K++R DF C+ + +E
Sbjct: 313 RTGVILDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDF----LCYHGRVSEELIG 368
Query: 429 VPRLVFHFAGGVRFEPPVKSYIIDVAP----QVKCIGVLAINGPGA-----SVIGNIMQQ 479
P + FHFAGG S ++ V C+ V G + IG + QQ
Sbjct: 369 FPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQ 428
Query: 480 NHLWEFDLAHNTV 492
+ +DL +
Sbjct: 429 YYNIGYDLKEKNI 441
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 108/404 (26%), Positives = 155/404 (38%), Gaps = 70/404 (17%)
Query: 125 PGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNG 184
P Q + DTGSE +W +++ N NP N
Sbjct: 82 PPQNISMVIDTGSELSWLR-CNRSSNP----------------------------NPVNN 112
Query: 185 VFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDT 244
F P RS ++ + CSS C+ D C C +SY D SS++G ++
Sbjct: 113 -FDPTRSSSYSPIPCSSPTCRTRTRDFLIPASC-DSDKLCHATLSYADASSSEGNLAAEI 170
Query: 245 ITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDT--GGILGLGYAKDAFVDKAALQYGGK 302
G NL GC + V+G EDT G+LG+ +F+ + K
Sbjct: 171 FHF----GNSTNDSNLIFGCMGS-VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---K 222
Query: 303 FSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELF-LAAPF-------YGVNVVGISV 354
FSYC+ + +L G L+ + T L ++ P Y V + GI V
Sbjct: 223 FSYCIS---GTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKV 279
Query: 355 GGQMLKIPSQVW--DFNAQGGTIIDSGTTLTNLALPAYEQL---FEALKKSLTKVKRVPA 409
G++L IP V D G T++DSGT T L P Y L F + V P
Sbjct: 280 NGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPD 339
Query: 410 GDF-GGLDYCFDAKGFDESS-----VPRLVFHFAG---GVRFEPPVK--SYIIDVAPQVK 458
F G +D C+ S +P + F G V +P + ++ V
Sbjct: 340 FVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVY 399
Query: 459 C--IGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
C G + G A VIG+ QQN EFDL + +G AP C+
Sbjct: 400 CFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECD 443
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 92.4 bits (228), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 101/424 (23%), Positives = 173/424 (40%), Gaps = 76/424 (17%)
Query: 84 RNSNNGSHRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFN 143
R SN S R ++++ ++ + EY +++++GTP + DTGSE W
Sbjct: 37 RRSNASSSRVFNTQLGS----PYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQ 92
Query: 144 SVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRK 203
+ H QT +F P +S TFK + C +
Sbjct: 93 CLPCVHCYNQTAP----------------------------IFDPSKSSTFKEIRCDTHD 124
Query: 204 CKVELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIG 263
C Y++ Y S KG ++T+T+ ++G+ + IG
Sbjct: 125 ------------------HSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIG 166
Query: 264 CTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFG 323
C + N F G++GL + + + +Y G SYC +S + FG
Sbjct: 167 CGR---NNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG-----KGTSKINFG 218
Query: 324 TPKVKLLSEMRRTELFL--AAP-FYGVNVVGISVGGQMLKIPSQVWDFNA-QGGTIIDSG 379
+ + T +F+ A P FY +N+ +SVG +I + F+A +G +IDSG
Sbjct: 219 ANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNT--RIETVGTPFHALKGNIVIDSG 276
Query: 380 TTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGG 439
+TLT + +A+++ +T V R P D C+ +K D P + HF+GG
Sbjct: 277 STLTYFPESYCNLVRKAVEQVVTAV-RFPRSDI----LCYYSKTID--IFPVITMHFSGG 329
Query: 440 VRFEPPVKSYIIDVAPQ---VKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAP 496
+ Y + VA V C+ ++ + ++ GN Q N L +D + V F P
Sbjct: 330 ADLV--LDKYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKP 387
Query: 497 SACN 500
+ C+
Sbjct: 388 TNCS 391
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 92.4 bits (228), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 100/422 (23%), Positives = 166/422 (39%), Gaps = 56/422 (13%)
Query: 92 RRKDSEMVQFQLPMH-SGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHN 150
RR + LP+ R +G YF ++K+G+P +++ + DTGS+ W N
Sbjct: 49 RRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKC 108
Query: 151 KTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSD 210
T+T +F S T K V C C
Sbjct: 109 PTKTNLNFRL-----------------------SLFDMNASSTSKKVGCDDDFCS----- 140
Query: 211 LFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNG--RKGKL-HNLTIGCTKT 267
S + +P+ C Y I Y D S++ G F D +T+E G + G L + GC
Sbjct: 141 FISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSD 200
Query: 268 IVNGVTFNEDTG--GILGLGYAKDAFVDKAALQYGGK--FSYCLVDHLSHQNVSSYLTFG 323
+G N D+ G++G G + + + + A K FS+CL D++ + +
Sbjct: 201 -QSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DNVKGGGIFAVGVVD 258
Query: 324 TPKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLT 383
+PKVK T + Y V ++G+ V G L +P + GGTI+DSGTTL
Sbjct: 259 SPKVK------TTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV---RNGGTIVDSGTTLA 309
Query: 384 NLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFE 443
Y+ L E + + F CF + + P + F F V+
Sbjct: 310 YFPKVLYDSLIETILARQPVKLHIVEETF----QCFSFSTNVDEAFPPVSFEFEDSVKLT 365
Query: 444 PPVKSYIIDVAPQVKCI-----GVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSA 498
Y+ + ++ C G+ ++G+++ N L +DL + +G+A
Sbjct: 366 VYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHN 425
Query: 499 CN 500
C+
Sbjct: 426 CS 427
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 91.7 bits (226), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/424 (24%), Positives = 171/424 (40%), Gaps = 54/424 (12%)
Query: 92 RRKDSEMVQFQLPMH-SGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHN 150
RR+ + + LP+ +GR G Y+ ++ +GTP + +++ DTGS+ W N +
Sbjct: 55 RRQLTILAGIDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQC 114
Query: 151 KTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSD 210
++ ++ S + K V+C C ++S
Sbjct: 115 PRRSTLGIELT-----------------------LYNIDESDSGKLVSCDDDFC-YQISG 150
Query: 211 LFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVE-LSNGRKGKLHN--LTIGCTKT 267
L+ C K + C Y Y DGSS G+F D + + ++ K + N + GC
Sbjct: 151 -GPLSGC-KANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGAR 208
Query: 268 IVNGV--TFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFG-- 323
+ + E GILG G A + + + L G+ L +N G
Sbjct: 209 QSGDLDSSNEEALDGILGFGKANSSMISQ--LASSGRVKKIFAHCLDGRNGGGIFAIGRV 266
Query: 324 -TPKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTL 382
PKV + T L P Y VN+ + VG + L IP+ ++ + G IIDSGTTL
Sbjct: 267 VQPKVNM------TPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTL 320
Query: 383 TNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDY-CFDAKGFDESSVPRLVFHFAGGVR 441
L E ++E L K +T + DY CF G + P + FHF V
Sbjct: 321 AYLP----EIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVF 376
Query: 442 FEPPVKSYIIDVAPQVKCIG-----VLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAP 496
Y+ + CIG + + + +++G+++ N L +DL + +G+
Sbjct: 377 LRVYPHDYLFP-HEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTE 435
Query: 497 SACN 500
C+
Sbjct: 436 YNCS 439
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 90.9 bits (224), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/411 (24%), Positives = 156/411 (37%), Gaps = 75/411 (18%)
Query: 113 LGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXX 172
+G YF ++ +GTP + F + DTGS+ W N
Sbjct: 82 IGLYFAKIGLGTPSRDFHVQVDTGSDILWVN----------------------------- 112
Query: 173 XXXXXXNNPCNG-VFCPQRSRTFKT----VTCSSRKCKVELSDLFSLTYCPKPSD----- 222
C G + CP++S + V SS V SD F +Y + S+
Sbjct: 113 ---------CAGCIRCPRKSDLVELTPYDVDASSTAKSVSCSDNFC-SYVNQRSECHSGS 162
Query: 223 PCLYDISYVDGSSAKGFFGSDTITVELSNG-RKGKLHNLTI--GCTKTIVNGVTFNEDT- 278
C Y I Y DGSS G+ D + ++L G R+ N TI GC + ++
Sbjct: 163 TCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAV 222
Query: 279 GGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFG---TPKVKLLSEMRR 335
GI+G G + +F+ + A Q GK L + N G +PKVK
Sbjct: 223 DGIMGFGQSNSSFISQLASQ--GKVKRSFAHCLDNNNGGGIFAIGEVVSPKVK------T 274
Query: 336 TELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFE 395
T + + Y VN+ I VG +L++ S +D G IIDSGTTL L Y L
Sbjct: 275 TPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLN 334
Query: 396 ALKKSLTKVK-RVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVA 454
+ S ++ F Y F P + F F V + Y+ V
Sbjct: 335 EILASHPELTLHTVQESFTCFHYTDKLDRF-----PTVTFQFDKSVSLAVYPREYLFQVR 389
Query: 455 PQVKCI-----GVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
C G+ G +++G++ N L +D+ + +G+ C+
Sbjct: 390 EDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 90.5 bits (223), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/467 (23%), Positives = 193/467 (41%), Gaps = 56/467 (11%)
Query: 50 ARRFAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSN---NGSHRRKDSEMVQFQLPMH 106
A+ AG + + F L + + + R +R++ G + +V F P+
Sbjct: 32 AKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGGGRQSSVGGVVDF--PVQ 89
Query: 107 SGRD-YGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXX 165
D Y +G YF +VK+G+P +F + DTGS+ W ++ +
Sbjct: 90 GSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLH---- 145
Query: 166 XXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCP-KPSDPC 224
F S T +VTCS C S +F T ++ C
Sbjct: 146 -------------------FFDAPGSLTAGSVTCSDPIC----SSVFQTTAAQCSENNQC 182
Query: 225 LYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHN---LTIGCTKTIVNGVTFNED-TGG 280
Y Y DGS G++ +DT + G ++ + GC+ +T ++ G
Sbjct: 183 GYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDG 242
Query: 281 ILGLGYAKDAFVDKAALQ--YGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTEL 338
I G G K + V + + + FS+CL S V G L+ M + L
Sbjct: 243 IFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGV---FVLGE---ILVPGMVYSPL 296
Query: 339 FLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALK 398
+ P Y +N++ I V GQML + + V++ + GTI+D+GTTLT L AY+ A+
Sbjct: 297 VPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAIS 356
Query: 399 KSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYI----IDVA 454
S++++ P G + C+ P + +FAGG + Y+ I
Sbjct: 357 NSVSQLV-TPIISNG--EQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDG 413
Query: 455 PQVKCIGVLAINGPGA-SVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
+ CIG P +++G+++ ++ ++ +DLA +G+A C+
Sbjct: 414 ASMWCIGFQ--KAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 458
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 90.5 bits (223), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/430 (23%), Positives = 163/430 (37%), Gaps = 92/430 (21%)
Query: 114 GEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXX 173
G Y V + GTP Q DTGS W +
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYL------------------------ 123
Query: 174 XXXXXNNPCNGV------------FCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPS 221
C+G F P+ S + K + C S KC+ C +
Sbjct: 124 --------CSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNT 175
Query: 222 DPCL-----YDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNE 276
C Y + Y GS+A +L+ + + +GC+ +
Sbjct: 176 RNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLT------VPDFVVGCS------IISTR 223
Query: 277 DTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDH-LSHQNVSSYLTFGT----------P 325
GI G G + + L+ +FS+CLV NV++ L T P
Sbjct: 224 QPAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTP 280
Query: 326 KVKLLSEMRRTELFLAA--PFYGVNVVGISVGGQMLKIPSQVWD--FNAQGGTIIDSGTT 381
+ + + A +Y +N+ I VG + +KIP + N GG+I+DSG+T
Sbjct: 281 GLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGST 340
Query: 382 LTNLALPAYEQLFEALKKSLTKVKRVPAGDF---GGLDYCFDAKGFDESSVPRLVFHFAG 438
T + P +E + E ++ R D GL CF+ G + +VP L+F F G
Sbjct: 341 FTFMERPVFELVAEEFASQMSNYTR--EKDLEKETGLGPCFNISGKGDVTVPELIFEFKG 398
Query: 439 GVRFEPPVKSYIIDVA-PQVKCIGVLA---INGPG----ASVIGNIMQQNHLWEFDLAHN 490
G + E P+ +Y V C+ V++ +N G A ++G+ QQN+L E+DL ++
Sbjct: 399 GAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLEND 458
Query: 491 TVGFAPSACN 500
GFA C+
Sbjct: 459 RFGFAKKKCS 468
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 89.4 bits (220), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 103/431 (23%), Positives = 173/431 (40%), Gaps = 92/431 (21%)
Query: 103 LPMHSGRD-YGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXX 161
P+ D + +G Y+ +VK+GTP ++F + DTGS+ W +
Sbjct: 70 FPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS--------------- 114
Query: 162 XXXXXXXXXXXXXXXXXNNPCNGVFCPQRSR--------------TFKTVTCSSRKCKVE 207
CNG CP+ S + V+CS R+C
Sbjct: 115 --------------------CNG--CPKTSELQIQLSFFDPGVSSSASLVSCSDRRC--- 149
Query: 208 LSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTI---TVELSNGRKGKLHNLTIGC 264
S+ + + C P++ C Y Y DGS G++ SD + TV S GC
Sbjct: 150 YSNFQTESGC-SPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGC 208
Query: 265 TKTIVNGVTF-NEDTGGILGLGYAKDAFVDKAALQYGGK--FSYCLVDHLSHQNVSSYLT 321
+ + GI GLG + + + A+Q FS+CL S +
Sbjct: 209 SNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGI----- 263
Query: 322 FGTPKVKLLSEMRR-----TELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTII 376
+L +++R T L + P Y VN+ I+V GQ+L I V+ GTII
Sbjct: 264 ------MVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTII 317
Query: 377 DSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDY----CFDAKGFDESSVPRL 432
D+GTTL L AY +A+ ++++ R + Y CF+ D P++
Sbjct: 318 DTGTTLAYLPDEAYSPFIQAVANAVSQYGR-------PITYESYQCFEITAGDVDVFPQV 370
Query: 433 VFHFAGGVRFEPPVKSYI---IDVAPQVKCIGVLAINGPGASVIGNIMQQNHLWEFDLAH 489
FAGG ++Y+ + CIG ++ +++G+++ ++ + +DL
Sbjct: 371 SLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVR 430
Query: 490 NTVGFAPSACN 500
+G+A C+
Sbjct: 431 QRIGWAEYDCS 441
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 88.2 bits (217), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 157/389 (40%), Gaps = 52/389 (13%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVH-KTHNKTQTXXXXXXXXXXXXXXXXXXXX 174
+ + +G P L DTGS+ TW + + K + +T
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIP-------------------- 117
Query: 175 XXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGS 234
F P RS T++ +C V + + + C Y + Y D S
Sbjct: 118 ----------FFHPSRSSTYRNASC------VSAPHAMPQIFRDEKTGNCQYHLRYRDFS 161
Query: 235 SAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDK 294
+ +G + +T E S+ N+ GC + ++ G+LGLG + V +
Sbjct: 162 NTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGFTKYS----GVLGLGPGTFSIVTR 217
Query: 295 AALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVNVVGISV 354
+G KFSYC + + L G K+ E T L + Y +++ IS
Sbjct: 218 ---NFGSKFSYCFGSLTNPTYPHNILILGN-GAKI--EGDPTPLQIFQDRYYLDLQAISF 271
Query: 355 GGQMLKI-PSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKV-KRVPAGDF 412
G ++L I P + +QGGT+ID+G + T LA AYE L E + L +V +RV D
Sbjct: 272 GEKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWD- 330
Query: 413 GGLDYCFDAK-GFDESSVPRLVFHFAGGVRFEPPVKS-YIIDVAPQVKCIGVLAINGPGA 470
C++ D P + FHFAGG V+S ++ + C+ +
Sbjct: 331 QYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDM 390
Query: 471 SVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
SVIG + QQN+ ++L V F + C
Sbjct: 391 SVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 87.0 bits (214), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 111/433 (25%), Positives = 167/433 (38%), Gaps = 80/433 (18%)
Query: 91 HRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHN 150
R+ S V F + SG Y LG Y+V + +G P + F L DTGS+ TW
Sbjct: 45 QNRRLSSTVVFPV---SGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQC------ 95
Query: 151 KTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFK----TVTCSSRKCKV 206
+ PCNG P R++ +K T+ CS C
Sbjct: 96 ----------------------------DAPCNGCTKP-RAKQYKPNHNTLPCSHILCSG 126
Query: 207 ELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTK 266
DL C P D C Y+I Y D +S+ G +D + ++L+NG L LT GC
Sbjct: 127 --LDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFGCGY 183
Query: 267 TIVN-GVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTP 325
N G T GILGLG K L+ G +V LSH +L+ G
Sbjct: 184 DQQNPGPHPPPPTAGILGLGRGKVGL--STQLKSLGITKNVIVHCLSHTG-KGFLSIG-- 238
Query: 326 KVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQG------GTIIDSG 379
EL ++ ++ S + P+++ FN + + DSG
Sbjct: 239 ----------DELVPSSGVTWTSLATNSPSKNYMAGPAELL-FNDKTTGVKGINVVFDSG 287
Query: 380 TTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDA----KGFDESS--VPRLV 433
++ T AY+ + + ++K L D L C+ K DE +
Sbjct: 288 SSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTIT 347
Query: 434 FHFA---GGVRFEPPVKSYIIDVAPQVKCIGVL---AINGPGASVIGNIMQQNHLWEFDL 487
F G F+ P +SY+I C+G+L I G ++IG+I Q + +D
Sbjct: 348 LRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDN 407
Query: 488 AHNTVGFAPSACN 500
+G+ S C+
Sbjct: 408 EKQRIGWISSDCD 420
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 86.7 bits (213), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 111/433 (25%), Positives = 167/433 (38%), Gaps = 80/433 (18%)
Query: 91 HRRKDSEMVQFQLPMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHN 150
R+ S V F + SG Y LG Y+V + +G P + F L DTGS+ TW
Sbjct: 45 QNRRLSSTVVFPV---SGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQC------ 95
Query: 151 KTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFK----TVTCSSRKCKV 206
+ PCNG P R++ +K T+ CS C
Sbjct: 96 ----------------------------DAPCNGCTKP-RAKQYKPNHNTLPCSHILCSG 126
Query: 207 ELSDLFSLTYCPKPSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTK 266
DL C P D C Y+I Y D +S+ G +D + ++L+NG L LT GC
Sbjct: 127 --LDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFGCGY 183
Query: 267 TIVN-GVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTP 325
N G T GILGLG K L+ G +V LSH +L+ G
Sbjct: 184 DQQNPGPHPPPPTAGILGLGRGKVGL--STQLKSLGITKNVIVHCLSHTG-KGFLSIG-- 238
Query: 326 KVKLLSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQG------GTIIDSG 379
EL ++ ++ S + P+++ FN + + DSG
Sbjct: 239 ----------DELVPSSGVTWTSLATNSPSKNYMAGPAELL-FNDKTTGVKGINVVFDSG 287
Query: 380 TTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDA----KGFDESS--VPRLV 433
++ T AY+ + + ++K L D L C+ K DE +
Sbjct: 288 SSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTIT 347
Query: 434 FHFA---GGVRFEPPVKSYIIDVAPQVKCIGVL---AINGPGASVIGNIMQQNHLWEFDL 487
F G F+ P +SY+I C+G+L I G ++IG+I Q + +D
Sbjct: 348 LRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDN 407
Query: 488 AHNTVGFAPSACN 500
+G+ S C+
Sbjct: 408 EKQRIGWISSDCD 420
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 86.3 bits (212), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 162/399 (40%), Gaps = 66/399 (16%)
Query: 114 GEYFVQVKVGTPGQKFWLAADTGSEFTWFN-SVHKTHNKTQTXXXXXXXXXXXXXXXXXX 172
G Y ++ +GTP Q+F L DTGS T+ S K K Q
Sbjct: 74 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQ------------------- 114
Query: 173 XXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVD 232
+ F P+ S +++ + C+ C C C+Y+ Y +
Sbjct: 115 ----------DPKFQPELSTSYQALKCNP-DCN-----------CDDEGKLCVYERRYAE 152
Query: 233 GSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAK---- 288
SS+ G D I+ N + GC G F++ GI+GLG K
Sbjct: 153 MSSSSGVLSEDLIS--FGNESQLSPQRAVFGCENE-ETGDLFSQRADGIMGLGRGKLSVV 209
Query: 289 DAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVN 348
D VDK ++ FS C P + S ++ F +P+Y ++
Sbjct: 210 DQLVDKGVIE--DVFSLCYGGMEVGGGAMVLGKISPPPGMVFS---HSDPF-RSPYYNID 263
Query: 349 VVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVP 408
+ + V G+ LK+ +V FN + GT++DSGTT A+ + +A+ K + +KR+
Sbjct: 264 LKQMHVAGKSLKLNPKV--FNGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIH 321
Query: 409 AGDFGGLDYCFDAKGFDESSV----PRLVFHFAGGVRFEPPVKSYIIDVAPQVK---CIG 461
D D CF G D + + P + F G + ++Y+ +V+ C+G
Sbjct: 322 GPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHT-KVRGAYCLG 380
Query: 462 VLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
+ + +++G I+ +N L +D ++ +GF + C+
Sbjct: 381 IFP-DRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 85.1 bits (209), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 111/472 (23%), Positives = 193/472 (40%), Gaps = 61/472 (12%)
Query: 50 ARRFAGEVDQVEAIKGFILRDTLRRQSMNQRFGLRNSN---NGSHRRKDSEMVQFQLPMH 106
A+ AG + + F L + + + R +R++ G + +V F P+
Sbjct: 32 AKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGGGRQSSVGGVVDF--PVQ 89
Query: 107 SGRD-YGLGE-----YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXX 160
D Y +G YF +VK+G+P +F + DTGS+ W ++ +
Sbjct: 90 GSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDL 149
Query: 161 XXXXXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCP-K 219
F S T +VTCS C S +F T
Sbjct: 150 H-----------------------FFDAPGSLTAGSVTCSDPIC----SSVFQTTAAQCS 182
Query: 220 PSDPCLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHN---LTIGCTKTIVNGVTFNE 276
++ C Y Y DGS G++ +DT + G ++ + GC+ +T ++
Sbjct: 183 ENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSD 242
Query: 277 D-TGGILGLGYAKDAFVDKAALQ--YGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEM 333
GI G G K + V + + + FS+CL S V G L+ M
Sbjct: 243 KAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGV---FVLGE---ILVPGM 296
Query: 334 RRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQL 393
+ L + P Y +N++ I V GQML + + V++ + GTI+D+GTTLT L AY+
Sbjct: 297 VYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLF 356
Query: 394 FEALKKSLTKVKRVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGGVRFEPPVKSYI--- 450
A+ S++++ P G + C+ P + +FAGG + Y+
Sbjct: 357 LNAISNSVSQLV-TPIISNG--EQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHY 413
Query: 451 -IDVAPQVKCIGVLAINGPGA-SVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
I + CIG P +++G+++ ++ ++ +DLA +G+A C+
Sbjct: 414 GIYDGASMWCIGFQ--KAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 463
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 84.3 bits (207), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/393 (24%), Positives = 152/393 (38%), Gaps = 51/393 (12%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXX 175
Y+ V VGTP F +A DTGS+ W T
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGT-----------------TCIRDLEDIG 144
Query: 176 XXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDGSS 235
+ P N ++ P S T ++ CS ++C F C PS C Y ISY + +
Sbjct: 145 VPQSVPLN-LYTPNASTTSSSIRCSDKRC-------FGSKKCSSPSSICPYQISYSNSTG 196
Query: 236 AKGFFGSDTITVELSNGRKGKLH-NLTIGCTKTIVNGVTFNEDTGGILGL---GYAKDAF 291
KG D + + + + N+T+GC + N G+LGL GY+ +
Sbjct: 197 TKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSL 256
Query: 292 VDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAP--FYGVNV 349
+ KA + FS C + + ++FG + ++ T AP YGVN+
Sbjct: 257 LAKANIT-ANSFSMCFGRVIGNVG---RISFGD---RGYTDQEETPFISVAPSTAYGVNI 309
Query: 350 VGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPA 409
G+SV G P + F D+G++ T+L PAY L ++ + L + +R P
Sbjct: 310 SGVSVAGD----PVDIRLFAK-----FDTGSSFTHLREPAYGVLTKSFDE-LVEDRRRPV 359
Query: 410 GDFGGLDYCFD-AKGFDESSVPRLVFHFAGGVR--FEPPVKSYIIDVAPQVKCIGVLAIN 466
++C+D + P + F GG + P + + C+GVL
Sbjct: 360 DPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSV 419
Query: 467 GPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
G +VIG + FD +G+ S C
Sbjct: 420 GLKINVIGQNFVAGYRIVFDRERMILGWKQSLC 452
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 79.3 bits (194), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 162/399 (40%), Gaps = 64/399 (16%)
Query: 114 GEYFVQVKVGTPGQKFWLAADTGSEFTWFN-SVHKTHNKTQTXXXXXXXXXXXXXXXXXX 172
G Y ++ +GTP Q F L D+GS T+ S + K Q
Sbjct: 91 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQ------------------- 131
Query: 173 XXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVD 232
+ F P+ S T++ V C+ C C + C+Y+ Y +
Sbjct: 132 ----------DPKFQPEMSSTYQPVKCN-MDCN-----------CDDDREQCVYEREYAE 169
Query: 233 GSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAK---- 288
SS+KG G D I+ N + GC +T+ G +++ GI+GLG
Sbjct: 170 HSSSKGVLGEDLIS--FGNESQLTPQRAVFGC-ETVETGDLYSQRADGIIGLGQGDLSLV 226
Query: 289 DAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVN 348
D VDK + Y +D + + + V S+ R +P+Y ++
Sbjct: 227 DQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDR------SPYYNID 280
Query: 349 VVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVP 408
+ GI V G+ L + S+V F+ + G ++DSGTT L A+ EA+ + ++ +K++
Sbjct: 281 LTGIRVAGKQLSLHSRV--FDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQID 338
Query: 409 AGDFGGLDYCFDAKGFDESS-----VPRLVFHFAGGVRFEPPVKSYIIDVAPQ--VKCIG 461
D D CF + S P + F G + ++Y+ + C+G
Sbjct: 339 GPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLG 398
Query: 462 VLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
V +++G I+ +N L +D ++ VGF + C+
Sbjct: 399 VFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCS 437
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 75.9 bits (185), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 101/442 (22%), Positives = 166/442 (37%), Gaps = 84/442 (19%)
Query: 104 PMHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXX 163
P+ RD Y + + +GTP Q + DTGS+ TW + + + +
Sbjct: 75 PLREVRD----GYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLK 130
Query: 164 XXXXXXXXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKC-KVELSDLFSLTYCPKPSD 222
VF P S T +C+S C ++ SD P D
Sbjct: 131 SP------------------SVFSPLHSSTSFRDSCASSFCVEIHSSD--------NPFD 164
Query: 223 PCLY---DISYVDGSSA------------KGFFGSDTITVELSNGRKGKLHNLTIGCTKT 267
PC +S + S+ +G S +T ++ R + + GC +
Sbjct: 165 PCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTS 224
Query: 268 IVNGVTFNEDTGGILGLGYAKDAFVDKAALQYGGKFSYCLV--DHLSHQNVSSYLTFGTP 325
T+ E G I G G + + G FS+C + +++ N+SS L G
Sbjct: 225 -----TYREPIG-IAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILGAS 277
Query: 326 KVKL-------LSEMRRTELFLAAPFYGVNVVGISVGGQMLKIPSQVWDFNAQG--GTII 376
+ + + M T ++ + + G+ + I ++P + F++QG G ++
Sbjct: 278 ALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLV 337
Query: 377 DSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGDFGGLDYCFDAKGFD------ESSV- 429
DSGTT T+L P Y QL L+ ++T + G D C+ + E+ V
Sbjct: 338 DSGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVM 397
Query: 430 ---PRLVFHFAGGVRFEPPV-KSYIIDVAPQ----VKCIGVLAIN----GPGASVIGNIM 477
P + FHF P S+ AP V+C+ + GP A V G+
Sbjct: 398 MIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGP-AGVFGSFQ 456
Query: 478 QQNHLWEFDLAHNTVGFAPSAC 499
QQN +DL +GF C
Sbjct: 457 QQNVKVVYDLEKERIGFQAMDC 478
>AT3G51360.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19064294-19066560 REVERSE LENGTH=488
Length = 488
Score = 67.4 bits (163), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 96/399 (24%), Positives = 150/399 (37%), Gaps = 65/399 (16%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWF----NSVHKTHNKTQTXXXXXXXXXXXXXXXXX 171
++ V +GTP Q F +A DTGS+ W NS +T
Sbjct: 89 HYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKL----------- 137
Query: 172 XXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYV 231
++ P +S++ VTC+S C + C P C Y I Y+
Sbjct: 138 ------------NIYNPSKSKSSSKVTCNSTLCALR-------NRCISPVSDCPYRIRYL 178
Query: 232 D-GSSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDA 290
GS + G D I + G + + +T GC+++ + G+ GI+GL A D
Sbjct: 179 SPGSKSTGVLVEDVIHMSTEEG-EARDARITFGCSESQL-GLFKEVAVNGIMGLAIA-DI 235
Query: 291 FVDKAALQYG---GKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTEL--FLAAPFY 345
V ++ G FS C N ++FG K S+ T L ++ FY
Sbjct: 236 AVPNMLVKAGVASDSFSMCF-----GPNGKGTISFGD---KGSSDQLETPLSGTISPMFY 287
Query: 346 GVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVK 405
V++ VG + +F A DSGT +T L P Y L S+ +
Sbjct: 288 DVSITKFKVGKVTVDT-----EFTAT----FDSGTAVTWLIEPYYTALTTNFHLSVPD-R 337
Query: 406 RVPAGDFGGLDYCFDAKGF-DESSVPRLVFHFAGGVRFEPPVKSYIIDVAP---QVKCIG 461
R+ ++C+ DE +P + F GG ++ + D + QV C+
Sbjct: 338 RLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLA 397
Query: 462 VLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
VL S+IG N+ D +G+ S CN
Sbjct: 398 VLKQVNADFSIIGQNFMTNYRIVHDRERRILGWKKSNCN 436
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 66.6 bits (161), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 150/394 (38%), Gaps = 52/394 (13%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXX 175
++ V VGTP F +A DTGS+ W
Sbjct: 102 HYANVSVGTPATWFLVALDTGSDLFWLPC-----------------NCGSTCIRDLKEVG 144
Query: 176 XXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYV--DG 233
+ P N ++ P S T ++ CS +C + CP Y I Y+ D
Sbjct: 145 LSQSRPLN-LYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCP-------YQIQYLSKDT 196
Query: 234 SSAKGFFGSDTITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLG---YAKDA 290
+ F V G + N+T+GC K + + G+LGLG Y+ +
Sbjct: 197 FTTGTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPS 256
Query: 291 FVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFLA--APFYGVN 348
+ KA + FS C + + +V ++FG K ++ T L +P Y V+
Sbjct: 257 ILAKAKIT-ANSFSMCFGNII---DVVGRISFGD---KGYTDQMETPLLPTEPSPTYAVS 309
Query: 349 VVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVP 408
V +SVGG + + Q + D+GT+ T+L P Y + +A +T KR P
Sbjct: 310 VTEVSVGGDAVGV---------QLLALFDTGTSFTHLLEPEYGLITKAFDDHVTD-KRRP 359
Query: 409 AGDFGGLDYCFDAKGFDESSV-PRLVFHFAGGVRFEPPVKSYII--DVAPQVKCIGVLAI 465
++C+D + + PR+ F GG + +I+ + + C+G+L
Sbjct: 360 IDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKS 419
Query: 466 NGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
++IG + FD +G+ S C
Sbjct: 420 VDFKINIIGQNFMSGYRIVFDRERMILGWKRSDC 453
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 66.2 bits (160), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 91/408 (22%), Positives = 153/408 (37%), Gaps = 75/408 (18%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWF------NSVHKTHNKTQTXXXXXXXXXXXXXXX 169
++ V +GTP F +A DTGS+ W +H + +
Sbjct: 103 HYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESV------------ 150
Query: 170 XXXXXXXXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDIS 229
P N ++ P S T ++ CS ++C F C P C Y I+
Sbjct: 151 -----------PLN-LYTPNASTTSSSIRCSDKRC-------FGSGKCSSPESICPYQIA 191
Query: 230 YVDGSSAKGFFGSDTI-TVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLG--- 285
+ G D + V K N+T+GC + + G+LGL
Sbjct: 192 LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKE 251
Query: 286 YAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTEL--FLAAP 343
Y+ + + KA + FS C +S V ++FG K ++ T L +
Sbjct: 252 YSVPSLLAKANIT-ANSFSMCFGRIIS---VVGRISFGD---KGYTDQEETPLVSLETST 304
Query: 344 FYGVNVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTK 403
YGVNV G+SVGG + +P + D+G++ T L AY +A L +
Sbjct: 305 AYGVNVTGVSVGGVPVDVPLFA---------LFDTGSSFTLLLESAYGVFTKAFDD-LME 354
Query: 404 VKRVPAGDFGGLDYCFDAKG--FDESSVPRLVFH---------FAGGVRFEPPVKSYIID 452
KR P ++C+D + + + PR + F ++ + +
Sbjct: 355 DKRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSN 414
Query: 453 VAPQVKCIGVL-AINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
++ C+G+L +IN ++IG + H FD +G+ S C
Sbjct: 415 EGTKMYCLGILKSIN---LNIIGQNLMSGHRIVFDRERMILGWKQSNC 459
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 63.9 bits (154), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 114/512 (22%), Positives = 177/512 (34%), Gaps = 121/512 (23%)
Query: 33 EEEEVQGMSMELVHRHDARRFAGEVDQVEAIKGFILRDTL-RRQSMNQRFGLRNSNNGSH 91
EE S L+HR A +IK D+L +QS+ L S+
Sbjct: 18 EETLASLFSSRLIHRFSDEGRA-------SIKTPSSSDSLPNKQSLEYYRLLAESDFRRQ 70
Query: 92 RRKDSEMVQFQLP------MHSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSV 145
R VQ +P + SG D+G Y + +GTP F +A DTGS W
Sbjct: 71 RMNLGAKVQSLVPSEGSKTISSGNDFGWLHY-TWIDIGTPSVSFLVALDTGSNLLWI--- 126
Query: 146 HKTHNKTQTXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFC------------------ 187
PCN V C
Sbjct: 127 -----------------------------------PCNCVQCAPLTSTYYSSLATKDLNE 151
Query: 188 --PQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDG-SSAKGFFGSDT 244
P S T K CS + C S + C P + C Y ++Y+ G +S+ G D
Sbjct: 152 YNPSSSSTSKVFLCSHKLCD-------SASDCESPKEQCPYTVNYLSGNTSSSGLLVEDI 204
Query: 245 ITV------ELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAK---DAFVDKA 295
+ + L NG + IGC K G++GLG A+ +F+ KA
Sbjct: 205 LHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKA 264
Query: 296 ALQYGGKFSYCLVDHLSHQ----NVSSYLTFGTPKVKLLSEMRRTELFLAAPFYGVNVVG 351
L FS C + S + ++ + TP ++L + Y V V
Sbjct: 265 GLMR-NSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSG--------YIVGVEA 315
Query: 352 ISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVPAGD 411
+G LK S T IDSG + T L E+++ + + + + +
Sbjct: 316 CCIGNSCLKQTSFT--------TFIDSGQSFTYLP----EEIYRKVALEIDRHINATSKN 363
Query: 412 FGGL--DYCFDAKGFDESSVPRLVFHFAGGVRF--EPPVKSYIIDVAPQVKCIGVLAING 467
F G+ +YC+++ E VP + F+ F P+ + C+ +
Sbjct: 364 FEGVSWEYCYESSA--EPKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQ 421
Query: 468 PGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
G IG + + FD + +G++PS C
Sbjct: 422 EGIGSIGQNYMRGYRMVFDRENMKLGWSPSKC 453
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 62.0 bits (149), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 104/435 (23%), Positives = 163/435 (37%), Gaps = 114/435 (26%)
Query: 95 DSEMVQFQLPM-HSGRDYGLGEYFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQ 153
DS++V L HS R GL + ++ G+P +K +L DTGS TW TQ
Sbjct: 39 DSKVVSLPLSSPHSQR--GLA-FMAEIHFGSPQKKQFLHMDTGSSLTW----------TQ 85
Query: 154 TXXXXXXXXXXXXXXXXXXXXXXXXNNPCNGVFC--------PQRSRTFKTVTCSSRKCK 205
PC+ + P S T++ C K
Sbjct: 86 CF-------------------------PCSDCYAQKIYPKYRPAASITYRDAMCEDSHPK 120
Query: 206 VELSDLFSLTYCPKPSDP----CLYDISYVDGSSAKGFFGSDTITVELSNGRKGKLHNLT 261
F DP C Y Y+D ++ KG + ITV+ +G ++H +
Sbjct: 121 SNPHFAF---------DPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVY 171
Query: 262 IGCTKTIVNGVTFNEDTG-GILGLGYAKDAFVDKAALQYGGKFSYCLVDHLSHQNVSSYL 320
GC T+ +G F TG GILGLG K + + ++G KFS+CL + +S S L
Sbjct: 172 FGC-NTLSDGSYF---TGTGILGLGVGKYSIIG----EFGSKFSFCLGE-ISEPKASHNL 222
Query: 321 TFGT-------PKVKLLSEMRRTELFLAAPFYGVNVVGISVGGQM-LKIPSQVWDFNAQG 372
G P V ++E + I VG ++ L P QV+
Sbjct: 223 ILGDGANVQGHPTVINITEGHTI----------FQLESIIVGEEITLDDPVQVF------ 266
Query: 373 GTIIDSGTTLTNLALPAYEQLFEALKKSL--TKVKRVPAGDFGGLDYCFDAKGFDESSVP 430
+D+G+TL++L+ Y + +A + + P C+ A +
Sbjct: 267 ---VDTGSTLSHLSTNLYYKFVDAFDDLIGSRPLSYEPT-------LCYKADTIERLEKM 316
Query: 431 RLVFHFAGGVRFEPPVKSYIIDVA-PQVKCIGVLAINGPGAS----VIGNIMQQNHLWEF 485
+ F F G + + I P+++C LAI S +IG I Q + +
Sbjct: 317 DVGFKFDVGAELSVNIHNIFIQQGPPEIRC---LAIQNNKESFSHVIIGVIAMQGYNVGY 373
Query: 486 DLAHNTVGFAPSACN 500
DL+ T C+
Sbjct: 374 DLSAKTAYINKQDCD 388
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 61.2 bits (147), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/375 (23%), Positives = 142/375 (37%), Gaps = 59/375 (15%)
Query: 116 YFVQVKVGTPGQKFWLAADTGSEFTWFNSVHKTHNKTQTXXXXXXXXXXXXXXXXXXXXX 175
++ VK+GTPG +F +A DTGS+ W T+
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFEL------------ 154
Query: 176 XXXNNPCNGVFCPQRSRTFKTVTCSSRKCKVELSDLFSLTYCPKPSDPCLYDISYVDG-S 234
++ P+ S T K VTC++ C L + + CP Y +SYV +
Sbjct: 155 --------SIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCP-------YMVSYVSAQT 199
Query: 235 SAKGFFGSDT--ITVELSNGRKGKLHNLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFV 292
S G D +T E N + + + +T GC + G+ GLG K +
Sbjct: 200 STSGILMEDVMHLTTEDKNPERVEAY-VTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVP 258
Query: 293 DKAALQ--YGGKFSYCLVDHLSHQNVSSYLTFGTPKVKLLSEMRRTELFL--AAPFYGVN 348
A + FS C H V ++FG K S+ T L + P Y +
Sbjct: 259 SVLAREGLVADSFSMC----FGHDGVGR-ISFGD---KGSSDQEETPFNLNPSHPNYNIT 310
Query: 349 VVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKRVP 408
V + VG ++ + + + D+GT+ T L P Y + E+ + P
Sbjct: 311 VTRVRVGTTLI---------DDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSP 361
Query: 409 AGDFGGLDYCFD-AKGFDESSVPRLVFHFAGGVRFEPPVKSYIIDVAPQVKCIGVLAING 467
+YC+D + + S +P L G F + II ++ + + + LAI
Sbjct: 362 DSRI-PFEYCYDMSNDANASLIPSLSLTMKGNSHFT--INDPIIVISTEGELVYCLAI-- 416
Query: 468 PGASVIGNIMQQNHL 482
S NI+ QN++
Sbjct: 417 -VKSSELNIIGQNYM 430
>AT3G42550.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:14665728-14669135 REVERSE LENGTH=430
Length = 430
Score = 56.2 bits (134), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 77/164 (46%), Gaps = 15/164 (9%)
Query: 348 NVVGISVGGQMLKIPSQVWDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVKR- 406
+++ ++V L I V+ GTIIDSGTTL + AY+ L +A+ +++ R
Sbjct: 231 HMMTVAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVHFPGEAYDPLIQAILNVVSQYGRP 290
Query: 407 VPAGDFGGLDYCFDAKGFDESSV------PRLVFHFAGGVRFEPPVKSYI----IDVAPQ 456
+P F CF+ S + P + FAGG ++Y+ +D+
Sbjct: 291 IPYESFQ----CFNITSGISSHLVIADMFPEVHLGFAGGASMVIKPEAYLFQKFLDLTNA 346
Query: 457 VKCIGVLAINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSACN 500
+ C+G + ++IG + ++ ++ +DL H +G+A C+
Sbjct: 347 IWCLGFYSSTSRRITIIGEVAIRDKMFVYDLDHQRIGWAEYNCS 390
>AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:9329933-9331432 REVERSE LENGTH=499
Length = 499
Score = 53.5 bits (127), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 113/292 (38%), Gaps = 62/292 (21%)
Query: 259 NLTIGCTKTIVNGVTFNEDTGGILGLGYAKDAFVDKAALQ---YGGKFSYCLVDHLSHQN 315
N T GC T + + G+ G G + + + A+ G FSYCLV H +
Sbjct: 211 NFTFGCAHTTL------AEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSD 264
Query: 316 V---SSYLTFG-------------------TPKVKLLSEMRRTELFLAAP----FYGVNV 349
S L G + K +E TE+ L P FY V++
Sbjct: 265 RVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEM-LENPKHPYFYSVSL 323
Query: 350 VGISVGGQMLKIPSQV--WDFNAQGGTIIDSGTTLTNLALPAYEQLFEALKKSLTKVK-- 405
GIS+G + + P+ + D N GG ++DSGTT T L Y + E + +V
Sbjct: 324 QGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHER 383
Query: 406 --RVPAGDFGGLDYCFDAKGFDESSVPRLVFHFAGG-VRFEPPVKSYII------DVAPQ 456
RV G+ C+ VP LV HFAG P ++Y D +
Sbjct: 384 ADRVEPS--SGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEE 439
Query: 457 VKCIGVL---------AINGPGASVIGNIMQQNHLWEFDLAHNTVGFAPSAC 499
+ IG L + G +++GN QQ +DL + VGFA C
Sbjct: 440 KRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491