Miyakogusa Predicted Gene
- Lj0g3v0281939.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0281939.1 Non Chatacterized Hit- tr|I1KD66|I1KD66_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max PE=4,71.47,0,Acid
proteases,Peptidase aspartic; Asp,Peptidase A1; CHLOROPLAST NUCLEIOD
DNA-BINDING-RELATED,NULL; ,gene.g21904.t1.1
(375 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 160 1e-39
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 158 7e-39
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 157 1e-38
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 137 1e-32
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 133 2e-31
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 132 3e-31
AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family pr... 128 5e-30
AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family pr... 124 2e-28
AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family pr... 121 7e-28
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 117 2e-26
AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 6e-26
AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 114 9e-26
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 113 2e-25
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 4e-24
AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family pr... 109 4e-24
AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 107 1e-23
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 106 3e-23
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 106 3e-23
AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 2e-22
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 103 3e-22
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 3e-22
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 4e-22
AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 4e-22
AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 5e-22
AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 102 6e-22
AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family pr... 101 7e-22
AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 1e-21
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 100 1e-21
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 99 4e-21
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 96 5e-20
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 93 3e-19
AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 91 2e-18
AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 90 2e-18
AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family pr... 89 4e-18
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 88 1e-17
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 86 4e-17
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 85 8e-17
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 83 3e-16
AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 82 8e-16
AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 2e-15
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 79 5e-15
AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family pr... 77 2e-14
AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family pr... 76 4e-14
AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family pr... 76 5e-14
AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family pr... 74 2e-13
AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family pr... 74 2e-13
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 73 4e-13
AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 71 1e-12
AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family pr... 71 1e-12
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 71 1e-12
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 67 2e-11
AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family pr... 66 4e-11
AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family pr... 65 8e-11
AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family pr... 65 1e-10
AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 3e-10
AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family pr... 62 5e-10
AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 8e-10
AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 9e-10
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 9e-10
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 58 1e-08
AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family pr... 53 4e-07
AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 52 7e-07
AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family pr... 50 3e-06
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 160 bits (405), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 175/358 (48%), Gaps = 24/358 (6%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y T + IG P + V++ +D GS ++W QC PC+ CY P+F +SS+Y+ L C DT
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSC--DT 205
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSY 139
P C C Y V G+ S + G T+TL + ++N +GCG S
Sbjct: 206 ---PQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI--GSTLVQNVAVGCGHSN 260
Query: 140 KGPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDN 199
+G F+ + G L++ SQLN +FS+C V SD S+++F P
Sbjct: 261 EGLFVGAAGLLGLGGGL-LALPSQLNTTSFSYCLV--DRDSDSASTVDFGTSLSP----- 312
Query: 200 NSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLP 259
++V+ PL N +Y+L GIS+ G +L I + + GGIIID GT +T L
Sbjct: 313 DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQ 372
Query: 260 SDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNV--YPTIEFYFENGNIAGENFVS 317
++ Y+ R + DL K G + CY PT+ F+F G + +
Sbjct: 373 TEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKN 432
Query: 318 YKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
Y + + GT CL+FA SS L +IG+ Q QGT +T+DL N ++ F+ NKC
Sbjct: 433 YMIPVDSV------GTFCLAFAPTASS-LAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 158 bits (399), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 174/369 (47%), Gaps = 38/369 (10%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTC-LI 82
L IG P VD GS + W QC PC+ C+ P+F SS+Y ++GC S C +
Sbjct: 111 LSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNAL 170
Query: 83 PMMRDQVFGNCTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYK 140
P NC K C Y G+ S + G++ T+T FE N+ I GCG +
Sbjct: 171 PR------SNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS-ISGIGFGCGVENE 223
Query: 141 GPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFY---------DH 191
G +Q SG+ GLGRGPLS+ SQL FS+C L S D +S + +
Sbjct: 224 GDGFSQGSGLVGLGRGPLSLISQLKETKFSYC---LTSIEDSEASSSLFIGSLASGIVNK 280
Query: 192 TLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDI 251
T ++ + + L N P +Y+L+ GI++ L ++ + + GG+IID
Sbjct: 281 TGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDS 340
Query: 252 GTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYK--DDPSNV-YPTIEFYFENG 308
GT +TYL A+ V + E V G GL+ C+K D N+ P + F+F+
Sbjct: 341 GTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGA 400
Query: 309 NIA--GENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNE 366
++ GEN++ + + G +CL A G S+ +++ G+ Q Q + +DL E
Sbjct: 401 DLELPGENYM---------VADSSTGVLCL--AMGSSNGMSIFGNVQQQNFNVLHDLEKE 449
Query: 367 VLVFTYNKC 375
+ F +C
Sbjct: 450 TVSFVPTEC 458
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 157 bits (397), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 179/354 (50%), Gaps = 25/354 (7%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMM 85
+GTP + ++L +D GS ++W QC PC+ CY P+F +SSTYK L C + C +
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLET 227
Query: 86 RDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFMT 145
C KC Y V G+ S + G + TDT+ F +S +I N +GCG +G F T
Sbjct: 228 -----SACRSNKCLYQVSYGDGSFTVGELATDTVTFGNS-GKINNVALGCGHDNEGLF-T 280
Query: 146 QFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVP 205
+G+ GLG G LS+ +Q+ A +FS+C V SG + SS++F L P
Sbjct: 281 GAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSG--KSSSLDFNSVQL----GGGDATAP 334
Query: 206 LKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSV 265
L N +Y++ G S+ G + + ++ + GG+I+D GT +T L + AY+
Sbjct: 335 LLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNS 394
Query: 266 FRSEVRRVDHDLVKKPGYDGL-EFCYK-DDPSNV-YPTIEFYFENGNIAGENFVSYKLNN 322
R ++ +L K L + CY S V PT+ F+F G S L
Sbjct: 395 LRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGK-------SLDLPA 447
Query: 323 NQTLFQAEE-GTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
L ++ GT C +FA SS+L++IG+ Q QGT +TYDL V+ + NKC
Sbjct: 448 KNYLIPVDDSGTFCFAFAP-TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 137 bits (346), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 158/356 (44%), Gaps = 23/356 (6%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMM 85
IGTP + D GS + W QCAPC CY PLF + SSTYK++ C S C +
Sbjct: 96 IGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCT--AL 153
Query: 86 RDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNA---EIKNFIMGCGDSYKGP 142
+Q + C Y++ G+ S + G + DTL S+ ++KN I+GCG + G
Sbjct: 154 ENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGT 213
Query: 143 FMTQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDN 199
F + SG+ GLG GP+S+ QL FS+C V L S DQ S I F T + +
Sbjct: 214 FNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINF--GTNAIVSGS 271
Query: 200 NSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLP 259
V PL + +Y+L IS+ + IIID GT LT LP
Sbjct: 272 GVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEG---NIIIDSGTTLTLLP 328
Query: 260 SDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNIAGENFVSYK 319
++ YS V K+ GL CY P I +F+ ++ K
Sbjct: 329 TEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADV--------K 380
Query: 320 LNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
L+++ Q E +C +F S + ++ G+ L+ YD V++ + F C
Sbjct: 381 LDSSNAFVQVSEDLVCFAFR--GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 133 bits (335), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 163/365 (44%), Gaps = 31/365 (8%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y T L +GTP + V++ +D GS I W QCAPC CY P+F R S TY + C S
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSY 139
C D N C Y V G+ S + G T+TL F + +K +GCG
Sbjct: 202 C---RRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN--RVKGVALGCGHDN 256
Query: 140 KGPFMTQFSGVFGLGRG---PLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLPFI 196
+G F+ + P + N K FS+C V S S +PSS+ F + + I
Sbjct: 257 EGLFVGAAGLLGLGKGKLSFPGQTGHRFNQK-FSYCLVDR-SASSKPSSVVFGNAAVSRI 314
Query: 197 EDNNSVMVPLKENDAHPYYYFLQFVGISINGFML-DIQSKVWGYGLNYDGGIIIDIGTNL 255
+ PL N +Y++ +GIS+ G + + + ++ +GG+IID GT++
Sbjct: 315 ----ARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSV 370
Query: 256 TYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNV----YPTIEFYFENGNIA 311
T L AY R R L + P + + C+ D SN+ PT+ +F +++
Sbjct: 371 TRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCF--DLSNMNEVKVPTVVLHFRGADVS 428
Query: 312 GENFVSYKLNNNQTLFQAE-EGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVF 370
L L + G C +FA G L++IG+ Q QG + YDL + + F
Sbjct: 429 --------LPATNYLIPVDTNGKFCFAFA-GTMGGLSIIGNIQQQGFRVVYDLASSRVGF 479
Query: 371 TYNKC 375
C
Sbjct: 480 APGGC 484
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 132 bits (333), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 164/366 (44%), Gaps = 33/366 (9%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y + IGTP + D GS + W QC PC CY PLF + SSTY+++ C S
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145
Query: 80 CLIPMMRDQVFGNCTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSN---AEIKNFIMG 134
C R +C+ + C Y + G+ S + G + DT+ S ++N I+G
Sbjct: 146 C-----RALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIG 200
Query: 135 CGDSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDH 191
CG G F SG+ GLG G S+ SQL FS+C V S + S I F
Sbjct: 201 CGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINF--G 258
Query: 192 TLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDI 251
T + + V + + D YYFL IS+ + S ++G G +G I+ID
Sbjct: 259 TNGIVSGDGVVSTSMVKKDPAT-YYFLNLEAISVGSKKIQFTSTIFGTG---EGNIVIDS 314
Query: 252 GTNLTYLPSDAYSVFRSEV-RRVDHDLVKKPGYDG-LEFCYKDDPSNVYPTIEFYFENGN 309
GT LT LPS+ Y S V + + V+ P DG L CY+D S P I +F+ G+
Sbjct: 315 GTTLTLLPSNFYYELESVVASTIKAERVQDP--DGILSLCYRDSSSFKVPDITVHFKGGD 372
Query: 310 IAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLV 369
+ KL N T E C +FA + LT+ G+ L+ YD V+ +
Sbjct: 373 V--------KLGNLNTFVAVSEDVSCFAFAANEQ--LTIFGNLAQMNFLVGYDTVSGTVS 422
Query: 370 FTYNKC 375
F C
Sbjct: 423 FKKTDC 428
>AT3G20015.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6978746-6980158 REVERSE LENGTH=470
Length = 470
Score = 128 bits (322), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 164/359 (45%), Gaps = 34/359 (9%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMM 85
+G+P + ++ +D GS + W QC PC CY P+F S +Y + C S C
Sbjct: 137 VGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVC----- 191
Query: 86 RDQVFGN-CTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFM 144
D++ + C CRY V G+ S + G + +TL F + ++N MGCG +G F+
Sbjct: 192 -DRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF--AKTVVRNVAMGCGHRNRGMFI 248
Query: 145 TQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNS 201
+ G +S QL+ + AF +C V G+D S+ F LP +
Sbjct: 249 GAAGLLGIGGGS-MSFVGQLSGQTGGAFGYCLV--SRGTDSTGSLVFGREALPV----GA 301
Query: 202 VMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSD 261
VPL N P +Y++ G+ + G + + V+ DGG+++D GT +T LP+
Sbjct: 302 SWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTA 361
Query: 262 AYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDP--SNVYPTIEFYFENG---NIAGENFV 316
AY FR + +L + G + CY S PT+ FYF G + NF+
Sbjct: 362 AYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFL 421
Query: 317 SYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+ + GT C +FA + L++IG+ Q +G +++D N + F N C
Sbjct: 422 ---------MPVDDSGTYCFAFA-ASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>AT3G61820.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:22880074-22881525 REVERSE LENGTH=483
Length = 483
Score = 124 bits (310), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 166/366 (45%), Gaps = 35/366 (9%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIP 83
L +GTP V++ +D GS + W QC+PC +CY +F + S T+ + C S C
Sbjct: 139 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLC--- 195
Query: 84 MMRDQVFGNCTGWK---CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYK 140
R C + C Y V G+ S + G T+TL F A + + +GCG +
Sbjct: 196 -RRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFH--GARVDHVPLGCGHDNE 252
Query: 141 GPFMTQFSGVFGLGRG---PLSVQSQLNAKAFSFCPV---RLGSGSDQPSSIEFYDHTLP 194
G F+ + G P +++ N K FS+C V GS S PS+I F + +P
Sbjct: 253 GLFVGAAGLLGLGRGGLSFPSQTKNRYNGK-FSYCLVDRTSSGSSSKPPSTIVFGNAAVP 311
Query: 195 FIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFML-DIQSKVWGYGLNYDGGIIIDIGT 253
SV PL N +Y+LQ +GIS+ G + + + +GG+IID GT
Sbjct: 312 ----KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGT 367
Query: 254 NLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNV----YPTIEFYFENGN 309
++T L AY R R L + P Y + C+ D S + PT+ F+F G
Sbjct: 368 SVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCF--DLSGMTTVKVPTVVFHFGGGE 425
Query: 310 IAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLV 369
++ +N + EG C +FA G +L++IG+ Q QG + YDLV +
Sbjct: 426 VSLP-------ASNYLIPVNTEGRFCFAFA-GTMGSLSIIGNIQQQGFRVAYDLVGSRVG 477
Query: 370 FTYNKC 375
F C
Sbjct: 478 FLSRAC 483
>AT1G31450.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:11259872-11261209 REVERSE LENGTH=445
Length = 445
Score = 121 bits (304), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 165/367 (44%), Gaps = 33/367 (8%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMM 85
IGTP VF D GS ++W QC PC CY PLF + SSTYK C S TC
Sbjct: 91 IGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSE 150
Query: 86 RDQVFGNCTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNF---IMGCGDSYK 140
++ C K C+Y G+ S + G + T+T+ + S+ +F + GCG +
Sbjct: 151 HEE---GCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCGYNNG 207
Query: 141 GPFMTQFSGVFGLGRGPLSVQSQLNA---KAFSFCPVRLGSGSDQPSSIEFYDHTLPF-- 195
G F SG+ GLG GPLS+ SQL + K FS+C + ++ S I +++P
Sbjct: 208 GTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGTNSIPSNP 267
Query: 196 IEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLN-----YDGGIIID 250
+D+ ++ PL + D YYFL +++ L GYGLN G IIID
Sbjct: 268 SKDSATLTTPLIQKDPET-YYFLTLEAVTVGKTKLPYTG--GGYGLNGKSSKRTGNIIID 324
Query: 251 IGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDG-LEFCYKDDPSNV-YPTIEFYFENG 308
GT LT L S Y F + V + G L C+K + P I +F N
Sbjct: 325 SGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDKEIGLPAITMHFTNA 384
Query: 309 NIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVL 368
++ KL+ + E T+CLS ++ + + G+ L+ YDL + +
Sbjct: 385 DV--------KLSPINAFVKLNEDTVCLSMIP--TTEVAIYGNMVQMDFLVGYDLETKTV 434
Query: 369 VFTYNKC 375
F C
Sbjct: 435 SFQRMDC 441
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 117 bits (292), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 157/388 (40%), Gaps = 62/388 (15%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSC-YPMQRPLFITRASSTYKELGCYSDTC-L 81
L IG P Q + L D GS + W +C+ C +C + +F R SST+ CY C L
Sbjct: 88 LRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRL 147
Query: 82 IPMMRDQVFGNCTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSN---AEIKNFIMGCG 136
+P N T C Y + S + G+ +T + S+ A +K+ GCG
Sbjct: 148 VPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCG 207
Query: 137 -----DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---AFSFC---------PVR---L 176
S G +GV GLGRGP+S SQL + FS+C P +
Sbjct: 208 FRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLII 267
Query: 177 GSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKV 236
G+G D S + F PL N P +Y+++ + +NG L I +
Sbjct: 268 GNGGDGISKLFF---------------TPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSI 312
Query: 237 WGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYD----GLEFCYK- 291
W + +GG ++D GT L +L AY + VRR VK P D G + C
Sbjct: 313 WEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRR----RVKLPIADALTPGFDLCVNV 368
Query: 292 ---DDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAE-GKSSALT 347
P + P ++F F G + FV N + EE CL+ +
Sbjct: 369 SGVTKPEKILPRLKFEFSGGAV----FVPPPRN---YFIETEEQIQCLAIQSVDPKVGFS 421
Query: 348 VIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
VIG+ QG L +D L F+ C
Sbjct: 422 VIGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>AT4G30030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14682210-14683484 REVERSE LENGTH=424
Length = 424
Score = 115 bits (287), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 163/380 (42%), Gaps = 52/380 (13%)
Query: 15 PVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELG 74
P A+ + IG P L +D GS ++W C PC CYP P F SSTY+
Sbjct: 73 PNPAAFLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNAS 131
Query: 75 CYSDTCLIPMM-RDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEI---KN 130
C S +P + RD+ GN C+Y++R + S + G++ + L FE S+ + +N
Sbjct: 132 CVSAPHAMPQIFRDEKTGN-----CQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQN 186
Query: 131 FIMGCGDSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYD 190
+ GCG G T++SGV GLG G S+ ++ FS+C GS ++ P+ Y
Sbjct: 187 IVFGCGQDNSG--FTKYSGVLGLGPGTFSIVTRNFGSKFSYC---FGSLTN-PT----YP 236
Query: 191 HTLPFIEDNNSVMVPLKENDAHPY-----YYFLQFVGISINGFMLDIQSKVWGYGLNYDG 245
H + + + + E D P Y+L IS +LDI+ + G
Sbjct: 237 HNILILGNGAKI-----EGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQR-YRSQG 290
Query: 246 GIIIDIGTNLTYLPSDAYSVFRSEV--------RRV-DHDLVKKPGYDGLEFCYKDDPSN 296
G +ID G + T L +AY E+ RRV D D P Y+G K D
Sbjct: 291 GTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEG---NLKLDLYG 347
Query: 297 VYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEG-TICLSFAEGKSSALTVIGSNQLQ 355
+P + F+F G L+ +E G + CL+ ++VIG+ Q
Sbjct: 348 -FPVVTFHFAGG-------AELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQ 399
Query: 356 GTLLTYDLVNEVLVFTYNKC 375
+ Y+L + F C
Sbjct: 400 NYNVGYNLRTMKVYFQRTDC 419
>AT4G30040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:14685602-14686885 FORWARD LENGTH=427
Length = 427
Score = 114 bits (286), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 168/382 (43%), Gaps = 48/382 (12%)
Query: 2 TNNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPL 61
T + I + N + A+ + IG+P L +D S + W QC PC +CY P+
Sbjct: 67 TGDIIAHLSPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPI 126
Query: 62 FITRASSTYKELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIF 121
F S T++ C + +P ++ N C Y++R +++ S G++ + L+F
Sbjct: 127 FDPSRSYTHRNETCRTSQYSMPSLK----FNANTRSCEYSMRYVDDTGSKGILAREMLLF 182
Query: 122 -----EHSNAEIKNFIMGCG-DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFCPVR 175
E S+A + + + GCG D+Y P + +G+ GLG G S+ + K FS+C
Sbjct: 183 NTIYDESSSAALHDVVFGCGHDNYGEPLVG--TGILGLGYGEFSLVHRF-GKKFSYC--- 236
Query: 176 LGSGSDQPSSIEFYDHTLPFI-EDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQS 234
GS D PS Y H + + +D +++ + H +Y++ IS++G +L I
Sbjct: 237 FGS-LDDPS----YPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDP 291
Query: 235 KVWGYGLNYD-GGIIIDIGTNLTYLPSDAYSVFRSEVRRV-----------DHDLVKKPG 282
+V+ GG IID G +LT L +AY ++ + + D++K
Sbjct: 292 RVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMEC 351
Query: 283 YDG-LEFCYKDDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEG 341
Y+G E +D + +P + F+F G L+ + CL+ G
Sbjct: 352 YNGNFE---RDLVESGFPIVTFHFSEG-------AELSLDVKSLFMKLSPNVFCLAVTPG 401
Query: 342 KSSALTVIGSNQLQGTLLTYDL 363
L IG+ Q + YDL
Sbjct: 402 N---LNSIGATAQQSYNIGYDL 420
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 113 bits (283), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 161/392 (41%), Gaps = 41/392 (10%)
Query: 1 MTNNTIMEATTNAYPVIDA-------YATFLLIGTPVQIVFLRVDIGSPISWFQCAPC-S 52
+ + + E+ + P D Y + +GTP + L D GS ++W QC PC
Sbjct: 106 LATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR 165
Query: 53 SCYPMQRPLFITRASSTYKELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFG 112
+CY + P+F S++Y + C S C G+C+ C Y ++ G++S S G
Sbjct: 166 TCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 225
Query: 113 VMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFMTQFSGVFGLGRGPLSVQSQLNA---KAF 169
+ + +S+ GCG++ +G F T +G+ GLGR LS SQ K F
Sbjct: 226 FLAKEKFTLTNSDV-FDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIF 283
Query: 170 SFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMV---PLKENDAHPYYYFLQFVGISIN 226
S+C PSS + H L F S V P+ +Y L V I++
Sbjct: 284 SYC---------LPSSASYTGH-LTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVG 333
Query: 227 GFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGL 286
G L I S V+ G +ID GT +T LP AY+ RS + G L
Sbjct: 334 GQKLPIPSTVFS-----TPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSIL 388
Query: 287 EFCYKDD--PSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSS 344
+ C+ + P + F F G + +L + + + +CL+FA
Sbjct: 389 DTCFDLSGFKTVTIPKVAFSFSGGAVV-------ELGSKGIFYVFKISQVCLAFAGNSDD 441
Query: 345 ALTVIGSNQLQGTL-LTYDLVNEVLVFTYNKC 375
+ I N Q TL + YD + F N C
Sbjct: 442 SNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 109 bits (272), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 159/355 (44%), Gaps = 34/355 (9%)
Query: 35 LRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMMRDQVFGNCT 94
L VD GS ++W QC PC SCY Q PL+ SS+YK + C S TC + G C
Sbjct: 148 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCG 207
Query: 95 G------WKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFMTQFS 148
G C Y V G+ S + G + +++++ + +++NF+ GCG + KG F
Sbjct: 208 GNNGVVKTPCEYVVSYGDGSYTRGDLASESILL--GDTKLENFVFGCGRNNKGLFGGSSG 265
Query: 149 GVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVP 205
+ GLGR +S+ SQ FS+C L G+ S+ F + + + + P
Sbjct: 266 -LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 322
Query: 206 LKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSV 265
L +N +Y L G SI G ++++S +G GI+ID GT +T LP Y
Sbjct: 323 LVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTVITRLPPSIYKA 374
Query: 266 FRSEVRRVDHDLVKKPGYDGLEFCYK----DDPSNVYPTIEFYFENGNIAGENFVSYKLN 321
+ E + PGY L+ C+ +D S P I+ F+ GN E V+
Sbjct: 375 VKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDIS--IPIIKMIFQ-GNAELEVDVTGVF- 430
Query: 322 NNQTLFQAEEGTICLSFAE-GKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+ + +CL+ A + + +IG+ Q + + YD E L C
Sbjct: 431 ---YFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>AT2G28010.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11930579-11931769 REVERSE LENGTH=396
Length = 396
Score = 109 bits (272), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 152/363 (41%), Gaps = 43/363 (11%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y L +GTP + +D GS I+W QC PC CY P+F SST+KE
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKE------- 117
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE---IKNFIMGCG 136
C G C Y V + + + G + T+T+ ++ E + I+GCG
Sbjct: 118 -----------KRCDGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCG 166
Query: 137 DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDHTL 193
+ F FSG+ GL GP S+ +Q+ + S+C G+ S I F + +
Sbjct: 167 HN-NSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGT-----SKINFGANAI 220
Query: 194 PFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGT 253
+ + V + A P +Y+L +S+ ++ + + L +G I+ID GT
Sbjct: 221 --VAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTF-HAL--EGNIVIDSGT 275
Query: 254 NLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENG-NIAG 312
LTY P ++ R V V + CY D +++P I +F G ++
Sbjct: 276 TLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDIFPVITMHFSGGVDLVL 335
Query: 313 ENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTY 372
+ + Y +NN +F CL+ + + G+ L+ YD + ++ F+
Sbjct: 336 DKYNMYMESNNGGVF-------CLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSP 388
Query: 373 NKC 375
C
Sbjct: 389 TNC 391
>AT2G28220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:12033953-12037527 FORWARD LENGTH=756
Length = 756
Score = 107 bits (267), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 154/369 (41%), Gaps = 51/369 (13%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y L +GTP + +D GS I W QC PC +CY P+F SST++E
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFRE------- 473
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE---IKNFIMGCG 136
C G C Y + +++ S G++ T+T+ ++ E + +GCG
Sbjct: 474 -----------QRCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCG 522
Query: 137 -----DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEF 188
Y G F + SG+ GL GPLS+ SQ++ S+C G+ S I F
Sbjct: 523 LDNTNLQYSG-FASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGT-----SKINF 576
Query: 189 YDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGII 248
T + + +V + +P+YY L +S+ ++ + DG I
Sbjct: 577 --GTNAIVAGDGTVAADMFIKKDNPFYY-LNLDAVSVEDNLIATLGTPFH---AEDGNIF 630
Query: 249 IDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKP--GYDGLEFCYKDDPSNVYPTIEFYFE 306
ID GT LTY P ++ R V +V VK P G D L CY D +++P I +F
Sbjct: 631 IDSGTTLTYFPMSYCNLVREAVEQV-VTAVKVPDMGSDNL-LCYYSDTIDIFPVITMHFS 688
Query: 307 NGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNE 366
G + V K N L G CL+ S V G+ L+ YD +
Sbjct: 689 ----GGADLVLDKY--NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSN 742
Query: 367 VLVFTYNKC 375
V+ F+ C
Sbjct: 743 VISFSPTNC 751
Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 159/377 (42%), Gaps = 53/377 (14%)
Query: 1 MTNNTIMEATTNAYPVID--AYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQ 58
++ N + A+ A + D Y L +GTP + +D GS + W QC PC CY
Sbjct: 61 LSKNQLQGASPYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQF 120
Query: 59 RPLFITRASSTYKELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDT 118
P+F SST+ E C+ G C Y + + + S G++ T+T
Sbjct: 121 DPIFDPSKSSTFNEQRCH------------------GKSCHYEIIYEDNTYSKGILATET 162
Query: 119 LIFEHSNAE---IKNFIMGCG----DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---A 168
+ ++ E + +GCG D F + SG+ GL GP S+ SQ++
Sbjct: 163 VTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGL 222
Query: 169 FSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGF 228
S+C G+ S I F T + + +V + +P+YY L +S+
Sbjct: 223 ISYCFSGQGT-----SKINF--GTNAIVAGDGTVAADMFIKKDNPFYY-LNLDAVSVE-- 272
Query: 229 MLDIQSKVWGYGLNY-DGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLE 287
D + + G + DG I+ID G+ +TY P ++ R V +V V+ P G +
Sbjct: 273 --DNRIETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQV-VTAVRVPDPSGND 329
Query: 288 -FCYKDDPSNVYPTIEFYFENG-NIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSA 345
CY + +++P I +F G ++ + + Y +N+ LF CL+ +
Sbjct: 330 MLCYFSETIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLF-------CLAIICNSPTQ 382
Query: 346 LTVIGSNQLQGTLLTYD 362
+ G+ L+ YD
Sbjct: 383 EAIFGNRAQNNFLVGYD 399
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 106 bits (265), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 156/378 (41%), Gaps = 45/378 (11%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y T + +GTP + + VD GS ++W C + +R +F S ++K +GC + T
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQT 164
Query: 80 CLIPMMRDQVFGNC--TGWKCRYNVRSGNESRSFGVMVTDTLIFEHSN---AEIKNFIMG 134
C + +M C C Y+ R + S + GV +T+ +N A + ++G
Sbjct: 165 CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIG 224
Query: 135 CGDSYKGPFMTQFSGVFGLGRGPLSVQS---QLNAKAFSFCPVRLGS----------GSD 181
C S+ G GV GL S S L FS+C V S GS
Sbjct: 225 CSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSS 284
Query: 182 QPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGL 241
+ + F T PL P+Y + +GIS+ MLDI S+VW
Sbjct: 285 RSTKTAFRRTT------------PLDLTRIPPFYA-INVIGISLGYDMLDIPSQVW--DA 329
Query: 242 NYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVK-KPGYDGLEFCYKDDPS-NV-- 297
GG I+D GT+LT L AY + + R +L + KP +E+C+ NV
Sbjct: 330 TSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSK 389
Query: 298 YPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGT 357
P + F+ + G ++ + L A G CL F + A VIG+ Q
Sbjct: 390 LPQLTFHLKGG-------ARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNY 442
Query: 358 LLTYDLVNEVLVFTYNKC 375
L +DL+ L F + C
Sbjct: 443 LWEFDLMASTLSFAPSAC 460
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 106 bits (264), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 161/388 (41%), Gaps = 58/388 (14%)
Query: 17 IDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFI------TRASSTY 70
I Y T + +G+P + +++VD GS I W CAPC C P++ L I ++ SST
Sbjct: 75 IGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKTDLGIPLSLYDSKTSSTS 133
Query: 71 KELGCYSDTCLIPMMRDQVFGNCTGWK-CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIK 129
K +GC D C M + C K C Y+V G+ S S G + D + E ++
Sbjct: 134 KNVGCEDDFCSFIMQSE----TCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLR 189
Query: 130 ------NFIMGCGDSYKGPFM---TQFSGVFGLGRGPLSVQSQLNA-----KAFSFCPVR 175
+ GCG + G + G+ G G+ S+ SQL A + FS C
Sbjct: 190 TAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 249
Query: 176 L-GSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQS 234
+ G G +E P ++ P+ N H Y + G+ ++G +D+
Sbjct: 250 MNGGGIFAVGEVES-----PVVK-----TTPIVPNQVH---YNVILKGMDVDGDPIDLPP 296
Query: 235 KVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDP 294
+ N DGG IID GT L YLP + Y+ ++ + F + +
Sbjct: 297 SL--ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFACFSFTSNT 354
Query: 295 SNVYPTIEFYFENGNIAGENFVSYKLN--NNQTLFQAEEGTICLSFAEG-----KSSALT 347
+P + +FE+ S KL+ + LF E C + G + +
Sbjct: 355 DKAFPVVNLHFED---------SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVI 405
Query: 348 VIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
++G L L+ YDL NEV+ + + C
Sbjct: 406 LLGDLVLSNKLVVYDLENEVIGWADHNC 433
>AT5G43100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:17299264-17302718 FORWARD LENGTH=631
Length = 631
Score = 103 bits (257), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 156/387 (40%), Gaps = 76/387 (19%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y T L IGTP Q L VD GS +++ C+ C C Q P F S++Y+ L C D
Sbjct: 76 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPDC 135
Query: 80 CLIPMMRDQVFGNC--TGWKCRYNVRSGNESRSFGVMVTDTLIF-EHSNAEIKNFIMGCG 136
NC G C Y R S S GV+ D + F S + + GC
Sbjct: 136 ------------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCE 183
Query: 137 DSYKGPFMTQFS-GVFGLGRGPLSVQSQLNAKA-----FSFC---------PVRLGSGSD 181
+ G +Q + G+ GLGRG LSV QL K FS C + LG S
Sbjct: 184 NEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISP 243
Query: 182 QPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGL 241
P + + H+ PF YY + + + G L + KV+
Sbjct: 244 PPGMV--FSHSDPF----------------RSPYYNIDLKQMHVAGKSLKLNPKVF---- 281
Query: 242 NYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVK----KPGYDGLEF--CYKD--D 293
N G ++D GT Y P +A+ + V + L + P YD + F +D +
Sbjct: 282 NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAE 341
Query: 294 PSNVYPTIEFYFENGN---IAGENFVSYKLNNNQTLFQAEE--GTICLSFAEGKSSALTV 348
N +P I F NG ++ EN+ LF+ + G CL + S T+
Sbjct: 342 IHNFFPEIAMEFGNGQKLILSPENY----------LFRHTKVRGAYCLGIFPDRDST-TL 390
Query: 349 IGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+G ++ TL+TYD N+ L F C
Sbjct: 391 LGGIVVRNTLVTYDRENDKLGFLKTNC 417
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 103 bits (256), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 164/385 (42%), Gaps = 70/385 (18%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y T L IGTP Q+ L VD GS +++ C+ C C Q P F SSTY+ + C D
Sbjct: 93 YTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMD- 151
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIF-EHSNAEIKNFIMGCGDS 138
C R+Q C Y S S GV+ D + F S + + GC
Sbjct: 152 CNCDDDREQ---------CVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202
Query: 139 YKGPFMTQFS-GVFGLGRGPLSVQSQLNAK-----AFSFC--PVRLGSGS------DQPS 184
G +Q + G+ GLG+G LS+ QL K +F C + +G GS D PS
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPS 262
Query: 185 SIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYD 244
+ F D + D PYY + GI + G L + S+V +D
Sbjct: 263 DMVFTD----------------SDPDRSPYYN-IDLTGIRVAGKQLSLHSRV------FD 299
Query: 245 G--GIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDG--LEFCYKDDPSN---- 296
G G ++D GT YLP A++ F V R L + G D + C++ SN
Sbjct: 300 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSE 359
Query: 297 ---VYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEE--GTICLS-FAEGKSSALTVIG 350
++P++E F++G S+ L+ +F+ + G CL F GK T++G
Sbjct: 360 LSKIFPSVEMVFKSGQ-------SWLLSPENYMFRHSKVHGAYCLGVFPNGKDHT-TLLG 411
Query: 351 SNQLQGTLLTYDLVNEVLVFTYNKC 375
++ TL+ YD N + F C
Sbjct: 412 GIVVRNTLVVYDRENSKVGFWRTNC 436
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 102 bits (255), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 117/422 (27%), Positives = 170/422 (40%), Gaps = 68/422 (16%)
Query: 1 MTNNTIMEATTNAYPV----IDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAP---CSS 53
+++ T AT P+ Y+ L GTP Q + D GS + W C CS
Sbjct: 67 LSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSG 126
Query: 54 CY-----PMQRPLFITRASSTYKELGCYSDTCLI---PMMR----DQVFGNCTGWKCRYN 101
C P P FI + SS+ K +GC S C P ++ D NCT Y
Sbjct: 127 CDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYI 186
Query: 102 VRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFMTQFSGVFGLGRGPLSVQ 161
++ G S + GV++T+ L F + + +F++GC Q +G+ G GRGP+S+
Sbjct: 187 LQYGLGSTA-GVLITEKLDFP--DLTVPDFVVGCSIIS----TRQPAGIAGFGRGPVSLP 239
Query: 162 SQLNAKAFSFCPVR---------------LGSGSDQPSSIEFYDHTLPFIEDNNSVMVPL 206
SQ+N K FS C V GSG + S +T PF ++ N
Sbjct: 240 SQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYT-PFRKNPNV----- 293
Query: 207 KENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVF 266
N A YY+L I + + I K G N DGG I+D G+ T++ + +
Sbjct: 294 -SNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELV 352
Query: 267 RSEVRR------VDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNIAGENFVSYKL 320
E + DL K+ GL C+ E FE G + L
Sbjct: 353 AEEFASQMSNYTREKDLEKE---TGLGPCFNISGKGDVTVPELIFE---FKGGAKLELPL 406
Query: 321 NNNQTLFQAEEGTICLSFA-------EGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYN 373
+N T F T+CL+ G + ++GS Q Q L+ YDL N+ F
Sbjct: 407 SNYFT-FVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKK 465
Query: 374 KC 375
KC
Sbjct: 466 KC 467
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 102 bits (254), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 152/369 (41%), Gaps = 25/369 (6%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTC-LI 82
+L+GTP + L +D GS ++W QC PC C+ + + S+++K + C C LI
Sbjct: 164 VLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPRCSLI 223
Query: 83 PMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE-------IKNFIMGC 135
V C Y G+ S + G +T + E + N + GC
Sbjct: 224 SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGC 283
Query: 136 GDSYKGPFMTQFSGVFGLGRGP-LSVQSQ-LNAKAFSFCPVRLGSGSDQPSSIEFYDHT- 192
G +G F + S Q Q L +FS+C V S ++ S + F +
Sbjct: 284 GHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKD 343
Query: 193 -LPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDI 251
L N + V KEN +YY +Q I + G LDI + W + DGG IID
Sbjct: 344 LLNHTNLNFTSFVNGKENSVETFYY-IQIKSILVGGKALDIPEETWNISSDGDGGTIIDS 402
Query: 252 GTNLTYLPSDAYSVFRSE-VRRVDHDLVKKPGYDGLEFCYK---DDPSNVY-PTIEFYFE 306
GT L+Y AY + +++ ++ + + L+ C+ + +N++ P + F
Sbjct: 403 GTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFV 462
Query: 307 NGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNE 366
+G + + + E +CL+ S ++IG+ Q Q + YD
Sbjct: 463 DGTV-------WNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRS 515
Query: 367 VLVFTYNKC 375
L FT KC
Sbjct: 516 RLGFTPTKC 524
>AT1G65240.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24230963-24233349 REVERSE LENGTH=475
Length = 475
Score = 102 bits (254), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 152/387 (39%), Gaps = 65/387 (16%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSC-----YPMQRPLFITRASSTYKELG 74
Y T + +G+P + ++VD GS I W C PC C + LF ASST K++G
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 75 CYSDTCLIPMMRDQVFGNCT-GWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIK---- 129
C D C D +C C Y++ +ES S G + D L E ++K
Sbjct: 134 CDDDFCSFISQSD----SCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPL 189
Query: 130 --NFIMGCGDSYKGPFM---TQFSGVFGLGRGPLSVQSQLNA-----KAFSFCPVRLGSG 179
+ GCG G + GV G G+ SV SQL A + FS C + G
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 180 SDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGY 239
+ ++ P+ N H Y + +G+ ++G LD+ +
Sbjct: 250 G---------IFAVGVVDSPKVKTTPMVPNQMH---YNVMLMGMDVDGTSLDLPRSIV-- 295
Query: 240 GLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLE------FCYKDD 293
+GG I+D GT L Y P Y + L ++P + F + +
Sbjct: 296 ---RNGGTIVDSGTTLAYFPKVLYDSLIETI------LARQPVKLHIVEETFQCFSFSTN 346
Query: 294 PSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEG-----KSSALTV 348
+P + F FE+ V + + LF EE C + G + S + +
Sbjct: 347 VDEAFPPVSFEFEDS-------VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVIL 399
Query: 349 IGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+G L L+ YDL NEV+ + + C
Sbjct: 400 LGDLVLSNKLVVYDLDNEVIGWADHNC 426
>AT2G35615.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:14959391-14960734 FORWARD LENGTH=447
Length = 447
Score = 102 bits (254), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 164/373 (43%), Gaps = 39/373 (10%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIP 83
+ IGTP VF D GS ++W QC PC CY P+F + SSTYK C S C
Sbjct: 89 ITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQ-A 147
Query: 84 MMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNF---IMGCGDSYK 140
+ + + + C+Y G++S S G + T+T+ + ++ +F + GCG +
Sbjct: 148 LSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNG 207
Query: 141 GPFMTQFSGVFGLGRGPLSVQSQLN---AKAFSFCPVRLGSGSDQPSSIEFYDHTLP--F 195
G F SG+ GLG G LS+ SQL +K FS+C + ++ S I +++P
Sbjct: 208 GTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSL 267
Query: 196 IEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYD----------- 244
+D+ V PL + + YYY L IS+ + K+ G +Y+
Sbjct: 268 SKDSGVVSTPLVDKEPLTYYY-LTLEAISVG------KKKIPYTGSSYNPNDDGILSETS 320
Query: 245 GGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDG-LEFCYKDDPSNV-YPTIE 302
G IIID GT LT L + + F S V + G L C+K + + P I
Sbjct: 321 GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIGLPEIT 380
Query: 303 FYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYD 362
+F ++ +L+ + E +CLS ++ + + G+ L+ YD
Sbjct: 381 VHFTGADV--------RLSPINAFVKLSEDMVCLSMV--PTTEVAIYGNFAQMDFLVGYD 430
Query: 363 LVNEVLVFTYNKC 375
L + F + C
Sbjct: 431 LETRTVSFQHMDC 443
>AT2G28030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11934208-11935386 REVERSE LENGTH=392
Length = 392
Score = 102 bits (253), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 147/366 (40%), Gaps = 49/366 (13%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y L +GTP + +D GS + W QC PC++CY P+F SST+KE
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKE------- 113
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE---IKNFIMGCG 136
C G C Y + + + S G + T+T+ ++ E + +GCG
Sbjct: 114 -----------KRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCG 162
Query: 137 DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDHTL 193
+ F FSG+ GL GP S+ +Q+ + S+C G+ S I F T
Sbjct: 163 HN-SSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT-----SKINF--GTN 214
Query: 194 PFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLN-YDGGIIIDIG 252
+ + V + A P Y+L +S+ D + G + +G IIID G
Sbjct: 215 AIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVG----DTHVETMGTTFHALEGNIIIDSG 270
Query: 253 TNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGL---EFCYKDDPSNVYPTIEFYFENGN 309
T LTY P ++ R VDH + D CY D +++P I +F
Sbjct: 271 TTLTYFPVSYCNLVR---EAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFS--- 324
Query: 310 IAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLV 369
G + V K N + GT CL+ + G+ L+ YD + ++
Sbjct: 325 -GGADLVLDKY--NMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVS 381
Query: 370 FTYNKC 375
F+ C
Sbjct: 382 FSPTNC 387
>AT2G28040.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:11936203-11937390 REVERSE LENGTH=395
Length = 395
Score = 101 bits (252), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 157/363 (43%), Gaps = 44/363 (12%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y L IGTP + +D GS W QC PC CY P+F SST+KE+ C DT
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRC--DT 122
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE---IKNFIMGCG 136
D C Y + G +S + G +VT+T+ ++ + + I+GCG
Sbjct: 123 ------HDH--------SCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCG 168
Query: 137 DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK---AFSFCPVRLGSGSDQPSSIEFYDHTL 193
+ G F F+GV GL RGP S+ +Q+ + S+C G+ S I F + +
Sbjct: 169 RNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT-----SKINFGANAI 222
Query: 194 PFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGT 253
+ + V + A P +Y+L +S+ ++ + + L G I+ID G+
Sbjct: 223 --VAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPF-HALK--GNIVIDSGS 277
Query: 254 NLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENG-NIAG 312
LTY P ++ R V +V V+ P D L CY +++P I +F G ++
Sbjct: 278 TLTYFPESYCNLVRKAVEQVV-TAVRFPRSDIL--CYYSKTIDIFPVITMHFSGGADLVL 334
Query: 313 ENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTY 372
+ + Y +N +F CL+ + G+ L+ YD + ++ F
Sbjct: 335 DKYNMYVASNTGGVF-------CLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKP 387
Query: 373 NKC 375
C
Sbjct: 388 TNC 390
>AT3G02740.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:590561-593089 FORWARD LENGTH=488
Length = 488
Score = 100 bits (249), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 156/383 (40%), Gaps = 49/383 (12%)
Query: 17 IDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPL-----FITRASSTYK 71
I Y + +GTP + ++VD GS I W CA C C P + L + ASST K
Sbjct: 82 IGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-PRKSDLVELTPYDVDASSTAK 140
Query: 72 ELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFE------HSN 125
+ C + C R + +G C+Y + G+ S + G +V D + + +
Sbjct: 141 SVSCSDNFCSYVNQRSECH---SGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTG 197
Query: 126 AEIKNFIMGCGDSYKGPF---MTQFSGVFGLGRGPLSVQSQLNA-----KAFSFCPVRLG 177
+ I GCG G G+ G G+ S SQL + ++F+ C
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257
Query: 178 SGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVW 237
G + P ++ P+ AH Y + I + +L++ S +
Sbjct: 258 GGG----IFAIGEVVSPKVK-----TTPMLSKSAH---YSVNLNAIEVGNSVLELSSNAF 305
Query: 238 GYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNV 297
G D G+IID GT L YLP Y+ +E+ +L + + D +
Sbjct: 306 DSG--DDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHYTDKLDR 363
Query: 298 YPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEG-----KSSALTVIGSN 352
+PT+ F F+ VS + + LFQ E T C + G ++LT++G
Sbjct: 364 FPTVTFQFDKS-------VSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDM 416
Query: 353 QLQGTLLTYDLVNEVLVFTYNKC 375
L L+ YD+ N+V+ +T + C
Sbjct: 417 ALSNKLVVYDIENQVIGWTNHNC 439
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 100 bits (249), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 148/370 (40%), Gaps = 33/370 (8%)
Query: 17 IDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCY 76
I Y +GTP Q++F+ +D + W C+ CS C F T +SSTY + C
Sbjct: 101 IGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNAST-SFNTNSSSTYSTVSCS 159
Query: 77 SDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCG 136
+ C + C +N G +S +V DTL I NF GC
Sbjct: 160 TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD--VIPNFSFGCI 217
Query: 137 DSYKGPFMTQFSGVFGLGRGPLSVQSQ---LNAKAFSFCPVRLGSGSDQPSSIEFY---D 190
+S G + G+ GLGRGP+S+ SQ L + FS+C PS FY
Sbjct: 218 NSASGNSLPP-QGLMGLGRGPMSLVSQTTSLYSGVFSYC---------LPSFRSFYFSGS 267
Query: 191 HTLPFIEDNNSVM-VPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIII 249
L + S+ PL N P Y++ G+S+ + + + N G II
Sbjct: 268 LKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTII 327
Query: 250 DIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGN 309
D GT +T Y R E R+ ++ + C+ D NV P I + +
Sbjct: 328 DSGTVITRFAQPVYEAIRDEFRK-QVNVSSFSTLGAFDTCFSADNENVAPKITLHMTS-- 384
Query: 310 IAGENFVSYKLNNNQTLFQAEEGTI-CLSFA---EGKSSALTVIGSNQLQGTLLTYDLVN 365
+ KL TL + GT+ CLS A + ++ L VI + Q Q + +D+ N
Sbjct: 385 ------LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPN 438
Query: 366 EVLVFTYNKC 375
+ C
Sbjct: 439 SRIGIAPEPC 448
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 99.0 bits (245), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 149/369 (40%), Gaps = 57/369 (15%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMM 85
IGTP Q + + +D + +W C+ C C LF SS+ + L C + C
Sbjct: 94 IGTPAQPMLVALDTSNDAAWIPCSGCVGCS--SSVLFDPSKSSSSRTLQCEAPQC----- 146
Query: 86 RDQVFGNCTGWK-CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFM 144
+ +CT K C +N+ G + + DTL ++ I N+ GC + G +
Sbjct: 147 KQAPNPSCTVSKSCGFNMTYGGSTIE-AYLTQDTLTL--ASDVIPNYTFGCINKASGTSL 203
Query: 145 TQFSGVFGLGRGPLSVQSQ---LNAKAFSFC-----------PVRLGSGSDQPSSIEFYD 190
G+ GLGRGPLS+ SQ L FS+C +RLG +QP I+
Sbjct: 204 PA-QGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGP-KNQPIRIK--- 258
Query: 191 HTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIID 250
PL +N Y++ VGI + ++DI + + G I D
Sbjct: 259 ------------TTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFD 306
Query: 251 IGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNI 310
GT T L AY R+E RR + G + CY S V+P++ F F N+
Sbjct: 307 SGTVYTRLVEPAYVAVRNEFRRRVKN-ANATSLGGFDTCYSG--SVVFPSVTFMFAGMNV 363
Query: 311 AGENFVSYKLNNNQTLFQAEEGTI-CLSFAEGK---SSALTVIGSNQLQGTLLTYDLVNE 366
L + L + G + CL+ A +S L VI S Q Q + D+ N
Sbjct: 364 T--------LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNS 415
Query: 367 VLVFTYNKC 375
L + C
Sbjct: 416 RLGISRETC 424
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 95.5 bits (236), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 148/365 (40%), Gaps = 19/365 (5%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTC-LI 82
+L+G+P + L +D GS ++W QC PC C+ + +AS++YK + C C L+
Sbjct: 174 VLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQRCNLV 233
Query: 83 PMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNA-------EIKNFIMGC 135
+ C Y G+ S + G +T + ++N + GC
Sbjct: 234 SSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGC 293
Query: 136 GDSYKGPFMTQFSGVFGLGRGPLSVQSQLNA---KAFSFCPVRLGSGSDQPSSIEFYDHT 192
G +G F + LS SQL + +FS+C V S ++ S + F +
Sbjct: 294 GHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDK 352
Query: 193 --LPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIID 250
L N + V KEN +YY +Q I + G +L+I + W + GG IID
Sbjct: 353 DLLSHPNLNFTSFVAGKENLVDTFYY-VQIKSILVAGEVLNIPEETWNISSDGAGGTIID 411
Query: 251 IGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNI 310
GT L+Y AY ++++ + K P Y +F D NV I
Sbjct: 412 SGTTLSYFAEPAYEFIKNKI--AEKAKGKYPVY--RDFPILDPCFNVSGIHNVQLPELGI 467
Query: 311 AGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVF 370
A + + + E +CL+ SA ++IG+ Q Q + YD L +
Sbjct: 468 AFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGY 527
Query: 371 TYNKC 375
KC
Sbjct: 528 APTKC 532
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 93.2 bits (230), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 153/381 (40%), Gaps = 38/381 (9%)
Query: 16 VIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSC-----YPMQRPLFITRASSTY 70
V+ Y T L +GTP + +++VD GS + W CA C+ C +Q F +S T
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 71 KELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEH--SNAEI 128
+ C C + + C Y + G+ S + G V+D L F+ ++ +
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV 196
Query: 129 KN----FIMGCGDSYKGPFMTQ---FSGVFGLGRGPLSVQSQLNA-----KAFSFCPVRL 176
N + GC S G + G+FG G+ +SV SQL + + FS C
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL--- 253
Query: 177 GSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKV 236
G + I L I + N V PL + H Y + + IS+NG L I V
Sbjct: 254 -KGENGGGGI----LVLGEIVEPNMVFTPLVPSQPH---YNVNLLSISVNGQALPINPSV 305
Query: 237 WGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPS- 295
+ + G IID GT L YL AY F + V+ G + CY S
Sbjct: 306 --FSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-CYVITTSV 362
Query: 296 -NVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQL 354
+++P + F G N Y + N C+ F ++ +T++G L
Sbjct: 363 GDIFPPVSLNFAGGASMFLNPQDYLIQQNNV---GGTAVWCIGFQRIQNQGITILGDLVL 419
Query: 355 QGTLLTYDLVNEVLVFTYNKC 375
+ + YDLV + + + C
Sbjct: 420 KDKIFVYDLVGQRIGWANYDC 440
>AT5G10760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3400671-3402165 REVERSE LENGTH=464
Length = 464
Score = 90.5 bits (223), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 152/387 (39%), Gaps = 46/387 (11%)
Query: 4 NTIMEATTNAYPVIDA-------YATFLLIGTPVQIVFLRVDIGSPISWFQCAPC-SSCY 55
N + EA + P Y + IGTP + L D GS ++W QC PC SCY
Sbjct: 109 NEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY 168
Query: 56 PMQRPLFITRASSTYKELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMV 115
+ P F +SSTY+ + C S PM D +C+ C Y++ G++S + G +
Sbjct: 169 SQKEPKFNPSSSSTYQNVSCSS-----PMCEDA--ESCSASNCVYSIVYGDKSFTQGFLA 221
Query: 116 TDTLIFEHSNAEIKNFIMGCGDSYKGPF---MTQFSGVFGLGRGPLSVQSQLNAKAFSFC 172
+ +S+ +++ GCG++ +G F G P + N FS+C
Sbjct: 222 KEKFTLTNSDV-LEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN-NIFSYC 279
Query: 173 PVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHP--YYYFLQFVGISINGFML 230
PS L F S V + P + Y + +GIS+ L
Sbjct: 280 ---------LPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKEL 330
Query: 231 DIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCY 290
I + G IID GT T LP+ Y+ RS + GY + CY
Sbjct: 331 AITPNSFS-----TEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCY 385
Query: 291 K--DDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTV 348
+ YPTI F F AG V +L+ + + +CL+FA G +
Sbjct: 386 DFTGLDTVTYPTIAFSF-----AGSTVV--ELDGSGISLPIKISQVCLAFA-GNDDLPAI 437
Query: 349 IGSNQLQGTLLTYDLVNEVLVFTYNKC 375
G+ Q + YD+ + F N C
Sbjct: 438 FGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>AT1G49050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18150638-18153186 FORWARD LENGTH=583
Length = 583
Score = 90.1 bits (222), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/395 (23%), Positives = 168/395 (42%), Gaps = 33/395 (8%)
Query: 2 TNNTIMEATTNAYPVIDAYATFLLIGTPV--QIVFLRVDIGSPISWFQC-APCSSCYPMQ 58
++ TI N YP Y T +L+G P Q L +D GS ++W QC APC+SC
Sbjct: 186 SSTTIFPVGGNVYPD-GLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGA 244
Query: 59 RPLFITRASSTYKELGCYSDTCLIPMMRDQVFGNCTG-WKCRYNVRSGNESRSFGVMVTD 117
L+ R + + S+ + + R+Q+ +C +C Y + + S S GV+ D
Sbjct: 245 NQLYKPRKDNLVRS----SEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKD 300
Query: 118 TLIFEHSNAEI--KNFIMGCGDSYKGPFMTQF---SGVFGLGRGPLSVQSQLNAKAFSFC 172
+ N + + + GCG +G + G+ GL R +S+ SQL ++
Sbjct: 301 KFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISN 360
Query: 173 PVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDI 232
V SD + + + + VP+ +D+ Y +Q +S ML +
Sbjct: 361 VVGHCLASDLNGEGYIFMGS-DLVPSHGMTWVPML-HDSRLDAYQMQVTKMSYGQGMLSL 418
Query: 233 QSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDH-DLVKKPGYDGLEFCYK 291
+ G ++ D G++ TY P+ AYS + ++ V +L + + L C++
Sbjct: 419 DGENGRVG-----KVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWR 473
Query: 292 DD---PSNVYPTIEFYFENGNIA-GENF--VSYKL--NNNQTLFQAEEGTICLSFAEGKS 343
P + ++ +F + G + +S KL L + +G +CL +G S
Sbjct: 474 AKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSS 533
Query: 344 ---SALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+ ++G ++G L+ YD V + + + C
Sbjct: 534 VHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568
>AT1G49050.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:18151161-18153186 FORWARD LENGTH=410
Length = 410
Score = 89.4 bits (220), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 161/379 (42%), Gaps = 36/379 (9%)
Query: 20 YATFLLIGTPV--QIVFLRVDIGSPISWFQC-APCSSCYPMQRPLFITRASSTYKELGCY 76
Y T +L+G P Q L +D GS ++W QC APC+SC L+ R + +
Sbjct: 30 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRS---- 85
Query: 77 SDTCLIPMMRDQVFGNCTG-WKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEI--KNFIM 133
S+ + + R+Q+ +C +C Y + + S S GV+ D + N + + +
Sbjct: 86 SEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 145
Query: 134 GCGDSYKGPFMTQF---SGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYD 190
GCG +G + G+ GL R +S+ SQL ++ V SD +
Sbjct: 146 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFM 205
Query: 191 HTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIID 250
+ + + VP+ +D+ Y +Q +S ML + + G ++ D
Sbjct: 206 GS-DLVPSHGMTWVPML-HDSRLDAYQMQVTKMSYGQGMLSLDGEN-----GRVGKVLFD 258
Query: 251 IGTNLTYLPSDAYSVFRSEVRRVDH-DLVKKPGYDGLEFCYKDD---PSNVYPTIEFYFE 306
G++ TY P+ AYS + ++ V +L + + L C++ P + ++ +F
Sbjct: 259 TGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFR 318
Query: 307 NGNIAGENFVSYKLNNNQTLFQAEE-------GTICLSFAEGKS---SALTVIGSNQLQG 356
I + + + + + L Q E+ G +CL +G S + ++G ++G
Sbjct: 319 --PITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRG 376
Query: 357 TLLTYDLVNEVLVFTYNKC 375
L+ YD V + + + C
Sbjct: 377 HLIVYDNVKRRIGWMKSDC 395
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 87.8 bits (216), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 150/360 (41%), Gaps = 36/360 (10%)
Query: 29 PVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIPMMRDQ 88
P Q + + +D GS +SW +C S+ P+ F SS+Y + C S TC
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--FDPTRSSSYSPIPCSSPTCRTRTRDFL 139
Query: 89 VFGNCTGWK-CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYKGPFM--- 144
+ +C K C + + S S G + + F +S + N I GC S G
Sbjct: 140 IPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTND-SNLIFGCMGSVSGSDPEED 198
Query: 145 TQFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMV 204
T+ +G+ G+ RG LS SQ+ FS+C + D P + D ++ N +
Sbjct: 199 TKTTGLLGMNRGSLSFISQMGFPKFSYC---ISGTDDFPGFLLLGDSNFTWLTPLNYTPL 255
Query: 205 PLKENDAHPYY----YFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPS 260
++ + PY+ Y +Q GI +NG +L I V G ++D GT T+L
Sbjct: 256 -IRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLG 314
Query: 261 DAYSVFRSE-VRRVDHDLV--KKPGY---DGLEFCYKDDPSNV-------YPTIEFYFEN 307
Y+ RS + R + L + P + ++ CY+ P + PT+ FE
Sbjct: 315 PVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEG 374
Query: 308 GNIA--GENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALT--VIGSNQLQGTLLTYDL 363
IA G+ + Y++ + L + C +F + VIG + Q + +DL
Sbjct: 375 AEIAVSGQPLL-YRVPH---LTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 85.9 bits (211), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 166/397 (41%), Gaps = 55/397 (13%)
Query: 7 MEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPL----- 61
++ +++ Y ++ Y T + +G+P +++D GS I W C+ CS+C P L
Sbjct: 88 VQGSSDPY-LVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNC-PHSSGLGIDLH 145
Query: 62 -FITRASSTYKELGCYSDTCLIPMMRDQVFGNCT-GWKCRYNVRSGNESRSFGVMVTDTL 119
F S T + C C + C+ +C Y+ R G+ S + G +TDT
Sbjct: 146 FFDAPGSLTAGSVTCSDPIC--SSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTF 203
Query: 120 IFEHSNAE--IKN----FIMGCGDSYKGPFMTQ----FSGVFGLGRGPLSVQSQLNAKA- 168
F+ E + N + GC +Y+ +T+ G+FG G+G LSV SQL+++
Sbjct: 204 YFDAILGESLVANSSAPIVFGC-STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGI 262
Query: 169 ----FSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGIS 224
FS C GSG L I V PL + H Y L + I
Sbjct: 263 TPPVFSHCLKGDGSGGGV--------FVLGEILVPGMVYSPLVPSQPH---YNLNLLSIG 311
Query: 225 INGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYD 284
+NG ML + + V + + G I+D GT LTYL +AY +F + + LV +
Sbjct: 312 VNGQMLPLDAAV--FEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISN 369
Query: 285 GLEFCY--KDDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQ----AEEGTICLSF 338
G E CY S+++P++ F G S L LF C+ F
Sbjct: 370 G-EQCYLVSTSISDMFPSVSLNFAGG-------ASMMLRPQDYLFHYGIYDGASMWCIGF 421
Query: 339 AEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+ T++G L+ + YDL + + + C
Sbjct: 422 QKAPEEQ-TILGDLVLKDKVFVYDLARQRIGWASYDC 457
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 85.1 bits (209), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 158/384 (41%), Gaps = 54/384 (14%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPL------FITRASSTYKEL 73
Y T + +G+P +++D GS I W C+ CS+C P L F S T +
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNC-PHSSGLGIDLHFFDAPGSLTAGSV 163
Query: 74 GCYSDTCLIPMMRDQVFGNCT-GWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE--IKN 130
C C + C+ +C Y+ R G+ S + G +TDT F+ E + N
Sbjct: 164 TCSDPIC--SSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 221
Query: 131 ----FIMGCGDSYKGPFMTQ----FSGVFGLGRGPLSVQSQLNAKA-----FSFCPVRLG 177
+ GC +Y+ +T+ G+FG G+G LSV SQL+++ FS C G
Sbjct: 222 SSAPIVFGC-STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 280
Query: 178 SGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVW 237
SG L I V PL + H Y L + I +NG ML + + V
Sbjct: 281 SGGGV--------FVLGEILVPGMVYSPLVPSQPH---YNLNLLSIGVNGQMLPLDAAV- 328
Query: 238 GYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCY--KDDPS 295
+ + G I+D GT LTYL +AY +F + + LV +G E CY S
Sbjct: 329 -FEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVSTSIS 386
Query: 296 NVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQ----AEEGTICLSFAEGKSSALTVIGS 351
+++P++ F G S L LF C+ F + T++G
Sbjct: 387 DMFPSVSLNFAGG-------ASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-TILGD 438
Query: 352 NQLQGTLLTYDLVNEVLVFTYNKC 375
L+ + YDL + + + C
Sbjct: 439 LVLKDKVFVYDLARQRIGWASYDC 462
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 83.2 bits (204), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 154/387 (39%), Gaps = 53/387 (13%)
Query: 16 VIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRA--------- 66
++ Y T + +GTP + +++D GS + W C C+ C P L I +
Sbjct: 80 LVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGC-PKTSELQIQLSFFDPGVSSS 138
Query: 67 --SSTYKELGCYSDTCLIPMMRDQVFGNCTGWK-CRYNVRSGNESRSFGVMVTDTLIFE- 122
+ + CYS+ Q C+ C Y+ + G+ S + G ++D + F+
Sbjct: 139 ASLVSCSDRRCYSNF--------QTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDT 190
Query: 123 --HSNAEIKN---FIMGCGDSYKGPFM---TQFSGVFGLGRGPLSVQSQLNA-----KAF 169
S I + F+ GC + G G+FGLG+G LSV SQL + F
Sbjct: 191 VITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVF 250
Query: 170 SFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFM 229
S C SG L I+ ++V PL + H Y + I++NG +
Sbjct: 251 SHCLKGDKSGGGI--------MVLGQIKRPDTVYTPLVPSQPH---YNVNLQSIAVNGQI 299
Query: 230 LDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLE-F 288
L I V + + G IID GT L YLP +AYS F V + Y+ + F
Sbjct: 300 LPIDPSV--FTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQCF 357
Query: 289 CYKDDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTV 348
+V+P + F G + V Q + C+ F +T+
Sbjct: 358 EITAGDVDVFPQVSLSFA----GGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITI 413
Query: 349 IGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+G L+ ++ YDLV + + + C
Sbjct: 414 LGDLVLKDKVVVYDLVRQRIGWAEYDC 440
>AT5G45120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:18241003-18242478 FORWARD LENGTH=491
Length = 491
Score = 81.6 bits (200), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 103/413 (24%), Positives = 157/413 (38%), Gaps = 66/413 (15%)
Query: 16 VIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCA----PCSSCYPMQR------PLFITR 65
V D Y L IGTP Q V + +D GS ++W C C CY ++ +F
Sbjct: 79 VRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPL 138
Query: 66 ASSTYKELGCYSDTCLIPMMRDQVFGNCTGWKCRYN-------VRS--------GNESRS 110
SST C S C+ D F C C + VR G
Sbjct: 139 HSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLI 198
Query: 111 FGVMVTDTLIFEHSNAEIKNFIMGCGDS-YKGPFMTQFSGVFGLGRGPLSVQSQLN--AK 167
G++ D I + ++ F GC S Y+ P G+ G GRG LS+ SQL K
Sbjct: 199 SGILTRD--ILKARTRDVPRFSFGCVTSTYREPI-----GIAGFGRGLLSLPSQLGFLEK 251
Query: 168 AFSFC--PVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHP--YYYFLQFVGI 223
FS C P + + + S + L ++ P+ +P YY L+ + I
Sbjct: 252 GFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITI 311
Query: 224 SINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDH--DLVKKP 281
N + + + +GG+++D GT T+LP YS + ++ +
Sbjct: 312 GTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATETE 371
Query: 282 GYDGLEFCYK------------DDPSNVYPTIEFYFENGNI----AGENFVSYKLNNNQT 325
G + CYK +D ++P+I F+F N G +F + ++ +
Sbjct: 372 SRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGS 431
Query: 326 LFQAEEGTICLSF---AEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+ Q CL F +G V GS Q Q + YDL E + F C
Sbjct: 432 VVQ------CLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>AT2G23945.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:10185229-10186605 REVERSE LENGTH=458
Length = 458
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 124/307 (40%), Gaps = 53/307 (17%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSCYP--MQRPLFITRASSTYKELGCYSDTCLIP 83
+G P +D GS + W QC PC C M P+F SST+ E C C
Sbjct: 102 VGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFC--- 158
Query: 84 MMRDQVFGNC-TGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAE---IKNFIMGCGDSY 139
R G+C + KC Y + + S GV+ + L F N + GCG
Sbjct: 159 --RYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYEN 216
Query: 140 KGPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFC------------PVRLGSGSD---QPS 184
+ F+G+ GLG P S+ QL +K FS+C + LG +D P+
Sbjct: 217 GEQLESHFTGILGLGAKPTSLAVQLGSK-FSYCIGDLANKNYGYNQLVLGEDADILGDPT 275
Query: 185 SIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYD 244
IEF E NS+ Y++ GIS+ L+I+ V+
Sbjct: 276 PIEF--------ETENSI-------------YYMNLEGISVGDTQLNIEPVVFKR-RGPR 313
Query: 245 GGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNV---YPTI 301
G+I+D GT T+L AY +E++ + +++ + CY S +P +
Sbjct: 314 TGVILDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDF-LCYHGRVSEELIGFPVV 372
Query: 302 EFYFENG 308
F+F G
Sbjct: 373 TFHFAGG 379
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 79.0 bits (193), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 134/375 (35%), Gaps = 54/375 (14%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDT 79
Y LIGTP Q + L +D S ++W C+ C C F S+++K + C +
Sbjct: 115 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCP--SNTAFSPAKSTSFKNVSCSAPQ 172
Query: 80 CLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSY 139
C + C C +N+ G+ S + + DT+ IK F GC +
Sbjct: 173 C-----KQVPNPTCGARACSFNLTYGSSSIAAN-LSQDTIRLAAD--PIKAFTFGCVNKV 224
Query: 140 KG----PFMTQFSGVFGLGRGPLSVQSQLNAKAFSFC-----------PVRLGSGSDQPS 184
G P G+ +S + FS+C +RLG S QP
Sbjct: 225 AGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTS-QPQ 283
Query: 185 SIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYD 244
+++ L N Y++ V I + ++D+ + +
Sbjct: 284 RVKYTQ---------------LLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTG 328
Query: 245 GGIIIDIGTNLTYLPSDAYSVFRSEVR-RVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEF 303
G I D GT T L Y R+E R RV G + CY PTI F
Sbjct: 329 AGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK--VPTITF 386
Query: 304 YFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFA---EGKSSALTVIGSNQLQGTLLT 360
F+ N+ +N L T CL+ A E +S + VI S Q Q +
Sbjct: 387 MFKGVNMTMP-------ADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVL 439
Query: 361 YDLVNEVLVFTYNKC 375
D+ N L +C
Sbjct: 440 IDVPNGRLGLARERC 454
>AT1G44130.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:16787508-16789318 REVERSE LENGTH=405
Length = 405
Score = 76.6 bits (187), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/401 (24%), Positives = 157/401 (39%), Gaps = 62/401 (15%)
Query: 3 NNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQC-APCSSCYPMQRPL 61
++ + + N +P + Y+ + IG+P + +D GS ++W QC APCS C
Sbjct: 33 SSVVFPLSGNVFP-LGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGC------- 84
Query: 62 FITRASSTYKELG----CYSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTD 117
+ YK G C + C ++ +C Y V+ ++ S G +VTD
Sbjct: 85 -TLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTD 143
Query: 118 TLIFEHSNAEIKN--FIMGCG-----DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFS 170
+ N GCG S P T +GV GLGRG + + +QL + +
Sbjct: 144 QFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPAT--AGVLGLGRGKIGLLTQLVSAGLT 201
Query: 171 FCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFML 230
V S + F D+ +P I PL D H Y + NG
Sbjct: 202 RNVVGHCLSSKGGGFLFFGDNLVPSI---GVAWTPLLSQDNH---YTTGPADLLFNGKPT 255
Query: 231 DIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGY-----DG 285
++ GL +I D G++ TY S AY ++ + + +DL P
Sbjct: 256 GLK------GLK----LIFDTGSSYTYFNSKAY---QTIINLIGNDLKVSPLKVAKEDKT 302
Query: 286 LEFCYKD--------DPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLS 337
L C+K + N + TI F NG + +++ +L L ++ G +CL
Sbjct: 303 LPICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQLYLAPEL----YLIVSKTGNVCLG 358
Query: 338 FAEGKSSAL---TVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
G L VIG +QG ++ YD + L + + C
Sbjct: 359 LLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLGWVSSDC 399
>AT1G77480.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114946-29117150 REVERSE LENGTH=432
Length = 432
Score = 75.9 bits (185), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 89/406 (21%), Positives = 155/406 (38%), Gaps = 66/406 (16%)
Query: 1 MTNNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQC-APCSSCYPMQR 59
+++ + + N YP + Y L IG P ++ L +D GS ++W QC APC+ C +
Sbjct: 49 LSSTVVFPVSGNVYP-LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRA 107
Query: 60 PLFITRASSTYKELGCYSDTCL-IPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDT 118
+ + L C C + + +D+ + +C Y + + + S G +VTD
Sbjct: 108 KQY----KPNHNTLPCSHILCSGLDLPQDRPCADPED-QCDYEIGYSDHASSIGALVTDE 162
Query: 119 LIFEHSNAEIKNFIM--GCG---DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFS--- 170
+ + +N I N + GCG + +G+ GLGRG + + +QL + +
Sbjct: 163 VPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNV 222
Query: 171 --FCPVRLGSGSDQPSSIEFYDHTLP---FIEDNNSVMVPLKENDAHPYYYFLQFVGISI 225
C G G + D +P + + P K A P +
Sbjct: 223 IVHCLSHTGKG-----FLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGV 277
Query: 226 NGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYD- 284
G ++ D G++ TY ++AY +R+ DL KP D
Sbjct: 278 KGI-----------------NVVFDSGSSYTYFNAEAYQAILDLIRK---DLNGKPLTDT 317
Query: 285 ----GLEFCYK--------DDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEG 332
L C+K D+ + TI F N +N +++ L E+G
Sbjct: 318 KDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGN----QKNGQLFQVPPESYLIITEKG 373
Query: 333 TICLSFAEGKSSAL---TVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+CL G L +IG QG ++ YD + + + + C
Sbjct: 374 RVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDC 419
>AT1G77480.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29114705-29117150 REVERSE LENGTH=466
Length = 466
Score = 75.9 bits (185), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/406 (21%), Positives = 155/406 (38%), Gaps = 66/406 (16%)
Query: 1 MTNNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQC-APCSSCYPMQR 59
+++ + + N YP + Y L IG P ++ L +D GS ++W QC APC+ C +
Sbjct: 49 LSSTVVFPVSGNVYP-LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRA 107
Query: 60 PLFITRASSTYKELGCYSDTCL-IPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDT 118
+ + L C C + + +D+ + +C Y + + + S G +VTD
Sbjct: 108 KQY----KPNHNTLPCSHILCSGLDLPQDRPCADPED-QCDYEIGYSDHASSIGALVTDE 162
Query: 119 LIFEHSNAEIKNFIM--GCG---DSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFS--- 170
+ + +N I N + GCG + +G+ GLGRG + + +QL + +
Sbjct: 163 VPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNV 222
Query: 171 --FCPVRLGSGSDQPSSIEFYDHTLP---FIEDNNSVMVPLKENDAHPYYYFLQFVGISI 225
C G G + D +P + + P K A P +
Sbjct: 223 IVHCLSHTGKG-----FLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGV 277
Query: 226 NGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYD- 284
G ++ D G++ TY ++AY +R+ DL KP D
Sbjct: 278 KGI-----------------NVVFDSGSSYTYFNAEAYQAILDLIRK---DLNGKPLTDT 317
Query: 285 ----GLEFCYK--------DDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEG 332
L C+K D+ + TI F N +N +++ L E+G
Sbjct: 318 KDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGN----QKNGQLFQVPPESYLIITEKG 373
Query: 333 TICLSFAEGKSSAL---TVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+CL G L +IG QG ++ YD + + + + C
Sbjct: 374 RVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDC 419
>AT1G66180.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24647221-24648513 FORWARD LENGTH=430
Length = 430
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 148/375 (39%), Gaps = 47/375 (12%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIP 83
L IGTP Q + +D GS +SW QC P + F SS++ L C C
Sbjct: 76 LPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPR 134
Query: 84 MMRDQVFGNCTGWK-CRYNVRSGNESRSFGVMVTDTLIFEHSNAEI-KNFIMGCGDSYKG 141
+ + +C + C Y+ + + + G +V + + F SN EI I+GC
Sbjct: 135 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITF--SNTEITPPLILGCATES-- 190
Query: 142 PFMTQFSGVFGLGRGPLSVQSQLNAKAFSFC-------PVRLGSGS----DQPSSIEF-Y 189
+ G+ G+ RG LS SQ FS+C P +GS D P+S F Y
Sbjct: 191 ---SDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKY 247
Query: 190 DHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIII 249
L F E + + P Y + +GI L+I V+ G ++
Sbjct: 248 VSLLTFPESQ-------RMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMV 300
Query: 250 DIGTNLTYLPSDAYSVFRSEV-RRVDHDLVKKPGYDG-LEFCYKDDPSNVYPTIE---FY 304
D G+ T+L AY R+E+ RV L K Y G + C+ + + + I F
Sbjct: 301 DSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFV 360
Query: 305 FENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSAL----TVIGSNQLQGTLLT 360
F G V + + L G C+ G+SS L +IG+ Q +
Sbjct: 361 FTRG-------VEILVPKERVLVNVGGGIHCVGI--GRSSMLGAASNIIGNVHQQNLWVE 411
Query: 361 YDLVNEVLVFTYNKC 375
+D+ N + F C
Sbjct: 412 FDVTNRRVGFAKADC 426
>AT1G05840.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:1762843-1766150 REVERSE LENGTH=485
Length = 485
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 145/377 (38%), Gaps = 51/377 (13%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSC-----YPMQRPLFITRASSTYKELGCYSDTC 80
IGTP + +++VD GS I W C C C ++ L+ S + K + C D C
Sbjct: 86 IGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFC 145
Query: 81 LIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIK------NFIMG 134
+ + G C Y G+ S + G V D + ++ ++K + I G
Sbjct: 146 Y-QISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFG 204
Query: 135 CGDSYKGPFMTQ----FSGVFGLGRGPLSVQSQLNA-----KAFSFC-PVRLGSGSDQPS 184
CG G + G+ G G+ S+ SQL + K F+ C R G G
Sbjct: 205 CGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGI---- 260
Query: 185 SIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYD 244
+ + M PL N H Y + + + L I + ++ G
Sbjct: 261 ------FAIGRVVQPKVNMTPLVPNQPH---YNVNMTAVQVGQEFLTIPADLFQPGDRK- 310
Query: 245 GGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDL-VKKPGYDGLEFCYKDDPSNVYPTIEF 303
G IID GT L YLP Y ++ + L V D F Y +P + F
Sbjct: 311 -GAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTF 369
Query: 304 YFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSA-----LTVIGSNQLQGTL 358
+FEN V ++ + LF EG C+ + + +T++G L L
Sbjct: 370 HFENS-------VFLRVYPHDYLF-PHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKL 421
Query: 359 LTYDLVNEVLVFTYNKC 375
+ YDL N+++ +T C
Sbjct: 422 VLYDLENQLIGWTEYNC 438
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 72.8 bits (177), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 134/364 (36%), Gaps = 53/364 (14%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIP 83
+L+G+P + L +D GS ++W QC PC C+
Sbjct: 174 VLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQN------------------------- 208
Query: 84 MMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNA-------EIKNFIMGCG 136
C Y G+ S + G +T + ++N + GCG
Sbjct: 209 ----------DNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCG 258
Query: 137 DSYKGPFMTQFSGVFGLGRGPLSVQSQLNA---KAFSFCPVRLGSGSDQPSSIEFYDHT- 192
+G F + LS SQL + +FS+C V S ++ S + F +
Sbjct: 259 HWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKD 317
Query: 193 -LPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDI 251
L N + V KEN +YY +Q I + G +L+I + W + GG IID
Sbjct: 318 LLSHPNLNFTSFVAGKENLVDTFYY-VQIKSILVAGEVLNIPEETWNISSDGAGGTIIDS 376
Query: 252 GTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNIA 311
GT L+Y AY ++++ + K P Y +F D NV IA
Sbjct: 377 GTTLSYFAEPAYEFIKNKI--AEKAKGKYPVYR--DFPILDPCFNVSGIHNVQLPELGIA 432
Query: 312 GENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQGTLLTYDLVNEVLVFT 371
+ + + E +CL+ SA ++IG+ Q Q + YD L +
Sbjct: 433 FADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 492
Query: 372 YNKC 375
KC
Sbjct: 493 PTKC 496
>AT5G10080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3150843-3153380 FORWARD LENGTH=528
Length = 528
Score = 71.2 bits (173), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 153/391 (39%), Gaps = 76/391 (19%)
Query: 22 TFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRA-----------SSTY 70
T++ IGTP + +D GS + W C C C P+ + + A SST
Sbjct: 102 TWIDIGTPSVSFLVALDTGSNLLWIPCN-CVQCAPLTSTYYSSLATKDLNEYNPSSSSTS 160
Query: 71 KELGCYSDTCLIPMMRDQVFGNCTGWK--CRYNVR--SGNESRSFGVMVTDTL------- 119
K C C +C K C Y V SGN S S G++V D L
Sbjct: 161 KVFLCSHKLC-------DSASDCESPKEQCPYTVNYLSGNTSSS-GLLVEDILHLTYNTN 212
Query: 120 --IFEHSNAEIKNFIMGCGDSYKGPFMTQFS--GVFGLGRGPLSVQSQLNA-----KAFS 170
+ S++ ++GCG G ++ + G+ GLG +SV S L+ +FS
Sbjct: 213 NRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFS 272
Query: 171 FCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFML 230
C SG I F D P I+ + FLQ +G+++
Sbjct: 273 LCFDEEDSGR-----IYFGDMG-PSIQQSTP---------------FLQLDNNKYSGYIV 311
Query: 231 DIQSKVWGYGLNYDGGII--IDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGL-- 286
+++ G ID G + TYLP + Y E+ R H ++G+
Sbjct: 312 GVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDR--HINATSKNFEGVSW 369
Query: 287 EFCYKDDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTI--CLSFAEGKSS 344
E+CY+ P I+ F + N ++ ++ +FQ +G + CL +
Sbjct: 370 EYCYESSAEPKVPAIKLKFSHNN-------TFVIHKPLFVFQQSQGLVQFCLPISPSGQE 422
Query: 345 ALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
+ IG N ++G + +D N L ++ +KC
Sbjct: 423 GIGSIGQNYMRGYRMVFDRENMKLGWSPSKC 453
>AT4G12920.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:7568286-7569455 FORWARD LENGTH=389
Length = 389
Score = 70.9 bits (172), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 148/367 (40%), Gaps = 46/367 (12%)
Query: 19 AYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQ-RPLFITRASSTYKELGCYS 77
A+ + G+P + FL +D GS ++W QC PCS CY + P + AS TY++ C
Sbjct: 57 AFMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCED 116
Query: 78 DTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNA---EIKNFIMG 134
P + C Y +E+ G + + + + + + G
Sbjct: 117 SH---PKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFG 173
Query: 135 CGDSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLP 194
C G + T +G+ GLG G S+ + +K FSFC LG S+ +S H L
Sbjct: 174 CNTLSDGSYFTG-TGILGLGVGKYSIIGEFGSK-FSFC---LGEISEPKAS-----HNL- 222
Query: 195 FIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDG--GIIIDIG 252
+ D +V HP + I+ + ++S + G + D + +D G
Sbjct: 223 ILGDGANVQ-------GHP-----TVINITEGHTIFQLESIIVGEEITLDDPVQVFVDTG 270
Query: 253 TNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNIAG 312
+ L++L ++ Y F V D + +P CYK D TIE E ++
Sbjct: 271 STLSHLSTNLYYKF---VDAFDDLIGSRPLSYEPTLCYKAD------TIE-RLEKMDVGF 320
Query: 313 ENFVSYKLNNN-QTLF--QAEEGTICLSFAEGKSS-ALTVIGSNQLQGTLLTYDLVNEVL 368
+ V +L+ N +F Q CL+ K S + +IG +QG + YDL +
Sbjct: 321 KFDVGAELSVNIHNIFIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTA 380
Query: 369 VFTYNKC 375
C
Sbjct: 381 YINKQDC 387
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 70.9 bits (172), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 149/383 (38%), Gaps = 69/383 (18%)
Query: 20 YATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFIT---------RASSTY 70
+ T + +GTP + +D GS + W C C C P + + + + S+T
Sbjct: 107 HYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTTN 165
Query: 71 KELGCYSDTCLIPMMRDQVFGNCTGWKCRYNVR-SGNESRSFGVMVTDTLIFEHSNAEIK 129
K++ C + C R+Q G T C Y V ++ + G+++ D + H E K
Sbjct: 166 KKVTCNNSLC---AQRNQCLG--TFSTCPYMVSYVSAQTSTSGILMEDVM---HLTTEDK 217
Query: 130 N-------FIMGCGDSYKGPFM--TQFSGVFGLGRGPLSVQSQLN-----AKAFSFCPVR 175
N GCG G F+ +G+FGLG +SV S L A +FS C
Sbjct: 218 NPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMC--- 274
Query: 176 LGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSK 235
G D I F D E+ P N +HP Y + + + ++D +
Sbjct: 275 --FGHDGVGRISFGDKGSSDQEE-----TPFNLNPSHPNYN-ITVTRVRVGTTLIDDEFT 326
Query: 236 VWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYD-GLEFCY---K 291
+ D GT+ TYL Y+ D P E+CY
Sbjct: 327 A-----------LFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSN 375
Query: 292 DDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGS 351
D +++ P++ + GN S+ N+ + + EG + A KSS L +IG
Sbjct: 376 DANASLIPSLSLTMK-GN-------SHFTINDPIIVISTEGELVYCLAIVKSSELNIIGQ 427
Query: 352 NQLQGTLLTYDLVNEVLVFTYNK 374
N + G + +D E LV + K
Sbjct: 428 NYMTGYRVVFD--REKLVLAWKK 448
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 67.0 bits (162), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/376 (22%), Positives = 148/376 (39%), Gaps = 42/376 (11%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPLFITRASSTYKELGCYSDTCLIP 83
L +G P Q + + +D GS +SW C P +F +SSTY + C S C
Sbjct: 69 LAVGDPPQNISMVLDTGSELSWLHCKKS----PNLGSVFNPVSSSTYSPVPCSSPICRTR 124
Query: 84 MMRDQVFGNC--TGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDS--- 138
+ +C C + + + G + +T + + + GC DS
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI--GSVTRPGTLFGCMDSGLS 182
Query: 139 YKGPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLPFIED 198
+ +G+ G+ RG LS +QL FS+C SGSD + D + ++
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI----SGSDSSGFLLLGDASYSWLGP 238
Query: 199 NNSVMVPLKENDAHPYY----YFLQFVGISINGFMLDIQSKVWGYGLNYDGGIIIDIGTN 254
+ L+ PY+ Y +Q GI + +L + V+ G ++D GT
Sbjct: 239 IQYTPLVLQSTPL-PYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297
Query: 255 LTYLPSDAYSVFRSEV---RRVDHDLVKKPGY---DGLEFCYKDDPSNV-----YPTIEF 303
T+L Y+ ++E + LV P + ++ CYK + P +
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357
Query: 304 YFENG--NIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALT----VIGSNQLQGT 357
F +++G+ + Y++N + + +E C +F G S L VIG + Q
Sbjct: 358 MFRGAEMSVSGQKLL-YRVNGAGS--EGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQNV 412
Query: 358 LLTYDLVNEVLVFTYN 373
+ +DL + F N
Sbjct: 413 WMEFDLAKSRVGFAGN 428
>AT3G51350.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19060485-19063248 REVERSE LENGTH=528
Length = 528
Score = 65.9 bits (159), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 142/382 (37%), Gaps = 69/382 (18%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSC--------YPMQRP--LFITRASSTYKELGC 75
+GTP + +D GS + W C ++C P P L+ AS+T + C
Sbjct: 108 VGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRC 167
Query: 76 YSDTCLIPMMRDQVFGN--CTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSNAEI--- 128
C FG+ C+ C Y + N + + G ++ D L + +
Sbjct: 168 SDKRC---------FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPV 218
Query: 129 -KNFIMGCGDSYKGPFMTQFS--GVFGLGRGPLSVQSQL-----NAKAFSFCPVRLGSGS 180
N +GCG G F S GV GLG SV S L A +FS C G
Sbjct: 219 KANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMC---FGRVI 275
Query: 181 DQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYG 240
I F D E+ + V A Y + G+S+ G +DI+
Sbjct: 276 GNVGRISFGDRGYTDQEETPFISV------APSTAYGVNISGVSVAGDPVDIRLFAK--- 326
Query: 241 LNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGL--EFCYKDDPSNV- 297
D G++ T+L AY V + D ++P L EFCY P+
Sbjct: 327 --------FDTGSSFTHLREPAYGVLTKSFDELVEDR-RRPVDPELPFEFCYDLSPNATT 377
Query: 298 --YPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTI--CLSFAEGKSSALTVIGSNQ 353
+P +E F I G + LNN + +EG + CL + + VIG N
Sbjct: 378 IQFPLVEMTF----IGGSKII---LNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNF 430
Query: 354 LQGTLLTYDLVNEVLVFTYNKC 375
+ G + +D +L + + C
Sbjct: 431 VAGYRIVFDRERMILGWKQSLC 452
>AT5G37540.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14912862-14914190 FORWARD LENGTH=442
Length = 442
Score = 65.1 bits (157), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 149/380 (39%), Gaps = 53/380 (13%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPL--FITRASSTYKELGCYSDTCL 81
L IGTP Q L +D GS +SW QC P P+ P F SS++ +L C C
Sbjct: 84 LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCK 143
Query: 82 IPMMRDQVFGNCTGWK-CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIKNFIMGCGDSYK 140
+ + +C + C Y+ + + + G +V + F +S I+GC
Sbjct: 144 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT-TPPLILGCAKE-- 200
Query: 141 GPFMTQFSGVFGLGRGPLSVQSQLNAKAFSFC-PVR------LGSGS----DQPSSIEF- 188
T G+ G+ G LS SQ FS+C P R +GS D P+S F
Sbjct: 201 ---STDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFK 257
Query: 189 YDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYDGGII 248
Y L F + + + P Y + GI I L+I V+ G +
Sbjct: 258 YVSLLTFPQSQ-------RMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTM 310
Query: 249 IDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGY---DGLEFCYKDDPSNVYPTIEFYF 305
+D G+ T+L AY + E+ R+ +KK GY + C+ + S
Sbjct: 311 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKK-GYVYGSTADMCFDGNHS---------M 360
Query: 306 ENGNIAGE------NFVSYKLNNNQTLFQAEEGTICLSFAEGKSSAL----TVIGSNQLQ 355
E G + G+ V + L G C+ G+SS L +IG+ Q
Sbjct: 361 EIGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGI--GRSSMLGAASNIIGNVHQQ 418
Query: 356 GTLLTYDLVNEVLVFTYNKC 375
+ +D+ N + F+ +C
Sbjct: 419 NLWVEFDVTNRRVGFSKAEC 438
>AT3G51340.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19057013-19059788 REVERSE LENGTH=530
Length = 530
Score = 64.7 bits (156), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 152/383 (39%), Gaps = 65/383 (16%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSC--------YPMQRPL--FITRASSTYKELGC 75
+GTP + +D GS + W C ++C + PL + AS+T + C
Sbjct: 109 LGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRC 168
Query: 76 YSDTCLIPMMRDQVFGNCTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSNAEIK---- 129
C G C+ + C Y + + + + G ++ D L + ++K
Sbjct: 169 SDKRCFGS-------GKCSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLKPVNA 221
Query: 130 NFIMGCGDSYKGPFMTQFS--GVFGLGRGPLSV-----QSQLNAKAFSFCPVRLGSGSDQ 182
N +GCG + G F T + GV GL SV ++ + A +FS C R+ S +
Sbjct: 222 NVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGR 281
Query: 183 PSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLN 242
I F D + + + +V L+ + A Y + G+S+ G +D+
Sbjct: 282 ---ISFGDKG--YTDQEETPLVSLETSTA----YGVNVTGVSVGGVPVDVPLFA------ 326
Query: 243 YDGGIIIDIGTNLTYLPSDAYSVF--------RSEVRRVDHDLVKKPGYDGLEFCYKDDP 294
+ D G++ T L AY VF + R VD D + YD E D
Sbjct: 327 -----LFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDA 381
Query: 295 SNVYPTIEFYFENGNIAGENFVSYKLNNN--QTLFQAEEGTICLSFAEGKSSALTVIGSN 352
+ + Y N ++F +++ N+ +++ + EGT KS L +IG N
Sbjct: 382 RPRHMQSKCY----NPCRDDF-RWRIQNDSQESVSYSNEGTKMYCLGILKSINLNIIGQN 436
Query: 353 QLQGTLLTYDLVNEVLVFTYNKC 375
+ G + +D +L + + C
Sbjct: 437 LMSGHRIVFDRERMILGWKQSNC 459
>AT2G17760.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:7713488-7716269 FORWARD LENGTH=513
Length = 513
Score = 63.5 bits (153), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/380 (23%), Positives = 147/380 (38%), Gaps = 66/380 (17%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSCY-PMQRP--------LFITRASSTYKELG 74
+ +GTP + +D GS + W C C++C ++ P ++ ASST ++
Sbjct: 108 VTVGTPSDWFMVALDTGSDLFWLPCD-CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVP 166
Query: 75 CYS------DTCLIPMMRDQVFGNCTGWKCRYNVRS-GNESRSFGVMVTDTLIF----EH 123
C S D C P C Y +R N + S GV+V D L +
Sbjct: 167 CNSTLCTRGDRCASPES-----------DCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKS 215
Query: 124 SNAEIKNFIMGCGDSYKGPFMTQFS--GVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSD 181
S A GCG G F + G+FGLG +SV S L + + + G+D
Sbjct: 216 SKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGND 275
Query: 182 QPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGL 241
I F D + PL HP Y + IS+ G D++
Sbjct: 276 GAGRISFGDKG-----SVDQRETPLNIRQPHPTYN-ITVTKISVGGNTGDLE-------- 321
Query: 242 NYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGL--EFCYKDDPSN--- 296
+D + D GT+ TYL AY++ + D + L E+CY P+
Sbjct: 322 -FDA--VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSF 378
Query: 297 VYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTI-CLSFAEGKSSALTVIGSNQLQ 355
YP + + G+ SY + + + ++ + CL+ K +++IG N +
Sbjct: 379 QYPAVNLTMKGGS-------SYPVYHPLVVIPMKDTDVYCLAIM--KIEDISIIGQNFMT 429
Query: 356 GTLLTYDLVNEVLVFTYNKC 375
G + +D +L + + C
Sbjct: 430 GYRVVFDREKLILGWKESDC 449
>AT4G33490.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108781-16110679 REVERSE LENGTH=425
Length = 425
Score = 62.4 bits (150), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/388 (22%), Positives = 146/388 (37%), Gaps = 59/388 (15%)
Query: 2 TNNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQC-APCSSCYPMQRP 60
++ + N YP + Y + IG P + +L +D GS ++W QC APC C P
Sbjct: 43 VSSVVFPVHGNVYP-LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP 101
Query: 61 LFITRASSTYKELGCYSDTCLIPMMRDQVFGNC-TGWKCRYNVRSGNESRSFGVMVTDTL 119
L+ + + C C + C T +C Y V + S GV+V D
Sbjct: 102 LY----QPSSDLIPCNDPLCKALHLNSN--QRCETPEQCDYEVEYADGGSSLGVLVRDVF 155
Query: 120 IFEHSNA--EIKNFIMGCG-DSYKGPFMTQ-FSGVFGLGRGPLSVQSQLNAKAF-----S 170
++ +GCG D G GV GLGRG +S+ SQL+++ +
Sbjct: 156 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 215
Query: 171 FCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFML 230
C LG G I F+ L D++ V + +Y + G
Sbjct: 216 HCLSSLGGG------ILFFGDDL---YDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTT 266
Query: 231 DIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDG----- 285
+++ + + D G++ TY S AY ++R +L KP +
Sbjct: 267 GLKNLL----------TVFDSGSSYTYFNSKAYQAVTYLLKR---ELSGKPLKEARDDHT 313
Query: 286 LEFCYK--------DDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGTICLS 337
L C++ ++ + + F+ G + F ++ L + +G +CL
Sbjct: 314 LPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLF---EIPPEAYLIISMKGNVCLG 370
Query: 338 FAEGKS---SALTVIGSNQLQGTLLTYD 362
G L +IG +Q ++ YD
Sbjct: 371 ILNGTEIGLQNLNLIGDISMQDQMIIYD 398
>AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:9329933-9331432 REVERSE LENGTH=499
Length = 499
Score = 61.6 bits (148), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 113/286 (39%), Gaps = 45/286 (15%)
Query: 130 NFIMGCGDSYKGPFMTQFSGVFGLGRGPLSVQSQLNAKA------FSFCPVRLGSGSDQ- 182
NF GC + + + GV G GRG LS+ +QL + FS+C V SD+
Sbjct: 211 NFTFGCAHTT----LAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRV 266
Query: 183 --PSSI---EFYDHTLPFI--------------EDNNSVMVPLKENDAHPYYYFLQFVGI 223
PS + F D + + N V + EN HPY+Y + GI
Sbjct: 267 RRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGI 326
Query: 224 SINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSV----FRSEVRRVDHDLVK 279
SI + + + N GG+++D GT T LP+ Y+ F S V RV +
Sbjct: 327 SIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADR 386
Query: 280 KPGYDGLEFCYKDDPSNVYPTIEFYFENGN----IAGENFVSYKLNNNQTLFQAEEGTIC 335
G+ CY + + P + +F + N+ Y+ + + + C
Sbjct: 387 VEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYF-YEFMDGGDGKEEKRKIGC 445
Query: 336 LSFAEG------KSSALTVIGSNQLQGTLLTYDLVNEVLVFTYNKC 375
L G + ++G+ Q QG + YDL+N + F KC
Sbjct: 446 LMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491
>AT4G33490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16108928-16110670 REVERSE LENGTH=401
Length = 401
Score = 61.6 bits (148), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 78/189 (41%), Gaps = 18/189 (9%)
Query: 2 TNNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQC-APCSSCYPMQRP 60
++ + N YP + Y + IG P + +L +D GS ++W QC APC C P
Sbjct: 40 VSSVVFPVHGNVYP-LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP 98
Query: 61 LFITRASSTYKELGCYSDTCLIPMMRDQVFGNC-TGWKCRYNVRSGNESRSFGVMVTDTL 119
L+ + + C C + C T +C Y V + S GV+V D
Sbjct: 99 LY----QPSSDLIPCNDPLCKALHLNSN--QRCETPEQCDYEVEYADGGSSLGVLVRDVF 152
Query: 120 IFEHSNA--EIKNFIMGCG-DSYKGPFMTQ-FSGVFGLGRGPLSVQSQLNAKAF-----S 170
++ +GCG D G GV GLGRG +S+ SQL+++ +
Sbjct: 153 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 212
Query: 171 FCPVRLGSG 179
C LG G
Sbjct: 213 HCLSSLGGG 221
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 61.6 bits (148), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 101/250 (40%), Gaps = 33/250 (13%)
Query: 142 PFMTQFS-GVFGL-GRGP--LSVQSQLN------AKAFSFC------PVRLGS--GSDQP 183
PF+ F GVFGL G P L+ +QL K F+ C P++ G+ P
Sbjct: 153 PFLVDFPPGVFGLAGLAPTALATWNQLTRPRLGLEKKFALCLPSDENPLKKGAIYFGGGP 212
Query: 184 SSIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNY 243
+ D + + P K N+ YFL GIS+NG + + + N
Sbjct: 213 YKLRNIDAR-SMLSYTRLITNPRKLNN-----YFLGLKGISVNGNRILFAPNAFAFDRNG 266
Query: 244 DGGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEF 303
DGG+ + T L SD Y VF + + + EFC + P I+
Sbjct: 267 DGGVTLSTIFPFTMLRSDIYRVFIEAFSQATSGIPRVSSTTPFEFCLSTTTNFQVPRIDL 326
Query: 304 YFENGNIAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSA--LTVIGSNQLQGTLLTY 361
NG V +KL+ + + + CL+F G +A +IG +Q++ TL+ +
Sbjct: 327 ELANG-------VIWKLSPANAMKKVSDDVACLAFVNGGDAAAQAVMIGIHQMENTLVEF 379
Query: 362 DLVNEVLVFT 371
D+ F+
Sbjct: 380 DVGRSAFGFS 389
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 57.8 bits (138), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/400 (22%), Positives = 150/400 (37%), Gaps = 63/400 (15%)
Query: 2 TNNTIMEATTNAYPVIDAYATFLLIGTPVQIVFLRVDIGSPISWFQCAPCSSCYPMQRPL 61
+N+ + T P Y TF + V L +D+G+ ++W C R L
Sbjct: 22 SNSQYLLPITKHEPTNLFYTTFNVGSAAKSPVNLLLDLGTNLTWLDC----------RKL 71
Query: 62 FITRASSTYKELGCYSDTCLIPMMRDQVFGN-CTGWKCRYNVRS--GNESRSFGVMVTDT 118
++ S+ + + C S TC + GN C G C Y + G G +V D
Sbjct: 72 ---KSLSSLRLVTCQSSTC------KSIPGNGCAGKSCLYKQPNPLGQNPVVTGRVVQDR 122
Query: 119 LIFEH-------SNAEIKNFIMGC-GDSYKGPFMTQFSGVFGLGRGPLSVQSQLNAK--- 167
S +++F C G+ GV L G S Q+ +
Sbjct: 123 ASLYTTDGGKFLSQVSVRHFTFSCAGEKALQGLPPPVDGVLALSPGSSSFTKQVTSAFNV 182
Query: 168 --AFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSV---MVPLKENDAHPYYYFLQFVG 222
FS C G+G + I ++ PF +N + + P+K D+ Y ++
Sbjct: 183 IPKFSLCLPSSGTGHFYIAGIHYF--IPPFNSSDNPIPRTLTPIKGTDSGDYLITVK--S 238
Query: 223 ISINGFMLDIQSKVWGYGLNYDGGIIIDIGTNLTYLPSDAYSVFRSE--VRRVDHDLVKK 280
I + G L + + GG + + T L +D Y+ ++ + K
Sbjct: 239 IYVGGTALKLNPDL------LTGGAKLSTVVHYTVLQTDIYNALAQSFTLKAKAMGIAKV 292
Query: 281 PGYDGLEFCYKDDPSNV-------YPTIEFYFENGNIAGENFVSYKLNNNQTLFQAEEGT 333
P + C+ + P IE G I GE V + T+ + +E
Sbjct: 293 PSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLP-GRI-GE--VKWGFYGANTVVKVKETV 348
Query: 334 ICLSFAEGKSSA--LTVIGSNQLQGTLLTYDLVNEVLVFT 371
+CL+F +G + L VIG++QLQ +L +D VL F+
Sbjct: 349 MCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFS 388
>AT3G51330.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19053480-19056152 REVERSE LENGTH=529
Length = 529
Score = 52.8 bits (125), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/380 (21%), Positives = 143/380 (37%), Gaps = 64/380 (16%)
Query: 26 IGTPVQIVFLRVDIGSPISWFQCAPCSSC--------YPMQRP--LFITRASSTYKELGC 75
+GTP + +D GS + W C S+C RP L+ SST + C
Sbjct: 108 VGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRC 167
Query: 76 YSDTCLIPMMRDQVFGNCTGWKCRYNVRSGNESRSFGVMVTDTLIFEHSNAEIK----NF 131
D C +C ++ +Y + ++ + G + D L + ++ N
Sbjct: 168 SDDRCFGSSRCSSPASSCP-YQIQYLSK---DTFTTGTLFEDVLHLVTEDEGLEPVKANI 223
Query: 132 IMGCGDSYKGPFMTQ--FSGVFGLGRGPLSV-----QSQLNAKAFSFCPVRLGSGSDQPS 184
+GCG + G + +G+ GLG SV ++++ A +FS C G+ D
Sbjct: 224 TLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMC---FGNIIDVVG 280
Query: 185 SIEFYDHTLPFIEDNNSVMVPLKENDAHPYYYFLQFVGISINGFMLDIQSKVWGYGLNYD 244
I F D + + PL + P Y + +S+ G + +Q
Sbjct: 281 RISFGDKGY-----TDQMETPLLPTEPSPTYA-VSVTEVSVGGDAVGVQLLA-------- 326
Query: 245 GGIIIDIGTNLTYLPSDAYSVFRSEVRRVDHDLVKKPGYD---GLEFCYKDDPSN---VY 298
+ D GT+ T+L Y + DH K+ D EFCY P+ ++
Sbjct: 327 ---LFDTGTSFTHLLEPEYGLITKAFD--DHVTDKRRPIDPELPFEFCYDLSPNKTTILF 381
Query: 299 PTIEFYFENGN---IAGENFVSYKLNNNQTLFQAEEGTICLSFAEGKSSALTVIGSNQLQ 355
P + FE G+ + F+ + +N+ CL + + +IG N +
Sbjct: 382 PRVAMTFEGGSQMFLRNPLFIVWNEDNS--------AMYCLGILKSVDFKINIIGQNFMS 433
Query: 356 GTLLTYDLVNEVLVFTYNKC 375
G + +D +L + + C
Sbjct: 434 GYRIVFDRERMILGWKRSDC 453
>AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6414585-6415745 FORWARD LENGTH=386
Length = 386
Score = 52.0 bits (123), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/227 (22%), Positives = 96/227 (42%), Gaps = 26/227 (11%)
Query: 149 GVFGLGRGPLSVQSQLNAKAFSFCPVRLGSGSDQPSSIEFYDHTLPFIEDNNSVMVPLKE 208
GV GLGR +S+ SQL A+ +++ + Y L + +SV
Sbjct: 170 GVMGLGRAQISLPSQLAAE------------TNERRRLTVYLSPLNGVVSTSSVEEVFGV 217
Query: 209 NDAHPYYYFLQFVGISINGFMLDIQS-KVWGYGLNYDGGIIIDIGTNLTY--LPSDAYSV 265
+ Y G S N ++++++S +V G L+ +G + +++ T + Y L S Y V
Sbjct: 218 AASRSLVYTPLLTGSSGN-YVINVKSIRVNGEKLSVEGPLAVELSTVVPYTILESSIYKV 276
Query: 266 FRSEVRRVDHDLVKKPGYDGLEFCYKDDPSNVYPTIEFYFENGNIAGENFVSYKLNNNQT 325
F + + P C+ D +P ++ ++ V ++++
Sbjct: 277 FAEAYAKAAGEATSVPPVAPFGLCFTSDVD--FPAVDLALQS------EMVRWRIHGKNL 328
Query: 326 LFQAEEGTICLSFAEGKSSALT--VIGSNQLQGTLLTYDLVNEVLVF 370
+ G C +G SS + V+G QL+G +L +DL N ++ F
Sbjct: 329 MVDVGGGVRCSGIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375
>AT3G25700.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=350
Length = 350
Score = 50.1 bits (118), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 53/120 (44%), Gaps = 7/120 (5%)
Query: 24 LLIGTPVQIVFLRVDIGSPISWFQCAPCSSC-YPMQRPLFITRASSTYKELGCYSDTC-L 81
L IG P Q + L D GS + W +C+ C +C + +F R SST+ CY C L
Sbjct: 88 LRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRL 147
Query: 82 IPMMRDQVFGNCTGWK--CRYNVRSGNESRSFGVMVTDTLIFEHSN---AEIKNFIMGCG 136
+P N T C Y + S + G+ +T + S+ A +K+ GCG
Sbjct: 148 VPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCG 207