Miyakogusa Predicted Gene
- Lj6g3v1880290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1880290.1 Non Chatacterized Hit- tr|B9RTU6|B9RTU6_RICCO
Basic 7S globulin 2 small subunit, putative OS=Ricinus,61.93,0,BASIC
7S GLOBULIN-RELATED,NULL; ASPARTYL PROTEASES,Peptidase A1; no
description,Peptidase aspartic, ,CUFF.60060.1
(427 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 410 e-115
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 399 e-111
AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 189 2e-48
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 165 6e-41
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 150 2e-36
AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 148 6e-36
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 75 8e-14
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 74 2e-13
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 74 2e-13
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 3e-12
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 7e-12
AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family pr... 65 1e-10
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 63 3e-10
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 59 7e-09
AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease famil... 54 2e-07
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 54 2e-07
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 53 4e-07
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 53 5e-07
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 52 6e-07
AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family pr... 52 6e-07
AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family pr... 52 1e-06
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 51 1e-06
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 51 1e-06
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 50 2e-06
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 410 bits (1055), Expect = e-115, Method: Compositional matrix adjust.
Identities = 217/434 (50%), Positives = 287/434 (66%), Gaps = 20/434 (4%)
Query: 8 LLFHTLIIPFIYP---SIASTFHPNALVLPVTRDPATNQYVTLLHQRTPLVPVKLTLDLS 64
++F L++ FI+ S + F P AL+LPVT+D +T QY T+++QRTPLVP + DL
Sbjct: 6 IIFSVLLL-FIFSLSSSAQTPFRPKALLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLG 64
Query: 65 GQFLWVDCEEGYVSSTYHPAHCHTPQCSITRSKSCVDCYLS-KPGCNINTCNLFPNNIFT 123
G+ LWVDC++GYVSSTY C++ CS S SC C+ +PGC+ NTC P+N T
Sbjct: 65 GRELWVDCDKGYVSSTYQSPRCNSAVCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVT 124
Query: 124 HTNQIGEVALDVVAVHSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNE 183
T GE ALDVV++ ST+GSNPG++V +PN +F CG T LLKGLA G GMAG+GR+N
Sbjct: 125 GTATSGEFALDVVSIQSTNGSNPGRVVKIPNLIFDCGATFLLKGLAKGTVGMAGMGRHN- 183
Query: 184 ISVPXXXXXXXXXXXXXXXCLSSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPD 243
I +P CL+S GV FFG+GPYVFLPG+ +S SL TPL+ NP
Sbjct: 184 IGLPSQFAAAFSFHRKFAVCLTSG---KGVAFFGNGPYVFLPGIQIS-SLQTTPLLINPV 239
Query: 244 NSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSI-GDEGEGGTKISTVNPYTTMET 302
++A G ++EYFIGV I+I EK + +N +LL I G GGTKIS+VNPYT +E+
Sbjct: 240 STASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPYTVLES 299
Query: 303 SIYHAFVNAFANEL--EDVPQEKPIAPFKLCFNSKNL-------EVPAIDFVLQGKGVFW 353
SIY+AF + F + + + + PF CF++KN+ VP I+ VL K V W
Sbjct: 300 SIYNAFTSEFVKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVW 359
Query: 354 RILGGNSMVQVSREVSCLAFVDGGIDATTSIVIGGYQLEDNLLQFDLVNSRLGFSSSLLL 413
RI G NSMV VS +V CL FVDGG++A TS+VIGG+QLEDNL++FDL +++ GFSS+LL
Sbjct: 360 RIFGANSMVSVSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNKFGFSSTLLG 419
Query: 414 TQTTCANFNFTSSA 427
QT CANFNFTS+A
Sbjct: 420 RQTNCANFNFTSTA 433
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 399 bits (1026), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/414 (49%), Positives = 271/414 (65%), Gaps = 16/414 (3%)
Query: 25 TFHPNALVLPVTRDPATNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCEEGYVSSTYHPA 84
+F P AL+LPVT+DP+T QY T+++QRTPLVP + DL G+ WVDC++GYVS+TY
Sbjct: 26 SFRPKALLLPVTKDPSTLQYTTVINQRTPLVPASVVFDLGGREFWVDCDQGYVSTTYRSP 85
Query: 85 HCHTPQCSITRSKSCVDCYLS-KPGCNINTCNLFPNNIFTHTNQIGEVALDVVAVHSTDG 143
C++ CS S +C C+ +PGC+ NTC FP+N T GE ALDVV++ ST+G
Sbjct: 86 RCNSAVCSRAGSIACGTCFSPPRPGCSNNTCGAFPDNSITGWATSGEFALDVVSIQSTNG 145
Query: 144 SNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXC 203
SNPG+ V +PN +F+CG T+LLKGLA G GMAG+GR+N I +P C
Sbjct: 146 SNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGMGRHN-IGLPLQFAAAFSFNRKFAVC 204
Query: 204 LSSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVK 263
L+S GV FFG+GPYVFLPG+ +S+ L TPL+ NP + G + EYFIGV
Sbjct: 205 LTSG---RGVAFFGNGPYVFLPGIQISR-LQKTPLLINPGTTVFEFSKGEKSPEYFIGVT 260
Query: 264 GIRINEKLIQLNTSLLSI-GDEGEGGTKISTVNPYTTMETSIYHAFVNAFANEL--EDVP 320
I+I EK + ++ +LL I G GGTKIS+VNPYT +E+SIY AF + F + +
Sbjct: 261 AIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAARSIK 320
Query: 321 QEKPIAPFKLCFNSKNL-------EVPAIDFVLQGKGVFWRILGGNSMVQVSREVSCLAF 373
+ + PF CF++KN+ VP I VL K V WRI G NSMV VS +V CL F
Sbjct: 321 RVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVICLGF 380
Query: 374 VDGGIDATTSIVIGGYQLEDNLLQFDLVNSRLGFSSSLLLTQTTCANFNFTSSA 427
VDGG++ S+VIGG+QLEDNL++FDL +++ GFSS+LL QT CANFNFTS+A
Sbjct: 381 VDGGVNPGASVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQTNCANFNFTSTA 434
>AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6414585-6415745 FORWARD LENGTH=386
Length = 386
Score = 189 bits (481), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 196/402 (48%), Gaps = 37/402 (9%)
Query: 8 LLFHTLIIPFIYPSIASTFHPNALVLPVTRDPATNQYVTLLHQRTPLVPVKLTLDLSGQF 67
L F + + I + N +V PV +D T QY+ + PVKL +DL+G
Sbjct: 9 LFFFSFLSALIISKSQISDSVNGVVFPVVKDLPTGQYLAQIRLGDSPDPVKLVVDLAGSI 68
Query: 68 LWVDCEEGYVSSTYHPAHCHTPQCSITR--SKSCVDCYLSKPGCNINTCNLFPNNIFTHT 125
LW DC +VSS+ + + C + ++ S+ N + L N+ F T
Sbjct: 69 LWFDCSSRHVSSSRNLISGSSSGCLKAKVGNERVSSSSSSRKDQNADCELLVKNDAFGIT 128
Query: 126 NQIGEVALDVVAVHSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEIS 185
+ GE+ DV++V S ++PG + + LF C LL+GLASG +G+ GLGR +IS
Sbjct: 129 AR-GELFSDVMSVGSV--TSPGTV----DLLFACTPPWLLRGLASGAQGVMGLGRA-QIS 180
Query: 186 VPXXXXXXXXXXXXXXXCLSSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNS 245
+P LS +GV+ VF GV S+SL+YTPL+T
Sbjct: 181 LPSQLAAETNERRRLTVYLS---PLNGVVSTSSVEEVF--GVAASRSLVYTPLLTGS--- 232
Query: 246 AGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIY 305
+ Y I VK IR+N + + + EG ++STV PYT +E+SIY
Sbjct: 233 ---------SGNYVINVKSIRVNGEKLSV---------EGPLAVELSTVVPYTILESSIY 274
Query: 306 HAFVNAFANELEDVPQEKPIAPFKLCFNSKNLEVPAIDFVLQGKGVFWRILGGNSMVQVS 365
F A+A + P+APF LCF S +++ PA+D LQ + V WRI G N MV V
Sbjct: 275 KVFAEAYAKAAGEATSVPPVAPFGLCFTS-DVDFPAVDLALQSEMVRWRIHGKNLMVDVG 333
Query: 366 REVSCLAFVDGGIDATTSIVIGGYQLEDNLLQFDLVNSRLGF 407
V C VDGG IV+GG QLE +L FDL NS +GF
Sbjct: 334 GGVRCSGIVDGGSSRVNPIVMGGLQLEGFILDFDLGNSMMGF 375
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 165 bits (417), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 191/424 (45%), Gaps = 34/424 (8%)
Query: 8 LLFHTLIIPFIYPSIASTFHP-NALVLPVTRDPATNQYVTLLHQRTPLVPVKLTLDLSGQ 66
LL LI+ F Y +++ ++P ALV V+++ + L+ + + + G
Sbjct: 5 LLVLCLILFFTYSYVSANYYPPKALVSTVSKNTILPIFTFTLNTNQ-----EFFIHIGGP 59
Query: 67 FLWVDCEEGYVSSTYHPAHCHTPQCSITRSKSCVDCYLSKPG-----CNINTCNLFPNNI 121
+L C +G C +P C++TR + C L C P
Sbjct: 60 YLVRKCNDGLPRPI---VPCGSPVCALTRRFTPHQCSLPSNKIINGVCACQATAFEPFQR 116
Query: 122 FTHTNQIGEVALDVVAVHSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRN 181
+++Q L + ++ S V + N + C L GV G+AGL
Sbjct: 117 ICNSDQFTYGDLSISSLKPISPS-----VTINNVYYLCIPQPFLVDFPPGVFGLAGLAPT 171
Query: 182 NEISVPXXXXXXXXXXXXXXXCLSSSTK--SSGVLFFGDGPYVFLPGVDVSKSLIYTPLI 239
+ CL S G ++FG GPY L +D L YT LI
Sbjct: 172 ALATWNQLTRPRLGLEKKFALCLPSDENPLKKGAIYFGGGPYK-LRNIDARSMLSYTRLI 230
Query: 240 TNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTT 299
TNP R YF+G+KGI +N I + + G+GG +ST+ P+T
Sbjct: 231 TNP----------RKLNNYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTM 280
Query: 300 METSIYHAFVNAFANELEDVPQEKPIAPFKLCFNSK-NLEVPAIDFVLQGKGVFWRILGG 358
+ + IY F+ AF+ +P+ PF+ C ++ N +VP ID L GV W++
Sbjct: 281 LRSDIYRVFIEAFSQATSGIPRVSSTTPFEFCLSTTTNFQVPRIDLEL-ANGVIWKLSPA 339
Query: 359 NSMVQVSREVSCLAFVDGGIDATTSIVIGGYQLEDNLLQFDLVNSRLGFSSSLLLTQTTC 418
N+M +VS +V+CLAFV+GG A +++IG +Q+E+ L++FD+ S GFSSSL L +C
Sbjct: 340 NAMKKVSDDVACLAFVNGGDAAAQAVMIGIHQMENTLVEFDVGRSAFGFSSSLGLVSASC 399
Query: 419 ANFN 422
+F
Sbjct: 400 GDFQ 403
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 150 bits (379), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/405 (30%), Positives = 183/405 (45%), Gaps = 45/405 (11%)
Query: 32 VLPVTRDPATNQYVTLLHQRTPL-VPVKLTLDLSGQFLWVDCEEGYVSSTYHPAHCHTPQ 90
+LP+T+ TN + T + + PV L LDL W+DC + S+ C +
Sbjct: 27 LLPITKHEPTNLFYTTFNVGSAAKSPVNLLLDLGTNLTWLDCRKLKSLSSLRLVTCQSST 86
Query: 91 CSITRSKSCV--DCYLSKPGCNINTCNLFPNNIFTHTNQIGEVALDVVAVHSTDGSNPGK 148
C C C +P L N + T G V D ++++TDG
Sbjct: 87 CKSIPGNGCAGKSCLYKQPN------PLGQNPVVT-----GRVVQDRASLYTTDGGKFLS 135
Query: 149 MVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSSST 208
V V +F F+C L+GL V G+ L + S CL SS
Sbjct: 136 QVSVRHFTFSCAGEKALQGLPPPVDGVLALSPGSS-SFTKQVTSAFNVIPKFSLCLPSSG 194
Query: 209 KSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRIN 268
F+ G + F+P + S + I P P G + +Y I VK I +
Sbjct: 195 TGH---FYIAGIHYFIPPFNSSDNPI--PRTLTP-------IKGTDSGDYLITVKSIYVG 242
Query: 269 EKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEK--PIA 326
++LN LL+ GG K+STV YT ++T IY+A +F + + + K +A
Sbjct: 243 GTALKLNPDLLT------GGAKLSTVVHYTVLQTDIYNALAQSFTLKAKAMGIAKVPSVA 296
Query: 327 PFKLCFNS----KNL----EVPAIDFVLQGK--GVFWRILGGNSMVQVSREVSCLAFVDG 376
PFK CF+S KNL VP I+ L G+ V W G N++V+V V CLAF+DG
Sbjct: 297 PFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEVKWGFYGANTVVKVKETVMCLAFIDG 356
Query: 377 GIDATTSIVIGGYQLEDNLLQFDLVNSRLGFSSSLLLTQTTCANF 421
G +VIG +QL+D++L+FD + L FS SLLL T+C+ +
Sbjct: 357 GKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFSESLLLHNTSCSTW 401
>AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6408242-6409417 REVERSE LENGTH=391
Length = 391
Score = 148 bits (374), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 122/380 (32%), Positives = 171/380 (45%), Gaps = 60/380 (15%)
Query: 34 PVTRDPATNQYVTLLHQRTPLVPVKLTLDLSGQF-LWVDCEEGYVSSTYHPAHCHTPQCS 92
P+ +D A N Y L + K LDL+G L +C S+TYHP C + +C
Sbjct: 33 PIYKDTAKNIYTIPLSIGS-TSSEKFVLDLNGAAPLLQNCPTAAKSTTYHPIRCGSTRCK 91
Query: 93 ITRSKSCVDCYLSKPGCNINTCNLFPNNIFTHTNQI------GEVALDVVAV-HSTDGSN 145
C PNN+ + + D V + ++ +G
Sbjct: 92 YANPN--FPC---------------PNNVIAKKRTVCLSSDNSRLFRDTVPLLYTFNGVY 134
Query: 146 PGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLS 205
+ + TC T+ L G+A N +S+P CL
Sbjct: 135 TRDSEMSSSLTLTC--TDGAPALKQRTIGLA----NTHLSIPSQLISMYQLPHKIALCLP 188
Query: 206 SSTKS---SGVLFFGDGPYVFLP-GVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIG 261
S+ +S +G L+ G G Y +LP DVSK TPLI N G+ + EY I
Sbjct: 189 STERSQSHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIGN----------GK-SGEYLID 237
Query: 262 VKGIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQ 321
VK I+I K + + G TKIST+ PYT +TS+Y A + AF ++ + +
Sbjct: 238 VKSIQIGAKTVPI----------PYGATKISTLAPYTVFQTSLYKALLTAFTENIK-IAK 286
Query: 322 EKPIAPFKLCFNSKNLE-VPAIDFVLQGKGVFWRILGGNSMVQVSREVSCLAFVDGGIDA 380
+ PF CF S VP ID VL G G WRI G NS+V+V++ V CL FVDGG+
Sbjct: 287 APAVKPFGACFYSNGGRGVPVIDLVLSG-GAKWRIYGSNSLVKVNKNVVCLGFVDGGVKP 345
Query: 381 TTSIVIGGYQLEDNLLQFDL 400
IVIGG+Q+EDNL++FDL
Sbjct: 346 KYPIVIGGFQMEDNLVEFDL 365
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 75.1 bits (183), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 97/402 (24%), Positives = 153/402 (38%), Gaps = 69/402 (17%)
Query: 40 ATNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCEE--------------GYVSSTYHPAH 85
+ QY L P + L D +WV C SST+ PAH
Sbjct: 80 GSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAH 139
Query: 86 CHTPQCSITRSKSCVDCYLSKPGCNI----NTCNLFPNNIFTHTNQIGEVALDVVAVHST 141
C+ P C + P CN +TC+ + + G + + A +T
Sbjct: 140 CYDPVCRLVPKPD------RAPICNHTRIHSTCH------YEYGYADGSLTSGLFARETT 187
Query: 142 D-GSNPGKMVIVPNFLFTCG---RTNLLKGLA-SGVKGMAGLGRNNEISVPXXXXXXXXX 196
++ GK + + F CG + G + +G G+ GLGR +
Sbjct: 188 SLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRG---PISFASQLGRRF 244
Query: 197 XXXXXXCLSSSTKS---SGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGR 253
CL T S + L G+G G +SK L +TPL+TNP P F
Sbjct: 245 GNKFSYCLMDYTLSPPPTSYLIIGNG------GDGISK-LFFTPLLTNP---LSPTF--- 291
Query: 254 PAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFA 313
Y++ +K + +N ++++ S+ I D G GGT + + + Y + + A
Sbjct: 292 ----YYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVR 347
Query: 314 NELEDVPQEKPIAP-FKLCFNSKNLE-----VPAIDFVLQGKGVFWRILGGNSMVQVSRE 367
++ +P + P F LC N + +P + F G VF N ++ +
Sbjct: 348 RRVK-LPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPP-PRNYFIETEEQ 405
Query: 368 VSCLAFVDGGIDATTSI-VIGGYQLEDNLLQFDLVNSRLGFS 408
+ CLA +D VIG + L +FD SRLGFS
Sbjct: 406 IQCLAIQS--VDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFS 445
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/441 (22%), Positives = 172/441 (39%), Gaps = 83/441 (18%)
Query: 13 LIIPFIYPSIASTFHPNALVLPVTRDP-ATNQYVTLLHQRT---------PLVPVKLTLD 62
LI P + +ST L + P +++ ++ H T P + + LD
Sbjct: 24 LIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVTLTVTLAVGDPPQNISMVLD 83
Query: 63 LSGQFLWVDCEE----GYV-----SSTYHPAHCHTPQCSITRSKSCVDCYLSKPGCNINT 113
+ W+ C++ G V SSTY P C +P C TR++ L P +
Sbjct: 84 TGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICR-TRTRD-----LPIPA----S 133
Query: 114 CNLFPNNIFTHTNQIGEVALDVVAVHSTDGSNPGKMVIV-----PNFLFTCGRTNLLKGL 168
C+ P H VA+ S +G+ + ++ P LF C + L
Sbjct: 134 CD--PKTHLCH------VAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNS 185
Query: 169 ASGVK--GMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSSSTKSSGVLFFGDGPYVFLPG 226
K G+ G+ R + V C+S S SSG L GD Y +L
Sbjct: 186 EEDAKSTGLMGMNRGSLSFV------NQLGFSKFSYCISGS-DSSGFLLLGDASYSWLGP 238
Query: 227 VDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGE 286
+ YTPL+ ++ P F Y + ++GIR+ K++ L S+ G
Sbjct: 239 IQ------YTPLVLQ--STPLPYFD---RVAYTVQLEGIRVGSKILSLPKSVFVPDHTGA 287
Query: 287 GGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQ--EKPIAPFK----LCFNSKNLEVP 340
G T + + +T + +Y A N F + + V + + P F+ LC+ + P
Sbjct: 288 GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRP 347
Query: 341 -------------AIDFVLQGKGVFWRILGGNSMVQVSREVSCLAFVDGGIDATTSIVIG 387
+ + G+ + +R+ G S + EV C F + + + VIG
Sbjct: 348 NFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGS--EGKEEVYCFTFGNSDLLGIEAFVIG 405
Query: 388 GYQLEDNLLQFDLVNSRLGFS 408
+ ++ ++FDL SR+GF+
Sbjct: 406 HHHQQNVWMEFDLAKSRVGFA 426
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 73.6 bits (179), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 144/387 (37%), Gaps = 69/387 (17%)
Query: 41 TNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCE-------------EGYVSSTYHPAHCH 87
+ +Y T + P V + LD W+ C E SS+Y P C
Sbjct: 145 SGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCD 204
Query: 88 TPQCSITRSKSCVDCYLSKPGCNINTCNLFPNNIFTHTNQIGEVALDVVAVHSTDGSNPG 147
TPQC+ L C TC L+ + + +G+ A + + + ST
Sbjct: 205 TPQCNA----------LEVSECRNATC-LYEVSYGDGSYTVGDFATETLTIGST------ 247
Query: 148 KMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSSS 207
+V N CG +N +GL G + CL
Sbjct: 248 ---LVQNVAVGCGHSN--EGL------FVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDR 296
Query: 208 TKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAE-YFIGVKGIR 266
S VD SL +PD P+ Y++G+ GI
Sbjct: 297 DSDSA------------STVDFGTSL-------SPDAVVAPLLRNHQLDTFYYLGLTGIS 337
Query: 267 INEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEKPIA 326
+ +L+Q+ S + + G GG I + T ++T IY++ ++F D+ + +A
Sbjct: 338 VGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVA 397
Query: 327 PFKLCFN---SKNLEVPAIDFVLQGKGVFWRILGGNSMVQV-SREVSCLAFVDGGIDATT 382
F C+N +EVP + F G G + N M+ V S CLAF A++
Sbjct: 398 MFDTCYNLSAKTTVEVPTVAFHFPG-GKMLALPAKNYMIPVDSVGTFCLAFAP---TASS 453
Query: 383 SIVIGGYQLEDNLLQFDLVNSRLGFSS 409
+IG Q + + FDL NS +GFSS
Sbjct: 454 LAIIGNVQQQGTRVTFDLANSLIGFSS 480
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 70.1 bits (170), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 148/382 (38%), Gaps = 66/382 (17%)
Query: 44 YVTLLHQRTPLVPVKLTLDLSGQFLWVDCE-----------EGYVSSTYHPAHCHTPQCS 92
Y+ + TP P+ + LD S W+ C + SS+ C PQC
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCK 147
Query: 93 ITRSKSCVDCYLSKPGCNINTCNLFPNNIFTHTNQIGEVAL--DVVAVHSTDGSNPGKMV 150
+ SC +SK C N T+ E L D + + S
Sbjct: 148 QAPNPSCT---VSK-SCGFN---------MTYGGSTIEAYLTQDTLTLASD--------- 185
Query: 151 IVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSSSTKS 210
++PN+ F C N G + +G+ GLGR + CL +S S
Sbjct: 186 VIPNYTFGC--INKASGTSLPAQGLMGLGRG---PLSLISQSQNLYQSTFSYCLPNSKSS 240
Query: 211 --SGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRIN 268
SG L G + + TPL+ NP R ++ Y++ + GIR+
Sbjct: 241 NFSGSLRLGPK--------NQPIRIKTTPLLKNP----------RRSSLYYVNLVGIRVG 282
Query: 269 EKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEKPIAPF 328
K++ + TS L+ GT + YT + Y A N F +++ + F
Sbjct: 283 NKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA-NATSLGGF 341
Query: 329 KLCFNSKNLEVPAIDFVLQGKGVFWRILGGNSMVQVSR-EVSCLAFVDGGIDATTSI-VI 386
C+ S ++ P++ F+ G V + N ++ S +SCLA ++ + + VI
Sbjct: 342 DTCY-SGSVVFPSVTFMFAGMNV--TLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVI 398
Query: 387 GGYQLEDNLLQFDLVNSRLGFS 408
Q +++ + D+ NSRLG S
Sbjct: 399 ASMQQQNHRVLIDVPNSRLGIS 420
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 68.9 bits (167), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 146/390 (37%), Gaps = 52/390 (13%)
Query: 52 TPLVPVKLTLDLSGQFLWVDCEEGY-------------VSSTYHPAHCHTPQCSITRSKS 98
TP L LD W+ C Y S+++ C+ P+CS+ S
Sbjct: 168 TPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPRCSLISSPD 227
Query: 99 C-VDCYLSKPGCNINTCNLFPNNIF--THTNQIGEVALDVVAVHSTDGSNPGKMVIVPNF 155
V C C P + +N G+ A++ V+ T V N
Sbjct: 228 PPVQCESDNQSC--------PYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNM 279
Query: 156 LFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSSSTKSSGVLF 215
+F CG N +GL SG G+ GLGR +S+T S L
Sbjct: 280 MFGCGHWN--RGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLI 337
Query: 216 FGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLN 275
FG+ + +L +T + +NS Y+I +K I + K + +
Sbjct: 338 FGEDKDLL-----NHTNLNFTSFVNGKENSVETF--------YYIQIKSILVGGKALDIP 384
Query: 276 TSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANEL-EDVPQEKPIAPFKLCFNS 334
+I +G+GGT I + + Y N FA ++ E+ P + CFN
Sbjct: 385 EETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNV 444
Query: 335 KNLEVPAIDFVLQG----KGVFWRILGGNSMVQVSREVSCLAFVDGGIDATTSIVIGGYQ 390
+E I G G W NS + +S ++ CLA + G +T +IG YQ
Sbjct: 445 SGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAIL--GTPKSTFSIIGNYQ 502
Query: 391 LEDNLLQFDLVNSRLGFSSSLLLTQTTCAN 420
++ + +D SRLGF T T CA+
Sbjct: 503 QQNFHILYDTKRSRLGF------TPTKCAD 526
>AT1G64830.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:24091271-24092566 REVERSE LENGTH=431
Length = 431
Score = 65.1 bits (157), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 148/386 (38%), Gaps = 68/386 (17%)
Query: 43 QYVTLLHQRTPLVPVKLTLDLSGQFLWVDC---EEGYV----------SSTYHPAHCHTP 89
+Y+ + TP VP+ D +W C E+ Y SSTY C +
Sbjct: 85 EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSS 144
Query: 90 QCSITRSKSCVDCYLSKPGCNINTCNL---FPNNIFTHTNQIGEVALDVVAVHSTDGSNP 146
QC SC + NTC+ + +N +T G+VA+D V + GS+
Sbjct: 145 QCRALEDASC--------STDENTCSYTITYGDNSYTK----GDVAVDTVTM----GSSG 188
Query: 147 GKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSS 206
+ V + N + CG N G S+ CL
Sbjct: 189 RRPVSLRNMIIGCGHEN--TGTFDPAGSGIIGLGGGSTSL--VSQLRKSINGKFSYCLVP 244
Query: 207 STKSSGV---LFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVK 263
T +G+ + FG V GV VS S++ PA YF+ ++
Sbjct: 245 FTSETGLTSKINFGTNGIVSGDGV-VSTSMV----------------KKDPATYYFLNLE 287
Query: 264 GIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEK 323
I + K IQ +++ GEG I + T + ++ Y+ + A+ ++ +
Sbjct: 288 AISVGSKKIQFTSTIFGT---GEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQD 344
Query: 324 PIAPFKLCF-NSKNLEVPAIDFVLQGKGVFWRILGGNSMVQVSREVSCLAFVDGGIDATT 382
P LC+ +S + +VP D + KG ++ N+ V VS +VSC AF A
Sbjct: 345 PDGILSLCYRDSSSFKVP--DITVHFKGGDVKLGNLNTFVAVSEDVSCFAFA-----ANE 397
Query: 383 SIVIGGYQLEDN-LLQFDLVNSRLGF 407
+ I G + N L+ +D V+ + F
Sbjct: 398 QLTIFGNLAQMNFLVGYDTVSGTVSF 423
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 63.2 bits (152), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 87/187 (46%), Gaps = 16/187 (8%)
Query: 233 LIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKIS 292
L YTP NP+ S Y++ ++ I + K +++ L+ G G+GG+ +
Sbjct: 282 LTYTPFRKNPNVSNKAFLE-----YYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 336
Query: 293 TVNPYTTMETSIYHAFVNAFANELEDVPQEKPIAP---FKLCFN---SKNLEVPAIDFVL 346
+ + +T ME ++ FA+++ + +EK + CFN ++ VP + F
Sbjct: 337 SGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEF 396
Query: 347 QGKGVFWRILGGNSMVQVSREVSCLAFV-DGGIDAT----TSIVIGGYQLEDNLLQFDLV 401
+G L + + CL V D ++ + +I++G +Q ++ L+++DL
Sbjct: 397 KGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLE 456
Query: 402 NSRLGFS 408
N R GF+
Sbjct: 457 NDRFGFA 463
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 58.9 bits (141), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 82/387 (21%), Positives = 147/387 (37%), Gaps = 63/387 (16%)
Query: 40 ATNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCE--------------EGYVSSTYHPAH 85
+ Y+ + TP + L D W C+ S++Y+
Sbjct: 128 GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVS 187
Query: 86 CHTPQCSITRSKSCVDCYLSKPGCNINTCNLFPNNIFTHTNQIGEVALDVVAVHSTDGSN 145
C + C S + + C+ + C ++ + +G +A + + ++D
Sbjct: 188 CSSAACGSLSSATG-----NAGSCSASNC-IYGIQYGDQSFSVGFLAKEKFTLTNSD--- 238
Query: 146 PGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLS 205
+ F CG N +GL +GV G+ GLGR +++S P CL
Sbjct: 239 -----VFDGVYFGCGENN--QGLFTGVAGLLGLGR-DKLSFP--SQTATAYNKIFSYCLP 288
Query: 206 SSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGI 265
SS +G L FG +S+S+ +TP+ T D G F+G +G + +
Sbjct: 289 SSASYTGHLTFGS--------AGISRSVKFTPISTITD---GTSFYGLNIVAITVGGQKL 337
Query: 266 RINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEKPI 325
I + +L+ + GT I+ + P Y A ++F ++ P +
Sbjct: 338 PIPSTVFSTPGALI------DSGTVITRLPP------KAYAALRSSFKAKMSKYPTTSGV 385
Query: 326 APFKLCFNS---KNLEVPAIDFVLQGKGVFWRILGGNSMVQVSR-EVSCLAFVDGGIDAT 381
+ CF+ K + +P + F G V LG + V + CLAF G D +
Sbjct: 386 SILDTCFDLSGFKTVTIPKVAFSFSGGAVVE--LGSKGIFYVFKISQVCLAFA-GNSDDS 442
Query: 382 TSIVIGGYQLEDNLLQFDLVNSRLGFS 408
+ + G Q + + +D R+GF+
Sbjct: 443 NAAIFGNVQQQTLEVVYDGAGGRVGFA 469
>AT5G33340.1 | Symbols: CDR1 | Eukaryotic aspartyl protease family
protein | chr5:12594474-12595787 FORWARD LENGTH=437
Length = 437
Score = 54.3 bits (129), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/374 (21%), Positives = 140/374 (37%), Gaps = 106/374 (28%)
Query: 43 QYVTLLHQRTPLVPVKLTLDLSGQFLWVDC---EEGYV----------SSTYHPAHCHTP 89
+Y+ + TP P+ D LW C ++ Y SSTY C +
Sbjct: 89 EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 90 QC-SITRSKSCVDCYLSKPGCNINTCNL---FPNNIFTHTNQIGEVALDVVAVHSTDGSN 145
QC ++ SC N NTC+ + +N +T G +A+D + + S+D
Sbjct: 149 QCTALENQASC--------STNDNTCSYSLSYGDNSYTK----GNIAVDTLTLGSSDT-- 194
Query: 146 PGKMVIVPNFLFTCGRTN--------------------LLKGLASGVKGMAGLGRNNEIS 185
+ + + N + CG N L+K L + G + +
Sbjct: 195 --RPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-----KFSYCL 247
Query: 186 VPXXXXXXXXXXXXXXXCLSSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNS 245
VP L+S + + FG V GV + TPLI
Sbjct: 248 VP----------------LTSKKDQTSKINFGTNAIVSGSGV------VSTPLI------ 279
Query: 246 AGPIFHGRPAAE--YFIGVKGIRINEKLIQLNTSLLSIGDEG---EGGTKISTVNPYTTM 300
+ + E Y++ +K I + K IQ + S + + GT + T +
Sbjct: 280 ------AKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTL------TLL 327
Query: 301 ETSIYHAFVNAFANELEDVPQEKPIAPFKLCFNSK-NLEVPAIDFVLQGKGVFWRILGGN 359
T Y +A A+ ++ ++ P + LC+++ +L+VP I G V ++ N
Sbjct: 328 PTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADV--KLDSSN 385
Query: 360 SMVQVSREVSCLAF 373
+ VQVS ++ C AF
Sbjct: 386 AFVQVSEDLVCFAF 399
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/381 (21%), Positives = 147/381 (38%), Gaps = 57/381 (14%)
Query: 57 VKLTLDLSGQFLWVDCEEGYVSSTYHPAHCHTPQCSITRSKSCVDCYLSKPGCNINTCN- 115
+ + +D + W+ C SS +P + P TRS S S P C T +
Sbjct: 86 ISMVIDTGSELSWLRCNR---SSNPNPVNNFDP----TRSSSYSPIPCSSPTCRTRTRDF 138
Query: 116 LFPNNIFTHTNQIGEVALDVVAVHSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKG- 174
L P + ++++ L S++G+ ++ +F + +NL+ G V G
Sbjct: 139 LIPASC--DSDKLCHATLSYADASSSEGNLAAEIF---HFGNSTNDSNLIFGCMGSVSGS 193
Query: 175 -------MAGLGRNNEISVPXXXXXXXXXXXXXXXCLSSSTKSSGVLFFGDGPYVFLPGV 227
GL N S+ C+S + G L GD + +L
Sbjct: 194 DPEEDTKTTGLLGMNRGSL---SFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWL--- 247
Query: 228 DVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEG 287
L YTPLI ++ P F Y + + GI++N KL+ + S+L G G
Sbjct: 248 ---TPLNYTPLIR--ISTPLPYFD---RVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG 299
Query: 288 GTKISTVNPYTTMETSIYHAFVNAFANELEDV--PQEKPIAPFK----LCFNSKNLEV-- 339
T + + +T + +Y A + F N + E P F+ LC+ + +
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRS 359
Query: 340 ------PAIDFVLQGKGVFWRILGGNSMVQV------SREVSCLAFVDGGIDATTSIVIG 387
P + V +G + + G + +V + V C F + + + VIG
Sbjct: 360 GILHRLPTVSLVFEGAEI--AVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIG 417
Query: 388 GYQLEDNLLQFDLVNSRLGFS 408
+ ++ ++FDL SR+G +
Sbjct: 418 HHHQQNMWIEFDLQRSRIGLA 438
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 53.1 bits (126), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 77/177 (43%), Gaps = 16/177 (9%)
Query: 235 YTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKISTV 294
YT L+ NP R ++ Y++ + IR+ K++ L + ++ GT +
Sbjct: 287 YTQLLRNP----------RRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSG 336
Query: 295 NPYTTMETSIYHAFVNAFANELEDVPQ-EKPIAPFKLCFNSKNLEVPAIDFVLQGKGVFW 353
YT + +Y A N F ++ + F C+ S ++VP I F+ KGV
Sbjct: 337 TVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY-SGQVKVPTITFMF--KGVNM 393
Query: 354 RILGGNSMVQ-VSREVSCLAFVDGGIDATTSI-VIGGYQLEDNLLQFDLVNSRLGFS 408
+ N M+ + SCLA + + + VI Q +++ + D+ N RLG +
Sbjct: 394 TMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLA 450
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 52.8 bits (125), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 144/392 (36%), Gaps = 52/392 (13%)
Query: 38 DPATNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCEEGYVSSTYHPAHCHTPQCSITRSK 97
D T QY T + TP ++ +D + WV+C + + ++S
Sbjct: 100 DYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRY----RARGKDNRRVFRADESKSF 155
Query: 98 SCVDCYLSKPGCNINTCNLFPNNI---------FTHTNQIGEVALDVVAVHS-TDGSNPG 147
V C C ++ NLF + + G A V A + T G G
Sbjct: 156 KTVGCLTQT--CKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNG 213
Query: 148 KMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLS-- 205
+M +P L C + + G G+ GL ++ CL
Sbjct: 214 RMARLPGHLIGCSSSFTGQSF-QGADGVLGLAFSD---FSFTSTATSLYGAKFSYCLVDH 269
Query: 206 -SSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNP-DNSAGPIFHGRPAAEYFIGVK 263
S+ S L FG S+S T P D + P F Y I V
Sbjct: 270 LSNKNVSNYLIFGS-----------SRSTKTAFRRTTPLDLTRIPPF-------YAINVI 311
Query: 264 GIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEK 323
GI + ++ + + + GGT + + T + + Y V A L ++ + K
Sbjct: 312 GISLGYDMLDIPSQVWDA--TSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK 369
Query: 324 PIA-PFKLCFNSKN----LEVPAIDFVLQGKGVFWRILGGNSMVQVSREVSCLAFVDGGI 378
P P + CF+ + ++P + F L+G G + + +V + V CL FV G
Sbjct: 370 PEGVPIEYCFSFTSGFNVSKLPQLTFHLKG-GARFEPHRKSYLVDAAPGVKCLGFVSAGT 428
Query: 379 DATTSIVIGGYQLEDNLLQFDLVNSRLGFSSS 410
AT VIG ++ L +FDL+ S L F+ S
Sbjct: 429 PATN--VIGNIMQQNYLWEFDLMASTLSFAPS 458
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 52.4 bits (124), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 141/398 (35%), Gaps = 84/398 (21%)
Query: 40 ATNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCE-------------EGYVSSTYHPAHC 86
+ +Y T L TP V + LD +W+ C + S TY C
Sbjct: 138 GSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPC 197
Query: 87 HTPQCSI-------TRSKSCVDCYLSKPGCNINTCNLFPNNIFT-HTNQIGEVALDVVAV 138
+P C TR K+C+ Y G T F T N++ VAL
Sbjct: 198 SSPHCRRLDSAGCNTRRKTCL--YQVSYGDGSFTVGDFSTETLTFRRNRVKGVALG--CG 253
Query: 139 HSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXX 198
H +G L G G N + S
Sbjct: 254 HDNEG-----------LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY------------ 290
Query: 199 XXXXCL---SSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPA 255
CL S+S+K S V+F VS+ +TPL++NP +
Sbjct: 291 ----CLVDRSASSKPSSVVF---------GNAAVSRIARFTPLLSNP----------KLD 327
Query: 256 AEYFIGVKGIRIN-EKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFAN 314
Y++G+ GI + ++ + SL + G GG I + T + Y A +AF
Sbjct: 328 TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRV 387
Query: 315 ELEDVPQEKPIAPFKLCFNSKNL-EVPAIDFVLQGKGVFWRILGGNSMVQVSREVS-CLA 372
+ + + + F CF+ N+ EV VL +G + N ++ V C A
Sbjct: 388 GAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFA 447
Query: 373 FVD--GGIDATTSIVIGGYQLEDNLLQFDLVNSRLGFS 408
F GG+ +IG Q + + +DL +SR+GF+
Sbjct: 448 FAGTMGGLS-----IIGNIQQQGFRVVYDLASSRVGFA 480
>AT4G16563.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:9329933-9331432 REVERSE LENGTH=499
Length = 499
Score = 52.4 bits (124), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/194 (22%), Positives = 88/194 (45%), Gaps = 28/194 (14%)
Query: 233 LIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKIS 292
++T ++ NP + P F Y + ++GI I ++ I L I G GG +
Sbjct: 304 FVFTEMLENPKH---PYF-------YSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 353
Query: 293 TVNPYTTMETSIYHAFVNAFANEL----EDVPQEKPIAPFKLCFN-SKNLEVPAIDFVLQ 347
+ +T + Y++ V F + + E + +P + C+ ++ ++VPA+
Sbjct: 354 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLNQTVKVPALVLHFA 413
Query: 348 G---------KGVFWRILGGNSMVQVSREVSCLAFVDGGIDAT----TSIVIGGYQLEDN 394
G + F+ + G + R++ CL ++GG ++ T ++G YQ +
Sbjct: 414 GNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGF 473
Query: 395 LLQFDLVNSRLGFS 408
+ +DL+N R+GF+
Sbjct: 474 EVVYDLLNRRVGFA 487
>AT1G79720.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:29997259-29998951 REVERSE LENGTH=484
Length = 484
Score = 51.6 bits (122), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 107/268 (39%), Gaps = 47/268 (17%)
Query: 152 VPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSS-STKS 210
+ NF+F CGR N GL G G+ GLGR+ SV CL S +
Sbjct: 245 LENFVFGCGRNNK--GLFGGSSGLMGLGRS---SVSLVSQTLKTFNGVFSYCLPSLEDGA 299
Query: 211 SGVLFFGDGPYVFLPGVDVSKSLIYTPLITNPDNSAGPIFHGRPAAEYFIGVKGIRINEK 270
SG L FG+ V+ VS YTPL+ NP + + Y + + G I
Sbjct: 300 SGSLSFGNDSSVYTNSTSVS----YTPLVQNP----------QLRSFYILNLTGASIGG- 344
Query: 271 LIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEKPIAPFKL 330
++L +S G + GT I+ + P SIY A F + P +
Sbjct: 345 -VELKSSSFGRGILIDSGTVITRLPP------SIYKAVKIEFLKQFSGFPTAPGYSILDT 397
Query: 331 CFN---SKNLEVPAIDFVLQGK--------GVFWRILGGNSMVQVSREVSCLAFVDGGID 379
CFN +++ +P I + QG GVF+ + S+V CLA +
Sbjct: 398 CFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLV-------CLALASLSYE 450
Query: 380 ATTSIVIGGYQLEDNLLQFDLVNSRLGF 407
I IG YQ ++ + +D RLG
Sbjct: 451 NEVGI-IGNYQQKNQRVIYDTTQERLGI 477
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 51.2 bits (121), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/355 (21%), Positives = 125/355 (35%), Gaps = 55/355 (15%)
Query: 40 ATNQYVTLLHQRTPLVPVKLTLDLSGQFLWVDCEEGYVSSTYHPAHCHTPQCSITRSKSC 99
+ +++ L P V +D +W C+ P P+ S +
Sbjct: 103 GSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKP-CTECFDQPTPIFDPEKSSS----- 156
Query: 100 VDCYLSKPGCNINTCNLFPNN-----------IFTHTNQIGEVALDVVAVHSTDGSNPGK 148
SK GC+ CN P + ++T+ + L + + N
Sbjct: 157 ----YSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS-- 210
Query: 149 MVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEISVPXXXXXXXXXXXXXXXCLSS-- 206
+ F CG N G + G G+ GLGR CL+S
Sbjct: 211 ---ISGIGFGCGVENEGDGFSQG-SGLVGLGRG------PLSLISQLKETKFSYCLTSIE 260
Query: 207 STKSSGVLFFGD--GPYVFLPGVDVSKSLIYT-PLITNPDNSAGPIFHGRPAAEYFIGVK 263
+++S LF G V G + + T L+ NPD P F Y++ ++
Sbjct: 261 DSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQ---PSF-------YYLELQ 310
Query: 264 GIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTMETSIYHAFVNAFANELEDVPQEK 323
GI + K + + S + ++G GG I + T +E + + F + + +
Sbjct: 311 GITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDS 370
Query: 324 PIAPFKLCFN----SKNLEVPAIDFVLQGKGVFWRILGGNSMV-QVSREVSCLAF 373
LCF +KN+ VP + F KG + G N MV S V CLA
Sbjct: 371 GSTGLDLCFKLPDAAKNIAVPKMIFHF--KGADLELPGENYMVADSSTGVLCLAM 423
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 51.2 bits (121), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 113/291 (38%), Gaps = 28/291 (9%)
Query: 125 TNQIGEVALDVVAVHSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEI 184
+N G+ A++ V+ T ++ V N +F CG N +GL G AGL
Sbjct: 259 SNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWN--RGL---FHGAAGLLGLGRG 313
Query: 185 SVPXXXXXXXXXXXXXXXCL---SSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITN 241
+ CL +S T S L FG+ + +L +T +
Sbjct: 314 PLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL-----SHPNLNFTSFVAG 368
Query: 242 PDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTME 301
+N Y++ +K I + +++ + +I +G GGT I + +
Sbjct: 369 KENLVDTF--------YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFA 420
Query: 302 TSIYHAFVNAFANELE-DVPQEKPIAPFKLCFNSK---NLEVPAIDFVLQGKGVFWRILG 357
Y N A + + P + CFN N+++P + G W
Sbjct: 421 EPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAF-ADGAVWNFPT 479
Query: 358 GNSMVQVSREVSCLAFVDGGIDATTSIVIGGYQLEDNLLQFDLVNSRLGFS 408
NS + ++ ++ CLA + G + +IG YQ ++ + +D SRLG++
Sbjct: 480 ENSFIWLNEDLVCLAML--GTPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 528
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 50.4 bits (119), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 113/291 (38%), Gaps = 28/291 (9%)
Query: 125 TNQIGEVALDVVAVHSTDGSNPGKMVIVPNFLFTCGRTNLLKGLASGVKGMAGLGRNNEI 184
+N G+ A++ V+ T ++ V N +F CG N +GL G AGL
Sbjct: 223 SNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWN--RGL---FHGAAGLLGLGRG 277
Query: 185 SVPXXXXXXXXXXXXXXXCL---SSSTKSSGVLFFGDGPYVFLPGVDVSKSLIYTPLITN 241
+ CL +S T S L FG+ + +L +T +
Sbjct: 278 PLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL-----SHPNLNFTSFVAG 332
Query: 242 PDNSAGPIFHGRPAAEYFIGVKGIRINEKLIQLNTSLLSIGDEGEGGTKISTVNPYTTME 301
+N Y++ +K I + +++ + +I +G GGT I + +
Sbjct: 333 KENLVDTF--------YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFA 384
Query: 302 TSIYHAFVNAFANELE-DVPQEKPIAPFKLCFNSK---NLEVPAIDFVLQGKGVFWRILG 357
Y N A + + P + CFN N+++P + G W
Sbjct: 385 EPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAF-ADGAVWNFPT 443
Query: 358 GNSMVQVSREVSCLAFVDGGIDATTSIVIGGYQLEDNLLQFDLVNSRLGFS 408
NS + ++ ++ CLA + G + +IG YQ ++ + +D SRLG++
Sbjct: 444 ENSFIWLNEDLVCLAML--GTPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 492