Miyakogusa Predicted Gene
- Lj6g3v1880270.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1880270.1 Non Chatacterized Hit- tr|K4AXN7|K4AXN7_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,44.63,5e-16,Asp,Peptidase A1; seg,NULL; BASIC 7S
GLOBULIN-RELATED,NULL; ASPARTYL PROTEASES,Peptidase A1; Acid
pr,CUFF.60099.1
(427 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 234 7e-62
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 227 1e-59
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 151 7e-37
AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 150 2e-36
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 3e-25
AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 112 5e-25
AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family pr... 74 2e-13
AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family pr... 72 7e-13
AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family pr... 72 1e-12
AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 70 2e-12
AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family pr... 69 5e-12
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 68 2e-11
AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic asparty... 65 1e-10
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 64 2e-10
AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family pr... 62 1e-09
AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family pr... 61 1e-09
AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family pr... 60 3e-09
AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family pr... 60 3e-09
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 58 1e-08
AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family pr... 58 2e-08
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 57 2e-08
AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family pr... 57 2e-08
AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family pr... 57 3e-08
AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family pr... 56 5e-08
AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family pr... 56 6e-08
AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family pr... 55 1e-07
AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family pr... 54 2e-07
AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family pr... 50 3e-06
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 50 4e-06
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 234 bits (598), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 154/438 (35%), Positives = 215/438 (49%), Gaps = 38/438 (8%)
Query: 11 LFSIALFSVPCLSISHSPNSKPHPFLLPIKKDPATNVFYTSIGIGTPQQNFNVAIDLAGE 70
+FS+ L + LS S +P LLP+ KD +T + T I TP +V DL G
Sbjct: 7 IFSVLLLFIFSLSSSAQTPFRPKALLLPVTKDQSTLQYTTVINQRTPLVPASVVFDLGGR 66
Query: 71 NLWYECNNHYNSSSFHPIICESNKCPK-NTHACSFCQGQFRPXXXXXXXXXXXXXPLAQV 129
LW +C+ Y SS++ C S C + + +C C RP +
Sbjct: 67 ELWVDCDKGYVSSTYQSPRCNSAVCSRAGSTSCGTCFSPPRPGCSNNTCGGIPDNTVTGT 126
Query: 130 LFPGDLAEDVVSISQ-------------NQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIG 176
G+ A DVVSI N +F G LL+ L K + G+ G
Sbjct: 127 ATSGEFALDVVSIQSTNGSNPGRVVKIPNLIFDC--------GATFLLKGLAKGTVGMAG 178
Query: 177 LARSQLALPTQLALLKKLPPKFSLCLPSSNNIGFTN-----LLIGTEEHPLSKYMQTTPL 231
+ R + LP+Q A KF++CL S + F L G + L QTTPL
Sbjct: 179 MGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGVAFFGNGPYVFLPGIQISSL----QTTPL 234
Query: 232 ILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKD-GNGGTRMSTMTRF 290
++NPV T F +G S+E+FI VT+++I + V + P+LL I G GGT++S++ +
Sbjct: 235 LINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIGGTKISSVNPY 294
Query: 291 AELQSSVYKPFILDFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRG 350
L+SS+Y F +F+K+A+ R +KRVASV PF ACF +G +R G AVP I+LVL
Sbjct: 295 TVLESSIYNAFTSEFVKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHS 354
Query: 351 -GAVWTIHGANSMVMVKKNVACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMFDXXXX 409
VW I GANSMV V +V CLGFVDGG + S+V+G QLE+NL+ FD
Sbjct: 355 KDVVWRIFGANSMVSVSDDVICLGFVDGGVNA-----RTSVVIGGFQLEDNLIEFDLASN 409
Query: 410 XXXXXXXXXXXXXXCSNF 427
C+NF
Sbjct: 410 KFGFSSTLLGRQTNCANF 427
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 227 bits (579), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 151/415 (36%), Positives = 209/415 (50%), Gaps = 26/415 (6%)
Query: 28 PNSKPHPFLLPIKKDPATNVFYTSIGIGTPQQNFNVAIDLAGENLWYECNNHYNSSSFHP 87
P+ +P LLP+ KDP+T + T I TP +V DL G W +C+ Y S+++
Sbjct: 25 PSFRPKALLLPVTKDPSTLQYTTVINQRTPLVPASVVFDLGGREFWVDCDQGYVSTTYRS 84
Query: 88 IICESNKCPK-NTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQ 146
C S C + + AC C RP + G+ A DVVSI
Sbjct: 85 PRCNSAVCSRAGSIACGTCFSPPRPGCSNNTCGAFPDNSITGWATSGEFALDVVSIQSTN 144
Query: 147 VFGVSSG-------CTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFS 199
G + G S G LL+ L K + G+ G+ R + LP Q A KF+
Sbjct: 145 --GSNPGRFVKIPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNRKFA 202
Query: 200 LCLPSSNNIGFTN-----LLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFID 254
+CL S + F L G + +S+ +Q TPL++NP T EF +G S E+FI
Sbjct: 203 VCLTSGRGVAFFGNGPYVFLPGIQ---ISR-LQKTPLLINPGTTVFEFSKGEKSPEYFIG 258
Query: 255 VTSVKIDGQVVNLKPSLLSIKKD-GNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRR 313
VT++KI + + + P+LL I G GGT++S++ + L+SS+YK F +FI++A+ R
Sbjct: 259 VTAIKIVEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQAAARS 318
Query: 314 MKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRG-GAVWTIHGANSMVMVKKNVACL 372
+KRVASV PF ACF +G +R G AVP I LVL VW I GANSMV V +V CL
Sbjct: 319 IKRVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVICL 378
Query: 373 GFVDGGTIGTMSFVKASIVLGAHQLEENLLMFDXXXXXXXXXXXXXXXXXXCSNF 427
GFVDGG AS+V+G QLE+NL+ FD C+NF
Sbjct: 379 GFVDGGVN-----PGASVVIGGFQLEDNLIEFDLASNKFGFSSTLLGRQTNCANF 428
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 151 bits (382), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 120/389 (30%), Positives = 177/389 (45%), Gaps = 53/389 (13%)
Query: 35 FLLPIKKDPATNVFYTSIGIGTPQQN-FNVAIDLAGENLWYECNNHYNSSSFHPIICESN 93
+LLPI K TN+FYT+ +G+ ++ N+ +DL W +C + SS + C+S+
Sbjct: 26 YLLPITKHEPTNLFYTTFNVGSAAKSPVNLLLDLGTNLTWLDCRKLKSLSSLRLVTCQSS 85
Query: 94 KC---PKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQ-VLFPGDLAEDVVSI------- 142
C P N A C PL Q + G + +D S+
Sbjct: 86 TCKSIPGNGCAGKSC-------------LYKQPNPLGQNPVVTGRVVQDRASLYTTDGGK 132
Query: 143 --SQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSL 200
SQ V + C GL P G++ L+ + Q+ + PKFSL
Sbjct: 133 FLSQVSVRHFTFSCAGEKALQGL----PPPVDGVLALSPGSSSFTKQVTSAFNVIPKFSL 188
Query: 201 CLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKI 260
CLPSS F I P + P L P+ +G S ++ I V S+ +
Sbjct: 189 CLPSSGTGHFYIAGIHYFIPPFNSSDNPIPRTLTPI-------KGTDSGDYLITVKSIYV 241
Query: 261 DGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASV 320
G + L P LL+ GG ++ST+ + LQ+ +Y F KA + +V SV
Sbjct: 242 GGTALKLNPDLLT------GGAKLSTVVHYTVLQTDIYNALAQSFTLKAKAMGIAKVPSV 295
Query: 321 APFEACFDVTTIGNSRT-GLAVPSIDLVLRG--GAV-WTIHGANSMVMVKKNVACLGFVD 376
APF+ CFD T G + T G VP I++ L G G V W +GAN++V VK+ V CL F+D
Sbjct: 296 APFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEVKWGFYGANTVVKVKETVMCLAFID 355
Query: 377 GGTIGTMSFVKASIVLGAHQLEENLLMFD 405
GG K +V+G HQL++++L FD
Sbjct: 356 GGKT-----PKDLMVIGTHQLQDHMLEFD 379
>AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6408242-6409417 REVERSE LENGTH=391
Length = 391
Score = 150 bits (378), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 179/385 (46%), Gaps = 64/385 (16%)
Query: 35 FLLPIKKDPATNVFYTSIGIG-TPQQNFNVAIDLAGEN-LWYECNNHYNSSSFHPIICES 92
FL PI KD A N++ + IG T + F +DL G L C S+++HPI C S
Sbjct: 30 FLHPIYKDTAKNIYTIPLSIGSTSSEKF--VLDLNGAAPLLQNCPTAAKSTTYHPIRCGS 87
Query: 93 NKCPKNTHACSFCQGQFR-PXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSI--SQNQVFG 149
+C + F P LF D V + + N V+
Sbjct: 88 TRC-------KYANPNFPCPNNVIAKKRTVCLSSDNSRLF-----RDTVPLLYTFNGVYT 135
Query: 150 VSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNIG 209
S ++S + P Q IGLA + L++P+QL + +LP K +LCLPS+
Sbjct: 136 RDSEMSSSLTLT-CTDGAPALKQRTIGLANTHLSIPSQLISMYQLPHKIALCLPSTERSQ 194
Query: 210 FTN--LLIGTEEH-------PLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKI 260
N L IG E+ +SK +TPLI N S E+ IDV S++I
Sbjct: 195 SHNGDLWIGKGEYYYLPYDKDVSKIFASTPLIGNG-----------KSGEYLIDVKSIQI 243
Query: 261 DGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASV 320
+ V + G T++ST+ + Q+S+YK + F + + ++ + +V
Sbjct: 244 GAKTVPIP----------YGATKISTLAPYTVFQTSLYKALLTAFTE---NIKIAKAPAV 290
Query: 321 APFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVACLGFVDGGTI 380
PF ACF S G VP IDLVL GGA W I+G+NS+V V KNV CLGFVDGG
Sbjct: 291 KPFGACF------YSNGGRGVPVIDLVLSGGAKWRIYGSNSLVKVNKNVVCLGFVDGGVK 344
Query: 381 GTMSFVKASIVLGAHQLEENLLMFD 405
K IV+G Q+E+NL+ FD
Sbjct: 345 P-----KYPIVIGGFQMEDNLVEFD 364
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 112 bits (281), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 117/253 (46%), Gaps = 36/253 (14%)
Query: 163 LLEKLPKSSQGIIGLARSQLALPTQLALLK-KLPPKFSLCLPSSNNIGFTNLLIGTEEHP 221
L P G+ GLA + LA QL + L KF+LCLPS +E+P
Sbjct: 154 FLVDFPPGVFGLAGLAPTALATWNQLTRPRLGLEKKFALCLPS-------------DENP 200
Query: 222 LSK---YMQTTPLILNPVDTGPEFEEGVPSTE------HFIDVTSVKIDGQVVNLKPSLL 272
L K Y P L +D T +F+ + + ++G + P+
Sbjct: 201 LKKGAIYFGGGPYKLRNIDARSMLSYTRLITNPRKLNNYFLGLKGISVNGNRILFAPNAF 260
Query: 273 SIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVAPFEACFDVTTI 332
+ ++G+GG +ST+ F L+S +Y+ FI F + S + RV+S PFE C TT
Sbjct: 261 AFDRNGDGGVTLSTIFPFTMLRSDIYRVFIEAFSQATSG--IPRVSSTTPFEFCLSTTT- 317
Query: 333 GNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVACLGFVDGGTIGTMSFVKASIVL 392
VP IDL L G +W + AN+M V +VACL FV+GG ++++
Sbjct: 318 -----NFQVPRIDLELANGVIWKLSPANAMKKVSDDVACLAFVNGGDAAAQ-----AVMI 367
Query: 393 GAHQLEENLLMFD 405
G HQ+E L+ FD
Sbjct: 368 GIHQMENTLVEFD 380
>AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6414585-6415745 FORWARD LENGTH=386
Length = 386
Score = 112 bits (280), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 117/411 (28%), Positives = 180/411 (43%), Gaps = 50/411 (12%)
Query: 1 MTYSSVIHFFLFSIALFSVPCLSISHSPNSKP-HPFLLPIKKDPATNVFYTSIGIGTPQQ 59
M SS ++ F FS + L IS S S + + P+ KD T + I +G
Sbjct: 1 MASSSCLNLFFFSF----LSALIISKSQISDSVNGVVFPVVKDLPTGQYLAQIRLGDSPD 56
Query: 60 NFNVAIDLAGENLWYECNNHYNSSSFHPIICESNKCPKNTHACSFCQGQFRPXXXXXXXX 119
+ +DLAG LW++C++ + SSS + I S+ C K
Sbjct: 57 PVKLVVDLAGSILWFDCSSRHVSSSRNLISGSSSGCLKAKVGNERVSSSSSSRKDQNADC 116
Query: 120 XXXXXPLA-QVLFPGDLAEDVVSISQNQVFGVSS---GCTNSDGFNGLLEKLPKSSQGII 175
A + G+L DV+S+ G CT LL L +QG++
Sbjct: 117 ELLVKNDAFGITARGELFSDVMSVGSVTSPGTVDLLFACTPP----WLLRGLASGAQGVM 172
Query: 176 GLARSQLALPTQLALLKKLPPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNP 235
GL R+Q++LP+QLA + ++ L N + T+ + S+ + TPL+
Sbjct: 173 GLGRAQISLPSQLAAETNERRRLTVYLSPLNGVVSTSSVEEVFGVAASRSLVYTPLL--- 229
Query: 236 VDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQS 295
TG S + I+V S++++G+ ++++ L +ST+ + L+S
Sbjct: 230 --TGS-------SGNYVINVKSIRVNGEKLSVEGPL---------AVELSTVVPYTILES 271
Query: 296 SVYKPFILDFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAV-W 354
S+YK F + K A + V VAPF CF + + P++DL L+ V W
Sbjct: 272 SIYKVFAEAYAKAAGE--ATSVPPVAPFGLCFT--------SDVDFPAVDLALQSEMVRW 321
Query: 355 TIHGANSMVMVKKNVACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMFD 405
IHG N MV V V C G VDGG+ S V IV+G QLE +L FD
Sbjct: 322 RIHGKNLMVDVGGGVRCSGIVDGGS----SRVNP-IVMGGLQLEGFILDFD 367
>AT2G39710.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:16562051-16563379 REVERSE LENGTH=442
Length = 442
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/432 (23%), Positives = 173/432 (40%), Gaps = 63/432 (14%)
Query: 8 HFFLFSIALFSVPCLSISHSPNSKPHPFLLPIKKDPAT---------NVFYT-SIGIGTP 57
+F S+ L P S ++ F L +K P + NV T ++ +G P
Sbjct: 15 NFLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVTLTVTLAVGDP 74
Query: 58 QQNFNVAIDLAGENLWYECN---------NHYNSSSFHPIICESNKCPKNTHACSFCQGQ 108
QN ++ +D E W C N +SS++ P+ C S C T
Sbjct: 75 PQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTRDLPI-PAS 133
Query: 109 FRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQVFGVSSGCTNSDGFNGLLEKLP 168
P + G+LA + I G GC +S G + E+
Sbjct: 134 CDPKTHLCHVAISYADATS---IEGNLAHETFVIGSVTRPGTLFGCMDS-GLSSNSEEDA 189
Query: 169 KSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQT 228
KS+ G++G+ R L+ QL KFS C+ S++ GF LL+G + +Q
Sbjct: 190 KST-GLMGMNRGSLSFVNQLGF-----SKFSYCISGSDSSGF--LLLGDASYSWLGPIQY 241
Query: 229 TPLILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMT 288
TPL+L P F+ + + + +++ ++++L S+ G G T + + T
Sbjct: 242 TPLVLQSTPL-PYFDR----VAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGT 296
Query: 289 RFAELQSSVYKPFILDFIKKASDRRMKRVASVAPF------EACFDV-TTIGNSRTGLAV 341
+F L VY +FI + + + R+ F + C+ V +T + +GL
Sbjct: 297 QFTFLMGPVYTALKNEFITQT--KSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL-- 352
Query: 342 PSIDLVLRGGA--------VWTIHGANSMVMVKKNVACLGFVDGGTIGTMSFVKASIVLG 393
P + L+ RG ++ ++GA S K+ V C F + +G +F V+G
Sbjct: 353 PMVSLMFRGAEMSVSGQKLLYRVNGAGS--EGKEEVYCFTFGNSDLLGIEAF-----VIG 405
Query: 394 AHQLEENLLMFD 405
H + + FD
Sbjct: 406 HHHQQNVWMEFD 417
>AT5G36260.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:14285068-14288179 REVERSE LENGTH=482
Length = 482
Score = 72.0 bits (175), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 92/402 (22%), Positives = 162/402 (40%), Gaps = 78/402 (19%)
Query: 37 LPIKKDPATN---VFYTSIGIGTPQQNFNVAIDLAGENLWYECN---------------- 77
LP+ D + +++T I +G+P + + V +D + LW C
Sbjct: 64 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLS 123
Query: 78 --NHYNSSSFHPIICESNKCP--KNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPG 133
+ SS+ + CE + C + C G +P G
Sbjct: 124 LYDSKTSSTSKNVGCEDDFCSFIMQSETC----GAKKPCSYHVVYGDGSTSD-------G 172
Query: 134 DLAEDVVSISQNQVFG----------VSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLA 183
D +D +I+ QV G V GC + +G L + + GI+G +S +
Sbjct: 173 DFIKD--NITLEQVTGNLRTAPLAQEVVFGCGKNQ--SGQLGQTDSAVDGIMGFGQSNTS 228
Query: 184 LPTQLALLKKLPPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFE 243
+ +QLA FS CL + N G +G E P+ K TTP++ N V
Sbjct: 229 IISQLAAGGSTKRIFSHCLDNMNGGGI--FAVGEVESPVVK---TTPIVPNQV------- 276
Query: 244 EGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFIL 303
+ + + + +DG ++L PSL S +G+GGT + + T A L ++Y
Sbjct: 277 ------HYNVILKGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNS--- 325
Query: 304 DFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMV 363
I+K + ++ ++ V ACF T S T A P ++L +++ + +
Sbjct: 326 -LIEKITAKQQVKLHMVQETFACFSFT----SNTDKAFPVVNLHFEDSLKLSVYPHDYLF 380
Query: 364 MVKKNVACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMFD 405
+++++ C G+ GG T I+LG L L+++D
Sbjct: 381 SLREDMYCFGWQSGGM--TTQDGADVILLGDLVLSNKLVVYD 420
>AT1G08210.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:2577119-2580581 REVERSE LENGTH=492
Length = 492
Score = 71.6 bits (174), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 154/387 (39%), Gaps = 68/387 (17%)
Query: 46 NVFYTSIGIGTPQQNFNVAIDLAGENLWYECNN----------HYNSSSFHPIICE---- 91
++YT + +GTP + FNV ID + LW C + S F P +
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 92 ----SNKCPKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQV 147
+C N S C P + D I+
Sbjct: 142 VSCSDRRCYSNFQTESGCS----PNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLA 197
Query: 148 FGVSS----GCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLP 203
S+ GC+N +G L++ ++ GI GL + L++ +QLA+ P FS CL
Sbjct: 198 INSSAPFVFGCSNLQ--SGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLK 255
Query: 204 SSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHF-IDVTSVKIDG 262
+ G +++G + P + Y TPL VPS H+ +++ S+ ++G
Sbjct: 256 GDKSGGGI-MVLGQIKRPDTVY---TPL--------------VPSQPHYNVNLQSIAVNG 297
Query: 263 QVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVAP 322
Q++ + PS+ +I GT + T T A L Y PFI S + R +
Sbjct: 298 QILPIDPSVFTIAT--GDGTIIDTGTTLAYLPDEAYSPFIQAVANAVS--QYGRPITYES 353
Query: 323 FEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVACLGFVDGGTIGT 382
++ CF++T G+ P + L GGA SMV+ + + G +I
Sbjct: 354 YQ-CFEITA-GDVDV---FPQVSLSFAGGA--------SMVLGPRAYLQIFSSSGSSIWC 400
Query: 383 MSFVKAS----IVLGAHQLEENLLMFD 405
+ F + S +LG L++ ++++D
Sbjct: 401 IGFQRMSHRRITILGDLVLKDKVVVYD 427
>AT3G12700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:4037136-4039043 FORWARD LENGTH=461
Length = 461
Score = 70.5 bits (171), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 155/387 (40%), Gaps = 62/387 (16%)
Query: 42 DPATNVFYTSIGIGTPQQNFNVAIDLAGENLWYECNNHYN------------SSSFHPII 89
D T ++T I +GTP + F V +D E W C S SF +
Sbjct: 100 DYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVG 159
Query: 90 CESNKCP---KNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSI---- 142
C + C N + + C P AQ +F A++ +++
Sbjct: 160 CLTQTCKVDLMNLFSLTTCP---TPSTPCSYDYRYADGSAAQGVF----AKETITVGLTN 212
Query: 143 -SQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLC 201
++ G GC++S F G + + + G++GLA S + + L KFS C
Sbjct: 213 GRMARLPGHLIGCSSS--FTG---QSFQGADGVLGLAFSDFSFTSTATSL--YGAKFSYC 265
Query: 202 LPSS-NNIGFTNLLI-GTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVK 259
L +N +N LI G+ + + +TTPL L + P F + I+V +
Sbjct: 266 LVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRI---PPF--------YAINVIGIS 314
Query: 260 IDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVA- 318
+ ++++ PS + G GGT + + T L + YK + + + +KRV
Sbjct: 315 LGYDMLDI-PSQVWDATSG-GGTILDSGTSLTLLADAAYKQVVTGLARYLVE--LKRVKP 370
Query: 319 SVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVACLGFVDGG 378
P E CF T+ N +P + L+GGA + H + +V V CLGFV G
Sbjct: 371 EGVPIEYCFSFTSGFNVS---KLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAG 427
Query: 379 TIGTMSFVKASIVLGAHQLEENLLMFD 405
T A+ V+G + L FD
Sbjct: 428 T-------PATNVIGNIMQQNYLWEFD 447
>AT5G22850.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:7633717-7636298 REVERSE LENGTH=493
Length = 493
Score = 69.3 bits (168), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 146/367 (39%), Gaps = 71/367 (19%)
Query: 42 DP-ATNVFYTSIGIGTPQQNFNVAIDLAGENLWYECN---------------NHYN---S 82
DP ++YT + +GTP ++F V +D + LW C N ++ S
Sbjct: 74 DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133
Query: 83 SSFHPIICESNKCPKNTHA----CS----FCQGQFRPXXXXXXXXXXXXXPLAQVLFPGD 134
+ PI C +C + CS C F+ L + G
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193
Query: 135 LAEDVVSISQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKL 194
S+ N V GC+ S G L K ++ GI G + +++ +QLA
Sbjct: 194 ------SLVPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 195 PPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHF-I 253
P FS CL N G L++G P M TPL VPS H+ +
Sbjct: 246 PRVFSHCLKGENGGGGI-LVLGEIVEP---NMVFTPL--------------VPSQPHYNV 287
Query: 254 DVTSVKIDGQVVNLKPSLLSIKKDGNG-GTRMSTMTRFAELQSSVYKPFILDFIKKASDR 312
++ S+ ++GQ + + PS+ S NG GT + T T A L + Y PF+ + I A +
Sbjct: 288 NLLSISVNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFV-EAITNAVSQ 343
Query: 313 RMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVA-- 370
++ V S C+ +TT G P + L GGA ++ + ++ + NV
Sbjct: 344 SVRPVVSKG--NQCYVITT----SVGDIFPPVSLNFAGGASMFLNPQDYLIQ-QNNVGGT 396
Query: 371 ---CLGF 374
C+GF
Sbjct: 397 AVWCIGF 403
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 67.8 bits (164), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 157/382 (41%), Gaps = 74/382 (19%)
Query: 31 KPHPFLL-PIKKDPATNV--FYTSIGIGTPQQNFNVAIDLAGENLWYECN-----NHY-- 80
KP PF+ P+ A+ ++ + IG P Q+ + D + +W +C+ +H+
Sbjct: 64 KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 123
Query: 81 -------NSSSFHPIICESNKC---PK--------NTHACSFCQGQFRPXXXXXXXXXXX 122
+SS+F P C C PK +T S C ++
Sbjct: 124 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFA 183
Query: 123 XXPLAQVLFPGDLAEDVVSISQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQL 182
+ G A + S++ F +S + FNG + G++GL R +
Sbjct: 184 RETTSLKTSSGKEAR-LKSVAFGCGFRISGQSVSGTSFNG--------ANGVMGLGRGPI 234
Query: 183 ALPTQLALLKKLPPKFSLCL--------PSSNNIGFTNLLIGTEEHPLSKYMQTTPLILN 234
+ +QL ++ KFS CL P+S L+IG +SK + TPL+ N
Sbjct: 235 SFASQLG--RRFGNKFSYCLMDYTLSPPPTSY------LIIGNGGDGISK-LFFTPLLTN 285
Query: 235 PVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQ 294
P+ P F +++ + SV ++G + + PS+ I GNGGT + + T A L
Sbjct: 286 PLS--PTF--------YYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLA 335
Query: 295 SSVYKPFILDFIKKASDRRMKR--VASVAP-FEACFDVTTIGNSRTGLAVPSIDLVLRGG 351
Y+ I A RR+K ++ P F+ C +V+ G ++ +P + GG
Sbjct: 336 EPAYRSVI-----AAVRRRVKLPIADALTPGFDLCVNVS--GVTKPEKILPRLKFEFSGG 388
Query: 352 AVWTIHGANSMVMVKKNVACLG 373
AV+ N + ++ + CL
Sbjct: 389 AVFVPPPRNYFIETEEQIQCLA 410
>AT5G02190.1 | Symbols: EMB24, ATASP38, PCS1 | Eukaryotic aspartyl
protease family protein | chr5:435322-436683 FORWARD
LENGTH=453
Length = 453
Score = 64.7 bits (156), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/383 (21%), Positives = 149/383 (38%), Gaps = 69/383 (18%)
Query: 57 PQQNFNVAIDLAGENLWYECNNHYN-----------SSSFHPIICESNKC---------P 96
P QN ++ ID E W CN N SSS+ PI C S C P
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 97 KNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQVFGVSSGCTN 156
+ + C A++ G+ D N +FG +
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLA----AEIFHFGNSTND-----SNLIFGCMGSVSG 192
Query: 157 SDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNI-GFTNLLI 215
SD E+ K++ G++G+ R L+ +Q+ PKFS C+ +++ GF LL+
Sbjct: 193 SDP-----EEDTKTT-GLLGMNRGSLSFISQMGF-----PKFSYCISGTDDFPGF--LLL 239
Query: 216 GTEEHPLSKYMQTTPLIL--NPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLS 273
G + TPLI P+ P F+ + + +T +K++G+++ + S+L
Sbjct: 240 GDSNFTWLTPLNYTPLIRISTPL---PYFDR----VAYTVQLTGIKVNGKLLPIPKSVLV 292
Query: 274 IKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASD----RRMKRVASVAPFEACFDV 329
G G T + + T+F L VY F+ + + + C+ +
Sbjct: 293 PDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRI 352
Query: 330 TTIGNSRTGL--AVPSIDLVLRGGAVWT-----IHGANSMVMVKKNVACLGFVDGGTIGT 382
+ + R+G+ +P++ LV G + ++ + + +V C F + +G
Sbjct: 353 SPV-RIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGM 411
Query: 383 MSFVKASIVLGAHQLEENLLMFD 405
++ V+G H + + FD
Sbjct: 412 EAY-----VIGHHHQQNMWIEFD 429
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 64.3 bits (155), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/349 (22%), Positives = 148/349 (42%), Gaps = 69/349 (19%)
Query: 48 FYTSIGIGTPQQNFNVAIDLAGENLWYECN---NHYNSSS----------FHPIICESNK 94
++T +GIG P + + +D + W +C + Y+ + + P+ C++ +
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 207
Query: 95 CPKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFP------GDLAEDVVSISQNQVF 148
C N S C+ L +V + GD A + ++I V
Sbjct: 208 C--NALEVSECRN---------------ATCLYEVSYGDGSYTVGDFATETLTIGSTLVQ 250
Query: 149 GVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNI 208
V+ GC +S+ E L + G++GL LALP+QL FS CL ++
Sbjct: 251 NVAVGCGHSN------EGLFVGAAGLLGLGGGLLALPSQLN-----TTSFSYCLVDRDSD 299
Query: 209 GFTNLLIGTEEHPLSKYMQTTPLILN-PVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNL 267
+ + GT LS PL+ N +DT +++ +T + + G+++ +
Sbjct: 300 SASTVDFGTS---LSPDAVVAPLLRNHQLDTF-----------YYLGLTGISVGGELLQI 345
Query: 268 KPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVAPFEACF 327
S + + G+GG + + T LQ+ +Y F+K D +++ A VA F+ C+
Sbjct: 346 PQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLD--LEKAAGVAMFDTCY 403
Query: 328 DVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKK-NVACLGFV 375
+++ ++T + VP++ GG + + N M+ V CL F
Sbjct: 404 NLS----AKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA 448
>AT3G54400.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:20140291-20142599 REVERSE LENGTH=425
Length = 425
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/298 (20%), Positives = 122/298 (40%), Gaps = 60/298 (20%)
Query: 54 IGTPQQNFNVAIDLAGENLWYECN-----------NHYNSSSFHPIICESNKCPKN---- 98
IGTP Q VA+D + + W C+ + SSS + CE+ +C +
Sbjct: 94 IGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPS 153
Query: 99 ---THACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQVFGVSSGCT 155
+ +C F L +D ++++ + + + GC
Sbjct: 154 CTVSKSCGF------------------NMTYGGSTIEAYLTQDTLTLASDVIPNYTFGCI 195
Query: 156 NSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNIGFT-NLL 214
N L +QG++GL R L+L +Q L + FS CLP+S + F+ +L
Sbjct: 196 NKASGTSL------PAQGLMGLGRGPLSLISQSQNLYQ--STFSYCLPNSKSSNFSGSLR 247
Query: 215 IGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSI 274
+G + P+ ++TTPL+ NP S+ +++++ +++ ++V++ S L+
Sbjct: 248 LGPKNQPIR--IKTTPLLKNPRR----------SSLYYVNLVGIRVGNKIVDIPTSALAF 295
Query: 275 KKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVAPFEACFDVTTI 332
GT + T + L Y +F ++ + S+ F+ C+ + +
Sbjct: 296 DPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKN---ANATSLGGFDTCYSGSVV 350
>AT1G01300.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:117065-118522 FORWARD LENGTH=485
Length = 485
Score = 61.2 bits (147), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 163/380 (42%), Gaps = 46/380 (12%)
Query: 19 VPCLSISHSPNSKPHPFLLPIKK--DPATNVFYTSIGIGTPQQNFNVAIDLAGENLWYEC 76
+P +++H+P +P F + + ++T +G+GTP + + +D + +W +C
Sbjct: 113 IPGRNVTHAP--RPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 170
Query: 77 NNHYNSSSFHPIICESNKCPKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFP---- 132
S I + K T+A C L QV +
Sbjct: 171 APCRRCYSQSDPIFDPRK--SKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSF 228
Query: 133 --GDLAEDVVSISQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLAL 190
GD + + ++ +N+V GV+ GC + + E L + G++GL + +L+ P Q
Sbjct: 229 TVGDFSTETLTFRRNRVKGVALGCGHDN------EGLFVGAAGLLGLGKGKLSFPGQTG- 281
Query: 191 LKKLPPKFSLCL-PSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNP-VDTGPEFEEGVPS 248
+ KFS CL S + ++++ G +S+ + TPL+ NP +D
Sbjct: 282 -HRFNQKFSYCLVDRSASSKPSSVVFGNAA--VSRIARFTPLLSNPKLD----------- 327
Query: 249 TEHFIDVTSVKIDG-QVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIK 307
T +++ + + + G +V + SL + + GNGG + + T L Y F
Sbjct: 328 TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF-- 385
Query: 308 KASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKK 367
+ + +KR + F+ CFD++ + + VP++ L RG V ++ N ++ V
Sbjct: 386 RVGAKTLKRAPDFSLFDTCFDLSNMNE----VKVPTVVLHFRGADV-SLPATNYLIPVDT 440
Query: 368 NVA-CLGFVDGGTIGTMSFV 386
N C F GT+G +S +
Sbjct: 441 NGKFCFAFA--GTMGGLSII 458
>AT4G35880.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr4:16993339-16995721 FORWARD LENGTH=524
Length = 524
Score = 60.1 bits (144), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/347 (22%), Positives = 139/347 (40%), Gaps = 57/347 (16%)
Query: 49 YTSIGIGTPQQNFNVAIDLAGENLWYECN-------------NHYNSSSFHPIICESNKC 95
YT++ +GTP F VA+D + W C+ + + S ++P + +NK
Sbjct: 108 YTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNK- 166
Query: 96 PKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSIS------QNQVFG 149
K T S C + + AQ G L EDV+ ++ +
Sbjct: 167 -KVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAY 225
Query: 150 VSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNIG 209
V+ GC + L P G+ GL ++++P+ LA + FS+C + G
Sbjct: 226 VTFGCGQVQSGSFLDIAAP---NGLFGLGMEKISVPSVLAREGLVADSFSMCF---GHDG 279
Query: 210 FTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKP 269
+ G + S + TP LNP + P + I VT V++ +++ +
Sbjct: 280 VGRISFGDKG---SSDQEETPFNLNP--SHPNYN---------ITVTRVRVGTTLIDDEF 325
Query: 270 SLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVAPFEACFDV 329
+ L T T F L +Y F +A D+R + + PFE C+D+
Sbjct: 326 TAL-----------FDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRI-PFEYCYDM 373
Query: 330 TTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKN-VACLGFV 375
+ N+ +PS+ L ++G + +TI+ ++ + V CL V
Sbjct: 374 SNDANAS---LIPSLSLTMKGNSHFTINDPIIVISTEGELVYCLAIV 417
>AT1G09750.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:3157541-3158960 FORWARD LENGTH=449
Length = 449
Score = 60.1 bits (144), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 126/306 (41%), Gaps = 32/306 (10%)
Query: 31 KPHPFLLPIKKDPATNV--FYTSIGIGTPQQNFNVAIDLAGENLWYECNNHYNSSSFHPI 88
KP P +P+ ++ + +GTP Q + +D + + +W C+ S+
Sbjct: 85 KPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 144
Query: 89 ICESNKCPKNTHACSFCQ-GQFRPXXXXXXXXXXXXXPLAQVL-----FPGDLAEDVVSI 142
++ +T +CS Q Q R Q F L +D +++
Sbjct: 145 FNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL 204
Query: 143 SQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCL 202
+ + + S GC NS N L QG++GL R ++L +Q L FS CL
Sbjct: 205 APDVIPNFSFGCINSASGNSL------PPQGLMGLGRGPMSLVSQTTSLYS--GVFSYCL 256
Query: 203 PSSNNIGFT-NLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKID 261
PS + F+ +L +G P K ++ TPL+ NP PS +++++T V +
Sbjct: 257 PSFRSFYFSGSLKLGLLGQP--KSIRYTPLLRNPRR---------PSL-YYVNLTGVSVG 304
Query: 262 GQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVA 321
V + P L+ + GT + + T VY+ +F K+ + + +++
Sbjct: 305 SVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN---VSSFSTLG 361
Query: 322 PFEACF 327
F+ CF
Sbjct: 362 AFDTCF 367
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 58.2 bits (139), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/163 (22%), Positives = 77/163 (47%), Gaps = 12/163 (7%)
Query: 243 EEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFI 302
+E + T +++ + S+ + G+V+N+ +I DG GGT + + T + Y+ FI
Sbjct: 369 KENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE-FI 427
Query: 303 LDFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSM 362
+ I + + + + CF+V+ I N + +P + + GAVW NS
Sbjct: 428 KNKIAEKAKGKYPVYRDFPILDPCFNVSGIHN----VQLPELGIAFADGAVWNFPTENSF 483
Query: 363 VMVKKNVACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMFD 405
+ + +++ CL +GT A ++G +Q + +++D
Sbjct: 484 IWLNEDLVCLAM-----LGTPK--SAFSIIGNYQQQNFHILYD 519
>AT3G50050.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:18554138-18557115 REVERSE LENGTH=632
Length = 632
Score = 57.8 bits (138), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/409 (21%), Positives = 163/409 (39%), Gaps = 83/409 (20%)
Query: 26 HSPNSKPHPF-LLPIKKDPATNVFYTS-IGIGTPQQNFNVAIDLAGENLWY-------EC 76
H +SK P + + D N +YT+ + IGTP Q F + +D +G + Y +C
Sbjct: 69 HKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVD-SGSTVTYVPCSDCEQC 127
Query: 77 NNHYN-------SSSFHPIICESN-KCPKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQ 128
H + SS++ P+ C + C + C + + ++
Sbjct: 128 GKHQDPKFQPEMSSTYQPVKCNMDCNCDDDREQCVY-EREYAEHSSSK------------ 174
Query: 129 VLFPGDLAEDVVSIS-------QNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQ 181
G L ED++S Q VFG + T + + + GIIGL +
Sbjct: 175 ----GVLGEDLISFGNESQLTPQRAVFGCETVETG--------DLYSQRADGIIGLGQGD 222
Query: 182 LALPTQLALLKKLPPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPE 241
L+L QL + F LC ++G ++++G ++P + ++ D+ P+
Sbjct: 223 LSLVDQLVDKGLISNSFGLCY-GGMDVGGGSMILGGFDYP-------SDMVF--TDSDPD 272
Query: 242 FEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPF 301
S + ID+T +++ G+ ++L + DG G + + T +A L + + F
Sbjct: 273 R-----SPYYNIDLTGIRVAGKQLSLHSRVF----DGEHGAVLDSGTTYAYLPDAAFAAF 323
Query: 302 ILDFIKKASDRRMKRVASVAP--FEACFDVTTIGN-SRTGLAVPSIDLVLRGGAVWTIHG 358
+++ S +K++ P + CF V S PS+++V + G W +
Sbjct: 324 EEAVMREVS--TLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSP 381
Query: 359 ANSMVMVKK--NVACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMFD 405
N M K CLG G T +LG + L+++D
Sbjct: 382 ENYMFRHSKVHGAYCLGVFPNGKDHT-------TLLGGIVVRNTLVVYD 423
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 57.4 bits (137), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/163 (22%), Positives = 77/163 (47%), Gaps = 12/163 (7%)
Query: 243 EEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFI 302
+E + T +++ + S+ + G+V+N+ +I DG GGT + + T + Y+ FI
Sbjct: 333 KENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE-FI 391
Query: 303 LDFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSM 362
+ I + + + + CF+V+ I N + +P + + GAVW NS
Sbjct: 392 KNKIAEKAKGKYPVYRDFPILDPCFNVSGIHN----VQLPELGIAFADGAVWNFPTENSF 447
Query: 363 VMVKKNVACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMFD 405
+ + +++ CL +GT A ++G +Q + +++D
Sbjct: 448 IWLNEDLVCLAM-----LGTPK--SAFSIIGNYQQQNFHILYD 483
>AT2G36670.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=512
Length = 512
Score = 57.0 bits (136), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/405 (22%), Positives = 156/405 (38%), Gaps = 82/405 (20%)
Query: 34 PFLLPIKKDPATNVFYTSIGIGTPQQNFNVAIDLAGENLWYECNNHYN------------ 81
P+L+ K T +++T + +G+P FNV ID + LW C++ N
Sbjct: 94 PYLVGSKM---TMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLH 150
Query: 82 -----------SSSFHPIICES------NKCPKNTHACSFCQGQFRPXXXXXXXXXXXXX 124
S + IC S +C +N C FR
Sbjct: 151 FFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQ----CGYSFRYGDGSGTSGYYMTD 206
Query: 125 PLAQVLFPGDLAEDVVSISQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLAL 184
F L E +V+ N + GC+ +G L K K+ GI G + +L++
Sbjct: 207 TF---YFDAILGESLVA---NSSAPIVFGCSTYQ--SGDLTKSDKAVDGIFGFGKGKLSV 258
Query: 185 PTQLALLKKLPPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEE 244
+QL+ PP FS CL + G +L L M +PL
Sbjct: 259 VSQLSSRGITPPVFSHCLKGDGSGGGVFVL----GEILVPGMVYSPL------------- 301
Query: 245 GVPSTEHF-IDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFIL 303
VPS H+ +++ S+ ++GQ++ L ++ + GT + T T L Y F L
Sbjct: 302 -VPSQPHYNLNLLSIGVNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLF-L 357
Query: 304 DFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMV 363
+ I + + + + S E C+ V+T PS+ L G GA+ M+
Sbjct: 358 NAISNSVSQLVTPIISNG--EQCYLVST----SISDMFPSVSLNFAG-------GASMML 404
Query: 364 MVKKNVACLGFVDGGTIGTMSFVKA---SIVLGAHQLEENLLMFD 405
+ + G DG ++ + F KA +LG L++ + ++D
Sbjct: 405 RPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYD 449
>AT5G10770.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:3403331-3405331 REVERSE LENGTH=474
Length = 474
Score = 56.6 bits (135), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/377 (21%), Positives = 147/377 (38%), Gaps = 69/377 (18%)
Query: 22 LSISHSPNSKPHPFLLPIKKDP--ATNVFYTSIGIGTPQQNFNVAIDLAGENLWYECN-- 77
L+ H SK LP K + + ++G+GTP+ + ++ D + W +C
Sbjct: 106 LATDHVSESKSTD--LPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPC 163
Query: 78 ------------NHYNSSSFHPIICESNKCPK------NTHACSFCQGQFRPXXXXXXXX 119
N S+S++ + C S C N +CS +
Sbjct: 164 VRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGD---- 219
Query: 120 XXXXXPLAQVLFPGDLAEDVVSISQNQVF-GVSSGCTNSDGFNGLLEKLPKSSQGIIGLA 178
Q G LA++ +++ + VF GV GC ++ GL + G++GL
Sbjct: 220 --------QSFSVGFLAKEKFTLTNSDVFDGVYFGCGENN--QGLFTGVA----GLLGLG 265
Query: 179 RSQLALPTQLALLKKLPPKFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDT 238
R +L+ P+Q A FS CLPSS + +L G+ +S+ ++ TP ++ +
Sbjct: 266 RDKLSFPSQTA--TAYNKIFSYCLPSSASY-TGHLTFGSAG--ISRSVKFTP--ISTITD 318
Query: 239 GPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVY 298
G F + +++ ++ + GQ + + ++ S G + + T L Y
Sbjct: 319 GTSF--------YGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAY 365
Query: 299 KPFILDFIKKASDRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHG 358
F KA + + V+ + CFD++ + +P + GGAV +
Sbjct: 366 AALRSSF--KAKMSKYPTTSGVSILDTCFDLSGFKT----VTIPKVAFSFSGGAVVELGS 419
Query: 359 ANSMVMVKKNVACLGFV 375
+ K + CL F
Sbjct: 420 KGIFYVFKISQVCLAFA 436
>AT2G36670.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:15364949-15368016 REVERSE LENGTH=507
Length = 507
Score = 55.8 bits (133), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 89/393 (22%), Positives = 150/393 (38%), Gaps = 79/393 (20%)
Query: 46 NVFYTSIGIGTPQQNFNVAIDLAGENLWYECNNHYN-----------------------S 82
+++T + +G+P FNV ID + LW C++ N S
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 83 SSFHPIICES------NKCPKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLA 136
+ IC S +C +N C FR F L
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQ----CGYSFRYGDGSGTSGYYMTDTF---YFDAILG 210
Query: 137 EDVVSISQNQVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPP 196
E +V+ N + GC+ +G L K K+ GI G + +L++ +QL+ PP
Sbjct: 211 ESLVA---NSSAPIVFGCSTYQ--SGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPP 265
Query: 197 KFSLCLPSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHF-IDV 255
FS CL + G +L L M +PL VPS H+ +++
Sbjct: 266 VFSHCLKGDGSGGGVFVL----GEILVPGMVYSPL--------------VPSQPHYNLNL 307
Query: 256 TSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMK 315
S+ ++GQ++ L ++ + GT + T T L Y F L+ I + + +
Sbjct: 308 LSIGVNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLF-LNAISNSVSQLVT 364
Query: 316 RVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVACLGFV 375
+ S E C+ V+T PS+ L G GA+ M+ + + G
Sbjct: 365 PIISNG--EQCYLVST----SISDMFPSVSLNFAG-------GASMMLRPQDYLFHYGIY 411
Query: 376 DGGTIGTMSFVKA---SIVLGAHQLEENLLMFD 405
DG ++ + F KA +LG L++ + ++D
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYD 444
>AT3G52500.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:19465644-19467053 REVERSE LENGTH=469
Length = 469
Score = 55.8 bits (133), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 108/246 (43%), Gaps = 23/246 (9%)
Query: 169 KSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPS-----SNNIGFTNLLIGTEEHPLS 223
+ GI G R ++LP+Q+ L +FS CL S +N +L G+ + S
Sbjct: 223 RQPAGIAGFGRGPVSLPSQMNL-----KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGS 277
Query: 224 KY--MQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIKKDGNGG 281
K + TP NP + F E +++++ + + + V + L+ +G+GG
Sbjct: 278 KTPGLTYTPFRKNPNVSNKAFLE-----YYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGG 332
Query: 282 TRMSTMTRFAELQSSVYKPFILDFIKKASD-RRMKRVASVAPFEACFDVTTIGNSRTGLA 340
+ + + + F ++ V++ +F + S+ R K + CF+++ G+ +
Sbjct: 333 SIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGKGD----VT 388
Query: 341 VPSIDLVLRGGAVWTIHGANSMVMV-KKNVACLGFVDGGTIGTMSFVKASIVLGAHQLEE 399
VP + +GGA + +N V + CL V T+ +I+LG+ Q +
Sbjct: 389 VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQN 448
Query: 400 NLLMFD 405
L+ +D
Sbjct: 449 YLVEYD 454
>AT5G07030.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:2183600-2185717 REVERSE LENGTH=455
Length = 455
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/315 (19%), Positives = 122/315 (38%), Gaps = 63/315 (20%)
Query: 54 IGTPQQNFNVAIDLAGENLWYECN-----------NHYNSSSFHPIICESNKCPK----- 97
IGTP Q +A+D + + W C+ + S+SF + C + +C +
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPT 180
Query: 98 -NTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQVFGVSSGCTN 156
ACSF +L++D + ++ + + + GC N
Sbjct: 181 CGARACSF------------------NLTYGSSSIAANLSQDTIRLAADPIKAFTFGCVN 222
Query: 157 SDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSNNIGFT-NLLI 215
G +P + + ++ K FS CLPS ++ F+ +L +
Sbjct: 223 KVAGGG---TIPPPQGLLGLGRGPLSLMSQAQSIYKS---TFSYCLPSFRSLTFSGSLRL 276
Query: 216 GTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLKPSLLSIK 275
G P + ++ T L+ NP S+ +++++ ++++ +VV+L P+ ++
Sbjct: 277 GPTSQP--QRVKYTQLLRNPRR----------SSLYYVNLVAIRVGRKVVDLPPAAIAFN 324
Query: 276 KDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRVASVAPFEACFDVTTIGNS 335
GT + T + L VY+ +F K+ V S+ F+ C+
Sbjct: 325 PSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKP-TTAVVTSLGGFDTCYS------- 376
Query: 336 RTGLAVPSIDLVLRG 350
+ VP+I + +G
Sbjct: 377 -GQVKVPTITFMFKG 390
>AT2G42980.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:17875005-17876588 REVERSE LENGTH=527
Length = 527
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/381 (21%), Positives = 144/381 (37%), Gaps = 52/381 (13%)
Query: 48 FYTSIGIGTPQQNFNVAIDLAGENLWYECNNHYN-------------SSSFHPIICESNK 94
++ + +GTP ++F++ +D + W +C Y+ S+SF I C +
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 95 C-----PKNTHACSFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQNQVFG 149
C P C Q P A F +L S+ +V
Sbjct: 220 CSLISSPDPPVQCE-SDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGN 278
Query: 150 VSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCLPSSN-NI 208
+ GC + + GL L R L+ +QL L FS CL N N
Sbjct: 279 MMFGCGHWN--RGLFSGASGLLG----LGRGPLSFSSQLQSL--YGHSFSYCLVDRNSNT 330
Query: 209 GFTNLLIGTEEHPLSKYMQTTPLILNPVDTGPEFEEGVPSTEHFIDVTSVKIDGQVVNLK 268
++ LI E+ L + T L G +E T ++I + S+ + G+ +++
Sbjct: 331 NVSSKLIFGEDKDL---LNHTNLNFTSFVNG---KENSVETFYYIQIKSILVGGKALDIP 384
Query: 269 PSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKR----VASVAPFE 324
+I DG+GGT + + T + Y + IK +MK +
Sbjct: 385 EETWNISSDGDGGTIIDSGTTLSYFAEPAY-----EIIKNKFAEKMKENYPIFRDFPVLD 439
Query: 325 ACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKNVACLGFVDGGTIGTMS 384
CF+V+ I + +P + + G VW NS + + +++ CL + G T S
Sbjct: 440 PCFNVSGI--EENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAIL-GTPKSTFS 496
Query: 385 FVKASIVLGAHQLEENLLMFD 405
++G +Q + +++D
Sbjct: 497 ------IIGNYQQQNFHILYD 511
>AT2G03200.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr2:966506-967891 REVERSE LENGTH=461
Length = 461
Score = 50.1 bits (118), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/395 (20%), Positives = 147/395 (37%), Gaps = 86/395 (21%)
Query: 48 FYTSIGIGTPQQNFNVAIDLAGENLWYECN-------------NHYNSSSFHPIICES-- 92
F + IG P ++ +D + +W +C + SSS+ + C S
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGL 166
Query: 93 ------NKCPKNTHACSF--CQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSIS- 143
+ C ++ AC + G + G LA + +
Sbjct: 167 CNALPRSNCNEDKDACEYLYTYGDYSSTR-------------------GLLATETFTFED 207
Query: 144 QNQVFGVSSGC---TNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSL 200
+N + G+ GC DGF+ G++GL R L+L +QL KFS
Sbjct: 208 ENSISGIGFGCGVENEGDGFS--------QGSGLVGLGRGPLSLISQLK-----ETKFSY 254
Query: 201 CLPS-SNNIGFTNLLIGTEEHPL---------SKYMQTTPLILNPVDTGPEFEEGVPSTE 250
CL S ++ ++L IG+ + + +T L+ NP P F
Sbjct: 255 CLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQ--PSF-------- 304
Query: 251 HFIDVTSVKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKAS 310
+++++ + + + ++++ S + +DG GG + + T L+ + +K +F + S
Sbjct: 305 YYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMS 364
Query: 311 DRRMKRVASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVM-VKKNV 369
+ S + CF + ++ +AVP + + GA + G N MV V
Sbjct: 365 -LPVDDSGSTG-LDLCFKLP---DAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGV 418
Query: 370 ACLGFVDGGTIGTMSFVKASIVLGAHQLEENLLMF 404
CL + V+ H LE+ + F
Sbjct: 419 LCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSF 453
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 49.7 bits (117), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 72/359 (20%), Positives = 142/359 (39%), Gaps = 86/359 (23%)
Query: 48 FYTSIGIGTPQQNFNVAIDLAGENLWYECN-------------NHYNSSSFHPIICESNK 94
+++ IG+GTP + + +D + W +C N +SS++ + C + +
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 95 CPK-NTHAC---------SFCQGQFRPXXXXXXXXXXXXXPLAQVLFPGDLAEDVVSISQ 144
C T AC S+ G F G+LA D V+
Sbjct: 222 CSLLETSACRSNKCLYQVSYGDGSFTV---------------------GELATDTVTFGN 260
Query: 145 N-QVFGVSSGCTNSDGFNGLLEKLPKSSQGIIGLARSQLALPTQLALLKKLPPKFSLCL- 202
+ ++ V+ GC + + E L + G++GL L++ Q+ FS CL
Sbjct: 261 SGKINNVALGCGHDN------EGLFTGAAGLLGLGGGVLSITNQMK-----ATSFSYCLV 309
Query: 203 ----PSSNNIGFTNLLIGTEEHPLSKYMQTTPLILNP-VDTGPEFEEGVPSTEHFIDVTS 257
S+++ F ++ +G + T PL+ N +DT +++ ++
Sbjct: 310 DRDSGKSSSLDFNSVQLGGGD-------ATAPLLRNKKIDT-----------FYYVGLSG 351
Query: 258 VKIDGQVVNLKPSLLSIKKDGNGGTRMSTMTRFAELQSSVYKPFILDFIKKASDRRMKRV 317
+ G+ V L ++ + G+GG + T LQ+ Y F+K + + K
Sbjct: 352 FSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLK-KGS 410
Query: 318 ASVAPFEACFDVTTIGNSRTGLAVPSIDLVLRGGAVWTIHGANSMVMVKKN-VACLGFV 375
+S++ F+ C+D +++ + VP++ GG + N ++ V + C F
Sbjct: 411 SSISLFDTCYDFSSLST----VKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFA 465