Miyakogusa Predicted Gene
- Lj1g3v4047290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4047290.1 tr|A4PIZ4|A4PIZ4_LOTJA Cysteine proteinase
OS=Lotus japonicus GN=LjCyp4 PE=2 SV=1,97.07,0,CYSTEINE PROTEASE,NULL;
CYSTEINE PROTEASE FAMILY C1-RELATED,Peptidase C1A, papain;
THIOL_PROTEASE_CY,gene.g35963.t1.1
(307 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G45890.1 | Symbols: SAG12 | senescence-associated gene 12 | c... 384 e-107
AT5G50260.1 | Symbols: | Cysteine proteinases superfamily prote... 362 e-100
AT2G34080.1 | Symbols: | Cysteine proteinases superfamily prote... 349 1e-96
AT3G48340.1 | Symbols: | Cysteine proteinases superfamily prote... 348 2e-96
AT3G49340.1 | Symbols: | Cysteine proteinases superfamily prote... 347 4e-96
AT3G19390.1 | Symbols: | Granulin repeat cysteine protease fami... 343 7e-95
AT4G35350.1 | Symbols: XCP1 | xylem cysteine peptidase 1 | chr4:... 342 1e-94
AT1G20850.1 | Symbols: XCP2 | xylem cysteine peptidase 2 | chr1:... 336 1e-92
AT2G27420.1 | Symbols: | Cysteine proteinases superfamily prote... 336 1e-92
AT5G43060.1 | Symbols: | Granulin repeat cysteine protease fami... 335 2e-92
AT1G29090.1 | Symbols: | Cysteine proteinases superfamily prote... 332 3e-91
AT3G48350.1 | Symbols: | Cysteine proteinases superfamily prote... 331 5e-91
AT4G36880.1 | Symbols: CP1 | cysteine proteinase1 | chr4:1737469... 330 7e-91
AT1G47128.1 | Symbols: RD21, RD21A | Granulin repeat cysteine pr... 328 4e-90
AT1G29080.1 | Symbols: | Papain family cysteine protease | chr1... 326 1e-89
AT3G19400.1 | Symbols: | Cysteine proteinases superfamily prote... 324 5e-89
AT1G06260.1 | Symbols: | Cysteine proteinases superfamily prote... 318 3e-87
AT4G11310.1 | Symbols: | Papain family cysteine protease | chr4... 306 1e-83
AT4G11320.1 | Symbols: | Papain family cysteine protease | chr4... 304 5e-83
AT4G23520.1 | Symbols: | Cysteine proteinases superfamily prote... 301 4e-82
AT1G09850.1 | Symbols: XBCP3 | xylem bark cysteine peptidase 3 |... 294 5e-80
AT1G29110.1 | Symbols: | Cysteine proteinases superfamily prote... 280 7e-76
AT3G43960.1 | Symbols: | Cysteine proteinases superfamily prote... 257 9e-69
AT4G35350.2 | Symbols: XCP1 | xylem cysteine peptidase 1 | chr4:... 244 5e-65
AT3G19400.2 | Symbols: | Cysteine proteinases superfamily prote... 231 5e-61
AT5G60360.1 | Symbols: SAG2, AALP, ALP | aleurain-like protease ... 229 2e-60
AT5G60360.2 | Symbols: AALP, ALP | aleurain-like protease | chr5... 222 3e-58
AT3G45310.1 | Symbols: | Cysteine proteinases superfamily prote... 219 2e-57
AT5G60360.3 | Symbols: AALP, ALP | aleurain-like protease | chr5... 217 7e-57
AT3G45310.2 | Symbols: | Cysteine proteinases superfamily prote... 213 2e-55
AT4G39090.1 | Symbols: RD19, RD19A | Papain family cysteine prot... 209 2e-54
AT2G21430.1 | Symbols: | Papain family cysteine protease | chr2... 206 2e-53
AT4G16190.1 | Symbols: | Papain family cysteine protease | chr4... 190 1e-48
AT3G54940.2 | Symbols: | Papain family cysteine protease | chr3... 190 1e-48
AT1G02305.1 | Symbols: | Cysteine proteinases superfamily prote... 94 2e-19
AT4G01610.1 | Symbols: | Cysteine proteinases superfamily prote... 92 5e-19
AT4G01610.2 | Symbols: | Cysteine proteinases superfamily prote... 89 5e-18
AT1G02300.1 | Symbols: | Cysteine proteinases superfamily prote... 74 1e-13
AT2G22160.1 | Symbols: | Cysteine proteinases superfamily prote... 63 2e-10
>AT5G45890.1 | Symbols: SAG12 | senescence-associated gene 12 |
chr5:18613300-18614759 FORWARD LENGTH=346
Length = 346
Score = 384 bits (986), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/314 (59%), Positives = 233/314 (74%), Gaps = 10/314 (3%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNN-AGNKSYKLGINHFADLTN 59
M +RH +WM + G+VY D E+ RY +FK NV+RIE N+ +++KL +N FADLTN
Sbjct: 34 MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93
Query: 60 EEFKAR-NRFKGHMCSNS---TKTPTFKYERVTS--VPASLDWRQKGAVTPIKNQGQCGC 113
+EF++ FKG +S TK F+Y+ V+S +P S+DWR+KGAVTPIKNQG CGC
Sbjct: 94 DEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGC 153
Query: 114 CWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLN 173
CWAFSAVAA EG T++ GKLISLSEQ+LVDCDT D GCEGGLMD AF+ I GL
Sbjct: 154 CWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFEHIKATGGLT 211
Query: 174 TEAKYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 233
TE+ YPYKG DATCN+ A SI G+EDVP N E AL+KAVA+QP+SV I+ G +F
Sbjct: 212 TESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDF 271
Query: 234 QFYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAE 292
QFYSSGVFTG C T LDH VTA+GYG S G+KYW++KNSWG +WGE GY+R+Q+DV +
Sbjct: 272 QFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDK 331
Query: 293 EGLCGIAMQASYPT 306
+GLCG+AM+ASYPT
Sbjct: 332 QGLCGLAMKASYPT 345
>AT5G50260.1 | Symbols: | Cysteine proteinases superfamily protein
| chr5:20455605-20456862 FORWARD LENGTH=361
Length = 361
Score = 362 bits (929), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/309 (58%), Positives = 218/309 (70%), Gaps = 9/309 (2%)
Query: 3 ERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEF 62
E +E+W + V + EK R+ +FK NV+ I N +KSYKL +N F D+T+EEF
Sbjct: 36 ELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHE-TNKKDKSYKLKLNKFGDMTSEEF 93
Query: 63 K---ARNRFKGHMCSNSTK--TPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
+ A + K H K T +F Y V ++P S+DWR+ GAVTP+KNQGQCG CWAF
Sbjct: 94 RRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAF 153
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
S V A EGI ++ T KL SLSEQELVDCDT +QGC GGLMD AF+FI + GL +E
Sbjct: 154 STVVAVEGINQIRTKKLTSLSEQELVDCDTNQ-NQGCNGGLMDLAFEFIKEKGGLTSELV 212
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
YPYK D TC+ N E SI G EDVP NSE L+KAVANQP+SVAIDA GS+FQFYS
Sbjct: 213 YPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYS 272
Query: 238 SGVFTGSCGTELDHGVTAVGYGSD-GGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLC 296
GVFTG CGTEL+HGV VGYG+ GTKYW+VKNSWGE+WGE+GYIRMQR + +EGLC
Sbjct: 273 EGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLC 332
Query: 297 GIAMQASYP 305
GIAM+ASYP
Sbjct: 333 GIAMEASYP 341
>AT2G34080.1 | Symbols: | Cysteine proteinases superfamily protein
| chr2:14393431-14394777 REVERSE LENGTH=345
Length = 345
Score = 349 bits (895), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 171/314 (54%), Positives = 222/314 (70%), Gaps = 10/314 (3%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
M ++HEQWMA+F + Y+D EK +R +FK+N++ IE FN GNKSYKLG+N FAD TNE
Sbjct: 35 MVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNE 94
Query: 61 EFKA-RNRFKGHMCSNSTKTPTFKYERVT-----SVPASLDWRQKGAVTPIKNQGQCGCC 114
EF A KG + +K T V S DWR +GAVTP+K QGQCGCC
Sbjct: 95 EFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCC 154
Query: 115 WAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNT 174
WAFSAVAA EG+ K++ G L+SLSEQ+L+DCD + D+GC+GG+M DAF +++QN+G+ +
Sbjct: 155 WAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCD-REYDRGCDGGIMSDAFNYVVQNRGIAS 213
Query: 175 EAKYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQ 234
E Y Y+G D C +N A+ AA I GF+ VP+N+E ALL+AV+ QP+SV++DA+G F
Sbjct: 214 ENDYSYQGSDGGCRSN--ARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFM 271
Query: 235 FYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEE 293
YS GV+ G CGT +H VT VGYG S GTKYWL KNSWGE WGE+GYIR++RDVA +
Sbjct: 272 HYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ 331
Query: 294 GLCGIAMQASYPTA 307
G+CG+A A YP A
Sbjct: 332 GMCGVAQYAFYPVA 345
>AT3G48340.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:17897739-17899074 FORWARD LENGTH=361
Length = 361
Score = 348 bits (894), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 170/308 (55%), Positives = 221/308 (71%), Gaps = 10/308 (3%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFK- 63
+++W + V + E+E R+ +F+ NV + N N+SYKL +N FADLT EFK
Sbjct: 38 YDRWRSHHS-VPRSLNEREKRFNVFRHNVMHVHN-TNKKNRSYKLKLNKFADLTINEFKN 95
Query: 64 --ARNRFKGHMC----SNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
+ K H +K + +E ++ +P+S+DWR+KGAVT IKNQG+CG CWAF
Sbjct: 96 AYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAF 155
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
S VAA EGI K+ T KL+SLSEQELVDCDTK ++GC GGLM+ AF+FI +N G+ TE
Sbjct: 156 STVAAVEGINKIKTNKLVSLSEQELVDCDTKQ-NEGCNGGLMEIAFEFIKKNGGITTEDS 214
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
YPY+G+D C+A+ + +I G EDVP N E+ALLKAVANQP+SVAIDA S+FQFYS
Sbjct: 215 YPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYS 274
Query: 238 SGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCG 297
GVFTGSCGTEL+HGV AVGYGS+ G KYW+V+NSWG +WGE GYI+++R++ EG CG
Sbjct: 275 EGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCG 334
Query: 298 IAMQASYP 305
IAM+ASYP
Sbjct: 335 IAMEASYP 342
>AT3G49340.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:18293347-18294577 REVERSE LENGTH=341
Length = 341
Score = 347 bits (891), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 168/315 (53%), Positives = 224/315 (71%), Gaps = 16/315 (5%)
Query: 3 ERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEF 62
E+HEQWM++F +VY D EK R++IF N++ +E+ N NK+Y L +N F+DLT+EEF
Sbjct: 33 EKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEF 92
Query: 63 KARNRFKGHMC---------SNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGC 113
KAR + G + ++S +T +F+YE V S+DW Q+GAVT +K+Q QCGC
Sbjct: 93 KAR--YTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCGC 150
Query: 114 CWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLN 173
CWAFSAVAA EG+TK++ G+L+SLSEQ+L+DC T+ + GC GG+M AF +I +N+G+
Sbjct: 151 CWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE--NNGCGGGIMWKAFDYIKENQGIT 208
Query: 174 TEAKYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 233
TE YPY+G TC +N A AA+I G+E VP N E ALLKAV+ QP+SVAI+ SG EF
Sbjct: 209 TEDNYPYQGAQQTCESNHLA--AATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEF 266
Query: 234 QFYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAE 292
YS G+F G CGT+L H VT VGYG S+ G KYWL+KNSWGE WGE GY+R+ RDV +
Sbjct: 267 IHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDVDSP 326
Query: 293 EGLCGIAMQASYPTA 307
+G+CG+A A YP A
Sbjct: 327 QGMCGLASLAYYPVA 341
>AT3G19390.1 | Symbols: | Granulin repeat cysteine protease family
protein | chr3:6723024-6724768 FORWARD LENGTH=452
Length = 452
Score = 343 bits (881), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 162/304 (53%), Positives = 216/304 (71%), Gaps = 3/304 (0%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
+E+W+ + K Y EKE R++IFK+N++ +E ++ N++Y++G+ FADLTN+EF+A
Sbjct: 43 YERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRA 102
Query: 65 -RNRFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAAT 123
R K K + Y+ S+P ++DWR KGAV P+K+QG CG CWAFSA+ A
Sbjct: 103 IYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAV 162
Query: 124 EGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYKGV 183
EGI ++ TG+LISLSEQELVDCDT + GC GGLMD AFKFI++N G++TE YPY
Sbjct: 163 EGINQIKTGELISLSEQELVDCDTS-YNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIAT 221
Query: 184 DA-TCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGVFT 242
D CN++ + +I G+EDVP N E +L KA+ANQPISVAI+A G FQ Y+SGVFT
Sbjct: 222 DVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFT 281
Query: 243 GSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGIAMQA 302
G+CGT LDHGV AVGYGS+GG YW+V+NSWG WGE GY +++R++ G CG+AM A
Sbjct: 282 GTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMA 341
Query: 303 SYPT 306
SYPT
Sbjct: 342 SYPT 345
>AT4G35350.1 | Symbols: XCP1 | xylem cysteine peptidase 1 |
chr4:16810529-16811875 FORWARD LENGTH=355
Length = 355
Score = 342 bits (878), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 164/308 (53%), Positives = 212/308 (68%), Gaps = 4/308 (1%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
+ E E WM++ K YK EK R+++F+EN+ I+ NN N SY LG+N FADLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 61 EFKAR--NRFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFS 118
EFK R K + F+Y +T +P S+DWR+KGAV P+K+QGQCG CWAFS
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFS 165
Query: 119 AVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKY 178
VAA EGI +++TG L SLSEQEL+DCDT + GC GGLMD AF++I+ GL+ E Y
Sbjct: 166 TVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKEDDY 224
Query: 179 PYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSS 238
PY + C E + +I G+EDVP N + +L+KA+A+QP+SVAI+ASG +FQFY
Sbjct: 225 PYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKG 284
Query: 239 GVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGI 298
GVF G CGT+LDHGV AVGYGS G+ Y +VKNSWG +WGE+G+IRM+R+ EGLCGI
Sbjct: 285 GVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGI 344
Query: 299 AMQASYPT 306
ASYPT
Sbjct: 345 NKMASYPT 352
>AT1G20850.1 | Symbols: XCP2 | xylem cysteine peptidase 2 |
chr1:7252208-7253537 FORWARD LENGTH=356
Length = 356
Score = 336 bits (862), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/309 (53%), Positives = 211/309 (68%), Gaps = 5/309 (1%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
+ E E W++ F K Y+ EK LR+++FK+N++ I+ N G KSY LG+N FADL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 61 EFKARNR-FKGHMCSNSTKTP--TFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
EFK K + + F Y V +VP S+DWR+KGAV +KNQG CG CWAF
Sbjct: 106 EFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAF 165
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
S VAA EGI K+ TG L +LSEQEL+DCDT + GC GGLMD AF++I++N GL E
Sbjct: 166 STVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKEED 224
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
YPY + TC + + +I G +DVP N E +LLKA+A+QP+SVAIDASG EFQFYS
Sbjct: 225 YPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284
Query: 238 SGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCG 297
GVF G CG +LDHGV AVGYGS G+ Y +VKNSWG +WGE+GYIR++R+ EGLCG
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCG 344
Query: 298 IAMQASYPT 306
I AS+PT
Sbjct: 345 INKMASFPT 353
>AT2G27420.1 | Symbols: | Cysteine proteinases superfamily protein
| chr2:11726311-11727519 REVERSE LENGTH=348
Length = 348
Score = 336 bits (861), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 164/317 (51%), Positives = 226/317 (71%), Gaps = 13/317 (4%)
Query: 3 ERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEF 62
E+HEQWMA+F +VY D EK R+ IFK+N++ ++ FN +YK+ IN F+DLT+EEF
Sbjct: 33 EKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEF 92
Query: 63 KARNR--------FKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCC 114
+A + + S+ T F+Y V+ S+DWRQ+GAVTP+K QG+CG C
Sbjct: 93 RATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGC 152
Query: 115 WAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNT 174
WAFSAVAA EGITK++ G+L+SLSEQ+L+DCD + +QGC GG+M AF++I++N+G+ T
Sbjct: 153 WAFSAVAAVEGITKITKGELVSLSEQQLLDCD-RDYNQGCRGGIMSKAFEYIIKNQGITT 211
Query: 175 EAKYPYKGVDATCNANAEAKD---AASIKGFEDVPANSESALLKAVANQPISVAIDASGS 231
E YPY+ TC+++ AA+I G+E VP N+E ALL+AV+ QP+SV I+ +G+
Sbjct: 212 EDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGA 271
Query: 232 EFQFYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVA 290
F+ YS GVF G CGT+L H VT VGYG S+ GTKYW+VKNSWGE WGE GY+R++RDV
Sbjct: 272 AFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVD 331
Query: 291 AEEGLCGIAMQASYPTA 307
A +G+CG+A+ A YP A
Sbjct: 332 APQGMCGLAILAFYPLA 348
>AT5G43060.1 | Symbols: | Granulin repeat cysteine protease family
protein | chr5:17269784-17272117 REVERSE LENGTH=463
Length = 463
Score = 335 bits (860), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/307 (54%), Positives = 219/307 (71%), Gaps = 10/307 (3%)
Query: 5 HEQWMAQFGKVYKDS----YEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
+E WM + GK + EK+ R++IFK+N++ I+ +N N SYKLG+ FADLTNE
Sbjct: 50 YEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDE-HNTKNLSYKLGLTRFADLTNE 108
Query: 61 EFKARNRFKGHMCSNSTKTPTFKYE-RV-TSVPASLDWRQKGAVTPIKNQGQCGCCWAFS 118
E+ R+ + G + + +Y+ RV ++P S+DWR++GAV +K+QG CG CWAFS
Sbjct: 109 EY--RSMYLGAKPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFS 166
Query: 119 AVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKY 178
+ A EGI K+ TG LISLSEQELVDCDT +QGC GGLMD AF+FI++N G++TEA Y
Sbjct: 167 TIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIKNGGIDTEADY 225
Query: 179 PYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSS 238
PYK D C+ N + +I +EDVP NSE++L KA+A+QPISVAI+A G FQ YSS
Sbjct: 226 PYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSS 285
Query: 239 GVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGI 298
GVF G CGTELDHGV AVGYG++ G YW+V+NSWG +WGE GYI+M R++ A G CGI
Sbjct: 286 GVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGI 345
Query: 299 AMQASYP 305
AM+ASYP
Sbjct: 346 AMEASYP 352
>AT1G29090.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:10163103-10164385 REVERSE LENGTH=355
Length = 355
Score = 332 bits (850), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 163/316 (51%), Positives = 217/316 (68%), Gaps = 16/316 (5%)
Query: 3 ERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEF 62
E H+QWM +F +VY D EK++R+ +FK+N++ IE FN G+++YKLG+N FAD T EEF
Sbjct: 45 EHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEF 104
Query: 63 KARNRFKGHMCSNSTKTPTFKYERVTSVPASL---------DWRQKGAVTPIKNQGQCGC 113
A + G N + F E + S ++ DWR +GAVTP+K QGQCGC
Sbjct: 105 IATH--TGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGC 162
Query: 114 CWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLN 173
CWAFS+VAA EG+TK+ L+SLSEQ+L+DCD + D GC GG+M DAF +I++N+G+
Sbjct: 163 CWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCD-RERDNGCNGGIMSDAFSYIIKNRGIA 221
Query: 174 TEAKYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 233
+EA YPY+ + TC N K +A I+GF+ VP+N+E ALL+AV+ QP+SV+IDA G F
Sbjct: 222 SEASYPYQAAEGTCRYN--GKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGF 279
Query: 234 QFYSSGVFTGS-CGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAA 291
YS GV+ CGT ++H VT VGYG S G KYWL KNSWGE WGE GYIR++RDVA
Sbjct: 280 MHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAW 339
Query: 292 EEGLCGIAMQASYPTA 307
+G+CG+A A YP A
Sbjct: 340 PQGMCGVAQYAFYPVA 355
>AT3G48350.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:17905752-17907370 FORWARD LENGTH=364
Length = 364
Score = 331 bits (848), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 167/309 (54%), Positives = 209/309 (67%), Gaps = 10/309 (3%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFK- 63
+E+W V + S+E R+ +F+ NV + N NK YKL IN FAD+T+ EF+
Sbjct: 38 YERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHR-TNKKNKPYKLKINRFADITHHEFRS 95
Query: 64 --ARNRFKGHMCSNSTKTPT--FKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSA 119
A + K H K + F YE VT VP+S+DWR+KGAVT +KNQ CG CWAFS
Sbjct: 96 SYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFST 155
Query: 120 VAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYP 179
VAA EGI K+ T KL+SLSEQELVDCDT+ +QGC GGLM+ AF+FI N G+ TE YP
Sbjct: 156 VAAVEGINKIRTNKLVSLSEQELVDCDTEE-NQGCAGGLMEPAFEFIKNNGGIKTEETYP 214
Query: 180 YKGVDAT-CNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSS 238
Y D C AN+ + +I G E VP N E LLKAVA+QP+SVAIDA S+FQ YS
Sbjct: 215 YDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSE 274
Query: 239 GVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCG 297
GVF G CGT+L+HGV VGYG + GTKYW+V+NSWG +WGE GY+R++R ++ EG CG
Sbjct: 275 GVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCG 334
Query: 298 IAMQASYPT 306
IAM+ASYPT
Sbjct: 335 IAMEASYPT 343
>AT4G36880.1 | Symbols: CP1 | cysteine proteinase1 |
chr4:17374692-17376180 REVERSE LENGTH=376
Length = 376
Score = 330 bits (846), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 168/314 (53%), Positives = 215/314 (68%), Gaps = 17/314 (5%)
Query: 7 QWMAQFGKVYKDSY----EKELRYQIFKENVQRIEAFN-NAGNKSYKLGINHFADLTNEE 61
QW A+ GK ++ +++ R+ IFK+N++ I+ N N N +YKLG+ F DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 62 FK-----ARNRFKGHMCSNSTKTPTFKYERVTS---VPASLDWRQKGAVTPIKNQGQCGC 113
++ AR + K KY + VP ++DWRQKGAV PIK+QG CG
Sbjct: 111 YRKLYLGARTEPARRIAK--AKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGS 168
Query: 114 CWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLN 173
CWAFS AA EGI K+ TG+LISLSEQELVDCD K +QGC GGLMD AF+FIM+N GLN
Sbjct: 169 CWAFSTTAAVEGINKIVTGELISLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLN 227
Query: 174 TEAKYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 233
TE YPY+G CN+ + SI G+EDVP E+AL KA++ QP+SVAI+A G F
Sbjct: 228 TEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIF 287
Query: 234 QFYSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAA-E 292
Q Y SG+FTGSCGT LDH V AVGYGS+ G YW+V+NSWG +WGE+GYIRM+R++AA +
Sbjct: 288 QHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASK 347
Query: 293 EGLCGIAMQASYPT 306
G CGIA++ASYP
Sbjct: 348 SGKCGIAVEASYPV 361
>AT1G47128.1 | Symbols: RD21, RD21A | Granulin repeat cysteine
protease family protein | chr1:17283139-17285609 REVERSE
LENGTH=462
Length = 462
Score = 328 bits (840), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 156/305 (51%), Positives = 217/305 (71%), Gaps = 7/305 (2%)
Query: 5 HEQWMAQFGKVYKDS--YEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEF 62
+E W+ + GK + EK+ R++IFK+N++ ++ +N N SY+LG+ FADLTN+E+
Sbjct: 50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEY 108
Query: 63 KARNRFKGHMCSNSTKTPTFKYE-RV-TSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAV 120
+++ M + + +YE RV +P S+DWR+KGAV +K+QG CG CWAFS +
Sbjct: 109 RSK-YLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTI 167
Query: 121 AATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPY 180
A EGI ++ TG LI+LSEQELVDCDT ++GC GGLMD AF+FI++N G++T+ YPY
Sbjct: 168 GAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGIDTDKDYPY 226
Query: 181 KGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGV 240
KGVD TC+ + +I +EDVP SE +L KAVA+QPIS+AI+A G FQ Y SG+
Sbjct: 227 KGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI 286
Query: 241 FTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGIAM 300
F GSCGT+LDHGV AVGYG++ G YW+V+NSWG+ WGE GY+RM R++A+ G CGIA+
Sbjct: 287 FDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAI 346
Query: 301 QASYP 305
+ SYP
Sbjct: 347 EPSYP 351
>AT1G29080.1 | Symbols: | Papain family cysteine protease |
chr1:10157494-10158674 REVERSE LENGTH=346
Length = 346
Score = 326 bits (836), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 220/312 (70%), Gaps = 13/312 (4%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
H+QWM QF +VY D +EK+LR Q+ EN++ IE+FNN GN+SYKLG+N F D T EEF A
Sbjct: 39 HQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLA 98
Query: 65 R-------NRFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
N N TK P + + + + DWR +GAVTP+K+QG+CG CWAF
Sbjct: 99 TYTGLRGVNVTSPFEVVNETK-PAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAF 157
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
SA+AA EG+TK++ G LISLSEQ+L+DC T+ + GC+GG +AF +I++++G+++E +
Sbjct: 158 SAIAAVEGLTKIARGNLISLSEQQLLDC-TREQNNGCKGGTFVNAFNYIIKHRGISSENE 216
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
YPY+ + C +N A+ A I+GFE+VP+N+E ALL+AV+ QP++VAIDAS + F YS
Sbjct: 217 YPYQVKEGPCRSN--ARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVHYS 274
Query: 238 SGVFTG-SCGTELDHGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGL 295
GV+ +CGT ++H VT VGYG S G KYWL KNSWG+ WGE GYIR++RDV +G+
Sbjct: 275 GGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGM 334
Query: 296 CGIAMQASYPTA 307
CG+A ASYP A
Sbjct: 335 CGVAQYASYPVA 346
>AT3G19400.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:6725510-6726878 FORWARD LENGTH=362
Length = 362
Score = 324 bits (830), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 153/305 (50%), Positives = 209/305 (68%), Gaps = 3/305 (0%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
+EQW+ + K Y EKE R++IFK+N++ ++ N+ ++++++G+ FADLTNEEF+A
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 65 RN-RFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAAT 123
R K +S KT + Y+ +P +DWR GAV +K+QG CG CWAFSAV A
Sbjct: 104 IYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAV 163
Query: 124 EGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYKGV 183
EGI +++TG+LISLSEQELVDCD V+ GC+GG+M+ AF+FIM+N G+ T+ YPY
Sbjct: 164 EGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 184 D-ATCNANAEAKD-AASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGVF 241
D CNA+ +I G+EDVP + E +L KAVA+QP+SVAI+AS FQ Y SGV
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVM 283
Query: 242 TGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGIAMQ 301
TG+CG LDHGV VGYGS G YW+++NSWG WG+ GY+++QR++ G CGIAM
Sbjct: 284 TGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMM 343
Query: 302 ASYPT 306
SYPT
Sbjct: 344 PSYPT 348
>AT1G06260.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:1916449-1917585 FORWARD LENGTH=343
Length = 343
Score = 318 bits (815), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 208/310 (67%), Gaps = 12/310 (3%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
+ +R E+W+ K+Y E LR+ I++ NVQ I+ N+ + +KL N FAD+TN
Sbjct: 39 LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSL-HLPFKLTDNRFADMTNS 97
Query: 61 EFKARNRFKGHMCSNSTKTPTFKYERVT-----SVPASLDWRQKGAVTPIKNQGQCGCCW 115
EFKA F G N++ K +R +VP ++DWR +GAVTPI+NQG+CG CW
Sbjct: 98 EFKAH--FLG---LNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCW 152
Query: 116 AFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTE 175
AFSAVAA EGI K+ TG L+SLSEQ+L+DCD ++GC GGLM+ AF+FI N GL TE
Sbjct: 153 AFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATE 212
Query: 176 AKYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQF 235
YPY G++ TC+ +I+G++ V A +E++L A A QP+SV IDA G FQ
Sbjct: 213 TDYPYTGIEGTCDQEKSKNKVVTIQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQL 271
Query: 236 YSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGL 295
YSSGVFT CGT L+HGVT VGYG +G KYW+VKNSWG WGE+GYIRM+R V+ + G
Sbjct: 272 YSSGVFTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGK 331
Query: 296 CGIAMQASYP 305
CGIAM ASYP
Sbjct: 332 CGIAMMASYP 341
>AT4G11310.1 | Symbols: | Papain family cysteine protease |
chr4:6883594-6885318 FORWARD LENGTH=364
Length = 364
Score = 306 bits (783), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 151/312 (48%), Positives = 204/312 (65%), Gaps = 22/312 (7%)
Query: 6 EQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKAR 65
E WM + GKVY EKE R IF++N++ I NA N SY+LG+ FADL+ E+K
Sbjct: 50 ESWMVKHGKVYGSVAEKERRLTIFEDNLRFINN-RNAENLSYRLGLTGFADLSLHEYK-- 106
Query: 66 NRFKGHMCSNSTKTPTFKYERVTS-----------VPASLDWRQKGAVTPIKNQGQCGCC 114
+C + P + +TS +P S+DWR +GAVT +K+QG C C
Sbjct: 107 -----EVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSC 161
Query: 115 WAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNT 174
WAFS V A EG+ K+ TG+L++LSEQ+L++C+ + + GC GG ++ A++FIM+N GL T
Sbjct: 162 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE--NNGCGGGKLETAYEFIMKNGGLGT 219
Query: 175 EAKYPYKGVDATCNANA-EAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 233
+ YPYK V+ C+ E I G+E++PAN ESAL+KAVA+QP++ ID+S EF
Sbjct: 220 DNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREF 279
Query: 234 QFYSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEE 293
Q Y SGVF GSCGT L+HGV VGYG++ G YWLVKNS G WGE GY++M R++A
Sbjct: 280 QLYESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPR 339
Query: 294 GLCGIAMQASYP 305
GLCGIAM+ASYP
Sbjct: 340 GLCGIAMRASYP 351
>AT4G11320.1 | Symbols: | Papain family cysteine protease |
chr4:6887336-6888827 FORWARD LENGTH=371
Length = 371
Score = 304 bits (778), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 204/312 (65%), Gaps = 22/312 (7%)
Query: 6 EQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKAR 65
E WM + GKVY EKE R IF++N++ I NA N SY+LG+N FADL+ E+
Sbjct: 57 ESWMVKHGKVYDSVAEKERRLTIFEDNLRFITN-RNAENLSYRLGLNRFADLSLHEY--- 112
Query: 66 NRFKGHMCSNSTKTPTFKYERVTS-----------VPASLDWRQKGAVTPIKNQGQCGCC 114
G +C + P + +TS +P S+DWR +GAVT +K+QG C C
Sbjct: 113 ----GEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSC 168
Query: 115 WAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNT 174
WAFS V A EG+ K+ TG+L++LSEQ+L++C+ + + GC GG ++ A++FIM N GL T
Sbjct: 169 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE--NNGCGGGKVETAYEFIMNNGGLGT 226
Query: 175 EAKYPYKGVDATCNANA-EAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 233
+ YPYK ++ C E I G+E++PAN E+AL+KAVA+QP++ +D+S EF
Sbjct: 227 DNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREF 286
Query: 234 QFYSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEE 293
Q Y SGVF G+CGT L+HGV VGYG++ G YW+VKNS G+ WGE GY++M R++A
Sbjct: 287 QLYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR 346
Query: 294 GLCGIAMQASYP 305
GLCGIAM+ASYP
Sbjct: 347 GLCGIAMRASYP 358
>AT4G23520.1 | Symbols: | Cysteine proteinases superfamily protein
| chr4:12274457-12276219 REVERSE LENGTH=356
Length = 356
Score = 301 bits (771), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 205/309 (66%), Gaps = 17/309 (5%)
Query: 6 EQWMAQFGKVYKDSY-EKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
+ WM++ GK Y ++ EKE R+Q FK+N++ I+ +NA N SY+LG+ FADLT +E+
Sbjct: 48 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQ-HNAKNLSYQLGLTRFADLTVQEY-- 104
Query: 65 RNRFKGHMCSNSTKTPTFKYERV------TSVPASLDWRQKGAVTPIKNQGQCGCCWAFS 118
R+ F G S K K R +P S+DWRQ+GAV+ IK+QG C CWAFS
Sbjct: 105 RDLFPG---SPKPKQRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFS 161
Query: 119 AVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEG-GLMDDAFKFIMQNKGLNTEAK 177
VAA EG+ K+ TG+LISLSEQELVDC+ V+ GC G GLMD AF+F++ N GL++E
Sbjct: 162 TVAAVEGLNKIVTGELISLSEQELVDCNL--VNNGCYGSGLMDTAFQFLINNNGLDSEKD 219
Query: 178 YPYKGVDATCN-ANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFY 236
YPY+G +CN + + +I +EDVPAN E +L KAVA+QP+SV +D EF Y
Sbjct: 220 YPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLY 279
Query: 237 SSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLC 296
S ++ G CGT LDH + VGYGS+ G YW+V+NSWG WG+ GYI++ R+ +GLC
Sbjct: 280 RSCIYNGPCGTNLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLC 339
Query: 297 GIAMQASYP 305
GIAM ASYP
Sbjct: 340 GIAMLASYP 348
>AT1G09850.1 | Symbols: XBCP3 | xylem bark cysteine peptidase 3 |
chr1:3201848-3203875 FORWARD LENGTH=437
Length = 437
Score = 294 bits (753), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 193/308 (62%), Gaps = 6/308 (1%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
+ E + W + GK Y E++ R QIFK+N + N N +Y L +N FADLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 61 EFKARNRFKGHMCSNSTKTPTFKYERV---TSVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
EFKA G S + K + + VP S+DWR+KGAVT +K+QG CG CW+F
Sbjct: 88 EFKASRL--GLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
SA A EGI ++ TG LISLSEQEL+DCD K + GC GGLMD AF+F+++N G++TE
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
YPY+ D TC + + +I + V +N E AL++AVA QP+SV I S FQ YS
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 238 SGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCG 297
SG+F+G C T LDH V VGYGS G YW+VKNSWG+ WG G++ MQR+ +G+CG
Sbjct: 265 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCG 324
Query: 298 IAMQASYP 305
I M ASYP
Sbjct: 325 INMLASYP 332
>AT1G29110.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:10171683-10173071 FORWARD LENGTH=334
Length = 334
Score = 280 bits (717), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 198/310 (63%), Gaps = 22/310 (7%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
H+QWM QF +VYKD EKE+R ++FK+N++ IE FNN GN+SY LG+N F D EEF A
Sbjct: 38 HQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLA 97
Query: 65 RNR-FKGHMCS-----NSTK-TPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
+ + ++ S N TK + + + S DWR +GAVTP+K QG C
Sbjct: 98 THTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKYQGACR----- 152
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
+TK+S L++LSEQ+L+DCD + + GC GG ++AFK+I++N G++ E +
Sbjct: 153 --------LTKISGKNLLTLSEQQLIDCDIEK-NGGCNGGEFEEAFKYIIKNGGVSLETE 203
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
YPY+ +C ANA I+GF+ VP+++E ALL+AV QP+SV IDA F Y
Sbjct: 204 YPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYK 263
Query: 238 SGVFTG-SCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLC 296
GV+ G CGT+++H VT VGYG+ G YW++KNSWGE WGE GY+R++RDV +G+C
Sbjct: 264 GGVYAGLDCGTDVNHAVTIVGYGTMSGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMC 323
Query: 297 GIAMQASYPT 306
GIA A+YP
Sbjct: 324 GIAQVAAYPV 333
>AT3G43960.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:15774122-15775628 REVERSE LENGTH=376
Length = 376
Score = 257 bits (656), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 194/309 (62%), Gaps = 12/309 (3%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
+EQW+ + GK Y EKE R++IFK+N++RIE N+ N+SY+ G+N F+DLT +EF+A
Sbjct: 41 YEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQA 100
Query: 65 RNRFKGHM--CSNSTKTPTFKYERVTSVPASLDWRQKGAVTP-IKNQGQCGCCWAFSAVA 121
+ G M S S ++Y+ +P +DWR++GAV P +K QG+CG CWAF+A
Sbjct: 101 -SYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATG 159
Query: 122 ATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYK 181
A EGI +++TG+L+SLSEQEL+DCD + GC GG AF+FI +N G+ ++ Y Y
Sbjct: 160 AVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT 219
Query: 182 GVD-ATCNA-NAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSG 239
G D A C A + +I G E VP N E +L KAVA QPISV I S + Y SG
Sbjct: 220 GEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMI--SAANMSDYKSG 277
Query: 240 VFTGSCGTEL-DHGVTAVGYG--SDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLC 296
V+ G+C DH V VGYG SD G YWL++NSWG +WGE GY+R+QR+ G C
Sbjct: 278 VYKGACSNLWGDHNVLIVGYGTSSDEG-DYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKC 336
Query: 297 GIAMQASYP 305
+A+ YP
Sbjct: 337 AVAVAPVYP 345
>AT4G35350.2 | Symbols: XCP1 | xylem cysteine peptidase 1 |
chr4:16810529-16811578 FORWARD LENGTH=288
Length = 288
Score = 244 bits (623), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 121/243 (49%), Positives = 162/243 (66%), Gaps = 5/243 (2%)
Query: 1 MHERHEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNE 60
+ E E WM++ K YK EK R+++F+EN+ I+ NN N SY LG+N FADLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 61 EFKAR--NRFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFS 118
EFK R K + F+Y +T +P S+DWR+KGAV P+K+QGQCG CWAFS
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFS 165
Query: 119 AVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKY 178
VAA EGI +++TG L SLSEQEL+DCDT + GC GGLMD AF++I+ GL+ E Y
Sbjct: 166 TVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKEDDY 224
Query: 179 PYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSS 238
PY + C E + +I G+EDVP N + +L+KA+A+QP+SVAI+ASG +FQFY
Sbjct: 225 PYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFY-K 283
Query: 239 GVF 241
GV+
Sbjct: 284 GVY 286
>AT3G19400.2 | Symbols: | Cysteine proteinases superfamily protein
| chr3:6725510-6726557 FORWARD LENGTH=290
Length = 290
Score = 231 bits (589), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 115/240 (47%), Positives = 161/240 (67%), Gaps = 3/240 (1%)
Query: 5 HEQWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKA 64
+EQW+ + K Y EKE R++IFK+N++ ++ N+ ++++++G+ FADLTNEEF+A
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 65 RN-RFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAAT 123
R K +S KT + Y+ +P +DWR GAV +K+QG CG CWAFSAV A
Sbjct: 104 IYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAV 163
Query: 124 EGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYKGV 183
EGI +++TG+LISLSEQELVDCD V+ GC+GG+M+ AF+FIM+N G+ T+ YPY
Sbjct: 164 EGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 184 D-ATCNANAEAKD-AASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGVF 241
D CNA+ +I G+EDVP + E +L KAVA+QP+SVAI+AS FQ Y S F
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSVNF 283
>AT5G60360.1 | Symbols: SAG2, AALP, ALP | aleurain-like protease |
chr5:24280044-24282152 FORWARD LENGTH=358
Length = 358
Score = 229 bits (583), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 182/315 (57%), Gaps = 22/315 (6%)
Query: 4 RHEQWMAQF----GKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTN 59
RH A+F GK Y++ E +LR+ IFKEN+ I + N G SYKLG+N FADLT
Sbjct: 54 RHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKG-LSYKLGVNQFADLTW 112
Query: 60 EEFKARNRFKGHMCSNSTKTPTFKYERVT--SVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
+EF+ CS + K +VT ++P + DWR+ G V+P+K+QG CG CW F
Sbjct: 113 QEFQRTKLGAAQNCSATLKGS----HKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTF 168
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
S A E + GK ISLSEQ+LVDC + GC GGL AF++I N GL+TE
Sbjct: 169 STTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKA 228
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVA-NQPISVAIDASGSEFQFY 236
YPY G D TC +AE + ++ +E L AV +P+S+A + S F+ Y
Sbjct: 229 YPYTGKDETCKFSAENVGVQVLNSV-NITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLY 286
Query: 237 SSGVFTGS-CGT---ELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAE 292
SGV+T S CG+ +++H V AVGYG + G YWL+KNSWG WG++GY +M+
Sbjct: 287 KSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMG---- 342
Query: 293 EGLCGIAMQASYPTA 307
+ +CGIA ASYP
Sbjct: 343 KNMCGIATCASYPVV 357
>AT5G60360.2 | Symbols: AALP, ALP | aleurain-like protease |
chr5:24280044-24282152 FORWARD LENGTH=357
Length = 357
Score = 222 bits (565), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 181/315 (57%), Gaps = 23/315 (7%)
Query: 4 RHEQWMAQF----GKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTN 59
RH A+F GK Y++ E +LR+ IFKEN+ I + N G SYKLG+N FADLT
Sbjct: 54 RHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKG-LSYKLGVNQFADLTW 112
Query: 60 EEFKARNRFKGHMCSNSTKTPTFKYERVT--SVPASLDWRQKGAVTPIKNQGQCGCCWAF 117
+EF+ CS + K +VT ++P + DWR+ G V+P+K+QG CG CW F
Sbjct: 113 QEFQRTKLGAAQNCSATLKGS----HKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTF 168
Query: 118 SAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAK 177
S A E + GK ISLSEQ+LVDC + GC GGL AF++I N GL+TE
Sbjct: 169 STTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKA 228
Query: 178 YPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVA-NQPISVAIDASGSEFQFY 236
YPY G D TC +AE + ++ +E L AV +P+S+A + S F+ Y
Sbjct: 229 YPYTGKDETCKFSAENVGVQVLNSV-NITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLY 286
Query: 237 SSGVFTGS-CGT---ELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAE 292
SGV+T S CG+ +++H V AVGYG + G YWL+KNSWG WG++GY +M+
Sbjct: 287 KSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMG---- 342
Query: 293 EGLCGIAMQASYPTA 307
+ +C IA ASYP
Sbjct: 343 KNMC-IATCASYPVV 356
>AT3G45310.1 | Symbols: | Cysteine proteinases superfamily protein
| chr3:16628704-16630473 REVERSE LENGTH=358
Length = 358
Score = 219 bits (558), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 176/308 (57%), Gaps = 18/308 (5%)
Query: 7 QWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARN 66
++ ++GK Y+ E +LR+ +FKEN+ I + N G SYKL +N FADLT +EF+
Sbjct: 61 RFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKG-LSYKLSLNQFADLTWQEFQRYK 119
Query: 67 RFKGHMCSNSTKTPTFKYERVT--SVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAATE 124
CS + K ++T +VP + DWR+ G V+P+K QG CG CW FS A E
Sbjct: 120 LGAAQNCSATLKGS----HKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALE 175
Query: 125 GITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYKGVD 184
+ GK ISLSEQ+LVDC + GC GGL AF++I N GL+TE YPY G D
Sbjct: 176 AAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 235
Query: 185 ATCNANAEAKDAASIKGFEDVPANSESALLKAVA-NQPISVAIDASGSEFQFYSSGVFTG 243
C +A+ ++ ++ +E L AV +P+SVA + EF+FY GVFT
Sbjct: 236 GGCKFSAK-NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVV-HEFRFYKKGVFTS 293
Query: 244 -SCGT---ELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGIA 299
+CG +++H V AVGYG + YWL+KNSWG +WG+ GY +M+ + +CG+A
Sbjct: 294 NTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMG----KNMCGVA 349
Query: 300 MQASYPTA 307
+SYP
Sbjct: 350 TCSSYPVV 357
>AT5G60360.3 | Symbols: AALP, ALP | aleurain-like protease |
chr5:24280044-24282157 FORWARD LENGTH=361
Length = 361
Score = 217 bits (553), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 171/287 (59%), Gaps = 14/287 (4%)
Query: 7 QWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARN 66
++ ++GK Y++ E +LR+ IFKEN+ I + N G SYKLG+N FADLT +EF+
Sbjct: 61 RFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKG-LSYKLGVNQFADLTWQEFQRTK 119
Query: 67 RFKGHMCSNSTKTPTFKYERVT--SVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAATE 124
CS + K +VT ++P + DWR+ G V+P+K+QG CG CW FS A E
Sbjct: 120 LGAAQNCSATLKGS----HKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALE 175
Query: 125 GITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYKGVD 184
+ GK ISLSEQ+LVDC + GC GGL AF++I N GL+TE YPY G D
Sbjct: 176 AAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD 235
Query: 185 ATCNANAEAKDAASIKGFEDVPANSESALLKAVA-NQPISVAIDASGSEFQFYSSGVFTG 243
TC +AE + ++ +E L AV +P+S+A + S F+ Y SGV+T
Sbjct: 236 ETCKFSAENVGVQVLNSV-NITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLYKSGVYTD 293
Query: 244 S-CGT---ELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQ 286
S CG+ +++H V AVGYG + G YWL+KNSWG WG++GY +M+
Sbjct: 294 SHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKME 340
>AT3G45310.2 | Symbols: | Cysteine proteinases superfamily protein
| chr3:16628704-16630473 REVERSE LENGTH=357
Length = 357
Score = 213 bits (541), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 123/308 (39%), Positives = 175/308 (56%), Gaps = 19/308 (6%)
Query: 7 QWMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARN 66
++ ++GK Y+ E +LR+ +FKEN+ I + N G SYKL +N FADLT +EF+
Sbjct: 61 RFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKG-LSYKLSLNQFADLTWQEFQRYK 119
Query: 67 RFKGHMCSNSTKTPTFKYERVT--SVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAATE 124
CS + K ++T +VP + DWR+ G V+P+K QG CG CW FS A E
Sbjct: 120 LGAAQNCSATLKGS----HKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALE 175
Query: 125 GITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYKGVD 184
+ GK ISLSEQ+LVDC + GC GGL AF++I N GL+TE YPY G D
Sbjct: 176 AAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 235
Query: 185 ATCNANAEAKDAASIKGFEDVPANSESALLKAVA-NQPISVAIDASGSEFQFYSSGVFTG 243
C +A+ ++ ++ +E L AV +P+SVA + EF+FY GVFT
Sbjct: 236 GGCKFSAK-NIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVV-HEFRFYKKGVFTS 293
Query: 244 -SCGT---ELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGIA 299
+CG +++H V AVGYG + YWL+KNSWG +WG+ GY +M+ + +C +A
Sbjct: 294 NTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMG----KNMC-VA 348
Query: 300 MQASYPTA 307
+SYP
Sbjct: 349 TCSSYPVV 356
>AT4G39090.1 | Symbols: RD19, RD19A | Papain family cysteine
protease | chr4:18215826-18217326 REVERSE LENGTH=368
Length = 368
Score = 209 bits (532), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 169/314 (53%), Gaps = 34/314 (10%)
Query: 11 QFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFK-----AR 65
+FGKVY + E + R+ +FK N++R + S G+ F+DLT EF+ R
Sbjct: 57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKL-DPSATHGVTQFSDLTRSEFRKKHLGVR 115
Query: 66 NRFKGHMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAATEG 125
+ FK + ++ K P E ++P DWR GAVTP+KNQG CG CW+FSA A EG
Sbjct: 116 SGFK--LPKDANKAPILPTE---NLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEG 170
Query: 126 ITKLSTGKLISLSEQELVDCDTK-------GVDQGCEGGLMDDAFKFIMQNKGLNTEAKY 178
L+TGKL+SLSEQ+LVDCD + D GC GGLM+ AF++ ++ GL E Y
Sbjct: 171 ANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDY 230
Query: 179 PYKGVDA-TCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYS 237
PY G D TC + ++K AS+ F + + E V N P++VAI+A Q Y
Sbjct: 231 PYTGKDGKTCKLD-KSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINA--GYMQTYI 287
Query: 238 SGVFTGS-CGTELDHGVTAVGYGSDG-------GTKYWLVKNSWGEQWGEQGYIRMQRDV 289
GV C L+HGV VGYG+ G YW++KNSWGE WGE G+ + +
Sbjct: 288 GGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYK----I 343
Query: 290 AAEEGLCGIAMQAS 303
+CG+ S
Sbjct: 344 CKGRNICGVDSMVS 357
>AT2G21430.1 | Symbols: | Papain family cysteine protease |
chr2:9171964-9173301 REVERSE LENGTH=361
Length = 361
Score = 206 bits (523), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 165/306 (53%), Gaps = 28/306 (9%)
Query: 11 QFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARNR-FK 69
+FGKVY E R+ +FK N+ R + S + G+ F+DLT EF+ ++ K
Sbjct: 54 KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKM-DPSARHGVTQFSDLTRSEFRRKHLGVK 112
Query: 70 G--HMCSNSTKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAATEGIT 127
G + ++ + P + ++P DWR +GAVTP+KNQG CG CW+FS A EG
Sbjct: 113 GGFKLPKDANQAPILPTQ---NLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAH 169
Query: 128 KLSTGKLISLSEQELVDCDTK-------GVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPY 180
L+TGKL+SLSEQ+LVDCD + D GC GGLM+ AF++ ++ GL E YPY
Sbjct: 170 FLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPY 229
Query: 181 KGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGV 240
G D +K AS+ F V N + + N P++VAI+A + Q Y GV
Sbjct: 230 TGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINA--AYMQTYIGGV 287
Query: 241 FTG-SCGTELDHGVTAVGYGSDGGTK-------YWLVKNSWGEQWGEQGYIRMQRDVAAE 292
C L+HGV VGYGS G ++ YW++KNSWGE WGE G+ + +
Sbjct: 288 SCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYK----ICKG 343
Query: 293 EGLCGI 298
+CG+
Sbjct: 344 RNICGV 349
>AT4G16190.1 | Symbols: | Papain family cysteine protease |
chr4:9171512-9172877 FORWARD LENGTH=373
Length = 373
Score = 190 bits (482), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 166/312 (53%), Gaps = 26/312 (8%)
Query: 10 AQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARNRFK 69
+++ K Y E + R+++FK N++R N + S G+ F+DLT +EF R +F
Sbjct: 60 SKYEKTYATQVEHDHRFRVFKANLRRARR-NQLLDPSAVHGVTQFSDLTPKEF--RRKFL 116
Query: 70 GHMCSN---STKTPTFKYERVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAATEGI 126
G T T T + +P DWR++GAVTP+KNQG CG CW+FSA+ A EG
Sbjct: 117 GLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGA 176
Query: 127 TKLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYP 179
L+T +L+SLSEQ+LVDCD D GC GGLM++AF++ ++ GL E YP
Sbjct: 177 HFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYP 236
Query: 180 YKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSG 239
Y G D T ++K AS+ F V ++ + V + P+++AI+A Q Y G
Sbjct: 237 YTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMW--MQTYIGG 294
Query: 240 VFTG-SCGTELDHGVTAVGYGSDG-------GTKYWLVKNSWGEQWGEQGYIRMQRDVAA 291
V C DHGV VG+GS G YW++KNSWG WGE GY ++ R
Sbjct: 295 VSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICR---G 351
Query: 292 EEGLCGIAMQAS 303
+CG+ S
Sbjct: 352 PHNMCGMDTMVS 363
>AT3G54940.2 | Symbols: | Papain family cysteine protease |
chr3:20354402-20356127 FORWARD LENGTH=367
Length = 367
Score = 190 bits (482), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 163/319 (51%), Gaps = 36/319 (11%)
Query: 8 WMAQFGKVYKDSYEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARNR 67
+M+ +GK Y E R IF +NV + A + + S G+ F+DLT EEFK
Sbjct: 54 FMSDYGKNYSTREEYIHRLGIFAKNVLK-AAEHQMMDPSAVHGVTQFSDLTEEEFK--RM 110
Query: 68 FKGHMCSNSTKTPTFKYE----RVTSVPASLDWRQKGAVTPIKNQGQCGCCWAFSAVAAT 123
+ G ++ T E V +P DWR+KG VT +KNQG CG CWAFS A
Sbjct: 111 YTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAA 170
Query: 124 EGITKLSTGKLISLSEQELVDC-------DTKGVDQGCEGGLMDDAFKFIMQNKGLNTEA 176
EG +STGKL+SLSEQ+LVDC D K D GC GGLM +A++++M+ GL E
Sbjct: 171 EGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEER 230
Query: 177 KYPYKGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFY 236
YPY G C + E K A + F +P + V + P++V ++A Q Y
Sbjct: 231 SYPYTGKRGHCKFDPE-KVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNA--VFMQTY 287
Query: 237 SSGVFTGSCGT-----ELDHGVTAVGYGSDG-------GTKYWLVKNSWGEQWGEQGYIR 284
GV SC ++HGV VGYGS G YW++KNSWG++WGE GY +
Sbjct: 288 IGGV---SCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYK 344
Query: 285 MQRDVAAEEGLCGIAMQAS 303
+ R +CGI S
Sbjct: 345 LCRG----HDICGINSMVS 359
>AT1G02305.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:455816-457974 FORWARD LENGTH=362
Length = 362
Score = 93.6 bits (231), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 118/279 (42%), Gaps = 30/279 (10%)
Query: 36 IEAFNNAGNKSYKLGIN-HFADLTNEEFKARNRFKGHMCSNSTKTPTFKYERVTSVPASL 94
++ N N +K N FA+ T EFK K + P ++ +P
Sbjct: 51 VKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEF 110
Query: 95 D----WRQKGAVTPIKNQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGV 150
D W Q ++ I +QG CG CWAF AV + + +SLS +L+ C
Sbjct: 111 DARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLC 170
Query: 151 DQGCEGGLMDDAFKFIMQNKGLNTEAKYPY--------KGVDATCNANAEAKDAAS---- 198
QGC GG A+++ ++ G+ TE PY G + A+ S
Sbjct: 171 GQGCNGGYPIAAWRY-FKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQL 229
Query: 199 --------IKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELD 250
+ ++ V ++ + + + N P+ VA +F Y SGV+ GT +
Sbjct: 230 WRESKHYGVSAYK-VRSHPDDIMAEVYKNGPVEVAFTVY-EDFAHYKSGVYKHITGTNIG 287
Query: 251 -HGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQR 287
H V +G+G SD G YWL+ N W WG+ GY +++R
Sbjct: 288 GHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRR 326
>AT4G01610.1 | Symbols: | Cysteine proteinases superfamily protein
| chr4:694857-696937 FORWARD LENGTH=359
Length = 359
Score = 92.0 bits (227), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 128/294 (43%), Gaps = 32/294 (10%)
Query: 21 EKELRYQIFKENVQRIEAFNNAGNKSYKLGIN-HFADLTNEEFKARNRFKGHMCSNSTKT 79
+++L +I ++ + ++ N N +K IN F++ T EFK K +
Sbjct: 35 KQKLDSKILQDEI--VKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92
Query: 80 PTFKYERVTSVPASLD----WRQKGAVTPIKNQGQCGCCWAFSAVAATEGITKLSTGKLI 135
P ++ +P + D W Q ++ I +QG CG CWAF AV + + G I
Sbjct: 93 PIVSHDPSLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNI 152
Query: 136 SLSEQELVDCDTKGVDQGCEGGLMDDAFKFI-------------MQNKGLN---TEAKYP 179
SLS +L+ C GC+GG A+++ N G + E YP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212
Query: 180 YKGVDATCNAN----AEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQF 235
C ++ +E+K S+ + V +N + + + N P+ V+ +F
Sbjct: 213 TPKCSRKCVSDNKLWSESKHY-SVSTYT-VKSNPQDIMAEVYKNGPVEVSFTVY-EDFAH 269
Query: 236 YSSGVFTGSCGTELD-HGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQR 287
Y SGV+ G+ + H V +G+G S G YWL+ N W WG+ GY ++R
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRR 323
>AT4G01610.2 | Symbols: | Cysteine proteinases superfamily protein
| chr4:694857-696937 FORWARD LENGTH=359
Length = 359
Score = 88.6 bits (218), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 126/294 (42%), Gaps = 32/294 (10%)
Query: 21 EKELRYQIFKENVQRIEAFNNAGNKSYKLGIN-HFADLTNEEFKARNRFKGHMCSNSTKT 79
+++L +I ++ + ++ N N +K IN F++ T EFK K +
Sbjct: 35 KQKLDSKILQDEI--VKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92
Query: 80 PTFKYERVTSVPASLD----WRQKGAVTPIKNQGQCGCCWAFSAVAATEGITKLSTGKLI 135
P ++ +P + D W Q ++ I G CG CWAF AV + + G I
Sbjct: 93 PIVSHDPSLKLPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFGMNI 152
Query: 136 SLSEQELVDCDTKGVDQGCEGGLMDDAFKFI-------------MQNKGLN---TEAKYP 179
SLS +L+ C GC+GG A+++ N G + E YP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212
Query: 180 YKGVDATCNAN----AEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQF 235
C ++ +E+K S+ + V +N + + + N P+ V+ +F
Sbjct: 213 TPKCSRKCVSDNKLWSESKHY-SVSTYT-VKSNPQDIMAEVYKNGPVEVSFTVY-EDFAH 269
Query: 236 YSSGVFTGSCGTELD-HGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQR 287
Y SGV+ G+ + H V +G+G S G YWL+ N W WG+ GY ++R
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRR 323
>AT1G02300.1 | Symbols: | Cysteine proteinases superfamily protein
| chr1:453288-455376 FORWARD LENGTH=379
Length = 379
Score = 74.3 bits (181), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 115/300 (38%), Gaps = 52/300 (17%)
Query: 36 IEAFNNAGNKSYKLGIN-HFADLTNEEFKARNRFKGHMCSNSTK---TPTFKYERVTSVP 91
++ N N +K N FA+ T EFK R G + + T P +++ +P
Sbjct: 48 VKEVNENPNAGWKAAFNDRFANATVAEFK---RLLGVIQTPKTAYLGVPIVRHDLSLKLP 104
Query: 92 ASLDWR---------QKGAVTPIKNQ---------------GQCGCCWAFSAVAATEGIT 127
D R ++ V I N G CG CWAF AV +
Sbjct: 105 KEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLSDRF 164
Query: 128 KLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFI-------------MQNKGLN- 173
+ +SLS +++ C GC GG A+ + N G +
Sbjct: 165 CIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSH 224
Query: 174 --TEAKYPYKGVDATCNANAEAKDAASIKGF--EDVPANSESALLKAVANQPISVAIDAS 229
E YP + C + + + G + + + + + N P+ VA
Sbjct: 225 PGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVY 284
Query: 230 GSEFQFYSSGVFTGSCGTELD-HGVTAVGYG-SDGGTKYWLVKNSWGEQWGEQGYIRMQR 287
+F Y SGV+ GT++ H V +G+G SD G YWL+ N W WG+ GY +++R
Sbjct: 285 -EDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRR 343
>AT2G22160.1 | Symbols: | Cysteine proteinases superfamily protein
| chr2:9425143-9425460 REVERSE LENGTH=105
Length = 105
Score = 63.2 bits (152), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 53/96 (55%), Gaps = 12/96 (12%)
Query: 20 YEKELRYQIFKENVQRIEAFNNAGNKSYKLGINHFADLTNEEFKARNRFKGHMCSNSTK- 78
++ E + +FK+N + I N K YKL +N FA+LT+ EF H C + +
Sbjct: 9 HQTESSFDVFKKNAEYIVK-TNKERKPYKLKLNKFANLTDVEF-----VNAHTCFDMSDH 62
Query: 79 -----TPTFKYERVTSVPASLDWRQKGAVTPIKNQG 109
+ F YE +T P SLDWR+KGAVT +K+QG
Sbjct: 63 KKILDSKPFFYENMTQAPDSLDWREKGAVTNVKDQG 98