[UP]
[1][TOP] >UniRef100_A8JGJ0 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8JGJ0_CHLRE Length = 272 Score = 218 bits (556), Expect = 2e-55 Identities = 100/100 (100%), Positives = 100/100 (100%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW Sbjct: 173 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 232 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDR 301 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDR Sbjct: 233 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDR 272 [2][TOP] >UniRef100_A7SGN6 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7SGN6_NEMVE Length = 253 Score = 153 bits (386), Expect = 1e-35 Identities = 69/98 (70%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ RGPI+CGI+AT +LE YTGGVY E NP INH+VSVVGWGVD VEYWV+RNSW Sbjct: 156 IFARGPIACGIEATSRLEQYTGGVYTEYDPNPQINHIVSVVGWGVDD--GVEYWVVRNSW 213 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGE+GFLK+VTS+Y +G GN YNLAIE DC F VP Sbjct: 214 GTPWGENGFLKIVTSSYKNGQGNDYNLAIEQDCAFAVP 251 [3][TOP] >UniRef100_A9V4B3 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V4B3_MONBE Length = 321 Score = 150 bits (379), Expect = 6e-35 Identities = 64/99 (64%), Positives = 79/99 (79%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPISCGI A +L+ YTGG++AE N SINH++S+VGWG+D + VEYWV+RNSW Sbjct: 222 IYHRGPISCGIAADTKLDDYTGGIFAEYVPNASINHIISIVGWGLDAASGVEYWVVRNSW 281 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPD 298 G+PWGE GF ++VTS Y +G GN YNLAIE DC +GVPD Sbjct: 282 GQPWGEQGFFRIVTSKYMNGTGNDYNLAIEQDCAWGVPD 320 [4][TOP] >UniRef100_Q58HG7 Cathepsin Z n=1 Tax=Cyprinus carpio RepID=Q58HG7_CYPCA Length = 301 Score = 147 bits (372), Expect = 4e-34 Identities = 65/98 (66%), Positives = 80/98 (81%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI ATD+L+ YTGGVY+E +P INH++SV GWGVD E VEYWV+RNSW Sbjct: 198 IYSGGPISCGIMATDKLDAYTGGVYSEYVQDPYINHIISVAGWGVD-ENGVEYWVVRNSW 256 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLAIE++C +G P Sbjct: 257 GEPWGEKGWLRIVTSAYKGGSGSQYNLAIEENCMYGDP 294 [5][TOP] >UniRef100_Q4SS50 Chromosome 11 SCAF14479, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4SS50_TETNG Length = 297 Score = 147 bits (371), Expect = 5e-34 Identities = 64/98 (65%), Positives = 81/98 (82%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPISCGI ATD+L+ Y+GGVY+E + +P INH+VSV GWG+ E VEYW++RNSW Sbjct: 200 IYARGPISCGIMATDKLDAYSGGVYSEYQESPFINHIVSVAGWGM--EDGVEYWIVRNSW 257 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC FG P Sbjct: 258 GEPWGERGWLRIVTSAYKGGSGSSYNLAVEEDCMFGDP 295 [6][TOP] >UniRef100_UPI0000E49DA9 PREDICTED: similar to cathepsin Z precursor n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49DA9 Length = 219 Score = 147 bits (370), Expect = 7e-34 Identities = 62/98 (63%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY +GPISCGIDAT +LE YTGG+Y E K NH++SV GWGVD T EYW++RNSW Sbjct: 119 IYAKGPISCGIDATSKLEAYTGGIYEEFKIVAISNHIISVAGWGVDNSTGTEYWIVRNSW 178 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+ ++VTS Y DG GN YNLAIE +C +G P Sbjct: 179 GEPWGEQGWFRIVTSRYKDGEGNWYNLAIEGECRYGDP 216 [7][TOP] >UniRef100_UPI0000D8DB68 hypothetical protein LOC450022 n=1 Tax=Danio rerio RepID=UPI0000D8DB68 Length = 301 Score = 147 bits (370), Expect = 7e-34 Identities = 65/98 (66%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI ATD+L+ YTGG+Y+E P INH+VSV GWGVD E VE+WV+RNSW Sbjct: 198 IYSGGPISCGIMATDKLDAYTGGLYSEYVQEPYINHIVSVAGWGVD-ENGVEFWVVRNSW 256 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLAIE+DC +G P Sbjct: 257 GEPWGEKGWLRIVTSAYKGGSGSQYNLAIEEDCMYGDP 294 [8][TOP] >UniRef100_Q5XJD4 Zgc:103420 n=1 Tax=Danio rerio RepID=Q5XJD4_DANRE Length = 301 Score = 147 bits (370), Expect = 7e-34 Identities = 65/98 (66%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI ATD+L+ YTGG+Y+E P INH+VSV GWGVD E VE+WV+RNSW Sbjct: 198 IYSGGPISCGIMATDKLDAYTGGLYSEYVQEPYINHIVSVAGWGVD-ENGVEFWVVRNSW 256 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLAIE+DC +G P Sbjct: 257 GEPWGEKGWLRIVTSAYKGGSGSQYNLAIEEDCMYGDP 294 [9][TOP] >UniRef100_Q6JZV5 Cathepsin Z n=1 Tax=Fundulus heteroclitus RepID=Q6JZV5_FUNHE Length = 303 Score = 146 bits (369), Expect = 9e-34 Identities = 69/107 (64%), Positives = 83/107 (77%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPISCGI ATDQL+ YTGG+Y+E + INH+VSV GWGV E VEYWV+RNSW Sbjct: 201 IYARGPISCGIMATDQLDAYTGGLYSEYQEEAFINHIVSVAGWGV--EDGVEYWVVRNSW 258 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDL 322 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G D VP D L Sbjct: 259 GEPWGEKGWLRIVTSAYKGGSGSKYNLALEEDCMYG--DPIVPKDYL 303 [10][TOP] >UniRef100_Q6INK5 MGC82409 protein n=1 Tax=Xenopus laevis RepID=Q6INK5_XENLA Length = 296 Score = 145 bits (366), Expect = 2e-33 Identities = 64/98 (65%), Positives = 78/98 (79%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYK GPISCGI ATD+L+ YTGG+YAE + INH++SV GWG+D E VEYW++RNSW Sbjct: 197 IYKNGPISCGIMATDKLDAYTGGLYAEYQPRAMINHIISVAGWGLD-ENGVEYWIVRNSW 255 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G G YNLAIE+DC +G P Sbjct: 256 GEPWGERGWLRIVTSAYKGGKGADYNLAIEEDCAYGDP 293 [11][TOP] >UniRef100_C1BJN5 Cathepsin Z n=1 Tax=Osmerus mordax RepID=C1BJN5_OSMMO Length = 300 Score = 144 bits (364), Expect = 3e-33 Identities = 62/98 (63%), Positives = 80/98 (81%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +Y GPISCGI AT++L+ YTGG+Y+E +PSINH+VSV GWGV E VEYW++RNSW Sbjct: 198 LYAGGPISCGIMATEKLDAYTGGLYSEYVESPSINHIVSVAGWGV--ENGVEYWIVRNSW 255 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G P Sbjct: 256 GEPWGEKGWLRIVTSAYKGGSGSQYNLALEEDCMYGDP 293 [12][TOP] >UniRef100_Q63ZI5 LOC494800 protein n=1 Tax=Xenopus laevis RepID=Q63ZI5_XENLA Length = 296 Score = 144 bits (363), Expect = 4e-33 Identities = 64/98 (65%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYK GPISCGI AT++L+ YTGG+YAE + + INH+VSV GWG+D E VEYW++RNSW Sbjct: 197 IYKNGPISCGIMATEKLDAYTGGLYAEFQPSAMINHIVSVAGWGLD-ENGVEYWIVRNSW 255 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G G YNLAIE+DC +G P Sbjct: 256 GEPWGERGWLRIVTSAYKGGKGADYNLAIEEDCAYGDP 293 [13][TOP] >UniRef100_Q64HX9 Cathepsin Y n=1 Tax=Oncorhynchus mykiss RepID=Q64HX9_ONCMY Length = 290 Score = 144 bits (362), Expect = 6e-33 Identities = 62/98 (63%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI ATD+L+ YTGG+Y+E P INH+VSV GWG+D E VE+W++RNSW Sbjct: 187 IYAGGPISCGIMATDKLDAYTGGLYSEYIQEPYINHIVSVAGWGLD-ENGVEFWIVRNSW 245 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G P Sbjct: 246 GEPWGEKGWLRIVTSAYKGGSGSQYNLALEEDCMYGDP 283 [14][TOP] >UniRef100_C1BFQ4 Cathepsin Z n=1 Tax=Oncorhynchus mykiss RepID=C1BFQ4_ONCMY Length = 300 Score = 144 bits (362), Expect = 6e-33 Identities = 62/98 (63%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI ATD+L+ YTGG+Y+E P INH+VSV GWG+D E VE+W++RNSW Sbjct: 197 IYAGGPISCGIMATDKLDAYTGGLYSEYIQEPYINHIVSVAGWGLD-ENGVEFWIVRNSW 255 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G P Sbjct: 256 GEPWGEKGWLRIVTSAYKGGSGSQYNLALEEDCMYGDP 293 [15][TOP] >UniRef100_C0PUU4 Cathepsin Z (Fragment) n=1 Tax=Salmo salar RepID=C0PUU4_SALSA Length = 298 Score = 144 bits (362), Expect = 6e-33 Identities = 62/98 (63%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI ATD+L+ YTGG+Y+E P INH+VSV GWG+D E VE+W++RNSW Sbjct: 195 IYAGGPISCGIMATDKLDAYTGGLYSEYIEEPFINHIVSVAGWGMD-ENGVEFWIVRNSW 253 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G P Sbjct: 254 GEPWGEKGWLRIVTSAYKGGSGSQYNLALEEDCMYGDP 291 [16][TOP] >UniRef100_C0PUQ5 Cathepsin Z (Fragment) n=1 Tax=Salmo salar RepID=C0PUQ5_SALSA Length = 296 Score = 143 bits (361), Expect = 8e-33 Identities = 61/98 (62%), Positives = 80/98 (81%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT++L+ YTGG+Y+E P+INH+VSV GWG+D E VE+W++RNSW Sbjct: 193 IYAGGPISCGIMATEKLDAYTGGLYSEYIEEPNINHIVSVAGWGLD-ENGVEFWIVRNSW 251 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G P Sbjct: 252 GEPWGEKGWLRIVTSAYKGGSGSQYNLALEEDCMYGDP 289 [17][TOP] >UniRef100_A8E5S3 LOC100127597 protein n=3 Tax=Xenopus (Silurana) tropicalis RepID=A8E5S3_XENTR Length = 296 Score = 143 bits (361), Expect = 8e-33 Identities = 63/98 (64%), Positives = 79/98 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYK GPISCGI AT++L+ YTGG+YAE + + INH+VSV GWG+D E+ EYW++RNSW Sbjct: 197 IYKNGPISCGIMATEKLDAYTGGLYAEYQPSAMINHIVSVAGWGLD-ESGAEYWIVRNSW 255 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G G YNLAIE+DC +G P Sbjct: 256 GEPWGERGWLRIVTSAYKGGKGADYNLAIEEDCAYGDP 293 [18][TOP] >UniRef100_A7SGN5 Predicted protein (Fragment) n=1 Tax=Nematostella vectensis RepID=A7SGN5_NEMVE Length = 252 Score = 142 bits (359), Expect = 1e-32 Identities = 64/98 (65%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ RGPI+CGI AT L+ YT GVY E ++P INH+VSV+GWGV E EYWV+RNSW Sbjct: 155 IFARGPIACGIMATPNLDNYTKGVYKEHNTSPEINHIVSVMGWGV--ENGEEYWVVRNSW 212 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGE GF ++VTSAY DG GN YNLAIE DC F VP Sbjct: 213 GTPWGEEGFFRIVTSAYKDGQGNDYNLAIEQDCAFAVP 250 [19][TOP] >UniRef100_UPI000065DA49 UPI000065DA49 related cluster n=1 Tax=Takifugu rubripes RepID=UPI000065DA49 Length = 302 Score = 141 bits (356), Expect = 3e-32 Identities = 61/98 (62%), Positives = 80/98 (81%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPISCGI ATD+L+ Y+GGVY+E + +P INH+VSV GWG+ E E+W++RNSW Sbjct: 200 IYARGPISCGIMATDKLDAYSGGVYSEYQESPLINHIVSVAGWGM--EDGDEFWIVRNSW 257 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G+G+ YNLA+E+DC +G P Sbjct: 258 GEPWGERGWLRIVTSAYKGGSGSSYNLALEEDCMYGDP 295 [20][TOP] >UniRef100_UPI000194DA0C PREDICTED: hypothetical protein, partial n=1 Tax=Taeniopygia guttata RepID=UPI000194DA0C Length = 141 Score = 141 bits (355), Expect = 4e-32 Identities = 61/98 (62%), Positives = 75/98 (76%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT++L+ YTGG+Y E P++NH+VSV GWGV E EYW++RNSW Sbjct: 43 IYTNGPISCGIMATEKLDAYTGGLYTEYNPTPTVNHIVSVAGWGV--ENGTEYWIVRNSW 100 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTSAY G G YNLAIE+DC +G P Sbjct: 101 GEPWGERGWLRIVTSAYKGGRGADYNLAIEEDCTYGDP 138 [21][TOP] >UniRef100_UPI0000F2B676 PREDICTED: similar to CTSZ protein n=1 Tax=Monodelphis domestica RepID=UPI0000F2B676 Length = 309 Score = 140 bits (353), Expect = 6e-32 Identities = 60/98 (61%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+ L+ YTGG+Y E P INH++SV GWGV + EYW++RNSW Sbjct: 211 IYANGPISCGIMATEALDNYTGGIYFEYNPQPMINHIISVAGWGVS-DNGTEYWIVRNSW 269 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTS Y DG G+ YNLAIE+ C+FG P Sbjct: 270 GEPWGEKGWLRIVTSRYKDGQGSNYNLAIEETCSFGDP 307 [22][TOP] >UniRef100_UPI00017977C1 PREDICTED: similar to cathepsin Z n=1 Tax=Equus caballus RepID=UPI00017977C1 Length = 317 Score = 140 bits (352), Expect = 8e-32 Identities = 60/98 (61%), Positives = 73/98 (74%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYK GPISCGI AT+++ YTGG+YAE INH+VSV GWGV +EYW++RNSW Sbjct: 220 IYKNGPISCGIMATEKMANYTGGIYAEYHDTAFINHIVSVAGWGVS--NGIEYWIVRNSW 277 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y DG GN YNL IE+ C FG P Sbjct: 278 GEPWGERGWMRIVTSTYKDGKGNYYNLHIEESCTFGDP 315 [23][TOP] >UniRef100_UPI0000ECA906 Cathepsin Z precursor (EC 3.4.22.-) (Cathepsin X) (Cathepsin P). n=2 Tax=Gallus gallus RepID=UPI0000ECA906 Length = 305 Score = 139 bits (350), Expect = 1e-31 Identities = 59/96 (61%), Positives = 75/96 (78%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT++L+ YTGG+Y E +P++NH+VSV GWGV E EYW++RNSW Sbjct: 207 IYANGPISCGIMATEKLDAYTGGLYTEYNPSPTVNHIVSVAGWGV--ENGTEYWIVRNSW 264 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFG 289 GEPWGE G+L++VTSAY G G YNLA+E+DC +G Sbjct: 265 GEPWGERGWLRIVTSAYKGGRGAEYNLAVEEDCAYG 300 [24][TOP] >UniRef100_A5GFX7 Cathepsin Z n=1 Tax=Sus scrofa RepID=A5GFX7_PIG Length = 304 Score = 139 bits (350), Expect = 1e-31 Identities = 61/98 (62%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+++ YTGG+YAE K INH+VSV GWGV T EYW++RNSW Sbjct: 207 IYANGPISCGIMATEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGT--EYWIVRNSW 264 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y DG G YNLAIE++C FG P Sbjct: 265 GEPWGERGWMRIVTSTYKDGRGAHYNLAIEENCTFGDP 302 [25][TOP] >UniRef100_Q4S3W7 Chromosome 20 SCAF14744, whole genome shotgun sequence. (Fragment) n=2 Tax=Tetraodon nigroviridis RepID=Q4S3W7_TETNG Length = 288 Score = 139 bits (349), Expect = 2e-31 Identities = 60/96 (62%), Positives = 77/96 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ GPISCGI AT++L+ YTGG+Y+E +P INH+VSV GWGVD T EYW++RNSW Sbjct: 190 IHSGGPISCGIMATEKLDDYTGGLYSEYVESPEINHIVSVAGWGVDNGT--EYWIVRNSW 247 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFG 289 GEPWGE G+L++VTS Y G+G+ YNLA+EDDC +G Sbjct: 248 GEPWGERGWLRIVTSLYKGGSGSKYNLALEDDCMYG 283 [26][TOP] >UniRef100_Q9UBR2 Cathepsin Z n=1 Tax=Homo sapiens RepID=CATZ_HUMAN Length = 303 Score = 139 bits (349), Expect = 2e-31 Identities = 62/98 (63%), Positives = 73/98 (74%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT++L YTGG+YAE + INHVVSV GWG+ T EYW++RNSW Sbjct: 206 IYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGISDGT--EYWIVRNSW 263 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTS Y DG G YNLAIE+ C FG P Sbjct: 264 GEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDP 301 [27][TOP] >UniRef100_Q5U000 Cathepsin Z n=1 Tax=Homo sapiens RepID=Q5U000_HUMAN Length = 303 Score = 138 bits (347), Expect = 3e-31 Identities = 61/98 (62%), Positives = 73/98 (74%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT++L YTGG+YAE + INHVVSV GWG+ T EYW++RNSW Sbjct: 206 IYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGISDGT--EYWIVRNSW 263 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+L++VTS Y DG G YN+AIE+ C FG P Sbjct: 264 GEPWGERGWLRIVTSTYKDGKGARYNIAIEEHCTFGDP 301 [28][TOP] >UniRef100_Q9EPP7 Cathepsin Z n=1 Tax=Cricetulus griseus RepID=Q9EPP7_CRIGR Length = 306 Score = 135 bits (341), Expect = 2e-30 Identities = 57/98 (58%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI T++++ YTGG+YAE + SINH+VSV GWGV + +EYW++RNSW Sbjct: 208 IYANGPISCGIMVTEKMDNYTGGIYAELQEQTSINHIVSVAGWGVSSD-GIEYWIVRNSW 266 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y G G YNLAIE+ C +G P Sbjct: 267 GEPWGERGWMRIVTSTYKGGTGASYNLAIEEACTYGDP 304 [29][TOP] >UniRef100_Q9WUU7 Cathepsin Z n=2 Tax=Mus musculus RepID=CATZ_MOUSE Length = 306 Score = 135 bits (341), Expect = 2e-30 Identities = 57/98 (58%), Positives = 72/98 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+ + YTGG+YAE + INH++SV GWGV + +EYW++RNSW Sbjct: 208 IYANGPISCGIMATEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSND-GIEYWIVRNSW 266 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y G G+ YNLAIE C FG P Sbjct: 267 GEPWGEKGWMRIVTSTYKGGTGDSYNLAIESACTFGDP 304 [30][TOP] >UniRef100_UPI00005A4607 PREDICTED: similar to Cathepsin Z precursor (Cathepsin X) (Cathepsin P) n=1 Tax=Canis lupus familiaris RepID=UPI00005A4607 Length = 375 Score = 135 bits (340), Expect = 2e-30 Identities = 59/98 (60%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+++ YTGG++AE + INHV+SVVGWGV T EYW++RNSW Sbjct: 278 IYANGPISCGIMATEKMVNYTGGIHAEYQEQAYINHVISVVGWGVSDGT--EYWIVRNSW 335 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y DG G YNLA+E+ C FG P Sbjct: 336 GEPWGERGWMRIVTSTYKDGKGASYNLAVEEYCTFGDP 373 [31][TOP] >UniRef100_UPI00004BE249 Cathepsin Z precursor (EC 3.4.22.-) (Cathepsin X) (Cathepsin P). n=1 Tax=Canis lupus familiaris RepID=UPI00004BE249 Length = 260 Score = 135 bits (340), Expect = 2e-30 Identities = 59/98 (60%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+++ YTGG++AE + INHV+SVVGWGV T EYW++RNSW Sbjct: 163 IYANGPISCGIMATEKMVNYTGGIHAEYQEQAYINHVISVVGWGVSDGT--EYWIVRNSW 220 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y DG G YNLA+E+ C FG P Sbjct: 221 GEPWGERGWMRIVTSTYKDGKGASYNLAVEEYCTFGDP 258 [32][TOP] >UniRef100_Q9ES94 Cathepsin Z n=1 Tax=Mus musculus RepID=Q9ES94_MOUSE Length = 307 Score = 134 bits (338), Expect = 4e-30 Identities = 56/98 (57%), Positives = 72/98 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +Y GPISCGI AT+ + YTGG+YAE + INH++SV GWGV + +EYW++RNSW Sbjct: 208 MYANGPISCGIMATEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSND-GIEYWIVRNSW 266 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y G G+ YNLAIE C FG P Sbjct: 267 GEPWGEKGWMRIVTSTYKGGTGDSYNLAIESACTFGDP 304 [33][TOP] >UniRef100_Q9R1T3 Cathepsin Z n=1 Tax=Rattus norvegicus RepID=CATZ_RAT Length = 306 Score = 134 bits (338), Expect = 4e-30 Identities = 56/98 (57%), Positives = 74/98 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+++ YTGG+Y E ++ INH++SV GWGV + +EYW++RNSW Sbjct: 208 IYANGPISCGIMATERMSNYTGGIYTEYQNQAIINHIISVAGWGVSND-GIEYWIVRNSW 266 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y G G+ YNLAIE+ C FG P Sbjct: 267 GEPWGERGWMRIVTSTYKGGTGSSYNLAIEEACTFGDP 304 [34][TOP] >UniRef100_UPI00005BDF98 Cathepsin Z (EC 3.4.22.-) n=1 Tax=Bos taurus RepID=UPI00005BDF98 Length = 304 Score = 132 bits (333), Expect = 1e-29 Identities = 57/98 (58%), Positives = 71/98 (72%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+++ YTGG+Y+E INH+VSV GWGV +EYW++RNSW Sbjct: 207 IYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSD--GMEYWIVRNSW 264 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y G G YNLAIE+ C FG P Sbjct: 265 GEPWGEHGWMRIVTSTYKGGEGARYNLAIEESCTFGDP 302 [35][TOP] >UniRef100_P05689 Cathepsin Z n=1 Tax=Bos taurus RepID=CATZ_BOVIN Length = 304 Score = 132 bits (333), Expect = 1e-29 Identities = 57/98 (58%), Positives = 71/98 (72%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT+++ YTGG+Y+E INH+VSV GWGV +EYW++RNSW Sbjct: 207 IYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSD--GMEYWIVRNSW 264 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G++++VTS Y G G YNLAIE+ C FG P Sbjct: 265 GEPWGEHGWMRIVTSTYKGGEGARYNLAIEESCTFGDP 302 [36][TOP] >UniRef100_UPI0001925E05 PREDICTED: similar to cathepsin Y n=1 Tax=Hydra magnipapillata RepID=UPI0001925E05 Length = 769 Score = 132 bits (331), Expect = 2e-29 Identities = 60/98 (61%), Positives = 75/98 (76%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYKRGPI+CGI AT + YTGGVY E ++P NH++SV GWGVD E VE+WV RNSW Sbjct: 666 IYKRGPIACGIMATPSFDKYTGGVYTEY-TDPFENHIISVHGWGVD-ENGVEFWVGRNSW 723 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G+PWGE+G+ ++VTS Y+ G G MYNL IE++C F VP Sbjct: 724 GQPWGENGWFRIVTSLYEGGKGGMYNLGIENNCAFAVP 761 [37][TOP] >UniRef100_A8J8M1 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J8M1_CHLRE Length = 268 Score = 132 bits (331), Expect = 2e-29 Identities = 57/100 (57%), Positives = 74/100 (74%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPISCGI ++ ++ Y GGVYAE + P ++H V+VVGWG + E +E+WV+RN+W Sbjct: 170 IYARGPISCGIASSKGVQAYKGGVYAEYRERPQVSHTVTVVGWGGE-EGGMEFWVVRNNW 228 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDR 301 GE WGE GF++LVTSAY G YNL +E DC+F VPDR Sbjct: 229 GEAWGERGFMRLVTSAYSPGRAAHYNLGVETDCSFAVPDR 268 [38][TOP] >UniRef100_UPI00005893B2 PREDICTED: similar to LOC494800 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005893B2 Length = 293 Score = 130 bits (328), Expect = 5e-29 Identities = 59/99 (59%), Positives = 75/99 (75%), Gaps = 1/99 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGG-VYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I RGPISCG+ T+ + + GG VYAE +S SINH+VSV GWGVD ET VEYW++RNS Sbjct: 193 ISARGPISCGVMVTEAFDAFQGGKVYAEYQSTISINHIVSVAGWGVD-ETGVEYWIVRNS 251 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG+PW E G++++VTSAY G G+ YNL IE +C +GVP Sbjct: 252 WGQPWAEQGWVRIVTSAYKSGAGDSYNLGIETECAYGVP 290 [39][TOP] >UniRef100_C1BLW5 Cathepsin Z n=1 Tax=Osmerus mordax RepID=C1BLW5_OSMMO Length = 304 Score = 129 bits (325), Expect = 1e-28 Identities = 58/98 (59%), Positives = 71/98 (72%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCG+ AT LE YTGG++AE + +NH++SV GWGV E EYWV+RNSW Sbjct: 205 IYTNGPISCGVMATAGLEAYTGGLFAEFHALSLMNHIISVAGWGVT-EDGTEYWVVRNSW 263 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+ ++VTSAY G GN YNLA+E C +G P Sbjct: 264 GEPWGEYGWARIVTSAYKGGKGNFYNLAVEKKCAYGDP 301 [40][TOP] >UniRef100_UPI0001926221 PREDICTED: similar to cathepsin Z n=1 Tax=Hydra magnipapillata RepID=UPI0001926221 Length = 304 Score = 127 bits (320), Expect = 4e-28 Identities = 58/98 (59%), Positives = 70/98 (71%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+KRGPI+C I AT + YTGG+Y+E NH +SV G+GVD E VE+W+ RNSW Sbjct: 202 IFKRGPIACVIMATPLFDKYTGGIYSEYNEVSIANHAISVHGYGVD-ENGVEFWIGRNSW 260 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+ ++VTS Y DG G YNL IEDDC FGVP Sbjct: 261 GEPWGERGWFRMVTSLYKDGKGGFYNLGIEDDCAFGVP 298 [41][TOP] >UniRef100_UPI0001925E06 PREDICTED: similar to cathepsin Z n=1 Tax=Hydra magnipapillata RepID=UPI0001925E06 Length = 304 Score = 127 bits (320), Expect = 4e-28 Identities = 58/98 (59%), Positives = 70/98 (71%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+KRGPI+C I AT + YTGG+Y+E NH +SV G+GVD E VE+W+ RNSW Sbjct: 202 IFKRGPIACVIMATPLFDKYTGGIYSEYNEVSIANHAISVHGYGVD-ENGVEFWIGRNSW 260 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+ ++VTS Y DG G YNL IEDDC FGVP Sbjct: 261 GEPWGERGWFRMVTSLYKDGKGGFYNLGIEDDCAFGVP 298 [42][TOP] >UniRef100_UPI0001863518 hypothetical protein BRAFLDRAFT_77191 n=1 Tax=Branchiostoma floridae RepID=UPI0001863518 Length = 302 Score = 127 bits (320), Expect = 4e-28 Identities = 57/96 (59%), Positives = 71/96 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +YK GPISCGI AT LE YTGGVY+E + NHV+S+ GWGVD + EYW+ RNSW Sbjct: 199 VYKNGPISCGIMATSGLEKYTGGVYSEFHAISRENHVLSIAGWGVDDD-GTEYWIGRNSW 257 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFG 289 G PWGESG+ K+VTS Y +G G+ YNL IE++C +G Sbjct: 258 GTPWGESGWFKIVTSLYKNGEGDKYNLGIEENCAYG 293 [43][TOP] >UniRef100_UPI000155C10C PREDICTED: similar to LOC548400 protein, partial n=1 Tax=Ornithorhynchus anatinus RepID=UPI000155C10C Length = 91 Score = 127 bits (320), Expect = 4e-28 Identities = 55/90 (61%), Positives = 69/90 (76%) Frame = +2 Query: 26 CGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWGESG 205 CGI AT+ L+ YTGG+Y+E NP INH+VSV GW VD + EYW++RNSWGEPWGE G Sbjct: 1 CGIMATEGLDAYTGGIYSEYNQNPLINHIVSVAGWDVDSD-GTEYWIVRNSWGEPWGERG 59 Query: 206 FLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 +L++VTS Y G G+ YNLAIE+ C+FG P Sbjct: 60 WLRIVTSTYKGGKGSDYNLAIEERCSFGDP 89 [44][TOP] >UniRef100_UPI000175F27B PREDICTED: similar to cathepsin Z cysteine protease n=1 Tax=Danio rerio RepID=UPI000175F27B Length = 301 Score = 125 bits (314), Expect = 2e-27 Identities = 58/98 (59%), Positives = 70/98 (71%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+K GPISC I AT LE Y GGV+AE NH++SV GWGV E EYW++RNSW Sbjct: 202 IFKNGPISCAIMATKGLEAYDGGVFAEFHILSMPNHIISVAGWGVT-EDGTEYWIVRNSW 260 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GE WGESG+ ++VTSAY G GN YN+AIE+DC +G P Sbjct: 261 GEFWGESGWARIVTSAYKGGKGNWYNVAIENDCAYGDP 298 [45][TOP] >UniRef100_UPI0001A2D48A UPI0001A2D48A related cluster n=1 Tax=Danio rerio RepID=UPI0001A2D48A Length = 272 Score = 125 bits (314), Expect = 2e-27 Identities = 58/98 (59%), Positives = 70/98 (71%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+K GPISC I AT LE Y GGV+AE NH++SV GWGV E EYW++RNSW Sbjct: 166 IFKNGPISCAIMATKGLEAYDGGVFAEFHILSMPNHIISVAGWGVT-EDGTEYWIVRNSW 224 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GE WGESG+ ++VTSAY G GN YN+AIE+DC +G P Sbjct: 225 GEFWGESGWARIVTSAYKGGKGNWYNVAIENDCAYGDP 262 [46][TOP] >UniRef100_A8WW81 C. briggsae CBR-CPZ-1 protein n=1 Tax=Caenorhabditis briggsae RepID=A8WW81_CAEBR Length = 306 Score = 124 bits (311), Expect = 5e-27 Identities = 57/98 (58%), Positives = 73/98 (74%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY +GPI+CGI AT ETY GG+Y E ++ I+H++SV GWGVD ET VEYW+ RNSW Sbjct: 209 IYHKGPIACGIAATKAFETYAGGIYKE-VTDEDIDHIISVHGWGVDHETGVEYWIGRNSW 267 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+ K+VTS Y + +G+ YNL IE+DC + P Sbjct: 268 GEPWGERGWFKIVTSQYKN-SGSKYNLKIEEDCVWADP 304 [47][TOP] >UniRef100_Q27125 Cathepsin B-like protease n=1 Tax=Urechis caupo RepID=Q27125_URECA Length = 294 Score = 123 bits (309), Expect = 8e-27 Identities = 53/98 (54%), Positives = 70/98 (71%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I + GPISCG+ A + YTGGV+ E INH++SV GWGV E VE+W+ RNSW Sbjct: 196 IMENGPISCGVMADAAFDAYTGGVFKEYHEQADINHIISVAGWGV--ENGVEFWIGRNSW 253 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G+PWGE+G+ ++VTS Y +G+GN YNL IE++C FG P Sbjct: 254 GQPWGENGWFRMVTSKYKNGDGNKYNLGIENECAFGDP 291 [48][TOP] >UniRef100_O01850 Cathepsin Z-like enzyme n=1 Tax=Caenorhabditis elegans RepID=O01850_CAEEL Length = 306 Score = 122 bits (306), Expect = 2e-26 Identities = 56/98 (57%), Positives = 72/98 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY +GPI+CGI AT ETY GG+Y E ++ I+H++SV GWGVD E+ VEYW+ RNSW Sbjct: 209 IYHKGPIACGIAATKAFETYAGGIYKE-VTDEDIDHIISVHGWGVDHESGVEYWIGRNSW 267 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GEPWGE G+ K+VTS Y + G+ YNL IE+DC + P Sbjct: 268 GEPWGEHGWFKIVTSQYKNA-GSKYNLKIEEDCVWADP 304 [49][TOP] >UniRef100_C3KJR8 Cathepsin Z n=1 Tax=Anoplopoma fimbria RepID=C3KJR8_9PERC Length = 301 Score = 120 bits (302), Expect = 5e-26 Identities = 54/98 (55%), Positives = 69/98 (70%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISC + ATD LE YTGG+++E NH+VSV GWGV + + EYW++RNSW Sbjct: 202 IYTHGPISCALMATDGLEEYTGGIFSEFHPLSLPNHIVSVAGWGV-ADDETEYWIVRNSW 260 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GE WGE G+ ++VTSAY G GN +NL IE +C +G P Sbjct: 261 GEFWGEHGWARIVTSAYKGGKGNWFNLGIEKNCAYGDP 298 [50][TOP] >UniRef100_Q6E7B0 Cathepsin Z-like cysteine proteinase n=1 Tax=Brugia malayi RepID=Q6E7B0_BRUMA Length = 311 Score = 119 bits (298), Expect = 2e-25 Identities = 55/98 (56%), Positives = 72/98 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPI+CGI AT ETY GG+Y E K+ SI+HV+SV GWGVD ++ V YW+ RNSW Sbjct: 214 IYHHGPIACGIAATKAFETYGGGIYKE-KTEESIDHVISVHGWGVDRDSGVPYWIGRNSW 272 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGE+G+ ++VTS Y + +G+ YNL IE+DC + P Sbjct: 273 GTPWGENGWFRIVTSEYKN-SGSKYNLKIEEDCVWADP 309 [51][TOP] >UniRef100_P91771 Cysteine protease n=1 Tax=Onchocerca volvulus RepID=P91771_ONCVO Length = 306 Score = 119 bits (298), Expect = 2e-25 Identities = 54/98 (55%), Positives = 71/98 (72%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPI+CGI AT ETY GG+Y ER +N I+H++SV GWGVD E+ V YW+ RNSW Sbjct: 209 IYHHGPIACGIAATKAFETYAGGIYNER-TNEDIDHIISVHGWGVDSESGVPYWIGRNSW 267 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGE+G+ ++VTS Y + + + YNL IE+DC + P Sbjct: 268 GTPWGENGWFRIVTSEYKN-SSSKYNLKIEEDCVWADP 304 [52][TOP] >UniRef100_A8QHC0 Cathepsin Z-like cysteine proteinase, putative (Fragment) n=1 Tax=Brugia malayi RepID=A8QHC0_BRUMA Length = 177 Score = 119 bits (298), Expect = 2e-25 Identities = 55/98 (56%), Positives = 72/98 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPI+CGI AT ETY GG+Y E K+ SI+HV+SV GWGVD ++ V YW+ RNSW Sbjct: 80 IYHHGPIACGIAATKAFETYGGGIYKE-KTEESIDHVISVHGWGVDRDSGVPYWIGRNSW 138 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGE+G+ ++VTS Y + +G+ YNL IE+DC + P Sbjct: 139 GTPWGENGWFRIVTSEYKN-SGSKYNLKIEEDCVWADP 175 [53][TOP] >UniRef100_Q58HF4 Cathepsin Z cysteine protease n=1 Tax=Paralichthys olivaceus RepID=Q58HF4_PAROL Length = 300 Score = 119 bits (297), Expect = 2e-25 Identities = 54/98 (55%), Positives = 69/98 (70%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +YK GPISC + AT LE+Y+GGV++E NH+VSV GWGV E EYW+IRNSW Sbjct: 201 LYKNGPISCALMATSGLESYSGGVFSEFHLLSLPNHIVSVAGWGVSEE-GTEYWIIRNSW 259 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GE WGE G+ ++VTS+Y +G GN +NL IE C +G P Sbjct: 260 GEFWGEHGWARIVTSSYKEGEGNWFNLGIEKHCVYGDP 297 [54][TOP] >UniRef100_A5HC51 Cathepsin Z (Fragment) n=1 Tax=Oryctolagus cuniculus RepID=A5HC51_RABIT Length = 173 Score = 118 bits (296), Expect = 3e-25 Identities = 52/82 (63%), Positives = 62/82 (75%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPISCGI AT++L YTGG+YAE + INHVVSV GWG+ T EYW++RNSW Sbjct: 93 IYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGISDGT--EYWIVRNSW 150 Query: 182 GEPWGESGFLKLVTSAYDDGNG 247 GEPWGE G+L++VTS Y DG G Sbjct: 151 GEPWGERGWLRIVTSTYKDGKG 172 [55][TOP] >UniRef100_Q6PN98 Cathepsin Z n=1 Tax=Onchocerca volvulus RepID=Q6PN98_ONCVO Length = 306 Score = 117 bits (294), Expect = 4e-25 Identities = 53/98 (54%), Positives = 70/98 (71%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GPI+CGI AT ETY GG+Y ER +N I+H++S GWGVD E+ V YW+ RNSW Sbjct: 209 IYHHGPIACGIAATKAFETYAGGIYNER-TNEDIDHIISAHGWGVDSESGVPYWIGRNSW 267 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGE+G+ ++VTS Y + + + YNL IE+DC + P Sbjct: 268 GTPWGENGWFRIVTSEYKN-SSSKYNLKIEEDCVWADP 304 [56][TOP] >UniRef100_A4VE98 Cathepsin z n=1 Tax=Tetrahymena thermophila SB210 RepID=A4VE98_TETTH Length = 585 Score = 117 bits (294), Expect = 4e-25 Identities = 53/98 (54%), Positives = 67/98 (68%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPISCGI T++ E YTGG+Y E + P INH ++VVGWG DP+T VEYW+ RNSW Sbjct: 493 IYARGPISCGIYVTNKFEAYTGGIYKESTAFPMINHEIAVVGWGTDPQTGVEYWIGRNSW 552 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE+GF ++ + NLAIE DC++G P Sbjct: 553 GTYWGENGFFRI--------QMHKQNLAIETDCSWGEP 582 Score = 93.2 bits (230), Expect = 1e-17 Identities = 48/103 (46%), Positives = 62/103 (60%), Gaps = 2/103 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLE-TYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I+ RGPI+C I AT+ L YTGG+Y + S P NHV+ VVGWG E + +YW+IRNS Sbjct: 192 IFNRGPIACYIYATEYLRYNYTGGIYNDTSSYPGTNHVIEVVGWG--EENNEKYWIIRNS 249 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP-DRW 304 WG WGE GF + + NM N+ +CN+ VP D W Sbjct: 250 WGSYWGEKGFYRQLRGV------NMLNIE-SSNCNWAVPLDTW 285 [57][TOP] >UniRef100_Q2M436 Cathepsin-like cysteine protease n=1 Tax=Phytophthora infestans RepID=Q2M436_PHYIN Length = 635 Score = 117 bits (292), Expect = 8e-25 Identities = 52/98 (53%), Positives = 66/98 (67%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYKRGPI CG+ AT + E+YTGG+Y+E P INH +SV GWG D ETD EYW+ RNSW Sbjct: 512 IYKRGPIGCGVHATSKFESYTGGIYSEHVMFPLINHEISVAGWGYDEETDTEYWIGRNSW 571 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE+G+ ++ + NL IE DC++GVP Sbjct: 572 GTYWGENGWFRI--------QMHHNNLGIEQDCDWGVP 601 Score = 96.7 bits (239), Expect = 1e-18 Identities = 49/142 (34%), Positives = 77/142 (54%), Gaps = 2/142 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPI+C + TD Y+GG++ ++ + ++H +S+VGWG E V +WV+RNSW Sbjct: 214 IYARGPIACSVAVTDGFLKYSGGIFDDKTNATDVDHAISIVGWG--EENGVPFWVLRNSW 271 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP--DRWVPADDLGFGRPDDMSQQ 355 G WGESG+++LV + N+ +E +C FGVP D W + D + + Sbjct: 272 GSFWGESGWMRLVR--------GVNNVGVEGECAFGVPRDDGWPTPTKIEEKEEDKVKEP 323 Query: 356 QPKLDADFIAKAKTGGGSRKMI 421 Q + + T GG R+ + Sbjct: 324 QEETSVE-----STLGGCRQKL 340 [58][TOP] >UniRef100_C3YFK2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YFK2_BRAFL Length = 278 Score = 115 bits (289), Expect = 2e-24 Identities = 52/89 (58%), Positives = 65/89 (73%) Frame = +2 Query: 23 SCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWGES 202 SCGI AT LE YTGGVY+E + NHV+S+ GWGVD + EYW+ RNSWG PWGES Sbjct: 182 SCGIMATSGLEKYTGGVYSEFHAMSRENHVLSIAGWGVDDD-GTEYWIGRNSWGTPWGES 240 Query: 203 GFLKLVTSAYDDGNGNMYNLAIEDDCNFG 289 G+ K+VTS Y +G G+ YNL IE++C +G Sbjct: 241 GWFKIVTSLYKNGEGDKYNLGIEENCAYG 269 [59][TOP] >UniRef100_Q9XZI2 Cathepsin Z1 preproprotein n=1 Tax=Toxocara canis RepID=Q9XZI2_TOXCA Length = 307 Score = 114 bits (284), Expect = 6e-24 Identities = 51/98 (52%), Positives = 68/98 (69%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ GPI+CGI AT E Y+GG+Y E S I+H+++V GWGVD ++ V YW+ RNSW Sbjct: 210 IFHNGPIACGIAATKAFEMYSGGIYTEETSE-EIDHIIAVYGWGVDHDSSVPYWIGRNSW 268 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G PWGESG+ ++VTS Y G+ YNL IE+DC + P Sbjct: 269 GTPWGESGWFRVVTSEYKHA-GSRYNLKIEEDCVWADP 305 [60][TOP] >UniRef100_Q234M1 Papain family cysteine protease containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q234M1_TETTH Length = 581 Score = 110 bits (274), Expect = 9e-23 Identities = 53/98 (54%), Positives = 66/98 (67%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ RGPISCGI T++ E YTGGVY+E KS INH ++VVGWGVD T+ EYW+ RNSW Sbjct: 489 IFARGPISCGIAVTNKFEAYTGGVYSE-KSLTRINHEIAVVGWGVDETTNTEYWIGRNSW 547 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE GF ++ + NL IE DC++GVP Sbjct: 548 GTYWGEDGFFRI--------KMHSENLKIETDCSWGVP 577 Score = 92.8 bits (229), Expect = 2e-17 Identities = 47/103 (45%), Positives = 59/103 (57%), Gaps = 2/103 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLE-TYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I+ RGPI CGI + D L YTGG+Y NH +SVVGWGV E +YW++RNS Sbjct: 191 IFNRGPIGCGIASNDYLRYNYTGGIYVNTTEVDYHNHAISVVGWGV--ENGTKYWIVRNS 248 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP-DRW 304 WG WGE G+ +LV + +L IE DC + VP D W Sbjct: 249 WGSYWGEKGYFRLVR--------GINSLNIESDCAWAVPKDTW 283 [61][TOP] >UniRef100_P92005 Protein M04G12.2, confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=P92005_CAEEL Length = 467 Score = 110 bits (274), Expect = 9e-23 Identities = 53/100 (53%), Positives = 68/100 (68%), Gaps = 1/100 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLE-TYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I K GPI+C I AT + E Y GVY+E KS+ NH++S+ GWGVD E VEYW+ RNS Sbjct: 364 IKKGGPIACAIGATKKFEYEYVKGVYSE-KSDLESNHIISLTGWGVD-ENGVEYWIARNS 421 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPD 298 WGE WGE G+ ++VTS + DG G+ YN+ IE DC + D Sbjct: 422 WGEAWGELGWFRVVTSKFKDGQGDQYNMGIERDCYYADVD 461 [62][TOP] >UniRef100_C5LNV7 Cathepsin z, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LNV7_9ALVE Length = 846 Score = 110 bits (274), Expect = 9e-23 Identities = 47/100 (47%), Positives = 68/100 (68%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+KRGP+SCG+DAT Q++ YTGGV+ + K+ P INH V +VGWG + T+ EYW++RNSW Sbjct: 705 IWKRGPVSCGVDATKQMDDYTGGVFYQNKAEPKINHEVGLVGWGREEGTNDEYWIMRNSW 764 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDR 301 G WGE+GF+++ NL I+ DC++ P + Sbjct: 765 GTFWGENGFMRI----------KFGNLKIDSDCSWVEPSK 794 [63][TOP] >UniRef100_C5KCV4 Cathepsin Z, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KCV4_9ALVE Length = 394 Score = 109 bits (272), Expect = 2e-22 Identities = 47/98 (47%), Positives = 67/98 (68%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+KRGP+SCG+DAT Q++ YTGGV+ + K+ P INH V +VGWG + T+ EYW++RNSW Sbjct: 253 IWKRGPVSCGVDATKQMDDYTGGVFYQHKAEPKINHEVGLVGWGREEGTNDEYWIMRNSW 312 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE+GF+++ NL I+ DC++ P Sbjct: 313 GTFWGENGFMRI----------KFGNLKIDSDCSWVEP 340 [64][TOP] >UniRef100_A8X2Y6 C. briggsae CBR-CPZ-2 protein n=1 Tax=Caenorhabditis briggsae AF16 RepID=A8X2Y6_CAEBR Length = 479 Score = 108 bits (271), Expect = 2e-22 Identities = 53/100 (53%), Positives = 68/100 (68%), Gaps = 1/100 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLE-TYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I K GPI+C I AT + E Y GVY+E KS+ NH++S+ GWGVD E VEYW+ RNS Sbjct: 376 IKKGGPIACAIGATKKFEYEYVKGVYSE-KSDLESNHIISLTGWGVD-ENGVEYWIARNS 433 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPD 298 WGE WGE G+ ++VTS + +G G+ YN+ IE DC F D Sbjct: 434 WGEAWGELGWFRVVTSKFQNGEGDHYNMGIERDCYFADVD 473 [65][TOP] >UniRef100_Q54R55 Cathepsin Z n=1 Tax=Dictyostelium discoideum RepID=Q54R55_DICDI Length = 296 Score = 106 bits (264), Expect = 1e-21 Identities = 50/99 (50%), Positives = 66/99 (66%), Gaps = 1/99 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPI+C IDAT +LE YT G++ E K +P NH++SV+GWGV T YW++RNSW Sbjct: 202 IYARGPIACSIDATSKLEAYTSGIFKEFKLDPLPNHIISVIGWGVQDST--PYWIVRNSW 259 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMY-NLAIEDDCNFGVP 295 G +GE GF +V G+++ NL IE DCN+ VP Sbjct: 260 GSYYGEGGFFNIV-------QGSLFENLGIELDCNWAVP 291 [66][TOP] >UniRef100_UPI00006D00EE Papain family cysteine protease containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006D00EE Length = 591 Score = 105 bits (263), Expect = 2e-21 Identities = 48/98 (48%), Positives = 67/98 (68%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ RGPI CGI+AT +LE Y+GG++ + S+NH V+VVGWGVD T VEYW+ RNSW Sbjct: 490 IFARGPIGCGIEATLKLENYSGGIFEQNLLFTSLNHEVAVVGWGVDEATGVEYWIARNSW 549 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE+G+ ++ + + NG IE +C++GVP Sbjct: 550 GSYWGENGYFRI--RMHKNNNG------IEKECDWGVP 579 Score = 90.1 bits (222), Expect = 1e-16 Identities = 44/102 (43%), Positives = 62/102 (60%), Gaps = 1/102 (0%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY+RGPI+CGI D L YTGG++ +R + I H +SVVG+G + +YW++RNSW Sbjct: 198 IYQRGPITCGIAVPDALLNYTGGIFYDRTGDLEIEHDISVVGYGT-LKNGTKYWMVRNSW 256 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP-DRW 304 G WGE+GF +++ + NL IE C + VP D W Sbjct: 257 GTYWGENGFFRIIR--------GVNNLNIESACAWAVPRDTW 290 [67][TOP] >UniRef100_UPI00006CBB5F Papain family cysteine protease containing protein n=1 Tax=Tetrahymena thermophila RepID=UPI00006CBB5F Length = 1367 Score = 103 bits (257), Expect = 9e-21 Identities = 55/100 (55%), Positives = 65/100 (65%), Gaps = 1/100 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLET-YTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 IY RGPISC IDATD LE YTGG+Y+E+ P NH VSVVGWG E + EYW++RNS Sbjct: 1274 IYSRGPISCTIDATDNLENNYTGGIYSEKVKLPIPNHYVSVVGWGQTLEGE-EYWIVRNS 1332 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPD 298 WG WGE GF KL + D NL +E C++GV D Sbjct: 1333 WGTYWGEEGFFKL--KMHKD------NLGLEFGCSWGVID 1364 Score = 85.1 bits (209), Expect = 3e-15 Identities = 47/138 (34%), Positives = 75/138 (54%), Gaps = 7/138 (5%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ GPISC I++T+ YTGG+ S I H +S+VGWG D E +YW+ RNS Sbjct: 940 IFNHGPISCVINSTEDFRNYTGGILNPPDSPVQITHSLSIVGWGED-EKQTKYWIARNSL 998 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFG-VPDRWVP--ADDLGFGRPDDM-- 346 G WGE+GF++++ L IE DC++G + D W ++ F + + Sbjct: 999 GTFWGENGFIRIIR--------GKNALKIESDCSYGRIRDTWSQQIRNNTSFSKSNSSYT 1050 Query: 347 --SQQQPKLDADFIAKAK 394 +QQ+ K++ D +K++ Sbjct: 1051 SNNQQEHKINLDINSKSE 1068 [68][TOP] >UniRef100_C1N8M6 Papain family cysteine protease n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1N8M6_9CHLO Length = 553 Score = 102 bits (254), Expect = 2e-20 Identities = 50/99 (50%), Positives = 61/99 (61%), Gaps = 1/99 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSI-NHVVSVVGWGVDPETDVEYWVIRNS 178 I RGPISCGI TD E Y GG+Y+ERK P NH +S+VG+GVD ++ EYW+ RNS Sbjct: 134 IATRGPISCGIHVTDGFEAYAGGIYSERKWTPYFPNHELSLVGYGVDDDSGEEYWIGRNS 193 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG WGE GF ++ NL IE DC +GVP Sbjct: 194 WGTYWGEGGFFRIKMHG--------QNLGIETDCTWGVP 224 Score = 86.7 bits (213), Expect = 1e-15 Identities = 43/113 (38%), Positives = 64/113 (56%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I RGPI+CG+ T + E Y GGV+ + +H++S+ G+G E +YW+ RNSW Sbjct: 449 IATRGPIACGLCVTPEFEAYAGGVFEDTTGCTQEDHIISIAGYGT-TEDGTKYWIGRNSW 507 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDLGFGRPD 340 G WGESG+ K++ + NL +ED+C++ VP VP + G G D Sbjct: 508 GTYWGESGWFKIIR--------GVDNLGVEDNCDWAVP--IVPETNQGIGGRD 550 [69][TOP] >UniRef100_C5LAI7 Cathepsin z, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LAI7_9ALVE Length = 1140 Score = 99.8 bits (247), Expect = 1e-19 Identities = 49/98 (50%), Positives = 61/98 (62%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ GPISCG+ AT++ E YTGGVY E S NH +SV GWGV E +EYW+ RNSW Sbjct: 1036 IFTNGPISCGVFATERFEAYTGGVYEEEVSTVGTNHEISVAGWGVS-EDGIEYWIGRNSW 1094 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE G+ ++ D N NL IE DC +G+P Sbjct: 1095 GTYWGEDGWFRMKM----DEN----NLNIETDCYWGIP 1124 Score = 63.5 bits (153), Expect = 1e-08 Identities = 37/98 (37%), Positives = 50/98 (51%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I RGPI+C I A ++ G + P H ++VVG+G PE + YW+ RNSW Sbjct: 745 IADRGPIACSI-AVRRVALVCGQYAVAGATEPE--HEIAVVGYGTTPE-GIPYWIGRNSW 800 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WG +GF ++V NL IE DC + VP Sbjct: 801 GHYWGHNGFFRIVR--------GKNNLGIEGDCAWAVP 830 [70][TOP] >UniRef100_A9VD33 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VD33_MONBE Length = 624 Score = 97.8 bits (242), Expect = 5e-19 Identities = 50/98 (51%), Positives = 62/98 (63%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGPI GIDATD+LE YT G+++E K P NH +S+VGWGV+ T EYWV+RNSW Sbjct: 201 IYARGPIVAGIDATDKLEAYTHGIFSEEKILPVPNHEISIVGWGVEDGT--EYWVVRNSW 258 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G +GE GF ++ N + NLAI F VP Sbjct: 259 GTYFGEEGFFRI--------NMHENNLAINSQPAFAVP 288 Score = 72.4 bits (176), Expect = 2e-11 Identities = 37/104 (35%), Positives = 58/104 (55%), Gaps = 1/104 (0%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTG-GVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 IY RGPI+ I LET+ G G++ + + S++H + + GWGV E +V YW+IRNS Sbjct: 521 IYARGPIAGTIAVPPALETWNGQGIFNDTTGDVSLDHEIEIAGWGV--ENNVPYWIIRNS 578 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVP 310 WG W ++ + L+ NL +E +C++ V D +P Sbjct: 579 WGTYWADTNWFYLIRGT--------NNLGVEANCDWAVWDGKMP 614 [71][TOP] >UniRef100_Q8IT40 Cathepsin Z (Fragment) n=1 Tax=Theromyzon tessulatum RepID=Q8IT40_THETS Length = 106 Score = 95.1 bits (235), Expect = 3e-18 Identities = 40/65 (61%), Positives = 48/65 (73%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ GPISCGI AT++ + Y+GGVY E INH++SV GWGVDP+T VEYWV RNSW Sbjct: 42 IFTSGPISCGIMATEKFDQYSGGVYLEFHEQSFINHIISVAGWGVDPQTGVEYWVGRNSW 101 Query: 182 GEPWG 196 G WG Sbjct: 102 GTTWG 106 [72][TOP] >UniRef100_A0DIY3 Chromosome undetermined scaffold_52, whole genome shotgun sequence n=1 Tax=Paramecium tetraurelia RepID=A0DIY3_PARTE Length = 512 Score = 95.1 bits (235), Expect = 3e-18 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 1/99 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSI-NHVVSVVGWGVDPETDVEYWVIRNS 178 I+ RGPI CG+ AT +L+ Y GG +K+N +I NH VSVVGWGV E VEYW++RNS Sbjct: 422 IFNRGPIVCGVYATQELDDYEGGYIFSQKTNKTILNHYVSVVGWGV--EDGVEYWIVRNS 479 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG WG+ G+ K+ + D NL +E C++GVP Sbjct: 480 WGSYWGDMGYAKM--KMHSD------NLLLEHYCSWGVP 510 [73][TOP] >UniRef100_C1E7G2 Cysteine endopeptidase n=1 Tax=Micromonas sp. RCC299 RepID=C1E7G2_9CHLO Length = 670 Score = 94.7 bits (234), Expect = 4e-18 Identities = 47/96 (48%), Positives = 60/96 (62%), Gaps = 1/96 (1%) Frame = +2 Query: 11 RGPISCGIDATDQLET-YTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 RGPI+CGI TD+ + Y GG+YAE +NH ++VVG+GVD + EYW+ RNSWG Sbjct: 255 RGPIACGIHVTDKFYSDYKGGIYAESHLLNFMNHELAVVGYGVDEASGEEYWIGRNSWGT 314 Query: 188 PWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WGESGF ++ + NL IE DC FGVP Sbjct: 315 YWGESGFFRI--------KMHHQNLGIESDCTFGVP 342 Score = 79.7 bits (195), Expect = 1e-13 Identities = 38/98 (38%), Positives = 57/98 (58%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ RGPI+CG+ T++ E Y GG++ + +H +S+ G+G D E + +YWV RNSW Sbjct: 569 IFARGPIACGLCVTEEFEAYKGGIFTDATGCKDQDHEISIAGFGEDEEGN-KYWVGRNSW 627 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WGE G+ +L + L IED C++ VP Sbjct: 628 GTFWGEDGWFRL--------QRGVNALGIEDACDWAVP 657 [74][TOP] >UniRef100_C5KWJ6 Cathepsin z, putative (Fragment) n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KWJ6_9ALVE Length = 658 Score = 91.7 bits (226), Expect = 3e-17 Identities = 36/61 (59%), Positives = 49/61 (80%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+KRGP+SCG+DAT Q++ YTGGV+ + K+ P INH V +VGWG + T+ EYW++RNSW Sbjct: 576 IWKRGPVSCGVDATKQMDDYTGGVFYQNKAEPKINHEVGLVGWGREEGTNDEYWIMRNSW 635 Query: 182 G 184 G Sbjct: 636 G 636 [75][TOP] >UniRef100_A4RRS0 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RRS0_OSTLU Length = 316 Score = 88.2 bits (217), Expect = 4e-16 Identities = 45/103 (43%), Positives = 61/103 (59%), Gaps = 2/103 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 I+ RGP+S GIDA D L Y GG+Y K P INH+VS+VGWG + +YWV+RN Sbjct: 189 IFARGPVSAGIDA-DGLRGYVGGIY---KDTPDFEINHIVSIVGWGT-ADDGTKYWVVRN 243 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRW 304 SWG+ WGE GF +++ + +L IED+ + P W Sbjct: 244 SWGQYWGEMGFFRIIR--------GVNSLGIEDEVAWATPGSW 278 [76][TOP] >UniRef100_C5LYL7 Putative uncharacterized protein n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5LYL7_9ALVE Length = 965 Score = 88.2 bits (217), Expect = 4e-16 Identities = 41/99 (41%), Positives = 57/99 (57%), Gaps = 1/99 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLET-YTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I RGP+ C + + Y GGV+ + K N INH VS+VGWG DPET+ EYW++RNS Sbjct: 823 IKARGPVVCHMHVDEAFRANYEGGVFYQDKPNAEINHEVSLVGWGKDPETNEEYWIMRNS 882 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG WGE G++++ L I+ C++G P Sbjct: 883 WGSFWGEHGYMRI----------KFGTLLIDSQCSWGEP 911 [77][TOP] >UniRef100_C1FFA0 Cysteine endopeptidase n=1 Tax=Micromonas sp. RCC299 RepID=C1FFA0_9CHLO Length = 388 Score = 87.4 bits (215), Expect = 6e-16 Identities = 42/102 (41%), Positives = 62/102 (60%), Gaps = 1/102 (0%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y RGP++ GIDA + L+ YTGG+ + INH+V++VGWG + + +YW++RNS Sbjct: 256 VYARGPVAAGIDA-NLLDEYTGGILDQPADYEYEINHIVAIVGWG-ETKKGEKYWIVRNS 313 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRW 304 WGE WGE GF ++V L IED+C++ P W Sbjct: 314 WGEYWGEMGFFRIVR--------GKKALGIEDECSWATPASW 347 [78][TOP] >UniRef100_Q01FU9 Cathepsin Z (ISS) n=1 Tax=Ostreococcus tauri RepID=Q01FU9_OSTTA Length = 387 Score = 86.3 bits (212), Expect = 1e-15 Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 2/103 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 IY RGP++ GIDA D L Y GG+Y K PS INH+VS+VGWG + +YW++RN Sbjct: 260 IYARGPVAAGIDA-DGLRGYVGGIY---KDTPSFEINHIVSIVGWGTAKD-GTKYWIVRN 314 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRW 304 SWG+ WGE G+ +++ + L +ED+ + P W Sbjct: 315 SWGQYWGEMGYFRIIR--------GVNALGLEDEVAWATPGSW 349 [79][TOP] >UniRef100_Q86GK0 Cathepsin Z-like cysteine proteinase n=1 Tax=Myxobolus cerebralis RepID=Q86GK0_9CNID Length = 297 Score = 86.3 bits (212), Expect = 1e-15 Identities = 45/99 (45%), Positives = 60/99 (60%), Gaps = 4/99 (4%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLE-TYTGGVYAERKSNPSINHVVSVVGWGVDPETDVE---YWVI 169 ++ RGP+SC + A++ YTGGVY E SN NH+VS++GWG D + + YW+I Sbjct: 193 MFARGPLSCSMYASENFVFNYTGGVYVEN-SNSLPNHLVSILGWGEDVDEHDKVRPYWII 251 Query: 170 RNSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNF 286 RNSWG WGE GF ++ S+Y DG Y L IE F Sbjct: 252 RNSWGTNWGEKGFFRIPRSSYKDGR---YTLNIEKGLYF 287 [80][TOP] >UniRef100_B7FSC8 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FSC8_PHATR Length = 237 Score = 83.6 bits (205), Expect = 9e-15 Identities = 36/74 (48%), Positives = 52/74 (70%), Gaps = 1/74 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKS-NPSINHVVSVVGWGVDPETDVEYWVIRNS 178 IY RGP++ G++A + + Y GGV K N +NHVVS+VGW +D ET ++W++RNS Sbjct: 148 IYARGPVATGVNA-EPIVNYAGGVVNNTKIWNMMVNHVVSIVGWDMDEETGQQHWIVRNS 206 Query: 179 WGEPWGESGFLKLV 220 WG+ WGE GF ++V Sbjct: 207 WGQYWGEMGFFRIV 220 [81][TOP] >UniRef100_Q54VR1 Peptidase C1A family protein n=1 Tax=Dictyostelium discoideum RepID=Q54VR1_DICDI Length = 291 Score = 82.8 bits (203), Expect = 2e-14 Identities = 41/99 (41%), Positives = 59/99 (59%), Gaps = 1/99 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAER-KSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 I+ RGPI+CG++ TD E+YT GV+ S INH +S++GWG E V+YW+ RNS Sbjct: 199 IFARGPIACGMEVTDAFESYTSGVFTSSVGSTGEINHEISIIGWGT--ENGVDYWIGRNS 256 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG +GE GF ++ + L+IE C++ VP Sbjct: 257 WGTYFGELGFFRI--------QRGIDLLSIESACDWAVP 287 [82][TOP] >UniRef100_B7FSD0 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FSD0_PHATR Length = 256 Score = 81.6 bits (200), Expect = 4e-14 Identities = 33/72 (45%), Positives = 49/72 (68%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +Y RGP++ I+A +E Y GGV+ E + NH+VS+ GWG D +T YW++RNSW Sbjct: 168 LYVRGPVAATINAEPIVE-YAGGVFGEDGHSQRTNHIVSITGWGTDEDTGKLYWIVRNSW 226 Query: 182 GEPWGESGFLKL 217 G+ WGE GF+++ Sbjct: 227 GQYWGEMGFMRI 238 [83][TOP] >UniRef100_B8LDQ9 Predicted protein (Fragment) n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8LDQ9_THAPS Length = 262 Score = 81.3 bits (199), Expect = 5e-14 Identities = 35/72 (48%), Positives = 49/72 (68%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY RGP++ I+A D L Y GG+ + + + NH+VS+VG+G D + +YW+IRNSW Sbjct: 174 IYARGPVATTINA-DPLRDYEGGILDDETAGTNTNHIVSIVGYGKDETSGKDYWIIRNSW 232 Query: 182 GEPWGESGFLKL 217 GE WGE GF K+ Sbjct: 233 GEYWGEMGFAKI 244 [84][TOP] >UniRef100_Q5YER6 Cathepsin Z n=1 Tax=Bigelowiella natans RepID=Q5YER6_BIGNA Length = 325 Score = 79.3 bits (194), Expect = 2e-13 Identities = 42/93 (45%), Positives = 55/93 (59%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I RGPISCGIDA L YT G+ R ++HV+SVVGWG D +T YW++RNSW Sbjct: 220 IAARGPISCGIDAAPILN-YTSGIADMR--GEMVDHVISVVGWGKD-DTKGSYWIVRNSW 275 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDC 280 GE WGE G++++ A L +E+ C Sbjct: 276 GEYWGEMGYIRVAFGA----------LKVEEQC 298 [85][TOP] >UniRef100_UPI000186E648 Cathepsin L precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E648 Length = 346 Score = 78.2 bits (191), Expect = 4e-13 Identities = 34/74 (45%), Positives = 49/74 (66%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S +DA+ + Y GGVY + K S+ +NH V VG+G DP+T +YW++RNSWG Sbjct: 258 GPVSVSMDASSPAFKKYKGGVYTDDKCSSMKLNHAVVAVGYGTDPDTKQDYWLVRNSWGT 317 Query: 188 PWGESGFLKLVTSA 229 WGE G+ K+ +A Sbjct: 318 AWGERGYFKIARNA 331 [86][TOP] >UniRef100_UPI000065E4AD UPI000065E4AD related cluster n=1 Tax=Takifugu rubripes RepID=UPI000065E4AD Length = 247 Score = 78.2 bits (191), Expect = 4e-13 Identities = 44/93 (47%), Positives = 59/93 (63%) Frame = +2 Query: 11 RGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 R I CG+ T+ L+ TGG+ +E +P IN SV GV T EYW++ NSWGEP Sbjct: 162 RNLIHCGVITTENLDDCTGGLSSEYLESPEIN---SVAARGVANGT--EYWIV-NSWGEP 215 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFG 289 WGE G L++VTSAY +G+ +LA+E DC +G Sbjct: 216 WGERG-LQVVTSAYKGESGSKNSLALEKDCVYG 247 [87][TOP] >UniRef100_Q4DV75 Cysteine protease, putative n=1 Tax=Trypanosoma cruzi RepID=Q4DV75_TRYCR Length = 434 Score = 77.4 bits (189), Expect = 7e-13 Identities = 33/82 (40%), Positives = 58/82 (70%), Gaps = 3/82 (3%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVY---AERKSNPSINHVVSVVGWGVDPETDVEYWVIR 172 + ++GP++ + A+D + YTGGV+ + N +I+H V +VG+G D +T+ +YWV+R Sbjct: 262 LVQKGPLAVSVAASDWM-FYTGGVFDGCGKDGENITISHAVQLVGYGTDNKTNQDYWVVR 320 Query: 173 NSWGEPWGESGFLKLVTSAYDD 238 NSWGE WGE+GF++L+ +++ Sbjct: 321 NSWGEGWGENGFIRLLRKKHNE 342 [88][TOP] >UniRef100_A1Z0R3 Cysteine protease n=1 Tax=Theileria annulata RepID=A1Z0R3_THEAN Length = 441 Score = 77.0 bits (188), Expect = 9e-13 Identities = 34/67 (50%), Positives = 46/67 (68%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P GI T +L+ Y+GG++ K +NH V +VG GVD ET + YW+I+NSWGE WG Sbjct: 352 PTVVGIAVTKELKLYSGGIFTG-KCGGELNHAVLLVGEGVDHETGMRYWIIKNSWGEDWG 410 Query: 197 ESGFLKL 217 E+GFL+L Sbjct: 411 ENGFLRL 417 [89][TOP] >UniRef100_P25781 Cysteine proteinase n=1 Tax=Theileria annulata RepID=CYSP_THEAN Length = 441 Score = 77.0 bits (188), Expect = 9e-13 Identities = 34/67 (50%), Positives = 46/67 (68%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P GI T +L+ Y+GG++ K +NH V +VG GVD ET + YW+I+NSWGE WG Sbjct: 352 PTVVGIAVTKELKLYSGGIFTG-KCGGELNHAVLLVGEGVDHETGMRYWIIKNSWGEDWG 410 Query: 197 ESGFLKL 217 E+GFL+L Sbjct: 411 ENGFLRL 417 [90][TOP] >UniRef100_UPI000186E71E Cathepsin L precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186E71E Length = 345 Score = 76.6 bits (187), Expect = 1e-12 Identities = 35/76 (46%), Positives = 54/76 (71%), Gaps = 4/76 (5%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAER--KSNP-SINHVVSVVGWGVDPETDVEYWVIRNSW 181 GP+S IDA+ + + Y+ GVY E K+ P S++H V VVG+G D ET +YW+++NSW Sbjct: 255 GPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSW 314 Query: 182 GEPWGESGFLKLVTSA 229 G+ WGE+G++K+ +A Sbjct: 315 GDSWGENGYIKMARNA 330 [91][TOP] >UniRef100_B7FS80 Predicted protein (Fragment) n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FS80_PHATR Length = 259 Score = 76.3 bits (186), Expect = 1e-12 Identities = 34/74 (45%), Positives = 47/74 (63%), Gaps = 2/74 (2%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 IY RGP++ IDA + Y GGV + S NH VS+VGWG D + + +YW++RN Sbjct: 169 IYLRGPVTASIDA-GPIHKYPGGVLWDNPKYHSDKTNHAVSIVGWGYDYDEEKQYWIVRN 227 Query: 176 SWGEPWGESGFLKL 217 SWG+ WGE GF ++ Sbjct: 228 SWGQYWGEMGFFRI 241 [92][TOP] >UniRef100_Q4N068 Cysteine proteinase, putative n=1 Tax=Theileria parva RepID=Q4N068_THEPA Length = 441 Score = 76.3 bits (186), Expect = 1e-12 Identities = 35/67 (52%), Positives = 45/67 (67%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P I AT +L+ Y GGVY K ++NH V +VG G D ET + YW+I+NSWGE WG Sbjct: 352 PTVVAIAATRELKLYQGGVYTG-KCGDALNHAVLLVGEGYDEETGLRYWIIKNSWGEDWG 410 Query: 197 ESGFLKL 217 E+GFL+L Sbjct: 411 ENGFLRL 417 [93][TOP] >UniRef100_Q156I2 Phytophthora-inhibited protease 1 n=1 Tax=Solanum lycopersicum RepID=Q156I2_SOLLC Length = 345 Score = 75.5 bits (184), Expect = 3e-12 Identities = 30/75 (40%), Positives = 45/75 (60%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 PIS GI A D+ Y G+Y + N +NH V+V+G+G E +YW+++NSWG WG Sbjct: 258 PISVGIAANDEFHMYGSGIY-DGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSWGSDWG 316 Query: 197 ESGFLKLVTSAYDDG 241 E G++++ DG Sbjct: 317 EEGYMRIARDVGVDG 331 [94][TOP] >UniRef100_A8JGQ3 Papain-type cysteine protease n=1 Tax=Chlamydomonas reinhardtii RepID=A8JGQ3_CHLRE Length = 382 Score = 75.5 bits (184), Expect = 3e-12 Identities = 41/112 (36%), Positives = 58/112 (51%), Gaps = 1/112 (0%) Frame = +2 Query: 2 IYKRGPISCG-IDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 IY RGPI+CG + D Y GG+Y + + ++H V VVGWG E +YW++RNS Sbjct: 224 IYHRGPITCGQVCPEDFTWHYNGGIYKDTSGDTELDHDVEVVGWG--EEDGEKYWIVRNS 281 Query: 179 WGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDLGFGR 334 WG WGE GF ++ G+ DC +G P+ W D+ G+ Sbjct: 282 WGTYWGERGFFRV-------RRGDNSLQLESGDCWYGEPE-WQMEQDVRTGK 325 [95][TOP] >UniRef100_Q6A1H9 Cathepsin X/O n=1 Tax=Suberites domuncula RepID=Q6A1H9_SUBDO Length = 298 Score = 75.5 bits (184), Expect = 3e-12 Identities = 34/73 (46%), Positives = 47/73 (64%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDA-TDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 ++ RGPI+C + A + E YTGGV + S HVV+V GWG D +T ++YW+ RNS Sbjct: 205 VFARGPIACSVYAHSAAFEEYTGGVIHDPVQYNSTTHVVAVTGWGTDEKTGMKYWIGRNS 264 Query: 179 WGEPWGESGFLKL 217 +G WGE G+ KL Sbjct: 265 FGTAWGEDGWFKL 277 [96][TOP] >UniRef100_Q7YXL2 Cathepsin-L-like midgut cysteine proteinase n=1 Tax=Tenebrio molitor RepID=Q7YXL2_TENMO Length = 330 Score = 75.1 bits (183), Expect = 3e-12 Identities = 34/70 (48%), Positives = 50/70 (71%), Gaps = 1/70 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP++ IDATD+L+ Y+GG++ ++ N S +NH V VVG+G D D YW+++NSWG Sbjct: 245 GPVAVAIDATDELQFYSGGLFYDQTCNQSDLNHGVLVVGYGSDNGQD--YWILKNSWGSG 302 Query: 191 WGESGFLKLV 220 WGESG+ + V Sbjct: 303 WGESGYWRQV 312 [97][TOP] >UniRef100_A1XG95 Putative cathepsin L-like proteinase n=1 Tax=Tenebrio molitor RepID=A1XG95_TENMO Length = 328 Score = 75.1 bits (183), Expect = 3e-12 Identities = 34/70 (48%), Positives = 50/70 (71%), Gaps = 1/70 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP++ IDATD+L+ Y+GG++ ++ N S +NH V VVG+G D D YW+++NSWG Sbjct: 243 GPVAVAIDATDELQFYSGGLFYDQTCNQSDLNHGVLVVGYGSDNGQD--YWILKNSWGSG 300 Query: 191 WGESGFLKLV 220 WGESG+ + V Sbjct: 301 WGESGYWRQV 310 [98][TOP] >UniRef100_UPI0001926446 PREDICTED: similar to cathepsin C, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926446 Length = 374 Score = 74.7 bits (182), Expect = 4e-12 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 10/82 (12%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAE--------RKSNPSI--NHVVSVVGWGVDPETD 151 + + GP+S GI+ T + Y GG++ + K NP NH V VVG+GVD + Sbjct: 271 LIRYGPLSVGINVTSEFLHYKGGIFYQPGKHNGIGSKFNPFYLTNHAVLVVGYGVDHDNG 330 Query: 152 VEYWVIRNSWGEPWGESGFLKL 217 V+YW+++NSWGE WGE GF ++ Sbjct: 331 VKYWIVKNSWGEGWGEGGFFRI 352 [99][TOP] >UniRef100_B8C725 Probable papain cysteine protease (Fragment) n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8C725_THAPS Length = 244 Score = 74.3 bits (181), Expect = 6e-12 Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 5/77 (6%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSI-----NHVVSVVGWGVDPETDVEYWV 166 IY RGPI I+A + L YTGG+ +P++ NH VS+VGWG D E ++W+ Sbjct: 170 IYARGPIKAAINA-NPLRNYTGGILGS-DDDPAMLDTHHNHGVSIVGWGYDEERKTQHWI 227 Query: 167 IRNSWGEPWGESGFLKL 217 +RNSWG WGE GF ++ Sbjct: 228 VRNSWGVYWGEMGFFRI 244 [100][TOP] >UniRef100_B7FS79 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FS79_PHATR Length = 353 Score = 74.3 bits (181), Expect = 6e-12 Identities = 34/74 (45%), Positives = 45/74 (60%), Gaps = 2/74 (2%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVY--AERKSNPSINHVVSVVGWGVDPETDVEYWVIRN 175 IY RGP+ + A + Y+GGV A NH VS++GWG D E D +YW++RN Sbjct: 223 IYARGPVKASVYAKP-IYNYSGGVLWDAPEYQADKHNHGVSIIGWGYDDEMDRQYWIVRN 281 Query: 176 SWGEPWGESGFLKL 217 SWG+ WGE GF +L Sbjct: 282 SWGQYWGEMGFFRL 295 [101][TOP] >UniRef100_Q4DC63 Cysteine proteinase, putative n=1 Tax=Trypanosoma cruzi RepID=Q4DC63_TRYCR Length = 392 Score = 73.9 bits (180), Expect = 7e-12 Identities = 31/73 (42%), Positives = 50/73 (68%), Gaps = 2/73 (2%) Frame = +2 Query: 8 KRGPISCGIDATDQLETYTGGVY--AERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 K GP+S +DAT Y GG++ + N +INHVV +VG+G D + +++YW++RNSW Sbjct: 277 KNGPLSVNVDAT-YWSAYAGGIFNGCDYSKNITINHVVQLVGYGHDNKLNLDYWILRNSW 335 Query: 182 GEPWGESGFLKLV 220 WGE+G+++L+ Sbjct: 336 SPSWGENGYMRLL 348 [102][TOP] >UniRef100_Q4N640 Cysteine protease, putative n=1 Tax=Theileria parva RepID=Q4N640_THEPA Length = 612 Score = 73.2 bits (178), Expect = 1e-11 Identities = 34/84 (40%), Positives = 50/84 (59%), Gaps = 1/84 (1%) Frame = +2 Query: 8 KRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 K GP I + Y G++ + + + NH V VVG G DP+ V YW++RNSWGE Sbjct: 392 KVGPFQLSIHVAKDMSFYKEGIF-DGECSKKPNHSVVVVGHGYDPDLKVHYWIVRNSWGE 450 Query: 188 PWGESGFLKLVTSAYD-DGNGNMY 256 WGESG+++L+ + Y+ +G G Y Sbjct: 451 DWGESGYMRLLNANYNYNGIGAYY 474 [103][TOP] >UniRef100_A1XG94 Putative cathepsin L-like proteinase n=1 Tax=Tenebrio molitor RepID=A1XG94_TENMO Length = 328 Score = 73.2 bits (178), Expect = 1e-11 Identities = 32/66 (48%), Positives = 48/66 (72%), Gaps = 1/66 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP++ IDATD+L+ Y+GG++ ++ N S +NH V VVG+G D D YW+++NSWG Sbjct: 243 GPVAVAIDATDELQFYSGGLFYDQTCNQSDLNHGVFVVGYGSDNGQD--YWILKNSWGSG 300 Query: 191 WGESGF 208 WGE+G+ Sbjct: 301 WGENGY 306 [104][TOP] >UniRef100_C4QB46 SmCL2-like peptidase (C01 family) n=1 Tax=Schistosoma mansoni RepID=C4QB46_SCHMA Length = 342 Score = 72.8 bits (177), Expect = 2e-11 Identities = 30/73 (41%), Positives = 46/73 (63%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GP+S GI+ Q Y G+Y + S+ +NH V +VG+G E V+YW I+NS Sbjct: 253 LYEHGPVSAGINVEQQFMRYKSGIYQSQSCSSTEVNHAVLIVGYG--EENGVQYWTIKNS 310 Query: 179 WGEPWGESGFLKL 217 WG WGE G++++ Sbjct: 311 WGTSWGEEGYVRM 323 [105][TOP] >UniRef100_B4KNQ5 GI20850 n=1 Tax=Drosophila mojavensis RepID=B4KNQ5_DROMO Length = 329 Score = 72.8 bits (177), Expect = 2e-11 Identities = 29/72 (40%), Positives = 48/72 (66%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 + +GPIS GI A++ Y GV +R+ N + NH V VVG+G DP+ ++W+++NSW Sbjct: 240 VASKGPISVGIHASNNFRNYRDGVLNDRQCNKAANHAVLVVGFGRDPQGG-DFWLVKNSW 298 Query: 182 GEPWGESGFLKL 217 G WG+ G++++ Sbjct: 299 GASWGDGGYIRM 310 [106][TOP] >UniRef100_O18455 Cathepsin L-like cysteine proteinase n=3 Tax=Heterodera glycines RepID=O18455_HETGL Length = 374 Score = 72.4 bits (176), Expect = 2e-11 Identities = 32/71 (45%), Positives = 51/71 (71%), Gaps = 2/71 (2%) Frame = +2 Query: 11 RGPISCGIDATDQ-LETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWG 184 +GP+S IDA + + YT GVY E++ +P +++H V VVG+G DP T +YW+++NSWG Sbjct: 286 QGPVSVAIDAGHRSFQLYTNGVYFEKECDPENLDHGVLVVGYGTDP-TQGDYWIVKNSWG 344 Query: 185 EPWGESGFLKL 217 WGE G++++ Sbjct: 345 TRWGEQGYIRM 355 [107][TOP] >UniRef100_Q9LP39 Putative cysteine proteinase n=1 Tax=Arabidopsis thaliana RepID=Q9LP39_ARATH Length = 346 Score = 72.0 bits (175), Expect = 3e-11 Identities = 30/70 (42%), Positives = 49/70 (70%), Gaps = 1/70 (1%) Frame = +2 Query: 11 RGPISCGIDATDQ-LETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 R P++ IDA++ Y+GGVY R S+NH V++VG+G PE ++YW+ +NSWG+ Sbjct: 256 RQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPE-GMKYWLAKNSWGK 314 Query: 188 PWGESGFLKL 217 WGE+G++++ Sbjct: 315 TWGENGYIRI 324 [108][TOP] >UniRef100_Q8S335 Cysteine protease n=1 Tax=Solanum pimpinellifolium RepID=Q8S335_SOLPI Length = 344 Score = 72.0 bits (175), Expect = 3e-11 Identities = 31/73 (42%), Positives = 47/73 (64%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P+S GI A+ L+ Y GG Y + INH V+ +G+G D E +YW+++NSWG WG Sbjct: 257 PVSIGIAASQDLQFYAGGTY-DGNCADQINHAVTAIGYGTDEEGQ-KYWLLKNSWGTSWG 314 Query: 197 ESGFLKLVTSAYD 235 E+GF+K++ + D Sbjct: 315 ENGFMKIIRDSGD 327 [109][TOP] >UniRef100_Q8S334 Cysteine protease n=1 Tax=Solanum pennellii RepID=Q8S334_SOLPN Length = 337 Score = 72.0 bits (175), Expect = 3e-11 Identities = 34/76 (44%), Positives = 48/76 (63%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P+S GI A+ L+ Y GG Y +N INH V+ +G+G D E +YW+++NSWG WG Sbjct: 250 PVSIGIAASQDLQFYAGGTYDGSCAN-RINHAVTAIGYGTD-EKGQKYWLLKNSWGTSWG 307 Query: 197 ESGFLKLVTSAYDDGN 244 E GF+K++ D GN Sbjct: 308 EDGFMKIIR---DSGN 320 [110][TOP] >UniRef100_Q650Y2 Putative cysteine proteinase n=1 Tax=Oryza sativa Japonica Group RepID=Q650Y2_ORYSJ Length = 416 Score = 72.0 bits (175), Expect = 3e-11 Identities = 31/68 (45%), Positives = 46/68 (67%), Gaps = 1/68 (1%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETD-VEYWVIRNSWGEPW 193 PIS GIDA+ L+ Y GV+ R +NH V VVG+GV+ D +YW+++NSWG+ W Sbjct: 247 PISVGIDASADLQHYKKGVFTGRCKTAPLNHGVVVVGYGVNTTPDKTKYWIVKNSWGKGW 306 Query: 194 GESGFLKL 217 GE G++++ Sbjct: 307 GEGGYIRM 314 Score = 54.7 bits (130), Expect = 5e-06 Identities = 20/40 (50%), Positives = 31/40 (77%) Frame = +2 Query: 98 SINHVVSVVGWGVDPETDVEYWVIRNSWGEPWGESGFLKL 217 S+NH V+ VG+GV + ++ YW+ RNSWG WGESG++++ Sbjct: 341 SVNHAVTTVGYGVTQD-NINYWIARNSWGPRWGESGYIRM 379 [111][TOP] >UniRef100_B9G543 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=B9G543_ORYSJ Length = 351 Score = 72.0 bits (175), Expect = 3e-11 Identities = 31/68 (45%), Positives = 46/68 (67%), Gaps = 1/68 (1%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETD-VEYWVIRNSWGEPW 193 PIS GIDA+ L+ Y GV+ R +NH V VVG+GV+ D +YW+++NSWG+ W Sbjct: 247 PISVGIDASADLQHYKKGVFTGRCKTAPLNHGVVVVGYGVNTTPDKTKYWIVKNSWGKGW 306 Query: 194 GESGFLKL 217 GE G++++ Sbjct: 307 GEGGYIRM 314 [112][TOP] >UniRef100_B5KVP9 Cysteine protease n=1 Tax=Zea mays RepID=B5KVP9_MAIZE Length = 352 Score = 72.0 bits (175), Expect = 3e-11 Identities = 29/85 (34%), Positives = 55/85 (64%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P++ I+ L+ Y+GGV++ + +NH ++VVG+G D + ++YW+++NSWG+ WG Sbjct: 266 PVAAAIEMGGSLQFYSGGVFSGQ-CGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWG 324 Query: 197 ESGFLKLVTSAYDDGNGNMYNLAIE 271 E G+L++ D G G + +A++ Sbjct: 325 ERGYLRM---RRDVGRGGLCGIALD 346 [113][TOP] >UniRef100_Q9U938 Necpain n=1 Tax=Necator americanus RepID=Q9U938_NECAM Length = 339 Score = 72.0 bits (175), Expect = 3e-11 Identities = 30/78 (38%), Positives = 43/78 (55%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I K GP+ D + + Y G+Y ++ + H V ++GWG D TD YW+I NSW Sbjct: 250 IMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTD--YWLIANSW 307 Query: 182 GEPWGESGFLKLVTSAYD 235 + WGESGF ++V D Sbjct: 308 SKDWGESGFFRMVRGEND 325 [114][TOP] >UniRef100_Q4D659 Cysteine proteinase, putative n=1 Tax=Trypanosoma cruzi RepID=Q4D659_TRYCR Length = 392 Score = 72.0 bits (175), Expect = 3e-11 Identities = 31/73 (42%), Positives = 49/73 (67%), Gaps = 2/73 (2%) Frame = +2 Query: 8 KRGPISCGIDATDQLETYTGGVY--AERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 K GP+S +DAT Y GG++ N +INHVV +VG+G D + +++YW++RNSW Sbjct: 277 KNGPLSVNVDAT-YWAAYAGGIFNGCGYNKNITINHVVQLVGYGHDNKLNLDYWILRNSW 335 Query: 182 GEPWGESGFLKLV 220 WGE+G+++L+ Sbjct: 336 SPSWGENGYMRLL 348 [115][TOP] >UniRef100_Q1KYN0 Cathepsin B (Fragment) n=1 Tax=Streblomastix strix RepID=Q1KYN0_9EUKA Length = 283 Score = 71.6 bits (174), Expect = 4e-11 Identities = 31/73 (42%), Positives = 43/73 (58%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY+ GP+S G +Y GVY + H V +VGWGV+ E V YW+++NSW Sbjct: 192 IYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDE--VPYWLVQNSW 249 Query: 182 GEPWGESGFLKLV 220 G WGE+GF K++ Sbjct: 250 GTDWGENGFFKIL 262 [116][TOP] >UniRef100_A8D0K2 Cathepsin C n=1 Tax=Ixodes ricinus RepID=A8D0K2_IXORI Length = 466 Score = 71.6 bits (174), Expect = 4e-11 Identities = 35/102 (34%), Positives = 56/102 (54%), Gaps = 12/102 (11%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSIN------------HVVSVVGWGVDPE 145 + + GP++ G + + Y GGVY + S+N H V V G+GVD E Sbjct: 363 LVRGGPVAVGFEVYPDFQMYQGGVYRHTGVHRSLNLGSPFDPFELTNHAVLVTGYGVDKE 422 Query: 146 TDVEYWVIRNSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIE 271 T ++YW ++NSWG WGESG+ +++ A + G + +LA+E Sbjct: 423 TGLKYWSVKNSWGPGWGESGYFRILRGADECG---IESLAVE 461 [117][TOP] >UniRef100_A1XG96 Putative cathepsin L-like proteinase n=1 Tax=Tenebrio molitor RepID=A1XG96_TENMO Length = 416 Score = 71.6 bits (174), Expect = 4e-11 Identities = 33/70 (47%), Positives = 49/70 (70%), Gaps = 1/70 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP++ IDA D+L+ Y+GG++ ++ N S +NH V VVG+G D D YW+++NSWG Sbjct: 331 GPVAVAIDAPDELQFYSGGLFYDQTCNQSDLNHGVFVVGYGSDNGQD--YWILKNSWGFG 388 Query: 191 WGESGFLKLV 220 WGESG+ + V Sbjct: 389 WGESGYWRQV 398 [118][TOP] >UniRef100_Q4UDG1 Cysteine protease, putative n=1 Tax=Theileria annulata RepID=Q4UDG1_THEAN Length = 487 Score = 70.9 bits (172), Expect = 6e-11 Identities = 30/83 (36%), Positives = 48/83 (57%) Frame = +2 Query: 8 KRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 K GP I + ++ Y G++ + + + NH V VVG G DP+ V YW+++NSWGE Sbjct: 393 KTGPFQVSIHVSKEMSFYKEGIF-DGECSQRENHSVVVVGHGYDPDLKVYYWIVKNSWGE 451 Query: 188 PWGESGFLKLVTSAYDDGNGNMY 256 WGE+G+++L+ + Y Y Sbjct: 452 DWGENGYMRLLNANYKQNGITSY 474 [119][TOP] >UniRef100_Q22DX2 Papain family cysteine protease containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22DX2_TETTH Length = 358 Score = 70.9 bits (172), Expect = 6e-11 Identities = 31/77 (40%), Positives = 46/77 (59%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I + G +S +DAT Y G++ + K P INH V+++GWG D YW++RNSW Sbjct: 277 IMQNGALSIAVDAT-YWANYKSGIFTQ-KEKPQINHAVTLIGWGSD------YWLLRNSW 328 Query: 182 GEPWGESGFLKLVTSAY 232 G WGE G++K+ + Y Sbjct: 329 GSSWGEQGYIKVTNTGY 345 [120][TOP] >UniRef100_Q8S333 Cysteine protease n=1 Tax=Solanum lycopersicum RepID=Q8S333_SOLLC Length = 345 Score = 70.5 bits (171), Expect = 8e-11 Identities = 30/73 (41%), Positives = 47/73 (64%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 P+S GI A+ L+ Y GG Y + INH V+ +G+G D E +YW+++NSWG WG Sbjct: 258 PVSIGIAASQDLQFYAGGTY-DGNCADRINHAVTAIGYGTDEEGQ-KYWLLKNSWGTSWG 315 Query: 197 ESGFLKLVTSAYD 235 E+G++K++ + D Sbjct: 316 ENGYMKIIRDSGD 328 [121][TOP] >UniRef100_Q7YXL4 Cathepsin-L-like cysteine peptidase 02 n=1 Tax=Tenebrio molitor RepID=Q7YXL4_TENMO Length = 337 Score = 70.5 bits (171), Expect = 8e-11 Identities = 31/70 (44%), Positives = 51/70 (72%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ Q + Y+GGVY E + +PS ++H V VVG+G + + +YW+++NSWG+ Sbjct: 250 GPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDD-GTDYWLVKNSWGK 308 Query: 188 PWGESGFLKL 217 WG+ G++K+ Sbjct: 309 SWGDQGYIKM 318 [122][TOP] >UniRef100_B5G4X5 Putative cathepsin L preprotein n=1 Tax=Clonorchis sinensis RepID=B5G4X5_CLOSI Length = 371 Score = 70.5 bits (171), Expect = 8e-11 Identities = 33/71 (46%), Positives = 49/71 (69%), Gaps = 2/71 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPIS GI+A Y G+Y++ + NP ++H V VVG+GVD V YW+I+NSWGE Sbjct: 285 GPISVGINAGLPSFMAYESGIYSDHRCNPHDLDHGVLVVGYGVD--NGVPYWLIKNSWGE 342 Query: 188 PWGESGFLKLV 220 WGE+G+++++ Sbjct: 343 DWGENGYVRIL 353 [123][TOP] >UniRef100_UPI00015B62BC PREDICTED: similar to cathepsin L-like protease n=1 Tax=Nasonia vitripennis RepID=UPI00015B62BC Length = 353 Score = 70.1 bits (170), Expect = 1e-10 Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP S ID + D Y+ GVY + + N ++H V +VG+G D TD ++W+++NSWGE Sbjct: 265 GPFSAAIDGSHDTFRFYSEGVYYQPECNEDDLDHAVLIVGYGTDNRTDQDFWLVKNSWGE 324 Query: 188 PWGESGFLKL 217 WGE G+ K+ Sbjct: 325 TWGEGGYFKV 334 [124][TOP] >UniRef100_UPI0000D57336 PREDICTED: similar to cathepsin-L-like midgut cysteine proteinase n=1 Tax=Tribolium castaneum RepID=UPI0000D57336 Length = 314 Score = 70.1 bits (170), Expect = 1e-10 Identities = 31/72 (43%), Positives = 47/72 (65%), Gaps = 3/72 (4%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSN---PSINHVVSVVGWGVDPETDVEYWVIRNSWG 184 GPI+ I+AT++L+ Y GG+ + K N P +NH V VVG+G E ++W+++NSWG Sbjct: 227 GPIAATIEATEELQFYKGGILLDEKCNSKVPDLNHGVLVVGYG--SENGGDFWIVKNSWG 284 Query: 185 EPWGESGFLKLV 220 WGE G+ + V Sbjct: 285 SDWGEGGYYRPV 296 [125][TOP] >UniRef100_Q1LUB5 Novel protein similar to vertebrate cathepsin L (CTSL) (Fragment) n=2 Tax=Danio rerio RepID=Q1LUB5_DANRE Length = 334 Score = 70.1 bits (170), Expect = 1e-10 Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 2/71 (2%) Frame = +2 Query: 14 GPISCGIDATD-QLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA + Y+ G+Y E NP+ +NH V VVG+G + TD YW+I+NSWG Sbjct: 248 GPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEEGTD--YWIIKNSWGT 305 Query: 188 PWGESGFLKLV 220 WGE G+++++ Sbjct: 306 GWGEGGYMRMI 316 [126][TOP] >UniRef100_Q1LXE8 Novel protein similar to vertebrate cathepsin family (Fragment) n=1 Tax=Danio rerio RepID=Q1LXE8_DANRE Length = 263 Score = 70.1 bits (170), Expect = 1e-10 Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 2/71 (2%) Frame = +2 Query: 14 GPISCGIDATD-QLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA + Y+ G+Y E NP+ +NH V VVG+G + TD YW+I+NSWG Sbjct: 177 GPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEEGTD--YWIIKNSWGT 234 Query: 188 PWGESGFLKLV 220 WGE G+++++ Sbjct: 235 GWGEGGYMRMI 245 [127][TOP] >UniRef100_Q6Q7A8 Cathepsin L-like cysteine proteinase I variant form n=2 Tax=Heterodera glycines RepID=Q6Q7A8_HETGL Length = 374 Score = 70.1 bits (170), Expect = 1e-10 Identities = 31/71 (43%), Positives = 50/71 (70%), Gaps = 2/71 (2%) Frame = +2 Query: 11 RGPISCGIDATDQ-LETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWG 184 +GP+S IDA + + YT GVY E++ +P +++H V V G+G DP T +YW+++NSWG Sbjct: 286 QGPVSVAIDAGHRSFQLYTNGVYFEKECDPQNLDHGVLVEGYGTDP-TQGDYWIVKNSWG 344 Query: 185 EPWGESGFLKL 217 WGE G++++ Sbjct: 345 TRWGEQGYIRM 355 [128][TOP] >UniRef100_C3XVY0 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XVY0_BRAFL Length = 327 Score = 70.1 bits (170), Expect = 1e-10 Identities = 34/70 (48%), Positives = 48/70 (68%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDATD-QLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPIS GIDA+ + Y GVY E++ S+ ++H V VVG+G D E D YW+++NSWGE Sbjct: 241 GPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTDGEKD--YWLVKNSWGE 298 Query: 188 PWGESGFLKL 217 WG G++K+ Sbjct: 299 EWGMEGYIKM 308 [129][TOP] >UniRef100_B5LBH9 Cathepsin L-like cysteine proteinase n=1 Tax=Bursaphelenchus xylophilus RepID=B5LBH9_BURXY Length = 282 Score = 70.1 bits (170), Expect = 1e-10 Identities = 32/74 (43%), Positives = 49/74 (66%), Gaps = 2/74 (2%) Frame = +2 Query: 2 IYKRGPISCGIDATDQ-LETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 + +GP+S IDA + Y GVY E+ +P ++H V VVG+G DPE +YW+++N Sbjct: 191 VASQGPVSVAIDAGHRSFRLYKTGVYYEKHCSPEQLDHGVLVVGYGTDPEHG-DYWIVKN 249 Query: 176 SWGEPWGESGFLKL 217 SWGE WGE G++++ Sbjct: 250 SWGEEWGEKGYVRI 263 [130][TOP] >UniRef100_UPI000180B6FF PREDICTED: similar to Dipeptidyl-peptidase 1 precursor (Dipeptidyl-peptidase I) (DPP-I) (DPPI) (Cathepsin C) (Cathepsin J) (Dipeptidyl transferase), partial n=1 Tax=Ciona intestinalis RepID=UPI000180B6FF Length = 220 Score = 69.7 bits (169), Expect = 1e-10 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 12/80 (15%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVY-----AERKS-----NPS--INHVVSVVGWGVDPETDVE 157 GPI+ GI+ + Y G+Y ERK NP NH V VVG+G D T + Sbjct: 118 GPIAVGIEVYPDFQQYKSGIYHHVTSVERKLPVTGYNPFELTNHAVLVVGYGADERTGEK 177 Query: 158 YWVIRNSWGEPWGESGFLKL 217 YW+++NSWGE WGE GF+++ Sbjct: 178 YWIVKNSWGESWGEKGFVRI 197 [131][TOP] >UniRef100_Q812A9 Cathepsin P n=1 Tax=Meriones unguiculatus RepID=Q812A9_MERUN Length = 334 Score = 69.7 bits (169), Expect = 1e-10 Identities = 35/72 (48%), Positives = 49/72 (68%), Gaps = 4/72 (5%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERK-SNPSINHVVSVVGWGVDP-ETD-VEYWVIRNSW 181 GP++ +DA+ D Y GG+Y E K S S+NH V VVG+G + ETD +YW+I+NSW Sbjct: 243 GPVAAAVDASQDSFRFYRGGIYYEPKCSQYSVNHAVLVVGYGYEGNETDGKDYWLIKNSW 302 Query: 182 GEPWGESGFLKL 217 GE WG G++K+ Sbjct: 303 GENWGMRGYMKI 314 [132][TOP] >UniRef100_B8BWD8 Probable papain cysteine protease n=1 Tax=Thalassiosira pseudonana CCMP1335 RepID=B8BWD8_THAPS Length = 336 Score = 69.7 bits (169), Expect = 1e-10 Identities = 28/72 (38%), Positives = 45/72 (62%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I RGP++ I A + G ++++ + NH+V++VGWG D E++ +WV+RNSW Sbjct: 193 IKARGPVAATIQAGPLRDFMGGSIFSDDSAPKFPNHIVAIVGWGKDVESNKSFWVVRNSW 252 Query: 182 GEPWGESGFLKL 217 G WGE GF ++ Sbjct: 253 GYYWGEEGFFRV 264 [133][TOP] >UniRef100_Q6T9Z7 Fibroinase n=1 Tax=Bombyx mori RepID=Q6T9Z7_BOMMO Length = 341 Score = 69.7 bits (169), Expect = 1e-10 Identities = 32/71 (45%), Positives = 49/71 (69%), Gaps = 2/71 (2%) Frame = +2 Query: 14 GPISCGIDATD-QLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ + Y+ GVY E + S+ ++H V VVG+G D E V+YW+++NSWG Sbjct: 254 GPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWLVKNSWGR 312 Query: 188 PWGESGFLKLV 220 WGE G++K++ Sbjct: 313 SWGELGYIKMI 323 [134][TOP] >UniRef100_Q26559 Cysteine protease (Fragment) n=1 Tax=Spirometra mansonoides RepID=Q26559_9CEST Length = 216 Score = 69.7 bits (169), Expect = 1e-10 Identities = 33/70 (47%), Positives = 46/70 (65%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPIS GIDA D +Y+ GV+ + +P INH V V+G+G E D YW+++NSWG Sbjct: 130 GPISVGIDANDPGFMSYSHGVFVSKTCSPDDINHGVLVIGYGT--ENDEPYWLVKNSWGR 187 Query: 188 PWGESGFLKL 217 WGE G++K+ Sbjct: 188 SWGEQGYVKM 197 [135][TOP] >UniRef100_Q26425 Cysteine proteinase n=1 Tax=Bombyx mori RepID=Q26425_BOMMO Length = 344 Score = 69.7 bits (169), Expect = 1e-10 Identities = 32/71 (45%), Positives = 49/71 (69%), Gaps = 2/71 (2%) Frame = +2 Query: 14 GPISCGIDATD-QLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ + Y+ GVY E + S+ ++H V VVG+G D E V+YW+++NSWG Sbjct: 257 GPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD-EQGVDYWLVKNSWGR 315 Query: 188 PWGESGFLKLV 220 WGE G++K++ Sbjct: 316 SWGELGYIKMI 326 [136][TOP] >UniRef100_C1BMV4 Cathepsin L n=1 Tax=Caligus rogercresseyi RepID=C1BMV4_9MAXI Length = 332 Score = 69.7 bits (169), Expect = 1e-10 Identities = 30/70 (42%), Positives = 48/70 (68%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ + Y+ GVY E K +P +++H V VVG+G D + +YW+++NSW E Sbjct: 244 GPVSVAIDASHMSFQFYSHGVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSE 303 Query: 188 PWGESGFLKL 217 WG+ G++K+ Sbjct: 304 NWGDQGYIKM 313 [137][TOP] >UniRef100_B7SP43 Putative cathepsin B-like cysteine protease form 1 (Fragment) n=1 Tax=Dermacentor variabilis RepID=B7SP43_DERVA Length = 192 Score = 69.7 bits (169), Expect = 1e-10 Identities = 33/98 (33%), Positives = 45/98 (45%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYK GP+ +Y GVY H + ++GWG E V YW++ NSW Sbjct: 100 IYKNGPVEADFSVYADFPSYKSGVYQRHSEEMLGGHAIRILGWGT--EDGVPYWLVANSW 157 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 E WG+ G+ K+ + G IEDD N G+P Sbjct: 158 NEDWGDKGYFKIRRGNDECG--------IEDDINAGIP 187 [138][TOP] >UniRef100_UPI0000D57338 PREDICTED: similar to putative cathepsin L-like proteinase n=1 Tax=Tribolium castaneum RepID=UPI0000D57338 Length = 328 Score = 69.3 bits (168), Expect = 2e-10 Identities = 32/70 (45%), Positives = 46/70 (65%), Gaps = 1/70 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGV-YAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 + GPI+ +DATD+L+ Y+GGV Y S ++NH V VVG+G + D YW+++NS Sbjct: 239 VANNGPIAVALDATDELQFYSGGVLYDTTCSAQALNHGVLVVGYGSEGGQD--YWIVKNS 296 Query: 179 WGEPWGESGF 208 WG WGE G+ Sbjct: 297 WGSGWGEQGY 306 [139][TOP] >UniRef100_UPI0000D559FA PREDICTED: similar to putative cathepsin B-like proteinase n=1 Tax=Tribolium castaneum RepID=UPI0000D559FA Length = 319 Score = 69.3 bits (168), Expect = 2e-10 Identities = 33/83 (39%), Positives = 41/83 (49%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I GPI Y GVY N + NH+V +VGWG + E D YW+I NSW Sbjct: 230 IMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKEQD--YWLIANSW 287 Query: 182 GEPWGESGFLKLVTSAYDDGNGN 250 G WGE GF K++ + G N Sbjct: 288 GSSWGEHGFFKILRGKNECGIEN 310 [140][TOP] >UniRef100_UPI0000162F06 cysteine proteinase, putative n=1 Tax=Arabidopsis thaliana RepID=UPI0000162F06 Length = 334 Score = 69.3 bits (168), Expect = 2e-10 Identities = 31/71 (43%), Positives = 46/71 (64%), Gaps = 1/71 (1%) Frame = +2 Query: 8 KRGPISCGIDA-TDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWG 184 +R P+S IDA D Y GGVYA +NH V++VG+G + + YWV++NSWG Sbjct: 244 RRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGT--MSGLNYWVLKNSWG 301 Query: 185 EPWGESGFLKL 217 E WGE+G++++ Sbjct: 302 ESWGENGYMRI 312 [141][TOP] >UniRef100_C3UWE0 Cathepsin K n=1 Tax=Lutjanus argentimaculatus RepID=C3UWE0_9PERO Length = 330 Score = 69.3 bits (168), Expect = 2e-10 Identities = 35/90 (38%), Positives = 54/90 (60%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 ++K GP+S GIDAT + Y+ GVY + N INH V VG+GV + +YW+++N Sbjct: 239 LFKAGPVSVGIDATLSSFQFYSKGVYYDPSCNKEDINHAVLAVGYGVTGKGK-KYWIVKN 297 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SWGE WG+ G++ + + GN+ +A Sbjct: 298 SWGESWGKGGYILMARN-----RGNLCGIA 322 [142][TOP] >UniRef100_C1BL99 Cathepsin K n=1 Tax=Osmerus mordax RepID=C1BL99_OSMMO Length = 331 Score = 69.3 bits (168), Expect = 2e-10 Identities = 36/90 (40%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 + K GP+S GIDAT + Y GVY +R N INH V VG+GV P+ +YW+++N Sbjct: 240 VAKVGPVSVGIDATLSTFQFYQKGVYYDRNCNKDDINHAVLAVGYGVTPKGK-KYWIVKN 298 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SW E WG G++ + + GN+ +A Sbjct: 299 SWSESWGNKGYILMARN-----RGNLCGIA 323 [143][TOP] >UniRef100_Q9LP42 Putative cysteine proteinase n=1 Tax=Arabidopsis thaliana RepID=Q9LP42_ARATH Length = 365 Score = 69.3 bits (168), Expect = 2e-10 Identities = 31/71 (43%), Positives = 46/71 (64%), Gaps = 1/71 (1%) Frame = +2 Query: 8 KRGPISCGIDA-TDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWG 184 +R P+S IDA D Y GGVYA +NH V++VG+G + + YWV++NSWG Sbjct: 275 RRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGT--MSGLNYWVLKNSWG 332 Query: 185 EPWGESGFLKL 217 E WGE+G++++ Sbjct: 333 ESWGENGYMRI 343 [144][TOP] >UniRef100_Q84W75 Putative cysteine proteinase n=2 Tax=Arabidopsis thaliana RepID=Q84W75_ARATH Length = 355 Score = 69.3 bits (168), Expect = 2e-10 Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%) Frame = +2 Query: 17 PISCGIDATDQ-LETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPW 193 P+S IDA Y+GGVY E ++NH V+ VG+G PE ++YW+ +NSWGE W Sbjct: 267 PVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE-GIKYWLAKNSWGETW 325 Query: 194 GESGFLKL 217 GE+G++++ Sbjct: 326 GENGYIRI 333 [145][TOP] >UniRef100_Q015J8 Cathepsin (ISS) n=1 Tax=Ostreococcus tauri RepID=Q015J8_OSTTA Length = 556 Score = 69.3 bits (168), Expect = 2e-10 Identities = 35/78 (44%), Positives = 50/78 (64%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNP------SINHVVSVVGWGVDPETDVEYW 163 IY+RGP++ GI+A ++L+ Y GV +P SINH V VVGWGV + ++YW Sbjct: 299 IYERGPVAVGINA-NRLQAYDDGVIMMDDCHPLGRGISSINHAVLVVGWGVTKD-GIKYW 356 Query: 164 VIRNSWGEPWGESGFLKL 217 ++NS+G WG+ GF KL Sbjct: 357 ELKNSYGPKWGDQGFFKL 374 [146][TOP] >UniRef100_B9S9A3 Cysteine protease, putative n=1 Tax=Ricinus communis RepID=B9S9A3_RICCO Length = 340 Score = 69.3 bits (168), Expect = 2e-10 Identities = 28/71 (39%), Positives = 47/71 (66%), Gaps = 2/71 (2%) Frame = +2 Query: 11 RGPISCGIDATDQL--ETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWG 184 R P+S +DAT +L + Y GG++ + ++ H +++VG+G E +YW+I+NSWG Sbjct: 249 RQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAE-GTKYWLIKNSWG 307 Query: 185 EPWGESGFLKL 217 E WGE G+++L Sbjct: 308 EGWGEGGYMRL 318 [147][TOP] >UniRef100_Q9NGW4 Cathepsin L n=1 Tax=Fasciola gigantica RepID=Q9NGW4_FASGI Length = 326 Score = 69.3 bits (168), Expect = 2e-10 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP + +D Y+GG+Y R S+ +NH V VG+G TD YW+++NSWG Sbjct: 237 GPAAVAVDVESDFTMYSGGIYQSRTCSSLRVNHAVLAVGYGTQGGTD--YWIVKNSWGSS 294 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLA 265 WGE G++++V + GNM +A Sbjct: 295 WGERGYIRMVRN-----RGNMCGIA 314 [148][TOP] >UniRef100_B4LJE9 GJ20806 n=1 Tax=Drosophila virilis RepID=B4LJE9_DROVI Length = 339 Score = 69.3 bits (168), Expect = 2e-10 Identities = 31/74 (41%), Positives = 52/74 (70%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ + + Y+ G+Y E + +P +++H V VVG+G D E+ +YW+++NSWG Sbjct: 252 GPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTD-ESGQDYWLVKNSWGT 310 Query: 188 PWGESGFLKLVTSA 229 WG+ GF+K+ +A Sbjct: 311 TWGDKGFIKMARNA 324 [149][TOP] >UniRef100_A5HKY1 Cathepsin n=1 Tax=Fasciola gigantica RepID=A5HKY1_FASGI Length = 326 Score = 69.3 bits (168), Expect = 2e-10 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP + +D Y+GG+Y R S+ +NH V VG+G TD YW+++NSWG Sbjct: 237 GPAAVAVDVESDFMMYSGGIYQSRTCSSLRVNHAVLAVGYGTQSGTD--YWIVKNSWGSS 294 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLA 265 WGE G++++V + GNM +A Sbjct: 295 WGERGYIRMVRN-----RGNMCGIA 314 [150][TOP] >UniRef100_Q63088 Cathepsin J n=1 Tax=Rattus norvegicus RepID=CATJ_RAT Length = 334 Score = 69.3 bits (168), Expect = 2e-10 Identities = 37/72 (51%), Positives = 49/72 (68%), Gaps = 4/72 (5%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERK-SNPSINHVVSVVGWGVDP-ETDVE-YWVIRNSW 181 GP+S IDA+ D Y+GGVY E S+ +NH V VVG+G + ETD YW+I+NSW Sbjct: 243 GPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSW 302 Query: 182 GEPWGESGFLKL 217 GE WG +GF+K+ Sbjct: 303 GEEWGINGFMKI 314 [151][TOP] >UniRef100_Q9XYL8 Cathepsin n=1 Tax=Fasciola gigantica RepID=Q9XYL8_FASGI Length = 326 Score = 68.9 bits (167), Expect = 2e-10 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP + +D Y+GG+Y R S+ +NH V VG+G TD YW+++NSWG Sbjct: 237 GPAAVAVDVESDFMMYSGGIYQSRTCSSLRVNHAVLAVGYGTQGGTD--YWIVKNSWGSS 294 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLA 265 WGE G++++V + GNM +A Sbjct: 295 WGERGYIRMVRN-----RGNMCGIA 314 [152][TOP] >UniRef100_Q9NGW1 Cathepsin L (Fragment) n=1 Tax=Fasciola gigantica RepID=Q9NGW1_FASGI Length = 219 Score = 68.9 bits (167), Expect = 2e-10 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP + +D Y+GG+Y R S+ +NH V VG+G TD YW+++NSWG Sbjct: 130 GPAAVAVDVESDFMMYSGGIYQSRTCSSLRVNHAVLAVGYGTQGGTD--YWIVKNSWGSS 187 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLA 265 WGE G++++V + GNM +A Sbjct: 188 WGERGYIRMVRN-----RGNMCGIA 207 [153][TOP] >UniRef100_Q8MUT6 Cathepsin L2 n=1 Tax=Fasciola gigantica RepID=Q8MUT6_FASGI Length = 326 Score = 68.9 bits (167), Expect = 2e-10 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 1/85 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP + +D Y+GG+Y R S+ +NH V VG+G TD YW+++NSWG Sbjct: 237 GPAAVAVDVESDFMMYSGGIYQSRTCSSLHVNHAVLAVGYGTQGGTD--YWIVKNSWGSS 294 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLA 265 WGE G++++V + GNM +A Sbjct: 295 WGERGYIRMVRN-----RGNMCGIA 314 [154][TOP] >UniRef100_Q5CWJ0 Cryptopain-cysteine proteinase secreted, possible transmembrane domain near N-terminus n=2 Tax=Cryptosporidium parvum RepID=Q5CWJ0_CRYPV Length = 401 Score = 68.9 bits (167), Expect = 2e-10 Identities = 32/71 (45%), Positives = 45/71 (63%), Gaps = 1/71 (1%) Frame = +2 Query: 8 KRGPISCGIDATDQ-LETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWG 184 K GPIS I A + Y GV+ + +NH V +VG+ +D +T+ EYW++RNSWG Sbjct: 307 KYGPISVAIQADQTPFQFYKSGVF-DAPCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWG 365 Query: 185 EPWGESGFLKL 217 E WGE G++KL Sbjct: 366 EAWGEKGYIKL 376 [155][TOP] >UniRef100_Q5CIN4 Cryptopain n=1 Tax=Cryptosporidium hominis RepID=Q5CIN4_CRYHO Length = 401 Score = 68.9 bits (167), Expect = 2e-10 Identities = 32/71 (45%), Positives = 45/71 (63%), Gaps = 1/71 (1%) Frame = +2 Query: 8 KRGPISCGIDATDQ-LETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWG 184 K GPIS I A + Y GV+ + +NH V +VG+ +D +T+ EYW++RNSWG Sbjct: 307 KYGPISVAIQADQTPFQFYKSGVF-DAPCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWG 365 Query: 185 EPWGESGFLKL 217 E WGE G++KL Sbjct: 366 EAWGEKGYIKL 376 [156][TOP] >UniRef100_C5KY31 Preprocathepsin c, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KY31_9ALVE Length = 326 Score = 68.9 bits (167), Expect = 2e-10 Identities = 40/118 (33%), Positives = 64/118 (54%), Gaps = 3/118 (2%) Frame = +2 Query: 2 IYKRGPISCGIDATD-QLETYTGG--VYAERKSNPSINHVVSVVGWGVDPETDVEYWVIR 172 +Y+RGPISCG+D+ + + Y G V A N S++H + +VGW D + + EY++++ Sbjct: 198 VYRRGPISCGVDSGEVENGKYHPGDIVRATTTGNWSLDHDIVIVGWSQDEDGE-EYYIVK 256 Query: 173 NSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDLGFGRPDDM 346 NSWG WG+ G+ + + + IE DC++ V D D +G PD M Sbjct: 257 NSWGTFWGDQGYFHV--------QSGINAMGIEQDCSWAVVDPEPRVAD--YGPPDWM 304 [157][TOP] >UniRef100_B6AJH9 Papain family cysteine protease, putative n=1 Tax=Cryptosporidium muris RN66 RepID=B6AJH9_9CRYT Length = 400 Score = 68.9 bits (167), Expect = 2e-10 Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 1/83 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQ-LETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 + K GPIS I A + Y GV+ + +NH V +VG+ +D + EYW++RNS Sbjct: 304 VAKYGPISVAIQADQAPFQFYKKGVF-DAPCGTDVNHAVVLVGYDLDIYSGKEYWLVRNS 362 Query: 179 WGEPWGESGFLKLVTSAYDDGNG 247 WGE WGE+G++KL A G G Sbjct: 363 WGENWGENGYIKLAIQAGKKGKG 385 [158][TOP] >UniRef100_Q7T0N7 Ctsc protein n=1 Tax=Xenopus laevis RepID=Q7T0N7_XENLA Length = 458 Score = 68.6 bits (166), Expect = 3e-10 Identities = 31/74 (41%), Positives = 43/74 (58%), Gaps = 6/74 (8%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVY----AERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 GP+S + D Y GVY + K NP NH V +VG+G D +T +YW+++N Sbjct: 363 GPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKN 422 Query: 176 SWGEPWGESGFLKL 217 SWGE WGE GF ++ Sbjct: 423 SWGESWGEKGFFRI 436 [159][TOP] >UniRef100_Q650Y1 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=Q650Y1_ORYSJ Length = 385 Score = 68.6 bits (166), Expect = 3e-10 Identities = 31/69 (44%), Positives = 50/69 (72%), Gaps = 2/69 (2%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVY-AERKSNPSI-NHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 P+S I +D+ +Y GGV+ SNP++ NHVV VVG+GV + +++YW+I+NSWG+ Sbjct: 279 PVSVVITISDEFRSYRGGVFRGPCGSNPNVDNHVVLVVGYGVTTD-NIKYWIIKNSWGKT 337 Query: 191 WGESGFLKL 217 WGE G++++ Sbjct: 338 WGEYGYIRM 346 [160][TOP] >UniRef100_B4ESG0 Papain-like cysteine proteinase (Fragment) n=1 Tax=Hordeum vulgare subsp. vulgare RepID=B4ESG0_HORVD Length = 185 Score = 68.6 bits (166), Expect = 3e-10 Identities = 28/67 (41%), Positives = 45/67 (67%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 PI I+ ++ Y GVY+ ++ H V+VVG+GVD T V+YW+++NSWG+ WG Sbjct: 97 PIGVAIEVGGGMQFYRSGVYSG-PCGTALAHAVTVVGYGVDAATGVKYWLVKNSWGQTWG 155 Query: 197 ESGFLKL 217 ESG++++ Sbjct: 156 ESGYIRM 162 [161][TOP] >UniRef100_Q86GS8 Cathepsin H n=1 Tax=Sterkiella histriomuscorum RepID=Q86GS8_OXYTR Length = 366 Score = 68.6 bits (166), Expect = 3e-10 Identities = 33/75 (44%), Positives = 43/75 (57%), Gaps = 3/75 (4%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAER--KSNPS-INHVVSVVGWGVDPETDVEYWVIR 172 IY GP+S D Y GVYA + P+ +NH V VG+G D E V+YW+I+ Sbjct: 261 IYLHGPVSVAFRVIDGFRDYKSGVYAVEGCANGPNDVNHAVLAVGFGTD-ENKVDYWIIK 319 Query: 173 NSWGEPWGESGFLKL 217 NSWG WG+ GF K+ Sbjct: 320 NSWGAAWGDQGFFKM 334 [162][TOP] >UniRef100_Q86GF6 Cathepsin L n=1 Tax=Pandalus borealis RepID=Q86GF6_PANBO Length = 318 Score = 68.6 bits (166), Expect = 3e-10 Identities = 35/74 (47%), Positives = 48/74 (64%), Gaps = 2/74 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 ++ GP+S IDA + + Y+ GVY E NPS INH V VG+G + +D YW+I+N Sbjct: 228 VHDDGPVSVCIDAGHNSFQLYSSGVYYEPNCNPSSINHAVLPVGYGTEEGSD--YWLIKN 285 Query: 176 SWGEPWGESGFLKL 217 SWG WG SG++KL Sbjct: 286 SWGTGWGLSGYMKL 299 [163][TOP] >UniRef100_Q6E7B2 Cathepsin F-like cysteine proteinase (Fragment) n=1 Tax=Brugia malayi RepID=Q6E7B2_BRUMA Length = 461 Score = 68.6 bits (166), Expect = 3e-10 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 3/92 (3%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN--PS-INHVVSVVGWGVDPETDVEYWVIR 172 I +RGP+S GIDA + L Y G+ KS PS INH V + G+G+ E ++ YW I+ Sbjct: 371 IAQRGPLSVGIDA-ELLSYYKSGILHPSKSRCPPSKINHGVLITGYGI--ENNLPYWTIK 427 Query: 173 NSWGEPWGESGFLKLVTSAYDDGNGNMYNLAI 268 NSWGE WGE+G+ +L+ G ++ + AI Sbjct: 428 NSWGEQWGENGYFQLMRGKNICGVSDLVSSAI 459 [164][TOP] >UniRef100_Q22AB1 Papain family cysteine protease containing protein n=1 Tax=Tetrahymena thermophila SB210 RepID=Q22AB1_TETTH Length = 344 Score = 68.6 bits (166), Expect = 3e-10 Identities = 31/73 (42%), Positives = 45/73 (61%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 + K GP++ GI+A L+ Y GG+ + + INH V +VG+GV E + YW+I+N W Sbjct: 255 LVKNGPVAVGINART-LQFYEGGIVDPKNCDDKINHAVLIVGYGV--EEGIPYWLIKNQW 311 Query: 182 GEPWGESGFLKLV 220 G WG GF KL+ Sbjct: 312 GAEWGIKGFFKLI 324 [165][TOP] >UniRef100_Q171L9 Cathepsin b n=1 Tax=Aedes aegypti RepID=Q171L9_AEDAE Length = 342 Score = 68.6 bits (166), Expect = 3e-10 Identities = 34/98 (34%), Positives = 49/98 (50%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GP+ L Y GVY + + H V ++GWGV E ++YW++ NSW Sbjct: 242 IYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV--ENGLKYWLVANSW 299 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G+ WG++GF K+V G IE D + G+P Sbjct: 300 GDDWGDNGFFKIVRGENHCG--------IEKDVHAGLP 329 [166][TOP] >UniRef100_Q16920 Cathepsin B-like thiol protease n=1 Tax=Aedes aegypti RepID=Q16920_AEDAE Length = 342 Score = 68.6 bits (166), Expect = 3e-10 Identities = 34/98 (34%), Positives = 49/98 (50%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY GP+ L Y GVY + + H V ++GWGV E ++YW++ NSW Sbjct: 242 IYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV--ENGLKYWLVANSW 299 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G+ WG++GF K+V G IE D + G+P Sbjct: 300 GDDWGDNGFFKIVRGENHCG--------IEKDVHAGLP 329 [167][TOP] >UniRef100_C4WSL6 ACYPI000014 protein n=1 Tax=Acyrthosiphon pisum RepID=C4WSL6_ACYPI Length = 335 Score = 68.6 bits (166), Expect = 3e-10 Identities = 37/96 (38%), Positives = 47/96 (48%), Gaps = 2/96 (2%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSI--NHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPI D D Y GVY +R N S H V ++GWGV+ T YW++ NSWGE Sbjct: 249 GPIEASFDVYDDFMNYESGVY-QRTGNASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGE 305 Query: 188 PWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG+ G K++ + G IE C GVP Sbjct: 306 QWGDKGMFKILRGTDECG--------IESSCTAGVP 333 [168][TOP] >UniRef100_C1C0V0 Cathepsin L n=1 Tax=Caligus clemensi RepID=C1C0V0_9MAXI Length = 336 Score = 68.6 bits (166), Expect = 3e-10 Identities = 31/70 (44%), Positives = 47/70 (67%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPIS IDA+ + Y+ GVY E K S+ ++H V VVG+G D + +YW+++NSW E Sbjct: 248 GPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSE 307 Query: 188 PWGESGFLKL 217 WG+ G++K+ Sbjct: 308 KWGDQGYIKM 317 [169][TOP] >UniRef100_B7QME5 Cysteine proteinase cathepsin L, putative n=1 Tax=Ixodes scapularis RepID=B7QME5_IXOSC Length = 127 Score = 68.6 bits (166), Expect = 3e-10 Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 12/98 (12%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSIN------------HVVSVVGWGVDPETDVE 157 GP++ G + + Y GGVY + S+N H V V G+GVD ET ++ Sbjct: 28 GPVAVGFEVYPDFQMYQGGVYRHTGVHRSLNLGSPFDPFELTNHAVLVTGYGVDKETGLK 87 Query: 158 YWVIRNSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIE 271 YW ++NSWG WGE+G+ +++ + G + +LA+E Sbjct: 88 YWSVKNSWGPGWGENGYFRILRGTDECG---IESLAVE 122 [170][TOP] >UniRef100_B1NHV8 Cathepsin B5 cysteine protease n=1 Tax=Monocercomonoides sp. PA RepID=B1NHV8_9EUKA Length = 281 Score = 68.6 bits (166), Expect = 3e-10 Identities = 31/80 (38%), Positives = 41/80 (51%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +Y RGP + ++Y GVY H V VVGWGV+ T YW+I+NSW Sbjct: 192 LYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGVEDGTP--YWLIQNSW 249 Query: 182 GEPWGESGFLKLVTSAYDDG 241 G WGE GF K++ + G Sbjct: 250 GTTWGEQGFFKILRGKNECG 269 [171][TOP] >UniRef100_A9JSG3 Cathepsin B n=1 Tax=Acyrthosiphon pisum RepID=A9JSG3_ACYPI Length = 335 Score = 68.6 bits (166), Expect = 3e-10 Identities = 37/96 (38%), Positives = 47/96 (48%), Gaps = 2/96 (2%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSI--NHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPI D D Y GVY +R N S H V ++GWGV+ T YW++ NSWGE Sbjct: 249 GPIEASFDVYDDFMNYESGVY-QRTGNASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGE 305 Query: 188 PWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG+ G K++ + G IE C GVP Sbjct: 306 QWGDKGMFKILRGTDECG--------IESSCTAGVP 333 [172][TOP] >UniRef100_A7XH36 Digestive cysteine protease n=1 Tax=Dermestes frischii RepID=A7XH36_9COLE Length = 339 Score = 68.6 bits (166), Expect = 3e-10 Identities = 31/70 (44%), Positives = 49/70 (70%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ + + Y+ GVY + + S+ ++H V VVG+G D E +YW+++NSWGE Sbjct: 252 GPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTD-ENGQDYWIVKNSWGE 310 Query: 188 PWGESGFLKL 217 WGE G++K+ Sbjct: 311 SWGEQGYIKM 320 [173][TOP] >UniRef100_A2E6N1 Clan CA, family C1, cathepsin B-like cysteine peptidase n=1 Tax=Trichomonas vaginalis G3 RepID=A2E6N1_TRIVA Length = 255 Score = 68.6 bits (166), Expect = 3e-10 Identities = 34/92 (36%), Positives = 50/92 (54%), Gaps = 2/92 (2%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAE--RKSNPSINHVVSVVGWGVDPETDVEYWVIRN 175 IY GP+S + TD+L+ YTGG++ + R H V ++GWG E + YW+I N Sbjct: 162 IYLHGPVSASVAVTDRLKYYTGGLFEDPPRDYIADRTHTVEIIGWG--QEKGIPYWIILN 219 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLAIE 271 +G WGE+G ++ + DD Y LA E Sbjct: 220 QYGRLWGENGMMR-IRMGRDDARVESYVLAAE 250 [174][TOP] >UniRef100_C1MQP3 Cathepsin n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MQP3_9CHLO Length = 583 Score = 68.2 bits (165), Expect = 4e-10 Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNP------SINHVVSVVGWGVDPETDVEYW 163 IY+ GP++ GI+ ++L Y GGV ++ P SINH VVGWGV E ++YW Sbjct: 375 IYETGPVAVGING-ERLHFYDGGVITAKECPPAGAGISSINHAALVVGWGV--ENGMKYW 431 Query: 164 VIRNSWGEPWGESGFLKL 217 ++RN++GE +GE G+ KL Sbjct: 432 LVRNTYGEDFGEKGYFKL 449 [175][TOP] >UniRef100_Q171L8 Cathepsin b n=1 Tax=Aedes aegypti RepID=Q171L8_AEDAE Length = 313 Score = 68.2 bits (165), Expect = 4e-10 Identities = 32/98 (32%), Positives = 50/98 (51%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+ GP+ + + Y+GGVY + H V ++GWGV+ T +YW++ NSW Sbjct: 221 IFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVEDGT--KYWLVANSW 278 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 G WG+ GF K+V G IE++ + G+P Sbjct: 279 GRVWGDDGFFKMVRGENHCG--------IEENVHAGLP 308 [176][TOP] >UniRef100_C5KKU1 Preprocathepsin c, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KKU1_9ALVE Length = 326 Score = 68.2 bits (165), Expect = 4e-10 Identities = 39/118 (33%), Positives = 63/118 (53%), Gaps = 3/118 (2%) Frame = +2 Query: 2 IYKRGPISCGIDA--TDQLETYTGGVY-AERKSNPSINHVVSVVGWGVDPETDVEYWVIR 172 +Y+RGPISCG+D+ + + + G + A N S++H + +VGW D E EY++++ Sbjct: 198 VYRRGPISCGVDSGEVENGKFHPGDIVRATTTGNWSLDHDIVIVGWSQD-EDGKEYYIVK 256 Query: 173 NSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDLGFGRPDDM 346 NSWG WG+ G+ + + + IE DC++ V D D +G PD M Sbjct: 257 NSWGTFWGDQGYFHV--------QSGINAMGIEQDCSWAVVDPEPRVAD--YGPPDWM 304 [177][TOP] >UniRef100_C5KBM2 Preprocathepsin c, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5KBM2_9ALVE Length = 326 Score = 68.2 bits (165), Expect = 4e-10 Identities = 41/118 (34%), Positives = 67/118 (56%), Gaps = 3/118 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT--DQLETYTGGVY-AERKSNPSINHVVSVVGWGVDPETDVEYWVIR 172 IY+RGPISC +D++ + + + G + A N S++H + ++GWG D E EY++++ Sbjct: 198 IYRRGPISCAVDSSVVENGKYHPGDIVRATTSGNWSLDHDIVIIGWGQD-EDGKEYYIVK 256 Query: 173 NSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDLGFGRPDDM 346 NSWG WG+ G+ LV S + ++ IE++C + V D D +G PD M Sbjct: 257 NSWGTFWGDQGYF-LVQSGIN-------SMGIEENCAWAVVDPEPRVAD--YGPPDWM 304 [178][TOP] >UniRef100_C3Y5H3 Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3Y5H3_BRAFL Length = 470 Score = 68.2 bits (165), Expect = 4e-10 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + Y GGVY S+P NH V +VG+G DPET ++W Sbjct: 370 LVKNGPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFW 429 Query: 164 VIRNSWGEPWGESGFLKL 217 ++NSWGE WGE GF ++ Sbjct: 430 TVKNSWGEKWGEEGFFRI 447 [179][TOP] >UniRef100_B7P3P0 Cathepsin B endopeptidase, putative n=1 Tax=Ixodes scapularis RepID=B7P3P0_IXOSC Length = 337 Score = 68.2 bits (165), Expect = 4e-10 Identities = 31/98 (31%), Positives = 47/98 (47%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+K GP+ +Y GVY + + H + ++GWG E YW++ NSW Sbjct: 247 IFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGT--ENGTPYWLVANSW 304 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 E WG+ G+ K++ + G IEDD N G+P Sbjct: 305 NEDWGDHGYFKILRGKDECG--------IEDDINAGIP 334 [180][TOP] >UniRef100_B4LM55 GJ20585 n=1 Tax=Drosophila virilis RepID=B4LM55_DROVI Length = 333 Score = 68.2 bits (165), Expect = 4e-10 Identities = 30/69 (43%), Positives = 42/69 (60%) Frame = +2 Query: 11 RGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 +GPIS I A Y GVY + N NH V VVG+G D + +YW+++NSWG Sbjct: 247 KGPISVLIHAGQTFMQYRSGVYKDNSCNKYFNHAVLVVGYGHDSR-EGDYWLVKNSWGSK 305 Query: 191 WGESGFLKL 217 WGESG++++ Sbjct: 306 WGESGYIRM 314 [181][TOP] >UniRef100_B3TM69 Cathepsin L4 (Fragment) n=1 Tax=Fasciola hepatica RepID=B3TM69_FASHE Length = 303 Score = 68.2 bits (165), Expect = 4e-10 Identities = 28/70 (40%), Positives = 44/70 (62%), Gaps = 1/70 (1%) Frame = +2 Query: 11 RGPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 +GP + +D Y GG+YA R S+ S+NH + VVG+G TD YW+++NSWG Sbjct: 213 KGPAAVAVDVESDFLMYRGGIYASRNCSSESLNHGILVVGYGTQDGTD--YWIVKNSWGS 270 Query: 188 PWGESGFLKL 217 WG+ G++++ Sbjct: 271 LWGDHGYIRM 280 [182][TOP] >UniRef100_B3NRG9 GG20405 n=1 Tax=Drosophila erecta RepID=B3NRG9_DROER Length = 352 Score = 68.2 bits (165), Expect = 4e-10 Identities = 31/74 (41%), Positives = 50/74 (67%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP++C ++A T E Y GG+Y + + N +NH V+VVG+G E +YW+I+NS+ + Sbjct: 266 GPLACSMNADTISFEQYGGGIYEDEECNQGEVNHSVTVVGYG--SENGRDYWIIKNSYSQ 323 Query: 188 PWGESGFLKLVTSA 229 WGE GF++L+ +A Sbjct: 324 NWGEGGFMRLIRNA 337 [183][TOP] >UniRef100_B0X8Q0 Cathepsin B n=1 Tax=Culex quinquefasciatus RepID=B0X8Q0_CULQU Length = 341 Score = 68.2 bits (165), Expect = 4e-10 Identities = 35/98 (35%), Positives = 46/98 (46%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I++ GP+ D + Y GVY H V ++GWGV E +YW+ NSW Sbjct: 244 IFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGV--ENGTKYWLCSNSW 301 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 GE WGE GF K+V G IE D + G+P Sbjct: 302 GEDWGERGFFKIVRGENHCG--------IESDVHAGLP 331 [184][TOP] >UniRef100_A8P5T6 Cathepsin F-like cysteine proteinase, putative n=1 Tax=Brugia malayi RepID=A8P5T6_BRUMA Length = 137 Score = 68.2 bits (165), Expect = 4e-10 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 3/92 (3%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN--PS-INHVVSVVGWGVDPETDVEYWVIR 172 I +RGP+S GIDA + L Y G+ KS PS INH V + G+G+ E ++ YW I+ Sbjct: 47 IAQRGPLSVGIDA-ELLSYYKSGILHPSKSRCPPSKINHGVLITGYGI--EDNLPYWTIK 103 Query: 173 NSWGEPWGESGFLKLVTSAYDDGNGNMYNLAI 268 NSWGE WGE+G+ +L+ G ++ + AI Sbjct: 104 NSWGEQWGENGYFRLMRGKDICGVSDLVSSAI 135 [185][TOP] >UniRef100_Q8V5U0 Viral cathepsin n=2 Tax=Alphabaculovirus RepID=CATV_NPVHZ Length = 367 Score = 68.2 bits (165), Expect = 4e-10 Identities = 30/72 (41%), Positives = 47/72 (65%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +Y GP++ +DA D + Y G+ + +NH V ++GWG+ E +V YW+I+NSW Sbjct: 280 VYTTGPVAIAVDAMDIIN-YRRGILNQCHIY-DLNHAVLLIGWGI--ENNVPYWIIKNSW 335 Query: 182 GEPWGESGFLKL 217 GE WGE+GFL++ Sbjct: 336 GEDWGENGFLRV 347 [186][TOP] >UniRef100_UPI00015B62BB PREDICTED: similar to cathepsin L-like cysteine proteinase n=1 Tax=Nasonia vitripennis RepID=UPI00015B62BB Length = 346 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/77 (38%), Positives = 49/77 (63%), Gaps = 2/77 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPIS IDA +D Y GVY E S ++H V +G+G D +T ++W+++NSWGE Sbjct: 257 GPISVAIDADSDSFMFYHSGVYYEPDCSRTDLDHGVLAIGYGTDSKTGKQFWLVKNSWGE 316 Query: 188 PWGESGFLKLVTSAYDD 238 WGE G++++ + +++ Sbjct: 317 DWGEKGYIRMSRNRHNN 333 [187][TOP] >UniRef100_UPI0000D57337 PREDICTED: similar to putative cathepsin L-like proteinase n=1 Tax=Tribolium castaneum RepID=UPI0000D57337 Length = 328 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/70 (42%), Positives = 46/70 (65%), Gaps = 1/70 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGV-YAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 + GP++ +DAT++L+ Y+GGV Y S ++NH V VVG+G + D YW+++NS Sbjct: 239 VANNGPVAVALDATEELQLYSGGVLYDTTCSAQALNHGVLVVGYGSEGGQD--YWIVKNS 296 Query: 179 WGEPWGESGF 208 WG WGE G+ Sbjct: 297 WGSGWGEQGY 306 [188][TOP] >UniRef100_UPI000069FAF7 LOC407938 protein n=1 Tax=Xenopus (Silurana) tropicalis RepID=UPI000069FAF7 Length = 458 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 6/74 (8%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVY----AERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 GP+S + D Y GVY + K NP NH V +VG+G D +T +YW+++N Sbjct: 363 GPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKN 422 Query: 176 SWGEPWGESGFLKL 217 SWGE WGE G+ ++ Sbjct: 423 SWGESWGEKGYFRI 436 [189][TOP] >UniRef100_UPI00016E35D8 UPI00016E35D8 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E35D8 Length = 336 Score = 67.8 bits (164), Expect = 5e-10 Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRN 175 ++K GPI+ GIDAT + Y+ GVY + NP +INH V +VG+GV+ YW+++N Sbjct: 245 LFKHGPIAVGIDATLSTFQLYSKGVYYDPNCNPENINHAVLLVGYGVNSRGQ-HYWIVKN 303 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SW WG G++ + + GN+ +A Sbjct: 304 SWSTNWGNGGYVLMARN-----RGNLCGIA 328 [190][TOP] >UniRef100_UPI00016E35D7 UPI00016E35D7 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E35D7 Length = 333 Score = 67.8 bits (164), Expect = 5e-10 Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRN 175 ++K GPI+ GIDAT + Y+ GVY + NP +INH V +VG+GV+ YW+++N Sbjct: 242 LFKHGPIAVGIDATLSTFQLYSKGVYYDPNCNPENINHAVLLVGYGVNSRGQ-HYWIVKN 300 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SW WG G++ + + GN+ +A Sbjct: 301 SWSTNWGNGGYVLMARN-----RGNLCGIA 325 [191][TOP] >UniRef100_Q70B20 Cysteine proteinase (Fragment) n=1 Tax=Platichthys flesus RepID=Q70B20_PLAFE Length = 177 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/69 (43%), Positives = 42/69 (60%), Gaps = 1/69 (1%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GPIS GID + D Y GVY E + NH +VG+G E +YW+++NSWG+ Sbjct: 93 GPISVGIDGSLDSFRNYVSGVYDESSCSTFANHYALIVGYG--NENGKDYWLVKNSWGKV 150 Query: 191 WGESGFLKL 217 WGE G++K+ Sbjct: 151 WGEEGYIKM 159 [192][TOP] >UniRef100_Q6NVR4 LOC407938 protein (Fragment) n=1 Tax=Xenopus (Silurana) tropicalis RepID=Q6NVR4_XENTR Length = 470 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 6/74 (8%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVY----AERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 GP+S + D Y GVY + K NP NH V +VG+G D +T +YW+++N Sbjct: 363 GPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKN 422 Query: 176 SWGEPWGESGFLKL 217 SWGE WGE G+ ++ Sbjct: 423 SWGESWGEKGYFRI 436 [193][TOP] >UniRef100_Q4QQR0 Putative uncharacterized protein n=1 Tax=Xenopus (Silurana) tropicalis RepID=Q4QQR0_XENTR Length = 458 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 6/74 (8%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVY----AERKSNPS--INHVVSVVGWGVDPETDVEYWVIRN 175 GP+S + D Y GVY + K NP NH V +VG+G D +T +YW+++N Sbjct: 363 GPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKN 422 Query: 176 SWGEPWGESGFLKL 217 SWGE WGE G+ ++ Sbjct: 423 SWGESWGEKGYFRI 436 [194][TOP] >UniRef100_Q9DCV1 Putative uncharacterized protein n=1 Tax=Mus musculus RepID=Q9DCV1_MOUSE Length = 461 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + D Y G+Y S+P NH V +VG+G DP T +EYW Sbjct: 362 LVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYW 421 Query: 164 VIRNSWGEPWGESGFLKL 217 +I+NSWG WGESG+ ++ Sbjct: 422 IIKNSWGSNWGESGYFRI 439 [195][TOP] >UniRef100_Q8BQL3 Putative uncharacterized protein n=1 Tax=Mus musculus RepID=Q8BQL3_MOUSE Length = 462 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + D Y G+Y S+P NH V +VG+G DP T +EYW Sbjct: 363 LVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYW 422 Query: 164 VIRNSWGEPWGESGFLKL 217 +I+NSWG WGESG+ ++ Sbjct: 423 IIKNSWGSNWGESGYFRI 440 [196][TOP] >UniRef100_Q3UAF5 Putative uncharacterized protein n=1 Tax=Mus musculus RepID=Q3UAF5_MOUSE Length = 462 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + D Y G+Y S+P NH V +VG+G DP T +EYW Sbjct: 363 LVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYW 422 Query: 164 VIRNSWGEPWGESGFLKL 217 +I+NSWG WGESG+ ++ Sbjct: 423 IIKNSWGSNWGESGYFRI 440 [197][TOP] >UniRef100_Q3U8J6 Putative uncharacterized protein (Fragment) n=1 Tax=Mus musculus RepID=Q3U8J6_MOUSE Length = 191 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + D Y G+Y S+P NH V +VG+G DP T +EYW Sbjct: 92 LVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYW 151 Query: 164 VIRNSWGEPWGESGFLKL 217 +I+NSWG WGESG+ ++ Sbjct: 152 IIKNSWGSNWGESGYFRI 169 [198][TOP] >UniRef100_Q3TIF1 Putative uncharacterized protein n=1 Tax=Mus musculus RepID=Q3TIF1_MOUSE Length = 462 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + D Y G+Y S+P NH V +VG+G DP T +EYW Sbjct: 363 LVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYW 422 Query: 164 VIRNSWGEPWGESGFLKL 217 +I+NSWG WGESG+ ++ Sbjct: 423 IIKNSWGSNWGESGYFRI 440 [199][TOP] >UniRef100_Q7K0S6 CG6347 n=1 Tax=Drosophila melanogaster RepID=Q7K0S6_DROME Length = 352 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/74 (40%), Positives = 51/74 (68%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP++C ++A T E Y+GG+Y + + N +NH V+VVG+G + D YW+I+NS+ + Sbjct: 266 GPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRD--YWIIKNSYSQ 323 Query: 188 PWGESGFLKLVTSA 229 WGE GF++++ +A Sbjct: 324 NWGEGGFMRILRNA 337 [200][TOP] >UniRef100_Q5DP45 Cathepsin B-like proteinase n=1 Tax=Triatoma vitticeps RepID=Q5DP45_9HEMI Length = 332 Score = 67.8 bits (164), Expect = 5e-10 Identities = 35/98 (35%), Positives = 48/98 (48%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I K GPI I + L +Y GVY HV+ ++GWGV E D YW++ NSW Sbjct: 243 ILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGV--ENDTPYWLVANSW 300 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG +GF K++ + + G IED G+P Sbjct: 301 NTDWGNNGFFKILRGSDECG--------IEDQIVAGIP 330 [201][TOP] >UniRef100_Q3ZCX7 Cathepsin L-like cysteine proteinase n=1 Tax=Rotylenchulus reniformis RepID=Q3ZCX7_ROTRE Length = 369 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/71 (43%), Positives = 47/71 (66%), Gaps = 2/71 (2%) Frame = +2 Query: 11 RGPISCGIDATDQ-LETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWG 184 +GP+S IDA + + Y GVY E + NP ++H V VVG+G DPE +YW+++NSW Sbjct: 281 QGPVSVAIDAGHRSFQLYKHGVYFEEECNPEELDHGVLVVGYGTDPEHG-DYWIVKNSWS 339 Query: 185 EPWGESGFLKL 217 WGE G++++ Sbjct: 340 THWGEQGYIRM 350 [202][TOP] >UniRef100_C5L947 Preprocathepsin c, putative n=1 Tax=Perkinsus marinus ATCC 50983 RepID=C5L947_9ALVE Length = 326 Score = 67.8 bits (164), Expect = 5e-10 Identities = 41/118 (34%), Positives = 67/118 (56%), Gaps = 3/118 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT--DQLETYTGGVY-AERKSNPSINHVVSVVGWGVDPETDVEYWVIR 172 IY+RGPISC +D++ + + + G + A N S++H + ++GWG D E EY++++ Sbjct: 198 IYRRGPISCAVDSSVVENGKYHPGDIVRATTPGNWSLDHDIVIIGWGQD-EDGKEYYIVK 256 Query: 173 NSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVPDRWVPADDLGFGRPDDM 346 NSWG WG+ G+ LV S + ++ IE++C + V D D +G PD M Sbjct: 257 NSWGTFWGDQGYF-LVQSGIN-------SMGIEENCAWAVVDPEPRVAD--YGPPDWM 304 [203][TOP] >UniRef100_B5G4Z2 Cathepsin B-like cysteine proteinase n=1 Tax=Clonorchis sinensis RepID=B5G4Z2_CLOSI Length = 343 Score = 67.8 bits (164), Expect = 5e-10 Identities = 35/98 (35%), Positives = 46/98 (46%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I RGP+ + Y GVY P H + ++GWG E DV YW+I NSW Sbjct: 248 IMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWG--EEGDVPYWLIANSW 305 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 E WGE G++K + + G IEDD G+P Sbjct: 306 NEDWGEKGYMKFLRGLNECG--------IEDDVTAGLP 335 [204][TOP] >UniRef100_B4QEH9 GD10987 n=1 Tax=Drosophila simulans RepID=B4QEH9_DROSI Length = 352 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/74 (40%), Positives = 51/74 (68%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP++C ++A T E Y+GG+Y + + N +NH V+VVG+G + D YW+I+NS+ + Sbjct: 266 GPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRD--YWIIKNSYSQ 323 Query: 188 PWGESGFLKLVTSA 229 WGE GF++++ +A Sbjct: 324 NWGEGGFMRILRNA 337 [205][TOP] >UniRef100_B4P7Y3 GE11675 n=1 Tax=Drosophila yakuba RepID=B4P7Y3_DROYA Length = 384 Score = 67.8 bits (164), Expect = 5e-10 Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 1/94 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSI-NHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GPI+C ++ + L+ Y GG+Y + + N NH + VVG+G E +YW+++NSW + Sbjct: 300 GPIACSVNGLETLKNYAGGIYNDDECNQGEPNHSILVVGYG--SENGQDYWIVKNSWDDT 357 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGV 292 WGE G+ +L G Y I D+C++ V Sbjct: 358 WGEQGYFRL-------PRGQNY-CFIADECSYPV 383 [206][TOP] >UniRef100_B4JW16 GH22826 n=1 Tax=Drosophila grimshawi RepID=B4JW16_DROGR Length = 340 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/70 (44%), Positives = 49/70 (70%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ + + Y+ GVY E + +P +++H V VVG+G D E +YW+++NSWG Sbjct: 253 GPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTD-ENGKDYWLVKNSWGT 311 Query: 188 PWGESGFLKL 217 WG+ GF+K+ Sbjct: 312 TWGDKGFIKM 321 [207][TOP] >UniRef100_B4HQX2 GM21493 n=1 Tax=Drosophila sechellia RepID=B4HQX2_DROSE Length = 352 Score = 67.8 bits (164), Expect = 5e-10 Identities = 30/74 (40%), Positives = 51/74 (68%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP++C ++A T E Y+GG+Y + + N +NH V+VVG+G + D YW+I+NS+ + Sbjct: 266 GPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRD--YWIIKNSYSQ 323 Query: 188 PWGESGFLKLVTSA 229 WGE GF++++ +A Sbjct: 324 NWGEGGFMRILRNA 337 [208][TOP] >UniRef100_A9JSH4 Cathepsin B n=1 Tax=Myzus persicae RepID=A9JSH4_MYZPE Length = 335 Score = 67.8 bits (164), Expect = 5e-10 Identities = 33/95 (34%), Positives = 48/95 (50%), Gaps = 1/95 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSIN-HVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GPI D D +Y GVY + ++ + H V ++GWGV+ T YW++ NSWGE Sbjct: 249 GPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGEQ 306 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 WG+ G K++ + G +E C GVP Sbjct: 307 WGDKGMFKILRGTDECG--------VESSCTAGVP 333 [209][TOP] >UniRef100_A4GTA7 Cathepsin B-like cysteine protease form 1 n=1 Tax=Ixodes ricinus RepID=A4GTA7_IXORI Length = 337 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/98 (31%), Positives = 46/98 (46%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 I+K GP+ +Y GVY + H + ++GWG E YW++ NSW Sbjct: 247 IFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRILGWGT--ENGTPYWLVANSW 304 Query: 182 GEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDCNFGVP 295 E WG+ G+ K++ + G IEDD N G+P Sbjct: 305 NEDWGDHGYFKILRGKDECG--------IEDDINAGIP 334 [210][TOP] >UniRef100_P97821 Dipeptidyl-peptidase 1 light chain n=3 Tax=Mus musculus RepID=CATC_MOUSE Length = 462 Score = 67.8 bits (164), Expect = 5e-10 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPS-----INHVVSVVGWGVDPETDVEYW 163 + K GP++ + D Y G+Y S+P NH V +VG+G DP T +EYW Sbjct: 363 LVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYW 422 Query: 164 VIRNSWGEPWGESGFLKL 217 +I+NSWG WGESG+ ++ Sbjct: 423 IIKNSWGSNWGESGYFRI 440 [211][TOP] >UniRef100_UPI00017B4BDB UPI00017B4BDB related cluster n=1 Tax=Tetraodon nigroviridis RepID=UPI00017B4BDB Length = 334 Score = 67.4 bits (163), Expect = 7e-10 Identities = 34/90 (37%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 ++K GP++ GIDAT Y+ GVY + NP INH V +VG+GV +YW+++N Sbjct: 243 LFKHGPVAIGIDATLTTFHLYSKGVYYDPDCNPEDINHAVLLVGYGVTRRGQ-QYWIVKN 301 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SWG WG G++ + + GN+ +A Sbjct: 302 SWGTGWGTEGYILMARN-----RGNLCGIA 326 [212][TOP] >UniRef100_Q4TI44 Chromosome undetermined SCAF2412, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4TI44_TETNG Length = 123 Score = 67.4 bits (163), Expect = 7e-10 Identities = 34/90 (37%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 ++K GP++ GIDAT Y+ GVY + NP INH V +VG+GV +YW+++N Sbjct: 32 LFKHGPVAIGIDATLTTFHLYSKGVYYDPDCNPEDINHAVLLVGYGVTRRGQ-QYWIVKN 90 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SWG WG G++ + + GN+ +A Sbjct: 91 SWGTGWGTEGYILMARN-----RGNLCGIA 115 [213][TOP] >UniRef100_Q4SW27 Chromosome undetermined SCAF13692, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4SW27_TETNG Length = 336 Score = 67.4 bits (163), Expect = 7e-10 Identities = 34/90 (37%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Frame = +2 Query: 2 IYKRGPISCGIDAT-DQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRN 175 ++K GP++ GIDAT Y+ GVY + NP INH V +VG+GV +YW+++N Sbjct: 245 LFKHGPVAIGIDATLTTFHLYSKGVYYDPDCNPEDINHAVLLVGYGVTRRGQ-QYWIVKN 303 Query: 176 SWGEPWGESGFLKLVTSAYDDGNGNMYNLA 265 SWG WG G++ + + GN+ +A Sbjct: 304 SWGTGWGTEGYILMARN-----RGNLCGIA 328 [214][TOP] >UniRef100_B5XGB8 Digestive cysteine proteinase 2 n=1 Tax=Salmo salar RepID=B5XGB8_SALSA Length = 367 Score = 67.4 bits (163), Expect = 7e-10 Identities = 40/110 (36%), Positives = 59/110 (53%), Gaps = 18/110 (16%) Frame = +2 Query: 2 IYKRGPISCGIDATDQ-LETYTGGVYAERKSNPS-------INHVVSVVGWGVDPETDVE 157 IYKRGPIS IDA+ + Y GVY NPS ++H V VG+GVD Sbjct: 246 IYKRGPISVAIDASSSDFQFYHSGVY----QNPSCGSAVSELDHAVLAVGFGVDKVHKTP 301 Query: 158 YWVIRNSWGEPWGESGFLKLVTSAYDDGN-------GNMYNL---AIEDD 277 Y++++NSW WG+ G++K++ + ++ N+Y L A+EDD Sbjct: 302 YYIVKNSWSSGWGDHGYIKMIRNGKNNCGIATFATYPNLYRLNRRALEDD 351 [215][TOP] >UniRef100_Q6IE73 Cathepsin Q-like 2 n=1 Tax=Rattus norvegicus RepID=Q6IE73_RAT Length = 342 Score = 67.4 bits (163), Expect = 7e-10 Identities = 31/71 (43%), Positives = 46/71 (64%), Gaps = 2/71 (2%) Frame = +2 Query: 11 RGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVD-PETD-VEYWVIRNSWG 184 +GP++ GI A+ + G+Y E K N +NH V VVG+G + ETD YW+I+NSWG Sbjct: 253 KGPVAAGIHASHGSFHFVSGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWG 312 Query: 185 EPWGESGFLKL 217 + WG G++K+ Sbjct: 313 KQWGLKGYMKI 323 [216][TOP] >UniRef100_O23801 FB1035 (Fragment) n=1 Tax=Ananas comosus RepID=O23801_ANACO Length = 324 Score = 67.4 bits (163), Expect = 7e-10 Identities = 26/67 (38%), Positives = 46/67 (68%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 PI+ IDA++ + Y GGV++ S+NH ++++G+G D + +YW++RNSWG WG Sbjct: 223 PIAALIDASENFQYYNGGVFSG-PCGTSLNHAITIIGYGQD-SSGTKYWIVRNSWGSSWG 280 Query: 197 ESGFLKL 217 E G++++ Sbjct: 281 EGGYVRM 287 [217][TOP] >UniRef100_Q9Y0D2 Cysteine proteinase n=1 Tax=Hypera postica RepID=Q9Y0D2_9CUCU Length = 324 Score = 67.4 bits (163), Expect = 7e-10 Identities = 29/69 (42%), Positives = 46/69 (66%), Gaps = 1/69 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP+S G+DA+ L +Y G+Y ++ +P+ +NH + VG+G E +YW+I+NSWG Sbjct: 240 GPVSVGMDAS-YLSSYDSGIYEDQDCSPAGLNHAILAVGYGT--ENGKDYWIIKNSWGAS 296 Query: 191 WGESGFLKL 217 WGE G+ +L Sbjct: 297 WGEQGYFRL 305 [218][TOP] >UniRef100_Q5DGQ6 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5DGQ6_SCHJA Length = 331 Score = 67.4 bits (163), Expect = 7e-10 Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ + INH V VVG+G + D YW+I+NS Sbjct: 241 VYQYGPISVGIVAVDSLIMYKSGVFESNECKYGDINHGVLVVGYGKEHGKD--YWLIKNS 298 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 299 WGDLWGSKGYFKL 311 [219][TOP] >UniRef100_Q5DFT0 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5DFT0_SCHJA Length = 342 Score = 67.4 bits (163), Expect = 7e-10 Identities = 34/73 (46%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ INH V VVG+G + D YW+I+NS Sbjct: 252 VYQYGPISVGIVALDSLTMYKSGVFESNDCKYADINHGVLVVGYGKEHGKD--YWLIKNS 309 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 310 WGDLWGSKGYFKL 322 [220][TOP] >UniRef100_Q5DBA1 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5DBA1_SCHJA Length = 331 Score = 67.4 bits (163), Expect = 7e-10 Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ + INH V VVG+G + D YW+I+NS Sbjct: 241 VYQYGPISVGIVALDSLIMYKSGVFESNDCKHADINHGVLVVGYGKEHGKD--YWLIKNS 298 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 299 WGDLWGSKGYFKL 311 [221][TOP] >UniRef100_Q26986 TFCP2 protein (Fragment) n=1 Tax=Tritrichomonas foetus RepID=Q26986_TRIFO Length = 270 Score = 67.4 bits (163), Expect = 7e-10 Identities = 31/74 (41%), Positives = 45/74 (60%), Gaps = 3/74 (4%) Frame = +2 Query: 2 IYKRGPISCGIDATD-QLETYTGGVYAER--KSNPSINHVVSVVGWGVDPETDVEYWVIR 172 I GP+SC +DA + Y GG+Y ++ NH + +VG+GV E EYW++R Sbjct: 179 IAANGPVSCNVDAGHYSFQLYQGGIYWSWFCRTQYIYNHAMGIVGYGV--EGSEEYWIVR 236 Query: 173 NSWGEPWGESGFLK 214 NSWGE WGE G+++ Sbjct: 237 NSWGESWGEQGYIR 250 [222][TOP] >UniRef100_C5IIM1 Cathepsin L-like cysteine proteinase n=1 Tax=Haliotis diversicolor supertexta RepID=C5IIM1_HALDV Length = 347 Score = 67.4 bits (163), Expect = 7e-10 Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 2/70 (2%) Frame = +2 Query: 14 GPISCGIDATDQ-LETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP+S IDA+ Q + Y+GGVY E K S+ ++H V VVG+G D D YW+++NSWG Sbjct: 261 GPVSIAIDASHQSFQLYSGGVYDEPKCSSTELDHGVLVVGYGTDDGQD--YWLVKNSWGT 318 Query: 188 PWGESGFLKL 217 WG G++K+ Sbjct: 319 TWGLEGYVKM 328 [223][TOP] >UniRef100_B4P464 GE12565 n=1 Tax=Drosophila yakuba RepID=B4P464_DROYA Length = 353 Score = 67.4 bits (163), Expect = 7e-10 Identities = 31/74 (41%), Positives = 50/74 (67%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDA-TDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GP++C ++A T E Y+GG+Y + + N +NH V+VVG+G + D YW+I+NS+ + Sbjct: 267 GPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRD--YWIIKNSYSQ 324 Query: 188 PWGESGFLKLVTSA 229 WGE GF++L +A Sbjct: 325 NWGEGGFMRLPRNA 338 [224][TOP] >UniRef100_B4NND9 GK22908 n=1 Tax=Drosophila willistoni RepID=B4NND9_DROWI Length = 381 Score = 67.4 bits (163), Expect = 7e-10 Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 1/70 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP+ C + A + L Y G+++ N +NH V VVG+G E +YW I+NSWGE Sbjct: 297 GPVGCSLFADEALLHYEKGIFSNETCNGQELNHAVLVVGYG--SENGQDYWTIKNSWGEN 354 Query: 191 WGESGFLKLV 220 WGESG+ +L+ Sbjct: 355 WGESGYFRLI 364 [225][TOP] >UniRef100_B3S1V6 Putative uncharacterized protein n=1 Tax=Trichoplax adhaerens RepID=B3S1V6_TRIAD Length = 466 Score = 67.4 bits (163), Expect = 7e-10 Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 6/79 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSIN------HVVSVVGWGVDPETDVEYW 163 + K GPIS + + Y GG+Y S N H V +VG+G D ++ +YW Sbjct: 369 LVKNGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYW 428 Query: 164 VIRNSWGEPWGESGFLKLV 220 +++NSWG WGE+GF +++ Sbjct: 429 IVKNSWGTKWGENGFFRIL 447 [226][TOP] >UniRef100_B2KSE0 Cathepsin B (Fragment) n=1 Tax=Samia cynthia ricini RepID=B2KSE0_SAMCR Length = 283 Score = 67.4 bits (163), Expect = 7e-10 Identities = 26/73 (35%), Positives = 41/73 (56%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 ++K GP+ L +Y GVY + N H + ++GWGV E + +YW+I NSW Sbjct: 205 LFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGV--ENNNKYWLIANSW 262 Query: 182 GEPWGESGFLKLV 220 WG++GF K++ Sbjct: 263 NSDWGDNGFFKIL 275 [227][TOP] >UniRef100_B0W2J0 Cathepsin B-like thiol protease n=1 Tax=Culex quinquefasciatus RepID=B0W2J0_CULQU Length = 288 Score = 67.4 bits (163), Expect = 7e-10 Identities = 32/73 (43%), Positives = 39/73 (53%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IY+ GPI D Y GVY +H V V+GWGV E V+YW+ NSW Sbjct: 199 IYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGV--ENGVKYWLCANSW 256 Query: 182 GEPWGESGFLKLV 220 E WGE+GF K+V Sbjct: 257 NERWGENGFFKIV 269 [228][TOP] >UniRef100_O91466 Viral cathepsin n=1 Tax=Cydia pomonella granulovirus RepID=CATV_GVCP Length = 333 Score = 67.4 bits (163), Expect = 7e-10 Identities = 33/81 (40%), Positives = 47/81 (58%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPW 193 GPIS ID +D L Y G+ ++N +NH V +VG+GV + DV YW+++NSWG W Sbjct: 250 GPISVAIDVSD-LINYKAGIADICENNEGLNHAVLLVGYGV--KNDVPYWILKNSWGAEW 306 Query: 194 GESGFLKLVTSAYDDGNGNMY 256 GE G+ ++ G N Y Sbjct: 307 GEEGYFRVQRDKNSCGMMNEY 327 [229][TOP] >UniRef100_O23791 Fruit bromelain n=1 Tax=Ananas comosus RepID=BROM1_ANACO Length = 351 Score = 67.4 bits (163), Expect = 7e-10 Identities = 26/67 (38%), Positives = 46/67 (68%) Frame = +2 Query: 17 PISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPWG 196 PI+ IDA++ + Y GGV++ S+NH ++++G+G D + +YW++RNSWG WG Sbjct: 250 PIAALIDASENFQYYNGGVFSG-PCGTSLNHAITIIGYGQD-SSGTKYWIVRNSWGSSWG 307 Query: 197 ESGFLKL 217 E G++++ Sbjct: 308 EGGYVRM 314 [230][TOP] >UniRef100_UPI0001868BDC hypothetical protein BRAFLDRAFT_285529 n=1 Tax=Branchiostoma floridae RepID=UPI0001868BDC Length = 456 Score = 67.0 bits (162), Expect = 9e-10 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSIN------HVVSVVGWGVDPETDVEYW 163 + K GPI D Y G+Y N HVV++VG+G+DP +D +YW Sbjct: 356 LVKNGPIPVSFQVYDDFRQYHNGIYHHTGLKDGWNPWKVTDHVVTIVGYGIDPSSDEKYW 415 Query: 164 VIRNSWGEPWGESGFLKL 217 + +N+WG WGE G+ K+ Sbjct: 416 IAQNTWGTDWGELGYFKI 433 [231][TOP] >UniRef100_UPI00006A4C13 PREDICTED: similar to predicted protein n=1 Tax=Ciona intestinalis RepID=UPI00006A4C13 Length = 340 Score = 67.0 bits (162), Expect = 9e-10 Identities = 33/74 (44%), Positives = 50/74 (67%), Gaps = 2/74 (2%) Frame = +2 Query: 14 GPISCGIDATD-QLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 GPIS IDA+ + Y+ GVY+E S+ ++H V VVG+G E +YW+++NSWGE Sbjct: 254 GPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYGT--ENGKDYWLVKNSWGE 311 Query: 188 PWGESGFLKLVTSA 229 WGE+G++K+ +A Sbjct: 312 GWGEAGYIKMSRNA 325 [232][TOP] >UniRef100_Q6GP75 MGC80629 protein n=1 Tax=Xenopus laevis RepID=Q6GP75_XENLA Length = 256 Score = 67.0 bits (162), Expect = 9e-10 Identities = 26/73 (35%), Positives = 49/73 (67%), Gaps = 5/73 (6%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDP-----ETDVEYWVIRNS 178 GPI+ I +++L+ Y G++ + + +NH V++VG+G + E D +YW+I+NS Sbjct: 167 GPITVAIGVSEELQNYEKGIF-DGECAEEVNHAVTIVGYGTEAAKNEGEEDEDYWIIKNS 225 Query: 179 WGEPWGESGFLKL 217 WG+ WGE+G++++ Sbjct: 226 WGKDWGENGYIRM 238 [233][TOP] >UniRef100_Q2KKU6 Cathepsin (Fragment) n=1 Tax=Siniperca chuatsi RepID=Q2KKU6_SINCH Length = 38 Score = 67.0 bits (162), Expect = 9e-10 Identities = 26/37 (70%), Positives = 33/37 (89%) Frame = +2 Query: 170 RNSWGEPWGESGFLKLVTSAYDDGNGNMYNLAIEDDC 280 RNSWGEPWGE G+L++VTSAY G+G+ YNLA+E+DC Sbjct: 1 RNSWGEPWGEKGWLRIVTSAYKGGSGSQYNLALEEDC 37 [234][TOP] >UniRef100_Q77LX9 Cathepsin n=2 Tax=Alphabaculovirus RepID=Q77LX9_9ABAC Length = 365 Score = 67.0 bits (162), Expect = 9e-10 Identities = 29/72 (40%), Positives = 47/72 (65%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 +Y GP++ +DA D + Y G+ + +NH V ++GWG+ E +V YW+I+NSW Sbjct: 278 VYTTGPVAIAVDAMDIIN-YRRGILNQCHIY-DLNHAVLLIGWGI--ENNVPYWIIKNSW 333 Query: 182 GEPWGESGFLKL 217 GE WGE+G+L++ Sbjct: 334 GEDWGENGYLRV 345 [235][TOP] >UniRef100_Q9ET52 Cathepsin-6 n=1 Tax=Mus musculus RepID=Q9ET52_MOUSE Length = 334 Score = 67.0 bits (162), Expect = 9e-10 Identities = 33/73 (45%), Positives = 50/73 (68%), Gaps = 4/73 (5%) Frame = +2 Query: 14 GPISCGIDAT-DQLETYTGGVYAERK-SNPSINHVVSVVGWGVDP-ETDV-EYWVIRNSW 181 GPIS +DA+ ++ Y GG+Y + SN ++NH V VVG+G + ETD +YW+I+NSW Sbjct: 244 GPISAAVDASFNRFSFYDGGIYHQPNCSNNTVNHAVLVVGYGTEGNETDGNKYWLIKNSW 303 Query: 182 GEPWGESGFLKLV 220 G WG G++K++ Sbjct: 304 GRRWGIGGYMKII 316 [236][TOP] >UniRef100_Q6T857 Cathepsin L n=1 Tax=Fasciola gigantica RepID=Q6T857_FASGI Length = 326 Score = 67.0 bits (162), Expect = 9e-10 Identities = 27/70 (38%), Positives = 43/70 (61%), Gaps = 1/70 (1%) Frame = +2 Query: 11 RGPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNSWGE 187 +GP + +D Y GG+YA R S+ +NH + VVG+G TD YW+++NSWG Sbjct: 236 KGPAAVAVDVESDFLMYRGGIYASRNCSSEKLNHAMLVVGYGTQDGTD--YWIVKNSWGS 293 Query: 188 PWGESGFLKL 217 WG+ G++++ Sbjct: 294 LWGDHGYIRM 303 [237][TOP] >UniRef100_Q6R018 Cathepsin L protein n=1 Tax=Fasciola hepatica RepID=Q6R018_FASHE Length = 326 Score = 67.0 bits (162), Expect = 9e-10 Identities = 31/85 (36%), Positives = 47/85 (55%), Gaps = 1/85 (1%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNP-SINHVVSVVGWGVDPETDVEYWVIRNSWGEP 190 GP + +D Y G+Y + +P S+NH V VG+G TD YW+++NSWG Sbjct: 237 GPAAVAVDVESDFMMYRSGIYQSQTCSPLSVNHAVLAVGYGTQGGTD--YWIVKNSWGLS 294 Query: 191 WGESGFLKLVTSAYDDGNGNMYNLA 265 WGE G++++V + GNM +A Sbjct: 295 WGERGYIRMVRN-----RGNMCGIA 314 [238][TOP] >UniRef100_Q5DHW7 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5DHW7_SCHJA Length = 241 Score = 67.0 bits (162), Expect = 9e-10 Identities = 34/73 (46%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ INH V VVG+G + D YW+I+NS Sbjct: 151 VYQYGPISVGIVAVDSLIMYKSGVFESNDCKYADINHGVLVVGYGKEHGKD--YWLIKNS 208 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 209 WGDLWGSKGYFKL 221 [239][TOP] >UniRef100_Q5DHR5 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5DHR5_SCHJA Length = 331 Score = 67.0 bits (162), Expect = 9e-10 Identities = 34/73 (46%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ INH V VVG+G + D YW+I+NS Sbjct: 241 VYQYGPISVGIVAVDSLIMYKSGVFESNDCKYADINHGVLVVGYGKEHGKD--YWLIKNS 298 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 299 WGDLWGSKGYFKL 311 [240][TOP] >UniRef100_Q5DCE5 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5DCE5_SCHJA Length = 331 Score = 67.0 bits (162), Expect = 9e-10 Identities = 34/73 (46%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ INH V VVG+G + D YW+I+NS Sbjct: 241 VYQYGPISVGIVALDSLTMYKSGVFESNDCKYGDINHGVLVVGYGKEHGKD--YWLIKNS 298 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 299 WGDLWGSKGYFKL 311 [241][TOP] >UniRef100_Q5D8Y1 Putative uncharacterized protein n=1 Tax=Schistosoma japonicum RepID=Q5D8Y1_SCHJA Length = 331 Score = 67.0 bits (162), Expect = 9e-10 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y GV+ INH V +VG+G E +YW+I+NS Sbjct: 241 VYQYGPISVGIVAVDSLIMYKSGVFESNDCKYGDINHGVLIVGYG--KENGKDYWLIKNS 298 Query: 179 WGEPWGESGFLKL 217 WG+ WG G+ KL Sbjct: 299 WGDLWGSKGYFKL 311 [242][TOP] >UniRef100_Q26564 Cathepsin L (Fragment) n=1 Tax=Schistosoma mansoni RepID=Q26564_SCHMA Length = 317 Score = 67.0 bits (162), Expect = 9e-10 Identities = 33/73 (45%), Positives = 45/73 (61%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y GPIS IDA D L Y G+Y ++ S+ +NH V VG+G + D YW+I+NS Sbjct: 228 LYHYGPISVAIDALDDLILYKSGIYESKQCSSFLLNHGVLAVGYGRENRKD--YWLIKNS 285 Query: 179 WGEPWGESGFLKL 217 WG WG +G+ KL Sbjct: 286 WGTTWGMNGYFKL 298 [243][TOP] >UniRef100_Q11007 Cathepsin B proteinase (Fragment) n=1 Tax=Ancylostoma caninum RepID=Q11007_ANCCA Length = 340 Score = 67.0 bits (162), Expect = 9e-10 Identities = 29/80 (36%), Positives = 41/80 (51%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSW 181 IYK GP+ Y GG+Y + + H V VVGWG + TD YW+I NSW Sbjct: 251 IYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWGRENGTD--YWLIANSW 308 Query: 182 GEPWGESGFLKLVTSAYDDG 241 WGE+G+ ++ + + G Sbjct: 309 NTDWGENGYFRIARGSNECG 328 [244][TOP] >UniRef100_Q10834 Preprocathepsin cathepsin L n=1 Tax=Schistosoma japonicum RepID=Q10834_SCHJA Length = 331 Score = 67.0 bits (162), Expect = 9e-10 Identities = 33/73 (45%), Positives = 44/73 (60%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSN-PSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y+ GPIS GI A D L Y G+Y + INH V VG+G E +YW+I+NS Sbjct: 241 VYQYGPISVGIVALDSLILYKSGIYESKDCKYADINHGVLAVGYG--RENGKDYWLIKNS 298 Query: 179 WGEPWGESGFLKL 217 WG+ WG +G+ KL Sbjct: 299 WGDLWGMNGYFKL 311 [245][TOP] >UniRef100_C6LMY0 Cathepsin B n=1 Tax=Giardia intestinalis ATCC 50581 RepID=C6LMY0_GIALA Length = 298 Score = 67.0 bits (162), Expect = 9e-10 Identities = 29/76 (38%), Positives = 39/76 (51%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPW 193 GP+ Y GGVY H V +VG+G D E DV+YW+IRNSWG W Sbjct: 212 GPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEMVGYGTD-EYDVDYWIIRNSWGPDW 270 Query: 194 GESGFLKLVTSAYDDG 241 GE G+ +++ + G Sbjct: 271 GEDGYFRIIRMTNECG 286 [246][TOP] >UniRef100_C3ZQU4 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZQU4_BRAFL Length = 456 Score = 67.0 bits (162), Expect = 9e-10 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 6/78 (7%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERKSNPSIN------HVVSVVGWGVDPETDVEYW 163 + K GPI D Y G+Y N HVV++VG+G+DP +D +YW Sbjct: 356 LVKNGPIPVSFQVYDDFRQYHNGIYHHTGLKDGWNPWKVTDHVVTIVGYGIDPSSDEKYW 415 Query: 164 VIRNSWGEPWGESGFLKL 217 + +N+WG WGE G+ K+ Sbjct: 416 IAQNTWGTDWGELGYFKI 433 [247][TOP] >UniRef100_C1LZA3 SmCL2-like peptidase (C01 family) n=1 Tax=Schistosoma mansoni RepID=C1LZA3_SCHMA Length = 256 Score = 67.0 bits (162), Expect = 9e-10 Identities = 33/73 (45%), Positives = 45/73 (61%), Gaps = 1/73 (1%) Frame = +2 Query: 2 IYKRGPISCGIDATDQLETYTGGVYAERK-SNPSINHVVSVVGWGVDPETDVEYWVIRNS 178 +Y GPIS IDA D L Y G+Y ++ S+ +NH V VG+G + D YW+I+NS Sbjct: 167 LYHYGPISVAIDALDDLILYKSGIYESKQCSSFLLNHGVLAVGYGRENRKD--YWLIKNS 224 Query: 179 WGEPWGESGFLKL 217 WG WG +G+ KL Sbjct: 225 WGTTWGMNGYFKL 237 [248][TOP] >UniRef100_B5DPR9 GA23505 n=1 Tax=Drosophila pseudoobscura pseudoobscura RepID=B5DPR9_DROPS Length = 323 Score = 67.0 bits (162), Expect = 9e-10 Identities = 28/72 (38%), Positives = 47/72 (65%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPW 193 GP++ I+A ++ Y GV+ + + S+NH V VG+G DP +YW+I+NSWG W Sbjct: 239 GPLAVSINAAT-IQFYKSGVFRDSSCSRSVNHAVLAVGYGTDPSYG-DYWLIKNSWGTGW 296 Query: 194 GESGFLKLVTSA 229 GESG++++ ++ Sbjct: 297 GESGYIRMARNS 308 [249][TOP] >UniRef100_B4LPU4 GJ20385 n=1 Tax=Drosophila virilis RepID=B4LPU4_DROVI Length = 370 Score = 67.0 bits (162), Expect = 9e-10 Identities = 26/70 (37%), Positives = 45/70 (64%), Gaps = 1/70 (1%) Frame = +2 Query: 11 RGPISCGIDATDQLETYTGGVYAERKSNPS-INHVVSVVGWGVDPETDVEYWVIRNSWGE 187 +GP++C ++ + L Y G+YA+ + N +NH + VVG+G + D YW+++NSW + Sbjct: 285 QGPLACSVNGLESLLLYKRGIYADEECNKGEVNHSILVVGYGTEDGQD--YWIVKNSWDK 342 Query: 188 PWGESGFLKL 217 WGE G+ +L Sbjct: 343 AWGEDGYFRL 352 [250][TOP] >UniRef100_B4H900 GL15686 n=1 Tax=Drosophila persimilis RepID=B4H900_DROPE Length = 323 Score = 67.0 bits (162), Expect = 9e-10 Identities = 28/72 (38%), Positives = 47/72 (65%) Frame = +2 Query: 14 GPISCGIDATDQLETYTGGVYAERKSNPSINHVVSVVGWGVDPETDVEYWVIRNSWGEPW 193 GP++ I+A ++ Y GV+ + + S+NH V VG+G DP +YW+I+NSWG W Sbjct: 239 GPLAVSINAAT-IQFYKSGVFRDSSCSRSVNHAVLAVGYGTDPSYG-DYWLIKNSWGTGW 296 Query: 194 GESGFLKLVTSA 229 GESG++++ ++ Sbjct: 297 GESGYIRMARNS 308