
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149038.5 + phase: 0 /partial
(345 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_563648.1| cathepsin B-like cysteine protease, putative [A... 538 e-152
gb|AAN60355.1| unknown [Arabidopsis thaliana] gi|21554165|gb|AAM... 524 e-147
emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis ... 524 e-147
gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tub... 517 e-145
emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana ... 514 e-144
pir||S60479 cathepsin B-like cysteine proteinase (EC 3.4.22.-) -... 514 e-144
ref|NP_563647.1| cathepsin B-like cysteine protease, putative [A... 508 e-143
gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa ... 507 e-142
gb|AAK69541.1| cathepsin B-like cysteine proteinase [Ipomoea bat... 502 e-141
gb|AAF04727.1| cathepsin B-like cysteine proteinase [Ipomoea bat... 498 e-140
emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare] 485 e-136
emb|CAA46811.1| cathepsin B [Triticum aestivum] gi|7435782|pir||... 477 e-133
emb|CAA46810.1| cathepsin B [Triticum aestivum] gi|7435783|pir||... 471 e-131
emb|CAA46812.1| cathepsin B [Triticum aestivum] 424 e-117
emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum] 342 1e-92
dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protei... 308 1e-82
emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum] 304 2e-81
gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tub... 303 6e-81
dbj|BAE01603.1| unnamed protein product [Macaca fascicularis] 278 1e-73
gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct] gi... 277 3e-73
>ref|NP_563648.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1| At1g02300/T6A9_10
[Arabidopsis thaliana] gi|25090140|gb|AAN72238.1|
At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10
[Arabidopsis thaliana]
Length = 362
Score = 538 bits (1386), Expect = e-152
Identities = 244/332 (73%), Positives = 274/332 (82%), Gaps = 10/332 (3%)
Query: 8 EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
E L+ KL S ILQ I K++NENP AGW+A+ N RF+N TV +FKRLLGVK PK E L
Sbjct: 34 ENLSKQKLTSWILQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFL 93
Query: 68 STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
P+V+H SLKLPKEFDARTAWSQC++IG+IL QGHCGSCWAFGAVESL
Sbjct: 94 GVPIVSHDISLKLPKEFDARTAWSQCTSIGRILD----------QGHCGSCWAFGAVESL 143
Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
DRFCI ++MN+SLSVNDLLACCGFLCG GC+GG PI AWRY HHGVVTEECDPYFD
Sbjct: 144 SDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNT 203
Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
GCSHPGCEPAY TPKC RKCV GNQ+W+ SKHY V AY+V+S P DIMAEVYKNGPVEVA
Sbjct: 204 GCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVA 263
Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
FTV+EDFAHYKSGVYKHITG+ +GGHAVKLIGWGTSD+GEDYWLLANQWN +WGDDGYFK
Sbjct: 264 FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFK 323
Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVREVTDMD 339
I+RGTNECGIE V AGLPS +N+V+ +T D
Sbjct: 324 IRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 355
>gb|AAN60355.1| unknown [Arabidopsis thaliana] gi|21554165|gb|AAM63244.1| cathepsin
B-like cysteine protease, putative [Arabidopsis
thaliana] gi|21281113|gb|AAM45063.1| putative cathepsin
B cysteine protease [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1| putative cathepsin B cysteine
protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis
thaliana] gi|18411686|ref|NP_567215.1| cathepsin B-like
cysteine protease, putative [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine
protease [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis
thaliana]
Length = 359
Score = 524 bits (1349), Expect = e-147
Identities = 241/326 (73%), Positives = 265/326 (80%), Gaps = 10/326 (3%)
Query: 8 EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
E L KL+S ILQ+ I K++NENP AGW+AAIN RFSN TV +FKRLLGVK PKK L
Sbjct: 31 ESLTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFL 90
Query: 68 STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
P+V+H SLKLPK FDARTAW QC++IG IL QGHCGSCWAFGAVESL
Sbjct: 91 GVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILD----------QGHCGSCWAFGAVESL 140
Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
DRFCI F MNISLSVNDLLACCGF CG GCDGG PI AW+Y ++ GVVTEECDPYFD
Sbjct: 141 SDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNT 200
Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
GCSHPGCEPAY TPKC RKCV N++W SKHYSV Y VKS+PQDIMAEVYKNGPVEV+
Sbjct: 201 GCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVS 260
Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
FTV+EDFAHYKSGVYKHITGS +GGHAVKLIGWGTS EGEDYWL+ANQWN WGDDGYF
Sbjct: 261 FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFM 320
Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVR 333
I+RGTNECGIED+ AGLPS+KN+ R
Sbjct: 321 IRRGTNECGIEDEPVAGLPSSKNVFR 346
>emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to
cysteine proteases (Pfam: PF00112, E=1.3e-79, N=1)
[Arabidopsis thaliana] gi|30678927|ref|NP_849281.1|
cathepsin B-like cysteine protease, putative
[Arabidopsis thaliana] gi|7435784|pir||T02011 probable
cathepsin B-like cysteine proteinase (EC 3.4.22.-)
T15B16.17a - Arabidopsis thaliana
Length = 359
Score = 524 bits (1349), Expect = e-147
Identities = 241/326 (73%), Positives = 265/326 (80%), Gaps = 10/326 (3%)
Query: 8 EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
E L KL+S ILQ+ I K++NENP AGW+AAIN RFSN TV +FKRLLGVK PKK L
Sbjct: 31 ESLTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFL 90
Query: 68 STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
P+V+H SLKLPK FDARTAW QC++IG ILG GHCGSCWAFGAVESL
Sbjct: 91 GVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILGL----------GHCGSCWAFGAVESL 140
Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
DRFCI F MNISLSVNDLLACCGF CG GCDGG PI AW+Y ++ GVVTEECDPYFD
Sbjct: 141 SDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNT 200
Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
GCSHPGCEPAY TPKC RKCV N++W SKHYSV Y VKS+PQDIMAEVYKNGPVEV+
Sbjct: 201 GCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVS 260
Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
FTV+EDFAHYKSGVYKHITGS +GGHAVKLIGWGTS EGEDYWL+ANQWN WGDDGYF
Sbjct: 261 FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFM 320
Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVR 333
I+RGTNECGIED+ AGLPS+KN+ R
Sbjct: 321 IRRGTNECGIEDEPVAGLPSSKNVFR 346
>gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 517 bits (1331), Expect = e-145
Identities = 236/338 (69%), Positives = 272/338 (79%), Gaps = 10/338 (2%)
Query: 7 DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKEL 66
++ ++ KL S ILQ+SI K++NEN EAGW+AA NP+ SNFTV QFKRLLGVK A + +L
Sbjct: 27 EKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQFKRLLGVKPAREGDL 86
Query: 67 LSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVES 126
PV+THP+ +LPKEFDAR AW QCSTIGKIL QGHCGSCWAFGAVES
Sbjct: 87 EGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILD----------QGHCGSCWAFGAVES 136
Query: 127 LQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQ 186
L DRFCIH++++ISLSVNDLLACC FLCG+GCDGG PI AWRY GVVTEECDPYFD
Sbjct: 137 LSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDT 196
Query: 187 IGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEV 246
GCSHPGCEP Y TPKC RKCVKGN +W++SKHY V AYRV DPQ IMAEVYKNGPVEV
Sbjct: 197 TGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEV 256
Query: 247 AFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYF 306
+FTV+EDFAHYKSGVYKH+TG +GGHAVKLIGWGTS++GEDYWL+ N WN WG+DGYF
Sbjct: 257 SFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYF 316
Query: 307 KIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAGV 344
KI+RGTNECGIE V AGLPS +N+ E+ D +DA +
Sbjct: 317 KIRRGTNECGIEHSVVAGLPSARNLNVELGDAVLDASM 354
>emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 514 bits (1325), Expect = e-144
Identities = 234/340 (68%), Positives = 271/340 (78%), Gaps = 12/340 (3%)
Query: 7 DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKEL 66
++ ++ K S ILQ+SI KQ+NEN +AGW+AA+NPRFSNFTV QFKRLLGVK K +L
Sbjct: 27 EQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDL 86
Query: 67 LSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVES 126
P++THPK L+LP+EFDAR AWS CSTIG+IL QGHCGSCWAFGAVES
Sbjct: 87 KGIPILTHPKLLELPQEFDARVAWSNCSTIGRILD----------QGHCGSCWAFGAVES 136
Query: 127 LQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQ 186
L DRFCIH+ +NISLS NDL ACCGFLCG GCDGG P+ AW+Y GVVT+ECDPYFD
Sbjct: 137 LSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDN 196
Query: 187 IGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEV 246
GCSHPGCEPAY TPKC RKCVK N +W RSKH+ V AY + SDP IM EVYKNGPVEV
Sbjct: 197 EGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSDPHSIMTEVYKNGPVEV 256
Query: 247 AFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYF 306
+FTV+EDFAHYKSGVYKH+TG +GGHAVKLIGWGTS++GEDYWLLANQWN WGDDGYF
Sbjct: 257 SFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYF 316
Query: 307 KIKRGTNECGIEDDVTAGLPSTK--NIVREVTDMDVDAGV 344
KI+RGTNEC IED+V AGLPS + N+ +V+D +DA +
Sbjct: 317 KIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAFLDAAM 356
>pir||S60479 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - Aztec tobacco
Length = 356
Score = 514 bits (1323), Expect = e-144
Identities = 234/340 (68%), Positives = 271/340 (78%), Gaps = 12/340 (3%)
Query: 7 DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKEL 66
++ ++ K S ILQ+SI KQ+NEN +AGW+AA+NPRFSNFTV QFKRLLGVK K +L
Sbjct: 27 EQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDL 86
Query: 67 LSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVES 126
P++THPK L+LP+EFDAR AWS CSTIG+IL QGHCGSCWAFGAVES
Sbjct: 87 KGIPILTHPKLLELPQEFDARVAWSNCSTIGRILD----------QGHCGSCWAFGAVES 136
Query: 127 LQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQ 186
L DRFCIH+ +NISLS NDL ACCGFLCG GCDGG P+ AW+Y GVVT+ECDPYFD
Sbjct: 137 LSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDN 196
Query: 187 IGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEV 246
GCSHPGCEPAY TPKC RKCVK N +W RSKH+ V AY + SDP IM EVYKNGPVEV
Sbjct: 197 EGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMITSDPLSIMTEVYKNGPVEV 256
Query: 247 AFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYF 306
+FTV+EDFAHYKSGVYKH+TG +GGHAVKLIGWGTS++GEDYWLLANQWN WGDDGYF
Sbjct: 257 SFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYF 316
Query: 307 KIKRGTNECGIEDDVTAGLPSTK--NIVREVTDMDVDAGV 344
KI+RGTNEC IED+V AGLPS + N+ +V+D +DA +
Sbjct: 317 KIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAYLDAAM 356
>ref|NP_563647.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
Length = 379
Score = 508 bits (1309), Expect = e-143
Identities = 232/342 (67%), Positives = 266/342 (76%), Gaps = 10/342 (2%)
Query: 8 EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
E L+ KL S ILQ I K++NENP AGW+AA N RF+N TV +FKRLLGV Q PK L
Sbjct: 31 ENLSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVIQTPKTAYL 90
Query: 68 STPVVTHPKSLKLPKEFDARTAWSQCSTIGKIL----------GSNLILMLMMIQGHCGS 117
P+V H SLKLPKEFDARTAWS C++I +IL S + L + GHCGS
Sbjct: 91 GVPIVRHDLSLKLPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGS 150
Query: 118 CWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVT 177
CWAFGAVESL DRFCI +++N+SLS ND++ACCG LCG GC+GG P+ AW Y +HGVVT
Sbjct: 151 CWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVT 210
Query: 178 EECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAE 237
+ECDPYFD GCSHPGCEP Y TPKC RKCV NQ+W SKHY V AYR+ DPQDIMAE
Sbjct: 211 QECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAE 270
Query: 238 VYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWN 297
VYKNGPVEVAFTV+EDFAHYKSGVYK+ITG+ +GGHAVKLIGWGTSD+GEDYWLLANQWN
Sbjct: 271 VYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWN 330
Query: 298 TNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMD 339
+WGDDGYFKI+RGTNECGIE V AGLPS KN+ + +T D
Sbjct: 331 RSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 372
>gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa (japonica
cultivar-group)]
Length = 358
Score = 507 bits (1306), Expect = e-142
Identities = 230/318 (72%), Positives = 258/318 (80%), Gaps = 10/318 (3%)
Query: 16 NSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHP 75
+S I+Q+ I K IN++P AGW AA NP F+N+T QFK +LGVK P L PV T+P
Sbjct: 38 SSRIIQDDIIKAINKHPNAGWTAARNPYFANYTTAQFKHILGVKPTPHSVLNDVPVKTYP 97
Query: 76 KSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHF 135
+SL LPKEFDAR+AWSQC+TIG IL QGHCGSCWAFGAVE LQDRFCIHF
Sbjct: 98 RSLMLPKEFDARSAWSQCNTIGTILD----------QGHCGSCWAFGAVECLQDRFCIHF 147
Query: 136 DMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCE 195
+MNISLSVNDL+ACCGF+CG GCDGG PI AWRY +GVVT+ECDPYFDQ+GC HPGCE
Sbjct: 148 NMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCE 207
Query: 196 PAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFA 255
PAY TP C +KC NQ+W KH+SV AYRV SDP DIMAEVY+NGPVEVAFTV+EDFA
Sbjct: 208 PAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFA 267
Query: 256 HYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNEC 315
HYKSGVYKHITG +GGHAVKLIGWGT+D GEDYWLLANQWN WGDDGYFKI RGTNEC
Sbjct: 268 HYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNEC 327
Query: 316 GIEDDVTAGLPSTKNIVR 333
GIE+DV AG+PSTKN+VR
Sbjct: 328 GIEEDVVAGMPSTKNMVR 345
>gb|AAK69541.1| cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 502 bits (1293), Expect = e-141
Identities = 233/331 (70%), Positives = 264/331 (79%), Gaps = 18/331 (5%)
Query: 14 KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVT 73
+++ ILQ+ I K +NENPEAGW+A +NPRFS+FTV QFKRLLGVK+APK L TPVVT
Sbjct: 30 EVDPKILQDEIVKTVNENPEAGWKADMNPRFSDFTVSQFKRLLGVKKAPKSLLKRTPVVT 89
Query: 74 HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
H K ++LPK FDARTAW QC +I IL QGHCGSCWAFGAVESL DRFCI
Sbjct: 90 HSKEIELPKTFDARTAWPQCLSIADILD----------QGHCGSCWAFGAVESLTDRFCI 139
Query: 134 HFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPG 193
H+ N++LSVNDLLACCGFLCG GCDGG PI AW+Y GVVT ECDPYFDQ GCSHPG
Sbjct: 140 HYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPG 199
Query: 194 CEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFED 253
CEPAY TP C +KCVK N +W SKH+SV AYRV SD IM EVY NGP EV+FTV+ED
Sbjct: 200 CEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYED 259
Query: 254 FAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTN 313
FAHYKSGVYKH+TGS +GGHAVKLIGWGTS++GEDYWLLANQWN +WGDDGYFKI RGTN
Sbjct: 260 FAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGDDGYFKIIRGTN 319
Query: 314 ECGIEDDVTAGLPSTKNIVREVTDMDVDAGV 344
ECGIE DVTAG+PSTKN +D+++GV
Sbjct: 320 ECGIE-DVTAGMPSTKN-------LDIESGV 342
>gb|AAF04727.1| cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 498 bits (1283), Expect = e-140
Identities = 232/331 (70%), Positives = 262/331 (79%), Gaps = 18/331 (5%)
Query: 14 KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVT 73
+++ ILQ+ I K +NENPEAGW+A +NPRFS+FTV QFKRLLGVK+APK L TPVVT
Sbjct: 30 EVDPKILQDEIVKTVNENPEAGWKADMNPRFSDFTVSQFKRLLGVKKAPKSLLKRTPVVT 89
Query: 74 HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
H K ++LPK FDARTAW QC +I IL QGHCGSCWAFGAVESL DRFCI
Sbjct: 90 HSKEIELPKTFDARTAWPQCLSIADILD----------QGHCGSCWAFGAVESLTDRFCI 139
Query: 134 HFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPG 193
H+ N++LSVNDLLACCGFLCG GCDGG PI AW+Y GVVT ECDPYFDQ GCSHPG
Sbjct: 140 HYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPG 199
Query: 194 CEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFED 253
CEPAY TP C +KCVK N +W SKH+SV AYRV SD IM EVY NGP EV+FTV+ED
Sbjct: 200 CEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYED 259
Query: 254 FAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTN 313
FAHYKSGVYKH+TGS +GGHAVKLIGWGTS++GEDYWLLANQWN +WG DGYFKI RGTN
Sbjct: 260 FAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGGDGYFKIIRGTN 319
Query: 314 ECGIEDDVTAGLPSTKNIVREVTDMDVDAGV 344
ECGIE DVTAG PSTKN +D+++GV
Sbjct: 320 ECGIE-DVTAGTPSTKN-------LDIESGV 342
>emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 485 bits (1248), Expect = e-136
Identities = 219/313 (69%), Positives = 247/313 (77%), Gaps = 10/313 (3%)
Query: 19 ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
I+Q+ I + +N +P AGW A NP +N+T+ QFK +LGVK P L THP+S
Sbjct: 35 IIQKGIIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLLAGVRTKTHPRSE 94
Query: 79 KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
+LPKEFDAR+ WS CSTIGKIL QGHCGSCWAFGAVE LQDRFCIH +MN
Sbjct: 95 QLPKEFDARSKWSGCSTIGKILD----------QGHCGSCWAFGAVECLQDRFCIHHNMN 144
Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY 198
ISLS NDL+ACCGF+CG GCDGG PI AW+Y +GVVTEECDPYFDQ+GC HPGCEPAY
Sbjct: 145 ISLSANDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCKHPGCEPAY 204
Query: 199 QTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYK 258
TP C +KC NQ+W+ KH+S+ AY+V SDP DIMAEVYKNGPVEVAFTV+EDFAHYK
Sbjct: 205 PTPVCEKKCKVQNQVWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYK 264
Query: 259 SGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 318
SGVYKHITG +GGHAVKLIGWGTSD GEDYWLLANQWN WGDDGYFKI RG NECGIE
Sbjct: 265 SGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE 324
Query: 319 DDVTAGLPSTKNI 331
+DVTAG+PS KNI
Sbjct: 325 EDVTAGMPSMKNI 337
>emb|CAA46811.1| cathepsin B [Triticum aestivum] gi|7435782|pir||T06466 cathepsin
B-like cysteine proteinase (EC 3.4.22.-) (clone A116) -
wheat (fragment)
Length = 353
Score = 477 bits (1228), Expect = e-133
Identities = 221/317 (69%), Positives = 247/317 (77%), Gaps = 13/317 (4%)
Query: 19 ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
I+Q+ I + +N++P AGW A NP F+N+T+ QFK +LGVK P L P+ HP+ +
Sbjct: 37 IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95
Query: 79 KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
LPKEFDART WS CSTIG IL QGHCG+CWAF AVE+LQDRFCIH +M+
Sbjct: 96 DLPKEFDARTQWSSCSTIGNILD----------QGHCGACWAFAAVEALQDRFCIHLNMS 145
Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY 198
+SLSVNDLLACCGFLCG+GC+GG PI AWRY GVVTEECDPYFDQ GC HPGCEPAY
Sbjct: 146 VSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAY 205
Query: 199 QTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFE--DFAH 256
TPKC RKC NQ WK +KH+SV AYRV S+P DIMAEVYKNGPVEVAFT + DFAH
Sbjct: 206 PTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAH 265
Query: 257 YKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECG 316
YKSGVYKHITG +GGHAVKLIGWGTSD GEDYWLLANQWN WGDDGYFKI RG NECG
Sbjct: 266 YKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENECG 325
Query: 317 IEDDVTAGLPSTKNIVR 333
IE DVTAG+PSTKN R
Sbjct: 326 IEGDVTAGMPSTKNTAR 342
>emb|CAA46810.1| cathepsin B [Triticum aestivum] gi|7435783|pir||T06413 cathepsin
B-like cysteine proteinase (EC 3.4.22.-) - wheat
(fragment)
Length = 305
Score = 471 bits (1212), Expect = e-131
Identities = 212/308 (68%), Positives = 241/308 (77%), Gaps = 10/308 (3%)
Query: 24 IAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKLPKE 83
I + +N +P AGW A NP +N+T+ QFK +LGVK P + TH +S +LPK
Sbjct: 1 IIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRSEQLPKV 60
Query: 84 FDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMNISLSV 143
FDAR+ WS CSTIGKIL QGHCGSCWAFGAVE LQDRFCIH +MNI+LS
Sbjct: 61 FDARSKWSGCSTIGKILD----------QGHCGSCWAFGAVECLQDRFCIHHNMNITLSA 110
Query: 144 NDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKC 203
NDL+ACCGF+CG GCDGG PI AW+Y +GVVT+ECDPYFDQ+GC HPGCEPAY TP C
Sbjct: 111 NDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCKHPGCEPAYPTPVC 170
Query: 204 VRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYK 263
+KC NQ+W+ KH+S+ AY+V SDP DIMAEVY NGPVEVAFTV+EDFAHYKSGVYK
Sbjct: 171 EKKCKVQNQVWEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYK 230
Query: 264 HITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTA 323
HITG +GGHAVKLIGWGTSD GEDYWLLANQWN WGDDGYFKI RG NECGIE+DVTA
Sbjct: 231 HITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTA 290
Query: 324 GLPSTKNI 331
G+PSTKNI
Sbjct: 291 GMPSTKNI 298
>emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 424 bits (1089), Expect = e-117
Identities = 195/285 (68%), Positives = 220/285 (76%), Gaps = 13/285 (4%)
Query: 19 ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
I+Q+ I + +N++P AGW A NP F+N+T+ QFK +LGVK P L P+ HP+ +
Sbjct: 37 IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95
Query: 79 KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
LPKEFDART WS CSTIG IL QGHCG+CWAF AVE+LQDRFCIH +M+
Sbjct: 96 DLPKEFDARTQWSSCSTIGNILD----------QGHCGACWAFAAVEALQDRFCIHLNMS 145
Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY 198
+SLSVNDLLACCGFLCG+GC+GG PI AWRY GVVTEECDPYFDQ GC HPGCEPAY
Sbjct: 146 VSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAY 205
Query: 199 QTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFE--DFAH 256
TPKC RKC NQ WK +KH+SV AYRV S+P DIMAEVYKNGPVEVAFT + DFAH
Sbjct: 206 PTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAH 265
Query: 257 YKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWG 301
YKSGVYKHITG +GGHAVKLIGWGTSD GEDYWLLANQWN WG
Sbjct: 266 YKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWG 310
>emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 342 bits (876), Expect = 1e-92
Identities = 150/166 (90%), Positives = 158/166 (94%)
Query: 158 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRS 217
CDGG PI AW+Y AHHGVVTEECDPYFDQIGCSHPGCEP YQTPKCVRKCVKGNQ+WK+S
Sbjct: 1 CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60
Query: 218 KHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 277
KHYSVK Y+V SDPQ+IM EVYKNGPVEVAF+V+EDFAHYKSGVYKHITGSALGGHAVKL
Sbjct: 61 KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120
Query: 278 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTA 323
GWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE+DVTA
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166
>dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 308 bits (790), Expect = 1e-82
Identities = 134/174 (77%), Positives = 148/174 (85%)
Query: 166 AWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAY 225
AW Y +HGVVT+ECDPYFD GCSHPGCEP Y TPKC RKCV NQ+W SKHY V AY
Sbjct: 3 AWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAY 62
Query: 226 RVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDE 285
R+ DPQDIMAEVYKNGPVEVAFTV+EDFAHYKSGVYK+ITG+ +GGHAVKLIGWGTSD+
Sbjct: 63 RINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDD 122
Query: 286 GEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMD 339
GEDYWLLANQWN +WGDDGYFKI+RGTNECGIE V AGLPS KN+ + +T D
Sbjct: 123 GEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 176
>emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
Length = 206
Score = 304 bits (779), Expect = 2e-81
Identities = 140/176 (79%), Positives = 151/176 (85%), Gaps = 10/176 (5%)
Query: 19 ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
+LQESIAK++NENP AGW+AAINPRFSN TVGQFKRLLGVKQ P+ EL S PVVTHPKSL
Sbjct: 41 LLQESIAKEVNENPGAGWKAAINPRFSNSTVGQFKRLLGVKQTPRNELSSIPVVTHPKSL 100
Query: 79 KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
LPKEFDARTAW QCSTIG+IL QGHCGSCWAFGAVESL DRFCIHF ++
Sbjct: 101 NLPKEFDARTAWPQCSTIGRILD----------QGHCGSCWAFGAVESLSDRFCIHFGVD 150
Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGC 194
+ LSVNDLLACCGFLCG+GCDGG PI AW+Y AHHGVVTEECDPYFDQIGCSHPGC
Sbjct: 151 VPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 206
>gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 303 bits (775), Expect = 6e-81
Identities = 141/204 (69%), Positives = 160/204 (78%), Gaps = 10/204 (4%)
Query: 7 DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKEL 66
++ ++ KL S ILQ+SI K++NEN EAGW+AA NP+ SNFTV QFKRLLGVK A + +L
Sbjct: 25 EKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQFKRLLGVKPAREGDL 84
Query: 67 LSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVES 126
PV+THP+ +LPKEFDAR AW QCSTIGKIL QGHCGSCWAFGAVES
Sbjct: 85 EGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILD----------QGHCGSCWAFGAVES 134
Query: 127 LQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQ 186
L DRFCIH++++ISLSVNDLLACC FLCG+GCDGG PI AWRY GVVTEECDPYFD
Sbjct: 135 LSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDT 194
Query: 187 IGCSHPGCEPAYQTPKCVRKCVKG 210
GCSHPGCEP Y TPKC RKCVKG
Sbjct: 195 TGCSHPGCEPLYPTPKCHRKCVKG 218
>dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
Length = 339
Score = 278 bits (712), Expect = 1e-73
Identities = 144/330 (43%), Positives = 195/330 (58%), Gaps = 40/330 (12%)
Query: 18 HILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL----LGVKQAPKKELLSTPVVT 73
H L + + +N+ W+A N F N V KRL LG + P++ + +
Sbjct: 24 HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVMFT----- 75
Query: 74 HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
+ LKLP+ FDAR W QC TI +I QG CGSCWAFGAVE++ DR CI
Sbjct: 76 --EDLKLPESFDAREQWPQCPTIKEIRD----------QGSCGSCWAFGAVEAISDRICI 123
Query: 134 HFDMNISLSVN--DLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE-------CDPYF 184
H + ++S+ V+ DLL CCG +CG GC+GG P AW + G+V+ C PY
Sbjct: 124 HTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYS 183
Query: 185 -----DQIGCSHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
+ S P C TPKC + C G + +K+ KHY +Y V + +DIMAE+
Sbjct: 184 IPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 243
Query: 239 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNT 298
YKNGPVE AF+V+ DF YKSGVY+H+TG +GGHA++++GWG + G YWL+AN WNT
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNT 302
Query: 299 NWGDDGYFKIKRGTNECGIEDDVTAGLPST 328
+WGD+G+FKI RG + CGIE +V AG+P T
Sbjct: 303 DWGDNGFFKILRGQDHCGIESEVVAGIPRT 332
>gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic
construct]
Length = 340
Score = 277 bits (709), Expect = 3e-73
Identities = 143/330 (43%), Positives = 195/330 (58%), Gaps = 40/330 (12%)
Query: 18 HILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL----LGVKQAPKKELLSTPVVT 73
H + + + +N+ W+A N F N +G KRL LG + P++ + +
Sbjct: 24 HPVSDELVNYVNKR-NTTWQAGHN--FYNVDMGYLKRLCGTFLGGPKPPQRVMFT----- 75
Query: 74 HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
+ LKLP FDAR W QC TI +I QG CGSCWAFGAVE++ DR CI
Sbjct: 76 --EDLKLPASFDAREQWPQCPTIKEIRD----------QGSCGSCWAFGAVEAISDRICI 123
Query: 134 HFDMNISLSVN--DLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE-------CDPYF 184
H + ++S+ V+ DLL CCG +CG GC+GG P AW + G+V+ C PY
Sbjct: 124 HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYS 183
Query: 185 -----DQIGCSHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
+ S P C TPKC + C G + +K+ KHY +Y V + +DIMAE+
Sbjct: 184 IPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 243
Query: 239 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNT 298
YKNGPVE AF+V+ DF YKSGVY+H+TG +GGHA++++GWG + G YWL+AN WNT
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNT 302
Query: 299 NWGDDGYFKIKRGTNECGIEDDVTAGLPST 328
+WGD+G+FKI RG + CGIE +V AG+P T
Sbjct: 303 DWGDNGFFKILRGQDHCGIESEVVAGIPRT 332
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.319 0.137 0.440
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 656,065,370
Number of Sequences: 2540612
Number of extensions: 29550959
Number of successful extensions: 64650
Number of sequences better than 10.0: 1787
Number of HSP's better than 10.0 without gapping: 1629
Number of HSP's successfully gapped in prelim test: 158
Number of HSP's that attempted gapping in prelim test: 59400
Number of HSP's gapped (non-prelim): 2208
length of query: 345
length of database: 863,360,394
effective HSP length: 129
effective length of query: 216
effective length of database: 535,621,446
effective search space: 115694232336
effective search space used: 115694232336
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)
Medicago: description of AC149038.5