Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149038.5 + phase: 0 /partial
         (345 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q93VC9 At1g02300/T6A9_10 [Arabidopsis thaliana]             538  e-152
UniRef100_Q94K85 Putative cathepsin B cysteine protease [Arabido...   524  e-147
UniRef100_Q9ZSI0 T15B16.17a protein [Arabidopsis thaliana]            524  e-147
UniRef100_Q6ST24 Cathepsin B-like cysteine proteinase [Solanum t...   517  e-145
UniRef100_Q40413 Cathepsin B-like cysteine proteinase [Nicotiana...   514  e-144
UniRef100_O23681 Cathepsin B-like cysteine proteinase [Arabidops...   506  e-142
UniRef100_Q94G21 Cathepsin B-like cysteine proteinase [Ipomoea b...   502  e-141
UniRef100_Q9SQ82 Cathepsin B-like cysteine proteinase [Ipomoea b...   498  e-140
UniRef100_Q711Q3 Cathepsin B [Hordeum vulgare]                        485  e-136
UniRef100_Q03107 Cathepsin B [Triticum aestivum]                      477  e-133
UniRef100_Q03106 Cathepsin B [Triticum aestivum]                      471  e-131
UniRef100_Q9SC35 Putative cathepsin B-like protease [Pisum sativum]   342  1e-92
UniRef100_Q9SC36 Putative cathepsin B-like protease [Pisum sativum]   304  2e-81
UniRef100_UPI000036E1D7 UPI000036E1D7 UniRef100 entry                 277  3e-73
UniRef100_Q5R6D1 Hypothetical protein DKFZp459D187 [Pongo pygmaeus]   277  3e-73
UniRef100_P07858 Cathepsin B precursor [Homo sapiens]                 275  1e-72
UniRef100_P00787 Cathepsin B precursor [Rattus norvegicus]            273  7e-72
UniRef100_Q6IN22 Cathepsin B, preproprotein [Rattus norvegicus]       271  3e-71
UniRef100_P10605 Cathepsin B precursor [Mus musculus]                 267  4e-70
UniRef100_Q95PM1 Cathepsin B endopeptidase precursor [Schistosom...   265  1e-69

>UniRef100_Q93VC9 At1g02300/T6A9_10 [Arabidopsis thaliana]
          Length = 362

 Score =  538 bits (1386), Expect = e-152
 Identities = 244/332 (73%), Positives = 274/332 (82%), Gaps = 10/332 (3%)

Query: 8   EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
           E L+  KL S ILQ  I K++NENP AGW+A+ N RF+N TV +FKRLLGVK  PK E L
Sbjct: 34  ENLSKQKLTSWILQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFL 93

Query: 68  STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
             P+V+H  SLKLPKEFDARTAWSQC++IG+IL           QGHCGSCWAFGAVESL
Sbjct: 94  GVPIVSHDISLKLPKEFDARTAWSQCTSIGRILD----------QGHCGSCWAFGAVESL 143

Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
            DRFCI ++MN+SLSVNDLLACCGFLCG GC+GG PI AWRY  HHGVVTEECDPYFD  
Sbjct: 144 SDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNT 203

Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
           GCSHPGCEPAY TPKC RKCV GNQ+W+ SKHY V AY+V+S P DIMAEVYKNGPVEVA
Sbjct: 204 GCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVA 263

Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
           FTV+EDFAHYKSGVYKHITG+ +GGHAVKLIGWGTSD+GEDYWLLANQWN +WGDDGYFK
Sbjct: 264 FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFK 323

Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVREVTDMD 339
           I+RGTNECGIE  V AGLPS +N+V+ +T  D
Sbjct: 324 IRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 355


>UniRef100_Q94K85 Putative cathepsin B cysteine protease [Arabidopsis thaliana]
          Length = 359

 Score =  524 bits (1349), Expect = e-147
 Identities = 241/326 (73%), Positives = 265/326 (80%), Gaps = 10/326 (3%)

Query: 8   EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
           E L   KL+S ILQ+ I K++NENP AGW+AAIN RFSN TV +FKRLLGVK  PKK  L
Sbjct: 31  ESLTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFL 90

Query: 68  STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
             P+V+H  SLKLPK FDARTAW QC++IG IL           QGHCGSCWAFGAVESL
Sbjct: 91  GVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILD----------QGHCGSCWAFGAVESL 140

Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
            DRFCI F MNISLSVNDLLACCGF CG GCDGG PI AW+Y ++ GVVTEECDPYFD  
Sbjct: 141 SDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNT 200

Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
           GCSHPGCEPAY TPKC RKCV  N++W  SKHYSV  Y VKS+PQDIMAEVYKNGPVEV+
Sbjct: 201 GCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVS 260

Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
           FTV+EDFAHYKSGVYKHITGS +GGHAVKLIGWGTS EGEDYWL+ANQWN  WGDDGYF 
Sbjct: 261 FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFM 320

Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVR 333
           I+RGTNECGIED+  AGLPS+KN+ R
Sbjct: 321 IRRGTNECGIEDEPVAGLPSSKNVFR 346


>UniRef100_Q9ZSI0 T15B16.17a protein [Arabidopsis thaliana]
          Length = 359

 Score =  524 bits (1349), Expect = e-147
 Identities = 241/326 (73%), Positives = 265/326 (80%), Gaps = 10/326 (3%)

Query: 8   EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
           E L   KL+S ILQ+ I K++NENP AGW+AAIN RFSN TV +FKRLLGVK  PKK  L
Sbjct: 31  ESLTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFL 90

Query: 68  STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
             P+V+H  SLKLPK FDARTAW QC++IG ILG           GHCGSCWAFGAVESL
Sbjct: 91  GVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILGL----------GHCGSCWAFGAVESL 140

Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
            DRFCI F MNISLSVNDLLACCGF CG GCDGG PI AW+Y ++ GVVTEECDPYFD  
Sbjct: 141 SDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNT 200

Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
           GCSHPGCEPAY TPKC RKCV  N++W  SKHYSV  Y VKS+PQDIMAEVYKNGPVEV+
Sbjct: 201 GCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVS 260

Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
           FTV+EDFAHYKSGVYKHITGS +GGHAVKLIGWGTS EGEDYWL+ANQWN  WGDDGYF 
Sbjct: 261 FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFM 320

Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVR 333
           I+RGTNECGIED+  AGLPS+KN+ R
Sbjct: 321 IRRGTNECGIEDEPVAGLPSSKNVFR 346


>UniRef100_Q6ST24 Cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score =  517 bits (1331), Expect = e-145
 Identities = 236/338 (69%), Positives = 272/338 (79%), Gaps = 10/338 (2%)

Query: 7   DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKEL 66
           ++ ++  KL S ILQ+SI K++NEN EAGW+AA NP+ SNFTV QFKRLLGVK A + +L
Sbjct: 27  EKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQFKRLLGVKPAREGDL 86

Query: 67  LSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVES 126
              PV+THP+  +LPKEFDAR AW QCSTIGKIL           QGHCGSCWAFGAVES
Sbjct: 87  EGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILD----------QGHCGSCWAFGAVES 136

Query: 127 LQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQ 186
           L DRFCIH++++ISLSVNDLLACC FLCG+GCDGG PI AWRY    GVVTEECDPYFD 
Sbjct: 137 LSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDT 196

Query: 187 IGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEV 246
            GCSHPGCEP Y TPKC RKCVKGN +W++SKHY V AYRV  DPQ IMAEVYKNGPVEV
Sbjct: 197 TGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEV 256

Query: 247 AFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYF 306
           +FTV+EDFAHYKSGVYKH+TG  +GGHAVKLIGWGTS++GEDYWL+ N WN  WG+DGYF
Sbjct: 257 SFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYF 316

Query: 307 KIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAGV 344
           KI+RGTNECGIE  V AGLPS +N+  E+ D  +DA +
Sbjct: 317 KIRRGTNECGIEHSVVAGLPSARNLNVELGDAVLDASM 354


>UniRef100_Q40413 Cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  514 bits (1325), Expect = e-144
 Identities = 234/340 (68%), Positives = 271/340 (78%), Gaps = 12/340 (3%)

Query: 7   DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKEL 66
           ++ ++  K  S ILQ+SI KQ+NEN +AGW+AA+NPRFSNFTV QFKRLLGVK   K +L
Sbjct: 27  EQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDL 86

Query: 67  LSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVES 126
              P++THPK L+LP+EFDAR AWS CSTIG+IL           QGHCGSCWAFGAVES
Sbjct: 87  KGIPILTHPKLLELPQEFDARVAWSNCSTIGRILD----------QGHCGSCWAFGAVES 136

Query: 127 LQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQ 186
           L DRFCIH+ +NISLS NDL ACCGFLCG GCDGG P+ AW+Y    GVVT+ECDPYFD 
Sbjct: 137 LSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDN 196

Query: 187 IGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEV 246
            GCSHPGCEPAY TPKC RKCVK N +W RSKH+ V AY + SDP  IM EVYKNGPVEV
Sbjct: 197 EGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSDPHSIMTEVYKNGPVEV 256

Query: 247 AFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYF 306
           +FTV+EDFAHYKSGVYKH+TG  +GGHAVKLIGWGTS++GEDYWLLANQWN  WGDDGYF
Sbjct: 257 SFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYF 316

Query: 307 KIKRGTNECGIEDDVTAGLPSTK--NIVREVTDMDVDAGV 344
           KI+RGTNEC IED+V AGLPS +  N+  +V+D  +DA +
Sbjct: 317 KIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAFLDAAM 356


>UniRef100_O23681 Cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  506 bits (1302), Expect = e-142
 Identities = 230/332 (69%), Positives = 262/332 (78%), Gaps = 12/332 (3%)

Query: 8   EKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELL 67
           E L+  KL S ILQ  I K++NENP AGW+AA N RF+N TV +FKRLLGV Q PK   L
Sbjct: 31  ENLSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVIQTPKTAYL 90

Query: 68  STPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESL 127
             P+V H  SLKLPKEFDARTAWS C++I +ILG            HCGSCWAFGAVESL
Sbjct: 91  GVPIVRHDLSLKLPKEFDARTAWSHCTSIRRILG------------HCGSCWAFGAVESL 138

Query: 128 QDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
            DRFCI +++N+SLS ND++ACCG LCG GC+GG P+ AW Y  +HGVVT+ECDPYFD  
Sbjct: 139 SDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNT 198

Query: 188 GCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA 247
           GCSHPGCEP Y TPKC RKCV  NQ+W  SKHY V AYR+  DPQDIMAEVYKNGPVEVA
Sbjct: 199 GCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVA 258

Query: 248 FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFK 307
           FTV+EDFAHYKSGVYK+ITG+ +GGHAVKLIGWGTSD+GEDYWLLANQWN +WGDDGYFK
Sbjct: 259 FTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFK 318

Query: 308 IKRGTNECGIEDDVTAGLPSTKNIVREVTDMD 339
           I+RGTNECGIE  V AGLPS KN+ + +T  D
Sbjct: 319 IRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 350


>UniRef100_Q94G21 Cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  502 bits (1293), Expect = e-141
 Identities = 233/331 (70%), Positives = 264/331 (79%), Gaps = 18/331 (5%)

Query: 14  KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVT 73
           +++  ILQ+ I K +NENPEAGW+A +NPRFS+FTV QFKRLLGVK+APK  L  TPVVT
Sbjct: 30  EVDPKILQDEIVKTVNENPEAGWKADMNPRFSDFTVSQFKRLLGVKKAPKSLLKRTPVVT 89

Query: 74  HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
           H K ++LPK FDARTAW QC +I  IL           QGHCGSCWAFGAVESL DRFCI
Sbjct: 90  HSKEIELPKTFDARTAWPQCLSIADILD----------QGHCGSCWAFGAVESLTDRFCI 139

Query: 134 HFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPG 193
           H+  N++LSVNDLLACCGFLCG GCDGG PI AW+Y    GVVT ECDPYFDQ GCSHPG
Sbjct: 140 HYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPG 199

Query: 194 CEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFED 253
           CEPAY TP C +KCVK N +W  SKH+SV AYRV SD   IM EVY NGP EV+FTV+ED
Sbjct: 200 CEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYED 259

Query: 254 FAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTN 313
           FAHYKSGVYKH+TGS +GGHAVKLIGWGTS++GEDYWLLANQWN +WGDDGYFKI RGTN
Sbjct: 260 FAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGDDGYFKIIRGTN 319

Query: 314 ECGIEDDVTAGLPSTKNIVREVTDMDVDAGV 344
           ECGIE DVTAG+PSTKN       +D+++GV
Sbjct: 320 ECGIE-DVTAGMPSTKN-------LDIESGV 342


>UniRef100_Q9SQ82 Cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  498 bits (1283), Expect = e-140
 Identities = 232/331 (70%), Positives = 262/331 (79%), Gaps = 18/331 (5%)

Query: 14  KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVT 73
           +++  ILQ+ I K +NENPEAGW+A +NPRFS+FTV QFKRLLGVK+APK  L  TPVVT
Sbjct: 30  EVDPKILQDEIVKTVNENPEAGWKADMNPRFSDFTVSQFKRLLGVKKAPKSLLKRTPVVT 89

Query: 74  HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
           H K ++LPK FDARTAW QC +I  IL           QGHCGSCWAFGAVESL DRFCI
Sbjct: 90  HSKEIELPKTFDARTAWPQCLSIADILD----------QGHCGSCWAFGAVESLTDRFCI 139

Query: 134 HFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPG 193
           H+  N++LSVNDLLACCGFLCG GCDGG PI AW+Y    GVVT ECDPYFDQ GCSHPG
Sbjct: 140 HYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPG 199

Query: 194 CEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFED 253
           CEPAY TP C +KCVK N +W  SKH+SV AYRV SD   IM EVY NGP EV+FTV+ED
Sbjct: 200 CEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYED 259

Query: 254 FAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTN 313
           FAHYKSGVYKH+TGS +GGHAVKLIGWGTS++GEDYWLLANQWN +WG DGYFKI RGTN
Sbjct: 260 FAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGGDGYFKIIRGTN 319

Query: 314 ECGIEDDVTAGLPSTKNIVREVTDMDVDAGV 344
           ECGIE DVTAG PSTKN       +D+++GV
Sbjct: 320 ECGIE-DVTAGTPSTKN-------LDIESGV 342


>UniRef100_Q711Q3 Cathepsin B [Hordeum vulgare]
          Length = 344

 Score =  485 bits (1248), Expect = e-136
 Identities = 219/313 (69%), Positives = 247/313 (77%), Gaps = 10/313 (3%)

Query: 19  ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
           I+Q+ I + +N +P AGW A  NP  +N+T+ QFK +LGVK  P   L      THP+S 
Sbjct: 35  IIQKGIIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLLAGVRTKTHPRSE 94

Query: 79  KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
           +LPKEFDAR+ WS CSTIGKIL           QGHCGSCWAFGAVE LQDRFCIH +MN
Sbjct: 95  QLPKEFDARSKWSGCSTIGKILD----------QGHCGSCWAFGAVECLQDRFCIHHNMN 144

Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY 198
           ISLS NDL+ACCGF+CG GCDGG PI AW+Y   +GVVTEECDPYFDQ+GC HPGCEPAY
Sbjct: 145 ISLSANDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCKHPGCEPAY 204

Query: 199 QTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYK 258
            TP C +KC   NQ+W+  KH+S+ AY+V SDP DIMAEVYKNGPVEVAFTV+EDFAHYK
Sbjct: 205 PTPVCEKKCKVQNQVWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYK 264

Query: 259 SGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 318
           SGVYKHITG  +GGHAVKLIGWGTSD GEDYWLLANQWN  WGDDGYFKI RG NECGIE
Sbjct: 265 SGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE 324

Query: 319 DDVTAGLPSTKNI 331
           +DVTAG+PS KNI
Sbjct: 325 EDVTAGMPSMKNI 337


>UniRef100_Q03107 Cathepsin B [Triticum aestivum]
          Length = 353

 Score =  477 bits (1228), Expect = e-133
 Identities = 221/317 (69%), Positives = 247/317 (77%), Gaps = 13/317 (4%)

Query: 19  ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
           I+Q+ I + +N++P AGW A  NP F+N+T+ QFK +LGVK  P   L   P+  HP+ +
Sbjct: 37  IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95

Query: 79  KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
            LPKEFDART WS CSTIG IL           QGHCG+CWAF AVE+LQDRFCIH +M+
Sbjct: 96  DLPKEFDARTQWSSCSTIGNILD----------QGHCGACWAFAAVEALQDRFCIHLNMS 145

Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY 198
           +SLSVNDLLACCGFLCG+GC+GG PI AWRY    GVVTEECDPYFDQ GC HPGCEPAY
Sbjct: 146 VSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAY 205

Query: 199 QTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFE--DFAH 256
            TPKC RKC   NQ WK +KH+SV AYRV S+P DIMAEVYKNGPVEVAFT  +  DFAH
Sbjct: 206 PTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAH 265

Query: 257 YKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECG 316
           YKSGVYKHITG  +GGHAVKLIGWGTSD GEDYWLLANQWN  WGDDGYFKI RG NECG
Sbjct: 266 YKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENECG 325

Query: 317 IEDDVTAGLPSTKNIVR 333
           IE DVTAG+PSTKN  R
Sbjct: 326 IEGDVTAGMPSTKNTAR 342


>UniRef100_Q03106 Cathepsin B [Triticum aestivum]
          Length = 305

 Score =  471 bits (1212), Expect = e-131
 Identities = 212/308 (68%), Positives = 241/308 (77%), Gaps = 10/308 (3%)

Query: 24  IAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKLPKE 83
           I + +N +P AGW A  NP  +N+T+ QFK +LGVK  P     +    TH +S +LPK 
Sbjct: 1   IIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRSEQLPKV 60

Query: 84  FDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMNISLSV 143
           FDAR+ WS CSTIGKIL           QGHCGSCWAFGAVE LQDRFCIH +MNI+LS 
Sbjct: 61  FDARSKWSGCSTIGKILD----------QGHCGSCWAFGAVECLQDRFCIHHNMNITLSA 110

Query: 144 NDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKC 203
           NDL+ACCGF+CG GCDGG PI AW+Y   +GVVT+ECDPYFDQ+GC HPGCEPAY TP C
Sbjct: 111 NDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCKHPGCEPAYPTPVC 170

Query: 204 VRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYK 263
            +KC   NQ+W+  KH+S+ AY+V SDP DIMAEVY NGPVEVAFTV+EDFAHYKSGVYK
Sbjct: 171 EKKCKVQNQVWEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYK 230

Query: 264 HITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTA 323
           HITG  +GGHAVKLIGWGTSD GEDYWLLANQWN  WGDDGYFKI RG NECGIE+DVTA
Sbjct: 231 HITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTA 290

Query: 324 GLPSTKNI 331
           G+PSTKNI
Sbjct: 291 GMPSTKNI 298


>UniRef100_Q9SC35 Putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score =  342 bits (876), Expect = 1e-92
 Identities = 150/166 (90%), Positives = 158/166 (94%)

Query: 158 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRS 217
           CDGG PI AW+Y AHHGVVTEECDPYFDQIGCSHPGCEP YQTPKCVRKCVKGNQ+WK+S
Sbjct: 1   CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60

Query: 218 KHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 277
           KHYSVK Y+V SDPQ+IM EVYKNGPVEVAF+V+EDFAHYKSGVYKHITGSALGGHAVKL
Sbjct: 61  KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120

Query: 278 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTA 323
            GWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE+DVTA
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166


>UniRef100_Q9SC36 Putative cathepsin B-like protease [Pisum sativum]
          Length = 206

 Score =  304 bits (779), Expect = 2e-81
 Identities = 140/176 (79%), Positives = 151/176 (85%), Gaps = 10/176 (5%)

Query: 19  ILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSL 78
           +LQESIAK++NENP AGW+AAINPRFSN TVGQFKRLLGVKQ P+ EL S PVVTHPKSL
Sbjct: 41  LLQESIAKEVNENPGAGWKAAINPRFSNSTVGQFKRLLGVKQTPRNELSSIPVVTHPKSL 100

Query: 79  KLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMN 138
            LPKEFDARTAW QCSTIG+IL           QGHCGSCWAFGAVESL DRFCIHF ++
Sbjct: 101 NLPKEFDARTAWPQCSTIGRILD----------QGHCGSCWAFGAVESLSDRFCIHFGVD 150

Query: 139 ISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGC 194
           + LSVNDLLACCGFLCG+GCDGG PI AW+Y AHHGVVTEECDPYFDQIGCSHPGC
Sbjct: 151 VPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 206


>UniRef100_UPI000036E1D7 UPI000036E1D7 UniRef100 entry
          Length = 339

 Score =  277 bits (708), Expect = 3e-73
 Identities = 143/330 (43%), Positives = 195/330 (58%), Gaps = 40/330 (12%)

Query: 18  HILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL----LGVKQAPKKELLSTPVVT 73
           H L + +   +N+     W+A  N  F N  +   KRL    LG  + P++ + +     
Sbjct: 24  HPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGAFLGGPKPPQRVMFT----- 75

Query: 74  HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
             + LKLP+ FDAR  W QC TI +I            QG CGSCWAFGAVE++ DR CI
Sbjct: 76  --EDLKLPESFDAREQWPQCPTIKEIRD----------QGSCGSCWAFGAVEAISDRICI 123

Query: 134 HFDMNISLSVN--DLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE-------CDPYF 184
           H + ++S+ V+  DLL CCG +CG GC+GG P  AW +    G+V+         C PY 
Sbjct: 124 HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYS 183

Query: 185 -----DQIGCSHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
                  +  S P C     TPKC + C  G +  +K+ KHY   +Y V +  +DIMAE+
Sbjct: 184 IPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 243

Query: 239 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNT 298
           YKNGPVE AF+V+ DF  YKSGVY+H+TG  +GGHA++++GWG  + G  YWL+AN WNT
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNT 302

Query: 299 NWGDDGYFKIKRGTNECGIEDDVTAGLPST 328
           +WGD+G+FKI RG + CGIE +V AG+P T
Sbjct: 303 DWGDNGFFKILRGQDHCGIESEVVAGIPRT 332


>UniRef100_Q5R6D1 Hypothetical protein DKFZp459D187 [Pongo pygmaeus]
          Length = 339

 Score =  277 bits (708), Expect = 3e-73
 Identities = 143/330 (43%), Positives = 195/330 (58%), Gaps = 40/330 (12%)

Query: 18  HILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL----LGVKQAPKKELLSTPVVT 73
           H L + +   +N+     W+A  N  F N  V   K+L    LG  + P++ + +     
Sbjct: 24  HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVMFT----- 75

Query: 74  HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
             + LKLP+ FDAR  W QC TI +I            QG CGSCWAFGAVE++ DR CI
Sbjct: 76  --EDLKLPESFDAREQWPQCPTIKEIRD----------QGSCGSCWAFGAVEAISDRICI 123

Query: 134 HFDMNISLSVN--DLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE-------CDPYF 184
           H + ++S+ V+  DLL CCG +CG GC+GG P  AW +    G+V+         C PY 
Sbjct: 124 HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYS 183

Query: 185 -----DQIGCSHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
                  +  S P C     TPKC + C  G +  +K+ KHY   +Y V +  +DIMAE+
Sbjct: 184 IPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEI 243

Query: 239 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNT 298
           YKNGPVE AF+V+ DF  YKSGVY+H+TG  +GGHA++++GWG  + G  YWL+AN WNT
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNT 302

Query: 299 NWGDDGYFKIKRGTNECGIEDDVTAGLPST 328
           +WGD+G+FKI RG + CGIE +V AG+P T
Sbjct: 303 DWGDNGFFKILRGQDHCGIESEVVAGIPRT 332


>UniRef100_P07858 Cathepsin B precursor [Homo sapiens]
          Length = 339

 Score =  275 bits (703), Expect = 1e-72
 Identities = 142/330 (43%), Positives = 194/330 (58%), Gaps = 40/330 (12%)

Query: 18  HILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL----LGVKQAPKKELLSTPVVT 73
           H + + +   +N+     W+A  N  F N  +   KRL    LG  + P++ + +     
Sbjct: 24  HPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGTFLGGPKPPQRVMFT----- 75

Query: 74  HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCI 133
             + LKLP  FDAR  W QC TI +I            QG CGSCWAFGAVE++ DR CI
Sbjct: 76  --EDLKLPASFDAREQWPQCPTIKEIRD----------QGSCGSCWAFGAVEAISDRICI 123

Query: 134 HFDMNISLSVN--DLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE-------CDPYF 184
           H + ++S+ V+  DLL CCG +CG GC+GG P  AW +    G+V+         C PY 
Sbjct: 124 HTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYS 183

Query: 185 -----DQIGCSHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
                  +  S P C     TPKC + C  G +  +K+ KHY   +Y V +  +DIMAE+
Sbjct: 184 IPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 243

Query: 239 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNT 298
           YKNGPVE AF+V+ DF  YKSGVY+H+TG  +GGHA++++GWG  + G  YWL+AN WNT
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNT 302

Query: 299 NWGDDGYFKIKRGTNECGIEDDVTAGLPST 328
           +WGD+G+FKI RG + CGIE +V AG+P T
Sbjct: 303 DWGDNGFFKILRGQDHCGIESEVVAGIPRT 332


>UniRef100_P00787 Cathepsin B precursor [Rattus norvegicus]
          Length = 339

 Score =  273 bits (697), Expect = 7e-72
 Identities = 145/337 (43%), Positives = 189/337 (56%), Gaps = 44/337 (13%)

Query: 14  KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL----LGVKQAPKKELLST 69
           K +SH L + +   IN+     W+A  N  F N  +   K+L    LG    P++     
Sbjct: 20  KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPNLPER----- 71

Query: 70  PVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQD 129
             V   + + LP+ FDAR  WS C TI +I            QG CGSCWAFGAVE++ D
Sbjct: 72  --VGFSEDINLPESFDAREQWSNCPTIAQIRD----------QGSCGSCWAFGAVEAMSD 119

Query: 130 RFCIHFD--MNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQI 187
           R CIH +  +N+ +S  DLL CCG  CG GC+GG P  AW +    G+V+     Y   I
Sbjct: 120 RICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGV--YNSHI 177

Query: 188 GC--------------SHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQ 232
           GC              S P C     TPKC + C  G +  +K  KHY   +Y V    +
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEK 237

Query: 233 DIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLL 292
           +IMAE+YKNGPVE AFTVF DF  YKSGVYKH  G  +GGHA++++GWG  + G  YWL+
Sbjct: 238 EIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLV 296

Query: 293 ANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTK 329
           AN WN +WGD+G+FKI RG N CGIE ++ AG+P T+
Sbjct: 297 ANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQ 333


>UniRef100_Q6IN22 Cathepsin B, preproprotein [Rattus norvegicus]
          Length = 339

 Score =  271 bits (692), Expect = 3e-71
 Identities = 145/334 (43%), Positives = 187/334 (55%), Gaps = 38/334 (11%)

Query: 14  KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLG-VKQAPKKELLSTPVV 72
           K + H L + +   IN+     W+A  N  F N  +   K+L G V   PK        V
Sbjct: 20  KPSFHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLP----ERV 72

Query: 73  THPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFC 132
              + + LP+ FDAR  WS C TI +I            QG CGSCWAFGAVE++ DR C
Sbjct: 73  GFSEDINLPESFDAREQWSNCPTIAQIRD----------QGSCGSCWAFGAVEAMSDRIC 122

Query: 133 IHFD--MNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC- 189
           IH +  +N+ +S  DLL CCG  CG GC+GG P  AW +    G+V+     Y   IGC 
Sbjct: 123 IHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGV--YNSHIGCL 180

Query: 190 -------------SHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIM 235
                        S P C     TPKC + C  G +  +K  KHY   +Y V    ++IM
Sbjct: 181 PYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIM 240

Query: 236 AEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQ 295
           AE+YKNGPVE AFTVF DF  YKSGVYKH  G  +GGHA++++GWG  + G  YWL+AN 
Sbjct: 241 AEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANS 299

Query: 296 WNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTK 329
           WN +WGD+G+FKI RG N CGIE ++ AG+P T+
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQ 333


>UniRef100_P10605 Cathepsin B precursor [Mus musculus]
          Length = 339

 Score =  267 bits (682), Expect = 4e-70
 Identities = 143/333 (42%), Positives = 187/333 (55%), Gaps = 38/333 (11%)

Query: 14  KLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLG-VKQAPKKELLSTPVV 72
           K + H L + +   IN+     W+A  N  F N  +   K+L G V   PK        V
Sbjct: 20  KPSFHPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLP----GRV 72

Query: 73  THPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFC 132
              + + LP+ FDAR  WS C TIG+I            QG CGSCWAFGAVE++ DR C
Sbjct: 73  AFGEDIDLPETFDAREQWSNCPTIGQIRD----------QGSCGSCWAFGAVEAISDRTC 122

Query: 133 IHFD--MNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC- 189
           IH +  +N+ +S  DLL CCG  CG GC+GG P  AW +    G+V+     Y   +GC 
Sbjct: 123 IHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCL 180

Query: 190 -------------SHPGCEPAYQTPKCVRKCVKG-NQIWKRSKHYSVKAYRVKSDPQDIM 235
                        S P C     TP+C + C  G +  +K  KH+   +Y V +  ++IM
Sbjct: 181 PYTIPPCEHHVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIM 240

Query: 236 AEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQ 295
           AE+YKNGPVE AFTVF DF  YKSGVYKH  G  +GGHA++++GWG  + G  YWL AN 
Sbjct: 241 AEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANS 299

Query: 296 WNTNWGDDGYFKIKRGTNECGIEDDVTAGLPST 328
           WN +WGD+G+FKI RG N CGIE ++ AG+P T
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIESEIVAGIPRT 332


>UniRef100_Q95PM1 Cathepsin B endopeptidase precursor [Schistosoma mansoni]
          Length = 347

 Score =  265 bits (678), Expect = 1e-69
 Identities = 144/319 (45%), Positives = 184/319 (57%), Gaps = 32/319 (10%)

Query: 28  INENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKLPKEFDAR 87
           IN      W+AA   RF   TV   +R+LG    P  E L T + T   S +LPK FDAR
Sbjct: 45  INYEANTTWKAAPTTRFR--TVSDIRRMLGALPDPNGEQLET-LCTGYISDELPKSFDAR 101

Query: 88  TAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMNIS--LSVND 145
             W  C +I +I            Q  CGSCWAFGAVE++ DR CI         LS  +
Sbjct: 102 VEWPHCPSISEIRD----------QSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAEN 151

Query: 146 LLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE-------CDPYFDQIGCSH------P 192
           L++CC   CG GC+GG P  AW Y  + G+VT +       C PY +   C H      P
Sbjct: 152 LVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHVIGPLP 209

Query: 193 GCEPAYQTPKCVRKCVKGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVF 251
            C+   +TP C   C  G  I +++ K Y  K YR+ S+P+ IM E+ +NGPVEV F V+
Sbjct: 210 SCDGDVETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVY 269

Query: 252 EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 311
            DF +YKSGVY+H++G+ LGGHAV+L+GWG  +    YWL+AN WN++WGD GYFKI RG
Sbjct: 270 ADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNSDWGDKGYFKIVRG 328

Query: 312 TNECGIEDDVTAGLPSTKN 330
            NECGIE DV AG+P  KN
Sbjct: 329 KNECGIESDVNAGIPKIKN 347


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.319    0.137    0.440 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 643,976,089
Number of Sequences: 2790947
Number of extensions: 28875419
Number of successful extensions: 61440
Number of sequences better than 10.0: 1469
Number of HSP's better than 10.0 without gapping: 1332
Number of HSP's successfully gapped in prelim test: 137
Number of HSP's that attempted gapping in prelim test: 57274
Number of HSP's gapped (non-prelim): 1826
length of query: 345
length of database: 848,049,833
effective HSP length: 128
effective length of query: 217
effective length of database: 490,808,617
effective search space: 106505469889
effective search space used: 106505469889
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)


Medicago: description of AC149038.5