
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149038.5 + phase: 0 /partial
(345 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC85741 similar to GP|14532526|gb|AAK63991.1 At1g02300/T6A9_10 {... 703 0.0
BF636528 similar to GP|14532526|gb At1g02300/T6A9_10 {Arabidopsi... 402 e-112
BQ079377 similar to GP|6562770|emb| putative cathepsin B-like pr... 168 3e-42
TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3... 127 6e-30
TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4... 104 6e-23
TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4... 99 2e-21
TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 98 4e-21
TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A ... 97 7e-21
TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 95 3e-20
TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 94 8e-20
TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativ... 86 3e-17
TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase... 82 3e-16
TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteina... 82 4e-16
TC92862 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 71 5e-13
AJ497905 similar to SP|Q26636|CATL_ Cathepsin L precursor (EC 3.... 71 7e-13
BE999342 similar to GP|10336513|dbj cysteine proteinase {Astraga... 65 3e-11
TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4... 60 1e-09
CB893145 similar to GP|7381223|gb| papain-like cysteine proteina... 55 4e-08
TC89224 similar to GP|7381221|gb|AAF61441.1| papain-like cystein... 52 3e-07
TC81254 similar to GP|600111|emb|CAA84378.1| cysteine proteinase... 52 3e-07
>TC85741 similar to GP|14532526|gb|AAK63991.1 At1g02300/T6A9_10 {Arabidopsis
thaliana}, partial (85%)
Length = 1459
Score = 703 bits (1815), Expect = 0.0
Identities = 334/345 (96%), Positives = 334/345 (96%)
Frame = +1
Query: 1 IGDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQ 60
IGDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQ
Sbjct: 154 IGDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQ 333
Query: 61 APKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWA 120
APKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKIL QGHCGSCWA
Sbjct: 334 APKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILD----------QGHCGSCWA 483
Query: 121 FGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEEC 180
FGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEEC
Sbjct: 484 FGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEEC 663
Query: 181 DPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYK 240
DPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYK
Sbjct: 664 DPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYK 843
Query: 241 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNW 300
NGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNW
Sbjct: 844 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNW 1023
Query: 301 GDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAGVS 345
GDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAGVS
Sbjct: 1024GDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDVDAGVS 1158
>BF636528 similar to GP|14532526|gb At1g02300/T6A9_10 {Arabidopsis thaliana},
partial (59%)
Length = 656
Score = 402 bits (1033), Expect = e-112
Identities = 185/228 (81%), Positives = 195/228 (85%)
Frame = +2
Query: 59 KQAPKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSC 118
KQ P+ EL S PVVTHPKSLKLPK+FDARTAWSQCSTIG+IL QGHCGSC
Sbjct: 2 KQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRILD----------QGHCGSC 151
Query: 119 WAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTE 178
WAFGAVESL DRFCIHFDMN+SLSVND+LACCG LCGAGC GGTP AW YLAHHGVVTE
Sbjct: 152 WAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAHHGVVTE 331
Query: 179 ECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
ECDPYFDQIGCSHPGCEP Y+TPKCV+KCV GNQ+W+ SKHYSVKAY V SDPQDIMAEV
Sbjct: 332 ECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQDIMAEV 511
Query: 239 YKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEG 286
YKNGPVEVAFTV+ DFAHYKS VYKHITG AL GHA KL GWGTS EG
Sbjct: 512 YKNGPVEVAFTVYXDFAHYKSXVYKHITGFALXGHAXKLXGWGTSHEG 655
>BQ079377 similar to GP|6562770|emb| putative cathepsin B-like protease
{Pisum sativum}, partial (28%)
Length = 476
Score = 168 bits (425), Expect = 3e-42
Identities = 97/134 (72%), Positives = 104/134 (77%), Gaps = 5/134 (3%)
Frame = +2
Query: 1 IGDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQ 60
IGDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQ
Sbjct: 65 IGDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQ 244
Query: 61 APKKELLSTPVVTHPKSLKL-PKEFDARTAWSQC--STIGKILGSNLILMLMMIQGHCGS 117
APKKELLSTPVVTHPKSLK+ + FDA+ ++ Q +GKIL I G G
Sbjct: 245 APKKELLSTPVVTHPKSLKIAQRNFDAKDSFGQQW*PLLGKIL---------RIXGPWGV 397
Query: 118 C-WAFGA-VESLQD 129
WAF VES Q+
Sbjct: 398 LGWAFXXPVESFQE 439
>TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3.4.22.-) -
garden pea, complete
Length = 1534
Score = 127 bits (319), Expect = 6e-30
Identities = 99/317 (31%), Positives = 137/317 (42%), Gaps = 6/317 (1%)
Frame = +3
Query: 7 DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFK-RLLGVKQAPKKE 65
DE K+ S LQ + K N+ G+ +N F+++T +F+ LG Q
Sbjct: 264 DEMKRRFKIFSENLQ--LIKSTNKK-RLGYTLGVN-HFADWTWEEFRSHRLGAAQNCSAT 431
Query: 66 LLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCWAFGAVE 125
L +T + LP E D W + + ++ QGHCGSCW F
Sbjct: 432 LKGNHRIT---DVVLPAEKD----WRKEGIVSEVKD----------QGHCGSCWTFSTTG 560
Query: 126 SLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHG-VVTEECDPYF 184
+L+ + F NISLS L+ C G GC+GG P A+ Y+ ++G + TEE PY
Sbjct: 561 ALESAYAQAFGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYT 740
Query: 185 DQIG-CSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGP 243
Q G C A Q V + K + ++ P
Sbjct: 741 GQNGLCKFTSENVAVQVLGSVNITLGAEDELKHAVAFA--------------------RP 860
Query: 244 VEVAFTVFEDFAHYKSGVYKHITGSALG---GHAVKLIGWGTSDEGEDYWLLANQWNTNW 300
V VAF V +DF YK GVY T + HAV +G+G D G YWL+ N W W
Sbjct: 861 VSVAFQVVDDFRLYKKGVYTSTTCGSTPMDVNHAVLAVGYGIED-GVPYWLIKNSWGGEW 1037
Query: 301 GDDGYFKIKRGTNECGI 317
GD GYFK++ G N CG+
Sbjct: 1038GDHGYFKMEMGKNMCGV 1088
>TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4.22.-)
precursor - spring vetch, partial (87%)
Length = 1484
Score = 104 bits (259), Expect = 6e-23
Identities = 89/325 (27%), Positives = 143/325 (43%), Gaps = 9/325 (2%)
Frame = +2
Query: 2 GDAETDEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRL-LGVKQ 60
G E D++ K N + E A+ ++ +N +F++ T +++ + LG K
Sbjct: 332 GLGEKDQRFEIFKDNLGFIDEHNAQNYT------YKVGLN-KFADMTNEEYRNMYLGTKN 490
Query: 61 APKKELLSTPVVT-HPKSLKLPKEFDARTAWSQCSTIGKILGSNLILMLMMIQGHCGSCW 119
K+ ++ + T H + W + I QG CGSCW
Sbjct: 491 DAKRNVMKIKITTGHRYAFNSGDRLPVHVDWRSKGAVAHIKD----------QGSCGSCW 640
Query: 120 AFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEE 179
AF + +++ I +SLS +L+ C GC+GG YA+ ++ +G + E
Sbjct: 641 AFSTIATVEAINKIVTGKLVSLSEQELVDCDRAF-NEGCNGGLMDYAFEFIGENGGIDTE 817
Query: 180 CD-PYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEV 238
D PY G C+P + K V + G + V AY ++ + +
Sbjct: 818 QDYPYKGFEG----RCDPTRKNAKVVS--IDGYE--------DVPAYN-----ENALKKA 940
Query: 239 YKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWN 297
+ PV VA Y+SGV+ G+ L H V ++G+G S+ G DYWL+ N W
Sbjct: 941 VSHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLD-HGVVVVGYG-SENGVDYWLVRNSWG 1114
Query: 298 TNWGDDGYFKIKR-----GTNECGI 317
TNWG+DGYFK++R T +CGI
Sbjct: 1115TNWGEDGYFKLERNVKKINTGKCGI 1189
>TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4.22.-) -
kidney bean, complete
Length = 1720
Score = 99.0 bits (245), Expect = 2e-21
Identities = 64/212 (30%), Positives = 98/212 (46%), Gaps = 6/212 (2%)
Frame = +3
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLA 171
QG CGSCWAF V +++ I + + LS +L+ C GC+GG YA+ Y+
Sbjct: 867 QGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCDNQE-NQGCNGGLMEYAFEYIK 1043
Query: 172 HHG-VVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSD 230
G + TE PY G C+ + V + G++ V ++
Sbjct: 1044 QKGGITTESYYPYTANDG----SCDATKENVPAVS--IDGHET-------------VPAN 1166
Query: 231 PQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDY 289
+D + + N PV VA DF Y GV+ G L H V ++G+GT+ +G +Y
Sbjct: 1167 DEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN-HGVAIVGYGTTVDGTNY 1343
Query: 290 WLLANQWNTNWGDDGYFKIKRGTNE----CGI 317
W++ N W WG+ GY ++KR + CGI
Sbjct: 1344 WIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGI 1439
>TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (96%)
Length = 1315
Score = 98.2 bits (243), Expect = 4e-21
Identities = 64/213 (30%), Positives = 98/213 (45%), Gaps = 7/213 (3%)
Frame = +3
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRY-L 170
QG CG CWAF AV + + + +SLS +L+ C GC+GG A+++ +
Sbjct: 519 QGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFII 698
Query: 171 AHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYR-VKS 229
+HG+ TE PY G + T ++ Y V +
Sbjct: 699 QNHGLNTEAQYPYQGVDGTCNANKASIQAT--------------------TITGYEDVPA 818
Query: 230 DPQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 288
+ + + + N P+ VA DF YKSGV+ G+ L H V +G+G S++G
Sbjct: 819 NNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTEL-DHGVTAVGYGVSNDGTK 995
Query: 289 YWLLANQWNTNWGDDGYFKIKRGTNE----CGI 317
YWL+ N W T+WG++GY ++RG CGI
Sbjct: 996 YWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGI 1094
>TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A precursor
(EC 3.4.22.-) (Turgor-responsive protein 15A). [Garden
pea], partial (98%)
Length = 1642
Score = 97.4 bits (241), Expect = 7e-21
Identities = 81/295 (27%), Positives = 128/295 (42%), Gaps = 15/295 (5%)
Frame = +3
Query: 43 RFSNFTVGQFKR-LLGVKQAPKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILG 101
+FS+ T +F+R LG+ + + + P + LP++FD W + + +
Sbjct: 450 KFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTN-NLPEDFD----WREKGAVTPVKD 614
Query: 102 SNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFL-------C 154
QG CGSCWAF +L+ + SLS L+ C C
Sbjct: 615 ----------QGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSC 764
Query: 155 GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIW 214
+GC+GG A+ Y+ G V E D ++ G + + + K +++
Sbjct: 765 DSGCNGGLMNNAFEYILQSGGVVSEKD-------YAYTGRDGSCKFDK--------SKVV 899
Query: 215 KRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVY-KHITGSALGGH 273
++SV V D I A + KNGP+ VA Y SGV +I A H
Sbjct: 900 ASVSNFSV----VSLDEDQIAANLVKNGPLAVAINAAW-MQTYMSGVSCPYICAKARLDH 1064
Query: 274 AVKLIGWGTSD------EGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 322
V L+G+G + + YW++ N W NWG++GY+KI RG N CG++ V+
Sbjct: 1065GVLLVGFGQGGYAPIRLKEKPYWIIKNSWGQNWGEEGYYKICRGRNVCGVDSMVS 1229
>TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (94%)
Length = 1258
Score = 95.1 bits (235), Expect = 3e-20
Identities = 65/214 (30%), Positives = 101/214 (46%), Gaps = 8/214 (3%)
Frame = +3
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLA 171
QG CG CWAF AV + + + +SLS +L+ C GC+GG A++++
Sbjct: 534 QGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFII 713
Query: 172 -HHGVVTEECDPYFDQIG-CSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYR-VK 228
+HG+ TE PY G CS TP ++ Y V
Sbjct: 714 QNHGLHTEAQYPYQGVDGTCS----ANETSTPAA-----------------TIAGYEDVP 830
Query: 229 SDPQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGE 287
++ ++ + + N P+ VA DF YKSGV+ G+ L H V +G+G S++G
Sbjct: 831 ANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTQLD-HGVTAVGYGISNDGT 1007
Query: 288 DYWLLANQWNTNWGDDGYFKIKRGTNE----CGI 317
YWL+ N W +WG++GY +++R + CGI
Sbjct: 1008KYWLVKNSWGNDWGEEGYIRMQRSVDAAQGLCGI 1109
>TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (58%)
Length = 773
Score = 94.0 bits (232), Expect = 8e-20
Identities = 70/225 (31%), Positives = 106/225 (47%), Gaps = 12/225 (5%)
Frame = +3
Query: 115 CGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRY-LAHH 173
CG CWAF AV + + + +SLS +L+ C GC+GG A+++ + +H
Sbjct: 21 CGCCWAFSAVPAPEGIHKLSTGRLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH 200
Query: 174 GVVTEECDPYFDQIG-CSHPGCEPAYQTPKCVRKCVKGNQIWKRSKH-YSVKAYR-VKSD 230
G+ TE PY G CS K S H ++ Y V ++
Sbjct: 201 GLNTEAQYPYQGVDGTCSAN----------------------KASIHAVTITGYEDVPAN 314
Query: 231 PQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDY 289
+ + + N P+ VA DF YKSGV+ G+ L H V +G+G ++G Y
Sbjct: 315 NEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTEL-DHGVTAVGYGVGNDGTKY 491
Query: 290 WLLANQWNTNWGDDGYFKIKRGTNE----CGIEDDV---TAGLPS 327
WL+ N W T+WG++GY K++RG + CGI + TA LP+
Sbjct: 492 WLVKNLWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYPTA*LPN 626
>TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativum},
partial (96%)
Length = 1818
Score = 85.5 bits (210), Expect = 3e-17
Identities = 65/216 (30%), Positives = 102/216 (47%), Gaps = 10/216 (4%)
Frame = +1
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLA 171
Q CGSCWAF A+ +++ I ISLS +L+ C GC+GG YA+ ++
Sbjct: 520 QASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDC-DTSYNEGCNGGLMDYAFEFII 696
Query: 172 HHGVVTEECD-PYFDQIGCSHPGCEPAYQTPKCVRKCVKG--NQIWKRSKHYSVKAYR-V 227
+G + E D PY K V G +Q K +K ++ Y V
Sbjct: 697 SNGGIDSEDDYPY----------------------KAVDGRCDQNRKNAKVVTIDDYEDV 810
Query: 228 KSDPQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEG 286
+ + + + N P+ VA +F Y+ GV+ G+AL H V +G+GT + G
Sbjct: 811 PAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALD-HGVAAVGYGT-ENG 984
Query: 287 EDYWLLANQWNTNWGDDGYFKIKRG-----TNECGI 317
+DYW++ N W +WG+ GY +++R +CGI
Sbjct: 985 KDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGI 1092
>TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase {Zinnia
elegans}, partial (88%)
Length = 1265
Score = 82.0 bits (201), Expect = 3e-16
Identities = 62/213 (29%), Positives = 96/213 (44%), Gaps = 7/213 (3%)
Frame = +2
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLA 171
QG CGSCWAF V +++ I SLS +L+ C GC+GG YA+ ++
Sbjct: 500 QGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDC-DTTYNNGCNGGLMDYAFSFIV 676
Query: 172 HHGVVTEECD-PYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYR-VKS 229
+G + +E D PY + CE + + V ++ Y V
Sbjct: 677 KNGGLHKEEDYPYIME----ESTCEMKKEVSEVV----------------TINGYHDVPQ 796
Query: 230 DPQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 288
+ + + + N P+ VA DF Y GV+ GS L H V +G+GTS +G D
Sbjct: 797 NNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELD-HGVSAVGYGTS-KGLD 970
Query: 289 YWLLANQWNTNWGDDGYFKIKRGTNE----CGI 317
Y ++ N W WG+ G+ ++KR + CG+
Sbjct: 971 YIIVKNSWGAKWGEKGFIRMKRNIGKSEGICGL 1069
>TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (54%)
Length = 709
Score = 81.6 bits (200), Expect = 4e-16
Identities = 59/206 (28%), Positives = 94/206 (44%), Gaps = 8/206 (3%)
Frame = +1
Query: 120 AFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLA-HHGVVTE 178
AF AV + + + +SLS +L+ C GC+GG A++++ +HG+ TE
Sbjct: 1 AFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTE 180
Query: 179 ECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYS-VKAYR-VKSDPQDIMA 236
PY G + K S H + + Y V ++ + +
Sbjct: 181 AAYPYQGVDGTCNAN---------------------KASIHAATITGYEDVPANNEQALQ 297
Query: 237 EVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQ 295
+ N P+ VA DF YKSGV+ G+ L H V +G+G ++G YWL+ N
Sbjct: 298 KAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELD-HGVTAVGYGVGNDGTKYWLVKNS 474
Query: 296 WNTNWGDDGYFKIKRGTNE----CGI 317
W T+WG +GY +++RG + CGI
Sbjct: 475 WGTDWGQEGYIRMQRGMDAPEXLCGI 552
>TC92862 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (41%)
Length = 581
Score = 71.2 bits (173), Expect = 5e-13
Identities = 38/109 (34%), Positives = 59/109 (53%), Gaps = 8/109 (7%)
Frame = +2
Query: 227 VKSDPQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDE 285
V ++ + + + N P+ V DF YKSGV+ G+ L H V +G+G ++
Sbjct: 116 VPANNEQALQKAVANQPISVTIDASGSDFQFYKSGVFTGSCGTELD-HGVTAVGYGVGND 292
Query: 286 GEDYWLLANQWNTNWGDDGYFKIKRGTNE----CGIEDDV---TAGLPS 327
G YWL+ N W T+WG++GY K++RG + CGI + TA LP+
Sbjct: 293 GTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYPTA*LPN 439
>AJ497905 similar to SP|Q26636|CATL_ Cathepsin L precursor (EC 3.4.22.15).
[Flesh fly Boettcherisca peregrina] {Sarcophaga
peregrina}, partial (26%)
Length = 306
Score = 70.9 bits (172), Expect = 7e-13
Identities = 35/79 (44%), Positives = 46/79 (57%), Gaps = 3/79 (3%)
Frame = +1
Query: 242 GPVEVAFTV-FEDFAHYKSGVYKHITGSALG-GHAVKLIGWGTSDEGEDYWLLANQWNTN 299
GPV VA E F YK GVY S+ H V ++G+GT ++G DYW++ N W T+
Sbjct: 70 GPVSVAIDASHESFQFYKEGVYYEPECSSENLDHGVLVVGYGTDEDGNDYWIVKNSWCTS 249
Query: 300 WGDDGYFKIKRG-TNECGI 317
WG DG+ K+ R N CGI
Sbjct: 250 WGQDGFIKMARNRDNHCGI 306
>BE999342 similar to GP|10336513|dbj cysteine proteinase {Astragalus
sinicus}, partial (27%)
Length = 332
Score = 65.5 bits (158), Expect = 3e-11
Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 5/82 (6%)
Frame = +1
Query: 241 NGPVEVAFTVF-EDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTN 299
N P+ V DF YKSG + G+ L H V +G+G S++G YWL+ N W T
Sbjct: 10 NQPISVFIDASGSDFRFYKSGGFTRSCGTELD-HGVTAVGYGVSNDGTKYWLVKNSWGTE 186
Query: 300 WGDDGYFKIKRGTNE----CGI 317
WG++GY ++RG + CGI
Sbjct: 187 WGEEGYIMMQRGVDAAEGLCGI 252
>TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4.22.-) 3
precursor - kidney bean, partial (93%)
Length = 1675
Score = 60.1 bits (144), Expect = 1e-09
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 7/116 (6%)
Frame = +1
Query: 209 KGNQIWKRSKHYSVKAY-RVKSDPQDIMAEVYKNGPVEVAFTVF-EDFAHYKSGVYKHIT 266
K +Q K ++ S+ Y +V + + + + N P+ VA +F Y SG++
Sbjct: 826 KCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKC 1005
Query: 267 GSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG-----TNECGI 317
G+AL H V +G+GT + G DYW++ N W +WG+ GY +++R +CGI
Sbjct: 1006GTALD-HGVTAVGYGT-ENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGI 1167
Score = 42.7 bits (99), Expect = 2e-04
Identities = 23/55 (41%), Positives = 30/55 (53%)
Frame = +2
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYA 166
QG CGSCWAF + +++ I +SLS +L+ C AGCDGG YA
Sbjct: 593 QGSCGSCWAFSTIAAVEGINKIVTGELVSLSEQELVD-CDRTVNAGCDGGLMEYA 754
>CB893145 similar to GP|7381223|gb| papain-like cysteine proteinase isoform
III {Ipomoea batatas}, partial (46%)
Length = 782
Score = 55.1 bits (131), Expect = 4e-08
Identities = 29/102 (28%), Positives = 49/102 (47%), Gaps = 6/102 (5%)
Frame = +1
Query: 227 VKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS--- 283
+ D I A + K+GP+ A Y G+ + V L+G+G+
Sbjct: 265 ISVDDNQITANLVKHGPLAAAINAVY-MQTYVGGISCPYICTRRLDLGVLLVGYGSGAGA 441
Query: 284 ---DEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 322
++ + YW++ N W WG++GY+KI RG N CG++ V+
Sbjct: 442 DMKEKEKPYWIVKNSWGETWGENGYYKICRGRNICGVDSMVS 567
>TC89224 similar to GP|7381221|gb|AAF61441.1| papain-like cysteine
proteinase isoform II {Ipomoea batatas}, partial (63%)
Length = 767
Score = 52.4 bits (124), Expect = 3e-07
Identities = 45/151 (29%), Positives = 69/151 (44%), Gaps = 10/151 (6%)
Frame = +1
Query: 43 RFSNFTVGQFKR-LLGVKQAP-KKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKIL 100
RFS+ T +F++ +LG++ K+ + P++ LPK+FD W + + +
Sbjct: 265 RFSDLTPREFRKSVLGLRGVGLPKDANAAPILPTDN---LPKDFD----WREKGAVTAVK 423
Query: 101 GSNLILMLMMIQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLAC---CG----FL 153
QG CGSCW+F +L+ + +SLS L+ C C
Sbjct: 424 N----------QGSCGSCWSFSTTGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEQPGS 573
Query: 154 CGAGCDGGTPIYAWRY-LAHHGVVTEECDPY 183
C AGC+GG A+ Y L GV+ EE PY
Sbjct: 574 CDAGCNGGLMNSAFEYILKSGGVMREEDYPY 666
>TC81254 similar to GP|600111|emb|CAA84378.1| cysteine proteinase {Vicia
sativa}, partial (61%)
Length = 692
Score = 52.4 bits (124), Expect = 3e-07
Identities = 26/72 (36%), Positives = 39/72 (54%)
Frame = +1
Query: 112 QGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLA 171
QG CGSCWAF + +++ I +SLS +L+ C GC+GG YA+ ++
Sbjct: 442 QGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVD-CDTEVNQGCNGGLMEYAFEFIK 618
Query: 172 HHGVVTEECDPY 183
+G+ TE PY
Sbjct: 619 QNGITTETNYPY 654
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.319 0.137 0.440
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,686,860
Number of Sequences: 36976
Number of extensions: 213231
Number of successful extensions: 1056
Number of sequences better than 10.0: 56
Number of HSP's better than 10.0 without gapping: 996
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1027
length of query: 345
length of database: 9,014,727
effective HSP length: 97
effective length of query: 248
effective length of database: 5,428,055
effective search space: 1346157640
effective search space used: 1346157640
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)
Medicago: description of AC149038.5