
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0258b.5
(360 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A ... 503 e-143
TC89224 similar to GP|7381221|gb|AAF61441.1| papain-like cystein... 411 e-115
CB893145 similar to GP|7381223|gb| papain-like cysteine proteina... 290 5e-79
TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 208 2e-54
TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 207 7e-54
TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase... 204 6e-53
TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3... 201 5e-52
TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativ... 197 7e-51
TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4... 192 2e-49
BI312054 similar to PIR|S55923|S55 cysteine proteinase (EC 3.4.2... 192 2e-49
TC87304 similar to GP|13897890|gb|AAK48495.1 putative cysteine p... 188 3e-48
TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4... 182 2e-46
TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4... 113 9e-45
TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 154 7e-38
BG644861 similar to GP|1401242|gb| pre-pro-cysteine proteinase {... 152 1e-37
TC85912 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 149 2e-36
TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteina... 140 6e-34
BG583083 similar to PIR|B84752|B847 probable cysteine proteinase... 137 5e-33
TC81601 similar to GP|13491750|gb|AAK27968.1 cysteine protease {... 135 3e-32
TC90718 similar to GP|13491750|gb|AAK27968.1 cysteine protease {... 125 3e-29
>TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A precursor
(EC 3.4.22.-) (Turgor-responsive protein 15A). [Garden
pea], partial (98%)
Length = 1642
Score = 503 bits (1296), Expect = e-143
Identities = 246/360 (68%), Positives = 290/360 (80%), Gaps = 9/360 (2%)
Frame = +3
Query: 4 HRILFLMFVFFLFFSVVSS----DGGVDPLIRQVVDGEG---LGAEHHFLEFKRRFGKVY 56
HR L +F+F + ++ D LIRQVVD L AEHHF FK +F K Y
Sbjct: 165 HRFLIALFLFATVATAATTLSDDTNSDDLLIRQVVDTAEDHILNAEHHFTSFKSKFSKNY 344
Query: 57 VSEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPS 115
++EEH YRF VFKSN+ +A+ HQ LDPSA HG+T+FSDLT EFR LGL + + LP+
Sbjct: 345 ATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPA 524
Query: 116 DADSAPILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSL 175
A APIL T+NLP+DFDWRE GAVTPVK+QGSCG+CW+FS TGALEGA++L+TGKL SL
Sbjct: 525 HAQKAPILPTNNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSL 704
Query: 176 SEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKF 235
SEQQLVDCDH CDPEE GSCDSGC GGLMN+AFEYIL +GGV+ E+DY Y+G G+CKF
Sbjct: 705 SEQQLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQSGGVVSEKDYAYTG-RDGSCKF 881
Query: 236 DQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSK-KLN 294
D++K+ ASV+NFSVVS DEDQIAANLVKNGPLAVAINA +MQTY+ GVSCPY+C+K +L+
Sbjct: 882 DKSKVVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYICAKARLD 1061
Query: 295 HGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVAA 354
HGVLLVG+G YAPIR+K+KPYWIIKNSWG+NWGE GYYKICRGRNVCGVDSMVSTVAA
Sbjct: 1062HGVLLVGFGQGGYAPIRLKEKPYWIIKNSWGQNWGEEGYYKICRGRNVCGVDSMVSTVAA 1241
>TC89224 similar to GP|7381221|gb|AAF61441.1| papain-like cysteine
proteinase isoform II {Ipomoea batatas}, partial (63%)
Length = 767
Score = 411 bits (1056), Expect = e-115
Identities = 202/255 (79%), Positives = 221/255 (86%), Gaps = 5/255 (1%)
Frame = +1
Query: 9 LMFVFFLFFSVVS---SDGGVDPLIRQVVDGEG--LGAEHHFLEFKRRFGKVYVSEEEHG 63
L+FV FSV + D G DP+IRQVVD EG LGAEHHF FK +FGKVY S++EH
Sbjct: 1 LLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVRLGAEHHFNLFKHKFGKVYSSKDEHD 180
Query: 64 YRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPIL 123
YRF +FKSN++RA+RHQL+DPSAVHGVTRFSDLTP EFR SVLGLRGVGLP DA++APIL
Sbjct: 181 YRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGLRGVGLPKDANAAPIL 360
Query: 124 RTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDC 183
TDNLPKDFDWRE GAVT VKNQGSCG+CWSFS TGALEGAHFLSTGKLVSLSEQQLVDC
Sbjct: 361 PTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGKLVSLSEQQLVDC 540
Query: 184 DHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAAS 243
DHECDPE+ GSCD+GC GGLMNSAFEYIL +GGVMREEDYPYSGT G+CKFD+ KIAAS
Sbjct: 541 DHECDPEQPGSCDAGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRGSCKFDKKKIAAS 720
Query: 244 VANFSVVSRDEDQIA 258
VANFSVVS DEDQIA
Sbjct: 721 VANFSVVSLDEDQIA 765
>CB893145 similar to GP|7381223|gb| papain-like cysteine proteinase isoform
III {Ipomoea batatas}, partial (46%)
Length = 782
Score = 290 bits (743), Expect = 5e-79
Identities = 146/230 (63%), Positives = 169/230 (73%), Gaps = 1/230 (0%)
Frame = +1
Query: 130 KDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDP 189
+DFDWRE GAVTPV+NQG CG+ WSFS GALEGAHFLS+G+LVSLSEQ VDCDHE
Sbjct: 1 RDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGAHFLSSGELVSLSEQHHVDCDHE--- 171
Query: 190 EEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVA-NFS 248
YI GG+MR EDY Y +T A SVA NFS
Sbjct: 172 --------------------YIQKYGGLMRVEDYTYY----------KTNTARSVAANFS 261
Query: 249 VVSRDEDQIAANLVKNGPLAVAINAVYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYA 308
+S D++QI ANLVK+GPLA AINAVYMQTYVGG+SCPY+C+++L+ GVLLVGYGS + A
Sbjct: 262 SISVDDNQITANLVKHGPLAAAINAVYMQTYVGGISCPYICTRRLDLGVLLVGYGSGAGA 441
Query: 309 PIRMKQKPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSTVAALHTT 358
++ K+KPYWI+KNSWGE WGENGYYKICRGRN+CGVDSMVSTVAA HTT
Sbjct: 442 DMKEKEKPYWIVKNSWGETWGENGYYKICRGRNICGVDSMVSTVAAAHTT 591
>TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (94%)
Length = 1258
Score = 208 bits (530), Expect = 2e-54
Identities = 124/310 (40%), Positives = 171/310 (55%), Gaps = 11/310 (3%)
Frame = +3
Query: 52 FGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVH--GVTRFSDLTPMEFRHSVLGLR 109
+GKVY +E R +FK N++ + ++ G+ +F+D+T EF S +
Sbjct: 243 YGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFADITNEEFIASRNKFK 422
Query: 110 GVGLPSDADSAPILRTDN--LPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFL 167
G + S + +N +P DWR+ GAVTPVKNQG CG CW+FSA A EG H L
Sbjct: 423 G-HMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKL 599
Query: 168 STGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSG 227
STGKLVSLSEQ+LVDCD + D GC+GGLM+ AF++I+ N G+ E YPY G
Sbjct: 600 STGKLVSLSEQELVDCDTK-------GVDQGCEGGLMDDAFKFIIQNHGLHTEAQYPYQG 758
Query: 228 TAGGTCKFDQTKI-AASVANFSVVSRDEDQIAANLVKNGPLAVAINA--VYMQTYVGGVS 284
GTC ++T AA++A + V + + V N P++VAI+A Q Y GV
Sbjct: 759 -VDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVAIDASGSDFQFYKSGVF 935
Query: 285 CPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYKICR----GR 340
C +L+HGV VGYG I YW++KNSWG +WGE GY ++ R +
Sbjct: 936 TG-SCGTQLDHGVTAVGYG------ISNDGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQ 1094
Query: 341 NVCGVDSMVS 350
+CG+ M S
Sbjct: 1095GLCGIAMMAS 1124
>TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (96%)
Length = 1315
Score = 207 bits (526), Expect = 7e-54
Identities = 125/311 (40%), Positives = 167/311 (53%), Gaps = 12/311 (3%)
Frame = +3
Query: 52 FGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVH--GVTRFSDLTPMEFRHSVLGLR 109
+GKVY +E RF +F NM D + + G+ +F+DLT EF S +
Sbjct: 225 YGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEEFVASRNKFK 404
Query: 110 GVGLPSDADSAPILRTDN---LPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHF 166
G + S + +N +P DWR+ GAVTPVKNQG CG CW+FSA A EG H
Sbjct: 405 G-HMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHK 581
Query: 167 LSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYS 226
LSTGKLVSLSEQ+LVDCD + D GC+GGLM+ AF++I+ N G+ E YPY
Sbjct: 582 LSTGKLVSLSEQELVDCDTK-------GVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQ 740
Query: 227 GTAGGTCKFDQTKI-AASVANFSVVSRDEDQIAANLVKNGPLAVAINA--VYMQTYVGGV 283
G GTC ++ I A ++ + V + +Q V N P++VAI+A Q Y GV
Sbjct: 741 G-VDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV 917
Query: 284 SCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYKICRG---- 339
C +L+HGV VGYG + YW++KNSWG +WGE GY + RG
Sbjct: 918 FTG-SCGTELDHGVTAVGYG------VSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAA 1076
Query: 340 RNVCGVDSMVS 350
+CG+ S
Sbjct: 1077EGLCGIAMQAS 1109
>TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase {Zinnia
elegans}, partial (88%)
Length = 1265
Score = 204 bits (518), Expect = 6e-53
Identities = 123/316 (38%), Positives = 174/316 (54%), Gaps = 10/316 (3%)
Frame = +2
Query: 45 FLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHS 104
F + R GK+Y + EE RF VFK N+ + + G+ F+DL+ EF++
Sbjct: 191 FESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNK 370
Query: 105 VLGLRGVGLPSDADSAP---ILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGAL 161
LGL+ V L +S+ R +LPK DWR+ GAVTPVKNQG CG+CW+FS A+
Sbjct: 371 YLGLK-VDLSQRRESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAV 547
Query: 162 EGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREE 221
EG + + TG L SLSEQ+L+DCD + ++GC GGLM+ AF +I+ NGG+ +EE
Sbjct: 548 EGINQIVTGNLTSLSEQELIDCD--------TTYNNGCNGGLMDYAFSFIVKNGGLHKEE 703
Query: 222 DYPYSGTAGGTCKF-DQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAV--YMQT 278
DYPY TC+ + ++ + V ++ +Q + N PL+VAI A Q
Sbjct: 704 DYPYI-MEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 880
Query: 279 YVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYK--- 335
Y GGV + C +L+HGV VGYG+ K Y I+KNSWG WGE G+ +
Sbjct: 881 YSGGVFDGH-CGSELDHGVSAVGYGTS-------KGLDYIIVKNSWGAKWGEKGFIRMKR 1036
Query: 336 -ICRGRNVCGVDSMVS 350
I + +CG+ M S
Sbjct: 1037NIGKSEGICGLYKMAS 1084
>TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3.4.22.-) -
garden pea, complete
Length = 1534
Score = 201 bits (510), Expect = 5e-52
Identities = 133/355 (37%), Positives = 182/355 (50%), Gaps = 21/355 (5%)
Frame = +3
Query: 12 VFFLFFSVVSSDGGVD----PLIRQVVDGEG-----LGAEHH---FLEFKRRFGKVYVSE 59
+ +FF V ++ G+ IR V D E +G H F F R+GK Y +
Sbjct: 84 LLIVFFCVATAAAGLSFHDSNPIRMVSDMEEQLLQVIGESRHAVSFARFANRYGKRYDTV 263
Query: 60 EEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLR---GVGLPSD 116
+E RF +F N+ + GV F+D T EFR LG L +
Sbjct: 264 DEMKRRFKIFSENLQLIKSTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCSATLKGN 443
Query: 117 ADSAPILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLS 176
++ LP + DWR+ G V+ VK+QG CG+CW+FS TGALE A+ + GK +SLS
Sbjct: 444 HRITDVV----LPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNISLS 611
Query: 177 EQQLVDCDHECDPEEAGSCDS-GCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKF 235
EQQLVDC AG+ ++ GC GGL + AFEYI NGG+ EE YPY+G G CKF
Sbjct: 612 EQQLVDC--------AGAYNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTG-QNGLCKF 764
Query: 236 DQTKIAASV-ANFSVVSRDEDQIAANLVKNGPLAVAINAV-YMQTYVGGVSCPYVCSK-- 291
+A V + ++ ED++ + P++VA V + Y GV C
Sbjct: 765 TSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQVVDDFRLYKKGVYTSTTCGSTP 944
Query: 292 -KLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYKICRGRNVCGV 345
+NH VL VGYG E PYW+IKNSWG WG++GY+K+ G+N+CGV
Sbjct: 945 MDVNHAVLAVGYGIEDGV-------PYWLIKNSWGGEWGDHGYFKMEMGKNMCGV 1088
>TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativum},
partial (96%)
Length = 1818
Score = 197 bits (500), Expect = 7e-51
Identities = 117/302 (38%), Positives = 165/302 (53%), Gaps = 10/302 (3%)
Frame = +1
Query: 47 EFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVL 106
E+ + GK Y E RF +FK N+ H L+ + G+TRF+DLT E+R L
Sbjct: 202 EWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFL 381
Query: 107 G--------LRGVGLPSDADSAPILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSAT 158
G ++ +G AP + D LP+ DWR+ GAV VK+Q SCG+CW+FSA
Sbjct: 382 GTKIDPNRRMKKLGGSKSNRYAPRVG-DKLPESVDWRKEGAVVGVKDQASCGSCWAFSAI 558
Query: 159 GALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVM 218
A+EG + + TG L+SLSEQ+LVDCD S + GC GGLM+ AFE+I++NGG+
Sbjct: 559 AAVEGINKIVTGDLISLSEQELVDCD--------TSYNEGCNGGLMDYAFEFIISNGGID 714
Query: 219 REEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAI--NAVYM 276
E+DYPY G + + ++ ++ V ++ V N P+AVA+
Sbjct: 715 SEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREF 894
Query: 277 QTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYKI 336
Q Y GV C L+HGV VGYG+E+ K YWI++NSWG +WGE GY ++
Sbjct: 895 QLYEYGVFTGR-CGTALDHGVAAVGYGTEN-------GKDYWIVRNSWGGSWGEQGYIRL 1050
Query: 337 CR 338
R
Sbjct: 1051ER 1056
>TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4.22.-)
precursor - spring vetch, partial (87%)
Length = 1484
Score = 192 bits (487), Expect = 2e-49
Identities = 117/333 (35%), Positives = 171/333 (51%), Gaps = 9/333 (2%)
Frame = +2
Query: 15 LFFSVVSSDGGVDPLIRQVVDGEGLGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSNMH 74
LFFS+++ +D +R + + + E+ + KVY E RF +FK N+
Sbjct: 218 LFFSLITLSLAMDTSMRSNEEVMTM-----YEEWLVKHHKVYNGLGEKDQRFEIFKDNLG 382
Query: 75 RARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLRGVGLPS-------DADSAPILRTDN 127
H + + G+ +F+D+T E+R+ LG + + D
Sbjct: 383 FIDEHNAQNYTYKVGLNKFADMTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAFNSGDR 562
Query: 128 LPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEC 187
LP DWR GAV +K+QGSCG+CW+FS +E + + TGKLVSLSEQ+LVDCD
Sbjct: 563 LPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDR-- 736
Query: 188 DPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANF 247
+ + GC GGLM+ AFE+I NGG+ E+DYPY G G + S+ +
Sbjct: 737 ------AFNEGCNGGLMDYAFEFIGENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGY 898
Query: 248 SVVSRDEDQIAANLVKNGPLAVAINA--VYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSE 305
V + V + P++VAI A +Q Y GV C L+HGV++VGYGSE
Sbjct: 899 EDVPAYNENALKKAVSHQPVSVAIEAGGRALQLYQSGVFTGR-CGTNLDHGVVVVGYGSE 1075
Query: 306 SYAPIRMKQKPYWIIKNSWGENWGENGYYKICR 338
+ YW+++NSWG NWGE+GY+K+ R
Sbjct: 1076NGV-------DYWLVRNSWGTNWGEDGYFKLER 1153
>BI312054 similar to PIR|S55923|S55 cysteine proteinase (EC 3.4.22.-)
precursor - soybean, partial (39%)
Length = 447
Score = 192 bits (487), Expect = 2e-49
Identities = 89/146 (60%), Positives = 114/146 (77%)
Frame = +3
Query: 138 GAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDS 197
GAVT VK QG+CG+CW+FS TG++EGA+FL+TGKL+SLSEQQLVDCD +CD + SCD+
Sbjct: 12 GAVTGVKMQGTCGSCWAFSTTGSIEGANFLATGKLLSLSEQQLVDCDSKCDITDKTSCDN 191
Query: 198 GCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQI 257
GC GGLM +A+ Y+L GG+ E YPY+G G CKFD KIA + NF+ + DE+QI
Sbjct: 192 GCNGGLMTNAYNYLLEAGGLEEENAYPYTG-GKGECKFDPKKIAVKITNFTNIPVDENQI 368
Query: 258 AANLVKNGPLAVAINAVYMQTYVGGV 283
AA LV +GPLA+ +NAV+MQTY+GGV
Sbjct: 369 AAYLVNHGPLAMGVNAVFMQTYIGGV 446
>TC87304 similar to GP|13897890|gb|AAK48495.1 putative cysteine protease
{Ipomoea batatas}, partial (24%)
Length = 1325
Score = 188 bits (477), Expect = 3e-48
Identities = 119/324 (36%), Positives = 177/324 (53%), Gaps = 19/324 (5%)
Frame = +3
Query: 45 FLEFKRRFGKVYV-SEEEHGYRFNVFKSN------MHRARRHQLLDPSAVHGVTRFSDLT 97
F +K+ G+ Y SEEE+ RF +FK+N M+ R+ Q +++ +F+D++
Sbjct: 180 FQMWKKEHGRDYANSEEENAKRFEIFKTNFKYINEMNAKRKSQTQHRLSLN---KFADMS 350
Query: 98 PMEFRHSVLGLRGVGLPSDADSAPILRTD---NLPKDFDWREHGAVTPVKNQGSCGACWS 154
P EF + L + +PS+ D+A + D NLP DWRE GAVT V++QG C + W+
Sbjct: 351 PEEFSKTYLPKIEMQVPSNRDNAKLKDDDDCENLPTSVDWREKGAVTEVRDQGDCQSHWA 530
Query: 155 FSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNN 214
FS TGA+EG + + TG L++LS Q+LVDCD GC GG +AF Y++ N
Sbjct: 531 FSVTGAIEGLNKIVTGNLINLSAQELVDCD---------PASKGCAGGFYFNAFGYVIEN 683
Query: 215 GGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINAV 274
GG+ E +YPY GTCK + K+ S+ N V+ E+ + K P++V+++A
Sbjct: 684 GGIDTEANYPYL-AKNGTCKENANKV-VSIDNLLVLDGTEEALLCRTSKQ-PVSVSLDAT 854
Query: 275 YMQTYVGGVSCPYVC---SKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGEN 331
+Q Y GGV C S+ N L+VGY S + + YWI+KNSWG++WGE
Sbjct: 855 GLQFYAGGVYGGENCKKESRNANLVGLIVGYDS-------VNGEDYWIVKNSWGKDWGEK 1013
Query: 332 GYYKICRG------RNVCGVDSMV 349
GY I R VC +++ V
Sbjct: 1014GYLFIKRNVFEDWPFGVCAINAAV 1085
>TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4.22.-) -
kidney bean, complete
Length = 1720
Score = 182 bits (462), Expect = 2e-46
Identities = 107/293 (36%), Positives = 151/293 (51%), Gaps = 12/293 (4%)
Frame = +3
Query: 65 RFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLRGV------GLPSDAD 118
RFNVFKSN+ +D + +F+D+T EF+ + G + G P +
Sbjct: 606 RFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSG 785
Query: 119 SAPILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQ 178
+ P DWR+ GAVT VK+QG CG+CW+FS A+EG + + T +LV LSEQ
Sbjct: 786 TFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 965
Query: 179 QLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQT 238
+L+DCD++ + GC GGLM AFEYI GG+ E YPY+ G +
Sbjct: 966 ELIDCDNQ--------ENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKEN 1121
Query: 239 KIAASVANFSVVSRDEDQIAANLVKNGPLAVAINA--VYMQTYVGGVSCPYVCSKKLNHG 296
A S+ V +++ V N P++VAI+A Q Y GV C K+LNHG
Sbjct: 1122VPAVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTG-DCGKELNHG 1298
Query: 297 VLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGENGYYKICRG----RNVCGV 345
V +VGYG+ + YWI++NSWG WGE GY ++ R +CG+
Sbjct: 1299VAIVGYGT------TVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGI 1439
>TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4.22.-) 3
precursor - kidney bean, partial (93%)
Length = 1675
Score = 113 bits (282), Expect(2) = 9e-45
Identities = 76/213 (35%), Positives = 111/213 (51%), Gaps = 11/213 (5%)
Frame = +2
Query: 6 ILFLMFV--FFLFFSVVSSDGGVDPLIRQVVDGEGLGAEHHFLEFKRRFGKVY--VSEEE 61
I+F +F F L S++S D + D E ++ + E++ + GK+ + E
Sbjct: 149 IVFTLFTATFALDMSIISYDKTHSDKSSRRSDKE---VKNIYEEWRVKHGKLNNNIDGSE 319
Query: 62 HGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGLR--GVGLPSDADS 119
RF +FK N+ H + + G+ RF+DL+ E+R LG + +G+
Sbjct: 320 KDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMARTK 499
Query: 120 APILRT-----DNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVS 174
R D LPK DWR GAV VK+QGSCG+CW+FS A+EG + + TG+LVS
Sbjct: 500 TRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGELVS 679
Query: 175 LSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSA 207
LSEQ+LVDCD + ++GC GGLM A
Sbjct: 680 LSEQELVDCDR--------TVNAGCDGGLMEYA 754
Score = 85.1 bits (209), Expect(2) = 9e-45
Identities = 46/127 (36%), Positives = 69/127 (54%), Gaps = 2/127 (1%)
Frame = +1
Query: 214 NGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAINA 273
NGG+ +EDYPY G G ++ + S+ ++ V ++ V N P++VAI A
Sbjct: 775 NGGIDSDEDYPYRGVDGKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEA 954
Query: 274 V--YMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGEN 331
Q YV G+ C L+HGV VGYG+E+ YWI++NSWG++WGE+
Sbjct: 955 GGREFQLYVSGIFTGK-CGTALDHGVTAVGYGTENGVD-------YWIVRNSWGKSWGES 1110
Query: 332 GYYKICR 338
GY ++ R
Sbjct: 1111GYVRMER 1131
>TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (58%)
Length = 773
Score = 154 bits (388), Expect = 7e-38
Identities = 87/204 (42%), Positives = 116/204 (56%), Gaps = 7/204 (3%)
Frame = +3
Query: 149 CGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAF 208
CG CW+FSA A EG H LSTG+LVSLSEQ+LVDCD + D GC+GGLM+ AF
Sbjct: 21 CGCCWAFSAVPAPEGIHKLSTGRLVSLSEQELVDCDTK-------GVDQGCEGGLMDDAF 179
Query: 209 EYILNNGGVMREEDYPYSGTAGGTCKFDQTKI-AASVANFSVVSRDEDQIAANLVKNGPL 267
++I+ N G+ E YPY G GTC ++ I A ++ + V + +Q V N P+
Sbjct: 180 KFIIQNHGLNTEAQYPYQG-VDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPI 356
Query: 268 AVAINA--VYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWG 325
+VAI+A Q Y GV C +L+HGV VGYG + YW++KN WG
Sbjct: 357 SVAIDASGSDFQFYKSGVFTG-SCGTELDHGVTAVGYG------VGNDGTKYWLVKNLWG 515
Query: 326 ENWGENGYYKICRG----RNVCGV 345
+WGE GY K+ RG +CG+
Sbjct: 516 TDWGEEGYIKMQRGVDAAEGLCGI 587
>BG644861 similar to GP|1401242|gb| pre-pro-cysteine proteinase {Vicia faba},
partial (47%)
Length = 764
Score = 152 bits (385), Expect = 1e-37
Identities = 82/151 (54%), Positives = 101/151 (66%), Gaps = 8/151 (5%)
Frame = +1
Query: 4 HRILFLMFVFFLFFSVVSS----DGGVDPLIRQVVDGEG---LGAEHHFLEFKRRFGKVY 56
HR L +F+F + ++ D LIRQVVD L AEHHF FK +F K Y
Sbjct: 4 HRFLIALFLFATVATAATTLSDDTNSDDLLIRQVVDTAEDHILNAEHHFTSFKSKFSKNY 183
Query: 57 VSEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPS 115
++EEH YRF VFKSN+ +A+ HQ LDPSA HG+T+FSDLT EFR LGL + + LP+
Sbjct: 184 ATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPA 363
Query: 116 DADSAPILRTDNLPKDFDWREHGAVTPVKNQ 146
A APIL T+NLP+DFDWRE GAVTPVK+Q
Sbjct: 364 HAQKAPILPTNNLPEDFDWREKGAVTPVKDQ 456
Score = 72.4 bits (176), Expect = 3e-13
Identities = 31/38 (81%), Positives = 36/38 (94%)
Frame = +2
Query: 147 GSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCD 184
GSCG+CW+FS TGALEGA++L+TGKL SLSEQQLVDCD
Sbjct: 650 GSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCD 763
>TC85912 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (60%)
Length = 718
Score = 149 bits (376), Expect = 2e-36
Identities = 82/192 (42%), Positives = 112/192 (57%), Gaps = 5/192 (2%)
Frame = +2
Query: 47 EFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVH--GVTRFSDLTPMEFRHS 104
++ ++GKVY +E RF +F N++ D + ++ GV +F+DLT EF S
Sbjct: 170 QWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTNDEFTSS 349
Query: 105 VLGLRGVGLPSDADSAPILRTDN---LPKDFDWREHGAVTPVKNQGSCGACWSFSATGAL 161
+G + S + +N +P DWR+ GAVTPVKNQG CG CW+FSA A
Sbjct: 350 RNKFKG-HMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAT 526
Query: 162 EGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREE 221
EG H LSTGKL+SLSEQ+LVDCD + D C+GGLM+ AF++I+ N G+ E
Sbjct: 527 EGIHKLSTGKLISLSEQELVDCDTK-------GVDQSCEGGLMDDAFKFIIQNHGLNTEA 685
Query: 222 DYPYSGTAGGTC 233
+YPY G GTC
Sbjct: 686 NYPYQG-VDGTC 718
>TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (54%)
Length = 709
Score = 140 bits (354), Expect = 6e-34
Identities = 84/206 (40%), Positives = 116/206 (55%), Gaps = 7/206 (3%)
Frame = +1
Query: 154 SFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILN 213
+FSA A EG LSTGKLVSLSEQ+LVDCD + D GC+GGLM+ AF++I+
Sbjct: 1 AFSAVAATEGITKLSTGKLVSLSEQELVDCDTK-------GVDQGCEGGLMDDAFKFIIQ 159
Query: 214 NGGVMREEDYPYSGTAGGTCKFDQTKI-AASVANFSVVSRDEDQIAANLVKNGPLAVAIN 272
N G+ E YPY G GTC ++ I AA++ + V + +Q V N P++VAI+
Sbjct: 160 NHGLSTEAAYPYQG-VDGTCNANKASIHAATITGYEDVPANNEQALQKAVANQPISVAID 336
Query: 273 A--VYMQTYVGGVSCPYVCSKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWGE 330
A Q Y GV C +L+HGV VGYG + YW++KNSWG +WG+
Sbjct: 337 ASGSDFQFYKSGVFSG-SCGTELDHGVTAVGYG------VGNDGTKYWLVKNSWGTDWGQ 495
Query: 331 NGYYKICRGRN----VCGVDSMVSTV 352
GY ++ RG + +CG+ T+
Sbjct: 496 EGYIRMQRGMDAPEXLCGIAMQAFTL 573
>BG583083 similar to PIR|B84752|B847 probable cysteine proteinase [imported]
- Arabidopsis thaliana, partial (27%)
Length = 740
Score = 137 bits (346), Expect = 5e-33
Identities = 82/198 (41%), Positives = 115/198 (57%), Gaps = 6/198 (3%)
Frame = +2
Query: 65 RFNVFKSNMHRARR-HQLLDPSAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDAD----S 119
R +FK+N+ + + S G+ ++SDLT EF S GL+ S + +
Sbjct: 140 RKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSSKMRSAA 319
Query: 120 APILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQ 179
P D++P +FDWR+ GAVT VK+QGSCG CW+FS A+EGA ++TG+L+SLSEQQ
Sbjct: 320 VPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELISLSEQQ 499
Query: 180 LVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKF-DQT 238
LVDCD +SGC GG M+SAF+YI+ G++ E DYPY TC+ DQ
Sbjct: 500 LVDCDER---------NSGCHGGNMDSAFKYIIQK-GIVSEADYPYQ-EGSQTCQLNDQM 646
Query: 239 KIAASVANFSVVSRDEDQ 256
K A + N V +++Q
Sbjct: 647 KFEAQITNLLDVPANDEQ 700
>TC81601 similar to GP|13491750|gb|AAK27968.1 cysteine protease {Ipomoea
batatas}, partial (49%)
Length = 725
Score = 135 bits (339), Expect = 3e-32
Identities = 74/189 (39%), Positives = 99/189 (52%), Gaps = 1/189 (0%)
Frame = +1
Query: 37 EGLGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVH-GVTRFSD 95
E L + ++ GKVY E RF +FK N+ D V +D
Sbjct: 178 ESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLAD 357
Query: 96 LTPMEFRHSVLGLRGVGLPSDADSAPILRTDNLPKDFDWREHGAVTPVKNQGSCGACWSF 155
LT EF+ S G + + S +P DWR GAVTP+K+QG CG+CW+F
Sbjct: 358 LTLDEFKASRNGYKKIDREFTTTSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWAF 537
Query: 156 SATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNG 215
S A EG + ++TGKLVSLSEQ+LVDCD + + D GC+GGLM FE+I+ NG
Sbjct: 538 STVAATEGINQITTGKLVSLSEQELVDCDTKGE-------DQGCEGGLMEDGFEFIIKNG 696
Query: 216 GVMREEDYP 224
G+ E +YP
Sbjct: 697 GITSETNYP 723
>TC90718 similar to GP|13491750|gb|AAK27968.1 cysteine protease {Ipomoea
batatas}, partial (44%)
Length = 796
Score = 125 bits (314), Expect = 3e-29
Identities = 77/205 (37%), Positives = 108/205 (52%), Gaps = 9/205 (4%)
Frame = +2
Query: 47 EFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVH--GVTRFSDLTPMEFRHS 104
++ ++ KVY +E R +F +N++ + ++ G+ +F+DLT EF S
Sbjct: 191 QWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGINQFADLTNEEFIAS 370
Query: 105 VLGLRGVGLPSDADSAPIL--RTDNLPKDFDWREHGAVTPVKNQGSCGACWSFSATGALE 162
+G S A + +P DWR+ GAVTPVKNQG CG CW+FSA A E
Sbjct: 371 RNKFKGHMCSSIAKTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATE 550
Query: 163 GAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILNNGGVMRE-- 220
G LSTGKLVSLSE +LVDCD + D GC+GGLM+ AF++I+ N G+ E
Sbjct: 551 GITKLSTGKLVSLSEXELVDCDTK-------GVDQGCEGGLMDXAFKFIIQNLGLXTEAA 709
Query: 221 ---EDYPYSGTAGGTCKFDQTKIAA 242
Y + G C+ + KI A
Sbjct: 710 XPSXXYGHPCXXGSFCRXYRXKIPA 784
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.320 0.138 0.428
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,993,772
Number of Sequences: 36976
Number of extensions: 179384
Number of successful extensions: 1141
Number of sequences better than 10.0: 61
Number of HSP's better than 10.0 without gapping: 1058
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1068
length of query: 360
length of database: 9,014,727
effective HSP length: 97
effective length of query: 263
effective length of database: 5,428,055
effective search space: 1427578465
effective search space used: 1427578465
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)
Lotus: description of TM0258b.5