
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0173b.5
(355 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4... 535 e-152
TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 355 2e-98
TC81254 similar to GP|600111|emb|CAA84378.1| cysteine proteinase... 347 5e-96
TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase... 346 9e-96
TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 345 2e-95
TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativ... 330 5e-91
TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4... 325 1e-89
TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 269 1e-72
TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteina... 246 1e-65
TC87304 similar to GP|13897890|gb|AAK48495.1 putative cysteine p... 223 9e-59
TC92862 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 205 2e-53
TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3... 189 2e-48
TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A ... 185 3e-47
TC81601 similar to GP|13491750|gb|AAK27968.1 cysteine protease {... 177 4e-45
TC85912 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 172 2e-43
TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4... 166 1e-41
AW775259 similar to GP|13491750|gb cysteine protease {Ipomoea ba... 165 2e-41
BI311691 similar to PIR|T03941|T039 cysteine proteinase (EC 3.4.... 158 3e-39
TC90718 similar to GP|13491750|gb|AAK27968.1 cysteine protease {... 153 9e-38
BG583083 similar to PIR|B84752|B847 probable cysteine proteinase... 149 2e-36
>TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4.22.-) -
kidney bean, complete
Length = 1720
Score = 535 bits (1378), Expect = e-152
Identities = 253/349 (72%), Positives = 293/349 (83%), Gaps = 1/349 (0%)
Frame = +3
Query: 1 MGMKNLFCVALFSLALFLSVAESFDYHEKDLASEEGLWDLYERWRSHHTVSRSLDEKHKR 60
M K L + L S+AL L V+ESFD+H+KD++S+E LWDLYERWRSHHTVSR+L+EK KR
Sbjct: 432 MTTKKLLLIVL-SIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKR 608
Query: 61 FNVFKANVMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGS 120
FNVFK+NVMHVH TNK+DKPYKLKLNKFAD+TN+EFK+ YA SK++HHRMF G PR +G+
Sbjct: 609 FNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGT 788
Query: 121 FMYENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQE 180
FMYEN P S DWR GAVT VKDQGQCGSCWAFS + AVEGINQI+T+ LV LSEQE
Sbjct: 789 FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQE 968
Query: 181 LVDCDVEVNEGCNGGFMAKALEFIKEK-GITSESIYPYTATDGTCNTQKMVNEPAVSIDG 239
L+DCD + N+GCNGG M A E+IK+K GIT+ES YPYTA DG+C+ K N PAVSIDG
Sbjct: 969 LIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATK-ENVPAVSIDG 1145
Query: 240 YEIVPENNEVALLKAAANQPISVGIDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAA 299
+E VP N+E ALLKA ANQP+SV IDAGGSDFQFYSEGVFTGDCG EL+HGVAIVGYG
Sbjct: 1146HETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT 1325
Query: 300 QDGTKYWIVKNSWGADWGEQGYIRMKRGISKKQGLCGIAMEASYPIKSS 348
DGT YWIV+NSWGA+WGEQGYIRMKR +S K+GLCGIAMEASYP+K+S
Sbjct: 1326VDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPVKNS 1472
>TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (96%)
Length = 1315
Score = 355 bits (910), Expect = 2e-98
Identities = 185/351 (52%), Positives = 234/351 (65%), Gaps = 5/351 (1%)
Frame = +3
Query: 6 LFCVALFSLALFLSVAESFDYHEKDLASEEGLWDLYERWRSHH-TVSRSLDEKHKRFNVF 64
+FC+ L+++ + + HE+ +ERW +H+ V + E+ KRF +F
Sbjct: 132 VFCLGLWAIQVTSRTLQDGSMHER-----------HERWMNHYGKVYKDHQEREKRFKIF 278
Query: 65 KANVMHVHETNKLD--KPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGSFM 122
N+ ++ N D + YKL +N+FADLTN EF + + +K H M I R +F
Sbjct: 279 TENMKYIEAFNNGDNNESYKLGINQFADLTNEEF--VASRNKFKGH-MCSSIIRTT-TFK 446
Query: 123 YENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELV 182
YENV ++P + DWR GAVTPVK+QGQCG CWAFSA+AA EGI+++ T LVSLSEQELV
Sbjct: 447 YENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELV 626
Query: 183 DCDVE-VNEGCNGGFMAKALEFI-KEKGITSESIYPYTATDGTCNTQKMVNEPAVSIDGY 240
DCD + V++GC GG M A +FI + G+ +E+ YPY DGTCN K + A +I GY
Sbjct: 627 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANK-ASIQATTITGY 803
Query: 241 EIVPENNEVALLKAAANQPISVGIDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQ 300
E VP NNE AL KA ANQPISV IDA GSDFQFY GVFTG CGTELDHGV VGYG +
Sbjct: 804 EDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSN 983
Query: 301 DGTKYWIVKNSWGADWGEQGYIRMKRGISKKQGLCGIAMEASYPIKSSPTL 351
DGTKYW+VKNSWG DWGE+GYI M+RG+ +GLCGIAM+ASYP P L
Sbjct: 984 DGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA*MPNL 1136
>TC81254 similar to GP|600111|emb|CAA84378.1| cysteine proteinase {Vicia
sativa}, partial (61%)
Length = 692
Score = 347 bits (889), Expect = 5e-96
Identities = 167/222 (75%), Positives = 185/222 (83%)
Frame = +1
Query: 1 MGMKNLFCVALFSLALFLSVAESFDYHEKDLASEEGLWDLYERWRSHHTVSRSLDEKHKR 60
M MK L V+L SLAL L +A+SFD+ E DLASE+ LWDLYERWRSHHTV+RSLDEK+ R
Sbjct: 7 MEMKKLLFVSL-SLALVLGIAKSFDFEENDLASEKSLWDLYERWRSHHTVTRSLDEKNNR 183
Query: 61 FNVFKANVMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGS 120
FNVFKANVMHVH TNKLDKPYKLKLNKFAD+TNYEF+SIYA SK++HHRMF G+ NG
Sbjct: 184 FNVFKANVMHVHNTNKLDKPYKLKLNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGP 363
Query: 121 FMYENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQE 180
FMYENV+ VP S DWR GAVT VKDQGQCGSCWAFS I AVEGINQI+T LVSLSEQE
Sbjct: 364 FMYENVEGVPSSIDWRKIGAVTGVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQE 543
Query: 181 LVDCDVEVNEGCNGGFMAKALEFIKEKGITSESIYPYTATDG 222
LVDCD EVN+GCNGG M A EFIK+ GIT+E+ YPY A DG
Sbjct: 544 LVDCDTEVNQGCNGGLMEYAFEFIKQNGITTETNYPYAAKDG 669
>TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase {Zinnia
elegans}, partial (88%)
Length = 1265
Score = 346 bits (887), Expect = 9e-96
Identities = 185/341 (54%), Positives = 229/341 (66%), Gaps = 7/341 (2%)
Frame = +2
Query: 13 SLALFLSVAESFD-----YHEKDLASEEGLWDLYERWRSHH-TVSRSLDEKHKRFNVFKA 66
SL LFLS+A D Y +DL S + L +L+E W S H + +++EK RF VFK
Sbjct: 92 SLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKD 271
Query: 67 NVMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGSFMYENV 126
N+ H+ + NK+ Y L LN+FADL++ EFK+ Y K+D + F Y +V
Sbjct: 272 NLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRES---SEEEFTYRDV 442
Query: 127 DSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDV 186
D +P S DWR GAVTPVK+QGQCGSCWAFS +AAVEGINQI T +L SLSEQEL+DCD
Sbjct: 443 D-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDT 619
Query: 187 EVNEGCNGGFMAKALEFI-KEKGITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPE 245
N GCNGG M A FI K G+ E YPY + TC +K V+E V+I+GY VP+
Sbjct: 620 TYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSE-VVTINGYHDVPQ 796
Query: 246 NNEVALLKAAANQPISVGIDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKY 305
NNE +LLKA ANQP+SV I+A G DFQFYS GVF G CG+ELDHGV+ VGYG ++ G Y
Sbjct: 797 NNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELDHGVSAVGYGTSK-GLDY 973
Query: 306 WIVKNSWGADWGEQGYIRMKRGISKKQGLCGIAMEASYPIK 346
IVKNSWGA WGE+G+IRMKR I K +G+CG+ ASYP K
Sbjct: 974 IIVKNSWGAKWGEKGFIRMKRNIGKSEGICGLYKMASYPTK 1096
>TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (94%)
Length = 1258
Score = 345 bits (884), Expect = 2e-95
Identities = 182/344 (52%), Positives = 232/344 (66%), Gaps = 5/344 (1%)
Frame = +3
Query: 6 LFCVALFSLALFLSVAESFDYHEKDLASEEGLWDLYERWRSHH-TVSRSLDEKHKRFNVF 64
L C+ LF++ + + L + +++ +E+W H+ V + L E+ R +F
Sbjct: 147 LLCLGLFAIQVT----------SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIF 296
Query: 65 KANVMHVHETNKL--DKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGSFM 122
K NV ++ +N +K YKL +N+FAD+TN EF I + +K H M I + +F
Sbjct: 297 KENVNYIEASNNAGNNKLYKLGINQFADITNEEF--IASRNKFKGH-MCSSITK-TSTFK 464
Query: 123 YENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELV 182
YEN SVP + DWR GAVTPVK+QGQCG CWAFSA+AA EGI+++ T LVSLSEQELV
Sbjct: 465 YENA-SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELV 641
Query: 183 DCDVE-VNEGCNGGFMAKALEFI-KEKGITSESIYPYTATDGTCNTQKMVNEPAVSIDGY 240
DCD + V++GC GG M A +FI + G+ +E+ YPY DGTC+ + + PA +I GY
Sbjct: 642 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLHTEAQYPYQGVDGTCSANE-TSTPAATIAGY 818
Query: 241 EIVPENNEVALLKAAANQPISVGIDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQ 300
E VP NNE AL KA ANQPISV IDA GSDFQFY GVFTG CGT+LDHGV VGYG +
Sbjct: 819 EDVPANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISN 998
Query: 301 DGTKYWIVKNSWGADWGEQGYIRMKRGISKKQGLCGIAMEASYP 344
DGTKYW+VKNSWG DWGE+GYIRM+R + QGLCGIAM ASYP
Sbjct: 999 DGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQGLCGIAMMASYP 1130
>TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativum},
partial (96%)
Length = 1818
Score = 330 bits (846), Expect = 5e-91
Identities = 181/343 (52%), Positives = 222/343 (63%), Gaps = 8/343 (2%)
Frame = +1
Query: 13 SLALFLSVAESFDYHEKDLASEEG----LWDLYERWRSHHTVS-RSLDEKHKRFNVFKAN 67
SLAL +S+ S+D D ++ + + +YE W H S L EK KRF +FK N
Sbjct: 103 SLALDMSII-SYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDN 279
Query: 68 VMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGSFMYENV- 126
+ + E N L+ Y+L L +FADLTN E++S + +KID +R + + V
Sbjct: 280 LKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGGSKSNRYAPRVG 459
Query: 127 DSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDV 186
D +P S DWR GAV VKDQ CGSCWAFSAIAAVEGIN+I T L+SLSEQELVDCD
Sbjct: 460 DKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDT 639
Query: 187 EVNEGCNGGFMAKALEF-IKEKGITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPE 245
NEGCNGG M A EF I GI SE YPY A DG C+ Q N V+ID YE VP
Sbjct: 640 SYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCD-QNRKNAKVVTIDDYEDVPA 816
Query: 246 NNEVALLKAAANQPISVGIDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKY 305
+E+AL KA ANQPI+V ++ GG +FQ Y GVFTG CGT LDHGVA VGYG ++G Y
Sbjct: 817 YDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVGYG-TENGKDY 993
Query: 306 WIVKNSWGADWGEQGYIRMKRGI-SKKQGLCGIAMEASYPIKS 347
WIV+NSWG WGEQGYIR++R + S + G CGIA+E SYPIK+
Sbjct: 994 WIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 1122
>TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4.22.-)
precursor - spring vetch, partial (87%)
Length = 1484
Score = 325 bits (834), Expect = 1e-89
Identities = 169/335 (50%), Positives = 215/335 (63%), Gaps = 4/335 (1%)
Frame = +2
Query: 16 LFLSVAESFDYHEKDLASEEGLWDLYERWR-SHHTVSRSLDEKHKRFNVFKANVMHVHET 74
LF S+ + + S E + +YE W HH V L EK +RF +FK N+ + E
Sbjct: 218 LFFSLITLSLAMDTSMRSNEEVMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFIDEH 397
Query: 75 NKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGS-FMYENVDSVPISK 133
N + YK+ LNKFAD+TN E++++Y +K D R I G + + + D +P+
Sbjct: 398 NAQNYTYKVGLNKFADMTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAFNSGDRLPVHV 577
Query: 134 DWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDVEVNEGCN 193
DWR+ GAV +KDQG CGSCWAFS IA VE IN+I T LVSLSEQELVDCD NEGCN
Sbjct: 578 DWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCN 757
Query: 194 GGFMAKALEFIKEK-GITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALL 252
GG M A EFI E GI +E YPY +G C+ + N VSIDGYE VP NE AL
Sbjct: 758 GGLMDYAFEFIGENGGIDTEQDYPYKGFEGRCDPTRK-NAKVVSIDGYEDVPAYNENALK 934
Query: 253 KAAANQPISVGIDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKYWIVKNSW 312
KA ++QP+SV I+AGG Q Y GVFTG CGT LDHGV +VGYG +++G YW+V+NSW
Sbjct: 935 KAVSHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYG-SENGVDYWLVRNSW 1111
Query: 313 GADWGEQGYIRMKRGISK-KQGLCGIAMEASYPIK 346
G +WGE GY +++R + K G CGIAM+ASYP+K
Sbjct: 1112GTNWGEDGYFKLERNVKKINTGKCGIAMQASYPVK 1216
>TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (58%)
Length = 773
Score = 269 bits (688), Expect = 1e-72
Identities = 131/205 (63%), Positives = 152/205 (73%), Gaps = 2/205 (0%)
Frame = +3
Query: 149 QCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDVE-VNEGCNGGFMAKALEFI-KE 206
QCG CWAFSA+ A EGI+++ T LVSLSEQELVDCD + V++GC GG M A +FI +
Sbjct: 18 QCGCCWAFSAVPAPEGIHKLSTGRLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQN 197
Query: 207 KGITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALLKAAANQPISVGIDA 266
G+ +E+ YPY DGTC+ K + AV+I GYE VP NNE AL KA ANQPISV IDA
Sbjct: 198 HGLNTEAQYPYQGVDGTCSANK-ASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDA 374
Query: 267 GGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKYWIVKNSWGADWGEQGYIRMKR 326
GSDFQFY GVFTG CGTELDHGV VGYG DGTKYW+VKN WG DWGE+GYI+M+R
Sbjct: 375 SGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNLWGTDWGEEGYIKMQR 554
Query: 327 GISKKQGLCGIAMEASYPIKSSPTL 351
G+ +GLCGIAMEASYP P L
Sbjct: 555 GVDAAEGLCGIAMEASYPTA*LPNL 629
>TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (54%)
Length = 709
Score = 246 bits (627), Expect = 1e-65
Identities = 120/189 (63%), Positives = 142/189 (74%), Gaps = 2/189 (1%)
Frame = +1
Query: 155 AFSAIAAVEGINQIRTHHLVSLSEQELVDCDVE-VNEGCNGGFMAKALEFI-KEKGITSE 212
AFSA+AA EGI ++ T LVSLSEQELVDCD + V++GC GG M A +FI + G+++E
Sbjct: 1 AFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTE 180
Query: 213 SIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALLKAAANQPISVGIDAGGSDFQ 272
+ YPY DGTCN K + A +I GYE VP NNE AL KA ANQPISV IDA GSDFQ
Sbjct: 181 AAYPYQGVDGTCNANK-ASIHAATITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQ 357
Query: 273 FYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKYWIVKNSWGADWGEQGYIRMKRGISKKQ 332
FY GVF+G CGTELDHGV VGYG DGTKYW+VKNSWG DWG++GYIRM+RG+ +
Sbjct: 358 FYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGQEGYIRMQRGMDAPE 537
Query: 333 GLCGIAMEA 341
LCGIAM+A
Sbjct: 538 XLCGIAMQA 564
>TC87304 similar to GP|13897890|gb|AAK48495.1 putative cysteine protease
{Ipomoea batatas}, partial (24%)
Length = 1325
Score = 223 bits (568), Expect = 9e-59
Identities = 142/359 (39%), Positives = 204/359 (56%), Gaps = 17/359 (4%)
Frame = +3
Query: 6 LFCVALFSLALFLSVAESFDYHEK--DLASEEGLWDLYERWRSHH--TVSRSLDEKHKRF 61
L +++ L+ LS S H K +S+E +++L++ W+ H + S +E KRF
Sbjct: 69 LILISITCLSFALSSEYSISSHGKLDKFSSDEEVFELFQMWKKEHGRDYANSEEENAKRF 248
Query: 62 NVFKANVMHVHETN---KLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGN 118
+FK N +++E N K ++L LNKFAD++ EF Y KI+ +P
Sbjct: 249 EIFKTNFKYINEMNAKRKSQTQHRLSLNKFADMSPEEFSKTYLP-KIEMQ-----VPSNR 410
Query: 119 GSFMYENVD---SVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVS 175
+ ++ D ++P S DWR GAVT V+DQG C S WAFS A+EG+N+I T +L++
Sbjct: 411 DNAKLKDDDDCENLPTSVDWREKGAVTEVRDQGDCQSHWAFSVTGAIEGLNKIVTGNLIN 590
Query: 176 LSEQELVDCDVEVNEGCNGGFMAKALEFIKEK-GITSESIYPYTATDGTCNTQKMVNEPA 234
LS QELVDCD ++GC GGF A ++ E GI +E+ YPY A +GTC K
Sbjct: 591 LSAQELVDCD-PASKGCAGGFYFNAFGYVIENGGIDTEANYPYLAKNGTC---KENANKV 758
Query: 235 VSIDGYEIVPENNEVALLKAAANQPISVGIDAGGSDFQFYSEGVFTGD-CGTELDHG--- 290
VSID +V + E ALL + QP+SV +DA G QFY+ GV+ G+ C E +
Sbjct: 759 VSIDNL-LVLDGTEEALLCRTSKQPVSVSLDATG--LQFYAGGVYGGENCKKESRNANLV 929
Query: 291 VAIVGYGAAQDGTKYWIVKNSWGADWGEQGYIRMKRGISKKQ--GLCGIAMEASYPIKS 347
IVGY + +G YWIVKNSWG DWGE+GY+ +KR + + G+C I YP+K+
Sbjct: 930 GLIVGYDSV-NGEDYWIVKNSWGKDWGEKGYLFIKRNVFEDWPFGVCAINAAVGYPVKT 1103
>TC92862 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (41%)
Length = 581
Score = 205 bits (522), Expect = 2e-53
Identities = 97/148 (65%), Positives = 110/148 (73%)
Frame = +2
Query: 204 IKEKGITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALLKAAANQPISVG 263
I+ G+ +E+ YPY DGTC+ K + AV+I GYE VP NNE AL KA ANQPISV
Sbjct: 2 IQNHGLNTEAQYPYQGVDGTCSANK-ASIHAVTITGYEDVPANNEQALQKAVANQPISVT 178
Query: 264 IDAGGSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKYWIVKNSWGADWGEQGYIR 323
IDA GSDFQFY GVFTG CGTELDHGV VGYG DGTKYW+VKNSWG DWGE+GYI+
Sbjct: 179 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIK 358
Query: 324 MKRGISKKQGLCGIAMEASYPIKSSPTL 351
M+RG+ +GLCGIAMEASYP P L
Sbjct: 359 MQRGVDAAEGLCGIAMEASYPTA*LPNL 442
>TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3.4.22.-) -
garden pea, complete
Length = 1534
Score = 189 bits (479), Expect = 2e-48
Identities = 122/313 (38%), Positives = 169/313 (53%), Gaps = 12/313 (3%)
Frame = +3
Query: 53 SLDEKHKRFNVFKANVMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFH 112
++DE +RF +F N+ + TNK Y L +N FAD T EF+S HR+
Sbjct: 258 TVDEMKRRFKIFSENLQLIKSTNKKRLGYTLGVNHFADWTWEEFRS---------HRL-- 404
Query: 113 GIPRGNGSFMYEN---VDSV-PISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQI 168
G + + + N D V P KDWR G V+ VKDQG CGSCW FS A+E
Sbjct: 405 GAAQNCSATLKGNHRITDVVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQ 584
Query: 169 RTHHLVSLSEQELVDCDVEVNE-GCNGGFMAKALEFIK-EKGITSESIYPYTATDGTCNT 226
+SLSEQ+LVDC N GCNGG ++A E+IK G+ +E YPYT +G C
Sbjct: 585 AFGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQNGLC-- 758
Query: 227 QKMVNE-PAVSIDGYEIVPENNEVALLKAAA-NQPISVGIDAGGSDFQFYSEGVFTG-DC 283
K +E AV + G + E L A A +P+SV DF+ Y +GV+T C
Sbjct: 759 -KFTSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQV-VDDFRLYKKGVYTSTTC 932
Query: 284 GT---ELDHGVAIVGYGAAQDGTKYWIVKNSWGADWGEQGYIRMKRGISKKQGLCGIAME 340
G+ +++H V VGYG +DG YW++KNSWG +WG+ GY +M+ G + +CG+A
Sbjct: 933 GSTPMDVNHAVLAVGYG-IEDGVPYWLIKNSWGGEWGDHGYFKMEMG----KNMCGVATC 1097
Query: 341 ASYPIKSSPTLKD 353
+SYP+ + KD
Sbjct: 1098SSYPVVA*MVRKD 1136
>TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A precursor
(EC 3.4.22.-) (Turgor-responsive protein 15A). [Garden
pea], partial (98%)
Length = 1642
Score = 185 bits (469), Expect = 3e-47
Identities = 129/373 (34%), Positives = 184/373 (48%), Gaps = 38/373 (10%)
Frame = +3
Query: 3 MKNLFCVALFSLALFLSVAESF--DYHEKDL-------ASEEGLWDL---YERWRSHHTV 50
M + F +ALF A + A + D + DL +E+ + + + ++S +
Sbjct: 159 MAHRFLIALFLFATVATAATTLSDDTNSDDLLIRQVVDTAEDHILNAEHHFTSFKSKFSK 338
Query: 51 SRSLDEKHK-RFNVFKANVMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHR 109
+ + E+H RF VFK+N++ KLD + + KF+DLT EF+ R
Sbjct: 339 NYATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGITKFSDLTASEFR-----------R 485
Query: 110 MFHGI------PRGNGSFMYENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVE 163
F G+ P +++P DWR GAVTPVKDQG CGSCWAFS A+E
Sbjct: 486 QFLGLNKRLRLPAHAQKAPILPTNNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALE 665
Query: 164 GINQIRTHHLVSLSEQELVDCD--------VEVNEGCNGGFMAKALEFIKEK-GITSESI 214
G N + T L SLSEQ+LVDCD + GCNGG M A E+I + G+ SE
Sbjct: 666 GANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQSGGVVSEKD 845
Query: 215 YPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALLKAAANQPISVGIDAGGSDFQFY 274
Y YT DG+C K ++ S+ + +V + + N P++V I+A Q Y
Sbjct: 846 YAYTGRDGSCKFDK--SKVVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAW--MQTY 1013
Query: 275 SEGVFTG--DCGTELDHGVAIVGYGAAQDG--------TKYWIVKNSWGADWGEQGYIRM 324
GV LDHGV +VG+G Q G YWI+KNSWG +WGE+GY ++
Sbjct: 1014MSGVSCPYICAKARLDHGVLLVGFG--QGGYAPIRLKEKPYWIIKNSWGQNWGEEGYYKI 1187
Query: 325 KRGISKKQGLCGI 337
RG + +CG+
Sbjct: 1188CRG----RNVCGV 1214
>TC81601 similar to GP|13491750|gb|AAK27968.1 cysteine protease {Ipomoea
batatas}, partial (49%)
Length = 725
Score = 177 bits (450), Expect = 4e-45
Identities = 108/218 (49%), Positives = 130/218 (59%), Gaps = 5/218 (2%)
Frame = +1
Query: 4 KNLFCVALFSLALFLSVAESFDYHEKDLASEEGLWDLYERWRSHH-TVSRSLDEKHKRFN 62
KN + +ALF L L+VA + + L L + +E+W + H V EK KRF
Sbjct: 100 KNQYILALF---LLLAVAGITNVMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFM 270
Query: 63 VFKANVMHVHETNKLD-KPYKLKLNKFADLTNYEFK-SIYASSKIDHHRMFHGIPRGNGS 120
+FK NV + N D +PYKL +N ADLT EFK S KID S
Sbjct: 271 IFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEFKASRNGYKKIDREFT-------TTS 429
Query: 121 FMYENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQE 180
F YENV ++P + DWR GAVTP+KDQGQCGSCWAFS +AA EGINQI T LVSLSEQE
Sbjct: 430 FKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQE 609
Query: 181 LVDCDVE-VNEGCNGGFMAKALEF-IKEKGITSESIYP 216
LVDCD + ++GC GG M EF IK GITSE+ YP
Sbjct: 610 LVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 723
>TC85912 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (60%)
Length = 718
Score = 172 bits (435), Expect = 2e-43
Identities = 99/217 (45%), Positives = 134/217 (61%), Gaps = 5/217 (2%)
Frame = +2
Query: 13 SLALFLSVAESFDYHEKDLASEEGLWDLYERWRSHH-TVSRSLDEKHKRFNVFKANVMHV 71
SLAL L + F ++ +++ + +W S + V + E+ KRF +F NV ++
Sbjct: 83 SLALLLCLG-LFAIQVTSRTLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYI 259
Query: 72 HETNKLD--KPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNGSFMYENVDSV 129
NK D K Y L +N+FADLTN EF S + +K H M I R +F YEN ++
Sbjct: 260 EAFNKGDNNKLYTLGVNQFADLTNDEFTS--SRNKFKGH-MCSSITR-TSTFKYENASAI 427
Query: 130 PISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDVE-V 188
P S DWR GAVTPVK+QGQCG CWAFSA+AA EGI+++ T L+SLSEQELVDCD + V
Sbjct: 428 PSSVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGV 607
Query: 189 NEGCNGGFMAKALEF-IKEKGITSESIYPYTATDGTC 224
++ C GG M A +F I+ G+ +E+ YPY DGTC
Sbjct: 608 DQSCEGGLMDDAFKFIIQNHGLNTEANYPYQGVDGTC 718
>TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4.22.-) 3
precursor - kidney bean, partial (93%)
Length = 1675
Score = 166 bits (420), Expect = 1e-41
Identities = 81/140 (57%), Positives = 97/140 (68%), Gaps = 1/140 (0%)
Frame = +1
Query: 208 GITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALLKAAANQPISVGIDAG 267
GI S+ YPY DG C+ K N VSID YE VP +E+AL KA ANQPISV I+AG
Sbjct: 781 GIDSDEDYPYRGVDGKCDQYKK-NARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAG 957
Query: 268 GSDFQFYSEGVFTGDCGTELDHGVAIVGYGAAQDGTKYWIVKNSWGADWGEQGYIRMKRG 327
G +FQ Y G+FTG CGT LDHGV VGYG ++G YWIV+NSWG WGE GY+RM+R
Sbjct: 958 GREFQLYVSGIFTGKCGTALDHGVTAVGYG-TENGVDYWIVRNSWGKSWGESGYVRMERN 1134
Query: 328 ISKK-QGLCGIAMEASYPIK 346
++ G CGI M++SYPIK
Sbjct: 1135LAASVAGKCGIVMQSSYPIK 1194
Score = 166 bits (420), Expect = 1e-41
Identities = 103/235 (43%), Positives = 142/235 (59%), Gaps = 10/235 (4%)
Frame = +2
Query: 6 LFCVALFSLALFLSVAESFDYHEKDLAS---EEGLWDLYERWR-SHHTVSRSLD--EKHK 59
+F + + AL +S+ S+D D +S ++ + ++YE WR H ++ ++D EK K
Sbjct: 152 VFTLFTATFALDMSII-SYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNIDGSEKDK 328
Query: 60 RFNVFKANVMHVHETNKLDKPYKLKLNKFADLTNYEFKSIYASSKIDHHRMFHGIPRGNG 119
RF +FK N+ + E N ++ YK+ LN+FADL+N E++S Y +KID M +
Sbjct: 329 RFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMARTKTRS 508
Query: 120 SFMYENV-DSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSE 178
+ +V D +P S DWR+ GAV VKDQG CGSCWAFS IAAVEGIN+I T LVSLSE
Sbjct: 509 NRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGELVSLSE 688
Query: 179 QELVDCDVEVNEGCNGGFMAKAL--EFIKEKGITSESIYPYTA-TDGTCNTQKMV 230
QELVDCD VN GC+GG M AL + +T+ I P A NT+KM+
Sbjct: 689 QELVDCDRTVNAGCDGGLMEYAL*VHYKPMVVLTAMRIIPIVALMVNVINTRKML 853
>AW775259 similar to GP|13491750|gb cysteine protease {Ipomoea batatas},
partial (46%)
Length = 654
Score = 165 bits (418), Expect = 2e-41
Identities = 96/195 (49%), Positives = 120/195 (61%), Gaps = 4/195 (2%)
Frame = +3
Query: 14 LALFLSVAESFDYHEKDLASEEGLWDLYERWRSHH-TVSRSLDEKHKRFNVFKANVMHVH 72
L L L++A+ + + L L + +E+W S + + + EK KRF +FK NV +
Sbjct: 81 LFLLLALADITNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIE 260
Query: 73 ETNKLD-KPYKLKLNKFADLTNYEFK-SIYASSKIDHHRMFHGIPRGNGSFMYENVDSVP 130
N D KPYKL +N ADLT EFK S KID R F SF YENV ++P
Sbjct: 261 SFNAADNKPYKLSVNHLADLTLDEFKASRNGYKKID--REF-----ATTSFKYENVTAIP 419
Query: 131 ISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDVE-VN 189
+ DWR GAVTP+KDQGQCGSCWAFS +AA+EGINQI T L+SLSEQELVDCD + +
Sbjct: 420 EAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGED 599
Query: 190 EGCNGGFMAKALEFI 204
+GC GG M EFI
Sbjct: 600 QGCEGGLMEDGFEFI 644
>BI311691 similar to PIR|T03941|T039 cysteine proteinase (EC 3.4.22.-)
precursor - common tobacco, partial (13%)
Length = 781
Score = 158 bits (399), Expect = 3e-39
Identities = 106/257 (41%), Positives = 140/257 (54%), Gaps = 14/257 (5%)
Frame = +3
Query: 41 YERWRSHHTVSRSLD-EKHKRFNVFKANVMHVHETNKLDKP-YKLKLNKFADLTNYEFKS 98
YE+W + + D EK KRF +F N+ ++ N+ K YKL LN+F+DLTN EF +
Sbjct: 33 YEQWMKEFERNYADDAEKEKRFKIFAENLEYIENFNRAGKQTYKLGLNQFSDLTNEEFAA 212
Query: 99 IYASSKIDHHRMFHGIPRGNGSFMYENVDS----------VPISKDWRTSGAVTPVKDQG 148
+Y + + F + +P S DW+ SGAVT VK QG
Sbjct: 213 LYNCVDLKRELESSMVSTAGPIFNMSEISPTNSPKGKRKPIPDSVDWKESGAVTNVKRQG 392
Query: 149 QCGSCWAFSAIAAVEGINQIRTH-HLVSLSEQELVDCDVEVNEGCNGGFMAKALEFIKEK 207
C+AF+ AAVEGI +I+T L SLS QELVDCD + N GC GG + KALE++K
Sbjct: 393 ---CCYAFATTAAVEGIMKIKTDKELTSLSMQELVDCD-KANGGCEGGSVRKALEYMKTN 560
Query: 208 GITSESIYPYTATDGTCNTQKMVNEPAVSIDGYEIVPENNEVALLKAAANQPISVGIDAG 267
GI + YPYT GTC + K + A IDGY IV E LL+A A QP++V I A
Sbjct: 561 GIAKDVDYPYTEKVGTCLSNK--KDRAAKIDGYVIV-SPGEQNLLEAVAQQPVTVAI-AI 728
Query: 268 GSDFQFYSEGVF-TGDC 283
DF+ Y G+F +G C
Sbjct: 729 NDDFKKYESGIFGSGPC 779
>TC90718 similar to GP|13491750|gb|AAK27968.1 cysteine protease {Ipomoea
batatas}, partial (44%)
Length = 796
Score = 153 bits (387), Expect = 9e-38
Identities = 87/195 (44%), Positives = 123/195 (62%), Gaps = 5/195 (2%)
Frame = +2
Query: 35 EGLWDLYERWRSHHT-VSRSLDEKHKRFNVFKANVMHVHETNK--LDKPYKLKLNKFADL 91
+ +++ +E+W S ++ V + E+ +R +F ANV ++ N +K YKL +N+FADL
Sbjct: 167 DSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGINQFADL 346
Query: 92 TNYEFKSIYASSKIDHHRMFHGIPRGNGSFMYENVDSVPISKDWRTSGAVTPVKDQGQCG 151
TN EF I + +K H M I + +F YENV ++P + DWR GAVTPVK+QGQCG
Sbjct: 347 TNEEF--IASRNKFKGH-MCSSIAKTT-TFKYENVSAIPSTVDWRKKGAVTPVKNQGQCG 514
Query: 152 SCWAFSAIAAVEGINQIRTHHLVSLSEQELVDCDVE-VNEGCNGGFMAKALEF-IKEKGI 209
CWAFSA+AA EGI ++ T LVSLSE ELVDCD + V++GC GG M A +F I+ G+
Sbjct: 515 CCWAFSAVAATEGITKLSTGKLVSLSEXELVDCDTKGVDQGCEGGLMDXAFKFIIQNLGL 694
Query: 210 TSESIYPYTATDGTC 224
+E+ P C
Sbjct: 695 XTEAAXPSXXYGHPC 739
>BG583083 similar to PIR|B84752|B847 probable cysteine proteinase [imported]
- Arabidopsis thaliana, partial (27%)
Length = 740
Score = 149 bits (376), Expect = 2e-36
Identities = 85/201 (42%), Positives = 115/201 (56%), Gaps = 1/201 (0%)
Frame = +2
Query: 49 TVSRSLDEKHKRFNVFKANVMHVHETNKL-DKPYKLKLNKFADLTNYEFKSIYASSKIDH 107
T + + E KR +FK N+ ++ N +K YKL LN+++DLT+ EF + + K+
Sbjct: 107 TQNDKISELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVS- 283
Query: 108 HRMFHGIPRGNGSFMYENVDSVPISKDWRTSGAVTPVKDQGQCGSCWAFSAIAAVEGINQ 167
+ + + + D VP + DWR GAVT VKDQG CG CWAFS +AAVEG +
Sbjct: 284 -KQLSSSKMRSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVK 460
Query: 168 IRTHHLVSLSEQELVDCDVEVNEGCNGGFMAKALEFIKEKGITSESIYPYTATDGTCNTQ 227
I T L+SLSEQ+LVDCD E N GC+GG M A ++I +KGI SE+ YPY TC
Sbjct: 461 INTGELISLSEQQLVDCD-ERNSGCHGGNMDSAFKYIIQKGIVSEADYPYQEGSQTCQLN 637
Query: 228 KMVNEPAVSIDGYEIVPENNE 248
+ A I VP N+E
Sbjct: 638 DQMKFEA-QITNLLDVPANDE 697
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.317 0.134 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,412,068
Number of Sequences: 36976
Number of extensions: 155037
Number of successful extensions: 868
Number of sequences better than 10.0: 66
Number of HSP's better than 10.0 without gapping: 786
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 790
length of query: 355
length of database: 9,014,727
effective HSP length: 97
effective length of query: 258
effective length of database: 5,428,055
effective search space: 1400438190
effective search space used: 1400438190
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (27.3 bits)
Lotus: description of TM0173b.5