
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140034.6 + phase: 0
(360 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 357 5e-99
TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 344 3e-95
TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4... 341 3e-94
TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase... 322 1e-88
TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4... 322 1e-88
TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativ... 318 2e-87
TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4... 169 9e-83
TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 270 8e-73
TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteina... 254 3e-68
TC87304 similar to GP|13897890|gb|AAK48495.1 putative cysteine p... 235 2e-62
TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3... 223 9e-59
TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A ... 209 1e-54
TC85912 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 203 1e-52
TC92862 similar to GP|10336513|dbj|BAB13759. cysteine proteinase... 188 3e-48
TC81601 similar to GP|13491750|gb|AAK27968.1 cysteine protease {... 184 6e-47
TC90718 similar to GP|13491750|gb|AAK27968.1 cysteine protease {... 181 3e-46
AW775259 similar to GP|13491750|gb cysteine protease {Ipomoea ba... 172 2e-43
TC81254 similar to GP|600111|emb|CAA84378.1| cysteine proteinase... 171 4e-43
BG583083 similar to PIR|B84752|B847 probable cysteine proteinase... 161 4e-40
BI311691 similar to PIR|T03941|T039 cysteine proteinase (EC 3.4.... 148 3e-36
>TC85911 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (96%)
Length = 1315
Score = 357 bits (915), Expect = 5e-99
Identities = 172/309 (55%), Positives = 221/309 (70%), Gaps = 6/309 (1%)
Frame = +3
Query: 56 AMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKN--SYNLTDNKFADLT 113
+M +R + W+ +G+ YK + ERE RF I+ N++YI+ N N SY L N+FADLT
Sbjct: 189 SMHERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLT 368
Query: 114 NEEF---QSTYMGLSTRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAF 170
NEEF ++ + G T F+Y+ +P + DWRK+GAVT + +QGQCG CWAF
Sbjct: 369 NEEFVASRNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAF 548
Query: 171 AAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQD 230
+AVAA EGI+K+ +GKL+SLSEQEL+DCD K +QGC+GGLM+ A+ FII+N GL TE
Sbjct: 549 SAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQ 728
Query: 231 YPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYS 290
YPY+GVDGTC KA+ A +I+GYE+VPA+NE L+ A A+QP+SVAIDA G FQFY
Sbjct: 729 YPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYK 908
Query: 291 EGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMC 349
GVF+G CG +L+HGVT VGYG KYW+VKNSWG DWGE GYI M+R + EG+C
Sbjct: 909 SGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLC 1088
Query: 350 GIAMQASYP 358
GIAMQASYP
Sbjct: 1089GIAMQASYP 1115
>TC85913 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (94%)
Length = 1258
Score = 344 bits (882), Expect = 3e-95
Identities = 168/312 (53%), Positives = 221/312 (69%), Gaps = 6/312 (1%)
Frame = +3
Query: 53 DVEAMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNS--YNLTDNKFA 110
D + ++ + W+ +G+ YK ERE R I++ NV YI+ N N+ Y L N+FA
Sbjct: 198 DDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFA 377
Query: 111 DLTNEEF---QSTYMGLSTRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGC 167
D+TNEEF ++ + G + + F+Y E+ +P + DWRK+GAVT + +QGQCG C
Sbjct: 378 DITNEEFIASRNKFKGHMCSSITKTSTFKY-ENASVPSTVDWRKKGAVTPVKNQGQCGCC 554
Query: 168 WAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTT 227
WAF+AVAA EGI+K+ +GKL+SLSEQEL+DCD K +QGC+GGLM+ A+ FII+N GL T
Sbjct: 555 WAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLHT 734
Query: 228 EQDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQ 287
E YPY+GVDGTC + + AA+I+GYE+VPA+NE L+ A A+QP+SVAIDA G FQ
Sbjct: 735 EAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVAIDASGSDFQ 914
Query: 288 FYSEGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNSWGADWGESGYIRMKRDTLSKE 346
FY GVF+G CG QL+HGVT VGYG KYW+VKNSWG DWGE GYIRM+R + +
Sbjct: 915 FYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQ 1094
Query: 347 GMCGIAMQASYP 358
G+CGIAM ASYP
Sbjct: 1095GLCGIAMMASYP 1130
>TC78139 similar to PIR|S22502|S22502 cysteine proteinase (EC 3.4.22.-) -
kidney bean, complete
Length = 1720
Score = 341 bits (874), Expect = 3e-94
Identities = 173/345 (50%), Positives = 228/345 (65%), Gaps = 8/345 (2%)
Frame = +3
Query: 23 TTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRF 82
TT +LL++ + ++ SE H + S E++ ++ W + H ++ +E++ RF
Sbjct: 435 TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERW-RSHHTVSRNLNEKQKRF 611
Query: 83 GIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRLRSHNTG-------F 135
++++NV ++ N Y L NKFAD+TN EF++TY G G F
Sbjct: 612 NVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTF 791
Query: 136 RYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQEL 195
Y+ P S DWRK+GAVT++ DQGQCG CWAF+ V AVEGIN+IK+ +L+ LSEQEL
Sbjct: 792 MYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQEL 971
Query: 196 IDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASISGY 255
IDCD + NQGC GGLME A+ +I + GG+TTE YPY DG+C K A SI G+
Sbjct: 972 IDCDNQE-NQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDGH 1148
Query: 256 EEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYGKET 315
E VPA++E L A A+QPVSVAIDAGG FQFYSEGVF+G CGK+LNHGV +VGYG
Sbjct: 1149ETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTV 1328
Query: 316 I-NKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 359
YWIV+NSWGA+WGE GYIRMKR+ +KEG+CGIAM+ASYP+
Sbjct: 1329DGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 1463
>TC86539 similar to GP|22759715|dbj|BAC10906. cysteine proteinase {Zinnia
elegans}, partial (88%)
Length = 1265
Score = 322 bits (826), Expect = 1e-88
Identities = 162/315 (51%), Positives = 220/315 (69%), Gaps = 7/315 (2%)
Frame = +2
Query: 51 SSDVEAMKKR---FDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDN 107
S D+++M K F+ W+ RHG+ Y+ +E+ +RF +++ N+++I +N ++Y L N
Sbjct: 152 SEDLKSMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLN 331
Query: 108 KFADLTNEEFQSTYMGL----STRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQ 163
+FADL+++EF++ Y+GL S R S F Y + DLP+S DWRK+GAVT + +QGQ
Sbjct: 332 EFADLSHQEFKNKYLGLKVDLSQRRESSEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQ 508
Query: 164 CGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENG 223
CG CWAF+ VAAVEGIN+I +G L SLSEQELIDCD + N GC GGLM+ A++FI++NG
Sbjct: 509 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDT-TYNNGCNGGLMDYAFSFIVKNG 685
Query: 224 GLTTEQDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGG 283
GL E+DYPY + TC+M+K +I+GY +VP +NE L A A+QP+SVAI+A G
Sbjct: 686 GLHKEEDYPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASG 865
Query: 284 YSFQFYSEGVFSGICGKQLNHGVTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRDTL 343
FQFYS GVF G CG +L+HGV+ VGYG Y IVKNSWGA WGE G+IRMKR+
Sbjct: 866 RDFQFYSGGVFDGHCGSELDHGVSAVGYGTSKGLDYIIVKNSWGAKWGEKGFIRMKRNIG 1045
Query: 344 SKEGMCGIAMQASYP 358
EG+CG+ ASYP
Sbjct: 1046KSEGICGLYKMASYP 1090
>TC89773 similar to PIR|S51817|S47312 cysteine proteinase (EC 3.4.22.-)
precursor - spring vetch, partial (87%)
Length = 1484
Score = 322 bits (825), Expect = 1e-88
Identities = 169/356 (47%), Positives = 225/356 (62%), Gaps = 9/356 (2%)
Frame = +2
Query: 13 LAMMTSTILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKY 72
L T T+ + TI LL + + S T + + +V M ++ W+ +H + Y
Sbjct: 170 LLSQTPTMASITITSLLFFS----LITLSLAMDTSMRSNEEVMTM---YEEWLVKHHKVY 328
Query: 73 KHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRLRSH- 131
E++ RF I++ N+ +I NAQ +Y + NKFAD+TNEE+++ Y+G + +
Sbjct: 329 NGLGEKDQRFEIFKDNLGFIDEHNAQNYTYKVGLNKFADMTNEEYRNMYLGTKNDAKRNV 508
Query: 132 -----NTGFRYD-EHGD-LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKS 184
TG RY GD LP DWR +GAV I DQG CG CWAF+ +A VE INKI +
Sbjct: 509 MKIKITTGHRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVT 688
Query: 185 GKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEK 244
GKL+SLSEQEL+DCD ++ N+GC GGLM+ A+ FI ENGG+ TEQDYPY+G +G C +
Sbjct: 689 GKLVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIGENGGIDTEQDYPYKGFEGRCDPTR 865
Query: 245 AAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNH 304
SI GYE+VPA NE LK A +HQPVSVAI+AGG + Q Y GVF+G CG L+H
Sbjct: 866 KNAKVVSIDGYEDVPAYNENALKKAVSHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDH 1045
Query: 305 GVTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRDTLS-KEGMCGIAMQASYPL 359
GV VVGYG E YW+V+NSWG +WGE GY +++R+ G CGIAMQASYP+
Sbjct: 1046GVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 1213
>TC76652 similar to EGAD|143257|152780 thiolprotease {Pisum sativum},
partial (96%)
Length = 1818
Score = 318 bits (815), Expect = 2e-87
Identities = 160/355 (45%), Positives = 223/355 (62%), Gaps = 9/355 (2%)
Frame = +1
Query: 14 AMMTSTILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYK 73
AM +L + F + + + +I+ + P K ++ E + ++ W+ +HG+ Y
Sbjct: 58 AMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTM-YEEWLVKHGKSYN 234
Query: 74 HNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLST------- 126
E++ RF I++ N+++I N ++Y L +FADLTNEE++S ++G
Sbjct: 235 GLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMK 414
Query: 127 RLRSHNTGFRYDEHGD-LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSG 185
+L + GD LPES DWRKEGAV + DQ CG CWAF+A+AAVEGINKI +G
Sbjct: 415 KLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTG 594
Query: 186 KLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKA 245
LISLSEQEL+DCD S N+GC GGLM+ A+ FII NGG+ +E DYPY+ VDG C +
Sbjct: 595 DLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRK 771
Query: 246 AHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHG 305
+I YE+VPA +E L+ A A+QP++VA++ GG FQ Y GVF+G CG L+HG
Sbjct: 772 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 951
Query: 306 VTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRD-TLSKEGMCGIAMQASYPL 359
V VGYG E YWIV+NSWG WGE GYIR++R+ S+ G CGIA++ SYP+
Sbjct: 952 VAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 1116
>TC87982 similar to PIR|T12041|T12041 cysteine proteinase (EC 3.4.22.-) 3
precursor - kidney bean, partial (93%)
Length = 1675
Score = 169 bits (428), Expect(2) = 9e-83
Identities = 79/139 (56%), Positives = 97/139 (68%), Gaps = 1/139 (0%)
Frame = +1
Query: 222 NGGLTTEQDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDA 281
NGG+ +++DYPY GVDG C K SI YE+VPA +E LK A A+QP+SVAI+A
Sbjct: 775 NGGIDSDEDYPYRGVDGKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEA 954
Query: 282 GGYSFQFYSEGVFSGICGKQLNHGVTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRD 341
GG FQ Y G+F+G CG L+HGVT VGYG E YWIV+NSWG WGESGY+RM+R+
Sbjct: 955 GGREFQLYVSGIFTGKCGTALDHGVTAVGYGTENGVDYWIVRNSWGKSWGESGYVRMERN 1134
Query: 342 -TLSKEGMCGIAMQASYPL 359
S G CGI MQ+SYP+
Sbjct: 1135LAASVAGKCGIVMQSSYPI 1191
Score = 155 bits (393), Expect(2) = 9e-83
Identities = 83/212 (39%), Positives = 129/212 (60%), Gaps = 12/212 (5%)
Frame = +2
Query: 16 MTSTILTTTIFILLMLCNTCVIASESECPPTHKQKSS--DVEAMKKRFDGWVKRHGRKYK 73
M ++ T+F + +I+ + TH KSS + +K ++ W +HG+
Sbjct: 134 MLVILIVFTLFTATFALDMSIISYDK----THSDKSSRRSDKEVKNIYEEWRVKHGKLNN 301
Query: 74 HND--EREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMG-------- 123
+ D E++ RF I++ N+++I NA+ +Y + N+FADL+NEE++S Y+G
Sbjct: 302 NIDGSEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGM 481
Query: 124 LSTRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIK 183
+ R ++ + + LP+S DWR +GAV ++ DQG CG CWAF+ +AAVEGINKI
Sbjct: 482 MMARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIV 661
Query: 184 SGKLISLSEQELIDCDVKSGNQGCQGGLMETA 215
+G+L+SLSEQEL+DCD ++ N GC GGLME A
Sbjct: 662 TGELVSLSEQELVDCD-RTVNAGCDGGLMEYA 754
>TC85914 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (58%)
Length = 773
Score = 270 bits (689), Expect = 8e-73
Identities = 123/197 (62%), Positives = 154/197 (77%), Gaps = 1/197 (0%)
Frame = +3
Query: 163 QCGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIEN 222
QCG CWAF+AV A EGI+K+ +G+L+SLSEQEL+DCD K +QGC+GGLM+ A+ FII+N
Sbjct: 18 QCGCCWAFSAVPAPEGIHKLSTGRLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQN 197
Query: 223 GGLTTEQDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAG 282
GL TE YPY+GVDGTC KA+ +A +I+GYE+VPA+NE L+ A A+QP+SVAIDA
Sbjct: 198 HGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDAS 377
Query: 283 GYSFQFYSEGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNSWGADWGESGYIRMKRD 341
G FQFY GVF+G CG +L+HGVT VGYG KYW+VKN WG DWGE GYI+M+R
Sbjct: 378 GSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNLWGTDWGEEGYIKMQRG 557
Query: 342 TLSKEGMCGIAMQASYP 358
+ EG+CGIAM+ASYP
Sbjct: 558 VDAAEGLCGIAMEASYP 608
>TC85915 homologue to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (54%)
Length = 709
Score = 254 bits (650), Expect = 3e-68
Identities = 120/188 (63%), Positives = 148/188 (77%), Gaps = 1/188 (0%)
Frame = +1
Query: 169 AFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTE 228
AF+AVAA EGI K+ +GKL+SLSEQEL+DCD K +QGC+GGLM+ A+ FII+N GL+TE
Sbjct: 1 AFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTE 180
Query: 229 QDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQF 288
YPY+GVDGTC KA+ +AA+I+GYE+VPA+NE L+ A A+QP+SVAIDA G FQF
Sbjct: 181 AAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQF 360
Query: 289 YSEGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEG 347
Y GVFSG CG +L+HGVT VGYG KYW+VKNSWG DWG+ GYIRM+R + E
Sbjct: 361 YKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGQEGYIRMQRGMDAPEX 540
Query: 348 MCGIAMQA 355
+CGIAMQA
Sbjct: 541 LCGIAMQA 564
>TC87304 similar to GP|13897890|gb|AAK48495.1 putative cysteine protease
{Ipomoea batatas}, partial (24%)
Length = 1325
Score = 235 bits (599), Expect = 2e-62
Identities = 145/362 (40%), Positives = 208/362 (57%), Gaps = 15/362 (4%)
Frame = +3
Query: 13 LAMMTSTILTTTIFILLMLCNTCVIASESECPPTHK-QKSSDVEAMKKRFDGWVKRHGRK 71
++ MT IL+ I I + C + ++SE K K S E + + F W K HGR
Sbjct: 36 ISKMTKFILSFLILISIT-CLSFALSSEYSISSHGKLDKFSSDEEVFELFQMWKKEHGRD 212
Query: 72 YKHNDEREV-RFGIYQANVQYIQCKNAQKNS---YNLTDNKFADLTNEEFQSTYMG-LST 126
Y +++E RF I++ N +YI NA++ S + L+ NKFAD++ EEF TY+ +
Sbjct: 213 YANSEEENAKRFEIFKTNFKYINEMNAKRKSQTQHRLSLNKFADMSPEEFSKTYLPKIEM 392
Query: 127 RLRSHNTGFRYDEHGD---LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIK 183
++ S+ + + D LP S DWR++GAVTE+ DQG C WAF+ A+EG+NKI
Sbjct: 393 QVPSNRDNAKLKDDDDCENLPTSVDWREKGAVTEVRDQGDCQSHWAFSVTGAIEGLNKIV 572
Query: 184 SGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKME 243
+G LI+LS QEL+DCD S +GC GG A+ ++IENGG+ TE +YPY +GTCK
Sbjct: 573 TGNLINLSAQELVDCDPAS--KGCAGGFYFNAFGYVIENGGIDTEANYPYLAKNGTCK-- 740
Query: 244 KAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLN 303
+ A+ SI + EA L + QPVSV++DA G QFY+ GV+ G K+ +
Sbjct: 741 ENANKVVSIDNLLVLDGTEEA-LLCRTSKQPVSVSLDATG--LQFYAGGVYGGENCKKES 911
Query: 304 HGVTVVG--YGKETIN--KYWIVKNSWGADWGESGYIRMKRDTLS--KEGMCGIAMQASY 357
+VG G +++N YWIVKNSWG DWGE GY+ +KR+ G+C I Y
Sbjct: 912 RNANLVGLIVGYDSVNGEDYWIVKNSWGKDWGEKGYLFIKRNVFEDWPFGVCAINAAVGY 1091
Query: 358 PL 359
P+
Sbjct: 1092PV 1097
>TC85449 homologue to PIR|S71923|S71923 cysteine proteinase (EC 3.4.22.-) -
garden pea, complete
Length = 1534
Score = 223 bits (568), Expect = 9e-59
Identities = 126/305 (41%), Positives = 170/305 (55%), Gaps = 5/305 (1%)
Frame = +3
Query: 61 FDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQST 120
F + R+G++Y DE + RF I+ N+Q I+ N ++ Y L N FAD T EEF+S
Sbjct: 219 FARFANRYGKRYDTVDEMKRRFKIFSENLQLIKSTNKKRLGYTLGVNHFADWTWEEFRSH 398
Query: 121 YMGLSTRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGIN 180
+G + + G LP KDWRKEG V+E+ DQG CG CW F+ A+E
Sbjct: 399 RLGAAQNCSATLKGNHRITDVVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAY 578
Query: 181 KIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTC 240
GK ISLSEQ+L+DC N GC GGL A+ +I NGGL TE+ YPY G +G C
Sbjct: 579 AQAFGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQNGLC 758
Query: 241 KMEKAAHYAASISGYEEVPADNEAKLKAAAAH-QPVSVAIDAGGYSFQFYSEGVF-SGIC 298
K + + A + G + E +LK A A +PVSVA F+ Y +GV+ S C
Sbjct: 759 KF-TSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQVVD-DFRLYKKGVYTSTTC 932
Query: 299 GK---QLNHGVTVVGYGKETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQA 355
G +NH V VGYG E YW++KNSWG +WG+ GY +M+ + MCG+A +
Sbjct: 933 GSTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGGEWGDHGYFKMEMG----KNMCGVATCS 1100
Query: 356 SYPLV 360
SYP+V
Sbjct: 1101SYPVV 1115
>TC76560 homologue to SP|P25804|CYSP_PEA Cysteine proteinase 15A precursor
(EC 3.4.22.-) (Turgor-responsive protein 15A). [Garden
pea], partial (98%)
Length = 1642
Score = 209 bits (532), Expect = 1e-54
Identities = 118/314 (37%), Positives = 175/314 (55%), Gaps = 18/314 (5%)
Frame = +3
Query: 61 FDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQST 120
F + + + Y +E + RFG++++N+ + S KF+DLT EF+
Sbjct: 309 FTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGITKFSDLTASEFRRQ 488
Query: 121 YMGLSTRLR--SHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEG 178
++GL+ RLR +H +LPE DWR++GAVT + DQG CG CWAF+ A+EG
Sbjct: 489 FLGLNKRLRLPAHAQKAPILPTNNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEG 668
Query: 179 INKIKSGKLISLSEQELIDCD-------VKSGNQGCQGGLMETAYTFIIENGGLTTEQDY 231
N + +GKL SLSEQ+L+DCD S + GC GGLM A+ +I+++GG+ +E+DY
Sbjct: 669 ANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLMNNAFEYILQSGGVVSEKDY 848
Query: 232 PYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSE 291
Y G DG+CK +K + AS+S + V D + + P++VAI+A Q Y
Sbjct: 849 AYTGRDGSCKFDK-SKVVASVSNFSVVSLDEDQIAANLVKNGPLAVAINAAW--MQTYMS 1019
Query: 292 GVFSG-ICGK-QLNHGVTVVGYGKETI-------NKYWIVKNSWGADWGESGYIRMKRDT 342
GV IC K +L+HGV +VG+G+ YWI+KNSWG +WGE GY ++ R
Sbjct: 1020GVSCPYICAKARLDHGVLLVGFGQGGYAPIRLKEKPYWIIKNSWGQNWGEEGYYKICRG- 1196
Query: 343 LSKEGMCGIAMQAS 356
+CG+ S
Sbjct: 1197---RNVCGVDSMVS 1229
>TC85912 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (60%)
Length = 718
Score = 203 bits (516), Expect = 1e-52
Identities = 99/189 (52%), Positives = 132/189 (69%), Gaps = 5/189 (2%)
Frame = +2
Query: 57 MKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNS--YNLTDNKFADLTN 114
M +R W+ ++G+ YK + ERE RF I+ NV YI+ N N+ Y L N+FADLTN
Sbjct: 152 MYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTN 331
Query: 115 EEFQST---YMGLSTRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFA 171
+EF S+ + G + + F+Y+ +P S DWRK+GAVT + +QGQCG CWAF+
Sbjct: 332 DEFTSSRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPVKNQGQCGCCWAFS 511
Query: 172 AVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDY 231
AVAA EGI+K+ +GKLISLSEQEL+DCD K +Q C+GGLM+ A+ FII+N GL TE +Y
Sbjct: 512 AVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQSCEGGLMDDAFKFIIQNHGLNTEANY 691
Query: 232 PYEGVDGTC 240
PY+GVDGTC
Sbjct: 692 PYQGVDGTC 718
>TC92862 similar to GP|10336513|dbj|BAB13759. cysteine proteinase
{Astragalus sinicus}, partial (41%)
Length = 581
Score = 188 bits (477), Expect = 3e-48
Identities = 86/140 (61%), Positives = 106/140 (75%), Gaps = 1/140 (0%)
Frame = +2
Query: 220 IENGGLTTEQDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAI 279
I+N GL TE YPY+GVDGTC KA+ +A +I+GYE+VPA+NE L+ A A+QP+SV I
Sbjct: 2 IQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVTI 181
Query: 280 DAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNSWGADWGESGYIRM 338
DA G FQFY GVF+G CG +L+HGVT VGYG KYW+VKNSWG DWGE GYI+M
Sbjct: 182 DASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKM 361
Query: 339 KRDTLSKEGMCGIAMQASYP 358
+R + EG+CGIAM+ASYP
Sbjct: 362 QRGVDAAEGLCGIAMEASYP 421
>TC81601 similar to GP|13491750|gb|AAK27968.1 cysteine protease {Ipomoea
batatas}, partial (49%)
Length = 725
Score = 184 bits (466), Expect = 6e-47
Identities = 89/179 (49%), Positives = 125/179 (69%), Gaps = 2/179 (1%)
Frame = +1
Query: 56 AMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNS-YNLTDNKFADLTN 114
++++R + W+ HG+ Y+ E+E RF I++ NV++I+ NA N Y L+ N ADLT
Sbjct: 187 SLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTL 366
Query: 115 EEFQSTYMGLSTRLRSHNT-GFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAV 173
+EF+++ G R T F+Y+ +P + DWR +GAVT I DQGQCG CWAF+ V
Sbjct: 367 DEFKASRNGYKKIDREFTTTSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWAFSTV 546
Query: 174 AAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYP 232
AA EGIN+I +GKL+SLSEQEL+DCD K +QGC+GGLME + FII+NGG+T+E +YP
Sbjct: 547 AATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 723
>TC90718 similar to GP|13491750|gb|AAK27968.1 cysteine protease {Ipomoea
batatas}, partial (44%)
Length = 796
Score = 181 bits (460), Expect = 3e-46
Identities = 91/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Frame = +2
Query: 50 KSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNS--YNLTDN 107
+S V++M +R + W+ ++ + YK ERE R I+ ANV YI+ N N+ Y L N
Sbjct: 152 RSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGIN 331
Query: 108 KFADLTNEEF---QSTYMGLSTRLRSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQC 164
+FADLTNEEF ++ + G + T F+Y+ +P + DWRK+GAVT + +QGQC
Sbjct: 332 QFADLTNEEFIASRNKFKGHMCSSIAKTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQC 511
Query: 165 GGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGG 224
G CWAF+AVAA EGI K+ +GKL+SLSE EL+DCD K +QGC+GGLM+ A+ FII+N G
Sbjct: 512 GCCWAFSAVAATEGITKLSTGKLVSLSEXELVDCDTKGVDQGCEGGLMDXAFKFIIQNLG 691
Query: 225 LTTEQDYP 232
L TE P
Sbjct: 692 LXTEAAXP 715
>AW775259 similar to GP|13491750|gb cysteine protease {Ipomoea batatas},
partial (46%)
Length = 654
Score = 172 bits (435), Expect = 2e-43
Identities = 91/200 (45%), Positives = 129/200 (64%), Gaps = 2/200 (1%)
Frame = +3
Query: 25 IFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGI 84
+F+LL L + + S +K + ++++R + W+ +G+ YK E+E RF I
Sbjct: 81 LFLLLALADITNVMS---------RKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMI 233
Query: 85 YQANVQYIQCKNAQKNS-YNLTDNKFADLTNEEFQSTYMGLSTRLRSH-NTGFRYDEHGD 142
++ NV++I+ NA N Y L+ N ADLT +EF+++ G R T F+Y+
Sbjct: 234 FKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRNGYKKIDREFATTSFKYENVTA 413
Query: 143 LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKS 202
+PE+ DWR +GAVT I DQGQCG CWAF+ VAA+EGIN+I +GKLISLSEQEL+DCD K
Sbjct: 414 IPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKG 593
Query: 203 GNQGCQGGLMETAYTFIIEN 222
+QGC+GGLME + FII N
Sbjct: 594 EDQGCEGGLMEDGFEFII*N 653
>TC81254 similar to GP|600111|emb|CAA84378.1| cysteine proteinase {Vicia
sativa}, partial (61%)
Length = 692
Score = 171 bits (433), Expect = 4e-43
Identities = 91/185 (49%), Positives = 118/185 (63%), Gaps = 7/185 (3%)
Frame = +1
Query: 61 FDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQST 120
++ W + H + DE+ RF +++ANV ++ N Y L NKFAD+TN EF+S
Sbjct: 124 YERW-RSHHTVTRSLDEKNNRFNVFKANVMHVHNTNKLDKPYKLKLNKFADMTNYEFRSI 300
Query: 121 YMGLST------RLRSHNTG-FRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAV 173
Y R SH+ G F Y+ +P S DWRK GAVT + DQGQCG CWAF+ +
Sbjct: 301 YADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTGVKDQGQCGSCWAFSTI 480
Query: 174 AAVEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPY 233
AVEGIN+IK+ KL+SLSEQEL+DCD + NQGC GGLME A+ FI +N G+TTE +YPY
Sbjct: 481 VAVEGINQIKTQKLVSLSEQELVDCDTEV-NQGCNGGLMEYAFEFIKQN-GITTETNYPY 654
Query: 234 EGVDG 238
DG
Sbjct: 655 AAKDG 669
>BG583083 similar to PIR|B84752|B847 probable cysteine proteinase [imported]
- Arabidopsis thaliana, partial (27%)
Length = 740
Score = 161 bits (407), Expect = 4e-40
Identities = 85/195 (43%), Positives = 130/195 (66%), Gaps = 8/195 (4%)
Frame = +2
Query: 77 EREVRFGIYQANVQYIQ-CKNAQKNSYNLTDNKFADLTNEEFQSTYMGL-------STRL 128
E E R I++ N++YI+ NA SY L N+++DLT++EF +++ GL S+++
Sbjct: 128 ELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSSKM 307
Query: 129 RSHNTGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLI 188
RS F ++ D+P + DWR++GAVT++ DQG CG CWAF+ VAAVEG KI +G+LI
Sbjct: 308 RSAAVPFNLND--DVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELI 481
Query: 189 SLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHY 248
SLSEQ+L+DCD + N GC GG M++A+ +II+ G+ +E DYPY+ TC++ +
Sbjct: 482 SLSEQQLVDCDER--NSGCHGGNMDSAFKYIIQK-GIVSEADYPYQEGSQTCQLNDQMKF 652
Query: 249 AASISGYEEVPADNE 263
A I+ +VPA++E
Sbjct: 653 EAQITNLLDVPANDE 697
>BI311691 similar to PIR|T03941|T039 cysteine proteinase (EC 3.4.22.-)
precursor - common tobacco, partial (13%)
Length = 781
Score = 148 bits (374), Expect = 3e-36
Identities = 107/263 (40%), Positives = 142/263 (53%), Gaps = 20/263 (7%)
Frame = +3
Query: 56 AMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCKN-AQKNSYNLTDNKFADLTN 114
A + R++ W+K R Y + E+E RF I+ N++YI+ N A K +Y L N+F+DLTN
Sbjct: 18 ATETRYEQWMKEFERNYADDAEKEKRFKIFAENLEYIENFNRAGKQTYKLGLNQFSDLTN 197
Query: 115 EEFQSTY--MGLSTRLRS---------------HNTGFRYDEHGDLPESKDWRKEGAVTE 157
EEF + Y + L L S T + +P+S DW++ GAVT
Sbjct: 198 EEFAALYNCVDLKRELESSMVSTAGPIFNMSEISPTNSPKGKRKPIPDSVDWKESGAVTN 377
Query: 158 IMDQGQCGGCWAFAAVAAVEGINKIKSGK-LISLSEQELIDCDVKSGNQGCQGGLMETAY 216
+ QG C C+AFA AAVEGI KIK+ K L SLS QEL+DCD N GC+GG + A
Sbjct: 378 VKRQG-C--CYAFATTAAVEGIMKIKTDKELTSLSMQELVDCD--KANGGCEGGSVRKAL 542
Query: 217 TFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVS 276
++ N G+ + DYPY GTC K AA I GY + + E L A A QPV+
Sbjct: 543 EYMKTN-GIAKDVDYPYTEKVGTCLSNKKDR-AAKIDGY-VIVSPGEQNLLEAVAQQPVT 713
Query: 277 VAIDAGGYSFQFYSEGVF-SGIC 298
VAI A F+ Y G+F SG C
Sbjct: 714 VAI-AINDDFKKYESGIFGSGPC 779
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.317 0.132 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,609,731
Number of Sequences: 36976
Number of extensions: 160874
Number of successful extensions: 1002
Number of sequences better than 10.0: 63
Number of HSP's better than 10.0 without gapping: 933
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 937
length of query: 360
length of database: 9,014,727
effective HSP length: 97
effective length of query: 263
effective length of database: 5,428,055
effective search space: 1427578465
effective search space used: 1427578465
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (27.3 bits)
Medicago: description of AC140034.6