Miyakogusa Predicted Gene
- Lj6g3v0315180.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0315180.2 Non Chatacterized Hit- tr|I1MZR2|I1MZR2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.44312
PE,81.94,0,Peptidase_M16,Peptidase M16, N-terminal;
Peptidase_M16_C,Peptidase M16, C-terminal; seg,NULL; no
des,CUFF.57708.2
(1040 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G06900.1 | Symbols: | Insulinase (Peptidase family M16) fami... 1390 0.0
AT2G41790.1 | Symbols: | Insulinase (Peptidase family M16) fami... 486 e-137
AT3G57470.1 | Symbols: | Insulinase (Peptidase family M16) fami... 434 e-121
AT3G57470.2 | Symbols: | Insulinase (Peptidase family M16) fami... 431 e-121
AT3G57470.3 | Symbols: | Insulinase (Peptidase family M16) fami... 425 e-119
AT5G01440.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 137 5e-32
AT3G02090.1 | Symbols: MPPBETA | Insulinase (Peptidase family M1... 60 7e-09
AT3G02090.2 | Symbols: MPPBETA | Insulinase (Peptidase family M1... 60 7e-09
AT3G57460.1 | Symbols: | catalytics;metal ion binding | chr3:21... 58 3e-08
>AT1G06900.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr1:2115155-2120635 REVERSE LENGTH=1024
Length = 1024
Score = 1390 bits (3598), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 655/916 (71%), Positives = 770/916 (84%), Gaps = 2/916 (0%)
Query: 125 MCVGIGSFSDPNEAQGLAHFLEHMLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCY 184
MCV +GSF DP EAQGLAHFLEHMLFMGS EFPDENEYDSYLSKHGGSSNAYTE E+TCY
Sbjct: 111 MCVSMGSFLDPPEAQGLAHFLEHMLFMGSTEFPDENEYDSYLSKHGGSSNAYTEMEHTCY 170
Query: 185 HFEVKREFLKGALGRFSQFFISPLVKMEAMEREVLAVDSEFNQVLQSDACRLQQLQCHTS 244
HFEVKREFL+GAL RFSQFF++PL+K EAMEREVLAVDSEFNQ LQ+DACRLQQLQC+TS
Sbjct: 171 HFEVKREFLQGALKRFSQFFVAPLMKTEAMEREVLAVDSEFNQALQNDACRLQQLQCYTS 230
Query: 245 ALNHPLNKFFWGNKKSLVDAMEKGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLESW 304
A HP N+F WGNKKSL AME G++LRE I+KLY++YYHGGLMKLVVIGGESLD+LESW
Sbjct: 231 AKGHPFNRFAWGNKKSLSGAMENGVDLRECIVKLYKEYYHGGLMKLVVIGGESLDMLESW 290
Query: 305 VVELFSAVKKGPQVNPEFTVEGPIWKSGKVYRLEAVKDVHILDLAWTLPCLHKEYLKKPQ 364
VVELF VK G ++ P EGPIWK GK+YRLEAVKDVHILDL WTLP L Y+KKP+
Sbjct: 291 VVELFGDVKNGSKIRPTLEAEGPIWKGGKLYRLEAVKDVHILDLTWTLPPLRSAYVKKPE 350
Query: 365 DYLAHLLGHEGRGSLHFFLKARGWATSLSAGVGDEGIYCSTIAYVFVMSMHLTDSGIEKI 424
DYLAHLLGHEGRGSLH FLKA+GWATSLSAGVGD+GI S++AYVF MS+HLTDSG+EKI
Sbjct: 351 DYLAHLLGHEGRGSLHSFLKAKGWATSLSAGVGDDGINRSSLAYVFGMSIHLTDSGLEKI 410
Query: 425 YDIIGFVYQYLKLLRQVSPQEWIFKELQNIGNMEFKFAEEQPQDDYAAELAENLNFYPPE 484
YDIIG++YQYLKLLR VSPQEWIFKELQ+IGNM+F+FAEEQP DDYAAEL+EN+ YP E
Sbjct: 411 YDIIGYIYQYLKLLRDVSPQEWIFKELQDIGNMDFRFAEEQPADDYAAELSENMLAYPVE 470
Query: 485 HVIYGDYLYKTWDGQLLEQVLGFFIPENMRVDVVSKVLKSEDFKYETWFGSRYIVEDIAQ 544
HVIYGDY+Y+TWD +L+E ++GFF P+NMR+DVVSK +KSE+F+ E WFGS YI ED+
Sbjct: 471 HVIYGDYVYQTWDPKLIEDLMGFFTPQNMRIDVVSKSIKSEEFQQEPWFGSSYIEEDVPL 530
Query: 545 NLMELWRNPPEIDASLHLPSKNEFIPSDFSIRAGDTSDDDSANLTSPKCIADEALIKFWY 604
+LME W NP E+D SLHLPSKN+FIP DFSIRA + SD D + + P+CI DE +KFWY
Sbjct: 531 SLMESWSNPSEVDNSLHLPSKNQFIPCDFSIRAIN-SDVDPKSQSPPRCIIDEPFMKFWY 589
Query: 605 KLDSTFKVPRANTYFRINLKGGYDNVKSCVLSELFIHLLKDELNEVIYQASVAKLETSVS 664
KLD TFKVPRANTYFRINLKG Y +VK+C+L+EL+I+LLKDELNE+IYQAS+AKLETS+S
Sbjct: 590 KLDETFKVPRANTYFRINLKGAYASVKNCLLTELYINLLKDELNEIIYQASIAKLETSLS 649
Query: 665 YFGDMLELKVYGFNEKLPVLLSKILSVARSFMPTEDRYKVIKEDVERTLKNTNMKPLSHS 724
+GD LELKVYGFNEK+P LLSKIL++A+SFMP +R+KVIKE++ER +NTNMKPL+HS
Sbjct: 650 MYGDKLELKVYGFNEKIPALLSKILAIAKSFMPNLERFKVIKENMERGFRNTNMKPLNHS 709
Query: 725 SYLRLQILCESFYDADDKLHCLNDLFLDDLKAFIPELRSQLYIEGLCHGNLSEDEAINIS 784
+YLRLQ+LC+ YD+D+KL LNDL LDDL +FIPELRSQ++IE LCHGNLSEDEA+NIS
Sbjct: 710 TYLRLQLLCKRIYDSDEKLSVLNDLSLDDLNSFIPELRSQIFIEALCHGNLSEDEAVNIS 769
Query: 785 NIFKMNFPVNPLPIKLRHAERVICLPSNANLVRDVGVKNKLDKNSVVELYFQIEQDFGMG 844
NIFK + V PLP K RH E++ C P A LVRDV VKNK + NSVVELY+QIE +
Sbjct: 770 NIFKDSLTVEPLPSKCRHGEQITCFPMGAKLVRDVNVKNKSETNSVVELYYQIEPEEAQ- 828
Query: 845 SMKLKALIDLVDEIVEEPLYNQLRTKEQLGYVVECSPRVTYRVFGFCFCIQSSEYNPVYL 904
S + KA++DL EI+EEPL+NQLRTKEQLGYVVEC PR+TYRV GFCFC+QSS+Y PV+L
Sbjct: 829 STRTKAVLDLFHEIIEEPLFNQLRTKEQLGYVVECGPRLTYRVHGFCFCVQSSKYGPVHL 888
Query: 905 QGRVDNFINXXXXXXXXXXXXSFENYKSGLTAKLLEKDPSLTYESNRLWNQIVDKRYIFD 964
GRVDNFI S+E+Y+SG+ A+LLEKDPSL E+N LW+QIVDKRY+FD
Sbjct: 889 LGRVDNFIKDIEGLLEQLDDESYEDYRSGMIARLLEKDPSLLSETNDLWSQIVDKRYMFD 948
Query: 965 VSEKEAEELRNISKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTDLKDAEALSKSEQVIT 1024
S KEAEELR+I K DV+ WYKTY + SSPKCRRL VRVWGC+T++K+ + K+ QVI
Sbjct: 949 FSHKEAEELRSIQKKDVISWYKTYFRESSPKCRRLAVRVWGCDTNMKETQTDQKAVQVIA 1008
Query: 1025 DPAAFKKESEFYPSFC 1040
D AFK S+FYPS C
Sbjct: 1009 DAVAFKSTSKFYPSLC 1024
Score = 66.2 bits (160), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/37 (78%), Positives = 34/37 (91%)
Query: 12 DDVVVKSPNDLRLYRLIELHNGLQALLVHDPEIYPDG 48
D+VVVKSPND RLYR+IEL NGL ALL+HDP+IYP+G
Sbjct: 11 DNVVVKSPNDRRLYRVIELENGLCALLIHDPDIYPEG 47
>AT2G41790.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr2:17429453-17436110 REVERSE LENGTH=970
Length = 970
Score = 486 bits (1252), Expect = e-137, Method: Compositional matrix adjust.
Identities = 310/934 (33%), Positives = 491/934 (52%), Gaps = 41/934 (4%)
Query: 125 MCVGIGSFSDPNEAQGLAHFLEHMLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCY 184
M V +GSFSDP +GLAHFLEHMLF S+++P+E+ Y Y+++HGGS+NAYT +E T Y
Sbjct: 51 MSVSVGSFSDPQGLEGLAHFLEHMLFYASEKYPEEDSYSKYITEHGGSTNAYTASEETNY 110
Query: 185 HFEVKREFLKGALGRFSQFFISPLVKMEAMEREVLAVDSEFNQVLQSDACRLQQLQCHTS 244
HF+V + AL RF+QFFI PL+ +A RE+ AVDSE + L SD R++QLQ H S
Sbjct: 111 HFDVNADCFDEALDRFAQFFIKPLMSADATMREIKAVDSENQKNLLSDGWRIRQLQKHLS 170
Query: 245 ALNHPLNKFFWGNKKSL-VDAMEKGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLES 303
+HP +KF GN +L V KG++ R +++K YE++Y +M LVV G ESLD ++
Sbjct: 171 KEDHPYHKFSTGNMDTLHVRPQAKGVDTRSELIKFYEEHYSANIMHLVVYGKESLDKIQD 230
Query: 304 WVVELFSAVKKGPQVNPEFTVEGPIWKSGKV-YRLEAVKDVHILDLAWTL-PCLHKEYLK 361
V +F ++ +V P F + ++ + +K H L ++W + P +H Y +
Sbjct: 231 LVERMFQEIQNTNKVVPRFPGQPCTADHLQILVKAIPIKQGHKLGVSWPVTPSIH-HYDE 289
Query: 362 KPQDYLAHLLGHEGRGSLHFFLKARGWATSLSAGVGDEGIYCSTIAYVFV-MSMHLTDSG 420
P YL HL+GHEG GSL LK GWAT LSAG G+ T+ Y F +S+ LTD+G
Sbjct: 290 APSQYLGHLIGHEGEGSLFHALKTLGWATGLSAGEGE-----WTLDYSFFKVSIDLTDAG 344
Query: 421 IEKIYDIIGFVYQYLKLLRQVSPQEWIFKELQNIGNMEFKFAEEQPQDDYAAELAENLNF 480
E + +I+G ++ Y++LL+Q +WIF EL I +F + ++ P Y ++A N+
Sbjct: 345 HEHMQEILGLLFNYIQLLQQTGVCQWIFDELSAICETKFHYQDKIPPMSYIVDIASNMQI 404
Query: 481 YPPEHVIYGDYLYKTWDGQLLEQVLGFFIPENMRVDVVSKVLKSEDFKYETWFGSRYIVE 540
YP + + G L ++ ++++V+ P N R+ S+ + + K E W+ + Y +E
Sbjct: 405 YPTKDWLVGSSLPTKFNPAIVQKVVDELSPSNFRIFWESQKFEGQTDKAEPWYNTAYSLE 464
Query: 541 DIAQNLMELW-RNPPEIDASLHLPSKNEFIPSDFSIRAGDTSDDDSANLTSPKCIADEAL 599
I + ++ W ++ P D LHLP+ N FIP+D S++ D + T P +
Sbjct: 465 KITSSTIQEWVQSAP--DVHLHLPAPNVFIPTDLSLKDADDKE------TVPVLLRKTPF 516
Query: 600 IKFWYKLDSTFKVPRANTYFRINLKGGYDNVKSCVLSELFIHLLKDELNEVIYQASVAKL 659
+ WYK D+ F P+A N + + VL+++F LL D LNE Y A VA L
Sbjct: 517 SRLWYKPDTMFSKPKAYVKMDFNCPLAVSSPDAAVLTDIFTRLLMDYLNEYAYYAQVAGL 576
Query: 660 ETSVSYFGDMLELKVYGFNEKLPVLLSKILSVARSFMPTEDRYKVIKEDVERTLKNTNMK 719
VS + EL + G+N KL +LL ++ +F DR+ VIKE V + +N +
Sbjct: 577 YYGVSLSDNGFELTLLGYNHKLRILLETVVGKIANFEVKPDRFAVIKETVTKEYQNYKFR 636
Query: 720 -PLSHSSYLRLQILCESFYDADDKLHCLNDLFLDDLKAFIPELRSQLYIEGLCHGNLSED 778
P + Y IL + + ++L L+ L +D+ F+P L S+ +IE GN+ +
Sbjct: 637 QPYHQAMYYCSLILQDQTWPWTEELDVLSHLEAEDVAKFVPMLLSRTFIECYIAGNVENN 696
Query: 779 EAINISNIFKMNFPVNPLPI-----KLRH-AERVICLPSNANLVRDVGVKNKLDKNSVVE 832
EA ++ + +P PI +H RV+ L N D+NS +
Sbjct: 697 EAESMVKHIEDVLFNDPKPICRPLFPSQHLTNRVVKLGEGMKYFYHQDGSNPSDENSALV 756
Query: 833 LYFQIEQDFGMGSMKLKALIDLVDEIVEEPLYNQLRTKEQLGYVVECSPRVTYRVFGFCF 892
Y Q+ +D ++KL+ L + ++ ++QLRT EQLGY+ + R ++G F
Sbjct: 757 HYIQVHRDDFSMNIKLQ----LFGLVAKQATFHQLRTVEQLGYITALAQRNDSGIYGVQF 812
Query: 893 CIQSSEYNPVYLQGRVDNFINXXXXXXXXXXXXSFENYKSGLTAKL---LEKDPSLTYES 949
IQSS P ++ RV++ + S E++KS +TA + LEK +L ES
Sbjct: 813 IIQSSVKGPGHIDSRVESLLK---NFESKLYEMSNEDFKSNVTALIDMKLEKHKNLKEES 869
Query: 950 NRLWNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTD 1009
W +I F+ E E L+ + K ++++++ Y+K + + + L +RV+G
Sbjct: 870 RFYWREIQSGTLKFNRKEAEVSALKQLQKQELIDFFDEYIKVGAARKKSLSIRVYGSQ-H 928
Query: 1010 LK----DAEALSKSEQVITDPAAFKKESEFYPSF 1039
LK D + + I D F+K + SF
Sbjct: 929 LKEMASDKDEVPSPSVEIEDIVGFRKSQPLHGSF 962
>AT3G57470.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr3:21269828-21275886 REVERSE LENGTH=851
Length = 851
Score = 434 bits (1115), Expect = e-121, Method: Compositional matrix adjust.
Identities = 274/799 (34%), Positives = 424/799 (53%), Gaps = 28/799 (3%)
Query: 125 MCVGIGSFSDPNEAQGLAHFLEHMLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCY 184
M V +GSF+DP +GLAHFLEHMLF S+++P+E+ Y Y+++HGGS+NAYT +E T Y
Sbjct: 53 MNVSVGSFTDPEGLEGLAHFLEHMLFYASEKYPEEDSYSKYITEHGGSTNAYTSSEDTNY 112
Query: 185 HFEVKREFLKGALGRFSQFFISPLVKMEAMEREVLAVDSEFNQVLQSDACRLQQLQCHTS 244
HF++ + AL RF+QFFI PL+ +A RE+ AVDSE L SD+ R+ QLQ H S
Sbjct: 113 HFDINTDSFYEALDRFAQFFIQPLMSTDATMREIKAVDSEHQNNLLSDSWRMAQLQKHLS 172
Query: 245 ALNHPLNKFFWGNKKSL-VDAMEKGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLES 303
+HP +KF GN +L V E G++ R +++K Y+++Y +M LVV G E+LD +
Sbjct: 173 REDHPYHKFSTGNMDTLHVRPEENGVDTRSELIKFYDEHYSANIMHLVVYGKENLDKTQG 232
Query: 304 WVVELFSAVKKGPQVNPEFTVEGPIWKSGKV-YRLEAVKDVHILDLAWTLPCLHKEYLKK 362
V LF ++ Q P F + +V + + H L ++W + Y +
Sbjct: 233 LVEALFQGIRNTNQGIPRFPGQPCTLDHLQVLVKAVPIMQGHELSVSWPVTPSISHYEEA 292
Query: 363 PQDYLAHLLGHEGRGSLHFFLKARGWATSLSAGVGDEGIYCSTIAYVFVMSMHLTDSGIE 422
P YL L+GHEG GSL LK GWAT L AG D + S F +S+ LTD+G E
Sbjct: 293 PCRYLGDLIGHEGEGSLFHALKILGWATGLYAGEADWSMEYS----FFNVSIDLTDAGHE 348
Query: 423 KIYDIIGFVYQYLKLLRQVSPQEWIFKELQNIGNMEFKFAEEQPQDDYAAELAENLNFYP 482
+ DI+G +++Y+K+L+Q +WIF EL I EF + + YA +++ N+ YP
Sbjct: 349 HMQDILGLLFEYIKVLQQSGVSQWIFDELSAICEAEFHYQAKIDPISYAVDISSNMKIYP 408
Query: 483 PEHVIYGDYLYKTWDGQLLEQVLGFFIPENMRVDVVSKVLKSEDFKYETWFGSRYIVEDI 542
+H + G L ++ ++++VL P N+R+ S + + K E W+ + Y +E I
Sbjct: 409 TKHWLVGSSLPSKFNPAIVQKVLDELSPNNVRIFWESNKFEGQTDKVEPWYNTAYSLEKI 468
Query: 543 AQNLMELW-RNPPEIDASLHLPSKNEFIPSDFSIRAGDTSDDDSANLTSPKCIADEALIK 601
+ ++ W ++ P D +L LP+ N FIP+DFS++ D D D P + + +
Sbjct: 469 TKFTIQEWMQSAP--DVNLLLPTPNVFIPTDFSLK--DLKDKD----IFPVLLRKTSYSR 520
Query: 602 FWYKLDSTFKVPRANTYFRINLKGGYDNVKSCVLSELFIHLLKDELNEVIYQASVAKLET 661
WYK D+ F P+A N + + VLS++F+ LL D LNE Y A A L+
Sbjct: 521 LWYKPDTKFFKPKAYVKMDFNCPLAVSSPDAAVLSDIFVWLLVDYLNEYAYYAQAAGLDY 580
Query: 662 SVSYFGDMLELKVYGFNEKLPVLLSKILSVARSFMPTEDRYKVIKEDVERTLKNTNM-KP 720
+S + EL + GFN KL +LL ++ F DR+ VIKE V + +N +P
Sbjct: 581 GLSLSDNGFELSLAGFNHKLRILLEAVIQKIAKFEVKPDRFSVIKETVTKAYQNNKFQQP 640
Query: 721 LSHSSYLRLQILCESFYDADDKLHCLNDLFLDDLKAFIPELRSQLYIEGLCHGNLSEDEA 780
++ +L + + ++L L+ L +DL F+P L S+ ++E GN+ +DEA
Sbjct: 641 HEQATNYCSLVLQDQIWPWTEELDALSHLEAEDLANFVPMLLSRTFVECYIAGNVEKDEA 700
Query: 781 INISN-----IFKMNFPV-NPLPIKLRHAERVICLPSNANLVRDVGVKNKLDKNSVVELY 834
++ +F + P+ PL RV L + N D+NS + Y
Sbjct: 701 ESMVKHIEDVLFTDSKPICRPLFPSQFLTNRVTELGTGMKHFYYQEGSNSSDENSALVHY 760
Query: 835 FQIEQD-FGMGSMKLKALIDLVDEIVEEPLYNQLRTKEQLGYVVECSPRVTYRVFGFCFC 893
Q+ +D F M S + L + I ++ ++QLRT EQLGY+ S V+G F
Sbjct: 761 IQVHKDEFSMNSK-----LQLFELIAKQDTFHQLRTIEQLGYITSLSLSNDSGVYGVQFI 815
Query: 894 IQSSEYNPVYLQGRVDNFI 912
IQSS P ++ RV++ +
Sbjct: 816 IQSSVKGPGHIDSRVESLL 834
>AT3G57470.2 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr3:21269428-21275400 REVERSE LENGTH=891
Length = 891
Score = 431 bits (1109), Expect = e-121, Method: Compositional matrix adjust.
Identities = 291/907 (32%), Positives = 461/907 (50%), Gaps = 33/907 (3%)
Query: 148 MLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCYHFEVKREFLKGALGRFSQFFISP 207
MLF S+++P+E+ Y Y+++HGGS+NAYT +E T YHF++ + AL RF+QFFI P
Sbjct: 1 MLFYASEKYPEEDSYSKYITEHGGSTNAYTSSEDTNYHFDINTDSFYEALDRFAQFFIQP 60
Query: 208 LVKMEAMEREVLAVDSEFNQVLQSDACRLQQLQCHTSALNHPLNKFFWGNKKSL-VDAME 266
L+ +A RE+ AVDSE L SD+ R+ QLQ H S +HP +KF GN +L V E
Sbjct: 61 LMSTDATMREIKAVDSEHQNNLLSDSWRMAQLQKHLSREDHPYHKFSTGNMDTLHVRPEE 120
Query: 267 KGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLESWVVELFSAVKKGPQVNPEFTVEG 326
G++ R +++K Y+++Y +M LVV G E+LD + V LF ++ Q P F +
Sbjct: 121 NGVDTRSELIKFYDEHYSANIMHLVVYGKENLDKTQGLVEALFQGIRNTNQGIPRFPGQP 180
Query: 327 PIWKSGKV-YRLEAVKDVHILDLAWTLPCLHKEYLKKPQDYLAHLLGHEGRGSLHFFLKA 385
+V + + H L ++W + Y + P YL L+GHEG GSL LK
Sbjct: 181 CTLDHLQVLVKAVPIMQGHELSVSWPVTPSISHYEEAPCRYLGDLIGHEGEGSLFHALKI 240
Query: 386 RGWATSLSAGVGDEGIYCSTIAYVFVMSMHLTDSGIEKIYDIIGFVYQYLKLLRQVSPQE 445
GWAT L AG D + S F +S+ LTD+G E + DI+G +++Y+K+L+Q +
Sbjct: 241 LGWATGLYAGEADWSMEYS----FFNVSIDLTDAGHEHMQDILGLLFEYIKVLQQSGVSQ 296
Query: 446 WIFKELQNIGNMEFKFAEEQPQDDYAAELAENLNFYPPEHVIYGDYLYKTWDGQLLEQVL 505
WIF EL I EF + + YA +++ N+ YP +H + G L ++ ++++VL
Sbjct: 297 WIFDELSAICEAEFHYQAKIDPISYAVDISSNMKIYPTKHWLVGSSLPSKFNPAIVQKVL 356
Query: 506 GFFIPENMRVDVVSKVLKSEDFKYETWFGSRYIVEDIAQNLMELW-RNPPEIDASLHLPS 564
P N+R+ S + + K E W+ + Y +E I + ++ W ++ P D +L LP+
Sbjct: 357 DELSPNNVRIFWESNKFEGQTDKVEPWYNTAYSLEKITKFTIQEWMQSAP--DVNLLLPT 414
Query: 565 KNEFIPSDFSIRAGDTSDDDSANLTSPKCIADEALIKFWYKLDSTFKVPRANTYFRINLK 624
N FIP+DFS++ D D D P + + + WYK D+ F P+A N
Sbjct: 415 PNVFIPTDFSLK--DLKDKD----IFPVLLRKTSYSRLWYKPDTKFFKPKAYVKMDFNCP 468
Query: 625 GGYDNVKSCVLSELFIHLLKDELNEVIYQASVAKLETSVSYFGDMLELKVYGFNEKLPVL 684
+ + VLS++F+ LL D LNE Y A A L+ +S + EL + GFN KL +L
Sbjct: 469 LAVSSPDAAVLSDIFVWLLVDYLNEYAYYAQAAGLDYGLSLSDNGFELSLAGFNHKLRIL 528
Query: 685 LSKILSVARSFMPTEDRYKVIKEDVERTLKNTNM-KPLSHSSYLRLQILCESFYDADDKL 743
L ++ F DR+ VIKE V + +N +P ++ +L + + ++L
Sbjct: 529 LEAVIQKIAKFEVKPDRFSVIKETVTKAYQNNKFQQPHEQATNYCSLVLQDQIWPWTEEL 588
Query: 744 HCLNDLFLDDLKAFIPELRSQLYIEGLCHGNLSEDEAINISN-----IFKMNFPV-NPLP 797
L+ L +DL F+P L S+ ++E GN+ +DEA ++ +F + P+ PL
Sbjct: 589 DALSHLEAEDLANFVPMLLSRTFVECYIAGNVEKDEAESMVKHIEDVLFTDSKPICRPLF 648
Query: 798 IKLRHAERVICLPSNANLVRDVGVKNKLDKNSVVELYFQIEQD-FGMGSMKLKALIDLVD 856
RV L + N D+NS + Y Q+ +D F M S + L +
Sbjct: 649 PSQFLTNRVTELGTGMKHFYYQEGSNSSDENSALVHYIQVHKDEFSMNSK-----LQLFE 703
Query: 857 EIVEEPLYNQLRTKEQLGYVVECSPRVTYRVFGFCFCIQSSEYNPVYLQGRVDNFINXXX 916
I ++ ++QLRT EQLGY+ S V+G F IQSS P ++ RV++ +
Sbjct: 704 LIAKQDTFHQLRTIEQLGYITSLSLSNDSGVYGVQFIIQSSVKGPGHIDSRVESLLKDLE 763
Query: 917 XXXXXXXXXSFENYKSGLTAKLLEKDPSLTYESNRLWNQIVDKRYIFDVSEKEAEELRNI 976
F++ + L LEKD +L ES W +I F+ + E LR +
Sbjct: 764 SKFYNMSDEEFKSNVTNLIDMKLEKDKNLDEESWFYWAEIQTGTLKFNRIDAEVAALRLL 823
Query: 977 SKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTDLK----DAEALSKSEQVITDPAAFKKE 1032
K++ ++++ Y+K +P + L + V+G N LK D + + + I D F+K
Sbjct: 824 KKDEWIDFFDEYIKVDAPNKKSLSICVYG-NQHLKEMRNDKDKIPSTSIEIEDIVCFRKS 882
Query: 1033 SEFYPSF 1039
Y S
Sbjct: 883 QPLYGSL 889
>AT3G57470.3 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr3:21269428-21275400 REVERSE LENGTH=881
Length = 881
Score = 425 bits (1093), Expect = e-119, Method: Compositional matrix adjust.
Identities = 290/907 (31%), Positives = 458/907 (50%), Gaps = 43/907 (4%)
Query: 148 MLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCYHFEVKREFLKGALGRFSQFFISP 207
MLF S+++P+E+ Y Y+++HGGS+NAYT +E T YHF++ + AL RF+QFFI P
Sbjct: 1 MLFYASEKYPEEDSYSKYITEHGGSTNAYTSSEDTNYHFDINTDSFYEALDRFAQFFIQP 60
Query: 208 LVKMEAMEREVLAVDSEFNQVLQSDACRLQQLQCHTSALNHPLNKFFWGNKKSL-VDAME 266
L+ +A RE+ AVDSE L SD+ R+ QLQ H S +HP +KF GN +L V E
Sbjct: 61 LMSTDATMREIKAVDSEHQNNLLSDSWRMAQLQKHLSREDHPYHKFSTGNMDTLHVRPEE 120
Query: 267 KGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLESWVVELFSAVKKGPQVNPEFTVEG 326
G++ R +++K Y+++Y +M LVV G E+LD + V LF ++ Q P F +
Sbjct: 121 NGVDTRSELIKFYDEHYSANIMHLVVYGKENLDKTQGLVEALFQGIRNTNQGIPRFPGQP 180
Query: 327 PIWKSGKV-YRLEAVKDVHILDLAWTLPCLHKEYLKKPQDYLAHLLGHEGRGSLHFFLKA 385
+V + + H L ++W + Y + P YL L+GHEG GSL LK
Sbjct: 181 CTLDHLQVLVKAVPIMQGHELSVSWPVTPSISHYEEAPCRYLGDLIGHEGEGSLFHALKI 240
Query: 386 RGWATSLSAGVGDEGIYCSTIAYVFVMSMHLTDSGIEKIYDIIGFVYQYLKLLRQVSPQE 445
GWAT L AG D + S F +S+ LTD+G E + DI+G +++Y+K+L+Q +
Sbjct: 241 LGWATGLYAGEADWSMEYS----FFNVSIDLTDAGHEHMQDILGLLFEYIKVLQQSGVSQ 296
Query: 446 WIFKELQNIGNMEFKFAEEQPQDDYAAELAENLNFYPPEHVIYGDYLYKTWDGQLLEQVL 505
WIF EL I EF + + YA +++ N+ YP +H + G L ++ ++++VL
Sbjct: 297 WIFDELSAICEAEFHYQAKIDPISYAVDISSNMKIYPTKHWLVGSSLPSKFNPAIVQKVL 356
Query: 506 GFFIPENMRVDVVSKVLKSEDFKYETWFGSRYIVEDIAQNLMELW-RNPPEIDASLHLPS 564
P N+R+ S + + K E W+ + Y +E I + ++ W ++ P D +L LP+
Sbjct: 357 DELSPNNVRIFWESNKFEGQTDKVEPWYNTAYSLEKITKFTIQEWMQSAP--DVNLLLPT 414
Query: 565 KNEFIPSDFSIRAGDTSDDDSANLTSPKCIADEALIKFWYKLDSTFKVPRANTYFRINLK 624
N FIP+DFS++ D D D P + + + WYK D+ F P+A N
Sbjct: 415 PNVFIPTDFSLK--DLKDKD----IFPVLLRKTSYSRLWYKPDTKFFKPKAYVKMDFNCP 468
Query: 625 GGYDNVKSCVLSELFIHLLKDELNEVIYQASVAKLETSVSYFGDMLELKVYGFNEKLPVL 684
+ + VLS++F+ LL D LNE Y A A L+ +S + EL + GFN KL +L
Sbjct: 469 LAVSSPDAAVLSDIFVWLLVDYLNEYAYYAQAAGLDYGLSLSDNGFELSLAGFNHKLRIL 528
Query: 685 LSKILSVARSFMPTEDRYKVIKEDVERTLKNTNM-KPLSHSSYLRLQILCESFYDADDKL 743
L ++ F DR+ VIKE V + +N +P ++ +L + + ++L
Sbjct: 529 LEAVIQKIAKFEVKPDRFSVIKETVTKAYQNNKFQQPHEQATNYCSLVLQDQIWPWTEEL 588
Query: 744 HCLNDLFLDDLKAFIPELRSQLYIEGLCHGNLSEDEAINISN-----IFKMNFPV-NPLP 797
L+ L +DL F+P L S+ ++E GN+ +DEA ++ +F + P+ PL
Sbjct: 589 DALSHLEAEDLANFVPMLLSRTFVECYIAGNVEKDEAESMVKHIEDVLFTDSKPICRPLF 648
Query: 798 IKLRHAERVICLPSNANLVRDVGVKNKLDKNSVVELYFQIEQD-FGMGSMKLKALIDLVD 856
RV L + N D+NS + Y Q+ +D F M S + L +
Sbjct: 649 PSQFLTNRVTELGTGMKHFYYQEGSNSSDENSALVHYIQVHKDEFSMNSK-----LQLFE 703
Query: 857 EIVEEPLYNQLRTKEQLGYVVECSPRVTYRVFGFCFCIQSSEYNPVYLQGRVDNFINXXX 916
I ++ ++QLRT EQLGY+ S V+G F IQSS P ++ RV++ +
Sbjct: 704 LIAKQDTFHQLRTIEQLGYITSLSLSNDSGVYGVQFIIQSSVKGPGHIDSRVESLLKDLE 763
Query: 917 XXXXXXXXXSFENYKSGLTAKLLEKDPSLTYESNRLWNQIVDKRYIFDVSEKEAEELRNI 976
F+ LEKD +L ES W +I F+ + E LR +
Sbjct: 764 SKFYNMSDEEFK----------LEKDKNLDEESWFYWAEIQTGTLKFNRIDAEVAALRLL 813
Query: 977 SKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTDLK----DAEALSKSEQVITDPAAFKKE 1032
K++ ++++ Y+K +P + L + V+G N LK D + + + I D F+K
Sbjct: 814 KKDEWIDFFDEYIKVDAPNKKSLSICVYG-NQHLKEMRNDKDKIPSTSIEIEDIVCFRKS 872
Query: 1033 SEFYPSF 1039
Y S
Sbjct: 873 QPLYGSL 879
>AT5G01440.1 | Symbols: | BEST Arabidopsis thaliana protein match is:
Insulinase (Peptidase family M16) family protein
(TAIR:AT1G06900.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:179822-181283 FORWARD
LENGTH=291
Length = 291
Score = 137 bits (344), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/191 (39%), Positives = 122/191 (63%), Gaps = 10/191 (5%)
Query: 818 DVGVKNKLDKNSVVELYFQIEQDFGMGSMKLKALIDLVDEIVEEPLYNQLRTKEQLGYVV 877
DV VKNK + NS+ ++YF+I+ + + + AL++L I+ + +YN+LR +E+LGY+V
Sbjct: 17 DVKVKNKFESNSLAKVYFRIKCEKAQEARQT-ALLNLFVSIISDSVYNKLRIEEKLGYLV 75
Query: 878 ECSPRVTYRVFGFCFCIQSSEYNPVYLQGRVDNFINXXXXXXXXXXXXSFENYKSGLTAK 937
EC R+ + V GF C+ SS+YNP +L R+ F+N F+++K+G+++K
Sbjct: 76 ECETRLIHGV-GFYVCVVSSDYNPCHLVRRIYKFMNGIRLEGLFDKM--FKDFKNGVSSK 132
Query: 938 LLEKDPSLTYESNRLWNQIVDKRYIFDVSEKEAEELRNISKNDVVEWYKTYLKPSSPKCR 997
L P + ++ W++IV + IFD +E +EL I+KND++EWYK Y++ SSPKC
Sbjct: 133 L----PHDSGKTK--WSEIVRESCIFDFYSEEKKELSLITKNDLIEWYKRYVRLSSPKCC 186
Query: 998 RLLVRVWGCNT 1008
+V +WGCNT
Sbjct: 187 SFVVSIWGCNT 197
>AT3G02090.1 | Symbols: MPPBETA | Insulinase (Peptidase family M16)
protein | chr3:365624-368526 FORWARD LENGTH=531
Length = 531
Score = 60.5 bits (145), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 89/213 (41%), Gaps = 11/213 (5%)
Query: 127 VGIGSFSDPNEAQGLAHFLEHMLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCYHF 186
+ GS + +E G AHFLEHM+F G+D + + GG NAYT E T Y+
Sbjct: 125 IDAGSRFESDETNGTAHFLEHMIFKGTDRRT-VRALEEEIEDIGGHLNAYTSREQTTYYA 183
Query: 187 EVKREFLKGALGRFSQFFISPLVKMEAMEREVLAVDSEFNQVL-QSDACRLQQLQCHTSA 245
+V + AL + + + + + RE + E +V Q+D L L H +A
Sbjct: 184 KVLDSNVNQALDVLADILQNSKFEEQRINRERDVILREMQEVEGQTDEVVLDHL--HATA 241
Query: 246 LNH-PLNKFFWGNKKSLVDAMEKGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLESW 304
+ PL + G +++ K I RE + + +Y M + G + +
Sbjct: 242 FQYTPLGRTILGPAQNV-----KSIT-REDLQNYIKTHYTASRMVIAAAGAVKHEEVVEQ 295
Query: 305 VVELFSAVKKGPQVNPEFTVEGPIWKSGKVYRL 337
V +LF+ + P + P +G R+
Sbjct: 296 VKKLFTKLSSDPTTTSQLVANEPASFTGSEVRM 328
>AT3G02090.2 | Symbols: MPPBETA | Insulinase (Peptidase family M16)
protein | chr3:365624-368534 FORWARD LENGTH=535
Length = 535
Score = 60.1 bits (144), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 89/213 (41%), Gaps = 11/213 (5%)
Query: 127 VGIGSFSDPNEAQGLAHFLEHMLFMGSDEFPDENEYDSYLSKHGGSSNAYTETEYTCYHF 186
+ GS + +E G AHFLEHM+F G+D + + GG NAYT E T Y+
Sbjct: 125 IDAGSRFESDETNGTAHFLEHMIFKGTDRR-TVRALEEEIEDIGGHLNAYTSREQTTYYA 183
Query: 187 EVKREFLKGALGRFSQFFISPLVKMEAMEREVLAVDSEFNQVL-QSDACRLQQLQCHTSA 245
+V + AL + + + + + RE + E +V Q+D L L H +A
Sbjct: 184 KVLDSNVNQALDVLADILQNSKFEEQRINRERDVILREMQEVEGQTDEVVLDHL--HATA 241
Query: 246 LNH-PLNKFFWGNKKSLVDAMEKGINLREKILKLYEDYYHGGLMKLVVIGGESLDVLESW 304
+ PL + G +++ K I RE + + +Y M + G + +
Sbjct: 242 FQYTPLGRTILGPAQNV-----KSIT-REDLQNYIKTHYTASRMVIAAAGAVKHEEVVEQ 295
Query: 305 VVELFSAVKKGPQVNPEFTVEGPIWKSGKVYRL 337
V +LF+ + P + P +G R+
Sbjct: 296 VKKLFTKLSSDPTTTSQLVANEPASFTGSEVRM 328
>AT3G57460.1 | Symbols: | catalytics;metal ion binding |
chr3:21263086-21265797 REVERSE LENGTH=356
Length = 356
Score = 58.2 bits (139), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 57/104 (54%), Gaps = 7/104 (6%)
Query: 378 SLHFFLKARG-WATSLSAGVGDEGIYCSTIAYVFV-MSMHLTDSGIEKIYDIIGFVYQYL 435
S+H + +A WAT L AG D T+ Y F +S++LTD+ E + DI+G +++ +
Sbjct: 59 SIHHYEEAHARWATGLYAGEPDW-----TVEYSFFNVSINLTDACHEHMKDILGLLFRQI 113
Query: 436 KLLRQVSPQEWIFKELQNIGNMEFKFAEEQPQDDYAAELAENLN 479
KLL+Q +WIF EL I EF + + YA ++ N+
Sbjct: 114 KLLQQSGVSQWIFDELSAIFEAEFHYQAKIDPISYAVNISSNMT 157
Score = 54.7 bits (130), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 13/118 (11%)
Query: 597 EALIKFWYKLDSTFKVPRANTYFRINLKGGYDNVKSCVLSELFIHLLKDELNEVIYQASV 656
E +IKF ++D F P A + + VLS + LL D LNE Y A
Sbjct: 179 EKIIKFTIQMD--FNCPLA-----------VSSPATVVLSNSSVWLLVDYLNEYAYYAQA 225
Query: 657 AKLETSVSYFGDMLELKVYGFNEKLPVLLSKILSVARSFMPTEDRYKVIKEDVERTLK 714
A+L +S + EL + GFN KL +LL ++ +F DR+ V+K V R LK
Sbjct: 226 ARLHYGLSLSDNGFELSLTGFNHKLRILLEAVIQKMANFQVKPDRFSVVKVTVLRELK 283