Miyakogusa Predicted Gene
- Lj4g3v1983560.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1983560.1 Non Chatacterized Hit- tr|I1MZR2|I1MZR2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.44312
PE,76.03,0,Peptidase_M16,Peptidase M16, N-terminal;
Peptidase_M16_C,Peptidase M16, C-terminal; N-ARGININE
DIBAS,gene.g55940.t1.1
(1013 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G06900.1 | Symbols: | Insulinase (Peptidase family M16) fami... 1217 0.0
AT2G41790.1 | Symbols: | Insulinase (Peptidase family M16) fami... 452 e-127
AT3G57470.1 | Symbols: | Insulinase (Peptidase family M16) fami... 406 e-113
AT3G57470.2 | Symbols: | Insulinase (Peptidase family M16) fami... 396 e-110
AT3G57470.3 | Symbols: | Insulinase (Peptidase family M16) fami... 389 e-108
AT5G01440.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 129 2e-29
AT3G02090.1 | Symbols: MPPBETA | Insulinase (Peptidase family M1... 59 2e-08
AT3G02090.2 | Symbols: MPPBETA | Insulinase (Peptidase family M1... 59 2e-08
AT3G57460.1 | Symbols: | catalytics;metal ion binding | chr3:21... 54 8e-07
>AT1G06900.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr1:2115155-2120635 REVERSE LENGTH=1024
Length = 1024
Score = 1217 bits (3150), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 594/923 (64%), Positives = 731/923 (79%), Gaps = 13/923 (1%)
Query: 93 MCVGVGSFSDPNEAQGLAHFLEHMLFMGSEEFPDENEYDSYLSKHGGSSNAHTEAEHTCY 152
MCV +GSF DP EAQGLAHFLEHMLFMGS EFPDENEYDSYLSKHGGSSNA+TE EHTCY
Sbjct: 111 MCVSMGSFLDPPEAQGLAHFLEHMLFMGSTEFPDENEYDSYLSKHGGSSNAYTEMEHTCY 170
Query: 153 KFEVKREYLKGALRRFSQFFISPLVKIEAMEREVLAVDSEFNKVLQYDHCRLQQLQRHTS 212
FEVKRE+L+GAL+RFSQFF++PL+K EAMEREVLAVDSEFN+ LQ D CRLQQLQ +TS
Sbjct: 171 HFEVKREFLQGALKRFSQFFVAPLMKTEAMEREVLAVDSEFNQALQNDACRLQQLQCYTS 230
Query: 213 ALNHPLNKFSCGNKKSLVDAMENGSITNLREKLLKFYEDYYHAGLMKLVVIGGESLNVLE 272
A HP N+F+ GNKKSL AMENG +LRE ++K Y++YYH GLMKLVVIGGESL++LE
Sbjct: 231 AKGHPFNRFAWGNKKSLSGAMENG--VDLRECIVKLYKEYYHGGLMKLVVIGGESLDMLE 288
Query: 273 SWVVELFSTVKNGPQVNPEFIVEGPMWKSGKVYRLEAVKDINILSLAWTLPSLDQDYLEK 332
SWVVELF VKNG ++ P EGP+WK GK+YRLEAVKD++IL L WTLP L Y++K
Sbjct: 289 SWVVELFGDVKNGSKIRPTLEAEGPIWKGGKLYRLEAVKDVHILDLTWTLPPLRSAYVKK 348
Query: 333 PDDYLAYLLRNEGRGSLIFFLRARGWATSLFAGIGNDGMYWSSIAYVFVISICLTDSGIE 392
P+DYLA+LL +EGRGSL FL+A+GWATSL AG+G+DG+ SS+AYVF +SI LTDSG+E
Sbjct: 349 PEDYLAHLLGHEGRGSLHSFLKAKGWATSLSAGVGDDGINRSSLAYVFGMSIHLTDSGLE 408
Query: 393 KMYDIIGFVYQYLKLLRQVSPQEWIFKELQNMGNMKFRFAEEQPQDDXXXXXXXXXXXXX 452
K+YDIIG++YQYLKLLR VSPQEWIFKELQ++GNM FRFAEEQP DD
Sbjct: 409 KIYDIIGYIYQYLKLLRDVSPQEWIFKELQDIGNMDFRFAEEQPADDYAAELSENMLAYP 468
Query: 453 PEHVIYGDYMYETWDEQLLQQVLGFFIPENMRVDVVSKFLKSSQDFKYETWFGSRYVEED 512
EHVIYGDY+Y+TWD +L++ ++GFF P+NMR+DVVSK +K S++F+ E WFGS Y+EED
Sbjct: 469 VEHVIYGDYVYQTWDPKLIEDLMGFFTPQNMRIDVVSKSIK-SEEFQQEPWFGSSYIEED 527
Query: 513 ISQNLMKLWRNPPEIDASFHLPSKNEFIPSDFSIRA--GEDDSANLTSPRCIVDEALIKF 570
+ +LM+ W NP E+D S HLPSKN+FIP DFSIRA + D + + PRCI+DE +KF
Sbjct: 528 VPLSLMESWSNPSEVDNSLHLPSKNQFIPCDFSIRAINSDVDPKSQSPPRCIIDEPFMKF 587
Query: 571 WYKPDSTFKVPRSNTYFQINLKGSRYDNVKSCVLSELFIHLLKDELNEIIYQASVAKLET 630
WYK D TFKVPR+NTYF+INLKG+ Y +VK+C+L+EL+I+LLKDELNEIIYQAS+AKLET
Sbjct: 588 WYKLDETFKVPRANTYFRINLKGA-YASVKNCLLTELYINLLKDELNEIIYQASIAKLET 646
Query: 631 RISYVGDHMLELKVSGFNEKLPVLLSKILSVTRSFMPTEDRYKVIKEDVMKRALKNTNMK 690
+S GD LELKV GFNEK+P LLSKIL++ +SFMP +R+KVIKE+ M+R +NTNMK
Sbjct: 647 SLSMYGDK-LELKVYGFNEKIPALLSKILAIAKSFMPNLERFKVIKEN-MERGFRNTNMK 704
Query: 691 PLSHSSYLRLQILCESFYDADDKLHCXXXXXXXXXXAFIPELRSQLYIEGLCHGNLSEDE 750
PL+HS+YLRLQ+LC+ YD+D+KL +FIPELRSQ++IE LCHGNLSEDE
Sbjct: 705 PLNHSTYLRLQLLCKRIYDSDEKLSVLNDLSLDDLNSFIPELRSQIFIEALCHGNLSEDE 764
Query: 751 AINISNIFRINFPLNINPPLIKLRHARRIVCLPSSANLVRDVGVKNKFDKNSAVELYFQI 810
A+NISNIF+ + L + P K RH +I C P A LVRDV VKNK + NS VELY+QI
Sbjct: 765 AVNISNIFKDS--LTVEPLPSKCRHGEQITCFPMGAKLVRDVNVKNKSETNSVVELYYQI 822
Query: 811 EQDFGMRSMKLKALIDLVEEIVKEPLFNQLRTKEQLGYVVDCSSRVTIRVLGFCFHIQSS 870
E + +S + KA++DL EI++EPLFNQLRTKEQLGYVV+C R+T RV GFCF +QSS
Sbjct: 823 EPEEA-QSTRTKAVLDLFHEIIEEPLFNQLRTKEQLGYVVECGPRLTYRVHGFCFCVQSS 881
Query: 871 EYNPVYLQGRIDNFINXXXXXXXXXXXXSFENYKSGLMAKLLEKDPSLTYESNRLWNQIL 930
+Y PV+L GR+DNFI S+E+Y+SG++A+LLEKDPSL E+N LW+QI+
Sbjct: 882 KYGPVHLLGRVDNFIKDIEGLLEQLDDESYEDYRSGMIARLLEKDPSLLSETNDLWSQIV 941
Query: 931 DKRYIFDVSKKKAEELRNISKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTDLKDNAEAL 990
DKRY+FD S K+AEELR+I K DV+ WYKTY + SSPKCRRL VRVWGC+T++K+ +
Sbjct: 942 DKRYMFDFSHKEAEELRSIQKKDVISWYKTYFRESSPKCRRLAVRVWGCDTNMKE-TQTD 1000
Query: 991 SKSMQVIITDPTAFKKESVFYPS 1013
K++QV I D AFK S FYPS
Sbjct: 1001 QKAVQV-IADAVAFKSTSKFYPS 1022
Score = 68.9 bits (167), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/37 (81%), Positives = 34/37 (91%)
Query: 14 DDVVVKSPNDRRLYRFIQLHNGLQALLVHDPEIYPEG 50
D+VVVKSPNDRRLYR I+L NGL ALL+HDP+IYPEG
Sbjct: 11 DNVVVKSPNDRRLYRVIELENGLCALLIHDPDIYPEG 47
>AT2G41790.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr2:17429453-17436110 REVERSE LENGTH=970
Length = 970
Score = 452 bits (1162), Expect = e-127, Method: Compositional matrix adjust.
Identities = 318/1009 (31%), Positives = 492/1009 (48%), Gaps = 74/1009 (7%)
Query: 17 VVKSPNDRRLYRFIQLHNGLQALLVHDPEIYPEGPPKPVPTDNXXXXXXXXXXXXXXXXX 76
++K D R YR I L N LQ LL+ DP+ TD
Sbjct: 15 ILKPRTDNREYRMIVLKNLLQVLLISDPD-----------TDKCAAS------------- 50
Query: 77 XXXXXXXXXXXXXXXXMCVGVGSFSDPNEAQGLAHFLEHMLFMGSEEFPDENEYDSYLSK 136
M V VGSFSDP +GLAHFLEHMLF SE++P+E+ Y Y+++
Sbjct: 51 ----------------MSVSVGSFSDPQGLEGLAHFLEHMLFYASEKYPEEDSYSKYITE 94
Query: 137 HGGSSNAHTEAEHTCYKFEVKREYLKGALRRFSQFFISPLVKIEAMEREVLAVDSEFNKV 196
HGGS+NA+T +E T Y F+V + AL RF+QFFI PL+ +A RE+ AVDSE K
Sbjct: 95 HGGSTNAYTASEETNYHFDVNADCFDEALDRFAQFFIKPLMSADATMREIKAVDSENQKN 154
Query: 197 LQYDHCRLQQLQRHTSALNHPLNKFSCGNKKSL-VDAMENGSITNLREKLLKFYEDYYHA 255
L D R++QLQ+H S +HP +KFS GN +L V G + R +L+KFYE++Y A
Sbjct: 155 LLSDGWRIRQLQKHLSKEDHPYHKFSTGNMDTLHVRPQAKG--VDTRSELIKFYEEHYSA 212
Query: 256 GLMKLVVIGGESLNVLESWVVELFSTVKNGPQVNPEFIVEGPMWKSGKV-YRLEAVKDIN 314
+M LVV G ESL+ ++ V +F ++N +V P F + ++ + +K +
Sbjct: 213 NIMHLVVYGKESLDKIQDLVERMFQEIQNTNKVVPRFPGQPCTADHLQILVKAIPIKQGH 272
Query: 315 ILSLAWTLPSLDQDYLEKPDDYLAYLLRNEGRGSLIFFLRARGWATSLFAGIGNDGMYWS 374
L ++W + Y E P YL +L+ +EG GSL L+ GWAT L AG G W+
Sbjct: 273 KLGVSWPVTPSIHHYDEAPSQYLGHLIGHEGEGSLFHALKTLGWATGLSAGEGE----WT 328
Query: 375 SIAYVFVISICLTDSGIEKMYDIIGFVYQYLKLLRQVSPQEWIFKELQNMGNMKFRFAEE 434
F +SI LTD+G E M +I+G ++ Y++LL+Q +WIF EL + KF + ++
Sbjct: 329 LDYSFFKVSIDLTDAGHEHMQEILGLLFNYIQLLQQTGVCQWIFDELSAICETKFHYQDK 388
Query: 435 QPQDDXXXXXXXXXXXXXPEHVIYGDYMYETWDEQLLQQVLGFFIPENMRVDVVSKFLKS 494
P + + G + ++ ++Q+V+ P N R+ S+ +
Sbjct: 389 IPPMSYIVDIASNMQIYPTKDWLVGSSLPTKFNPAIVQKVVDELSPSNFRIFWESQKFEG 448
Query: 495 SQDFKYETWFGSRYVEEDISQNLMKLW-RNPPEIDASFHLPSKNEFIPSDFSIRAGEDDS 553
D K E W+ + Y E I+ + ++ W ++ P D HLP+ N FIP+D S++ +D
Sbjct: 449 QTD-KAEPWYNTAYSLEKITSSTIQEWVQSAP--DVHLHLPAPNVFIPTDLSLKDADDKE 505
Query: 554 ANLTSPRCIVDEALIKFWYKPDSTFKVPRSNTYFQIN--LKGSRYDNVKSCVLSELFIHL 611
T P + + WYKPD+ F P++ N L S D + VL+++F L
Sbjct: 506 ---TVPVLLRKTPFSRLWYKPDTMFSKPKAYVKMDFNCPLAVSSPD---AAVLTDIFTRL 559
Query: 612 LKDELNEIIYQASVAKLETRISYVGDHMLELKVSGFNEKLPVLLSKILSVTRSFMPTEDR 671
L D LNE Y A VA L +S + D+ EL + G+N KL +LL ++ +F DR
Sbjct: 560 LMDYLNEYAYYAQVAGLYYGVS-LSDNGFELTLLGYNHKLRILLETVVGKIANFEVKPDR 618
Query: 672 YKVIKEDVMKRALKNTNMKPLSHSSYLRLQILCESFYDADDKLHCXXXXXXXXXXAFIPE 731
+ VIKE V K +P + Y IL + + ++L F+P
Sbjct: 619 FAVIKETVTKEYQNYKFRQPYHQAMYYCSLILQDQTWPWTEELDVLSHLEAEDVAKFVPM 678
Query: 732 LRSQLYIEGLCHGNLSEDEA----INISNIFRINFPLNINPPLIKLRH-ARRIVCLPSSA 786
L S+ +IE GN+ +EA +I ++ N P I PL +H R+V L
Sbjct: 679 LLSRTFIECYIAGNVENNEAESMVKHIEDVL-FNDPKPICRPLFPSQHLTNRVVKLGEGM 737
Query: 787 NLVRDVGVKNKFDKNSAVELYFQIEQDFGMRSMKLKALIDLVEEIVKEPLFNQLRTKEQL 846
N D+NSA+ Y Q+ +D ++KL+ L + K+ F+QLRT EQL
Sbjct: 738 KYFYHQDGSNPSDENSALVHYIQVHRDDFSMNIKLQ----LFGLVAKQATFHQLRTVEQL 793
Query: 847 GYVVDCSSRVTIRVLGFCFHIQSSEYNPVYLQGRIDNFINXXXXXXXXXXXXSFENYKSG 906
GY+ + R + G F IQSS P ++ R+++ + F++ +
Sbjct: 794 GYITALAQRNDSGIYGVQFIIQSSVKGPGHIDSRVESLLKNFESKLYEMSNEDFKSNVTA 853
Query: 907 LMAKLLEKDPSLTYESNRLWNQILDKRYIFDVSKKKAEELRNISKNDVVEWYKTYLKPSS 966
L+ LEK +L ES W +I F+ + + L+ + K ++++++ Y+K +
Sbjct: 854 LIDMKLEKHKNLKEESRFYWREIQSGTLKFNRKEAEVSALKQLQKQELIDFFDEYIKVGA 913
Query: 967 PKCRRLLVRVWGCNTDLKDNAEALSK--SMQVIITDPTAFKKESVFYPS 1013
+ + L +RV+G LK+ A + S V I D F+K + S
Sbjct: 914 ARKKSLSIRVYGSQ-HLKEMASDKDEVPSPSVEIEDIVGFRKSQPLHGS 961
>AT3G57470.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr3:21269828-21275886 REVERSE LENGTH=851
Length = 851
Score = 406 bits (1044), Expect = e-113, Method: Compositional matrix adjust.
Identities = 289/883 (32%), Positives = 434/883 (49%), Gaps = 79/883 (8%)
Query: 17 VVKSPNDRRLYRFIQLHNGLQALLVHDPEIYPEGPPKPVPTDNXXXXXXXXXXXXXXXXX 76
++K D+R YR I L N L+ LL+ DPE TD
Sbjct: 17 ILKPRTDKREYRRIVLKNSLEVLLISDPE-----------TDKCAAS------------- 52
Query: 77 XXXXXXXXXXXXXXXXMCVGVGSFSDPNEAQGLAHFLEHMLFMGSEEFPDENEYDSYLSK 136
M V VGSF+DP +GLAHFLEHMLF SE++P+E+ Y Y+++
Sbjct: 53 ----------------MNVSVGSFTDPEGLEGLAHFLEHMLFYASEKYPEEDSYSKYITE 96
Query: 137 HGGSSNAHTEAEHTCYKFEVKREYLKGALRRFSQFFISPLVKIEAMEREVLAVDSEFNKV 196
HGGS+NA+T +E T Y F++ + AL RF+QFFI PL+ +A RE+ AVDSE
Sbjct: 97 HGGSTNAYTSSEDTNYHFDINTDSFYEALDRFAQFFIQPLMSTDATMREIKAVDSEHQNN 156
Query: 197 LQYDHCRLQQLQRHTSALNHPLNKFSCGNKKSL-VDAMENGSITNLREKLLKFYEDYYHA 255
L D R+ QLQ+H S +HP +KFS GN +L V ENG + R +L+KFY+++Y A
Sbjct: 157 LLSDSWRMAQLQKHLSREDHPYHKFSTGNMDTLHVRPEENG--VDTRSELIKFYDEHYSA 214
Query: 256 GLMKLVVIGGESLNVLESWVVELFSTVKNGPQVNPEFIVEGPMWKSGKVYRLEAVKDINI 315
+M LVV G E+L+ + V LF ++N Q P F + +V VK + I
Sbjct: 215 NIMHLVVYGKENLDKTQGLVEALFQGIRNTNQGIPRFPGQPCTLDHLQVL----VKAVPI 270
Query: 316 -----LSLAWTLPSLDQDYLEKPDDYLAYLLRNEGRGSLIFFLRARGWATSLFAGIGNDG 370
LS++W + Y E P YL L+ +EG GSL L+ GWAT L+AG +
Sbjct: 271 MQGHELSVSWPVTPSISHYEEAPCRYLGDLIGHEGEGSLFHALKILGWATGLYAGEAD-- 328
Query: 371 MYWSSIAYVFVISICLTDSGIEKMYDIIGFVYQYLKLLRQVSPQEWIFKELQNMGNMKFR 430
WS F +SI LTD+G E M DI+G +++Y+K+L+Q +WIF EL + +F
Sbjct: 329 --WSMEYSFFNVSIDLTDAGHEHMQDILGLLFEYIKVLQQSGVSQWIFDELSAICEAEFH 386
Query: 431 FAEEQPQDDXXXXXXXXXXXXXPEHVIYGDYMYETWDEQLLQQVLGFFIPENMRVDVVSK 490
+ + +H + G + ++ ++Q+VL P N+R+ S
Sbjct: 387 YQAKIDPISYAVDISSNMKIYPTKHWLVGSSLPSKFNPAIVQKVLDELSPNNVRIFWESN 446
Query: 491 FLKSSQDFKYETWFGSRYVEEDISQNLMKLW-RNPPEIDASFHLPSKNEFIPSDFSIRAG 549
+ D K E W+ + Y E I++ ++ W ++ P D + LP+ N FIP+DFS++
Sbjct: 447 KFEGQTD-KVEPWYNTAYSLEKITKFTIQEWMQSAP--DVNLLLPTPNVFIPTDFSLKDL 503
Query: 550 EDDSANLTSPRCIVDEALIKFWYKPDSTFKVPRSNTYFQIN--LKGSRYDNVKSCVLSEL 607
+D P + + + WYKPD+ F P++ N L S D + VLS++
Sbjct: 504 KDKD---IFPVLLRKTSYSRLWYKPDTKFFKPKAYVKMDFNCPLAVSSPD---AAVLSDI 557
Query: 608 FIHLLKDELNEIIYQASVAKLETRISYVGDHMLELKVSGFNEKLPVLLSKILSVTRSFMP 667
F+ LL D LNE Y A A L+ +S + D+ EL ++GFN KL +LL ++ F
Sbjct: 558 FVWLLVDYLNEYAYYAQAAGLDYGLS-LSDNGFELSLAGFNHKLRILLEAVIQKIAKFEV 616
Query: 668 TEDRYKVIKEDVMKRALKNTNMKPLSHSSYLRLQILCESFYDADDKLHCXXXXXXXXXXA 727
DR+ VIKE V K N +P ++ +L + + ++L
Sbjct: 617 KPDRFSVIKETVTKAYQNNKFQQPHEQATNYCSLVLQDQIWPWTEELDALSHLEAEDLAN 676
Query: 728 FIPELRSQLYIEGLCHGNLSEDEA----INISNIFRINFPLNINPPLIKLRHARRIVCLP 783
F+P L S+ ++E GN+ +DEA +I ++ + P R+ L
Sbjct: 677 FVPMLLSRTFVECYIAGNVEKDEAESMVKHIEDVLFTDSKPICRPLFPSQFLTNRVTELG 736
Query: 784 SSANLVRDVGVKNKFDKNSAVELYFQIEQD-FGMRSMKLKALIDLVEEIVKEPLFNQLRT 842
+ N D+NSA+ Y Q+ +D F M S + L E I K+ F+QLRT
Sbjct: 737 TGMKHFYYQEGSNSSDENSALVHYIQVHKDEFSMNSK-----LQLFELIAKQDTFHQLRT 791
Query: 843 KEQLGYVVDCSSRVTIRVLGFCFHIQSSEYNPVYLQGRIDNFI 885
EQLGY+ S V G F IQSS P ++ R+++ +
Sbjct: 792 IEQLGYITSLSLSNDSGVYGVQFIIQSSVKGPGHIDSRVESLL 834
>AT3G57470.2 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr3:21269428-21275400 REVERSE LENGTH=891
Length = 891
Score = 396 bits (1017), Expect = e-110, Method: Compositional matrix adjust.
Identities = 289/914 (31%), Positives = 451/914 (49%), Gaps = 42/914 (4%)
Query: 116 MLFMGSEEFPDENEYDSYLSKHGGSSNAHTEAEHTCYKFEVKREYLKGALRRFSQFFISP 175
MLF SE++P+E+ Y Y+++HGGS+NA+T +E T Y F++ + AL RF+QFFI P
Sbjct: 1 MLFYASEKYPEEDSYSKYITEHGGSTNAYTSSEDTNYHFDINTDSFYEALDRFAQFFIQP 60
Query: 176 LVKIEAMEREVLAVDSEFNKVLQYDHCRLQQLQRHTSALNHPLNKFSCGNKKSL-VDAME 234
L+ +A RE+ AVDSE L D R+ QLQ+H S +HP +KFS GN +L V E
Sbjct: 61 LMSTDATMREIKAVDSEHQNNLLSDSWRMAQLQKHLSREDHPYHKFSTGNMDTLHVRPEE 120
Query: 235 NGSITNLREKLLKFYEDYYHAGLMKLVVIGGESLNVLESWVVELFSTVKNGPQVNPEFIV 294
NG + R +L+KFY+++Y A +M LVV G E+L+ + V LF ++N Q P F
Sbjct: 121 NG--VDTRSELIKFYDEHYSANIMHLVVYGKENLDKTQGLVEALFQGIRNTNQGIPRFPG 178
Query: 295 EGPMWKSGKVYRLEAVKDINI-----LSLAWTLPSLDQDYLEKPDDYLAYLLRNEGRGSL 349
+ +V VK + I LS++W + Y E P YL L+ +EG GSL
Sbjct: 179 QPCTLDHLQV----LVKAVPIMQGHELSVSWPVTPSISHYEEAPCRYLGDLIGHEGEGSL 234
Query: 350 IFFLRARGWATSLFAGIGNDGMYWSSIAYVFVISICLTDSGIEKMYDIIGFVYQYLKLLR 409
L+ GWAT L+AG + WS F +SI LTD+G E M DI+G +++Y+K+L+
Sbjct: 235 FHALKILGWATGLYAGEAD----WSMEYSFFNVSIDLTDAGHEHMQDILGLLFEYIKVLQ 290
Query: 410 QVSPQEWIFKELQNMGNMKFRFAEEQPQDDXXXXXXXXXXXXXPEHVIYGDYMYETWDEQ 469
Q +WIF EL + +F + + +H + G + ++
Sbjct: 291 QSGVSQWIFDELSAICEAEFHYQAKIDPISYAVDISSNMKIYPTKHWLVGSSLPSKFNPA 350
Query: 470 LLQQVLGFFIPENMRVDVVSKFLKSSQDFKYETWFGSRYVEEDISQNLMKLW-RNPPEID 528
++Q+VL P N+R+ S + D K E W+ + Y E I++ ++ W ++ P D
Sbjct: 351 IVQKVLDELSPNNVRIFWESNKFEGQTD-KVEPWYNTAYSLEKITKFTIQEWMQSAP--D 407
Query: 529 ASFHLPSKNEFIPSDFSIRAGEDDSANLTSPRCIVDEALIKFWYKPDSTFKVPRSNTYFQ 588
+ LP+ N FIP+DFS++ +D P + + + WYKPD+ F P++
Sbjct: 408 VNLLLPTPNVFIPTDFSLKDLKDKD---IFPVLLRKTSYSRLWYKPDTKFFKPKAYVKMD 464
Query: 589 IN--LKGSRYDNVKSCVLSELFIHLLKDELNEIIYQASVAKLETRISYVGDHMLELKVSG 646
N L S D + VLS++F+ LL D LNE Y A A L+ +S + D+ EL ++G
Sbjct: 465 FNCPLAVSSPD---AAVLSDIFVWLLVDYLNEYAYYAQAAGLDYGLS-LSDNGFELSLAG 520
Query: 647 FNEKLPVLLSKILSVTRSFMPTEDRYKVIKEDVMKRALKNTNMKPLSHSSYLRLQILCES 706
FN KL +LL ++ F DR+ VIKE V K N +P ++ +L +
Sbjct: 521 FNHKLRILLEAVIQKIAKFEVKPDRFSVIKETVTKAYQNNKFQQPHEQATNYCSLVLQDQ 580
Query: 707 FYDADDKLHCXXXXXXXXXXAFIPELRSQLYIEGLCHGNLSEDEA----INISNIFRINF 762
+ ++L F+P L S+ ++E GN+ +DEA +I ++ +
Sbjct: 581 IWPWTEELDALSHLEAEDLANFVPMLLSRTFVECYIAGNVEKDEAESMVKHIEDVLFTDS 640
Query: 763 PLNINPPLIKLRHARRIVCLPSSANLVRDVGVKNKFDKNSAVELYFQIEQD-FGMRSMKL 821
P R+ L + N D+NSA+ Y Q+ +D F M S
Sbjct: 641 KPICRPLFPSQFLTNRVTELGTGMKHFYYQEGSNSSDENSALVHYIQVHKDEFSMNSK-- 698
Query: 822 KALIDLVEEIVKEPLFNQLRTKEQLGYVVDCSSRVTIRVLGFCFHIQSSEYNPVYLQGRI 881
+ L E I K+ F+QLRT EQLGY+ S V G F IQSS P ++ R+
Sbjct: 699 ---LQLFELIAKQDTFHQLRTIEQLGYITSLSLSNDSGVYGVQFIIQSSVKGPGHIDSRV 755
Query: 882 DNFINXXXXXXXXXXXXSFENYKSGLMAKLLEKDPSLTYESNRLWNQILDKRYIFDVSKK 941
++ + F++ + L+ LEKD +L ES W +I F+
Sbjct: 756 ESLLKDLESKFYNMSDEEFKSNVTNLIDMKLEKDKNLDEESWFYWAEIQTGTLKFNRIDA 815
Query: 942 KAEELRNISKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTDLKD--NAEALSKSMQVIIT 999
+ LR + K++ ++++ Y+K +P + L + V+G N LK+ N + S + I
Sbjct: 816 EVAALRLLKKDEWIDFFDEYIKVDAPNKKSLSICVYG-NQHLKEMRNDKDKIPSTSIEIE 874
Query: 1000 DPTAFKKESVFYPS 1013
D F+K Y S
Sbjct: 875 DIVCFRKSQPLYGS 888
>AT3G57470.3 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr3:21269428-21275400 REVERSE LENGTH=881
Length = 881
Score = 389 bits (999), Expect = e-108, Method: Compositional matrix adjust.
Identities = 288/914 (31%), Positives = 447/914 (48%), Gaps = 52/914 (5%)
Query: 116 MLFMGSEEFPDENEYDSYLSKHGGSSNAHTEAEHTCYKFEVKREYLKGALRRFSQFFISP 175
MLF SE++P+E+ Y Y+++HGGS+NA+T +E T Y F++ + AL RF+QFFI P
Sbjct: 1 MLFYASEKYPEEDSYSKYITEHGGSTNAYTSSEDTNYHFDINTDSFYEALDRFAQFFIQP 60
Query: 176 LVKIEAMEREVLAVDSEFNKVLQYDHCRLQQLQRHTSALNHPLNKFSCGNKKSL-VDAME 234
L+ +A RE+ AVDSE L D R+ QLQ+H S +HP +KFS GN +L V E
Sbjct: 61 LMSTDATMREIKAVDSEHQNNLLSDSWRMAQLQKHLSREDHPYHKFSTGNMDTLHVRPEE 120
Query: 235 NGSITNLREKLLKFYEDYYHAGLMKLVVIGGESLNVLESWVVELFSTVKNGPQVNPEFIV 294
NG + R +L+KFY+++Y A +M LVV G E+L+ + V LF ++N Q P F
Sbjct: 121 NG--VDTRSELIKFYDEHYSANIMHLVVYGKENLDKTQGLVEALFQGIRNTNQGIPRFPG 178
Query: 295 EGPMWKSGKVYRLEAVKDINI-----LSLAWTLPSLDQDYLEKPDDYLAYLLRNEGRGSL 349
+ +V VK + I LS++W + Y E P YL L+ +EG GSL
Sbjct: 179 QPCTLDHLQV----LVKAVPIMQGHELSVSWPVTPSISHYEEAPCRYLGDLIGHEGEGSL 234
Query: 350 IFFLRARGWATSLFAGIGNDGMYWSSIAYVFVISICLTDSGIEKMYDIIGFVYQYLKLLR 409
L+ GWAT L+AG + WS F +SI LTD+G E M DI+G +++Y+K+L+
Sbjct: 235 FHALKILGWATGLYAGEAD----WSMEYSFFNVSIDLTDAGHEHMQDILGLLFEYIKVLQ 290
Query: 410 QVSPQEWIFKELQNMGNMKFRFAEEQPQDDXXXXXXXXXXXXXPEHVIYGDYMYETWDEQ 469
Q +WIF EL + +F + + +H + G + ++
Sbjct: 291 QSGVSQWIFDELSAICEAEFHYQAKIDPISYAVDISSNMKIYPTKHWLVGSSLPSKFNPA 350
Query: 470 LLQQVLGFFIPENMRVDVVSKFLKSSQDFKYETWFGSRYVEEDISQNLMKLW-RNPPEID 528
++Q+VL P N+R+ S + D K E W+ + Y E I++ ++ W ++ P D
Sbjct: 351 IVQKVLDELSPNNVRIFWESNKFEGQTD-KVEPWYNTAYSLEKITKFTIQEWMQSAP--D 407
Query: 529 ASFHLPSKNEFIPSDFSIRAGEDDSANLTSPRCIVDEALIKFWYKPDSTFKVPRSNTYFQ 588
+ LP+ N FIP+DFS++ +D P + + + WYKPD+ F P++
Sbjct: 408 VNLLLPTPNVFIPTDFSLKDLKDKD---IFPVLLRKTSYSRLWYKPDTKFFKPKAYVKMD 464
Query: 589 IN--LKGSRYDNVKSCVLSELFIHLLKDELNEIIYQASVAKLETRISYVGDHMLELKVSG 646
N L S D + VLS++F+ LL D LNE Y A A L+ +S + D+ EL ++G
Sbjct: 465 FNCPLAVSSPD---AAVLSDIFVWLLVDYLNEYAYYAQAAGLDYGLS-LSDNGFELSLAG 520
Query: 647 FNEKLPVLLSKILSVTRSFMPTEDRYKVIKEDVMKRALKNTNMKPLSHSSYLRLQILCES 706
FN KL +LL ++ F DR+ VIKE V K N +P ++ +L +
Sbjct: 521 FNHKLRILLEAVIQKIAKFEVKPDRFSVIKETVTKAYQNNKFQQPHEQATNYCSLVLQDQ 580
Query: 707 FYDADDKLHCXXXXXXXXXXAFIPELRSQLYIEGLCHGNLSEDEA----INISNIFRINF 762
+ ++L F+P L S+ ++E GN+ +DEA +I ++ +
Sbjct: 581 IWPWTEELDALSHLEAEDLANFVPMLLSRTFVECYIAGNVEKDEAESMVKHIEDVLFTDS 640
Query: 763 PLNINPPLIKLRHARRIVCLPSSANLVRDVGVKNKFDKNSAVELYFQIEQD-FGMRSMKL 821
P R+ L + N D+NSA+ Y Q+ +D F M S
Sbjct: 641 KPICRPLFPSQFLTNRVTELGTGMKHFYYQEGSNSSDENSALVHYIQVHKDEFSMNSK-- 698
Query: 822 KALIDLVEEIVKEPLFNQLRTKEQLGYVVDCSSRVTIRVLGFCFHIQSSEYNPVYLQGRI 881
+ L E I K+ F+QLRT EQLGY+ S V G F IQSS P ++ R+
Sbjct: 699 ---LQLFELIAKQDTFHQLRTIEQLGYITSLSLSNDSGVYGVQFIIQSSVKGPGHIDSRV 755
Query: 882 DNFINXXXXXXXXXXXXSFENYKSGLMAKLLEKDPSLTYESNRLWNQILDKRYIFDVSKK 941
++ + F+ LEKD +L ES W +I F+
Sbjct: 756 ESLLKDLESKFYNMSDEEFK----------LEKDKNLDEESWFYWAEIQTGTLKFNRIDA 805
Query: 942 KAEELRNISKNDVVEWYKTYLKPSSPKCRRLLVRVWGCNTDLKD--NAEALSKSMQVIIT 999
+ LR + K++ ++++ Y+K +P + L + V+G N LK+ N + S + I
Sbjct: 806 EVAALRLLKKDEWIDFFDEYIKVDAPNKKSLSICVYG-NQHLKEMRNDKDKIPSTSIEIE 864
Query: 1000 DPTAFKKESVFYPS 1013
D F+K Y S
Sbjct: 865 DIVCFRKSQPLYGS 878
>AT5G01440.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Insulinase (Peptidase family M16) family protein
(TAIR:AT1G06900.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:179822-181283 FORWARD
LENGTH=291
Length = 291
Score = 129 bits (323), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/191 (37%), Positives = 121/191 (63%), Gaps = 10/191 (5%)
Query: 791 DVGVKNKFDKNSAVELYFQIEQDFGMRSMKLKALIDLVEEIVKEPLFNQLRTKEQLGYVV 850
DV VKNKF+ NS ++YF+I+ + + + AL++L I+ + ++N+LR +E+LGY+V
Sbjct: 17 DVKVKNKFESNSLAKVYFRIKCEKAQEARQT-ALLNLFVSIISDSVYNKLRIEEKLGYLV 75
Query: 851 DCSSRVTIRVLGFCFHIQSSEYNPVYLQGRIDNFINXXXXXXXXXXXXSFENYKSGLMAK 910
+C +R+ I +GF + SS+YNP +L RI F+N F+++K+G+ +K
Sbjct: 76 ECETRL-IHGVGFYVCVVSSDYNPCHLVRRIYKFMNGIRLEGLFDKM--FKDFKNGVSSK 132
Query: 911 LLEKDPSLTYESNRLWNQILDKRYIFDVSKKKAEELRNISKNDVVEWYKTYLKPSSPKCR 970
L P + ++ W++I+ + IFD ++ +EL I+KND++EWYK Y++ SSPKC
Sbjct: 133 L----PHDSGKTK--WSEIVRESCIFDFYSEEKKELSLITKNDLIEWYKRYVRLSSPKCC 186
Query: 971 RLLVRVWGCNT 981
+V +WGCNT
Sbjct: 187 SFVVSIWGCNT 197
>AT3G02090.1 | Symbols: MPPBETA | Insulinase (Peptidase family M16)
protein | chr3:365624-368526 FORWARD LENGTH=531
Length = 531
Score = 58.5 bits (140), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 91/215 (42%), Gaps = 13/215 (6%)
Query: 95 VGVGSFSDPNEAQGLAHFLEHMLFMGSEEFPDENEYDSYLSKHGGSSNAHTEAEHTCYKF 154
+ GS + +E G AHFLEHM+F G++ + + GG NA+T E T Y
Sbjct: 125 IDAGSRFESDETNGTAHFLEHMIFKGTDRRT-VRALEEEIEDIGGHLNAYTSREQTTYYA 183
Query: 155 EVKREYLKGALRRFSQFFISPLVKIEAMEREVLAVDSEFNKV-LQYDHCRLQQLQRHTSA 213
+V + AL + + + + + RE + E +V Q D L L H +A
Sbjct: 184 KVLDSNVNQALDVLADILQNSKFEEQRINRERDVILREMQEVEGQTDEVVLDHL--HATA 241
Query: 214 LNH-PLNKFSCGNKKSLVDAMENGSITNLREKLLKFYEDYYHAGLMKLVVIGGESLNVLE 272
+ PL + G +++ SIT RE L + + +Y A M + G +
Sbjct: 242 FQYTPLGRTILGPAQNV------KSIT--REDLQNYIKTHYTASRMVIAAAGAVKHEEVV 293
Query: 273 SWVVELFSTVKNGPQVNPEFIVEGPMWKSGKVYRL 307
V +LF+ + + P + + P +G R+
Sbjct: 294 EQVKKLFTKLSSDPTTTSQLVANEPASFTGSEVRM 328
>AT3G02090.2 | Symbols: MPPBETA | Insulinase (Peptidase family M16)
protein | chr3:365624-368534 FORWARD LENGTH=535
Length = 535
Score = 58.5 bits (140), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 91/215 (42%), Gaps = 13/215 (6%)
Query: 95 VGVGSFSDPNEAQGLAHFLEHMLFMGSEEFPDENEYDSYLSKHGGSSNAHTEAEHTCYKF 154
+ GS + +E G AHFLEHM+F G++ + + GG NA+T E T Y
Sbjct: 125 IDAGSRFESDETNGTAHFLEHMIFKGTDRRT-VRALEEEIEDIGGHLNAYTSREQTTYYA 183
Query: 155 EVKREYLKGALRRFSQFFISPLVKIEAMEREVLAVDSEFNKVL-QYDHCRLQQLQRHTSA 213
+V + AL + + + + + RE + E +V Q D L L H +A
Sbjct: 184 KVLDSNVNQALDVLADILQNSKFEEQRINRERDVILREMQEVEGQTDEVVLDHL--HATA 241
Query: 214 LNH-PLNKFSCGNKKSLVDAMENGSITNLREKLLKFYEDYYHAGLMKLVVIGGESLNVLE 272
+ PL + G +++ SIT RE L + + +Y A M + G +
Sbjct: 242 FQYTPLGRTILGPAQNV------KSIT--REDLQNYIKTHYTASRMVIAAAGAVKHEEVV 293
Query: 273 SWVVELFSTVKNGPQVNPEFIVEGPMWKSGKVYRL 307
V +LF+ + + P + + P +G R+
Sbjct: 294 EQVKKLFTKLSSDPTTTSQLVANEPASFTGSEVRM 328
>AT3G57460.1 | Symbols: | catalytics;metal ion binding |
chr3:21263086-21265797 REVERSE LENGTH=356
Length = 356
Score = 53.5 bits (127), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 43/74 (58%), Gaps = 4/74 (5%)
Query: 358 WATSLFAGIGNDGMYWSSIAYVFVISICLTDSGIEKMYDIIGFVYQYLKLLRQVSPQEWI 417
WAT L+AG + W+ F +SI LTD+ E M DI+G +++ +KLL+Q +WI
Sbjct: 70 WATGLYAGEPD----WTVEYSFFNVSINLTDACHEHMKDILGLLFRQIKLLQQSGVSQWI 125
Query: 418 FKELQNMGNMKFRF 431
F EL + +F +
Sbjct: 126 FDELSAIFEAEFHY 139
Score = 52.0 bits (123), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 46/81 (56%), Gaps = 1/81 (1%)
Query: 601 SCVLSELFIHLLKDELNEIIYQASVAKLETRISYVGDHMLELKVSGFNEKLPVLLSKILS 660
+ VLS + LL D LNE Y A A+L +S + D+ EL ++GFN KL +LL ++
Sbjct: 201 TVVLSNSSVWLLVDYLNEYAYYAQAARLHYGLS-LSDNGFELSLTGFNHKLRILLEAVIQ 259
Query: 661 VTRSFMPTEDRYKVIKEDVMK 681
+F DR+ V+K V++
Sbjct: 260 KMANFQVKPDRFSVVKVTVLR 280