Miyakogusa Predicted Gene
- Lj4g3v3019490.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v3019490.1 Non Chatacterized Hit- tr|I1KPT8|I1KPT8_SOYBN
Uncharacterized protein OS=Glycine max PE=3 SV=1,86.53,0,ZINC
PROTEASE-RELATED,NULL; METALLOPROTEASE,NULL; no description,Peptidase
M16, core; LuxS/MPP-like ,CUFF.52150.1
(930 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G56730.1 | Symbols: | Insulinase (Peptidase family M16) prot... 1316 0.0
AT5G42390.1 | Symbols: | Insulinase (Peptidase family M16) fami... 94 3e-19
AT3G02090.2 | Symbols: MPPBETA | Insulinase (Peptidase family M1... 59 2e-08
AT3G02090.1 | Symbols: MPPBETA | Insulinase (Peptidase family M1... 58 3e-08
AT1G06900.1 | Symbols: | Insulinase (Peptidase family M16) fami... 50 9e-06
>AT5G56730.1 | Symbols: | Insulinase (Peptidase family M16) protein
| chr5:22946906-22952576 REVERSE LENGTH=956
Length = 956
Score = 1316 bits (3405), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 634/894 (70%), Positives = 731/894 (81%), Gaps = 54/894 (6%)
Query: 39 ATPIPKKQRFRSLKLVNVDMEQLLPHQ--PVGVDYGTLDNGLRYYVRCNSKPRMXXXXXX 96
++ + +KQ FRSLKL++VDMEQ L ++ P G DYG LDNGL YYVR NSKPRM
Sbjct: 8 SSKVLRKQGFRSLKLMSVDMEQELGNELEPFGADYGRLDNGLIYYVRRNSKPRMRAALAL 67
Query: 97 XXXXGSVLEEEDERGVAHIVEHLAFSATTKYTNHDIVKFLESIGAEFGACQNAVTSSDDT 156
GSVLEEED+RGVAHIVEHLAFSATT+YTNHDIVKFLESIGAEFG CQNA+T++D+T
Sbjct: 68 AVKVGSVLEEEDQRGVAHIVEHLAFSATTRYTNHDIVKFLESIGAEFGPCQNAMTTADET 127
Query: 157 VYELLVPVDKPELLSQAISILAEFSSEIRVSKDDLEKERGAVMEEYRGSRNATGRLQDAH 216
+YEL VPVDKPELLSQAISILAEFSSEIRVSK+DLEKERGAVMEEYRG+RNATGR+QD+H
Sbjct: 128 IYELFVPVDKPELLSQAISILAEFSSEIRVSKEDLEKERGAVMEEYRGNRNATGRMQDSH 187
Query: 217 WILLMEGSKYAERLPIGLEKVIRTVPHETVKNFYKKWYHLCNMAVIAVGDFPDTQSVVEL 276
W L+MEGSKYAERLPIGLEKVIR+VP TVK FY+KWYHLCNMAV+AVGDFPDT++VV+L
Sbjct: 188 WQLMMEGSKYAERLPIGLEKVIRSVPAATVKQFYQKWYHLCNMAVVAVGDFPDTKTVVDL 247
Query: 277 IKTHFGQKIPAPDPPLIPTFHIPSHDEPRFSCFVESEAGGSAVMISYKMQAEELKTVKDY 336
IKTHF K + +PP IP F +PSH+E RFSCFVESEA GSAVMISYKM +LKTVKDY
Sbjct: 248 IKTHFEDKRSSSEPPQIPVFPVPSHEETRFSCFVESEAAGSAVMISYKMPVSDLKTVKDY 307
Query: 337 RNLLAESMFFHALNQRFFKISRRKDPPFFSCSASADNLVRPLKANIMTSSCKGKGTLQAL 396
R++LAESMF HALNQR FKISRRKDPPFF+CS +AD L
Sbjct: 308 RDMLAESMFLHALNQRLFKISRRKDPPFFACSVAADVL---------------------- 345
Query: 397 ESMLIEVARVRLHGFSEREVSIVRSLLMSEIESAFLERDQVQSTSMRDEYLQHFLHNEPV 456
VARVRLHGFSERE+S+VR+L+MSEIESA+LERDQVQSTS+RDEY+QHFLH EPV
Sbjct: 346 ------VARVRLHGFSEREISVVRALMMSEIESAYLERDQVQSTSLRDEYIQHFLHKEPV 399
Query: 457 VGIEYEAQLQKTLLPHISALEVSKCSERLRTSNSCVIKTIEPQAFAXXXXXXXXXXXXXX 516
+GIEYEAQLQKTLLP ISA +VS+ SE+LRTS CVIK++EP++ A
Sbjct: 400 IGIEYEAQLQKTLLPQISASDVSRYSEKLRTSCGCVIKSMEPKSAATIDHMRNVVSKVNS 459
Query: 517 XXXXGEISSWDDEYIPEEIVTTKPNMGHVMKELEYSNIGATELVLSNGMRVCYKRTDFLD 576
I+ WD+E IPEEIV+ KP G + +LEY +G TEL LSNGM+VCYK TDFLD
Sbjct: 460 LEEEKMIAPWDEENIPEEIVSEKPTPGDITHQLEYPEVGVTELTLSNGMQVCYKSTDFLD 519
Query: 577 DQVIFTGYSYGGLSELPENEYFSCSMGPTIAGEIGVFGYRPSVLMDMLAGKRAEVGTKIG 636
DQV+FTG+SYGGLSELPE++Y SCSMG TIAGEIG+FGY+PSVLMDMLA
Sbjct: 520 DQVLFTGFSYGGLSELPESDYISCSMGSTIAGEIGMFGYKPSVLMDMLA----------- 568
Query: 637 AYMRTFYGDCSPSDLETALQLVYQLFTTNLTPGEESVKIVMQMAEEAVCAQDRDPYTAFT 696
DLETALQLVYQLFTTN+ P EE V IVMQMAEE+V A++RDPYT F
Sbjct: 569 -------------DLETALQLVYQLFTTNVMPQEEEVGIVMQMAEESVRARERDPYTVFA 615
Query: 697 NRVKELNYGNSYFFRPVKKADLQKVDPVKACEYFSKCFRDPSTFTVVIVGNIDSTIALPL 756
NRVKELNYGNSYFFRP++ ++L+KVDP+KACEYF+ CFRDPSTFTVVIVGN+D TIALPL
Sbjct: 616 NRVKELNYGNSYFFRPIRISELRKVDPLKACEYFNSCFRDPSTFTVVIVGNLDPTIALPL 675
Query: 757 ILQYLGGILKPPEPIMHFNRDELKGLPFTFPTTIHREVVRSPMVEAQCLVQICFPVELKN 816
ILQYLGGI KPP+P+++FNRD+LKGLPFTFPT I +E VRSPMVEAQC VQ+CFPV+L N
Sbjct: 676 ILQYLGGIPKPPQPVLNFNRDDLKGLPFTFPTKITKEFVRSPMVEAQCSVQLCFPVQLTN 735
Query: 817 GTMVEEIHFVGFLSKLLETKIMQVLRFKHGQIYSVGVSVFLGGNKPSRTGDARGDISINF 876
GTM+EEIH +GFL KLLETKI+Q LRF+HGQIYS VSVFLGGNKPSRT D RGDIS+NF
Sbjct: 736 GTMIEEIHCIGFLGKLLETKIIQFLRFEHGQIYSAEVSVFLGGNKPSRTADLRGDISVNF 795
Query: 877 SCDPEISSKLVDLALDEMLRLQEEGPSEQDVSTVLEIEQRAHENGLQENYYWLD 930
SCDPEISSKLVDLAL+E++RLQ+EGPS++D+S +LEIEQRAHENG+QENYYWLD
Sbjct: 796 SCDPEISSKLVDLALEEIVRLQKEGPSQEDISAILEIEQRAHENGMQENYYWLD 849
>AT5G42390.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr5:16945308-16952647 FORWARD LENGTH=1265
Length = 1265
Score = 94.4 bits (233), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 103/226 (45%), Gaps = 15/226 (6%)
Query: 62 LPHQPVGVDYGTLDNGLRYYVRCNSKPRMXXXXXXXXXXGSVLEEEDERGVAHIVEHLAF 121
LP P + G L NGLRY + N P GS+ EEEDE+G+AH++EH+AF
Sbjct: 189 LPSHP-KLHRGQLKNGLRYLILPNKVPPNRFEAHMEVHVGSIDEEEDEQGIAHMIEHVAF 247
Query: 122 SATTKYTNHDIVKFLESIGAEFGACQNAVTSSDDTVYELLVPV----DKPELLSQAISIL 177
+ K K L + GA NA T TV+ + P + +L + L
Sbjct: 248 LGSKKRE-----KLLGT-----GARSNAYTDFHHTVFHIHSPTHTKDSEDDLFPSVLDAL 297
Query: 178 AEFSSEIRVSKDDLEKERGAVMEEYRGSRNATGRLQDAHWILLMEGSKYAERLPIGLEKV 237
E + + +EKER A++ E + R+ L +K R PIGLE+
Sbjct: 298 NEIAFHPKFLSSRVEKERRAILSELQMMNTIEYRVDCQLLQHLHSENKLGRRFPIGLEEQ 357
Query: 238 IRTVPHETVKNFYKKWYHLCNMAVIAVGDFPDTQSVVELIKTHFGQ 283
I+ + ++ F+++WY N + VGD + +V I+ FG+
Sbjct: 358 IKKWDVDKIRKFHERWYFPANATLYIVGDIDNIPRIVHNIEAVFGK 403
>AT3G02090.2 | Symbols: MPPBETA | Insulinase (Peptidase family M16)
protein | chr3:365624-368534 FORWARD LENGTH=535
Length = 535
Score = 58.5 bits (140), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 91/211 (43%), Gaps = 13/211 (6%)
Query: 73 TLDNGLRYYVRCNSKPRMXXXXXXXXXXGSVLEEEDERGVAHIVEHLAFSATTKYTNHDI 132
TL NGLR N + GS E ++ G AH +EH+ F T + T
Sbjct: 101 TLPNGLRVATESNLSAK-TATVGVWIDAGSRFESDETNGTAHFLEHMIFKGTDRRT---- 155
Query: 133 VKFLESIGAEFGACQNAVTSSDDTVYELLVPVDKPELLSQAISILAEFSSEIRVSKDDLE 192
V+ LE + G NA TS + T Y V +D ++QA+ +LA+ + + +
Sbjct: 156 VRALEEEIEDIGGHLNAYTSREQTTYYAKV-LDSN--VNQALDVLADILQNSKFEEQRIN 212
Query: 193 KERGAVMEEYRGSRNATGRLQDAHWILLMEGSKYAE--RLPIGLEKVIRTVPHETVKNFY 250
+ER ++ E + T + H L +Y R +G + ++++ E ++N+
Sbjct: 213 RERDVILREMQEVEGQTDEVVLDH--LHATAFQYTPLGRTILGPAQNVKSITREDLQNYI 270
Query: 251 KKWYHLCNMAVIAVGDFPDTQSVVELIKTHF 281
K Y M + A G + VVE +K F
Sbjct: 271 KTHYTASRMVIAAAGAVKH-EEVVEQVKKLF 300
>AT3G02090.1 | Symbols: MPPBETA | Insulinase (Peptidase family M16)
protein | chr3:365624-368526 FORWARD LENGTH=531
Length = 531
Score = 58.2 bits (139), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 92/213 (43%), Gaps = 13/213 (6%)
Query: 73 TLDNGLRYYVRCNSKPRMXXXXXXXXXXGSVLEEEDERGVAHIVEHLAFSATTKYTNHDI 132
TL NGLR N + GS E ++ G AH +EH+ F T + T
Sbjct: 101 TLPNGLRVATESNLSAK-TATVGVWIDAGSRFESDETNGTAHFLEHMIFKGTDRRT---- 155
Query: 133 VKFLESIGAEFGACQNAVTSSDDTVYELLVPVDKPELLSQAISILAEFSSEIRVSKDDLE 192
V+ LE + G NA TS + T Y V +D ++QA+ +LA+ + + +
Sbjct: 156 VRALEEEIEDIGGHLNAYTSREQTTYYAKV-LDSN--VNQALDVLADILQNSKFEEQRIN 212
Query: 193 KERGAVMEEYRGSRNATGRLQDAHWILLMEGSKYAE--RLPIGLEKVIRTVPHETVKNFY 250
+ER ++ E + T + H L +Y R +G + ++++ E ++N+
Sbjct: 213 RERDVILREMQEVEGQTDEVVLDH--LHATAFQYTPLGRTILGPAQNVKSITREDLQNYI 270
Query: 251 KKWYHLCNMAVIAVGDFPDTQSVVELIKTHFGQ 283
K Y M + A G + VVE +K F +
Sbjct: 271 KTHYTASRMVIAAAGAVKH-EEVVEQVKKLFTK 302
>AT1G06900.1 | Symbols: | Insulinase (Peptidase family M16) family
protein | chr1:2115155-2120635 REVERSE LENGTH=1024
Length = 1024
Score = 49.7 bits (117), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 78/172 (45%), Gaps = 14/172 (8%)
Query: 101 GSVLEEEDERGVAHIVEHLAFSATTKYTNHDIVKFLESIGAEFGACQNAVTSSDDTVYEL 160
GS L+ + +G+AH +EH+ F +T++ + + +S ++ G NA T + T Y
Sbjct: 116 GSFLDPPEAQGLAHFLEHMLFMGSTEFPDENEY---DSYLSKHGGSSNAYTEMEHTCYHF 172
Query: 161 LVPVDKPELLSQAISILAEFSSEIRVSKDDLEKERGAVMEEYRGS-RNATGRLQDAHWIL 219
V K E L A+ ++F + + +E+E AV E+ + +N RLQ
Sbjct: 173 EV---KREFLQGALKRFSQFFVAPLMKTEAMEREVLAVDSEFNQALQNDACRLQQLQCYT 229
Query: 220 LMEGSKYAERLPIGLEKVIRTVP------HETVKNFYKKWYHLCNMAVIAVG 265
+G + R G +K + E + YK++YH M ++ +G
Sbjct: 230 SAKGHPF-NRFAWGNKKSLSGAMENGVDLRECIVKLYKEYYHGGLMKLVVIG 280