Miyakogusa Predicted Gene
- Lj0g3v0350409.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0350409.1 tr|A7S6W7|A7S6W7_NEMVE Predicted protein
(Fragment) OS=Nematostella vectensis GN=v1g106861 PE=4
SV=1,32.89,0.00000000005,Found in ATP-dependent protease La
(LON),Peptidase S16, lon N-terminal; seg,NULL; ATP-DEPENDENT
PROT,CUFF.24079.1
(543 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr3g110340.1 | ATP-dependent protease LA (lon) domain protein... 736 0.0
Medtr1g007160.1 | ATP-dependent protease LA (lon) domain protein... 676 0.0
Medtr1g007160.2 | ATP-dependent protease LA (lon) domain protein... 676 0.0
Medtr3g110340.2 | ATP-dependent protease LA (lon) domain protein... 649 0.0
>Medtr3g110340.1 | ATP-dependent protease LA (lon) domain protein |
HC | chr3:51505663-51501214 | 20130731
Length = 531
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/485 (74%), Positives = 403/485 (83%), Gaps = 15/485 (3%)
Query: 59 AVDSSGVFTFNTSVASLHTYLGDVEDTRHRAAFFDGGTVLNVPLFCLPGVVLFPGATLPL 118
+ SS FTFNTS+ASLHTYLGDVEDTRHR AF D G N+PLF L GVVLFPGATLPL
Sbjct: 62 GIGSSDEFTFNTSIASLHTYLGDVEDTRHRTAFLDAGATFNLPLFSLQGVVLFPGATLPL 121
Query: 119 RVIEAHFVAAIDRALSQVDVPLTIGVIRIHRDTPNRRMKSATIGTTAEIRQYGRLENGSL 178
RV FVAAI+RALSQVDVP TIGV+R+HRDT + M++A+ GTTA IRQYGRLE+GSL
Sbjct: 122 RVTVPRFVAAIERALSQVDVPYTIGVVRVHRDTESFTMQAASTGTTAVIRQYGRLEDGSL 181
Query: 179 NVVTRGQQRFRLRRCWVDVEGVPYGEVQIIEEDVPLRTPRDAFDQVPSSSNMSCSHAVLH 238
NVVTRGQQRF LRR W DV+GVPYGE+QIIEED+PLRTPR F + SSSNM CSH
Sbjct: 182 NVVTRGQQRFHLRRSWNDVDGVPYGEIQIIEEDLPLRTPRGIFGKSASSSNMPCSH---- 237
Query: 239 TPSSKHSHVKMEELKNGESDSEANSDGSFERELSQMERKIHLSVIGSSHVRDMMDESASS 298
VKM LKNG+++S+AN D FE ELS ERK HLS IGSS M D SA+S
Sbjct: 238 --------VKMHGLKNGQNNSDANPDEDFESELSPTERKAHLSAIGSSSASGMADVSANS 289
Query: 299 SDVKLMNKSDQEIRSNQDLSIGNCSTSGKQSSKEELNRCYKNVYNRPSHKISKTFLPHWV 358
S V M+ SDQEIRSN DLSI CSTSGKQSSKEELNRCYKN+ S+KISK F PHWV
Sbjct: 290 SGVNFMHNSDQEIRSNLDLSIEKCSTSGKQSSKEELNRCYKNIQ---SYKISKAFWPHWV 346
Query: 359 YRMYDSYWLAQRAADMWKRIVGVPSMDSLIKTPDILSFHIASKIPMSESTRQELLEIDGI 418
YRMYDSYWLAQRAADMWK+IVGVPSMDSL+K PD+LSFHIASKIP+SESTRQ+LL+IDGI
Sbjct: 347 YRMYDSYWLAQRAADMWKQIVGVPSMDSLVKKPDVLSFHIASKIPVSESTRQDLLDIDGI 406
Query: 419 AYRLRREIELLESIDVIRCKSCLTVIAKRSDMLVMSSEGPLGAYVNPGGYVHEIMTLYKA 478
YRLRREIELL+SID++RC+ C T+IAKRSDMLVMS+EGPLGAYVNPGGYVHEIMTLYKA
Sbjct: 407 TYRLRREIELLDSIDLVRCRICQTIIAKRSDMLVMSNEGPLGAYVNPGGYVHEIMTLYKA 466
Query: 479 NGLALLGPPVTEYSWFPGYAWTIATCATCKIQMGWLFTATNKKMRPNSFWGIRSCQVAEQ 538
NGL+L+G VTEYSWFPGYAWTIA CATC+ QMGWLFT TNKK+RP+SFWGIRSCQVAE+
Sbjct: 467 NGLSLVGHAVTEYSWFPGYAWTIAKCATCRTQMGWLFTTTNKKLRPDSFWGIRSCQVAEE 526
Query: 539 KRQNL 543
R+NL
Sbjct: 527 IRRNL 531
>Medtr1g007160.1 | ATP-dependent protease LA (lon) domain protein |
HC | chr1:213875-206900 | 20130731
Length = 556
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/496 (68%), Positives = 393/496 (79%), Gaps = 31/496 (6%)
Query: 64 GVFTFNTSVASLHTYLGDVEDTRHRAAFFDGGTVLNVPLFCLPGVVLFPGATLPLRVIEA 123
G FT+NT +ASLHTYLGDVEDT HR+ F DGGTVL +P+FCL GVVLFPGATLPLRVIE+
Sbjct: 69 GEFTYNTCIASLHTYLGDVEDTHHRSTFLDGGTVLTLPIFCLQGVVLFPGATLPLRVIES 128
Query: 124 HFVAAIDRALSQVDVPLTIGVIRIHRDTPNRRMKSATIGTTAEIRQYGRLENGSLNVVTR 183
+FVAA++++LS+VDVP TIGVIR+ DT NRRMK+A+IGTTAEIRQYGRLE+GSLNVVTR
Sbjct: 129 NFVAAVEKSLSRVDVPYTIGVIRVFSDTANRRMKTASIGTTAEIRQYGRLEDGSLNVVTR 188
Query: 184 GQQRFRLRRCWVDVEGVPYGEVQIIEEDVPLRTPRDAFDQVPSSSNMSCSHAVLHTPSSK 243
GQQRFRLRRCW+DVEGVPYGE+QIIEED+P RTPRDAF ++ SN+ C+ A + SK
Sbjct: 189 GQQRFRLRRCWIDVEGVPYGEIQIIEEDIPSRTPRDAFGKLTPLSNLPCNRASVL--PSK 246
Query: 244 HSHVKMEELKNGESDSEANSDGSFERELSQMERKIHLSVIGSSHVRDMMDESASSSDVKL 303
+S V + N ESD+E SFE ELS ER+IH S+I SS DESASS D K
Sbjct: 247 YS-VDGQGSLNEESDTEE----SFENELSSTERRIHQSLIRSSF---EYDESASSGDDKF 298
Query: 304 MNKSDQEIRSNQ--------------------DLSIGNCSTSGKQSS-KEELNRCYKNVY 342
+SDQEIRSN D IG+CSTSGKQSS +E LN C KN
Sbjct: 299 TYESDQEIRSNLNTPDTLTPLLPDHEKDAENLDSRIGSCSTSGKQSSIREGLNWCSKNRD 358
Query: 343 NRPSHKISKTFLPHWVYRMYDSYWLAQRAADMWKRIVGVPSMDSLIKTPDILSFHIASKI 402
S + S+ FLP WVYRM+DSY LAQ+AADMWK+IVG PSMD+L+K PD+LSF IASKI
Sbjct: 359 LYSSRRTSRAFLPGWVYRMFDSYLLAQKAADMWKQIVGAPSMDALVKKPDVLSFSIASKI 418
Query: 403 PMSESTRQELLEIDGIAYRLRREIELLESIDVIRCKSCLTVIAKRSDMLVMSSEGPLGAY 462
P+SESTRQELL+IDGI+YRLRREIELLESID+IRCK C +IAKRSDMLVMSSEGP+GAY
Sbjct: 419 PVSESTRQELLDIDGISYRLRREIELLESIDLIRCKICQIIIAKRSDMLVMSSEGPVGAY 478
Query: 463 VNPGGYVHEIMTLYKANGLALLGPPVTEYSWFPGYAWTIATCATCKIQMGWLFTATNKKM 522
VN GYVHEI TLYKANGLAL GP +T+YSWFPGYAWTIA CATC+ MGWLFTATN+K+
Sbjct: 479 VNATGYVHEITTLYKANGLALTGPALTKYSWFPGYAWTIANCATCETHMGWLFTATNRKL 538
Query: 523 RPNSFWGIRSCQVAEQ 538
+P SFWGIR+CQVA++
Sbjct: 539 KPKSFWGIRNCQVADE 554
>Medtr1g007160.2 | ATP-dependent protease LA (lon) domain protein |
HC | chr1:213875-206900 | 20130731
Length = 556
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/496 (68%), Positives = 393/496 (79%), Gaps = 31/496 (6%)
Query: 64 GVFTFNTSVASLHTYLGDVEDTRHRAAFFDGGTVLNVPLFCLPGVVLFPGATLPLRVIEA 123
G FT+NT +ASLHTYLGDVEDT HR+ F DGGTVL +P+FCL GVVLFPGATLPLRVIE+
Sbjct: 69 GEFTYNTCIASLHTYLGDVEDTHHRSTFLDGGTVLTLPIFCLQGVVLFPGATLPLRVIES 128
Query: 124 HFVAAIDRALSQVDVPLTIGVIRIHRDTPNRRMKSATIGTTAEIRQYGRLENGSLNVVTR 183
+FVAA++++LS+VDVP TIGVIR+ DT NRRMK+A+IGTTAEIRQYGRLE+GSLNVVTR
Sbjct: 129 NFVAAVEKSLSRVDVPYTIGVIRVFSDTANRRMKTASIGTTAEIRQYGRLEDGSLNVVTR 188
Query: 184 GQQRFRLRRCWVDVEGVPYGEVQIIEEDVPLRTPRDAFDQVPSSSNMSCSHAVLHTPSSK 243
GQQRFRLRRCW+DVEGVPYGE+QIIEED+P RTPRDAF ++ SN+ C+ A + SK
Sbjct: 189 GQQRFRLRRCWIDVEGVPYGEIQIIEEDIPSRTPRDAFGKLTPLSNLPCNRASVL--PSK 246
Query: 244 HSHVKMEELKNGESDSEANSDGSFERELSQMERKIHLSVIGSSHVRDMMDESASSSDVKL 303
+S V + N ESD+E SFE ELS ER+IH S+I SS DESASS D K
Sbjct: 247 YS-VDGQGSLNEESDTEE----SFENELSSTERRIHQSLIRSSF---EYDESASSGDDKF 298
Query: 304 MNKSDQEIRSNQ--------------------DLSIGNCSTSGKQSS-KEELNRCYKNVY 342
+SDQEIRSN D IG+CSTSGKQSS +E LN C KN
Sbjct: 299 TYESDQEIRSNLNTPDTLTPLLPDHEKDAENLDSRIGSCSTSGKQSSIREGLNWCSKNRD 358
Query: 343 NRPSHKISKTFLPHWVYRMYDSYWLAQRAADMWKRIVGVPSMDSLIKTPDILSFHIASKI 402
S + S+ FLP WVYRM+DSY LAQ+AADMWK+IVG PSMD+L+K PD+LSF IASKI
Sbjct: 359 LYSSRRTSRAFLPGWVYRMFDSYLLAQKAADMWKQIVGAPSMDALVKKPDVLSFSIASKI 418
Query: 403 PMSESTRQELLEIDGIAYRLRREIELLESIDVIRCKSCLTVIAKRSDMLVMSSEGPLGAY 462
P+SESTRQELL+IDGI+YRLRREIELLESID+IRCK C +IAKRSDMLVMSSEGP+GAY
Sbjct: 419 PVSESTRQELLDIDGISYRLRREIELLESIDLIRCKICQIIIAKRSDMLVMSSEGPVGAY 478
Query: 463 VNPGGYVHEIMTLYKANGLALLGPPVTEYSWFPGYAWTIATCATCKIQMGWLFTATNKKM 522
VN GYVHEI TLYKANGLAL GP +T+YSWFPGYAWTIA CATC+ MGWLFTATN+K+
Sbjct: 479 VNATGYVHEITTLYKANGLALTGPALTKYSWFPGYAWTIANCATCETHMGWLFTATNRKL 538
Query: 523 RPNSFWGIRSCQVAEQ 538
+P SFWGIR+CQVA++
Sbjct: 539 KPKSFWGIRNCQVADE 554
>Medtr3g110340.2 | ATP-dependent protease LA (lon) domain protein |
HC | chr3:51505663-51501214 | 20130731
Length = 484
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/438 (73%), Positives = 360/438 (82%), Gaps = 15/438 (3%)
Query: 59 AVDSSGVFTFNTSVASLHTYLGDVEDTRHRAAFFDGGTVLNVPLFCLPGVVLFPGATLPL 118
+ SS FTFNTS+ASLHTYLGDVEDTRHR AF D G N+PLF L GVVLFPGATLPL
Sbjct: 62 GIGSSDEFTFNTSIASLHTYLGDVEDTRHRTAFLDAGATFNLPLFSLQGVVLFPGATLPL 121
Query: 119 RVIEAHFVAAIDRALSQVDVPLTIGVIRIHRDTPNRRMKSATIGTTAEIRQYGRLENGSL 178
RV FVAAI+RALSQVDVP TIGV+R+HRDT + M++A+ GTTA IRQYGRLE+GSL
Sbjct: 122 RVTVPRFVAAIERALSQVDVPYTIGVVRVHRDTESFTMQAASTGTTAVIRQYGRLEDGSL 181
Query: 179 NVVTRGQQRFRLRRCWVDVEGVPYGEVQIIEEDVPLRTPRDAFDQVPSSSNMSCSHAVLH 238
NVVTRGQQRF LRR W DV+GVPYGE+QIIEED+PLRTPR F + SSSNM CSH
Sbjct: 182 NVVTRGQQRFHLRRSWNDVDGVPYGEIQIIEEDLPLRTPRGIFGKSASSSNMPCSH---- 237
Query: 239 TPSSKHSHVKMEELKNGESDSEANSDGSFERELSQMERKIHLSVIGSSHVRDMMDESASS 298
VKM LKNG+++S+AN D FE ELS ERK HLS IGSS M D SA+S
Sbjct: 238 --------VKMHGLKNGQNNSDANPDEDFESELSPTERKAHLSAIGSSSASGMADVSANS 289
Query: 299 SDVKLMNKSDQEIRSNQDLSIGNCSTSGKQSSKEELNRCYKNVYNRPSHKISKTFLPHWV 358
S V M+ SDQEIRSN DLSI CSTSGKQSSKEELNRCYKN+ S+KISK F PHWV
Sbjct: 290 SGVNFMHNSDQEIRSNLDLSIEKCSTSGKQSSKEELNRCYKNIQ---SYKISKAFWPHWV 346
Query: 359 YRMYDSYWLAQRAADMWKRIVGVPSMDSLIKTPDILSFHIASKIPMSESTRQELLEIDGI 418
YRMYDSYWLAQRAADMWK+IVGVPSMDSL+K PD+LSFHIASKIP+SESTRQ+LL+IDGI
Sbjct: 347 YRMYDSYWLAQRAADMWKQIVGVPSMDSLVKKPDVLSFHIASKIPVSESTRQDLLDIDGI 406
Query: 419 AYRLRREIELLESIDVIRCKSCLTVIAKRSDMLVMSSEGPLGAYVNPGGYVHEIMTLYKA 478
YRLRREIELL+SID++RC+ C T+IAKRSDMLVMS+EGPLGAYVNPGGYVHEIMTLYKA
Sbjct: 407 TYRLRREIELLDSIDLVRCRICQTIIAKRSDMLVMSNEGPLGAYVNPGGYVHEIMTLYKA 466
Query: 479 NGLALLGPPVTEYSWFPG 496
NGL+L+G VTEYSWFPG
Sbjct: 467 NGLSLVGHAVTEYSWFPG 484