Miyakogusa Predicted Gene
- Lj1g3v2624870.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2624870.2 Non Characterized Hit- tr|I1N3L2|I1N3L2_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,79.42,0,SET
domain,NULL; seg,NULL; no description,NULL; UNCHARACTERIZED PUTATIVE
METHYLTRANSFERASE,NULL,CUFF.29312.2
(555 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr7g076590.1 | SET domain protein | HC | chr7:28843776-288518... 878 0.0
Medtr7g076590.2 | SET domain protein | HC | chr7:28844695-288518... 768 0.0
Medtr4g082305.1 | rubisco large subunit N-methyltransferase | HC... 59 9e-09
Medtr4g082305.2 | rubisco large subunit N-methyltransferase | HC... 58 2e-08
Medtr1g115980.1 | SET domain protein | HC | chr1:52408798-524174... 50 7e-06
Medtr1g115980.2 | SET domain protein | HC | chr1:52408798-524174... 50 7e-06
Medtr1g115980.3 | SET domain protein | HC | chr1:52408798-524174... 50 9e-06
Medtr1g115980.5 | SET domain protein | HC | chr1:52408798-524174... 49 9e-06
>Medtr7g076590.1 | SET domain protein | HC | chr7:28843776-28851809
| 20130731
Length = 547
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/555 (78%), Positives = 466/555 (83%), Gaps = 8/555 (1%)
Query: 1 MENPEAKLETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVSDGVLVVVPLDLAITP 60
M NPEAKLETF+QWLQ NGVELRGC IKYCDSRKGFGIF D DVSDG+L+VVPL+L+ITP
Sbjct: 1 MANPEAKLETFLQWLQANGVELRGCNIKYCDSRKGFGIFCDKDVSDGILLVVPLELSITP 60
Query: 61 MRVLQDPLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWF 120
MRVLQDP LGPECRA+FEEGDVDDR TVERLRK+S WKPYL MLPTTF NTLWF
Sbjct: 61 MRVLQDPFLGPECRALFEEGDVDDRLLMMLLLTVERLRKDSLWKPYLDMLPTTFGNTLWF 120
Query: 121 SDDELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANS 180
S++ELQELRGTTLYRATELQKKSLL+LYET SE E CFEDFLWANS
Sbjct: 121 SEEELQELRGTTLYRATELQKKSLLNLYETKVKDIVKKLLTLDSDSEKEVCFEDFLWANS 180
Query: 181 VFWSRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWV 240
VFWSRALN PLPRSYVFPEMQ+V +SC +ADE SQ K D+LTKE T +TVQGDTVWV
Sbjct: 181 VFWSRALNTPLPRSYVFPEMQDVHQSCTPKADENGSQVTKSDDLTKETTHSTVQGDTVWV 240
Query: 241 EGLVPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTKGN 300
EGLVPGIDFCNHDLKP ATWEVDGTGLTTGVP SMYL+SAAQSPLQ D+EISISYG KGN
Sbjct: 241 EGLVPGIDFCNHDLKPIATWEVDGTGLTTGVPVSMYLLSAAQSPLQIDREISISYGNKGN 300
Query: 301 EELLYLYGFVIDGNTDDYLMVHYPTEAISTISFSEAKSQLLEVQNAEMRCLLPKALLEHG 360
EELLYLYGFVIDGNTDDYLMVHYP+EAI+TISFSE+KSQLLEVQ AEMRCLLPK LL++G
Sbjct: 301 EELLYLYGFVIDGNTDDYLMVHYPSEAINTISFSESKSQLLEVQKAEMRCLLPKTLLDNG 360
Query: 361 FFPLGTRNSGENNKSNQVCNYSWSGQRKMPSYVNKLVFPEKFIATLRTIAMQEDELFKVS 420
FFP GT NSGENNKSN+VC YSWSGQRK PSYV+KLVFPEKF+ATLRTIAMQEDELFKVS
Sbjct: 361 FFPSGTENSGENNKSNKVCKYSWSGQRKTPSYVDKLVFPEKFMATLRTIAMQEDELFKVS 420
Query: 421 SMLEELAGPERERQPSDTDVQSAIWEVCGDSGAXXXXXXXXXXXXXXXEESSGTEESDHD 480
SMLEEL GPE ERQ SD DVQSAIWEVCGDSGA EE SGTEE+D +
Sbjct: 421 SMLEELVGPEGERQLSDIDVQSAIWEVCGDSGALQLLVDLLHVKLMDLEEHSGTEENDFE 480
Query: 481 LLKKALIIDSQDDSKQCTSNLSTFANDSEESKLMMTRNKWSAIVYRQGQKELTRLFLKEA 540
LLKKALIIDSQ+DSKQ ANDSEE+ L MTRNKWSAIVYR+GQK+LTRLFLKEA
Sbjct: 481 LLKKALIIDSQEDSKQ-------MANDSEETTL-MTRNKWSAIVYRRGQKQLTRLFLKEA 532
Query: 541 EHALELSQNEEVNAP 555
EHAL LS +EE+ P
Sbjct: 533 EHALHLSMSEEMAVP 547
>Medtr7g076590.2 | SET domain protein | HC | chr7:28844695-28851809
| 20130731
Length = 487
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/495 (77%), Positives = 411/495 (83%), Gaps = 8/495 (1%)
Query: 61 MRVLQDPLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWF 120
MRVLQDP LGPECRA+FEEGDVDDR TVERLRK+S WKPYL MLPTTF NTLWF
Sbjct: 1 MRVLQDPFLGPECRALFEEGDVDDRLLMMLLLTVERLRKDSLWKPYLDMLPTTFGNTLWF 60
Query: 121 SDDELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANS 180
S++ELQELRGTTLYRATELQKKSLL+LYET SE E CFEDFLWANS
Sbjct: 61 SEEELQELRGTTLYRATELQKKSLLNLYETKVKDIVKKLLTLDSDSEKEVCFEDFLWANS 120
Query: 181 VFWSRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWV 240
VFWSRALN PLPRSYVFPEMQ+V +SC +ADE SQ K D+LTKE T +TVQGDTVWV
Sbjct: 121 VFWSRALNTPLPRSYVFPEMQDVHQSCTPKADENGSQVTKSDDLTKETTHSTVQGDTVWV 180
Query: 241 EGLVPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTKGN 300
EGLVPGIDFCNHDLKP ATWEVDGTGLTTGVP SMYL+SAAQSPLQ D+EISISYG KGN
Sbjct: 181 EGLVPGIDFCNHDLKPIATWEVDGTGLTTGVPVSMYLLSAAQSPLQIDREISISYGNKGN 240
Query: 301 EELLYLYGFVIDGNTDDYLMVHYPTEAISTISFSEAKSQLLEVQNAEMRCLLPKALLEHG 360
EELLYLYGFVIDGNTDDYLMVHYP+EAI+TISFSE+KSQLLEVQ AEMRCLLPK LL++G
Sbjct: 241 EELLYLYGFVIDGNTDDYLMVHYPSEAINTISFSESKSQLLEVQKAEMRCLLPKTLLDNG 300
Query: 361 FFPLGTRNSGENNKSNQVCNYSWSGQRKMPSYVNKLVFPEKFIATLRTIAMQEDELFKVS 420
FFP GT NSGENNKSN+VC YSWSGQRK PSYV+KLVFPEKF+ATLRTIAMQEDELFKVS
Sbjct: 301 FFPSGTENSGENNKSNKVCKYSWSGQRKTPSYVDKLVFPEKFMATLRTIAMQEDELFKVS 360
Query: 421 SMLEELAGPERERQPSDTDVQSAIWEVCGDSGAXXXXXXXXXXXXXXXEESSGTEESDHD 480
SMLEEL GPE ERQ SD DVQSAIWEVCGDSGA EE SGTEE+D +
Sbjct: 361 SMLEELVGPEGERQLSDIDVQSAIWEVCGDSGALQLLVDLLHVKLMDLEEHSGTEENDFE 420
Query: 481 LLKKALIIDSQDDSKQCTSNLSTFANDSEESKLMMTRNKWSAIVYRQGQKELTRLFLKEA 540
LLKKALIIDSQ+DSKQ ANDSEE+ L MTRNKWSAIVYR+GQK+LTRLFLKEA
Sbjct: 421 LLKKALIIDSQEDSKQ-------MANDSEETTL-MTRNKWSAIVYRRGQKQLTRLFLKEA 472
Query: 541 EHALELSQNEEVNAP 555
EHAL LS +EE+ P
Sbjct: 473 EHALHLSMSEEMAVP 487
>Medtr4g082305.1 | rubisco large subunit N-methyltransferase | HC |
chr4:32029066-32031775 | 20130731
Length = 490
Score = 59.3 bits (142), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 123/323 (38%), Gaps = 78/323 (24%)
Query: 8 LETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVS-DGVLVVVPLDLAITPMRVLQD 66
++TF +WLQ GV +K +G G+ + D+S + V++ VP L I P V
Sbjct: 55 VQTFWKWLQEEGVITAKTPVKASVVPEGLGLVALKDISRNDVILQVPKRLWINPDAVASS 114
Query: 67 PLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWFSDDELQ 126
+ G C + V ER R++S WK Y +LP +T+++S++ELQ
Sbjct: 115 EI-GRVCSKLTPWLSV------MLFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQ 167
Query: 127 ELRGTTLYRAT----ELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVF 182
EL+GT L T E K L L + +DF WA +
Sbjct: 168 ELQGTQLLNTTLSVKEYVKNECLKLEKEIILPNKKLFPSPV-------TLDDFFWAFGIL 220
Query: 183 WSRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWVEG 242
SRA + + +NL
Sbjct: 221 RSRAFS-----------------------------RLRNENLV----------------- 234
Query: 243 LVPGIDFCNHDLKPKA---TWEVDG-TGLTTGVPFSMYLISAAQSPL--QTDQEISISYG 296
++P D NH + +EV G GL FS + + +SPL + +++ I Y
Sbjct: 235 VIPLADLINHSARVTTDDHAYEVKGPAGL-----FSWDYLFSLRSPLSVKAGEQVYIQYD 289
Query: 297 -TKGNEELLYLYGFVIDGNTDDY 318
K N EL YGF I+ N D Y
Sbjct: 290 LNKSNAELALDYGF-IEPNADRY 311
>Medtr4g082305.2 | rubisco large subunit N-methyltransferase | HC |
chr4:32029066-32030776 | 20130731
Length = 351
Score = 58.2 bits (139), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 80/186 (43%), Gaps = 19/186 (10%)
Query: 8 LETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVS-DGVLVVVPLDLAITPMRVLQD 66
++TF +WLQ GV +K +G G+ + D+S + V++ VP L I P V
Sbjct: 55 VQTFWKWLQEEGVITAKTPVKASVVPEGLGLVALKDISRNDVILQVPKRLWINPDAVASS 114
Query: 67 PLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWFSDDELQ 126
+ G C + V ER R++S WK Y +LP +T+++S++ELQ
Sbjct: 115 EI-GRVCSKLTPWLSV------MLFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQ 167
Query: 127 ELRGTTLYRAT----ELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVF 182
EL+GT L T E K L L + +DF WA +
Sbjct: 168 ELQGTQLLNTTLSVKEYVKNECLKLEKEIILPNKKLFPSPV-------TLDDFFWAFGIL 220
Query: 183 WSRALN 188
SRA +
Sbjct: 221 RSRAFS 226
>Medtr1g115980.1 | SET domain protein | HC | chr1:52408798-52417465
| 20130731
Length = 451
Score = 50.1 bits (118), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 70/324 (21%), Positives = 118/324 (36%), Gaps = 80/324 (24%)
Query: 8 LETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVSDG-VLVVVPLDLAITPMRVLQD 66
L F+ WL+ + +S G +F+ + G ++ VP L +TP
Sbjct: 34 LHNFLPWLENKASSTISSSLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDN---- 89
Query: 67 PLLGPECRAMFEEGDVDDRXXXXXXXTVER-LRKNSPWKPYLAMLP--TTFQNTLWFSDD 123
L PE + E DV + + + L ++S W PY++ LP NT+++++
Sbjct: 90 --LPPEIKPFISE-DVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNES 146
Query: 124 ELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVFW 183
EL+ +R +++Y+ T QK S+IE +DFL VF
Sbjct: 147 ELEMIRQSSVYQETIYQK------------------------SQIE---KDFLEIKPVFQ 179
Query: 184 SRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWVE-- 241
P +S+ ++ +C T+ G W
Sbjct: 180 ------PFCQSFGDFTWKDFMHAC------------------------TLVGSRAWGSTK 209
Query: 242 --GLVPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTKG 299
L+P DF NHD +A D + V + +++ I YG
Sbjct: 210 GLSLIPFADFLNHDGISEAIVMSDDDNKCSEV--------FSDRDYVPGEQVLIRYGKFS 261
Query: 300 NEELLYLYGFVIDGNTDDYLMVHY 323
N L+ +GF I N D + + Y
Sbjct: 262 NATLMLDFGFTIPYNIYDQVQIQY 285
>Medtr1g115980.2 | SET domain protein | HC | chr1:52408798-52417465
| 20130731
Length = 442
Score = 49.7 bits (117), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 72/324 (22%), Positives = 119/324 (36%), Gaps = 80/324 (24%)
Query: 8 LETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVSDG-VLVVVPLDLAITPMRVLQD 66
L F+ WL+ + +S G +F+ + G ++ VP L +TP
Sbjct: 34 LHNFLPWLENKASSTISSSLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDN---- 89
Query: 67 PLLGPECRAMFEEGDVDDRXXXXXXXTVER-LRKNSPWKPYLAMLP--TTFQNTLWFSDD 123
L PE + E DV + + + L ++S W PY++ LP NT+++++
Sbjct: 90 --LPPEIKPFISE-DVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNES 146
Query: 124 ELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVFW 183
EL+ +R +++Y+ T QK S+IE +DFL VF
Sbjct: 147 ELEMIRQSSVYQETIYQK------------------------SQIE---KDFLEIKPVFQ 179
Query: 184 SRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWVE-- 241
P +S+ ++ +C T+ G W
Sbjct: 180 ------PFCQSFGDFTWKDFMHAC------------------------TLVGSRAWGSTK 209
Query: 242 --GLVPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTKG 299
L+P DF NHD +A D + V FS +++ I YG
Sbjct: 210 GLSLIPFADFLNHDGISEAIVMSDDDNKCSEV-FS-------DRDYVPGEQVLIRYGKFS 261
Query: 300 NEELLYLYGFVIDGNTDDYLMVHY 323
N L+ +GF I N D + + Y
Sbjct: 262 NATLMLDFGFTIPYNIYDQVQIQY 285
>Medtr1g115980.3 | SET domain protein | HC | chr1:52408798-52417465
| 20130731
Length = 412
Score = 49.7 bits (117), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 71/320 (22%), Positives = 117/320 (36%), Gaps = 72/320 (22%)
Query: 8 LETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVSDG-VLVVVPLDLAITPMRVLQD 66
L F+ WL+ + +S G +F+ + G ++ VP L +TP
Sbjct: 34 LHNFLPWLENKASSTISSSLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDN---- 89
Query: 67 PLLGPECRAMFEEGDVDDRXXXXXXXTVER-LRKNSPWKPYLAMLP--TTFQNTLWFSDD 123
L PE + E DV + + + L ++S W PY++ LP NT+++++
Sbjct: 90 --LPPEIKPFISE-DVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNES 146
Query: 124 ELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVFW 183
EL+ +R +++Y+ T QK S+IE +DFL VF
Sbjct: 147 ELEMIRQSSVYQETIYQK------------------------SQIE---KDFLEIKPVFQ 179
Query: 184 SRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWVEGL 243
P +S+ ++ +C L K L
Sbjct: 180 ------PFCQSFGDFTWKDFMHACTLVGSRAWGSTKGL--------------------SL 213
Query: 244 VPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTKGNEEL 303
+P DF NHD +A D + V FS +++ I YG N L
Sbjct: 214 IPFADFLNHDGISEAIVMSDDDNKCSEV-FS-------DRDYVPGEQVLIRYGKFSNATL 265
Query: 304 LYLYGFVIDGNTDDYLMVHY 323
+ +GF I N D + + Y
Sbjct: 266 MLDFGFTIPYNIYDQVQIQY 285
>Medtr1g115980.5 | SET domain protein | HC | chr1:52408798-52417465
| 20130731
Length = 421
Score = 49.3 bits (116), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 69/320 (21%), Positives = 116/320 (36%), Gaps = 72/320 (22%)
Query: 8 LETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVSDG-VLVVVPLDLAITPMRVLQD 66
L F+ WL+ + +S G +F+ + G ++ VP L +TP
Sbjct: 34 LHNFLPWLENKASSTISSSLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDN---- 89
Query: 67 PLLGPECRAMFEEGDVDDRXXXXXXXTVER-LRKNSPWKPYLAMLP--TTFQNTLWFSDD 123
L PE + E DV + + + L ++S W PY++ LP NT+++++
Sbjct: 90 --LPPEIKPFISE-DVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNES 146
Query: 124 ELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVFW 183
EL+ +R +++Y+ T QK S+IE +DFL VF
Sbjct: 147 ELEMIRQSSVYQETIYQK------------------------SQIE---KDFLEIKPVFQ 179
Query: 184 SRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWVEGL 243
P +S+ ++ +C L K L
Sbjct: 180 ------PFCQSFGDFTWKDFMHACTLVGSRAWGSTKGL--------------------SL 213
Query: 244 VPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTKGNEEL 303
+P DF NHD +A D + V + +++ I YG N L
Sbjct: 214 IPFADFLNHDGISEAIVMSDDDNKCSEV--------FSDRDYVPGEQVLIRYGKFSNATL 265
Query: 304 LYLYGFVIDGNTDDYLMVHY 323
+ +GF I N D + + Y
Sbjct: 266 MLDFGFTIPYNIYDQVQIQY 285