Miyakogusa Predicted Gene
- Lj1g3v2624870.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2624870.2 Non Chatacterized Hit- tr|I1N3L2|I1N3L2_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,79.42,0,SET
domain,NULL; seg,NULL; no description,NULL; UNCHARACTERIZED PUTATIVE
METHYLTRANSFERASE,NULL,CUFF.29312.2
(555 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G01920.2 | Symbols: | SET domain-containing protein | chr1:3... 640 0.0
AT1G01920.1 | Symbols: | SET domain-containing protein | chr1:3... 630 0.0
AT1G14030.1 | Symbols: | Rubisco methyltransferase family prote... 56 7e-08
>AT1G01920.2 | Symbols: | SET domain-containing protein |
chr1:316204-319507 FORWARD LENGTH=547
Length = 547
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/551 (58%), Positives = 383/551 (69%), Gaps = 23/551 (4%)
Query: 5 EAKLETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDND--VSDGVLVVVPLDLAITPMR 62
EAKLE F+ WLQ NG ELRGC IKY DS KGFGIF+ SD VL+VVPLDLAITPMR
Sbjct: 7 EAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMR 66
Query: 63 VLQDPLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWFSD 122
VLQDPLLGPEC+ MFE+G VDDR T+ERLR NS WKPYL MLPT F N LWFSD
Sbjct: 67 VLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSD 126
Query: 123 DELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVF 182
D++ EL+GT LY ATELQKK LLSLY SE + FE FLWANSVF
Sbjct: 127 DDILELKGTNLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWANSVF 186
Query: 183 WSRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTV----QGDTV 238
WSRALN+PLP S+VFP+ Q+ C E N KEI GDT+
Sbjct: 187 WSRALNIPLPHSFVFPQSQDDTGECT-STSESPETAPVNSNEEKEIQAQPAPSVGSGDTI 245
Query: 239 WVEGLVPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYGTK 298
WVEGLVPGIDFCNHDLKP ATWEVDG G + VPFSMYL+S AQ P+ +EISISYG K
Sbjct: 246 WVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLSVAQRPI-PKKEISISYGNK 304
Query: 299 GNEELLYLYGFVIDGNTDDYLMVHYPTEAISTISFSEAKSQLLEVQNAEMRCLLPKALLE 358
GNEELLYLYGFVID N DDYLMVHYP EAI +I FS++K QLLE QNA++RCLLPK++L
Sbjct: 305 GNEELLYLYGFVIDNNPDDYLMVHYPVEAIPSIPFSDSKGQLLEAQNAQLRCLLPKSVLN 364
Query: 359 HGFFPLGTRNSGENNKSNQV--CNYSWSGQRKMPSYVNKLVFPEKFIATLRTIAMQEDEL 416
HGFFP T E+++ V CN+SWSG+RKMP+Y+NKLVFPE F+ LRTIAMQE+E+
Sbjct: 365 HGFFPRTTSVIRESDEKETVRSCNFSWSGKRKMPTYMNKLVFPEDFMTGLRTIAMQEEEI 424
Query: 417 FKVSSMLEELAGPERERQPSDTDVQSAIWEVCGDSGAXXXXXXXXXXXXXXXEESSGTEE 476
+KVS+MLEEL + QPS+T+V+ A+WE CGDSGA EE+SGTEE
Sbjct: 425 YKVSAMLEELVESRQGEQPSETEVRMAVWEACGDSGALQLLVDLLNSKMMKLEENSGTEE 484
Query: 477 SDHDLLKKALIIDSQDDSKQCTSNLSTFANDSEESKLMMTRNKWSAIVYRQGQKELTRLF 536
D LL++A +++S ++S+ M+RNKWS++VYR+GQK+LTRL
Sbjct: 485 QDARLLEEACVLESHEESRDLDGR-------------RMSRNKWSSVVYRRGQKQLTRLL 531
Query: 537 LKEAEHALELS 547
LKEAEHAL L+
Sbjct: 532 LKEAEHALHLA 542
>AT1G01920.1 | Symbols: | SET domain-containing protein |
chr1:316204-319507 FORWARD LENGTH=572
Length = 572
Score = 630 bits (1624), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/575 (56%), Positives = 388/575 (67%), Gaps = 46/575 (8%)
Query: 5 EAKLETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDND--VSDGVLVVVPLDLAITPMR 62
EAKLE F+ WLQ NG ELRGC IKY DS KGFGIF+ SD VL+VVPLDLAITPMR
Sbjct: 7 EAKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMR 66
Query: 63 VLQDPLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWFSD 122
VLQDPLLGPEC+ MFE+G VDDR T+ERLR NS WKPYL MLPT F N LWFSD
Sbjct: 67 VLQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSD 126
Query: 123 DELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANSVF 182
D++ EL+GT LY ATELQKK LLSLY SE + FE FLWANSVF
Sbjct: 127 DDILELKGTNLYHATELQKKKLLSLYHDKVEVLVTKLLILDGDSESKVSFEHFLWANSVF 186
Query: 183 WSRALNLPLPRSYVFPEMQEVQESCIL-----EADEKISQDKKQDNLTKEITCNTV-QGD 236
WSRALN+PLP S+VFP+ Q+ C E S ++K +LT +V GD
Sbjct: 187 WSRALNIPLPHSFVFPQSQDDTGECTSTSESPETAPVNSNEEKGKSLTSAQPAPSVGSGD 246
Query: 237 TVWVEGLVPGIDFCNHDLKPKATWEVDGTGLTTGVPFSMYLISAAQSPLQTDQEISISYG 296
T+WVEGLVPGIDFCNHDLKP ATWEVDG G + VPFSMYL+S AQ P+ +EISISYG
Sbjct: 247 TIWVEGLVPGIDFCNHDLKPVATWEVDGIGSVSRVPFSMYLLSVAQRPI-PKKEISISYG 305
Query: 297 TKGNEELLYLYGFVIDGNTDDYLM----------------------VHYPTEAISTISFS 334
KGNEELLYLYGFVID N DDYLM VHYP EAI +I FS
Sbjct: 306 NKGNEELLYLYGFVIDNNPDDYLMIKEMLVNFVLTSVVTFNNGFIQVHYPVEAIPSIPFS 365
Query: 335 EAKSQLLEVQNAEMRCLLPKALLEHGFFPLGTRNSGENNKSNQV--CNYSWSGQRKMPSY 392
++K QLLE QNA++RCLLPK++L HGFFP T E+++ V CN+SWSG+RKMP+Y
Sbjct: 366 DSKGQLLEAQNAQLRCLLPKSVLNHGFFPRTTSVIRESDEKETVRSCNFSWSGKRKMPTY 425
Query: 393 VNKLVFPEKFIATLRTIAMQEDELFKVSSMLEELAGPERERQPSDTDVQSAIWEVCGDSG 452
+NKLVFPE F+ LRTIAMQE+E++KVS+MLEEL + QPS+T+V+ A+WE CGDSG
Sbjct: 426 MNKLVFPEDFMTGLRTIAMQEEEIYKVSAMLEELVESRQGEQPSETEVRMAVWEACGDSG 485
Query: 453 AXXXXXXXXXXXXXXXEESSGTEESDHDLLKKALIIDSQDDSKQCTSNLSTFANDSEESK 512
A EE+SGTEE D LL++A +++S ++S+
Sbjct: 486 ALQLLVDLLNSKMMKLEENSGTEEQDARLLEEACVLESHEESRDLDGR------------ 533
Query: 513 LMMTRNKWSAIVYRQGQKELTRLFLKEAEHALELS 547
M+RNKWS++VYR+GQK+LTRL LKEAEHAL L+
Sbjct: 534 -RMSRNKWSSVVYRRGQKQLTRLLLKEAEHALHLA 567
>AT1G14030.1 | Symbols: | Rubisco methyltransferase family protein
| chr1:4805493-4807440 REVERSE LENGTH=482
Length = 482
Score = 55.8 bits (133), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 121/316 (38%), Gaps = 69/316 (21%)
Query: 2 ENPEAKLETFMQWLQTNGVELRGCKIKYCDSRKGFGIFSDNDVS-DGVLVVVPLDLAITP 60
E PE + F +WL+ GV + +G G+ + D+ + V++ +P L I P
Sbjct: 44 ELPE-NVRNFWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRLWINP 102
Query: 61 MRVLQDPLLGPECRAMFEEGDVDDRXXXXXXXTVERLRKNSPWKPYLAMLPTTFQNTLWF 120
V + GP C G + E+ + S W+ YL MLP + +T+++
Sbjct: 103 ETVTASKI-GPLC------GGLKPWVSVALFLIREKYEEESSWRVYLDMLPQSTDSTVFW 155
Query: 121 SDDELQELRGTTLYRATELQKKSLLSLYETXXXXXXXXXXXXXXXSEIESCFEDFLWANS 180
S++EL EL+GT L +T L K + S I +DF+WA
Sbjct: 156 SEEELAELKGTQLL-STTLGVKEYVENEFLKLEQEILLPNKDLFSSRI--TLDDFIWAFG 212
Query: 181 VFWSRALNLPLPRSYVFPEMQEVQESCILEADEKISQDKKQDNLTKEITCNTVQGDTVWV 240
+ SRA + + NL
Sbjct: 213 ILKSRAFS-----------------------------RLRGQNLV--------------- 228
Query: 241 EGLVPGIDFCNHDLKPKA---TWEVDGTGLTTGVPFSMYLISAAQSP--LQTDQEISISY 295
L+P D NH+ K +E+ G GL FS L+ + +SP ++ +++ I Y
Sbjct: 229 --LIPLADLINHNPAIKTEDYAYEIKGAGL-----FSRDLLFSLKSPVYVKAGEQVYIQY 281
Query: 296 G-TKGNEELLYLYGFV 310
K N EL YGFV
Sbjct: 282 DLNKSNAELALDYGFV 297