Miyakogusa Predicted Gene
- Lj6g3v0727750.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0727750.2 Non Chatacterized Hit- tr|I1N087|I1N087_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.19659 PE,85.45,0,no
description,NULL; RCMTFAMILY,RNA (C5-cytosine) methyltransferase;
seg,NULL; Nol1_Nop2_Fmu,Bacteri,CUFF.58216.2
(276 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G26600.1 | Symbols: | S-adenosyl-L-methionine-dependent meth... 427 e-120
AT5G55920.1 | Symbols: OLI2 | S-adenosyl-L-methionine-dependent ... 427 e-120
AT3G13180.1 | Symbols: | NOL1/NOP2/sun family protein / antiter... 112 3e-25
AT5G26180.2 | Symbols: | S-adenosyl-L-methionine-dependent meth... 91 8e-19
AT5G26180.1 | Symbols: | S-adenosyl-L-methionine-dependent meth... 91 8e-19
AT4G17590.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Bacterial ... 79 3e-15
AT4G17590.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 79 4e-15
AT2G22400.1 | Symbols: | S-adenosyl-L-methionine-dependent meth... 78 6e-15
AT4G40000.1 | Symbols: | S-adenosyl-L-methionine-dependent meth... 78 8e-15
AT1G06560.1 | Symbols: | NOL1/NOP2/sun family protein | chr1:20... 63 2e-10
>AT4G26600.1 | Symbols: | S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein |
chr4:13419629-13423418 FORWARD LENGTH=671
Length = 671
Score = 427 bits (1099), Expect = e-120, Method: Compositional matrix adjust.
Identities = 196/233 (84%), Positives = 219/233 (93%)
Query: 1 MAGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPR 60
+AGFYMLQSASSFLPVMALAP+EKERVVDMAAAPGGKTTY+AALMKNTGII+ANEMKVPR
Sbjct: 320 LAGFYMLQSASSFLPVMALAPREKERVVDMAAAPGGKTTYVAALMKNTGIIYANEMKVPR 379
Query: 61 LKSLTANLHRMGVSNTVVCNYDGKELPKVLGLNTVDRVLLDAPCSGTGVISKDESVKTSK 120
LKSL+ANLHRMGV+NT+VCNYDG+EL KVLG ++VDRVLLDAPCSGTGVISKDESVKTSK
Sbjct: 380 LKSLSANLHRMGVTNTIVCNYDGRELTKVLGQSSVDRVLLDAPCSGTGVISKDESVKTSK 439
Query: 121 DLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYALKRRDVKL 180
+DI+K A LQK+LIL AID+VDANSK+GGY+VYSTCS+M+ ENEAVIDYALK RDVKL
Sbjct: 440 SADDIKKFAHLQKQLILGAIDLVDANSKTGGYIVYSTCSVMIPENEAVIDYALKNRDVKL 499
Query: 181 VACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVAKLKKLSNSKQ 233
V CGL+FGRPGF+ FRE RFH SL+KTRRFYPHVHNMDGFFVAKLKK+SN+ Q
Sbjct: 500 VPCGLDFGRPGFSSFREHRFHPSLEKTRRFYPHVHNMDGFFVAKLKKMSNAMQ 552
>AT5G55920.1 | Symbols: OLI2 | S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein |
chr5:22645742-22649383 REVERSE LENGTH=682
Length = 682
Score = 427 bits (1098), Expect = e-120, Method: Compositional matrix adjust.
Identities = 195/236 (82%), Positives = 221/236 (93%)
Query: 1 MAGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPR 60
+AG+YMLQ ASSFLPVMALAP+E ER+VD+AAAPGGKTTYIAALMKNTG+I+ANEMKVPR
Sbjct: 337 LAGYYMLQGASSFLPVMALAPRENERIVDVAAAPGGKTTYIAALMKNTGLIYANEMKVPR 396
Query: 61 LKSLTANLHRMGVSNTVVCNYDGKELPKVLGLNTVDRVLLDAPCSGTGVISKDESVKTSK 120
LKSLTANLHRMGV+NT+VCNYDG+ELPKVLG NTVDRVLLDAPCSGTG+ISKDESVK +K
Sbjct: 397 LKSLTANLHRMGVTNTIVCNYDGRELPKVLGQNTVDRVLLDAPCSGTGIISKDESVKITK 456
Query: 121 DLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYALKRRDVKL 180
+++I+K A LQK+L+LAAIDMVDANSK+GGY+VYSTCSIMV ENEAVIDYALK+RDVKL
Sbjct: 457 TMDEIKKFAHLQKQLLLAAIDMVDANSKTGGYIVYSTCSIMVTENEAVIDYALKKRDVKL 516
Query: 181 VACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVAKLKKLSNSKQGAK 236
V CGL+FGR GFT+FRE RF SLDKTRRFYPHVHNMDGFFVAKLKK+SN KQ ++
Sbjct: 517 VTCGLDFGRKGFTRFREHRFQPSLDKTRRFYPHVHNMDGFFVAKLKKMSNVKQSSE 572
>AT3G13180.1 | Symbols: | NOL1/NOP2/sun family protein /
antitermination NusB domain-containing protein |
chr3:4236326-4239966 REVERSE LENGTH=523
Length = 523
Score = 112 bits (279), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 122/228 (53%), Gaps = 23/228 (10%)
Query: 3 GFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPRLK 62
G +Q S+ L V + PQ ER++D AAPGGKT ++A+ +K G+I+A ++ RL+
Sbjct: 310 GICSVQDESAGLIVSVVKPQPGERIMDACAAPGGKTLFMASCLKGQGMIYAMDVNEGRLR 369
Query: 63 SL--TANLHRM-GVSNTVVCNYDGKELPKVLGLNTV--DRVLLDAPCSGTGVISKDESVK 117
L TA H++ G+ T+ +L N V D+VLLDAPCSG GV+SK ++
Sbjct: 370 ILGETAKSHQVDGLITTI-----HSDLRVFAETNEVQYDKVLLDAPCSGLGVLSKRADLR 424
Query: 118 TSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYALKRRD 177
++ LED+ + +LQ EL+ +A +V K GG LVYSTCSI ENE ++ L R
Sbjct: 425 WNRKLEDMLELTKLQDELLDSASKLV----KHGGVLVYSTCSIDPEENEGRVEAFLLRHP 480
Query: 178 VKLVACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVAKL 225
+ +F F Q F S P H++DG F A+L
Sbjct: 481 EFTIDPVTSFVPSSFVT--SQGFFLS-------NPVKHSLDGAFAARL 519
>AT5G26180.2 | Symbols: | S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein |
chr5:9149253-9152595 FORWARD LENGTH=567
Length = 567
Score = 90.9 bits (224), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 79/251 (31%), Positives = 106/251 (42%), Gaps = 28/251 (11%)
Query: 3 GFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPRLK 62
G LQ +S + AL PQ V+D +APG KT ++AALM+ G I A E+ R+K
Sbjct: 284 GRIFLQGKASSMVAAALQPQAGWEVLDACSAPGNKTIHLAALMEGQGKIIACELNEERVK 343
Query: 63 SLTANLHRMGVSNTVVCNYDGKEL-PKVLGLNTVDRVLLDAPCSGTGVI---------SK 112
L + G SN VC+ D L PK + +LLD CSG+G I S
Sbjct: 344 RLEHTIKLSGASNIEVCHGDFLGLNPKDPSFAKIRAILLDPSCSGSGTITDRLDHLLPSH 403
Query: 113 DESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYA 172
E + D + K A QK+ + A+ +VYSTCSI ENE V+
Sbjct: 404 SEDNNMNYDSMRLHKLAVFQKKALAHALSFPKVER-----VVYSTCSIYQIENEDVVSSV 458
Query: 173 LKRRD---VKLVACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVA---KLK 226
L KL + R G F +D V + +GFF+A +
Sbjct: 459 LPLASSLGFKLATPFPQWQRRGLPVFAGSEHLLRMDP-------VEDKEGFFIALFVRAN 511
Query: 227 KLSNSKQGAKP 237
KL N K P
Sbjct: 512 KLDNPKSSELP 522
>AT5G26180.1 | Symbols: | S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein |
chr5:9149253-9152595 FORWARD LENGTH=567
Length = 567
Score = 90.9 bits (224), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 79/251 (31%), Positives = 106/251 (42%), Gaps = 28/251 (11%)
Query: 3 GFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPRLK 62
G LQ +S + AL PQ V+D +APG KT ++AALM+ G I A E+ R+K
Sbjct: 284 GRIFLQGKASSMVAAALQPQAGWEVLDACSAPGNKTIHLAALMEGQGKIIACELNEERVK 343
Query: 63 SLTANLHRMGVSNTVVCNYDGKEL-PKVLGLNTVDRVLLDAPCSGTGVI---------SK 112
L + G SN VC+ D L PK + +LLD CSG+G I S
Sbjct: 344 RLEHTIKLSGASNIEVCHGDFLGLNPKDPSFAKIRAILLDPSCSGSGTITDRLDHLLPSH 403
Query: 113 DESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYA 172
E + D + K A QK+ + A+ +VYSTCSI ENE V+
Sbjct: 404 SEDNNMNYDSMRLHKLAVFQKKALAHALSFPKVER-----VVYSTCSIYQIENEDVVSSV 458
Query: 173 LKRRD---VKLVACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVA---KLK 226
L KL + R G F +D V + +GFF+A +
Sbjct: 459 LPLASSLGFKLATPFPQWQRRGLPVFAGSEHLLRMDP-------VEDKEGFFIALFVRAN 511
Query: 227 KLSNSKQGAKP 237
KL N K P
Sbjct: 512 KLDNPKSSELP 522
>AT4G17590.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Bacterial Fmu
(Sun)/eukaryotic nucleolar NOL1/Nop2p
(InterPro:IPR001678); BEST Arabidopsis thaliana protein
match is: S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein
(TAIR:AT4G26600.1); Has 459 Blast hits to 368 proteins
in 186 species: Archae - 2; Bacteria - 0; Metazoa - 137;
Fungi - 144; Plants - 100; Viruses - 0; Other Eukaryotes
- 76 (source: NCBI BLink). | chr4:9800843-9802591
REVERSE LENGTH=201
Length = 201
Score = 79.0 bits (193), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 64/98 (65%), Gaps = 4/98 (4%)
Query: 49 GIIFANEMKVPRLKSLTANLHRMGVSNTVVCNYD-GKELPKVLGLNTVDRVLLDAPCSGT 107
GIIFAN L SL ANLHRMG++NTVV NY+ +L +V +N+ D VL++AP + T
Sbjct: 55 GIIFANASTEHLLGSLYANLHRMGITNTVVSNYNINTKLSRVFHINSKDMVLVNAPSTRT 114
Query: 108 GVISKDESVKTSKDLE-DIQKCAQLQKELILAAIDMVD 144
G+IS+ S+K S + E DIQ+ LQK I+ + M D
Sbjct: 115 GLISEFGSIKMSINEEADIQRFGVLQK--IVEGVHMSD 150
>AT4G17590.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; CONTAINS InterPro DOMAIN/s:
Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
(InterPro:IPR001678); BEST Arabidopsis thaliana protein
match is: S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein
(TAIR:AT4G26600.1); Has 362 Blast hits to 362 proteins
in 185 species: Archae - 2; Bacteria - 0; Metazoa - 115;
Fungi - 130; Plants - 51; Viruses - 0; Other Eukaryotes
- 64 (source: NCBI BLink). | chr4:9800843-9802591
REVERSE LENGTH=187
Length = 187
Score = 78.6 bits (192), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 64/98 (65%), Gaps = 4/98 (4%)
Query: 49 GIIFANEMKVPRLKSLTANLHRMGVSNTVVCNYD-GKELPKVLGLNTVDRVLLDAPCSGT 107
GIIFAN L SL ANLHRMG++NTVV NY+ +L +V +N+ D VL++AP + T
Sbjct: 55 GIIFANASTEHLLGSLYANLHRMGITNTVVSNYNINTKLSRVFHINSKDMVLVNAPSTRT 114
Query: 108 GVISKDESVKTSKDLE-DIQKCAQLQKELILAAIDMVD 144
G+IS+ S+K S + E DIQ+ LQK I+ + M D
Sbjct: 115 GLISEFGSIKMSINEEADIQRFGVLQK--IVEGVHMSD 150
>AT2G22400.1 | Symbols: | S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein |
chr2:9504823-9508788 REVERSE LENGTH=808
Length = 808
Score = 78.2 bits (191), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 88/193 (45%), Gaps = 29/193 (15%)
Query: 8 QSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKN--------TGIIFANEMKVP 59
Q A S +P + L V+DM AAPG KT + ++ G++ AN++
Sbjct: 170 QEAVSMVPPLFLDVHPDHFVLDMCAAPGSKTFQLLEIIHEASEPGSLPNGLVVANDVDFK 229
Query: 60 RLKSLTANLHRMGVSNTVVCNYDGKELP---------------KVLGLNTV--DRVLLDA 102
R L RM SN +V N++G++ P + + +N + DRVL D
Sbjct: 230 RSNLLIHQTKRMCTSNLIVTNHEGQQFPGCRLNKSRASEKGISENMPINQLAFDRVLCDV 289
Query: 103 PCSGTGVISKDESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMV 162
PCSG G + K + + LQ L + + ++ K GG ++YSTCS+
Sbjct: 290 PCSGDGTLRKAPDIWRKWNSGMGNGLHSLQIILAMRGLSLL----KVGGKMIYSTCSMNP 345
Query: 163 AENEAVIDYALKR 175
E+EAV+ L+R
Sbjct: 346 VEDEAVVAEILRR 358
>AT4G40000.1 | Symbols: | S-adenosyl-L-methionine-dependent
methyltransferases superfamily protein |
chr4:18543989-18547443 REVERSE LENGTH=783
Length = 783
Score = 77.8 bits (190), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 96/225 (42%), Gaps = 34/225 (15%)
Query: 2 AGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNT--------GIIFA 53
AG Q + S +P + L V+DM AAPG KT + ++ + G++ A
Sbjct: 157 AGNMTRQESVSMVPPLFLDVHPDHFVLDMCAAPGSKTFQLLEIIHESSEPGSLPNGMVVA 216
Query: 54 NEMKVPRLKSLTANLHRMGVSNTVVCNYDGKELPKVLGLNTV----------------DR 97
N++ R L R +N +V N +G+ P T+ DR
Sbjct: 217 NDVDYKRSNLLIHQTKRTCTTNLMVTNNEGQHFPSCNTKRTLSVASETNPHPIDQLLFDR 276
Query: 98 VLLDAPCSGTGVISKDESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYST 157
VL D PCSG G + K + + LQ L + + ++ K GG +VYST
Sbjct: 277 VLCDVPCSGDGTLRKAPDIWRRWNSGSGNGLHSLQVVLAMRGLSLL----KVGGRMVYST 332
Query: 158 CSIMVAENEAVIDYALKR--RDVKLVACGLNFG----RPGFTKFR 196
CS+ E+EAV+ L+R V+LV RPG TK++
Sbjct: 333 CSMNPIEDEAVVAEILRRCGCSVELVDVSDKLPELIRRPGLTKWK 377
>AT1G06560.1 | Symbols: | NOL1/NOP2/sun family protein |
chr1:2007660-2011824 FORWARD LENGTH=599
Length = 599
Score = 62.8 bits (151), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 72/143 (50%), Gaps = 16/143 (11%)
Query: 93 NTVDRVLLDAPCSGTGVISKDESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGY 152
N+ DRVLLDAPCS G+ + + + ++ Q++++ A+ +V + GG
Sbjct: 457 NSFDRVLLDAPCSALGL--RPRLFAGLETVVSLRNHGWYQRKMLDQAVQLV----RVGGI 510
Query: 153 LVYSTCSIMVAENEAVIDYAL-KRRDVKLVACGLNFGRPGFT---KFREQRFHRSL---- 204
LVYSTC+I +ENEAV+ YAL K R + L G PG +F + L
Sbjct: 511 LVYSTCTINPSENEAVVRYALDKYRFLSLAPQHPRIGGPGLVGRCEFPDGYIEEWLKPGE 570
Query: 205 -DKTRRFYPHVH-NMDGFFVAKL 225
+ ++F P + GFF+AK
Sbjct: 571 EELVQKFDPSSELDTIGFFIAKF 593
Score = 58.2 bits (139), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 55/101 (54%), Gaps = 9/101 (8%)
Query: 1 MAGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEM---K 57
+ G LQ+ S + AL PQ+ ER++DM AAPGGKTT IA LM + G I A + K
Sbjct: 274 LEGEIFLQNLPSIIVAHALDPQKGERILDMCAAPGGKTTAIAILMNDEGEIVAADRSHNK 333
Query: 58 VPRLKSLTANLHRMGVSNTVVCNYDGKE---LPKVLGLNTV 95
V +++L+A MG + C D + LP L +T+
Sbjct: 334 VLVVQNLSA---EMGFTCITTCKLDALKSVCLPTTLNESTI 371