Miyakogusa Predicted Gene

Lj6g3v0727750.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v0727750.2 Non Chatacterized Hit- tr|I1N087|I1N087_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.19659 PE,85.45,0,no
description,NULL; RCMTFAMILY,RNA (C5-cytosine) methyltransferase;
seg,NULL; Nol1_Nop2_Fmu,Bacteri,CUFF.58216.2
         (276 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G26600.1 | Symbols:  | S-adenosyl-L-methionine-dependent meth...   427   e-120
AT5G55920.1 | Symbols: OLI2 | S-adenosyl-L-methionine-dependent ...   427   e-120
AT3G13180.1 | Symbols:  | NOL1/NOP2/sun family protein / antiter...   112   3e-25
AT5G26180.2 | Symbols:  | S-adenosyl-L-methionine-dependent meth...    91   8e-19
AT5G26180.1 | Symbols:  | S-adenosyl-L-methionine-dependent meth...    91   8e-19
AT4G17590.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Bacterial ...    79   3e-15
AT4G17590.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...    79   4e-15
AT2G22400.1 | Symbols:  | S-adenosyl-L-methionine-dependent meth...    78   6e-15
AT4G40000.1 | Symbols:  | S-adenosyl-L-methionine-dependent meth...    78   8e-15
AT1G06560.1 | Symbols:  | NOL1/NOP2/sun family protein | chr1:20...    63   2e-10

>AT4G26600.1 | Symbols:  | S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein |
           chr4:13419629-13423418 FORWARD LENGTH=671
          Length = 671

 Score =  427 bits (1099), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 196/233 (84%), Positives = 219/233 (93%)

Query: 1   MAGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPR 60
           +AGFYMLQSASSFLPVMALAP+EKERVVDMAAAPGGKTTY+AALMKNTGII+ANEMKVPR
Sbjct: 320 LAGFYMLQSASSFLPVMALAPREKERVVDMAAAPGGKTTYVAALMKNTGIIYANEMKVPR 379

Query: 61  LKSLTANLHRMGVSNTVVCNYDGKELPKVLGLNTVDRVLLDAPCSGTGVISKDESVKTSK 120
           LKSL+ANLHRMGV+NT+VCNYDG+EL KVLG ++VDRVLLDAPCSGTGVISKDESVKTSK
Sbjct: 380 LKSLSANLHRMGVTNTIVCNYDGRELTKVLGQSSVDRVLLDAPCSGTGVISKDESVKTSK 439

Query: 121 DLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYALKRRDVKL 180
             +DI+K A LQK+LIL AID+VDANSK+GGY+VYSTCS+M+ ENEAVIDYALK RDVKL
Sbjct: 440 SADDIKKFAHLQKQLILGAIDLVDANSKTGGYIVYSTCSVMIPENEAVIDYALKNRDVKL 499

Query: 181 VACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVAKLKKLSNSKQ 233
           V CGL+FGRPGF+ FRE RFH SL+KTRRFYPHVHNMDGFFVAKLKK+SN+ Q
Sbjct: 500 VPCGLDFGRPGFSSFREHRFHPSLEKTRRFYPHVHNMDGFFVAKLKKMSNAMQ 552


>AT5G55920.1 | Symbols: OLI2 | S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein |
           chr5:22645742-22649383 REVERSE LENGTH=682
          Length = 682

 Score =  427 bits (1098), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 195/236 (82%), Positives = 221/236 (93%)

Query: 1   MAGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPR 60
           +AG+YMLQ ASSFLPVMALAP+E ER+VD+AAAPGGKTTYIAALMKNTG+I+ANEMKVPR
Sbjct: 337 LAGYYMLQGASSFLPVMALAPRENERIVDVAAAPGGKTTYIAALMKNTGLIYANEMKVPR 396

Query: 61  LKSLTANLHRMGVSNTVVCNYDGKELPKVLGLNTVDRVLLDAPCSGTGVISKDESVKTSK 120
           LKSLTANLHRMGV+NT+VCNYDG+ELPKVLG NTVDRVLLDAPCSGTG+ISKDESVK +K
Sbjct: 397 LKSLTANLHRMGVTNTIVCNYDGRELPKVLGQNTVDRVLLDAPCSGTGIISKDESVKITK 456

Query: 121 DLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYALKRRDVKL 180
            +++I+K A LQK+L+LAAIDMVDANSK+GGY+VYSTCSIMV ENEAVIDYALK+RDVKL
Sbjct: 457 TMDEIKKFAHLQKQLLLAAIDMVDANSKTGGYIVYSTCSIMVTENEAVIDYALKKRDVKL 516

Query: 181 VACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVAKLKKLSNSKQGAK 236
           V CGL+FGR GFT+FRE RF  SLDKTRRFYPHVHNMDGFFVAKLKK+SN KQ ++
Sbjct: 517 VTCGLDFGRKGFTRFREHRFQPSLDKTRRFYPHVHNMDGFFVAKLKKMSNVKQSSE 572


>AT3G13180.1 | Symbols:  | NOL1/NOP2/sun family protein /
           antitermination NusB domain-containing protein |
           chr3:4236326-4239966 REVERSE LENGTH=523
          Length = 523

 Score =  112 bits (279), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/228 (36%), Positives = 122/228 (53%), Gaps = 23/228 (10%)

Query: 3   GFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPRLK 62
           G   +Q  S+ L V  + PQ  ER++D  AAPGGKT ++A+ +K  G+I+A ++   RL+
Sbjct: 310 GICSVQDESAGLIVSVVKPQPGERIMDACAAPGGKTLFMASCLKGQGMIYAMDVNEGRLR 369

Query: 63  SL--TANLHRM-GVSNTVVCNYDGKELPKVLGLNTV--DRVLLDAPCSGTGVISKDESVK 117
            L  TA  H++ G+  T+       +L      N V  D+VLLDAPCSG GV+SK   ++
Sbjct: 370 ILGETAKSHQVDGLITTI-----HSDLRVFAETNEVQYDKVLLDAPCSGLGVLSKRADLR 424

Query: 118 TSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYALKRRD 177
            ++ LED+ +  +LQ EL+ +A  +V    K GG LVYSTCSI   ENE  ++  L R  
Sbjct: 425 WNRKLEDMLELTKLQDELLDSASKLV----KHGGVLVYSTCSIDPEENEGRVEAFLLRHP 480

Query: 178 VKLVACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVAKL 225
              +    +F    F     Q F  S        P  H++DG F A+L
Sbjct: 481 EFTIDPVTSFVPSSFVT--SQGFFLS-------NPVKHSLDGAFAARL 519


>AT5G26180.2 | Symbols:  | S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein |
           chr5:9149253-9152595 FORWARD LENGTH=567
          Length = 567

 Score = 90.9 bits (224), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 79/251 (31%), Positives = 106/251 (42%), Gaps = 28/251 (11%)

Query: 3   GFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPRLK 62
           G   LQ  +S +   AL PQ    V+D  +APG KT ++AALM+  G I A E+   R+K
Sbjct: 284 GRIFLQGKASSMVAAALQPQAGWEVLDACSAPGNKTIHLAALMEGQGKIIACELNEERVK 343

Query: 63  SLTANLHRMGVSNTVVCNYDGKEL-PKVLGLNTVDRVLLDAPCSGTGVI---------SK 112
            L   +   G SN  VC+ D   L PK      +  +LLD  CSG+G I         S 
Sbjct: 344 RLEHTIKLSGASNIEVCHGDFLGLNPKDPSFAKIRAILLDPSCSGSGTITDRLDHLLPSH 403

Query: 113 DESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYA 172
            E    + D   + K A  QK+ +  A+            +VYSTCSI   ENE V+   
Sbjct: 404 SEDNNMNYDSMRLHKLAVFQKKALAHALSFPKVER-----VVYSTCSIYQIENEDVVSSV 458

Query: 173 LKRRD---VKLVACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVA---KLK 226
           L        KL      + R G   F        +D        V + +GFF+A   +  
Sbjct: 459 LPLASSLGFKLATPFPQWQRRGLPVFAGSEHLLRMDP-------VEDKEGFFIALFVRAN 511

Query: 227 KLSNSKQGAKP 237
           KL N K    P
Sbjct: 512 KLDNPKSSELP 522


>AT5G26180.1 | Symbols:  | S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein |
           chr5:9149253-9152595 FORWARD LENGTH=567
          Length = 567

 Score = 90.9 bits (224), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 79/251 (31%), Positives = 106/251 (42%), Gaps = 28/251 (11%)

Query: 3   GFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEMKVPRLK 62
           G   LQ  +S +   AL PQ    V+D  +APG KT ++AALM+  G I A E+   R+K
Sbjct: 284 GRIFLQGKASSMVAAALQPQAGWEVLDACSAPGNKTIHLAALMEGQGKIIACELNEERVK 343

Query: 63  SLTANLHRMGVSNTVVCNYDGKEL-PKVLGLNTVDRVLLDAPCSGTGVI---------SK 112
            L   +   G SN  VC+ D   L PK      +  +LLD  CSG+G I         S 
Sbjct: 344 RLEHTIKLSGASNIEVCHGDFLGLNPKDPSFAKIRAILLDPSCSGSGTITDRLDHLLPSH 403

Query: 113 DESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMVAENEAVIDYA 172
            E    + D   + K A  QK+ +  A+            +VYSTCSI   ENE V+   
Sbjct: 404 SEDNNMNYDSMRLHKLAVFQKKALAHALSFPKVER-----VVYSTCSIYQIENEDVVSSV 458

Query: 173 LKRRD---VKLVACGLNFGRPGFTKFREQRFHRSLDKTRRFYPHVHNMDGFFVA---KLK 226
           L        KL      + R G   F        +D        V + +GFF+A   +  
Sbjct: 459 LPLASSLGFKLATPFPQWQRRGLPVFAGSEHLLRMDP-------VEDKEGFFIALFVRAN 511

Query: 227 KLSNSKQGAKP 237
           KL N K    P
Sbjct: 512 KLDNPKSSELP 522


>AT4G17590.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Bacterial Fmu
           (Sun)/eukaryotic nucleolar NOL1/Nop2p
           (InterPro:IPR001678); BEST Arabidopsis thaliana protein
           match is: S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein
           (TAIR:AT4G26600.1); Has 459 Blast hits to 368 proteins
           in 186 species: Archae - 2; Bacteria - 0; Metazoa - 137;
           Fungi - 144; Plants - 100; Viruses - 0; Other Eukaryotes
           - 76 (source: NCBI BLink). | chr4:9800843-9802591
           REVERSE LENGTH=201
          Length = 201

 Score = 79.0 bits (193), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 64/98 (65%), Gaps = 4/98 (4%)

Query: 49  GIIFANEMKVPRLKSLTANLHRMGVSNTVVCNYD-GKELPKVLGLNTVDRVLLDAPCSGT 107
           GIIFAN      L SL ANLHRMG++NTVV NY+   +L +V  +N+ D VL++AP + T
Sbjct: 55  GIIFANASTEHLLGSLYANLHRMGITNTVVSNYNINTKLSRVFHINSKDMVLVNAPSTRT 114

Query: 108 GVISKDESVKTSKDLE-DIQKCAQLQKELILAAIDMVD 144
           G+IS+  S+K S + E DIQ+   LQK  I+  + M D
Sbjct: 115 GLISEFGSIKMSINEEADIQRFGVLQK--IVEGVHMSD 150


>AT4G17590.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           cellular_component unknown; CONTAINS InterPro DOMAIN/s:
           Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
           (InterPro:IPR001678); BEST Arabidopsis thaliana protein
           match is: S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein
           (TAIR:AT4G26600.1); Has 362 Blast hits to 362 proteins
           in 185 species: Archae - 2; Bacteria - 0; Metazoa - 115;
           Fungi - 130; Plants - 51; Viruses - 0; Other Eukaryotes
           - 64 (source: NCBI BLink). | chr4:9800843-9802591
           REVERSE LENGTH=187
          Length = 187

 Score = 78.6 bits (192), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 64/98 (65%), Gaps = 4/98 (4%)

Query: 49  GIIFANEMKVPRLKSLTANLHRMGVSNTVVCNYD-GKELPKVLGLNTVDRVLLDAPCSGT 107
           GIIFAN      L SL ANLHRMG++NTVV NY+   +L +V  +N+ D VL++AP + T
Sbjct: 55  GIIFANASTEHLLGSLYANLHRMGITNTVVSNYNINTKLSRVFHINSKDMVLVNAPSTRT 114

Query: 108 GVISKDESVKTSKDLE-DIQKCAQLQKELILAAIDMVD 144
           G+IS+  S+K S + E DIQ+   LQK  I+  + M D
Sbjct: 115 GLISEFGSIKMSINEEADIQRFGVLQK--IVEGVHMSD 150


>AT2G22400.1 | Symbols:  | S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein |
           chr2:9504823-9508788 REVERSE LENGTH=808
          Length = 808

 Score = 78.2 bits (191), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 88/193 (45%), Gaps = 29/193 (15%)

Query: 8   QSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKN--------TGIIFANEMKVP 59
           Q A S +P + L       V+DM AAPG KT  +  ++           G++ AN++   
Sbjct: 170 QEAVSMVPPLFLDVHPDHFVLDMCAAPGSKTFQLLEIIHEASEPGSLPNGLVVANDVDFK 229

Query: 60  RLKSLTANLHRMGVSNTVVCNYDGKELP---------------KVLGLNTV--DRVLLDA 102
           R   L     RM  SN +V N++G++ P               + + +N +  DRVL D 
Sbjct: 230 RSNLLIHQTKRMCTSNLIVTNHEGQQFPGCRLNKSRASEKGISENMPINQLAFDRVLCDV 289

Query: 103 PCSGTGVISKDESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYSTCSIMV 162
           PCSG G + K   +    +         LQ  L +  + ++    K GG ++YSTCS+  
Sbjct: 290 PCSGDGTLRKAPDIWRKWNSGMGNGLHSLQIILAMRGLSLL----KVGGKMIYSTCSMNP 345

Query: 163 AENEAVIDYALKR 175
            E+EAV+   L+R
Sbjct: 346 VEDEAVVAEILRR 358


>AT4G40000.1 | Symbols:  | S-adenosyl-L-methionine-dependent
           methyltransferases superfamily protein |
           chr4:18543989-18547443 REVERSE LENGTH=783
          Length = 783

 Score = 77.8 bits (190), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 64/225 (28%), Positives = 96/225 (42%), Gaps = 34/225 (15%)

Query: 2   AGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNT--------GIIFA 53
           AG    Q + S +P + L       V+DM AAPG KT  +  ++  +        G++ A
Sbjct: 157 AGNMTRQESVSMVPPLFLDVHPDHFVLDMCAAPGSKTFQLLEIIHESSEPGSLPNGMVVA 216

Query: 54  NEMKVPRLKSLTANLHRMGVSNTVVCNYDGKELPKVLGLNTV----------------DR 97
           N++   R   L     R   +N +V N +G+  P      T+                DR
Sbjct: 217 NDVDYKRSNLLIHQTKRTCTTNLMVTNNEGQHFPSCNTKRTLSVASETNPHPIDQLLFDR 276

Query: 98  VLLDAPCSGTGVISKDESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGYLVYST 157
           VL D PCSG G + K   +    +         LQ  L +  + ++    K GG +VYST
Sbjct: 277 VLCDVPCSGDGTLRKAPDIWRRWNSGSGNGLHSLQVVLAMRGLSLL----KVGGRMVYST 332

Query: 158 CSIMVAENEAVIDYALKR--RDVKLVACGLNFG----RPGFTKFR 196
           CS+   E+EAV+   L+R    V+LV           RPG TK++
Sbjct: 333 CSMNPIEDEAVVAEILRRCGCSVELVDVSDKLPELIRRPGLTKWK 377


>AT1G06560.1 | Symbols:  | NOL1/NOP2/sun family protein |
           chr1:2007660-2011824 FORWARD LENGTH=599
          Length = 599

 Score = 62.8 bits (151), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 72/143 (50%), Gaps = 16/143 (11%)

Query: 93  NTVDRVLLDAPCSGTGVISKDESVKTSKDLEDIQKCAQLQKELILAAIDMVDANSKSGGY 152
           N+ DRVLLDAPCS  G+  +       + +  ++     Q++++  A+ +V    + GG 
Sbjct: 457 NSFDRVLLDAPCSALGL--RPRLFAGLETVVSLRNHGWYQRKMLDQAVQLV----RVGGI 510

Query: 153 LVYSTCSIMVAENEAVIDYAL-KRRDVKLVACGLNFGRPGFT---KFREQRFHRSL---- 204
           LVYSTC+I  +ENEAV+ YAL K R + L       G PG     +F +      L    
Sbjct: 511 LVYSTCTINPSENEAVVRYALDKYRFLSLAPQHPRIGGPGLVGRCEFPDGYIEEWLKPGE 570

Query: 205 -DKTRRFYPHVH-NMDGFFVAKL 225
            +  ++F P    +  GFF+AK 
Sbjct: 571 EELVQKFDPSSELDTIGFFIAKF 593



 Score = 58.2 bits (139), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 55/101 (54%), Gaps = 9/101 (8%)

Query: 1   MAGFYMLQSASSFLPVMALAPQEKERVVDMAAAPGGKTTYIAALMKNTGIIFANEM---K 57
           + G   LQ+  S +   AL PQ+ ER++DM AAPGGKTT IA LM + G I A +    K
Sbjct: 274 LEGEIFLQNLPSIIVAHALDPQKGERILDMCAAPGGKTTAIAILMNDEGEIVAADRSHNK 333

Query: 58  VPRLKSLTANLHRMGVSNTVVCNYDGKE---LPKVLGLNTV 95
           V  +++L+A    MG +    C  D  +   LP  L  +T+
Sbjct: 334 VLVVQNLSA---EMGFTCITTCKLDALKSVCLPTTLNESTI 371