Miyakogusa Predicted Gene

Lj0g3v0206149.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0206149.2 tr|H2BPZ4|H2BPZ4_LOTJA Sterol glucosyltransferase
1 OS=Lotus japonicus GN=SGT1 PE=4 SV=1,98.38,0,no description,NULL;
seg,NULL; GLYCOSYLTRANSFERASE FAMILY PROTEIN,NULL;
GLUCOSYL/GLUCURONOSYL TRANSF,CUFF.13202.2
         (309 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G07020.2 | Symbols:  | UDP-Glycosyltransferase superfamily pr...   494   e-140
AT3G07020.1 | Symbols:  | UDP-Glycosyltransferase superfamily pr...   490   e-139
AT1G43620.3 | Symbols: UGT80B1 | UDP-Glycosyltransferase superfa...   344   4e-95
AT1G43620.2 | Symbols: UGT80B1 | UDP-Glycosyltransferase superfa...   344   4e-95
AT1G43620.1 | Symbols: TT15, UGT80B1 | UDP-Glycosyltransferase s...   344   4e-95
AT5G24750.1 | Symbols:  | UDP-Glycosyltransferase superfamily pr...    63   3e-10

>AT3G07020.2 | Symbols:  | UDP-Glycosyltransferase superfamily
           protein | chr3:2218120-2221590 REVERSE LENGTH=637
          Length = 637

 Score =  494 bits (1273), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 236/309 (76%), Positives = 258/309 (83%), Gaps = 1/309 (0%)

Query: 1   MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
           MPWTPT+EFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDM+NDLRKK+LKLRPVTYLSG+
Sbjct: 330 MPWTPTSEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGT 389

Query: 61  QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
           QGS ++IPH Y+WSPHLVPKPKDWGP+IDVVGFC+LDLASN+EPP  LV+WLE GDKPIY
Sbjct: 390 QGSGSNIPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIY 449

Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLAEPKDNIYLLDNIPHDWLF 180
           IGFGSLPVQEP+KMTEIIVEAL+ T QRGIINKGWGGLGNL EPKD +YLLDN+PHDWLF
Sbjct: 450 IGFGSLPVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLF 509

Query: 181 LHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLPK 240
             CK              LKA+CPTTIVPFFGDQPFWG+RVH RGVGP PIPVDEFSL K
Sbjct: 510 PRCKAVVHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHK 569

Query: 241 LVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLPQTRNKTEPDQQPLPSSVF 300
           L +AINFMLD KVK  A  LAKAM++EDGV GAVKAFFK LP  +     D  P PS   
Sbjct: 570 LEDAINFMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNIS-DPIPEPSGFL 628

Query: 301 SISRCFGCS 309
           S  +CFGCS
Sbjct: 629 SFRKCFGCS 637


>AT3G07020.1 | Symbols:  | UDP-Glycosyltransferase superfamily
           protein | chr3:2217841-2221590 REVERSE LENGTH=637
          Length = 637

 Score =  490 bits (1262), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 235/309 (76%), Positives = 257/309 (83%), Gaps = 1/309 (0%)

Query: 1   MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
           MPWTPT+EFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDM+NDLRKK+LKLRPVTYLSG+
Sbjct: 330 MPWTPTSEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGT 389

Query: 61  QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
           QGS ++IPH Y+WSPHLVPKPKDWGP+IDVVGFC+LDLASN+EPP  LV+WLE GDKPIY
Sbjct: 390 QGSGSNIPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIY 449

Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLAEPKDNIYLLDNIPHDWLF 180
           IGFGSLPVQEP+KMTEIIVEAL+ T QRGIINKGWGGLGNL EPKD +YLLDN+PHDWLF
Sbjct: 450 IGFGSLPVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLF 509

Query: 181 LHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLPK 240
             CK              LKA+CPTTIVPFFGDQPFWG+RVH RGVGP PIPVDEFSL K
Sbjct: 510 PRCKAVVHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHK 569

Query: 241 LVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLPQTRNKTEPDQQPLPSSVF 300
           L +AINFMLD KVK  A  LAKAM++EDGV GAVKAFFK LP  +     D  P PS   
Sbjct: 570 LEDAINFMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNIS-DPIPEPSGFL 628

Query: 301 SISRCFGCS 309
           S  +CFG S
Sbjct: 629 SFRKCFGFS 637


>AT1G43620.3 | Symbols: UGT80B1 | UDP-Glycosyltransferase
           superfamily protein | chr1:16425654-16429500 REVERSE
           LENGTH=615
          Length = 615

 Score =  344 bits (883), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 154/283 (54%), Positives = 204/283 (72%), Gaps = 1/283 (0%)

Query: 1   MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
           MPWTPT EFPHPL+RV Q A Y LSY +VD ++W  IR  IND RK++L L P+ Y S  
Sbjct: 293 MPWTPTNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTY 352

Query: 61  QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
            GS + +P  Y+WSPH+VPKP DWGP +DVVG+CFL+L S ++P E  + W+E G  P+Y
Sbjct: 353 HGSISHLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVY 412

Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLA-EPKDNIYLLDNIPHDWL 179
           IGFGS+P+ +PK+  +II+E L+ T QRGI+++GWGGLGNLA E  +N++L+++ PHDWL
Sbjct: 413 IGFGSMPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWL 472

Query: 180 FLHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLP 239
           F  C               LKA CPTTIVPFFGDQ FWGDR++++G+GP PIP+ + S+ 
Sbjct: 473 FPQCSAVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVE 532

Query: 240 KLVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLP 282
            L ++I FML P+VK + +ELAK +ENEDGV  AV AF + LP
Sbjct: 533 NLSSSIRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLP 575


>AT1G43620.2 | Symbols: UGT80B1 | UDP-Glycosyltransferase
           superfamily protein | chr1:16425654-16429500 REVERSE
           LENGTH=615
          Length = 615

 Score =  344 bits (883), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 154/283 (54%), Positives = 204/283 (72%), Gaps = 1/283 (0%)

Query: 1   MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
           MPWTPT EFPHPL+RV Q A Y LSY +VD ++W  IR  IND RK++L L P+ Y S  
Sbjct: 293 MPWTPTNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTY 352

Query: 61  QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
            GS + +P  Y+WSPH+VPKP DWGP +DVVG+CFL+L S ++P E  + W+E G  P+Y
Sbjct: 353 HGSISHLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVY 412

Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLA-EPKDNIYLLDNIPHDWL 179
           IGFGS+P+ +PK+  +II+E L+ T QRGI+++GWGGLGNLA E  +N++L+++ PHDWL
Sbjct: 413 IGFGSMPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWL 472

Query: 180 FLHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLP 239
           F  C               LKA CPTTIVPFFGDQ FWGDR++++G+GP PIP+ + S+ 
Sbjct: 473 FPQCSAVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVE 532

Query: 240 KLVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLP 282
            L ++I FML P+VK + +ELAK +ENEDGV  AV AF + LP
Sbjct: 533 NLSSSIRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLP 575


>AT1G43620.1 | Symbols: TT15, UGT80B1 | UDP-Glycosyltransferase
           superfamily protein | chr1:16425654-16429500 REVERSE
           LENGTH=615
          Length = 615

 Score =  344 bits (883), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 154/283 (54%), Positives = 204/283 (72%), Gaps = 1/283 (0%)

Query: 1   MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
           MPWTPT EFPHPL+RV Q A Y LSY +VD ++W  IR  IND RK++L L P+ Y S  
Sbjct: 293 MPWTPTNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTY 352

Query: 61  QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
            GS + +P  Y+WSPH+VPKP DWGP +DVVG+CFL+L S ++P E  + W+E G  P+Y
Sbjct: 353 HGSISHLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVY 412

Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLA-EPKDNIYLLDNIPHDWL 179
           IGFGS+P+ +PK+  +II+E L+ T QRGI+++GWGGLGNLA E  +N++L+++ PHDWL
Sbjct: 413 IGFGSMPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWL 472

Query: 180 FLHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLP 239
           F  C               LKA CPTTIVPFFGDQ FWGDR++++G+GP PIP+ + S+ 
Sbjct: 473 FPQCSAVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVE 532

Query: 240 KLVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLP 282
            L ++I FML P+VK + +ELAK +ENEDGV  AV AF + LP
Sbjct: 533 NLSSSIRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLP 575


>AT5G24750.1 | Symbols:  | UDP-Glycosyltransferase superfamily
           protein | chr5:8490821-8494536 REVERSE LENGTH=520
          Length = 520

 Score = 62.8 bits (151), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 101/275 (36%), Gaps = 65/275 (23%)

Query: 71  YIWSPHLVPKPKDWGPKIDVVGFCFL-------------------------DLASNFEPP 105
           Y +S  +V  P  W   + V GF FL                            SN    
Sbjct: 237 YGFSKEIVECPDYWPLSVRVCGFWFLPNEWQFSCNECGDNPFAGRLGTDDSHTCSNHTEL 296

Query: 106 ETLVKWLEDGDKPIYIGFGSLP----VQEPKKMTEIIVEALETTGQRGIINKGWGG---- 157
            T +   E    PI++G  S+     V++P     ++   ++ TG R II     G    
Sbjct: 297 YTFISSCEPA-LPIFVGLSSVGSMGFVRDPIAFLRVLQSVIQITGYRFIIFTASYGPLDA 355

Query: 158 -LGNLAEPKDN---------IYLLDN--------IPHDWLFLHCKXXXXXXXXXXXXXXL 199
            +  +A   D+         I + +         +P++W+F  C               L
Sbjct: 356 AIRTIANGSDSSEKQPLHAGISIFNGKLFCFSGMVPYNWMFRTCAAAIHHGGSGSVAAAL 415

Query: 200 KAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLPK-------------LVNAIN 246
           +A  P  I PF  DQ +W +++   GV P P+  +   L               +  AI 
Sbjct: 416 QAGIPQIICPFMLDQFYWAEKMSWLGVAPQPLKRNHLLLEDSNDEKNITEAAQVVAKAIY 475

Query: 247 FMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQL 281
             L  K + RA+E+A+ +  EDGVT AV+   +++
Sbjct: 476 DALSAKTRARAMEIAEILSLEDGVTEAVRVLREEV 510