Miyakogusa Predicted Gene
- Lj0g3v0206149.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0206149.2 tr|H2BPZ4|H2BPZ4_LOTJA Sterol glucosyltransferase
1 OS=Lotus japonicus GN=SGT1 PE=4 SV=1,98.38,0,no description,NULL;
seg,NULL; GLYCOSYLTRANSFERASE FAMILY PROTEIN,NULL;
GLUCOSYL/GLUCURONOSYL TRANSF,CUFF.13202.2
(309 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G07020.2 | Symbols: | UDP-Glycosyltransferase superfamily pr... 494 e-140
AT3G07020.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 490 e-139
AT1G43620.3 | Symbols: UGT80B1 | UDP-Glycosyltransferase superfa... 344 4e-95
AT1G43620.2 | Symbols: UGT80B1 | UDP-Glycosyltransferase superfa... 344 4e-95
AT1G43620.1 | Symbols: TT15, UGT80B1 | UDP-Glycosyltransferase s... 344 4e-95
AT5G24750.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 63 3e-10
>AT3G07020.2 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr3:2218120-2221590 REVERSE LENGTH=637
Length = 637
Score = 494 bits (1273), Expect = e-140, Method: Compositional matrix adjust.
Identities = 236/309 (76%), Positives = 258/309 (83%), Gaps = 1/309 (0%)
Query: 1 MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
MPWTPT+EFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDM+NDLRKK+LKLRPVTYLSG+
Sbjct: 330 MPWTPTSEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGT 389
Query: 61 QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
QGS ++IPH Y+WSPHLVPKPKDWGP+IDVVGFC+LDLASN+EPP LV+WLE GDKPIY
Sbjct: 390 QGSGSNIPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIY 449
Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLAEPKDNIYLLDNIPHDWLF 180
IGFGSLPVQEP+KMTEIIVEAL+ T QRGIINKGWGGLGNL EPKD +YLLDN+PHDWLF
Sbjct: 450 IGFGSLPVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLF 509
Query: 181 LHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLPK 240
CK LKA+CPTTIVPFFGDQPFWG+RVH RGVGP PIPVDEFSL K
Sbjct: 510 PRCKAVVHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHK 569
Query: 241 LVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLPQTRNKTEPDQQPLPSSVF 300
L +AINFMLD KVK A LAKAM++EDGV GAVKAFFK LP + D P PS
Sbjct: 570 LEDAINFMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNIS-DPIPEPSGFL 628
Query: 301 SISRCFGCS 309
S +CFGCS
Sbjct: 629 SFRKCFGCS 637
>AT3G07020.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr3:2217841-2221590 REVERSE LENGTH=637
Length = 637
Score = 490 bits (1262), Expect = e-139, Method: Compositional matrix adjust.
Identities = 235/309 (76%), Positives = 257/309 (83%), Gaps = 1/309 (0%)
Query: 1 MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
MPWTPT+EFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDM+NDLRKK+LKLRPVTYLSG+
Sbjct: 330 MPWTPTSEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGT 389
Query: 61 QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
QGS ++IPH Y+WSPHLVPKPKDWGP+IDVVGFC+LDLASN+EPP LV+WLE GDKPIY
Sbjct: 390 QGSGSNIPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIY 449
Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLAEPKDNIYLLDNIPHDWLF 180
IGFGSLPVQEP+KMTEIIVEAL+ T QRGIINKGWGGLGNL EPKD +YLLDN+PHDWLF
Sbjct: 450 IGFGSLPVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLF 509
Query: 181 LHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLPK 240
CK LKA+CPTTIVPFFGDQPFWG+RVH RGVGP PIPVDEFSL K
Sbjct: 510 PRCKAVVHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHK 569
Query: 241 LVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLPQTRNKTEPDQQPLPSSVF 300
L +AINFMLD KVK A LAKAM++EDGV GAVKAFFK LP + D P PS
Sbjct: 570 LEDAINFMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNIS-DPIPEPSGFL 628
Query: 301 SISRCFGCS 309
S +CFG S
Sbjct: 629 SFRKCFGFS 637
>AT1G43620.3 | Symbols: UGT80B1 | UDP-Glycosyltransferase
superfamily protein | chr1:16425654-16429500 REVERSE
LENGTH=615
Length = 615
Score = 344 bits (883), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 154/283 (54%), Positives = 204/283 (72%), Gaps = 1/283 (0%)
Query: 1 MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
MPWTPT EFPHPL+RV Q A Y LSY +VD ++W IR IND RK++L L P+ Y S
Sbjct: 293 MPWTPTNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTY 352
Query: 61 QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
GS + +P Y+WSPH+VPKP DWGP +DVVG+CFL+L S ++P E + W+E G P+Y
Sbjct: 353 HGSISHLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVY 412
Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLA-EPKDNIYLLDNIPHDWL 179
IGFGS+P+ +PK+ +II+E L+ T QRGI+++GWGGLGNLA E +N++L+++ PHDWL
Sbjct: 413 IGFGSMPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWL 472
Query: 180 FLHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLP 239
F C LKA CPTTIVPFFGDQ FWGDR++++G+GP PIP+ + S+
Sbjct: 473 FPQCSAVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVE 532
Query: 240 KLVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLP 282
L ++I FML P+VK + +ELAK +ENEDGV AV AF + LP
Sbjct: 533 NLSSSIRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLP 575
>AT1G43620.2 | Symbols: UGT80B1 | UDP-Glycosyltransferase
superfamily protein | chr1:16425654-16429500 REVERSE
LENGTH=615
Length = 615
Score = 344 bits (883), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 154/283 (54%), Positives = 204/283 (72%), Gaps = 1/283 (0%)
Query: 1 MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
MPWTPT EFPHPL+RV Q A Y LSY +VD ++W IR IND RK++L L P+ Y S
Sbjct: 293 MPWTPTNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTY 352
Query: 61 QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
GS + +P Y+WSPH+VPKP DWGP +DVVG+CFL+L S ++P E + W+E G P+Y
Sbjct: 353 HGSISHLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVY 412
Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLA-EPKDNIYLLDNIPHDWL 179
IGFGS+P+ +PK+ +II+E L+ T QRGI+++GWGGLGNLA E +N++L+++ PHDWL
Sbjct: 413 IGFGSMPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWL 472
Query: 180 FLHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLP 239
F C LKA CPTTIVPFFGDQ FWGDR++++G+GP PIP+ + S+
Sbjct: 473 FPQCSAVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVE 532
Query: 240 KLVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLP 282
L ++I FML P+VK + +ELAK +ENEDGV AV AF + LP
Sbjct: 533 NLSSSIRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLP 575
>AT1G43620.1 | Symbols: TT15, UGT80B1 | UDP-Glycosyltransferase
superfamily protein | chr1:16425654-16429500 REVERSE
LENGTH=615
Length = 615
Score = 344 bits (883), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 154/283 (54%), Positives = 204/283 (72%), Gaps = 1/283 (0%)
Query: 1 MPWTPTAEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGS 60
MPWTPT EFPHPL+RV Q A Y LSY +VD ++W IR IND RK++L L P+ Y S
Sbjct: 293 MPWTPTNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTY 352
Query: 61 QGSDTDIPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNFEPPETLVKWLEDGDKPIY 120
GS + +P Y+WSPH+VPKP DWGP +DVVG+CFL+L S ++P E + W+E G P+Y
Sbjct: 353 HGSISHLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVY 412
Query: 121 IGFGSLPVQEPKKMTEIIVEALETTGQRGIINKGWGGLGNLA-EPKDNIYLLDNIPHDWL 179
IGFGS+P+ +PK+ +II+E L+ T QRGI+++GWGGLGNLA E +N++L+++ PHDWL
Sbjct: 413 IGFGSMPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWL 472
Query: 180 FLHCKXXXXXXXXXXXXXXLKAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLP 239
F C LKA CPTTIVPFFGDQ FWGDR++++G+GP PIP+ + S+
Sbjct: 473 FPQCSAVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVE 532
Query: 240 KLVNAINFMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQLP 282
L ++I FML P+VK + +ELAK +ENEDGV AV AF + LP
Sbjct: 533 NLSSSIRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLP 575
>AT5G24750.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr5:8490821-8494536 REVERSE LENGTH=520
Length = 520
Score = 62.8 bits (151), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 101/275 (36%), Gaps = 65/275 (23%)
Query: 71 YIWSPHLVPKPKDWGPKIDVVGFCFL-------------------------DLASNFEPP 105
Y +S +V P W + V GF FL SN
Sbjct: 237 YGFSKEIVECPDYWPLSVRVCGFWFLPNEWQFSCNECGDNPFAGRLGTDDSHTCSNHTEL 296
Query: 106 ETLVKWLEDGDKPIYIGFGSLP----VQEPKKMTEIIVEALETTGQRGIINKGWGG---- 157
T + E PI++G S+ V++P ++ ++ TG R II G
Sbjct: 297 YTFISSCEPA-LPIFVGLSSVGSMGFVRDPIAFLRVLQSVIQITGYRFIIFTASYGPLDA 355
Query: 158 -LGNLAEPKDN---------IYLLDN--------IPHDWLFLHCKXXXXXXXXXXXXXXL 199
+ +A D+ I + + +P++W+F C L
Sbjct: 356 AIRTIANGSDSSEKQPLHAGISIFNGKLFCFSGMVPYNWMFRTCAAAIHHGGSGSVAAAL 415
Query: 200 KAACPTTIVPFFGDQPFWGDRVHDRGVGPPPIPVDEFSLPK-------------LVNAIN 246
+A P I PF DQ +W +++ GV P P+ + L + AI
Sbjct: 416 QAGIPQIICPFMLDQFYWAEKMSWLGVAPQPLKRNHLLLEDSNDEKNITEAAQVVAKAIY 475
Query: 247 FMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQL 281
L K + RA+E+A+ + EDGVT AV+ +++
Sbjct: 476 DALSAKTRARAMEIAEILSLEDGVTEAVRVLREEV 510