Miyakogusa Predicted Gene
- Lj3g3v2248290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2248290.1 Non Chatacterized Hit- tr|B9R7Z9|B9R7Z9_RICCO
Glycosyltransferase, putative OS=Ricinus communis
GN=R,45.74,2e-18,Glycos_transf_1,Glycosyl transferase, family 1;
UDP-Glycosyltransferase/glycogen phosphorylase,NULL;,CUFF.43750.1
(671 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G15940.2 | Symbols: | UDP-Glycosyltransferase superfamily pr... 615 e-176
AT3G15940.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 615 e-176
AT1G52420.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 576 e-164
AT1G19710.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 100 4e-21
AT1G75420.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 96 7e-20
AT5G01220.1 | Symbols: SQD2 | sulfoquinovosyldiacylglycerol 2 | ... 54 4e-07
>AT3G15940.2 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr3:5393632-5396187 REVERSE LENGTH=697
Length = 697
Score = 615 bits (1585), Expect = e-176, Method: Compositional matrix adjust.
Identities = 342/664 (51%), Positives = 430/664 (64%), Gaps = 41/664 (6%)
Query: 38 TPQYSPSFQRLQSNFSPRREGRG---GVKWFGRKRNXXXXXXXXITFWAYLVFFVQSKWA 94
TP+ SP+ +++ S +PRREG+G V+WF + N IT W YL F+VQS+WA
Sbjct: 23 TPRGSPTLRKVHSGRTPRREGKGSGGAVQWF--RSNRLLYWLLLITLWTYLGFYVQSRWA 80
Query: 95 HGDKEE-EFSGFGRKSSD----FEQNQHQDLVPQNTSLSFINNKTVESMVEIGDTVVVAL 149
H D + EF FG K + EQN+ +DLV +S + +++ + + + + V L
Sbjct: 81 HDDDNKVEFLRFGGKLREDVLHVEQNKRRDLVADESSHAVVDHTNIVHL-GVNKRMHVTL 139
Query: 150 AKKENSVPXXXXXXXXXXXXXXXXXXXXXXX-QKV----EISGIEES--EIPLRNTTYGF 202
AKKE+S QKV E ++E E+P N TYG
Sbjct: 140 AKKEDSTSRRSVSPRRRTRKASRSSRTRIRSTQKVRKVMETKELDEQDQELPNINVTYGK 199
Query: 203 LVGPFDSTEDRILQWGPEKRFGTCDRNGEFAXXXXXXXXXXXXHELSLTGAPXXXXXXXX 262
L GPF S EDRIL+W P+KR GTCDR +F HELS+TGAP
Sbjct: 200 LFGPFGSLEDRILEWSPQKRSGTCDRKSDFKRLVWSRRFVLLFHELSMTGAPISMMELAS 259
Query: 263 XXXXCGATVSAVVLSKKGGLMQELVRRQIKVIDDKANRSFRTAMKADLVIAGSAVCASWI 322
CGATV AVVLS++GGL+QEL RR+IKV++DK SF+TAMKADLVIAGSAVCASWI
Sbjct: 260 ELLSCGATVYAVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCASWI 319
Query: 323 EQYIEHFSSGASQVVWWIMENRREYFDRSKDVLHRVKMLVFLSESQSKQWQNWCAEEIIE 382
+QY++H +G SQ+ WW+MENRREYFDR+K VL RVK+L+FLSE QSKQW WC E+ ++
Sbjct: 320 DQYMDHHPAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSKQWLTWCEEDHVK 379
Query: 383 LRSQITVVPLSVNEELAFVAGIHSTHNGSSFSAEKMDEKRKLLRDSVRREMGLNDNDMLV 442
LRSQ +VPLSVN+ELAFVAG+ S+ N + + E M EKR+ LR+SVR E GL D DMLV
Sbjct: 380 LRSQPVIVPLSVNDELAFVAGVSSSLNTPTLTQETMKEKRQKLRESVRTEFGLTDKDMLV 439
Query: 443 LSLSSINPGKGQLLLLESASSIVYHGPLPNDKKMQKSLEAEEHSSTLIRKHHNRKLLPLL 502
+SLSSINPGKGQLLLLES + L +++ + A+ + S +I+ + + +
Sbjct: 440 MSLSSINPGKGQLLLLESVA-------LALEREQTQEQVAKRNQSKIIKNLNGIRKEKIS 492
Query: 503 EDSRVALTNISRN---------------STNGTRKVVLSRNNGTMEHSLKILIGSVGSKS 547
+R L SR S G RK++LS N T + LK+L+GSVGSKS
Sbjct: 493 LSARHRLRGSSRKMKITSPAVDNHPSVLSATGRRKLLLS-GNVTQKQDLKLLLGSVGSKS 551
Query: 548 NKVDYVKGLLSFLEQHSNLSKEVLWTPATTRVASLYSAADVYIINSQGLGETFGRVTIEA 607
NKV YVK +LSFL + NLS VLWTPATTRVASLYSAADVY+ NSQG+GETFGRVTIEA
Sbjct: 552 NKVAYVKEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGVGETFGRVTIEA 611
Query: 608 MAFGLPVLGTDAGGTLEIVEHNVTGLLHPIGRPGNRVLAQNLRFLLENRLAREQMGMNGR 667
MA+GLPVLGTDAGGT EIVEHNVTGLLHP+GR GN+VLAQNL FLL N R Q+G GR
Sbjct: 612 MAYGLPVLGTDAGGTKEIVEHNVTGLLHPVGRAGNKVLAQNLLFLLRNPSTRLQLGSQGR 671
Query: 668 KKVQ 671
+ V+
Sbjct: 672 EIVE 675
>AT3G15940.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr3:5393632-5396187 REVERSE LENGTH=697
Length = 697
Score = 615 bits (1585), Expect = e-176, Method: Compositional matrix adjust.
Identities = 342/664 (51%), Positives = 430/664 (64%), Gaps = 41/664 (6%)
Query: 38 TPQYSPSFQRLQSNFSPRREGRG---GVKWFGRKRNXXXXXXXXITFWAYLVFFVQSKWA 94
TP+ SP+ +++ S +PRREG+G V+WF + N IT W YL F+VQS+WA
Sbjct: 23 TPRGSPTLRKVHSGRTPRREGKGSGGAVQWF--RSNRLLYWLLLITLWTYLGFYVQSRWA 80
Query: 95 HGDKEE-EFSGFGRKSSD----FEQNQHQDLVPQNTSLSFINNKTVESMVEIGDTVVVAL 149
H D + EF FG K + EQN+ +DLV +S + +++ + + + + V L
Sbjct: 81 HDDDNKVEFLRFGGKLREDVLHVEQNKRRDLVADESSHAVVDHTNIVHL-GVNKRMHVTL 139
Query: 150 AKKENSVPXXXXXXXXXXXXXXXXXXXXXXX-QKV----EISGIEES--EIPLRNTTYGF 202
AKKE+S QKV E ++E E+P N TYG
Sbjct: 140 AKKEDSTSRRSVSPRRRTRKASRSSRTRIRSTQKVRKVMETKELDEQDQELPNINVTYGK 199
Query: 203 LVGPFDSTEDRILQWGPEKRFGTCDRNGEFAXXXXXXXXXXXXHELSLTGAPXXXXXXXX 262
L GPF S EDRIL+W P+KR GTCDR +F HELS+TGAP
Sbjct: 200 LFGPFGSLEDRILEWSPQKRSGTCDRKSDFKRLVWSRRFVLLFHELSMTGAPISMMELAS 259
Query: 263 XXXXCGATVSAVVLSKKGGLMQELVRRQIKVIDDKANRSFRTAMKADLVIAGSAVCASWI 322
CGATV AVVLS++GGL+QEL RR+IKV++DK SF+TAMKADLVIAGSAVCASWI
Sbjct: 260 ELLSCGATVYAVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADLVIAGSAVCASWI 319
Query: 323 EQYIEHFSSGASQVVWWIMENRREYFDRSKDVLHRVKMLVFLSESQSKQWQNWCAEEIIE 382
+QY++H +G SQ+ WW+MENRREYFDR+K VL RVK+L+FLSE QSKQW WC E+ ++
Sbjct: 320 DQYMDHHPAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSKQWLTWCEEDHVK 379
Query: 383 LRSQITVVPLSVNEELAFVAGIHSTHNGSSFSAEKMDEKRKLLRDSVRREMGLNDNDMLV 442
LRSQ +VPLSVN+ELAFVAG+ S+ N + + E M EKR+ LR+SVR E GL D DMLV
Sbjct: 380 LRSQPVIVPLSVNDELAFVAGVSSSLNTPTLTQETMKEKRQKLRESVRTEFGLTDKDMLV 439
Query: 443 LSLSSINPGKGQLLLLESASSIVYHGPLPNDKKMQKSLEAEEHSSTLIRKHHNRKLLPLL 502
+SLSSINPGKGQLLLLES + L +++ + A+ + S +I+ + + +
Sbjct: 440 MSLSSINPGKGQLLLLESVA-------LALEREQTQEQVAKRNQSKIIKNLNGIRKEKIS 492
Query: 503 EDSRVALTNISRN---------------STNGTRKVVLSRNNGTMEHSLKILIGSVGSKS 547
+R L SR S G RK++LS N T + LK+L+GSVGSKS
Sbjct: 493 LSARHRLRGSSRKMKITSPAVDNHPSVLSATGRRKLLLS-GNVTQKQDLKLLLGSVGSKS 551
Query: 548 NKVDYVKGLLSFLEQHSNLSKEVLWTPATTRVASLYSAADVYIINSQGLGETFGRVTIEA 607
NKV YVK +LSFL + NLS VLWTPATTRVASLYSAADVY+ NSQG+GETFGRVTIEA
Sbjct: 552 NKVAYVKEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGVGETFGRVTIEA 611
Query: 608 MAFGLPVLGTDAGGTLEIVEHNVTGLLHPIGRPGNRVLAQNLRFLLENRLAREQMGMNGR 667
MA+GLPVLGTDAGGT EIVEHNVTGLLHP+GR GN+VLAQNL FLL N R Q+G GR
Sbjct: 612 MAYGLPVLGTDAGGTKEIVEHNVTGLLHPVGRAGNKVLAQNLLFLLRNPSTRLQLGSQGR 671
Query: 668 KKVQ 671
+ V+
Sbjct: 672 EIVE 675
>AT1G52420.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:19528667-19531035 FORWARD LENGTH=670
Length = 670
Score = 576 bits (1485), Expect = e-164, Method: Compositional matrix adjust.
Identities = 324/671 (48%), Positives = 416/671 (61%), Gaps = 48/671 (7%)
Query: 21 FSPXXXXXXXXXXXXXXTPQYSPSFQRLQSNFSPRRE--GRGGVKWFGRKRNXXXXXXXX 78
SP TP+ +P R+ S +PRR G G +WF R
Sbjct: 6 LSPLRQTSVKSSLSGRSTPRGTP---RVYSGRTPRRGHGGGGAFQWFRSSR--LVYWLLL 60
Query: 79 ITFWAYLVFFVQSKWAHGDKEE-EFSGF-GRKSSD---FEQNQHQDLVPQNTSLSFINNK 133
IT W YL F+VQS+WAH ++ + EF F GR D E+ + D+V S + + N
Sbjct: 61 ITLWTYLGFYVQSRWAHDNESKVEFLRFGGRPRKDELYMEKIKGLDVVANENSEALV-NI 119
Query: 134 TVESMVEIGDTVVVALAKKENSVPXXXXXXXXXXXXXXXXXXXXXXXQKVEISGIEES-- 191
T + V+L KK++ V ++ I + E+
Sbjct: 120 TGKDDAGSNKRTDVSLIKKDDGVSRRSLSSKQKTRKTVRTSRSKIRVKQKVIKEVLETKD 179
Query: 192 -------EIPLRNTTYGFLVGPFDSTEDRILQWGPEKRFGTCDRNGEFAXXXXXXXXXXX 244
++PL N TYG L+GPF S ED++L+W P +R GTCDR +F
Sbjct: 180 LDDEQDPQLPLTNATYGKLLGPFGSLEDKVLEWSPHRRSGTCDRKSDFKRLVWSRRFVLL 239
Query: 245 XHELSLTGAPXXXXXXXXXXXXCGATVSAVVLSKKGGLMQELVRRQIKVIDDKANRSFRT 304
HELS+TGAP CGATVSAVVLS++GGLMQEL RR+IKV++DK SF+T
Sbjct: 240 FHELSMTGAPISMMELASELLSCGATVSAVVLSRRGGLMQELSRRRIKVVEDKGELSFKT 299
Query: 305 AMKADLVIAGSAVCASWIEQYIEHFSSGASQVVWWIMENRREYFDRSKDVLHRVKMLVFL 364
AMKADL+IAGSAVC SWI+QY+ H +G SQ+ WWIMENRREYFDR+K VL RVKML+FL
Sbjct: 300 AMKADLIIAGSAVCTSWIDQYMNHHPAGGSQIAWWIMENRREYFDRAKPVLDRVKMLIFL 359
Query: 365 SESQSKQWQNWCAEEIIELRSQITVVPLSVNEELAFVAGIHSTHNGSSFSAEKMDEKRKL 424
SESQS+QW WC EE I+LRSQ +VPLSVN+ELAFVAGI S+ N + S EKM KR++
Sbjct: 360 SESQSRQWLTWCEEEHIKLRSQPVIVPLSVNDELAFVAGIPSSLNTPTLSPEKMRVKRQI 419
Query: 425 LRDSVRREMGLNDNDMLVLSLSSINPGKGQLLLLESASSIVYHGPLPNDKKMQKSLEAEE 484
LR+SVR E+G+ D+DMLV+SLSSINP KGQLLLLES + + ++ E++
Sbjct: 420 LRESVRTELGITDSDMLVMSLSSINPTKGQLLLLESIALALS----------ERGQESQR 469
Query: 485 HSSTLIRKHHNRKLLPLLEDSRVALTNIS--RNSTNGTRKVVLSRNNGTM--EHSLKILI 540
+ +IRK +V+L++ R S+ + V L+ +NG + LK+L+
Sbjct: 470 NHKGIIRKE------------KVSLSSKHRLRGSSRQMKSVSLTLDNGLRREKQELKVLL 517
Query: 541 GSVGSKSNKVDYVKGLLSFLEQHSNLSKEVLWTPATTRVASLYSAADVYIINSQGLGETF 600
GSVGSKSNKV YVK +LSFL NLSK V+WTPATTRVASLYSAADVY+ NSQG+GETF
Sbjct: 518 GSVGSKSNKVGYVKEMLSFLSNSGNLSKSVMWTPATTRVASLYSAADVYVTNSQGVGETF 577
Query: 601 GRVTIEAMAFGLPVLGTDAGGTLEIVEHNVTGLLHPIGRPGNRVLAQNLRFLLENRLARE 660
GRVTIEAMA+GL V+GTDAGGT E+V+HN+TGLLH +GR GN+ LA NL +LL N R
Sbjct: 578 GRVTIEAMAYGLAVVGTDAGGTKEMVQHNMTGLLHSMGRSGNKELAHNLLYLLRNPDERL 637
Query: 661 QMGMNGRKKVQ 671
++G GRK V+
Sbjct: 638 RLGSEGRKMVE 648
>AT1G19710.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:6814920-6816716 FORWARD LENGTH=479
Length = 479
Score = 100 bits (249), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 55/110 (50%), Positives = 68/110 (61%)
Query: 562 QHSNLSKEVLWTPATTRVASLYSAADVYIINSQGLGETFGRVTIEAMAFGLPVLGTDAGG 621
Q L K V + T +VA +A DV + NSQ GE FGR+TIEAMAF LPVLGT AGG
Sbjct: 336 QEMKLQKIVHFVNKTMKVAPYLAAIDVLVQNSQARGECFGRITIEAMAFKLPVLGTAAGG 395
Query: 622 TLEIVEHNVTGLLHPIGRPGNRVLAQNLRFLLENRLAREQMGMNGRKKVQ 671
T+EIV + TGLLH G+ G LA+N+ L N R MG G ++V+
Sbjct: 396 TMEIVVNRTTGLLHNTGKDGVLPLAKNIVKLATNVKMRNTMGKKGYERVK 445
Score = 68.2 bits (165), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 94/186 (50%), Gaps = 26/186 (13%)
Query: 282 LMQELVRRQIKVIDDKANRSFRTAMKADLVIAGSAVCASWIEQYI-EHFSSGASQVVWWI 340
L +++ R ++VI K+ ++ TA+K+DLV+ +AV W++ + ++ +V+WWI
Sbjct: 127 LEHKMLDRGVQVISAKSQKAIDTALKSDLVVLNTAVAGKWLDAVLKDNVPKVLPKVLWWI 186
Query: 341 MENRREYFDRSKDVLHRVKMLVFL------SESQSKQWQNWCAEEIIELRSQITVVPLSV 394
E R YF VK L F+ S + ++ W+N + + + VV L
Sbjct: 187 HEMRGHYFKPD-----LVKHLPFVAGAMIDSHATAEYWKNRTHDRLGIKMPKTYVVHLGN 241
Query: 395 NEELAFVAGIHSTHNGSSFSAEKMDEKRKLLRDSVRREMGLNDNDMLVLSLSSINPGKGQ 454
++EL VA SF+ + +LR+ VR +G+ + D+L ++S++ GKGQ
Sbjct: 242 SKELMEVA-------EDSFA-------KNVLREQVRESLGVRNEDILFGIINSVSRGKGQ 287
Query: 455 LLLLES 460
L L +
Sbjct: 288 DLFLRA 293
>AT1G75420.1 | Symbols: | UDP-Glycosyltransferase superfamily
protein | chr1:28305469-28307317 FORWARD LENGTH=463
Length = 463
Score = 96.3 bits (238), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 3/134 (2%)
Query: 538 ILIGSVGSKSNKVDYVKGLLSFLEQHSNLSKEVLWTPATTRVASLYSAADVYIINSQGLG 597
+++GS SK K + L +F+ + L V + T VA +A DV + NSQ G
Sbjct: 304 VVVGSDMSKQTKFE--TELRNFVRE-KKLENFVHFVNKTLTVAPYIAAIDVLVQNSQARG 360
Query: 598 ETFGRVTIEAMAFGLPVLGTDAGGTLEIVEHNVTGLLHPIGRPGNRVLAQNLRFLLENRL 657
E FGR+TIEAMAF LPVLGT AGGT+EIV + TGLLH G+ G LA+N+ L
Sbjct: 361 ECFGRITIEAMAFKLPVLGTAAGGTMEIVVNGTTGLLHSAGKEGVIPLAKNIVKLATQVE 420
Query: 658 AREQMGMNGRKKVQ 671
R +MG NG ++V+
Sbjct: 421 LRLRMGKNGYERVK 434
Score = 65.9 bits (159), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 99/205 (48%), Gaps = 33/205 (16%)
Query: 282 LMQELVRRQIKVIDDKANRSFRTAMKADLVIAGSAVCASWIEQYI-EHFSSGASQVVWWI 340
L +++ R ++VI K ++ T++KADL++ +AV W++ + E+ +++WWI
Sbjct: 117 LEHKMLDRGVQVISAKGQKAVDTSLKADLIVLNTAVAGKWLDAVLKENVVKVLPKILWWI 176
Query: 341 MENRREYFDRSKDVLHRVKMLVFL------SESQSKQWQNWCAEEIIELRSQITVVPLSV 394
E R YF+ VK L F+ S + + W+N + + VV L
Sbjct: 177 HEMRGHYFNAD-----LVKHLPFVAGAMIDSHATAGYWKNRTQARLGIKMPKTYVVHLGN 231
Query: 395 NEELAFVAGIHSTHNGSSFSAEKMDEKRKLLRDSVRREMGLNDNDMLVLSLSSINPGKGQ 454
++EL V AE KR +LR+ VR +G+ + D+L ++S++ GKGQ
Sbjct: 232 SKELMEV-------------AEDSVAKR-VLREHVRESLGVRNEDLLFGIINSVSRGKGQ 277
Query: 455 LLLLESASSIVYHGPLP--NDKKMQ 477
L L + +H L +KK+Q
Sbjct: 278 DLFLRA-----FHESLERIKEKKLQ 297
>AT5G01220.1 | Symbols: SQD2 | sulfoquinovosyldiacylglycerol 2 |
chr5:86907-89885 REVERSE LENGTH=510
Length = 510
Score = 53.5 bits (127), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 77/163 (47%), Gaps = 22/163 (13%)
Query: 528 NNGTMEHSLKILIGSVGSKSNKVDYVKGLLSFLEQ-----------HSNLSKEVLWTPAT 576
+NG E L I +G +G + + ++ +K ++ L + +L K PA
Sbjct: 302 SNGEPEKPLVIHVGRIGVEKS-LELLKSVMDKLPEARIAFIGDGPYKEDLEKLFTGMPAV 360
Query: 577 -------TRVASLYSAADVYIINSQGLGETFGRVTIEAMAFGLPVLGTDAGGTLEIVEHN 629
++ Y++ DV+++ S+ ET G V +EAM+ GLPV+ AGG +I+ +
Sbjct: 361 FTGTLQGDELSQAYASGDVFVMPSES--ETLGLVVLEAMSSGLPVVAARAGGIPDIIPED 418
Query: 630 VTGLLHPIGRPGN-RVLAQNLRFLLENRLAREQMGMNGRKKVQ 671
G + PG+ LR LL +R RE +G R++ +
Sbjct: 419 QEGKTGFLFNPGDVEDCVTKLRTLLHDRETREIIGKAAREETE 461