Miyakogusa Predicted Gene
- Lj5g3v1699450.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1699450.1 Non Chatacterized Hit- tr|I1NHQ8|I1NHQ8_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,82.63,0,UDP-Glycosyltransferase/glycogen phosphorylase,NULL;
seg,NULL; coiled-coil,NULL; Glycos_transf_1,Gly,CUFF.55822.1
(1058 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G04480.1 | Symbols: | UDP-Glycosyltransferase superfamily pr... 1248 0.0
AT5G04480.2 | Symbols: | UDP-Glycosyltransferase superfamily pr... 1214 0.0
AT4G01210.1 | Symbols: | glycosyl transferase family 1 protein ... 631 0.0
>AT5G04480.1 | Symbols: | UDP-Glycosyltransferase superfamily protein
| chr5:1271886-1277793 REVERSE LENGTH=1050
Length = 1050
Score = 1248 bits (3230), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 609/1033 (58%), Positives = 746/1033 (72%), Gaps = 17/1033 (1%)
Query: 26 VGFHSIRGRFPFKRNPSHNHHRDRQKSSSDRQLPXXXXXXXXXXXXXHLHNRFTRXXXXX 85
FHSIR R KRN S RDR S DR H R
Sbjct: 33 TSFHSIRDRLRLKRNSSD--RRDRSHSGLDRP---------SLRTRPHHIGRSLNRKGLL 81
Query: 86 XXXXXXXXXXXYAMIISVVFLFALASMVMQSSITSVFRQRTERGRHLREGLKLGSALKFV 145
Y ++ V F ++S+++Q+SIT Q +G +R + LGS LK+V
Sbjct: 82 SLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNSITW---QGNVKGGQVRSQIGLGSTLKYV 138
Query: 146 PGRVSQRFLSGDGLDRVRSQPRIGVRAPRIALILGHMMIDPQSLMLVTLIQNLQKLGYVF 205
PG +++ + G GLD +RS RIGVR PR+AL+LG+M DP++LMLVT+++NLQKLGYVF
Sbjct: 139 PGGIARTLIEGKGLDPLRSAVRIGVRPPRLALVLGNMKKDPRTLMLVTVMKNLQKLGYVF 198
Query: 206 RIFAVGHGNARSIWENIDGGLTPLSTEQQGRIDWSTFEGIIADSLEAKEAISSLMQEPFC 265
++FAV +G ARS+WE + G + L +EQ G DW+ FEG+IADSLEAKEAISSLMQEPF
Sbjct: 199 KVFAVENGEARSLWEQLAGHVKVLVSEQLGHADWTIFEGVIADSLEAKEAISSLMQEPFR 258
Query: 266 SVPLIWIIQEDSLSSRLPVYEQMGWQHLVSHWRSAFSRASVVVFPDFTYPMLYSELDSGN 325
SVPLIWI+ ED L++RLPVY++MG L+SHWRSAF+RA VVVFP FT PML+S LD GN
Sbjct: 259 SVPLIWIVHEDILANRLPVYQRMGQNSLISHWRSAFARADVVVFPQFTLPMLHSVLDDGN 318
Query: 326 FFVIPGSPIDVWAAESYSKTHTKDQLRELSGFGKNDMLVLVVGSSIFYDDLSWDYAVAMH 385
F VIP S +DVWAAESYS+THTK LRE++ FG++D+++LV+GSS FYD+ SWD AVAMH
Sbjct: 319 FVVIPESVVDVWAAESYSETHTKQNLREINEFGEDDVIILVLGSSFFYDEFSWDNAVAMH 378
Query: 386 SLGPLLTKYARRNDDAESFKFVFLCGNSTDGYDDALQEVASRLGLPHGSIRHYGLNGDVN 445
LGPLLT+Y RR D + SFKFVFL GNST G DA+QEVASRLGL G++RH+GLN DVN
Sbjct: 379 MLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQSDAVQEVASRLGLTEGTVRHFGLNEDVN 438
Query: 446 SVLLMADIILYGSAQDVQGFPPLLIRAMTFEIPVIAPDFPVLKKYIVDGVHGIFFSKHNP 505
VL MADI++Y S+Q+ Q FPPL++RAM+F IP+I PDFP++KKY+ D VHGIFF +++P
Sbjct: 439 RVLRMADILVYASSQEEQNFPPLIVRAMSFGIPIITPDFPIMKKYMADEVHGIFFRRNDP 498
Query: 506 EALMNAFSLLLSSGRLSKFAQAIGLSGRQFAKNMLALDCITGYARLLENVLSFPSDALLP 565
+AL+ AFS L+S GRLSKFAQ I SGR KN++A +CITGYARLLEN+L FPSD LP
Sbjct: 499 DALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFPSDTFLP 558
Query: 566 GPVSQIQQGGWEWNFFQNEIELDIKLRTMDDEISY-GKGTIIHAVEQDLASLNYSTSFLE 624
G +SQ+Q WEWNFF++E+E K +D ++ GK I+ VE+ + ST+ ++
Sbjct: 559 GSISQLQVAAWEWNFFRSELE-QPKSFILDSAYAFIGKSGIVFQVEEKFMGVIESTNPVD 617
Query: 625 NETEVPMQDKLTKLDWDVLRXXXXXXXXXXXXXXXXXXXXXKYVGVWDDIYRNARKSEKL 684
N T + +KLDWDVL + V W++IYRNARKSEKL
Sbjct: 618 NNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIYRNARKSEKL 677
Query: 685 KFEANERDEGELERTGQPVCIYEIYSGTGAWPFLHHGSLYRGXXXXXXXXXXXXXXVDAV 744
KFE NERDEGELERTG+P+CIYEIY+G GAWPFLHHGSLYRG VDA
Sbjct: 678 KFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDAA 737
Query: 745 GRLPLLNDTYYRDILCEMGGLFAIANRVDSIHKRPWIGFQSWHAAGRKVALSAEAERVLE 804
RLPLLNDTYYRDILCE+GG+F++AN+VDSIH RPWIGFQSW AAGRKV+LS++AE LE
Sbjct: 738 DRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLE 797
Query: 805 ETMHENLRGDVIYFWGRLDMDGSIIGSNNALTFWSMCDILNGGNCRTVFQDGFRQMYALP 864
+ + +G++IYFW RLD+DG GS NALTFWSMCDILN GNCRT F+D FR MY LP
Sbjct: 798 NIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLP 857
Query: 865 PHAEALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFVDSIDAVHTDSSSKYSMCLLG 924
H EALPPMPEDG +WS+LH+WVMPTPSFLEF+MFSRMF +S+DA+H + + S C L
Sbjct: 858 EHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKS-CSLA 916
Query: 925 SSEIEKKHCYCRVLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNFS 984
SS +E+KHCYCRVLELL+NVWAYHS RKMVYINP GS+EEQHP++QRKG MWAKYFNF+
Sbjct: 917 SSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFT 976
Query: 985 LLKSMXXXXXXXXXXXXHPREVWLWPMTGEVHWQGVXXXXXXXXXXLKMDKKRKTKEKLY 1044
LLKSM HPRE WLWP+TGEVHW+GV LKMDKKRKTKEKLY
Sbjct: 977 LLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLY 1036
Query: 1045 ERMKYGYKQKSLG 1057
+R+K GYKQKSLG
Sbjct: 1037 DRIKNGYKQKSLG 1049
>AT5G04480.2 | Symbols: | UDP-Glycosyltransferase superfamily protein
| chr5:1271886-1277793 REVERSE LENGTH=1035
Length = 1035
Score = 1214 bits (3141), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 599/1033 (57%), Positives = 731/1033 (70%), Gaps = 32/1033 (3%)
Query: 26 VGFHSIRGRFPFKRNPSHNHHRDRQKSSSDRQLPXXXXXXXXXXXXXHLHNRFTRXXXXX 85
FHSIR R KRN S RDR S DR H R
Sbjct: 33 TSFHSIRDRLRLKRNSSD--RRDRSHSGLDRP---------SLRTRPHHIGRSLNRKGLL 81
Query: 86 XXXXXXXXXXXYAMIISVVFLFALASMVMQSSITSVFRQRTERGRHLREGLKLGSALKFV 145
Y ++ V F ++S+++Q+SIT Q +G +R + LGS LK+V
Sbjct: 82 SLLKPRGTCLLYFLVAFTVCAFVMSSLLLQNSITW---QGNVKGGQVRSQIGLGSTLKYV 138
Query: 146 PGRVSQRFLSGDGLDRVRSQPRIGVRAPRIALILGHMMIDPQSLMLVTLIQNLQKLGYVF 205
PG +++ + G GLD +RS RIGVR PR+AL+LG+M DP++LMLV
Sbjct: 139 PGGIARTLIEGKGLDPLRSAVRIGVRPPRLALVLGNMKKDPRTLMLV------------- 185
Query: 206 RIFAVGHGNARSIWENIDGGLTPLSTEQQGRIDWSTFEGIIADSLEAKEAISSLMQEPFC 265
FAV +G ARS+WE + G + L +EQ G DW+ FEG+IADSLEAKEAISSLMQEPF
Sbjct: 186 --FAVENGEARSLWEQLAGHVKVLVSEQLGHADWTIFEGVIADSLEAKEAISSLMQEPFR 243
Query: 266 SVPLIWIIQEDSLSSRLPVYEQMGWQHLVSHWRSAFSRASVVVFPDFTYPMLYSELDSGN 325
SVPLIWI+ ED L++RLPVY++MG L+SHWRSAF+RA VVVFP FT PML+S LD GN
Sbjct: 244 SVPLIWIVHEDILANRLPVYQRMGQNSLISHWRSAFARADVVVFPQFTLPMLHSVLDDGN 303
Query: 326 FFVIPGSPIDVWAAESYSKTHTKDQLRELSGFGKNDMLVLVVGSSIFYDDLSWDYAVAMH 385
F VIP S +DVWAAESYS+THTK LRE++ FG++D+++LV+GSS FYD+ SWD AVAMH
Sbjct: 304 FVVIPESVVDVWAAESYSETHTKQNLREINEFGEDDVIILVLGSSFFYDEFSWDNAVAMH 363
Query: 386 SLGPLLTKYARRNDDAESFKFVFLCGNSTDGYDDALQEVASRLGLPHGSIRHYGLNGDVN 445
LGPLLT+Y RR D + SFKFVFL GNST G DA+QEVASRLGL G++RH+GLN DVN
Sbjct: 364 MLGPLLTRYGRRKDTSGSFKFVFLYGNSTKGQSDAVQEVASRLGLTEGTVRHFGLNEDVN 423
Query: 446 SVLLMADIILYGSAQDVQGFPPLLIRAMTFEIPVIAPDFPVLKKYIVDGVHGIFFSKHNP 505
VL MADI++Y S+Q+ Q FPPL++RAM+F IP+I PDFP++KKY+ D VHGIFF +++P
Sbjct: 424 RVLRMADILVYASSQEEQNFPPLIVRAMSFGIPIITPDFPIMKKYMADEVHGIFFRRNDP 483
Query: 506 EALMNAFSLLLSSGRLSKFAQAIGLSGRQFAKNMLALDCITGYARLLENVLSFPSDALLP 565
+AL+ AFS L+S GRLSKFAQ I SGR KN++A +CITGYARLLEN+L FPSD LP
Sbjct: 484 DALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYARLLENMLHFPSDTFLP 543
Query: 566 GPVSQIQQGGWEWNFFQNEIELDIKLRTMDDEISY-GKGTIIHAVEQDLASLNYSTSFLE 624
G +SQ+Q WEWNFF++E+E K +D ++ GK I+ VE+ + ST+ ++
Sbjct: 544 GSISQLQVAAWEWNFFRSELE-QPKSFILDSAYAFIGKSGIVFQVEEKFMGVIESTNPVD 602
Query: 625 NETEVPMQDKLTKLDWDVLRXXXXXXXXXXXXXXXXXXXXXKYVGVWDDIYRNARKSEKL 684
N T + +KLDWDVL + V W++IYRNARKSEKL
Sbjct: 603 NNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDVEDWEEIYRNARKSEKL 662
Query: 685 KFEANERDEGELERTGQPVCIYEIYSGTGAWPFLHHGSLYRGXXXXXXXXXXXXXXVDAV 744
KFE NERDEGELERTG+P+CIYEIY+G GAWPFLHHGSLYRG VDA
Sbjct: 663 KFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLSLSSKDRRLSSDDVDAA 722
Query: 745 GRLPLLNDTYYRDILCEMGGLFAIANRVDSIHKRPWIGFQSWHAAGRKVALSAEAERVLE 804
RLPLLNDTYYRDILCE+GG+F++AN+VDSIH RPWIGFQSW AAGRKV+LS++AE LE
Sbjct: 723 DRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRAAGRKVSLSSKAEESLE 782
Query: 805 ETMHENLRGDVIYFWGRLDMDGSIIGSNNALTFWSMCDILNGGNCRTVFQDGFRQMYALP 864
+ + +G++IYFW RLD+DG GS NALTFWSMCDILN GNCRT F+D FR MY LP
Sbjct: 783 NIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGNCRTTFEDAFRHMYGLP 842
Query: 865 PHAEALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFVDSIDAVHTDSSSKYSMCLLG 924
H EALPPMPEDG +WS+LH+WVMPTPSFLEF+MFSRMF +S+DA+H + + S C L
Sbjct: 843 EHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLDALHNNLNDSKS-CSLA 901
Query: 925 SSEIEKKHCYCRVLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNFS 984
SS +E+KHCYCRVLELL+NVWAYHS RKMVYINP GS+EEQHP++QRKG MWAKYFNF+
Sbjct: 902 SSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPLQQRKGLMWAKYFNFT 961
Query: 985 LLKSMXXXXXXXXXXXXHPREVWLWPMTGEVHWQGVXXXXXXXXXXLKMDKKRKTKEKLY 1044
LLKSM HPRE WLWP+TGEVHW+GV LKMDKKRKTKEKLY
Sbjct: 962 LLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERYRLKMDKKRKTKEKLY 1021
Query: 1045 ERMKYGYKQKSLG 1057
+R+K GYKQKSLG
Sbjct: 1022 DRIKNGYKQKSLG 1034
>AT4G01210.1 | Symbols: | glycosyl transferase family 1 protein |
chr4:507738-512362 REVERSE LENGTH=1031
Length = 1031
Score = 631 bits (1627), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/911 (37%), Positives = 517/911 (56%), Gaps = 64/911 (7%)
Query: 167 RIGVRAPRIALILGHMMIDPQSLMLVTLIQNLQKLGYVFRIFAVGHGNARSIWENIDGGL 226
R G R P++AL+ G ++ DP+ +++V+L + LQ++GY ++++ G SIW+ + +
Sbjct: 141 RFGFRKPKLALVFGDLLADPEQVLMVSLSKALQEVGYAIEVYSLEDGPVNSIWQKMGVPV 200
Query: 227 TPLSTEQQGR--IDWSTFEGIIADSLEAKEAISSLMQEPFCSVPLIWIIQEDSLSSRLPV 284
T L Q+ IDW +++GII +SL A+ + MQEPF S+PLIW+I E++L+ R
Sbjct: 201 TILKPNQESSCVIDWLSYDGIIVNSLRARSMFTCFMQEPFKSLPLIWVINEETLAVRSRQ 260
Query: 285 YEQMGWQHLVSHWRSAFSRASVVVFPDFTYPMLYSELDSGNFFVIPGSPIDVWAAESYSK 344
Y G L++ W+ FSRASVVVF ++ P+LY+E D+GNF+VIPGSP +V A++
Sbjct: 261 YNSTGQTELLTDWKKIFSRASVVVFHNYLLPILYTEFDAGNFYVIPGSPEEVCKAKNLEF 320
Query: 345 THTKDQLRELSGFGKNDMLVLVVGSSIFYDDLSWDYAVAMHSLGPLLTKYARRNDDAESF 404
KD D+++ +VGS Y ++A+ + +L PL + +D++
Sbjct: 321 PPQKD-----------DVVISIVGSQFLYKGQWLEHALLLQALRPLFSGNYLESDNSH-L 368
Query: 405 KFVFLCGNSTDGYDDALQEVASRLGLPHGSIRHYGLNGDVNSVLLMADIILYGSAQDVQG 464
K + L G + Y A++ ++ L P +++H + G+V+ +L +D+++YGS + Q
Sbjct: 369 KIIVLGGETASNYSVAIETISQNLTYPKEAVKHVRVAGNVDKILESSDLVIYGSFLEEQS 428
Query: 465 FPPLLIRAMTFEIPVIAPDFPVLKKYIVDGVHGIFFSKHNPEALMNAFSLLLSSGRLSKF 524
FP +L++AM+ P++APD ++KY+ D V G F K N + L +++ G++S
Sbjct: 429 FPEILMKAMSLGKPIVAPDLFNIRKYVDDRVTGYLFPKQNLKVLSQVVLEVITEGKISPL 488
Query: 525 AQAIGLSGRQFAKNMLALDCITGYARLLENVLSFPSDALLPGPVSQIQ---QGGWEWNFF 581
AQ I + G+ KNM+A + I GYA LLEN+L F S+ P V ++ + W W+ F
Sbjct: 489 AQKIAMMGKTTVKNMMARETIEGYAALLENMLKFSSEVASPKDVQKVPPELREEWSWHPF 548
Query: 582 Q--------NEIELDIK-LRTMDDEISYGKGTIIH--AVEQDLASLNYSTSFLENETEVP 630
+ N I + L ++ +Y G + AV D ++ E E +
Sbjct: 549 EAFMDTSPNNRIARSYEFLAKVEGHWNYTPGEAMKFGAVNDD----SFVYEIWEEERYLQ 604
Query: 631 MQDKLTKLDWDVLRXXXXXXXXXXXXXXXXXXXXXKYVGVWDDIYRNARKSEKLKFEANE 690
M + + + + L+ +Y G W+D+Y++A+++++ K + +E
Sbjct: 605 MMNSKKRREDEELKSRVL-----------------QYRGTWEDVYKSAKRADRSKNDLHE 647
Query: 691 RDEGELERTGQPVCIYEIYSGTGAWPFLHHGSLYRGXXXXXXXXXXXXXXVDAVGRLPLL 750
RDEGEL RTGQP+CIYE Y G G W FLH LYRG VDA RLPL
Sbjct: 648 RDEGELLRTGQPLCIYEPYFGEGTWSFLHQDPLYRGVGLSVKGRRPRMDDVDASSRLPLF 707
Query: 751 NDTYYRDILCEMGGLFAIANRVDSIHKRPWIGFQSWHAAGRKVALSAEAERVLEETMHEN 810
N+ YYRD L + G FAI+N++D +HK WIGFQSW A RK +LS AE L +
Sbjct: 708 NNPYYRDALGDFGAFFAISNKIDRLHKNSWIGFQSWRATARKESLSKIAEDALLNAIQTR 767
Query: 811 LRGDVIYFWGRLDMDGSIIGSNNALT--FWSMCDILNGGNCRTVFQDGFRQMYALPPHAE 868
GD +YFW R+D D N L FWS CD +N GNCR + + ++MY++ + +
Sbjct: 768 KHGDALYFWVRMDKD-----PRNPLQKPFWSFCDAINAGNCRFAYNETLKKMYSI-KNLD 821
Query: 869 ALPPMPEDGGYWSALHSWVMPTPSFLEFIMFSRMFVDSIDAVHTDSSSKYSMCLLGSSEI 928
+LPPMPEDG WS + SW +PT SFLEF+MFSRMFVDS+DA + + + C L S
Sbjct: 822 SLPPMPEDGDTWSVMQSWALPTRSFLEFVMFSRMFVDSLDAQIYEEHHRTNRCYL--SLT 879
Query: 929 EKKHCYCRVLELLINVWAYHSARKMVYINPNTGSMEEQHPIEQRKGFMWAKYFNFSLLKS 988
+ KHCY RVLELL+NVWAYHSAR++VYI+P TG M+EQH + R+G MW K+F+++ LK+
Sbjct: 880 KDKHCYSRVLELLVNVWAYHSARRIVYIDPETGLMQEQHKQKNRRGKMWVKWFDYTTLKT 939
Query: 989 MXXXXXXXXXXXXHPREV--WLWPMTGEVHWQGVXXXXXXXXXXLKMDKKRKTKEKLYER 1046
M R V WLWP TGE+ W+G K +KK+K+++KL
Sbjct: 940 MDEDLAEEADSD---RRVGHWLWPWTGEIVWRGTLEKEKQKKNLEKEEKKKKSRDKLSRM 996
Query: 1047 MKYGYKQKSLG 1057
+QK +G
Sbjct: 997 RSRSGRQKVIG 1007