Miyakogusa Predicted Gene
- Lj1g3v2311890.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2311890.1 tr|G7KRB0|G7KRB0_MEDTR Lactosylceramide
4-alpha-galactosyltransferase OS=Medicago truncatula
GN=MTR_,66.28,0,LACTOSYLCERAMIDE 4-ALPHA-GALACTOSYLTRANSFERASE (ALPHA-
1,4-GALACTOSYLTRANSFERASE),NULL; Gb3_synth,Al,CUFF.28844.1
(437 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G38150.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 401 e-112
AT5G01250.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 378 e-105
AT3G09020.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 377 e-104
AT2G38152.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 356 1e-98
AT1G61050.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 337 9e-93
AT4G19900.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 149 5e-36
>AT2G38150.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr2:15981700-15982917 REVERSE LENGTH=405
Length = 405
Score = 401 bits (1031), Expect = e-112, Method: Compositional matrix adjust.
Identities = 183/330 (55%), Positives = 237/330 (71%), Gaps = 3/330 (0%)
Query: 109 LIAPLNVTEEERIAWFRGNLHEFKILKSNNLTRQFHARVQGFFNHDHQCESQFFMTWISP 168
L+ P ++ +RI WFR L E +ILKS ++ FH RV +N + C +QFFM W+SP
Sbjct: 73 LLPPRKASKNQRIDWFRRKLPELEILKSTTKSKSFHTRVLDLYNKN--CSAQFFMIWLSP 130
Query: 169 ASSFGAREFFCLETLFKVHPGACLVILSRTLDSTHGHRILKPLLDRGFRVQAVSPDFSFL 228
A+SFG RE ++TLF +PGACL ILS +LDS +G+ ILKPL D+GF + AV+ D FL
Sbjct: 131 ANSFGPREMLAIDTLFTTNPGACLAILSNSLDSPNGYTILKPLFDQGFNLIAVTIDIPFL 190
Query: 229 LKGTPAEAWFHQLRKGKKDPGEIPLFQNLSNLIRLAVLYKYGGVYLDTDFLVLKPLTGLR 288
+K TPAEAW +L+ G DPG IPLF NLS+L RLAVLYKYGGVYLDTD + L +TGLR
Sbjct: 191 VKNTPAEAWLKRLKSGNMDPGSIPLFMNLSDLTRLAVLYKYGGVYLDTDIIFLNDMTGLR 250
Query: 289 NCIGAQSMDLGSKQWTRLNNAILIFDMNHPLLLRFIHEFALTFDGNKWGHNGPYMVSRVV 348
N IGAQS D +K+WTRLNNA+++FD+ HPL+ F+ E+A TFDGNKWG+N PY+VSRV+
Sbjct: 251 NAIGAQSSDPATKRWTRLNNAVMVFDIYHPLMREFLQEYATTFDGNKWGYNSPYLVSRVI 310
Query: 349 AKLGKSPGF-KFTILPPVAFYPVDWLKIGGFFRKPKTQGEAKWVDAKLIQLSGESYGIHL 407
+LG PG+ TI P AFYPV+W+KI F+KP T EAKWV+ + ++ SY IHL
Sbjct: 311 KRLGNKPGYNNLTIFSPDAFYPVNWIKIQKLFKKPATTREAKWVEKTVQDMNKGSYMIHL 370
Query: 408 WNKQSSRFLIEEGSVIARLVSEHCVICNSL 437
WNK + + IEEGSV+ LVS HC +C ++
Sbjct: 371 WNKVTRKIKIEEGSVMHTLVSTHCTVCGNI 400
>AT5G01250.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr5:102370-103593 REVERSE LENGTH=407
Length = 407
Score = 378 bits (970), Expect = e-105, Method: Compositional matrix adjust.
Identities = 194/423 (45%), Positives = 265/423 (62%), Gaps = 31/423 (7%)
Query: 16 DTERRTSKRMFHRRVLKRGAKLYISIFSLITIYAIIFLIHDDGVIYHDSLEVLQG---QS 72
D E+R + + +RR+ + G+ SL T +AI F+ +I + ++ Q
Sbjct: 5 DIEKRFTVVIDNRRLNQSGSS------SLFTAFAISFVT----LIVVTTFTLISNFSMQP 54
Query: 73 HRNEEEIKSTLA-LATHIALRSMQEQRDGVDKGSQRVLIAPLNVTEEERIAWFRGNLHEF 131
HR+ +K + + H+ L S +E G + L+ E+ L
Sbjct: 55 HRDFSGVKIEIKRVIPHLPLSSERE-------GERSDLLKQQTQVNEK--------LQVI 99
Query: 132 KILKSNNLTRQFHARVQGFFNHDHQCESQFFMTWISPASSFGAREFFCLETLFKVHPGAC 191
++ +NL+ +F RV F CE F MTWISPA FG RE +E++FK HP C
Sbjct: 100 EVFSGDNLSDKFQKRVNEFVGDG--CEVNFVMTWISPADFFGNREVLAIESVFKSHPYGC 157
Query: 192 LVILSRTLDSTHGHRILKPLLDRGFRVQAVSPDFSFLLKGTPAEAWFHQLRKGKKDPGEI 251
L+ILS T+DS G+ LKP +DRG++V AV+PD FLLKGT E W +++ GK+DPG+I
Sbjct: 158 LMILSATMDSPQGYATLKPFIDRGYKVLAVTPDLPFLLKGTAGELWLDEIKSGKRDPGKI 217
Query: 252 PLFQNLSNLIRLAVLYKYGGVYLDTDFLVLKPLTGLRNCIGAQSMDLGSKQWTRLNNAIL 311
L QNLSNL+RLA LYKYGGVYLDTD +VLK GLRN IGAQ++D S WTRLNNA+L
Sbjct: 218 SLAQNLSNLMRLAYLYKYGGVYLDTDMIVLKSFKGLRNVIGAQTLDPSSTNWTRLNNAVL 277
Query: 312 IFDMNHPLLLRFIHEFALTFDGNKWGHNGPYMVSRVVAKLGKSPGFKFTILPPVAFYPVD 371
IFD NHPLLL+F+ EFA TF+GN WG+NGPY+VSRV + S G+ FT++ P FY V+
Sbjct: 278 IFDKNHPLLLKFMEEFAKTFNGNIWGYNGPYLVSRVARAVEGSSGYNFTVMRPSVFYSVN 337
Query: 372 WLKIGGFFRKPKTQGEAKWVDAKLIQLSGESYGIHLWNKQSSRFLIEEGSVIARLVSEHC 431
WL+I F+ PKT+ ++KWV KL+ + YG+HLWNK S ++ IE+GS + +LVSEHC
Sbjct: 338 WLEIKKLFKVPKTEKDSKWVKTKLLHMQRNGYGLHLWNKFSRKYEIEQGSAMWKLVSEHC 397
Query: 432 VIC 434
+IC
Sbjct: 398 IIC 400
>AT3G09020.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr3:2753307-2754542 FORWARD LENGTH=411
Length = 411
Score = 377 bits (967), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/417 (46%), Positives = 268/417 (64%), Gaps = 21/417 (5%)
Query: 23 KRMF-HRRVLKRGAKLYISIFSLITIYAIIFLIHDDGVIYHDSLEVLQGQSHRNEE-EIK 80
+ MF HRR+ + G+ L+ + F+ I ++F I + +L V + S + EIK
Sbjct: 10 RAMFDHRRLNRSGSSLFTA-FASTVIALVVFTI-----VLVSNLSVREDFSAKVVTIEIK 63
Query: 81 STLALATHIALRSMQEQRDGVDKGSQRVLIAPLNVTEEERIAWFRGNLHEFKILKSNNLT 140
+ + ++ L S +E D V+ +T +E I L ++ +++
Sbjct: 64 T---IVPYLPLSSEKEVSDQVNNNYS----IKQQITVKEEI----NKLQVLEVFGGKDVS 112
Query: 141 RQFHARVQGFFNHDHQCESQFFMTWISPASSFGAREFFCLETLFKVHPGACLVILSRTLD 200
+F R F D CE +F MTWISPA FG RE +E++FK H CL+ILS T+D
Sbjct: 113 EKFQQRATEFLRDD--CEVKFMMTWISPAELFGKREILSVESVFKSHARGCLMILSSTMD 170
Query: 201 STHGHRILKPLLDRGFRVQAVSPDFSFLLKGTPAEAWFHQLRKGKKDPGEIPLFQNLSNL 260
S G RILKP LDRG+RV AV+PD FLLK T E+W +++ GK+DPG+I L QNLSNL
Sbjct: 171 SLQGFRILKPFLDRGYRVMAVTPDLPFLLKDTAGESWLEEIQTGKRDPGKISLAQNLSNL 230
Query: 261 IRLAVLYKYGGVYLDTDFLVLKPLTGLRNCIGAQSMDLGSKQWTRLNNAILIFDMNHPLL 320
+RLA L+K+GGVYLDTD +VLK LRN IGAQ+++ S+ WTRLNNA+LIFD NHP L
Sbjct: 231 MRLAYLFKFGGVYLDTDMIVLKSFKTLRNVIGAQTLEPVSRNWTRLNNAVLIFDKNHPFL 290
Query: 321 LRFIHEFALTFDGNKWGHNGPYMVSRVVAKLGKSPGFKFTILPPVAFYPVDWLKIGGFFR 380
L+ I EFALTF+GN WGHNGPY+VSRV + + G+ FTIL P AFYPV+W++I F+
Sbjct: 291 LKSIEEFALTFNGNVWGHNGPYLVSRVARAVEGTDGYNFTILTPPAFYPVNWVEIEKLFK 350
Query: 381 KPKTQGEAKWVDAKLIQLSGESYGIHLWNKQSSRFLIEEGSVIARLVSEHCVICNSL 437
P+T+ ++K V K++++ SYG+HLWNK S +F IE+GS + +LVS C+IC+S+
Sbjct: 351 VPRTEKDSKRVQVKVLEMQKRSYGLHLWNKFSRKFEIEQGSAMDKLVSNQCIICDSV 407
>AT2G38152.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr2:15984062-15985278 REVERSE LENGTH=380
Length = 380
Score = 356 bits (914), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 171/360 (47%), Positives = 233/360 (64%), Gaps = 27/360 (7%)
Query: 79 IKSTLALATHIALRSMQEQRDGVDKGSQRVLIAPLNVTEEERIAWFRGNLHEFKILKSNN 138
I S ++L + S + ++ ++ L P N T +RIAW +L EF++
Sbjct: 45 ILSNMSLKSTFFWSSPTSEVIQTNRMERKSLAPPKNTTSRDRIAWLHSHLTEFEV----- 99
Query: 139 LTRQFHARVQGFFNHDHQCESQFFMTWISPASSFGAREFFCLETLFKVHPGACLVILSRT 198
+FFMTW SPA FG RE +E++FK HP CL+I+S +
Sbjct: 100 ---------------------RFFMTWFSPAEYFGKREMLAVESVFKAHPQGCLMIVSGS 138
Query: 199 LDSTHGHRILKPLLDRGFRVQAVSPDFSFLLKGTPAEAWFHQLRKGKKDPGEIPLFQNLS 258
LDS G ILKPL DRG++V A +PD S LL+ TPA++WF +++ K+DPG IPL QNLS
Sbjct: 139 LDSLQGDSILKPLNDRGYKVFAATPDMSLLLENTPAKSWFQEMKSCKRDPGRIPLHQNLS 198
Query: 259 NLIRLAVLYKYGGVYLDTDFLVLKPLTGLRNCIGAQSMDLG-SKQWTRLNNAILIFDMNH 317
NL RLA LYKYGGVYLDTDF+V + GL+N IGAQ++ G SK WTRLNNA+LIF+ +H
Sbjct: 199 NLARLAFLYKYGGVYLDTDFIVTRSFKGLKNSIGAQTVVEGDSKNWTRLNNAVLIFEKDH 258
Query: 318 PLLLRFIHEFALTFDGNKWGHNGPYMVSRVVAKLGKSPGFKFTILPPVAFYPVDWLKIGG 377
PL+ FI EFA TFDGNKWGHNGPY+V+RV + ++ G FT+LPPVAFYP +WL I
Sbjct: 259 PLVYSFIEEFASTFDGNKWGHNGPYLVTRVAQRARETIGDNFTVLPPVAFYPFNWLDIPR 318
Query: 378 FFRKPKTQGEAKWVDAKLIQLSGESYGIHLWNKQSSRFLIEEGSVIARLVSEHCVICNSL 437
F+ P+ ++ + L++L+ ESYG+HLWNK + + I +GSVI ++S+HCV+C +
Sbjct: 319 LFQTPRGSNDSTLLKTDLVKLNRESYGLHLWNKITRKLKIGKGSVIDIIISDHCVVCRGI 378
>AT1G61050.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr1:22486736-22488043 FORWARD LENGTH=435
Length = 435
Score = 337 bits (864), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 159/298 (53%), Positives = 212/298 (71%), Gaps = 4/298 (1%)
Query: 141 RQFHARVQGFFNHDHQCESQFFMTWISPASSFGAREFFCLETLFKVHPGACLVILSRTLD 200
++F RV+ + CES FFMTWIS SFG RE F +E+LFK HP CL+++S + D
Sbjct: 134 QRFQTRVKSLLSKS-SCESLFFMTWISSIESFGDRERFTIESLFKFHPNGCLILVSNSFD 192
Query: 201 STHGHRILKPLLDRGFRVQAVSPDFSFLLKGTPAEAWFHQLRKGKKDPGEIPLFQNLSNL 260
G ILKP D+G +V + PDF+++ K T AE WF +L+KG PG IPL QNLSNL
Sbjct: 193 CDRGTLILKPFTDKGLKVLPIKPDFAYIFKDTSAEKWFERLKKGTLSPGVIPLEQNLSNL 252
Query: 261 IRLAVLYKYGGVYLDTDFLVLKPLTGLRNCIGAQSMDLGSKQWTRLNNAILIFDMNHPLL 320
+RL +LYKYGG+YLDTD ++LK L+ L N IGAQ++D +K+W+RLNNA+LIFD NHPLL
Sbjct: 253 LRLVLLYKYGGIYLDTDVIILKSLSNLHNVIGAQTVDPVTKKWSRLNNAVLIFDKNHPLL 312
Query: 321 LRFIHEFALTFDGNKWGHNGPYMVSRVVA--KLGKSPGFKFTILPPVAFYPVDWLKIGGF 378
RFI EF+ TF+GNKWGHNGPY+VSRV+ K+ S F++LPP AFYPVDW +I GF
Sbjct: 313 KRFIDEFSRTFNGNKWGHNGPYLVSRVITRIKISSSSDLGFSVLPPSAFYPVDWTRIKGF 372
Query: 379 FRKPKTQGEAKWVDAKLIQLSGESYGIHLWNKQSSRFLIEEGSVIARLVSEHCVICNS 436
+R P + +A W+ +L L ++ +HLWN++S + IEEGS+I +L+S C+ CNS
Sbjct: 373 YRAPTNESDA-WLRKRLTHLRKNTFAVHLWNRESKKLRIEEGSIIHQLMSHSCIFCNS 429
>AT4G19900.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr4:10789144-10791433 REVERSE LENGTH=644
Length = 644
Score = 149 bits (375), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 145/304 (47%), Gaps = 24/304 (7%)
Query: 143 FHARVQGFFNHDHQCESQFFMTWISPASSFGAREFFCLETLFKVHPGACLVILSRTLDST 202
F + FF + +C + FM W SP F R LE+L H AC+V+ S T++
Sbjct: 355 FSDFMDSFFRKE-KCSMRVFMVWNSPGWMFSVRHQRGLESLLSQHRDACVVVFSETVELD 413
Query: 203 HGHRILKPLLDRGFRVQAVSPDFSFLLKGTP----AEAWFHQLRKGKKDPGEIPLFQNLS 258
+ ++V P+ LL+ TP A WF RK K P + S
Sbjct: 414 F---FRNSFVKDSYKVAVAMPNLDELLQDTPTHVFASVWF-DWRKTKFYP------THYS 463
Query: 259 NLIRLAVLYKYGGVYLDTDFLVLKPLTGLRNCIGAQSMDLGSKQWTRLNNAILIFDMNHP 318
L+RLA LYKYGGVYLD+D +VL L+ LRN IG + G LN A++ F+ P
Sbjct: 464 ELVRLAALYKYGGVYLDSDVIVLGSLSSLRNTIGMEDQVAGES----LNGAVMSFEKKSP 519
Query: 319 LLLRFIHEFALTFDGNKWGHNGPYMVSRVVAKL--GKSPGF---KFTILPPVAFYPVDWL 373
LL ++E+ LT+D NG +++RV + GK+ + I P F+P++
Sbjct: 520 FLLECLNEYYLTYDDKCLRCNGADLLTRVAKRFLNGKNRRMNQQELNIRPSSVFFPINSQ 579
Query: 374 KIGGFFRKPKTQGEAKWVDAKLIQLSGESYGIHLWNKQSSRFLIEEGSVIARLVSEHCVI 433
+I +F P + E D ++ ES H WN +S + E S++A+ + C+
Sbjct: 580 QITNYFAYPAIEDERSQQDESFKKILNESLTFHFWNSVTSSLIPEPESLVAKFLDHSCIR 639
Query: 434 CNSL 437
C+ +
Sbjct: 640 CSDV 643