Miyakogusa Predicted Gene
- Lj4g3v0654570.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0654570.1 Non Chatacterized Hit- tr|H2ZRQ7|H2ZRQ7_LATCH
Uncharacterized protein (Fragment) OS=Latimeria
chalum,28.41,1e-16,seg,NULL; LACTOSYLCERAMIDE
4-ALPHA-GALACTOSYLTRANSFERASE (ALPHA- 1,4-GALACTOSYLTRANSFERASE),NULL;
Gl,CUFF.47849.1
(590 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G19900.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 610 e-175
AT3G09020.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 134 2e-31
AT2G38152.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 132 6e-31
AT5G01250.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 124 2e-28
AT1G61050.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 120 2e-27
AT2G38150.1 | Symbols: | alpha 1,4-glycosyltransferase family p... 119 5e-27
>AT4G19900.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr4:10789144-10791433 REVERSE LENGTH=644
Length = 644
Score = 610 bits (1574), Expect = e-175, Method: Compositional matrix adjust.
Identities = 313/569 (55%), Positives = 402/569 (70%), Gaps = 47/569 (8%)
Query: 61 DPIDELDIIV---------DENQ--QEDRPLHLN----TPSSAYFFDPISAAIRRAFLSP 105
D IDE D + DENQ ++++ + LN SS ++FD ++ IRRAF
Sbjct: 84 DRIDEHDDAIEDDGVSNEEDENQDAEQEQEVDLNRNKAASSSGFYFDHVNGVIRRAF--N 141
Query: 106 PSSIHQWHSFDNDDNKFSAPVDRSI-----TAFGSDDVHLHDYLRSKTTPVTSIEDALLL 160
SI +W D D FS D S AFGSDDV L + +R K VTS+EDALLL
Sbjct: 142 KRSIDEW---DYDYTGFSIDSDSSGDKSSRAAFGSDDVPLDESIRRKIVEVTSVEDALLL 198
Query: 161 KS----SPLRHGWGHWFEKKGLFLRKDRMFRSSFDALNPVNNPLLQDPDGAGVTGFTRGD 216
KS SPLR GWG WF+KKG FLR+DRMF+S+ + LNP+NNP+LQDPD G TG TRGD
Sbjct: 199 KSGKKVSPLRQGWGDWFDKKGDFLRRDRMFKSNIETLNPLNNPMLQDPDSVGNTGLTRGD 258
Query: 217 RILQKWWLNEFKRVPFPGNK---------NPNKLPIVTK----KLGTERKTLNDDEN-NK 262
+++QKW LN+ KR PF K PN+ +++ K G ERKTL++DE +
Sbjct: 259 KVVQKWRLNQIKRNPFMAKKPLSVVSEKKEPNEFRLLSSVGEIKRG-ERKTLDNDEKIER 317
Query: 263 GSLGDIIDDRHH-EFRNHIYADGNTWGYFPGLPLRLSFDDFMEAFFRRGKCVMRVFMVWN 321
++ +R H E H+YADG WGY+PG+ LSF DFM++FFR+ KC MRVFMVWN
Sbjct: 318 EEQKNVESERKHDEVTEHMYADGTKWGYYPGIEPSLSFSDFMDSFFRKEKCSMRVFMVWN 377
Query: 322 SPPWMYTVRYQRGLESLLFHHPNACVVVFSETIELDFFKDSFVKDGYKVAVVMPNLDQLL 381
SP WM++VR+QRGLESLL H +ACVVVFSET+ELDFF++SFVKD YKVAV MPNLD+LL
Sbjct: 378 SPGWMFSVRHQRGLESLLSQHRDACVVVFSETVELDFFRNSFVKDSYKVAVAMPNLDELL 437
Query: 382 KDTPAHIFSSVWFEWRKTKFYPTHFSELIRLAALYKYGGIYLDSDIIVWKPISFLNNSVG 441
+DTP H+F+SVWF+WRKTKFYPTH+SEL+RLAALYKYGG+YLDSD+IV +S L N++G
Sbjct: 438 QDTPTHVFASVWFDWRKTKFYPTHYSELVRLAALYKYGGVYLDSDVIVLGSLSSLRNTIG 497
Query: 442 VEEHAPGAGSALNGAVMAFAKHSLFIKECMEEFYTTYDDTNLRWNGADLLTRVARKFMGD 501
+E+ AG +LNGAVM+F K S F+ EC+ E+Y TYDD LR NGADLLTRVA++F+
Sbjct: 498 MEDQV--AGESLNGAVMSFEKKSPFLLECLNEYYLTYDDKCLRCNGADLLTRVAKRFLNG 555
Query: 502 DNKSIKQLDLKVEPSHIFFPITSQNITRYFIAPATETEKAQEDVLLKKIMQESLTFHFWN 561
N+ + Q +L + PS +FFPI SQ IT YF PA E E++Q+D KKI+ ESLTFHFWN
Sbjct: 556 KNRRMNQQELNIRPSSVFFPINSQQITNYFAYPAIEDERSQQDESFKKILNESLTFHFWN 615
Query: 562 SLTSALIPEPDSLVTRLMNYACIRCLELL 590
S+TS+LIPEP+SLV + ++++CIRC ++L
Sbjct: 616 SVTSSLIPEPESLVAKFLDHSCIRCSDVL 644
>AT3G09020.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr3:2753307-2754542 FORWARD LENGTH=411
Length = 411
Score = 134 bits (336), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 143/292 (48%), Gaps = 21/292 (7%)
Query: 307 FRRGKCVMRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVVFSETIE-LDFFK--DSF 363
F R C ++ M W SP ++ R +ES+ H C+++ S T++ L F+ F
Sbjct: 122 FLRDDCEVKFMMTWISPAELFGKREILSVESVFKSHARGCLMILSSTMDSLQGFRILKPF 181
Query: 364 VKDGYKVAVVMPNLDQLLKDTPAHIFSSVWFEWRKTK-------FYPTHFSELIRLAALY 416
+ GY+V V P+L LLKDT W E +T + S L+RLA L+
Sbjct: 182 LDRGYRVMAVTPDLPFLLKDTAGE----SWLEEIQTGKRDPGKISLAQNLSNLMRLAYLF 237
Query: 417 KYGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKHSLFIKECMEEF 474
K+GG+YLD+D+IV K L N +G + P + + LN AV+ F K+ F+ + +EEF
Sbjct: 238 KFGGVYLDTDMIVLKSFKTLRNVIGAQTLEPVSRNWTRLNNAVLIFDKNHPFLLKSIEEF 297
Query: 475 YTTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYFIAP 534
T++ NG L++RVAR G D + L P+ F+P+ I + F P
Sbjct: 298 ALTFNGNVWGHNGPYLVSRVARAVEGTDGYNFTIL---TPPA--FYPVNWVEIEKLFKVP 352
Query: 535 ATETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 586
TE + + V + ++ + S H WN + E S + +L++ CI C
Sbjct: 353 RTEKDSKRVQVKVLEMQKRSYGLHLWNKFSRKFEIEQGSAMDKLVSNQCIIC 404
>AT2G38152.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr2:15984062-15985278 REVERSE LENGTH=380
Length = 380
Score = 132 bits (333), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 138/290 (47%), Gaps = 30/290 (10%)
Query: 314 MRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVVFSETIELDFFKDSFVK----DGYK 369
+R FM W SP + R +ES+ HP C+++ S +++ DS +K GYK
Sbjct: 99 VRFFMTWFSPAEYFGKREMLAVESVFKAHPQGCLMIVSGSLD-SLQGDSILKPLNDRGYK 157
Query: 370 VAVVMPNLDQLLKDTPAHIFSSVWFEWRKT-----KFYPTH--FSELIRLAALYKYGGIY 422
V P++ LL++TPA WF+ K+ P H S L RLA LYKYGG+Y
Sbjct: 158 VFAATPDMSLLLENTPA----KSWFQEMKSCKRDPGRIPLHQNLSNLARLAFLYKYGGVY 213
Query: 423 LDSDIIVWKPISFLNNSVGVEEHAPGAG---SALNGAVMAFAKHSLFIKECMEEFYTTYD 479
LD+D IV + L NS+G + G + LN AV+ F K + +EEF +T+D
Sbjct: 214 LDTDFIVTRSFKGLKNSIGAQTVVEGDSKNWTRLNNAVLIFEKDHPLVYSFIEEFASTFD 273
Query: 480 DTNLRWNGADLLTRV---ARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYFIAPAT 536
NG L+TRV AR+ +GD + V P F+P +I R F P
Sbjct: 274 GNKWGHNGPYLVTRVAQRARETIGD--------NFTVLPPVAFYPFNWLDIPRLFQTPRG 325
Query: 537 ETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 586
+ L K+ +ES H WN +T L S++ +++ C+ C
Sbjct: 326 SNDSTLLKTDLVKLNRESYGLHLWNKITRKLKIGKGSVIDIIISDHCVVC 375
>AT5G01250.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr5:102370-103593 REVERSE LENGTH=407
Length = 407
Score = 124 bits (311), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 139/309 (44%), Gaps = 22/309 (7%)
Query: 290 FPGLPLRLSFDDFMEAFFRRGKCVMRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVV 349
F G L F + F G C + M W SP + R +ES+ HP C+++
Sbjct: 102 FSGDNLSDKFQKRVNEFVGDG-CEVNFVMTWISPADFFGNREVLAIESVFKSHPYGCLMI 160
Query: 350 FSETIELDFFKDS---FVKDGYKVAVVMPNLDQLLKDTPAHIFSSVWF-EWRKTKFYP-- 403
S T++ + F+ GYKV V P+L LLK T + W E + K P
Sbjct: 161 LSATMDSPQGYATLKPFIDRGYKVLAVTPDLPFLLKGTAGEL----WLDEIKSGKRDPGK 216
Query: 404 ----THFSELIRLAALYKYGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAV 457
+ S L+RLA LYKYGG+YLD+D+IV K L N +G + P + + LN AV
Sbjct: 217 ISLAQNLSNLMRLAYLYKYGGVYLDTDMIVLKSFKGLRNVIGAQTLDPSSTNWTRLNNAV 276
Query: 458 MAFAKHSLFIKECMEEFYTTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSH 517
+ F K+ + + MEEF T++ +NG L++RVAR G + V
Sbjct: 277 LIFDKNHPLLLKFMEEFAKTFNGNIWGYNGPYLVSRVARAVEGSSG-----YNFTVMRPS 331
Query: 518 IFFPITSQNITRYFIAPATETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTR 577
+F+ + I + F P TE + L + + H WN + E S + +
Sbjct: 332 VFYSVNWLEIKKLFKVPKTEKDSKWVKTKLLHMQRNGYGLHLWNKFSRKYEIEQGSAMWK 391
Query: 578 LMNYACIRC 586
L++ CI C
Sbjct: 392 LVSEHCIIC 400
>AT1G61050.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr1:22486736-22488043 FORWARD LENGTH=435
Length = 435
Score = 120 bits (302), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 143/303 (47%), Gaps = 20/303 (6%)
Query: 296 RLSFDDFMEAFFRRGKCVMRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVVFSETIE 355
R F +++ + C FM W S + R + +ESL HPN C+++ S + +
Sbjct: 133 RQRFQTRVKSLLSKSSCESLFFMTWISSIESFGDRERFTIESLFKFHPNGCLILVSNSFD 192
Query: 356 LD---FFKDSFVKDGYKVAVVMPNLDQLLKDTPAHIFSSVWFE-WRKTKFYP------TH 405
D F G KV + P+ + KDT A WFE +K P +
Sbjct: 193 CDRGTLILKPFTDKGLKVLPIKPDFAYIFKDTSAE----KWFERLKKGTLSPGVIPLEQN 248
Query: 406 FSELIRLAALYKYGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKH 463
S L+RL LYKYGGIYLD+D+I+ K +S L+N +G + P S LN AV+ F K+
Sbjct: 249 LSNLLRLVLLYKYGGIYLDTDVIILKSLSNLHNVIGAQTVDPVTKKWSRLNNAVLIFDKN 308
Query: 464 SLFIKECMEEFYTTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPIT 523
+K ++EF T++ NG L++RV + S L V P F+P+
Sbjct: 309 HPLLKRFIDEFSRTFNGNKWGHNGPYLVSRVITRI---KISSSSDLGFSVLPPSAFYPVD 365
Query: 524 SQNITRYFIAPATETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYAC 583
I ++ AP E++ A L + + + H WN + L E S++ +LM+++C
Sbjct: 366 WTRIKGFYRAPTNESD-AWLRKRLTHLRKNTFAVHLWNRESKKLRIEEGSIIHQLMSHSC 424
Query: 584 IRC 586
I C
Sbjct: 425 IFC 427
>AT2G38150.1 | Symbols: | alpha 1,4-glycosyltransferase family
protein | chr2:15981700-15982917 REVERSE LENGTH=405
Length = 405
Score = 119 bits (298), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 137/288 (47%), Gaps = 22/288 (7%)
Query: 312 CVMRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVVFSETIE----LDFFKDSFVKDG 367
C + FM+W SP + R +++L +P AC+ + S +++ K F G
Sbjct: 119 CSAQFFMIWLSPANSFGPREMLAIDTLFTTNPGACLAILSNSLDSPNGYTILKPLF-DQG 177
Query: 368 YKVAVVMPNLDQLLKDTPAHIFSSVWFEWRKTKFYP-------THFSELIRLAALYKYGG 420
+ + V ++ L+K+TPA W + K+ + S+L RLA LYKYGG
Sbjct: 178 FNLIAVTIDIPFLVKNTPAE----AWLKRLKSGNMDPGSIPLFMNLSDLTRLAVLYKYGG 233
Query: 421 IYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKHSLFIKECMEEFYTTY 478
+YLD+DII ++ L N++G + P + LN AVM F + ++E ++E+ TT+
Sbjct: 234 VYLDTDIIFLNDMTGLRNAIGAQSSDPATKRWTRLNNAVMVFDIYHPLMREFLQEYATTF 293
Query: 479 DDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYFIAPATET 538
D +N L++RV ++ + +L + F+P+ I + F PAT
Sbjct: 294 DGNKWGYNSPYLVSRVIKRLGNKPGYN----NLTIFSPDAFYPVNWIKIQKLFKKPATTR 349
Query: 539 EKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 586
E + ++ + + S H WN +T + E S++ L++ C C
Sbjct: 350 EAKWVEKTVQDMNKGSYMIHLWNKVTRKIKIEEGSVMHTLVSTHCTVC 397