Miyakogusa Predicted Gene

Lj4g3v0959990.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v0959990.1 tr|D0ABH5|D0ABH5_9ORYZ
OO_Ba0013J05-OO_Ba0033A15.32 protein OS=Oryza officinalis
GN=OO_Ba0013J05-OO_,32.16,4e-18,seg,NULL; Gb3_synth,Alpha
1,4-glycosyltransferase domain; LACTOSYLCERAMIDE
4-ALPHA-GALACTOSYLTRANSFE,gene.g53578.t1.1
         (512 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G19900.1 | Symbols:  | alpha 1,4-glycosyltransferase family p...   433   e-121
AT1G61050.1 | Symbols:  | alpha 1,4-glycosyltransferase family p...    93   5e-19
AT3G09020.1 | Symbols:  | alpha 1,4-glycosyltransferase family p...    86   8e-17
AT2G38150.1 | Symbols:  | alpha 1,4-glycosyltransferase family p...    84   2e-16
AT2G38152.1 | Symbols:  | alpha 1,4-glycosyltransferase family p...    83   4e-16
AT5G01250.1 | Symbols:  | alpha 1,4-glycosyltransferase family p...    81   2e-15

>AT4G19900.1 | Symbols:  | alpha 1,4-glycosyltransferase family
           protein | chr4:10789144-10791433 REVERSE LENGTH=644
          Length = 644

 Score =  433 bits (1113), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 247/550 (44%), Positives = 329/550 (59%), Gaps = 96/550 (17%)

Query: 51  DENQ--QEDRPLHLN----TPSSAYFFDPISAAIRRAFLSPPSSIHQWHSFDNDDNKFSA 104
           DENQ  ++++ + LN      SS ++FD ++  IRRAF     SI +W   D D   FS 
Sbjct: 103 DENQDAEQEQEVDLNRNKAASSSGFYFDHVNGVIRRAF--NKRSIDEW---DYDYTGFSI 157

Query: 105 PVDRSI-----TAFGSDDVHLHDYLRSKTTPVTSIGMAGVTGSRRKAS---------FSE 150
             D S       AFGSDDV L + +R K   VTS+  A +  S +K S         F +
Sbjct: 158 DSDSSGDKSSRAAFGSDDVPLDESIRRKIVEVTSVEDALLLKSGKKVSPLRQGWGDWFDK 217

Query: 151 K---IECSGSLKSSFDALNPVNNPLLQDPDGAGVTGFTRGDRILQKWWLNEFKRVPFPGN 207
           K   +      KS+ + LNP+NNP+LQDPD  G TG TRGD+++QKW LN+ KR PF   
Sbjct: 218 KGDFLRRDRMFKSNIETLNPLNNPMLQDPDSVGNTGLTRGDKVVQKWRLNQIKRNPFMAK 277

Query: 208 K---------NPNKLPIVTK----KLGTERKTLNDDEN-NKGSLGDIIDDRHH-EFRNHI 252
           K          PN+  +++     K G ERKTL++DE   +    ++  +R H E   H+
Sbjct: 278 KPLSVVSEKKEPNEFRLLSSVGEIKRG-ERKTLDNDEKIEREEQKNVESERKHDEVTEHM 336

Query: 253 YADGNTWGYFPGLPLRLSFDDFMEAFFRRGKCVMRVFMVWNSPPWMYTVRYQRGLESLLF 312
           YADG  WGY+PG+   LSF DFM++FFR+ KC MRVFMVWNSP WM++VR+QRGLESLL 
Sbjct: 337 YADGTKWGYYPGIEPSLSFSDFMDSFFRKEKCSMRVFMVWNSPGWMFSVRHQRGLESLLS 396

Query: 313 HHPNACVVVFSETIELDFFKDSFVKDG--------------------------------- 339
            H +ACVVVFSET+ELDFF++SFVKD                                  
Sbjct: 397 QHRDACVVVFSETVELDFFRNSFVKDSYKVAVAMPNLDELLQDTPTHVFASVWFDWRKTK 456

Query: 340 -----------------YGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAGSALNGAVMAF 382
                            YGG+YLDSD+IV   +S L N++G+E+    AG +LNGAVM+F
Sbjct: 457 FYPTHYSELVRLAALYKYGGVYLDSDVIVLGSLSSLRNTIGMEDQV--AGESLNGAVMSF 514

Query: 383 AKHSLFIKECMEEFYTTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFF 442
            K S F+ EC+ E+Y TYDD  LR NGADLLTRVA++F+   N+ + Q +L + PS +FF
Sbjct: 515 EKKSPFLLECLNEYYLTYDDKCLRCNGADLLTRVAKRFLNGKNRRMNQQELNIRPSSVFF 574

Query: 443 PITSQNITRYFIAPATETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMN 502
           PI SQ IT YF  PA E E++Q+D   KKI+ ESLTFHFWNS+TS+LIPEP+SLV + ++
Sbjct: 575 PINSQQITNYFAYPAIEDERSQQDESFKKILNESLTFHFWNSVTSSLIPEPESLVAKFLD 634

Query: 503 YACIRCLELL 512
           ++CIRC ++L
Sbjct: 635 HSCIRCSDVL 644


>AT1G61050.1 | Symbols:  | alpha 1,4-glycosyltransferase family
           protein | chr1:22486736-22488043 FORWARD LENGTH=435
          Length = 435

 Score = 92.8 bits (229), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 123/299 (41%), Gaps = 62/299 (20%)

Query: 268 RLSFDDFMEAFFRRGKCVMRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVVFSETIE 327
           R  F   +++   +  C    FM W S    +  R +  +ESL   HPN C+++ S + +
Sbjct: 133 RQRFQTRVKSLLSKSSCESLFFMTWISSIESFGDRERFTIESLFKFHPNGCLILVSNSFD 192

Query: 328 LD-------------------------FFKDSF-------VKDG---------------- 339
            D                          FKD+        +K G                
Sbjct: 193 CDRGTLILKPFTDKGLKVLPIKPDFAYIFKDTSAEKWFERLKKGTLSPGVIPLEQNLSNL 252

Query: 340 --------YGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKHSLFI 389
                   YGGIYLD+D+I+ K +S L+N +G +   P     S LN AV+ F K+   +
Sbjct: 253 LRLVLLYKYGGIYLDTDVIILKSLSNLHNVIGAQTVDPVTKKWSRLNNAVLIFDKNHPLL 312

Query: 390 KECMEEFYTTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNI 449
           K  ++EF  T++      NG  L++RV  +       S   L   V P   F+P+    I
Sbjct: 313 KRFIDEFSRTFNGNKWGHNGPYLVSRVITRI---KISSSSDLGFSVLPPSAFYPVDWTRI 369

Query: 450 TRYFIAPATETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 508
             ++ AP  E++ A     L  + + +   H WN  +  L  E  S++ +LM+++CI C
Sbjct: 370 KGFYRAPTNESD-AWLRKRLTHLRKNTFAVHLWNRESKKLRIEEGSIIHQLMSHSCIFC 427


>AT3G09020.1 | Symbols:  | alpha 1,4-glycosyltransferase family
           protein | chr3:2753307-2754542 FORWARD LENGTH=411
          Length = 411

 Score = 85.5 bits (210), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 53/171 (30%), Positives = 86/171 (50%), Gaps = 7/171 (4%)

Query: 340 YGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKHSLFIKECMEEFY 397
           +GG+YLD+D+IV K    L N +G +   P +   + LN AV+ F K+  F+ + +EEF 
Sbjct: 239 FGGVYLDTDMIVLKSFKTLRNVIGAQTLEPVSRNWTRLNNAVLIFDKNHPFLLKSIEEFA 298

Query: 398 TTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYFIAPA 457
            T++      NG  L++RVAR   G D  +   L     P+  F+P+    I + F  P 
Sbjct: 299 LTFNGNVWGHNGPYLVSRVARAVEGTDGYNFTIL---TPPA--FYPVNWVEIEKLFKVPR 353

Query: 458 TETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 508
           TE +  +  V + ++ + S   H WN  +     E  S + +L++  CI C
Sbjct: 354 TEKDSKRVQVKVLEMQKRSYGLHLWNKFSRKFEIEQGSAMDKLVSNQCIIC 404


>AT2G38150.1 | Symbols:  | alpha 1,4-glycosyltransferase family
           protein | chr2:15981700-15982917 REVERSE LENGTH=405
          Length = 405

 Score = 84.0 bits (206), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 112/283 (39%), Gaps = 62/283 (21%)

Query: 284 CVMRVFMVWNSPPWMYTVRYQRGLESLLFHHPNACVVVFSE--------TIELDFFKDSF 335
           C  + FM+W SP   +  R    +++L   +P AC+ + S         TI    F   F
Sbjct: 119 CSAQFFMIWLSPANSFGPREMLAIDTLFTTNPGACLAILSNSLDSPNGYTILKPLFDQGF 178

Query: 336 ------------------------VKDG------------------------YGGIYLDS 347
                                   +K G                        YGG+YLD+
Sbjct: 179 NLIAVTIDIPFLVKNTPAEAWLKRLKSGNMDPGSIPLFMNLSDLTRLAVLYKYGGVYLDT 238

Query: 348 DIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKHSLFIKECMEEFYTTYDDTNL 405
           DII    ++ L N++G +   P     + LN AVM F  +   ++E ++E+ TT+D    
Sbjct: 239 DIIFLNDMTGLRNAIGAQSSDPATKRWTRLNNAVMVFDIYHPLMREFLQEYATTFDGNKW 298

Query: 406 RWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYFIAPATETEKAQE 465
            +N   L++RV ++       +    +L +     F+P+    I + F  PAT  E    
Sbjct: 299 GYNSPYLVSRVIKRLGNKPGYN----NLTIFSPDAFYPVNWIKIQKLFKKPATTREAKWV 354

Query: 466 DVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 508
           +  ++ + + S   H WN +T  +  E  S++  L++  C  C
Sbjct: 355 EKTVQDMNKGSYMIHLWNKVTRKIKIEEGSVMHTLVSTHCTVC 397


>AT2G38152.1 | Symbols:  | alpha 1,4-glycosyltransferase family
           protein | chr2:15984062-15985278 REVERSE LENGTH=380
          Length = 380

 Score = 83.2 bits (204), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 82/175 (46%), Gaps = 14/175 (8%)

Query: 340 YGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG---SALNGAVMAFAKHSLFIKECMEEF 396
           YGG+YLD+D IV +    L NS+G +    G     + LN AV+ F K    +   +EEF
Sbjct: 209 YGGVYLDTDFIVTRSFKGLKNSIGAQTVVEGDSKNWTRLNNAVLIFEKDHPLVYSFIEEF 268

Query: 397 YTTYDDTNLRWNGADLLTRV---ARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYF 453
            +T+D      NG  L+TRV   AR+ +GD        +  V P   F+P    +I R F
Sbjct: 269 ASTFDGNKWGHNGPYLVTRVAQRARETIGD--------NFTVLPPVAFYPFNWLDIPRLF 320

Query: 454 IAPATETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 508
             P    +       L K+ +ES   H WN +T  L     S++  +++  C+ C
Sbjct: 321 QTPRGSNDSTLLKTDLVKLNRESYGLHLWNKITRKLKIGKGSVIDIIISDHCVVC 375


>AT5G01250.1 | Symbols:  | alpha 1,4-glycosyltransferase family
           protein | chr5:102370-103593 REVERSE LENGTH=407
          Length = 407

 Score = 81.3 bits (199), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 79/171 (46%), Gaps = 7/171 (4%)

Query: 340 YGGIYLDSDIIVWKPISFLNNSVGVEEHAPGAG--SALNGAVMAFAKHSLFIKECMEEFY 397
           YGG+YLD+D+IV K    L N +G +   P +   + LN AV+ F K+   + + MEEF 
Sbjct: 235 YGGVYLDTDMIVLKSFKGLRNVIGAQTLDPSSTNWTRLNNAVLIFDKNHPLLLKFMEEFA 294

Query: 398 TTYDDTNLRWNGADLLTRVARKFMGDDNKSIKQLDLKVEPSHIFFPITSQNITRYFIAPA 457
            T++     +NG  L++RVAR   G         +  V    +F+ +    I + F  P 
Sbjct: 295 KTFNGNIWGYNGPYLVSRVARAVEGSSG-----YNFTVMRPSVFYSVNWLEIKKLFKVPK 349

Query: 458 TETEKAQEDVLLKKIMQESLTFHFWNSLTSALIPEPDSLVTRLMNYACIRC 508
           TE +       L  + +     H WN  +     E  S + +L++  CI C
Sbjct: 350 TEKDSKWVKTKLLHMQRNGYGLHLWNKFSRKYEIEQGSAMWKLVSEHCIIC 400