Miyakogusa Predicted Gene

Lj3g3v2906200.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v2906200.1 Non Chatacterized Hit- tr|I1KFD4|I1KFD4_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,75.2,0,coiled-coil,NULL; Galactosyl_T,Glycosyl transferase,
family 31; DUF4094,Domain of unknown function D,gene.g49923.t1.1
         (343 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G05170.2 | Symbols:  | Galactosyltransferase family protein |...   476   e-135
AT1G05170.1 | Symbols:  | Galactosyltransferase family protein |...   469   e-132
AT2G32430.1 | Symbols:  | Galactosyltransferase family protein |...   459   e-129
AT4G26940.1 | Symbols:  | Galactosyltransferase family protein |...   422   e-118
AT1G32930.1 | Symbols:  | Galactosyltransferase family protein |...   364   e-101
AT1G11730.1 | Symbols:  | Galactosyltransferase family protein |...   351   4e-97
AT1G77810.2 | Symbols:  | Galactosyltransferase family protein |...   347   5e-96
AT1G33430.1 | Symbols:  | Galactosyltransferase family protein |...   343   9e-95
AT1G77810.1 | Symbols:  | Galactosyltransferase family protein |...   338   3e-93
AT1G33430.2 | Symbols:  | Galactosyltransferase family protein |...   336   2e-92
AT1G22015.1 | Symbols: DD46 | Galactosyltransferase family prote...   317   6e-87
AT4G26940.2 | Symbols:  | Galactosyltransferase family protein |...   186   2e-47
AT5G53340.1 | Symbols:  | Galactosyltransferase family protein |...   156   2e-38
AT5G53340.2 | Symbols:  | Galactosyltransferase family protein |...   156   2e-38
AT4G32120.1 | Symbols:  | Galactosyltransferase family protein |...   155   3e-38
AT2G25300.1 | Symbols:  | Galactosyltransferase family protein |...   148   6e-36
AT2G26100.1 | Symbols:  | Galactosyltransferase family protein |...   117   1e-26
AT3G14960.1 | Symbols:  | Galactosyltransferase family protein |...   101   8e-22
AT1G53290.1 | Symbols:  | Galactosyltransferase family protein |...   100   3e-21

>AT1G05170.2 | Symbols:  | Galactosyltransferase family protein |
           chr1:1491460-1493931 REVERSE LENGTH=407
          Length = 407

 Score =  476 bits (1225), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 235/370 (63%), Positives = 276/370 (74%), Gaps = 54/370 (14%)

Query: 28  IWSVPESKGLARPTATEADQLNVVSEGCNSRVLQEMEMKRE----YSEDFKSHNSIQNLD 83
           +W++PESKG++ P+ TEA++L +VSEGCN + L + E+KR+    + E   +H ++Q LD
Sbjct: 38  MWNIPESKGMSHPSVTEAERLKLVSEGCNPKALYQKEVKRDPQALFGEVANTHIALQTLD 97

Query: 84  KTISNLEMELAAARATQESVRSGAPVPED------------------------------I 113
           KTIS+LEMELAAAR+ QES+++GAP+ +D                              I
Sbjct: 98  KTISSLEMELAAARSVQESLQNGAPLSDDMGKKQPQEQRRFLMVVGINTAFSSRKRRDSI 157

Query: 114 RISDQSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYL 173
           R +               GII RFVIGHSAT+GGILDRAIEAEDRKHGDFLRL+HVEGYL
Sbjct: 158 RATWMPQGEKRKRLEEEKGIIIRFVIGHSATTGGILDRAIEAEDRKHGDFLRLDHVEGYL 217

Query: 174 ELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDS--------------YM------PTL 213
           ELS KTKTYF+TA ++WDADFYVKVDDDVHVNI +              Y+      P L
Sbjct: 218 ELSGKTKTYFSTAFSMWDADFYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVL 277

Query: 214 FHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGS 273
             +GVRYHEPEYWKFGE+GNKYFRHATGQLYAIS DLA+YIS+NQ+VLHKYANEDVSLG+
Sbjct: 278 SQKGVRYHEPEYWKFGENGNKYFRHATGQLYAISRDLASYISINQHVLHKYANEDVSLGA 337

Query: 274 WFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE 333
           WFIG+DV+HIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE
Sbjct: 338 WFIGIDVKHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE 397

Query: 334 GENALWSASF 343
           GE ALWSA+F
Sbjct: 398 GEKALWSATF 407


>AT1G05170.1 | Symbols:  | Galactosyltransferase family protein |
           chr1:1491460-1493931 REVERSE LENGTH=404
          Length = 404

 Score =  469 bits (1207), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 234/370 (63%), Positives = 274/370 (74%), Gaps = 57/370 (15%)

Query: 28  IWSVPESKGLARPTATEADQLNVVSEGCNSRVLQEMEMKRE----YSEDFKSHNSIQNLD 83
           +W++PESKG++ P+ TEA++L +VSEGCN +     E+KR+    + E   +H ++Q LD
Sbjct: 38  MWNIPESKGMSHPSVTEAERLKLVSEGCNPKA---KEVKRDPQALFGEVANTHIALQTLD 94

Query: 84  KTISNLEMELAAARATQESVRSGAPVPED------------------------------I 113
           KTIS+LEMELAAAR+ QES+++GAP+ +D                              I
Sbjct: 95  KTISSLEMELAAARSVQESLQNGAPLSDDMGKKQPQEQRRFLMVVGINTAFSSRKRRDSI 154

Query: 114 RISDQSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYL 173
           R +               GII RFVIGHSAT+GGILDRAIEAEDRKHGDFLRL+HVEGYL
Sbjct: 155 RATWMPQGEKRKRLEEEKGIIIRFVIGHSATTGGILDRAIEAEDRKHGDFLRLDHVEGYL 214

Query: 174 ELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDS--------------YM------PTL 213
           ELS KTKTYF+TA ++WDADFYVKVDDDVHVNI +              Y+      P L
Sbjct: 215 ELSGKTKTYFSTAFSMWDADFYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVL 274

Query: 214 FHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGS 273
             +GVRYHEPEYWKFGE+GNKYFRHATGQLYAIS DLA+YIS+NQ+VLHKYANEDVSLG+
Sbjct: 275 SQKGVRYHEPEYWKFGENGNKYFRHATGQLYAISRDLASYISINQHVLHKYANEDVSLGA 334

Query: 274 WFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE 333
           WFIG+DV+HIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE
Sbjct: 335 WFIGIDVKHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE 394

Query: 334 GENALWSASF 343
           GE ALWSA+F
Sbjct: 395 GEKALWSATF 404


>AT2G32430.1 | Symbols:  | Galactosyltransferase family protein |
           chr2:13771626-13774102 FORWARD LENGTH=409
          Length = 409

 Score =  459 bits (1180), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 228/371 (61%), Positives = 271/371 (73%), Gaps = 55/371 (14%)

Query: 28  IWSVPESKGLARPT-ATEADQLNVVSEGCNSRVLQEMEMKRE----YSEDFKSHNSIQNL 82
           +W +PESK + RP+ +TEA++L ++SEGC+ + L + E+ R+    + E  K+HN+IQ L
Sbjct: 39  MWIIPESKDMPRPSVSTEAERLKLISEGCDPKTLYQKEVNRDPQALFGEVSKTHNAIQTL 98

Query: 83  DKTISNLEMELAAARATQESVRSGAPVPED------------------------------ 112
           DKTIS+LEMELAAAR+ QES+ +GAP+  D                              
Sbjct: 99  DKTISSLEMELAAARSAQESLVNGAPISNDMEKKQLPGKRRYLMVVGINTAFSSRKRRDS 158

Query: 113 IRISDQSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGY 172
           +R +               GII RFVIGHSAT+GGILDR+IEAED+KHGDFLRL+HVEGY
Sbjct: 159 VRTTWMPSGEKRKKLEEEKGIIIRFVIGHSATAGGILDRSIEAEDKKHGDFLRLDHVEGY 218

Query: 173 LELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDS--------------YM------PT 212
           LELS KTKTYF+TAV+ WDA+FYVKVDDDVHVNI +              Y+      P 
Sbjct: 219 LELSGKTKTYFSTAVSKWDAEFYVKVDDDVHVNIATLGETLVRHRKKHRVYLGCMKSGPV 278

Query: 213 LFHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLG 272
           L  +GVRYHEPEYWKFGE+GNKYFRHATGQLYAIS DLA+YIS+NQ+VLHKYANEDV+LG
Sbjct: 279 LSQKGVRYHEPEYWKFGENGNKYFRHATGQLYAISRDLASYISLNQHVLHKYANEDVTLG 338

Query: 273 SWFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCG 332
           +WFIGLDV HIDDRRLCCGTPPDCEWKAQAGNICVASFDW+CSGICRSADRIKEVH+RCG
Sbjct: 339 AWFIGLDVTHIDDRRLCCGTPPDCEWKAQAGNICVASFDWTCSGICRSADRIKEVHKRCG 398

Query: 333 EGENALWSASF 343
           E ENA+W A F
Sbjct: 399 EPENAIWKARF 409


>AT4G26940.1 | Symbols:  | Galactosyltransferase family protein |
           chr4:13529911-13532387 REVERSE LENGTH=407
          Length = 407

 Score =  422 bits (1084), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 218/370 (58%), Positives = 255/370 (68%), Gaps = 58/370 (15%)

Query: 28  IWSVPESKGLARPTATEADQLNVVSEGCNSRVLQEMEMKRE----YSEDFKSHNSIQNLD 83
           +W  PES  ++R T    ++L + SE C+S    +  +KRE      + +KS ++IQ LD
Sbjct: 42  MWPEPESNVVSRDTVASDERLRLESEDCDS---SKKGLKRESKDILGDVYKSPDAIQTLD 98

Query: 84  KTISNLEMELAAARATQESVRSGAPVPEDIRISD-------------------------- 117
           KTIS LE ELA ARA QES+ +G+PV +D ++ +                          
Sbjct: 99  KTISKLETELADARAAQESIMNGSPVSDDFKLPETVTKRKYLMVVGVNTAFSSRKRRDSV 158

Query: 118 ----QSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYL 173
                 P           GI+ RFVIGHS+T GGILDRAI+AE+ KHGDFLRL+HVEGYL
Sbjct: 159 RATWMPPGEERKKLEEEKGIVMRFVIGHSSTPGGILDRAIQAEESKHGDFLRLDHVEGYL 218

Query: 174 ELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDS--------------YM------PTL 213
           ELSAKTKTYF TA  +WDADFYVKVDDDVHVNI +              Y+      P L
Sbjct: 219 ELSAKTKTYFTTAFAMWDADFYVKVDDDVHVNIATLGAELARYRMKPRVYIGCMKSGPVL 278

Query: 214 FHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGS 273
             +GVRYHEPEYWKFGE GNKYFRHATGQLYAIS +LA+YIS+NQNVLHKY NEDVSLGS
Sbjct: 279 AQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAISRELASYISINQNVLHKYVNEDVSLGS 338

Query: 274 WFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGE 333
           WF+GLDVEH+DDRRLCCGT  DCEWKAQAGNICVASFDWSCSGICRSADR+K+VHRRCGE
Sbjct: 339 WFLGLDVEHVDDRRLCCGT-TDCEWKAQAGNICVASFDWSCSGICRSADRMKDVHRRCGE 397

Query: 334 GENALWSASF 343
           GE AL +ASF
Sbjct: 398 GEKALLAASF 407


>AT1G32930.1 | Symbols:  | Galactosyltransferase family protein |
           chr1:11931980-11934399 REVERSE LENGTH=399
          Length = 399

 Score =  364 bits (935), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 190/359 (52%), Positives = 228/359 (63%), Gaps = 56/359 (15%)

Query: 36  GLARPTATEADQLNVVSEGCNSRVLQEMEMKREYSEDFKSHNSIQNLDKTISNLEMELAA 95
           G+ R +  + DQ    S   N  V  E +     S    +H+ I+ LDKTIS+LE+ELA 
Sbjct: 46  GIERASPEQNDQ----SRSLNPLVDCESKEGDILSRVSHTHDVIKTLDKTISSLEVELAT 101

Query: 96  ARATQESVRSGAPVPEDIRISDQSPXX-------------------------------XX 124
           ARA +   R G+P      ++DQS                                    
Sbjct: 102 ARAARSDGRDGSPAVAKT-VADQSKIRPRMFFVMGIMTAFSSRKRRDSIRGTWLPKGDEL 160

Query: 125 XXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFA 184
                  GII RFVIGHS++ GG+LD  IEAE+ +H DF RLNH+EGY ELS+KT+ YF+
Sbjct: 161 KRLETEKGIIMRFVIGHSSSPGGVLDHTIEAEEEQHKDFFRLNHIEGYHELSSKTQIYFS 220

Query: 185 TAVNLWDADFYVKVDDDVHVNIDS--------------YM------PTLFHRGVRYHEPE 224
           +AV  WDADFY+KVDDDVHVN+                Y+      P L  +GV+YHEPE
Sbjct: 221 SAVAKWDADFYIKVDDDVHVNLGMLGSTLARHRSKPRVYIGCMKSGPVLAQKGVKYHEPE 280

Query: 225 YWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGSWFIGLDVEHID 284
           YWKFGE GNKYFRHATGQ+YAIS DLATYIS+N+ +LHKYANEDVSLGSWFIGLDVEHID
Sbjct: 281 YWKFGEEGNKYFRHATGQIYAISKDLATYISVNRQLLHKYANEDVSLGSWFIGLDVEHID 340

Query: 285 DRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGEGENALWSASF 343
           DR LCCGTP DCEWK QAGN C ASFDWSCSGIC+S DR+ EVH+RCGEG+ A+W +SF
Sbjct: 341 DRSLCCGTPLDCEWKGQAGNPCAASFDWSCSGICKSVDRMLEVHQRCGEGDGAIWHSSF 399


>AT1G11730.1 | Symbols:  | Galactosyltransferase family protein |
           chr1:3957473-3960113 FORWARD LENGTH=384
          Length = 384

 Score =  351 bits (901), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 173/311 (55%), Positives = 211/311 (67%), Gaps = 44/311 (14%)

Query: 76  HNSIQNLDKTISNLEMELAAARATQESVRSGAPVPED----------------------- 112
           +N+I  LDK+ISNLEM+L AARA +ES+     +  +                       
Sbjct: 73  NNTIGILDKSISNLEMKLVAARAERESLSGKFNISNEAKKRKYFMVIGINTAFSSRKRRD 132

Query: 113 -IRISDQSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEG 171
            +R +               GII RFVIGHS  S GILD+AIEAE++ HGDFLRL H EG
Sbjct: 133 SVRSTWMPQGENLKKLEEEKGIIVRFVIGHSVLSHGILDKAIEAEEKTHGDFLRLEHTEG 192

Query: 172 YLELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDSYM--------------------P 211
           Y++LSAKTKT+FATAV+LWDA+FY+KVDDDVHVN+ S                      P
Sbjct: 193 YMKLSAKTKTFFATAVSLWDAEFYIKVDDDVHVNLASLKKALSAHQNKPRVYVGCMKSGP 252

Query: 212 TLFHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSL 271
            L  + V+YHEPEYWKFGE GNKYFRHATGQ YAIS DLATYI +NQ++LHKYANEDVSL
Sbjct: 253 VLARKSVKYHEPEYWKFGEVGNKYFRHATGQFYAISKDLATYILINQDLLHKYANEDVSL 312

Query: 272 GSWFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRC 331
           GSWFIGL+VEH+D++RLCC T  DCE KA  G++C ASFDW CSGICRSA+R+ +VH RC
Sbjct: 313 GSWFIGLNVEHVDEKRLCCSTSQDCELKAMMGHVCAASFDWKCSGICRSAERMADVHERC 372

Query: 332 GEGENALWSAS 342
           GE +NALW+++
Sbjct: 373 GEPQNALWTSN 383


>AT1G77810.2 | Symbols:  | Galactosyltransferase family protein |
           chr1:29260899-29263001 REVERSE LENGTH=384
          Length = 384

 Score =  347 bits (891), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 174/337 (51%), Positives = 219/337 (64%), Gaps = 49/337 (14%)

Query: 47  QLNVVSEGC--NSRVLQEMEMKREYSEDFKSHNSIQNLDKTISNLEMELAAARATQESVR 104
           +L +VS+ C  N +  QE ++  E     ++H +IQ+LDK++S L    ++ R++QE V 
Sbjct: 53  ELQIVSDDCAHNKKATQEKDVTGEV---LRTHEAIQSLDKSVSTL----SSTRSSQEMVD 105

Query: 105 SGAPVP--------------------EDIRISDQSPXXXXXXXXXXXGIIFRFVIGHSAT 144
                P                    + +R +               GI+ +F+IGHSAT
Sbjct: 106 GSETNPRKKVFMVMGINTAFSSRKRRDSVRETWMPQGEKLERLEQEKGIVIKFMIGHSAT 165

Query: 145 SGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWDADFYVKVDDDVHV 204
           S  ILDRAI++ED +H DFLRL HVEGY ELSAKTK +F+TAV  WDA+FY+KVDDDVHV
Sbjct: 166 SNSILDRAIDSEDAQHKDFLRLEHVEGYHELSAKTKIFFSTAVAKWDAEFYIKVDDDVHV 225

Query: 205 NIDSYM--------------------PTLFHRGVRYHEPEYWKFGESGNKYFRHATGQLY 244
           N+                        P L  + V+YHEPEYWKFGE GNKYFRHATGQ+Y
Sbjct: 226 NLGMLASTLARHRSKPRVYIGCMKSGPVLAQKTVKYHEPEYWKFGEDGNKYFRHATGQIY 285

Query: 245 AISNDLATYISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCCGTPPDCEWKAQAGN 304
           AIS DLA YIS+NQ +LHKYANEDVSLGSWFIGL+VEHIDDR  CCGTPPDC WKA+AG+
Sbjct: 286 AISKDLANYISINQPILHKYANEDVSLGSWFIGLEVEHIDDRNFCCGTPPDCRWKAEAGD 345

Query: 305 ICVASFDWSCSGICRSADRIKEVHRRCGEGENALWSA 341
           +CVASF+WSCSGIC+S +R+K VH  C EGE A+W+ 
Sbjct: 346 VCVASFEWSCSGICKSVERMKIVHEVCSEGEGAVWNT 382


>AT1G33430.1 | Symbols:  | Galactosyltransferase family protein |
           chr1:12124438-12126052 REVERSE LENGTH=395
          Length = 395

 Score =  343 bits (880), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 166/334 (49%), Positives = 214/334 (64%), Gaps = 46/334 (13%)

Query: 56  NSRVLQEMEMKREYSEDFKSHNSIQNLDKTISNLEMELAAARATQESVR----------- 104
           + R L E + +    E  ++H ++++L++T+S LEMELAAAR +  S             
Sbjct: 60  HKRKLIESKSRDIIGEVSRTHQAVKSLERTMSTLEMELAAARTSDRSSEFWSERSAKNQS 119

Query: 105 ---------------SGAPVPEDIRISDQSPXXXXXXXXXXXGIIFRFVIGHSATSGGIL 149
                          S     + +R +               GI+ RFVIGHSAT GG+L
Sbjct: 120 RLQKVFAVIGINTAFSSKKRRDSVRQTWMPTGEKLKKIEKEKGIVVRFVIGHSATPGGVL 179

Query: 150 DRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDSY 209
           D+AI+ ED +H DFLRL H+EGY +LS KT+ YF+TA  ++DA+FYVKVDDDVHVN+   
Sbjct: 180 DKAIDEEDSEHKDFLRLKHIEGYHQLSTKTRLYFSTATAMYDAEFYVKVDDDVHVNLGML 239

Query: 210 M--------------------PTLFHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISND 249
           +                    P L  +GV+YHEPE+WKFGE GNKYFRHATGQ+YAIS D
Sbjct: 240 VTTLARYQSRPRIYIGCMKSGPVLSQKGVKYHEPEFWKFGEEGNKYFRHATGQIYAISKD 299

Query: 250 LATYISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVAS 309
           LATYIS NQ +LH+YANEDVSLG+W +GL+VEH+D+R +CCGTPPDC+WKAQAGN+C AS
Sbjct: 300 LATYISTNQGILHRYANEDVSLGAWMLGLEVEHVDERSMCCGTPPDCQWKAQAGNVCAAS 359

Query: 310 FDWSCSGICRSADRIKEVHRRCGEGENALWSASF 343
           FDWSCSGIC+S DR+  VHR C EG+  L +  F
Sbjct: 360 FDWSCSGICKSVDRMARVHRACAEGDTPLANFRF 393


>AT1G77810.1 | Symbols:  | Galactosyltransferase family protein |
           chr1:29260899-29263001 REVERSE LENGTH=393
          Length = 393

 Score =  338 bits (867), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 175/346 (50%), Positives = 219/346 (63%), Gaps = 58/346 (16%)

Query: 47  QLNVVSEGC--NSRVLQEMEMKREYSEDFKSHNSIQN---LDKTISNLEMELAAARATQE 101
           +L +VS+ C  N +  QE ++  E     ++H +IQ+   LDK++S L    ++ R++QE
Sbjct: 53  ELQIVSDDCAHNKKATQEKDVTGEV---LRTHEAIQDDRSLDKSVSTL----SSTRSSQE 105

Query: 102 SVRSGAPVP--------------------EDIRISDQSPXXXXXXXXXXXGIIFRFVIGH 141
            V      P                    + +R +               GI+ +F+IGH
Sbjct: 106 MVDGSETNPRKKVFMVMGINTAFSSRKRRDSVRETWMPQGEKLERLEQEKGIVIKFMIGH 165

Query: 142 SATSGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWDADFYVKVDDD 201
           SATS  ILDRAI++ED +H DFLRL HVEGY ELSAKTK +F+TAV  WDA+FY+KVDDD
Sbjct: 166 SATSNSILDRAIDSEDAQHKDFLRLEHVEGYHELSAKTKIFFSTAVAKWDAEFYIKVDDD 225

Query: 202 VHVNIDSYMPTLFH--------------------------RGVRYHEPEYWKFGESGNKY 235
           VHVN+     TL                            R V+YHEPEYWKFGE GNKY
Sbjct: 226 VHVNLGMLASTLARHRSKPRVYIGCMKSGPVLAQNLLNCFRTVKYHEPEYWKFGEDGNKY 285

Query: 236 FRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCCGTPPD 295
           FRHATGQ+YAIS DLA YIS+NQ +LHKYANEDVSLGSWFIGL+VEHIDDR  CCGTPPD
Sbjct: 286 FRHATGQIYAISKDLANYISINQPILHKYANEDVSLGSWFIGLEVEHIDDRNFCCGTPPD 345

Query: 296 CEWKAQAGNICVASFDWSCSGICRSADRIKEVHRRCGEGENALWSA 341
           C WKA+AG++CVASF+WSCSGIC+S +R+K VH  C EGE A+W+ 
Sbjct: 346 CRWKAEAGDVCVASFEWSCSGICKSVERMKIVHEVCSEGEGAVWNT 391


>AT1G33430.2 | Symbols:  | Galactosyltransferase family protein |
           chr1:12124438-12126052 REVERSE LENGTH=403
          Length = 403

 Score =  336 bits (861), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 166/342 (48%), Positives = 214/342 (62%), Gaps = 54/342 (15%)

Query: 56  NSRVLQEMEMKREYSEDFKSHNSIQNLDKTISNLEMELAAARATQESVR----------- 104
           + R L E + +    E  ++H ++++L++T+S LEMELAAAR +  S             
Sbjct: 60  HKRKLIESKSRDIIGEVSRTHQAVKSLERTMSTLEMELAAARTSDRSSEFWSERSAKNQS 119

Query: 105 ---------------SGAPVPEDIRISDQSPXXXXXXXXXXXGIIFR--------FVIGH 141
                          S     + +R +               GI+ R        FVIGH
Sbjct: 120 RLQKVFAVIGINTAFSSKKRRDSVRQTWMPTGEKLKKIEKEKGIVVRKFGFLFDRFVIGH 179

Query: 142 SATSGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWDADFYVKVDDD 201
           SAT GG+LD+AI+ ED +H DFLRL H+EGY +LS KT+ YF+TA  ++DA+FYVKVDDD
Sbjct: 180 SATPGGVLDKAIDEEDSEHKDFLRLKHIEGYHQLSTKTRLYFSTATAMYDAEFYVKVDDD 239

Query: 202 VHVNIDSYM--------------------PTLFHRGVRYHEPEYWKFGESGNKYFRHATG 241
           VHVN+   +                    P L  +GV+YHEPE+WKFGE GNKYFRHATG
Sbjct: 240 VHVNLGMLVTTLARYQSRPRIYIGCMKSGPVLSQKGVKYHEPEFWKFGEEGNKYFRHATG 299

Query: 242 QLYAISNDLATYISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCCGTPPDCEWKAQ 301
           Q+YAIS DLATYIS NQ +LH+YANEDVSLG+W +GL+VEH+D+R +CCGTPPDC+WKAQ
Sbjct: 300 QIYAISKDLATYISTNQGILHRYANEDVSLGAWMLGLEVEHVDERSMCCGTPPDCQWKAQ 359

Query: 302 AGNICVASFDWSCSGICRSADRIKEVHRRCGEGENALWSASF 343
           AGN+C ASFDWSCSGIC+S DR+  VHR C EG+  L +  F
Sbjct: 360 AGNVCAASFDWSCSGICKSVDRMARVHRACAEGDTPLANFRF 401


>AT1G22015.1 | Symbols: DD46 | Galactosyltransferase family protein
           | chr1:7751225-7753425 REVERSE LENGTH=398
          Length = 398

 Score =  317 bits (813), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 153/319 (47%), Positives = 201/319 (63%), Gaps = 46/319 (14%)

Query: 71  EDFKSHNSIQNLDKTISNLEMELAAARATQESVRSGAPVP-------------------- 110
           E  K+H +I++LDK++S L+ +L+A  + Q+ V   A                       
Sbjct: 77  EVLKTHKAIESLDKSVSMLQKQLSATHSPQQIVNVSATNSSTEGNQKNKVFMVIGINTAF 136

Query: 111 ------EDIRISDQSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFL 164
                 + +R +               GI+ +F+IGHS+T   +LD+ I++ED ++ DF 
Sbjct: 137 SSRKRRDSLRETWMPQGEKLEKLEKEKGIVVKFMIGHSSTPNSMLDKEIDSEDAQYNDFF 196

Query: 165 RLNHVEGYLELSAKTKTYFATAVNLWDADFYVKVDDDVHVNIDSYM-------------- 210
           RL+HVEGY  LSAKTK++F++AV  WDA+FYVK+DDDVHVN+ +                
Sbjct: 197 RLDHVEGYYNLSAKTKSFFSSAVAKWDAEFYVKIDDDVHVNLGTLASTLASHRSKPRVYI 256

Query: 211 ------PTLFHRGVRYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKY 264
                 P L  +  +Y EPE+WKFGE GNKYFRHATGQ+YAIS DLATYIS NQ +LHKY
Sbjct: 257 GCMKSGPVLTKKTAKYREPEFWKFGEEGNKYFRHATGQIYAISKDLATYISNNQPILHKY 316

Query: 265 ANEDVSLGSWFIGLDVEHIDDRRLCCGTPPDCEWKAQAGNICVASFDWSCSGICRSADRI 324
           ANEDV+LGSWFIGL+VE IDDR  CCGTPPDCE +A+AG +CVA+FDW CSG+CRS DR+
Sbjct: 317 ANEDVTLGSWFIGLEVEQIDDRNFCCGTPPDCEMRAEAGEMCVATFDWKCSGVCRSVDRM 376

Query: 325 KEVHRRCGEGENALWSASF 343
             VH  CGEG  A+W A+ 
Sbjct: 377 WMVHVMCGEGSKAVWDANL 395


>AT4G26940.2 | Symbols:  | Galactosyltransferase family protein |
           chr4:13530223-13532387 REVERSE LENGTH=306
          Length = 306

 Score =  186 bits (472), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 103/213 (48%), Positives = 129/213 (60%), Gaps = 37/213 (17%)

Query: 28  IWSVPESKGLARPTATEADQLNVVSEGCNSRVLQEMEMKRE----YSEDFKSHNSIQNLD 83
           +W  PES  ++R T    ++L + SE C+S    +  +KRE      + +KS ++IQ LD
Sbjct: 42  MWPEPESNVVSRDTVASDERLRLESEDCDS---SKKGLKRESKDILGDVYKSPDAIQTLD 98

Query: 84  KTISNLEMELAAARATQESVRSGAPVPEDIRISD-------------------------- 117
           KTIS LE ELA ARA QES+ +G+PV +D ++ +                          
Sbjct: 99  KTISKLETELADARAAQESIMNGSPVSDDFKLPETVTKRKYLMVVGVNTAFSSRKRRDSV 158

Query: 118 ----QSPXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYL 173
                 P           GI+ RFVIGHS+T GGILDRAI+AE+ KHGDFLRL+HVEGYL
Sbjct: 159 RATWMPPGEERKKLEEEKGIVMRFVIGHSSTPGGILDRAIQAEESKHGDFLRLDHVEGYL 218

Query: 174 ELSAKTKTYFATAVNLWDADFYVKVDDDVHVNI 206
           ELSAKTKTYF TA  +WDADFYVKVDDDVHVNI
Sbjct: 219 ELSAKTKTYFTTAFAMWDADFYVKVDDDVHVNI 251


>AT5G53340.1 | Symbols:  | Galactosyltransferase family protein |
           chr5:21641045-21643195 REVERSE LENGTH=338
          Length = 338

 Score =  156 bits (394), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 96/270 (35%), Positives = 143/270 (52%), Gaps = 57/270 (21%)

Query: 84  KTISNLEMELAAARATQESVRSGAPVPEDIRISDQSPX---------------------- 121
           KT++ LEMEL++AR  QE   S +P   D   + + P                       
Sbjct: 76  KTLAALEMELSSAR--QEGFVSKSPKLADGTETKKRPLVVIGIMTSLGNKKKRDAVRQAW 133

Query: 122 ----XXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHV-EGYLELS 176
                         G+I RFVIG SA  G  +D++I+ E+ +  DF+ L+ V E   E S
Sbjct: 134 MGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQTDDFIILDDVVEAPEEAS 193

Query: 177 AKTKTYFATAVNLWDADFYVKVDDDVHVNIDSYMPTLF--------HRGV---------- 218
            K K +FA A + WDA FY K  D+++VNID+   TL         + G           
Sbjct: 194 KKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLENPRAYIGCMKSGEVFSEP 253

Query: 219 --RYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGSWFI 276
             +++EPE+WKFG+    YFRHA G++Y I++ LA ++S+N+++LH YA++DVS GSWF+
Sbjct: 254 NHKWYEPEWWKFGDK-KAYFRHAYGEMYVITHALARFVSINRDILHSYAHDDVSTGSWFV 312

Query: 277 GLDVEHIDDRRLCCGTPPDCEWKAQAGNIC 306
           GLDV+H+D+ + CC       W ++A  IC
Sbjct: 313 GLDVKHVDEGKFCCSA-----WSSEA--IC 335


>AT5G53340.2 | Symbols:  | Galactosyltransferase family protein |
           chr5:21641045-21643195 REVERSE LENGTH=337
          Length = 337

 Score =  156 bits (394), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/254 (36%), Positives = 137/254 (53%), Gaps = 50/254 (19%)

Query: 84  KTISNLEMELAAARATQESVRSGAPVPEDIRISDQSPX---------------------- 121
           KT++ LEMEL++AR  QE   S +P   D   + + P                       
Sbjct: 76  KTLAALEMELSSAR--QEGFVSKSPKLADGTETKKRPLVVIGIMTSLGNKKKRDAVRQAW 133

Query: 122 ----XXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHV-EGYLELS 176
                         G+I RFVIG SA  G  +D++I+ E+ +  DF+ L+ V E   E S
Sbjct: 134 MGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQTDDFIILDDVVEAPEEAS 193

Query: 177 AKTKTYFATAVNLWDADFYVKVDDDVHVNIDSYMPTLF--------HRGV---------- 218
            K K +FA A + WDA FY K  D+++VNID+   TL         + G           
Sbjct: 194 KKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLENPRAYIGCMKSGEVFSEP 253

Query: 219 --RYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGSWFI 276
             +++EPE+WKFG+    YFRHA G++Y I++ LA ++S+N+++LH YA++DVS GSWF+
Sbjct: 254 NHKWYEPEWWKFGDK-KAYFRHAYGEMYVITHALARFVSINRDILHSYAHDDVSTGSWFV 312

Query: 277 GLDVEHIDDRRLCC 290
           GLDV+H+D+ + CC
Sbjct: 313 GLDVKHVDEGKFCC 326


>AT4G32120.1 | Symbols:  | Galactosyltransferase family protein |
           chr4:15517230-15519687 REVERSE LENGTH=345
          Length = 345

 Score =  155 bits (393), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 92/254 (36%), Positives = 133/254 (52%), Gaps = 42/254 (16%)

Query: 80  QNLDKTISNLEMELAAARA-----TQESVRS---------------GAPVPEDIRISDQS 119
           ++L++ I   EMELA A++      Q+SV S               G+ +  +       
Sbjct: 83  KDLERRIVETEMELAQAKSQGYLKKQKSVSSSGKKMLAVIGVYTGFGSHLKRNKFRGSWM 142

Query: 120 PXXXXXXXXXXXGIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRL-NHVEGYLELSAK 178
           P           G++ RFVIG SA  G  LDR I+ E+R   DFL L NH E   EL  K
Sbjct: 143 PRDDALKKLEERGVVIRFVIGRSANRGDSLDRKIDEENRATKDFLILENHEEAQEELPKK 202

Query: 179 TKTYFATAVNLWDADFYVKVDDDVHVNIDSYMPTLFHR--------------------GV 218
            K +++ AV  WDA+FYVKVDD+V ++++  +  L  R                    G 
Sbjct: 203 VKFFYSAAVQNWDAEFYVKVDDNVDLDLEGMIALLESRRSQDGAYIGCMKSGDVITEEGS 262

Query: 219 RYHEPEYWKFGESGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGSWFIGL 278
           +++EPE+WKFG+    YFRHATG L  +S +LA Y+++N  +L  YA +D ++GSW IG+
Sbjct: 263 QWYEPEWWKFGDD-KSYFRHATGSLVILSKNLAQYVNINSGLLKTYAFDDTTIGSWMIGV 321

Query: 279 DVEHIDDRRLCCGT 292
              +IDD RLCC +
Sbjct: 322 QATYIDDNRLCCSS 335


>AT2G25300.1 | Symbols:  | Galactosyltransferase family protein |
           chr2:10771922-10774156 REVERSE LENGTH=346
          Length = 346

 Score =  148 bits (373), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 106/182 (58%), Gaps = 22/182 (12%)

Query: 132 GIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRL-NHVEGYLELSAKTKTYFATAVNLW 190
           GI+ RFVIG S   G  LDR I+ E++   DFL L NH E   EL+ K K +F+ AV  W
Sbjct: 156 GIVIRFVIGRSPNRGDSLDRKIDEENQARKDFLILENHEEAQEELAKKVKFFFSAAVQNW 215

Query: 191 DADFYVKVDDDVHVNIDSYMPTLFHR--------------------GVRYHEPEYWKFGE 230
           DA+FY+KVDD++ ++++  +  L  R                    G +++EPE+WKFG+
Sbjct: 216 DAEFYIKVDDNIDLDLEGLIGLLESRRGQDAAYIGCMKSGEVVAEEGGKWYEPEWWKFGD 275

Query: 231 SGNKYFRHATGQLYAISNDLATYISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCC 290
               YFRHA G L  +S  LA Y+++N   L  YA +D S+GSW IG+   +IDD RLCC
Sbjct: 276 E-KSYFRHAAGSLLILSKTLAQYVNINSGSLKTYAFDDTSIGSWMIGVQATYIDDNRLCC 334

Query: 291 GT 292
            +
Sbjct: 335 SS 336


>AT2G26100.1 | Symbols:  | Galactosyltransferase family protein |
           chr2:11116212-11118129 REVERSE LENGTH=371
          Length = 371

 Score =  117 bits (292), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 116/217 (53%), Gaps = 30/217 (13%)

Query: 132 GIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWD 191
           G+ FRFVIG S  +  + +  +E E +++ DF+ L+  E Y+ L  KT  +F  A  L++
Sbjct: 149 GLAFRFVIGKSKDAKKMAE--LEKEIKEYRDFVLLDTEEEYIRLPYKTLAFFKAAFKLFE 206

Query: 192 ADFYVKVDDDVHVNIDSYMPTL-------------FHRGVRYHEPE---YWKFGES-GNK 234
           AD+YVK DDD+++  D     L               +G    +P+   Y K G   GN+
Sbjct: 207 ADYYVKADDDIYLRPDRLATLLANERLHSQTYIGCMKKGPVITDPKLKWYEKQGNLIGNE 266

Query: 235 YFRHATGQLYAISNDLATYISMNQN-VLHKYANEDVSLGSWFIGLDVEHIDDRRLCCGTP 293
           YF HA G +Y +S ++   ++  +N  L  + NEDV++GSW + +DV H D+R LC    
Sbjct: 267 YFLHAYGPIYVLSAEIVASLAAARNGSLRMFNNEDVTIGSWMLAMDVHHEDNRALC---D 323

Query: 294 PDCEWKAQAGNICVASFDWS-CSGICRSADRIKEVHR 329
           P C  K+      +A +D   CSG+C    R+KE+H+
Sbjct: 324 PHCSPKS------IAVWDIPKCSGLCDPESRLKELHK 354


>AT3G14960.1 | Symbols:  | Galactosyltransferase family protein |
           chr3:5036252-5037951 REVERSE LENGTH=343
          Length = 343

 Score =  101 bits (251), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 111/219 (50%), Gaps = 36/219 (16%)

Query: 132 GIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWD 191
           G+  RF+IG +     +++  + +E   + DF+ L+  E Y +L  KT  +F  A  L+D
Sbjct: 123 GLAIRFIIGKTKDEAKMVE--LRSEVAMYDDFILLDIEEEYSKLPYKTLAFFKAAYALYD 180

Query: 192 ADFYVKVDDDVHVNID--------------SYM------PTLFHRGVRYHEPEYWKFGES 231
           ++FYVK DDD+++  D              +Y+      P      ++++EP     G+ 
Sbjct: 181 SEFYVKADDDIYLRPDRLSLLLAKERGHSQTYLGCMKKGPVFTDPKLKWYEPLADLLGK- 239

Query: 232 GNKYFRHATGQLYAISNDLAT-YISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCC 290
             +YF HA G +YA+S D+ T  +++  N    ++NEDV++G+W + ++V H +   LC 
Sbjct: 240 --EYFLHAYGPIYALSADVVTSLVALKNNSFRMFSNEDVTIGAWMLAMNVNHENLHTLC- 296

Query: 291 GTPPDCEWKAQAGNICVASFDWS-CSGICRSADRIKEVH 328
              P+C          +A +D   CSG+C    R+ E+H
Sbjct: 297 --EPEC------SPYSIAVWDIPKCSGLCNPEKRMLELH 327


>AT1G53290.1 | Symbols:  | Galactosyltransferase family protein |
           chr1:19871353-19873251 FORWARD LENGTH=345
          Length = 345

 Score = 99.8 bits (247), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/221 (28%), Positives = 113/221 (51%), Gaps = 36/221 (16%)

Query: 132 GIIFRFVIGHSATSGGILDRAIEAEDRKHGDFLRLNHVEGYLELSAKTKTYFATAVNLWD 191
           G+  RF+IG + +   +    +  E  ++ DF+ L+  E Y +L  KT  +F  A  L+D
Sbjct: 125 GLAIRFMIGKTKSEEKMAQ--LRREIAEYDDFVLLDIEEEYSKLPYKTLAFFKAAYALYD 182

Query: 192 ADFYVKVDDDVHVNID--------------SYM------PTLFHRGVRYHEPEYWKFGES 231
           ++FYVK DDD+++  D              +Y+      P      ++++EP     G+ 
Sbjct: 183 SEFYVKADDDIYLRPDRLSLLLAKERSHSQTYLGCLKKGPVFTDPKLKWYEPLSHLLGK- 241

Query: 232 GNKYFRHATGQLYAISND-LATYISMNQNVLHKYANEDVSLGSWFIGLDVEHIDDRRLCC 290
             +YF HA G +YA+S D +A+ +++  N    + NEDV++G+W + ++V H +   LC 
Sbjct: 242 --EYFLHAYGPIYALSADVVASLVALKNNSFRMFNNEDVTIGAWMLAMNVNHENHHILC- 298

Query: 291 GTPPDCEWKAQAGNICVASFDWS-CSGICRSADRIKEVHRR 330
              P+C   +      VA +D   CSG+C    R+ E+H++
Sbjct: 299 --EPECSPSS------VAVWDIPKCSGLCNPEKRMLELHKQ 331