
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126019.1 + phase: 0
(253 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g07020 UDP-glucose:sterol glucosyltransferase 352 8e-98
At1g43620 unknown protein 273 8e-74
At5g24750 sterol glucosyltransferase - like protein 71 5e-13
At2g31790 putative glucosyltransferase 45 5e-05
At1g07260 unknown protein 33 0.19
At4g15490 indole-3-acetate beta-glucosyltransferase like protein 31 0.56
At4g09500 unknown protein 30 1.2
At1g48760 putative protein 29 2.8
At2g36760 putative glucosyl transferase 28 3.6
At2g30140 putative glucosyltransferase 28 3.6
At2g26890 unknown protein 28 3.6
At5g59590 glucuronosyl transferase - like protein 28 4.7
At5g13800 unknown protein 28 4.7
At3g47000 beta-D-glucan exohydrolase - like protein 28 4.7
At5g17050 UDP glucose:flavonoid 3-o-glucosyltransferase -like pr... 28 6.2
At2g36780 putative glucosyl transferase 28 6.2
At2g36770 putative glucosyl transferase 28 6.2
At2g30150 putative glucosyltransferase 28 6.2
At2g22930 putative flavonol 3-O-glucosyltransferase 28 6.2
At1g24370 hypothetical protein 28 6.2
>At3g07020 UDP-glucose:sterol glucosyltransferase
Length = 637
Score = 352 bits (904), Expect = 8e-98
Identities = 172/229 (75%), Positives = 189/229 (82%), Gaps = 6/229 (2%)
Query: 28 DWGPKIDVVGFCFLDLASNYEPPESLVKWLEDGDKPIYVGFGSLPVQDPKTMTEIIVEAL 87
DWGP+IDVVGFC+LDLASNYEPP LV+WLE GDKPIY+GFGSLPVQ+P+ MTEIIVEAL
Sbjct: 412 DWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIYIGFGSLPVQEPEKMTEIIVEAL 471
Query: 88 ETTGQRGIISKGWGGLGDLTEPKDSIYLLDNVPHDWLFLHCKAVVHHGGAGTTAAGLKAA 147
+ T QRGII+KGWGGLG+L EPKD +YLLDNVPHDWLF CKAVVHHGGAGTTAAGLKA+
Sbjct: 472 QRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLFPRCKAVVHHGGAGTTAAGLKAS 531
Query: 148 CPTTIVPFFGDQPFWGERVHDRGVGPPPIPIDVFSLPKLIAAINFMLDPKVKEHAIELAK 207
CPTTIVPFFGDQPFWGERVH RGVGP PIP+D FSL KL AINFMLD KVK A LAK
Sbjct: 532 CPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHKLEDAINFMLDDKVKSSAETLAK 591
Query: 208 AMENEDGVTGAVKVFFKQLP---KNKPEPNTEPSSSSCFSYIARCFGYS 253
AM++EDGV GAVK FFK LP +N +P EPS F +CFG S
Sbjct: 592 AMKDEDGVAGAVKAFFKHLPSAKQNISDPIPEPSG---FLSFRKCFGCS 637
>At1g43620 unknown protein
Length = 615
Score = 273 bits (697), Expect = 8e-74
Identities = 120/210 (57%), Positives = 158/210 (75%), Gaps = 1/210 (0%)
Query: 27 ADWGPKIDVVGFCFLDLASNYEPPESLVKWLEDGDKPIYVGFGSLPVQDPKTMTEIIVEA 86
+DWGP +DVVG+CFL+L S Y+P E + W+E G P+Y+GFGS+P+ DPK +II+E
Sbjct: 374 SDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVYIGFGSMPLDDPKQTMDIILET 433
Query: 87 LETTGQRGIISKGWGGLGDL-TEPKDSIYLLDNVPHDWLFLHCKAVVHHGGAGTTAAGLK 145
L+ T QRGI+ +GWGGLG+L TE ++++L+++ PHDWLF C AVVHHGGAGTTA GLK
Sbjct: 434 LKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWLFPQCSAVVHHGGAGTTATGLK 493
Query: 146 AACPTTIVPFFGDQPFWGERVHDRGVGPPPIPIDVFSLPKLIAAINFMLDPKVKEHAIEL 205
A CPTTIVPFFGDQ FWG+R++++G+GP PIPI S+ L ++I FML P+VK +EL
Sbjct: 494 AGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVENLSSSIRFMLQPEVKSQVMEL 553
Query: 206 AKAMENEDGVTGAVKVFFKQLPKNKPEPNT 235
AK +ENEDGV AV F + LP P P +
Sbjct: 554 AKVLENEDGVAAAVDAFHRHLPPELPLPES 583
>At5g24750 sterol glucosyltransferase - like protein
Length = 517
Score = 71.2 bits (173), Expect = 5e-13
Identities = 53/189 (28%), Positives = 86/189 (45%), Gaps = 39/189 (20%)
Query: 63 PIYVGFGSLP----VQDPKTMTEIIVEALETTGQRGII-SKGWGGL---------GDLTE 108
PI+VG S+ V+DP ++ ++ TG R II + +G L G +
Sbjct: 282 PIFVGLSSVGSMGFVRDPIAFLRVLQSVIQITGYRFIIFTASYGPLDAAIRTIANGSDSS 341
Query: 109 PKDSIY------------LLDNVPHDWLFLHCKAVVHHGGAGTTAAGLKAACPTTIVPFF 156
K ++ VP++W+F C A +HHGG+G+ AA L+A P I PF
Sbjct: 342 EKQPLHAGISIFNGKLFCFSGMVPYNWMFRTCAAAIHHGGSGSVAAALQAGIPQIICPFM 401
Query: 157 GDQPFWGERVHDRGVGPPPI------------PIDVFSLPKLIA-AINFMLDPKVKEHAI 203
DQ +W E++ GV P P+ ++ +++A AI L K + A+
Sbjct: 402 LDQFYWAEKMSWLGVAPQPLKRNHLLLEDSNDEKNITEAAQVVAKAIYDALSAKTRARAM 461
Query: 204 ELAKAMENE 212
E+A+ + E
Sbjct: 462 EIAEILSLE 470
>At2g31790 putative glucosyltransferase
Length = 457
Score = 44.7 bits (104), Expect = 5e-05
Identities = 44/137 (32%), Positives = 60/137 (43%), Gaps = 16/137 (11%)
Query: 45 SNYEPPESLVKWLEDGDKP----IYVGFGSLPVQDPKTMTEIIVEALETTG-------QR 93
S EP ES++KWL G++P +YV FG+L K M EI + A+ TG +
Sbjct: 254 SKTEPDESVLKWL--GNRPAKSVVYVAFGTLVALSEKQMKEIAM-AISQTGYHFLWSVRE 310
Query: 94 GIISKGWGGLGDLTEPKDSIYLLDNVPHDWLFLH--CKAVVHHGGAGTTAAGLKAACPTT 151
SK G + E KDS + VP + H V H G +T L P
Sbjct: 311 SERSKLPSGFIEEAEEKDSGLVAKWVPQLEVLAHESIGCFVSHCGWNSTLEALCLGVPMV 370
Query: 152 IVPFFGDQPFWGERVHD 168
VP + DQP + + D
Sbjct: 371 GVPQWTDQPTNAKFIED 387
>At1g07260 unknown protein
Length = 476
Score = 32.7 bits (73), Expect = 0.19
Identities = 19/45 (42%), Positives = 27/45 (59%), Gaps = 3/45 (6%)
Query: 51 ESLVKWLEDGDKP--IYVGFGSLPVQDPKTMTEIIVEALETTGQR 93
+ +++WLED + +Y+ FGSL + K E I EALE TG R
Sbjct: 269 DRIMRWLEDQPESSIVYICFGSLGIIG-KLQIEEIAEALELTGHR 312
>At4g15490 indole-3-acetate beta-glucosyltransferase like protein
Length = 479
Score = 31.2 bits (69), Expect = 0.56
Identities = 28/131 (21%), Positives = 49/131 (37%), Gaps = 28/131 (21%)
Query: 48 EPPESLVKWLEDGDKP--IYVGFGSLPVQDPKTMTEIIVEALETTGQRGIISKGWGGLGD 105
EP ++WL+ + +Y+ FG++ + M EI G++S G L
Sbjct: 265 EPASDCMEWLDSREPSSVVYISFGTIANLKQEQMEEIA---------HGVLSSGLSVLWV 315
Query: 106 LTEPKDSIYLLDNV---------------PHDWLFLH--CKAVVHHGGAGTTAAGLKAAC 148
+ P + ++ +V P + + H + H G +T L A
Sbjct: 316 VRPPMEGTFVEPHVLPRELEEKGKIVEWCPQERVLAHPAIACFLSHCGWNSTMEALTAGV 375
Query: 149 PTTIVPFFGDQ 159
P P +GDQ
Sbjct: 376 PVVCFPQWGDQ 386
>At4g09500 unknown protein
Length = 417
Score = 30.0 bits (66), Expect = 1.2
Identities = 27/107 (25%), Positives = 45/107 (41%), Gaps = 9/107 (8%)
Query: 132 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHDR---GVGPPPIPIDVFSLPKLIA 188
V+H G GT L + C ++PF DQ + + + V P FS L
Sbjct: 309 VNHCGPGTIWESLVSDCQMVLIPFLSDQVLFTRLMTEEFEVSVEVPREKTGWFSKESLSN 368
Query: 189 AINFMLDPK------VKEHAIELAKAMENEDGVTGAVKVFFKQLPKN 229
AI ++D V+ + +L + + + +TG V F + L +N
Sbjct: 369 AIKSVMDKDSDIGKLVRSNHTKLKEILVSPGLLTGYVDHFVEGLQEN 415
>At1g48760 putative protein
Length = 869
Score = 28.9 bits (63), Expect = 2.8
Identities = 14/37 (37%), Positives = 19/37 (50%)
Query: 195 DPKVKEHAIELAKAMENEDGVTGAVKVFFKQLPKNKP 231
DP VK A+ L AM NED V+ ++ K+ P
Sbjct: 346 DPNVKLEALHLLMAMVNEDNVSEISRILMNYALKSDP 382
>At2g36760 putative glucosyl transferase
Length = 496
Score = 28.5 bits (62), Expect = 3.6
Identities = 33/125 (26%), Positives = 50/125 (39%), Gaps = 17/125 (13%)
Query: 51 ESLVKWLE--DGDKPIYVGFGSLPVQDPKTMTEIIVEALETTGQRGI-ISKGWGGLGDLT 107
+ +KWL+ D + +YV GS+ P + LE T + I + +G G +L
Sbjct: 276 DECIKWLDSKDVESVLYVCLGSI-CNLPLAQLRELGLGLEATKRPFIWVIRGGGKYHELA 334
Query: 108 E----------PKDSIYLLDNVPHDWLFLHCKAV---VHHGGAGTTAAGLKAACPTTIVP 154
E K+ L+ L L AV + H G +T G+ + P P
Sbjct: 335 EWILESGFEERTKERSLLIKGWSPQMLILSHPAVGGFLTHCGWNSTLEGITSGVPLITWP 394
Query: 155 FFGDQ 159
FGDQ
Sbjct: 395 LFGDQ 399
>At2g30140 putative glucosyltransferase
Length = 455
Score = 28.5 bits (62), Expect = 3.6
Identities = 32/126 (25%), Positives = 53/126 (41%), Gaps = 22/126 (17%)
Query: 46 NYEPPESLVKWLEDGDKP--IYVGFGSLPVQDPKTMTEIIVEALETTGQRGI-ISKGW-- 100
N EP + ++WLE+ + +Y+ GS M EI V+ L +G R + +++G
Sbjct: 254 NKEP--NYIQWLEEQPEGSVLYISQGSFLSVSEAQMEEI-VKGLRESGVRFLWVARGGEL 310
Query: 101 -------GGLGDLTEPKDSIYLLDNVPHDWLFLHCKAVVHHGGAGTTAAGLKAACPTTIV 153
G LG + D + +L + + HC G +T G+ + P
Sbjct: 311 KLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHC-------GFNSTLEGIYSGVPMLAF 363
Query: 154 PFFGDQ 159
P F DQ
Sbjct: 364 PLFWDQ 369
>At2g26890 unknown protein
Length = 2535
Score = 28.5 bits (62), Expect = 3.6
Identities = 20/72 (27%), Positives = 27/72 (36%)
Query: 76 PKTMTEIIVEALETTGQRGIISKGWGGLGDLTEPKDSIYLLDNVPHDWLFLHCKAVVHHG 135
PK + + E T G + G + E D L P + + L C V+H
Sbjct: 2287 PKLVAAVAYEGRRETMSSGEVKAEEIGSDGVNESTDPSSLPGQTPQERVRLSCLRVLHQL 2346
Query: 136 GAGTTAAGLKAA 147
A TT A AA
Sbjct: 2347 AASTTCAEAMAA 2358
>At5g59590 glucuronosyl transferase - like protein
Length = 449
Score = 28.1 bits (61), Expect = 4.7
Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 2/41 (4%)
Query: 44 ASNYEPPESLVKWL--EDGDKPIYVGFGSLPVQDPKTMTEI 82
+S E S V+WL + + IY+ GSL + D K M E+
Sbjct: 246 SSLLEEDRSCVEWLNKQKSNSVIYISLGSLALMDTKDMLEM 286
>At5g13800 unknown protein
Length = 484
Score = 28.1 bits (61), Expect = 4.7
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 9/46 (19%)
Query: 65 YVGFG-SLPVQDPKTMTEIIVEALETTGQRGIISKGWGGLGDLTEP 109
++G G SLP +DP TMTE ET+ + + G GD TEP
Sbjct: 150 FLGQGLSLPTEDPTTMTE------ETSSSED--KEPFWGFGDKTEP 187
>At3g47000 beta-D-glucan exohydrolase - like protein
Length = 608
Score = 28.1 bits (61), Expect = 4.7
Identities = 15/36 (41%), Positives = 21/36 (57%), Gaps = 3/36 (8%)
Query: 79 MTEIIVEALETTGQRGIISKGWGGLGDLTEPKDSIY 114
+TEI+ E L G +G + W GL L+EP+ S Y
Sbjct: 273 LTEILKEKL---GFKGFLVSDWEGLDRLSEPQGSNY 305
>At5g17050 UDP glucose:flavonoid 3-o-glucosyltransferase -like
protein
Length = 460
Score = 27.7 bits (60), Expect = 6.2
Identities = 33/143 (23%), Positives = 50/143 (34%), Gaps = 10/143 (6%)
Query: 33 IDVVGFCFLDLASNYEPPESLVKWLE--DGDKPIYVGFGSLPVQDPKTMTEIIVEALETT 90
I +G L + P + W+E Y+ FG++ P + I E LE++
Sbjct: 247 IGPLGLLSSTLQQLVQDPHGCLAWMEKRSSGSVAYISFGTVMTPPPGELAAI-AEGLESS 305
Query: 91 GQRGIISKGWGGL-----GDLTEPKDSIYLLDNVPHDWLFLH--CKAVVHHGGAGTTAAG 143
+ S L G L ++ ++ P L H V H G +
Sbjct: 306 KVPFVWSLKEKSLVQLPKGFLDRTREQGIVVPWAPQVELLKHEATGVFVTHCGWNSVLES 365
Query: 144 LKAACPTTIVPFFGDQPFWGERV 166
+ P PFFGDQ G V
Sbjct: 366 VSGGVPMICRPFFGDQRLNGRAV 388
>At2g36780 putative glucosyl transferase
Length = 496
Score = 27.7 bits (60), Expect = 6.2
Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 16/72 (22%)
Query: 88 ETTGQRGIISKGWGGLGDLTEPKDSIYLLDNVPHDWLFLHCKAVVHHGGAGTTAAGLKAA 147
E +RG++ KGW + +L + HC G +T G+ +
Sbjct: 344 ERIKERGLLIKGWA---------PQVLILSHPSVGGFLTHC-------GWNSTLEGITSG 387
Query: 148 CPTTIVPFFGDQ 159
P P FGDQ
Sbjct: 388 IPLITWPLFGDQ 399
>At2g36770 putative glucosyl transferase
Length = 496
Score = 27.7 bits (60), Expect = 6.2
Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 16/72 (22%)
Query: 88 ETTGQRGIISKGWGGLGDLTEPKDSIYLLDNVPHDWLFLHCKAVVHHGGAGTTAAGLKAA 147
E +RG++ KGW + +L + HC G +T G+ +
Sbjct: 344 ERIKERGLLIKGWS---------PQVLILSHPSVGGFLTHC-------GWNSTLEGITSG 387
Query: 148 CPTTIVPFFGDQ 159
P P FGDQ
Sbjct: 388 IPLITWPLFGDQ 399
>At2g30150 putative glucosyltransferase
Length = 440
Score = 27.7 bits (60), Expect = 6.2
Identities = 27/116 (23%), Positives = 45/116 (38%), Gaps = 18/116 (15%)
Query: 55 KWLEDGDKP--IYVGFGSLPVQDPKTMTEIIVEALETTGQRGIISKGW---------GGL 103
KWL++ + +Y+ GS M EI+V E + +++G G L
Sbjct: 246 KWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLKEALEGSL 305
Query: 104 GDLTEPKDSIYLLDNVPHDWLFLHCKAVVHHGGAGTTAAGLKAACPTTIVPFFGDQ 159
G + D + +L + + HC G +T G+ + P P F DQ
Sbjct: 306 GVVVSWCDQLRVLCHAAIGGFWTHC-------GYNSTLEGICSGVPLLTFPVFWDQ 354
>At2g22930 putative flavonol 3-O-glucosyltransferase
Length = 442
Score = 27.7 bits (60), Expect = 6.2
Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 9/104 (8%)
Query: 132 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHDR---GVGPPPIPIDVFSLPKLIA 188
V+H G GT L C ++PF GDQ + + + V FS L
Sbjct: 334 VNHCGPGTIWECLMTDCQMVLLPFLGDQVLFTRLMTEEFKVSVEVSREKTGWFSKESLSD 393
Query: 189 AINFMLDPK------VKEHAIELAKAMENEDGVTGAVKVFFKQL 226
AI ++D V+ + +L + + + +TG V F ++L
Sbjct: 394 AIKSVMDKDSDLGKLVRSNHAKLKETLGSHGLLTGYVDKFVEEL 437
>At1g24370 hypothetical protein
Length = 413
Score = 27.7 bits (60), Expect = 6.2
Identities = 21/90 (23%), Positives = 34/90 (37%), Gaps = 4/90 (4%)
Query: 33 IDVVGFCFLDLASNYEPPESLVKWLEDGDKPIYVGFGSLPVQDPKTMTEIIVEALETTGQ 92
IDV+ C L N+ P ++++ W + +Q+ + +E T
Sbjct: 270 IDVIKRCSSSLYKNFSPIDTMLDWPPENPFEHSKLLNYYELQEDNDWIRLYLELAVATTN 329
Query: 93 RGIISKGWGGLGDLTEPKDSIYLLDNVPHD 122
RG I L DL+ K +D P D
Sbjct: 330 RGTIR----DLDDLSNLKIIQVAIDTTPQD 355
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.322 0.142 0.453
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,489,011
Number of Sequences: 26719
Number of extensions: 296323
Number of successful extensions: 720
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 710
Number of HSP's gapped (non-prelim): 23
length of query: 253
length of database: 11,318,596
effective HSP length: 97
effective length of query: 156
effective length of database: 8,726,853
effective search space: 1361389068
effective search space used: 1361389068
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)
Medicago: description of AC126019.1