KMC001557A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001557A_C01 KMC001557A_c01
agaaaaaaaaaaaggacctataatatatagttgaaggcagcaagcaaaacatgaaattca
atCACTTGAAACAGAAGTGCTAATTTTATTCTGGTTTTTTCAGGACTGTGAAGTTTCATC
AATAGTCCATTACAAGGATTCAAGTCCTTGGGCAAAAACTTCAGCAAAAGCCTCCATTTG
AGACTTGTTCAATGCCAAACCAATCTCAATTCCTCCATGAATGTCCCTACTATCTGAAAG
AGAAAATGCTCCAGTTCTATCAACAGAAGTAACATCCACCTTCTTAGGCCTTCCCCATCC
AAAATCAACACCAAAAACCTCAAACCGAGGTGACCCTGCAGTGGAAAACATCTTATCACC
CGCCACAGATTGTATCTTTGAAATCCAATCCTCTGCTCCATTCATCACCCCACTCTCCAA
TTCACTCAACTCATCAGAAATCACCACAAGAGCATTGATGAACCCATCATCACCCAAAAC
CTCTCCTGTTTCAGCCACAGCCAAGTGAGGCATAATGCAATTCCCAAAATAAGTTGGCTT
AATTGGAGGCTCCAAACGGGTTCTACAATCCACACTGAACACAAAAGCTACTCTCTTCG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001557A_C01 KMC001557A_c01
         (599 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAA74428.1| Anthocyanin 5-aromatic acyltransferase [Gentiana...   131  8e-30
gb|AAK96528.1| AT5g39050/MXF12_60 [Arabidopsis thaliana] gi|2509...   120  1e-26
ref|NP_189605.1| hypothetical protein; protein id: At3g29635.1 [...   120  1e-26
gb|AAL50566.1|AF405707_1 malonyl CoA:anthocyanin 5-O-glucoside-6...   119  3e-26
ref|NP_198725.1| acyltransferase -like  protein; protein id: At5...   117  1e-25

>dbj|BAA74428.1| Anthocyanin 5-aromatic acyltransferase [Gentiana triflora]
          Length = 469

 Score =  131 bits (329), Expect = 8e-30
 Identities = 63/154 (40%), Positives = 92/154 (58%), Gaps = 3/154 (1%)
 Frame = -3

Query: 585 FVFSVDCRTRLEPPIKPTYFGNCIMPHLAVAETGEVLGDDGFINALVVISDELSEL---E 415
           F F+ DCR  L PP  P YFGNC+   +A A   E++GD G + A+  I + + +    E
Sbjct: 315 FSFTADCRGLLTPPCPPNYFGNCLASCVAKATHKELVGDKGLLVAVAAIGEAIEKRLHNE 374

Query: 414 SGVMNGAEDWISKIQSVAGDKMFSTAGSPRFEVFGVDFGWGRPKKVDVTSVDRTGAFSLS 235
            GV+  A+ W+S+   +   +     GSP+F+ +GVDFGWG+P K D+TSVD      + 
Sbjct: 375 KGVLADAKTWLSESNGIPSKRFLGITGSPKFDSYGVDFGWGKPAKFDITSVDYAELIYVI 434

Query: 234 DSRDIHGGIEIGLALNKSQMEAFAEVFAQGLESL 133
            SRD   G+EIG++L K  M+AFA++F +G  SL
Sbjct: 435 QSRDFEKGVEIGVSLPKIHMDAFAKIFEEGFCSL 468

>gb|AAK96528.1| AT5g39050/MXF12_60 [Arabidopsis thaliana]
           gi|25090341|gb|AAN72280.1| At5g39050/MXF12_60
           [Arabidopsis thaliana]
          Length = 469

 Score =  120 bits (301), Expect = 1e-26
 Identities = 58/154 (37%), Positives = 98/154 (62%), Gaps = 2/154 (1%)
 Frame = -3

Query: 591 VAFVFSVDCRTRLEPPIKPTYFGNCIMPHLAVAETGEV-LGDDGFINALVVISDELSELE 415
           V + F+VDCR+ + PP+  +YFGNC+     ++ T E  + ++GF+ A  ++SD +  L+
Sbjct: 316 VGYGFAVDCRSLMVPPVPSSYFGNCVSACFKMSLTAETFMSEEGFLAAARMVSDSVEALD 375

Query: 414 SGVMNGAEDWISKIQSVA-GDKMFSTAGSPRFEVFGVDFGWGRPKKVDVTSVDRTGAFSL 238
             V     + +    +++ G ++ S AGS RF V+G+DFGWGRP+KV V S+D+  A S 
Sbjct: 376 ENVALKIPEILEGFTTLSPGTQVLSVAGSTRFGVYGLDFGWGRPEKVVVVSIDQGEAISF 435

Query: 237 SDSRDIHGGIEIGLALNKSQMEAFAEVFAQGLES 136
           ++SRD  GG+E+G +L K +M+   ++  +GLE+
Sbjct: 436 AESRDGSGGVELGFSLKKHEMDVLVDLLHKGLEN 469

>ref|NP_189605.1| hypothetical protein; protein id: At3g29635.1 [Arabidopsis
           thaliana]
          Length = 458

 Score =  120 bits (301), Expect = 1e-26
 Identities = 63/154 (40%), Positives = 95/154 (60%), Gaps = 3/154 (1%)
 Frame = -3

Query: 591 VAFVFSVDCRTRLEPPIKPTYFGNCIMPHLAVAETGEV-LGDDGFINALVVISDELSELE 415
           V F+++ D R RL+PP+   YFGNC+ P         V LG+DGF+N + ++SD +  + 
Sbjct: 302 VRFMYAADFRNRLDPPVPEMYFGNCVFPIGCFGYKANVFLGEDGFVNMVEILSDSVRSIG 361

Query: 414 SGVMNG-AEDWISKIQSVA-GDKMFSTAGSPRFEVFGVDFGWGRPKKVDVTSVDRTGAFS 241
              +    E +I+  +SV  G ++ S AGS +F ++G DFGWG+P   ++ S+DR  AFS
Sbjct: 362 LRKLETICELYINGTKSVKPGTQIGSIAGSNQFGLYGSDFGWGKPCNSEIASIDRNEAFS 421

Query: 240 LSDSRDIHGGIEIGLALNKSQMEAFAEVFAQGLE 139
           +S+ RD  GG+EIGL L K +M+ F  +F  GLE
Sbjct: 422 MSERRDEPGGVEIGLCLKKCEMDIFIYLFQNGLE 455

>gb|AAL50566.1|AF405707_1 malonyl CoA:anthocyanin 5-O-glucoside-6'''-O-malonyltransferase
           [Salvia splendens]
          Length = 462

 Score =  119 bits (298), Expect = 3e-26
 Identities = 58/148 (39%), Positives = 89/148 (59%), Gaps = 4/148 (2%)
 Frame = -3

Query: 585 FVFSVDCRTRLEPPIKPTYFGNCIMPHLAVAETGEVLGDDGFINALVVISDELS---ELE 415
           F+   D R R++PPI   YFGNCI+  +A  E G++  +DGF  A   I  E+    +  
Sbjct: 294 FIIPADARGRVDPPIPENYFGNCIVSSVAQVERGKLAAEDGFAVAAEAIGGEIEGKLKNR 353

Query: 414 SGVMNGAEDWISKIQSVAGDKMFSTAGSPRFEVFGVDFGWGRPKKVDVTSVD-RTGAFSL 238
             ++ GAE+W+S I    G  +   +GSP+F++   DFGWG+ +K++V S+D    + SL
Sbjct: 354 DEILRGAENWMSDIFKCFGMSVLGVSGSPKFDLLKADFGWGKARKLEVLSIDGENHSMSL 413

Query: 237 SDSRDIHGGIEIGLALNKSQMEAFAEVF 154
             S D +GG+E+GL+L + +M AF EVF
Sbjct: 414 CSSSDFNGGLEVGLSLPRERMAAFEEVF 441

>ref|NP_198725.1| acyltransferase -like  protein; protein id: At5g39090.1, supported
           by cDNA: gi_20466677 [Arabidopsis thaliana]
           gi|10177552|dbj|BAB10831.1| anthocyanin
           acyltransferase-like protein [Arabidopsis thaliana]
           gi|20466678|gb|AAM20656.1| acyltransferase-like protein
           [Arabidopsis thaliana]
          Length = 448

 Score =  117 bits (293), Expect = 1e-25
 Identities = 60/154 (38%), Positives = 94/154 (60%), Gaps = 3/154 (1%)
 Frame = -3

Query: 591 VAFVFSVDCRTRLE-PPIKPTYFGNCIMPHLAVA-ETGEVLGDDGFINALVVISDELSEL 418
           V + F+VDCR  ++ PPI  TYFGNC+   + +  + G  LG+ GF+ A  +ISD + EL
Sbjct: 295 VGYRFAVDCRRLIDDPPIPLTYFGNCVYSAVKIPLDAGMFLGEQGFVVAARLISDSVEEL 354

Query: 417 ESGVMNGAEDWISKIQSVAGDKMF-STAGSPRFEVFGVDFGWGRPKKVDVTSVDRTGAFS 241
           +S V     + +   +    D  F S AGS RF ++G+DFGWG+P K  + S+D+ G  S
Sbjct: 355 DSNVAWKIPELLETYEKAPVDSQFVSVAGSTRFGIYGLDFGWGKPFKSLLVSIDQRGKIS 414

Query: 240 LSDSRDIHGGIEIGLALNKSQMEAFAEVFAQGLE 139
           +++SRD  GG+EIG +L K +M    ++  +G++
Sbjct: 415 IAESRDGSGGVEIGFSLKKQEMNVLIDLLHKGIK 448

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 545,931,064
Number of Sequences: 1393205
Number of extensions: 12602428
Number of successful extensions: 37283
Number of sequences better than 10.0: 177
Number of HSP's better than 10.0 without gapping: 34869
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 36979
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23426109484
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR029e07_f BP078246 1 349
2 MPD027h07_f AV771877 30 603
3 MFB086g12_f BP040312 63 562
4 GENf004d04 BP058476 68 482
5 MR080e03_f BP082159 71 463
6 MR039c09_f BP079009 71 592




Lotus japonicus
Kazusa DNA Research Institute