KMC001787A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001787A_C01 KMC001787A_c01
atggtataaggggataaaTTTTTTGAAAAGTAAAAAGAACAATTTTATTTCATAGATAAA
GTCATAGGACATTGAAGAGACACTTCTAGAATCACAGTGAACAACAAGCTTGATTAAACA
ACTTTTTGAAGATAAGAATTAAGAATCCCTCCGTGCAAAGTACAAGATGGAATTTGAGAA
GAGAACATTAATTAGTATGTTATCTCACGGCTTGGATTGCAAAGTGGCATATTGAAGAAG
CTCCACATCGCGCTCAAACCTGACGATGTCTTCCTCTTCCATATTCACAATCACTTCGAC
TCCTCGCCCATCTCTCGTATCGGTCAAAAGCATACCATTACTCACGAACCACCCACAAGT
GGTTGCCCAGAGGCGGTTTGCCCCACCCAAAATCCGCATTATACAAGGGGAACCCGCACC
AACTAGTGTAAATAAACAAATTGTGAACGTCCTTGATCTCATCATAACCACGAGGCTTGT
TCAAAGGTGAAGGACCTTGGTTCATGCACTTGGAGATGAACGCCACATCCTTATCTTCCC
CTCCAAATTTCTTTGGATACACTTCACAAAAATC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001787A_C01 KMC001787A_c01
         (574 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAF34801.1|AF227981_1 F21J9.20-like protein [Euphorbia esula]       47  3e-10
ref|NP_173851.1| acyltransferase family; protein id: At1g24420.1...    43  8e-09
ref|NP_193274.1| acyltransferase family; protein id: At4g15390.1...    41  5e-08
ref|NP_189647.1| acyltransferase family; protein id: At3g30280.1...    39  8e-08
dbj|BAB01067.1| acetyltranferase-like protein [Arabidopsis thali...    40  3e-07

>gb|AAF34801.1|AF227981_1 F21J9.20-like protein [Euphorbia esula]
          Length = 219

 Score = 46.6 bits (109), Expect(2) = 3e-10
 Identities = 26/67 (38%), Positives = 31/67 (45%)
 Frame = -1

Query: 574 DFCEVYPKKFGGEDKDVAFISKCMNQGPSPLNKPRGYDEIKDVHNLFIYTSWCGFPLYNA 395
           DF E Y KK  GED   A              K      + D  + F+ + WC F LY+A
Sbjct: 105 DFVENYVKKVQGEDGVGAICE---------FGKDFAEKALSDKIDFFMCSGWCRFGLYDA 155

Query: 394 DFGWGKP 374
           DFGWGKP
Sbjct: 156 DFGWGKP 162

 Score = 39.7 bits (91), Expect(2) = 3e-10
 Identities = 18/54 (33%), Positives = 28/54 (51%)
 Frame = -2

Query: 387 GGANRLWATTCGWFVSNGMLLTDTRDGRGVEVIVNMEEEDIVRFERDVELLQYA 226
           G     W +     + N  +L DT+DG G E  + + EED+  FE D  +L++A
Sbjct: 158 GWGKPTWLSIVSTNIRNVCILLDTKDGEGFEAWITLSEEDMSWFESDERVLEFA 211

>ref|NP_173851.1| acyltransferase family; protein id: At1g24420.1 [Arabidopsis
           thaliana] gi|25403243|pir||D86378 protein F21J9.8
           [imported] - Arabidopsis thaliana
           gi|9743331|gb|AAF97955.1|AC000103_5 F21J9.8 [Arabidopsis
           thaliana]
          Length = 436

 Score = 42.7 bits (99), Expect(2) = 8e-09
 Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
 Frame = -2

Query: 387 GGANRLWATTCGW-FVSNGMLLTDTRDGRGVEVIVNMEEEDIVRFERDVELLQYATL 220
           G    +W T  G     N MLL DT+DG G+E  + + EE +  FE D ELL+ A+L
Sbjct: 374 GWGKPVWVTGRGTSHFKNLMLLIDTKDGEGIEAWITLTEEQMSLFECDQELLESASL 430

 Score = 38.5 bits (88), Expect(2) = 8e-09
 Identities = 14/23 (60%), Positives = 18/23 (77%)
 Frame = -1

Query: 442 NLFIYTSWCGFPLYNADFGWGKP 374
           +L++  SWC   LY+ADFGWGKP
Sbjct: 356 DLWMSNSWCKLGLYDADFGWGKP 378

>ref|NP_193274.1| acyltransferase family; protein id: At4g15390.1, supported by cDNA:
           gi_20466571 [Arabidopsis thaliana]
           gi|7485094|pir||D71418 hypothetical protein -
           Arabidopsis thaliana gi|2244896|emb|CAB10318.1| HSR201
           like protein [Arabidopsis thaliana]
           gi|7268286|emb|CAB78581.1| HSR201 like protein
           [Arabidopsis thaliana] gi|20466572|gb|AAM20603.1| HSR201
           like protein [Arabidopsis thaliana]
           gi|22136378|gb|AAM91267.1| HSR201-like protein
           [Arabidopsis thaliana]
          Length = 446

 Score = 40.8 bits (94), Expect(2) = 5e-08
 Identities = 17/42 (40%), Positives = 30/42 (70%)
 Frame = -2

Query: 345 VSNGMLLTDTRDGRGVEVIVNMEEEDIVRFERDVELLQYATL 220
           + N  +L D++DG+G+E  V + EE++  FE++ ELL +AT+
Sbjct: 399 LGNLAMLIDSKDGQGIEAFVTLPEENMSSFEQNPELLAFATM 440

 Score = 37.7 bits (86), Expect(2) = 5e-08
 Identities = 15/35 (42%), Positives = 19/35 (53%)
 Frame = -1

Query: 451 DVHNLFIYTSWCGFPLYNADFGWGKPPLGNHLWVV 347
           + H  +  +SWC  PLY A FGW  P     +WVV
Sbjct: 363 ETHEPYTVSSWCKLPLYEASFGWDSP-----VWVV 392

>ref|NP_189647.1| acyltransferase family; protein id: At3g30280.1 [Arabidopsis
           thaliana] gi|9294332|dbj|BAB02229.1|
           acetyl-CoA:benzylalcohol acetyltranferase-like protein
           [Arabidopsis thaliana]
          Length = 443

 Score = 38.9 bits (89), Expect(2) = 8e-08
 Identities = 15/42 (35%), Positives = 30/42 (70%)
 Frame = -2

Query: 345 VSNGMLLTDTRDGRGVEVIVNMEEEDIVRFERDVELLQYATL 220
           + N  +L D++DG+G+E  V + EE+++  E++ ELL +A++
Sbjct: 396 LENVTMLIDSKDGQGIEAFVTLPEENMLSLEQNTELLAFASV 437

 Score = 38.9 bits (89), Expect(2) = 8e-08
 Identities = 13/26 (50%), Positives = 16/26 (61%)
 Frame = -1

Query: 451 DVHNLFIYTSWCGFPLYNADFGWGKP 374
           + H  +  +SWC  PLY A FGWG P
Sbjct: 360 ETHEPYTVSSWCKLPLYEASFGWGSP 385

>dbj|BAB01067.1| acetyltranferase-like protein [Arabidopsis thaliana]
          Length = 455

 Score = 39.7 bits (91), Expect(2) = 3e-07
 Identities = 18/40 (45%), Positives = 29/40 (72%)
 Frame = -2

Query: 339 NGMLLTDTRDGRGVEVIVNMEEEDIVRFERDVELLQYATL 220
           N + L DT++  G+E  VN+ E+++  FE+D ELLQ+A+L
Sbjct: 404 NIVTLLDTKEAGGIEAWVNLNEQEMNLFEQDRELLQFASL 443

 Score = 36.2 bits (82), Expect(2) = 3e-07
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = -1

Query: 442 NLFIYTSWCGFPLYNADFGWGKP 374
           + +I++S C F LY  DFGWGKP
Sbjct: 370 DFYIFSSACRFGLYETDFGWGKP 392

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 512,781,270
Number of Sequences: 1393205
Number of extensions: 11514893
Number of successful extensions: 29026
Number of sequences better than 10.0: 56
Number of HSP's better than 10.0 without gapping: 27989
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 29003
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21243732558
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR022f05_f BP077692 1 429
2 MR010f11_f BP076726 10 394
3 MR055e08_f BP080246 33 396
4 MR049c03_f BP079775 33 479
5 GNf032a12 BP069668 34 414
6 MR010d04_f BP076704 34 547
7 MR036b03_f BP078761 34 532
8 MR010f02_f BP076718 34 522
9 MRL002h05_f BP083823 34 420
10 MR085c08_f BP082531 34 418
11 MR028f05_f BP078175 34 412
12 MR082c02_f BP082292 34 408
13 MR060d07_f BP080600 34 376
14 MR014e07_f BP077038 34 299
15 MR019f12_f BP077456 34 225
16 MR019h02_f BP077470 34 163
17 GENf057a10 BP060765 34 408
18 GENf027b12 BP059472 34 380
19 GENf015e02 BP058980 35 420
20 GNf061e01 BP071905 36 415
21 GNf084e07 BP073569 36 444
22 MR033g09_f BP078571 37 451
23 MR083h12_f BP082430 37 512
24 GNf091d02 BP074079 37 445
25 MR099g08_f BP083609 38 262
26 MR002a11_f BP076050 41 393
27 GNf055h03 BP071499 41 592
28 MR058f09_f BP080473 104 543




Lotus japonicus
Kazusa DNA Research Institute