KMC002750A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002750A_C01 KMC002750A_c01
tAATAAATTATGATCTCATAATGGTATTTTGTTGTATTTGACATATTTTATCGCTTAACT
TGGCAATGTTTATTCATCTTGACATTTCATGTACATTACATATCAAGTTGGGGTTGCTAT
TCTTGTCGACCCAAAATGACCACTCTTCCTCCTACTCAGCACGACCAATGTCATTCAGGC
TGACCAAGTACATTATTACAAATAAGTCACTAATTGAGTTTTAGGAAAATCAAAGGGCGC
AAGATAGACATGAGAAAATATCCATGTTATCTAGCAGATGCAGAGAAGACTCTGTTGTAC
CATATAGAATGAAGAAGTCATGTGAAATATATCATATTTGACAGGGTTCTTATGCCAAAG
GAAATGCTGCTTCACCTTTTTGACCCTCATCAGCAATCTAAGATACCTTTTCTCTGGGAT
GGAAAGCAGTATATTTTTCAAATTTGGAATATCCTTCTCCAAAACAATAACAGCAAAGGA
TTCCCAATTCAAAACCTCAAAAAATGGTGGGACAAAATTGTCAGATATAATAACCGGAAC
ACATTCATAGGAAATGGCTTCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002750A_C01 KMC002750A_c01
         (562 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_197954.1| putative protein; protein id: At5g25820.1 [Arab...   137  3e-34
ref|NP_195005.1| putative protein; protein id: At4g32790.1 [Arab...   135  2e-33
dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]                135  2e-33
ref|NP_197468.1| putative protein; protein id: At5g19670.1 [Arab...   129  2e-32
dbj|BAB08970.1| contains similarity to limonene cyclase~gene_id:...   133  1e-31

>ref|NP_197954.1| putative protein; protein id: At5g25820.1 [Arabidopsis thaliana]
          Length = 654

 Score =  137 bits (346), Expect(2) = 3e-34
 Identities = 62/81 (76%), Positives = 70/81 (85%)
 Frame = -2

Query: 561 EAISYECVPVIISDNFVPPFFEVLNWESFAVIVLEKDIPNLKNILLSIPEKRYLRLLMRV 382
           EAI Y+CVPVIISDNFVPPFFEVLNWESFA+ + EKDIPNLK IL+SIPE RY  + MRV
Sbjct: 559 EAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEKDIPNLKKILMSIPESRYRSMQMRV 618

Query: 381 KKVKQHFLWHKNPVKYDIFHM 319
           KKV++HFLWH  P KYD+FHM
Sbjct: 619 KKVQKHFLWHAKPEKYDMFHM 639

 Score = 29.3 bits (64), Expect(2) = 3e-34
 Identities = 11/14 (78%), Positives = 12/14 (85%)
 Frame = -3

Query: 317 LLHSIWYNRVFSAS 276
           +LHSIWYNRVF  S
Sbjct: 640 ILHSIWYNRVFQIS 653

>ref|NP_195005.1| putative protein; protein id: At4g32790.1 [Arabidopsis thaliana]
           gi|7486402|pir||T04446 hypothetical protein F4D11.10 -
           Arabidopsis thaliana gi|3063691|emb|CAA18582.1| putative
           protein [Arabidopsis thaliana]
           gi|7270226|emb|CAB79996.1| putative protein [Arabidopsis
           thaliana]
          Length = 593

 Score =  135 bits (341), Expect(2) = 2e-33
 Identities = 62/81 (76%), Positives = 71/81 (87%)
 Frame = -2

Query: 561 EAISYECVPVIISDNFVPPFFEVLNWESFAVIVLEKDIPNLKNILLSIPEKRYLRLLMRV 382
           EA+ YECVPVIISDNFVPPFFEVLNWESFAV VLEKDIP+LKNIL+SI E+RY  + MRV
Sbjct: 500 EALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSITEERYREMQMRV 559

Query: 381 KKVKQHFLWHKNPVKYDIFHM 319
           K V++HFLWH  P ++DIFHM
Sbjct: 560 KMVQKHFLWHSKPERFDIFHM 580

 Score = 28.1 bits (61), Expect(2) = 2e-33
 Identities = 10/11 (90%), Positives = 11/11 (99%)
 Frame = -3

Query: 317 LLHSIWYNRVF 285
           +LHSIWYNRVF
Sbjct: 581 ILHSIWYNRVF 591

>dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]
          Length = 270

 Score =  135 bits (341), Expect(2) = 2e-33
 Identities = 62/81 (76%), Positives = 71/81 (87%)
 Frame = -2

Query: 561 EAISYECVPVIISDNFVPPFFEVLNWESFAVIVLEKDIPNLKNILLSIPEKRYLRLLMRV 382
           EA+ YECVPVIISDNFVPPFFEVLNWESFAV VLEKDIP+LKNIL+SI E+RY  + MRV
Sbjct: 177 EALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSITEERYREMQMRV 236

Query: 381 KKVKQHFLWHKNPVKYDIFHM 319
           K V++HFLWH  P ++DIFHM
Sbjct: 237 KMVQKHFLWHSKPERFDIFHM 257

 Score = 28.1 bits (61), Expect(2) = 2e-33
 Identities = 10/11 (90%), Positives = 11/11 (99%)
 Frame = -3

Query: 317 LLHSIWYNRVF 285
           +LHSIWYNRVF
Sbjct: 258 ILHSIWYNRVF 268

>ref|NP_197468.1| putative protein; protein id: At5g19670.1 [Arabidopsis thaliana]
          Length = 600

 Score =  129 bits (325), Expect(2) = 2e-32
 Identities = 57/81 (70%), Positives = 70/81 (86%)
 Frame = -2

Query: 561 EAISYECVPVIISDNFVPPFFEVLNWESFAVIVLEKDIPNLKNILLSIPEKRYLRLLMRV 382
           E+I YECVPVIISDNFVPPFFEVL+W +F+VIV EKDIP LK+ILLSIPE +Y+++ M V
Sbjct: 504 ESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDILLSIPEDKYVKMQMAV 563

Query: 381 KKVKQHFLWHKNPVKYDIFHM 319
           +K ++HFLWH  P KYD+FHM
Sbjct: 564 RKAQRHFLWHAKPEKYDLFHM 584

 Score = 30.8 bits (68), Expect(2) = 2e-32
 Identities = 12/16 (75%), Positives = 13/16 (81%)
 Frame = -3

Query: 317 LLHSIWYNRVFSASAR 270
           +LHSIWYNRVF A  R
Sbjct: 585 VLHSIWYNRVFQAKRR 600

>dbj|BAB08970.1| contains similarity to limonene cyclase~gene_id:K15O15.2
           [Arabidopsis thaliana]
          Length = 559

 Score =  133 bits (335), Expect(2) = 1e-31
 Identities = 62/81 (76%), Positives = 69/81 (84%)
 Frame = -2

Query: 561 EAISYECVPVIISDNFVPPFFEVLNWESFAVIVLEKDIPNLKNILLSIPEKRYLRLLMRV 382
           EAI  ECVPVII+DN+VPPFFEVLNWE FAV V EKDIPNL+NILLSIPE RY+ +  RV
Sbjct: 463 EAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLSIPEDRYIGMQARV 522

Query: 381 KKVKQHFLWHKNPVKYDIFHM 319
           K V+QHFLWHK PVK+D FHM
Sbjct: 523 KAVQQHFLWHKKPVKFDQFHM 543

 Score = 24.6 bits (52), Expect(2) = 1e-31
 Identities = 9/16 (56%), Positives = 11/16 (68%)
 Frame = -3

Query: 317 LLHSIWYNRVFSASAR 270
           +LHSIWY+RV     R
Sbjct: 544 ILHSIWYSRVHRIKTR 559

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 481,148,867
Number of Sequences: 1393205
Number of extensions: 10279573
Number of successful extensions: 30329
Number of sequences better than 10.0: 105
Number of HSP's better than 10.0 without gapping: 29568
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30323
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20095422690
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNLf008b05 BP075261 1 446
2 MFB073h11_f BP039350 2 484
3 GNf080a07 BP073242 50 444
4 MRL001d12_f BP083730 50 567




Lotus japonicus
Kazusa DNA Research Institute