KMC003917A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003917A_C01 KMC003917A_c01
gctctgtGTACTATAATGAAAGAATGGTTCATAAAATAGCACATTAATAACATCCATAGA
GATGTAAAACCAATACAAAATATTGACCCTATTACATTATGAAATCTGCCCTTAGTATAC
AGTTCTCATGGCATTACATCTAATTAAAAAGAAATGCTATCAAACATTATATACATTAAT
ACTCATTTTTTAAGAGATAAAAAATAACTTATTAATTGATACTTGATACAACTAAGATCT
CTCTACATTCGCATGTATTCACCTTTTCCCAGAAAACATTGTGCCGCAGTTCACTAGATG
AAAGGAAATAATGCATACTCAGCATAGAATAGATTCCATATGAATTGATAGAATCCTGAT
ATTGCTTCCTTTGTATGATTTGCTTGTTCTAATATCTTAACCTGGTAAATCAAACTTGAT
GCAAGAATCATATGAGCTGGTATCATTAACCAACGCCTGAAAGCCTGTGGCATGTAAATT
GCTGCCAATATGGAAACAATATAGTTCATCAGCAAAATTCCAGAACCAAGGAAAGCAATG
TTCCGAACTCCTAATTTTGTTGCAAAGGTAGATATCTGATACTTGCGATCTCCTTCAACA
TCTGGAAGATCTTTTGTTATAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003917A_C01 KMC003917A_c01
         (622 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_187801.1| hypothetical protein; protein id: At3g11950.1 [...   162  3e-39
dbj|BAB03104.1| dbj|BAA17774.1~gene_id:MEC18.5~similar to unknow...   117  1e-25
ref|NP_179485.2| hypothetical protein; protein id: At2g18950.1, ...    77  1e-13
ref|ZP_00070982.1| hypothetical protein [Trichodesmium erythraeu...    76  4e-13
gb|ZP_00107768.1| hypothetical protein [Nostoc punctiforme]            74  2e-12

>ref|NP_187801.1| hypothetical protein; protein id: At3g11950.1 [Arabidopsis thaliana]
          Length = 970

 Score =  162 bits (411), Expect = 3e-39
 Identities = 83/124 (66%), Positives = 96/124 (76%), Gaps = 16/124 (12%)
 Frame = -3

Query: 620  ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQ-------- 465
            ITKDLPDVEGDRK+QIST ATKLGVRNIAFLGSG+LL+NY+ +I  A YMPQ        
Sbjct: 847  ITKDLPDVEGDRKFQISTLATKLGVRNIAFLGSGLLLVNYVSAISLAFYMPQYAALKRPT 906

Query: 464  --------AFRRWLMIPAHMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYAL 309
                     FR  LMIPAH+ILAS LI+Q  +LE+AN+TKEAISG+Y+FIWNLFYAEY L
Sbjct: 907  LLSFNNEQVFRGSLMIPAHVILASGLIFQTWVLEKANYTKEAISGYYRFIWNLFYAEYLL 966

Query: 308  FPFI 297
            FPF+
Sbjct: 967  FPFL 970

>dbj|BAB03104.1| dbj|BAA17774.1~gene_id:MEC18.5~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 441

 Score =  117 bits (293), Expect = 1e-25
 Identities = 63/100 (63%), Positives = 73/100 (73%), Gaps = 16/100 (16%)
 Frame = -3

Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMP--------- 468
           ITKDLPDVEGDRK+QIST ATKLGVRNIAFLGSG+LL+NY+ +I  A YMP         
Sbjct: 295 ITKDLPDVEGDRKFQISTLATKLGVRNIAFLGSGLLLVNYVSAISLAFYMPQYAALKRPT 354

Query: 467 -------QAFRRWLMIPAHMILASSLIYQVKILEQANHTK 369
                  Q FR  LMIPAH+ILAS LI+Q  +LE+AN+TK
Sbjct: 355 LLSFNNEQVFRGSLMIPAHVILASGLIFQTWVLEKANYTK 394

>ref|NP_179485.2| hypothetical protein; protein id: At2g18950.1, supported by cDNA:
           gi_17104827, supported by cDNA: gi_17380873, supported
           by cDNA: gi_20384918, supported by cDNA: gi_21281071
           [Arabidopsis thaliana]
           gi|17104828|gb|AAL35412.1|AF324344_1 tocopherol
           polyprenyltransferase [Arabidopsis thaliana]
           gi|17380874|gb|AAL36249.1| unknown protein [Arabidopsis
           thaliana] gi|20384919|gb|AAM10489.1| homogentisate
           phytylprenyltransferase [Arabidopsis thaliana]
           gi|21281072|gb|AAM45041.1| unknown protein [Arabidopsis
           thaliana]
          Length = 393

 Score = 77.4 bits (189), Expect = 1e-13
 Identities = 39/106 (36%), Positives = 65/106 (60%)
 Frame = -3

Query: 614 KDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQAFRRWLMIPA 435
           KD+PD+EGD+ + I +F+  LG + + +    +L M Y V+IL     P  + + + +  
Sbjct: 289 KDIPDIEGDKIFGIRSFSVTLGQKRVFWTCVTLLQMAYAVAILVGATSPFIWSKVISVVG 348

Query: 434 HMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYALFPFI 297
           H+ILA++L  + K ++ ++ T+  I+  Y FIW LFYAEY L PF+
Sbjct: 349 HVILATTLWARAKSVDLSSKTE--ITSCYMFIWKLFYAEYLLLPFL 392

>ref|ZP_00070982.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 349

 Score = 75.9 bits (185), Expect = 4e-13
 Identities = 36/106 (33%), Positives = 62/106 (57%)
 Frame = -3

Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQAFRRWLMI 441
           I KD+PD+EGDR+Y I+TF  KLG   +  L   +L   Y+  ++  +    +   + ++
Sbjct: 241 IFKDIPDIEGDRQYNINTFTIKLGAFAVFNLARWVLTFCYLGMVMVGVVWLASVNLFFLV 300

Query: 440 PAHMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYALFP 303
            +H++    + +  + ++   H K+AI+ FYQFIW LF+ EY +FP
Sbjct: 301 ISHLLALGIMWWFSQRVDL--HDKKAIADFYQFIWKLFFLEYLIFP 344

>gb|ZP_00107768.1| hypothetical protein [Nostoc punctiforme]
          Length = 322

 Score = 73.9 bits (180), Expect = 2e-12
 Identities = 42/117 (35%), Positives = 64/117 (53%)
 Frame = -3

Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQAFRRWLMI 441
           I KD+PD+EGDR Y I+TF  KLG + +  L   ++ + Y+  IL  +    +     +I
Sbjct: 213 IFKDIPDIEGDRLYNITTFTIKLGSQAVFNLALWVITVCYLGIILVGVLRIASVNPIFLI 272

Query: 440 PAHMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYALFPFI**TAAQCFL 270
            AH+ L   + ++   ++  +  K AI+ FYQFIW LF+ EY +FP        CFL
Sbjct: 273 TAHLALLVWMWWRSLAVDLQD--KSAIAQFYQFIWKLFFIEYLIFPI------ACFL 321

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 483,835,859
Number of Sequences: 1393205
Number of extensions: 9611271
Number of successful extensions: 19573
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 19154
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 19565
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25017613016
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD002g06_f BP044188 1 508
2 GNf086b10 BP073692 8 126
3 MPD087b09_f AV775698 10 474
4 MF065b02_f BP031736 15 431
5 MF057e09_f BP031304 30 480
6 SPD048e11_f BP047838 31 537
7 MPDL062c10_f AV779648 32 606
8 SPD023b08_f BP045792 51 630
9 SPD023c04_f BP045797 52 631
10 MPD074d05_f AV774850 106 522




Lotus japonicus
Kazusa DNA Research Institute