KMC003977A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003977A_C01 KMC003977A_c01
tcgcatttctcttcttctctcgtctcgtcacgtgaaggggttgttttctgcaaaaggagg
caatggacttctttttcaagGGAATGAGTGGTGATGGATCCGAGTGTCCCTTTGACGCGA
GCGACATCCAAAGATGCCCCTTTTTGAGAAACATTAATGAGCCTGCAAATTTCTGTTTCT
CTTTGGGAAAAGTCTCCATGCCTGTGGCACGGGGCTAAGGGTCCGATATTTGAGGATGGT
CCTAGTTTTGATACATGCATTTAAGCTATTTCATGGGAAAGATGGAGTTGTTCCTCTCTC
TGAGAGATCTGACTTTTATGATGGAAGTGCAGCAGCTGATTCTGTGCCTGTTTTCAATTC
CTTTACCTGGTAAAGTGCTGCAACCATAAGTCTGTCAGCCCTTGGAGTAGGTGGCCCATT
TGGCTTTTGGAATTTTTCCGAGAAATGGAAGAAGCAGAAGAATTCAGAATCATCAAGTAA
AAAAGAACACTCCTCTCAGAAAGGAGATGTATCAAAGCATGAAGCACTTGGAAATGAATG
GTTGAAAAGTGGGACTTGCCCAATGGCCAAGTCTTATAGAGCTGTAAGCCGTGTCCTCCC
TCTTGTTGCAATAGCTTTTAAGCCAGCAGCTGGGGTGAATCTTAGATGTCCTCCTGC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003977A_C01 KMC003977A_c01
         (657 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM66071.1| unknown [Arabidopsis thaliana]                         139  5e-43
ref|NP_680396.1| putative protein; protein id: At5g45410.2, supp...   136  3e-42
gb|AAK93585.2| unknown protein [Arabidopsis thaliana]                 136  3e-40
ref|NP_194233.1| putative protein; protein id: At4g25030.1, supp...   124  2e-35
gb|AAG46152.1|AC018727_4 unknown protein [Oryza sativa]               112  8e-34

>gb|AAM66071.1| unknown [Arabidopsis thaliana]
          Length = 343

 Score =  139 bits (350), Expect(3) = 5e-43
 Identities = 72/132 (54%), Positives = 89/132 (66%)
 Frame = +2

Query: 257 AFKLFHGKDGVVPLSERSDFYDGSAAADSVPVFNSFTW*SAATISLSALGVGGPFGFWNF 436
           AFKLFHGKDG+VPLS  +D  +  A   +   FN       ATISLSA G GGPFGF  F
Sbjct: 56  AFKLFHGKDGIVPLSGFADDSEDEAGRRAPLQFNPLAG-KVATISLSAFGPGGPFGFGPF 114

Query: 437 SEKWKKQKNSESSSKKEHSSQKGDVSKHEALGNEWLKSGTCPMAKSYRAVSRVLPLVAIA 616
           SEKWKKQ+     SK +   Q GD SKHEA+G+EWLK+G CP+AKS+RA S+V+PL++ A
Sbjct: 115 SEKWKKQQKKPKPSKNQ---QSGDSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKA 171

Query: 617 FKPAAGVNLRCP 652
               +G+  RCP
Sbjct: 172 LTLPSGMKYRCP 183

 Score = 46.2 bits (108), Expect(3) = 5e-43
 Identities = 24/41 (58%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
 Frame = +3

Query: 102 ECPFDA-SDIQRCPFLRNINEPANFCFSLGKVSMPVARG*G 221
           ECPF A S IQ+CPFLRNIN+P N  FS     +PV  G G
Sbjct: 4   ECPFAAESIIQKCPFLRNINKPTNLSFSSLSFPIPVQGGKG 44

 Score = 32.0 bits (71), Expect(3) = 5e-43
 Identities = 12/15 (80%), Positives = 13/15 (86%)
 Frame = +1

Query: 211 GAKGPIFEDGPSFDT 255
           G KGPIFEDGP FD+
Sbjct: 41  GGKGPIFEDGPGFDS 55

>ref|NP_680396.1| putative protein; protein id: At5g45410.2, supported by cDNA: 767.
           [Arabidopsis thaliana] gi|9758731|dbj|BAB09169.1|
           gene_id:MFC19.8~pir||T05524~similar to unknown protein
           [Arabidopsis thaliana] gi|27754261|gb|AAO22584.1|
           unknown protein [Arabidopsis thaliana]
          Length = 342

 Score =  136 bits (343), Expect(3) = 3e-42
 Identities = 72/132 (54%), Positives = 89/132 (66%)
 Frame = +2

Query: 257 AFKLFHGKDGVVPLSERSDFYDGSAAADSVPVFNSFTW*SAATISLSALGVGGPFGFWNF 436
           AFKLFHGKDG+VPLS  +D  +  A   ++  FN       ATISLSA G GGPFGF  F
Sbjct: 56  AFKLFHGKDGIVPLSGFADDSEDEAGRRALQ-FNPLAG-KVATISLSAFGPGGPFGFGPF 113

Query: 437 SEKWKKQKNSESSSKKEHSSQKGDVSKHEALGNEWLKSGTCPMAKSYRAVSRVLPLVAIA 616
           SEKWKKQ+     SK +   Q GD SKHEA+G+EWLK+G CP+AKS+RA S+V+PL++ A
Sbjct: 114 SEKWKKQQKKPKPSKNQ---QSGDSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKA 170

Query: 617 FKPAAGVNLRCP 652
                G+  RCP
Sbjct: 171 LTLPPGMKYRCP 182

 Score = 46.2 bits (108), Expect(3) = 3e-42
 Identities = 24/41 (58%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
 Frame = +3

Query: 102 ECPFDA-SDIQRCPFLRNINEPANFCFSLGKVSMPVARG*G 221
           ECPF A S IQ+CPFLRNIN+P N  FS     +PV  G G
Sbjct: 4   ECPFAAESIIQKCPFLRNINKPTNLSFSSLSFPIPVQGGKG 44

 Score = 32.0 bits (71), Expect(3) = 3e-42
 Identities = 12/15 (80%), Positives = 13/15 (86%)
 Frame = +1

Query: 211 GAKGPIFEDGPSFDT 255
           G KGPIFEDGP FD+
Sbjct: 41  GGKGPIFEDGPGFDS 55

>gb|AAK93585.2| unknown protein [Arabidopsis thaliana]
          Length = 331

 Score =  136 bits (343), Expect(3) = 3e-40
 Identities = 72/132 (54%), Positives = 89/132 (66%)
 Frame = +2

Query: 257 AFKLFHGKDGVVPLSERSDFYDGSAAADSVPVFNSFTW*SAATISLSALGVGGPFGFWNF 436
           AFKLFHGKDG+VPLS  +D  +  A   ++  FN       ATISLSA G GGPFGF  F
Sbjct: 45  AFKLFHGKDGIVPLSGFADDSEDEAGRRALQ-FNPLAG-KVATISLSAFGPGGPFGFGPF 102

Query: 437 SEKWKKQKNSESSSKKEHSSQKGDVSKHEALGNEWLKSGTCPMAKSYRAVSRVLPLVAIA 616
           SEKWKKQ+     SK +   Q GD SKHEA+G+EWLK+G CP+AKS+RA S+V+PL++ A
Sbjct: 103 SEKWKKQQKKPKPSKNQ---QSGDSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKA 159

Query: 617 FKPAAGVNLRCP 652
                G+  RCP
Sbjct: 160 LTLPPGMKYRCP 171

 Score = 39.3 bits (90), Expect(3) = 3e-40
 Identities = 18/32 (56%), Positives = 21/32 (65%)
 Frame = +3

Query: 126 IQRCPFLRNINEPANFCFSLGKVSMPVARG*G 221
           IQ+CPFLRNIN+P N  FS     +PV  G G
Sbjct: 2   IQKCPFLRNINKPTNLSFSSLSFPIPVQGGKG 33

 Score = 32.0 bits (71), Expect(3) = 3e-40
 Identities = 12/15 (80%), Positives = 13/15 (86%)
 Frame = +1

Query: 211 GAKGPIFEDGPSFDT 255
           G KGPIFEDGP FD+
Sbjct: 30  GGKGPIFEDGPGFDS 44

>ref|NP_194233.1| putative protein; protein id: At4g25030.1, supported by cDNA:
           16463. [Arabidopsis thaliana] gi|7485448|pir||T05524
           hypothetical protein F13M23.170 - Arabidopsis thaliana
           gi|4455246|emb|CAB36745.1| putative protein [Arabidopsis
           thaliana] gi|7269353|emb|CAB79412.1| putative protein
           [Arabidopsis thaliana] gi|21553767|gb|AAM62860.1|
           unknown [Arabidopsis thaliana]
          Length = 344

 Score =  124 bits (310), Expect(3) = 2e-35
 Identities = 68/132 (51%), Positives = 86/132 (64%)
 Frame = +2

Query: 257 AFKLFHGKDGVVPLSERSDFYDGSAAADSVPVFNSFTW*SAATISLSALGVGGPFGFWNF 436
           AF+LFHG+DGVVPLS+ +     + A   VPVF+      AATISLS+ G GGPFGF  F
Sbjct: 60  AFRLFHGQDGVVPLSDTAR----TEAQKPVPVFHPLAA-KAATISLSSFGSGGPFGFDAF 114

Query: 437 SEKWKKQKNSESSSKKEHSSQKGDVSKHEALGNEWLKSGTCPMAKSYRAVSRVLPLVAIA 616
           S+ +K QK    SSK +  +       HEA+G+EWLK+G CP+AKSYRAVS V PLVA  
Sbjct: 115 SDMFKNQKKKSDSSKNKGGN-------HEAMGDEWLKTGNCPIAKSYRAVSGVAPLVAKI 167

Query: 617 FKPAAGVNLRCP 652
            +P  G+  +CP
Sbjct: 168 LQPPPGMKFKCP 179

 Score = 36.2 bits (82), Expect(3) = 2e-35
 Identities = 15/20 (75%), Positives = 16/20 (80%)
 Frame = +3

Query: 123 DIQRCPFLRNINEPANFCFS 182
           +I RCPFLRNINEP N  FS
Sbjct: 15  NILRCPFLRNINEPTNLSFS 34

 Score = 31.6 bits (70), Expect(3) = 2e-35
 Identities = 12/13 (92%), Positives = 13/13 (99%)
 Frame = +1

Query: 217 KGPIFEDGPSFDT 255
           KGPIFEDGP+FDT
Sbjct: 47  KGPIFEDGPNFDT 59

>gb|AAG46152.1|AC018727_4 unknown protein [Oryza sativa]
          Length = 352

 Score =  112 bits (279), Expect(3) = 8e-34
 Identities = 66/133 (49%), Positives = 78/133 (58%)
 Frame = +2

Query: 257 AFKLFHGKDGVVPLSERSDFYDGSAAADSVPVFNSFTW*SAATISLSALGVGGPFGFWNF 436
           AF++FHG+DGVVPLS  S            P FN      AATISLSA G  G F F +F
Sbjct: 67  AFRVFHGQDGVVPLSHGSFERFEKPMPKPNPEFNPLAA-KAATISLSAFG--GFFSFGDF 123

Query: 437 SEKWKKQKNSESSSKKEHSSQKGDVSKHEALGNEWLKSGTCPMAKSYRAVSRVLPLVAIA 616
           S K + +KNS            G  + HEAL NEWL+ G CP+AKSYRA+S V+PLVA  
Sbjct: 124 SNK-RNKKNSNQKKPNNLPQNGGQPNNHEALSNEWLEMGQCPLAKSYRALSGVVPLVAKM 182

Query: 617 FKPAAGVNLRCPP 655
             P AG+ LRCPP
Sbjct: 183 MTPPAGMKLRCPP 195

 Score = 43.1 bits (100), Expect(3) = 8e-34
 Identities = 30/55 (54%), Positives = 34/55 (61%), Gaps = 4/55 (7%)
 Frame = +3

Query: 63  MDFFFKGMSGDGSECPFDAS---DIQRCPFLRNINEPANFCFSLGKVSMPV-ARG 215
           MD FF+  S D   C  D S    I+RCPFLRNINEP +F FS   V+ PV ARG
Sbjct: 1   MDPFFRRASSDPL-CLEDNSVQHGIERCPFLRNINEPTSFSFS--SVNFPVPARG 52

 Score = 31.2 bits (69), Expect(3) = 8e-34
 Identities = 12/14 (85%), Positives = 13/14 (92%)
 Frame = +1

Query: 211 GAKGPIFEDGPSFD 252
           G KGPIFEDGP+FD
Sbjct: 52  GDKGPIFEDGPNFD 65

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 599,313,885
Number of Sequences: 1393205
Number of extensions: 13287994
Number of successful extensions: 35896
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 34242
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35822
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 28006887348
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf092g02 BP074188 1 386
2 MPD079b08_f AV775164 152 657




Lotus japonicus
Kazusa DNA Research Institute