KMC010165A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC010165A_C01 KMC010165A_c01
catgaagagaaattagatttcatttacattacagtcggctggtatcgaaacaagtactgt
tgaaatcaatgtttgcagaaCAAAAGAATGGAGGATTGAATTGACATGGCCACGCCATTA
ATTGCTAAAAGAATTAACAAAATAGAAACCATATGCAAACAACAAAAGGCAAAGCTAATA
GGAAAACAGAAGCTACTAGCATATTAACAAGAAGCCACAGCAATATCATTTAATCTATTA
GGCCAAAATGGTATGATGACAACATACAAAAATTCACTCAAAAACAGTGAAATATCATAT
TCTCAAAATGATCCAGACAATCATTTCCATCAGGGTTCAAACAGTCGTAAGGTAGCAGCA
TCGAAGTTGCAAAGTGAACCACTTTGTCCCATGTAACTATAATTCATGAATCCTAGTTAA
GATTTTATGAGCAGTTTAAAGCAAAGAAACTCATTATATGACAATAGGCAACACCAACTA
CAATTCAAACTAGACAGAAAGTAAGAAACATGCCAAATTGCAAAGTAGCAGGTGCCTAAA
GTATAAAAGAAGCAGGAACATTTCAGTTCACAACATCCAAAGTCAATCTATTAATCTTGG
TTTCCATATAAATATCCAAATCCACAAAACAATTACATGATAAATATCAAATTCCAAAAC
CAAAAACTGCACGTATCCTTTGGAAAATGAGATCCTTCCACCCCCTTTCCGTGTTCTCCA
CTACAGTGGCATTCCCATACCTATCTTCTTCAGGAACATCACTATGTGTCAGCTTAACAA
CAGTAACACCAGGTTCAGGTTCCTCAAACACAAGCCTCACCGTGGAAACAACCCCATCAT
TCCAGCTCCCAAATCTCCATCTCTGCACAATCAACTTCGCTTCCTTCAACTCCAAATTAG
TCCCTGTCACCGACCCATCAAAAATGCTGAAATCCCCACCAACCTCCTTATTAATCCTCG
CATTGCTCTGCGTGAAACCCTTCCATCTATTCTCATCCATCAAAATCTCATACAAATCCC
TCGCCCTGCAATTGAACCTCTCCGTCAATCTAATACTCTTCACCCCCTTCTTCACCTCCT
TCGTCGGCGCAGCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC010165A_C01 KMC010165A_c01
         (1094 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_566410.1| expressed protein; protein id: At3g12050.1, sup...   246  6e-64
gb|AAM53279.1| unknown protein [Arabidopsis thaliana] gi|2319767...   246  6e-64
ref|NP_473204.2| hypothetical protein, conserved [Plasmodium fal...    79  1e-13
ref|NP_666148.1| cDNA sequence BC023857; hypothetical protein MG...    75  2e-12
ref|NP_610121.2| CG1416-PA [Drosophila melanogaster] gi|24585722...    74  5e-12

>ref|NP_566410.1| expressed protein; protein id: At3g12050.1, supported by cDNA: 38478.
            [Arabidopsis thaliana] gi|10998146|dbj|BAB03117.1|
            gene_id:MEC18.18~unknown protein [Arabidopsis thaliana]
            gi|12322021|gb|AAG51059.1|AC069473_21 unknown protein;
            42843-40829 [Arabidopsis thaliana]
            gi|21593383|gb|AAM65332.1| unknown [Arabidopsis thaliana]
          Length = 360

 Score =  246 bits (627), Expect = 6e-64
 Identities = 113/143 (79%), Positives = 131/143 (91%)
 Frame = -2

Query: 1078 EVKKGVKSIRLTERFNCRARDLYEILMDENRWKGFTQSNARINKEVGGDFSIFDGSVTGT 899
            + K+G K+I +TE+FNCRARDLYEILMDENRWKGFTQSNA+I+K+V G  S+FDGSVTG 
Sbjct: 218  KTKEGFKTITMTEKFNCRARDLYEILMDENRWKGFTQSNAKISKDVNGPISVFDGSVTGM 277

Query: 898  NLELKEAKLIVQRWRFGSWNDGVVSTVRLVFEEPEPGVTVVKLTHSDVPEEDRYGNATVV 719
            NLEL+E KLIVQ+WRFGSW DG+ STV++VFEEP+PGVT+V LTH+DVPEEDRYGNATVV
Sbjct: 278  NLELEEGKLIVQKWRFGSWPDGLDSTVKIVFEEPQPGVTIVNLTHTDVPEEDRYGNATVV 337

Query: 718  ENTERGWKDLIFQRIRAVFGFGI 650
            ENTERGW+DLIF RIRAVFGFGI
Sbjct: 338  ENTERGWRDLIFHRIRAVFGFGI 360

>gb|AAM53279.1| unknown protein [Arabidopsis thaliana] gi|23197674|gb|AAN15364.1|
            unknown protein [Arabidopsis thaliana]
          Length = 360

 Score =  246 bits (627), Expect = 6e-64
 Identities = 113/143 (79%), Positives = 131/143 (91%)
 Frame = -2

Query: 1078 EVKKGVKSIRLTERFNCRARDLYEILMDENRWKGFTQSNARINKEVGGDFSIFDGSVTGT 899
            + K+G K+I +TE+FNCRARDLYEILMDENRWKGFTQSNA+I+K+V G  S+FDGSVTG 
Sbjct: 218  KTKEGFKTITMTEKFNCRARDLYEILMDENRWKGFTQSNAKISKDVNGPISVFDGSVTGI 277

Query: 898  NLELKEAKLIVQRWRFGSWNDGVVSTVRLVFEEPEPGVTVVKLTHSDVPEEDRYGNATVV 719
            NLEL+E KLIVQ+WRFGSW DG+ STV++VFEEP+PGVT+V LTH+DVPEEDRYGNATVV
Sbjct: 278  NLELEEGKLIVQKWRFGSWPDGLDSTVKIVFEEPQPGVTIVNLTHTDVPEEDRYGNATVV 337

Query: 718  ENTERGWKDLIFQRIRAVFGFGI 650
            ENTERGW+DLIF RIRAVFGFGI
Sbjct: 338  ENTERGWRDLIFHRIRAVFGFGI 360

>ref|NP_473204.2| hypothetical protein, conserved [Plasmodium falciparum 3D7]
            gi|15383895|emb|CAB39022.2| hypothetical protein,
            conserved [Plasmodium falciparum 3D7]
          Length = 140

 Score = 79.0 bits (193), Expect = 1e-13
 Identities = 39/135 (28%), Positives = 68/135 (49%), Gaps = 1/135 (0%)
 Frame = -2

Query: 1057 SIRLTERFNCRARDLYEILMDENRWKGFTQSN-ARINKEVGGDFSIFDGSVTGTNLELKE 881
            S  +TE +      L+    D       ++ + A ++ +VGG FS+F GS+ G   E+ +
Sbjct: 2    SFEITEEYYVPPEVLFNAFTDAYTLTRLSRGSLAEVDLKVGGKFSLFSGSILGEFTEITK 61

Query: 880  AKLIVQRWRFGSWNDGVVSTVRLVFEEPEPGVTVVKLTHSDVPEEDRYGNATVVENTERG 701
               IV++W+F  WN+   STV + F   +   T +KLTH+++P  ++Y    V+E  + G
Sbjct: 62   PHKIVEKWKFRDWNECDYSTVTVEFISVKENHTKLKLTHNNIPASNKYNEGGVLERCKNG 121

Query: 700  WKDLIFQRIRAVFGF 656
            W       I  + G+
Sbjct: 122  WTQNFLHNIEVILGY 136

>ref|NP_666148.1| cDNA sequence BC023857; hypothetical protein MGC36589 [Mus musculus]
            gi|19344046|gb|AAH25552.1| Similar to chromosome 14 open
            reading frame 3 [Mus musculus] gi|23272235|gb|AAH23857.1|
            hypothetical protein MGC36589 [Mus musculus]
          Length = 338

 Score = 75.1 bits (183), Expect = 2e-12
 Identities = 48/153 (31%), Positives = 75/153 (48%), Gaps = 6/153 (3%)
 Frame = -2

Query: 1093 AAPTKEVKK--GVK----SIRLTERFNCRARDLYEILMDENRWKGFTQSNARINKEVGGD 932
            +AP+K   K  GVK     I L E F     +LY +   +   + FT + A +  + GG 
Sbjct: 190  SAPSKSQAKPVGVKIPTCKITLKETFLTSPEELYRVFTTQELVQAFTHAPAALEADRGGK 249

Query: 931  FSIFDGSVTGTNLELKEAKLIVQRWRFGSWNDGVVSTVRLVFEEPEPGVTVVKLTHSDVP 752
            F + DG+VTG   +L   K I  +WRF SW +G  +T+ L F + + G T + +    +P
Sbjct: 250  FHMVDGNVTGEFTDLVPEKHIAMKWRFKSWPEGHFATITLTFID-KNGETELCMEGRGIP 308

Query: 751  EEDRYGNATVVENTERGWKDLIFQRIRAVFGFG 653
              +        E T +GW+   F+ I+  FG+G
Sbjct: 309  APEE-------ERTRQGWQRYYFEGIKQTFGYG 334

>ref|NP_610121.2| CG1416-PA [Drosophila melanogaster] gi|24585722|ref|NP_724361.1|
            CG1416-PB [Drosophila melanogaster]
            gi|24585724|ref|NP_724362.1| CG1416-PC [Drosophila
            melanogaster] gi|21464348|gb|AAM51977.1| LD43819p
            [Drosophila melanogaster] gi|22947044|gb|AAF57232.2|
            CG1416-PA [Drosophila melanogaster]
            gi|22947045|gb|AAN11136.1| CG1416-PB [Drosophila
            melanogaster] gi|22947046|gb|AAN11137.1| CG1416-PC
            [Drosophila melanogaster]
          Length = 354

 Score = 73.6 bits (179), Expect = 5e-12
 Identities = 43/140 (30%), Positives = 68/140 (47%)
 Frame = -2

Query: 1072 KKGVKSIRLTERFNCRARDLYEILMDENRWKGFTQSNARINKEVGGDFSIFDGSVTGTNL 893
            K  V+++ +TE F+C A DLY  L        FT++ A+++   GG+F ++ G+V G   
Sbjct: 216  KLDVRTLSMTEEFHCSANDLYNALTKPEMVTAFTRAPAKVDAVRGGEFILYGGNVLGKFE 275

Query: 892  ELKEAKLIVQRWRFGSWNDGVVSTVRLVFEEPEPGVTVVKLTHSDVPEEDRYGNATVVEN 713
            EL   K I Q WR  +W  G  S V +  EE     T++ L  + +P       A+  + 
Sbjct: 276  ELVPEKKIQQSWRLKNWTSGHYSNVVIELEETSSS-TMMSLKQTGIP-------ASEFDA 327

Query: 712  TERGWKDLIFQRIRAVFGFG 653
             +  W    +  I+  FGFG
Sbjct: 328  MKTNWYRYYWHSIKQTFGFG 347

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 951,972,745
Number of Sequences: 1393205
Number of extensions: 22398477
Number of successful extensions: 64373
Number of sequences better than 10.0: 97
Number of HSP's better than 10.0 without gapping: 58302
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 63960
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 65614730658
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR058b12_f BP080437 1 529
2 MF012e01_f BP028871 456 850
3 MF030h12_f BP029883 564 796
4 MR076f04_f BP081863 564 1094




Lotus japonicus
Kazusa DNA Research Institute