KMC001669A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001669A_C01 KMC001669A_c01
atgttcaatCGCGTTCTAATATTCTGGAACTGGAGAAGCAAAGATAAATGGGAAGCCAAG
TAACTGTAGAAAACAACATAACCAAGGAAGAGAAAGTAACCTTTGAATATATTTGTATTA
CAAAGGTATATTTGAGCTACATCACTCTTAACAACCATAACAGATTCTGAAGCATGGCTG
AACAGAGCAATATTCAGATTTCAGACACATGCAATTGCGACACCCACTGGAATGTCACAG
GTCACAGTTAAACCATTAGAGGGGGAAAATTATTCATAAATATACTACCTCACCAAAGAT
AGAGCCTCTACCAAGATGACTTTTCTGAAGAAAGCCATTCTTGTGTTGCCTCCAAATCGG
GAAGAACAACTGCCCCAGATTTCCTCCACTCATCAGCATCTGGTGTCTTGAATCTGTCCA
ACAATAGTGCATGCATCCCTATGCTCTTAGCTGGCTCATAGTCTTTCCGCATGCTATCGC
CAATATGCAAAGTTTCTTCAGGTTTAATATTTCCAGCCCTCTCAAGTGCAATCTCATAAA
TTTTTGGGTTTGGCTTCTCAACGCCTTCAAGACCAGAAAACACGCCAAAGTCCCACTCAG
ATCCCTCATTTATACCCATAGCTGGAAGAATCACATCTGGATATCGATACTCTGCATTGC
TCACGAGGCCAACTTTGACACCCTTTCCACGAAGCCATCTAAGAAATGGTCGGGAGTCCG
GAAAGACAGTATAAGGAGCAGATGACCCAAAAGATGCATATATGCGTCTGAAAATTTTCT
CAAATGTTTCCTCATCATATTCATATCCAGCCCTGACAAAAGAATCACGCACGCAGGTTT
TCCACCACACAATGTTCGGCATTTTGGCTCCAAACCCGAAACAGGGATACTTCTTTGCCA
TGTCCTTATATGCAAGCTTAAATCCCTCATGTACACGTTTGTAGTCTGGGCACGGATGGC
CAGCAGCTTTAGCGGCCATGCAATAATAGTCACCAAGCTCCCCTTTGTAAGCCATTAGGG
TGCCGGTAACATCTACAGTGACACAGCGCAGTCTAGCCAATATTGACATGGTAGCACAAA
TTATAGAGAGGCCCTTCGTCAGTTAGGTTGTTCAGAAGAATTTCCAGTGCCGCAATGCCA
ATATAAATTTCAAGGAAGCAACGAATAGGGTAAGGTATTAAACTCCTTCAAAATTGATAT
TAAAGCCAGAACAGTGAAAGGTTGATGATGCTTTGTGTCCGAAATCGGAAATGCTCCAGA
AGAAGAaggttgagctagaatttcattgtatagaaaaatagatggtgtgggcagtgatgt
tgttgttgttgttgttgtttcatgcc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001669A_C01 KMC001669A_c01
         (1346 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_199286.1| Dreg-2 like protein; protein id: At5g44730.1 [A...   413  e-114
pir||S56690 hypothetical protein (clone AFD1) - wild oat (fragme...   108  3e-22
ref|NP_077219.1| RIKEN cDNA 2810435D12 [Mus musculus] gi|1285049...   107  6e-22
ref|XP_216417.1| similar to RIKEN cDNA 2810435D12 [Mus musculus]...   104  4e-21
ref|NP_112496.1| hypothetical protein MGC12904 [Homo sapiens] gi...   100  9e-20

>ref|NP_199286.1| Dreg-2 like protein; protein id: At5g44730.1 [Arabidopsis thaliana]
            gi|2660676|gb|AAC79147.1| Dreg-2 like protein
            [Arabidopsis thaliana] gi|9758377|dbj|BAB08826.1| Dreg-2
            like protein [Arabidopsis thaliana]
          Length = 255

 Score =  413 bits (1061), Expect = e-114
 Identities = 194/251 (77%), Positives = 227/251 (90%), Gaps = 1/251 (0%)
 Frame = -2

Query: 1069 MSILARLRCVTVDVTGTLMAYKGELGDYYCMAAKAAGHPCPDYKRVHEGFKLAYKDMAKK 890
            +S+L++LRC+TVDVTGTL+AYKGELGDYYCMAAKA G PCPDYKRVHEGFKLAY DMA+K
Sbjct: 3    VSLLSKLRCITVDVTGTLIAYKGELGDYYCMAAKAIGLPCPDYKRVHEGFKLAYTDMAQK 62

Query: 889  YPCFGFGAKMPNIVWWKTCVRDSFVRAGYEYDEETFEKIFRRIYASFGSSAPYTVFPDSR 710
            YPCFGF AKMPNIVWWKTCVRDSFV+AGYEYDEETFEKIFRRIY++FGS+APY+VF DS+
Sbjct: 63   YPCFGFHAKMPNIVWWKTCVRDSFVKAGYEYDEETFEKIFRRIYSTFGSAAPYSVFQDSQ 122

Query: 709  PFLRWLRGKGVKVGLVSNAEYRYPDVILPAMGINEGSEWDFGVFSGLEGVEKPNPKIYEI 530
            PFLRW R KG+ VGLVSNAEYRY +VILP+ G+++ +EWDFGVFSG+EG+EKP+P+I+ +
Sbjct: 123  PFLRWARRKGLIVGLVSNAEYRYQEVILPSFGLSK-AEWDFGVFSGIEGIEKPDPRIFTL 181

Query: 529  ALERAG-NIKPEETLHIGDSMRKDYEPAKSIGMHALLLDRFKTPDADEWRKSGAVVLPDL 353
            ALERAG NI PEE LHIGDSMRKDY PAKSIGMHALL+DRFKT  A +W ++GA+VLPDL
Sbjct: 182  ALERAGNNIAPEEVLHIGDSMRKDYVPAKSIGMHALLVDRFKTEAAKDWIEAGAIVLPDL 241

Query: 352  EATQEWLSSEK 320
             A Q+ L S+K
Sbjct: 242  VAVQQLLESDK 252

>pir||S56690 hypothetical protein (clone AFD1) - wild oat (fragment)
           gi|726471|gb|AAA76739.1| putative ORF1
          Length = 89

 Score =  108 bits (269), Expect = 3e-22
 Identities = 50/65 (76%), Positives = 56/65 (85%)
 Frame = -2

Query: 523 ERAGNIKPEETLHIGDSMRKDYEPAKSIGMHALLLDRFKTPDADEWRKSGAVVLPDLEAT 344
           E AGNI PEE LHIGD+MRKDY PA+SIGMHALLLDRFKT DA+ W++SGA VLPDLEA 
Sbjct: 1   EMAGNIAPEEALHIGDTMRKDYVPARSIGMHALLLDRFKTADAESWKQSGAPVLPDLEAA 60

Query: 343 QEWLS 329
           Q WL+
Sbjct: 61  QAWLT 65

>ref|NP_077219.1| RIKEN cDNA 2810435D12 [Mus musculus] gi|12850490|dbj|BAB28741.1|
            unnamed protein product [Mus musculus]
            gi|13097531|gb|AAH03491.1| RIKEN cDNA 2810435D12 gene
            [Mus musculus] gi|29290603|emb|CAD83061.1| bN189G18.4.1
            (novel protein (2810435D12Rik), variant 1) [Mus musculus]
          Length = 251

 Score =  107 bits (266), Expect = 6e-22
 Identities = 66/212 (31%), Positives = 113/212 (53%)
 Frame = -2

Query: 1054 RLRCVTVDVTGTLMAYKGELGDYYCMAAKAAGHPCPDYKRVHEGFKLAYKDMAKKYPCFG 875
            ++R +T DV  TL+  +  +G+ Y   A+A G    D   V + F+ AY+  +  +P +G
Sbjct: 6    QMRLLTWDVKDTLIKLRRPVGEEYASKARAHGVVVEDIT-VEQAFRQAYRAQSHNFPNYG 64

Query: 874  FGAKMPNIVWWKTCVRDSFVRAGYEYDEETFEKIFRRIYASFGSSAPYTVFPDSRPFLRW 695
                + +  WWK  V  +F  AG   D +    +  ++Y  F S   + V   +   L+ 
Sbjct: 65   LSRGLTSRQWWKDVVLHTFRLAGVP-DAQAMTPVADQLYEDFSSPFTWQVLEGAEMTLKG 123

Query: 694  LRGKGVKVGLVSNAEYRYPDVILPAMGINEGSEWDFGVFSGLEGVEKPNPKIYEIALERA 515
             R +G+K+ +VSN + R  D IL  +G+ E   +DF + S   G  KP+P+I+  AL+RA
Sbjct: 124  CRKRGLKLAVVSNFDRRLED-ILTGLGLRE--HFDFVLTSEAVGCPKPDPRIFREALQRA 180

Query: 514  GNIKPEETLHIGDSMRKDYEPAKSIGMHALLL 419
              ++P    H+GDS   DY+ ++++GMH+ L+
Sbjct: 181  C-VEPAVAAHVGDSYLCDYQGSQAVGMHSFLV 211

>ref|XP_216417.1| similar to RIKEN cDNA 2810435D12 [Mus musculus] [Rattus norvegicus]
          Length = 251

 Score =  104 bits (259), Expect = 4e-21
 Identities = 64/212 (30%), Positives = 112/212 (52%)
 Frame = -2

Query: 1054 RLRCVTVDVTGTLMAYKGELGDYYCMAAKAAGHPCPDYKRVHEGFKLAYKDMAKKYPCFG 875
            ++R +T DV  TL+  +  +G+ Y   A+A G    +   V + F+ A++  +  +P +G
Sbjct: 6    QMRLLTWDVKDTLIKVRRPVGEEYASKARAHG-VLVEATAVEQAFRQAFRAQSHSFPNYG 64

Query: 874  FGAKMPNIVWWKTCVRDSFVRAGYEYDEETFEKIFRRIYASFGSSAPYTVFPDSRPFLRW 695
                + +  WW   V  +F  AG   D +    +  ++Y  F S   + V   +   L+ 
Sbjct: 65   LSLGLTSRQWWMDVVLHTFRLAGVP-DAQAMAPVADQLYEDFSSPFAWRVLEGAETTLKG 123

Query: 694  LRGKGVKVGLVSNAEYRYPDVILPAMGINEGSEWDFGVFSGLEGVEKPNPKIYEIALERA 515
             R +G+K+ +VSN + R  D IL  +G+ E   +DF + S   G  KP+P+I+  AL+ A
Sbjct: 124  CRKRGMKLAVVSNFDRRLED-ILTGLGLRE--HFDFVLTSEAVGCPKPDPRIFREALQLA 180

Query: 514  GNIKPEETLHIGDSMRKDYEPAKSIGMHALLL 419
              ++P    H+GDS R DY+ A+++GMH+ L+
Sbjct: 181  C-VEPSAAAHVGDSYRCDYQGARAVGMHSFLV 211

>ref|NP_112496.1| hypothetical protein MGC12904 [Homo sapiens]
            gi|13477173|gb|AAH05048.1|AAH05048 Similar to RIKEN cDNA
            2810435D12 gene [Homo sapiens] gi|22749572|gb|AAH31878.1|
            similar to hypothetical protein MGC12904 [Homo sapiens]
          Length = 251

 Score = 99.8 bits (247), Expect = 9e-20
 Identities = 61/212 (28%), Positives = 112/212 (52%)
 Frame = -2

Query: 1054 RLRCVTVDVTGTLMAYKGELGDYYCMAAKAAGHPCPDYKRVHEGFKLAYKDMAKKYPCFG 875
            ++R +T DV  TL+  +  LG+ Y   A+A G    +   + +GF+ AY+  +  +P +G
Sbjct: 6    QIRLLTWDVKDTLLRLRHPLGEAYATKARAHGLEV-EPSALEQGFRQAYRAQSHSFPNYG 64

Query: 874  FGAKMPNIVWWKTCVRDSFVRAGYEYDEETFEKIFRRIYASFGSSAPYTVFPDSRPFLRW 695
                + +  WW   V  +F  AG + D +    I  ++Y  F     + V   +   LR 
Sbjct: 65   LSHGLTSRQWWLDVVLQTFHLAGVQ-DAQAVAPIAEQLYKDFSHPCTWQVLDGAEDTLRE 123

Query: 694  LRGKGVKVGLVSNAEYRYPDVILPAMGINEGSEWDFGVFSGLEGVEKPNPKIYEIALERA 515
             R +G+++ ++SN + R  + IL  +G+ E   +DF + S   G  KP+P+I++ AL R 
Sbjct: 124  CRTRGLRLAVISNFDRRL-EGILGGLGLRE--HFDFVLTSEAAGWPKPDPRIFQEAL-RL 179

Query: 514  GNIKPEETLHIGDSMRKDYEPAKSIGMHALLL 419
             +++P    H+GD+   DY+  +++GMH+ L+
Sbjct: 180  AHMEPVVAAHVGDNYLCDYQGPRAVGMHSFLV 211

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,224,636,989
Number of Sequences: 1393205
Number of extensions: 29160443
Number of successful extensions: 138349
Number of sequences better than 10.0: 227
Number of HSP's better than 10.0 without gapping: 86845
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 131571
length of database: 448,689,247
effective HSP length: 127
effective length of database: 271,752,212
effective search space used: 87232460052
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD006b02_f AV770376 1 105
2 MWM202e07_f AV767836 10 298
3 GENf008h12 BP058699 47 425
4 MPD042a12_f AV772838 51 558
5 GENf071g11 BP061419 59 401
6 GNf067e05 BP072346 80 517
7 GNf054f09 BP071413 87 513
8 GNf053d01 BP071306 92 306
9 GNf088a05 BP073829 93 620
10 SPD081f09_f BP050484 438 989
11 MFB031c08_f BP036261 811 1365




Lotus japonicus
Kazusa DNA Research Institute