KMC001869A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001869A_C01 KMC001869A_c01
ctttcagaATAAAAAAAATCAGACGTAAAATAAATATAACTATCCCATAAATAAAATAAG
ATGTTTATAATAGACTCAACCTATCTGCTTTCCTATCCCCGAATTTAACATATATTGATA
GTACATGAGCCTGATAACCAACCAAACATCATAAAAGGCACTGATGAAATAACTTTCTTT
TTAGTTCTTTTTAAATTTTCAACCGCCATGTGATCGAGTTTTGTCTACCAGATCTCACCA
GCTGCTTAACTGTTTCTGTACTTTAGAACTGTGAGATGACCTGCCACCTTTGTCTTTGTA
AGAATTCCTGGATGCAATAGTCTTCCGTCCCCGACGGGCGGATGCTATCGCATGTCGTCT
CTGCTTATTCAAGCTCTTATTCAGTTCAGCGTCATTTTCAATTTCATTTTGGTCATGCTC
TTCCTCATCACTAGCATCTCCCTTGTCAGATGCATAGCTTTGACCAGCTTCACAGCTCTC
CTCTATTCCTTTACTTTCATGTTCATCATCCTGCTCCGACAAATATGAAAAATCATCATC
TTTAATGATTGCTTCATTCAGATCTTCCACTGAGTCAACCCCTTCACTATCTGAATTTGA
ATCACTCTCTGGCGTTCCTTCAATGAACCTCTGAATATCCTCTGCATCCTTTTTAGTAAA
GCCACTGGCAGCAAGTTCCCTATCTAGAGAACCAGTGGCTCTGTCTATTTCCGATAAACA
GGGGCTTCCATCTTCATCTTTCCCTTCATCAGAATCATCAATATCATCTATGCTTTCTTG
AAAAGTAAGATTGAACCTCTTTCTGAAAAACTTGAAGATACATTCAATATCACGGTCGAA
GTACATCTGTGCATTACGGTGTGACACGGATACCATTTGTGGAAAATCAATCACGGTAAT
CTTTTCATCATCATTGATCATGATGTTAAACTCATTAAAATCGCAATGAATGAGACC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001869A_C01 KMC001869A_c01
         (957 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_190695.1| putative protein; protein id: At3g51270.1 [Arab...   217  2e-55
dbj|BAB92640.1| B1099D03.17 [Oryza sativa (japonica cultivar-gro...   204  2e-51
gb|AAH47169.1| Similar to hypothetical protein FLJ11159 [Danio r...   113  5e-24
ref|NP_080210.1| RIKEN cDNA 2010110K24 [Mus musculus] gi|1284253...   102  1e-20
ref|XP_217975.1| similar to RIKEN cDNA 2010110K24 [Mus musculus]...   102  1e-20

>ref|NP_190695.1| putative protein; protein id: At3g51270.1 [Arabidopsis thaliana]
           gi|11357630|pir||T45758 hypothetical protein F24M12.310
           - Arabidopsis thaliana gi|6562279|emb|CAB62649.1|
           putative protein [Arabidopsis thaliana]
          Length = 472

 Score =  217 bits (552), Expect = 2e-55
 Identities = 119/247 (48%), Positives = 161/247 (65%), Gaps = 7/247 (2%)
 Frame = -1

Query: 957 GLIHCDFNEFNIMINDDEKITVIDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFNLTFQ 778
           GLIHCDFNEFNIMI+D+EKIT+IDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFN++F 
Sbjct: 232 GLIHCDFNEFNIMIDDEEKITMIDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFNMSFH 291

Query: 777 ESIDDIDDSDEGKDEDGSPCLSEIDRATGSLDRELAASGFTKKDAEDIQRFIEGTPESDS 598
           E   + ++++   DE+  P   +I +   +LD++L ASGFT+K+  D+ +FIEG  E   
Sbjct: 292 EDKGESEETE--VDENSRPSFFDITKDANALDKDLEASGFTRKEQTDLDKFIEGGVEKSE 349

Query: 597 NSDSEGVDSVEDLNEAIIKDDDFSYLSE------QDDEHESK-GIEESCEAGQSYASDKG 439
           +SD    D   D  E   + ++   L+E      QD E +S  G+E   E   +   +  
Sbjct: 350 DSDE---DEESDDEEQTCESNEEGNLNEIKSLQLQDKEQKSSDGVEAEVELDNTENGESN 406

Query: 438 DASDEEEHDQNEIENDAELNKSLNKQRRHAIASARRGRKTIASRNSYKDKGGRSSHSSKV 259
              DE   ++ E E +AEL K+L K RR A+A+AR  RK+ +SRN+YKDK GR S +SK+
Sbjct: 407 GDEDEVGSNEEEEEKEAELEKNLGKVRRRAMAAARGRRKSQSSRNTYKDK-GRGSQNSKI 465

Query: 258 QKQLSSW 238
              +S +
Sbjct: 466 HSNMSGF 472

>dbj|BAB92640.1| B1099D03.17 [Oryza sativa (japonica cultivar-group)]
          Length = 646

 Score =  204 bits (519), Expect = 2e-51
 Identities = 120/274 (43%), Positives = 164/274 (59%), Gaps = 34/274 (12%)
 Frame = -1

Query: 957  GLIHCDFNEFNIMINDDEKITVIDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFNLTFQ 778
            GLIHCDFNEFNIMI+DDEK+T+IDFPQMVSV HRNAQM+FDRDIECI+KFFRKRF+L+  
Sbjct: 376  GLIHCDFNEFNIMIDDDEKVTMIDFPQMVSVKHRNAQMFFDRDIECIYKFFRKRFHLS-S 434

Query: 777  ESIDDIDDSDEGKDEDGSPCLSEIDRATGSLDRELAASGFTKKDAEDIQRFI-EGTPESD 601
            E  ++ D SD   DE+  P    I +A GSLD+ELAASGFT+K+  ++ ++I +   E  
Sbjct: 435  EKCEEQDGSDIDDDENSRPSFLSIQKAAGSLDKELAASGFTRKEQVEMDKYIDQNAEEES 494

Query: 600  SNSDSEGVDSVEDLNEAIIKDDDFSYLSEQD------------DEHESKGIEESCEAGQS 457
            S+ DS      ED ++  +K      ++EQD            D +E +   +  E   S
Sbjct: 495  SDDDSTSEQDNEDGDDVAVKIGSLK-IAEQDSAEVPDCTLASKDSNEPETFAKENETSTS 553

Query: 456  YA----------SDKGDA-----------SDEEEHDQNEIENDAELNKSLNKQRRHAIAS 340
             +          S  GDA           SD++  D  + E+D  L K LNKQR+ AIA+
Sbjct: 554  CSGENNSINPSPSSNGDAKEPTESQDNDDSDDDSSDDPDGEDDDALAKQLNKQRKRAIAA 613

Query: 339  ARRGRKTIASRNSYKDKGGRSSHSSKVQKQLSSW 238
            A   R+ I+SRN+YK K G+ + +SK+++Q   W
Sbjct: 614  AHGRRRPISSRNAYKYK-GKGTMNSKIERQACKW 646

>gb|AAH47169.1| Similar to hypothetical protein FLJ11159 [Danio rerio]
          Length = 512

 Score =  113 bits (282), Expect = 5e-24
 Identities = 62/140 (44%), Positives = 85/140 (60%)
 Frame = -1

Query: 957 GLIHCDFNEFNIMINDDEKITVIDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFNLTFQ 778
           GLIH DFNEFN+M++D++ +T+IDFPQMVS SH NA+ YFDRD++CI  FF KRFN    
Sbjct: 223 GLIHGDFNEFNLMLDDNDHVTMIDFPQMVSTSHINAEWYFDRDVKCIRDFFIKRFNY--- 279

Query: 777 ESIDDIDDSDEGKDEDGSPCLSEIDRATGSLDRELAASGFTKKDAEDIQRFIEGTPESDS 598
                        + +  P   +I RA  SLD E++ASG+TK+  +D        PESD 
Sbjct: 280 -------------ESELYPTFKDIRRAC-SLDVEISASGYTKELQQDDSLLHPEGPESDG 325

Query: 597 NSDSEGVDSVEDLNEAIIKD 538
           + D E  +S +D ++A   D
Sbjct: 326 DEDDESPESTDDTHQASAGD 345

>ref|NP_080210.1| RIKEN cDNA 2010110K24 [Mus musculus] gi|12842538|dbj|BAB25639.1|
           unnamed protein product [Mus musculus]
           gi|12842641|dbj|BAB25676.1| unnamed protein product [Mus
           musculus] gi|26339924|dbj|BAC33625.1| unnamed protein
           product [Mus musculus]
          Length = 547

 Score =  102 bits (253), Expect = 1e-20
 Identities = 66/182 (36%), Positives = 92/182 (50%), Gaps = 1/182 (0%)
 Frame = -1

Query: 957 GLIHCDFNEFNIMINDDEKITVIDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFNLTFQ 778
           GLIH DFNEFN+M++ D+ IT+IDFPQMVS SH NA+ YFDRD++CI +FF KRF+    
Sbjct: 223 GLIHGDFNEFNLMLDKDDHITMIDFPQMVSTSHPNAEWYFDRDVKCIREFFMKRFSY--- 279

Query: 777 ESIDDIDDSDEGKDEDGSPCLSEIDRATGSLDRELAASGFTKKDAEDIQRFIEGTPESDS 598
                        + +  P  S+I R   SLD E++ASG+TK+   D +      P+   
Sbjct: 280 -------------ESELYPTFSDI-RKEDSLDVEVSASGYTKEMQADDELLHPVGPDDKI 325

Query: 597 NSDSEGVDSVEDLNEAIIKDDDF-SYLSEQDDEHESKGIEESCEAGQSYASDKGDASDEE 421
               E  D      E + K   + S L ++ D  +  G    C +  S     G   +E 
Sbjct: 326 TETEEDSDFTFSDEEMLEKAKVWRSELEKEADPADESGGSWCCSSTDSKQIKDGGLPEES 385

Query: 420 EH 415
            H
Sbjct: 386 AH 387

>ref|XP_217975.1| similar to RIKEN cDNA 2010110K24 [Mus musculus] [Rattus norvegicus]
          Length = 551

 Score =  102 bits (253), Expect = 1e-20
 Identities = 71/191 (37%), Positives = 99/191 (51%), Gaps = 4/191 (2%)
 Frame = -1

Query: 957 GLIHCDFNEFNIMINDDEKITVIDFPQMVSVSHRNAQMYFDRDIECIFKFFRKRFNLTFQ 778
           GLIH DFNEFN+M++ D+ IT+IDFPQMVS SH NA+ YFDRD++CI +FF KRFN    
Sbjct: 223 GLIHGDFNEFNLMLDKDDHITMIDFPQMVSTSHPNAEWYFDRDVKCIREFFLKRFNY--- 279

Query: 777 ESIDDIDDSDEGKDEDGSPCLSEIDRATGSLDRELAASGFTKK-DAEDIQRFIEGTPESD 601
                        + +  P  S+I R   SLD E++ASG+TK+  A+D      G  +  
Sbjct: 280 -------------ESELYPTFSDI-RREDSLDVEVSASGYTKEMQADDELLHPVGPDDKI 325

Query: 600 SNSDSEGVDSVED---LNEAIIKDDDFSYLSEQDDEHESKGIEESCEAGQSYASDKGDAS 430
           + ++ E   S+ D   L  A ++  +        DE  S     S ++ Q     K D  
Sbjct: 326 TETEEESDLSLSDEEMLERAKVQRSELENEPNPADESGSSCCFSSADSKQM----KEDGL 381

Query: 429 DEEEHDQNEIE 397
            EE  D +  E
Sbjct: 382 PEESADVSSFE 392

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 746,778,027
Number of Sequences: 1393205
Number of extensions: 16149377
Number of successful extensions: 86695
Number of sequences better than 10.0: 1609
Number of HSP's better than 10.0 without gapping: 59877
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 73586
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 54078381240
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf021b12 BP059247 1 407
2 MFBL049a07_f BP043746 9 509
3 MF031h08_f BP029941 10 571
4 MFBL003e02_f BP041422 27 513
5 MR093h01_f BP083184 33 556
6 MF008c06_f BP028632 505 960




Lotus japonicus
Kazusa DNA Research Institute