KMC001276A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001276A_C01 KMC001276A_c01
AAAAGCTATAAAACTATTGAAACTTTTAATTTGTAAGCAGCACTACAGAAAGCATCCTGT
TTTCTTACAATTTAGCTGCCTCAAGGCAAATTTTAGTGTTCCCATATGCAAACTGGAATC
CAAAAGGTGTATAAAAAACTCATCCAATTGCATGTTGACGATCAACTAAACTTTCAACTA
ACAGAGTCATACAATATACATCTATAACAAGTGCTTAGCTGAGTTTGGGAGTATGTAGAT
CATACCTTGAACACTTCAAAGCAGTGCAGAACTTTATGCGTACACATCTTCCTTGACCAC
CTACGGCATTTAATTTCCAACAATTTCAGGACTGTCAATGTCCACAATCTTCACACCGCG
GCGCTTTCTTATCAATACACTCAACCTCCTCTTACGAACGCCTAAAAGCTTAATCCTCGA
GGTCCCACTTCCGTATCTATGCAACTTAACATCATCTCCAAGTGAAGTTTCAGGGCTTAC
TGTACCTACCCTTTTACCACTGTCTTTCACTTCAGACCTGTTATGCAAATCTTGTTTTGG
TTGGTTAACACGACAAAGCCG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001276A_C01 KMC001276A_c01
         (561 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|ZP_00059845.1| hypothetical protein [Clostridium thermocellu...    32  7.0
dbj|BAB25636.1| unnamed protein product [Mus musculus]                 32  7.0
ref|NP_010592.1| Hypothetical ORF; Ydr306cp [Saccharomyces cerev...    32  7.0
gb|EAA30019.1| predicted protein [Neurospora crassa]                   32  7.0
ref|NP_615455.1| bacterial extracellular solute-binding family 3...    31  9.1

>ref|ZP_00059845.1| hypothetical protein [Clostridium thermocellum ATCC 27405]
          Length = 413

 Score = 31.6 bits (70), Expect = 7.0
 Identities = 27/73 (36%), Positives = 34/73 (45%), Gaps = 6/73 (8%)
 Frame = -1

Query: 513 EVKDSGKRVGTVSPETSLGDDVKLHRYGSGTSRIKLLGVRKRRLSVLIRKR------RGV 352
           E++D GK   TV     LG    L  YG G   +  LGVRK RL     K+       G+
Sbjct: 309 ELQDQGK--DTVEANVLLGFPPDLREYGIGAQILYDLGVRKIRLLTNNPKKLIGLGGHGL 366

Query: 351 KIVDIDSPEIVGN 313
           +IV+    EI GN
Sbjct: 367 EIVERVPIEIKGN 379

>dbj|BAB25636.1| unnamed protein product [Mus musculus]
          Length = 131

 Score = 31.6 bits (70), Expect = 7.0
 Identities = 16/34 (47%), Positives = 19/34 (55%)
 Frame = -1

Query: 552 RVNQPKQDLHNRSEVKDSGKRVGTVSPETSLGDD 451
           R  QP+Q L  R +  DS   V  VS  TSLG+D
Sbjct: 9   RRQQPQQGLRRRRQTSDSSVGVNHVSSTTSLGED 42

>ref|NP_010592.1| Hypothetical ORF; Ydr306cp [Saccharomyces cerevisiae]
           gi|2131422|pir||S61192 hypothetical protein YDR306c -
           yeast (Saccharomyces cerevisiae)
           gi|849223|gb|AAB64742.1| Ydr306cp [Saccharomyces
           cerevisiae]
          Length = 478

 Score = 31.6 bits (70), Expect = 7.0
 Identities = 16/35 (45%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
 Frame = +3

Query: 249 EHFKAVQNFMR-THLP*PPTAFNFQQFQDCQCPQS 350
           E F A++NF + THL  P ++ + Q FQD Q PQ+
Sbjct: 266 ELFSAIKNFTKLTHLSFPRSSIDCQGFQDIQWPQN 300

>gb|EAA30019.1| predicted protein [Neurospora crassa]
          Length = 1220

 Score = 31.6 bits (70), Expect = 7.0
 Identities = 19/63 (30%), Positives = 29/63 (45%), Gaps = 12/63 (19%)
 Frame = -3

Query: 382  ECIDKKAPRCEDCGH*QS*NCWKLNAVGGQ------------GRCVRIKFCTALKCSRYD 239
            E + KK  RC  CG  +     +  A GG+             RC+++K+C+A +C R D
Sbjct: 1146 EALRKKEERCRSCGKGEEELVEERKANGGETAGKTAEVLRKCARCLKVKYCSA-ECQRRD 1204

Query: 238  LHT 230
              T
Sbjct: 1205 WKT 1207

>ref|NP_615455.1| bacterial extracellular solute-binding family 3 protein
           [Methanosarcina acetivorans str. C2A]
           gi|19914275|gb|AAM03935.1| bacterial extracellular
           solute-binding family 3 protein [Methanosarcina
           acetivorans str. C2A]
          Length = 738

 Score = 31.2 bits (69), Expect = 9.1
 Identities = 30/116 (25%), Positives = 48/116 (40%), Gaps = 17/116 (14%)
 Frame = +2

Query: 254 LQSSAELYAYTSSLTTYGI*FPTISGLSMSTIFTPRRFLINTLNLLLRTPKSLILEVPLP 433
           L+   E+ +YT  L   G  FP +        F P  F++  +  L + P SL  ++ L 
Sbjct: 280 LKDGNEIESYTRVLVPLGYTFPLMGA------FVPFLFIL-FIAWLYKNPLSLFEQLQLA 332

Query: 434 ----------------YLCNLTSSPSEVSGLTVPT-LLPLSFTSDLLCKSCFGWLT 550
                           +L NL   P++   L + T +L  SFT+ + C S F + T
Sbjct: 333 AVGILSFFGPAKIAVGWLLNLMRLPADAFNLYISTSILRQSFTASMACMSIFSFAT 388

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 448,843,685
Number of Sequences: 1393205
Number of extensions: 9203374
Number of successful extensions: 28713
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 28183
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28692
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20095422690
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf039g03 BP060036 1 367
2 GNf045d04 BP070681 1 436
3 GENf007h08 BP058652 1 390
4 GENf045c07 BP060248 2 466
5 GENLf071d09 BP066178 7 510
6 MF045b07_f BP030650 8 569
7 GENf005a06 BP058512 8 383
8 GENf037h05 BP059954 8 382
9 GNf012b03 BP068217 16 475
10 GNf100g12 BP074813 16 411
11 GENf054e02 BP060660 18 376




Lotus japonicus
Kazusa DNA Research Institute