KMC005356A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005356A_C01 KMC005356A_c01
CATTAAACGCTTAAGCGATTAATATTCCTTCTATAAAATTATCTTTTAATTGTTGCTAAG
GTTGAGGATGGGAAAAAAAACAGTGAGAAACAATTATTTCAAATTCCCAAGACAATCACA
CGCACTCTCTGGGGGCAAATCCGACCCGAGATCCAGCCAAGTCATACGCGATGCGGATCC
CTGGTGGTGGGATGTTACCGATTATGGAAAAGGGACCACCCCATTGAGCGAATGCGAAGC
ATGAAGGTCCGGTCTTGGTTGGTTTTCAACGTAGCAAGAACGTTCTTCGCCGGGAAGGAC
ACGTCAACATCCCGAAAATGGAGCACCAGAGTGGGAACCTTTGGCTCTCTGGAGCCCTGA
TAGGTCATAACAAGTATCGAAGGACTCGGTCTCGGGCCCTCGGTTCAAATGCGCAGCCGC
AAACCGAAAAGCGTGTCTGAAAGCCAAGTAAGCCGGTATAATGAGTGGTGTCACGGTTGT
GCCTGAATCGATGATGACGCCTCCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005356A_C01 KMC005356A_c01
         (505 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_191741.1| putative protein; protein id: At3g61820.1, supp...    68  1e-24
ref|NP_171637.1| chloroplast nucleoid DNA binding protein, putat...    68  3e-21
gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putativ...    67  5e-21
dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like protein ...    65  4e-18
gb|AAO41867.1| unknown protein [Arabidopsis thaliana]                  57  1e-14

>ref|NP_191741.1| putative protein; protein id: At3g61820.1, supported by cDNA:
           gi_14532549 [Arabidopsis thaliana]
           gi|11357465|pir||T47974 hypothetical protein F15G16.210
           - Arabidopsis thaliana gi|6850873|emb|CAB71112.1|
           putative protein [Arabidopsis thaliana]
          Length = 483

 Score = 67.8 bits (164), Expect(3) = 1e-24
 Identities = 33/55 (60%), Positives = 37/55 (67%)
 Frame = -2

Query: 504 GGVIIDSGTTVTPLIIPAYLAFRHAFRFAAAHLNRGPETESFDTCYDLSGLQRAK 340
           GGVIIDSGT+VT L  PAY+A R AFR  A  L R P    FDTC+DLSG+   K
Sbjct: 359 GGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVK 413

 Score = 55.1 bits (131), Expect(3) = 1e-24
 Identities = 27/43 (62%), Positives = 28/43 (64%)
 Frame = -1

Query: 250 GPSCFAFAQWGGPFSIIGNIPPPGIRIAYDLAGSRVGFAPREC 122
           G  CFAFA   G  SIIGNI   G R+AYDL GSRVGF  R C
Sbjct: 441 GRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

 Score = 31.2 bits (69), Expect(3) = 1e-24
 Identities = 15/32 (46%), Positives = 18/32 (55%)
 Frame = -3

Query: 356 GSREPKVPTLVLHFRDVDVSFPAKNVLATLKT 261
           G    KVPT+V HF   +VS PA N L  + T
Sbjct: 408 GMTTVKVPTVVFHFGGGEVSLPASNYLIPVNT 439

>ref|NP_171637.1| chloroplast nucleoid DNA binding protein, putative; protein id:
           At1g01300.1, supported by cDNA: 7567. [Arabidopsis
           thaliana] gi|25518405|pir||C86143 hypothetical protein
           F6F3.10 - Arabidopsis thaliana
           gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein
           [Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
           chloroplast nucleoid DNA binding protein, putative
           [Arabidopsis thaliana]
          Length = 485

 Score = 68.2 bits (165), Expect(2) = 3e-21
 Identities = 33/55 (60%), Positives = 38/55 (69%)
 Frame = -2

Query: 504 GGVIIDSGTTVTPLIIPAYLAFRHAFRFAAAHLNRGPETESFDTCYDLSGLQRAK 340
           GGVIIDSGT+VT LI PAY+A R AFR  A  L R P+   FDTC+DLS +   K
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVK 414

 Score = 54.7 bits (130), Expect(2) = 3e-21
 Identities = 27/43 (62%), Positives = 28/43 (64%)
 Frame = -1

Query: 250 GPSCFAFAQWGGPFSIIGNIPPPGIRIAYDLAGSRVGFAPREC 122
           G  CFAFA   G  SIIGNI   G R+ YDLA SRVGFAP  C
Sbjct: 442 GKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

 Score = 41.2 bits (95), Expect = 0.007
 Identities = 23/49 (46%), Positives = 26/49 (52%)
 Frame = -3

Query: 347 EPKVPTLVLHFRDVDVSFPAKNVLATLKTNQDRTFMLRIRSMGWSLFHN 201
           E KVPT+VLHFR  DVS PA N L  + TN    F       G S+  N
Sbjct: 412 EVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGN 460

>gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score = 67.4 bits (163), Expect(2) = 5e-21
 Identities = 33/55 (60%), Positives = 37/55 (67%)
 Frame = -2

Query: 504 GGVIIDSGTTVTPLIIPAYLAFRHAFRFAAAHLNRGPETESFDTCYDLSGLQRAK 340
           GGVIIDSGT+VT LI PAY+A R AFR  A  L R P    FDTC+DLS +   K
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVK 414

 Score = 54.7 bits (130), Expect(2) = 5e-21
 Identities = 27/43 (62%), Positives = 28/43 (64%)
 Frame = -1

Query: 250 GPSCFAFAQWGGPFSIIGNIPPPGIRIAYDLAGSRVGFAPREC 122
           G  CFAFA   G  SIIGNI   G R+ YDLA SRVGFAP  C
Sbjct: 442 GKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

 Score = 40.8 bits (94), Expect = 0.009
 Identities = 23/49 (46%), Positives = 26/49 (52%)
 Frame = -3

Query: 347 EPKVPTLVLHFRDVDVSFPAKNVLATLKTNQDRTFMLRIRSMGWSLFHN 201
           E KVPT+VLHFR  DVS PA N L  + TN    F       G S+  N
Sbjct: 412 EVKVPTVVLHFRRADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGN 460

>dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like protein [Oryza sativa
           (japonica cultivar-group)]
          Length = 500

 Score = 64.7 bits (156), Expect(2) = 4e-18
 Identities = 38/66 (57%), Positives = 42/66 (63%), Gaps = 1/66 (1%)
 Frame = -2

Query: 504 GGVIIDSGTTVTPLIIPAYLAFRHAFRFAAAHLNRGPETES-FDTCYDLSGLQRAKGSHS 328
           GGVI+DSGT+VT L  PAY A R AFR AAA L   P   S FDTCYDLSGL+  K    
Sbjct: 374 GGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTV 433

Query: 327 GAPFSG 310
              F+G
Sbjct: 434 SMHFAG 439

 Score = 47.8 bits (112), Expect(2) = 4e-18
 Identities = 22/43 (51%), Positives = 26/43 (60%)
 Frame = -1

Query: 250 GPSCFAFAQWGGPFSIIGNIPPPGIRIAYDLAGSRVGFAPREC 122
           G  CFAFA   G  SIIGNI   G R+ +D  G R+GF P+ C
Sbjct: 458 GTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500

>gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
          Length = 470

 Score = 56.6 bits (135), Expect(2) = 1e-14
 Identities = 27/50 (54%), Positives = 33/50 (66%)
 Frame = -2

Query: 504 GGVIIDSGTTVTPLIIPAYLAFRHAFRFAAAHLNRGPETESFDTCYDLSG 355
           GGV++D+GT VT L   AY+AFR  F+   A+L R      FDTCYDLSG
Sbjct: 345 GGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSG 394

 Score = 43.9 bits (102), Expect(2) = 1e-14
 Identities = 27/63 (42%), Positives = 35/63 (54%)
 Frame = -1

Query: 310 MLTCPSRRRTFLLR*KPTKTGPSCFAFAQWGGPFSIIGNIPPPGIRIAYDLAGSRVGFAP 131
           +LT P+R   FL+      +G  CFAFA      SIIGNI   GI++++D A   VGF P
Sbjct: 412 VLTLPARN--FLM--PVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 467

Query: 130 REC 122
             C
Sbjct: 468 NVC 470

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 479,341,398
Number of Sequences: 1393205
Number of extensions: 11419295
Number of successful extensions: 27128
Number of sequences better than 10.0: 67
Number of HSP's better than 10.0 without gapping: 26326
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 27117
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15362785481
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD066c05_f AV774387 1 523
2 MPD068h04_f AV774539 43 408
3 SPD040e07_f BP047188 43 551




Lotus japonicus
Kazusa DNA Research Institute