KMC004200A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004200A_C01 KMC004200A_c01
gggccccccttttttttttttatccaaataaagagacccactaaatattaatcacTCAGA
TCCATGTGAAATTTAATATCATATCAATGTCACACTGTTAGCCATATGGCCTACATGCAA
CCCTTTACAATAATACTACCATAAATTACAAGAATATATAACTTCAATCTTCAGAACTAA
AAATAGCCCCCAAGGGTTGAACAAATTTGCAAGCCATGCCATGGAGTTTATAGTTACAAC
TTACAATCCATTTTTAGAAATTAAAATTAACCCATGCAGGGACTAATAATTTATGAACCC
ACTCCACACAATCAAGGGAAAAGGAAAATCCTTCTCTACCATATGTGGGCTGTTCTTGAC
TTTTATCACCTAAACACTTTAATGCTAAGCATGGTCTTTTCCCTTTTTTTTGTTTCTCTC
TAGAATTAAACCCCAAGACCATAATAATTAAAAACCTAAACCTATACTAAAATTCTTTTT
TCTCCCCACATCATAAATTATAAATTTTATAATGTTATTATATAGAGCAATGCTAGGATG
AGGGTTAAATAGGTCAATTACAAATCTAGCACTTGTTGGGTGAGAATCCAACTTCATTTT
TTGCAAGATCGAAGCTGACACGTGTCCCCTGTTGCTGCACGTTCCCTATAATGGACAAAG
ACGACGTCGTAGGCGCGAACGCGAAACAGAAAGTACCGGCGGAGTCCACCGGAATCAAGT
AATTCTTCGCCGGCAACGACCACGTTTTGCCGCCGCTGAAGTGGAGCGACACGGTGGGGA
CACGCACAGACTGCAGAGTCGACAAATCATAGCACGTGTCGAACAGCGCAACCCCCTCGG
CGGAACGCAGGTTCTGAGTCAGCCTCCGGAACGCGTCACGGACCGAGTTGTAAGCCTGAG
TTCGCAACCGAGTTATGGCGGTGCCGGAGTCCACGATAATCCCGCCGGCGCCGGTTTGGT
CCATCTCGAATGTCTCCGGCGGAATGCCGACCATCTGGCCTCCGACGCTGACGCCGGtga
gctccacgtagtagaaggttccgatcttctggttcttcagtagtggtgcagtgactgagt
cgccgggtcgagctgag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004200A_C01 KMC004200A_c01
         (1097 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_188478.1| putative chloroplast nucleoid DNA-binding prote...   241  2e-62
gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein...   241  2e-62
ref|NP_173922.1| hypothetical protein; protein id: At1g25510.1, ...   240  3e-62
dbj|BAB21205.1| nucleoid DNA-binding protein cnd41-like protein ...   232  7e-60
gb|AAO41867.1| unknown protein [Arabidopsis thaliana]                 193  5e-48

>ref|NP_188478.1| putative chloroplast nucleoid DNA-binding protein; protein id:
            At3g18490.1 [Arabidopsis thaliana]
            gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid
            DNA binding protein-like [Arabidopsis thaliana]
            gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
          Length = 500

 Score =  241 bits (614), Expect = 2e-62
 Identities = 113/173 (65%), Positives = 145/173 (83%), Gaps = 1/173 (0%)
 Frame = -2

Query: 1084 GDSVTAPLLKNQKIGTFYYVELTGVSVGGQMVGIPPETFEMDQTGAGGIIVDSGTAITRL 905
            G   TAPLL+N+KI TFYYV L+G SVGG+ V +P   F++D +G+GG+I+D GTA+TRL
Sbjct: 328  GGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 387

Query: 904  RTQAYNSVRDAFRRLTQNLRS-AEGVALFDTCYDLSTLQSVRVPTVSLHFSGGKTWSLPA 728
            +TQAYNS+RDAF +LT NL+  +  ++LFDTCYD S+L +V+VPTV+ HF+GGK+  LPA
Sbjct: 388  QTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447

Query: 727  KNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVSFDLAKNEVGFSPNKC 569
            KNYLIPVD +GTFCFAFAPT+SSLSIIGNVQQQGTR+++DL+KN +G S NKC
Sbjct: 448  KNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

>gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
            thaliana]
          Length = 500

 Score =  241 bits (614), Expect = 2e-62
 Identities = 113/173 (65%), Positives = 145/173 (83%), Gaps = 1/173 (0%)
 Frame = -2

Query: 1084 GDSVTAPLLKNQKIGTFYYVELTGVSVGGQMVGIPPETFEMDQTGAGGIIVDSGTAITRL 905
            G   TAPLL+N+KI TFYYV L+G SVGG+ V +P   F++D +G+GG+I+D GTA+TRL
Sbjct: 328  GGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 387

Query: 904  RTQAYNSVRDAFRRLTQNLRS-AEGVALFDTCYDLSTLQSVRVPTVSLHFSGGKTWSLPA 728
            +TQAYNS+RDAF +LT NL+  +  ++LFDTCYD S+L +V+VPTV+ HF+GGK+  LPA
Sbjct: 388  QTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447

Query: 727  KNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVSFDLAKNEVGFSPNKC 569
            KNYLIPVD +GTFCFAFAPT+SSLSIIGNVQQQGTR+++DL+KN +G S NKC
Sbjct: 448  KNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

>ref|NP_173922.1| hypothetical protein; protein id: At1g25510.1, supported by cDNA:
            gi_20466515 [Arabidopsis thaliana]
            gi|25518510|pir||D86385 hypothetical protein F2J7.6 -
            Arabidopsis thaliana
            gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
            protein [Arabidopsis thaliana] gi|20466516|gb|AAM20575.1|
            unknown protein [Arabidopsis thaliana]
            gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis
            thaliana]
          Length = 483

 Score =  240 bits (612), Expect = 3e-62
 Identities = 112/171 (65%), Positives = 143/171 (83%)
 Frame = -2

Query: 1081 DSVTAPLLKNQKIGTFYYVELTGVSVGGQMVGIPPETFEMDQTGAGGIIVDSGTAITRLR 902
            D+V APLL+N ++ TFYY+ LTG+SVGG+++ IP  +FEMD++G+GGII+DSGTA+TRL+
Sbjct: 313  DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQ 372

Query: 901  TQAYNSVRDAFRRLTQNLRSAEGVALFDTCYDLSTLQSVRVPTVSLHFSGGKTWSLPAKN 722
            T+ YNS+RD+F + T +L  A GVA+FDTCY+LS   +V VPTV+ HF GGK  +LPAKN
Sbjct: 373  TEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKN 432

Query: 721  YLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVSFDLAKNEVGFSPNKC 569
            Y+IPVDS GTFC AFAPT SSL+IIGNVQQQGTRV+FDLA + +GFS NKC
Sbjct: 433  YMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

>dbj|BAB21205.1| nucleoid DNA-binding protein cnd41-like protein [Oryza sativa
            (japonica cultivar-group)]
          Length = 504

 Score =  232 bits (592), Expect = 7e-60
 Identities = 113/169 (66%), Positives = 136/169 (79%)
 Frame = -2

Query: 1075 VTAPLLKNQKIGTFYYVELTGVSVGGQMVGIPPETFEMDQTGAGGIIVDSGTAITRLRTQ 896
            VTAPL+++ +  TFYYV L+G+SVGGQ++ IPP  F MD TGAGG+IVDSGTA+TRL++ 
Sbjct: 336  VTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSS 395

Query: 895  AYNSVRDAFRRLTQNLRSAEGVALFDTCYDLSTLQSVRVPTVSLHFSGGKTWSLPAKNYL 716
            AY ++RDAF R TQ+L    GV+LFDTCYDLS   SV VP VSL F+GG    LPAKNYL
Sbjct: 396  AYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKNYL 455

Query: 715  IPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVSFDLAKNEVGFSPNKC 569
            IPVD AGT+C AFAPT +++SIIGNVQQQGTRVSFD AK+ VGF+ NKC
Sbjct: 456  IPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504

>gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
          Length = 470

 Score =  193 bits (490), Expect = 5e-48
 Identities = 92/175 (52%), Positives = 125/175 (70%)
 Frame = -2

Query: 1093 ARPGDSVTAPLLKNQKIGTFYYVELTGVSVGGQMVGIPPETFEMDQTGAGGIIVDSGTAI 914
            A P  +   PL++N +  +FYYV L G+ VGG  + +P   F++ +TG GG+++D+GTA+
Sbjct: 296  ALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAV 355

Query: 913  TRLRTQAYNSVRDAFRRLTQNLRSAEGVALFDTCYDLSTLQSVRVPTVSLHFSGGKTWSL 734
            TRL T AY + RD F+  T NL  A GV++FDTCYDLS   SVRVPTVS +F+ G   +L
Sbjct: 356  TRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTL 415

Query: 733  PAKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVSFDLAKNEVGFSPNKC 569
            PA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSFD A   VGF PN C
Sbjct: 416  PARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 974,393,111
Number of Sequences: 1393205
Number of extensions: 22902843
Number of successful extensions: 67738
Number of sequences better than 10.0: 214
Number of HSP's better than 10.0 without gapping: 62948
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 67380
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 65889269280
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB006c01_f BP034320 1 623
2 SPDL097f07_f BP058108 56 381
3 MPDL002b03_f AV776607 69 381
4 MPD007f10_f AV770478 172 693
5 MFBL043h09_f BP043470 179 697
6 MFB077g09_f BP039649 215 651
7 MF006f07_f BP028557 279 820
8 MR015c11_f BP077100 500 880
9 MFBL008h10_f BP041697 610 1098




Lotus japonicus
Kazusa DNA Research Institute