KMC000156A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000156A_C01 KMC000156A_c01
caaacatgCAGGAACTTGATAATCGTTTTCGAACCATACAAGCTAAAATCTACCAGGAAA
ACTTTTTACAAGACAACGGTGAATACAGGAGAAAATCAAATTTAAATGGTAAACTATCGG
GGGCTATAAGTAGTCAGGTGTACAACACAGTGGGTATCCGACTTGAGGAAACTTTTCAAT
TTAGTTGATAAGCATAAGATAATAATAAACTCAACAACAAAATTTAAAAAAAAAAAAAAT
ATCACCGGCTAGGTACTTCATTTATCGTTATTCAGCTGGAGCATCAGAATGAATTCTTTC
ATACAAGCGATGTACGAGTGGTGATTTAGACATCTGGGGTTCCCTCTCTAAGACTGCAAT
TACATCCTTCACAGAGATGCTTCGAGCTACCCTAGCTTGAGATGCAACAGCATGATTTCT
CCCATGTTTTCCAGCAGAACCTGAAGCTGCAAGAAAAGCAGTAGAACCTTTTCTCTCCCC
TTCTGGGTCATCCTTGGTACCTCTTCCAGATGTGGATAAGGATTTCGGGTTCACATCATT
GGATGGCTGCGAACCGGATGACGTGTCCATCGCCTCTTCACGTTTCTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000156A_C01 KMC000156A_c01
         (588 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_199127.1| putative protein; protein id: At5g43130.1 [Arab...    69  3e-11
emb|CAC39055.1| putative protein [Oryza sativa]                        63  3e-09
gb|AAO23083.1| unknown protein [Oryza sativa (japonica cultivar-...    60  2e-08
pir||A86402 protein T22C5.17 [imported] - Arabidopsis thaliana g...    54  1e-06
ref|NP_174093.1| hypothetical protein; protein id: At1g27720.1 [...    54  1e-06

>ref|NP_199127.1| putative protein; protein id: At5g43130.1 [Arabidopsis thaliana]
           gi|9757840|dbj|BAB08277.1|
           gb|AAF24960.1~gene_id:MMG4.16~strong similarity to
           unknown protein [Arabidopsis thaliana]
          Length = 689

 Score = 69.3 bits (168), Expect = 3e-11
 Identities = 40/88 (45%), Positives = 56/88 (63%), Gaps = 1/88 (1%)
 Frame = -1

Query: 558 SGSQPSNDVNPKSLSTSGRGTKDDPEGERKGSTAFLAASGSAGKH-GRNHAVASQARVAR 382
           S S+   D N K+ S  G+ +KD  +G R+        SG+ G+  G+N   + Q +V R
Sbjct: 609 SVSEAGKDGNQKTTSGGGKNSKDRQDGGRR-------FSGTGGRRVGKNQGSSLQPKVVR 661

Query: 381 SISVKDVIAVLEREPQMSKSPLVHRLYE 298
           +ISVKDV+AVLEREPQMSKS L++RL +
Sbjct: 662 TISVKDVVAVLEREPQMSKSTLMYRLIQ 689

>emb|CAC39055.1| putative protein [Oryza sativa]
          Length = 691

 Score = 62.8 bits (151), Expect = 3e-09
 Identities = 38/104 (36%), Positives = 59/104 (56%), Gaps = 1/104 (0%)
 Frame = -1

Query: 588 QKREEAMDTSSGSQPSNDVNPKSLSTSGRGTKDDPEGERKGSTAFLAASGSAGKHGRNHA 409
           +++ E +D ++ SQ            +G+G  D  E  ++  +A    +G   + GR   
Sbjct: 589 RQKREGLDLAASSQRGT---ASRSHMAGKGPTDHHEASKRTHSAAFG-TGGMNRQGRGPF 644

Query: 408 VASQAR-VARSISVKDVIAVLEREPQMSKSPLVHRLYERIHSDA 280
            AS  +   R+IS+KDVI VLEREPQM+KS L++RLYER+  D+
Sbjct: 645 AASHPKGPQRTISMKDVICVLEREPQMTKSRLIYRLYERLPGDS 688

>gb|AAO23083.1| unknown protein [Oryza sativa (japonica cultivar-group)]
          Length = 755

 Score = 60.1 bits (144), Expect = 2e-08
 Identities = 38/99 (38%), Positives = 53/99 (53%)
 Frame = -1

Query: 567 DTSSGSQPSNDVNPKSLSTSGRGTKDDPEGERKGSTAFLAASGSAGKHGRNHAVASQARV 388
           D SSGS P N +   S    G+G+++  E E+ G                    +S  +V
Sbjct: 673 DGSSGSMPGNMLPRTSSPKPGKGSREQQEIEKTGGVRR----------------SSHVKV 716

Query: 387 ARSISVKDVIAVLEREPQMSKSPLVHRLYERIHSDAPAE 271
            RSI+VKDVIA LEREPQM KS L+ +LY R  +++ A+
Sbjct: 717 TRSITVKDVIAALEREPQMLKSSLLFQLYGRSPAESSAK 755

>pir||A86402 protein T22C5.17 [imported] - Arabidopsis thaliana
           gi|6693034|gb|AAF24960.1|AC012375_23 T22C5.17
           [Arabidopsis thaliana]
          Length = 697

 Score = 54.3 bits (129), Expect = 1e-06
 Identities = 26/37 (70%), Positives = 33/37 (88%)
 Frame = -1

Query: 393 RVARSISVKDVIAVLEREPQMSKSPLVHRLYERIHSD 283
           +V RSISVKDVIAV+E+EPQMS+S L++R+Y RI SD
Sbjct: 660 KVVRSISVKDVIAVVEKEPQMSRSTLLYRVYNRICSD 696

>ref|NP_174093.1| hypothetical protein; protein id: At1g27720.1 [Arabidopsis
           thaliana]
          Length = 617

 Score = 54.3 bits (129), Expect = 1e-06
 Identities = 26/37 (70%), Positives = 33/37 (88%)
 Frame = -1

Query: 393 RVARSISVKDVIAVLEREPQMSKSPLVHRLYERIHSD 283
           +V RSISVKDVIAV+E+EPQMS+S L++R+Y RI SD
Sbjct: 580 KVVRSISVKDVIAVVEKEPQMSRSTLLYRVYNRICSD 616

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 504,516,859
Number of Sequences: 1393205
Number of extensions: 10719180
Number of successful extensions: 30221
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 28800
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30197
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 22283372436
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL095f05_f BP057978 1 549
2 SPDL070d11_f BP056340 9 480
3 GENLf019g05 BP063389 19 141
4 SPDL027f09_f BP053686 30 289
5 MRL036d07_f BP085484 46 261
6 SPDL047c11_f BP054937 50 227
7 GENLf022g03 BP063526 85 571
8 GENLf006a06 BP062638 89 586
9 MPDL086d06_f AV780985 94 596




Lotus japonicus
Kazusa DNA Research Institute