KMC002047A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002047A_C01 KMC002047A_c01
catGCAAAAACCAGAGTCAGGGACACCTTATTACATGACGGGCCCCGTGGTGGCTGTTAT
GAAATTAATAAAGAAATGGTATGCAGCTAGCAATAGAGTACATGGGGACCAACCCTCAGA
ATACAAATAGTTGGCAATAAAACAACAATGATCAATGCCTTTTTGCTGGTCTCCATTGAA
CACTAAAAAGGAGAGAATCACTAGTTACCGAGTGATGGATAATCAGTACATGTGTGTCTG
AACGCTGAAGACAAACAATTGGCAAACAATGCTTGTACTTTTTCGGAGCACCTAGAACCA
TAGCTAAGTAGGACAAATTGAGAGTACAGCGTGCCGTCCTTGAGTAACAAACTACTTCAT
ACCATGTCAGTGAGATCTAACTAGACCAAAATTTCATGCCATACTACAATGTATCAAACT
GACAACTTATTTCATATATTTGTTTGCATATACACTCACACCTTTTGTTTCCATCATGTA
CTGTTGTTAGACTGATTAACTACTCCATTTGTCCACTTAATGCTGCTTCCTCACCCTCAG
GATCCAATTCTTGTTCCACACCTTTGACCTCAGGTACATAATGCATCAGCATATTTTCGA
TGCCGGATTTTAGAGTAACTGATGAACTCGGGCAGCCACTACATGCTCCTTGCATTTGAA
GTTTCACTATTCCGGTATCTGGGTCAGAGCCTCTATATACAATGTCCCCACCATCATCTT
GCACAGCTGGTCGAATACGGGTTTCCAGCAACTCCTTGATCATCGCAACTATTTCAGAAT
CATCATCTTTGATAGCAGTATCCATGGCAGCAGAAGTCTGAGAGTCCAAAAAGAGGGGCT
GGCCGGAAGAGTAGAAGTCCATGATAGCGGCGAAGATTTCAGGCTTAAGGAATTCCCAGG
AAGCATCTTCGGATTTGGTAACCGTGACGAAATCCGATCCAAAGAAGATACGAGTAATCC
CGTCGATGGCGAAGAGTGATTTAGCGAGGGGagaattcatggcggaacgagggttgggga
agtcggcgcttccaacttccataacaggcttgccagggtgaaacatgagag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002047A_C01 KMC002047A_c01
         (1071 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAK70910.1|AC087551_9 hypothetical protein [Oryza sativa]          323  3e-87
ref|NP_566673.1| expressed protein; protein id: At3g20970.1, sup...   315  6e-85
pir||A96552 unknown protein, 90320-88994 [imported] - Arabidopsi...   303  3e-81
ref|NP_175550.1| unknown protein; protein id: At1g51390.1 [Arabi...   288  1e-76
dbj|BAC24985.1| unnamed protein product [Mus musculus]                209  6e-53

>gb|AAK70910.1|AC087551_9 hypothetical protein [Oryza sativa]
          Length = 272

 Score =  323 bits (828), Expect = 3e-87
 Identities = 159/189 (84%), Positives = 177/189 (93%)
 Frame = -3

Query: 1069 LMFHPGKPVMEVGSADFPNPRSAMNSPLAKSLFAIDGITRIFFGSDFVTVTKSEDASWEF 890
            LMF+PGKPVMEVGS+DFPN R+AM SPLAK+LFAIDG+TR+FFGSDFVTVTKSE+ SW++
Sbjct: 85   LMFYPGKPVMEVGSSDFPNARTAMTSPLAKALFAIDGVTRVFFGSDFVTVTKSEETSWDY 144

Query: 889  LKPEIFAAIMDFYSSGQPLFLDSQTSAAMDTAIKDDDSEIVAMIKELLETRIRPAVQDDG 710
            LKPE+FA IMDFYSSGQ LFLDS T+A+MDTAI +DDSEIVAMIKELLETRIRPAVQDDG
Sbjct: 145  LKPEVFAVIMDFYSSGQSLFLDSSTAASMDTAIHEDDSEIVAMIKELLETRIRPAVQDDG 204

Query: 709  GDIVYRGSDPDTGIVKLQMQGACSGCPSSSVTLKSGIENMLMHYVPEVKGVEQELDPEGE 530
            GDI YRG DP+TGIVKL+MQGACSGCPSSSVTLKSGIENMLMHYVPEVKGVEQELD + E
Sbjct: 205  GDIEYRGFDPETGIVKLKMQGACSGCPSSSVTLKSGIENMLMHYVPEVKGVEQELDGD-E 263

Query: 529  EAALSGQME 503
            EA L+GQ+E
Sbjct: 264  EAELTGQLE 272

>ref|NP_566673.1| expressed protein; protein id: At3g20970.1, supported by cDNA:
            gi_13899084, supported by cDNA: gi_18377515 [Arabidopsis
            thaliana] gi|9294004|dbj|BAB01907.1|
            gb|AAD30650.1~gene_id:MFD22.10~similar to unknown protein
            [Arabidopsis thaliana]
            gi|13899085|gb|AAK48964.1|AF370537_1 Unknown protein
            [Arabidopsis thaliana] gi|18377516|gb|AAL66924.1| unknown
            protein [Arabidopsis thaliana]
            gi|28207822|emb|CAD55561.1| NFU4 protein [Arabidopsis
            thaliana]
          Length = 283

 Score =  315 bits (808), Expect = 6e-85
 Identities = 154/188 (81%), Positives = 170/188 (89%)
 Frame = -3

Query: 1069 LMFHPGKPVMEVGSADFPNPRSAMNSPLAKSLFAIDGITRIFFGSDFVTVTKSEDASWEF 890
            LMF+PGKPVMEVGSADFPN RSA+ SPLAKS+++IDG+ R+FFGSDFVTVTKS+D SW+ 
Sbjct: 93   LMFYPGKPVMEVGSADFPNVRSALGSPLAKSIYSIDGVVRVFFGSDFVTVTKSDDVSWDI 152

Query: 889  LKPEIFAAIMDFYSSGQPLFLDSQTSAAMDTAIKDDDSEIVAMIKELLETRIRPAVQDDG 710
            LKPEIFAA+MDFYSSGQPLFLDSQ +AA DTAI +DDSE VAMIKELLETRIRPAVQDDG
Sbjct: 153  LKPEIFAAVMDFYSSGQPLFLDSQAAAAKDTAISEDDSETVAMIKELLETRIRPAVQDDG 212

Query: 709  GDIVYRGSDPDTGIVKLQMQGACSGCPSSSVTLKSGIENMLMHYVPEVKGVEQELDPEGE 530
            GDI Y G DP++GIVKL+MQGACSGCPSSSVTLKSGIENMLMHYV EVKGVEQE D E E
Sbjct: 213  GDIEYCGFDPESGIVKLRMQGACSGCPSSSVTLKSGIENMLMHYVSEVKGVEQEFDGEDE 272

Query: 529  EAALSGQM 506
            E  LSG+M
Sbjct: 273  EGTLSGEM 280

>pir||A96552 unknown protein, 90320-88994 [imported] - Arabidopsis thaliana
            gi|12325368|gb|AAG52627.1|AC024261_14 unknown protein;
            90320-88994 [Arabidopsis thaliana]
            gi|28207824|emb|CAD55562.1| NFU5 protein [Arabidopsis
            thaliana]
          Length = 275

 Score =  303 bits (776), Expect = 3e-81
 Identities = 150/189 (79%), Positives = 167/189 (87%)
 Frame = -3

Query: 1069 LMFHPGKPVMEVGSADFPNPRSAMNSPLAKSLFAIDGITRIFFGSDFVTVTKSEDASWEF 890
            LMF PGKPVME+GSADFPN RSAM+SPLAK++FAIDG+ R+F+GSDFVTVTKS+D +W+ 
Sbjct: 88   LMFSPGKPVMEIGSADFPNSRSAMSSPLAKAIFAIDGVVRVFYGSDFVTVTKSDDVTWDI 147

Query: 889  LKPEIFAAIMDFYSSGQPLFLDSQTSAAMDTAIKDDDSEIVAMIKELLETRIRPAVQDDG 710
            LKP+IFA +MDFYSSGQPLFLDSQ +AA DTAI +DDSE VAMIKELLETRIRP+VQDDG
Sbjct: 148  LKPDIFAVVMDFYSSGQPLFLDSQATAAKDTAIHEDDSETVAMIKELLETRIRPSVQDDG 207

Query: 709  GDIVYRGSDPDTGIVKLQMQGACSGCPSSSVTLKSGIENMLMHYVPEVKGVEQELDPEGE 530
            GDI Y G D +TGIVKL+MQGACSGCPSSSVTLKSGIENMLMHYV EVKGVEQE D E E
Sbjct: 208  GDIEYCGFDTETGIVKLRMQGACSGCPSSSVTLKSGIENMLMHYVSEVKGVEQEFDGE-E 266

Query: 529  EAALSGQME 503
            E   SG ME
Sbjct: 267  EGTSSGPME 275

>ref|NP_175550.1| unknown protein; protein id: At1g51390.1 [Arabidopsis thaliana]
            gi|4836948|gb|AAD30650.1|AC006085_23 Similar to human
            CGI-33 protein [Arabidopsis thaliana]
          Length = 304

 Score =  288 bits (737), Expect = 1e-76
 Identities = 151/218 (69%), Positives = 167/218 (76%), Gaps = 29/218 (13%)
 Frame = -3

Query: 1069 LMFHPGKPVMEVGSADFPNPRSAMNSPLAKSLFAIDGI---------------------- 956
            LMF PGKPVME+GSADFPN RSAM+SPLAK++FAIDGI                      
Sbjct: 88   LMFSPGKPVMEIGSADFPNSRSAMSSPLAKAIFAIDGIPRLLLQHTIVSSSYNPCFVTKI 147

Query: 955  -------TRIFFGSDFVTVTKSEDASWEFLKPEIFAAIMDFYSSGQPLFLDSQTSAAMDT 797
                    R+F+GSDFVTVTKS+D +W+ LKP+IFA +MDFYSSGQPLFLDSQ +AA DT
Sbjct: 148  VSVDAGVVRVFYGSDFVTVTKSDDVTWDILKPDIFAVVMDFYSSGQPLFLDSQATAAKDT 207

Query: 796  AIKDDDSEIVAMIKELLETRIRPAVQDDGGDIVYRGSDPDTGIVKLQMQGACSGCPSSSV 617
            AI +DDSE VAMIKELLETRIRP+VQDDGGDI Y G D +TGIVKL+MQGACSGCPSSSV
Sbjct: 208  AIHEDDSETVAMIKELLETRIRPSVQDDGGDIEYCGFDTETGIVKLRMQGACSGCPSSSV 267

Query: 616  TLKSGIENMLMHYVPEVKGVEQELDPEGEEAALSGQME 503
            TLKSGIENMLMHYV EVKGVEQE D E EE   SG ME
Sbjct: 268  TLKSGIENMLMHYVSEVKGVEQEFDGE-EEGTSSGPME 304

>dbj|BAC24985.1| unnamed protein product [Mus musculus]
          Length = 200

 Score =  209 bits (532), Expect = 6e-53
 Identities = 98/179 (54%), Positives = 133/179 (73%), Gaps = 1/179 (0%)
 Frame = -3

Query: 1069 LMFHPGKPVMEVGSADFPNPRSAMNSPLAKSLFAIDGITRIFFGSDFVTVTK-SEDASWE 893
            L F PGKPV+E  + DFP P +A  SPLA+ LF I+G+  +FFG DF+TVTK +E+  W 
Sbjct: 14   LKFIPGKPVLETRTMDFPTPAAAFRSPLARQLFRIEGVKSVFFGPDFITVTKENEELDWN 73

Query: 892  FLKPEIFAAIMDFYSSGQPLFLDSQTSAAMDTAIKDDDSEIVAMIKELLETRIRPAVQDD 713
             LKP+I+A IMDF++SG PL  +       +    ++D E+VAMIKELL+TRIRP VQ+D
Sbjct: 74   LLKPDIYATIMDFFASGLPLVTEETPPPPGEAGSSEEDDEVVAMIKELLDTRIRPTVQED 133

Query: 712  GGDIVYRGSDPDTGIVKLQMQGACSGCPSSSVTLKSGIENMLMHYVPEVKGVEQELDPE 536
            GGD++YRG   + GIV+L++QG+C+ CPSS +TLKSGI+NML  Y+PEV+GVEQ +D +
Sbjct: 134  GGDVIYRGF--EDGIVRLKLQGSCTSCPSSIITLKSGIQNMLQFYIPEVEGVEQVMDDD 190

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 958,254,957
Number of Sequences: 1393205
Number of extensions: 22222659
Number of successful extensions: 58812
Number of sequences better than 10.0: 169
Number of HSP's better than 10.0 without gapping: 55220
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 58632
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 64016183864
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWL066h10_f AV769760 1 548
2 GENf033e07 BP059753 4 134
3 MFB092h09_f BP040749 180 625
4 MWM095e05_f AV766284 505 1071




Lotus japonicus
Kazusa DNA Research Institute