KMC017994A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC017994A_C01 KMC017994A_c01
gaaccaccgcgcgcaaccgcagctagcggcagctgaacgcgcttttgcgacggaggcagc
gaaatcggtcgctccttcggCGGATCCGGTGAAGTGGGACTACAGAGGGCAGAGGAAGAT
AATCCCTCTGGGGCAGTGGCTTCCTAAGATTGCCGTCGATGCTTACGTGGCACCTAACGT
GGTTCTCGCCGGTCAAGTCACCGTCTGGGATGGGGCATCCGTGTGGCCGGGTTGCGTTCT
CCGGGGCGATCTCAACAAGATCAGCATCGGCTTCTGCTCCAATGTTCAGGAACGGTGTGT
TCTTCACGCCGCTTGGTCTTCTCCCACAGGCCTTCCAGCTGAGACCACAGTAGAGAGGTA
TGTGACTGTTGGGGCATACAGCCTGTTGAGGTCCTGCACTATTGAGCCAGAGTGCATTAT
TGGGCAGCACTCCATCCTCATGGAAGGTTCATTGGTGGAGACACAATCAATCCTTGAAGC
TGGGTCAGTCGTTCCACCTGGAAGGCGAATTCCATCAGGTGAACTCTGGGCAGGAAATCC
AGCCAGGTTTGTGAGGACTTTGACCCATGAAGAAATCCTAGAAATCCCCAAACTTGCAGT
TGCAATTAATGATCTGAGTAGAGATCATTTCGATGAGTTCCTTCCCTATTCTACAGTATA
TTTGGAGGTTGAGAAGTTCAAGAAATCCTTGGGTATTGCTGTTTAATGCTCTTTGTTGCT
AGTCTTCAATGTATCTGTTAGTTGGCGTTTTGCTTGTAAGGAAATAAGAATTAAAACAGA
GAGCAAGCTTTTGTGTTTTCTTTGTATTCCTTGCTGAAAGTATTGGCTTAaaacatcatg
tcctgtataattgactggagtaatatgtgatcagatgcattatctgagtcttggacatta
gtttttccct


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC017994A_C01 KMC017994A_c01
         (910 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_201156.1| putative protein; protein id: At5g63510.1, supp...   392  e-108
gb|AAM64682.1| unknown [Arabidopsis thaliana]                         391  e-108
gb|AAK96778.1| Unknown protein [Arabidopsis thaliana] gi|1797879...   390  e-107
ref|NP_190437.1| putative protein; protein id: At3g48680.1, supp...   389  e-107
gb|AAM61583.1| unknown [Arabidopsis thaliana]                         140  3e-32

>ref|NP_201156.1| putative protein; protein id: At5g63510.1, supported by cDNA:
           31971., supported by cDNA: gi_15451013, supported by
           cDNA: gi_17978794 [Arabidopsis thaliana]
           gi|25091501|sp|Q9FMV1|UMP7_ARATH Unknown mitochondrial
           protein At5g63510 gi|9758292|dbj|BAB08816.1| contains
           similarity to acetyltransferase~gene_id:MLE2.14
           [Arabidopsis thaliana]
          Length = 252

 Score =  392 bits (1008), Expect = e-108
 Identities = 196/230 (85%), Positives = 210/230 (91%), Gaps = 5/230 (2%)
 Frame = +2

Query: 29  AAERAFA--TEAAK---SVAPSADPVKWDYRGQRKIIPLGQWLPKIAVDAYVAPNVVLAG 193
           AAE A A  TE  K   +V+PS D VKWDYRGQR+IIPLGQWLPK+AVDAYVAPNVVLAG
Sbjct: 23  AAEAALARKTELPKPQFTVSPSTDRVKWDYRGQRQIIPLGQWLPKVAVDAYVAPNVVLAG 82

Query: 194 QVTVWDGASVWPGCVLRGDLNKISIGFCSNVQERCVLHAAWSSPTGLPAETTVERYVTVG 373
           QVTVWDG+SVW G VLRGDLNKI++GFCSNVQERCV+HAAWSSPTGLPA T ++RYVTVG
Sbjct: 83  QVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSSPTGLPAATIIDRYVTVG 142

Query: 374 AYSLLRSCTIEPECIIGQHSILMEGSLVETQSILEAGSVVPPGRRIPSGELWAGNPARFV 553
           AYSLLRSCTIEPECIIGQHSILMEGSLVET+SILEAGSVVPPGRRIPSGELW GNPARF+
Sbjct: 143 AYSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVVPPGRRIPSGELWGGNPARFI 202

Query: 554 RTLTHEEILEIPKLAVAINDLSRDHFDEFLPYSTVYLEVEKFKKSLGIAV 703
           RTLT+EE LEIPKLAVAIN LS D+F EFLPYSTVYLEVEKFKKSLGIAV
Sbjct: 203 RTLTNEETLEIPKLAVAINHLSGDYFSEFLPYSTVYLEVEKFKKSLGIAV 252

>gb|AAM64682.1| unknown [Arabidopsis thaliana]
          Length = 252

 Score =  391 bits (1004), Expect = e-108
 Identities = 195/230 (84%), Positives = 210/230 (90%), Gaps = 5/230 (2%)
 Frame = +2

Query: 29  AAERAFA--TEAAK---SVAPSADPVKWDYRGQRKIIPLGQWLPKIAVDAYVAPNVVLAG 193
           AAE A A  TE  K   +V+PS D VKWDYRGQR+IIPLGQWLPK+AVDAYVAPNVVLAG
Sbjct: 23  AAEAALARKTELPKPQFTVSPSTDLVKWDYRGQRQIIPLGQWLPKVAVDAYVAPNVVLAG 82

Query: 194 QVTVWDGASVWPGCVLRGDLNKISIGFCSNVQERCVLHAAWSSPTGLPAETTVERYVTVG 373
           QVTVWDG+SVW G VLRGDLNKI++GFCSNVQERCV+HAAWSSPTGLPA T ++RYVTVG
Sbjct: 83  QVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSSPTGLPAATIIDRYVTVG 142

Query: 374 AYSLLRSCTIEPECIIGQHSILMEGSLVETQSILEAGSVVPPGRRIPSGELWAGNPARFV 553
           AYSLLRSCTIEPECIIGQHSILMEGSLVET+SILEAGSVVPPGRRIPSGELW GNPARF+
Sbjct: 143 AYSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVVPPGRRIPSGELWGGNPARFI 202

Query: 554 RTLTHEEILEIPKLAVAINDLSRDHFDEFLPYSTVYLEVEKFKKSLGIAV 703
           RTLT+EE LEIPKLA+AIN LS D+F EFLPYSTVYLEVEKFKKSLGIAV
Sbjct: 203 RTLTNEETLEIPKLALAINHLSGDYFSEFLPYSTVYLEVEKFKKSLGIAV 252

>gb|AAK96778.1| Unknown protein [Arabidopsis thaliana] gi|17978795|gb|AAL47391.1|
           unknown protein [Arabidopsis thaliana]
          Length = 252

 Score =  390 bits (1001), Expect = e-107
 Identities = 195/230 (84%), Positives = 209/230 (90%), Gaps = 5/230 (2%)
 Frame = +2

Query: 29  AAERAFA--TEAAK---SVAPSADPVKWDYRGQRKIIPLGQWLPKIAVDAYVAPNVVLAG 193
           AAE A A  TE  K   +V+PS D VKWDYRGQR+IIPLGQWLPK+AVDAYVAPNVVLAG
Sbjct: 23  AAEAALARKTELPKPQFTVSPSTDRVKWDYRGQRQIIPLGQWLPKVAVDAYVAPNVVLAG 82

Query: 194 QVTVWDGASVWPGCVLRGDLNKISIGFCSNVQERCVLHAAWSSPTGLPAETTVERYVTVG 373
           QVTVWDG+SVW G VLRGDLNKI++GFCSNVQ RCV+HAAWSSPTGLPA T ++RYVTVG
Sbjct: 83  QVTVWDGSSVWNGAVLRGDLNKITVGFCSNVQGRCVVHAAWSSPTGLPAATIIDRYVTVG 142

Query: 374 AYSLLRSCTIEPECIIGQHSILMEGSLVETQSILEAGSVVPPGRRIPSGELWAGNPARFV 553
           AYSLLRSCTIEPECIIGQHSILMEGSLVET+SILEAGSVVPPGRRIPSGELW GNPARF+
Sbjct: 143 AYSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVVPPGRRIPSGELWGGNPARFI 202

Query: 554 RTLTHEEILEIPKLAVAINDLSRDHFDEFLPYSTVYLEVEKFKKSLGIAV 703
           RTLT+EE LEIPKLAVAIN LS D+F EFLPYSTVYLEVEKFKKSLGIAV
Sbjct: 203 RTLTNEETLEIPKLAVAINHLSGDYFSEFLPYSTVYLEVEKFKKSLGIAV 252

>ref|NP_190437.1| putative protein; protein id: At3g48680.1, supported by cDNA:
           gi_13430603, supported by cDNA: gi_15293166 [Arabidopsis
           thaliana] gi|25091504|sp|Q9SMN1|UMP8_ARATH Unknown
           mitochondrial protein At3g48680 gi|11358444|pir||T46212
           hypothetical protein T8P19.190 - Arabidopsis thaliana
           gi|6523099|emb|CAB62357.1| putative protein [Arabidopsis
           thaliana] gi|13430604|gb|AAK25924.1|AF360214_1 unknown
           protein [Arabidopsis thaliana]
           gi|15293167|gb|AAK93694.1| unknown protein [Arabidopsis
           thaliana]
          Length = 256

 Score =  389 bits (999), Expect = e-107
 Identities = 190/229 (82%), Positives = 206/229 (88%), Gaps = 3/229 (1%)
 Frame = +2

Query: 26  AAAERAFATEAAK---SVAPSADPVKWDYRGQRKIIPLGQWLPKIAVDAYVAPNVVLAGQ 196
           A A     TE  K    V PS D VKWDYRGQR+IIPLGQWLPK+AVDAYVAPNVVLAGQ
Sbjct: 28  AEAVAVATTETPKPKSQVTPSPDRVKWDYRGQRQIIPLGQWLPKVAVDAYVAPNVVLAGQ 87

Query: 197 VTVWDGASVWPGCVLRGDLNKISIGFCSNVQERCVLHAAWSSPTGLPAETTVERYVTVGA 376
           VTVWDG+SVW G VLRGDLNKI++GFCSNVQERCV+HAAWSSPTGLPA+T ++RYVTVGA
Sbjct: 88  VTVWDGSSVWNGAVLRGDLNKITVGFCSNVQERCVVHAAWSSPTGLPAQTLIDRYVTVGA 147

Query: 377 YSLLRSCTIEPECIIGQHSILMEGSLVETQSILEAGSVVPPGRRIPSGELWAGNPARFVR 556
           YSLLRSCTIEPECIIGQHSILMEGSLVET+SILEAGSV+PPGRRIPSGELW GNPARF+R
Sbjct: 148 YSLLRSCTIEPECIIGQHSILMEGSLVETRSILEAGSVLPPGRRIPSGELWGGNPARFIR 207

Query: 557 TLTHEEILEIPKLAVAINDLSRDHFDEFLPYSTVYLEVEKFKKSLGIAV 703
           TLT+EE LEIPKLAVAIN LS D+F EFLPYST+YLEVEKFKKSLGIA+
Sbjct: 208 TLTNEETLEIPKLAVAINHLSGDYFSEFLPYSTIYLEVEKFKKSLGIAI 256

>gb|AAM61583.1| unknown [Arabidopsis thaliana]
          Length = 275

 Score =  140 bits (352), Expect = 3e-32
 Identities = 74/183 (40%), Positives = 114/183 (61%)
 Frame = +2

Query: 143 PKIAVDAYVAPNVVLAGQVTVWDGASVWPGCVLRGDLNKISIGFCSNVQERCVLHAAWSS 322
           P +  DA+VAP+  + G V +  G+S+W GCVLRGD+N +S+G  +N+Q+  ++H A S+
Sbjct: 53  PIVDKDAFVAPSASVIGDVHIGRGSSIWYGCVLRGDVNTVSVGSGTNIQDNSLVHVAKSN 112

Query: 323 PTGLPAETTVERYVTVGAYSLLRSCTIEPECIIGQHSILMEGSLVETQSILEAGSVVPPG 502
            +G    T +   VT+G  ++L  CT+E E  IG  + L++G +VE   ++ AG++V   
Sbjct: 113 LSGKVHPTIIGDNVTIGHSAVLHGCTVEDETFIGMGATLLDGVVVEKHGMVAAGALVRQN 172

Query: 503 RRIPSGELWAGNPARFVRTLTHEEILEIPKLAVAINDLSRDHFDEFLPYSTVYLEVEKFK 682
            RIPSGE+W GNPARF+R LT EEI  I + A   ++L++ H  E    +   L V +F+
Sbjct: 173 TRIPSGEVWGGNPARFLRKLTDEEIAFISQSATNYSNLAQAHAAE----NAKPLNVIEFE 228

Query: 683 KSL 691
           K L
Sbjct: 229 KVL 231

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 841,639,337
Number of Sequences: 1393205
Number of extensions: 20736124
Number of successful extensions: 69769
Number of sequences better than 10.0: 295
Number of HSP's better than 10.0 without gapping: 64076
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 69333
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 49641180728
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB041f11_f BP037006 1 531
2 SPD012e06_f BP044960 418 910




Lotus japonicus
Kazusa DNA Research Institute