KMC000712A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000712A_C02 KMC000712A_c02
atccaaacaggatctatatatatcaacaatagtatagtccactaatgattgatatcgcat
aggatattcttccagctactTCCCAACCGTGAACAATTGGGAAAACCGACATGCTGAATC
CGCTACCCAGGAGAATTAGTACATATCATGCATAGTTCCTGCAGAAGCATAAAAAGGAAA
GTGCACTCTCCCTACGCAGGCATCACCCTGATCAGGTTCAAAGGGACTGTCTCAGCCTGA
TCTGCTAGCTTTCCAGCTTATTAGCCAAAGCTCAAGCAGATTCAGGCACCCCTGGTGCAT
CTTGGGAGTTAATAATGGCTTAAAGGGGAAAAATGGTACTAAACATGATACTAGATCAAT
TTATGGCTGAGAATGGGATAACACAACAAAAGCAAAAGCAGAATATAAATGGAGTTCTTA
AATGTTGTCCACAAACTGAAGTTTTATCTCCTATAGCTTGAAAATAGTTCACCATCCTGA
TCCTGATAGGCAAGTGGTACAGCTGCAGCTAGCTCCAAGTTCTCTCAAGTGATTTGCACC
ATTTATATAGTCCTTATTTAAAAAGTAATACAATATATTTAGCCTTGCTTGAGTTGCTTC
TCCTTGGAACTCAAGTAAGATGCTGGCAAGTAAATCTCTCCACCAGCTTCACCATGTTGT
TCCCCGCTGACGGTTTTTTGGGCAGATGGAAATTCAGCACTCATAGCATCCATTCCACGC
AGCAACGCTTCCTCAGCAGTCCCATAGCCATGTTTCTCAGCATACTTCCTCACATCTTCA
GTTATTTTCATAGAGCAGAATTTAGGACCACACATTGAGCAGAAATGTGCTACCTTTGCA
CCTTCTGATGGCAACGATTCATCATGGAAGGACATGGCTGTCATTGGATCCAATGACAAA
GCAAACTGGTCCATCCATCGAAATTCAAACCTTGCCTTGCTTAACGCATCATCCCATTCT
TGAGCATGTGGATGACCTTTGGCTAAATCAGCAGC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000712A_C02 KMC000712A_c02
         (995 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAK26130.1|AC084406_13 putative thiamin biosynthesis protein ...   238  8e-62
ref|NP_180524.1| putative thiamin biosynthesis protein; protein ...   236  3e-61
gb|AAG49550.1|AF264021_1 putative thiamine biosythesis protein T...   233  3e-60
ref|NP_301331.1| putative thiamine biosynthesis protein [Mycobac...   155  7e-37
ref|NP_214937.1| thiC [Mycobacterium tuberculosis H37Rv] gi|3024...   152  6e-36

>gb|AAK26130.1|AC084406_13 putative thiamin biosynthesis protein [Oryza sativa]
          Length = 628

 Score =  238 bits (608), Expect = 8e-62
 Identities = 108/131 (82%), Positives = 124/131 (94%)
 Frame = -1

Query: 995 AADLAKGHPHAQEWDDALSKARFEFRWMDQFALSLDPMTAMSFHDESLPSEGAKVAHFCS 816
           AADLAKGHP+AQ WDD LSKARFEFRW+DQFALSLDP+TAMSFHDE+LPSEGAKVAHFCS
Sbjct: 498 AADLAKGHPYAQAWDDTLSKARFEFRWLDQFALSLDPVTAMSFHDETLPSEGAKVAHFCS 557

Query: 815 MCGPKFCSMKITEDVRKYAEKHGYGTAEEALLRGMDAMSAEFPSAQKTVSGEQHGEAGGE 636
           MCGPKFCSMKITED+RKYA++HGYGT EEA+++GM+AMSAEF +A+KT+SGEQHGEAGGE
Sbjct: 558 MCGPKFCSMKITEDIRKYADEHGYGTVEEAVIQGMNAMSAEFSAARKTISGEQHGEAGGE 617

Query: 635 IYLPASYLSSK 603
           IY+P SY + K
Sbjct: 618 IYVPESYTARK 628

>ref|NP_180524.1| putative thiamin biosynthesis protein; protein id: At2g29630.1,
           supported by cDNA: gi_20260179 [Arabidopsis thaliana]
           gi|25319657|pir||F84698 probable thiamin biosynthesis
           protein [imported] - Arabidopsis thaliana
           gi|3582335|gb|AAC35232.1| putative thiamin biosynthesis
           protein [Arabidopsis thaliana]
           gi|20260180|gb|AAM12988.1| putative thiamin biosynthesis
           protein [Arabidopsis thaliana]
           gi|22136156|gb|AAM91156.1| putative thiamin biosynthesis
           protein [Arabidopsis thaliana]
          Length = 644

 Score =  236 bits (603), Expect = 3e-61
 Identities = 109/133 (81%), Positives = 123/133 (91%)
 Frame = -1

Query: 995 AADLAKGHPHAQEWDDALSKARFEFRWMDQFALSLDPMTAMSFHDESLPSEGAKVAHFCS 816
           AADLAK HPHAQ WDDALSKARFEFRWMDQFALSLDPMTAMSFHDE+LP++GAKVAHFCS
Sbjct: 512 AADLAKQHPHAQAWDDALSKARFEFRWMDQFALSLDPMTAMSFHDETLPADGAKVAHFCS 571

Query: 815 MCGPKFCSMKITEDVRKYAEKHGYGTAEEALLRGMDAMSAEFPSAQKTVSGEQHGEAGGE 636
           MCGPKFCSMKITED+RKYAE++GYG+AEEA+ +GMDAMS EF  A+KT+SGEQHGE GGE
Sbjct: 572 MCGPKFCSMKITEDIRKYAEENGYGSAEEAIRQGMDAMSEEFNIAKKTISGEQHGEVGGE 631

Query: 635 IYLPASYLSSKEK 597
           IYLP SY+ + +K
Sbjct: 632 IYLPESYVKAAQK 644

>gb|AAG49550.1|AF264021_1 putative thiamine biosythesis protein ThiC [Poa secunda]
          Length = 643

 Score =  233 bits (595), Expect = 3e-60
 Identities = 107/131 (81%), Positives = 122/131 (92%)
 Frame = -1

Query: 995 AADLAKGHPHAQEWDDALSKARFEFRWMDQFALSLDPMTAMSFHDESLPSEGAKVAHFCS 816
           AADLAK HP+AQ WDDALSKARFEFRW+DQFALSLDP+TAMSFHDE+LPS+GAKVAHFCS
Sbjct: 513 AADLAKCHPYAQAWDDALSKARFEFRWLDQFALSLDPVTAMSFHDETLPSDGAKVAHFCS 572

Query: 815 MCGPKFCSMKITEDVRKYAEKHGYGTAEEALLRGMDAMSAEFPSAQKTVSGEQHGEAGGE 636
           MCGPKFCSMKITED+RKYA++HGYGT EEA+ +GM  MSAEF +A+KT+SGEQHGEAGGE
Sbjct: 573 MCGPKFCSMKITEDIRKYADEHGYGTVEEAVRQGMSDMSAEFTAARKTISGEQHGEAGGE 632

Query: 635 IYLPASYLSSK 603
           IY+P SYL+ K
Sbjct: 633 IYVPESYLAKK 643

>ref|NP_301331.1| putative thiamine biosynthesis protein [Mycobacterium leprae]
           gi|7388313|sp|Q9ZBL0|THIC_MYCLE Thiamine biosynthesis
           protein thiC gi|11279601|pir||T44743 probable thiamin
           biosythesis protein thiC [imported] - Mycobacterium
           leprae gi|4154058|emb|CAA22712.1| putative thiamine
           biosythesis protein ThiC [Mycobacterium leprae]
           gi|13092616|emb|CAC29802.1| putative thiamine
           biosynthesis protein [Mycobacterium leprae]
          Length = 547

 Score =  155 bits (393), Expect = 7e-37
 Identities = 81/124 (65%), Positives = 88/124 (70%)
 Frame = -1

Query: 995 AADLAKGHPHAQEWDDALSKARFEFRWMDQFALSLDPMTAMSFHDESLPSEGAKVAHFCS 816
           AADLAKG+P AQE DDALS ARFEFRW DQFALSLDP TA  FHDE+LP+E AK AHFCS
Sbjct: 434 AADLAKGYPRAQERDDALSTARFEFRWNDQFALSLDPPTAREFHDETLPAEPAKTAHFCS 493

Query: 815 MCGPKFCSMKITEDVRKYAEKHGYGTAEEALLRGMDAMSAEFPSAQKTVSGEQHGEAGGE 636
           MCGPKFCSM+IT D+R YA KHG  T EEA+  GM   SAEF             E G  
Sbjct: 494 MCGPKFCSMRITADIRVYAAKHGLDT-EEAIEMGMTEKSAEF------------AEHGNR 540

Query: 635 IYLP 624
           +YLP
Sbjct: 541 VYLP 544

>ref|NP_214937.1| thiC [Mycobacterium tuberculosis H37Rv]
           gi|3024726|sp|P96269|THIC_MYCTU Thiamine biosynthesis
           protein thiC gi|7448741|pir||E70630 thiamin biosynthesis
           protein thiC - Mycobacterium tuberculosis (strain H37RV)
           gi|1817689|emb|CAB06563.1| thiC [Mycobacterium
           tuberculosis H37Rv]
          Length = 547

 Score =  152 bits (385), Expect = 6e-36
 Identities = 80/127 (62%), Positives = 87/127 (67%), Gaps = 3/127 (2%)
 Frame = -1

Query: 995 AADLAKGHPHAQEWDDALSKARFEFRWMDQFALSLDPMTAMSFHDESLPSEGAKVAHFCS 816
           AADLAKGHP AQE DDALS ARFEFRW DQFALSLDP TA  FHDE+LP+E AK AHFCS
Sbjct: 430 AADLAKGHPRAQERDDALSTARFEFRWNDQFALSLDPDTAREFHDETLPAEPAKTAHFCS 489

Query: 815 MCGPKFCSMKITEDVRKYAEKHGYGT---AEEALLRGMDAMSAEFPSAQKTVSGEQHGEA 645
           MCGPKFCSM+IT+DVR+YA +HG  T    E  L  GM   S EF             E 
Sbjct: 490 MCGPKFCSMRITQDVREYAAEHGLETEADIEAVLAAGMAEKSREF------------AEH 537

Query: 644 GGEIYLP 624
           G  +YLP
Sbjct: 538 GNRVYLP 544

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 871,103,703
Number of Sequences: 1393205
Number of extensions: 19114149
Number of successful extensions: 45081
Number of sequences better than 10.0: 118
Number of HSP's better than 10.0 without gapping: 42996
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 45027
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 57117888189
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL034c01_f BP042957 1 487
2 GENLf033h11 BP064105 334 785
3 GENLf032a03 BP064002 346 836
4 MPDL090g12_f AV781215 377 483
5 SPD061b05_f BP048826 378 962
6 GENf020c02 BP059206 487 991
7 GNf100f06 BP074797 490 999




Lotus japonicus
Kazusa DNA Research Institute