KMC000415A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000415A_C01 KMC000415A_c01
ttttaaaatcccaattttggttcaaatttaaaatccattatttttctaggtggaaacaaa
tggcccctaagttatccaccATGTTCTCCACGGCATATGAGAAGCAACTAGATCAATGTG
AGATCAACAACCACTTCTCTTGGTTCTATACAACTCATTCAAGGTTCAATATCCGCAATT
ATTCATACACAGACCTCTAATACATTCGCATAATAAAGAACTAGTCATTTAAATATACAA
CTCATTCAAGGTTCAACATCCTCAATTTATGGACTTGTGAACACTATGGTCTTTGCAAGT
TCTAGTGCTAACTTGCGTCCAAAGAGCTTCTCCACAATAACTAAAGCAAACTCGATGGAA
GTTCCTGGACCTCTGCTGGTAATAAGGTTGCCATCAACTACAACTCGGTTTTCAACTTCA
CTCTGATCTGATAATTTATTGGCACATGGCAGGAAAAGCAGTGGCCTTTTTACCCTTTAA
CAAGCCATGGGGCTCCAAGACTAAAGCTGGGGATGCACAAATTGCTCCATAATATCTGTT
TGATTCTCTTTGATTTTTCAGCAAACTCACCAAGGTTTCTGACTTTGCAAAAGCTTGGGC
ACCACCTAGTCCACCTGGCAACACAATAAGGTCATATGAAAGTTTAGCTGCTTCGTCAAG
GAGAATGTCTGCCTCCAGTTTAACTTTACGTGATGCCACAATCTCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000415A_C01 KMC000415A_c01
         (706 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB16486.1| P0665D10.11 [Oryza sativa (japonica cultivar-gro...   112  2e-40
ref|NP_188117.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophospha...   112  2e-39
dbj|BAA97062.1| emb|CAA17570.1~gene_id:K15M2.13~similar to unkno...   112  2e-39
ref|NP_564626.1| expressed protein; protein id: At1g53280.1, sup...   146  3e-34
gb|AAM60860.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate...   144  2e-33

>dbj|BAB16486.1| P0665D10.11 [Oryza sativa (japonica cultivar-group)]
          Length = 517

 Score =  112 bits (279), Expect(2) = 2e-40
 Identities = 55/86 (63%), Positives = 66/86 (75%)
 Frame = -2

Query: 702 IVASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYGA 523
           +    K  L ADI+++EAAK  +DLIV+PGGL GAQ  + ++ LV LLK Q ESN+ YGA
Sbjct: 376 VTRRHKFNLIADIMVEEAAKREFDLIVMPGGLPGAQKLSSTKVLVDLLKKQAESNKPYGA 435

Query: 522 ICASPALVLEPHGLLKGKKATAFPAM 445
           ICASPA VLEPHGLLKGKKAT+FP M
Sbjct: 436 ICASPAYVLEPHGLLKGKKATSFPPM 461

 Score = 76.6 bits (187), Expect(2) = 2e-40
 Identities = 37/64 (57%), Positives = 50/64 (77%)
 Frame = -3

Query: 470 KRPLLFLPCANKLSDQSEVENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAK 291
           K+   F P A+ L+DQS  ++RVVVDGNLITS+ PG++ EFAL IVEKLFGR+ A+ +AK
Sbjct: 453 KKATSFPPMAHLLTDQSACDSRVVVDGNLITSKAPGSATEFALAIVEKLFGREKAVSIAK 512

Query: 290 TIVF 279
            ++F
Sbjct: 513 ELIF 516

 Score = 65.5 bits (158), Expect(2) = 1e-18
 Identities = 35/88 (39%), Positives = 51/88 (57%)
 Frame = -2

Query: 702 IVASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYGA 523
           + A+  VKL AD  + +    ++DLI LPGG+ G+      + L  ++K Q E    Y A
Sbjct: 170 VEAAFGVKLVADGRVADLEGEAFDLIALPGGMPGSANLRDCKVLEKMVKKQAEQGGLYAA 229

Query: 522 ICASPALVLEPHGLLKGKKATAFPAMCQ 439
           ICA+PA+ L   GLLKG KAT +P+  +
Sbjct: 230 ICATPAVTLAHWGLLKGLKATCYPSFME 257

 Score = 49.7 bits (117), Expect(2) = 1e-18
 Identities = 22/41 (53%), Positives = 34/41 (82%)
 Frame = -3

Query: 416 VENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELA 294
           V +RVVVD N +TS+GP T+IE+AL +VE+L+G++ + E+A
Sbjct: 266 VNSRVVVDRNAVTSQGPATAIEYALALVEQLYGKEKSEEVA 306

>ref|NP_188117.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
           protein, putative; protein id: At3g14990.1, supported by
           cDNA: gi_11908017, supported by cDNA: gi_13194799,
           supported by cDNA: gi_14517477 [Arabidopsis thaliana]
           gi|11908018|gb|AAG41438.1|AF326856_1 putative
           4-methyl-5(b-hydroxyethyl)-thiazole monophosphate
           biosynthesis protein [Arabidopsis thaliana]
           gi|13194800|gb|AAK15562.1|AF349515_1 putative
           4-methyl-5(b-hydroxyethyl)-thiazole monophosphate
           biosynthesis protein [Arabidopsis thaliana]
           gi|14517478|gb|AAK62629.1| AT3g14990/K15M2_13
           [Arabidopsis thaliana] gi|22136580|gb|AAM91076.1|
           AT3g14990/K15M2_13 [Arabidopsis thaliana]
          Length = 392

 Score =  112 bits (281), Expect(2) = 2e-39
 Identities = 55/87 (63%), Positives = 67/87 (76%)
 Frame = -2

Query: 705 EIVASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYG 526
           E+  SRK KL A++LLDE A+ S+DLIVLPGGL GAQ FA  E LV++L+ Q E+N+ YG
Sbjct: 251 EVEGSRKAKLVAEVLLDEVAEKSFDLIVLPGGLNGAQRFASCEKLVNMLRKQAEANKPYG 310

Query: 525 AICASPALVLEPHGLLKGKKATAFPAM 445
            ICASPA V EP+GLLKGKKAT  P +
Sbjct: 311 GICASPAYVFEPNGLLKGKKATTHPVV 337

 Score = 72.0 bits (175), Expect(2) = 3e-22
 Identities = 36/86 (41%), Positives = 50/86 (57%)
 Frame = -2

Query: 696 ASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYGAIC 517
           A   +K+ AD LL +     +DLIVLPGGL G +     ++L +++K Q    R   AIC
Sbjct: 48  ACHGIKMVADTLLSDITDSVFDLIVLPGGLPGGETLKNCKSLENMVKKQDSDGRLNAAIC 107

Query: 516 ASPALVLEPHGLLKGKKATAFPAMCQ 439
            +PAL L   GLL+GKKAT +P   +
Sbjct: 108 CAPALALGTWGLLEGKKATGYPVFME 133

 Score = 72.0 bits (175), Expect(2) = 2e-39
 Identities = 33/51 (64%), Positives = 45/51 (87%)
 Frame = -3

Query: 443 ANKLSDQSEVENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAK 291
           ++KLSD+S +E+RVVVDGN+ITSR PGT++EF+L IVEK +GR+ AL+L K
Sbjct: 338 SDKLSDKSHIEHRVVVDGNVITSRAPGTAMEFSLAIVEKFYGREKALQLGK 388

 Score = 55.1 bits (131), Expect(2) = 3e-22
 Identities = 22/45 (48%), Positives = 38/45 (83%)
 Frame = -3

Query: 416 VENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAKTIV 282
           VE+RV +DG ++TSRGPGT+IEF++ ++E+LFG++ A E++  ++
Sbjct: 143 VESRVQIDGRIVTSRGPGTTIEFSITLIEQLFGKEKADEVSSILL 187

>dbj|BAA97062.1| emb|CAA17570.1~gene_id:K15M2.13~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 369

 Score =  112 bits (281), Expect(2) = 2e-39
 Identities = 55/87 (63%), Positives = 67/87 (76%)
 Frame = -2

Query: 705 EIVASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYG 526
           E+  SRK KL A++LLDE A+ S+DLIVLPGGL GAQ FA  E LV++L+ Q E+N+ YG
Sbjct: 228 EVEGSRKAKLVAEVLLDEVAEKSFDLIVLPGGLNGAQRFASCEKLVNMLRKQAEANKPYG 287

Query: 525 AICASPALVLEPHGLLKGKKATAFPAM 445
            ICASPA V EP+GLLKGKKAT  P +
Sbjct: 288 GICASPAYVFEPNGLLKGKKATTHPVV 314

 Score = 72.0 bits (175), Expect(2) = 3e-22
 Identities = 36/86 (41%), Positives = 50/86 (57%)
 Frame = -2

Query: 696 ASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYGAIC 517
           A   +K+ AD LL +     +DLIVLPGGL G +     ++L +++K Q    R   AIC
Sbjct: 25  ACHGIKMVADTLLSDITDSVFDLIVLPGGLPGGETLKNCKSLENMVKKQDSDGRLNAAIC 84

Query: 516 ASPALVLEPHGLLKGKKATAFPAMCQ 439
            +PAL L   GLL+GKKAT +P   +
Sbjct: 85  CAPALALGTWGLLEGKKATGYPVFME 110

 Score = 72.0 bits (175), Expect(2) = 2e-39
 Identities = 33/51 (64%), Positives = 45/51 (87%)
 Frame = -3

Query: 443 ANKLSDQSEVENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAK 291
           ++KLSD+S +E+RVVVDGN+ITSR PGT++EF+L IVEK +GR+ AL+L K
Sbjct: 315 SDKLSDKSHIEHRVVVDGNVITSRAPGTAMEFSLAIVEKFYGREKALQLGK 365

 Score = 55.1 bits (131), Expect(2) = 3e-22
 Identities = 22/45 (48%), Positives = 38/45 (83%)
 Frame = -3

Query: 416 VENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAKTIV 282
           VE+RV +DG ++TSRGPGT+IEF++ ++E+LFG++ A E++  ++
Sbjct: 120 VESRVQIDGRIVTSRGPGTTIEFSITLIEQLFGKEKADEVSSILL 164

>ref|NP_564626.1| expressed protein; protein id: At1g53280.1, supported by cDNA:
           101735., supported by cDNA: gi_20259560 [Arabidopsis
           thaliana] gi|7769869|gb|AAF69547.1|AC008007_22 F12M16.18
           [Arabidopsis thaliana] gi|15810459|gb|AAL07117.1|
           unknown protein [Arabidopsis thaliana]
           gi|20259561|gb|AAM14123.1| unknown protein [Arabidopsis
           thaliana]
          Length = 438

 Score =  146 bits (368), Expect = 3e-34
 Identities = 73/88 (82%), Positives = 79/88 (88%)
 Frame = -2

Query: 705 EIVASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYG 526
           E+VASRKVKL AD+LLDEA K SYDLIVLPGGLGGA+AFA SE LV++LK Q ESN+ YG
Sbjct: 297 EVVASRKVKLVADVLLDEAEKNSYDLIVLPGGLGGAEAFASSEKLVNMLKKQAESNKPYG 356

Query: 525 AICASPALVLEPHGLLKGKKATAFPAMC 442
           AICASPALV EPHGLLKGKKATAFPAMC
Sbjct: 357 AICASPALVFEPHGLLKGKKATAFPAMC 384

 Score = 77.8 bits (190), Expect = 1e-13
 Identities = 37/60 (61%), Positives = 48/60 (79%)
 Frame = -3

Query: 470 KRPLLFLPCANKLSDQSEVENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAK 291
           K+   F    +KL+DQS +E+RV+VDGNLITSRGPGTS+EFAL IVEK +GR+  L+L+K
Sbjct: 375 KKATAFPAMCSKLTDQSHIEHRVLVDGNLITSRGPGTSLEFALAIVEKFYGREKGLQLSK 434

 Score = 67.8 bits (164), Expect(2) = 1e-20
 Identities = 34/86 (39%), Positives = 47/86 (54%)
 Frame = -2

Query: 696 ASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYGAIC 517
           A   +K+ AD LL +     +DLI+LPGGL G +     + L  ++K Q    R   AIC
Sbjct: 95  ACHGIKMVADTLLSDITDSVFDLIMLPGGLPGGETLKNCKPLEKMVKKQDTDGRLNAAIC 154

Query: 516 ASPALVLEPHGLLKGKKATAFPAMCQ 439
            +PAL     GLL+GKKAT +P   +
Sbjct: 155 CAPALAFGTWGLLEGKKATCYPVFME 180

 Score = 53.9 bits (128), Expect(2) = 1e-20
 Identities = 22/45 (48%), Positives = 38/45 (83%)
 Frame = -3

Query: 416 VENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAKTIV 282
           VE+RV +DG ++TSRGPGT++EF++ +VE+L G++ A+E++  +V
Sbjct: 189 VESRVEIDGKIVTSRGPGTTMEFSVTLVEQLLGKEKAVEVSGPLV 233

>gb|AAM60860.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
           protein, putative [Arabidopsis thaliana]
          Length = 438

 Score =  144 bits (362), Expect = 2e-33
 Identities = 72/88 (81%), Positives = 78/88 (87%)
 Frame = -2

Query: 705 EIVASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYG 526
           E+VASRKVKL AD+LLDEA K  YDLIVLPGGLGGA+AFA SE LV++LK Q ESN+ YG
Sbjct: 297 EVVASRKVKLVADVLLDEAEKNLYDLIVLPGGLGGAEAFASSEKLVNMLKKQAESNKPYG 356

Query: 525 AICASPALVLEPHGLLKGKKATAFPAMC 442
           AICASPALV EPHGLLKGKKATAFPAMC
Sbjct: 357 AICASPALVFEPHGLLKGKKATAFPAMC 384

 Score = 77.8 bits (190), Expect = 1e-13
 Identities = 37/60 (61%), Positives = 48/60 (79%)
 Frame = -3

Query: 470 KRPLLFLPCANKLSDQSEVENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAK 291
           K+   F    +KL+DQS +E+RV+VDGNLITSRGPGTS+EFAL IVEK +GR+  L+L+K
Sbjct: 375 KKATAFPAMCSKLTDQSHIEHRVLVDGNLITSRGPGTSLEFALAIVEKFYGREKGLQLSK 434

 Score = 67.4 bits (163), Expect(2) = 2e-20
 Identities = 34/86 (39%), Positives = 47/86 (54%)
 Frame = -2

Query: 696 ASRKVKLEADILLDEAAKLSYDLIVLPGGLGGAQAFAKSETLVSLLKNQRESNRYYGAIC 517
           A   +K+ AD LL +     +DLI+LPGGL G +     + L  ++K Q    R   AIC
Sbjct: 95  ACHGIKMVADTLLSDITDSVFDLIMLPGGLPGGETLKNCKPLERMVKKQDTDGRLNAAIC 154

Query: 516 ASPALVLEPHGLLKGKKATAFPAMCQ 439
            +PAL     GLL+GKKAT +P   +
Sbjct: 155 CAPALAFGTWGLLEGKKATCYPVFME 180

 Score = 53.9 bits (128), Expect(2) = 2e-20
 Identities = 22/45 (48%), Positives = 38/45 (83%)
 Frame = -3

Query: 416 VENRVVVDGNLITSRGPGTSIEFALVIVEKLFGRKLALELAKTIV 282
           VE+RV +DG ++TSRGPGT++EF++ +VE+L G++ A+E++  +V
Sbjct: 189 VESRVEIDGKIVTSRGPGTTMEFSVTLVEQLLGKEKAVEVSGPLV 233

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 598,716,221
Number of Sequences: 1393205
Number of extensions: 12390218
Number of successful extensions: 30618
Number of sequences better than 10.0: 171
Number of HSP's better than 10.0 without gapping: 29548
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30580
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 32091529758
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf013a12 BP058881 1 434
2 MR045f04_f BP079495 110 477
3 GNf095c11 BP074397 119 346
4 GENf085f04 BP061965 120 499
5 MF013g05_f BP028939 120 611
6 MFB023b11_f BP035641 120 593
7 SPD062f05_f BP048958 120 590
8 MF062g12_f BP031603 120 611
9 MF014b01_f BP028961 122 614
10 MFBL017f04_f BP042119 123 616
11 GENLf015e10 BP063160 123 639
12 MFB093a05_f BP040756 134 724
13 SPD004e04_f BP044323 149 715




Lotus japonicus
Kazusa DNA Research Institute