KCC001101A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001101A_C01 KCC001101A_c01
accgctccaacttcaagGCCAACCCGGACTACAAGGGCGTGTGCCACGTGGCGCTGGCGC
AGGAGGGCCACTGCAAGCCCGGCGAGGTGCTGTTCGGCACTGACAGCCACACCTGCAACG
CCGGCGCGTTCGGCCAGTTCGCCACGGGTGTGGGCAACACCGACGCCGGCTTCATCCTGG
GCACCGGCAAGCTGCTGATCAAGGTTCCCCCGACCATGCGCTTCATCATGGACGGCGAGA
TGCCCGAGTACCTGCTGGCCAAGGACCTCATCCTGCAGATCATTGGCGAGATCTCGGTGG
CGGGCGCCACCTACAAGGCCATGGAGTTCGCCGGCAGCGCCATTGAGGGCATGAACATGG
ACGAGCGCATGACCATCTGCAACATGGTGGTGGAGGCGGGAGGCAAGAACGGCGTGATCG
CGCCTGACCAGACCACCTTCGACTACGTCAAGGCGCGCACGCAGGAGGCTTTCGAGCCCG
TGTACAGCGACGGCGCCGCCAGCTACATTGCTGACTACAAGTGGGACGTGTCCAAGCTGG
AGCCGCTGGTGGCCGCGCCCCACTCGCCCGACAACCGCAAGACCGCGCGCGAGTGCTCCG
ACGTCAAGATCGACCGCGTGTACATCGGCAGCTGCACGGGCGGCAAGACCGAGGACTTCA
TCTCGGCCGCCAAGCTGTTTTACCGCGCCAAGCGCCAGGTCAAGGTGCCCACCTACCTGG
TGCCAGCCACCCAGAAGGTGTGGGCGGACGTGTACACCATGCCCGTGCCCGGCTGCGACG
GCAAGACCGCCGCCGAGATCTTCGAGGAGAGCGGCTGCATCACCCCGCCGCGCCCTCCTG
CGCGnCTGCCTGGCGGGCGGCGACACTCGGCGCTG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001101A_C01 KCC001101A_c01
         (875 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_567405.1| aconitase family [Arabidopsis thaliana] gi|1502...   469  e-131
pir||T06300 hypothetical protein T9E8.170 - Arabidopsis thaliana...   463  e-129
ref|ZP_00074911.1| COG0065: 3-isopropylmalate dehydratase large ...   410  e-113
ref|NP_661514.1| 3-isopropylmalate dehydratase, large subunit, p...   294  1e-78
ref|NP_632433.1| 3-isopropylmalate dehydratase [Methanosarcina m...   226  5e-58

>ref|NP_567405.1| aconitase family [Arabidopsis thaliana] gi|15027971|gb|AAK76516.1|
           unknown protein [Arabidopsis thaliana]
           gi|21436051|gb|AAM51226.1| unknown protein [Arabidopsis
           thaliana]
          Length = 509

 Score =  469 bits (1206), Expect = e-131
 Identities = 220/276 (79%), Positives = 247/276 (88%)
 Frame = +3

Query: 9   NFKANPDYKGVCHVALAQEGHCKPGEVLFGTDSHTCNAGAFGQFATGVGNTDAGFILGTG 188
           NFKANPDYKGVCHVALAQEGHC+PGEVL GTDSHTC AGAFGQFATG+GNTDAGF+LGTG
Sbjct: 168 NFKANPDYKGVCHVALAQEGHCRPGEVLLGTDSHTCTAGAFGQFATGIGNTDAGFVLGTG 227

Query: 189 KLLIKVPPTMRFIMDGEMPEYLLAKDLILQIIGEISVAGATYKAMEFAGSAIEGMNMDER 368
           K+L+KVPPTMRFI+DGEMP YL AKDLILQIIGEISVAGATYK MEF+G+ IE ++M+ER
Sbjct: 228 KILLKVPPTMRFILDGEMPSYLQAKDLILQIIGEISVAGATYKTMEFSGTTIESLSMEER 287

Query: 369 MTICNMVVEAGGKNGVIAPDQTTFDYVKARTQEAFEPVYSDGAASYIADYKWDVSKLEPL 548
           MT+CNMVVEAGGKNGVI PD TT +YV+ RT   FEPVYSDG AS++ADY++DVSKLEP+
Sbjct: 288 MTLCNMVVEAGGKNGVIPPDATTLNYVENRTSVPFEPVYSDGNASFVADYRFDVSKLEPV 347

Query: 549 VAAPHSPDNRKTARECSDVKIDRVYIGSCTGGKTEDFISAAKLFYRAKRQVKVPTYLVPA 728
           VA PHSPDNR  AREC DVKIDRVYIGSCTGGKTEDF++AAKLF+ A R+VKVPT+LVPA
Sbjct: 348 VAKPHSPDNRALARECKDVKIDRVYIGSCTGGKTEDFMAAAKLFHAAGRKVKVPTFLVPA 407

Query: 729 TQKVWADVYTMPVPGCDGKTAAEIFEESGCITPPRP 836
           TQKVW DVY +PVPG  GKT A+IFEE+GC TP  P
Sbjct: 408 TQKVWMDVYALPVPGAGGKTCAQIFEEAGCDTPASP 443

>pir||T06300 hypothetical protein T9E8.170 - Arabidopsis thaliana
           gi|4584548|emb|CAB40778.1| putative protein [Arabidopsis
           thaliana] gi|7268046|emb|CAB78385.1| putative protein
           [Arabidopsis thaliana]
          Length = 509

 Score =  463 bits (1191), Expect = e-129
 Identities = 218/276 (78%), Positives = 245/276 (87%)
 Frame = +3

Query: 9   NFKANPDYKGVCHVALAQEGHCKPGEVLFGTDSHTCNAGAFGQFATGVGNTDAGFILGTG 188
           NFKANPDYKGVCHVALAQEGHC+PGEVL GTDSHTC AGAFGQFATG+GNTDAGF+LGTG
Sbjct: 168 NFKANPDYKGVCHVALAQEGHCRPGEVLLGTDSHTCTAGAFGQFATGIGNTDAGFVLGTG 227

Query: 189 KLLIKVPPTMRFIMDGEMPEYLLAKDLILQIIGEISVAGATYKAMEFAGSAIEGMNMDER 368
           K+L+KVPPTMRFI+DGEMP YL AKDLILQIIGEISVAGATYK MEF+G+ IE ++M+ER
Sbjct: 228 KILLKVPPTMRFILDGEMPSYLQAKDLILQIIGEISVAGATYKTMEFSGTTIESLSMEER 287

Query: 369 MTICNMVVEAGGKNGVIAPDQTTFDYVKARTQEAFEPVYSDGAASYIADYKWDVSKLEPL 548
           MT+CNMVVEAGGKNGVI PD TT +YV+A     F PVYSDG AS++ADY++DVSKLEP+
Sbjct: 288 MTLCNMVVEAGGKNGVIPPDATTLNYVEACILSCFLPVYSDGNASFVADYRFDVSKLEPV 347

Query: 549 VAAPHSPDNRKTARECSDVKIDRVYIGSCTGGKTEDFISAAKLFYRAKRQVKVPTYLVPA 728
           VA PHSPDNR  AREC DVKIDRVYIGSCTGGKTEDF++AAKLF+ A R+VKVPT+LVPA
Sbjct: 348 VAKPHSPDNRALARECKDVKIDRVYIGSCTGGKTEDFMAAAKLFHAAGRKVKVPTFLVPA 407

Query: 729 TQKVWADVYTMPVPGCDGKTAAEIFEESGCITPPRP 836
           TQKVW DVY +PVPG  GKT A+IFEE+GC TP  P
Sbjct: 408 TQKVWMDVYALPVPGAGGKTCAQIFEEAGCDTPASP 443

>ref|ZP_00074911.1| COG0065: 3-isopropylmalate dehydratase large subunit [Trichodesmium
           erythraeum IMS101]
          Length = 444

 Score =  410 bits (1054), Expect = e-113
 Identities = 198/278 (71%), Positives = 231/278 (82%)
 Frame = +3

Query: 3   RSNFKANPDYKGVCHVALAQEGHCKPGEVLFGTDSHTCNAGAFGQFATGVGNTDAGFILG 182
           RSNFKANPDYKGVCHVALAQEGH +PGEVLFGTDSHTCNAGAFG+FATG+GNTDA FI G
Sbjct: 107 RSNFKANPDYKGVCHVALAQEGHTRPGEVLFGTDSHTCNAGAFGEFATGIGNTDAAFIAG 166

Query: 183 TGKLLIKVPPTMRFIMDGEMPEYLLAKDLILQIIGEISVAGATYKAMEFAGSAIEGMNMD 362
           TGKLL+KVP TMRFI++GEMP YLLAKDLILQIIG+ISV+GATY+ MEFAG  +  M M+
Sbjct: 167 TGKLLVKVPATMRFILNGEMPNYLLAKDLILQIIGDISVSGATYRTMEFAGETVGQMTME 226

Query: 363 ERMTICNMVVEAGGKNGVIAPDQTTFDYVKARTQEAFEPVYSDGAASYIADYKWDVSKLE 542
           ERMT+CNMV+EAGGKNGVIAPD+ TF+Y++ RT + FE  YSD  A + +D  +DV+KLE
Sbjct: 227 ERMTLCNMVIEAGGKNGVIAPDEMTFEYLRGRTDKPFESFYSDKNAEFYSDRSYDVTKLE 286

Query: 543 PLVAAPHSPDNRKTARECSDVKIDRVYIGSCTGGKTEDFISAAKLFYRAKRQVKVPTYLV 722
           P+VA PHSPDN++ AR C DVKIDRVYIGSCTGGKT DF+ AAKL      QVKVPTYLV
Sbjct: 287 PVVAKPHSPDNKELARNCQDVKIDRVYIGSCTGGKTSDFLHAAKLI--KDHQVKVPTYLV 344

Query: 723 PATQKVWADVYTMPVPGCDGKTAAEIFEESGCITPPRP 836
           PATQKV+ D++ +     DGKT +EIF ++GCI P  P
Sbjct: 345 PATQKVYEDLFIIK---HDGKTLSEIFLDAGCIEPAAP 379

>ref|NP_661514.1| 3-isopropylmalate dehydratase, large subunit, putative [Chlorobium
           tepidum TLS] gi|21646552|gb|AAM71856.1|
           3-isopropylmalate dehydratase, large subunit, putative
           [Chlorobium tepidum TLS]
          Length = 431

 Score =  294 bits (752), Expect = 1e-78
 Identities = 149/263 (56%), Positives = 187/263 (70%)
 Frame = +3

Query: 30  YKGVCHVALAQEGHCKPGEVLFGTDSHTCNAGAFGQFATGVGNTDAGFILGTGKLLIKVP 209
           Y+GVCHVALA+EG   PG VLFGTDSHTC +GAFG F +G+GNTDA FILGTGKL  KVP
Sbjct: 104 YRGVCHVALAEEGFNLPGTVLFGTDSHTCTSGAFGMFGSGIGNTDAAFILGTGKLWEKVP 163

Query: 210 PTMRFIMDGEMPEYLLAKDLILQIIGEISVAGATYKAMEFAGSAIEGMNMDERMTICNMV 389
            +M+F  +G+MPEYL AKDLILQI+G+I+  GATY+AMEF G A+  + +DERMT+CNM 
Sbjct: 164 DSMKFTFEGQMPEYLTAKDLILQILGDITTDGATYRAMEFDGEAVYSLPIDERMTLCNMA 223

Query: 390 VEAGGKNGVIAPDQTTFDYVKARTQEAFEPVYSDGAASYIADYKWDVSKLEPLVAAPHSP 569
           +EAGG NG+IA D  T  +VKART + +E   SD  A Y + Y+++V K+EP+VA PHSP
Sbjct: 224 IEAGGMNGIIAADAVTEAFVKARTSKPYEIFTSDPDAQYHSMYRYNVEKMEPIVAKPHSP 283

Query: 570 DNRKTARECSDVKIDRVYIGSCTGGKTEDFISAAKLFYRAKRQVKVPTYLVPATQKVWAD 749
           DNR T    +   I + YIGSCTGGK  DF  AAK+     ++V V T +VPAT  V + 
Sbjct: 284 DNRATVHSVAGTPITKSYIGSCTGGKLTDFKLAAKIL--KGKKVAVTTNIVPATVLVASQ 341

Query: 750 VYTMPVPGCDGKTAAEIFEESGC 818
           + T      DG+T   IFEE+GC
Sbjct: 342 LETEMY---DGQTLRHIFEEAGC 361

>ref|NP_632433.1| 3-isopropylmalate dehydratase [Methanosarcina mazei Goe1]
           gi|31563183|sp|Q8PZT3|LE21_METMA 3-isopropylmalate
           dehydratase large subunit 1 (Isopropylmalate isomerase
           1) (Alpha-IPM isomerase 1) (IPMI 1)
           gi|20904779|gb|AAM30105.1| 3-isopropylmalate dehydratase
           [Methanosarcina mazei Goe1]
          Length = 391

 Score =  226 bits (575), Expect = 5e-58
 Identities = 107/234 (45%), Positives = 152/234 (64%)
 Frame = +3

Query: 33  KGVCHVALAQEGHCKPGEVLFGTDSHTCNAGAFGQFATGVGNTDAGFILGTGKLLIKVPP 212
           +G+CH  L + G   PG++L G DSH+C  GAFG FATGVG TD   I  TGKL  KVP 
Sbjct: 75  EGICHQVLPENGFALPGKLLVGADSHSCTYGAFGAFATGVGATDMAEIFATGKLWFKVPE 134

Query: 213 TMRFIMDGEMPEYLLAKDLILQIIGEISVAGATYKAMEFAGSAIEGMNMDERMTICNMVV 392
           + R  ++G + +++ AKDL L +IG+  +AGATYKA+EF G AI  +++  RMT+CNM +
Sbjct: 135 SFRMTVEGSLDKHVYAKDLTLYLIGKTGIAGATYKAVEFYGQAISELSVAGRMTLCNMAI 194

Query: 393 EAGGKNGVIAPDQTTFDYVKARTQEAFEPVYSDGAASYIADYKWDVSKLEPLVAAPHSPD 572
           E G K G++ PD+ TFD++K R    +EPVYSD  ASY+ ++ +D   +EP VA PH  D
Sbjct: 195 EMGAKTGIVPPDEKTFDFLKNRAVAPYEPVYSDPDASYLKEFVYDAGDIEPQVACPHQVD 254

Query: 573 NRKTARECSDVKIDRVYIGSCTGGKTEDFISAAKLFYRAKRQVKVPTYLVPATQ 734
           N K   E     +D+V+IG+CT G+ ED   AA +     ++V V T ++PA++
Sbjct: 255 NVKPVGEVEGTHVDQVFIGTCTNGRLEDLEVAASVL--KGKKVTVRTIIIPASR 306



EST assemble image


clone accession position
1 MX249b07_r BP092054 1 510
2 LCL075b07_r AV630194 18 485
3 CM017g11_r AV388204 275 877
4 LC095e01_r AV625613 295 770
5 HC044c05_r AV635275 373 844




Chlamydomonas reinhardtii
Kazusa DNA Research Institute