KCC000213A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000213A_C01 KCC000213A_c01
gcttactacagcttaccagtTAGACTTCTCGCAGTGCTAGCCCGCTGTAACAACAACCCT
TGCTTCCCTTTCCGCTGTGCGACCCTTCTTTTCCTGAACCGTTTGCCGGTAGCGCCTGTG
GCTAGTAGCAGCAGTATGATGCAACAGACCATTTCCTCCAGGGGTGCTGGGCGGTGCACC
ACTCGTCTAGCTGGGCGGAGGGTCTCAGCACCCTCGTTGGCAATTCGCCGGGCACGCTCG
CTCGTGGTTCGCGCACAGCAGGTCCCTGAGGCTGAGGACGAGGCAGAGCCCACAACCAGC
TACTCGCACGCAGCGAGGCAGGCTTACGCTCGGCTCTCGGCCCCGCCCAAAGTGGGCGCC
GAGTATGGCGAGGGCTTCATGCAGTTCCGGCTGGGTGGTGAGCCGCGGCGGCTGGACGTG
GCGGCCCTGAACGAGTCGTTGAAGGCCGGCGGCGCGTTGCGGCTGCGCTTCCACAACCGG
CCTGACGAGGCGTACGGCTGCGTGTTTGACTTTGACTCCATCATCGCCAACACTCACGGC
GCCTACGTGAGCGCCTGGCGGAAGCTGGCGGAAGCTAGGGGGCTGCCGCTGCCCAGGCAC
GCCCGCCTGTCCATGCACGCCACCGCGCCGGAGCGGATCATCATGGACGTCCTGGGCTGG
ACCAGCAGCATGAAGGAGGCGCGCGCGCTGGCCTTTGAGCTGGCGGAGACGTACGCACAG
GAGCTGGCGGCGGGGCCGGCCATGGCCACACCCCTGCCCGGCGTGCGCGAGTGGCTCGAC
GCTCTGACCGCCTTCAACGTGCCTGTGGCGGCCGTGAGTGTGCTGGACCGCGGCACGGTG
CGGCGCGCGCTGGAGCGCATGCACCTGCACGACCACTTCCAAGTGCTGGTGACCGCCGAG
GACGAGCTGGAGAGCACCGCGCAGCGCTACCTGTCGGCCTGCCTGCAGCTGAACCGGCCG
CCCAACATGTGCGCCGTGTTCGGGGGCAGCCCCGAGGCCGTCACGGCCGCGCACAACTGC
ACCATGAAGGCGGTGGCTGTGCCTGTGTCGCCCGACTACCCTGCCTACAAGCTGCGGTCG
GCGGACGTTACGGTGGCGCGGCTGGACCACCTCACCGTGTACAACCTGAGGCGCCTGTTC
GCAAACAGCGGCGAGGAGTTCATGGACCTGCGCACGCAGCGCTCCGACGACAAGCCCGCG
AACCGGCGCCGTGTGGCCAGTGCCCTGCTGTGAnGGGCGCGTGTGGCAGTGCCTGCTGTG
AGGGCGCGTGTGCAGTGCTGTTTTAGGGCGCGGTTGC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000213A_C01 KCC000213A_c01
         (1297 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM64820.1| unknown [Arabidopsis thaliana]                         212  1e-53
ref|NP_566385.1| haloacid dehalogenase-like hydrolase family [Ar...   212  1e-53
gb|AAK63981.1| AT3g10970/F9F8_21 [Arabidopsis thaliana] gi|22137...   211  2e-53
gb|AAF01524.1|AC009991_20 unknown protein [Arabidopsis thaliana]      146  7e-34
ref|NP_192894.1| haloacid dehalogenase-like hydrolase family [Ar...    85  3e-15

>gb|AAM64820.1| unknown [Arabidopsis thaliana]
          Length = 365

 Score =  212 bits (540), Expect = 1e-53
 Identities = 121/289 (41%), Positives = 168/289 (57%), Gaps = 3/289 (1%)
 Frame = +1

Query: 352  VGAEYGEGFMQFRLGGEPRRLDVAALNESLKAGGALRLRFHNRPDEAYGCVFDFDSIIAN 531
            +GAEYGEGF  FR  G P ++DV   NE L+ G   R+R+  +PDEAYG +F +D+I+A+
Sbjct: 77   IGAEYGEGFETFRQDG-PLKVDVDFWNEKLQDGFLQRIRYAMKPDEAYGLIFSWDNIVAD 135

Query: 532  THGAYVSAWRKLAEARGLPLPRHA---RLSMHATAPERIIMDVLGWTSSMKEARALAFEL 702
            T    + AW++LA   G  +       RL ++A A + ++  VL W  +  +   L   L
Sbjct: 136  TRSLKLEAWKQLAAEEGKEITEEVDIQRLMLYAGA-DHVLRKVLFWEKTQSKIDRLKLRL 194

Query: 703  AETYAQELAAGPAMATPLPGVREWLDALTAFNVPVAAVSVLDRGTVRRALERMHLHDHFQ 882
            +E Y   L     +  P  G+R+WLDA+T   +P A VS LDR  +  ALERM L  +FQ
Sbjct: 195  SEIYYDSLLK---LTEPKEGLRDWLDAVTTARIPCAVVSNLDRKNMINALERMGLQKYFQ 251

Query: 883  VLVTAEDELESTAQRYLSACLQLNRPPNMCAVFGGSPEAVTAAHNCTMKAVAVPVSPDYP 1062
             +V+ ED +ES A R+LSA ++L+R P+ C VF   P  +TAAHNCTM AV +  +  + 
Sbjct: 252  AVVSEEDGMESIAHRFLSAAVKLDRKPSKCVVFEDDPRGITAAHNCTMMAVGLIGA--HR 309

Query: 1063 AYKLRSADVTVARLDHLTVYNLRRLFANSGEEFMDLRTQRSDDKPANRR 1209
            AY L  AD+ V     L+V NLRRLFAN G  FMD   Q  +  P  R+
Sbjct: 310  AYDLVQADLAVGNFYELSVINLRRLFANKGSTFMDHEKQIIEKSPPKRK 358

>ref|NP_566385.1| haloacid dehalogenase-like hydrolase family [Arabidopsis thaliana]
          Length = 365

 Score =  212 bits (539), Expect = 1e-53
 Identities = 120/289 (41%), Positives = 168/289 (57%), Gaps = 3/289 (1%)
 Frame = +1

Query: 352  VGAEYGEGFMQFRLGGEPRRLDVAALNESLKAGGALRLRFHNRPDEAYGCVFDFDSIIAN 531
            +GAEYGEGF  FR  G P ++DV   NE L+ G   R+R+  +PDEAYG +F +D+++A+
Sbjct: 77   IGAEYGEGFETFRQDG-PLKVDVDFWNEKLQDGFLQRIRYAMKPDEAYGLIFSWDNVVAD 135

Query: 532  THGAYVSAWRKLAEARGLPLPRHA---RLSMHATAPERIIMDVLGWTSSMKEARALAFEL 702
            T    + AW++LA   G  +       RL ++A A + ++  VL W  +  +   L   L
Sbjct: 136  TRSLKLEAWKQLAAEEGKEITEEVDIQRLMLYAGA-DHVLRKVLFWEKTQSKIDRLKLRL 194

Query: 703  AETYAQELAAGPAMATPLPGVREWLDALTAFNVPVAAVSVLDRGTVRRALERMHLHDHFQ 882
            +E Y   L     +  P  G+R+WLDA+T   +P A VS LDR  +  ALERM L  +FQ
Sbjct: 195  SEIYYDSLLK---LTEPKEGLRDWLDAVTTARIPCAVVSNLDRKNMINALERMGLQKYFQ 251

Query: 883  VLVTAEDELESTAQRYLSACLQLNRPPNMCAVFGGSPEAVTAAHNCTMKAVAVPVSPDYP 1062
             +V+ ED +ES A R+LSA ++L+R P+ C VF   P  +TAAHNCTM AV +  +  + 
Sbjct: 252  AVVSEEDGMESIAHRFLSAAVKLDRKPSKCVVFEDDPRGITAAHNCTMMAVGLIGA--HR 309

Query: 1063 AYKLRSADVTVARLDHLTVYNLRRLFANSGEEFMDLRTQRSDDKPANRR 1209
            AY L  AD+ V     L+V NLRRLFAN G  FMD   Q  +  P  R+
Sbjct: 310  AYDLVQADLAVGNFYELSVINLRRLFANKGSTFMDHEKQIIEKSPPKRK 358

>gb|AAK63981.1| AT3g10970/F9F8_21 [Arabidopsis thaliana] gi|22137294|gb|AAM91492.1|
            AT3g10970/F9F8_21 [Arabidopsis thaliana]
          Length = 365

 Score =  211 bits (538), Expect = 2e-53
 Identities = 120/289 (41%), Positives = 168/289 (57%), Gaps = 3/289 (1%)
 Frame = +1

Query: 352  VGAEYGEGFMQFRLGGEPRRLDVAALNESLKAGGALRLRFHNRPDEAYGCVFDFDSIIAN 531
            +GAEYGEGF  FR  G P ++DV   NE L+ G   R+R+  +PDEAYG +F +D+++A+
Sbjct: 77   IGAEYGEGFETFRQDG-PLKVDVDFWNEKLQDGFLQRIRYAMKPDEAYGLIFSWDNVVAD 135

Query: 532  THGAYVSAWRKLAEARGLPLPRHA---RLSMHATAPERIIMDVLGWTSSMKEARALAFEL 702
            T G  + AW++LA   G  +       RL ++A A + ++  VL W  +  +   L   L
Sbjct: 136  TRGLKLEAWKQLAAEEGKEITEEVDIQRLMLYAGA-DHVLRKVLFWEKTQSKIDRLKLRL 194

Query: 703  AETYAQELAAGPAMATPLPGVREWLDALTAFNVPVAAVSVLDRGTVRRALERMHLHDHFQ 882
            +E Y   L     +  P  G+R+WLDA+T   +P A VS LDR  +  ALERM L  +FQ
Sbjct: 195  SEIYYDSLLK---LTEPKEGLRDWLDAVTTARIPCAVVSNLDRKNMINALERMGLQKYFQ 251

Query: 883  VLVTAEDELESTAQRYLSACLQLNRPPNMCAVFGGSPEAVTAAHNCTMKAVAVPVSPDYP 1062
             +V+  D +ES A R+LSA ++L+R P+ C VF   P  +TAAHNCTM AV +  +  + 
Sbjct: 252  AVVSEGDGMESIAHRFLSAAVKLDRKPSKCVVFEDDPRGITAAHNCTMMAVGLIGA--HR 309

Query: 1063 AYKLRSADVTVARLDHLTVYNLRRLFANSGEEFMDLRTQRSDDKPANRR 1209
            AY L  AD+ V     L+V NLRRLFAN G  FMD   Q  +  P  R+
Sbjct: 310  AYDLVQADLAVGNFYELSVINLRRLFANKGSTFMDHEKQIIEKSPPKRK 358

>gb|AAF01524.1|AC009991_20 unknown protein [Arabidopsis thaliana]
          Length = 201

 Score =  146 bits (369), Expect = 7e-34
 Identities = 81/193 (41%), Positives = 110/193 (56%)
 Frame = +1

Query: 631  ERIIMDVLGWTSSMKEARALAFELAETYAQELAAGPAMATPLPGVREWLDALTAFNVPVA 810
            + ++  VL W  +  +   L   L+E Y   L     +  P  G+R+WLDA+T   +P A
Sbjct: 7    DHVLRKVLFWEKTQSKIDRLKLRLSEIYYDSLLK---LTEPKEGLRDWLDAVTTARIPCA 63

Query: 811  AVSVLDRGTVRRALERMHLHDHFQVLVTAEDELESTAQRYLSACLQLNRPPNMCAVFGGS 990
             VS LDR  +  ALERM L  +FQ +V+ ED +ES A R+LSA ++L+R P+ C VF   
Sbjct: 64   VVSNLDRKNMINALERMGLQKYFQAMVSEEDGMESIAHRFLSAAVKLDRKPSKCVVFEDD 123

Query: 991  PEAVTAAHNCTMKAVAVPVSPDYPAYKLRSADVTVARLDHLTVYNLRRLFANSGEEFMDL 1170
            P  +TAAHNCTM AV +  +  + AY L  AD+ V     L+V NLRRLFAN G  FMD 
Sbjct: 124  PRGITAAHNCTMMAVGLIGA--HRAYDLVQADLAVGNFYELSVINLRRLFANKGSTFMDH 181

Query: 1171 RTQRSDDKPANRR 1209
              Q  +  P  R+
Sbjct: 182  EKQIIEKSPPKRK 194

>ref|NP_192894.1| haloacid dehalogenase-like hydrolase family [Arabidopsis thaliana]
            gi|30681816|ref|NP_849359.1| haloacid dehalogenase-like
            hydrolase family [Arabidopsis thaliana]
            gi|7486116|pir||T10577 hypothetical protein F25E4.190 -
            Arabidopsis thaliana gi|7267857|emb|CAB78200.1| putative
            protein [Arabidopsis thaliana] gi|7321054|emb|CAB82162.1|
            putative protein [Arabidopsis thaliana]
            gi|16648789|gb|AAL25585.1| AT4g11570/F25E4_190
            [Arabidopsis thaliana] gi|20466125|gb|AAM19984.1|
            AT4g11570/F25E4_190 [Arabidopsis thaliana]
            gi|24030331|gb|AAN41332.1| unknown protein [Arabidopsis
            thaliana]
          Length = 373

 Score = 84.7 bits (208), Expect = 3e-15
 Identities = 65/246 (26%), Positives = 118/246 (47%), Gaps = 5/246 (2%)
 Frame = +1

Query: 496  GCVFDFDSIIANTHGAYVS-AWRKLAEARGL-PLPRHARLSMHATAPERIIMDVLGWTSS 669
            G +F+++ ++   +    + +W  LA+  G  P P      +     E+ I +VL W+  
Sbjct: 129  GAIFEWEGVLIEDNPDLDNQSWLTLAQEEGKSPPPAFMLRRVEGMKNEQAISEVLCWSRD 188

Query: 670  MKEARALAFELAETYAQELAAGPAMATPLPGVREWLDALTAFNVPVAAVSVLDRGTVRRA 849
              + R +A    E +    A    +     G +E+++ L    +P+A VS   R T+  A
Sbjct: 189  PVQVRRMAKRKEEIFK---ALHGGVYRLRDGSQEFVNVLMNNKIPMALVSTRPRETLENA 245

Query: 850  LERMHLHDHFQVLVTAEDELESTA--QRYLSACLQLNRPPNMCAVFGGSPEAVTAAHNCT 1023
            +  + +   F V+V +ED        + ++ A   L+  P  C VFG S + + AAH+  
Sbjct: 246  VGSIGIRKFFSVIVASEDVYRGKPDPEMFIYAAQLLDFIPERCIVFGNSNQTIEAAHDGR 305

Query: 1024 MKAVAVPVSPDYPAYKLRSADVTVARLDHLTVYNLRRLFANSGEEF-MDLRTQRSDDKPA 1200
            MK VA  V+  +P Y+L +A++ V RLD L++ +L++L      EF  +L  ++ D++  
Sbjct: 306  MKCVA--VASKHPIYELGAAELVVRRLDELSIIDLKKLADTDLTEFEPELEMEKEDEREL 363

Query: 1201 NRRRVA 1218
                VA
Sbjct: 364  PSSAVA 369



EST assemble image


clone accession position
1 LCL055c11_r AV629254 1 464
2 MXL003e05_r BP093110 21 366
3 CL21h07_r AV394349 24 555
4 CL12e07_r AV393808 26 556
5 CL10h02_r AV393662 31 527
6 LCL025h07_r AV627380 155 358
7 CL44a03_r AV395572 224 655
8 LCL060f09_r AV629544 259 673
9 CL47e03_r AV395687 360 862
10 CM028h07_r AV387696 729 1307




Chlamydomonas reinhardtii
Kazusa DNA Research Institute