KCC002198A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002198A_C01 KCC002198A_c01
GACCACACAGCGGCGTTGCATCAAATATATAACTTCAATACTGCGTGCTTCGACCACGGT
CCAAGGCACTGCGAATGGCGCGCGCTATGCTTATAGTGATTACATGATATTTTCTAGCGC
GTCGGTGTTACGTCGTGACGCGGAGACACTAACGTGTCGACTGACTACTAATACACTATG
GTGGCGGTGCAGCTGCTTGCCTCACTGGCGCTTCTTGTAATAACGCTGCCAAGCGAAATT
GACGCCCAACTGTCGCACAATTATGCGAGTGTGTTGGGACTGAGCTATAGGTTCTATGAG
GCTCAGATGTCTGGCAACGTGCCTTCCTGGTCGCGTGCCTCCCAGGCGGCCGGCGGCTGG
CGCAACAAGAGCCACGCCCTCGATGGCACTGGCCCGGGCGGAGTGAACCTGGACCTGTCA
GGTGGCTGGTATGATGCGGGAGATCACCTGAAGCTACATCTGCCTCTGGGTGTGTCGGTG
AGCCTTCTGTCGTACGGCGCCCTCACCTTCGAGGCCGCCTACCGCGCCGCGGGCCAGTGG
GACATCGCCGTGCGCAACCTAGACTGGGCGGCCTCCTACATCGCCAAGTGCCACACACAG
GCCAGCGACACTCCCGCCTACAACAAGTTCGTGGCCCAGATTGGCGATGTGGCTACGGAC
CACAACACCTGGTGGGGCCGGCCGGAGCAGCAGCCCGAGGGCGGTGCCCAG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002198A_C01 KCC002198A_c01
         (711 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAF19168.1|AF180561_1 thermophilic extracellular endocellulas...   117  2e-25
gb|AAC38572.2| endoglucanase H [Clostridium cellulovorans]            114  1e-24
gb|AAK06394.1| CelE [Caldicellulosiruptor sp. Tok7B.1]                109  5e-23
sp|P26224|GUNF_CLOTM Endoglucanase F precursor (EGF) (Endo-1,4-b...   107  2e-22
sp|P22534|GUNA_CALSA Endoglucanase A precursor (Endo-1,4-beta-gl...   106  4e-22

>gb|AAF19168.1|AF180561_1 thermophilic extracellular endocellulase [Myxobacter sp. AL-1]
          Length = 651

 Score =  117 bits (292), Expect = 2e-25
 Identities = 70/171 (40%), Positives = 90/171 (51%), Gaps = 3/171 (1%)
 Frame = +1

Query: 193 LLASLALLVITLPSEID---AQLSHNYASVLGLSYRFYEAQMSGNVPSWSRASQAAGGWR 363
           LL + A+L +    + D   A+ S NYA +L  S  FYEAQ SG +P  SR +     WR
Sbjct: 8   LLLTFAMLFMPASGKADIASAKESQNYAELLQKSILFYEAQRSGKLPESSRLN-----WR 62

Query: 364 NKSHALDGTGPGGVNLDLSGGWYDAGDHLKLHLPLGVSVSLLSYGALTFEAAYRAAGQWD 543
             S   DG   G    DL+GGWYDAGDH+K  LP+  S ++LS+    +  AY AAGQ D
Sbjct: 63  GDSALEDGKDVGH---DLTGGWYDAGDHVKFGLPMAYSAAVLSWSVYEYRDAYEAAGQLD 119

Query: 544 IAVRNLDWAASYIAKCHTQASDTPAYNKFVAQIGDVATDHNTWWGRPEQQP 696
             + N+ WA  Y  K HT   +      F  Q+G+ A DH  WWG  E  P
Sbjct: 120 AILDNIRWATDYFIKAHTDRYE------FWGQVGNGAQDH-AWWGPAEVMP 163

>gb|AAC38572.2| endoglucanase H [Clostridium cellulovorans]
          Length = 715

 Score =  114 bits (286), Expect = 1e-24
 Identities = 61/156 (39%), Positives = 87/156 (55%)
 Frame = +1

Query: 220 ITLPSEIDAQLSHNYASVLGLSYRFYEAQMSGNVPSWSRASQAAGGWRNKSHALDGTGPG 399
           + +  E  A  + NY   L  S  FYE Q SG +P+  R++     WR  S   DG+  G
Sbjct: 27  VLVKGETTATPTFNYGEALQKSIMFYEFQRSGKLPTDIRSN-----WRGDSGTKDGSDVG 81

Query: 400 GVNLDLSGGWYDAGDHLKLHLPLGVSVSLLSYGALTFEAAYRAAGQWDIAVRNLDWAASY 579
              +DL+GGWYDAGDH+K +LP+  +V++L++     +AAY  +GQ D  V+ + WA  Y
Sbjct: 82  ---VDLTGGWYDAGDHVKFNLPMSYTVAMLAWSLSEDKAAYEKSGQLDYLVKEIKWATDY 138

Query: 580 IAKCHTQASDTPAYNKFVAQIGDVATDHNTWWGRPE 687
           + KCHT      A N++  Q+GD   DH  WWG  E
Sbjct: 139 LMKCHT------APNEYYYQVGDGGADHK-WWGPAE 167

>gb|AAK06394.1| CelE [Caldicellulosiruptor sp. Tok7B.1]
          Length = 1751

 Score =  109 bits (272), Expect = 5e-23
 Identities = 64/174 (36%), Positives = 92/174 (52%), Gaps = 9/174 (5%)
 Frame = +1

Query: 178 MVAVQLLASL-ALLVITL--------PSEIDAQLSHNYASVLGLSYRFYEAQMSGNVPSW 330
           M A++ + S+ ALLV+TL        P +  A  ++NY   L  +  FYE QMSG +PSW
Sbjct: 4   MKAIKRVVSITALLVLTLSLCFPGIMPVKAYAGGTYNYGEALQKTIMFYEFQMSGKLPSW 63

Query: 331 SRASQAAGGWRNKSHALDGTGPGGVNLDLSGGWYDAGDHLKLHLPLGVSVSLLSYGALTF 510
            R       WR  S   DG   G   LDL+GGW+DAGDH+K +LP+  S S+L +    +
Sbjct: 64  VR-----NNWRGDSGLDDGKDVG---LDLTGGWHDAGDHVKFNLPMSYSASMLGWAVYEY 115

Query: 511 EAAYRAAGQWDIAVRNLDWAASYIAKCHTQASDTPAYNKFVAQIGDVATDHNTW 672
           + A+  + Q +  +  ++WA  Y  KCH      P+   +  Q+GD   DHN W
Sbjct: 116 KDAFVKSKQLEHILNQIEWANDYFVKCH------PSKYVYYYQVGDPTVDHNFW 163

>sp|P26224|GUNF_CLOTM Endoglucanase F precursor (EGF) (Endo-1,4-beta-glucanase)
           (Cellulase F) gi|98643|pir||S15727 cellulase (EC
           3.2.1.4) F precursor - Clostridium thermocellum
           gi|581006|emb|CAA43035.1| cellulase [Clostridium
           thermocellum]
          Length = 739

 Score =  107 bits (266), Expect = 2e-22
 Identities = 61/170 (35%), Positives = 85/170 (49%)
 Frame = +1

Query: 178 MVAVQLLASLALLVITLPSEIDAQLSHNYASVLGLSYRFYEAQMSGNVPSWSRASQAAGG 357
           ++A  L  +L  +V    + +      NY   L  +  FYE Q SG +P   R       
Sbjct: 4   ILAFLLTVALVAVVAIPQAVVSFAADFNYGEALQKAIMFYEFQRSGKLPENKR-----NN 58

Query: 358 WRNKSHALDGTGPGGVNLDLSGGWYDAGDHLKLHLPLGVSVSLLSYGALTFEAAYRAAGQ 537
           WR  S   DG   G   LDL+GGWYDAGDH+K +LP+  +V++L++       AY  +GQ
Sbjct: 59  WRGDSALNDGADNG---LDLTGGWYDAGDHVKFNLPMAYAVTMLAWSVYESRDAYVQSGQ 115

Query: 538 WDIAVRNLDWAASYIAKCHTQASDTPAYNKFVAQIGDVATDHNTWWGRPE 687
               + N+ WA  Y  KCH      P+ N +  Q+GD A DH +WWG  E
Sbjct: 116 LPYILDNIKWATDYFIKCH------PSPNVYYYQVGDGALDH-SWWGPAE 158

>sp|P22534|GUNA_CALSA Endoglucanase A precursor (Endo-1,4-beta-glucanase A) (Cellulase A)
           gi|7462025|pir||T17120 cellulase (EC 3.2.1.-) precursor,
           thermoactive - Caldocellum saccharolyticum
           gi|537500|gb|AAA91086.1| cellulase
          Length = 1742

 Score =  106 bits (264), Expect = 4e-22
 Identities = 55/145 (37%), Positives = 78/145 (52%)
 Frame = +1

Query: 253 SHNYASVLGLSYRFYEAQMSGNVPSWSRASQAAGGWRNKSHALDGTGPGGVNLDLSGGWY 432
           S NY   L  +  FYE QMSG +P+W R       WR  S   DG   G   LDL+GGW+
Sbjct: 25  SFNYGEALQKAIMFYEFQMSGKLPNWVR-----NNWRGDSALKDGQDNG---LDLTGGWF 76

Query: 433 DAGDHLKLHLPLGVSVSLLSYGALTFEAAYRAAGQWDIAVRNLDWAASYIAKCHTQASDT 612
           DAGDH+K +LP+  + ++LS+ A  ++ A+  +GQ +  +  ++W   Y  KCH      
Sbjct: 77  DAGDHVKFNLPMSYTGTMLSWAAYEYKDAFVKSGQLEHILNQIEWVNDYFVKCH------ 130

Query: 613 PAYNKFVAQIGDVATDHNTWWGRPE 687
           P+   +  Q+GD   DH  WWG  E
Sbjct: 131 PSKYVYYYQVGDGGKDH-AWWGPAE 154



EST assemble image


clone accession position
1 MXL022h05_r BP094426 1 484
2 MXL040h05_r BP095377 1 335
3 HCL002a08_r AV639623 3 484
4 MXL044a04_r BP095569 9 412
5 HCL047d07_r AV642205 146 322
6 MXL018f06_r BP094137 243 491
7 MXL100h08_r BP098890 288 711




Chlamydomonas reinhardtii
Kazusa DNA Research Institute