KCC000350A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000350A_C01 KCC000350A_c01
CCTTAGTCCCGCTCCTATACCCTTGTTAAGGTTTATGTCAAACAGCAAGCCCGCCAAGAA
CGGCCAGAGGAATGATACCACGTCGACCAAAACAACATCAAATGCAGTCGAGCTCCCTTC
TGGCAAGCGGATAGTCACTGGCACTTCTGACGTCGTAGCGAAGCGGGACAACGGTGGGCA
ACCTTCGGCTTCAAGCCATGGCGCGGCGAGTGGACGCGCGGAGAGCGGGCGCGGCCCCCA
GGCTGCTAGCGGCGTAGCGCCAGCCAACCCCAATCAGAGTGGCAACAGCGGCCGTGCTAA
CGGCCATGTCGATGGCGGTGCCAGGCACCGCGCTGCTGCTGCTGGGCCCCAGCCCAGTGG
CACCGGCCCCAACCACGCTGCCGGCAGGCCCGCCAGCGGCAAGCCCGGCCCCGCAGTCGC
TCCGGGTGCCAACTTCAACGGAGGCGTGCTTGGTCCACGTGTCGCAGCAGGCGCA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000350A_C01 KCC000350A_c01
         (475 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_498814.1| COLlagen structural gene (col-91) [Caenorhabdit...    56  2e-07
ref|XP_325094.1| hypothetical protein [Neurospora crassa] gi|289...    54  7e-07
ref|NP_499982.1| COLlagen structural gene (33.8 kD) (col-103) [C...    54  1e-06
ref|NP_505486.1| COLlagen structural gene (27.2 kD) (col-147) [C...    53  2e-06
ref|NP_505484.1| COLlagen structural gene (27.4 kD) (col-146) [C...    53  2e-06

>ref|NP_498814.1| COLlagen structural gene (col-91) [Caenorhabditis elegans]
           gi|465855|sp|P34391|YLS6_CAEEL Putative cuticle collagen
           F09G8.6 gi|630599|pir||S44796 F09G8.6 protein -
           Caenorhabditis elegans gi|156286|gb|AAA28008.1|
           Hypothetical protein F09G8.6 [Caenorhabditis elegans]
          Length = 278

 Score = 55.8 bits (133), Expect = 2e-07
 Identities = 43/122 (35%), Positives = 52/122 (42%), Gaps = 6/122 (4%)
 Frame = +2

Query: 107 VELPSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAESGRGPQAASGVAPANPNQSGN 286
           ++ P+G+    G           GQP A   G A G A    GP   +G AP  P   GN
Sbjct: 132 IQCPAGEAGPAGAPGAPGPAGPDGQPGADGQGGAPGPA-GPEGPAGDAG-APGAPGAPGN 189

Query: 287 SGRA--NGHVDGGARHRAAAAGPQPSGTGPNHAAGRPAS----GKPGPAVAPGANFNGGV 448
            G+   NG    G    A A GPQ    GP  + G+P S    G PGPA APG +   G 
Sbjct: 190 DGQPGQNGQRSTGTPGAAGAPGPQ----GPVGSDGQPGSAGAPGAPGPAGAPGVDGQPGA 245

Query: 449 LG 454
            G
Sbjct: 246 NG 247

 Score = 38.1 bits (87), Expect = 0.053
 Identities = 30/78 (38%), Positives = 35/78 (44%), Gaps = 3/78 (3%)
 Frame = +2

Query: 233 GPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGP 412
           GP  A G A   P   G++G A   +DG A   A+AAG          A    A G PGP
Sbjct: 94  GPPGAPGAA-GEPGVDGDAGAAG--IDGVAIQFASAAGGACIQCPAGEAGPAGAPGAPGP 150

Query: 413 A---VAPGANFNGGVLGP 457
           A     PGA+  GG  GP
Sbjct: 151 AGPDGQPGADGQGGAPGP 168

 Score = 36.6 bits (83), Expect = 0.15
 Identities = 32/107 (29%), Positives = 42/107 (38%), Gaps = 11/107 (10%)
 Frame = +2

Query: 188 ASSHGAASGRAESGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAGP------ 349
           AS+ G A  +  +G   +A    AP  P  +G  G+      GGA   A   GP      
Sbjct: 124 ASAAGGACIQCPAG---EAGPAGAPGAPGPAGPDGQPGADGQGGAPGPAGPEGPAGDAGA 180

Query: 350 -----QPSGTGPNHAAGRPASGKPGPAVAPGANFNGGVLGPRVAAGA 475
                 P   G     G+ ++G PG A APG     G  G   +AGA
Sbjct: 181 PGAPGAPGNDGQPGQNGQRSTGTPGAAGAPGPQGPVGSDGQPGSAGA 227

>ref|XP_325094.1| hypothetical protein [Neurospora crassa] gi|28926533|gb|EAA35504.1|
            hypothetical protein [Neurospora crassa]
          Length = 1168

 Score = 54.3 bits (129), Expect = 7e-07
 Identities = 39/147 (26%), Positives = 61/147 (40%), Gaps = 4/147 (2%)
 Frame = +2

Query: 47   KPAKNGQRNDTTSTKTTSNAVELPS----GKRIVTGTSDVVAKRDNGGQPSASSHGAASG 214
            +P+ N  RN   S     ++   P+     K++  G S    K     +P  SS    S 
Sbjct: 707  QPSSNANRNRNESQAPVESSDSRPTTSAKDKQLSQGPS--APKDAYNNKPPVSSRRPPSS 764

Query: 215  RAESGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPA 394
            +   G  P   + VAP  P++ G     N ++ G     AA A  Q   TG +  +GRP+
Sbjct: 765  QRNGGNAPTTGNAVAPPRPSRDGRPTADNQYLSG-----AAGAPTQRPTTGGSMQSGRPS 819

Query: 395  SGKPGPAVAPGANFNGGVLGPRVAAGA 475
              +P P     AN +G +  P   +G+
Sbjct: 820  YAQPAPPEVADANVHGRIQQPSKGSGS 846

>ref|NP_499982.1| COLlagen structural gene (33.8 kD) (col-103) [Caenorhabditis
           elegans] gi|25385141|pir||E88633 protein F56B3.1
           [imported] - Caenorhabditis elegans
           gi|13559616|gb|AAK29827.1| Collagen protein 103
           [Caenorhabditis elegans]
          Length = 371

 Score = 53.5 bits (127), Expect = 1e-06
 Identities = 46/158 (29%), Positives = 62/158 (39%), Gaps = 18/158 (11%)
 Frame = +2

Query: 47  KPAKNGQRNDTTSTKTTSNAVEL-PSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAE 223
           +P  NG      +++ ++   +  P+G     G +    +  N GQP A S G   G A 
Sbjct: 162 QPGSNGGAGSNGASEGSAGGCKTCPAGPPGPPGPAGQAGRPGNDGQPGAPSFGGGVG-AP 220

Query: 224 SGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAA---AAGPQPSGT---------- 364
              GP   +G +P  P   G  GR   +  GG+        A  P P G           
Sbjct: 221 GAPGPAGDAG-SPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAGPPGPPGNNGAPGGGYGV 279

Query: 365 ---GPNHAAGRP-ASGKPGPAVAPGANFNGGVLGPRVA 466
              GP   +GRP A G+PGP   PGA  N G  G   A
Sbjct: 280 GPPGPPGPSGRPGAPGQPGPDGQPGAPGNDGTPGTDAA 317

 Score = 43.5 bits (101), Expect = 0.001
 Identities = 34/116 (29%), Positives = 44/116 (37%), Gaps = 2/116 (1%)
 Frame = +2

Query: 116 PSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAE-SGRGPQAASGVAPANPNQSGNSG 292
           P G R   G + +       GQP ++    ++G +E S  G +      P  P  +G +G
Sbjct: 141 PPGPRGPPGQAGLDGLPGAPGQPGSNGGAGSNGASEGSAGGCKTCPAGPPGPPGPAGQAG 200

Query: 293 RANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPAS-GKPGPAVAPGANFNGGVLGP 457
           R       GA       G  P   GP   AG P   G PG    PG N  GG   P
Sbjct: 201 RPGNDGQPGAPSFGGGVG-APGAPGPAGDAGSPGQPGAPGQPGRPGKNAQGGSSRP 255

 Score = 40.8 bits (94), Expect = 0.008
 Identities = 39/102 (38%), Positives = 50/102 (48%), Gaps = 7/102 (6%)
 Frame = +2

Query: 173 GGQPSASSHGAASGRAESGRGPQAASGV-----APANPNQSGNSGRANGHVDGGARH-RA 334
           G Q S SS+    G     RGP   +G+     AP  P  +G +G +NG  +G A   + 
Sbjct: 130 GCQCSPSSNTCPPGP----RGPPGQAGLDGLPGAPGQPGSNGGAG-SNGASEGSAGGCKT 184

Query: 335 AAAGPQPSGTGPNHAAGRPAS-GKPGPAVAPGANFNGGVLGP 457
             AGP P   GP   AGRP + G+PG   AP  +F GGV  P
Sbjct: 185 CPAGP-PGPPGPAGQAGRPGNDGQPG---AP--SFGGGVGAP 220

 Score = 37.0 bits (84), Expect = 0.12
 Identities = 35/116 (30%), Positives = 39/116 (33%), Gaps = 15/116 (12%)
 Frame = +2

Query: 170 NGGQPS-ASSHGAASGRAESGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAG 346
           NGG  S  +S G+A G      GP    G A     + GN G+      GG      A G
Sbjct: 166 NGGAGSNGASEGSAGGCKTCPAGPPGPPGPA-GQAGRPGNDGQPGAPSFGGGVGAPGAPG 224

Query: 347 P-----QPSGTGPNHAAGRPASG---------KPGPAVAPGANFNGGVLGPRVAAG 472
           P      P   G     GRP             PGPA  PG   N G  G     G
Sbjct: 225 PAGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAGPPGPPGNNGAPGGGYGVG 280

>ref|NP_505486.1| COLlagen structural gene (27.2 kD) (col-147) [Caenorhabditis
           elegans] gi|7507298|pir||T24586 hypothetical protein
           T06E4.4 - Caenorhabditis elegans
           gi|3879547|emb|CAA94788.1| C. elegans COL-147 protein
           (corresponding sequence T06E4.4) [Caenorhabditis
           elegans]
          Length = 290

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 46/148 (31%), Positives = 54/148 (36%), Gaps = 5/148 (3%)
 Frame = +2

Query: 47  KPAKNGQRNDTTSTKTTSNAVELPSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAES 226
           KP  NG     T        +  P+G     G       +   G P   + G   G A  
Sbjct: 111 KPGANGVTIGLTGGN--GPCITCPAGAPGPAGAPGAPGPQGPSGAPGQDAVGGGPGPA-- 166

Query: 227 GRGPQAASGVAPANPNQSGNSGRANGHVDGGARHR-----AAAAGPQPSGTGPNHAAGRP 391
             GPQ  +G A A P Q+G  G       GG R R     + A GPQ    GP       
Sbjct: 167 --GPQGPAGDAGA-PGQAGAPGHPGAPGQGGQRSRGTPGPSGAPGPQGPAGGPGQPGQSG 223

Query: 392 ASGKPGPAVAPGANFNGGVLGPRVAAGA 475
            +G PGPA APGA    G  G     GA
Sbjct: 224 GAGAPGPAGAPGAPGGPGNAGTPGTPGA 251

 Score = 40.0 bits (92), Expect = 0.014
 Identities = 35/110 (31%), Positives = 43/110 (38%), Gaps = 4/110 (3%)
 Frame = +2

Query: 116 PSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAESGR----GPQAASGVAPANPNQSG 283
           P+G +   G +    +    G P A   G    R   G     GPQ  +G  P  P QSG
Sbjct: 165 PAGPQGPAGDAGAPGQAGAPGHPGAPGQGGQRSRGTPGPSGAPGPQGPAG-GPGQPGQSG 223

Query: 284 NSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGPAVAPGAN 433
            +G       G A    A  GP  +GT      G P  G PG A APG +
Sbjct: 224 GAG-----APGPAGAPGAPGGPGNAGT-----PGTP--GAPGNAGAPGGD 261

 Score = 33.5 bits (75), Expect = 1.3
 Identities = 28/71 (39%), Positives = 30/71 (41%)
 Frame = +2

Query: 260 PANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGPAVAPGANFN 439
           P  P Q G  G A GH   G   +  A G     TG N       +G PGPA APGA   
Sbjct: 91  PGPPGQPGAQGEA-GHA--GEAGKPGANGVTIGLTGGNGPCITCPAGAPGPAGAPGA--- 144

Query: 440 GGVLGPRVAAG 472
            G  GP  A G
Sbjct: 145 PGPQGPSGAPG 155

>ref|NP_505484.1| COLlagen structural gene (27.4 kD) (col-146) [Caenorhabditis
           elegans] gi|7507300|pir||T24590 hypothetical protein
           T06E4.6 - Caenorhabditis elegans
           gi|3879551|emb|CAA94792.1| C. elegans COL-146 protein
           (corresponding sequence T06E4.6) [Caenorhabditis
           elegans]
          Length = 290

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 47/148 (31%), Positives = 54/148 (35%), Gaps = 5/148 (3%)
 Frame = +2

Query: 47  KPAKNGQRNDTTSTKTTSNAVELPSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAES 226
           KP  NG     T        +  P+G     G       +   G P   + G   G A  
Sbjct: 111 KPGANGVTIGLTGGN--GPCITCPAGAPGPAGAPGAPGPQGPSGAPGQDAVGEGPGPA-- 166

Query: 227 GRGPQAASGVAPANPNQSGNSGRANGHVDGGARHR-----AAAAGPQPSGTGPNHAAGRP 391
             GPQ  +G A A P Q+G  G       GG R R     A A GPQ    GP       
Sbjct: 167 --GPQGPAGDAGA-PGQAGAPGHPGAPGQGGQRSRGTPGPAGAPGPQGPAGGPGQPGQSG 223

Query: 392 ASGKPGPAVAPGANFNGGVLGPRVAAGA 475
            +G PGPA APGA    G  G     GA
Sbjct: 224 GAGAPGPAGAPGAPGGPGQPGQDGQPGA 251

 Score = 33.5 bits (75), Expect = 1.3
 Identities = 28/71 (39%), Positives = 30/71 (41%)
 Frame = +2

Query: 260 PANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGPAVAPGANFN 439
           P  P Q G  G A GH   G   +  A G     TG N       +G PGPA APGA   
Sbjct: 91  PGPPGQPGAQGEA-GHA--GEAGKPGANGVTIGLTGGNGPCITCPAGAPGPAGAPGA--- 144

Query: 440 GGVLGPRVAAG 472
            G  GP  A G
Sbjct: 145 PGPQGPSGAPG 155



EST assemble image


clone accession position
1 CL21c05_r AV394378 1 208
2 HCL071d01_r AV643538 1 479
3 CL41c11_r AV395393 9 190
4 CL59c12_r AV397917 12 294
5 CL48h03_r AV395814 14 159
6 MXL005g02_r BP093255 20 438




Chlamydomonas reinhardtii
Kazusa DNA Research Institute