KCC000834A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000834A_C01 KCC000834A_c01
gctctattggcttgcaatgtagccaccgagcagttttCAGTGGTCTATTACTTATTGCTG
CACATACTATCATCCAGATAGCTAGCACGCGCTTAATATCGCAAGTGTCGACAGCAGTGC
GTTAACTTCCTCTGCAAACAGTCGCTCTATATAACCGCCGTCCGCTTTACCTCCTAGCAG
AGCTATCACGGCACACCACTTACAACTGGCTTTGAGCTGCCGGAGCCGAACCGAAAATGT
CGCGACTTGCGCACACCCAGGTTATTGCTGCCCCGGCACGACAAAGCTGCGGCTCAAGTC
AAGTTATGACTTCATCGAAACGCCCTAGCCTCGTTTCGGCACGGCCACGAAAACGTGTTC
AGTGTGCTGACGCGCTTGCGAGTCCGCCGAGCCTCGGTGGCCGCGTCATCGCGTGCGAGA
GCGTAAGCGCTGCAGCAAGTGGATTCACCTCTACTGGCATCTCAGCTGCCCGCACAGCTT
TGCGGCGTACCAACATCCCGCCGAGCATGGCATCGCCAAGATGCCCATTGAGCTCCGGCG
CTGGATTGCCGAGGAGTGGTGGTACCAGGGCCTGCAGCGCCTGGAGAGCGTCCTGCACGA
GCCGGGCGAGCAGCGAAGCCTGGATAACTGAGGACCACGAGCACCACGGCCACGACCACG
ACCGGTTCGCCTCTACCCTACCCTGGCCGCCCGCCGACGCCGCCAGCTACCTCCCCCCCG
CCGTCCTGGCCGCATGCGTGCCCAGCTCCTTTGACGACTTCTGCCGCCGCATGTCTGCAA
GCGCCGCCGCCGGCCGCCTGCCCGCCGTGCTGCTGGTATCAGCGGCCAGCTGGGCCCACC
TGCGCCCGCAGATCGAGCCC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000834A_C01 KCC000834A_c01
         (860 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|P08124|CC01_CAEEL Cuticle collagen 1 precursor (Squat protein...    56  9e-07
ref|NP_506095.1| COLlagen structural gene, SQuaT SQT-3, DumPY : ...    56  9e-07
ref|NP_767161.1| blr0521 [Bradyrhizobium japonicum] gi|27348769|...    53  6e-06
ref|NP_862423.1| proline-rich extensin-like protein [Micrococcus...    50  5e-05
ref|NP_493977.1| putative secreted or extracellular protein fami...    50  6e-05

>sp|P08124|CC01_CAEEL Cuticle collagen 1 precursor (Squat protein 3) gi|84425|pir||A31219
           collagen 1 - Caenorhabditis elegans
           gi|6678|emb|CAA23463.1| unnamed protein product
           [Caenorhabditis elegans] gi|156258|gb|AAA27988.1|
           collagen
          Length = 296

 Score = 55.8 bits (133), Expect = 9e-07
 Identities = 29/86 (33%), Positives = 39/86 (44%)
 Frame = +1

Query: 493 TSRRAWHRQDAH*APALDCRGVVVPGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSP 672
           T+R+A+   + + AP L C G  +PGP  P   P + G+  +PG     +     T G P
Sbjct: 75  TTRQAYGGPEVNPAPNLQCEGCCLPGPPGPAGAPGKPGKPGRPG-----APGTPGTPGKP 129

Query: 673 LPYPGRPPTPPATSPPPSWPHACPAP 750
              P  P TPP   P P  P   P P
Sbjct: 130 PVAPCEPTTPPPCKPCPQGPPGPPGP 155

>ref|NP_506095.1| COLlagen structural gene, SQuaT SQT-3, DumPY : shorter than
           wild-type DPY-15 (29.2 kD) (sqt-3) [Caenorhabditis
           elegans] gi|7499772|pir||T21314 hypothetical protein
           F23H12.4 - Caenorhabditis elegans
           gi|3876307|emb|CAA98942.1| C. elegans SQT-3 protein
           (corresponding sequence F23H12.4) [Caenorhabditis
           elegans]
          Length = 301

 Score = 55.8 bits (133), Expect = 9e-07
 Identities = 29/86 (33%), Positives = 39/86 (44%)
 Frame = +1

Query: 493 TSRRAWHRQDAH*APALDCRGVVVPGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSP 672
           T+R+A+   + + AP L C G  +PGP  P   P + G+  +PG     +     T G P
Sbjct: 80  TTRQAYGGPEVNPAPNLQCEGCCLPGPPGPAGAPGKPGKPGRPG-----APGTPGTPGKP 134

Query: 673 LPYPGRPPTPPATSPPPSWPHACPAP 750
              P  P TPP   P P  P   P P
Sbjct: 135 PVAPCEPTTPPPCKPCPQGPPGPPGP 160

>ref|NP_767161.1| blr0521 [Bradyrhizobium japonicum] gi|27348769|dbj|BAC45786.1|
           blr0521 [Bradyrhizobium japonicum USDA 110]
          Length = 745

 Score = 53.1 bits (126), Expect = 6e-06
 Identities = 40/104 (38%), Positives = 45/104 (42%), Gaps = 22/104 (21%)
 Frame = +1

Query: 559 VVPGPAAPGERPARAGRAAKPG*LRT---TSTTATTTTGSPLPYPGRPP----------- 696
           V P PAAP  RP     AA P    T   T+T A T T +P   PG PP           
Sbjct: 200 VAPPPAAPTARPGSPAPAATPAPTPTPAPTATPAPTATPAPGSTPGAPPAGRPGAPPPGV 259

Query: 697 ---TPPATSPPPSWPHACPAPLTTSAAACLQAPP-----PAACP 804
              +PPA   PP+ P A PAP TT A      PP     PA+ P
Sbjct: 260 RPGSPPAAGSPPA-PGATPAPTTTPAPGGTATPPSGRPGPASTP 302

 Score = 48.5 bits (114), Expect = 1e-04
 Identities = 33/94 (35%), Positives = 35/94 (37%)
 Frame = +1

Query: 565 PGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACP 744
           P PA P  RPA    A  P         A      P P P   PTP    PPP+ P A P
Sbjct: 137 PPPAPPAARPAPTPPAPPPA-------AAPQHAPPPPPPPAARPTPTPPPPPPAGPAARP 189

Query: 745 APLTTSAAACLQAPPPAACPPCCWYQRPAGPTCA 846
            P  T+       P P A PP     RP  P  A
Sbjct: 190 TPAPTAT------PTPVAPPPAAPTARPGSPAPA 217

 Score = 42.4 bits (98), Expect = 0.010
 Identities = 27/92 (29%), Positives = 33/92 (35%), Gaps = 1/92 (1%)
 Frame = +1

Query: 565 PGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPL-PYPGRPPTPPATSPPPSWPHAC 741
           P P  P  RP        P       T A T T +P+ P P  P   P +  P + P   
Sbjct: 164 PPPPPPAARPTPTPPPPPPAGPAARPTPAPTATPTPVAPPPAAPTARPGSPAPAATPAPT 223

Query: 742 PAPLTTSAAACLQAPPPAACPPCCWYQRPAGP 837
           P P  T+  A    P P + P      RP  P
Sbjct: 224 PTPAPTATPAPTATPAPGSTPGAPPAGRPGAP 255

 Score = 42.4 bits (98), Expect = 0.010
 Identities = 40/100 (40%), Positives = 42/100 (42%), Gaps = 8/100 (8%)
 Frame = +1

Query: 565 PGPAAPGERPAR---AGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSP-PPSWP 732
           PG   PG RP     AG    PG     +T A TTT    P PG   TPP+  P P S P
Sbjct: 252 PGAPPPGVRPGSPPAAGSPPAPG-----ATPAPTTT----PAPGGTATPPSGRPGPASTP 302

Query: 733 ---HACPAPLTTSAAACLQAPPPAACPPCCWYQRP-AGPT 840
               A PAP  T A      PPP          RP AGPT
Sbjct: 303 APGAATPAPTATPAPGGALTPPPG---------RPGAGPT 333

 Score = 42.4 bits (98), Expect = 0.010
 Identities = 30/78 (38%), Positives = 32/78 (40%)
 Frame = +1

Query: 571 PAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACPAP 750
           PAAP  RPA    AA P                P   P RP  PP   PPP+   A P P
Sbjct: 61  PAAPA-RPAAPPPAAAP--------PHPPAAPPPAAAPPRPAAPPPPPPPPAARPAPPPP 111

Query: 751 LTTSAAACLQAPPPAACP 804
               AA    +PPPAA P
Sbjct: 112 PPPPAAPKQPSPPPAAAP 129

 Score = 42.0 bits (97), Expect = 0.013
 Identities = 44/127 (34%), Positives = 48/127 (37%), Gaps = 26/127 (20%)
 Frame = +1

Query: 535 PALDCRGVVVPGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPA-- 708
           PA   R    P  AAP   PA    AA P   R  +        +  P P  PP PPA  
Sbjct: 61  PAAPARPAAPPPAAAPPHPPAAPPPAAAPP--RPAAPPPPPPPPAARPAPPPPPPPPAAP 118

Query: 709 ----------------TSPPPSWPHACPAPLTTS---AAACLQAPP----PAACP-PCCW 816
                           T PPP+ P A PAP   +   AAA   APP    PAA P P   
Sbjct: 119 KQPSPPPAAAPQQHAPTPPPPAPPAARPAPTPPAPPPAAAPQHAPPPPPPPAARPTPTPP 178

Query: 817 YQRPAGP 837
              PAGP
Sbjct: 179 PPPPAGP 185

 Score = 37.0 bits (84), Expect = 0.42
 Identities = 21/51 (41%), Positives = 23/51 (44%)
 Frame = +1

Query: 685 GRPPTPPATSPPPSWPHACPAPLTTSAAACLQAPPPAACPPCCWYQRPAGP 837
           G+P  PP   PP + P A PA       A     PPAA PP     RPA P
Sbjct: 46  GKPKQPPK-GPPGAAPPAAPARPAAPPPAAAPPHPPAAPPPAAAPPRPAAP 95

 Score = 35.4 bits (80), Expect = 1.2
 Identities = 31/96 (32%), Positives = 35/96 (36%), Gaps = 3/96 (3%)
 Frame = +1

Query: 568 GPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSPP---PSWPHA 738
           GP    ++P +    A P         A     +P P    PP PPA  PP   P  P A
Sbjct: 43  GPDGKPKQPPKGPPGAAP-------PAAPARPAAPPP-AAAPPHPPAAPPPAAAPPRPAA 94

Query: 739 CPAPLTTSAAACLQAPPPAACPPCCWYQRPAGPTCA 846
            P P    AA    APPP   PP    Q    P  A
Sbjct: 95  PPPPPPPPAAR--PAPPPPPPPPAAPKQPSPPPAAA 128

 Score = 34.3 bits (77), Expect = 2.7
 Identities = 39/106 (36%), Positives = 41/106 (37%), Gaps = 8/106 (7%)
 Frame = +1

Query: 565 PGPAA--PGERPARAGRAAKPG*LRTTSTTATTTTGSPL-PYPGRP-----PTPPATSPP 720
           PG  A  P  RP  A   A PG   T + TAT   G  L P PGRP     P P   +PP
Sbjct: 285 PGGTATPPSGRPGPASTPA-PG-AATPAPTATPAPGGALTPPPGRPGAGPTPGPQGGTPP 342

Query: 721 PSWPHACPAPLTTSAAACLQAPPPAACPPCCWYQRPAGPTCARRSS 858
              P          AA    APP A   P     RPA P  A   S
Sbjct: 343 AGAP----------AAGTPAAPPQAGGLPA----RPAAPAGAAAPS 374

>ref|NP_862423.1| proline-rich extensin-like protein [Micrococcus sp. 28]
           gi|18025411|gb|AAK62519.1| proline-rich extensin-like
           protein [Micrococcus sp. 28]
          Length = 406

 Score = 50.1 bits (118), Expect = 5e-05
 Identities = 44/142 (30%), Positives = 57/142 (39%), Gaps = 4/142 (2%)
 Frame = +1

Query: 445 SPLLASQLPAQLCGVPTSRRAWHRQDAH*APALDCRGVVVPGPAAPGERPARAGRAAKPG 624
           +P+ A+++P     VP S  +        APA     ++   PAAP   P  A   A PG
Sbjct: 201 APVPAAEVPPA-APVPWSENSAEDAPLPAAPAAPLSALLPAAPAAPLSAPPLAEPPAAPG 259

Query: 625 *LRTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACPAPLTTSAAACLQAPPPAACP 804
            L      A +        P  PP   A +PPP+ P   PA     AAA    PPP A P
Sbjct: 260 SLPAPFAPAVSPEAVAPAVPLAPPAVSAAAPPPARPPPGPAAPPPPAAA---LPPPPAAP 316

Query: 805 PC----CWYQRPAGPTCARRSS 858
           P          PA P  AR ++
Sbjct: 317 PARAAPAAAPAPAPPVPARMAA 338

 Score = 33.9 bits (76), Expect = 3.5
 Identities = 35/107 (32%), Positives = 42/107 (38%), Gaps = 16/107 (14%)
 Frame = +1

Query: 565 PGPAAPGERPARA---GRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTP-PATSPPPSWP 732
           P  A P   PARA   G A     L     ++ +   +    P  PP P PA   PP+ P
Sbjct: 154 PAAARPAAVPARAPLIGDAPPAAGLTPAPESSPSLASTRSTVPSTPPAPVPAAEVPPAAP 213

Query: 733 ------HACPAPLTTSAAACLQAPPPAA------CPPCCWYQRPAGP 837
                  A  APL  + AA L A  PAA       PP    + PA P
Sbjct: 214 VPWSENSAEDAPLPAAPAAPLSALLPAAPAAPLSAPPLA--EPPAAP 258

>ref|NP_493977.1| putative secreted or extracellular protein family member precursor
           (46.3 kD) (2B757) [Caenorhabditis elegans]
           gi|13775528|gb|AAK39336.1| Hypothetical protein Y51H7C.1
           [Caenorhabditis elegans]
          Length = 448

 Score = 49.7 bits (117), Expect = 6e-05
 Identities = 27/73 (36%), Positives = 34/73 (45%), Gaps = 11/73 (15%)
 Frame = +1

Query: 631 RTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACPAPLTTSAAACLQ---------- 780
           RTT+TT TTTT + LP     P PPAT PP ++P   P P+ T+     Q          
Sbjct: 214 RTTTTTTTTTTTTTLPPCVYTPAPPATQPPRTYPTTTPRPIPTTKTTVPQQWTTLPPGPE 273

Query: 781 -APPPAACPPCCW 816
             P P     CC+
Sbjct: 274 PTPGPGPSGECCY 286



EST assemble image


clone accession position
1 MXL017d07_r BP094045 1 376
2 HCL056e08_r AV642711 38 545
3 HCL091b12_r AV644609 49 438
4 HCL046c09_r AV642141 58 533
5 CL73e07_r AV396989 67 540
6 HCL010c06_r AV640095 67 310
7 HCL057e08_r AV642766 71 601
8 MXL043g11_r BP095557 73 263
9 HCL024a01_r AV640876 74 339
10 HCL009b04_r AV640031 80 324
11 LCL100c10_r AV631818 82 637
12 HCL064a10_r AV643117 121 616
13 HCL001e09_r AV639593 344 646
14 HCL067f11_r AV643330 382 860




Chlamydomonas reinhardtii
Kazusa DNA Research Institute