KCC000834A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KCC000834A_C01 KCC000834A_c01
gctctattggcttgcaatgtagccaccgagcagttttCAGTGGTCTATTACTTATTGCTG
CACATACTATCATCCAGATAGCTAGCACGCGCTTAATATCGCAAGTGTCGACAGCAGTGC
GTTAACTTCCTCTGCAAACAGTCGCTCTATATAACCGCCGTCCGCTTTACCTCCTAGCAG
AGCTATCACGGCACACCACTTACAACTGGCTTTGAGCTGCCGGAGCCGAACCGAAAATGT
CGCGACTTGCGCACACCCAGGTTATTGCTGCCCCGGCACGACAAAGCTGCGGCTCAAGTC
AAGTTATGACTTCATCGAAACGCCCTAGCCTCGTTTCGGCACGGCCACGAAAACGTGTTC
AGTGTGCTGACGCGCTTGCGAGTCCGCCGAGCCTCGGTGGCCGCGTCATCGCGTGCGAGA
GCGTAAGCGCTGCAGCAAGTGGATTCACCTCTACTGGCATCTCAGCTGCCCGCACAGCTT
TGCGGCGTACCAACATCCCGCCGAGCATGGCATCGCCAAGATGCCCATTGAGCTCCGGCG
CTGGATTGCCGAGGAGTGGTGGTACCAGGGCCTGCAGCGCCTGGAGAGCGTCCTGCACGA
GCCGGGCGAGCAGCGAAGCCTGGATAACTGAGGACCACGAGCACCACGGCCACGACCACG
ACCGGTTCGCCTCTACCCTACCCTGGCCGCCCGCCGACGCCGCCAGCTACCTCCCCCCCG
CCGTCCTGGCCGCATGCGTGCCCAGCTCCTTTGACGACTTCTGCCGCCGCATGTCTGCAA
GCGCCGCCGCCGGCCGCCTGCCCGCCGTGCTGCTGGTATCAGCGGCCAGCTGGGCCCACC
TGCGCCCGCAGATCGAGCCC
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC000834A_C01 KCC000834A_c01
(860 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
sp|P08124|CC01_CAEEL Cuticle collagen 1 precursor (Squat protein... 56 9e-07
ref|NP_506095.1| COLlagen structural gene, SQuaT SQT-3, DumPY : ... 56 9e-07
ref|NP_767161.1| blr0521 [Bradyrhizobium japonicum] gi|27348769|... 53 6e-06
ref|NP_862423.1| proline-rich extensin-like protein [Micrococcus... 50 5e-05
ref|NP_493977.1| putative secreted or extracellular protein fami... 50 6e-05
>sp|P08124|CC01_CAEEL Cuticle collagen 1 precursor (Squat protein 3) gi|84425|pir||A31219
collagen 1 - Caenorhabditis elegans
gi|6678|emb|CAA23463.1| unnamed protein product
[Caenorhabditis elegans] gi|156258|gb|AAA27988.1|
collagen
Length = 296
Score = 55.8 bits (133), Expect = 9e-07
Identities = 29/86 (33%), Positives = 39/86 (44%)
Frame = +1
Query: 493 TSRRAWHRQDAH*APALDCRGVVVPGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSP 672
T+R+A+ + + AP L C G +PGP P P + G+ +PG + T G P
Sbjct: 75 TTRQAYGGPEVNPAPNLQCEGCCLPGPPGPAGAPGKPGKPGRPG-----APGTPGTPGKP 129
Query: 673 LPYPGRPPTPPATSPPPSWPHACPAP 750
P P TPP P P P P P
Sbjct: 130 PVAPCEPTTPPPCKPCPQGPPGPPGP 155
>ref|NP_506095.1| COLlagen structural gene, SQuaT SQT-3, DumPY : shorter than
wild-type DPY-15 (29.2 kD) (sqt-3) [Caenorhabditis
elegans] gi|7499772|pir||T21314 hypothetical protein
F23H12.4 - Caenorhabditis elegans
gi|3876307|emb|CAA98942.1| C. elegans SQT-3 protein
(corresponding sequence F23H12.4) [Caenorhabditis
elegans]
Length = 301
Score = 55.8 bits (133), Expect = 9e-07
Identities = 29/86 (33%), Positives = 39/86 (44%)
Frame = +1
Query: 493 TSRRAWHRQDAH*APALDCRGVVVPGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSP 672
T+R+A+ + + AP L C G +PGP P P + G+ +PG + T G P
Sbjct: 80 TTRQAYGGPEVNPAPNLQCEGCCLPGPPGPAGAPGKPGKPGRPG-----APGTPGTPGKP 134
Query: 673 LPYPGRPPTPPATSPPPSWPHACPAP 750
P P TPP P P P P P
Sbjct: 135 PVAPCEPTTPPPCKPCPQGPPGPPGP 160
>ref|NP_767161.1| blr0521 [Bradyrhizobium japonicum] gi|27348769|dbj|BAC45786.1|
blr0521 [Bradyrhizobium japonicum USDA 110]
Length = 745
Score = 53.1 bits (126), Expect = 6e-06
Identities = 40/104 (38%), Positives = 45/104 (42%), Gaps = 22/104 (21%)
Frame = +1
Query: 559 VVPGPAAPGERPARAGRAAKPG*LRT---TSTTATTTTGSPLPYPGRPP----------- 696
V P PAAP RP AA P T T+T A T T +P PG PP
Sbjct: 200 VAPPPAAPTARPGSPAPAATPAPTPTPAPTATPAPTATPAPGSTPGAPPAGRPGAPPPGV 259
Query: 697 ---TPPATSPPPSWPHACPAPLTTSAAACLQAPP-----PAACP 804
+PPA PP+ P A PAP TT A PP PA+ P
Sbjct: 260 RPGSPPAAGSPPA-PGATPAPTTTPAPGGTATPPSGRPGPASTP 302
Score = 48.5 bits (114), Expect = 1e-04
Identities = 33/94 (35%), Positives = 35/94 (37%)
Frame = +1
Query: 565 PGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACP 744
P PA P RPA A P A P P P PTP PPP+ P A P
Sbjct: 137 PPPAPPAARPAPTPPAPPPA-------AAPQHAPPPPPPPAARPTPTPPPPPPAGPAARP 189
Query: 745 APLTTSAAACLQAPPPAACPPCCWYQRPAGPTCA 846
P T+ P P A PP RP P A
Sbjct: 190 TPAPTAT------PTPVAPPPAAPTARPGSPAPA 217
Score = 42.4 bits (98), Expect = 0.010
Identities = 27/92 (29%), Positives = 33/92 (35%), Gaps = 1/92 (1%)
Frame = +1
Query: 565 PGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPL-PYPGRPPTPPATSPPPSWPHAC 741
P P P RP P T A T T +P+ P P P P + P + P
Sbjct: 164 PPPPPPAARPTPTPPPPPPAGPAARPTPAPTATPTPVAPPPAAPTARPGSPAPAATPAPT 223
Query: 742 PAPLTTSAAACLQAPPPAACPPCCWYQRPAGP 837
P P T+ A P P + P RP P
Sbjct: 224 PTPAPTATPAPTATPAPGSTPGAPPAGRPGAP 255
Score = 42.4 bits (98), Expect = 0.010
Identities = 40/100 (40%), Positives = 42/100 (42%), Gaps = 8/100 (8%)
Frame = +1
Query: 565 PGPAAPGERPAR---AGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSP-PPSWP 732
PG PG RP AG PG +T A TTT P PG TPP+ P P S P
Sbjct: 252 PGAPPPGVRPGSPPAAGSPPAPG-----ATPAPTTT----PAPGGTATPPSGRPGPASTP 302
Query: 733 ---HACPAPLTTSAAACLQAPPPAACPPCCWYQRP-AGPT 840
A PAP T A PPP RP AGPT
Sbjct: 303 APGAATPAPTATPAPGGALTPPPG---------RPGAGPT 333
Score = 42.4 bits (98), Expect = 0.010
Identities = 30/78 (38%), Positives = 32/78 (40%)
Frame = +1
Query: 571 PAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACPAP 750
PAAP RPA AA P P P RP PP PPP+ A P P
Sbjct: 61 PAAPA-RPAAPPPAAAP--------PHPPAAPPPAAAPPRPAAPPPPPPPPAARPAPPPP 111
Query: 751 LTTSAAACLQAPPPAACP 804
AA +PPPAA P
Sbjct: 112 PPPPAAPKQPSPPPAAAP 129
Score = 42.0 bits (97), Expect = 0.013
Identities = 44/127 (34%), Positives = 48/127 (37%), Gaps = 26/127 (20%)
Frame = +1
Query: 535 PALDCRGVVVPGPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPA-- 708
PA R P AAP PA AA P R + + P P PP PPA
Sbjct: 61 PAAPARPAAPPPAAAPPHPPAAPPPAAAPP--RPAAPPPPPPPPAARPAPPPPPPPPAAP 118
Query: 709 ----------------TSPPPSWPHACPAPLTTS---AAACLQAPP----PAACP-PCCW 816
T PPP+ P A PAP + AAA APP PAA P P
Sbjct: 119 KQPSPPPAAAPQQHAPTPPPPAPPAARPAPTPPAPPPAAAPQHAPPPPPPPAARPTPTPP 178
Query: 817 YQRPAGP 837
PAGP
Sbjct: 179 PPPPAGP 185
Score = 37.0 bits (84), Expect = 0.42
Identities = 21/51 (41%), Positives = 23/51 (44%)
Frame = +1
Query: 685 GRPPTPPATSPPPSWPHACPAPLTTSAAACLQAPPPAACPPCCWYQRPAGP 837
G+P PP PP + P A PA A PPAA PP RPA P
Sbjct: 46 GKPKQPPK-GPPGAAPPAAPARPAAPPPAAAPPHPPAAPPPAAAPPRPAAP 95
Score = 35.4 bits (80), Expect = 1.2
Identities = 31/96 (32%), Positives = 35/96 (36%), Gaps = 3/96 (3%)
Frame = +1
Query: 568 GPAAPGERPARAGRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTPPATSPP---PSWPHA 738
GP ++P + A P A +P P PP PPA PP P P A
Sbjct: 43 GPDGKPKQPPKGPPGAAP-------PAAPARPAAPPP-AAAPPHPPAAPPPAAAPPRPAA 94
Query: 739 CPAPLTTSAAACLQAPPPAACPPCCWYQRPAGPTCA 846
P P AA APPP PP Q P A
Sbjct: 95 PPPPPPPPAAR--PAPPPPPPPPAAPKQPSPPPAAA 128
Score = 34.3 bits (77), Expect = 2.7
Identities = 39/106 (36%), Positives = 41/106 (37%), Gaps = 8/106 (7%)
Frame = +1
Query: 565 PGPAA--PGERPARAGRAAKPG*LRTTSTTATTTTGSPL-PYPGRP-----PTPPATSPP 720
PG A P RP A A PG T + TAT G L P PGRP P P +PP
Sbjct: 285 PGGTATPPSGRPGPASTPA-PG-AATPAPTATPAPGGALTPPPGRPGAGPTPGPQGGTPP 342
Query: 721 PSWPHACPAPLTTSAAACLQAPPPAACPPCCWYQRPAGPTCARRSS 858
P AA APP A P RPA P A S
Sbjct: 343 AGAP----------AAGTPAAPPQAGGLPA----RPAAPAGAAAPS 374
>ref|NP_862423.1| proline-rich extensin-like protein [Micrococcus sp. 28]
gi|18025411|gb|AAK62519.1| proline-rich extensin-like
protein [Micrococcus sp. 28]
Length = 406
Score = 50.1 bits (118), Expect = 5e-05
Identities = 44/142 (30%), Positives = 57/142 (39%), Gaps = 4/142 (2%)
Frame = +1
Query: 445 SPLLASQLPAQLCGVPTSRRAWHRQDAH*APALDCRGVVVPGPAAPGERPARAGRAAKPG 624
+P+ A+++P VP S + APA ++ PAAP P A A PG
Sbjct: 201 APVPAAEVPPA-APVPWSENSAEDAPLPAAPAAPLSALLPAAPAAPLSAPPLAEPPAAPG 259
Query: 625 *LRTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACPAPLTTSAAACLQAPPPAACP 804
L A + P PP A +PPP+ P PA AAA PPP A P
Sbjct: 260 SLPAPFAPAVSPEAVAPAVPLAPPAVSAAAPPPARPPPGPAAPPPPAAA---LPPPPAAP 316
Query: 805 PC----CWYQRPAGPTCARRSS 858
P PA P AR ++
Sbjct: 317 PARAAPAAAPAPAPPVPARMAA 338
Score = 33.9 bits (76), Expect = 3.5
Identities = 35/107 (32%), Positives = 42/107 (38%), Gaps = 16/107 (14%)
Frame = +1
Query: 565 PGPAAPGERPARA---GRAAKPG*LRTTSTTATTTTGSPLPYPGRPPTP-PATSPPPSWP 732
P A P PARA G A L ++ + + P PP P PA PP+ P
Sbjct: 154 PAAARPAAVPARAPLIGDAPPAAGLTPAPESSPSLASTRSTVPSTPPAPVPAAEVPPAAP 213
Query: 733 ------HACPAPLTTSAAACLQAPPPAA------CPPCCWYQRPAGP 837
A APL + AA L A PAA PP + PA P
Sbjct: 214 VPWSENSAEDAPLPAAPAAPLSALLPAAPAAPLSAPPLA--EPPAAP 258
>ref|NP_493977.1| putative secreted or extracellular protein family member precursor
(46.3 kD) (2B757) [Caenorhabditis elegans]
gi|13775528|gb|AAK39336.1| Hypothetical protein Y51H7C.1
[Caenorhabditis elegans]
Length = 448
Score = 49.7 bits (117), Expect = 6e-05
Identities = 27/73 (36%), Positives = 34/73 (45%), Gaps = 11/73 (15%)
Frame = +1
Query: 631 RTTSTTATTTTGSPLPYPGRPPTPPATSPPPSWPHACPAPLTTSAAACLQ---------- 780
RTT+TT TTTT + LP P PPAT PP ++P P P+ T+ Q
Sbjct: 214 RTTTTTTTTTTTTTLPPCVYTPAPPATQPPRTYPTTTPRPIPTTKTTVPQQWTTLPPGPE 273
Query: 781 -APPPAACPPCCW 816
P P CC+
Sbjct: 274 PTPGPGPSGECCY 286
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MXL017d07_r |
BP094045 |
1 |
376 |
2 |
HCL056e08_r |
AV642711 |
38 |
545 |
3 |
HCL091b12_r |
AV644609 |
49 |
438 |
4 |
HCL046c09_r |
AV642141 |
58 |
533 |
5 |
CL73e07_r |
AV396989 |
67 |
540 |
6 |
HCL010c06_r |
AV640095 |
67 |
310 |
7 |
HCL057e08_r |
AV642766 |
71 |
601 |
8 |
MXL043g11_r |
BP095557 |
73 |
263 |
9 |
HCL024a01_r |
AV640876 |
74 |
339 |
10 |
HCL009b04_r |
AV640031 |
80 |
324 |
11 |
LCL100c10_r |
AV631818 |
82 |
637 |
12 |
HCL064a10_r |
AV643117 |
121 |
616 |
13 |
HCL001e09_r |
AV639593 |
344 |
646 |
14 |
HCL067f11_r |
AV643330 |
382 |
860 |
|
Chlamydomonas reinhardtii
Kazusa DNA Research Institute