KCC000371A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000371A_C01 KCC000371A_c01
CTACTTGAACGACCTTTCATTTAGCGATTTCAGAGTTAAAGACCCGTCAGAGCAGTCACG
TTACAGACTGCGAACGCTTGTAATATACCGACGGAGCACGTATTGTCCGTCACCCGCCTG
TTCTGCTGGTCCGCGGAGCAAGGGACCAGAGCAGACCACAGCTCAAGCAAAGACCGCCTG
CAAAGTGGGCGCGGCCCACAAAGTGGGCGAGGTCTTGCTCGAACGCCGAGTGCGTCTCGC
CGACCGATACCATGAGCGGGCGCGGCGACCGGCGTGACCCGCAAAGCGGTAGCCGATTAG
CGTCCGTTCGCCAGCAGGAAGAGGGCTCCCGTCCCGGAGCCAAAGGCTCGGCTGCCGCGT
CTCAAAGAGACAGCAGGGGCGACACGCCCGACCCGGGAGCTCTAGCTTCGCGCAAGCGTG
AGGCAGGCAGCGTGCAGCAGGCTGCGCAGCAGCCCAGCGCAAAGCGCACACGCGTGCCAG
ATCCCCCGCCGCACTACGCCGGCAAGCCCGACGAACTCGGGCGCACTTGTGATCCCTCGC
CCAAAGGGCAGGACAAGCAGGCGAAGCCCCCGGGTTGAGCACGTGAAGACAGCGGCCAGG
CCCGCAGCTTGCGAGGCTTCGGTCCGCAACGCCTGAGCACGCGCG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000371A_C01 KCC000371A_c01
         (645 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|XP_293801.3| similar to RE32881p [Homo sapiens]                    50  2e-05
gb|AAH44157.1| Similar to hepatoma-derived growth factor, relate...    47  2e-04
emb|CAA67261.1| collagen type I alpha 1 [Homo sapiens]                 46  4e-04
gb|AAB94054.2| pro alpha 1(I) collagen [Homo sapiens]                  46  4e-04
ref|NP_000079.1| alpha 1 type I collagen preproprotein; Collagen...    46  4e-04

>ref|XP_293801.3| similar to RE32881p [Homo sapiens]
          Length = 439

 Score = 50.4 bits (119), Expect = 2e-05
 Identities = 40/116 (34%), Positives = 52/116 (44%), Gaps = 5/116 (4%)
 Frame = +3

Query: 237 SPTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRK 416
           S T     + D R P  GS+L S R    GS+  ++  A  +Q+DSR  TP PG+    +
Sbjct: 40  SRTPAPGSQQDSRTPAPGSQLDS-RTPAPGSQQDSRTPAPGTQQDSR--TPAPGSQQDSR 96

Query: 417 REAGSVQQAAQQPSAKRTRVPDPPPHY-AGKPDELGRTC----DPSPKGQDKQAKP 569
             A   QQ +Q         PDP P Y AG+PD   R      DP P+    Q  P
Sbjct: 97  TPAPGTQQDSQTRPRLSAGQPDPCPRYSAGQPDPCPRLSAGQPDPCPRYSAGQPDP 152

>gb|AAH44157.1| Similar to hepatoma-derived growth factor, related protein 2 [Danio
           rerio]
          Length = 417

 Score = 47.4 bits (111), Expect = 2e-04
 Identities = 42/163 (25%), Positives = 62/163 (37%), Gaps = 18/163 (11%)
 Frame = +3

Query: 135 GARDQSRPQLKQRPPAKWARPTKWARSCSN----AECVSPTDTMSGRGDRRDPQSGSRLA 302
           G+ DQ +P +K++ PA    P K AR+ S+     E  SP++         D    S   
Sbjct: 136 GSEDQKKPAVKRKAPAPKRPPVKKARASSSDRDGEESGSPSEPEPSPSSDSDSGKNSDQD 195

Query: 303 SVRQQEEGSRPGAK--------------GSAAASQRDSRGDTPDPGALASRKREAGSVQQ 440
              Q+E G R G K               S + SQ D +    D      R   +GS  Q
Sbjct: 196 FTPQKESGGRGGKKPAGRGRRKKASSGSDSDSGSQSDQKAARSDSEDEKPRPAASGSESQ 255

Query: 441 AAQQPSAKRTRVPDPPPHYAGKPDELGRTCDPSPKGQDKQAKP 569
           +  +  +     P PPP     P    +   P PK + ++ KP
Sbjct: 256 SGSKSDSDSEPPPPPPPTRKA-PQGRKKAEKPPPKPRARKPKP 297

>emb|CAA67261.1| collagen type I alpha 1 [Homo sapiens]
          Length = 1069

 Score = 46.2 bits (108), Expect = 4e-04
 Identities = 34/120 (28%), Positives = 41/120 (33%)
 Frame = +3

Query: 240  PTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRKR 419
            PT      GDR +P                +PGAKG    +        P P   A    
Sbjct: 792  PTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPG 851

Query: 420  EAGSVQQAAQQPSAKRTRVPDPPPHYAGKPDELGRTCDPSPKGQDKQAKPPG*AREDSGQ 599
              G+V      P AK  R    PP   G P   GR   P P G      PPG A ++ G+
Sbjct: 852  PIGNVGA----PGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGK 907

 Score = 35.8 bits (81), Expect = 0.55
 Identities = 28/86 (32%), Positives = 33/86 (37%), Gaps = 2/86 (2%)
 Frame = +3

Query: 324 GSR--PGAKGSAAASQRDSRGDTPDPGALASRKREAGSVQQAAQQPSAKRTRVPDPPPHY 497
           GSR  PGA G A          +P P        EAG   +A    +   T  P      
Sbjct: 491 GSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSP------ 544

Query: 498 AGKPDELGRTCDPSPKGQDKQAKPPG 575
            G P   G+T  P P GQD +  PPG
Sbjct: 545 -GSPGPDGKTGPPGPAGQDGRPGPPG 569

 Score = 35.0 bits (79), Expect = 0.94
 Identities = 36/140 (25%), Positives = 46/140 (32%), Gaps = 20/140 (14%)
 Frame = +3

Query: 240  PTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRKR 419
            P     G+G R +     R   V         G KGS  A        TP P  +A ++ 
Sbjct: 900  PAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRG 959

Query: 420  EAG----------------SVQQAAQQPSAKRTRVPDP----PPHYAGKPDELGRTCDPS 539
              G                S +   Q PS        P    PP  AG P E GR   P 
Sbjct: 960  VVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPG 1019

Query: 540  PKGQDKQAKPPG*AREDSGQ 599
             +G   +   PG A+ D G+
Sbjct: 1020 AEGSPGRDGSPG-AKGDRGE 1038

 Score = 32.7 bits (73), Expect = 4.7
 Identities = 27/84 (32%), Positives = 30/84 (35%), Gaps = 3/84 (3%)
 Frame = +3

Query: 333 PGAKGSAAASQRDSRGDTPDPGALASRKREAGSVQQAAQQPSAKRTRVPDPPPHYAGKPD 512
           PGA G A           P P   A  + E G     A  P  +    P  PP  AGKP 
Sbjct: 604 PGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGP----AGSPGFQGLPGPAGPPGEAGKPG 659

Query: 513 ELGRTCD---PSPKGQDKQAKPPG 575
           E G   D   P P G   +   PG
Sbjct: 660 EQGVPGDLGAPGPSGARGERGFPG 683

>gb|AAB94054.2| pro alpha 1(I) collagen [Homo sapiens]
          Length = 1461

 Score = 46.2 bits (108), Expect = 4e-04
 Identities = 34/120 (28%), Positives = 41/120 (33%)
 Frame = +3

Query: 240  PTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRKR 419
            PT      GDR +P                +PGAKG    +        P P   A    
Sbjct: 789  PTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPG 848

Query: 420  EAGSVQQAAQQPSAKRTRVPDPPPHYAGKPDELGRTCDPSPKGQDKQAKPPG*AREDSGQ 599
              G+V      P AK  R    PP   G P   GR   P P G      PPG A ++ G+
Sbjct: 849  PIGNVGA----PGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGK 904

 Score = 35.8 bits (81), Expect = 0.55
 Identities = 28/86 (32%), Positives = 33/86 (37%), Gaps = 2/86 (2%)
 Frame = +3

Query: 324 GSR--PGAKGSAAASQRDSRGDTPDPGALASRKREAGSVQQAAQQPSAKRTRVPDPPPHY 497
           GSR  PGA G A          +P P        EAG   +A    +   T  P      
Sbjct: 488 GSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSP------ 541

Query: 498 AGKPDELGRTCDPSPKGQDKQAKPPG 575
            G P   G+T  P P GQD +  PPG
Sbjct: 542 -GSPGPDGKTGPPGPAGQDGRPGPPG 566

 Score = 35.0 bits (79), Expect = 0.94
 Identities = 36/140 (25%), Positives = 46/140 (32%), Gaps = 20/140 (14%)
 Frame = +3

Query: 240  PTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRKR 419
            P     G+G R +     R   V         G KGS  A        TP P  +A ++ 
Sbjct: 897  PAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRG 956

Query: 420  EAG----------------SVQQAAQQPSAKRTRVPDP----PPHYAGKPDELGRTCDPS 539
              G                S +   Q PS        P    PP  AG P E GR   P 
Sbjct: 957  VVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPG 1016

Query: 540  PKGQDKQAKPPG*AREDSGQ 599
             +G   +   PG A+ D G+
Sbjct: 1017 AEGSPGRDGSPG-AKGDRGE 1035

 Score = 32.7 bits (73), Expect = 4.7
 Identities = 27/84 (32%), Positives = 30/84 (35%), Gaps = 3/84 (3%)
 Frame = +3

Query: 333 PGAKGSAAASQRDSRGDTPDPGALASRKREAGSVQQAAQQPSAKRTRVPDPPPHYAGKPD 512
           PGA G A           P P   A  + E G     A  P  +    P  PP  AGKP 
Sbjct: 601 PGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGP----AGSPGFQGLPGPAGPPGEAGKPG 656

Query: 513 ELGRTCD---PSPKGQDKQAKPPG 575
           E G   D   P P G   +   PG
Sbjct: 657 EQGVPGDLGAPGPSGARGERGFPG 680

>ref|NP_000079.1| alpha 1 type I collagen preproprotein; Collagen I, alpha-1
            polypeptide; osteogenesis imperfecta type IV; collagen of
            skin, tendon and bone, alpha-1 chain [Homo sapiens]
            gi|1418928|emb|CAA98968.1| prepro-alpha1(I) collagen
            [Homo sapiens]
          Length = 1464

 Score = 46.2 bits (108), Expect = 4e-04
 Identities = 34/120 (28%), Positives = 41/120 (33%)
 Frame = +3

Query: 240  PTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRKR 419
            PT      GDR +P                +PGAKG    +        P P   A    
Sbjct: 792  PTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPG 851

Query: 420  EAGSVQQAAQQPSAKRTRVPDPPPHYAGKPDELGRTCDPSPKGQDKQAKPPG*AREDSGQ 599
              G+V      P AK  R    PP   G P   GR   P P G      PPG A ++ G+
Sbjct: 852  PIGNVGA----PGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGK 907

 Score = 35.8 bits (81), Expect = 0.55
 Identities = 28/86 (32%), Positives = 33/86 (37%), Gaps = 2/86 (2%)
 Frame = +3

Query: 324 GSR--PGAKGSAAASQRDSRGDTPDPGALASRKREAGSVQQAAQQPSAKRTRVPDPPPHY 497
           GSR  PGA G A          +P P        EAG   +A    +   T  P      
Sbjct: 491 GSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSP------ 544

Query: 498 AGKPDELGRTCDPSPKGQDKQAKPPG 575
            G P   G+T  P P GQD +  PPG
Sbjct: 545 -GSPGPDGKTGPPGPAGQDGRPGPPG 569

 Score = 35.4 bits (80), Expect = 0.72
 Identities = 36/140 (25%), Positives = 47/140 (32%), Gaps = 20/140 (14%)
 Frame = +3

Query: 240  PTDTMSGRGDRRDPQSGSRLASVRQQEEGSRPGAKGSAAASQRDSRGDTPDPGALASRKR 419
            P     G+G R +     R   V         G KGS  A        TP P  +A ++ 
Sbjct: 900  PAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRG 959

Query: 420  EAG----------------SVQQAAQQPSAKRTRVPDP----PPHYAGKPDELGRTCDPS 539
              G                S +   Q PS        P    PP  AG P E GR   P+
Sbjct: 960  VVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPA 1019

Query: 540  PKGQDKQAKPPG*AREDSGQ 599
             +G   +   PG A+ D G+
Sbjct: 1020 AEGSPGRDGSPG-AKGDRGE 1038

 Score = 32.7 bits (73), Expect = 4.7
 Identities = 27/84 (32%), Positives = 30/84 (35%), Gaps = 3/84 (3%)
 Frame = +3

Query: 333 PGAKGSAAASQRDSRGDTPDPGALASRKREAGSVQQAAQQPSAKRTRVPDPPPHYAGKPD 512
           PGA G A           P P   A  + E G     A  P  +    P  PP  AGKP 
Sbjct: 604 PGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGP----AGSPGFQGLPGPAGPPGEAGKPG 659

Query: 513 ELGRTCD---PSPKGQDKQAKPPG 575
           E G   D   P P G   +   PG
Sbjct: 660 EQGVPGDLGAPGPSGARGERGFPG 683



EST assemble image


clone accession position
1 LCL032h10_r AV627852 1 472
2 LCL035c08_r AV628010 1 528
3 CL22h07_r AV394252 4 395
4 LCL089d07_r AV631137 4 403
5 MXL063c07_r BP096696 9 410
6 HCL017h08_r AV640533 18 528
7 CL68f02_r AV396944 107 647




Chlamydomonas reinhardtii
Kazusa DNA Research Institute