KCC003249A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC003249A_C01 KCC003249A_c01
cttggtggatgtggccgagcctcttggcccgtacgggccccagcagcccacaactgccac
ctcgactgctgcagctgctgTAGCTGCGGGTGAGGCGGGTGGCGCAAGCGGCAGGCAGCT
CCAGCCGGCTCCGGCACCTCCAACTGCGGCCAGCCGCTACCCGTGGAACATGGGCTGGGG
TGCGGGGGCAAGCGGTGCAGCGGGCAGGGAGAACACACCGCTGCGCCAGCTTCCGCCCTT
CCCGCCGCCGCAGCCTGAGCTGGCAGCCCTGCCGCTGCACCGTGCGCAAGCGACGGGCAC
AGTGGCGACAGGCACCGCTGGACCTGGAGAGCAGGGGCCGCAAGGGTACCAGCAACGTAG
TGGCGGCGGCGACACGACGGAGGCCCATGTGCCACCACCGCTG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC003249A_C01 KCC003249A_c01
         (403 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAK30597.1|AF350268_1 major ampullate spidroin 2-like protein...    51  3e-06
ref|NP_506284.2| COLlagen structural gene (30.3 kD) (col-160) [C...    50  8e-06
sp|P33479|IE18_PRVKA IMMEDIATE-EARLY PROTEIN IE180 gi|418707|pir...    50  8e-06
pir||T22827 hypothetical protein F57B1.4 - Caenorhabditis elegans      50  8e-06
gb|AAC04503.1| spidroin 2 [Araneus bicentenarius]                      50  8e-06

>gb|AAK30597.1|AF350268_1 major ampullate spidroin 2-like protein [Argiope trifasciata]
          Length = 120

 Score = 51.2 bits (121), Expect = 3e-06
 Identities = 43/117 (36%), Positives = 52/117 (43%), Gaps = 4/117 (3%)
 Frame = +2

Query: 26  GPYGPQQPTTATSTAAAAVAAGEAG-GASGRQLQPAPAPPTAASRYPWNMGWGAGASGAA 202
           G YG   P  A++ AAAA A G  G G SG      P PP      P   G  A A+ AA
Sbjct: 8   GGYGTSGPGGASAAAAAAAAGGPGGQGPSG------PGPPGPGGYGPSGPGAAAAAAAAA 61

Query: 203 GRENT--PLRQLPP-FPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQGYQQRSG 364
           G   +  P +Q P  + P  P  A+     A A G    G+ GPG+QGP   Q   G
Sbjct: 62  GGPGSQGPGQQGPGGYGPSGPGGASAAAAAAAAGGPGGQGSYGPGQQGPGAGQYGPG 118

 Score = 32.0 bits (71), Expect = 2.1
 Identities = 37/114 (32%), Positives = 42/114 (36%), Gaps = 9/114 (7%)
 Frame = -2

Query: 339 GPCSPGPAVPVATVPVACARCSGRAASSGCGG---------GKGGSWRSGVFSLPAAPLA 187
           GP   GP     + P   +  +  AA+ G GG         G GG   SG     AA  A
Sbjct: 1   GPGQQGPGGYGTSGPGGASAAAAAAAAGGPGGQGPSGPGPPGPGGYGPSGP-GAAAAAAA 59

Query: 186 PAPQPMFHG*RLAAVGGAGAGWSCLPLAPPASPAATAAAAVEVAVVGCWGPYGP 25
            A  P   G      GG G      P  P  + AA AAAA      G  G YGP
Sbjct: 60  AAGGPGSQGPGQQGPGGYG------PSGPGGASAAAAAAAA--GGPGGQGSYGP 105

>ref|NP_506284.2| COLlagen structural gene (30.3 kD) (col-160) [Caenorhabditis
           elegans] gi|24817333|emb|CAB01508.2| C. elegans COL-160
           protein (corresponding sequence F57B1.4) [Caenorhabditis
           elegans]
          Length = 317

 Score = 50.1 bits (118), Expect = 8e-06
 Identities = 41/110 (37%), Positives = 49/110 (44%), Gaps = 1/110 (0%)
 Frame = +2

Query: 20  PLGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGASGA 199
           P GP GP         + A  A G++GGAS     P PA P   S  P + G  AGA GA
Sbjct: 175 PAGPPGPSGAPGQKGPSGAPGAPGQSGGAS-LPGPPGPAGPPGPSGQPGSNG-NAGAPGA 232

Query: 200 AGR-ENTPLRQLPPFPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQG 346
            G+  + P    P  PP  P  A  P    Q      +G+A PG  GPQG
Sbjct: 233 PGQVVDVPGTPGPAGPPGPPGPAGAPGQPGQ------SGSAQPGGPGPQG 276

>sp|P33479|IE18_PRVKA IMMEDIATE-EARLY PROTEIN IE180 gi|418707|pir||A45344 immediate-early
           protein - suid herpesvirus 1 (strain Kaplan)
           gi|334071|gb|AAA47470.1| immediate-early protein
          Length = 1446

 Score = 50.1 bits (118), Expect = 8e-06
 Identities = 45/124 (36%), Positives = 51/124 (40%), Gaps = 6/124 (4%)
 Frame = +2

Query: 35  GPQQPTTATSTAAAAVAAGEAGGASGRQLQPA-----PAPPTAASRYPWNMGWGAGASGA 199
           GP+ PT A   AA A A G  G +S     PA     P P  A  R+    G   G  G 
Sbjct: 144 GPRPPTPAALAAAEAGAPGGPGRSSPSAASPASSSGSPGPSAAPRRWSPARGDPVGEPGP 203

Query: 200 AGRENTPLRQLPPFPPPQP-ELAALPLHRAQATGTVATGTAGPGEQGPQGYQQRSGGGDT 376
           A R  TP       PP QP  +AA P  R  A  + A+  AGP    P G    S GGD 
Sbjct: 204 AARPRTPA------PPAQPAAVAAAPARRGPA--SPASPAAGP-VSAPGGGGAPSAGGDR 254

Query: 377 TEAH 388
              H
Sbjct: 255 GRHH 258

 Score = 32.7 bits (73), Expect = 1.2
 Identities = 31/111 (27%), Positives = 45/111 (39%), Gaps = 3/111 (2%)
 Frame = +2

Query: 14   AEP---LGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGA 184
            AEP   L P  P+QP       A A AAG   G  G     +PA   ++S    +    +
Sbjct: 804  AEPAPGLPPLWPEQPGLVVPAPAPA-AAGAPSGLPGSG-PSSPASTKSSSSTKSSSSTKS 861

Query: 185  GASGAAGRENTPLRQLPPFPPPQPELAALPLHRAQATGTVATGTAGPGEQG 337
            G SG++G  ++P     P P  + +    P  R    G    G +G   +G
Sbjct: 862  GLSGSSGYASSPAAGPDPAPERRKKKRRAPGARRPGDGEEDEGLSGAALRG 912

 Score = 30.4 bits (67), Expect = 6.2
 Identities = 24/68 (35%), Positives = 29/68 (42%)
 Frame = -2

Query: 378  VVSPPPLRCWYPCGPCSPGPAVPVATVPVACARCSGRAASSGCGGGKGGSWRSGVFSLPA 199
            V +P P     P G    GP+ P +T        S   +SS    G  GS  SG  S PA
Sbjct: 822  VPAPAPAAAGAPSGLPGSGPSSPAST-----KSSSSTKSSSSTKSGLSGS--SGYASSPA 874

Query: 198  APLAPAPQ 175
            A   PAP+
Sbjct: 875  AGPDPAPE 882

 Score = 30.0 bits (66), Expect = 8.0
 Identities = 32/92 (34%), Positives = 36/92 (38%), Gaps = 2/92 (2%)
 Frame = -2

Query: 339 GPCSPGPAVPVATVPVACARCSGRAASSGCGGGKGGSWRSGVF--SLPAAPLAPAPQPMF 166
           GP S   + P    P A A     AA +G  GG G S  S     S   +P   A    +
Sbjct: 136 GPRSRAGSGPRPPTPAALA-----AAEAGAPGGPGRSSPSAASPASSSGSPGPSAAPRRW 190

Query: 165 HG*RLAAVGGAGAGWSCLPLAPPASPAATAAA 70
              R   VG  G        APPA PAA AAA
Sbjct: 191 SPARGDPVGEPGPAARPRTPAPPAQPAAVAAA 222

 Score = 30.0 bits (66), Expect = 8.0
 Identities = 37/133 (27%), Positives = 46/133 (33%), Gaps = 13/133 (9%)
 Frame = +2

Query: 11  VAEPLGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGA 190
           V EP     P+ P      AA A A    G AS       PA P A        G    A
Sbjct: 198 VGEPGPAARPRTPAPPAQPAAVAAAPARRGPAS-------PASPAAGPVSAPGGGGAPSA 250

Query: 191 SGAAGR-----------ENTPLRQLPPFP-PPQPELAALPLHRAQATGTVATGTAGPG-E 331
            G  GR           E    R+L P P   +  +++ P   + +T TVA  T   G E
Sbjct: 251 GGDRGRHHHQHREPLLDEPAAARRLDPRPLGARSPVSSNPNSNSNSTTTVAVETVARGPE 310

Query: 332 QGPQGYQQRSGGG 370
           +   G      GG
Sbjct: 311 KDEDGLGLAGDGG 323

>pir||T22827 hypothetical protein F57B1.4 - Caenorhabditis elegans
          Length = 356

 Score = 50.1 bits (118), Expect = 8e-06
 Identities = 41/110 (37%), Positives = 49/110 (44%), Gaps = 1/110 (0%)
 Frame = +2

Query: 20  PLGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGASGA 199
           P GP GP         + A  A G++GGAS     P PA P   S  P + G  AGA GA
Sbjct: 214 PAGPPGPSGAPGQKGPSGAPGAPGQSGGAS-LPGPPGPAGPPGPSGQPGSNG-NAGAPGA 271

Query: 200 AGR-ENTPLRQLPPFPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQG 346
            G+  + P    P  PP  P  A  P    Q      +G+A PG  GPQG
Sbjct: 272 PGQVVDVPGTPGPAGPPGPPGPAGAPGQPGQ------SGSAQPGGPGPQG 315

>gb|AAC04503.1| spidroin 2 [Araneus bicentenarius]
          Length = 236

 Score = 50.1 bits (118), Expect = 8e-06
 Identities = 43/129 (33%), Positives = 54/129 (41%), Gaps = 9/129 (6%)
 Frame = +2

Query: 26  GPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGASGAAG 205
           GPYGP         AAAA AAG  G  SG+Q    P       + P+  G  A A+ A G
Sbjct: 2   GPYGP-------GAAAAAAAAGGYGPGSGQQ---GPGQQGPGQQGPYGPGAAAAAAAAGG 51

Query: 206 R-----ENTPLRQLP----PFPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQGYQQR 358
                 +  P++Q P    P+ P     AA        +G    G  GPG+QGP G Q  
Sbjct: 52  YGPGSGQQGPVQQGPGQQGPYGPGASAAAAAAGGYGPGSGQQGPGQQGPGQQGP-GQQGP 110

Query: 359 SGGGDTTEA 385
            G G +  A
Sbjct: 111 YGAGASAAA 119

 Score = 32.0 bits (71), Expect = 2.1
 Identities = 32/109 (29%), Positives = 43/109 (39%), Gaps = 4/109 (3%)
 Frame = +2

Query: 26  GPYGPQQPTTATSTAAAAVAAGEAGGASGRQ--LQPAPAPPTAASRYPWNMGWGAGASGA 199
           G  GP  P      +AAA AAG  G  SG+Q   Q  P       + P+  G  A A+ A
Sbjct: 67  GQQGPYGP----GASAAAAAAGGYGPGSGQQGPGQQGPGQQGPGQQGPYGAGASAAAAAA 122

Query: 200 AGRENTPLRQLP--PFPPPQPELAALPLHRAQATGTVATGTAGPGEQGP 340
            G      +Q P      P    AA  L  + A+  V++  +     GP
Sbjct: 123 GGYGPGSGQQGPGVRVAAPVASAAASRLSSSAASSRVSSAVSSLVSSGP 171



EST assemble image


clone accession position
1 MXL020b02_r BP094249 1 399
2 LCL057c09_r AV629356 211 403




Chlamydomonas reinhardtii
Kazusa DNA Research Institute