KCC001604A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001604A_C01 KCC001604A_c01
tgcatagaaatcgtccaaaagggtcaCGACTTACTGCGAACATCGACTCAGTTGAGTGCC
TGACGCCGCAAACTTTCTTGTCGAGTTCGGCGCCGTCTGAATCACGCTTGATCTTTGCGT
GTTGCGCGTATTTCCTGTTGCAACTAGGCCATTTTCGCCGCTGCGCCCTGCATTATTGAA
CATGGACTCTGGGGCTTTGACGCGACGAACCGCCCGCCTCCAAGCAAGCCCTCGGCTGCC
GCGGGCGCCAGTCTCGCTGCCGGCTTGGCGCATCCAGCCGCGCACCTCGCGCTACGTGGC
TGCGCTCGCCTCGGGCGGCGGTGACGGCGGTGCGGGCAAGGGCCCCAGCGGCGGTGGTGG
CGGTGGCGGTGGCGGTGATGGCGACGGCAAGGGCAAGGGCGATGACGGCCACAACTCGGG
CGACAAGGGCCCCCAGAAGGGCGGCCTGTTCCGAGGCTGGGAGGAGCGCGTGGCTTACGA
TTCCGAGTTTCCCATTAAGGTGCTGATGGAGCAGGTGATCGGTGTGGGCGCGTGCGTCAT
CGGCGACATGAGCGCGCGACCCAACTGGGGCCTGAACGAAAA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001604A_C01 KCC001604A_c01
         (582 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_187476.1| glycine-rich protein [Arabidopsis thaliana] gi|...   101  9e-21
dbj|BAB85250.1| P0425G02.25 [Oryza sativa (japonica cultivar-gro...    91  9e-18
ref|NP_187475.1| expressed protein [Arabidopsis thaliana] gi|123...    91  1e-17
prf||1604369A sulfated surface glycoprotein SSG185                     64  1e-09
sp|P21997|SSGP_VOLCA SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185)...    64  1e-09

>ref|NP_187476.1| glycine-rich protein [Arabidopsis thaliana]
           gi|12322723|gb|AAG51347.1|AC012562_8 unknown protein;
           33915-34928 [Arabidopsis thaliana]
           gi|19698961|gb|AAL91216.1| unknown protein [Arabidopsis
           thaliana] gi|22136296|gb|AAM91226.1| unknown protein
           [Arabidopsis thaliana]
          Length = 337

 Score =  101 bits (251), Expect = 9e-21
 Identities = 61/136 (44%), Positives = 81/136 (58%), Gaps = 16/136 (11%)
 Frame = +2

Query: 221 QASPRLPRAPVSL---PAWRIQ-PR-----------TSRYVAALASGGGDGGAGKGPSGG 355
           Q+S RLPR  ++L   P  R+  PR           +S ++A   +GGG GG+     GG
Sbjct: 16  QSSVRLPRV-INLSRDPTTRVSFPRNGSVCSLHTNFSSPHLAKPCAGGGGGGSTGNNGGG 74

Query: 356 GGGGGGGDGDGKGKGDDGHNSGDKGPQKGGLF-RGWEERVAYDSEFPIKVLMEQVIGVGA 532
            G GGGG G G   G+    S   GP   GLF +GW  RVA D +FP KVLME+++G+ A
Sbjct: 75  SGSGGGGGGFGGSGGEASEESSPWGPI--GLFIQGWRSRVAADPQFPFKVLMEEIVGLSA 132

Query: 533 CVIGDMSARPNWGLNE 580
           CV+GDM++RPN+GLNE
Sbjct: 133 CVLGDMASRPNFGLNE 148

>dbj|BAB85250.1| P0425G02.25 [Oryza sativa (japonica cultivar-group)]
           gi|20161467|dbj|BAB90391.1| P0432B10.9 [Oryza sativa
           (japonica cultivar-group)]
          Length = 401

 Score = 91.3 bits (225), Expect = 9e-18
 Identities = 63/155 (40%), Positives = 79/155 (50%), Gaps = 8/155 (5%)
 Frame = +2

Query: 140 ATRPFSPLRPALLNMDSGALTRRTA--RLQASPRL--PRAPVSLPAWRIQPRTSRYVAAL 307
           A RP  P+  A +   +      TA   L +SPRL  PRA  SL    +   +S  +  L
Sbjct: 45  ARRPPLPIAMASMAFTAAKFLPATAPTHLDSSPRLSPPRAG-SLSFSPLSSSSSALLLRL 103

Query: 308 ASGGGDGGAGKG----PSGGGGGGGGGDGDGKGKGDDGHNSGDKGPQKGGLFRGWEERVA 475
            S    G +G G    P     GGGGG GD    G      G  G   G    GW  RVA
Sbjct: 104 RSPSPSGPSGPGGRLPPPPRSYGGGGGSGDAADSG------GSSGGILGIFLAGWAARVA 157

Query: 476 YDSEFPIKVLMEQVIGVGACVIGDMSARPNWGLNE 580
            D +FP KVLME+++GV ACV+GDM++RPN+GLNE
Sbjct: 158 ADPQFPFKVLMEELVGVSACVLGDMASRPNFGLNE 192

 Score = 33.5 bits (75), Expect = 2.2
 Identities = 31/86 (36%), Positives = 32/86 (37%), Gaps = 6/86 (6%)
 Frame = -1

Query: 564 LGRALMSPMTHAPTPITCSISTLMGNSES*ATR----SSQPRNRPPFWGPLS--PELWPS 403
           L  A   P    P PI  +           AT      S PR  PP  G LS  P    S
Sbjct: 37  LPAAAAKPARRPPLPIAMASMAFTAAKFLPATAPTHLDSSPRLSPPRAGSLSFSPLSSSS 96

Query: 402 SPLPLPSPSPPPPPPPPPLGPLPAPP 325
           S L L   SP P  P  P G LP PP
Sbjct: 97  SALLLRLRSPSPSGPSGPGGRLPPPP 122

>ref|NP_187475.1| expressed protein [Arabidopsis thaliana]
           gi|12322720|gb|AAG51344.1|AC012562_5 unknown protein;
           31866-32885 [Arabidopsis thaliana]
          Length = 339

 Score = 90.9 bits (224), Expect = 1e-17
 Identities = 49/91 (53%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
 Frame = +2

Query: 311 SGGGDGGAGKGPSGGGGGGGGGDGDGKGKGDDGHNSGDKGPQKGGLF-RGWEERVAYDSE 487
           +GGG GG+     GG G GGGG G G   G +   S   GP   GLF +GW  RVA DS+
Sbjct: 60  AGGGGGGSIGNHGGGSGSGGGGGGYG---GSEEEESSPWGPL--GLFIQGWRSRVAADSQ 114

Query: 488 FPIKVLMEQVIGVGACVIGDMSARPNWGLNE 580
           FP KVLME ++GV A V+GDM++RPN+GLNE
Sbjct: 115 FPFKVLMEMLVGVSANVLGDMASRPNFGLNE 145

>prf||1604369A sulfated surface glycoprotein SSG185
          Length = 453

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 30/53 (56%), Positives = 32/53 (59%)
 Frame = -1

Query: 471 TRSSQPRNRPPFWGPLSPELWPSSPLPLPSPSPPPPPPPPPLGPLPAPPSPPP 313
           T SS+P + PP   P SP     SP P P P PPPPPPPPP  P P PP PPP
Sbjct: 236 TASSRPPSPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPP 288

 Score = 61.2 bits (147), Expect = 1e-08
 Identities = 30/60 (50%), Positives = 32/60 (53%)
 Frame = -1

Query: 465 SSQPRNRPPFWGPLSPELWPSSPLPLPSPSPPPPPPPPPLGPLPAPPSPPPEASAAT*RE 286
           S  P  RPP   P SP   P  P P P P PPPP PPPP  P P PP PPP  S +  R+
Sbjct: 243 SPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRK 302

 Score = 56.2 bits (134), Expect = 3e-07
 Identities = 31/81 (38%), Positives = 36/81 (44%)
 Frame = -1

Query: 546 SPMTHAPTPITCSISTLMGNSES*ATRSSQPRNRPPFWGPLSPELWPSSPLPLPSPSPPP 367
           SP+  +P P   S       S    +      + PP   P  P   P  P P P P PPP
Sbjct: 227 SPLPPSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPP 286

Query: 366 PPPPPPLGPLPAPPSPPPEAS 304
           PPPPPP  P P+PP  PP  S
Sbjct: 287 PPPPPPPPPSPSPPRKPPSPS 307

 Score = 52.8 bits (125), Expect = 4e-06
 Identities = 31/76 (40%), Positives = 35/76 (45%), Gaps = 8/76 (10%)
 Frame = -1

Query: 516 TCSISTLMGNSES*ATRSSQPRNRP----PFWGPLSPELWPSS----PLPLPSPSPPPPP 361
           T S S    + +   T  S P   P    P   PL P   P++    P P PSP PP PP
Sbjct: 195 TASYSVFNSDKDCCPTGLSGPNVNPIGPAPNNSPLPPSPQPTASSRPPSPPPSPRPPSPP 254

Query: 360 PPPPLGPLPAPPSPPP 313
           PP P  P P PP PPP
Sbjct: 255 PPSPSPPPPPPPPPPP 270

 Score = 51.2 bits (121), Expect = 1e-05
 Identities = 25/53 (47%), Positives = 28/53 (52%)
 Frame = -1

Query: 456 PRNRPPFWGPLSPELWPSSPLPLPSPSPPPPPPPPPLGPLPAPPSPPPEASAA 298
           P   PP   P SP   P  P P P P PPP P PP   P P+PP PPP + +A
Sbjct: 266 PPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSPSA 318

 Score = 48.9 bits (115), Expect = 5e-05
 Identities = 30/64 (46%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
 Frame = -1

Query: 465 SSQPRNRPPFWGPLSPELWPSSPLPLPSPSPP-PPPPPPPLGPLPAPPS------PPPEA 307
           S  P   PP   P  P   PS P P P P PP PPPPPP   P   PPS      PPP  
Sbjct: 257 SPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSP 316

Query: 306 SAAT 295
           SA T
Sbjct: 317 SAVT 320

 Score = 37.7 bits (86), Expect = 0.12
 Identities = 22/49 (44%), Positives = 23/49 (46%), Gaps = 6/49 (12%)
 Frame = -1

Query: 432 GPLSPELWPSSPLPLPSPSPPPPPP-----PPPLGPLPAPPSP-PPEAS 304
           G   P + P  P P  SP PP P P     PP   P P PPSP PP  S
Sbjct: 211 GLSGPNVNPIGPAPNNSPLPPSPQPTASSRPPSPPPSPRPPSPPPPSPS 259

>sp|P21997|SSGP_VOLCA SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185) gi|99441|pir||A33647
           sulfated surface glycoprotein 185 - Volvox carteri
           gi|1405821|emb|CAA35953.1| SULFATED SURFACE GLYCOPROTEIN
           185 [Volvox carteri]
          Length = 485

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 30/53 (56%), Positives = 32/53 (59%)
 Frame = -1

Query: 471 TRSSQPRNRPPFWGPLSPELWPSSPLPLPSPSPPPPPPPPPLGPLPAPPSPPP 313
           T SS+P + PP   P SP     SP P P P PPPPPPPPP  P P PP PPP
Sbjct: 236 TASSRPPSPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPP 288

 Score = 61.2 bits (147), Expect = 1e-08
 Identities = 30/60 (50%), Positives = 32/60 (53%)
 Frame = -1

Query: 465 SSQPRNRPPFWGPLSPELWPSSPLPLPSPSPPPPPPPPPLGPLPAPPSPPPEASAAT*RE 286
           S  P  RPP   P SP   P  P P P P PPPP PPPP  P P PP PPP  S +  R+
Sbjct: 243 SPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRK 302

 Score = 56.2 bits (134), Expect = 3e-07
 Identities = 31/81 (38%), Positives = 36/81 (44%)
 Frame = -1

Query: 546 SPMTHAPTPITCSISTLMGNSES*ATRSSQPRNRPPFWGPLSPELWPSSPLPLPSPSPPP 367
           SP+  +P P   S       S    +      + PP   P  P   P  P P P P PPP
Sbjct: 227 SPLPPSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPP 286

Query: 366 PPPPPPLGPLPAPPSPPPEAS 304
           PPPPPP  P P+PP  PP  S
Sbjct: 287 PPPPPPPPPSPSPPRKPPSPS 307

 Score = 52.8 bits (125), Expect = 4e-06
 Identities = 31/76 (40%), Positives = 35/76 (45%), Gaps = 8/76 (10%)
 Frame = -1

Query: 516 TCSISTLMGNSES*ATRSSQPRNRP----PFWGPLSPELWPSS----PLPLPSPSPPPPP 361
           T S S    + +   T  S P   P    P   PL P   P++    P P PSP PP PP
Sbjct: 195 TASYSVFNSDKDCCPTGLSGPNVNPIGPAPNNSPLPPSPQPTASSRPPSPPPSPRPPSPP 254

Query: 360 PPPPLGPLPAPPSPPP 313
           PP P  P P PP PPP
Sbjct: 255 PPSPSPPPPPPPPPPP 270

 Score = 52.0 bits (123), Expect = 6e-06
 Identities = 32/88 (36%), Positives = 37/88 (41%)
 Frame = -1

Query: 456 PRNRPPFWGPLSPELWPSSPLPLPSPSPPPPPPPPPLGPLPAPPSPPPEASAAT*REVRG 277
           P   PP   P SP   P  P P P P PPP P PP   P P+PP PPP +  +      G
Sbjct: 266 PPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSPPSVLPAATG 325

Query: 276 WMRQAGSETGARGSRGLAWRRAVRRVKA 193
           +      E  +R      WR  V  V A
Sbjct: 326 F---PFCECVSRSPSSYPWRVTVANVSA 350

 Score = 50.8 bits (120), Expect = 1e-05
 Identities = 31/68 (45%), Positives = 32/68 (46%), Gaps = 11/68 (16%)
 Frame = -1

Query: 465 SSQPRNRPPFWGPLSPELWPSSPLPLPSPSPP-PPPPPP----------PLGPLPAPPSP 319
           S  P   PP   P  P   PS P P P P PP PPPPPP          P  P+P PPSP
Sbjct: 257 SPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSP 316

Query: 318 PPEASAAT 295
           P    AAT
Sbjct: 317 PSVLPAAT 324

 Score = 37.7 bits (86), Expect = 0.12
 Identities = 22/49 (44%), Positives = 23/49 (46%), Gaps = 6/49 (12%)
 Frame = -1

Query: 432 GPLSPELWPSSPLPLPSPSPPPPPP-----PPPLGPLPAPPSP-PPEAS 304
           G   P + P  P P  SP PP P P     PP   P P PPSP PP  S
Sbjct: 211 GLSGPNVNPIGPAPNNSPLPPSPQPTASSRPPSPPPSPRPPSPPPPSPS 259



EST assemble image


clone accession position
1 LC034b06_r AV621305 1 497
2 CM068g06_r AV391095 27 642




Chlamydomonas reinhardtii
Kazusa DNA Research Institute