KCC003326A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC003326A_C01 KCC003326A_c01
gggcgttccggcacgcatcacctaccgcgtcaccacgtctgttgcggcggatgccagcat
ctccgcggatgtggcagcacAGACCGCATCGAGCCCCAGCAAGCTGCAGTTCGCCATGAC
GGAGCAGCCGCTCAGCTCCGCGCAACGGGCCAGCTTCAGCGGCAATGCGGCGGTGGTGGG
CAGCTCCGTCCACTTCCCGGTGTACGTCAACTCGTACGGCCTGTTCTACAACGTGCCCGG
CAACCGAGTGACGCCGCTTAACATCACCGCTTGTCTGGCGGCGCGCATTCTGACGGGGGA
CATCCAAGAGTGGACACACCCAGACTTTCTGGACAAGAACCCGGGCTTCGTCAAGACCGG
CACGGACTCATACAAGATCACTGTGCTGATGGACACCGCGACCTCCGGCTCCACACTGGC
GGTGCTGGGGTGGCTGGCCAAGGCCTGCCCCTCGGTTCCCGTGACCTACACCGGCGTGCG
CGCTGACGCGGTTGCTGTCGGCTCCAACATGATCGCTTCCCTGCAATCCACCGGCTACTC
TTTCGGCTTCGCGATGGGCAGCGTGGGCCGCGCCGCGTCCGTGAGCGAGTTCGCTGTTGA
GACCGCCACGGCAGGACAGTACCTGCAGACTGGATCCACCAACACCACCAACATGCAGGT
TGCGCTCGCGGCCGCCGTGCCTACCTCCCTGTCCGGTGACTACTCCACCGCCACCGTCGT
CTCGCTGTCCGCCGACCGCGTGGCGCCCATCCTCACGGTGGGCTACCTGGTCACCAAGAG
CGACTGGTCCACAGAGGGCCTCGAGGAGGGGCAGCGCGGCGAGCTGGTGCGGGCG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC003326A_C01 KCC003326A_c01
         (835 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_342020.1| Phosphate binding periplasmic protein precursor...    63  7e-09
pir||C34768 ORF2 protein - Orf virus (strain NZ2) gi|332565|gb|A...    58  2e-07
ref|XP_302837.1| hypothetical protein XP_302837 [Homo sapiens]         54  4e-06
gb|AAF43497.1|AF134579_1 arabinogalactan protein [Zea mays]            53  5e-06
gb|AAA59875.1| mucin                                                   52  9e-06

>ref|NP_342020.1| Phosphate binding periplasmic protein precursor (pstS) [Sulfolobus
           solfataricus] gi|25296900|pir||C90194 hypothetical
           protein pstS [imported] - Sulfolobus solfataricus
           gi|13813646|gb|AAK40810.1| Phosphate binding periplasmic
           protein precursor (pstS) [Sulfolobus solfataricus]
          Length = 405

 Score = 62.8 bits (151), Expect = 7e-09
 Identities = 68/262 (25%), Positives = 109/262 (40%), Gaps = 15/262 (5%)
 Frame = +2

Query: 68  DVAAQTASSPSKLQFAMTEQPLSSAQRASFSGNAAVVGSSVHFPVYVNSYGLFYNVPGNR 247
           D+ A     PS L   +T++                 G  +  P  V +  + YN+P   
Sbjct: 113 DIGATDVPPPSNLYQQLTQK----------------YGEVLTIPDVVGAVDIIYNIPSFS 156

Query: 248 VTPLNITACLAARILTGDIQEWTHPDFLDKNPGFVKTGTDSYKITVLMDTATSGSTLAVL 427
            T LN+TA + A+I  G IQ W  P     NP F  T     KI  +  +  SG+T    
Sbjct: 157 GT-LNLTADVLAKIYLGQIQYWDDPAIKALNPHFNFT---HQKIIAVHRSDGSGTTFIFT 212

Query: 428 GWLAKACPS-----VPVTYT-GVRADAVAVG------SNMIASLQSTGYSFGFAMGSVGR 571
            WL  +  S     V   YT     D +  G        + A +    YS G+       
Sbjct: 213 YWLYTSSQSWRSSNVSYGYTVNWPVDKLGNGLGGKGSDGVTAYVSQNPYSIGYVEAQYAI 272

Query: 572 AASVSEFAVETATAGQY-LQTGSTNTTNMQVALAAAVPTSLSGDYS--TATVVSLSADRV 742
           A +++  AV   + G Y L T ++  T +Q A  +++P+SL+ D S   +  +++ A   
Sbjct: 273 AKNLTPAAVLNPSTGDYVLPTQASIETAIQNANLSSLPSSLTRDLSQYLSVFLNVKAHNA 332

Query: 743 APILTVGYLVTKSDWSTEGLEE 808
            PI+T  +LV K +++ +   E
Sbjct: 333 YPIVTFSWLVIKVNYTDKSKAE 354

>pir||C34768 ORF2 protein - Orf virus (strain NZ2) gi|332565|gb|AAA46789.1| ORF2
          Length = 221

 Score = 58.2 bits (139), Expect = 2e-07
 Identities = 67/217 (30%), Positives = 84/217 (37%), Gaps = 30/217 (13%)
 Frame = -3

Query: 680 ARRPRAQPACWWCWWIQSAGTVLPWRSQQRTRSRTRRGPR-----CPSRSRKSSRWIAGK 516
           ARRPR +    WC           WR  +  R+RTRRG       C  R   ++RW  G 
Sbjct: 21  ARRPRRR----WC--------ARCWRRARARRARTRRGGCRWRRCCARRCAAAARWTRGC 68

Query: 515 RSCWSRQQPRQR--------ARR---CRSREPRGRPWPATPAPPVWSRRSRCPS-AQ*SC 372
           R CW    PR R        ARR   C  R   G  WP    P    R S  P+ A  + 
Sbjct: 69  RCCWRGAGPRPRRAPAAARGARRRGCCARRARGGGGWPRRWRPRAARRGSGPPARAPAAA 128

Query: 371 MSPCRS*RSPGSCPESLGVSTL---------GCPPSECAPPDKR*C*AASLGCRARCRTG 219
           ++P ++ RS     + L V TL           PP+      ++ C     GC AR R  
Sbjct: 129 LAPDQAPRSKVK-KDRLAVETLPPQPRTPHTRLPPARRQHRSQQACTPRRAGCSARSRWW 187

Query: 218 RTS*RTPGSG----RSCPPPPHCR*SWPVARS*AAAP 120
           R +  + GS     R C PP     SW  A S   AP
Sbjct: 188 RWASSSAGSSAPRRRRCRPP-----SWRRAPSARTAP 219

>ref|XP_302837.1| hypothetical protein XP_302837 [Homo sapiens]
          Length = 250

 Score = 53.5 bits (127), Expect = 4e-06
 Identities = 71/234 (30%), Positives = 90/234 (38%), Gaps = 11/234 (4%)
 Frame = +3

Query: 21  PTASPRLLRRMP--ASPRMWQHR--PHRAPASCSSP*RSSRSAPRNGPASAAMRRWWAAP 188
           PTASP    R P  ASP     R  P R P   S      R++PR  P  A+ RR     
Sbjct: 32  PTASPA---RTPPRASPARTPPRASPGRTPPRASPRRTPPRASPRRAPPRASPRRTPPTA 88

Query: 189 STSRCTSTRTACSTTCPATE*RRLTSPLVWRRAF*RGTSKSGHTQTFWTRTRASSRPART 368
           S +R   T +   T  P     R  S    RR   R +          TRT   + P RT
Sbjct: 89  SPTRTPPTESPARTP-PTASPARTPSRASPRRTPPRASP---------TRTPPRASPRRT 138

Query: 369 HTRSLC*WTPRPPAPHWRCWGGWPRPAPRFP*PTPACALTRLLSAPT*SLPCNPPA---- 536
             R+    +PR   P        PR +P+   P  +   T   ++PT + P   PA    
Sbjct: 139 PPRA----SPRRTPPRASPTRAPPRASPKRTPPRASPTRTPPRASPTRTPPTESPARTPP 194

Query: 537 TLSASRWAAWAAPR---P*ASSLLRPPRQDSTCRLDPPTPPTCRLRSRPPCLPP 689
           T S +R  + A+PR   P AS    PPR   T      TPP    +  PP   P
Sbjct: 195 TASPARTPSRASPRRMPPRASPTRTPPRASPT-----RTPPRASPKRTPPTASP 243

 Score = 44.3 bits (103), Expect = 0.003
 Identities = 53/177 (29%), Positives = 69/177 (38%), Gaps = 19/177 (10%)
 Frame = +3

Query: 336 RTRASSRPARTHTRSLC*WTPRPPAPHWRCWGGWPRPAPRFP*PTPACALTRLLSAPT*S 515
           RT   + P RT  R+    +PR   P        PR +PR   PT +   T    +P  +
Sbjct: 47  RTPPRASPGRTPPRA----SPRRTPPRASPRRAPPRASPRRTPPTASPTRTPPTESPART 102

Query: 516 LPCNPPATL-------------SASRWAAWAAPR---P*ASSLLRPPRQDSTCRLDPPTP 647
            P   PA               S +R    A+PR   P AS    PPR   T      +P
Sbjct: 103 PPTASPARTPSRASPRRTPPRASPTRTPPRASPRRTPPRASPRRTPPRASPTRAPPRASP 162

Query: 648 PTCRLRSRPPCLPP--CPVTTPPPPS-SRCPPTAWRPSSRWATWSPRATGPQRASRR 809
                R+ P   PP   P  TPP  S +R PPTA  P+   +  SPR   P+ +  R
Sbjct: 163 KRTPPRASPTRTPPRASPTRTPPTESPARTPPTA-SPARTPSRASPRRMPPRASPTR 218

 Score = 42.0 bits (97), Expect = 0.012
 Identities = 42/123 (34%), Positives = 51/123 (41%), Gaps = 15/123 (12%)
 Frame = +3

Query: 486 TRLLSAPT*SLPCNPPATLSASRWAAWAAPR---P*ASSLLRPPRQD---STCRLDPP-T 644
           T L   P  + P   P T S +R    A+P    P AS    PPR     +  R  P  T
Sbjct: 7   TSLTRTPPTASPARTPPTESPARTPPTASPARTPPRASPARTPPRASPGRTPPRASPRRT 66

Query: 645 PPTCRLRSRPPCLPP-------CPVTTPPPPS-SRCPPTAWRPSSRWATWSPRATGPQRA 800
           PP    R  PP   P        P  TPP  S +R PPTA  P+   +  SPR T P+ +
Sbjct: 67  PPRASPRRAPPRASPRRTPPTASPTRTPPTESPARTPPTA-SPARTPSRASPRRTPPRAS 125

Query: 801 SRR 809
             R
Sbjct: 126 PTR 128

 Score = 33.9 bits (76), Expect = 3.4
 Identities = 30/83 (36%), Positives = 33/83 (39%), Gaps = 6/83 (7%)
 Frame = +3

Query: 579 P*ASSLLRPPRQDSTCRLDPP-----TPPTCRLRSRPPCLPPCPVTTPPPPS-SRCPPTA 740
           P  +SL R P   S  R  P      TPPT      PP     P  TPP  S  R PP A
Sbjct: 4   PYTTSLTRTPPTASPARTPPTESPARTPPTASPARTPP--RASPARTPPRASPGRTPPRA 61

Query: 741 WRPSSRWATWSPRATGPQRASRR 809
                     SPR T P+ + RR
Sbjct: 62  ----------SPRRTPPRASPRR 74

>gb|AAF43497.1|AF134579_1 arabinogalactan protein [Zea mays]
          Length = 274

 Score = 53.1 bits (126), Expect = 5e-06
 Identities = 70/237 (29%), Positives = 90/237 (37%), Gaps = 15/237 (6%)
 Frame = +3

Query: 99  ASCSSP*RSSRSAPRNGPASAAMRRWWAAPSTSRCTSTRTACSTTCPATE*RRLTSPLVW 278
           AS ++    S S     P SA       A STS  TS+ +   T CP T           
Sbjct: 8   ASSAATATPSTSTAAGTPTSACSP---TATSTSTRTSSASTAPTACPGTS---------- 54

Query: 279 RRAF*RGTSKSGHTQTFWTRTRASSRPARTHTRSLC*WTPRPPAPHWRCWGGWPR--PAP 452
                RG+  S  + T  + T A  R  R  T S    +P   +P   C    PR  PAP
Sbjct: 55  -----RGSRPSPCSSTATSSTSAPGRRPRGTTTSTAWSSPWTASPCASCREPTPRGRPAP 109

Query: 453 --RFP*PTPA----CALTRLLSAPT*SLPC-----NPPATLSASRWAAWAAPRP*ASSLL 599
             R P P PA    C+      +P+   PC     +P  T +ASR    +      SS  
Sbjct: 110 CRRCPSPAPARPTACSSRSTAGSPSGPTPCPSQRRSPGCTGTASRPTTASRTLTWPSSSA 169

Query: 600 RPPRQDS--TCRLDPPTPPTCRLRSRPPCLPPCPVTTPPPPSSRCPPTAWRPSSRWA 764
           R P   +  + R   PT  T  + SRP C P     T P P+S      WR + R A
Sbjct: 170 RSPPTCTAWSARRTAPTTSTGSM-SRPQCPPWEGTATTPRPASSPLTAPWRVTRRLA 225

 Score = 35.4 bits (80), Expect = 1.2
 Identities = 52/185 (28%), Positives = 63/185 (33%), Gaps = 21/185 (11%)
 Frame = -3

Query: 593 RTRSRTRRGPRCPSRSR-----------KSSRWIAGKR-------SCWSRQQPRQRARRC 468
           RT S +     CP  SR            SS    G+R       + WS          C
Sbjct: 39  RTSSASTAPTACPGTSRGSRPSPCSSTATSSTSAPGRRPRGTTTSTAWSSPWTASPCASC 98

Query: 467 RSREPRGRPWPATPAPPVWSRRSRCPSAQ*SCMSPCRS*RSPGSCPESLGVSTLGCPPSE 288
           R   PRGRP P            RCPS      +P R    P +C      ST G P   
Sbjct: 99  REPTPRGRPAPC----------RRCPSP-----APAR----PTACSSR---STAGSPSGP 136

Query: 287 CAPPDKR*C*AASLGCRARCRTGRTS*RT---PGSGRSCPPPPHCR*SWPVARS*AAAPS 117
              P +R     S GC        T+ RT   P S  S   PP C  +W   R+   AP+
Sbjct: 137 TPCPSQR----RSPGCTGTASRPTTASRTLTWPSS--SARSPPTCT-AWSARRT---APT 186

Query: 116 WRTAA 102
             T +
Sbjct: 187 TSTGS 191

>gb|AAA59875.1| mucin
          Length = 573

 Score = 52.4 bits (124), Expect = 9e-06
 Identities = 61/229 (26%), Positives = 78/229 (33%), Gaps = 10/229 (4%)
 Frame = +3

Query: 96  PASCSSP*RSSRSAPRNGPASAAMRRWWAAPSTSRCTSTRT-----ACSTTCPATE*RRL 260
           P +  SP  ++ S P   P+          P+T+    T T     A +TT P T     
Sbjct: 124 PTTTPSPPTTTPSPPTTTPSPPTTTTTTPPPTTTPSPPTTTPITPPASTTTLPPT----- 178

Query: 261 TSPLVWRRAF*RGTSKSGHTQTFWTRTRASSRPARTHTRSLC*WTPRPPAPHWRCWGGWP 440
           T+P           S    T T    T   S P  T        T  PP          P
Sbjct: 179 TTP-----------SPPTTTTTTPPPTTTPSPPTTTPITPPTSTTTLPPTTT-------P 220

Query: 441 RPAPRF-----P*PTPACALTRLLSAPT*SLPCNPPATLSASRWAAWAAPRP*ASSLLRP 605
            P P       P  TP+   T   S PT +    PP T  +        P P  +     
Sbjct: 221 SPPPTTTTTPPPTTTPSPPTTTTPSPPTITTTTPPPTTTPSPPTTTTTTPPPTTT----- 275

Query: 606 PRQDSTCRLDPPTPPTCRLRSRPPCLPPCPVTTPPPPSSRCPPTAWRPS 752
           P   +T  + PPT  T    +  P  PP   TTPPP ++  PPT   PS
Sbjct: 276 PSPPTTTPITPPTSTTTLPPTTTPSPPPTTTTTPPPTTTPSPPTTTTPS 324

 Score = 45.4 bits (106), Expect = 0.001
 Identities = 38/140 (27%), Positives = 50/140 (35%), Gaps = 7/140 (5%)
 Frame = +3

Query: 339 TRASSRPARTHTRSLC*WTPRPPAPHWRCWGGWPRPAPRF-------P*PTPACALTRLL 497
           T   S P  T T      TP PP            P+P         P  TP+  +T   
Sbjct: 296 TTTPSPPPTTTTTPPPTTTPSPPTT--------TTPSPPITTTTTPPPTTTPSSPITTTP 347

Query: 498 SAPT*SLPCNPPATLSASRWAAWAAPRP*ASSLLRPPRQDSTCRLDPPTPPTCRLRSRPP 677
           S PT ++    P T  +S       P    +    P    +      P+PPT  + + PP
Sbjct: 348 SPPTTTMTTPSPTTTPSSPITTTTTPSSTTTPSPPPTTMTTPSPTTTPSPPTTTMTTLPP 407

Query: 678 CLPPCPVTTPPPPSSRCPPT 737
                P+TT P P S  PPT
Sbjct: 408 TTTSSPLTTTPLPPSITPPT 427

 Score = 40.4 bits (93), Expect = 0.036
 Identities = 43/157 (27%), Positives = 52/157 (32%), Gaps = 19/157 (12%)
 Frame = +3

Query: 339 TRASSRPARTHTRSLC*WTPRPPAPHWRCWGGWPRPAPRFP*PTPACALTRLLSAPT*SL 518
           T   S P  T T      TP PP            P+P    P+P    T   S PT + 
Sbjct: 93  TTTPSPPITTTTTPPPTTTPSPPISTTTTPPPTTTPSPPTTTPSPP---TTTPSPPTTTT 149

Query: 519 PCNPPATLSASRWAAWAAPRP*ASSLLRPPRQDSTCRLDPPTPPTCRLRSRP-------- 674
              PP T  +        P P +++ L P    S       TPP     S P        
Sbjct: 150 TTPPPTTTPSPPTTTPITP-PASTTTLPPTTTPSPPTTTTTTPPPTTTPSPPTTTPITPP 208

Query: 675 -----------PCLPPCPVTTPPPPSSRCPPTAWRPS 752
                      P  PP   TTPPP ++  PPT   PS
Sbjct: 209 TSTTTLPPTTTPSPPPTTTTTPPPTTTPSPPTTTTPS 245

 Score = 38.9 bits (89), Expect = 0.11
 Identities = 65/264 (24%), Positives = 83/264 (30%), Gaps = 9/264 (3%)
 Frame = +3

Query: 21  PTASPRLLRRMPASPRMWQHRPHRAPASCSSP*RSSRSAPRNGPASAAMRRWWAAPSTSR 200
           PT +P      P+ P      P   P +  SP     + P   PAS         PS   
Sbjct: 131 PTTTPSPPTTTPSPPTTTTTTP--PPTTTPSP---PTTTPITPPASTTTLPPTTTPSPPT 185

Query: 201 CTSTRTACSTTCPATE*RRLTSPLVWRRAF*RGTSKSGHTQTFWTRTRASSRPARTHTRS 380
            T+T    +TT        +T P                + T    T   S P  T T  
Sbjct: 186 TTTTTPPPTTTPSPPTTTPITPPT---------------STTTLPPTTTPSPPPTTTTTP 230

Query: 381 LC*WTPRPPA------PHWRCWGGWPRPAPRFP*PTPACALTRLLSAPT*SLPCNPPATL 542
               TP PP       P        P   P  P  T          +P  + P  PP   
Sbjct: 231 PPTTTPSPPTTTTPSPPTITTTTPPPTTTPSPPTTTTTTPPPTTTPSPPTTTPITPPT-- 288

Query: 543 SASRWAAWAAPRP*ASSLLRPPRQDSTCRLDPPTPPTCRLRSRPPCLPPCPVTTPPP--- 713
           S +       P P  ++   PP   +      P+PPT    +  P  P    TTPPP   
Sbjct: 289 STTTLPPTTTPSPPPTTTTTPPPTTT------PSPPT----TTTPSPPITTTTTPPPTTT 338

Query: 714 PSSRCPPTAWRPSSRWATWSPRAT 785
           PSS    T   P++   T SP  T
Sbjct: 339 PSSPITTTPSPPTTTMTTPSPTTT 362

 Score = 38.1 bits (87), Expect = 0.18
 Identities = 38/136 (27%), Positives = 45/136 (32%)
 Frame = +3

Query: 384 C*WTPRPPAPHWRCWGGWPRPAPRFP*PTPACALTRLLSAPT*SLPCNPPATLSASRWAA 563
           C  TP PP          P P P     T     T   S PT +    PP T        
Sbjct: 53  CITTPSPPTTT-------PSPPPT---STTTLPPTTTPSPPTTTTTTPPPTT-------- 94

Query: 564 WAAPRP*ASSLLRPPRQDSTCRLDPPTPPTCRLRSRPPCLPPCPVTTPPPPSSRCPPTAW 743
              P P  ++   PP   +      P+PP     + PP   P P TT P P    P T  
Sbjct: 95  --TPSPPITTTTTPPPTTT------PSPPISTTTTPPPTTTPSPPTTTPSP----PTTTP 142

Query: 744 RPSSRWATWSPRATGP 791
            P +   T  P  T P
Sbjct: 143 SPPTTTTTTPPPTTTP 158

 Score = 37.4 bits (85), Expect = 0.31
 Identities = 55/209 (26%), Positives = 71/209 (33%), Gaps = 6/209 (2%)
 Frame = +3

Query: 183 APSTSRCTSTRTACSTTCPATE*RRLTSPLVWRRAF*RGTSKSGHTQTFWTRTRASSRPA 362
           +P T+   S  T  +TT P T     T+P           S    T T    T   S P 
Sbjct: 237 SPPTTTTPSPPTITTTTPPPT-----TTP-----------SPPTTTTTTPPPTTTPSPPT 280

Query: 363 RTHTRSLC*WTPRPPAPHWRCWGGWPRPAPRF-----P*PTPACALTRLLSAPT*SLPCN 527
            T        T  PP          P P P       P  TP+   T   S P  +    
Sbjct: 281 TTPITPPTSTTTLPPTTT-------PSPPPTTTTTPPPTTTPSPPTTTTPSPPITTTTTP 333

Query: 528 PPATLSASRWAAWAAPRP*ASSLLRPPRQDSTCRLDPPTPPTCRLRSRPPCLPPCPVTTP 707
           PP T  +S       P P  +++  P    +T    P T  T    +  P  PP  +TTP
Sbjct: 334 PPTTTPSS--PITTTPSPPTTTMTTP--SPTTTPSSPITTTTTPSSTTTPSPPPTTMTTP 389

Query: 708 PPPSSRCPPTAWRPS-SRWATWSPRATGP 791
            P ++  PPT    +     T SP  T P
Sbjct: 390 SPTTTPSPPTTTMTTLPPTTTSSPLTTTP 418



EST assemble image


clone accession position
1 MXL008d05_r BP093436 1 502
2 LCL089f04_r AV631157 337 835




Chlamydomonas reinhardtii
Kazusa DNA Research Institute