KCC002415A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002415A_C01 KCC002415A_c01
ctgagcagccgcagcgctacaggactccctgtagcgcggcggcaccaacggtccggctgg
gtggggtgcgaagcggcatgCGGATGCGCAACTCCAATAGCGACGGCAACCTGTGCCGCT
GGGGTAGCACAGCGAGTGGAGTCAGCGCTGTCAGCGCCGTCAGCGCGGCCAGCAGCGGCG
CGGCATGCGGTGACGGTCTTGACAACGGCGCTGCTAGCGGCCGTGCAGATCGGCAGGCGT
CAGTGCTGCAGCAGTCACAGCTGCCTACTTTGGCTGGGCTTGACGCGGTGGAGCACGGCG
GGGCTTCTGGCCCCGGGGCTGCAGCAGGAGAGGTCATGGCGGCCCCCGTGGCCTCATCCG
CCAGCCTGTGTCGGGCTGGCAGCAGCCTGCTGACGCCCATGCAGAGCTGTGAGAGTGGGG
AGGTCGGTGCCGGTGAGCCGGGACAGGCGGCAGACCAGCATGAAGGTGCATTGGAAGCCC
CACCGACGCCCACACCTCCTGCCATCCACCGCAACGCCGTGACCCTGACCGTGTCGTCCG
CATGTGACGGCGAGGACGCCAGCAGCTTGTCAGGAGCCTCAGACGTTGCCGGGCGGCTGT
TAGCGTCAGAGGCGCACCTGGCCTCGGGCTTTGACCACACTTCTGCCACGAAGCGCCCCG
CTACTTTCACTCTGGAGGCGAAGACGCGCACGGACTTCAGGCAGCCGCCACAGGATGCAT
TGCTGCCGACACTTAGCACT


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002415A_C01 KCC002415A_c01
         (740 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAC44888.1| unknown [Nannocystis exedens]                           52  7e-06
ref|NP_014487.1| ORF; Yol155cp [Saccharomyces cerevisiae] gi|213...    50  5e-05
dbj|BAB85832.1| cell wall protein Awa1p [Saccharomyces cerevisiae]     49  1e-04
ref|NP_459903.1| Fels-1 prophage; putative minor tail protein [S...    48  2e-04
ref|NP_189520.1| glycine-rich protein [Arabidopsis thaliana] gi|...    48  2e-04

>gb|AAC44888.1| unknown [Nannocystis exedens]
          Length = 290

 Score = 52.4 bits (124), Expect = 7e-06
 Identities = 49/166 (29%), Positives = 71/166 (42%), Gaps = 8/166 (4%)
 Frame = -2

Query: 637 WSKPEAR-CASDANSRPATSEAPDKLLASSPSHADD-TVRVTALRWMAGGVGVGGASNAP 464
           W+ P  R    +A + P    +P    ++SPS A   +   +A+RW          S  P
Sbjct: 5   WAPPHRRGLRENAPAAPYAWRSP----STSPSSASRRSSPSSAVRWPC--------SRTP 52

Query: 463 SCWSAACPGSPAPTSPLSQLCMGVSRLLPARHRLADEATGAAMTSPAA-APGP-----EA 302
             WSA  P S +P SP S  C+   R+ P         TG+    P+A +P P      A
Sbjct: 53  PTWSATSPPSRSPCSPPSSRCVRPCRVAP---------TGSPAPRPSARSPTPCSSRSRA 103

Query: 301 PPCSTASSPAKVGSCDCCSTDACRSARPLAAPLSRPSPHAAPLLAA 164
           P  STA SPA   +    + + C S+ P A+  + P P  +   AA
Sbjct: 104 PSSSTARSPASAAATPTSNPNPCWSSAPPASSSTSPRPGTSRAPAA 149

>ref|NP_014487.1| ORF; Yol155cp [Saccharomyces cerevisiae] gi|2132025|pir||S66852
           hypothetical protein YOL155c - yeast (Saccharomyces
           cerevisiae) gi|1420064|emb|CAA99177.1| unnamed protein
           product [Saccharomyces cerevisiae]
          Length = 967

 Score = 49.7 bits (117), Expect = 5e-05
 Identities = 47/221 (21%), Positives = 94/221 (42%), Gaps = 22/221 (9%)
 Frame = +3

Query: 90  NSNSDGNLCRWGSTASGVSAV-SAVSAASSGAACGDGLDNGAASGRADRQASVLQQSQLP 266
           +S    ++   GS+ SG S++ S+ S+ SS ++  +   + + S  A    S +  S   
Sbjct: 87  SSEVSSSITSSGSSVSGSSSITSSGSSVSSSSSATESGSSASGSSSATESGSSVSGSSTS 146

Query: 267 TLAGLDAVEHGGASGPGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCESG--------- 419
             +G  +    G+S  G+ +     +  + S+S   +GSS      + ESG         
Sbjct: 147 ITSGSSSATESGSSVSGSTSATESGSSASGSSSATESGSSASGSSSATESGSSVSGSSSA 206

Query: 420 -EVGAGEPGQAADQHEGALEAPP------TPTPPAIHRNAVTLTVSSACDGEDASSLSGA 578
            E G+   G ++    G+  + P      T +  +   +  ++T S    G  ASS SG+
Sbjct: 207 TESGSSVSGSSSATESGSASSVPSSSGSVTESGSSSSASESSITQSGTASGSSASSTSGS 266

Query: 579 -----SDVAGRLLASEAHLASGFDHTSATKRPATFTLEAKT 686
                S V+G   +S   ++S    ++++   A+ ++ + T
Sbjct: 267 VTQSGSSVSGSSASSAPGISSSIPQSTSSASTASGSITSGT 307

 Score = 41.2 bits (95), Expect = 0.017
 Identities = 45/192 (23%), Positives = 79/192 (40%), Gaps = 4/192 (2%)
 Frame = +3

Query: 66  VRSGMRMRNSNSD--GNLCRWGSTASGVSAVSAVSAASSGAACGDGLDNGAASGRAD--R 233
           + S + + +S SD   +L    S+++ VS+  A S +SS  +        + SG +    
Sbjct: 50  ISSSIELTSSTSDVSSSLTELTSSSTEVSSSIAPSTSSSEVSSSITSSGSSVSGSSSITS 109

Query: 234 QASVLQQSQLPTLAGLDAVEHGGASGPGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCE 413
             S +  S   T +G  A     A+  G++      +  + S+S   +GSS+        
Sbjct: 110 SGSSVSSSSSATESGSSASGSSSATESGSSVSGSSTSITSGSSSATESGSSV-------- 161

Query: 414 SGEVGAGEPGQAADQHEGALEAPPTPTPPAIHRNAVTLTVSSACDGEDASSLSGASDVAG 593
           SG   A E G +A     A E+  + +  +    + +    S+   E  SS+SG+S    
Sbjct: 162 SGSTSATESGSSASGSSSATESGSSASGSSSATESGSSVSGSSSATESGSSVSGSSSATE 221

Query: 594 RLLASEAHLASG 629
              AS    +SG
Sbjct: 222 SGSASSVPSSSG 233

>dbj|BAB85832.1| cell wall protein Awa1p [Saccharomyces cerevisiae]
          Length = 1713

 Score = 48.5 bits (114), Expect = 1e-04
 Identities = 60/261 (22%), Positives = 106/261 (39%), Gaps = 41/261 (15%)
 Frame = +3

Query: 33   SAAAPTVRLGGVRSGMRMRNSNSDGNLCRWGSTASGVSAVSAV--------SAASSGAAC 188
            SA+  ++   G  SG  +  S++ G++ + GS+ SG SA SA         S +S+  A 
Sbjct: 496  SASESSITQSGTASGSSV--SSTSGSVTQSGSSVSGSSASSAPGISSSIPQSTSSASTAS 553

Query: 189  GD-----------GLDNGAASGRADRQASVLQQSQLPTLAGLDAVEHG------------ 299
            G            G  +   SG +   +S   +S         A E G            
Sbjct: 554  GSITSGTLTSITSGSSSATESGSSVSGSSSATESGSSVSGSTSATESGSSVSGSTSATES 613

Query: 300  GASGPGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCESGEVGAGEPGQAADQHEGALEA 479
            G+S  G+++     + V+ S S   +GSS+     + ESG   +G    +A +   A   
Sbjct: 614  GSSASGSSSATESGSSVSGSTSATESGSSVSGSTSATESGSSASG--SSSATESGSASSV 671

Query: 480  PP-----TPTPPAIHRNAVTLTVSSACDGEDASSLSGA-----SDVAGRLLASEAHLASG 629
            P      T +  +   +  ++T S    G  ASS SG+     S V+G   +S   ++S 
Sbjct: 672  PSSSGSVTESGSSSSASESSITQSGTASGSSASSTSGSVTQSGSSVSGSSASSAPGISSS 731

Query: 630  FDHTSATKRPATFTLEAKTRT 692
               ++++   A+ ++ + T T
Sbjct: 732  IPQSTSSASTASGSITSGTLT 752

 Score = 47.4 bits (111), Expect = 2e-04
 Identities = 51/245 (20%), Positives = 100/245 (40%), Gaps = 14/245 (5%)
 Frame = +3

Query: 27   PCSAAAPTVRLGGVRSGMRMRNSNSDGNLCRWGSTASGVSAVSAVSAASSGAACGDGLDN 206
            P S ++ +   G + SG     ++   +    GS+ASG S  SA  + SS +      ++
Sbjct: 733  PQSTSSASTASGSITSGTLTSITSGSSSATESGSSASGSS--SATESGSSVSGSTSATES 790

Query: 207  GAASGRADRQASVLQQSQLPTLAGLDAVEHGGASGPGAAAGEVMAAPVASSASLCRAGSS 386
            G++                  ++G  +    G+S  G+++     + V+ S S   +GSS
Sbjct: 791  GSS------------------VSGSTSATESGSSASGSSSATESGSSVSGSTSATESGSS 832

Query: 387  LLTPMQSCESGEVGA--------GEPGQAADQHEGALEAPPTPTPPAIHRNAVTLTVS-S 539
                  + ESG   +         E G ++   E ++    T +  +    + ++T S S
Sbjct: 833  ASGSSSATESGSASSVPSSSGSVTESGSSSSASESSITQSGTASGSSASSTSGSVTQSGS 892

Query: 540  ACDGEDASSLSGA-----SDVAGRLLASEAHLASGFDHTSATKRPATFTLEAKTRTDFRQ 704
            +  G  ASS SG+     S V+G   +S   ++S    ++++   A+ ++ + T T    
Sbjct: 893  SVSGSSASSTSGSVTQSGSSVSGSSASSAPGISSSIPQSTSSASTASGSITSGTLTSITS 952

Query: 705  PPQDA 719
                A
Sbjct: 953  SASSA 957

 Score = 35.8 bits (81), Expect = 0.72
 Identities = 39/198 (19%), Positives = 80/198 (39%), Gaps = 9/198 (4%)
 Frame = +3

Query: 126 STASGVSAVSAVSAASSGAACGDGLDNGAASGRADR----QASVLQQSQLPTLAGLDAVE 293
           S+   +S+  ++SA S+  A    +   +    +      + S   +S     +G  +  
Sbjct: 375 SSDESISSTESLSATSTPLAVSSTVVTSSTDSVSPNIPFSEISSSPESSTAITSGSSSAT 434

Query: 294 HGGASGPGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCESGEVGAGEPGQAADQHEGAL 473
             G+S  G+ +     +  + S+S   +GSS+        SG   A E G A+     + 
Sbjct: 435 ESGSSVSGSTSATESGSSASGSSSATESGSSV--------SGSTSATESGSASSVPSSS- 485

Query: 474 EAPPTPTPPAIHRNAVTLTVSSACDGEDASSLSGA-----SDVAGRLLASEAHLASGFDH 638
               T +  +   +  ++T S    G   SS SG+     S V+G   +S   ++S    
Sbjct: 486 -GSVTESGSSSSASESSITQSGTASGSSVSSTSGSVTQSGSSVSGSSASSAPGISSSIPQ 544

Query: 639 TSATKRPATFTLEAKTRT 692
           ++++   A+ ++ + T T
Sbjct: 545 STSSASTASGSITSGTLT 562

>ref|NP_459903.1| Fels-1 prophage; putative minor tail protein [Salmonella
           typhimurium LT2] gi|16419437|gb|AAL19862.1| putative
           Fels-1 prophage minor tail protein [phage Fels-1]
          Length = 790

 Score = 47.8 bits (112), Expect = 2e-04
 Identities = 52/205 (25%), Positives = 84/205 (40%), Gaps = 2/205 (0%)
 Frame = +3

Query: 87  RNSNSDGNLCRWGSTASGVSAVSAVSAASSGAACGDGLDNGAASGRADRQASVLQQSQLP 266
           RN+ + G       T++G +A SA +AAS+ A   D     AAS  A  ++S        
Sbjct: 122 RNATAAGQASEQAQTSAGQAAESATAAASA-AGAADASATQAASSAASAESS-------- 172

Query: 267 TLAGLDAVEHGGASGPGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCESGEVGAGEPGQ 446
             AG    + G AS   A+A     A  AS A       +  T   + ++    AG+   
Sbjct: 173 --AGTATTKAGEASASAASADTARTAAAASEA-------AAKTSEANADASRTAAGDSAA 223

Query: 447 AADQHEGALE--APPTPTPPAIHRNAVTLTVSSACDGEDASSLSGASDVAGRLLASEAHL 620
           AA     A +  A          + + T   SSA D   +++ + AS+ A    A+EA  
Sbjct: 224 AAAASATAAQTSAERAGASETAAKTSETQAASSAGDAGASATAAAASEKAAAASAAEAKT 283

Query: 621 ASGFDHTSATKRPATFTLEAKTRTD 695
           +     TSA+   A+ T  + + ++
Sbjct: 284 SETNAATSASTAAASATAASSSASE 308

 Score = 43.1 bits (100), Expect = 0.005
 Identities = 48/192 (25%), Positives = 72/192 (37%), Gaps = 2/192 (1%)
 Frame = +3

Query: 126 STASGVSAVSAVSAASSGAACGDGLDNGAASGRADRQASVLQQSQLPTL-AGLDAVEHGG 302
           +T +  SA SA S+A +           AAS    R A+   ++   T  A  DA     
Sbjct: 159 ATQAASSAASAESSAGTATTKAGEASASAASADTARTAAAASEAAAKTSEANADASRTAA 218

Query: 303 ASGPGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCES-GEVGAGEPGQAADQHEGALEA 479
                AAA    AA  ++  +     ++  +  Q+  S G+ GA     AA +   A  A
Sbjct: 219 GDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGDAGASATAAAASEKAAAASA 278

Query: 480 PPTPTPPAIHRNAVTLTVSSACDGEDASSLSGASDVAGRLLASEAHLASGFDHTSATKRP 659
               T      NA T   ++A     AS  S AS+ +    AS+   +      +A    
Sbjct: 279 AEAKTS---ETNAATSASTAAASATAAS--SSASEASTHAAASDTSASLAAQSRAAAGES 333

Query: 660 ATFTLEAKTRTD 695
           AT   EA  R +
Sbjct: 334 ATRAEEAAKRAE 345

>ref|NP_189520.1| glycine-rich protein [Arabidopsis thaliana]
           gi|11994785|dbj|BAB03175.1| gene_id:T19N8.7~unknown
           protein [Arabidopsis thaliana]
          Length = 614

 Score = 47.8 bits (112), Expect = 2e-04
 Identities = 55/211 (26%), Positives = 76/211 (35%), Gaps = 14/211 (6%)
 Frame = +3

Query: 60  GGVRSGMRMRNSNSDGNLCRWGSTASGVSAVSAVSAASSGAACGDGLDNGAASGRADRQA 239
           GG  SG    +  S       G  ASG SA +   +AS+GAA G   + G  SG     +
Sbjct: 284 GGAASGAGAASGASAKTGGESGEAASGGSAETGGESASAGAASGGSAETGGESGSGGAAS 343

Query: 240 SVLQQSQLPTLAGLD----AVEHGG--ASGPGAAAGEVMAAPVASSASL--------CRA 377
                S   T  G      + E GG  ASG  A+ GE  +   ASS S+          +
Sbjct: 344 GGESASGGATSGGSPETGGSAETGGESASGGAASGGESASGGAASSGSVESGGESTGATS 403

Query: 378 GSSLLTPMQSCESGEVGAGEPGQAADQHEGALEAPPTPTPPAIHRNAVTLTVSSACDGED 557
           G S  T  +S   G    GE         G+ E     T   +         S+  +   
Sbjct: 404 GGSAETSDESASGGAASGGESASGGAASGGSAETGGESTSSGVASGG-----STGSESAS 458

Query: 558 ASSLSGASDVAGRLLASEAHLASGFDHTSAT 650
           A + SG S  A    A+     +G   ++ T
Sbjct: 459 AGAASGGSTEANGGAAAGGSTEAGSGTSTET 489

 Score = 47.8 bits (112), Expect = 2e-04
 Identities = 58/210 (27%), Positives = 79/210 (37%), Gaps = 12/210 (5%)
 Frame = +3

Query: 36  AAAPTVRLGGVRSGMRMRNSNSDGNLCRWGSTASGVSAVSAVSAASSG----AACGDGLD 203
           AAA     GG  +      S   G     G  ASG  A S  SA + G    AA G   +
Sbjct: 255 AAAGETASGGAAAADTSGGSAETGGESASGGAASGAGAASGASAKTGGESGEAASGGSAE 314

Query: 204 NGAASGRADRQASVLQQSQLPTLAGLDAVEHGGASGP-GAAAGEVMAAPVASSASLCRAG 380
            G  S  A               A   + E GG SG  GAA+G   A+  A+S      G
Sbjct: 315 TGGESASAG-------------AASGGSAETGGESGSGGAASGGESASGGATSGGSPETG 361

Query: 381 SSLLTPMQSCESGEVGAGEPGQAADQHEGALEAPPTPTPPAIHRNAVTLTVS----SACD 548
            S  T  +S   G    GE         G++E+    T      +A T   S    +A  
Sbjct: 362 GSAETGGESASGGAASGGESASGGAASSGSVESGGESTGATSGGSAETSDESASGGAASG 421

Query: 549 GEDAS---SLSGASDVAGRLLASEAHLASG 629
           GE AS   +  G+++  G   ++ + +ASG
Sbjct: 422 GESASGGAASGGSAETGGE--STSSGVASG 449

 Score = 45.8 bits (107), Expect = 7e-04
 Identities = 49/176 (27%), Positives = 71/176 (39%), Gaps = 3/176 (1%)
 Frame = +3

Query: 132 ASGVSAVSAVSAASSGAACGDGLDNGAASGRADRQASVLQQSQLPTLAGLDAVEHGGASG 311
           A+G  +  A SA S GAA G+    GAA+  AD      +        G ++   G ASG
Sbjct: 239 AAGADSGGAASADSGGAAAGETASGGAAA--ADTSGGSAE-------TGGESASGGAASG 289

Query: 312 PGAAAGEVMAAPVASSASLCRAGSSLLTPMQSCESGEVGAGEPGQAADQHE-GALEAPPT 488
            GAA+G   +A     +    +G S  T  +S  +G    G      +    GA     +
Sbjct: 290 AGAASG--ASAKTGGESGEAASGGSAETGGESASAGAASGGSAETGGESGSGGAASGGES 347

Query: 489 PTPPAIHRNAVTLTVSSACDGEDAS--SLSGASDVAGRLLASEAHLASGFDHTSAT 650
            +  A    +     S+   GE AS  + SG    +G   AS   + SG + T AT
Sbjct: 348 ASGGATSGGSPETGGSAETGGESASGGAASGGESASGG-AASSGSVESGGESTGAT 402



EST assemble image


clone accession position
1 MXL019h01_r BP094228 1 498
2 HCL030h07_r AV641276 361 740




Chlamydomonas reinhardtii
Kazusa DNA Research Institute