KCC000972A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000972A_C01 KCC000972A_c01
CCCATTCACAGCTACCAACTGAAAGCCAAGATGGGTCATGAGAAGTACCCTGACACGCAG
GAGGGATGTGTCGACTTCATGATTCCCAAGCCCGCCCAGAAAGCGGTCAAGGATGCTTAC
CAATTCGCCATCACATTGCAGTGCATCAACGCCGCGGAGGACGGCAGCAAGTCGTGGGAC
CACGGTATCTTCGACTGTATGGACAACATCCCGCTGTGTCTGGCCATTATGTTCTGCAAC
GGCTGGGGCTTGTGCATTTCCTACAGGAACATGCAGTACATGACCGGTGACAGCTGCGAG
GTGGCCTTCGTGAATGGCATGGTGGCCGGCTCGGTGTGTCTGGGCCCCTGCCACTATGCG
GTGGTGCGCGGCAACTTTCGCAAGAAGTACGGCCTCAAGGGTAGCCCGTGCCAGGACTGC
ATGTGCGGCTGCTGCCTCGGACCCTGCGTGCTGTGCAGTGACACCAACCAGCTGATGGTG
TCGGAGGGAATTGCCGTGCCCTTCCTCAATGGTCTCAACGCCAGCGGCGCGGGCGGCACC
AAGGTCAGCCCCGCCAAGTATTGTGGCGGCGGCGGCGGTGGTGCACGGGGCGACGGGTGA
TTGCATCGGTGTTGCTGGCAAAACTGTTGCCAGCTGCCTCATGCATTCTCGAGTCCTGAC
GCTGTCAAGGGTGTGGTGCCTCTG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000972A_C01 KCC000972A_c01
         (684 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAO39872.1| hypothetical protein [Oryza sativa (japonica cult...    63  5e-09
gb|AAO39879.1| hypothetical protein [Oryza sativa (japonica cult...    62  8e-09
gb|AAM08438.1| hypothetical protein [Dictyostelium discoideum]         52  6e-06
ref|NP_172940.1| expressed protein [Arabidopsis thaliana] gi|877...    50  3e-05
pir||T44768 antifreeze glycopeptide AFGP polyprotein precursor [...    50  4e-05

>gb|AAO39872.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
           gi|31249732|gb|AAP46224.1| hypothetical protein [Oryza
           sativa (japonica cultivar-group)]
          Length = 148

 Score = 62.8 bits (151), Expect = 5e-09
 Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 7/112 (6%)
 Frame = +1

Query: 163 GSKSWDHGIFDCMDNIPLCLAIMFCNGWGLCISYRNMQYMT-------GDSCEVAFVNGM 321
           GS +W  G+FDC D+  LC   M C  W  CI++  +  +        G S  +  +  M
Sbjct: 15  GSAAWSSGLFDCFDDCGLCC--MTC--WCPCITFGRVAEIVDRGSTSCGASGALYALLAM 70

Query: 322 VAGSVCLGPCHYAVVRGNFRKKYGLKGSPCQDCMCGCCLGPCVLCSDTNQLM 477
           V G  C+  C Y   RG  R +YGL  + C DC   C    C LC +  +L+
Sbjct: 71  VTGCQCIYSCTY---RGKMRAQYGLADAACGDCCVHCWCESCALCQEYRELV 119

>gb|AAO39879.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
           gi|31249761|gb|AAP46253.1| hypothetical protein [Oryza
           sativa (japonica cultivar-group)]
          Length = 150

 Score = 62.0 bits (149), Expect = 8e-09
 Identities = 43/143 (30%), Positives = 60/143 (41%), Gaps = 8/143 (5%)
 Frame = +1

Query: 163 GSKSWDHGIFDCMDNIPLCLAIMFCNGWGLCISYRNMQYMTGDSCEVAFVNGMVAGSVC- 339
           GS +W  G+ DC D+  LC    +C     CI++  +  M           G + G +C 
Sbjct: 15  GSAAWSTGLCDCFDDCGLCCLTCWCP----CITFGRVAEMVDRGSTSCGTGGALYGLLCA 70

Query: 340 ------LGPCHYAVVRGNFRKKYGLKGSPCQDCMCGCCLGPCVLCSDTNQLMVSEGIAVP 501
                 +  C Y   RG  R +YGL  + C DC    C  PC LC +  +L V+ G    
Sbjct: 71  FTGCQWIYSCTY---RGKMRTQYGLAEAGCADCCVHFCCEPCALCQEYREL-VARGYDPK 126

Query: 502 FLNGLNASGAGGTKVSPA-KYCG 567
               LNA  A     +PA +Y G
Sbjct: 127 LGWHLNADRAAAAGAAPAVQYMG 149

>gb|AAM08438.1| hypothetical protein [Dictyostelium discoideum]
          Length = 109

 Score = 52.4 bits (124), Expect = 6e-06
 Identities = 34/114 (29%), Positives = 53/114 (45%), Gaps = 8/114 (7%)
 Frame = +1

Query: 172 SWDHGIFDCMDNIPLCLAIMFCNGWGLCISYR--NMQYMT------GDSCEVAFVNGMVA 327
           +W+HG+ DC  +I +C           CISY    +Q M       G  CE+      + 
Sbjct: 3   NWEHGLCDCTSDIRVC-----------CISYLWPQLQIMQQRATVEGRQCEIT---DCIF 48

Query: 328 GSVCLGPCHYAVVRGNFRKKYGLKGSPCQDCMCGCCLGPCVLCSDTNQLMVSEG 489
            ++C  PC   + R   R+K+G++GS   DC+  C    C LC+   Q+M  +G
Sbjct: 49  TALCF-PCVTCLTRSQIREKHGIEGSGVMDCLTVCY---CTLCTIHQQIMQLQG 98

>ref|NP_172940.1| expressed protein [Arabidopsis thaliana]
           gi|8778226|gb|AAF79235.1|AC006917_20 F10B6.27
           [Arabidopsis thaliana] gi|18252925|gb|AAL62389.1|
           unknown protein [Arabidopsis thaliana]
           gi|21389643|gb|AAM48020.1| unknown protein [Arabidopsis
           thaliana]
          Length = 152

 Score = 50.1 bits (118), Expect = 3e-05
 Identities = 34/118 (28%), Positives = 48/118 (39%), Gaps = 7/118 (5%)
 Frame = +1

Query: 139 QCINAAEDGSKSWDHGIFDCMDNIPLCLAIMFCNGWGLCISYRNMQYMT---GDSCEVA- 306
           Q ++A       W  G  DC  +   C    +C     CI++  +  +      SC  A 
Sbjct: 4   QHLHAKPHAEGEWSTGFCDCFSDCKNCCITFWCP----CITFGQVAEIVDRGSTSCGTAG 59

Query: 307 ---FVNGMVAGSVCLGPCHYAVVRGNFRKKYGLKGSPCQDCMCGCCLGPCVLCSDTNQ 471
               +  +V G  C+  C Y   RG  R +Y +KG  C DC+   C   C LCS T Q
Sbjct: 60  ALYALIAVVTGCACIYSCFY---RGKMRAQYNIKGDDCTDCLKHFC---CELCSLTQQ 111

>pir||T44768 antifreeze glycopeptide AFGP polyprotein precursor [imported] -
           Boreogadus saida gi|2078483|gb|AAC60129.1| antifreeze
           glycopeptide AFGP polyprotein precursor
          Length = 507

 Score = 49.7 bits (117), Expect = 4e-05
 Identities = 55/188 (29%), Positives = 73/188 (38%), Gaps = 14/188 (7%)
 Frame = +2

Query: 140 SASTPRRTAASRGTTVSSTVWTTSRCVWPLCSATAGACAFPTGTCST--*PVTAARWPS* 313
           +A+TP   A +     ++T  T +    P  +ATA   A    T +T   P  AAR    
Sbjct: 45  TAATPATAATAATEATAATAATPATAATPATAATAATTAATAATAATAATPARAAR---- 100

Query: 314 MAWWPARCVWAPATMRWCAATFARSTASRVARARTACAAA----ASDPACCAVTPTS*W- 478
            A  PA     PAT    A     +TA   ARA T   AA    A+ PA  A   T+   
Sbjct: 101 -AATPATAA-TPATAATAATAATAATAETPARAATPATAATPATAATPATAATAATAATS 158

Query: 479 ------CRRELPCPSSMVSTPAARA-APRSAPPSIVAAAAVVHGATGDCIGVAGKTVASC 637
                  R   P  ++  +TPA  A A R+A P+  A AA    A          T A+ 
Sbjct: 159 ATAATAARAATPATAATPATPATAARAARAATPATAATAATAATAATAATAATAATAATP 218

Query: 638 LMHSRVLT 661
              +R  T
Sbjct: 219 ARAARAAT 226

 Score = 49.3 bits (116), Expect = 5e-05
 Identities = 50/179 (27%), Positives = 74/179 (40%), Gaps = 10/179 (5%)
 Frame = +2

Query: 71  STS*FPSPPRKRSRMLTNSPSHCSASTPRRTAASRGTTVSSTVWTTSRCVWPLCSATAGA 250
           +T+  P+ P   +   T + +  +A+TP R A +     ++T  T +       +ATA  
Sbjct: 306 ATAATPATPATAATAATAATA-ATAATPARAARAATPATAATPATAATAATAATAATAAT 364

Query: 251 CAFPT--------GTCST*PVTAARWPS*MAWWPARCVWA--PATMRWCAATFARSTASR 400
            A P          T +T    A    +  A  PAR   A  PAT    A     +TA+ 
Sbjct: 365 AATPARAARAATPATAATAATAATAATAATAATPARAARAATPATPATPATPATPATAAT 424

Query: 401 VARARTACAAAASDPACCAVTPTS*WCRRELPCPSSMVSTPAARAAPRSAPPSIVAAAA 577
            A A TA  AA +  A  A T  +       P  ++  +TPA  A P +AP +  AA A
Sbjct: 425 AATAATAATAATAATAATAATAPT-------PARAARAATPATGATPATAPTAGTAATA 476

 Score = 47.8 bits (112), Expect = 2e-04
 Identities = 51/191 (26%), Positives = 76/191 (39%), Gaps = 2/191 (1%)
 Frame = +2

Query: 68  VSTS*FPSPPRKRSRMLTNSPSHCSASTPRRTAASRGTTVSSTVWTTSRCVWPLCSATAG 247
           ++T+  P+ P   +   T++ +  +A+TP R A       ++T  T +       +ATA 
Sbjct: 269 LATAATPATPATPATAATDATA-ATAATPARAATPATPATAATPATPATAATAATAATAA 327

Query: 248 ACAFPTGTC-ST*PVTAARWPS*MAWWPARCVWAPATMRWCAATFARSTASRVARARTAC 424
             A P     +  P TAA         PA    A AT    A     +T +R ARA T  
Sbjct: 328 TAATPARAARAATPATAAT--------PATAATA-ATAATAATAATAATPARAARAATPA 378

Query: 425 AAAASDPACCAVTPTS*WCRRELPCPSSMVSTPAARAAPRS-APPSIVAAAAVVHGATGD 601
            AA +  A  A T  +       P  ++  +TPA  A P + A P+  A AA    A   
Sbjct: 379 TAATAATAATAATAAT----AATPARAARAATPATPATPATPATPATAATAATAATAATA 434

Query: 602 CIGVAGKTVAS 634
                  T A+
Sbjct: 435 ATAATAATAAT 445

 Score = 41.2 bits (95), Expect = 0.015
 Identities = 50/192 (26%), Positives = 70/192 (36%), Gaps = 29/192 (15%)
 Frame = +2

Query: 89  SPPRKRSRMLTNSPSHCSASTPRRTAASRGTTVSSTVWTTSRCVWPLCSATAG------- 247
           +P R  +     +P+  +A+TP   A +     S+T  T +R   P  +AT         
Sbjct: 127 TPARAATPATAATPA--TAATPATAATAATAATSATAATAARAATPATAATPATPATAAR 184

Query: 248 -------ACAFPTGTCST*PVTAARWPS*MAWWPARCVWA--------------PATMRW 364
                  A A    T +T    A    +  A  PAR   A              PAT   
Sbjct: 185 AARAATPATAATAATAATAATAATAATAATAATPARAARAATPATAPTPATAATPATAAT 244

Query: 365 CAATFARSTASRVARARTACAAAASDPACCAVTPTS*WCRRELPCPSSMVSTPAARAAP- 541
            A     +T +R ARA T   AA    A    TP +       P  ++  +T A  A P 
Sbjct: 245 AATAPTAATPARAARAATPATAATLATAATPATPAT-------PATAATDATAATAATPA 297

Query: 542 RSAPPSIVAAAA 577
           R+A P+  A AA
Sbjct: 298 RAATPATPATAA 309



EST assemble image


clone accession position
1 MX059d12_r BP088399 1 339
2 LC023e08_r AV620546 1 435
3 CM007f08_r AV386837 4 351
4 MX016d09_r BP086772 4 457
5 CM088a09_r AV393043 9 507
6 CM012d02_r AV387181 159 684




Chlamydomonas reinhardtii
Kazusa DNA Research Institute