KCC001303A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001303A_C01 KCC001303A_c01
atcactccATCCTTTCATATTGTCGCGACTCCTCTGCACCTTTCCCGGTTGCACACCATG
GCCCTCACCATGCGTCGCGCGACTGTCGCCCGCCCGGCTGTGTCAAGCCGCACTCGCACC
GTCACAGTCCAGGCTTCTGCTTCCAAGCACATGGGCGCTGGCGTAGCTGCTGTGGCTCTG
GCCGCGACCATGTCACTTGCGGGCCCCGCCCTGGCCGATCTGAACGCTTACGAGGCCGCC
ACAGGTGGCGAGTTCGGCATCGGCTCTGCCATGCAGTACGGCGAGGCGGACATCCAGGGC
AGGGACTTCTCCAACCAGGACCTGCGCCGCTCCAACTTCACCTCCGCCGACTGCCGCAAC
GCCACCTTCAAGGGCTCCAACCTGCAGGGCGCCTACTTCATCAAGGCCGTTACCTACCGC
ACCAACTTTGAGGATGCCAACCTGTCTGACGTGCTGATGGACCGCGCCACAATGGTGGAG
GCCAACCTGAAGAACGCCATCCTTCAGCGCACCGTGTTCACGCGCTCCGACCTCAAGGAC
GCCGTCATCGAGGG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001303A_C01 KCC001303A_c01
         (554 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAK59627.1| unknown protein [Arabidopsis thaliana]                 141  7e-33
ref|NP_563902.1| chloroplast lumen pentapeptide protein, putativ...   141  7e-33
pir||F86257 Hypothetical protein [imported] - Arabidopsis thalia...   139  3e-32
ref|NP_484230.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    82  6e-15
ref|ZP_00110510.1| COG1357: Uncharacterized low-complexity prote...    80  2e-14

>gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
          Length = 280

 Score =  141 bits (355), Expect = 7e-33
 Identities = 83/185 (44%), Positives = 108/185 (57%), Gaps = 4/185 (2%)
 Frame = +1

Query: 10  SFHIVATPLHLSRLHTMALTMRRATVARPAVSSRTRTVTVQASASKH----MGAGVAAVA 177
           S  +  +P H  R     L +   +      SS TR     ++ S      + A +AA  
Sbjct: 20  SSSVSRSPYHFQRYLLRRLQLSSRSNLEIKDSSNTREGCCSSAESNKWKRILSAAMAAAV 79

Query: 178 LAATMSLAGPALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCR 357
           +A++  +  PA+A+LN +EA T GEFGIGSA QYG AD+     SN++ RR+NFTSAD R
Sbjct: 80  IASSSGV--PAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMR 137

Query: 358 NATFKGSNLQGAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLK 537
            + F GS   GAY  KAV Y+ NF  A+LSD LMDR  + EANL NA+L R+V TRSDL 
Sbjct: 138 ESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLG 197

Query: 538 DAVIE 552
            A IE
Sbjct: 198 GAKIE 202

>ref|NP_563902.1| chloroplast lumen pentapeptide protein, putative [Arabidopsis
           thaliana] gi|23297125|gb|AAN13098.1| unknown protein
           [Arabidopsis thaliana]
          Length = 280

 Score =  141 bits (355), Expect = 7e-33
 Identities = 83/185 (44%), Positives = 109/185 (58%), Gaps = 4/185 (2%)
 Frame = +1

Query: 10  SFHIVATPLHLSRLHTMALTMRRATVARPAVSSRTRTVTVQASAS----KHMGAGVAAVA 177
           S  +  +P H  R     L +   +      SS TR     ++ S    + + A +AA  
Sbjct: 20  SSSVSRSPYHFQRYLLRRLQLSSRSNLEIKDSSNTREGCCSSAESNTWKRILSAAMAAAV 79

Query: 178 LAATMSLAGPALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCR 357
           +A++  +  PA+A+LN +EA T GEFGIGSA QYG AD+     SN++ RR+NFTSAD R
Sbjct: 80  IASSSGV--PAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMR 137

Query: 358 NATFKGSNLQGAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLK 537
            + F GS   GAY  KAV Y+ NF  A+LSD LMDR  + EANL NA+L R+V TRSDL 
Sbjct: 138 ESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLG 197

Query: 538 DAVIE 552
            A IE
Sbjct: 198 GAKIE 202

>pir||F86257 Hypothetical protein [imported] - Arabidopsis thaliana
           gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical
           protein [Arabidopsis thaliana]
          Length = 293

 Score =  139 bits (350), Expect = 3e-32
 Identities = 81/183 (44%), Positives = 109/183 (59%), Gaps = 2/183 (1%)
 Frame = +1

Query: 10  SFHIVATPLHLSR--LHTMALTMRRATVARPAVSSRTRTVTVQASASKHMGAGVAAVALA 183
           S  +  +P H  R  L  + L+ R    +   +   + T     +  + + A +AA  +A
Sbjct: 10  SSSVSRSPYHFQRYLLRRLQLSSR----SNLEIKDSSNTSAESNTWKRILSAAMAAAVIA 65

Query: 184 ATMSLAGPALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNA 363
           ++  +  PA+A+LN +EA T GEFGIGSA QYG AD+     SN++ RR+NFTSAD R +
Sbjct: 66  SSSGV--PAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMRES 123

Query: 364 TFKGSNLQGAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLKDA 543
            F GS   GAY  KAV Y+ NF  A+LSD LMDR  + EANL NA+L R+V TRSDL  A
Sbjct: 124 DFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGA 183

Query: 544 VIE 552
            IE
Sbjct: 184 KIE 186

>ref|NP_484230.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25354788|pir||AB1830
           hypothetical protein all0186 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17135164|dbj|BAB77710.1|
           ORF_ID:all0186~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 168

 Score = 81.6 bits (200), Expect = 6e-15
 Identities = 41/95 (43%), Positives = 56/95 (58%)
 Frame = +1

Query: 265 SAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQGAYFIKAVTYRTNFEDANL 444
           + + Y  A+++ RDF+N DL   NF +A+ R   F+G+NL  A   K V  + N  +ANL
Sbjct: 34  NTINYNNANLENRDFANADLVGVNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANL 93

Query: 445 SDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVI 549
           +  L+DRAT+  ANLKNAI      TRS   DA I
Sbjct: 94  TGALVDRATLDNANLKNAIFTEATLTRSRFYDADI 128

>ref|ZP_00110510.1| COG1357: Uncharacterized low-complexity proteins [Nostoc
           punctiforme]
          Length = 168

 Score = 80.1 bits (196), Expect = 2e-14
 Identities = 42/95 (44%), Positives = 53/95 (55%)
 Frame = +1

Query: 265 SAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQGAYFIKAVTYRTNFEDANL 444
           + + Y   +++ RDFSN DL    F +A+ R   F+G+NL  A   K V  + N E ANL
Sbjct: 34  NTINYNNINLENRDFSNADLAGVTFVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANL 93

Query: 445 SDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVI 549
           S  L+DR TM  ANLKNAI      TRS   DA I
Sbjct: 94  SGALVDRVTMDGANLKNAIFTEATLTRSRFFDAEI 128



EST assemble image


clone accession position
1 MX064b09_r BP088581 1 307
2 CM035h08_r AV388756 2 553
3 HC020f08_r AV633426 16 486
4 HC035b10_r AV634578 16 466
5 HC024g04_r AV633746 58 554




Chlamydomonas reinhardtii
Kazusa DNA Research Institute