KCC001266A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001266A_C01 KCC001266A_c01
GTTCTTTACCCCCGGTCATACAAACCACCAGCCAGCAAAGATGGCCGCCCTTATGCAGAA
GTCCGCGCTGTCGCGCCCGGCATGCTCCACCCGCAGCTCTCGCCGCGCTGTGGTCGTCCG
CGCCGCGGCTGACCGCAAGCTGTGGGCCCCCGGCGTTGTTGCCCCTGAGTACCTGAAGGG
CGACCTGGCCGGCGACTACGGCTGGGACCCTCTCGGCCTGGGCGCTGACCCCACCGCTCT
GAAGTGGTACCGCCAGTCGGAGCTGCAGCACGCTCGCTGGGCGATGCTGGGTGTTGCTGG
CGTGCTGGTTCAGGAGATCGTGAAGCCCGATGTGTACTTCTACGAGGCTGGCCTGCCCCA
GAACCTGCCCGAGCCCTTCACCAACATCAACATGGGTGGCCTGCTGGCTTGGGAGTTCAT
CCTGATGCACTGAGTTGAGGTCCGCCGCTGGCAGGACTACAAGAACTTCGGCTCGGTCAA
CGAGGACCCCATCTTCAAGGGCAACAAGGTGCCCAACCCGGAGATGGGCTACCCCGGCGG
CATCTTCGACCCCTTTGGCTTCTCCAAGGGTAACCTGAAGGAGCTGCAGACCAAGGAGAT
CAAGAACGGCCGCCTGGCCATGATCGCCTACATGGCCTTCATCCTGCAGGCCCAGGCCAC
CGGCAAGGGCCCCCTGGCCGCCCTGTCGGCCCACCTGTCGAACCCCTTCGGCAACAACAT
CCTGAAGAACATCGGCACTTGCACCGTGCCCCACTCCGTGGATGTCCAGGGCCTGACCAT
CCCCCTGACCTGCCTGTGGCCCGGCAGCCAGTAAGCCGCTTCCCCAGAGCAGCTTGCGCA
GCCGCCTCGGGCAGCCGCCGCTGCCCGCAGCAGCGGTGGCGACCGAGGCGGAGACACACG
CGCGCGCTTGGCGTGTTGCTGCACAGCGCATGCCGACTAGCCGCGCGCGCGTGGGCTTGG
TGCTAGATCGGCGCGGCGCCATTGTTTGAGTGCATCCCGAGCGGTCGCCGTCTGGATTTG
GGCTGCTGCTGTTGCACGGCAGGGGCAGTGTGCACCGGCGGCGGCGTAGCGGGGTGTCGG
CGTTTGCCCTAGGCGCCCAAGCGCTGACTGGAACTTTGACTTCCCGTGTACAATGGTGAC
TTGAGATGCTGCTACTTGGTGTATAGGTTCGGCTTTTGTGGGTGCTTGAGTTGGTGGTCT
TCAGAGTTTAGGACACGAGTTTTGGACAAGAGAGTGTGCATGGTGAGCGAAGCGCGCCTG
GAGACAGGGCTGCGCGTCCGCAGGTTGGTGTTGTGGCGTGGCGTGCGTGTGCGCCTTCAA
CCGACAGACTGAAGGCAGCACCACGTGTGATGAggcggctggtcgggggcgaaccagggg
gagggatgaggcccgaggcttactggcccagtgccgagaacgggtgtgtaacc


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001266A_C01 KCC001266A_c01
         (1433 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_190331.3| light-harvesting chlorophyll a/b binding protei...   196  7e-49
pir||S14305 chlorophyll a/b-binding protein (cab-11) - tomato         191  4e-47
pir||S31863 chlorophyll a/b-binding protein type 4, photosystem ...   190  5e-47
ref|NP_084540.1| cDNA sequence BC002118 [Mus musculus] gi|128053...   190  6e-47
pir||S31864 chlorophyll a/b-binding protein type 4, photosystem ...   189  1e-46

>ref|NP_190331.3| light-harvesting chlorophyll a/b binding protein [Arabidopsis
           thaliana] gi|115385|sp|P27521|CB24_ARATH Chlorophyll A-B
           binding protein 4, chloroplast precursor (LHCI type III
           CAB-4) (LHCP) gi|11277628|pir||T45707 CHLOROPHYLL A-B
           BINDING PROTEIN 4 PRECURSOR homolog - Arabidopsis
           thaliana gi|166646|gb|AAA32760.1| light-harvesting
           chlorophyll a/b binding protein
           gi|6522530|emb|CAB61973.1| CHLOROPHYLL A-B BINDING
           PROTEIN 4 PRECURSOR homolog [Arabidopsis thaliana]
           gi|20260362|gb|AAM13079.1| chlorophyll A-B binding
           protein 4 precursor homolog [Arabidopsis thaliana]
           gi|21554365|gb|AAM63472.1| chlorophyll a-b binding
           protein 4 precursor homolog [Arabidopsis thaliana]
           gi|23197770|gb|AAN15412.1| chlorophyll A-B binding
           protein 4 precursor homolog [Arabidopsis thaliana]
          Length = 251

 Score =  196 bits (499), Expect = 7e-49
 Identities = 105/221 (47%), Positives = 139/221 (62%), Gaps = 7/221 (3%)
 Frame = +2

Query: 86  STRSSRRAVVVRAAADRKLWAPGVVAPEYLKGDLAGDYGWDPLGLGADPTALKWYRQSEL 265
           S  SS +    +  A +  W PG+ +P+YL G LAGD G+DPLGL  DP  LKW+ Q+EL
Sbjct: 38  SIGSSAKTSSFKVEAKKGEWLPGLASPDYLTGSLAGDNGFDPLGLAEDPENLKWFVQAEL 97

Query: 266 QHARWAMLGVAGVLVQEIVK-------PDVYFYEAGLPQNLPEPFTNINMGGLLAWEFIL 424
            + RWAMLGVAG+L+ E+         P+  +Y+AG  Q      T      L   EFIL
Sbjct: 98  VNGRWAMLGVAGMLLPEVFTKIGIINVPE--WYDAGKEQYFASSST------LFVIEFIL 149

Query: 425 MH*VEVRRWQDYKNFGSVNEDPIFKGNKVPNPEMGYPGGIFDPFGFSKGNLKELQTKEIK 604
            H VE+RRWQD KN GSVN+DPIFK   +P  E+GYPGGIF+P  F+    +E + KE+ 
Sbjct: 150 FHYVEIRRWQDIKNPGSVNQDPIFKQYSLPKGEVGYPGGIFNPLNFAP--TQEAKEKELA 207

Query: 605 NGRLAMIAYMAFILQAQATGKGPLAALSAHLSNPFGNNILK 727
           NGRLAM+A++ F++Q   TGKGP   L  HLS+P+ N I++
Sbjct: 208 NGRLAMLAFLGFVVQHNVTGKGPFENLLQHLSDPWHNTIVQ 248

>pir||S14305 chlorophyll a/b-binding protein (cab-11) - tomato
          Length = 251

 Score =  191 bits (484), Expect = 4e-47
 Identities = 104/221 (47%), Positives = 138/221 (62%), Gaps = 5/221 (2%)
 Frame = +2

Query: 86  STRSSRRAVVVRAAADRKLWAPGVVAPEYLKGDLAGDYGWDPLGLGADPTALKWYRQSEL 265
           ST SS  +  V A   +  W PG+ +P+YL G L GD G+DPLGL  DP  LKW+ Q+EL
Sbjct: 39  STSSSYNSFKVEAKKGQ--WLPGLASPDYLDGSLPGDNGFDPLGLVEDPENLKWFIQAEL 96

Query: 266 QHARWAMLGVAGVLVQEI-----VKPDVYFYEAGLPQNLPEPFTNINMGGLLAWEFILMH 430
            + RWAMLGVAG+L+ E+     +     +Y+AG  +      T      L   EFIL H
Sbjct: 97  VNGRWAMLGVAGMLLPEVFTSIGILNVPKWYDAGKSEYFASSST------LFVIEFILFH 150

Query: 431 *VEVRRWQDYKNFGSVNEDPIFKGNKVPNPEMGYPGGIFDPFGFSKGNLKELQTKEIKNG 610
            VE+RRWQD KN GSVN+DPIFK   +P  + GYPGGIF+P  F+    +E + KE+ NG
Sbjct: 151 YVEIRRWQDIKNPGSVNQDPIFKNYSLPPNKCGYPGGIFNPLNFAP--TEEAKEKELANG 208

Query: 611 RLAMIAYMAFILQAQATGKGPLAALSAHLSNPFGNNILKNI 733
           RLAM+A++ FI+Q   TGKGP   L  HLS+P+ N I++ +
Sbjct: 209 RLAMLAFLGFIVQHNVTGKGPFDNLLQHLSDPWHNTIIQTL 249

>pir||S31863 chlorophyll a/b-binding protein type 4, photosystem I - Scotch pine
           gi|22753|emb|CAA78932.1| Lhca4 protein,Type 4 protein of
           light-harvesting complex of photosystem I [Pinus
           sylvestris]
          Length = 251

 Score =  190 bits (483), Expect = 5e-47
 Identities = 105/237 (44%), Positives = 142/237 (59%), Gaps = 7/237 (2%)
 Frame = +2

Query: 44  AALMQKSALSRPACSTRSSRRAVVVRA--AADRKLWAPGVVAPEYLKGDLAGDYGWDPLG 217
           +A +  S++SR A   R        R+     +  W PG+ +P YL G L GD G+DPLG
Sbjct: 20  SAFLTGSSVSRLARRVRPGSNPAPARSFKVEAKGEWLPGLSSPSYLNGSLPGDNGFDPLG 79

Query: 218 LGADPTALKWYRQSELQHARWAMLGVAGVLVQEIVKP-----DVYFYEAGLPQNLPEPFT 382
           L  DP +LKWY Q+EL + RWAMLGVAG+L+ E++          +Y+AG  +      T
Sbjct: 80  LAEDPESLKWYVQAELVNGRWAMLGVAGMLIPEVLTSIGLINVPKWYDAGKVEYFASSST 139

Query: 383 NINMGGLLAWEFILMH*VEVRRWQDYKNFGSVNEDPIFKGNKVPNPEMGYPGGIFDPFGF 562
                 L   EFIL H VE+RRWQD K  GSVN+DP+FK   +P  E+GYPGGIF+P  F
Sbjct: 140 ------LFVIEFILFHYVELRRWQDIKYPGSVNQDPLFKQYSLPPNEVGYPGGIFNPLNF 193

Query: 563 SKGNLKELQTKEIKNGRLAMIAYMAFILQAQATGKGPLAALSAHLSNPFGNNILKNI 733
           S     E + KE+ NGRLAM+A++ F++Q   TGKGP   L  HLS+P+ N I++ +
Sbjct: 194 SPS--MEAKEKELANGRLAMLAFLGFVVQHNVTGKGPFDNLLQHLSDPWHNTIIQTL 248

>ref|NP_084540.1| cDNA sequence BC002118 [Mus musculus] gi|12805303|gb|AAH02118.1|
           CDNA sequence BC002118 [Mus musculus]
           gi|33438474|emb|CAE30280.1| chlorophyll a /b binding
           protein [Beta vulgaris]
          Length = 252

 Score =  190 bits (482), Expect = 6e-47
 Identities = 104/234 (44%), Positives = 142/234 (60%), Gaps = 5/234 (2%)
 Frame = +2

Query: 50  LMQKSALSRPACSTRSSRRAVVVRAAADRKLWAPGVVAPEYLKGDLAGDYGWDPLGLGAD 229
           L ++  +  P+ S+ +S     ++  A +  W PG+ +P YL G L GD G+DPL L  D
Sbjct: 30  LNKECGVRLPSTSSTNS-----LKVEAKKGEWLPGLASPAYLNGSLPGDNGFDPLALAED 84

Query: 230 PTALKWYRQSELQHARWAMLGVAGVLVQEI-----VKPDVYFYEAGLPQNLPEPFTNINM 394
           P  LKW+ Q+EL + RWAMLGVAG+L+ E+     +     +Y+AG  +      T    
Sbjct: 85  PENLKWFVQAELVNGRWAMLGVAGMLLPEVFTQIGIINVPKWYDAGKQEYFASSST---- 140

Query: 395 GGLLAWEFILMH*VEVRRWQDYKNFGSVNEDPIFKGNKVPNPEMGYPGGIFDPFGFSKGN 574
             L   EFIL H VE+RRWQD KN GSVN+DPIFK   +P  E+GYPGGIF+P  F+   
Sbjct: 141 --LFVIEFILFHYVEIRRWQDIKNPGSVNQDPIFKQYSLPPNEVGYPGGIFNPLNFAP-- 196

Query: 575 LKELQTKEIKNGRLAMIAYMAFILQAQATGKGPLAALSAHLSNPFGNNILKNIG 736
             E + KE+ NGRLAM+A++ FI+Q   TGKGP   L  HLS+P+ N I++  G
Sbjct: 197 TAEAKEKELANGRLAMLAFLGFIVQHNVTGKGPFDNLLQHLSDPWHNTIIQTFG 250

>pir||S31864 chlorophyll a/b-binding protein type 4, photosystem I - Scotch pine
           (fragment) gi|829287|emb|CAA78901.1| Lhca4 protein,Type
           4 protein of light-harvesting complex of photosystem I
           [Pinus sylvestris]
          Length = 244

 Score =  189 bits (480), Expect = 1e-46
 Identities = 98/202 (48%), Positives = 129/202 (63%), Gaps = 5/202 (2%)
 Frame = +2

Query: 143 WAPGVVAPEYLKGDLAGDYGWDPLGLGADPTALKWYRQSELQHARWAMLGVAGVLVQEIV 322
           W PG+ +P YL G L GD G+DPLGL  DP +LKWY Q+EL + RWAMLGVAG+L+ E++
Sbjct: 48  WLPGLSSPSYLNGSLPGDNGFDPLGLAEDPESLKWYVQAELVNGRWAMLGVAGMLIPEVL 107

Query: 323 KP-----DVYFYEAGLPQNLPEPFTNINMGGLLAWEFILMH*VEVRRWQDYKNFGSVNED 487
                     +Y+AG  +      T      L   EFIL H VE+RRWQD K  GSVN+D
Sbjct: 108 TSIGLINVPKWYDAGKVEYFASSST------LFVIEFILFHYVELRRWQDIKYPGSVNQD 161

Query: 488 PIFKGNKVPNPEMGYPGGIFDPFGFSKGNLKELQTKEIKNGRLAMIAYMAFILQAQATGK 667
           P+FK   +P  E+GYPGGIF+P  FS     E + KE+ NGRLAM+A++ F++Q   TGK
Sbjct: 162 PLFKQYSLPPNEVGYPGGIFNPLNFSPS--MEAKEKELANGRLAMLAFLGFVVQHNVTGK 219

Query: 668 GPLAALSAHLSNPFGNNILKNI 733
           GP   L  HLS+P+ N I++ +
Sbjct: 220 GPFDNLLQHLSDPWHNTIIQTL 241



EST assemble image


clone accession position
1 MX260h01_r BP092919 1 585
2 HC081c07_r AV638056 3 543
3 MX223g01_r BP090421 25 438
4 CM044f01_r AV389522 99 397
5 CM063h05_r AV390070 99 706
6 MX044f01_r BP087833 102 471
7 MX253b08_r BP092361 103 619
8 MX221a09_r BP090250 106 256
9 CM097d05_r AV393202 107 603
10 HC018b12_r AV633220 113 525
11 MX214d07_r BP089820 116 462
12 MX250h11_r BP092196 123 584
13 CM032d02_r AV388478 123 521
14 CM031h03_r AV388304 130 276
15 HC042b02_r AV635118 131 630
16 CM033c06_r AV388432 141 607
17 HC013e08_r AV632855 141 520
18 CM098e12_r AV392923 163 617
19 CM070a03_r AV391275 170 762
20 HC072d02_r AV637376 379 886
21 LC026f06_r AV620773 405 910
22 MXL062e08_r BP096658 426 691
23 HC077d08_r AV637781 431 936
24 HC066e01_r AV636945 431 866
25 CM091b04_r AV397535 445 733
26 LC056b02_r AV622887 449 942
27 MXL004f04_r BP093177 456 838
28 MX219c11_r BP090135 508 774
29 HC003b10_r AV632042 527 1033
30 HC056g10_r AV636233 694 1247
31 MXL080f12_r BP097749 901 1279
32 MXL058a03_r BP096387 948 1296
33 HC019e08_r AV633328 1032 1554




Chlamydomonas reinhardtii
Kazusa DNA Research Institute