KCC000343A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000343A_C01 KCC000343A_c01
aggaccacaaggtgttctcggagaagtttggggaccgctacgtgcggttgattcaggtgt
cgcgcaaggagatgcaggccACGCTGGCGCTGCGCTTCGGTGGGGAGGGTGTGCTCAAAA
TGAAGGGCATCCCCTTCAAGGCGACCGCCATGGACGTGCGCAAGTTCTTTGCGAACTACA
AGATCAAACCGGAGGGCGTCAGCTTCATCATGCACGCGGACGGGCGGCCCACCGGCATGG
CGTTCATCGAGTTTGAGACGCCGCAGGAGGCGGTGCGTGCAATGGAGAAGGACCGCGCCA
AGTTCGGGCCGGAGTACGGCGACCGCTTCTGCATGCTGCAGCTGGTCGGCCGGCACGAGA
TGGAGAAGGTGACTCTGCAGCGCGAGAACGAGAACAACGACAACAAGCTGCTGAACGGCA
TCAACGTGCTGCAAGCGGCCGCGCTAGCCACGCAGGCGGCGAACAGCAACCCCGC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000343A_C01 KCC000343A_c01
         (475 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_201402.2| heterogeneous nuclear ribonucleoprotein (hnRNP)...    67  1e-10
dbj|BAB10406.1| contains similarity to ribonucleoprotein F~gene_...    67  1e-10
ref|NP_650120.1| CG6946-PA [Drosophila melanogaster] gi|24646107...    66  2e-10
dbj|BAC40188.1| unnamed protein product [Mus musculus]                 65  3e-10
ref|NP_005511.1| heterogeneous nuclear ribonucleoprotein H1 [Hom...    65  3e-10

>ref|NP_201402.2| heterogeneous nuclear ribonucleoprotein (hnRNP), putative
           [Arabidopsis thaliana]
          Length = 289

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 31/73 (42%), Positives = 44/73 (59%)
 Frame = +3

Query: 111 VLKMKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFETPQEAVRAMEK 290
           VLKM+G+P+      + +FF+ YK+    V  +   DG+ TG AF+EFET +EA RAM K
Sbjct: 201 VLKMRGLPYSVNKPQIIEFFSGYKVIQGRVQVVCRPDGKATGEAFVEFETGEEARRAMAK 260

Query: 291 DRAKFGPEYGDRF 329
           D+   G  Y + F
Sbjct: 261 DKMSIGSRYVELF 273

 Score = 45.4 bits (106), Expect = 3e-04
 Identities = 23/77 (29%), Positives = 42/77 (53%)
 Frame = +3

Query: 99  GGEGVLKMKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFETPQEAVR 278
           GG  V++++G+PF    +D+ +FFA   I       ++  +G+ +G AF+ F  P +   
Sbjct: 80  GGFPVVRLRGLPFNCADIDIFEFFAGLNIVD---VLLVSKNGKFSGEAFVVFAGPMQVEI 136

Query: 279 AMEKDRAKFGPEYGDRF 329
           A+++DR   G  Y + F
Sbjct: 137 ALQRDRHNMGRRYVEVF 153

>dbj|BAB10406.1| contains similarity to ribonucleoprotein F~gene_id:K2A18.8
           [Arabidopsis thaliana]
          Length = 248

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 31/73 (42%), Positives = 44/73 (59%)
 Frame = +3

Query: 111 VLKMKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFETPQEAVRAMEK 290
           VLKM+G+P+      + +FF+ YK+    V  +   DG+ TG AF+EFET +EA RAM K
Sbjct: 160 VLKMRGLPYSVNKPQIIEFFSGYKVIQGRVQVVCRPDGKATGEAFVEFETGEEARRAMAK 219

Query: 291 DRAKFGPEYGDRF 329
           D+   G  Y + F
Sbjct: 220 DKMSIGSRYVELF 232

 Score = 45.4 bits (106), Expect = 3e-04
 Identities = 23/77 (29%), Positives = 42/77 (53%)
 Frame = +3

Query: 99  GGEGVLKMKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFETPQEAVR 278
           GG  V++++G+PF    +D+ +FFA   I       ++  +G+ +G AF+ F  P +   
Sbjct: 39  GGFPVVRLRGLPFNCADIDIFEFFAGLNIVD---VLLVSKNGKFSGEAFVVFAGPMQVEI 95

Query: 279 AMEKDRAKFGPEYGDRF 329
           A+++DR   G  Y + F
Sbjct: 96  ALQRDRHNMGRRYVEVF 112

>ref|NP_650120.1| CG6946-PA [Drosophila melanogaster] gi|24646107|ref|NP_731639.1|
           CG6946-PB [Drosophila melanogaster]
           gi|7299517|gb|AAF54704.1| CG6946-PA [Drosophila
           melanogaster] gi|7299518|gb|AAF54705.1| CG6946-PB
           [Drosophila melanogaster] gi|19528177|gb|AAL90203.1|
           AT27789p [Drosophila melanogaster]
          Length = 586

 Score = 66.2 bits (160), Expect = 2e-10
 Identities = 35/112 (31%), Positives = 61/112 (54%), Gaps = 5/112 (4%)
 Frame = +3

Query: 9   KVFSEKFGDRYVRLIQVSRKEMQATLALRFGGEG---VLKMKGIPFKATAMDVRKFFANY 179
           K+     G RY+ +   + KE +  +  +  G G   V+K++G+P+  T   + +FF+  
Sbjct: 111 KLNKASMGHRYIEVFTATPKEAKEAMR-KISGHGTAFVVKLRGLPYAVTEQQIEEFFSGL 169

Query: 180 KIKP--EGVSFIMHADGRPTGMAFIEFETPQEAVRAMEKDRAKFGPEYGDRF 329
            IK   EG+ F+M   GR TG AF++FE+  +  +A+ ++R K G  Y + F
Sbjct: 170 DIKTDREGILFVMDRRGRATGEAFVQFESQDDTEQALGRNREKIGHRYIEIF 221

 Score = 38.1 bits (87), Expect = 0.053
 Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 6/83 (7%)
 Frame = +3

Query: 99  GGEG------VLKMKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFET 260
           GG G       + M+G+P+ +   DV KFF    I+P  V    +  G  +G A   F+T
Sbjct: 472 GGRGNDIEYYTIHMRGLPYTSFENDVFKFFE--PIRPANVRINYNKKGLHSGTADAYFDT 529

Query: 261 PQEAVRAMEKDRAKFGPEYGDRF 329
            +++  AM++ R + G  Y + F
Sbjct: 530 YEDSQVAMKRHREQMGSRYIELF 552

>dbj|BAC40188.1| unnamed protein product [Mus musculus]
          Length = 472

 Score = 65.5 bits (158), Expect = 3e-10
 Identities = 33/80 (41%), Positives = 54/80 (67%), Gaps = 3/80 (3%)
 Frame = +3

Query: 99  GGEG-VLKMKGIPFKATAMDVRKFFANYKIK--PEGVSFIMHADGRPTGMAFIEFETPQE 269
           GGEG V+K++G+P+  +A +V++FF++ KI+   +G+ FI   +GRP+G AF+E E+  E
Sbjct: 7   GGEGFVVKVRGLPWSCSADEVQRFFSDCKIQNGAQGIRFIYTREGRPSGEAFVELESEDE 66

Query: 270 AVRAMEKDRAKFGPEYGDRF 329
              A++KDR   G  Y + F
Sbjct: 67  VKLALKKDRETMGHRYVEVF 86

 Score = 58.9 bits (141), Expect = 3e-08
 Identities = 30/109 (27%), Positives = 59/109 (53%), Gaps = 6/109 (5%)
 Frame = +3

Query: 21  EKFGDRYVRLIQVSRKEMQATL------ALRFGGEGVLKMKGIPFKATAMDVRKFFANYK 182
           E  G RYV + + +  EM   L      +     +G ++++G+PF  +  ++ +FF+  +
Sbjct: 76  ETMGHRYVEVFKSNNVEMDWVLKHTGPNSPDTANDGFVRLRGLPFGCSKEEIVQFFSGLE 135

Query: 183 IKPEGVSFIMHADGRPTGMAFIEFETPQEAVRAMEKDRAKFGPEYGDRF 329
           I P G++  +   GR TG AF++F + + A +A++K + + G  Y + F
Sbjct: 136 IVPNGITLPVDFQGRSTGEAFVQFASQEIAEKALKKHKERIGHRYIEIF 184

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 28/70 (40%), Positives = 41/70 (58%)
 Frame = +3

Query: 120 MKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFETPQEAVRAMEKDRA 299
           M+G+P++AT  D+  FF+   + P  V   +  DGR TG A +EF T ++AV AM KD+A
Sbjct: 293 MRGLPYRATENDIYNFFS--PLNPVRVHIEIGPDGRVTGEADVEFATHEDAVAAMSKDKA 350

Query: 300 KFGPEYGDRF 329
                Y + F
Sbjct: 351 NMQHRYVELF 360

>ref|NP_005511.1| heterogeneous nuclear ribonucleoprotein H1 [Homo sapiens]
           gi|1710632|sp|P31943|ROH1_HUMAN Heterogeneous nuclear
           ribonucleoprotein H (hnRNP H) gi|2134669|pir||I39358
           heterogeneous nuclear ribonucleoprotein H - human
           gi|347314|gb|AAA91346.1| hnRNP H
           gi|12655001|gb|AAH01348.1| HNRPH1 protein [Homo sapiens]
          Length = 449

 Score = 65.5 bits (158), Expect = 3e-10
 Identities = 33/80 (41%), Positives = 54/80 (67%), Gaps = 3/80 (3%)
 Frame = +3

Query: 99  GGEG-VLKMKGIPFKATAMDVRKFFANYKIK--PEGVSFIMHADGRPTGMAFIEFETPQE 269
           GGEG V+K++G+P+  +A +V++FF++ KI+   +G+ FI   +GRP+G AF+E E+  E
Sbjct: 7   GGEGFVVKVRGLPWSCSADEVQRFFSDCKIQNGAQGIRFIYTREGRPSGEAFVELESEDE 66

Query: 270 AVRAMEKDRAKFGPEYGDRF 329
              A++KDR   G  Y + F
Sbjct: 67  VKLALKKDRETMGHRYVEVF 86

 Score = 58.9 bits (141), Expect = 3e-08
 Identities = 30/109 (27%), Positives = 59/109 (53%), Gaps = 6/109 (5%)
 Frame = +3

Query: 21  EKFGDRYVRLIQVSRKEMQATL------ALRFGGEGVLKMKGIPFKATAMDVRKFFANYK 182
           E  G RYV + + +  EM   L      +     +G ++++G+PF  +  ++ +FF+  +
Sbjct: 76  ETMGHRYVEVFKSNNVEMDWVLKHTGPNSPDTANDGFVRLRGLPFGCSKEEIVQFFSGLE 135

Query: 183 IKPEGVSFIMHADGRPTGMAFIEFETPQEAVRAMEKDRAKFGPEYGDRF 329
           I P G++  +   GR TG AF++F + + A +A++K + + G  Y + F
Sbjct: 136 IVPNGITLPVDFQGRSTGEAFVQFASQEIAEKALKKHKERIGHRYIEIF 184

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 28/70 (40%), Positives = 41/70 (58%)
 Frame = +3

Query: 120 MKGIPFKATAMDVRKFFANYKIKPEGVSFIMHADGRPTGMAFIEFETPQEAVRAMEKDRA 299
           M+G+P++AT  D+  FF+   + P  V   +  DGR TG A +EF T ++AV AM KD+A
Sbjct: 293 MRGLPYRATENDIYNFFS--PLNPVRVHIEIGPDGRVTGEADVEFATHEDAVAAMSKDKA 350

Query: 300 KFGPEYGDRF 329
                Y + F
Sbjct: 351 NMQHRYVELF 360



EST assemble image


clone accession position
1 CL20g10_r AV394327 1 422
2 CL75d07_r AV397043 114 475




Chlamydomonas reinhardtii
Kazusa DNA Research Institute