KCC000901A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000901A_C01 KCC000901A_c01
gcaccaggtCAACTCCTTTAATTCTCATAGCCGCTCCTCTCAATCCATATCTGAGTGTCG
GCTGCAGCTATGTGTGACTCGGCCCGTAAATAAAACTGTGCATTTCAGAGGGCTCTGCGC
TATGTAAATTCTTTTGTAGCACAAACTGTAATAATCTATAGCCAATTTGGGTTGGCTTGG
ATCACAGGCACGAGCTGGTATTCCATGCCGAGTCCCGCAACCCTTATTAAACACCAAGCA
GGCCCACGCGGCGATTTGATAAGCTACTCTATGCACTTAGCGACGTCACGAAGTGCCCTG
CTCGGCGGGTTGCACAGCTGTCGAGCCGCACGCTTTGTCGGCGCACAGGTTAAGGCGGGG
CACACTCAGGCCGCGCCGCGTGTAGGGGGCGCCGGCTGCCCCGCAAGAGGCGTCGCTACT
CGCGCGGGCGCTGGGTCTGAAGCGGAGACCGAGAGTGGGTACTGGACGCTGCAAAGCTAT
GTGCTGTACGAGAGCGAGGTGAAGAAATCAAAGTTCATCGTTCATGCCTGGCCGGTCAGC
TCGCCTGCCGAGGCGATGGACCTGATCAAGGGGGCGTCAGACCCCAGTGCCTCCCACAAC
TGCTTCGCCTACCGGATAGGCGACGAGTTCCGCTCCAGCGATGATGGCGAGCCAGGCGGC
ACAGCGGGGAAGCCCATCCAGACCGCAATTGATGGCGAGGGGCTTGAACGAGTGGCGGTG
CTGGTGACGCGTnTCTTCGGGGGCGTAAGCTGGGCGCGGCGGGCTGTCCGGGCGTA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000901A_C01 KCC000901A_c01
         (776 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_885242.1| conserved hypothetical protein [Bordetella para...   118  1e-25
ref|NP_880428.1| conserved hypothetical protein [Bordetella pert...   118  1e-25
ref|NP_568034.1| expressed protein [Arabidopsis thaliana]             114  2e-24
ref|NP_849515.1| expressed protein [Arabidopsis thaliana] gi|278...   114  2e-24
gb|AAM63017.1| unknown [Arabidopsis thaliana]                         114  2e-24

>ref|NP_885242.1| conserved hypothetical protein [Bordetella parapertussis]
           gi|33602002|ref|NP_889562.1| conserved hypothetical
           protein [Bordetella bronchiseptica]
           gi|33574027|emb|CAE38350.1| conserved hypothetical
           protein [Bordetella parapertussis]
           gi|33576440|emb|CAE33518.1| conserved hypothetical
           protein [Bordetella bronchiseptica]
          Length = 195

 Score =  118 bits (295), Expect = 1e-25
 Identities = 56/94 (59%), Positives = 73/94 (77%)
 Frame = +1

Query: 466 TLQSYVLYESEVKKSKFIVHAWPVSSPAEAMDLIKGASDPSASHNCFAYRIGDEFRSSDD 645
           TL +   ++ E+KKS+F  +A PV + AEAM      SDP+A+HNC+AYRIG+E+R +DD
Sbjct: 4   TLAAACRHQEEIKKSRFAAYAAPVGTIAEAMAHFAAHSDPAATHNCWAYRIGNEYRFNDD 63

Query: 646 GEPGGTAGKPIQTAIDGEGLERVAVLVTRXFGGV 747
           GEPGGTAG+PI  AIDG+GL+RVAVLV R FGG+
Sbjct: 64  GEPGGTAGRPILQAIDGQGLDRVAVLVVRWFGGI 97

>ref|NP_880428.1| conserved hypothetical protein [Bordetella pertussis]
           gi|33572432|emb|CAE41999.1| conserved hypothetical
           protein [Bordetella pertussis]
          Length = 195

 Score =  118 bits (295), Expect = 1e-25
 Identities = 56/94 (59%), Positives = 73/94 (77%)
 Frame = +1

Query: 466 TLQSYVLYESEVKKSKFIVHAWPVSSPAEAMDLIKGASDPSASHNCFAYRIGDEFRSSDD 645
           TL +   ++ E+KKS+F  +A PV + AEAM      SDP+A+HNC+AYRIG+E+R +DD
Sbjct: 4   TLAAACRHQEEIKKSRFAAYAAPVGTIAEAMAHFAAHSDPAATHNCWAYRIGNEYRFNDD 63

Query: 646 GEPGGTAGKPIQTAIDGEGLERVAVLVTRXFGGV 747
           GEPGGTAG+PI  AIDG+GL+RVAVLV R FGG+
Sbjct: 64  GEPGGTAGRPILQAIDGQGLDRVAVLVVRWFGGI 97

>ref|NP_568034.1| expressed protein [Arabidopsis thaliana]
          Length = 234

 Score =  114 bits (285), Expect = 2e-24
 Identities = 67/139 (48%), Positives = 81/139 (58%), Gaps = 1/139 (0%)
 Frame = +1

Query: 334 FVGAQVKAGHTQAAPRVGGAGCPARGVATRAGAGSEAETESGYWTLQSYVLYESEVKKSK 513
           +VG  V A    A   VGG   P   +   +G  S A     + TL+  V  E E+KKSK
Sbjct: 2   WVGVPVAA----AVTTVGGRRIPV--LVAASGKRSMASNSGSFTTLKETVSVEKEIKKSK 55

Query: 514 FIVHAWPVSSPAEAMDLIKGASDPSASHNCFAYRIGD-EFRSSDDGEPGGTAGKPIQTAI 690
           FI  A P+SS   A   +    DP ASHNC+AY+IGD   R SDDGEP GTAGKPIQ+AI
Sbjct: 56  FIAIAGPISSEQSAQMFLSQVRDPRASHNCWAYKIGDHHHRCSDDGEPSGTAGKPIQSAI 115

Query: 691 DGEGLERVAVLVTRXFGGV 747
              GL+RV V+V R FGG+
Sbjct: 116 LSSGLDRVMVVVIRYFGGI 134

>ref|NP_849515.1| expressed protein [Arabidopsis thaliana] gi|27808512|gb|AAO24536.1|
           At4g38090 [Arabidopsis thaliana]
          Length = 189

 Score =  114 bits (285), Expect = 2e-24
 Identities = 67/139 (48%), Positives = 81/139 (58%), Gaps = 1/139 (0%)
 Frame = +1

Query: 334 FVGAQVKAGHTQAAPRVGGAGCPARGVATRAGAGSEAETESGYWTLQSYVLYESEVKKSK 513
           +VG  V A    A   VGG   P   +   +G  S A     + TL+  V  E E+KKSK
Sbjct: 2   WVGVPVAA----AVTTVGGRRIPV--LVAASGKRSMASNSGSFTTLKETVSVEKEIKKSK 55

Query: 514 FIVHAWPVSSPAEAMDLIKGASDPSASHNCFAYRIGD-EFRSSDDGEPGGTAGKPIQTAI 690
           FI  A P+SS   A   +    DP ASHNC+AY+IGD   R SDDGEP GTAGKPIQ+AI
Sbjct: 56  FIAIAGPISSEQSAQMFLSQVRDPRASHNCWAYKIGDHHHRCSDDGEPSGTAGKPIQSAI 115

Query: 691 DGEGLERVAVLVTRXFGGV 747
              GL+RV V+V R FGG+
Sbjct: 116 LSSGLDRVMVVVIRYFGGI 134

>gb|AAM63017.1| unknown [Arabidopsis thaliana]
          Length = 234

 Score =  114 bits (284), Expect = 2e-24
 Identities = 67/139 (48%), Positives = 81/139 (58%), Gaps = 1/139 (0%)
 Frame = +1

Query: 334 FVGAQVKAGHTQAAPRVGGAGCPARGVATRAGAGSEAETESGYWTLQSYVLYESEVKKSK 513
           +VG  V A    A   VGG   P   +   +G  S A     + TL+  V  E E+KKSK
Sbjct: 2   WVGVPVAA----AVTTVGGHRIPV--LVAASGKRSMASNSGSFTTLKETVSVEKEIKKSK 55

Query: 514 FIVHAWPVSSPAEAMDLIKGASDPSASHNCFAYRIGD-EFRSSDDGEPGGTAGKPIQTAI 690
           FI  A P+SS   A   +    DP ASHNC+AY+IGD   R SDDGEP GTAGKPIQ+AI
Sbjct: 56  FIAIAGPISSEQSAQMFLSQVRDPRASHNCWAYKIGDHHHRCSDDGEPSGTAGKPIQSAI 115

Query: 691 DGEGLERVAVLVTRXFGGV 747
              GL+RV V+V R FGG+
Sbjct: 116 LSSGLDRVMVVVIRYFGGI 134



EST assemble image


clone accession position
1 CM035c09_r AV388708 1 476
2 CM002c03_r AV397830 8 363
3 MX248e09_r BP092024 252 776




Chlamydomonas reinhardtii
Kazusa DNA Research Institute