KCC002068A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002068A_C02 KCC002068A_c02
gtgcCTCTGCGTTGTTAGAAGGACAACCTTTCGACAAGTAGAGTAGCTTAGAAGTACCGA
AAGGAGTAGCCCTGGCGCAGACAGTGGACATTTCTCCTCCCGCGTTGCCCACAAAGTTGG
GACACGCGGCGGGCTGTTTGAACATGCACACTCAACTGCTGCAACGCCGCGCCGCTGCTG
CTTCGCCTGTCGGGACTGTGCCCAAATCAGCGGCGATGTTTTCTCTCTTTAAGTTTGGCA
AGAAGCGCTCGAACCAATATGTCGCGGCTTCAGCGCCTGCCCAAAGCACCTCCACCGAGC
TCATAGAAAAGACCAGCCAGCCCACGGTCGCGTCTGGCAGCATCAGCCACCAGGACCTGT
ACTCGGGGTCTACGCCTCTGCCTGCTGGCAAGCAGCTCAAGCTGCAGCTGGCCGTTTGTT
ATCTTCCGCACCCGGAGAAGGTGCACTATGGTGGCGAGGATGCGCACTTCATCTCCGACT
ACGGGGGCGGCATGATGGGCGTGGCCGATGGAGTTGGCGGGTGGCAGGAGTCTGGCGTGA
ACCCTGCCGACTACTCGCGGACTCTTATGCTGATGTCGCGCGCCTATCTTGAGGGCAACG
ACATCTTCCAGGAGCAAGCGGCTTCCCGGCATGGTGTGCTCATAGACCCGCGGGGCGCGC
TGGAGGCTGCGCACATGAACACCAAGGTGCCCGGCTCGGCCACGGCGTGCGTGATGCAGC
TGGACCAGGCCAACGGCGTGCTCGCAGCAGCGAACCTGGGCGACAGCGGCTTCCTGGTGA
TCCGTGACGGCAAGGAGCTGATCCGCTCCAAGCCGCTGCAGCACTATTTTGACTGCCCGC
TGCAGTTCGGCGCGTTCCCGGAGTTTGTGGAGGCGACAGACACGGCAGACATGGCCGACC
TGTACAGCATCACGCTGCGGCCCGGGGACGTCATCGTGGCGGGCACGGACGGGCTATGGG
ACAACTGCTACCTCAGTGAGATCA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002068A_C02 KCC002068A_c02
         (984 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_201473.1| expressed protein [Arabidopsis thaliana] gi|884...   151  1e-35
ref|NP_193391.2| expressed protein [Arabidopsis thaliana] gi|264...   146  6e-34
pir||E85184 hypothetical protein dl4315c [imported] - Arabidopsi...   146  6e-34
pir||T00581 hypothetical protein At2g30170 [imported] - Arabidop...   137  4e-31
ref|NP_565696.1| expressed protein [Arabidopsis thaliana] gi|138...   134  3e-30

>ref|NP_201473.1| expressed protein [Arabidopsis thaliana] gi|8843730|dbj|BAA97278.1|
           contains similarity to unknown
           protein~emb|CAB46038.1~gene_id:MSN2.11 [Arabidopsis
           thaliana] gi|22531134|gb|AAM97071.1| putative protein
           [Arabidopsis thaliana] gi|23198040|gb|AAN15547.1|
           putative protein [Arabidopsis thaliana]
           gi|26449356|dbj|BAC41805.1| unknown protein [Arabidopsis
           thaliana]
          Length = 414

 Score =  151 bits (382), Expect = 1e-35
 Identities = 113/314 (35%), Positives = 146/314 (45%)
 Frame = +3

Query: 42  SSLEVPKGVALAQTVDISPPALPTKLGHAAGCLNMHTQLLQRRAAAASPVGTVPKSAAMF 221
           + L+  K  +   +  I+ P    +LG   G +        R     S V  + KS A+F
Sbjct: 71  NGLDFTKKRSSGGSFTINCPVASMRLGKRGGMMK------NRLVCHYSVVDPLEKSRALF 124

Query: 222 SLFKFGKKRSNQYVAASAPAQSTSTELIEKTSQPTVASGSISHQDLYSGSTPLPAGKQLK 401
                    S     +  PA   S+        P   + S+    L SGS          
Sbjct: 125 GTLSKSVHTSPMACFSVGPAHELSSLNGGSQESPPTTTTSLKSLRLVSGS---------- 174

Query: 402 LQLAVCYLPHPEKVHYGGEDAHFISDYGGGMMGVADGVGGWQESGVNPADYSRTLMLMSR 581
                CYLPHPEK   GGEDAHFI D     +GVADGVGGW E GVN   +SR LM  S 
Sbjct: 175 -----CYLPHPEKEATGGEDAHFICDEEQA-IGVADGVGGWAEVGVNAGLFSRELMSYSV 228

Query: 582 AYLEGNDIFQEQAASRHGVLIDPRGALEAAHMNTKVPGSATACVMQLDQANGVLAAANLG 761
           + +      QEQ     G  IDP   LE AH  TK  GS+TAC++ L      L A NLG
Sbjct: 229 SAI------QEQ---HKGSSIDPLVVLEKAHSQTKAKGSSTACIIVLKDKG--LHAINLG 277

Query: 762 DSGFLVIRDGKELIRSKPLQHYFDCPLQFGAFPEFVEATDTADMADLYSITLRPGDVIVA 941
           DSGF V+R+G  + +S   QH F+   Q     E   + D      +++I ++ GDVIVA
Sbjct: 278 DSGFTVVREGTTVFQSPVQQHGFNFTYQL----ESGNSADVPSSGQVFTIDVQSGDVIVA 333

Query: 942 GTDGLWDNCYLSEI 983
           GTDG++DN Y  EI
Sbjct: 334 GTDGVYDNLYNEEI 347

>ref|NP_193391.2| expressed protein [Arabidopsis thaliana]
           gi|26450942|dbj|BAC42578.1| unknown protein [Arabidopsis
           thaliana] gi|28950865|gb|AAO63356.1| At4g16580
           [Arabidopsis thaliana]
          Length = 300

 Score =  146 bits (368), Expect = 6e-34
 Identities = 92/198 (46%), Positives = 112/198 (56%)
 Frame = +3

Query: 390 KQLKLQLAVCYLPHPEKVHYGGEDAHFISDYGGGMMGVADGVGGWQESGVNPADYSRTLM 569
           K LKL    CYLPHP+K   GGEDAHFI       +GVADGVGGW E G++   YSR LM
Sbjct: 47  KPLKLVSGSCYLPHPDKEATGGEDAHFICAEEQA-LGVADGVGGWAELGIDAGYYSRELM 105

Query: 570 LMSRAYLEGNDIFQEQAASRHGVLIDPRGALEAAHMNTKVPGSATACVMQLDQANGVLAA 749
             S      N I  E   S     IDP   LE AH  TK  GS+TAC++ L   N  L A
Sbjct: 106 SNS-----VNAIQDEPKGS-----IDPARVLEKAHTCTKSQGSSTACIIAL--TNQGLHA 153

Query: 750 ANLGDSGFLVIRDGKELIRSKPLQHYFDCPLQFGAFPEFVEATDTADMADLYSITLRPGD 929
            NLGDSGF+V+R+G  + RS   QH F+   Q     E     D      ++++ + PGD
Sbjct: 154 INLGDSGFMVVREGHTVFRSPVQQHDFNFTYQL----ESGRNGDLPSSGQVFTVAVAPGD 209

Query: 930 VIVAGTDGLWDNCYLSEI 983
           VI+AGTDGL+DN Y +EI
Sbjct: 210 VIIAGTDGLFDNLYNNEI 227

>pir||E85184 hypothetical protein dl4315c [imported] - Arabidopsis thaliana
           gi|5302796|emb|CAB46038.1| hypothetical protein
           [Arabidopsis thaliana] gi|7268408|emb|CAB78700.1|
           hypothetical protein [Arabidopsis thaliana]
          Length = 335

 Score =  146 bits (368), Expect = 6e-34
 Identities = 92/198 (46%), Positives = 112/198 (56%)
 Frame = +3

Query: 390 KQLKLQLAVCYLPHPEKVHYGGEDAHFISDYGGGMMGVADGVGGWQESGVNPADYSRTLM 569
           K LKL    CYLPHP+K   GGEDAHFI       +GVADGVGGW E G++   YSR LM
Sbjct: 82  KPLKLVSGSCYLPHPDKEATGGEDAHFICAEEQA-LGVADGVGGWAELGIDAGYYSRELM 140

Query: 570 LMSRAYLEGNDIFQEQAASRHGVLIDPRGALEAAHMNTKVPGSATACVMQLDQANGVLAA 749
             S      N I  E   S     IDP   LE AH  TK  GS+TAC++ L   N  L A
Sbjct: 141 SNS-----VNAIQDEPKGS-----IDPARVLEKAHTCTKSQGSSTACIIAL--TNQGLHA 188

Query: 750 ANLGDSGFLVIRDGKELIRSKPLQHYFDCPLQFGAFPEFVEATDTADMADLYSITLRPGD 929
            NLGDSGF+V+R+G  + RS   QH F+   Q     E     D      ++++ + PGD
Sbjct: 189 INLGDSGFMVVREGHTVFRSPVQQHDFNFTYQL----ESGRNGDLPSSGQVFTVAVAPGD 244

Query: 930 VIVAGTDGLWDNCYLSEI 983
           VI+AGTDGL+DN Y +EI
Sbjct: 245 VIIAGTDGLFDNLYNNEI 262

>pir||T00581 hypothetical protein At2g30170 [imported] - Arabidopsis thaliana
          Length = 283

 Score =  137 bits (344), Expect = 4e-31
 Identities = 73/201 (36%), Positives = 111/201 (54%)
 Frame = +3

Query: 381 PAGKQLKLQLAVCYLPHPEKVHYGGEDAHFISDYGGGMMGVADGVGGWQESGVNPADYSR 560
           P   +L L + +  +PHP+KV  GGEDA F+S Y GG+M VADGV GW E  V+P+ +S+
Sbjct: 40  PLRPELSLSVGIHAIPHPDKVEKGGEDAFFVSSYRGGVMAVADGVSGWAEQDVDPSLFSK 99

Query: 561 TLMLMSRAYLEGNDIFQEQAASRHGVLIDPRGALEAAHMNTKVPGSATACVMQLDQANGV 740
            LM  +   ++  +           V  DP   ++ AH  T   GSAT  ++ + +  G+
Sbjct: 100 ELMANASRLVDDQE-----------VRYDPGFLIDKAHTATTSRGSATISILAMLEEVGI 148

Query: 741 LAAANLGDSGFLVIRDGKELIRSKPLQHYFDCPLQFGAFPEFVEATDTADMADLYSITLR 920
           L   N+GD G  ++R+G+ +  + P +HYFDCP Q  +      A    D +    + ++
Sbjct: 149 LKIGNVGDCGLKLLREGQIIFATAPQEHYFDCPYQLSSEG---SAQTYLDASQFSIVEVQ 205

Query: 921 PGDVIVAGTDGLWDNCYLSEI 983
            GDVIV G+DGL+DN +  EI
Sbjct: 206 KGDVIVMGSDGLFDNVFDHEI 226

>ref|NP_565696.1| expressed protein [Arabidopsis thaliana]
           gi|13878071|gb|AAK44113.1|AF370298_1 unknown protein
           [Arabidopsis thaliana] gi|17104663|gb|AAL34220.1|
           unknown protein [Arabidopsis thaliana]
           gi|20197099|gb|AAC16955.2| expressed protein
           [Arabidopsis thaliana]
          Length = 298

 Score =  134 bits (336), Expect = 3e-30
 Identities = 74/201 (36%), Positives = 111/201 (54%)
 Frame = +3

Query: 381 PAGKQLKLQLAVCYLPHPEKVHYGGEDAHFISDYGGGMMGVADGVGGWQESGVNPADYSR 560
           P   +L L + +  +PHP+KV  GGEDA F+S Y GG+M VADGV GW E  V+P+ +S+
Sbjct: 40  PLRPELSLSVGIHAIPHPDKVEKGGEDAFFVSSYRGGVMAVADGVSGWAEQDVDPSLFSK 99

Query: 561 TLMLMSRAYLEGNDIFQEQAASRHGVLIDPRGALEAAHMNTKVPGSATACVMQLDQANGV 740
            LM  +   ++  +           V  DP   ++ AH  T   GSAT  +  L++  G+
Sbjct: 100 ELMANASRLVDDQE-----------VRYDPGFLIDKAHTATTSRGSATIILAMLEEV-GI 147

Query: 741 LAAANLGDSGFLVIRDGKELIRSKPLQHYFDCPLQFGAFPEFVEATDTADMADLYSITLR 920
           L   N+GD G  ++R+G+ +  + P +HYFDCP Q  +      +  T   A    + ++
Sbjct: 148 LKIGNVGDCGLKLLREGQIIFATAPQEHYFDCPYQLSS----EGSAQTYLDASFSIVEVQ 203

Query: 921 PGDVIVAGTDGLWDNCYLSEI 983
            GDVIV G+DGL+DN +  EI
Sbjct: 204 KGDVIVMGSDGLFDNVFDHEI 224



EST assemble image


clone accession position
1 HCL057e12_r AV642768 1 499
2 HC086e03_r AV638445 5 423
3 LCL051c06_r AV629067 5 540
4 LCL027e06_r AV627489 7 386
5 MXL037a08_r BP095170 13 220
6 MX220g02_r BP090224 14 325
7 LC092a12_r AV625403 20 462
8 LC001a01_r AV625403 55 212
9 HC060c09_r AV636498 316 767
10 MXL012e03_r BP093715 515 994




Chlamydomonas reinhardtii
Kazusa DNA Research Institute