KCC000704A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000704A_C01 KCC000704A_c01
cgggacgaggacgacgacatgggccgcggccggccgcagctggctaaggagctGTCATAC
TTTGAGCGCGCCAAGCAGCGCCTGCGCAACAAGGACGCGTACAACGACATGCTCAAGTGC
CTCAGCATGTACACGCAGGAGATTATCAGCCGCCAGGAGCTGCTGCAGCTGCTCAACGAC
ATTATCGGCCGCTTCCCGGACCTCATGTCCGGGTTCCATGAGTTCATGGCCAAGTGCGAC
CTCATGGAGATCGACGAGAACAAGGTGCGCGGCCTGGCGGCGCAGAACCTGCAGCGCCAG
CGCCAGCGCGAGGAGGAGCTGCAGCGGCAGAAGTACCTCACCAAGCCGCTCAGCGAGATC
GTTTCGGGCGACACGGAGCGCGTGACGCCGTCGTACGTGAAGATGCCGCCCGGCTACCCG
CACCTGGTGTCCACCGGCCGCACGCCGCTGGCAGACAGCGTGCTGAACTCGGAGTTTGTG
AACGTCATCACGGGCAGCGAGGACTACTCATTTAAGCTGATG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000704A_C01 KCC000704A_c01
         (522 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAC43533.1| putative transcriptional regulatory protein [Ara...   115  3e-25
ref|NP_186781.2| expressed protein [Arabidopsis thaliana]             107  7e-23
gb|AAF03494.1|AC010676_4 unknown protein [Arabidopsis thaliana]       104  6e-22
ref|NP_173829.1| expressed protein [Arabidopsis thaliana] gi|748...    97  1e-19
pir||I61713 co-repressor protein - mouse gi|642619|gb|AAA69773.1...    81  7e-15

>dbj|BAC43533.1| putative transcriptional regulatory protein [Arabidopsis thaliana]
          Length = 543

 Score =  115 bits (289), Expect = 3e-25
 Identities = 63/176 (35%), Positives = 99/176 (55%), Gaps = 21/176 (11%)
 Frame = +1

Query: 58  YFERAKQRLRNKDAYNDMLKCLSMYTQEIISRQELLQLLNDIIGRFPDLMSGFHEFMAKC 237
           + E+ K+RL ++D Y   LKCL+M++  II R++L  L++D++G+FPDLM  F++F  +C
Sbjct: 337 FCEKVKERLCSQDDYQAFLKCLNMFSNGIIQRKDLQNLVSDVLGKFPDLMDEFNQFFERC 396

Query: 238 DLME---------------IDENKVRGLAAQNLQRQRQR------EEELQRQKYLTKPLS 354
           + ++                +EN  R +  +   R+ +R      E+E  + KY+ K + 
Sbjct: 397 ESIDGFQHLAGVMSKKSLGSEENLSRSVKGEEKDREHKRDVEAAKEKERSKDKYMGKSIQ 456

Query: 355 EIVSGDTERVTPSYVKMPPGYPHLVSTGRTPLADSVLNSEFVNVITGSEDYSFKLM 522
           E+   D ER TPSY  +PP YP      R     +VLN  +V+V +GSEDYSFK M
Sbjct: 457 ELDLSDCERCTPSYRLLPPDYPIPSVRHRQKSGAAVLNDHWVSVTSGSEDYSFKHM 512

>ref|NP_186781.2| expressed protein [Arabidopsis thaliana]
          Length = 1039

 Score =  107 bits (268), Expect = 7e-23
 Identities = 57/177 (32%), Positives = 102/177 (57%), Gaps = 22/177 (12%)
 Frame = +1

Query: 58  YFERAKQRLRNKDAYNDMLKCLSMYTQEIISRQELLQLLNDIIGRFPDLMSGFHEFMAKC 237
           + E+ K RL ++D Y   LKCL++++  II R++L  L++D++G+FPDLM  F++F  +C
Sbjct: 8   FCEKVKDRLCSQDDYQTFLKCLNIFSNGIIQRKDLQNLVSDLLGKFPDLMDEFNQFFERC 67

Query: 238 DLMEIDENKVRGLAAQNL--------------QRQRQREEELQ--------RQKYLTKPL 351
           + +     ++ G+ ++ L              +++ + + EL+        +++Y+ K +
Sbjct: 68  ESITDGFQRLAGVMSKKLFSSEEQLSRPMKVEEKESEHKPELEAVKETEQCKKEYMGKSI 127

Query: 352 SEIVSGDTERVTPSYVKMPPGYPHLVSTGRTPLADSVLNSEFVNVITGSEDYSFKLM 522
            E+   D E  TPSY  +P  YP  +++ R+ L   VLN  +V+V +GSEDYSFK M
Sbjct: 128 QELDLSDCECCTPSYRLLPADYPIPIASQRSELGAEVLNDHWVSVTSGSEDYSFKHM 184

>gb|AAF03494.1|AC010676_4 unknown protein [Arabidopsis thaliana]
          Length = 1324

 Score =  104 bits (260), Expect = 6e-22
 Identities = 58/181 (32%), Positives = 98/181 (54%), Gaps = 26/181 (14%)
 Frame = +1

Query: 58  YFERAKQRLRNKDAYNDMLKCLSMYTQEIISRQELLQLLNDIIGRFPDLMSGFHEFMAKC 237
           + E+ K RL ++D Y   LKCL++++  II R++L  L++D++G+FPDLM  F++F  +C
Sbjct: 341 FCEKVKDRLCSQDDYQTFLKCLNIFSNGIIQRKDLQNLVSDLLGKFPDLMDEFNQFFERC 400

Query: 238 D--------------------LMEIDENKVRGLAAQNLQRQRQ------REEELQRQKYL 339
           +                    L   +E   R +  +  + + +      +E E  +++Y+
Sbjct: 401 ESITGTEIHGFQRLAGVMSKKLFSSEEQLSRPMKVEEKESEHKPELEAVKETEQCKKEYM 460

Query: 340 TKPLSEIVSGDTERVTPSYVKMPPGYPHLVSTGRTPLADSVLNSEFVNVITGSEDYSFKL 519
            K + E+   D E  TPSY  +P  YP  +++ R+ L   VLN  +V+V +GSEDYSFK 
Sbjct: 461 GKSIQELDLSDCECCTPSYRLLPADYPIPIASQRSELGAEVLNDHWVSVTSGSEDYSFKH 520

Query: 520 M 522
           M
Sbjct: 521 M 521

>ref|NP_173829.1| expressed protein [Arabidopsis thaliana] gi|7486361|pir||T00649
           hypothetical protein F3I6.12 - Arabidopsis thaliana
           gi|2829870|gb|AAC00578.1| Hypothetical protein
           [Arabidopsis thaliana]
          Length = 1263

 Score = 97.1 bits (240), Expect = 1e-19
 Identities = 58/199 (29%), Positives = 99/199 (49%), Gaps = 39/199 (19%)
 Frame = +1

Query: 43  AKELSYFERAKQRLRNKDAYNDMLKCLSMYTQEIISRQELLQLLNDIIGRFPDLMSGFHE 222
           +++L+  +R K++L N   Y + L+CL+++++EIISR EL  L+ ++IG +PDLM  F E
Sbjct: 288 SQDLAIVDRVKEKL-NASEYQEFLRCLNLFSKEIISRPELQSLVGNLIGVYPDLMDSFIE 346

Query: 223 FMAKCDLME---------------------------------------IDENKVRGLAAQ 285
           F+ +C+  E                                        D+   R    +
Sbjct: 347 FLVQCEKNEKRQICNLLNLLAEGLLSGILTKKSLWSEGKYPQPSLDNDRDQEHKRDDGLR 406

Query: 286 NLQRQRQREEELQRQKYLTKPLSEIVSGDTERVTPSYVKMPPGYPHLVSTGRTPLADSVL 465
           +   +++R E+        KP+SE+   + E+ TPSY  +P  YP  +++ +T +   VL
Sbjct: 407 DRDHEKERLEKAAANLKWAKPISELDLSNCEQCTPSYRLLPKNYPISIASQKTEIGKLVL 466

Query: 466 NSEFVNVITGSEDYSFKLM 522
           N  +V+V +GSEDYSF  M
Sbjct: 467 NDHWVSVTSGSEDYSFSHM 485

>pir||I61713 co-repressor protein - mouse gi|642619|gb|AAA69773.1| mSin3A gene
           product
          Length = 1219

 Score = 81.3 bits (199), Expect = 7e-15
 Identities = 46/155 (29%), Positives = 79/155 (50%)
 Frame = +1

Query: 49  ELSYFERAKQRLRNKDAYNDMLKCLSMYTQEIISRQELLQLLNDIIGRFPDLMSGFHEFM 228
           E  +F++ ++ LR+ +AY + L+CL ++ QE+ISR EL+QL++  +G+FP+L + F  F+
Sbjct: 464 ESLFFDKVRKALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFL 523

Query: 229 AKCDLMEIDENKVRGLAAQNLQRQRQREEELQRQKYLTKPLSEIVSGDTERVTPSYVKMP 408
              + + +                    E   +++       EI     +R+  SY  +P
Sbjct: 524 GYKESVHL--------------------ESFPKERATEGIAMEIDYASCKRLGSSYRALP 563

Query: 409 PGYPHLVSTGRTPLADSVLNSEFVNVITGSEDYSF 513
             Y     TGRTPL   VLN  +V+  + SED +F
Sbjct: 564 KSYQQPKCTGRTPLCKEVLNDTWVSFPSWSEDSTF 598

 Score = 33.5 bits (75), Expect = 1.7
 Identities = 18/78 (23%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
 Frame = +1

Query: 31  RPQLAKELSYFERAKQRLRNK-DAYNDMLKCLSMYTQEIISRQELLQLLNDIIGRFPDLM 207
           R ++   LSY ++ K +  ++   YND L  +  +  + I    ++  ++ +    PDL+
Sbjct: 120 RLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLI 179

Query: 208 SGFHEFMAKCDLMEIDEN 261
            GF+ F+     +E+  N
Sbjct: 180 MGFNTFLPPGYKIEVQTN 197



EST assemble image


clone accession position
1 MXL063a08_r BP096684 1 359
2 MXL084d04_r BP097966 54 322
3 MXL083a05_r BP097890 117 476
4 MXL095b01_r BP098541 117 524
5 HCL031b08_r AV641292 171 278
6 CL56a12_r AV396123 311 489




Chlamydomonas reinhardtii
Kazusa DNA Research Institute