KCC001886A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001886A_C01 KCC001886A_c01
acatggaccgcgtcaacatgtacagctccttctacatggcccgccggaagggcgccaaga
aggacaactaagcagctggcGGCGAAGGCACGGCGGGCAAGTGGCCGGCTAGCAACAGCC
AATGGCGCTGACATCGCAGGAGCAGTGCGTTGGTTGCTAGCCGGCGGCGTGCCGTGCAGG
GAGCACCGCTGTGGTCAAGCTGCGGGGACTGGGCTGGCAGCGGAAGCAGTACAGGCAGCG
GCGCAATGGGCCCGTTACTCGTGCGGGCCCTGCAGTCCAGCTCGAATCATTGCAGCTTGC
TAGCCCGGCTGTCACAGCAGCGCTTCGGTAGGTGCGGCCGTCCACGTTTAGCGTGCTGGA
CTCCATAGTAGTGGTGCCAGTAGGAGTTAGGACAATGACCAGTAGCGCAGTTGAGGACGG
GCATTTTCCCGGTAGCCCAGGTTCCGCTGTCTGTGGTTGGTGTGTGGCAATCCACGCCGG
ACGCAGTTGTAGGGGACACCGGGTCGCCGCACAGGCTTCCTGGCCACTGCTGTGTGGGGT
GTGGGGGACCGATAGCGCCCGAGGCAGCACCCTGGGGATCATTAAGTAAACGAATGAAGT
GCGTGACAGCGACAGAATCGCGCAGTTGGATGTGTGCTGTGTTATGTCTGGTTTCCATCT
CATGTGTTGCTGGTGCCGCGGAGGGGAGCGGAAGGCACATGGCGGCCGGTCTGGCGTCCT
CGAGTGCCTAGTATCGGGGCTGCTGCACGCTCTCAGCAGTGCAGTTGCTTCGGGAGGCGT
GCTTTGGGATTCCGACGGACTGTGAG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001886A_C01 KCC001886A_c01
         (806 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_859456.1| hypothetical protein Chr3_0240 [Leishmania majo...    41  0.020
gb|AAH58068.1| Unknown (protein for MGC:63802) [Danio rerio]           37  0.49
ref|NP_075993.1| NK2 transcription factor related, locus 4 [Mus ...    36  0.64
ref|NP_031522.1| AT motif binding factor 1 [Mus musculus] gi|182...    34  0.71
ref|NP_081083.1| RIKEN cDNA 1110033F04 [Mus musculus] gi|1283508...    36  0.84

>ref|NP_859456.1| hypothetical protein Chr3_0240 [Leishmania major]
            gi|21629320|gb|AAM68997.1|AC125735_27 hypothetical
            protein Chr3_0240 [Leishmania major]
          Length = 2087

 Score = 41.2 bits (95), Expect = 0.020
 Identities = 41/154 (26%), Positives = 63/154 (40%), Gaps = 7/154 (4%)
 Frame = -3

Query: 702  PCAFRSPPRHQQHMRWKPDITQHTSNCAILS------LSRTSFVYLMIPRVLPRALSVPH 541
            PC     P H       P++ Q  S  A +       LSR++   +     LPR ++ P 
Sbjct: 1671 PCTPPPTPPHGH----PPNLAQQQSRDAAVQWQHAVVLSRSATALMQTTNTLPRVVASPP 1726

Query: 540  TPHSSGQEA-CAATRCPLQLRPAWIATHQPQTAEPGLPGKCPSSTALLVIVLTPTGTTTM 364
            +P    + A  +A+R    L PA  A    Q A  G   K  +  A     + PTG T  
Sbjct: 1727 SPSRLPRVAPLSASRRTKPLAPAAAAPASSQPARSGYEAKTLARAA-----MAPTGKTQR 1781

Query: 363  ESSTLNVDGRTYRSAAVTAGLASCNDSSWTAGPA 262
             S++  V G+  + A  ++ LA C  ++   G A
Sbjct: 1782 PSNSAKVRGKA-KQAPPSSPLAGCGSAAVARGAA 1814

>gb|AAH58068.1| Unknown (protein for MGC:63802) [Danio rerio]
          Length = 896

 Score = 36.6 bits (83), Expect = 0.49
 Identities = 39/154 (25%), Positives = 54/154 (34%), Gaps = 22/154 (14%)
 Frame = -3

Query: 801  SPSESQSTPPEATALLRACSSPDTRHSRTPDRPPCAFR------SPPRHQQHM------- 661
            SP  S+ T P       + +SP  RH R    PP A R      SPP H   +       
Sbjct: 679  SPKRSRGTSPGKRRTPPSSASPPPRHRRNSPNPPAAQRGRDTRSSPPAHTTRVSSSPPGR 738

Query: 660  ---------RWKPDITQHTSNCAILSLSRTSFVYLMIPRVLPRALSVPHTPHSSGQEACA 508
                     R +   +   S   I  +SRT       PR   R    P         + +
Sbjct: 739  YGASGSSPQRQRRQTSPSHSTRPIRRVSRTP-----EPRKSQRGSQSPPPERRQVSRSPS 793

Query: 507  ATRCPLQLRPAWIATHQPQTAEPGLPGKCPSSTA 406
            A+  P Q RPA ++  +  +  P  P K  SS +
Sbjct: 794  ASPPPAQKRPASVSPSRSTSRSPPPPAKKNSSVS 827

>ref|NP_075993.1| NK2 transcription factor related, locus 4 [Mus musculus]
           gi|27923816|sp|Q9EQM3|NK24_MOUSE Homeobox protein
           Nkx-2.4 (Homeobox protein NKX2.4) (Homeobox protein NK-2
           homolog D) gi|11493714|gb|AAG35618.1|AF202038_1 homeobox
           transcription factor [Mus musculus]
          Length = 354

 Score = 36.2 bits (82), Expect = 0.64
 Identities = 23/61 (37%), Positives = 31/61 (50%)
 Frame = -1

Query: 308 PG*QAAMIRAGLQGPHE*RAHCAAACTASAASPVPAA*PQRCSLHGTPPASNQRTAPAMS 129
           P  QAA + AG+Q PH    H AAA  A+AA+   AA     + +  PP  +Q    AM 
Sbjct: 54  PSSQAAAVAAGMQPPHAMAGHNAAAAAAAAAAAAAAA-----ATYHMPPGVSQFPHSAMG 108

Query: 128 A 126
           +
Sbjct: 109 S 109

>ref|NP_031522.1| AT motif binding factor 1 [Mus musculus]
           gi|18202592|sp|Q61329|ABF1_MOUSE Alpha-fetoprotein
           enhancer binding protein (AT motif-binding factor)
           (AT-binding transcription factor 1)
           gi|1345408|dbj|BAA05046.1| AT motif-binding factor [Mus
           musculus] gi|1587706|prf||2207230A transcription factor
           ATBF1
          Length = 3726

 Score = 33.9 bits (76), Expect(2) = 0.71
 Identities = 27/80 (33%), Positives = 37/80 (45%), Gaps = 8/80 (10%)
 Frame = -2

Query: 580 DPQGAASGAIGPPHPTQQWPGSLC--GDPVSPTTASGVDCHTPTTDSGTWAT--GKMPVL 413
           +P  +  G  G   P  Q  GSLC  G   SP+  SGV+C    T  G+  +  G M ++
Sbjct: 605 EPNESTEGDDGGFVPHHQHAGSLCELGVGESPS-GSGVECPKCDTVLGSSRSLGGHMTMM 663

Query: 412 N----CATGHCPNSYWHHYY 365
           +    C T  CP   WH+ Y
Sbjct: 664 HSRNSCKTLKCPKCNWHYKY 683

 Score = 20.8 bits (42), Expect(2) = 0.71
 Identities = 7/12 (58%), Positives = 8/12 (66%)
 Frame = -3

Query: 237 CLYCFRCQPSPR 202
           C+YC   QP PR
Sbjct: 702 CVYCKSGQPHPR 713

>ref|NP_081083.1| RIKEN cDNA 1110033F04 [Mus musculus] gi|12835082|dbj|BAB23145.1|
           unnamed protein product [Mus musculus]
          Length = 167

 Score = 35.8 bits (81), Expect = 0.84
 Identities = 24/68 (35%), Positives = 27/68 (39%), Gaps = 16/68 (23%)
 Frame = -3

Query: 156 QPTHC--SCDVSAI--------GCC*PATCPPCLRRQLLSC------PSWRPSGGPCRRS 25
           QPT C  SC +S+          CC  + C PC R     C      P  RP   PC R 
Sbjct: 70  QPTCCRPSCCISSCCQPSCGSSSCCGSSCCRPCCRPCCSPCCSPCCRPCCRPCCRPCCRP 129

Query: 24  CTC*RGPC 1
           C C R  C
Sbjct: 130 CCCLRPVC 137



EST assemble image


clone accession position
1 LC082g03_r AV624788 1 504
2 HC017c06_r AV633142 299 806




Chlamydomonas reinhardtii
Kazusa DNA Research Institute