KCC000247A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000247A_C01 KCC000247A_c01
gcttaattgcagcttcatacattgcatgtggcctCGAGACTTATCAATGGATACCCTGTA
GGCTATCACCATCCATTGAGCGGTGCTGATACACAAGCATGACCCGCTAGTTCAACACAA
GTCCTCCAAAATGCGCGGACCGCTTGGTGGCATGCATGTGCAAATTCTTCTAGCTCTCAT
TATATATGCCATAGTTGTAGCAGCTCGGTTTGAGGACCAGGCTGGTGCGTATGACTGGTA
TAAACAGCACATTGGCATAGCCACGTCTGCACAATTTCATCCCAGCAAGCCGCGTGTTTG
CGTGGCGACGGAGCAGTCGGTGGTCGGCTGCTTAAACCTGCGCGATGGCAGCATCGCGTG
GCGGAAGTCACTTCAGACAGCTCATGCAGCGCCATCAGTAGCATACGTTGAAAGCTCGAG
CTCACTGGTGACGGCGTCAGGTGGCTTGGTTCGCGCGTTCGATCTGGAGGGCGGCCTCAA
GTGGCAGCGCAAGCTGCCTGTGCAGTCAGGAGCATTTGTTTCGGAAGTTAAGGGCAAAGG
CAGTGACAACTCCAGTGGCGCGATCCTGGCTGTCCAGGCAGGCGCGGTCCAGGTCCTGGA
CGCTGCGGACGCGTCCCAGCTGTCGAAGCCGCATCAGCTGAAGGGCCTGGCCAAGGACAA
CATCGTTGCCGCCGACGGCTACCTCGTAGCGTACAACACCGGGTCCAAGGCCGTGCTGCT
TGTGTTGACGTCCAGCTTTGTTGCCGGCGACGCGGCCGCTGAGGTGGTGGTGGAGGCACC
ACAGAATCTGTCGGCAGCGGCTGCAGGCGGGCCGGCTGGCTTTGCGGCGCTGTCCGCTTG
CGGTAGCTCGCTCTGTGTCCTGCGACTTGGCGGCTCCGGGGATGCGGCGTTCAGTTGCCT
GCGCCTGGACGCGCTTGTTCCGGAGCTGACCGCGGCCCCCGCGGTTAGCCGCAAGCTGAT
TGCCACCTCGGCAGGCTTCGTGCTTTTGAGCAGCGATGCGGGCGCTGCTGTGCTCTCCGT
CGTTGATGGCGCACCCAAGCTGCTGCGCTTCAATGCTGCCGCCATGGGCGCCTCGGC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000247A_C01 KCC000247A_c01
         (1077 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_196717.2| expressed protein [Arabidopsis thaliana]              72  1e-11
dbj|BAC97862.1| mKIAA0090 protein [Mus musculus]                       70  8e-11
ref|NP_666269.2| RIKEN cDNA C230096C10; KIAA0090 protein-like [M...    68  2e-10
ref|NP_055862.1| KIAA0090 protein [Homo sapiens] gi|21961663|gb|...    67  7e-10
pir||T46707 proteophosphoglycan, membrane-associated [imported] ...    56  9e-07

>ref|NP_196717.2| expressed protein [Arabidopsis thaliana]
          Length = 978

 Score = 72.4 bits (176), Expect = 1e-11
 Identities = 74/281 (26%), Positives = 123/281 (43%), Gaps = 15/281 (5%)
 Frame = +2

Query: 152 MHVQILLALIIY---AIVVAARFEDQAGAYDWYKQHIGIATSAQFHPSK---PRVCVATE 313
           M +++ L L+++   AI+  + +EDQAG  DW++++IG    A FH  K    RV V+TE
Sbjct: 1   MAIRVFLTLLLFLSSAILSFSLYEDQAGLTDWHQRYIGKVKHAVFHTQKTGRKRVIVSTE 60

Query: 314 QSVVGCLNLRDGSIAWRKSLQTAHAAPSVAYVESSSSLVTAS-GGLVRAFDL-EGGLKWQ 487
           ++VV  L+LR G I WR  L T  A   V        +  +S G  +RA++L +G + W+
Sbjct: 61  ENVVASLDLRHGEIFWRHVLGTKDAIDGVGIALGKYVITLSSEGSTLRAWNLPDGQMVWE 120

Query: 488 RKLPV--QSGAFVSEVKGKGSDNSSGAILAVQAGAVQVLDAADASQL-SKPHQLKGLAKD 658
             L     S + +S    K        I     G +  + A D   L  K    +G    
Sbjct: 121 TSLHTAQHSKSLLSVPVDK-----DYPITVFGGGYLHAVSAIDGEVLWKKDFTAEGFEVQ 175

Query: 659 NIVAADG----YLVAYNTGSKAVLLVLTSSFVAGDAAAEVVVEAPQNLSAAAAGGPAGFA 826
            ++ A G    Y++ +   S+AV+  + S   +G+  A+     P   S   +   +   
Sbjct: 176 RVLQAPGSSIIYVLGFLHSSEAVVYQIDSK--SGEVVAQKSTVFPGGFSGEISSVSSDKV 233

Query: 827 ALSACGSSLCVLRLGGSGDAAFSCLRLDALVPELTAAPAVS 949
            +     S+ V      GD +F    +  LV +   A  +S
Sbjct: 234 VVLDSTRSILVTIGFIDGDISFQKTPISDLVEDSGTAEILS 274

>dbj|BAC97862.1| mKIAA0090 protein [Mus musculus]
          Length = 992

 Score = 69.7 bits (169), Expect = 8e-11
 Identities = 40/117 (34%), Positives = 66/117 (56%), Gaps = 5/117 (4%)
 Frame = +2

Query: 182 IYAIVVAARFEDQAGAYDWYKQHIGIA--TSAQFHPSKPRVCVATEQSVVGCLNLRDGSI 355
           +  + VAA +EDQ G +DW +Q++G     S +F P   ++ VATE++V+  LN R G I
Sbjct: 10  VLLVPVAAVYEDQVGKFDWRQQYVGKIKFASLEFSPGSKKLVVATEKNVIAALNSRTGEI 69

Query: 356 AWRK-SLQTAHAAPSVAYVESSSSLVTASGG-LVRAFDLE-GGLKWQRKLPVQSGAF 517
            WR     TA  A     V    ++  ++GG L+R+++   GGL W+  + + +G+F
Sbjct: 70  LWRHVDKGTAEGAVDAMLVHGQDAITVSNGGRLMRSWETNIGGLNWE--ITLDTGSF 124

>ref|NP_666269.2| RIKEN cDNA C230096C10; KIAA0090 protein-like [Mus musculus]
           gi|26339734|dbj|BAC33530.1| unnamed protein product [Mus
           musculus]
          Length = 997

 Score = 68.2 bits (165), Expect = 2e-10
 Identities = 39/117 (33%), Positives = 65/117 (55%), Gaps = 5/117 (4%)
 Frame = +2

Query: 182 IYAIVVAARFEDQAGAYDWYKQHIGIA--TSAQFHPSKPRVCVATEQSVVGCLNLRDGSI 355
           +  +  AA +EDQ G +DW +Q++G     S +F P   ++ VATE++V+  LN R G I
Sbjct: 15  VLLVPAAAVYEDQVGKFDWRQQYVGKIKFASLEFSPGSKKLVVATEKNVIAALNSRTGEI 74

Query: 356 AWRK-SLQTAHAAPSVAYVESSSSLVTASGG-LVRAFDLE-GGLKWQRKLPVQSGAF 517
            WR     TA  A     V    ++  ++GG L+R+++   GGL W+  + + +G+F
Sbjct: 75  LWRHVDKGTAEGAVDAMLVHGQDAITVSNGGRLMRSWETNIGGLNWE--ITLDTGSF 129

>ref|NP_055862.1| KIAA0090 protein [Homo sapiens] gi|21961663|gb|AAH34589.1| KIAA0090
           protein [Homo sapiens]
          Length = 993

 Score = 66.6 bits (161), Expect = 7e-10
 Identities = 39/114 (34%), Positives = 63/114 (55%), Gaps = 5/114 (4%)
 Frame = +2

Query: 191 IVVAARFEDQAGAYDWYKQHIGIA--TSAQFHPSKPRVCVATEQSVVGCLNLRDGSIAWR 364
           I  AA +EDQ G +DW +Q++G     S +F P   ++ VATE++V+  LN R G I WR
Sbjct: 17  IPAAAVYEDQVGKFDWRQQYVGKVKFASLEFSPGSKKLVVATEKNVIAALNSRTGEILWR 76

Query: 365 K-SLQTAHAAPSVAYVESSSSLVTASGG-LVRAFDLE-GGLKWQRKLPVQSGAF 517
                TA  A     +     +  ++GG ++R+++   GGL W+  + + SG+F
Sbjct: 77  HVDKGTAEGAVDAMLLHGQDVITVSNGGRIMRSWETNIGGLNWE--ITLDSGSF 128

>pir||T46707 proteophosphoglycan, membrane-associated [imported] - Leishmania
            major (fragment) gi|5420389|emb|CAB46680.1|
            proteophosphoglycan [Leishmania major]
          Length = 383

 Score = 56.2 bits (134), Expect = 9e-07
 Identities = 55/208 (26%), Positives = 100/208 (47%)
 Frame = -1

Query: 1062 AAALKRSSLGAPSTTESTAAPASLLKSTKPAEVAISLRLTAGAAVSSGTSASRRRQLNAA 883
            ++A   SS  APS++ S+A  AS   S+ P+  + +   ++ +A SS +SA      +A+
Sbjct: 54   SSAPSASSSSAPSSSSSSAPSAS--SSSAPSSSSSAPSASSSSAPSSSSSAP-----SAS 106

Query: 882  SPEPPSRRTQSELPQADSAAKPAGPPAAAADRFCGASTTTSAAASPATKLDVNTSSTALD 703
            S   PS  + S  P A S++ P+   +A +     A +++S++A  A+     +SS++  
Sbjct: 107  SSSAPS--SSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSSA 164

Query: 702  PVLYATR*PSAATMLSLARPFS*CGFDSWDASAASRTWTAPAWTARIAPLELSLPLPLTS 523
            P   ++  PS+++  + +   S         SA S + +AP+ ++  AP   S   P  S
Sbjct: 165  PSASSSSAPSSSSSSAPSASSS---------SAPSSSSSAPSASSSSAPSSSSSSAPSAS 215

Query: 522  ETNAPDCTGSLRCHLRPPSRSNARTKPP 439
             ++AP  + S      P S S   T  P
Sbjct: 216  SSSAPSSSSS-----APSSSSTTTTMDP 238

 Score = 53.1 bits (126), Expect = 8e-06
 Identities = 54/220 (24%), Positives = 98/220 (44%), Gaps = 7/220 (3%)
 Frame = -1

Query: 1044 SSLGAPSTTESTAAPASLLKSTKPAEVAISLRLTAGAAVSSGTSASRRRQLNAASPEPPS 865
            SS  APS + S+A  +S    +  +  A S   +A +A SS   +S      +AS     
Sbjct: 6    SSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAP 65

Query: 864  RRTQSELPQADSAAKPAGPPAAAADRFCGASTTTSAAASPATKLDVNTSSTALDPVLYAT 685
              + S  P A S++ P+   +A +     A +++S+A S ++    ++SS+A  P   ++
Sbjct: 66   SSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSA--PSASSS 123

Query: 684  R*PSAATMLSLARPFS*CGFDSWDASAASRTW-------TAPAWTARIAPLELSLPLPLT 526
              PS+++    A   S     S  A +AS +        +AP+ ++  AP   S   P  
Sbjct: 124  SAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSSAPSA 183

Query: 525  SETNAPDCTGSLRCHLRPPSRSNARTKPPDAVTSELELST 406
            S ++AP  + S        + S++ +  P A +S    S+
Sbjct: 184  SSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSS 223

 Score = 48.1 bits (113), Expect = 3e-04
 Identities = 51/218 (23%), Positives = 97/218 (44%), Gaps = 2/218 (0%)
 Frame = -1

Query: 1032 APSTTESTAAPASLLKSTKPAEVAISLRLTAGAAVSSGTSASRRRQLNAASPEPPSRRTQ 853
            APS++ S  + +S    +  +    +   +A ++ SS  SAS     +++S   PS  + 
Sbjct: 3    APSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSS 62

Query: 852  SELPQADSAAKPAGPPAAAADRFCGASTTTSAAASPATKLDVNTSSTALDPVLYATR*PS 673
            S    + S+A  A   +A +      S ++S+A S ++     +SS+A       +   S
Sbjct: 63   SAPSSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSA------PSSSSS 116

Query: 672  AATMLSLARPFS*CGFDSWDASAA--SRTWTAPAWTARIAPLELSLPLPLTSETNAPDCT 499
            A +  S + P S     S  +S+A  S + +AP+ ++  AP   S   P  S ++AP  +
Sbjct: 117  APSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSS 176

Query: 498  GSLRCHLRPPSRSNARTKPPDAVTSELELSTYATDGAA 385
             S        S  ++ +  P A +S    S+ ++  +A
Sbjct: 177  SSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSA 214



EST assemble image


clone accession position
1 MXL087d06_r BP098116 1 376
2 LCL028f12_r AV627570 35 557
3 MXL087f09_r BP098124 48 409
4 LCL066a08_r AV629780 56 509
5 LCL085b12_r AV630852 61 192
6 LCL067h02_r AV629862 66 466
7 MXL051f09_r BP096047 69 568
8 LCL074d11_r AV630156 83 421
9 LCL002b04_r AV626082 310 705
10 LCL051g09_r AV629087 349 806
11 CL13c07_r AV393901 584 1078




Chlamydomonas reinhardtii
Kazusa DNA Research Institute