KCC001521A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001521A_C01 KCC001521A_c01
cagTTCTTGAACGCCCAGTCGCTTGCTTCATTTTCTAGGTCACTTCTTGTTTCCACCCAC
ATCCAAGAATGCTGGCCCTCTCCTCGTCTCGCAGCGCTGCGTTCAGCGCCACCGTGCGCG
TCCAGCCGGCTCGCCAGGTTGCCCTGGCTCCCAGCCGCTCGTCTCTGATTGTTGAGGCCG
CTGGTGCCCAAGAAGAAGACCAGCAAGTCGAAGACGGCCATCCGCAAGGCGGCCTGGAAG
AAGGAGGTGCTGCCCTACGTGGAGCAGGCCCTGTTCAAGGCTAAGCTGGCTCTGAAGGAG
GGCAGCCGCGACTCTGCCGATAAGGACATGGTCGTGAGCACCCAGACTGAGGAGAAGTCG
GAGTAAGCGGCTCGCTGGACCGCAAGCGGACTTTGTGCATCGCGGAGCAGAATGGCGAGG
CACTGGCTGAAGTGCTGGTCCCAAGTGCTGGGGCTGGCCTGGCATGGTGCTTGACGGGAA
GGGGGCTGTAGGCGCAGGCGCGGGGCGTATCCCCGGCGATTTTGTTCTGCAGCCGGGGTG
GGCATGGGTCGCTGCCCTCCCGGCAGCCCCGGGCTGGGCGGTGGGGCCCGCGGCGTGGCC
AGG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001521A_C01 KCC001521A_c01
         (603 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|XP_311129.1| ENSANGP00000004749 [Anopheles gambiae] gi|30178...    44  1e-05
dbj|BAC98254.1| mKIAA1802 protein [Mus musculus]                       51  1e-05
ref|XP_134016.1| expressed sequence AI116001 [Mus musculus] gi|3...    50  2e-05
ref|XP_323435.1| hypothetical protein [Neurospora crassa] gi|289...    47  3e-04
ref|XP_352232.1| similar to amino acid feature: Rod protein doma...    47  3e-04

>ref|XP_311129.1| ENSANGP00000004749 [Anopheles gambiae] gi|30178092|gb|EAA06612.2|
           ENSANGP00000004749 [Anopheles gambiae str. PEST]
          Length = 633

 Score = 44.3 bits (103), Expect(2) = 1e-05
 Identities = 39/119 (32%), Positives = 53/119 (43%), Gaps = 8/119 (6%)
 Frame = -3

Query: 502 RACAYSP--LPVKHHARP--APALGTSTSASASPFCSAMHKVRLRSSEPLTPTSPQSGCS 335
           RA   SP  LP +  ARP  A A G    A+A+   +A   V +R + P TP+   + C 
Sbjct: 471 RAARASPPALPFRAPARPPSAQAAGPGVVAAAAAAAAAPAAVAVRPASPCTPSPCAAACP 530

Query: 334 RPCPYRQSRGCPPSEPA*P*TGPAPRRAAPPSS----RPPCGWPSSTCWSSSWAPAAST 170
                 +  GCP   P  P   PA R A PP++     PP    S+    SS  PA ++
Sbjct: 531 TSPSAARPSGCPGRSP--PSAPPAARPAGPPAACTARAPPGRSRSAPPSRSSAGPAVAS 587

 Score = 35.0 bits (79), Expect = 0.82
 Identities = 40/160 (25%), Positives = 52/160 (32%), Gaps = 29/160 (18%)
 Frame = -3

Query: 601 WPRRGPHRPARGCREGSDPCPPRLQNKIAGDTPRACAYSPLPVKHHARPAPALGTSTSAS 422
           WP R            +   PP  Q    G    A A +  P     RPA     S  A+
Sbjct: 468 WPTRAARASPPALPFRAPARPPSAQAAGPGVVAAAAAAAAAPAAVAVRPASPCTPSPCAA 527

Query: 421 ASPFCSAMHKV-----RLRSSEP--LTPTSPQSGCSRPCPYRQSRGCPPSEP-------- 287
           A P   +  +      R   S P    P  P + C+   P  +SR  PPS          
Sbjct: 528 ACPTSPSAARPSGCPGRSPPSAPPAARPAGPPAACTARAPPGRSRSAPPSRSSAGPAVAS 587

Query: 286 -------------A*P*TGPA-PRRAAPPSSRPPCGWPSS 209
                        A P   PA PR +APP +  P   P++
Sbjct: 588 GCGSPPDCSASCAARPPASPAAPRSSAPPGASRPAPRPAA 627

 Score = 33.9 bits (76), Expect = 1.8
 Identities = 40/127 (31%), Positives = 46/127 (35%), Gaps = 10/127 (7%)
 Frame = -3

Query: 580 RPARGCREGSDPCPPRLQNKIAGDTPRAC-AYSPLPVKHHARPAPALGTSTSASASPFCS 404
           RPA  C     PC        +   P  C   SP      ARPA      T A A P  S
Sbjct: 515 RPASPCTPS--PCAAACPTSPSAARPSGCPGRSPPSAPPAARPAGPPAACT-ARAPPGRS 571

Query: 403 AMHKVRLRSSEPLTPTSPQS--GCSRPCPYRQSRGCPPSEPA*P*TG-------PAPRRA 251
                   S+ P   +   S   CS  C  R     PP+ PA P +        PAPR A
Sbjct: 572 RSAPPSRSSAGPAVASGCGSPPDCSASCAAR-----PPASPAAPRSSAPPGASRPAPRPA 626

Query: 250 APPSSRP 230
           AP + RP
Sbjct: 627 APGAPRP 633

 Score = 26.2 bits (56), Expect(2) = 1e-05
 Identities = 11/21 (52%), Positives = 14/21 (66%)
 Frame = -2

Query: 593 PRAPPPSPGLPGGQRPMPTPA 531
           P APPP+P  P G  P+P P+
Sbjct: 430 PPAPPPTP--PFGPMPLPMPS 448

>dbj|BAC98254.1| mKIAA1802 protein [Mus musculus]
          Length = 804

 Score = 50.8 bits (120), Expect = 1e-05
 Identities = 39/130 (30%), Positives = 54/130 (41%), Gaps = 5/130 (3%)
 Frame = -3

Query: 601 WPRRGPHRPARGCREGSDPCPPRLQNKIAGDTPRACAYSPLPVKHHARPAPALGTSTSAS 422
           W  +   +P++    G  P PP  QN  A D   A     LP++   + +P+L   + AS
Sbjct: 93  WSEQPKEQPSKDTESGKSPSPPERQNP-AFDPAEARPTPALPMEAQ-KTSPSLCPESQAS 150

Query: 421 ASPFCSAMHKVRLRSSEPLTPTSPQSGCSR---PCPYRQSRGCPPSEPA*P*TGPAPR-- 257
             P         L S EP  P+ P         PCP R    C   E   P  GP+P   
Sbjct: 151 GPPVLEPQGAGPLISPEPQAPSLPAEASKAAPVPCPERVDPPCELPELEKPERGPSPESV 210

Query: 256 RAAPPSSRPP 227
           ++A  SS+PP
Sbjct: 211 KSALVSSKPP 220

>ref|XP_134016.1| expressed sequence AI116001 [Mus musculus]
           gi|32469497|ref|NP_862902.1| DNA segment, Chr 8, ERATO
           Doi 457, expressed [Mus musculus]
           gi|22137493|gb|AAH28991.1| DNA segment, Chr 8, ERATO Doi
           457, expressed [Mus musculus]
          Length = 802

 Score = 50.1 bits (118), Expect = 2e-05
 Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 5/130 (3%)
 Frame = -3

Query: 601 WPRRGPHRPARGCREGSDPCPPRLQNKIAGDTPRACAYSPLPVKHHARPAPALGTSTSAS 422
           W  +   +P++    G  P PP  QN  A D   A     LP++   + +P+L   + AS
Sbjct: 91  WSEQPKEQPSKDTESGKSPSPPERQNP-AFDPAEARPTPALPMEAQ-KTSPSLCPESQAS 148

Query: 421 ASPFCSAMHKVRLRSSEPLTPTSPQSGCSR---PCPYRQSRGCPPSEPA*P*TGPAPR-- 257
             P         L S EP  P  P         PCP R    C   E   P  GP+P   
Sbjct: 149 GPPVLEPQGAGPLISPEPQAPCLPAEASKAAPVPCPERVDPPCELPELEKPERGPSPESV 208

Query: 256 RAAPPSSRPP 227
           ++A  SS+PP
Sbjct: 209 KSALVSSKPP 218

>ref|XP_323435.1| hypothetical protein [Neurospora crassa] gi|28922386|gb|EAA31621.1|
           hypothetical protein [Neurospora crassa]
          Length = 1461

 Score = 46.6 bits (109), Expect = 3e-04
 Identities = 47/150 (31%), Positives = 58/150 (38%), Gaps = 26/150 (17%)
 Frame = -3

Query: 601 WPRRGPHRPARGCREGSDPCPPRLQNKIAGDTPRACAYS-PLPVKHHARPAPALGTSTSA 425
           +P  GPH P+ G R  S P P R  +      P    Y  P+P      PAP  G   +A
Sbjct: 296 YPHGGPHPPSHGGRAPS-PAPFRAASPNPYRAPSPAPYPVPIPPPLAGSPAPT-GPFRAA 353

Query: 424 SASPFCSAM---HKVRLRSSEPLTPTS----PQSGCSRPCPYRQS--------------- 311
           S SP+  A    ++ R  S  P  P+     P S    P P R +               
Sbjct: 354 SPSPYREASPAPYRPRASSPAPYVPSPAPYHPHSHSPSPAPLRNTSPSRSRSPYKHQLDY 413

Query: 310 RGCPPSEPA*P*TGPAPRRA---APPSSRP 230
           RG PP   A P    +P RA   APPS  P
Sbjct: 414 RGTPPIRDASPARAYSPMRAHSPAPPSPAP 443

>ref|XP_352232.1| similar to amino acid feature: Rod protein domain, aa 266 .. 468;
           amino acid feature: globular protein domain, aa 32 ..
           265 [Homo sapiens]
          Length = 317

 Score = 46.6 bits (109), Expect = 3e-04
 Identities = 35/124 (28%), Positives = 45/124 (36%)
 Frame = -3

Query: 601 WPRRGPHRPARGCREGSDPCPPRLQNKIAGDTPRACAYSPLPVKHHARPAPALGTSTSAS 422
           WPR  PH PA+       P P  +     G T  A    PLP  H ++         S  
Sbjct: 154 WPRSPPHTPAKPRTHTHGPAPHLIPQPSLGPTLMA----PLPTSHPSQAQDPHSWPCSPP 209

Query: 421 ASPFCSAMHKVRLRSSEPLTPTSPQSGCSRPCPYRQSRGCPPSEPA*P*TGPAPRRAAPP 242
            +P   A        S P TPT P++    P P+   +  P      P     P +A  P
Sbjct: 210 HTPTNQAQDPHSRFRSPPHTPTKPRTHTQDPSPHLTPQPSPGPTLTVPLPTSNPDQAQDP 269

Query: 241 SSRP 230
            SRP
Sbjct: 270 RSRP 273



EST assemble image


clone accession position
1 HCL001a10_r AV639564 1 297
2 LC071c04_r AV623975 4 492
3 CM065f10_r AV391170 18 617
4 MX201b03_r BP089121 18 495
5 HC091e07_r AV638840 35 391
6 CM058h03_r AV390366 35 652
7 CM058h02_r AV390381 35 651




Chlamydomonas reinhardtii
Kazusa DNA Research Institute