KCC002207A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002207A_C01 KCC002207A_c01
gcgaggCACATAGCGTTGCCGCTGTAGCAAAATTTCTACCTTGAATAGTCCGGAACACCG
CAAAACACAGCTGAAAAGCCCAAACAGGGTCAAGCTCAATAATGGCGCTCAGGCACGCCC
AAAAGCTGCACTCAGCAGCAGGGCGCTGCTCATTTCATGCCAAATGCCCGCGTGTGCGAT
ACAGCTTATCGCGTGTGAGTGCATCGTCTACTCTGGCAGTGCCAGCAAACAGCGACCATG
GCGTTGTGCACAGCCAAAGCCACCAGCCAGCTCTGCTGGCCCAGGCCCTGGCGACGTCAG
CGGGCATCGAGGATATTCTTACCTCCAAATCATGGACGGAAGTCAGGTCTCTGTGGACGC
ACCACAAAGACCACTTGCGGACCGAGGACCTGGCTGCGACCTGGGTCCGTCTTGCCAAAG
TTAGCAAGGAGCCGACGGTCCGGGCATCGCCAGAGCTGCAGAAGTTCGTGGATATCCTCG
CGTGCGCAACCATCGACCGGATACAGCAATTTTCCATCTCCTCGTTGTGCTCGATCATGT
GGGCCTCTTCGAAGCTGAAGAAGGGCCTGGGCCACTCGGGAATGTTCAAATCCTTCCTCA
AGGCCTGGGCGGCGGAGATGGAGGTGCACCTGGACCACTTAGATCTGGAGCAGTCGCGCA
AGGTGATGGTCGCCATAACGCGCGTCAACTATCAGCCCTCGGCGCCCTGGAAGATGAAGA
TGGAGGCCACGCTTACAGCCAACCTGGCCACCTGCGCCTGCCCCAAGACGCTCGCC


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002207A_C01 KCC002207A_c01
         (776 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_627333.1| hypothetical protein SCE41.24c [Streptomyces co...    46  8e-04
ref|XP_296812.2| similar to hypothetical protein [Homo sapiens]        42  0.008
ref|NP_194514.1| proline-rich protein family [Arabidopsis thalia...    42  0.008
pir||A45974 collagen alpha 1(XIV) chain precursor, short form 2 ...    42  0.008
ref|ZP_00058564.1| COG0558: Phosphatidylglycerophosphate synthas...    42  0.011

>ref|NP_627333.1| hypothetical protein SCE41.24c [Streptomyces coelicolor A3(2)]
           gi|10241798|emb|CAC09556.1| hypothetical protein
           SCE41.24c [Streptomyces coelicolor A3(2)]
          Length = 889

 Score = 45.8 bits (107), Expect = 8e-04
 Identities = 54/172 (31%), Positives = 63/172 (36%), Gaps = 1/172 (0%)
 Frame = -3

Query: 726 PPSSSSRAPRADS*RALWRPSPCATAPDLSGPGAPPSPPPRP*GRI*TFPSGPGPSSASK 547
           PP S+   P          P+P AT P    PG PP PPP       + P+ PG +    
Sbjct: 395 PPPSAPATP----------PAPGATPPP-GTPGTPPPPPP-------SAPNAPGGT---- 432

Query: 546 RPT*SSTTRRWKIAVSGRWLRTRGYPRTSAALAMPGPSAPC*LWQDGPRSQPGPRSASGL 367
            P          +A  G   RT G      A   PGP  P       P   PGP  A   
Sbjct: 433 -PPGGMHHAATMLADPG---RTGG-----GAPQPPGPPGP-----PNPPGPPGPPGAPAA 478

Query: 366 CGASTET*LPSMIWR*EYPRCPLTSPG-PGPAELAGGFGCAQRHGRCLLALP 214
            GA     +PS       P  P   PG PG    AGG G A  H + +LA P
Sbjct: 479 PGAPGAPGVPS------GPGVPPPPPGAPGAQGSAGGAGGAVHHAQTVLAAP 524

>ref|XP_296812.2| similar to hypothetical protein [Homo sapiens]
          Length = 398

 Score = 42.4 bits (98), Expect = 0.008
 Identities = 23/54 (42%), Positives = 28/54 (51%)
 Frame = -3

Query: 762 GRRRWPGWL*AWPPSSSSRAPRADS*RALWRPSPCATAPDLSGPGAPPSPPPRP 601
           GR  +PG L    P S+ R PR+ +       + CATAP L GPG  P  PP P
Sbjct: 85  GRPHYPGHLGPPKPGSAPRPPRSKA------AAECATAPGLQGPGRDPLGPPAP 132

>ref|NP_194514.1| proline-rich protein family [Arabidopsis thaliana]
           gi|7488233|pir||T09024 proline-rich protein T27E11.90 -
           Arabidopsis thaliana gi|4972116|emb|CAB43973.1| putative
           proline-rich protein [Arabidopsis thaliana]
           gi|7269638|emb|CAB81434.1| putative proline-rich protein
           [Arabidopsis thaliana]
          Length = 577

 Score = 42.4 bits (98), Expect = 0.008
 Identities = 50/148 (33%), Positives = 56/148 (37%)
 Frame = -3

Query: 726 PPSSSSRAPRADS*RALWRPSPCATAPDLSGPGAPPSPPPRP*GRI*TFPSGPGPSSASK 547
           PP S S  P  DS      PSP   +P L  PG PPSP P P G     PS PGP S   
Sbjct: 176 PPPSPSPTPGPDSPL----PSPGPDSP-LPLPGPPPSPSPTP-GPDSPLPS-PGPDSPLP 228

Query: 546 RPT*SSTTRRWKIAVSGRWLRTRGYPRTSAALAMPGPSAPC*LWQDGPRSQPGPRSASGL 367
            P                     G P +S+    PGP +P       P   P P   S L
Sbjct: 229 LP---------------------GPPPSSS--PTPGPDSPLPSPGPPPSPSPTPGPDSPL 265

Query: 366 CGASTET*LPSMIWR*EYPRCPLTSPGP 283
                ++ LPS       P  PL SPGP
Sbjct: 266 PSPGPDSPLPS-----PGPDPPLPSPGP 288

 Score = 34.7 bits (78), Expect = 1.8
 Identities = 40/129 (31%), Positives = 47/129 (36%), Gaps = 8/129 (6%)
 Frame = -3

Query: 639 SGPGAPPSPPPRP*GRI*TFPS---GPGPSSASKRPT*SSTTRRWKIAVSGRWLRTRGYP 469
           S P  PP PPP P   +   PS    PGP S    P   S      + + G        P
Sbjct: 159 SDPPLPPPPPPYP-SPLPPPPSPSPTPGPDSPLPSPGPDS-----PLPLPGPPPSPSPTP 212

Query: 468 RTSAALAMPGPSAPC*LWQDGPRSQPGPRSASGLCG-----ASTET*LPSMIWR*EYPRC 304
              + L  PGP +P  L    P S P P   S L       + + T  P        P  
Sbjct: 213 GPDSPLPSPGPDSPLPLPGPPPSSSPTPGPDSPLPSPGPPPSPSPTPGPDSPLPSPGPDS 272

Query: 303 PLTSPGPGP 277
           PL SPGP P
Sbjct: 273 PLPSPGPDP 281

>pir||A45974 collagen alpha 1(XIV) chain precursor, short form 2 - chicken
          Length = 1747

 Score = 42.4 bits (98), Expect = 0.008
 Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 2/85 (2%)
 Frame = +2

Query: 194  CECIVYSGSASKQRPWRCAQPKPPASSAGPGPGDVSGHRGYSYLQIMDG--SQVSVDAPQ 367
            C  + ++ S S+        P PP    GPG     GHRG    +  DG   ++ V  PQ
Sbjct: 1338 CPALPHACSCSEANKGPLGPPGPPG---GPGVRGAKGHRGDPGPKGPDGPRGEIGVPGPQ 1394

Query: 368  RPLADRGPGCDLGPSCQS*QGADGP 442
             P   +GP    GPS  S QG  GP
Sbjct: 1395 GPPGPQGPPGPQGPSGLSIQGLPGP 1419

>ref|ZP_00058564.1| COG0558: Phosphatidylglycerophosphate synthase [Thermobifida fusca]
          Length = 455

 Score = 42.0 bits (97), Expect = 0.011
 Identities = 44/142 (30%), Positives = 55/142 (37%), Gaps = 4/142 (2%)
 Frame = -3

Query: 768 SWGRRRWPGWL*AWPPSSSSRAPRADS*RALWRPSPCATAPDLSGPGAPPSPPPRP---* 598
           SW  RRW      WP  S             W PSPC TA     P A     PRP    
Sbjct: 31  SWPARRWTNRSTGWPSPSPG-----------WPPSPCCTAATPWNPRASSPRSPRPWCCR 79

Query: 597 GRI*TFPSGPGPSSA-SKRPT*SSTTRRWKIAVSGRWLRTRGYPRTSAALAMPGPSAPC* 421
           G    +P   GP +A + R +  ST+     A +  WL   G+ R S     P  SAP  
Sbjct: 80  GWPCRWPGRAGPLAALTTRSSTPSTSSTRGPATTENWL---GWARCS-----PSSSAPA- 130

Query: 420 LWQDGPRSQPGPRSASGLCGAS 355
            W  G  S    R+++  C A+
Sbjct: 131 TWATGNSSS---RTSAPACSAT 149



EST assemble image


clone accession position
1 LCL066a09_r AV629781 1 325
2 MXL055e07_r BP096256 7 455
3 LCL093h01_r AV631433 12 448
4 LCL064a07_r AV629701 36 379
5 LCL074h06_r AV630178 48 388
6 HCL051e11_r AV642427 280 610
7 LCL077e08_r AV630341 335 611
8 HCL002h12_r AV639678 340 778




Chlamydomonas reinhardtii
Kazusa DNA Research Institute