KCC002745A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002745A_C01 KCC002745A_c01
gaaggtgcgtggcttgtgggtgggcagtgtgaaggcgatgcggcggctggctcgagttct
cagctcaggaacccgtgtgaGCAGTTGGTGGATGGGCACGCAGTGTGTGCGCGGGCACAT
ATGGGTTGCGGCTGCAGTTGTTGTTACCACCCACGCAAGCAAGCGTGCTTGTGCTCAAGC
CCCGCGGCCTGCATGCCTCAGCACTTGGGCTGTACGTTTGGTCCTAATCTTTTTTGTTTG
TGTATTTCTTTCTGACGAGTGCCGATGCCATGACAGTTGTGCAGGGCGCAACCGAAGTTA
GGAAAACACAGTACACATACGTGTGTGGCGTCCTCACTAAGCCTTGCGTGTGGGCACAAA
GCGACCAGCCGAGCGGGCTGCGCGGAAGTGGGCAGGAGCAGGATAGAGGCGTGTACGTGC
AAGTCTTGTACAGGTGTACGTGTGTGTTCGTGTGTGTGGAAGCGAGTGCCTGAAGTTGCG
AGACGTGTGTGGCATGCGTCTGTCCCCCCTTCCTAAATTTCGTAATCATTGTGCATTTCG
AAAGCACCCATGCGCGTGGGCCTAAAGAGCACTGTTGTGGGCGTTTGAGTGGGCACGGAC
TGGCGCACGGCCCGGCAGGGAACCAAAAAAGCCCACCACACCACCCGGTGCAAGCACCTT
TGCCAGTGCGGCTGCGTGTGTCGAGCTAAGTGGCTCTTGAGGCAACGGCGCTGGGACAAA
GCGCACCGCGGGATGCTGATGCTCGGCGCACAGGCTGTCGGTATCTGACCGTGCGCAGGA
TGTGCCTATGTTGTGAATGGTATAAATAGGTGTGCAGCAGTACCGTATACTAAGTATGAT
GGCGCATGAAAGCTCTCGGGGGAACCCAATTCCTAAAGAAGCGTGCTCGCACCCTTGCCT
AATCAATTACTCATGCTTGCGTGAGAGTGCGTGAAACTT


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002745A_C01 KCC002745A_c01
         (939 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAC36106.1| dJ447F3.3.1 (novel protein with WAP-type (Whey A...    38  0.22
ref|XP_173052.2| similar to dJ447F3.3.1 (novel protein with WAP-...    38  0.22
gb|AAA99804.1| 220 kDa silk protein                                    38  0.28
ref|NP_079268.1| hypothetical protein FLJ12547 [Homo sapiens] gi...    37  0.30
emb|CAC32269.1| hypothetical protein L6071.10 [Leishmania major]       37  0.37

>emb|CAC36106.1| dJ447F3.3.1 (novel protein with WAP-type (Whey Acidic Protein)
           'four-disulfide core' domains (isoform 1)) [Homo
           sapiens] gi|14270396|emb|CAC39444.1| hypothetical
           protein [Homo sapiens]
          Length = 224

 Score = 38.1 bits (87), Expect = 0.22
 Identities = 45/168 (26%), Positives = 58/168 (33%), Gaps = 18/168 (10%)
 Frame = +1

Query: 7   AWLVGGQCEGDAAAGSSSQLRNPCEQLVDGHAVCARAH----MGCGCSCCYHP------- 153
           +W+  G+    A  G     +NPC++L  G  +C         GCG  C   P       
Sbjct: 13  SWITAGE---HAKEGECPPHKNPCKELCQGDELCPAEQKCCTTGCGRICRDIPKGRKRDC 69

Query: 154 ----RKQACL--CSSPAACMPQHLGCTFGPNLFCLC-ISF*RVPMP*QLCRAQPKLGKHS 312
               RKQ+CL  C +   C      CT G N  C+  IS  ++      C A P      
Sbjct: 70  PRVIRKQSCLKRCITDETCPGVKKCCTLGCNKSCVVPISKQKLAEFGGECPADPL----P 125

Query: 313 THTCVASSLSLACGHKATSRAGCAEVGRSRIEACTCKSCTGVRVCSCV 456
                    S   GHK  S  GC       IE      C  V V  C+
Sbjct: 126 CEELCDGDASCPQGHKCCS-TGCGRTCLGDIEGGRGGDCPKVLVGLCI 172

>ref|XP_173052.2| similar to dJ447F3.3.1 (novel protein with WAP-type (Whey Acidic
           Protein) four-disulfide core domains (isoform 1)) [Homo
           sapiens] gi|32307109|ref|NP_542181.1| WAP four-disulfide
           core domain 3 isoform 1 precursor; whey acidic protein
           14; protease inhibitor WAP14 [Homo sapiens]
           gi|32363334|sp|Q8IUB2|WFD3_HUMAN WAP four-disulfide core
           domain protein 3 precursor (Putative protease inhibitor
           WAP14) gi|25005079|gb|AAN70993.1|AF488306_1 probable
           protease inhibitor WAP14 precursor [Homo sapiens]
          Length = 231

 Score = 38.1 bits (87), Expect = 0.22
 Identities = 45/168 (26%), Positives = 58/168 (33%), Gaps = 18/168 (10%)
 Frame = +1

Query: 7   AWLVGGQCEGDAAAGSSSQLRNPCEQLVDGHAVCARAH----MGCGCSCCYHP------- 153
           +W+  G+    A  G     +NPC++L  G  +C         GCG  C   P       
Sbjct: 20  SWITAGE---HAKEGECPPHKNPCKELCQGDELCPAEQKCCTTGCGRICRDIPKGRKRDC 76

Query: 154 ----RKQACL--CSSPAACMPQHLGCTFGPNLFCLC-ISF*RVPMP*QLCRAQPKLGKHS 312
               RKQ+CL  C +   C      CT G N  C+  IS  ++      C A P      
Sbjct: 77  PRVIRKQSCLKRCITDETCPGVKKCCTLGCNKSCVVPISKQKLAEFGGECPADPL----P 132

Query: 313 THTCVASSLSLACGHKATSRAGCAEVGRSRIEACTCKSCTGVRVCSCV 456
                    S   GHK  S  GC       IE      C  V V  C+
Sbjct: 133 CEELCDGDASCPQGHKCCS-TGCGRTCLGDIEGGRGGDCPKVLVGLCI 179

>gb|AAA99804.1| 220 kDa silk protein
          Length = 1704

 Score = 37.7 bits (86), Expect = 0.28
 Identities = 48/228 (21%), Positives = 74/228 (32%), Gaps = 8/228 (3%)
 Frame = +1

Query: 25  QCEGDAAA-GSSSQLRNPCEQLVDGHAVCARAHMGCGCSCCYHPRKQACLCSSPAACMPQ 201
           +C+GD    G  +  +N C        +C  A    GC+        +C C+ PA    +
Sbjct: 206 ECKGDGQCQGPKTWCKNNCR------CICPTAEPTGGCAAPLRWDDDSCSCACPANMEKK 259

Query: 202 HLGCT-----FGPNLF-CLCISF*RVPMP*QLCRAQPKLGKHSTHTCVASSLSLACGHK- 360
              CT     + PN   C C            C A  +  K +      + L    G   
Sbjct: 260 KEKCTESGRIWNPNTCECGCAKLD--------CPAGKEANKETCECNCKNELKCEGGQVF 311

Query: 361 ATSRAGCAEVGRSRIEACTCKSCTGVRVCSCVWKRVPEVARRVWHASVPPS*IS*SLCIS 540
                 C   G  + +ACT      V+ CSC   + P   ++       P       CI 
Sbjct: 312 CKDSCSCVCPGSDKDKACTAPHFYDVQSCSC---QCPVNMQKPSGGCPKPQNWDKDNCIC 368

Query: 541 KAPMRVGLKSTVVGV*VGTDWRTARQGTKKAHHTTRCKHLCQCGCVCR 684
           + P++   K+  V                 A+   RC   C+C C+ R
Sbjct: 369 ECPVKHECKNGKVWDSTQCQCICPTDAPPCANGKERCDETCECACINR 416

>ref|NP_079268.1| hypothetical protein FLJ12547 [Homo sapiens]
           gi|10434098|dbj|BAB14128.1| unnamed protein product
           [Homo sapiens]
          Length = 307

 Score = 37.0 bits (84), Expect = 0.48
 Identities = 40/136 (29%), Positives = 52/136 (37%), Gaps = 5/136 (3%)
 Frame = +1

Query: 76  CEQLVDGHAVCARAHMGCGCSCCYHPRKQACLCSSPAACMPQHLGCTFGPNL---FCLCI 246
           C +L  G  +C  A   C C C +      C+C+    C+     C  G +L    CLC+
Sbjct: 89  CVRLCVGVHICVCA---CVCGCTF-----VCVCACVCGCL-----CVCGAHLCVCVCLCV 135

Query: 247 SF*RVPMP*QLCRAQPKLGKHSTHTCVASSLSLACGHKATSRAGCAEV-GRSRIEACTCK 423
                 +   LC      G   T  CV      AC    T    CA V G + +  C C 
Sbjct: 136 G---AHLCVCLCVCACVWG--CTFVCVC-----ACVWGCTFVCVCACVCGCTFVCVCACV 185

Query: 424 -SCTGVRVCSCVWKRV 468
             CT V VC CVW  +
Sbjct: 186 CGCTFVCVCLCVWVHI 201

 Score = 30.0 bits (66), Expect(2) = 0.30
 Identities = 33/128 (25%), Positives = 41/128 (31%)
 Frame = +1

Query: 76  CEQLVDGHAVCARAHMGCGCSCCYHPRKQACLCSSPAACMPQHLGCTFGPNLFCLCISF* 255
           C  +  G  +C R  +G     C      AC+C     C+     C  G    CLC+   
Sbjct: 79  CVPVCGGAHLCVRLCVGVHICVC------ACVCGCTFVCV---CACVCG----CLCV--- 122

Query: 256 RVPMP*QLCRAQPKLGKHSTHTCVASSLSLACGHKATSRAGCAEVGRSRIEACTCKSCTG 435
                   C A         H CV   L +   H       CA V            CT 
Sbjct: 123 --------CGA---------HLCVCVCLCVGA-HLCVCLCVCACVW----------GCTF 154

Query: 436 VRVCSCVW 459
           V VC+CVW
Sbjct: 155 VCVCACVW 162

 Score = 26.6 bits (57), Expect(2) = 0.30
 Identities = 10/23 (43%), Positives = 11/23 (47%)
 Frame = +3

Query: 435 CTCVFVCVEASA*SCETCVACVC 503
           C C FVCV A    C     C+C
Sbjct: 174 CGCTFVCVCACVCGCTFVCVCLC 196

>emb|CAC32269.1| hypothetical protein L6071.10 [Leishmania major]
          Length = 259

 Score = 37.4 bits (85), Expect = 0.37
 Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 15/55 (27%)
 Frame = -3

Query: 466 LASTHTNTHVHLYKTCTYTPLS-------CSCPLPR--------SPLGWSLCAHT 347
           L  THT+TH H ++TCT+   S       C CP  R        SPL  SL  HT
Sbjct: 28  LPLTHTHTHTHTHRTCTHIIFSEPHIVFYCVCPFTRCHRRRPLPSPLSLSLPQHT 82



EST assemble image


clone accession position
1 LC090a02_r AV625252 1 534
2 MX228c07_r BP090734 252 690
3 LC080a01_r AV624598 269 777
4 LC002d03_r AV618994 374 939




Chlamydomonas reinhardtii
Kazusa DNA Research Institute