KCC001544A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC001544A_C01 KCC001544A_c01
GATGGCATCCTGGGCATGGGCTTCCCCGCCATCAGCGTGCAGCACGTGCCGCCGCCCTTC
ACCCGCCTGGTGGAGGAGGGCGGGCTGGCAGCGCCCGTGTTCAGCTTCTGGCTGAACCGC
GACCCCAACGCGCCCAACGGCGGCGAGCTGGTGTTGGGCGGCATTGACCCTACCCACTTC
ACCGTGCGAGCACACCTGGGTTCCAGTCACTCGCCAGGGCTACTGGCAGTTCAACATGGA
GGGCCTGGACCTGGGGCCCGGCAGCCAGAAGATGTGCGCCAAGGGCTGCGCCGCCATTGC
CGACACCGGCACCTCCCTCATCGCCGGCCCCTCGGACGAGGTGGCCGCGCTCAACCACGC
CATCGGCGCCACCTCTGCGCTGTCGGCCCAGTGCCGCCAGCTGGTGCGCGACTACCTGCC
GCAGATCATCGCGCAGCTGCACGACCTGCCGCTGGACCAGGTCTGCGCCAGCATTGGTCT
GTGCCCCATGGCCGCCGCCTCCACCATCAAGCCCGCTCGCCGCCTGCTCGCCACTACTAC
TGCCGCCGGTACGCACAGCATTCGCACCAGCAGCGGCGCTGCCGCCGTTGCTGACGAGGC
CGCCGCCGGCGACGCCTCCGACGTCGACGCCGCCGTTGCCGCCGTCAAGGCCCAGCTCGC
GAACCTGCTCGGCCACGCCGCCGCCGGCGCAACCACCACCAACGGCCGCGGCGCCGCCGC
CAGCGATGGCGGCGTATCGGGCGTTATCTCCAAGCTGGTTGGCGAGGCCGCCGCCAAGGC
TCAGGGCTCCAAGGCTGAGTCGGCCGGCGACAGTGTGGTGTGCAGCTTCTGCCAGACGGC
TGTGGCCTACATCAAGATTGCGCTGCAGTCCAACTCCACCATCGAGCAGATCGCCGACGC
AGTGGGTCAGCTGTGCGACCAGGTGTCGTTCGGCGGCCCGAGTGTGGTGGATTGTGACAA
GATCTCCACCCTGCCCGTCATCAGCTTCAACATCGGCGGCCGCGTGTTCCCGCTGCGGCC
CGAGCAGTACGTACTGCAGTTGGACGCGGGCGGCGGCGAGATGCAGTGCATCAGCGGCTT
CATGGGCCTGGACGTGCCGGCCGGGCCCCTGGTGGATCCTGGGAGACATATTCCTGGGCG
CCTATCACACCGTGTTCGACTACGGCGCAGCGCGCCTGGGCTTCGCCAATGCGGCTTAGG
TAAAGGAGGAGAGGCCTGTGGGAGGAGAGGCCCGGAAGCGTCTGGGAGAGGAGGCGCCAA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC001544A_C01 KCC001544A_c01
         (1260 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]       603  0.0
ref|NP_172655.1| aspartic proteinase -related [Arabidopsis thali...   154  4e-61
gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]             154  4e-61
emb|CAC86004.1| aspartic proteinase [Theobroma cacao]                 159  4e-52
ref|NP_176419.2| aspartic protease -related [Arabidopsis thalian...   114  9e-50

>emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
          Length = 578

 Score =  603 bits (1554), Expect(3) = 0.0
 Identities = 307/308 (99%), Positives = 307/308 (99%)
 Frame = +2

Query: 188  EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
            EHTWVPVTRQGYWQF MEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG
Sbjct: 243  EHTWVPVTRQGYWQFTMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 302

Query: 368  ATSALSAQCRQLVRDYLPQIIAQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTAA 547
            ATSALSAQCRQLVRDYLPQIIAQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTAA
Sbjct: 303  ATSALSAQCRQLVRDYLPQIIAQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTAA 362

Query: 548  GTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAASD 727
            GTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAASD
Sbjct: 363  GTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAASD 422

Query: 728  GGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVG 907
            GGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVG
Sbjct: 423  GGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVG 482

Query: 908  QLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMG 1087
            QLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMG
Sbjct: 483  QLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMG 542

Query: 1088 LDVPAGPL 1111
            LDVPAGPL
Sbjct: 543  LDVPAGPL 550

 Score =  132 bits (333), Expect(3) = 0.0
 Identities = 61/61 (100%), Positives = 61/61 (100%)
 Frame = +1

Query: 1   DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
           DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF
Sbjct: 181 DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 240

Query: 181 T 183
           T
Sbjct: 241 T 241

 Score = 67.0 bits (162), Expect(3) = 0.0
 Identities = 30/31 (96%), Positives = 30/31 (96%)
 Frame = +3

Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
            GP WILGDIFLGAYHTVFDYGAARLGFANAA
Sbjct: 548  GPLWILGDIFLGAYHTVFDYGAARLGFANAA 578

>ref|NP_172655.1| aspartic proteinase -related [Arabidopsis thaliana]
            gi|25290005|pir||F86253 hypothetical protein [imported] -
            Arabidopsis thaliana gi|3157937|gb|AAC17620.1| Identical
            to aspartic proteinase cDNA gb|U51036 from A. thaliana.
            ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
            gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
            gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
            gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
            and gb|Z34173 come from this gene. [Arabidopsis thaliana]
            gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24
            [Arabidopsis thaliana] gi|15912251|gb|AAL08259.1|
            At1g11910/F12F1_24 [Arabidopsis thaliana]
            gi|17381036|gb|AAL36330.1| putative aspartic proteinase
            [Arabidopsis thaliana] gi|21617929|gb|AAM66979.1|
            putative aspartic proteinase [Arabidopsis thaliana]
            gi|25055040|gb|AAN71979.1| putative aspartic proteinase
            [Arabidopsis thaliana]
          Length = 506

 Score =  154 bits (388), Expect(3) = 4e-61
 Identities = 106/312 (33%), Positives = 152/312 (47%), Gaps = 4/312 (1%)
 Frame = +2

Query: 188  EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
            +HT+VPVT++GYWQF+M  + +G      C  GC+AIAD+GTSL+AGP+  +  +NHAIG
Sbjct: 249  KHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIG 308

Query: 368  ATSALSAQCRQLVRDYLPQII-AQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
            A   +S QC+ +V  Y   I+   L +    ++C+ IGLC                  T 
Sbjct: 309  AAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLC------------------TF 350

Query: 545  AGTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAAS 724
             GT  +  S G  +V D+                 A+L+N +G AA  A           
Sbjct: 351  DGTRGV--SMGIESVVDKE---------------NAKLSNGVGDAACSA----------- 382

Query: 725  DGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAV 904
                                               C+ AV +I+  L+ N T E+I + V
Sbjct: 383  -----------------------------------CEMAVVWIQSQLRQNMTQERILNYV 407

Query: 905  GQLCDQV-SFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
             +LC+++ S  G S VDC ++ST+P +S  IGG+VF L PE+YVL++   G   QCISGF
Sbjct: 408  NELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKV-GEGPVAQCISGF 466

Query: 1082 MGLDV--PAGPL 1111
            + LDV  P GPL
Sbjct: 467  IALDVAPPRGPL 478

 Score = 72.8 bits (177), Expect(3) = 4e-61
 Identities = 32/60 (53%), Positives = 42/60 (69%)
 Frame = +1

Query: 1   DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
           DGILG+GF  ISV    P +  ++++G +  PVFSFWLNR+ +   GGELV GG+DP HF
Sbjct: 187 DGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHF 246

 Score = 53.1 bits (126), Expect(3) = 4e-61
 Identities = 21/31 (67%), Positives = 26/31 (83%)
 Frame = +3

Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
            GP WILGD+F+G YHTVFD+G  ++GFA AA
Sbjct: 476  GPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506

>gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
          Length = 486

 Score =  154 bits (388), Expect(3) = 4e-61
 Identities = 106/312 (33%), Positives = 152/312 (47%), Gaps = 4/312 (1%)
 Frame = +2

Query: 188  EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
            +HT+VPVT++GYWQF+M  + +G      C  GC+AIAD+GTSL+AGP+  +  +NHAIG
Sbjct: 229  KHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIG 288

Query: 368  ATSALSAQCRQLVRDYLPQII-AQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
            A   +S QC+ +V  Y   I+   L +    ++C+ IGLC                  T 
Sbjct: 289  AAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLC------------------TF 330

Query: 545  AGTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAAS 724
             GT  +  S G  +V D+                 A+L+N +G AA  A           
Sbjct: 331  DGTRGV--SMGIESVVDKE---------------NAKLSNGVGDAACSA----------- 362

Query: 725  DGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAV 904
                                               C+ AV +I+  L+ N T E+I + V
Sbjct: 363  -----------------------------------CEMAVVWIQSQLRQNMTQERILNYV 387

Query: 905  GQLCDQV-SFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
             +LC+++ S  G S VDC ++ST+P +S  IGG+VF L PE+YVL++   G   QCISGF
Sbjct: 388  NELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKV-GEGPVAQCISGF 446

Query: 1082 MGLDV--PAGPL 1111
            + LDV  P GPL
Sbjct: 447  IALDVAPPRGPL 458

 Score = 72.8 bits (177), Expect(3) = 4e-61
 Identities = 32/60 (53%), Positives = 42/60 (69%)
 Frame = +1

Query: 1   DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
           DGILG+GF  ISV    P +  ++++G +  PVFSFWLNR+ +   GGELV GG+DP HF
Sbjct: 167 DGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHF 226

 Score = 53.1 bits (126), Expect(3) = 4e-61
 Identities = 21/31 (67%), Positives = 26/31 (83%)
 Frame = +3

Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
            GP WILGD+F+G YHTVFD+G  ++GFA AA
Sbjct: 456  GPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 486

>emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
          Length = 514

 Score =  159 bits (401), Expect(2) = 4e-52
 Identities = 109/322 (33%), Positives = 155/322 (47%), Gaps = 4/322 (1%)
 Frame = +2

Query: 188  EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
            +HT+VPVT++GYWQF+M  + +       CA  CAAIAD+GTSL+AGPS  +  +NHAIG
Sbjct: 257  KHTYVPVTQKGYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSLLAGPSTVITMINHAIG 316

Query: 368  ATSALSAQCRQLVRDYLPQIIAQL-HDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
            AT  +S +C+ +V+ Y   II  L  +    ++C+ IGLC                  T 
Sbjct: 317  ATGVVSQECKAVVQQYGRTIIDLLIAEAQPQKICSQIGLC------------------TF 358

Query: 545  AGTHSIRTSSGAAAVADEAAAGDASDVDAAVAAVKAQLANLLGHAAAGATTTNGRGAAAS 724
             G H +  S+G  +V DE                                         S
Sbjct: 359  NGAHGV--STGIESVVDE-----------------------------------------S 375

Query: 725  DGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAV 904
            +G  SGV+                       +C  C+ AV +++  ++ N T ++I   V
Sbjct: 376  NGKSSGVLR--------------------DAMCPACEMAVVWMQNQVRQNQTQDRILSYV 415

Query: 905  GQLCDQV-SFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
             +LCD+V +  G S VDC  +S++P ISF IGG+VF L PE+Y+L++   G E QCISGF
Sbjct: 416  NELCDRVPNPMGESAVDCGSLSSMPTISFTIGGKVFDLTPEEYILKV-GEGSEAQCISGF 474

Query: 1082 MGLDV--PAGPLVDPGRHIPGR 1141
              LD+  P GPL   G    GR
Sbjct: 475  TALDIPPPRGPLWILGDIFMGR 496

 Score = 70.1 bits (170), Expect(2) = 4e-52
 Identities = 30/60 (50%), Positives = 42/60 (70%)
 Frame = +1

Query: 1   DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
           DGILG+GF  ISV    P +  ++++G +  PVFSFWLNR+ +   GGE+V GG+DP H+
Sbjct: 195 DGILGLGFKEISVGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEAGGEIVFGGVDPNHY 254

 Score = 55.5 bits (132), Expect = 2e-06
 Identities = 23/31 (74%), Positives = 26/31 (83%)
 Frame = +3

Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANAA 1196
            GP WILGDIF+G YHTVFD+G  R+GFA AA
Sbjct: 484  GPLWILGDIFMGRYHTVFDFGKLRVGFAEAA 514

>ref|NP_176419.2| aspartic protease -related [Arabidopsis thaliana]
            gi|17979428|gb|AAL49856.1| putative aspartic protease
            [Arabidopsis thaliana] gi|23297031|gb|AAN13225.1|
            putative aspartic protease [Arabidopsis thaliana]
          Length = 513

 Score =  114 bits (285), Expect(3) = 9e-50
 Identities = 96/312 (30%), Positives = 134/312 (42%), Gaps = 4/312 (1%)
 Frame = +2

Query: 188  EHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAGPSDEVAALNHAIG 367
            EHT+VPVT++GYWQF+M  + +   S   C  GC+AIAD+GTSL+AGP+  VA +N AIG
Sbjct: 256  EHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTSLLAGPTAVVAMINKAIG 315

Query: 368  ATSALSAQCRQLVRDYLPQII-AQLHDLPLDQVCASIGLCPMAAASTIKPARRLLATTTA 544
            A+  +S QC+ +V  Y   I+   L +    ++C+ IGLC                    
Sbjct: 316  ASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLC------------------AY 357

Query: 545  AGTHSIRTSSGAAAVADEAAAGDASDV-DAAVAAVKAQLANLLGHAAAGATTTNGRGAAA 721
             GTH +  S G  +V D+     +S + DA   A +  +  +        T         
Sbjct: 358  DGTHGV--SMGIESVVDKENTRSSSGLRDAGCPACEMAVVWIQSQLRQNMTQER------ 409

Query: 722  SDGGVSGVISKLVGEAAAKAQGSKAESAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADA 901
                    I   + E   +      ESA D                              
Sbjct: 410  --------IVNYINEICERMPSPNGESAVD------------------------------ 431

Query: 902  VGQLCDQVSFGGPSVVDCDKISTLPVISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGF 1081
                C Q+S          K+ T+   SF IGG+VF L PE+YVL++   G   QCISGF
Sbjct: 432  ----CSQLS----------KMPTV---SFTIGGKVFDLAPEEYVLKI-GEGPVAQCISGF 473

Query: 1082 MGLDV--PAGPL 1111
              LD+  P GPL
Sbjct: 474  TALDIPPPRGPL 485

 Score = 75.9 bits (185), Expect(3) = 9e-50
 Identities = 31/60 (51%), Positives = 44/60 (72%)
 Frame = +1

Query: 1   DGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGELVLGGIDPTHF 180
           DG+LG+GF  I+V +  P +  ++++G +  PVFSFWLNRDP +  GGE+V GG+DP HF
Sbjct: 194 DGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHF 253

 Score = 51.6 bits (122), Expect(3) = 9e-50
 Identities = 20/30 (66%), Positives = 25/30 (82%)
 Frame = +3

Query: 1104 GPWWILGDIFLGAYHTVFDYGAARLGFANA 1193
            GP WILGD+F+G YHTVFD+G  ++GFA A
Sbjct: 483  GPLWILGDVFMGKYHTVFDFGNEQVGFAEA 512



EST assemble image


clone accession position
1 MX004g03_r BP086236 1 475
2 MX004h05_r BP086243 114 396
3 HC078a10_r AV637828 134 629
4 HCL025g03_r AV640983 275 694
5 CM061b10_r AV390596 326 696
6 HCL056a12_r AV642687 338 855
7 CM075a01_r AV391584 412 1009
8 CM094g01_r AV393281 601 1151
9 HC011h03_r AV632726 949 1323
10 HC094f08_r AV639073 973 1374




Chlamydomonas reinhardtii
Kazusa DNA Research Institute