KCC000543A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC000543A_C01 KCC000543A_c01
ggaggctgctgaacgtcaatgccaCGGACCCGCAACAGAGAACAAGCGACGTGGTAATTG
GCGTCACGGACAGTGGTGCCTTCGTGAATCACCCTGACCTCATTGGGTCGTTCTGGGAGA
ACCCGGCTGAAATTGACAATGATCTGCGGGACAACGACAACAACACCTACATTGATGACG
TCTATGGCGCCTGCTTCGCCAACTCGGTGTGCTCGCCCACTTCGCTTAACTCTAACCTGT
CGCGGTGCGGCATCGGCAAAAACACATTCGCTTGGAACAACGTCAACGACATCAACAGCC
ACGGCACGAAAATCGCTGGTGTCATCGGTGCGCGGCCCAACAATGGCATCGGCCTGGCGG
GTGTGGCGCCCAACTTGCGCCAGATGGTTCTCAAGGTGGTGGATGACACCTACTTTCAGT
ACGCATATAGCGACGTGGTGAGGGCCATCGACTACGGGTACGCCAAGGGTGCGCGCATCT
TCTCCATGTCGTTTGGCCAGGACGCCCGCACCAGTGCCACGCCCACCAACAAGCCTAGTC
TGGACGCGGCTGCCACTGCCTACCGCAACCTGTTTAACAAGTACTCCAACGCCCTGTTCG
TGGCAGCAGCAGGCAACGAGTGGACGAGCCTGGAGGGCTGGCGCTCAGGCAACTACACCT
ACTCGCCCTGCATGATCGCCACCGACAACACGCTCTGCGTGGGCGGCACCAACGTGAACG
ACACAATCTTTTACGTCTTTGCCTTCAACCAGCAGGCCGGCACGAACTTCGGCCCCACCA
CAGTGGACATGGGCGCGCCCGCACACAGCATTTACACAACCGACATTGCCGTCAATAGAA
ACTACAGCGCCCCTTCGGGTACCTCTTTCGCTACCCCCATGGTGGCGGCAGTGGCGGGTC
TGGTGCTGTCGGCGCTGGGCGGCACCGGGCGGGCCACTCCGCAGACACCCCTGCAGATTA
AGAACATCCTAATGAGCTCTGGCGATCTGCTGCCCAGTCTGAACAACCAGTTCAAGAGTG
CGCGCCGCCTGAACGCCGCCCAACGCTGTGGCGGCCGCCCTGACCCTGGCCCGGACCAAC
CGCACTACCGTTGTCCGGGAGCTGGATGGCAGCACGGCCGCCGCCGTCGGTGCTGTCACC
GCCATGCAGGCGTGGGAGTACATATGGTACACGGGTGTTTACACGGATGGTGCCTTCGAC
AACTTT


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC000543A_C01 KCC000543A_c01
         (1206 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|ZP_00074711.1| COG1404: Subtilisin-like serine proteases [Tr...   153  7e-36
ref|NP_485039.1| protease [Nostoc sp. PCC 7120] gi|25535083|pir|...   122  1e-26
ref|NP_716498.1| serine protease, subtilase family [Shewanella o...   121  3e-26
ref|ZP_00081441.1| COG1404: Subtilisin-like serine proteases [Ge...   112  1e-23
ref|ZP_00107910.1| COG1404: Subtilisin-like serine proteases [No...   110  5e-23

>ref|ZP_00074711.1| COG1404: Subtilisin-like serine proteases [Trichodesmium erythraeum
            IMS101]
          Length = 1349

 Score =  153 bits (386), Expect = 7e-36
 Identities = 116/356 (32%), Positives = 163/356 (45%)
 Frame = +3

Query: 15   VNATDPQQRTSDVVIGVTDSGAFVNHPDLIGSFWENPAEIDNDLRDNDNNTYIDDVYGAC 194
            +NA D Q  + D+V+GV D+G   +HPDL  + W N  E   +  D+DNN YIDD YG  
Sbjct: 452  LNAWDIQTGSKDIVVGVIDTGIDYSHPDLANNMWTNSGETPGNGIDDDNNGYIDDYYGYD 511

Query: 195  FANSVCSPTSLNSNLSRCGIGKNTFAWNNVNDINSHGTKIAGVIGARPNNGIGLAGVAPN 374
            FA     P                       D  SHGT +AG IGA  NNG+G+ GV   
Sbjct: 512  FAYDDGDPM----------------------DRQSHGTHVAGTIGAEGNNGVGVVGVNHQ 549

Query: 375  LRQMVLKVVDDTYFQYAYSDVVRAIDYGYAKGARIFSMSFGQDARTSATPTNKPSLDAAA 554
               M +K ++D      + D + A++Y    GA I + S+G    +          DA A
Sbjct: 550  TDLMAIKFLNDQGSGSTF-DAILAVEYATMMGADITNNSWGGGGFSQGL------YDAIA 602

Query: 555  TAYRNLFNKYSNALFVAAAGNEWTSLEGWRSGNYTYSPCMIATDNTLCVGGTNVNDTIFY 734
             A        +N+LFVAAAGN   + +     N    P     +N + V  T+ ND    
Sbjct: 603  AAGE------ANSLFVAAAGNSSRNTD-----NSPSYPASYDLENIIAVAATDKND---- 647

Query: 735  VFAFNQQAGTNFGPTTVDMGAPAHSIYTTDIAVNRNYSAPSGTSFATPMVAAVAGLVLSA 914
                N    +N+G TTVD+GAP   I +T       Y++ SGTS A+P VA VA LVL+ 
Sbjct: 648  ----NMSGFSNYGATTVDLGAPGSGILST--VPGERYASYSGTSMASPHVAGVAALVLAE 701

Query: 915  LGGTGRATPQTPLQIKNILMSSGDLLPSLNNQFKSARRLNAAQRCGGRPDPGPDQP 1082
                  A      ++K I++ + D + +LN +  +  RLNA         P P+ P
Sbjct: 702  NPDLSYA------EVKEIILDNVDSISALNGKTLTGGRLNADNALSAMGSP-PENP 750

>ref|NP_485039.1| protease [Nostoc sp. PCC 7120] gi|25535083|pir||AI1930 proteinase
            [imported] - Nostoc sp. (strain PCC 7120)
            gi|17130342|dbj|BAB72953.1| protease [Nostoc sp. PCC
            7120]
          Length = 488

 Score =  122 bits (306), Expect = 1e-26
 Identities = 100/326 (30%), Positives = 152/326 (45%), Gaps = 5/326 (1%)
 Frame = +3

Query: 51   VVIGVTDSGAFVNHPDLIGSFWENPAEIDNDLRDNDNNTYIDDVYGACFANSVCSPTSLN 230
            +++ V D+G   NH DL  + W N  EI  +  D+D N Y+DDV+G  F          N
Sbjct: 112  IIVAVIDTGVDTNHEDLRNNIWTNSKEIAGNGIDDDGNGYVDDVHGWNF----------N 161

Query: 231  SNLSRCGIGKNTFAWNNVNDINSHGTKIAGVIGARPNNGIGLAGVAPNLRQMVLKVVDDT 410
             N             NN  D N HGT ++G+I A  NNG G+ GVA N + M +KV+D++
Sbjct: 162  DNN------------NNTLDNNGHGTHVSGII-AGGNNGFGVTGVAYNSQIMAVKVLDES 208

Query: 411  YFQYAYSDVVRAIDYGYAKGARIFSMSFGQDARTSATPTNKPSLDAAATAYRNLFNKYSN 590
                +YS +   I Y    GA++ ++S G D   S++ T K +++ A++           
Sbjct: 209  -GSGSYSAIANGIYYAVDNGAKVINLSLGGD---SSSRTLKSAIEYASS---------KG 255

Query: 591  ALFVAAAGNEWTSLEGWRSGNYTYSPCMIATDNTLCVGGTNVNDTIFYVFAFNQQAGTNF 770
            A+ V AAGN+  S   +        P   A    + VG  + N  +     F+ ++G   
Sbjct: 256  AIVVMAAGNDGESAPDY--------PARYANQTGIAVGAVDANKNL---TDFSNRSGNT- 303

Query: 771  GPTTVDMGAPAHSIYTTDIAVNRNYSAPSGTSFATPMVAAVAGLVLSALGGTGRATPQTP 950
              T   + AP  S+Y++    N  Y+  SGTS ATP VA V  L+LSA          T 
Sbjct: 304  --TMAYVTAPGQSVYSS--VPNNQYANYSGTSMATPYVAGVVALMLSANPNL------TE 353

Query: 951  LQIKNILMS-----SGDLLPSLNNQF 1013
             Q+++I+ S     S D  P  ++ F
Sbjct: 354  AQVRDIITSTAGNTSNDTTPKTDSDF 379

>ref|NP_716498.1| serine protease, subtilase family [Shewanella oneidensis MR-1]
            gi|24346442|gb|AAN53943.1|AE015532_3 serine protease,
            subtilase family [Shewanella oneidensis MR-1]
          Length = 818

 Score =  121 bits (303), Expect = 3e-26
 Identities = 108/340 (31%), Positives = 160/340 (46%), Gaps = 7/340 (2%)
 Frame = +3

Query: 42   TSDVVIGVTDSGAFVNHPDLIGSFWENPAEIDNDLRDNDNNTYIDDVYGACFANSVCSPT 221
            +SDVVIGV D+G   NHPDL  + W N  EI  +  D+D N  IDD++G          +
Sbjct: 139  SSDVVIGVIDTGVDYNHPDLQANMWVNAGEIAGNGIDDDANGVIDDIHGY---------S 189

Query: 222  SLNSNLSRCGIGKNTFAWNNVNDINSHGTKIAGVIGARPNNGIGLAGVAPNLRQMVLKVV 401
            ++N+N              N  D N HGT ++G IGA+ NNG+G+ GV  +++    + +
Sbjct: 190  AVNNN-------------GNPMDGNGHGTHVSGTIGAKGNNGVGVVGVNWDVKIAGCQFL 236

Query: 402  D-DTYFQYAYSDVVRAIDY------GYAKGARIFSMSFGQDARTSATPTNKPSLDAAATA 560
            D D Y   A    +  IDY       +    +  + S+G    + A    K +++A   A
Sbjct: 237  DTDGYGSTA--GAIACIDYFTNLKVNHGVDIKATNNSWGGGGFSQAL---KDAIEAGGEA 291

Query: 561  YRNLFNKYSNALFVAAAGNEWTSLEGWRSGNYTYSPCMIATDNTLCVGGTNVNDTIFYVF 740
                       LFVAAAGN+  +++   S +Y   P    +D    +  T  ND    + 
Sbjct: 292  ---------GILFVAAAGND--AVDNDASPHY---PSSYNSDVVFSIASTTRNDR---MS 334

Query: 741  AFNQQAGTNFGPTTVDMGAPAHSIYTTDIAVNRNYSAPSGTSFATPMVAAVAGLVLSALG 920
             F+Q     +G T+VDMGAP  +I +T       Y+  SGTS ATP V   A LV +   
Sbjct: 335  DFSQ-----WGLTSVDMGAPGSAILST--VRGGGYATYSGTSMATPHVTGAAALVWAL-- 385

Query: 921  GTGRATPQTPLQIKNILMSSGDLLPSLNNQFKSARRLNAA 1040
                    TP+++K +LM+SGD    L  +  +  RLN A
Sbjct: 386  ----NPDLTPVEMKELLMASGDANADLTGKTVAGTRLNVA 421

>ref|ZP_00081441.1| COG1404: Subtilisin-like serine proteases [Geobacter metallireducens]
          Length = 519

 Score =  112 bits (280), Expect = 1e-23
 Identities = 100/340 (29%), Positives = 148/340 (43%), Gaps = 1/340 (0%)
 Frame = +3

Query: 21   ATDPQQRTSDVVIGVTDSGAFVNHPDLIGSFWENPAEIDNDLR-DNDNNTYIDDVYGACF 197
            A D    ++ VV+ V DSG   NHPDL  + W N AE++     D+D +  +DD+YG   
Sbjct: 134  AWDNTTGSAGVVVAVIDSGVDYNHPDLKANMWINQAELNGKPGIDDDGDGVVDDIYGY-- 191

Query: 198  ANSVCSPTSLNSNLSRCGIGKNTFAWNNVNDINSHGTKIAGVIGARPNNGIGLAGVAPNL 377
                     +N+N              N  D N HGT +AG IGA  NNGIG+AGV   +
Sbjct: 192  -------NGVNNN-------------GNPMDNNGHGTHVAGTIGAVGNNGIGVAGVNWTV 231

Query: 378  RQMVLKVVDDTYFQYAYSDVVRAIDYGYAKGARIFSMSFGQDARTSATPTNKPSLDAAAT 557
            + M  K +D     Y  SD +  + Y     +R  ++             N       + 
Sbjct: 232  KIMACKFLDANGSGYT-SDAIECLQYVKKMKSRGVNI---------VATNNSWGGGGYSR 281

Query: 558  AYRNLFNKYSNALFVAAAGNEWTSLEGWRSGNYTYSPCMIATDNTLCVGGTNVNDTIFYV 737
            A  +  N   + LF+ AAGN   + +   S    Y+       N + V  T   D +   
Sbjct: 282  ALYDTINSQRDILFITAAGNAAANNDTTPSYPADYN-----LPNIIAVAATTSTDGL--- 333

Query: 738  FAFNQQAGTNFGPTTVDMGAPAHSIYTTDIAVNRNYSAPSGTSFATPMVAAVAGLVLSAL 917
                  + +N+G  TV +GAP +SI +T    N  Y+  SGTS ATP V  +A L+    
Sbjct: 334  -----ASFSNYGRRTVMVGAPGYSILST--YPNNQYAYLSGTSMATPHVTGLAALI---- 382

Query: 918  GGTGRATPQTPLQIKNILMSSGDLLPSLNNQFKSARRLNA 1037
                +        IKN+++S GD   SL  +  + RR++A
Sbjct: 383  --KAKYPTMDWRGIKNLILSGGDRPSSLAAKTVTGRRIDA 420

>ref|ZP_00107910.1| COG1404: Subtilisin-like serine proteases [Nostoc punctiforme]
          Length = 659

 Score =  110 bits (275), Expect = 5e-23
 Identities = 92/288 (31%), Positives = 139/288 (47%)
 Frame = +3

Query: 51  VVIGVTDSGAFVNHPDLIGSFWENPAEIDNDLRDNDNNTYIDDVYGACFANSVCSPTSLN 230
           VV+ V D+G   NH DL  + W N  EI  +  D+D N YIDD YG  FA+         
Sbjct: 129 VVVAVVDTGVDYNHEDLKNNIWTNTKEIAGNGIDDDGNGYIDDNYGWNFAD--------- 179

Query: 231 SNLSRCGIGKNTFAWNNVNDINSHGTKIAGVIGARPNNGIGLAGVAPNLRQMVLKVVDDT 410
                    KN    NN  D N HGT ++G I A  NN  G+ G+A + + M +KV++++
Sbjct: 180 ---------KN----NNTLDNNGHGTHVSGTI-AGENNNYGVTGIAYDAKIMPVKVLNES 225

Query: 411 YFQYAYSDVVRAIDYGYAKGARIFSMSFGQDARTSATPTNKPSLDAAATAYRNLFNKYSN 590
               +YS + + I Y    GA + ++S G    TS+  T + +++ A++           
Sbjct: 226 -GSGSYSSIAKGIRYAVDNGANVINLSLG---GTSSNRTLESAINYASS---------KG 272

Query: 591 ALFVAAAGNEWTSLEGWRSGNYTYSPCMIATDNTLCVGGTNVNDTIFYVFAFNQQAGTNF 770
            + V AAGN     +G  S +Y   P   A+   + VG  + N+    +  F+ ++GTN 
Sbjct: 273 VIVVMAAGN-----DGESSPDY---PARYASKAGIAVGAVDKNNN---MADFSNRSGTN- 320

Query: 771 GPTTVDMGAPAHSIYTTDIAVNRNYSAPSGTSFATPMVAAVAGLVLSA 914
                 + AP   +Y++    N  Y+  SGTS ATP VA V  L+LSA
Sbjct: 321 --QISYVTAPGVKVYSS--VPNNQYATYSGTSMATPHVAGVVALMLSA 364



EST assemble image


clone accession position
1 MXL089e06_r BP098217 1 392
2 MXL025g05_r BP094617 25 454
3 CL40d01_r AV395337 25 536
4 MXL005d05_r BP093232 117 624
5 CL62d07_r AV396416 118 618
6 MXL073e10_r BP097315 186 556
7 HCL059a09_r AV642840 249 747
8 LCL077e05_r AV630339 260 809
9 HCL026e07_r AV641033 591 1107
10 CL50c03_r AV395745 663 1206
11 MXL062g10_r BP096671 677 1196




Chlamydomonas reinhardtii
Kazusa DNA Research Institute