KMC019872A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC019872A_C01 KMC019872A_c01
ATTTATAGATTTTATGAAAGCACGTTCCCACTCCAATTGCATTGAGAACTAGCCATGATG
ATAGCTACAAAATTAACCTACACTTTTTTCACTCTTCTCATGGTGAGCAATCTATGGATA
ATTAGTGCAAGTGAATGTCCTCCAATGCATAAGAAGAACTCATCCAACTTAGAAGCCAAG
AGGAAGAGGTTTGAAAGTTGGCAGAAACAACATGGCCGAAAATATGAGAATCCAGAAGAA
TGGCAAGTTCGTTTCGACATTTACCAAACAAATGTTGAGTTTATAGAATGCATAAACTCT
CAAAACCGCTCCTACCATCTCACAGACAACAAATTTGCAGATCTTACAAATGAAGAGTTC
AAAAGAATTTATATGGGTTATGGAAAAACTAGTTTGAGCTGTAATGCAGGAGCAGGATTA
TTAAGGTATAATGGACATGGGGATCTTCCGGAAAGCATCGATTGGAGGAAGAAAGGAGCT
GTGACTGACATCAAGGATCAAGGCACTTGTGGAAGCTGTTGGGCATTCTCTGCAGTGGCA
GCAGTGGAAGGGATACACCAAATAAAATCAGGAAACTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC019872A_C01 KMC019872A_c01
         (578 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_563764.1| cysteine proteinase; protein id: At1g06260.1 [A...   142  2e-33
ref|NP_567983.1| cysteine protease XCP1; protein id: At4g35350.1...   136  2e-31
gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease...   132  3e-30
gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus] gi...   131  7e-30
gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [...   130  9e-30

>ref|NP_563764.1| cysteine proteinase; protein id: At1g06260.1 [Arabidopsis thaliana]
           gi|25289993|pir||D86198 cysteine proteinase (EC
           3.4.22.-) [similarity] - Arabidopsis thaliana
           gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity
           to a cysteine endopeptidase 1 from Phaseolus vulgaris
           gb|U52970 and is a member of the papain cysteine
           protease family PF|00112. [Arabidopsis thaliana]
          Length = 343

 Score =  142 bits (359), Expect = 2e-33
 Identities = 72/161 (44%), Positives = 106/161 (65%)
 Frame = +1

Query: 94  LLMVSNLWIISASECPPMHKKNSSNLEAKRKRFESWQKQHGRKYENPEEWQVRFDIYQTN 273
           +L+ S L  + +S   P HK         ++RFE W K H + Y   +EW +RF IYQ+N
Sbjct: 19  VLIASKLCSVDSSVYDP-HK-------TLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70

Query: 274 VEFIECINSQNRSYHLTDNKFADLTNEEFKRIYMGYGKTSLSCNAGAGLLRYNGHGDLPE 453
           V+ I+ INS +  + LTDN+FAD+TN EFK  ++G   +SL  +     +  +  G++P+
Sbjct: 71  VQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPV-CDPAGNVPD 129

Query: 454 SIDWRKKGAVTDIKDQGTCGSCWAFSAVAAVEGIHQIKSGN 576
           ++DWR +GAVT I++QG CG CWAFSAVAA+EGI++IK+GN
Sbjct: 130 AVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGN 170

>ref|NP_567983.1| cysteine protease XCP1; protein id: At4g35350.1, supported by cDNA:
           gi_6708180 [Arabidopsis thaliana] gi|7435808|pir||T06122
           cysteine proteinase (EC 3.4.22.-) F23E12.90 -
           Arabidopsis thaliana gi|3080415|emb|CAA18734.1| cysteine
           proteinase-like protein [Arabidopsis thaliana]
           gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine
           endopeptidase XCP1 [Arabidopsis thaliana]
           gi|7270487|emb|CAB80252.1| cysteine proteinase-like
           protein [Arabidopsis thaliana]
           gi|26449881|dbj|BAC42063.1| putative cysteine proteinase
           [Arabidopsis thaliana] gi|28827736|gb|AAO50712.1|
           unknown protein [Arabidopsis thaliana]
          Length = 355

 Score =  136 bits (343), Expect = 2e-31
 Identities = 71/146 (48%), Positives = 93/146 (63%), Gaps = 1/146 (0%)
 Frame = +1

Query: 142 PMHKKNSSNLEAKRKRFESWQKQHGRKYENPEEWQVRFDIYQTNVEFIECINSQNRSYHL 321
           P H  N+  L    + FESW  +H + Y++ EE   RF++++ N+  I+  N++  SY L
Sbjct: 38  PEHLTNTDKL---LELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWL 94

Query: 322 TDNKFADLTNEEFKRIYMGYGKTSLSCNAGAGL-LRYNGHGDLPESIDWRKKGAVTDIKD 498
             N+FADLT+EEFK  Y+G  K   S         RY    DLP+S+DWRKKGAV  +KD
Sbjct: 95  GLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKD 154

Query: 499 QGTCGSCWAFSAVAAVEGIHQIKSGN 576
           QG CGSCWAFS VAAVEGI+QI +GN
Sbjct: 155 QGQCGSCWAFSTVAAVEGINQITTGN 180

>gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  132 bits (332), Expect = 3e-30
 Identities = 72/167 (43%), Positives = 106/167 (63%), Gaps = 3/167 (1%)
 Frame = +1

Query: 85  FFTLLMVSNLWIISASECPPMHKKNSSNLEAKRKR-FESWQKQHGRKYENPEEWQVRFDI 261
           F T+++VS+   +S       H   SS  +A+  R +E W  +HG+   +  E   RF+I
Sbjct: 12  FLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEI 71

Query: 262 YQTNVEFIECINSQNRSYHLTDNKFADLTNEEFKRIYMGYGKTSLSCNAGAGLLRYNGH- 438
           ++ N+ FI+  N +N SY L   KFADLTN+E++ +Y+G   + L   A    LRY    
Sbjct: 72  FKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLG---SRLKRKATKSSLRYEVRV 128

Query: 439 GD-LPESIDWRKKGAVTDIKDQGTCGSCWAFSAVAAVEGIHQIKSGN 576
           GD +PES+DWRK+GAV ++KDQG+CGSCWAFS + AVEGI++I +G+
Sbjct: 129 GDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGD 175

>gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
           gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase
           [Brassica napus]
          Length = 343

 Score =  131 bits (329), Expect = 7e-30
 Identities = 69/136 (50%), Positives = 88/136 (63%), Gaps = 5/136 (3%)
 Frame = +1

Query: 181 RKRFESWQKQHGRKYENPEEWQVRFDIYQTNVEFIECINS--QNRSYHLTDNKFADLTNE 354
           +KR  +W  +HGR Y +  E   R+ +++ NVE IE +N      ++ L  N+FADLTNE
Sbjct: 34  QKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNE 93

Query: 355 EFKRIYMGY-GKTSLSCNAGAGLLRYN--GHGDLPESIDWRKKGAVTDIKDQGTCGSCWA 525
           EF+ +Y GY G + LS        RY       LP S+DWRKKGAVT IKDQG+CGSCWA
Sbjct: 94  EFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWA 153

Query: 526 FSAVAAVEGIHQIKSG 573
           FSAVAA+EG+ QIK G
Sbjct: 154 FSAVAAIEGVAQIKKG 169

>gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  130 bits (328), Expect = 9e-30
 Identities = 69/138 (50%), Positives = 87/138 (63%), Gaps = 5/138 (3%)
 Frame = +1

Query: 175 AKRKRFESWQKQHGRKYENPEEWQVRFDIYQTNVEFIECINS--QNRSYHLTDNKFADLT 348
           A +KR   W  +HGR Y +  E   R+ +++ NVE IE +N      ++ L  N+FADLT
Sbjct: 33  AMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLT 92

Query: 349 NEEFKRIYMGY-GKTSLSCNAGAGLLRYNGHGD--LPESIDWRKKGAVTDIKDQGTCGSC 519
           NEEF+ +Y G+ G + LS        RY       LP S+DWRKKGAVT IKDQG CGSC
Sbjct: 93  NEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSC 152

Query: 520 WAFSAVAAVEGIHQIKSG 573
           WAFSAVAA+EG+ QIK G
Sbjct: 153 WAFSAVAAIEGVAQIKKG 170

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 494,276,798
Number of Sequences: 1393205
Number of extensions: 10450290
Number of successful extensions: 31852
Number of sequences better than 10.0: 961
Number of HSP's better than 10.0 without gapping: 30219
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31184
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB100d09_f BP041262 1 585
2 MFB073a07_f BP039284 8 555




Lotus japonicus
Kazusa DNA Research Institute