KMC004625A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004625A_C01 KMC004625A_c01
aagAAAAATCAAGTATCATCATTTAAGGATCATCATCATCAAAGCTGAATGGAGGACATG
AATTATTCCAAGTGCTCATGAGTAATTATGGCAGTTTCTCTCTAAAGCCAATAAACCTCA
AAAGCTTACCTTCTTTCAACATCATAATCACAATTTGGATAGTTTGGAGTACAGACAGTA
ATTGGTGCAGATTTTTTCAATACACACCATACAGAATCTAATTGGTAAGATAATCTATAT
TTATATTCAGAATGAAACATCAGCGTCAGCATCCATGTCCATCACCCATCTTCCCATATT
TTTGGTGGAAGGCAAACCTGCAGTAACATCCTCTTCAATCCCACATTCATTTGTTCCTCT
TCTGATCATGAAGTAACCATCATCTCCCCAGCTTCTGTTCCATTGATTTGCGATAAGCCA
ATAGTCCTCCCCCTCGTCAGTTGTTCCCCACCCAATCAGCTTTACTGCATGACCACCTAA
TTGAGAACCTGTGATGTGCTTGTAAACTCCCGATTTGTAGTGGGCAAAATCCTCATAAAC
A


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004625A_C01 KMC004625A_c01
         (541 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T02011 probable cathepsin B-like cysteine proteinase (EC 3....   164  8e-40
ref|NP_567215.1| cathepsin B-like cysteine protease, putative; p...   164  8e-40
gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis...   162  2e-39
ref|NP_563647.1| cathepsin B-like cysteine protease, putative; p...   162  2e-39
ref|NP_563648.1| Expressed protein; protein id: At1g02305.1, sup...   162  2e-39

>pir||T02011 probable cathepsin B-like cysteine proteinase (EC 3.4.22.-)
           T15B16.17a - Arabidopsis thaliana
           gi|3859606|gb|AAC72872.1| contains similarity to
           cysteine proteases (Pfam: PF00112, E=1.3e-79, N=1)
           [Arabidopsis thaliana] gi|7268205|emb|CAB77732.1|
           cathepsin B-like cysteine protease [Arabidopsis
           thaliana]
          Length = 359

 Score =  164 bits (414), Expect = 8e-40
 Identities = 71/84 (84%), Positives = 78/84 (92%)
 Frame = -2

Query: 540 VYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIR 361
           VYEDFAHYKSGVYKHITGS +GGHAVKLIGWGT+ EGEDYWL+ANQWNR WGDDGYFMIR
Sbjct: 263 VYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIR 322

Query: 360 RGTNECGIEEDVTAGLPSTKNMGR 289
           RGTNECGIE++  AGLPS+KN+ R
Sbjct: 323 RGTNECGIEDEPVAGLPSSKNVFR 346

>ref|NP_567215.1| cathepsin B-like cysteine protease, putative; protein id:
           At4g01610.1, supported by cDNA: 20761., supported by
           cDNA: gi_13877860, supported by cDNA: gi_17473833,
           supported by cDNA: gi_21281112 [Arabidopsis thaliana]
           gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin
           B cysteine protease [Arabidopsis thaliana]
           gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis
           thaliana] gi|21281113|gb|AAM45063.1| putative cathepsin
           B cysteine protease [Arabidopsis thaliana]
           gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine
           protease, putative [Arabidopsis thaliana]
           gi|24417490|gb|AAN60355.1| unknown [Arabidopsis
           thaliana] gi|24899725|gb|AAN65077.1| unknown protein
           [Arabidopsis thaliana]
          Length = 359

 Score =  164 bits (414), Expect = 8e-40
 Identities = 71/84 (84%), Positives = 78/84 (92%)
 Frame = -2

Query: 540 VYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIR 361
           VYEDFAHYKSGVYKHITGS +GGHAVKLIGWGT+ EGEDYWL+ANQWNR WGDDGYFMIR
Sbjct: 263 VYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIR 322

Query: 360 RGTNECGIEEDVTAGLPSTKNMGR 289
           RGTNECGIE++  AGLPS+KN+ R
Sbjct: 323 RGTNECGIEDEPVAGLPSSKNVFR 346

>gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  162 bits (411), Expect = 2e-39
 Identities = 70/90 (77%), Positives = 81/90 (89%)
 Frame = -2

Query: 540 VYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIR 361
           VYEDFAHYKSGVYK+ITG+++GGHAVKLIGWGT+D+GEDYWL+ANQWNRSWGDDGYF IR
Sbjct: 261 VYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIR 320

Query: 360 RGTNECGIEEDVTAGLPSTKNMGRWVMDMD 271
           RGTNECGIE+ V AGLPS KN+ + +   D
Sbjct: 321 RGTNECGIEQSVVAGLPSEKNVFKGITTSD 350

>ref|NP_563647.1| cathepsin B-like cysteine protease, putative; protein id:
           At1g02300.1 [Arabidopsis thaliana]
          Length = 379

 Score =  162 bits (411), Expect = 2e-39
 Identities = 70/90 (77%), Positives = 81/90 (89%)
 Frame = -2

Query: 540 VYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIR 361
           VYEDFAHYKSGVYK+ITG+++GGHAVKLIGWGT+D+GEDYWL+ANQWNRSWGDDGYF IR
Sbjct: 283 VYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIR 342

Query: 360 RGTNECGIEEDVTAGLPSTKNMGRWVMDMD 271
           RGTNECGIE+ V AGLPS KN+ + +   D
Sbjct: 343 RGTNECGIEQSVVAGLPSEKNVFKGITTSD 372

>ref|NP_563648.1| Expressed protein; protein id: At1g02305.1, supported by cDNA:
           gi_14532525, supported by cDNA: gi_16226807 [Arabidopsis
           thaliana] gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10
           [Arabidopsis thaliana]
           gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10
           [Arabidopsis thaliana] gi|25090140|gb|AAN72238.1|
           At1g02300/T6A9_10 [Arabidopsis thaliana]
          Length = 362

 Score =  162 bits (410), Expect = 2e-39
 Identities = 70/90 (77%), Positives = 79/90 (87%)
 Frame = -2

Query: 540 VYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIR 361
           VYEDFAHYKSGVYKHITG+ +GGHAVKLIGWGT+D+GEDYWL+ANQWNRSWGDDGYF IR
Sbjct: 266 VYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIR 325

Query: 360 RGTNECGIEEDVTAGLPSTKNMGRWVMDMD 271
           RGTNECGIE  V AGLPS +N+ + +   D
Sbjct: 326 RGTNECGIEHGVVAGLPSDRNVVKGITTSD 355

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 496,103,317
Number of Sequences: 1393205
Number of extensions: 11338532
Number of successful extensions: 27119
Number of sequences better than 10.0: 1037
Number of HSP's better than 10.0 without gapping: 25834
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26770
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD014c05_f AV770944 1 492
2 MFB097b07_f BP041041 4 482
3 MPD053g05_f AV773584 34 460
4 MR069c03_f BP081286 40 410
5 SPD039h06_f BP047139 40 542
6 MF024c06_f BP029526 40 468
7 MF047g12_f BP030787 41 535
8 MWM012b03_f AV764765 41 454
9 MF052b05_f BP031016 103 542




Lotus japonicus
Kazusa DNA Research Institute