KMC012385A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC012385A_C01 KMC012385A_c01
ccccctccgaatagattggcggggcacgatcaccaccgtcttggtggtagtcatcacggt
gtcatcggcgatggacatgtCGATCATTTCGTACAGCAGAGACAAGTCTGCACCGCCTCT
GAGGAGTGACGAGGAGGTAATGAGGATGTACGAGGAGTGGGCTATGAAGCAAGGGAAAGT
GTACAACGCTCTCGGCGAGAAGGAGAAGAGGTTCGAAATCTTCAAAGACAACCTCAAATT
CATCGACGAGCACAATGCAGAGAACCGGACTTACAAGGTGGGGTTGAACCGGTTCGCTGA
TCTTAGCAACGAGGAGTACAGGGCCAAGTTCCTGGGAATCAGAGTTGATTCCAACAGGAG
GAGGAGGAGGATGGCGAAGTCCACCACCAGCAACCACCGTTATGCTCCGCGTGTCGGTGA
CGGTGAGAAATTGCCTGAATCTGTTGATTGGAGGAAGGAAGGTGCTGTGGTAGGAGTCAA
AGATCAAGGAGAATGCGGGAGCGCCTGGGCATTTTCAGCGGTAGCTGCTGGTGAAGGAAT
CAACAAGAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC012385A_C01 KMC012385A_c01
         (549 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||S24602 cysteine proteinase tpp (EC 3.4.22.-) - garden pea g...   223  9e-58
pir||T12041 cysteine proteinase (EC 3.4.22.-) 3 precursor - kidn...   220  8e-57
gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease...   197  5e-50
gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea bat...   193  1e-48
emb|CAB53515.1| cysteine protease [Solanum tuberosum]                 190  8e-48

>pir||S24602 cysteine proteinase tpp (EC 3.4.22.-) - garden pea
           gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  223 bits (569), Expect = 9e-58
 Identities = 112/172 (65%), Positives = 138/172 (80%), Gaps = 3/172 (1%)
 Frame = +2

Query: 41  LVVVITVSSAMDMSIISYSR---DKSAPPLRSDEEVMRMYEEWAMKQGKVYNALGEKEKR 211
           + +  T+S A+DM IISY +   DKS P  R++++V+ MYEEW +K GK YNALGEKEKR
Sbjct: 10  ITLTFTLSLALDMCIISYDKTHPDKSTP--RTNDQVLTMYEEWLVKHGKNYNALGEKEKR 67

Query: 212 FEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRAKFLGIRVDSNRRRRRMAKSTTS 391
           FEIFKDNL FIDEHN++N ++++GLNRFADL+NEEYR +FLG R++ NRR R++   T  
Sbjct: 68  FEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQT-- 125

Query: 392 NHRYAPRVGDGEKLPESVDWRKEGAVVGVKDQGECGSAWAFSAVAAGEGINK 547
            +RYA RVGD  KLPESVDWRKEGAVVGVKDQG CGS WAFSA+AA EG+NK
Sbjct: 126 -NRYATRVGD--KLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNK 174

>pir||T12041 cysteine proteinase (EC 3.4.22.-) 3 precursor - kidney bean
           gi|2511693|emb|CAB17076.1| cysteine proteinase precursor
           [Phaseolus vulgaris]
          Length = 455

 Score =  220 bits (561), Expect = 8e-57
 Identities = 113/173 (65%), Positives = 137/173 (78%), Gaps = 3/173 (1%)
 Frame = +2

Query: 38  VLVVVITVSSAMDMSIISYS---RDKSAPPLRSDEEVMRMYEEWAMKQGKVYNALGEKEK 208
           +L  +  +SSA+DMSIISY    +DK+    R+DEEV  +YEEW +K GK+YNALGEK+K
Sbjct: 2   LLFALFALSSALDMSIISYDNAHQDKAT--WRTDEEVNSLYEEWLVKHGKLYNALGEKDK 59

Query: 209 RFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRAKFLGIRVDSNRRRRRMAKSTT 388
           RF+IFKDNL+FID+ NAENRTYK+GLNRFADL+NEEYRA++LG ++D NRR  R     T
Sbjct: 60  RFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGR-----T 114

Query: 389 SNHRYAPRVGDGEKLPESVDWRKEGAVVGVKDQGECGSAWAFSAVAAGEGINK 547
            ++RYAPRV  GE LP+SVDWRKEGAVV VKDQ  CGS WAFSA+ A EGINK
Sbjct: 115 PSNRYAPRV--GETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINK 165

>gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  197 bits (502), Expect = 5e-50
 Identities = 99/170 (58%), Positives = 124/170 (72%)
 Frame = +2

Query: 38  VLVVVITVSSAMDMSIISYSRDKSAPPLRSDEEVMRMYEEWAMKQGKVYNALGEKEKRFE 217
           + + +I VSSAMDMSIISY ++      RSD EV R+YEEW +K GK  N+L EK++RFE
Sbjct: 11  LFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFE 70

Query: 218 IFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRAKFLGIRVDSNRRRRRMAKSTTSNH 397
           IFKDNL+FIDEHN +N +Y++GL +FADL+N+EYR+ +LG R+          K+T S+ 
Sbjct: 71  IFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL--------KRKATKSSL 122

Query: 398 RYAPRVGDGEKLPESVDWRKEGAVVGVKDQGECGSAWAFSAVAAGEGINK 547
           RY  RVGD   +PESVDWRKEGAV  VKDQG CGS WAFS + A EGINK
Sbjct: 123 RYEVRVGDA--IPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINK 170

>gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  193 bits (490), Expect = 1e-48
 Identities = 102/173 (58%), Positives = 131/173 (74%), Gaps = 3/173 (1%)
 Frame = +2

Query: 38  VLVVVITVSSAMDMSIISYSRDKSAPPL-RSDEEVMRMYEEWAMKQGKVYNALG-EKEKR 211
           VL  V + S++ DMSII+Y  +  A  L RSDEEVM +YE W ++ GK YN LG EK+KR
Sbjct: 11  VLAAVSSASASADMSIITYDEEHPAKGLSRSDEEVMALYESWLVEHGKSYNGLGGEKDKR 70

Query: 212 FEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRAKFLGIRVDSNRRRRRMAKSTT 388
           FEIFKDNL++IDE N+  +R+YK+GLNRFADL+NEEYR+ +LG + D+   RRR+AK T 
Sbjct: 71  FEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDA---RRRIAK-TK 126

Query: 389 SNHRYAPRVGDGEKLPESVDWRKEGAVVGVKDQGECGSAWAFSAVAAGEGINK 547
           S+ RYAP+ G    LP+S+DWR++GAV  VKDQG CGS WAFS +AA EGIN+
Sbjct: 127 SDRRYAPKAGGS--LPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQ 177

>emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  190 bits (483), Expect = 8e-48
 Identities = 102/175 (58%), Positives = 131/175 (74%), Gaps = 2/175 (1%)
 Frame = +2

Query: 26  TITTVLVVVI-TVSSAMDMSIISYSRDKSAPPLRSDEEVMRMYEEWAMKQGKVYNALGEK 202
           TI+ +L+++  T+SSA DMSIISY  D++    RSD+EV  +YE W ++ GK YNALGEK
Sbjct: 9   TISLLLMLIFSTLSSASDMSIISY--DETHIHHRSDDEVSALYESWLIEHGKSYNALGEK 66

Query: 203 EKRFEIFKDNLKFIDEHNA-ENRTYKVGLNRFADLSNEEYRAKFLGIRVDSNRRRRRMAK 379
           +KRF+IFKDNLK+IDE N+  N++YK+GL +FADL+NEEYR+ +LG +   +RR+    K
Sbjct: 67  DKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNK 126

Query: 380 STTSNHRYAPRVGDGEKLPESVDWRKEGAVVGVKDQGECGSAWAFSAVAAGEGIN 544
           S     RY P+VGD   LPESVDWR +G +VGVKDQG CGS WAFSAVAA E IN
Sbjct: 127 S----DRYLPKVGD--SLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESIN 175

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 534,102,698
Number of Sequences: 1393205
Number of extensions: 13102595
Number of successful extensions: 64444
Number of sequences better than 10.0: 995
Number of HSP's better than 10.0 without gapping: 57293
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 63127
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 18947112822
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD026h03_f BP046092 1 526
2 MPD074d12_f AV774857 253 550




Lotus japonicus
Kazusa DNA Research Institute