KMC002498A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002498A_C01 KMC002498A_c01
gctatgcccagttctgtctaccaaacattaccattgagatGGATAAGCACGAAAACTAAT
TTTTATGGCTTCATTCACACAAAGTACACCCGAAGGAAACTTGTCAACAAGCTGAAAATA
TAATGTTTAAGGATTTGTGTTTTTTATAAACTAAACAACAGGGTGATAGCAGCATAATAG
AATATGAAAAAGTTCCTCGACCATACACCATTACTTTCCTAAGCGTATTAGGAATGGCAG
AACAGTCAGCATAAACGACTCATAGGAGAAGTGTTGCAGATCCGGTATATGCAGTGTCCT
TCTCTTGTGAATTTGTCAGTTAGTCCCTTAACAGTAAGACAACACTCGATGAAGTTGTCA
TATTCTATAGGCCTTGCTTTTTCCACCGAGTTTGGTCAAACTGGGAGACGAGCAATTCCA
ATACCACAGGAGAAACAGCATAACCCAAACTCAGCAGCGCATCTCGTAACTCATTTGAAT
CGATTTTACCACTCCTGTCCTTATCAAATCTCTCAAATATGCCCCTCCAGTTCTGAAGAC
TGTAAAATAGAGATGTGAATTCCTTGGGTCCTATTTTCTTGACGTTGGTGTTGGTGAAGT
GAAACATGAGGAGGTGAACGGTTCTCAAGCTGAAGCTCTGGTTGTAGGAAGAGAGAGCTC
TCTGCAATTCCTTGTCATCGATCAAGCCGCTGCCGTCCTGGTCCGCCACCTGGAAGCATG
CGACTATGCTAGGGTCTGTGCCGGGAGGAAACACCGACGGCACCAGTGACGCGAACGGGC
TCCCGTACGCTGATGGAGGCGGTGGGTAACCGCTACCACCGCCGCCGCCGTGGGAATGGG
ATTCGTCCTTGGGGGGCTTTTGGTAGGGCGCGGCGTAAGGAGCGGACGGCTGGCCGTAGG
GGCTAGCGGAGTAAGGCTGAGAAGGCGGAGGTGGACCGCCGTAGGATTGGGATGGCGGTG
GAGCGCCGTAGGGTTGGGATGGCGGGGCTGCGCCGTAGGGTTGAGGTGGTGGCGGAGCGC
CGTAGGACTGGGATGGCGGGGCGGCGCCGTATggctggtatggcggcggagcgccgtaac
cgtaaccggaaggcttgttggggtagcctgacatgtggaatgagaagggatt


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002498A_C01 KMC002498A_c01
         (1132 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAB63845.1| putative cysteine protease [Pisum sativum]            394  e-117
ref|NP_187641.2| unknown protein; protein id: At3g10300.1, suppo...   308  3e-85
ref|NP_196037.2| EF - hand Calcium binding protein - like; prote...   302  5e-84
gb|AAF02826.1|AC009400_22 unknown protein [Arabidopsis thaliana]      301  4e-81
gb|AAH19191.1| RIKEN cDNA 2600002E23 gene [Mus musculus]              130  4e-29

>emb|CAB63845.1| putative cysteine protease [Pisum sativum]
          Length = 286

 Score =  394 bits (1013), Expect(2) = e-117
 Identities = 199/251 (79%), Positives = 215/251 (85%)
 Frame = -1

Query: 1120 FHMSGYPNKPSGYGYGAPPPYQPYGAAPPSQSYGAPPPPQPYGAAPPSQPYGAPPPSQSY 941
            F+MSGYPN+   YGYG       Y A PP+QSYGAPPP Q YGA PPSQ YGAPPPSQ  
Sbjct: 13   FNMSGYPNQSPNYGYG-------YNAPPPTQSYGAPPPSQSYGAPPPSQSYGAPPPSQY- 64

Query: 940  GGPPPPSQPYSASPYGQPSAPYAAPYQKPPKDESHSHGGGGGSGYPPPPSAYGSPFASLV 761
             G PPP Q YSASPYGQPSAPYAAP+QKPPK+ESHS GGG    YPPP  A+GSPFASL+
Sbjct: 65   -GAPPPGQSYSASPYGQPSAPYAAPHQKPPKEESHSSGGG---AYPPP--AHGSPFASLL 118

Query: 760  PSVFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNV 581
            PS FPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLM+HFTNT+V
Sbjct: 119  PSTFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLMYHFTNTSV 178

Query: 580  KKIGPKEFTSLFYSLQNWRGIFERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQ 401
             KIGPKEFTSLFYSLQ+WRGIFERFDKDRSG+IDSNELRDALLSLGYAVSP VL+LLVS+
Sbjct: 179  -KIGPKEFTSLFYSLQSWRGIFERFDKDRSGQIDSNELRDALLSLGYAVSPTVLDLLVSK 237

Query: 400  FDQTRWKKQGL 368
            FD+T  K + +
Sbjct: 238  FDKTGGKHKAV 248

 Score = 52.8 bits (125), Expect(2) = e-117
 Identities = 26/44 (59%), Positives = 30/44 (68%)
 Frame = -3

Query: 380 KARPIEYDNFIECCLTVKGLTDKFTREGHCIYRICNTSPMSRLC 249
           K + +EYDNFIECCLTVKGLTDKF  +   I  +    PM RLC
Sbjct: 244 KHKAVEYDNFIECCLTVKGLTDKFKEKDTGILAL-QHFPMRRLC 286

>ref|NP_187641.2| unknown protein; protein id: At3g10300.1, supported by cDNA:
            gi_17064843 [Arabidopsis thaliana]
            gi|17064844|gb|AAL32576.1| Unknown protein [Arabidopsis
            thaliana]
          Length = 335

 Score =  308 bits (789), Expect(2) = 3e-85
 Identities = 173/321 (53%), Positives = 206/321 (63%), Gaps = 41/321 (12%)
 Frame = -1

Query: 1114 MSGYPNKPSGYGYGA--PPPYQPYGAA----PPSQSYGAPPPPQ-----------PYGAA 986
            MSGYP    GYGYG   PPP  PYG+     PP  S G+ PPP            PYGA 
Sbjct: 1    MSGYPPSSQGYGYGGNPPPPQPPYGSTGNNPPPYGSSGSNPPPPYGSSASSPYAVPYGAQ 60

Query: 985  PPSQPYGAPPPSQSYGGPPPPSQPYSASP----YGQPS-APYAAPYQKPPKD-------- 845
            P   PYGAPP +     P   ++P+   P    YG PS   Y A     P D        
Sbjct: 61   PA--PYGAPPSAPYASLPGDHNKPHKEKPHGASYGSPSPGGYGAHPSSGPSDYGGYGGAP 118

Query: 844  ESHSHGGG-----------GGSGYPPPPSAYGSPFASLVPSVFPPGTDPSIVACFQVADQ 698
            +   HGGG           GG G PPP ++YGSPFASLVPS FPPGTDP+IVACFQ AD+
Sbjct: 119  QQSGHGGGYGGAPQQSGHGGGYGAPPPQASYGSPFASLVPSAFPPGTDPNIVACFQAADR 178

Query: 697  DGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNVKKIGPKEFTSLFYSLQNWRGI 518
            D SG IDDKELQ ALSSYNQSFS+RTVHLLM+ FTN+NV+KIGPKEFTSLF+SLQNWR I
Sbjct: 179  DNSGFIDDKELQGALSSYNQSFSIRTVHLLMYLFTNSNVRKIGPKEFTSLFFSLQNWRSI 238

Query: 517  FERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQFDQTRWKKQGL*NMTTSSSVV 338
            FERFDKDRSG+ID+NELRDAL+SLG++VSPV+L+LLVS+FD++  + + +          
Sbjct: 239  FERFDKDRSGRIDTNELRDALMSLGFSVSPVILDLLVSKFDKSGGRNRAI-EYDNFIECC 297

Query: 337  LLLRD*LTNSQEKDTAYTGSA 275
            L ++      +EKDTA +GSA
Sbjct: 298  LTVKGLTEKFKEKDTALSGSA 318

 Score = 30.8 bits (68), Expect(2) = 3e-85
 Identities = 12/15 (80%), Positives = 15/15 (100%)
 Frame = -2

Query: 270 FSYESFMLTVLPFLI 226
           F+YE+FMLTVLPFL+
Sbjct: 320 FNYENFMLTVLPFLV 334

>ref|NP_196037.2| EF - hand Calcium binding protein - like; protein id: At5g04170.1,
            supported by cDNA: gi_19698990 [Arabidopsis thaliana]
            gi|9955572|emb|CAC05499.1| EF-hand Calcium binding
            protein-like [Arabidopsis thaliana]
            gi|19698991|gb|AAL91231.1| EF-hand calcium binding
            protein-like [Arabidopsis thaliana]
          Length = 354

 Score =  302 bits (774), Expect(2) = 5e-84
 Identities = 182/342 (53%), Positives = 208/342 (60%), Gaps = 61/342 (17%)
 Frame = -1

Query: 1114 MSGYPNKPSGYGYG-----APPPYQP-----------------------YGAAPP----- 1034
            MSGYP    GYGYG      PPP QP                       YGA+ P     
Sbjct: 1    MSGYPPTSQGYGYGYGGGNQPPPPQPPYSSGGNNPPYGSSTTSSPYAVPYGASKPQSSSS 60

Query: 1033 ------SQSYGAPPPPQPYGAA------PPSQP-----YGAPPPS-----QSYGGPPPPS 920
                  S SYGAPPP  PY  +      PP +      YGAPPPS      SYG  P PS
Sbjct: 61   SAPTYGSSSYGAPPPSAPYAPSPGDYNKPPKEKPYGGGYGAPPPSGSSDYGSYGAGPRPS 120

Query: 919  QP------YSASPYGQPSAPYAAPYQKPPKDESHSHGGGGGSGYPPPPSAYGSPFASLVP 758
            QP      Y A+P     + Y +    PP+  S  HGGG G GYPP  S YGSPFASL+P
Sbjct: 121  QPSGHGGGYGATP-PHGVSDYGSYGGAPPRPASSGHGGGYG-GYPPQAS-YGSPFASLIP 177

Query: 757  SVFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNVK 578
            S F PGTDP+IVACFQ ADQDGSG IDDKELQ ALSSY Q FS+RTVHLLM+ FTN+N  
Sbjct: 178  SGFAPGTDPNIVACFQAADQDGSGFIDDKELQGALSSYQQRFSMRTVHLLMYLFTNSNAM 237

Query: 577  KIGPKEFTSLFYSLQNWRGIFERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQF 398
            KIGPKEFT+LFYSLQNWR IFER DKDRSG+ID NELRDALLSLG++VSPVVL+LLVS+F
Sbjct: 238  KIGPKEFTALFYSLQNWRSIFERSDKDRSGRIDVNELRDALLSLGFSVSPVVLDLLVSKF 297

Query: 397  DQTRWKKQGL*NMTTSSSVVLLLRD*LTNSQEKDTAYTGSAT 272
            D++  K + +          L ++      +EKDTAY+GSAT
Sbjct: 298  DKSGGKNRAI-EYDNFIECCLTVKGLTEKFKEKDTAYSGSAT 338

 Score = 32.3 bits (72), Expect(2) = 5e-84
 Identities = 14/15 (93%), Positives = 15/15 (99%)
 Frame = -2

Query: 270 FSYESFMLTVLPFLI 226
           F+YESFMLTVLPFLI
Sbjct: 339 FNYESFMLTVLPFLI 353

>gb|AAF02826.1|AC009400_22 unknown protein [Arabidopsis thaliana]
          Length = 330

 Score =  301 bits (772), Expect(2) = 4e-81
 Identities = 164/290 (56%), Positives = 193/290 (66%), Gaps = 41/290 (14%)
 Frame = -1

Query: 1114 MSGYPNKPSGYGYGA--PPPYQPYGAA----PPSQSYGAPPPPQ-----------PYGAA 986
            MSGYP    GYGYG   PPP  PYG+     PP  S G+ PPP            PYGA 
Sbjct: 1    MSGYPPSSQGYGYGGNPPPPQPPYGSTGNNPPPYGSSGSNPPPPYGSSASSPYAVPYGAQ 60

Query: 985  PPSQPYGAPPPSQSYGGPPPPSQPYSASP----YGQPS-APYAAPYQKPPKD-------- 845
            P   PYGAPP +     P   ++P+   P    YG PS   Y A     P D        
Sbjct: 61   PA--PYGAPPSAPYASLPGDHNKPHKEKPHGASYGSPSPGGYGAHPSSGPSDYGGYGGAP 118

Query: 844  ESHSHGGG-----------GGSGYPPPPSAYGSPFASLVPSVFPPGTDPSIVACFQVADQ 698
            +   HGGG           GG G PPP ++YGSPFASLVPS FPPGTDP+IVACFQ AD+
Sbjct: 119  QQSGHGGGYGGAPQQSGHGGGYGAPPPQASYGSPFASLVPSAFPPGTDPNIVACFQAADR 178

Query: 697  DGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNVKKIGPKEFTSLFYSLQNWRGI 518
            D SG IDDKELQ ALSSYNQSFS+RTVHLLM+ FTN+NV+KIGPKEFTSLF+SLQNWR I
Sbjct: 179  DNSGFIDDKELQGALSSYNQSFSIRTVHLLMYLFTNSNVRKIGPKEFTSLFFSLQNWRSI 238

Query: 517  FERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQFDQTRWKKQGL 368
            FERFDKDRSG+ID+NELRDAL+SLG++VSPV+L+LLVS+FD++  + + +
Sbjct: 239  FERFDKDRSGRIDTNELRDALMSLGFSVSPVILDLLVSKFDKSGGRNRAI 288

 Score = 23.5 bits (49), Expect(2) = 4e-81
 Identities = 18/48 (37%), Positives = 23/48 (47%)
 Frame = -3

Query: 374 RPIEYDNFIECCLTVKGLTDKFTREGHCIYRICNTSPMSRLC*LFCHS 231
           R IEYDNFIE   + +    +  R    ++ I  TS     C LF HS
Sbjct: 286 RAIEYDNFIEG--SPRSSRRRIRRYQAQLFSITRTS-----CSLFYHS 326

>gb|AAH19191.1| RIKEN cDNA 2600002E23 gene [Mus musculus]
          Length = 275

 Score =  130 bits (327), Expect = 4e-29
 Identities = 88/241 (36%), Positives = 115/241 (47%), Gaps = 2/241 (0%)
 Frame = -1

Query: 1114 MSGYPNKPSGYGYGAPPPYQPYGAAPPSQSYGAPPPPQPYGAA-PPSQPYGAPPPSQSYG 938
            M+ YPN  S  G     P  P G   P   +G       YG+  PP   YGAP P   YG
Sbjct: 1    MASYPNGQSCPGAAGQVPGVPPGGYYPGPPHGGGQ----YGSGLPPGGGYGAPAPGGPYG 56

Query: 937  GPPPPSQPYSASPYGQPSAPYAAPYQKPPKDESHSHGGGGGSGYPPPPSAYGSPFASLVP 758
             P          P G PS PY      PP         GG  G  PP   YG+       
Sbjct: 57   YPSA-----GGVPSGTPSGPYGGI---PP---------GGPYGQLPPGGPYGTQPGHYGQ 99

Query: 757  SVFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYN-QSFSLRTVHLLMFHFTNTNV 581
               PP  DP   + FQ  D D SG I  KEL++AL + N  SF+  T H+++  F  T  
Sbjct: 100  GGVPPNVDPEAYSWFQSVDADHSGYISLKELKQALVNSNWSSFNDETCHMMINMFDKTKS 159

Query: 580  KKIGPKEFTSLFYSLQNWRGIFERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQ 401
             +I    F++L+  LQ WR +F+++D+DRSG I S EL+ AL  +GY +SP   +LLVS+
Sbjct: 160  GRIDVAGFSALWKFLQQWRNLFQQYDRDRSGSISSTELQQALSQMGYNLSPQFTQLLVSR 219

Query: 400  F 398
            +
Sbjct: 220  Y 220

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,124,571,788
Number of Sequences: 1393205
Number of extensions: 33004616
Number of successful extensions: 413506
Number of sequences better than 10.0: 9782
Number of HSP's better than 10.0 without gapping: 127155
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 246662
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 68909194122
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf052a10 BP071203 1 451
2 MPD072d08_f AV774722 32 454
3 GNf075c09 BP072902 38 448
4 MR046h09_f BP079596 41 459
5 MPD065e05_f AV774337 46 465
6 MR034d08_f BP078623 46 455
7 MR061d06_f BP080675 46 473
8 MR023e10_f BP077762 48 178
9 GENf073b12 BP061467 52 414
10 MR010f08_f BP076723 52 464
11 MWM043a09_f AV765343 52 486
12 MFB014c05_f BP034937 53 589
13 GNf015a05 BP068425 54 561
14 MFB011f01_f BP034728 61 587
15 GNf062d05 BP071972 87 584
16 SPD060g06_f BP048797 87 606
17 GNf067h02 BP072367 88 186
18 GNf005a05 BP067706 97 530
19 SPD011h03_f BP044907 107 661
20 MR047h10_f BP079678 108 203
21 MFB095h10_f BP040961 108 663
22 MPD088f08_f AV775806 109 463
23 MFB085e09_f BP040219 531 1064
24 MWM073a06_f AV765874 793 1157




Lotus japonicus
Kazusa DNA Research Institute