Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002498A_C01 KMC002498A_c01
(1132 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB63845.1| putative cysteine protease [Pisum sativum] 394 e-117
ref|NP_187641.2| unknown protein; protein id: At3g10300.1, suppo... 308 3e-85
ref|NP_196037.2| EF - hand Calcium binding protein - like; prote... 302 5e-84
gb|AAF02826.1|AC009400_22 unknown protein [Arabidopsis thaliana] 301 4e-81
gb|AAH19191.1| RIKEN cDNA 2600002E23 gene [Mus musculus] 130 4e-29
>emb|CAB63845.1| putative cysteine protease [Pisum sativum]
Length = 286
Score = 394 bits (1013), Expect(2) = e-117
Identities = 199/251 (79%), Positives = 215/251 (85%)
Frame = -1
Query: 1120 FHMSGYPNKPSGYGYGAPPPYQPYGAAPPSQSYGAPPPPQPYGAAPPSQPYGAPPPSQSY 941
F+MSGYPN+ YGYG Y A PP+QSYGAPPP Q YGA PPSQ YGAPPPSQ
Sbjct: 13 FNMSGYPNQSPNYGYG-------YNAPPPTQSYGAPPPSQSYGAPPPSQSYGAPPPSQY- 64
Query: 940 GGPPPPSQPYSASPYGQPSAPYAAPYQKPPKDESHSHGGGGGSGYPPPPSAYGSPFASLV 761
G PPP Q YSASPYGQPSAPYAAP+QKPPK+ESHS GGG YPPP A+GSPFASL+
Sbjct: 65 -GAPPPGQSYSASPYGQPSAPYAAPHQKPPKEESHSSGGG---AYPPP--AHGSPFASLL 118
Query: 760 PSVFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNV 581
PS FPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLM+HFTNT+V
Sbjct: 119 PSTFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLMYHFTNTSV 178
Query: 580 KKIGPKEFTSLFYSLQNWRGIFERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQ 401
KIGPKEFTSLFYSLQ+WRGIFERFDKDRSG+IDSNELRDALLSLGYAVSP VL+LLVS+
Sbjct: 179 -KIGPKEFTSLFYSLQSWRGIFERFDKDRSGQIDSNELRDALLSLGYAVSPTVLDLLVSK 237
Query: 400 FDQTRWKKQGL 368
FD+T K + +
Sbjct: 238 FDKTGGKHKAV 248
Score = 52.8 bits (125), Expect(2) = e-117
Identities = 26/44 (59%), Positives = 30/44 (68%)
Frame = -3
Query: 380 KARPIEYDNFIECCLTVKGLTDKFTREGHCIYRICNTSPMSRLC 249
K + +EYDNFIECCLTVKGLTDKF + I + PM RLC
Sbjct: 244 KHKAVEYDNFIECCLTVKGLTDKFKEKDTGILAL-QHFPMRRLC 286
>ref|NP_187641.2| unknown protein; protein id: At3g10300.1, supported by cDNA:
gi_17064843 [Arabidopsis thaliana]
gi|17064844|gb|AAL32576.1| Unknown protein [Arabidopsis
thaliana]
Length = 335
Score = 308 bits (789), Expect(2) = 3e-85
Identities = 173/321 (53%), Positives = 206/321 (63%), Gaps = 41/321 (12%)
Frame = -1
Query: 1114 MSGYPNKPSGYGYGA--PPPYQPYGAA----PPSQSYGAPPPPQ-----------PYGAA 986
MSGYP GYGYG PPP PYG+ PP S G+ PPP PYGA
Sbjct: 1 MSGYPPSSQGYGYGGNPPPPQPPYGSTGNNPPPYGSSGSNPPPPYGSSASSPYAVPYGAQ 60
Query: 985 PPSQPYGAPPPSQSYGGPPPPSQPYSASP----YGQPS-APYAAPYQKPPKD-------- 845
P PYGAPP + P ++P+ P YG PS Y A P D
Sbjct: 61 PA--PYGAPPSAPYASLPGDHNKPHKEKPHGASYGSPSPGGYGAHPSSGPSDYGGYGGAP 118
Query: 844 ESHSHGGG-----------GGSGYPPPPSAYGSPFASLVPSVFPPGTDPSIVACFQVADQ 698
+ HGGG GG G PPP ++YGSPFASLVPS FPPGTDP+IVACFQ AD+
Sbjct: 119 QQSGHGGGYGGAPQQSGHGGGYGAPPPQASYGSPFASLVPSAFPPGTDPNIVACFQAADR 178
Query: 697 DGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNVKKIGPKEFTSLFYSLQNWRGI 518
D SG IDDKELQ ALSSYNQSFS+RTVHLLM+ FTN+NV+KIGPKEFTSLF+SLQNWR I
Sbjct: 179 DNSGFIDDKELQGALSSYNQSFSIRTVHLLMYLFTNSNVRKIGPKEFTSLFFSLQNWRSI 238
Query: 517 FERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQFDQTRWKKQGL*NMTTSSSVV 338
FERFDKDRSG+ID+NELRDAL+SLG++VSPV+L+LLVS+FD++ + + +
Sbjct: 239 FERFDKDRSGRIDTNELRDALMSLGFSVSPVILDLLVSKFDKSGGRNRAI-EYDNFIECC 297
Query: 337 LLLRD*LTNSQEKDTAYTGSA 275
L ++ +EKDTA +GSA
Sbjct: 298 LTVKGLTEKFKEKDTALSGSA 318
Score = 30.8 bits (68), Expect(2) = 3e-85
Identities = 12/15 (80%), Positives = 15/15 (100%)
Frame = -2
Query: 270 FSYESFMLTVLPFLI 226
F+YE+FMLTVLPFL+
Sbjct: 320 FNYENFMLTVLPFLV 334
>ref|NP_196037.2| EF - hand Calcium binding protein - like; protein id: At5g04170.1,
supported by cDNA: gi_19698990 [Arabidopsis thaliana]
gi|9955572|emb|CAC05499.1| EF-hand Calcium binding
protein-like [Arabidopsis thaliana]
gi|19698991|gb|AAL91231.1| EF-hand calcium binding
protein-like [Arabidopsis thaliana]
Length = 354
Score = 302 bits (774), Expect(2) = 5e-84
Identities = 182/342 (53%), Positives = 208/342 (60%), Gaps = 61/342 (17%)
Frame = -1
Query: 1114 MSGYPNKPSGYGYG-----APPPYQP-----------------------YGAAPP----- 1034
MSGYP GYGYG PPP QP YGA+ P
Sbjct: 1 MSGYPPTSQGYGYGYGGGNQPPPPQPPYSSGGNNPPYGSSTTSSPYAVPYGASKPQSSSS 60
Query: 1033 ------SQSYGAPPPPQPYGAA------PPSQP-----YGAPPPS-----QSYGGPPPPS 920
S SYGAPPP PY + PP + YGAPPPS SYG P PS
Sbjct: 61 SAPTYGSSSYGAPPPSAPYAPSPGDYNKPPKEKPYGGGYGAPPPSGSSDYGSYGAGPRPS 120
Query: 919 QP------YSASPYGQPSAPYAAPYQKPPKDESHSHGGGGGSGYPPPPSAYGSPFASLVP 758
QP Y A+P + Y + PP+ S HGGG G GYPP S YGSPFASL+P
Sbjct: 121 QPSGHGGGYGATP-PHGVSDYGSYGGAPPRPASSGHGGGYG-GYPPQAS-YGSPFASLIP 177
Query: 757 SVFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNVK 578
S F PGTDP+IVACFQ ADQDGSG IDDKELQ ALSSY Q FS+RTVHLLM+ FTN+N
Sbjct: 178 SGFAPGTDPNIVACFQAADQDGSGFIDDKELQGALSSYQQRFSMRTVHLLMYLFTNSNAM 237
Query: 577 KIGPKEFTSLFYSLQNWRGIFERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQF 398
KIGPKEFT+LFYSLQNWR IFER DKDRSG+ID NELRDALLSLG++VSPVVL+LLVS+F
Sbjct: 238 KIGPKEFTALFYSLQNWRSIFERSDKDRSGRIDVNELRDALLSLGFSVSPVVLDLLVSKF 297
Query: 397 DQTRWKKQGL*NMTTSSSVVLLLRD*LTNSQEKDTAYTGSAT 272
D++ K + + L ++ +EKDTAY+GSAT
Sbjct: 298 DKSGGKNRAI-EYDNFIECCLTVKGLTEKFKEKDTAYSGSAT 338
Score = 32.3 bits (72), Expect(2) = 5e-84
Identities = 14/15 (93%), Positives = 15/15 (99%)
Frame = -2
Query: 270 FSYESFMLTVLPFLI 226
F+YESFMLTVLPFLI
Sbjct: 339 FNYESFMLTVLPFLI 353
>gb|AAF02826.1|AC009400_22 unknown protein [Arabidopsis thaliana]
Length = 330
Score = 301 bits (772), Expect(2) = 4e-81
Identities = 164/290 (56%), Positives = 193/290 (66%), Gaps = 41/290 (14%)
Frame = -1
Query: 1114 MSGYPNKPSGYGYGA--PPPYQPYGAA----PPSQSYGAPPPPQ-----------PYGAA 986
MSGYP GYGYG PPP PYG+ PP S G+ PPP PYGA
Sbjct: 1 MSGYPPSSQGYGYGGNPPPPQPPYGSTGNNPPPYGSSGSNPPPPYGSSASSPYAVPYGAQ 60
Query: 985 PPSQPYGAPPPSQSYGGPPPPSQPYSASP----YGQPS-APYAAPYQKPPKD-------- 845
P PYGAPP + P ++P+ P YG PS Y A P D
Sbjct: 61 PA--PYGAPPSAPYASLPGDHNKPHKEKPHGASYGSPSPGGYGAHPSSGPSDYGGYGGAP 118
Query: 844 ESHSHGGG-----------GGSGYPPPPSAYGSPFASLVPSVFPPGTDPSIVACFQVADQ 698
+ HGGG GG G PPP ++YGSPFASLVPS FPPGTDP+IVACFQ AD+
Sbjct: 119 QQSGHGGGYGGAPQQSGHGGGYGAPPPQASYGSPFASLVPSAFPPGTDPNIVACFQAADR 178
Query: 697 DGSGLIDDKELQRALSSYNQSFSLRTVHLLMFHFTNTNVKKIGPKEFTSLFYSLQNWRGI 518
D SG IDDKELQ ALSSYNQSFS+RTVHLLM+ FTN+NV+KIGPKEFTSLF+SLQNWR I
Sbjct: 179 DNSGFIDDKELQGALSSYNQSFSIRTVHLLMYLFTNSNVRKIGPKEFTSLFFSLQNWRSI 238
Query: 517 FERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQFDQTRWKKQGL 368
FERFDKDRSG+ID+NELRDAL+SLG++VSPV+L+LLVS+FD++ + + +
Sbjct: 239 FERFDKDRSGRIDTNELRDALMSLGFSVSPVILDLLVSKFDKSGGRNRAI 288
Score = 23.5 bits (49), Expect(2) = 4e-81
Identities = 18/48 (37%), Positives = 23/48 (47%)
Frame = -3
Query: 374 RPIEYDNFIECCLTVKGLTDKFTREGHCIYRICNTSPMSRLC*LFCHS 231
R IEYDNFIE + + + R ++ I TS C LF HS
Sbjct: 286 RAIEYDNFIEG--SPRSSRRRIRRYQAQLFSITRTS-----CSLFYHS 326
>gb|AAH19191.1| RIKEN cDNA 2600002E23 gene [Mus musculus]
Length = 275
Score = 130 bits (327), Expect = 4e-29
Identities = 88/241 (36%), Positives = 115/241 (47%), Gaps = 2/241 (0%)
Frame = -1
Query: 1114 MSGYPNKPSGYGYGAPPPYQPYGAAPPSQSYGAPPPPQPYGAA-PPSQPYGAPPPSQSYG 938
M+ YPN S G P P G P +G YG+ PP YGAP P YG
Sbjct: 1 MASYPNGQSCPGAAGQVPGVPPGGYYPGPPHGGGQ----YGSGLPPGGGYGAPAPGGPYG 56
Query: 937 GPPPPSQPYSASPYGQPSAPYAAPYQKPPKDESHSHGGGGGSGYPPPPSAYGSPFASLVP 758
P P G PS PY PP GG G PP YG+
Sbjct: 57 YPSA-----GGVPSGTPSGPYGGI---PP---------GGPYGQLPPGGPYGTQPGHYGQ 99
Query: 757 SVFPPGTDPSIVACFQVADQDGSGLIDDKELQRALSSYN-QSFSLRTVHLLMFHFTNTNV 581
PP DP + FQ D D SG I KEL++AL + N SF+ T H+++ F T
Sbjct: 100 GGVPPNVDPEAYSWFQSVDADHSGYISLKELKQALVNSNWSSFNDETCHMMINMFDKTKS 159
Query: 580 KKIGPKEFTSLFYSLQNWRGIFERFDKDRSGKIDSNELRDALLSLGYAVSPVVLELLVSQ 401
+I F++L+ LQ WR +F+++D+DRSG I S EL+ AL +GY +SP +LLVS+
Sbjct: 160 GRIDVAGFSALWKFLQQWRNLFQQYDRDRSGSISSTELQQALSQMGYNLSPQFTQLLVSR 219
Query: 400 F 398
+
Sbjct: 220 Y 220
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,124,571,788
Number of Sequences: 1393205
Number of extensions: 33004616
Number of successful extensions: 413506
Number of sequences better than 10.0: 9782
Number of HSP's better than 10.0 without gapping: 127155
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 246662
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 68909194122
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)