Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC011471A_C01 KMC011471A_c01
(563 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB14533.1| unnamed protein product [Homo sapiens] 49 6e-05
gb|AAF21601.1|AF009222_1 kexin-like serine endoprotease [Pneumoc... 48 1e-04
gb|AAM44363.1| hypothetical protein [Dictyostelium discoideum] g... 48 1e-04
emb|CAC43457.1| protease 1 [Pneumocystis carinii] 48 1e-04
ref|NP_172666.1| unknown protein; protein id: At1g12020.1, suppo... 47 1e-04
>dbj|BAB14533.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 48.5 bits (114), Expect = 6e-05
Identities = 29/63 (46%), Positives = 34/63 (53%), Gaps = 9/63 (14%)
Frame = +3
Query: 387 PFSSSSSTPPPP----STTSPPPPPPPKPPLPLL-FTLVSVSPSTNSGSA----EPLHGE 539
P S + PPPP STT PPPPPPP PP PL +S PS G+A PL G+
Sbjct: 356 PGDSGTIIPPPPAPGDSTTPPPPPPPPPPPPPLPGGVCISSPPSLPGGTAISPPPPLSGD 415
Query: 540 LTL 548
T+
Sbjct: 416 ATI 418
>gb|AAF21601.1|AF009222_1 kexin-like serine endoprotease [Pneumocystis carinii]
Length = 493
Score = 47.8 bits (112), Expect = 1e-04
Identities = 20/34 (58%), Positives = 24/34 (69%)
Frame = +3
Query: 369 TNLLTDPFSSSSSTPPPPSTTSPPPPPPPKPPLP 470
T+L ++P S+SSS PPPPS PPPPPP P P
Sbjct: 329 TSLSSNPTSTSSSEPPPPSPPPPPPPPPAPAPAP 362
Score = 38.9 bits (89), Expect = 0.044
Identities = 16/29 (55%), Positives = 19/29 (65%)
Frame = +3
Query: 384 DPFSSSSSTPPPPSTTSPPPPPPPKPPLP 470
DP +S SS P S++ PPPP PP PP P
Sbjct: 326 DPDTSLSSNPTSTSSSEPPPPSPPPPPPP 354
Score = 32.3 bits (72), Expect = 4.2
Identities = 13/38 (34%), Positives = 20/38 (52%)
Frame = +3
Query: 411 PPPPSTTSPPPPPPPKPPLPLLFTLVSVSPSTNSGSAE 524
P PP+ P PPPP PP ++ S + +T+S +
Sbjct: 393 PEPPAXPPKPQPPPPSPPEQKPTSITSSTSTTSSSKTK 430
>gb|AAM44363.1| hypothetical protein [Dictyostelium discoideum]
gi|28828387|gb|AAM09303.2| similar to Plasmodium
lophurae. Histidine-rich glycoprotein precursor
[Dictyostelium discoideum]
Length = 233
Score = 47.8 bits (112), Expect = 1e-04
Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 1/70 (1%)
Frame = +1
Query: 322 FTTDPKHPKSFHQ*NSQTFSPTLSPHHLQ-LHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
+ D +P + + +P +PHH LHH HHHHHHH + H++ H H
Sbjct: 40 YQLDVNNPHNPNNNPHNPHNPNNNPHHPHHLHHHHHHHHHHHHHHHHHHHHHHHHHHPHH 99
Query: 499 PVLIPAQLNH 528
P P +H
Sbjct: 100 PHHHPHHHHH 109
Score = 44.3 bits (103), Expect = 0.001
Identities = 15/35 (42%), Positives = 18/35 (50%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSHP 501
HH HH HHHHHHH + H++ H HP
Sbjct: 121 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHP 155
Score = 44.3 bits (103), Expect = 0.001
Identities = 15/35 (42%), Positives = 18/35 (50%)
Frame = +1
Query: 394 PHHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
PHH HH HHHHHHH + H++ H H
Sbjct: 110 PHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 144
Score = 44.3 bits (103), Expect = 0.001
Identities = 15/35 (42%), Positives = 18/35 (50%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSHP 501
HH HH HHHHHHH + H++ H HP
Sbjct: 125 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHPHHHP 159
Score = 43.5 bits (101), Expect = 0.002
Identities = 18/53 (33%), Positives = 21/53 (38%)
Frame = +1
Query: 340 HPKSFHQ*NSQTFSPTLSPHHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HP H P HH HH HHHHHHH + H++ H H
Sbjct: 96 HPHHPHHHPHHHHHPHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 148
Score = 43.1 bits (100), Expect = 0.002
Identities = 18/53 (33%), Positives = 22/53 (40%)
Frame = +1
Query: 340 HPKSFHQ*NSQTFSPTLSPHHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
H H + P PHH HH HHHHHHH + H++ H H
Sbjct: 86 HHHHHHHHHHHPHHPHHHPHH---HHHPHHHHHHHHHHHHHHHHHHHHHHHHH 135
Score = 42.4 bits (98), Expect = 0.004
Identities = 15/35 (42%), Positives = 17/35 (47%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSHP 501
HH HH HHHHHHH + H+ H HP
Sbjct: 129 HHHHHHHHHHHHHHHHHHHHHHHHHHPHHHPHPHP 163
Score = 41.6 bits (96), Expect = 0.007
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 117 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 150
Score = 41.6 bits (96), Expect = 0.007
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 116 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 149
Score = 41.6 bits (96), Expect = 0.007
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 123 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHPH 156
Score = 41.6 bits (96), Expect = 0.007
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 120 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 153
Score = 41.6 bits (96), Expect = 0.007
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 119 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 152
Score = 41.6 bits (96), Expect = 0.007
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 118 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH 151
Score = 40.8 bits (94), Expect = 0.012
Identities = 14/34 (41%), Positives = 17/34 (49%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
HH HH HHHHHHH + H++ H H
Sbjct: 124 HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHPHH 157
Score = 39.7 bits (91), Expect = 0.026
Identities = 18/56 (32%), Positives = 22/56 (39%)
Frame = +1
Query: 331 DPKHPKSFHQ*NSQTFSPTLSPHHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSH 498
+P HP H + HH HH HHHHHHH + H+ H H
Sbjct: 63 NPHHPHHLHHHHHHH-------HHHHHHHHHHHHHHHHHHHPHHPHHHPHHHHHPH 111
Score = 38.9 bits (89), Expect = 0.044
Identities = 17/39 (43%), Positives = 19/39 (48%)
Frame = +1
Query: 397 HHLQLHHRLQQHHHHHHHRRNLRFHYYLHLSRSHPVLIP 513
HH HH HHHHHHH + H + H HP L P
Sbjct: 136 HHHHHHHHHHHHHHHHHHHPHHHPHPHPH-PHPHPHLHP 173
>emb|CAC43457.1| protease 1 [Pneumocystis carinii]
Length = 938
Score = 47.8 bits (112), Expect = 1e-04
Identities = 20/34 (58%), Positives = 24/34 (69%)
Frame = +3
Query: 369 TNLLTDPFSSSSSTPPPPSTTSPPPPPPPKPPLP 470
T+L ++P S+SSS PPPPS PPPPPP P P
Sbjct: 768 TSLSSNPTSTSSSEPPPPSPPPPPPPPPAPAPAP 801
Score = 38.9 bits (89), Expect = 0.044
Identities = 16/29 (55%), Positives = 19/29 (65%)
Frame = +3
Query: 384 DPFSSSSSTPPPPSTTSPPPPPPPKPPLP 470
DP +S SS P S++ PPPP PP PP P
Sbjct: 765 DPDTSLSSNPTSTSSSEPPPPSPPPPPPP 793
Score = 38.5 bits (88), Expect = 0.058
Identities = 22/57 (38%), Positives = 26/57 (45%)
Frame = +3
Query: 357 PMKLTNLLTDPFSSSSSTPPPPSTTSPPPPPPPKPPLPLLFTLVSVSPSTNSGSAEP 527
P T L P S+SSS PPPP+ P P P P L + PS+ GS P
Sbjct: 653 PEPTTTLPPTPSSTSSSRPPPPAPQPQPQPQPQPDPGSLPSSDPESPPSSEPGSQPP 709
Score = 37.7 bits (86), Expect = 0.099
Identities = 23/62 (37%), Positives = 26/62 (41%)
Frame = +3
Query: 333 PQAPQILSPMKLTNLLTDPFSSSSSTPPPPSTTSPPPPPPPKPPLPLLFTLVSVSPSTNS 512
PQ PQ P P + PP P PPPPPPP P T ++ S ST S
Sbjct: 822 PQPPQPQPPQ--------PQPEPPAPPPKPQPPQPPPPPPPPEQKP---TSITSSTSTTS 870
Query: 513 GS 518
S
Sbjct: 871 SS 872
Score = 34.7 bits (78), Expect = 0.84
Identities = 14/37 (37%), Positives = 20/37 (53%)
Frame = +3
Query: 414 PPPSTTSPPPPPPPKPPLPLLFTLVSVSPSTNSGSAE 524
PPP P PPPPP PP ++ S + +T+S +
Sbjct: 839 PPPKPQPPQPPPPPPPPEQKPTSITSSTSTTSSSKTK 875
>ref|NP_172666.1| unknown protein; protein id: At1g12020.1, supported by cDNA:
gi_18252908 [Arabidopsis thaliana]
gi|25372783|pir||C86255 protein F12F1.11 [imported] -
Arabidopsis thaliana gi|3157952|gb|AAC17635.1| F12F1.11
[Arabidopsis thaliana] gi|18252909|gb|AAL62381.1|
unknown protein [Arabidopsis thaliana]
Length = 226
Score = 47.4 bits (111), Expect = 1e-04
Identities = 33/85 (38%), Positives = 43/85 (49%), Gaps = 9/85 (10%)
Frame = -2
Query: 559 SELVKVSSPCNGSAEPELVLGE--------TETSVNSSGSGGFGGGGGGGDVVEGGGGVE 404
S L+ SS +E LGE T +S + GG GG + G E
Sbjct: 142 SSLLASSSFSTDDSEIPSRLGESVVNSCPCTSSSELTQDGGGCSGGLEPMEFFCAGDACE 201
Query: 403 D-DEEKGSVRRFVSFIGERIWGAWG 332
+EEKG+VRRFVSFIGE+++G WG
Sbjct: 202 KVEEEKGTVRRFVSFIGEKVFGVWG 226
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 545,967,176
Number of Sequences: 1393205
Number of extensions: 15047917
Number of successful extensions: 460219
Number of sequences better than 10.0: 5596
Number of HSP's better than 10.0 without gapping: 102857
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 281839
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20382500157
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)