FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0215, 111 aa
1>>>pF1KE0215 111 - 111 aa - 111 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.1921+/-0.000272; mu= 16.1784+/- 0.017
mean_var=53.7500+/-10.363, 0's: 0 Z-trim(119.4): 16 B-trim: 346 in 1/49
Lambda= 0.174938
statistics sampled from 33330 (33346) to 33330 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.773), E-opt: 0.2 (0.391), width: 16
Scan time: 3.560
The best scores are: opt bits E(85289)
NP_004878 (OMIM: 604186) C-X-C motif chemokine 14 ( 111) 752 196.5 7.4e-51
NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 p ( 107) 167 48.8 2e-06
NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 p ( 107) 154 45.6 1.9e-05
NP_001502 (OMIM: 155730) growth-regulated alpha pr ( 107) 153 45.3 2.3e-05
NP_002407 (OMIM: 601704) C-X-C motif chemokine 9 p ( 125) 125 38.3 0.0035
>>NP_004878 (OMIM: 604186) C-X-C motif chemokine 14 prec (111 aa)
initn: 752 init1: 752 opt: 752 Z-score: 1034.0 bits: 196.5 E(85289): 7.4e-51
Smith-Waterman score: 752; 100.0% identity (100.0% similar) in 111 aa overlap (1-111:1-111)
10 20 30 40 50 60
pF1KE0 MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKY
10 20 30 40 50 60
70 80 90 100 110
pF1KE0 PHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE
:::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 PHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE
70 80 90 100 110
>>NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 precu (107 aa)
initn: 163 init1: 60 opt: 167 Z-score: 236.3 bits: 48.8 E(85289): 2e-06
Smith-Waterman score: 167; 33.3% identity (60.4% similar) in 96 aa overlap (8-97:8-98)
10 20 30 40 50
pF1KE0 MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGS------KCKCSRKGPKIRYSDVKKL
: : . ::: .:::::::. . :. :. .:.: . :. ......
NP_002 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 EMKPKYPHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE
..: ::: . :: : :. ::. ::.: .:..:.
NP_002 KVKSPGPHCAQTEVIATLKN-----GQKACLNPASPMVKKIIEKMLKNGKSN
70 80 90 100
>>NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 precu (107 aa)
initn: 153 init1: 60 opt: 154 Z-score: 218.6 bits: 45.6 E(85289): 1.9e-05
Smith-Waterman score: 154; 30.2% identity (60.4% similar) in 96 aa overlap (8-97:8-98)
10 20 30 40 50
pF1KE0 MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGS------KCKCSRKGPKIRYSDVKKL
: : . ::: .:::::::. . :. :. .:.: . :. ......
NP_002 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 EMKPKYPHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE
... ::: . :: : :. :.. ::.: ....:.
NP_002 NVRSPGPHCAQTEVIATLKN-----GKKACLNPASPMVQKIIEKILNKGSTN
70 80 90 100
>>NP_001502 (OMIM: 155730) growth-regulated alpha protei (107 aa)
initn: 135 init1: 60 opt: 153 Z-score: 217.2 bits: 45.3 E(85289): 2.3e-05
Smith-Waterman score: 153; 32.3% identity (59.4% similar) in 96 aa overlap (8-97:8-98)
10 20 30 40 50
pF1KE0 MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGS------KCKCSRKGPKIRYSDVKKL
: : . ::: .:::::::. :. :. .:.: . :. ......
NP_001 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 EMKPKYPHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE
..: ::: . :: : :. :.. ::.: .:..:.
NP_001 NVKSPGPHCAQTEVIATLKN-----GRKACLNPASPIVKKIIEKMLNSDKSN
70 80 90 100
>>NP_002407 (OMIM: 601704) C-X-C motif chemokine 9 precu (125 aa)
initn: 81 init1: 47 opt: 125 Z-score: 178.1 bits: 38.3 E(85289): 0.0035
Smith-Waterman score: 125; 29.5% identity (58.9% similar) in 95 aa overlap (15-107:9-98)
10 20 30 40 50
pF1KE0 MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKC-SRKGPKIRYSDVKKLEMKPK
::. ::.:. . : : ..:.: : . :. ...: :..
NP_002 MKKSGVLFLLGIILLVLIGVQGTPVVRKGRCSCISTNQGTIHLQSLKDLKQFAP
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 YPHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIK-WYNAWNEKRRVYEE
: ::. .: : :. : . ::.: ..:..:: : . ..:..
NP_002 SPSCEKIEIIATLKN-----GVQTCLNPDSADVKELIKKWEKQVSQKKKQKNGKKHQKKK
60 70 80 90 100
NP_002 VLKVRKSQRSRQKKTT
110 120
111 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 20:58:35 2016 done: Thu Nov 3 20:58:36 2016
Total Scan time: 3.560 Total Display time: -0.040
Function used was FASTA [36.3.4 Apr, 2011]