FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3014, 104 aa 1>>>pF1KE3014 104 - 104 aa - 104 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9060+/-0.000283; mu= 10.7828+/- 0.018 mean_var=50.0982+/- 9.837, 0's: 0 Z-trim(117.3): 20 B-trim: 0 in 0/54 Lambda= 0.181202 statistics sampled from 29228 (29248) to 29228 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.75), E-opt: 0.2 (0.343), width: 16 Scan time: 4.640 The best scores are: opt bits E(85289) NP_002611 (OMIM: 173461) platelet factor 4 variant ( 104) 660 179.5 8.3e-46 NP_002610 (OMIM: 173460) platelet factor 4 precurs ( 101) 531 145.8 1.1e-35 XP_005265753 (OMIM: 173460) PREDICTED: platelet fa ( 110) 440 122.0 1.8e-28 NP_001502 (OMIM: 155730) growth-regulated alpha pr ( 107) 244 70.8 4.6e-13 NP_002695 (OMIM: 121010) platelet basic protein pr ( 128) 237 69.0 1.9e-12 NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 p ( 114) 233 67.9 3.6e-12 NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 p ( 107) 225 65.8 1.5e-11 NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 p ( 114) 221 64.8 3.2e-11 NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 p ( 107) 207 61.1 3.8e-10 NP_000575 (OMIM: 146930) interleukin-8 precursor [ ( 99) 183 54.8 2.7e-08 NP_001556 (OMIM: 147310) C-X-C motif chemokine 10 ( 98) 148 45.7 1.5e-05 NP_002407 (OMIM: 601704) C-X-C motif chemokine 9 p ( 125) 127 40.2 0.00086 NP_006410 (OMIM: 605149) C-X-C motif chemokine 13 ( 109) 122 38.9 0.0019 XP_006714126 (OMIM: 605149) PREDICTED: C-X-C motif ( 109) 122 38.9 0.0019 NP_004878 (OMIM: 604186) C-X-C motif chemokine 14 ( 111) 117 37.6 0.0047 >>NP_002611 (OMIM: 173461) platelet factor 4 variant pre (104 aa) initn: 660 init1: 660 opt: 660 Z-score: 943.4 bits: 179.5 E(85289): 8.3e-46 Smith-Waterman score: 660; 100.0% identity (100.0% similar) in 104 aa overlap (1-104:1-104) 10 20 30 40 50 60 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS 10 20 30 40 50 60 70 80 90 100 pF1KE3 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :::::::::::::::::::::::::::::::::::::::::::: NP_002 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES 70 80 90 100 >>NP_002610 (OMIM: 173460) platelet factor 4 precursor [ (101 aa) initn: 542 init1: 529 opt: 531 Z-score: 761.3 bits: 145.8 E(85289): 1.1e-35 Smith-Waterman score: 531; 84.6% identity (89.4% similar) in 104 aa overlap (1-104:1-101) 10 20 30 40 50 60 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS ::::: :.: .:::.:::::.::::: ::::::::::::::::::::::::::: NP_002 MSSAAG---FCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITS 10 20 30 40 50 70 80 90 100 pF1KE3 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES ::::::::::::::::::::::::::::::: :::::::. ::: NP_002 LEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>XP_005265753 (OMIM: 173460) PREDICTED: platelet factor (110 aa) initn: 440 init1: 440 opt: 440 Z-score: 632.2 bits: 122.0 E(85289): 1.8e-28 Smith-Waterman score: 440; 95.8% identity (97.2% similar) in 71 aa overlap (34-104:40-110) 10 20 30 40 50 60 pF1KE3 AARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITSLEV :::::::::::::::::::::::::::::: XP_005 PAECLATVPGAAPAPPTWLEQLLSGGGVIYAEAEEDGDLQCLCVKTTSQVRPRHITSLEV 10 20 30 40 50 60 70 80 90 100 pF1KE3 IKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :::::::::::::::::::::::::::: :::::::. ::: XP_005 IKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 70 80 90 100 110 >>NP_001502 (OMIM: 155730) growth-regulated alpha protei (107 aa) initn: 182 init1: 141 opt: 244 Z-score: 355.5 bits: 70.8 E(85289): 4.6e-13 Smith-Waterman score: 244; 43.1% identity (71.6% similar) in 102 aa overlap (5-104:2-103) 10 20 30 40 50 pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI ::. :. : . ..: .::::: .:.: :: : .:.: :..: . ..:..: NP_001 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNI 10 20 30 40 50 60 70 80 90 100 pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :..: . :::: ...::::::::: ::. . . ::::.. :.: NP_001 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 60 70 80 90 100 >>NP_002695 (OMIM: 121010) platelet basic protein prepro (128 aa) initn: 217 init1: 196 opt: 237 Z-score: 344.3 bits: 69.0 E(85289): 1.9e-12 Smith-Waterman score: 241; 43.6% identity (70.3% similar) in 101 aa overlap (15-102:21-121) 10 20 30 40 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVA--------FARAEAEE-DGDL--- : .:.:.::: .. . .:... : :.:: NP_002 MSLRLDTTPSCNSARPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAE 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 -QCLCVKTTSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEH .:.:.:::: ..:..: ::::: : :: ...:::::.::::::: .: :::.... NP_002 LRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK 70 80 90 100 110 120 pF1KE3 LES : NP_002 LAGDESAD >>NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 precu (114 aa) initn: 206 init1: 174 opt: 233 Z-score: 339.5 bits: 67.9 E(85289): 3.6e-12 Smith-Waterman score: 233; 38.8% identity (68.0% similar) in 103 aa overlap (2-103:6-108) 10 20 30 40 50 pF1KE3 MSSAARSRLTRATRQEMLFLALLLL-PVVVAFARAEAEEDGDLQCLCVKTTSQVRP : ::: .. .: : ::: : .: : : .:.:.:..::. :.: NP_002 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES . :..:.:. ::.: ....:.::::..:::: .: . ::.:.. :. NP_002 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 70 80 90 100 110 >>NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 precu (107 aa) initn: 167 init1: 133 opt: 225 Z-score: 328.6 bits: 65.8 E(85289): 1.5e-11 Smith-Waterman score: 225; 40.2% identity (70.6% similar) in 102 aa overlap (5-104:2-103) 10 20 30 40 50 pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI ::. :. : . ..: .::::: .:.: :: : .:.: :..: . .. ..: NP_002 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNI 10 20 30 40 50 60 70 80 90 100 pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :..: . :::: ...:::::::.: ::. . . ::::.. :.. NP_002 QSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 60 70 80 90 100 >>NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 precu (114 aa) initn: 199 init1: 141 opt: 221 Z-score: 322.5 bits: 64.8 E(85289): 3.2e-11 Smith-Waterman score: 221; 37.5% identity (66.3% similar) in 104 aa overlap (2-104:6-109) 10 20 30 40 50 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPV-VVAFARAEAEEDGDLQCLCVKTTSQVRP : ::: .. .: : ::: : .: : . .:.: :...: .: : NP_002 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES . : .:.:. :::.: ....:.::::...::: .: . ::.:.. :.: NP_002 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 >>NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 precu (107 aa) initn: 182 init1: 126 opt: 207 Z-score: 303.2 bits: 61.1 E(85289): 3.8e-10 Smith-Waterman score: 207; 39.0% identity (70.0% similar) in 100 aa overlap (5-102:2-101) 10 20 30 40 50 pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI :.. :. : . ..: .::::: .:.: :: : .:.: :..: . .. ..: NP_002 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNI 10 20 30 40 50 60 70 80 90 100 pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :..: . :::: ...:::::::.: ::. . . .:::.. : NP_002 QSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 60 70 80 90 100 >>NP_000575 (OMIM: 146930) interleukin-8 precursor [Homo (99 aa) initn: 139 init1: 131 opt: 183 Z-score: 269.8 bits: 54.8 E(85289): 2.7e-08 Smith-Waterman score: 183; 28.3% identity (69.6% similar) in 92 aa overlap (13-103:3-94) 10 20 30 40 50 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQ-VRPRHIT .. . .:: .:. ... . . . .:.: :.:: :. .:. : NP_000 MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFIK 10 20 30 40 50 60 70 80 90 100 pF1KE3 SLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :.::..:::: ....:. :..::..::: . ...... :. NP_000 ELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 60 70 80 90 104 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:28:29 2016 done: Sun Nov 6 04:28:30 2016 Total Scan time: 4.640 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]