FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1401, 114 aa 1>>>pF1KE1401 114 - 114 aa - 114 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.3060+/-0.000254; mu= 16.3999+/- 0.016 mean_var=61.4809+/-12.216, 0's: 0 Z-trim(121.3): 37 B-trim: 157 in 1/55 Lambda= 0.163570 statistics sampled from 37582 (37622) to 37582 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.8), E-opt: 0.2 (0.441), width: 16 Scan time: 4.240 The best scores are: opt bits E(85289) NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 p ( 114) 741 181.9 2e-46 NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 p ( 114) 599 148.3 2.4e-36 NP_001502 (OMIM: 155730) growth-regulated alpha pr ( 107) 313 80.8 4.8e-16 NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 p ( 107) 297 77.1 6.6e-15 NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 p ( 107) 293 76.1 1.3e-14 NP_002695 (OMIM: 121010) platelet basic protein pr ( 128) 284 74.1 6.3e-14 NP_002610 (OMIM: 173460) platelet factor 4 precurs ( 101) 260 68.3 2.7e-12 XP_005265753 (OMIM: 173460) PREDICTED: platelet fa ( 110) 237 62.9 1.2e-10 NP_002611 (OMIM: 173461) platelet factor 4 variant ( 104) 233 61.9 2.3e-10 NP_000575 (OMIM: 146930) interleukin-8 precursor [ ( 99) 220 58.9 1.8e-09 NP_002407 (OMIM: 601704) C-X-C motif chemokine 9 p ( 125) 207 55.9 1.8e-08 NP_001556 (OMIM: 147310) C-X-C motif chemokine 10 ( 98) 159 44.5 4e-05 NP_006410 (OMIM: 605149) C-X-C motif chemokine 13 ( 109) 154 43.3 9.7e-05 XP_006714126 (OMIM: 605149) PREDICTED: C-X-C motif ( 109) 154 43.3 9.7e-05 NP_001029058 (OMIM: 600835,609423) stromal cell-de ( 119) 133 38.4 0.0032 NP_006265 (OMIM: 602227) C-C motif chemokine 19 pr ( 98) 132 38.1 0.0033 XP_016879020 (OMIM: 602957) PREDICTED: C-C motif c ( 93) 126 36.6 0.0084 NP_002981 (OMIM: 602957) C-C motif chemokine 22 pr ( 93) 126 36.6 0.0084 >>NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 precu (114 aa) initn: 741 init1: 741 opt: 741 Z-score: 954.5 bits: 181.9 E(85289): 2e-46 Smith-Waterman score: 741; 100.0% identity (100.0% similar) in 114 aa overlap (1-114:1-114) 10 20 30 40 50 60 pF1KE1 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 70 80 90 100 110 >>NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 precu (114 aa) initn: 645 init1: 599 opt: 599 Z-score: 773.4 bits: 148.3 E(85289): 2.4e-36 Smith-Waterman score: 599; 79.8% identity (93.0% similar) in 114 aa overlap (1-114:1-114) 10 20 30 40 50 60 pF1KE1 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP ::: :::::::::::.::::::.:::::: :::.:::::..::: ::::.::..: :.: NP_002 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN : :..:::: :::::::::::::::::..::::::::::::::::::.:::.: NP_002 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 >>NP_001502 (OMIM: 155730) growth-regulated alpha protei (107 aa) initn: 354 init1: 283 opt: 313 Z-score: 409.0 bits: 80.8 E(85289): 4.8e-16 Smith-Waterman score: 313; 48.2% identity (76.4% similar) in 110 aa overlap (6-114:2-107) 10 20 30 40 50 pF1KE1 MSLLSSRAARVPGPSSS-LCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVH .::: .::. : . .:::::. : : :: :.: :::: :::: ::.: NP_001 MARAALSAAPSNPRLLRVALLLLLLVAAGRRA-AG--ASVATELRCQCLQTLQGIH 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 PKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN :: :....: . ::.:...::.:.::::.. ::.: .:..::.:.:.:.. .: : NP_001 PKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNS-DKSN 60 70 80 90 100 >>NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 precu (107 aa) initn: 315 init1: 283 opt: 297 Z-score: 388.6 bits: 77.1 E(85289): 6.6e-15 Smith-Waterman score: 297; 46.9% identity (76.5% similar) in 98 aa overlap (14-111:10-105) 10 20 30 40 50 60 pF1KE1 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP ::. ..::::: . .:: :.:. :::: :::: ::.: NP_002 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAG--ASVVTELRCQCLQTLQGIHL 10 20 30 40 50 70 80 90 100 110 pF1KE1 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN : :....: . ::.:...::.:.:::::. ::.: .:...:.:.:::. :. NP_002 KNIQSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 60 70 80 90 100 >>NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 precu (107 aa) initn: 313 init1: 270 opt: 293 Z-score: 383.5 bits: 76.1 E(85289): 1.3e-14 Smith-Waterman score: 293; 44.3% identity (76.4% similar) in 106 aa overlap (6-110:2-104) 10 20 30 40 50 pF1KE1 MSLLSSRAARVPGPSSS-LCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVH .::. .::. : . .:::::. . :...: :. :::: :::: ::.: NP_002 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLAT---ELRCQCLQTLQGIH 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 PKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN : :....: . ::.:...::.:.::::.. ::.: .:..::.:.:.: .: NP_002 LKNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 60 70 80 90 100 >>NP_002695 (OMIM: 121010) platelet basic protein prepro (128 aa) initn: 327 init1: 284 opt: 284 Z-score: 371.0 bits: 74.1 E(85289): 6.3e-14 Smith-Waterman score: 284; 56.2% identity (90.6% similar) in 64 aa overlap (46-109:60-123) 20 30 40 50 60 70 pF1KE1 SSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHPKMISNLQVFAIGPQC ::::.:..::.:.::: :..:.:.. : .: NP_002 LTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHC 30 40 50 60 70 80 80 90 100 110 pF1KE1 SKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN ..:::.:.::.:..:::::.:: .::..:: : : NP_002 NQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD 90 100 110 120 >>NP_002610 (OMIM: 173460) platelet factor 4 precursor [ (101 aa) initn: 240 init1: 199 opt: 260 Z-score: 341.7 bits: 68.3 E(85289): 2.7e-12 Smith-Waterman score: 260; 40.2% identity (75.3% similar) in 97 aa overlap (15-108:4-100) 10 20 30 40 50 pF1KE1 MSLLSSRAARVPGPSSSLCAL---LVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQG ....:: :..: :: : .: :. : .:.:.:..::. NP_002 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQ 10 20 30 40 60 70 80 90 100 110 pF1KE1 VHPKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN :.:. :..:.:. ::.: ....:.::::..:::: .::. ::.:.:.:. NP_002 VRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 50 60 70 80 90 100 >>XP_005265753 (OMIM: 173460) PREDICTED: platelet factor (110 aa) initn: 244 init1: 199 opt: 237 Z-score: 311.9 bits: 62.9 E(85289): 1.2e-10 Smith-Waterman score: 237; 46.0% identity (87.3% similar) in 63 aa overlap (46-108:47-109) 20 30 40 50 60 70 pF1KE1 SSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHPKMISNLQVFAIGPQC .:.:.:..::. :.:. :..:.:. ::.: XP_005 VPGAAPAPPTWLEQLLSGGGVIYAEAEEDGDLQCLCVKTTSQVRPRHITSLEVIKAGPHC 20 30 40 50 60 70 80 90 100 110 pF1KE1 SKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN ....:.::::..:::: .::. ::.:.:.:. XP_005 PTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 80 90 100 110 >>NP_002611 (OMIM: 173461) platelet factor 4 variant pre (104 aa) initn: 206 init1: 174 opt: 233 Z-score: 307.1 bits: 61.9 E(85289): 2.3e-10 Smith-Waterman score: 233; 38.8% identity (68.0% similar) in 103 aa overlap (6-108:2-103) 10 20 30 40 50 60 pF1KE1 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP : ::: .. .: : ::: : .: : : .:.:.:..::. :.: NP_002 MSSAARSRLTRATRQEMLFLALLLL-PVVVAFARAEAEEDGDLQCLCVKTTSQVRP 10 20 30 40 50 70 80 90 100 110 pF1KE1 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN . :..:.:. ::.: ....:.::::..:::: .: . ::.:.. :. NP_002 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES 60 70 80 90 100 >>NP_000575 (OMIM: 146930) interleukin-8 precursor [Homo (99 aa) initn: 242 init1: 174 opt: 220 Z-score: 290.8 bits: 58.9 E(85289): 1.8e-09 Smith-Waterman score: 220; 34.0% identity (77.7% similar) in 94 aa overlap (15-107:2-93) 10 20 30 40 50 pF1KE1 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQT-TQGVH .:.: . :. .:.. ... : .: .:::: :..: .. : NP_000 MTSKLAVALLAAFLISAALCEGAVLPRSA--KELRCQCIKTYSKPFH 10 20 30 40 60 70 80 90 100 110 pF1KE1 PKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN ::.:..:.:. ::.:...:....:..:.:.::::. ....:..:.: NP_000 PKFIKELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 50 60 70 80 90 114 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:09:54 2016 done: Mon Nov 7 02:09:55 2016 Total Scan time: 4.240 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]