FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1416, 107 aa 1>>>pF1KE1416 107 - 107 aa - 107 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.5564+/-0.000273; mu= 14.0557+/- 0.017 mean_var=57.0291+/-11.327, 0's: 0 Z-trim(119.3): 37 B-trim: 137 in 1/52 Lambda= 0.169834 statistics sampled from 33180 (33221) to 33180 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.39), width: 16 Scan time: 4.790 The best scores are: opt bits E(85289) NP_001502 (OMIM: 155730) growth-regulated alpha pr ( 107) 683 174.4 3.2e-44 NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 p ( 107) 610 156.5 7.7e-39 NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 p ( 107) 602 154.5 3e-38 NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 p ( 114) 313 83.7 6.5e-17 NP_002695 (OMIM: 121010) platelet basic protein pr ( 128) 299 80.3 7.7e-16 NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 p ( 114) 287 77.4 5.4e-15 NP_002610 (OMIM: 173460) platelet factor 4 precurs ( 101) 265 71.9 2.1e-13 XP_005265753 (OMIM: 173460) PREDICTED: platelet fa ( 110) 245 67.1 6.6e-12 NP_002611 (OMIM: 173461) platelet factor 4 variant ( 104) 244 66.8 7.4e-12 NP_000575 (OMIM: 146930) interleukin-8 precursor [ ( 99) 238 65.3 2e-11 NP_002407 (OMIM: 601704) C-X-C motif chemokine 9 p ( 125) 211 58.8 2.3e-09 NP_001556 (OMIM: 147310) C-X-C motif chemokine 10 ( 98) 161 46.4 9.4e-06 NP_004878 (OMIM: 604186) C-X-C motif chemokine 14 ( 111) 153 44.5 4e-05 NP_001029058 (OMIM: 600835,609423) stromal cell-de ( 119) 140 41.4 0.00039 NP_006410 (OMIM: 605149) C-X-C motif chemokine 13 ( 109) 139 41.1 0.00043 XP_006714126 (OMIM: 605149) PREDICTED: C-X-C motif ( 109) 139 41.1 0.00043 NP_954637 (OMIM: 600835,609423) stromal cell-deriv ( 89) 138 40.8 0.00043 NP_000600 (OMIM: 600835,609423) stromal cell-deriv ( 93) 138 40.8 0.00045 NP_001171605 (OMIM: 600835,609423) stromal cell-de ( 140) 139 41.2 0.00052 NP_005400 (OMIM: 604852) C-X-C motif chemokine 11 ( 94) 127 38.1 0.0029 >>NP_001502 (OMIM: 155730) growth-regulated alpha protei (107 aa) initn: 683 init1: 683 opt: 683 Z-score: 915.0 bits: 174.4 E(85289): 3.2e-44 Smith-Waterman score: 683; 100.0% identity (100.0% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN ::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 70 80 90 100 >>NP_002080 (OMIM: 139110) C-X-C motif chemokine 2 precu (107 aa) initn: 610 init1: 610 opt: 610 Z-score: 818.3 bits: 156.5 E(85289): 7.7e-39 Smith-Waterman score: 610; 89.7% identity (97.2% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV ::::.:::::::::::::::::::::::.:::::: .:::::::::::::::: :::::: NP_002 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN .::::::::::::::::::::.:::::::::.:::::::::.. ::: NP_002 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 70 80 90 100 >>NP_002081 (OMIM: 139111) C-X-C motif chemokine 3 precu (107 aa) initn: 658 init1: 602 opt: 602 Z-score: 807.7 bits: 154.5 E(85289): 3e-38 Smith-Waterman score: 602; 86.9% identity (98.1% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV ::.:.:::::::::::::::::::::::.::::::::.::::::::::::::: :::::: NP_002 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN ::.::::::::::::::::::.:::::::::.:.:::::.::. ..: NP_002 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 70 80 90 100 >>NP_002985 (OMIM: 600324) C-X-C motif chemokine 5 precu (114 aa) initn: 354 init1: 283 opt: 313 Z-score: 424.6 bits: 83.7 E(85289): 6.5e-17 Smith-Waterman score: 313; 48.2% identity (76.4% similar) in 110 aa overlap (2-107:6-114) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRA-AG--ASVATELRCQCLQTLQGIH .::: .::. : . .:::::. : : :: :.: :::: :::: ::.: NP_002 MSLLSSRAARVPGPSSS-LCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVH 10 20 30 40 50 60 70 80 90 100 pF1KE1 PKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNS-DKSN :: :....: . ::.:...::.:.::::.. ::.: .:..::.:.:.:.. .: : NP_002 PKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 60 70 80 90 100 110 >>NP_002695 (OMIM: 121010) platelet basic protein prepro (128 aa) initn: 362 init1: 296 opt: 299 Z-score: 405.4 bits: 80.3 E(85289): 7.7e-16 Smith-Waterman score: 299; 52.4% identity (81.7% similar) in 82 aa overlap (26-106:45-126) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAA-GASVATELRCQCLQTLQGIHP .: :.. . ... .::::.:..: .:::: NP_002 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP 20 30 40 50 60 70 60 70 80 90 100 pF1KE1 KNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN :::::..: . : :: :.:::::::.::: ::.: .: .:::..: : .:.: NP_002 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD 80 90 100 110 120 >>NP_002984 (OMIM: 138965) C-X-C motif chemokine 6 precu (114 aa) initn: 296 init1: 249 opt: 287 Z-score: 390.2 bits: 77.4 E(85289): 5.4e-15 Smith-Waterman score: 287; 43.1% identity (75.2% similar) in 109 aa overlap (2-107:6-114) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAG--ASVATELRCQCLQTLQGIHP .::: .::. .:::::: . .:: ..: ::::: ::.. ..: NP_002 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 60 70 80 90 100 pF1KE1 KNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNS-DKSN :.: ...: ::.:...::.:.::::...::.: .:..::.:.:.:.: .:.: NP_002 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 >>NP_002610 (OMIM: 173460) platelet factor 4 precursor [ (101 aa) initn: 189 init1: 164 opt: 265 Z-score: 361.8 bits: 71.9 E(85289): 2.1e-13 Smith-Waterman score: 265; 46.6% identity (71.8% similar) in 103 aa overlap (1-103:1-101) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV :. :: : : : :: ..:::: ::.: .: : .:.: :..: . ..:..: :. NP_002 MSSAAGFCA-SRPGLLFLGLLLLPLVVA-FASAEAEEDGDLQCLCVKTTSQVRPRHITSL 10 20 30 40 50 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN .: . :::: ...::::::::: ::. .:. ::::.:.:.: NP_002 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>XP_005265753 (OMIM: 173460) PREDICTED: platelet factor (110 aa) initn: 201 init1: 164 opt: 245 Z-score: 334.8 bits: 67.1 E(85289): 6.6e-12 Smith-Waterman score: 245; 42.7% identity (68.8% similar) in 96 aa overlap (8-103:20-110) 10 20 30 40 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQT :::. : :. :: ..: : : .:.: :..: XP_005 MITATLNGEPAECLATVPGAAPAPPTWLE-----QLLSGGGVIYAEAEEDGDLQCLCVKT 10 20 30 40 50 50 60 70 80 90 100 pF1KE1 LQGIHPKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN . ..:..: :..: . :::: ...::::::::: ::. .:. ::::.:.:.: XP_005 TSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 110 >>NP_002611 (OMIM: 173461) platelet factor 4 variant pre (104 aa) initn: 182 init1: 141 opt: 244 Z-score: 333.8 bits: 66.8 E(85289): 7.4e-12 Smith-Waterman score: 244; 43.1% identity (71.6% similar) in 102 aa overlap (2-103:5-104) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNI ::. :. : . ..: .::::: .:.: :: : .:.: :..: . ..:..: NP_002 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI 10 20 30 40 50 60 70 80 90 100 pF1KE1 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN :..: . :::: ...::::::::: ::. . . ::::.. :.: NP_002 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES 60 70 80 90 100 >>NP_000575 (OMIM: 146930) interleukin-8 precursor [Homo (99 aa) initn: 236 init1: 172 opt: 238 Z-score: 326.2 bits: 65.3 E(85289): 2e-11 Smith-Waterman score: 238; 43.8% identity (70.8% similar) in 89 aa overlap (16-101:5-93) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAG--RRAAGASVATELRCQCLQTL-QGIHPKNI : :::: .:..:. . :. : ::::::..: . .::: : NP_000 MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFI 10 20 30 40 60 70 80 90 100 pF1KE1 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN . . : :::::.::.:. :..::. ::.: :....::.: NP_000 KELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 50 60 70 80 90 107 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:45:44 2016 done: Mon Nov 7 01:45:44 2016 Total Scan time: 4.790 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]