FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1416, 107 aa 1>>>pF1KE1416 107 - 107 aa - 107 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.3115+/-0.000569; mu= 15.5248+/- 0.034 mean_var=58.5598+/-11.476, 0's: 0 Z-trim(113.0): 30 B-trim: 109 in 1/49 Lambda= 0.167600 statistics sampled from 13619 (13650) to 13619 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.791), E-opt: 0.2 (0.419), width: 16 Scan time: 1.690 The best scores are: opt bits E(32554) CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 ( 107) 683 172.1 5.8e-44 CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 ( 107) 610 154.4 1.2e-38 CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 ( 107) 602 152.5 4.6e-38 CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 ( 114) 313 82.7 5.2e-17 CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 ( 128) 299 79.3 5.9e-16 CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 ( 114) 287 76.4 4.1e-15 CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 ( 101) 265 71.0 1.5e-13 CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 ( 104) 244 65.9 5.1e-12 CCDS34005.1 CXCL8 gene_id:3576|Hs108|chr4 ( 99) 238 64.5 1.4e-11 >>CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 (107 aa) initn: 683 init1: 683 opt: 683 Z-score: 902.7 bits: 172.1 E(32554): 5.8e-44 Smith-Waterman score: 683; 100.0% identity (100.0% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN ::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 70 80 90 100 >>CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 (107 aa) initn: 610 init1: 610 opt: 610 Z-score: 807.3 bits: 154.4 E(32554): 1.2e-38 Smith-Waterman score: 610; 89.7% identity (97.2% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV ::::.:::::::::::::::::::::::.:::::: .:::::::::::::::: :::::: CCDS34 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN .::::::::::::::::::::.:::::::::.:::::::::.. ::: CCDS34 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 70 80 90 100 >>CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 (107 aa) initn: 658 init1: 602 opt: 602 Z-score: 796.8 bits: 152.5 E(32554): 4.6e-38 Smith-Waterman score: 602; 86.9% identity (98.1% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV ::.:.:::::::::::::::::::::::.::::::::.::::::::::::::: :::::: CCDS34 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN ::.::::::::::::::::::.:::::::::.:.:::::.::. ..: CCDS34 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 70 80 90 100 >>CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 (114 aa) initn: 354 init1: 283 opt: 313 Z-score: 418.8 bits: 82.7 E(32554): 5.2e-17 Smith-Waterman score: 313; 48.2% identity (76.4% similar) in 110 aa overlap (2-107:6-114) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRA-AG--ASVATELRCQCLQTLQGIH .::: .::. : . .:::::. : : :: :.: :::: :::: ::.: CCDS34 MSLLSSRAARVPGPSSS-LCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVH 10 20 30 40 50 60 70 80 90 100 pF1KE1 PKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNS-DKSN :: :....: . ::.:...::.:.::::.. ::.: .:..::.:.:.:.. .: : CCDS34 PKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 60 70 80 90 100 110 >>CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 (128 aa) initn: 362 init1: 296 opt: 299 Z-score: 399.9 bits: 79.3 E(32554): 5.9e-16 Smith-Waterman score: 299; 52.4% identity (81.7% similar) in 82 aa overlap (26-106:45-126) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAA-GASVATELRCQCLQTLQGIHP .: :.. . ... .::::.:..: .:::: CCDS35 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP 20 30 40 50 60 70 60 70 80 90 100 pF1KE1 KNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN :::::..: . : :: :.:::::::.::: ::.: .: .:::..: : .:.: CCDS35 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD 80 90 100 110 120 >>CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 (114 aa) initn: 296 init1: 249 opt: 287 Z-score: 384.8 bits: 76.4 E(32554): 4.1e-15 Smith-Waterman score: 287; 43.1% identity (75.2% similar) in 109 aa overlap (2-107:6-114) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAG--ASVATELRCQCLQTLQGIHP .::: .::. .:::::: . .:: ..: ::::: ::.. ..: CCDS35 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 60 70 80 90 100 pF1KE1 KNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNS-DKSN :.: ...: ::.:...::.:.::::...::.: .:..::.:.:.:.: .:.: CCDS35 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 >>CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 (101 aa) initn: 189 init1: 164 opt: 265 Z-score: 356.8 bits: 71.0 E(32554): 1.5e-13 Smith-Waterman score: 265; 46.6% identity (71.8% similar) in 103 aa overlap (1-103:1-101) 10 20 30 40 50 60 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV :. :: : : : :: ..:::: ::.: .: : .:.: :..: . ..:..: :. CCDS35 MSSAAGFCA-SRPGLLFLGLLLLPLVVA-FASAEAEEDGDLQCLCVKTTSQVRPRHITSL 10 20 30 40 50 70 80 90 100 pF1KE1 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN .: . :::: ...::::::::: ::. .:. ::::.:.:.: CCDS35 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 (104 aa) initn: 182 init1: 141 opt: 244 Z-score: 329.2 bits: 65.9 E(32554): 5.1e-12 Smith-Waterman score: 244; 43.1% identity (71.6% similar) in 102 aa overlap (2-103:5-104) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNI ::. :. : . ..: .::::: .:.: :: : .:.: :..: . ..:..: CCDS35 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI 10 20 30 40 50 60 70 80 90 100 pF1KE1 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN :..: . :::: ...::::::::: ::. . . ::::.. :.: CCDS35 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES 60 70 80 90 100 >>CCDS34005.1 CXCL8 gene_id:3576|Hs108|chr4 (99 aa) initn: 236 init1: 172 opt: 238 Z-score: 321.6 bits: 64.5 E(32554): 1.4e-11 Smith-Waterman score: 238; 43.8% identity (70.8% similar) in 89 aa overlap (16-101:5-93) 10 20 30 40 50 pF1KE1 MARAALSAAPSNPRLLRVALLLLLLVAAG--RRAAGASVATELRCQCLQTL-QGIHPKNI : :::: .:..:. . :. : ::::::..: . .::: : CCDS34 MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFI 10 20 30 40 60 70 80 90 100 pF1KE1 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN . . : :::::.::.:. :..::. ::.: :....::.: CCDS34 KELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 50 60 70 80 90 107 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:45:43 2016 done: Mon Nov 7 01:45:43 2016 Total Scan time: 1.690 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]