FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1321, 107 aa 1>>>pF1KE1321 107 - 107 aa - 107 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.4032+/-0.000564; mu= 14.4858+/- 0.034 mean_var=55.4975+/-11.044, 0's: 0 Z-trim(112.0): 33 B-trim: 105 in 1/48 Lambda= 0.172162 statistics sampled from 12811 (12844) to 12811 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.775), E-opt: 0.2 (0.395), width: 16 Scan time: 1.660 The best scores are: opt bits E(32554) CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 ( 107) 679 175.6 5.1e-45 CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 ( 107) 610 158.5 7.4e-40 CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 ( 107) 602 156.5 2.9e-39 CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 ( 114) 297 80.7 2e-16 CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 ( 114) 269 73.8 2.4e-14 CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 ( 128) 264 72.6 6.3e-14 CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 ( 101) 233 64.8 1.1e-11 CCDS34005.1 CXCL8 gene_id:3576|Hs108|chr4 ( 99) 225 62.8 4.3e-11 CCDS34014.1 CXCL9 gene_id:4283|Hs108|chr4 ( 125) 222 62.1 8.6e-11 >>CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 (107 aa) initn: 679 init1: 679 opt: 679 Z-score: 921.6 bits: 175.6 E(32554): 5.1e-45 Smith-Waterman score: 679; 100.0% identity (100.0% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN ::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 70 80 90 100 >>CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 (107 aa) initn: 696 init1: 610 opt: 610 Z-score: 829.0 bits: 158.5 E(32554): 7.4e-40 Smith-Waterman score: 610; 87.9% identity (99.1% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV ::.:::::::::::::::::::::::::::::::: ..:::::::::::::::::::::: CCDS34 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN .:.::::::::::::::::::.:::::::::::.:::::.:..:..: CCDS34 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 70 80 90 100 >>CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 (107 aa) initn: 658 init1: 602 opt: 602 Z-score: 818.3 bits: 156.5 E(32554): 2.9e-39 Smith-Waterman score: 602; 86.9% identity (98.1% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV ::.:.:::::::::::::::::::::::.::::::::.::::::::::::::: :::::: CCDS47 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN ::.::::::::::::::::::.:::::::::.:.:::::.::. ..: CCDS47 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 70 80 90 100 >>CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 (114 aa) initn: 315 init1: 283 opt: 297 Z-score: 408.5 bits: 80.7 E(32554): 2e-16 Smith-Waterman score: 297; 46.9% identity (76.5% similar) in 98 aa overlap (10-105:14-111) 10 20 30 40 50 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAG--ASVVTELRCQCLQTLQGIHL ::. ..::::: . .:: :.:. :::: :::: ::.: CCDS34 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP 10 20 30 40 50 60 60 70 80 90 100 pF1KE1 KNIQSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN : :....: . ::.:...::.:.:::::. ::.: .:...:.:.:::. :. CCDS34 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 70 80 90 100 110 >>CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 (114 aa) initn: 288 init1: 245 opt: 269 Z-score: 370.9 bits: 73.8 E(32554): 2.4e-14 Smith-Waterman score: 269; 42.9% identity (76.5% similar) in 98 aa overlap (10-105:14-111) 10 20 30 40 50 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAG--ASVVTELRCQCLQTLQGIHL ::. .:::::: . .:: ..:.::::: ::.. .. CCDS35 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 60 70 80 90 100 pF1KE1 KNIQSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN :.: ...: ::.:...::.:.:::::..::.: .:...:.:.:::..:. CCDS35 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 >>CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 (128 aa) initn: 336 init1: 261 opt: 264 Z-score: 363.5 bits: 72.6 E(32554): 6.3e-14 Smith-Waterman score: 264; 48.1% identity (81.8% similar) in 77 aa overlap (25-101:45-121) 10 20 30 40 50 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHL :. ..... ... .::::.:..: .::: CCDS35 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP 20 30 40 50 60 70 60 70 80 90 100 pF1KE1 KNIQSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN :::::..: . : :: :.:::::::.:.: ::.: .: ..::..: : CCDS35 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD 80 90 100 110 120 >>CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 (101 aa) initn: 176 init1: 151 opt: 233 Z-score: 323.3 bits: 64.8 E(32554): 1.1e-11 Smith-Waterman score: 233; 44.0% identity (72.5% similar) in 91 aa overlap (11-101:10-99) 10 20 30 40 50 60 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV : : :: ..:::: ::.: .: : .:.: :..: . .. ..: :. CCDS35 MSSAAGFCASRPGLLFLGLLLLPLVVAFA-SAEAEEDGDLQCLCVKTTSQVRPRHITSL 10 20 30 40 50 70 80 90 100 pF1KE1 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN .: . :::: ...:::::::.: ::. .:. .:::.:.: CCDS35 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>CCDS34005.1 CXCL8 gene_id:3576|Hs108|chr4 (99 aa) initn: 215 init1: 150 opt: 225 Z-score: 312.7 bits: 62.8 E(32554): 4.3e-11 Smith-Waterman score: 225; 40.7% identity (70.3% similar) in 91 aa overlap (16-103:5-95) 10 20 30 40 50 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAAS--RRAAGASVVTELRCQCLQTL-QGIHLKNI : :::: .:..:. . :. . ::::::..: . .: : : CCDS34 MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFI 10 20 30 40 60 70 80 90 100 pF1KE1 QSVNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN . . : :::::.::.:. :..:.. ::.: ::...::.:.. CCDS34 KELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 50 60 70 80 90 >>CCDS34014.1 CXCL9 gene_id:4283|Hs108|chr4 (125 aa) initn: 193 init1: 143 opt: 222 Z-score: 307.3 bits: 62.1 E(32554): 8.6e-11 Smith-Waterman score: 222; 40.7% identity (77.9% similar) in 86 aa overlap (15-99:7-88) 10 20 30 40 50 pF1KE1 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQG-IHLKNIQS :. ....::.:.... :. :: . ::.:..: :: :::..... CCDS34 MKKSGVLFLLGIILLVLIGVQ----GTPVVRKGRCSCISTNQGTIHLQSLKD 10 20 30 40 60 70 80 90 100 pF1KE1 VNVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN .. .:.: : . :.::::::: ..:::: : :...:.: CCDS34 LKQFAPSPSCEKIEIIATLKNGVQTCLNPDSADVKELIKKWEKQVSQKKKQKNGKKHQKK 50 60 70 80 90 100 CCDS34 KVLKVRKSQRSRQKKTT 110 120 107 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 22:48:51 2016 done: Sun Nov 6 22:48:51 2016 Total Scan time: 1.660 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]