FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1604, 107 aa 1>>>pF1KE1604 107 - 107 aa - 107 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.2818+/-0.000567; mu= 15.7069+/- 0.034 mean_var=59.1853+/-11.758, 0's: 0 Z-trim(112.8): 28 B-trim: 168 in 1/49 Lambda= 0.166712 statistics sampled from 13446 (13474) to 13446 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.414), width: 16 Scan time: 1.610 The best scores are: opt bits E(32554) CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 ( 107) 682 171.0 1.2e-43 CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 ( 107) 610 153.7 2e-38 CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 ( 107) 610 153.7 2e-38 CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 ( 114) 293 77.5 1.9e-15 CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 ( 128) 275 73.2 4.1e-14 CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 ( 114) 263 70.3 2.8e-13 CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 ( 101) 242 65.2 8.4e-12 CCDS34005.1 CXCL8 gene_id:3576|Hs108|chr4 ( 99) 235 63.5 2.7e-11 CCDS34014.1 CXCL9 gene_id:4283|Hs108|chr4 ( 125) 235 63.6 3.2e-11 >>CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 (107 aa) initn: 682 init1: 682 opt: 682 Z-score: 897.0 bits: 171.0 E(32554): 1.2e-43 Smith-Waterman score: 682; 100.0% identity (100.0% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN ::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 70 80 90 100 >>CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 (107 aa) initn: 610 init1: 610 opt: 610 Z-score: 803.4 bits: 153.7 E(32554): 2e-38 Smith-Waterman score: 610; 89.7% identity (97.2% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV ::::.:::::::::::::::::::::::.:::::: .:::::::::::::::: :::::: CCDS47 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN .::::::::::::::::::::.:::::::::.:::::::::.. ::: CCDS47 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 70 80 90 100 >>CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 (107 aa) initn: 696 init1: 610 opt: 610 Z-score: 803.4 bits: 153.7 E(32554): 2e-38 Smith-Waterman score: 610; 87.9% identity (99.1% similar) in 107 aa overlap (1-107:1-107) 10 20 30 40 50 60 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV ::.:::::::::::::::::::::::::::::::: ..:::::::::::::::::::::: CCDS34 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV 10 20 30 40 50 60 70 80 90 100 pF1KE1 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN .:.::::::::::::::::::.:::::::::::.:::::.:..:..: CCDS34 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN 70 80 90 100 >>CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 (114 aa) initn: 323 init1: 270 opt: 293 Z-score: 391.0 bits: 77.5 E(32554): 1.9e-15 Smith-Waterman score: 293; 44.3% identity (76.4% similar) in 106 aa overlap (2-104:6-110) 10 20 30 40 50 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLAT---ELRCQCLQTLQGIH .::. .::. : . .:::::. . :...: :. :::: :::: ::.: CCDS34 MSLLSSRAARVPGPSSS-LCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVH 10 20 30 40 50 60 70 80 90 100 pF1KE1 LKNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN : :....: . ::.:...::.:.::::.. ::.: .:..::.:.:.: .: CCDS34 PKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 60 70 80 90 100 110 >>CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 (128 aa) initn: 325 init1: 270 opt: 275 Z-score: 366.9 bits: 73.2 E(32554): 4.1e-14 Smith-Waterman score: 275; 48.8% identity (79.3% similar) in 82 aa overlap (25-106:45-126) 10 20 30 40 50 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHL :. ..... . : .::::.:..: .::: CCDS35 RPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHP 20 30 40 50 60 70 60 70 80 90 100 pF1KE1 KNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN :::::..: . : :: :.:::::::.:.: ::.: .: .:::..: : . .: CCDS35 KNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD 80 90 100 110 120 >>CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 (114 aa) initn: 277 init1: 232 opt: 263 Z-score: 352.0 bits: 70.3 E(32554): 2.8e-13 Smith-Waterman score: 263; 40.4% identity (72.5% similar) in 109 aa overlap (2-107:6-114) 10 20 30 40 50 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAG--APLATELRCQCLQTLQGIHL .::. .::. .:::::: . .:: . . ::::: ::.. .. CCDS35 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 60 70 80 90 100 pF1KE1 KNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNG-KSN :.: ...: ::.:...::.:.::::...::.: .:..::.:.:.: .: :.: CCDS35 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 >>CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 (101 aa) initn: 192 init1: 156 opt: 242 Z-score: 325.3 bits: 65.2 E(32554): 8.4e-12 Smith-Waterman score: 242; 44.1% identity (72.0% similar) in 93 aa overlap (11-103:10-101) 10 20 30 40 50 60 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV : : :: ..:::: ::.: : : .:.: :..: . .. ..: :. CCDS35 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAE-AEEDGDLQCLCVKTTSQVRPRHITSL 10 20 30 40 50 70 80 90 100 pF1KE1 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN .: . :::: ...:::::::.: ::. .:. ::::.:.:.. CCDS35 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>CCDS34005.1 CXCL8 gene_id:3576|Hs108|chr4 (99 aa) initn: 213 init1: 148 opt: 235 Z-score: 316.4 bits: 63.5 E(32554): 2.7e-11 Smith-Waterman score: 235; 41.7% identity (70.8% similar) in 96 aa overlap (16-107:5-99) 10 20 30 40 50 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPL---ATELRCQCLQTL-QGIHLKN : :::: .:..:. :: : : ::::::..: . .: : CCDS34 MTSKLAVALLAAFLISAAL-CEGAVLPRSAKELRCQCIKTYSKPFHPKF 10 20 30 40 60 70 80 90 100 pF1KE1 IQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN :. ..: :::::.::.:. :..:.. ::.: :....::.:: .... CCDS34 IKELRVIESGPHCANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKRAENS 50 60 70 80 90 >>CCDS34014.1 CXCL9 gene_id:4283|Hs108|chr4 (125 aa) initn: 247 init1: 171 opt: 235 Z-score: 315.1 bits: 63.6 E(32554): 3.2e-11 Smith-Waterman score: 235; 41.6% identity (77.5% similar) in 89 aa overlap (15-102:7-91) 10 20 30 40 50 pF1KE1 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQG-IHLKNIQS :. ....::.:.... :.:.. . ::.:..: :: :::..... CCDS34 MKKSGVLFLLGIILLVLIGVQ----GTPVVRKGRCSCISTNQGTIHLQSLKD 10 20 30 40 60 70 80 90 100 pF1KE1 VKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN .: .:.: : . :.::::::: ..:::: : ::..:.: : CCDS34 LKQFAPSPSCEKIEIIATLKNGVQTCLNPDSADVKELIKKWEKQVSQKKKQKNGKKHQKK 50 60 70 80 90 100 CCDS34 KVLKVRKSQRSRQKKTT 110 120 107 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 11:56:50 2016 done: Sun Nov 6 11:56:51 2016 Total Scan time: 1.610 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]