FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3014, 104 aa 1>>>pF1KE3014 104 - 104 aa - 104 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9302+/-0.000581; mu= 10.6804+/- 0.035 mean_var=49.6552+/- 9.821, 0's: 0 Z-trim(110.5): 19 B-trim: 202 in 2/49 Lambda= 0.182009 statistics sampled from 11672 (11688) to 11672 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.764), E-opt: 0.2 (0.359), width: 16 Scan time: 1.630 The best scores are: opt bits E(32554) CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 ( 104) 660 180.2 2e-46 CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 ( 101) 531 146.3 3.1e-36 CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 ( 107) 244 71.0 1.6e-13 CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 ( 128) 237 69.2 6.6e-13 CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 ( 114) 233 68.1 1.2e-12 CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 ( 107) 225 66.0 5e-12 CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 ( 114) 221 64.9 1.1e-11 >>CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 (104 aa) initn: 660 init1: 660 opt: 660 Z-score: 947.0 bits: 180.2 E(32554): 2e-46 Smith-Waterman score: 660; 100.0% identity (100.0% similar) in 104 aa overlap (1-104:1-104) 10 20 30 40 50 60 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS 10 20 30 40 50 60 70 80 90 100 pF1KE3 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :::::::::::::::::::::::::::::::::::::::::::: CCDS35 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES 70 80 90 100 >>CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 (101 aa) initn: 542 init1: 529 opt: 531 Z-score: 764.1 bits: 146.3 E(32554): 3.1e-36 Smith-Waterman score: 531; 84.6% identity (89.4% similar) in 104 aa overlap (1-104:1-101) 10 20 30 40 50 60 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS ::::: :.: .:::.:::::.::::: ::::::::::::::::::::::::::: CCDS35 MSSAAG---FCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITS 10 20 30 40 50 70 80 90 100 pF1KE3 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES ::::::::::::::::::::::::::::::: :::::::. ::: CCDS35 LEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES 60 70 80 90 100 >>CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 (107 aa) initn: 182 init1: 141 opt: 244 Z-score: 356.4 bits: 71.0 E(32554): 1.6e-13 Smith-Waterman score: 244; 43.1% identity (71.6% similar) in 102 aa overlap (5-104:2-103) 10 20 30 40 50 pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI ::. :. : . ..: .::::: .:.: :: : .:.: :..: . ..:..: CCDS47 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNI 10 20 30 40 50 60 70 80 90 100 pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :..: . :::: ...::::::::: ::. . . ::::.. :.: CCDS47 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN 60 70 80 90 100 >>CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 (128 aa) initn: 217 init1: 196 opt: 237 Z-score: 345.2 bits: 69.2 E(32554): 6.6e-13 Smith-Waterman score: 241; 43.6% identity (70.3% similar) in 101 aa overlap (15-102:21-121) 10 20 30 40 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVA--------FARAEAEE-DGDL--- : .:.:.::: .. . .:... : :.:: CCDS35 MSLRLDTTPSCNSARPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAE 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 -QCLCVKTTSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEH .:.:.:::: ..:..: ::::: : :: ...:::::.::::::: .: :::.... CCDS35 LRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK 70 80 90 100 110 120 pF1KE3 LES : CCDS35 LAGDESAD >>CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 (114 aa) initn: 206 init1: 174 opt: 233 Z-score: 340.3 bits: 68.1 E(32554): 1.2e-12 Smith-Waterman score: 233; 38.8% identity (68.0% similar) in 103 aa overlap (2-103:6-108) 10 20 30 40 50 pF1KE3 MSSAARSRLTRATRQEMLFLALLLL-PVVVAFARAEAEEDGDLQCLCVKTTSQVRP : ::: .. .: : ::: : .: : : .:.:.:..::. :.: CCDS34 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES . :..:.:. ::.: ....:.::::..:::: .: . ::.:.. :. CCDS34 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN 70 80 90 100 110 >>CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 (107 aa) initn: 167 init1: 133 opt: 225 Z-score: 329.4 bits: 66.0 E(32554): 5e-12 Smith-Waterman score: 225; 40.2% identity (70.6% similar) in 102 aa overlap (5-104:2-103) 10 20 30 40 50 pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI ::. :. : . ..: .::::: .:.: :: : .:.: :..: . .. ..: CCDS34 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNI 10 20 30 40 50 60 70 80 90 100 pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES :..: . :::: ...:::::::.: ::. . . ::::.. :.. CCDS34 QSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN 60 70 80 90 100 >>CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 (114 aa) initn: 199 init1: 141 opt: 221 Z-score: 323.3 bits: 64.9 E(32554): 1.1e-11 Smith-Waterman score: 221; 37.5% identity (66.3% similar) in 104 aa overlap (2-104:6-109) 10 20 30 40 50 pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPV-VVAFARAEAEEDGDLQCLCVKTTSQVRP : ::: .. .: : ::: : .: : . .:.: :...: .: : CCDS35 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP 10 20 30 40 50 60 60 70 80 90 100 pF1KE3 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES . : .:.:. :::.: ....:.::::...::: .: . ::.:.. :.: CCDS35 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN 70 80 90 100 110 104 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:28:28 2016 done: Sun Nov 6 04:28:29 2016 Total Scan time: 1.630 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]