FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3014, 104 aa
1>>>pF1KE3014 104 - 104 aa - 104 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9302+/-0.000581; mu= 10.6804+/- 0.035
mean_var=49.6552+/- 9.821, 0's: 0 Z-trim(110.5): 19 B-trim: 202 in 2/49
Lambda= 0.182009
statistics sampled from 11672 (11688) to 11672 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.764), E-opt: 0.2 (0.359), width: 16
Scan time: 1.630
The best scores are: opt bits E(32554)
CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 ( 104) 660 180.2 2e-46
CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 ( 101) 531 146.3 3.1e-36
CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 ( 107) 244 71.0 1.6e-13
CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 ( 128) 237 69.2 6.6e-13
CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 ( 114) 233 68.1 1.2e-12
CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 ( 107) 225 66.0 5e-12
CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 ( 114) 221 64.9 1.1e-11
>>CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 (104 aa)
initn: 660 init1: 660 opt: 660 Z-score: 947.0 bits: 180.2 E(32554): 2e-46
Smith-Waterman score: 660; 100.0% identity (100.0% similar) in 104 aa overlap (1-104:1-104)
10 20 30 40 50 60
pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS
10 20 30 40 50 60
70 80 90 100
pF1KE3 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
::::::::::::::::::::::::::::::::::::::::::::
CCDS35 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
70 80 90 100
>>CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 (101 aa)
initn: 542 init1: 529 opt: 531 Z-score: 764.1 bits: 146.3 E(32554): 3.1e-36
Smith-Waterman score: 531; 84.6% identity (89.4% similar) in 104 aa overlap (1-104:1-101)
10 20 30 40 50 60
pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS
::::: :.: .:::.:::::.::::: :::::::::::::::::::::::::::
CCDS35 MSSAAG---FCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITS
10 20 30 40 50
70 80 90 100
pF1KE3 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
::::::::::::::::::::::::::::::: :::::::. :::
CCDS35 LEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
60 70 80 90 100
>>CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 (107 aa)
initn: 182 init1: 141 opt: 244 Z-score: 356.4 bits: 71.0 E(32554): 1.6e-13
Smith-Waterman score: 244; 43.1% identity (71.6% similar) in 102 aa overlap (5-104:2-103)
10 20 30 40 50
pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI
::. :. : . ..: .::::: .:.: :: : .:.: :..: . ..:..:
CCDS47 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNI
10 20 30 40 50
60 70 80 90 100
pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
:..: . :::: ...::::::::: ::. . . ::::.. :.:
CCDS47 QSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN
60 70 80 90 100
>>CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 (128 aa)
initn: 217 init1: 196 opt: 237 Z-score: 345.2 bits: 69.2 E(32554): 6.6e-13
Smith-Waterman score: 241; 43.6% identity (70.3% similar) in 101 aa overlap (15-102:21-121)
10 20 30 40
pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPVVVA--------FARAEAEE-DGDL---
: .:.:.::: .. . .:... : :.::
CCDS35 MSLRLDTTPSCNSARPLHALQVLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAE
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE3 -QCLCVKTTSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEH
.:.:.:::: ..:..: ::::: : :: ...:::::.::::::: .: :::....
CCDS35 LRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK
70 80 90 100 110 120
pF1KE3 LES
:
CCDS35 LAGDESAD
>>CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 (114 aa)
initn: 206 init1: 174 opt: 233 Z-score: 340.3 bits: 68.1 E(32554): 1.2e-12
Smith-Waterman score: 233; 38.8% identity (68.0% similar) in 103 aa overlap (2-103:6-108)
10 20 30 40 50
pF1KE3 MSSAARSRLTRATRQEMLFLALLLL-PVVVAFARAEAEEDGDLQCLCVKTTSQVRP
: ::: .. .: : ::: : .: : : .:.:.:..::. :.:
CCDS34 MSLLSSRAARVPGPSSSLCALLVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQGVHP
10 20 30 40 50 60
60 70 80 90 100
pF1KE3 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
. :..:.:. ::.: ....:.::::..:::: .: . ::.:.. :.
CCDS34 KMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN
70 80 90 100 110
>>CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 (107 aa)
initn: 167 init1: 133 opt: 225 Z-score: 329.4 bits: 66.0 E(32554): 5e-12
Smith-Waterman score: 225; 40.2% identity (70.6% similar) in 102 aa overlap (5-104:2-103)
10 20 30 40 50
pF1KE3 MSSAARSRLTRA-TRQEMLFLALLLLPVVVAFARAE-AEEDGDLQCLCVKTTSQVRPRHI
::. :. : . ..: .::::: .:.: :: : .:.: :..: . .. ..:
CCDS34 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNI
10 20 30 40 50
60 70 80 90 100
pF1KE3 TSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
:..: . :::: ...:::::::.: ::. . . ::::.. :..
CCDS34 QSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN
60 70 80 90 100
>>CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 (114 aa)
initn: 199 init1: 141 opt: 221 Z-score: 323.3 bits: 64.9 E(32554): 1.1e-11
Smith-Waterman score: 221; 37.5% identity (66.3% similar) in 104 aa overlap (2-104:6-109)
10 20 30 40 50
pF1KE3 MSSAARSRLTRATRQEMLFLALLLLPV-VVAFARAEAEEDGDLQCLCVKTTSQVRP
: ::: .. .: : ::: : .: : . .:.: :...: .: :
CCDS35 MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP
10 20 30 40 50 60
60 70 80 90 100
pF1KE3 RHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
. : .:.:. :::.: ....:.::::...::: .: . ::.:.. :.:
CCDS35 KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN
70 80 90 100 110
104 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 04:28:28 2016 done: Sun Nov 6 04:28:29 2016
Total Scan time: 1.630 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]