FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6745, 101 aa
1>>>pF1KB6745 101 - 101 aa - 101 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.5166+/-0.000521; mu= 13.2274+/- 0.031
mean_var=52.1533+/-10.246, 0's: 0 Z-trim(113.3): 21 B-trim: 92 in 2/50
Lambda= 0.177596
statistics sampled from 13913 (13934) to 13913 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.428), width: 16
Scan time: 1.570
The best scores are: opt bits E(32554)
CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 ( 101) 657 175.0 7e-45
CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 ( 104) 531 142.7 3.8e-35
CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 ( 107) 265 74.6 1.3e-14
CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 ( 114) 260 73.3 3.2e-14
CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 ( 128) 255 72.0 8.6e-14
CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 ( 114) 245 69.5 4.6e-13
CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 ( 107) 242 68.7 7.5e-13
CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 ( 107) 233 66.4 3.7e-12
>>CCDS3562.1 PF4 gene_id:5196|Hs108|chr4 (101 aa)
initn: 657 init1: 657 opt: 657 Z-score: 919.2 bits: 175.0 E(32554): 7e-45
Smith-Waterman score: 657; 100.0% identity (100.0% similar) in 101 aa overlap (1-101:1-101)
10 20 30 40 50 60
pF1KB6 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITSLEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITSLEV
10 20 30 40 50 60
70 80 90 100
pF1KB6 IKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
:::::::::::::::::::::::::::::::::::::::::
CCDS35 IKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
70 80 90 100
>>CCDS3561.1 PF4V1 gene_id:5197|Hs108|chr4 (104 aa)
initn: 542 init1: 529 opt: 531 Z-score: 744.5 bits: 142.7 E(32554): 3.8e-35
Smith-Waterman score: 531; 84.6% identity (89.4% similar) in 104 aa overlap (1-101:1-104)
10 20 30 40 50
pF1KB6 MSSAAG---FCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITS
::::: :.: .:::.:::::.::::: :::::::::::::::::::::::::::
CCDS35 MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS
10 20 30 40 50 60
60 70 80 90 100
pF1KB6 LEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
::::::::::::::::::::::::::::::: :::::::. :::
CCDS35 LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES
70 80 90 100
>>CCDS47074.1 CXCL1 gene_id:2919|Hs108|chr4 (107 aa)
initn: 189 init1: 164 opt: 265 Z-score: 376.0 bits: 74.6 E(32554): 1.3e-14
Smith-Waterman score: 265; 46.6% identity (71.8% similar) in 103 aa overlap (1-101:1-103)
10 20 30 40 50
pF1KB6 MSSAAGFCA-SRPGLLFLGLLLLPLVVA-FASAEAEEDGDLQCLCVKTTSQVRPRHITSL
:. :: : : : :: ..:::: ::.: .: : .:.: :..: . ..:..: :.
CCDS47 MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV
10 20 30 40 50 60
60 70 80 90 100
pF1KB6 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
.: . :::: ...::::::::: ::. .:. ::::.:.:.:
CCDS47 NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSDKSN
70 80 90 100
>>CCDS34006.1 CXCL5 gene_id:6374|Hs108|chr4 (114 aa)
initn: 240 init1: 199 opt: 260 Z-score: 368.7 bits: 73.3 E(32554): 3.2e-14
Smith-Waterman score: 260; 40.2% identity (75.3% similar) in 97 aa overlap (4-100:15-108)
10 20 30 40
pF1KB6 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQ
....:: :..: :: : .: :. : .:.:.:..::.
CCDS34 MSLLSSRAARVPGPSSSLCAL---LVLLLLLTQPGPIASAGPAAAVLRELRCVCLQTTQG
10 20 30 40 50
50 60 70 80 90 100
pF1KB6 VRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
:.:. :..:.:. ::.: ....:.::::..:::: .::. ::.:.:.:.
CCDS34 VHPKMISNLQVFAIGPQCSKVEVVASLKNGKEICLDPEAPFLKKVIQKILDGGNKEN
60 70 80 90 100 110
>>CCDS3563.1 PPBP gene_id:5473|Hs108|chr4 (128 aa)
initn: 245 init1: 212 opt: 255 Z-score: 361.1 bits: 72.0 E(32554): 8.6e-14
Smith-Waterman score: 255; 52.9% identity (80.0% similar) in 70 aa overlap (30-99:52-121)
10 20 30 40 50
pF1KB6 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAEAEEDGDLQCLCVKTTSQVRPRHITSLE
: ... ..:.:.:.:::: ..:..: :::
CCDS35 VLLLLSLLLTALASSTKGQTKRNLAKGKEESLDSDLYAELRCMCIKTTSGIHPKNIQSLE
30 40 50 60 70 80
60 70 80 90 100
pF1KB6 VIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
:: : :: ...:::::.::::::: .:: :::..: :
CCDS35 VIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKKLAGDESAD
90 100 110 120
>>CCDS3560.1 CXCL6 gene_id:6372|Hs108|chr4 (114 aa)
initn: 238 init1: 166 opt: 245 Z-score: 347.9 bits: 69.5 E(32554): 4.6e-13
Smith-Waterman score: 245; 40.4% identity (72.7% similar) in 99 aa overlap (4-101:15-109)
10 20 30 40
pF1KB6 MSSAAGFCASRPGLLFLGLLLLPL-VVAFASAEAEEDGDLQCLCVKTTS
....:: :: : ::: : .: :. . .:.: :...:
CCDS35 MSLPSSRAARVPGPSGSLCA----LLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTL
10 20 30 40 50
50 60 70 80 90 100
pF1KB6 QVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
.: :. : .:.:. :::.: ....:.::::...::: .::. ::.:.:.:.:
CCDS35 RVNPKTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSGNKKN
60 70 80 90 100 110
>>CCDS34008.1 CXCL2 gene_id:2920|Hs108|chr4 (107 aa)
initn: 181 init1: 156 opt: 242 Z-score: 344.2 bits: 68.7 E(32554): 7.5e-13
Smith-Waterman score: 242; 44.1% identity (72.0% similar) in 93 aa overlap (10-101:11-103)
10 20 30 40 50
pF1KB6 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAE-AEEDGDLQCLCVKTTSQVRPRHITSL
: : :: ..:::: ::.: : : .:.: :..: . .. ..: :.
CCDS34 MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQCLQTLQGIHLKNIQSV
10 20 30 40 50 60
60 70 80 90 100
pF1KB6 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
.: . :::: ...:::::::.: ::. .:. ::::.:.:..
CCDS34 KVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGKSN
70 80 90 100
>>CCDS34007.1 CXCL3 gene_id:2921|Hs108|chr4 (107 aa)
initn: 176 init1: 151 opt: 233 Z-score: 331.7 bits: 66.4 E(32554): 3.7e-12
Smith-Waterman score: 233; 44.0% identity (71.4% similar) in 91 aa overlap (10-99:11-101)
10 20 30 40 50
pF1KB6 MSSAAGFCASRPGLLFLGLLLLPLVVAFASAE-AEEDGDLQCLCVKTTSQVRPRHITSL
: : :: ..:::: ::.: : : .:.: :..: . .. ..: :.
CCDS34 MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV
10 20 30 40 50 60
60 70 80 90 100
pF1KB6 EVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKKIIKKLLES
.: . :::: ...:::::::.: ::. .:. .:::.:.:
CCDS34 NVRSPGPHCAQTEVIATLKNGKKACLNPASPMVQKIIEKILNKGSTN
70 80 90 100
101 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 18:17:02 2016 done: Sat Nov 5 18:17:02 2016
Total Scan time: 1.570 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]