FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4044, 267 aa
1>>>pF1KE4044 267 - 267 aa - 267 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4978+/-0.000605; mu= 14.9249+/- 0.037
mean_var=75.1570+/-15.140, 0's: 0 Z-trim(113.4): 9 B-trim: 224 in 2/49
Lambda= 0.147941
statistics sampled from 13994 (14000) to 13994 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.43), width: 16
Scan time: 2.730
The best scores are: opt bits E(32554)
CCDS8567.1 SPSB2 gene_id:84727|Hs108|chr12 ( 263) 1550 339.2 1.7e-93
CCDS3115.1 SPSB4 gene_id:92369|Hs108|chr3 ( 273) 783 175.6 3.4e-44
CCDS102.1 SPSB1 gene_id:80176|Hs108|chr1 ( 273) 754 169.4 2.5e-42
CCDS46985.1 FBXO45 gene_id:200933|Hs108|chr3 ( 286) 540 123.7 1.4e-28
>>CCDS8567.1 SPSB2 gene_id:84727|Hs108|chr12 (263 aa)
initn: 1543 init1: 1543 opt: 1550 Z-score: 1791.9 bits: 339.2 E(32554): 1.7e-93
Smith-Waterman score: 1550; 88.8% identity (91.9% similar) in 260 aa overlap (1-260:1-253)
10 20 30 40 50 60
pF1KE4 MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSENIEVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSENIEVK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 EGGLYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAPLQTDHY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 EGGLYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAPLQTDHY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 AALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQGEQLEVPERLLVVLDMEEGTLGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 AALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQGEQLEVPERLLVVLDMEEGTLGY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 AIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGENFLSLVAVVW
:::::::::::::::::::::::::::::::::::::::::.: ...: : .
CCDS85 AIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRAEP-----HSLLHLSRLCV
190 200 210 220 230
250 260
pF1KE4 DGNSSDKSRGDGPSSSLPQPLFSAGKG
: .: :. :.:: :
CCDS85 RHNLGDTRLGQ--VSALPLPPAMKRYLLYQ
240 250 260
>>CCDS3115.1 SPSB4 gene_id:92369|Hs108|chr3 (273 aa)
initn: 774 init1: 431 opt: 783 Z-score: 907.0 bits: 175.6 E(32554): 3.4e-44
Smith-Waterman score: 783; 59.0% identity (81.0% similar) in 195 aa overlap (26-217:34-227)
10 20 30 40 50
pF1KE4 MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSE
: :..::. : :..: ::.:::.: :
CCDS31 KLSGSLKSVEVREPALRPAKRELRGAEPGRPARLDQLLDMPAAGLAVQLRHAWNPEDRSL
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 NIEVKEGG-LYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAP
:. ::. : :.:.:::::::: ::: :..::::::.:.:: .::::::::::::: ::
CCDS31 NVFVKDDDRLTFHRHPVAQSTDGIRGKVGHARGLHAWQINWPARQRGTHAVVGVATARAP
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 LQTDHYAALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPA--GTQGEQLEVPERLLVVLD
:.. :.::.::..::::::.::..:::..:. . ::: : . : . .:. ::::::
CCDS31 LHSVGYTALVGSDAESWGWDLGRSRLYHDGKNQPGVAYPAFLGPD-EAFALPDSLLVVLD
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE4 MEEGTLGYAIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGENF
:.::::.. . : ::: :::::::. :::.::::::.:.: .::.
CCDS31 MDEGTLSFIVDGQYLGVAFRGLKGKKLYPVVSAVWGHCEVTMRYINGLDPEPLPLMDLCR
190 200 210 220 230 240
240 250 260
pF1KE4 LSLVAVVWDGNSSDKSRGDGPSSSLPQPLFSAGKG
CCDS31 RSIRSALGRQRLQDISSLPLPQSLKNYLQYQ
250 260 270
>>CCDS102.1 SPSB1 gene_id:80176|Hs108|chr1 (273 aa)
initn: 749 init1: 417 opt: 754 Z-score: 873.5 bits: 169.4 E(32554): 2.5e-42
Smith-Waterman score: 759; 51.5% identity (74.9% similar) in 227 aa overlap (1-217:1-227)
10 20 30 40 50
pF1KE4 MGQTALAGGSS---STPTPQALYPDLS----C-PEGLEELLSAPPPDLGAQRRHGWNPKD
::: . .: .. :: . : .:. : : :. ::. :: . .: :.:: .:
CCDS10 MGQKVTGGIKTVDMRDPTYRPLKQELQGLDYCKPTRLDLLLDMPPVSYDVQLLHSWNNND
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 CSENIEVKEGG-LYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATA
: :. ::: : :.:.:::::::. ::: ::.::::.:.:.: ..:::::::::::::
CCDS10 RSLNVFVKEDDKLIFHRHPVAQSTDAIRGKVGYTRGLHVWQITWAMRQRGTHAVVGVATA
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 LAPLQTDHYAALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQ-GEQLEVPERLLVV
:::.. :..:.:.: ::::::.::..:::..:. . ::: . : . ::. .::.
CCDS10 DAPLHSVGYTTLVGNNHESWGWDLGRNRLYHDGKNQPSKTYPAFLEPDETFIVPDSFLVA
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE4 LDMEEGTLGYAIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGE
:::..:::.. . : :.: :::::::. :::.::::::.:..:.:::
CCDS10 LDMDDGTLSFIVDGQYMGVAFRGLKGKKLYPVVSAVWGHCEIRMRYLNGLDPEPLPLMDL
190 200 210 220 230 240
240 250 260
pF1KE4 NFLSLVAVVWDGNSSDKSRGDGPSSSLPQPLFSAGKG
CCDS10 CRRSVRLALGRERLGEIHTLPLPASLKAYLLYQ
250 260 270
>>CCDS46985.1 FBXO45 gene_id:200933|Hs108|chr3 (286 aa)
initn: 515 init1: 209 opt: 540 Z-score: 626.4 bits: 123.7 E(32554): 1.4e-28
Smith-Waterman score: 540; 46.3% identity (75.1% similar) in 177 aa overlap (45-219:111-282)
20 30 40 50 60 70
pF1KE4 TPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSENIEVKEGGLYFERRPVAQS
.:... .:::.:. .:..:. ..: :.:::
CCDS46 SLCARSLAEEALRTDILCNLPSYKAKIRAFQHAFSTNDCSRNVYIKKNGFTLHRNPIAQS
90 100 110 120 130 140
80 90 100 110 120 130
pF1KE4 TDGARGKRGYSRGLHAWEISW--PLEQRGTHAVVGVATALAPLQTDHYAALLGSNSESWG
::::: : :.:.: ::::. : :: :: ::.:.:: ::.: . :.:::::...:::
CCDS46 TDGARTKIGFSEGRHAWEVWWEGPL---GTVAVIGIATKRAPMQCQGYVALLGSDDQSWG
150 160 170 180 190
140 150 160 170 180 190
pF1KE4 WDIGRGKLYHQSKGPGAPQYPAGTQGEQLEVPERLLVVLDMEEGTLGYAIGGTYLGPAFR
:.. ..: :... :. .: ... . .. ::. :.::::. ::.. : .:: :::
CCDS46 WNLVDNNLLHNGEVNGS--FPQCNNAPKYQIGERIRVILDMEDKTLAFERGYEFLGVAFR
200 210 220 230 240 250
200 210 220 230 240 250
pF1KE4 GLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGENFLSLVAVVWDGNSSDKSRGDG
:: ::::::::.:. .: . :::.
CCDS46 GLPKVCLYPAVSAVYGNTEVTLVYLGKPLDG
260 270 280
267 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 04:17:14 2016 done: Sun Nov 6 04:17:15 2016
Total Scan time: 2.730 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]