FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4044, 267 aa 1>>>pF1KE4044 267 - 267 aa - 267 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4978+/-0.000605; mu= 14.9249+/- 0.037 mean_var=75.1570+/-15.140, 0's: 0 Z-trim(113.4): 9 B-trim: 224 in 2/49 Lambda= 0.147941 statistics sampled from 13994 (14000) to 13994 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.43), width: 16 Scan time: 2.730 The best scores are: opt bits E(32554) CCDS8567.1 SPSB2 gene_id:84727|Hs108|chr12 ( 263) 1550 339.2 1.7e-93 CCDS3115.1 SPSB4 gene_id:92369|Hs108|chr3 ( 273) 783 175.6 3.4e-44 CCDS102.1 SPSB1 gene_id:80176|Hs108|chr1 ( 273) 754 169.4 2.5e-42 CCDS46985.1 FBXO45 gene_id:200933|Hs108|chr3 ( 286) 540 123.7 1.4e-28 >>CCDS8567.1 SPSB2 gene_id:84727|Hs108|chr12 (263 aa) initn: 1543 init1: 1543 opt: 1550 Z-score: 1791.9 bits: 339.2 E(32554): 1.7e-93 Smith-Waterman score: 1550; 88.8% identity (91.9% similar) in 260 aa overlap (1-260:1-253) 10 20 30 40 50 60 pF1KE4 MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSENIEVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSENIEVK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 EGGLYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAPLQTDHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 EGGLYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAPLQTDHY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 AALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQGEQLEVPERLLVVLDMEEGTLGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 AALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQGEQLEVPERLLVVLDMEEGTLGY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 AIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGENFLSLVAVVW :::::::::::::::::::::::::::::::::::::::::.: ...: : . CCDS85 AIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRAEP-----HSLLHLSRLCV 190 200 210 220 230 250 260 pF1KE4 DGNSSDKSRGDGPSSSLPQPLFSAGKG : .: :. :.:: : CCDS85 RHNLGDTRLGQ--VSALPLPPAMKRYLLYQ 240 250 260 >>CCDS3115.1 SPSB4 gene_id:92369|Hs108|chr3 (273 aa) initn: 774 init1: 431 opt: 783 Z-score: 907.0 bits: 175.6 E(32554): 3.4e-44 Smith-Waterman score: 783; 59.0% identity (81.0% similar) in 195 aa overlap (26-217:34-227) 10 20 30 40 50 pF1KE4 MGQTALAGGSSSTPTPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSE : :..::. : :..: ::.:::.: : CCDS31 KLSGSLKSVEVREPALRPAKRELRGAEPGRPARLDQLLDMPAAGLAVQLRHAWNPEDRSL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 NIEVKEGG-LYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAP :. ::. : :.:.:::::::: ::: :..::::::.:.:: .::::::::::::: :: CCDS31 NVFVKDDDRLTFHRHPVAQSTDGIRGKVGHARGLHAWQINWPARQRGTHAVVGVATARAP 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 LQTDHYAALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPA--GTQGEQLEVPERLLVVLD :.. :.::.::..::::::.::..:::..:. . ::: : . : . .:. :::::: CCDS31 LHSVGYTALVGSDAESWGWDLGRSRLYHDGKNQPGVAYPAFLGPD-EAFALPDSLLVVLD 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 MEEGTLGYAIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGENF :.::::.. . : ::: :::::::. :::.::::::.:.: .::. CCDS31 MDEGTLSFIVDGQYLGVAFRGLKGKKLYPVVSAVWGHCEVTMRYINGLDPEPLPLMDLCR 190 200 210 220 230 240 240 250 260 pF1KE4 LSLVAVVWDGNSSDKSRGDGPSSSLPQPLFSAGKG CCDS31 RSIRSALGRQRLQDISSLPLPQSLKNYLQYQ 250 260 270 >>CCDS102.1 SPSB1 gene_id:80176|Hs108|chr1 (273 aa) initn: 749 init1: 417 opt: 754 Z-score: 873.5 bits: 169.4 E(32554): 2.5e-42 Smith-Waterman score: 759; 51.5% identity (74.9% similar) in 227 aa overlap (1-217:1-227) 10 20 30 40 50 pF1KE4 MGQTALAGGSS---STPTPQALYPDLS----C-PEGLEELLSAPPPDLGAQRRHGWNPKD ::: . .: .. :: . : .:. : : :. ::. :: . .: :.:: .: CCDS10 MGQKVTGGIKTVDMRDPTYRPLKQELQGLDYCKPTRLDLLLDMPPVSYDVQLLHSWNNND 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 CSENIEVKEGG-LYFERRPVAQSTDGARGKRGYSRGLHAWEISWPLEQRGTHAVVGVATA : :. ::: : :.:.:::::::. ::: ::.::::.:.:.: ..::::::::::::: CCDS10 RSLNVFVKEDDKLIFHRHPVAQSTDAIRGKVGYTRGLHVWQITWAMRQRGTHAVVGVATA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 LAPLQTDHYAALLGSNSESWGWDIGRGKLYHQSKGPGAPQYPAGTQ-GEQLEVPERLLVV :::.. :..:.:.: ::::::.::..:::..:. . ::: . : . ::. .::. CCDS10 DAPLHSVGYTTLVGNNHESWGWDLGRNRLYHDGKNQPSKTYPAFLEPDETFIVPDSFLVA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 LDMEEGTLGYAIGGTYLGPAFRGLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGE :::..:::.. . : :.: :::::::. :::.::::::.:..:.::: CCDS10 LDMDDGTLSFIVDGQYMGVAFRGLKGKKLYPVVSAVWGHCEIRMRYLNGLDPEPLPLMDL 190 200 210 220 230 240 240 250 260 pF1KE4 NFLSLVAVVWDGNSSDKSRGDGPSSSLPQPLFSAGKG CCDS10 CRRSVRLALGRERLGEIHTLPLPASLKAYLLYQ 250 260 270 >>CCDS46985.1 FBXO45 gene_id:200933|Hs108|chr3 (286 aa) initn: 515 init1: 209 opt: 540 Z-score: 626.4 bits: 123.7 E(32554): 1.4e-28 Smith-Waterman score: 540; 46.3% identity (75.1% similar) in 177 aa overlap (45-219:111-282) 20 30 40 50 60 70 pF1KE4 TPQALYPDLSCPEGLEELLSAPPPDLGAQRRHGWNPKDCSENIEVKEGGLYFERRPVAQS .:... .:::.:. .:..:. ..: :.::: CCDS46 SLCARSLAEEALRTDILCNLPSYKAKIRAFQHAFSTNDCSRNVYIKKNGFTLHRNPIAQS 90 100 110 120 130 140 80 90 100 110 120 130 pF1KE4 TDGARGKRGYSRGLHAWEISW--PLEQRGTHAVVGVATALAPLQTDHYAALLGSNSESWG ::::: : :.:.: ::::. : :: :: ::.:.:: ::.: . :.:::::...::: CCDS46 TDGARTKIGFSEGRHAWEVWWEGPL---GTVAVIGIATKRAPMQCQGYVALLGSDDQSWG 150 160 170 180 190 140 150 160 170 180 190 pF1KE4 WDIGRGKLYHQSKGPGAPQYPAGTQGEQLEVPERLLVVLDMEEGTLGYAIGGTYLGPAFR :.. ..: :... :. .: ... . .. ::. :.::::. ::.. : .:: ::: CCDS46 WNLVDNNLLHNGEVNGS--FPQCNNAPKYQIGERIRVILDMEDKTLAFERGYEFLGVAFR 200 210 220 230 240 250 200 210 220 230 240 250 pF1KE4 GLKGRTLYPAVSAVWGQCQVRIRYLGERRGEAWGRRGENFLSLVAVVWDGNSSDKSRGDG :: ::::::::.:. .: . :::. CCDS46 GLPKVCLYPAVSAVYGNTEVTLVYLGKPLDG 260 270 280 267 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:17:14 2016 done: Sun Nov 6 04:17:15 2016 Total Scan time: 2.730 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]