FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6230, 300 aa 1>>>pF1KE6230 300 - 300 aa - 300 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0736+/-0.000776; mu= 16.8007+/- 0.047 mean_var=62.9287+/-12.389, 0's: 0 Z-trim(107.4): 8 B-trim: 0 in 0/50 Lambda= 0.161678 statistics sampled from 9523 (9525) to 9523 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.674), E-opt: 0.2 (0.293), width: 16 Scan time: 1.770 The best scores are: opt bits E(32554) CCDS3417.1 CD38 gene_id:952|Hs108|chr4 ( 300) 2100 498.2 3.1e-141 CCDS3416.1 BST1 gene_id:683|Hs108|chr4 ( 318) 659 162.1 4.9e-40 >>CCDS3417.1 CD38 gene_id:952|Hs108|chr4 (300 aa) initn: 2100 init1: 2100 opt: 2100 Z-score: 2649.2 bits: 498.2 E(32554): 3.1e-141 Smith-Waterman score: 2100; 100.0% identity (100.0% similar) in 300 aa overlap (1-300:1-300) 10 20 30 40 50 60 pF1KE6 MANCEFSPVSGDKPCCRLSRRAQLCLGVSILVLILVVVLAVVVPRWRQQWSGPGTTKRFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MANCEFSPVSGDKPCCRLSRRAQLCLGVSILVLILVVVLAVVVPRWRQQWSGPGTTKRFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ETVLARCVKYTEIHPEMRHVDCQSVWDAFKGAFISKHPCNITEEDYQPLMKLGTQTVPCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ETVLARCVKYTEIHPEMRHVDCQSVWDAFKGAFISKHPCNITEEDYQPLMKLGTQTVPCN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 KILLWSRIKDLAHQFTQVQRDMFTLEDTLLGYLADDLTWCGEFNTSKINYQSCPDWRKDC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 KILLWSRIKDLAHQFTQVQRDMFTLEDTLLGYLADDLTWCGEFNTSKINYQSCPDWRKDC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 SNNPVSVFWKTVSRRFAEAACDVVHVMLNGSRSKIFDKNSTFGSVEVHNLQPEKVQTLEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SNNPVSVFWKTVSRRFAEAACDVVHVMLNGSRSKIFDKNSTFGSVEVHNLQPEKVQTLEA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 WVIHGGREDSRDLCQDPTIKELESIISKRNIQFSCKNIYRPDKFLQCVKNPEDSSCTSEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 WVIHGGREDSRDLCQDPTIKELESIISKRNIQFSCKNIYRPDKFLQCVKNPEDSSCTSEI 250 260 270 280 290 300 >>CCDS3416.1 BST1 gene_id:683|Hs108|chr4 (318 aa) initn: 569 init1: 382 opt: 659 Z-score: 832.3 bits: 162.1 E(32554): 4.9e-40 Smith-Waterman score: 659; 36.8% identity (66.0% similar) in 285 aa overlap (16-296:6-280) 10 20 30 40 50 60 pF1KE6 MANCEFSPVSGDKPCCRLSRRAQLCLGVSILVLILVVVLAVVVPRWRQQWSGPGTTKRFP : :: :: : . .:.:.:.. : : .: : ::. .. CCDS34 MAAQGCAASRLLQLLLQLLLLLLLLAAGGA------RARWRGEGTSAHLR 10 20 30 40 70 80 90 100 110 pF1KE6 ETVLARCVKYTEI-HPEMRHVDCQSVWDAFKGAFISKHPCNITEEDYQPLMKLGTQTVPC . :.::..: . ::.:. .: ..:.::: : ..: ::.. ::. ...:. ...: CCDS34 DIFLGRCAEYRALLSPEQRNKNCTAIWEAFKVA-LDKDPCSVLPSDYDLFINLSRHSIPR 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 NKILLWSRIKDLAHQFTQVQRDMFTLEDTLLGYLADDLTWCGEFNTSKINYQSCPDWRKD .: :.: . :...:.. : .. : :.: : .:: :.:: . : : ..::::: .: CCDS34 DKSLFWENSHLLVNSFADNTRRFMPLSDVLYGRVADFLSWCRQKNDSGLDYQSCPT-SED 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE6 CSNNPVSVFWKTVSRRFAEAACDVVHVMLNGSR-SKIFDKNSTFGSVEVHNLQPEKVQTL : ::::. ::: .: .... . :.:::::::. . . .. :.. :. ::: ::. . CCDS34 CENNPVDSFWKRASIQYSKDSSGVIHVMLNGSEPTGAYPIKGFFADYEIPNLQKEKITRI 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE6 EAWVIH--GGREDSRDLCQDPTIKELESIISKRNIQFSCKNIYRPDKFLQCVKNPEDSSC : ::.: :: . . : . ..: ::. .. ..:.:: : ::: :.:::: . .: CCDS34 EIWVMHEIGG--PNVESCGEGSMKVLEKRLKDMGFQYSCINDYRPVKLLQCVDHSTHPDC 230 240 250 260 270 280 300 pF1KE6 TSEI CCDS34 ALKSAAAATQRKAPSLYTEQRAGLIIPLFLVLASRTQL 290 300 310 300 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:17:37 2016 done: Tue Nov 8 11:17:37 2016 Total Scan time: 1.770 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]