FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2181, 240 aa 1>>>pF1KE2181 240 - 240 aa - 240 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4193+/-0.000687; mu= 14.1186+/- 0.041 mean_var=65.9634+/-13.184, 0's: 0 Z-trim(110.7): 13 B-trim: 358 in 2/49 Lambda= 0.157915 statistics sampled from 11839 (11844) to 11839 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.742), E-opt: 0.2 (0.364), width: 16 Scan time: 2.110 The best scores are: opt bits E(32554) CCDS1006.1 THEM4 gene_id:117145|Hs108|chr1 ( 240) 1636 380.8 4.3e-106 CCDS1005.1 THEM5 gene_id:284486|Hs108|chr1 ( 247) 547 132.7 2.1e-31 >>CCDS1006.1 THEM4 gene_id:117145|Hs108|chr1 (240 aa) initn: 1636 init1: 1636 opt: 1636 Z-score: 2018.3 bits: 380.8 E(32554): 4.3e-106 Smith-Waterman score: 1636; 99.6% identity (99.6% similar) in 240 aa overlap (1-240:1-240) 10 20 30 40 50 60 pF1KE2 MLRSCAARLRTLGALCRPPVGRRLPGSEPRPELRSFSSEEVILKDCSVPNPSWNKDLRLL :::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::: CCDS10 MLRSCAARLRTLGALCLPPVGRRLPGSEPRPELRSFSSEEVILKDCSVPNPSWNKDLRLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FDQFMKKCEDGSWKRLPSYKRTPTEWIQDFKTHFLDPKLMKEEQMSQAQLFTRSFDDGLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FDQFMKKCEDGSWKRLPSYKRTPTEWIQDFKTHFLDPKLMKEEQMSQAQLFTRSFDDGLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 FEYVMFYNDIEKRMVCLFQGGPYLEGPPGFIHGGAIATMIDATVGMCAMMAGGIVMTANL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FEYVMFYNDIEKRMVCLFQGGPYLEGPPGFIHGGAIATMIDATVGMCAMMAGGIVMTANL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 NINYKRPIPLCSVVMINSQLDKVEGRKFFVSCNVQSVDEKTLYSEATSLFIKLNPAKSLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NINYKRPIPLCSVVMINSQLDKVEGRKFFVSCNVQSVDEKTLYSEATSLFIKLNPAKSLT 190 200 210 220 230 240 >>CCDS1005.1 THEM5 gene_id:284486|Hs108|chr1 (247 aa) initn: 517 init1: 388 opt: 547 Z-score: 677.2 bits: 132.7 E(32554): 2.1e-31 Smith-Waterman score: 547; 38.7% identity (68.7% similar) in 243 aa overlap (1-233:1-239) 10 20 30 40 50 pF1KE2 MLRSC---AARL-RTLGALCRPPVGRRL-P----GSEPRPELRSFSSEEVILKDCSVPNP :.: : :::: . : : : . :: : :: . : :.. ::: ..:: CCDS10 MIRRCFQVAARLGHHRGLLEAPRILPRLNPASAFGSSTDSMFSRFLPEKTDLKDYALPNA 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 SWNKDLRLLFDQFMKKCEDGSWKRLPSYKRTPTEWIQDFKTHFLDPKLMKEEQMSQAQLF :: .:. :...:..: ....: .:::.: . . :. .: : : . .. ..: CCDS10 SWCSDMLSLYQEFLEKTKSSGWIKLPSFK-SNRDHIRGLK---LPSGLAVSSDKGDCRIF 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 TRSFD-DGLGFEYVMFYNDIEKRMVCLFQGGPYLEGPPGFIHGGAIATMIDATVGMCAMM :: .. .: :::::.:.. .:. ::::: : :::::::: :::..:.:.: : . :.. CCDS10 TRCIQVEGQGFEYVIFFQPTQKKSVCLFQPGSYLEGPPGFAHGGSLAAMMDETFSKTAFL 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 AGGIVMTANLNINYKRPIPLCSVVMINSQLDKVEGRKFFVSCNVQSVDEKTLYSEATSLF :: ..: .::: .: ::. :.:... .:::.: .:...:: ..: :..:.:......: CCDS10 AGEGLFTLSLNIRFKNLIPVDSLVVMDVELDKIEDQKLYMSCIAHSRDQQTVYAKSSGVF 180 190 200 210 220 230 240 pF1KE2 IKLNPAKSLT ..: CCDS10 LQLQLEEESPQ 240 240 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:48:07 2016 done: Mon Nov 7 15:48:07 2016 Total Scan time: 2.110 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]