FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6609, 299 aa 1>>>pF1KE6609 299 - 299 aa - 299 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3985+/-0.000937; mu= 14.4057+/- 0.057 mean_var=58.4712+/-12.375, 0's: 0 Z-trim(104.6): 23 B-trim: 442 in 1/47 Lambda= 0.167727 statistics sampled from 7973 (7985) to 7973 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.617), E-opt: 0.2 (0.245), width: 16 Scan time: 1.530 The best scores are: opt bits E(32554) CCDS8435.1 SC5D gene_id:6309|Hs108|chr11 ( 299) 2086 513.1 9.8e-146 CCDS3809.1 MSMO1 gene_id:6307|Hs108|chr4 ( 293) 279 75.9 4.1e-14 CCDS43280.1 MSMO1 gene_id:6307|Hs108|chr4 ( 162) 273 74.4 6.5e-14 >>CCDS8435.1 SC5D gene_id:6309|Hs108|chr11 (299 aa) initn: 2086 init1: 2086 opt: 2086 Z-score: 2729.9 bits: 513.1 E(32554): 9.8e-146 Smith-Waterman score: 2086; 100.0% identity (100.0% similar) in 299 aa overlap (1-299:1-299) 10 20 30 40 50 60 pF1KE6 MDLVLRVADYYFFTPYVYPATWPEDDIFRQAISLLIVTNVGAYILYFFCATLSYYFVFDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 MDLVLRVADYYFFTPYVYPATWPEDDIFRQAISLLIVTNVGAYILYFFCATLSYYFVFDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ALMKHPQFLKNQVRREIKFTVQALPWISILTVALFLLEIRGYSKLHDDLGEFPYGLFELV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 ALMKHPQFLKNQVRREIKFTVQALPWISILTVALFLLEIRGYSKLHDDLGEFPYGLFELV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VSIISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIPTPFASHAFHPIDGFLQSLPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 VSIISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIPTPFASHAFHPIDGFLQSLPY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 HIYPFIFPLHKVVYLSLYILVNIWTISIHDGDFRVPQILQPFINGSAHHTDHHMFFDYNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 HIYPFIFPLHKVVYLSLYILVNIWTISIHDGDFRVPQILQPFINGSAHHTDHHMFFDYNY 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 GQYFTLWDRIGGSFKNPSSFEGKGPLSYVKEMTEGKRSSHSGNGCKNEKLFNGEFTKTE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 GQYFTLWDRIGGSFKNPSSFEGKGPLSYVKEMTEGKRSSHSGNGCKNEKLFNGEFTKTE 250 260 270 280 290 >>CCDS3809.1 MSMO1 gene_id:6307|Hs108|chr4 (293 aa) initn: 268 init1: 152 opt: 279 Z-score: 366.9 bits: 75.9 E(32554): 4.1e-14 Smith-Waterman score: 279; 29.5% identity (55.1% similar) in 234 aa overlap (45-263:61-284) 20 30 40 50 60 pF1KE6 PYVYPATWPEDDIFRQAISLLIVTNVGAYILYF-FCATLSYYFVFDHALMKH------PQ ::: :: .. : : . :. :. CCDS38 EPFKNAWNYMLNNYTKFQIATWGSLIVHEALYFLFCLP-GFLFQFIPYMKKYKIQKDKPE 40 50 60 70 80 70 80 90 100 110 120 pF1KE6 FLKNQ-----VRREIKFTVQALPWISILTVALFLLEIRGYSKLHDDLGEFPYGLFELVVS .:: : .: .: :: .. . .. : : .. : ..: : :.. CCDS38 TWENQWKCFKVLLFNHFCIQ-LP---LICGTYYFTE---YFNIPYDWERMPRWYF-LLAR 90 100 110 120 130 140 130 140 150 160 170 180 pF1KE6 IISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIPTPFASHAFHPIDGFLQSLPYHI .. . : . :..:: :::. .:: .:: :: .. : . .. ::.. .. . . : CCDS38 CFGCAVIEDTWHYFLHRLLHHKRIYKYIHKVHHEFQAPFGMEAEYAHPLETLILGTGFFI 150 160 170 180 190 200 190 200 210 220 230 pF1KE6 YPFIFPLHKVVYLSLYILVNIW-TISIHDG-DFRV-PQILQPFINGSAHHTDHHMFFDYN .. : :. : .. . . ::..:.: :. . : : :: :: :: ::: : : CCDS38 GIVLLCDH-VILLWAWVTIRLLETIDVHSGYDIPLNPLNLIPFYAGSRHHDFHHMNFIGN 210 220 230 240 250 260 240 250 260 270 280 290 pF1KE6 YGQYFTLWDRIGGSFKNPSSFEGKGPLSYVKEMTEGKRSSHSGNGCKNEKLFNGEFTKTE :.. :: :::: :. .. .... : CCDS38 YASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE 270 280 290 >>CCDS43280.1 MSMO1 gene_id:6307|Hs108|chr4 (162 aa) initn: 268 init1: 152 opt: 273 Z-score: 363.2 bits: 74.4 E(32554): 6.5e-14 Smith-Waterman score: 273; 34.6% identity (62.5% similar) in 136 aa overlap (131-263:19-153) 110 120 130 140 150 160 pF1KE6 GYSKLHDDLGEFPYGLFELVVSIISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIP : . :..:: :::. .:: .:: :: .. : CCDS43 MPRWYFLLARCFGCAVIEDTWHYFLHRLLHHKRIYKYIHKVHHEFQAP 10 20 30 40 170 180 190 200 210 pF1KE6 TPFASHAFHPIDGFLQSLPYHIYPFIFPLHKVVYLSLYILVNIW-TISIHDG-DFRV-PQ . .. ::.. .. . . : .. : :. : .. . . ::..:.: :. . : CCDS43 FGMEAEYAHPLETLILGTGFFIGIVLLCDH-VILLWAWVTIRLLETIDVHSGYDIPLNPL 50 60 70 80 90 100 220 230 240 250 260 270 pF1KE6 ILQPFINGSAHHTDHHMFFDYNYGQYFTLWDRIGGSFKNPSSFEGKGPLSYVKEMTEGKR : :: :: :: ::: : ::.. :: :::: :. .. .... : CCDS43 NLIPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE 110 120 130 140 150 160 280 290 pF1KE6 SSHSGNGCKNEKLFNGEFTKTE 299 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:44:30 2016 done: Tue Nov 8 14:44:30 2016 Total Scan time: 1.530 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]