FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6620, 327 aa 1>>>pF1KE6620 327 - 327 aa - 327 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5304+/-0.000774; mu= 14.2247+/- 0.046 mean_var=57.3722+/-11.541, 0's: 0 Z-trim(106.8): 12 B-trim: 2 in 1/50 Lambda= 0.169326 statistics sampled from 9204 (9211) to 9204 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.665), E-opt: 0.2 (0.283), width: 16 Scan time: 1.980 The best scores are: opt bits E(32554) CCDS7467.1 HOGA1 gene_id:112817|Hs108|chr10 ( 327) 2188 542.6 1.6e-154 CCDS44469.1 HOGA1 gene_id:112817|Hs108|chr10 ( 164) 671 171.9 3.1e-43 CCDS1350.1 NPL gene_id:80896|Hs108|chr1 ( 320) 311 84.0 1.7e-16 CCDS55667.1 NPL gene_id:80896|Hs108|chr1 ( 301) 257 70.8 1.5e-12 CCDS72990.1 NPL gene_id:80896|Hs108|chr1 ( 230) 247 68.4 6.4e-12 >>CCDS7467.1 HOGA1 gene_id:112817|Hs108|chr10 (327 aa) initn: 2188 init1: 2188 opt: 2188 Z-score: 2887.5 bits: 542.6 E(32554): 1.6e-154 Smith-Waterman score: 2188; 100.0% identity (100.0% similar) in 327 aa overlap (1-327:1-327) 10 20 30 40 50 60 pF1KE6 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG 250 260 270 280 290 300 310 320 pF1KE6 PCRAPLQELSPAEEEALRMDFTSNGWL ::::::::::::::::::::::::::: CCDS74 PCRAPLQELSPAEEEALRMDFTSNGWL 310 320 >>CCDS44469.1 HOGA1 gene_id:112817|Hs108|chr10 (164 aa) initn: 671 init1: 671 opt: 671 Z-score: 889.7 bits: 171.9 E(32554): 3.1e-43 Smith-Waterman score: 806; 50.2% identity (50.2% similar) in 327 aa overlap (1-327:1-164) 10 20 30 40 50 60 pF1KE6 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE ::::::::::: CCDS44 LHKLGTFPFRG------------------------------------------------- 70 130 140 150 160 170 180 pF1KE6 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV CCDS44 ------------------------------------------------------------ 190 200 210 220 230 240 pF1KE6 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC :::::: CCDS44 ------------------------------------------------------AVGGVC 250 260 270 280 290 300 pF1KE6 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG 80 90 100 110 120 130 310 320 pF1KE6 PCRAPLQELSPAEEEALRMDFTSNGWL ::::::::::::::::::::::::::: CCDS44 PCRAPLQELSPAEEEALRMDFTSNGWL 140 150 160 >>CCDS1350.1 NPL gene_id:80896|Hs108|chr1 (320 aa) initn: 117 init1: 67 opt: 311 Z-score: 409.5 bits: 84.0 E(32554): 1.7e-16 Smith-Waterman score: 320; 27.5% identity (55.6% similar) in 306 aa overlap (32-323:5-307) 10 20 30 40 50 60 pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL : . :. . ::.: ..:.... . . . CCDS13 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV 10 20 30 70 80 90 100 110 pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV : ... :.:..:: :. ::: .:. . : .. : .. : : . . CCDS13 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL :.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. . CCDS13 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV .. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::. CCDS13 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQ---HRLIEPNAAVTRRFGIPGLKKIMD :.: . : :: .. :. . .. : . : .:.: : .: ::. : :: CCDS13 GAVGSTYNYLGKKTNQMLEAFEQKDFSLALNYQFCIQRFI--NFVVKLGFGVSQTKAIMT 220 230 240 250 260 270 300 310 320 pF1KE6 WF-GYYGGPCRAPLQ----ELSPAEEEALR-MDFTSNGWL : :: : ::: :.. . : :. .:: : CCDS13 LVSGIPMGPPRLPLQKASREFTDSAEAKLKSLDFLSFTDLKDGNLEAGS 280 290 300 310 320 >>CCDS55667.1 NPL gene_id:80896|Hs108|chr1 (301 aa) initn: 117 init1: 67 opt: 257 Z-score: 338.7 bits: 70.8 E(32554): 1.5e-12 Smith-Waterman score: 266; 29.3% identity (55.6% similar) in 225 aa overlap (111-323:67-288) 90 100 110 120 130 140 pF1KE6 FPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVEMTVSMAQVGADAAMVVTPCY : : . . :.. :..:::. :..: . CCDS55 GNCFLPVYKASPLTVTRLWAERLDQVIIHVGALSLKESQELAQHAAEIGADGIAVIAPFF 40 50 60 70 80 90 150 160 170 180 190 pF1KE6 YRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDLPVDAVVT--LSQHPNIVGMKD . .. ::. .:: .: .: : .:: ::. . .. .. :.. :.. :.: CCDS55 LKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKIRAEELLDGILDKIPTFQGLKF 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE6 SGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVCALANVLGAQVCQLERLC : :. .: : ..:.:.: : : :... ..::.:.: . : :: .. :. . CCDS55 SDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGATGAVGSTYNYLGKKTNQMLEAF 160 170 180 190 200 210 260 270 280 290 300 pF1KE6 CTGQWEDAQKLQ---HRLIEPNAAVTRRFGIPGLKKIMDWF-GYYGGPCRAPLQ----EL .. : . : .:.: : .: ::. : :: : :: : ::: :. CCDS55 EQKDFSLALNYQFCIQRFI--NFVVKLGFGVSQTKAIMTLVSGIPMGPPRLPLQKASREF 220 230 240 250 260 270 310 320 pF1KE6 SPAEEEALR-MDFTSNGWL . . : :. .:: : CCDS55 TDSAEAKLKSLDFLSFTDLKDGNLEAGS 280 290 300 >>CCDS72990.1 NPL gene_id:80896|Hs108|chr1 (230 aa) initn: 67 init1: 67 opt: 247 Z-score: 327.5 bits: 68.4 E(32554): 6.4e-12 Smith-Waterman score: 247; 26.3% identity (58.7% similar) in 213 aa overlap (32-239:5-216) 10 20 30 40 50 60 pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL : . :. . ::.: ..:.... . . . CCDS72 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV 10 20 30 70 80 90 100 110 pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV : ... :.:..:: :. ::: .:. . : .. : .. : : . . CCDS72 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL :.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. . CCDS72 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV .. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::. CCDS72 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFG :.: CCDS72 GAVGSFVSRDLSTLLSN 220 230 327 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:50:57 2016 done: Tue Nov 8 14:50:57 2016 Total Scan time: 1.980 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]