FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1149, 449 aa 1>>>pF1KE1149 449 - 449 aa - 449 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4560+/-0.000799; mu= 16.6296+/- 0.048 mean_var=64.1060+/-12.720, 0's: 0 Z-trim(106.6): 19 B-trim: 0 in 0/52 Lambda= 0.160186 statistics sampled from 9062 (9070) to 9062 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.65), E-opt: 0.2 (0.279), width: 16 Scan time: 2.210 The best scores are: opt bits E(32554) CCDS10798.1 SETD6 gene_id:79918|Hs108|chr16 ( 449) 2996 701.1 5.7e-202 CCDS54013.1 SETD6 gene_id:79918|Hs108|chr16 ( 473) 2723 638.0 5.9e-183 CCDS9951.1 SETD3 gene_id:84193|Hs108|chr14 ( 594) 275 72.4 1.4e-12 >>CCDS10798.1 SETD6 gene_id:79918|Hs108|chr16 (449 aa) initn: 2996 init1: 2996 opt: 2996 Z-score: 3739.5 bits: 701.1 E(32554): 5.7e-202 Smith-Waterman score: 2996; 100.0% identity (100.0% similar) in 449 aa overlap (1-449:1-449) 10 20 30 40 50 60 pF1KE1 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 AGELLFVVPRAALLSQHTCSIGGLLERERVALQSQSGWVPLLLALLHELQAPASRWRPYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AGELLFVVPRAALLSQHTCSIGGLLERERVALQSQSGWVPLLLALLHELQAPASRWRPYF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFMEAHPDLFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFMEAHPDLFS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANHNANLEYSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANHNANLEYSA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 NCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGREEVLTEEELTTTLKVLCMPAEEFREL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGREEVLTEEELTTTLKVLCMPAEEFREL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 KDQDGGGDDKREEGSLTITNIPKLKASWRQLLQNSVLLTLQTYATDLKTDQGLLSNKEVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KDQDGGGDDKREEGSLTITNIPKLKASWRQLLQNSVLLTLQTYATDLKTDQGLLSNKEVY 370 380 390 400 410 420 430 440 pF1KE1 AKLSWREQQALQVRYGQKMILHQLLELTS ::::::::::::::::::::::::::::: CCDS10 AKLSWREQQALQVRYGQKMILHQLLELTS 430 440 >>CCDS54013.1 SETD6 gene_id:79918|Hs108|chr16 (473 aa) initn: 2718 init1: 2718 opt: 2723 Z-score: 3398.2 bits: 638.0 E(32554): 5.9e-183 Smith-Waterman score: 2938; 94.9% identity (94.9% similar) in 473 aa overlap (1-449:1-473) 10 20 30 40 pF1KE1 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKV-------------------- :::::::::::::::::::::::::::::::::::::::: CCDS54 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKVSERAGGRRTRGGARAALTSP 10 20 30 40 50 60 50 60 70 80 90 pF1KE1 ----AVSRQGTVAGYGMVARESVQAGELLFVVPRAALLSQHTCSIGGLLERERVALQSQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALLSQHTCSIGGLLERERVALQSQS 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE1 GWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKD 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE1 LANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNS 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE1 PVMVPAADILNHLANHNANLEYSANCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PVMVPAADILNHLANHNANLEYSANCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGFV 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE1 EPYPDNTDDTADIQMVTVREAALQGTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGREE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 EPYPDNTDDTADIQMVTVREAALQGTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGREE 310 320 330 340 350 360 340 350 360 370 380 390 pF1KE1 VLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDKREEGSLTITNIPKLKASWRQLLQNSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDKREEGSLTITNIPKLKASWRQLLQNSV 370 380 390 400 410 420 400 410 420 430 440 pF1KE1 LLTLQTYATDLKTDQGLLSNKEVYAKLSWREQQALQVRYGQKMILHQLLELTS ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LLTLQTYATDLKTDQGLLSNKEVYAKLSWREQQALQVRYGQKMILHQLLELTS 430 440 450 460 470 >>CCDS9951.1 SETD3 gene_id:84193|Hs108|chr14 (594 aa) initn: 85 init1: 59 opt: 275 Z-score: 339.1 bits: 72.4 E(32554): 1.4e-12 Smith-Waterman score: 369; 25.9% identity (53.6% similar) in 440 aa overlap (15-443:72-477) 10 20 30 40 pF1KE1 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKVAVSR :: : ...: . : . :. CCDS99 GPGKEWEEYVQIRTLVEKIRKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNF 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE1 QGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVALQSQSGWVPL . :.:. : ....: ::.. ::: :. : .. .: : ..:. ::.. : . : CCDS99 KEE--GFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRI-LQAM-GNIAL 110 120 130 140 150 110 120 130 140 150 160 pF1KE1 LLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIR . :: : .: : :.::. : .. . :... :.: : ::.: . . : .. : CCDS99 AFHLLCERASPNSFWQPYIQTLP--SEYDTPLYFEEDEVR-YLQSTQAIHDVFSQYKNTA 160 170 180 190 200 210 170 180 190 200 210 pF1KE1 SEYQSIVLPFMEAHPDLFSLRVR---SLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPV .: . ...:: .: .. . : :. :. ::. . : : :. . . CCDS99 RQY-AYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSR----VTLA 220 230 240 250 260 220 230 240 250 260 270 pF1KE1 MVPAADILNH---LANHNANLEYSANCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGF ..: :. :: : . . ::: . : . :: : . :..:. :: .: ... :: CCDS99 LIPLWDMCNHTNGLITTGYNLE-DDRC-ECVALQDFRAGEQIYIFYGTRSNAEFVIHSGF 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE1 VEPYPDNTDDTADIQMVTVREAALQGTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGRE . .:. : . :.. . . : . :.: : : . ..:.. CCDS99 F--FDNNSHDRVKIKLGVSKSDRLYAMKAE-----VLAR-------AGIPTSSVFALHFT 330 340 350 360 370 340 350 360 370 380 390 pF1KE1 EVLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDKREEGS--LTITNIPKLKASWRQLLQ : .: . :.:.:: ::..: :.. : :. . .. ..: : .:. CCDS99 EPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKL-WT-FLE 380 390 400 410 420 430 400 410 420 430 440 pF1KE1 NSVLLTLQTYATDLKTDQGLLSNKEVYAKLSWREQQALQVRYGQKMILHQLLELTS . . : :.:: : .. :...:.:.. :: : ..:...: :.: ::.. CCDS99 DRASLLLKTYKTTIEEDKSVLKNHD----LSVRAKMAIKLRLGEKEILEKAVKSAAVNRE 440 450 460 470 480 CCDS99 YYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIREAISKAKA 490 500 510 520 530 540 449 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:04:46 2016 done: Sun Nov 6 14:04:46 2016 Total Scan time: 2.210 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]