FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1165, 594 aa 1>>>pF1KE1165 594 - 594 aa - 594 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1287+/-0.00101; mu= 14.3178+/- 0.061 mean_var=73.4006+/-14.653, 0's: 0 Z-trim(104.5): 24 B-trim: 0 in 0/51 Lambda= 0.149701 statistics sampled from 7936 (7946) to 7936 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.606), E-opt: 0.2 (0.244), width: 16 Scan time: 3.540 The best scores are: opt bits E(32554) CCDS9951.1 SETD3 gene_id:84193|Hs108|chr14 ( 594) 3867 844.9 0 CCDS9952.1 SETD3 gene_id:84193|Hs108|chr14 ( 296) 1892 418.3 6.9e-117 CCDS10798.1 SETD6 gene_id:79918|Hs108|chr16 ( 449) 275 69.1 1.4e-11 CCDS54013.1 SETD6 gene_id:79918|Hs108|chr16 ( 473) 271 68.3 2.6e-11 >>CCDS9951.1 SETD3 gene_id:84193|Hs108|chr14 (594 aa) initn: 3867 init1: 3867 opt: 3867 Z-score: 4512.2 bits: 844.9 E(32554): 0 Smith-Waterman score: 3867; 100.0% identity (100.0% similar) in 594 aa overlap (1-594:1-594) 10 20 30 40 50 60 pF1KE1 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVAL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 QDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 QDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 GIPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 GIPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 DNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 DNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVK 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE1 SAAVNREYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 SAAVNREYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIRE 490 500 510 520 530 540 550 560 570 580 590 pF1KE1 AISKAKATENGLVNGENSIPNGTRSENESLNQESKRAVEDAKGSSSDSTAGVKE :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 AISKAKATENGLVNGENSIPNGTRSENESLNQESKRAVEDAKGSSSDSTAGVKE 550 560 570 580 590 >>CCDS9952.1 SETD3 gene_id:84193|Hs108|chr14 (296 aa) initn: 1892 init1: 1892 opt: 1892 Z-score: 2211.9 bits: 418.3 E(32554): 6.9e-117 Smith-Waterman score: 1892; 100.0% identity (100.0% similar) in 283 aa overlap (1-283:1-283) 10 20 30 40 50 60 pF1KE1 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVAL ::::::::::::::::::::::::::::::::::::::::::: CCDS99 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLTPEDSFALAVASA 250 260 270 280 290 310 320 330 340 350 360 pF1KE1 QDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARA >>CCDS10798.1 SETD6 gene_id:79918|Hs108|chr16 (449 aa) initn: 135 init1: 59 opt: 275 Z-score: 321.6 bits: 69.1 E(32554): 1.4e-11 Smith-Waterman score: 369; 26.2% identity (53.2% similar) in 442 aa overlap (72-477:15-443) 50 60 70 80 90 100 pF1KE1 GPGKEWEEYVQIRTLVEKIRKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNF :: : ...: . : . :. CCDS10 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKVAVSR 10 20 30 40 110 120 130 140 150 pF1KE1 KEE--GFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRI-LQAM-GNIAL . :.:. : ....: ::.. ::: :. : .. .: : ..:. ::.. : . : CCDS10 QGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVALQSQSGWVPL 50 60 70 80 90 100 160 170 180 190 200 210 pF1KE1 AFHLLCERASPNSFWQPYIQTLP--SEYDTPLYFEEDEVR-YLQSTQAIHDVFSQYKNTA . :: : .: : :.::. : .. . :... :.: : ::.: . . : .. : CCDS10 LLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIR 110 120 130 140 150 160 220 230 240 250 260 pF1KE1 RQY-AYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV----TLA .: . ...:: .: .. . : :. :. ::. . : : :. . . CCDS10 SEYQSIVLPFMEAHPDLFSLRVR---SLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPV 170 180 190 200 210 270 280 290 300 310 320 pF1KE1 LIPLWDMCNHTNGLITTGYNLE-DDRC-ECVALQDFRAGEQIYIFYGTRSNAEFVIHSGF ..: :. :: : . . ::: . : . :: : . :..:. :: .: ... :: CCDS10 MVPAADILNH---LANHNANLEYSANCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGF 220 230 240 250 260 270 330 340 350 360 370 pF1KE1 F--FDNNSHDRVKIKLGVSKSDRLYAMKAE-----VLARAG-------IPTSSVFALHFT . .:. : . :.. . . : . :.: : : . ..:.. CCDS10 VEPYPDNTDDTADIQMVTVREAALQGTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGRE 280 290 300 310 320 330 380 390 400 410 420 pF1KE1 EPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKL---WT-F : .: . :.:.:: ::..: :.. : :. .. : :: : . CCDS10 EVLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDKREEGS----LTITNIPKLKASWRQL 340 350 360 370 380 390 430 440 450 460 470 480 pF1KE1 LEDRASLLLKTYKTTIEEDKSVLKNHD----LSVRAKMAIKLRLGEKEILEKAVKSAAVN :.. . : :.:: : .. :...:.:.. :: : ..:...: :.: ::.. CCDS10 LQNSVLLTLQTYATDLKTDQGLLSNKEVYAKLSWREQQALQVRYGQKMILHQLLELTS 400 410 420 430 440 490 500 510 520 530 540 pF1KE1 REYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIREAISKA >>CCDS54013.1 SETD6 gene_id:79918|Hs108|chr16 (473 aa) initn: 135 init1: 59 opt: 271 Z-score: 316.5 bits: 68.3 E(32554): 2.6e-11 Smith-Waterman score: 365; 27.0% identity (54.5% similar) in 407 aa overlap (105-477:74-467) 80 90 100 110 120 130 pF1KE1 REDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA :.:. : ....: ::.. ::: :. : CCDS54 AGGRRTRGGARAALTSPPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQ 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE1 KNSVLGPLYSQDRI-LQAM-GNIALAFHLLCERASPNSFWQPYIQTLP--SEYDTPLYFE .. .: : ..:. ::.. : . : . :: : .: : :.::. : .. . :... CCDS54 HTCSIGGLLERERVALQSQSGWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWP 110 120 130 140 150 160 200 210 220 230 240 pF1KE1 EDEVR-YLQSTQAIHDVFSQYKNTARQY-AYFYKVIQTHPHANKLPLKDSFTYEDYRWAV :.: : ::.: . . : .. : .: . ...:: .: .. . : :. : CCDS54 EEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVR---SLELYHQLV 170 180 190 200 210 250 260 270 280 290 300 pF1KE1 SSVMTRQNQIPTEDGSRV----TLALIPLWDMCNHTNGLITTGYNLE-DDRC-ECVALQD . ::. . : : :. . ...: :. :: : . . ::: . : . :: : CCDS54 ALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNH---LANHNANLEYSANCLRMVATQP 220 230 240 250 260 270 310 320 330 340 350 pF1KE1 FRAGEQIYIFYGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAE----- . :..:. :: .: ... :: . .:. : . :.. . . : . :.: CCDS54 IPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAERHL 280 290 300 310 320 330 360 370 380 390 400 pF1KE1 VLARAG-------IPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRI : : . ..:.. : .: . :.:.:: ::..: :.. : CCDS54 VYERWDFLCKLEMVGEEGAFVIGREEVLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDK 340 350 360 370 380 390 410 420 430 440 450 460 pF1KE1 FTLGNSEFPVSWDNEVKL---WT-FLEDRASLLLKTYKTTIEEDKSVLKNHD----LSVR :. .. : :: : .:.. . : :.:: : .. :...:.:.. :: : CCDS54 REEGS----LTITNIPKLKASWRQLLQNSVLLTLQTYATDLKTDQGLLSNKEVYAKLSWR 400 410 420 430 440 450 470 480 490 500 510 520 pF1KE1 AKMAIKLRLGEKEILEKAVKSAAVNREYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLP ..:...: :.: ::.. CCDS54 EQQALQVRYGQKMILHQLLELTS 460 470 594 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:23:20 2016 done: Sun Nov 6 14:23:21 2016 Total Scan time: 3.540 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]