FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3758, 431 aa 1>>>pF1KE3758 431 - 431 aa - 431 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.4045+/-0.00098; mu= -10.9697+/- 0.059 mean_var=294.4516+/-60.239, 0's: 0 Z-trim(114.2): 80 B-trim: 0 in 0/52 Lambda= 0.074742 statistics sampled from 14673 (14752) to 14673 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.77), E-opt: 0.2 (0.453), width: 16 Scan time: 2.810 The best scores are: opt bits E(32554) CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1 ( 431) 2835 318.9 6.1e-87 CCDS1457.1 ELK4 gene_id:2005|Hs108|chr1 ( 405) 2367 268.4 8.9e-72 CCDS9060.1 ELK3 gene_id:2004|Hs108|chr12 ( 407) 753 94.4 2.2e-19 CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX ( 428) 701 88.8 1.1e-17 CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX ( 95) 470 63.6 9.8e-11 >>CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1 (431 aa) initn: 2835 init1: 2835 opt: 2835 Z-score: 1674.4 bits: 318.9 E(32554): 6.1e-87 Smith-Waterman score: 2835; 100.0% identity (100.0% similar) in 431 aa overlap (1-431:1-431) 10 20 30 40 50 60 pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 TPIILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGLDGPST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TPIILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGLDGPST 370 380 390 400 410 420 430 pF1KE3 PGPFSPDLQKT ::::::::::: CCDS14 PGPFSPDLQKT 430 >>CCDS1457.1 ELK4 gene_id:2005|Hs108|chr1 (405 aa) initn: 2351 init1: 2351 opt: 2367 Z-score: 1402.1 bits: 268.4 E(32554): 8.9e-72 Smith-Waterman score: 2367; 97.6% identity (98.4% similar) in 375 aa overlap (1-374:1-375) 10 20 30 40 50 60 pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ 310 320 330 340 350 360 370 380 390 400 410 pF1KE3 TPI-ILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGLDGPS . .. ::::: : CCDS14 VACSLFMVSPLLSFICPFKQIQNLYTQVCFLLLRFVLERLCVTVM 370 380 390 400 >>CCDS9060.1 ELK3 gene_id:2004|Hs108|chr12 (407 aa) initn: 1139 init1: 533 opt: 753 Z-score: 461.5 bits: 94.4 E(32554): 2.2e-19 Smith-Waterman score: 1199; 50.9% identity (71.6% similar) in 436 aa overlap (1-431:1-407) 10 20 30 40 50 60 pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS :.::::::::::::: ...:.::::::::.::::.:::::.:::.:::: :::::::: CCDS90 MESAITLWQFLLQLLLDQKHEHLICWTSNDGEFKLLKAEEVAKLWGLRKNKTNMNYDKLS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV ::::::: ::::::: ::::::::::.::::.::: .: : . ::: ... :. . . CCDS90 RALRYYYDKNIIKKVIGQKFVYKFVSFPEILKMDPHAV---EISRESLLLQD-SDCKASP 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS :. : . ...:::.:::::::::::.:::.. :: ::::. :. : . CCDS90 EGREAHKHGLAALRSTSRNEYIHSGLYSSFTINSLQNPP-DAFKAIKTEK-LEEPPEDSP 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE3 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT : : . .::.:::. . : ..:: .:. :: : .. . : . :: :. :: :. CCDS90 PVEEVRTVIRFVTNKTDKHVTRPV---VSL-PSTSEAAAASA-FLASSVSAKISSLMLPN 180 190 200 210 220 250 260 270 280 290 pF1KE3 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSS-HPDIDTDIDSVASQPMELPENLSLE .:: .... :.:: . : .:. :: : : .. . :. .: : ::: CCDS90 AAS-----ISSASPFSS----RSPSLSPNSPLPSEHRSLFLEAACHDSDSLE-PLNLSSG 230 240 250 260 270 300 310 320 330 340 350 pF1KE3 PKDQDSVLLEKDKVNNSSRSKKPKGLEL-APTLVITSSDPSPLGILSPSLPTASLTPAFF : .. : : .:::::::. :: ::....: . ... ::.::..::::::: CCDS90 SKTKSPSLPPK--------AKKPKGLEISAPPLVLSGTDIGSIALNSPALPSGSLTPAFF 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE3 S-QTP--IILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGL . ::: ..::::::::::::::.:::::::::::::: .::::::..::.: : . .: CCDS90 TAQTPNGLLLTPSPLLSSIHFWSSLSPVAPLSPARLQGPSTLFQFPTLLNGHMPVPIPSL 340 350 360 370 380 390 420 430 pF1KE3 DGPSTPGPFSPDLQKT : ..: .: . ::. CCDS90 DRAASPVLLSSNSQKS 400 >>CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX (428 aa) initn: 725 init1: 338 opt: 701 Z-score: 430.8 bits: 88.8 E(32554): 1.1e-17 Smith-Waterman score: 786; 37.4% identity (57.7% similar) in 463 aa overlap (1-430:1-427) 10 20 30 40 50 pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDG-QFKLLQAEEVARLWGIRKNKPNMNYDKL :: ..::::::::::.. : :.: ::: :: .:::..:::::::::.:::: ::::::: CCDS14 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 SRALRYYYVKNIIKKVNGQKFVYKFVSYPEIL-----NMDPMTVGRIEGDCESLNFSEVS :::::::: ::::.::.:::::::::::::. . :. . . .. . . CCDS14 SRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQPEVSVTSTMPNVAPAAIH 70 80 90 100 110 120 120 130 140 150 160 pF1KE3 SSSKDVENGGKDKPPQ-----PGA-KTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKT .. :. .: : ::. ::::.:..:::::.::..:: CCDS14 AAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYSTFTIQSL------------- 130 140 150 160 170 180 190 200 210 220 pF1KE3 ENPAEKLAEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSS-EETIQALET . . : .: :.:. :: : .:: : . : ::: : ..: :. CCDS14 --------QPQPPPHPRPAVV----LPSAAP--AGAAAPPSGSRSTSPSPLEACLEAEEA 170 180 190 200 210 230 240 250 260 270 pF1KE3 -------LVSPKLPSLEAPTSASNVMTAFATTPPISSIPPLQE----PPRTPSPPLS-SH :. :. :.:.. . . : : .. : .: : : . .. CCDS14 GLPLQVILTPPEAPNLKSEELNVEPGLGRALPPEVKVEGPKEELEVAGERGFVPETTKAE 220 230 240 250 260 270 280 290 300 310 320 330 pF1KE3 PDIDTDIDSVASQPMELPENLSLEPKDQDSVLLEKDKVNNSSRSKKPKGLEL--APTLVI :.. . ..: : .:: . . . . .... ....::. ::: .:.:. CCDS14 PEVPPQ-EGV---PARLPAVVMDTAGQAGGHAASSPEISQPQKGRKPRDLELPLSPSLLG 280 290 300 310 320 340 350 360 370 380 pF1KE3 TSSDPSPLGILSPS---LPTASLTPAFF---SQTPIILTPSPLLSSIHFWSTLSPVAPLS . : : : : .:::... . ::..:::: : ::::::::::.:: : CCDS14 GPGPERTPGSGSGSGLQAPGPALTPSLLPTHTLTPVLLTPSSLPPSIHFWSTLSPIAPRS 330 340 350 360 370 380 390 400 410 420 430 pF1KE3 PARLQGANTLFQFPSVLNSHGPFTLSGLDGPSTPGPFSPDLQKT ::.:. ::::: ... . ..:: ::: .:: :: CCDS14 PAKLS-----FQFPSSGSAQVHIPSISVDGLSTPVVLSPGPQKP 390 400 410 420 >>CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX (95 aa) initn: 346 init1: 346 opt: 470 Z-score: 306.2 bits: 63.6 E(32554): 9.8e-11 Smith-Waterman score: 470; 77.8% identity (90.0% similar) in 90 aa overlap (1-89:1-90) 10 20 30 40 50 pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDG-QFKLLQAEEVARLWGIRKNKPNMNYDKL :: ..::::::::::.. : :.: ::: :: .:::..:::::::::.:::: ::::::: CCDS59 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 SRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKD :::::::: ::::.::.::::::::::::: CCDS59 SRALRYYYDKNIIRKVSGQKFVYKFVSYPESHCAP 70 80 90 431 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 07:51:21 2016 done: Sun Nov 6 07:51:21 2016 Total Scan time: 2.810 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]