FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3758, 431 aa
1>>>pF1KE3758 431 - 431 aa - 431 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.4045+/-0.00098; mu= -10.9697+/- 0.059
mean_var=294.4516+/-60.239, 0's: 0 Z-trim(114.2): 80 B-trim: 0 in 0/52
Lambda= 0.074742
statistics sampled from 14673 (14752) to 14673 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.77), E-opt: 0.2 (0.453), width: 16
Scan time: 2.810
The best scores are: opt bits E(32554)
CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1 ( 431) 2835 318.9 6.1e-87
CCDS1457.1 ELK4 gene_id:2005|Hs108|chr1 ( 405) 2367 268.4 8.9e-72
CCDS9060.1 ELK3 gene_id:2004|Hs108|chr12 ( 407) 753 94.4 2.2e-19
CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX ( 428) 701 88.8 1.1e-17
CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX ( 95) 470 63.6 9.8e-11
>>CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1 (431 aa)
initn: 2835 init1: 2835 opt: 2835 Z-score: 1674.4 bits: 318.9 E(32554): 6.1e-87
Smith-Waterman score: 2835; 100.0% identity (100.0% similar) in 431 aa overlap (1-431:1-431)
10 20 30 40 50 60
pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 TPIILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGLDGPST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 TPIILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGLDGPST
370 380 390 400 410 420
430
pF1KE3 PGPFSPDLQKT
:::::::::::
CCDS14 PGPFSPDLQKT
430
>>CCDS1457.1 ELK4 gene_id:2005|Hs108|chr1 (405 aa)
initn: 2351 init1: 2351 opt: 2367 Z-score: 1402.1 bits: 268.4 E(32554): 8.9e-72
Smith-Waterman score: 2367; 97.6% identity (98.4% similar) in 375 aa overlap (1-374:1-375)
10 20 30 40 50 60
pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPENLSLEP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 KDQDSVLLEKDKVNNSSRSKKPKGLELAPTLVITSSDPSPLGILSPSLPTASLTPAFFSQ
310 320 330 340 350 360
370 380 390 400 410
pF1KE3 TPI-ILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGLDGPS
. .. ::::: :
CCDS14 VACSLFMVSPLLSFICPFKQIQNLYTQVCFLLLRFVLERLCVTVM
370 380 390 400
>>CCDS9060.1 ELK3 gene_id:2004|Hs108|chr12 (407 aa)
initn: 1139 init1: 533 opt: 753 Z-score: 461.5 bits: 94.4 E(32554): 2.2e-19
Smith-Waterman score: 1199; 50.9% identity (71.6% similar) in 436 aa overlap (1-431:1-407)
10 20 30 40 50 60
pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS
:.::::::::::::: ...:.::::::::.::::.:::::.:::.:::: ::::::::
CCDS90 MESAITLWQFLLQLLLDQKHEHLICWTSNDGEFKLLKAEEVAKLWGLRKNKTNMNYDKLS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKDV
::::::: ::::::: ::::::::::.::::.::: .: : . ::: ... :. . .
CCDS90 RALRYYYDKNIIKKVIGQKFVYKFVSFPEILKMDPHAV---EISRESLLLQD-SDCKASP
70 80 90 100 110
130 140 150 160 170 180
pF1KE3 ENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKLAEKKS
:. : . ...:::.:::::::::::.:::.. :: ::::. :. : .
CCDS90 EGREAHKHGLAALRSTSRNEYIHSGLYSSFTINSLQNPP-DAFKAIKTEK-LEEPPEDSP
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE3 PQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPSLEAPT
: : . .::.:::. . : ..:: .:. :: : .. . : . :: :. :: :.
CCDS90 PVEEVRTVIRFVTNKTDKHVTRPV---VSL-PSTSEAAAASA-FLASSVSAKISSLMLPN
180 190 200 210 220
250 260 270 280 290
pF1KE3 SASNVMTAFATTPPISSIPPLQEPPRTPSPPLSS-HPDIDTDIDSVASQPMELPENLSLE
.:: .... :.:: . : .:. :: : : .. . :. .: : :::
CCDS90 AAS-----ISSASPFSS----RSPSLSPNSPLPSEHRSLFLEAACHDSDSLE-PLNLSSG
230 240 250 260 270
300 310 320 330 340 350
pF1KE3 PKDQDSVLLEKDKVNNSSRSKKPKGLEL-APTLVITSSDPSPLGILSPSLPTASLTPAFF
: .. : : .:::::::. :: ::....: . ... ::.::..:::::::
CCDS90 SKTKSPSLPPK--------AKKPKGLEISAPPLVLSGTDIGSIALNSPALPSGSLTPAFF
280 290 300 310 320 330
360 370 380 390 400 410
pF1KE3 S-QTP--IILTPSPLLSSIHFWSTLSPVAPLSPARLQGANTLFQFPSVLNSHGPFTLSGL
. ::: ..::::::::::::::.:::::::::::::: .::::::..::.: : . .:
CCDS90 TAQTPNGLLLTPSPLLSSIHFWSSLSPVAPLSPARLQGPSTLFQFPTLLNGHMPVPIPSL
340 350 360 370 380 390
420 430
pF1KE3 DGPSTPGPFSPDLQKT
: ..: .: . ::.
CCDS90 DRAASPVLLSSNSQKS
400
>>CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX (428 aa)
initn: 725 init1: 338 opt: 701 Z-score: 430.8 bits: 88.8 E(32554): 1.1e-17
Smith-Waterman score: 786; 37.4% identity (57.7% similar) in 463 aa overlap (1-430:1-427)
10 20 30 40 50
pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDG-QFKLLQAEEVARLWGIRKNKPNMNYDKL
:: ..::::::::::.. : :.: ::: :: .:::..:::::::::.:::: :::::::
CCDS14 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE3 SRALRYYYVKNIIKKVNGQKFVYKFVSYPEIL-----NMDPMTVGRIEGDCESLNFSEVS
:::::::: ::::.::.:::::::::::::. . :. . . .. . .
CCDS14 SRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQPEVSVTSTMPNVAPAAIH
70 80 90 100 110 120
120 130 140 150 160
pF1KE3 SSSKDVENGGKDKPPQ-----PGA-KTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKT
.. :. .: : ::. ::::.:..:::::.::..::
CCDS14 AAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYSTFTIQSL-------------
130 140 150 160
170 180 190 200 210 220
pF1KE3 ENPAEKLAEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSS-EETIQALET
. . : .: :.:. :: : .:: : . : ::: : ..: :.
CCDS14 --------QPQPPPHPRPAVV----LPSAAP--AGAAAPPSGSRSTSPSPLEACLEAEEA
170 180 190 200 210
230 240 250 260 270
pF1KE3 -------LVSPKLPSLEAPTSASNVMTAFATTPPISSIPPLQE----PPRTPSPPLS-SH
:. :. :.:.. . . : : .. : .: : : . ..
CCDS14 GLPLQVILTPPEAPNLKSEELNVEPGLGRALPPEVKVEGPKEELEVAGERGFVPETTKAE
220 230 240 250 260 270
280 290 300 310 320 330
pF1KE3 PDIDTDIDSVASQPMELPENLSLEPKDQDSVLLEKDKVNNSSRSKKPKGLEL--APTLVI
:.. . ..: : .:: . . . . .... ....::. ::: .:.:.
CCDS14 PEVPPQ-EGV---PARLPAVVMDTAGQAGGHAASSPEISQPQKGRKPRDLELPLSPSLLG
280 290 300 310 320
340 350 360 370 380
pF1KE3 TSSDPSPLGILSPS---LPTASLTPAFF---SQTPIILTPSPLLSSIHFWSTLSPVAPLS
. : : : : .:::... . ::..:::: : ::::::::::.:: :
CCDS14 GPGPERTPGSGSGSGLQAPGPALTPSLLPTHTLTPVLLTPSSLPPSIHFWSTLSPIAPRS
330 340 350 360 370 380
390 400 410 420 430
pF1KE3 PARLQGANTLFQFPSVLNSHGPFTLSGLDGPSTPGPFSPDLQKT
::.:. ::::: ... . ..:: ::: .:: ::
CCDS14 PAKLS-----FQFPSSGSAQVHIPSISVDGLSTPVVLSPGPQKP
390 400 410 420
>>CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX (95 aa)
initn: 346 init1: 346 opt: 470 Z-score: 306.2 bits: 63.6 E(32554): 9.8e-11
Smith-Waterman score: 470; 77.8% identity (90.0% similar) in 90 aa overlap (1-89:1-90)
10 20 30 40 50
pF1KE3 MDSAITLWQFLLQLLQKPQNKHMICWTSNDG-QFKLLQAEEVARLWGIRKNKPNMNYDKL
:: ..::::::::::.. : :.: ::: :: .:::..:::::::::.:::: :::::::
CCDS59 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE3 SRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSSSSKD
:::::::: ::::.::.:::::::::::::
CCDS59 SRALRYYYDKNIIRKVSGQKFVYKFVSYPESHCAP
70 80 90
431 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 07:51:21 2016 done: Sun Nov 6 07:51:21 2016
Total Scan time: 2.810 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]