FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2471, 597 aa 1>>>pF1KE2471 597 - 597 aa - 597 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0401+/-0.00101; mu= 7.7742+/- 0.061 mean_var=195.9202+/-39.037, 0's: 0 Z-trim(111.3): 59 B-trim: 264 in 1/51 Lambda= 0.091629 statistics sampled from 12254 (12309) to 12254 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.72), E-opt: 0.2 (0.378), width: 16 Scan time: 3.430 The best scores are: opt bits E(32554) CCDS34127.2 BRD9 gene_id:65980|Hs108|chr5 ( 597) 3944 534.2 1.8e-151 CCDS34128.2 BRD9 gene_id:65980|Hs108|chr5 ( 544) 2921 399.0 8.4e-111 CCDS10742.1 BRD7 gene_id:29117|Hs108|chr16 ( 651) 875 128.6 2.5e-29 CCDS54007.1 BRD7 gene_id:29117|Hs108|chr16 ( 652) 875 128.6 2.5e-29 >>CCDS34127.2 BRD9 gene_id:65980|Hs108|chr5 (597 aa) initn: 3944 init1: 3944 opt: 3944 Z-score: 2833.0 bits: 534.2 E(32554): 1.8e-151 Smith-Waterman score: 3944; 100.0% identity (100.0% similar) in 597 aa overlap (1-597:1-597) 10 20 30 40 50 60 pF1KE2 MGKKHKKHKAEWRSSYEDYADKPLEKPLKLVLKVGGSEVTELSGSGHDSSYYDDRSDHER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MGKKHKKHKAEWRSSYEDYADKPLEKPLKLVLKVGGSEVTELSGSGHDSSYYDDRSDHER 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 ERHKEKKKKKKKKSEKEKHLDDEERRKRKEEKKRKREREHCDTEGEADDFDPGKKVEVEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ERHKEKKKKKKKKSEKEKHLDDEERRKRKEEKKRKREREHCDTEGEADDFDPGKKVEVEP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 PPDRPVRACRTQPAENESTPIQQLLEHFLRQLQRKDPHGFFAFPVTDAIAPGYSMIIKHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PPDRPVRACRTQPAENESTPIQQLLEHFLRQLQRKDPHGFFAFPVTDAIAPGYSMIIKHP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 MDFGTMKDKIVANEYKSVTEFKADFKLMCDNAMTYNRPDTVYYKLAKKILHAGFKMMSKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MDFGTMKDKIVANEYKSVTEFKADFKLMCDNAMTYNRPDTVYYKLAKKILHAGFKMMSKQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 AALLGNEDTAVEEPVPEVVPVQVETAKKSKKPSREVISCMFEPEGNACSLTDSTAEEHVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 AALLGNEDTAVEEPVPEVVPVQVETAKKSKKPSREVISCMFEPEGNACSLTDSTAEEHVL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 ALVEHAADEARDRINRFLPGGKMGYLKRNGDGSLLYSVVNTAEPDADEEETHPVDLSSLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ALVEHAADEARDRINRFLPGGKMGYLKRNGDGSLLYSVVNTAEPDADEEETHPVDLSSLS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 SKLLPGFTTLGFKDERRNKVTFLSSATTALSMQNNSVFGDLKSDEMELLYSAYGDETGVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SKLLPGFTTLGFKDERRNKVTFLSSATTALSMQNNSVFGDLKSDEMELLYSAYGDETGVQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 CALSLQEFVKDAGSYSKKVVDDLLDQITGGDHSRTLFQLKQRRNVPMKPPDEAKVGDTLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 CALSLQEFVKDAGSYSKKVVDDLLDQITGGDHSRTLFQLKQRRNVPMKPPDEAKVGDTLG 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE2 DSSSSVLEFMSMKSYPDVSVDISMLSSLGKVKKELDPDDSHLNLDETTKLLQDLHEAQAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DSSSSVLEFMSMKSYPDVSVDISMLSSLGKVKKELDPDDSHLNLDETTKLLQDLHEAQAE 490 500 510 520 530 540 550 560 570 580 590 pF1KE2 RGGSRPSSNLSSLSNASERDQHHLGSPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RGGSRPSSNLSSLSNASERDQHHLGSPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT 550 560 570 580 590 >>CCDS34128.2 BRD9 gene_id:65980|Hs108|chr5 (544 aa) initn: 2923 init1: 2888 opt: 2921 Z-score: 2102.7 bits: 399.0 E(32554): 8.4e-111 Smith-Waterman score: 2921; 85.1% identity (90.6% similar) in 544 aa overlap (61-597:9-544) 40 50 60 70 80 90 pF1KE2 VLKVGGSEVTELSGSGHDSSYYDDRSDHERERHKEKKKKKKKKSEKEKHLDDEERRKRKE :: .:.:.....: ... . ::. CCDS34 MMTGQTMSERGTKKRKRRRRRSPRRRSI--WTMRKEGS 10 20 30 100 110 120 130 140 pF1KE2 EKKRKREREHCDTEGEADDFDP---GKKVE-VEPPPDRPVRACRTQP---AENESTPIQQ :.::. :. .: . . . :.. . .: . :: ..:: :. :. . CCDS34 ERKRRSGSERGSTVTRRERLTTLILGRRWRWSRPQIGQSERAGHSQPKMRAHLFSNSWNT 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE2 LLEHFLRQLQRKDPHGFFAFPVTDAIAPGYSMIIKHPMDFGTMKDKIVANEYKSVTEFKA : .:::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SSASF------RDPHGFFAFPVTDAIAPGYSMIIKHPMDFGTMKDKIVANEYKSVTEFKA 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE2 DFKLMCDNAMTYNRPDTVYYKLAKKILHAGFKMMSKQAALLGNEDTAVEEPVPEVVPVQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DFKLMCDNAMTYNRPDTVYYKLAKKILHAGFKMMSKQAALLGNEDTAVEEPVPEVVPVQV 160 170 180 190 200 210 270 280 290 300 310 320 pF1KE2 ETAKKSKKPSREVISCMFEPEGNACSLTDSTAEEHVLALVEHAADEARDRINRFLPGGKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ETAKKSKKPSREVISCMFEPEGNACSLTDSTAEEHVLALVEHAADEARDRINRFLPGGKM 220 230 240 250 260 270 330 340 350 360 370 380 pF1KE2 GYLKRNGDGSLLYSVVNTAEPDADEEETHPVDLSSLSSKLLPGFTTLGFKDERRNKVTFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GYLKRNGDGSLLYSVVNTAEPDADEEETHPVDLSSLSSKLLPGFTTLGFKDERRNKVTFL 280 290 300 310 320 330 390 400 410 420 430 440 pF1KE2 SSATTALSMQNNSVFGDLKSDEMELLYSAYGDETGVQCALSLQEFVKDAGSYSKKVVDDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SSATTALSMQNNSVFGDLKSDEMELLYSAYGDETGVQCALSLQEFVKDAGSYSKKVVDDL 340 350 360 370 380 390 450 460 470 480 490 500 pF1KE2 LDQITGGDHSRTLFQLKQRRNVPMKPPDEAKVGDTLGDSSSSVLEFMSMKSYPDVSVDIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LDQITGGDHSRTLFQLKQRRNVPMKPPDEAKVGDTLGDSSSSVLEFMSMKSYPDVSVDIS 400 410 420 430 440 450 510 520 530 540 550 560 pF1KE2 MLSSLGKVKKELDPDDSHLNLDETTKLLQDLHEAQAERGGSRPSSNLSSLSNASERDQHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MLSSLGKVKKELDPDDSHLNLDETTKLLQDLHEAQAERGGSRPSSNLSSLSNASERDQHH 460 470 480 490 500 510 570 580 590 pF1KE2 LGSPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT :::::::::::::::::::::::::::::::::: CCDS34 LGSPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT 520 530 540 >>CCDS10742.1 BRD7 gene_id:29117|Hs108|chr16 (651 aa) initn: 1033 init1: 715 opt: 875 Z-score: 639.9 bits: 128.6 E(32554): 2.5e-29 Smith-Waterman score: 1174; 36.3% identity (68.0% similar) in 597 aa overlap (1-562:1-584) 10 20 30 40 50 pF1KE2 MGKKHKKHKAEWRSSYEDYADKPLEKPLKLVLKVGGSEVTELS--GSGHDSSYYDDRSDH :::::::::.. . ::.: .::::::::::::.:::::: .:::::: ..:..:: CCDS10 MGKKHKKHKSD-KHLYEEY----VEKPLKLVLKVGGNEVTELSTGSSGHDSSLFEDKNDH 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 ERERHKEKKKKKKKKSEKEKHLDDEERRKRK-EEKKRKREREHCDTEGEADDFDPGKKVE . .::..:.::.::.::. ... :..:. .: :.::.:.. ..:.: : .. :. CCDS10 D--KHKDRKRKKRKKGEKQIPGEEKGRKRRRVKEDKKKRDRDRVENEAEKD-LQCHAPVR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 VEPPPDRPVRACRTQPAENESTPIQQLLEHFLRQLQRKDPHGFFAFPVTDAIAPGYSMII .. ::..:. . .. : :.::.:. :....:::::::: .::.::::: ::::::::: CCDS10 LDLPPEKPLTSSLAKQEEVEQTPLQEALNQLMRQLQRKDPSAFFSFPVTDFIAPGYSMII 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 KHPMDFGTMKDKIVANEYKSVTEFKADFKLMCDNAMTYNRPDTVYYKLAKKILHAGFKMM ::::::.:::.:: :.:.:. :.: .::::: ::: ::.:.:.::: :::.::.:.:.. CCDS10 KHPMDFSTMKEKIKNNDYQSIEELKDNFKLMCTNAMIYNKPETIYYKAAKKLLHSGMKIL 180 190 200 210 220 230 240 250 260 270 280 pF1KE2 SKQ--AALLGNEDTAVEEPVPEVVPVQVETAK---------KSKKPSREVISCMFEPEGN :.. .: . : .. . ..:.. . .. : .. . :. .. CCDS10 SQERIQSLKQSIDFMADLQKTRKQKDGTDTSQSGEDGGCWQREREDSGDAEAHAFKSPSK 240 250 260 270 280 290 290 300 310 320 330 pF1KE2 ACSLTD---------STAEEHVLALVEHAADEARDRINRFLPGGKMGYLKRNGDGSLLYS . : :. :. ... . :. ...: : ... . .:. ::. . CCDS10 ENKKKDKDMLEDKFKSNNLEREQEQLDRIVKESGGKLTRRLVNSQCEFERRKPDGTTTLG 300 310 320 330 340 350 340 350 360 370 380 390 pF1KE2 VVNTAEPDADEEETHPVDLSSLSSKLLPGFTTL-GFKDERRNKVT---FLSSAT-TALSM ... ..: . : :: :. ...: : .:: :::...::::: .:. . .. . CCDS10 LLHPVDPIVGEPGYCPVRLGMTTGRLQSGVNTLQGFKEDKRNKVTPVLYLNYGPYSSYAP 360 370 380 390 400 410 400 410 420 430 440 450 pF1KE2 QNNSVFGDLKSDEMELLYSAYGDETGVQCALSLQEFVKDAGSYSKKVVDDLLDQITGGDH . .:.:.....:. .:.::.::... . .:..::. .: ..:.::: .: : : CCDS10 HYDSTFANISKDDSDLIYSTYGEDSDLPSDFSIHEFLATCQDYPYVMADSLLDVLTKGGH 420 430 440 450 460 470 460 470 480 490 500 pF1KE2 SRTLFQLKQRRNVPMKPPDEA--KVGDTLGDSSSSVLEFMSM--KSYPDVSVDISMLSSL :::: :. .. . : ::. .. :: . . .: . .: : . .. .... CCDS10 SRTL----QEMEMSL-PEDEGHTRTLDTAKEMEITEVEPPGRLDSSTQDRLIALKAVTNF 480 490 500 510 520 510 520 530 540 550 560 pF1KE2 GKVKKELDPDDSHL---NLDETTKLLQDLHEAQAERGGSRPSSNLSSLSNASERDQHHLG : . .: ..... .:::::.::..:.::: :: ..:: :. : . : :..: CCDS10 GVPVEVFDSEEAEIFQKKLDETTRLLRELQEAQNERLSTRPPPNMICLLGPSYREMHLAE 530 540 550 560 570 580 570 580 590 pF1KE2 SPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT CCDS10 QVTNNLKELAQQVTPGDIVSTYGVRKAMGISIPSPVMENNFVDLTEDTEEPKKTDVAECG 590 600 610 620 630 640 >>CCDS54007.1 BRD7 gene_id:29117|Hs108|chr16 (652 aa) initn: 1033 init1: 715 opt: 875 Z-score: 639.9 bits: 128.6 E(32554): 2.5e-29 Smith-Waterman score: 1171; 36.2% identity (68.2% similar) in 600 aa overlap (1-562:1-585) 10 20 30 40 50 pF1KE2 MGKKHKKHKAEWRSSYEDYADKPLEKPLKLVLKVGGSEVTELS--GSGHDSSYYDDRSDH :::::::::.. . ::.: .::::::::::::.:::::: .:::::: ..:..:: CCDS54 MGKKHKKHKSD-KHLYEEY----VEKPLKLVLKVGGNEVTELSTGSSGHDSSLFEDKNDH 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 ERERHKEKKKKKKKKSEKEKHLDDEERRKRK-EEKKRKREREHCDTEGEADDFDPGKKVE . .::..:.::.::.::. ... :..:. .: :.::.:.. ..:.: : .. :. CCDS54 D--KHKDRKRKKRKKGEKQIPGEEKGRKRRRVKEDKKKRDRDRVENEAEKD-LQCHAPVR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 VEPPPDRPVRACRTQPAENESTPIQQLLEHFLRQLQRKDPHGFFAFPVTDAIAPGYSMII .. ::..:. . .. : :.::.:. :....:::::::: .::.::::: ::::::::: CCDS54 LDLPPEKPLTSSLAKQEEVEQTPLQEALNQLMRQLQRKDPSAFFSFPVTDFIAPGYSMII 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 KHPMDFGTMKDKIVANEYKSVTEFKADFKLMCDNAMTYNRPDTVYYKLAKKILHAGFKMM ::::::.:::.:: :.:.:. :.: .::::: ::: ::.:.:.::: :::.::.:.:.. CCDS54 KHPMDFSTMKEKIKNNDYQSIEELKDNFKLMCTNAMIYNKPETIYYKAAKKLLHSGMKIL 180 190 200 210 220 230 240 250 260 270 280 pF1KE2 SKQ--AALLGNEDTAVEEPVPEVVPVQVETAK---------KSKKPSREVISCMFEPEGN :.. .: . : .. . ..:.. . .. : .. . :. .. CCDS54 SQERIQSLKQSIDFMADLQKTRKQKDGTDTSQSGEDGGCWQREREDSGDAEAHAFKSPSK 240 250 260 270 280 290 290 300 310 320 330 pF1KE2 ACSLTD---------STAEEHVLALVEHAADEARDRINRFLPGGKMGYLKRNGDGSLLYS . : :. :. ... . :. ...: : ... . .:. ::. . CCDS54 ENKKKDKDMLEDKFKSNNLEREQEQLDRIVKESGGKLTRRLVNSQCEFERRKPDGTTTLG 300 310 320 330 340 350 340 350 360 370 380 390 pF1KE2 VVNTAEPDADEEETHPVDLSSLSSKLLPGFTTL-GFKDERRNKVT---FLSSAT-TALSM ... ..: . : :: :. ...: : .:: :::...::::: .:. . .. . CCDS54 LLHPVDPIVGEPGYCPVRLGMTTGRLQSGVNTLQGFKEDKRNKVTPVLYLNYGPYSSYAP 360 370 380 390 400 410 400 410 420 430 440 450 pF1KE2 QNNSVFGDLKSDEMELLYSAYGDETGVQCALSLQEFVKDAGSYSKKVVDDLLDQITGGDH . .:.:.....:. .:.::.::... . .:..::. .: ..:.::: .: : : CCDS54 HYDSTFANISKDDSDLIYSTYGEDSDLPSDFSIHEFLATCQDYPYVMADSLLDVLTKGGH 420 430 440 450 460 470 460 470 480 490 500 pF1KE2 SRTLFQLKQRRNVPMKPPDEAKVGDTLGDSSSSVLEFMSMK-------SYPDVSVDISML :::: :. .. . : ::... :: :... . .. .. : : . .. . CCDS54 SRTL----QEMEMSL-PEDEGHT-RTL-DTAKEMEQITEVEPPGRLDSSTQDRLIALKAV 480 490 500 510 520 510 520 530 540 550 560 pF1KE2 SSLGKVKKELDPDDSHL---NLDETTKLLQDLHEAQAERGGSRPSSNLSSLSNASERDQH ...: . .: ..... .:::::.::..:.::: :: ..:: :. : . : :..: CCDS54 TNFGVPVEVFDSEEAEIFQKKLDETTRLLRELQEAQNERLSTRPPPNMICLLGPSYREMH 530 540 550 560 570 580 570 580 590 pF1KE2 HLGSPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT CCDS54 LAEQVTNNLKELAQQVTPGDIVSTYGVRKAMGISIPSPVMENNFVDLTEDTEEPKKTDVA 590 600 610 620 630 640 597 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 20:30:24 2016 done: Mon Nov 7 20:30:25 2016 Total Scan time: 3.430 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]