FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4120, 503 aa 1>>>pF1KE4120 503 - 503 aa - 503 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7606+/-0.00122; mu= 2.1771+/- 0.071 mean_var=306.5992+/-68.738, 0's: 0 Z-trim(110.5): 794 B-trim: 541 in 1/49 Lambda= 0.073247 statistics sampled from 10683 (11666) to 10683 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.708), E-opt: 0.2 (0.358), width: 16 Scan time: 3.380 The best scores are: opt bits E(32554) CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 503) 3384 371.9 9.4e-103 CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 361) 2244 251.2 1.4e-66 CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 ( 500) 1254 146.8 5.3e-35 CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 ( 467) 703 88.5 1.7e-17 >>CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 (503 aa) initn: 3384 init1: 3384 opt: 3384 Z-score: 1958.2 bits: 371.9 E(32554): 9.4e-103 Smith-Waterman score: 3384; 100.0% identity (100.0% similar) in 503 aa overlap (1-503:1-503) 10 20 30 40 50 60 pF1KE4 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAMMGVSGYVEYLREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAMMGVSGYVEYLREQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 EVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCGKKYTRKDQLEYHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCGKKYTRKDQLEYHI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 RKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSISPETTVTSRGQAEEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSISPETTVTSRGQAEEES 430 440 450 460 470 480 490 500 pF1KE4 PSQEETVAPGEAVQGSVSTTGPD ::::::::::::::::::::::: CCDS44 PSQEETVAPGEAVQGSVSTTGPD 490 500 >>CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 (361 aa) initn: 2244 init1: 2244 opt: 2244 Z-score: 1308.9 bits: 251.2 E(32554): 1.4e-66 Smith-Waterman score: 2244; 100.0% identity (100.0% similar) in 342 aa overlap (1-342:1-342) 10 20 30 40 50 60 pF1KE4 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRGQVSA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVADRGGRSDDEVRVL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GAVHIKTENLEEWLGPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQENYTLGSSGAKVAR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAMMGVSGYVEYLREQ :::::::::::::::::::::::::::::::::::::::::: CCDS13 PTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVWSCGFRTALVVGGIATVY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 EVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCGKKYTRKDQLEYHI CCDS13 E >>CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 (500 aa) initn: 1225 init1: 648 opt: 1254 Z-score: 741.8 bits: 146.8 E(32554): 5.3e-35 Smith-Waterman score: 1292; 45.6% identity (69.7% similar) in 498 aa overlap (1-485:1-483) 10 20 30 40 50 60 pF1KE4 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRDH :.... ::...:..:..:::.::.::.::.::::.:..::: :::::.:::::::::::: CCDS48 MDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPYFRDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 MSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCTQ .:. :: .::::::::.:::::::::::::. ::: :..:.:::::::::: .:::::: CCDS48 SALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVIDKCTQ 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 ILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRG-QVS :::.:: ::.:..:. . : :. :::. . . . .: . .: .: : . CCDS48 ILESIHSKISVGDVD-----SVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPT 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 AVLDIRELSPPEESTSPQIIEPS-SDVESREPILRINRAGQWYVETGVADRGGRSDDEVR : :.: . : .. .. : . :: : . . . .: : ..: : . CCDS48 ASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEY----EIQIE-GDHEQGDLLVRESQ 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 VLGAVHIKTENLEEWLGPENQPSGEDGSSAEEVTA---MVIDTTGHGSVGQENYTLGSSG . :..: :. .. ... :.:: .: : . ..... ..::: :. :. .... CCDS48 IT-EVKVKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAA 240 250 260 270 280 300 310 320 330 340 pF1KE4 AKVARPTS-SE----VDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEES---AM .. ::. :: .. ::: :.. . :::.. . .. .. :. : :: CCDS48 SQ---PTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAM 290 300 310 320 330 340 350 360 370 380 390 400 pF1KE4 MGVSGYVEYLREQEVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRMCG :. :: ::. . .:. :: :: ::::.::::::::::::::::::::::::..:: CCDS48 MNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCG 350 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 KKYTRKDQLEYHIRKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSISPE :::::::::::::: :: .:::.:..::: ::::. ::::.:::::: ... . ::: CCDS48 KKYTRKDQLEYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRS-RIESPE 410 420 430 440 450 460 470 480 490 500 pF1KE4 TTVTSRGQAEEESPSQEETVAPGEAVQGSVSTTGPD : . : :.. : : CCDS48 RTDVYVEQKLENDASASEMGLDSRMEIHTVSDAPD 470 480 490 500 >>CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 (467 aa) initn: 821 init1: 451 opt: 703 Z-score: 427.5 bits: 88.5 E(32554): 1.7e-17 Smith-Waterman score: 703; 31.0% identity (60.6% similar) in 462 aa overlap (1-447:1-446) 10 20 30 40 50 pF1KE4 MEKGGN-IQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPYFRD :: : : ...:.::::...:..::: :.::.:::. . :::. :::::.::::::::: : CCDS68 MEPGTNSFRVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIFRAHKAVLAASSPYFCD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 HMSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIIDKCT .. :.. . . . :: :::..: ::::. . .:.::::::::::: :..:::: CCDS68 QVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVSYLTAASFLQMWHVVDKCT 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 QILEGIHFKINVAEVEAELSQTRTKHQERPPESHR-VTPNLNRSLSPRHNTPKGN--RRG ..::: : . . .:.. . :: :. .. ... . . . . ::.. : : CCDS68 EVLEG-----NPTVLCQKLNHG-SDHQSPSSSSYNGLVESFELGSGGHTDFPKAQELRDG 130 140 150 160 170 180 190 200 210 220 pF1KE4 QVSAVLDIRELSPPEESTSPQIIEPSSDVESREPILRINRAGQWYVETGVAD-------R . ::: . : . . .:..: . : . :.: : :..: : CCDS68 ENEEESTKDELSS--QLTEHEYLPSNSSTEHDR--LSTEMASQ-DGEEGASDSAEFHYTR 180 190 200 210 220 230 240 250 260 270 280 pF1KE4 GGRSDDEVRVLGA-VHIKTENLEEWL-GPENQPSGEDGSSAEEVTAMVIDTTGHGSVGQE : . . .:.: : ::. : . . . .. . .: .... . : . : .: CCDS68 PMYSKPSIMAHKRWIHVKPERLEQACEGMDVHATYDEHQVTESINTVQTEHTVQPSGVEE 230 240 250 260 270 280 290 300 310 320 330 340 pF1KE4 NYTLGSSGAKVARPTSSEVDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEESAM .. .: . ... ... . .. . . . .. . .: : . .: : . CCDS68 DFHIGEKKVEAEFDEQADESNYDEQVDFYGSSMEEFSGERSDGNLIGHRQ-----EAALA 290 300 310 320 330 340 350 360 370 380 390 400 pF1KE4 MGVSGYVEYLR--EQEVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVCRM : : .:.. ..:.:. : . .: :.:::..:.. :::: .:.:. :. : . CCDS68 AGYSENIEMVTGIKEEASHLGFSATDKLYPCQCGKSFTHKSQRDRHMSMHLGLRPYGCGV 350 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 CGKKYTRKDQLEYHIRKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHSIS ::::. : .: :.. ::: ::..:..:.: : .. ...: CCDS68 CGKKFKMKHHLVGHMKIHTGIKPYECNICAKRFMWRDSFHRHVTSCTKSYEAAKAEQNTT 410 420 430 440 450 460 470 480 490 500 pF1KE4 PETTVTSRGQAEEESPSQEETVAPGEAVQGSVSTTGPD CCDS68 EAN 503 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:17:38 2016 done: Mon Nov 7 15:17:39 2016 Total Scan time: 3.380 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]