FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1710, 215 aa 1>>>pF1KE1710 215 - 215 aa - 215 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.9753+/-0.000871; mu= 4.2088+/- 0.053 mean_var=186.2358+/-36.021, 0's: 0 Z-trim(112.9): 28 B-trim: 37 in 2/53 Lambda= 0.093982 statistics sampled from 13530 (13554) to 13530 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.725), E-opt: 0.2 (0.416), width: 16 Scan time: 2.130 The best scores are: opt bits E(32554) CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 ( 215) 1303 187.9 4.2e-48 CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 ( 219) 877 130.1 1e-30 CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 ( 221) 827 123.4 1.1e-28 CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 ( 226) 823 122.8 1.7e-28 CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 ( 213) 799 119.5 1.5e-27 CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 ( 207) 558 86.9 1e-17 >>CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 (215 aa) initn: 1303 init1: 1303 opt: 1303 Z-score: 977.2 bits: 187.9 E(32554): 4.2e-48 Smith-Waterman score: 1303; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:1-215) 10 20 30 40 50 60 pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 PGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATRKSSKNPKKPKTVKPKKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATRKSSKNPKKPKTVKPKKV 130 140 150 160 170 180 190 200 210 pF1KE1 AKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK ::::::::::::::::::::::::::::::::::: CCDS45 AKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK 190 200 210 >>CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 (219 aa) initn: 732 init1: 540 opt: 877 Z-score: 664.9 bits: 130.1 E(32554): 1e-30 Smith-Waterman score: 877; 70.7% identity (86.0% similar) in 222 aa overlap (1-215:1-219) 10 20 30 40 50 60 pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV ::::.: :::: : :: . :::.: .:.:.:.: .:: :::::..:...::::.:: CCDS45 MSETAPAAPAAPAPAEKTPVKKKARK---SAGAAKRKASGPPVSELITKAVAASKERSGV 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK ::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: :.: CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK 60 70 80 90 100 110 130 140 150 160 170 pF1KE1 PGASKV--ATKTKATGASKKLKKATGAS--KKSVK-TPKKAKKPAATR--KSSKNPKKPK : :.:. : : .::.:: ::::::. :::.: ::::::::::. :..:.::: : CCDS45 PKAKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKKAKSPKKAK 120 130 140 150 160 170 180 190 200 210 pF1KE1 TVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK ..::::. ::::::::::::::: ...:::.::::::: ::: CCDS45 AAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK 180 190 200 210 >>CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 (221 aa) initn: 501 init1: 501 opt: 827 Z-score: 628.2 bits: 123.4 E(32554): 1.1e-28 Smith-Waterman score: 827; 67.7% identity (83.0% similar) in 223 aa overlap (1-215:1-221) 10 20 30 40 50 60 pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV ::::.: ::. : :: . ::::: .:.:.:.: .:: :::::..:...::::.:: CCDS45 MSETAPLAPTIPAPAEKTPVKKKAKKA--GATAGKRKASGPPVSELITKAVAASKERSGV 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK ::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: : : CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGK 60 70 80 90 100 110 130 140 150 160 170 pF1KE1 PGASKV--ATKTKATGASKKLKKATGAS--KKSVK-TPKKAKKPAA---TRKSSKNPKKP : :.:. : : .::.:: ::..::. :::.: ::::.::::. :.: .:. :: CCDS45 PKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKV 120 130 140 150 160 170 180 190 200 210 pF1KE1 KTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK :: .:::.::::::::: :::::: . :::..: :::::::: CCDS45 KTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK 180 190 200 210 220 >>CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 (226 aa) initn: 735 init1: 554 opt: 823 Z-score: 625.1 bits: 122.8 E(32554): 1.7e-28 Smith-Waterman score: 823; 66.8% identity (83.2% similar) in 220 aa overlap (1-214:1-220) 10 20 30 40 50 60 pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV ::::.: :. : :: : ::: : : .:.:.:.: .:: :::::..:...::::.:. CCDS46 MSETAPAETATPAPVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERNGL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK :::::::::::.::::::::::::::.:::::::::::::::::::::::::::.: :.: CCDS46 SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 PGASKV--ATKTKATGAS-KKLKKATGASKKSVKTPKKAKKPAAT--RKSSKNPKKPKTV : :.:. : : .::. :: :::.::.: :::::::::::. .: .:.::: :.. CCDS46 PKAKKAGAAKAKKPAGATPKKAKKAAGAKKAVKKTPKKAKKPAAAGVKKVAKSPKKAKAA 130 140 150 160 170 180 180 190 200 210 pF1KE1 -KPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK ::::..::::: ::::::::: ...:::.:::: : :: CCDS46 AKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK 190 200 210 220 >>CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 (213 aa) initn: 585 init1: 511 opt: 799 Z-score: 607.9 bits: 119.5 E(32554): 1.5e-27 Smith-Waterman score: 799; 67.7% identity (83.4% similar) in 223 aa overlap (1-215:1-213) 10 20 30 40 50 60 pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV ::::.: ::::. :: . ::: :: :... .: .:: :::::..:...::::.:: CCDS45 MSETAPAAPAAAPPAEKAPVKKKA---AKKAGGTPRKASGPPVSELITKAVAASKERSGV 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK ::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: :.: CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK 60 70 80 90 100 110 130 140 150 160 170 pF1KE1 PGASKVA-TKTKA-TGASKKLKKATGAS--KKSVK-TPKKAKKPAA---TRKSSKNPKKP : ..:.. :: : .::.:: :::.:.. :::.: :::::::::: :.: .:.::: CCDS45 PKVKKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKA 120 130 140 150 160 170 180 190 200 210 pF1KE1 KTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK :..::::.::: :: :::::::: ::..:::::::::: CCDS45 KVAKPKKAAKSAAK--AVKPKAAK-----PKVVKPKKAAPKKK 180 190 200 210 >>CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 (207 aa) initn: 529 init1: 355 opt: 558 Z-score: 431.5 bits: 86.9 E(32554): 1e-17 Smith-Waterman score: 558; 53.1% identity (73.2% similar) in 213 aa overlap (1-209:1-207) 10 20 30 40 50 pF1KE1 MSETVPPAPAAS--AAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERG :::::: : :.. :: :: . :...::: .::.: : . :::.::..: : :.:: CCDS34 MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVP-NLSVSKLITEALSVSQERV 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 GVSLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVE :.::.::::::::::::::::::::::..::::.:: ::::.::::::::::.::. CCDS34 GMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 TKPGASK-VATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATR-KSSKNPKKPKTVK :. :.: :..::: :. : : :..:: :.:::: :: :. .. .: : .: CCDS34 TRSKAKKSVSAKTKKLVLSRDSK-----SPKTAKTNKRAKKPRATTPKTVRSGRKAKGAK 120 130 140 150 160 170 180 190 200 210 pF1KE1 PKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK :. :::.::.: : : .. . .. . : :: CCDS34 GKQQQKSPVKARASKSKLTQHHEVNVRKATSKK 180 190 200 215 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 16:57:57 2016 done: Sun Nov 6 16:57:58 2016 Total Scan time: 2.130 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]