FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9605, 221 aa 1>>>pF1KE9605 221 - 221 aa - 221 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.6741+/-0.000978; mu= 0.9525+/- 0.059 mean_var=217.3454+/-43.989, 0's: 0 Z-trim(112.6): 31 B-trim: 150 in 2/53 Lambda= 0.086996 statistics sampled from 13327 (13349) to 13327 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.41), width: 16 Scan time: 2.310 The best scores are: opt bits E(32554) CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 ( 221) 1362 182.6 1.7e-46 CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 ( 219) 1142 155.0 3.4e-38 CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 ( 213) 1053 143.8 7.6e-35 CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 ( 226) 1020 139.7 1.4e-33 CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 ( 215) 827 115.5 2.7e-26 CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 ( 207) 508 75.4 2.9e-14 >>CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 (221 aa) initn: 1362 init1: 1362 opt: 1362 Z-score: 948.4 bits: 182.6 E(32554): 1.7e-46 Smith-Waterman score: 1362; 100.0% identity (100.0% similar) in 221 aa overlap (1-221:1-221) 10 20 30 40 50 60 pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT 130 140 150 160 170 180 190 200 210 220 pF1KE9 PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK ::::::::::::::::::::::::::::::::::::::::: CCDS45 PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK 190 200 210 220 >>CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 (219 aa) initn: 814 init1: 814 opt: 1142 Z-score: 799.2 bits: 155.0 E(32554): 3.4e-38 Smith-Waterman score: 1142; 86.0% identity (94.1% similar) in 221 aa overlap (1-221:1-219) 10 20 30 40 50 60 pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL :::::: ::. :::::::::::::.:. : :.:::::::::::::::::::::::::::: CCDS45 MSETAPAAPAAPAPAEKTPVKKKARKS-AGAAKRKASGPPVSELITKAVAASKERSGVSL 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK ::::::::::::::::::::::::::::::::::::::::::::::::::::::::.::: CCDS45 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKPK 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE9 AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT :::::::: .:::::::::::..:::::::: ::::::.::::.:::.:: ::: ::.:. CCDS45 AKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKK-AKSPKKAKA 120 130 140 150 160 170 190 200 210 220 pF1KE9 PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK .:::: :::::::: ::::::::..:::..: :::: ::: CCDS45 AKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK 180 190 200 210 >>CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 (213 aa) initn: 1065 init1: 882 opt: 1053 Z-score: 739.0 bits: 143.8 E(32554): 7.6e-35 Smith-Waterman score: 1053; 81.1% identity (90.5% similar) in 222 aa overlap (1-221:1-213) 10 20 30 40 50 pF1KE9 MSETAPLAPTIPAPAEKTPVKKKA-KKAGATAGKRKASGPPVSELITKAVAASKERSGVS :::::: ::. ::::.:::::: ::::.: :::::::::::::::::::::::::: CCDS45 MSETAPAAPAAAPPAEKAPVKKKAAKKAGGT--PRKASGPPVSELITKAVAASKERSGVS 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 LAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::.:: CCDS45 LAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 KAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVK :.::::..::.::.::::::::.::.:::::: ::::::.::::.:. ::::::: ::.: CCDS45 KVKKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKAK 120 130 140 150 160 170 180 190 200 210 220 pF1KE9 TPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK . .::::::: ::: .:::..::::.: :::::::: CCDS45 VAKPKKAAKSAAKA-------VKPKAAKPKVVKPKKAAPKKK 180 190 200 210 >>CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 (226 aa) initn: 682 init1: 682 opt: 1020 Z-score: 716.3 bits: 139.7 E(32554): 1.4e-33 Smith-Waterman score: 1020; 78.3% identity (89.1% similar) in 230 aa overlap (1-221:1-226) 10 20 30 40 50 pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKK--AGATAGKRKASGPPVSELITKAVAASKERSGV :::::: . :::.::.:.:::: : ::: :.::::.::::::::::::::::::.:. CCDS46 MSETAPAETATPAPVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERNGL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE9 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGK :::::::::::.::::::::::::::::::::::::::::::::::::::::::::::.: CCDS46 SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE9 PKAKKAGAAKPRKPAGAA-KKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKK :::::::::: .:::::. :: ::.::: ::..::::::.:::: :::.:::::: :: CCDS46 PKAKKAGAAKAKKPAGATPKKAKKAAGA---KKAVKKTPKKAKKPA-AAGVKKVAKSPKK 130 140 150 160 170 180 190 200 210 220 pF1KE9 VKTP-QPKKAAKSPAKAKA--PK---PKAAKPKSGKPKVTKAKKAAPKKK .:. .::::.::::: :: :: :::::::..:::..:::::: ::: CCDS46 AKAAAKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK 180 190 200 210 220 >>CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 (215 aa) initn: 501 init1: 501 opt: 827 Z-score: 585.7 bits: 115.5 E(32554): 2.7e-26 Smith-Waterman score: 827; 67.7% identity (83.0% similar) in 223 aa overlap (1-221:1-215) 10 20 30 40 50 pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKKAG--ATAGKRKASGPPVSELITKAVAASKERSGV ::::.: ::. : :: . ::::: . :.:.:.: .:: :::::..:...::::.:: CCDS45 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE9 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGK ::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: : : CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE9 PKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKV : :.:. : : .::.:: ::..::. :::.: ::::.::: :.:.: .:. :: CCDS45 PGASKV--ATKTKATGASKKLKKATGAS--KKSVK-TPKKAKKP---AATRKSSKNPKKP 130 140 150 160 170 180 190 200 210 220 pF1KE9 KTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK :: .:::.::::::::: :::::: . :::..: :::::::: CCDS45 KTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK 180 190 200 210 >>CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 (207 aa) initn: 518 init1: 405 opt: 508 Z-score: 369.5 bits: 75.4 E(32554): 2.9e-14 Smith-Waterman score: 535; 49.1% identity (73.4% similar) in 218 aa overlap (1-215:1-207) 10 20 30 40 50 pF1KE9 MSETAPLAPTIP--APAEKTPVKKKAKK-AGATAGKRKASGPPVSELITKAVAASKERSG ::::.: : . : :: :.::...: :: ...::. . ::.:::.:...:.:: : CCDS34 MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE9 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEG .::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::. CCDS34 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE9 KPKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKK . ::::. .:: .: . .. .: .:: . :: :..::: : : :...:..: CCDS34 RSKAKKSVSAKTKKL--VLSRDSK-----SPKTA--KTNKRAKKP--RATTPKTVRSGRK 130 140 150 160 180 190 200 210 220 pF1KE9 VKTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK .: . :. :::.::.: : : .. . . . . .:: CCDS34 AKGAKGKQQQKSPVKARASKSKLTQHHEVNVRKATSKK 170 180 190 200 221 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:27:39 2016 done: Sun Nov 6 06:27:39 2016 Total Scan time: 2.310 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]