FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9603, 207 aa 1>>>pF1KE9603 207 - 207 aa - 207 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8743+/-0.000989; mu= 7.5142+/- 0.060 mean_var=129.6508+/-24.878, 0's: 0 Z-trim(108.2): 22 B-trim: 0 in 0/52 Lambda= 0.112638 statistics sampled from 10055 (10072) to 10055 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.656), E-opt: 0.2 (0.309), width: 16 Scan time: 2.210 The best scores are: opt bits E(32554) CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 ( 207) 1233 211.1 4e-55 CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 ( 219) 567 102.9 1.6e-22 CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 ( 215) 558 101.4 4.3e-22 CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 ( 226) 529 96.7 1.2e-20 CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 ( 213) 523 95.7 2.2e-20 CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 ( 221) 512 94.0 7.7e-20 >>CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 (207 aa) initn: 1233 init1: 1233 opt: 1233 Z-score: 1103.2 bits: 211.1 E(32554): 4e-55 Smith-Waterman score: 1233; 99.0% identity (100.0% similar) in 207 aa overlap (1-207:1-207) 10 20 30 40 50 60 pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG :::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 RSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATTPKTVRSGRKAKGAKGKQKQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::.:: CCDS34 RSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATTPKTVRSGRKAKGAKGKQQQK 130 140 150 160 170 180 190 200 pF1KE9 SPVKARASKSKLTQHHEVNVRKATSKK ::::::::::::::::::::::::::: CCDS34 SPVKARASKSKLTQHHEVNVRKATSKK 190 200 >>CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 (219 aa) initn: 670 init1: 396 opt: 567 Z-score: 518.0 bits: 102.9 E(32554): 1.6e-22 Smith-Waterman score: 594; 53.4% identity (74.9% similar) in 219 aa overlap (1-207:1-213) 10 20 30 40 50 60 pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG ::::.::: :. : :: :.::..:: :: .:.::. . ::.:::.:...:.:: : CCDS45 MSETAPAAPAAP--APAEKTPVKKKARKSAG--AAKRKASGPPVSELITKAVAASKERSG 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST .::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::. . CCDS45 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEA 60 70 80 90 100 110 130 140 150 160 pF1KE9 RSKAKKSVSAKTKKLVLSRDSKSPKTA-----------KTNKRAKKPRATT-PKTVRSGR . ::::. .::.:: . . .:.:: : :: :.:::: :.. : ..: . CCDS45 KPKAKKAGAAKAKKPAGA--AKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKKAKSPK 120 130 140 150 160 170 170 180 190 200 pF1KE9 KAKGAKGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK :::.:: :. :::.::.: : : .. . .. . : :: CCDS45 KAKAAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK 180 190 200 210 >>CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 (215 aa) initn: 489 init1: 353 opt: 558 Z-score: 510.2 bits: 101.4 E(32554): 4.3e-22 Smith-Waterman score: 558; 53.1% identity (73.7% similar) in 213 aa overlap (1-207:1-209) 10 20 30 40 50 pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVP-NLSVSKLITEALSVSQERV :::::: : :.. :: :: . :...::: .::.: : . :::.::..: : :.:: CCDS45 MSETVPPAPAAS--AAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERG 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 GMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKS :.::.::::::::::::::::::::::..::::.:: ::::.::::::::::.::. CCDS45 GVSLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 TRSKAKKSVSAKTKKLVLSRDSK-----SPKTAKTNKRAKKPRATTPKTVRSGRKAKGAK :. :.: :..::: :. : : :..:: :.:::: :.: :. .. .: : .: CCDS45 TKPGASK-VATKTKATGASKKLKKATGASKKSVKTPKKAKKP-AATRKSSKNPKKPKTVK 120 130 140 150 160 170 180 190 200 pF1KE9 GKQKQKSPVKARASKSKLTQHHEVNVRKATSKK :. :::.::.: : : .. . .. . : :: CCDS45 PKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK 180 190 200 210 >>CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 (226 aa) initn: 462 init1: 462 opt: 529 Z-score: 484.4 bits: 96.7 E(32554): 1.2e-20 Smith-Waterman score: 567; 51.9% identity (75.0% similar) in 216 aa overlap (1-206:1-214) 10 20 30 40 50 pF1KE9 MSETVPAASASAGLAAMEKLPTKKRG-RKPAGLISASRKVPNLSVSKLITEALSVSQERV ::::.:: .:. : .:: :.::.. .: :: .:.::. . ::.:::.:...:.:: CCDS46 MSETAPAETATP--APVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERN 10 20 30 40 50 60 70 80 90 100 110 pF1KE9 GMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKS :.::.::::::::.:::::::::::::.:::::.:: ::::.::::::::::.::. CCDS46 GLSLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE9 TRSKAKKSVSAKTKKLVLSRDSKSPKTA-------KTNKRAKKPRATTPKTV-RSGRKAK .. ::::. .::.:: . . .:. :.: :: :.:::: :. : : .: .::: CCDS46 AKPKAKKAGAAKAKKPAGATPKKAKKAAGAKKAVKKTPKKAKKPAAAGVKKVAKSPKKAK 120 130 140 150 160 170 180 190 200 pF1KE9 GA-KGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK .: : :. :::.: .: : : .. . .. . : : CCDS46 AAAKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK 180 190 200 210 220 >>CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 (213 aa) initn: 531 init1: 379 opt: 523 Z-score: 479.5 bits: 95.7 E(32554): 2.2e-20 Smith-Waterman score: 552; 52.7% identity (73.2% similar) in 220 aa overlap (1-207:1-212) 10 20 30 40 50 60 pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG ::::.::: :.: : :: :.::.. : :: .. ::. . ::.:::.:...:.:: : CCDS45 MSETAPAAPAAAPPA--EKAPVKKKAAKKAG--GTPRKASGPPVSELITKAVAASKERSG 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST .::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::. . CCDS45 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEA 60 70 80 90 100 110 130 140 150 160 pF1KE9 RSKAKKSVSAKTKKLVLSRDSKSPKTA-----------KTNKRAKKPRATT--PKTVRSG . :.::. ..: :: : . .:.:: : :: :.:::: :.: :...: CCDS45 KPKVKKAGGTKPKKPVGA--AKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSP 120 130 140 150 160 170 170 180 190 200 pF1KE9 RKAKGAKGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK .::: :: :. :: :.: : : .. . :. .::. :: CCDS45 KKAKVAKPKKAAKSA--AKAVKPKAAKPKVVKPKKAAPKKK 180 190 200 210 >>CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 (221 aa) initn: 522 init1: 405 opt: 512 Z-score: 469.6 bits: 94.0 E(32554): 7.7e-20 Smith-Waterman score: 535; 49.1% identity (72.9% similar) in 218 aa overlap (1-207:1-215) 10 20 30 40 50 60 pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG ::::.: : . : :: :.::...: :: ...::. . ::.:::.:...:.:: : CCDS45 MSETAPLAPTIP--APAEKTPVKKKAKK-AGATAGKRKASGPPVSELITKAVAASKERSG 10 20 30 40 50 70 80 90 100 110 120 pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST .::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::. CCDS45 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEG 60 70 80 90 100 110 130 140 150 160 pF1KE9 RSKAKKSVSAKTKKLV-LSRDSK------SPKTA--KTNKRAKKPR--ATTPKTVRSGRK . ::::. .:: .: . .. : .:: . :: :..::: : : :...:..: CCDS45 KPKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKK 120 130 140 150 160 170 170 180 190 200 pF1KE9 AKGAKGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK .: . :. :::.::.: : : .. . . . . .:: CCDS45 VKTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK 180 190 200 210 220 207 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:29:20 2016 done: Sun Nov 6 06:29:20 2016 Total Scan time: 2.210 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]