FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1710, 215 aa
1>>>pF1KE1710 215 - 215 aa - 215 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.9753+/-0.000871; mu= 4.2088+/- 0.053
mean_var=186.2358+/-36.021, 0's: 0 Z-trim(112.9): 28 B-trim: 37 in 2/53
Lambda= 0.093982
statistics sampled from 13530 (13554) to 13530 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.725), E-opt: 0.2 (0.416), width: 16
Scan time: 2.130
The best scores are: opt bits E(32554)
CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 ( 215) 1303 187.9 4.2e-48
CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 ( 219) 877 130.1 1e-30
CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 ( 221) 827 123.4 1.1e-28
CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 ( 226) 823 122.8 1.7e-28
CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 ( 213) 799 119.5 1.5e-27
CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 ( 207) 558 86.9 1e-17
>>CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 (215 aa)
initn: 1303 init1: 1303 opt: 1303 Z-score: 977.2 bits: 187.9 E(32554): 4.2e-48
Smith-Waterman score: 1303; 100.0% identity (100.0% similar) in 215 aa overlap (1-215:1-215)
10 20 30 40 50 60
pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 PGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATRKSSKNPKKPKTVKPKKV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 PGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATRKSSKNPKKPKTVKPKKV
130 140 150 160 170 180
190 200 210
pF1KE1 AKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
:::::::::::::::::::::::::::::::::::
CCDS45 AKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
190 200 210
>>CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 (219 aa)
initn: 732 init1: 540 opt: 877 Z-score: 664.9 bits: 130.1 E(32554): 1e-30
Smith-Waterman score: 877; 70.7% identity (86.0% similar) in 222 aa overlap (1-215:1-219)
10 20 30 40 50 60
pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
::::.: :::: : :: . :::.: .:.:.:.: .:: :::::..:...::::.::
CCDS45 MSETAPAAPAAPAPAEKTPVKKKARK---SAGAAKRKASGPPVSELITKAVAASKERSGV
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: :.:
CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK
60 70 80 90 100 110
130 140 150 160 170
pF1KE1 PGASKV--ATKTKATGASKKLKKATGAS--KKSVK-TPKKAKKPAATR--KSSKNPKKPK
: :.:. : : .::.:: ::::::. :::.: ::::::::::. :..:.::: :
CCDS45 PKAKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKKAKSPKKAK
120 130 140 150 160 170
180 190 200 210
pF1KE1 TVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
..::::. ::::::::::::::: ...:::.::::::: :::
CCDS45 AAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK
180 190 200 210
>>CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 (221 aa)
initn: 501 init1: 501 opt: 827 Z-score: 628.2 bits: 123.4 E(32554): 1.1e-28
Smith-Waterman score: 827; 67.7% identity (83.0% similar) in 223 aa overlap (1-215:1-221)
10 20 30 40 50 60
pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
::::.: ::. : :: . ::::: .:.:.:.: .:: :::::..:...::::.::
CCDS45 MSETAPLAPTIPAPAEKTPVKKKAKKA--GATAGKRKASGPPVSELITKAVAASKERSGV
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: : :
CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGK
60 70 80 90 100 110
130 140 150 160 170
pF1KE1 PGASKV--ATKTKATGASKKLKKATGAS--KKSVK-TPKKAKKPAA---TRKSSKNPKKP
: :.:. : : .::.:: ::..::. :::.: ::::.::::. :.: .:. ::
CCDS45 PKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKV
120 130 140 150 160 170
180 190 200 210
pF1KE1 KTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
:: .:::.::::::::: :::::: . :::..: ::::::::
CCDS45 KTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
180 190 200 210 220
>>CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 (226 aa)
initn: 735 init1: 554 opt: 823 Z-score: 625.1 bits: 122.8 E(32554): 1.7e-28
Smith-Waterman score: 823; 66.8% identity (83.2% similar) in 220 aa overlap (1-214:1-220)
10 20 30 40 50 60
pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
::::.: :. : :: : ::: : : .:.:.:.: .:: :::::..:...::::.:.
CCDS46 MSETAPAETATPAPVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERNGL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
:::::::::::.::::::::::::::.:::::::::::::::::::::::::::.: :.:
CCDS46 SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK
70 80 90 100 110 120
130 140 150 160 170
pF1KE1 PGASKV--ATKTKATGAS-KKLKKATGASKKSVKTPKKAKKPAAT--RKSSKNPKKPKTV
: :.:. : : .::. :: :::.::.: :::::::::::. .: .:.::: :..
CCDS46 PKAKKAGAAKAKKPAGATPKKAKKAAGAKKAVKKTPKKAKKPAAAGVKKVAKSPKKAKAA
130 140 150 160 170 180
180 190 200 210
pF1KE1 -KPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
::::..::::: ::::::::: ...:::.:::: : ::
CCDS46 AKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK
190 200 210 220
>>CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 (213 aa)
initn: 585 init1: 511 opt: 799 Z-score: 607.9 bits: 119.5 E(32554): 1.5e-27
Smith-Waterman score: 799; 67.7% identity (83.4% similar) in 223 aa overlap (1-215:1-213)
10 20 30 40 50 60
pF1KE1 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
::::.: ::::. :: . ::: :: :... .: .:: :::::..:...::::.::
CCDS45 MSETAPAAPAAAPPAEKAPVKKKA---AKKAGGTPRKASGPPVSELITKAVAASKERSGV
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: :.:
CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK
60 70 80 90 100 110
130 140 150 160 170
pF1KE1 PGASKVA-TKTKA-TGASKKLKKATGAS--KKSVK-TPKKAKKPAA---TRKSSKNPKKP
: ..:.. :: : .::.:: :::.:.. :::.: :::::::::: :.: .:.:::
CCDS45 PKVKKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKA
120 130 140 150 160 170
180 190 200 210
pF1KE1 KTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
:..::::.::: :: :::::::: ::..::::::::::
CCDS45 KVAKPKKAAKSAAK--AVKPKAAK-----PKVVKPKKAAPKKK
180 190 200 210
>>CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 (207 aa)
initn: 529 init1: 355 opt: 558 Z-score: 431.5 bits: 86.9 E(32554): 1e-17
Smith-Waterman score: 558; 53.1% identity (73.2% similar) in 213 aa overlap (1-209:1-207)
10 20 30 40 50
pF1KE1 MSETVPPAPAAS--AAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERG
:::::: : :.. :: :: . :...::: .::.: : . :::.::..: : :.::
CCDS34 MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVP-NLSVSKLITEALSVSQERV
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 GVSLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVE
:.::.::::::::::::::::::::::..::::.:: ::::.::::::::::.::.
CCDS34 GMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 TKPGASK-VATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATR-KSSKNPKKPKTVK
:. :.: :..::: :. : : :..:: :.:::: :: :. .. .: : .:
CCDS34 TRSKAKKSVSAKTKKLVLSRDSK-----SPKTAKTNKRAKKPRATTPKTVRSGRKAKGAK
120 130 140 150 160 170
180 190 200 210
pF1KE1 PKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
:. :::.::.: : : .. . .. . : ::
CCDS34 GKQQQKSPVKARASKSKLTQHHEVNVRKATSKK
180 190 200
215 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 16:57:57 2016 done: Sun Nov 6 16:57:58 2016
Total Scan time: 2.130 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]