FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9605, 221 aa
1>>>pF1KE9605 221 - 221 aa - 221 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.6741+/-0.000978; mu= 0.9525+/- 0.059
mean_var=217.3454+/-43.989, 0's: 0 Z-trim(112.6): 31 B-trim: 150 in 2/53
Lambda= 0.086996
statistics sampled from 13327 (13349) to 13327 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.41), width: 16
Scan time: 2.310
The best scores are: opt bits E(32554)
CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 ( 221) 1362 182.6 1.7e-46
CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 ( 219) 1142 155.0 3.4e-38
CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 ( 213) 1053 143.8 7.6e-35
CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 ( 226) 1020 139.7 1.4e-33
CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 ( 215) 827 115.5 2.7e-26
CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 ( 207) 508 75.4 2.9e-14
>>CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 (221 aa)
initn: 1362 init1: 1362 opt: 1362 Z-score: 948.4 bits: 182.6 E(32554): 1.7e-46
Smith-Waterman score: 1362; 100.0% identity (100.0% similar) in 221 aa overlap (1-221:1-221)
10 20 30 40 50 60
pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT
130 140 150 160 170 180
190 200 210 220
pF1KE9 PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
:::::::::::::::::::::::::::::::::::::::::
CCDS45 PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
190 200 210 220
>>CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 (219 aa)
initn: 814 init1: 814 opt: 1142 Z-score: 799.2 bits: 155.0 E(32554): 3.4e-38
Smith-Waterman score: 1142; 86.0% identity (94.1% similar) in 221 aa overlap (1-221:1-219)
10 20 30 40 50 60
pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL
:::::: ::. :::::::::::::.:. : :.::::::::::::::::::::::::::::
CCDS45 MSETAPAAPAAPAPAEKTPVKKKARKS-AGAAKRKASGPPVSELITKAVAASKERSGVSL
10 20 30 40 50
70 80 90 100 110 120
pF1KE9 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::.:::
CCDS45 AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKPK
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE9 AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT
:::::::: .:::::::::::..:::::::: ::::::.::::.:::.:: ::: ::.:.
CCDS45 AKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKK-AKSPKKAKA
120 130 140 150 160 170
190 200 210 220
pF1KE9 PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
.:::: :::::::: ::::::::..:::..: :::: :::
CCDS45 AKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK
180 190 200 210
>>CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 (213 aa)
initn: 1065 init1: 882 opt: 1053 Z-score: 739.0 bits: 143.8 E(32554): 7.6e-35
Smith-Waterman score: 1053; 81.1% identity (90.5% similar) in 222 aa overlap (1-221:1-213)
10 20 30 40 50
pF1KE9 MSETAPLAPTIPAPAEKTPVKKKA-KKAGATAGKRKASGPPVSELITKAVAASKERSGVS
:::::: ::. ::::.:::::: ::::.: ::::::::::::::::::::::::::
CCDS45 MSETAPAAPAAAPPAEKAPVKKKAAKKAGGT--PRKASGPPVSELITKAVAASKERSGVS
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 LAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKP
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::.::
CCDS45 LAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 KAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVK
:.::::..::.::.::::::::.::.:::::: ::::::.::::.:. ::::::: ::.:
CCDS45 KVKKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKAK
120 130 140 150 160 170
180 190 200 210 220
pF1KE9 TPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
. .::::::: ::: .:::..::::.: ::::::::
CCDS45 VAKPKKAAKSAAKA-------VKPKAAKPKVVKPKKAAPKKK
180 190 200 210
>>CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 (226 aa)
initn: 682 init1: 682 opt: 1020 Z-score: 716.3 bits: 139.7 E(32554): 1.4e-33
Smith-Waterman score: 1020; 78.3% identity (89.1% similar) in 230 aa overlap (1-221:1-226)
10 20 30 40 50
pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKK--AGATAGKRKASGPPVSELITKAVAASKERSGV
:::::: . :::.::.:.:::: : ::: :.::::.::::::::::::::::::.:.
CCDS46 MSETAPAETATPAPVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERNGL
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE9 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGK
:::::::::::.::::::::::::::::::::::::::::::::::::::::::::::.:
CCDS46 SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE9 PKAKKAGAAKPRKPAGAA-KKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKK
:::::::::: .:::::. :: ::.::: ::..::::::.:::: :::.:::::: ::
CCDS46 PKAKKAGAAKAKKPAGATPKKAKKAAGA---KKAVKKTPKKAKKPA-AAGVKKVAKSPKK
130 140 150 160 170
180 190 200 210 220
pF1KE9 VKTP-QPKKAAKSPAKAKA--PK---PKAAKPKSGKPKVTKAKKAAPKKK
.:. .::::.::::: :: :: :::::::..:::..:::::: :::
CCDS46 AKAAAKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK
180 190 200 210 220
>>CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 (215 aa)
initn: 501 init1: 501 opt: 827 Z-score: 585.7 bits: 115.5 E(32554): 2.7e-26
Smith-Waterman score: 827; 67.7% identity (83.0% similar) in 223 aa overlap (1-221:1-215)
10 20 30 40 50
pF1KE9 MSETAPLAPTIPAPAEKTPVKKKAKKAG--ATAGKRKASGPPVSELITKAVAASKERSGV
::::.: ::. : :: . ::::: . :.:.:.: .:: :::::..:...::::.::
CCDS45 MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE9 SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGK
::::::::::::::::::::::::::.:::::::::::::::::::::::::::.: : :
CCDS45 SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE9 PKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKV
: :.:. : : .::.:: ::..::. :::.: ::::.::: :.:.: .:. ::
CCDS45 PGASKV--ATKTKATGASKKLKKATGAS--KKSVK-TPKKAKKP---AATRKSSKNPKKP
130 140 150 160 170
180 190 200 210 220
pF1KE9 KTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
:: .:::.::::::::: :::::: . :::..: ::::::::
CCDS45 KTVKPKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
180 190 200 210
>>CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 (207 aa)
initn: 518 init1: 405 opt: 508 Z-score: 369.5 bits: 75.4 E(32554): 2.9e-14
Smith-Waterman score: 535; 49.1% identity (73.4% similar) in 218 aa overlap (1-215:1-207)
10 20 30 40 50
pF1KE9 MSETAPLAPTIP--APAEKTPVKKKAKK-AGATAGKRKASGPPVSELITKAVAASKERSG
::::.: : . : :: :.::...: :: ...::. . ::.:::.:...:.:: :
CCDS34 MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE9 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEG
.::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::.
CCDS34 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE9 KPKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKK
. ::::. .:: .: . .. .: .:: . :: :..::: : : :...:..:
CCDS34 RSKAKKSVSAKTKKL--VLSRDSK-----SPKTA--KTNKRAKKP--RATTPKTVRSGRK
130 140 150 160
180 190 200 210 220
pF1KE9 VKTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
.: . :. :::.::.: : : .. . . . . .::
CCDS34 AKGAKGKQQQKSPVKARASKSKLTQHHEVNVRKATSKK
170 180 190 200
221 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 06:27:39 2016 done: Sun Nov 6 06:27:39 2016
Total Scan time: 2.310 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]