FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9603, 207 aa
1>>>pF1KE9603 207 - 207 aa - 207 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8743+/-0.000989; mu= 7.5142+/- 0.060
mean_var=129.6508+/-24.878, 0's: 0 Z-trim(108.2): 22 B-trim: 0 in 0/52
Lambda= 0.112638
statistics sampled from 10055 (10072) to 10055 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.656), E-opt: 0.2 (0.309), width: 16
Scan time: 2.210
The best scores are: opt bits E(32554)
CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 ( 207) 1233 211.1 4e-55
CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 ( 219) 567 102.9 1.6e-22
CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 ( 215) 558 101.4 4.3e-22
CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 ( 226) 529 96.7 1.2e-20
CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 ( 213) 523 95.7 2.2e-20
CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 ( 221) 512 94.0 7.7e-20
>>CCDS34349.1 HIST1H1T gene_id:3010|Hs108|chr6 (207 aa)
initn: 1233 init1: 1233 opt: 1233 Z-score: 1103.2 bits: 211.1 E(32554): 4e-55
Smith-Waterman score: 1233; 99.0% identity (100.0% similar) in 207 aa overlap (1-207:1-207)
10 20 30 40 50 60
pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
:::::::::::::.::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 RSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATTPKTVRSGRKAKGAKGKQKQK
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::.::
CCDS34 RSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATTPKTVRSGRKAKGAKGKQQQK
130 140 150 160 170 180
190 200
pF1KE9 SPVKARASKSKLTQHHEVNVRKATSKK
:::::::::::::::::::::::::::
CCDS34 SPVKARASKSKLTQHHEVNVRKATSKK
190 200
>>CCDS4586.1 HIST1H1E gene_id:3008|Hs108|chr6 (219 aa)
initn: 670 init1: 396 opt: 567 Z-score: 518.0 bits: 102.9 E(32554): 1.6e-22
Smith-Waterman score: 594; 53.4% identity (74.9% similar) in 219 aa overlap (1-207:1-213)
10 20 30 40 50 60
pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
::::.::: :. : :: :.::..:: :: .:.::. . ::.:::.:...:.:: :
CCDS45 MSETAPAAPAAP--APAEKTPVKKKARKSAG--AAKRKASGPPVSELITKAVAASKERSG
10 20 30 40 50
70 80 90 100 110 120
pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
.::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::. .
CCDS45 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEA
60 70 80 90 100 110
130 140 150 160
pF1KE9 RSKAKKSVSAKTKKLVLSRDSKSPKTA-----------KTNKRAKKPRATT-PKTVRSGR
. ::::. .::.:: . . .:.:: : :: :.:::: :.. : ..: .
CCDS45 KPKAKKAGAAKAKKPAGA--AKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKKAKSPK
120 130 140 150 160 170
170 180 190 200
pF1KE9 KAKGAKGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK
:::.:: :. :::.::.: : : .. . .. . : ::
CCDS45 KAKAAKPKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK
180 190 200 210
>>CCDS4569.1 HIST1H1A gene_id:3024|Hs108|chr6 (215 aa)
initn: 489 init1: 353 opt: 558 Z-score: 510.2 bits: 101.4 E(32554): 4.3e-22
Smith-Waterman score: 558; 53.1% identity (73.7% similar) in 213 aa overlap (1-207:1-209)
10 20 30 40 50
pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVP-NLSVSKLITEALSVSQERV
:::::: : :.. :: :: . :...::: .::.: : . :::.::..: : :.::
CCDS45 MSETVPPAPAAS--AAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERG
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 GMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKS
:.::.::::::::::::::::::::::..::::.:: ::::.::::::::::.::.
CCDS45 GVSLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVE
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 TRSKAKKSVSAKTKKLVLSRDSK-----SPKTAKTNKRAKKPRATTPKTVRSGRKAKGAK
:. :.: :..::: :. : : :..:: :.:::: :.: :. .. .: : .:
CCDS45 TKPGASK-VATKTKATGASKKLKKATGASKKSVKTPKKAKKP-AATRKSSKNPKKPKTVK
120 130 140 150 160 170
180 190 200
pF1KE9 GKQKQKSPVKARASKSKLTQHHEVNVRKATSKK
:. :::.::.: : : .. . .. . : ::
CCDS45 PKKVAKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
180 190 200 210
>>CCDS4635.1 HIST1H1B gene_id:3009|Hs108|chr6 (226 aa)
initn: 462 init1: 462 opt: 529 Z-score: 484.4 bits: 96.7 E(32554): 1.2e-20
Smith-Waterman score: 567; 51.9% identity (75.0% similar) in 216 aa overlap (1-206:1-214)
10 20 30 40 50
pF1KE9 MSETVPAASASAGLAAMEKLPTKKRG-RKPAGLISASRKVPNLSVSKLITEALSVSQERV
::::.:: .:. : .:: :.::.. .: :: .:.::. . ::.:::.:...:.::
CCDS46 MSETAPAETATP--APVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERN
10 20 30 40 50
60 70 80 90 100 110
pF1KE9 GMSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKS
:.::.::::::::.:::::::::::::.:::::.:: ::::.::::::::::.::.
CCDS46 GLSLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGE
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE9 TRSKAKKSVSAKTKKLVLSRDSKSPKTA-------KTNKRAKKPRATTPKTV-RSGRKAK
.. ::::. .::.:: . . .:. :.: :: :.:::: :. : : .: .:::
CCDS46 AKPKAKKAGAAKAKKPAGATPKKAKKAAGAKKAVKKTPKKAKKPAAAGVKKVAKSPKKAK
120 130 140 150 160 170
180 190 200
pF1KE9 GA-KGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK
.: : :. :::.: .: : : .. . .. . : :
CCDS46 AAAKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK
180 190 200 210 220
>>CCDS4577.1 HIST1H1C gene_id:3006|Hs108|chr6 (213 aa)
initn: 531 init1: 379 opt: 523 Z-score: 479.5 bits: 95.7 E(32554): 2.2e-20
Smith-Waterman score: 552; 52.7% identity (73.2% similar) in 220 aa overlap (1-207:1-212)
10 20 30 40 50 60
pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
::::.::: :.: : :: :.::.. : :: .. ::. . ::.:::.:...:.:: :
CCDS45 MSETAPAAPAAAPPA--EKAPVKKKAAKKAG--GTPRKASGPPVSELITKAVAASKERSG
10 20 30 40 50
70 80 90 100 110 120
pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
.::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::. .
CCDS45 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEA
60 70 80 90 100 110
130 140 150 160
pF1KE9 RSKAKKSVSAKTKKLVLSRDSKSPKTA-----------KTNKRAKKPRATT--PKTVRSG
. :.::. ..: :: : . .:.:: : :: :.:::: :.: :...:
CCDS45 KPKVKKAGGTKPKKPVGA--AKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSP
120 130 140 150 160 170
170 180 190 200
pF1KE9 RKAKGAKGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK
.::: :: :. :: :.: : : .. . :. .::. ::
CCDS45 KKAKVAKPKKAAKSA--AKAVKPKAAKPKVVKPKKAAPKKK
180 190 200 210
>>CCDS4597.1 HIST1H1D gene_id:3007|Hs108|chr6 (221 aa)
initn: 522 init1: 405 opt: 512 Z-score: 469.6 bits: 94.0 E(32554): 7.7e-20
Smith-Waterman score: 535; 49.1% identity (72.9% similar) in 218 aa overlap (1-207:1-215)
10 20 30 40 50 60
pF1KE9 MSETVPAASASAGLAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
::::.: : . : :: :.::...: :: ...::. . ::.:::.:...:.:: :
CCDS45 MSETAPLAPTIP--APAEKTPVKKKAKK-AGATAGKRKASGPPVSELITKAVAASKERSG
10 20 30 40 50
70 80 90 100 110 120
pF1KE9 MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
.::.::::::::::::::::::::::.:::::.:: ::::.::::::::::.::.
CCDS45 VSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEG
60 70 80 90 100 110
130 140 150 160
pF1KE9 RSKAKKSVSAKTKKLV-LSRDSK------SPKTA--KTNKRAKKPR--ATTPKTVRSGRK
. ::::. .:: .: . .. : .:: . :: :..::: : : :...:..:
CCDS45 KPKAKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKK
120 130 140 150 160 170
170 180 190 200
pF1KE9 AKGAKGKQKQKSPVKARASKSKLTQHHEVNVRKATSKK
.: . :. :::.::.: : : .. . . . . .::
CCDS45 VKTPQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
180 190 200 210 220
207 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 06:29:20 2016 done: Sun Nov 6 06:29:20 2016
Total Scan time: 2.210 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]