FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6563, 175 aa 1>>>pF1KE6563 175 - 175 aa - 175 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7484+/-0.000736; mu= 10.7371+/- 0.045 mean_var=72.7514+/-14.597, 0's: 0 Z-trim(109.8): 28 B-trim: 303 in 2/50 Lambda= 0.150367 statistics sampled from 11138 (11165) to 11138 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.722), E-opt: 0.2 (0.343), width: 16 Scan time: 1.850 The best scores are: opt bits E(32554) CCDS35362.1 H2BFWT gene_id:158983|Hs108|chrX ( 175) 1128 253.2 6e-68 CCDS55468.1 H2BFM gene_id:286436|Hs108|chrX ( 154) 636 146.5 7.2e-36 CCDS4625.1 HIST1H2BL gene_id:8340|Hs108|chr6 ( 126) 304 74.4 2.9e-14 CCDS4592.1 HIST1H2BF gene_id:8343|Hs108|chr6 ( 126) 302 74.0 3.9e-14 CCDS4588.1 HIST1H2BE gene_id:8344|Hs108|chr6 ( 126) 302 74.0 3.9e-14 CCDS4594.1 HIST1H2BG gene_id:8339|Hs108|chr6 ( 126) 302 74.0 3.9e-14 CCDS4584.1 HIST1H2BC gene_id:8347|Hs108|chr6 ( 126) 302 74.0 3.9e-14 CCDS4603.1 HIST1H2BI gene_id:8346|Hs108|chr6 ( 126) 302 74.0 3.9e-14 CCDS4621.1 HIST1H2BK gene_id:85236|Hs108|chr6 ( 126) 302 74.0 3.9e-14 CCDS936.1 HIST2H2BE gene_id:8349|Hs108|chr1 ( 126) 301 73.7 4.5e-14 CCDS4618.1 HIST1H2BJ gene_id:8970|Hs108|chr6 ( 126) 301 73.7 4.5e-14 CCDS4601.1 HIST1H2BH gene_id:8345|Hs108|chr6 ( 126) 300 73.5 5.3e-14 CCDS4640.1 HIST1H2BO gene_id:8348|Hs108|chr6 ( 126) 299 73.3 6.1e-14 CCDS30846.1 HIST2H2BF gene_id:440689|Hs108|chr1 ( 126) 299 73.3 6.1e-14 CCDS4563.1 HIST1H2BA gene_id:255626|Hs108|chr6 ( 127) 299 73.3 6.2e-14 CCDS53359.1 HIST2H2BF gene_id:440689|Hs108|chr1 ( 134) 299 73.3 6.5e-14 CCDS4633.1 HIST1H2BN gene_id:8341|Hs108|chr6 ( 126) 298 73.1 7.1e-14 CCDS4575.1 HIST1H2BB gene_id:3018|Hs108|chr6 ( 126) 297 72.9 8.3e-14 CCDS4587.1 HIST1H2BD gene_id:3017|Hs108|chr6 ( 126) 297 72.9 8.3e-14 CCDS1574.1 HIST3H2BB gene_id:128312|Hs108|chr1 ( 126) 296 72.7 9.6e-14 CCDS4629.1 HIST1H2BM gene_id:8342|Hs108|chr6 ( 126) 296 72.7 9.6e-14 >>CCDS35362.1 H2BFWT gene_id:158983|Hs108|chrX (175 aa) initn: 1128 init1: 1128 opt: 1128 Z-score: 1333.4 bits: 253.2 E(32554): 6e-68 Smith-Waterman score: 1128; 100.0% identity (100.0% similar) in 175 aa overlap (1-175:1-175) 10 20 30 40 50 60 pF1KE6 MLRTEVPRLPRSTTAIVWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MLRTEVPRLPRSTTAIVWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQSK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 QRKRGRHGPRRCHSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 QRKRGRHGPRRCHSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATE 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 AGHLARSTKRQTITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 AGHLARSTKRQTITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK 130 140 150 160 170 >>CCDS55468.1 H2BFM gene_id:286436|Hs108|chrX (154 aa) initn: 614 init1: 468 opt: 636 Z-score: 757.5 bits: 146.5 E(32554): 7.2e-36 Smith-Waterman score: 636; 70.7% identity (83.4% similar) in 157 aa overlap (23-175:1-154) 10 20 30 40 50 60 pF1KE6 MLRTEVPRLPRSTTAIVWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQSK ::.::::: :::::::: ::::::::: .:: CCDS55 MAAASAMAEASSETTSEEGQSIQEPKEANSTKAQK--- 10 20 30 70 80 90 100 110 pF1KE6 QRKRGRHGPRRCHSNCRGDSFAT----YFRRVLKQVHQGLSLSREAVSVMDSLVHDILDR :..:: .: :: :.: :::::. :: :::::::::::::.::::::::..:::::: CCDS55 QKRRGCRGSRRRHANRRGDSFGDSFTPYFPRVLKQVHQGLSLSQEAVSVMDSMIHDILDR 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 IATEAGHLARSTKRQTITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::::::.::. ::: :::. . .:::::::::.::::::..::.:.::::: :: :::: CCDS55 IATEAGQLAHYTKRVTITSRDIQMAVRLLLPGKMGKLAEAQGTNAALRTSLCAIWQQRK 100 110 120 130 140 150 >>CCDS4625.1 HIST1H2BL gene_id:8340|Hs108|chr6 (126 aa) initn: 296 init1: 260 opt: 304 Z-score: 369.6 bits: 74.4 E(32554): 2.9e-14 Smith-Waterman score: 304; 45.1% identity (77.0% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS46 MPELAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..:::.::..::. .::.: CCDS46 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIASEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS46 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 >>CCDS4592.1 HIST1H2BF gene_id:8343|Hs108|chr6 (126 aa) initn: 292 init1: 256 opt: 302 Z-score: 367.2 bits: 74.0 E(32554): 3.9e-14 Smith-Waterman score: 302; 45.1% identity (76.2% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS45 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS45 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS45 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 >>CCDS4588.1 HIST1H2BE gene_id:8344|Hs108|chr6 (126 aa) initn: 292 init1: 256 opt: 302 Z-score: 367.2 bits: 74.0 E(32554): 3.9e-14 Smith-Waterman score: 302; 45.1% identity (76.2% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS45 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS45 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS45 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 >>CCDS4594.1 HIST1H2BG gene_id:8339|Hs108|chr6 (126 aa) initn: 292 init1: 256 opt: 302 Z-score: 367.2 bits: 74.0 E(32554): 3.9e-14 Smith-Waterman score: 302; 45.1% identity (76.2% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS45 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS45 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS45 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 >>CCDS4584.1 HIST1H2BC gene_id:8347|Hs108|chr6 (126 aa) initn: 292 init1: 256 opt: 302 Z-score: 367.2 bits: 74.0 E(32554): 3.9e-14 Smith-Waterman score: 302; 45.1% identity (76.2% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS45 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS45 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS45 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 >>CCDS4603.1 HIST1H2BI gene_id:8346|Hs108|chr6 (126 aa) initn: 292 init1: 256 opt: 302 Z-score: 367.2 bits: 74.0 E(32554): 3.9e-14 Smith-Waterman score: 302; 45.1% identity (76.2% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS46 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS46 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS46 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 >>CCDS4621.1 HIST1H2BK gene_id:85236|Hs108|chr6 (126 aa) initn: 292 init1: 256 opt: 302 Z-score: 367.2 bits: 74.0 E(32554): 3.9e-14 Smith-Waterman score: 302; 45.1% identity (76.2% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS46 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:...: .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS46 RS--RKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS46 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSAK 90 100 110 120 >>CCDS936.1 HIST2H2BE gene_id:8349|Hs108|chr1 (126 aa) initn: 292 init1: 256 opt: 301 Z-score: 366.1 bits: 73.7 E(32554): 4.5e-14 Smith-Waterman score: 301; 45.1% identity (75.4% similar) in 122 aa overlap (47-164:2-121) 20 30 40 50 60 70 pF1KE6 VWSCHLMATASAMAGPSSETTSEEQLITQEPKEANSTTSQKQ-SKQ---RKRGRHGPRRC :. :.:. . :. ::. . . . : .: CCDS93 MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRK 10 20 30 80 90 100 110 120 130 pF1KE6 HSNCRGDSFATYFRRVLKQVHQGLSLSREAVSVMDSLVHDILDRIATEAGHLARSTKRQT .: : .:.. : .:::::: ..: .:...:.:.:.::..::: ::..::. .::.: CCDS93 RS--RKESYSIYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRST 40 50 60 70 80 140 150 160 170 pF1KE6 ITAWETRMAVRLLLPGQMGKLAESEGTKAVLRTSLYAIQQQRK ::. : . ::::::::...: : ::::::: . CCDS93 ITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 90 100 110 120 175 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:26:27 2016 done: Tue Nov 8 14:26:27 2016 Total Scan time: 1.850 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]