FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5054, 153 aa 1>>>pF1KE5054 153 - 153 aa - 153 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9867+/-0.000721; mu= 13.4032+/- 0.044 mean_var=55.7033+/-10.914, 0's: 0 Z-trim(107.9): 10 B-trim: 0 in 0/51 Lambda= 0.171844 statistics sampled from 9835 (9840) to 9835 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.302), width: 16 Scan time: 1.730 The best scores are: opt bits E(32554) CCDS694.1 DNASE2B gene_id:58511|Hs108|chr1 ( 153) 1073 273.5 3.4e-74 CCDS44167.1 DNASE2B gene_id:58511|Hs108|chr1 ( 361) 1073 273.7 7.2e-74 CCDS12284.1 DNASE2 gene_id:1777|Hs108|chr19 ( 360) 368 98.9 2.9e-21 >>CCDS694.1 DNASE2B gene_id:58511|Hs108|chr1 (153 aa) initn: 1073 init1: 1073 opt: 1073 Z-score: 1445.4 bits: 273.5 E(32554): 3.4e-74 Smith-Waterman score: 1073; 100.0% identity (100.0% similar) in 153 aa overlap (1-153:1-153) 10 20 30 40 50 60 pF1KE5 MPQLCTRASSSEIPGRLLTTLQSAQGQKFLHFAKSDSFLDDIFAAWMAQRLKTHLLTETW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MPQLCTRASSSEIPGRLLTTLQSAQGQKFLHFAKSDSFLDDIFAAWMAQRLKTHLLTETW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 QRKRQELPSNCSLPYHVYNIKAIKLSRHSYFSSYQDHAKWCISQKGTKNRWTCIGDLNRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 QRKRQELPSNCSLPYHVYNIKAIKLSRHSYFSSYQDHAKWCISQKGTKNRWTCIGDLNRS 70 80 90 100 110 120 130 140 150 pF1KE5 PHQAFRSGGFICTQNWQIYQAFQGLVLYYESCK ::::::::::::::::::::::::::::::::: CCDS69 PHQAFRSGGFICTQNWQIYQAFQGLVLYYESCK 130 140 150 >>CCDS44167.1 DNASE2B gene_id:58511|Hs108|chr1 (361 aa) initn: 1073 init1: 1073 opt: 1073 Z-score: 1439.7 bits: 273.7 E(32554): 7.2e-74 Smith-Waterman score: 1073; 100.0% identity (100.0% similar) in 153 aa overlap (1-153:209-361) 10 20 30 pF1KE5 MPQLCTRASSSEIPGRLLTTLQSAQGQKFL :::::::::::::::::::::::::::::: CCDS44 YEAIDSQLLVCNPNVYSCSIPATFHQELIHMPQLCTRASSSEIPGRLLTTLQSAQGQKFL 180 190 200 210 220 230 40 50 60 70 80 90 pF1KE5 HFAKSDSFLDDIFAAWMAQRLKTHLLTETWQRKRQELPSNCSLPYHVYNIKAIKLSRHSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 HFAKSDSFLDDIFAAWMAQRLKTHLLTETWQRKRQELPSNCSLPYHVYNIKAIKLSRHSY 240 250 260 270 280 290 100 110 120 130 140 150 pF1KE5 FSSYQDHAKWCISQKGTKNRWTCIGDLNRSPHQAFRSGGFICTQNWQIYQAFQGLVLYYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FSSYQDHAKWCISQKGTKNRWTCIGDLNRSPHQAFRSGGFICTQNWQIYQAFQGLVLYYE 300 310 320 330 340 350 pF1KE5 SCK ::: CCDS44 SCK 360 >>CCDS12284.1 DNASE2 gene_id:1777|Hs108|chr19 (360 aa) initn: 272 init1: 150 opt: 368 Z-score: 495.1 bits: 98.9 E(32554): 2.9e-21 Smith-Waterman score: 368; 39.6% identity (63.2% similar) in 144 aa overlap (11-152:207-347) 10 20 30 40 pF1KE5 MPQLCTRASSSEIPGRLLTTLQSAQGQKFLHFAKSDSFLD :. : :: : : : ::: ..: : CCDS12 TYPWVYNYQLEGIFAQEFPDLENVVKGHHVSQEPWNSSITLTSQAGAVFQSFAKFSKFGD 180 190 200 210 220 230 50 60 70 80 90 pF1KE5 DIFAAWMAQRLKTHLLTETWQRKRQELPSNCSLPYHVYNIKAIKLSRHS--YFSSYQDHA :....:.: : :.: .. :.. :::::: ..: :.. : . . :.: .::. CCDS12 DLYSGWLAAALGTNLQVQFWHKTVGILPSNCSDIWQVLNVNQIAFPGPAGPSFNSTEDHS 240 250 260 270 280 290 100 110 120 130 140 150 pF1KE5 KWCISQKGTKNRWTCIGDLNRSPHQAFRSGGFICTQNWQIYQAFQGLVLYYESCK :::.: :: :::.::.::. . :.:: .:.: ...::: :: :. : CCDS12 KWCVSPKGP---WTCVGDMNRNQGEEQRGGGTLCAQLPALWKAFQPLVKNYQPCNGMARK 300 310 320 330 340 350 CCDS12 PSRAYKI 360 153 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:34:54 2016 done: Tue Nov 8 04:34:54 2016 Total Scan time: 1.730 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]