FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4810, 305 aa 1>>>pF1KB4810 305 - 305 aa - 305 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6832+/-0.00094; mu= 12.7038+/- 0.056 mean_var=56.6088+/-11.555, 0's: 0 Z-trim(102.6): 19 B-trim: 0 in 0/52 Lambda= 0.170464 statistics sampled from 7012 (7020) to 7012 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.571), E-opt: 0.2 (0.216), width: 16 Scan time: 2.340 The best scores are: opt bits E(32554) CCDS2886.1 DNASE1L3 gene_id:1776|Hs108|chr3 ( 305) 2001 500.5 6.4e-142 CCDS58836.1 DNASE1L3 gene_id:1776|Hs108|chr3 ( 275) 1324 334.0 7.6e-92 CCDS10507.1 DNASE1 gene_id:1773|Hs108|chr16 ( 282) 795 203.9 1.1e-52 CCDS14747.1 DNASE1L1 gene_id:1774|Hs108|chrX ( 302) 782 200.7 1.1e-51 CCDS42105.1 DNASE1L2 gene_id:1775|Hs108|chr16 ( 299) 466 123.0 2.7e-28 >>CCDS2886.1 DNASE1L3 gene_id:1776|Hs108|chr3 (305 aa) initn: 2001 init1: 2001 opt: 2001 Z-score: 2661.4 bits: 500.5 E(32554): 6.4e-142 Smith-Waterman score: 2001; 100.0% identity (100.0% similar) in 305 aa overlap (1-305:1-305) 10 20 30 40 50 60 pF1KB4 MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVMEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVMEI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 KDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQYAFLYKEKLVSVKRSYHYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 KDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQYAFLYKEKLVSVKRSYHYH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 DYQDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPETSVKEIDELVEVYTDVKHRWK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 DYQDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPETSVKEIDELVEVYTDVKHRWK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 AENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIGDQEDTTVKKSTNCAYDRIVLRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 AENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIGDQEDTTVKKSTNCAYDRIVLRG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 QEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVEFKLQSSRAFTNSKKSVTLRKKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 QEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVEFKLQSSRAFTNSKKSVTLRKKT 250 260 270 280 290 300 pF1KB4 KSKRS ::::: CCDS28 KSKRS >>CCDS58836.1 DNASE1L3 gene_id:1776|Hs108|chr3 (275 aa) initn: 1324 init1: 1324 opt: 1324 Z-score: 1762.4 bits: 334.0 E(32554): 7.6e-92 Smith-Waterman score: 1738; 90.2% identity (90.2% similar) in 305 aa overlap (1-305:1-275) 10 20 30 40 50 60 pF1KB4 MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVMEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVMEI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 KDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQYAFLYKEKLVSVKRSYHYH ::::::::::::::::: ::::::::::::: CCDS58 KDSNNRICPILMEKLNR------------------------------EKLVSVKRSYHYH 70 80 90 130 140 150 160 170 180 pF1KB4 DYQDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPETSVKEIDELVEVYTDVKHRWK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 DYQDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPETSVKEIDELVEVYTDVKHRWK 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB4 AENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIGDQEDTTVKKSTNCAYDRIVLRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 AENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIGDQEDTTVKKSTNCAYDRIVLRG 160 170 180 190 200 210 250 260 270 280 290 300 pF1KB4 QEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVEFKLQSSRAFTNSKKSVTLRKKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVEFKLQSSRAFTNSKKSVTLRKKT 220 230 240 250 260 270 pF1KB4 KSKRS ::::: CCDS58 KSKRS >>CCDS10507.1 DNASE1 gene_id:1773|Hs108|chr16 (282 aa) initn: 678 init1: 364 opt: 795 Z-score: 1059.1 bits: 203.9 E(32554): 1.1e-52 Smith-Waterman score: 795; 44.8% identity (72.4% similar) in 279 aa overlap (5-282:7-282) 10 20 30 40 50 pF1KB4 MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVM :. :: : ...:....: .::...:::.:. . . .. ::....: :: ::. CCDS10 MRGMKLLGALLALAALLQGAVSLKIAAFNIQTFGETKMSNATLVSYIVQILSRYDIALVQ 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB4 EIKDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQYAFLYKEKLVSVKRSYH :..::. :...::... ::.::.: ::::.:::.: :.:. ::. ::. CCDS10 EVRDSHLTAVGKLLDNLNQDAPD--TYHYVVSEPLGRNSYKERYLFVYRPDQVSAVDSYY 70 80 90 100 110 120 130 140 150 160 170 pF1KB4 YHDY-QDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPETSVKEIDELVEVYTDVKH : : . :.:.::: .: : : : :..:.:.:::..: .: ::: : .:: ::.. CCDS10 YDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPGDAVAEIDALYDVYLDVQE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB4 RWKAENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIGDQEDTTVKKSTNCAYDRIV .: :. ..::::::::::: . :..::: :.: : ::: :. :::. :.::::::: CCDS10 KWGLEDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLIPDSADTTATP-THCAYDRIV 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB4 LRGQEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVEFKLQSSRAFTNSKKSVTLR . :. . ..::: : :.:: :: :... : .:::.::: :. CCDS10 VAGMLLRGAVVPDSALPFNFQAAYGLSDQLAQAISDHYPVEVMLK 240 250 260 270 280 300 pF1KB4 KKTKSKRS >>CCDS14747.1 DNASE1L1 gene_id:1774|Hs108|chrX (302 aa) initn: 701 init1: 406 opt: 782 Z-score: 1041.3 bits: 200.7 E(32554): 1.1e-51 Smith-Waterman score: 782; 41.0% identity (76.4% similar) in 288 aa overlap (9-296:7-287) 10 20 30 40 50 60 pF1KB4 MSRELAPLLLLLLSIHSALAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVMEI ::.:. ..: :.:::.::.. . .: ...::..:... ::::....:. CCDS14 MHYPTALLFLILANGAQAFRICAFNAQRLTLAKVAREQVMDTLVRILARCDIMVLQEV 10 20 30 40 50 70 80 90 100 110 120 pF1KB4 KDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQYAFLYKEKLVSVKRSYHYH ::.. :.:...::: . : :. . : .:::.:: : :...:. . ..: :: :. CCDS14 VDSSGSAIPLLLRELNRFDGSG-PYSTLSSPQLGRSTYMETYVYFYRSHKTQVLSSYVYN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB4 DYQDGDADVFSREPFVVWFQSPHTAVKDFVIIPLHTTPETSVKEIDELVEVYTDVKHRWK : . :::.:::::. :. : ... ..:..::::::.. ::.. : .:. .:...:. CCDS14 D----EDDVFAREPFVAQFSLPSNVLPSLVLVPLHTTPKAVEKELNALYDVFLEVSQHWQ 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB4 AENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIGDQEDTTVKKSTNCAYDRIVLRG ... :..::::: :. . :: ...:::.: : :.:.: :::::. ::.:.:::.::.: CCDS14 SKDVILLGDFNADCASLTKKRLDKLELRTEPGFHWVIADGEDTTVRASTHCTYDRVVLHG 180 190 200 210 220 230 250 260 270 280 290 300 pF1KB4 QEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVEFKLQSSRAFTNSKKSVTLRKKT .. : . .. ..::: ...:::::::..:::.::: .:. :.: . . :.:. CCDS14 ERCRSLL--HTAAAFDFPTSFQLTEEEALNISDHYPVEVELKLSQAHSVQPLSLTVLLLL 240 250 260 270 280 290 pF1KB4 KSKRS CCDS14 SLLSPQLCPAA 300 >>CCDS42105.1 DNASE1L2 gene_id:1775|Hs108|chr16 (299 aa) initn: 643 init1: 280 opt: 466 Z-score: 621.4 bits: 123.0 E(32554): 2.7e-28 Smith-Waterman score: 729; 42.3% identity (65.3% similar) in 300 aa overlap (8-285:7-299) 10 20 30 40 50 pF1KB4 MSRELAPLLLLLLSIHSA--LAMRICSFNVRSFGESKQEDKNAMDVIVKVIKRCDIILVM :: : ....: :.:: .::..:::.:: : ..:.:.. :. ::. CCDS42 MGGPRALLAALWALEAAGTAALRIGAFNIQSFGDSKVSDPACGSIIAKILAGYDLALVQ 10 20 30 40 50 60 70 80 90 100 110 pF1KB4 EIKDSNNRICPILMEKLNRNSRRGITYNYVISSRLGRNTYKEQYAFLYKEKLVSVKRSYH :..: . :::..: :.. :..: :. :::. :::.: :.:.. ::: .: CCDS42 EVRDPDLSAVSALMEQINSVSEH--EYSFVSSQPLGRDQYKEMYLFVYRKDAVSVVDTYL 60 70 80 90 100 110 120 130 140 150 pF1KB4 YHDYQDGDADVFSREPFVVWFQSPHT--------------------AVKDFVIIPLHTTP : : .: ::::::::: :..: : :....:.::::..: CCDS42 YPDPED----VFSREPFVVKFSAPGTGERAPPLPSRRALTPPPLPAAAQNLVLIPLHAAP 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB4 ETSVKEIDELVEVYTDVKHRWKAENFIFMGDFNAGCSYVPKKAWKNIRLRTDPRFVWLIG . .: ::: : .:: :: .: .....:.::::: :::: . : ::::.. : ::: CCDS42 HQAVAEIDALYDVYLDVIDKWGTDDMLFLGDFNADCSYVRAQDWAAIRLRSSEVFKWLIP 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB4 DQEDTTVKKSTNCAYDRIVLRGQEIVSSVVPKSNSVFDFQKAYKLTEEEALDVSDHFPVE :. :::: .: .::::::: : .. :. :.: .: :::. . : . .:: .::::::: CCDS42 DSADTTVGNS-DCAYDRIVACGARLRRSLKPQSATVHDFQEEFGLDQTQALAISDHFPVE 240 250 260 270 280 290 280 290 300 pF1KB4 FKLQSSRAFTNSKKSVTLRKKTKSKRS :. : CCDS42 VTLKFHR 305 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:14:51 2016 done: Mon Nov 7 01:14:51 2016 Total Scan time: 2.340 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]