FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5364, 395 aa 1>>>pF1KB5364 395 - 395 aa - 395 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1862+/-0.000843; mu= 17.4947+/- 0.051 mean_var=62.5184+/-12.509, 0's: 0 Z-trim(106.2): 16 B-trim: 0 in 0/49 Lambda= 0.162207 statistics sampled from 8823 (8827) to 8823 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.652), E-opt: 0.2 (0.271), width: 16 Scan time: 2.770 The best scores are: opt bits E(32554) CCDS6006.1 ASAH1 gene_id:427|Hs108|chr8 ( 395) 2682 636.2 1.5e-182 CCDS6005.1 ASAH1 gene_id:427|Hs108|chr8 ( 411) 2507 595.3 3.3e-170 CCDS47813.1 ASAH1 gene_id:427|Hs108|chr8 ( 389) 1987 473.6 1.4e-133 CCDS43239.1 NAAA gene_id:27163|Hs108|chr4 ( 359) 460 116.2 4.7e-26 >>CCDS6006.1 ASAH1 gene_id:427|Hs108|chr8 (395 aa) initn: 2682 init1: 2682 opt: 2682 Z-score: 3390.8 bits: 636.2 E(32554): 1.5e-182 Smith-Waterman score: 2682; 99.2% identity (100.0% similar) in 395 aa overlap (1-395:1-395) 10 20 30 40 50 60 pF1KB5 MPGRSCVALVLLAAAVSCAVAQHAPPWTEDCRKSTYPPSGPTYRGAVPWYTINLDLPPYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 MPGRSCVALVLLAAAVSCAVAQHAPPWTEDCRKSTYPPSGPTYRGAVPWYTINLDLPPYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 RWHELMLDKAPMLKVIVNSLKNMINTFVPSGKVMQVVDEKLPGLLGNFPGPFEEEMKGIA :::::::::::.::::::::::::::::::::.::::::::::::::::::::::::::: CCDS60 RWHELMLDKAPVLKVIVNSLKNMINTFVPSGKIMQVVDEKLPGLLGNFPGPFEEEMKGIA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 AVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTWVITE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 AVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTWVITE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 QLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGILEWI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 QLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGILEWI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 LGKKDAMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITRDRKE :::::.:::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LGKKDVMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITRDRKE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 SLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYDVLST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 SLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYDVLST 310 320 330 340 350 360 370 380 390 pF1KB5 KPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW ::::::::::::::::::::::::::::::::::: CCDS60 KPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW 370 380 390 >>CCDS6005.1 ASAH1 gene_id:427|Hs108|chr8 (411 aa) initn: 2507 init1: 2507 opt: 2507 Z-score: 3169.2 bits: 595.3 E(32554): 3.3e-170 Smith-Waterman score: 2507; 99.2% identity (100.0% similar) in 369 aa overlap (27-395:43-411) 10 20 30 40 50 pF1KB5 MPGRSCVALVLLAAAVSCAVAQHAPPWTEDCRKSTYPPSGPTYRGAVPWYTINLDL :::::::::::::::::::::::::::::: CCDS60 GSHRASYPSLSALFTEASILGFGSFAVKAQWTEDCRKSTYPPSGPTYRGAVPWYTINLDL 20 30 40 50 60 70 60 70 80 90 100 110 pF1KB5 PPYKRWHELMLDKAPMLKVIVNSLKNMINTFVPSGKVMQVVDEKLPGLLGNFPGPFEEEM :::::::::::::::.::::::::::::::::::::.::::::::::::::::::::::: CCDS60 PPYKRWHELMLDKAPVLKVIVNSLKNMINTFVPSGKIMQVVDEKLPGLLGNFPGPFEEEM 80 90 100 110 120 130 120 130 140 150 160 170 pF1KB5 KGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 KGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTW 140 150 160 170 180 190 180 190 200 210 220 230 pF1KB5 VITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 VITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGI 200 210 220 230 240 250 240 250 260 270 280 290 pF1KB5 LEWILGKKDAMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITR :::::::::.:::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LEWILGKKDVMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITR 260 270 280 290 300 310 300 310 320 330 340 350 pF1KB5 DRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 DRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYD 320 330 340 350 360 370 360 370 380 390 pF1KB5 VLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW ::::::::::::::::::::::::::::::::::::::: CCDS60 VLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW 380 390 400 410 >>CCDS47813.1 ASAH1 gene_id:427|Hs108|chr8 (389 aa) initn: 2311 init1: 1987 opt: 1987 Z-score: 2511.9 bits: 473.6 E(32554): 1.4e-133 Smith-Waterman score: 2240; 89.9% identity (90.4% similar) in 376 aa overlap (27-395:43-389) 10 20 30 40 pF1KB5 MPGRSCVALVLLAAAVSCAVAQHAPPWTEDCRKSTYPPSGPT-------YRGAVPW :::::::::::::::: ::::::: CCDS47 GSHRASYPSLSALFTEASILGFGSFAVKAQWTEDCRKSTYPPSGPTVFPAVIRYRGAVPW 20 30 40 50 60 70 50 60 70 80 90 100 pF1KB5 YTINLDLPPYKRWHELMLDKAPMLKVIVNSLKNMINTFVPSGKVMQVVDEKLPGLLGNFP ::::::::::::::::::::::. :::::::: CCDS47 YTINLDLPPYKRWHELMLDKAPV-----------------------------PGLLGNFP 80 90 100 110 120 130 140 150 160 pF1KB5 GPFEEEMKGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GPFEEEMKGIAAVTDIPLGEIISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGW 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB5 NINNDTWVITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NINNDTWVITEQLKPLTVNLDFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSI 170 180 190 200 210 220 230 240 250 260 270 280 pF1KB5 NGGYLGILEWILGKKDAMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSG ::::::::::::::::.::::::::::::::::::::::::::::::::::::::::::: CCDS47 NGGYLGILEWILGKKDVMWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSG 230 240 250 260 270 280 290 300 310 320 330 340 pF1KB5 EGCVITRDRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 EGCVITRDRKESLDVYELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENI 290 300 310 320 330 340 350 360 370 380 390 pF1KB5 SFETMYDVLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW :::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SFETMYDVLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDPCIGW 350 360 370 380 >>CCDS43239.1 NAAA gene_id:27163|Hs108|chr4 (359 aa) initn: 648 init1: 316 opt: 460 Z-score: 581.2 bits: 116.2 E(32554): 4.7e-26 Smith-Waterman score: 642; 33.9% identity (64.1% similar) in 354 aa overlap (46-387:32-355) 20 30 40 50 60 pF1KB5 VSCAVAQHAPPWTEDCRKSTYPPSGPTYRGAVPWYTINLDLPPYKRW------HELMLDK :.: ....:: : :: ..: : . CCDS43 RTADREARPGLPSLLLLLLAGAGLSAASPPAAPRFNVSLDSVPELRWLPVLRHYDLDLVR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 APMLKVIVNSLKNMINTFVPSGKVMQVVDEKLPGLLGNFPGPFEEEMKGIAAVTDIPLGE : : .:: . . . ..... :::. ... : : :: :..:. .. :.. CCDS43 AAMAQVIGDRVPKWVHVLI--GKVVLELERFL-------PQPFTGEIRGMCDFMNLSLAD 70 80 90 100 110 130 140 150 160 170 180 pF1KB5 IISFNIFYELFTICTSIVAEDKKGHLIHGRNMDFGVFLGWNINNDTWVITEQLKPLTVNL . :. :: ..::::::.:..::. ::::.:. .: :. :. :::.. CCDS43 CLLVNLAYESSVFCTSIVAQDSRGHIYHGRNLDYP--FG-NV----------LRKLTVDV 120 130 140 150 190 200 210 220 230 240 pF1KB5 DFQRNNKTVFKASSFAGYVGMLTGFKPGLFSLTLNERFSINGGYLGILEWILGKKDAMW- .: .:.. .: ...: ::::. :: .: :... .:: . :. : . :.. CCDS43 QFLKNGQIAFTGTTFIGYVGLWTGQSPHKFTVSGDER---DKGW-----WWENAIAALFR 160 170 180 190 200 210 250 260 270 280 290 300 pF1KB5 ----IGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITRDRKESLDV ...: :..: .: ..: : . :.:: ..: .:.:.::.. :: ::::.: :. CCDS43 RHIPVSWLIRATLSESENFEAAVGKLAKTPLIADVYYIVGGTSPREGVVITRNRDGPADI 220 230 240 250 260 270 310 320 330 340 350 360 pF1KB5 YELDAKQGRWYVVQTNYDRWKHPFFLDDRRTPAKMCLNRTSQENISFETMYDVLSTKPVL . :: .: :. :.::::.:: ::::: : :: :.: :.:.:.....::. :: CCDS43 WPLDPLNGAWFRVETNYDHWKPAPKEDDRRTSAIKALNATGQANLSLEALFQILSVVPVY 280 290 300 310 320 330 370 380 390 pF1KB5 NKLTVYTTLIDV-TKGQFETYLRDCPDPCIGW :..:.:::.... . .. : .:. CCDS43 NNFTIYTTVMSAGSPDKYMTRIRNPSRK 340 350 395 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 12:31:11 2016 done: Sat Nov 5 12:31:11 2016 Total Scan time: 2.770 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]