FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6168, 141 aa 1>>>pF1KE6168 141 - 141 aa - 141 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1778+/-0.000612; mu= 11.6694+/- 0.037 mean_var=56.4089+/-11.609, 0's: 0 Z-trim(110.7): 9 B-trim: 481 in 1/50 Lambda= 0.170766 statistics sampled from 11766 (11772) to 11766 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.758), E-opt: 0.2 (0.362), width: 16 Scan time: 1.210 The best scores are: opt bits E(32554) CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19 ( 474) 974 247.6 6.6e-66 CCDS623.1 ATG4C gene_id:84938|Hs108|chr1 ( 458) 475 124.6 6.5e-29 CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2 ( 393) 273 74.8 5.4e-14 CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2 ( 380) 264 72.6 2.4e-13 CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX ( 398) 241 66.9 1.3e-11 >>CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19 (474 aa) initn: 974 init1: 974 opt: 974 Z-score: 1296.8 bits: 247.6 E(32554): 6.6e-66 Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:334-474) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT :::::::::::::::::::::::::::::: CCDS12 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT 310 320 330 340 350 360 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE 370 380 390 400 410 420 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL 430 440 450 460 470 >>CCDS623.1 ATG4C gene_id:84938|Hs108|chr1 (458 aa) initn: 495 init1: 400 opt: 475 Z-score: 632.6 bits: 124.6 E(32554): 6.5e-29 Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .::::..: :: :.::: :.:.:::::: CCDS62 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF 300 310 320 330 340 350 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: : CCDS62 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E 360 370 380 390 400 410 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL .::.::...::..:. : . . : . : :: :.:.::.: CCDS62 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL 420 430 440 450 >>CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2 (393 aa) initn: 224 init1: 129 opt: 273 Z-score: 364.7 bits: 74.8 E(32554): 5.4e-14 Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:256-391) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .:::: . ::::: . :.::::: ::. CCDS46 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA 230 240 250 260 270 280 40 50 60 70 80 pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA :. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..: CCDS46 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA 290 300 310 320 330 340 90 100 110 120 130 140 pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::: :.: : : : .. . .: . :: : .::: .: CCDS46 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL 350 360 370 380 390 >>CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2 (380 aa) initn: 224 init1: 129 opt: 264 Z-score: 352.9 bits: 72.6 E(32554): 2.4e-13 Smith-Waterman score: 264; 41.6% identity (68.3% similar) in 101 aa overlap (1-99:256-353) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .:::: . ::::: . :.::::: ::. CCDS46 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA 230 240 250 260 270 280 40 50 60 70 80 pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA :. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..: CCDS46 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA 290 300 310 320 330 340 90 100 110 120 130 140 pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::: :.: CCDS46 L---PMFELVELQPSHLACPDVLNLSLGESCQVQILLM 350 360 370 380 >>CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX (398 aa) initn: 235 init1: 126 opt: 241 Z-score: 322.0 bits: 66.9 E(32554): 1.3e-11 Smith-Waterman score: 241; 35.9% identity (64.1% similar) in 103 aa overlap (1-101:257-355) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .:::: .. ::::. : :..:::: : CCDS14 VPLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTF 230 240 250 260 270 280 40 50 60 70 80 pF1KE6 VDVSQ-ADFPLESFHCT-SPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA ::. . . ..::: ::..: . ..::: ..::. ..:.:.. :: . . . CCDS14 VDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEI----L 290 300 310 320 330 340 90 100 110 120 130 140 pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL : :: :.. : CCDS14 KENLRMFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 350 360 370 380 390 141 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:01:38 2016 done: Tue Nov 8 10:01:38 2016 Total Scan time: 1.210 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]