FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6168, 141 aa 1>>>pF1KE6168 141 - 141 aa - 141 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1103+/-0.000279; mu= 12.2638+/- 0.017 mean_var=58.7165+/-11.871, 0's: 0 Z-trim(117.7): 28 B-trim: 0 in 0/54 Lambda= 0.167376 statistics sampled from 29916 (29944) to 29916 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.746), E-opt: 0.2 (0.351), width: 16 Scan time: 4.470 The best scores are: opt bits E(85289) XP_006722990 (OMIM: 611340) PREDICTED: cysteine pr ( 360) 974 242.9 3.2e-64 NP_001268433 (OMIM: 611340) cysteine protease ATG4 ( 411) 974 243.0 3.6e-64 NP_116274 (OMIM: 611340) cysteine protease ATG4D i ( 474) 974 243.0 4e-64 NP_835739 (OMIM: 611339) cysteine protease ATG4C [ ( 458) 475 122.5 7.3e-28 XP_005271345 (OMIM: 611339) PREDICTED: cysteine pr ( 458) 475 122.5 7.3e-28 NP_116241 (OMIM: 611339) cysteine protease ATG4C [ ( 458) 475 122.5 7.3e-28 XP_011540615 (OMIM: 611339) PREDICTED: cysteine pr ( 423) 387 101.2 1.7e-21 XP_005247053 (OMIM: 611338) PREDICTED: cysteine pr ( 319) 273 73.6 2.6e-13 XP_016859129 (OMIM: 611338) PREDICTED: cysteine pr ( 319) 273 73.6 2.6e-13 NP_037457 (OMIM: 611338) cysteine protease ATG4B i ( 393) 273 73.7 3.1e-13 XP_016859127 (OMIM: 611338) PREDICTED: cysteine pr ( 457) 273 73.7 3.5e-13 XP_016859128 (OMIM: 611338) PREDICTED: cysteine pr ( 341) 264 71.5 1.2e-12 XP_005247052 (OMIM: 611338) PREDICTED: cysteine pr ( 341) 264 71.5 1.2e-12 NP_847896 (OMIM: 611338) cysteine protease ATG4B i ( 380) 264 71.5 1.4e-12 XP_005247050 (OMIM: 611338) PREDICTED: cysteine pr ( 405) 264 71.5 1.4e-12 XP_005247049 (OMIM: 611338) PREDICTED: cysteine pr ( 415) 264 71.5 1.5e-12 XP_016859126 (OMIM: 611338) PREDICTED: cysteine pr ( 479) 264 71.6 1.7e-12 NP_001308219 (OMIM: 300663) cysteine protease ATG4 ( 226) 241 65.8 4.1e-11 XP_011529144 (OMIM: 300663) PREDICTED: cysteine pr ( 321) 241 65.9 5.5e-11 NP_001308217 (OMIM: 300663) cysteine protease ATG4 ( 321) 241 65.9 5.5e-11 NP_001308216 (OMIM: 300663) cysteine protease ATG4 ( 321) 241 65.9 5.5e-11 NP_840055 (OMIM: 300663) cysteine protease ATG4A i ( 321) 241 65.9 5.5e-11 NP_443168 (OMIM: 300663) cysteine protease ATG4A i ( 398) 241 66.0 6.6e-11 XP_016858089 (OMIM: 611339) PREDICTED: cysteine pr ( 418) 225 62.1 1e-09 XP_016858090 (OMIM: 611339) PREDICTED: cysteine pr ( 418) 225 62.1 1e-09 NP_001308218 (OMIM: 300663) cysteine protease ATG4 ( 259) 192 54.0 1.7e-07 NP_840054 (OMIM: 300663) cysteine protease ATG4A i ( 336) 192 54.1 2.1e-07 XP_011529143 (OMIM: 300663) PREDICTED: cysteine pr ( 340) 192 54.1 2.1e-07 >>XP_006722990 (OMIM: 611340) PREDICTED: cysteine protea (360 aa) initn: 974 init1: 974 opt: 974 Z-score: 1274.0 bits: 242.9 E(85289): 3.2e-64 Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:220-360) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT :::::::::::::::::::::::::::::: XP_006 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT 190 200 210 220 230 240 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE 250 260 270 280 290 300 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL 310 320 330 340 350 360 >>NP_001268433 (OMIM: 611340) cysteine protease ATG4D is (411 aa) initn: 974 init1: 974 opt: 974 Z-score: 1273.1 bits: 243.0 E(85289): 3.6e-64 Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:271-411) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT :::::::::::::::::::::::::::::: NP_001 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT 250 260 270 280 290 300 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE 310 320 330 340 350 360 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL 370 380 390 400 410 >>NP_116274 (OMIM: 611340) cysteine protease ATG4D isofo (474 aa) initn: 974 init1: 974 opt: 974 Z-score: 1272.2 bits: 243.0 E(85289): 4e-64 Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:334-474) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT :::::::::::::::::::::::::::::: NP_116 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT 310 320 330 340 350 360 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE 370 380 390 400 410 420 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::::::::::::::::::::::::::::::::::::::::::::::::::: NP_116 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL 430 440 450 460 470 >>NP_835739 (OMIM: 611339) cysteine protease ATG4C [Homo (458 aa) initn: 495 init1: 400 opt: 475 Z-score: 621.2 bits: 122.5 E(85289): 7.3e-28 Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .::::..: :: :.::: :.:.:::::: NP_835 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF 300 310 320 330 340 350 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: : NP_835 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E 360 370 380 390 400 410 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL .::.::...::..:. : . . : . : :: :.:.::.: NP_835 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL 420 430 440 450 >>XP_005271345 (OMIM: 611339) PREDICTED: cysteine protea (458 aa) initn: 495 init1: 400 opt: 475 Z-score: 621.2 bits: 122.5 E(85289): 7.3e-28 Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .::::..: :: :.::: :.:.:::::: XP_005 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF 300 310 320 330 340 350 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: : XP_005 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E 360 370 380 390 400 410 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL .::.::...::..:. : . . : . : :: :.:.::.: XP_005 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL 420 430 440 450 >>NP_116241 (OMIM: 611339) cysteine protease ATG4C [Homo (458 aa) initn: 495 init1: 400 opt: 475 Z-score: 621.2 bits: 122.5 E(85289): 7.3e-28 Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .::::..: :: :.::: :.:.:::::: NP_116 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF 300 310 320 330 340 350 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: : NP_116 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E 360 370 380 390 400 410 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL .::.::...::..:. : . . : . : :: :.:.::.: NP_116 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL 420 430 440 450 >>XP_011540615 (OMIM: 611339) PREDICTED: cysteine protea (423 aa) initn: 368 init1: 368 opt: 387 Z-score: 506.9 bits: 101.2 E(85289): 1.7e-21 Smith-Waterman score: 387; 60.5% identity (80.2% similar) in 81 aa overlap (1-81:323-403) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .::::..: :: :.::: :.:.:::::: XP_011 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF 300 310 320 330 340 350 40 50 60 70 80 90 pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE :::: :::::.::: ::.::.: :::::::.::: . ..:. :.:. XP_011 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKNQIMIKAAI 360 370 380 390 400 410 100 110 120 130 140 pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL XP_011 INIHVQDFMWT 420 >>XP_005247053 (OMIM: 611338) PREDICTED: cysteine protea (319 aa) initn: 239 init1: 129 opt: 273 Z-score: 360.0 bits: 73.6 E(85289): 2.6e-13 Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:182-317) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .:::: . ::::: . :.::::: ::. XP_005 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA 160 170 180 190 200 210 40 50 60 70 80 pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA :. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..: XP_005 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA 220 230 240 250 260 270 90 100 110 120 130 140 pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::: :.: : : : .. . .: . :: : .::: .: XP_005 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL 280 290 300 310 >>XP_016859129 (OMIM: 611338) PREDICTED: cysteine protea (319 aa) initn: 239 init1: 129 opt: 273 Z-score: 360.0 bits: 73.6 E(85289): 2.6e-13 Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:182-317) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .:::: . ::::: . :.::::: ::. XP_016 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA 160 170 180 190 200 210 40 50 60 70 80 pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA :. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..: XP_016 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA 220 230 240 250 260 270 90 100 110 120 130 140 pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::: :.: : : : .. . .: . :: : .::: .: XP_016 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL 280 290 300 310 >>NP_037457 (OMIM: 611338) cysteine protease ATG4B isofo (393 aa) initn: 224 init1: 129 opt: 273 Z-score: 358.6 bits: 73.7 E(85289): 3.1e-13 Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:256-391) 10 20 30 pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT .:::: . ::::: . :.::::: ::. NP_037 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA 230 240 250 260 270 280 40 50 60 70 80 pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA :. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..: NP_037 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA 290 300 310 320 330 340 90 100 110 120 130 140 pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL ::: :.: : : : .. . .: . :: : .::: .: NP_037 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL 350 360 370 380 390 141 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:01:38 2016 done: Tue Nov 8 10:01:39 2016 Total Scan time: 4.470 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]