FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2316, 398 aa 1>>>pF1KE2316 398 - 398 aa - 398 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9686+/-0.000297; mu= 19.0495+/- 0.019 mean_var=64.3943+/-13.067, 0's: 0 Z-trim(117.2): 34 B-trim: 150 in 2/50 Lambda= 0.159827 statistics sampled from 28909 (28950) to 28909 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.709), E-opt: 0.2 (0.339), width: 16 Scan time: 7.430 The best scores are: opt bits E(85289) NP_443168 (OMIM: 300663) cysteine protease ATG4A i ( 398) 2751 642.7 4.4e-184 NP_001308216 (OMIM: 300663) cysteine protease ATG4 ( 321) 2227 521.9 8.8e-148 NP_840055 (OMIM: 300663) cysteine protease ATG4A i ( 321) 2227 521.9 8.8e-148 NP_001308217 (OMIM: 300663) cysteine protease ATG4 ( 321) 2227 521.9 8.8e-148 XP_011529144 (OMIM: 300663) PREDICTED: cysteine pr ( 321) 2227 521.9 8.8e-148 NP_001308219 (OMIM: 300663) cysteine protease ATG4 ( 226) 1546 364.7 1.2e-100 NP_037457 (OMIM: 611338) cysteine protease ATG4B i ( 393) 1502 354.7 2.2e-97 XP_016859127 (OMIM: 611338) PREDICTED: cysteine pr ( 457) 1502 354.8 2.4e-97 NP_847896 (OMIM: 611338) cysteine protease ATG4B i ( 380) 1462 345.5 1.3e-94 XP_005247050 (OMIM: 611338) PREDICTED: cysteine pr ( 405) 1462 345.5 1.3e-94 XP_005247049 (OMIM: 611338) PREDICTED: cysteine pr ( 415) 1462 345.5 1.4e-94 XP_016859126 (OMIM: 611338) PREDICTED: cysteine pr ( 479) 1462 345.6 1.5e-94 NP_840054 (OMIM: 300663) cysteine protease ATG4A i ( 336) 1457 344.3 2.5e-94 XP_011529143 (OMIM: 300663) PREDICTED: cysteine pr ( 340) 1444 341.3 2.1e-93 XP_016859129 (OMIM: 611338) PREDICTED: cysteine pr ( 319) 1218 289.2 9.5e-78 XP_005247053 (OMIM: 611338) PREDICTED: cysteine pr ( 319) 1218 289.2 9.5e-78 XP_016859128 (OMIM: 611338) PREDICTED: cysteine pr ( 341) 1178 280.0 6e-75 XP_005247052 (OMIM: 611338) PREDICTED: cysteine pr ( 341) 1178 280.0 6e-75 NP_001308218 (OMIM: 300663) cysteine protease ATG4 ( 259) 933 223.4 4.9e-58 XP_006722990 (OMIM: 611340) PREDICTED: cysteine pr ( 360) 352 89.5 1.3e-17 XP_005271345 (OMIM: 611339) PREDICTED: cysteine pr ( 458) 349 88.9 2.6e-17 NP_835739 (OMIM: 611339) cysteine protease ATG4C [ ( 458) 349 88.9 2.6e-17 NP_116241 (OMIM: 611339) cysteine protease ATG4C [ ( 458) 349 88.9 2.6e-17 XP_011540615 (OMIM: 611339) PREDICTED: cysteine pr ( 423) 326 83.6 9.8e-16 NP_001268433 (OMIM: 611340) cysteine protease ATG4 ( 411) 256 67.5 6.9e-11 NP_116274 (OMIM: 611340) cysteine protease ATG4D i ( 474) 256 67.5 7.7e-11 XP_016858090 (OMIM: 611339) PREDICTED: cysteine pr ( 418) 218 58.7 3e-08 XP_016858089 (OMIM: 611339) PREDICTED: cysteine pr ( 418) 218 58.7 3e-08 >>NP_443168 (OMIM: 300663) cysteine protease ATG4A isofo (398 aa) initn: 2751 init1: 2751 opt: 2751 Z-score: 3425.8 bits: 642.7 E(85289): 4.4e-184 Smith-Waterman score: 2751; 100.0% identity (100.0% similar) in 398 aa overlap (1-398:1-398) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_443 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_443 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_443 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 DIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_443 DIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_443 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_443 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWP 310 320 330 340 350 360 370 380 390 pF1KE2 PFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV :::::::::::::::::::::::::::::::::::::: NP_443 PFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 370 380 390 >>NP_001308216 (OMIM: 300663) cysteine protease ATG4A is (321 aa) initn: 2227 init1: 2227 opt: 2227 Z-score: 2774.2 bits: 521.9 E(85289): 8.8e-148 Smith-Waterman score: 2227; 100.0% identity (100.0% similar) in 321 aa overlap (78-398:1-321) 50 60 70 80 90 100 pF1KE2 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ :::::::::::::::::::::::::::::: NP_001 MLRCGQMMLAQALICRHLGRDWSWEKQKEQ 10 20 30 110 120 130 140 150 160 pF1KE2 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE2 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE2 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE2 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR 220 230 240 250 260 270 350 360 370 380 390 pF1KE2 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV ::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 280 290 300 310 320 >>NP_840055 (OMIM: 300663) cysteine protease ATG4A isofo (321 aa) initn: 2227 init1: 2227 opt: 2227 Z-score: 2774.2 bits: 521.9 E(85289): 8.8e-148 Smith-Waterman score: 2227; 100.0% identity (100.0% similar) in 321 aa overlap (78-398:1-321) 50 60 70 80 90 100 pF1KE2 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ :::::::::::::::::::::::::::::: NP_840 MLRCGQMMLAQALICRHLGRDWSWEKQKEQ 10 20 30 110 120 130 140 150 160 pF1KE2 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_840 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE2 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_840 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE2 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_840 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE2 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_840 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR 220 230 240 250 260 270 350 360 370 380 390 pF1KE2 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV ::::::::::::::::::::::::::::::::::::::::::::::::::: NP_840 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 280 290 300 310 320 >>NP_001308217 (OMIM: 300663) cysteine protease ATG4A is (321 aa) initn: 2227 init1: 2227 opt: 2227 Z-score: 2774.2 bits: 521.9 E(85289): 8.8e-148 Smith-Waterman score: 2227; 100.0% identity (100.0% similar) in 321 aa overlap (78-398:1-321) 50 60 70 80 90 100 pF1KE2 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ :::::::::::::::::::::::::::::: NP_001 MLRCGQMMLAQALICRHLGRDWSWEKQKEQ 10 20 30 110 120 130 140 150 160 pF1KE2 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE2 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE2 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE2 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR 220 230 240 250 260 270 350 360 370 380 390 pF1KE2 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV ::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 280 290 300 310 320 >>XP_011529144 (OMIM: 300663) PREDICTED: cysteine protea (321 aa) initn: 2227 init1: 2227 opt: 2227 Z-score: 2774.2 bits: 521.9 E(85289): 8.8e-148 Smith-Waterman score: 2227; 100.0% identity (100.0% similar) in 321 aa overlap (78-398:1-321) 50 60 70 80 90 100 pF1KE2 DISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQ :::::::::::::::::::::::::::::: XP_011 MLRCGQMMLAQALICRHLGRDWSWEKQKEQ 10 20 30 110 120 130 140 150 160 pF1KE2 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE2 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIV 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE2 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFV 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE2 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 DTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLR 220 230 240 250 260 270 350 360 370 380 390 pF1KE2 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 280 290 300 310 320 >>NP_001308219 (OMIM: 300663) cysteine protease ATG4A is (226 aa) initn: 1546 init1: 1546 opt: 1546 Z-score: 1927.7 bits: 364.7 E(85289): 1.2e-100 Smith-Waterman score: 1546; 100.0% identity (100.0% similar) in 226 aa overlap (173-398:1-226) 150 160 170 180 190 200 pF1KE2 GEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPPD :::::::::::::::::::::::::::::: NP_001 MDNTVVIEDIKKMCRVLPLSADTAGDRPPD 10 20 30 210 220 230 240 250 260 pF1KE2 SLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLGALGGKPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLGALGGKPN 40 50 60 70 80 90 270 280 290 300 310 320 pF1KE2 NAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGF 100 110 120 130 140 150 330 340 350 360 370 380 pF1KE2 FCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTE 160 170 180 190 200 210 390 pF1KE2 QLEEFDLEEDFEILSV :::::::::::::::: NP_001 QLEEFDLEEDFEILSV 220 >>NP_037457 (OMIM: 611338) cysteine protease ATG4B isofo (393 aa) initn: 1512 init1: 877 opt: 1502 Z-score: 1869.4 bits: 354.7 E(85289): 2.2e-97 Smith-Waterman score: 1502; 55.8% identity (80.3% similar) in 391 aa overlap (15-398:13-393) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :... :..:.:.: :::::... . :::...:::...:::::::.. NP_037 MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD : ::::::.::.:::::::::::..::::.:::::::: : ..:.:: : .:. :.: NP_037 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE ::: :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.: NP_037 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME 120 130 140 150 160 170 190 200 210 220 230 pF1KE2 DIKKMCRV-LPLSADTAGDRPPDSLTASN---QSKGTSAYCSAWKPLLLIVPLRLGINQI .:...::. .: .. :: : :: : . .. : :.::.:..:::::...: NP_037 EIRRLCRTSVPCAGATA--FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 NPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVN : .::...:.:: ::::::..:::::.:.::::..:.:::.::::::: :. .. . NP_037 NEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQKH :..::: . : ::.: .::::.:.::::: : ::..::. :.: .: : :::::. . NP_037 DESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQ 300 310 320 330 340 350 360 370 380 390 pF1KE2 PSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEF-DLE-EDFEILSV ::: : :.: . . . : .:.::.: : : :::::::. NP_037 PSHL------ACPDVLNLSLDSSD-VERLERFFDSEDEDFEILSL 360 370 380 390 >>XP_016859127 (OMIM: 611338) PREDICTED: cysteine protea (457 aa) initn: 1512 init1: 877 opt: 1502 Z-score: 1868.5 bits: 354.8 E(85289): 2.4e-97 Smith-Waterman score: 1502; 55.8% identity (80.3% similar) in 391 aa overlap (15-398:77-457) 10 20 30 40 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSK :... :..:.:.: :::::... . :::.. XP_016 YGTDTMATMTVCSAPGGSGCVATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDE 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE2 LLSDISARLWFTYRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQ .:::...:::::::..: ::::::.::.:::::::::::..::::.:::::::: : .. XP_016 ILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQR 110 120 130 140 150 160 110 120 130 140 150 160 pF1KE2 KEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEW :.:: : .:. :.:::: :::::.::::::::::::.:.:::::::::::::.:: : XP_016 KRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTW 170 180 190 200 210 220 170 180 190 200 210 220 pF1KE2 NSLAVYVSMDNTVVIEDIKKMCRV-LPLSADTAGDRPPDSLTASN---QSKGTSAYCSAW .::::...::::::.:.:...::. .: .. :: : :: : . .. : : XP_016 SSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATA--FPADSDRHCNGFPAGAEVTNRPSPW 230 240 250 260 270 280 230 240 250 260 270 280 pF1KE2 KPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDP .::.:..:::::...:: .::...:.:: ::::::..:::::.:.::::..:.:::.::: XP_016 RPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDP 290 300 310 320 330 340 290 300 310 320 330 pF1KE2 HTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK- :::: :. .. . :..::: . : ::.: .::::.:.::::: : ::..::. :.: XP_016 HTTQPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKL 350 360 370 380 390 400 340 350 360 370 380 390 pF1KE2 EILKENLRMFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEF-DLE-EDFEILS .: : :::::. .::: : :.: . . . : .:.::.: : : ::::::: XP_016 SLLGGALPMFELVELQPSHL------ACPDVLNLSLDSSD-VERLERFFDSEDEDFEILS 410 420 430 440 450 pF1KE2 V . XP_016 L >>NP_847896 (OMIM: 611338) cysteine protease ATG4B isofo (380 aa) initn: 1482 init1: 877 opt: 1462 Z-score: 1819.8 bits: 345.5 E(85289): 1.3e-94 Smith-Waterman score: 1462; 57.6% identity (82.8% similar) in 349 aa overlap (15-358:13-358) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :... :..:.:.: :::::... . :::...:::...:::::::.. NP_847 MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD : ::::::.::.:::::::::::..::::.:::::::: : ..:.:: : .:. :.: NP_847 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE ::: :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.: NP_847 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME 120 130 140 150 160 170 190 200 210 220 230 pF1KE2 DIKKMCRV-LPLSADTAGDRPPDSLTASN---QSKGTSAYCSAWKPLLLIVPLRLGINQI .:...::. .: .. :: : :: : . .. : :.::.:..:::::...: NP_847 EIRRLCRTSVPCAGATA--FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 NPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVN : .::...:.:: ::::::..:::::.:.::::..:.:::.::::::: :. .. . NP_847 NEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQKH :..::: . : ::.: .::::.:.::::: : ::..::. :.: .: : :::::. . NP_847 DESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQ 300 310 320 330 340 350 360 370 380 390 pF1KE2 PSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV ::: NP_847 PSHLACPDVLNLSLGESCQVQILLM 360 370 380 >>XP_005247050 (OMIM: 611338) PREDICTED: cysteine protea (405 aa) initn: 1512 init1: 877 opt: 1462 Z-score: 1819.4 bits: 345.5 E(85289): 1.3e-94 Smith-Waterman score: 1487; 55.1% identity (79.5% similar) in 396 aa overlap (15-398:13-405) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :... :..:.:.: :::::... . :::...:::...:::::::.. XP_005 MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD : ::::::.::.:::::::::::..::::.:::::::: : ..:.:: : .:. :.: XP_005 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE ::: :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.: XP_005 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME 120 130 140 150 160 170 190 200 210 220 230 pF1KE2 DIKKMCRV-LPLSADTAGDRPPDSLTASN---QSKGTSAYCSAWKPLLLIVPLRLGINQI .:...::. .: .. :: : :: : . .. : :.::.:..:::::...: XP_005 EIRRLCRTSVPCAGATA--FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 NPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVN : .::...:.:: ::::::..:::::.:.::::..:.:::.::::::: :. .. . XP_005 NEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQKH :..::: . : ::.: .::::.:.::::: : ::..::. :.: .: : :::::. . XP_005 DESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQ 300 310 320 330 340 350 360 370 380 390 pF1KE2 PSHW--PPFVPPAKPEVTTTGAEFI-DST--EQLEEF-DLE-EDFEILSV ::: : . . : . . . ::. :.::.: : : :::::::. XP_005 PSHLACPDVLNLSLGESCQVQVGSLGDSSDVERLERFFDSEDEDFEILSL 360 370 380 390 400 398 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:45:24 2016 done: Sun Nov 6 12:45:25 2016 Total Scan time: 7.430 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]