FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2316, 398 aa 1>>>pF1KE2316 398 - 398 aa - 398 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2012+/-0.000653; mu= 17.8632+/- 0.040 mean_var=62.1028+/-12.315, 0's: 0 Z-trim(110.1): 19 B-trim: 7 in 1/50 Lambda= 0.162749 statistics sampled from 11315 (11328) to 11315 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.721), E-opt: 0.2 (0.348), width: 16 Scan time: 2.480 The best scores are: opt bits E(32554) CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX ( 398) 2751 654.1 6.7e-188 CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2 ( 393) 1502 360.8 1.3e-99 CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2 ( 380) 1462 351.4 8.2e-97 CCDS14539.1 ATG4A gene_id:115201|Hs108|chrX ( 336) 1457 350.2 1.7e-96 CCDS623.1 ATG4C gene_id:84938|Hs108|chr1 ( 458) 349 90.1 4.5e-18 CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19 ( 474) 256 68.3 1.7e-11 >>CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX (398 aa) initn: 2751 init1: 2751 opt: 2751 Z-score: 3486.9 bits: 654.1 E(32554): 6.7e-188 Smith-Waterman score: 2751; 100.0% identity (100.0% similar) in 398 aa overlap (1-398:1-398) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 DIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWP 310 320 330 340 350 360 370 380 390 pF1KE2 PFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV :::::::::::::::::::::::::::::::::::::: CCDS14 PFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 370 380 390 >>CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2 (393 aa) initn: 1512 init1: 877 opt: 1502 Z-score: 1902.1 bits: 360.8 E(32554): 1.3e-99 Smith-Waterman score: 1502; 55.8% identity (80.3% similar) in 391 aa overlap (15-398:13-393) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :... :..:.:.: :::::... . :::...:::...:::::::.. CCDS46 MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD : ::::::.::.:::::::::::..::::.:::::::: : ..:.:: : .:. :.: CCDS46 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE ::: :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.: CCDS46 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME 120 130 140 150 160 170 190 200 210 220 230 pF1KE2 DIKKMCRV-LPLSADTAGDRPPDSLTASN---QSKGTSAYCSAWKPLLLIVPLRLGINQI .:...::. .: .. :: : :: : . .. : :.::.:..:::::...: CCDS46 EIRRLCRTSVPCAGATA--FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 NPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVN : .::...:.:: ::::::..:::::.:.::::..:.:::.::::::: :. .. . CCDS46 NEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQKH :..::: . : ::.: .::::.:.::::: : ::..::. :.: .: : :::::. . CCDS46 DESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQ 300 310 320 330 340 350 360 370 380 390 pF1KE2 PSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEF-DLE-EDFEILSV ::: : :.: . . . : .:.::.: : : :::::::. CCDS46 PSHL------ACPDVLNLSLDSSD-VERLERFFDSEDEDFEILSL 360 370 380 390 >>CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2 (380 aa) initn: 1482 init1: 877 opt: 1462 Z-score: 1851.6 bits: 351.4 E(32554): 8.2e-97 Smith-Waterman score: 1462; 57.6% identity (82.8% similar) in 349 aa overlap (15-358:13-358) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :... :..:.:.: :::::... . :::...:::...:::::::.. CCDS46 MDAATLTYDTLRFAEF-EDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKN 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD : ::::::.::.:::::::::::..::::.:::::::: : ..:.:: : .:. :.: CCDS46 FPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE ::: :::::.::::::::::::.:.:::::::::::::.:: :.::::...::::::.: CCDS46 RKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVME 120 130 140 150 160 170 190 200 210 220 230 pF1KE2 DIKKMCRV-LPLSADTAGDRPPDSLTASN---QSKGTSAYCSAWKPLLLIVPLRLGINQI .:...::. .: .. :: : :: : . .. : :.::.:..:::::...: CCDS46 EIRRLCRTSVPCAGATA--FPADSDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 NPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVN : .::...:.:: ::::::..:::::.:.::::..:.:::.::::::: :. .. . CCDS46 NEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIP 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 DQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQK-EILKENLRMFELVQKH :..::: . : ::.: .::::.:.::::: : ::..::. :.: .: : :::::. . CCDS46 DESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQ 300 310 320 330 340 350 360 370 380 390 pF1KE2 PSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV ::: CCDS46 PSHLACPDVLNLSLGESCQVQILLM 360 370 380 >>CCDS14539.1 ATG4A gene_id:115201|Hs108|chrX (336 aa) initn: 2306 init1: 1453 opt: 1457 Z-score: 1846.0 bits: 350.2 E(32554): 1.7e-96 Smith-Waterman score: 2186; 84.4% identity (84.4% similar) in 398 aa overlap (1-398:1-336) 10 20 30 40 50 60 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 FSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEKQKEQPKEYQRILQCFLD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RKDCCYSIHQMAQMGVGEGKSIGEWFGPNTVAQVLKKLALFDEWNSLAVYVSMDNTVVIE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 DIKKMCRVLPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVY :::::::::::::::::::::::::::::: CCDS14 DIKKMCRVLPLSADTAGDRPPDSLTASNQS------------------------------ 190 200 210 250 260 270 280 290 300 pF1KE2 VDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTF :::::::::::::::::::::::::::: CCDS14 --------------------------------DELIFLDPHTTQTFVDTEENGTVNDQTF 220 230 310 320 330 340 350 360 pF1KE2 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 HCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEILKENLRMFELVQKHPSHWP 240 250 260 270 280 290 370 380 390 pF1KE2 PFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV :::::::::::::::::::::::::::::::::::::: CCDS14 PFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV 300 310 320 330 >>CCDS623.1 ATG4C gene_id:84938|Hs108|chr1 (458 aa) initn: 632 init1: 215 opt: 349 Z-score: 438.0 bits: 90.1 E(32554): 4.5e-18 Smith-Waterman score: 585; 29.6% identity (54.7% similar) in 422 aa overlap (34-391:68-454) 10 20 30 40 50 60 pF1KE2 VLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKSKLLSDISARLWFTYRRKFSP ..:.. . .. .:. .:.:.:::..: CCDS62 NSPVLLLGKCYHFKYEDEDKTLPAESGCTIEDHVIAGNVEEFRKDFISRIWLTYREEFPQ 40 50 60 70 80 90 70 80 90 100 pF1KE2 IGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSW---------------------- : :.. ..: :::: :: :::.:::.:: . ::: :.: CCDS62 IEGSALTTDCGWGCTLRTGQMLLAQGLILHFLGRAWTWPDALNIENSDSESWTSHTVKKF 100 110 120 130 140 150 110 120 130 pF1KE2 ----------EKQKEQP----KE---------------YQR-ILQCFLDRKDCCYSIHQM :.. . : :: :.: :.. : : ...::. CCDS62 TASFEASLSGEREFKTPTISLKETIGKYSDDHEMRNEVYHRKIISWFGDSPLALFGLHQL 160 170 180 190 200 210 140 150 160 170 180 pF1KE2 AQMGVGEGKSIGEWFGPNTVAQVLKKL---ALFDEWNSLAVYVSMDNTVVIEDIKKMCRV ..: ::. :.:.:: .::..:.: : . .....::..: :: :. CCDS62 IEYGKKSGKKAGDWYGPAVVAHILRKAVEEARHPDLQGITIYVAQDCTVYNSDVI----- 220 230 240 250 260 270 190 200 210 220 230 240 pF1KE2 LPLSADTAGDRPPDSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVYVDAFKECF :. :.:..: . : ....::.::: .. : :.. : . CCDS62 ---------DKQSASMTSDNADD---------KAVIILVPVRLGGERTNTDYLEFVKGIL 280 290 300 310 250 260 270 280 290 300 pF1KE2 KMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVND---QTFHCLQS .. .: .::::...::: :: : ::..::: :.:::. ...: .:::: : CCDS62 SLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSFVDV----SIKDFPLETFHC-PS 320 330 340 350 360 310 320 330 340 350 360 pF1KE2 PQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEIL---KENLRMFELVQKHPSHWPPF :..:.. ..::: ..::.:.. .:: . : . ::. .: .:. : . : CCDS62 PKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSKEKYPLFTFVNGHSRDYD-F 370 380 390 400 410 420 370 380 390 pF1KE2 VPPAKPEVTTTGAEFI---DSTEQLEEFDLEEDFEILSV . :::. : . : .::..:. :: CCDS62 TS------TTTNEEDLFSEDEKKQLKRFSTEEFVLL 430 440 450 >>CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19 (474 aa) initn: 673 init1: 254 opt: 256 Z-score: 319.7 bits: 68.3 E(32554): 1.7e-11 Smith-Waterman score: 654; 31.5% identity (57.3% similar) in 368 aa overlap (29-355:94-434) 10 20 30 40 50 pF1KE2 MESVLSKYEDQITIFTDYLEEYPDTDELVWILGKQHLLKTEKS--KLLSDISARLWFT . . :... .. : . .. :. .:::.: CCDS12 KFKAKFLTAWNNVKYGWVVKSRTSFSKISSIHLCGRRYRFEGEGDIQRFQRDFVSRLWLT 70 80 90 100 110 120 60 70 80 90 100 pF1KE2 YRRKFSPIGGTGPSSDAGWGCMLRCGQMMLAQALICRHLGRDWSWEK------------- ::: : :. : .:: ::::::: :::::::.:. . : :::.: . CCDS12 YRRDFPPLPGGCLTSDCGWGCMLRSGQMMLAQGLLLHFLPRDWTWAEGMGLGPPELSGSA 130 140 150 160 170 180 110 120 130 140 pF1KE2 ---------------------QKEQPKEYQRILQCFLDRKDCCYSIHQMAQMGVGEGKSI . :: .....:.. : :. ...:.....: . ::. CCDS12 SPSRYHGPARWMPPRWAQGAPELEQERRHRQIVSWFADHPRAPFGLHRLVELGQSSGKKA 190 200 210 220 230 240 150 160 170 180 190 200 pF1KE2 GEWFGPNTVAQVLKK-LALFDEWNSLAVYVSMDNTVVIEDIKKMCRVLPLSADTAGDRPP :.:.::. ::..:.: . .. . :.::::.: :: :. .. :: CCDS12 GDWYGPSLVAHILRKAVESCSDVTRLVVYVSQDCTVYKADVARLVA-----------RPD 250 260 270 280 290 210 220 230 240 250 260 pF1KE2 DSLTASNQSKGTSAYCSAWKPLLLIVPLRLGINQINPVYVDAFKECFKMPQSLGALGGKP . . :: ....::.::: . .::::: :: .. :: .:::: CCDS12 PT--------------AEWKSVVILVPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKP 300 310 320 330 270 280 290 300 310 320 pF1KE2 NNAYYFIGFLGDELIFLDPHTTQTFVDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALG .. ::::. : :..:::: : ::. . . ..::: ::..: . ..::: ..: CCDS12 RHSLYFIGYQDDFLLYLDPHYCQPTVDVSQ-ADFPLESFHCT-SPRKMAFAKMDPSCTVG 340 350 360 370 380 390 330 340 350 360 370 pF1KE2 FFCKEEKDFDNWCSLVQKEILK----ENLRMFELVQKHPSHWPPFVPPAKPEVTTTGAEF :. ..:.:.. :: . . . . : :: :.. : CCDS12 FYAGDRKEFETLCSELTRVLSSSSATERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPR 400 410 420 430 440 450 380 390 pF1KE2 IDSTEQLEEFDLEEDFEILSV CCDS12 TGRLLRAKRPSSEDFVFL 460 470 398 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:45:24 2016 done: Sun Nov 6 12:45:24 2016 Total Scan time: 2.480 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]