FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4306, 306 aa 1>>>pF1KE4306 306 - 306 aa - 306 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7209+/-0.000321; mu= 19.1969+/- 0.020 mean_var=62.2644+/-12.705, 0's: 0 Z-trim(116.4): 15 B-trim: 1205 in 1/51 Lambda= 0.162538 statistics sampled from 27560 (27567) to 27560 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.703), E-opt: 0.2 (0.323), width: 16 Scan time: 7.100 The best scores are: opt bits E(85289) NP_000301 (OMIM: 256730,600722) palmitoyl-protein ( 306) 2100 500.6 1.6e-141 XP_005271065 (OMIM: 256730,600722) PREDICTED: palm ( 282) 1663 398.1 1.1e-110 NP_001136076 (OMIM: 256730,600722) palmitoyl-prote ( 203) 1108 267.8 1.3e-71 NP_005146 (OMIM: 603298) lysosomal thioesterase PP ( 302) 257 68.4 2e-11 NP_001191032 (OMIM: 603298) lysosomal thioesterase ( 302) 257 68.4 2e-11 NP_619731 (OMIM: 603298) lysosomal thioesterase PP ( 308) 257 68.4 2.1e-11 >>NP_000301 (OMIM: 256730,600722) palmitoyl-protein thio (306 aa) initn: 2100 init1: 2100 opt: 2100 Z-score: 2661.6 bits: 500.6 E(85289): 1.6e-141 Smith-Waterman score: 2100; 100.0% identity (100.0% similar) in 306 aa overlap (1-306:1-306) 10 20 30 40 50 60 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH 250 260 270 280 290 300 pF1KE4 IIPFLG :::::: NP_000 IIPFLG >>XP_005271065 (OMIM: 256730,600722) PREDICTED: palmitoy (282 aa) initn: 1663 init1: 1663 opt: 1663 Z-score: 2108.2 bits: 398.1 E(85289): 1.1e-110 Smith-Waterman score: 1878; 92.2% identity (92.2% similar) in 306 aa overlap (1-306:1-282) 10 20 30 40 50 60 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH :: :::::::::::::::::::::::::::::::::: XP_005 SE------------------------DRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH 250 260 270 pF1KE4 IIPFLG :::::: XP_005 IIPFLG 280 >>NP_001136076 (OMIM: 256730,600722) palmitoyl-protein t (203 aa) initn: 1098 init1: 1098 opt: 1108 Z-score: 1406.9 bits: 267.8 E(85289): 1.3e-71 Smith-Waterman score: 1190; 66.3% identity (66.3% similar) in 306 aa overlap (1-306:1-203) 10 20 30 40 50 60 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEK :::::::::::::::::::::::::::::::::::::::::: NP_001 MASPGCLWLLAVALLPWTCASRALQHLDPPAPLPLVIWHGMG------------------ 10 20 30 40 70 80 90 100 110 120 pF1KE4 KIPGIYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQGYNAMGFSQGGQF NP_001 ------------------------------------------------------------ 130 140 150 160 170 180 pF1KE4 LRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL ::::::::::::::::::::::::::::::::::: NP_001 -------------------------VFGLPRCPGESSHICDFIRKTLNAGAYSKVVQERL 50 60 70 190 200 210 220 230 240 pF1KE4 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VQAEYWHDPIKEDVYRNHSIFLADINQERGINESYKKNLMALKKFVMVKFLNDSIVDPVD 80 90 100 110 120 130 250 260 270 280 290 300 pF1KE4 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SEWFGFYRSGQAKETIPLQETSLYTQDRLGLKEMDNAGQLVFLATEGDHLQLSEEWFYAH 140 150 160 170 180 190 pF1KE4 IIPFLG :::::: NP_001 IIPFLG 200 >>NP_005146 (OMIM: 603298) lysosomal thioesterase PPT2 i (302 aa) initn: 245 init1: 107 opt: 257 Z-score: 326.0 bits: 68.4 E(85289): 2e-11 Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:13-274) 10 20 30 40 50 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDSCCNPLS :.: :::. : : ::: :... ::. :: : NP_005 MLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDSS---YS 10 20 30 40 50 60 70 80 90 100 pF1KE4 MGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQG . . ..... :: . ::.: :. .. .. .:.. .: .:: : :: NP_005 FRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKAP---QG 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 YNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF----IR . . .:::: ::. . . . ..::... ..: .: ..... . .: NP_005 VHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWLFPTSMR 110 120 130 140 150 170 180 190 200 210 220 pF1KE4 KTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YKKNLMAL ..: :: :: . .::::: ..:.: : : ::: :: :: .. ..::.. . NP_005 SNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRKNFLRV 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE4 KKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMDNAGQLV ..:.. .:... : .: .:::: .:.::. ..: .: .: .::: . : .: NP_005 GHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLLARGAIV 220 230 240 250 260 270 290 300 pF1KE4 FLATEGDHLQLSEEWFYAHIIPFLG NP_005 RCPMAGISHTAWHSNRTLYETCIEPWLS 280 290 300 >>NP_001191032 (OMIM: 603298) lysosomal thioesterase PPT (302 aa) initn: 245 init1: 107 opt: 257 Z-score: 326.0 bits: 68.4 E(85289): 2e-11 Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:13-274) 10 20 30 40 50 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDSCCNPLS :.: :::. : : ::: :... ::. :: : NP_001 MLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDSS---YS 10 20 30 40 50 60 70 80 90 100 pF1KE4 MGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKDPKLQQG . . ..... :: . ::.: :. .. .. .:.. .: .:: : :: NP_001 FRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKAP---QG 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 YNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF----IR . . .:::: ::. . . . ..::... ..: .: ..... . .: NP_001 VHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWLFPTSMR 110 120 130 140 150 170 180 190 200 210 220 pF1KE4 KTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YKKNLMAL ..: :: :: . .::::: ..:.: : : ::: :: :: .. ..::.. . NP_001 SNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRKNFLRV 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE4 KKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMDNAGQLV ..:.. .:... : .: .:::: .:.::. ..: .: .: .::: . : .: NP_001 GHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLLARGAIV 220 230 240 250 260 270 290 300 pF1KE4 FLATEGDHLQLSEEWFYAHIIPFLG NP_001 RCPMAGISHTAWHSNRTLYETCIEPWLS 280 290 300 >>NP_619731 (OMIM: 603298) lysosomal thioesterase PPT2 i (308 aa) initn: 245 init1: 107 opt: 257 Z-score: 325.9 bits: 68.4 E(85289): 2.1e-11 Smith-Waterman score: 296; 28.5% identity (56.6% similar) in 288 aa overlap (8-281:19-280) 10 20 30 40 pF1KE4 MASPGCLWLLAVALLPWTCASRALQHLDPPAPL-----PLVIWHGMGDS :.: :::. : : ::: :... ::. :: NP_619 MKSCGSMLGLCGQRLPAAWVLL--LLPFL----PLLLLAAPAPHRASYKPVIVVHGLFDS 10 20 30 40 50 50 60 70 80 90 100 pF1KE4 CCNPLSMGAIKKMVEKKIPG--IYVLSLEIGKTLMEDVENSFFLNVNSQVTTVCQALAKD :. . ..... :: . ::.: :. .. .. .:.. .: .:: NP_619 S---YSFRHLLEYINETHPGTVVTVLDLFDGRESLRP----LWEQVQGFREAVVPIMAKA 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 PKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMINLISVGGQHQGVFGLPRCPGESSHICDF : :: . . .:::: ::. . . . ..::... ..: .: ..... . NP_619 P---QGVHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYG------DTDYLKWL 110 120 130 140 150 170 180 190 200 210 pF1KE4 ----IRKTLNAGAYSKVVQERLVQAEYWHDPIKEDVYRNHSIFLADINQERGINES--YK .:..: :: :: . .::::: ..:.: : : ::: :: :: .. .. NP_619 FPTSMRSNLYRICYSPWGQEFSI-CNYWHDPHHDDLYLNASSFLALINGERDHPNATVWR 160 170 180 190 200 210 220 230 240 250 260 270 pF1KE4 KNLMALKKFVMVKFLNDSIVDPVDSEWFGFYRSGQAKETI-PLQETSLYTQDRLGLKEMD ::.. . ..:.. .:... : .: .:::: .:.::. ..: .: .: .::: . NP_619 KNFLRVGHLVLIGGPDDGVITPWQSSFFGFY---DANETVLEMEEQLVYLRDSFGLKTLL 220 230 240 250 260 270 280 290 300 pF1KE4 NAGQLVFLATEGDHLQLSEEWFYAHIIPFLG : .: NP_619 ARGAIVRCPMAGISHTAWHSNRTLYETCIEPWLS 280 290 300 306 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:28:04 2016 done: Sat Nov 5 23:28:05 2016 Total Scan time: 7.100 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]