FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4389, 408 aa 1>>>pF1KE4389 408 - 408 aa - 408 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0826+/-0.000396; mu= 17.9817+/- 0.024 mean_var=66.4280+/-13.826, 0's: 0 Z-trim(112.1): 16 B-trim: 1269 in 1/51 Lambda= 0.157362 statistics sampled from 20854 (20866) to 20854 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.621), E-opt: 0.2 (0.245), width: 16 Scan time: 8.420 The best scores are: opt bits E(85289) NP_000657 (OMIM: 104620,609924) aminoacylase-1 iso ( 408) 2781 640.5 2.1e-183 NP_001185824 (OMIM: 104620,609924) aminoacylase-1 ( 408) 2781 640.5 2.1e-183 NP_001185827 (OMIM: 104620,609924) aminoacylase-1 ( 373) 2336 539.5 5.1e-153 NP_001185825 (OMIM: 104620,609924) aminoacylase-1 ( 336) 1595 371.2 2.1e-102 NP_001185826 (OMIM: 104620,609924) aminoacylase-1 ( 343) 1498 349.2 8.9e-96 XP_016855886 (OMIM: 617124) PREDICTED: probable ca ( 425) 187 51.7 4.2e-06 NP_116038 (OMIM: 609064) beta-Ala-His dipeptidase ( 507) 163 46.3 0.00021 >>NP_000657 (OMIM: 104620,609924) aminoacylase-1 isoform (408 aa) initn: 2781 init1: 2781 opt: 2781 Z-score: 3413.6 bits: 640.5 E(85289): 2.1e-183 Smith-Waterman score: 2781; 100.0% identity (100.0% similar) in 408 aa overlap (1-408:1-408) 10 20 30 40 50 60 pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL 310 320 330 340 350 360 370 380 390 400 pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS :::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS 370 380 390 400 >>NP_001185824 (OMIM: 104620,609924) aminoacylase-1 isof (408 aa) initn: 2781 init1: 2781 opt: 2781 Z-score: 3413.6 bits: 640.5 E(85289): 2.1e-183 Smith-Waterman score: 2781; 100.0% identity (100.0% similar) in 408 aa overlap (1-408:1-408) 10 20 30 40 50 60 pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL 310 320 330 340 350 360 370 380 390 400 pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS :::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS 370 380 390 400 >>NP_001185827 (OMIM: 104620,609924) aminoacylase-1 isof (373 aa) initn: 2335 init1: 2335 opt: 2336 Z-score: 2868.1 bits: 539.5 E(85289): 5.1e-153 Smith-Waterman score: 2467; 91.4% identity (91.4% similar) in 408 aa overlap (1-408:1-373) 10 20 30 40 50 60 pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV :::::::::::::::::::::::::::::::: NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYG---------------------------- 10 20 30 70 80 90 100 110 120 pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ ::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 -------TNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ 40 50 60 70 80 130 140 150 160 170 180 pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL 270 280 290 300 310 320 370 380 390 400 pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS :::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS 330 340 350 360 370 >>NP_001185825 (OMIM: 104620,609924) aminoacylase-1 isof (336 aa) initn: 1592 init1: 1592 opt: 1595 Z-score: 1959.6 bits: 371.2 E(85289): 2.1e-102 Smith-Waterman score: 2133; 82.4% identity (82.4% similar) in 408 aa overlap (1-408:1-336) 10 20 30 40 50 60 pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ :::::::::::::::::::::::::::::::::::::::::::: NP_001 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEG---------------- 70 80 90 100 130 140 150 160 170 180 pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP :::: NP_001 --------------------------------------------------------IANP 190 200 210 220 230 240 pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG 170 180 190 200 210 220 310 320 330 340 350 360 pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL 230 240 250 260 270 280 370 380 390 400 pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS :::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS 290 300 310 320 330 >>NP_001185826 (OMIM: 104620,609924) aminoacylase-1 isof (343 aa) initn: 2345 init1: 1498 opt: 1498 Z-score: 1840.5 bits: 349.2 E(85289): 8.9e-96 Smith-Waterman score: 2219; 84.1% identity (84.1% similar) in 408 aa overlap (1-408:1-343) 10 20 30 40 50 60 pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN ::::::::::::::::::::::::::::::::::::::: NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKL--------------------- 190 200 210 250 260 270 280 290 300 pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG :::::::::::::::: NP_001 --------------------------------------------AFEEQLQSWCQAAGEG 220 230 310 320 330 340 350 360 pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL 240 250 260 270 280 290 370 380 390 400 pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS :::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS 300 310 320 330 340 >>XP_016855886 (OMIM: 617124) PREDICTED: probable carbox (425 aa) initn: 151 init1: 74 opt: 187 Z-score: 230.6 bits: 51.7 E(85289): 4.2e-06 Smith-Waterman score: 187; 27.3% identity (57.6% similar) in 172 aa overlap (62-226:107-277) 40 50 60 70 80 90 pF1KE4 GAAVAFFEETARQLGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFKEHW ..: :..:.:. :: .: ::::. .: : XP_016 AEFGKYIHKVFPTVVSTSFIQHEVVEEYSHLFTIQGSDPSLQPYLLMAHFDVVPAPEEGW 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE4 SHDPFEAFKDSEGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGG :: .. . .: ::.::. : : . :.:.. : .. . :.. ... ::: .: XP_016 EVPPFSGL-ERDGIIYGRGTLDDKNSVMALLQALELLLIRKYIPRRSFFISLGHDEESSG 140 150 160 170 180 190 160 170 180 190 200 pF1KE4 H--QGMELFVQ----RPEFHALRAGFALDEGIANPTDAFTVF-YSERSPWWVRVTSTGRP : . ..: . : . ..:: ::. : : .... ::.. . . . XP_016 TGAQRISALLQSRGVQLAFIVDEGGFILDDFIPNFKKPIALIAVSEKGSMNLMLQVNMTS 200 210 220 230 240 250 210 220 230 240 250 260 pF1KE4 GHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSNPHLKEGSVTSVNLTKLEGGVAYNV ::.: ..:. : .:. . XP_016 GHSSAPPKETSIGILAAAVSRLEQTPMPIIFGSGTVVTVLQQLANEVYGEKSLNQCNNQD 260 270 280 290 300 310 >>NP_116038 (OMIM: 609064) beta-Ala-His dipeptidase prec (507 aa) initn: 129 init1: 105 opt: 163 Z-score: 200.1 bits: 46.3 E(85289): 0.00021 Smith-Waterman score: 163; 27.5% identity (58.2% similar) in 153 aa overlap (13-161:67-212) 10 20 30 pF1KE4 MTSKGPEEEHPSVTLFRQYL-RIRTVQPKP--DYGAAVAFFE : ::: : :. .: :: :: . NP_116 KVFQYIDLHQDEFVQTLKEWVAIESDSVQPVPRFRQELFRMMAVAADTLQRLGARVASVD 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE4 ETARQLGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFK-EHWSHDPFEA .:: : :.. . : ..:. :..:: ... . .: :: :. . . : ::. . NP_116 MGPQQLPDG-QSLPIPP----VILAELGSDPTKGTVCFYGHLDVQPADRGDGWLTDPY-V 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE4 FKDSEGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELF . . .: .:.::: : : . ...:: ... . .: .:.. .. : .: ..: . NP_116 LTEVDGKLYGRGATDNKGPVLAWINAVSAFRALEQDLPVNIKF-IIEGMEEAGSVALEEL 160 170 180 190 200 160 170 180 190 200 210 pF1KE4 VQRPEFHALRAGFALDEGIANPTDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEK :.. NP_116 VEKEKDRFFSGVDYIVISDNLWISQRKPAITYGTRGNSYFMVEVKCRDQDFHSGTFGGIL 210 220 230 240 250 260 408 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:55:54 2016 done: Sat Nov 5 22:55:55 2016 Total Scan time: 8.420 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]