FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4389, 408 aa
1>>>pF1KE4389 408 - 408 aa - 408 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0826+/-0.000396; mu= 17.9817+/- 0.024
mean_var=66.4280+/-13.826, 0's: 0 Z-trim(112.1): 16 B-trim: 1269 in 1/51
Lambda= 0.157362
statistics sampled from 20854 (20866) to 20854 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.621), E-opt: 0.2 (0.245), width: 16
Scan time: 8.420
The best scores are: opt bits E(85289)
NP_000657 (OMIM: 104620,609924) aminoacylase-1 iso ( 408) 2781 640.5 2.1e-183
NP_001185824 (OMIM: 104620,609924) aminoacylase-1 ( 408) 2781 640.5 2.1e-183
NP_001185827 (OMIM: 104620,609924) aminoacylase-1 ( 373) 2336 539.5 5.1e-153
NP_001185825 (OMIM: 104620,609924) aminoacylase-1 ( 336) 1595 371.2 2.1e-102
NP_001185826 (OMIM: 104620,609924) aminoacylase-1 ( 343) 1498 349.2 8.9e-96
XP_016855886 (OMIM: 617124) PREDICTED: probable ca ( 425) 187 51.7 4.2e-06
NP_116038 (OMIM: 609064) beta-Ala-His dipeptidase ( 507) 163 46.3 0.00021
>>NP_000657 (OMIM: 104620,609924) aminoacylase-1 isoform (408 aa)
initn: 2781 init1: 2781 opt: 2781 Z-score: 3413.6 bits: 640.5 E(85289): 2.1e-183
Smith-Waterman score: 2781; 100.0% identity (100.0% similar) in 408 aa overlap (1-408:1-408)
10 20 30 40 50 60
pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
310 320 330 340 350 360
370 380 390 400
pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
370 380 390 400
>>NP_001185824 (OMIM: 104620,609924) aminoacylase-1 isof (408 aa)
initn: 2781 init1: 2781 opt: 2781 Z-score: 3413.6 bits: 640.5 E(85289): 2.1e-183
Smith-Waterman score: 2781; 100.0% identity (100.0% similar) in 408 aa overlap (1-408:1-408)
10 20 30 40 50 60
pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
310 320 330 340 350 360
370 380 390 400
pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
370 380 390 400
>>NP_001185827 (OMIM: 104620,609924) aminoacylase-1 isof (373 aa)
initn: 2335 init1: 2335 opt: 2336 Z-score: 2868.1 bits: 539.5 E(85289): 5.1e-153
Smith-Waterman score: 2467; 91.4% identity (91.4% similar) in 408 aa overlap (1-408:1-373)
10 20 30 40 50 60
pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
::::::::::::::::::::::::::::::::
NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYG----------------------------
10 20 30
70 80 90 100 110 120
pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
:::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 -------TNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
40 50 60 70 80
130 140 150 160 170 180
pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
90 100 110 120 130 140
190 200 210 220 230 240
pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
150 160 170 180 190 200
250 260 270 280 290 300
pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
210 220 230 240 250 260
310 320 330 340 350 360
pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
270 280 290 300 310 320
370 380 390 400
pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
330 340 350 360 370
>>NP_001185825 (OMIM: 104620,609924) aminoacylase-1 isof (336 aa)
initn: 1592 init1: 1592 opt: 1595 Z-score: 1959.6 bits: 371.2 E(85289): 2.1e-102
Smith-Waterman score: 2133; 82.4% identity (82.4% similar) in 408 aa overlap (1-408:1-336)
10 20 30 40 50 60
pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
::::::::::::::::::::::::::::::::::::::::::::
NP_001 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEG----------------
70 80 90 100
130 140 150 160 170 180
pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
::::
NP_001 --------------------------------------------------------IANP
190 200 210 220 230 240
pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
110 120 130 140 150 160
250 260 270 280 290 300
pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
170 180 190 200 210 220
310 320 330 340 350 360
pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
230 240 250 260 270 280
370 380 390 400
pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
290 300 310 320 330
>>NP_001185826 (OMIM: 104620,609924) aminoacylase-1 isof (343 aa)
initn: 2345 init1: 1498 opt: 1498 Z-score: 1840.5 bits: 349.2 E(85289): 8.9e-96
Smith-Waterman score: 2219; 84.1% identity (84.1% similar) in 408 aa overlap (1-408:1-343)
10 20 30 40 50 60
pF1KE4 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MTSKGPEEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TVLTWPGTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSN
:::::::::::::::::::::::::::::::::::::::
NP_001 TDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEKL---------------------
190 200 210
250 260 270 280 290 300
pF1KE4 PHLKEGSVTSVNLTKLEGGVAYNVIPATMSASFDFRVAPDVDFKAFEEQLQSWCQAAGEG
::::::::::::::::
NP_001 --------------------------------------------AFEEQLQSWCQAAGEG
220 230
310 320 330 340 350 360
pF1KE4 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VTLEFAQKWMHPQVTPTDDSNPWWAAFSRVCKDMNLTLEPEIMPAATDNRYIRAVGVPAL
240 250 260 270 280 290
370 380 390 400
pF1KE4 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFSPMNRTPVLLHDHDERLHEAVFLRGVDIYTRLLPALASVPALPSDS
300 310 320 330 340
>>XP_016855886 (OMIM: 617124) PREDICTED: probable carbox (425 aa)
initn: 151 init1: 74 opt: 187 Z-score: 230.6 bits: 51.7 E(85289): 4.2e-06
Smith-Waterman score: 187; 27.3% identity (57.6% similar) in 172 aa overlap (62-226:107-277)
40 50 60 70 80 90
pF1KE4 GAAVAFFEETARQLGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFKEHW
..: :..:.:. :: .: ::::. .: :
XP_016 AEFGKYIHKVFPTVVSTSFIQHEVVEEYSHLFTIQGSDPSLQPYLLMAHFDVVPAPEEGW
80 90 100 110 120 130
100 110 120 130 140 150
pF1KE4 SHDPFEAFKDSEGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGG
:: .. . .: ::.::. : : . :.:.. : .. . :.. ... ::: .:
XP_016 EVPPFSGL-ERDGIIYGRGTLDDKNSVMALLQALELLLIRKYIPRRSFFISLGHDEESSG
140 150 160 170 180 190
160 170 180 190 200
pF1KE4 H--QGMELFVQ----RPEFHALRAGFALDEGIANPTDAFTVF-YSERSPWWVRVTSTGRP
: . ..: . : . ..:: ::. : : .... ::.. . . .
XP_016 TGAQRISALLQSRGVQLAFIVDEGGFILDDFIPNFKKPIALIAVSEKGSMNLMLQVNMTS
200 210 220 230 240 250
210 220 230 240 250 260
pF1KE4 GHASRFMEDTAAEKLHKVVNSILAFREKEWQRLQSNPHLKEGSVTSVNLTKLEGGVAYNV
::.: ..:. : .:. .
XP_016 GHSSAPPKETSIGILAAAVSRLEQTPMPIIFGSGTVVTVLQQLANEVYGEKSLNQCNNQD
260 270 280 290 300 310
>>NP_116038 (OMIM: 609064) beta-Ala-His dipeptidase prec (507 aa)
initn: 129 init1: 105 opt: 163 Z-score: 200.1 bits: 46.3 E(85289): 0.00021
Smith-Waterman score: 163; 27.5% identity (58.2% similar) in 153 aa overlap (13-161:67-212)
10 20 30
pF1KE4 MTSKGPEEEHPSVTLFRQYL-RIRTVQPKP--DYGAAVAFFE
: ::: : :. .: :: :: .
NP_116 KVFQYIDLHQDEFVQTLKEWVAIESDSVQPVPRFRQELFRMMAVAADTLQRLGARVASVD
40 50 60 70 80 90
40 50 60 70 80 90
pF1KE4 ETARQLGLGCQKVEVAPGYVVTVLTWPGTNPTLSSILLNSHTDVVPVFK-EHWSHDPFEA
.:: : :.. . : ..:. :..:: ... . .: :: :. . . : ::. .
NP_116 MGPQQLPDG-QSLPIPP----VILAELGSDPTKGTVCFYGHLDVQPADRGDGWLTDPY-V
100 110 120 130 140 150
100 110 120 130 140 150
pF1KE4 FKDSEGYIYARGAQDMKCVSIQYLEAVRRLKVEGHRFPRTIHMTFVPDEEVGGHQGMELF
. . .: .:.::: : : . ...:: ... . .: .:.. .. : .: ..: .
NP_116 LTEVDGKLYGRGATDNKGPVLAWINAVSAFRALEQDLPVNIKF-IIEGMEEAGSVALEEL
160 170 180 190 200
160 170 180 190 200 210
pF1KE4 VQRPEFHALRAGFALDEGIANPTDAFTVFYSERSPWWVRVTSTGRPGHASRFMEDTAAEK
:..
NP_116 VEKEKDRFFSGVDYIVISDNLWISQRKPAITYGTRGNSYFMVEVKCRDQDFHSGTFGGIL
210 220 230 240 250 260
408 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 22:55:54 2016 done: Sat Nov 5 22:55:55 2016
Total Scan time: 8.420 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]