FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4473, 493 aa
1>>>pF1KE4473 493 - 493 aa - 493 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1083+/-0.000422; mu= 19.2846+/- 0.026
mean_var=69.4096+/-14.432, 0's: 0 Z-trim(109.8): 20 B-trim: 1051 in 1/54
Lambda= 0.153945
statistics sampled from 17984 (17991) to 17984 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.578), E-opt: 0.2 (0.211), width: 16
Scan time: 8.900
The best scores are: opt bits E(85289)
NP_000276 (OMIM: 170100,613230) xaa-Pro dipeptidas ( 493) 3352 754.1 2e-217
NP_001159529 (OMIM: 170100,613230) xaa-Pro dipepti ( 429) 2466 557.3 3.1e-158
NP_001159528 (OMIM: 170100,613230) xaa-Pro dipepti ( 452) 1881 427.4 4.2e-119
NP_071381 (OMIM: 613159,613553) probable Xaa-Pro a ( 507) 464 112.7 2.5e-24
NP_003390 (OMIM: 300145,300909) xaa-Pro aminopepti ( 674) 149 42.8 0.0036
>>NP_000276 (OMIM: 170100,613230) xaa-Pro dipeptidase is (493 aa)
initn: 3352 init1: 3352 opt: 3352 Z-score: 4024.4 bits: 754.1 E(85289): 2e-217
Smith-Waterman score: 3352; 100.0% identity (100.0% similar) in 493 aa overlap (1-493:1-493)
10 20 30 40 50 60
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
430 440 450 460 470 480
490
pF1KE4 GCDKAFTPFSGPK
:::::::::::::
NP_000 GCDKAFTPFSGPK
490
>>NP_001159529 (OMIM: 170100,613230) xaa-Pro dipeptidase (429 aa)
initn: 2456 init1: 2456 opt: 2466 Z-score: 2961.8 bits: 557.3 E(85289): 3.1e-158
Smith-Waterman score: 2761; 87.0% identity (87.0% similar) in 493 aa overlap (1-493:1-429)
10 20 30 40 50 60
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
:::::::
NP_001 TGVLFRQ-----------------------------------------------------
130 140 150 160 170 180
pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
:::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 -----------IASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
70 80 90 100 110
190 200 210 220 230 240
pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
120 130 140 150 160 170
250 260 270 280 290 300
pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
180 190 200 210 220 230
310 320 330 340 350 360
pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
240 250 260 270 280 290
370 380 390 400 410 420
pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
300 310 320 330 340 350
430 440 450 460 470 480
pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
360 370 380 390 400 410
490
pF1KE4 GCDKAFTPFSGPK
:::::::::::::
NP_001 GCDKAFTPFSGPK
420
>>NP_001159528 (OMIM: 170100,613230) xaa-Pro dipeptidase (452 aa)
initn: 1879 init1: 1879 opt: 1881 Z-score: 2259.3 bits: 427.4 E(85289): 4.2e-119
Smith-Waterman score: 3009; 91.7% identity (91.7% similar) in 493 aa overlap (1-493:1-452)
10 20 30 40 50 60
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
::: ::::::::::::::::
NP_001 VEC-----------------------------------------LFEHYCYSRGGMRHSS
190
250 260 270 280 290 300
pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
200 210 220 230 240 250
310 320 330 340 350 360
pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
260 270 280 290 300 310
370 380 390 400 410 420
pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
320 330 340 350 360 370
430 440 450 460 470 480
pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
380 390 400 410 420 430
490
pF1KE4 GCDKAFTPFSGPK
:::::::::::::
NP_001 GCDKAFTPFSGPK
440 450
>>NP_071381 (OMIM: 613159,613553) probable Xaa-Pro amino (507 aa)
initn: 383 init1: 130 opt: 464 Z-score: 557.7 bits: 112.7 E(85289): 2.5e-24
Smith-Waterman score: 573; 29.1% identity (56.6% similar) in 477 aa overlap (23-485:72-506)
10 20 30 40
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS---IVVLQ
.:: :..: ..:. :.:. .:::.
NP_071 PNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLS
50 60 70 80 90 100
50 60 70 80 90 100
pF1KE4 GGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGK------STLFVPRLPAS
. : . .: :.:.. : . : :: :.. :: . ::::: :
NP_071 N--PTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPS
110 120 130 140 150
110 120 130 140 150 160
pF1KE4 HATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASF
. : : . . .::.. ..:. .: ..: . .. : . . .
NP_071 RELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYM
160 170 180 190 200 210
170 180 190 200 210
pF1KE4 DGIS--KFEVNNTI--LHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKE
. .. : . .: . .. : . :..:. :.: .. ..:..:.: :.: . :. ..:
NP_071 QPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEE
220 230 240 250 260 270
220 230 240 250 260 270
pF1KE4 YELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGG
: . :: : .::. .: . ..:. : .::: . :.. :..:.: :.: :
NP_071 AFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMVLLDGGC
280 290 300 310 320 330
280 290 300 310 320 330
pF1KE4 EYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLE
: :..:::: ..:.::.::: : .:::::. .: .. ::. ... . . .
NP_071 ESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQ
340 350 360 370 380 390
340 350 360 370 380 390
pF1KE4 ELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRT
.: .::.. .. . . . :: .::.::.::::. .: :::
NP_071 KLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP-----------RSL--
400 410 420 430 440
400 410 420 430 440 450
pF1KE4 ARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVT-
::::::.:.:::::. . .... ..:::.: ::::.:::::
NP_071 --PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIEDDVVVTQ
450 460 470 480
460 470 480 490
pF1KE4 DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK
:: . : . :. ...:: :..:
NP_071 DSPLILSADCPKEMNDIEQI---CSQAS
490 500
>>NP_003390 (OMIM: 300145,300909) xaa-Pro aminopeptidase (674 aa)
initn: 138 init1: 68 opt: 149 Z-score: 177.9 bits: 42.8 E(85289): 0.0036
Smith-Waterman score: 149; 27.5% identity (52.1% similar) in 211 aa overlap (240-437:415-616)
210 220 230 240 250 260
pF1KE4 MKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQN
:. : .:: :.:. ::. . : : ...
NP_003 KNVPKGTVDEFSGAEIVDKFRGEEQFSSGPSFETISASGLNAALAHYSPTKELN-RKLSS
390 400 410 420 430 440
270 280 290 300 310 320
pF1KE4 GDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYEAVLRS----SRAVMGAMKPGVW
.: :.: ::.:. ..::: . : .: :: .: :: . :: .. : :
NP_003 DEMYLLDSGGQYWDGTTDITRTVHW-GTPSAFQKEAYTRVLIGNIDLSRLIFPAATSGRM
450 460 470 480 490 500
330 340 350 360 370
pF1KE4 WPDMHRLADRIHLEELAHMGILSG-SVDAMVQAHLGAV-FMPHGL----GHFLGIDVHDV
.. .: : . ..: .: .. .. .: : :. ... : : .: .
NP_003 ---VEAFARRALWDAGLNYGHGTGHGIGNFLCVHEWPVGFQSNNIAMAKGMFTSI---EP
510 520 530 540 550
380 390 400 410 420 430
pF1KE4 GGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH---LLDEALADPARASFLN
: : .: : . . :. :: :: : . :. . :.: .: .: . ..::
NP_003 GYYKDGEFGIRLEDVALVVEAKTKYPGSYLTFEV-VSFVPYDRNLIDVSLLSPEHLQYLN
560 570 580 590 600 610
440 450 460 470 480 490
pF1KE4 REVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK
:
NP_003 RYYQTIREKVGPELQRRQLLEEFEWLQQHTEPLAARAPDTASWASVLVVSTLAILGWSV
620 630 640 650 660 670
493 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:53:33 2016 done: Sun Nov 6 00:53:34 2016
Total Scan time: 8.900 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]