FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4473, 493 aa
1>>>pF1KE4473 493 - 493 aa - 493 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.8066+/-0.000975; mu= 14.8865+/- 0.058
mean_var=64.8926+/-12.880, 0's: 0 Z-trim(103.7): 26 B-trim: 0 in 0/48
Lambda= 0.159212
statistics sampled from 7534 (7544) to 7534 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.232), width: 16
Scan time: 2.930
The best scores are: opt bits E(32554)
CCDS42544.1 PEPD gene_id:5184|Hs108|chr19 ( 493) 3352 779.1 0
CCDS54244.1 PEPD gene_id:5184|Hs108|chr19 ( 429) 2466 575.6 3.8e-164
CCDS54245.1 PEPD gene_id:5184|Hs108|chr19 ( 452) 1881 441.2 1.1e-123
CCDS14007.1 XPNPEP3 gene_id:63929|Hs108|chr22 ( 507) 464 115.7 1.2e-25
>>CCDS42544.1 PEPD gene_id:5184|Hs108|chr19 (493 aa)
initn: 3352 init1: 3352 opt: 3352 Z-score: 4159.3 bits: 779.1 E(32554): 0
Smith-Waterman score: 3352; 100.0% identity (100.0% similar) in 493 aa overlap (1-493:1-493)
10 20 30 40 50 60
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
430 440 450 460 470 480
490
pF1KE4 GCDKAFTPFSGPK
:::::::::::::
CCDS42 GCDKAFTPFSGPK
490
>>CCDS54244.1 PEPD gene_id:5184|Hs108|chr19 (429 aa)
initn: 2456 init1: 2456 opt: 2466 Z-score: 3060.5 bits: 575.6 E(32554): 3.8e-164
Smith-Waterman score: 2761; 87.0% identity (87.0% similar) in 493 aa overlap (1-493:1-429)
10 20 30 40 50 60
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
:::::::
CCDS54 TGVLFRQ-----------------------------------------------------
130 140 150 160 170 180
pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 -----------IASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
70 80 90 100 110
190 200 210 220 230 240
pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
120 130 140 150 160 170
250 260 270 280 290 300
pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
180 190 200 210 220 230
310 320 330 340 350 360
pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
240 250 260 270 280 290
370 380 390 400 410 420
pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
300 310 320 330 340 350
430 440 450 460 470 480
pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
360 370 380 390 400 410
490
pF1KE4 GCDKAFTPFSGPK
:::::::::::::
CCDS54 GCDKAFTPFSGPK
420
>>CCDS54245.1 PEPD gene_id:5184|Hs108|chr19 (452 aa)
initn: 1879 init1: 1879 opt: 1881 Z-score: 2333.9 bits: 441.2 E(32554): 1.1e-123
Smith-Waterman score: 3009; 91.7% identity (91.7% similar) in 493 aa overlap (1-493:1-452)
10 20 30 40 50 60
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS
::: ::::::::::::::::
CCDS54 VEC-----------------------------------------LFEHYCYSRGGMRHSS
190
250 260 270 280 290 300
pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA
200 210 220 230 240 250
310 320 330 340 350 360
pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG
260 270 280 290 300 310
370 380 390 400 410 420
pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH
320 330 340 350 360 370
430 440 450 460 470 480
pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA
380 390 400 410 420 430
490
pF1KE4 GCDKAFTPFSGPK
:::::::::::::
CCDS54 GCDKAFTPFSGPK
440 450
>>CCDS14007.1 XPNPEP3 gene_id:63929|Hs108|chr22 (507 aa)
initn: 383 init1: 130 opt: 464 Z-score: 574.0 bits: 115.7 E(32554): 1.2e-25
Smith-Waterman score: 573; 29.1% identity (56.6% similar) in 477 aa overlap (23-485:72-506)
10 20 30 40
pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS---IVVLQ
.:: :..: ..:. :.:. .:::.
CCDS14 PNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLS
50 60 70 80 90 100
50 60 70 80 90 100
pF1KE4 GGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGK------STLFVPRLPAS
. : . .: :.:.. : . : :: :.. :: . ::::: :
CCDS14 N--PTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPS
110 120 130 140 150
110 120 130 140 150 160
pF1KE4 HATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASF
. : : . . .::.. ..:. .: ..: . .. : . . .
CCDS14 RELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYM
160 170 180 190 200 210
170 180 190 200 210
pF1KE4 DGIS--KFEVNNTI--LHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKE
. .. : . .: . .. : . :..:. :.: .. ..:..:.: :.: . :. ..:
CCDS14 QPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEE
220 230 240 250 260 270
220 230 240 250 260 270
pF1KE4 YELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGG
: . :: : .::. .: . ..:. : .::: . :.. :..:.: :.: :
CCDS14 AFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMVLLDGGC
280 290 300 310 320 330
280 290 300 310 320 330
pF1KE4 EYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLE
: :..:::: ..:.::.::: : .:::::. .: .. ::. ... . . .
CCDS14 ESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQ
340 350 360 370 380 390
340 350 360 370 380 390
pF1KE4 ELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRT
.: .::.. .. . . . :: .::.::.::::. .: :::
CCDS14 KLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP-----------RSL--
400 410 420 430 440
400 410 420 430 440 450
pF1KE4 ARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVT-
::::::.:.:::::. . .... ..:::.: ::::.:::::
CCDS14 --PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIEDDVVVTQ
450 460 470 480
460 470 480 490
pF1KE4 DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK
:: . : . :. ...:: :..:
CCDS14 DSPLILSADCPKEMNDIEQI---CSQAS
490 500
493 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:53:32 2016 done: Sun Nov 6 00:53:33 2016
Total Scan time: 2.930 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]