FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4473, 493 aa 1>>>pF1KE4473 493 - 493 aa - 493 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8066+/-0.000975; mu= 14.8865+/- 0.058 mean_var=64.8926+/-12.880, 0's: 0 Z-trim(103.7): 26 B-trim: 0 in 0/48 Lambda= 0.159212 statistics sampled from 7534 (7544) to 7534 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.232), width: 16 Scan time: 2.930 The best scores are: opt bits E(32554) CCDS42544.1 PEPD gene_id:5184|Hs108|chr19 ( 493) 3352 779.1 0 CCDS54244.1 PEPD gene_id:5184|Hs108|chr19 ( 429) 2466 575.6 3.8e-164 CCDS54245.1 PEPD gene_id:5184|Hs108|chr19 ( 452) 1881 441.2 1.1e-123 CCDS14007.1 XPNPEP3 gene_id:63929|Hs108|chr22 ( 507) 464 115.7 1.2e-25 >>CCDS42544.1 PEPD gene_id:5184|Hs108|chr19 (493 aa) initn: 3352 init1: 3352 opt: 3352 Z-score: 4159.3 bits: 779.1 E(32554): 0 Smith-Waterman score: 3352; 100.0% identity (100.0% similar) in 493 aa overlap (1-493:1-493) 10 20 30 40 50 60 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA 430 440 450 460 470 480 490 pF1KE4 GCDKAFTPFSGPK ::::::::::::: CCDS42 GCDKAFTPFSGPK 490 >>CCDS54244.1 PEPD gene_id:5184|Hs108|chr19 (429 aa) initn: 2456 init1: 2456 opt: 2466 Z-score: 3060.5 bits: 575.6 E(32554): 3.8e-164 Smith-Waterman score: 2761; 87.0% identity (87.0% similar) in 493 aa overlap (1-493:1-429) 10 20 30 40 50 60 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK ::::::: CCDS54 TGVLFRQ----------------------------------------------------- 130 140 150 160 170 180 pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 -----------IASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI 70 80 90 100 110 190 200 210 220 230 240 pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS 120 130 140 150 160 170 250 260 270 280 290 300 pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA 180 190 200 210 220 230 310 320 330 340 350 360 pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG 240 250 260 270 280 290 370 380 390 400 410 420 pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH 300 310 320 330 340 350 430 440 450 460 470 480 pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA 360 370 380 390 400 410 490 pF1KE4 GCDKAFTPFSGPK ::::::::::::: CCDS54 GCDKAFTPFSGPK 420 >>CCDS54245.1 PEPD gene_id:5184|Hs108|chr19 (452 aa) initn: 1879 init1: 1879 opt: 1881 Z-score: 2333.9 bits: 441.2 E(32554): 1.1e-123 Smith-Waterman score: 3009; 91.7% identity (91.7% similar) in 493 aa overlap (1-493:1-452) 10 20 30 40 50 60 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TGVLFRQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 VECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSS ::: :::::::::::::::: CCDS54 VEC-----------------------------------------LFEHYCYSRGGMRHSS 190 250 260 270 280 290 300 pF1KE4 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTA 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE4 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLG 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE4 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 AVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDH 320 330 340 350 360 370 430 440 450 460 470 480 pF1KE4 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMA 380 390 400 410 420 430 490 pF1KE4 GCDKAFTPFSGPK ::::::::::::: CCDS54 GCDKAFTPFSGPK 440 450 >>CCDS14007.1 XPNPEP3 gene_id:63929|Hs108|chr22 (507 aa) initn: 383 init1: 130 opt: 464 Z-score: 574.0 bits: 115.7 E(32554): 1.2e-25 Smith-Waterman score: 573; 29.1% identity (56.6% similar) in 477 aa overlap (23-485:72-506) 10 20 30 40 pF1KE4 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS---IVVLQ .:: :..: ..:. :.:. .:::. CCDS14 PNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLS 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE4 GGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGK------STLFVPRLPAS . : . .: :.:.. : . : :: :.. :: . ::::: : CCDS14 N--PTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPS 110 120 130 140 150 110 120 130 140 150 160 pF1KE4 HATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASF . : : . . .::.. ..:. .: ..: . .. : . . . CCDS14 RELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYM 160 170 180 190 200 210 170 180 190 200 210 pF1KE4 DGIS--KFEVNNTI--LHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKE . .. : . .: . .. : . :..:. :.: .. ..:..:.: :.: . :. ..: CCDS14 QPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEE 220 230 240 250 260 270 220 230 240 250 260 270 pF1KE4 YELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGG : . :: : .::. .: . ..:. : .::: . :.. :..:.: :.: : CCDS14 AFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMVLLDGGC 280 290 300 310 320 330 280 290 300 310 320 330 pF1KE4 EYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLE : :..:::: ..:.::.::: : .:::::. .: .. ::. ... . . . CCDS14 ESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQ 340 350 360 370 380 390 340 350 360 370 380 390 pF1KE4 ELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRT .: .::.. .. . . . :: .::.::.::::. .: ::: CCDS14 KLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP-----------RSL-- 400 410 420 430 440 400 410 420 430 440 450 pF1KE4 ARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVT- ::::::.:.:::::. . .... ..:::.: ::::.::::: CCDS14 --PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIEDDVVVTQ 450 460 470 480 460 470 480 490 pF1KE4 DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK :: . : . :. ...:: :..: CCDS14 DSPLILSADCPKEMNDIEQI---CSQAS 490 500 493 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:53:32 2016 done: Sun Nov 6 00:53:33 2016 Total Scan time: 2.930 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]