FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6419, 496 aa 1>>>pF1KE6419 496 - 496 aa - 496 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5073+/-0.000932; mu= 16.6176+/- 0.056 mean_var=62.4056+/-12.334, 0's: 0 Z-trim(105.1): 16 B-trim: 0 in 0/48 Lambda= 0.162354 statistics sampled from 8222 (8230) to 8222 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.253), width: 16 Scan time: 2.370 The best scores are: opt bits E(32554) CCDS8262.1 PRCP gene_id:5547|Hs108|chr11 ( 496) 3420 809.9 0 CCDS41695.1 PRCP gene_id:5547|Hs108|chr11 ( 517) 3062 726.0 2.3e-209 CCDS7030.1 DPP7 gene_id:29952|Hs108|chr9 ( 492) 1259 303.7 3e-82 CCDS4623.1 PRSS16 gene_id:10279|Hs108|chr6 ( 514) 410 104.9 2.2e-22 >>CCDS8262.1 PRCP gene_id:5547|Hs108|chr11 (496 aa) initn: 3420 init1: 3420 opt: 3420 Z-score: 4325.7 bits: 809.9 E(32554): 0 Smith-Waterman score: 3420; 100.0% identity (100.0% similar) in 496 aa overlap (1-496:1-496) 10 20 30 40 50 60 pF1KE6 MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAVAKNYSVLYFQQKVDHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAVAKNYSVLYFQQKVDHF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 EHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 EHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 GGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHRS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 WDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 WDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 LPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 YQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWITTMYGGKNISSHT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 YQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWITTMYGGKNISSHT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 NIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 NIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRH 430 440 450 460 470 480 490 pF1KE6 MKNWIRDFYDSAGKQH :::::::::::::::: CCDS82 MKNWIRDFYDSAGKQH 490 >>CCDS41695.1 PRCP gene_id:5547|Hs108|chr11 (517 aa) initn: 3406 init1: 3062 opt: 3062 Z-score: 3872.2 bits: 726.0 E(32554): 2.3e-209 Smith-Waterman score: 3368; 95.9% identity (95.9% similar) in 517 aa overlap (1-496:1-517) 10 20 30 40 50 pF1KE6 MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAVAKNYSVLYFQQK---- :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAVAKNYSVLYFQQKALAA 10 20 30 40 50 60 60 70 80 90 pF1KE6 -----------------VDHFGFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWF ::::::::::::::::::::::::::::::::::::::::::: CCDS41 GQLHICIIQLNHYKTPLVDHFGFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWF 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 CNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 CNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIK 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE6 HLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 HLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFM 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE6 KIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 KIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWI 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE6 SETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSG 310 320 330 340 350 360 340 350 360 370 380 390 pF1KE6 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 370 380 390 400 410 420 400 410 420 430 440 450 pF1KE6 VRPRPSWITTMYGGKNISSHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 VRPRPSWITTMYGGKNISSHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 430 440 450 460 470 480 460 470 480 490 pF1KE6 RTKNALDPMSVLLARSLEVRHMKNWIRDFYDSAGKQH ::::::::::::::::::::::::::::::::::::: CCDS41 RTKNALDPMSVLLARSLEVRHMKNWIRDFYDSAGKQH 490 500 510 >>CCDS7030.1 DPP7 gene_id:29952|Hs108|chr9 (492 aa) initn: 1202 init1: 425 opt: 1259 Z-score: 1590.2 bits: 303.7 E(32554): 3e-82 Smith-Waterman score: 1286; 42.4% identity (69.9% similar) in 488 aa overlap (15-486:4-474) 10 20 30 40 50 60 pF1KE6 MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAVAKNYSVLYFQQKVDHF :::: . : ::: : .. : ... .:::..::: CCDS70 MGSAPWAPVLLL----ALGLRGLQAGARRAPD--PGFQERFFQQRLDHF 10 20 30 40 70 80 90 100 110 pF1KE6 GFNTV--KTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLV .:. ::: ::.::.:..: .. : :.::::::::. : ::..:. ..: : :.:: CCDS70 NFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWAFANNSAFVAELAAERGALLV 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 FAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGG ::::::::.::::: .: . . : ..:: ::::::::::.. :.: . ::.. :.::.:: CCDS70 FAEHRYYGKSLPFGAQSTQRG-HTELLTVEQALADFAELLRALRRDL-GAQDAPAIAFGG 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE6 SYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIH ::::::.:..::::::.:.::::::::. : . :.. ::.::. ..:.:..... CCDS70 SYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVR 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE6 RSWDAINRLSNTGS--GLQWLTGALHLCSPLTSQ-DIQHLKDWISETWVNLAMVDYPYAS ... :. : :. ..: :. :.::... :. .: . .... :::.:::: . CCDS70 EAFRQIKDLFLQGAYDTVRWEFGT---CQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPT 230 240 250 260 270 300 310 320 330 340 350 pF1KE6 NFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLG .:: :::: :.:: :. : .:.. . .. .. :: ::. .: .: . : CCDS70 DFLGPLPANPVKVGCDRL----LSEAQRITGLRALAGLVYNASGSEHCYDIYRLYHSCAD 280 290 300 310 320 330 360 370 380 390 400 pF1KE6 TLG---------WSYQACTEVVMPFCTNGVDDMFE--PHSWNLKELSDDCFQQWGVRPRP : :.::::::. . : .:.: ::: : . .:.. :.. ::: ::: CCDS70 PTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRY--CLDTWGVWPRP 340 350 360 370 380 390 410 420 430 440 450 460 pF1KE6 SWITTMYGGKNISSHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDLRTKNA .:. : . : .. . .::.::::.::::.:::. .... ...::::. ::::::::... CCDS70 DWLLTSFWGGDLRAASNIIFSNGNLDPWAGGGIRRNLSASVIAVTIQGGAHHLDLRASHP 400 410 420 430 440 450 470 480 490 pF1KE6 LDPMSVLLARSLEVRHMKNWIRDFYDSAGKQH :: ::. ::.::. . .:.. CCDS70 EDPASVVEARKLEATIIGEWVKAARREQQPALRGGPRLSL 460 470 480 490 >>CCDS4623.1 PRSS16 gene_id:10279|Hs108|chr6 (514 aa) initn: 416 init1: 191 opt: 410 Z-score: 515.2 bits: 104.9 E(32554): 2.2e-22 Smith-Waterman score: 521; 27.0% identity (53.9% similar) in 514 aa overlap (8-486:12-503) 10 20 30 40 pF1KE6 MGRRALLLLLLSFLAPWATIALRPALRALGSLHLP----TNPTSL-----PAVAKN :::.:. :. .: :: :: :. .. .: :..: CCDS46 MAVWLAQWLGPLLLVSL---WGLLAPASLLRRLGE-HIQQFQESSAQGLGLSLGPGAAAL 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 YSVLYFQQKVDHFGFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMW .: ...: .: :. . ..: ::: : :..: . : :... :.::.. : CCDS46 PKVGWLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPA 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 DVAEELKAMLVFAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIPG .: :... :::.:: :.: : . .: ::.:. :::: . :.: . CCDS46 ALAPAWGALVISLEHRFYGLSIPAGG---LEMAQLRFLSSRLALADVVSARLALSRLFNI 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE6 AENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFR . ..: : .::::.: :::: :.:.::.. ...:.:::. :. . . .:. .. CCDS46 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDF---SEYNDVVSRSLM 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE6 KSGP----HCSESIHRSWDAINR-LSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISET ... .: .. .. ..: : . :.. : : :.:: . : .. CCDS46 STAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELSACGPLGRAENQAELLGALQA 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE6 WVNLAMVDYPYASNFLQPLPAWPIKVVCQYL--KNPNVSDSLLLQNIFQALNVYYNYSGQ :. ..:.: .. :: .. .: : . : : : .. .:... . :: CCDS46 LVG-GVVQYDGQTG--APLS---VRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQ 300 310 320 330 340 350 360 370 380 pF1KE6 VKCLNISETAT-----------SSLGTLGWSYQACTEV-VMPFCTNGVDDMFEPHSW--N :::..:.. : :..: : ::.::: . : : : : CCDS46 -KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPR----CPFSQLPA 350 360 370 380 390 390 400 410 420 430 440 pF1KE6 LKELSDDCFQQWG-----VRPRPSWITTMYGGKNISSHTNIVFSNGELDPWSGGGVTKDI : : : : .: : . ...:::.. ... ...: ::. ::: .::. . CCDS46 LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPGAN-KVLFVNGDTDPWHVLSVTQAL 400 410 420 430 440 450 450 460 470 480 490 pF1KE6 TDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFYDSAGKQH .. .. : :.: ::. . : :. :.:. .....:.. CCDS46 GSSESTLLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWLKLAKESQIKGEV 460 470 480 490 500 510 496 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:06:35 2016 done: Tue Nov 8 13:06:35 2016 Total Scan time: 2.370 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]