FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3381, 1151 aa 1>>>pF1KE3381 1151 - 1151 aa - 1151 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.4620+/-0.0012; mu= -2.4658+/- 0.071 mean_var=337.4112+/-85.607, 0's: 0 Z-trim(109.1): 47 B-trim: 0 in 0/52 Lambda= 0.069822 statistics sampled from 10661 (10683) to 10661 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.328), width: 16 Scan time: 3.590 The best scores are: opt bits E(32554) CCDS47908.1 ZFPM2 gene_id:23414|Hs108|chr8 (1151) 7862 807.6 0 CCDS32502.1 ZFPM1 gene_id:161882|Hs108|chr16 (1006) 894 105.6 6.1e-22 >>CCDS47908.1 ZFPM2 gene_id:23414|Hs108|chr8 (1151 aa) initn: 7862 init1: 7862 opt: 7862 Z-score: 4300.0 bits: 807.6 E(32554): 0 Smith-Waterman score: 7862; 100.0% identity (100.0% similar) in 1151 aa overlap (1-1151:1-1151) 10 20 30 40 50 60 pF1KE3 MSRRKQSKPRQIKRPLEDAIEDEEEECPSEETDIISKGDFPLEESFSTEFGPENLSCEEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MSRRKQSKPRQIKRPLEDAIEDEEEECPSEETDIISKGDFPLEESFSTEFGPENLSCEEV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 EYFCNKGDDEGIQETAESDGDTQSEKPGQPGVETDDWDGPGELEVFQKDGERKIQSRQQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 EYFCNKGDDEGIQETAESDGDTQSEKPGQPGVETDDWDGPGELEVFQKDGERKIQSRQQL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 PVGTTWGPFPGKMDLNNNSLKTKAQVPMVLTAGPKWLLDVTWQGVEDNKNNCIVYSKGGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PVGTTWGPFPGKMDLNNNSLKTKAQVPMVLTAGPKWLLDVTWQGVEDNKNNCIVYSKGGQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 LWCTTTKAISEGEELIAFVVDFDSRLQAASQMTLTEGMYPARLLDSIQLLPQQAAMASIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LWCTTTKAISEGEELIAFVVDFDSRLQAASQMTLTEGMYPARLLDSIQLLPQQAAMASIL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 PTAIVNKDIFPCKSCGIWYRSERNLQAHLMYYCSGRQREAAPVSEENEDSAHQISSLCPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PTAIVNKDIFPCKSCGIWYRSERNLQAHLMYYCSGRQREAAPVSEENEDSAHQISSLCPF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 PQCTKSFSNARALEMHLNSHSGVKMEEFLPPGASLKCTVCSYTADSVINFHQHLFSHLTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PQCTKSFSNARALEMHLNSHSGVKMEEFLPPGASLKCTVCSYTADSVINFHQHLFSHLTQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 AAFRCNHCHFGFQTQRELLQHQELHVPSGKLPRESDMEHSPSATEDSLQPATDLLTRSEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 AAFRCNHCHFGFQTQRELLQHQELHVPSGKLPRESDMEHSPSATEDSLQPATDLLTRSEL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 PQSQKAMQTKDASSDTELDKCEKKTQLFLTNQRPEIQPTTNKQSFSYTKIKSEPSSPRLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PQSQKAMQTKDASSDTELDKCEKKTQLFLTNQRPEIQPTTNKQSFSYTKIKSEPSSPRLA 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE3 SSPVQPNIGPSFPVGPFLSQFSFPQDITMVPQASEILAKMSELVHRRLRHGSSSYPPVIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SSPVQPNIGPSFPVGPFLSQFSFPQDITMVPQASEILAKMSELVHRRLRHGSSSYPPVIY 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE3 SPLMPKGATCFECNITFNNLDNYLVHKKHYCSSRWQQMAKSPEFPSVSEKMPEALSPNTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SPLMPKGATCFECNITFNNLDNYLVHKKHYCSSRWQQMAKSPEFPSVSEKMPEALSPNTG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE3 QTSINLLNPAAHSADPENPLLQTSCINSSTVLDLIGPNGKGHDKDFSTQTKKLSTSSNND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 QTSINLLNPAAHSADPENPLLQTSCINSSTVLDLIGPNGKGHDKDFSTQTKKLSTSSNND 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE3 DKINGKPVDVKNPSVPLVDGESDPNKTTCEACNITFSRHETYMVHKQYYCATRHDPPLKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DKINGKPVDVKNPSVPLVDGESDPNKTTCEACNITFSRHETYMVHKQYYCATRHDPPLKR 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE3 SASNKVPAMQRTMRTRKRRKMYEMCLPEQEQRPPLVQQRFLDVANLNNPCTSTQEPTEGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SASNKVPAMQRTMRTRKRRKMYEMCLPEQEQRPPLVQQRFLDVANLNNPCTSTQEPTEGL 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE3 GECYHPRCDIFPGIVSKHLETSLTINKCVPVSKCDTTHSSVSCLEMDVPIDLSKKCLSQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GECYHPRCDIFPGIVSKHLETSLTINKCVPVSKCDTTHSSVSCLEMDVPIDLSKKCLSQS 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE3 ERTTTSPKRLLDYHECTVCKISFNKVENYLAHKQNFCPVTAHQRNDLGQLDGKVFPNPES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ERTTTSPKRLLDYHECTVCKISFNKVENYLAHKQNFCPVTAHQRNDLGQLDGKVFPNPES 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE3 ERNSPDVSYERSIIKCEKNGNLKQPSPNGNLFSSHLATLQGLKVFSEAAQLIATKEENRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ERNSPDVSYERSIIKCEKNGNLKQPSPNGNLFSSHLATLQGLKVFSEAAQLIATKEENRH 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE3 LFLPQCLYPGAIKKAKGADQLSPYYGIKPSDYISGSLVIHNTDIEQSRNAENESPKGQAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LFLPQCLYPGAIKKAKGADQLSPYYGIKPSDYISGSLVIHNTDIEQSRNAENESPKGQAS 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE3 SNGCAALKKDSLPLLPKNRGMVIVNGGLKQDERPAANPQQENISQNPQHEDDHKSPSWIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SNGCAALKKDSLPLLPKNRGMVIVNGGLKQDERPAANPQQENISQNPQHEDDHKSPSWIS 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE3 ENPLAANENVSPGIPSAEEQLSSIAKGVNGSSQAPTSGKYCRLCDIQFNNLSNFITHKKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ENPLAANENVSPGIPSAEEQLSSIAKGVNGSSQAPTSGKYCRLCDIQFNNLSNFITHKKF 1090 1100 1110 1120 1130 1140 1150 pF1KE3 YCSSHAAEHVK ::::::::::: CCDS47 YCSSHAAEHVK 1150 >>CCDS32502.1 ZFPM1 gene_id:161882|Hs108|chr16 (1006 aa) initn: 1318 init1: 337 opt: 894 Z-score: 507.3 bits: 105.6 E(32554): 6.1e-22 Smith-Waterman score: 1449; 31.7% identity (51.0% similar) in 1223 aa overlap (1-1151:1-1006) 10 20 30 40 50 60 pF1KE3 MSRRKQSKPRQIKRPLEDAIEDEEEECPSEETDIISKGDFPLEESFSTEFGPENLSCEEV :::::::.:::::: : : .: .:: . . .:. : :. : . : : . CCDS32 MSRRKQSNPRQIKRSLGD-MEAREEVQLVGASHMEQKATAP--EAPSPPSADVN-SPPPL 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYFCNKGDDEGIQETAESDGDTQSEKPGQPGVETDDWDGPGELEVFQKDGERKIQSRQQL . : . . : : . :.::.: :.:: ::: .::.:.:..: .: CCDS32 PPPTSPGGPKEL-EGQEPEPRPTEEEPGSP------WSGPDELEPVVQDGQRRIRARLSL 60 70 80 90 100 130 140 150 160 170 pF1KE3 PVGTTWGPFPGKMDLNNNSLKTKAQVP---MVLTAGPKWLLDVTWQGVEDNKNNCIVYSK .: .:::: :... .: . : ..:. :: . :.. . . : .. : CCDS32 ATGLSWGPFHGSVQTRASSPRQAEPSPALTLLLVDEACWLRTLP-QALTEAEANTEIHRK 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 GGQLWCTTTKAISEGEEL-IAFVVDFDSRLQAASQMTLTEGMYPARLLDSIQLLPQQAAM ::: .:: . : : . .... : . .: :: : .:::::::.: CCDS32 DDALWCRVTKPVPAGGLLSVLLTAEPHSTPGHPVKKEPAEPTCPAPAHD-LQLLPQQAGM 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE3 ASILPTAIVNKDIFPCKSCGIWYRSERNLQAHLMYYCSGRQREAAPV---SEENEDSAHQ :::: ::..:::.::::.:::::::::::::::.:::..:: ..:. ..:. .. CCDS32 ASILATAVINKDVFPCKDCGIWYRSERNLQAHLLYYCASRQGTGSPAAAATDEKPKETYP 230 240 250 260 270 280 300 310 320 330 340 350 pF1KE3 ISSLCPFPQCTKSFSNARALEMHLNSHSGVKMEEFLPPGASLKCTVCSYTADSVINFHQH .:::::: :: .: .::.:. :::: . : . : .: . . : ..: CCDS32 NERVCPFPQCRKSCPSASSLEIHMRSHSGER------P---FVCLICLSAFTTKANCERH 290 300 310 320 330 360 370 380 390 pF1KE3 LFSHLTQAAFRCNHCHFGFQTQRELL------QHQ--------ELHVPSG-----KLPRE : : . :. : : ..: :..: .:. :.. :.. ::: . CCDS32 LKVHTDTLSGVCHSCGF-ISTTRDILYSHLVTNHMVCQPGSKGEIYSPGAGHPATKLPPD 340 350 360 370 380 390 400 410 420 430 pF1KE3 S----DMEHS----PSATED-SLQP-------------ATDLLTRSE-LPQSQKAMQTKD : ...:. : :. : .: : ::. .:.: : :. . . CCDS32 SLGSFQQQHTALQGPLASADLGLAPTPSPGLDRKALAEATNGEARAEPLAQNGGSSEPPA 400 410 420 430 440 450 440 450 460 470 480 pF1KE3 ASSDTELDKCEK-KTQLFLTNQRPEIQ-P--TTNKQSFSYTKIKSEPSSPRLASSPVQPN : . ... :. .. .: .: : : : . .: . ...:.: ::: .:::: . CCDS32 APRSIKVEAVEEPEAAPILGPGEPGPQAPSRTPSPRSPAPARVKAELSSPTPGSSPVPGE 460 470 480 490 500 510 490 500 510 520 530 540 pF1KE3 IGPSFPVGPFLSQFSFPQDITMVPQASEILAKMSELVHRRLRHGSSSYPPVIYS---PLM .: . . :: :. : : . : ::::::::::::: ::..:... . : CCDS32 LGLAGAL--FLPQYVFGPDAA--PPASEILAKMSELVHSRLQQGAGAGAGGAQTGLFPGA 520 530 540 550 560 570 550 560 570 580 590 600 pF1KE3 PKGATCFECNITFNNLDNYLVHKKHYCSSRWQQMAKSPEFPSVSEKMPEALSPNTGQTSI :::::::::.:::.:..:: :::. :::.: ..:: . . . :.: :. ... CCDS32 PKGATCFECEITFSNVNNYYVHKRLYCSGR-----RAPE-DAPAARRPKA-PPGPARA-- 580 590 600 610 620 610 620 630 640 650 660 pF1KE3 NLLNPAAHSADPENPLLQTSCINSSTVLDLIGPNGKGHDKDFSTQTKKLSTSSNNDDKIN : .. :.:. : :: ::... . . ... .: . CCDS32 ----PPGQPAEPDAP-------RSSP-----GPGAR---------EEGAGGAATPEDGAG 630 640 650 670 680 690 700 710 720 pF1KE3 GKPVD-VKNPSVPLVDGESDPNKTTCEACNITFSRHETYMVHKQYYCATRHDPPLKRSAS :. . ..:. . :.:.::..: :::::: ::::::: :::.::::.::::: .: :. CCDS32 GRGSEGSQSPGSSVDDAEDDPSRTLCEACNIRFSRHETYTVHKRYYCASRHDPPPRRPAA 660 670 680 690 700 710 730 740 750 760 770 pF1KE3 NK---------VPAMQRTMRTRKRRKMYEMCLPEQEQRPPLVQQRFLDVANLNNPCTSTQ .:. .:::.:::.::. :: . : . CCDS32 PPGPPGPAAPPAPSPAAPVRTRRRRKLYELHAAGAPPPPP----------PGHAPAPESP 720 730 740 750 760 780 790 800 810 820 830 pF1KE3 EPTEGLGECYHPRCDIFPGIVSKHLETSLTINKCVPVSKCDTTHSSVSCLEMDVPIDLSK .: : : ::.. . :.. : :::::: CCDS32 RPGSGSGSG--------PGLAPARSPG--------PAA--------------DGPIDLSK 770 780 790 840 850 860 870 880 890 pF1KE3 KCLSQSERTTTSPKRLLDYHECTVCKISFNKVENYLAHKQNFCPVTAHQRNDLGQLDGKV : . . : ::::::.:..::...: :::::. ::. : . :: : . . CCDS32 K--PRRPLPGAPAPALADYHECTACRVSFHSLEAYLAHKKYSCPA-APPPGALG-LPAAA 800 810 820 830 840 850 900 910 920 930 940 950 pF1KE3 FPNPESERNSPDVSYERSIIKCEKNGNLKQPSPNGNLFSSHLATLQGLKVFSEAAQLIAT : : :: .. :.:. :. .:: . .: : . CCDS32 CPY------------------CPPNGPVR-----GDLLE-HFRLAHGLLL---GAPLAG- 860 870 880 960 970 980 990 1000 1010 pF1KE3 KEENRHLFLPQCLYPGAIKKAKGADQLSPYYGIKPSDYISGSLVIHNTDIEQSRNAENES :: .. ::. : .:. : : CCDS32 --------------PG-VEARTPADR-----GPSPAP------------------APAAS 890 900 1020 1030 1040 1050 1060 1070 pF1KE3 PKGQASSNGCAALKKDSLPLLPKNRGMVIVNGGLKQDERPAANPQQENISQNPQHEDDHK :. : .:: : . .:. ::. CCDS32 PQ-------------------PGSRG-----------PRDGLGPE-------PQEPPPGP 910 920 930 1080 1090 1100 1110 1120 pF1KE3 SPSWISENPLAANENVSPGIPSAEEQLSSIAKGVNGSSQ---APTSG---KYCRLCDIQF :: : :: : : : : : . :. :::. :. :: . .:::::.:.: CCDS32 PPS-----PAAAPEAVPP--PPAPPSYSD--KGVQTPSKGTPAPLPNGNHRYCRLCNIKF 940 950 960 970 980 1130 1140 1150 pF1KE3 NNLSNFITHKKFYCSSHAAEHVK ..::.::.:::.::::::::::: CCDS32 SSLSTFIAHKKYYCSSHAAEHVK 990 1000 1151 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 15:19:45 2016 done: Mon Nov 7 15:19:46 2016 Total Scan time: 3.590 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]