FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5770, 761 aa 1>>>pF1KB5770 761 - 761 aa - 761 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4314+/-0.00115; mu= 15.0714+/- 0.069 mean_var=98.3304+/-19.169, 0's: 0 Z-trim(103.3): 37 B-trim: 0 in 0/50 Lambda= 0.129339 statistics sampled from 7326 (7348) to 7326 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.575), E-opt: 0.2 (0.226), width: 16 Scan time: 3.580 The best scores are: opt bits E(32554) CCDS8243.1 THAP12 gene_id:5612|Hs108|chr11 ( 761) 5046 952.9 0 CCDS41302.1 ZMYM1 gene_id:79830|Hs108|chr1 (1142) 522 108.9 4.8e-23 >>CCDS8243.1 THAP12 gene_id:5612|Hs108|chr11 (761 aa) initn: 5046 init1: 5046 opt: 5046 Z-score: 5092.0 bits: 952.9 E(32554): 0 Smith-Waterman score: 5046; 100.0% identity (100.0% similar) in 761 aa overlap (1-761:1-761) 10 20 30 40 50 60 pF1KB5 MPNFCAAPNCTRKSTQSDLAFFRFPRDPARCQKWVENCRRADLEDKTPDQLNKHYRLCAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MPNFCAAPNCTRKSTQSDLAFFRFPRDPARCQKWVENCRRADLEDKTPDQLNKHYRLCAK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 HFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLKQKKID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 HFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLKQKKID 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 ETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEILILMGKQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEILILMGKQN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 IPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCSKTQQRQMLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 IPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCSKTQQRQMLE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 ICESCIREETLREVRDSHFFSIITDDVVDIAGEEHLPVLVRFVDESHNLREEFIGFLPYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ICESCIREETLREVRDSHFFSIITDDVVDIAGEEHLPVLVRFVDESHNLREEFIGFLPYE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 ADAEILAVKFHTMITEKWGLNMEYCRGQAYIVSSGFSSKMKVVASRLLEKYPQAIYTLCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ADAEILAVKFHTMITEKWGLNMEYCRGQAYIVSSGFSSKMKVVASRLLEKYPQAIYTLCS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 SCALNMWLAKSVPVMGVSVALGTIEEVCSFFHRSPQLLLELDNVISVLFQNSKERGKELK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 SCALNMWLAKSVPVMGVSVALGTIEEVCSFFHRSPQLLLELDNVISVLFQNSKERGKELK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 EICHSQWTGRHDAFEILVELLQALVLCLDGINSDTNIRWNNYIAGRAFVLCSAVSDFDFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 EICHSQWTGRHDAFEILVELLQALVLCLDGINSDTNIRWNNYIAGRAFVLCSAVSDFDFI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 VTIVVLKNVLSFTRAFGKNLQGQTSDVFFAAGSLTAVLHSLNEVMENIEVYHEFWFEEAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 VTIVVLKNVLSFTRAFGKNLQGQTSDVFFAAGSLTAVLHSLNEVMENIEVYHEFWFEEAT 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB5 NLATKLDIQMKLPGKFRRAHQGNLESQLTSESYYKETLSVPTVEHIIQELKDIFSEQHLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 NLATKLDIQMKLPGKFRRAHQGNLESQLTSESYYKETLSVPTVEHIIQELKDIFSEQHLK 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB5 ALKCLSLVPSVMGQLKFNTSEEHHADMYRSDLPNPDTLSAELHCWRIKWKHRGKDIELPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 ALKCLSLVPSVMGQLKFNTSEEHHADMYRSDLPNPDTLSAELHCWRIKWKHRGKDIELPS 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB5 TIYEALHLPDIKFFPNVYALLKVLCILPVMKVENERYENGRKRLKAYLRNTLTDQRSSNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 TIYEALHLPDIKFFPNVYALLKVLCILPVMKVENERYENGRKRLKAYLRNTLTDQRSSNL 670 680 690 700 710 720 730 740 750 760 pF1KB5 ALLNINFDIKHDLDLMVDTYIKLYTSKSELPTDNSETVENT ::::::::::::::::::::::::::::::::::::::::: CCDS82 ALLNINFDIKHDLDLMVDTYIKLYTSKSELPTDNSETVENT 730 740 750 760 >>CCDS41302.1 ZMYM1 gene_id:79830|Hs108|chr1 (1142 aa) initn: 266 init1: 109 opt: 522 Z-score: 527.2 bits: 108.9 E(32554): 4.8e-23 Smith-Waterman score: 528; 26.0% identity (57.6% similar) in 616 aa overlap (146-733:533-1122) 120 130 140 150 160 170 pF1KB5 QKKIDETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKE-NKEYLKSLFEILI .: : : . .. : ::.::: ..: .. CCDS41 WKKTLEKFRKHEKSEMHLKSLEFWREYQFCDGAVSDDLSIHSKQIEGNKKYLKLIIENIL 510 520 530 540 550 560 180 190 200 210 220 230 pF1KB5 LMGKQNIPLDGHE--ADEIPEGLFTPDNFQALLECRI-NSGEEVLRKRFETTAVNTLFCS ..::: .:: :.. .. . .: :: ::: : ..:::.. :. .. :. : . CCDS41 FLGKQCLPLRGNDQSVSSVNKG-----NFLELLEMRAKDKGEETF--RLMNSQVD--FYN 570 580 590 600 610 240 250 260 270 280 pF1KB5 KTQ-QRQMLEICESCIREETLREVRDSHFFSIITDDVVDIAGEEHLPVLVRFVDESHN-- .:: : ...:: .. . .. . :. :: :::: :.... : .:.: . ::. ..: . CCDS41 STQIQSDIIEIIKTEMLQDIVNEINDSSAFSIICDETINSAMKEQLSICVRYPQKSSKAI 620 630 640 650 660 670 290 300 310 320 330 340 pF1KB5 -LREEFIGFLPYEADAEILAVKFHTMIT---EKWGLNMEYCRGQAYIVSSGFSSKMKVVA ..:.:.::. : :. ....: : .. :..:. .:::: ..... :.. .: CCDS41 LIKERFLGFVDTE---EMTGTHLHRTIKTYLQQIGVDMDKIHGQAYDSTTNLKIKFNKIA 680 690 700 710 720 730 350 360 370 380 390 400 pF1KB5 SRLLEKYPQAIYTLCSSCALNMWLAKSVP-VMGVSVALGTIEEVCSFFHRSPQLLLELDN ... .. :.:.: : . :.. . . : . :: :. . . . : ..: .. : CCDS41 AEFKKEEPRALYIHCYAHFLDLSIIRFCKEVKELRSALKTLSSLFNTICMSGEMLANFRN 740 750 760 770 780 790 410 420 430 440 450 460 pF1KB5 VISVLFQNSKERGKELKEICHSQWTGRHDAFEILVELLQALVLCLDGINSDTNIRWNNYI . : ::. . :.: .: :: . .. ... : .. :. : : .. :. . CCDS41 IYR-LSQNKTCK----KHISQSCWTVHDRTLLSVIDSLPEIIETLEVIASHSS---NTSF 800 810 820 830 840 470 480 490 500 510 520 pF1KB5 AGRAFVLCSAVSDFDFIVTIVVLKNVLSFTRAFGKNLQGQTSDVFFAAGSLTAVLHSLNE : . : . :: :.:. . : ::: : ..:.::..: :.: .... :.:. :. CCDS41 ADELSHLLTLVSKFEFVFCLKFLYRVLSVTGILSKELQNKTIDIFSLSSKIEAILECLSS 850 860 870 880 890 900 530 540 550 560 570 pF1KB5 VMENIEVYHE-FW--FEEATNLATKLDIQMKLPG--KFRRAHQ----GNLESQL---TSE : .:: . .: :: . : .... :. : :. .. :: .... ..: CCDS41 --ERNDVYFKTIWDGTEEICQKITCKGFKVEKPSLQKRRKIQKSVDLGNSDNMFFPTSTE 910 920 930 940 950 960 580 590 600 610 620 630 pF1KB5 SYYKETLSVPTVEHIIQELKDIFSEQHLKALKCLSLVPSVMGQLKFNTSEEHHADMYRSD :: .. .. :.:.:: ::: .: .: . .. .:. .: ..:. : CCDS41 EQYKINIYYQGLDTILQNLKLCFSEFDYCKIKQISELLFKWNEPLNETTAKHVQEFYKLD 970 980 990 1000 1010 1020 640 650 660 670 680 pF1KB5 LPNPDTLSAELHCWRIKWKHRGKDIELPST----IYEALHLPDIKFFPNVYALLKVLCIL : : .... . :.. : : ..:: .: : . :: . CCDS41 EDIIPELRFYRHYAKLNFVIDDSCINFVSLGCLFIQHGLH-SNI---PCLSKLLYIALSW 1030 1040 1050 1060 1070 690 700 710 720 730 740 pF1KB5 PVMKVENERYENGRKRLKAYLRNTLTDQRSSNLALLNINFDIKHDLDLMVDTYIKLYTSK :. .. .: . :::.:: ::. ... .. ::. .. .. . : CCDS41 PITSASTENSFSTLPRLKTYLCNTMGQEKLTGPALMAVEQELVNKLMEPERLNEIVEKFI 1080 1090 1100 1110 1120 1130 750 760 pF1KB5 SELPTDNSETVENT CCDS41 SQMKEI 1140 761 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:05:13 2016 done: Sun Nov 6 15:05:14 2016 Total Scan time: 3.580 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]