FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5770, 761 aa 1>>>pF1KB5770 761 - 761 aa - 761 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.6175+/-0.000494; mu= 14.1782+/- 0.031 mean_var=108.4439+/-22.535, 0's: 0 Z-trim(110.7): 54 B-trim: 943 in 2/48 Lambda= 0.123161 statistics sampled from 19112 (19158) to 19112 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.561), E-opt: 0.2 (0.225), width: 16 Scan time: 8.690 The best scores are: opt bits E(85289) NP_004696 (OMIM: 607374) 52 kDa repressor of the i ( 761) 5046 908.4 0 NP_113623 (OMIM: 612531) THAP domain-containing pr ( 228) 229 52.2 2.9e-06 NP_001123947 (OMIM: 612534) THAP domain-containing ( 395) 185 44.5 0.001 XP_011540703 (OMIM: 612532) PREDICTED: THAP domain ( 148) 172 41.9 0.0023 XP_016858250 (OMIM: 612532) PREDICTED: THAP domain ( 168) 172 42.0 0.0025 NP_085050 (OMIM: 609518) THAP domain-containing pr ( 309) 174 42.5 0.0033 NP_001008695 (OMIM: 609518) THAP domain-containing ( 309) 174 42.5 0.0033 NP_001182682 (OMIM: 612532) THAP domain-containing ( 239) 172 42.1 0.0034 NP_612359 (OMIM: 612532) THAP domain-containing pr ( 175) 162 40.2 0.009 >>NP_004696 (OMIM: 607374) 52 kDa repressor of the inhib (761 aa) initn: 5046 init1: 5046 opt: 5046 Z-score: 4851.4 bits: 908.4 E(85289): 0 Smith-Waterman score: 5046; 100.0% identity (100.0% similar) in 761 aa overlap (1-761:1-761) 10 20 30 40 50 60 pF1KB5 MPNFCAAPNCTRKSTQSDLAFFRFPRDPARCQKWVENCRRADLEDKTPDQLNKHYRLCAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MPNFCAAPNCTRKSTQSDLAFFRFPRDPARCQKWVENCRRADLEDKTPDQLNKHYRLCAK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 HFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLKQKKID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 HFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLKQKKID 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 ETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEILILMGKQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 ETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEILILMGKQN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 IPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCSKTQQRQMLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 IPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCSKTQQRQMLE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 ICESCIREETLREVRDSHFFSIITDDVVDIAGEEHLPVLVRFVDESHNLREEFIGFLPYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 ICESCIREETLREVRDSHFFSIITDDVVDIAGEEHLPVLVRFVDESHNLREEFIGFLPYE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 ADAEILAVKFHTMITEKWGLNMEYCRGQAYIVSSGFSSKMKVVASRLLEKYPQAIYTLCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 ADAEILAVKFHTMITEKWGLNMEYCRGQAYIVSSGFSSKMKVVASRLLEKYPQAIYTLCS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 SCALNMWLAKSVPVMGVSVALGTIEEVCSFFHRSPQLLLELDNVISVLFQNSKERGKELK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 SCALNMWLAKSVPVMGVSVALGTIEEVCSFFHRSPQLLLELDNVISVLFQNSKERGKELK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 EICHSQWTGRHDAFEILVELLQALVLCLDGINSDTNIRWNNYIAGRAFVLCSAVSDFDFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 EICHSQWTGRHDAFEILVELLQALVLCLDGINSDTNIRWNNYIAGRAFVLCSAVSDFDFI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 VTIVVLKNVLSFTRAFGKNLQGQTSDVFFAAGSLTAVLHSLNEVMENIEVYHEFWFEEAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 VTIVVLKNVLSFTRAFGKNLQGQTSDVFFAAGSLTAVLHSLNEVMENIEVYHEFWFEEAT 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB5 NLATKLDIQMKLPGKFRRAHQGNLESQLTSESYYKETLSVPTVEHIIQELKDIFSEQHLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 NLATKLDIQMKLPGKFRRAHQGNLESQLTSESYYKETLSVPTVEHIIQELKDIFSEQHLK 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB5 ALKCLSLVPSVMGQLKFNTSEEHHADMYRSDLPNPDTLSAELHCWRIKWKHRGKDIELPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 ALKCLSLVPSVMGQLKFNTSEEHHADMYRSDLPNPDTLSAELHCWRIKWKHRGKDIELPS 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB5 TIYEALHLPDIKFFPNVYALLKVLCILPVMKVENERYENGRKRLKAYLRNTLTDQRSSNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 TIYEALHLPDIKFFPNVYALLKVLCILPVMKVENERYENGRKRLKAYLRNTLTDQRSSNL 670 680 690 700 710 720 730 740 750 760 pF1KB5 ALLNINFDIKHDLDLMVDTYIKLYTSKSELPTDNSETVENT ::::::::::::::::::::::::::::::::::::::::: NP_004 ALLNINFDIKHDLDLMVDTYIKLYTSKSELPTDNSETVENT 730 740 750 760 >>NP_113623 (OMIM: 612531) THAP domain-containing protei (228 aa) initn: 218 init1: 93 opt: 229 Z-score: 233.4 bits: 52.2 E(85289): 2.9e-06 Smith-Waterman score: 229; 37.7% identity (67.0% similar) in 106 aa overlap (1-105:1-99) 10 20 30 40 50 pF1KB5 MPNFCAAPNC-TRKSTQSDLAFFRFPRDPARCQKWVENCRRADLEDKTPDQLNKHYRLCA ::. ::: .: : . . ...: ::: :: : ..::. :: .. .: .:: ::. NP_113 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNF---VP---GKHTFLCS 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 KHFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLKQKKI ::::.: . :. : :. .:.:::::. .:... . . :. .:. NP_113 KHFEASCFDLTGQTRR-LKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 DETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEILILMGKQ NP_113 NISSQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLV 120 130 140 150 160 170 >>NP_001123947 (OMIM: 612534) THAP domain-containing pro (395 aa) initn: 120 init1: 64 opt: 185 Z-score: 187.7 bits: 44.5 E(85289): 0.001 Smith-Waterman score: 185; 28.6% identity (57.1% similar) in 147 aa overlap (1-140:1-140) 10 20 30 40 50 pF1KB5 MPNFCAAPNCT----RKSTQSDLAFFRFP-RDPARCQKWVENCRRADLEDKTPDQLNKHY :: .::: : :.. . :.:. :: .: : .::..: .: .. .: .:. NP_001 MPRYCAAICCKNRRGRNNKDRKLSFYPFPLHDKERLEKWLKNMKR---DSWVP---SKYQ 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 RLCAKHFETSMICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLK ::. :: . . : :...:.::::.: .. ..: :. ::: .. NP_001 FLCSDHFTPDSLDIRWGIR-YLKQTAVPTIFSLPEDNQGKDPSKKKSQKKNLEDEKEVCP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 QKKIDETSEQEQKHKETNNSNA--QNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEIL . : .:. .. .:. :... :.: NP_001 KAKSEESFVLNETKKNIVNTDVPHQHPELLHSSSLVKPPAPKTGSIQNNMLTLNLVKQHT 120 130 140 150 160 170 >>XP_011540703 (OMIM: 612532) PREDICTED: THAP domain-con (148 aa) initn: 105 init1: 48 opt: 172 Z-score: 181.4 bits: 41.9 E(85289): 0.0023 Smith-Waterman score: 172; 27.9% identity (55.2% similar) in 154 aa overlap (1-145:1-142) 10 20 30 40 50 pF1KB5 MPNFCAAPNCTRK--STQSDLAFFRFPRD-PARCQKWVENCRRADLEDKTPDQLNKHYRL ::. ::: .: . : ...:.: ::: . : ..:: : :.... : .: . XP_011 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPK------QHTVI 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 CAKHFETSMICRTS-PYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSE--DEIRTL :..::. : .. : :. ::.::.: ...: .. :. :: . . XP_011 CSEHFRPE--CFSAFGNRKNLKHNAVPTVF----AFQDPTQQVRENTDPASERGNASSSQ 60 70 80 90 100 120 130 140 150 160 170 pF1KB5 KQKKIDETSEQEQ---KHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFE :.: . :.. :. .. .: . : : . :: XP_011 KEKVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQIP 110 120 130 140 180 190 200 210 220 230 pF1KB5 ILILMGKQNIPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCS >>XP_016858250 (OMIM: 612532) PREDICTED: THAP domain-con (168 aa) initn: 105 init1: 48 opt: 172 Z-score: 180.6 bits: 42.0 E(85289): 0.0025 Smith-Waterman score: 172; 27.9% identity (55.2% similar) in 154 aa overlap (1-145:1-142) 10 20 30 40 50 pF1KB5 MPNFCAAPNCTRK--STQSDLAFFRFPRD-PARCQKWVENCRRADLEDKTPDQLNKHYRL ::. ::: .: . : ...:.: ::: . : ..:: : :.... : .: . XP_016 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPK------QHTVI 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 CAKHFETSMICRTS-PYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSE--DEIRTL :..::. : .. : :. ::.::.: ...: .. :. :: . . XP_016 CSEHFRPE--CFSAFGNRKNLKHNAVPTVF----AFQDPTQQVRENTDPASERGNASSSQ 60 70 80 90 100 120 130 140 150 160 170 pF1KB5 KQKKIDETSEQEQ---KHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFE :.: . :.. :. .. .: . : : . :: XP_016 KEKVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB5 ILILMGKQNIPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCS >>NP_085050 (OMIM: 609518) THAP domain-containing protei (309 aa) initn: 52 init1: 52 opt: 174 Z-score: 178.7 bits: 42.5 E(85289): 0.0033 Smith-Waterman score: 174; 33.0% identity (59.0% similar) in 100 aa overlap (1-92:1-99) 10 20 30 40 50 pF1KB5 MPNFCAAPNC----TRKSTQSDLAFFRFPR-DPARCQKWVENCRRADLEDK-TPDQLNKH :: :.: .: ::.. . ..: :.:. : : :. ::.: : . : ... NP_085 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB5 YRLCAKHFETSM--ICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIR .:.:::: . . : :. :...:.::::. :.: NP_085 IYFCSKHFEEDCFELVGISGYHR-LKEGAVPTIFESFSKLRRTTKTKGHSYPPGPAEVSR 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 TLKQKKIDETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEI NP_085 LRRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLL 120 130 140 150 160 170 >>NP_001008695 (OMIM: 609518) THAP domain-containing pro (309 aa) initn: 52 init1: 52 opt: 174 Z-score: 178.7 bits: 42.5 E(85289): 0.0033 Smith-Waterman score: 174; 33.0% identity (59.0% similar) in 100 aa overlap (1-92:1-99) 10 20 30 40 50 pF1KB5 MPNFCAAPNC----TRKSTQSDLAFFRFPR-DPARCQKWVENCRRADLEDK-TPDQLNKH :: :.: .: ::.. . ..: :.:. : : :. ::.: : . : ... NP_001 MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEY 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB5 YRLCAKHFETSM--ICRTSPYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIR .:.:::: . . : :. :...:.::::. :.: NP_001 IYFCSKHFEEDCFELVGISGYHR-LKEGAVPTIFESFSKLRRTTKTKGHSYPPGPAEVSR 70 80 90 100 110 120 130 140 150 160 170 pF1KB5 TLKQKKIDETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEI NP_001 LRRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLPASPAGRLEPGLSSPFSDLL 120 130 140 150 160 170 >>NP_001182682 (OMIM: 612532) THAP domain-containing pro (239 aa) initn: 133 init1: 48 opt: 172 Z-score: 178.4 bits: 42.1 E(85289): 0.0034 Smith-Waterman score: 172; 27.9% identity (55.2% similar) in 154 aa overlap (1-145:1-142) 10 20 30 40 50 pF1KB5 MPNFCAAPNCTRK--STQSDLAFFRFPRD-PARCQKWVENCRRADLEDKTPDQLNKHYRL ::. ::: .: . : ...:.: ::: . : ..:: : :.... : .: . NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPK------QHTVI 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 CAKHFETSMICRTS-PYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSE--DEIRTL :..::. : .. : :. ::.::.: ...: .. :. :: . . NP_001 CSEHFRPE--CFSAFGNRKNLKHNAVPTVF----AFQDPTQQVRENTDPASERGNASSSQ 60 70 80 90 100 120 130 140 150 160 170 pF1KB5 KQKKIDETSEQEQ---KHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFE :.: . :.. :. .. .: . : : . :: NP_001 KEKVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLR 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB5 ILILMGKQNIPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCS NP_001 RTPNKQPSDHSYALLDLDSLKKKLFLTLKENEKLRKRLQAQRLVMRRMSSRLRACKGHQG 170 180 190 200 210 220 >>NP_612359 (OMIM: 612532) THAP domain-containing protei (175 aa) initn: 125 init1: 48 opt: 162 Z-score: 170.8 bits: 40.2 E(85289): 0.009 Smith-Waterman score: 168; 26.6% identity (56.3% similar) in 158 aa overlap (1-154:1-144) 10 20 30 40 50 pF1KB5 MPNFCAAPNCTRK--STQSDLAFFRFPRD-PARCQKWVENCRRADLEDKTPDQLNKHYRL ::. ::: .: . : ...:.: ::: . : ..:: : :.... : .: . NP_612 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPK------QHTVI 10 20 30 40 50 60 70 80 90 100 110 pF1KB5 CAKHFETSMICRTS-PYRTVLRDNAIPTIFDLTSHLNNPHSRHRKRIKELSEDEIRTLKQ :..::. : .. : :. ::.::.: ...: .. :. :: . .: NP_612 CSEHFRPE--CFSAFGNRKNLKHNAVPTVFA----FQDPTQQVRENTDPASERGNASSSQ 60 70 80 90 100 120 130 140 150 160 170 pF1KB5 KKIDETSEQEQKHKETNNSNAQNPSEEEGEGQDEDILPLTLEEKENKEYLKSLFEILILM : ..:: ... ... ..:... . .: :: NP_612 K--EKTSPCRSQVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPAS 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB5 GKQNIPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKRFETTAVNTLFCSKTQQR NP_612 REALWLSEE 170 761 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:05:14 2016 done: Sun Nov 6 15:05:15 2016 Total Scan time: 8.690 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]