FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0115, 228 aa
1>>>pF1KE0115 228 - 228 aa - 228 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6906+/-0.00036; mu= 12.7359+/- 0.022
mean_var=77.8623+/-16.105, 0's: 0 Z-trim(115.1): 62 B-trim: 1433 in 2/51
Lambda= 0.145349
statistics sampled from 25277 (25341) to 25277 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.679), E-opt: 0.2 (0.297), width: 16
Scan time: 6.600
The best scores are: opt bits E(85289)
NP_113623 (OMIM: 612531) THAP domain-containing pr ( 228) 1515 326.8 1.9e-89
NP_612359 (OMIM: 612532) THAP domain-containing pr ( 175) 280 67.7 1.4e-11
XP_011540703 (OMIM: 612532) PREDICTED: THAP domain ( 148) 275 66.7 2.5e-11
NP_060575 (OMIM: 602629,609520) THAP domain-contai ( 213) 277 67.2 2.5e-11
XP_016858250 (OMIM: 612532) PREDICTED: THAP domain ( 168) 275 66.7 2.7e-11
NP_001182682 (OMIM: 612532) THAP domain-containing ( 239) 275 66.8 3.7e-11
XP_005263589 (OMIM: 612532) PREDICTED: THAP domain ( 238) 270 65.7 7.6e-11
NP_001182681 (OMIM: 612532) THAP domain-containing ( 238) 270 65.7 7.6e-11
NP_057047 (OMIM: 612533) THAP domain-containing pr ( 577) 247 61.1 4.5e-09
XP_005247073 (OMIM: 612533) PREDICTED: THAP domain ( 711) 247 61.2 5.3e-09
NP_001123947 (OMIM: 612534) THAP domain-containing ( 395) 230 57.5 3.9e-08
NP_001318031 (OMIM: 612536) THAP domain-containing ( 233) 225 56.3 5.2e-08
NP_689871 (OMIM: 612536) THAP domain-containing pr ( 274) 225 56.3 5.9e-08
NP_004696 (OMIM: 607374) 52 kDa repressor of the i ( 761) 229 57.4 7.7e-08
XP_011529968 (OMIM: 612535) PREDICTED: THAP domain ( 222) 220 55.2 1e-07
XP_011529969 (OMIM: 612535) PREDICTED: THAP domain ( 222) 220 55.2 1e-07
NP_653322 (OMIM: 612535) THAP domain-containing pr ( 222) 220 55.2 1e-07
XP_016859745 (OMIM: 612533) PREDICTED: THAP domain ( 601) 197 50.7 6.6e-06
XP_011509593 (OMIM: 612533) PREDICTED: THAP domain ( 628) 197 50.7 6.9e-06
NP_078948 (OMIM: 612537) DNA transposase THAP9 iso ( 903) 199 51.2 6.9e-06
XP_005262831 (OMIM: 612535) PREDICTED: THAP domain ( 160) 185 47.8 1.3e-05
NP_085050 (OMIM: 609518) THAP domain-containing pr ( 309) 179 46.7 5.2e-05
NP_001008695 (OMIM: 609518) THAP domain-containing ( 309) 179 46.7 5.2e-05
XP_016863290 (OMIM: 612535) PREDICTED: THAP domain ( 147) 169 44.4 0.00012
XP_006714172 (OMIM: 612535) PREDICTED: THAP domain ( 172) 169 44.5 0.00014
NP_001304720 (OMIM: 612535) THAP domain-containing ( 180) 169 44.5 0.00014
XP_016863289 (OMIM: 612535) PREDICTED: THAP domain ( 180) 169 44.5 0.00014
XP_005262829 (OMIM: 612535) PREDICTED: THAP domain ( 181) 169 44.5 0.00014
XP_016864092 (OMIM: 612537) PREDICTED: DNA transpo ( 916) 178 46.8 0.00015
NP_064532 (OMIM: 612538) THAP domain-containing pr ( 257) 153 41.2 0.002
NP_001318032 (OMIM: 612536) THAP domain-containing ( 231) 148 40.1 0.0037
NP_001318033 (OMIM: 612536) THAP domain-containing ( 231) 148 40.1 0.0037
>>NP_113623 (OMIM: 612531) THAP domain-containing protei (228 aa)
initn: 1515 init1: 1515 opt: 1515 Z-score: 1726.9 bits: 326.8 E(85289): 1.9e-89
Smith-Waterman score: 1515; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228)
10 20 30 40 50 60
pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_113 MPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEAS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_113 CFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_113 LLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEANS
130 140 150 160 170 180
190 200 210 220
pF1KE0 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI
::::::::::::::::::::::::::::::::::::::::::::::::
NP_113 VLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTKSTFI
190 200 210 220
>>NP_612359 (OMIM: 612532) THAP domain-containing protei (175 aa)
initn: 257 init1: 185 opt: 280 Z-score: 329.0 bits: 67.7 E(85289): 1.4e-11
Smith-Waterman score: 280; 34.6% identity (63.9% similar) in 133 aa overlap (1-131:1-130)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
NP_612 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: : .... .. ...:. : .. :.. .
NP_612 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSS---QKEKTSPCRS
70 80 90 100 110
120 130 140 150 160 170
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
::: : . . .:
NP_612 QVLPEAGAGEDSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE
120 130 140 150 160 170
>>XP_011540703 (OMIM: 612532) PREDICTED: THAP domain-con (148 aa)
initn: 257 init1: 185 opt: 275 Z-score: 324.5 bits: 66.7 E(85289): 2.5e-11
Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
XP_011 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: : .... .. ...:. :
XP_011 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
XP_011 DSPGRNMDTALEELQLPPNAEGHVKQIP
130 140
>>NP_060575 (OMIM: 602629,609520) THAP domain-containing (213 aa)
initn: 339 init1: 171 opt: 277 Z-score: 324.4 bits: 67.2 E(85289): 2.5e-11
Smith-Waterman score: 366; 35.3% identity (63.2% similar) in 201 aa overlap (1-185:1-197)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYNKHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFEA
: .:.: :: . :.: .:::.::: :. ::: :::::: : :.. .::.::
NP_060 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 SCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAG-P---SNLKSNI
.:: ... :: .:::::: .::. .. :...::. ... : : :.. . :
NP_060 DCFKRECNNKLLKENAVPTIF-LCTEPHD---KKEDLLEPQEQLPPPPLPPPVSQVDAAI
70 80 90 100 110
120 130 140 150 160
pF1KE0 S----------SQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRR
. . .:. .:.:. .. :. .::: .::... .::.:.:: :. :: :.
NP_060 GLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQRCRRQERQ
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE0 WIKATCLVK-NLEANSVLPKGTSEHMLPTALSSLPLEDFKILEQDQQDKTLLSLNLKQTK
: .:. . : ..: .:
NP_060 LEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEVPA
180 190 200 210
>>XP_016858250 (OMIM: 612532) PREDICTED: THAP domain-con (168 aa)
initn: 257 init1: 185 opt: 275 Z-score: 323.6 bits: 66.7 E(85289): 2.7e-11
Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
XP_016 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: : .... .. ...:. :
XP_016 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
XP_016 DSPGRNMDTALEELQLPPNAEGHVKQAMLFNVENGTPASREALWLSEE
130 140 150 160
>>NP_001182682 (OMIM: 612532) THAP domain-containing pro (239 aa)
initn: 309 init1: 185 opt: 275 Z-score: 321.4 bits: 66.8 E(85289): 3.7e-11
Smith-Waterman score: 275; 37.7% identity (67.0% similar) in 106 aa overlap (1-104:1-106)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: : .... .. ...:. :
NP_001 PECFSAFGNRKNLKHNAVPTVFAFQDPTQQVRENTDPASERGNASSSQKEKVLPEAGAGE
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 QVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKNLEA
NP_001 DSPGRNMDTALEELQLPPNAEGHVKQVSPRRPQATEAVGRPTGPAGLRRTPNKQPSDHSY
130 140 150 160 170 180
>>XP_005263589 (OMIM: 612532) PREDICTED: THAP domain-con (238 aa)
initn: 309 init1: 185 opt: 270 Z-score: 315.7 bits: 65.7 E(85289): 7.6e-11
Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
XP_005 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: :
XP_005 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED
70 80 90 100 110 120
>>NP_001182681 (OMIM: 612532) THAP domain-containing pro (238 aa)
initn: 309 init1: 185 opt: 270 Z-score: 315.7 bits: 65.7 E(85289): 7.6e-11
Smith-Waterman score: 270; 45.2% identity (70.2% similar) in 84 aa overlap (1-82:1-84)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYN-KHINISFHRFPLD-PKRRKEWVRLVRRKNFVPGKHTFLCSKHFE
:: .::: : . :. .. ...:::::.. :. :::: . : :: : .:: .::.::.
NP_001 MPKSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFR
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 ASCFDLTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQ
::. :. . :: .::::.: :
NP_001 PECFSAFGNRKNLKHNAVPTVFAFQDPTQVRENTDPASERGNASSSQKEKVLPEAGAGED
70 80 90 100 110 120
>>NP_057047 (OMIM: 612533) THAP domain-containing protei (577 aa)
initn: 186 init1: 126 opt: 247 Z-score: 283.9 bits: 61.1 E(85289): 4.5e-09
Smith-Waterman score: 247; 34.5% identity (61.5% similar) in 148 aa overlap (1-143:1-146)
10 20 30 40 50
pF1KE0 MPTNCAAAGCATTYNK--HINISFHRFPL-DPKRRKEWVRLVRRKNFVPGKHTFLCSKHF
: :::..:.. .: . .::::::: : :: .:.. :.: :..: :..::::.::
NP_057 MVICCAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHF
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 EASCFD--LTGQTRRLKMDAVPTIFDFCTHIKSMKLKSRNLLKKNNSCSPAGPSNLKSNI
. :. : : : :: :::.:: . . .. ..:. .:. : . .: . .:
NP_057 TKDSFSKRLEDQHRLLKPTAVPSIFHLTEKKRGAGGHGRTR-RKDASKATGGVRGHSSAA
70 80 90 100 110
120 130 140 150 160 170
pF1KE0 SSQQVLLEHSYAFRNPMEAKKRIIKLEKEIASLRRKMKTCLQKERRATRRWIKATCLVKN
.:. . . ::: :: . .:..
NP_057 TSRGAAGWSPSSSGNPM-AKPESRRLKQAALQGEATPRAAQEAASQEQAQQALERTPGDG
120 130 140 150 160 170
>>XP_005247073 (OMIM: 612533) PREDICTED: THAP domain-con (711 aa)
initn: 166 init1: 126 opt: 247 Z-score: 282.6 bits: 61.2 E(85289): 5.3e-09
Smith-Waterman score: 247; 34.5% identity (61.5% similar) in 148 aa overlap (1-143:108-253)
10 20
pF1KE0 MPTNCAAAGCATTYNK--HINISFHRFPL-
: :::..:.. .: . .:::::::
XP_005 SPPRSLPRGGPRAGGRLGPGPGCAAGPRPAMVICCAAVNCSNRQGKGEKRAVSFHRFPLK
80 90 100 110 120 130
30 40 50 60 70 80
pF1KE0 DPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEASCFD--LTGQTRRLKMDAVPTIFDFCTH
: :: .:.. :.: :..: :..::::.:: . :. : : : :: :::.:: . .
XP_005 DSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDSFSKRLEDQHRLLKPTAVPSIFHLTEK
140 150 160 170 180 190
90 100 110 120 130 140
pF1KE0 IKSMKLKSRNLLKKNNSCSPAGPSNLKSNISSQQVLLEHSYAFRNPMEAKKRIIKLEKEI
.. ..:. .:. : . .: . .: .:. . . ::: :: . .:..
XP_005 KRGAGGHGRTR-RKDASKATGGVRGHSSAATSRGAAGWSPSSSGNPM-AKPESRRLKQAA
200 210 220 230 240 250
150 160 170 180 190 200
pF1KE0 ASLRRKMKTCLQKERRATRRWIKATCLVKNLEANSVLPKGTSEHMLPTALSSLPLEDFKI
XP_005 LQGEATPRAAQEAASQEQAQQALERTPGDGLATMVAGSQGKAEASATDAGDESATSSIEG
260 270 280 290 300 310
228 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 02:28:17 2016 done: Fri Nov 4 02:28:18 2016
Total Scan time: 6.600 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]