FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0464, 222 aa
1>>>pF1KE0464 222 - 222 aa - 222 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2543+/-0.000325; mu= 9.3700+/- 0.020
mean_var=83.8383+/-16.548, 0's: 0 Z-trim(116.3): 32 B-trim: 211 in 2/51
Lambda= 0.140073
statistics sampled from 27336 (27368) to 27336 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.696), E-opt: 0.2 (0.321), width: 16
Scan time: 6.300
The best scores are: opt bits E(85289)
XP_011529968 (OMIM: 612535) PREDICTED: THAP domain ( 222) 1533 319.1 3.6e-87
XP_011529969 (OMIM: 612535) PREDICTED: THAP domain ( 222) 1533 319.1 3.6e-87
NP_653322 (OMIM: 612535) THAP domain-containing pr ( 222) 1533 319.1 3.6e-87
XP_005262829 (OMIM: 612535) PREDICTED: THAP domain ( 181) 1240 259.9 2e-69
XP_005262831 (OMIM: 612535) PREDICTED: THAP domain ( 160) 982 207.7 8.9e-54
XP_016863290 (OMIM: 612535) PREDICTED: THAP domain ( 147) 970 205.3 4.4e-53
XP_006714172 (OMIM: 612535) PREDICTED: THAP domain ( 172) 967 204.7 7.8e-53
NP_001304720 (OMIM: 612535) THAP domain-containing ( 180) 679 146.5 2.7e-35
XP_016863289 (OMIM: 612535) PREDICTED: THAP domain ( 180) 679 146.5 2.7e-35
NP_078948 (OMIM: 612537) DNA transposase THAP9 iso ( 903) 310 72.2 3.1e-12
NP_113623 (OMIM: 612531) THAP domain-containing pr ( 228) 220 53.8 2.7e-07
NP_085050 (OMIM: 609518) THAP domain-containing pr ( 309) 220 53.9 3.6e-07
NP_001008695 (OMIM: 609518) THAP domain-containing ( 309) 220 53.9 3.6e-07
NP_060575 (OMIM: 602629,609520) THAP domain-contai ( 213) 211 52.0 9.1e-07
XP_016864092 (OMIM: 612537) PREDICTED: DNA transpo ( 916) 217 53.4 1.4e-06
NP_057047 (OMIM: 612533) THAP domain-containing pr ( 577) 200 49.9 1e-05
XP_005247073 (OMIM: 612533) PREDICTED: THAP domain ( 711) 200 50.0 1.2e-05
NP_001123947 (OMIM: 612534) THAP domain-containing ( 395) 188 47.4 3.9e-05
XP_011540703 (OMIM: 612532) PREDICTED: THAP domain ( 148) 176 44.8 8.9e-05
XP_016858250 (OMIM: 612532) PREDICTED: THAP domain ( 168) 176 44.9 0.0001
NP_612359 (OMIM: 612532) THAP domain-containing pr ( 175) 176 44.9 0.0001
NP_001182682 (OMIM: 612532) THAP domain-containing ( 239) 176 44.9 0.00014
NP_001182681 (OMIM: 612532) THAP domain-containing ( 238) 174 44.5 0.00018
XP_005263589 (OMIM: 612532) PREDICTED: THAP domain ( 238) 174 44.5 0.00018
NP_004696 (OMIM: 607374) 52 kDa repressor of the i ( 761) 155 40.9 0.0071
>>XP_011529968 (OMIM: 612535) PREDICTED: THAP domain-con (222 aa)
initn: 1533 init1: 1533 opt: 1533 Z-score: 1686.0 bits: 319.1 E(85289): 3.6e-87
Smith-Waterman score: 1533; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
130 140 150 160 170 180
190 200 210 220
pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
::::::::::::::::::::::::::::::::::::::::::
XP_011 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
190 200 210 220
>>XP_011529969 (OMIM: 612535) PREDICTED: THAP domain-con (222 aa)
initn: 1533 init1: 1533 opt: 1533 Z-score: 1686.0 bits: 319.1 E(85289): 3.6e-87
Smith-Waterman score: 1533; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
130 140 150 160 170 180
190 200 210 220
pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
::::::::::::::::::::::::::::::::::::::::::
XP_011 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
190 200 210 220
>>NP_653322 (OMIM: 612535) THAP domain-containing protei (222 aa)
initn: 1533 init1: 1533 opt: 1533 Z-score: 1686.0 bits: 319.1 E(85289): 3.6e-87
Smith-Waterman score: 1533; 100.0% identity (100.0% similar) in 222 aa overlap (1-222:1-222)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_653 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_653 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_653 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
130 140 150 160 170 180
190 200 210 220
pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
::::::::::::::::::::::::::::::::::::::::::
NP_653 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
190 200 210 220
>>XP_005262829 (OMIM: 612535) PREDICTED: THAP domain-con (181 aa)
initn: 1240 init1: 1240 opt: 1240 Z-score: 1367.4 bits: 259.9 E(85289): 2e-69
Smith-Waterman score: 1240; 100.0% identity (100.0% similar) in 181 aa overlap (42-222:1-181)
20 30 40 50 60 70
pF1KE0 SRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDVLCSRHFKKTDF
::::::::::::::::::::::::::::::
XP_005 MKRLDVNAAGIWEPKKGDVLCSRHFKKTDF
10 20 30
80 90 100 110 120 130
pF1KE0 DRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 DRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEE
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE0 FQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 FQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELK
100 110 120 130 140 150
200 210 220
pF1KE0 DECLISQETANRLDTFCWDCCQESIEQDYIS
:::::::::::::::::::::::::::::::
XP_005 DECLISQETANRLDTFCWDCCQESIEQDYIS
160 170 180
>>XP_005262831 (OMIM: 612535) PREDICTED: THAP domain-con (160 aa)
initn: 999 init1: 978 opt: 982 Z-score: 1086.4 bits: 207.7 E(85289): 8.9e-54
Smith-Waterman score: 982; 88.7% identity (94.3% similar) in 159 aa overlap (1-159:1-159)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::.: ... .. : . .. :
XP_005 HLVGASSCIEEFQSQFIFKHRKRKQEQEEEQKPRREKCIS
130 140 150 160
190 200 210 220
pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
>>XP_016863290 (OMIM: 612535) PREDICTED: THAP domain-con (147 aa)
initn: 991 init1: 970 opt: 970 Z-score: 1073.9 bits: 205.3 E(85289): 4.4e-53
Smith-Waterman score: 970; 97.9% identity (99.3% similar) in 141 aa overlap (1-141:1-141)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
:::::::::::::::::: ..
XP_016 HLVGASSCIEEFQSQFIFTYTSARSLL
130 140
>>XP_006714172 (OMIM: 612535) PREDICTED: THAP domain-con (172 aa)
initn: 967 init1: 967 opt: 967 Z-score: 1069.6 bits: 204.7 E(85289): 7.8e-53
Smith-Waterman score: 967; 100.0% identity (100.0% similar) in 138 aa overlap (1-138:1-138)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::
XP_006 HLVGASSCIEEFQSQFIFISMFKRKCLFKAKNYFSVTIIANIYKVPIFIQST
130 140 150 160 170
>>NP_001304720 (OMIM: 612535) THAP domain-containing pro (180 aa)
initn: 1231 init1: 679 opt: 679 Z-score: 754.7 bits: 146.5 E(85289): 2.7e-35
Smith-Waterman score: 1151; 81.1% identity (81.1% similar) in 222 aa overlap (1-222:1-180)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::
NP_001 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQ------------------------
70 80 90
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::::::::::::::::::::::::::
NP_001 ------------------EHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
100 110 120 130
190 200 210 220
pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
::::::::::::::::::::::::::::::::::::::::::
NP_001 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
140 150 160 170 180
>>XP_016863289 (OMIM: 612535) PREDICTED: THAP domain-con (180 aa)
initn: 1231 init1: 679 opt: 679 Z-score: 754.7 bits: 146.5 E(85289): 2.7e-35
Smith-Waterman score: 1151; 81.1% identity (81.1% similar) in 222 aa overlap (1-222:1-180)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNH
::::::::::::::::::::::::::::::::::::
XP_016 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQ------------------------
70 80 90
130 140 150 160 170 180
pF1KE0 HLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
::::::::::::::::::::::::::::::::::::::::::
XP_016 ------------------EHSYSVMDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQ
100 110 120 130
190 200 210 220
pF1KE0 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
::::::::::::::::::::::::::::::::::::::::::
XP_016 KSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQDYIS
140 150 160 170 180
>>NP_078948 (OMIM: 612537) DNA transposase THAP9 isoform (903 aa)
initn: 284 init1: 284 opt: 310 Z-score: 340.7 bits: 72.2 E(85289): 3.1e-12
Smith-Waterman score: 333; 32.3% identity (59.9% similar) in 217 aa overlap (1-210:1-198)
10 20 30 40 50 60
pF1KE0 MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDV
:.. :::.::..: :. .::.:: :::: . ::. :..:.: . :: : : .
NP_078 MTRSCSAVGCSTRDTVLSRERGLSFHQFPTDTIQRSKWIRAVNRVDPRSKKIWIPGPGAI
10 20 30 40 50 60
70 80 90 100 110
pF1KE0 LCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHL-QGKREKLHCRKNFTLKTVPATNYN
:::.::...::. . ::: :..::. : :.. :: . : . :... . .: ..
NP_078 LCSKHFQESDFESYGIRRKLKKGAVPSV--SLYKIPQGVHLKGKARQKILKQPLPDNS--
70 80 90 100 110
120 130 140 150 160 170
pF1KE0 HHLVGASSCIEEFQSQFIFEHSYSVMDSPKKL-KHKLDHVIGELEDTKESLRNVLD-REK
.: .. .:.:: . .: . .:: .: :. .:. : .: . :
NP_078 ----------QEVATE---DHNYS-LKTPLTIGAEKLAEVQQMLQVSKKRLISVKNYRMI
120 130 140 150 160
180 190 200 210 220
pF1KE0 RFQKSLRKTIRELKDECLISQETA----NRLDTFCWDCCQESIEQDYIS
. .:.:: : : .: :.:.:: ... : :.
NP_078 KKRKGLR-LIDALVEEKLLSEETECLLRAQFSDFKWELYNWRETDEYSAEMKQFACTLYL
170 180 190 200 210 220
NP_078 CSSKVYDYVRKILKLPHSSILRTWLSKCQPSPGFNSNIFSFLQRRVENGDQLYQYCSLLI
230 240 250 260 270 280
222 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 06:53:20 2016 done: Thu Nov 3 06:53:21 2016
Total Scan time: 6.300 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]