FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1723, 234 aa
1>>>pF1KE1723 234 - 234 aa - 234 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0292+/-0.000302; mu= 16.2871+/- 0.019
mean_var=65.1472+/-13.090, 0's: 0 Z-trim(117.0): 10 B-trim: 213 in 2/48
Lambda= 0.158901
statistics sampled from 28701 (28707) to 28701 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.714), E-opt: 0.2 (0.337), width: 16
Scan time: 6.060
The best scores are: opt bits E(85289)
NP_003249 (OMIM: 188300) thymidine kinase, cytosol ( 234) 1573 368.7 4.7e-102
XP_016880481 (OMIM: 188300) PREDICTED: thymidine k ( 309) 1128 266.8 3e-71
>>NP_003249 (OMIM: 188300) thymidine kinase, cytosolic [ (234 aa)
initn: 1573 init1: 1573 opt: 1573 Z-score: 1953.2 bits: 368.7 E(85289): 4.7e-102
Smith-Waterman score: 1573; 100.0% identity (100.0% similar) in 234 aa overlap (1-234:1-234)
10 20 30 40 50 60
pF1KE1 MSCINLPTVLPGSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQIAQYKCLVIKYAKDTR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 MSCINLPTVLPGSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQIAQYKCLVIKYAKDTR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 YSSSFCTHDRNTMEALPACLLRDVAQEALGVAVIGIDEGQFFPDIVEFCEAMANAGKTVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 YSSSFCTHDRNTMEALPACLLRDVAQEALGVAVIGIDEGQFFPDIVEFCEAMANAGKTVI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 VAALDGTFQRKPFGAILNLVPLAESVVKLTAVCMECFREAAYTKRLGTEKEVEVIGGADK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 VAALDGTFQRKPFGAILNLVPLAESVVKLTAVCMECFREAAYTKRLGTEKEVEVIGGADK
130 140 150 160 170 180
190 200 210 220 230
pF1KE1 YHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 YHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN
190 200 210 220 230
>>XP_016880481 (OMIM: 188300) PREDICTED: thymidine kinas (309 aa)
initn: 1128 init1: 1128 opt: 1128 Z-score: 1400.1 bits: 266.8 E(85289): 3e-71
Smith-Waterman score: 1490; 87.6% identity (87.6% similar) in 266 aa overlap (1-233:43-308)
10 20 30
pF1KE1 MSCINLPTVLPGSPSKTRGQIQVILGPMFS
::::::::::::::::::::::::::::::
XP_016 RGLKRSAREPGAYCGTALESTRVRELPGGAMSCINLPTVLPGSPSKTRGQIQVILGPMFS
20 30 40 50 60 70
40 50 60 70 80 90
pF1KE1 GKSTELMRRVRRFQIAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALPACLLRDVAQEALG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 GKSTELMRRVRRFQIAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALPACLLRDVAQEALG
80 90 100 110 120 130
100 110 120 130 140 150
pF1KE1 VAVIGIDEGQFFPDIVEFCEAMANAGKTVIVAALDGTFQRKPFGAILNLVPLAESVVKLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 VAVIGIDEGQFFPDIVEFCEAMANAGKTVIVAALDGTFQRKPFGAILNLVPLAESVVKLT
140 150 160 170 180 190
160 170
pF1KE1 AVCMECFREAAYTKRLGTEKEV---------------------------------EVIGG
:::::::::::::::::::::: :::::
XP_016 AVCMECFREAAYTKRLGTEKEVAPPAFPAGRRGGGMALPPSCPGPSPIPCPCGQVEVIGG
200 210 220 230 240 250
180 190 200 210 220 230
pF1KE1 ADKYHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 ADKYHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN
260 270 280 290 300
234 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 09:47:23 2016 done: Sun Nov 6 09:47:24 2016
Total Scan time: 6.060 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]