FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1723, 234 aa 1>>>pF1KE1723 234 - 234 aa - 234 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0292+/-0.000302; mu= 16.2871+/- 0.019 mean_var=65.1472+/-13.090, 0's: 0 Z-trim(117.0): 10 B-trim: 213 in 2/48 Lambda= 0.158901 statistics sampled from 28701 (28707) to 28701 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.714), E-opt: 0.2 (0.337), width: 16 Scan time: 6.060 The best scores are: opt bits E(85289) NP_003249 (OMIM: 188300) thymidine kinase, cytosol ( 234) 1573 368.7 4.7e-102 XP_016880481 (OMIM: 188300) PREDICTED: thymidine k ( 309) 1128 266.8 3e-71 >>NP_003249 (OMIM: 188300) thymidine kinase, cytosolic [ (234 aa) initn: 1573 init1: 1573 opt: 1573 Z-score: 1953.2 bits: 368.7 E(85289): 4.7e-102 Smith-Waterman score: 1573; 100.0% identity (100.0% similar) in 234 aa overlap (1-234:1-234) 10 20 30 40 50 60 pF1KE1 MSCINLPTVLPGSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQIAQYKCLVIKYAKDTR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MSCINLPTVLPGSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQIAQYKCLVIKYAKDTR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 YSSSFCTHDRNTMEALPACLLRDVAQEALGVAVIGIDEGQFFPDIVEFCEAMANAGKTVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 YSSSFCTHDRNTMEALPACLLRDVAQEALGVAVIGIDEGQFFPDIVEFCEAMANAGKTVI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VAALDGTFQRKPFGAILNLVPLAESVVKLTAVCMECFREAAYTKRLGTEKEVEVIGGADK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 VAALDGTFQRKPFGAILNLVPLAESVVKLTAVCMECFREAAYTKRLGTEKEVEVIGGADK 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 YHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 YHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN 190 200 210 220 230 >>XP_016880481 (OMIM: 188300) PREDICTED: thymidine kinas (309 aa) initn: 1128 init1: 1128 opt: 1128 Z-score: 1400.1 bits: 266.8 E(85289): 3e-71 Smith-Waterman score: 1490; 87.6% identity (87.6% similar) in 266 aa overlap (1-233:43-308) 10 20 30 pF1KE1 MSCINLPTVLPGSPSKTRGQIQVILGPMFS :::::::::::::::::::::::::::::: XP_016 RGLKRSAREPGAYCGTALESTRVRELPGGAMSCINLPTVLPGSPSKTRGQIQVILGPMFS 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE1 GKSTELMRRVRRFQIAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALPACLLRDVAQEALG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 GKSTELMRRVRRFQIAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALPACLLRDVAQEALG 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE1 VAVIGIDEGQFFPDIVEFCEAMANAGKTVIVAALDGTFQRKPFGAILNLVPLAESVVKLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VAVIGIDEGQFFPDIVEFCEAMANAGKTVIVAALDGTFQRKPFGAILNLVPLAESVVKLT 140 150 160 170 180 190 160 170 pF1KE1 AVCMECFREAAYTKRLGTEKEV---------------------------------EVIGG :::::::::::::::::::::: ::::: XP_016 AVCMECFREAAYTKRLGTEKEVAPPAFPAGRRGGGMALPPSCPGPSPIPCPCGQVEVIGG 200 210 220 230 240 250 180 190 200 210 220 230 pF1KE1 ADKYHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 ADKYHSVCRLCYFKKASGQPAGPDNKENCPVPGKPGEAVAARKLFAPQQILQCSPAN 260 270 280 290 300 234 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 09:47:23 2016 done: Sun Nov 6 09:47:24 2016 Total Scan time: 6.060 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]