FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3057, 212 aa 1>>>pF1KE3057 212 - 212 aa - 212 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4579+/-0.000372; mu= 12.2852+/- 0.023 mean_var=61.7221+/-12.614, 0's: 0 Z-trim(112.5): 12 B-trim: 916 in 1/50 Lambda= 0.163250 statistics sampled from 21498 (21507) to 21498 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.646), E-opt: 0.2 (0.252), width: 16 Scan time: 5.920 The best scores are: opt bits E(85289) NP_036277 (OMIM: 188345) thymidylate kinase isofor ( 212) 1390 335.9 3e-92 NP_001307834 (OMIM: 188345) thymidylate kinase iso ( 251) 881 216.0 4.3e-56 NP_001307833 (OMIM: 188345) thymidylate kinase iso ( 193) 698 172.9 3.2e-43 NP_001307831 (OMIM: 188345) thymidylate kinase iso ( 169) 696 172.4 3.9e-43 NP_001158503 (OMIM: 188345) thymidylate kinase iso ( 188) 695 172.2 5.1e-43 NP_001307832 (OMIM: 188345) thymidylate kinase iso ( 113) 509 128.3 5e-30 >>NP_036277 (OMIM: 188345) thymidylate kinase isoform 1 (212 aa) initn: 1390 init1: 1390 opt: 1390 Z-score: 1777.2 bits: 335.9 E(85289): 3e-92 Smith-Waterman score: 1390; 100.0% identity (100.0% similar) in 212 aa overlap (1-212:1-212) 10 20 30 40 50 60 pF1KE3 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA 130 140 150 160 170 180 190 200 210 pF1KE3 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK :::::::::::::::::::::::::::::::: NP_036 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK 190 200 210 >>NP_001307834 (OMIM: 188345) thymidylate kinase isoform (251 aa) initn: 1376 init1: 881 opt: 881 Z-score: 1128.1 bits: 216.0 E(85289): 4.3e-56 Smith-Waterman score: 1258; 84.0% identity (84.0% similar) in 244 aa overlap (8-212:8-251) 10 20 30 40 50 60 pF1KE3 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK ::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK 10 20 30 40 50 60 70 80 pF1KE3 SDVEDHSVHLLFSANRWEQV---------------------------------------P :::::::::::::::::::: : NP_001 SDVEDHSVHLLFSANRWEQVFILVAQTGVQWGDLGSLQPTPRRFKRFSCLSLSSSCDHRP 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE3 LIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQPDVGLPKPDLVLFLQLQLADAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQPDVGLPKPDLVLFLQLQLADAA 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE3 KRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDASKSIEAVHEDIRVLSEDAIRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDASKSIEAVHEDIRVLSEDAIRT 190 200 210 220 230 240 210 pF1KE3 ATEKPLGELWK ::::::::::: NP_001 ATEKPLGELWK 250 >>NP_001307833 (OMIM: 188345) thymidylate kinase isoform (193 aa) initn: 1193 init1: 698 opt: 698 Z-score: 897.0 bits: 172.9 E(85289): 3.2e-43 Smith-Waterman score: 1164; 87.7% identity (88.7% similar) in 212 aa overlap (1-212:1-193) 10 20 30 40 50 60 pF1KE3 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQP :::::::::::::::::::: .. :: :: :::::::::: NP_001 SDVEDHSVHLLFSANRWEQV-----RFPLHSTLNVD--------------NFSLDWCKQP 70 80 90 100 130 140 150 160 170 180 pF1KE3 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA 110 120 130 140 150 160 190 200 210 pF1KE3 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK :::::::::::::::::::::::::::::::: NP_001 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK 170 180 190 >>NP_001307831 (OMIM: 188345) thymidylate kinase isoform (169 aa) initn: 1101 init1: 696 opt: 696 Z-score: 895.4 bits: 172.4 E(85289): 3.9e-43 Smith-Waterman score: 1019; 79.7% identity (79.7% similar) in 212 aa overlap (1-212:1-169) 10 20 30 40 50 60 pF1KE3 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQP ::::::: :::::::::: NP_001 SDVEDHS-------------------------------------------NFSLDWCKQP 70 130 140 150 160 170 180 pF1KE3 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA 80 90 100 110 120 130 190 200 210 pF1KE3 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK :::::::::::::::::::::::::::::::: NP_001 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK 140 150 160 >>NP_001158503 (OMIM: 188345) thymidylate kinase isoform (188 aa) initn: 711 init1: 694 opt: 695 Z-score: 893.4 bits: 172.2 E(85289): 5.1e-43 Smith-Waterman score: 1153; 88.7% identity (88.7% similar) in 212 aa overlap (1-212:1-188) 10 20 30 40 50 60 pF1KE3 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQP :::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKE---------- 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 DVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA :::::::::::::::::::::::::::::::::::::::::::::: NP_001 --------------LQLADAAKRGAFGHERYENGAFQERALRCFHQLMKDTTLNWKMVDA 120 130 140 150 190 200 210 pF1KE3 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK :::::::::::::::::::::::::::::::: NP_001 SKSIEAVHEDIRVLSEDAIRTATEKPLGELWK 160 170 180 >>NP_001307832 (OMIM: 188345) thymidylate kinase isoform (113 aa) initn: 509 init1: 509 opt: 509 Z-score: 660.1 bits: 128.3 E(85289): 5e-30 Smith-Waterman score: 509; 100.0% identity (100.0% similar) in 80 aa overlap (1-80:1-80) 10 20 30 40 50 60 pF1KE3 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SDVEDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQP :::::::::::::::::::: NP_001 SDVEDHSVHLLFSANRWEQVYSWRMLPSGERLAMSAMRTGLSRSGRSGVSTSS 70 80 90 100 110 212 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:05:25 2016 done: Sun Nov 6 06:05:26 2016 Total Scan time: 5.920 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]