FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3090, 260 aa 1>>>pF1KE3090 260 - 260 aa - 260 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7261+/-0.00037; mu= 18.2662+/- 0.023 mean_var=66.3702+/-13.359, 0's: 0 Z-trim(113.4): 23 B-trim: 28 in 1/49 Lambda= 0.157430 statistics sampled from 22641 (22663) to 22641 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.644), E-opt: 0.2 (0.266), width: 16 Scan time: 6.420 The best scores are: opt bits E(85289) NP_000779 (OMIM: 125450) deoxycytidine kinase [Hom ( 260) 1761 408.6 5.6e-114 NP_550438 (OMIM: 251880,601465,617068,617070) deox ( 277) 846 200.8 2.1e-51 NP_001305789 (OMIM: 251880,601465,617068,617070) d ( 180) 669 160.5 2e-39 NP_001305790 (OMIM: 251880,601465,617068,617070) d ( 180) 669 160.5 2e-39 XP_011530949 (OMIM: 251880,601465,617068,617070) P ( 271) 633 152.5 7.7e-37 NP_001305792 (OMIM: 251880,601465,617068,617070) d ( 174) 609 146.8 2.4e-35 NP_001305791 (OMIM: 251880,601465,617068,617070) d ( 174) 609 146.8 2.4e-35 NP_550440 (OMIM: 251880,601465,617068,617070) deox ( 189) 390 97.1 2.4e-20 NP_001305788 (OMIM: 251880,601465,617068,617070) d ( 183) 349 87.8 1.5e-17 NP_001258979 (OMIM: 188250,609560,617069) thymidin ( 168) 321 81.4 1.2e-15 NP_001258863 (OMIM: 188250,609560,617069) thymidin ( 216) 321 81.5 1.4e-15 NP_001166114 (OMIM: 188250,609560,617069) thymidin ( 234) 321 81.5 1.5e-15 NP_001166115 (OMIM: 188250,609560,617069) thymidin ( 240) 321 81.5 1.5e-15 NP_001166116 (OMIM: 188250,609560,617069) thymidin ( 247) 321 81.6 1.5e-15 NP_004605 (OMIM: 188250,609560,617069) thymidine k ( 265) 321 81.6 1.6e-15 NP_001258864 (OMIM: 188250,609560,617069) thymidin ( 146) 140 40.3 0.0025 >>NP_000779 (OMIM: 125450) deoxycytidine kinase [Homo sa (260 aa) initn: 1761 init1: 1761 opt: 1761 Z-score: 2167.2 bits: 408.6 E(85289): 5.6e-114 Smith-Waterman score: 1761; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:1-260) 10 20 30 40 50 60 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARWCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARWCN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNGKLKDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNGKLKDAE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDGIIYLQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 KPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDGIIYLQA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 TPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPILTLDVNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 TPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPILTLDVNE 190 200 210 220 230 240 250 260 pF1KE3 DFKDKYESLVEKVKEFLSTL :::::::::::::::::::: NP_000 DFKDKYESLVEKVKEFLSTL 250 260 >>NP_550438 (OMIM: 251880,601465,617068,617070) deoxygua (277 aa) initn: 835 init1: 671 opt: 846 Z-score: 1043.7 bits: 200.8 E(85289): 2.1e-51 Smith-Waterman score: 846; 47.5% identity (77.9% similar) in 263 aa overlap (1-260:18-277) 10 20 30 40 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ :: : .. : . : ...:::::::.::::::..: . NP_550 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: NP_550 TYPEWHVATEPVATWQNIQAAGT---QKACTAQSLGNLLDMMYREPARWSYTFQTFSFLS 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN :...:: . :: .:.::: .:::::::::::::.::.:. ... :: ::::::... NP_550 RLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 NQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKT .:.. . : :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : NP_550 WEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKL 180 190 200 210 220 230 230 240 250 260 pF1KE3 NFDYLQEVPILTLDVNEDFKD---KYESLVEKVKEFLSTL .:. :...:.:.::::.::.. : :.:...:. :...: NP_550 HFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL 240 250 260 270 >>NP_001305789 (OMIM: 251880,601465,617068,617070) deoxy (180 aa) initn: 650 init1: 650 opt: 669 Z-score: 829.0 bits: 160.5 E(85289): 2e-39 Smith-Waterman score: 669; 52.2% identity (83.9% similar) in 180 aa overlap (84-260:1-180) 60 70 80 90 100 110 pF1KE3 PVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLN :::..: :::.::::.. :::...:: . NP_001 MMYREPARWSYTFQTFSFLSRLKVQLEPFP 10 20 30 120 130 140 150 160 170 pF1KE3 GKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELD :: .:.::: .:::::::::::::.::.:. ... :: ::::::... .:.. . : NP_001 EKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLH 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE3 GIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPI :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : .:. :...:. NP_001 GFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPV 100 110 120 130 140 150 240 250 260 pF1KE3 LTLDVNEDFKD---KYESLVEKVKEFLSTL :.::::.::.. : :.:...:. :...: NP_001 LVLDVNDDFSEEVTKQEDLMREVNTFVKNL 160 170 180 >>NP_001305790 (OMIM: 251880,601465,617068,617070) deoxy (180 aa) initn: 650 init1: 650 opt: 669 Z-score: 829.0 bits: 160.5 E(85289): 2e-39 Smith-Waterman score: 669; 52.2% identity (83.9% similar) in 180 aa overlap (84-260:1-180) 60 70 80 90 100 110 pF1KE3 PVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLN :::..: :::.::::.. :::...:: . NP_001 MMYREPARWSYTFQTFSFLSRLKVQLEPFP 10 20 30 120 130 140 150 160 170 pF1KE3 GKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELD :: .:.::: .:::::::::::::.::.:. ... :: ::::::... .:.. . : NP_001 EKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLH 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE3 GIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPI :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : .:. :...:. NP_001 GFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPV 100 110 120 130 140 150 240 250 260 pF1KE3 LTLDVNEDFKD---KYESLVEKVKEFLSTL :.::::.::.. : :.:...:. :...: NP_001 LVLDVNDDFSEEVTKQEDLMREVNTFVKNL 160 170 180 >>XP_011530949 (OMIM: 251880,601465,617068,617070) PREDI (271 aa) initn: 814 init1: 449 opt: 633 Z-score: 782.4 bits: 152.5 E(85289): 7.7e-37 Smith-Waterman score: 786; 45.2% identity (75.7% similar) in 263 aa overlap (1-260:18-271) 10 20 30 40 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ :: : .. : . : ...:::::::.::::::..: . XP_011 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: XP_011 TYPEWHVATEPVATWQNIQAAG---TQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLS 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN :...:: . :: .:.::: .::: ::::.::.:. ... :: ::::::... XP_011 RLKVQLEPFPEKLLQARKPVQIFER------YIFAKNLFENGSLSDIEWHIYQDWHSFLL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 NQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKT .:.. . : :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : XP_011 WEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKL 180 190 200 210 220 230 230 240 250 260 pF1KE3 NFDYLQEVPILTLDVNEDFKD---KYESLVEKVKEFLSTL .:. :...:.:.::::.::.. : :.:...:. :...: XP_011 HFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL 240 250 260 270 >>NP_001305792 (OMIM: 251880,601465,617068,617070) deoxy (174 aa) initn: 629 init1: 449 opt: 609 Z-score: 755.5 bits: 146.8 E(85289): 2.4e-35 Smith-Waterman score: 609; 48.9% identity (80.6% similar) in 180 aa overlap (84-260:1-174) 60 70 80 90 100 110 pF1KE3 PVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLN :::..: :::.::::.. :::...:: . NP_001 MMYREPARWSYTFQTFSFLSRLKVQLEPFP 10 20 30 120 130 140 150 160 170 pF1KE3 GKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELD :: .:.::: .::: ::::.::.:. ... :: ::::::... .:.. . : NP_001 EKLLQARKPVQIFER------YIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLH 40 50 60 70 80 180 190 200 210 220 230 pF1KE3 GIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPI :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : .:. :...:. NP_001 GFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPV 90 100 110 120 130 140 240 250 260 pF1KE3 LTLDVNEDFKD---KYESLVEKVKEFLSTL :.::::.::.. : :.:...:. :...: NP_001 LVLDVNDDFSEEVTKQEDLMREVNTFVKNL 150 160 170 >>NP_001305791 (OMIM: 251880,601465,617068,617070) deoxy (174 aa) initn: 629 init1: 449 opt: 609 Z-score: 755.5 bits: 146.8 E(85289): 2.4e-35 Smith-Waterman score: 609; 48.9% identity (80.6% similar) in 180 aa overlap (84-260:1-174) 60 70 80 90 100 110 pF1KE3 PVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLN :::..: :::.::::.. :::...:: . NP_001 MMYREPARWSYTFQTFSFLSRLKVQLEPFP 10 20 30 120 130 140 150 160 170 pF1KE3 GKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELD :: .:.::: .::: ::::.::.:. ... :: ::::::... .:.. . : NP_001 EKLLQARKPVQIFER------YIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLH 40 50 60 70 80 180 190 200 210 220 230 pF1KE3 GIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPI :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : .:. :...:. NP_001 GFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPV 90 100 110 120 130 140 240 250 260 pF1KE3 LTLDVNEDFKD---KYESLVEKVKEFLSTL :.::::.::.. : :.:...:. :...: NP_001 LVLDVNDDFSEEVTKQEDLMREVNTFVKNL 150 160 170 >>NP_550440 (OMIM: 251880,601465,617068,617070) deoxygua (189 aa) initn: 427 init1: 231 opt: 390 Z-score: 486.2 bits: 97.1 E(85289): 2.4e-20 Smith-Waterman score: 390; 46.0% identity (71.5% similar) in 137 aa overlap (1-137:18-151) 10 20 30 40 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ :: : .. : . : ...:::::::.::::::..: . NP_550 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: NP_550 TYPEWHVATEPVATWQNIQAAG---TQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLS 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN :...:: . :: .:.::: .::::::::: : NP_550 RLKVQLEPFPEKLLQARKPVQIFERSVYSDRLHFEALMNIPVLVLDVNDDFSEEVTKQED 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 NQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKT NP_550 LMREVNTFVKNL 180 >>NP_001305788 (OMIM: 251880,601465,617068,617070) deoxy (183 aa) initn: 389 init1: 193 opt: 349 Z-score: 436.1 bits: 87.8 E(85289): 1.5e-17 Smith-Waterman score: 349; 43.8% identity (71.1% similar) in 128 aa overlap (1-128:18-142) 10 20 30 40 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ :: : .. : . : ...:::::::.::::::..: . NP_001 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: NP_001 TYPEWHVATEPVATWQNIQAAG---TQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLS 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN :...:: . :: .:.::: .::: NP_001 RLKVQLEPFPEKLLQARKPVQIFERLHFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVN 120 130 140 150 160 170 >>NP_001258979 (OMIM: 188250,609560,617069) thymidine ki (168 aa) initn: 351 init1: 169 opt: 321 Z-score: 402.2 bits: 81.4 E(85289): 1.2e-15 Smith-Waterman score: 358; 35.4% identity (68.0% similar) in 175 aa overlap (85-259:1-160) 60 70 80 90 100 110 pF1KE3 VARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNG ::. ::..:.:::. ::. :. NP_001 MYHDASRWGLTLQTYV-------QLTMLDR 10 20 120 130 140 150 160 170 pF1KE3 KLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDG . . . : ..:::..: ::::. :::.: : :..... ..: ::. .. :..: NP_001 HTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL-- 30 40 50 60 70 80 180 190 200 210 220 230 pF1KE3 IIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPIL :.::...:::: .:. : :.::. ::::::: .:. :: ::.. .: . . .:.: NP_001 IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL-----FPMAAPVL 90 100 110 120 130 240 250 260 pF1KE3 TLDVNEDFKDKYESLVEKVKEFLSTL ...... .. : : :. .. . : NP_001 VIEADHHMERMLE-LFEQNRDRILTPENRKHCP 140 150 160 260 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 05:14:03 2016 done: Sun Nov 6 05:14:04 2016 Total Scan time: 6.420 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]