FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2524, 457 aa 1>>>pF1KE2524 457 - 457 aa - 457 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.1948+/-0.000408; mu= -7.8084+/- 0.026 mean_var=376.6241+/-76.482, 0's: 0 Z-trim(123.2): 14 B-trim: 0 in 0/57 Lambda= 0.066088 statistics sampled from 42529 (42543) to 42529 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.499), width: 16 Scan time: 11.220 The best scores are: opt bits E(85289) NP_001268 (OMIM: 118491) choline kinase alpha isof ( 457) 3124 311.6 2.9e-84 NP_997634 (OMIM: 118491) choline kinase alpha isof ( 439) 1912 196.0 1.7e-49 XP_016872637 (OMIM: 118491) PREDICTED: choline kin ( 281) 1878 192.6 1.2e-48 XP_016872636 (OMIM: 118491) PREDICTED: choline kin ( 419) 1666 172.5 1.9e-42 NP_005189 (OMIM: 602541,612395) choline/ethanolami ( 395) 1227 130.6 7.2e-30 XP_016875069 (OMIM: 609858) PREDICTED: ethanolamin ( 425) 295 41.8 0.0043 >>NP_001268 (OMIM: 118491) choline kinase alpha isoform (457 aa) initn: 3124 init1: 3124 opt: 3124 Z-score: 1633.9 bits: 311.6 E(85289): 2.9e-84 Smith-Waterman score: 3124; 100.0% identity (100.0% similar) in 457 aa overlap (1-457:1-457) 10 20 30 40 50 60 pF1KE2 MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEGSEQAQKENEFQGAEAMVLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEGSEQAQKENEFQGAEAMVLE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 SVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGMKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGMKM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 PFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 VFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 FFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHFLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHFLW 370 380 390 400 410 420 430 440 450 pF1KE2 GLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV ::::::::::::::::::::::::::::::::::::: NP_001 GLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV 430 440 450 >>NP_997634 (OMIM: 118491) choline kinase alpha isoform (439 aa) initn: 2988 init1: 1902 opt: 1912 Z-score: 1009.6 bits: 196.0 E(85289): 1.7e-49 Smith-Waterman score: 2956; 96.1% identity (96.1% similar) in 457 aa overlap (1-457:1-439) 10 20 30 40 50 60 pF1KE2 MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_997 MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_997 PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEGSEQAQKENEFQGAEAMVLE :::::::::::::::::::::::::::::::::: :::::::: NP_997 SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQM------------------GAEAMVLE 130 140 150 160 190 200 210 220 230 240 pF1KE2 SVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGMKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_997 SVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGMKM 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE2 PFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_997 PFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPV 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE2 VFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_997 VFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYP 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE2 FFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHFLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_997 FFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHFLW 350 360 370 380 390 400 430 440 450 pF1KE2 GLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV ::::::::::::::::::::::::::::::::::::: NP_997 GLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV 410 420 430 >>XP_016872637 (OMIM: 118491) PREDICTED: choline kinase (281 aa) initn: 1878 init1: 1878 opt: 1878 Z-score: 994.6 bits: 192.6 E(85289): 1.2e-48 Smith-Waterman score: 1878; 100.0% identity (100.0% similar) in 281 aa overlap (177-457:1-281) 150 160 170 180 190 200 pF1KE2 LYGAILQMRSCNKEGSEQAQKENEFQGAEAMVLESVMFAILAERSLGPKLYGIFPQGRLE :::::::::::::::::::::::::::::: XP_016 MVLESVMFAILAERSLGPKLYGIFPQGRLE 10 20 30 210 220 230 240 250 260 pF1KE2 QFIPSRRLDTEELSLPDISAEIAEKMATFHGMKMPFNKEPKWLFGTMEKYLKEVLRIKFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 QFIPSRRLDTEELSLPDISAEIAEKMATFHGMKMPFNKEPKWLFGTMEKYLKEVLRIKFT 40 50 60 70 80 90 270 280 290 300 310 320 pF1KE2 EESRIKKLHKLLSYNLPLELENLRSLLESTPSPVVFCHNDCQEGNILLLEGRENSEKQKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 EESRIKKLHKLLSYNLPLELENLRSLLESTPSPVVFCHNDCQEGNILLLEGRENSEKQKL 100 110 120 130 140 150 330 340 350 360 370 380 pF1KE2 MLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAF 160 170 180 190 200 210 390 400 410 420 430 440 pF1KE2 QNDFENLSTEEKSIIKEEMLLEVNRFALASHFLWGLWSIVQAKISSIEFGYMDYAQARFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 QNDFENLSTEEKSIIKEEMLLEVNRFALASHFLWGLWSIVQAKISSIEFGYMDYAQARFD 220 230 240 250 260 270 450 pF1KE2 AYFHQKRKLGV ::::::::::: XP_016 AYFHQKRKLGV 280 >>XP_016872636 (OMIM: 118491) PREDICTED: choline kinase (419 aa) initn: 1663 init1: 1663 opt: 1666 Z-score: 883.1 bits: 172.5 E(85289): 1.9e-42 Smith-Waterman score: 2792; 91.7% identity (91.7% similar) in 457 aa overlap (1-457:1-419) 10 20 30 40 50 60 pF1KE2 MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MKTKFCTGGEAEPSPLGLLLSCGSGSAAPAPGVGQQRDAASDLESKQLGGQQPPLALPPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIRGGL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEGSEQAQKENEFQGAEAMVLE :::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKEGSEQAQKENEFQ-------- 130 140 150 160 170 190 200 210 220 230 240 pF1KE2 SVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKMATFHGMKM :::::::::::::::::::::::::::::: XP_016 ------------------------------SRRLDTEELSLPDISAEIAEKMATFHGMKM 180 190 200 250 260 270 280 290 300 pF1KE2 PFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSLLESTPSPV 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE2 VFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMYDYSYEKYP 270 280 290 300 310 320 370 380 390 400 410 420 pF1KE2 FFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHFLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 FFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRFALASHFLW 330 340 350 360 370 380 430 440 450 pF1KE2 GLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV ::::::::::::::::::::::::::::::::::::: XP_016 GLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV 390 400 410 >>NP_005189 (OMIM: 602541,612395) choline/ethanolamine k (395 aa) initn: 1047 init1: 456 opt: 1227 Z-score: 657.2 bits: 130.6 E(85289): 7.2e-30 Smith-Waterman score: 1475; 57.9% identity (77.7% similar) in 385 aa overlap (74-455:29-388) 50 60 70 80 90 100 pF1KE2 ESKQLGGQQPPLALPPPPPLPLPLPLPQPPPPQPPADEQPEPRTR---RRAYLWCKEFLP : : .. .: :::: ::.:.: NP_005 MAAEATAVAGSGAVGGCLAKDGLQQSKCPDTTPKRRRASSLSRDAERRAYQWCREYLG 10 20 30 40 50 110 120 130 140 150 160 pF1KE2 GAWRGLREDEFHISVIRGGLSNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQMRSCNKE :::: .. .:... . :::::.::.::::: ..:.:::.::::::::::: NP_005 GAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVLLRLYGAILQ------- 60 70 80 90 100 110 170 180 190 200 210 220 pF1KE2 GSEQAQKENEFQGAEAMVLESVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELS :....:::::::::::::::::.:::.::.:::::.:::: : :.:: NP_005 ------------GVDSLVLESVMFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELR 120 130 140 150 230 240 250 260 270 280 pF1KE2 LPDISAEIAEKMATFHGMKMPFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSY : .:: :: ::: ::::.:::.:::.:::::::.:::.. . : . ... : : NP_005 EPVLSAAIATKMAQFHGMEMPFTKEPHWLFGTMERYLKQIQDLPPT---GLPEMNLLEMY 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE2 NLPLELENLRSLLESTPSPVVFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRG .: :. :::.::::::::::::::: :::::::: ::... :::.::::::::::: NP_005 SLKDEMGNLRKLLESTPSPVVFCHNDIQEGNILLLSEPENADS--LMLVDFEYSSYNYRG 220 230 240 250 260 270 350 360 370 380 390 400 pF1KE2 FDIGNHFCEWMYDYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSI ::::::::::.:::..:..::..: :::..:::::: :: .. :.:: ::. NP_005 FDIGNHFCEWVYDYTHEEWPFYKARPTDYPTQEQQLHFIRHYLAEAKKG-ETLSQEEQRK 280 290 300 310 320 330 410 420 430 440 450 pF1KE2 IKEEMLLEVNRFALASHFLWGLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLGV ..:..:.::.:.::::::.::::::.::..:.:::::.::::.::. ::.:: .: NP_005 LEEDLLVEVSRYALASHFFWGLWSILQASMSTIEFGYLDYAQSRFQFYFQQKGQLTSVHS 340 350 360 370 380 390 NP_005 SS >>XP_016875069 (OMIM: 609858) PREDICTED: ethanolamine ki (425 aa) initn: 322 init1: 111 opt: 295 Z-score: 176.5 bits: 41.8 E(85289): 0.0043 Smith-Waterman score: 358; 26.9% identity (55.7% similar) in 368 aa overlap (65-425:96-415) 40 50 60 70 80 90 pF1KE2 QQRDAASDLESKQLGGQQPPLALPPPPPLPLPLPLPQPPPPQPPADEQPEPRTRRRAYLW .: :. : . ...: : : :. : XP_016 LSAPAVLVVAVAVVVVVVSAVAWAMANYIHVPPGSPEVPKLNVTVQDQEEHRCREGALSL 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE2 CKEFLPGAWRGLREDEFHISVIRGGLSNMLFQCSLPDTTATLGDEPRKVLLRLYGAILQM ... : : .: .... :..: :. : . .: . ::.:.:: .. XP_016 LQHLRPH-W---DPQEVTLQLFTDGITNKLIGCYVGNTMEDV------VLVRIYGNKTEL 130 140 150 160 170 160 170 180 190 200 210 pF1KE2 RSCNKEGSEQAQKENEFQGAEAMVLESVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRL .....: .. : .: .. .:.:: : .: .:: .. : XP_016 ---------LVDRDEEVKS----------FRVLQAHGCAPQLYCTFNNGLCYEFIQGEAL 180 190 200 210 220 230 240 250 260 270 pF1KE2 DTEELSLPDISAEIAEKMATFHGMKMPFNKEPK---WLFGTMEKYLKEVLRIKFTEESRI : ... : : ::...: .:... . :: :: : ::.. .. :..:. XP_016 DPKHVCNPAIFRLIARQLAKIHAIHAHNGWIPKSNLWL--KMGKYFS-LIPTGFADEDIN 220 230 240 250 260 270 280 290 300 310 320 pF1KE2 KKLHKLL--SYNLPLELENLRSLLESTPSPVVFCHNDCQEGNILLLEGRENSEKQKLMLI :.. . . : : :. .. .: . ::::.:::: ::. : .. ...: XP_016 KRFLSDIPSSQILQEEMTWMKEILSNLGSPVVLCHNDLLCKNIIY-----NEKQGDVQFI 280 290 300 310 320 330 340 350 360 370 380 pF1KE2 DFEYSSYNYRGFDIGNHFCEWMYDYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAFQND :.:::.::: ..:::::: :. : : . :: .. : ... .:: :.. . XP_016 DYEYSGYNYLAYDIGNHFNEFA-GVSDVDYSL-------YPDRELQSQWLRAYLEAYK-E 330 340 350 360 370 390 400 410 420 430 440 pF1KE2 FENLSTE--EKSIIKEEMLLEVNRFALASHFLWGLWSIVQAKISSIEFGYMDYAQARFDA :....:: :: . : ....::.:::. . ..: .. XP_016 FKGFGTEVTEKEV--EILFIQVNQFALTLQSAFSLTAVKRKSGKAYQS 380 390 400 410 420 450 pF1KE2 YFHQKRKLGV 457 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:26:38 2016 done: Tue Nov 8 03:26:39 2016 Total Scan time: 11.220 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]