FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3107, 277 aa 1>>>pF1KE3107 277 - 277 aa - 277 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2633+/-0.000862; mu= 15.1535+/- 0.052 mean_var=62.8509+/-12.566, 0's: 0 Z-trim(105.7): 15 B-trim: 0 in 0/49 Lambda= 0.161778 statistics sampled from 8552 (8562) to 8552 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.649), E-opt: 0.2 (0.263), width: 16 Scan time: 2.030 The best scores are: opt bits E(32554) CCDS1931.1 DGUOK gene_id:1716|Hs108|chr2 ( 277) 1842 438.5 2.6e-123 CCDS1932.1 DGUOK gene_id:1716|Hs108|chr2 ( 189) 977 236.5 1.1e-62 CCDS3548.1 DCK gene_id:1633|Hs108|chr4 ( 260) 846 206.0 2.3e-53 CCDS61955.1 TK2 gene_id:7084|Hs108|chr16 ( 168) 289 75.9 2.2e-14 CCDS54018.1 TK2 gene_id:7084|Hs108|chr16 ( 234) 289 76.0 2.9e-14 CCDS54016.1 TK2 gene_id:7084|Hs108|chr16 ( 240) 289 76.0 2.9e-14 CCDS54017.1 TK2 gene_id:7084|Hs108|chr16 ( 247) 289 76.0 3e-14 CCDS10805.2 TK2 gene_id:7084|Hs108|chr16 ( 265) 289 76.0 3.2e-14 >>CCDS1931.1 DGUOK gene_id:1716|Hs108|chr2 (277 aa) initn: 1842 init1: 1842 opt: 1842 Z-score: 2327.4 bits: 438.5 E(32554): 2.6e-123 Smith-Waterman score: 1842; 100.0% identity (100.0% similar) in 277 aa overlap (1-277:1-277) 10 20 30 40 50 60 pF1KE3 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 TYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 TYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 VQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 ASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 ASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFE 190 200 210 220 230 240 250 260 270 pF1KE3 ALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL ::::::::::::::::::::::::::::::::::::: CCDS19 ALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL 250 260 270 >>CCDS1932.1 DGUOK gene_id:1716|Hs108|chr2 (189 aa) initn: 974 init1: 974 opt: 977 Z-score: 1238.9 bits: 236.5 E(32554): 1.1e-62 Smith-Waterman score: 1047; 68.2% identity (68.2% similar) in 277 aa overlap (1-277:1-189) 10 20 30 40 50 60 pF1KE3 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 TYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 TYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEF :::::::::::::::::::::::::::: CCDS19 VQLEPFPEKLLQARKPVQIFERSVYSDR-------------------------------- 130 140 190 200 210 220 230 240 pF1KE3 ASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFE :::: CCDS19 --------------------------------------------------------LHFE 150 250 260 270 pF1KE3 ALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL ::::::::::::::::::::::::::::::::::::: CCDS19 ALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL 160 170 180 >>CCDS3548.1 DCK gene_id:1633|Hs108|chr4 (260 aa) initn: 835 init1: 671 opt: 846 Z-score: 1071.5 bits: 206.0 E(32554): 2.3e-53 Smith-Waterman score: 846; 47.5% identity (77.9% similar) in 263 aa overlap (18-277:1-260) 10 20 30 40 50 60 pF1KE3 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK :: : .. : . : ...:::::::.::::::..: . CCDS35 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ 10 20 30 40 70 80 90 100 110 pF1KE3 TYPEWHVATEPVATWQNIQAAGTQ---KACTAQSLGNLLDMMYREPARWSYTFQTFSFLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: CCDS35 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE3 RLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLL :...:: . :: .:.::: .:::::::::::::.::.:. ... :: ::::::... CCDS35 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 WEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKL .:.. . : :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : CCDS35 NQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKT 170 180 190 200 210 220 240 250 260 270 pF1KE3 HFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL .:. :...:.:.::::.::.. : :.:...:. :...: CCDS35 NFDYLQEVPILTLDVNEDFKD---KYESLVEKVKEFLSTL 230 240 250 260 >>CCDS61955.1 TK2 gene_id:7084|Hs108|chr16 (168 aa) initn: 340 init1: 164 opt: 289 Z-score: 371.8 bits: 75.9 E(32554): 2.2e-14 Smith-Waterman score: 328; 35.5% identity (69.0% similar) in 155 aa overlap (99-253:1-141) 70 80 90 100 110 120 pF1KE3 TEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLKVQLEPFPE ::.. .::. :.::. :. : . .: CCDS61 MYHDASRWGLTLQTYVQLTMLDRHTRP--- 10 20 130 140 150 160 170 180 pF1KE3 KLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLHG :. . :...:::..: ::::..::...:.. .... . ..: ...: .. . : CCDS61 ---QV-SSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL-- 30 40 50 60 70 80 190 200 210 220 230 240 pF1KE3 FIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPVL ..::...:..: .:: .: ::::: : : ::: .: :: :::. . : : ::: CCDS61 IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL---FP--MAAPVL 90 100 110 120 130 250 260 270 pF1KE3 VLDVNDDFSEEVTKQEDLMREVNTFVKNL :.... CCDS61 VIEADHHMERMLELFEQNRDRILTPENRKHCP 140 150 160 >>CCDS54018.1 TK2 gene_id:7084|Hs108|chr16 (234 aa) initn: 425 init1: 164 opt: 289 Z-score: 369.6 bits: 76.0 E(32554): 2.9e-14 Smith-Waterman score: 404; 33.8% identity (66.7% similar) in 213 aa overlap (41-253:22-207) 20 30 40 50 60 70 pF1KE3 LRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTKTYPEWHVATE . .::::: ::.: ....... . .: :: CCDS54 MGAFCQRPSSDKEQEKEKKSVICVEGNIASGKTTCLEFFSNA-TDVEVLTE 10 20 30 40 50 80 90 100 110 120 130 pF1KE3 PVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLKVQLEPFPEKL ::. :.:... : : .::.. .::. :.::. :. : . .: CCDS54 PVSKWRNVRGH------------NPLGLMYHDASRWGLTLQTYVQLTMLDRHTRP----- 60 70 80 90 140 150 160 170 180 190 pF1KE3 LQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLHGFI :. . :...:::..: ::::..::...:.. .... . ..: ...: .. . : .. CCDS54 -QVSS-VRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL--IV 100 110 120 130 140 200 210 220 230 240 250 pF1KE3 YLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPVLVL ::...:..: .:: .: ::::: : : ::: .: :: :::. . : : ::::. CCDS54 YLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL---FP--MAAPVLVI 150 160 170 180 190 200 260 270 pF1KE3 DVNDDFSEEVTKQEDLMREVNTFVKNL ... CCDS54 EADHHMERMLELFEQNRDRILTPENRKHCP 210 220 230 >>CCDS54016.1 TK2 gene_id:7084|Hs108|chr16 (240 aa) initn: 399 init1: 164 opt: 289 Z-score: 369.5 bits: 76.0 E(32554): 2.9e-14 Smith-Waterman score: 360; 33.2% identity (61.5% similar) in 226 aa overlap (29-253:21-213) 10 20 30 40 50 pF1KE3 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHA-GRGPRRLSIEGNIAVGKSTFVKLLT ::: : : ::::.. . : . . CCDS54 MLLWPLRGWAARALRCFGPGSRGSPASGPGPRRVQRR---AWPPDKEQEKEK 10 20 30 40 60 70 80 90 100 110 pF1KE3 KTYPEWHVATEPVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRL :. : ::::. :.:... : : .::.. .::. :.::. :. : CCDS54 KSV----VLTEPVSKWRNVRGH------------NPLGLMYHDASRWGLTLQTYVQLTML 50 60 70 80 90 120 130 140 150 160 170 pF1KE3 KVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWE . .: :. . :...:::..: ::::..::...:.. .... . ..: ...: . CCDS54 DRHTRP------QV-SSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRN 100 110 120 130 140 180 190 200 210 220 230 pF1KE3 FASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHF . . : ..::...:..: .:: .: ::::: : : ::: .: :: :::. . : CCDS54 MDVSVDL--IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL---F 150 160 170 180 190 200 240 250 260 270 pF1KE3 EALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL : ::::.... CCDS54 P--MAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP 210 220 230 240 >>CCDS54017.1 TK2 gene_id:7084|Hs108|chr16 (247 aa) initn: 402 init1: 164 opt: 289 Z-score: 369.3 bits: 76.0 E(32554): 3e-14 Smith-Waterman score: 334; 31.0% identity (62.3% similar) in 239 aa overlap (22-253:5-220) 10 20 30 40 50 pF1KE3 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHA-GRGPRRLSIEGNIAVGKSTFVKLLT ::.: .. :.:. : : : :. : : . .. CCDS54 MLLWPLRGWAA-RALRCFGPGSR-----GSPASGPGPR-RVQR 10 20 30 60 70 80 90 100 110 pF1KE3 KTYP---EWHVATEPVATWQNIQAAGTQKACTAQSLGNLLD---MMYREPARWSYTFQTF ...: : . . : .. :.: . .: . ..: : .::.. .::. :.::. CCDS54 RAWPPDKEQEKEKKSVICVEGNIASG-KTTCL-EFFSNATDVEGLMYHDASRWGLTLQTY 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE3 SFLSRLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWH :. : . .: :. . :...:::..: ::::..::...:.. .... . ..: CCDS54 VQLTMLDRHTRP------QVSS-VRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWF 100 110 120 130 140 180 190 200 210 220 230 pF1KE3 SFLLWEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHK ...: .. . : ..::...:..: .:: .: ::::: : : ::: .: :: :::. CCDS54 DWILRNMDVSVDL--IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKG 150 160 170 180 190 200 240 250 260 270 pF1KE3 TTKLHFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL . : : ::::.... CCDS54 SL---FP--MAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP 210 220 230 240 >>CCDS10805.2 TK2 gene_id:7084|Hs108|chr16 (265 aa) initn: 422 init1: 164 opt: 289 Z-score: 368.8 bits: 76.0 E(32554): 3.2e-14 Smith-Waterman score: 404; 33.8% identity (66.7% similar) in 213 aa overlap (41-253:53-238) 20 30 40 50 60 70 pF1KE3 LRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTKTYPEWHVATE . .::::: ::.: ....... . .: :: CCDS10 GSPASGPGPRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNA-TDVEVLTE 30 40 50 60 70 80 80 90 100 110 120 130 pF1KE3 PVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLKVQLEPFPEKL ::. :.:... : : .::.. .::. :.::. :. : . .: CCDS10 PVSKWRNVRGH------------NPLGLMYHDASRWGLTLQTYVQLTMLDRHTRP----- 90 100 110 120 140 150 160 170 180 190 pF1KE3 LQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLHGFI :. . :...:::..: ::::..::...:.. .... . ..: ...: .. . : .. CCDS10 -QVSS-VRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL--IV 130 140 150 160 170 180 200 210 220 230 240 250 pF1KE3 YLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPVLVL ::...:..: .:: .: ::::: : : ::: .: :: :::. . : : ::::. CCDS10 YLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL---FP--MAAPVLVI 190 200 210 220 230 260 270 pF1KE3 DVNDDFSEEVTKQEDLMREVNTFVKNL ... CCDS10 EADHHMERMLELFEQNRDRILTPENRKHCP 240 250 260 277 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 16:19:37 2016 done: Mon Nov 7 16:19:37 2016 Total Scan time: 2.030 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]