FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3121, 307 aa 1>>>pF1KE3121 307 - 307 aa - 307 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9484+/-0.000713; mu= 13.9800+/- 0.043 mean_var=92.6245+/-18.550, 0's: 0 Z-trim(112.0): 11 B-trim: 283 in 1/51 Lambda= 0.133264 statistics sampled from 12811 (12820) to 12811 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.751), E-opt: 0.2 (0.394), width: 16 Scan time: 2.420 The best scores are: opt bits E(32554) CCDS10805.2 TK2 gene_id:7084|Hs108|chr16 ( 265) 1840 363.2 1.2e-100 CCDS54018.1 TK2 gene_id:7084|Hs108|chr16 ( 234) 1521 301.8 3.3e-82 CCDS54016.1 TK2 gene_id:7084|Hs108|chr16 ( 240) 1290 257.4 7.8e-69 CCDS54017.1 TK2 gene_id:7084|Hs108|chr16 ( 247) 1162 232.8 2e-61 CCDS61955.1 TK2 gene_id:7084|Hs108|chr16 ( 168) 1149 230.2 8.5e-61 CCDS3548.1 DCK gene_id:1633|Hs108|chr4 ( 260) 321 71.1 1e-12 CCDS1931.1 DGUOK gene_id:1716|Hs108|chr2 ( 277) 289 65.0 7.5e-11 >>CCDS10805.2 TK2 gene_id:7084|Hs108|chr16 (265 aa) initn: 1840 init1: 1840 opt: 1840 Z-score: 1920.1 bits: 363.2 E(32554): 1.2e-100 Smith-Waterman score: 1840; 100.0% identity (100.0% similar) in 265 aa overlap (43-307:1-265) 20 30 40 50 60 70 pF1KE3 SRRRPTAGLAVVRADSHKKEPRASGSARPAMLLWPLRGWAARALRCFGPGSRGSPASGPG :::::::::::::::::::::::::::::: CCDS10 MLLWPLRGWAARALRCFGPGSRGSPASGPG 10 20 30 80 90 100 110 120 130 pF1KE3 PRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVR 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE3 GHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRS 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE3 GKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLE 160 170 180 190 200 210 260 270 280 290 300 pF1KE3 AIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP 220 230 240 250 260 >>CCDS54018.1 TK2 gene_id:7084|Hs108|chr16 (234 aa) initn: 1521 init1: 1521 opt: 1521 Z-score: 1589.4 bits: 301.8 E(32554): 3.3e-82 Smith-Waterman score: 1521; 100.0% identity (100.0% similar) in 224 aa overlap (84-307:11-234) 60 70 80 90 100 110 pF1KE3 RALRCFGPGSRGSPASGPGPRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFS :::::::::::::::::::::::::::::: CCDS54 MGAFCQRPSSDKEQEKEKKSVICVEGNIASGKTTCLEFFS 10 20 30 40 120 130 140 150 160 170 pF1KE3 NATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRL 50 60 70 80 90 100 180 190 200 210 220 230 pF1KE3 MERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQ 110 120 130 140 150 160 240 250 260 270 280 290 pF1KE3 RLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQN 170 180 190 200 210 220 300 pF1KE3 RDRILTPENRKHCP :::::::::::::: CCDS54 RDRILTPENRKHCP 230 >>CCDS54016.1 TK2 gene_id:7084|Hs108|chr16 (240 aa) initn: 1339 init1: 1290 opt: 1290 Z-score: 1349.2 bits: 257.4 E(32554): 7.8e-69 Smith-Waterman score: 1616; 90.6% identity (90.6% similar) in 265 aa overlap (43-307:1-240) 20 30 40 50 60 70 pF1KE3 SRRRPTAGLAVVRADSHKKEPRASGSARPAMLLWPLRGWAARALRCFGPGSRGSPASGPG :::::::::::::::::::::::::::::: CCDS54 MLLWPLRGWAARALRCFGPGSRGSPASGPG 10 20 30 80 90 100 110 120 130 pF1KE3 PRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVR :::::::::::::::::::::: ::::::::::::: CCDS54 PRRVQRRAWPPDKEQEKEKKSV-------------------------VLTEPVSKWRNVR 40 50 60 140 150 160 170 180 190 pF1KE3 GHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRS 70 80 90 100 110 120 200 210 220 230 240 250 pF1KE3 GKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLE 130 140 150 160 170 180 260 270 280 290 300 pF1KE3 AIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 AIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP 190 200 210 220 230 240 >>CCDS54017.1 TK2 gene_id:7084|Hs108|chr16 (247 aa) initn: 1162 init1: 1162 opt: 1162 Z-score: 1216.1 bits: 232.8 E(32554): 2e-61 Smith-Waterman score: 1666; 93.2% identity (93.2% similar) in 265 aa overlap (43-307:1-247) 20 30 40 50 60 70 pF1KE3 SRRRPTAGLAVVRADSHKKEPRASGSARPAMLLWPLRGWAARALRCFGPGSRGSPASGPG :::::::::::::::::::::::::::::: CCDS54 MLLWPLRGWAARALRCFGPGSRGSPASGPG 10 20 30 80 90 100 110 120 130 pF1KE3 PRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVR ::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVE------------- 40 50 60 70 140 150 160 170 180 190 pF1KE3 GHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRS ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 -----GLMYHDASRWGLTLQTYVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRS 80 90 100 110 120 130 200 210 220 230 240 250 pF1KE3 GKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLE 140 150 160 170 180 190 260 270 280 290 300 pF1KE3 AIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 AIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLELFEQNRDRILTPENRKHCP 200 210 220 230 240 >>CCDS61955.1 TK2 gene_id:7084|Hs108|chr16 (168 aa) initn: 1149 init1: 1149 opt: 1149 Z-score: 1204.9 bits: 230.2 E(32554): 8.5e-61 Smith-Waterman score: 1149; 100.0% identity (100.0% similar) in 168 aa overlap (140-307:1-168) 110 120 130 140 150 160 pF1KE3 EFFSNATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQTYVQLTMLDRHTRPQVS :::::::::::::::::::::::::::::: CCDS61 MYHDASRWGLTLQTYVQLTMLDRHTRPQVS 10 20 30 170 180 190 200 210 220 pF1KE3 SVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 SVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDLIVYLRTNPE 40 50 60 70 80 90 230 240 250 260 270 280 pF1KE3 TCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 TCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAPVLVIEADHHMERMLEL 100 110 120 130 140 150 290 300 pF1KE3 FEQNRDRILTPENRKHCP :::::::::::::::::: CCDS61 FEQNRDRILTPENRKHCP 160 >>CCDS3548.1 DCK gene_id:1633|Hs108|chr4 (260 aa) initn: 436 init1: 169 opt: 321 Z-score: 341.9 bits: 71.1 E(32554): 1e-12 Smith-Waterman score: 447; 33.9% identity (64.2% similar) in 254 aa overlap (78-299:6-259) 50 60 70 80 90 100 pF1KE3 LRGWAARALRCFGPGSRGSPASGPGPRRVQRRAWPP-DKEQEKEKKSVICVEGNIASGKT .:. : . .: . . : .:::::.::. CCDS35 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKS 10 20 30 110 120 130 140 150 pF1KE3 TCLEFFSN-ATDVEVLTEPVSKWRNVR---------------GHNPLGLMYHDASRWGLT : ...... : ::. :::..: ::. : : : .::. ::..: CCDS35 TFVNILKQLCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFT 40 50 60 70 80 90 160 170 180 190 200 pF1KE3 LQTYV-------QLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVL .:::. ::. :. . . . : ..:::..: ::::. :::.: : :..... CCDS35 FQTYACLSRIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIY 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE3 SEWFDWILRNMDVSVDL--IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEW ..: ::. .. :..: :.::...:::: .:. : :.::. ::::::: .:. :: : CCDS35 QDWHDWMNNQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESW 160 170 180 190 200 210 270 280 290 300 pF1KE3 LIKGSL-----FPMAAPVLVIEADHHMERMLE-LFEQNRDRILTPENRKHCP :.. .: . . .:.:...... .. : : :. .. . : CCDS35 LLHRTLKTNFDYLQEVPILTLDVNEDFKDKYESLVEKVKEFLSTL 220 230 240 250 260 >>CCDS1931.1 DGUOK gene_id:1716|Hs108|chr2 (277 aa) initn: 427 init1: 164 opt: 289 Z-score: 308.3 bits: 65.0 E(32554): 7.5e-11 Smith-Waterman score: 404; 33.8% identity (66.7% similar) in 213 aa overlap (95-280:41-253) 70 80 90 100 110 120 pF1KE3 GSPASGPGPRRVQRRAWPPDKEQEKEKKSVICVEGNIASGKTTCLEFFSNA-TDVEVLTE . .::::: ::.: ....... . .: :: CCDS19 LRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTKTYPEWHVATE 20 30 40 50 60 70 130 140 150 160 pF1KE3 PVSKWRNVRGH------------NPLGLMYHDASRWGLTLQTYVQLTMLDRHTRP----- ::. :.:... : : .::.. .::. :.::. :. : . .: CCDS19 PVATWQNIQAAGTQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLSRLKVQLEPFPEKL 80 90 100 110 120 130 170 180 190 200 210 220 pF1KE3 -QVSS-VRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL--IV :. . :...:::..: ::::..::...:.. .... . ..: ...: .. . : .. CCDS19 LQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLLWEFASRITLHGFI 140 150 160 170 180 190 230 240 250 260 270 pF1KE3 YLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL---FP--MAAPVLVI ::...:..: .:: .: ::::: : : ::: .: :: :::. . : : ::::. CCDS19 YLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKLHFEALMNIPVLVL 200 210 220 230 240 250 280 290 300 pF1KE3 EADHHMERMLELFEQNRDRILTPENRKHCP ... CCDS19 DVNDDFSEEVTKQEDLMREVNTFVKNL 260 270 307 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 03:27:55 2016 done: Sun Nov 6 03:27:56 2016 Total Scan time: 2.420 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]