FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3090, 260 aa 1>>>pF1KE3090 260 - 260 aa - 260 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1999+/-0.000838; mu= 15.3488+/- 0.051 mean_var=65.4458+/-13.016, 0's: 0 Z-trim(106.4): 19 B-trim: 3 in 1/51 Lambda= 0.158538 statistics sampled from 8977 (8988) to 8977 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.655), E-opt: 0.2 (0.276), width: 16 Scan time: 2.130 The best scores are: opt bits E(32554) CCDS3548.1 DCK gene_id:1633|Hs108|chr4 ( 260) 1761 411.4 3.2e-115 CCDS1931.1 DGUOK gene_id:1716|Hs108|chr2 ( 277) 846 202.1 3.3e-52 CCDS1932.1 DGUOK gene_id:1716|Hs108|chr2 ( 189) 390 97.7 6.1e-21 CCDS61955.1 TK2 gene_id:7084|Hs108|chr16 ( 168) 321 81.9 3.1e-16 CCDS54018.1 TK2 gene_id:7084|Hs108|chr16 ( 234) 321 82.0 4.1e-16 CCDS54016.1 TK2 gene_id:7084|Hs108|chr16 ( 240) 321 82.0 4.2e-16 CCDS54017.1 TK2 gene_id:7084|Hs108|chr16 ( 247) 321 82.0 4.3e-16 CCDS10805.2 TK2 gene_id:7084|Hs108|chr16 ( 265) 321 82.0 4.5e-16 >>CCDS3548.1 DCK gene_id:1633|Hs108|chr4 (260 aa) initn: 1761 init1: 1761 opt: 1761 Z-score: 2182.1 bits: 411.4 E(32554): 3.2e-115 Smith-Waterman score: 1761; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:1-260) 10 20 30 40 50 60 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARWCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARWCN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNGKLKDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 VQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNGKLKDAE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDGIIYLQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 KPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDGIIYLQA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 TPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPILTLDVNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 TPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPILTLDVNE 190 200 210 220 230 240 250 260 pF1KE3 DFKDKYESLVEKVKEFLSTL :::::::::::::::::::: CCDS35 DFKDKYESLVEKVKEFLSTL 250 260 >>CCDS1931.1 DGUOK gene_id:1716|Hs108|chr2 (277 aa) initn: 835 init1: 671 opt: 846 Z-score: 1050.7 bits: 202.1 E(32554): 3.3e-52 Smith-Waterman score: 846; 47.5% identity (77.9% similar) in 263 aa overlap (1-260:18-277) 10 20 30 40 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ :: : .. : . : ...:::::::.::::::..: . CCDS19 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: CCDS19 TYPEWHVATEPVATWQNIQAAGT---QKACTAQSLGNLLDMMYREPARWSYTFQTFSFLS 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN :...:: . :: .:.::: .:::::::::::::.::.:. ... :: ::::::... CCDS19 RLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHIYQDWHSFLL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 NQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKT .:.. . : :.:::::.:..::.:.: :.:.::.:: : :::.:: .::.::.:.: : CCDS19 WEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEAWLIHKTTKL 180 190 200 210 220 230 230 240 250 260 pF1KE3 NFDYLQEVPILTLDVNEDFKD---KYESLVEKVKEFLSTL .:. :...:.:.::::.::.. : :.:...:. :...: CCDS19 HFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL 240 250 260 270 >>CCDS1932.1 DGUOK gene_id:1716|Hs108|chr2 (189 aa) initn: 427 init1: 231 opt: 390 Z-score: 489.5 bits: 97.7 E(32554): 6.1e-21 Smith-Waterman score: 390; 46.0% identity (71.5% similar) in 137 aa overlap (1-137:18-151) 10 20 30 40 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKSTFVNILKQ :: : .. : . : ...:::::::.::::::..: . CCDS19 MAAGRLFLSRLRAPFSSMAKSPLEGVSSSRGLHAGRGPRRLSIEGNIAVGKSTFVKLLTK 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 LCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLS .:.:. :::: : :.:.. . . . .. ::.:.:::..: :::.::::.. :: CCDS19 TYPEWHVATEPVATWQNIQAAG---TQKACTAQSLGNLLDMMYREPARWSYTFQTFSFLS 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMN :...:: . :: .:.::: .::::::::: : CCDS19 RLKVQLEPFPEKLLQARKPVQIFERSVYSDRLHFEALMNIPVLVLDVNDDFSEEVTKQED 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 NQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKT CCDS19 LMREVNTFVKNL 180 >>CCDS61955.1 TK2 gene_id:7084|Hs108|chr16 (168 aa) initn: 351 init1: 169 opt: 321 Z-score: 404.9 bits: 81.9 E(32554): 3.1e-16 Smith-Waterman score: 358; 35.4% identity (68.0% similar) in 175 aa overlap (85-259:1-160) 60 70 80 90 100 110 pF1KE3 VARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNG ::. ::..:.:::. ::. :. CCDS61 MYHDASRWGLTLQTYV-------QLTMLDR 10 20 120 130 140 150 160 170 pF1KE3 KLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDG . . . : ..:::..: ::::. :::.: : :..... ..: ::. .. :..: CCDS61 HTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL-- 30 40 50 60 70 80 180 190 200 210 220 230 pF1KE3 IIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPIL :.::...:::: .:. : :.::. ::::::: .:. :: ::.. .: . . .:.: CCDS61 IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL-----FPMAAPVL 90 100 110 120 130 240 250 260 pF1KE3 TLDVNEDFKDKYESLVEKVKEFLSTL ...... .. : : :. .. . : CCDS61 VIEADHHMERMLE-LFEQNRDRILTPENRKHCP 140 150 160 >>CCDS54018.1 TK2 gene_id:7084|Hs108|chr16 (234 aa) initn: 454 init1: 169 opt: 321 Z-score: 402.8 bits: 82.0 E(32554): 4.1e-16 Smith-Waterman score: 459; 34.8% identity (64.4% similar) in 253 aa overlap (9-259:5-226) 10 20 30 40 50 pF1KE3 MATPPKRSCPSFSASSEGTRIKK--ISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARW : :...: . :: : .:::::.::.: ...... : ::. :::..: CCDS54 MGAFCQRPSSDKEQEKEKKSVICVEGNIASGKTTCLEFFSN-ATDVEVLTEPVSKW 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 CNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNGKLKD ::. : : : .::. ::..:.:::. ::. :. . . CCDS54 RNVR---------------GHNPLGLMYHDASRWGLTLQTYV-------QLTMLDRHTRP 60 70 80 90 120 130 140 150 160 170 pF1KE3 AEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQFGQSLELDGIIYL . : ..:::..: ::::. :::.: : :..... ..: ::. .. :..: :.:: CCDS54 QVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL--IVYL 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE3 QATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPILTLDV ...:::: .:. : :.::. ::::::: .:. :: ::.. .: . . .:.:.... CCDS54 RTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL-----FPMAAPVLVIEA 160 170 180 190 200 240 250 260 pF1KE3 NEDFKDKYESLVEKVKEFLSTL .. .. : : :. .. . : CCDS54 DHHMERMLE-LFEQNRDRILTPENRKHCP 210 220 230 >>CCDS54016.1 TK2 gene_id:7084|Hs108|chr16 (240 aa) initn: 386 init1: 169 opt: 321 Z-score: 402.6 bits: 82.0 E(32554): 4.2e-16 Smith-Waterman score: 388; 34.3% identity (63.8% similar) in 210 aa overlap (50-259:53-232) 20 30 40 50 60 70 pF1KE3 RIKKISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGG :. :::..: ::. : CCDS54 GSPASGPGPRRVQRRAWPPDKEQEKEKKSVVLTEPVSKWRNVR---------------GH 30 40 50 60 80 90 100 110 120 130 pF1KE3 NVLQMMYEKPERWSFTFQTYACLSRIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFAS : : .::. ::..:.:::. ::. :. . . . : ..:::..: ::::. CCDS54 NPLGLMYHDASRWGLTLQTYV-------QLTMLDRHTRPQVSSVRLMERSIHSARYIFVE 70 80 90 100 110 120 140 150 160 170 180 190 pF1KE3 NLYESECMNETEWTIYQDWHDWMNNQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQG :::.: : :..... ..: ::. .. :..: :.::...:::: .:. : :.::. CCDS54 NLYRSGKMPEVDYVVLSEWFDWILRNMDVSVDL--IVYLRTNPETCYQRLKKRCREEEKV 130 140 150 160 170 200 210 220 230 240 250 pF1KE3 IPLEYLEKLHYKHESWLLHRTLKTNFDYLQEVPILTLDVNEDFKDKYESLVEKVKEFLST ::::::: .:. :: ::.. .: . . .:.:...... .. : : :. .. . : CCDS54 IPLEYLEAIHHLHEEWLIKGSL-----FPMAAPVLVIEADHHMERMLE-LFEQNRDRILT 180 190 200 210 220 230 260 pF1KE3 L CCDS54 PENRKHCP 240 >>CCDS54017.1 TK2 gene_id:7084|Hs108|chr16 (247 aa) initn: 408 init1: 169 opt: 321 Z-score: 402.4 bits: 82.0 E(32554): 4.3e-16 Smith-Waterman score: 368; 35.0% identity (67.8% similar) in 183 aa overlap (77-259:72-239) 50 60 70 80 90 100 pF1KE3 DWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFTFQTYACLSRIR :. .: .::. ::..:.:::. CCDS54 DKEQEKEKKSVICVEGNIASGKTTCLEFFSNATDVEGLMYHDASRWGLTLQTYV------ 50 60 70 80 90 110 120 130 140 150 160 pF1KE3 AQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIYQDWHDWMNNQF ::. :. . . . : ..:::..: ::::. :::.: : :..... ..: ::. .. CCDS54 -QLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRNM 100 110 120 130 140 150 170 180 190 200 210 220 pF1KE3 GQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESWLLHRTLKTNFD :..: :.::...:::: .:. : :.::. ::::::: .:. :: ::.. .: CCDS54 DVSVDL--IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSL----- 160 170 180 190 200 230 240 250 260 pF1KE3 YLQEVPILTLDVNEDFKDKYESLVEKVKEFLSTL . . .:.:...... .. : : :. .. . : CCDS54 FPMAAPVLVIEADHHMERMLE-LFEQNRDRILTPENRKHCP 210 220 230 240 >>CCDS10805.2 TK2 gene_id:7084|Hs108|chr16 (265 aa) initn: 436 init1: 169 opt: 321 Z-score: 402.0 bits: 82.0 E(32554): 4.5e-16 Smith-Waterman score: 447; 33.9% identity (64.2% similar) in 254 aa overlap (6-259:36-257) 10 20 30 pF1KE3 MATPPKRSCPSFSASSEGTRIKKISIEGNIAAGKS .:. : . .: . . : .:::::.::. CCDS10 LRGWAARALRCFGPGSRGSPASGPGPRRVQRRAWPP-DKEQEKEKKSVICVEGNIASGKT 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE3 TFVNILKQLCEDWEVVPEPVARWCNVQSTQDEFEELTMSQKNGGNVLQMMYEKPERWSFT : ...... : ::. :::..: ::. : : : .::. ::..: CCDS10 TCLEFFSN-ATDVEVLTEPVSKWRNVR---------------GHNPLGLMYHDASRWGLT 70 80 90 100 100 110 120 130 140 150 pF1KE3 FQTYACLSRIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETEWTIY .:::. ::. :. . . . : ..:::..: ::::. :::.: : :..... CCDS10 LQTYV-------QLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVL 110 120 130 140 150 160 160 170 180 190 200 210 pF1KE3 QDWHDWMNNQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYKHESW ..: ::. .. :..: :.::...:::: .:. : :.::. ::::::: .:. :: : CCDS10 SEWFDWILRNMDVSVDL--IVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEW 170 180 190 200 210 220 230 240 250 260 pF1KE3 LLHRTLKTNFDYLQEVPILTLDVNEDFKDKYESLVEKVKEFLSTL :.. .: . . .:.:...... .. : : :. .. . : CCDS10 LIKGSL-----FPMAAPVLVIEADHHMERMLE-LFEQNRDRILTPENRKHCP 220 230 240 250 260 260 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 05:14:02 2016 done: Sun Nov 6 05:14:03 2016 Total Scan time: 2.130 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]