FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3605, 458 aa 1>>>pF1KE3605 458 - 458 aa - 458 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5599+/-0.000883; mu= 15.9481+/- 0.053 mean_var=63.7803+/-12.723, 0's: 0 Z-trim(105.2): 19 B-trim: 0 in 0/50 Lambda= 0.160595 statistics sampled from 8306 (8312) to 8306 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.641), E-opt: 0.2 (0.255), width: 16 Scan time: 3.120 The best scores are: opt bits E(32554) CCDS42034.1 GALK2 gene_id:2585|Hs108|chr15 ( 458) 3023 709.3 2.1e-204 CCDS32236.1 GALK2 gene_id:2585|Hs108|chr15 ( 447) 2914 684.0 8.1e-197 CCDS73724.1 GALK2 gene_id:2585|Hs108|chr15 ( 434) 2873 674.5 5.7e-194 CCDS11728.1 GALK1 gene_id:2584|Hs108|chr17 ( 392) 382 97.4 2.8e-20 >>CCDS42034.1 GALK2 gene_id:2585|Hs108|chr15 (458 aa) initn: 3023 init1: 3023 opt: 3023 Z-score: 3783.3 bits: 709.3 E(32554): 2.1e-204 Smith-Waterman score: 3023; 99.8% identity (99.8% similar) in 458 aa overlap (1-458:1-458) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: CCDS42 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 370 380 390 400 410 420 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: CCDS42 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 430 440 450 >>CCDS32236.1 GALK2 gene_id:2585|Hs108|chr15 (447 aa) initn: 2914 init1: 2914 opt: 2914 Z-score: 3647.0 bits: 684.0 E(32554): 8.1e-197 Smith-Waterman score: 2914; 99.3% identity (99.5% similar) in 443 aa overlap (16-458:5-447) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP . ::::::::::::::::::::::::::::::::::::::::::: CCDS32 MPVLYDRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: CCDS32 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 350 360 370 380 390 400 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: CCDS32 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 410 420 430 440 >>CCDS73724.1 GALK2 gene_id:2585|Hs108|chr15 (434 aa) initn: 2873 init1: 2873 opt: 2873 Z-score: 3595.8 bits: 674.5 E(32554): 5.7e-194 Smith-Waterman score: 2873; 99.8% identity (99.8% similar) in 434 aa overlap (25-458:1-434) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::: CCDS73 MFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: CCDS73 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 340 350 360 370 380 390 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: CCDS73 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 400 410 420 430 >>CCDS11728.1 GALK1 gene_id:2584|Hs108|chr17 (392 aa) initn: 478 init1: 164 opt: 382 Z-score: 477.4 bits: 97.4 E(32554): 2.8e-20 Smith-Waterman score: 527; 31.0% identity (55.2% similar) in 458 aa overlap (7-455:3-390) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP : :. :::: : . .. : .::. :.. : ::::::.:::: :: ::: CCDS11 MAALRQPQVAEL--LAEARRAFREEFGAEPELAVSAPGRVNLIGEHTDYNQGLVLP 10 20 30 40 50 70 80 90 100 110 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTN-----PLYPDFSTSANNIQIDKTKPLWHNYFLCGL ::.: .... : : ..: .:. : .: . . ... : : :: . CCDS11 MALELMTVLVGSPRKDGLVSLLTTSEGADEPQRLQFPLPTAQRSLEPGTPRWANY----V 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 KGIQEHFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEIC ::. ... . : :.. .: ...: ..:::::..: . . . . . :..: CCDS11 KGVIQYYPAAPLPGFSAVVVSSVPLGGGLSSSASLEVATYTFLQQLCPDSGTIAARAQVC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 AKSER-YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKL--PSGAVFVIANSCVEM ..:. . : : ::: ::.....: : ::. :... : : :. ::. :.:: :. CCDS11 QQAEHSFAGMPCGIMDQFISLMGQKGHALLIDCRSLETSLVPLSDPKLAVL-ITNSNVR- 180 190 200 210 220 240 250 260 270 280 290 pF1KE3 NKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHP .. :.:.. .: .:. .:. :.: . :.::: :::. CCDS11 HSLASSEYPVRRRQCEEVARALGKES---------LREVQ------LEEL---------- 230 240 250 260 300 310 320 330 340 350 pF1KE3 EPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEA : : : .: : .: :.:: .: :. : ... CCDS11 ------EAARDL-VSKEGFRR------------------ARHVVGEIRRTAQAAAALRRG 270 280 290 360 370 380 390 400 410 pF1KE3 PENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKF-GAQGSRLTGAGWGGCTVSM . .:.:: .:: : :: :: :::::::::. :. :::.::.:.:::::.. CCDS11 D---YRAFGRLMVESHRSLRDDYEVSCPELDQLVEAALAVPGVYGSRMTGGGFGGCTVTL 300 310 320 330 340 350 420 430 440 450 pF1KE3 VPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA . :. : . .... : :. : ... .. . :: :: CCDS11 LEASAAPHAMRHIQEHY-----GGTA----TFYLSQAADGAKVLCL 360 370 380 390 458 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 20:25:52 2016 done: Sun Nov 6 20:25:52 2016 Total Scan time: 3.120 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]