FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3605, 458 aa
1>>>pF1KE3605 458 - 458 aa - 458 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5599+/-0.000883; mu= 15.9481+/- 0.053
mean_var=63.7803+/-12.723, 0's: 0 Z-trim(105.2): 19 B-trim: 0 in 0/50
Lambda= 0.160595
statistics sampled from 8306 (8312) to 8306 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.641), E-opt: 0.2 (0.255), width: 16
Scan time: 3.120
The best scores are: opt bits E(32554)
CCDS42034.1 GALK2 gene_id:2585|Hs108|chr15 ( 458) 3023 709.3 2.1e-204
CCDS32236.1 GALK2 gene_id:2585|Hs108|chr15 ( 447) 2914 684.0 8.1e-197
CCDS73724.1 GALK2 gene_id:2585|Hs108|chr15 ( 434) 2873 674.5 5.7e-194
CCDS11728.1 GALK1 gene_id:2584|Hs108|chr17 ( 392) 382 97.4 2.8e-20
>>CCDS42034.1 GALK2 gene_id:2585|Hs108|chr15 (458 aa)
initn: 3023 init1: 3023 opt: 3023 Z-score: 3783.3 bits: 709.3 E(32554): 2.1e-204
Smith-Waterman score: 3023; 99.8% identity (99.8% similar) in 458 aa overlap (1-458:1-458)
10 20 30 40 50 60
pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL
:::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::
CCDS42 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF
370 380 390 400 410 420
430 440 450
pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
::::::::::::::::::::::::::::::::::::::
CCDS42 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
430 440 450
>>CCDS32236.1 GALK2 gene_id:2585|Hs108|chr15 (447 aa)
initn: 2914 init1: 2914 opt: 2914 Z-score: 3647.0 bits: 684.0 E(32554): 8.1e-197
Smith-Waterman score: 2914; 99.3% identity (99.5% similar) in 443 aa overlap (16-458:5-447)
10 20 30 40 50 60
pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
. :::::::::::::::::::::::::::::::::::::::::::
CCDS32 MPVLYDRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
10 20 30 40
70 80 90 100 110 120
pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER
110 120 130 140 150 160
190 200 210 220 230 240
pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF
170 180 190 200 210 220
250 260 270 280 290 300
pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI
230 240 250 260 270 280
310 320 330 340 350 360
pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL
:::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::
CCDS32 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL
290 300 310 320 330 340
370 380 390 400 410 420
pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF
350 360 370 380 390 400
430 440 450
pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
::::::::::::::::::::::::::::::::::::::
CCDS32 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
410 420 430 440
>>CCDS73724.1 GALK2 gene_id:2585|Hs108|chr15 (434 aa)
initn: 2873 init1: 2873 opt: 2873 Z-score: 3595.8 bits: 674.5 E(32554): 5.7e-194
Smith-Waterman score: 2873; 99.8% identity (99.8% similar) in 434 aa overlap (25-458:1-434)
10 20 30 40 50 60
pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
::::::::::::::::::::::::::::::::::::
CCDS73 MFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
10 20 30
70 80 90 100 110 120
pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER
100 110 120 130 140 150
190 200 210 220 230 240
pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF
160 170 180 190 200 210
250 260 270 280 290 300
pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI
220 230 240 250 260 270
310 320 330 340 350 360
pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL
:::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::
CCDS73 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL
280 290 300 310 320 330
370 380 390 400 410 420
pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF
340 350 360 370 380 390
430 440 450
pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
::::::::::::::::::::::::::::::::::::::
CCDS73 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
400 410 420 430
>>CCDS11728.1 GALK1 gene_id:2584|Hs108|chr17 (392 aa)
initn: 478 init1: 164 opt: 382 Z-score: 477.4 bits: 97.4 E(32554): 2.8e-20
Smith-Waterman score: 527; 31.0% identity (55.2% similar) in 458 aa overlap (7-455:3-390)
10 20 30 40 50 60
pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP
: :. :::: : . .. : .::. :.. : ::::::.:::: :: :::
CCDS11 MAALRQPQVAEL--LAEARRAFREEFGAEPELAVSAPGRVNLIGEHTDYNQGLVLP
10 20 30 40 50
70 80 90 100 110
pF1KE3 MAVEQDVLIAVEPVKTYALQLANTN-----PLYPDFSTSANNIQIDKTKPLWHNYFLCGL
::.: .... : : ..: .:. : .: . . ... : : :: .
CCDS11 MALELMTVLVGSPRKDGLVSLLTTSEGADEPQRLQFPLPTAQRSLEPGTPRWANY----V
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE3 KGIQEHFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEIC
::. ... . : :.. .: ...: ..:::::..: . . . . . :..:
CCDS11 KGVIQYYPAAPLPGFSAVVVSSVPLGGGLSSSASLEVATYTFLQQLCPDSGTIAARAQVC
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE3 AKSER-YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKL--PSGAVFVIANSCVEM
..:. . : : ::: ::.....: : ::. :... : : :. ::. :.:: :.
CCDS11 QQAEHSFAGMPCGIMDQFISLMGQKGHALLIDCRSLETSLVPLSDPKLAVL-ITNSNVR-
180 190 200 210 220
240 250 260 270 280 290
pF1KE3 NKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHP
.. :.:.. .: .:. .:. :.: . :.::: :::.
CCDS11 HSLASSEYPVRRRQCEEVARALGKES---------LREVQ------LEEL----------
230 240 250 260
300 310 320 330 340 350
pF1KE3 EPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEA
: : : .: : .: :.:: .: :. : ...
CCDS11 ------EAARDL-VSKEGFRR------------------ARHVVGEIRRTAQAAAALRRG
270 280 290
360 370 380 390 400 410
pF1KE3 PENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKF-GAQGSRLTGAGWGGCTVSM
. .:.:: .:: : :: :: :::::::::. :. :::.::.:.:::::..
CCDS11 D---YRAFGRLMVESHRSLRDDYEVSCPELDQLVEAALAVPGVYGSRMTGGGFGGCTVTL
300 310 320 330 340 350
420 430 440 450
pF1KE3 VPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA
. :. : . .... : :. : ... .. . :: ::
CCDS11 LEASAAPHAMRHIQEHY-----GGTA----TFYLSQAADGAKVLCL
360 370 380 390
458 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 20:25:52 2016 done: Sun Nov 6 20:25:52 2016
Total Scan time: 3.120 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]