FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6513, 139 aa 1>>>pF1KE6513 139 - 139 aa - 139 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9071+/-0.000781; mu= 12.6477+/- 0.047 mean_var=52.2924+/-10.346, 0's: 0 Z-trim(106.1): 25 B-trim: 0 in 0/49 Lambda= 0.177360 statistics sampled from 8749 (8773) to 8749 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.67), E-opt: 0.2 (0.269), width: 16 Scan time: 1.550 The best scores are: opt bits E(32554) CCDS33024.1 LGALS13 gene_id:29124|Hs108|chr19 ( 139) 958 252.7 5.5e-68 CCDS54267.1 LGALS16 gene_id:148003|Hs108|chr19 ( 142) 687 183.3 4.2e-47 CCDS46073.1 LGALS14 gene_id:56891|Hs108|chr19 ( 139) 651 174.1 2.4e-44 CCDS12542.1 LGALS14 gene_id:56891|Hs108|chr19 ( 168) 619 166.0 8.4e-42 CCDS33025.1 CLC gene_id:1178|Hs108|chr19 ( 142) 485 131.6 1.5e-31 CCDS12521.1 LGALS4 gene_id:3960|Hs108|chr19 ( 323) 241 69.4 1.9e-12 CCDS11222.1 LGALS9 gene_id:3965|Hs108|chr17 ( 355) 235 67.8 6.1e-12 CCDS82093.1 LGALS9 gene_id:3965|Hs108|chr17 ( 246) 233 67.3 6.3e-12 CCDS32592.1 LGALS9 gene_id:3965|Hs108|chr17 ( 323) 233 67.3 8e-12 CCDS42283.1 LGALS9B gene_id:284194|Hs108|chr17 ( 355) 224 65.0 4.3e-11 CCDS32587.1 LGALS9C gene_id:654346|Hs108|chr17 ( 356) 221 64.3 7.3e-11 >>CCDS33024.1 LGALS13 gene_id:29124|Hs108|chr19 (139 aa) initn: 958 init1: 958 opt: 958 Z-score: 1334.0 bits: 252.7 E(32554): 5.5e-68 Smith-Waterman score: 958; 100.0% identity (100.0% similar) in 139 aa overlap (1-139:1-139) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN ::::::::::::::::::: CCDS33 VKMVQVSRDISLTSVCVCN 130 >>CCDS54267.1 LGALS16 gene_id:148003|Hs108|chr19 (142 aa) initn: 687 init1: 687 opt: 687 Z-score: 959.1 bits: 183.3 E(32554): 4.2e-47 Smith-Waterman score: 687; 75.5% identity (87.1% similar) in 139 aa overlap (1-139:1-139) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :: : :::::::::::::::::::: : : ::.:::::::::.:.:::.:::..:::.: CCDS54 MSFLTVPYKLPVSLSVGSCVIIKGTLIDSSINEPQLQVDFYTEMNEDSEIAFHLRVHLGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF .:::: ::::::::::. :::::::: :.: ::: .::::.:::: ::.::::::::. CCDS54 RVVMNSREFGIWMLEENLHYVPFEDGKPFDLRIYVCHNEYEVKVNGEYIYAFVHRIPPSY 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN :::.:: ::.:: :: : : CCDS54 VKMIQVWRDVSLDSVLVNNGRR 130 140 >>CCDS46073.1 LGALS14 gene_id:56891|Hs108|chr19 (139 aa) initn: 662 init1: 651 opt: 651 Z-score: 909.5 bits: 174.1 E(32554): 2.4e-44 Smith-Waterman score: 651; 67.6% identity (84.2% similar) in 139 aa overlap (1-139:1-139) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :::::::: ::::: ::::::: :::: .:..::::.:.::: :::::::::.::.:::. CCDS46 MSSLPVPYTLPVSLPVGSCVIITGTPILTFVKDPQLEVNFYTGMDEDSDIAFQFRLHFGH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF ..:: :::: :: :.:::::: :::::::...::.. ::: :::.:.::.::. CCDS46 PAIMNSCVFGIWRYEEKCYYLPFEDGKPFELCIYVRHKEYKVMVNGQRIYNFAHRFPPAS 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN :::.:: :::::: : . . CCDS46 VKMLQVFRDISLTRVLISD 130 >>CCDS12542.1 LGALS14 gene_id:56891|Hs108|chr19 (168 aa) initn: 619 init1: 619 opt: 619 Z-score: 864.0 bits: 166.0 E(32554): 8.4e-42 Smith-Waterman score: 619; 66.4% identity (83.6% similar) in 134 aa overlap (6-139:35-168) 10 20 30 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQ ::: ::::: ::::::: :::: .:..::: CCDS12 HRLHLCKYWGCAVSNVCRFWEGRPLPLMIVVPYTLPVSLPVGSCVIITGTPILTFVKDPQ 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 LQVDFYTDMDEDSDIAFRFRVHFGNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYV :.:.::: :::::::::.::.:::. ..:: :::: :: :.:::::: ::::::: CCDS12 LEVNFYTGMDEDSDIAFQFRLHFGHPAIMNSCVFGIWRYEEKCYYLPFEDGKPFELCIYV 70 80 90 100 110 120 100 110 120 130 pF1KE6 HYNEYEIKVNGIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN ...::.. ::: :::.:.::.::. :::.:: :::::: : . . CCDS12 RHKEYKVMVNGQRIYNFAHRFPPASVKMLQVFRDISLTRVLISD 130 140 150 160 >>CCDS33025.1 CLC gene_id:1178|Hs108|chr19 (142 aa) initn: 485 init1: 485 opt: 485 Z-score: 679.8 bits: 131.6 E(32554): 1.5e-31 Smith-Waterman score: 485; 55.5% identity (73.0% similar) in 137 aa overlap (1-137:1-137) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :: ::::: .:::.:: : ::: :. :.:.: :::::.:.: :.:::.:.:.: :: CCDS33 MSLLPVPYTEAASLSTGSTVTIKGRPLACFLNEPYLQVDFHTEMKEESDIVFHFQVCFGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF .:::: ::.: : . . .::.::..::: : : ..:.. ::: : : ::: : CCDS33 RVVMNSREYGAWKQQVESKNMPFQDGQEFELSISVLPDKYQVMVNGQSSYTFDHRIKPEA 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN :::::: ::::::. : CCDS33 VKMVQVWRDISLTKFNVSYLKR 130 140 >>CCDS12521.1 LGALS4 gene_id:3960|Hs108|chr19 (323 aa) initn: 191 init1: 163 opt: 241 Z-score: 336.8 bits: 69.4 E(32554): 1.9e-12 Smith-Waterman score: 241; 34.1% identity (61.5% similar) in 135 aa overlap (3-135:16-147) 10 20 30 40 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDED .:: .: .:.:: : :.:. . . .. :.: . .: CCDS12 MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHM---KRFFVNFVVGQDPG 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 SDIAFRFRVHFG--NHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVN ::.::.: .: ..::.: . : : :: .::. : ::: . : ..:.. :: CCDS12 SDVAFHFNPRFDGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFELVFIVLAEHYKVVVN 60 70 80 90 100 110 110 120 130 pF1KE6 GIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN : .: . ::.: ..: .::. :..: :. CCDS12 GNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGPPMMPPYPGPGHCHQQLN 120 130 140 150 160 170 >>CCDS11222.1 LGALS9 gene_id:3965|Hs108|chr17 (355 aa) initn: 202 init1: 162 opt: 235 Z-score: 327.9 bits: 67.8 E(32554): 6.1e-12 Smith-Waterman score: 235; 31.2% identity (62.3% similar) in 138 aa overlap (6-139:15-149) 10 20 30 40 pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD ::.. .. :. : . ..:: . : . .. :.: : .. .: CCDS11 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI :::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... :::: CCDS11 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI 60 70 80 90 100 110 110 120 130 pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN . . ::.: : ..:. ...:. . : CCDS11 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRGR 120 130 140 150 160 170 CCDS11 RQKPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMYPHPAYPMPFITTILGGLYP 180 190 200 210 220 230 >>CCDS82093.1 LGALS9 gene_id:3965|Hs108|chr17 (246 aa) initn: 202 init1: 162 opt: 233 Z-score: 327.6 bits: 67.3 E(32554): 6.3e-12 Smith-Waterman score: 233; 31.3% identity (63.4% similar) in 134 aa overlap (6-135:15-145) 10 20 30 40 pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD ::.. .. :. : . ..:: . : . .. :.: : .. .: CCDS82 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI :::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... :::: CCDS82 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI 60 70 80 90 100 110 110 120 130 pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN . . ::.: : ..:. ...:. . CCDS82 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQPPGVWPANPAPITQTVIHTVQSAPGQMFS 120 130 140 150 160 170 >>CCDS32592.1 LGALS9 gene_id:3965|Hs108|chr17 (323 aa) initn: 202 init1: 162 opt: 233 Z-score: 325.7 bits: 67.3 E(32554): 8e-12 Smith-Waterman score: 233; 31.3% identity (63.4% similar) in 134 aa overlap (6-135:15-145) 10 20 30 40 pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD ::.. .. :. : . ..:: . : . .. :.: : .. .: CCDS32 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI :::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... :::: CCDS32 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI 60 70 80 90 100 110 110 120 130 pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN . . ::.: : ..:. ...:. . CCDS32 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQPPGVWPANPAPITQTVIHTVQSAPGQMFS 120 130 140 150 160 170 >>CCDS42283.1 LGALS9B gene_id:284194|Hs108|chr17 (355 aa) initn: 190 init1: 150 opt: 224 Z-score: 312.6 bits: 65.0 E(32554): 4.3e-11 Smith-Waterman score: 224; 30.4% identity (61.6% similar) in 138 aa overlap (6-139:15-149) 10 20 30 40 pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD ::.. .. :. : . ..:. . : . .. ::: : .. .: CCDS42 MAFSGSQAPYLSPAVPFSGTIQGGLQDGFQITVNGAVLSS--SGTRFAVDFQTGFS-GND 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI :::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... ::: CCDS42 IAFHFNPRFEDGGYVVCNTRQKGRWGPEERKMHMPFQKGMPFDLCFLVQSSDFKVMVNGS 60 70 80 90 100 110 110 120 130 pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN . . ::.: : ..:. ...:. . : CCDS42 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRGR 120 130 140 150 160 170 CCDS42 RQKPPSVRPANPAPITQTVIHTVQSASGQMFSTPAIPPMMYPHPAYPMPFITTIPGGLYP 180 190 200 210 220 230 139 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:59:20 2016 done: Tue Nov 8 13:59:20 2016 Total Scan time: 1.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]