FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6513, 139 aa 1>>>pF1KE6513 139 - 139 aa - 139 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.6763+/-0.00035; mu= 14.0890+/- 0.022 mean_var=50.5748+/-10.106, 0's: 0 Z-trim(112.8): 38 B-trim: 616 in 1/51 Lambda= 0.180346 statistics sampled from 21893 (21931) to 21893 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.661), E-opt: 0.2 (0.257), width: 16 Scan time: 3.680 The best scores are: opt bits E(85289) NP_037400 (OMIM: 608717) galactoside-binding solub ( 139) 958 256.6 9.3e-69 XP_011525176 (OMIM: 608717) PREDICTED: galactoside ( 132) 722 195.2 2.7e-50 XP_016882204 (OMIM: 608717) PREDICTED: galactoside ( 170) 722 195.3 3.4e-50 NP_064514 (OMIM: 607260) placental protein 13-like ( 139) 651 176.7 1e-44 NP_982297 (OMIM: 607260) placental protein 13-like ( 168) 619 168.5 3.9e-42 NP_001819 (OMIM: 153310) galectin-10 [Homo sapiens ( 142) 485 133.6 1.1e-31 XP_011525275 (OMIM: 602518) PREDICTED: galectin-4 ( 310) 241 70.2 2.6e-12 NP_006140 (OMIM: 602518) galectin-4 [Homo sapiens] ( 323) 241 70.3 2.7e-12 XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 ( 221) 235 68.6 5.8e-12 XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 ( 278) 235 68.7 7.1e-12 XP_016880112 (OMIM: 601879) PREDICTED: galectin-9 ( 343) 235 68.7 8.4e-12 NP_033665 (OMIM: 601879) galectin-9 isoform long [ ( 355) 235 68.7 8.7e-12 XP_006721958 (OMIM: 601879) PREDICTED: galectin-9 ( 234) 233 68.1 8.8e-12 NP_001317092 (OMIM: 601879) galectin-9 isoform 3 [ ( 246) 233 68.1 9.1e-12 XP_006721955 (OMIM: 601879) PREDICTED: galectin-9 ( 311) 233 68.2 1.1e-11 NP_002299 (OMIM: 601879) galectin-9 isoform short ( 323) 233 68.2 1.1e-11 NP_001035972 (OMIM: 617139) galectin-7 [Homo sapie ( 136) 212 62.5 2.4e-10 NP_002298 (OMIM: 600615) galectin-7 [Homo sapiens] ( 136) 212 62.5 2.4e-10 XP_016856764 (OMIM: 606099) PREDICTED: galectin-8 ( 226) 180 54.3 1.2e-07 XP_016856763 (OMIM: 606099) PREDICTED: galectin-8 ( 287) 180 54.4 1.5e-07 NP_963838 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 180 54.4 1.6e-07 NP_963837 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 180 54.4 1.6e-07 XP_011542490 (OMIM: 606099) PREDICTED: galectin-8 ( 329) 180 54.4 1.7e-07 NP_963839 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 180 54.4 1.8e-07 NP_006490 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 180 54.4 1.8e-07 NP_002297 (OMIM: 153619) galectin-3 isoform 1 [Hom ( 250) 171 52.0 6.7e-07 XP_011535061 (OMIM: 153619) PREDICTED: galectin-3 ( 264) 171 52.0 7e-07 XP_011525276 (OMIM: 602518) PREDICTED: galectin-4 ( 192) 157 48.3 6.6e-06 XP_016873947 (OMIM: 606096) PREDICTED: galectin-12 ( 252) 155 47.8 1.2e-05 NP_001136007 (OMIM: 606096) galectin-12 isoform 1 ( 337) 155 47.9 1.5e-05 NP_001136008 (OMIM: 606096) galectin-12 isoform 3 ( 327) 154 47.6 1.8e-05 NP_149092 (OMIM: 606096) galectin-12 isoform 2 [Ho ( 336) 154 47.6 1.8e-05 NP_001136010 (OMIM: 606096) galectin-12 isoform 5 ( 266) 136 42.9 0.00039 NP_001136009 (OMIM: 606096) galectin-12 isoform 4 ( 275) 136 42.9 0.0004 NP_002296 (OMIM: 150570) galectin-1 [Homo sapiens] ( 135) 116 37.5 0.008 >>NP_037400 (OMIM: 608717) galactoside-binding soluble l (139 aa) initn: 958 init1: 958 opt: 958 Z-score: 1355.4 bits: 256.6 E(85289): 9.3e-69 Smith-Waterman score: 958; 100.0% identity (100.0% similar) in 139 aa overlap (1-139:1-139) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_037 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_037 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN ::::::::::::::::::: NP_037 VKMVQVSRDISLTSVCVCN 130 >>XP_011525176 (OMIM: 608717) PREDICTED: galactoside-bin (132 aa) initn: 722 init1: 722 opt: 722 Z-score: 1023.9 bits: 195.2 E(85289): 2.7e-50 Smith-Waterman score: 722; 100.0% identity (100.0% similar) in 104 aa overlap (36-139:29-132) 10 20 30 40 50 60 pF1KE6 VPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN :::::::::::::::::::::::::::::: XP_011 MLQLRDSLRRLDSILKVAKKERTMSSLPLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ 60 70 80 90 100 110 130 pF1KE6 VSRDISLTSVCVCN :::::::::::::: XP_011 VSRDISLTSVCVCN 120 130 >>XP_016882204 (OMIM: 608717) PREDICTED: galactoside-bin (170 aa) initn: 722 init1: 722 opt: 722 Z-score: 1022.3 bits: 195.3 E(85289): 3.4e-50 Smith-Waterman score: 722; 100.0% identity (100.0% similar) in 104 aa overlap (36-139:67-170) 10 20 30 40 50 60 pF1KE6 VPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN :::::::::::::::::::::::::::::: XP_016 VFFTRAIQTACVFVCWFLRDNQRDTNPLFYLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN 40 50 60 70 80 90 70 80 90 100 110 120 pF1KE6 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ 100 110 120 130 140 150 130 pF1KE6 VSRDISLTSVCVCN :::::::::::::: XP_016 VSRDISLTSVCVCN 160 170 >>NP_064514 (OMIM: 607260) placental protein 13-like iso (139 aa) initn: 662 init1: 651 opt: 651 Z-score: 923.7 bits: 176.7 E(85289): 1e-44 Smith-Waterman score: 651; 67.6% identity (84.2% similar) in 139 aa overlap (1-139:1-139) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :::::::: ::::: ::::::: :::: .:..::::.:.::: :::::::::.::.:::. NP_064 MSSLPVPYTLPVSLPVGSCVIITGTPILTFVKDPQLEVNFYTGMDEDSDIAFQFRLHFGH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF ..:: :::: :: :.:::::: :::::::...::.. ::: :::.:.::.::. NP_064 PAIMNSCVFGIWRYEEKCYYLPFEDGKPFELCIYVRHKEYKVMVNGQRIYNFAHRFPPAS 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN :::.:: :::::: : . . NP_064 VKMLQVFRDISLTRVLISD 130 >>NP_982297 (OMIM: 607260) placental protein 13-like iso (168 aa) initn: 619 init1: 619 opt: 619 Z-score: 877.5 bits: 168.5 E(85289): 3.9e-42 Smith-Waterman score: 619; 66.4% identity (83.6% similar) in 134 aa overlap (6-139:35-168) 10 20 30 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQ ::: ::::: ::::::: :::: .:..::: NP_982 HRLHLCKYWGCAVSNVCRFWEGRPLPLMIVVPYTLPVSLPVGSCVIITGTPILTFVKDPQ 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 LQVDFYTDMDEDSDIAFRFRVHFGNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYV :.:.::: :::::::::.::.:::. ..:: :::: :: :.:::::: ::::::: NP_982 LEVNFYTGMDEDSDIAFQFRLHFGHPAIMNSCVFGIWRYEEKCYYLPFEDGKPFELCIYV 70 80 90 100 110 120 100 110 120 130 pF1KE6 HYNEYEIKVNGIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN ...::.. ::: :::.:.::.::. :::.:: :::::: : . . NP_982 RHKEYKVMVNGQRIYNFAHRFPPASVKMLQVFRDISLTRVLISD 130 140 150 160 >>NP_001819 (OMIM: 153310) galectin-10 [Homo sapiens] (142 aa) initn: 485 init1: 485 opt: 485 Z-score: 690.2 bits: 133.6 E(85289): 1.1e-31 Smith-Waterman score: 485; 55.5% identity (73.0% similar) in 137 aa overlap (1-137:1-137) 10 20 30 40 50 60 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN :: ::::: .:::.:: : ::: :. :.:.: :::::.:.: :.:::.:.:.: :: NP_001 MSLLPVPYTEAASLSTGSTVTIKGRPLACFLNEPYLQVDFHTEMKEESDIVFHFQVCFGR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF .:::: ::.: : . . .::.::..::: : : ..:.. ::: : : ::: : NP_001 RVVMNSREYGAWKQQVESKNMPFQDGQEFELSISVLPDKYQVMVNGQSSYTFDHRIKPEA 70 80 90 100 110 120 130 pF1KE6 VKMVQVSRDISLTSVCVCN :::::: ::::::. : NP_001 VKMVQVWRDISLTKFNVSYLKR 130 140 >>XP_011525275 (OMIM: 602518) PREDICTED: galectin-4 isof (310 aa) initn: 191 init1: 163 opt: 241 Z-score: 341.9 bits: 70.2 E(85289): 2.6e-12 Smith-Waterman score: 241; 34.1% identity (61.5% similar) in 135 aa overlap (3-135:16-147) 10 20 30 40 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDED .:: .: .:.:: : :.:. . . .. :.: . .: XP_011 MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHM---KRFFVNFVVGQDPG 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 SDIAFRFRVHFG--NHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVN ::.::.: .: ..::.: . : : :: .::. : ::: . : ..:.. :: XP_011 SDVAFHFNPRFDGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFELVFIVLAEHYKVVVN 60 70 80 90 100 110 110 120 130 pF1KE6 GIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN : .: . ::.: ..: .::. :..: :. XP_011 GNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGPPMMPPYPTMEGPPTFNP 120 130 140 150 160 170 >-- initn: 140 init1: 87 opt: 157 Z-score: 223.8 bits: 48.4 E(85289): 1e-05 Smith-Waterman score: 157; 27.5% identity (58.0% similar) in 138 aa overlap (5-137:178-310) 10 20 30 pF1KE6 MSSLPVPY--KLPVSLSVGSCVIIKGTPIHSFIN :::: .: .:.. .:::: . . XP_011 NFIGGQPLRPQGPPMMPPYPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKG---YVPPT 150 160 170 180 190 200 40 50 60 70 80 90 pF1KE6 DPQLQVDFYTDMDEDSDIAFRFRVHFGNH-VVMNRREFGIWMLEETT-DYVPFEDGKQFE .. ..: . ..:::... ..:: :: : : : :: . :: :. :. XP_011 GKSFAINF--KVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFD 210 220 230 240 250 260 100 110 120 130 pF1KE6 LCIYVHYNEYEIKVNGIRIYGFVHRIPP-SFVKMVQVSRDISLTSVCVCN : : ..... .:: ... :.::. . : .... :..:. : . XP_011 LSIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQI 270 280 290 300 310 >>NP_006140 (OMIM: 602518) galectin-4 [Homo sapiens] (323 aa) initn: 191 init1: 163 opt: 241 Z-score: 341.7 bits: 70.3 E(85289): 2.7e-12 Smith-Waterman score: 241; 34.1% identity (61.5% similar) in 135 aa overlap (3-135:16-147) 10 20 30 40 pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDED .:: .: .:.:: : :.:. . . .. :.: . .: NP_006 MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHM---KRFFVNFVVGQDPG 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 SDIAFRFRVHFG--NHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVN ::.::.: .: ..::.: . : : :: .::. : ::: . : ..:.. :: NP_006 SDVAFHFNPRFDGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFELVFIVLAEHYKVVVN 60 70 80 90 100 110 110 120 130 pF1KE6 GIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN : .: . ::.: ..: .::. :..: :. NP_006 GNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGPPMMPPYPGPGHCHQQLN 120 130 140 150 160 170 >-- initn: 140 init1: 87 opt: 157 Z-score: 223.6 bits: 48.4 E(85289): 1e-05 Smith-Waterman score: 157; 27.5% identity (58.0% similar) in 138 aa overlap (5-137:191-323) 10 20 30 pF1KE6 MSSLPVPY--KLPVSLSVGSCVIIKGTPIHSFIN :::: .: .:.. .:::: . . NP_006 PMMPPYPGPGHCHQQLNSLPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKG---YVPPT 170 180 190 200 210 40 50 60 70 80 90 pF1KE6 DPQLQVDFYTDMDEDSDIAFRFRVHFGNH-VVMNRREFGIWMLEETT-DYVPFEDGKQFE .. ..: . ..:::... ..:: :: : : : :: . :: :. :. NP_006 GKSFAINF--KVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFD 220 230 240 250 260 270 100 110 120 130 pF1KE6 LCIYVHYNEYEIKVNGIRIYGFVHRIPP-SFVKMVQVSRDISLTSVCVCN : : ..... .:: ... :.::. . : .... :..:. : . NP_006 LSIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQI 280 290 300 310 320 >>XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 isof (221 aa) initn: 221 init1: 162 opt: 235 Z-score: 335.7 bits: 68.6 E(85289): 5.8e-12 Smith-Waterman score: 235; 31.2% identity (62.3% similar) in 138 aa overlap (6-139:15-149) 10 20 30 40 pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD ::.. .. :. : . ..:: . : . .. :.: : .. .: XP_011 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI :::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... :::: XP_011 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI 60 70 80 90 100 110 110 120 130 pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN . . ::.: : ..:. ...:. . : XP_011 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRGR 120 130 140 150 160 170 XP_011 RQKARTCLPPEATDVVQASGIPHHDDRGPVPHPLHPTTATVTFL 180 190 200 210 220 >>XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 isof (278 aa) initn: 186 init1: 162 opt: 235 Z-score: 334.2 bits: 68.7 E(85289): 7.1e-12 Smith-Waterman score: 235; 31.2% identity (62.3% similar) in 138 aa overlap (6-139:15-149) 10 20 30 40 pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD ::.. .. :. : . ..:: . : . .. :.: : .. .: XP_006 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI :::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... :::: XP_006 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI 60 70 80 90 100 110 110 120 130 pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN . . ::.: : ..:. ...:. . : XP_006 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRGR 120 130 140 150 160 170 XP_006 RQKPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMYPHPAYPMPFITTILGGLYP 180 190 200 210 220 230 139 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:59:20 2016 done: Tue Nov 8 13:59:21 2016 Total Scan time: 3.680 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]