FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6513, 139 aa
1>>>pF1KE6513 139 - 139 aa - 139 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.6763+/-0.00035; mu= 14.0890+/- 0.022
mean_var=50.5748+/-10.106, 0's: 0 Z-trim(112.8): 38 B-trim: 616 in 1/51
Lambda= 0.180346
statistics sampled from 21893 (21931) to 21893 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.661), E-opt: 0.2 (0.257), width: 16
Scan time: 3.680
The best scores are: opt bits E(85289)
NP_037400 (OMIM: 608717) galactoside-binding solub ( 139) 958 256.6 9.3e-69
XP_011525176 (OMIM: 608717) PREDICTED: galactoside ( 132) 722 195.2 2.7e-50
XP_016882204 (OMIM: 608717) PREDICTED: galactoside ( 170) 722 195.3 3.4e-50
NP_064514 (OMIM: 607260) placental protein 13-like ( 139) 651 176.7 1e-44
NP_982297 (OMIM: 607260) placental protein 13-like ( 168) 619 168.5 3.9e-42
NP_001819 (OMIM: 153310) galectin-10 [Homo sapiens ( 142) 485 133.6 1.1e-31
XP_011525275 (OMIM: 602518) PREDICTED: galectin-4 ( 310) 241 70.2 2.6e-12
NP_006140 (OMIM: 602518) galectin-4 [Homo sapiens] ( 323) 241 70.3 2.7e-12
XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 ( 221) 235 68.6 5.8e-12
XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 ( 278) 235 68.7 7.1e-12
XP_016880112 (OMIM: 601879) PREDICTED: galectin-9 ( 343) 235 68.7 8.4e-12
NP_033665 (OMIM: 601879) galectin-9 isoform long [ ( 355) 235 68.7 8.7e-12
XP_006721958 (OMIM: 601879) PREDICTED: galectin-9 ( 234) 233 68.1 8.8e-12
NP_001317092 (OMIM: 601879) galectin-9 isoform 3 [ ( 246) 233 68.1 9.1e-12
XP_006721955 (OMIM: 601879) PREDICTED: galectin-9 ( 311) 233 68.2 1.1e-11
NP_002299 (OMIM: 601879) galectin-9 isoform short ( 323) 233 68.2 1.1e-11
NP_001035972 (OMIM: 617139) galectin-7 [Homo sapie ( 136) 212 62.5 2.4e-10
NP_002298 (OMIM: 600615) galectin-7 [Homo sapiens] ( 136) 212 62.5 2.4e-10
XP_016856764 (OMIM: 606099) PREDICTED: galectin-8 ( 226) 180 54.3 1.2e-07
XP_016856763 (OMIM: 606099) PREDICTED: galectin-8 ( 287) 180 54.4 1.5e-07
NP_963838 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 180 54.4 1.6e-07
NP_963837 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 180 54.4 1.6e-07
XP_011542490 (OMIM: 606099) PREDICTED: galectin-8 ( 329) 180 54.4 1.7e-07
NP_963839 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 180 54.4 1.8e-07
NP_006490 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 180 54.4 1.8e-07
NP_002297 (OMIM: 153619) galectin-3 isoform 1 [Hom ( 250) 171 52.0 6.7e-07
XP_011535061 (OMIM: 153619) PREDICTED: galectin-3 ( 264) 171 52.0 7e-07
XP_011525276 (OMIM: 602518) PREDICTED: galectin-4 ( 192) 157 48.3 6.6e-06
XP_016873947 (OMIM: 606096) PREDICTED: galectin-12 ( 252) 155 47.8 1.2e-05
NP_001136007 (OMIM: 606096) galectin-12 isoform 1 ( 337) 155 47.9 1.5e-05
NP_001136008 (OMIM: 606096) galectin-12 isoform 3 ( 327) 154 47.6 1.8e-05
NP_149092 (OMIM: 606096) galectin-12 isoform 2 [Ho ( 336) 154 47.6 1.8e-05
NP_001136010 (OMIM: 606096) galectin-12 isoform 5 ( 266) 136 42.9 0.00039
NP_001136009 (OMIM: 606096) galectin-12 isoform 4 ( 275) 136 42.9 0.0004
NP_002296 (OMIM: 150570) galectin-1 [Homo sapiens] ( 135) 116 37.5 0.008
>>NP_037400 (OMIM: 608717) galactoside-binding soluble l (139 aa)
initn: 958 init1: 958 opt: 958 Z-score: 1355.4 bits: 256.6 E(85289): 9.3e-69
Smith-Waterman score: 958; 100.0% identity (100.0% similar) in 139 aa overlap (1-139:1-139)
10 20 30 40 50 60
pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_037 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_037 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF
70 80 90 100 110 120
130
pF1KE6 VKMVQVSRDISLTSVCVCN
:::::::::::::::::::
NP_037 VKMVQVSRDISLTSVCVCN
130
>>XP_011525176 (OMIM: 608717) PREDICTED: galactoside-bin (132 aa)
initn: 722 init1: 722 opt: 722 Z-score: 1023.9 bits: 195.2 E(85289): 2.7e-50
Smith-Waterman score: 722; 100.0% identity (100.0% similar) in 104 aa overlap (36-139:29-132)
10 20 30 40 50 60
pF1KE6 VPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN
::::::::::::::::::::::::::::::
XP_011 MLQLRDSLRRLDSILKVAKKERTMSSLPLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN
10 20 30 40 50
70 80 90 100 110 120
pF1KE6 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ
60 70 80 90 100 110
130
pF1KE6 VSRDISLTSVCVCN
::::::::::::::
XP_011 VSRDISLTSVCVCN
120 130
>>XP_016882204 (OMIM: 608717) PREDICTED: galactoside-bin (170 aa)
initn: 722 init1: 722 opt: 722 Z-score: 1022.3 bits: 195.3 E(85289): 3.4e-50
Smith-Waterman score: 722; 100.0% identity (100.0% similar) in 104 aa overlap (36-139:67-170)
10 20 30 40 50 60
pF1KE6 VPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN
::::::::::::::::::::::::::::::
XP_016 VFFTRAIQTACVFVCWFLRDNQRDTNPLFYLQVDFYTDMDEDSDIAFRFRVHFGNHVVMN
40 50 60 70 80 90
70 80 90 100 110 120
pF1KE6 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSFVKMVQ
100 110 120 130 140 150
130
pF1KE6 VSRDISLTSVCVCN
::::::::::::::
XP_016 VSRDISLTSVCVCN
160 170
>>NP_064514 (OMIM: 607260) placental protein 13-like iso (139 aa)
initn: 662 init1: 651 opt: 651 Z-score: 923.7 bits: 176.7 E(85289): 1e-44
Smith-Waterman score: 651; 67.6% identity (84.2% similar) in 139 aa overlap (1-139:1-139)
10 20 30 40 50 60
pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN
:::::::: ::::: ::::::: :::: .:..::::.:.::: :::::::::.::.:::.
NP_064 MSSLPVPYTLPVSLPVGSCVIITGTPILTFVKDPQLEVNFYTGMDEDSDIAFQFRLHFGH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF
..:: :::: :: :.:::::: :::::::...::.. ::: :::.:.::.::.
NP_064 PAIMNSCVFGIWRYEEKCYYLPFEDGKPFELCIYVRHKEYKVMVNGQRIYNFAHRFPPAS
70 80 90 100 110 120
130
pF1KE6 VKMVQVSRDISLTSVCVCN
:::.:: :::::: : . .
NP_064 VKMLQVFRDISLTRVLISD
130
>>NP_982297 (OMIM: 607260) placental protein 13-like iso (168 aa)
initn: 619 init1: 619 opt: 619 Z-score: 877.5 bits: 168.5 E(85289): 3.9e-42
Smith-Waterman score: 619; 66.4% identity (83.6% similar) in 134 aa overlap (6-139:35-168)
10 20 30
pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQ
::: ::::: ::::::: :::: .:..:::
NP_982 HRLHLCKYWGCAVSNVCRFWEGRPLPLMIVVPYTLPVSLPVGSCVIITGTPILTFVKDPQ
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE6 LQVDFYTDMDEDSDIAFRFRVHFGNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYV
:.:.::: :::::::::.::.:::. ..:: :::: :: :.:::::: :::::::
NP_982 LEVNFYTGMDEDSDIAFQFRLHFGHPAIMNSCVFGIWRYEEKCYYLPFEDGKPFELCIYV
70 80 90 100 110 120
100 110 120 130
pF1KE6 HYNEYEIKVNGIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN
...::.. ::: :::.:.::.::. :::.:: :::::: : . .
NP_982 RHKEYKVMVNGQRIYNFAHRFPPASVKMLQVFRDISLTRVLISD
130 140 150 160
>>NP_001819 (OMIM: 153310) galectin-10 [Homo sapiens] (142 aa)
initn: 485 init1: 485 opt: 485 Z-score: 690.2 bits: 133.6 E(85289): 1.1e-31
Smith-Waterman score: 485; 55.5% identity (73.0% similar) in 137 aa overlap (1-137:1-137)
10 20 30 40 50 60
pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFGN
:: ::::: .:::.:: : ::: :. :.:.: :::::.:.: :.:::.:.:.: ::
NP_001 MSLLPVPYTEAASLSTGSTVTIKGRPLACFLNEPYLQVDFHTEMKEESDIVFHFQVCFGR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 HVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGIRIYGFVHRIPPSF
.:::: ::.: : . . .::.::..::: : : ..:.. ::: : : ::: :
NP_001 RVVMNSREYGAWKQQVESKNMPFQDGQEFELSISVLPDKYQVMVNGQSSYTFDHRIKPEA
70 80 90 100 110 120
130
pF1KE6 VKMVQVSRDISLTSVCVCN
:::::: ::::::. :
NP_001 VKMVQVWRDISLTKFNVSYLKR
130 140
>>XP_011525275 (OMIM: 602518) PREDICTED: galectin-4 isof (310 aa)
initn: 191 init1: 163 opt: 241 Z-score: 341.9 bits: 70.2 E(85289): 2.6e-12
Smith-Waterman score: 241; 34.1% identity (61.5% similar) in 135 aa overlap (3-135:16-147)
10 20 30 40
pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDED
.:: .: .:.:: : :.:. . . .. :.: . .:
XP_011 MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHM---KRFFVNFVVGQDPG
10 20 30 40 50
50 60 70 80 90 100
pF1KE6 SDIAFRFRVHFG--NHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVN
::.::.: .: ..::.: . : : :: .::. : ::: . : ..:.. ::
XP_011 SDVAFHFNPRFDGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFELVFIVLAEHYKVVVN
60 70 80 90 100 110
110 120 130
pF1KE6 GIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN
: .: . ::.: ..: .::. :..: :.
XP_011 GNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGPPMMPPYPTMEGPPTFNP
120 130 140 150 160 170
>--
initn: 140 init1: 87 opt: 157 Z-score: 223.8 bits: 48.4 E(85289): 1e-05
Smith-Waterman score: 157; 27.5% identity (58.0% similar) in 138 aa overlap (5-137:178-310)
10 20 30
pF1KE6 MSSLPVPY--KLPVSLSVGSCVIIKGTPIHSFIN
:::: .: .:.. .:::: . .
XP_011 NFIGGQPLRPQGPPMMPPYPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKG---YVPPT
150 160 170 180 190 200
40 50 60 70 80 90
pF1KE6 DPQLQVDFYTDMDEDSDIAFRFRVHFGNH-VVMNRREFGIWMLEETT-DYVPFEDGKQFE
.. ..: . ..:::... ..:: :: : : : :: . :: :. :.
XP_011 GKSFAINF--KVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFD
210 220 230 240 250 260
100 110 120 130
pF1KE6 LCIYVHYNEYEIKVNGIRIYGFVHRIPP-SFVKMVQVSRDISLTSVCVCN
: : ..... .:: ... :.::. . : .... :..:. : .
XP_011 LSIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQI
270 280 290 300 310
>>NP_006140 (OMIM: 602518) galectin-4 [Homo sapiens] (323 aa)
initn: 191 init1: 163 opt: 241 Z-score: 341.7 bits: 70.3 E(85289): 2.7e-12
Smith-Waterman score: 241; 34.1% identity (61.5% similar) in 135 aa overlap (3-135:16-147)
10 20 30 40
pF1KE6 MSSLPVPYKLPVSLSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDED
.:: .: .:.:: : :.:. . . .. :.: . .:
NP_006 MAYVPAPGYQPTYNPTLPYYQPIPGGLNVGMSVYIQGVASEHM---KRFFVNFVVGQDPG
10 20 30 40 50
50 60 70 80 90 100
pF1KE6 SDIAFRFRVHFG--NHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVN
::.::.: .: ..::.: . : : :: .::. : ::: . : ..:.. ::
NP_006 SDVAFHFNPRFDGWDKVVFNTLQGGKWGSEERKRSMPFKKGAAFELVFIVLAEHYKVVVN
60 70 80 90 100 110
110 120 130
pF1KE6 GIRIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN
: .: . ::.: ..: .::. :..: :.
NP_006 GNPFYEYGHRLPLQMVTHLQVDGDLQLQSINFIGGQPLRPQGPPMMPPYPGPGHCHQQLN
120 130 140 150 160 170
>--
initn: 140 init1: 87 opt: 157 Z-score: 223.6 bits: 48.4 E(85289): 1e-05
Smith-Waterman score: 157; 27.5% identity (58.0% similar) in 138 aa overlap (5-137:191-323)
10 20 30
pF1KE6 MSSLPVPY--KLPVSLSVGSCVIIKGTPIHSFIN
:::: .: .:.. .:::: . .
NP_006 PMMPPYPGPGHCHQQLNSLPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKG---YVPPT
170 180 190 200 210
40 50 60 70 80 90
pF1KE6 DPQLQVDFYTDMDEDSDIAFRFRVHFGNH-VVMNRREFGIWMLEETT-DYVPFEDGKQFE
.. ..: . ..:::... ..:: :: : : : :: . :: :. :.
NP_006 GKSFAINF--KVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFD
220 230 240 250 260 270
100 110 120 130
pF1KE6 LCIYVHYNEYEIKVNGIRIYGFVHRIPP-SFVKMVQVSRDISLTSVCVCN
: : ..... .:: ... :.::. . : .... :..:. : .
NP_006 LSIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQI
280 290 300 310 320
>>XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 isof (221 aa)
initn: 221 init1: 162 opt: 235 Z-score: 335.7 bits: 68.6 E(85289): 5.8e-12
Smith-Waterman score: 235; 31.2% identity (62.3% similar) in 138 aa overlap (6-139:15-149)
10 20 30 40
pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD
::.. .. :. : . ..:: . : . .. :.: : .. .:
XP_011 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND
10 20 30 40 50
50 60 70 80 90 100
pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI
:::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... ::::
XP_011 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI
60 70 80 90 100 110
110 120 130
pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN
. . ::.: : ..:. ...:. . :
XP_011 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRGR
120 130 140 150 160 170
XP_011 RQKARTCLPPEATDVVQASGIPHHDDRGPVPHPLHPTTATVTFL
180 190 200 210 220
>>XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 isof (278 aa)
initn: 186 init1: 162 opt: 235 Z-score: 334.2 bits: 68.7 E(85289): 7.1e-12
Smith-Waterman score: 235; 31.2% identity (62.3% similar) in 138 aa overlap (6-139:15-149)
10 20 30 40
pF1KE6 MSSLPVPYKLPVS--LSVGSCVIIKGTPIHSFINDPQLQVDFYTDMDEDSD
::.. .. :. : . ..:: . : . .. :.: : .. .:
XP_006 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSS--SGTRFAVNFQTGFS-GND
10 20 30 40 50
50 60 70 80 90 100
pF1KE6 IAFRFRVHF--GNHVVMNRREFGIWMLEETTDYVPFEDGKQFELCIYVHYNEYEIKVNGI
:::.: .: :..:: : :. : : :: ..::. : :.::. :. ..... ::::
XP_006 IAFHFNPRFEDGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKVMVNGI
60 70 80 90 100 110
110 120 130
pF1KE6 RIYGFVHRIPPSFVKMVQVSRDISLTSVCVCN
. . ::.: : ..:. ...:. . :
XP_006 LFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRGR
120 130 140 150 160 170
XP_006 RQKPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMYPHPAYPMPFITTILGGLYP
180 190 200 210 220 230
139 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 13:59:20 2016 done: Tue Nov 8 13:59:21 2016
Total Scan time: 3.680 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]