FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6156, 132 aa 1>>>pF1KE6156 132 - 132 aa - 132 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3377+/-0.000332; mu= 10.1498+/- 0.021 mean_var=57.6328+/-11.391, 0's: 0 Z-trim(114.9): 31 B-trim: 248 in 1/56 Lambda= 0.168943 statistics sampled from 24908 (24939) to 24908 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.292), width: 16 Scan time: 4.750 The best scores are: opt bits E(85289) NP_006489 (OMIM: 150571,608446) galectin-2 [Homo s ( 132) 892 225.3 2.3e-59 NP_002296 (OMIM: 150570) galectin-1 [Homo sapiens] ( 135) 391 103.2 1.3e-22 NP_002297 (OMIM: 153619) galectin-3 isoform 1 [Hom ( 250) 236 65.5 5.5e-11 XP_011535061 (OMIM: 153619) PREDICTED: galectin-3 ( 264) 236 65.5 5.7e-11 NP_001035972 (OMIM: 617139) galectin-7 [Homo sapie ( 136) 210 59.1 2.6e-09 NP_002298 (OMIM: 600615) galectin-7 [Homo sapiens] ( 136) 210 59.1 2.6e-09 XP_016856764 (OMIM: 606099) PREDICTED: galectin-8 ( 226) 210 59.1 4e-09 NP_963838 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 192 54.8 1.1e-07 NP_963837 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 192 54.8 1.1e-07 XP_011525276 (OMIM: 602518) PREDICTED: galectin-4 ( 192) 189 54.0 1.2e-07 NP_963839 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 192 54.8 1.3e-07 NP_006490 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 192 54.8 1.3e-07 XP_011525275 (OMIM: 602518) PREDICTED: galectin-4 ( 310) 189 54.1 1.9e-07 NP_006140 (OMIM: 602518) galectin-4 [Homo sapiens] ( 323) 189 54.1 1.9e-07 XP_006721958 (OMIM: 601879) PREDICTED: galectin-9 ( 234) 187 53.5 2e-07 XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 ( 221) 186 53.3 2.3e-07 NP_001317092 (OMIM: 601879) galectin-9 isoform 3 [ ( 246) 186 53.3 2.5e-07 XP_006721955 (OMIM: 601879) PREDICTED: galectin-9 ( 311) 187 53.6 2.6e-07 XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 ( 278) 186 53.3 2.8e-07 NP_002299 (OMIM: 601879) galectin-9 isoform short ( 323) 186 53.3 3.2e-07 XP_016880112 (OMIM: 601879) PREDICTED: galectin-9 ( 343) 186 53.3 3.4e-07 NP_033665 (OMIM: 601879) galectin-9 isoform long [ ( 355) 186 53.3 3.5e-07 XP_016873947 (OMIM: 606096) PREDICTED: galectin-12 ( 252) 182 52.3 5e-07 NP_001136008 (OMIM: 606096) galectin-12 isoform 3 ( 327) 182 52.4 6.4e-07 NP_149092 (OMIM: 606096) galectin-12 isoform 2 [Ho ( 336) 182 52.4 6.5e-07 NP_001136007 (OMIM: 606096) galectin-12 isoform 1 ( 337) 182 52.4 6.6e-07 NP_001136010 (OMIM: 606096) galectin-12 isoform 5 ( 266) 180 51.8 7.4e-07 NP_001136009 (OMIM: 606096) galectin-12 isoform 4 ( 275) 180 51.8 7.7e-07 XP_016856763 (OMIM: 606099) PREDICTED: galectin-8 ( 287) 153 45.3 7.6e-05 XP_011542490 (OMIM: 606099) PREDICTED: galectin-8 ( 329) 153 45.3 8.6e-05 >>NP_006489 (OMIM: 150571,608446) galectin-2 [Homo sapie (132 aa) initn: 892 init1: 892 opt: 892 Z-score: 1186.9 bits: 225.3 E(85289): 2.3e-59 Smith-Waterman score: 892; 100.0% identity (100.0% similar) in 132 aa overlap (1-132:1-132) 10 20 30 40 50 60 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLNLHFNPRFSESTIVCNSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLNLHFNPRFSESTIVCNSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHLSYLSVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 DGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHLSYLSVR 70 80 90 100 110 120 130 pF1KE6 GGFNMSSFKLKE :::::::::::: NP_006 GGFNMSSFKLKE 130 >>NP_002296 (OMIM: 150570) galectin-1 [Homo sapiens] (135 aa) initn: 236 init1: 236 opt: 391 Z-score: 526.8 bits: 103.2 E(85289): 1.3e-22 Smith-Waterman score: 391; 44.2% identity (73.6% similar) in 129 aa overlap (1-125:1-129) 10 20 30 40 50 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLNLHFNPRFSE----STIV :. : ..:...::: :.. : .: . .::.:::. ...: :::::::. .::: NP_002 MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 CNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHLSY ::: ::. :: :::: . :.::: .. .::.. .. ::::::.:. :::::. ..: NP_002 CNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINY 70 80 90 100 110 120 120 130 pF1KE6 LSVRGGFNMSSFKLKE ... : :.. NP_002 MAADGDFKIKCVAFD 130 >>NP_002297 (OMIM: 153619) galectin-3 isoform 1 [Homo sa (250 aa) initn: 192 init1: 146 opt: 236 Z-score: 318.3 bits: 65.5 E(85289): 5.5e-11 Smith-Waterman score: 236; 32.2% identity (68.6% similar) in 118 aa overlap (14-127:128-244) 10 20 30 40 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLN : . : :.. ... ..... .:.: . NP_002 TGAYPATGPYGAPAGPLIVPYNLPLPGGVVPRMLITILGTVKPNANRIALDFQRGND-VA 100 110 120 130 140 150 50 60 70 80 90 100 pF1KE6 LHFNPRFSEST---IVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDG .::::::.:.. ::::. .:::.:.:.. . : :. :. : : :.::: . :. NP_002 FHFNPRFNENNRRVIVCNTKLDNNWGREERQSVFPFESGKPFKIQVLVEPDHFKVAVNDA 160 170 180 190 200 210 110 120 130 pF1KE6 HELTFPNRLGH-SHLSYLSVRGGFNMSSFKLKE : : . .:. . ...: :.. : ....: NP_002 HLLQYNHRVKKLNEISKLGISGDIDLTSASYTMI 220 230 240 250 >>XP_011535061 (OMIM: 153619) PREDICTED: galectin-3 isof (264 aa) initn: 192 init1: 146 opt: 236 Z-score: 317.9 bits: 65.5 E(85289): 5.7e-11 Smith-Waterman score: 236; 32.2% identity (68.6% similar) in 118 aa overlap (14-127:142-258) 10 20 30 40 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLN : . : :.. ... ..... .:.: . XP_011 TGAYPATGPYGAPAGPLIVPYNLPLPGGVVPRMLITILGTVKPNANRIALDFQRGND-VA 120 130 140 150 160 170 50 60 70 80 90 100 pF1KE6 LHFNPRFSEST---IVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDG .::::::.:.. ::::. .:::.:.:.. . : :. :. : : :.::: . :. XP_011 FHFNPRFNENNRRVIVCNTKLDNNWGREERQSVFPFESGKPFKIQVLVEPDHFKVAVNDA 180 190 200 210 220 230 110 120 130 pF1KE6 HELTFPNRLGH-SHLSYLSVRGGFNMSSFKLKE : : . .:. . ...: :.. : ....: XP_011 HLLQYNHRVKKLNEISKLGISGDIDLTSASYTMI 240 250 260 >>NP_001035972 (OMIM: 617139) galectin-7 [Homo sapiens] (136 aa) initn: 194 init1: 143 opt: 210 Z-score: 288.3 bits: 59.1 E(85289): 2.6e-09 Smith-Waterman score: 210; 30.9% identity (62.6% similar) in 123 aa overlap (12-130:14-135) 10 20 30 40 50 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINL--G--QGTDKLNLHFNPRFSEST ..::..:.: : . ... : .:: : ::.: ::::::.. : NP_001 MSNVPHKSSLPEGIRPGTVLRIRGLVPPNASRFHVNLLCGEEQGSDA-ALHFNPRLDTSE 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 IVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHL .: :: . ..::.:.: . :. :. . . .: ::. . :.. : .:: ... NP_001 VVFNSKEQGSWGREERGPGVPFQRGQPFEVLIIASDDGFKAVVGDAQYHHFRHRLPLARV 60 70 80 90 100 110 120 130 pF1KE6 SYLSVRGGFNMSSFKLKE . : : ...: .. NP_001 RLVEVGGDVQLDSVRIF 120 130 >>NP_002298 (OMIM: 600615) galectin-7 [Homo sapiens] (136 aa) initn: 194 init1: 143 opt: 210 Z-score: 288.3 bits: 59.1 E(85289): 2.6e-09 Smith-Waterman score: 210; 30.9% identity (62.6% similar) in 123 aa overlap (12-130:14-135) 10 20 30 40 50 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINL--G--QGTDKLNLHFNPRFSEST ..::..:.: : . ... : .:: : ::.: ::::::.. : NP_002 MSNVPHKSSLPEGIRPGTVLRIRGLVPPNASRFHVNLLCGEEQGSDA-ALHFNPRLDTSE 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 IVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHL .: :: . ..::.:.: . :. :. . . .: ::. . :.. : .:: ... NP_002 VVFNSKEQGSWGREERGPGVPFQRGQPFEVLIIASDDGFKAVVGDAQYHHFRHRLPLARV 60 70 80 90 100 110 120 130 pF1KE6 SYLSVRGGFNMSSFKLKE . : : ...: .. NP_002 RLVEVGGDVQLDSVRIF 120 130 >>XP_016856764 (OMIM: 606099) PREDICTED: galectin-8 isof (226 aa) initn: 129 init1: 79 opt: 210 Z-score: 284.7 bits: 59.1 E(85289): 4e-09 Smith-Waterman score: 210; 28.7% identity (60.7% similar) in 122 aa overlap (12-127:27-148) 10 20 30 40 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTD----- . ::. . : : . . .: : ..: .:.. XP_016 MMLSLNNLQNIIYNPVIPFVGTIPDQLDPGTLIVIRGHVPSDADRFQVDLQNGSSMKPRA 10 20 30 40 50 60 50 60 70 80 90 pF1KE6 KLNLHFNPRFSES-TIVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPD . .::::::... ::::.: . .::.:. :. . .... .:::.: . XP_016 DVAFHFNPRFKRAGCIVCNTLINEKWGREEITYDTPFKREKSFEIVIMVLKDKFQVAVNG 70 80 90 100 110 120 100 110 120 130 pF1KE6 GHELTFPNRLGHSHLSYLSVRGGFNMSSFKLKE : : . .:.: ... :.. : :. : XP_016 KHTLLYGHRIGPEKIDTLGIYGKVNIHSIGFSFSSDLQSTQASSLELTEISRENVPKSGT 130 140 150 160 170 180 >>NP_963838 (OMIM: 606099) galectin-8 isoform b [Homo sa (317 aa) initn: 153 init1: 86 opt: 192 Z-score: 258.6 bits: 54.8 E(85289): 1.1e-07 Smith-Waterman score: 210; 28.7% identity (60.7% similar) in 122 aa overlap (12-127:27-148) 10 20 30 40 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTD----- . ::. . : : . . .: : ..: .:.. NP_963 MMLSLNNLQNIIYNPVIPFVGTIPDQLDPGTLIVIRGHVPSDADRFQVDLQNGSSMKPRA 10 20 30 40 50 60 50 60 70 80 90 pF1KE6 KLNLHFNPRFSES-TIVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPD . .::::::... ::::.: . .::.:. :. . .... .:::.: . NP_963 DVAFHFNPRFKRAGCIVCNTLINEKWGREEITYDTPFKREKSFEIVIMVLKDKFQVAVNG 70 80 90 100 110 120 100 110 120 130 pF1KE6 GHELTFPNRLGHSHLSYLSVRGGFNMSSFKLKE : : . .:.: ... :.. : :. : NP_963 KHTLLYGHRIGPEKIDTLGIYGKVNIHSIGFSFSSDLQSTQASSLELTEISRENVPKSGT 130 140 150 160 170 180 >-- initn: 153 init1: 86 opt: 192 Z-score: 258.6 bits: 54.8 E(85289): 1.1e-07 Smith-Waterman score: 192; 29.2% identity (60.8% similar) in 120 aa overlap (9-125:192-311) 10 20 30 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQG : : :: :. . : . .. .: ..: : NP_963 ASSLELTEISRENVPKSGTPQLRLPFAARLNTPMGPGRTVVVKGEVNANAKSFNVDLLAG 170 180 190 200 210 220 40 50 60 70 80 90 pF1KE6 TDK-LNLHFNPRFSESTIVCNSLDGSNWGQEQRE-DHLCFSPGSEVKFTVTFESDKFKVK .: . ::.:::.. ...: ::. .::.:.:. . :::: .. . . .::: NP_963 KSKDIALHLNPRLNIKAFVRNSFLQESWGEEERNITSFPFSPGMYFEMIIYCDVREFKVA 230 240 250 260 270 280 100 110 120 130 pF1KE6 LPDGHELTFPNRLGH-SHLSYLSVRGGFNMSSFKLKE . : : . .:. . : .. : . : ... NP_963 VNGVHSLEYKHRFKELSSIDTLEINGDIHLLEVRSW 290 300 310 >>NP_963837 (OMIM: 606099) galectin-8 isoform b [Homo sa (317 aa) initn: 153 init1: 86 opt: 192 Z-score: 258.6 bits: 54.8 E(85289): 1.1e-07 Smith-Waterman score: 210; 28.7% identity (60.7% similar) in 122 aa overlap (12-127:27-148) 10 20 30 40 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTD----- . ::. . : : . . .: : ..: .:.. NP_963 MMLSLNNLQNIIYNPVIPFVGTIPDQLDPGTLIVIRGHVPSDADRFQVDLQNGSSMKPRA 10 20 30 40 50 60 50 60 70 80 90 pF1KE6 KLNLHFNPRFSES-TIVCNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPD . .::::::... ::::.: . .::.:. :. . .... .:::.: . NP_963 DVAFHFNPRFKRAGCIVCNTLINEKWGREEITYDTPFKREKSFEIVIMVLKDKFQVAVNG 70 80 90 100 110 120 100 110 120 130 pF1KE6 GHELTFPNRLGHSHLSYLSVRGGFNMSSFKLKE : : . .:.: ... :.. : :. : NP_963 KHTLLYGHRIGPEKIDTLGIYGKVNIHSIGFSFSSDLQSTQASSLELTEISRENVPKSGT 130 140 150 160 170 180 >-- initn: 153 init1: 86 opt: 192 Z-score: 258.6 bits: 54.8 E(85289): 1.1e-07 Smith-Waterman score: 192; 29.2% identity (60.8% similar) in 120 aa overlap (9-125:192-311) 10 20 30 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQG : : :: :. . : . .. .: ..: : NP_963 ASSLELTEISRENVPKSGTPQLRLPFAARLNTPMGPGRTVVVKGEVNANAKSFNVDLLAG 170 180 190 200 210 220 40 50 60 70 80 90 pF1KE6 TDK-LNLHFNPRFSESTIVCNSLDGSNWGQEQRE-DHLCFSPGSEVKFTVTFESDKFKVK .: . ::.:::.. ...: ::. .::.:.:. . :::: .. . . .::: NP_963 KSKDIALHLNPRLNIKAFVRNSFLQESWGEEERNITSFPFSPGMYFEMIIYCDVREFKVA 230 240 250 260 270 280 100 110 120 130 pF1KE6 LPDGHELTFPNRLGH-SHLSYLSVRGGFNMSSFKLKE . : : . .:. . : .. : . : ... NP_963 VNGVHSLEYKHRFKELSSIDTLEINGDIHLLEVRSW 290 300 310 >>XP_011525276 (OMIM: 602518) PREDICTED: galectin-4 isof (192 aa) initn: 98 init1: 98 opt: 189 Z-score: 258.2 bits: 54.0 E(85289): 1.2e-07 Smith-Waterman score: 189; 29.9% identity (64.1% similar) in 117 aa overlap (17-130:76-192) 10 20 30 40 pF1KE6 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTD-KLNLH :. : : . .:.::. :.. . :: XP_011 NSLPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKGYVPPTGKSFAINFKVGSSGDIALH 50 60 70 80 90 100 50 60 70 80 90 100 pF1KE6 FNPRFSESTIVCNSLDGSNWGQEQRE-DHLCFSPGSEVKFTVTFESDKFKVKLPDGHELT .:::....:.: ::: ...::.:... : :.::. ... :.::: : . XP_011 INPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFDLSIRCGLDRFKVYANGQHLFD 110 120 130 140 150 160 110 120 130 pF1KE6 FPNRLG-HSHLSYLSVRGGFNMSSFKLKE : .::. .... : ..: ..: .. XP_011 FAHRLSAFQRVDTLEIQGDVTLSYVQI 170 180 190 132 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 09:56:23 2016 done: Tue Nov 8 09:56:24 2016 Total Scan time: 4.750 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]