FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6346, 432 aa 1>>>pF1KE6346 432 - 432 aa - 432 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9654+/-0.000773; mu= 18.8344+/- 0.047 mean_var=55.9743+/-11.468, 0's: 0 Z-trim(107.0): 32 B-trim: 0 in 0/48 Lambda= 0.171427 statistics sampled from 9311 (9328) to 9311 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.646), E-opt: 0.2 (0.287), width: 16 Scan time: 2.230 The best scores are: opt bits E(32554) CCDS34462.1 SLC35B2 gene_id:347734|Hs108|chr6 ( 432) 2859 715.1 3.3e-206 CCDS75463.1 SLC35B2 gene_id:347734|Hs108|chr6 ( 383) 2508 628.3 4e-180 CCDS69127.1 SLC35B2 gene_id:347734|Hs108|chr6 ( 339) 2019 507.3 9.1e-144 CCDS75462.1 SLC35B2 gene_id:347734|Hs108|chr6 ( 299) 1933 486.0 2.1e-137 CCDS11552.2 SLC35B1 gene_id:10237|Hs108|chr17 ( 359) 465 123.0 4.8e-28 CCDS4508.1 SLC35B3 gene_id:51000|Hs108|chr6 ( 401) 253 70.6 3.2e-12 >>CCDS34462.1 SLC35B2 gene_id:347734|Hs108|chr6 (432 aa) initn: 2859 init1: 2859 opt: 2859 Z-score: 3815.7 bits: 715.1 E(32554): 3.3e-206 Smith-Waterman score: 2859; 100.0% identity (100.0% similar) in 432 aa overlap (1-432:1-432) 10 20 30 40 50 60 pF1KE6 MDARWWAVVVLAAFPSLGAGGETPEAPPESWTQLWFFRFVVNAAGYASFMVPGYLLVQYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MDARWWAVVVLAAFPSLGAGGETPEAPPESWTQLWFFRFVVNAAGYASFMVPGYLLVQYF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 RRKNYLETGRGLCFPLVKACVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RRKNYLETGRGLCFPLVKACVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VSYLTWGVLQERVMTRSYGATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 VSYLTWGVLQERVMTRSYGATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 APMYRYSFASLSNVLSSWCQYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 APMYRYSFASLSNVLSSWCQYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 LTATLISIGVSMFLLSSGPEPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LTATLISIGVSMFLLSSGPEPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 QMMFGVNFFSCLFTVGSLLEQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QMMFGVNFFSCLFTVGSLLEQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 QFGAAVFTIIMTLRQAFAILLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QFGAAVFTIIMTLRQAFAILLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGK 370 380 390 400 410 420 430 pF1KE6 KAVPVESPVQKV :::::::::::: CCDS34 KAVPVESPVQKV 430 >>CCDS75463.1 SLC35B2 gene_id:347734|Hs108|chr6 (383 aa) initn: 2508 init1: 2508 opt: 2508 Z-score: 3347.3 bits: 628.3 E(32554): 4e-180 Smith-Waterman score: 2508; 100.0% identity (100.0% similar) in 383 aa overlap (50-432:1-383) 20 30 40 50 60 70 pF1KE6 GGETPEAPPESWTQLWFFRFVVNAAGYASFMVPGYLLVQYFRRKNYLETGRGLCFPLVKA :::::::::::::::::::::::::::::: CCDS75 MVPGYLLVQYFRRKNYLETGRGLCFPLVKA 10 20 30 80 90 100 110 120 130 pF1KE6 CVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 CVFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYG 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE6 ATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ATATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWC 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE6 QYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 QYEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGP 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE6 EPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 EPRSSPATTLSGLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLL 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE6 EQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 EQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAI 280 290 300 310 320 330 380 390 400 410 420 430 pF1KE6 LLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LLSCLLYGHTVTVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV 340 350 360 370 380 >>CCDS69127.1 SLC35B2 gene_id:347734|Hs108|chr6 (339 aa) initn: 2019 init1: 2019 opt: 2019 Z-score: 2694.5 bits: 507.3 E(32554): 9.1e-144 Smith-Waterman score: 2019; 100.0% identity (100.0% similar) in 312 aa overlap (121-432:28-339) 100 110 120 130 140 150 pF1KE6 EVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYGATATSPGERFT :::::::::::::::::::::::::::::: CCDS69 MLLAMPALWYLATSWCSTSGGRTTWRPVSYLTWGVLQERVMTRSYGATATSPGERFT 10 20 30 40 50 160 170 180 190 200 210 pF1KE6 DSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWCQYEALKFVSFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 DSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWCQYEALKFVSFP 60 70 80 90 100 110 220 230 240 250 260 270 pF1KE6 TQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLS 120 130 140 150 160 170 280 290 300 310 320 330 pF1KE6 GLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLLEQGALLEGTRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GLILLAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLLEQGALLEGTRF 180 190 200 210 220 230 340 350 360 370 380 390 pF1KE6 MGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAILLSCLLYGHTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MGRHSEFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAILLSCLLYGHTV 240 250 260 270 280 290 400 410 420 430 pF1KE6 TVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV :::::::::::::::::::::::::::::::::::::::::: CCDS69 TVVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV 300 310 320 330 >>CCDS75462.1 SLC35B2 gene_id:347734|Hs108|chr6 (299 aa) initn: 1933 init1: 1933 opt: 1933 Z-score: 2580.4 bits: 486.0 E(32554): 2.1e-137 Smith-Waterman score: 1933; 100.0% identity (100.0% similar) in 299 aa overlap (134-432:1-299) 110 120 130 140 150 160 pF1KE6 TPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYGATATSPGERFTDSQFLVLMNRVLA :::::::::::::::::::::::::::::: CCDS75 MTRSYGATATSPGERFTDSQFLVLMNRVLA 10 20 30 170 180 190 200 210 220 pF1KE6 LIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWCQYEALKFVSFPTQVLAKASKVIPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWCQYEALKFVSFPTQVLAKASKVIPV 40 50 60 70 80 90 230 240 250 260 270 280 pF1KE6 MLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLSGLILLAGYIAFDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLSGLILLAGYIAFDS 100 110 120 130 140 150 290 300 310 320 330 340 pF1KE6 FTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLLEQGALLEGTRFMGRHSEFAAHALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 FTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSLLEQGALLEGTRFMGRHSEFAAHALL 160 170 180 190 200 210 350 360 370 380 390 400 pF1KE6 LSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAILLSCLLYGHTVTVVGGLGVAVVFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAILLSCLLYGHTVTVVGGLGVAVVFA 220 230 240 250 260 270 410 420 430 pF1KE6 ALLLRVYARGRLKQRGKKAVPVESPVQKV ::::::::::::::::::::::::::::: CCDS75 ALLLRVYARGRLKQRGKKAVPVESPVQKV 280 290 >>CCDS11552.2 SLC35B1 gene_id:10237|Hs108|chr17 (359 aa) initn: 339 init1: 183 opt: 465 Z-score: 617.0 bits: 123.0 E(32554): 4.8e-28 Smith-Waterman score: 465; 30.7% identity (59.7% similar) in 345 aa overlap (75-416:19-356) 50 60 70 80 90 100 pF1KE6 GYASFMVPGYLLVQYFRRKNYLETGRGLCFPLVKACVFGNEPKASDEVPLAPRTEAAETT ::. . : .. : .: . : :. .. CCDS11 MRPLPPVGDVRLELSPPPPLLPVPVVSGSPVGS-----SGRLMASSSS 10 20 30 40 110 120 130 140 150 160 pF1KE6 PMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYGATATSPGERFTDSQFLVLMNRVLAL . . :.: .: :. : :. .:.:::.. .:: : . : :: . ::... :. CCDS11 LVPDRLRLPLCFLGVFVCYFYYGILQEKITRGKYGEGAKQ--ETFTFALTLVFIQCVINA 50 60 70 80 90 100 170 180 190 200 210 220 pF1KE6 IVAGLSCVLCKQPR-HGAPMYRYSFASLSNVLSSWCQYEALKFVSFPTQVLAKASKVIPV . : . . : . . :. :.: . . . ::.::..:::::.:. : ::: CCDS11 VFAKILIQFFDTARVDRTRSWLYAACSISYLGAMVSSNSALQFVNYPTQVLGKSCKPIPV 110 120 130 140 150 160 230 240 250 260 270 280 pF1KE6 MLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLSGLILLAGYIAFDS ::.: . ...: .:: . :: ::..:. . :. : .:: ...:. CCDS11 MLLGVTLLKKKYPLAKYLCVLLIVAGVALFMYKPKKVVGIEEHTVGYGELLLLLSLTLDG 170 180 190 200 210 220 290 300 310 320 330 340 pF1KE6 FTSNWQDALFA-YKMSSVQMMFGVNFFSCLFTVGSLLEQGALLEGTRFMGRHSEFAAHAL .:. :: . : :. .: .::...:..: :. ..: : : : : :. . . : CCDS11 LTGVSQDHMRAHYQTGSNHMMLNINLWSTLLLGMGILFTGELWEFLSFAERYPAIIYNIL 230 240 250 260 270 280 350 360 370 380 390 400 pF1KE6 LLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFAILLSCLLYGHTVTVVGGLGVAVVF :... :: :: :::.:. :: . .:: : :. :.:: : .:... .. . .:...:: CCDS11 LFGLTSALGQSFIFMTVVYFGPLTCSIITTTRKFFTILASVILFANPISPMQWVGTVLVF 290 300 310 320 330 340 410 420 430 pF1KE6 AALLLRV-YARGRLKQRGKKAVPVESPVQKV .: : . ...: : CCDS11 LGLGLDAKFGKGAKKTSH 350 >>CCDS4508.1 SLC35B3 gene_id:51000|Hs108|chr6 (401 aa) initn: 161 init1: 83 opt: 253 Z-score: 333.0 bits: 70.6 E(32554): 3.2e-12 Smith-Waterman score: 283; 24.0% identity (59.4% similar) in 313 aa overlap (111-418:80-377) 90 100 110 120 130 140 pF1KE6 VFGNEPKASDEVPLAPRTEAAETTPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYGA ....:..:. : :: .: ::: ... CCDS45 SKTQTMSPHIKSVDDVVVLGMNLSKFNKLTQFFICVAGVFVFYLIYGYLQELIFS----- 50 60 70 80 90 100 150 160 170 180 190 200 pF1KE6 TATSPGERFTDSQFLVLMNRVLALIVAGLSCVLCKQPRHGAPMYRYSFASLSNVLSSWCQ : . . . .:.:.. .. : . . : .. :. : : . .. .: . . CCDS45 ---VEGFK-SCGWYLTLVQFAFYSIFGLIELQLIQDKRRRIPGKTYMIIAFLTVGTMGLS 110 120 130 140 150 160 210 220 230 240 250 260 pF1KE6 YEALKFVSFPTQVLAKASKVIPVMLMGKLVSRRSYEHWEYLTATLISIGVSMFLLSSGPE .: ....::::. : :.::::: : ... . :. . .: .:.:. : :. . CCDS45 NTSLGYLNYPTQVIFKCCKLIPVMLGGVFIQGKRYNVADVSAAICMSLGLIWFTLA---D 170 180 190 200 210 270 280 290 300 310 pF1KE6 PRSSPATTLSGLILLAGYIAFDSFTSNWQD-ALFAYKMSSVQMMFGVNFFSCLFTVGSLL ..: .:.:..:.. . :. .: :. :. .. :. .:.. .. .. . .: CCDS45 STTAPNFNLTGVVLISLALCADAVIGNVQEKAMKLHNASNSEMVLYSYSIGFVYILLGLT 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE6 EQGALLEGTRFMGRHS-EFAAHALLLSICSACGQLFIFYTIGQFGAAVFTIIMTLRQAFA ..: .. : ... . ..:.:.:. . : :.. : ::: . . . : :.:.. CCDS45 CTSGLGPAVTFCAKNPVRTYGYAFLFSLTGYFGISFVLALIKIFGALIAVTVTTGRKAMT 280 290 300 310 320 330 380 390 400 410 420 430 pF1KE6 ILLSCLLYGHTVT---VVGGLGVAVVFAALLLRVYARGRLKQRGKKAVPVESPVQKV :.:: ..... : : .:: .: ...: ::... : : CCDS45 IVLSFIFFAKPFTFQYVWSGL---LVVLGIFLNVYSKNMDKIRLPSLYDLINKSVEARKS 340 350 360 370 380 390 CCDS45 RTLAQTV 400 432 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:19:42 2016 done: Tue Nov 8 12:19:43 2016 Total Scan time: 2.230 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]