FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3884, 331 aa 1>>>pF1KE3884 331 - 331 aa - 331 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2997+/-0.000983; mu= 15.4879+/- 0.059 mean_var=60.4009+/-12.002, 0's: 0 Z-trim(103.4): 55 B-trim: 429 in 1/47 Lambda= 0.165026 statistics sampled from 7438 (7494) to 7438 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.224), width: 16 Scan time: 1.160 The best scores are: opt bits E(33420) CCDS2933.1 ST3GAL6 gene_id:10402|Hs109|chr3 ( 331) 2256 545.8 1.8e-155 CCDS74968.1 ST3GAL6 gene_id:10402|Hs109|chr3 ( 384) 2256 545.9 2e-155 CCDS59452.1 ST3GAL6 gene_id:10402|Hs109|chr3 ( 213) 1278 312.9 1.5e-85 CCDS8474.1 ST3GAL4 gene_id:6484|Hs109|chr11 ( 329) 706 176.8 2.2e-44 CCDS58194.1 ST3GAL4 gene_id:6484|Hs109|chr11 ( 332) 700 175.4 5.9e-44 CCDS58193.1 ST3GAL4 gene_id:6484|Hs109|chr11 ( 333) 700 175.4 5.9e-44 CCDS497.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 359) 667 167.5 1.5e-41 CCDS492.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 375) 667 167.5 1.5e-41 CCDS494.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 390) 667 167.6 1.6e-41 CCDS498.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 413) 667 167.6 1.7e-41 CCDS496.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 429) 667 167.6 1.7e-41 CCDS493.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 444) 667 167.6 1.8e-41 CCDS57989.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 344) 655 164.7 1e-40 CCDS85962.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 434) 619 156.1 4.8e-38 CCDS85965.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 449) 619 156.2 4.9e-38 CCDS86856.1 ST3GAL5 gene_id:8869|Hs109|chr2 ( 390) 585 148.0 1.2e-35 CCDS42705.1 ST3GAL5 gene_id:8869|Hs109|chr2 ( 395) 585 148.0 1.2e-35 CCDS1986.2 ST3GAL5 gene_id:8869|Hs109|chr2 ( 418) 585 148.0 1.3e-35 CCDS57990.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 261) 389 101.3 9.3e-22 CCDS495.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 277) 389 101.3 9.8e-22 CCDS85961.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 331) 389 101.3 1.1e-21 CCDS57988.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 345) 388 101.1 1.4e-21 CCDS85964.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 230) 371 97.0 1.6e-20 CCDS86857.1 ST3GAL5 gene_id:8869|Hs109|chr2 ( 365) 316 84.0 2.1e-16 CCDS2073.1 ST6GAL2 gene_id:84620|Hs109|chr2 ( 529) 290 77.9 2.1e-14 CCDS6373.1 ST3GAL1 gene_id:6482|Hs109|chr8 ( 340) 261 70.9 1.7e-12 CCDS77183.1 ST8SIA5 gene_id:29906|Hs109|chr18 ( 345) 254 69.2 5.6e-12 CCDS10890.1 ST3GAL2 gene_id:6483|Hs109|chr16 ( 350) 245 67.1 2.5e-11 >>CCDS2933.1 ST3GAL6 gene_id:10402|Hs109|chr3 (331 aa) initn: 2256 init1: 2256 opt: 2256 Z-score: 2905.0 bits: 545.8 E(33420): 1.8e-155 Smith-Waterman score: 2256; 100.0% identity (100.0% similar) in 331 aa overlap (1-331:1-331) 10 20 30 40 50 60 pF1KE3 MRGYLVAIFLSAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAFASLLRFHQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 MRGYLVAIFLSAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAFASLLRFHQF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 HPFLCAADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 HPFLCAADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VGNGGVLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPIHNDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 VGNGGVLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPIHNDP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 NTTVILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRTAAYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 NTTVILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRTAAYE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LLHFPKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNATMSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 LLHFPKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNATMSL 250 260 270 280 290 300 310 320 330 pF1KE3 MNKNAYHNVTAEQLFLKDIIEKNLVINLTQD ::::::::::::::::::::::::::::::: CCDS29 MNKNAYHNVTAEQLFLKDIIEKNLVINLTQD 310 320 330 >>CCDS74968.1 ST3GAL6 gene_id:10402|Hs109|chr3 (384 aa) initn: 2256 init1: 2256 opt: 2256 Z-score: 2904.0 bits: 545.9 E(33420): 2e-155 Smith-Waterman score: 2256; 100.0% identity (100.0% similar) in 331 aa overlap (1-331:54-384) 10 20 30 pF1KE3 MRGYLVAIFLSAVFLYYVLHCILWGTNVYW :::::::::::::::::::::::::::::: CCDS74 APLRSSLLGLGGSLLPAGFAAGLHCPGEPAMRGYLVAIFLSAVFLYYVLHCILWGTNVYW 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE3 VAPVEMKRRNKIQPCLSKPAFASLLRFHQFHPFLCAADFRKIASLYGSDKFDLPYGMRTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 VAPVEMKRRNKIQPCLSKPAFASLLRFHQFHPFLCAADFRKIASLYGSDKFDLPYGMRTS 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE3 AEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGGVLKNKTLGEKIDSYDVIIRMNNGPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 AEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGGVLKNKTLGEKIDSYDVIIRMNNGPV 150 160 170 180 190 200 160 170 180 190 200 210 pF1KE3 LGHEEEVGRRTTFRLFYPESVFSDPIHNDPNTTVILTAFKPHDLRWLLELLMGDKINTNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LGHEEEVGRRTTFRLFYPESVFSDPIHNDPNTTVILTAFKPHDLRWLLELLMGDKINTNG 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE3 FWKKPALNLIYKPYQIRILDPFIIRTAAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 FWKKPALNLIYKPYQIRILDPFIIRTAAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYI 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE3 CHEVHLAGFKYNFSDLKSPLHYYGNATMSLMNKNAYHNVTAEQLFLKDIIEKNLVINLTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 CHEVHLAGFKYNFSDLKSPLHYYGNATMSLMNKNAYHNVTAEQLFLKDIIEKNLVINLTQ 330 340 350 360 370 380 pF1KE3 D : CCDS74 D >>CCDS59452.1 ST3GAL6 gene_id:10402|Hs109|chr3 (213 aa) initn: 1268 init1: 1268 opt: 1278 Z-score: 1649.6 bits: 312.9 E(33420): 1.5e-85 Smith-Waterman score: 1359; 86.5% identity (86.9% similar) in 245 aa overlap (87-331:1-213) 60 70 80 90 100 110 pF1KE3 FHQFHPFLCAADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCK ::::::::::::::::::::::::: CCDS59 MRTSAEYFRLALSKLQSCDLFDEFD----- 10 20 120 130 140 150 160 170 pF1KE3 KCVVVGNGGVLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPI .:::::::::::::::::::::::::::::::: CCDS59 ---------------------------KMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPI 30 40 50 180 190 200 210 220 230 pF1KE3 HNDPNTTVILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 HNDPNTTVILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRT 60 70 80 90 100 110 240 250 260 270 280 290 pF1KE3 AAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 AAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNA 120 130 140 150 160 170 300 310 320 330 pF1KE3 TMSLMNKNAYHNVTAEQLFLKDIIEKNLVINLTQD ::::::::::::::::::::::::::::::::::: CCDS59 TMSLMNKNAYHNVTAEQLFLKDIIEKNLVINLTQD 180 190 200 210 >>CCDS8474.1 ST3GAL4 gene_id:6484|Hs109|chr11 (329 aa) initn: 364 init1: 364 opt: 706 Z-score: 910.6 bits: 176.8 E(33420): 2.2e-44 Smith-Waterman score: 706; 39.1% identity (66.6% similar) in 335 aa overlap (2-329:6-327) 10 20 30 40 50 pF1KE3 MRGYLVAIFLSAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAF--ASL : :.:. :. :.. .: . : . :. :. :. .:::. : :: CCDS84 MVSKSRWKLLAM-LALVLVVMVWYSISREDSFYF--PIPEKK----EPCLQGEAESKASK 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 L--RFHQFHP-FLCAADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFD : . . .: :: :. . . . ..:::: . : : . : . . : .. ... CCDS84 LFGNYSRDQPIFLRLEDYFWVKT---PSAYELPYGTKGS-EDLLLRVLAITSSSIPKNIQ 60 70 80 90 100 120 130 140 150 160 170 pF1KE3 NIPCKKCVVVGNGGVLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESV .. :..::::::: :.:..::. :..:::.::.::.:: :.: .:: .::.:::::::. CCDS84 SLRCRRCVVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESA 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 FSDP-IHNDPNTTVILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILD :: ..:.:.: ..:.::: :..:. .: : .::::.: : .: :::::. CCDS84 HFDPKVENNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILN 170 180 190 200 210 220 240 250 260 270 280 pF1KE3 PFIIRTAAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKY-NFSDLKSP ::... :: .:: .: :.. : : ::::..:::::...: ::.::: : . . :. CCDS84 PFFMEIAADKLLSLPMQQPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQT 230 240 250 260 270 280 290 300 310 320 330 pF1KE3 LHYYGNATMSLMNKNAYHNVTAEQLFLKDIIEKNLVINLTQD .::: . :.. : .. :::. : : .: ..: . . ::: CCDS84 IHYYEQITLKSMAGSG-HNVSQEALAIKRMLEMGAIKNLTSF 290 300 310 320 >>CCDS58194.1 ST3GAL4 gene_id:6484|Hs109|chr11 (332 aa) initn: 364 init1: 364 opt: 700 Z-score: 902.8 bits: 175.4 E(33420): 5.9e-44 Smith-Waterman score: 700; 40.9% identity (68.6% similar) in 296 aa overlap (41-329:41-330) 20 30 40 50 60 pF1KE3 SAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAF--ASLL--RFHQFHP-FLC : .:::. : :: : . . .: :: CCDS58 MLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLR 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE3 AADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGG :. . . . ..:::: . : . . : . . : .. ..... :..::::::: CCDS58 LEDYFWVKT---PSAYELPYGTKGSEDLL-LRVLAITSSSIPKNIQSLRCRRCVVVGNGH 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDP-IHNDPNTTV :.:..::. :..:::.::.::.:: :.: .:: .::.:::::::. :: ..:.:.: . CCDS58 RLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 ILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHF .:.::: :..:. .: : .::::.: : .: :::::.::... :: .:: . CCDS58 VLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 PKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKY-NFSDLKSPLHYYGNATMSLMNK : :.. : : ::::..:::::...: ::.::: : . . :. .::: . :.. : CCDS58 PMQQPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAG 250 260 270 280 290 300 310 320 330 pF1KE3 NAYHNVTAEQLFLKDIIEKNLVINLTQD .. :::. : : .: ..: . . ::: CCDS58 SG-HNVSQEALAIKRMLEMGAIKNLTSF 310 320 330 >>CCDS58193.1 ST3GAL4 gene_id:6484|Hs109|chr11 (333 aa) initn: 364 init1: 364 opt: 700 Z-score: 902.8 bits: 175.4 E(33420): 5.9e-44 Smith-Waterman score: 700; 40.9% identity (68.6% similar) in 296 aa overlap (41-329:42-331) 20 30 40 50 60 pF1KE3 SAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAF--ASLL--RFHQFHP-FLC : .:::. : :: : . . .: :: CCDS58 MLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLR 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE3 AADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGG :. . . . ..:::: . : . . : . . : .. ..... :..::::::: CCDS58 LEDYFWVKT---PSAYELPYGTKGSEDLL-LRVLAITSSSIPKNIQSLRCRRCVVVGNGH 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDP-IHNDPNTTV :.:..::. :..:::.::.::.:: :.: .:: .::.:::::::. :: ..:.:.: . CCDS58 RLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 ILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHF .:.::: :..:. .: : .::::.: : .: :::::.::... :: .:: . CCDS58 VLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 PKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKY-NFSDLKSPLHYYGNATMSLMNK : :.. : : ::::..:::::...: ::.::: : . . :. .::: . :.. : CCDS58 PMQQPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAG 250 260 270 280 290 300 310 320 330 pF1KE3 NAYHNVTAEQLFLKDIIEKNLVINLTQD .. :::. : : .: ..: . . ::: CCDS58 SG-HNVSQEALAIKRMLEMGAIKNLTSF 310 320 330 >>CCDS497.1 ST3GAL3 gene_id:6487|Hs109|chr1 (359 aa) initn: 671 init1: 332 opt: 667 Z-score: 859.8 bits: 167.5 E(33420): 1.5e-41 Smith-Waterman score: 678; 32.6% identity (64.3% similar) in 353 aa overlap (1-329:7-356) 10 20 30 pF1KE3 MRGYLVAI--FLSAVFLYYV---LHCILW---------G--TNVYWVAPVEMKR .:. :.:. :: :::: :: . : : :. :.:. CCDS49 MGLLVFVRNLLLALCLFLVLGFLYYSAWKLHLLQWEEDSKYDRLGFLLNLDSKLPAELAT 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE3 R-NKIQPCLSKPAFASLL------RFHQFHPFLCAADFRKIASLYGSDKFDLPYGMRTSA . ... ::..:: : :: . :.. .::: : . .: :.:.. . CCDS49 KYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRKWARIR---EFVPPFGIKGQD 70 80 90 100 110 100 110 120 130 140 150 pF1KE3 EYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGGVLKNKTLGEKIDSYDVIIRMNNGPVL . .. :: . : .:.. :..:..::::::: ::.:: .::.::...:.:..:: CCDS49 NLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANKSLGSRIDDYDIVVRLNSAPVK 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE3 GHEEEVGRRTTFRLFYPESVFSDPIHNDPNTTVILTAFKPHDLRWLLELLMGDKIN-TNG : :..:: .::.:. :::.... : . . .. .:..:: .:..:: ... .... ..: CCDS49 GFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKWQDFKWLKYIVYKERVSASDG 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE3 FWKKPALNLIYKPYQIRILDPFIIRTAAYELLHFPKVFPKNQKPKHPTTGIIAITLAFYI :::. : . .: .::::.:..:. ::. :. .: . . :: : .:.:.:.. CCDS49 FWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGLMGRGNIPTLGSVAVTMALHG 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE3 CHEVHLAGFKYNFSDLKSPLHYYGNATMSLMNKNAYHNVTAEQLFLKDIIEKNLVINLTQ : :: .::: :..: ..::::: .. :. .... ::. :. ::. ... .. .:. CCDS49 CDEVAVAGFGYDMSTPNAPLHYYETVRMAAIKESWTHNIQREKEFLRKLVKARVITDLSS 300 310 320 330 340 350 pF1KE3 D CCDS49 GI >>CCDS492.1 ST3GAL3 gene_id:6487|Hs109|chr1 (375 aa) initn: 671 init1: 332 opt: 667 Z-score: 859.5 bits: 167.5 E(33420): 1.5e-41 Smith-Waterman score: 667; 34.3% identity (68.9% similar) in 289 aa overlap (48-329:87-372) 20 30 40 50 60 70 pF1KE3 VLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAFASLL------RFHQFHPFLCAADFRK ::..:: : :: . :.. .::: CCDS49 YDRLGFLLNLDSKLPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRK 60 70 80 90 100 110 80 90 100 110 120 130 pF1KE3 IASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGGVLKNKT : . .: :.:.. . . .. :: . : .:.. :..:..::::::: ::. CCDS49 WARIR---EFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANKS 120 130 140 150 160 170 140 150 160 170 180 190 pF1KE3 LGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPIHNDPNTTVILTAFKP :: .::.::...:.:..:: : :..:: .::.:. :::.... : . . .. .:..:: CCDS49 LGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKW 180 190 200 210 220 230 200 210 220 230 240 250 pF1KE3 HDLRWLLELLMGDKIN-TNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHFPKVFPK .:..:: ... .... ..::::. : . .: .::::.:..:. ::. :. .: CCDS49 QDFKWLKYIVYKERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGL 240 250 260 270 280 290 260 270 280 290 300 310 pF1KE3 NQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNATMSLMNKNAYHNVT . . :: : .:.:.:.. : :: .::: :..: ..::::: .. :. .... ::. CCDS49 MGRGNIPTLGSVAVTMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMAAIKESWTHNIQ 300 310 320 330 340 350 320 330 pF1KE3 AEQLFLKDIIEKNLVINLTQD :. ::. ... .. .:. CCDS49 REKEFLRKLVKARVITDLSSGI 360 370 >>CCDS494.1 ST3GAL3 gene_id:6487|Hs109|chr1 (390 aa) initn: 671 init1: 332 opt: 667 Z-score: 859.3 bits: 167.6 E(33420): 1.6e-41 Smith-Waterman score: 667; 34.3% identity (68.9% similar) in 289 aa overlap (48-329:102-387) 20 30 40 50 60 70 pF1KE3 VLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAFASLL------RFHQFHPFLCAADFRK ::..:: : :: . :.. .::: CCDS49 YDRLGFLLNLDSKLPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRK 80 90 100 110 120 130 80 90 100 110 120 130 pF1KE3 IASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGGVLKNKT : . .: :.:.. . . .. :: . : .:.. :..:..::::::: ::. CCDS49 WARIR---EFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANKS 140 150 160 170 180 140 150 160 170 180 190 pF1KE3 LGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPIHNDPNTTVILTAFKP :: .::.::...:.:..:: : :..:: .::.:. :::.... : . . .. .:..:: CCDS49 LGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKW 190 200 210 220 230 240 200 210 220 230 240 250 pF1KE3 HDLRWLLELLMGDKIN-TNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHFPKVFPK .:..:: ... .... ..::::. : . .: .::::.:..:. ::. :. .: CCDS49 QDFKWLKYIVYKERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGL 250 260 270 280 290 300 260 270 280 290 300 310 pF1KE3 NQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNATMSLMNKNAYHNVT . . :: : .:.:.:.. : :: .::: :..: ..::::: .. :. .... ::. CCDS49 MGRGNIPTLGSVAVTMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMAAIKESWTHNIQ 310 320 330 340 350 360 320 330 pF1KE3 AEQLFLKDIIEKNLVINLTQD :. ::. ... .. .:. CCDS49 REKEFLRKLVKARVITDLSSGI 370 380 390 >>CCDS498.1 ST3GAL3 gene_id:6487|Hs109|chr1 (413 aa) initn: 651 init1: 332 opt: 667 Z-score: 858.9 bits: 167.6 E(33420): 1.7e-41 Smith-Waterman score: 667; 34.3% identity (68.9% similar) in 289 aa overlap (48-329:125-410) 20 30 40 50 60 70 pF1KE3 VLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAFASLL------RFHQFHPFLCAADFRK ::..:: : :: . :.. .::: CCDS49 EDTAFALGFLKLPRPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRK 100 110 120 130 140 150 80 90 100 110 120 130 pF1KE3 IASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGGVLKNKT : . .: :.:.. . . .. :: . : .:.. :..:..::::::: ::. CCDS49 WARIR---EFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANKS 160 170 180 190 200 210 140 150 160 170 180 190 pF1KE3 LGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDPIHNDPNTTVILTAFKP :: .::.::...:.:..:: : :..:: .::.:. :::.... : . . .. .:..:: CCDS49 LGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKW 220 230 240 250 260 270 200 210 220 230 240 250 pF1KE3 HDLRWLLELLMGDKIN-TNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHFPKVFPK .:..:: ... .... ..::::. : . .: .::::.:..:. ::. :. .: CCDS49 QDFKWLKYIVYKERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGL 280 290 300 310 320 330 260 270 280 290 300 310 pF1KE3 NQKPKHPTTGIIAITLAFYICHEVHLAGFKYNFSDLKSPLHYYGNATMSLMNKNAYHNVT . . :: : .:.:.:.. : :: .::: :..: ..::::: .. :. .... ::. CCDS49 MGRGNIPTLGSVAVTMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMAAIKESWTHNIQ 340 350 360 370 380 390 320 330 pF1KE3 AEQLFLKDIIEKNLVINLTQD :. ::. ... .. .:. CCDS49 REKEFLRKLVKARVITDLSSGI 400 410 331 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Aug 4 20:39:21 2021 done: Wed Aug 4 20:39:21 2021 Total Scan time: 1.160 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]