FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2671, 333 aa 1>>>pF1KE2671 333 - 333 aa - 333 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4264+/-0.000902; mu= 14.7647+/- 0.054 mean_var=56.6122+/-11.399, 0's: 0 Z-trim(104.2): 47 B-trim: 139 in 2/47 Lambda= 0.170459 statistics sampled from 7729 (7776) to 7729 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.239), width: 16 Scan time: 2.540 The best scores are: opt bits E(32554) CCDS58193.1 ST3GAL4 gene_id:6484|Hs108|chr11 ( 333) 2217 553.5 8.8e-158 CCDS58194.1 ST3GAL4 gene_id:6484|Hs108|chr11 ( 332) 2182 544.9 3.4e-155 CCDS8474.1 ST3GAL4 gene_id:6484|Hs108|chr11 ( 329) 2167 541.2 4.4e-154 CCDS2933.1 ST3GAL6 gene_id:10402|Hs108|chr3 ( 331) 700 180.4 1.8e-45 CCDS497.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 359) 700 180.4 1.9e-45 CCDS492.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 375) 700 180.4 2e-45 CCDS74968.1 ST3GAL6 gene_id:10402|Hs108|chr3 ( 384) 700 180.4 2e-45 CCDS494.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 390) 700 180.4 2e-45 CCDS498.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 413) 700 180.4 2.1e-45 CCDS496.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 429) 700 180.4 2.2e-45 CCDS493.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 444) 700 180.5 2.3e-45 CCDS57989.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 344) 680 175.5 5.5e-44 CCDS42705.1 ST3GAL5 gene_id:8869|Hs108|chr2 ( 395) 562 146.5 3.4e-35 CCDS1986.2 ST3GAL5 gene_id:8869|Hs108|chr2 ( 418) 562 146.5 3.6e-35 CCDS59452.1 ST3GAL6 gene_id:10402|Hs108|chr3 ( 213) 524 137.1 1.2e-32 CCDS57988.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 345) 455 120.2 2.5e-27 CCDS57990.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 261) 448 118.4 6.4e-27 CCDS495.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 277) 448 118.4 6.7e-27 CCDS6373.1 ST3GAL1 gene_id:6482|Hs108|chr8 ( 340) 309 84.3 1.6e-16 CCDS10890.1 ST3GAL2 gene_id:6483|Hs108|chr16 ( 350) 299 81.8 8.9e-16 CCDS2073.1 ST6GAL2 gene_id:84620|Hs108|chr2 ( 529) 268 74.2 2.6e-13 CCDS57994.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 185) 250 69.7 2.1e-12 CCDS4091.1 ST8SIA4 gene_id:7903|Hs108|chr5 ( 359) 251 70.0 3.3e-12 CCDS57991.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 186) 243 68.0 7e-12 CCDS57993.1 ST3GAL3 gene_id:6487|Hs108|chr1 ( 201) 243 68.0 7.5e-12 CCDS69668.1 ST6GALNAC6 gene_id:30815|Hs108|chr9 ( 299) 234 65.8 5e-11 CCDS6882.1 ST6GALNAC6 gene_id:30815|Hs108|chr9 ( 333) 234 65.8 5.5e-11 CCDS81919.1 ST8SIA2 gene_id:8128|Hs108|chr15 ( 354) 233 65.6 7e-11 CCDS10372.1 ST8SIA2 gene_id:8128|Hs108|chr15 ( 375) 233 65.6 7.3e-11 CCDS3285.1 ST6GAL1 gene_id:6480|Hs108|chr3 ( 406) 233 65.6 7.9e-11 >>CCDS58193.1 ST3GAL4 gene_id:6484|Hs108|chr11 (333 aa) initn: 2217 init1: 2217 opt: 2217 Z-score: 2946.2 bits: 553.5 E(32554): 8.8e-158 Smith-Waterman score: 2217; 100.0% identity (100.0% similar) in 333 aa overlap (1-333:1-333) 10 20 30 40 50 60 pF1KE2 MVSKSRWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MVSKSRWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 NYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 VVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 NNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 ADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQIT 250 260 270 280 290 300 310 320 330 pF1KE2 LKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF ::::::::::::::::::::::::::::::::: CCDS58 LKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF 310 320 330 >>CCDS58194.1 ST3GAL4 gene_id:6484|Hs108|chr11 (332 aa) initn: 2182 init1: 2182 opt: 2182 Z-score: 2899.7 bits: 544.9 E(32554): 3.4e-155 Smith-Waterman score: 2182; 100.0% identity (100.0% similar) in 327 aa overlap (7-333:6-332) 10 20 30 40 50 60 pF1KE2 MVSKSRWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFG :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MCPAGWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFG 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 NYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 VVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVE 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE2 NNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIA 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE2 ADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQIT 240 250 260 270 280 290 310 320 330 pF1KE2 LKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF ::::::::::::::::::::::::::::::::: CCDS58 LKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF 300 310 320 330 >>CCDS8474.1 ST3GAL4 gene_id:6484|Hs108|chr11 (329 aa) initn: 2002 init1: 2002 opt: 2167 Z-score: 2879.8 bits: 541.2 E(32554): 4.4e-154 Smith-Waterman score: 2167; 98.5% identity (98.5% similar) in 333 aa overlap (1-333:1-329) 10 20 30 40 50 60 pF1KE2 MVSKSRWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFG ::::::::::::::::::::::::::::: :::::::::::::::::::::::::: CCDS84 MVSKSRWKLLAMLALVLVVMVWYSISREDS----FYFPIPEKKEPCLQGEAESKASKLFG 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 NYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 NYSRDQPIFLRLEDYFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 VVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 VVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVE 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE2 NNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 NNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIA 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE2 ADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS84 ADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQIT 240 250 260 270 280 290 310 320 330 pF1KE2 LKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF ::::::::::::::::::::::::::::::::: CCDS84 LKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF 300 310 320 >>CCDS2933.1 ST3GAL6 gene_id:10402|Hs108|chr3 (331 aa) initn: 364 init1: 364 opt: 700 Z-score: 930.0 bits: 180.4 E(32554): 1.8e-45 Smith-Waterman score: 700; 40.9% identity (68.9% similar) in 296 aa overlap (42-331:41-329) 20 30 40 50 60 70 pF1KE2 MLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLR : .:::. : :: : . . .: :: CCDS29 SAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAF--ASLL--RFHQFHP-FLC 20 30 40 50 60 80 90 100 110 120 pF1KE2 LEDYFWVKT---PSAYELPYGTKGSEDLLLRVLA-ITSSSIPKNIQSLRCRRCVVVGNGH :. . . . ..:::: . : . . .:. . : .. ..... :..::::::: CCDS29 AADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 RLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLL :.:..::. :..:::.::.::.:: :.: .:: .::.:::::::. :: ..:.:.: . CCDS29 VLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDP-IHNDPNTTV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 VLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSL .:.::: :..:. .: : .::::.: : .: :::::.::... :: .:: . CCDS29 ILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 PMQQPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAG : :.. : : ::::..:::::...: ::.::: : . . :. .::: . :.. : CCDS29 PKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKY-NFSDLKSPLHYYGNATMSLMNK 250 260 270 280 290 300 310 320 330 pF1KE2 SG-HNVSQEALAIKRMLEMGAIKNLTSF .. :::. : : .: ..: . . ::: CCDS29 NAYHNVTAEQLFLKDIIEKNLVINLTQD 310 320 330 >>CCDS497.1 ST3GAL3 gene_id:6487|Hs108|chr1 (359 aa) initn: 457 init1: 356 opt: 700 Z-score: 929.5 bits: 180.4 E(32554): 1.9e-45 Smith-Waterman score: 700; 39.8% identity (68.0% similar) in 294 aa overlap (46-332:70-357) 20 30 40 50 60 70 pF1KE2 VLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLRLEDY : : : . . .: .:. :.: :.: CCDS49 KYDRLGFLLNLDSKLPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMF--LDDS 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE2 F--WVKTPSAYELPYGTKGSEDLLLRVLAITSS-SIPKNIQSLRCRRCVVVGNGHRLRNS : :.. . :.: ::...:. .:..:. . ..:::::::..:::: : :. CCDS49 FRKWARI-REFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANK 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE2 SLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAF :::. :. ::.:.:::.::: :.: :::::::.:. :::.: :. . . :.:.::..: CCDS49 SLGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPE-QYERDSLFVLAGF 160 170 180 190 200 210 200 210 220 230 240 250 pF1KE2 KAMDFHWIETILSDKKRVRK--GFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSLPMQ : .::.:.. :. :.:: ::::. .: .::::::.:.. :: :..::.. CCDS49 KWQDFKWLKYIVY-KERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFN 220 230 240 250 260 270 260 270 280 290 300 pF1KE2 QPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAGS-G . . . :: : .:.:.::: :: : .::::: : . . .:::: . . .. : CCDS49 NGLMGRGNIPTLGSVAVTMALHGCDEVAVAGFGY-DMSTPNAPLHYYETVRMAAIKESWT 280 290 300 310 320 330 310 320 330 pF1KE2 HNVSQEALAIKRMLEMGAIKNLTSF ::...: ...... .: .:.: CCDS49 HNIQREKEFLRKLVKARVITDLSSGI 340 350 >>CCDS492.1 ST3GAL3 gene_id:6487|Hs108|chr1 (375 aa) initn: 457 init1: 356 opt: 700 Z-score: 929.1 bits: 180.4 E(32554): 2e-45 Smith-Waterman score: 700; 39.8% identity (68.0% similar) in 294 aa overlap (46-332:86-373) 20 30 40 50 60 70 pF1KE2 VLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLRLEDY : : : . . .: .:. :.: :.: CCDS49 EYDRLGFLLNLDSKLPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMF--LDDS 60 70 80 90 100 110 80 90 100 110 120 130 pF1KE2 F--WVKTPSAYELPYGTKGSEDLLLRVLAITSS-SIPKNIQSLRCRRCVVVGNGHRLRNS : :.. . :.: ::...:. .:..:. . ..:::::::..:::: : :. CCDS49 FRKWARI-REFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANK 120 130 140 150 160 170 140 150 160 170 180 190 pF1KE2 SLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAF :::. :. ::.:.:::.::: :.: :::::::.:. :::.: :. . . :.:.::..: CCDS49 SLGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPE-QYERDSLFVLAGF 180 190 200 210 220 230 200 210 220 230 240 250 pF1KE2 KAMDFHWIETILSDKKRVRK--GFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSLPMQ : .::.:.. :. :.:: ::::. .: .::::::.:.. :: :..::.. CCDS49 KWQDFKWLKYIVY-KERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFN 240 250 260 270 280 290 260 270 280 290 300 pF1KE2 QPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAGS-G . . . :: : .:.:.::: :: : .::::: : . . .:::: . . .. : CCDS49 NGLMGRGNIPTLGSVAVTMALHGCDEVAVAGFGY-DMSTPNAPLHYYETVRMAAIKESWT 300 310 320 330 340 310 320 330 pF1KE2 HNVSQEALAIKRMLEMGAIKNLTSF ::...: ...... .: .:.: CCDS49 HNIQREKEFLRKLVKARVITDLSSGI 350 360 370 >>CCDS74968.1 ST3GAL6 gene_id:10402|Hs108|chr3 (384 aa) initn: 364 init1: 364 opt: 700 Z-score: 929.0 bits: 180.4 E(32554): 2e-45 Smith-Waterman score: 700; 40.9% identity (68.9% similar) in 296 aa overlap (42-331:94-382) 20 30 40 50 60 70 pF1KE2 MLALVLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLR : .:::. : :: : . . .: :: CCDS74 SAVFLYYVLHCILWGTNVYWVAPVEMKRRNKIQPCLSKPAF--ASLL--RFHQFHP-FLC 70 80 90 100 110 80 90 100 110 120 pF1KE2 LEDYFWVKT---PSAYELPYGTKGSEDLLLRVLA-ITSSSIPKNIQSLRCRRCVVVGNGH :. . . . ..:::: . : . . .:. . : .. ..... :..::::::: CCDS74 AADFRKIASLYGSDKFDLPYGMRTSAEYFRLALSKLQSCDLFDEFDNIPCKKCVVVGNGG 120 130 140 150 160 170 130 140 150 160 170 180 pF1KE2 RLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLL :.:..::. :..:::.::.::.:: :.: .:: .::.:::::::. :: ..:.:.: . CCDS74 VLKNKTLGEKIDSYDVIIRMNNGPVLGHEEEVGRRTTFRLFYPESVFSDP-IHNDPNTTV 180 190 200 210 220 230 190 200 210 220 230 240 pF1KE2 VLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSL .:.::: :..:. .: : .::::.: : .: :::::.::... :: .:: . CCDS74 ILTAFKPHDLRWLLELLMGDKINTNGFWKKPALNLIYKPYQIRILDPFIIRTAAYELLHF 240 250 260 270 280 290 250 260 270 280 290 300 pF1KE2 PMQQPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAG : :.. : : ::::..:::::...: ::.::: : . . :. .::: . :.. : CCDS74 PKVFPKNQKPKHPTTGIIAITLAFYICHEVHLAGFKY-NFSDLKSPLHYYGNATMSLMNK 300 310 320 330 340 350 310 320 330 pF1KE2 SG-HNVSQEALAIKRMLEMGAIKNLTSF .. :::. : : .: ..: . . ::: CCDS74 NAYHNVTAEQLFLKDIIEKNLVINLTQD 360 370 380 >>CCDS494.1 ST3GAL3 gene_id:6487|Hs108|chr1 (390 aa) initn: 457 init1: 356 opt: 700 Z-score: 928.9 bits: 180.4 E(32554): 2e-45 Smith-Waterman score: 700; 39.8% identity (68.0% similar) in 294 aa overlap (46-332:101-388) 20 30 40 50 60 70 pF1KE2 VLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLRLEDY : : : . . .: .:. :.: :.: CCDS49 EYDRLGFLLNLDSKLPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMF--LDDS 80 90 100 110 120 80 90 100 110 120 130 pF1KE2 F--WVKTPSAYELPYGTKGSEDLLLRVLAITSS-SIPKNIQSLRCRRCVVVGNGHRLRNS : :.. . :.: ::...:. .:..:. . ..:::::::..:::: : :. CCDS49 FRKWARI-REFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANK 130 140 150 160 170 180 140 150 160 170 180 190 pF1KE2 SLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAF :::. :. ::.:.:::.::: :.: :::::::.:. :::.: :. . . :.:.::..: CCDS49 SLGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPE-QYERDSLFVLAGF 190 200 210 220 230 240 200 210 220 230 240 250 pF1KE2 KAMDFHWIETILSDKKRVRK--GFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSLPMQ : .::.:.. :. :.:: ::::. .: .::::::.:.. :: :..::.. CCDS49 KWQDFKWLKYIVY-KERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFN 250 260 270 280 290 300 260 270 280 290 300 pF1KE2 QPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAGS-G . . . :: : .:.:.::: :: : .::::: : . . .:::: . . .. : CCDS49 NGLMGRGNIPTLGSVAVTMALHGCDEVAVAGFGY-DMSTPNAPLHYYETVRMAAIKESWT 310 320 330 340 350 360 310 320 330 pF1KE2 HNVSQEALAIKRMLEMGAIKNLTSF ::...: ...... .: .:.: CCDS49 HNIQREKEFLRKLVKARVITDLSSGI 370 380 390 >>CCDS498.1 ST3GAL3 gene_id:6487|Hs108|chr1 (413 aa) initn: 457 init1: 356 opt: 700 Z-score: 928.4 bits: 180.4 E(32554): 2.1e-45 Smith-Waterman score: 700; 39.8% identity (68.0% similar) in 294 aa overlap (46-332:124-411) 20 30 40 50 60 70 pF1KE2 VLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLRLEDY : : : . . .: .:. :.: :.: CCDS49 SEDTAFALGFLKLPRPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMF--LDDS 100 110 120 130 140 150 80 90 100 110 120 130 pF1KE2 F--WVKTPSAYELPYGTKGSEDLLLRVLAITSS-SIPKNIQSLRCRRCVVVGNGHRLRNS : :.. . :.: ::...:. .:..:. . ..:::::::..:::: : :. CCDS49 FRKWARI-REFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANK 160 170 180 190 200 210 140 150 160 170 180 190 pF1KE2 SLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAF :::. :. ::.:.:::.::: :.: :::::::.:. :::.: :. . . :.:.::..: CCDS49 SLGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPE-QYERDSLFVLAGF 220 230 240 250 260 200 210 220 230 240 250 pF1KE2 KAMDFHWIETILSDKKRVRK--GFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSLPMQ : .::.:.. :. :.:: ::::. .: .::::::.:.. :: :..::.. CCDS49 KWQDFKWLKYIVY-KERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFN 270 280 290 300 310 320 260 270 280 290 300 pF1KE2 QPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAGS-G . . . :: : .:.:.::: :: : .::::: : . . .:::: . . .. : CCDS49 NGLMGRGNIPTLGSVAVTMALHGCDEVAVAGFGY-DMSTPNAPLHYYETVRMAAIKESWT 330 340 350 360 370 380 310 320 330 pF1KE2 HNVSQEALAIKRMLEMGAIKNLTSF ::...: ...... .: .:.: CCDS49 HNIQREKEFLRKLVKARVITDLSSGI 390 400 410 >>CCDS496.1 ST3GAL3 gene_id:6487|Hs108|chr1 (429 aa) initn: 457 init1: 356 opt: 700 Z-score: 928.2 bits: 180.4 E(32554): 2.2e-45 Smith-Waterman score: 700; 39.8% identity (68.0% similar) in 294 aa overlap (46-332:140-427) 20 30 40 50 60 70 pF1KE2 VLVVMVWYSISREDRYIELFYFPIPEKKEPCLQGEAESKASKLFGNYSRDQPIFLRLEDY : : : . . .: .:. :.: :.: CCDS49 SEDTAFALGFLKLPRPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMF--LDDS 110 120 130 140 150 160 80 90 100 110 120 130 pF1KE2 F--WVKTPSAYELPYGTKGSEDLLLRVLAITSS-SIPKNIQSLRCRRCVVVGNGHRLRNS : :.. . :.: ::...:. .:..:. . ..:::::::..:::: : :. CCDS49 FRKWARI-REFVPPFGIKGQDNLIKAILSVTKEYRLTPALDSLRCRRCIIVGNGGVLANK 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE2 SLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAF :::. :. ::.:.:::.::: :.: :::::::.:. :::.: :. . . :.:.::..: CCDS49 SLGSRIDDYDIVVRLNSAPVKGFEKDVGSKTTLRITYPEGAMQRPE-QYERDSLFVLAGF 230 240 250 260 270 280 200 210 220 230 240 250 pF1KE2 KAMDFHWIETILSDKKRVRK--GFWKQPPLIWDVNPKQIRILNPFFMEIAADKLLSLPMQ : .::.:.. :. :.:: ::::. .: .::::::.:.. :: :..::.. CCDS49 KWQDFKWLKYIVY-KERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFN 290 300 310 320 330 340 260 270 280 290 300 pF1KE2 QPRKIKQK-PTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITLKSMAGS-G . . . :: : .:.:.::: :: : .::::: : . . .:::: . . .. : CCDS49 NGLMGRGNIPTLGSVAVTMALHGCDEVAVAGFGY-DMSTPNAPLHYYETVRMAAIKESWT 350 360 370 380 390 400 310 320 330 pF1KE2 HNVSQEALAIKRMLEMGAIKNLTSF ::...: ...... .: .:.: CCDS49 HNIQREKEFLRKLVKARVITDLSSGI 410 420 333 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Feb 2 14:37:25 2017 done: Thu Feb 2 14:37:25 2017 Total Scan time: 2.540 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]