FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2765, 340 aa 1>>>pF1KE2765 340 - 340 aa - 340 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2631+/-0.000922; mu= 16.0938+/- 0.055 mean_var=60.9997+/-12.181, 0's: 0 Z-trim(104.0): 53 B-trim: 4 in 1/50 Lambda= 0.164214 statistics sampled from 7716 (7769) to 7716 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.602), E-opt: 0.2 (0.232), width: 16 Scan time: 1.110 The best scores are: opt bits E(33420) CCDS6373.1 ST3GAL1 gene_id:6482|Hs109|chr8 ( 340) 2313 556.7 1e-158 CCDS10890.1 ST3GAL2 gene_id:6483|Hs109|chr16 ( 350) 1178 287.8 9.4e-78 CCDS58194.1 ST3GAL4 gene_id:6484|Hs109|chr11 ( 332) 309 81.9 8.5e-16 CCDS58193.1 ST3GAL4 gene_id:6484|Hs109|chr11 ( 333) 309 81.9 8.5e-16 CCDS8474.1 ST3GAL4 gene_id:6484|Hs109|chr11 ( 329) 306 81.2 1.4e-15 CCDS497.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 359) 306 81.2 1.5e-15 CCDS492.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 375) 306 81.2 1.5e-15 CCDS494.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 390) 306 81.2 1.6e-15 CCDS498.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 413) 306 81.2 1.7e-15 CCDS496.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 429) 306 81.2 1.7e-15 CCDS85962.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 434) 306 81.2 1.8e-15 CCDS493.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 444) 306 81.2 1.8e-15 CCDS85965.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 449) 306 81.2 1.8e-15 CCDS57989.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 344) 304 80.7 2e-15 CCDS57990.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 261) 293 78.0 9.6e-15 CCDS495.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 277) 293 78.1 1e-14 CCDS85961.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 331) 293 78.1 1.2e-14 CCDS85964.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 230) 291 77.5 1.2e-14 CCDS57988.1 ST3GAL3 gene_id:6487|Hs109|chr1 ( 345) 289 77.1 2.3e-14 CCDS2933.1 ST3GAL6 gene_id:10402|Hs109|chr3 ( 331) 261 70.5 2.2e-12 CCDS74968.1 ST3GAL6 gene_id:10402|Hs109|chr3 ( 384) 261 70.5 2.6e-12 >>CCDS6373.1 ST3GAL1 gene_id:6482|Hs109|chr8 (340 aa) initn: 2313 init1: 2313 opt: 2313 Z-score: 2963.0 bits: 556.7 E(33420): 1e-158 Smith-Waterman score: 2313; 100.0% identity (100.0% similar) in 340 aa overlap (1-340:1-340) 10 20 30 40 50 60 pF1KE2 MVTLRKRTLKVLTFLVLFIFLTSFFLNYSHTMVATTWFPKQMVLELSENLKRLIKHRPCT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 MVTLRKRTLKVLTFLVLFIFLTSFFLNYSHTMVATTWFPKQMVLELSENLKRLIKHRPCT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 CTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNLNDTIKEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 CTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNLNDTIKEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 FRVVPGNVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRMNKAPTAGFEAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 FRVVPGNVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRMNKAPTAGFEAD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 VGTKTTHHLVYPESFRELGDNVSMILVPFKTIDLEWVVSAITTGTISHTYIPVPAKIRVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 VGTKTTHHLVYPESFRELGDNVSMILVPFKTIDLEWVVSAITTGTISHTYIPVPAKIRVK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 QDKILIYHPAFIKYVFDNWLQGHGRYPSTGILSVIFSMHVCDEVDLYGFGADSKGNWHHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 QDKILIYHPAFIKYVFDNWLQGHGRYPSTGILSVIFSMHVCDEVDLYGFGADSKGNWHHY 250 260 270 280 290 300 310 320 330 340 pF1KE2 WENNPSAGAFRKTGVHDADFESNVTATLASINKIRIFKGR :::::::::::::::::::::::::::::::::::::::: CCDS63 WENNPSAGAFRKTGVHDADFESNVTATLASINKIRIFKGR 310 320 330 340 >>CCDS10890.1 ST3GAL2 gene_id:6483|Hs109|chr16 (350 aa) initn: 1170 init1: 958 opt: 1178 Z-score: 1509.6 bits: 287.8 E(33420): 9.4e-78 Smith-Waterman score: 1209; 49.0% identity (76.2% similar) in 349 aa overlap (6-339:2-349) 10 20 30 40 pF1KE2 MVTLRKRTLKV--LTFLVLFIFLTSFFLNYSHTMVATTWFPKQMVLELSE---------N : .:.: :. :..:. :....::: .:: . . .:. .. . CCDS10 MKCSLRVWFLSVAFLLVFIMSLLFTYSHHSMATLPYLDSGALDGTHRVKLVPGYAG 10 20 30 40 50 50 60 70 80 90 100 pF1KE2 LKRLIKHR----PCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQ :.:: :.: :.: .:.:. : ::: .:. ...:. : .: : :. :::. :: CCDS10 LQRLSKERLSGKSCACRRCMGDAGASDWFDSHFDGNISPVWTRENMDLPPDVQRWWMMLQ 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 REKKPNNLNDTIKELFRVVPGNVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDF . : .: :.....::..:::. .:. . :::::::::::::: :.:: ..:.:.: CCDS10 PQFKSHNTNEVLEKLFQIVPGE-NPYRFRDPHQCRRCAVVGNSGNLRGSGYGQDVDGHNF 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 VLRMNKAPTAGFEADVGTKTTHHLVYPESFRELGDNVSMILVPFKTIDLEWVVSAITTGT ..:::.:::.::: :::..::::..:::: ..: :::..:::::..:: :..::..:: CCDS10 IMRMNQAPTVGFEQDVGSRTTHHFMYPESAKNLPANVSFVLVPFKVLDLLWIASALSTGQ 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE2 ISHTYIPVPAKIRVKQDKILIYHPAFIKYVFDNWLQGHGRYPSTGILSVIFSMHVCDEVD : :: :: . .:: ..:. ::.:::.::. : : . ::::::::.: ..:..::::::. CCDS10 IRFTYAPVKSFLRVDKEKVQIYNPAFFKYIHDRWTEHHGRYPSTGMLVLFFALHVCDEVN 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE2 LYGFGADSKGNWHHYWENNPSAGAFRKTGVHDADFESNVTATLASINKIRIFKGR .:::::::.:::::::::: :: ::::::::::::... ::. .::....: CCDS10 VYGFGADSRGNWHHYWENNRYAGEFRKTGVHDADFEAHIIDMLAKASKIEVYRGN 300 310 320 330 340 350 >>CCDS58194.1 ST3GAL4 gene_id:6484|Hs109|chr11 (332 aa) initn: 326 init1: 242 opt: 309 Z-score: 397.3 bits: 81.9 E(33420): 8.5e-16 Smith-Waterman score: 384; 30.1% identity (59.2% similar) in 289 aa overlap (32-302:16-296) 10 20 30 40 50 60 pF1KE2 VTLRKRTLKVLTFLVLFIFLTSFFLNYSHTMVATTWFPKQMVLELSENLKRLIKHRPCTC .:. .:. . . : . : .. : CCDS58 MCPAGWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPC 10 20 30 40 70 80 90 100 110 120 pF1KE2 THCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNLNDTIKELF . .. : : : . ... ::.. ::: : : . . : . . . :. CCDS58 LQGEAESKASKLFGN-YSRD-QPIFLR----LED--YFWVKTPSAYELPYGTKGSEDLLL 50 60 70 80 90 130 140 150 160 170 180 pF1KE2 RVVPGNVDPMLEK-RSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRMNKAPTAGFEAD ::. . . . .. .:. ::::.::::. ::.:: : :...: :.:.:.::.::.:.: CCDS58 RVLAITSSSIPKNIQSLRCRRCVVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGD 100 110 120 130 140 150 190 200 210 220 230 pF1KE2 VGTKTTHHLVYPESFR-----ELGDNVSMILVPFKTIDLEWVVSAITTGT-ISHTYIPVP ::.::: .: :::: . : . .. ..:: ::..:..:. . .. . . . : CCDS58 VGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQP 160 170 180 190 200 210 240 250 260 270 280 pF1KE2 AKI-RVKQDKILIYHPAFIKYVFDNWLQ---GHGRY----PSTGILSVIFSMHVCDEVDL : :. .: : .: :.. . :. :. . : :.::.:.. ...:.:: : . CCDS58 PLIWDVNPKQIRILNPFFMEIAADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHI 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE2 YGFG---ADSKGNWHHYWENNPSAGAFRKTGVHDADFESNVTATLASINKIRIFKGR ::: : .: . ::.: CCDS58 AGFGYPDAYNKKQTIHYYEQITLKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF 280 290 300 310 320 330 >>CCDS58193.1 ST3GAL4 gene_id:6484|Hs109|chr11 (333 aa) initn: 326 init1: 242 opt: 309 Z-score: 397.3 bits: 81.9 E(33420): 8.5e-16 Smith-Waterman score: 384; 30.1% identity (59.2% similar) in 289 aa overlap (32-302:17-297) 10 20 30 40 50 60 pF1KE2 VTLRKRTLKVLTFLVLFIFLTSFFLNYSHTMVATTWFPKQMVLELSENLKRLIKHRPCTC .:. .:. . . : . : .. : CCDS58 MVSKSRWKLLAMLALVLVVMVWYSISREDRYIELFYFPIPEKKEPC 10 20 30 40 70 80 90 100 110 120 pF1KE2 THCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNLNDTIKELF . .. : : : . ... ::.. ::: : : . . : . . . :. CCDS58 LQGEAESKASKLFGN-YSRD-QPIFLR----LED--YFWVKTPSAYELPYGTKGSEDLLL 50 60 70 80 90 130 140 150 160 170 180 pF1KE2 RVVPGNVDPMLEK-RSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRMNKAPTAGFEAD ::. . . . .. .:. ::::.::::. ::.:: : :...: :.:.:.::.::.:.: CCDS58 RVLAITSSSIPKNIQSLRCRRCVVVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGD 100 110 120 130 140 150 190 200 210 220 230 pF1KE2 VGTKTTHHLVYPESFR-----ELGDNVSMILVPFKTIDLEWVVSAITTGT-ISHTYIPVP ::.::: .: :::: . : . .. ..:: ::..:..:. . .. . . . : CCDS58 VGSKTTMRLFYPESAHFDPKVENNPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQP 160 170 180 190 200 210 240 250 260 270 280 pF1KE2 AKI-RVKQDKILIYHPAFIKYVFDNWLQ---GHGRY----PSTGILSVIFSMHVCDEVDL : :. .: : .: :.. . :. :. . : :.::.:.. ...:.:: : . CCDS58 PLIWDVNPKQIRILNPFFMEIAADKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHI 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE2 YGFG---ADSKGNWHHYWENNPSAGAFRKTGVHDADFESNVTATLASINKIRIFKGR ::: : .: . ::.: CCDS58 AGFGYPDAYNKKQTIHYYEQITLKSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF 280 290 300 310 320 330 >>CCDS8474.1 ST3GAL4 gene_id:6484|Hs109|chr11 (329 aa) initn: 326 init1: 242 opt: 306 Z-score: 393.5 bits: 81.2 E(33420): 1.4e-15 Smith-Waterman score: 382; 32.0% identity (60.9% similar) in 266 aa overlap (55-302:38-293) 30 40 50 60 70 80 pF1KE2 FLNYSHTMVATTWFPKQMVLELSENLKRLIKHRPCTCTHCIGQRKLSAWFDERFNQTMQP :..:: . .. : : : . ... :: CCDS84 KLLAMLALVLVVMVWYSISREDSFYFPIPEKKEPCLQGE--AESKASKLFGN-YSRD-QP 10 20 30 40 50 60 90 100 110 120 130 140 pF1KE2 LLTAQNALLEDDTYRWWLRLQREKKPNNLNDTIKELFRVVPGNVDPMLEK-RSVGCRRCA .. ::: : : . . : . . . :.::. . . . .. .:. ::::. CCDS84 IFLR----LED--YFWVKTPSAYELPYGTKGSEDLLLRVLAITSSSIPKNIQSLRCRRCV 70 80 90 100 110 150 160 170 180 190 pF1KE2 VVGNSGNLRESSYGPEIDSHDFVLRMNKAPTAGFEADVGTKTTHHLVYPESFR-----EL ::::. ::.:: : :...: :.:.:.::.::.:.:::.::: .: :::: . : CCDS84 VVGNGHRLRNSSLGDAINKYDVVIRLNNAPVAGYEGDVGSKTTMRLFYPESAHFDPKVEN 120 130 140 150 160 170 200 210 220 230 240 250 pF1KE2 GDNVSMILVPFKTIDLEWVVSAITTGT-ISHTYIPVPAKI-RVKQDKILIYHPAFIKYVF . .. ..:: ::..:..:. . .. . . . : : :. .: : .: :.. . CCDS84 NPDTLLVLVAFKAMDFHWIETILSDKKRVRKGFWKQPPLIWDVNPKQIRILNPFFMEIAA 180 190 200 210 220 230 260 270 280 290 300 pF1KE2 DNWLQ---GHGRY----PSTGILSVIFSMHVCDEVDLYGFG---ADSKGNWHHYWENNPS :. :. . : :.::.:.. ...:.:: : . ::: : .: . ::.: CCDS84 DKLLSLPMQQPRKIKQKPTTGLLAITLALHLCDLVHIAGFGYPDAYNKKQTIHYYEQITL 240 250 260 270 280 290 310 320 330 340 pF1KE2 AGAFRKTGVHDADFESNVTATLASINKIRIFKGR CCDS84 KSMAGSGHNVSQEALAIKRMLEMGAIKNLTSF 300 310 320 >>CCDS497.1 ST3GAL3 gene_id:6487|Hs109|chr1 (359 aa) initn: 300 init1: 228 opt: 306 Z-score: 392.9 bits: 81.2 E(33420): 1.5e-15 Smith-Waterman score: 416; 31.0% identity (60.6% similar) in 274 aa overlap (84-336:84-351) 60 70 80 90 100 110 pF1KE2 IKHRPCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNL : .. .. ::..: : :... : .. CCDS49 LPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRKWARIREFVPPFGI 60 70 80 90 100 110 120 130 140 150 160 pF1KE2 ---NDTIKELFRVVPG-NVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRM .. :: .. :. . : :. :. :::: .:::.: : ..: : .::..:.:.:. CCDS49 KGQDNLIKAILSVTKEYRLTPALD--SLRCRRCIIVGNGGVLANKSLGSRIDDYDIVVRL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 NKAPTAGFEADVGTKTTHHLVYPESFRELGDNVS----MILVPFKTIDLEWVVSAITTGT :.::. ::: :::.::: ...:::. . .. ..:. :: :..:. . CCDS49 NSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKWQDFKWLKYIVYKER 180 190 200 210 220 230 230 240 250 260 270 pF1KE2 ISHT---YIPVPAKIRVKQDKILIYHPAFIKYV--------FDNWLQGHGRYPSTGILSV .: . . : ... . .: : .: ::. . :.: :.:.: :. : ..: CCDS49 VSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGLMGRGNIPTLGSVAV 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE2 IFSMHVCDEVDLYGFGAD-SKGNWH-HYWENNPSAGAFRKTGVHDADFESNVTATLASIN ...: :::: . ::: : : : ::.:. : :.... .:. . :.. : .. CCDS49 TMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMA-AIKESWTHNIQREKEF---LRKLV 300 310 320 330 340 340 pF1KE2 KIRIFKGR : :. CCDS49 KARVITDLSSGI 350 >>CCDS492.1 ST3GAL3 gene_id:6487|Hs109|chr1 (375 aa) initn: 300 init1: 228 opt: 306 Z-score: 392.6 bits: 81.2 E(33420): 1.5e-15 Smith-Waterman score: 416; 31.0% identity (60.6% similar) in 274 aa overlap (84-336:100-367) 60 70 80 90 100 110 pF1KE2 IKHRPCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNL : .. .. ::..: : :... : .. CCDS49 LPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRKWARIREFVPPFGI 70 80 90 100 110 120 120 130 140 150 160 pF1KE2 ---NDTIKELFRVVPG-NVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRM .. :: .. :. . : :. :. :::: .:::.: : ..: : .::..:.:.:. CCDS49 KGQDNLIKAILSVTKEYRLTPALD--SLRCRRCIIVGNGGVLANKSLGSRIDDYDIVVRL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE2 NKAPTAGFEADVGTKTTHHLVYPESFRELGDNVS----MILVPFKTIDLEWVVSAITTGT :.::. ::: :::.::: ...:::. . .. ..:. :: :..:. . CCDS49 NSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKWQDFKWLKYIVYKER 190 200 210 220 230 240 230 240 250 260 270 pF1KE2 ISHT---YIPVPAKIRVKQDKILIYHPAFIKYV--------FDNWLQGHGRYPSTGILSV .: . . : ... . .: : .: ::. . :.: :.:.: :. : ..: CCDS49 VSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGLMGRGNIPTLGSVAV 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE2 IFSMHVCDEVDLYGFGAD-SKGNWH-HYWENNPSAGAFRKTGVHDADFESNVTATLASIN ...: :::: . ::: : : : ::.:. : :.... .:. . :.. : .. CCDS49 TMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMA-AIKESWTHNIQREKEF---LRKLV 310 320 330 340 350 360 340 pF1KE2 KIRIFKGR : :. CCDS49 KARVITDLSSGI 370 >>CCDS494.1 ST3GAL3 gene_id:6487|Hs109|chr1 (390 aa) initn: 300 init1: 228 opt: 306 Z-score: 392.4 bits: 81.2 E(33420): 1.6e-15 Smith-Waterman score: 416; 31.0% identity (60.6% similar) in 274 aa overlap (84-336:115-382) 60 70 80 90 100 110 pF1KE2 IKHRPCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNL : .. .. ::..: : :... : .. CCDS49 LPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRKWARIREFVPPFGI 90 100 110 120 130 140 120 130 140 150 160 pF1KE2 ---NDTIKELFRVVPG-NVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRM .. :: .. :. . : :. :. :::: .:::.: : ..: : .::..:.:.:. CCDS49 KGQDNLIKAILSVTKEYRLTPALD--SLRCRRCIIVGNGGVLANKSLGSRIDDYDIVVRL 150 160 170 180 190 200 170 180 190 200 210 220 pF1KE2 NKAPTAGFEADVGTKTTHHLVYPESFRELGDNVS----MILVPFKTIDLEWVVSAITTGT :.::. ::: :::.::: ...:::. . .. ..:. :: :..:. . CCDS49 NSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKWQDFKWLKYIVYKER 210 220 230 240 250 260 230 240 250 260 270 pF1KE2 ISHT---YIPVPAKIRVKQDKILIYHPAFIKYV--------FDNWLQGHGRYPSTGILSV .: . . : ... . .: : .: ::. . :.: :.:.: :. : ..: CCDS49 VSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGLMGRGNIPTLGSVAV 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE2 IFSMHVCDEVDLYGFGAD-SKGNWH-HYWENNPSAGAFRKTGVHDADFESNVTATLASIN ...: :::: . ::: : : : ::.:. : :.... .:. . :.. : .. CCDS49 TMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMA-AIKESWTHNIQREKEF---LRKLV 330 340 350 360 370 340 pF1KE2 KIRIFKGR : :. CCDS49 KARVITDLSSGI 380 390 >>CCDS498.1 ST3GAL3 gene_id:6487|Hs109|chr1 (413 aa) initn: 300 init1: 228 opt: 306 Z-score: 392.0 bits: 81.2 E(33420): 1.7e-15 Smith-Waterman score: 416; 31.0% identity (60.6% similar) in 274 aa overlap (84-336:138-405) 60 70 80 90 100 110 pF1KE2 IKHRPCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNL : .. .. ::..: : :... : .. CCDS49 RPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRKWARIREFVPPFGI 110 120 130 140 150 160 120 130 140 150 160 pF1KE2 ---NDTIKELFRVVPG-NVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRM .. :: .. :. . : :. :. :::: .:::.: : ..: : .::..:.:.:. CCDS49 KGQDNLIKAILSVTKEYRLTPALD--SLRCRRCIIVGNGGVLANKSLGSRIDDYDIVVRL 170 180 190 200 210 220 170 180 190 200 210 220 pF1KE2 NKAPTAGFEADVGTKTTHHLVYPESFRELGDNVS----MILVPFKTIDLEWVVSAITTGT :.::. ::: :::.::: ...:::. . .. ..:. :: :..:. . CCDS49 NSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKWQDFKWLKYIVYKER 230 240 250 260 270 280 230 240 250 260 270 pF1KE2 ISHT---YIPVPAKIRVKQDKILIYHPAFIKYV--------FDNWLQGHGRYPSTGILSV .: . . : ... . .: : .: ::. . :.: :.:.: :. : ..: CCDS49 VSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGLMGRGNIPTLGSVAV 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE2 IFSMHVCDEVDLYGFGAD-SKGNWH-HYWENNPSAGAFRKTGVHDADFESNVTATLASIN ...: :::: . ::: : : : ::.:. : :.... .:. . :.. : .. CCDS49 TMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMA-AIKESWTHNIQREKEF---LRKLV 350 360 370 380 390 400 340 pF1KE2 KIRIFKGR : :. CCDS49 KARVITDLSSGI 410 >>CCDS496.1 ST3GAL3 gene_id:6487|Hs109|chr1 (429 aa) initn: 300 init1: 228 opt: 306 Z-score: 391.7 bits: 81.2 E(33420): 1.7e-15 Smith-Waterman score: 416; 31.0% identity (60.6% similar) in 274 aa overlap (84-336:154-421) 60 70 80 90 100 110 pF1KE2 IKHRPCTCTHCIGQRKLSAWFDERFNQTMQPLLTAQNALLEDDTYRWWLRLQREKKPNNL : .. .. ::..: : :... : .. CCDS49 RPAELATKYANFSEGACKPGYASALMTAIFPRFSKPAPMFLDDSFRKWARIREFVPPFGI 130 140 150 160 170 180 120 130 140 150 160 pF1KE2 ---NDTIKELFRVVPG-NVDPMLEKRSVGCRRCAVVGNSGNLRESSYGPEIDSHDFVLRM .. :: .. :. . : :. :. :::: .:::.: : ..: : .::..:.:.:. CCDS49 KGQDNLIKAILSVTKEYRLTPALD--SLRCRRCIIVGNGGVLANKSLGSRIDDYDIVVRL 190 200 210 220 230 240 170 180 190 200 210 220 pF1KE2 NKAPTAGFEADVGTKTTHHLVYPESFRELGDNVS----MILVPFKTIDLEWVVSAITTGT :.::. ::: :::.::: ...:::. . .. ..:. :: :..:. . CCDS49 NSAPVKGFEKDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKWQDFKWLKYIVYKER 250 260 270 280 290 300 230 240 250 260 270 pF1KE2 ISHT---YIPVPAKIRVKQDKILIYHPAFIKYV--------FDNWLQGHGRYPSTGILSV .: . . : ... . .: : .: ::. . :.: :.:.: :. : ..: CCDS49 VSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNNGLMGRGNIPTLGSVAV 310 320 330 340 350 360 280 290 300 310 320 330 pF1KE2 IFSMHVCDEVDLYGFGAD-SKGNWH-HYWENNPSAGAFRKTGVHDADFESNVTATLASIN ...: :::: . ::: : : : ::.:. : :.... .:. . :.. : .. CCDS49 TMALHGCDEVAVAGFGYDMSTPNAPLHYYETVRMA-AIKESWTHNIQREKEF---LRKLV 370 380 390 400 410 340 pF1KE2 KIRIFKGR : :. CCDS49 KARVITDLSSGI 420 340 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Jul 3 20:34:51 2020 done: Fri Jul 3 20:34:52 2020 Total Scan time: 1.110 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]