FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0997, 175 aa 1>>>pF1KE0997 175 - 175 aa - 175 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2656+/-0.000656; mu= 12.8013+/- 0.040 mean_var=63.6419+/-12.814, 0's: 0 Z-trim(110.4): 15 B-trim: 0 in 0/49 Lambda= 0.160769 statistics sampled from 11570 (11580) to 11570 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.75), E-opt: 0.2 (0.356), width: 16 Scan time: 1.870 The best scores are: opt bits E(32554) CCDS46973.1 ST6GAL1 gene_id:6480|Hs108|chr3 ( 175) 1241 295.8 9.1e-81 CCDS3285.1 ST6GAL1 gene_id:6480|Hs108|chr3 ( 406) 1241 296.0 1.8e-80 CCDS2073.1 ST6GAL2 gene_id:84620|Hs108|chr2 ( 529) 735 178.7 4.9e-45 CCDS46380.1 ST6GAL2 gene_id:84620|Hs108|chr2 ( 466) 388 98.2 7.5e-21 >>CCDS46973.1 ST6GAL1 gene_id:6480|Hs108|chr3 (175 aa) initn: 1241 init1: 1241 opt: 1241 Z-score: 1563.5 bits: 295.8 E(32554): 9.1e-81 Smith-Waterman score: 1241; 100.0% identity (100.0% similar) in 175 aa overlap (1-175:1-175) 10 20 30 40 50 60 pF1KE0 MNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 PFYILKPQMPWELWDILQEISPEEIQPNPPSSGMLGIIIMMTLCDQVDIYEFLPSKRKTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 PFYILKPQMPWELWDILQEISPEEIQPNPPSSGMLGIIIMMTLCDQVDIYEFLPSKRKTD 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 VCYYYQKFFDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYLLGKATLPGFRTIHC ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VCYYYQKFFDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYLLGKATLPGFRTIHC 130 140 150 160 170 >>CCDS3285.1 ST6GAL1 gene_id:6480|Hs108|chr3 (406 aa) initn: 1241 init1: 1241 opt: 1241 Z-score: 1558.0 bits: 296.0 E(32554): 1.8e-80 Smith-Waterman score: 1241; 100.0% identity (100.0% similar) in 175 aa overlap (1-175:232-406) 10 20 30 pF1KE0 MNSQLVTTEKRFLKDSLYNEGILIVWDPSV :::::::::::::::::::::::::::::: CCDS32 IDDHDAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILIVWDPSV 210 220 230 240 250 260 40 50 60 70 80 90 pF1KE0 YHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 YHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPP 270 280 290 300 310 320 100 110 120 130 140 150 pF1KE0 SSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 SSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKH 330 340 350 360 370 380 160 170 pF1KE0 LNQGTDEDIYLLGKATLPGFRTIHC ::::::::::::::::::::::::: CCDS32 LNQGTDEDIYLLGKATLPGFRTIHC 390 400 >>CCDS2073.1 ST6GAL2 gene_id:84620|Hs108|chr2 (529 aa) initn: 722 init1: 722 opt: 735 Z-score: 922.0 bits: 178.7 E(32554): 4.9e-45 Smith-Waterman score: 735; 52.8% identity (86.9% similar) in 176 aa overlap (1-175:344-519) 10 20 pF1KE0 MNSQLVTTEKR-FLKDSLYNEGILIVWDPS .:::..:. .. :. .:::.. ::..:::. CCDS20 IDSHDAVLRFNSAPTRGYEKDVGNKTTIRIINSQILTNPSHHFIDSSLYKDVILVAWDPA 320 330 340 350 360 370 30 40 50 60 70 80 pF1KE0 VYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNP : ... ::..::::.:. : .:. .::::::::.:.. :.::::.:: . :.::::: CCDS20 PYSANLNLWYKKPDYNLFTPYIQHRQRNPNQPFYILHPKFIWQLWDIIQENTKEKIQPNP 380 390 400 410 420 430 90 100 110 120 130 140 pF1KE0 PSSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVK ::::..::.:::..: .: .::..:: :.:..:.:.. ..:.:::.:::::::::: ::. CCDS20 PSSGFIGILIMMSMCREVHVYEYIPSVRQTELCHYHELYYDAACTLGAYHPLLYEKLLVQ 440 450 460 470 480 490 150 160 170 pF1KE0 HLNQGTDEDIYLLGKATLPGFRTIHC .::.::. :.. ::..::::...:: CCDS20 RLNMGTQGDLHRKGKVVLPGFQAVHCPAPSPVIPHS 500 510 520 >>CCDS46380.1 ST6GAL2 gene_id:84620|Hs108|chr2 (466 aa) initn: 387 init1: 363 opt: 388 Z-score: 487.8 bits: 98.2 E(32554): 7.5e-21 Smith-Waterman score: 388; 50.0% identity (84.0% similar) in 100 aa overlap (1-99:344-443) 10 20 pF1KE0 MNSQLVTTEKR-FLKDSLYNEGILIVWDPS .:::..:. .. :. .:::.. ::..:::. CCDS46 IDSHDAVLRFNSAPTRGYEKDVGNKTTIRIINSQILTNPSHHFIDSSLYKDVILVAWDPA 320 330 340 350 360 370 30 40 50 60 70 80 pF1KE0 VYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNP : ... ::..::::.:. : .:. .::::::::.:.. :.::::.:: . :.::::: CCDS46 PYSANLNLWYKKPDYNLFTPYIQHRQRNPNQPFYILHPKFIWQLWDIIQENTKEKIQPNP 380 390 400 410 420 430 90 100 110 120 130 140 pF1KE0 PSSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVK ::::..: .. CCDS46 PSSGFIGSFVKIGHIRACSEPRSRDCTPAWTTE 440 450 460 175 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 16:01:34 2016 done: Mon Nov 7 16:01:34 2016 Total Scan time: 1.870 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]