FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1786, 294 aa 1>>>pF1KE1786 294 - 294 aa - 294 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.2608+/- 0.001; mu= -8.7549+/- 0.061 mean_var=446.7419+/-89.873, 0's: 0 Z-trim(117.4): 29 B-trim: 72 in 1/52 Lambda= 0.060680 statistics sampled from 18090 (18114) to 18090 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.832), E-opt: 0.2 (0.556), width: 16 Scan time: 3.010 The best scores are: opt bits E(32554) CCDS4007.1 SMN2 gene_id:6607|Hs108|chr5 ( 294) 2106 197.6 9.2e-51 CCDS34181.1 SMN1 gene_id:6606|Hs108|chr5 ( 294) 2106 197.6 9.2e-51 CCDS54867.1 SMN2 gene_id:6607|Hs108|chr5 ( 282) 1990 187.4 1e-47 CCDS75256.1 SMN1 gene_id:6606|Hs108|chr5 ( 282) 1990 187.4 1e-47 CCDS4008.1 SMN2 gene_id:6607|Hs108|chr5 ( 262) 1460 141.0 9e-34 CCDS34182.1 SMN1 gene_id:6606|Hs108|chr5 ( 262) 1460 141.0 9e-34 >>CCDS4007.1 SMN2 gene_id:6607|Hs108|chr5 (294 aa) initn: 2106 init1: 2106 opt: 2106 Z-score: 1024.8 bits: 197.6 E(32554): 9.2e-51 Smith-Waterman score: 2106; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN 250 260 270 280 290 >>CCDS34181.1 SMN1 gene_id:6606|Hs108|chr5 (294 aa) initn: 2106 init1: 2106 opt: 2106 Z-score: 1024.8 bits: 197.6 E(32554): 9.2e-51 Smith-Waterman score: 2106; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN 250 260 270 280 290 >>CCDS54867.1 SMN2 gene_id:6607|Hs108|chr5 (282 aa) initn: 1990 init1: 1990 opt: 1990 Z-score: 970.2 bits: 187.4 E(32554): 1e-47 Smith-Waterman score: 1990; 100.0% identity (100.0% similar) in 278 aa overlap (1-278:1-278) 10 20 30 40 50 60 pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN :::::::::::::::::::::::::::::::::::::: CCDS54 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMEMLA 250 260 270 280 >>CCDS75256.1 SMN1 gene_id:6606|Hs108|chr5 (282 aa) initn: 1990 init1: 1990 opt: 1990 Z-score: 970.2 bits: 187.4 E(32554): 1e-47 Smith-Waterman score: 1990; 100.0% identity (100.0% similar) in 278 aa overlap (1-278:1-278) 10 20 30 40 50 60 pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN :::::::::::::::::::::::::::::::::::::: CCDS75 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMEMLA 250 260 270 280 >>CCDS4008.1 SMN2 gene_id:6607|Hs108|chr5 (262 aa) initn: 1811 init1: 1432 opt: 1460 Z-score: 719.8 bits: 141.0 E(32554): 9e-34 Smith-Waterman score: 1751; 89.1% identity (89.1% similar) in 294 aa overlap (1-294:1-262) 10 20 30 40 50 60 pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP ::::::::::::::::::::::::::::: CCDS40 DNIKPKSAPWNSFLPPPPPMPGPRLGPGK------------------------------- 190 200 250 260 270 280 290 pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 -IIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN 210 220 230 240 250 260 >>CCDS34182.1 SMN1 gene_id:6606|Hs108|chr5 (262 aa) initn: 1811 init1: 1432 opt: 1460 Z-score: 719.8 bits: 141.0 E(32554): 9e-34 Smith-Waterman score: 1751; 89.1% identity (89.1% similar) in 294 aa overlap (1-294:1-262) 10 20 30 40 50 60 pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP ::::::::::::::::::::::::::::: CCDS34 DNIKPKSAPWNSFLPPPPPMPGPRLGPGK------------------------------- 190 200 250 260 270 280 290 pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 -IIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN 210 220 230 240 250 260 294 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 11:21:24 2016 done: Sun Nov 6 11:21:25 2016 Total Scan time: 3.010 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]