FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1786, 294 aa
1>>>pF1KE1786 294 - 294 aa - 294 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.2608+/- 0.001; mu= -8.7549+/- 0.061
mean_var=446.7419+/-89.873, 0's: 0 Z-trim(117.4): 29 B-trim: 72 in 1/52
Lambda= 0.060680
statistics sampled from 18090 (18114) to 18090 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.832), E-opt: 0.2 (0.556), width: 16
Scan time: 3.010
The best scores are: opt bits E(32554)
CCDS4007.1 SMN2 gene_id:6607|Hs108|chr5 ( 294) 2106 197.6 9.2e-51
CCDS34181.1 SMN1 gene_id:6606|Hs108|chr5 ( 294) 2106 197.6 9.2e-51
CCDS54867.1 SMN2 gene_id:6607|Hs108|chr5 ( 282) 1990 187.4 1e-47
CCDS75256.1 SMN1 gene_id:6606|Hs108|chr5 ( 282) 1990 187.4 1e-47
CCDS4008.1 SMN2 gene_id:6607|Hs108|chr5 ( 262) 1460 141.0 9e-34
CCDS34182.1 SMN1 gene_id:6606|Hs108|chr5 ( 262) 1460 141.0 9e-34
>>CCDS4007.1 SMN2 gene_id:6607|Hs108|chr5 (294 aa)
initn: 2106 init1: 2106 opt: 2106 Z-score: 1024.8 bits: 197.6 E(32554): 9.2e-51
Smith-Waterman score: 2106; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294)
10 20 30 40 50 60
pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
190 200 210 220 230 240
250 260 270 280 290
pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
250 260 270 280 290
>>CCDS34181.1 SMN1 gene_id:6606|Hs108|chr5 (294 aa)
initn: 2106 init1: 2106 opt: 2106 Z-score: 1024.8 bits: 197.6 E(32554): 9.2e-51
Smith-Waterman score: 2106; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294)
10 20 30 40 50 60
pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
190 200 210 220 230 240
250 260 270 280 290
pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
250 260 270 280 290
>>CCDS54867.1 SMN2 gene_id:6607|Hs108|chr5 (282 aa)
initn: 1990 init1: 1990 opt: 1990 Z-score: 970.2 bits: 187.4 E(32554): 1e-47
Smith-Waterman score: 1990; 100.0% identity (100.0% similar) in 278 aa overlap (1-278:1-278)
10 20 30 40 50 60
pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
190 200 210 220 230 240
250 260 270 280 290
pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
::::::::::::::::::::::::::::::::::::::
CCDS54 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMEMLA
250 260 270 280
>>CCDS75256.1 SMN1 gene_id:6606|Hs108|chr5 (282 aa)
initn: 1990 init1: 1990 opt: 1990 Z-score: 970.2 bits: 187.4 E(32554): 1e-47
Smith-Waterman score: 1990; 100.0% identity (100.0% similar) in 278 aa overlap (1-278:1-278)
10 20 30 40 50 60
pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS75 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
190 200 210 220 230 240
250 260 270 280 290
pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
::::::::::::::::::::::::::::::::::::::
CCDS75 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMEMLA
250 260 270 280
>>CCDS4008.1 SMN2 gene_id:6607|Hs108|chr5 (262 aa)
initn: 1811 init1: 1432 opt: 1460 Z-score: 719.8 bits: 141.0 E(32554): 9e-34
Smith-Waterman score: 1751; 89.1% identity (89.1% similar) in 294 aa overlap (1-294:1-262)
10 20 30 40 50 60
pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
:::::::::::::::::::::::::::::
CCDS40 DNIKPKSAPWNSFLPPPPPMPGPRLGPGK-------------------------------
190 200
250 260 270 280 290
pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 -IIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
210 220 230 240 250 260
>>CCDS34182.1 SMN1 gene_id:6606|Hs108|chr5 (262 aa)
initn: 1811 init1: 1432 opt: 1460 Z-score: 719.8 bits: 141.0 E(32554): 9e-34
Smith-Waterman score: 1751; 89.1% identity (89.1% similar) in 294 aa overlap (1-294:1-262)
10 20 30 40 50 60
pF1KE1 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDIC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 ETSGKPKTTPKRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 ETCVVVYTGYGNREEQNLSDLLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 DNIKPKSAPWNSFLPPPPPMPGPRLGPGKPGLKFNGPPPPPPPPPPHLLSCWLPPFPSGP
:::::::::::::::::::::::::::::
CCDS34 DNIKPKSAPWNSFLPPPPPMPGPRLGPGK-------------------------------
190 200
250 260 270 280 290
pF1KE1 PIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 -IIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN
210 220 230 240 250 260
294 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 11:21:24 2016 done: Sun Nov 6 11:21:25 2016
Total Scan time: 3.010 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]