FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2306, 297 aa 1>>>pF1KE2306 297 - 297 aa - 297 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1981+/-0.000786; mu= 7.3478+/- 0.047 mean_var=122.4737+/-24.340, 0's: 0 Z-trim(112.2): 12 B-trim: 35 in 1/51 Lambda= 0.115892 statistics sampled from 12981 (12988) to 12981 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.759), E-opt: 0.2 (0.399), width: 16 Scan time: 2.390 The best scores are: opt bits E(32554) CCDS12662.1 ERCC1 gene_id:2067|Hs108|chr19 ( 297) 1987 342.7 1.9e-94 CCDS12663.1 ERCC1 gene_id:2067|Hs108|chr19 ( 323) 1887 326.0 2.2e-89 CCDS54279.1 ERCC1 gene_id:2067|Hs108|chr19 ( 273) 1587 275.8 2.5e-74 >>CCDS12662.1 ERCC1 gene_id:2067|Hs108|chr19 (297 aa) initn: 1987 init1: 1987 opt: 1987 Z-score: 1809.0 bits: 342.7 E(32554): 1.9e-94 Smith-Waterman score: 1987; 100.0% identity (100.0% similar) in 297 aa overlap (1-297:1-297) 10 20 30 40 50 60 pF1KE2 MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQSLPTVDTSAQAAPQTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQSLPTVDTSAQAAPQTY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 AEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSPRQRGNPVLKFVRNVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSPRQRGNPVLKFVRNVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 WEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFALRVLLVQVDVKDPQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 WEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFALRVLLVQVDVKDPQQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 ALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQKPADLLMEKLEQDFVSRVTECLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 ALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQKPADLLMEKLEQDFVSRVTECLT 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 TVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPGLGPQKARRLFDVLHEPFLKVP ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPGLGPQKARRLFDVLHEPFLKVP 250 260 270 280 290 >>CCDS12663.1 ERCC1 gene_id:2067|Hs108|chr19 (323 aa) initn: 1912 init1: 1887 opt: 1887 Z-score: 1718.0 bits: 326.0 E(32554): 2.2e-89 Smith-Waterman score: 1887; 99.3% identity (99.6% similar) in 285 aa overlap (1-285:1-285) 10 20 30 40 50 60 pF1KE2 MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQSLPTVDTSAQAAPQTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQSLPTVDTSAQAAPQTY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 AEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSPRQRGNPVLKFVRNVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSPRQRGNPVLKFVRNVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 WEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFALRVLLVQVDVKDPQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 WEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFALRVLLVQVDVKDPQQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 ALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQKPADLLMEKLEQDFVSRVTECLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 ALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQKPADLLMEKLEQDFVSRVTECLT 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 TVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPGLGPQKARRLFDVLHEPFLKVP :::::::::::::::::::::::::::::::::::::::::.: : CCDS12 TVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPGLGPQKVRALGKNPRSWGKERAPNK 250 260 270 280 290 300 CCDS12 HNLRPQSFKVKKEPKTRHSGFRL 310 320 >>CCDS54279.1 ERCC1 gene_id:2067|Hs108|chr19 (273 aa) initn: 1612 init1: 1587 opt: 1587 Z-score: 1448.1 bits: 275.8 E(32554): 2.5e-74 Smith-Waterman score: 1785; 91.9% identity (91.9% similar) in 297 aa overlap (1-297:1-273) 10 20 30 40 50 60 pF1KE2 MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQSLPTVDTSAQAAPQTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MDPGKDKEGVPQPSGPPARKKFVIPLDEDEVPPGVAKPLFRSTQSLPTVDTSAQAAPQTY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 AEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSPRQRGNPVLKFVRNVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 AEYAISQPLEGAGATCPTGSEPLAGETPNQALKPGAKSNSIIVSPRQRGNPVLKFVRNVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 WEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFALRVLLVQVDVKDPQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 WEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQSLGKNFALRVLLVQVDVKDPQQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 ALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQKPADLLMEKLEQDFVSRVTECLT :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQKPADLLMEKLEQDFVSR------ 190 200 210 220 230 250 260 270 280 290 pF1KE2 TVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPGLGPQKARRLFDVLHEPFLKVP ::::::::::::::::::::::::::::::::::::::: CCDS54 ------------------SLEQLIAASREDLALCPGLGPQKARRLFDVLHEPFLKVP 240 250 260 270 297 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:05:15 2016 done: Mon Nov 7 02:05:16 2016 Total Scan time: 2.390 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]