FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5225, 250 aa 1>>>pF1KE5225 250 - 250 aa - 250 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3458+/-0.00084; mu= 8.8764+/- 0.051 mean_var=175.1442+/-34.067, 0's: 0 Z-trim(114.2): 4 B-trim: 0 in 0/53 Lambda= 0.096912 statistics sampled from 14754 (14757) to 14754 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.777), E-opt: 0.2 (0.453), width: 16 Scan time: 2.420 The best scores are: opt bits E(32554) CCDS9720.1 GCH1 gene_id:2643|Hs108|chr14 ( 250) 1673 244.9 3.8e-65 CCDS41954.1 GCH1 gene_id:2643|Hs108|chr14 ( 233) 1413 208.5 3.2e-54 CCDS45110.1 GCH1 gene_id:2643|Hs108|chr14 ( 213) 1407 207.6 5.4e-54 >>CCDS9720.1 GCH1 gene_id:2643|Hs108|chr14 (250 aa) initn: 1673 init1: 1673 opt: 1673 Z-score: 1283.0 bits: 244.9 E(32554): 3.8e-65 Smith-Waterman score: 1673; 100.0% identity (100.0% similar) in 250 aa overlap (1-250:1-250) 10 20 30 40 50 60 pF1KE5 MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 EEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 IFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 IFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VQERLTKQIAVAITEALRPAGVGVVVEATHMCMVMRGVQKMNSKTVTSTMLGVFREDPKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 VQERLTKQIAVAITEALRPAGVGVVVEATHMCMVMRGVQKMNSKTVTSTMLGVFREDPKT 190 200 210 220 230 240 250 pF1KE5 REEFLTLIRS :::::::::: CCDS97 REEFLTLIRS 250 >>CCDS41954.1 GCH1 gene_id:2643|Hs108|chr14 (233 aa) initn: 1407 init1: 1407 opt: 1413 Z-score: 1086.9 bits: 208.5 E(32554): 3.2e-54 Smith-Waterman score: 1413; 94.6% identity (96.9% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE5 MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 IFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 IFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VQERLTKQIAVAITEALRPAGVGVVVEATHMCMVMRGVQKMNSKTVTSTMLGVFREDPKT :::::::::::::::::::::::::::::. .:.. . : CCDS41 VQERLTKQIAVAITEALRPAGVGVVVEATKSNKYNKGLSPLLSSCHLFVAILK 190 200 210 220 230 250 pF1KE5 REEFLTLIRS >>CCDS45110.1 GCH1 gene_id:2643|Hs108|chr14 (213 aa) initn: 1407 init1: 1407 opt: 1407 Z-score: 1082.9 bits: 207.6 E(32554): 5.4e-54 Smith-Waterman score: 1407; 100.0% identity (100.0% similar) in 209 aa overlap (1-209:1-209) 10 20 30 40 50 60 pF1KE5 MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MEKGPVRAPAEKPRGARCSNGFPERDPPRPGPSRPAEKPPRPEAKSAQPADGWKGERPRS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 EEDNELNLPNLAAAYSSILSSLGENPQRQGLLKTPWRAASAMQFFTKGYQETISDVLNDA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 IFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 IFDEDHDEMVIVKDIDMFSMCEHHLVPFVGKVHIGYLPNKQVLGLSKLARIVEIYSRRLQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VQERLTKQIAVAITEALRPAGVGVVVEATHMCMVMRGVQKMNSKTVTSTMLGVFREDPKT ::::::::::::::::::::::::::::: CCDS45 VQERLTKQIAVAITEALRPAGVGVVVEATSAEP 190 200 210 250 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:40:31 2016 done: Mon Nov 7 22:40:32 2016 Total Scan time: 2.420 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]