FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1646, 161 aa 1>>>pF1KE1646 161 - 161 aa - 161 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4100+/-0.000985; mu= 10.8633+/- 0.058 mean_var=52.1974+/-10.920, 0's: 0 Z-trim(102.4): 16 B-trim: 41 in 1/47 Lambda= 0.177521 statistics sampled from 6907 (6915) to 6907 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.212), width: 16 Scan time: 1.640 The best scores are: opt bits E(32554) CCDS9337.1 ALOX5AP gene_id:241|Hs108|chr13 ( 161) 1046 275.8 8e-75 CCDS73558.1 ALOX5AP gene_id:241|Hs108|chr13 ( 218) 1046 275.8 1.1e-74 CCDS3749.1 MGST2 gene_id:4258|Hs108|chr4 ( 147) 276 78.6 1.7e-15 CCDS34316.1 LTC4S gene_id:4056|Hs108|chr5 ( 150) 255 73.2 7.2e-14 >>CCDS9337.1 ALOX5AP gene_id:241|Hs108|chr13 (161 aa) initn: 1046 init1: 1046 opt: 1046 Z-score: 1456.8 bits: 275.8 E(32554): 8e-75 Smith-Waterman score: 1046; 100.0% identity (100.0% similar) in 161 aa overlap (1-161:1-161) 10 20 30 40 50 60 pF1KE1 MDQETVGNVVLLAIVTLISVVQNGFFAHKVEHESRTQNGRSFQRTGTLAFERVYTANQNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 MDQETVGNVVLLAIVTLISVVQNGFFAHKVEHESRTQNGRSFQRTGTLAFERVYTANQNC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VDAYPTFLAVLWSAGLLCSQVPAAFAGLMYLFVRQKYFVGYLGERTQSTPGYIFGKRIIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 VDAYPTFLAVLWSAGLLCSQVPAAFAGLMYLFVRQKYFVGYLGERTQSTPGYIFGKRIIL 70 80 90 100 110 120 130 140 150 160 pF1KE1 FLFLMSVAGIFNYYLIFFFGSDFENYIKTISTTISPLLLIP ::::::::::::::::::::::::::::::::::::::::: CCDS93 FLFLMSVAGIFNYYLIFFFGSDFENYIKTISTTISPLLLIP 130 140 150 160 >>CCDS73558.1 ALOX5AP gene_id:241|Hs108|chr13 (218 aa) initn: 1046 init1: 1046 opt: 1046 Z-score: 1454.5 bits: 275.8 E(32554): 1.1e-74 Smith-Waterman score: 1046; 100.0% identity (100.0% similar) in 161 aa overlap (1-161:58-218) 10 20 30 pF1KE1 MDQETVGNVVLLAIVTLISVVQNGFFAHKV :::::::::::::::::::::::::::::: CCDS73 LGHIGNISHQCWAGCAAGGRAVLSGEPEANMDQETVGNVVLLAIVTLISVVQNGFFAHKV 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE1 EHESRTQNGRSFQRTGTLAFERVYTANQNCVDAYPTFLAVLWSAGLLCSQVPAAFAGLMY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 EHESRTQNGRSFQRTGTLAFERVYTANQNCVDAYPTFLAVLWSAGLLCSQVPAAFAGLMY 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE1 LFVRQKYFVGYLGERTQSTPGYIFGKRIILFLFLMSVAGIFNYYLIFFFGSDFENYIKTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LFVRQKYFVGYLGERTQSTPGYIFGKRIILFLFLMSVAGIFNYYLIFFFGSDFENYIKTI 150 160 170 180 190 200 160 pF1KE1 STTISPLLLIP ::::::::::: CCDS73 STTISPLLLIP 210 >>CCDS3749.1 MGST2 gene_id:4258|Hs108|chr4 (147 aa) initn: 288 init1: 276 opt: 276 Z-score: 391.7 bits: 78.6 E(32554): 1.7e-15 Smith-Waterman score: 276; 37.2% identity (62.8% similar) in 129 aa overlap (7-135:3-131) 10 20 30 40 50 60 pF1KE1 MDQETVGNVVLLAIVTLISVVQNGFFAHKVEHESRTQNGRSFQRTGTLAFERVYTANQNC :: .::: :...:. :...:: .: . . ::. ::::. :.::: CCDS37 MAGNSILLAAVSILSACQQSYFALQVGKARLKYKVTPPAVTGSPEFERVFRAQQNC 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 VDAYPTFLAVLWSAGLLCSQVPAAFAGLMYLFVRQKYFVGYLGERTQSTPGYIFGKRIIL :. :: :. .:: :: .:: :. ::.:.. :. :: :: . :. .. :. CCDS37 VEFYPIFIITLWMAGWYFNQVFATCLGLVYIYGRHLYFWGYSEAAKKRITGFRLSLGILA 60 70 80 90 100 110 130 140 150 160 pF1KE1 FLFLMSVAGIFNYYLIFFFGSDFENYIKTISTTISPLLLIP .: :... :: : .: CCDS37 LLTLLGALGIANSFLDEYLDLNIAKKLRRQF 120 130 140 >>CCDS34316.1 LTC4S gene_id:4056|Hs108|chr5 (150 aa) initn: 240 init1: 240 opt: 255 Z-score: 362.5 bits: 73.2 E(32554): 7.2e-14 Smith-Waterman score: 255; 36.7% identity (64.1% similar) in 128 aa overlap (9-135:5-131) 10 20 30 40 50 60 pF1KE1 MDQETVGNVVLLAIVTLISVVQNGFFAHKVEHESRTQNGRSFQRTGTLAFERVYTANQNC :.::: :::..:. ...:. .: :. :: ::::: :. :: CCDS34 MKDEVALLAAVTLLGVLLQAYFSLQVISARRAFRVSPPLTTGPPEFERVYRAQVNC 10 20 30 40 50 70 80 90 100 110 pF1KE1 VDAYPTFLAVLWSAGLLCSQVPAAFAGLMYLFVRQKYFVGYL-GERTQSTPGYIFGKRII . .: :::.:: ::.. . ::. ::.:::.: .:: :: . . . .: : . : . CCDS34 SEYFPLFLATLWVAGIFFHEGAAALCGLVYLFARLRYFQGYARSAQLRLAPLYA-SARAL 60 70 80 90 100 110 120 130 140 150 160 pF1KE1 LFLFLMSVAGIFNYYLIFFFGSDFENYIKTISTTISPLLLIP .: ... :.. ..: CCDS34 WLLVALAALGLLAHFLPAALRAALLGRLRTLLPWA 120 130 140 150 161 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:38:35 2016 done: Sun Nov 6 14:38:35 2016 Total Scan time: 1.640 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]