FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5180, 152 aa 1>>>pF1KE5180 152 - 152 aa - 152 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.5634+/-0.000269; mu= 16.0635+/- 0.017 mean_var=60.7941+/-12.571, 0's: 0 Z-trim(119.6): 8 B-trim: 1716 in 1/54 Lambda= 0.164491 statistics sampled from 33716 (33725) to 33716 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.395), width: 16 Scan time: 3.700 The best scores are: opt bits E(85289) XP_005259194 (OMIM: 603675) PREDICTED: 40S ribosom ( 152) 1075 262.5 1.8e-70 NP_001011 (OMIM: 603675) 40S ribosomal protein S16 ( 146) 543 136.3 1.8e-32 NP_001308040 (OMIM: 603675) 40S ribosomal protein ( 129) 221 59.8 1.7e-09 >>XP_005259194 (OMIM: 603675) PREDICTED: 40S ribosomal p (152 aa) initn: 1075 init1: 1075 opt: 1075 Z-score: 1386.0 bits: 262.5 E(85289): 1.8e-70 Smith-Waterman score: 1075; 100.0% identity (100.0% similar) in 152 aa overlap (1-152:1-152) 10 20 30 40 50 60 pF1KE5 MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIKVNGRPLEMIEPRTLQYKLLEPVLLLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIKVNGRPLEMIEPRTLQYKLLEPVLLLGK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ERFAGVDIRVRVKGGGHVAQIYGESQELGAWRRWLWEGGLHSAPVPFNCVSFSQLSVSPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 ERFAGVDIRVRVKGGGHVAQIYGESQELGAWRRWLWEGGLHSAPVPFNCVSFSQLSVSPS 70 80 90 100 110 120 130 140 150 pF1KE5 PKPWWPITRNMWMRLPRRRSKTSSSSMTGPCW :::::::::::::::::::::::::::::::: XP_005 PKPWWPITRNMWMRLPRRRSKTSSSSMTGPCW 130 140 150 >>NP_001011 (OMIM: 603675) 40S ribosomal protein S16 iso (146 aa) initn: 540 init1: 540 opt: 543 Z-score: 703.9 bits: 136.3 E(85289): 1.8e-32 Smith-Waterman score: 543; 94.3% identity (96.6% similar) in 88 aa overlap (1-88:1-88) 10 20 30 40 50 60 pF1KE5 MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIKVNGRPLEMIEPRTLQYKLLEPVLLLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIKVNGRPLEMIEPRTLQYKLLEPVLLLGK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ERFAGVDIRVRVKGGGHVAQIYGESQELGAWRRWLWEGGLHSAPVPFNCVSFSQLSVSPS ::::::::::::::::::::::. : . NP_001 ERFAGVDIRVRVKGGGHVAQIYAIRQSISKALVAYYQKYVDEASKKEIKDILIQYDRTLL 70 80 90 100 110 120 >>NP_001308040 (OMIM: 603675) 40S ribosomal protein S16 (129 aa) initn: 221 init1: 221 opt: 221 Z-score: 291.6 bits: 59.8 E(85289): 1.7e-09 Smith-Waterman score: 385; 75.0% identity (77.3% similar) in 88 aa overlap (1-88:1-71) 10 20 30 40 50 60 pF1KE5 MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIKVNGRPLEMIEPRTLQYKLLEPVLLLGK ::::::::::::::::::::::::::::::::: :::::::::: NP_001 MPSKGPLQSVQVFGRKKTATAVAHCKRGNGLIK-----------------LLEPVLLLGK 10 20 30 40 70 80 90 100 110 120 pF1KE5 ERFAGVDIRVRVKGGGHVAQIYGESQELGAWRRWLWEGGLHSAPVPFNCVSFSQLSVSPS ::::::::::::::::::::::. : . NP_001 ERFAGVDIRVRVKGGGHVAQIYAIRQSISKALVAYYQKYVDEASKKEIKDILIQYDRTLL 50 60 70 80 90 100 152 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:20:44 2016 done: Mon Nov 7 22:20:44 2016 Total Scan time: 3.700 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]