FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3692, 182 aa 1>>>pF1KE3692 182 - 182 aa - 182 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9292+/-0.000588; mu= 11.7533+/- 0.036 mean_var=96.2611+/-19.339, 0's: 0 Z-trim(115.5): 4 B-trim: 2 in 1/50 Lambda= 0.130722 statistics sampled from 16100 (16104) to 16100 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.818), E-opt: 0.2 (0.495), width: 16 Scan time: 2.260 The best scores are: opt bits E(32554) CCDS41495.1 MSRB2 gene_id:22921|Hs108|chr10 ( 182) 1286 251.4 2.3e-67 CCDS8973.1 MSRB3 gene_id:253827|Hs108|chr12 ( 192) 468 97.1 6.7e-21 CCDS31853.1 MSRB3 gene_id:253827|Hs108|chr12 ( 185) 466 96.7 8.4e-21 >>CCDS41495.1 MSRB2 gene_id:22921|Hs108|chr10 (182 aa) initn: 1286 init1: 1286 opt: 1286 Z-score: 1322.8 bits: 251.4 E(32554): 2.3e-67 Smith-Waterman score: 1286; 100.0% identity (100.0% similar) in 182 aa overlap (1-182:1-182) 10 20 30 40 50 60 pF1KE3 MARLLWLLRGLTLGTAPRRAVRGQAGGGGPGTGPGLGEAGSLATCELPLAKSEWQKKLTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MARLLWLLRGLTLGTAPRRAVRGQAGGGGPGTGPGLGEAGSLATCELPLAKSEWQKKLTP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 EQFYVTREKGTEPPFSGIYLNNKEAGMYHCVCCDSPLFSSEKKYCSGTGWPSFSEAHGTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EQFYVTREKGTEPPFSGIYLNNKEAGMYHCVCCDSPLFSSEKKYCSGTGWPSFSEAHGTS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 GSDESHTGILRRLDTSLGSARTEVVCKQCEAHLGHVFPDGPGPNGQRFCINSVALKFKPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 GSDESHTGILRRLDTSLGSARTEVVCKQCEAHLGHVFPDGPGPNGQRFCINSVALKFKPR 130 140 150 160 170 180 pF1KE3 KH :: CCDS41 KH >>CCDS8973.1 MSRB3 gene_id:253827|Hs108|chr12 (192 aa) initn: 434 init1: 247 opt: 468 Z-score: 488.7 bits: 97.1 E(32554): 6.7e-21 Smith-Waterman score: 468; 47.3% identity (72.3% similar) in 148 aa overlap (35-179:28-168) 10 20 30 40 50 60 pF1KE3 LWLLRGLTLGTAPRRAVRGQAGGGGPGTGPGLGEAGSL---ATCELPLAKSEWQKKLTPE : ...:: .:.. ....: .:.::: CCDS89 MSPRRTLPRPLSLCLSLCLCLCLAAALGSAQSGSCRDKKNCKVVFSQQELRKRLTPL 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 QFYVTREKGTEPPFSGIYLNNKEAGMYHCVCCDSPLFSSEKKYCSGTGWPSFSEAHGTSG :..::.::::: : : : ..:. :.:.:: : .:::.:: :. ::.::::: : . . CCDS89 QYHVTQEKGTESAFEGEYTHHKDPGIYKCVVCGTPLFKSETKFDSGSGWPSF---HDVIN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 SDESHTGILRRLDTSLGSARTEVVCKQCEAHLGHVFPDGPGPNGQRFCINSVALKFKPRK :. .: : : : :.:. :.:: :::::.: ::: :.:.:.::::.::.: : CCDS89 SE----AITFTDDFSYGMHRVETSCSQCGAHLGHIFDDGPRPTGKRYCINSAALSFTPAD 120 130 140 150 160 170 pF1KE3 H CCDS89 SSGTAEGGSGVASPAQADKAEL 180 190 >>CCDS31853.1 MSRB3 gene_id:253827|Hs108|chr12 (185 aa) initn: 434 init1: 247 opt: 466 Z-score: 486.9 bits: 96.7 E(32554): 8.4e-21 Smith-Waterman score: 466; 49.6% identity (74.1% similar) in 135 aa overlap (45-179:34-161) 20 30 40 50 60 70 pF1KE3 TAPRRAVRGQAGGGGPGTGPGLGEAGSLATCELPLAKSEWQKKLTPEQFYVTREKGTEPP :.. ....: .:.::: :..::.::::: CCDS31 FNLLHLVTKSQPVALRACGLPSGSCRDKKNCKVVFSQQELRKRLTPLQYHVTQEKGTESA 10 20 30 40 50 60 80 90 100 110 120 130 pF1KE3 FSGIYLNNKEAGMYHCVCCDSPLFSSEKKYCSGTGWPSFSEAHGTSGSDESHTGILRRLD : : : ..:. :.:.:: : .:::.:: :. ::.::::: : . .:. .: : CCDS31 FEGEYTHHKDPGIYKCVVCGTPLFKSETKFDSGSGWPSF---HDVINSE----AITFTDD 70 80 90 100 110 140 150 160 170 180 pF1KE3 TSLGSARTEVVCKQCEAHLGHVFPDGPGPNGQRFCINSVALKFKPRKH : : :.:. :.:: :::::.: ::: :.:.:.::::.::.: : CCDS31 FSYGMHRVETSCSQCGAHLGHIFDDGPRPTGKRYCINSAALSFTPADSSGTAEGGSGVAS 120 130 140 150 160 170 CCDS31 PAQADKAEL 180 182 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:50:41 2016 done: Sat Nov 5 22:50:41 2016 Total Scan time: 2.260 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]