FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3677, 268 aa 1>>>pF1KE3677 268 - 268 aa - 268 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4756+/-0.000705; mu= 8.8601+/- 0.043 mean_var=193.4639+/-38.625, 0's: 0 Z-trim(118.2): 29 B-trim: 171 in 1/51 Lambda= 0.092209 statistics sampled from 19053 (19082) to 19053 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.857), E-opt: 0.2 (0.586), width: 16 Scan time: 3.050 The best scores are: opt bits E(32554) CCDS10355.1 MESP1 gene_id:55897|Hs108|chr15 ( 268) 1889 262.3 2.6e-70 CCDS42078.1 MESP2 gene_id:145873|Hs108|chr15 ( 397) 702 104.6 1.2e-22 >>CCDS10355.1 MESP1 gene_id:55897|Hs108|chr15 (268 aa) initn: 1889 init1: 1889 opt: 1889 Z-score: 1375.8 bits: 262.3 E(32554): 2.6e-70 Smith-Waterman score: 1889; 100.0% identity (100.0% similar) in 268 aa overlap (1-268:1-268) 10 20 30 40 50 60 pF1KE3 MAQPLCPPLSESWMLSAAWGPTRRPPPSDKDCGRSLVSSPDSWGSTPADSPVASPARPGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MAQPLCPPLSESWMLSAAWGPTRRPPPSDKDCGRSLVSSPDSWGSTPADSPVASPARPGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 LRDPRAPSVGRRGARSSRLGSGQRQSASEREKLRMRTLARALHELRRFLPPSVAPAGQSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LRDPRAPSVGRRGARSSRLGSGQRQSASEREKLRMRTLARALHELRRFLPPSVAPAGQSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 TKIETLRLAIRYIGHLSAVLGLSEESLQRRCRQRGDAGSPRGCPLCPDDCPAQMQTRTQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 TKIETLRLAIRYIGHLSAVLGLSEESLQRRCRQRGDAGSPRGCPLCPDDCPAQMQTRTQA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 EGQGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EGQGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPS 190 200 210 220 230 240 250 260 pF1KE3 PLLPGDVLALLETWMPLSPLEWLPEEPK :::::::::::::::::::::::::::: CCDS10 PLLPGDVLALLETWMPLSPLEWLPEEPK 250 260 >>CCDS42078.1 MESP2 gene_id:145873|Hs108|chr15 (397 aa) initn: 720 init1: 620 opt: 702 Z-score: 520.3 bits: 104.6 E(32554): 1.2e-22 Smith-Waterman score: 794; 52.1% identity (62.7% similar) in 303 aa overlap (1-268:1-286) 10 20 30 40 50 pF1KE3 MAQ-PLCPPLS----ESWMLSAAWGPTRRPPPSDKDCGRSLVSSPDSWGSTPADSP--VA ::: : :: : . :... .:: . . : . : .:: :: :: : :. . CCDS42 MAQSP--PPQSLLGHDHWIFAQGWGWA-----GHWD-STSPASSSDSSGSCPCDGARGLP 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 SPARPG-TLRDPRAPSVGRRGARSSRLGSGQRQSASEREKLRMRTLARALHELRRFLPPS .: :. . : .: .. : ::.. : ::::::::::::::::::::::::::::::: CCDS42 QPQPPSCSSRAAEAAATTPRRARTGPAG-GQRQSASEREKLRMRTLARALHELRRFLPPS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 VAPAGQSLTKIETLRLAIRYIGHLSAVLGLSEESLQRRCRQRGDAGSPRGCPLCPDDCPA .::::::::::::::::::::::::::::::::::: : ::::::::: ::::::: :: CCDS42 LAPAGQSLTKIETLRLAIRYIGHLSAVLGLSEESLQCRRRQRGDAGSPWGCPLCPDRGPA 120 130 140 150 160 170 180 190 200 210 pF1KE3 QMQTRTQAEGQGQGRGLG--------------------LVSAVRAGASWGSPPACPGARA . ::.....:::::.: : ::::: : :::::: :::::.: CCDS42 EAQTQAEGQGQGQGQGQGQGQGQGQGQGQGQGQGRRPGLVSAVLAEASWGSPSACPGAQA 180 190 200 210 220 230 220 230 240 250 260 pF1KE3 APE-------PRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPE ::: :: : . ::. : ::: : : : : . : CCDS42 APERLGRGVHDTDPWA--TPPYCPKIQ----SPPYSSQGTTSDASL--WTPPQGCPWTQS 240 250 260 270 280 pF1KE3 EPK :. CCDS42 SPEPRNPPVPWTAAPATLELAAVYQGLSVSPEPCLSLGAPSLLPHPSCQRLQPQTPGRCW 290 300 310 320 330 340 268 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:10:00 2016 done: Sun Nov 6 14:10:01 2016 Total Scan time: 3.050 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]