FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1790, 297 aa 1>>>pF1KE1790 297 - 297 aa - 297 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0578+/-0.000735; mu= 13.8556+/- 0.045 mean_var=95.2097+/-19.183, 0's: 0 Z-trim(111.7): 11 B-trim: 98 in 1/50 Lambda= 0.131442 statistics sampled from 12618 (12627) to 12618 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.735), E-opt: 0.2 (0.388), width: 16 Scan time: 2.790 The best scores are: opt bits E(32554) CCDS6912.1 ENDOG gene_id:2021|Hs108|chr9 ( 297) 1990 386.8 1e-107 CCDS2680.1 EXOG gene_id:9941|Hs108|chr3 ( 368) 603 123.9 1.8e-28 CCDS46795.1 EXOG gene_id:9941|Hs108|chr3 ( 318) 544 112.6 3.8e-25 >>CCDS6912.1 ENDOG gene_id:2021|Hs108|chr9 (297 aa) initn: 1990 init1: 1990 opt: 1990 Z-score: 2047.3 bits: 386.8 E(32554): 1e-107 Smith-Waterman score: 1990; 99.7% identity (99.7% similar) in 297 aa overlap (1-297:1-297) 10 20 30 40 50 60 pF1KE1 MRALRAGLTLALGAGLGAVVEGWRRRREDARAAPGLLGRLPVLPVAAAAELPPVPGGPRG ::::::::::: :::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MRALRAGLTLASGAGLGAVVEGWRRRREDARAAPGLLGRLPVLPVAAAAELPPVPGGPRG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 PGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPERLRGDGDRRECDFREDDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 PGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPERLRGDGDRRECDFREDDS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VHAYHRATNADYRGSGFDRGHLAAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWNNLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 VHAYHRATNADYRGSGFDRGHLAAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWNNLE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 KYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGG 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 QIELRTYVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAITAGSK ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 QIELRTYVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAITAGSK 250 260 270 280 290 >>CCDS2680.1 EXOG gene_id:9941|Hs108|chr3 (368 aa) initn: 501 init1: 501 opt: 603 Z-score: 624.5 bits: 123.9 E(32554): 1.8e-28 Smith-Waterman score: 603; 39.1% identity (70.0% similar) in 243 aa overlap (55-292:53-293) 30 40 50 60 70 80 pF1KE1 RRREDARAAPGLLGRLPVLPVAAAAELPPVPGGPRGPGELAKYGLPGLAQLKSR--ESYV : : . : ..:.: :. ..: ... CCDS26 GAVVGAAGAGLAALQFFRSQGAEGALTGKQPDGSAEKAVLEQFGFP-LTGTEARCYTNHA 30 40 50 60 70 80 90 100 110 120 130 140 pF1KE1 LCYDPRTRGALWVVEQLRPERLRGDGDRRECDFREDDSVHAYHRATNADYRGSGFDRGHL : :: : ::.:.. .. ::.::..: :. : .. : : :: :::..:::. CCDS26 LSYDQAKRVPRWVLEHISKSKIMGDADRKHCKFKPDPNIPPTFSAFNEDYVGSGWSRGHM 90 100 110 120 130 140 150 160 170 180 190 200 pF1KE1 AAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWNNLEKYSRSLTRSYQNVYVCTGPLFL : :.:...:.::: .::::::..:: :.. :: .: : : ::. ...:.: .::: : CCDS26 APAGNNKFSSKAMAETFYLSNIVPQDFDNNSGYWNRIEMYCRELTERFEDVWVVSGPLTL 150 160 170 180 190 200 210 220 230 240 250 pF1KE1 PRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGGQIE---LRTYVMPNAPVDEAIP :.:..:::. :.:::::...::::.:..::.. . .. . : : ..:.:: . CCDS26 PQTRGDGKKIVSYQVIGEDNVAVPSHLYKVILARRSSVSTEPLALGAFVVPNEAIGFQPQ 210 220 230 240 250 260 260 270 280 290 pF1KE1 LERFLVPIESIERASGLLFVPNILARAGSLKAITAGSK : .: : ....:. :::.: :. : :..... : CCDS26 LTEFQVSLQDLEKLSGLVFFPH-LDRTSDIRNICSVDTCKLLDFQEFTLYLSTRKIEGAR 270 280 290 300 310 320 CCDS26 SVLRLEKIMENLKNAEIEPDDYFMSRYEKKLEELKAKEQSGTQIRKPS 330 340 350 360 >>CCDS46795.1 EXOG gene_id:9941|Hs108|chr3 (318 aa) initn: 479 init1: 460 opt: 544 Z-score: 564.9 bits: 112.6 E(32554): 3.8e-25 Smith-Waterman score: 544; 40.9% identity (72.6% similar) in 208 aa overlap (88-292:40-243) 60 70 80 90 100 110 pF1KE1 PRGPGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPERLRGDGDRRECDFRE :..:: .. .:. ::.::..: :. CCDS46 LRGSRRFLSGFVAGAVVGAAGAGLAALQFFRSQGAEGALTGKQPD---GDADRKHCKFKP 10 20 30 40 50 60 120 130 140 150 160 170 pF1KE1 DDSVHAYHRATNADYRGSGFDRGHLAAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWN : .. : : :: :::..:::.: :.:...:.::: .::::::..:: :.. :: CCDS46 DPNIPPTFSAFNEDYVGSGWSRGHMAPAGNNKFSSKAMAETFYLSNIVPQDFDNNSGYWN 70 80 90 100 110 120 180 190 200 210 220 230 pF1KE1 NLEKYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEA .: : : ::. ...:.: .::: ::.:..:::. :.:::::...::::.:..::.. . CCDS46 RIEMYCRELTERFEDVWVVSGPLTLPQTRGDGKKIVSYQVIGEDNVAVPSHLYKVILARR 130 140 150 160 170 180 240 250 260 270 280 290 pF1KE1 AGGQIE---LRTYVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAITA .. . : : ..:.:: . : .: : ....:. :::.: :. : :..... : CCDS46 SSVSTEPLALGAFVVPNEAIGFQPQLTEFQVSLQDLEKLSGLVFFPH-LDRTSDIRNICS 190 200 210 220 230 240 pF1KE1 GSK CCDS46 VDTCKLLDFQEFTLYLSTRKIEGARSVLRLEKIMENLKNAEIEPDDYFMSRYEKKLEELK 250 260 270 280 290 300 297 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:41:08 2016 done: Sun Nov 6 15:41:08 2016 Total Scan time: 2.790 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]