FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5243, 204 aa 1>>>pF1KE5243 204 - 204 aa - 204 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5338+/-0.000859; mu= 12.5460+/- 0.052 mean_var=69.7793+/-13.491, 0's: 0 Z-trim(107.4): 18 B-trim: 2 in 1/50 Lambda= 0.153536 statistics sampled from 9561 (9569) to 9561 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.294), width: 16 Scan time: 1.870 The best scores are: opt bits E(32554) CCDS33352.1 NABP1 gene_id:64859|Hs108|chr2 ( 204) 1378 313.9 4.4e-86 CCDS58745.1 NABP1 gene_id:64859|Hs108|chr2 ( 124) 856 198.1 1.9e-51 CCDS8911.1 NABP2 gene_id:79035|Hs108|chr12 ( 211) 718 167.7 4.6e-42 >>CCDS33352.1 NABP1 gene_id:64859|Hs108|chr2 (204 aa) initn: 1378 init1: 1378 opt: 1378 Z-score: 1658.9 bits: 313.9 E(32554): 4.4e-86 Smith-Waterman score: 1378; 99.5% identity (100.0% similar) in 204 aa overlap (1-204:1-204) 10 20 30 40 50 60 pF1KE5 MNRVNDPLIFIRDIKPGLKNLNVVFIVLEIGRVTKTKDGHEVRSCKVADKTGSITISVWD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MNRVNDPLIFIRDIKPGLKNLNVVFIVLEIGRVTKTKDGHEVRSCKVADKTGSITISVWD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EIGGLIQPGDIIRLTRGYASMWKGCLTLYTGRGGELQKIGEFCMVYSEVPNFSEPNPDYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EIGGLIQPGDIIRLTRGYASMWKGCLTLYTGRGGELQKIGEFCMVYSEVPNFSEPNPDYR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GQQNKGAQSEQKNNSMNSNMGTGTFGPVGNGVHTGPESREHQFSHAGRSNGRGLINPQLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GQQNKGAQSEQKNNSMNSNMGTGTFGPVGNGVHTGPESREHQFSHAGRSNGRGLINPQLQ 130 140 150 160 170 180 190 200 pF1KE5 GTASNQTVMTTISNGRDPRRAFNR ::::::::::::::::::::::.: CCDS33 GTASNQTVMTTISNGRDPRRAFKR 190 200 >>CCDS58745.1 NABP1 gene_id:64859|Hs108|chr2 (124 aa) initn: 869 init1: 856 opt: 856 Z-score: 1037.3 bits: 198.1 E(32554): 1.9e-51 Smith-Waterman score: 856; 99.2% identity (100.0% similar) in 124 aa overlap (81-204:1-124) 60 70 80 90 100 110 pF1KE5 TGSITISVWDEIGGLIQPGDIIRLTRGYASMWKGCLTLYTGRGGELQKIGEFCMVYSEVP :::::::::::::::::::::::::::::: CCDS58 MWKGCLTLYTGRGGELQKIGEFCMVYSEVP 10 20 30 120 130 140 150 160 170 pF1KE5 NFSEPNPDYRGQQNKGAQSEQKNNSMNSNMGTGTFGPVGNGVHTGPESREHQFSHAGRSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NFSEPNPDYRGQQNKGAQSEQKNNSMNSNMGTGTFGPVGNGVHTGPESREHQFSHAGRSN 40 50 60 70 80 90 180 190 200 pF1KE5 GRGLINPQLQGTASNQTVMTTISNGRDPRRAFNR ::::::::::::::::::::::::::::::::.: CCDS58 GRGLINPQLQGTASNQTVMTTISNGRDPRRAFKR 100 110 120 >>CCDS8911.1 NABP2 gene_id:79035|Hs108|chr12 (211 aa) initn: 773 init1: 700 opt: 718 Z-score: 868.6 bits: 167.7 E(32554): 4.6e-42 Smith-Waterman score: 734; 56.7% identity (76.4% similar) in 208 aa overlap (10-204:6-211) 10 20 30 40 50 60 pF1KE5 MNRVNDPLIFIRDIKPGLKNLNVVFIVLEIGRVTKTKDGHEVRSCKVADKTGSITISVWD :..::::::::::..::::: :::::::::::::.::::::::::.::::: CCDS89 MTTETFVKDIKPGLKNLNLIFIVLETGRVTKTKDGHEVRTCKVADKTGSINISVWD 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 EIGGLIQPGDIIRLTRGYASMWKGCLTLYTGRGGELQKIGEFCMVYSEVPNFSEPNPDYR ..:.:::::::::::.::::..::::::::::::.::::::::::::::::::::::.: CCDS89 DVGNLIQPGDIIRLTKGYASVFKGCLTLYTGRGGDLQKIGEFCMVYSEVPNFSEPNPEYS 60 70 80 90 100 110 130 140 150 160 pF1KE5 GQQ--NKGAQSEQKNNSMNSNMGTGTFGPV-----GNGVHTGP-----ESREHQFSHAGR :: ::..:.... .. . . : .. .:. :::. . : : :: CCDS89 TQQAPNKAVQNDSNPSASQPTTGPSAASPASENQNGNGLSAPPGPGGGPHPPHTPSHPPS 120 130 140 150 160 170 170 180 190 200 pF1KE5 SN-GRGLINPQLQGTASNQTVMTTISNGRDPRRAFNR . :. : : . .. . .:::.. ::. .: CCDS89 TRITRSQPNHTPAGPPGPSS--NPVSNGKETRRSSKR 180 190 200 210 204 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 07:23:16 2016 done: Tue Nov 8 07:23:16 2016 Total Scan time: 1.870 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]