FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3003, 70 aa 1>>>pF1KE3003 70 - 70 aa - 70 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9458+/-0.000504; mu= 8.0557+/- 0.030 mean_var=46.0528+/- 9.040, 0's: 0 Z-trim(112.8): 21 B-trim: 2 in 1/52 Lambda= 0.188993 statistics sampled from 13484 (13504) to 13484 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.814), E-opt: 0.2 (0.415), width: 16 Scan time: 1.120 The best scores are: opt bits E(32554) CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 ( 70) 450 129.1 2.1e-31 CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 ( 71) 351 102.1 2.9e-23 CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 ( 75) 321 94.0 8.8e-21 CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 ( 75) 290 85.5 3.1e-18 CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 ( 68) 263 78.1 4.6e-16 CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 ( 72) 261 77.6 7.2e-16 CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 ( 68) 202 61.5 4.7e-11 CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 ( 68) 201 61.2 5.7e-11 >>CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 (70 aa) initn: 450 init1: 450 opt: 450 Z-score: 677.1 bits: 129.1 E(32554): 2.1e-31 Smith-Waterman score: 450; 100.0% identity (100.0% similar) in 70 aa overlap (1-70:1-70) 10 20 30 40 50 60 pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF 10 20 30 40 50 60 70 pF1KE3 RDKRLFCVLL :::::::::: CCDS12 RDKRLFCVLL 70 >>CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 (71 aa) initn: 358 init1: 351 opt: 351 Z-score: 531.1 bits: 102.1 E(32554): 2.9e-23 Smith-Waterman score: 351; 71.0% identity (97.1% similar) in 69 aa overlap (2-70:3-71) 10 20 30 40 50 pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENP ::: :.::.::: :::::.:.::::.:::.:::.:.:.::.:::.:::.:::::.::: CCDS32 MASNNTASIAQARKLVEQLKMEANIDRIKVSKAAADLMAYCEAHAKEDPLLTPVPASENP 10 20 30 40 50 60 60 70 pF1KE3 FRDKRLFCVLL ::.:..::..: CCDS32 FREKKFFCAIL 70 >>CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 (75 aa) initn: 326 init1: 300 opt: 321 Z-score: 486.5 bits: 94.0 E(32554): 8.8e-21 Smith-Waterman score: 321; 63.4% identity (94.4% similar) in 71 aa overlap (1-70:5-75) 10 20 30 40 50 pF1KE3 MSNN-MAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPA :::: ..:..:::.:::::.:. .::.:::::::.:::.::.:...:::. :::: CCDS16 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA 10 20 30 40 50 60 60 70 pF1KE3 AENPFRDKRLFCVLL .:::::.:..::..: CCDS16 SENPFREKKFFCTIL 70 >>CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 (75 aa) initn: 290 init1: 290 opt: 290 Z-score: 440.9 bits: 85.5 E(32554): 3.1e-18 Smith-Waterman score: 290; 57.4% identity (92.6% similar) in 68 aa overlap (3-70:8-75) 10 20 30 40 50 pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPA :. .:..::: :::::.:... :.:::.:::.:...:..:: .:::.::::. CCDS80 MKGETPVNSTMSIGQARKMVEQLKIEASLCRIKVSKAAADLMTYCDAHACEDPLITPVPT 10 20 30 40 50 60 60 70 pF1KE3 AENPFRDKRLFCVLL .:::::.:..::.:: CCDS80 SENPFREKKFFCALL 70 >>CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 (68 aa) initn: 264 init1: 255 opt: 263 Z-score: 401.8 bits: 78.1 E(32554): 4.6e-16 Smith-Waterman score: 263; 57.1% identity (92.1% similar) in 63 aa overlap (8-70:7-68) 10 20 30 40 50 60 pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF ::.::: ::::..:..:.:.:::.::..:...:: ::..:::.. :::.:::: CCDS12 MSATNNIAQARKLVEQLRIEAGIERIKVSKAASDLMSYCEQHARNDPLLVGVPASENPF 10 20 30 40 50 70 pF1KE3 RDKRLFCVLL .::. :..: CCDS12 KDKKP-CIIL 60 >>CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 (72 aa) initn: 258 init1: 249 opt: 261 Z-score: 398.4 bits: 77.6 E(32554): 7.2e-16 Smith-Waterman score: 261; 50.7% identity (87.7% similar) in 73 aa overlap (1-70:1-72) 10 20 30 40 50 pF1KE3 MSNNMAK---IAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAE ::.. :. ::.::.::.::.::..:.:.:::.:.:.:...:: ::..:::. .:..: CCDS30 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE 10 20 30 40 50 60 60 70 pF1KE3 NPFRDKRLFCVLL :::.::. :..: CCDS30 NPFKDKKT-CIIL 70 >>CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 (68 aa) initn: 181 init1: 173 opt: 202 Z-score: 311.9 bits: 61.5 E(32554): 4.7e-11 Smith-Waterman score: 202; 45.7% identity (80.0% similar) in 70 aa overlap (1-70:1-68) 10 20 30 40 50 60 pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF ::.. ...: .:.:.::.::....:.:::::::.: :: .:. :::.: : .. ::: CCDS69 MSGS-SSVAAMKKVVQQLRLEAGLNRVKVSQAAADLKQFCLQNAQHDPLLTGVSSSTNPF 10 20 30 40 50 70 pF1KE3 RDKRLFCVLL : ... : .: CCDS69 RPQKV-CSFL 60 >>CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 (68 aa) initn: 207 init1: 187 opt: 201 Z-score: 310.4 bits: 61.2 E(32554): 5.7e-11 Smith-Waterman score: 201; 50.0% identity (75.7% similar) in 70 aa overlap (1-70:1-68) 10 20 30 40 50 60 pF1KE3 MSNNMAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAENPF ::.. :. . .. :::::::....:.::::::::: .: .: : :.. :::. ::: CCDS35 MSSG-ASASALQRLVEQLKLEAGVERIKVSQAAAELQQYCMQNACKDALLVGVPAGSNPF 10 20 30 40 50 70 pF1KE3 RDKRLFCVLL :. : :.:: CCDS35 REPRS-CALL 60 70 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:20:15 2016 done: Sun Nov 6 13:20:16 2016 Total Scan time: 1.120 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]