FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6586, 166 aa 1>>>pF1KE6586 166 - 166 aa - 166 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3203+/-0.000791; mu= 4.7188+/- 0.048 mean_var=176.5119+/-37.380, 0's: 0 Z-trim(114.6): 26 B-trim: 0 in 0/51 Lambda= 0.096536 statistics sampled from 15139 (15161) to 15139 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.466), width: 16 Scan time: 1.080 The best scores are: opt bits E(32554) CCDS14425.1 ZCCHC13 gene_id:389874|Hs108|chrX ( 166) 1258 186.0 9e-48 CCDS54637.1 CNBP gene_id:7555|Hs108|chr3 ( 170) 757 116.3 9.3e-27 CCDS46908.1 CNBP gene_id:7555|Hs108|chr3 ( 172) 748 115.0 2.2e-26 CCDS3056.1 CNBP gene_id:7555|Hs108|chr3 ( 177) 744 114.5 3.3e-26 CCDS46907.1 CNBP gene_id:7555|Hs108|chr3 ( 178) 732 112.8 1.1e-25 CCDS46906.1 CNBP gene_id:7555|Hs108|chr3 ( 179) 626 98.0 3e-21 >>CCDS14425.1 ZCCHC13 gene_id:389874|Hs108|chrX (166 aa) initn: 1258 init1: 1258 opt: 1258 Z-score: 971.1 bits: 186.0 E(32554): 9e-48 Smith-Waterman score: 1258; 100.0% identity (100.0% similar) in 166 aa overlap (1-166:1-166) 10 20 30 40 50 60 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKLGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 VLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKLGH 70 80 90 100 110 120 130 140 150 160 pF1KE6 IQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::::::::::::::::::::::::::::::::::::::::::: CCDS14 IQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ 130 140 150 160 >>CCDS54637.1 CNBP gene_id:7555|Hs108|chr3 (170 aa) initn: 757 init1: 757 opt: 757 Z-score: 593.9 bits: 116.3 E(32554): 9.3e-27 Smith-Waterman score: 757; 68.3% identity (82.8% similar) in 145 aa overlap (1-145:1-145) 10 20 30 40 50 60 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC :::.. : ::.:::::: :: ::. :: ..::: : :..: :: :::::. ::.: CCDS54 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKLGH : . ::::::.::::::::.:::::.: ::.::. ::::::::. ::::::::..:: CCDS54 DLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEFGH 70 80 90 100 110 120 130 140 150 160 pF1KE6 IQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::..:::::::: ::::::::: CCDS54 IQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>CCDS46908.1 CNBP gene_id:7555|Hs108|chr3 (172 aa) initn: 747 init1: 518 opt: 748 Z-score: 587.1 bits: 115.0 E(32554): 2.2e-26 Smith-Waterman score: 748; 67.3% identity (82.3% similar) in 147 aa overlap (1-145:1-147) 10 20 30 40 50 60 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC :::.. : ::.:::::: :: ::. :: ..::: : :..: :: :::::. ::.: CCDS46 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDC 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VLLGNI--CYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKL : .. ::::::.::::::::.:::::.: ::.::. ::::::::. ::::::::.. CCDS46 DLQEDVEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEF 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 GHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::::..:::::::: ::::::::: CCDS46 GHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>CCDS3056.1 CNBP gene_id:7555|Hs108|chr3 (177 aa) initn: 661 init1: 628 opt: 744 Z-score: 583.9 bits: 114.5 E(32554): 3.3e-26 Smith-Waterman score: 744; 66.4% identity (80.3% similar) in 152 aa overlap (1-145:1-152) 10 20 30 40 50 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGR--RGGGHG-----RGSQCGSTTLSYTCYCCGES :::.. : ::.:::::: :: ::. :: :. :.: :: : :..: :: :::: CCDS30 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGES 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCY :. ::.: : . ::::::.::::::::.:::::.: ::.::. ::::::::. ::::: CCDS30 GHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCY 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 SCGKLGHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::..:::::::..:::::::: ::::::::: CCDS30 SCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>CCDS46907.1 CNBP gene_id:7555|Hs108|chr3 (178 aa) initn: 760 init1: 518 opt: 732 Z-score: 574.8 bits: 112.8 E(32554): 1.1e-25 Smith-Waterman score: 732; 66.0% identity (79.7% similar) in 153 aa overlap (1-145:1-153) 10 20 30 40 50 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGR--RGGGHG-----RGSQCGSTTLSYTCYCCGES :::.. : ::.:::::: :: ::. :: :. :.: :: : :..: :: :::: CCDS46 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGES 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GRNAKNCVLLGN-ICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKC :. ::.: : . ::::::.::::::::.:::::.: ::.::. ::::::::. :::: CCDS46 GHLAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKC 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 YSCGKLGHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ ::::..:::::::..:::::::: ::::::::: CCDS46 YSCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>CCDS46906.1 CNBP gene_id:7555|Hs108|chr3 (179 aa) initn: 760 init1: 518 opt: 626 Z-score: 495.0 bits: 98.0 E(32554): 3e-21 Smith-Waterman score: 735; 65.6% identity (79.9% similar) in 154 aa overlap (1-145:1-154) 10 20 30 40 50 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGR--RGGGHG-----RGSQCGSTTLSYTCYCCGES :::.. : ::.:::::: :: ::. :: :. :.: :: : :..: :: :::: CCDS46 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGES 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GRNAKNCVLLGNI--CYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQK :. ::.: : .. ::::::.::::::::.:::::.: ::.::. ::::::::. ::: CCDS46 GHLAKDCDLQEDVEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQK 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 CYSCGKLGHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::..:::::::..:::::::: ::::::::: CCDS46 CYSCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 166 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:36:05 2016 done: Tue Nov 8 14:36:05 2016 Total Scan time: 1.080 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]