FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6586, 166 aa 1>>>pF1KE6586 166 - 166 aa - 166 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2720+/-0.000319; mu= 10.6301+/- 0.020 mean_var=173.0404+/-36.614, 0's: 0 Z-trim(121.3): 13 B-trim: 0 in 0/54 Lambda= 0.097499 statistics sampled from 37665 (37680) to 37665 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.773), E-opt: 0.2 (0.442), width: 16 Scan time: 4.690 The best scores are: opt bits E(85289) NP_001120668 (OMIM: 116955,602668) cellular nuclei ( 170) 757 117.3 1.2e-26 NP_001120666 (OMIM: 116955,602668) cellular nuclei ( 172) 748 116.0 2.9e-26 NP_001120667 (OMIM: 116955,602668) cellular nuclei ( 171) 745 115.6 3.9e-26 NP_003409 (OMIM: 116955,602668) cellular nucleic a ( 177) 744 115.4 4.4e-26 NP_001120665 (OMIM: 116955,602668) cellular nuclei ( 178) 732 113.8 1.4e-25 NP_001120664 (OMIM: 116955,602668) cellular nuclei ( 179) 626 98.9 4.4e-21 >>NP_001120668 (OMIM: 116955,602668) cellular nucleic ac (170 aa) initn: 757 init1: 757 opt: 757 Z-score: 599.3 bits: 117.3 E(85289): 1.2e-26 Smith-Waterman score: 757; 68.3% identity (82.8% similar) in 145 aa overlap (1-145:1-145) 10 20 30 40 50 60 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC :::.. : ::.:::::: :: ::. :: ..::: : :..: :: :::::. ::.: NP_001 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKLGH : . ::::::.::::::::.:::::.: ::.::. ::::::::. ::::::::..:: NP_001 DLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEFGH 70 80 90 100 110 120 130 140 150 160 pF1KE6 IQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::..:::::::: ::::::::: NP_001 IQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>NP_001120666 (OMIM: 116955,602668) cellular nucleic ac (172 aa) initn: 747 init1: 518 opt: 748 Z-score: 592.4 bits: 116.0 E(85289): 2.9e-26 Smith-Waterman score: 748; 67.3% identity (82.3% similar) in 147 aa overlap (1-145:1-147) 10 20 30 40 50 60 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC :::.. : ::.:::::: :: ::. :: ..::: : :..: :: :::::. ::.: NP_001 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDC 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VLLGNI--CYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKL : .. ::::::.::::::::.:::::.: ::.::. ::::::::. ::::::::.. NP_001 DLQEDVEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEF 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 GHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::::..:::::::: ::::::::: NP_001 GHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>NP_001120667 (OMIM: 116955,602668) cellular nucleic ac (171 aa) initn: 747 init1: 518 opt: 745 Z-score: 590.1 bits: 115.6 E(85289): 3.9e-26 Smith-Waterman score: 745; 67.8% identity (82.2% similar) in 146 aa overlap (1-145:1-146) 10 20 30 40 50 60 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC :::.. : ::.:::::: :: ::. :: ..::: : :..: :: :::::. ::.: NP_001 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGFQFVSSSLPDICYRCGESGHLAKDC 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VLLGN-ICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKLG : . ::::::.::::::::.:::::.: ::.::. ::::::::. ::::::::..: NP_001 DLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCYSCGEFG 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 HIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ ::::::..:::::::: ::::::::: NP_001 HIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>NP_003409 (OMIM: 116955,602668) cellular nucleic acid- (177 aa) initn: 661 init1: 628 opt: 744 Z-score: 589.2 bits: 115.4 E(85289): 4.4e-26 Smith-Waterman score: 744; 66.4% identity (80.3% similar) in 152 aa overlap (1-145:1-152) 10 20 30 40 50 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGR--RGGGHG-----RGSQCGSTTLSYTCYCCGES :::.. : ::.:::::: :: ::. :: :. :.: :: : :..: :: :::: NP_003 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGES 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GRNAKNCVLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCY :. ::.: : . ::::::.::::::::.:::::.: ::.::. ::::::::. ::::: NP_003 GHLAKDCDLQEDACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKCY 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 SCGKLGHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::..:::::::..:::::::: ::::::::: NP_003 SCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>NP_001120665 (OMIM: 116955,602668) cellular nucleic ac (178 aa) initn: 760 init1: 518 opt: 732 Z-score: 580.0 bits: 113.8 E(85289): 1.4e-25 Smith-Waterman score: 732; 66.0% identity (79.7% similar) in 153 aa overlap (1-145:1-153) 10 20 30 40 50 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGR--RGGGHG-----RGSQCGSTTLSYTCYCCGES :::.. : ::.:::::: :: ::. :: :. :.: :: : :..: :: :::: NP_001 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGES 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GRNAKNCVLLGN-ICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKC :. ::.: : . ::::::.::::::::.:::::.: ::.::. ::::::::. :::: NP_001 GHLAKDCDLQEDEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQKC 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 YSCGKLGHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ ::::..:::::::..:::::::: ::::::::: NP_001 YSCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 >>NP_001120664 (OMIM: 116955,602668) cellular nucleic ac (179 aa) initn: 760 init1: 518 opt: 626 Z-score: 499.4 bits: 98.9 E(85289): 4.4e-21 Smith-Waterman score: 735; 65.6% identity (79.9% similar) in 154 aa overlap (1-145:1-154) 10 20 30 40 50 pF1KE6 MSSKDFFACGHSGHWARGCPRGGAGGR--RGGGHG-----RGSQCGSTTLSYTCYCCGES :::.. : ::.:::::: :: ::. :: :. :.: :: : :..: :: :::: NP_001 MSSNECFKCGRSGHWARECPTGGGRGRGMRSRGRGGFTSDRGFQFVSSSLPDICYRCGES 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 GRNAKNCVLLGNI--CYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQK :. ::.: : .. ::::::.::::::::.:::::.: ::.::. ::::::::. ::: NP_001 GHLAKDCDLQEDVEACYNCGRGGHIAKDCKEPKREREQCCYNCGKPGHLARDCDHADEQK 70 80 90 100 110 120 120 130 140 150 160 pF1KE6 CYSCGKLGHIQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQ :::::..:::::::..:::::::: ::::::::: NP_001 CYSCGEFGHIQKDCTKVKCYRCGETGHVAINCSKTSEVNCYRCGESGHLARECTIEATA 130 140 150 160 170 166 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:36:05 2016 done: Tue Nov 8 14:36:06 2016 Total Scan time: 4.690 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]