FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9278, 63 aa 1>>>pF1KE9278 63 - 63 aa - 63 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.5808+/-0.000234; mu= 9.7846+/- 0.015 mean_var=47.3597+/- 9.533, 0's: 0 Z-trim(121.4): 8 B-trim: 50 in 2/53 Lambda= 0.186367 statistics sampled from 37852 (37861) to 37852 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.825), E-opt: 0.2 (0.444), width: 16 Scan time: 3.380 The best scores are: opt bits E(85289) NP_659435 (OMIM: 615128) centromere protein X isof ( 63) 397 113.0 3.3e-26 XP_016879818 (OMIM: 615128) PREDICTED: centromere ( 76) 372 106.3 4.1e-24 NP_001257936 (OMIM: 615128) centromere protein X i ( 58) 345 99.0 5e-22 NP_001257935 (OMIM: 615128) centromere protein X i ( 81) 206 61.7 1.2e-10 XP_016879817 (OMIM: 615128) PREDICTED: centromere ( 94) 191 57.7 2.2e-09 NP_001317465 (OMIM: 615128) centromere protein X i ( 131) 191 57.7 2.9e-09 XP_016879816 (OMIM: 615128) PREDICTED: centromere ( 149) 191 57.8 3.3e-09 XP_016879815 (OMIM: 615128) PREDICTED: centromere ( 165) 191 57.8 3.6e-09 >>NP_659435 (OMIM: 615128) centromere protein X isoform (63 aa) initn: 397 init1: 397 opt: 397 Z-score: 591.4 bits: 113.0 E(85289): 3.3e-26 Smith-Waterman score: 397; 98.4% identity (100.0% similar) in 63 aa overlap (1-63:1-63) 10 20 30 40 50 60 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRADVDQLEKVLPQLL ::::::::::::::::::::::::::::::::::::::::::::::.::::::::::::: NP_659 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRVDVDQLEKVLPQLL 10 20 30 40 50 60 pF1KE9 LDF ::: NP_659 LDF >>XP_016879818 (OMIM: 615128) PREDICTED: centromere prot (76 aa) initn: 388 init1: 372 opt: 372 Z-score: 553.8 bits: 106.3 E(85289): 4.1e-24 Smith-Waterman score: 372; 96.7% identity (100.0% similar) in 60 aa overlap (1-60:1-60) 10 20 30 40 50 60 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRADVDQLEKVLPQLL ::::::::::::::::::::::::::::::::::::::::::::::.::::::::::::. XP_016 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRVDVDQLEKVLPQLV 10 20 30 40 50 60 pF1KE9 LDF XP_016 RERGSGRKWGCPAGWP 70 >>NP_001257936 (OMIM: 615128) centromere protein X isofo (58 aa) initn: 361 init1: 345 opt: 345 Z-score: 516.4 bits: 99.0 E(85289): 5e-22 Smith-Waterman score: 345; 96.4% identity (100.0% similar) in 56 aa overlap (1-56:1-56) 10 20 30 40 50 60 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRADVDQLEKVLPQLL ::::::::::::::::::::::::::::::::::::::::::::::.:::::::.: NP_001 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRVDVDQLEKLLDF 10 20 30 40 50 pF1KE9 LDF >>NP_001257935 (OMIM: 615128) centromere protein X isofo (81 aa) initn: 383 init1: 206 opt: 206 Z-score: 312.2 bits: 61.7 E(85289): 1.2e-10 Smith-Waterman score: 351; 76.5% identity (77.8% similar) in 81 aa overlap (1-63:1-81) 10 20 30 40 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTK------------------EAAVRGVRQAQAE ::::::::::::::::::::::::::::: ::::::::::::: NP_001 MEGAGAGSGFRKELVSRLLHLHFKDDKTKVSGDALQLMVELLKVFVVEAAVRGVRQAQAE 10 20 30 40 50 60 50 60 pF1KE9 DALRADVDQLEKVLPQLLLDF ::::.:::::::::::::::: NP_001 DALRVDVDQLEKVLPQLLLDF 70 80 >>XP_016879817 (OMIM: 615128) PREDICTED: centromere prot (94 aa) initn: 363 init1: 191 opt: 191 Z-score: 289.4 bits: 57.7 E(85289): 2.2e-09 Smith-Waterman score: 326; 74.4% identity (76.9% similar) in 78 aa overlap (1-60:1-78) 10 20 30 40 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTK------------------EAAVRGVRQAQAE ::::::::::::::::::::::::::::: ::::::::::::: XP_016 MEGAGAGSGFRKELVSRLLHLHFKDDKTKVSGDALQLMVELLKVFVVEAAVRGVRQAQAE 10 20 30 40 50 60 50 60 pF1KE9 DALRADVDQLEKVLPQLLLDF ::::.::::::::::::. XP_016 DALRVDVDQLEKVLPQLVRERGSGRKWGCPAGWP 70 80 90 >>NP_001317465 (OMIM: 615128) centromere protein X isofo (131 aa) initn: 191 init1: 191 opt: 191 Z-score: 287.2 bits: 57.7 E(85289): 2.9e-09 Smith-Waterman score: 191; 100.0% identity (100.0% similar) in 29 aa overlap (1-29:1-29) 10 20 30 40 50 60 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRADVDQLEKVLPQLL ::::::::::::::::::::::::::::: NP_001 MEGAGAGSGFRKELVSRLLHLHFKDDKTKVSGDALQLMVELLKVFVVGEPRAHAPSGHSI 10 20 30 40 50 60 >>XP_016879816 (OMIM: 615128) PREDICTED: centromere prot (149 aa) initn: 204 init1: 191 opt: 191 Z-score: 286.3 bits: 57.8 E(85289): 3.3e-09 Smith-Waterman score: 191; 100.0% identity (100.0% similar) in 29 aa overlap (1-29:1-29) 10 20 30 40 50 60 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRADVDQLEKVLPQLL ::::::::::::::::::::::::::::: XP_016 MEGAGAGSGFRKELVSRLLHLHFKDDKTKVSGDALQLMVELLKVFVVGEPRAHAPSGHSI 10 20 30 40 50 60 >>XP_016879815 (OMIM: 615128) PREDICTED: centromere prot (165 aa) initn: 200 init1: 191 opt: 191 Z-score: 285.7 bits: 57.8 E(85289): 3.6e-09 Smith-Waterman score: 191; 100.0% identity (100.0% similar) in 29 aa overlap (1-29:1-29) 10 20 30 40 50 60 pF1KE9 MEGAGAGSGFRKELVSRLLHLHFKDDKTKEAAVRGVRQAQAEDALRADVDQLEKVLPQLL ::::::::::::::::::::::::::::: XP_016 MEGAGAGSGFRKELVSRLLHLHFKDDKTKVSGDALQLMVELLKVFVVGEPRAHAPSGHSI 10 20 30 40 50 60 63 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 10:16:34 2016 done: Tue Nov 8 10:16:35 2016 Total Scan time: 3.380 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]