FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1520, 118 aa 1>>>pF1KE1520 118 - 118 aa - 118 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9905+/-0.000231; mu= 3.9389+/- 0.015 mean_var=112.9988+/-22.745, 0's: 0 Z-trim(125.4): 3 B-trim: 360 in 1/59 Lambda= 0.120653 statistics sampled from 48948 (48951) to 48948 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.868), E-opt: 0.2 (0.574), width: 16 Scan time: 5.460 The best scores are: opt bits E(85289) NP_004086 (OMIM: 602223) eukaryotic translation in ( 118) 806 149.2 1.4e-36 NP_004087 (OMIM: 602224) eukaryotic translation in ( 120) 415 81.1 4.5e-16 NP_003723 (OMIM: 603483) eukaryotic translation in ( 100) 328 66.0 1.4e-11 >>NP_004086 (OMIM: 602223) eukaryotic translation initia (118 aa) initn: 806 init1: 806 opt: 806 Z-score: 777.5 bits: 149.2 E(85289): 1.4e-36 Smith-Waterman score: 806; 100.0% identity (100.0% similar) in 118 aa overlap (1-118:1-118) 10 20 30 40 50 60 pF1KE1 MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKFLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKFLM 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 ECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSPEDKRAGGEESQFEMDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 ECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSPEDKRAGGEESQFEMDI 70 80 90 100 110 >>NP_004087 (OMIM: 602224) eukaryotic translation initia (120 aa) initn: 475 init1: 319 opt: 415 Z-score: 409.5 bits: 81.1 E(85289): 4.5e-16 Smith-Waterman score: 415; 57.1% identity (82.4% similar) in 119 aa overlap (2-118:4-120) 10 20 30 40 50 pF1KE1 MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKF :.::. . . ::::: :: :...:..::: :: ::::::::::::::::::::::: NP_004 MSSSAGSGHQPSQSRAIP-TRTVAISDAAQLPH-DYCTTPGGTLFSTTPGGTRIIYDRKF 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 LMECRNSPVTKTPPRDLPTIPGVTSPSS--DEPPMEASQSHLRNSPEDKRAGGEESQFEM :.. ::::...::: ::.:::::::.. .. .:... . :. . :.: :...:::: NP_004 LLDRRNSPMAQTPPCHLPNIPGVTSPGTLIEDSKVEVNNLNNLNNHDRKHAVGDDAQFEM 60 70 80 90 100 110 pF1KE1 DI :: NP_004 DI 120 >>NP_003723 (OMIM: 603483) eukaryotic translation initia (100 aa) initn: 379 init1: 295 opt: 328 Z-score: 328.9 bits: 66.0 E(85289): 1.4e-11 Smith-Waterman score: 330; 50.8% identity (66.9% similar) in 118 aa overlap (1-118:1-100) 10 20 30 40 50 60 pF1KE1 MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKFLM :: ..:: : ::. : ::: :::::::::..:::::::::::::::. NP_003 MSTSTSC---P---IPGGRD-------QLPDC-YSTTPGGTLYATTPGGTRIIYDRKFLL 10 20 30 40 70 80 90 100 110 pF1KE1 ECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSPEDKRAGGEESQFEMDI ::.:::...::: :: :::::.: : :. . . : .. ...:::::: NP_003 ECKNSPIARTPPCCLPQIPGVTTP----PTAPLSKLEELKEQETEEEIPDDAQFEMDI 50 60 70 80 90 100 118 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 20:13:27 2016 done: Sun Nov 6 20:13:28 2016 Total Scan time: 5.460 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]