FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6617, 336 aa 1>>>pF1KE6617 336 - 336 aa - 336 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8602+/-0.000333; mu= 13.2660+/- 0.021 mean_var=75.9178+/-14.640, 0's: 0 Z-trim(116.6): 39 B-trim: 66 in 1/54 Lambda= 0.147198 statistics sampled from 27953 (27983) to 27953 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.7), E-opt: 0.2 (0.328), width: 16 Scan time: 5.950 The best scores are: opt bits E(85289) NP_004804 (OMIM: 603360,614876,614877) peroxisomal ( 336) 2234 483.6 2.6e-136 NP_476515 (OMIM: 603360,614876,614877) peroxisomal ( 346) 2084 451.7 1e-126 XP_011518776 (OMIM: 603360,614876,614877) PREDICTE ( 295) 1925 417.9 1.3e-116 >>NP_004804 (OMIM: 603360,614876,614877) peroxisomal bio (336 aa) initn: 2234 init1: 2234 opt: 2234 Z-score: 2568.3 bits: 483.6 E(85289): 2.6e-136 Smith-Waterman score: 2234; 100.0% identity (100.0% similar) in 336 aa overlap (1-336:1-336) 10 20 30 40 50 60 pF1KE6 MEKLRLLGLRYQEYVTRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 MEKLRLLGLRYQEYVTRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NDGILRKELRKKLPVSLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALIQLAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 NDGILRKELRKKLPVSLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALIQLAK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AVLRMLLLLWFKAGLQTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 AVLRMLLLLWFKAGLQTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 TPSLHSRHWGAPQQREGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 TPSLHSRHWGAPQQREGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RSWKPWLLAGVVDVTSLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 RSWKPWLLAGVVDVTSLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARIL 250 260 270 280 290 300 310 320 330 pF1KE6 FLLQLLADHVPGVGLVTRPLMDYLPTWQKIYFYSWG :::::::::::::::::::::::::::::::::::: NP_004 FLLQLLADHVPGVGLVTRPLMDYLPTWQKIYFYSWG 310 320 330 >>NP_476515 (OMIM: 603360,614876,614877) peroxisomal bio (346 aa) initn: 2084 init1: 2084 opt: 2084 Z-score: 2395.9 bits: 451.7 E(85289): 1e-126 Smith-Waterman score: 2084; 100.0% identity (100.0% similar) in 317 aa overlap (1-317:1-317) 10 20 30 40 50 60 pF1KE6 MEKLRLLGLRYQEYVTRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_476 MEKLRLLGLRYQEYVTRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NDGILRKELRKKLPVSLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALIQLAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_476 NDGILRKELRKKLPVSLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALIQLAK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AVLRMLLLLWFKAGLQTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_476 AVLRMLLLLWFKAGLQTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 TPSLHSRHWGAPQQREGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_476 TPSLHSRHWGAPQQREGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 RSWKPWLLAGVVDVTSLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_476 RSWKPWLLAGVVDVTSLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARIL 250 260 270 280 290 300 310 320 330 pF1KE6 FLLQLLADHVPGVGLVTRPLMDYLPTWQKIYFYSWG ::::::::::::::::: NP_476 FLLQLLADHVPGVGLVTTSQRAASPCLPARPHTQPWSPPAFLPGHP 310 320 330 340 >>XP_011518776 (OMIM: 603360,614876,614877) PREDICTED: p (295 aa) initn: 1925 init1: 1925 opt: 1925 Z-score: 2214.5 bits: 417.9 E(85289): 1.3e-116 Smith-Waterman score: 1925; 98.6% identity (99.3% similar) in 291 aa overlap (46-336:5-295) 20 30 40 50 60 70 pF1KE6 TRHPAATAQLETAVRGFSYLLAGRFADSHELSELVYSASNLLVLLNDGILRKELRKKLPV :. :::::::::::::::::::::::::: XP_011 MHWVLKSRVYSASNLLVLLNDGILRKELRKKLPV 10 20 30 80 90 100 110 120 130 pF1KE6 SLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALIQLAKAVLRMLLLLWFKAGL ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: XP_011 SLSQQKLLTWLSVLECVEVFMEMGAAKVWGEVGRWLVIALVQLAKAVLRMLLLLWFKAGL 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE6 QTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQNTPSLHSRHWGAPQQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 QTSPPIVPLDRETQAQPPDGDHSPGNHEQSYVGKRSNRVVRTLQNTPSLHSRHWGAPQQR 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE6 EGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQRSWKPWLLAGVVDVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 EGRQQQHHEELSATPTPLGLQETIAEFLYIARPLLHLLSLGLWGQRSWKPWLLAGVVDVT 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE6 SLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARILFLLQLLADHVPGVGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SLSLLSDRKGLTRRERRELRRRTILLLYYLLRSPFYDRFSEARILFLLQLLADHVPGVGL 220 230 240 250 260 270 320 330 pF1KE6 VTRPLMDYLPTWQKIYFYSWG ::::::::::::::::::::: XP_011 VTRPLMDYLPTWQKIYFYSWG 280 290 336 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:49:09 2016 done: Tue Nov 8 14:49:10 2016 Total Scan time: 5.950 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]