FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5453, 352 aa 1>>>pF1KE5453 352 - 352 aa - 352 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0523+/-0.000347; mu= 9.2072+/- 0.022 mean_var=122.0486+/-24.742, 0's: 0 Z-trim(118.9): 15 B-trim: 616 in 1/51 Lambda= 0.116093 statistics sampled from 32418 (32433) to 32418 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.724), E-opt: 0.2 (0.38), width: 16 Scan time: 8.670 The best scores are: opt bits E(85289) NP_003747 (OMIM: 603912) eukaryotic translation in ( 352) 2305 396.7 3.9e-110 NP_005796 (OMIM: 607173) 26S proteasome non-ATPase ( 310) 225 48.3 2.6e-05 NP_003745 (OMIM: 603914) eukaryotic translation in ( 357) 185 41.7 0.0031 >>NP_003747 (OMIM: 603912) eukaryotic translation initia (352 aa) initn: 2305 init1: 2305 opt: 2305 Z-score: 2098.2 bits: 396.7 E(85289): 3.9e-110 Smith-Waterman score: 2305; 100.0% identity (100.0% similar) in 352 aa overlap (1-352:1-352) 10 20 30 40 50 60 pF1KE5 MASRKEGTGSTATSSSSTAGAAGKGKGKGGSGDSAVKQVQIDGLVVLKIIKHYQEEGQGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MASRKEGTGSTATSSSSTAGAAGKGKGKGGSGDSAVKQVQIDGLVVLKIIKHYQEEGQGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EVVQGVLLGLVVEDRLEITNCFPFPQHTEDDADFDEVQYQMEMMRSLRHVNIDHLHVGWY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 EVVQGVLLGLVVEDRLEITNCFPFPQHTEDDADFDEVQYQMEMMRSLRHVNIDHLHVGWY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 QSTYYGSFVTRALLDSQFSYQHAIEESVVLIYDPIKTAQGSLSLKAYRLTPKLMEVCKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 QSTYYGSFVTRALLDSQFSYQHAIEESVVLIYDPIKTAQGSLSLKAYRLTPKLMEVCKEK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 DFSPEALKKANITFEYMFEEVPIVIKNSHLINVLMWELEKKSAVADKHELLSLASSNHLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 DFSPEALKKANITFEYMFEEVPIVIKNSHLINVLMWELEKKSAVADKHELLSLASSNHLG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 KNLQLLMDRVDEMSQDIVKYNTYMRNTSKQQQQKHQYQQRRQQENMQRQSRGEPPLPEED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 KNLQLLMDRVDEMSQDIVKYNTYMRNTSKQQQQKHQYQQRRQQENMQRQSRGEPPLPEED 250 260 270 280 290 300 310 320 330 340 350 pF1KE5 LSKLFKPPQPPARMDSLLIAGQINTYCQNIKEFTAQNLGKLFMAQALQEYNN :::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 LSKLFKPPQPPARMDSLLIAGQINTYCQNIKEFTAQNLGKLFMAQALQEYNN 310 320 330 340 350 >>NP_005796 (OMIM: 607173) 26S proteasome non-ATPase reg (310 aa) initn: 122 init1: 74 opt: 225 Z-score: 216.2 bits: 48.3 E(85289): 2.6e-05 Smith-Waterman score: 244; 24.7% identity (59.7% similar) in 283 aa overlap (20-275:8-283) 10 20 30 40 50 pF1KE5 MASRKEGTGSTATSSSSTAGAAGKGKGKGGSGDS-AV---KQVQIDGLVVLKIIKHYQEE :.. : :.: :. :: .:: :..:..::..:: . NP_005 MDRLLRLGGGMPGLGQGPPTDAPAVDTAEQVYISSLALLKMLKHGRA- 10 20 30 40 60 70 80 90 100 110 pF1KE5 GQGTEVVQGVLLGLVVEDR-LEITNCFPFPQHTEDDADFDEVQ--YQMEMMRSLRHVNID : ::. :..:: :.: ... . : .:: . .. . :. .: .:. :.... NP_005 GVPMEVM-GLMLGEFVDDYTVRVIDVFAMPQ-SGTGVSVEAVDPVFQAKMLDMLKQTGRP 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE5 HLHVGWYQS-TYYGSFVTRALLDSQFSYQHAIEESVVLIYDPIKTAQGSLSLKAYRLTPK .. ::::.: .: ... . ...: :.. :..:... :::....:.. . :.:: NP_005 EMVVGWYHSHPGFGCWLSGVDINTQQSFEALSERAVAVVVDPIQSVKGKVVIDAFRLINA 110 120 130 140 150 160 180 190 200 210 220 pF1KE5 LMEVCKEKDFSPEA----LKKANIT-----FEYMFEEVPIVIKNSHLINVLMWELEKKS- : : .. . . :.: .: .. . . : ....: . .. .:.::: NP_005 NMMVLGHEPRQTTSNLGHLNKPSIQALIHGLNRHYYSITINYRKNELEQKMLLNLHKKSW 170 180 190 200 210 220 230 240 250 260 270 pF1KE5 ----AVAD-----KHELLSLASSNHLGKNLQLLMDRVDEMSQDIVKYNTYMRNTSKQQQQ .. : ::. . .:.:: . ... :.:. . . ..:..::. . NP_005 MEGLTLQDYSEHCKHNESVVKEMLELAKNYNKAVEEEDKMTPEQLA----IKNVGKQDPK 230 240 250 260 270 280 280 290 300 310 320 330 pF1KE5 KHQYQQRRQQENMQRQSRGEPPLPEEDLSKLFKPPQPPARMDSLLIAGQINTYCQNIKEF .: NP_005 RHLEEHVDVLMTSNIVQCLAAMLDTVVFK 290 300 310 >>NP_003745 (OMIM: 603914) eukaryotic translation initia (357 aa) initn: 163 init1: 116 opt: 185 Z-score: 179.1 bits: 41.7 E(85289): 0.0031 Smith-Waterman score: 185; 26.9% identity (56.7% similar) in 201 aa overlap (13-206:72-254) 10 20 30 40 pF1KE5 MASRKEGTGSTATSSSSTAGAAGKGKGKGGSGDSAVKQVQID : . . : : : :: . :.. NP_003 APASSSDPAAAAAATAAPGQTPASAQAPAQTPAPALPGPALPGPFPGG------RVVRLH 50 60 70 80 90 50 60 70 80 90 100 pF1KE5 GLVVLKIIKHYQEEGQGTEVVQGVLLGLVVEDRLEITNCFPFPQHTEDDADFDEVQYQME ... .:. :.....:. : :.::: : . .:.:::: : :.:.. ::: .:: NP_003 PVILASIVDSYERRNEGAARVIGTLLGTVDKHSVEVTNCFSVP-HNESE---DEVAVDME 100 110 120 130 140 150 110 120 130 140 150 pF1KE5 MMRSL----RHVNIDHLHVGWYQSTYYGSFVTRALLDSQFSYQHAIEESVVLIYDPIKTA . ... ..:. ..: .::: . . ... ...: .. :.. . . : : . NP_003 FAKNMYELHKKVSPNELILGWYATGH--DITEHSVLIHEY-YSREAPNPIHLTVDT-SLQ 160 170 180 190 200 160 170 180 190 200 210 pF1KE5 QGSLSLKAYRLTPKLMEVCKEKD---FSPEALKKANITFEYMFEEVPIVIKNSHLINVLM .: .:.::: : :: : . :.: ..: : : . : ...: NP_003 NGRMSIKAYVST--LMGVPGRTMGVMFTPLTVKYAYYDTERI--GVDLIMKTCFSPNRVI 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE5 WELEKKSAVADKHELLSLASSNHLGKNLQLLMDRVDEMSQDIVKYNTYMRNTSKQQQQKH NP_003 GLSSDLQQVGGASARIQDALSTVLQYAEDVLSGKVSADNTVGRFLMSLVNQVPKIVPDDF 270 280 290 300 310 320 352 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:55:09 2016 done: Tue Nov 8 00:55:10 2016 Total Scan time: 8.670 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]