FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3036, 169 aa 1>>>pF1KE3036 169 - 169 aa - 169 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2545+/-0.000363; mu= 12.8223+/- 0.022 mean_var=86.2530+/-16.498, 0's: 0 Z-trim(115.4): 92 B-trim: 57 in 1/53 Lambda= 0.138098 statistics sampled from 25692 (25809) to 25692 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.303), width: 16 Scan time: 4.730 The best scores are: opt bits E(85289) NP_001423 (OMIM: 602061) proepiregulin preproprote ( 169) 1160 240.5 9.5e-64 NP_001720 (OMIM: 600345) probetacellulin isoform 1 ( 178) 213 51.9 6.1e-07 XP_011530513 (OMIM: 600345) PREDICTED: probetacell ( 178) 213 51.9 6.1e-07 XP_016863341 (OMIM: 131530,611718) PREDICTED: pro- (1010) 189 47.8 6e-05 NP_001093161 (OMIM: 190170) protransforming growth ( 159) 178 44.9 7.1e-05 NP_003227 (OMIM: 190170) protransforming growth fa ( 160) 178 44.9 7.1e-05 NP_001295088 (OMIM: 190170) protransforming growth ( 165) 178 44.9 7.3e-05 NP_001295087 (OMIM: 190170) protransforming growth ( 166) 178 44.9 7.3e-05 NP_003683 (OMIM: 603421) tomoregulin-1 precursor [ ( 380) 182 46.0 7.7e-05 XP_011509192 (OMIM: 605734) PREDICTED: tomoregulin ( 365) 177 45.0 0.00015 NP_057276 (OMIM: 605734) tomoregulin-2 isoform 1 p ( 374) 177 45.0 0.00015 XP_016859228 (OMIM: 605734) PREDICTED: tomoregulin ( 337) 175 44.6 0.00019 NP_001292063 (OMIM: 605734) tomoregulin-2 isoform ( 346) 175 44.6 0.00019 NP_053584 (OMIM: 603818) pro-neuregulin-2, membran ( 844) 177 45.3 0.00027 XP_016877436 (OMIM: 610894) PREDICTED: pro-neuregu ( 115) 166 42.4 0.00029 NP_612640 (OMIM: 610894) pro-neuregulin-4, membran ( 115) 166 42.4 0.00029 XP_016877434 (OMIM: 610894) PREDICTED: pro-neuregu ( 115) 166 42.4 0.00029 XP_016877435 (OMIM: 610894) PREDICTED: pro-neuregu ( 115) 166 42.4 0.00029 XP_016877437 (OMIM: 610894) PREDICTED: pro-neuregu ( 115) 166 42.4 0.00029 XP_016877433 (OMIM: 610894) PREDICTED: pro-neuregu ( 123) 166 42.4 0.00031 XP_016877428 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 XP_016877429 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 XP_016877432 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 XP_016877430 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 XP_016877431 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 XP_016877427 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 XP_016877426 (OMIM: 610894) PREDICTED: pro-neuregu ( 129) 166 42.4 0.00032 NP_001936 (OMIM: 126150) proheparin-binding EGF-li ( 208) 144 38.2 0.0095 >>NP_001423 (OMIM: 602061) proepiregulin preproprotein [ (169 aa) initn: 1160 init1: 1160 opt: 1160 Z-score: 1265.5 bits: 240.5 E(85289): 9.5e-64 Smith-Waterman score: 1160; 100.0% identity (100.0% similar) in 169 aa overlap (1-169:1-169) 10 20 30 40 50 60 pF1KE3 MTAGRRMEMLCAGRVPALLLCLGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTEDNPRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTAGRRMEMLCAGRVPALLLCLGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTEDNPRV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 AQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEHFFLTVHQPLSKEYVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEHFFLTVHQPLSKEYVA 70 80 90 100 110 120 130 140 150 160 pF1KE3 LTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV ::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV 130 140 150 160 >>NP_001720 (OMIM: 600345) probetacellulin isoform 1 pre (178 aa) initn: 161 init1: 128 opt: 213 Z-score: 245.5 bits: 51.9 E(85289): 6.1e-07 Smith-Waterman score: 213; 29.3% identity (57.5% similar) in 167 aa overlap (11-167:7-167) 10 20 30 40 50 pF1KE3 MTAGRRMEMLCAG--RVPALL-LCLGFHLLQAVLS---TTVIPSC---IPGESSDNCTAL :.: .: :: : ::. .:. :.. .: : . :. .::.: NP_001 MDRAARCSGASSLPLLLALALGLVILHCVVADGNSTRSPETNGLLCGDPEENCAAT 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 VQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEHF-FLTV : .. : .. : .: .... ::..:.: ..: . : :. :: :.:::. .. . NP_001 --TTQSKRKGHFS--RCPKQYKHYCIKGRCRFVVAEQTPSCVCDEGYIGARCERVDLFYL 60 70 80 90 100 110 120 130 140 150 160 pF1KE3 HQPLSKEYVALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV . .. : . ....:.: :.: :. :.:. :. ::: : : : : NP_001 RGDRGQILVICLIAVMVVFIILVIGVCT-CCHPLRKRR-KRKKKEEEMETLGKDITPINE 120 130 140 150 160 170 NP_001 DIEETNIA >>XP_011530513 (OMIM: 600345) PREDICTED: probetacellulin (178 aa) initn: 161 init1: 128 opt: 213 Z-score: 245.5 bits: 51.9 E(85289): 6.1e-07 Smith-Waterman score: 213; 29.3% identity (57.5% similar) in 167 aa overlap (11-167:7-167) 10 20 30 40 50 pF1KE3 MTAGRRMEMLCAG--RVPALL-LCLGFHLLQAVLS---TTVIPSC---IPGESSDNCTAL :.: .: :: : ::. .:. :.. .: : . :. .::.: XP_011 MDRAARCSGASSLPLLLALALGLVILHCVVADGNSTRSPETNGLLCGDPEENCAAT 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 VQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEHF-FLTV : .. : .. : .: .... ::..:.: ..: . : :. :: :.:::. .. . XP_011 --TTQSKRKGHFS--RCPKQYKHYCIKGRCRFVVAEQTPSCVCDEGYIGARCERVDLFYL 60 70 80 90 100 110 120 130 140 150 160 pF1KE3 HQPLSKEYVALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV . .. : . ....:.: :.: :. :.:. :. ::: : : : : XP_011 RGDRGQILVICLIAVMVVFIILVIGVCT-CCHPLRKRR-KRKKKEEEMETLGKDITPINE 120 130 140 150 160 170 XP_011 DIEETNIA >>XP_016863341 (OMIM: 131530,611718) PREDICTED: pro-epid (1010 aa) initn: 110 init1: 49 opt: 189 Z-score: 209.9 bits: 47.8 E(85289): 6e-05 Smith-Waterman score: 189; 27.4% identity (61.9% similar) in 113 aa overlap (52-157:837-949) 30 40 50 60 70 80 pF1KE3 LGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTEDNPRVAQVSITKCSSDMNGYCLH-GQ .. .:. .. : ..: . .::::: : XP_016 TNTEGGYTCMCAGRLSEPGLICPDSTPPPHLREDDHHYSVRNSDSECPLSHDGYCLHDGV 810 820 830 840 850 860 90 100 110 120 130 pF1KE3 CIYLVDMSQNYCRCEVGYTGVRCEH----FFLTVHQPLSKEYVALTVILIILFLITVVGS :.:. ... : : ::: : ::.. .. : ... ...: . .. :. .. XP_016 CMYIEALDKYACNCVVGYIGERCQYRDLKWWELRHAGHGQQQKVIVVAVCVVVLVMLLLL 870 880 890 900 910 920 140 150 160 pF1KE3 TYYFCRWYRNRK--SKEPKKEYERVTSGDPELPQV . . ..::..: ::.::. :: XP_016 SLWGAHYYRTQKLLSKNPKNPYEESSRDVRSRRPADTEDGMSSCPQPWGQCNQLHGGRSP 930 940 950 960 970 980 >>NP_001093161 (OMIM: 190170) protransforming growth fac (159 aa) initn: 165 init1: 144 opt: 178 Z-score: 208.5 bits: 44.9 E(85289): 7.1e-05 Smith-Waterman score: 178; 28.0% identity (59.8% similar) in 107 aa overlap (48-145:20-126) 20 30 40 50 60 70 pF1KE3 LLLCLGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTED---NPRVAQVSITK---CSSD : :: .. . .: :: . ... : .. NP_001 MVPSAGQLALFALGIVLAACQALENSTSPLSDPPVAAAVVSHFNDCPDS 10 20 30 40 80 90 100 110 120 pF1KE3 MNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEH--FFLTVHQPLSKEYV-ALTVILIIL . .:.:: : .::. .. : :. ::.:.:::: .. .: .:. . ::.:. :. NP_001 HTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHADLLAVVAASQKKQAITALVVVSIVA 50 60 70 80 90 100 130 140 150 160 pF1KE3 FLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV . . .. . : : NP_001 LAVLIITCVLIHCCQVRKHCEWCRALICRHEKPSALLKGRTACCHSETVV 110 120 130 140 150 >>NP_003227 (OMIM: 190170) protransforming growth factor (160 aa) initn: 165 init1: 144 opt: 178 Z-score: 208.5 bits: 44.9 E(85289): 7.1e-05 Smith-Waterman score: 178; 27.6% identity (59.5% similar) in 116 aa overlap (35-145:16-127) 10 20 30 40 50 60 pF1KE3 RRMEMLCAGRVPALLLCLGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTEDNPRVAQV- :. .: .. .: :. . . : : .: : NP_003 MVPSAGQLALFALGIVLAAC---QALENSTSPL-SADPPVAAAVV 10 20 30 40 70 80 90 100 110 pF1KE3 -SITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEH--FFLTVHQPLSKEYV- .. : .. . .:.:: : .::. .. : :. ::.:.:::: .. .: .:. . NP_003 SHFNDCPDSHTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHADLLAVVAASQKKQAIT 50 60 70 80 90 100 120 130 140 150 160 pF1KE3 ALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV ::.:. :. . . .. . : : NP_003 ALVVVSIVALAVLIITCVLIHCCQVRKHCEWCRALICRHEKPSALLKGRTACCHSETVV 110 120 130 140 150 160 >>NP_001295088 (OMIM: 190170) protransforming growth fac (165 aa) initn: 165 init1: 144 opt: 178 Z-score: 208.3 bits: 44.9 E(85289): 7.3e-05 Smith-Waterman score: 178; 28.0% identity (59.8% similar) in 107 aa overlap (48-145:26-132) 20 30 40 50 60 70 pF1KE3 LLLCLGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTED---NPRVAQVSITK---CSSD : :: .. . .: :: . ... : .. NP_001 MFKIGRGALDLFSELLSFGGIVLAACQALENSTSPLSDPPVAAAVVSHFNDCPDS 10 20 30 40 50 80 90 100 110 120 pF1KE3 MNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEH--FFLTVHQPLSKEYV-ALTVILIIL . .:.:: : .::. .. : :. ::.:.:::: .. .: .:. . ::.:. :. NP_001 HTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHADLLAVVAASQKKQAITALVVVSIVA 60 70 80 90 100 110 130 140 150 160 pF1KE3 FLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV . . .. . : : NP_001 LAVLIITCVLIHCCQVRKHCEWCRALICRHEKPSALLKGRTACCHSETVV 120 130 140 150 160 >>NP_001295087 (OMIM: 190170) protransforming growth fac (166 aa) initn: 165 init1: 144 opt: 178 Z-score: 208.3 bits: 44.9 E(85289): 7.3e-05 Smith-Waterman score: 178; 27.6% identity (59.5% similar) in 116 aa overlap (35-145:22-133) 10 20 30 40 50 60 pF1KE3 RRMEMLCAGRVPALLLCLGFHLLQAVLSTTVIPSCIPGESSDNCTALVQTEDNPRVAQV- :. .: .. .: :. . . : : .: : NP_001 MFKIGRGALDLFSELLSFGGIVLAAC---QALENSTSPL-SADPPVAAAVV 10 20 30 40 70 80 90 100 110 pF1KE3 -SITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVGYTGVRCEH--FFLTVHQPLSKEYV- .. : .. . .:.:: : .::. .. : :. ::.:.:::: .. .: .:. . NP_001 SHFNDCPDSHTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHADLLAVVAASQKKQAIT 50 60 70 80 90 100 120 130 140 150 160 pF1KE3 ALTVILIILFLITVVGSTYYFCRWYRNRKSKEPKKEYERVTSGDPELPQV ::.:. :. . . .. . : : NP_001 ALVVVSIVALAVLIITCVLIHCCQVRKHCEWCRALICRHEKPSALLKGRTACCHSETVV 110 120 130 140 150 160 >>NP_003683 (OMIM: 603421) tomoregulin-1 precursor [Homo (380 aa) initn: 194 init1: 168 opt: 182 Z-score: 207.9 bits: 46.0 E(85289): 7.7e-05 Smith-Waterman score: 182; 28.4% identity (58.8% similar) in 102 aa overlap (68-161:275-376) 40 50 60 70 80 90 pF1KE3 SCIPGESSDNCTALVQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVG : ..::::.::.: .. . .. :::: : NP_003 GKKDDGLQYRPDVKDASDQREDVYIGNHMPCPENLNGYCIHGKCEFIYSTQKASCRCESG 250 260 270 280 290 300 100 110 120 130 140 pF1KE3 YTGVRCEH------FFLTVHQPLSKEYVALTVILI-ILFLITVVGSTYYFC-RWYRNRKS ::: .::. . . .: :.. .: . . : .....: : . :.:.. NP_003 YTGQHCEKTDFSILYVVPSRQKLTHVLIAAIIGAVQIAIIVAIVMCITRKCPKNNRGRRQ 310 320 330 340 350 360 150 160 pF1KE3 KEPKKEYERVTSGDPELPQV :. .. :: NP_003 KQNLGHFTSDTSSRMV 370 380 >>XP_011509192 (OMIM: 605734) PREDICTED: tomoregulin-2 i (365 aa) initn: 210 init1: 164 opt: 177 Z-score: 202.7 bits: 45.0 E(85289): 0.00015 Smith-Waterman score: 177; 28.9% identity (61.9% similar) in 97 aa overlap (68-156:256-352) 40 50 60 70 80 90 pF1KE3 SCIPGESSDNCTALVQTEDNPRVAQVSITKCSSDMNGYCLHGQCIYLVDMSQNYCRCEVG : .::.:.::.: . ..:.. :::..: XP_011 TKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRCDAG 230 240 250 260 270 280 100 110 120 130 140 pF1KE3 YTGVRCEH----FFLTVHQPLSKEYVALTVIL----IILFLITVVGSTYYFCRWYRNRKS ::: .::. . .: :. .:: ..... : .. ..:. : : : ... XP_011 YTGQHCEKKDYSVLYVVPGPVRFQYVLIAAVIGTIQIAVICVVVLCITRKCPRSNRIHRQ 290 300 310 320 330 340 150 160 pF1KE3 KEPKKEYERVTSGDPELPQV :. .: XP_011 KQNTGHYSSDNTTRASTRLI 350 360 169 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:52:56 2016 done: Sat Nov 5 22:52:57 2016 Total Scan time: 4.730 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]