FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6990, 252 aa 1>>>pF1KB6990 252 - 252 aa - 252 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0269+/-0.000318; mu= 7.0623+/- 0.020 mean_var=115.0004+/-22.645, 0's: 0 Z-trim(119.6): 71 B-trim: 741 in 1/55 Lambda= 0.119598 statistics sampled from 33798 (33870) to 33798 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.761), E-opt: 0.2 (0.397), width: 16 Scan time: 7.520 The best scores are: opt bits E(85289) NP_001648 (OMIM: 104640) amphiregulin preproprotei ( 252) 1667 297.8 1.2e-80 NP_001936 (OMIM: 126150) proheparin-binding EGF-li ( 208) 281 58.6 1e-08 NP_003683 (OMIM: 603421) tomoregulin-1 precursor [ ( 380) 181 41.5 0.0026 XP_011509192 (OMIM: 605734) PREDICTED: tomoregulin ( 365) 180 41.3 0.0028 NP_057276 (OMIM: 605734) tomoregulin-2 isoform 1 p ( 374) 180 41.3 0.0029 XP_016859228 (OMIM: 605734) PREDICTED: tomoregulin ( 337) 177 40.8 0.0038 XP_005273544 (OMIM: 142445,603013) PREDICTED: pro- ( 692) 182 41.8 0.0038 NP_001292063 (OMIM: 605734) tomoregulin-2 isoform ( 346) 177 40.8 0.0039 XP_005273543 (OMIM: 142445,603013) PREDICTED: pro- ( 695) 177 41.0 0.0069 >>NP_001648 (OMIM: 104640) amphiregulin preproprotein [H (252 aa) initn: 1667 init1: 1667 opt: 1667 Z-score: 1568.9 bits: 297.8 E(85289): 1.2e-80 Smith-Waterman score: 1667; 100.0% identity (100.0% similar) in 252 aa overlap (1-252:1-252) 10 20 30 40 50 60 pF1KB6 MRAPLLPPAPVVLSLLILGSGHYAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MRAPLLPPAPVVLSLLILGSGHYAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 SEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 SDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQQEYFGER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQQEYFGER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 CGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKK 190 200 210 220 230 240 250 pF1KB6 LRQENGNVHAIA :::::::::::: NP_001 LRQENGNVHAIA 250 >>NP_001936 (OMIM: 126150) proheparin-binding EGF-like g (208 aa) initn: 259 init1: 195 opt: 281 Z-score: 277.7 bits: 58.6 E(85289): 1e-08 Smith-Waterman score: 281; 34.5% identity (60.8% similar) in 148 aa overlap (100-247:70-208) 70 80 90 100 110 120 pF1KB6 PSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENTSDKPKRKKK : .:: :: . . : .. ::::: NP_001 PDPPTVSTDQLLPLGGGRDRKVRDLQEADLDLLRVTLSSKP--QALATPNKEEHGKRKKK 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB6 GGKNGKNRRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQQEYFGERCGEKSMKTH : :: :..:: ....:::::::::...:.: .: :. : :::: :. .. NP_001 GKGLGK------KRDPCLRKYKDFCIHGECKYVKELRAPSCICHPGYHGERCHGLSLPVE 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB6 SMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKKLRQENGNVH . . . ::..:. .:.: : .. : ...: . :. : ::. :: . :.. NP_001 NRLYTYDHTTILAVVAVVLSSVCLLVI-VGLLMFRYHRRGGYDVENEEKVKLGMTNSH 160 170 180 190 200 250 pF1KB6 AIA >>NP_003683 (OMIM: 603421) tomoregulin-1 precursor [Homo (380 aa) initn: 141 init1: 141 opt: 181 Z-score: 180.5 bits: 41.5 E(85289): 0.0026 Smith-Waterman score: 189; 22.6% identity (65.2% similar) in 155 aa overlap (94-238:213-366) 70 80 90 100 110 120 pF1KB6 SPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDS-VRVEQV-VKPPQNKTESENTS : .. . : .. ::. .. . :....:: NP_003 AENVGCVCNIDCSGYSFNPVCASDGSSYNNPCFVREASCIKQEQIDIRHLGHCTDTDDTS 190 200 210 220 230 240 130 140 150 160 170 pF1KB6 -----DKPKRKKKGGKNGKNRRNR---KKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQ : . . :.....:. .. :: .....::::.:..: . ..:.:. NP_003 LLGKKDDGLQYRPDVKDASDQREDVYIGNHMPCPENLNGYCIHGKCEFIYSTQKASCRCE 250 260 270 280 290 300 180 190 200 210 220 230 pF1KB6 QEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEG . : :..: ::. . .. : .:.. . :::...:: .. ...:.. . :. .. .: NP_003 SGYTGQHC-EKTDFSILYVVPSRQKLTHVLIAAIIGAVQIAIIVAIVMCITRKCPKNNRG 310 320 330 340 350 360 240 250 pF1KB6 EAEERKKLRQENGNVHAIA . ... NP_003 RRQKQNLGHFTSDTSSRMV 370 380 >>XP_011509192 (OMIM: 605734) PREDICTED: tomoregulin-2 i (365 aa) initn: 82 init1: 82 opt: 180 Z-score: 179.8 bits: 41.3 E(85289): 0.0028 Smith-Waterman score: 180; 20.4% identity (62.6% similar) in 147 aa overlap (87-229:193-338) 60 70 80 90 100 110 pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK ::: :: . . ..: .. . .: XP_011 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT 170 180 190 200 210 220 120 130 140 150 160 170 pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC : . .. : . ..:... . :... :: ....::.::.:.. ... .:.: XP_011 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC 230 240 250 260 270 280 180 190 200 210 220 230 pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE . : :..: .:.... .. . . .. . ::: .... .... :... . :. : XP_011 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRKCPRSNR 290 300 310 320 330 340 240 250 pF1KB6 GEAEERKKLRQENGNVHAIA XP_011 IHRQKQNTGHYSSDNTTRASTRLI 350 360 >>NP_057276 (OMIM: 605734) tomoregulin-2 isoform 1 precu (374 aa) initn: 82 init1: 82 opt: 180 Z-score: 179.6 bits: 41.3 E(85289): 0.0029 Smith-Waterman score: 180; 20.4% identity (62.6% similar) in 147 aa overlap (87-229:202-347) 60 70 80 90 100 110 pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK ::: :: . . ..: .. . .: NP_057 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT 180 190 200 210 220 230 120 130 140 150 160 170 pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC : . .. : . ..:... . :... :: ....::.::.:.. ... .:.: NP_057 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC 240 250 260 270 280 290 180 190 200 210 220 230 pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE . : :..: .:.... .. . . .. . ::: .... .... :... . :. : NP_057 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRKCPRSNR 300 310 320 330 340 350 240 250 pF1KB6 GEAEERKKLRQENGNVHAIA NP_057 IHRQKQNTGHYSSDNTTRASTRLI 360 370 >>XP_016859228 (OMIM: 605734) PREDICTED: tomoregulin-2 i (337 aa) initn: 82 init1: 82 opt: 177 Z-score: 177.5 bits: 40.8 E(85289): 0.0038 Smith-Waterman score: 177; 20.3% identity (62.9% similar) in 143 aa overlap (87-225:193-334) 60 70 80 90 100 110 pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK ::: :: . . ..: .. . .: XP_016 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT 170 180 190 200 210 220 120 130 140 150 160 170 pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC : . .. : . ..:... . :... :: ....::.::.:.. ... .:.: XP_016 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC 230 240 250 260 270 280 180 190 200 210 220 230 pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE . : :..: .:.... .. . . .. . ::: .... .... :... . : XP_016 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRAKL 290 300 310 320 330 240 250 pF1KB6 GEAEERKKLRQENGNVHAIA >>XP_005273544 (OMIM: 142445,603013) PREDICTED: pro-neur (692 aa) initn: 132 init1: 69 opt: 182 Z-score: 177.5 bits: 41.8 E(85289): 0.0038 Smith-Waterman score: 182; 22.5% identity (57.8% similar) in 204 aa overlap (53-252:151-345) 30 40 50 60 70 80 pF1KB6 YAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYD :. ... .. . :: ::.. .. . XP_005 PIISLDATAASAVWVSSEAYTSPVSRAQSESEVQVTVQGDKAVVSFEPSAAPTPKNRIFA 130 140 150 160 170 180 90 100 110 120 130 140 pF1KB6 YSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKK .: .. :..:. . ::. . . ::. ::. : . :: . . . .: .. . XP_005 FSFLPSTAPSFPSPTRNPEVRTPKSATQPQT-TET-NLQTAPKLSTSTSTTGTSHLVK-- 190 200 210 220 230 150 160 170 180 190 pF1KB6 KNPCNAEFQNFCIHG-ECKYIEHLEAVT---CKCQQEYFGERCGEKSMKTHSMIDSSLSK : . ..::..: :: ... : . ::: .:. :.:: . : . . .: XP_005 ---CAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKAEELYQK 240 250 260 270 280 290 200 210 220 230 240 250 pF1KB6 IALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKKLRQENGNVHAIA .:. :... :...... ... . . :: . . . :..::.: .:. :: XP_005 RVLT-ITGICIALLVVGIMCVVAYCKTKKQRK-KLHDRLRQSLRSERNNMMNIANGPHHP 300 310 320 330 340 350 XP_005 NPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGH 360 370 380 390 400 410 >>NP_001292063 (OMIM: 605734) tomoregulin-2 isoform 2 pr (346 aa) initn: 82 init1: 82 opt: 177 Z-score: 177.4 bits: 40.8 E(85289): 0.0039 Smith-Waterman score: 177; 20.3% identity (62.9% similar) in 143 aa overlap (87-225:202-343) 60 70 80 90 100 110 pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK ::: :: . . ..: .. . .: NP_001 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT 180 190 200 210 220 230 120 130 140 150 160 170 pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC : . .. : . ..:... . :... :: ....::.::.:.. ... .:.: NP_001 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC 240 250 260 270 280 290 180 190 200 210 220 230 pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE . : :..: .:.... .. . . .. . ::: .... .... :... . : NP_001 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRAKL 300 310 320 330 340 240 250 pF1KB6 GEAEERKKLRQENGNVHAIA >>XP_005273543 (OMIM: 142445,603013) PREDICTED: pro-neur (695 aa) initn: 131 init1: 68 opt: 177 Z-score: 172.8 bits: 41.0 E(85289): 0.0069 Smith-Waterman score: 177; 22.8% identity (58.3% similar) in 206 aa overlap (53-252:151-348) 30 40 50 60 70 80 pF1KB6 YAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYD :. ... .. . :: ::.. .. . XP_005 PIISLDATAASAVWVSSEAYTSPVSRAQSESEVQVTVQGDKAVVSFEPSAAPTPKNRIFA 130 140 150 160 170 180 90 100 110 120 130 140 pF1KB6 YSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKK .: .. :..:. . ::. . . ::. ::. : . :: . . . .: .. . XP_005 FSFLPSTAPSFPSPTRNPEVRTPKSATQPQT-TET-NLQTAPKLSTSTSTTGTSHLVK-- 190 200 210 220 230 150 160 170 180 190 pF1KB6 KNPCNAEFQNFCIHG-ECKYIEHLEAVT---CKCQQEYFGERCGEK-SMKTHSMIDSS-L : . ..::..: :: ... : . :::: . : :: :. ::.... . : XP_005 ---CAEKEKTFCVNGGECFMVKDLSNPSRYLCKCQPGFTGARCTENVPMKVQNQEKAEEL 240 250 260 270 280 290 200 210 220 230 240 250 pF1KB6 SKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKKLRQENGNVHAIA . . .:... :...... ... . . :: . . . :..::.: .:. :: XP_005 YQKRVLTITGICIALLVVGIMCVVAYCKTKKQRK-KLHDRLRQSLRSERNNMMNIANGPH 300 310 320 330 340 350 XP_005 HPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSN 360 370 380 390 400 410 252 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:01:33 2016 done: Mon Nov 7 02:01:34 2016 Total Scan time: 7.520 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]