FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6990, 252 aa
1>>>pF1KB6990 252 - 252 aa - 252 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0269+/-0.000318; mu= 7.0623+/- 0.020
mean_var=115.0004+/-22.645, 0's: 0 Z-trim(119.6): 71 B-trim: 741 in 1/55
Lambda= 0.119598
statistics sampled from 33798 (33870) to 33798 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.761), E-opt: 0.2 (0.397), width: 16
Scan time: 7.520
The best scores are: opt bits E(85289)
NP_001648 (OMIM: 104640) amphiregulin preproprotei ( 252) 1667 297.8 1.2e-80
NP_001936 (OMIM: 126150) proheparin-binding EGF-li ( 208) 281 58.6 1e-08
NP_003683 (OMIM: 603421) tomoregulin-1 precursor [ ( 380) 181 41.5 0.0026
XP_011509192 (OMIM: 605734) PREDICTED: tomoregulin ( 365) 180 41.3 0.0028
NP_057276 (OMIM: 605734) tomoregulin-2 isoform 1 p ( 374) 180 41.3 0.0029
XP_016859228 (OMIM: 605734) PREDICTED: tomoregulin ( 337) 177 40.8 0.0038
XP_005273544 (OMIM: 142445,603013) PREDICTED: pro- ( 692) 182 41.8 0.0038
NP_001292063 (OMIM: 605734) tomoregulin-2 isoform ( 346) 177 40.8 0.0039
XP_005273543 (OMIM: 142445,603013) PREDICTED: pro- ( 695) 177 41.0 0.0069
>>NP_001648 (OMIM: 104640) amphiregulin preproprotein [H (252 aa)
initn: 1667 init1: 1667 opt: 1667 Z-score: 1568.9 bits: 297.8 E(85289): 1.2e-80
Smith-Waterman score: 1667; 100.0% identity (100.0% similar) in 252 aa overlap (1-252:1-252)
10 20 30 40 50 60
pF1KB6 MRAPLLPPAPVVLSLLILGSGHYAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MRAPLLPPAPVVLSLLILGSGHYAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 SEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 SDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQQEYFGER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SDKPKRKKKGGKNGKNRRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQQEYFGER
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 CGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKK
190 200 210 220 230 240
250
pF1KB6 LRQENGNVHAIA
::::::::::::
NP_001 LRQENGNVHAIA
250
>>NP_001936 (OMIM: 126150) proheparin-binding EGF-like g (208 aa)
initn: 259 init1: 195 opt: 281 Z-score: 277.7 bits: 58.6 E(85289): 1e-08
Smith-Waterman score: 281; 34.5% identity (60.8% similar) in 148 aa overlap (100-247:70-208)
70 80 90 100 110 120
pF1KB6 PSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENTSDKPKRKKK
: .:: :: . . : .. :::::
NP_001 PDPPTVSTDQLLPLGGGRDRKVRDLQEADLDLLRVTLSSKP--QALATPNKEEHGKRKKK
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB6 GGKNGKNRRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQQEYFGERCGEKSMKTH
: :: :..:: ....:::::::::...:.: .: :. : :::: :. ..
NP_001 GKGLGK------KRDPCLRKYKDFCIHGECKYVKELRAPSCICHPGYHGERCHGLSLPVE
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB6 SMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKKLRQENGNVH
. . . ::..:. .:.: : .. : ...: . :. : ::. :: . :..
NP_001 NRLYTYDHTTILAVVAVVLSSVCLLVI-VGLLMFRYHRRGGYDVENEEKVKLGMTNSH
160 170 180 190 200
250
pF1KB6 AIA
>>NP_003683 (OMIM: 603421) tomoregulin-1 precursor [Homo (380 aa)
initn: 141 init1: 141 opt: 181 Z-score: 180.5 bits: 41.5 E(85289): 0.0026
Smith-Waterman score: 189; 22.6% identity (65.2% similar) in 155 aa overlap (94-238:213-366)
70 80 90 100 110 120
pF1KB6 SPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDS-VRVEQV-VKPPQNKTESENTS
: .. . : .. ::. .. . :....::
NP_003 AENVGCVCNIDCSGYSFNPVCASDGSSYNNPCFVREASCIKQEQIDIRHLGHCTDTDDTS
190 200 210 220 230 240
130 140 150 160 170
pF1KB6 -----DKPKRKKKGGKNGKNRRNR---KKKNPCNAEFQNFCIHGECKYIEHLEAVTCKCQ
: . . :.....:. .. :: .....::::.:..: . ..:.:.
NP_003 LLGKKDDGLQYRPDVKDASDQREDVYIGNHMPCPENLNGYCIHGKCEFIYSTQKASCRCE
250 260 270 280 290 300
180 190 200 210 220 230
pF1KB6 QEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEG
. : :..: ::. . .. : .:.. . :::...:: .. ...:.. . :. .. .:
NP_003 SGYTGQHC-EKTDFSILYVVPSRQKLTHVLIAAIIGAVQIAIIVAIVMCITRKCPKNNRG
310 320 330 340 350 360
240 250
pF1KB6 EAEERKKLRQENGNVHAIA
. ...
NP_003 RRQKQNLGHFTSDTSSRMV
370 380
>>XP_011509192 (OMIM: 605734) PREDICTED: tomoregulin-2 i (365 aa)
initn: 82 init1: 82 opt: 180 Z-score: 179.8 bits: 41.3 E(85289): 0.0028
Smith-Waterman score: 180; 20.4% identity (62.6% similar) in 147 aa overlap (87-229:193-338)
60 70 80 90 100 110
pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK
::: :: . . ..: .. . .:
XP_011 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT
170 180 190 200 210 220
120 130 140 150 160 170
pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC
: . .. : . ..:... . :... :: ....::.::.:.. ... .:.:
XP_011 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC
230 240 250 260 270 280
180 190 200 210 220 230
pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE
. : :..: .:.... .. . . .. . ::: .... .... :... . :. :
XP_011 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRKCPRSNR
290 300 310 320 330 340
240 250
pF1KB6 GEAEERKKLRQENGNVHAIA
XP_011 IHRQKQNTGHYSSDNTTRASTRLI
350 360
>>NP_057276 (OMIM: 605734) tomoregulin-2 isoform 1 precu (374 aa)
initn: 82 init1: 82 opt: 180 Z-score: 179.6 bits: 41.3 E(85289): 0.0029
Smith-Waterman score: 180; 20.4% identity (62.6% similar) in 147 aa overlap (87-229:202-347)
60 70 80 90 100 110
pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK
::: :: . . ..: .. . .:
NP_057 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT
180 190 200 210 220 230
120 130 140 150 160 170
pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC
: . .. : . ..:... . :... :: ....::.::.:.. ... .:.:
NP_057 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC
240 250 260 270 280 290
180 190 200 210 220 230
pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE
. : :..: .:.... .. . . .. . ::: .... .... :... . :. :
NP_057 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRKCPRSNR
300 310 320 330 340 350
240 250
pF1KB6 GEAEERKKLRQENGNVHAIA
NP_057 IHRQKQNTGHYSSDNTTRASTRLI
360 370
>>XP_016859228 (OMIM: 605734) PREDICTED: tomoregulin-2 i (337 aa)
initn: 82 init1: 82 opt: 177 Z-score: 177.5 bits: 40.8 E(85289): 0.0038
Smith-Waterman score: 177; 20.3% identity (62.9% similar) in 143 aa overlap (87-225:193-334)
60 70 80 90 100 110
pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK
::: :: . . ..: .. . .:
XP_016 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT
170 180 190 200 210 220
120 130 140 150 160 170
pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC
: . .. : . ..:... . :... :: ....::.::.:.. ... .:.:
XP_016 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC
230 240 250 260 270 280
180 190 200 210 220 230
pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE
. : :..: .:.... .. . . .. . ::: .... .... :... . :
XP_016 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRAKL
290 300 310 320 330
240 250
pF1KB6 GEAEERKKLRQENGNVHAIA
>>XP_005273544 (OMIM: 142445,603013) PREDICTED: pro-neur (692 aa)
initn: 132 init1: 69 opt: 182 Z-score: 177.5 bits: 41.8 E(85289): 0.0038
Smith-Waterman score: 182; 22.5% identity (57.8% similar) in 204 aa overlap (53-252:151-345)
30 40 50 60 70 80
pF1KB6 YAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYD
:. ... .. . :: ::.. .. .
XP_005 PIISLDATAASAVWVSSEAYTSPVSRAQSESEVQVTVQGDKAVVSFEPSAAPTPKNRIFA
130 140 150 160 170 180
90 100 110 120 130 140
pF1KB6 YSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKK
.: .. :..:. . ::. . . ::. ::. : . :: . . . .: .. .
XP_005 FSFLPSTAPSFPSPTRNPEVRTPKSATQPQT-TET-NLQTAPKLSTSTSTTGTSHLVK--
190 200 210 220 230
150 160 170 180 190
pF1KB6 KNPCNAEFQNFCIHG-ECKYIEHLEAVT---CKCQQEYFGERCGEKSMKTHSMIDSSLSK
: . ..::..: :: ... : . ::: .:. :.:: . : . . .:
XP_005 ---CAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKAEELYQK
240 250 260 270 280 290
200 210 220 230 240 250
pF1KB6 IALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKKLRQENGNVHAIA
.:. :... :...... ... . . :: . . . :..::.: .:. ::
XP_005 RVLT-ITGICIALLVVGIMCVVAYCKTKKQRK-KLHDRLRQSLRSERNNMMNIANGPHHP
300 310 320 330 340 350
XP_005 NPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGH
360 370 380 390 400 410
>>NP_001292063 (OMIM: 605734) tomoregulin-2 isoform 2 pr (346 aa)
initn: 82 init1: 82 opt: 177 Z-score: 177.4 bits: 40.8 E(85289): 0.0039
Smith-Waterman score: 177; 20.3% identity (62.9% similar) in 143 aa overlap (87-225:202-343)
60 70 80 90 100 110
pF1KB6 MSSGSEISPVSEMPSSSEPSSGADYDYSEEYDNEPQIPGYIVDDSVRVE--QVVKPPQNK
::: :: . . ..: .. . .:
NP_001 DEDAEDVWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNT
180 190 200 210 220 230
120 130 140 150 160 170
pF1KB6 TESENTSDKPKRKKKGGKNGKN--RRNRKKKNPCNAEFQNFCIHGECKYIEHLEAVTCKC
: . .. : . ..:... . :... :: ....::.::.:.. ... .:.:
NP_001 TTTTKSEDGHYARTDYAENANKLEESAREHHIPCPEHYNGFCMHGKCEHSINMQEPSCRC
240 250 260 270 280 290
180 190 200 210 220 230
pF1KB6 QQEYFGERCGEKSMKTHSMIDSSLSKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYE
. : :..: .:.... .. . . .. . ::: .... .... :... . :
NP_001 DAGYTGQHCEKKDYSVLYVVPGPV-RFQYVLIAAVIGTIQIAVICVVVLCITRAKL
300 310 320 330 340
240 250
pF1KB6 GEAEERKKLRQENGNVHAIA
>>XP_005273543 (OMIM: 142445,603013) PREDICTED: pro-neur (695 aa)
initn: 131 init1: 68 opt: 177 Z-score: 172.8 bits: 41.0 E(85289): 0.0069
Smith-Waterman score: 177; 22.8% identity (58.3% similar) in 206 aa overlap (53-252:151-348)
30 40 50 60 70 80
pF1KB6 YAAGLDLNDTYSGKREPFSGDHSADGFEVTSRSEMSSGSEISPVSEMPSSSEPSSGADYD
:. ... .. . :: ::.. .. .
XP_005 PIISLDATAASAVWVSSEAYTSPVSRAQSESEVQVTVQGDKAVVSFEPSAAPTPKNRIFA
130 140 150 160 170 180
90 100 110 120 130 140
pF1KB6 YSEEYDNEPQIPGYIVDDSVRVEQVVKPPQNKTESENTSDKPKRKKKGGKNGKNRRNRKK
.: .. :..:. . ::. . . ::. ::. : . :: . . . .: .. .
XP_005 FSFLPSTAPSFPSPTRNPEVRTPKSATQPQT-TET-NLQTAPKLSTSTSTTGTSHLVK--
190 200 210 220 230
150 160 170 180 190
pF1KB6 KNPCNAEFQNFCIHG-ECKYIEHLEAVT---CKCQQEYFGERCGEK-SMKTHSMIDSS-L
: . ..::..: :: ... : . :::: . : :: :. ::.... . :
XP_005 ---CAEKEKTFCVNGGECFMVKDLSNPSRYLCKCQPGFTGARCTENVPMKVQNQEKAEEL
240 250 260 270 280 290
200 210 220 230 240 250
pF1KB6 SKIALAAIAAFMSAVILTAVAVITVQLRRQYVRKYEGEAEERKKLRQENGNVHAIA
. . .:... :...... ... . . :: . . . :..::.: .:. ::
XP_005 YQKRVLTITGICIALLVVGIMCVVAYCKTKKQRK-KLHDRLRQSLRSERNNMMNIANGPH
300 310 320 330 340 350
XP_005 HPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSN
360 370 380 390 400 410
252 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 02:01:33 2016 done: Mon Nov 7 02:01:34 2016
Total Scan time: 7.520 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]