FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7040, 462 aa
1>>>pF1KB7040 462 - 462 aa - 462 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7258+/-0.000486; mu= 1.2363+/- 0.030
mean_var=180.4626+/-37.131, 0's: 0 Z-trim(114.2): 21 B-trim: 211 in 1/53
Lambda= 0.095473
statistics sampled from 23948 (23960) to 23948 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.636), E-opt: 0.2 (0.281), width: 16
Scan time: 9.540
The best scores are: opt bits E(85289)
NP_002998 (OMIM: 182140) semenogelin-1 preproprote ( 462) 3078 436.8 6e-122
NP_002999 (OMIM: 182141) semenogelin-2 precursor [ ( 582) 2133 306.7 1.1e-82
NP_001116437 (OMIM: 613259) repetin [Homo sapiens] ( 784) 254 47.9 0.00011
XP_011507833 (OMIM: 616284) PREDICTED: filaggrin-2 (2239) 238 46.0 0.0013
NP_001014364 (OMIM: 616284) filaggrin-2 [Homo sapi (2391) 238 46.0 0.0013
>>NP_002998 (OMIM: 182140) semenogelin-1 preproprotein [ (462 aa)
initn: 3078 init1: 3078 opt: 3078 Z-score: 2310.4 bits: 436.8 E(85289): 6e-122
Smith-Waterman score: 3078; 100.0% identity (100.0% similar) in 462 aa overlap (1-462:1-462)
10 20 30 40 50 60
pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH
370 380 390 400 410 420
430 440 450 460
pF1KB7 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT
::::::::::::::::::::::::::::::::::::::::::
NP_002 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT
430 440 450 460
>>NP_002999 (OMIM: 182141) semenogelin-2 precursor [Homo (582 aa)
initn: 3619 init1: 2115 opt: 2133 Z-score: 1605.4 bits: 306.7 E(85289): 1.1e-82
Smith-Waterman score: 2133; 75.0% identity (89.2% similar) in 436 aa overlap (1-436:1-436)
10 20 30 40 50 60
pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK
:: :.::::::::::::::::::::::::.::: :::::::::::: ::: .:.:.::
NP_002 MKSIILFVLSLLLILEKQAAVMGQKGGSKGQLPSGSSQFPHGQKGQHYFGQKDQQHTKSK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK
::::::.::::: :::: .:::::::::::::.:::..:::::::::. :::::::::::
NP_002 GSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLNYKQEGRDHDKSK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG
:::: .:::::::.::.:::::::::::::::::.::: ::::.:::::::::::.:.::
NP_002 GHFHMIVIHHKGGQAHHGTQNPSQDQGNSPSGKGLSSQCSNTEKRLWVHGLSKEQASASG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP
::::: ::::::::::::::::.:::::::::::::::::::::.::::::::.:::: :
NP_002 AQKGRTQGGSQSSYVLQTEELVVNKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY
::::.:::: ::::.::::::::::::::::::.:::.:::::.:::: :: ::::.::.
NP_002 AHQDRLQHGPKDIFTTQDELLVYNKNQHQTKNLSQDQEHGRKAHKISYPSSRTEERQLHH
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY
::..::::::..:: ::::: .::::.:.:: ::.:::..: :::::::::::::.:.
NP_002 GEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQEHGHKENKISYQSSSTEERHLNC
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH
::.:.:: ::. :: :::. . :::: :. :.: .:.. . : ::.. :. :.
NP_002 GEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQEYGHKENKISYQSSSTEERRLNS
370 380 390 400 410 420
430 440 450 460
pF1KB7 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT
.: .. :.:...:
NP_002 GEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMSYQSSSTEERRLNY
430 440 450 460 470 480
>--
initn: 962 init1: 618 opt: 618 Z-score: 477.7 bits: 98.0 E(85289): 7.2e-20
Smith-Waterman score: 618; 65.1% identity (85.6% similar) in 146 aa overlap (317-462:437-582)
290 300 310 320 330 340
pF1KB7 SYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKI
::::: .::::.:.:::::.:::..: ::.
NP_002 SYQSSSTEERRLNSGEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKM
410 420 430 440 450 460
350 360 370 380 390 400
pF1KB7 SYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGE
::::::::::::.:: ...:::::: :: : :::: ::::::.:::.:. : :.::::.
NP_002 SYQSSSTEERRLNYGGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQWSGQNAKGK
470 480 490 500 510 520
410 420 430 440 450 460
pF1KB7 SGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT
::::.. .:::::::::::... : . .::: :.: .: ::.:. :.::::. :
NP_002 SGQSADSKQDLLSHEQKGRYKQESSESHNIVITEHEVAQDDHLTQQYNEDRNPIST
530 540 550 560 570 580
>>NP_001116437 (OMIM: 613259) repetin [Homo sapiens] (784 aa)
initn: 158 init1: 84 opt: 254 Z-score: 204.8 bits: 47.9 E(85289): 0.00011
Smith-Waterman score: 257; 22.9% identity (55.1% similar) in 428 aa overlap (31-431:129-534)
10 20 30 40 50 60
pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK
: :. :: :. : :. . :: .:. .:.
NP_001 SQQERGQEGAQDCKFPGNTGRQHRQRHEEERQNSHHSQ-PERQDGDSHHGQPERQDRDSH
100 110 120 130 140 150
70 80 90 100 110
pF1KB7 GSFSIQY---TYHV-----DANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQE
. : . ..: : ..: .. . :. :.. .. .:: .: ... :..
NP_001 HGQSEKQDRDSHHSQPERQDRDSHHNQSERQDKDFSFDQSERQSQDSSSG-KKVSHKSTS
160 170 180 190 200 210
120 130 140 150 160 170
pF1KB7 GRDHDKSKGHFHRVVIHHKGGK-AHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGL
:. : .::. . .: . .: : :. . : . :.. . .: .. :
NP_001 GQA--KWQGHIFALNRCEKPIQDSHYG-QSERHTQQSETLGQASHFNQTNQQKSGSYCGQ
220 230 240 250 260 270
180 190 200 210 220 230
pF1KB7 SKEQTSVSGAQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHS
:.. . : . .:: :::. ::.. . . .: . :.. . :. . . :
NP_001 SERLGQELGCGQTDRQG--QSSHYGQTDRQDQSYHYGQTDRQGQSSHYSQTDRQGQSSHY
280 290 300 310 320 330
240 250 260 270 280 290
pF1KB7 SKVQTSLCPAHQDKLQH-GSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQS
:. : .: . .: :. : . . :.....: .. .. .: .:.... : .
NP_001 SQ------PDRQGQSSHYGQMD---RKGQCYHYDQTNRQGQG-SHYSQPNRQGQSSHYGQ
340 350 360 370 380
300 310 320 330 340
pF1KB7 SSTEERRLHYGENGVQKDVS---------QSSIYSQTEEKAQGKSQKQITIPSQEQEHSQ
.:... :::.. : . : ::: ::: ....::. : .: ....:
NP_001 PDTQDQSSHYGQTDRQDQSSHYGQTERQGQSSHYSQMDRQGQGSHYGQTDRQGQSSHYGQ
390 400 410 420 430 440
350 360 370 380 390
pF1KB7 ---KANKISYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEP-
.... : ... . . :::.. : .: : ::: .: :.:. . .:.
NP_001 PDRQGQNSHYGQTDRQGQSSHYGQTDRQ---GQSSHYSQPDK--QGQSSHYGKIDRQDQS
450 460 470 480 490
400 410 420 430 440 450
pF1KB7 WH-GE-NAKGESGQ--STNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHL
.: :. ...:.:.. .:.:. . . . : :. ..::
NP_001 YHYGQPDGQGQSSHYGQTDRQGQSFHYGQPDRQGQSSHYSQMDRQGQSSHYGQTDRQGQS
500 510 520 530 540 550
460
pF1KB7 NNDRNPLFT
NP_001 SHYGQTDRQGQSYHYGQTDRQGQSSHYIQSQTGEIQGQNKYFQGTEGTRKASYVEQSGRS
560 570 580 590 600 610
>>XP_011507833 (OMIM: 616284) PREDICTED: filaggrin-2 iso (2239 aa)
initn: 61 init1: 61 opt: 238 Z-score: 186.0 bits: 46.0 E(85289): 0.0013
Smith-Waterman score: 241; 23.1% identity (53.4% similar) in 459 aa overlap (18-452:895-1330)
10 20 30 40
pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK---
:.. .::.: ..:. : :.:. ..
XP_011 SSSGQTTGFGQHRSSSGQYSGFGQHGSGSDQSSGFGQHGTGSGQ-SSGFGQYESRSRQSS
870 880 890 900 910 920
50 60 70 80 90 100
pF1KB7 -GQHYSGQKGKQQTESKGSFSIQYT-YHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG
::: ::.. .. ..:: : : . . :: :: .. ... .:. :
XP_011 YGQHGSGSSQSSGYGQHGSNSGQTSGFGQHRPGSGQSSGFGQYGSGSGQSSGFGQHGSGT
930 940 950 960 970 980
110 120 130 140 150 160
pF1KB7 SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNT
... ..: :. ..: :. . .: :.. ..:. . :. :: :...:..
XP_011 GKSSGFAQHEYRSGQSSYGQHGTGSSQSSGCGQHESGSGPTTSFGQHVSG---SDNFSSS
990 1000 1010 1020 1030 1040
170 180 190 200 210
pF1KB7 EERLWVHGLSK---EQTSVSGAQKGRKQGGSQS----SYVLQTEELVANKQQRETKNSHQ
... : : . : :: . : :: ::. : : .: . .. :..:..
XP_011 GQHISDSGQSTGFGQYGSGSGQSTGLGQGESQQVESGSTVHGRQETTHGQTINTTRHSQS
1050 1060 1070 1080 1090 1100
220 230 240 250 260
pF1KB7 NKGHY-QNVVEV-REEHSSKVQTSLCPAHQDKLQHGSKDIFSTQ------DELLVYNKNQ
..:. :. .: :...::. ..: .:. :..: .. . :: . . :
XP_011 GQGQSTQTGSRVTRRRRSSQSENSDSEVHS-KVSHRHSEHIHTQAGSHYPKSGSTVRRRQ
1110 1120 1130 1140 1150
270 280 290 300 310 320
pF1KB7 HQTKNLNQDQ-QHGRKANKISYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKS
:.. : .::.... : :..: : ..... :...: ..: : :
XP_011 GTTHGQRGDTTRHGHSGHGQSTQTGSRTSGRQRFSHS----DATDSEVHS-------GVS
1160 1170 1180 1190 1200
330 340 350 360 370 380
pF1KB7 QKQITIPSQEQEHSQKANKISYQSSSTEERR-LHYGENG--VQKDVSQRSIYSQTEKLVA
.. :::: ::: ... . . :...::. ::..: . . : .. .: . ..
XP_011 HRP---HSQEQTHSQAGSQHGESESTVHERHETTYGQTGEATGHGHSGHGQSTQRGSRTT
1210 1220 1230 1240 1250 1260
390 400 410 420 430 440
pF1KB7 GKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQED
:. . .. :. ... ..: .. : .: ..: :: :: :
XP_011 GRRGSGHSESSDSEVHSGGSHRPQSQEQTHGQAGSQHGESGSTVHGRHGTTH----GQTG
1270 1280 1290 1300 1310 1320
450 460
pF1KB7 DSDRHLAQHLNNDRNPLFT
:. :: :
XP_011 DTTRHAHYHHGKSTQRGSSTTGRRGSGHSESSDSEVHSGGSHTHSGHTHGQSGSQHGESE
1330 1340 1350 1360 1370 1380
>>NP_001014364 (OMIM: 616284) filaggrin-2 [Homo sapiens] (2391 aa)
initn: 61 init1: 61 opt: 238 Z-score: 185.6 bits: 46.0 E(85289): 0.0013
Smith-Waterman score: 241; 23.1% identity (53.4% similar) in 459 aa overlap (18-452:1047-1482)
10 20 30 40
pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK---
:.. .::.: ..:. : :.:. ..
NP_001 SSSGQTTGFGQHRSSSGQYSGFGQHGSGSDQSSGFGQHGTGSGQ-SSGFGQYESRSRQSS
1020 1030 1040 1050 1060 1070
50 60 70 80 90 100
pF1KB7 -GQHYSGQKGKQQTESKGSFSIQYT-YHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG
::: ::.. .. ..:: : : . . :: :: .. ... .:. :
NP_001 YGQHGSGSSQSSGYGQHGSNSGQTSGFGQHRPGSGQSSGFGQYGSGSGQSSGFGQHGSGT
1080 1090 1100 1110 1120 1130
110 120 130 140 150 160
pF1KB7 SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNT
... ..: :. ..: :. . .: :.. ..:. . :. :: :...:..
NP_001 GKSSGFAQHEYRSGQSSYGQHGTGSSQSSGCGQHESGSGPTTSFGQHVSG---SDNFSSS
1140 1150 1160 1170 1180 1190
170 180 190 200 210
pF1KB7 EERLWVHGLSK---EQTSVSGAQKGRKQGGSQS----SYVLQTEELVANKQQRETKNSHQ
... : : . : :: . : :: ::. : : .: . .. :..:..
NP_001 GQHISDSGQSTGFGQYGSGSGQSTGLGQGESQQVESGSTVHGRQETTHGQTINTTRHSQS
1200 1210 1220 1230 1240 1250
220 230 240 250 260
pF1KB7 NKGHY-QNVVEV-REEHSSKVQTSLCPAHQDKLQHGSKDIFSTQ------DELLVYNKNQ
..:. :. .: :...::. ..: .:. :..: .. . :: . . :
NP_001 GQGQSTQTGSRVTRRRRSSQSENSDSEVHS-KVSHRHSEHIHTQAGSHYPKSGSTVRRRQ
1260 1270 1280 1290 1300 1310
270 280 290 300 310 320
pF1KB7 HQTKNLNQDQ-QHGRKANKISYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKS
:.. : .::.... : :..: : ..... :...: ..: : :
NP_001 GTTHGQRGDTTRHGHSGHGQSTQTGSRTSGRQRFSHS----DATDSEVHS-------GVS
1320 1330 1340 1350 1360
330 340 350 360 370 380
pF1KB7 QKQITIPSQEQEHSQKANKISYQSSSTEERR-LHYGENG--VQKDVSQRSIYSQTEKLVA
.. :::: ::: ... . . :...::. ::..: . . : .. .: . ..
NP_001 HRP---HSQEQTHSQAGSQHGESESTVHERHETTYGQTGEATGHGHSGHGQSTQRGSRTT
1370 1380 1390 1400 1410
390 400 410 420 430 440
pF1KB7 GKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQED
:. . .. :. ... ..: .. : .: ..: :: :: :
NP_001 GRRGSGHSESSDSEVHSGGSHRPQSQEQTHGQAGSQHGESGSTVHGRHGTTH----GQTG
1420 1430 1440 1450 1460 1470
450 460
pF1KB7 DSDRHLAQHLNNDRNPLFT
:. :: :
NP_001 DTTRHAHYHHGKSTQRGSSTTGRRGSGHSESSDSEVHSGGSHTHSGHTHGQSGSQHGESE
1480 1490 1500 1510 1520 1530
462 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 13:03:26 2016 done: Sun Nov 6 13:03:27 2016
Total Scan time: 9.540 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]