FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7040, 462 aa 1>>>pF1KB7040 462 - 462 aa - 462 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7258+/-0.000486; mu= 1.2363+/- 0.030 mean_var=180.4626+/-37.131, 0's: 0 Z-trim(114.2): 21 B-trim: 211 in 1/53 Lambda= 0.095473 statistics sampled from 23948 (23960) to 23948 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.636), E-opt: 0.2 (0.281), width: 16 Scan time: 9.540 The best scores are: opt bits E(85289) NP_002998 (OMIM: 182140) semenogelin-1 preproprote ( 462) 3078 436.8 6e-122 NP_002999 (OMIM: 182141) semenogelin-2 precursor [ ( 582) 2133 306.7 1.1e-82 NP_001116437 (OMIM: 613259) repetin [Homo sapiens] ( 784) 254 47.9 0.00011 XP_011507833 (OMIM: 616284) PREDICTED: filaggrin-2 (2239) 238 46.0 0.0013 NP_001014364 (OMIM: 616284) filaggrin-2 [Homo sapi (2391) 238 46.0 0.0013 >>NP_002998 (OMIM: 182140) semenogelin-1 preproprotein [ (462 aa) initn: 3078 init1: 3078 opt: 3078 Z-score: 2310.4 bits: 436.8 E(85289): 6e-122 Smith-Waterman score: 3078; 100.0% identity (100.0% similar) in 462 aa overlap (1-462:1-462) 10 20 30 40 50 60 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH 370 380 390 400 410 420 430 440 450 460 pF1KB7 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT :::::::::::::::::::::::::::::::::::::::::: NP_002 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT 430 440 450 460 >>NP_002999 (OMIM: 182141) semenogelin-2 precursor [Homo (582 aa) initn: 3619 init1: 2115 opt: 2133 Z-score: 1605.4 bits: 306.7 E(85289): 1.1e-82 Smith-Waterman score: 2133; 75.0% identity (89.2% similar) in 436 aa overlap (1-436:1-436) 10 20 30 40 50 60 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK :: :.::::::::::::::::::::::::.::: :::::::::::: ::: .:.:.:: NP_002 MKSIILFVLSLLLILEKQAAVMGQKGGSKGQLPSGSSQFPHGQKGQHYFGQKDQQHTKSK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK ::::::.::::: :::: .:::::::::::::.:::..:::::::::. ::::::::::: NP_002 GSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLNYKQEGRDHDKSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG :::: .:::::::.::.:::::::::::::::::.::: ::::.:::::::::::.:.:: NP_002 GHFHMIVIHHKGGQAHHGTQNPSQDQGNSPSGKGLSSQCSNTEKRLWVHGLSKEQASASG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP ::::: ::::::::::::::::.:::::::::::::::::::::.::::::::.:::: : NP_002 AQKGRTQGGSQSSYVLQTEELVVNKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY ::::.:::: ::::.::::::::::::::::::.:::.:::::.:::: :: ::::.::. NP_002 AHQDRLQHGPKDIFTTQDELLVYNKNQHQTKNLSQDQEHGRKAHKISYPSSRTEERQLHH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY ::..::::::..:: ::::: .::::.:.:: ::.:::..: :::::::::::::.:. NP_002 GEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQEHGHKENKISYQSSSTEERHLNC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH ::.:.:: ::. :: :::. . :::: :. :.: .:.. . : ::.. :. :. NP_002 GEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQEYGHKENKISYQSSSTEERRLNS 370 380 390 400 410 420 430 440 450 460 pF1KB7 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT .: .. :.:...: NP_002 GEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMSYQSSSTEERRLNY 430 440 450 460 470 480 >-- initn: 962 init1: 618 opt: 618 Z-score: 477.7 bits: 98.0 E(85289): 7.2e-20 Smith-Waterman score: 618; 65.1% identity (85.6% similar) in 146 aa overlap (317-462:437-582) 290 300 310 320 330 340 pF1KB7 SYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKI ::::: .::::.:.:::::.:::..: ::. NP_002 SYQSSSTEERRLNSGEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKM 410 420 430 440 450 460 350 360 370 380 390 400 pF1KB7 SYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGE ::::::::::::.:: ...:::::: :: : :::: ::::::.:::.:. : :.::::. NP_002 SYQSSSTEERRLNYGGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQWSGQNAKGK 470 480 490 500 510 520 410 420 430 440 450 460 pF1KB7 SGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT ::::.. .:::::::::::... : . .::: :.: .: ::.:. :.::::. : NP_002 SGQSADSKQDLLSHEQKGRYKQESSESHNIVITEHEVAQDDHLTQQYNEDRNPIST 530 540 550 560 570 580 >>NP_001116437 (OMIM: 613259) repetin [Homo sapiens] (784 aa) initn: 158 init1: 84 opt: 254 Z-score: 204.8 bits: 47.9 E(85289): 0.00011 Smith-Waterman score: 257; 22.9% identity (55.1% similar) in 428 aa overlap (31-431:129-534) 10 20 30 40 50 60 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK : :. :: :. : :. . :: .:. .:. NP_001 SQQERGQEGAQDCKFPGNTGRQHRQRHEEERQNSHHSQ-PERQDGDSHHGQPERQDRDSH 100 110 120 130 140 150 70 80 90 100 110 pF1KB7 GSFSIQY---TYHV-----DANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQE . : . ..: : ..: .. . :. :.. .. .:: .: ... :.. NP_001 HGQSEKQDRDSHHSQPERQDRDSHHNQSERQDKDFSFDQSERQSQDSSSG-KKVSHKSTS 160 170 180 190 200 210 120 130 140 150 160 170 pF1KB7 GRDHDKSKGHFHRVVIHHKGGK-AHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGL :. : .::. . .: . .: : :. . : . :.. . .: .. : NP_001 GQA--KWQGHIFALNRCEKPIQDSHYG-QSERHTQQSETLGQASHFNQTNQQKSGSYCGQ 220 230 240 250 260 270 180 190 200 210 220 230 pF1KB7 SKEQTSVSGAQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHS :.. . : . .:: :::. ::.. . . .: . :.. . :. . . : NP_001 SERLGQELGCGQTDRQG--QSSHYGQTDRQDQSYHYGQTDRQGQSSHYSQTDRQGQSSHY 280 290 300 310 320 330 240 250 260 270 280 290 pF1KB7 SKVQTSLCPAHQDKLQH-GSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQS :. : .: . .: :. : . . :.....: .. .. .: .:.... : . NP_001 SQ------PDRQGQSSHYGQMD---RKGQCYHYDQTNRQGQG-SHYSQPNRQGQSSHYGQ 340 350 360 370 380 300 310 320 330 340 pF1KB7 SSTEERRLHYGENGVQKDVS---------QSSIYSQTEEKAQGKSQKQITIPSQEQEHSQ .:... :::.. : . : ::: ::: ....::. : .: ....: NP_001 PDTQDQSSHYGQTDRQDQSSHYGQTERQGQSSHYSQMDRQGQGSHYGQTDRQGQSSHYGQ 390 400 410 420 430 440 350 360 370 380 390 pF1KB7 ---KANKISYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEP- .... : ... . . :::.. : .: : ::: .: :.:. . .:. NP_001 PDRQGQNSHYGQTDRQGQSSHYGQTDRQ---GQSSHYSQPDK--QGQSSHYGKIDRQDQS 450 460 470 480 490 400 410 420 430 440 450 pF1KB7 WH-GE-NAKGESGQ--STNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHL .: :. ...:.:.. .:.:. . . . : :. ..:: NP_001 YHYGQPDGQGQSSHYGQTDRQGQSFHYGQPDRQGQSSHYSQMDRQGQSSHYGQTDRQGQS 500 510 520 530 540 550 460 pF1KB7 NNDRNPLFT NP_001 SHYGQTDRQGQSYHYGQTDRQGQSSHYIQSQTGEIQGQNKYFQGTEGTRKASYVEQSGRS 560 570 580 590 600 610 >>XP_011507833 (OMIM: 616284) PREDICTED: filaggrin-2 iso (2239 aa) initn: 61 init1: 61 opt: 238 Z-score: 186.0 bits: 46.0 E(85289): 0.0013 Smith-Waterman score: 241; 23.1% identity (53.4% similar) in 459 aa overlap (18-452:895-1330) 10 20 30 40 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK--- :.. .::.: ..:. : :.:. .. XP_011 SSSGQTTGFGQHRSSSGQYSGFGQHGSGSDQSSGFGQHGTGSGQ-SSGFGQYESRSRQSS 870 880 890 900 910 920 50 60 70 80 90 100 pF1KB7 -GQHYSGQKGKQQTESKGSFSIQYT-YHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG ::: ::.. .. ..:: : : . . :: :: .. ... .:. : XP_011 YGQHGSGSSQSSGYGQHGSNSGQTSGFGQHRPGSGQSSGFGQYGSGSGQSSGFGQHGSGT 930 940 950 960 970 980 110 120 130 140 150 160 pF1KB7 SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNT ... ..: :. ..: :. . .: :.. ..:. . :. :: :...:.. XP_011 GKSSGFAQHEYRSGQSSYGQHGTGSSQSSGCGQHESGSGPTTSFGQHVSG---SDNFSSS 990 1000 1010 1020 1030 1040 170 180 190 200 210 pF1KB7 EERLWVHGLSK---EQTSVSGAQKGRKQGGSQS----SYVLQTEELVANKQQRETKNSHQ ... : : . : :: . : :: ::. : : .: . .. :..:.. XP_011 GQHISDSGQSTGFGQYGSGSGQSTGLGQGESQQVESGSTVHGRQETTHGQTINTTRHSQS 1050 1060 1070 1080 1090 1100 220 230 240 250 260 pF1KB7 NKGHY-QNVVEV-REEHSSKVQTSLCPAHQDKLQHGSKDIFSTQ------DELLVYNKNQ ..:. :. .: :...::. ..: .:. :..: .. . :: . . : XP_011 GQGQSTQTGSRVTRRRRSSQSENSDSEVHS-KVSHRHSEHIHTQAGSHYPKSGSTVRRRQ 1110 1120 1130 1140 1150 270 280 290 300 310 320 pF1KB7 HQTKNLNQDQ-QHGRKANKISYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKS :.. : .::.... : :..: : ..... :...: ..: : : XP_011 GTTHGQRGDTTRHGHSGHGQSTQTGSRTSGRQRFSHS----DATDSEVHS-------GVS 1160 1170 1180 1190 1200 330 340 350 360 370 380 pF1KB7 QKQITIPSQEQEHSQKANKISYQSSSTEERR-LHYGENG--VQKDVSQRSIYSQTEKLVA .. :::: ::: ... . . :...::. ::..: . . : .. .: . .. XP_011 HRP---HSQEQTHSQAGSQHGESESTVHERHETTYGQTGEATGHGHSGHGQSTQRGSRTT 1210 1220 1230 1240 1250 1260 390 400 410 420 430 440 pF1KB7 GKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQED :. . .. :. ... ..: .. : .: ..: :: :: : XP_011 GRRGSGHSESSDSEVHSGGSHRPQSQEQTHGQAGSQHGESGSTVHGRHGTTH----GQTG 1270 1280 1290 1300 1310 1320 450 460 pF1KB7 DSDRHLAQHLNNDRNPLFT :. :: : XP_011 DTTRHAHYHHGKSTQRGSSTTGRRGSGHSESSDSEVHSGGSHTHSGHTHGQSGSQHGESE 1330 1340 1350 1360 1370 1380 >>NP_001014364 (OMIM: 616284) filaggrin-2 [Homo sapiens] (2391 aa) initn: 61 init1: 61 opt: 238 Z-score: 185.6 bits: 46.0 E(85289): 0.0013 Smith-Waterman score: 241; 23.1% identity (53.4% similar) in 459 aa overlap (18-452:1047-1482) 10 20 30 40 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK--- :.. .::.: ..:. : :.:. .. NP_001 SSSGQTTGFGQHRSSSGQYSGFGQHGSGSDQSSGFGQHGTGSGQ-SSGFGQYESRSRQSS 1020 1030 1040 1050 1060 1070 50 60 70 80 90 100 pF1KB7 -GQHYSGQKGKQQTESKGSFSIQYT-YHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG ::: ::.. .. ..:: : : . . :: :: .. ... .:. : NP_001 YGQHGSGSSQSSGYGQHGSNSGQTSGFGQHRPGSGQSSGFGQYGSGSGQSSGFGQHGSGT 1080 1090 1100 1110 1120 1130 110 120 130 140 150 160 pF1KB7 SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNT ... ..: :. ..: :. . .: :.. ..:. . :. :: :...:.. NP_001 GKSSGFAQHEYRSGQSSYGQHGTGSSQSSGCGQHESGSGPTTSFGQHVSG---SDNFSSS 1140 1150 1160 1170 1180 1190 170 180 190 200 210 pF1KB7 EERLWVHGLSK---EQTSVSGAQKGRKQGGSQS----SYVLQTEELVANKQQRETKNSHQ ... : : . : :: . : :: ::. : : .: . .. :..:.. NP_001 GQHISDSGQSTGFGQYGSGSGQSTGLGQGESQQVESGSTVHGRQETTHGQTINTTRHSQS 1200 1210 1220 1230 1240 1250 220 230 240 250 260 pF1KB7 NKGHY-QNVVEV-REEHSSKVQTSLCPAHQDKLQHGSKDIFSTQ------DELLVYNKNQ ..:. :. .: :...::. ..: .:. :..: .. . :: . . : NP_001 GQGQSTQTGSRVTRRRRSSQSENSDSEVHS-KVSHRHSEHIHTQAGSHYPKSGSTVRRRQ 1260 1270 1280 1290 1300 1310 270 280 290 300 310 320 pF1KB7 HQTKNLNQDQ-QHGRKANKISYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKS :.. : .::.... : :..: : ..... :...: ..: : : NP_001 GTTHGQRGDTTRHGHSGHGQSTQTGSRTSGRQRFSHS----DATDSEVHS-------GVS 1320 1330 1340 1350 1360 330 340 350 360 370 380 pF1KB7 QKQITIPSQEQEHSQKANKISYQSSSTEERR-LHYGENG--VQKDVSQRSIYSQTEKLVA .. :::: ::: ... . . :...::. ::..: . . : .. .: . .. NP_001 HRP---HSQEQTHSQAGSQHGESESTVHERHETTYGQTGEATGHGHSGHGQSTQRGSRTT 1370 1380 1390 1400 1410 390 400 410 420 430 440 pF1KB7 GKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQED :. . .. :. ... ..: .. : .: ..: :: :: : NP_001 GRRGSGHSESSDSEVHSGGSHRPQSQEQTHGQAGSQHGESGSTVHGRHGTTH----GQTG 1420 1430 1440 1450 1460 1470 450 460 pF1KB7 DSDRHLAQHLNNDRNPLFT :. :: : NP_001 DTTRHAHYHHGKSTQRGSSTTGRRGSGHSESSDSEVHSGGSHTHSGHTHGQSGSQHGESE 1480 1490 1500 1510 1520 1530 462 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:03:26 2016 done: Sun Nov 6 13:03:27 2016 Total Scan time: 9.540 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]