FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4401, 417 aa
1>>>pF1KE4401 417 - 417 aa - 417 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1015+/-0.00034; mu= 18.1818+/- 0.021
mean_var=69.9551+/-14.422, 0's: 0 Z-trim(115.0): 312 B-trim: 304 in 1/53
Lambda= 0.153343
statistics sampled from 24834 (25170) to 24834 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.66), E-opt: 0.2 (0.295), width: 16
Scan time: 9.080
The best scores are: opt bits E(85289)
XP_016882220 (OMIM: 142440) PREDICTED: serine prot ( 417) 2903 651.4 1.2e-186
XP_006723244 (OMIM: 142440) PREDICTED: serine prot ( 417) 2903 651.4 1.2e-186
XP_005258895 (OMIM: 142440) PREDICTED: serine prot ( 417) 2903 651.4 1.2e-186
NP_892028 (OMIM: 142440) serine protease hepsin pr ( 417) 2903 651.4 1.2e-186
NP_002142 (OMIM: 142440) serine protease hepsin pr ( 417) 2903 651.4 1.2e-186
XP_016882221 (OMIM: 142440) PREDICTED: serine prot ( 393) 2483 558.5 1.1e-158
NP_001243246 (OMIM: 605511) transmembrane protease ( 453) 909 210.3 8e-54
NP_076927 (OMIM: 605511) transmembrane protease se ( 454) 898 207.9 4.3e-53
NP_115780 (OMIM: 605511) transmembrane protease se ( 327) 856 198.5 2.1e-50
NP_001275679 (OMIM: 606751) transmembrane protease ( 413) 792 184.4 4.6e-46
NP_001275680 (OMIM: 606751) transmembrane protease ( 448) 792 184.4 4.9e-46
NP_110397 (OMIM: 606751) transmembrane protease se ( 457) 792 184.4 5e-46
XP_011528035 (OMIM: 602060) PREDICTED: transmembra ( 384) 747 174.4 4.3e-43
XP_005261100 (OMIM: 602060) PREDICTED: transmembra ( 452) 747 174.5 4.9e-43
NP_005647 (OMIM: 602060) transmembrane protease se ( 492) 747 174.5 5.2e-43
XP_011528033 (OMIM: 602060) PREDICTED: transmembra ( 492) 747 174.5 5.2e-43
NP_001128571 (OMIM: 602060) transmembrane protease ( 529) 747 174.5 5.5e-43
NP_002763 (OMIM: 226200,606635) enteropeptidase pr (1019) 745 174.3 1.3e-42
XP_011527961 (OMIM: 226200,606635) PREDICTED: ente (1034) 743 173.8 1.7e-42
XP_011527960 (OMIM: 226200,606635) PREDICTED: ente (1037) 743 173.8 1.7e-42
XP_011527959 (OMIM: 226200,606635) PREDICTED: ente (1049) 743 173.8 1.8e-42
XP_011527957 (OMIM: 226200,606635) PREDICTED: ente (1064) 743 173.8 1.8e-42
XP_011527956 (OMIM: 226200,606635) PREDICTED: ente (1064) 743 173.8 1.8e-42
XP_011527958 (OMIM: 226200,606635) PREDICTED: ente (1064) 743 173.8 1.8e-42
XP_011520753 (OMIM: 613797) PREDICTED: serine prot ( 280) 730 170.5 4.6e-42
XP_011520754 (OMIM: 613797) PREDICTED: serine prot ( 280) 730 170.5 4.6e-42
NP_690851 (OMIM: 613797) serine protease 33 precur ( 280) 730 170.5 4.6e-42
NP_001305325 (OMIM: 229000,612423) plasma kallikre ( 436) 710 166.3 1.4e-40
XP_016863673 (OMIM: 229000,612423) PREDICTED: plas ( 436) 710 166.3 1.4e-40
NP_000883 (OMIM: 229000,612423) plasma kallikrein ( 638) 710 166.4 1.9e-40
XP_016863670 (OMIM: 229000,612423) PREDICTED: plas ( 649) 710 166.4 1.9e-40
XP_011530232 (OMIM: 229000,612423) PREDICTED: plas ( 649) 710 166.4 1.9e-40
NP_001275930 (OMIM: 206200,609862) transmembrane p ( 802) 708 166.0 3e-40
NP_705837 (OMIM: 206200,609862) transmembrane prot ( 811) 708 166.0 3.1e-40
NP_114154 (OMIM: 608018) serine protease 27 isofor ( 290) 698 163.5 6.4e-40
NP_002764 (OMIM: 600823) prostasin preproprotein [ ( 343) 672 157.8 3.9e-38
NP_068813 (OMIM: 606797) suppressor of tumorigenic ( 855) 674 158.5 5.9e-38
XP_005262880 (OMIM: 264900,612416) PREDICTED: coag ( 536) 671 157.7 6.5e-38
XP_006714200 (OMIM: 264900,612416) PREDICTED: coag ( 610) 671 157.7 7.1e-38
NP_000119 (OMIM: 264900,612416) coagulation factor ( 625) 671 157.7 7.3e-38
XP_005262878 (OMIM: 264900,612416) PREDICTED: coag ( 626) 671 157.7 7.3e-38
XP_016878460 (OMIM: 610560) PREDICTED: polyserase- ( 621) 667 156.9 1.3e-37
XP_016878459 (OMIM: 610560) PREDICTED: polyserase- ( 634) 667 156.9 1.4e-37
XP_016878458 (OMIM: 610560) PREDICTED: polyserase- ( 637) 667 156.9 1.4e-37
NP_001245220 (OMIM: 610560) polyserase-2 isoform 3 ( 752) 667 156.9 1.6e-37
XP_016878457 (OMIM: 610560) PREDICTED: polyserase- ( 761) 667 156.9 1.6e-37
NP_001245219 (OMIM: 610560) polyserase-2 isoform 2 ( 850) 667 157.0 1.7e-37
NP_775773 (OMIM: 610560) polyserase-2 isoform 1 pr ( 855) 667 157.0 1.7e-37
NP_001088 (OMIM: 102480) acrosin precursor [Homo s ( 421) 659 155.0 3.4e-37
XP_011526282 (OMIM: 610477) PREDICTED: transmembra ( 690) 656 154.5 7.8e-37
>>XP_016882220 (OMIM: 142440) PREDICTED: serine protease (417 aa)
initn: 2903 init1: 2903 opt: 2903 Z-score: 3471.8 bits: 651.4 E(85289): 1.2e-186
Smith-Waterman score: 2903; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417)
10 20 30 40 50 60
pF1KE4 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
310 320 330 340 350 360
370 380 390 400 410
pF1KE4 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
370 380 390 400 410
>>XP_006723244 (OMIM: 142440) PREDICTED: serine protease (417 aa)
initn: 2903 init1: 2903 opt: 2903 Z-score: 3471.8 bits: 651.4 E(85289): 1.2e-186
Smith-Waterman score: 2903; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417)
10 20 30 40 50 60
pF1KE4 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
310 320 330 340 350 360
370 380 390 400 410
pF1KE4 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
370 380 390 400 410
>>XP_005258895 (OMIM: 142440) PREDICTED: serine protease (417 aa)
initn: 2903 init1: 2903 opt: 2903 Z-score: 3471.8 bits: 651.4 E(85289): 1.2e-186
Smith-Waterman score: 2903; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417)
10 20 30 40 50 60
pF1KE4 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
310 320 330 340 350 360
370 380 390 400 410
pF1KE4 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
370 380 390 400 410
>>NP_892028 (OMIM: 142440) serine protease hepsin precur (417 aa)
initn: 2903 init1: 2903 opt: 2903 Z-score: 3471.8 bits: 651.4 E(85289): 1.2e-186
Smith-Waterman score: 2903; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417)
10 20 30 40 50 60
pF1KE4 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
310 320 330 340 350 360
370 380 390 400 410
pF1KE4 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_892 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
370 380 390 400 410
>>NP_002142 (OMIM: 142440) serine protease hepsin precur (417 aa)
initn: 2903 init1: 2903 opt: 2903 Z-score: 3471.8 bits: 651.4 E(85289): 1.2e-186
Smith-Waterman score: 2903; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417)
10 20 30 40 50 60
pF1KE4 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSAD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 ARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 DEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 RYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 GGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCE
310 320 330 340 350 360
370 380 390 400 410
pF1KE4 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 DSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
370 380 390 400 410
>>XP_016882221 (OMIM: 142440) PREDICTED: serine protease (393 aa)
initn: 2483 init1: 2483 opt: 2483 Z-score: 2970.0 bits: 558.5 E(85289): 1.1e-158
Smith-Waterman score: 2483; 99.2% identity (99.4% similar) in 355 aa overlap (51-405:23-377)
30 40 50 60 70 80
pF1KE4 LTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSR
: .::::::::::::::::::::::::::
XP_016 MCKMRTREDMQLGNEGPGTALRLIHLQVSSADARLMVFDKTEGTWRLLCSSR
10 20 30 40 50
90 100 110 120 130 140
pF1KE4 SNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVISVCDC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 SNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVISVCDC
60 70 80 90 100 110
150 160 170 180 190 200
pF1KE4 PRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 PRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLT
120 130 140 150 160 170
210 220 230 240 250 260
pF1KE4 AAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 AAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIAL
180 190 200 210 220 230
270 280 290 300 310 320
pF1KE4 VHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEARVPIISND
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 VHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEARVPIISND
240 250 260 270 280 290
330 340 350 360 370 380
pF1KE4 VCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 VCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTG
300 310 320 330 340 350
390 400 410
pF1KE4 CALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
:::::::::::::::::::::::::
XP_016 CALAQKPGVYTKVSDFREWIFQAIKVKVGSRWEPGWGRLGV
360 370 380 390
>>NP_001243246 (OMIM: 605511) transmembrane protease ser (453 aa)
initn: 758 init1: 441 opt: 909 Z-score: 1087.3 bits: 210.3 E(85289): 8e-54
Smith-Waterman score: 909; 39.8% identity (67.9% similar) in 364 aa overlap (47-405:100-448)
20 30 40 50 60 70
pF1KE4 KVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYP-VQVSSADARLMVFDKTEGTWRL
: : : :.:.. .: :.:: : ..:.
NP_001 HFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVF--TAASWKT
70 80 90 100 110 120
80 90 100 110 120 130
pF1KE4 LCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCVDEGRLPHTQ--RLLE
.::. ... :...: ..:: .. ..: : . .. : .:. :: . : .
NP_001 MCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDH-LLPDDKVTALHH
130 140 150 160 170 180
140 150 160 170 180 190
pF1KE4 VISVCD-CPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSL
. : . : :. .. : ::.:. .::::: . :..::::.::...: ::::::.
NP_001 SVYVREGCASGHVVTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSV
190 200 210 220 230 240
200 210 220 230 240 250
pF1KE4 LSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQA-SPHGLQLGVQAVVYHGGYLPFRDPNS
.. :..::::: . . . : . .: :. .: .: :. .:::. : : :
NP_001 ITPLWIITAAHCVYDL-YLPKSWTIQVGLVSLLDNPAPSHL-VEKIVYHSKYKPKRL---
250 260 270 280 290 300
260 270 280 290 300 310
pF1KE4 EENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQE
.:::::..:..:: ..:.::::::: . . . :::.: ..::: :. :. . ::..
NP_001 ---GNDIALMKLAGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGGDASPVLNH
310 320 330 340 350
320 330 340 350 360 370
pF1KE4 ARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRL
: ::.::: .:: : ::. :.:.:.:::: ::.:.::::::::.::.. : :.:
NP_001 AAVPLISNKICNHRDVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQER--RL--WKL
360 370 380 390 400 410
380 390 400 410
pF1KE4 CGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
: .:.: ::: ..::::::.:..: .:: . ..
NP_001 VGATSFGIGCAEVNKPGVYTRVTSFLDWIHEQMERDLKT
420 430 440 450
>>NP_076927 (OMIM: 605511) transmembrane protease serine (454 aa)
initn: 777 init1: 263 opt: 898 Z-score: 1074.1 bits: 207.9 E(85289): 4.3e-53
Smith-Waterman score: 898; 39.7% identity (67.7% similar) in 365 aa overlap (47-405:100-449)
20 30 40 50 60 70
pF1KE4 KVAALTAGTLLLLTAIGAASWAIVAVLLRSDQEPLYP-VQVSSADARLMVFDKTEGTWRL
: : : :.:.. .: :.:: : ..:.
NP_076 HFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVF--TAASWKT
70 80 90 100 110 120
80 90 100 110 120 130
pF1KE4 LCSSRSNARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCVDEGRLPHTQ--RLLE
.::. ... :...: ..:: .. ..: : . .. : .:. :: . : .
NP_076 MCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDH-LLPDDKVTALHH
130 140 150 160 170 180
140 150 160 170 180 190
pF1KE4 VISVCD-CPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSL
. : . : :. .. : ::.:. .::::: . :..::::.::...: ::::::.
NP_076 SVYVREGCASGHVVTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSV
190 200 210 220 230 240
200 210 220 230 240 250
pF1KE4 LSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQA-SPHGLQLGVQAVVYHGGYLPFRDPNS
.. :..::::: . . . : . .: :. .: .: :. .:::. : : :
NP_076 ITPLWIITAAHCVYDL-YLPKSWTIQVGLVSLLDNPAPSHL-VEKIVYHSKYKPKRL---
250 260 270 280 290 300
260 270 280 290 300 310
pF1KE4 EENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQY-YGQQAGVLQ
.:::::..:..:: ..:.::::::: . . . :::.: ..::: :. :. . ::.
NP_076 ---GNDIALMKLAGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGAGDASPVLN
310 320 330 340 350
320 330 340 350 360 370
pF1KE4 EARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWR
.: ::.::: .:: : ::. :.:.:.:::: ::.:.::::::::.::.. : :.
NP_076 HAAVPLISNKICNHRDVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQER--RL--WK
360 370 380 390 400 410
380 390 400 410
pF1KE4 LCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL
: : .:.: ::: ..::::::.:..: .:: . ..
NP_076 LVGATSFGIGCAEVNKPGVYTRVTSFLDWIHEQMERDLKT
420 430 440 450
>>NP_115780 (OMIM: 605511) transmembrane protease serine (327 aa)
initn: 777 init1: 263 opt: 856 Z-score: 1025.9 bits: 198.5 E(85289): 2.1e-50
Smith-Waterman score: 856; 40.0% identity (68.1% similar) in 335 aa overlap (76-405:1-322)
50 60 70 80 90 100
pF1KE4 SDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELD
.::. ... :...: ..:: .. ..:
NP_115 MCSDDWKGHYANVACAQLGFPSYVSSDNLR
10 20 30
110 120 130 140 150 160
pF1KE4 VRTAGANGTSGFFCVDEGRLPHTQ--RLLEVISVCD-CPRGRFLAAICQDCGRRKLPVDR
: . .. : .:. :: . : . . : . : :. .. : ::.:. .:
NP_115 VSSLEGQFREEFVSIDH-LLPDDKVTALHHSVYVREGCASGHVVTLQCTACGHRRGYSSR
40 50 60 70 80
170 180 190 200 210 220
pF1KE4 IVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAV
:::: . :..::::.::...: ::::::... :..::::: . . . : . .: :
NP_115 IVGGNMSLLSQWPWQASLQFQGYHLCGGSVITPLWIITAAHCVYDL-YLPKSWTIQVGLV
90 100 110 120 130 140
230 240 250 260 270 280
pF1KE4 AQA-SPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAA
. .: .: :. .:::. : : : .:::::..:..:: ..:.::::::: .
NP_115 SLLDNPAPSHL-VEKIVYHSKYKPKRL------GNDIALMKLAGPLTFNEMIQPVCLPNS
150 160 170 180 190 200
290 300 310 320 330 340
pF1KE4 GQALVDGKICTVTGWGNTQY-YGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAG
. . :::.: ..::: :. :. . ::..: ::.::: .:: : ::. :.:.:.:::
NP_115 EENFPDGKVCWTSGWGATEDGAGDASPVLNHAAVPLISNKICNHRDVYGGIISPSMLCAG
210 220 230 240 250 260
350 360 370 380 390 400
pF1KE4 YPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWI
: ::.:.::::::::.::.. : :.: : .:.: ::: ..::::::.:..: .::
NP_115 YLTGGVDSCQGDSGGPLVCQE--RRL--WKLVGATSFGIGCAEVNKPGVYTRVTSFLDWI
270 280 290 300 310
410
pF1KE4 FQAIKTHSEASGMVTQL
. ..
NP_115 HEQMERDLKT
320
>>NP_001275679 (OMIM: 606751) transmembrane protease ser (413 aa)
initn: 697 init1: 221 opt: 792 Z-score: 948.0 bits: 184.4 E(85289): 4.6e-46
Smith-Waterman score: 800; 34.6% identity (61.9% similar) in 396 aa overlap (11-400:30-404)
10 20 30 40
pF1KE4 MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVA
: :.: ..: . : . ..: :..
NP_001 MRRGCAVLGALGLLAGAGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEASAEEALLP
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE4 VLLRSDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTH
.: .. ...: : : . . . : :.: . .. : .: :: :
NP_001 ALPKT-----VSFRINSEDFLLEAQVRDQPRWLLVCHEGWSPALGLQICWSLGHLRLTHH
70 80 90 100 110
110 120 130 140 150
pF1KE4 SELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVI--SVCDCPRGRFLAAICQDCGRRKLP
. ... :... : .. :. .:: .: :. .. :..:: : :
NP_001 KGVNLTDIKLNSSQEFAQLS----PRLGGFLEEAWQPRNNCTSGQVVSLRCSECGARPL-
120 130 140 150 160 170
160 170 180 190 200 210
pF1KE4 VDRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPE-RNRVLSRWRVF
..:::::.... ::::::.:. : ::::.:. ::.:::::. : :: :::
NP_001 ASRIVGGQSVAPGRWPWQASVALGFRHTCGGSVLAPRWVVTAAHCMHSFRLARLSSWRVH
180 190 200 210 220 230
220 230 240 250 260 270
pF1KE4 AGAVAQAS--PHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPV
:: :.... :: : :. .. : : ...... :.::..:.. : ... . :
NP_001 AGLVSHSAVRPHQGAL-VERIIPHPLY------SAQNHDYDVALLRLQTALNFSDTVGAV
240 250 260 270 280
280 290 300 310 320 330
pF1KE4 CLPAAGQALVDGKICTVTGWGNTQ-YYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPK
:::: : . :. : :.:::.:. . .. .::.. ::..:...::.. :.. . :.
NP_001 CLPAKEQHFPKGSRCWVSGWGHTHPSHTYSSDMLQDTVVPLFSTQLCNSSCVYSGALTPR
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE4 MFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSD
:.:::: .: ::::::::::.:: :. . ::: :.:::: ::: ..::::.::..
NP_001 MLCAGYLDGRADACQGDSGGPLVCPDGDT----WRLVGVVSWGRGCAEPNHPGVYAKVAE
350 360 370 380 390
400 410
pF1KE4 FREWIFQAIKTHSEASGMVTQL
: .::
NP_001 FLDWIHDTAQDSLL
400 410
417 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 01:21:46 2016 done: Sun Nov 6 01:21:47 2016
Total Scan time: 9.080 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]