FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4333, 343 aa
1>>>pF1KE4333 343 - 343 aa - 343 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4094+/-0.000335; mu= 16.1485+/- 0.021
mean_var=76.9502+/-15.225, 0's: 0 Z-trim(116.4): 318 B-trim: 1093 in 2/50
Lambda= 0.146207
statistics sampled from 27204 (27524) to 27204 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.706), E-opt: 0.2 (0.323), width: 16
Scan time: 8.100
The best scores are: opt bits E(85289)
NP_002764 (OMIM: 600823) prostasin preproprotein [ ( 343) 2374 510.0 3e-144
XP_011520753 (OMIM: 613797) PREDICTED: serine prot ( 280) 897 198.4 1.6e-50
XP_011520754 (OMIM: 613797) PREDICTED: serine prot ( 280) 897 198.4 1.6e-50
NP_690851 (OMIM: 613797) serine protease 33 precur ( 280) 897 198.4 1.6e-50
NP_114154 (OMIM: 608018) serine protease 27 isofor ( 290) 833 184.9 1.9e-46
XP_005255530 (OMIM: 609343) PREDICTED: brain-speci ( 317) 823 182.8 8.6e-46
NP_071402 (OMIM: 609343) brain-specific serine pro ( 317) 823 182.8 8.6e-46
XP_016878460 (OMIM: 610560) PREDICTED: polyserase- ( 621) 773 172.5 2.2e-42
XP_016878459 (OMIM: 610560) PREDICTED: polyserase- ( 634) 773 172.5 2.2e-42
XP_016878458 (OMIM: 610560) PREDICTED: polyserase- ( 637) 773 172.5 2.2e-42
NP_001245220 (OMIM: 610560) polyserase-2 isoform 3 ( 752) 773 172.5 2.5e-42
XP_016878457 (OMIM: 610560) PREDICTED: polyserase- ( 761) 773 172.5 2.6e-42
NP_001245219 (OMIM: 610560) polyserase-2 isoform 2 ( 850) 773 172.6 2.8e-42
NP_775773 (OMIM: 610560) polyserase-2 isoform 1 pr ( 855) 773 172.6 2.8e-42
NP_659205 (OMIM: 608159) testisin isoform 2 prepro ( 312) 738 164.9 2.1e-40
NP_006790 (OMIM: 608159) testisin isoform 1 prepro ( 314) 730 163.2 6.9e-40
NP_003285 (OMIM: 191080) tryptase alpha/beta-1 pre ( 275) 721 161.3 2.3e-39
NP_077078 (OMIM: 191081) tryptase beta-2 prepropro ( 275) 712 159.4 8.6e-39
NP_001275930 (OMIM: 206200,609862) transmembrane p ( 802) 709 159.0 3.1e-38
NP_705837 (OMIM: 206200,609862) transmembrane prot ( 811) 709 159.1 3.1e-38
NP_001275679 (OMIM: 606751) transmembrane protease ( 413) 688 154.4 4e-37
NP_001275680 (OMIM: 606751) transmembrane protease ( 448) 688 154.4 4.2e-37
NP_110397 (OMIM: 606751) transmembrane protease se ( 457) 688 154.4 4.3e-37
XP_016882221 (OMIM: 142440) PREDICTED: serine prot ( 393) 672 151.0 3.9e-36
NP_002142 (OMIM: 142440) serine protease hepsin pr ( 417) 672 151.0 4.1e-36
XP_006723244 (OMIM: 142440) PREDICTED: serine prot ( 417) 672 151.0 4.1e-36
NP_892028 (OMIM: 142440) serine protease hepsin pr ( 417) 672 151.0 4.1e-36
XP_005258895 (OMIM: 142440) PREDICTED: serine prot ( 417) 672 151.0 4.1e-36
XP_016882220 (OMIM: 142440) PREDICTED: serine prot ( 417) 672 151.0 4.1e-36
NP_001193718 (OMIM: 610050) transmembrane protease ( 532) 667 150.1 1e-35
NP_001070731 (OMIM: 610050) transmembrane protease ( 567) 667 150.1 1.1e-35
NP_001231924 (OMIM: 610050) transmembrane protease ( 563) 666 149.9 1.3e-35
NP_001519 (OMIM: 604552) hepatocyte growth factor ( 655) 648 146.1 2e-34
NP_001284368 (OMIM: 604552) hepatocyte growth fact ( 662) 648 146.1 2e-34
XP_011528035 (OMIM: 602060) PREDICTED: transmembra ( 384) 622 140.5 5.8e-33
XP_005261100 (OMIM: 602060) PREDICTED: transmembra ( 452) 622 140.5 6.6e-33
XP_011528033 (OMIM: 602060) PREDICTED: transmembra ( 492) 622 140.6 7.1e-33
NP_005647 (OMIM: 602060) transmembrane protease se ( 492) 622 140.6 7.1e-33
NP_001128571 (OMIM: 602060) transmembrane protease ( 529) 622 140.6 7.5e-33
NP_001034592 (OMIM: 610561) serine protease 53 pre ( 553) 613 138.7 2.9e-32
XP_011544122 (OMIM: 610561) PREDICTED: serine prot ( 567) 613 138.7 2.9e-32
XP_011544121 (OMIM: 610561) PREDICTED: serine prot ( 576) 613 138.7 3e-32
NP_001088 (OMIM: 102480) acrosin precursor [Homo s ( 421) 598 135.4 2.1e-31
XP_011535778 (OMIM: 227500,608446,613878) PREDICTE ( 364) 594 134.6 3.3e-31
NP_001254483 (OMIM: 227500,608446,613878) coagulat ( 382) 594 134.6 3.5e-31
XP_006720026 (OMIM: 227500,608446,613878) PREDICTE ( 412) 594 134.6 3.7e-31
XP_011535777 (OMIM: 227500,608446,613878) PREDICTE ( 433) 594 134.6 3.8e-31
NP_062562 (OMIM: 227500,608446,613878) coagulation ( 444) 594 134.6 3.9e-31
NP_000122 (OMIM: 227500,608446,613878) coagulation ( 466) 594 134.6 4.1e-31
XP_011535776 (OMIM: 227500,608446,613878) PREDICTE ( 495) 594 134.6 4.2e-31
>>NP_002764 (OMIM: 600823) prostasin preproprotein [Homo (343 aa)
initn: 2374 init1: 2374 opt: 2374 Z-score: 2710.7 bits: 510.0 E(85289): 3e-144
Smith-Waterman score: 2374; 100.0% identity (100.0% similar) in 343 aa overlap (1-343:1-343)
10 20 30 40 50 60
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVAPQARITGGSSAVAGQWPWQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVAPQARITGGSSAVAGQWPWQV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 SITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTLKDI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 SITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTLKDI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 IPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAPSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 IPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAPSV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 SLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQGDSGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 SLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQGDSGG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 PLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQTQESQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 PLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQTQESQ
250 260 270 280 290 300
310 320 330 340
pF1KE4 PDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
:::::::::::::::::::::::::::::::::::::::::::
NP_002 PDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
310 320 330 340
>>XP_011520753 (OMIM: 613797) PREDICTED: serine protease (280 aa)
initn: 880 init1: 575 opt: 897 Z-score: 1028.2 bits: 198.4 E(85289): 1.6e-50
Smith-Waterman score: 897; 46.7% identity (71.7% similar) in 276 aa overlap (15-286:9-279)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAE-APCGVAPQ--ARITGGSSAVAGQWP
: .:: :: ..:..: . : :: :. .::.:: .. :.::
XP_011 MRGVSCLQVLLLLVLG----AAGTQGRKSAACG-QPRMSSRIVGGRDGRDGEWP
10 20 30 40
60 70 80 90 100 110
pF1KE4 WQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTL
::.:: ..:.:::::::.. ::::.:::::: . :.:.::: .: : : . .
XP_011 WQASIQHRGAHVCGGSLIAPQWVLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPV
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA
. .. :.: ..:..::.::::: ::. .: ..:.:::. .: : : : ::::: .
XP_011 RRVLLPPDYSEDGARGDLALLQLRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLR
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQG
:.: : .::: ..:::.. .::. ::.. : :. ..: .:::: .: ::::::
XP_011 PGVPLPEWRPLQGVRVPLLDSRTCDGLYHVGADVPQAERIVLPGSLCAGYPQGHKDACQG
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE4 DSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQT
::::::.: : : :.:.:::: .:. ::::::: ...:. :::..:.
XP_011 DSGGPLTCLQSGSWVLVGVVSWGKGCALPNRPGVYTSVATYSPWIQARVSF
230 240 250 260 270 280
300 310 320 330 340
pF1KE4 QESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
>>XP_011520754 (OMIM: 613797) PREDICTED: serine protease (280 aa)
initn: 880 init1: 575 opt: 897 Z-score: 1028.2 bits: 198.4 E(85289): 1.6e-50
Smith-Waterman score: 897; 46.7% identity (71.7% similar) in 276 aa overlap (15-286:9-279)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAE-APCGVAPQ--ARITGGSSAVAGQWP
: .:: :: ..:..: . : :: :. .::.:: .. :.::
XP_011 MRGVSCLQVLLLLVLG----AAGTQGRKSAACG-QPRMSSRIVGGRDGRDGEWP
10 20 30 40
60 70 80 90 100 110
pF1KE4 WQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTL
::.:: ..:.:::::::.. ::::.:::::: . :.:.::: .: : : . .
XP_011 WQASIQHRGAHVCGGSLIAPQWVLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPV
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA
. .. :.: ..:..::.::::: ::. .: ..:.:::. .: : : : ::::: .
XP_011 RRVLLPPDYSEDGARGDLALLQLRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLR
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQG
:.: : .::: ..:::.. .::. ::.. : :. ..: .:::: .: ::::::
XP_011 PGVPLPEWRPLQGVRVPLLDSRTCDGLYHVGADVPQAERIVLPGSLCAGYPQGHKDACQG
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE4 DSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQT
::::::.: : : :.:.:::: .:. ::::::: ...:. :::..:.
XP_011 DSGGPLTCLQSGSWVLVGVVSWGKGCALPNRPGVYTSVATYSPWIQARVSF
230 240 250 260 270 280
300 310 320 330 340
pF1KE4 QESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
>>NP_690851 (OMIM: 613797) serine protease 33 precursor (280 aa)
initn: 880 init1: 575 opt: 897 Z-score: 1028.2 bits: 198.4 E(85289): 1.6e-50
Smith-Waterman score: 897; 46.7% identity (71.7% similar) in 276 aa overlap (15-286:9-279)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAE-APCGVAPQ--ARITGGSSAVAGQWP
: .:: :: ..:..: . : :: :. .::.:: .. :.::
NP_690 MRGVSCLQVLLLLVLG----AAGTQGRKSAACG-QPRMSSRIVGGRDGRDGEWP
10 20 30 40
60 70 80 90 100 110
pF1KE4 WQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTL
::.:: ..:.:::::::.. ::::.:::::: . :.:.::: .: : : . .
NP_690 WQASIQHRGAHVCGGSLIAPQWVLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPV
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA
. .. :.: ..:..::.::::: ::. .: ..:.:::. .: : : : ::::: .
NP_690 RRVLLPPDYSEDGARGDLALLQLRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLR
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQG
:.: : .::: ..:::.. .::. ::.. : :. ..: .:::: .: ::::::
NP_690 PGVPLPEWRPLQGVRVPLLDSRTCDGLYHVGADVPQAERIVLPGSLCAGYPQGHKDACQG
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE4 DSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQT
::::::.: : : :.:.:::: .:. ::::::: ...:. :::..:.
NP_690 DSGGPLTCLQSGSWVLVGVVSWGKGCALPNRPGVYTSVATYSPWIQARVSF
230 240 250 260 270 280
300 310 320 330 340
pF1KE4 QESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
>>NP_114154 (OMIM: 608018) serine protease 27 isoform 1 (290 aa)
initn: 811 init1: 516 opt: 833 Z-score: 955.0 bits: 184.9 E(85289): 1.9e-46
Smith-Waterman score: 833; 43.8% identity (69.0% similar) in 281 aa overlap (14-289:6-280)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVAPQA--RITGGSSAVAGQWPW
:: .:: :: :. : . :: :. :..::... :.:::
NP_114 MRRPAAVPLLL---LLCFGSQRAKAATACG-RPRMLNRMVGGQDTQEGEWPW
10 20 30 40
60 70 80 90 100 110
pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTLK
:::: .: : :::::..:::::.::::: . . :.: :::.:: . . : . ..
NP_114 QVSIQRNGSHFCGGSLIAEQWVLTAAHCFRNTSETSLYQVLLGARQLVQPGPHAMYARVR
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE4 DIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAP
.. .: : .:..:.::..: :. :. :: :.::: .. : .:..: ::::: .:
NP_114 QVESNPLYQGTASSADVALVELEAPVPFTNYILPVCLPDPSVIFETGMNCWVTGWG--SP
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE4 SVSLLTPKP--LQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQ
: : :.: ::.: ::.:. :: ::. :.. .:. ...::.:::. :: ::::.
NP_114 SEEDLLPEPRILQKLAVPIIDTPKCNLLYSKDTEFGYQPKTIKNDMLCAGFEEGKKDACK
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQ
::::::: : : : .:..:::..:. .:::::: .... .::. . .::
NP_114 GDSGGPLVCLVGQSWLQAGVISWGEGCARQNRPGVYIRVTAHHNWIHRIIPKLQFQPARL
230 240 250 260 270 280
300 310 320 330 340
pF1KE4 TQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
NP_114 GGQK
290
>>XP_005255530 (OMIM: 609343) PREDICTED: brain-specific (317 aa)
initn: 841 init1: 335 opt: 823 Z-score: 943.1 bits: 182.8 E(85289): 8.6e-46
Smith-Waterman score: 823; 44.4% identity (67.9% similar) in 293 aa overlap (7-291:10-295)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAP----CGVAPQA--RITGGSSA
:: : ::. . :: :: : . ..:. : :: :: :..:: ..
XP_005 MVVSGAPPALGGGCLGTFTSLL---LLASTAILNAARIPVPPACG-KPQQLNRVVGGEDS
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 VAGQWPWQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA-YEVKLGAHQLDSYSE
. ..::: ::: .:.: :.:::.. .::..::::: .. .: . : ::: :: . .
XP_005 TDSEWPWIVSIQKNGTHHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLGNPGS
60 70 80 90 100 110
120 130 140 150 160
pF1KE4 DAKVSTLKDIIPHPSY-LQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCT
.. . . ::: : .::. .::::..: : : ::. . ::::: :. .: . ::
XP_005 RSQKVGVAWVEPHPVYSWKEGACADIALVRLERSIQFSERVLPICLPDASIHLPPNTHCW
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE4 VTGWGHVAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEG
..::: . .: : :. ::.:.::.:. :.:. :: : . : . :::.::::.::
XP_005 ISGWGSIQDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAG-QGP--ITEDMLCAGYLEG
180 190 200 210 220 230
230 240 250 260 270 280
pF1KE4 GKDACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQ
.::: ::::::: : :.: : :.::.:::..:. ::::::: :.. ::... : .:
XP_005 ERDACLGDSGGPLMCQVDGAWLLAGIISWGEGCAERNRPGVYISLSAHRSWVEKIVQGVQ
240 250 260 270 280 290
290 300 310 320 330 340
pF1KE4 PRVVPQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
:
XP_005 LRGRAQGGGALRAPSQGSGAAARS
300 310
>>NP_071402 (OMIM: 609343) brain-specific serine proteas (317 aa)
initn: 841 init1: 335 opt: 823 Z-score: 943.1 bits: 182.8 E(85289): 8.6e-46
Smith-Waterman score: 823; 44.4% identity (67.9% similar) in 293 aa overlap (7-291:10-295)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAP----CGVAPQA--RITGGSSA
:: : ::. . :: :: : . ..:. : :: :: :..:: ..
NP_071 MVVSGAPPALGGGCLGTFTSLL---LLASTAILNAARIPVPPACG-KPQQLNRVVGGEDS
10 20 30 40 50
60 70 80 90 100 110
pF1KE4 VAGQWPWQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA-YEVKLGAHQLDSYSE
. ..::: ::: .:.: :.:::.. .::..::::: .. .: . : ::: :: . .
NP_071 TDSEWPWIVSIQKNGTHHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLGNPGS
60 70 80 90 100 110
120 130 140 150 160
pF1KE4 DAKVSTLKDIIPHPSY-LQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCT
.. . . ::: : .::. .::::..: : : ::. . ::::: :. .: . ::
NP_071 RSQKVGVAWVEPHPVYSWKEGACADIALVRLERSIQFSERVLPICLPDASIHLPPNTHCW
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE4 VTGWGHVAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEG
..::: . .: : :. ::.:.::.:. :.:. :: : . : . :::.::::.::
NP_071 ISGWGSIQDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAG-QGP--ITEDMLCAGYLEG
180 190 200 210 220 230
230 240 250 260 270 280
pF1KE4 GKDACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQ
.::: ::::::: : :.: : :.::.:::..:. ::::::: :.. ::... : .:
NP_071 ERDACLGDSGGPLMCQVDGAWLLAGIISWGEGCAERNRPGVYISLSAHRSWVEKIVQGVQ
240 250 260 270 280 290
290 300 310 320 330 340
pF1KE4 PRVVPQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
:
NP_071 LRGRAQGGGALRAPSQGSGAAARS
300 310
>>XP_016878460 (OMIM: 610560) PREDICTED: polyserase-2 is (621 aa)
initn: 746 init1: 483 opt: 773 Z-score: 882.0 bits: 172.5 E(85289): 2.2e-42
Smith-Waterman score: 773; 42.0% identity (65.0% similar) in 283 aa overlap (26-302:27-309)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPW
: : : . :: :.:::.:::.: : :::
XP_016 MARHLLLPLVMLVISPIPGAFQDSALSPTQEEPEDLDCGRPEPSARIVGGSNAQPGTWPW
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA---YEVKLGAHQLDSYSEDAKVS
:::. . : :.:::::.. .::::::::: .. : . : ::.:. :. . :..
XP_016 QVSLHHGGGHICGGSLIAPSWVLSAAHCFMTNGTLEPAAEWSVLLGVHSQDGPLDGAHTR
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 TLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGH
.. :. .: : .:.:::.:. : ... . :.::: :. : .: : .::::
XP_016 AVAAIVVPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE4 VAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQ
: . : : ::..:. :... ::.:::. . . . :.:::: :: .:.::
XP_016 VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPEGRRDTCQ
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV--TELQPRVV
::::::: : : :. .::.:.: .:: ::::::.: ...: .::. .: .: :
XP_016 GDSGGPLVCEEGGRWFQAGITSFGFGCGRRNRPGVFTAVATYEAWIREQVMGSEPGPAFP
250 260 270 280 290 300
300 310 320 330 340
pF1KE4 PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
: :..: :
XP_016 TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQVMVPGSRPCHGALVSESWVLA
310 320 330 340 350 360
>--
initn: 402 init1: 144 opt: 399 Z-score: 455.6 bits: 93.6 E(85289): 1.2e-18
Smith-Waterman score: 399; 30.4% identity (55.2% similar) in 306 aa overlap (31-316:312-593)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPWQ
: : : .: :. :.. : :::.
XP_016 YEAWIREQVMGSEPGPAFPTQPQKTQSDPQEPREENCTIALPEC----GKAPRPGAWPWE
290 300 310 320 330
60 70 80 90 100 110
pF1KE4 VSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQ--LDSYSEDAKVSTL
... : . : :.::::.:::. : :: . . ... : : . : : . .:. :
XP_016 AQVMVPGSRPCHGALVSESWVLAPASCFLDPNSSDSPPRDLDAWRVLLPSRPRAERVARL
340 350 360 370 380 390
120 130 140 150 160 170
pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA
. : . .... :.::::: :...: ::.::: . : : .: .. ::.
XP_016 ---VQHENASWDNAS-DLALLQLRTPVNLSAASRPVCLPHPEHYFLPGSRCRLARWGRGE
400 410 420 430 440 450
180 190 200 210 220 230
pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNID--AKP---EEPHFVQEDMVCAGYVEGGK-
:. : : : ::. :.. :.:::. . : : . :: .: .: : .
XP_016 PA---LGPGAL--LEAELLGGWWCHCLYGRQGAAVPLPGDPPH-----ALCPAYQEKEEV
460 470 480 490 500
240 250 260 270 280
pF1KE4 DACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTE---L
.: .:: : : :: :.:.:: .. ..: :: .. .... :: :.::. :
XP_016 GSCWNDSRWSLLCQEEGTWFLAGIRDFPSGCL---RPRAFFPLQTHGPWI-SHVTRGAYL
510 520 530 540 550
290 300 310 320 330 340
pF1KE4 QPRVV----PQTQESQPDSNLCGSHLA----FSSAPAQGLLRPILFLPLGLALGLLSPWL
. ... :. .:.. .. : : .:.::
XP_016 EDQLAWDWGPDGEETE--TQTCPPHTEHGDDLSTAPPVPDDGRVLDPRGHGCSREPGAVC
560 570 580 590 600 610
pF1KE4 SEH
XP_016 CHWS
620
>>XP_016878459 (OMIM: 610560) PREDICTED: polyserase-2 is (634 aa)
initn: 746 init1: 483 opt: 773 Z-score: 881.8 bits: 172.5 E(85289): 2.2e-42
Smith-Waterman score: 773; 42.0% identity (65.0% similar) in 283 aa overlap (26-302:27-309)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPW
: : : . :: :.:::.:::.: : :::
XP_016 MARHLLLPLVMLVISPIPGAFQDSALSPTQEEPEDLDCGRPEPSARIVGGSNAQPGTWPW
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA---YEVKLGAHQLDSYSEDAKVS
:::. . : :.:::::.. .::::::::: .. : . : ::.:. :. . :..
XP_016 QVSLHHGGGHICGGSLIAPSWVLSAAHCFMTNGTLEPAAEWSVLLGVHSQDGPLDGAHTR
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 TLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGH
.. :. .: : .:.:::.:. : ... . :.::: :. : .: : .::::
XP_016 AVAAIVVPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE4 VAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQ
: . : : ::..:. :... ::.:::. . . . :.:::: :: .:.::
XP_016 VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPEGRRDTCQ
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV--TELQPRVV
::::::: : : :. .::.:.: .:: ::::::.: ...: .::. .: .: :
XP_016 GDSGGPLVCEEGGRWFQAGITSFGFGCGRRNRPGVFTAVATYEAWIREQVMGSEPGPAFP
250 260 270 280 290 300
300 310 320 330 340
pF1KE4 PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
: :..: :
XP_016 TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQVMVPGSRPCHGALVSESWVLA
310 320 330 340 350 360
>--
initn: 383 init1: 144 opt: 412 Z-score: 470.3 bits: 96.3 E(85289): 1.9e-19
Smith-Waterman score: 412; 29.6% identity (54.0% similar) in 328 aa overlap (31-342:312-606)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPWQ
: : : .: :. :.. : :::.
XP_016 YEAWIREQVMGSEPGPAFPTQPQKTQSDPQEPREENCTIALPEC----GKAPRPGAWPWE
290 300 310 320 330
60 70 80 90 100 110
pF1KE4 VSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQ--LDSYSEDAKVSTL
... : . : :.::::.:::. : :: . . ... : : . : : . .:. :
XP_016 AQVMVPGSRPCHGALVSESWVLAPASCFLDPNSSDSPPRDLDAWRVLLPSRPRAERVARL
340 350 360 370 380 390
120 130 140 150 160 170
pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA
. : . .... :.::::: :...: ::.::: . : : .: .. ::.
XP_016 ---VQHENASWDNAS-DLALLQLRTPVNLSAASRPVCLPHPEHYFLPGSRCRLARWGRGE
400 410 420 430 440 450
180 190 200 210 220 230
pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNID--AKP---EEPHFVQEDMVCAGYVEGGK-
:. : : : ::. :.. :.:::. . : : . :: .: .: : .
XP_016 PA---LGPGAL--LEAELLGGWWCHCLYGRQGAAVPLPGDPPH-----ALCPAYQEKEEV
460 470 480 490 500
240 250 260 270 280
pF1KE4 DACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTE---L
.: .:: : : :: :.:.:: .. ..: :: .. .... :: :.::. :
XP_016 GSCWNDSRWSLLCQEEGTWFLAGIRDFPSGCL---RPRAFFPLQTHGPWI-SHVTRGAYL
510 520 530 540 550
290 300 310 320 330 340
pF1KE4 QPRVV----PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
. ... :. .:.. .. : : .. . :. :.: :::.:
XP_016 EDQLAWDWGPDGEETETQT--CPPHTEHGACGLRLEAAPV---------GVLWPWLAEVH
560 570 580 590 600
XP_016 VAGDRVCTGILLAPGWVLAATHCVLR
610 620 630
>>XP_016878458 (OMIM: 610560) PREDICTED: polyserase-2 is (637 aa)
initn: 746 init1: 483 opt: 773 Z-score: 881.8 bits: 172.5 E(85289): 2.2e-42
Smith-Waterman score: 773; 42.0% identity (65.0% similar) in 283 aa overlap (26-302:27-309)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPW
: : : . :: :.:::.:::.: : :::
XP_016 MARHLLLPLVMLVISPIPGAFQDSALSPTQEEPEDLDCGRPEPSARIVGGSNAQPGTWPW
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA---YEVKLGAHQLDSYSEDAKVS
:::. . : :.:::::.. .::::::::: .. : . : ::.:. :. . :..
XP_016 QVSLHHGGGHICGGSLIAPSWVLSAAHCFMTNGTLEPAAEWSVLLGVHSQDGPLDGAHTR
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 TLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGH
.. :. .: : .:.:::.:. : ... . :.::: :. : .: : .::::
XP_016 AVAAIVVPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE4 VAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQ
: . : : ::..:. :... ::.:::. . . . :.:::: :: .:.::
XP_016 VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPEGRRDTCQ
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV--TELQPRVV
::::::: : : :. .::.:.: .:: ::::::.: ...: .::. .: .: :
XP_016 GDSGGPLVCEEGGRWFQAGITSFGFGCGRRNRPGVFTAVATYEAWIREQVMGSEPGPAFP
250 260 270 280 290 300
300 310 320 330 340
pF1KE4 PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
: :..: :
XP_016 TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQVMVPGSRPCHGALVSESWVLA
310 320 330 340 350 360
>--
initn: 402 init1: 144 opt: 397 Z-score: 453.2 bits: 93.2 E(85289): 1.7e-18
Smith-Waterman score: 397; 30.0% identity (54.7% similar) in 307 aa overlap (31-321:312-594)
10 20 30 40 50
pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPWQ
: : : .: :. :.. : :::.
XP_016 YEAWIREQVMGSEPGPAFPTQPQKTQSDPQEPREENCTIALPEC----GKAPRPGAWPWE
290 300 310 320 330
60 70 80 90 100 110
pF1KE4 VSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQ--LDSYSEDAKVSTL
... : . : :.::::.:::. : :: . . ... : : . : : . .:. :
XP_016 AQVMVPGSRPCHGALVSESWVLAPASCFLDPNSSDSPPRDLDAWRVLLPSRPRAERVARL
340 350 360 370 380 390
120 130 140 150 160 170
pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA
. : . .... :.::::: :...: ::.::: . : : .: .. ::.
XP_016 ---VQHENASWDNAS-DLALLQLRTPVNLSAASRPVCLPHPEHYFLPGSRCRLARWGRGE
400 410 420 430 440 450
180 190 200 210 220 230
pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNID--AKP---EEPHFVQEDMVCAGYVEGGK-
:. : : : ::. :.. :.:::. . : : . :: .: .: : .
XP_016 PA---LGPGAL--LEAELLGGWWCHCLYGRQGAAVPLPGDPPH-----ALCPAYQEKEEV
460 470 480 490 500
240 250 260 270 280
pF1KE4 DACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTE---L
.: .:: : : :: :.:.:: .. ..: :: .. .... :: :.::. :
XP_016 GSCWNDSRWSLLCQEEGTWFLAGIRDFPSGCL---RPRAFFPLQTHGPWI-SHVTRGAYL
510 520 530 540 550
290 300 310 320 330 340
pF1KE4 QPRVV----PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH
. ... :. .:.. .. : : .. : :
XP_016 EDQLAWDWGPDGEETE--TQTCPPHTEHGGQALQQCLTLKCIWAGQGPAPSHRATRYPAW
560 570 580 590 600 610
XP_016 SSASGCPSTWDSGPPWPSWS
620 630
343 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 02:15:03 2016 done: Sun Nov 6 02:15:04 2016
Total Scan time: 8.100 Total Display time: 0.050
Function used was FASTA [36.3.4 Apr, 2011]