FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4333, 343 aa 1>>>pF1KE4333 343 - 343 aa - 343 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4094+/-0.000335; mu= 16.1485+/- 0.021 mean_var=76.9502+/-15.225, 0's: 0 Z-trim(116.4): 318 B-trim: 1093 in 2/50 Lambda= 0.146207 statistics sampled from 27204 (27524) to 27204 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.706), E-opt: 0.2 (0.323), width: 16 Scan time: 8.100 The best scores are: opt bits E(85289) NP_002764 (OMIM: 600823) prostasin preproprotein [ ( 343) 2374 510.0 3e-144 XP_011520753 (OMIM: 613797) PREDICTED: serine prot ( 280) 897 198.4 1.6e-50 XP_011520754 (OMIM: 613797) PREDICTED: serine prot ( 280) 897 198.4 1.6e-50 NP_690851 (OMIM: 613797) serine protease 33 precur ( 280) 897 198.4 1.6e-50 NP_114154 (OMIM: 608018) serine protease 27 isofor ( 290) 833 184.9 1.9e-46 XP_005255530 (OMIM: 609343) PREDICTED: brain-speci ( 317) 823 182.8 8.6e-46 NP_071402 (OMIM: 609343) brain-specific serine pro ( 317) 823 182.8 8.6e-46 XP_016878460 (OMIM: 610560) PREDICTED: polyserase- ( 621) 773 172.5 2.2e-42 XP_016878459 (OMIM: 610560) PREDICTED: polyserase- ( 634) 773 172.5 2.2e-42 XP_016878458 (OMIM: 610560) PREDICTED: polyserase- ( 637) 773 172.5 2.2e-42 NP_001245220 (OMIM: 610560) polyserase-2 isoform 3 ( 752) 773 172.5 2.5e-42 XP_016878457 (OMIM: 610560) PREDICTED: polyserase- ( 761) 773 172.5 2.6e-42 NP_001245219 (OMIM: 610560) polyserase-2 isoform 2 ( 850) 773 172.6 2.8e-42 NP_775773 (OMIM: 610560) polyserase-2 isoform 1 pr ( 855) 773 172.6 2.8e-42 NP_659205 (OMIM: 608159) testisin isoform 2 prepro ( 312) 738 164.9 2.1e-40 NP_006790 (OMIM: 608159) testisin isoform 1 prepro ( 314) 730 163.2 6.9e-40 NP_003285 (OMIM: 191080) tryptase alpha/beta-1 pre ( 275) 721 161.3 2.3e-39 NP_077078 (OMIM: 191081) tryptase beta-2 prepropro ( 275) 712 159.4 8.6e-39 NP_001275930 (OMIM: 206200,609862) transmembrane p ( 802) 709 159.0 3.1e-38 NP_705837 (OMIM: 206200,609862) transmembrane prot ( 811) 709 159.1 3.1e-38 NP_001275679 (OMIM: 606751) transmembrane protease ( 413) 688 154.4 4e-37 NP_001275680 (OMIM: 606751) transmembrane protease ( 448) 688 154.4 4.2e-37 NP_110397 (OMIM: 606751) transmembrane protease se ( 457) 688 154.4 4.3e-37 XP_016882221 (OMIM: 142440) PREDICTED: serine prot ( 393) 672 151.0 3.9e-36 NP_002142 (OMIM: 142440) serine protease hepsin pr ( 417) 672 151.0 4.1e-36 XP_006723244 (OMIM: 142440) PREDICTED: serine prot ( 417) 672 151.0 4.1e-36 NP_892028 (OMIM: 142440) serine protease hepsin pr ( 417) 672 151.0 4.1e-36 XP_005258895 (OMIM: 142440) PREDICTED: serine prot ( 417) 672 151.0 4.1e-36 XP_016882220 (OMIM: 142440) PREDICTED: serine prot ( 417) 672 151.0 4.1e-36 NP_001193718 (OMIM: 610050) transmembrane protease ( 532) 667 150.1 1e-35 NP_001070731 (OMIM: 610050) transmembrane protease ( 567) 667 150.1 1.1e-35 NP_001231924 (OMIM: 610050) transmembrane protease ( 563) 666 149.9 1.3e-35 NP_001519 (OMIM: 604552) hepatocyte growth factor ( 655) 648 146.1 2e-34 NP_001284368 (OMIM: 604552) hepatocyte growth fact ( 662) 648 146.1 2e-34 XP_011528035 (OMIM: 602060) PREDICTED: transmembra ( 384) 622 140.5 5.8e-33 XP_005261100 (OMIM: 602060) PREDICTED: transmembra ( 452) 622 140.5 6.6e-33 XP_011528033 (OMIM: 602060) PREDICTED: transmembra ( 492) 622 140.6 7.1e-33 NP_005647 (OMIM: 602060) transmembrane protease se ( 492) 622 140.6 7.1e-33 NP_001128571 (OMIM: 602060) transmembrane protease ( 529) 622 140.6 7.5e-33 NP_001034592 (OMIM: 610561) serine protease 53 pre ( 553) 613 138.7 2.9e-32 XP_011544122 (OMIM: 610561) PREDICTED: serine prot ( 567) 613 138.7 2.9e-32 XP_011544121 (OMIM: 610561) PREDICTED: serine prot ( 576) 613 138.7 3e-32 NP_001088 (OMIM: 102480) acrosin precursor [Homo s ( 421) 598 135.4 2.1e-31 XP_011535778 (OMIM: 227500,608446,613878) PREDICTE ( 364) 594 134.6 3.3e-31 NP_001254483 (OMIM: 227500,608446,613878) coagulat ( 382) 594 134.6 3.5e-31 XP_006720026 (OMIM: 227500,608446,613878) PREDICTE ( 412) 594 134.6 3.7e-31 XP_011535777 (OMIM: 227500,608446,613878) PREDICTE ( 433) 594 134.6 3.8e-31 NP_062562 (OMIM: 227500,608446,613878) coagulation ( 444) 594 134.6 3.9e-31 NP_000122 (OMIM: 227500,608446,613878) coagulation ( 466) 594 134.6 4.1e-31 XP_011535776 (OMIM: 227500,608446,613878) PREDICTE ( 495) 594 134.6 4.2e-31 >>NP_002764 (OMIM: 600823) prostasin preproprotein [Homo (343 aa) initn: 2374 init1: 2374 opt: 2374 Z-score: 2710.7 bits: 510.0 E(85289): 3e-144 Smith-Waterman score: 2374; 100.0% identity (100.0% similar) in 343 aa overlap (1-343:1-343) 10 20 30 40 50 60 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVAPQARITGGSSAVAGQWPWQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVAPQARITGGSSAVAGQWPWQV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 SITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTLKDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 SITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTLKDI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 IPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAPSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 IPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAPSV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 SLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQGDSGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 SLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQGDSGG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 PLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQTQESQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 PLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQTQESQ 250 260 270 280 290 300 310 320 330 340 pF1KE4 PDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH ::::::::::::::::::::::::::::::::::::::::::: NP_002 PDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH 310 320 330 340 >>XP_011520753 (OMIM: 613797) PREDICTED: serine protease (280 aa) initn: 880 init1: 575 opt: 897 Z-score: 1028.2 bits: 198.4 E(85289): 1.6e-50 Smith-Waterman score: 897; 46.7% identity (71.7% similar) in 276 aa overlap (15-286:9-279) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAE-APCGVAPQ--ARITGGSSAVAGQWP : .:: :: ..:..: . : :: :. .::.:: .. :.:: XP_011 MRGVSCLQVLLLLVLG----AAGTQGRKSAACG-QPRMSSRIVGGRDGRDGEWP 10 20 30 40 60 70 80 90 100 110 pF1KE4 WQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTL ::.:: ..:.:::::::.. ::::.:::::: . :.:.::: .: : : . . XP_011 WQASIQHRGAHVCGGSLIAPQWVLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPV 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA . .. :.: ..:..::.::::: ::. .: ..:.:::. .: : : : ::::: . XP_011 RRVLLPPDYSEDGARGDLALLQLRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLR 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQG :.: : .::: ..:::.. .::. ::.. : :. ..: .:::: .: :::::: XP_011 PGVPLPEWRPLQGVRVPLLDSRTCDGLYHVGADVPQAERIVLPGSLCAGYPQGHKDACQG 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 DSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQT ::::::.: : : :.:.:::: .:. ::::::: ...:. :::..:. XP_011 DSGGPLTCLQSGSWVLVGVVSWGKGCALPNRPGVYTSVATYSPWIQARVSF 230 240 250 260 270 280 300 310 320 330 340 pF1KE4 QESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH >>XP_011520754 (OMIM: 613797) PREDICTED: serine protease (280 aa) initn: 880 init1: 575 opt: 897 Z-score: 1028.2 bits: 198.4 E(85289): 1.6e-50 Smith-Waterman score: 897; 46.7% identity (71.7% similar) in 276 aa overlap (15-286:9-279) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAE-APCGVAPQ--ARITGGSSAVAGQWP : .:: :: ..:..: . : :: :. .::.:: .. :.:: XP_011 MRGVSCLQVLLLLVLG----AAGTQGRKSAACG-QPRMSSRIVGGRDGRDGEWP 10 20 30 40 60 70 80 90 100 110 pF1KE4 WQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTL ::.:: ..:.:::::::.. ::::.:::::: . :.:.::: .: : : . . XP_011 WQASIQHRGAHVCGGSLIAPQWVLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPV 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA . .. :.: ..:..::.::::: ::. .: ..:.:::. .: : : : ::::: . XP_011 RRVLLPPDYSEDGARGDLALLQLRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLR 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQG :.: : .::: ..:::.. .::. ::.. : :. ..: .:::: .: :::::: XP_011 PGVPLPEWRPLQGVRVPLLDSRTCDGLYHVGADVPQAERIVLPGSLCAGYPQGHKDACQG 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 DSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQT ::::::.: : : :.:.:::: .:. ::::::: ...:. :::..:. XP_011 DSGGPLTCLQSGSWVLVGVVSWGKGCALPNRPGVYTSVATYSPWIQARVSF 230 240 250 260 270 280 300 310 320 330 340 pF1KE4 QESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH >>NP_690851 (OMIM: 613797) serine protease 33 precursor (280 aa) initn: 880 init1: 575 opt: 897 Z-score: 1028.2 bits: 198.4 E(85289): 1.6e-50 Smith-Waterman score: 897; 46.7% identity (71.7% similar) in 276 aa overlap (15-286:9-279) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAE-APCGVAPQ--ARITGGSSAVAGQWP : .:: :: ..:..: . : :: :. .::.:: .. :.:: NP_690 MRGVSCLQVLLLLVLG----AAGTQGRKSAACG-QPRMSSRIVGGRDGRDGEWP 10 20 30 40 60 70 80 90 100 110 pF1KE4 WQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTL ::.:: ..:.:::::::.. ::::.:::::: . :.:.::: .: : : . . NP_690 WQASIQHRGAHVCGGSLIAPQWVLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPV 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA . .. :.: ..:..::.::::: ::. .: ..:.:::. .: : : : ::::: . NP_690 RRVLLPPDYSEDGARGDLALLQLRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLR 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQG :.: : .::: ..:::.. .::. ::.. : :. ..: .:::: .: :::::: NP_690 PGVPLPEWRPLQGVRVPLLDSRTCDGLYHVGADVPQAERIVLPGSLCAGYPQGHKDACQG 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 DSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQT ::::::.: : : :.:.:::: .:. ::::::: ...:. :::..:. NP_690 DSGGPLTCLQSGSWVLVGVVSWGKGCALPNRPGVYTSVATYSPWIQARVSF 230 240 250 260 270 280 300 310 320 330 340 pF1KE4 QESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH >>NP_114154 (OMIM: 608018) serine protease 27 isoform 1 (290 aa) initn: 811 init1: 516 opt: 833 Z-score: 955.0 bits: 184.9 E(85289): 1.9e-46 Smith-Waterman score: 833; 43.8% identity (69.0% similar) in 281 aa overlap (14-289:6-280) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVAPQA--RITGGSSAVAGQWPW :: .:: :: :. : . :: :. :..::... :.::: NP_114 MRRPAAVPLLL---LLCFGSQRAKAATACG-RPRMLNRMVGGQDTQEGEWPW 10 20 30 40 60 70 80 90 100 110 pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQLDSYSEDAKVSTLK :::: .: : :::::..:::::.::::: . . :.: :::.:: . . : . .. NP_114 QVSIQRNGSHFCGGSLIAEQWVLTAAHCFRNTSETSLYQVLLGARQLVQPGPHAMYARVR 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 DIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAP .. .: : .:..:.::..: :. :. :: :.::: .. : .:..: ::::: .: NP_114 QVESNPLYQGTASSADVALVELEAPVPFTNYILPVCLPDPSVIFETGMNCWVTGWG--SP 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 SVSLLTPKP--LQQLEVPLISRETCNCLYNIDAK-PEEPHFVQEDMVCAGYVEGGKDACQ : : :.: ::.: ::.:. :: ::. :.. .:. ...::.:::. :: ::::. NP_114 SEEDLLPEPRILQKLAVPIIDTPKCNLLYSKDTEFGYQPKTIKNDMLCAGFEEGKKDACK 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQPRVVPQ ::::::: : : : .:..:::..:. .:::::: .... .::. . .:: NP_114 GDSGGPLVCLVGQSWLQAGVISWGEGCARQNRPGVYIRVTAHHNWIHRIIPKLQFQPARL 230 240 250 260 270 280 300 310 320 330 340 pF1KE4 TQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH NP_114 GGQK 290 >>XP_005255530 (OMIM: 609343) PREDICTED: brain-specific (317 aa) initn: 841 init1: 335 opt: 823 Z-score: 943.1 bits: 182.8 E(85289): 8.6e-46 Smith-Waterman score: 823; 44.4% identity (67.9% similar) in 293 aa overlap (7-291:10-295) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAP----CGVAPQA--RITGGSSA :: : ::. . :: :: : . ..:. : :: :: :..:: .. XP_005 MVVSGAPPALGGGCLGTFTSLL---LLASTAILNAARIPVPPACG-KPQQLNRVVGGEDS 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 VAGQWPWQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA-YEVKLGAHQLDSYSE . ..::: ::: .:.: :.:::.. .::..::::: .. .: . : ::: :: . . XP_005 TDSEWPWIVSIQKNGTHHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLGNPGS 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 DAKVSTLKDIIPHPSY-LQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCT .. . . ::: : .::. .::::..: : : ::. . ::::: :. .: . :: XP_005 RSQKVGVAWVEPHPVYSWKEGACADIALVRLERSIQFSERVLPICLPDASIHLPPNTHCW 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 VTGWGHVAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEG ..::: . .: : :. ::.:.::.:. :.:. :: : . : . :::.::::.:: XP_005 ISGWGSIQDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAG-QGP--ITEDMLCAGYLEG 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 GKDACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQ .::: ::::::: : :.: : :.::.:::..:. ::::::: :.. ::... : .: XP_005 ERDACLGDSGGPLMCQVDGAWLLAGIISWGEGCAERNRPGVYISLSAHRSWVEKIVQGVQ 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE4 PRVVPQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH : XP_005 LRGRAQGGGALRAPSQGSGAAARS 300 310 >>NP_071402 (OMIM: 609343) brain-specific serine proteas (317 aa) initn: 841 init1: 335 opt: 823 Z-score: 943.1 bits: 182.8 E(85289): 8.6e-46 Smith-Waterman score: 823; 44.4% identity (67.9% similar) in 293 aa overlap (7-291:10-295) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAP----CGVAPQA--RITGGSSA :: : ::. . :: :: : . ..:. : :: :: :..:: .. NP_071 MVVSGAPPALGGGCLGTFTSLL---LLASTAILNAARIPVPPACG-KPQQLNRVVGGEDS 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 VAGQWPWQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA-YEVKLGAHQLDSYSE . ..::: ::: .:.: :.:::.. .::..::::: .. .: . : ::: :: . . NP_071 TDSEWPWIVSIQKNGTHHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLGNPGS 60 70 80 90 100 110 120 130 140 150 160 pF1KE4 DAKVSTLKDIIPHPSY-LQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCT .. . . ::: : .::. .::::..: : : ::. . ::::: :. .: . :: NP_071 RSQKVGVAWVEPHPVYSWKEGACADIALVRLERSIQFSERVLPICLPDASIHLPPNTHCW 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 VTGWGHVAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEG ..::: . .: : :. ::.:.::.:. :.:. :: : . : . :::.::::.:: NP_071 ISGWGSIQDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAG-QGP--ITEDMLCAGYLEG 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 GKDACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTELQ .::: ::::::: : :.: : :.::.:::..:. ::::::: :.. ::... : .: NP_071 ERDACLGDSGGPLMCQVDGAWLLAGIISWGEGCAERNRPGVYISLSAHRSWVEKIVQGVQ 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE4 PRVVPQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH : NP_071 LRGRAQGGGALRAPSQGSGAAARS 300 310 >>XP_016878460 (OMIM: 610560) PREDICTED: polyserase-2 is (621 aa) initn: 746 init1: 483 opt: 773 Z-score: 882.0 bits: 172.5 E(85289): 2.2e-42 Smith-Waterman score: 773; 42.0% identity (65.0% similar) in 283 aa overlap (26-302:27-309) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPW : : : . :: :.:::.:::.: : ::: XP_016 MARHLLLPLVMLVISPIPGAFQDSALSPTQEEPEDLDCGRPEPSARIVGGSNAQPGTWPW 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA---YEVKLGAHQLDSYSEDAKVS :::. . : :.:::::.. .::::::::: .. : . : ::.:. :. . :.. XP_016 QVSLHHGGGHICGGSLIAPSWVLSAAHCFMTNGTLEPAAEWSVLLGVHSQDGPLDGAHTR 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 TLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGH .. :. .: : .:.:::.:. : ... . :.::: :. : .: : .:::: XP_016 AVAAIVVPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 VAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQ : . : : ::..:. :... ::.:::. . . . :.:::: :: .:.:: XP_016 VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPEGRRDTCQ 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV--TELQPRVV ::::::: : : :. .::.:.: .:: ::::::.: ...: .::. .: .: : XP_016 GDSGGPLVCEEGGRWFQAGITSFGFGCGRRNRPGVFTAVATYEAWIREQVMGSEPGPAFP 250 260 270 280 290 300 300 310 320 330 340 pF1KE4 PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH : :..: : XP_016 TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQVMVPGSRPCHGALVSESWVLA 310 320 330 340 350 360 >-- initn: 402 init1: 144 opt: 399 Z-score: 455.6 bits: 93.6 E(85289): 1.2e-18 Smith-Waterman score: 399; 30.4% identity (55.2% similar) in 306 aa overlap (31-316:312-593) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPWQ : : : .: :. :.. : :::. XP_016 YEAWIREQVMGSEPGPAFPTQPQKTQSDPQEPREENCTIALPEC----GKAPRPGAWPWE 290 300 310 320 330 60 70 80 90 100 110 pF1KE4 VSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQ--LDSYSEDAKVSTL ... : . : :.::::.:::. : :: . . ... : : . : : . .:. : XP_016 AQVMVPGSRPCHGALVSESWVLAPASCFLDPNSSDSPPRDLDAWRVLLPSRPRAERVARL 340 350 360 370 380 390 120 130 140 150 160 170 pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA . : . .... :.::::: :...: ::.::: . : : .: .. ::. XP_016 ---VQHENASWDNAS-DLALLQLRTPVNLSAASRPVCLPHPEHYFLPGSRCRLARWGRGE 400 410 420 430 440 450 180 190 200 210 220 230 pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNID--AKP---EEPHFVQEDMVCAGYVEGGK- :. : : : ::. :.. :.:::. . : : . :: .: .: : . XP_016 PA---LGPGAL--LEAELLGGWWCHCLYGRQGAAVPLPGDPPH-----ALCPAYQEKEEV 460 470 480 490 500 240 250 260 270 280 pF1KE4 DACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTE---L .: .:: : : :: :.:.:: .. ..: :: .. .... :: :.::. : XP_016 GSCWNDSRWSLLCQEEGTWFLAGIRDFPSGCL---RPRAFFPLQTHGPWI-SHVTRGAYL 510 520 530 540 550 290 300 310 320 330 340 pF1KE4 QPRVV----PQTQESQPDSNLCGSHLA----FSSAPAQGLLRPILFLPLGLALGLLSPWL . ... :. .:.. .. : : .:.:: XP_016 EDQLAWDWGPDGEETE--TQTCPPHTEHGDDLSTAPPVPDDGRVLDPRGHGCSREPGAVC 560 570 580 590 600 610 pF1KE4 SEH XP_016 CHWS 620 >>XP_016878459 (OMIM: 610560) PREDICTED: polyserase-2 is (634 aa) initn: 746 init1: 483 opt: 773 Z-score: 881.8 bits: 172.5 E(85289): 2.2e-42 Smith-Waterman score: 773; 42.0% identity (65.0% similar) in 283 aa overlap (26-302:27-309) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPW : : : . :: :.:::.:::.: : ::: XP_016 MARHLLLPLVMLVISPIPGAFQDSALSPTQEEPEDLDCGRPEPSARIVGGSNAQPGTWPW 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA---YEVKLGAHQLDSYSEDAKVS :::. . : :.:::::.. .::::::::: .. : . : ::.:. :. . :.. XP_016 QVSLHHGGGHICGGSLIAPSWVLSAAHCFMTNGTLEPAAEWSVLLGVHSQDGPLDGAHTR 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 TLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGH .. :. .: : .:.:::.:. : ... . :.::: :. : .: : .:::: XP_016 AVAAIVVPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 VAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQ : . : : ::..:. :... ::.:::. . . . :.:::: :: .:.:: XP_016 VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPEGRRDTCQ 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV--TELQPRVV ::::::: : : :. .::.:.: .:: ::::::.: ...: .::. .: .: : XP_016 GDSGGPLVCEEGGRWFQAGITSFGFGCGRRNRPGVFTAVATYEAWIREQVMGSEPGPAFP 250 260 270 280 290 300 300 310 320 330 340 pF1KE4 PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH : :..: : XP_016 TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQVMVPGSRPCHGALVSESWVLA 310 320 330 340 350 360 >-- initn: 383 init1: 144 opt: 412 Z-score: 470.3 bits: 96.3 E(85289): 1.9e-19 Smith-Waterman score: 412; 29.6% identity (54.0% similar) in 328 aa overlap (31-342:312-606) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPWQ : : : .: :. :.. : :::. XP_016 YEAWIREQVMGSEPGPAFPTQPQKTQSDPQEPREENCTIALPEC----GKAPRPGAWPWE 290 300 310 320 330 60 70 80 90 100 110 pF1KE4 VSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQ--LDSYSEDAKVSTL ... : . : :.::::.:::. : :: . . ... : : . : : . .:. : XP_016 AQVMVPGSRPCHGALVSESWVLAPASCFLDPNSSDSPPRDLDAWRVLLPSRPRAERVARL 340 350 360 370 380 390 120 130 140 150 160 170 pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA . : . .... :.::::: :...: ::.::: . : : .: .. ::. XP_016 ---VQHENASWDNAS-DLALLQLRTPVNLSAASRPVCLPHPEHYFLPGSRCRLARWGRGE 400 410 420 430 440 450 180 190 200 210 220 230 pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNID--AKP---EEPHFVQEDMVCAGYVEGGK- :. : : : ::. :.. :.:::. . : : . :: .: .: : . XP_016 PA---LGPGAL--LEAELLGGWWCHCLYGRQGAAVPLPGDPPH-----ALCPAYQEKEEV 460 470 480 490 500 240 250 260 270 280 pF1KE4 DACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTE---L .: .:: : : :: :.:.:: .. ..: :: .. .... :: :.::. : XP_016 GSCWNDSRWSLLCQEEGTWFLAGIRDFPSGCL---RPRAFFPLQTHGPWI-SHVTRGAYL 510 520 530 540 550 290 300 310 320 330 340 pF1KE4 QPRVV----PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH . ... :. .:.. .. : : .. . :. :.: :::.: XP_016 EDQLAWDWGPDGEETETQT--CPPHTEHGACGLRLEAAPV---------GVLWPWLAEVH 560 570 580 590 600 XP_016 VAGDRVCTGILLAPGWVLAATHCVLR 610 620 630 >>XP_016878458 (OMIM: 610560) PREDICTED: polyserase-2 is (637 aa) initn: 746 init1: 483 opt: 773 Z-score: 881.8 bits: 172.5 E(85289): 2.2e-42 Smith-Waterman score: 773; 42.0% identity (65.0% similar) in 283 aa overlap (26-302:27-309) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPW : : : . :: :.:::.:::.: : ::: XP_016 MARHLLLPLVMLVISPIPGAFQDSALSPTQEEPEDLDCGRPEPSARIVGGSNAQPGTWPW 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 QVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEA---YEVKLGAHQLDSYSEDAKVS :::. . : :.:::::.. .::::::::: .. : . : ::.:. :. . :.. XP_016 QVSLHHGGGHICGGSLIAPSWVLSAAHCFMTNGTLEPAAEWSVLLGVHSQDGPLDGAHTR 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 TLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGH .. :. .: : .:.:::.:. : ... . :.::: :. : .: : .:::: XP_016 AVAAIVVPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 VAPSVSLLTPKPLQQLEVPLISRETCNCLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQ : . : : ::..:. :... ::.:::. . . . :.:::: :: .:.:: XP_016 VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPEGRRDTCQ 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 GDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV--TELQPRVV ::::::: : : :. .::.:.: .:: ::::::.: ...: .::. .: .: : XP_016 GDSGGPLVCEEGGRWFQAGITSFGFGCGRRNRPGVFTAVATYEAWIREQVMGSEPGPAFP 250 260 270 280 290 300 300 310 320 330 340 pF1KE4 PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH : :..: : XP_016 TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQVMVPGSRPCHGALVSESWVLA 310 320 330 340 350 360 >-- initn: 402 init1: 144 opt: 397 Z-score: 453.2 bits: 93.2 E(85289): 1.7e-18 Smith-Waterman score: 397; 30.0% identity (54.7% similar) in 307 aa overlap (31-321:312-594) 10 20 30 40 50 pF1KE4 MAQKGVLGPGQLGAVAILLYLGLLRSGTGAEGAEAPCGVA-PQARITGGSSAVAGQWPWQ : : : .: :. :.. : :::. XP_016 YEAWIREQVMGSEPGPAFPTQPQKTQSDPQEPREENCTIALPEC----GKAPRPGAWPWE 290 300 310 320 330 60 70 80 90 100 110 pF1KE4 VSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYEVKLGAHQ--LDSYSEDAKVSTL ... : . : :.::::.:::. : :: . . ... : : . : : . .:. : XP_016 AQVMVPGSRPCHGALVSESWVLAPASCFLDPNSSDSPPRDLDAWRVLLPSRPRAERVARL 340 350 360 370 380 390 120 130 140 150 160 170 pF1KE4 KDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVA . : . .... :.::::: :...: ::.::: . : : .: .. ::. XP_016 ---VQHENASWDNAS-DLALLQLRTPVNLSAASRPVCLPHPEHYFLPGSRCRLARWGRGE 400 410 420 430 440 450 180 190 200 210 220 230 pF1KE4 PSVSLLTPKPLQQLEVPLISRETCNCLYNID--AKP---EEPHFVQEDMVCAGYVEGGK- :. : : : ::. :.. :.:::. . : : . :: .: .: : . XP_016 PA---LGPGAL--LEAELLGGWWCHCLYGRQGAAVPLPGDPPH-----ALCPAYQEKEEV 460 470 480 490 500 240 250 260 270 280 pF1KE4 DACQGDSGGPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKVTE---L .: .:: : : :: :.:.:: .. ..: :: .. .... :: :.::. : XP_016 GSCWNDSRWSLLCQEEGTWFLAGIRDFPSGCL---RPRAFFPLQTHGPWI-SHVTRGAYL 510 520 530 540 550 290 300 310 320 330 340 pF1KE4 QPRVV----PQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGLALGLLSPWLSEH . ... :. .:.. .. : : .. : : XP_016 EDQLAWDWGPDGEETE--TQTCPPHTEHGGQALQQCLTLKCIWAGQGPAPSHRATRYPAW 560 570 580 590 600 610 XP_016 SSASGCPSTWDSGPPWPSWS 620 630 343 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 02:15:03 2016 done: Sun Nov 6 02:15:04 2016 Total Scan time: 8.100 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]