FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6356, 321 aa 1>>>pF1KE6356 321 - 321 aa - 321 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3643+/-0.000614; mu= 16.7037+/- 0.037 mean_var=81.8906+/-16.201, 0's: 0 Z-trim(114.1): 172 B-trim: 2 in 1/51 Lambda= 0.141729 statistics sampled from 14495 (14671) to 14495 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.451), width: 16 Scan time: 2.300 The best scores are: opt bits E(32554) CCDS10430.1 TPSG1 gene_id:25823|Hs108|chr16 ( 321) 2278 474.7 4.3e-134 CCDS10431.1 TPSAB1 gene_id:7177|Hs108|chr16 ( 275) 817 175.9 3.2e-44 CCDS10481.1 PRSS22 gene_id:64063|Hs108|chr16 ( 317) 778 168.0 9.1e-42 CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 563) 717 155.7 8e-38 CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 532) 704 153.0 4.8e-37 CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 567) 704 153.0 5.1e-37 CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 802) 699 152.1 1.3e-36 CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 811) 699 152.1 1.4e-36 CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 ( 638) 687 149.6 6.2e-36 CCDS5279.1 PLG gene_id:5340|Hs108|chr6 ( 810) 672 146.6 6.2e-35 CCDS74669.1 PRSS56 gene_id:646960|Hs108|chr2 ( 603) 669 145.9 7.6e-35 CCDS47145.1 PRSS48 gene_id:345062|Hs108|chr4 ( 328) 660 143.8 1.7e-34 CCDS73391.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 413) 651 142.1 7.3e-34 CCDS73392.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 448) 651 142.1 7.8e-34 CCDS44735.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 457) 651 142.1 7.9e-34 CCDS1563.1 PRSS38 gene_id:339501|Hs108|chr1 ( 326) 648 141.4 9.3e-34 CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 454) 647 141.3 1.4e-33 CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 453) 646 141.1 1.6e-33 CCDS3521.1 TMPRSS11B gene_id:132724|Hs108|chr4 ( 416) 645 140.9 1.7e-33 CCDS76482.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 290) 642 140.1 2e-33 CCDS43129.2 TMPRSS7 gene_id:344805|Hs108|chr3 ( 717) 646 141.3 2.3e-33 CCDS53717.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 397) 642 140.2 2.5e-33 CCDS3518.1 TMPRSS11D gene_id:9407|Hs108|chr4 ( 418) 642 140.3 2.6e-33 CCDS44743.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 432) 642 140.3 2.7e-33 CCDS53716.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 435) 642 140.3 2.7e-33 CCDS31684.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 437) 642 140.3 2.7e-33 CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 492) 639 139.7 4.6e-33 CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 529) 639 139.7 4.8e-33 CCDS12088.1 TMPRSS9 gene_id:360200|Hs108|chr19 (1059) 637 139.6 1.1e-32 CCDS10432.1 TPSD1 gene_id:23430|Hs108|chr16 ( 242) 622 136.0 3e-32 CCDS47065.1 TMPRSS11A gene_id:339967|Hs108|chr4 ( 418) 617 135.1 9.1e-32 CCDS3519.1 TMPRSS11A gene_id:339967|Hs108|chr4 ( 421) 617 135.1 9.2e-32 CCDS33993.1 TMPRSS11E gene_id:28983|Hs108|chr4 ( 423) 613 134.3 1.6e-31 CCDS3847.1 F11 gene_id:2160|Hs108|chr4 ( 625) 615 134.9 1.6e-31 CCDS157.1 CELA2A gene_id:63036|Hs108|chr1 ( 269) 576 126.6 2.2e-29 CCDS10852.1 CTRL gene_id:1506|Hs108|chr16 ( 264) 569 125.2 5.8e-29 CCDS3520.1 TMPRSS11F gene_id:389208|Hs108|chr4 ( 438) 570 125.5 7.4e-29 CCDS32490.1 CTRB1 gene_id:1504|Hs108|chr16 ( 263) 567 124.8 7.7e-29 CCDS75122.1 CORIN gene_id:10699|Hs108|chr4 ( 938) 568 125.4 1.7e-28 CCDS3477.1 CORIN gene_id:10699|Hs108|chr4 (1042) 568 125.4 1.9e-28 CCDS10476.1 PRSS27 gene_id:83886|Hs108|chr16 ( 290) 560 123.4 2.2e-28 CCDS46816.1 PRSS42 gene_id:339906|Hs108|chr3 ( 293) 558 123.0 3e-28 CCDS32489.1 CTRB2 gene_id:440387|Hs108|chr16 ( 263) 556 122.5 3.6e-28 CCDS10478.1 PRSS21 gene_id:10942|Hs108|chr16 ( 314) 556 122.6 4.2e-28 CCDS5976.1 PRSS55 gene_id:203074|Hs108|chr8 ( 352) 550 121.4 1.1e-27 CCDS58452.1 PRSS36 gene_id:146547|Hs108|chr16 ( 752) 553 122.3 1.2e-27 CCDS58453.1 PRSS36 gene_id:146547|Hs108|chr16 ( 850) 553 122.3 1.4e-27 CCDS32436.1 PRSS36 gene_id:146547|Hs108|chr16 ( 855) 553 122.3 1.4e-27 CCDS30605.1 CELA2B gene_id:51032|Hs108|chr1 ( 269) 542 119.7 2.7e-27 CCDS13571.1 TMPRSS15 gene_id:5651|Hs108|chr21 (1019) 547 121.1 3.6e-27 >>CCDS10430.1 TPSG1 gene_id:25823|Hs108|chr16 (321 aa) initn: 2278 init1: 2278 opt: 2278 Z-score: 2520.9 bits: 474.7 E(32554): 4.3e-134 Smith-Waterman score: 2278; 99.7% identity (99.7% similar) in 321 aa overlap (1-321:1-321) 10 20 30 40 50 60 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGCGRPQVSDAGGRIVGGHAAPAGAWPWQASLRLRRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MALGACGLLLLLAVPGVSLRTLQPGCGRPQVSDAGGRIVGGHAAPAGAWPWQASLRLRRV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 HVCGGSLLSPQWVLTAAHCFSGSLNSSDYQVHLGELEITLSPHFSTVRQIILHSSPSGQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 HVCGGSLLSPQWVLTAAHCFSGSLNSSDYQVHLGELEITLSPHFSTVRQIILHSSPSGQP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWGYTREGEPLPPPYSLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWGYTREGEPLPPPYSLR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 EVKVSVVDTETCRRDYPGPGGSILQPDMLCARGPGDACQDDSGGPLVCQVNGAWVQAGIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: : CCDS10 EVKVSVVDTETCRRDYPGPGGSILQPDMLCARGPGDACQDDSGGPLVCQVNGAWVQAGTV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 SWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRLPLLAGFFLPGLFLLLVSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRLPLLAGFFLPGLFLLLVSC 250 260 270 280 290 300 310 320 pF1KE6 VLLAKCLLHPSADGTPFPAPD ::::::::::::::::::::: CCDS10 VLLAKCLLHPSADGTPFPAPD 310 320 >>CCDS10431.1 TPSAB1 gene_id:7177|Hs108|chr16 (275 aa) initn: 725 init1: 311 opt: 817 Z-score: 907.3 bits: 175.9 E(32554): 3.2e-44 Smith-Waterman score: 817; 46.9% identity (72.2% similar) in 273 aa overlap (9-269:4-271) 10 20 30 40 50 pF1KE6 MALGACGLLLLLAVPGVSLRTLQ-PGCGRPQVSDAGGRIVGGHAAPAGAWPWQASLRLRR :::::.: .. :. :. :. .. .: ::::. :: . ::::.:::.. CCDS10 MLNLLLLALPVLASRAYAAPAPGQA-LQRVG--IVGGQEAPRSKWPWQVSLRVHG 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 ---VHVCGGSLLSPQWVLTAAHCFSGSLNS-SDYQVHLGELEITLSPHFSTVRQIILHSS .: :::::. :::::::::: . .... . .:.: : .. . .. : .::.: . CCDS10 PYWMHFCGGSLIHPQWVLTAAHCVGPDVKDLAALRVQLREQHLYYQDQLLPVSRIIVHPQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 -PSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWGYTREGEPLP ..: : .::::.:: ::..::.. : :: ::. : ::. ::::::: . . : :: CCDS10 FYTAQIG--ADIALLELEEPVNVSSHVHTVTLPPASETFPPGMPCWVTGWGDVDNDERLP 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 PPYSLREVKVSVVDTETCRRDY-----PGPGGSILQPDMLCARGPG-DACQDDSGGPLVC ::. :..::: ..... : : : :.. ::::: . :.:: :::::::: CCDS10 PPFPLKQVKVPIMENHICDAKYHLGAYTGDDVRIVRDDMLCAGNTRRDSCQGDSGGPLVC 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE6 QVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRLPLLAGF .:::.:.:::.:::::::..:::::.:::: :..::.... CCDS10 KVNGTWLQAGVVSWGEGCAQPNRPGIYTRVTYYLDWIHHYVPKKP 240 250 260 270 290 300 310 320 pF1KE6 FLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD >>CCDS10481.1 PRSS22 gene_id:64063|Hs108|chr16 (317 aa) initn: 717 init1: 266 opt: 778 Z-score: 863.3 bits: 168.0 E(32554): 9.1e-42 Smith-Waterman score: 778; 42.6% identity (67.9% similar) in 277 aa overlap (3-267:15-287) 10 20 30 40 pF1KE6 MALGACGLLLLLAVPGV--SLRT-LQPGCGRPQVSDAGGRIVGGHAAP ::. ::::: .. . : . :.::.:: . :.:::. . CCDS10 MVVSGAPPALGGGCLGTFTSLLLLASTAILNAARIPVPPACGKPQQLN---RVVGGEDST 10 20 30 40 50 50 60 70 80 90 100 pF1KE6 AGAWPWQASLRLRRVHVCGGSLLSPQWVLTAAHCFSGSLNSSD-YQVHLGELEITLSPHF . ::: .:.. .: :.::::. .::.::::::. .::. ..: :: .. .: CCDS10 DSEWPWIVSIQKNGTHHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLG-NPGS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE6 ST----VRQIILHSSPSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCW . : . : : . :. .:::::.: . .: :.::.:::.:: . :. .:: CCDS10 RSQKVGVAWVEPHPVYSWKEGACADIALVRLERSIQFSERVLPICLPDASIHLPPNTHCW 120 130 140 150 160 170 170 180 190 200 210 pF1KE6 VTGWGYTREGEPLPPPYSLREVKVSVVDTETCRRDY-PGPGGSILQPDMLCA---RGPGD ..::: ..: ::: : .:...:: ..:.:.: . : : : . . ::::: .: : CCDS10 ISGWGSIQDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAGQGPITEDMLCAGYLEGERD 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE6 ACQDDSGGPLVCQVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSE :: ::::::.:::.:::. :::.::::::.. :::::: . :. .:... CCDS10 ACLGDSGGPLMCQVDGAWLLAGIISWGEGCAERNRPGVYISLSAHRSWVEKIVQGVQLRG 240 250 260 270 280 290 280 290 300 310 320 pF1KE6 SGYPRLPLLAGFFLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD CCDS10 RAQGGGALRAPSQGSGAAARS 300 310 >>CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 (563 aa) initn: 575 init1: 368 opt: 717 Z-score: 792.5 bits: 155.7 E(32554): 8e-38 Smith-Waterman score: 717; 42.4% identity (66.8% similar) in 262 aa overlap (18-274:306-563) 10 20 30 40 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGCGRPQVSDAGGRIVGGHAAPAG : : .. :.. . :::::: : . CCDS58 EVAHRDFANSFSILRYNSTIQESLHRSECPSQRYISLQCSHCGLRAMTGRIVGGALASDS 280 290 300 310 320 330 50 60 70 80 90 100 pF1KE6 AWPWQASLRLRRVHVCGGSLLSPQWVLTAAHCFSGSLNS--SDYQVHLGELEITLSPHFS ::::.::.. .:.:::.:.. :::::::::: . .. ..:. : .. :. . CCDS58 KWPWQVSLHFGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEGWKVYAGTSNLHQLPEAA 340 350 360 370 380 390 110 120 130 140 150 160 pF1KE6 TVRQIILHSSPSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWG .. .::..:. . . ::::..:: :.:::..: :.::: .. : . ::.::.: CCDS58 SIAEIIINSNYTDEEDDY-DIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFG 400 410 420 430 440 450 170 180 190 200 210 220 pF1KE6 YTREGEPLPPPYSLREVKVSVVDTETCRRDYPGPGGSILQPDMLCA---RGPGDACQDDS ::: . :. ::::.:...: . : :: : : : :.:: :: :.:: :: CCDS58 KTRETDDKTSPF-LREVQVNLIDFKKCN-DYL-VYDSYLTPRMMCAGDLRGGRDSCQGDS 460 470 480 490 500 510 230 240 250 260 270 280 pF1KE6 GGPLVCQVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRL ::::::. :. : ::..::: :::. :.:::::.: . :: .. .:.: CCDS58 GGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMESSAG 520 530 540 550 560 290 300 310 320 pF1KE6 PLLAGFFLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD >>CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 (532 aa) initn: 562 init1: 355 opt: 704 Z-score: 778.5 bits: 153.0 E(32554): 4.8e-37 Smith-Waterman score: 704; 43.1% identity (66.8% similar) in 253 aa overlap (18-265:271-519) 10 20 30 40 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGCGRPQVSDAGGRIVGGHAAPAG : : .. :.. . :::::: : . CCDS55 EVAHRDFANSFSILRYNSTIQESLHRSECPSQRYISLQCSHCGLRAMTGRIVGGALASDS 250 260 270 280 290 300 50 60 70 80 90 100 pF1KE6 AWPWQASLRLRRVHVCGGSLLSPQWVLTAAHCFSGSLNS--SDYQVHLGELEITLSPHFS ::::.::.. .:.:::.:.. :::::::::: . .. ..:. : .. :. . CCDS55 KWPWQVSLHFGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEGWKVYAGTSNLHQLPEAA 310 320 330 340 350 360 110 120 130 140 150 160 pF1KE6 TVRQIILHSSPSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWG .. .::..:. . . ::::..:: :.:::..: :.::: .. : . ::.::.: CCDS55 SIAEIIINSNYTDEEDDY-DIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFG 370 380 390 400 410 170 180 190 200 210 220 pF1KE6 YTREGEPLPPPYSLREVKVSVVDTETCRRDYPGPGGSILQPDMLCA---RGPGDACQDDS ::: . :. ::::.:...: . : :: : : : :.:: :: :.:: :: CCDS55 KTRETDDKTSPF-LREVQVNLIDFKKCN-DYL-VYDSYLTPRMMCAGDLRGGRDSCQGDS 420 430 440 450 460 470 230 240 250 260 270 280 pF1KE6 GGPLVCQVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRL ::::::. :. : ::..::: :::. :.:::::.: . :: CCDS55 GGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMESEVRFRKS 480 490 500 510 520 530 290 300 310 320 pF1KE6 PLLAGFFLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD >>CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 (567 aa) initn: 562 init1: 355 opt: 704 Z-score: 778.1 bits: 153.0 E(32554): 5.1e-37 Smith-Waterman score: 704; 43.1% identity (66.8% similar) in 253 aa overlap (18-265:306-554) 10 20 30 40 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGCGRPQVSDAGGRIVGGHAAPAG : : .. :.. . :::::: : . CCDS41 EVAHRDFANSFSILRYNSTIQESLHRSECPSQRYISLQCSHCGLRAMTGRIVGGALASDS 280 290 300 310 320 330 50 60 70 80 90 100 pF1KE6 AWPWQASLRLRRVHVCGGSLLSPQWVLTAAHCFSGSLNS--SDYQVHLGELEITLSPHFS ::::.::.. .:.:::.:.. :::::::::: . .. ..:. : .. :. . CCDS41 KWPWQVSLHFGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEGWKVYAGTSNLHQLPEAA 340 350 360 370 380 390 110 120 130 140 150 160 pF1KE6 TVRQIILHSSPSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWG .. .::..:. . . ::::..:: :.:::..: :.::: .. : . ::.::.: CCDS41 SIAEIIINSNYTDEEDDY-DIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFG 400 410 420 430 440 450 170 180 190 200 210 220 pF1KE6 YTREGEPLPPPYSLREVKVSVVDTETCRRDYPGPGGSILQPDMLCA---RGPGDACQDDS ::: . :. ::::.:...: . : :: : : : :.:: :: :.:: :: CCDS41 KTRETDDKTSPF-LREVQVNLIDFKKCN-DYL-VYDSYLTPRMMCAGDLRGGRDSCQGDS 460 470 480 490 500 510 230 240 250 260 270 280 pF1KE6 GGPLVCQVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRL ::::::. :. : ::..::: :::. :.:::::.: . :: CCDS41 GGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMESEVRFRKS 520 530 540 550 560 290 300 310 320 pF1KE6 PLLAGFFLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD >>CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 (802 aa) initn: 626 init1: 222 opt: 699 Z-score: 770.5 bits: 152.1 E(32554): 1.3e-36 Smith-Waterman score: 706; 40.1% identity (62.6% similar) in 289 aa overlap (6-270:522-802) 10 20 30 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGC-GRPQVSDA :: . . .. .: : :::. :. CCDS74 STCISLPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDG 500 510 520 530 540 550 40 50 60 70 80 pF1KE6 G-------------GRIVGGHAAPAGAWPWQASLRLRRVHVCGGSLLSPQWVLTAAHCFS . .::::: .. : :::::::..: :.:::.:.. .::.::::::. CCDS74 SDEEHCDCGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQ 560 570 580 590 600 610 90 100 110 120 130 pF1KE6 -GSLNSSD-YQVHLGEL-EITLSPHFST--VRQIILHSSPSGQPGTSG-DIALVELSVPV :. :. . : ::.. . . : . : ...:: : . . :.::..:. :: CCDS74 EDSMASTVLWTVFLGKVWQNSRWPGEVSFKVSRLLLH--PYHEEDSHDYDVALLQLDHPV 620 630 640 650 660 140 150 160 170 180 190 pF1KE6 TLSSRILPVCLPEASDDFCPGIRCWVTGWGYTREGEPLPPPYSLREVKVSVVDTETCRRD . :. . ::::: : : ::..::.:::: ::: :. .:..: :... . : . CCDS74 VRSAAVRPVCLPARSHFFEPGLHCWITGWGALREGGPIS--NALQKVDVQLIPQDLCSEV 670 680 690 700 710 720 200 210 220 230 240 250 pF1KE6 YPGPGGSILQPDMLCA---RGPGDACQDDSGGPLVCQ-VNGAWVQAGIVSWGEGCGRPNR : . : :::: .: :::: ::::::::. ..: : ::.:::: :::::: CCDS74 YRYQ----VTPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNY 730 740 750 760 770 780 260 270 280 290 300 310 pF1KE6 PGVYTRVPAYVNWIRRHITASGGSESGYPRLPLLAGFFLPGLFLLLVSCVLLAKCLLHPS :::::. . ..::.. .: CCDS74 FGVYTRITGVISWIQQVVT 790 800 >>CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 (811 aa) initn: 626 init1: 222 opt: 699 Z-score: 770.5 bits: 152.1 E(32554): 1.4e-36 Smith-Waterman score: 706; 40.1% identity (62.6% similar) in 289 aa overlap (6-270:531-811) 10 20 30 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGC-GRPQVSDA :: . . .. .: : :::. :. CCDS13 STCISLPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDG 510 520 530 540 550 560 40 50 60 70 80 pF1KE6 G-------------GRIVGGHAAPAGAWPWQASLRLRRVHVCGGSLLSPQWVLTAAHCFS . .::::: .. : :::::::..: :.:::.:.. .::.::::::. CCDS13 SDEEHCDCGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQ 570 580 590 600 610 620 90 100 110 120 130 pF1KE6 -GSLNSSD-YQVHLGEL-EITLSPHFST--VRQIILHSSPSGQPGTSG-DIALVELSVPV :. :. . : ::.. . . : . : ...:: : . . :.::..:. :: CCDS13 EDSMASTVLWTVFLGKVWQNSRWPGEVSFKVSRLLLH--PYHEEDSHDYDVALLQLDHPV 630 640 650 660 670 140 150 160 170 180 190 pF1KE6 TLSSRILPVCLPEASDDFCPGIRCWVTGWGYTREGEPLPPPYSLREVKVSVVDTETCRRD . :. . ::::: : : ::..::.:::: ::: :. .:..: :... . : . CCDS13 VRSAAVRPVCLPARSHFFEPGLHCWITGWGALREGGPIS--NALQKVDVQLIPQDLCSEV 680 690 700 710 720 730 200 210 220 230 240 250 pF1KE6 YPGPGGSILQPDMLCA---RGPGDACQDDSGGPLVCQ-VNGAWVQAGIVSWGEGCGRPNR : . : :::: .: :::: ::::::::. ..: : ::.:::: :::::: CCDS13 YRYQ----VTPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNY 740 750 760 770 780 790 260 270 280 290 300 310 pF1KE6 PGVYTRVPAYVNWIRRHITASGGSESGYPRLPLLAGFFLPGLFLLLVSCVLLAKCLLHPS :::::. . ..::.. .: CCDS13 FGVYTRITGVISWIQQVVT 800 810 >>CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 (638 aa) initn: 456 init1: 358 opt: 687 Z-score: 758.6 bits: 149.6 E(32554): 6.2e-36 Smith-Waterman score: 687; 41.3% identity (66.9% similar) in 269 aa overlap (16-274:369-630) 10 20 30 40 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGCGRPQVSDAGGRIVGGHAAP : ::: . : . .. .. ::::: . CCDS34 DCKEEKCKCFLRLSMDGSPTRIAYGTQGSSGYSLRLCNTGDNSVCTTKTSTRIVGGTNSS 340 350 360 370 380 390 50 60 70 80 90 pF1KE6 AGAWPWQASLRLR---RVHVCGGSLLSPQWVLTAAHCFSGSLNSSDYQVHLGEL---EIT : ::::.::... . :.:::::.. ::::::::::.: .. .... : : .:: CCDS34 WGEWPWQVSLQVKLTAQRHLCGGSLIGHQWVLTAAHCFDGLPLQDVWRIYSGILNLSDIT 400 410 420 430 440 450 100 110 120 130 140 150 pF1KE6 LSPHFSTVRQIILHSSPSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRC . :: ...::.:.. . . :. ::::..:..:.. . :.::: .: : CCDS34 KDTPFSQIKEIIIHQNYKVSEGNH-DIALIKLQAPLNYTEFQKPICLPSKGDTSTIYTNC 460 470 480 490 500 510 160 170 180 190 200 210 pF1KE6 WVTGWGYTRE-GEPLPPPYSLREVKVSVVDTETCRRDYPGPGGSILQPDMLCA---RGPG ::::::...: :: :..:.. .: .: :.. : .: : :.:: .: CCDS34 WVTGWGFSKEKGEI---QNILQKVNIPLVTNEECQKRYQD--YKITQ-RMVCAGYKEGGK 520 530 540 550 560 570 220 230 240 250 260 270 pF1KE6 DACQDDSGGPLVCQVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGS :::. ::::::::. :: : .::.::::::.: ..:::::.: :..:: .. .: : CCDS34 DACKGDSGGPLVCKHNGMWRLVGITSWGEGCARREQPGVYTKVAEYMDWILEKTQSSDGK 580 590 600 610 620 630 280 290 300 310 320 pF1KE6 ESGYPRLPLLAGFFLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD CCDS34 AQMQSPA >>CCDS5279.1 PLG gene_id:5340|Hs108|chr6 (810 aa) initn: 602 init1: 242 opt: 672 Z-score: 740.7 bits: 146.6 E(32554): 6.2e-35 Smith-Waterman score: 672; 43.0% identity (65.9% similar) in 249 aa overlap (26-265:567-803) 10 20 30 40 50 pF1KE6 MALGACGLLLLLAVPGVSLRTLQPGCGRPQV--SDAGGRIVGGHAAPAGAWPWQA ::.::: . ::.::: .: .::::. CCDS52 DVGGPWCYTTNPRKLYDYCDVPQCAAPSFDCGKPQVEPKKCPGRVVGGCVAHPHSWPWQV 540 550 560 570 580 590 60 70 80 90 100 110 pF1KE6 SLRLRR-VHVCGGSLLSPQWVLTAAHCFSGSLNSSDYQVHLG-ELEITLSPHFSTVRQII ::: : .: :::.:.::.::::::::. : :.:.: :: . :..: :: . .. CCDS52 SLRTRFGMHFCGGTLISPEWVLTAAHCLEKSPRPSSYKVILGAHQEVNLEPHVQEIEVSR 600 610 620 630 640 650 120 130 140 150 160 pF1KE6 LHSSPSGQPGTSGDIALVELSVPVTLSSRILPVCLPEASDDFCPGIRCWVTGWGYTRE-- : : : ::::..:: :........:.::: . .:..:::: :. CCDS52 LFLEP-----TRKDIALLKLSSPAVITDKVIPACLPSPNYVVADRTECFITGWGETQGTF 660 670 680 690 700 710 170 180 190 200 210 220 pF1KE6 GEPLPPPYSLREVKVSVVDTETCRRDYPGPGGSILQPDMLCA---RGPGDACQDDSGGPL : : :.:... :.....: : : .: . : ::: : :.:: :::::: CCDS52 GAGL-----LKEAQLPVIENKVCNR-YEFLNGRV-QSTELCAGHLAGGTDSCQGDSGGPL 720 730 740 750 760 230 240 250 260 270 280 pF1KE6 VCQVNGAWVQAGIVSWGEGCGRPNRPGVYTRVPAYVNWIRRHITASGGSESGYPRLPLLA :: . .. :..::: ::.:::.::::.:: .:.:: CCDS52 VCFEKDKYILQGVTSWGLGCARPNKPGVYVRVSRFVTWIEGVMRNN 770 780 790 800 810 290 300 310 320 pF1KE6 GFFLPGLFLLLVSCVLLAKCLLHPSADGTPFPAPD 321 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:24:59 2016 done: Tue Nov 8 12:24:59 2016 Total Scan time: 2.300 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]