FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4470, 492 aa 1>>>pF1KE4470 492 - 492 aa - 492 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9393+/-0.00103; mu= 10.8790+/- 0.062 mean_var=130.6575+/-25.943, 0's: 0 Z-trim(109.4): 179 B-trim: 49 in 2/49 Lambda= 0.112204 statistics sampled from 10649 (10834) to 10649 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.692), E-opt: 0.2 (0.333), width: 16 Scan time: 2.650 The best scores are: opt bits E(32554) CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 492) 3494 577.2 1.4e-164 CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 529) 3494 577.2 1.5e-164 CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 453) 1191 204.3 2.2e-52 CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 454) 1180 202.6 7.6e-52 CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 532) 1000 173.5 5.1e-43 CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 567) 995 172.7 9.4e-43 CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 563) 993 172.4 1.2e-42 CCDS44735.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 457) 907 158.4 1.5e-38 CCDS73392.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 448) 905 158.0 1.9e-38 CCDS73391.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 413) 893 156.1 6.8e-38 CCDS13571.1 TMPRSS15 gene_id:5651|Hs108|chr21 (1019) 899 157.3 7.1e-38 CCDS75122.1 CORIN gene_id:10699|Hs108|chr4 ( 938) 824 145.2 3e-34 CCDS3477.1 CORIN gene_id:10699|Hs108|chr4 (1042) 824 145.2 3.2e-34 CCDS53717.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 397) 810 142.6 7.4e-34 CCDS53716.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 435) 810 142.7 7.9e-34 CCDS31684.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 437) 810 142.7 7.9e-34 CCDS44743.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 432) 806 142.0 1.2e-33 CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 ( 638) 767 135.8 1.3e-31 CCDS76482.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 290) 752 133.1 3.9e-31 CCDS32993.1 HPN gene_id:3249|Hs108|chr19 ( 417) 747 132.4 9e-31 CCDS3520.1 TMPRSS11F gene_id:389208|Hs108|chr4 ( 438) 722 128.4 1.5e-29 CCDS8487.1 ST14 gene_id:6768|Hs108|chr11 ( 855) 717 127.8 4.6e-29 CCDS3518.1 TMPRSS11D gene_id:9407|Hs108|chr4 ( 418) 711 126.6 5.1e-29 CCDS3847.1 F11 gene_id:2160|Hs108|chr4 ( 625) 712 126.9 6.2e-29 CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 802) 686 122.8 1.4e-27 CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 811) 686 122.8 1.4e-27 CCDS33993.1 TMPRSS11E gene_id:28983|Hs108|chr4 ( 423) 677 121.1 2.3e-27 CCDS3521.1 TMPRSS11B gene_id:132724|Hs108|chr4 ( 416) 676 121.0 2.6e-27 CCDS42939.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 344) 670 119.9 4.4e-27 CCDS12088.1 TMPRSS9 gene_id:360200|Hs108|chr19 (1059) 677 121.4 4.8e-27 CCDS43129.2 TMPRSS7 gene_id:344805|Hs108|chr3 ( 717) 655 117.7 4.2e-26 CCDS10430.1 TPSG1 gene_id:25823|Hs108|chr16 ( 321) 642 115.4 9.6e-26 CCDS47065.1 TMPRSS11A gene_id:339967|Hs108|chr4 ( 418) 642 115.5 1.2e-25 CCDS3519.1 TMPRSS11A gene_id:339967|Hs108|chr4 ( 421) 642 115.5 1.2e-25 CCDS74669.1 PRSS56 gene_id:646960|Hs108|chr2 ( 603) 638 114.9 2.5e-25 CCDS44881.1 TMPRSS12 gene_id:283471|Hs108|chr12 ( 348) 623 112.3 8.6e-25 CCDS45469.1 PRSS8 gene_id:5652|Hs108|chr16 ( 343) 622 112.2 9.5e-25 CCDS10481.1 PRSS22 gene_id:64063|Hs108|chr16 ( 317) 617 111.3 1.6e-24 CCDS1563.1 PRSS38 gene_id:339501|Hs108|chr1 ( 326) 614 110.8 2.2e-24 CCDS47145.1 PRSS48 gene_id:345062|Hs108|chr4 ( 328) 614 110.8 2.3e-24 CCDS5279.1 PLG gene_id:5340|Hs108|chr6 ( 810) 613 111.0 5.1e-24 CCDS10431.1 TPSAB1 gene_id:7177|Hs108|chr16 ( 275) 590 106.9 2.9e-23 CCDS157.1 CELA2A gene_id:63036|Hs108|chr1 ( 269) 583 105.8 6.3e-23 CCDS32490.1 CTRB1 gene_id:1504|Hs108|chr16 ( 263) 580 105.3 8.6e-23 CCDS10852.1 CTRL gene_id:1506|Hs108|chr16 ( 264) 575 104.5 1.5e-22 CCDS14101.1 ACR gene_id:49|Hs108|chr22 ( 421) 574 104.4 2.4e-22 CCDS32489.1 CTRB2 gene_id:440387|Hs108|chr16 ( 263) 564 102.7 5.2e-22 CCDS73390.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 344) 563 102.6 7.2e-22 CCDS73393.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 388) 563 102.6 7.9e-22 CCDS3369.1 HGFAC gene_id:3083|Hs108|chr4 ( 655) 564 103.0 1.1e-21 >>CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 (492 aa) initn: 3494 init1: 3494 opt: 3494 Z-score: 3068.2 bits: 577.2 E(32554): 1.4e-164 Smith-Waterman score: 3494; 100.0% identity (100.0% similar) in 492 aa overlap (1-492:1-492) 10 20 30 40 50 60 pF1KE4 MALNSGSPPAIGPYYENHGYQPENPYPAQPTVVPTVYEVHPAQYYPSPVPQYAPRVLTQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MALNSGSPPAIGPYYENHGYQPENPYPAQPTVVPTVYEVHPAQYYPSPVPQYAPRVLTQA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 SNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGIEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGIEC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 DSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNENY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNENY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 GRAACRDMGYKNNFYSSQGIVDDSGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAVVSLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GRAACRDMGYKNNFYSSQGIVDDSGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAVVSLR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 CIACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWIVTAAHCVEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 CIACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWIVTAAHCVEK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 PLNNPWHWTAFAGILRQSFMFYGAGYQVEKVISHPNYDSKTKNNDIALMKLQKPLTFNDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PLNNPWHWTAFAGILRQSFMFYGAGYQVEKVISHPNYDSKTKNNDIALMKLQKPLTFNDL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 VKPVCLPNPGMMLQPEQLCWISGWGATEEKGKTSEVLNAAKVLLIETQRCNSRYVYDNLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VKPVCLPNPGMMLQPEQLCWISGWGATEEKGKTSEVLNAAKVLLIETQRCNSRYVYDNLI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 TPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVF 430 440 450 460 470 480 490 pF1KE4 TDWIYRQMRADG :::::::::::: CCDS33 TDWIYRQMRADG 490 >>CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 (529 aa) initn: 3494 init1: 3494 opt: 3494 Z-score: 3067.7 bits: 577.2 E(32554): 1.5e-164 Smith-Waterman score: 3494; 100.0% identity (100.0% similar) in 492 aa overlap (1-492:38-529) 10 20 30 pF1KE4 MALNSGSPPAIGPYYENHGYQPENPYPAQP :::::::::::::::::::::::::::::: CCDS54 GESGCEERGAAGHIEHSRYLSLLDAVDNSKMALNSGSPPAIGPYYENHGYQPENPYPAQP 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE4 TVVPTVYEVHPAQYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TVVPTVYEVHPAQYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLT 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE4 LGTFLVGAALAAGLLWKFMGSKCSNSGIECDSSGTCINPSNWCDGVSHCPGGEDENRCVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LGTFLVGAALAAGLLWKFMGSKCSNSGIECDSSGTCINPSNWCDGVSHCPGGEDENRCVR 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE4 LYGPNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDDSGSTSFM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LYGPNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDDSGSTSFM 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE4 KLNTSAGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KLNTSAGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQ 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE4 VSLHVQNVHVCGGSIITPEWIVTAAHCVEKPLNNPWHWTAFAGILRQSFMFYGAGYQVEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VSLHVQNVHVCGGSIITPEWIVTAAHCVEKPLNNPWHWTAFAGILRQSFMFYGAGYQVEK 310 320 330 340 350 360 340 350 360 370 380 390 pF1KE4 VISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEK 370 380 390 400 410 420 400 410 420 430 440 450 pF1KE4 GKTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GKTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSKN 430 440 450 460 470 480 460 470 480 490 pF1KE4 NIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG :::::::::::::::::::::::::::::::::::::::::: CCDS54 NIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG 490 500 510 520 >>CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 (453 aa) initn: 1037 init1: 648 opt: 1191 Z-score: 1053.9 bits: 204.3 E(32554): 2.2e-52 Smith-Waterman score: 1191; 46.0% identity (68.4% similar) in 411 aa overlap (89-491:52-450) 60 70 80 90 100 110 pF1KE4 QASNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGI ...: . . ::: :: .: ::.. CCDS58 DDLKISPVAPDADAVAAQILSLLPLKFFPIIVIGIIALILALAIGLGIHF---DCSGK-Y 30 40 50 60 70 120 130 140 150 160 170 pF1KE4 ECDSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNE .: :: ::. ::::: : :::: ::::. : : .:::... ::. .:.:::. CCDS58 RCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAA--SWKTMCSDDWKG 80 90 100 110 120 130 180 190 200 210 220 230 pF1KE4 NYGRAACRDMGYKNNFYSSQGIVDDSGSTSFMKLNTSAGNV---DIYKKLYHS----DAC .:. .:: ..:. . . ::... .: .: . .: .. : :.:: ..: CCDS58 HYANVACAQLGFPS-YVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHHSVYVREGC 140 150 160 170 180 190 240 250 260 270 280 290 pF1KE4 SSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWI .: ::.:.: ::: . : ::::::. .: . ::::.::. :. :.::::.::: :: CCDS58 ASGHVVTLQCTACGHRRGYS--SRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVITPLWI 200 210 220 230 240 250 300 310 320 330 340 350 pF1KE4 VTAAHCVEKPLNNPWHWTAFAGILRQSFMFYGA-GYQVEKVISHPNYDSKTKNNDIALMK .:::::: : : :: .:.. :.. : .. :::.. : .: : .::::::: CCDS58 ITAAHCVYD-LYLPKSWTIQVGLV--SLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMK 260 270 280 290 300 360 370 380 390 400 410 pF1KE4 LQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEKGKTSEVLNAAKVLLIETQRC : :::::....:::::: . ..:: :::::::. : .: ::: : : :: .. : CCDS58 LAGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGGDASPVLNHAAVPLISNKIC 310 320 330 340 350 360 420 430 440 450 460 470 pF1KE4 NSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIWWLIGDTSWGSGCAKAYR : : :: ..:.:.:.:::.: :.::::::::::::: .. .: :.: ::.: :::.. . CCDS58 NHRDVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATSFGIGCAEVNK 370 380 390 400 410 420 480 490 pF1KE4 PGVYGNVMVFTDWIYRQMRADG :::: : : :::..::. : CCDS58 PGVYTRVTSFLDWIHEQMERDLKT 430 440 450 >>CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 (454 aa) initn: 1014 init1: 386 opt: 1180 Z-score: 1044.2 bits: 202.6 E(32554): 7.6e-52 Smith-Waterman score: 1180; 45.9% identity (68.2% similar) in 412 aa overlap (89-491:52-451) 60 70 80 90 100 110 pF1KE4 QASNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGI ...: . . ::: :: .: ::.. CCDS13 DDLKISPVAPDADAVAAQILSLLPLKFFPIIVIGIIALILALAIGLGIHF---DCSGK-Y 30 40 50 60 70 120 130 140 150 160 170 pF1KE4 ECDSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNE .: :: ::. ::::: : :::: ::::. : : .:::... ::. .:.:::. CCDS13 RCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAA--SWKTMCSDDWKG 80 90 100 110 120 130 180 190 200 210 220 230 pF1KE4 NYGRAACRDMGYKNNFYSSQGIVDDSGSTSFMKLNTSAGNV---DIYKKLYHS----DAC .:. .:: ..:. . . ::... .: .: . .: .. : :.:: ..: CCDS13 HYANVACAQLGFPS-YVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHHSVYVREGC 140 150 160 170 180 190 240 250 260 270 280 290 pF1KE4 SSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWI .: ::.:.: ::: . : ::::::. .: . ::::.::. :. :.::::.::: :: CCDS13 ASGHVVTLQCTACGHRRGYS--SRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVITPLWI 200 210 220 230 240 250 300 310 320 330 340 350 pF1KE4 VTAAHCVEKPLNNPWHWTAFAGILRQSFMFYGA-GYQVEKVISHPNYDSKTKNNDIALMK .:::::: : : :: .:.. :.. : .. :::.. : .: : .::::::: CCDS13 ITAAHCVYD-LYLPKSWTIQVGLV--SLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMK 260 270 280 290 300 360 370 380 390 400 pF1KE4 LQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEE-KGKTSEVLNAAKVLLIETQR : :::::....:::::: . ..:: :::::::. : .: ::: : : :: .. CCDS13 LAGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGAGDASPVLNHAAVPLISNKI 310 320 330 340 350 360 410 420 430 440 450 460 pF1KE4 CNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIWWLIGDTSWGSGCAKAY :: : :: ..:.:.:.:::.: :.::::::::::::: .. .: :.: ::.: :::.. CCDS13 CNHRDVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATSFGIGCAEVN 370 380 390 400 410 420 470 480 490 pF1KE4 RPGVYGNVMVFTDWIYRQMRADG .:::: : : :::..::. : CCDS13 KPGVYTRVTSFLDWIHEQMERDLKT 430 440 450 >>CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 (532 aa) initn: 866 init1: 497 opt: 1000 Z-score: 885.8 bits: 173.5 E(32554): 5.1e-43 Smith-Waterman score: 1006; 34.4% identity (62.1% similar) in 506 aa overlap (7-491:35-526) 10 20 30 pF1KE4 MALNSGSPPAIGPYYENH-GYQPENPYPAQ--PTVV :: .: . : : ::: :. . CCDS55 SHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGT 10 20 30 40 50 60 40 50 60 70 80 pF1KE4 P----TVYEVHPAQYYPSPV-PQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCIT : . .. ::: :. . : : .:.. : . .. :..: :: :. : . CCDS55 PPGRASPGRASPAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRA 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE4 LTLGTFLV--GAALAAGLLWKFMGSKCS----NSGIEC-DSSGTCINPSNWCDGVSHCPG .:. . . : .: : . ..::. .. .: . . :::: : CCDS55 TPVGAVPIRSSPARSAPATRATRESPVQFWQGHTGIRYKEQRESCPKHAVRCDGVVDCKL 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE4 GEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIV :: :::. . .:..::.. ..: :.:...::..:.. .:...:... ... CCDS55 KSDELGCVRFDWDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAH 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE4 DD-SGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGE : ..: :... :.. : ..:..:. : :. .::.: ::. ..: :::: CCDS55 RDFANSFSILRYNST-----IQESLHRSE-CPSQRYISLQCSHCGLRAMTGR---IVGGA 250 260 270 280 290 270 280 290 300 310 pF1KE4 SALPGAWPWQVSLHVQNVHVCGGSIITPEWIVTAAHCV----EKPLNNPWHWTAFAGILR : . :::::::: ..:.:::..: .:..::::: :: :.. : ..:: CCDS55 LASDSKWPWQVSLHFGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEG---WKVYAG--T 300 310 320 330 340 350 320 330 340 350 360 370 pF1KE4 QSFMFYGAGYQVEKVISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPE ... . .. ..: . :: .. . :::::.:.::::.. ..:.::: :. .. . CCDS55 SNLHQLPEAASIAEIIINSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLN 360 370 380 390 400 410 380 390 400 410 420 430 pF1KE4 QLCWISGWGATEEKG-KTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVD . :::.:.: :.: ::: : ..: ::. ..::. :::. .:: :.::: :.:. : CCDS55 ETCWITGFGKTRETDDKTSPFLREVQVNLIDFKKCNDYLVYDSYLTPRMMCAGDLRGGRD 420 430 440 450 460 470 440 450 460 470 480 490 pF1KE4 SCQGDSGGPLVTSKNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG ::::::::::: .:: :.: : ::::.::.. .:::: .: ::: .:... CCDS55 SCQGDSGGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMESEVRFR 480 490 500 510 520 530 CCDS55 KS >>CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 (567 aa) initn: 882 init1: 497 opt: 995 Z-score: 881.0 bits: 172.7 E(32554): 9.4e-43 Smith-Waterman score: 1009; 35.4% identity (63.4% similar) in 492 aa overlap (9-491:91-561) 10 20 30 pF1KE4 MALNSGSPPAIGPYYENHGYQPENPYPAQPTVVPT-VY ::.. .. . . . :. :. :: :: CCDS41 PAGTPPGRASPGRASPAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVY 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE4 EVHPAQYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLTL-GTFLV :. .:: : : :.. . .:: :: . : . : : : :. CCDS41 LVR-----ATPVGAVPIRSSPARSAPATRATRESP-GTSLPKFTWREGQKQLPLIGCVLL 130 140 150 160 170 100 110 120 130 140 150 pF1KE4 GAALAAGLLWKFMGSKCSNSGIEC-DSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPN ::...:. :. . ...::. .. .: . . :::: : :: :::. . CCDS41 LIALVVSLIILFQFWQ-GHTGIRYKEQRESCPKHAVRCDGVVDCKLKSDELGCVRFDWDK 180 190 200 210 220 230 160 170 180 190 200 210 pF1KE4 FILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDD-SGSTSFMKLNT .:..::.. ..: :.:...::..:.. .:...:... ... : ..: :... :. CCDS41 SLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRDFANSFSILRYNS 240 250 260 270 280 290 220 230 240 250 260 270 pF1KE4 SAGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQVSLH . : ..:..:. : :. .::.: ::. . .::::: : . :::::::: CCDS41 T-----IQESLHRSE-CPSQRYISLQCSHCGL---RAMTGRIVGGALASDSKWPWQVSLH 300 310 320 330 340 280 290 300 310 320 330 pF1KE4 VQNVHVCGGSIITPEWIVTAAHCV----EKPLNNPWHWTAFAGILRQSFMFYGAGYQVEK ..:.:::..: .:..::::: :: :.. : ..:: . .:. . . CCDS41 FGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEG---WKVYAGTSNLHQLPEAAS--IAE 350 360 370 380 390 340 350 360 370 380 390 pF1KE4 VISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEK .: . :: .. . :::::.:.::::.. ..:.::: :. .. .. :::.:.: :.: CCDS41 IIINSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRET 400 410 420 430 440 450 400 410 420 430 440 pF1KE4 G-KTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSK ::: : ..: ::. ..::. :::. .:: :.::: :.:. :::::::::::: . CCDS41 DDKTSPFLREVQVNLIDFKKCNDYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQ 460 470 480 490 500 510 450 460 470 480 490 pF1KE4 NNIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG :: :.: : ::::.::.. .:::: .: ::: .:... CCDS41 NNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMESEVRFRKS 520 530 540 550 560 >>CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 (563 aa) initn: 880 init1: 495 opt: 993 Z-score: 879.3 bits: 172.4 E(32554): 1.2e-42 Smith-Waterman score: 1007; 35.4% identity (63.3% similar) in 491 aa overlap (9-490:91-560) 10 20 30 pF1KE4 MALNSGSPPAIGPYYENHGYQPENPYPAQPTVVPT-VY ::.. .. . . . :. :. :: :: CCDS58 PAGTPPGRASPGRASPAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVY 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE4 EVHPAQYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLTL-GTFLV :. .:: : : :.. . .:: :: . : . : : : :. CCDS58 LVR-----ATPVGAVPIRSSPARSAPATRATRESP-GTSLPKFTWREGQKQLPLIGCVLL 130 140 150 160 170 100 110 120 130 140 150 pF1KE4 GAALAAGLLWKFMGSKCSNSGIEC-DSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGPN ::...:. :. . ...::. .. .: . . :::: : :: :::. . CCDS58 LIALVVSLIILFQFWQ-GHTGIRYKEQRESCPKHAVRCDGVVDCKLKSDELGCVRFDWDK 180 190 200 210 220 230 160 170 180 190 200 210 pF1KE4 FILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDD-SGSTSFMKLNT .:..::.. ..: :.:...::..:.. .:...:... ... : ..: :... :. CCDS58 SLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRDFANSFSILRYNS 240 250 260 270 280 290 220 230 240 250 260 270 pF1KE4 SAGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQVSLH . : ..:..:. : :. .::.: ::. . .::::: : . :::::::: CCDS58 T-----IQESLHRSE-CPSQRYISLQCSHCGL---RAMTGRIVGGALASDSKWPWQVSLH 300 310 320 330 340 280 290 300 310 320 330 pF1KE4 VQNVHVCGGSIITPEWIVTAAHCV----EKPLNNPWHWTAFAGILRQSFMFYGAGYQVEK ..:.:::..: .:..::::: :: :.. : ..:: . .:. . . CCDS58 FGTTHICGGTLIDAQWVLTAAHCFFVTREKVLEG---WKVYAGTSNLHQLPEAAS--IAE 350 360 370 380 390 340 350 360 370 380 390 pF1KE4 VISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEK .: . :: .. . :::::.:.::::.. ..:.::: :. .. .. :::.:.: :.: CCDS58 IIINSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRET 400 410 420 430 440 450 400 410 420 430 440 pF1KE4 G-KTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSK ::: : ..: ::. ..::. :::. .:: :.::: :.:. :::::::::::: . CCDS58 DDKTSPFLREVQVNLIDFKKCNDYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQ 460 470 480 490 500 510 450 460 470 480 490 pF1KE4 NNIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG :: :.: : ::::.::.. .:::: .: ::: .:.. CCDS58 NNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMESSAG 520 530 540 550 560 >>CCDS44735.1 TMPRSS5 gene_id:80975|Hs108|chr11 (457 aa) initn: 813 init1: 336 opt: 907 Z-score: 805.4 bits: 158.4 E(32554): 1.5e-38 Smith-Waterman score: 907; 35.4% identity (62.3% similar) in 427 aa overlap (68-485:32-449) 40 50 60 70 80 90 pF1KE4 EVHPAQYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVG : . : .:: . ... . .:: .:.: CCDS44 SLMLDDQPPMEAQYAEEGPGPGIFRAEPGDQQHPISQAVCWRSMRRGCAVLGALG-LLAG 10 20 30 40 50 60 100 110 120 130 140 150 pF1KE4 AALAAGLLWKFMGSKCSN--SGIECDSSGT--CINPSNWCDGVSHCPGGEDENRCVRLYG :.... :: .. :. :: : : : . : . : .. :. . CCDS44 AGVGSWLLVLYLCPAASQPISGTLQDEEITLSCSEASAEEALLPALP----KTVSFRINS 70 80 90 100 110 160 170 180 190 200 210 pF1KE4 PNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGY-KNNFYSSQGIVDDSGSTS--FM .:.:.. .. : ::.. :. : : ..:. . . ... ...: . ..: : CCDS44 EDFLLEAQVRDQPRWLLVCHEGWSPALGLQICWSLGHLRLTHHKGVNLTDIKLNSSQEFA 120 130 140 150 160 170 220 230 240 250 260 270 pF1KE4 KLNTSAGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQ .:. :. . . . :.: :::::: ::. ::::::.:. :: :::: CCDS44 QLSPRLGGF-LEEAWQPRNNCTSGQVVSLRCSECGA---RPLASRIVGGQSVAPGRWPWQ 180 190 200 210 220 230 280 290 300 310 320 pF1KE4 VSLHVQNVHVCGGSIITPEWIVTAAHCVEK-PLNNPWHWTAFAGILRQSFMFYGAGYQVE .:. . :.::::...:.:.::::::... : : . ::.. .: . : :: CCDS44 ASVALGFRHTCGGSVLAPRWVVTAAHCMHSFRLARLSSWRVHAGLVSHSAVRPHQGALVE 240 250 260 270 280 290 330 340 350 360 370 380 pF1KE4 KVISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEE ..: :: :...... :.::..:: :.:.: : :::: . . . ::.:::: :. CCDS44 RIIPHPLYSAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQHFPKGSRCWVSGWGHTHP 300 310 320 330 340 350 390 400 410 420 430 440 pF1KE4 KGK-TSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTS . .:..:. . : :. :: ::: ::.. .:: :.:::.:.: .:.:::::::::: CCDS44 SHTYSSDMLQDTVVPLFSTQLCNSSCVYSGALTPRMLCAGYLDGRADACQGDSGGPLVCP 360 370 380 390 400 410 450 460 470 480 490 pF1KE4 KNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG .. : :.: .::: :::. .::::..: : :::. CCDS44 DGDTWRLVGVVSWGRGCAEPNHPGVYAKVAEFLDWIHDTAQDSLL 420 430 440 450 >>CCDS73392.1 TMPRSS5 gene_id:80975|Hs108|chr11 (448 aa) initn: 813 init1: 336 opt: 905 Z-score: 803.7 bits: 158.0 E(32554): 1.9e-38 Smith-Waterman score: 905; 35.5% identity (62.6% similar) in 422 aa overlap (73-485:28-440) 50 60 70 80 90 100 pF1KE4 QYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAA : .:: . ... . .:: .:.::.... CCDS73 MTGWGQWRAIILHSPDPPWGQPHMIDVSQAVCWRSMRRGCAVLGALG-LLAGAGVGS 10 20 30 40 50 110 120 130 140 150 pF1KE4 GLLWKFMGSKCSN--SGIECDSSGT--CINPSNWCDGVSHCPGGEDENRCVRLYGPNFIL :: .. :. :: : : : . : . : .. :. . .:.: CCDS73 WLLVLYLCPAASQPISGTLQDEEITLSCSEASAEEALLPALP----KTVSFRINSEDFLL 60 70 80 90 100 110 160 170 180 190 200 210 pF1KE4 QVYSSQRKSWHPVCQDDWNENYGRAACRDMGY-KNNFYSSQGIVDDSGSTS--FMKLNTS .. .. : ::.. :. : : ..:. . . ... ...: . ..: : .:. CCDS73 EAQVRDQPRWLLVCHEGWSPALGLQICWSLGHLRLTHHKGVNLTDIKLNSSQEFAQLSPR 120 130 140 150 160 170 220 230 240 250 260 270 pF1KE4 AGNVDIYKKLYHSDACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQVSLHV :. . . . :.: :::::: ::. ::::::.:. :: ::::.:. . CCDS73 LGGF-LEEAWQPRNNCTSGQVVSLRCSECGAR---PLASRIVGGQSVAPGRWPWQASVAL 180 190 200 210 220 280 290 300 310 320 330 pF1KE4 QNVHVCGGSIITPEWIVTAAHCVEK-PLNNPWHWTAFAGILRQSFMFYGAGYQVEKVISH :.::::...:.:.::::::... : : . ::.. .: . : ::..: : CCDS73 GFRHTCGGSVLAPRWVVTAAHCMHSFRLARLSSWRVHAGLVSHSAVRPHQGALVERIIPH 230 240 250 260 270 280 340 350 360 370 380 390 pF1KE4 PNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEKGK-T : :...... :.::..:: :.:.: : :::: . . . ::.:::: :. . . CCDS73 PLYSAQNHDYDVALLRLQTALNFSDTVGAVCLPAKEQHFPKGSRCWVSGWGHTHPSHTYS 290 300 310 320 330 340 400 410 420 430 440 450 pF1KE4 SEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIW :..:. . : :. :: ::: ::.. .:: :.:::.:.: .:.:::::::::: .. : CCDS73 SDMLQDTVVPLFSTQLCNSSCVYSGALTPRMLCAGYLDGRADACQGDSGGPLVCPDGDTW 350 360 370 380 390 400 460 470 480 490 pF1KE4 WLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG :.: .::: :::. .::::..: : :::. CCDS73 RLVGVVSWGRGCAEPNHPGVYAKVAEFLDWIHDTAQDSLL 410 420 430 440 >>CCDS73391.1 TMPRSS5 gene_id:80975|Hs108|chr11 (413 aa) initn: 813 init1: 336 opt: 893 Z-score: 793.7 bits: 156.1 E(32554): 6.8e-38 Smith-Waterman score: 893; 35.9% identity (62.3% similar) in 409 aa overlap (86-485:5-405) 60 70 80 90 100 110 pF1KE4 VLTQASNPVVCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAALAAGLLWKFMGSKCSN : .: .:.::.... :: .. :. CCDS73 MRRGCAVLGALGLLAGAGVGSWLLVLYLCPAASQ 10 20 30 120 130 140 150 160 170 pF1KE4 --SGIECDSSGT--CINPSNWCDGVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPV :: : : : . : . : .. :. . .:.:.. .. : : CCDS73 PISGTLQDEEITLSCSEASAEEALLPALP----KTVSFRINSEDFLLEAQVRDQPRWLLV 40 50 60 70 80 90 180 190 200 210 220 pF1KE4 CQDDWNENYGRAACRDMGY-KNNFYSSQGIVDDSGSTS--FMKLNTSAGNVDIYKKLYHS :.. :. : : ..:. . . ... ...: . ..: : .:. :. . . CCDS73 CHEGWSPALGLQICWSLGHLRLTHHKGVNLTDIKLNSSQEFAQLSPRLGGF-LEEAWQPR 100 110 120 130 140 230 240 250 260 270 280 pF1KE4 DACSSKAVVSLRCIACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITP . :.: :::::: ::. ::::::.:. :: ::::.:. . :.::::...: CCDS73 NNCTSGQVVSLRCSECGAR---PLASRIVGGQSVAPGRWPWQASVALGFRHTCGGSVLAP 150 160 170 180 190 200 290 300 310 320 330 340 pF1KE4 EWIVTAAHCVEK-PLNNPWHWTAFAGILRQSFMFYGAGYQVEKVISHPNYDSKTKNNDIA .:.::::::... : : . ::.. .: . : ::..: :: :...... :.: CCDS73 RWVVTAAHCMHSFRLARLSSWRVHAGLVSHSAVRPHQGALVERIIPHPLYSAQNHDYDVA 210 220 230 240 250 260 350 360 370 380 390 400 pF1KE4 LMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEKGK-TSEVLNAAKVLLIE :..:: :.:.: : :::: . . . ::.:::: :. . .:..:. . : :. CCDS73 LLRLQTALNFSDTVGAVCLPAKEQHFPKGSRCWVSGWGHTHPSHTYSSDMLQDTVVPLFS 270 280 290 300 310 320 410 420 430 440 450 460 pF1KE4 TQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIWWLIGDTSWGSGCA :: ::: ::.. .:: :.:::.:.: .:.:::::::::: .. : :.: .::: ::: CCDS73 TQLCNSSCVYSGALTPRMLCAGYLDGRADACQGDSGGPLVCPDGDTWRLVGVVSWGRGCA 330 340 350 360 370 380 470 480 490 pF1KE4 KAYRPGVYGNVMVFTDWIYRQMRADG . .::::..: : :::. CCDS73 EPNHPGVYAKVAEFLDWIHDTAQDSLL 390 400 410 492 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:54:41 2016 done: Sun Nov 6 00:54:41 2016 Total Scan time: 2.650 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]