FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1552, 560 aa 1>>>pF1KE1552 560 - 560 aa - 560 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6579+/-0.00132; mu= 19.1863+/- 0.080 mean_var=207.6393+/-37.490, 0's: 0 Z-trim(109.2): 248 B-trim: 29 in 1/50 Lambda= 0.089006 statistics sampled from 10445 (10717) to 10445 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.696), E-opt: 0.2 (0.329), width: 16 Scan time: 2.390 The best scores are: opt bits E(32554) CCDS7577.1 HABP2 gene_id:3026|Hs108|chr10 ( 560) 4060 535.0 9.3e-152 CCDS53579.1 HABP2 gene_id:3026|Hs108|chr10 ( 534) 3893 513.5 2.6e-145 CCDS3369.1 HGFAC gene_id:3083|Hs108|chr4 ( 655) 1148 161.2 3.7e-39 CCDS75098.1 HGFAC gene_id:3083|Hs108|chr4 ( 662) 1100 155.0 2.6e-37 CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 ( 638) 538 82.8 1.4e-15 CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 453) 536 82.3 1.4e-15 CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 454) 536 82.3 1.4e-15 CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 492) 530 81.6 2.5e-15 CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 529) 530 81.6 2.5e-15 CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 802) 532 82.2 2.6e-15 CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 811) 532 82.2 2.6e-15 CCDS75122.1 CORIN gene_id:10699|Hs108|chr4 ( 938) 529 81.9 3.7e-15 CCDS3477.1 CORIN gene_id:10699|Hs108|chr4 (1042) 529 82.0 3.9e-15 CCDS73391.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 413) 516 79.7 7.8e-15 CCDS73392.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 448) 516 79.7 8.1e-15 CCDS44735.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 457) 516 79.7 8.2e-15 CCDS8487.1 ST14 gene_id:6768|Hs108|chr11 ( 855) 516 80.2 1.1e-14 CCDS10431.1 TPSAB1 gene_id:7177|Hs108|chr16 ( 275) 508 78.4 1.3e-14 CCDS3847.1 F11 gene_id:2160|Hs108|chr4 ( 625) 511 79.3 1.5e-14 CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 532) 510 79.1 1.5e-14 CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 567) 510 79.1 1.6e-14 CCDS44881.1 TMPRSS12 gene_id:283471|Hs108|chr12 ( 348) 507 78.4 1.6e-14 CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 563) 509 79.0 1.7e-14 CCDS8812.1 CELA1 gene_id:1990|Hs108|chr12 ( 258) 504 77.8 1.8e-14 CCDS34302.1 F12 gene_id:2161|Hs108|chr5 ( 615) 505 78.5 2.5e-14 CCDS157.1 CELA2A gene_id:63036|Hs108|chr1 ( 269) 500 77.3 2.6e-14 CCDS14666.1 F9 gene_id:2158|Hs108|chrX ( 461) 494 76.9 5.8e-14 CCDS3518.1 TMPRSS11D gene_id:9407|Hs108|chr4 ( 418) 492 76.6 6.7e-14 CCDS12813.1 KLK8 gene_id:11202|Hs108|chr19 ( 260) 489 75.9 6.8e-14 CCDS42600.1 KLK8 gene_id:11202|Hs108|chr19 ( 305) 489 76.0 7.4e-14 CCDS45469.1 PRSS8 gene_id:5652|Hs108|chr16 ( 343) 489 76.1 7.9e-14 CCDS12088.1 TMPRSS9 gene_id:360200|Hs108|chr19 (1059) 495 77.7 8.1e-14 CCDS3964.1 GZMK gene_id:3003|Hs108|chr5 ( 264) 484 75.2 1.1e-13 CCDS73602.1 F7 gene_id:2155|Hs108|chr13 ( 382) 483 75.4 1.4e-13 CCDS9529.1 F7 gene_id:2155|Hs108|chr13 ( 444) 483 75.5 1.5e-13 CCDS9528.1 F7 gene_id:2155|Hs108|chr13 ( 466) 483 75.5 1.6e-13 CCDS76482.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 290) 480 74.8 1.6e-13 CCDS53717.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 397) 480 75.0 1.9e-13 CCDS74669.1 PRSS56 gene_id:646960|Hs108|chr2 ( 603) 482 75.6 1.9e-13 CCDS44743.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 432) 480 75.1 2e-13 CCDS53716.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 435) 480 75.1 2e-13 CCDS31684.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 437) 480 75.1 2e-13 CCDS83495.1 F9 gene_id:2158|Hs108|chrX ( 423) 478 74.8 2.3e-13 CCDS83291.1 PLAT gene_id:5327|Hs108|chr8 ( 473) 477 74.8 2.7e-13 CCDS12822.1 KLK13 gene_id:26085|Hs108|chr19 ( 277) 473 73.9 2.9e-13 CCDS14101.1 ACR gene_id:49|Hs108|chr22 ( 421) 473 74.2 3.6e-13 CCDS10852.1 CTRL gene_id:1506|Hs108|chr16 ( 264) 466 72.9 5.3e-13 CCDS5976.1 PRSS55 gene_id:203074|Hs108|chr8 ( 352) 467 73.3 5.6e-13 CCDS2145.1 PROC gene_id:5624|Hs108|chr2 ( 461) 468 73.6 5.9e-13 CCDS10481.1 PRSS22 gene_id:64063|Hs108|chr16 ( 317) 465 72.9 6.4e-13 >>CCDS7577.1 HABP2 gene_id:3026|Hs108|chr10 (560 aa) initn: 4060 init1: 4060 opt: 4060 Z-score: 2838.0 bits: 535.0 E(32554): 9.3e-152 Smith-Waterman score: 4060; 100.0% identity (100.0% similar) in 560 aa overlap (1-560:1-560) 10 20 30 40 50 60 pF1KE1 MFARMSDLHVLLLMALVGKTACGFSLMSLLESLDPDWTPDQYDYSYEDYNQEENTSSTLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MFARMSDLHVLLLMALVGKTACGFSLMSLLESLDPDWTPDQYDYSYEDYNQEENTSSTLT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HAENPDWYYTEDQADPCQPNPCEHGGDCLVHGSTFTCSCLAPFSGNKCQKVQNTCKDNPC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 HAENPDWYYTEDQADPCQPNPCEHGGDCLVHGSTFTCSCLAPFSGNKCQKVQNTCKDNPC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GRGQCLITQSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQNGATCSRHKRRSKFTCACPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GRGQCLITQSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQNGATCSRHKRRSKFTCACPD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 QFKGKFCEIGSDDCYVGDGYSYRGKMNRTVNQHACLYWNSHLLLQENYNMFMEDAETHGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 QFKGKFCEIGSDDCYVGDGYSYRGKMNRTVNQHACLYWNSHLLLQENYNMFMEDAETHGI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 GEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDVSACSAQDVAYPEESPTEPSTKLPGFDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDVSACSAQDVAYPEESPTEPSTKLPGFDS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 CGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMPQGHFCGGALIHPCWVLTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 CGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMPQGHFCGGALIHPCWVLTA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 AHCTDIKTRHLKVVLGDQDLKKEEFHEQSFRVEKIFKYSHYNERDEIPHNDIALLKLKPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 AHCTDIKTRHLKVVLGDQDLKKEEFHEQSFRVEKIFKYSHYNERDEIPHNDIALLKLKPV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 DGHCALESKYVKTVCLPDGSFPSGSECHISGWGVTETGKGSRQLLDAKVKLIANTLCNSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 DGHCALESKYVKTVCLPDGSFPSGSECHISGWGVTETGKGSRQLLDAKVKLIANTLCNSR 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE1 QLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSWGLECGKRPGVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 QLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSWGLECGKRPGVY 490 500 510 520 530 540 550 560 pF1KE1 TQVTKFLNWIKATIKSESGF :::::::::::::::::::: CCDS75 TQVTKFLNWIKATIKSESGF 550 560 >>CCDS53579.1 HABP2 gene_id:3026|Hs108|chr10 (534 aa) initn: 3893 init1: 3893 opt: 3893 Z-score: 2722.3 bits: 513.5 E(32554): 2.6e-145 Smith-Waterman score: 3893; 100.0% identity (100.0% similar) in 534 aa overlap (27-560:1-534) 10 20 30 40 50 60 pF1KE1 MFARMSDLHVLLLMALVGKTACGFSLMSLLESLDPDWTPDQYDYSYEDYNQEENTSSTLT :::::::::::::::::::::::::::::::::: CCDS53 MSLLESLDPDWTPDQYDYSYEDYNQEENTSSTLT 10 20 30 70 80 90 100 110 120 pF1KE1 HAENPDWYYTEDQADPCQPNPCEHGGDCLVHGSTFTCSCLAPFSGNKCQKVQNTCKDNPC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 HAENPDWYYTEDQADPCQPNPCEHGGDCLVHGSTFTCSCLAPFSGNKCQKVQNTCKDNPC 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 GRGQCLITQSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQNGATCSRHKRRSKFTCACPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 GRGQCLITQSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQNGATCSRHKRRSKFTCACPD 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE1 QFKGKFCEIGSDDCYVGDGYSYRGKMNRTVNQHACLYWNSHLLLQENYNMFMEDAETHGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 QFKGKFCEIGSDDCYVGDGYSYRGKMNRTVNQHACLYWNSHLLLQENYNMFMEDAETHGI 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE1 GEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDVSACSAQDVAYPEESPTEPSTKLPGFDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 GEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDVSACSAQDVAYPEESPTEPSTKLPGFDS 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE1 CGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMPQGHFCGGALIHPCWVLTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 CGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMPQGHFCGGALIHPCWVLTA 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE1 AHCTDIKTRHLKVVLGDQDLKKEEFHEQSFRVEKIFKYSHYNERDEIPHNDIALLKLKPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 AHCTDIKTRHLKVVLGDQDLKKEEFHEQSFRVEKIFKYSHYNERDEIPHNDIALLKLKPV 340 350 360 370 380 390 430 440 450 460 470 480 pF1KE1 DGHCALESKYVKTVCLPDGSFPSGSECHISGWGVTETGKGSRQLLDAKVKLIANTLCNSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 DGHCALESKYVKTVCLPDGSFPSGSECHISGWGVTETGKGSRQLLDAKVKLIANTLCNSR 400 410 420 430 440 450 490 500 510 520 530 540 pF1KE1 QLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSWGLECGKRPGVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 QLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSWGLECGKRPGVY 460 470 480 490 500 510 550 560 pF1KE1 TQVTKFLNWIKATIKSESGF :::::::::::::::::::: CCDS53 TQVTKFLNWIKATIKSESGF 520 530 >>CCDS3369.1 HGFAC gene_id:3083|Hs108|chr4 (655 aa) initn: 787 init1: 203 opt: 1148 Z-score: 816.6 bits: 161.2 E(32554): 3.7e-39 Smith-Waterman score: 1148; 36.6% identity (61.8% similar) in 503 aa overlap (75-555:162-646) 50 60 70 80 90 100 pF1KE1 SYEDYNQEENTSSTLTHAENPDWYYTEDQADPCQPNPCEHGGDC--LVHGSTFTCSCLAP ::: .:: .::.: ... ::: CCDS33 WCATTHNYDRDRAWGYCVEATPPPGGPAALDPCASGPCLNGGSCSNTQDPQSYHCSCPRA 140 150 160 170 180 190 110 120 130 140 150 pF1KE1 FSGNKCQKVQNTCKDNP------CGRGQCLITQSPPYYRCVCKHPYTGPS-CSQVV-PVC :.:. : . : :. : . :. .: : . : . : . .: CCDS33 FTGKDCG--TEKCFDETRYEYLEGGDRWARVRQGH-VEQCEC---FGGRTWCEGTRHTAC 200 210 220 230 240 160 170 180 190 200 210 pF1KE1 RPNPCQNGATCSRHKRRSKFTCACPDQFKGKFCEIGSDD-CYVGDGYSYRGKMNRTVNQH .:: ::.:: . .:::: : :..:.: :. :..:.: .::: . ... CCDS33 LSSPCLNGGTCHLIVATGTTVCACPPGFAGRLCNIEPDERCFLGNGTGYRGVASTSASGL 250 260 270 280 290 300 220 230 240 250 260 270 pF1KE1 ACLYWNSHLLLQENYNMFMEDAETHGIGEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDV .:: ::: :: :: . . : :.: : .::::: ::.:::.. : .. ..:::: . CCDS33 SCLAWNSDLLYQELHVDSVGAAALLGLGPHAYCRNPDNDERPWCYV-VKDSALSWEYCRL 310 320 330 340 350 360 280 290 300 310 320 330 pF1KE1 SACSAQDVAY--PEESPTEPSTKLPGFDSCGKTEIAERKIK-RIYGGFKSTAGKHPWQAS :: . . :. : : :: ..::. . . .. :: :: .: :.::: :. CCDS33 EACESLTRVQLSPDLLATLPEPASPGRQACGRRHKKRTFLRPRIIGGSSSLPGSHPWLAA 370 380 390 400 410 420 340 350 360 370 380 pF1KE1 LQSSLPLTISMPQGHFCGGALIHPCWVLTAAHC-TDIKTRH-LKVVLGDQDLKKEEFHEQ . . ::.:.:.: :::..:::: . : ..::::.. ... : CCDS33 IYIG---------DSFCAGSLVHTCWVVSAAHCFSHSPPRDSVSVVLGQHFFNRTTDVTQ 430 440 450 460 470 390 400 410 420 430 440 pF1KE1 SFRVEKIFKYSHYNERDEIPHNDIALLKLKPVDGHCALESKYVKTVCLPD-GS-FPSGSE .: .:: . :. :. . : :..:..:: .:: .:..:. .:::. :: ::.: . CCDS33 TFGIEKYIPYTLYSVFNPSDH-DLVLIRLKKKGDRCATRSQFVQPICLPEPGSTFPAGHK 480 490 500 510 520 530 450 460 470 480 490 500 pF1KE1 CHISGWG-VTETGKG-SRQLLDAKVKLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDT :.:.::: . :. .: : .: .: : :.:. :.: ..: :. .:.::: .. . :. CCDS33 CQIAGWGHLDENVSGYSSSLREALVPLVADHKCSSPEVYGADISPNMLCAGYFDCKS-DA 540 550 560 570 580 590 510 520 530 540 550 560 pF1KE1 CQGDSGGPLTCEKDGTYYVYGIVSWGLECGK--RPGVYTQVTKFLNWIKATIKSESGF :::::::::.:::.:. :.:::.::: ::. .:::::.:.....::. :. CCDS33 CQGDSGGPLACEKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWINDRIRPPRRLVA 600 610 620 630 640 650 CCDS33 PS >>CCDS75098.1 HGFAC gene_id:3083|Hs108|chr4 (662 aa) initn: 718 init1: 237 opt: 1100 Z-score: 783.2 bits: 155.0 E(32554): 2.6e-37 Smith-Waterman score: 1137; 36.3% identity (61.8% similar) in 510 aa overlap (75-555:162-653) 50 60 70 80 90 100 pF1KE1 SYEDYNQEENTSSTLTHAENPDWYYTEDQADPCQPNPCEHGGDC--LVHGSTFTCSCLAP ::: .:: .::.: ... ::: CCDS75 WCATTHNYDRDRAWGYCVEATPPPGGPAALDPCASGPCLNGGSCSNTQDPQSYHCSCPRA 140 150 160 170 180 190 110 120 130 140 150 pF1KE1 FSGNKCQKVQNTCKDNP------CGRGQCLITQSPPYYRCVCKHPYTGPS-CSQVV-PVC :.:. : . : :. : . :. .: : . : . : . .: CCDS75 FTGKDCG--TEKCFDETRYEYLEGGDRWARVRQGH-VEQCEC---FGGRTWCEGTRHTAC 200 210 220 230 240 160 170 180 190 200 210 pF1KE1 RPNPCQNGATCSRHKRRSKFTCACPDQFKGKFCEIGSDD-CYVGDGYSYRGKMNRTVNQH .:: ::.:: . .:::: : :..:.: :. :..:.: .::: . ... CCDS75 LSSPCLNGGTCHLIVATGTTVCACPPGFAGRLCNIEPDERCFLGNGTGYRGVASTSASGL 250 260 270 280 290 300 220 230 240 250 260 270 pF1KE1 ACLYWNSHLLLQENYNMFMEDAETHGIGEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDV .:: ::: :: :: . . : :.: : .::::: ::.:::.. : .. ..:::: . CCDS75 SCLAWNSDLLYQELHVDSVGAAALLGLGPHAYCRNPDNDERPWCYV-VKDSALSWEYCRL 310 320 330 340 350 360 280 290 300 310 320 pF1KE1 SACSAQ-----DVAYPEESP----TEPSTKLPGFDSCGKTEIAERKIK-RIYGGFKSTAG ::. . ... . :: : : :: ..::. . . .. :: :: .: : CCDS75 EACDLETEGRESLTRVQLSPDLLATLPEPASPGRQACGRRHKKRTFLRPRIIGGSSSLPG 370 380 390 400 410 420 330 340 350 360 370 380 pF1KE1 KHPWQASLQSSLPLTISMPQGHFCGGALIHPCWVLTAAHC-TDIKTRH-LKVVLGDQDLK .::: :.. . ::.:.:.: :::..:::: . : ..::::.. .. CCDS75 SHPWLAAIYIG---------DSFCAGSLVHTCWVVSAAHCFSHSPPRDSVSVVLGQHFFN 430 440 450 460 470 390 400 410 420 430 440 pF1KE1 KEEFHEQSFRVEKIFKYSHYNERDEIPHNDIALLKLKPVDGHCALESKYVKTVCLPD-GS . :.: .:: . :. :. . : :..:..:: .:: .:..:. .:::. :: CCDS75 RTTDVTQTFGIEKYIPYTLYSVFNPSDH-DLVLIRLKKKGDRCATRSQFVQPICLPEPGS 480 490 500 510 520 530 450 460 470 480 490 pF1KE1 -FPSGSECHISGWG-VTETGKG-SRQLLDAKVKLIANTLCNSRQLYDHMIDDSMICAGNL ::.: .:.:.::: . :. .: : .: .: : :.:. :.: ..: :. .:.::: . CCDS75 TFPAGHKCQIAGWGHLDENVSGYSSSLREALVPLVADHKCSSPEVYGADISPNMLCAGYF 540 550 560 570 580 590 500 510 520 530 540 550 pF1KE1 QKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSWGLECGK--RPGVYTQVTKFLNWIKATIK . . :.:::::::::.:::.:. :.:::.::: ::. .:::::.:.....::. :. CCDS75 DCKS-DACQGDSGGPLACEKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWINDRIR 600 610 620 630 640 650 560 pF1KE1 SESGF CCDS75 PPRRLVAPS 660 >>CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 (638 aa) initn: 395 init1: 158 opt: 538 Z-score: 393.3 bits: 82.8 E(32554): 1.4e-15 Smith-Waterman score: 550; 33.1% identity (60.9% similar) in 320 aa overlap (252-559:342-630) 230 240 250 260 270 280 pF1KE1 LLLQENYNMFMEDAETHGIGEHNFCRNPDADEKPWCFIKVTNDKVKWEYCDVSACSAQDV .:: ::.... : : . CCDS34 VKGVNVCQETCTKMIRCQFFTYSLLPEDCKEEKCKCFLRLSMDG-----------SPTRI 320 330 340 350 360 290 300 310 320 330 340 pF1KE1 AYPEESPTEPSTKLPGFDSCGKTEIAERKIK-RIYGGFKSTAGKHPWQASLQSSLPLTIS :: .. . : .: . : . . : . :: :: .:. :. :::.::: .: CCDS34 AYGTQGSSGYSLRL---CNTGDNSVCTTKTSTRIVGGTNSSWGEWPWQVSLQVKLT---- 370 380 390 400 410 350 360 370 380 390 pF1KE1 MPQGHFCGGALIHPCWVLTAAHCTD---IKT--RHLKVVLGDQDLKKEEFHEQSFRVEKI : :.:::.:: :::::::: : .. : . .:. .:. :. : ...: CCDS34 -AQRHLCGGSLIGHQWVLTAAHCFDGLPLQDVWRIYSGILNLSDITKDTPFSQ---IKEI 420 430 440 450 460 400 410 420 430 440 450 pF1KE1 FKYSHYNERDEIPHNDIALLKLK-PVDGHCALESKYVKTVCLPDGSFPSG--SECHISGW . ...:. . ..::::.::. :.. ... : .:::. . : ..: ..:: CCDS34 IIHQNYKVSE--GNHDIALIKLQAPLN-----YTEFQKPICLPSKGDTSTIYTNCWVTGW 470 480 490 500 510 520 460 470 480 490 500 510 pF1KE1 GVT-ETGKGSRQLLDAKVKLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGDSGG : . : :. . : ... :..: :..: :. : . :.::: .. :.:.:.::::: CCDS34 GFSKEKGEIQNILQKVNIPLVTNEECQKRY-QDYKITQRMVCAG-YKEGGKDACKGDSGG 530 540 550 560 570 580 520 530 540 550 560 pF1KE1 PLTCEKDGTYYVYGIVSWGLECGKR--PGVYTQVTKFLNWIKATIKSESGF ::.:...: . . ::.::: :..: :::::.:.....:: .: .: CCDS34 PLVCKHNGMWRLVGITSWGEGCARREQPGVYTKVAEYMDWILEKTQSSDGKAQMQSPA 590 600 610 620 630 >>CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 (453 aa) initn: 504 init1: 158 opt: 536 Z-score: 393.3 bits: 82.3 E(32554): 1.4e-15 Smith-Waterman score: 569; 40.7% identity (64.8% similar) in 253 aa overlap (313-557:216-450) 290 300 310 320 330 340 pF1KE1 YPEESPTEPSTKLPGFDSCGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMP :: :: : .. ::::::: CCDS58 HSVYVREGCASGHVVTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQF--------- 190 200 210 220 230 350 360 370 380 390 pF1KE1 QG-HFCGGALIHPCWVLTAAHCT-DIKT-RHLKVVLGDQDLKKEEFHEQSFRVEKIFKYS :: :.:::..: : :..:::::. :. . . .: .: . : :::: .: CCDS58 QGYHLCGGSVITPLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNP--APSHLVEKIVYHS 240 250 260 270 280 290 400 410 420 430 440 450 pF1KE1 HYNERDEIPHNDIALLKLKPVDGHCALESKYVKTVCLPDG--SFPSGSECHISGWGVTET .:. . :::::.:: : .. ..... ::::.. .::.:. : ::::.:: CCDS58 KYKPKRL--GNDIALMKLA---GPLTF-NEMIQPVCLPNSEENFPDGKVCWTSGWGATED 300 310 320 330 340 460 470 480 490 500 510 pF1KE1 GKGSRQLLD-AKVKLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCE : . .:. : : ::.: .:: :..: .:. ::.::: : : :.:::::::::.:. CCDS58 GGDASPVLNHAAVPLISNKICNHRDVYGGIISPSMLCAGYLTG-GVDSCQGDSGGPLVCQ 350 360 370 380 390 400 520 530 540 550 560 pF1KE1 KDGTYYVYGIVSWGLECGK--RPGVYTQVTKFLNWIKATIKSESGF . . . : .:.:. :.. .:::::.::.::.::. .. . CCDS58 ERRLWKLVGATSFGIGCAEVNKPGVYTRVTSFLDWIHEQMERDLKT 410 420 430 440 450 >>CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 (454 aa) initn: 497 init1: 114 opt: 536 Z-score: 393.3 bits: 82.3 E(32554): 1.4e-15 Smith-Waterman score: 569; 41.3% identity (64.2% similar) in 254 aa overlap (313-557:216-451) 290 300 310 320 330 340 pF1KE1 YPEESPTEPSTKLPGFDSCGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMP :: :: : .. ::::::: CCDS13 HSVYVREGCASGHVVTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQF--------- 190 200 210 220 230 350 360 370 380 390 pF1KE1 QG-HFCGGALIHPCWVLTAAHCT-DIKT-RHLKVVLGDQDLKKEEFHEQSFRVEKIFKYS :: :.:::..: : :..:::::. :. . . .: .: . : :::: .: CCDS13 QGYHLCGGSVITPLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNP--APSHLVEKIVYHS 240 250 260 270 280 290 400 410 420 430 440 450 pF1KE1 HYNERDEIPHNDIALLKLKPVDGHCALESKYVKTVCLPDG--SFPSGSECHISGWGVTET .:. . :::::.:: : .. ..... ::::.. .::.:. : ::::.:: CCDS13 KYKPKRL--GNDIALMKLA---GPLTF-NEMIQPVCLPNSEENFPDGKVCWTSGWGATED 300 310 320 330 340 460 470 480 490 500 510 pF1KE1 GKG--SRQLLDAKVKLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTC : : : : : : ::.: .:: :..: .:. ::.::: : : :.:::::::::.: CCDS13 GAGDASPVLNHAAVPLISNKICNHRDVYGGIISPSMLCAGYLTG-GVDSCQGDSGGPLVC 350 360 370 380 390 400 520 530 540 550 560 pF1KE1 EKDGTYYVYGIVSWGLECGK--RPGVYTQVTKFLNWIKATIKSESGF .. . . : .:.:. :.. .:::::.::.::.::. .. . CCDS13 QERRLWKLVGATSFGIGCAEVNKPGVYTRVTSFLDWIHEQMERDLKT 410 420 430 440 450 >>CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 (492 aa) initn: 377 init1: 173 opt: 530 Z-score: 388.8 bits: 81.6 E(32554): 2.5e-15 Smith-Waterman score: 572; 32.6% identity (55.2% similar) in 420 aa overlap (159-557:113-491) 130 140 150 160 170 180 pF1KE1 QSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQN-GATCSRHKRRSKFTCACPDQFKGKFC :.: : :. :. :: :... : CCDS33 KALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGIECD-----SSGTCINPSNW----C 90 100 110 120 130 190 200 210 220 230 240 pF1KE1 EIGSDDCYVGDGYSYRGKM---NRTVNQHACLYWNSHLLLQENYNMFMEDAETHGIG-EH . : . : :. . .. : .. .. . : . :...: . : . .: .. CCDS33 D-GVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKN 140 150 160 170 180 190 250 260 270 280 290 pF1KE1 NFCRNPDA--DEKPWCFIKVT----NDKVKWEYCDVSACSAQDVAYPEESPTEPSTKLPG :: . : :.:.. : . . .:::.. :. : . CCDS33 NFYSSQGIVDDSGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAVV---------SLRCI- 200 210 220 230 240 300 310 320 330 340 350 pF1KE1 FDSCGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMPQGHFCGGALIHPCWV .:: . . :. .:: :: .. : :::.::. . . : :::..: : :. CCDS33 --ACGVNLNSSRQ-SRIVGGESALPGAWPWQVSLH--------VQNVHVCGGSIITPEWI 250 260 270 280 290 360 370 380 390 400 410 pF1KE1 LTAAHCTDIKTR---HLKVVLGDQDLKKE-EFHEQSFRVEKIFKYSHYNERDEIPHNDIA .:::::.. : . : :.. :. ...:::.. :: : .. .:::: CCDS33 VTAAHCVEKPLNNPWHWTAFAG--ILRQSFMFYGAGYQVEKVI--SHPNYDSKTKNNDIA 300 310 320 330 340 420 430 440 450 460 pF1KE1 LLKL-KPVDGHCALESKYVKTVCLPDGSFPSGSE--CHISGWGVTET-GKGSRQLLDAKV :.:: ::. . :: ::::. .. : : :::::.:: :: :. : ::: CCDS33 LMKLQKPL-----TFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEKGKTSEVLNAAKV 350 360 370 380 390 400 470 480 490 500 510 520 pF1KE1 KLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSW :: . :::: .::..: .::::: :: . :.:::::::::. :.. ... : .:: CCDS33 LLIETQRCNSRYVYDNLITPAMICAGFLQG-NVDSCQGDSGGPLVTSKNNIWWLIGDTSW 410 420 430 440 450 460 530 540 550 560 pF1KE1 GLECGK--RPGVYTQVTKFLNWIKATIKSESGF : :.: ::::: .: : .:: .... CCDS33 GSGCAKAYRPGVYGNVMVFTDWIYRQMRADG 470 480 490 >>CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 (529 aa) initn: 377 init1: 173 opt: 530 Z-score: 388.5 bits: 81.6 E(32554): 2.5e-15 Smith-Waterman score: 572; 32.6% identity (55.2% similar) in 420 aa overlap (159-557:150-528) 130 140 150 160 170 180 pF1KE1 QSPPYYRCVCKHPYTGPSCSQVVPVCRPNPCQN-GATCSRHKRRSKFTCACPDQFKGKFC :.: : :. :. :: :... : CCDS54 KALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGIECD-----SSGTCINPSNW----C 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 EIGSDDCYVGDGYSYRGKM---NRTVNQHACLYWNSHLLLQENYNMFMEDAETHGIG-EH . : . : :. . .. : .. .. . : . :...: . : . .: .. CCDS54 D-GVSHCPGGEDENRCVRLYGPNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKN 180 190 200 210 220 250 260 270 280 290 pF1KE1 NFCRNPDA--DEKPWCFIKVT----NDKVKWEYCDVSACSAQDVAYPEESPTEPSTKLPG :: . : :.:.. : . . .:::.. :. : . CCDS54 NFYSSQGIVDDSGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAVV---------SLRCI- 230 240 250 260 270 300 310 320 330 340 350 pF1KE1 FDSCGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMPQGHFCGGALIHPCWV .:: . . :. .:: :: .. : :::.::. . . : :::..: : :. CCDS54 --ACGVNLNSSRQ-SRIVGGESALPGAWPWQVSLH--------VQNVHVCGGSIITPEWI 280 290 300 310 320 360 370 380 390 400 410 pF1KE1 LTAAHCTDIKTR---HLKVVLGDQDLKKE-EFHEQSFRVEKIFKYSHYNERDEIPHNDIA .:::::.. : . : :.. :. ...:::.. :: : .. .:::: CCDS54 VTAAHCVEKPLNNPWHWTAFAG--ILRQSFMFYGAGYQVEKVI--SHPNYDSKTKNNDIA 330 340 350 360 370 380 420 430 440 450 460 pF1KE1 LLKL-KPVDGHCALESKYVKTVCLPDGSFPSGSE--CHISGWGVTET-GKGSRQLLDAKV :.:: ::. . :: ::::. .. : : :::::.:: :: :. : ::: CCDS54 LMKLQKPL-----TFNDLVKPVCLPNPGMMLQPEQLCWISGWGATEEKGKTSEVLNAAKV 390 400 410 420 430 470 480 490 500 510 520 pF1KE1 KLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPLTCEKDGTYYVYGIVSW :: . :::: .::..: .::::: :: . :.:::::::::. :.. ... : .:: CCDS54 LLIETQRCNSRYVYDNLITPAMICAGFLQG-NVDSCQGDSGGPLVTSKNNIWWLIGDTSW 440 450 460 470 480 490 530 540 550 560 pF1KE1 GLECGK--RPGVYTQVTKFLNWIKATIKSESGF : :.: ::::: .: : .:: .... CCDS54 GSGCAKAYRPGVYGNVMVFTDWIYRQMRADG 500 510 520 >>CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 (802 aa) initn: 383 init1: 107 opt: 532 Z-score: 388.3 bits: 82.2 E(32554): 2.6e-15 Smith-Waterman score: 532; 40.8% identity (62.0% similar) in 255 aa overlap (313-554:567-801) 290 300 310 320 330 340 pF1KE1 YPEESPTEPSTKLPGFDSCGKTEIAERKIKRIYGGFKSTAGKHPWQASLQSSLPLTISMP :: :: :. :. ::::::: CCDS74 KPNPQCDGRPDCRDGSDEEHCDCGLQGPSSRIVGGAVSSEGEWPWQASLQV--------- 540 550 560 570 580 350 360 370 380 390 pF1KE1 QG-HFCGGALIHPCWVLTAAHCTD----IKTRHLKVVLGDQDLKKEEFHEQSFRVEKIFK .: :.:::::: ::.::::: . .: : :: ... : ::.: ... CCDS74 RGRHICGGALIADRWVITAAHCFQEDSMASTVLWTVFLGKVWQNSRWPGEVSFKVSRLLL 590 600 610 620 630 640 400 410 420 430 440 450 pF1KE1 YSHYNERDEIPHN-DIALLKLKPVDGHCALESKYVKTVCLPDGS--FPSGSECHISGWG- . :.:.: :. :.:::.: : ...: :. :::: : : : .: :.::: CCDS74 HP-YHEED--SHDYDVALLQL----DHPVVRSAAVRPVCLPARSHFFEPGLHCWITGWGA 650 660 670 680 690 700 460 470 480 490 500 510 pF1KE1 VTETGKGSRQLLDAKVKLIANTLCNSRQLYDHMIDDSMICAGNLQKPGQDTCQGDSGGPL . : : : : . :.:: . ::. ..: ... :.::: .: .:.::::::::: CCDS74 LREGGPISNALQKVDVQLIPQDLCS--EVYRYQVTPRMLCAG-YRKGKKDACQGDSGGPL 710 720 730 740 750 520 530 540 550 560 pF1KE1 TCEK-DGTYYVYGIVSWGLECGKRP---GVYTQVTKFLNWIKATIKSESGF .:. .: ... :.::::: :: :: ::::..: ..::. .. CCDS74 VCKALSGRWFLAGLVSWGLGCG-RPNYFGVYTRITGVISWIQQVVT 760 770 780 790 800 560 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 22:27:00 2016 done: Sun Nov 6 22:27:01 2016 Total Scan time: 2.390 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]