FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2007, 583 aa 1>>>pF1KE2007 583 - 583 aa - 583 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2293+/-0.00124; mu= 15.2771+/- 0.074 mean_var=133.1308+/-26.957, 0's: 0 Z-trim(105.9): 201 B-trim: 2 in 1/50 Lambda= 0.111157 statistics sampled from 8484 (8703) to 8484 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.621), E-opt: 0.2 (0.267), width: 16 Scan time: 3.070 The best scores are: opt bits E(32554) CCDS34049.1 CFI gene_id:3426|Hs108|chr4 ( 583) 4168 680.7 1.4e-195 CCDS82946.1 CFI gene_id:3426|Hs108|chr4 ( 591) 4142 676.5 2.5e-194 CCDS82945.1 CFI gene_id:3426|Hs108|chr4 ( 576) 4106 670.8 1.3e-192 CCDS43129.2 TMPRSS7 gene_id:344805|Hs108|chr3 ( 717) 698 124.3 5.1e-28 CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 802) 660 118.3 3.8e-26 CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 811) 660 118.3 3.8e-26 CCDS3518.1 TMPRSS11D gene_id:9407|Hs108|chr4 ( 418) 605 109.2 1.1e-23 CCDS32993.1 HPN gene_id:3249|Hs108|chr19 ( 417) 536 98.1 2.3e-20 CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 ( 638) 537 98.5 2.8e-20 CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 492) 533 97.7 3.6e-20 CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 529) 533 97.7 3.8e-20 CCDS3520.1 TMPRSS11F gene_id:389208|Hs108|chr4 ( 438) 526 96.5 7.3e-20 CCDS3847.1 F11 gene_id:2160|Hs108|chr4 ( 625) 528 97.0 7.5e-20 CCDS73391.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 413) 524 96.2 8.7e-20 CCDS33993.1 TMPRSS11E gene_id:28983|Hs108|chr4 ( 423) 524 96.2 8.9e-20 CCDS73392.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 448) 524 96.2 9.3e-20 CCDS44735.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 457) 524 96.2 9.4e-20 CCDS3521.1 TMPRSS11B gene_id:132724|Hs108|chr4 ( 416) 520 95.5 1.4e-19 CCDS5976.1 PRSS55 gene_id:203074|Hs108|chr8 ( 352) 517 95.0 1.7e-19 CCDS44881.1 TMPRSS12 gene_id:283471|Hs108|chr12 ( 348) 516 94.8 1.9e-19 CCDS3369.1 HGFAC gene_id:3083|Hs108|chr4 ( 655) 517 95.3 2.6e-19 CCDS75098.1 HGFAC gene_id:3083|Hs108|chr4 ( 662) 517 95.3 2.6e-19 CCDS12088.1 TMPRSS9 gene_id:360200|Hs108|chr19 (1059) 513 94.8 5.7e-19 CCDS8487.1 ST14 gene_id:6768|Hs108|chr11 ( 855) 509 94.1 7.7e-19 CCDS47065.1 TMPRSS11A gene_id:339967|Hs108|chr4 ( 418) 503 92.8 9.1e-19 CCDS3519.1 TMPRSS11A gene_id:339967|Hs108|chr4 ( 421) 503 92.8 9.2e-19 CCDS74669.1 PRSS56 gene_id:646960|Hs108|chr2 ( 603) 495 91.7 2.9e-18 CCDS14666.1 F9 gene_id:2158|Hs108|chrX ( 461) 489 90.6 4.6e-18 CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 532) 488 90.5 5.7e-18 CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 563) 488 90.5 5.9e-18 CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 567) 488 90.5 6e-18 CCDS10430.1 TPSG1 gene_id:25823|Hs108|chr16 ( 321) 484 89.7 6.3e-18 CCDS156.1 CTRC gene_id:11330|Hs108|chr1 ( 268) 482 89.3 6.9e-18 CCDS13571.1 TMPRSS15 gene_id:5651|Hs108|chr21 (1019) 488 90.8 8.9e-18 CCDS34302.1 F12 gene_id:2161|Hs108|chr5 ( 615) 479 89.1 1.7e-17 CCDS2145.1 PROC gene_id:5624|Hs108|chr2 ( 461) 475 88.4 2.2e-17 CCDS45469.1 PRSS8 gene_id:5652|Hs108|chr16 ( 343) 472 87.8 2.5e-17 CCDS5279.1 PLG gene_id:5340|Hs108|chr6 ( 810) 474 88.5 3.6e-17 CCDS83495.1 F9 gene_id:2158|Hs108|chrX ( 423) 469 87.4 4e-17 CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 453) 465 86.8 6.6e-17 CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 454) 465 86.8 6.6e-17 CCDS157.1 CELA2A gene_id:63036|Hs108|chr1 ( 269) 461 85.9 7.1e-17 CCDS53579.1 HABP2 gene_id:3026|Hs108|chr10 ( 534) 463 86.5 9.2e-17 CCDS7577.1 HABP2 gene_id:3026|Hs108|chr10 ( 560) 463 86.5 9.5e-17 CCDS47145.1 PRSS48 gene_id:345062|Hs108|chr4 ( 328) 452 84.5 2.2e-16 CCDS10852.1 CTRL gene_id:1506|Hs108|chr16 ( 264) 449 84.0 2.7e-16 CCDS14101.1 ACR gene_id:49|Hs108|chr22 ( 421) 442 83.0 8.1e-16 CCDS3709.1 PRSS12 gene_id:8492|Hs108|chr4 ( 875) 446 84.0 8.6e-16 CCDS12808.1 KLK2 gene_id:3817|Hs108|chr19 ( 261) 437 82.0 1e-15 CCDS9530.1 F10 gene_id:2159|Hs108|chr13 ( 488) 439 82.6 1.2e-15 >>CCDS34049.1 CFI gene_id:3426|Hs108|chr4 (583 aa) initn: 4168 init1: 4168 opt: 4168 Z-score: 3625.0 bits: 680.7 E(32554): 1.4e-195 Smith-Waterman score: 4168; 99.8% identity (100.0% similar) in 583 aa overlap (1-583:1-583) 10 20 30 40 50 60 pF1KE2 MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 CKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 CKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 TDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 TDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 CLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 CLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 CDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCAGFASVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::. CCDS34 CDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCAGFASVT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 QEETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIKDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QEETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIKDA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 SGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRIIFHEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRIIFHEN 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 YNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGWGREKDNER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 YNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGWGREKDNER 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE2 VFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 VFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYV 490 500 510 520 530 540 550 560 570 580 pF1KE2 WGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV ::::::::::::::::::::::::::::::::::::::::::: CCDS34 WGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV 550 560 570 580 >>CCDS82946.1 CFI gene_id:3426|Hs108|chr4 (591 aa) initn: 4154 init1: 2121 opt: 4142 Z-score: 3602.4 bits: 676.5 E(32554): 2.5e-194 Smith-Waterman score: 4142; 98.5% identity (98.6% similar) in 591 aa overlap (1-583:1-591) 10 20 30 40 50 60 pF1KE2 MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 CKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 TDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 TDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 CLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKA 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 CDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCA------ :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCAAARHPT 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE2 --GFASVAQEETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLP :::::.:::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 IQGFASVTQEETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLP 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE2 WQVAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 WQVAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYV 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE2 DRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 DRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGW 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE2 GREKDNERVFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GREKDNERVFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCM 490 500 510 520 530 540 540 550 560 570 580 pF1KE2 DANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 DANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV 550 560 570 580 590 >>CCDS82945.1 CFI gene_id:3426|Hs108|chr4 (576 aa) initn: 4116 init1: 2121 opt: 4106 Z-score: 3571.3 bits: 670.8 E(32554): 1.3e-192 Smith-Waterman score: 4106; 98.8% identity (98.8% similar) in 583 aa overlap (1-583:1-576) 10 20 30 40 50 60 pF1KE2 MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MKLLHVFLLFLCFHLRFCKVTYTSQEDLVEKKCLAKKYTHLSCDKVFCQPWQRCIEGTCV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 CKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CKLPYQCPKNGTAVCATNRRSFPTYCQQKSLECLHPGTKFLNNGTCTAEGKFSVSLKHGN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 TDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 TDSEGIVEVKLVDQDKTMFICKSSWSMREANVACLDLGFQQGADTQRRFKLSDLSINSTE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 CLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 CDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCAGFASVA :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CDGINDCGDQSDELCCKACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCA------ 250 260 270 280 290 310 320 330 340 350 360 pF1KE2 QEETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIKDA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 -EETEILTADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIKDA 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE2 SGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRIIFHEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 SGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRIIFHEN 360 370 380 390 400 410 430 440 450 460 470 480 pF1KE2 YNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGWGREKDNER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 YNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGWGREKDNER 420 430 440 450 460 470 490 500 510 520 530 540 pF1KE2 VFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 VFSLQWGEVKLISNCSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYV 480 490 500 510 520 530 550 560 570 580 pF1KE2 WGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV ::::::::::::::::::::::::::::::::::::::::::: CCDS82 WGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV 540 550 560 570 >>CCDS43129.2 TMPRSS7 gene_id:344805|Hs108|chr3 (717 aa) initn: 579 init1: 185 opt: 698 Z-score: 616.5 bits: 124.3 E(32554): 5.1e-28 Smith-Waterman score: 698; 33.2% identity (59.8% similar) in 383 aa overlap (209-573:346-713) 180 190 200 210 220 230 pF1KE2 TECLHVHCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQM .:. :. . :. .: .: .: . : CCDS43 YMDHQTIFRVPSPLVHIQLQCSSRLSDKPLLAEYGSYNISQPCPVGSF-RCSSGLCVPQA 320 330 340 350 360 370 240 250 260 270 280 290 pF1KE2 KACDGINDCGDQSDELCC----KACQGKGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCA . :::.::: :.:::: : ::. ..:. . : : :.: :: .:.:: .:. CCDS43 QRCDGVNDCFDESDELFCVSPQPACNTSSFR-QHGPLI-----CDGFRDCENGRDEQNCT 380 390 400 410 420 300 310 320 330 340 pF1KE2 GFASVAQEETEILTADMDAERRRIKSLLPKLSC-------GVK-NRMHIRRKRIVGGKRA :. .. . .. :.. . ..: : .: .::.:: . CCDS43 --QSIPCNNRTFKCGNDICFRKQNAKCDGTVDCPDGSDEEGCTCSRSSSALHRIIGGTDT 430 440 450 460 470 480 350 360 370 380 390 400 pF1KE2 QLGDLPWQVAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVD-WIHPDLK : ::::... ... ::. :. :.:.::::..... ::. . ... . : CCDS43 LEGGWPWQVSLHFVGSAYCGASVISREWLLSAAHCFHGNRLSDPTPWTAHLGMYVQGNAK 490 500 510 520 530 540 410 420 430 440 450 460 pF1KE2 RIVIEYVDRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPND . : ::. :: ::. :.. ::::.... . .: . : :.: . . .. CCDS43 --FVSPVRRIVVHEYYNSQTFDYDIALLQLSIAWPETLKQLIQ--PICIPPTGQRVRSGE 550 560 570 580 590 600 470 480 490 500 510 520 pF1KE2 TCIVSGWGR--EKDNERVFSLQWGEVKLISN--CSKFYGNRFYEKEMECAGTYDGSIDAC : :.:::: : ::. . :: .::.::.. : . :: . ..: ::: ..:. ::: CCDS43 KCWVTGWGRRHEADNKGSLVLQQAEVELIDQTLCVSTYG--IITSRMLCAGIMSGKRDAC 610 620 630 640 650 660 530 540 550 560 570 580 pF1KE2 KGDSGGPLVCMDANNVTYVW-GVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQ :::::::: : .. .. :.::::.. :.:.::::::.:.:. :: .: CCDS43 KGDSGGPLSCRRKSDGKWILTGIVSWGHGSGRPNFPGVYTRVSNFVPWIHKYVPSLL 670 680 690 700 710 pF1KE2 YNV >>CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 (802 aa) initn: 395 init1: 272 opt: 660 Z-score: 583.0 bits: 118.3 E(32554): 3.8e-26 Smith-Waterman score: 660; 31.4% identity (59.2% similar) in 373 aa overlap (215-569:443-797) 190 200 210 220 230 240 pF1KE2 HCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKACDGI :.:. : .:. ::: . ::::. CCDS74 IPVVATAGITINFTSQISLTGPGVRVHYGLYNQSDPCP-GEFLCSVNGLCVP---ACDGV 420 430 440 450 460 250 260 270 280 290 pF1KE2 NDCGDQSDELCCKACQGKGFHCK-SGVCIPSQYQCNGEVDCITGEDE------VGCAGFA .:: . :: : .:.. :.:: ...:: :.:. ::..: :: : :. :. CCDS74 KDCPNGLDERNC-VCRAT-FQCKEDSTCISLPKVCDGQPDCLNGSDEEQCQEGVPCGTFT 470 480 490 500 510 520 300 310 320 330 340 350 pF1KE2 SVAQEETEILTADMDAERR---RIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQ .... . . . . : : : . .::... .::::: .. :. ::: CCDS74 FQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCDCGLQG----PSSRIVGGAVSSEGEWPWQ 530 540 550 560 570 580 360 370 380 390 400 410 pF1KE2 VAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVD--WIHPDLKRIVIEYV .... . ::: :. :..:::::.. .. .::. . : . : : CCDS74 ASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVLWTVFLGKVWQNSRWPGEVSFKV 590 600 610 620 630 640 420 430 440 450 460 pF1KE2 DRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSI---PACVPWSPYLFQPNDTCIV .:...: .. ... :.::... : . :: :.:.: ..:.:. : . CCDS74 SRLLLHPYHEEDSHDYDVALLQL-------DHPVVRSAAVRPVCLPARSHFFEPGLHCWI 650 660 670 680 690 470 480 490 500 510 520 pF1KE2 SGWGREKDNERVFS-LQWGEVKLISN--CSKFYGNRFYEKEMECAGTYDGSIDACKGDSG .::: ... . . :: .:.:: . ::. : . . : ::: :. :::.:::: CCDS74 TGWGALREGGPISNALQKVDVQLIPQDLCSEVYRYQVTPR-MLCAGYRKGKKDACQGDSG 700 710 720 730 740 750 530 540 550 560 570 580 pF1KE2 GPLVCMDANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV ::::: .. .. :.:::: .::.:.. ::::.... ..:: CCDS74 GPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQVVT 760 770 780 790 800 >>CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 (811 aa) initn: 395 init1: 272 opt: 660 Z-score: 582.9 bits: 118.3 E(32554): 3.8e-26 Smith-Waterman score: 660; 31.4% identity (59.2% similar) in 373 aa overlap (215-569:452-806) 190 200 210 220 230 240 pF1KE2 HCRGLETSLAECTFTKRRTMGYQDFADVVCYTQKADSPMDDFFQCVNGKYISQMKACDGI :.:. : .:. ::: . ::::. CCDS13 IPVVATAGITINFTSQISLTGPGVRVHYGLYNQSDPCP-GEFLCSVNGLCVP---ACDGV 430 440 450 460 470 250 260 270 280 290 pF1KE2 NDCGDQSDELCCKACQGKGFHCK-SGVCIPSQYQCNGEVDCITGEDE------VGCAGFA .:: . :: : .:.. :.:: ...:: :.:. ::..: :: : :. :. CCDS13 KDCPNGLDERNC-VCRAT-FQCKEDSTCISLPKVCDGQPDCLNGSDEEQCQEGVPCGTFT 480 490 500 510 520 530 300 310 320 330 340 350 pF1KE2 SVAQEETEILTADMDAERR---RIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQ .... . . . . : : : . .::... .::::: .. :. ::: CCDS13 FQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCDCGLQG----PSSRIVGGAVSSEGEWPWQ 540 550 560 570 580 590 360 370 380 390 400 410 pF1KE2 VAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVD--WIHPDLKRIVIEYV .... . ::: :. :..:::::.. .. .::. . : . : : CCDS13 ASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVLWTVFLGKVWQNSRWPGEVSFKV 600 610 620 630 640 650 420 430 440 450 460 pF1KE2 DRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSI---PACVPWSPYLFQPNDTCIV .:...: .. ... :.::... : . :: :.:.: ..:.:. : . CCDS13 SRLLLHPYHEEDSHDYDVALLQL-------DHPVVRSAAVRPVCLPARSHFFEPGLHCWI 660 670 680 690 700 470 480 490 500 510 520 pF1KE2 SGWGREKDNERVFS-LQWGEVKLISN--CSKFYGNRFYEKEMECAGTYDGSIDACKGDSG .::: ... . . :: .:.:: . ::. : . . : ::: :. :::.:::: CCDS13 TGWGALREGGPISNALQKVDVQLIPQDLCSEVYRYQVTPR-MLCAGYRKGKKDACQGDSG 710 720 730 740 750 760 530 540 550 560 570 580 pF1KE2 GPLVCMDANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV ::::: .. .. :.:::: .::.:.. ::::.... ..:: CCDS13 GPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQVVT 770 780 790 800 810 >>CCDS3518.1 TMPRSS11D gene_id:9407|Hs108|chr4 (418 aa) initn: 472 init1: 289 opt: 605 Z-score: 538.8 bits: 109.2 E(32554): 1.1e-23 Smith-Waterman score: 605; 38.6% identity (69.3% similar) in 254 aa overlap (327-574:173-417) 300 310 320 330 340 350 pF1KE2 ASVAQEETEILTADMDAERRRIKSLLPKLSCGV-KNRMHIRRKRIVGGKRAQLGDLPWQV ::. . . . ..::.:: .:. :. :::: CCDS35 LNNSGNLEINPSTEITSLTDQAAANWLINECGAGPDLITLSEQRILGGTEAEEGSWPWQV 150 160 170 180 190 200 360 370 380 390 400 410 pF1KE2 AIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRI ... .. ::: :.. ::::::::.:.... : : :. .. : :. : : CCDS35 SLRLNNAHHCGGSLINNMWILTAAHCFRSNSNPRDWIATSGISTTFPKLRM----RVRNI 210 220 230 240 250 420 430 440 450 460 470 pF1KE2 IFHENYNAGTYQNDIALIEMKKDGN-KKDCELPRSIPACVPWSPYLFQPNDTCIVSGWG- ..:.::...:..:::::...... . :: . :. :.: . . :..: :.::: CCDS35 LIHNNYKSATHENDIALVRLENSVTFTKDIH---SV--CLPAATQNIPPGSTAYVTGWGA 260 270 280 290 300 310 480 490 500 510 520 530 pF1KE2 REKDNERVFSLQWGEVKLISN--CSKFYG-NRFYEKEMECAGTYDGSIDACKGDSGGPLV .: .. : :. :.:..::: :. .. : . : :::. .:..:::.:::::::: CCDS35 QEYAGHTVPELRQGQVRIISNDVCNAPHSYNGAILSGMLCAGVPQGGVDACQGDSGGPLV 320 330 340 350 360 370 540 550 560 570 580 pF1KE2 CMDANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV :. . .. :.::::..:: :. :::::.:. :.::: ..: CCDS35 QEDSRRLWFIVGIVSWGDQCGLPDKPGVYTRVTAYLDWIRQQTGI 380 390 400 410 >>CCDS32993.1 HPN gene_id:3249|Hs108|chr19 (417 aa) initn: 444 init1: 170 opt: 536 Z-score: 479.0 bits: 98.1 E(32554): 2.3e-20 Smith-Waterman score: 536; 34.4% identity (60.3% similar) in 302 aa overlap (292-569:113-400) 270 280 290 300 310 pF1KE2 KGFHCKSGVCIPSQYQCNGEVDCITGEDEVGCAGFASV-------AQEETEILTADMDAE : .:: : .:. :.... : CCDS32 ARVAGLSCEEMGFLRALTHSELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVISV-CDCP 90 100 110 120 130 140 320 330 340 350 360 370 pF1KE2 RRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIKDASGITCGGIYIGGCW : :. . . . .:: : .. :::::. ..:: ::::... .. ::: ..: : CCDS32 RGRFLAAICQ-DCG---RRKLPVDRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDW 150 160 170 180 190 380 390 400 410 420 pF1KE2 ILTAAHCL--RASKTHRYQIWTTVVDWIHPDLKRIVIEYVDRIIFHENY------NAGTY .::::::. : :..... .: : .. .. : ..: .: :. CCDS32 VLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQLGVQAV---VYHGGYLPFRDPNSEEN 200 210 220 230 240 250 430 440 450 460 470 480 pF1KE2 QNDIALIEMKKDGNKKDCELPRSI-PACVPWSPYLFQPNDTCIVSGWGREKD-NERVFSL .:::::..... : . : :.:.: . . . : :.::: . .... : CCDS32 SNDIALVHLSSP-----LPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQYYGQQAGVL 260 270 280 290 300 490 500 510 520 530 540 pF1KE2 QWGEVKLISN--CS--KFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYV : ..: .::: :. ::::.. : : ::: .:.::::.::::::.:: :. . : CCDS32 QEARVPIISNDVCNGADFYGNQIKPK-MFCAGYPEGGIDACQGDSGGPFVCEDSISRTPR 310 320 330 340 350 360 550 560 570 580 pF1KE2 W---GVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV : :.:::: .:. . :::::::... .:: CCDS32 WRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQL 370 380 390 400 410 >>CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 (638 aa) initn: 459 init1: 149 opt: 537 Z-score: 477.6 bits: 98.5 E(32554): 2.8e-20 Smith-Waterman score: 537; 37.8% identity (64.3% similar) in 238 aa overlap (339-569:390-621) 310 320 330 340 350 360 pF1KE2 ADMDAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDLPWQVAIK---DASGITC ::::: .. :. ::::... :. : CCDS34 IAYGTQGSSGYSLRLCNTGDNSVCTTKTSTRIVGGTNSSWGEWPWQVSLQVKLTAQRHLC 360 370 380 390 400 410 370 380 390 400 410 420 pF1KE2 GGIYIGGCWILTAAHCLRASKTHR-YQIWTTVVDWIHPDLKRIVIEYVDRIIFHENYNAG :: :: :.::::::. . . ..:.. ... . : . . .::.:.::... CCDS34 GGSLIGHQWVLTAAHCFDGLPLQDVWRIYSGILN-LSDITKDTPFSQIKEIIIHQNYKVS 420 430 440 450 460 470 430 440 450 460 470 480 pF1KE2 TYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVSGWGREKDNERVFS- ..:::::... : . . : :.: . .: :.::: :.. .. . CCDS34 EGNHDIALIKLQAPLNYTEFQ----KPICLPSKGDTSTIYTNCWVTGWGFSKEKGEIQNI 480 490 500 510 520 530 490 500 510 520 530 540 pF1KE2 LQWGEVKLISN--CSKFYGNRFYEKEMECAGTYDGSIDACKGDSGGPLVCMDANNVTYVW :: .. :..: :.: : . ..: ::: .:. ::::::::::::: :.. . CCDS34 LQKVNIPLVTNEECQKRYQDYKITQRMVCAGYKEGGKDACKGDSGGPLVC-KHNGMWRLV 540 550 560 570 580 590 550 560 570 580 pF1KE2 GVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV :..::::.:.. : ::::::::.:.::: CCDS34 GITSWGEGCARREQPGVYTKVAEYMDWILEKTQSSDGKAQMQSPA 600 610 620 630 >>CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 (492 aa) initn: 440 init1: 271 opt: 533 Z-score: 475.5 bits: 97.7 E(32554): 3.6e-20 Smith-Waterman score: 564; 31.6% identity (56.3% similar) in 373 aa overlap (228-569:119-484) 200 210 220 230 240 250 pF1KE2 FTKRRTMGYQDFADVVCYTQKADSPMDDFFQC-VNGKYISQMKACDGINDCGDQSDELCC .: .: :. . :::.. : :: : CCDS33 LTLGTFLVGAALAAGLLWKFMGSKCSNSGIECDSSGTCINPSNWCDGVSHCPGGEDENRC 90 100 110 120 130 140 260 270 280 290 pF1KE2 KACQGKGF-------HCKS--GVCIPSQYQCNGEVDC---------ITGEDEVGCAGFAS : .: . :: :: . . :.. : ... : .: .: CCDS33 VRLYGPNFILQVYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDDSGSTS 150 160 170 180 190 200 300 310 320 330 340 350 pF1KE2 VAQEETEILTADM-------DAERRRIKSLLPKLSCGVKNRMHIRRKRIVGGKRAQLGDL . .: ..:. :: . : ..::: : :..:::::. : : CCDS33 FMKLNTSAGNVDIYKKLYHSDACSSKAVVSLRCIACGV-NLNSSRQSRIVGGESALPGAW 210 220 230 240 250 260 360 370 380 390 400 410 pF1KE2 PWQVAIKDASGITCGGIYIGGCWILTAAHCLRASKTHRYQIWTTVVDWIHPDLKRIVIEY ::::... . .::: : ::.:::::.. .. .. ::. . .. .. : CCDS33 PWQVSLHVQNVHVCGGSIITPEWIVTAAHCVEKPLNNPWH-WTAFAGILRQSFMFYGAGY 270 280 290 300 310 320 420 430 440 450 460 470 pF1KE2 -VDRIIFHENYNAGTYQNDIALIEMKKDGNKKDCELPRSIPACVPWSPYLFQPNDTCIVS :...: : ::.. : .:::::....: . .: : . :.:.: ...::.. : .: CCDS33 QVEKVISHPNYDSKTKNNDIALMKLQKPLTFND--LVK--PVCLPNPGMMLQPEQLCWIS 330 340 350 360 370 380 480 490 500 510 520 pF1KE2 GWG-REKDNERVFSLQWGEVKLISN--C-SKFYGNRFYEKEMECAGTYDGSIDACKGDSG ::: :. .. :. ..: :: . : :.. . . : ::: .:..:.:.:::: CCDS33 GWGATEEKGKTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSG 390 400 410 420 430 440 530 540 550 560 570 580 pF1KE2 GPLVCMDANNVTYVWGVVSWGENCGKPEFPGVYTKVANYFDWISYHVGRPFISQYNV :::: . ::. .. : .::: .:.: :::: .: . ::: CCDS33 GPLVT-SKNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVFTDWIYRQMRADG 450 460 470 480 490 583 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 11:23:25 2016 done: Sun Nov 6 11:23:26 2016 Total Scan time: 3.070 Total Display time: 0.090 Function used was FASTA [36.3.4 Apr, 2011]