FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5343, 270 aa 1>>>pF1KE5343 270 - 270 aa - 270 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1839+/-0.000636; mu= 15.5764+/- 0.038 mean_var=68.3961+/-13.603, 0's: 0 Z-trim(111.6): 169 B-trim: 0 in 0/50 Lambda= 0.155081 statistics sampled from 12336 (12511) to 12336 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.384), width: 16 Scan time: 2.590 The best scores are: opt bits E(32554) CCDS219.1 CELA3B gene_id:23436|Hs108|chr1 ( 270) 1896 432.6 1.4e-121 CCDS220.1 CELA3A gene_id:10136|Hs108|chr1 ( 270) 1765 403.3 9.5e-113 CCDS157.1 CELA2A gene_id:63036|Hs108|chr1 ( 269) 1067 247.1 9.7e-66 CCDS30605.1 CELA2B gene_id:51032|Hs108|chr1 ( 269) 1025 237.7 6.5e-63 CCDS156.1 CTRC gene_id:11330|Hs108|chr1 ( 268) 1014 235.3 3.6e-62 CCDS8812.1 CELA1 gene_id:1990|Hs108|chr12 ( 258) 936 217.8 6.2e-57 CCDS10852.1 CTRL gene_id:1506|Hs108|chr16 ( 264) 710 167.2 1.1e-41 CCDS10481.1 PRSS22 gene_id:64063|Hs108|chr16 ( 317) 577 137.5 1.1e-32 CCDS32993.1 HPN gene_id:3249|Hs108|chr19 ( 417) 542 129.8 3.1e-30 CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 453) 536 128.5 8.5e-30 CCDS13686.1 TMPRSS3 gene_id:64699|Hs108|chr21 ( 454) 524 125.8 5.5e-29 CCDS73391.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 413) 512 123.1 3.3e-28 CCDS73392.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 448) 512 123.1 3.5e-28 CCDS44735.1 TMPRSS5 gene_id:80975|Hs108|chr11 ( 457) 512 123.1 3.6e-28 CCDS34120.1 KLKB1 gene_id:3818|Hs108|chr4 ( 638) 513 123.4 4e-28 CCDS33564.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 492) 505 121.5 1.1e-27 CCDS54486.1 TMPRSS2 gene_id:7113|Hs108|chr21 ( 529) 505 121.6 1.2e-27 CCDS32490.1 CTRB1 gene_id:1504|Hs108|chr16 ( 263) 496 119.4 2.7e-27 CCDS13571.1 TMPRSS15 gene_id:5651|Hs108|chr21 (1019) 500 120.6 4.4e-27 CCDS10430.1 TPSG1 gene_id:25823|Hs108|chr16 ( 321) 492 118.5 5.9e-27 CCDS75122.1 CORIN gene_id:10699|Hs108|chr4 ( 938) 496 119.7 7.6e-27 CCDS3477.1 CORIN gene_id:10699|Hs108|chr4 (1042) 496 119.7 8.2e-27 CCDS3847.1 F11 gene_id:2160|Hs108|chr4 ( 625) 490 118.3 1.4e-26 CCDS32489.1 CTRB2 gene_id:440387|Hs108|chr16 ( 263) 485 116.9 1.5e-26 CCDS3520.1 TMPRSS11F gene_id:389208|Hs108|chr4 ( 438) 482 116.4 3.6e-26 CCDS74669.1 PRSS56 gene_id:646960|Hs108|chr2 ( 603) 478 115.6 8.7e-26 CCDS47145.1 PRSS48 gene_id:345062|Hs108|chr4 ( 328) 469 113.4 2.1e-25 CCDS12805.1 KLK15 gene_id:55554|Hs108|chr19 ( 256) 467 112.9 2.4e-25 CCDS62766.1 KLK15 gene_id:55554|Hs108|chr19 ( 255) 465 112.4 3.3e-25 CCDS12046.1 CFD gene_id:1675|Hs108|chr19 ( 253) 460 111.3 7e-25 CCDS14101.1 ACR gene_id:49|Hs108|chr22 ( 421) 461 111.7 9.1e-25 CCDS82261.1 CFD gene_id:1675|Hs108|chr19 ( 260) 458 110.8 9.8e-25 CCDS12041.1 PRSS57 gene_id:400668|Hs108|chr19 ( 283) 458 110.9 1e-24 CCDS76482.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 290) 458 110.9 1.1e-24 CCDS74856.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 802) 462 112.1 1.3e-24 CCDS13941.1 TMPRSS6 gene_id:164656|Hs108|chr22 ( 811) 462 112.1 1.3e-24 CCDS53717.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 397) 458 111.0 1.4e-24 CCDS44743.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 432) 458 111.0 1.5e-24 CCDS53716.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 435) 458 111.0 1.5e-24 CCDS31684.1 TMPRSS4 gene_id:56649|Hs108|chr11 ( 437) 458 111.0 1.5e-24 CCDS8487.1 ST14 gene_id:6768|Hs108|chr11 ( 855) 459 111.4 2.2e-24 CCDS77203.1 PRSS57 gene_id:400668|Hs108|chr19 ( 282) 453 109.8 2.3e-24 CCDS12808.1 KLK2 gene_id:3817|Hs108|chr19 ( 261) 452 109.5 2.5e-24 CCDS44881.1 TMPRSS12 gene_id:283471|Hs108|chr12 ( 348) 452 109.6 3.1e-24 CCDS55788.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 532) 450 109.3 6e-24 CCDS58185.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 563) 450 109.3 6.3e-24 CCDS41721.1 TMPRSS13 gene_id:84000|Hs108|chr11 ( 567) 450 109.3 6.3e-24 CCDS3369.1 HGFAC gene_id:3083|Hs108|chr4 ( 655) 450 109.3 7.1e-24 CCDS75098.1 HGFAC gene_id:3083|Hs108|chr4 ( 662) 450 109.3 7.2e-24 CCDS73251.1 OVCH2 gene_id:341277|Hs108|chr11 ( 565) 446 108.4 1.2e-23 >>CCDS219.1 CELA3B gene_id:23436|Hs108|chr1 (270 aa) initn: 1896 init1: 1896 opt: 1896 Z-score: 2296.1 bits: 432.6 E(32554): 1.4e-121 Smith-Waterman score: 1896; 99.6% identity (99.6% similar) in 270 aa overlap (1-270:1-270) 10 20 30 40 50 60 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTCGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTCGG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCVAC :::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::: CCDS21 SLIAPDWVVTAGHCISSSRTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCVAC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEALL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 PVVDYEHCSRWNWWGSSVKKTMVCAGGDIRSGCNGDSGGPLNCPTEDGGWQVHGVTSFVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 PVVDYEHCSRWNWWGSSVKKTMVCAGGDIRSGCNGDSGGPLNCPTEDGGWQVHGVTSFVS 190 200 210 220 230 240 250 260 270 pF1KE5 AFGCNTRRKPTVFTRVSAFIDWIEETIASH :::::::::::::::::::::::::::::: CCDS21 AFGCNTRRKPTVFTRVSAFIDWIEETIASH 250 260 270 >>CCDS220.1 CELA3A gene_id:10136|Hs108|chr1 (270 aa) initn: 1765 init1: 1765 opt: 1765 Z-score: 2137.7 bits: 403.3 E(32554): 9.5e-113 Smith-Waterman score: 1765; 93.0% identity (96.3% similar) in 270 aa overlap (1-270:1-270) 10 20 30 40 50 60 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTCGG :::::::::::::::::::::::. :::::.::::::::::::::::::::::::::::: CCDS22 MMLRLLSSLLLVAVASGYGPPSSHSSSRVVHGEDAVPYSWPWQVSLQYEKSGSFYHTCGG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCVAC :::::::::::::::: . :::::::::. :::::::::::::: .:::::::::::::: CCDS22 SLIAPDWVVTAGHCISRDLTYQVVLGEYNLAVKEGPEQVIPINSEELFVHPLWNRSCVAC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEALL :::::::::::::::::::::::::::::::::.::::::::::::::::::::::.: : CCDS22 GNDIALIKLSRSAQLGDAVQLASLPPAGDILPNKTPCYITGWGRLYTNGPLPDKLQQARL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 PVVDYEHCSRWNWWGSSVKKTMVCAGGDIRSGCNGDSGGPLNCPTEDGGWQVHGVTSFVS :::::.::::::::::.:::::::::: :::::::::::::::::::::::::::::::: CCDS22 PVVDYKHCSRWNWWGSTVKKTMVCAGGYIRSGCNGDSGGPLNCPTEDGGWQVHGVTSFVS 190 200 210 220 230 240 250 260 270 pF1KE5 AFGCNTRRKPTVFTRVSAFIDWIEETIASH ::::: :::::::::::::::::::::: CCDS22 AFGCNFIWKPTVFTRVSAFIDWIEETIASH 250 260 270 >>CCDS157.1 CELA2A gene_id:63036|Hs108|chr1 (269 aa) initn: 1064 init1: 454 opt: 1067 Z-score: 1293.7 bits: 247.1 E(32554): 9.7e-66 Smith-Waterman score: 1067; 57.6% identity (76.8% similar) in 271 aa overlap (2-270:1-269) 10 20 30 40 50 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPS-SRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTCG :.: : ::: : . : :. : .:::.::.: : :::::::::: ..:..::::: CCDS15 MIRTLLLSTLVAGALSCGDPTYPPYVTRVVGGEEARPNSWPWQVSLQYSSNGKWYHTCG 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 GSLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCVA ::::: .::.::.:::::: ::.: ::... : :. .. ... . :: :: . .. CCDS15 GSLIANSWVLTAAHCISSSRTYRVGLGRHNLYVAESGSLAVSVSK--IVVHKDWNSNQIS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 CGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEAL ::::::.::. ..: : .::: ::::: ::::. :::.:::::: ::: .:: ::.. CCDS15 KGNDIALLKLANPVSLTDKIQLACLPPAGTILPNNYPCYVTGWGRLQTNGAVPDVLQQGR 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 LPVVDYEHCSRWNWWGSSVKKTMVCAGGD-IRSGCNGDSGGPLNCPTEDGGWQVHGVTSF : :::: :: ::::::: .:.::::: . :.::::::::::: . :: :::::..:: CCDS15 LLVVDYATCSSSAWWGSSVKTSMICAGGDGVISSCNGDSGGPLNCQASDGRWQVHGIVSF 180 190 200 210 220 230 240 250 260 270 pF1KE5 VSAFGCNTRRKPTVFTRVSAFIDWIEETIASH : .::: .::.:::::: .::::. .::.. CCDS15 GSRLGCNYYHKPSVFTRVSNYIDWINSVIANN 240 250 260 >>CCDS30605.1 CELA2B gene_id:51032|Hs108|chr1 (269 aa) initn: 1056 init1: 452 opt: 1025 Z-score: 1242.9 bits: 237.7 E(32554): 6.5e-63 Smith-Waterman score: 1025; 54.6% identity (76.0% similar) in 271 aa overlap (2-270:1-269) 10 20 30 40 50 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPS-SRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTCG :.: : ::: : . : . :. ::...::.: : :::::::::: ..:..::::: CCDS30 MIRTLLLSTLVAGALSCGVSTYAPDMSRMLGGEEARPNSWPWQVSLQYSSNGQWYHTCG 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 GSLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCVA ::::: .::.::.:::::: :.:.::... : :. .. ... . :: :: . :. CCDS30 GSLIANSWVLTAAHCISSSGIYRVMLGQHNLYVAESGSLAVSVSK--IVVHKDWNSDQVS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 CGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEAL ::::::.::. ..: : .::: ::::: ::::. :::.:::::: ::: ::: :... CCDS30 KGNDIALLKLANPVSLTDKIQLACLPPAGTILPNNYPCYVTGWGRLQTNGALPDDLKQGQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 LPVVDYEHCSRWNWWGSSVKKTMVCAGGD-IRSGCNGDSGGPLNCPTEDGGWQVHGVTSF : :::: :: .::::.:: .:.::::: . ::::::::::: . :: :.:::. :. CCDS30 LLVVDYATCSSSGWWGSTVKTNMICAGGDGVICTCNGDSGGPLNCQASDGRWEVHGIGSL 180 190 200 210 220 230 240 250 260 270 pF1KE5 VSAFGCNTRRKPTVFTRVSAFIDWIEETIASH .:..::: ::..::::: . :::. .::.. CCDS30 TSVLGCNYYYKPSIFTRVSNYNDWINSVIANN 240 250 260 >>CCDS156.1 CTRC gene_id:11330|Hs108|chr1 (268 aa) initn: 1030 init1: 414 opt: 1014 Z-score: 1229.7 bits: 235.3 E(32554): 3.6e-62 Smith-Waterman score: 1014; 55.0% identity (77.0% similar) in 269 aa overlap (2-267:1-266) 10 20 30 40 50 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRP--SSRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTC :: . :.: ::. : :: : :.:::.:::: :.:::::.:::: :. .. ::: CCDS15 MLGITVLAALLACASSCGVPSFPPNLSARVVGGEDARPHSWPWQISLQYLKNDTWRHTC 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 GGSLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCV ::.::: ..:.::.::::.. ::.:..:. . : : : . .. . :: :: . CCDS15 GGTLIASNFVLTAAHCISNTRTYRVAVGKNNLEV-EDEEGSLFVGVDTIHVHKRWN--AL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 ACGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEA ::::::::.. ..:.:..:.: :: ..::.. :::.::::::.::::. ::::.. CCDS15 LLRNDIALIKLAEHVELSDTIQVACLPEKDSLLPKDYPCYVTGWGRLWTNGPIADKLQQG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 LLPVVDYEHCSRWNWWGSSVKKTMVCAGGD-IRSGCNGDSGGPLNCPTEDGGWQVHGVTS : ::::. ::: .::: ::::::::::: . :.::::::::::: :.:.:.: :..: CCDS15 LQPVVDHATCSRIDWWGFRVKKTMVCAGGDGVISACNGDSGGPLNCQLENGSWEVFGIVS 180 190 200 210 220 230 240 250 260 270 pF1KE5 FVSAFGCNTRRKPTVFTRVSAFIDWIEETIASH : : :::::.::.:.:::::.::::.: . CCDS15 FGSRRGCNTRKKPVVYTRVSAYIDWINEKMQL 240 250 260 >>CCDS8812.1 CELA1 gene_id:1990|Hs108|chr12 (258 aa) initn: 959 init1: 578 opt: 936 Z-score: 1135.6 bits: 217.8 E(32554): 6.2e-57 Smith-Waterman score: 936; 55.3% identity (80.5% similar) in 246 aa overlap (26-270:16-258) 10 20 30 40 50 60 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQVSLQYEKSGSFYHTCGG ..:::.: .: ::: :.::::...:: :::::: CCDS88 MLVLYGHSTQDLPETNARVVGGTEAGRNSWPSQISLQYRSGGSRYHTCGG 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 SLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNRSCVAC .:: .::.::.::.. . :..:: :... . ..: :: . ... . ::: :: . :: CCDS88 TLIRQNWVMTAAHCVDYQKTFRVVAGDHNLSQNDGTEQYVSVQK--IVVHPYWNSDNVAA 60 70 80 90 100 130 140 150 160 170 180 pF1KE5 GNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPLPDKLQEALL : ::::..:..:. :.. :::. :: : :: :..::::::::. ::: : . ::.: : CCDS88 GYDIALLRLAQSVTLNSYVQLGVLPQEGAILANNSPCYITGWGKTKTNGQLAQTLQQAYL 110 120 130 140 150 160 190 200 210 220 230 pF1KE5 PVVDYEHCSRWNWWGSSVKKTMVCAGGD-IRSGCNGDSGGPLNCPTEDGGWQVHGVTSFV : ::: :: ..:::.::.:::::::: .::::.:::::::.: . .: ..:::::::: CCDS88 PSVDYAICSSSSYWGSTVKNTMVCAGGDGVRSGCQGDSGGPLHCLV-NGKYSVHGVTSFV 170 180 190 200 210 220 240 250 260 270 pF1KE5 SAFGCNTRRKPTVFTRVSAFIDWIEETIASH :. :::. :::::::.:::.:.::...:::. CCDS88 SSRGCNVSRKPTVFTQVSAYISWINNVIASN 230 240 250 >>CCDS10852.1 CTRL gene_id:1506|Hs108|chr16 (264 aa) initn: 506 init1: 182 opt: 710 Z-score: 862.2 bits: 167.2 E(32554): 1.1e-41 Smith-Waterman score: 710; 43.2% identity (68.1% similar) in 273 aa overlap (1-268:2-262) 10 20 30 40 50 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRP----SSRVVNGEDAVPYSWPWQVSLQYEKSGSFY ..: : ::.:.. . : : :. .: :.:.::::.:: ::::::::: ..:: . CCDS10 MLLLSLTLSLVLLGSSWGCGIPAIKPALSFSQRIVNGENAVLGSWPWQVSLQ-DSSG--F 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 HTCGGSLIAPDWVVTAGHCISSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVHPLWNR : ::::::. .:::::.:: : . ::::::::. . : ::. .. . ..:: :: CCDS10 HFCGGSLISQSWVVTAAHCNVSPGRHFVVLGEYDRSSNAEPLQVLSVSRA--ITHPSWNS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 SCVACGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGPL-PDK . . .::..:.::. :: .. . : ... : . : :::::: : . : . CCDS10 TTM--NNDVTLLKLASPAQYTTRISPVCLASSNEALTEGLTCVTTGWGRLSGVGNVTPAH 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 LQEALLPVVDYEHCSRWNWWGSSVKKTMVCAGGDIRSGCNGDSGGPLNCPTEDGGWQVHG ::.. ::.: ..: . .::::. .:.:::: :.:.::::::: : . . : . : CCDS10 LQQVALPLVTVNQCRQ--YWGSSITDSMICAGGAGASSCQGDSGGPLVCQ-KGNTWVLIG 180 190 200 210 220 230 240 250 260 270 pF1KE5 VTSFVSAFGCNTRRKPTVFTRVSAFIDWIEETIASH ..:. . .::.: :.:.:::: : ::...:: CCDS10 IVSWGTK-NCNVR-APAVYTRVSKFSTWINQVIAYN 240 250 260 >>CCDS10481.1 PRSS22 gene_id:64063|Hs108|chr16 (317 aa) initn: 340 init1: 150 opt: 577 Z-score: 700.2 bits: 137.5 E(32554): 1.1e-32 Smith-Waterman score: 579; 36.7% identity (61.9% similar) in 286 aa overlap (2-267:21-289) 10 20 30 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPS--SRVVNGEDAVPYS .: : :. .: :. : ..:. .:::.:::.. CCDS10 MVVSGAPPALGGGCLGTFTSLLLLASTAILNAARIPVPPACGKPQQLNRVVGGEDSTDSE 10 20 30 40 50 60 40 50 60 70 80 pF1KE5 WPWQVSLQYEKSGSFYHTCGGSLIAPDWVVTAGHC----ISSSWTYQVVLGEYD------ ::: ::.: :.:. : :.:::.. ::.::.:: ... . ..:.:: .. CCDS10 WPWIVSIQ--KNGT--HHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLGNPGS 70 80 90 100 110 90 100 110 120 130 140 pF1KE5 RAVKEGPEQVIPINSGDLFVHPLWNRSCVACGNDIALIKLSRSAQLGDAVQLASLPPAGD :. : : : : ::... . ::. ::::..: :: :... : :: :. CCDS10 RSQKVGVAWVEP--------HPVYSWKEGACA-DIALVRLERSIQFSERVLPICLPDASI 120 130 140 150 160 150 160 170 180 190 200 pF1KE5 ILPNETPCYITGWGRLYTNGPLP--DKLQEALLPVVDYEHCSRWNWWGSS---VKKTMVC :: .: :.:.::: . . ::: . ::. .:..: : ::. : :.. . . :.: CCDS10 HLPPNTHCWISGWGSIQDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAGQGPITEDMLC 170 180 190 200 210 220 210 220 230 240 250 260 pF1KE5 AG---GDIRSGCNGDSGGPLNCPTEDGGWQVHGVTSFVSAFGCNTRRKPTVFTRVSAFID :: :. :..: ::::::: : . ::.: . :. :. . :: : .: :. .:: . CCDS10 AGYLEGE-RDACLGDSGGPLMCQV-DGAWLLAGIISW--GEGCAERNRPGVYISLSAHRS 230 240 250 260 270 280 270 pF1KE5 WIEETIASH :.:. . CCDS10 WVEKIVQGVQLRGRAQGGGALRAPSQGSGAAARS 290 300 310 >>CCDS32993.1 HPN gene_id:3249|Hs108|chr19 (417 aa) initn: 360 init1: 219 opt: 542 Z-score: 656.2 bits: 129.8 E(32554): 3.1e-30 Smith-Waterman score: 542; 36.0% identity (60.5% similar) in 261 aa overlap (25-270:159-407) 10 20 30 40 50 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQVSLQYEKSGSF : .:.:.:.:. :::::::.:. . CCDS32 QRLLEVISVCDCPRGRFLAAICQDCGRRKLPVDRIVGGRDTSLGRWPWQVSLRYDGA--- 130 140 150 160 170 180 60 70 80 90 100 pF1KE5 YHTCGGSLIAPDWVVTAGHC------ISSSWTYQVVLGEYDRAVKEGPE---QVIPINSG : :::::.. :::.::.:: . : : .: : .: .: . :.. ..: CCDS32 -HLCGGSLLSGDWVLTAAHCFPERNRVLSRW--RVFAGAVAQASPHGLQLGVQAVVYHGG 190 200 210 220 230 240 110 120 130 140 150 160 pF1KE5 DLFVHPLWNRSCVACGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRL : :. . . .:::::..:: : . .: . :: ::. : . : .:::: CCDS32 YL---PFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNT 250 260 270 280 290 170 180 190 200 210 220 pF1KE5 YTNGPLPDKLQEALLPVVDYEHCSRWNWWGSSVKKTMVCAG---GDIRSGCNGDSGGPLN : :::: .:... . :. ...:...: : ::: : : ..:.::::::. CCDS32 QYYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGI-DACQGDSGGPFV 300 310 320 330 340 350 230 240 250 260 270 pF1KE5 CP---TEDGGWQVHGVTSFVSAFGCNTRRKPTVFTRVSAFIDWIEETIASH : .. :.. :..:. . :: .:: :.:.:: : .:: ..: .: CCDS32 CEDSISRTPRWRLCGIVSWGT--GCALAQKPGVYTKVSDFREWIFQAIKTHSEASGMVTQ 360 370 380 390 400 410 CCDS32 L >>CCDS58790.1 TMPRSS3 gene_id:64699|Hs108|chr21 (453 aa) initn: 536 init1: 185 opt: 536 Z-score: 648.4 bits: 128.5 E(32554): 8.5e-30 Smith-Waterman score: 536; 36.0% identity (64.0% similar) in 250 aa overlap (26-267:214-447) 10 20 30 40 50 pF1KE5 MMLRLLSSLLLVAVASGYGPPSSRPSSRVVNGEDAVPYSWPWQVSLQYEKSGSFY :::.:.:. .. .::::.:::.. : : CCDS58 LHHSVYVREGCASGHVVTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQ--G--Y 190 200 210 220 230 60 70 80 90 100 110 pF1KE5 HTCGGSLIAPDWVVTAGHCI-----SSSWTYQVVLGEYDRAVKEGPEQVIPINSGDLFVH : ::::.:.: :..::.::. .::: :: : . . .: ... . : CCDS58 HLCGGSVITPLWIITAAHCVYDLYLPKSWTIQV--GLVSLLDNPAPSHLVE----KIVYH 240 250 260 270 280 290 120 130 140 150 160 170 pF1KE5 PLWNRSCVACGNDIALIKLSRSAQLGDAVQLASLPPAGDILPNETPCYITGWGRLYTNGP .. . . ::::::.::. ... .: . :: . . .:. :. .::: .: CCDS58 SKYKPKRL--GNDIALMKLAGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGGD 300 310 320 330 340 350 180 190 200 210 220 pF1KE5 LPDKLQEALLPVVDYEHCSRWNWWGSSVKKTMVCAG---GDIRSGCNGDSGGPLNCPTED :..: .:... . :.. . .:. .. .:.::: : . : :.::::::: : : CCDS58 ASPVLNHAAVPLISNKICNHRDVYGGIISPSMLCAGYLTGGVDS-CQGDSGGPLVCQ-ER 360 370 380 390 400 230 240 250 260 270 pF1KE5 GGWQVHGVTSFVSAFGCNTRRKPTVFTRVSAFIDWIEETIASH :.. :.::: ..:: :: :.:::..:.:::.: . CCDS58 RLWKLVGATSF--GIGCAEVNKPGVYTRVTSFLDWIHEQMERDLKT 410 420 430 440 450 270 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:57:19 2016 done: Mon Nov 7 23:57:19 2016 Total Scan time: 2.590 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]