FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4462, 479 aa 1>>>pF1KE4462 479 - 479 aa - 479 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2205+/-0.00082; mu= 18.4493+/- 0.049 mean_var=68.3732+/-13.432, 0's: 0 Z-trim(106.9): 13 B-trim: 4 in 1/49 Lambda= 0.155107 statistics sampled from 9222 (9229) to 9222 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.283), width: 16 Scan time: 2.390 The best scores are: opt bits E(32554) CCDS46609.1 CTSA gene_id:5476|Hs108|chr20 ( 480) 3278 742.6 2.1e-214 CCDS13385.2 CTSA gene_id:5476|Hs108|chr20 ( 498) 3278 742.7 2.1e-214 CCDS54467.1 CTSA gene_id:5476|Hs108|chr20 ( 481) 2494 567.2 1.3e-161 CCDS5419.1 CPVL gene_id:54504|Hs108|chr7 ( 476) 555 133.3 5.5e-31 >>CCDS46609.1 CTSA gene_id:5476|Hs108|chr20 (480 aa) initn: 3240 init1: 3240 opt: 3278 Z-score: 3962.8 bits: 742.6 E(32554): 2.1e-214 Smith-Waterman score: 3278; 99.8% identity (99.8% similar) in 480 aa overlap (1-479:1-480) 10 20 30 40 50 pF1KE4 MIRAAPPPLFLLLLLLLL-VSWASRGEAAPDQDEIQRLPGLAKQPSFRQYSGYLKGSGSK :::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::: CCDS46 MIRAAPPPLFLLLLLLLLLVSWASRGEAAPDQDEIQRLPGLAKQPSFRQYSGYLKGSGSK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 HLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFLVQPDGVTLEYNPYSWNLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 HLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFLVQPDGVTLEYNPYSWNLI 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 ANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFRLFPEYKNNKLFLTGESYA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 ANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFRLFPEYKNNKLFLTGESYA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 GIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFAYYHGLLGNRLWSSLQTHC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 GIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFAYYHGLLGNRLWSSLQTHC 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 CSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCAGGVPSHFRYEKDTVVVQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 CSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCAGGVPSHFRYEKDTVVVQD 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE4 LGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLNNPYVRKALNIPEQLPQWD 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE4 MCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMACNFMGDEWFVDSLNQKME 370 380 390 400 410 420 420 430 440 450 460 470 pF1KE4 VQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPY 430 440 450 460 470 480 >>CCDS13385.2 CTSA gene_id:5476|Hs108|chr20 (498 aa) initn: 3240 init1: 3240 opt: 3278 Z-score: 3962.6 bits: 742.7 E(32554): 2.1e-214 Smith-Waterman score: 3278; 99.8% identity (99.8% similar) in 480 aa overlap (1-479:19-498) 10 20 30 40 pF1KE4 MIRAAPPPLFLLLLLLLL-VSWASRGEAAPDQDEIQRLPGLA :::::::::::::::::: ::::::::::::::::::::::: CCDS13 MTSSPRAPPGEQGRGGAEMIRAAPPPLFLLLLLLLLLVSWASRGEAAPDQDEIQRLPGLA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 KQPSFRQYSGYLKGSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KQPSFRQYSGYLKGSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFL 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 VQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFR 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 LFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFA 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 YYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCA 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 GGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLN 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE4 NPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMA 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE4 CNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 CNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTD 430 440 450 460 470 480 470 pF1KE4 KPLAAFTMFSRFLNKQPY :::::::::::::::::: CCDS13 KPLAAFTMFSRFLNKQPY 490 >>CCDS54467.1 CTSA gene_id:5476|Hs108|chr20 (481 aa) initn: 2595 init1: 2467 opt: 2494 Z-score: 3014.7 bits: 567.2 E(32554): 1.3e-161 Smith-Waterman score: 3110; 96.2% identity (96.2% similar) in 480 aa overlap (1-479:19-481) 10 20 30 40 pF1KE4 MIRAAPPPLFLLLLLLLL-VSWASRGEAAPDQDEIQRLPGLA :::::::::::::::::: ::::::::::::::::::::::: CCDS54 MTSSPRAPPGEQGRGGAEMIRAAPPPLFLLLLLLLLLVSWASRGEAAPDQDEIQRLPGLA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 KQPSFRQYSGYLKGSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KQPSFRQYSGYLKGSGSKHLHYWFVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFL 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 VQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFR ::::::::::::::::::::::::::::::::::::::::::: CCDS54 -----------------IANVLYLESPAGVGFSYSDDKFYATNDTEVAQSNFEALQDFFR 130 140 150 160 170 180 190 200 210 220 pF1KE4 LFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGLSSYEQNDNSLVYFA 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE4 YYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCA 230 240 250 260 270 280 290 300 310 320 330 340 pF1KE4 GGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLN 290 300 310 320 330 340 350 360 370 380 390 400 pF1KE4 NPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQKYQILLYNGDVDMA 350 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 CNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 CNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTD 410 420 430 440 450 460 470 pF1KE4 KPLAAFTMFSRFLNKQPY :::::::::::::::::: CCDS54 KPLAAFTMFSRFLNKQPY 470 480 >>CCDS5419.1 CPVL gene_id:54504|Hs108|chr7 (476 aa) initn: 532 init1: 337 opt: 555 Z-score: 669.8 bits: 133.3 E(32554): 5.5e-31 Smith-Waterman score: 667; 29.9% identity (60.4% similar) in 455 aa overlap (37-474:66-466) 10 20 30 40 50 60 pF1KE4 PPLFLLLLLLLLVSWASRGEAAPDQDEIQRLPGLAKQPSFRQYSGYL--KGSGSKHLHYW .::: ....:.:.: . . ...: .: CCDS54 KGDSGQPLFLTPYIEAGKIQKGRELSLVGPFPGL----NMKSYAGFLTVNKTYNSNLFFW 40 50 60 70 80 90 70 80 90 100 110 120 pF1KE4 FVESQKDPENSPVVLWLNGGPGCSSLDGLLTEHGPFLVQPDGVTLEYNPYSWNLIANVLY : .: .::..::::::.:::: ::. ::..::::..: . .::. . :. ..:: CCDS54 FFPAQIQPEDAPVVLWLQGGPGGSSMFGLFVEHGPYVVTSN-MTLRDRDFPWTTTLSMLY 100 110 120 130 140 150 130 140 150 160 170 180 pF1KE4 LESPAGVGFSYSDDKF-YATNDTEVAQSNFEALQDFFRLFPEYKNNKLFLTGESYAGIYI ...:.:.:::..:: ::.:. .::.. . :: .::..::::::: ...::::::: :. CCDS54 IDNPVGTGFSFTDDTHGYAVNEDDVARDLYSALIQFFQIFPEYKNNDFYVTGESYAGKYV 160 170 180 190 200 210 190 200 210 220 230 pF1KE4 PTLAVLVMQ-DP----SMNLQGLAVGNGLSSYEQNDNSLVYFAYYHGLLGNRLWSSLQTH :..: :. . .: ..::.:.:.:.: :. :. .. . : : ::: .. . .: . CCDS54 PAIAHLIHSLNPVREVKINLNGIAIGDGYSDPESIIGGYAEFLYQIGLLDEKQKKYFQKQ 220 230 240 250 260 270 240 250 260 270 280 290 pF1KE4 CCSQNKCNFYDNKDLECVTNLQEVARIVGNSGLNIYNLYAPCAGGVPSHFRYEKDTVVVQ : ::. .... . . :. .: . ::.:. : CCDS54 CH-------------ECIEHIRKQNWFEAFEILD--KLLDGDLTSDPSYFQN------VT 280 290 300 300 310 320 330 340 350 pF1KE4 DLGNIFTRLPLKRMWHQALLRSGDKVRMDPPCTNTTAASTYLNNPYVRKALNIPEQLPQW .: .. : : : . .:. : ::.:... .: CCDS54 GCSNYYNFL-----------------RCTEP-EDQLYYVKFLSLPEVRQAIHVGNQT--- 310 320 330 340 360 370 380 390 400 pF1KE4 DMCNFLVNLQYRR--LYRSMNSQYLKLLSSQKYQILLYNGDVDMAC-------NFMGDEW . . . .: : .:.. ..... :..:.:::..:. ..:: .: CCDS54 -FNDGTIVEKYLREDTVQSVKPWLTEIMNN--YKVLIYNGQLDIIVAAALTERSLMGMDW 350 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 FVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKGAGHMVPTDKPLAAFTM .. .: : .. : . .:: ..::.... . . . :.:.::..: :.:: :: : CCDS54 KGSQEYKKAE--KKVWKIFKSDS--EVAGYIRQAGDFHQVIIRGGGHILPYDQPLRAFDM 410 420 430 440 450 460 470 pF1KE4 FSRFLNKQPY ..::. CCDS54 INRFIYGKGWDPYVG 470 479 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:30:58 2016 done: Sun Nov 6 00:30:59 2016 Total Scan time: 2.390 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]