FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5142, 104 aa 1>>>pF1KE5142 104 - 104 aa - 104 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9383+/-0.000542; mu= 11.0854+/- 0.033 mean_var=50.5723+/-10.169, 0's: 0 Z-trim(111.6): 17 B-trim: 0 in 0/52 Lambda= 0.180351 statistics sampled from 12486 (12503) to 12486 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.794), E-opt: 0.2 (0.384), width: 16 Scan time: 1.310 The best scores are: opt bits E(32554) CCDS55000.1 PGC gene_id:5225|Hs108|chr6 ( 315) 636 172.4 1.3e-43 CCDS4859.1 PGC gene_id:5225|Hs108|chr6 ( 388) 636 172.5 1.6e-43 CCDS31574.1 PGA3 gene_id:643834|Hs108|chr11 ( 388) 269 77.0 8.9e-15 CCDS31575.1 PGA4 gene_id:643847|Hs108|chr11 ( 388) 268 76.7 1.1e-14 CCDS8001.1 PGA5 gene_id:5222|Hs108|chr11 ( 388) 268 76.7 1.1e-14 CCDS73012.1 CTSE gene_id:1510|Hs108|chr1 ( 363) 231 67.1 8e-12 CCDS73013.1 CTSE gene_id:1510|Hs108|chr1 ( 396) 231 67.1 8.6e-12 CCDS7725.1 CTSD gene_id:1509|Hs108|chr11 ( 412) 229 66.6 1.3e-11 >>CCDS55000.1 PGC gene_id:5225|Hs108|chr6 (315 aa) initn: 636 init1: 636 opt: 636 Z-score: 896.3 bits: 172.4 E(32554): 1.3e-43 Smith-Waterman score: 636; 99.0% identity (99.0% similar) in 97 aa overlap (1-97:1-97) 10 20 30 40 50 60 pF1KE5 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS 10 20 30 40 50 60 70 80 90 100 pF1KE5 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ::::::::::::::::::::::::::::::::::: : CCDS55 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES 70 80 90 100 110 120 CCDS55 STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM 130 140 150 160 170 180 >>CCDS4859.1 PGC gene_id:5225|Hs108|chr6 (388 aa) initn: 651 init1: 636 opt: 636 Z-score: 894.8 bits: 172.5 E(32554): 1.6e-43 Smith-Waterman score: 636; 99.0% identity (99.0% similar) in 97 aa overlap (1-97:1-97) 10 20 30 40 50 60 pF1KE5 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS 10 20 30 40 50 60 70 80 90 100 pF1KE5 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ::::::::::::::::::::::::::::::::::: : CCDS48 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES 70 80 90 100 110 120 CCDS48 STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM 130 140 150 160 170 180 >>CCDS31574.1 PGA3 gene_id:643834|Hs108|chr11 (388 aa) initn: 275 init1: 135 opt: 269 Z-score: 378.7 bits: 77.0 E(32554): 8.9e-15 Smith-Waterman score: 269; 47.6% identity (70.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . CCDS31 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS . : .:. :.: ::: :.:::: :.: :.:::::: : CCDS31 KAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 CCDS31 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>CCDS31575.1 PGA4 gene_id:643847|Hs108|chr11 (388 aa) initn: 275 init1: 135 opt: 268 Z-score: 377.3 bits: 76.7 E(32554): 1.1e-14 Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . CCDS31 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS : .:. :.: ::: :.:::: :.: :.:::::: : CCDS31 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 CCDS31 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>CCDS8001.1 PGA5 gene_id:5222|Hs108|chr11 (388 aa) initn: 275 init1: 135 opt: 268 Z-score: 377.3 bits: 76.7 E(32554): 1.1e-14 Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . CCDS80 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS : .:. :.: ::: :.:::: :.: :.:::::: : CCDS80 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 CCDS80 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>CCDS73012.1 CTSE gene_id:1510|Hs108|chr1 (363 aa) initn: 234 init1: 138 opt: 231 Z-score: 325.8 bits: 67.1 E(32554): 8e-12 Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG ....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : . CCDS73 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ... . ::. :.: ::: ::::.::::: :.:::::: : CCDS73 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF 70 80 90 100 110 120 CCDS73 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAE 130 140 150 160 170 180 >>CCDS73013.1 CTSE gene_id:1510|Hs108|chr1 (396 aa) initn: 212 init1: 138 opt: 231 Z-score: 325.2 bits: 67.1 E(32554): 8.6e-12 Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG ....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : . CCDS73 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ... . ::. :.: ::: ::::.::::: :.:::::: : CCDS73 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF 70 80 90 100 110 120 CCDS73 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAE 130 140 150 160 170 180 >>CCDS7725.1 CTSD gene_id:1509|Hs108|chr11 (412 aa) initn: 229 init1: 156 opt: 229 Z-score: 322.1 bits: 66.6 E(32554): 1.3e-11 Smith-Waterman score: 229; 48.4% identity (63.2% similar) in 95 aa overlap (9-97:11-103) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA-AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG .:: : :.:..::.:: :::.::.: : : :. :: . CCDS77 MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVG--GSVEDLIAKGPVSKYSQA 10 20 30 40 50 60 70 80 90 100 pF1KE5 DLSVTYEPMA-----YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS .:: :. :::: :.:::.:::::: : :.:::::: : CCDS77 VPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACW 60 70 80 90 100 110 CCDS77 IHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQV 120 130 140 150 160 170 104 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 21:47:30 2016 done: Mon Nov 7 21:47:31 2016 Total Scan time: 1.310 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]