FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1626, 142 aa 1>>>pF1KE1626 142 - 142 aa - 142 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2940+/-0.000752; mu= 10.8863+/- 0.045 mean_var=53.6476+/-11.105, 0's: 0 Z-trim(106.7): 18 B-trim: 564 in 1/50 Lambda= 0.175105 statistics sampled from 9107 (9120) to 9107 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.28), width: 16 Scan time: 1.590 The best scores are: opt bits E(32554) CCDS8765.1 LALBA gene_id:3906|Hs108|chr12 ( 142) 979 254.9 1.2e-68 CCDS8989.1 LYZ gene_id:4069|Hs108|chr12 ( 148) 337 92.7 8.3e-20 CCDS31174.1 LYZL1 gene_id:84569|Hs108|chr10 ( 194) 323 89.2 1.2e-18 CCDS7167.2 LYZL2 gene_id:119180|Hs108|chr10 ( 194) 323 89.2 1.2e-18 CCDS11275.1 SPACA3 gene_id:124912|Hs108|chr17 ( 215) 321 88.7 1.9e-18 CCDS14286.1 SPACA5 gene_id:389852|Hs108|chrX ( 159) 294 81.9 1.7e-16 CCDS35238.1 SPACA5B gene_id:729201|Hs108|chrX ( 159) 294 81.9 1.7e-16 CCDS11302.1 LYZL6 gene_id:57151|Hs108|chr17 ( 148) 284 79.3 8.9e-16 CCDS82105.1 SPACA3 gene_id:124912|Hs108|chr17 ( 112) 231 65.9 7.4e-12 CCDS2697.1 LYZL4 gene_id:131375|Hs108|chr3 ( 146) 227 64.9 1.9e-11 >>CCDS8765.1 LALBA gene_id:3906|Hs108|chr12 (142 aa) initn: 979 init1: 979 opt: 979 Z-score: 1345.9 bits: 254.9 E(32554): 1.2e-68 Smith-Waterman score: 979; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLKDIDGYGGIALPELICTMFHTSGYDTQAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 MRFFVPLFLVGILFPAILAKQFTKCELSQLLKDIDGYGGIALPELICTMFHTSGYDTQAI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VENNESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILDIKGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS87 VENNESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILDIKGI 70 80 90 100 110 120 130 140 pF1KE1 DYWLAHKALCTEKLEQWLCEKL :::::::::::::::::::::: CCDS87 DYWLAHKALCTEKLEQWLCEKL 130 140 >>CCDS8989.1 LYZ gene_id:4069|Hs108|chr12 (148 aa) initn: 288 init1: 189 opt: 337 Z-score: 469.0 bits: 92.7 E(32554): 8.3e-20 Smith-Waterman score: 337; 37.7% identity (69.6% similar) in 138 aa overlap (1-133:1-137) 10 20 30 40 50 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLK--DIDGYGGIALPELICTMFHTSGYDTQ :. .. : :: .: .. .: : .:::.. :: .::: ::.: . .: :::.:. CCDS89 MKALIVLGLV-LLSVTVQGKVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTR 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 AIVEN--NESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKIL- : : ..::.::.:::... ::.....: . : : .::. .:.:.:.: . :::... CCDS89 ATNYNAGDRSTDYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVR 60 70 80 90 100 110 120 130 140 pF1KE1 DIKGIDYWLAHKALCTEKLEQWLCEKL : .:: :.: . : .. CCDS89 DPQGIRAWVAWRNRCQNRDVRQYVQGCGV 120 130 140 >>CCDS31174.1 LYZL1 gene_id:84569|Hs108|chr10 (194 aa) initn: 114 init1: 114 opt: 323 Z-score: 448.0 bits: 89.2 E(32554): 1.2e-18 Smith-Waterman score: 323; 34.8% identity (68.1% similar) in 141 aa overlap (7-140:53-192) 10 20 30 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLK--DI : :.: : . .: .:.:.:..... . CCDS31 TEKSASGAGTRNLPFQFCLRQALRMKAAGILTLIGCLVTGAESKIYTRCKLAKIFSRAGL 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE1 DGYGGIALPELICTMFHTSGYDTQA-IVENNESTEYGLFQISNKLWCKSSQVPQSRNICD :.: :..: . :: .. :::.: : : .. : .::.:::.. ::. ... .. : : CCDS31 DNYWGFSLGNWICMAYYESGYNTTAQTVLDDGSIDYGIFQINSFAWCRRGKLKEN-NHCH 90 100 110 120 130 140 100 110 120 130 140 pF1KE1 ISCDKFLDDDITDDIMCAKKIL-DIKGIDYWLAHKALCTEK-LEQWL--CEKL ..:. .. ::.:: :.::.::. . .:..:: . : : . : .: :: CCDS31 VACSALITDDLTDAIICARKIVKETQGMNYWQGWKKHCEGRDLSEWKKGCEVS 150 160 170 180 190 >>CCDS7167.2 LYZL2 gene_id:119180|Hs108|chr10 (194 aa) initn: 129 init1: 116 opt: 323 Z-score: 448.0 bits: 89.2 E(32554): 1.2e-18 Smith-Waterman score: 323; 35.5% identity (68.1% similar) in 141 aa overlap (7-140:53-192) 10 20 30 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLK--DI : :.: : . .: .:.:.:..... . CCDS71 TEKSASAAGTRNLPFQFCLRQALRMKAAGILTLIGCLVTGAESKIYTRCKLAKIFSRAGL 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE1 DGYGGIALPELICTMFHTSGYDTQA-IVENNESTEYGLFQISNKLWCKSSQVPQSRNICD :.: :..: . :: .. :::.: : : .. : .::.:::.. ::. ... .. : : CCDS71 DNYWGFSLGNWICMAYYESGYNTTAQTVLDDGSIDYGIFQINSFAWCRRGKLKEN-NHCH 90 100 110 120 130 140 100 110 120 130 140 pF1KE1 ISCDKFLDDDITDDIMCAKKIL-DIKGIDYWLAHKALCTEK-LEQWL--CEKL ..:. .. ::.:: :.:::::. . .:..:: . : : . : .: :: CCDS71 VACSALVTDDLTDAIICAKKIVKETQGMNYWQGWKKHCEGRDLSDWKKDCEVS 150 160 170 180 190 >>CCDS11275.1 SPACA3 gene_id:124912|Hs108|chr17 (215 aa) initn: 189 init1: 149 opt: 321 Z-score: 444.5 bits: 88.7 E(32554): 1.9e-18 Smith-Waterman score: 321; 35.3% identity (66.9% similar) in 139 aa overlap (9-140:77-214) 10 20 30 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLKD--IDG :.. :.:. :: . .:::...:.: .:: CCDS11 STSAAGIEARSRALRRRWCPAGIMLLALVCLLSCLLPSSEAKLYGRCELARVLHDFGLDG 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE1 YGGIALPELICTMFHTSGYDTQAI-VENNESTEYGLFQISNKLWCKSSQVPQSRNICDIS : : .: . .: . :::... :. : . ::. :.:::... :: :. .:. :.: . CCDS11 YRGYSLADWVCLAYFTSGFNAAALDYEADGSTNNGIFQINSRRWC-SNLTPNVPNVCRMY 110 120 130 140 150 160 100 110 120 130 140 pF1KE1 CDKFLDDDITDDIMCAKKIL-DIKGIDYWLAHKALCTEK-LEQWL--CEKL :. .:. .. : ..:: :: . .:. :: : . : : : .:. :. CCDS11 CSDLLNPNLKDTVICAMKITQEPQGLGYWEAWRHHCQGKDLTEWVDGCDF 170 180 190 200 210 >>CCDS14286.1 SPACA5 gene_id:389852|Hs108|chrX (159 aa) initn: 260 init1: 160 opt: 294 Z-score: 409.8 bits: 81.9 E(32554): 1.7e-16 Smith-Waterman score: 294; 32.6% identity (66.0% similar) in 144 aa overlap (5-140:7-148) 10 20 30 40 50 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLK--DIDGYGGIALPELICTMFHTSGYD : . :. .. .. :: . .:::. :. ..:: : .. . .: . ::.: CCDS14 MKAWGTVVVTLATLMVVTVDAKIYERCELAARLERAGLNGYKGYGVGDWLCMAHYESGFD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 TQAIVENNE--STEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKI : :.:..: :.:::.::... :: .. .: ..:.: ..: .:. : ::: :::.: CCDS14 T-AFVDHNPDGSSEYGIFQLNSAWWCDNGITP-TKNLCHMDCHDLLNRHILDDIRCAKQI 70 80 90 100 110 120 130 140 pF1KE1 LDIK-GIDYWLAHKALCT-EKLEQWL--CEKL .. . :.. : . . :. . : .:: :. CCDS14 VSSQNGLSAWTSWRLHCSGHDLSEWLKGCDMHVKIDPKIHP 120 130 140 150 >>CCDS35238.1 SPACA5B gene_id:729201|Hs108|chrX (159 aa) initn: 260 init1: 160 opt: 294 Z-score: 409.8 bits: 81.9 E(32554): 1.7e-16 Smith-Waterman score: 294; 32.6% identity (66.0% similar) in 144 aa overlap (5-140:7-148) 10 20 30 40 50 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLK--DIDGYGGIALPELICTMFHTSGYD : . :. .. .. :: . .:::. :. ..:: : .. . .: . ::.: CCDS35 MKAWGTVVVTLATLMVVTVDAKIYERCELAARLERAGLNGYKGYGVGDWLCMAHYESGFD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 TQAIVENNE--STEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKI : :.:..: :.:::.::... :: .. .: ..:.: ..: .:. : ::: :::.: CCDS35 T-AFVDHNPDGSSEYGIFQLNSAWWCDNGITP-TKNLCHMDCHDLLNRHILDDIRCAKQI 70 80 90 100 110 120 130 140 pF1KE1 LDIK-GIDYWLAHKALCT-EKLEQWL--CEKL .. . :.. : . . :. . : .:: :. CCDS35 VSSQNGLSAWTSWRLHCSGHDLSEWLKGCDMHVKIDPKIHP 120 130 140 150 >>CCDS11302.1 LYZL6 gene_id:57151|Hs108|chr17 (148 aa) initn: 108 init1: 75 opt: 284 Z-score: 396.7 bits: 79.3 E(32554): 8.9e-16 Smith-Waterman score: 284; 31.4% identity (66.4% similar) in 137 aa overlap (7-138:7-142) 10 20 30 40 50 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLK--DIDGYGGIALPELICTMFHTSGYDTQ ..::. .. :. ...:.:.:.:. :.::. : .: . .: : : .. . CCDS11 MTKALLIYLVSSFLALNQASLISRCDLAQVLQLEDLDGFEGYSLSDWLCLAFVESKFNIS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 AIVENNE-STEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILD- : :: . : .::::::... ::.. . :.:.: ..:. .:. .. : :::.:.. CCDS11 KINENADGSFDYGLFQINSHYWCNDYK-SYSENLCHVDCQDLLNPNLLAGIHCAKRIVSG 70 80 90 100 110 120 130 140 pF1KE1 IKGIDYWLAHKALCTEK-LEQWLCEKL .:.. :. . :. . : :: CCDS11 ARGMNNWVEWRLHCSGRPLFYWLTGCRLR 120 130 140 >>CCDS82105.1 SPACA3 gene_id:124912|Hs108|chr17 (112 aa) initn: 159 init1: 84 opt: 231 Z-score: 326.3 bits: 65.9 E(32554): 7.4e-12 Smith-Waterman score: 231; 34.0% identity (65.0% similar) in 100 aa overlap (46-140:13-111) 20 30 40 50 60 70 pF1KE1 AILAKQFTKCELSQLLKDIDGYGGIALPELICTMFHTSGYDTQAI-VENNESTEYGLFQI .: . :::... :. : . ::. :.::: CCDS82 MVSALRGAPLIRVCLAYFTSGFNAAALDYEADGSTNNGIFQI 10 20 30 40 80 90 100 110 120 130 pF1KE1 SNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKIL-DIKGIDYWLAHKALCTEK ... :: :. .:. :.: . :. .:. .. : ..:: :: . .:. :: : . : : CCDS82 NSRRWC-SNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGYWEAWRHHCQGK 50 60 70 80 90 100 140 pF1KE1 -LEQWL--CEKL : .:. :. CCDS82 DLTEWVDGCDF 110 >>CCDS2697.1 LYZL4 gene_id:131375|Hs108|chr3 (146 aa) initn: 191 init1: 81 opt: 227 Z-score: 319.0 bits: 64.9 E(32554): 1.9e-11 Smith-Waterman score: 227; 30.3% identity (57.2% similar) in 145 aa overlap (1-138:1-141) 10 20 30 40 50 pF1KE1 MRFFVPLFLVGILFPAILAKQFTKCELSQLLKD--IDGYGGIALPELICTMFHTSGYDTQ :. : : :.: : : . .: ... :.: .: . : .: . .: . : .. . CCDS26 MKASVVLSLLGYLVVPSGAYILGRCTVAKKLHDGGLDYFEGYSLENWVCLAYFESKFNPM 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 AIVENNES--TEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILD :: ::.. : .::::. .. :: . ..:: : .::. .:. .. : ::: :. CCDS26 AIYENTREGYTGFGLFQMRGSDWCGD----HGRNRCHMSCSALLNPNLEKTIKCAKTIVK 70 80 90 100 110 120 130 140 pF1KE1 IK-GIDYWLAHKALC--TEKLEQWLCEKL : :. : . . : .. : .:: CCDS26 GKEGMGAWPTWSRYCQYSDTLARWLDGCKL 120 130 140 142 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:52:31 2016 done: Sun Nov 6 14:52:32 2016 Total Scan time: 1.590 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]