FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8859, 95 aa 1>>>pF1KB8859 95 - 95 aa - 95 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1993+/-0.000703; mu= 9.1865+/- 0.043 mean_var=58.1741+/-11.817, 0's: 0 Z-trim(109.1): 44 B-trim: 342 in 1/49 Lambda= 0.168155 statistics sampled from 10597 (10643) to 10597 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.734), E-opt: 0.2 (0.327), width: 16 Scan time: 1.380 The best scores are: opt bits E(32554) CCDS3391.1 S100P gene_id:6286|Hs108|chr4 ( 95) 601 153.4 1.9e-38 CCDS1047.1 S100A1 gene_id:6271|Hs108|chr1 ( 94) 325 86.5 2.7e-18 CCDS13736.1 S100B gene_id:6285|Hs108|chr21 ( 92) 315 84.0 1.4e-17 CCDS43333.1 S100Z gene_id:170591|Hs108|chr5 ( 99) 305 81.6 8.3e-17 CCDS1044.1 S100A2 gene_id:6273|Hs108|chr1 ( 97) 259 70.4 1.9e-13 CCDS1042.1 S100A4 gene_id:6275|Hs108|chr1 ( 101) 258 70.2 2.3e-13 CCDS1008.1 S100A10 gene_id:6281|Hs108|chr1 ( 97) 257 70.0 2.6e-13 CCDS1037.1 S100A12 gene_id:6283|Hs108|chr1 ( 92) 253 69.0 4.9e-13 CCDS1009.1 S100A11 gene_id:6282|Hs108|chr1 ( 105) 250 68.3 9.1e-13 CCDS30861.1 FLG2 gene_id:388698|Hs108|chr1 (2391) 243 67.1 4.5e-11 >>CCDS3391.1 S100P gene_id:6286|Hs108|chr4 (95 aa) initn: 601 init1: 601 opt: 601 Z-score: 803.6 bits: 153.4 E(32554): 1.9e-38 Smith-Waterman score: 601; 100.0% identity (100.0% similar) in 95 aa overlap (1-95:1-95) 10 20 30 40 50 60 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLKD 10 20 30 40 50 60 70 80 90 pF1KB8 LDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK ::::::::::::::::::::::::::::::::::: CCDS33 LDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK 70 80 90 >>CCDS1047.1 S100A1 gene_id:6271|Hs108|chr1 (94 aa) initn: 367 init1: 323 opt: 325 Z-score: 441.8 bits: 86.5 E(32554): 2.7e-18 Smith-Waterman score: 325; 55.7% identity (83.0% similar) in 88 aa overlap (2-89:3-90) 10 20 30 40 50 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLK .:::::: .:.:: .::.::. :.: ::: :.. :: :::.. :: :::::..: CCDS10 MGSELETAMETLINVFHAHSGKEGDKYKLSKKELKELLQTELSGFLDAQKDVDAVDKVMK 10 20 30 40 50 60 60 70 80 90 pF1KB8 DLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK .:: :::..:::.:..:.:::.: ::...: CCDS10 ELDENGDGEVDFQEYVVLVAALTVACNNFFWENS 70 80 90 >>CCDS13736.1 S100B gene_id:6285|Hs108|chr21 (92 aa) initn: 315 init1: 315 opt: 315 Z-score: 428.8 bits: 84.0 E(32554): 1.4e-17 Smith-Waterman score: 315; 51.1% identity (81.1% similar) in 90 aa overlap (1-90:1-90) 10 20 30 40 50 60 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLKD :.::: :: .:::: .::: ::. . : :.::: :...:: ::. :....:::... CCDS13 MSELEKAMVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMET 10 20 30 40 50 60 70 80 90 pF1KB8 LDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK :: .::.. ::.::..::: .:.:::..:: CCDS13 LDNDGDGECDFQEFMAFVAMVTTACHEFFEHE 70 80 90 >>CCDS43333.1 S100Z gene_id:170591|Hs108|chr5 (99 aa) initn: 305 init1: 305 opt: 305 Z-score: 415.2 bits: 81.6 E(32554): 8.3e-17 Smith-Waterman score: 305; 53.4% identity (78.4% similar) in 88 aa overlap (2-89:3-90) 10 20 30 40 50 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLK :.:: :: .: .: ::::.: . :.:::::.:...:: ::. :. . :::... CCDS43 MPTQLEMAMDTMIRIFHRYSGKERKRFKLSKGELKLLLQRELTEFLSCQKETQLVDKIVQ 10 20 30 40 50 60 60 70 80 90 pF1KB8 DLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK ::::: : .:::.::.:.:::.: ::. :: CCDS43 DLDANKDNEVDFNEFVVMVAALTVACNDYFVEQLKKKGK 70 80 90 >>CCDS1044.1 S100A2 gene_id:6273|Hs108|chr1 (97 aa) initn: 276 init1: 258 opt: 259 Z-score: 355.1 bits: 70.4 E(32554): 1.9e-13 Smith-Waterman score: 259; 42.7% identity (75.3% similar) in 89 aa overlap (2-90:3-91) 10 20 30 40 50 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLK . :: :..... .: .:: .::. :.:::.: :..::::.:. :.... ::. CCDS10 MCSSLEQALAVLVTTFHKYSCQEGDKFKLSKGEMKELLHKELPSFVGEKVDEEGLKKLMG 10 20 30 40 50 60 60 70 80 90 pF1KB8 DLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK .:: :.: ::::.:. ::.: :: :. .:. CCDS10 SLDENSDQQVDFQEYAVFLALITVMCNDFFQGCPDRP 70 80 90 >>CCDS1042.1 S100A4 gene_id:6275|Hs108|chr1 (101 aa) initn: 258 init1: 258 opt: 258 Z-score: 353.5 bits: 70.2 E(32554): 2.3e-13 Smith-Waterman score: 258; 43.7% identity (78.2% similar) in 87 aa overlap (4-90:5-91) 10 20 30 40 50 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLK :: :. .....: .:::.::. :.:.::: :. .:::.:: . :. : .::.. CCDS10 MACPLEKALDVMVSTFHKYSGKEGDKFKLNKSELKELLTRELPSFLGKRTDEAAFQKLMS 10 20 30 40 50 60 60 70 80 90 pF1KB8 DLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK .::.: : .:::.:. ::.. :. :...:: CCDS10 NLDSNRDNEVDFQEYCVFLSCIAMMCNEFFEGFPDKQPRKK 70 80 90 100 >>CCDS1008.1 S100A10 gene_id:6281|Hs108|chr1 (97 aa) initn: 225 init1: 225 opt: 257 Z-score: 352.4 bits: 70.0 E(32554): 2.6e-13 Smith-Waterman score: 257; 45.5% identity (76.1% similar) in 88 aa overlap (2-89:3-87) 10 20 30 40 50 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLK ...: :: .. .: ...:..: ::: .:.::::::.::::.. :: ::::..: CCDS10 MPSQMEHAMETMMFTFHKFAGDKGY---LTKEDLRVLMEKEFPGFLENQKDPLAVDKIMK 10 20 30 40 50 60 70 80 90 pF1KB8 DLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK ::: :..: :. :. ..:..: ::. :: CCDS10 DLDQCRDGKVGFQSFFSLIAGLTIACNDYFVVHMKQKGKK 60 70 80 90 >>CCDS1037.1 S100A12 gene_id:6283|Hs108|chr1 (92 aa) initn: 269 init1: 253 opt: 253 Z-score: 347.6 bits: 69.0 E(32554): 4.9e-13 Smith-Waterman score: 253; 45.1% identity (72.5% similar) in 91 aa overlap (1-91:1-91) 10 20 30 40 50 60 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLKD ::.:: . :...: .:: .: .::.::::: :. ::: . ... ::: ..:.... CCDS10 MTKLEEHLEGIVNIFHQYSVRKGHFDTLSKGELKQLLTKELANTIKNIKDKAVIDEIFQG 10 20 30 40 50 60 70 80 90 pF1KB8 LDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK :::: : ::::.::: .:: .: : . .: CCDS10 LDANQDEQVDFQEFISLVAIALKAAHYHTHKE 70 80 90 >>CCDS1009.1 S100A11 gene_id:6282|Hs108|chr1 (105 aa) initn: 250 init1: 250 opt: 250 Z-score: 342.7 bits: 68.3 E(32554): 9.1e-13 Smith-Waterman score: 250; 40.7% identity (74.7% similar) in 91 aa overlap (2-92:8-98) 10 20 30 40 50 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAV :: : . .: ::..:.:..: . ::.: :. .:. :: .: .. :: .. CCDS10 MAKISSPTETERCIESLIAVFQKYAGKDGYNYTLSKTEFLSFMNTELAAFTKNQKDPGVL 10 20 30 40 50 60 60 70 80 90 pF1KB8 DKLLKDLDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK :...: ::.:.:.:.:::::. ...... ::: : :: CCDS10 DRMMKKLDTNSDGQLDFSEFLNLIGGLAMACHDSFLKAVPSQKRT 70 80 90 100 >>CCDS30861.1 FLG2 gene_id:388698|Hs108|chr1 (2391 aa) initn: 243 init1: 243 opt: 243 Z-score: 312.2 bits: 67.1 E(32554): 4.5e-11 Smith-Waterman score: 243; 41.8% identity (71.4% similar) in 91 aa overlap (1-91:1-91) 10 20 30 40 50 60 pF1KB8 MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLQSGKDKDAVDKLLKD ::.: .. .:::: .:. ..: ::.::::: :.:::: :.. : :.:: ... CCDS30 MTDLLRSVVTVIDVFYKYTKQDGECGTLSKGELKELLEKELHPVLKNPDDPDTVDVIMHM 10 20 30 40 50 60 70 80 90 pF1KB8 LDANGDAQVDFSEFIVFVAAITSACHKYFEKAGLK :: . : ..::.::.... .: ::.: . : CCDS30 LDRDHDRRLDFTEFLLMIFKLTMACNKVLSKEYCKASGSKKHRRGHRHQEEESETEEDEE 70 80 90 100 110 120 CCDS30 DTPGHKSGYRHSSWSEGEEHGYSSGHSRGTVKCRHGSNSRRLGRQGNLSSSGNQEGSQKR 130 140 150 160 170 180 95 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:14:57 2016 done: Mon Nov 7 02:14:57 2016 Total Scan time: 1.380 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]