FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6648, 223 aa 1>>>pF1KE6648 223 - 223 aa - 223 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2717+/-0.000683; mu= 16.0386+/- 0.041 mean_var=110.0446+/-20.882, 0's: 0 Z-trim(114.1): 61 B-trim: 17 in 1/51 Lambda= 0.122262 statistics sampled from 14608 (14672) to 14608 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.805), E-opt: 0.2 (0.451), width: 16 Scan time: 1.600 The best scores are: opt bits E(32554) CCDS2162.1 CFC1 gene_id:55997|Hs108|chr2 ( 223) 1631 297.4 4.8e-81 CCDS33286.1 CFC1B gene_id:653275|Hs108|chr2 ( 223) 1621 295.6 1.6e-80 CCDS74574.1 CFC1 gene_id:55997|Hs108|chr2 ( 191) 585 112.8 1.5e-25 CCDS74573.1 CFC1 gene_id:55997|Hs108|chr2 ( 148) 565 109.2 1.5e-24 CCDS54575.1 TDGF1 gene_id:6997|Hs108|chr3 ( 172) 319 65.9 1.9e-11 CCDS2742.1 TDGF1 gene_id:6997|Hs108|chr3 ( 188) 319 65.9 2e-11 >>CCDS2162.1 CFC1 gene_id:55997|Hs108|chr2 (223 aa) initn: 1631 init1: 1631 opt: 1631 Z-score: 1568.5 bits: 297.4 E(32554): 4.8e-81 Smith-Waterman score: 1631; 100.0% identity (100.0% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL 130 140 150 160 170 180 190 200 210 220 pF1KE6 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL ::::::::::::::::::::::::::::::::::::::::::: CCDS21 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL 190 200 210 220 >>CCDS33286.1 CFC1B gene_id:653275|Hs108|chr2 (223 aa) initn: 1621 init1: 1621 opt: 1621 Z-score: 1559.0 bits: 295.6 E(32554): 1.6e-80 Smith-Waterman score: 1621; 99.6% identity (99.6% similar) in 223 aa overlap (1-223:1-223) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR ::::::::::::::::: :::::::::::::::::::::::::::::::::::::::::: CCDS33 VTGSAEGWGPEEPLPYSWAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL 130 140 150 160 170 180 190 200 210 220 pF1KE6 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL ::::::::::::::::::::::::::::::::::::::::::: CCDS33 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL 190 200 210 220 >>CCDS74574.1 CFC1 gene_id:55997|Hs108|chr2 (191 aa) initn: 623 init1: 565 opt: 585 Z-score: 572.2 bits: 112.8 E(32554): 1.5e-25 Smith-Waterman score: 585; 58.2% identity (68.1% similar) in 182 aa overlap (1-177:1-176) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNG----GTCVLGSFCVCPAHFTGRYCEH :::::::::::::::::::::: ..: : . : .: . .: : : . : CCDS74 VTGSAEGWGPEEPLPYSRAFGE-VNAAPWSTEPGPSAPATSAGASSGPCTASPSRRLTAV 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 DQRRSECGALEHGAWTLR-ACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGA .. : .: : : ::. : : : : . : :. . . :..:: CCDS74 TRKTSWPPTLTGRAPGARPACYSC-C--PAHSCT--ASCARMRPRTLGPWSLPSSSGSGA 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 PSLLLLLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL :. CCDS74 PAEGRDLGIAFNFLCCK 180 190 >>CCDS74573.1 CFC1 gene_id:55997|Hs108|chr2 (148 aa) initn: 565 init1: 565 opt: 565 Z-score: 554.4 bits: 109.2 E(32554): 1.5e-24 Smith-Waterman score: 876; 66.4% identity (66.4% similar) in 223 aa overlap (1-223:1-148) 10 20 30 40 50 60 pF1KE6 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MTWRHHVRLLFTVSLALQIINLGNSYQREKHNGGREEVTKVATQKHRQSPLNWTSSHFGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VTGSAEGWGPEEPLPYSRAFGEGASARPRCCRNGGTCVLGSFCVCPAHFTGRYCEHDQRR :::::::::::::::::::::: CCDS74 VTGSAEGWGPEEPLPYSRAFGE-------------------------------------- 70 80 130 140 150 160 170 180 pF1KE6 SECGALEHGAWTLRACHLCRCIFGALHCLPLQTPDRCDPKDFLASHAHGPSAGGAPSLLL ::::::::::::::::::::::: CCDS74 -------------------------------------DPKDFLASHAHGPSAGGAPSLLL 90 100 190 200 210 220 pF1KE6 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL ::::::::::::::::::::::::::::::::::::::::::: CCDS74 LLPCALLHRLLRPDAPAHPRSLVPSVLQRERRPCGRPGLGHRL 110 120 130 140 >>CCDS54575.1 TDGF1 gene_id:6997|Hs108|chr3 (172 aa) initn: 336 init1: 316 opt: 319 Z-score: 319.1 bits: 65.9 E(32554): 1.9e-11 Smith-Waterman score: 319; 37.7% identity (58.5% similar) in 130 aa overlap (68-184:36-165) 40 50 60 70 80 pF1KE6 VTKVATQKHRQSPLNWTSSHFGEVTGSAEGWGPEEPL--PYS--RAFGEGASARPR---- : ::: : : :. : . . CCDS54 VFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPAIRPRSSQRVPPMGIQHSKELNRT 10 20 30 40 50 60 90 100 110 120 130 140 pF1KE6 CCRNGGTCVLGSFCVCPAHFTGRYCEHDQRRSECGALEHGAWTLRACHLCRCIFGALHCL :: :::::.:::::.:: : :: :::: :. .::.. : .: . : ::.: : :.:. CCDS54 CCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCGSVPHDTWLPKKCSLCKCWHGQLRCF 70 80 90 100 110 120 150 160 170 180 190 200 pF1KE6 PLQTPDRCD----PKDFLASHA-HGPSAGGAPSLLLLLPCALLHRLLRPDAPAHPRSLVP : :: . ..::.. . : .. . ...:. : CCDS54 PQAFLPGCDGLVMDEHLVASRTPELPPSARTTTFMLVGICLSIQSYY 130 140 150 160 170 210 220 pF1KE6 SVLQRERRPCGRPGLGHRL >>CCDS2742.1 TDGF1 gene_id:6997|Hs108|chr3 (188 aa) initn: 316 init1: 316 opt: 319 Z-score: 318.7 bits: 65.9 E(32554): 2e-11 Smith-Waterman score: 319; 37.7% identity (58.5% similar) in 130 aa overlap (68-184:52-181) 40 50 60 70 80 pF1KE6 VTKVATQKHRQSPLNWTSSHFGEVTGSAEGWGPEEPL--PYS--RAFGEGASARPR---- : ::: : : :. : . . CCDS27 VFELGLVAGLGHQEFARPSRGYLAFRDDSIWPQEEPAIRPRSSQRVPPMGIQHSKELNRT 30 40 50 60 70 80 90 100 110 120 130 140 pF1KE6 CCRNGGTCVLGSFCVCPAHFTGRYCEHDQRRSECGALEHGAWTLRACHLCRCIFGALHCL :: :::::.:::::.:: : :: :::: :. .::.. : .: . : ::.: : :.:. CCDS27 CCLNGGTCMLGSFCACPPSFYGRNCEHDVRKENCGSVPHDTWLPKKCSLCKCWHGQLRCF 90 100 110 120 130 140 150 160 170 180 190 200 pF1KE6 PLQTPDRCD----PKDFLASHA-HGPSAGGAPSLLLLLPCALLHRLLRPDAPAHPRSLVP : :: . ..::.. . : .. . ...:. : CCDS27 PQAFLPGCDGLVMDEHLVASRTPELPPSARTTTFMLVGICLSIQSYY 150 160 170 180 210 220 pF1KE6 SVLQRERRPCGRPGLGHRL 223 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:06:17 2016 done: Tue Nov 8 15:06:18 2016 Total Scan time: 1.600 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]