FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6218, 290 aa 1>>>pF1KE6218 290 - 290 aa - 290 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3253+/-0.000914; mu= 15.1161+/- 0.055 mean_var=63.2177+/-12.479, 0's: 0 Z-trim(104.6): 17 B-trim: 0 in 0/49 Lambda= 0.161308 statistics sampled from 7987 (7995) to 7987 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.246), width: 16 Scan time: 2.000 The best scores are: opt bits E(32554) CCDS47325.1 SGCD gene_id:6444|Hs108|chr5 ( 290) 1891 448.7 2.3e-126 CCDS47327.1 SGCD gene_id:6444|Hs108|chr5 ( 289) 1884 447.1 6.9e-126 CCDS47326.1 SGCD gene_id:6444|Hs108|chr5 ( 256) 1512 360.5 7.2e-100 CCDS5992.2 SGCZ gene_id:137868|Hs108|chr8 ( 312) 1144 274.9 5.1e-74 CCDS9299.1 SGCG gene_id:6445|Hs108|chr13 ( 291) 1029 248.1 5.5e-66 >>CCDS47325.1 SGCD gene_id:6444|Hs108|chr5 (290 aa) initn: 1891 init1: 1891 opt: 1891 Z-score: 2382.3 bits: 448.7 E(32554): 2.3e-126 Smith-Waterman score: 1891; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KE6 MMPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILKVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MMPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILKVM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQTK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSIET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSIET 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 PNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDAAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDAAK 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 IRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 IRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL 250 260 270 280 290 >>CCDS47327.1 SGCD gene_id:6444|Hs108|chr5 (289 aa) initn: 1884 init1: 1884 opt: 1884 Z-score: 2373.5 bits: 447.1 E(32554): 6.9e-126 Smith-Waterman score: 1884; 100.0% identity (100.0% similar) in 289 aa overlap (2-290:1-289) 10 20 30 40 50 60 pF1KE6 MMPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILKVM ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILKVM 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 NFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQTK 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 VLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSIET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSIET 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 PNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDAAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDAAK 180 190 200 210 220 230 250 260 270 280 290 pF1KE6 IRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 IRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL 240 250 260 270 280 >>CCDS47326.1 SGCD gene_id:6444|Hs108|chr5 (256 aa) initn: 1532 init1: 1512 opt: 1512 Z-score: 1906.5 bits: 360.5 E(32554): 7.2e-100 Smith-Waterman score: 1512; 99.1% identity (100.0% similar) in 235 aa overlap (1-235:1-235) 10 20 30 40 50 60 pF1KE6 MMPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILKVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MMPQEQYTHHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILKVM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 NFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQTK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 VLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSIET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSIET 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 PNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDAAK :::::::::::::::::::::::::::::::::::::::::::::::::::::.. CCDS47 PNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEVRDEKDR 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 IRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL CCDS47 SSKSYSFNRPTLPITG 250 >>CCDS5992.2 SGCZ gene_id:137868|Hs108|chr8 (312 aa) initn: 940 init1: 524 opt: 1144 Z-score: 1442.3 bits: 274.9 E(32554): 5.1e-74 Smith-Waterman score: 1144; 57.2% identity (82.8% similar) in 297 aa overlap (2-290:14-310) 10 20 30 40 pF1KE6 MMPQEQY--THHRSTMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILI : .::: . .....: . . :.: ::::::::::::::::::.. . CCDS59 MDRSTNLDIEELKMTREQYILATQQNNLPRTENAQLYPVGIYGWRKRCLYFFVLLLLVTM 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 LVNLAMTIWILKVMNFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKS .::::::::::::::::.:::::::.:.::..::: :::: :::.:::.:: . : ..: CCDS59 IVNLAMTIWILKVMNFTVDGMGNLRVTKKGIRLEGISEFLLPLYVKEIHSRKDSPLVLQS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 ARNVTVNILNDQTKVLTQLITGPKAVEAYGKKFEVK-TVSGKLLFSADNNEVVVGAERLR :::::: : . .. :: : :::: :.:::. . .:..:::::..:...:::.:. CCDS59 DRNVTVNARNHMGQLTGQLTIGADAVEAQCKRFEVRASEDGRVLFSADEDEITIGAEKLK 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 VLGAEGTVFPKSIETPNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTEL : :.::.:: .:.:::..::.: ..::::::::::.::::.::...: ::...:::: :: CCDS59 VTGTEGAVFGHSVETPHIRAEPSQDLRLESPTRSLIMEAPRGVQVSAAAGDFKATCRKEL 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE6 RLESKDGEIKLDAAKIRLPRLPHGSYT---PTGT--RQKVFEICVCANGRLFLSQAGAGS .:.: .::: :.: :.: :: ::.. :... :: :.:.::: ::.:.:: ::.:: CCDS59 HLQSTEGEIFLNAETIKLGNLPTGSFSSSSPSSSSSRQTVYELCVCPNGKLYLSPAGVGS 250 260 270 280 290 300 290 pF1KE6 TCQINTSVCL ::: ....:: CCDS59 TCQSSSNICLWS 310 >>CCDS9299.1 SGCG gene_id:6445|Hs108|chr13 (291 aa) initn: 1027 init1: 1027 opt: 1029 Z-score: 1298.1 bits: 248.1 E(32554): 5.5e-66 Smith-Waterman score: 1029; 52.9% identity (81.4% similar) in 291 aa overlap (2-290:1-291) 10 20 30 40 50 pF1KE6 MMPQEQYTHHRS--TMPGSVGPQVYKVGIYGWRKRCLYFFVLLLMILILVNLAMTIWILK : .:::: . . :::.:::::::::::.:::::.:...::::.:::::: CCDS92 MVREQYTTATEGICIERPENQYVYKIGIYGWRKRCLYLFVLLLLIILVVNLALTIWILK 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 VMNFTIDGMGNLRITEKGLKLEGDSEFLQPLYAKEIQSRPGNALYFKSARNVTVNILNDQ :: :. :::.: .:. ::.:::.:::: :::::::.:: ..: ..:..::::: :.. CCDS92 VMWFSPAGMGHLCVTKDGLRLEGESEFLFPLYAKEIHSRVDSSLLLQSTQNVTVNARNSE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 TKVLTQLITGPKAVEAYGKKFEVKTVSGKLLFSADNNEVVVGAERLRVLGAEGTVFPKSI .: .: .::: ::. ...:.... .:: ::..:..:::::...::: : ::..: .:. CCDS92 GEVTGRLKVGPKMVEVQNQQFQINSNDGKPLFTVDEKEVVVGTDKLRVTGPEGALFEHSV 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 ETPNVRADPFKELRLESPTRSLVMEAPKGVEINAEAGNMEATCRTELRLESKDGEIKLDA ::: ::::::..:::::::::: :.::.::.:.:.::..:: . .. ..:.:: . ::: CCDS92 ETPLVRADPFQDLRLESPTRSLSMDAPRGVHIQAHAGKIEALSQMDILFHSSDGMLVLDA 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 AKIRLPRLPHGSYTPTGTRQKVFEICVCANGRLFLSQAGAGSTCQINTSVCL . ::.: .:.. :.:. :...::::: .:.:.:: ::...::: .. .:: CCDS92 ETVCLPKLVQGTWGPSGSSQSLYEICVCPDGKLYLSVAGVSTTCQEHNHICL 240 250 260 270 280 290 290 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:11:03 2016 done: Tue Nov 8 11:11:03 2016 Total Scan time: 2.000 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]