FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8944, 304 aa 1>>>pF1KB8944 304 - 304 aa - 304 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2466+/-0.000811; mu= 9.5825+/- 0.050 mean_var=199.1455+/-41.567, 0's: 0 Z-trim(114.7): 138 B-trim: 11 in 1/51 Lambda= 0.090884 statistics sampled from 15135 (15288) to 15135 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.795), E-opt: 0.2 (0.47), width: 16 Scan time: 2.830 The best scores are: opt bits E(32554) CCDS8867.1 HOXC11 gene_id:3227|Hs108|chr12 ( 304) 2095 286.4 1.8e-77 CCDS2265.1 HOXD11 gene_id:3237|Hs108|chr2 ( 338) 643 96.1 4e-20 CCDS5411.1 HOXA11 gene_id:3207|Hs108|chr7 ( 313) 518 79.7 3.2e-15 >>CCDS8867.1 HOXC11 gene_id:3227|Hs108|chr12 (304 aa) initn: 2095 init1: 2095 opt: 2095 Z-score: 1504.3 bits: 286.4 E(32554): 1.8e-77 Smith-Waterman score: 2095; 99.7% identity (99.7% similar) in 304 aa overlap (1-304:1-304) 10 20 30 40 50 60 pF1KB8 MFNSVNLGNFCSPSRKERGADFGERGSCASNLYLPSCTYYMPEFSTVFSFLPQAPSRQIS ::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::: CCDS88 MFNSVNLGNFCSPSRKERGADFGERGSCASNLYLPSCTYYMPEFSTVSSFLPQAPSRQIS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 YPYSAQVPPVREVSYGLEPSGKWHHRNSYSSCYAAADELMHRECLPPSTVTEILMKNEGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 YPYSAQVPPVREVSYGLEPSGKWHHRNSYSSCYAAADELMHRECLPPSTVTEILMKNEGS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 YGGHHHPSAPHATPAGFYSSVNKNSVLPQAFDRFFDNAYCGGGDPPAEPPCSGKGEAKGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 YGGHHHPSAPHATPAGFYSSVNKNSVLPQAFDRFFDNAYCGGGDPPAEPPCSGKGEAKGE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 PEAPPASGLASRAEAGAEAEAEEENTNPSSSGSAHSVAKEPAKGAAPNAPRTRKKRCPYS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 PEAPPASGLASRAEAGAEAEAEEENTNPSSSGSAHSVAKEPAKGAAPNAPRTRKKRCPYS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 KFQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLSRDRLQYFSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 KFQIRELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLSRDRLQYFSG 250 260 270 280 290 300 pF1KB8 NPLL :::: CCDS88 NPLL >>CCDS2265.1 HOXD11 gene_id:3237|Hs108|chr2 (338 aa) initn: 833 init1: 466 opt: 643 Z-score: 474.8 bits: 96.1 E(32554): 4e-20 Smith-Waterman score: 792; 44.5% identity (66.1% similar) in 339 aa overlap (21-304:3-338) 10 20 30 40 50 pF1KB8 MFNSVNLGNFCSPSRKERGADFGERGSCASNLYLPSCTYYM-P-EFSTVFSFLPQAPSRQ :: : :. :...:::.:.::. : .:.. ::: : : : CCDS22 MNDFDECGQSAASMYLPGCAYYVAPSDFASKPSFLSQPSSCQ 10 20 30 40 60 70 80 pF1KB8 ISYPYSA----QVPPVREVS---YGLEPSGKWHHRNS----------------------- ...:::. .: :::::. :::: .:: .:.. CCDS22 MTFPYSSNLAPHVQPVREVAFRDYGLE-RAKWPYRGGGGGGSAGGGSSGGGPGGGGGGAG 50 60 70 80 90 100 90 100 110 120 130 pF1KB8 -YSSCYAAA-----------DELMHRECLPPS-TVTEILMKN-EGSYGGHHHPSAPHATP :. :::: . :.:: :::. ..:.: : .. : .: .. CCDS22 GYAPYYAAAAAAAAAAAAAEEAAMQRELLPPAGRRPDVLFKAPEPVCAAPGPPHGPAGAA 110 120 130 140 150 160 140 150 160 170 180 pF1KB8 AGFYSSVNKNSVLPQAFDRFFDNA----YCGGGDPPAEPPCSGKGEA-KGEPEAPPASGL ..:::.:..:..:::.::.:.. : . : :: : . .: : ::.:.. ..: CCDS22 SNFYSAVGRNGILPQGFDQFYEAAPGPPFAGPQPPPPPAPPQPEGAADKGDPRTGAGGGG 170 180 190 200 210 220 190 200 210 220 230 240 pF1KB8 AS---RAEAGAEAEAEEENTNPSSSGSAHSVAKEPAKGA-APNAPRTRKKRCPYSKFQIR .: .: :.: .. :... .. : .. : ...: ::. :.:::::::.:.::: CCDS22 GSPCTKATPGSEPKGAAEGSGGDGEGPPGEAGAEKSSSAVAPQ--RSRKKRCPYTKYQIR 230 240 250 260 270 250 260 270 280 290 300 pF1KB8 ELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLSRDRLQYFSGNPLL :::::::::::::::::::::::::::::::::::::::::::::.:::::::.::::. CCDS22 ELEREFFFNVYINKEKRLQLSRMLNLTDRQVKIWFQNRRMKEKKLNRDRLQYFTGNPLF 280 290 300 310 320 330 >>CCDS5411.1 HOXA11 gene_id:3207|Hs108|chr7 (313 aa) initn: 766 init1: 480 opt: 518 Z-score: 386.7 bits: 79.7 E(32554): 3.2e-15 Smith-Waterman score: 956; 52.2% identity (73.0% similar) in 318 aa overlap (21-304:2-313) 10 20 30 40 50 pF1KB8 MFNSVNLGNFCSPSRKERGADFGERGSCASNLYLPSCTYYM--PEFSTVFSFLPQAPS-R :: ::: :.::.::::::::. :.::.. :::::.:: : CCDS54 MDFDERGPCSSNMYLPSCTYYVSGPDFSSLPSFLPQTPSSR 10 20 30 40 60 70 80 90 100 110 pF1KB8 QISYPYSAQVP---PVREVS---YGLEPSGKWHHRNSYSSCYAAADELMHRECLP-PSTV ..: ::...: :::::. :..::. ::: :.. . ::.: .::.::.:: ::.. CCDS54 PMTYSYSSNLPQVQPVREVTFREYAIEPATKWHPRGNLAHCYSA-EELVHRDCLQAPSAA 50 60 70 80 90 100 120 130 140 150 160 pF1KB8 T---EILMKNEGSYGGHHHPSAPHATPAGFYSSVNKNSVLPQAFDRFFDNAYCGGGDPPA ..: :. .. .:::. : :. ..:::.:..:.:::::::.::..:: : . : CCDS54 GVPGDVLAKSSANV--YHHPT-P-AVSSNFYSTVGRNGVLPQAFDQFFETAY-GTPENLA 110 120 130 140 150 170 180 190 200 pF1KB8 EPPCSG-KGEAKGEPEAPPASGLASRAEAGAEA-------------------EAEEENTN : :. :: : : .:. :. : .:: : : .:. CCDS54 SSDYPGDKSAEKGPPAATATSAAAAAAATGAPATSSSDSGGGGGCRETAAAAEEKERRRR 160 170 180 190 200 210 210 220 230 240 250 260 pF1KB8 PSSSGSAHSVAKEPA-KGAAPNAPRTRKKRCPYSKFQIRELEREFFFNVYINKEKRLQLS : ::.: .: . . :... .. :::::::::.:.:::::::::::.:::::::::::: CCDS54 PESSSSPESSSGHTEDKAGGSSGQRTRKKRCPYTKYQIRELEREFFFSVYINKEKRLQLS 220 230 240 250 260 270 270 280 290 300 pF1KB8 RMLNLTDRQVKIWFQNRRMKEKKLSRDRLQYFSGNPLL :::::::::::::::::::::::..::::::.:.:::: CCDS54 RMLNLTDRQVKIWFQNRRMKEKKINRDRLQYYSANPLL 280 290 300 310 304 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:31:44 2016 done: Sun Nov 6 04:31:44 2016 Total Scan time: 2.830 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]