FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8912, 251 aa 1>>>pF1KB8912 251 - 251 aa - 251 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.9614+/-0.00109; mu= -9.9643+/- 0.067 mean_var=614.6302+/-124.764, 0's: 0 Z-trim(118.6): 72 B-trim: 74 in 1/54 Lambda= 0.051733 statistics sampled from 19462 (19535) to 19462 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.845), E-opt: 0.2 (0.6), width: 16 Scan time: 2.980 The best scores are: opt bits E(32554) CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 1845 151.0 7.1e-37 CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 925 82.4 3.4e-16 CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 856 77.2 1.2e-14 CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 743 68.9 4.7e-12 >>CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 (251 aa) initn: 1845 init1: 1845 opt: 1845 Z-score: 775.5 bits: 151.0 E(32554): 7.1e-37 Smith-Waterman score: 1845; 100.0% identity (100.0% similar) in 251 aa overlap (1-251:1-251) 10 20 30 40 50 60 pF1KB8 MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQPEAGFGRRAAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQPEAGFGRRAAC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 TVQRYAACRDPGPPPPPPPPPPPPPPPGLSPRAPAPPPAGALLPEPGQRCEAVSSSPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TVQRYAACRDPGPPPPPPPPPPPPPPPGLSPRAPAPPPAGALLPEPGQRCEAVSSSPPPP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 PCAQNPLHPSPSHSACKEPVVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQVLELEKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PCAQNPLHPSPSHSACKEPVVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQVLELEKE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 FHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGAAGSAGGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGAAGSAGGP 190 200 210 220 230 240 250 pF1KB8 PGRPNGGPRAL ::::::::::: CCDS11 PGRPNGGPRAL 250 >>CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 (264 aa) initn: 1096 init1: 704 opt: 925 Z-score: 404.2 bits: 82.4 E(32554): 3.4e-16 Smith-Waterman score: 953; 59.9% identity (71.8% similar) in 252 aa overlap (1-237:1-231) 10 20 30 40 50 60 pF1KB8 MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQPEAGFGRRAAC : :::.:..:::.::::::::::::..:.: .::: :: :. :::.:: . CCDS88 MIMSSYLMDSNYIDPKFPPCEEYSQNSYIP-EHSPEYY--GRTRESGFQHHH-------- 10 20 30 40 70 80 90 100 110 pF1KB8 TVQRYAACRDPGPPPPPPPPPPPPP-------PPGLSPRAPAPPPAGALLPEPGQR-CE- .. ::::: : : :: : :. .: :: :: .: :: CCDS88 --------QELYPPPPPRPSYPERQYSCTSLQGPGNS-RGHGPAQAGHHHPEKSQSLCEP 50 60 70 80 90 100 120 130 140 150 160 pF1KB8 ------AVSSSPPPPPCAQNPLHPSPSHSACKEPVVYPWMRKVHVSTVNPNYAGGEPKRS ..: :: :: :.: : :: .: :.:.:::::.:.::::::::: ::::::: CCDS88 APLSGASASPSPAPPACSQ-PAPDHPSSAASKQPIVYPWMKKIHVSTVNPNYNGGEPKRS 110 120 130 140 150 170 180 190 200 210 220 pF1KB8 RTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPN :::::::::::::::::::::::::::.::::.:::::::::::::::::::::::.::: CCDS88 RTAYTRQQVLELEKEFHYNRYLTRRRRIEIAHSLCLSERQIKIWFQNRRMKWKKDHRLPN 160 170 180 190 200 210 230 240 250 pF1KB8 TKIRSGGAAGSAGGPPGRPNGGPRAL ::.::. ::.: CCDS88 TKVRSAPPAGAAPSTLSAATPGTSEDHSQSATPPEQQRAEDITRL 220 230 240 250 260 >>CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 (255 aa) initn: 965 init1: 599 opt: 856 Z-score: 376.5 bits: 77.2 E(32554): 1.2e-14 Smith-Waterman score: 862; 58.2% identity (72.5% similar) in 244 aa overlap (1-237:1-229) 10 20 30 40 50 60 pF1KB8 MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQPEAGFGRRAAC :.:::...::.:::::::::::: :. :: .... ::.:: . ..::: :. : CCDS22 MVMSSYMVNSKYVDPKFPPCEEYLQGGYL-GEQGADYYGGGAQG-ADFQPP-GLYPRPDF 10 20 30 40 50 70 80 90 100 110 pF1KB8 TVQRYAACRDPGPPPPPPP--PPPPPPPPG---LSPRAPAP-PPAGALLPEPGQRCEAVS : ... ::: : : :: .: : : ::: : :: : : : CCDS22 GEQPFGGS-GPGPGSALPARGHGQEPGGPGGHYAAPGEPCPAPPAPPPAPLPGAR--AYS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 SSPPPPPCAQNPLHPSPSHSACKEP-VVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQ .: : : :: .: :.: ::::::.::::..:::::.::::::::::::::: CCDS22 QSDPKQP---------PSGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTAYTRQQ 120 130 140 150 160 180 190 200 210 220 230 pF1KB8 VLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGA :::::::::.:::::::::.::::.::::::::::::::::::::::::::::: ::... CCDS22 VLELEKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKGRSSSS 170 180 190 200 210 220 240 250 pF1KB8 AGSAGGPPGRPNGGPRAL ..:. CCDS22 SSSSSCSSSVAPSQHLQPMAKDHHTDLTTL 230 240 250 >>CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 (320 aa) initn: 881 init1: 686 opt: 743 Z-score: 329.9 bits: 68.9 E(32554): 4.7e-12 Smith-Waterman score: 803; 50.2% identity (66.7% similar) in 255 aa overlap (12-243:42-296) 10 20 30 pF1KB8 MAMSSFLINSNYVDPKFPPCEEYS-QSDYLPS-----DHSP : .: :: .. :. :: . . CCDS54 YIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQLPHAGGGREPTA 20 30 40 50 60 70 40 50 60 70 80 pF1KB8 GYYAGGQRRESSFQPEAGFGRRAACTVQR---YAACRDPGPPPPPPPPPPPPPPPG---- .::: :: .. : . ..: . : . .:: :: : :: :. CCDS54 SYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPPAQAKGPAHGLH 80 90 100 110 120 130 90 100 110 120 130 pF1KB8 ----LSPRAPAPPPAGALLPEPGQRCEAVSSSPPPPPCAQNPLHP------SPSHSACKE :.:. : : :. : .::::. ..: : .. : : :: :: CCDS54 ASHVLQPQLPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLADKSPLGLKGKE 140 150 160 170 180 190 140 150 160 170 180 190 pF1KB8 PVVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHA :::::::.:.:::.:::.: ::::::::::::::::::::::::.:::::::::.::::. CCDS54 PVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYLTRRRRIEIAHT 200 210 220 230 240 250 200 210 220 230 240 250 pF1KB8 LCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGAAGSAGGPPGRPNGGPRAL :::::::.:::::::::::::::::::::.::...:....::::. CCDS54 LCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQSPHLHPHPHPS 260 270 280 290 300 310 CCDS54 TSTPVPSSI 320 251 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 02:25:06 2016 done: Sun Nov 6 02:25:06 2016 Total Scan time: 2.980 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]