FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9693, 252 aa 1>>>pF1KB9693 252 - 252 aa - 252 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.7563+/-0.000945; mu= -9.7550+/- 0.057 mean_var=530.9813+/-117.846, 0's: 0 Z-trim(119.6): 532 B-trim: 1053 in 1/54 Lambda= 0.055659 statistics sampled from 20211 (20919) to 20211 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.859), E-opt: 0.2 (0.643), width: 16 Scan time: 3.110 The best scores are: opt bits E(32554) CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 ( 252) 1774 155.4 3.3e-38 CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 ( 323) 719 70.8 1.2e-12 CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 ( 288) 671 66.9 1.7e-11 >>CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 (252 aa) initn: 1774 init1: 1774 opt: 1774 Z-score: 799.3 bits: 155.4 E(32554): 3.3e-38 Smith-Waterman score: 1774; 100.0% identity (100.0% similar) in 252 aa overlap (1-252:1-252) 10 20 30 40 50 60 pF1KB9 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPEGAGPAAGLDVRAARREAASPGTPGPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPEGAGPAAGLDVRAARREAASPGTPGPPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 PPPAASGPGPGAAAAPHLLAASILADLRGGPGAAPGGASPASSSSAASSPSSGRAPGAAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PPPAASGPGPGAAAAPHLLAASILADLRGGPGAAPGGASPASSSSAASSPSSGRAPGAAP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 SAAAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SAAAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 HTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPDLLRRPGARSTSPSDSLPCSLAGSPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 HTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPDLLRRPGARSTSPSDSLPCSLAGSPA 190 200 210 220 230 240 250 pF1KB9 PSPAPSPAPAGL :::::::::::: CCDS12 PSPAPSPAPAGL 250 >>CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 (323 aa) initn: 1049 init1: 661 opt: 719 Z-score: 340.2 bits: 70.8 E(32554): 1.2e-12 Smith-Waterman score: 777; 50.4% identity (61.3% similar) in 282 aa overlap (38-250:38-318) 10 20 30 40 50 60 pF1KB9 VDYFAADVLMAISSGAVVHRGRPGPEGAGPAAGLDVRAARREAASPGTPGPPPP---P-- ::: .: :: :.: :: :::: : : CCDS58 LDYFAAECLVSMSAGAVVHRRPPDPEGAGGAAGSEVGAAPPESALPG-PGPPGPASVPQL 10 20 30 40 50 60 70 80 90 pF1KB9 PAASGPGPGAA-AAPHLLAASILADLRGGPGA---------------------------- : . .:.:::. :::::::::. :::::. : CCDS58 PQVPAPSPGAGGAAPHLLAASVWADLRGSSGEGSWENSGEAPRASSGFSDPIPCSVQTPC 70 80 90 100 110 120 100 110 120 pF1KB9 ---APG-GAS----PASSSSAASSPSSGRAPGA----------------APSA------- ::. ::. : :::.: . ::. :::: ::.: CCDS58 SELAPASGAAAVCAPESSSDAPAVPSAPAAPGAPAASGGFSGGALGAGPAPAADQAPRRR 130 140 150 160 170 180 130 140 150 160 170 pF1KB9 ----AAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHH ::: :.:::: :.:::::::::::: :::::::::.::: :::::.::::::::. CCDS58 SVTPAAKRHQCPFPGCTKAYYKSSHLKSHQRTHTGERPFSCDWLDCDKKFTRSDELARHY 190 200 210 220 230 240 180 190 200 210 220 230 pF1KB9 RTHTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPDLLRRPGARSTSPSDSLPCSLAGS :::::::::::::: :.:.:::::.::::::: .:::... : : : : : . : CCDS58 RTHTGEKRFSCPLCPKQFSRSDHLTKHARRHPTYHPDMIEYRGRRRTPRIDPPLTSEVES 250 260 270 280 290 300 240 250 pF1KB9 PAPSPAPSPAPAGL : . .:.:::. CCDS58 SASGSGPGPAPSFTTCL 310 320 >>CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 (288 aa) initn: 846 init1: 576 opt: 671 Z-score: 320.0 bits: 66.9 E(32554): 1.7e-11 Smith-Waterman score: 678; 47.5% identity (60.2% similar) in 284 aa overlap (10-247:9-288) 10 20 30 40 pF1KB9 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPE----GAGPAA------------GLD-- .:::. :...:: :::: : ::: ::. :: : : CCDS10 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSA 10 20 30 40 50 50 60 70 pF1KB9 -----VR--------------AARREAASPG---TPGP-PPPPPAASGPGP-GAAAAPHL .: : :::.:. :: ::: : ..:: :::::: CCDS10 SLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPS 60 70 80 90 100 110 80 90 100 110 120 130 pF1KB9 LAASILADLRG-GPGAAPGGASPASSSSAASSPSSGRAPGAAP-SAAAKSHRCPFPDCAK : : : : :: ::.:. . :. . : . ..:.: . : : CCDS10 PAWSEPEPEAGLEPEREPG---PAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEK 120 130 140 150 160 170 140 150 160 170 180 190 pF1KB9 AYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRTHTGEKRFSCPLCSKRF .: ::::::.:::::::::::::.:: :.::::::::::::.:::::::.::::.: ::: CCDS10 VYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRF 180 190 200 210 220 230 200 210 220 230 240 250 pF1KB9 TRSDHLAKHARRHPGFHPDLLRR--PGARSTSPSDSLPCSLAGSPAPSPAPSPAPAGL :::::.:::::: .::: .:.: :.:. : :: : :.::. ::: :: CCDS10 MRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSD-YSRSDASSPTISPASSP 240 250 260 270 280 252 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:35:46 2016 done: Sun Nov 6 13:35:46 2016 Total Scan time: 3.110 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]