FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8937, 288 aa 1>>>pF1KB8937 288 - 288 aa - 288 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.7468+/-0.00096; mu= -5.2121+/- 0.058 mean_var=447.7617+/-96.501, 0's: 0 Z-trim(117.7): 644 B-trim: 0 in 0/53 Lambda= 0.060611 statistics sampled from 17731 (18524) to 17731 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.569), width: 16 Scan time: 2.760 The best scores are: opt bits E(32554) CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 ( 288) 2004 188.4 5.4e-48 CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 ( 252) 671 71.7 6e-13 CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 ( 323) 635 68.7 6.3e-12 CCDS6633.1 KLF9 gene_id:687|Hs108|chr9 ( 244) 608 66.2 2.7e-11 >>CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 (288 aa) initn: 2004 init1: 2004 opt: 2004 Z-score: 975.2 bits: 188.4 E(32554): 5.4e-48 Smith-Waterman score: 2004; 100.0% identity (100.0% similar) in 288 aa overlap (1-288:1-288) 10 20 30 40 50 60 pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSAS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPSP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 AWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 SSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRFMRSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRFMRSD 190 200 210 220 230 240 250 260 270 280 pF1KB8 HLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP :::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 HLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP 250 260 270 280 >>CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 (252 aa) initn: 846 init1: 576 opt: 671 Z-score: 345.9 bits: 71.7 E(32554): 6e-13 Smith-Waterman score: 703; 48.1% identity (60.8% similar) in 291 aa overlap (2-288:3-247) 10 20 30 40 50 pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSA ::.: ::.:::. :...:: :::: : ::: ::. :: : : CCDS12 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPE----GAGPAA------------GLD-- 10 20 30 40 60 70 80 90 100 110 pF1KB8 SLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPS .: : :::.:. :: ::: : ..:: :::::: CCDS12 -----VR--------------AARREAASPG---TPGP-PPPPPAASGPGP-GAAAAPHL 50 60 70 120 130 140 150 160 170 pF1KB8 PAWSEPEPEAGLEPEREPG---PAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEK : : : : :: ::.:. . :. . : . ..:.: . : : CCDS12 LAASILADLRG-GPGAAPGGASPASSSSAASSPSSGRAPGAAP-SAAAKSHRCPFPDCAK 80 90 100 110 120 130 180 190 200 210 220 230 pF1KB8 VYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRF .: ::::::.:::::::::::::.:: :.::::::::::::.:::::::.::::.: ::: CCDS12 AYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRTHTGEKRFSCPLCSKRF 140 150 160 170 180 190 240 250 260 270 280 pF1KB8 MRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSD-YSRSDASSPTISPASSP :::::.:::::: .::: .:.: :.:. : :: : :.::. ::: :: CCDS12 TRSDHLAKHARRHPGFHPDLLRR--PGARSTSPSDSLPCSLAGSPAPSPAPSPAPAGL 200 210 220 230 240 250 >>CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 (323 aa) initn: 687 init1: 553 opt: 635 Z-score: 327.6 bits: 68.7 E(32554): 6.3e-12 Smith-Waterman score: 703; 42.9% identity (60.2% similar) in 324 aa overlap (2-287:3-318) 10 20 30 40 pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVH---------GPREGPE---SRPEGA-------AV ::.: .:.::::::::::. :::: : : : . ::.: . CCDS58 MSAAVACLDYFAAECLVSMSAGAVVHRRPPDPEGAGGAAGSEVGAAPPESALPGPGPPGP 10 20 30 40 50 60 50 60 70 80 90 pF1KB8 AATPTLPRVEERRDGKDSAS-LFVVARILADLNQQAPAPAPAERREGAAARKART---PC :..: ::.: : .:. ...: . ::: .. . . :. : .. . :: CCDS58 ASVPQLPQVPAPSPGAGGAAPHLLAASVWADLRGSSGEGSWENSGEAPRASSGFSDPIPC 70 80 90 100 110 120 100 110 120 130 140 pF1KB8 RLPPPAPEPTSPGAEGAAAAPPSPAWSEPEPEAGLEPEREPGPAGSG-----------EP . : : .: : ::::. .: : : . : .::.:: : CCDS58 SVQTPCSE-LAP-ASGAAAVC-APESSSDAPAVPSAPAAPGAPAASGGFSGGALGAGPAP 130 140 150 160 170 150 160 170 180 190 200 pF1KB8 GLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGKSSHLKAHLRTHTGERPFACSWQDCN . : :: :: . ..:.: . :: :.: ::::::.: :::::::::.:.: ::. CCDS58 AADQAPRR-RS---VTPAAKRHQCPFPGCTKAYYKSSHLKSHQRTHTGERPFSCDWLDCD 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB8 KKFARSDELARHYRTHTGEKKFSCPICEKRFMRSDHLTKHARRHANFHPGMLQRRGGGSR :::.::::::::::::::::.::::.: :.: :::::::::::: ..:: :.. :: : CCDS58 KKFTRSDELARHYRTHTGEKRFSCPLCPKQFSRSDHLTKHARRHPTYHPDMIEYRGR-RR 240 250 260 270 280 290 270 280 pF1KB8 TGS----LSDYSRSDASSPTISPASSP : :.. .:.::. .:: : CCDS58 TPRIDPPLTSEVESSASGSGPGPAPSFTTCL 300 310 320 >>CCDS6633.1 KLF9 gene_id:687|Hs108|chr9 (244 aa) initn: 717 init1: 574 opt: 608 Z-score: 316.3 bits: 66.2 E(32554): 2.7e-11 Smith-Waterman score: 714; 45.8% identity (65.5% similar) in 264 aa overlap (1-259:1-235) 10 20 30 40 50 pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDG----K :.::::.: ::.::::.:.::.: :..: :. : : ..:. : : CCDS66 MSAAAYMDFVAAQCLVSISNRAAV--PEHGVA--PD-AERLRLPEREVTKEHGDPGDTWK 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 DSASLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAP-EPTSPGAEGAAA : .: ..:. : :::. : .: . : .: : . .. .. CCDS66 DYCTLVTIAKSLLDLNKYRPIQTP-------------SVCSDSLESPDEDMGSDSDVTTE 60 70 80 90 100 120 130 140 150 160 170 pF1KB8 APPSPAWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCE . ::. : :: .: :.: . .::. . .:. .. ...::: :.:: CCDS66 SGSSPSHS---PEERQDPGSAPSPLSLLHPGV---AAKGK-----HASEKRHKCPYSGCG 110 120 130 140 150 180 190 200 210 220 230 pF1KB8 KVYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKR :::::::::::: :.::::::: :.: :: :::.:::::.::::::::::.: ::.:::: CCDS66 KVYGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKR 160 170 180 190 200 210 240 250 260 270 280 pF1KB8 FMRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP ::::::::::::::..:::.:..: CCDS66 FMRSDHLTKHARRHTEFHPSMIKRSKKALANAL 220 230 240 288 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:21:35 2016 done: Tue Nov 8 04:21:35 2016 Total Scan time: 2.760 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]