FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7783, 457 aa 1>>>pF1KB7783 457 - 457 aa - 457 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.2416+/-0.00103; mu= -7.2910+/- 0.061 mean_var=437.8477+/-101.223, 0's: 0 Z-trim(116.2): 574 B-trim: 823 in 1/54 Lambda= 0.061293 statistics sampled from 16016 (16775) to 16016 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.797), E-opt: 0.2 (0.515), width: 16 Scan time: 3.650 The best scores are: opt bits E(32554) CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 ( 457) 3180 295.2 9.2e-80 CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 ( 366) 2537 238.3 1e-62 CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 ( 345) 624 69.1 8.3e-12 CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 ( 402) 608 67.7 2.5e-11 CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX ( 359) 591 66.2 6.4e-11 CCDS12343.1 KLF2 gene_id:10365|Hs108|chr19 ( 355) 588 65.9 7.7e-11 >>CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 (457 aa) initn: 3180 init1: 3180 opt: 3180 Z-score: 1545.5 bits: 295.2 E(32554): 9.2e-80 Smith-Waterman score: 3180; 100.0% identity (100.0% similar) in 457 aa overlap (1-457:1-457) 10 20 30 40 50 60 pF1KB7 MATRVLSMSARLGPVPQPPAPQDEPVFAQLKPVLGAANPARDAALFPGEELKHAHHRPQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 MATRVLSMSARLGPVPQPPAPQDEPVFAQLKPVLGAANPARDAALFPGEELKHAHHRPQA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 QPAPAQAPQPAQPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 QPAPAQAPQPAQPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 DQFFTDTEGLPYSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 DQFFTDTEGLPYSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 APTQALPEFTSIFSSHQTAAPEVNNIFIKQELPTPDLHLSVPTQQGHLYQLLNTPDLDMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 APTQALPEFTSIFSSHQTAAPEVNNIFIKQELPTPDLHLSVPTQQGHLYQLLNTPDLDMP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 SSTNQTAAMDTLNVSMSAAMAGLNTHTSAVPQTAVKQFQGMPPCTYTMPSQFLPQQATYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 SSTNQTAAMDTLNVSMSAAMAGLNTHTSAVPQTAVKQFQGMPPCTYTMPSQFLPQQATYF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 PPSPPSSEPGSPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVRYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 PPSPPSSEPGSPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVRYN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 RRSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 RRSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDEL 370 380 390 400 410 420 430 440 450 pF1KB7 TRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN ::::::::::::::::::::::::::::::::::::: CCDS94 TRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN 430 440 450 >>CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 (366 aa) initn: 2537 init1: 2537 opt: 2537 Z-score: 1239.4 bits: 238.3 E(32554): 1e-62 Smith-Waterman score: 2537; 100.0% identity (100.0% similar) in 366 aa overlap (92-457:1-366) 70 80 90 100 110 120 pF1KB7 PAPAQAPQPAQPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVVD :::::::::::::::::::::::::::::: CCDS66 MEKYLTPQLPPVPIIPEHKKYRRDSASVVD 10 20 30 130 140 150 160 170 180 pF1KB7 QFFTDTEGLPYSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 QFFTDTEGLPYSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPPA 40 50 60 70 80 90 190 200 210 220 230 240 pF1KB7 PTQALPEFTSIFSSHQTAAPEVNNIFIKQELPTPDLHLSVPTQQGHLYQLLNTPDLDMPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 PTQALPEFTSIFSSHQTAAPEVNNIFIKQELPTPDLHLSVPTQQGHLYQLLNTPDLDMPS 100 110 120 130 140 150 250 260 270 280 290 300 pF1KB7 STNQTAAMDTLNVSMSAAMAGLNTHTSAVPQTAVKQFQGMPPCTYTMPSQFLPQQATYFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 STNQTAAMDTLNVSMSAAMAGLNTHTSAVPQTAVKQFQGMPPCTYTMPSQFLPQQATYFP 160 170 180 190 200 210 310 320 330 340 350 360 pF1KB7 PSPPSSEPGSPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVRYNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 PSPPSSEPGSPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVRYNR 220 230 240 250 260 270 370 380 390 400 410 420 pF1KB7 RSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDELT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 RSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDELT 280 290 300 310 320 330 430 440 450 pF1KB7 RHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN :::::::::::::::::::::::::::::::::::: CCDS66 RHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN 340 350 360 >>CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 (345 aa) initn: 615 init1: 563 opt: 624 Z-score: 325.5 bits: 69.1 E(32554): 8.3e-12 Smith-Waterman score: 641; 35.7% identity (59.7% similar) in 370 aa overlap (102-455:6-342) 80 90 100 110 120 pF1KB7 QPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVV--DQFFTDTEG :::. :. : .:: .... . . CCDS34 MLMFDPVPV----KQEAMDPVSVSYPSNYMESMKP 10 20 30 130 140 150 160 170 180 pF1KB7 LPYSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPPAPTQALPEF :.. ... ::. .... . :. ::: . .. ..:: : .. : CCDS34 NKYGVIYSTPLPE------KFFQTPEGLSHGIQMEPVDLTVNKR--SSPPSAGNS--PSS 40 50 60 70 80 190 200 210 220 230 240 pF1KB7 TSIFSSHQTAAPEVNNIFIKQELPT--PDLHLSVPTQQGHLYQLLNTPDLDMPSSTNQTA .. :::. :.: .. .:. : .. : . : : ...: :.:: ..: CCDS34 LKFPSSHRRASPGLS-------MPSSSPPIKKYSPPSPG--VQPFGVP-LSMPPV--MAA 90 100 110 120 250 260 270 280 290 300 pF1KB7 AMDTLNVSMSAAMAGLN-THTSAVPQTAVKQFQG--MPPCTYTMP----SQFLPQQATYF :.. .. . . .. . .. :: ....: : . : :. .: .: CCDS34 ALSRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYE 130 140 150 160 170 180 310 320 330 340 350 pF1KB7 PPSPPSS---EPG-SPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQP : .. ::: :.: . ....:: ... :. . : :... .. : CCDS34 KPISQKKIKIEPGIEPQRTDYYPEEMSPP--LMNSVSPPQALLQENHPSVIVQPGKRPLP 190 200 210 220 230 240 360 370 380 390 400 410 pF1KB7 VRYNRRSNPDLE-KRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFA :. .:: . ::::: ::: ::.:::::::::::: :::::::::::::::: :.:: CCDS34 VE-----SPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFA 250 260 270 280 290 300 420 430 440 450 pF1KB7 RSDELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN ::::::::.::::: ::::: :.:::::::::::: ::: CCDS34 RSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV 310 320 330 340 >>CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 (402 aa) initn: 618 init1: 584 opt: 608 Z-score: 317.0 bits: 67.7 E(32554): 2.5e-11 Smith-Waterman score: 627; 34.5% identity (59.5% similar) in 385 aa overlap (100-455:26-399) 70 80 90 100 110 120 pF1KB7 PAQPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVVDQFFTDTEG .: : . : . .. : .: . . : :. CCDS94 MNIHMKRKTIKNINTFENRMLMLDGMPAVRVKTELLESEQGSPNVHN--YPDMEA 10 20 30 40 50 130 140 150 160 170 180 pF1KB7 LPYSINMNVFLPDITHLRTGLYKSQ-RPC---VTHIKTEPVAIFSHQSETTAPPPAP--T .: .: : : . ...: .: ... .: :.:. : :: .: : CCDS94 VPLLLNNVKGEPPEDSLSVDHFQTQTEPVDLSINKARTSPTAVSSSPVSMTASASSPSST 60 70 80 90 100 110 190 200 210 220 230 pF1KB7 QALPEFTSIFSSHQTAAPEVNNIFIKQELPTPDLHLSVPTQQGHLYQLLNT-----PDLD .. .: ..: :. :.. .. . :: .. . : :.:. :. CCDS94 STSSSSSSRLASSPTVITSVSSASSSSTVLTPGPLVASASGVGG-QQFLHIIHPVPPSSP 120 130 140 150 160 170 240 250 260 270 280 290 pF1KB7 MPSSTNQTA-------AMDTLNVSMSAAMAGLNTHTS-AVPQTAVKQFQGMPPCTYTMPS : ..:. . ..... : ..:. . :.... .:: . .: : CCDS94 MNLQSNKLSHVHRIPVVVQSVPVVYTAVRSPGNVNNTIVVPLLEDGRGHGKAQMD---PR 180 190 200 210 220 300 310 320 330 340 pF1KB7 QFLPQQATY------FPPSPPSS--EPGSPDRQ-AEMLQNLTPPPSYAATIASKLAIHNP . :.:. .: .: : :: . :. .:.. : : .. . .. ..: CCDS94 GLSPRQSKSDSDDDDLPNVTLDSVNETGSTALSIARAVQEVHPSP--VSRVRGN-RMNNQ 230 240 250 260 270 280 350 360 370 380 390 400 pF1KB7 NLPTTLPVNSQNIQPVRYNRRS-NPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGE ..: . .. .:. .: .::: .:: .::::: ::. ::.:::::::::::: :::::: CCDS94 KFPCS--ISPFSIESTRRQRRSESPDSRKRRIHRCDFEGCNKVYTKSSHLKAHRRTHTGE 290 300 310 320 330 340 410 420 430 440 450 pF1KB7 KPYKCTWEGCDWRFARSDELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN :::::::::: :.::::::::::::::::.:::.:. :.:::::::::::: .:: CCDS94 KPYKCTWEGCTWKFARSDELTRHYRKHTGVKPFKCADCDRSFSRSDHLALHRRRHMLV 350 360 370 380 390 400 >>CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX (359 aa) initn: 617 init1: 555 opt: 591 Z-score: 309.5 bits: 66.2 E(32554): 6.4e-11 Smith-Waterman score: 600; 35.3% identity (58.8% similar) in 320 aa overlap (151-455:42-356) 130 140 150 160 170 180 pF1KB7 DQFFTDTEGLPYSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPP :.:. : . ..:. . .. :: CCDS14 EVQLNSEGGSMQVFKQVTASVRNRDPPEIEYRSNMTSPTLLDANPMENPALFNDIKIEPP 20 30 40 50 60 70 190 200 210 220 230 pF1KB7 APTQA----LPEFTSI-FSSHQTAAPEVNNIFIKQEL--PTPDLH---LSVPTQQGHLYQ : ::. . .: :. :: ... . : :. : : :. . . CCDS14 EELLASDFSLPQVEPVDLSFHKPKAPLQPASMLQAPIRPPKPQSSPQTLVVSTSTSDMST 80 90 100 110 120 130 240 250 260 270 280 pF1KB7 LLNTPDLDMP----SSTNQTAAMDTLNVSMSAAMAGLNTHTSAVPQTAVKQFQGMPPCTY : : . : .:...:.... :.: . ..: .. ... : :..: CCDS14 SANIPTVLTPGSVLTSSQSTGSQQILHVIHTIPSVSLPNKMGGLKTIPVV-VQSLPMVYT 140 150 160 170 180 190 290 300 310 320 330 340 pF1KB7 TMPSQFLPQQATYFPPSPPSSEPGSPDRQAEMLQNLT-PPPSYAATIASKLAIHNPNLPT :.:.. : : . ... :: . .. : : : .:: : . : . CCDS14 TLPADGGPAAITVPLIGGDGKNAGSVKVDPTSMSPLEIPSDSEESTIESG----SSALQS 200 210 220 230 240 350 360 370 380 390 400 pF1KB7 TLPVNSQNIQPVRYNRRSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKC .... .... . . ::..:::: ::. ::.:::::::::::: : ::::::::: CCDS14 LQGLQQEPAAMAQMQGEESLDLKRRRIHQCDFAGCSKVYTKSSHLKAHRRIHTGEKPYKC 250 260 270 280 290 300 410 420 430 440 450 pF1KB7 TWEGCDWRFARSDELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN ::.::.:.::::::::::.::::: :::.: :::::::::::.:: .:: CCDS14 TWDGCSWKFARSDELTRHFRKHTGIKPFRCTDCNRSFSRSDHLSLHRRRHDTM 310 320 330 340 350 >>CCDS12343.1 KLF2 gene_id:10365|Hs108|chr19 (355 aa) initn: 640 init1: 525 opt: 588 Z-score: 308.1 bits: 65.9 E(32554): 7.7e-11 Smith-Waterman score: 607; 39.1% identity (59.9% similar) in 302 aa overlap (174-455:63-354) 150 160 170 180 190 200 pF1KB7 THLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPPAPTQALPEFTSIFSSHQTAAPEV : ::: :. :: . :. : CCDS12 ESGGTDDDLNSVLDFILSMGLDGLGAEAAPEPPPPPPPPAFYYPEPGAPPPYSAPAGGLV 40 50 60 70 80 90 210 220 230 240 250 260 pF1KB7 NNIFIKQELPTPDLHLSVPTQQGHLYQLLNTPDLDMPSSTNQTAAMDTLNVS--MSAAMA ... .. :: .: :. :. .:.. :: : . . .. . . . .. . CCDS12 SEL-LRPELDAP---LG-PALHGRF--LLAPPGRLVKAEPPEADGGGGYGCAPGLTRGPR 100 110 120 130 140 270 280 290 300 310 pF1KB7 GLNTHTSAVPQTA-VKQFQGMPPCTYTMPSQFLPQQATYFP-PSPPSSEP---GSPDRQA ::. . . : .. .. : :: : . :. . .: :.: .: : :.: : CCDS12 GLKREGAPGPAASCMRGPGGRPPPPPDTPP-LSPDGPARLPAPGPRASFPPPFGGPGFGA 150 160 170 180 190 200 320 330 340 350 360 pF1KB7 EM--LQNLTP-PPSY-----AATIASKLAIHNPN-----LPTTLPVNSQNIQPVRYNRRS :. : ::.. ::. :. :.. : : . :.. . .: : .::: CCDS12 PGPGLHYAPPAPPAFGLFDDAAAAAAALGLAPPAARGLLTPPASPLELLEAKPKR-GRRS 210 220 230 240 250 260 370 380 390 400 410 420 pF1KB7 NPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDELTRH : .. : :.: :: :.::::::::::::::::::::.:.:.:: :.:::::::::: CCDS12 WPR-KRTATHTCSYAGCGKTYTKSSHLKAHLRTHTGEKPYHCNWDGCGWKFARSDELTRH 270 280 290 300 310 320 430 440 450 pF1KB7 YRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN :::::: .:::: .:.:.:::::::::::::: CCDS12 YRKHTGHRPFQCHLCDRAFSRSDHLALHMKRHM 330 340 350 457 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 23:12:26 2016 done: Sun Nov 6 23:12:26 2016 Total Scan time: 3.650 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]