FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8937, 288 aa
1>>>pF1KB8937 288 - 288 aa - 288 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.7468+/-0.00096; mu= -5.2121+/- 0.058
mean_var=447.7617+/-96.501, 0's: 0 Z-trim(117.7): 644 B-trim: 0 in 0/53
Lambda= 0.060611
statistics sampled from 17731 (18524) to 17731 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.569), width: 16
Scan time: 2.760
The best scores are: opt bits E(32554)
CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 ( 288) 2004 188.4 5.4e-48
CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 ( 252) 671 71.7 6e-13
CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 ( 323) 635 68.7 6.3e-12
CCDS6633.1 KLF9 gene_id:687|Hs108|chr9 ( 244) 608 66.2 2.7e-11
>>CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 (288 aa)
initn: 2004 init1: 2004 opt: 2004 Z-score: 975.2 bits: 188.4 E(32554): 5.4e-48
Smith-Waterman score: 2004; 100.0% identity (100.0% similar) in 288 aa overlap (1-288:1-288)
10 20 30 40 50 60
pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSAS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 LFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPSP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 AWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGK
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 SSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRFMRSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 SSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRFMRSD
190 200 210 220 230 240
250 260 270 280
pF1KB8 HLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 HLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP
250 260 270 280
>>CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 (252 aa)
initn: 846 init1: 576 opt: 671 Z-score: 345.9 bits: 71.7 E(32554): 6e-13
Smith-Waterman score: 703; 48.1% identity (60.8% similar) in 291 aa overlap (2-288:3-247)
10 20 30 40 50
pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSA
::.: ::.:::. :...:: :::: : ::: ::. :: : :
CCDS12 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPE----GAGPAA------------GLD--
10 20 30 40
60 70 80 90 100 110
pF1KB8 SLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPS
.: : :::.:. :: ::: : ..:: ::::::
CCDS12 -----VR--------------AARREAASPG---TPGP-PPPPPAASGPGP-GAAAAPHL
50 60 70
120 130 140 150 160 170
pF1KB8 PAWSEPEPEAGLEPEREPG---PAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEK
: : : : :: ::.:. . :. . : . ..:.: . : :
CCDS12 LAASILADLRG-GPGAAPGGASPASSSSAASSPSSGRAPGAAP-SAAAKSHRCPFPDCAK
80 90 100 110 120 130
180 190 200 210 220 230
pF1KB8 VYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRF
.: ::::::.:::::::::::::.:: :.::::::::::::.:::::::.::::.: :::
CCDS12 AYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRTHTGEKRFSCPLCSKRF
140 150 160 170 180 190
240 250 260 270 280
pF1KB8 MRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSD-YSRSDASSPTISPASSP
:::::.:::::: .::: .:.: :.:. : :: : :.::. ::: ::
CCDS12 TRSDHLAKHARRHPGFHPDLLRR--PGARSTSPSDSLPCSLAGSPAPSPAPSPAPAGL
200 210 220 230 240 250
>>CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 (323 aa)
initn: 687 init1: 553 opt: 635 Z-score: 327.6 bits: 68.7 E(32554): 6.3e-12
Smith-Waterman score: 703; 42.9% identity (60.2% similar) in 324 aa overlap (2-287:3-318)
10 20 30 40
pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVH---------GPREGPE---SRPEGA-------AV
::.: .:.::::::::::. :::: : : : . ::.: .
CCDS58 MSAAVACLDYFAAECLVSMSAGAVVHRRPPDPEGAGGAAGSEVGAAPPESALPGPGPPGP
10 20 30 40 50 60
50 60 70 80 90
pF1KB8 AATPTLPRVEERRDGKDSAS-LFVVARILADLNQQAPAPAPAERREGAAARKART---PC
:..: ::.: : .:. ...: . ::: .. . . :. : .. . ::
CCDS58 ASVPQLPQVPAPSPGAGGAAPHLLAASVWADLRGSSGEGSWENSGEAPRASSGFSDPIPC
70 80 90 100 110 120
100 110 120 130 140
pF1KB8 RLPPPAPEPTSPGAEGAAAAPPSPAWSEPEPEAGLEPEREPGPAGSG-----------EP
. : : .: : ::::. .: : : . : .::.:: :
CCDS58 SVQTPCSE-LAP-ASGAAAVC-APESSSDAPAVPSAPAAPGAPAASGGFSGGALGAGPAP
130 140 150 160 170
150 160 170 180 190 200
pF1KB8 GLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGKSSHLKAHLRTHTGERPFACSWQDCN
. : :: :: . ..:.: . :: :.: ::::::.: :::::::::.:.: ::.
CCDS58 AADQAPRR-RS---VTPAAKRHQCPFPGCTKAYYKSSHLKSHQRTHTGERPFSCDWLDCD
180 190 200 210 220 230
210 220 230 240 250 260
pF1KB8 KKFARSDELARHYRTHTGEKKFSCPICEKRFMRSDHLTKHARRHANFHPGMLQRRGGGSR
:::.::::::::::::::::.::::.: :.: :::::::::::: ..:: :.. :: :
CCDS58 KKFTRSDELARHYRTHTGEKRFSCPLCPKQFSRSDHLTKHARRHPTYHPDMIEYRGR-RR
240 250 260 270 280 290
270 280
pF1KB8 TGS----LSDYSRSDASSPTISPASSP
: :.. .:.::. .:: :
CCDS58 TPRIDPPLTSEVESSASGSGPGPAPSFTTCL
300 310 320
>>CCDS6633.1 KLF9 gene_id:687|Hs108|chr9 (244 aa)
initn: 717 init1: 574 opt: 608 Z-score: 316.3 bits: 66.2 E(32554): 2.7e-11
Smith-Waterman score: 714; 45.8% identity (65.5% similar) in 264 aa overlap (1-259:1-235)
10 20 30 40 50
pF1KB8 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDG----K
:.::::.: ::.::::.:.::.: :..: :. : : ..:. : :
CCDS66 MSAAAYMDFVAAQCLVSISNRAAV--PEHGVA--PD-AERLRLPEREVTKEHGDPGDTWK
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 DSASLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAP-EPTSPGAEGAAA
: .: ..:. : :::. : .: . : .: : . .. ..
CCDS66 DYCTLVTIAKSLLDLNKYRPIQTP-------------SVCSDSLESPDEDMGSDSDVTTE
60 70 80 90 100
120 130 140 150 160 170
pF1KB8 APPSPAWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCE
. ::. : :: .: :.: . .::. . .:. .. ...::: :.::
CCDS66 SGSSPSHS---PEERQDPGSAPSPLSLLHPGV---AAKGK-----HASEKRHKCPYSGCG
110 120 130 140 150
180 190 200 210 220 230
pF1KB8 KVYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKR
:::::::::::: :.::::::: :.: :: :::.:::::.::::::::::.: ::.::::
CCDS66 KVYGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKR
160 170 180 190 200 210
240 250 260 270 280
pF1KB8 FMRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP
::::::::::::::..:::.:..:
CCDS66 FMRSDHLTKHARRHTEFHPSMIKRSKKALANAL
220 230 240
288 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 04:21:35 2016 done: Tue Nov 8 04:21:35 2016
Total Scan time: 2.760 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]