FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9693, 252 aa
1>>>pF1KB9693 252 - 252 aa - 252 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.7563+/-0.000945; mu= -9.7550+/- 0.057
mean_var=530.9813+/-117.846, 0's: 0 Z-trim(119.6): 532 B-trim: 1053 in 1/54
Lambda= 0.055659
statistics sampled from 20211 (20919) to 20211 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.859), E-opt: 0.2 (0.643), width: 16
Scan time: 3.110
The best scores are: opt bits E(32554)
CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 ( 252) 1774 155.4 3.3e-38
CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 ( 323) 719 70.8 1.2e-12
CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 ( 288) 671 66.9 1.7e-11
>>CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 (252 aa)
initn: 1774 init1: 1774 opt: 1774 Z-score: 799.3 bits: 155.4 E(32554): 3.3e-38
Smith-Waterman score: 1774; 100.0% identity (100.0% similar) in 252 aa overlap (1-252:1-252)
10 20 30 40 50 60
pF1KB9 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPEGAGPAAGLDVRAARREAASPGTPGPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPEGAGPAAGLDVRAARREAASPGTPGPPP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 PPPAASGPGPGAAAAPHLLAASILADLRGGPGAAPGGASPASSSSAASSPSSGRAPGAAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PPPAASGPGPGAAAAPHLLAASILADLRGGPGAAPGGASPASSSSAASSPSSGRAPGAAP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SAAAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 SAAAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRT
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 HTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPDLLRRPGARSTSPSDSLPCSLAGSPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 HTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPDLLRRPGARSTSPSDSLPCSLAGSPA
190 200 210 220 230 240
250
pF1KB9 PSPAPSPAPAGL
::::::::::::
CCDS12 PSPAPSPAPAGL
250
>>CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 (323 aa)
initn: 1049 init1: 661 opt: 719 Z-score: 340.2 bits: 70.8 E(32554): 1.2e-12
Smith-Waterman score: 777; 50.4% identity (61.3% similar) in 282 aa overlap (38-250:38-318)
10 20 30 40 50 60
pF1KB9 VDYFAADVLMAISSGAVVHRGRPGPEGAGPAAGLDVRAARREAASPGTPGPPPP---P--
::: .: :: :.: :: :::: : :
CCDS58 LDYFAAECLVSMSAGAVVHRRPPDPEGAGGAAGSEVGAAPPESALPG-PGPPGPASVPQL
10 20 30 40 50 60
70 80 90
pF1KB9 PAASGPGPGAA-AAPHLLAASILADLRGGPGA----------------------------
: . .:.:::. :::::::::. :::::. :
CCDS58 PQVPAPSPGAGGAAPHLLAASVWADLRGSSGEGSWENSGEAPRASSGFSDPIPCSVQTPC
70 80 90 100 110 120
100 110 120
pF1KB9 ---APG-GAS----PASSSSAASSPSSGRAPGA----------------APSA-------
::. ::. : :::.: . ::. :::: ::.:
CCDS58 SELAPASGAAAVCAPESSSDAPAVPSAPAAPGAPAASGGFSGGALGAGPAPAADQAPRRR
130 140 150 160 170 180
130 140 150 160 170
pF1KB9 ----AAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHH
::: :.:::: :.:::::::::::: :::::::::.::: :::::.::::::::.
CCDS58 SVTPAAKRHQCPFPGCTKAYYKSSHLKSHQRTHTGERPFSCDWLDCDKKFTRSDELARHY
190 200 210 220 230 240
180 190 200 210 220 230
pF1KB9 RTHTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPDLLRRPGARSTSPSDSLPCSLAGS
:::::::::::::: :.:.:::::.::::::: .:::... : : : : : . :
CCDS58 RTHTGEKRFSCPLCPKQFSRSDHLTKHARRHPTYHPDMIEYRGRRRTPRIDPPLTSEVES
250 260 270 280 290 300
240 250
pF1KB9 PAPSPAPSPAPAGL
: . .:.:::.
CCDS58 SASGSGPGPAPSFTTCL
310 320
>>CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 (288 aa)
initn: 846 init1: 576 opt: 671 Z-score: 320.0 bits: 66.9 E(32554): 1.7e-11
Smith-Waterman score: 678; 47.5% identity (60.2% similar) in 284 aa overlap (10-247:9-288)
10 20 30 40
pF1KB9 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPE----GAGPAA------------GLD--
.:::. :...:: :::: : ::: ::. :: : :
CCDS10 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSA
10 20 30 40 50
50 60 70
pF1KB9 -----VR--------------AARREAASPG---TPGP-PPPPPAASGPGP-GAAAAPHL
.: : :::.:. :: ::: : ..:: ::::::
CCDS10 SLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPS
60 70 80 90 100 110
80 90 100 110 120 130
pF1KB9 LAASILADLRG-GPGAAPGGASPASSSSAASSPSSGRAPGAAP-SAAAKSHRCPFPDCAK
: : : : :: ::.:. . :. . : . ..:.: . : :
CCDS10 PAWSEPEPEAGLEPEREPG---PAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEK
120 130 140 150 160 170
140 150 160 170 180 190
pF1KB9 AYYKSSHLKSHLRTHTGERPFACDWQGCDKKFARSDELARHHRTHTGEKRFSCPLCSKRF
.: ::::::.:::::::::::::.:: :.::::::::::::.:::::::.::::.: :::
CCDS10 VYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRF
180 190 200 210 220 230
200 210 220 230 240 250
pF1KB9 TRSDHLAKHARRHPGFHPDLLRR--PGARSTSPSDSLPCSLAGSPAPSPAPSPAPAGL
:::::.:::::: .::: .:.: :.:. : :: : :.::. ::: ::
CCDS10 MRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSD-YSRSDASSPTISPASSP
240 250 260 270 280
252 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 13:35:46 2016 done: Sun Nov 6 13:35:46 2016
Total Scan time: 3.110 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]