FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7939, 342 aa
1>>>pF1KB7939 342 - 342 aa - 342 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.1433+/-0.000854; mu= 8.1549+/- 0.050
mean_var=160.5269+/-37.316, 0's: 0 Z-trim(112.0): 27 B-trim: 722 in 1/50
Lambda= 0.101228
statistics sampled from 12786 (12810) to 12786 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.394), width: 16
Scan time: 2.740
The best scores are: opt bits E(32554)
CCDS2855.1 PHF7 gene_id:51533|Hs108|chr3 ( 342) 2502 377.1 1.1e-104
CCDS2854.1 PHF7 gene_id:51533|Hs108|chr3 ( 381) 1649 252.6 3.9e-67
CCDS9638.1 G2E3 gene_id:55632|Hs108|chr14 ( 706) 659 108.3 2e-23
CCDS76669.1 G2E3 gene_id:55632|Hs108|chr14 ( 660) 563 94.2 3.2e-19
>>CCDS2855.1 PHF7 gene_id:51533|Hs108|chr3 (342 aa)
initn: 2502 init1: 2502 opt: 2502 Z-score: 1992.7 bits: 377.1 E(32554): 1.1e-104
Smith-Waterman score: 2502; 100.0% identity (100.0% similar) in 342 aa overlap (1-342:1-342)
10 20 30 40 50 60
pF1KB7 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRRWCLILCATCGSH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRRWCLILCATCGSH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 GTHRDCSSLRSNSKKWECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 GTHRDCSSLRSNSKKWECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENP
250 260 270 280 290 300
310 320 330 340
pF1KB7 GLSWTDWPEPSLLEKPESSRGRRSYSWRSKGVRITNSCKKSK
::::::::::::::::::::::::::::::::::::::::::
CCDS28 GLSWTDWPEPSLLEKPESSRGRRSYSWRSKGVRITNSCKKSK
310 320 330 340
>>CCDS2854.1 PHF7 gene_id:51533|Hs108|chr3 (381 aa)
initn: 2488 init1: 1646 opt: 1649 Z-score: 1318.8 bits: 252.6 E(32554): 3.9e-67
Smith-Waterman score: 2366; 89.6% identity (89.6% similar) in 374 aa overlap (1-335:1-374)
10 20 30 40 50 60
pF1KB7 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC
130 140 150 160 170 180
190 200 210 220
pF1KB7 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDR-------------
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRDAAWELEPGAFSD
190 200 210 220 230 240
230 240 250 260
pF1KB7 --------------------------RWCLILCATCGSHGTHRDCSSLRSNSKKWECEEC
::::::::::::::::::::::::::::::::::
CCDS28 LYQRYQHCDAPICLYEQGRDSFEDEGRWCLILCATCGSHGTHRDCSSLRSNSKKWECEEC
250 260 270 280 290 300
270 280 290 300 310 320
pF1KB7 SPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENPGLSWTDWPEPSLLEKPESSRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENPGLSWTDWPEPSLLEKPESSRG
310 320 330 340 350 360
330 340
pF1KB7 RRSYSWRSKGVRITNSCKKSK
::::::::::::::
CCDS28 RRSYSWRSKGVRITNSCKKSK
370 380
>>CCDS9638.1 G2E3 gene_id:55632|Hs108|chr14 (706 aa)
initn: 580 init1: 382 opt: 659 Z-score: 534.0 bits: 108.3 E(32554): 2e-23
Smith-Waterman score: 716; 36.8% identity (55.5% similar) in 353 aa overlap (22-326:1-347)
10 20 30 40 50
pF1KB7 MKTVKEKKECQRLRKSAKTRRVTQRKP--SSGPVCWLCLREPGDPEKLGEFLQKD--NIS
... :: :.. .: .: .. :.: :: :. :..
CCDS96 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLT
10 20 30
60 70 80 90 100 110
pF1KB7 VHYFCLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCL
:::.::..:: . :::. ..: .::: :::.::. :::. : ::::.::.:.: .:
CCDS96 VHYYCLLMSSGIWQRGKEEEGVYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCK
40 50 60 70 80 90
120 130 140 150 160 170
pF1KB7 RNFHLPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEE-SCILCCEDLSQQSVENI
:..:.::: .: :. :: :.. ::: :::.: : .. : : .: : . ::
CCDS96 RSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNI
100 110 120 130 140 150
180 190 200 210 220
pF1KB7 -QSPCCSQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIP---------
.::::..: .:: :.: : ... ::.: ::: : .::::::::::
CCDS96 LRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELE
160 170 180 190 200 210
230 240 250
pF1KB7 ------------------------------DRRWCLILCATCGSHGTHRDCSSLRSNSKK
: .: . : ::: ::: :::::: ..
CCDS96 ENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQN
220 230 240 250 260 270
260 270 280 290 300 310
pF1KB7 WECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFC--RDNTLEEN-PGLSWTDWPEPSL
::: :: : :::.. .. :. . : :::. : : . :
CCDS96 WECLECRG------IIYNSGEFQKAKKHVLPNSNNVGITDCLLEESSPKLPRQSPGSQSK
280 290 300 310 320 330
320 330 340
pF1KB7 LEKPESSRGRRSYSWRSKGVRITNSCKKSK
..:. ::. :
CCDS96 DLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKANIWNSALDAFRNRNFNPSYAIEVA
340 350 360 370 380 390
>>CCDS76669.1 G2E3 gene_id:55632|Hs108|chr14 (660 aa)
initn: 530 init1: 332 opt: 563 Z-score: 458.6 bits: 94.2 E(32554): 3.2e-19
Smith-Waterman score: 620; 37.1% identity (53.7% similar) in 307 aa overlap (64-326:1-301)
40 50 60 70 80 90
pF1KB7 WLCLREPGDPEKLGEFLQKDNISVHYFCLILSSKLPQRGQSNRGFHGFLPEDIKKEAARA
.:: . :::. ..: .::: :::.::. ::
CCDS76 MSSGIWQRGKEEEGVYGFLIEDIRKEVNRA
10 20 30
100 110 120 130 140 150
pF1KB7 SRKICFVCKKKGAAINCQKDQCLRNFHLPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHG
:. : ::::.::.:.: .: :..:.::: .: :. :: :.. ::: :::.: : .
CCDS76 SKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSN
40 50 60 70 80 90
160 170 180 190 200 210
pF1KB7 HVGEE-SCILCCEDLSQQSVENI-QSPCCSQAIYHRKCIQKYAHTSAKHFFKCPQCNNRK
. : : .: : . :: .::::..: .:: :.: : ... ::.: :::
CCDS76 NYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSD
100 110 120 130 140 150
220 230
pF1KB7 EFPQEMLRMGIHIP---------------------------------------DRRWCLI
: .:::::::::: : .: .
CCDS76 IFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIK
160 170 180 190 200 210
240 250 260 270 280 290
pF1KB7 LCATCGSHGTHRDCSSLRSNSKKWECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFC-
: ::: ::: :::::: ..::: :: : :::.. .. :. .
CCDS76 RCQCCGSSGTHLACSSLRSWEQNWECLECRG------IIYNSGEFQKAKKHVLPNSNNVG
220 230 240 250 260
300 310 320 330 340
pF1KB7 -RDNTLEEN-PGLSWTDWPEPSLLEKPESSRGRRSYSWRSKGVRITNSCKKSK
: :::. : : . : ..:. ::. :
CCDS76 ITDCLLEESSPKLPRQSPGSQSKDLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKAN
270 280 290 300 310 320
CCDS76 IWNSALDAFRNRNFNPSYAIEVAYVIENDNFGSEHPGSKQEFLSLLMQHLENSSLFEGSL
330 340 350 360 370 380
342 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 14:41:18 2016 done: Sat Nov 5 14:41:19 2016
Total Scan time: 2.740 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]