FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3402, 149 aa 1>>>pF1KE3402 149 - 149 aa - 149 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.5474+/-0.000988; mu= 6.0021+/- 0.058 mean_var=206.9624+/-55.759, 0's: 0 Z-trim(111.3): 69 B-trim: 1113 in 2/48 Lambda= 0.089151 statistics sampled from 12170 (12247) to 12170 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.758), E-opt: 0.2 (0.376), width: 16 Scan time: 1.660 The best scores are: opt bits E(32554) CCDS32914.1 ZNF69 gene_id:7620|Hs108|chr19 ( 149) 1042 145.7 9.9e-36 CCDS42503.1 ZNF440 gene_id:126070|Hs108|chr19 ( 595) 838 120.3 1.8e-27 CCDS45982.1 ZNF763 gene_id:284390|Hs108|chr19 ( 397) 828 118.8 3.4e-27 CCDS32915.1 ZNF700 gene_id:90592|Hs108|chr19 ( 742) 806 116.3 3.5e-26 CCDS74289.1 ZNF700 gene_id:90592|Hs108|chr19 ( 745) 806 116.3 3.5e-26 CCDS12268.1 ZNF439 gene_id:90594|Hs108|chr19 ( 499) 754 109.4 2.8e-24 CCDS45985.1 ZNF844 gene_id:284391|Hs108|chr19 ( 666) 545 82.7 4.1e-16 CCDS45986.1 ZNF20 gene_id:7568|Hs108|chr19 ( 532) 538 81.7 6.8e-16 CCDS45983.1 ZNF433 gene_id:163059|Hs108|chr19 ( 673) 469 72.9 3.6e-13 CCDS45981.1 ZNF823 gene_id:55552|Hs108|chr19 ( 610) 450 70.4 1.9e-12 CCDS42502.1 ZNF627 gene_id:199692|Hs108|chr19 ( 461) 448 70.0 1.9e-12 CCDS42505.1 ZNF564 gene_id:163050|Hs108|chr19 ( 553) 426 67.3 1.5e-11 CCDS45984.2 ZNF878 gene_id:729747|Hs108|chr19 ( 531) 420 66.5 2.5e-11 CCDS32916.1 ZNF136 gene_id:7695|Hs108|chr19 ( 540) 409 65.1 6.7e-11 >>CCDS32914.1 ZNF69 gene_id:7620|Hs108|chr19 (149 aa) initn: 1042 init1: 1042 opt: 1042 Z-score: 755.0 bits: 145.7 E(32554): 9.9e-36 Smith-Waterman score: 1042; 100.0% identity (100.0% similar) in 149 aa overlap (1-149:1-149) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV 70 80 90 100 110 120 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP ::::::::::::::::::::::::::::: CCDS32 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP 130 140 >>CCDS42503.1 ZNF440 gene_id:126070|Hs108|chr19 (595 aa) initn: 838 init1: 838 opt: 838 Z-score: 606.9 bits: 120.3 E(32554): 1.8e-27 Smith-Waterman score: 838; 93.8% identity (98.4% similar) in 128 aa overlap (22-149:16-143) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY ::::::::::::::.::::::::::::.:: :::::::: CCDS42 MDPVAFKDVAVNFTQEEWALLDISQRKLYREVMLETFRNLTSLGKRWKDQNIEY 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV :.::::::::::::.:::::::::::::::: ::::::::::::::::.:::.::::::: CCDS42 EHQNPRRNFRSLIEEKVNEIKDDSHCGETFTPVPDDRLNFQEKKASPEVKSCESFVCGEV 60 70 80 90 100 110 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP ::::::::::::::::::::::::::::: CCDS42 GLGNSSFNMNIRGDIGHKAYEYQEYGPKPCKCQQPKKAFRYRPSFRTQERDHTGEKPNAC 120 130 140 150 160 170 >>CCDS45982.1 ZNF763 gene_id:284390|Hs108|chr19 (397 aa) initn: 823 init1: 823 opt: 828 Z-score: 601.8 bits: 118.8 E(32554): 3.4e-27 Smith-Waterman score: 828; 85.8% identity (95.0% similar) in 141 aa overlap (9-149:9-146) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY : :: ... .:: :::::::::::::.::::::::::::.::.:::::::: CCDS45 MMFQDPVAC-EDVAVNFTQE--EWALLDISQRKLYREVMLETFRNLTSIGKKWKDQNIEY 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV :::::::::::::: .:::::.:::::::::::::::::::::::::: ::::.:::::: CCDS45 EYQNPRRNFRSLIEGNVNEIKEDSHCGETFTQVPDDRLNFQEKKASPEAKSCDNFVCGEV 60 70 80 90 100 110 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP :.:::::::::::::::::::::.:.::: CCDS45 GIGNSSFNMNIRGDIGHKAYEYQDYAPKPYKCQQPKKAFRYHPSFRTQERNHTGEKPYAC 120 130 140 150 160 170 >>CCDS32915.1 ZNF700 gene_id:90592|Hs108|chr19 (742 aa) initn: 953 init1: 802 opt: 806 Z-score: 583.6 bits: 116.3 E(32554): 3.5e-26 Smith-Waterman score: 915; 80.4% identity (90.2% similar) in 163 aa overlap (1-149:1-163) 10 20 30 40 pF1KE3 MPCCSHRRCREDPGTSESQEM--------------EEWALLDISQRKLYKEVMLETFRNL ::::::: ::::::::::.:: :::.::::::..:..:::::::::: CCDS32 MPCCSHRSCREDPGTSESREMDPVAFEDVAVNFTQEEWTLLDISQKNLFREVMLETFRNL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 TSVGKSWKDQNIEYEYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKAS ::.::.:.:::::::::::::.::::::.::::::.:::::::::::::::::::::::: CCDS32 TSIGKKWSDQNIEYEYQNPRRSFRSLIEEKVNEIKEDSHCGETFTQVPDDRLNFQEKKAS 70 80 90 100 110 120 110 120 130 140 pF1KE3 PEIKSCDSFVCGEVGLGNSSFNMNIRGDIGHKAYEYQEYGPKP ::.::::::::.:::.:::::::.:::: :::::::::::::: CCDS32 PEVKSCDSFVCAEVGIGNSSFNMSIRGDTGHKAYEYQEYGPKPYKCQQPKNKKAFRYRPS 130 140 150 160 170 180 CCDS32 IRTQERDHTGEKPYACKVCGKTFIFHSSIRRHMVMHSGDGTYKCKFCGKAFHSFSLYLIH 190 200 210 220 230 240 >>CCDS74289.1 ZNF700 gene_id:90592|Hs108|chr19 (745 aa) initn: 939 init1: 802 opt: 806 Z-score: 583.6 bits: 116.3 E(32554): 3.5e-26 Smith-Waterman score: 909; 78.9% identity (88.6% similar) in 166 aa overlap (1-149:1-166) 10 20 30 40 pF1KE3 MPCCSHRRCREDPGTSESQEM-----------------EEWALLDISQRKLYKEVMLETF ::::::: ::::::::::.:: :::.::::::..:..::::::: CCDS74 MPCCSHRSCREDPGTSESREMMFQDPVAFEDVAVNFTQEEWTLLDISQKNLFREVMLETF 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 RNLTSVGKSWKDQNIEYEYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEK :::::.::.:.:::::::::::::.::::::.::::::.::::::::::::::::::::: CCDS74 RNLTSIGKKWSDQNIEYEYQNPRRSFRSLIEEKVNEIKEDSHCGETFTQVPDDRLNFQEK 70 80 90 100 110 120 110 120 130 140 pF1KE3 KASPEIKSCDSFVCGEVGLGNSSFNMNIRGDIGHKAYEYQEYGPKP :::::.::::::::.:::.:::::::.:::: :::::::::::::: CCDS74 KASPEVKSCDSFVCAEVGIGNSSFNMSIRGDTGHKAYEYQEYGPKPYKCQQPKNKKAFRY 130 140 150 160 170 180 CCDS74 RPSIRTQERDHTGEKPYACKVCGKTFIFHSSIRRHMVMHSGDGTYKCKFCGKAFHSFSLY 190 200 210 220 230 240 >>CCDS12268.1 ZNF439 gene_id:90594|Hs108|chr19 (499 aa) initn: 752 init1: 588 opt: 754 Z-score: 549.3 bits: 109.4 E(32554): 2.8e-24 Smith-Waterman score: 754; 85.9% identity (93.8% similar) in 128 aa overlap (22-149:31-157) 10 20 30 40 50 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGK ::::::::::..::.::::::: ::::.:: CCDS12 MLSLSPILLYTCEMFQDPVAFKDVAVNFTQEEWALLDISQKNLYREVMLETFWNLTSIGK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 SWKDQNIEYEYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKS .:::::::::::::::::::. :.::::::.::::::::: :::::::::.::::::.:: CCDS12 KWKDQNIEYEYQNPRRNFRSVTEEKVNEIKEDSHCGETFTPVPDDRLNFQKKKASPEVKS 70 80 90 100 110 120 120 130 140 pF1KE3 CDSFVCGEVGLGNSSFNMNIRGDIGHKAYEYQEYGPKP :::::: :::::::: ::::::: :::: : ::::::: CCDS12 CDSFVC-EVGLGNSSSNMNIRGDTGHKACECQEYGPKPWKSQQPKKAFRYHPSLRTQERD 130 140 150 160 170 CCDS12 HTGKKPYACKECGKNIIYHSSIQRHMVVHSGDGPYKCKFCGKAFHCLSLYLIHERTHTGE 180 190 200 210 220 230 >>CCDS45985.1 ZNF844 gene_id:284391|Hs108|chr19 (666 aa) initn: 543 init1: 356 opt: 545 Z-score: 402.7 bits: 82.7 E(32554): 4.1e-16 Smith-Waterman score: 545; 57.6% identity (82.7% similar) in 139 aa overlap (11-149:7-140) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY :: ... .:: ::.::: ::..::.::: ::.:::.:.:..::::::: CCDS45 MDLVAFEDVAVNFTQE--EWSLLDPSQKNLYREVMQETLRNLASIGEKWKDQNIED 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV .:.::: :.:::. ..:.: ...::::: .:.::: :: ::.:: .:::.: ::::: CCDS45 QYKNPRNNLRSLLGERVDENTEENHCGETSSQIPDDTLN---KKTSPGVKSCESSVCGEV 60 70 80 90 100 110 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP .:.::.: .::.: .:: :::::: .: CCDS45 FVGHSSLNRHIRADTAHKPSEYQEYGQEPYKCQQRKKAFRCHPSFQMQEKAHTGEKLYDC 120 130 140 150 160 170 >>CCDS45986.1 ZNF20 gene_id:7568|Hs108|chr19 (532 aa) initn: 407 init1: 256 opt: 538 Z-score: 398.9 bits: 81.7 E(32554): 6.8e-16 Smith-Waterman score: 538; 59.7% identity (80.6% similar) in 139 aa overlap (11-149:10-142) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY :: ..: .:: :::::: ::..::..:: :::.:::::::.:: :::: CCDS45 MMFQDSVAFEDVAVSFTQE--EWALLDPSQKNLYRDVMQETFKNLTSVGKTWKVQNIED 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV ::.:::::. ::...:. : :.. ::::.:.:. :: :: .:. : : :.: ::::: CCDS45 EYKNPRRNL-SLMREKLCESKESHHCGESFNQIADDMLN---RKTLPGITPCESSVCGEV 60 70 80 90 100 110 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP : :.::.: .::.: :::. :::::: .: CCDS45 GTGHSSLNTHIRADTGHKSSEYQEYGENPYRNKECKKAFSYLDSFQSHDKACTKEKPYDG 120 130 140 150 160 170 >>CCDS45983.1 ZNF433 gene_id:163059|Hs108|chr19 (673 aa) initn: 455 init1: 224 opt: 469 Z-score: 349.8 bits: 72.9 E(32554): 3.6e-13 Smith-Waterman score: 469; 55.4% identity (76.3% similar) in 139 aa overlap (11-149:10-141) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY :: ... .:: :::::: ::..: ..:: ::::::.:.::.:: ::: CCDS45 MMFQDSVAFEDVAVTFTQE--EWALLDPSQKNLCRDVMQETFRNLASIGKKWKPQNIYV 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV ::.: :::.: .. ... : :. . :: .:::::: : ::.. .:::.: : ::: CCDS45 EYENLRRNLR-IVGERLFESKEGHQHGEILTQVPDDML----KKTTTGVKSCESSVYGEV 60 70 80 90 100 110 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP : ..::.: .:: : ::::::::::: :: CCDS45 GSAHSSLNRHIRDDTGHKAYEYQEYGQKPYKCKYCKKPFNCLSSVQTHERAHSGRKLYVC 120 130 140 150 160 170 >>CCDS45981.1 ZNF823 gene_id:55552|Hs108|chr19 (610 aa) initn: 460 init1: 193 opt: 450 Z-score: 337.1 bits: 70.4 E(32554): 1.9e-12 Smith-Waterman score: 454; 53.2% identity (74.1% similar) in 139 aa overlap (11-149:7-135) 10 20 30 40 50 60 pF1KE3 MPCCSHRRCREDPGTSESQEMEEWALLDISQRKLYKEVMLETFRNLTSVGKSWKDQNIEY :: ... .:: ::::: ::..::..:: ::.::: . .:.:::: CCDS45 MDSVAFEDVAVNFTQE--EWALLGPSQKSLYRNVMQETIRNLDCIEMKWEDQNIGD 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 EYQNPRRNFRSLIEKKVNEIKDDSHCGETFTQVPDDRLNFQEKKASPEIKSCDSFVCGEV . :: .::.:: .. ::::::.::::: :.::. .: : .:... ::: :::: CCDS45 QCQNAKRNLRS----HTCEIKDDSQCGETFGQIPDSIVN----KNTPRVNPCDSGECGEV 60 70 80 90 100 130 140 pF1KE3 GLGNSSFNMNIRGDIGHKAYEYQEYGPKP ::.::.: ::: : :::. :.:::: :: CCDS45 VLGHSSLNCNIRVDTGHKSCEHQEYGEKPYTHKQRGKAISHQHSFQTHERPPTGKKPFDC 110 120 130 140 150 160 149 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 05:11:41 2016 done: Sun Nov 6 05:11:41 2016 Total Scan time: 1.660 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]