FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0759, 331 aa
1>>>pF1KE0759 331 - 331 aa - 331 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6808+/-0.00091; mu= 13.7842+/- 0.054
mean_var=63.9089+/-13.072, 0's: 0 Z-trim(105.7): 32 B-trim: 488 in 2/47
Lambda= 0.160433
statistics sampled from 8534 (8555) to 8534 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.633), E-opt: 0.2 (0.263), width: 16
Scan time: 2.310
The best scores are: opt bits E(32554)
CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 ( 331) 2226 523.9 6.9e-149
CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 ( 306) 1513 358.9 3.1e-99
CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 ( 326) 1422 337.8 7.2e-93
CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 ( 365) 1395 331.6 6e-91
CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 ( 198) 914 220.2 1.1e-57
>>CCDS32841.1 DOK6 gene_id:220164|Hs108|chr18 (331 aa)
initn: 2226 init1: 2226 opt: 2226 Z-score: 2786.4 bits: 523.9 E(32554): 6.9e-149
Smith-Waterman score: 2226; 100.0% identity (100.0% similar) in 331 aa overlap (1-331:1-331)
10 20 30 40 50 60
pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 MEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQTF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQTF
250 260 270 280 290 300
310 320 330
pF1KE0 PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ
:::::::::::::::::::::::::::::::
CCDS32 PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ
310 320 330
>>CCDS13446.1 DOK5 gene_id:55816|Hs108|chr20 (306 aa)
initn: 1469 init1: 1469 opt: 1513 Z-score: 1895.1 bits: 358.9 E(32554): 3.1e-99
Smith-Waterman score: 1513; 70.1% identity (89.6% similar) in 308 aa overlap (1-307:1-306)
10 20 30 40 50 60
pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT
::::::::::::::.::::.:::..::::::::::::::.::::: ::.::::: .::::
CCDS13 MASNFNDIVKQGYVRIRSRRLGIYQRCWLVFKKASSKGPKRLEKFSDERAAYFRCYHKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS
::.:.::..:::. :::::..: :.:.:::::::::.:::.:::: : :::.:::.::::
CCDS13 ELNNVKNVARLPKSTKKHAIGIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP
::::::::.::.:::.:::::::::.::::..:::..:::.: : :::..: .:::. ::
CCDS13 LGEPDLLATGVEREQSERFNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE
::.:::::::.::::::.::::.:::::: ::::.:: ::::::::.:::::::::: :.
CCDS13 LSALRRYGRDTTWFTFEAGRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERL-LQ
190 200 210 220 230
250 260 270 280 290
pF1KE0 MEQKARLQTSLTE-PMTLSKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQT
... :: ...: .:: . :::::::.:::::.:.:..: :: . . :. :..:
CCDS13 SVKNSMLQMKMSERAASLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVS-SPLKLHRTET
240 250 260 270 280 290
300 310 320 330
pF1KE0 FPSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ
::.: :.
CCDS13 FPAYRSEH
300
>>CCDS10783.1 DOK4 gene_id:55715|Hs108|chr16 (326 aa)
initn: 1423 init1: 1308 opt: 1422 Z-score: 1780.8 bits: 337.8 E(32554): 7.2e-93
Smith-Waterman score: 1423; 63.9% identity (84.9% similar) in 324 aa overlap (1-314:1-324)
10 20 30 40 50 60
pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT
::.::.:::::::::..::::::.:::::::.:.:::::.::::.::::.. .:. :::
CCDS10 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS
:. :.: .::::.:::..:::::: :....::.:.:::::::: : : .::::.::::::
CCDS10 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP
:::::::: ::: ::..::::.:.: ::::.:::: .:::::::::::::: .:::: ::
CCDS10 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE
: ::::::::.: ::::.:::::.::::.::::.:::.:::.::::::::::::.:..::
CCDS10 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
190 200 210 220 230 240
250 260 270 280 290
pF1KE0 MEQKARLQTSLTEPMTL--SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQ
::...:: .. :: .. . . ::::::::::: .....: : :.:.:... :
CCDS10 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGEGYGAAQASSET
250 260 270 280 290 300
300 310 320 330
pF1KE0 TF--------PSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ
. :. . .: ::. :
CCDS10 DLLNRFILLKPKPSQGDSSEAKTPSQ
310 320
>>CCDS81986.1 DOK4 gene_id:55715|Hs108|chr16 (365 aa)
initn: 1396 init1: 1308 opt: 1395 Z-score: 1746.2 bits: 331.6 E(32554): 6e-91
Smith-Waterman score: 1395; 68.4% identity (89.3% similar) in 291 aa overlap (1-289:1-291)
10 20 30 40 50 60
pF1KE0 MASNFNDIVKQGYVKIRSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT
::.::.:::::::::..::::::.:::::::.:.:::::.::::.::::.. .:. :::
CCDS81 MATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLEKYPDEKSVCLRGCPKVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 ELHNIKNITRLPRETKKHAVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDIS
:. :.: .::::.:::..:::::: :....::.:.:::::::: : : .::::.::::::
CCDS81 EISNVKCVTRLPKETKRQAVAIIFTDDSARTFTCDSELEAEEWYKTLSVECLGSRLNDIS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 LGEPDLLAAGVQREQNERFNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWP
:::::::: ::: ::..::::.:.: ::::.:::: .:::::::::::::: .:::: ::
CCDS81 LGEPDLLAPGVQCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVKLVSWP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 LSSLRRYGRDSTWFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLE
: ::::::::.: ::::.:::::.::::.::::.:::.:::.::::::::::::.:..::
CCDS81 LCSLRRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQIYQRVHSATLAIAEQHKRVLLE
190 200 210 220 230 240
250 260 270 280 290
pF1KE0 MEQKARLQTSLTEPMTL--SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQ
::...:: .. :: .. . . ::::::::::: .....: : :...
CCDS81 MEKNVRLLNKGTEHYSYPCTPTTMLPRSAYWHHITGSQNIAEASSYAGESLPCPTPTCQE
250 260 270 280 290 300
300 310 320 330
pF1KE0 TFPSYAPEQSEEAQQPLSRSSSYGFSYSSSLIQ
CCDS81 ALWRMRPIGQGSFDLALSSEPASVPTGEGYGAAQASSETDLLNRFILLKPKPSQGDSSEA
310 320 330 340 350 360
>>CCDS13447.1 DOK5 gene_id:55816|Hs108|chr20 (198 aa)
initn: 870 init1: 870 opt: 914 Z-score: 1148.9 bits: 220.2 E(32554): 1.1e-57
Smith-Waterman score: 914; 66.0% identity (87.0% similar) in 200 aa overlap (109-307:1-198)
80 90 100 110 120 130
pF1KE0 AVAIIFHDETSKTFACESELEAEEWCKHLCMECLGTRLNDISLGEPDLLAAGVQREQNER
:::.:::.::::::::::::.::.:::.::
CCDS13 MECVGTRINDISLGEPDLLATGVEREQSER
10 20 30
140 150 160 170 180 190
pF1KE0 FNVYLMPTPNLDIYGECTMQITHENIYLWDIHNAKVKLVMWPLSSLRRYGRDSTWFTFES
:::::::.::::..:::..:::.: : :::..: .:::. ::::.:::::::.::::::.
CCDS13 FNVYLMPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFTFEA
40 50 60 70 80 90
200 210 220 230 240 250
pF1KE0 GRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHERLMLEMEQKARLQTSLTE-PMTL
::::.:::::: ::::.:: ::::::::.:::::::::: :. ... :: ...: .:
CCDS13 GRMCETGEGLFIFQTRDGEAIYQKVHSAALAIAEQHERL-LQSVKNSMLQMKMSERAASL
100 110 120 130 140
260 270 280 290 300 310
pF1KE0 SKSISLPRSAYWHHITRQNSVGEIYSLQGHGFGSSKMSRAQTFPSYAPEQSEEAQQPLSR
: . :::::::.:::::.:.:..: :: . . :. :..:::.: :.
CCDS13 STMVPLPRSAYWQHITRQHSTGQLYRLQDVS-SPLKLHRTETFPAYRSEH
150 160 170 180 190
320 330
pF1KE0 SSSYGFSYSSSLIQ
331 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 03:04:49 2016 done: Sat Nov 5 03:04:49 2016
Total Scan time: 2.310 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]