FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4104, 473 aa
1>>>pF1KE4104 473 - 473 aa - 473 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.6720+/-0.000687; mu= 14.1569+/- 0.042
mean_var=124.8895+/-25.349, 0's: 0 Z-trim(114.1): 28 B-trim: 952 in 2/51
Lambda= 0.114765
statistics sampled from 14682 (14709) to 14682 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.769), E-opt: 0.2 (0.452), width: 16
Scan time: 2.860
The best scores are: opt bits E(32554)
CCDS3467.1 KCTD8 gene_id:386617|Hs108|chr4 ( 473) 3165 534.6 8.3e-152
CCDS34260.1 KCTD16 gene_id:57528|Hs108|chr5 ( 428) 1144 200.0 4.2e-51
CCDS9455.1 KCTD12 gene_id:115207|Hs108|chr13 ( 325) 630 114.8 1.4e-25
>>CCDS3467.1 KCTD8 gene_id:386617|Hs108|chr4 (473 aa)
initn: 3165 init1: 3165 opt: 3165 Z-score: 2838.9 bits: 534.6 E(32554): 8.3e-152
Smith-Waterman score: 3165; 100.0% identity (100.0% similar) in 473 aa overlap (1-473:1-473)
10 20 30 40 50 60
pF1KE4 MALKDTGSGGSTILPISEMVSSSSSPGASAAAAPGPCAPSPFPEVVELNVGGQVYVTKHS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MALKDTGSGGSTILPISEMVSSSSSPGASAAAAPGPCAPSPFPEVVELNVGGQVYVTKHS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 TLLSVPDSTLASMFSPSSPRGGARRRGELPRDSRARFFIDRDGFLFRYVLDYLRDKQLAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 TLLSVPDSTLASMFSPSSPRGGARRRGELPRDSRARFFIDRDGFLFRYVLDYLRDKQLAL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 PEHFPEKERLLREAEYFQLTDLVKLLSPKVTKQNSLNDEGCQSDLEDNVSQGSSDALLLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 PEHFPEKERLLREAEYFQLTDLVKLLSPKVTKQNSLNDEGCQSDLEDNVSQGSSDALLLR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 GAAAAVPSGPGAHGGGGGGGAQDKRSGFLTLGYRGSYTTVRDNQADAKFRRVARIMVCGR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 GAAAAVPSGPGAHGGGGGGGAQDKRSGFLTLGYRGSYTTVRDNQADAKFRRVARIMVCGR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 IALAKEVFGDTLNESRDPDRQPEKYTSRFYLKFTYLEQAFDRLSEAGFHMVACNSSGTAA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 IALAKEVFGDTLNESRDPDRQPEKYTSRFYLKFTYLEQAFDRLSEAGFHMVACNSSGTAA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 FVNQYRDDKIWSSYTEYIFFRPPQKIVSPKQEHEDRKHDKVTDKGSESGTSCNELSTSSC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 FVNQYRDDKIWSSYTEYIFFRPPQKIVSPKQEHEDRKHDKVTDKGSESGTSCNELSTSSC
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 DSHSEASTPQDNPSSAQQATAHQPNTLTLDRPSKKAPVQWIPPPDKRRNSELFQTLISKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 DSHSEASTPQDNPSSAQQATAHQPNTLTLDRPSKKAPVQWIPPPDKRRNSELFQTLISKS
370 380 390 400 410 420
430 440 450 460 470
pF1KE4 RETNLSKKKVCEKLSVEEEMKKCIQDFKKIHIPDYFPERKRQWQSELLQKYGL
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 RETNLSKKKVCEKLSVEEEMKKCIQDFKKIHIPDYFPERKRQWQSELLQKYGL
430 440 450 460 470
>>CCDS34260.1 KCTD16 gene_id:57528|Hs108|chr5 (428 aa)
initn: 1681 init1: 652 opt: 1144 Z-score: 1031.0 bits: 200.0 E(32554): 4.2e-51
Smith-Waterman score: 1669; 61.7% identity (78.2% similar) in 441 aa overlap (35-473:16-428)
10 20 30 40 50 60
pF1KE4 DTGSGGSTILPISEMVSSSSSPGASAAAAPGPCAPSPFPEVVELNVGGQVYVTKHSTLLS
: .:. :::::::::::::: :.::::.:
CCDS34 MALSGNCSRYYPREQGSAVPNSFPEVVELNVGGQVYFTRHSTLIS
10 20 30 40
70 80 90 100 110 120
pF1KE4 VPDSTLASMFSPSSPRGGARRRGELPRDSRARFFIDRDGFLFRYVLDYLRDKQLALPEHF
.: : : .::::. : : ..: .::..:::::::::::::.::::::.:..::.::
CCDS34 IPHSLLWKMFSPK--RDTA---NDLAKDSKGRFFIDRDGFLFRYILDYLRDRQVVLPDHF
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE4 PEKERLLREAEYFQLTDLVKLLSPKVTKQNSLNDEGCQSDLEDNVSQGSSDALLLRGAAA
::: :: :::::::: ::::::.: ::. :: :.::.:: .::::. . ..
CCDS34 PEKGRLKREAEYFQLPDLVKLLTPDEIKQSP--DEFCHSDFED-ASQGSDTRIC--PPSS
110 120 130 140 150
190 200 210 220 230 240
pF1KE4 AVPSGPGAHGGGGGGGAQDKRSGFLTLGYRGSYTTVRDNQADAKFRRVARIMVCGRIALA
.:. :.. ::.:.::::: : :..::::::::: ::.:::::.::
CCDS34 LLPA--------------DRKWGFITVGYRGSCTLGREGQADAKFRRVPRILVCGRISLA
160 170 180 190 200
250 260 270 280 290 300
pF1KE4 KEVFGDTLNESRDPDRQPEKYTSRFYLKFTYLEQAFDRLSEAGFHMVACNSSGTAAFVNQ
:::::.:::::::::: ::.::::::::: .::.::: ::: :::::::::: ::.:.::
CCDS34 KEVFGETLNESRDPDRAPERYTSRFYLKFKHLERAFDMLSECGFHMVACNSSVTASFINQ
210 220 230 240 250 260
310 320 330 340 350 360
pF1KE4 YRDDKIWSSYTEYIFFRPPQKIVSPKQEHEDRKHDKVTDKGSESGTSCNELSTSSCDSHS
: :::::::::::.:.: :.. ::.. :. : :: .:::::::.::::::::.:
CCDS34 YTDDKIWSSYTEYVFYREPSRW-SPSHCDCCCKNGK-GDKEGESGTSCNDLSTSSCDSQS
270 280 290 300 310
370 380 390 400 410 420
pF1KE4 EASTPQDNPSSAQQATAHQPNTLTLDRPSKKAPVQWIPPPDKRRNSELFQTLISKSRETN
:::.::. . ...: : ::::: ::.::: : . ::.:.:..:: : :::.:
CCDS34 EASSPQE--TVICGPVTRQTNIQTLDRPIKKGPVQLIQQSEMRRKSDLLRTLTSGSRESN
320 330 340 350 360 370
430 440 450 460 470
pF1KE4 LSKKK--VCEKLSVEEEMKKCIQDFKKIHIPDYFPERKRQWQSELLQKYGL
.:.:: : ::::.:::..:::::: ::.::: :::::. ::::::.:: :
CCDS34 MSSKKKAVKEKLSIEEELEKCIQDFLKIKIPDRFPERKHPWQSELLRKYHL
380 390 400 410 420
>>CCDS9455.1 KCTD12 gene_id:115207|Hs108|chr13 (325 aa)
initn: 894 init1: 540 opt: 630 Z-score: 572.7 bits: 114.8 E(32554): 1.4e-25
Smith-Waterman score: 1056; 51.9% identity (73.8% similar) in 343 aa overlap (1-321:1-324)
10 20 30 40 50 60
pF1KE4 MALKDTGSGGSTILPISEMVSSSSSPGASAAAAPGPCAPSPFPEVVELNVGGQVYVTKHS
::: :. : :: .... :...... . : ::..::::::::::::..
CCDS94 MALADSTRG----LP------NGGGGGGGSGSSSSSAEPPLFPDIVELNVGGQVYVTRRC
10 20 30 40 50
70 80 90 100 110 120
pF1KE4 TLLSVPDSTLASMFSPSSPRGGARRRGELPRDSRARFFIDRDGFLFRYVLDYLRDKQLAL
:..::::: : ::. ..:. :: :::..:::.:::::::::.:::::: ::.:
CCDS94 TVVSVPDSLLWRMFTQQQPQ-------ELARDSKGRFFLDRDGFLFRYILDYLRDLQLVL
60 70 80 90 100
130 140 150 160 170
pF1KE4 PEHFPEKERLLREAEYFQLTDLVKLL-SPKVT------KQNSLNDEGCQSDLEDNVSQGS
:..:::. :: ::::::.: .::. : .:. .. ... :: .: . . :
CCDS94 PDYFPERSRLQREAEYFELPELVRRLGAPQQPGPGPPPSRRGVHKEGSLGD--ELLPLGY
110 120 130 140 150 160
180 190 200 210 220
pF1KE4 SDALLLRGAAAAVPS-----GPGAHGGGGGGG----AQD----KRSGFLTLGYRGSYTTV
:. .::.:..:: . . .::..: .:. .:::..:.:::::::
CCDS94 SEPEQQEGASAGAPSPTLELASRSPSGGAAGPLLTPSQSLDGSRRSGYITIGYRGSYTIG
170 180 190 200 210 220
230 240 250 260 270 280
pF1KE4 RDNQADAKFRRVARIMVCGRIALAKEVFGDTLNESRDPDRQPEKYTSRFYLKFTYLEQAF
:: :::::::::::: :::. .:::::::::::::::::: ::.::::.::::..:::::
CCDS94 RDAQADAKFRRVARITVCGKTSLAKEVFGDTLNESRDPDRPPERYTSRYYLKFNFLEQAF
230 240 250 260 270 280
290 300 310 320 330
pF1KE4 DRLSEAGFHMVACNSSGTAAFVNQ--YRDDKIWSSYTEYIFFRPPQKIVSPKQEHEDRKH
:.:::.:::::::.:.:: ::... .::::.:::::.: :
CCDS94 DKLSESGFHMVACSSTGTCAFASSTDQSEDKIWTSYTEYVFCRE
290 300 310 320
340 350 360 370 380 390
pF1KE4 DKVTDKGSESGTSCNELSTSSCDSHSEASTPQDNPSSAQQATAHQPNTLTLDRPSKKAPV
473 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 01:44:07 2016 done: Sun Nov 6 01:44:07 2016
Total Scan time: 2.860 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]