FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1165, 594 aa
1>>>pF1KE1165 594 - 594 aa - 594 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.1287+/-0.00101; mu= 14.3178+/- 0.061
mean_var=73.4006+/-14.653, 0's: 0 Z-trim(104.5): 24 B-trim: 0 in 0/51
Lambda= 0.149701
statistics sampled from 7936 (7946) to 7936 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.606), E-opt: 0.2 (0.244), width: 16
Scan time: 3.540
The best scores are: opt bits E(32554)
CCDS9951.1 SETD3 gene_id:84193|Hs108|chr14 ( 594) 3867 844.9 0
CCDS9952.1 SETD3 gene_id:84193|Hs108|chr14 ( 296) 1892 418.3 6.9e-117
CCDS10798.1 SETD6 gene_id:79918|Hs108|chr16 ( 449) 275 69.1 1.4e-11
CCDS54013.1 SETD6 gene_id:79918|Hs108|chr16 ( 473) 271 68.3 2.6e-11
>>CCDS9951.1 SETD3 gene_id:84193|Hs108|chr14 (594 aa)
initn: 3867 init1: 3867 opt: 3867 Z-score: 4512.2 bits: 844.9 E(32554): 0
Smith-Waterman score: 3867; 100.0% identity (100.0% similar) in 594 aa overlap (1-594:1-594)
10 20 30 40 50 60
pF1KE1 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVAL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 QDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 QDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 GIPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 GIPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSW
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE1 DNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 DNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVK
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE1 SAAVNREYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 SAAVNREYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIRE
490 500 510 520 530 540
550 560 570 580 590
pF1KE1 AISKAKATENGLVNGENSIPNGTRSENESLNQESKRAVEDAKGSSSDSTAGVKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 AISKAKATENGLVNGENSIPNGTRSENESLNQESKRAVEDAKGSSSDSTAGVKE
550 560 570 580 590
>>CCDS9952.1 SETD3 gene_id:84193|Hs108|chr14 (296 aa)
initn: 1892 init1: 1892 opt: 1892 Z-score: 2211.9 bits: 418.3 E(32554): 6.9e-117
Smith-Waterman score: 1892; 100.0% identity (100.0% similar) in 283 aa overlap (1-283:1-283)
10 20 30 40 50 60
pF1KE1 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 MGKKSRVKTQKSGTGATATVSPKEILNLTSELLQKCSSPAPGPGKEWEEYVQIRTLVEKI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 RKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 LWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSFWQPYIQTLP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 SEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDSFT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVAL
:::::::::::::::::::::::::::::::::::::::::::
CCDS99 YEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLTPEDSFALAVASA
250 260 270 280 290
310 320 330 340 350 360
pF1KE1 QDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARA
>>CCDS10798.1 SETD6 gene_id:79918|Hs108|chr16 (449 aa)
initn: 135 init1: 59 opt: 275 Z-score: 321.6 bits: 69.1 E(32554): 1.4e-11
Smith-Waterman score: 369; 26.2% identity (53.2% similar) in 442 aa overlap (72-477:15-443)
50 60 70 80 90 100
pF1KE1 GPGKEWEEYVQIRTLVEKIRKKQKGLSVTFDGKREDYFPDLMKWASENGASVEGFEMVNF
:: : ...: . : . :.
CCDS10 MATQAKRPRVAGPVDGGDLDPVACFLSWCRRVGLELSPKVAVSR
10 20 30 40
110 120 130 140 150
pF1KE1 KEE--GFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRI-LQAM-GNIAL
. :.:. : ....: ::.. ::: :. : .. .: : ..:. ::.. : . :
CCDS10 QGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVALQSQSGWVPL
50 60 70 80 90 100
160 170 180 190 200 210
pF1KE1 AFHLLCERASPNSFWQPYIQTLP--SEYDTPLYFEEDEVR-YLQSTQAIHDVFSQYKNTA
. :: : .: : :.::. : .. . :... :.: : ::.: . . : .. :
CCDS10 LLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIR
110 120 130 140 150 160
220 230 240 250 260
pF1KE1 RQY-AYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV----TLA
.: . ...:: .: .. . : :. :. ::. . : : :. . .
CCDS10 SEYQSIVLPFMEAHPDLFSLRVR---SLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPV
170 180 190 200 210
270 280 290 300 310 320
pF1KE1 LIPLWDMCNHTNGLITTGYNLE-DDRC-ECVALQDFRAGEQIYIFYGTRSNAEFVIHSGF
..: :. :: : . . ::: . : . :: : . :..:. :: .: ... ::
CCDS10 MVPAADILNH---LANHNANLEYSANCLRMVATQPIPKGHEIFNTYGQMANWQLIHMYGF
220 230 240 250 260 270
330 340 350 360 370
pF1KE1 F--FDNNSHDRVKIKLGVSKSDRLYAMKAE-----VLARAG-------IPTSSVFALHFT
. .:. : . :.. . . : . :.: : : . ..:..
CCDS10 VEPYPDNTDDTADIQMVTVREAALQGTKTEAERHLVYERWDFLCKLEMVGEEGAFVIGRE
280 290 300 310 320 330
380 390 400 410 420
pF1KE1 EPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKL---WT-F
: .: . :.:.:: ::..: :.. : :. .. : :: : .
CCDS10 EVLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDKREEGS----LTITNIPKLKASWRQL
340 350 360 370 380 390
430 440 450 460 470 480
pF1KE1 LEDRASLLLKTYKTTIEEDKSVLKNHD----LSVRAKMAIKLRLGEKEILEKAVKSAAVN
:.. . : :.:: : .. :...:.:.. :: : ..:...: :.: ::..
CCDS10 LQNSVLLTLQTYATDLKTDQGLLSNKEVYAKLSWREQQALQVRYGQKMILHQLLELTS
400 410 420 430 440
490 500 510 520 530 540
pF1KE1 REYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNIREAISKA
>>CCDS54013.1 SETD6 gene_id:79918|Hs108|chr16 (473 aa)
initn: 135 init1: 59 opt: 271 Z-score: 316.5 bits: 68.3 E(32554): 2.6e-11
Smith-Waterman score: 365; 27.0% identity (54.5% similar) in 407 aa overlap (105-477:74-467)
80 90 100 110 120 130
pF1KE1 REDYFPDLMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA
:.:. : ....: ::.. ::: :. :
CCDS54 AGGRRTRGGARAALTSPPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQ
50 60 70 80 90 100
140 150 160 170 180 190
pF1KE1 KNSVLGPLYSQDRI-LQAM-GNIALAFHLLCERASPNSFWQPYIQTLP--SEYDTPLYFE
.. .: : ..:. ::.. : . : . :: : .: : :.::. : .. . :...
CCDS54 HTCSIGGLLERERVALQSQSGWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWP
110 120 130 140 150 160
200 210 220 230 240
pF1KE1 EDEVR-YLQSTQAIHDVFSQYKNTARQY-AYFYKVIQTHPHANKLPLKDSFTYEDYRWAV
:.: : ::.: . . : .. : .: . ...:: .: .. . : :. :
CCDS54 EEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVR---SLELYHQLV
170 180 190 200 210
250 260 270 280 290 300
pF1KE1 SSVMTRQNQIPTEDGSRV----TLALIPLWDMCNHTNGLITTGYNLE-DDRC-ECVALQD
. ::. . : : :. . ...: :. :: : . . ::: . : . :: :
CCDS54 ALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNH---LANHNANLEYSANCLRMVATQP
220 230 240 250 260 270
310 320 330 340 350
pF1KE1 FRAGEQIYIFYGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAE-----
. :..:. :: .: ... :: . .:. : . :.. . . : . :.:
CCDS54 IPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAERHL
280 290 300 310 320 330
360 370 380 390 400
pF1KE1 VLARAG-------IPTSSVFALHFTEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRI
: : . ..:.. : .: . :.:.:: ::..: :.. :
CCDS54 VYERWDFLCKLEMVGEEGAFVIGREEVLTEEELTTTLKVLCMPAEEFRELKDQDGGGDDK
340 350 360 370 380 390
410 420 430 440 450 460
pF1KE1 FTLGNSEFPVSWDNEVKL---WT-FLEDRASLLLKTYKTTIEEDKSVLKNHD----LSVR
:. .. : :: : .:.. . : :.:: : .. :...:.:.. :: :
CCDS54 REEGS----LTITNIPKLKASWRQLLQNSVLLTLQTYATDLKTDQGLLSNKEVYAKLSWR
400 410 420 430 440 450
470 480 490 500 510 520
pF1KE1 AKMAIKLRLGEKEILEKAVKSAAVNREYYRQQMEEKAPLPKYEESNLGLLESSVGDSRLP
..:...: :.: ::..
CCDS54 EQQALQVRYGQKMILHQLLELTS
460 470
594 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 14:23:20 2016 done: Sun Nov 6 14:23:21 2016
Total Scan time: 3.540 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]