FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3964, 305 aa
1>>>pF1KE3964 305 - 305 aa - 305 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.0641+/-0.00146; mu= 6.3441+/- 0.085
mean_var=162.0436+/-35.027, 0's: 0 Z-trim(103.3): 226 B-trim: 44 in 1/50
Lambda= 0.100753
statistics sampled from 7066 (7334) to 7066 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.587), E-opt: 0.2 (0.225), width: 16
Scan time: 2.150
The best scores are: opt bits E(32554)
CCDS10300.1 WDR61 gene_id:80349|Hs108|chr15 ( 305) 2061 312.3 3e-85
CCDS76785.1 WDR61 gene_id:80349|Hs108|chr15 ( 212) 1343 207.8 6.1e-54
CCDS2470.1 DAW1 gene_id:164781|Hs108|chr2 ( 415) 446 77.6 1.7e-14
CCDS6981.1 WDR5 gene_id:11091|Hs108|chr9 ( 334) 393 69.9 3.1e-12
CCDS54592.1 POC1A gene_id:25886|Hs108|chr3 ( 359) 377 67.6 1.6e-11
CCDS2846.1 POC1A gene_id:25886|Hs108|chr3 ( 407) 377 67.6 1.8e-11
CCDS10788.1 KATNB1 gene_id:10300|Hs108|chr16 ( 655) 369 66.6 5.6e-11
CCDS3012.1 WDR5B gene_id:54554|Hs108|chr3 ( 330) 361 65.2 7.7e-11
CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 ( 478) 363 65.6 8.2e-11
>>CCDS10300.1 WDR61 gene_id:80349|Hs108|chr15 (305 aa)
initn: 2061 init1: 2061 opt: 2061 Z-score: 1643.9 bits: 312.3 E(32554): 3e-85
Smith-Waterman score: 2061; 100.0% identity (100.0% similar) in 305 aa overlap (1-305:1-305)
10 20 30 40 50 60
pF1KE3 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDDLVKVWKWRDERLDLQWSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDDLVKVWKWRDERLDLQWSL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 EGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 EGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 LATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 KLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 KLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 CPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 CPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHI
250 260 270 280 290 300
pF1KE3 YDCPI
:::::
CCDS10 YDCPI
>>CCDS76785.1 WDR61 gene_id:80349|Hs108|chr15 (212 aa)
initn: 1415 init1: 1343 opt: 1343 Z-score: 1081.9 bits: 207.8 E(32554): 6.1e-54
Smith-Waterman score: 1343; 100.0% identity (100.0% similar) in 199 aa overlap (107-305:14-212)
80 90 100 110 120 130
pF1KE3 PIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVE
::::::::::::::::::::::::::::::
CCDS76 MTNQYGILFKQEQVDAWTLAFSPDSQYLATGTHVGKVNIFGVE
10 20 30 40
140 150 160 170 180 190
pF1KE3 SGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPIRSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 SGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPIRSL
50 60 70 80 90 100
200 210 220 230 240 250
pF1KE3 TFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 TFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSV
110 120 130 140 150 160
260 270 280 290 300
pF1KE3 KVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 KVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI
170 180 190 200 210
>>CCDS2470.1 DAW1 gene_id:164781|Hs108|chr2 (415 aa)
initn: 919 init1: 398 opt: 446 Z-score: 373.5 bits: 77.6 E(32554): 1.7e-14
Smith-Waterman score: 446; 23.9% identity (64.7% similar) in 289 aa overlap (13-301:131-414)
10 20 30 40
pF1KE3 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDD
..: ......:... .. ..:::.:
CCDS24 NKSGSCFITGSYDRTCKLWDTASGEELNTLEGHRNVVYAIAFNN---PYGDKIATGSFDK
110 120 130 140 150
50 60 70 80 90 100
pF1KE3 LVKVWKWRDERLDLQWSLEGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSI
:.:. : ...:: .: .... ..:..:.:. .:::..::... ..
CCDS24 TCKLWSV--ETGKCYHTFRGHTAEIVCLSFNPQSTLVATGSMDTTAKLWDIQNGEEVYTL
160 170 180 190 200 210
110 120 130 140 150 160
pF1KE3 DAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKY
. .. .:.:. ... . ::. : .. ...:.: : . : : ... : .
CCDS24 RGHSAEIISLSFNTSGDRIITGSFDHTVVVWDADTGRKVNILIGHCAEISSASFNWDCSL
220 230 240 250 260 270
170 180 190 200 210 220
pF1KE3 LASGAIDGIINIFDIATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHA
. .:..: ...: ..:: . :: :: : . :. ..:..::: :: .:...
CCDS24 ILTGSMDKTCKLWDATNGKCVATLTGHDDEILDSCFDYTGKLIATASADGTARIFSAATR
280 290 300 310 320 330
230 240 250 260 270 280
pF1KE3 NLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKY
. . : :: . . ...: :. .:....::::....::. : :.... : :.... .
CCDS24 KCIAKLEGHEGEISKISFNPQGNHLLTGSSDKTARIWDAQTGQCLQVLEGHTDEIFSCAF
340 350 360 370 380 390
290 300
pF1KE3 NGNGSKIVSVGDDQEIHIYDCPI
: .:. ... . :. .:.
CCDS24 NYKGNIVITGSKDNTCRIWR
400 410
>>CCDS6981.1 WDR5 gene_id:11091|Hs108|chr9 (334 aa)
initn: 618 init1: 376 opt: 393 Z-score: 333.1 bits: 69.9 E(32554): 3.1e-12
Smith-Waterman score: 393; 39.5% identity (72.8% similar) in 162 aa overlap (141-302:38-199)
120 130 140 150 160 170
pF1KE3 TLAFSPDSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDG
...: . : . :. .::.:..:::.. :
CCDS69 PETEAARAQPTPSSSATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSPNGEWLASSSADK
10 20 30 40 50 60
180 190 200 210 220 230
pF1KE3 IINIFDIATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSG
.:.:. ::. .:. :: . : ....: ::.:::.:::: .::.::. .. ::.:
CCDS69 LIKIWGAYDGKFEKTISGHKLGISDVAWSSDSNLLVSASDDKTLKIWDVSSGKCLKTLKG
70 80 90 100 110 120
240 250 260 270 280 290
pF1KE3 HASWVLNVAFCPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIV
:...:. : :... .::.: :.::..::: : :..:. :.: : .:..: .:: ::
CCDS69 HSNYVFCCNFNPQSNLIVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFNRDGSLIV
130 140 150 160 170 180
300
pF1KE3 SVGDDQEIHIYDCPI
: . : .:.:
CCDS69 SSSYDGLCRIWDTASGQCLKTLIDDDNPPVSFVKFSPNGKYILAATLDNTLKLWDYSKGK
190 200 210 220 230 240
>>CCDS54592.1 POC1A gene_id:25886|Hs108|chr3 (359 aa)
initn: 729 init1: 330 opt: 377 Z-score: 320.1 bits: 67.6 E(32554): 1.6e-11
Smith-Waterman score: 377; 30.7% identity (64.3% similar) in 199 aa overlap (105-302:17-215)
80 90 100 110 120 130
pF1KE3 TLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWT-LAFSPDSQYLATGTHVGKVNIF
: :: : . :: ... ::.:. . . ..
CCDS54 MAAPCAEDPSLERHFKGHRDAVTCVDFSINTKQLASGSMDSCLMVW
10 20 30 40
140 150 160 170 180 190
pF1KE3 GVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPI
.. .. : . . . . .::.:. ::::. : . :. . ....:. .
CCDS54 HMKPQSRAYRFTGHKDAVTCVNFSPSGHLLASGSRDKTVRIWVPNVKGESTVFRAHTATV
50 60 70 80 90 100
200 210 220 230 240 250
pF1KE3 RSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSD
::. : :.: .:::::: .:.. ... .. .:: : .:: . : :: .::.:.:
CCDS54 RSVHFCSDGQSFVTASDDKTVKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDD
110 120 130 140 150 160
260 270 280 290 300
pF1KE3 KSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI
:.::.:: ..: :::.. .: : : .. .:. :...: :. ....:
CCDS54 KTVKLWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVWDVRTHRLLQHYQ
170 180 190 200 210 220
CCDS54 LHSAAVNGLSFHPSGNYLITASSDSTLKILDLMEGRLLYTLHGHQGPATTVAFSRTGEYF
230 240 250 260 270 280
>>CCDS2846.1 POC1A gene_id:25886|Hs108|chr3 (407 aa)
initn: 712 init1: 330 opt: 377 Z-score: 319.5 bits: 67.6 E(32554): 1.8e-11
Smith-Waterman score: 377; 30.7% identity (64.3% similar) in 199 aa overlap (105-302:17-215)
80 90 100 110 120 130
pF1KE3 TLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWT-LAFSPDSQYLATGTHVGKVNIF
: :: : . :: ... ::.:. . . ..
CCDS28 MAAPCAEDPSLERHFKGHRDAVTCVDFSINTKQLASGSMDSCLMVW
10 20 30 40
140 150 160 170 180 190
pF1KE3 GVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPI
.. .. : . . . . .::.:. ::::. : . :. . ....:. .
CCDS28 HMKPQSRAYRFTGHKDAVTCVNFSPSGHLLASGSRDKTVRIWVPNVKGESTVFRAHTATV
50 60 70 80 90 100
200 210 220 230 240 250
pF1KE3 RSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSD
::. : :.: .:::::: .:.. ... .. .:: : .:: . : :: .::.:.:
CCDS28 RSVHFCSDGQSFVTASDDKTVKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDD
110 120 130 140 150 160
260 270 280 290 300
pF1KE3 KSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI
:.::.:: ..: :::.. .: : : .. .:. :...: :. ....:
CCDS28 KTVKLWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVWDVRTHRLLQHYQ
170 180 190 200 210 220
CCDS28 LHSAAVNGLSFHPSGNYLITASSDSTLKILDLMEGRLLYTLHGHQGPATTVAFSRTGEYF
230 240 250 260 270 280
>>CCDS10788.1 KATNB1 gene_id:10300|Hs108|chr16 (655 aa)
initn: 326 init1: 326 opt: 369 Z-score: 310.5 bits: 66.6 E(32554): 5.6e-11
Smith-Waterman score: 369; 26.3% identity (61.7% similar) in 266 aa overlap (14-279:18-273)
10 20 30 40 50
pF1KE3 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDDLVKVWKWRDERLDL
:: . . :.. : : ... ..::. :: .: : .. .
CCDS10 MATPVVTKTAWKLQEIVAHASNVSSLVLG---KASGRLLATGG-DD-CRVNLWSINKPNC
10 20 30 40 50
60 70 80 90 100 110
pF1KE3 QWSLEGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSP
:: :: : :: .. . ...: .. ::.:::: .: .... . .. .: : :
CCDS10 IMSLTGHTSPVESVRLNTPEELIVAGSQSGSIRVWDLEAAKILRTLMGHKANICSLDFHP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE3 DSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFD
....:.:.. ..... .. . ... . . .:::::.:::.: : ....:
CCDS10 YGEFVASGSQDTNIKLWDIRRKGCVFRYRGHSQAVRCLRFSPDGKWLASAADDHTVKLWD
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE3 IATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVL
...::.. . ::. :. . : :. ::...:.: :...:... .... . :. . :
CCDS10 LTAGKMMSEFPGHTGPVNVVEFHPNEYLLASGSSDRTIRFWDLEKFQVVSCIEGEPGPVR
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE3 NVAFCPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQ
.: : :: . :. .: :..:. . : :: ::
CCDS10 SVLFNPDGCCLYSGCQD-SLRVYGWEPERC----FDVVLVNWGKVADLAICNDQLIGVAF
240 250 260 270 280 290
300
pF1KE3 EIHIYDCPI
CCDS10 SQSNVSSYVVDLTRVTRTGTVARDPVQDHRPLAQPLPNPSAPLRRIYERPSTTCSKPQRV
300 310 320 330 340 350
>>CCDS3012.1 WDR5B gene_id:54554|Hs108|chr3 (330 aa)
initn: 563 init1: 352 opt: 361 Z-score: 308.0 bits: 65.2 E(32554): 7.7e-11
Smith-Waterman score: 361; 38.8% identity (71.1% similar) in 152 aa overlap (151-302:44-195)
130 140 150 160 170 180
pF1KE3 LATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATG
. :. .::.:..:::.. : .: :. :
CCDS30 ALSSSANQSKEVPENPNYALKCTLVGHTEAVSSVKFSPNGEWLASSSADRLIIIWGAYDG
20 30 40 50 60 70
190 200 210 220 230 240
pF1KE3 KLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAF
: .:: :: . : ....: ::. ::.:::: .:..::. .. ::.::...:. :
CCDS30 KYEKTLYGHNLEISDVAWSSDSSRLVSASDDKTLKLWDVRSGKCLKTLKGHSNYVFCCNF
80 90 100 110 120 130
250 260 270 280 290 300
pF1KE3 CPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHI
: .. ..:.: :..::.:.: : :..:. :.: : .:..: .:: ::: . : .:
CCDS30 NPPSNLIISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSGSLIVSGSYDGLCRI
140 150 160 170 180 190
pF1KE3 YDCPI
.:
CCDS30 WDAASGQCLKTLVDDDNPPVSFVKFSPNGKYILTATLDNTLKLWDYSRGRCLKTYTGHKN
200 210 220 230 240 250
>>CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 (478 aa)
initn: 356 init1: 356 opt: 363 Z-score: 307.6 bits: 65.6 E(32554): 8.2e-11
Smith-Waterman score: 363; 31.1% identity (63.7% similar) in 193 aa overlap (111-302:23-214)
90 100 110 120 130 140
pF1KE3 SSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVESGKK
.: .::... :::.. . ... . .
CCDS31 MASATEDPVLERYFKGHKAAITSLDLSPNGKQLATASWDTFLMLWNFKPHAR
10 20 30 40 50
150 160 170 180 190
pF1KE3 EYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIF-DIATGKLLHTLEGHAMPIRSLTFS
: . . :. .:: :. :::.. : . .. ::. ...:. :.::. ::
CCDS31 AYRYVGHKDVVTSVQFSPHGNLLASASRDRTVRLWIPDKRGKF-SEFKAHTAPVRSVDFS
60 70 80 90 100 110
200 210 220 230 240 250
pF1KE3 PDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSVKVW
:.:.:.:::.: ::.... . . .: :. :: . : :: .:: : ::..:.:
CCDS31 ADGQFLATASEDKSIKVWSMYRQRFLYSLYRHTHWVRCAKFSPDGRLIVSCSEDKTIKIW
120 130 140 150 160 170
260 270 280 290 300
pF1KE3 DVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI
:. .. ::..: : . : .: .:. :.:.:.:: ....:
CCDS31 DTTNKQCVNNFSDSVGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNKLLQHYQVHSGGV
180 190 200 210 220 230
CCDS31 NCISFHPSGNYLITASSDGTLKILDLLEGRLIYTLQGHTGPVFTVSFSKGGELFASGGAD
240 250 260 270 280 290
305 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 08:30:41 2016 done: Sun Nov 6 08:30:42 2016
Total Scan time: 2.150 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]