FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3964, 305 aa 1>>>pF1KE3964 305 - 305 aa - 305 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0641+/-0.00146; mu= 6.3441+/- 0.085 mean_var=162.0436+/-35.027, 0's: 0 Z-trim(103.3): 226 B-trim: 44 in 1/50 Lambda= 0.100753 statistics sampled from 7066 (7334) to 7066 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.587), E-opt: 0.2 (0.225), width: 16 Scan time: 2.150 The best scores are: opt bits E(32554) CCDS10300.1 WDR61 gene_id:80349|Hs108|chr15 ( 305) 2061 312.3 3e-85 CCDS76785.1 WDR61 gene_id:80349|Hs108|chr15 ( 212) 1343 207.8 6.1e-54 CCDS2470.1 DAW1 gene_id:164781|Hs108|chr2 ( 415) 446 77.6 1.7e-14 CCDS6981.1 WDR5 gene_id:11091|Hs108|chr9 ( 334) 393 69.9 3.1e-12 CCDS54592.1 POC1A gene_id:25886|Hs108|chr3 ( 359) 377 67.6 1.6e-11 CCDS2846.1 POC1A gene_id:25886|Hs108|chr3 ( 407) 377 67.6 1.8e-11 CCDS10788.1 KATNB1 gene_id:10300|Hs108|chr16 ( 655) 369 66.6 5.6e-11 CCDS3012.1 WDR5B gene_id:54554|Hs108|chr3 ( 330) 361 65.2 7.7e-11 CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 ( 478) 363 65.6 8.2e-11 >>CCDS10300.1 WDR61 gene_id:80349|Hs108|chr15 (305 aa) initn: 2061 init1: 2061 opt: 2061 Z-score: 1643.9 bits: 312.3 E(32554): 3e-85 Smith-Waterman score: 2061; 100.0% identity (100.0% similar) in 305 aa overlap (1-305:1-305) 10 20 30 40 50 60 pF1KE3 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDDLVKVWKWRDERLDLQWSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDDLVKVWKWRDERLDLQWSL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 EGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 KLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 CPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 CPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHI 250 260 270 280 290 300 pF1KE3 YDCPI ::::: CCDS10 YDCPI >>CCDS76785.1 WDR61 gene_id:80349|Hs108|chr15 (212 aa) initn: 1415 init1: 1343 opt: 1343 Z-score: 1081.9 bits: 207.8 E(32554): 6.1e-54 Smith-Waterman score: 1343; 100.0% identity (100.0% similar) in 199 aa overlap (107-305:14-212) 80 90 100 110 120 130 pF1KE3 PIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVE :::::::::::::::::::::::::::::: CCDS76 MTNQYGILFKQEQVDAWTLAFSPDSQYLATGTHVGKVNIFGVE 10 20 30 40 140 150 160 170 180 190 pF1KE3 SGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPIRSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 SGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPIRSL 50 60 70 80 90 100 200 210 220 230 240 250 pF1KE3 TFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 TFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSV 110 120 130 140 150 160 260 270 280 290 300 pF1KE3 KVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 KVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI 170 180 190 200 210 >>CCDS2470.1 DAW1 gene_id:164781|Hs108|chr2 (415 aa) initn: 919 init1: 398 opt: 446 Z-score: 373.5 bits: 77.6 E(32554): 1.7e-14 Smith-Waterman score: 446; 23.9% identity (64.7% similar) in 289 aa overlap (13-301:131-414) 10 20 30 40 pF1KE3 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDD ..: ......:... .. ..:::.: CCDS24 NKSGSCFITGSYDRTCKLWDTASGEELNTLEGHRNVVYAIAFNN---PYGDKIATGSFDK 110 120 130 140 150 50 60 70 80 90 100 pF1KE3 LVKVWKWRDERLDLQWSLEGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSI :.:. : ...:: .: .... ..:..:.:. .:::..::... .. CCDS24 TCKLWSV--ETGKCYHTFRGHTAEIVCLSFNPQSTLVATGSMDTTAKLWDIQNGEEVYTL 160 170 180 190 200 210 110 120 130 140 150 160 pF1KE3 DAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKY . .. .:.:. ... . ::. : .. ...:.: : . : : ... : . CCDS24 RGHSAEIISLSFNTSGDRIITGSFDHTVVVWDADTGRKVNILIGHCAEISSASFNWDCSL 220 230 240 250 260 270 170 180 190 200 210 220 pF1KE3 LASGAIDGIINIFDIATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHA . .:..: ...: ..:: . :: :: : . :. ..:..::: :: .:... CCDS24 ILTGSMDKTCKLWDATNGKCVATLTGHDDEILDSCFDYTGKLIATASADGTARIFSAATR 280 290 300 310 320 330 230 240 250 260 270 280 pF1KE3 NLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKY . . : :: . . ...: :. .:....::::....::. : :.... : :.... . CCDS24 KCIAKLEGHEGEISKISFNPQGNHLLTGSSDKTARIWDAQTGQCLQVLEGHTDEIFSCAF 340 350 360 370 380 390 290 300 pF1KE3 NGNGSKIVSVGDDQEIHIYDCPI : .:. ... . :. .:. CCDS24 NYKGNIVITGSKDNTCRIWR 400 410 >>CCDS6981.1 WDR5 gene_id:11091|Hs108|chr9 (334 aa) initn: 618 init1: 376 opt: 393 Z-score: 333.1 bits: 69.9 E(32554): 3.1e-12 Smith-Waterman score: 393; 39.5% identity (72.8% similar) in 162 aa overlap (141-302:38-199) 120 130 140 150 160 170 pF1KE3 TLAFSPDSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDG ...: . : . :. .::.:..:::.. : CCDS69 PETEAARAQPTPSSSATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSPNGEWLASSSADK 10 20 30 40 50 60 180 190 200 210 220 230 pF1KE3 IINIFDIATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSG .:.:. ::. .:. :: . : ....: ::.:::.:::: .::.::. .. ::.: CCDS69 LIKIWGAYDGKFEKTISGHKLGISDVAWSSDSNLLVSASDDKTLKIWDVSSGKCLKTLKG 70 80 90 100 110 120 240 250 260 270 280 290 pF1KE3 HASWVLNVAFCPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIV :...:. : :... .::.: :.::..::: : :..:. :.: : .:..: .:: :: CCDS69 HSNYVFCCNFNPQSNLIVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFNRDGSLIV 130 140 150 160 170 180 300 pF1KE3 SVGDDQEIHIYDCPI : . : .:.: CCDS69 SSSYDGLCRIWDTASGQCLKTLIDDDNPPVSFVKFSPNGKYILAATLDNTLKLWDYSKGK 190 200 210 220 230 240 >>CCDS54592.1 POC1A gene_id:25886|Hs108|chr3 (359 aa) initn: 729 init1: 330 opt: 377 Z-score: 320.1 bits: 67.6 E(32554): 1.6e-11 Smith-Waterman score: 377; 30.7% identity (64.3% similar) in 199 aa overlap (105-302:17-215) 80 90 100 110 120 130 pF1KE3 TLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWT-LAFSPDSQYLATGTHVGKVNIF : :: : . :: ... ::.:. . . .. CCDS54 MAAPCAEDPSLERHFKGHRDAVTCVDFSINTKQLASGSMDSCLMVW 10 20 30 40 140 150 160 170 180 190 pF1KE3 GVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPI .. .. : . . . . .::.:. ::::. : . :. . ....:. . CCDS54 HMKPQSRAYRFTGHKDAVTCVNFSPSGHLLASGSRDKTVRIWVPNVKGESTVFRAHTATV 50 60 70 80 90 100 200 210 220 230 240 250 pF1KE3 RSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSD ::. : :.: .:::::: .:.. ... .. .:: : .:: . : :: .::.:.: CCDS54 RSVHFCSDGQSFVTASDDKTVKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDD 110 120 130 140 150 160 260 270 280 290 300 pF1KE3 KSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI :.::.:: ..: :::.. .: : : .. .:. :...: :. ....: CCDS54 KTVKLWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVWDVRTHRLLQHYQ 170 180 190 200 210 220 CCDS54 LHSAAVNGLSFHPSGNYLITASSDSTLKILDLMEGRLLYTLHGHQGPATTVAFSRTGEYF 230 240 250 260 270 280 >>CCDS2846.1 POC1A gene_id:25886|Hs108|chr3 (407 aa) initn: 712 init1: 330 opt: 377 Z-score: 319.5 bits: 67.6 E(32554): 1.8e-11 Smith-Waterman score: 377; 30.7% identity (64.3% similar) in 199 aa overlap (105-302:17-215) 80 90 100 110 120 130 pF1KE3 TLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWT-LAFSPDSQYLATGTHVGKVNIF : :: : . :: ... ::.:. . . .. CCDS28 MAAPCAEDPSLERHFKGHRDAVTCVDFSINTKQLASGSMDSCLMVW 10 20 30 40 140 150 160 170 180 190 pF1KE3 GVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPI .. .. : . . . . .::.:. ::::. : . :. . ....:. . CCDS28 HMKPQSRAYRFTGHKDAVTCVNFSPSGHLLASGSRDKTVRIWVPNVKGESTVFRAHTATV 50 60 70 80 90 100 200 210 220 230 240 250 pF1KE3 RSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSD ::. : :.: .:::::: .:.. ... .. .:: : .:: . : :: .::.:.: CCDS28 RSVHFCSDGQSFVTASDDKTVKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDD 110 120 130 140 150 160 260 270 280 290 300 pF1KE3 KSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI :.::.:: ..: :::.. .: : : .. .:. :...: :. ....: CCDS28 KTVKLWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVWDVRTHRLLQHYQ 170 180 190 200 210 220 CCDS28 LHSAAVNGLSFHPSGNYLITASSDSTLKILDLMEGRLLYTLHGHQGPATTVAFSRTGEYF 230 240 250 260 270 280 >>CCDS10788.1 KATNB1 gene_id:10300|Hs108|chr16 (655 aa) initn: 326 init1: 326 opt: 369 Z-score: 310.5 bits: 66.6 E(32554): 5.6e-11 Smith-Waterman score: 369; 26.3% identity (61.7% similar) in 266 aa overlap (14-279:18-273) 10 20 30 40 50 pF1KE3 MTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETVVTGSLDDLVKVWKWRDERLDL :: . . :.. : : ... ..::. :: .: : .. . CCDS10 MATPVVTKTAWKLQEIVAHASNVSSLVLG---KASGRLLATGG-DD-CRVNLWSINKPNC 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 QWSLEGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSP :: :: : :: .. . ...: .. ::.:::: .: .... . .. .: : : CCDS10 IMSLTGHTSPVESVRLNTPEELIVAGSQSGSIRVWDLEAAKILRTLMGHKANICSLDFHP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 DSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFD ....:.:.. ..... .. . ... . . .:::::.:::.: : ....: CCDS10 YGEFVASGSQDTNIKLWDIRRKGCVFRYRGHSQAVRCLRFSPDGKWLASAADDHTVKLWD 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 IATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVL ...::.. . ::. :. . : :. ::...:.: :...:... .... . :. . : CCDS10 LTAGKMMSEFPGHTGPVNVVEFHPNEYLLASGSSDRTIRFWDLEKFQVVSCIEGEPGPVR 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 NVAFCPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQ .: : :: . :. .: :..:. . : :: :: CCDS10 SVLFNPDGCCLYSGCQD-SLRVYGWEPERC----FDVVLVNWGKVADLAICNDQLIGVAF 240 250 260 270 280 290 300 pF1KE3 EIHIYDCPI CCDS10 SQSNVSSYVVDLTRVTRTGTVARDPVQDHRPLAQPLPNPSAPLRRIYERPSTTCSKPQRV 300 310 320 330 340 350 >>CCDS3012.1 WDR5B gene_id:54554|Hs108|chr3 (330 aa) initn: 563 init1: 352 opt: 361 Z-score: 308.0 bits: 65.2 E(32554): 7.7e-11 Smith-Waterman score: 361; 38.8% identity (71.1% similar) in 152 aa overlap (151-302:44-195) 130 140 150 160 170 180 pF1KE3 LATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIFDIATG . :. .::.:..:::.. : .: :. : CCDS30 ALSSSANQSKEVPENPNYALKCTLVGHTEAVSSVKFSPNGEWLASSSADRLIIIWGAYDG 20 30 40 50 60 70 190 200 210 220 230 240 pF1KE3 KLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAF : .:: :: . : ....: ::. ::.:::: .:..::. .. ::.::...:. : CCDS30 KYEKTLYGHNLEISDVAWSSDSSRLVSASDDKTLKLWDVRSGKCLKTLKGHSNYVFCCNF 80 90 100 110 120 130 250 260 270 280 290 300 pF1KE3 CPDDTHFVSSSSDKSVKVWDVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHI : .. ..:.: :..::.:.: : :..:. :.: : .:..: .:: ::: . : .: CCDS30 NPPSNLIISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSGSLIVSGSYDGLCRI 140 150 160 170 180 190 pF1KE3 YDCPI .: CCDS30 WDAASGQCLKTLVDDDNPPVSFVKFSPNGKYILTATLDNTLKLWDYSRGRCLKTYTGHKN 200 210 220 230 240 250 >>CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 (478 aa) initn: 356 init1: 356 opt: 363 Z-score: 307.6 bits: 65.6 E(32554): 8.2e-11 Smith-Waterman score: 363; 31.1% identity (63.7% similar) in 193 aa overlap (111-302:23-214) 90 100 110 120 130 140 pF1KE3 SSSLDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVESGKK .: .::... :::.. . ... . . CCDS31 MASATEDPVLERYFKGHKAAITSLDLSPNGKQLATASWDTFLMLWNFKPHAR 10 20 30 40 50 150 160 170 180 190 pF1KE3 EYSLDTRGKFILSIAYSPDGKYLASGAIDGIINIF-DIATGKLLHTLEGHAMPIRSLTFS : . . :. .:: :. :::.. : . .. ::. ...:. :.::. :: CCDS31 AYRYVGHKDVVTSVQFSPHGNLLASASRDRTVRLWIPDKRGKF-SEFKAHTAPVRSVDFS 60 70 80 90 100 110 200 210 220 230 240 250 pF1KE3 PDSQLLVTASDDGYIKIYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSVKVW :.:.:.:::.: ::.... . . .: :. :: . : :: .:: : ::..:.: CCDS31 ADGQFLATASEDKSIKVWSMYRQRFLYSLYRHTHWVRCAKFSPDGRLIVSCSEDKTIKIW 120 130 140 150 160 170 260 270 280 290 300 pF1KE3 DVGTRTCVHTFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDCPI :. .. ::..: : . : .: .:. :.:.:.:: ....: CCDS31 DTTNKQCVNNFSDSVGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNKLLQHYQVHSGGV 180 190 200 210 220 230 CCDS31 NCISFHPSGNYLITASSDGTLKILDLLEGRLIYTLQGHTGPVFTVSFSKGGELFASGGAD 240 250 260 270 280 290 305 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 08:30:41 2016 done: Sun Nov 6 08:30:42 2016 Total Scan time: 2.150 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]