FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3978, 392 aa 1>>>pF1KE3978 392 - 392 aa - 392 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5163+/-0.000864; mu= 15.1514+/- 0.052 mean_var=66.9585+/-13.298, 0's: 0 Z-trim(106.2): 151 B-trim: 4 in 1/49 Lambda= 0.156737 statistics sampled from 8702 (8864) to 8702 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.659), E-opt: 0.2 (0.272), width: 16 Scan time: 2.610 The best scores are: opt bits E(32554) CCDS33298.1 SPOPL gene_id:339745|Hs108|chr2 ( 392) 2641 606.3 1.6e-173 CCDS11551.1 SPOP gene_id:8405|Hs108|chr17 ( 374) 1932 445.9 2.8e-125 CCDS34609.1 KLHL7 gene_id:55975|Hs108|chr7 ( 586) 262 68.4 1.9e-11 CCDS47418.1 BTBD9 gene_id:114781|Hs108|chr6 ( 612) 259 67.7 3.2e-11 >>CCDS33298.1 SPOPL gene_id:339745|Hs108|chr2 (392 aa) initn: 2641 init1: 2641 opt: 2641 Z-score: 3228.9 bits: 606.3 E(32554): 1.6e-173 Smith-Waterman score: 2641; 100.0% identity (100.0% similar) in 392 aa overlap (1-392:1-392) 10 20 30 40 50 60 pF1KE3 MSREPTPPLPGDMSTGPIAESWCYTQVKVVKFSYMWTINNFSFCREEMGEVLKSSTFSSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSREPTPPLPGDMSTGPIAESWCYTQVKVVKFSYMWTINNFSFCREEMGEVLKSSTFSSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 PSDKMKWCLRVNPKGLDDESKDYLSLYLLLVSCPKSEVRAKFKFSLLNAKREETKAMESQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PSDKMKWCLRVNPKGLDDESKDYLSLYLLLVSCPKSEVRAKFKFSLLNAKREETKAMESQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGHTNTNTLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGHTNTNTLK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHKSVLAARSPVFNAMFEHEMEESKKNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHKSVLAARSPVFNAMFEHEMEESKKNR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 VEINDLDPEVFKEMMRFIYTGRAPNLDKMADNLLAAADKYALERLKVMCEEALCSNLSVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VEINDLDPEVFKEMMRFIYTGRAPNLDKMADNLLAAADKYALERLKVMCEEALCSNLSVE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 NVADTLVLADLHSAEQLKAQAIDFINRCSVLRQLGCKDGKNWNSNQATDIMETSGWKSMI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 NVADTLVLADLHSAEQLKAQAIDFINRCSVLRQLGCKDGKNWNSNQATDIMETSGWKSMI 310 320 330 340 350 360 370 380 390 pF1KE3 QSHPHLVAEAFRALASAQCPQFGIPRKRLKQS :::::::::::::::::::::::::::::::: CCDS33 QSHPHLVAEAFRALASAQCPQFGIPRKRLKQS 370 380 390 >>CCDS11551.1 SPOP gene_id:8405|Hs108|chr17 (374 aa) initn: 2177 init1: 1932 opt: 1932 Z-score: 2362.7 bits: 445.9 E(32554): 2.8e-125 Smith-Waterman score: 2144; 80.6% identity (91.6% similar) in 392 aa overlap (1-392:1-374) 10 20 30 40 50 60 pF1KE3 MSREPTPPLPGDMSTGPIAESWCYTQVKVVKFSYMWTINNFSFCREEMGEVLKSSTFSSG ::: :.:: :..::.::.::::::::.::::::::::::::::::::::::.:::::::: CCDS11 MSRVPSPPPPAEMSSGPVAESWCYTQIKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 PSDKMKWCLRVNPKGLDDESKDYLSLYLLLVSCPKSEVRAKFKFSLLNAKREETKAMESQ .::.::::::::::::.:::::::::::::::::::::::::::.:::: ::::::::: CCDS11 ANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAMESQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGHTNTNTLK ::::::::::::::::::::::::::::::::::::::::::::::::::::... : .: CCDS11 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMNMVK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHKSVLAARSPVFNAMFEHEMEESKKNR :::::::..::.::::.::::: . : ::::.:::..::::::::.:::::::::::::: CCDS11 VPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESKKNR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 VEINDLDPEVFKEMMRFIYTGRAPNLDKMADNLLAAADKYALERLKVMCEEALCSNLSVE :::::..:::::::: :::::.:::::::::.::::::::::::::::::.::::::::: CCDS11 VEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNLSVE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 NVADTLVLADLHSAEQLKAQAIDFINRCSVLRQLGCKDGKNWNSNQATDIMETSGWKSMI :.:. :.:::::::.:::.::.:::: .:.:..::::::::. CCDS11 NAAEILILADLHSADQLKTQAVDFINY------------------HASDVLETSGWKSMV 310 320 330 340 370 380 390 pF1KE3 QSHPHLVAEAFRALASAQCPQFGIPRKRLKQS :::::::::.:.::::::: .: :::::::: CCDS11 VSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS 350 360 370 >>CCDS34609.1 KLHL7 gene_id:55975|Hs108|chr7 (586 aa) initn: 244 init1: 244 opt: 262 Z-score: 318.8 bits: 68.4 E(32554): 1.9e-11 Smith-Waterman score: 262; 34.1% identity (68.1% similar) in 138 aa overlap (190-327:34-171) 160 170 180 190 200 210 pF1KE3 EVSVVQDSVNISGHTNTNTLKVPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHKSVLA ..:. .. . : ..:. ... ::. ::: CCDS34 SGVEKSSKKKTEKKLAAREEAKLLAGFMGVMNNMRKQKTLCDVILMVQERKIPAHRVVLA 10 20 30 40 50 60 220 230 240 250 260 270 pF1KE3 ARSPVFNAMFEHEMEESKKNRVEINDLDPEVFKEMMRFIYTGRAPNLDKMADNLLAAADK : : :: :: .: :::. .::..: .:........: ::.: .. ...:: ::.. CCDS34 AASHFFNLMFTTNMLESKSFEVELKDAEPDIIEQLVEFAYTARISVNSNNVQSLLDAANQ 70 80 90 100 110 120 280 290 300 310 320 330 pF1KE3 YALERLKVMCEEALCSNLSVENVADTLVLADLHSAEQLKAQAIDFINRCSVLRQLGCKDG : .: .: :: . : .... : :::. . .::: : :::.. CCDS34 YQIEPVKKMCVDFLKEQVDASNCLGISVLAECLDCPELKATADDFIHQHFTEVYKTDEFL 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE3 KNWNSNQATDIMETSGWKSMIQSHPHLVAEAFRALASAQCPQFGIPRKRLKQS CCDS34 QLDVKRVTHLLNQDTLTVRAEDQVYDAAVRWLKYDEPNRQPFMVDILAKVRFPLISKNFL 190 200 210 220 230 240 >>CCDS47418.1 BTBD9 gene_id:114781|Hs108|chr6 (612 aa) initn: 85 init1: 85 opt: 259 Z-score: 314.9 bits: 67.7 E(32554): 3.2e-11 Smith-Waterman score: 259; 32.2% identity (63.7% similar) in 146 aa overlap (186-327:22-167) 160 170 180 190 200 210 pF1KE3 TLFCEVSVVQDSVNISGHTNTNTLKVPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHK :.: .: : . .. : .: :. ..: ::. CCDS47 MSNSHPLRPFTAVGEIDHVHILSEHIGALLIGEEYGDVTFVVEKKRFPAHR 10 20 30 40 50 220 230 240 250 260 270 pF1KE3 SVLAARSPVFNAMFEHEMEESK-KNRVEINDLDPEVFKEMMRFIYTGRAPNLDKMAD--- .:::: : :.. :.::. . .. ..: :.: ....:::::: :. . CCDS47 VILAARCQYFRALLYGGMRESQPEAEIPLQDTTAEAFTMLLKYIYTGRATLTDEKEEVLL 60 70 80 90 100 110 280 290 300 310 320 330 pF1KE3 NLLAAADKYALERLKVMCEEALCSNLSVENVADTLVLADLHSAEQLKAQAIDFINRCSVL ..:. : ::.. .:. : ::. :...:: :. .:.:.: .: . :..: CCDS47 DFLSLAHKYGFPELEDSTSEYLCTILNIQNVCMTFDVASLYSLPKLTCMCCMFMDRNAQE 120 130 140 150 160 170 340 350 360 370 380 390 pF1KE3 RQLGCKDGKNWNSNQATDIMETSGWKSMIQSHPHLVAEAFRALASAQCPQFGIPRKRLKQ CCDS47 VLSSEGFLSLSKTALLNIVLRDSFAAPEKDIFLALLNWCKHNSKENHAEIMQAVRLPLMS 180 190 200 210 220 230 392 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 08:24:21 2016 done: Sun Nov 6 08:24:21 2016 Total Scan time: 2.610 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]