FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1307, 472 aa 1>>>pF1KE1307 472 - 472 aa - 472 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3831+/-0.000781; mu= 17.4396+/- 0.047 mean_var=68.4953+/-13.721, 0's: 0 Z-trim(108.6): 10 B-trim: 80 in 2/50 Lambda= 0.154969 statistics sampled from 10338 (10344) to 10338 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.697), E-opt: 0.2 (0.318), width: 16 Scan time: 2.100 The best scores are: opt bits E(32554) CCDS995.1 SELENBP1 gene_id:8991|Hs108|chr1 ( 472) 3284 743.2 1.4e-214 CCDS60266.1 SELENBP1 gene_id:8991|Hs108|chr1 ( 514) 3277 741.6 4.5e-214 CCDS58027.1 SELENBP1 gene_id:8991|Hs108|chr1 ( 410) 2475 562.3 3.5e-160 >>CCDS995.1 SELENBP1 gene_id:8991|Hs108|chr1 (472 aa) initn: 3284 init1: 3284 opt: 3284 Z-score: 3965.8 bits: 743.2 E(32554): 1.4e-214 Smith-Waterman score: 3284; 100.0% identity (100.0% similar) in 472 aa overlap (1-472:1-472) 10 20 30 40 50 60 pF1KE1 MATKCGNCGPGYSTPLEAMKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 MATKCGNCGPGYSTPLEAMKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HRLPMPNLKDELHHSGWNTCSSCFGDSTKSRTKLVLPSLISSRIYVVDVGSEPRAPKLHK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 HRLPMPNLKDELHHSGWNTCSSCFGDSTKSRTKLVLPSLISSRIYVVDVGSEPRAPKLHK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VIEPKDIHAKCELAFLHTSHCLASGEVMISSLGDVKGNGKGGFVLLDGETFEVKGTWERP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 VIEPKDIHAKCELAFLHTSHCLASGEVMISSLGDVKGNGKGGFVLLDGETFEVKGTWERP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPADVEAGLYGSHLYVWDWQRHEIVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 GGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPADVEAGLYGSHLYVWDWQRHEIVQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 TLSLKDGLIPLEIRFLHNPDAAQGFVGCALSSTIQRFYKNEGGTWSVEKVIQVPPKKVKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 TLSLKDGLIPLEIRFLHNPDAAQGFVGCALSSTIQRFYKNEGGTWSVEKVIQVPPKKVKG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 WLLPEMPGLITDILLSLDDRFLYFSNWLHGDLRQYDISDPQRPRLTGQLFLGGSIVKGGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 WLLPEMPGLITDILLSLDDRFLYFSNWLHGDLRQYDISDPQRPRLTGQLFLGGSIVKGGP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 VQVLEDEELKSQPEPLVVKGKRVAGGPQMIQLSLDGKRLYITTSLYSAWDKQFYPDLIRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 VQVLEDEELKSQPEPLVVKGKRVAGGPQMIQLSLDGKRLYITTSLYSAWDKQFYPDLIRE 370 380 390 400 410 420 430 440 450 460 470 pF1KE1 GSVMLQVDVDTVKGGLKLNPNFLVDFGKEPLGPALAHELRYPGGDCSSDIWI :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS99 GSVMLQVDVDTVKGGLKLNPNFLVDFGKEPLGPALAHELRYPGGDCSSDIWI 430 440 450 460 470 >>CCDS60266.1 SELENBP1 gene_id:8991|Hs108|chr1 (514 aa) initn: 3277 init1: 3277 opt: 3277 Z-score: 3956.8 bits: 741.6 E(32554): 4.5e-214 Smith-Waterman score: 3277; 100.0% identity (100.0% similar) in 471 aa overlap (2-472:44-514) 10 20 30 pF1KE1 MATKCGNCGPGYSTPLEAMKGPREEIVYLPC :::::::::::::::::::::::::::::: CCDS60 WPAGMCAAERAEGAFTLQSVAQPMRPIASTATKCGNCGPGYSTPLEAMKGPREEIVYLPC 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE1 IYRNTGTEAPDYLATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWNTCSSCFGDSTKSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 IYRNTGTEAPDYLATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWNTCSSCFGDSTKSR 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE1 TKLVLPSLISSRIYVVDVGSEPRAPKLHKVIEPKDIHAKCELAFLHTSHCLASGEVMISS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 TKLVLPSLISSRIYVVDVGSEPRAPKLHKVIEPKDIHAKCELAFLHTSHCLASGEVMISS 140 150 160 170 180 190 160 170 180 190 200 210 pF1KE1 LGDVKGNGKGGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LGDVKGNGKGGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVLR 200 210 220 230 240 250 220 230 240 250 260 270 pF1KE1 DGFNPADVEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCALS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 DGFNPADVEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCALS 260 270 280 290 300 310 280 290 300 310 320 330 pF1KE1 STIQRFYKNEGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 STIQRFYKNEGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHGD 320 330 340 350 360 370 340 350 360 370 380 390 pF1KE1 LRQYDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LRQYDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMIQ 380 390 400 410 420 430 400 410 420 430 440 450 pF1KE1 LSLDGKRLYITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LSLDGKRLYITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEPL 440 450 460 470 480 490 460 470 pF1KE1 GPALAHELRYPGGDCSSDIWI ::::::::::::::::::::: CCDS60 GPALAHELRYPGGDCSSDIWI 500 510 >>CCDS58027.1 SELENBP1 gene_id:8991|Hs108|chr1 (410 aa) initn: 2475 init1: 2475 opt: 2475 Z-score: 2989.2 bits: 562.3 E(32554): 3.5e-160 Smith-Waterman score: 2724; 86.9% identity (86.9% similar) in 472 aa overlap (1-472:1-410) 10 20 30 40 50 60 pF1KE1 MATKCGNCGPGYSTPLEAMKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MATKCGNCGPGYSTPLEAMKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HRLPMPNLKDELHHSGWNTCSSCFGDSTKSRTKLVLPSLISSRIYVVDVGSEPRAPKLHK CCDS58 ------------------------------------------------------------ 130 140 150 160 170 180 pF1KE1 VIEPKDIHAKCELAFLHTSHCLASGEVMISSLGDVKGNGKGGFVLLDGETFEVKGTWERP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 --EPKDIHAKCELAFLHTSHCLASGEVMISSLGDVKGNGKGGFVLLDGETFEVKGTWERP 70 80 90 100 110 190 200 210 220 230 240 pF1KE1 GGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPADVEAGLYGSHLYVWDWQRHEIVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPADVEAGLYGSHLYVWDWQRHEIVQ 120 130 140 150 160 170 250 260 270 280 290 300 pF1KE1 TLSLKDGLIPLEIRFLHNPDAAQGFVGCALSSTIQRFYKNEGGTWSVEKVIQVPPKKVKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TLSLKDGLIPLEIRFLHNPDAAQGFVGCALSSTIQRFYKNEGGTWSVEKVIQVPPKKVKG 180 190 200 210 220 230 310 320 330 340 350 360 pF1KE1 WLLPEMPGLITDILLSLDDRFLYFSNWLHGDLRQYDISDPQRPRLTGQLFLGGSIVKGGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 WLLPEMPGLITDILLSLDDRFLYFSNWLHGDLRQYDISDPQRPRLTGQLFLGGSIVKGGP 240 250 260 270 280 290 370 380 390 400 410 420 pF1KE1 VQVLEDEELKSQPEPLVVKGKRVAGGPQMIQLSLDGKRLYITTSLYSAWDKQFYPDLIRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VQVLEDEELKSQPEPLVVKGKRVAGGPQMIQLSLDGKRLYITTSLYSAWDKQFYPDLIRE 300 310 320 330 340 350 430 440 450 460 470 pF1KE1 GSVMLQVDVDTVKGGLKLNPNFLVDFGKEPLGPALAHELRYPGGDCSSDIWI :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GSVMLQVDVDTVKGGLKLNPNFLVDFGKEPLGPALAHELRYPGGDCSSDIWI 360 370 380 390 400 410 472 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:04:47 2016 done: Mon Nov 7 02:04:47 2016 Total Scan time: 2.100 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]