FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5201, 218 aa 1>>>pF1KE5201 218 - 218 aa - 218 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8332+/-0.000964; mu= 10.2249+/- 0.057 mean_var=60.3840+/-12.432, 0's: 0 Z-trim(103.6): 18 B-trim: 226 in 1/49 Lambda= 0.165049 statistics sampled from 7468 (7475) to 7468 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.612), E-opt: 0.2 (0.23), width: 16 Scan time: 1.370 The best scores are: opt bits E(32554) CCDS14641.1 HPRT1 gene_id:3251|Hs108|chrX ( 218) 1448 353.4 6.5e-98 CCDS7145.1 PRTFDC1 gene_id:56952|Hs108|chr10 ( 225) 1009 248.8 2e-66 CCDS60506.1 PRTFDC1 gene_id:56952|Hs108|chr10 ( 190) 781 194.5 3.7e-50 >>CCDS14641.1 HPRT1 gene_id:3251|Hs108|chrX (218 aa) initn: 1448 init1: 1448 opt: 1448 Z-score: 1871.3 bits: 353.4 E(32554): 6.5e-98 Smith-Waterman score: 1448; 100.0% identity (100.0% similar) in 218 aa overlap (1-218:1-218) 10 20 30 40 50 60 pF1KE5 MATRSPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMDRTERLARDVMKEMGGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MATRSPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMDRTERLARDVMKEMGGH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 HIVALCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRLKSYCNDQSTGDIKVIGGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 HIVALCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRLKSYCNDQSTGDIKVIGGD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 DLSTLTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKVASLLVKRTPRSVGYKPDFVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DLSTLTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKVASLLVKRTPRSVGYKPDFVG 130 140 150 160 170 180 190 200 210 pF1KE5 FEIPDKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA :::::::::::::::::::::::::::::::::::::: CCDS14 FEIPDKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA 190 200 210 >>CCDS7145.1 PRTFDC1 gene_id:56952|Hs108|chr10 (225 aa) initn: 1025 init1: 1009 opt: 1009 Z-score: 1306.1 bits: 248.8 E(32554): 2e-66 Smith-Waterman score: 1009; 68.2% identity (88.2% similar) in 211 aa overlap (7-217:14-224) 10 20 30 40 50 pF1KE5 MATRSPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMDRTERLARDV :::: :: :::::.:: :.:: ::: :.::::.:.:: ::::.:. CCDS71 MAGSSEEAPDYGRGVVIMDDWPGYDLNLFTYPQHYYGDLEYVLIPHGIIVDRIERLAKDI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 MKEMGGHHIVALCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRLKSYCNDQSTGD ::..: :..:::::::::: :::....: ..::::: . : ::::::::: :::: :. CCDS71 MKDIGYSDIMVLCVLKGGYKFCADLVEHLKNISRNSDRFVSMKVDFIRLKSYRNDQSMGE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 IKVIGGDDLSTLTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKVASLLVKRTPRSVG ...:::::::::.:::::::::.. ::.::..::: ...:.:.:.:::::::::: :: : CCDS71 MQIIGGDDLSTLAGKNVLIVEDVVGTGRTMKALLSNIEKYKPNMIKVASLLVKRTSRSDG 130 140 150 160 170 180 180 190 200 210 pF1KE5 YKPDFVGFEIPDKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA ..::..:::::. ::::::::::::::::::.:::.: :: ::. CCDS71 FRPDYAGFEIPNLFVVGYALDYNEYFRDLNHICVINEHGKEKYRV 190 200 210 220 >>CCDS60506.1 PRTFDC1 gene_id:56952|Hs108|chr10 (190 aa) initn: 781 init1: 781 opt: 781 Z-score: 1014.0 bits: 194.5 E(32554): 3.7e-50 Smith-Waterman score: 781; 66.1% identity (87.1% similar) in 171 aa overlap (7-177:14-184) 10 20 30 40 50 pF1KE5 MATRSPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMDRTERLARDV :::: :: :::::.:: :.:: ::: :.::::.:.:: ::::.:. CCDS60 MAGSSEEAPDYGRGVVIMDDWPGYDLNLFTYPQHYYGDLEYVLIPHGIIVDRIERLAKDI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 MKEMGGHHIVALCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRLKSYCNDQSTGD ::..: :..:::::::::: :::....: ..::::: . : ::::::::: :::: :. CCDS60 MKDIGYSDIMVLCVLKGGYKFCADLVEHLKNISRNSDRFVSMKVDFIRLKSYRNDQSMGE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 IKVIGGDDLSTLTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKVASLLVKRTPRSVG ...:::::::::.:::::::::.. ::.::..::: ...:.:.:.:::::::::: :: : CCDS60 MQIIGGDDLSTLAGKNVLIVEDVVGTGRTMKALLSNIEKYKPNMIKVASLLVKRTSRSDG 130 140 150 160 170 180 180 190 200 210 pF1KE5 YKPDFVGFEIPDKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA ..:: CCDS60 FRPDSHMRHQ 190 218 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:27:48 2016 done: Mon Nov 7 22:27:48 2016 Total Scan time: 1.370 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]