FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4546, 640 aa 1>>>pF1KE4546 640 - 640 aa - 640 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6207+/-0.000936; mu= 18.4055+/- 0.057 mean_var=75.5856+/-15.084, 0's: 0 Z-trim(105.7): 40 B-trim: 0 in 0/52 Lambda= 0.147521 statistics sampled from 8564 (8594) to 8564 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.633), E-opt: 0.2 (0.264), width: 16 Scan time: 2.980 The best scores are: opt bits E(32554) CCDS4942.1 EFHC1 gene_id:114327|Hs108|chr6 ( 640) 4298 924.5 0 CCDS55021.1 EFHC1 gene_id:114327|Hs108|chr6 ( 621) 4153 893.6 0 CCDS55405.1 EFHC2 gene_id:80258|Hs108|chrX ( 749) 701 159.0 2.2e-38 >>CCDS4942.1 EFHC1 gene_id:114327|Hs108|chr6 (640 aa) initn: 4298 init1: 4298 opt: 4298 Z-score: 4941.1 bits: 924.5 E(32554): 0 Smith-Waterman score: 4298; 99.7% identity (99.8% similar) in 640 aa overlap (1-640:1-640) 10 20 30 40 50 60 pF1KE4 MVSNPVHGLPFLPGTSFKDSTKTAFHRSQTLSYRNGYAIVRRPTVGIGGDRLQFNQLSQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 MVSNPVHGLPFLPGTSFKDSTKTAFHRSQTLSYRNGYAIVRRPTVGIGGDRLQFNQLSQA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 ELDELASKAPVLTYGQPKQAPPADFIPAHVAFDKKVLKFDAYFQEDVPMSTEEQYRIRQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 ELDELASKAPVLTYGQPKQAPPADFIPAHVAFDKKVLKFDAYFQEDVPMSTEEQYRIRQV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 NIYYYLEDDSMSVIEPVVENSGILQGKLIKRQRLAKNDRGDHYHWKDLNRGINITIYGKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 NIYYYLEDDSMSVIEPVVENSGILQGKLIKRQRLAKNDRGDHYHWKDLNRGINITIYGKT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 FRVVDCDQFTQVFLESQGIELNPPEKMALDPYTELRKQPLRKYVTPSDFDQLKQFLTFDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 FRVVDCDQFTQVFLESQGIELNPPEKMALDPYTELRKQPLRKYVTPSDFDQLKQFLTFDK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 QVLRFYAIWDDTDSMYGECRTYIIHYYLMDDTVEIREVHERNDGIDPFPLLMNRQRVPKV :::::::::::::::::::::::::::::::::::::::::::: ::::::::::::::: CCDS49 QVLRFYAIWDDTDSMYGECRTYIIHYYLMDDTVEIREVHERNDGRDPFPLLMNRQRVPKV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 LVENAKNFPQCVLEISDQEVLEWYTAKDFIVGKSLTILGRTFFIYDCDPFTRRYYKEKFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 LVENAKNFPQCVLEISDQEVLEWYTAKDFIVGKSLTILGRTFFIYDCDPFTRRYYKEKFG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 ITDLPRIDVSKREPPPVKQELPPYNGFGLVEDSAQNCFALIPKAPKKDVIKMLVNDNKVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 ITDLPRIDVSKREPPPVKQELPPYNGFGLVEDSAQNCFALIPKAPKKDVIKMLVNDNKVL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 RYLAVLESPIPEDKDRRFVFSYFLATDMISIFEPPVRNSGIIGGKYLGRTKVVKPYSTVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 RYLAVLESPIPEDKDRRFVFSYFLATDMISIFEPPVRNSGIIGGKYLGRTKVVKPYSTVD 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 NPVYYGPSDFFIGAVIEVFGHRFIILDTDEYVLKYMESNAAQYSPEALASIQNHVRKREA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 NPVYYGPSDFFIGAVIEVFGHRFIILDTDEYVLKYMESNAAQYSPEALASIQNHVRKREA 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 PAPEAESKQTEKDPGVQELEALIDTIQKQLKDHSCKDNIREAFQIYDKEASGYVDRDMFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 PAPEAESKQTEKDPGVQELEALIDTIQKQLKDHSCKDNIREAFQIYDKEASGYVDRDMFF 550 560 570 580 590 600 610 620 630 640 pF1KE4 KICESLNVPVDDSLVKELLRMCSHGEGKINYYNFVRAFSN ::::::::::::::::::.::::::::::::::::::::: CCDS49 KICESLNVPVDDSLVKELIRMCSHGEGKINYYNFVRAFSN 610 620 630 640 >>CCDS55021.1 EFHC1 gene_id:114327|Hs108|chr6 (621 aa) initn: 4153 init1: 4153 opt: 4153 Z-score: 4774.5 bits: 893.6 E(32554): 0 Smith-Waterman score: 4153; 99.7% identity (99.8% similar) in 619 aa overlap (22-640:3-621) 10 20 30 40 50 60 pF1KE4 MVSNPVHGLPFLPGTSFKDSTKTAFHRSQTLSYRNGYAIVRRPTVGIGGDRLQFNQLSQA ::::::::::::::::::::::::::::::::::::::: CCDS55 MLKTAFHRSQTLSYRNGYAIVRRPTVGIGGDRLQFNQLSQA 10 20 30 40 70 80 90 100 110 120 pF1KE4 ELDELASKAPVLTYGQPKQAPPADFIPAHVAFDKKVLKFDAYFQEDVPMSTEEQYRIRQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ELDELASKAPVLTYGQPKQAPPADFIPAHVAFDKKVLKFDAYFQEDVPMSTEEQYRIRQV 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE4 NIYYYLEDDSMSVIEPVVENSGILQGKLIKRQRLAKNDRGDHYHWKDLNRGINITIYGKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 NIYYYLEDDSMSVIEPVVENSGILQGKLIKRQRLAKNDRGDHYHWKDLNRGINITIYGKT 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE4 FRVVDCDQFTQVFLESQGIELNPPEKMALDPYTELRKQPLRKYVTPSDFDQLKQFLTFDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 FRVVDCDQFTQVFLESQGIELNPPEKMALDPYTELRKQPLRKYVTPSDFDQLKQFLTFDK 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE4 QVLRFYAIWDDTDSMYGECRTYIIHYYLMDDTVEIREVHERNDGIDPFPLLMNRQRVPKV :::::::::::::::::::::::::::::::::::::::::::: ::::::::::::::: CCDS55 QVLRFYAIWDDTDSMYGECRTYIIHYYLMDDTVEIREVHERNDGRDPFPLLMNRQRVPKV 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE4 LVENAKNFPQCVLEISDQEVLEWYTAKDFIVGKSLTILGRTFFIYDCDPFTRRYYKEKFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LVENAKNFPQCVLEISDQEVLEWYTAKDFIVGKSLTILGRTFFIYDCDPFTRRYYKEKFG 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE4 ITDLPRIDVSKREPPPVKQELPPYNGFGLVEDSAQNCFALIPKAPKKDVIKMLVNDNKVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ITDLPRIDVSKREPPPVKQELPPYNGFGLVEDSAQNCFALIPKAPKKDVIKMLVNDNKVL 350 360 370 380 390 400 430 440 450 460 470 480 pF1KE4 RYLAVLESPIPEDKDRRFVFSYFLATDMISIFEPPVRNSGIIGGKYLGRTKVVKPYSTVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RYLAVLESPIPEDKDRRFVFSYFLATDMISIFEPPVRNSGIIGGKYLGRTKVVKPYSTVD 410 420 430 440 450 460 490 500 510 520 530 540 pF1KE4 NPVYYGPSDFFIGAVIEVFGHRFIILDTDEYVLKYMESNAAQYSPEALASIQNHVRKREA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 NPVYYGPSDFFIGAVIEVFGHRFIILDTDEYVLKYMESNAAQYSPEALASIQNHVRKREA 470 480 490 500 510 520 550 560 570 580 590 600 pF1KE4 PAPEAESKQTEKDPGVQELEALIDTIQKQLKDHSCKDNIREAFQIYDKEASGYVDRDMFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PAPEAESKQTEKDPGVQELEALIDTIQKQLKDHSCKDNIREAFQIYDKEASGYVDRDMFF 530 540 550 560 570 580 610 620 630 640 pF1KE4 KICESLNVPVDDSLVKELLRMCSHGEGKINYYNFVRAFSN ::::::::::::::::::.::::::::::::::::::::: CCDS55 KICESLNVPVDDSLVKELIRMCSHGEGKINYYNFVRAFSN 590 600 610 620 >>CCDS55405.1 EFHC2 gene_id:80258|Hs108|chrX (749 aa) initn: 954 init1: 240 opt: 701 Z-score: 802.7 bits: 159.0 E(32554): 2.2e-38 Smith-Waterman score: 1195; 36.6% identity (63.6% similar) in 580 aa overlap (9-544:3-562) 10 20 30 40 50 pF1KE4 MVSNPVHGLPFLPGTSF-KDSTKTAFHRSQTLSYRNG-YAIVRRPTVGIGGDRLQFNQLS ::.:::.:: .. : ::.:: .. :. . .: ::::. : :. CCDS55 MALPLLPGNSFNRNVGKEKFHKSQHWGFCNNVMMLVSDEKPGIGGEPL----LG 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 QAELDELASKAPVLTYGQPKQAPPADFIPAHVAFDKKVLKFDAYFQEDVPMSTEEQYRIR : .. : . :. .. .:. :::::.::.::::..:.: ... .:::: CCDS55 Q----KIKPKCSIYPKGDGSD------VPSWVAFDKQVLSFDAYLEEEVLDKSQTNYRIR 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 QVNIYYYLEDDSMSVIEPVVENSGILQGKLIKRQR--LAKNDRGDHYHWKDLNRGINITI .::.: :::...: :: :.:::.::: :.:.: : :. . : .: : .... CCDS55 YYKIYFYPEDDTIQVNEPEVKNSGLLQGTSIRRHRITLPPPDEDQFYTVYHFNVGTEVVF 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 YGKTFRVVDCDQFTQVFLESQGIELNPPEKMALDPYTELRKQ------PLRKYVTPSDFD ::.::.. ::: ::. ::.. :...::: . ::: ..:.. ::: : . .: CCDS55 YGRTFKIYDCDAFTRNFLRKIGVKVNPPVQCPEDPYMKIRREVVEHVEPLRPYES---LD 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 QLKQFLTFDKQVLRFYAIWDDTDSMYGECRTYIIHYYLMDDTVEIREVHERNDGIDPFPL ::::: . ..: :. .:::. ::.:. : :.::.: :::.::.:. ...: : . . CCDS55 TLKQFLQYHGKILCFFCLWDDSVSMFGDRRELILHYFLCDDTIEIKELLPHSSGRDALKM 220 230 240 250 260 270 300 310 320 pF1KE4 LMNRQRVPK----------------VL------VENAKN---FPQCVLEISDQEVLEWYT .. :...:: :: ..: . : . : :: :.: CCDS55 FLRRSKLPKNCPPRVYQPGQITDRAVLNSYGDFIKNQADGYLFDRYKLGKVDQ---EFYK 280 290 300 310 320 330 330 340 350 360 370 380 pF1KE4 AKDFIVGKSLTILGRTFFIYDCDPFTRRYYKEKFGITDLPRIDVSKREPPP-VKQELPPY .:. .: .... :: ..:::: ::. ::: :.:: .. .. . ::: .....::: CCDS55 DSDLSLGVTINVWGRKVLLYDCDEFTKSYYKSKYGIENFTSVSCKPPSPPPKIERKFPPY 340 350 360 370 380 390 390 400 410 420 430 pF1KE4 NGFGLVEDSAQNCFALIPKAPKKDVIKMLVNDN-----KVLRYLAVLESPIPEDKDRRFV :::: ::: .::. : : ... :.. .:. ..::..: : . : :: :: CCDS55 NGFGSEEDSLRNCIDLKPTPHRRNFKKFMEKDSYGSKSNILRFFAKLVTDKCVDLDRMFV 400 410 420 430 440 450 440 450 460 470 480 490 pF1KE4 FSYFLATDMISIFEPPVRNSGIIGGKYLGRTKVVKPYSTV---DNPVYYGPSDFFIGAVI .::.:. : ::.::: ::::: :: .: :..: :: . : . : ...::... CCDS55 ISYYLGDDTISVFEPIERNSGIAGGMFLKRSRVKKPGQEVFKSELSEYIKAEELYIGVTV 460 470 480 490 500 510 500 510 520 530 540 550 pF1KE4 EVFGHRFIILDTDEYVLKYMESNAAQYSPEALASIQNHVRKREAPAPEAESKQTEKDPGV .: :. : .:..:::.:.:::.:. .: : ......:. . : CCDS55 NVNGYLFRLLNADEYTLNYMEQNTDKYPFSNLKLALQKLKQEEGKSRELKQVFKAADSKH 520 530 540 550 560 570 560 570 580 590 600 610 pF1KE4 QELEALIDTIQKQLKDHSCKDNIREAFQIYDKEASGYVDRDMFFKICESLNVPVDDSLVK CCDS55 TNMVDYNTFRDILMSLTVGNLAEQEFVTIARHYRVPEGTCSDMDFLIALAHEKFKKNMFE 580 590 600 610 620 630 640 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:04:27 2016 done: Sun Nov 6 00:04:28 2016 Total Scan time: 2.980 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]