FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3690, 1008 aa 1>>>pF1KE3690 1008 - 1008 aa - 1008 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.6534+/-0.00128; mu= 15.7092+/- 0.076 mean_var=156.6643+/-33.832, 0's: 0 Z-trim(104.5): 146 B-trim: 0 in 0/52 Lambda= 0.102468 statistics sampled from 7783 (7931) to 7783 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.604), E-opt: 0.2 (0.244), width: 16 Scan time: 3.130 The best scores are: opt bits E(32554) CCDS82856.1 HLTF gene_id:6596|Hs108|chr3 (1008) 6698 1003.9 0 CCDS33875.1 HLTF gene_id:6596|Hs108|chr3 (1009) 6686 1002.1 0 CCDS892.1 TTF2 gene_id:8458|Hs108|chr1 (1162) 430 77.3 2e-13 >>CCDS82856.1 HLTF gene_id:6596|Hs108|chr3 (1008 aa) initn: 6698 init1: 6698 opt: 6698 Z-score: 5363.0 bits: 1003.9 E(32554): 0 Smith-Waterman score: 6698; 99.9% identity (99.9% similar) in 1008 aa overlap (1-1008:1-1008) 10 20 30 40 50 60 pF1KE3 MSWMFKRDPVWKYLQTVQYGVHGNFPRLSYPTFFPRFEFQDVIPPDDFLTSDEEVDSVLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 MSWMFKRDPVWKYLQTVQYGVHGNFPRLSYPTFFPRFEFQDVIPPDDFLTSDEEVDSVLF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 GSLRGHVVGLRYYTGVVNNNEMVALQRDPNNPYDKNAIKVNNVNGNQVGHLKKELAGALA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GSLRGHVVGLRYYTGVVNNNEMVALQRDPNNPYDKNAIKVNNVNGNQVGHLKKELAGALA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 YIMDNKLAQIEGVVPFGANNAFTMPLHMTFWGKEENRKAVSDQLKKHGFKLGPAPKTLGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 YIMDNKLAQIEGVVPFGANNAFTMPLHMTFWGKEENRKAVSDQLKKHGFKLGPAPKTLGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 NLESGWGSGRAGPSYSMPVHAAVQMTTEQLKTEFDKLFEDLKEDDKTLEMEPAEAIETPL ::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::: CCDS82 NLESGWGSGRAGPSYSMPVHAAVQMTTEQLKTEFDKLFEDLKEDDKTHEMEPAEAIETPL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LPHQKQALAWMVSRENSKELPPFWEQRNDLYYNTITNFSEKDRPENVHGGILADDMGLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LPHQKQALAWMVSRENSKELPPFWEQRNDLYYNTITNFSEKDRPENVHGGILADDMGLGK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 TLTAIAVILTNFHDGRPLPIERVKKNLLKKEYNVNDDSMKLGGNNTSEKADGLSKDASRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 TLTAIAVILTNFHDGRPLPIERVKKNLLKKEYNVNDDSMKLGGNNTSEKADGLSKDASRC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 SEQPSISDIKEKSKFRMSELSSSRPKRRKTAVQYIESSDSEEIETSELPQKMKGKLKNVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 SEQPSISDIKEKSKFRMSELSSSRPKRRKTAVQYIESSDSEEIETSELPQKMKGKLKNVQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 SETKGRAKGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTDVEERPRTTLII :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 SETKGRAKGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTDVEERPRTTLII 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE3 CPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTYNILTHDYGTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTYNILTHDYGTK 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE3 GDSPLHSIRWLRVILDEGHAIRNPNAQQTKAVLDLESERRWVLTGTPIQNSLKDLWSLLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GDSPLHSIRWLRVILDEGHAIRNPNAQQTKAVLDLESERRWVLTGTPIQNSLKDLWSLLS 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE3 FLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSLIKNITLRRTKTSKIKGKPVLELPER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 FLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSLIKNITLRRTKTSKIKGKPVLELPER 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE3 KVFIQHITLSDEERKIYQSVKNEGRATIGRYFNEGTVLAHYADVLGLLLRLRQICCHTYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 KVFIQHITLSDEERKIYQSVKNEGRATIGRYFNEGTVLAHYADVLGLLLRLRQICCHTYL 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE3 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE3 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE3 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE3 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI 910 920 930 940 950 960 970 980 990 1000 pF1KE3 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL :::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL 970 980 990 1000 >>CCDS33875.1 HLTF gene_id:6596|Hs108|chr3 (1009 aa) initn: 3846 init1: 3846 opt: 6686 Z-score: 5353.4 bits: 1002.1 E(32554): 0 Smith-Waterman score: 6686; 99.8% identity (99.8% similar) in 1009 aa overlap (1-1008:1-1009) 10 20 30 40 50 60 pF1KE3 MSWMFKRDPVWKYLQTVQYGVHGNFPRLSYPTFFPRFEFQDVIPPDDFLTSDEEVDSVLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSWMFKRDPVWKYLQTVQYGVHGNFPRLSYPTFFPRFEFQDVIPPDDFLTSDEEVDSVLF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 GSLRGHVVGLRYYTGVVNNNEMVALQRDPNNPYDKNAIKVNNVNGNQVGHLKKELAGALA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GSLRGHVVGLRYYTGVVNNNEMVALQRDPNNPYDKNAIKVNNVNGNQVGHLKKELAGALA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 YIMDNKLAQIEGVVPFGANNAFTMPLHMTFWGKEENRKAVSDQLKKHGFKLGPAPKTLGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 YIMDNKLAQIEGVVPFGANNAFTMPLHMTFWGKEENRKAVSDQLKKHGFKLGPAPKTLGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 NLESGWGSGRAGPSYSMPVHAAVQMTTEQLKTEFDKLFEDLKEDDKTLEMEPAEAIETPL ::::::::::::::::::::::::::::::::::::::::::::::: :::::::::::: CCDS33 NLESGWGSGRAGPSYSMPVHAAVQMTTEQLKTEFDKLFEDLKEDDKTHEMEPAEAIETPL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LPHQKQALAWMVSRENSKELPPFWEQRNDLYYNTITNFSEKDRPENVHGGILADDMGLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LPHQKQALAWMVSRENSKELPPFWEQRNDLYYNTITNFSEKDRPENVHGGILADDMGLGK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 TLTAIAVILTNFHDGRPLPIERVKKNLLKKEYNVNDDSMKLGGNNTSEKADGLSKDASRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TLTAIAVILTNFHDGRPLPIERVKKNLLKKEYNVNDDSMKLGGNNTSEKADGLSKDASRC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 SEQPSISDIKEKSKFRMSELSSSRPKRRKTAVQYIESSDSEEIETSELPQKMKGKLKNVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SEQPSISDIKEKSKFRMSELSSSRPKRRKTAVQYIESSDSEEIETSELPQKMKGKLKNVQ 370 380 390 400 410 420 430 440 450 460 470 pF1KE3 SETKGRAK-GSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTDVEERPRTTLI :::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SETKGRAKAGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTDVEERPRTTLI 430 440 450 460 470 480 480 490 500 510 520 530 pF1KE3 ICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTYNILTHDYGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTYNILTHDYGT 490 500 510 520 530 540 540 550 560 570 580 590 pF1KE3 KGDSPLHSIRWLRVILDEGHAIRNPNAQQTKAVLDLESERRWVLTGTPIQNSLKDLWSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KGDSPLHSIRWLRVILDEGHAIRNPNAQQTKAVLDLESERRWVLTGTPIQNSLKDLWSLL 550 560 570 580 590 600 600 610 620 630 640 650 pF1KE3 SFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSLIKNITLRRTKTSKIKGKPVLELPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSLIKNITLRRTKTSKIKGKPVLELPE 610 620 630 640 650 660 660 670 680 690 700 710 pF1KE3 RKVFIQHITLSDEERKIYQSVKNEGRATIGRYFNEGTVLAHYADVLGLLLRLRQICCHTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RKVFIQHITLSDEERKIYQSVKNEGRATIGRYFNEGTVLAHYADVLGLLLRLRQICCHTY 670 680 690 700 710 720 720 730 740 750 760 770 pF1KE3 LLTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LLTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFC 730 740 750 760 770 780 780 790 800 810 820 830 pF1KE3 KPCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KPCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMH 790 800 810 820 830 840 840 850 860 870 880 890 pF1KE3 ALTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ALTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNT 850 860 870 880 890 900 900 910 920 930 940 950 pF1KE3 EAGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EAGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKF 910 920 930 940 950 960 960 970 980 990 1000 pF1KE3 IVKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 IVKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL 970 980 990 1000 >>CCDS892.1 TTF2 gene_id:8458|Hs108|chr1 (1162 aa) initn: 1064 init1: 291 opt: 430 Z-score: 354.5 bits: 77.3 E(32554): 2e-13 Smith-Waterman score: 852; 31.7% identity (60.3% similar) in 590 aa overlap (442-980:608-1137) 420 430 440 450 460 470 pF1KE3 MKGKLKNVQSETKGRAKGSSKVIEDVAFACALTSSVPTTKKKMLKKGACAVEGSKKTD-V :: . . .:: :. . :. .: : CCDS89 LAWLLWRESQKPQGGILADDMGLGKTLTMIALILTQKNQEKKEEKEKSTALTWLSKDDSC 580 590 600 610 620 630 480 490 500 510 520 530 pF1KE3 EERPRTTLIICPLSVLSNWIDQFGQHIKSDVHLNFYVYYGPDRIREPALLSKQDIVLTTY . . :::::: :.. .: .. ....:. .: :.:.::.: . .:: :::.::: CCDS89 DFTSHGTLIICPASLIHHWKNEVEKRVNSN-KLRVYLYHGPNRDSRARVLSTYDIVITTY 640 650 660 670 680 690 540 550 560 570 pF1KE3 NILTHDYGT--------------KGDS-PLHSIRWLRVILDEGHAIRNPNAQQTKAVLDL ...... : .: : :: : : :.::::.: ..:: .: . :: : CCDS89 SLVAKEIPTNKQEAEIPGANLNVEGTSTPLLRIAWARIILDEAHNVKNPRVQTSIAVCKL 700 710 720 730 740 750 580 590 600 610 620 630 pF1KE3 ESERRWVLTGTPIQNSLKDLWSLLSFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSL .. ::..:::::::.: :..:::.::. .:: . . :. .. :.. : .::. : CCDS89 QACARWAVTGTPIQNNLLDMYSLLKFLRCSPFDEFNLWRSQVDN----GSKKGGERLSIL 760 770 780 790 800 810 640 650 660 670 680 pF1KE3 IKNITLRRTKTS-KIKGKPVLELPERKVFIQHITLSDEERKIY-----------QSV--K :.. ::::: . :.:.. ::.:: ..:. ::..:. .: :: . CCDS89 TKSLLLRRTKDQLDSTGRPLVILPQRKFQLHHLKLSEDEETVYNVFFARSRSALQSYLKR 820 830 840 850 860 870 690 700 710 720 pF1KE3 NEGRATI-GRYFNE---------GTVLAHYAD-----------VLGLLLRLRQICCHTYL .:.:.. :: :. :. .... .:. :::::: ::: : CCDS89 HESRGNQSGRSPNNPFSRVALEFGSEEPRHSEAADSPRSSTVHILSQLLRLRQCCCHLSL 880 890 900 910 920 930 730 740 750 760 770 780 pF1KE3 LTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITHCAHVFCK : .:.. : ::. . :.:: .:. :..::. CCDS89 LKSALD---------PMELKGE-----GLVLSL--EEQ----LSALTL------------ 940 950 960 790 800 810 820 830 840 pF1KE3 PCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSKINALMHA . ... .: . : . . . .:.: : :.::..:. CCDS89 ----SELRDSEPSSTVSL-NGTFFKMELFEGMRE----------------STKISSLLAE 970 980 990 850 860 870 880 890 900 pF1KE3 LTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESIQCFQNTE : ..... . ::..:::.:..:... . :: :.... .:::. :.:.. .. :.... CCDS89 LEAIQRNSASQKSVIVSQWTNMLKVVALHLKKHGLTYATIDGSVNPKQRMDLVEAFNHSR 1000 1010 1020 1030 1040 1050 910 920 930 940 950 960 pF1KE3 AGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQEVIITKFI . : .::.:: ::::::::......::.: :::. ::: :: .:.::...:.: .:. CCDS89 G--PQVMLISLLAGGVGLNLTGGNHLFLLDMHWNPSLEDQACDRIYRVGQQKDVVIHRFV 1060 1070 1080 1090 1100 1110 970 980 990 1000 pF1KE3 VKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL . .:::..:..:.::..:: CCDS89 CEGTVEEKILQLQEKKKDLAKQVLSGSGESVTKLTLADLRVLFGI 1120 1130 1140 1150 1160 1008 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:56:25 2016 done: Sun Nov 6 01:56:26 2016 Total Scan time: 3.130 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]