FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5532, 393 aa 1>>>pF1KE5532 393 - 393 aa - 393 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0405+/-0.000877; mu= 17.9526+/- 0.053 mean_var=60.4257+/-12.239, 0's: 0 Z-trim(105.2): 19 B-trim: 0 in 0/49 Lambda= 0.164992 statistics sampled from 8290 (8302) to 8290 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.636), E-opt: 0.2 (0.255), width: 16 Scan time: 2.430 The best scores are: opt bits E(32554) CCDS7722.1 CHID1 gene_id:66005|Hs108|chr11 ( 393) 2645 638.2 3.8e-183 CCDS44511.1 CHID1 gene_id:66005|Hs108|chr11 ( 418) 2425 585.8 2.3e-167 CCDS44510.1 CHID1 gene_id:66005|Hs108|chr11 ( 362) 1368 334.2 1.1e-91 >>CCDS7722.1 CHID1 gene_id:66005|Hs108|chr11 (393 aa) initn: 2645 init1: 2645 opt: 2645 Z-score: 3401.5 bits: 638.2 E(32554): 3.8e-183 Smith-Waterman score: 2645; 100.0% identity (100.0% similar) in 393 aa overlap (1-393:1-393) 10 20 30 40 50 60 pF1KE5 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 AKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL 310 320 330 340 350 360 370 380 390 pF1KE5 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL ::::::::::::::::::::::::::::::::: CCDS77 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL 370 380 390 >>CCDS44511.1 CHID1 gene_id:66005|Hs108|chr11 (418 aa) initn: 2416 init1: 2416 opt: 2425 Z-score: 3118.0 bits: 585.8 E(32554): 2.3e-167 Smith-Waterman score: 2585; 94.0% identity (94.0% similar) in 418 aa overlap (1-393:1-418) 10 20 30 pF1KE5 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEK----------------------- ::::::::::::::::::::::::::::::::::::: CCDS44 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKVKFCSCCPGWSAMARSWLTATSA 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 --SQFSDKPVQDRGLVVTDLKAESVVLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TQSQFSDKPVQDRGLVVTDLKAESVVLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDV 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 TKVFGSKFTQISPVWLQLKRRGREMFEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TKVFGSKFTQISPVWLQLKRRGREMFEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDW 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE5 TYDDFRNVLDSEDEIEELSKTVVQVAKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TYDDFRNVLDSEDEIEELSKTVVQVAKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEA 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE5 LHQARLLALLVIPPAITPGTDQLGMFTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 LHQARLLALLVIPPAITPGTDQLGMFTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPL 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE5 SWVRACVQVLDPKSKWRSKILLGLNFYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 SWVRACVQVLDPKSKWRSKILLGLNFYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWD 310 320 330 340 350 360 340 350 360 370 380 390 pF1KE5 SQASEHFFEYKKSRSGRHVVFYPTLKSLQVRLELARELGVGVSIWELGQGLDYFYDLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 SQASEHFFEYKKSRSGRHVVFYPTLKSLQVRLELARELGVGVSIWELGQGLDYFYDLL 370 380 390 400 410 >>CCDS44510.1 CHID1 gene_id:66005|Hs108|chr11 (362 aa) initn: 2433 init1: 1360 opt: 1368 Z-score: 1759.2 bits: 334.2 E(32554): 1.1e-91 Smith-Waterman score: 2375; 92.1% identity (92.1% similar) in 393 aa overlap (1-393:1-362) 10 20 30 40 50 60 pF1KE5 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGM ::::::::::::::::::::::: :::::: CCDS44 AKNQHFDGFVVEVWNQLLSQKRV-------------------------------TDQLGM 190 200 250 260 270 280 290 300 pF1KE5 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE5 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL 270 280 290 300 310 320 370 380 390 pF1KE5 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL ::::::::::::::::::::::::::::::::: CCDS44 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL 330 340 350 360 393 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:33:04 2016 done: Tue Nov 8 01:33:04 2016 Total Scan time: 2.430 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]