FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5532, 393 aa
1>>>pF1KE5532 393 - 393 aa - 393 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0405+/-0.000877; mu= 17.9526+/- 0.053
mean_var=60.4257+/-12.239, 0's: 0 Z-trim(105.2): 19 B-trim: 0 in 0/49
Lambda= 0.164992
statistics sampled from 8290 (8302) to 8290 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.636), E-opt: 0.2 (0.255), width: 16
Scan time: 2.430
The best scores are: opt bits E(32554)
CCDS7722.1 CHID1 gene_id:66005|Hs108|chr11 ( 393) 2645 638.2 3.8e-183
CCDS44511.1 CHID1 gene_id:66005|Hs108|chr11 ( 418) 2425 585.8 2.3e-167
CCDS44510.1 CHID1 gene_id:66005|Hs108|chr11 ( 362) 1368 334.2 1.1e-91
>>CCDS7722.1 CHID1 gene_id:66005|Hs108|chr11 (393 aa)
initn: 2645 init1: 2645 opt: 2645 Z-score: 3401.5 bits: 638.2 E(32554): 3.8e-183
Smith-Waterman score: 2645; 100.0% identity (100.0% similar) in 393 aa overlap (1-393:1-393)
10 20 30 40 50 60
pF1KE5 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 AKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 AKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGM
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS77 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL
310 320 330 340 350 360
370 380 390
pF1KE5 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL
:::::::::::::::::::::::::::::::::
CCDS77 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL
370 380 390
>>CCDS44511.1 CHID1 gene_id:66005|Hs108|chr11 (418 aa)
initn: 2416 init1: 2416 opt: 2425 Z-score: 3118.0 bits: 585.8 E(32554): 2.3e-167
Smith-Waterman score: 2585; 94.0% identity (94.0% similar) in 418 aa overlap (1-393:1-418)
10 20 30
pF1KE5 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEK-----------------------
:::::::::::::::::::::::::::::::::::::
CCDS44 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKVKFCSCCPGWSAMARSWLTATSA
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE5 --SQFSDKPVQDRGLVVTDLKAESVVLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TQSQFSDKPVQDRGLVVTDLKAESVVLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDV
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE5 TKVFGSKFTQISPVWLQLKRRGREMFEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TKVFGSKFTQISPVWLQLKRRGREMFEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDW
130 140 150 160 170 180
160 170 180 190 200 210
pF1KE5 TYDDFRNVLDSEDEIEELSKTVVQVAKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TYDDFRNVLDSEDEIEELSKTVVQVAKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEA
190 200 210 220 230 240
220 230 240 250 260 270
pF1KE5 LHQARLLALLVIPPAITPGTDQLGMFTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LHQARLLALLVIPPAITPGTDQLGMFTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPL
250 260 270 280 290 300
280 290 300 310 320 330
pF1KE5 SWVRACVQVLDPKSKWRSKILLGLNFYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SWVRACVQVLDPKSKWRSKILLGLNFYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWD
310 320 330 340 350 360
340 350 360 370 380 390
pF1KE5 SQASEHFFEYKKSRSGRHVVFYPTLKSLQVRLELARELGVGVSIWELGQGLDYFYDLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SQASEHFFEYKKSRSGRHVVFYPTLKSLQVRLELARELGVGVSIWELGQGLDYFYDLL
370 380 390 400 410
>>CCDS44510.1 CHID1 gene_id:66005|Hs108|chr11 (362 aa)
initn: 2433 init1: 1360 opt: 1368 Z-score: 1759.2 bits: 334.2 E(32554): 1.1e-91
Smith-Waterman score: 2375; 92.1% identity (92.1% similar) in 393 aa overlap (1-393:1-362)
10 20 30 40 50 60
pF1KE5 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MRTLFNLLWLALACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPVQDRGLVVTDLKAESV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREM
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 FEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTVVQV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 AKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGM
::::::::::::::::::::::: ::::::
CCDS44 AKNQHFDGFVVEVWNQLLSQKRV-------------------------------TDQLGM
190 200
250 260 270 280 290 300
pF1KE5 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 FTHKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQVLDPKSKWRSKILLGLN
210 220 230 240 250 260
310 320 330 340 350 360
pF1KE5 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 FYGMDYATSKDAREPVVGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHVVFYPTL
270 280 290 300 310 320
370 380 390
pF1KE5 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL
:::::::::::::::::::::::::::::::::
CCDS44 KSLQVRLELARELGVGVSIWELGQGLDYFYDLL
330 340 350 360
393 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 01:33:04 2016 done: Tue Nov 8 01:33:04 2016
Total Scan time: 2.430 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]