FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6438, 466 aa 1>>>pF1KE6438 466 - 466 aa - 466 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3923+/-0.00105; mu= 13.9776+/- 0.063 mean_var=106.4318+/-20.396, 0's: 0 Z-trim(107.1): 10 B-trim: 14 in 1/50 Lambda= 0.124319 statistics sampled from 9361 (9365) to 9361 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.288), width: 16 Scan time: 2.400 The best scores are: opt bits E(32554) CCDS42405.1 CLUL1 gene_id:27098|Hs108|chr18 ( 466) 3152 576.1 2.6e-164 CCDS74187.1 CLUL1 gene_id:27098|Hs108|chr18 ( 518) 3152 576.2 2.8e-164 CCDS47832.1 CLU gene_id:1191|Hs108|chr8 ( 449) 332 70.3 4.6e-12 >>CCDS42405.1 CLUL1 gene_id:27098|Hs108|chr18 (466 aa) initn: 3152 init1: 3152 opt: 3152 Z-score: 3063.4 bits: 576.1 E(32554): 2.6e-164 Smith-Waterman score: 3152; 100.0% identity (100.0% similar) in 466 aa overlap (1-466:1-466) 10 20 30 40 50 60 pF1KE6 MKPPLLVFIVCLLWLKDSHCAPTWKDKTAISENLKSFSEVGEIDADEEVKKALTGIKQMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MKPPLLVFIVCLLWLKDSHCAPTWKDKTAISENLKSFSEVGEIDADEEVKKALTGIKQMK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 IMMERKEKEHTNLMSTLKKCREEKQEALKLLNEVQEHLEEEERLCRESLADSWGECRSCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 IMMERKEKEHTNLMSTLKKCREEKQEALKLLNEVQEHLEEEERLCRESLADSWGECRSCL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 ENNCMRIYTTCQPSWSSVKNKIERFFRKIYQFLFPFHEDNEKDLPISEKLIEEDAQLTQM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ENNCMRIYTTCQPSWSSVKNKIERFFRKIYQFLFPFHEDNEKDLPISEKLIEEDAQLTQM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 EDVFSQLTVDVNSLFNRSFNVFRQMQQEFDQTFQSHFISDTDLTEPYFFPAFSKEPMTKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EDVFSQLTVDVNSLFNRSFNVFRQMQQEFDQTFQSHFISDTDLTEPYFFPAFSKEPMTKA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 DLEQCWDIPNFFQLFCNFSVSIYESVSETITKMLKAIEDLPKQDKAPDHGGLISKMLPGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DLEQCWDIPNFFQLFCNFSVSIYESVSETITKMLKAIEDLPKQDKAPDHGGLISKMLPGQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 DRGLCGELDQNLSRCFKFHEKCQKCQAHLSEDCPDVPALHTELDEAIRLVNVSNQQYGQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DRGLCGELDQNLSRCFKFHEKCQKCQAHLSEDCPDVPALHTELDEAIRLVNVSNQQYGQI 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LQMTRKHLEDTAYLVEKMRGQFGWVSELANQAPETEIIFNSIQVVPRIHEGNISKQDETM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LQMTRKHLEDTAYLVEKMRGQFGWVSELANQAPETEIIFNSIQVVPRIHEGNISKQDETM 370 380 390 400 410 420 430 440 450 460 pF1KE6 MTDLSILPSSNFTLKIPLEESAESSNFIGYVVAKALQHFKEHFKTW :::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MTDLSILPSSNFTLKIPLEESAESSNFIGYVVAKALQHFKEHFKTW 430 440 450 460 >>CCDS74187.1 CLUL1 gene_id:27098|Hs108|chr18 (518 aa) initn: 3152 init1: 3152 opt: 3152 Z-score: 3062.7 bits: 576.2 E(32554): 2.8e-164 Smith-Waterman score: 3152; 100.0% identity (100.0% similar) in 466 aa overlap (1-466:53-518) 10 20 30 pF1KE6 MKPPLLVFIVCLLWLKDSHCAPTWKDKTAI :::::::::::::::::::::::::::::: CCDS74 LSSLQPLPPRFKRFSCLSLSSGWDYSNSGNMKPPLLVFIVCLLWLKDSHCAPTWKDKTAI 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE6 SENLKSFSEVGEIDADEEVKKALTGIKQMKIMMERKEKEHTNLMSTLKKCREEKQEALKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 SENLKSFSEVGEIDADEEVKKALTGIKQMKIMMERKEKEHTNLMSTLKKCREEKQEALKL 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE6 LNEVQEHLEEEERLCRESLADSWGECRSCLENNCMRIYTTCQPSWSSVKNKIERFFRKIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LNEVQEHLEEEERLCRESLADSWGECRSCLENNCMRIYTTCQPSWSSVKNKIERFFRKIY 150 160 170 180 190 200 160 170 180 190 200 210 pF1KE6 QFLFPFHEDNEKDLPISEKLIEEDAQLTQMEDVFSQLTVDVNSLFNRSFNVFRQMQQEFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 QFLFPFHEDNEKDLPISEKLIEEDAQLTQMEDVFSQLTVDVNSLFNRSFNVFRQMQQEFD 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE6 QTFQSHFISDTDLTEPYFFPAFSKEPMTKADLEQCWDIPNFFQLFCNFSVSIYESVSETI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 QTFQSHFISDTDLTEPYFFPAFSKEPMTKADLEQCWDIPNFFQLFCNFSVSIYESVSETI 270 280 290 300 310 320 280 290 300 310 320 330 pF1KE6 TKMLKAIEDLPKQDKAPDHGGLISKMLPGQDRGLCGELDQNLSRCFKFHEKCQKCQAHLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 TKMLKAIEDLPKQDKAPDHGGLISKMLPGQDRGLCGELDQNLSRCFKFHEKCQKCQAHLS 330 340 350 360 370 380 340 350 360 370 380 390 pF1KE6 EDCPDVPALHTELDEAIRLVNVSNQQYGQILQMTRKHLEDTAYLVEKMRGQFGWVSELAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 EDCPDVPALHTELDEAIRLVNVSNQQYGQILQMTRKHLEDTAYLVEKMRGQFGWVSELAN 390 400 410 420 430 440 400 410 420 430 440 450 pF1KE6 QAPETEIIFNSIQVVPRIHEGNISKQDETMMTDLSILPSSNFTLKIPLEESAESSNFIGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 QAPETEIIFNSIQVVPRIHEGNISKQDETMMTDLSILPSSNFTLKIPLEESAESSNFIGY 450 460 470 480 490 500 460 pF1KE6 VVAKALQHFKEHFKTW :::::::::::::::: CCDS74 VVAKALQHFKEHFKTW 510 >>CCDS47832.1 CLU gene_id:1191|Hs108|chr8 (449 aa) initn: 274 init1: 120 opt: 332 Z-score: 330.1 bits: 70.3 E(32554): 4.6e-12 Smith-Waterman score: 566; 25.8% identity (60.3% similar) in 476 aa overlap (1-464:1-447) 10 20 30 40 50 pF1KE6 MKPPLLVFIVCLL-WLKDSHCAPTWKDKTAISENLKSFSEVGEIDADEEVKKALTGIKQM : ::.:. :: : . . . :.:. ...:. .:. : ...:...:..:.::. CCDS47 MMKTLLLFVGLLLTW----ESGQVLGDQTVSDNELQEMSNQGSKYVNKEIQNAVNGVKQI 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 KIMMERKEKEHTNLMSTLKKCREEKQEALKLLNEVQEHLEEEERLCRESLADSWGECRSC : ..:. ..:. .:.:.:.. ...:..::. : . .:.: .: :.. : ::. : CCDS47 KTLIEKTNEERKTLLSNLEEAKKKKEDALNETRESETKLKELPGVCNETMMALWEECKPC 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 LENNCMRIYT-TCQPSWSSVKNKIERFFRKIYQFLFPFHEDNEKDLPISEKLIEEDAQLT :...::..:. .:. . . : ..:.:. . : : .. : ..:.:.: : : CCDS47 LKQTCMKFYARVCRSGSGLVGRQLEEFLNQSSPFYFWMNGDR------IDSLLENDRQQT 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 QMEDV----FSQLTVDVNSLFNRSFNVFRQMQQEFDQTFQSHFIS-DTDLTEPYFFPAFS .: :: ::. . .. ::. : : . :. :. :.. . .:.:: : CCDS47 HMLDVMQDHFSRASSIIDELFQDRF--FTREPQD---TY--HYLPFSLPHRRPHFF--FP 180 190 200 210 220 240 250 260 270 280 290 pF1KE6 KEPMTKADLEQCWDIP-NFFQLFCNFSVSIYESVSETITKMLKAIEDLPKQDKAPDHGGL : .... . : :: .: : :.:. . :. .. : .: CCDS47 KSRIVRSLMPFSPYEPLNFHAMFQPFLEMIHEA---------QQAMDIHFHSPAFQHPPT 230 240 250 260 270 300 310 320 330 340 pF1KE6 ISKMLPGQDRGLCGELDQNLSRCFKFHEKCQKCQAHLSEDC----PDVPALHTELDEAIR .:: .: :. .: . :......:.::. :: :: :. :. ::::... CCDS47 EFIREGDDDRTVCREIRHNSTGCLRMKDQCDKCREILSVDCSTNNPSQAKLRRELDESLQ 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE6 LVNVSNQQYGQILQMTRKHLEDTAYLVEKMRGQFGWVSELANQAPETEIIFNSIQVVPRI ... ...:...:. . .. .:. :.:.. ::.:::.::: . . . . .: CCDS47 VAERLTRKYNELLKSYQWKMLNTSSLLEQLNEQFNWVSRLANLTQGEDQYYLRVTTVAS- 340 350 360 370 380 390 410 420 430 440 450 460 pF1KE6 HEGNISKQDETMMTDLSILPSSNFTLKIPLEESAESSNFIGYVVAKALQHFKEHFKTW : .. . . . . .... :. .:. .:.: : .. .:. :. ::::..... . CCDS47 HTSDSDVPSGVTEVVVKLFDSDPITVTVPVEVSRKNPKFMETVAEKALQEYRKKHREE 400 410 420 430 440 466 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 13:17:51 2016 done: Tue Nov 8 13:17:51 2016 Total Scan time: 2.400 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]