FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1555, 153 aa 1>>>pF1KE1555 153 - 153 aa - 153 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8382+/-0.000718; mu= 14.9587+/- 0.044 mean_var=63.9706+/-13.167, 0's: 0 Z-trim(108.9): 28 B-trim: 0 in 0/49 Lambda= 0.160356 statistics sampled from 10480 (10500) to 10480 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.686), E-opt: 0.2 (0.323), width: 16 Scan time: 1.520 The best scores are: opt bits E(32554) CCDS2085.1 MALL gene_id:7851|Hs108|chr2 ( 153) 1003 239.9 4.5e-64 CCDS2006.1 MAL gene_id:4118|Hs108|chr2 ( 153) 396 99.5 8.5e-22 CCDS75780.1 MAL2 gene_id:114569|Hs108|chr8 ( 176) 293 75.7 1.4e-14 CCDS2007.1 MAL gene_id:4118|Hs108|chr2 ( 111) 248 65.2 1.3e-11 >>CCDS2085.1 MALL gene_id:7851|Hs108|chr2 (153 aa) initn: 1003 init1: 1003 opt: 1003 Z-score: 1263.7 bits: 239.9 E(32554): 4.5e-64 Smith-Waterman score: 1003; 100.0% identity (100.0% similar) in 153 aa overlap (1-153:1-153) 10 20 30 40 50 60 pF1KE1 MASPDPPATSYAPSDVPSGVALFLTIPFAFFLPELIFGFLVWTMVAATHIVYPLLQGWVM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 MASPDPPATSYAPSDVPSGVALFLTIPFAFFLPELIFGFLVWTMVAATHIVYPLLQGWVM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 YVSLTSFLISLMFLLSYLFGFYKRFESWRVLDSLYHGTTGILYMSAAVLQVHATIVSEKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS20 YVSLTSFLISLMFLLSYLFGFYKRFESWRVLDSLYHGTTGILYMSAAVLQVHATIVSEKL 70 80 90 100 110 120 130 140 150 pF1KE1 LDPRIYYINSAASFFAFIATLLYILHAFSIYYH ::::::::::::::::::::::::::::::::: CCDS20 LDPRIYYINSAASFFAFIATLLYILHAFSIYYH 130 140 150 >>CCDS2006.1 MAL gene_id:4118|Hs108|chr2 (153 aa) initn: 393 init1: 393 opt: 396 Z-score: 504.8 bits: 99.5 E(32554): 8.5e-22 Smith-Waterman score: 396; 42.8% identity (73.8% similar) in 145 aa overlap (7-150:3-147) 10 20 30 40 50 60 pF1KE1 MASPDPPATSYAPSDVPSGVALFLTIPFAFFLPELIFGFLVWTMVAATHIVYPLLQGWVM ::.. . : .::: ..: :.: .:. :.::: ::: .::.. . .::.::::: CCDS20 MAPAAATGGSTLPSGFSVFTTLPDLLFIFEFIFGGLVWILVASSLVPWPLVQGWVM 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 YVSLTSFLISLMFLLSYLFGFYKRFESWRVLDSLYHGTTGILYMSAAVLQVHATIVSEKL .::. :. . ... :..: . :: .::. :: :....:.::.::.. :::. . CCDS20 FVSVFCFVATTTLIILYIIGAHGGETSWVTLDAAYHCTAALFYLSASVLEALATITMQDG 60 70 80 90 100 110 130 140 150 pF1KE1 LDPRIYYINSAASFFAFIATLLYILHA-FSIYYH . : :. : :: :..::::::..:: ::. CCDS20 FTYRHYHENIAAVVFSYIATLLYVVHAVFSLIRWKSS 120 130 140 150 >>CCDS75780.1 MAL2 gene_id:114569|Hs108|chr8 (176 aa) initn: 262 init1: 197 opt: 293 Z-score: 375.2 bits: 75.7 E(32554): 1.4e-14 Smith-Waterman score: 337; 40.5% identity (66.0% similar) in 153 aa overlap (4-143:11-163) 10 20 30 40 50 pF1KE1 MASPDPPATSYAPSDV--PSGVALFLTIPFAFFLPELIFGFLVWTMVAATHIV : ::.:. : : :.: .. : :: :..:: ::: .::.... CCDS75 MSAGGASVPPPPNPAVSFPPPRVTLPAGPDILRTYSGAFVCLEILFGGLVWILVASSNVP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 YPLLQGWVMYVSLTSFLISLMFLLSYLFGFYKRFES-WRVLDSLYHGTTGILYMSAAVLQ ::::::::.::.:.:..::.:: .: :. .... : :: :: :. ..:..: .:. CCDS75 LPLLQGWVMFVSVTAFFFSLLFLGMFLSGMVAQIDANWNFLDFAYHFTVFVFYFGAFLLE 70 80 90 100 110 120 120 130 140 150 pF1KE1 VHAT----------IVSEKLLDPRIYYINSAASFFAFIATLLYILHAFSIYYH . :: :... ::. : :: :::.:::..: : CCDS75 AAATSLHDLHCNTTITGQPLLSDNQYNINVAASIFAFMTTACYGCSLGLALRRWRP 130 140 150 160 170 >>CCDS2007.1 MAL gene_id:4118|Hs108|chr2 (111 aa) initn: 291 init1: 239 opt: 248 Z-score: 321.7 bits: 65.2 E(32554): 1.3e-11 Smith-Waterman score: 248; 38.1% identity (69.5% similar) in 105 aa overlap (7-111:3-105) 10 20 30 40 50 60 pF1KE1 MASPDPPATSYAPSDVPSGVALFLTIPFAFFLPELIFGFLVWTMVAATHIVYPLLQGWVM ::.. . : .::: ..: :.: .:. :.::: ::: .::.. . .::.::::: CCDS20 MAPAAATGGSTLPSGFSVFTTLPDLLFIFEFIFGGLVWILVASSLVPWPLVQGWVM 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 YVSLTSFLISLMFLLSYLFGFYKRFESWRVLDSLYHGTTGILYMSAAVLQVHATIVSEKL .::. :. . ... :..: . :: .: : .: .::. ::... CCDS20 FVSVFCFVATTTLIILYIIGAHGGETSWVTLVFSYIAT--LLYVVHAVFSLIRWKSS 60 70 80 90 100 110 130 140 150 pF1KE1 LDPRIYYINSAASFFAFIATLLYILHAFSIYYH 153 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:32:13 2016 done: Mon Nov 7 02:32:13 2016 Total Scan time: 1.520 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]