FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5320, 291 aa 1>>>pF1KE5320 291 - 291 aa - 291 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3037+/-0.000866; mu= 14.4805+/- 0.052 mean_var=56.4539+/-11.475, 0's: 0 Z-trim(105.0): 17 B-trim: 33 in 1/48 Lambda= 0.170698 statistics sampled from 8174 (8178) to 8174 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.627), E-opt: 0.2 (0.251), width: 16 Scan time: 1.840 The best scores are: opt bits E(32554) CCDS6768.1 TMEM38B gene_id:55151|Hs108|chr9 ( 291) 1975 494.6 3.6e-140 CCDS12349.1 TMEM38A gene_id:79041|Hs108|chr19 ( 299) 836 214.1 1e-55 >>CCDS6768.1 TMEM38B gene_id:55151|Hs108|chr9 (291 aa) initn: 1975 init1: 1975 opt: 1975 Z-score: 2630.0 bits: 494.6 E(32554): 3.6e-140 Smith-Waterman score: 1975; 100.0% identity (100.0% similar) in 291 aa overlap (1-291:1-291) 10 20 30 40 50 60 pF1KE5 MDSPWDELALAFSRTSMFPFFDIAHYLVSVMAVKRQPGAAALAWKNPISSWFTAMLHCFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 MDSPWDELALAFSRTSMFPFFDIAHYLVSVMAVKRQPGAAALAWKNPISSWFTAMLHCFG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 GGILSCLLLAEPPLKFLANHTNILLASSIWYITFFCPHDLVSQGYSYLPVQLLASGMKEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 GGILSCLLLAEPPLKFLANHTNILLASSIWYITFFCPHDLVSQGYSYLPVQLLASGMKEV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 TRTWKIVGGVTHANSYYKNGWIVMIAIGWARGAGGTIITNFERLVKGDWKPEGDEWLKMS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 TRTWKIVGGVTHANSYYKNGWIVMIAIGWARGAGGTIITNFERLVKGDWKPEGDEWLKMS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YPAKVTLLGSVIFTFQHTQHLAISKHNLMFLYTIFIVATKITMMTTQTSTMTFAPFEDTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 YPAKVTLLGSVIFTFQHTQHLAISKHNLMFLYTIFIVATKITMMTTQTSTMTFAPFEDTL 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 SWMLFGWQQPFSSCEKKSEAKSPSNGVGSLASKPVDVASDNVKKKHTKKNE ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 SWMLFGWQQPFSSCEKKSEAKSPSNGVGSLASKPVDVASDNVKKKHTKKNE 250 260 270 280 290 >>CCDS12349.1 TMEM38A gene_id:79041|Hs108|chr19 (299 aa) initn: 829 init1: 829 opt: 836 Z-score: 1113.9 bits: 214.1 E(32554): 1e-55 Smith-Waterman score: 836; 41.2% identity (76.2% similar) in 294 aa overlap (7-291:11-299) 10 20 30 40 50 pF1KE5 MDSPWDELALAFSRTSMFPFFDIAHYLVSVMAVKRQPGAAALAWKNPISSWFTAML ::::.:::. .:: ::.....::.. .: .:::. :. ..::.::. ::: CCDS12 MELLSALSLGELALSFSRVPLFPVFDLSYFIVSILYLKYEPGAVELSRRHPIASWLCAML 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 HCFGGGILSCLLLAEPPLKFLANHTNILLASSIWYITFFCPHDLVSQGYSYLPVQLLASG ::::. ::. :::.:: . ...:...:::::..::. :::: :: . .:::.:. . CCDS12 HCFGSYILADLLLGEPLIDYFSNNSSILLASAVWYLIFFCPLDLFYKCVCFLPVKLIFVA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 MKEVTRTWKIVGGVTHANSYYKNGWIVMIAIGWARGAGGTIITNFERLVKGDWKPEGDEW ::::.:. ::. :. ::. .:..::.:::: ::..:.: ....:::.:..: :::: .: CCDS12 MKEVVRVRKIAVGIHHAHHHYHHGWFVMIATGWVKGSGVALMSNFEQLLRGVWKPETNEI 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 LKMSYPAKVTLLGSVIFTFQHTQHLAISKHNLMFLYTIFIVATKITMMTTQTSTMTFAPF :.::.:.:..: :...::.:.:. : .:: .:.:..:.:.:. :. . .:.. . : . CCDS12 LHMSFPTKASLYGAILFTLQQTRWLPVSKASLIFIFTLFMVSCKVFLTATHSHSSPFDAL 190 200 210 220 230 240 240 250 260 270 280 pF1KE5 EDTLSWMLFGWQQPFSSC--EKKSEAKSPSNGVG-------SLASKPVDVASDNVKKKHT : . .::: :.: ... . .. :.. : .. .: . :.. .::.. CCDS12 EGYICPVLFG-----SACGGDHHHDNHGGSHSGGGPGAQHSAMPAKSKEELSEGSRKKKA 250 260 270 280 290 290 pF1KE5 KKNE :: . CCDS12 KKAD 291 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 07:31:47 2016 done: Tue Nov 8 07:31:47 2016 Total Scan time: 1.840 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]