FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2707, 237 aa 1>>>pF1KE2707 237 - 237 aa - 237 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1556+/-0.00075; mu= 10.9910+/- 0.045 mean_var=88.7032+/-17.564, 0's: 0 Z-trim(110.4): 15 B-trim: 2 in 1/50 Lambda= 0.136177 statistics sampled from 11596 (11606) to 11596 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.73), E-opt: 0.2 (0.357), width: 16 Scan time: 1.090 The best scores are: opt bits E(32554) CCDS11848.2 VAPA gene_id:9218|Hs108|chr18 ( 249) 1590 321.7 2.7e-88 CCDS33498.1 VAPB gene_id:9217|Hs108|chr20 ( 243) 1001 206.0 1.8e-53 CCDS11847.2 VAPA gene_id:9218|Hs108|chr18 ( 294) 918 189.7 1.7e-48 CCDS56198.1 VAPB gene_id:9217|Hs108|chr20 ( 99) 376 83.0 7.9e-17 >>CCDS11848.2 VAPA gene_id:9218|Hs108|chr18 (249 aa) initn: 1590 init1: 1590 opt: 1590 Z-score: 1698.5 bits: 321.7 E(32554): 2.7e-88 Smith-Waterman score: 1590; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:8-244) 10 20 30 40 50 pF1KE2 MAKHEQILVLDPPTDLKFKGPFTDVVTTNLKLRNPSDRKVCFKVKTTAPRRYC ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MASASGAMAKHEQILVLDPPTDLKFKGPFTDVVTTNLKLRNPSDRKVCFKVKTTAPRRYC 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 VRPNSGIIDPGSTVTVSVMLQPFDYDPNEKSKHKFMVQTIFAPPNTSDMEAVWKEAKPDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VRPNSGIIDPGSTVTVSVMLQPFDYDPNEKSKHKFMVQTIFAPPNTSDMEAVWKEAKPDE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE2 LMDSKLRCVFEMPNENDKLNDMEPSKAVPLNASKQDGPMPKPHSVSLNDTETRKLMEECK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LMDSKLRCVFEMPNENDKLNDMEPSKAVPLNASKQDGPMPKPHSVSLNDTETRKLMEECK 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE2 RLQGEMMKLSEENRHLRDEGLRLRKVAHSDKPGSTSTASFRDNVTSPLPSLLVVIAAIFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RLQGEMMKLSEENRHLRDEGLRLRKVAHSDKPGSTSTASFRDNVTSPLPSLLVVIAAIFI 190 200 210 220 230 240 pF1KE2 GFFL :::: CCDS11 GFFLGKFIL >>CCDS33498.1 VAPB gene_id:9217|Hs108|chr20 (243 aa) initn: 985 init1: 800 opt: 1001 Z-score: 1073.3 bits: 206.0 E(32554): 1.8e-53 Smith-Waterman score: 1001; 63.2% identity (87.0% similar) in 239 aa overlap (1-234:1-239) 10 20 30 40 50 60 pF1KE2 MAKHEQILVLDPPTDLKFKGPFTDVVTTNLKLRNPSDRKVCFKVKTTAPRRYCVRPNSGI ::: ::.: :.: .:::.::::::::::::: ::.::.::::::::::::::::::::: CCDS33 MAKVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 IDPGSTVTVSVMLQPFDYDPNEKSKHKFMVQTIFAPPNTSDMEAVWKEAKPDELMDSKLR :: :....:::::::::::::::::::::::..::: .:::::::::::::..::::::: CCDS33 IDAGASINVSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAVWKEAKPEDLMDSKLR 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 CVFEMPNENDKLNDMEPSKAVPLNASKQDGPM-PKPHSVSLNDTETRKLMEECKRLQGEM ::::.: :::: .:.: .: . .::: . :. : : ::.:::..:.::::::::::. CCDS33 CVFELPAENDKPHDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEV 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE2 MKLSEENRHLRDE-GLRLRKVAHSDKPGSTSTASFRDN-VTSPLPSLLVV--IAAIFIGF ..: :::.....: :::.::...:..: :. . . ... ... : .:.:. :....:: CCDS33 QRLREENKQFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGK 190 200 210 220 230 240 pF1KE2 FL CCDS33 IAL >>CCDS11847.2 VAPA gene_id:9218|Hs108|chr18 (294 aa) initn: 912 init1: 912 opt: 918 Z-score: 983.9 bits: 189.7 E(32554): 1.7e-48 Smith-Waterman score: 1413; 83.3% identity (83.3% similar) in 269 aa overlap (1-224:8-276) 10 20 30 40 50 pF1KE2 MAKHEQILVLDPPTDLKFKGPFTDVVTTNLKLRNPSDRKVCFKVKTTAPRRYC ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MASASGAMAKHEQILVLDPPTDLKFKGPFTDVVTTNLKLRNPSDRKVCFKVKTTAPRRYC 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 VRPNSGIIDPGSTVTVSVMLQPFDYDPNEKSKHKFMVQTIFAPPNTSDMEAVWKEAKPDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VRPNSGIIDPGSTVTVSVMLQPFDYDPNEKSKHKFMVQTIFAPPNTSDMEAVWKEAKPDE 70 80 90 100 110 120 120 130 pF1KE2 LMDSKLRCVFEMPNENDKL----------------------------------------- ::::::::::::::::::: CCDS11 LMDSKLRCVFEMPNENDKLGITPPGNAPTVTSMSSINNTVATPASYHTKDDPRGLSVLKQ 130 140 150 160 170 180 140 150 160 170 180 pF1KE2 ----NDMEPSKAVPLNASKQDGPMPKPHSVSLNDTETRKLMEECKRLQGEMMKLSEENRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EKQKNDMEPSKAVPLNASKQDGPMPKPHSVSLNDTETRKLMEECKRLQGEMMKLSEENRH 190 200 210 220 230 240 190 200 210 220 230 pF1KE2 LRDEGLRLRKVAHSDKPGSTSTASFRDNVTSPLPSLLVVIAAIFIGFFL :::::::::::::::::::::::::::::::::::: CCDS11 LRDEGLRLRKVAHSDKPGSTSTASFRDNVTSPLPSLLVVIAAIFIGFFLGKFIL 250 260 270 280 290 >>CCDS56198.1 VAPB gene_id:9217|Hs108|chr20 (99 aa) initn: 392 init1: 372 opt: 376 Z-score: 415.6 bits: 83.0 E(32554): 7.9e-17 Smith-Waterman score: 376; 62.2% identity (81.1% similar) in 90 aa overlap (1-90:1-90) 10 20 30 40 50 60 pF1KE2 MAKHEQILVLDPPTDLKFKGPFTDVVTTNLKLRNPSDRKVCFKVKTTAPRRYCVRPNSGI ::: ::.: :.: .:::.::::::::::::: ::.::.::::::::::::::::::::: CCDS56 MAKVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 IDPGSTVTVSVMLQPFDYDPNEKSKHKFMVQTIFAPPNTSDMEAVWKEAKPDELMDSKLR :: :....:: : . . ... .: . CCDS56 IDAGASINVSGRRWTADEEDSAEQQPHFSISPNWEGRRP 70 80 90 237 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Dec 15 09:59:30 2017 done: Fri Dec 15 09:59:30 2017 Total Scan time: 1.090 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]