FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7941, 343 aa 1>>>pF1KB7941 343 - 343 aa - 343 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0008+/-0.00075; mu= 2.0674+/- 0.046 mean_var=224.6836+/-45.481, 0's: 0 Z-trim(117.6): 152 B-trim: 13 in 1/51 Lambda= 0.085564 statistics sampled from 18227 (18400) to 18227 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.833), E-opt: 0.2 (0.565), width: 16 Scan time: 3.550 The best scores are: opt bits E(32554) CCDS31781.1 DBX2 gene_id:440097|Hs108|chr12 ( 339) 494 72.7 4.8e-13 >>CCDS31781.1 DBX2 gene_id:440097|Hs108|chr12 (339 aa) initn: 454 init1: 391 opt: 494 Z-score: 347.7 bits: 72.7 E(32554): 4.8e-13 Smith-Waterman score: 529; 38.4% identity (59.1% similar) in 323 aa overlap (2-297:1-310) 10 20 30 40 50 pF1KB7 MMFPGLLAPPAG-YPSLLRPTPTLTLPQSLQSAFSGHSSFLVEDLIRISRPPAYLPRSVP :.:. .: :: : ... . :.:: . . :.: ::.:.:.:.. :. :: : CCDS31 MLPSAVAAHAGAYWDVVASSALLNLPAAPGFGNLGKS-FLIENLLRVGGAPT--PRLQP 10 20 30 40 50 60 70 80 90 100 pF1KB7 TASMSP------------PRQGAPTALTDTGASDLGSPGPGSRRGGSPP-TAFSPASETT : .: : ..:. : :.. ::. :. : ..::..... CCDS31 PAPHDPATALATAGAQLRPLPASPVPLKLCPAAEQVSPA-GAPYGTRWAFQVLSPSADSA 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 FLKFGVNAILSSGPRTETSPA----LLQSVPPKTFAFPYFEGSFQPFIRSSYFPASSSVV : : . ..:: .: :.:: : :: . :. :: :.. CCDS31 RLP-GRAPGDRDCTFQPSAPAPSKPFLLSTPP--FYSACCGGSCRRPASSTAFPREESML 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 PIPGTFSWPLAARGKPRRGMLRRAVFSDVQRKALEKMFQKQKYISKPDRKKLAAKLGLKD :. . .: :::.:::::::. ::::::::::::::::: :::::: .::::. CCDS31 PL-----LTQDSNSKARRGILRRAVFSEDQRKALEKMFQKQKYISKTDRKKLAINLGLKE 180 190 200 210 220 230 240 250 260 270 pF1KB7 SQVKIWFQNRRMKWRNSKERELLSSG-----GCREQTLPTKL----NPHPDLSDVGQKGP :::::::::::::::::::.:.::. : .:. : . .: :.. :: :. CCDS31 SQVKIWFQNRRMKWRNSKEKEVLSNRCIQEVGLQEDPLSRSALGFPSPCPSIWDVPQQHS 230 240 250 260 270 280 280 290 300 310 320 330 pF1KB7 GNEEEEEGPGSPSHRLAYHASSDPQHLRDPRLPGPLPPSPAHSSSPGKPSDFSDSEEEEE . . .:..: ::.:: ..:. : CCDS31 SPRWRENSP-EPSERLIQESSGAPPPEANSLQGALYLCSEEEAGSKGVLTGAV 290 300 310 320 330 343 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:39:07 2016 done: Sun Nov 6 04:39:07 2016 Total Scan time: 3.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]