FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7941, 343 aa
1>>>pF1KB7941 343 - 343 aa - 343 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.0008+/-0.00075; mu= 2.0674+/- 0.046
mean_var=224.6836+/-45.481, 0's: 0 Z-trim(117.6): 152 B-trim: 13 in 1/51
Lambda= 0.085564
statistics sampled from 18227 (18400) to 18227 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.833), E-opt: 0.2 (0.565), width: 16
Scan time: 3.550
The best scores are: opt bits E(32554)
CCDS31781.1 DBX2 gene_id:440097|Hs108|chr12 ( 339) 494 72.7 4.8e-13
>>CCDS31781.1 DBX2 gene_id:440097|Hs108|chr12 (339 aa)
initn: 454 init1: 391 opt: 494 Z-score: 347.7 bits: 72.7 E(32554): 4.8e-13
Smith-Waterman score: 529; 38.4% identity (59.1% similar) in 323 aa overlap (2-297:1-310)
10 20 30 40 50
pF1KB7 MMFPGLLAPPAG-YPSLLRPTPTLTLPQSLQSAFSGHSSFLVEDLIRISRPPAYLPRSVP
:.:. .: :: : ... . :.:: . . :.: ::.:.:.:.. :. :: :
CCDS31 MLPSAVAAHAGAYWDVVASSALLNLPAAPGFGNLGKS-FLIENLLRVGGAPT--PRLQP
10 20 30 40 50
60 70 80 90 100
pF1KB7 TASMSP------------PRQGAPTALTDTGASDLGSPGPGSRRGGSPP-TAFSPASETT
: .: : ..:. : :.. ::. :. : ..::.....
CCDS31 PAPHDPATALATAGAQLRPLPASPVPLKLCPAAEQVSPA-GAPYGTRWAFQVLSPSADSA
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB7 FLKFGVNAILSSGPRTETSPA----LLQSVPPKTFAFPYFEGSFQPFIRSSYFPASSSVV
: : . ..:: .: :.:: : :: . :. :: :..
CCDS31 RLP-GRAPGDRDCTFQPSAPAPSKPFLLSTPP--FYSACCGGSCRRPASSTAFPREESML
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB7 PIPGTFSWPLAARGKPRRGMLRRAVFSDVQRKALEKMFQKQKYISKPDRKKLAAKLGLKD
:. . .: :::.:::::::. ::::::::::::::::: :::::: .::::.
CCDS31 PL-----LTQDSNSKARRGILRRAVFSEDQRKALEKMFQKQKYISKTDRKKLAINLGLKE
180 190 200 210 220
230 240 250 260 270
pF1KB7 SQVKIWFQNRRMKWRNSKERELLSSG-----GCREQTLPTKL----NPHPDLSDVGQKGP
:::::::::::::::::::.:.::. : .:. : . .: :.. :: :.
CCDS31 SQVKIWFQNRRMKWRNSKEKEVLSNRCIQEVGLQEDPLSRSALGFPSPCPSIWDVPQQHS
230 240 250 260 270 280
280 290 300 310 320 330
pF1KB7 GNEEEEEGPGSPSHRLAYHASSDPQHLRDPRLPGPLPPSPAHSSSPGKPSDFSDSEEEEE
. . .:..: ::.:: ..:. :
CCDS31 SPRWRENSP-EPSERLIQESSGAPPPEANSLQGALYLCSEEEAGSKGVLTGAV
290 300 310 320 330
343 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 04:39:07 2016 done: Sun Nov 6 04:39:07 2016
Total Scan time: 3.550 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]