FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1603, 104 aa
1>>>pF1KE1603 104 - 104 aa - 104 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0307+/-0.000597; mu= 11.5249+/- 0.036
mean_var=62.2356+/-12.332, 0's: 0 Z-trim(111.8): 4 B-trim: 0 in 0/51
Lambda= 0.162575
statistics sampled from 12672 (12675) to 12672 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.389), width: 16
Scan time: 1.670
The best scores are: opt bits E(32554)
CCDS4456.1 SCGB3A1 gene_id:92304|Hs108|chr5 ( 104) 642 158.0 9.7e-40
>>CCDS4456.1 SCGB3A1 gene_id:92304|Hs108|chr5 (104 aa)
initn: 642 init1: 642 opt: 642 Z-score: 826.9 bits: 158.0 E(32554): 9.7e-40
Smith-Waterman score: 642; 100.0% identity (100.0% similar) in 104 aa overlap (1-104:1-104)
10 20 30 40 50 60
pF1KE1 MKLAALLGLCVALSCSSAAAFLVGSAKPVAQPVAALESAAEAGAGTLANPLGTLNPLKLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MKLAALLGLCVALSCSSAAAFLVGSAKPVAQPVAALESAAEAGAGTLANPLGTLNPLKLL
10 20 30 40 50 60
70 80 90 100
pF1KE1 LSSLGIPVNHLIEGSQKCVAELGPQAVGAVKALKALLGALTVFG
::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LSSLGIPVNHLIEGSQKCVAELGPQAVGAVKALKALLGALTVFG
70 80 90 100
104 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 17:01:31 2016 done: Sun Nov 6 17:01:31 2016
Total Scan time: 1.670 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]