FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7903, 107 aa
1>>>pF1KB7903 107 - 107 aa - 107 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4966+/-0.000619; mu= 3.2101+/- 0.038
mean_var=174.5326+/-34.515, 0's: 0 Z-trim(118.6): 8 B-trim: 10 in 1/55
Lambda= 0.097081
statistics sampled from 19600 (19607) to 19600 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.87), E-opt: 0.2 (0.602), width: 16
Scan time: 1.640
The best scores are: opt bits E(32554)
CCDS4789.1 HMGA1 gene_id:3159|Hs108|chr6 ( 107) 704 108.3 9.4e-25
CCDS4788.1 HMGA1 gene_id:3159|Hs108|chr6 ( 96) 417 68.1 1.1e-12
>>CCDS4789.1 HMGA1 gene_id:3159|Hs108|chr6 (107 aa)
initn: 704 init1: 704 opt: 704 Z-score: 557.9 bits: 108.3 E(32554): 9.4e-25
Smith-Waterman score: 704; 100.0% identity (100.0% similar) in 107 aa overlap (1-107:1-107)
10 20 30 40 50 60
pF1KB7 MSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGR
10 20 30 40 50 60
70 80 90 100
pF1KB7 PKGSKNKGAAKTRKTTTTPGRKPRGRPKKLEKEEEEGISQESSEEEQ
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 PKGSKNKGAAKTRKTTTTPGRKPRGRPKKLEKEEEEGISQESSEEEQ
70 80 90 100
>>CCDS4788.1 HMGA1 gene_id:3159|Hs108|chr6 (96 aa)
initn: 410 init1: 410 opt: 417 Z-score: 341.3 bits: 68.1 E(32554): 1.1e-12
Smith-Waterman score: 604; 89.7% identity (89.7% similar) in 107 aa overlap (1-107:1-96)
10 20 30 40 50 60
pF1KB7 MSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPPVSPGTALVGSQKEPSEVPTPKRPRGR
:::::::::::::::::::::::::::::::::: :::::::::::::::
CCDS47 MSESSSKSSQPLASKQEKDGTEKRGRGRPRKQPP-----------KEPSEVPTPKRPRGR
10 20 30 40
70 80 90 100
pF1KB7 PKGSKNKGAAKTRKTTTTPGRKPRGRPKKLEKEEEEGISQESSEEEQ
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 PKGSKNKGAAKTRKTTTTPGRKPRGRPKKLEKEEEEGISQESSEEEQ
50 60 70 80 90
107 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 02:06:30 2016 done: Mon Nov 7 02:06:31 2016
Total Scan time: 1.640 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]