FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9678, 108 aa
1>>>pF1KB9678 108 - 108 aa - 108 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9262+/-0.000507; mu= 12.7138+/- 0.031
mean_var=69.5757+/-13.981, 0's: 0 Z-trim(115.2): 51 B-trim: 0 in 0/54
Lambda= 0.153761
statistics sampled from 15684 (15740) to 15684 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.843), E-opt: 0.2 (0.484), width: 16
Scan time: 1.470
The best scores are: opt bits E(32554)
CCDS6767.1 TAL2 gene_id:6887|Hs108|chr9 ( 108) 727 168.7 6.1e-43
CCDS547.1 TAL1 gene_id:6886|Hs108|chr1 ( 331) 346 84.6 4e-17
CCDS12292.1 LYL1 gene_id:4066|Hs108|chr19 ( 280) 330 81.0 4.1e-16
>>CCDS6767.1 TAL2 gene_id:6887|Hs108|chr9 (108 aa)
initn: 727 init1: 727 opt: 727 Z-score: 884.3 bits: 168.7 E(32554): 6.1e-43
Smith-Waterman score: 727; 100.0% identity (100.0% similar) in 108 aa overlap (1-108:1-108)
10 20 30 40 50 60
pF1KB9 MTRKIFTNTRERWRQQNVNSAFAKLRKLIPTHPPDKKLSKNETLRLAMRYINFLVKVLGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 MTRKIFTNTRERWRQQNVNSAFAKLRKLIPTHPPDKKLSKNETLRLAMRYINFLVKVLGE
10 20 30 40 50 60
70 80 90 100
pF1KB9 QSLQQTGVAAQGNILGLFPQGPHLPGLEDRTLLENYQVPSPGPSHHIP
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS67 QSLQQTGVAAQGNILGLFPQGPHLPGLEDRTLLENYQVPSPGPSHHIP
70 80 90 100
>>CCDS547.1 TAL1 gene_id:6886|Hs108|chr1 (331 aa)
initn: 329 init1: 329 opt: 346 Z-score: 421.0 bits: 84.6 E(32554): 4e-17
Smith-Waterman score: 346; 72.2% identity (88.9% similar) in 72 aa overlap (1-72:186-257)
10 20 30
pF1KB9 MTRKIFTNTRERWRQQNVNSAFAKLRKLIP
..:.::::.::::::::::.:::.::::::
CCDS54 DAFPMFTTNNRVKRRPSPYEMEITDGPHTKVVRRIFTNSRERWRQQNVNGAFAELRKLIP
160 170 180 190 200 210
40 50 60 70 80 90
pF1KB9 THPPDKKLSKNETLRLAMRYINFLVKVLGEQSLQQTGVAAQGNILGLFPQGPHLPGLEDR
:::::::::::: :::::.:::::.:.:..: . : : :
CCDS54 THPPDKKLSKNEILRLAMKYINFLAKLLNDQEEEGTQRAKTGKDPVVGAGGGGGGGGGGA
220 230 240 250 260 270
100
pF1KB9 TLLENYQVPSPGPSHHIP
CCDS54 PPDDLLQDVLSPNSSCGSSLDGAASPDSYTEEPAPKHTARSLHPAMLPAADGAGPR
280 290 300 310 320 330
>>CCDS12292.1 LYL1 gene_id:4066|Hs108|chr19 (280 aa)
initn: 327 init1: 311 opt: 330 Z-score: 402.8 bits: 81.0 E(32554): 4.1e-16
Smith-Waterman score: 332; 53.9% identity (77.5% similar) in 102 aa overlap (1-102:149-234)
10 20 30
pF1KB9 MTRKIFTNTRERWRQQNVNSAFAKLRKLIP
..:..:::.::::::::::.:::.::::.:
CCDS12 GPFSIFPSSRLKRRPSHCELDLAEGHQPQKVARRVFTNSRERWRQQNVNGAFAELRKLLP
120 130 140 150 160 170
40 50 60 70 80 90
pF1KB9 THPPDKKLSKNETLRLAMRYINFLVKVLGEQSLQQTGVAAQGNILGLFPQGPHLPGLEDR
:::::.::::::.:::::.::.:::..: .:. ...:: :: :: . :
CCDS12 THPPDRKLSKNEVLRLAMKYIGFLVRLLRDQA---AALAA----------GPTPPGPRKR
180 190 200 210 220
100
pF1KB9 TLLENYQVPSPGPSHHIP
. ..::. :
CCDS12 PV---HRVPDDGARRGSGRRAEAAARSQPAPPADPDGSPGGAARPIKMEQTALSPEVR
230 240 250 260 270 280
108 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:15:31 2016 done: Fri Nov 4 18:15:31 2016
Total Scan time: 1.470 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]