FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1016, 440 aa
1>>>pF1KE1016 440 - 440 aa - 440 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.8857+/-0.000713; mu= 7.7383+/- 0.043
mean_var=165.8263+/-33.459, 0's: 0 Z-trim(115.8): 17 B-trim: 0 in 0/53
Lambda= 0.099597
statistics sampled from 16331 (16346) to 16331 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.814), E-opt: 0.2 (0.502), width: 16
Scan time: 3.610
The best scores are: opt bits E(32554)
CCDS34317.1 SQSTM1 gene_id:8878|Hs108|chr5 ( 440) 3083 454.4 1e-127
CCDS47355.1 SQSTM1 gene_id:8878|Hs108|chr5 ( 356) 2520 373.4 2e-103
>>CCDS34317.1 SQSTM1 gene_id:8878|Hs108|chr5 (440 aa)
initn: 3083 init1: 3083 opt: 3083 Z-score: 2406.2 bits: 454.4 E(32554): 1e-127
Smith-Waterman score: 3083; 100.0% identity (100.0% similar) in 440 aa overlap (1-440:1-440)
10 20 30 40 50 60
pF1KE1 MASLTVKAYLLGKEDAAREIRRFSFCCSPEPEAEAEAAAGPGPCERLLSRVAALFPALRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MASLTVKAYLLGKEDAAREIRRFSFCCSPEPEAEAEAAAGPGPCERLLSRVAALFPALRP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 GGFQAHYRDEDGDLVAFSSDEELTMAMSYVKDDIFRIYIKEKKECRRDHRPPCAQEAPRN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 GGFQAHYRDEDGDLVAFSSDEELTMAMSYVKDDIFRIYIKEKKECRRDHRPPCAQEAPRN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 MVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLHRGHTKLAFPSPFGHLSEGFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLHRGHTKLAFPSPFGHLSEGFS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 HSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARPGPTAESASGPSEDPSVNFLKNV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 HSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARPGPTAESASGPSEDPSVNFLKNV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 GESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESSSTEEKSSSQPSSCCSDPSKPGGNV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 GESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESSSTEEKSSSQPSSCCSDPSKPGGNV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 EGATQSLAEQMRKIALESEGRPEEQMESDNCSGGDDDWTHLSSKEVDPSTGELQSLQMPE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 EGATQSLAEQMRKIALESEGRPEEQMESDNCSGGDDDWTHLSSKEVDPSTGELQSLQMPE
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 SEGPSSLDPSQEGPTGLKEAALYPHLPPEADPRLIESLSQMLSMGFSDEGGWLTRLLQTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 SEGPSSLDPSQEGPTGLKEAALYPHLPPEADPRLIESLSQMLSMGFSDEGGWLTRLLQTK
370 380 390 400 410 420
430 440
pF1KE1 NYDIGAALDTIQYSKHPPPL
::::::::::::::::::::
CCDS34 NYDIGAALDTIQYSKHPPPL
430 440
>>CCDS47355.1 SQSTM1 gene_id:8878|Hs108|chr5 (356 aa)
initn: 2520 init1: 2520 opt: 2520 Z-score: 1970.3 bits: 373.4 E(32554): 2e-103
Smith-Waterman score: 2520; 100.0% identity (100.0% similar) in 356 aa overlap (85-440:1-356)
60 70 80 90 100 110
pF1KE1 FPALRPGGFQAHYRDEDGDLVAFSSDEELTMAMSYVKDDIFRIYIKEKKECRRDHRPPCA
::::::::::::::::::::::::::::::
CCDS47 MAMSYVKDDIFRIYIKEKKECRRDHRPPCA
10 20 30
120 130 140 150 160 170
pF1KE1 QEAPRNMVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLHRGHTKLAFPSPFGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 QEAPRNMVHPNVICDGCNGPVVGTRYKCSVCPDYDLCSVCEGKGLHRGHTKLAFPSPFGH
40 50 60 70 80 90
180 190 200 210 220 230
pF1KE1 LSEGFSHSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARPGPTAESASGPSEDPSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LSEGFSHSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARPGPTAESASGPSEDPSV
100 110 120 130 140 150
240 250 260 270 280 290
pF1KE1 NFLKNVGESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESSSTEEKSSSQPSSCCSDPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 NFLKNVGESVAAALSPLGIEVDIDVEHGGKRSRLTPVSPESSSTEEKSSSQPSSCCSDPS
160 170 180 190 200 210
300 310 320 330 340 350
pF1KE1 KPGGNVEGATQSLAEQMRKIALESEGRPEEQMESDNCSGGDDDWTHLSSKEVDPSTGELQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KPGGNVEGATQSLAEQMRKIALESEGRPEEQMESDNCSGGDDDWTHLSSKEVDPSTGELQ
220 230 240 250 260 270
360 370 380 390 400 410
pF1KE1 SLQMPESEGPSSLDPSQEGPTGLKEAALYPHLPPEADPRLIESLSQMLSMGFSDEGGWLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SLQMPESEGPSSLDPSQEGPTGLKEAALYPHLPPEADPRLIESLSQMLSMGFSDEGGWLT
280 290 300 310 320 330
420 430 440
pF1KE1 RLLQTKNYDIGAALDTIQYSKHPPPL
::::::::::::::::::::::::::
CCDS47 RLLQTKNYDIGAALDTIQYSKHPPPL
340 350
440 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 13:26:01 2016 done: Sat Nov 5 13:26:01 2016
Total Scan time: 3.610 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]