FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6126, 105 aa
1>>>pF1KE6126 105 - 105 aa - 105 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8443+/-0.000576; mu= 12.6252+/- 0.035
mean_var=68.2159+/-13.026, 0's: 0 Z-trim(113.7): 23 B-trim: 18 in 1/49
Lambda= 0.155286
statistics sampled from 14313 (14333) to 14313 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.807), E-opt: 0.2 (0.44), width: 16
Scan time: 1.020
The best scores are: opt bits E(32554)
CCDS825.1 PROK1 gene_id:84432|Hs108|chr1 ( 105) 772 180.5 1.6e-46
CCDS2916.1 PROK2 gene_id:60675|Hs108|chr3 ( 108) 407 98.8 6.9e-22
CCDS46868.1 PROK2 gene_id:60675|Hs108|chr3 ( 129) 269 67.9 1.6e-12
>>CCDS825.1 PROK1 gene_id:84432|Hs108|chr1 (105 aa)
initn: 772 init1: 772 opt: 772 Z-score: 948.5 bits: 180.5 E(32554): 1.6e-46
Smith-Waterman score: 772; 99.0% identity (100.0% similar) in 105 aa overlap (1-105:1-105)
10 20 30 40 50 60
pF1KE6 MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS82 MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC
10 20 30 40 50 60
70 80 90 100
pF1KE6 HPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF
::::::.::::::::::::::::::::::::::::::::::::::
CCDS82 HPGSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF
70 80 90 100
>>CCDS2916.1 PROK2 gene_id:60675|Hs108|chr3 (108 aa)
initn: 418 init1: 392 opt: 407 Z-score: 506.4 bits: 98.8 E(32554): 6.9e-22
Smith-Waterman score: 407; 54.0% identity (80.5% similar) in 87 aa overlap (10-96:18-104)
10 20 30 40 50
pF1KE6 MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTP
.:: ..: :::::::..: :::.: :::.:.:....:.:::
CCDS29 MRSLCCAPLLLLLLLPPLLLTPRAGDAAVITGACDKDSQCGGGMCCAVSIWVKSIRICTP
10 20 30 40 50 60
60 70 80 90 100
pF1KE6 LGREGEECHPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF
.:. :. ::: ..:.::: .: ::::::::.: : : .:. :
CCDS29 MGKLGDSCHPLTRKVPFFGRRMHHTCPCLPGLACLRTSFNRFICLAQK
70 80 90 100
>>CCDS46868.1 PROK2 gene_id:60675|Hs108|chr3 (129 aa)
initn: 382 init1: 269 opt: 269 Z-score: 338.3 bits: 67.9 E(32554): 1.6e-12
Smith-Waterman score: 355; 43.5% identity (64.8% similar) in 108 aa overlap (10-96:18-125)
10 20 30 40 50
pF1KE6 MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTP
.:: ..: :::::::..: :::.: :::.:.:....:.:::
CCDS46 MRSLCCAPLLLLLLLPPLLLTPRAGDAAVITGACDKDSQCGGGMCCAVSIWVKSIRICTP
10 20 30 40 50 60
60 70 80 90
pF1KE6 LGREGEECHPGSHK---------------------IPFFRKRKHHTCPCLPNLLCSRFPD
.:. :. ::: ..: .::: .: ::::::::.: : :
CCDS46 MGKLGDSCHPLTRKNNFGNGRQERRKRKRSKRKKEVPFFGRRMHHTCPCLPGLACLRTSF
70 80 90 100 110 120
100
pF1KE6 GRYRCSMDLKNINF
.:. :
CCDS46 NRFICLAQK
105 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 09:40:41 2016 done: Tue Nov 8 09:40:41 2016
Total Scan time: 1.020 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]