FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1790, 297 aa
1>>>pF1KE1790 297 - 297 aa - 297 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0578+/-0.000735; mu= 13.8556+/- 0.045
mean_var=95.2097+/-19.183, 0's: 0 Z-trim(111.7): 11 B-trim: 98 in 1/50
Lambda= 0.131442
statistics sampled from 12618 (12627) to 12618 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.735), E-opt: 0.2 (0.388), width: 16
Scan time: 2.790
The best scores are: opt bits E(32554)
CCDS6912.1 ENDOG gene_id:2021|Hs108|chr9 ( 297) 1990 386.8 1e-107
CCDS2680.1 EXOG gene_id:9941|Hs108|chr3 ( 368) 603 123.9 1.8e-28
CCDS46795.1 EXOG gene_id:9941|Hs108|chr3 ( 318) 544 112.6 3.8e-25
>>CCDS6912.1 ENDOG gene_id:2021|Hs108|chr9 (297 aa)
initn: 1990 init1: 1990 opt: 1990 Z-score: 2047.3 bits: 386.8 E(32554): 1e-107
Smith-Waterman score: 1990; 99.7% identity (99.7% similar) in 297 aa overlap (1-297:1-297)
10 20 30 40 50 60
pF1KE1 MRALRAGLTLALGAGLGAVVEGWRRRREDARAAPGLLGRLPVLPVAAAAELPPVPGGPRG
::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 MRALRAGLTLASGAGLGAVVEGWRRRREDARAAPGLLGRLPVLPVAAAAELPPVPGGPRG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 PGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPERLRGDGDRRECDFREDDS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 PGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPERLRGDGDRRECDFREDDS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 VHAYHRATNADYRGSGFDRGHLAAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWNNLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 VHAYHRATNADYRGSGFDRGHLAAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWNNLE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 KYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 KYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGG
190 200 210 220 230 240
250 260 270 280 290
pF1KE1 QIELRTYVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAITAGSK
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 QIELRTYVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAITAGSK
250 260 270 280 290
>>CCDS2680.1 EXOG gene_id:9941|Hs108|chr3 (368 aa)
initn: 501 init1: 501 opt: 603 Z-score: 624.5 bits: 123.9 E(32554): 1.8e-28
Smith-Waterman score: 603; 39.1% identity (70.0% similar) in 243 aa overlap (55-292:53-293)
30 40 50 60 70 80
pF1KE1 RRREDARAAPGLLGRLPVLPVAAAAELPPVPGGPRGPGELAKYGLPGLAQLKSR--ESYV
: : . : ..:.: :. ..: ...
CCDS26 GAVVGAAGAGLAALQFFRSQGAEGALTGKQPDGSAEKAVLEQFGFP-LTGTEARCYTNHA
30 40 50 60 70 80
90 100 110 120 130 140
pF1KE1 LCYDPRTRGALWVVEQLRPERLRGDGDRRECDFREDDSVHAYHRATNADYRGSGFDRGHL
: :: : ::.:.. .. ::.::..: :. : .. : : :: :::..:::.
CCDS26 LSYDQAKRVPRWVLEHISKSKIMGDADRKHCKFKPDPNIPPTFSAFNEDYVGSGWSRGHM
90 100 110 120 130 140
150 160 170 180 190 200
pF1KE1 AAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWNNLEKYSRSLTRSYQNVYVCTGPLFL
: :.:...:.::: .::::::..:: :.. :: .: : : ::. ...:.: .::: :
CCDS26 APAGNNKFSSKAMAETFYLSNIVPQDFDNNSGYWNRIEMYCRELTERFEDVWVVSGPLTL
150 160 170 180 190 200
210 220 230 240 250
pF1KE1 PRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGGQIE---LRTYVMPNAPVDEAIP
:.:..:::. :.:::::...::::.:..::.. . .. . : : ..:.:: .
CCDS26 PQTRGDGKKIVSYQVIGEDNVAVPSHLYKVILARRSSVSTEPLALGAFVVPNEAIGFQPQ
210 220 230 240 250 260
260 270 280 290
pF1KE1 LERFLVPIESIERASGLLFVPNILARAGSLKAITAGSK
: .: : ....:. :::.: :. : :..... :
CCDS26 LTEFQVSLQDLEKLSGLVFFPH-LDRTSDIRNICSVDTCKLLDFQEFTLYLSTRKIEGAR
270 280 290 300 310 320
CCDS26 SVLRLEKIMENLKNAEIEPDDYFMSRYEKKLEELKAKEQSGTQIRKPS
330 340 350 360
>>CCDS46795.1 EXOG gene_id:9941|Hs108|chr3 (318 aa)
initn: 479 init1: 460 opt: 544 Z-score: 564.9 bits: 112.6 E(32554): 3.8e-25
Smith-Waterman score: 544; 40.9% identity (72.6% similar) in 208 aa overlap (88-292:40-243)
60 70 80 90 100 110
pF1KE1 PRGPGELAKYGLPGLAQLKSRESYVLCYDPRTRGALWVVEQLRPERLRGDGDRRECDFRE
:..:: .. .:. ::.::..: :.
CCDS46 LRGSRRFLSGFVAGAVVGAAGAGLAALQFFRSQGAEGALTGKQPD---GDADRKHCKFKP
10 20 30 40 50 60
120 130 140 150 160 170
pF1KE1 DDSVHAYHRATNADYRGSGFDRGHLAAAANHRWSQKAMDDTFYLSNVAPQVPHLNQNAWN
: .. : : :: :::..:::.: :.:...:.::: .::::::..:: :.. ::
CCDS46 DPNIPPTFSAFNEDYVGSGWSRGHMAPAGNNKFSSKAMAETFYLSNIVPQDFDNNSGYWN
70 80 90 100 110 120
180 190 200 210 220 230
pF1KE1 NLEKYSRSLTRSYQNVYVCTGPLFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEA
.: : : ::. ...:.: .::: ::.:..:::. :.:::::...::::.:..::.. .
CCDS46 RIEMYCRELTERFEDVWVVSGPLTLPQTRGDGKKIVSYQVIGEDNVAVPSHLYKVILARR
130 140 150 160 170 180
240 250 260 270 280 290
pF1KE1 AGGQIE---LRTYVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAITA
.. . : : ..:.:: . : .: : ....:. :::.: :. : :..... :
CCDS46 SSVSTEPLALGAFVVPNEAIGFQPQLTEFQVSLQDLEKLSGLVFFPH-LDRTSDIRNICS
190 200 210 220 230 240
pF1KE1 GSK
CCDS46 VDTCKLLDFQEFTLYLSTRKIEGARSVLRLEKIMENLKNAEIEPDDYFMSRYEKKLEELK
250 260 270 280 290 300
297 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 15:41:08 2016 done: Sun Nov 6 15:41:08 2016
Total Scan time: 2.790 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]