FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6620, 327 aa
1>>>pF1KE6620 327 - 327 aa - 327 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5304+/-0.000774; mu= 14.2247+/- 0.046
mean_var=57.3722+/-11.541, 0's: 0 Z-trim(106.8): 12 B-trim: 2 in 1/50
Lambda= 0.169326
statistics sampled from 9204 (9211) to 9204 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.665), E-opt: 0.2 (0.283), width: 16
Scan time: 1.980
The best scores are: opt bits E(32554)
CCDS7467.1 HOGA1 gene_id:112817|Hs108|chr10 ( 327) 2188 542.6 1.6e-154
CCDS44469.1 HOGA1 gene_id:112817|Hs108|chr10 ( 164) 671 171.9 3.1e-43
CCDS1350.1 NPL gene_id:80896|Hs108|chr1 ( 320) 311 84.0 1.7e-16
CCDS55667.1 NPL gene_id:80896|Hs108|chr1 ( 301) 257 70.8 1.5e-12
CCDS72990.1 NPL gene_id:80896|Hs108|chr1 ( 230) 247 68.4 6.4e-12
>>CCDS7467.1 HOGA1 gene_id:112817|Hs108|chr10 (327 aa)
initn: 2188 init1: 2188 opt: 2188 Z-score: 2887.5 bits: 542.6 E(32554): 1.6e-154
Smith-Waterman score: 2188; 100.0% identity (100.0% similar) in 327 aa overlap (1-327:1-327)
10 20 30 40 50 60
pF1KE6 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE6 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG
250 260 270 280 290 300
310 320
pF1KE6 PCRAPLQELSPAEEEALRMDFTSNGWL
:::::::::::::::::::::::::::
CCDS74 PCRAPLQELSPAEEEALRMDFTSNGWL
310 320
>>CCDS44469.1 HOGA1 gene_id:112817|Hs108|chr10 (164 aa)
initn: 671 init1: 671 opt: 671 Z-score: 889.7 bits: 171.9 E(32554): 3.1e-43
Smith-Waterman score: 806; 50.2% identity (50.2% similar) in 327 aa overlap (1-327:1-164)
10 20 30 40 50 60
pF1KE6 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE
:::::::::::
CCDS44 LHKLGTFPFRG-------------------------------------------------
70
130 140 150 160 170 180
pF1KE6 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV
CCDS44 ------------------------------------------------------------
190 200 210 220 230 240
pF1KE6 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC
::::::
CCDS44 ------------------------------------------------------AVGGVC
250 260 270 280 290 300
pF1KE6 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG
80 90 100 110 120 130
310 320
pF1KE6 PCRAPLQELSPAEEEALRMDFTSNGWL
:::::::::::::::::::::::::::
CCDS44 PCRAPLQELSPAEEEALRMDFTSNGWL
140 150 160
>>CCDS1350.1 NPL gene_id:80896|Hs108|chr1 (320 aa)
initn: 117 init1: 67 opt: 311 Z-score: 409.5 bits: 84.0 E(32554): 1.7e-16
Smith-Waterman score: 320; 27.5% identity (55.6% similar) in 306 aa overlap (32-323:5-307)
10 20 30 40 50 60
pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL
: . :. . ::.: ..:.... . . .
CCDS13 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV
10 20 30
70 80 90 100 110
pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV
: ... :.:..:: :. ::: .:. . : .. : .. : : . .
CCDS13 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL
:.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. .
CCDS13 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV
.. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::.
CCDS13 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT
160 170 180 190 200 210
240 250 260 270 280 290
pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQ---HRLIEPNAAVTRRFGIPGLKKIMD
:.: . : :: .. :. . .. : . : .:.: : .: ::. : ::
CCDS13 GAVGSTYNYLGKKTNQMLEAFEQKDFSLALNYQFCIQRFI--NFVVKLGFGVSQTKAIMT
220 230 240 250 260 270
300 310 320
pF1KE6 WF-GYYGGPCRAPLQ----ELSPAEEEALR-MDFTSNGWL
: :: : ::: :.. . : :. .:: :
CCDS13 LVSGIPMGPPRLPLQKASREFTDSAEAKLKSLDFLSFTDLKDGNLEAGS
280 290 300 310 320
>>CCDS55667.1 NPL gene_id:80896|Hs108|chr1 (301 aa)
initn: 117 init1: 67 opt: 257 Z-score: 338.7 bits: 70.8 E(32554): 1.5e-12
Smith-Waterman score: 266; 29.3% identity (55.6% similar) in 225 aa overlap (111-323:67-288)
90 100 110 120 130 140
pF1KE6 FPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVEMTVSMAQVGADAAMVVTPCY
: : . . :.. :..:::. :..: .
CCDS55 GNCFLPVYKASPLTVTRLWAERLDQVIIHVGALSLKESQELAQHAAEIGADGIAVIAPFF
40 50 60 70 80 90
150 160 170 180 190
pF1KE6 YRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDLPVDAVVT--LSQHPNIVGMKD
. .. ::. .:: .: .: : .:: ::. . .. .. :.. :.. :.:
CCDS55 LKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKIRAEELLDGILDKIPTFQGLKF
100 110 120 130 140 150
200 210 220 230 240 250
pF1KE6 SGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVCALANVLGAQVCQLERLC
: :. .: : ..:.:.: : : :... ..::.:.: . : :: .. :. .
CCDS55 SDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGATGAVGSTYNYLGKKTNQMLEAF
160 170 180 190 200 210
260 270 280 290 300
pF1KE6 CTGQWEDAQKLQ---HRLIEPNAAVTRRFGIPGLKKIMDWF-GYYGGPCRAPLQ----EL
.. : . : .:.: : .: ::. : :: : :: : ::: :.
CCDS55 EQKDFSLALNYQFCIQRFI--NFVVKLGFGVSQTKAIMTLVSGIPMGPPRLPLQKASREF
220 230 240 250 260 270
310 320
pF1KE6 SPAEEEALR-MDFTSNGWL
. . : :. .:: :
CCDS55 TDSAEAKLKSLDFLSFTDLKDGNLEAGS
280 290 300
>>CCDS72990.1 NPL gene_id:80896|Hs108|chr1 (230 aa)
initn: 67 init1: 67 opt: 247 Z-score: 327.5 bits: 68.4 E(32554): 6.4e-12
Smith-Waterman score: 247; 26.3% identity (58.7% similar) in 213 aa overlap (32-239:5-216)
10 20 30 40 50 60
pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL
: . :. . ::.: ..:.... . . .
CCDS72 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV
10 20 30
70 80 90 100 110
pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV
: ... :.:..:: :. ::: .:. . : .. : .. : : . .
CCDS72 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ
40 50 60 70 80 90
120 130 140 150 160 170
pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL
:.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. .
CCDS72 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV
.. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::.
CCDS72 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT
160 170 180 190 200 210
240 250 260 270 280 290
pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFG
:.:
CCDS72 GAVGSFVSRDLSTLLSN
220 230
327 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 14:50:57 2016 done: Tue Nov 8 14:50:57 2016
Total Scan time: 1.980 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]