FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5352, 315 aa
1>>>pF1KE5352 315 - 315 aa - 315 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9435+/-0.000917; mu= 17.8962+/- 0.055
mean_var=66.0079+/-13.078, 0's: 0 Z-trim(105.1): 16 B-trim: 17 in 1/48
Lambda= 0.157862
statistics sampled from 8260 (8268) to 8260 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.254), width: 16
Scan time: 2.230
The best scores are: opt bits E(32554)
CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 ( 315) 2118 491.2 4.3e-139
CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 ( 488) 281 73.0 5.3e-13
CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 ( 416) 256 67.2 2.4e-11
CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 ( 454) 256 67.3 2.6e-11
>>CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 (315 aa)
initn: 2118 init1: 2118 opt: 2118 Z-score: 2610.6 bits: 491.2 E(32554): 4.3e-139
Smith-Waterman score: 2118; 100.0% identity (100.0% similar) in 315 aa overlap (1-315:1-315)
10 20 30 40 50 60
pF1KE5 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKKEVKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKKEVKI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 HVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHKEFFVSEDEKYYLRSLGEDPRKDVAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 HVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHKEFFVSEDEKYYLRSLGEDPRKDVAD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 IRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDVMDNLLIQVTGKKRVVLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 IRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDVMDNLLIQVTGKKRVVLF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 SPRDAQYLYLKGTKSEVLNIDNPDLAKYPLFSKARRYECSLEAGDVLFIPALWFHNVISE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 SPRDAQYLYLKGTKSEVLNIDNPDLAKYPLFSKARRYECSLEAGDVLFIPALWFHNVISE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 EFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRAAQILDRALKTLAELPEEYRDFYARR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 EFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRAAQILDRALKTLAELPEEYRDFYARR
250 260 270 280 290 300
310
pF1KE5 MVLHIQDKAYSKNSE
:::::::::::::::
CCDS42 MVLHIQDKAYSKNSE
310
>>CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 (488 aa)
initn: 218 init1: 115 opt: 281 Z-score: 346.9 bits: 73.0 E(32554): 5.3e-13
Smith-Waterman score: 284; 30.1% identity (54.7% similar) in 289 aa overlap (27-294:41-305)
10 20 30 40 50
pF1KE5 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKK
..: .. .. . . .:.. ::::: :
CCDS30 VIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAKYLSQVLHGK
20 30 40 50 60 70
60 70 80 90 100
pF1KE5 EVKIHV-----AAVAQMDFISKNFVYRTLP-F-----DQLVQRAA--EEKHKEFFVSEDE
...... ..: :.. . :.: :: : :: . . :..:.. :
CCDS30 QIRFRMGMKSMSTVPQFE-TTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSKFWAYADY
80 90 100 110 120
110 120 130 140 150 160
pF1KE5 KYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDV
::.. :: :: :.. . :. :: .: :.. :.: : . : :
CCDS30 KYFV-SLFEDKTDLFQDVKWS------DFGFPGRNGQE----STLWIGSLGAHTPCHLDS
130 140 150 160 170
170 180 190 200 210
pF1KE5 MD-NLLIQVTGKKRVVLFSPRDAQYLY---LKGTKSEV---LNIDNPDLAKYPLFSKARR
. ::..:: :.:: :: :.:. .:: . .: : .:. :::: ..: : ::.:
CCDS30 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR
180 190 200 210 220 230
220 230 240 250 260 270
pF1KE5 YECSLEAGDVLFIPALWFHNVIS-EEFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRA
. .: :.:::.: :.: : : . :..: : .: . ... : .:
CCDS30 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINS-WIELEEDHLARVEE-------AITR-
240 250 260 270 280
280 290 300 310
pF1KE5 AQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE
.: :::: :: :.. :
CCDS30 --MLVCALKT-AENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQA
290 300 310 320 330 340
>>CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 (416 aa)
initn: 279 init1: 126 opt: 256 Z-score: 317.1 bits: 67.2 E(32554): 2.4e-11
Smith-Waterman score: 359; 28.1% identity (61.7% similar) in 253 aa overlap (8-250:184-415)
10 20 30
pF1KE5 MAGQHLPVPRLEGVSREQFM-QHLYPQRKPLVLEGI-
::::. : ..: : : : : :..:.:.
CCDS10 PARGSLPEQPCTKKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGR-PVILKGVA
160 170 180 190 200 210
40 50 60 70 80 90
pF1KE5 DLGPCTSKWTVDYLSQVGGKKEVKIHVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHK
: :: .::...:.....: . : ..:.. . .... . ......
CCDS10 DHWPCMQKWSLEYIQEIAGCRTVPVEVGS----RYTDEEWSQTLMTVNEFISK-------
220 230 240 250 260
100 110 120 130 140 150
pF1KE5 EFFVSEDEKYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFK----EEQFFSSVFRIS
..:.: :..: .... : :.: :: ::..:.. . ::. .. ..
CCDS10 -YIVNEP-----RDVGYLAQHQLFD---QIPELKQDISIPDYCSLGDGEEEEITINAWFG
270 280 290 300 310
160 170 180 190 200
pF1KE5 SPGLQLWTHYDVMDNLLIQVTGKKRVVLFSPRDAQYLYLKGTK----SEVLNIDNPDLAK
: : : ..:.:.:: :.: . :.::... :: . :. . ....:::: :
CCDS10 PQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNTSQVDVENPDLEK
320 330 340 350 360 370
210 220 230 240 250 260
pF1KE5 YPLFSKARRYECSLEAGDVLFIPALWFHNVISEEFGVGVNIFWKHLPSECYDKTDTYGNK
.: :.:: : : :..::::. ..: : . ... .:...:
CCDS10 FPKFAKAPFLSCILSPGEILFIPVKYWHYVRALDLSFSVSFWWS
380 390 400 410
270 280 290 300 310
pF1KE5 DPTAASRAAQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE
>>CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 (454 aa)
initn: 279 init1: 126 opt: 256 Z-score: 316.5 bits: 67.3 E(32554): 2.6e-11
Smith-Waterman score: 359; 28.1% identity (61.7% similar) in 253 aa overlap (8-250:222-453)
10 20 30
pF1KE5 MAGQHLPVPRLEGVSREQFM-QHLYPQRKPLVLEGI-
::::. : ..: : : : : :..:.:.
CCDS45 PARGSLPEQPCTKKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGR-PVILKGVA
200 210 220 230 240 250
40 50 60 70 80 90
pF1KE5 DLGPCTSKWTVDYLSQVGGKKEVKIHVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHK
: :: .::...:.....: . : ..:.. . .... . ......
CCDS45 DHWPCMQKWSLEYIQEIAGCRTVPVEVGS----RYTDEEWSQTLMTVNEFISK-------
260 270 280 290
100 110 120 130 140 150
pF1KE5 EFFVSEDEKYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFK----EEQFFSSVFRIS
..:.: :..: .... : :.: :: ::..:.. . ::. .. ..
CCDS45 -YIVNEP-----RDVGYLAQHQLFD---QIPELKQDISIPDYCSLGDGEEEEITINAWFG
300 310 320 330 340 350
160 170 180 190 200
pF1KE5 SPGLQLWTHYDVMDNLLIQVTGKKRVVLFSPRDAQYLYLKGTK----SEVLNIDNPDLAK
: : : ..:.:.:: :.: . :.::... :: . :. . ....:::: :
CCDS45 PQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNTSQVDVENPDLEK
360 370 380 390 400 410
210 220 230 240 250 260
pF1KE5 YPLFSKARRYECSLEAGDVLFIPALWFHNVISEEFGVGVNIFWKHLPSECYDKTDTYGNK
.: :.:: : : :..::::. ..: : . ... .:...:
CCDS45 FPKFAKAPFLSCILSPGEILFIPVKYWHYVRALDLSFSVSFWWS
420 430 440 450
270 280 290 300 310
pF1KE5 DPTAASRAAQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE
315 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 07:33:11 2016 done: Tue Nov 8 07:33:11 2016
Total Scan time: 2.230 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]