FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1152, 488 aa
1>>>pF1KE1152 488 - 488 aa - 488 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.1204+/-0.00088; mu= 13.7778+/- 0.053
mean_var=74.7915+/-14.708, 0's: 0 Z-trim(106.8): 18 B-trim: 0 in 0/51
Lambda= 0.148302
statistics sampled from 9205 (9219) to 9205 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.66), E-opt: 0.2 (0.283), width: 16
Scan time: 2.620
The best scores are: opt bits E(32554)
CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 ( 488) 3364 729.2 2.4e-210
CCDS7498.1 HIF1AN gene_id:55662|Hs108|chr10 ( 349) 325 79.0 9.5e-15
CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 ( 416) 317 77.3 3.6e-14
CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 ( 454) 317 77.3 3.9e-14
CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 ( 315) 281 69.5 5.9e-12
>>CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 (488 aa)
initn: 3364 init1: 3364 opt: 3364 Z-score: 3889.9 bits: 729.2 E(32554): 2.4e-210
Smith-Waterman score: 3364; 100.0% identity (100.0% similar) in 488 aa overlap (1-488:1-488)
10 20 30 40 50 60
pF1KE1 MAAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 MAAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 KYLSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 KYLSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDH
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 SKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTLWIGSLGAHTPCHLDSYG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 SKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTLWIGSLGAHTPCHLDSYG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 CNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQRHA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 CNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQRHA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 VTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTAEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 VTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTAEN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 PQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEELN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 PQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEELN
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 VCNHMEVGQTGSQNLTTGTDKPEAASPFGPDLVPVAQRSEEPPSERGGIFGSDGKDFVDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 VCNHMEVGQTGSQNLTTGTDKPEAASPFGPDLVPVAQRSEEPPSERGGIFGSDGKDFVDK
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE1 DGEHFGKLHCAKRQQIMSNSENAIEEQIASNTTTTPQTFISTDDLLDCLVNPQVTRIVAQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS30 DGEHFGKLHCAKRQQIMSNSENAIEEQIASNTTTTPQTFISTDDLLDCLVNPQVTRIVAQ
430 440 450 460 470 480
pF1KE1 LLIQGRSL
::::::::
CCDS30 LLIQGRSL
>>CCDS7498.1 HIF1AN gene_id:55662|Hs108|chr10 (349 aa)
initn: 311 init1: 160 opt: 325 Z-score: 378.3 bits: 79.0 E(32554): 9.5e-15
Smith-Waterman score: 329; 27.0% identity (55.3% similar) in 300 aa overlap (32-309:51-341)
10 20 30 40 50 60
pF1KE1 AAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAK
.:.:.: . ..:... . . .:: .:. .
CCDS74 GALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIEN-EEPVVLTDTNLVYPALKWDLE
30 40 50 60 70
70 80 90 100 110
pF1KE1 YLSQ--------VLHGKQIRF-RMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSIS
::.. : .. .: . :.:.. .:. : : ..::. : .. .
CCDS74 YLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNREEMKFHEFVEKLQDIQQRG
80 90 100 110 120 130
120 130 140 150 160
pF1KE1 GPFRDYDHSKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRN-GQEST--LWIGSLG
: : : .. . . : ... : . .: . :. :: .. : :: :
CCDS74 GEERLYLQQTLNDTVGRKIVMDF------LGFNWNWINKQQGKRGWGQLTSNLLLIGMEG
140 150 160 170 180 190
170 180 190 200 210 220
pF1KE1 AHTPCHLDSYGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKR
:: : : :. :..: :: ::::.. ::: . .. . :... ::: .:
CCDS74 NVTPAHYDEQQ-NFFAQIKGYKRCILFPPDQFECLYPYPV-HHPCDRQSQVDFDNPDYER
200 210 220 230 240 250
230 240 250 260 270 280
pF1KE1 FPQFRKAQRHAVTLSPGQVLFVPRHWWHYVESI--DPVTVSINSWIE-------LEEDHL
::.:... . ....::.::..: .:::..::. .:...: : . .:
CCDS74 FPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGAPTPKRIEYPLK
260 270 280 290 300 310
290 300 310 320 330
pF1KE1 ARVEEAITRMLVCALKTAE-NPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTS
:. . :: : . : : :::.. ::
CCDS74 AHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN
320 330 340
>>CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 (416 aa)
initn: 237 init1: 172 opt: 317 Z-score: 367.8 bits: 77.3 E(32554): 3.6e-14
Smith-Waterman score: 335; 28.3% identity (59.4% similar) in 244 aa overlap (34-272:196-414)
10 20 30 40 50 60
pF1KE1 GSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPA-RHWNAKY
.: .. .:.:. ... :: ..:. .:
CCDS10 KKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGRPVILKGVADHWPCMQKWSLEY
170 180 190 200 210 220
70 80 90 100 110 120
pF1KE1 LSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSK
.... . . ..: : : .. : :..::.. . .. : :: .
CCDS10 IQEIAGCRTVPVEVG--SRYTDEEWSQTL----MTVNEFIS----KYIVNEP-RDVGY--
230 240 250 260 270
130 140 150 160 170
pF1KE1 FWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTL----WIGSLGAHTPCHLDS
:... ::.. .: ::.. :. : .:.: . :.: :. .: : :
CCDS10 ---LAQHQ----LFDQIPELKQDISIPDYCSLG-DGEEEEITINAWFGPQGTISPLHQDP
280 290 300 310 320
180 190 200 210 220 230
pF1KE1 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR
:.. ::.::: .:. :... ::: ... :...: ::::..::.: ::
CCDS10 QQ-NFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNT--SQVDVENPDLEKFPKFAKAPF
330 340 350 360 370 380
240 250 260 270 280 290
pF1KE1 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTA
. ::::..::.: ..::::...: .. :.. :
CCDS10 LSCILSPGEILFIPVKYWHYVRALD-LSFSVSFWWS
390 400 410
300 310 320 330 340 350
pF1KE1 ENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEE
>>CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 (454 aa)
initn: 237 init1: 172 opt: 317 Z-score: 367.2 bits: 77.3 E(32554): 3.9e-14
Smith-Waterman score: 335; 28.3% identity (59.4% similar) in 244 aa overlap (34-272:234-452)
10 20 30 40 50 60
pF1KE1 GSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPA-RHWNAKY
.: .. .:.:. ... :: ..:. .:
CCDS45 KKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGRPVILKGVADHWPCMQKWSLEY
210 220 230 240 250 260
70 80 90 100 110 120
pF1KE1 LSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSK
.... . . ..: : : .. : :..::.. . .. : :: .
CCDS45 IQEIAGCRTVPVEVG--SRYTDEEWSQTL----MTVNEFIS----KYIVNEP-RDVGY--
270 280 290 300 310
130 140 150 160 170
pF1KE1 FWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTL----WIGSLGAHTPCHLDS
:... ::.. .: ::.. :. : .:.: . :.: :. .: : :
CCDS45 ---LAQHQ----LFDQIPELKQDISIPDYCSLG-DGEEEEITINAWFGPQGTISPLHQDP
320 330 340 350 360
180 190 200 210 220 230
pF1KE1 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR
:.. ::.::: .:. :... ::: ... :...: ::::..::.: ::
CCDS45 QQ-NFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNT--SQVDVENPDLEKFPKFAKAPF
370 380 390 400 410
240 250 260 270 280 290
pF1KE1 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTA
. ::::..::.: ..::::...: .. :.. :
CCDS45 LSCILSPGEILFIPVKYWHYVRALD-LSFSVSFWWS
420 430 440 450
300 310 320 330 340 350
pF1KE1 ENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEE
>>CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 (315 aa)
initn: 218 init1: 115 opt: 281 Z-score: 328.1 bits: 69.5 E(32554): 5.9e-12
Smith-Waterman score: 284; 30.1% identity (54.7% similar) in 289 aa overlap (41-305:27-294)
20 30 40 50 60 70
pF1KE1 VIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAKYLSQVLHGK
..: .. .. . . .:.. ::::: :
CCDS42 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKK
10 20 30 40 50
80 90 100 110 120
pF1KE1 QIRFRMGMKSMSTVPQFE-TTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSKFWAYADY
...... ..: :.. . :.: :: : :: . . :..:.. :
CCDS42 EVKIHV-----AAVAQMDFISKNFVYRTLP-F-----DQLVQRAA--EEKHKEFFVSEDE
60 70 80 90 100
130 140 150 160 170
pF1KE1 KYFV-SLFEDKTDLFQDVKWS------DFGFPGRNGQE----STLWIGSLGAHTPCHLDS
::.. :: :: :.. . :. :: .: :.. :.: : . : :
CCDS42 KYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDV
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR
. ::..:: :.:: :: :.:. .:: . .: : .:. :::: ..: : ::.:
CCDS42 MD-NLLIQVTGKKRVVLFSPRDAQYLY---LKGTKSEV---LNIDNPDLAKYPLFSKARR
170 180 190 200 210
240 250 260 270 280
pF1KE1 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINS-WIELEEDHLARVEE-------AITR-
. .: :.:::.: :.: : : . :..: : .: . ... : .:
CCDS42 YECSLEAGDVLFIPALWFHNVIS-EEFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRA
220 230 240 250 260 270
290 300 310 320 330 340
pF1KE1 --MLVCALKT-AENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQA
.: :::: :: :.. :
CCDS42 AQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE
280 290 300 310
488 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 01:47:30 2016 done: Mon Nov 7 01:47:30 2016
Total Scan time: 2.620 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]