FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1152, 488 aa 1>>>pF1KE1152 488 - 488 aa - 488 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1204+/-0.00088; mu= 13.7778+/- 0.053 mean_var=74.7915+/-14.708, 0's: 0 Z-trim(106.8): 18 B-trim: 0 in 0/51 Lambda= 0.148302 statistics sampled from 9205 (9219) to 9205 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.66), E-opt: 0.2 (0.283), width: 16 Scan time: 2.620 The best scores are: opt bits E(32554) CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 ( 488) 3364 729.2 2.4e-210 CCDS7498.1 HIF1AN gene_id:55662|Hs108|chr10 ( 349) 325 79.0 9.5e-15 CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 ( 416) 317 77.3 3.6e-14 CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 ( 454) 317 77.3 3.9e-14 CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 ( 315) 281 69.5 5.9e-12 >>CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 (488 aa) initn: 3364 init1: 3364 opt: 3364 Z-score: 3889.9 bits: 729.2 E(32554): 2.4e-210 Smith-Waterman score: 3364; 100.0% identity (100.0% similar) in 488 aa overlap (1-488:1-488) 10 20 30 40 50 60 pF1KE1 MAAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MAAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KYLSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 KYLSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTLWIGSLGAHTPCHLDSYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 SKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTLWIGSLGAHTPCHLDSYG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 CNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQRHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 CNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQRHA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 VTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTAEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTAEN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 PQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEELN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 PQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEELN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 VCNHMEVGQTGSQNLTTGTDKPEAASPFGPDLVPVAQRSEEPPSERGGIFGSDGKDFVDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 VCNHMEVGQTGSQNLTTGTDKPEAASPFGPDLVPVAQRSEEPPSERGGIFGSDGKDFVDK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 DGEHFGKLHCAKRQQIMSNSENAIEEQIASNTTTTPQTFISTDDLLDCLVNPQVTRIVAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 DGEHFGKLHCAKRQQIMSNSENAIEEQIASNTTTTPQTFISTDDLLDCLVNPQVTRIVAQ 430 440 450 460 470 480 pF1KE1 LLIQGRSL :::::::: CCDS30 LLIQGRSL >>CCDS7498.1 HIF1AN gene_id:55662|Hs108|chr10 (349 aa) initn: 311 init1: 160 opt: 325 Z-score: 378.3 bits: 79.0 E(32554): 9.5e-15 Smith-Waterman score: 329; 27.0% identity (55.3% similar) in 300 aa overlap (32-309:51-341) 10 20 30 40 50 60 pF1KE1 AAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAK .:.:.: . ..:... . . .:: .:. . CCDS74 GALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIEN-EEPVVLTDTNLVYPALKWDLE 30 40 50 60 70 70 80 90 100 110 pF1KE1 YLSQ--------VLHGKQIRF-RMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSIS ::.. : .. .: . :.:.. .:. : : ..::. : .. . CCDS74 YLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNREEMKFHEFVEKLQDIQQRG 80 90 100 110 120 130 120 130 140 150 160 pF1KE1 GPFRDYDHSKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRN-GQEST--LWIGSLG : : : .. . . : ... : . .: . :. :: .. : :: : CCDS74 GEERLYLQQTLNDTVGRKIVMDF------LGFNWNWINKQQGKRGWGQLTSNLLLIGMEG 140 150 160 170 180 190 170 180 190 200 210 220 pF1KE1 AHTPCHLDSYGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKR :: : : :. :..: :: ::::.. ::: . .. . :... ::: .: CCDS74 NVTPAHYDEQQ-NFFAQIKGYKRCILFPPDQFECLYPYPV-HHPCDRQSQVDFDNPDYER 200 210 220 230 240 250 230 240 250 260 270 280 pF1KE1 FPQFRKAQRHAVTLSPGQVLFVPRHWWHYVESI--DPVTVSINSWIE-------LEEDHL ::.:... . ....::.::..: .:::..::. .:...: : . .: CCDS74 FPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGAPTPKRIEYPLK 260 270 280 290 300 310 290 300 310 320 330 pF1KE1 ARVEEAITRMLVCALKTAE-NPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTS :. . :: : . : : :::.. :: CCDS74 AHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN 320 330 340 >>CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 (416 aa) initn: 237 init1: 172 opt: 317 Z-score: 367.8 bits: 77.3 E(32554): 3.6e-14 Smith-Waterman score: 335; 28.3% identity (59.4% similar) in 244 aa overlap (34-272:196-414) 10 20 30 40 50 60 pF1KE1 GSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPA-RHWNAKY .: .. .:.:. ... :: ..:. .: CCDS10 KKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGRPVILKGVADHWPCMQKWSLEY 170 180 190 200 210 220 70 80 90 100 110 120 pF1KE1 LSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSK .... . . ..: : : .. : :..::.. . .. : :: . CCDS10 IQEIAGCRTVPVEVG--SRYTDEEWSQTL----MTVNEFIS----KYIVNEP-RDVGY-- 230 240 250 260 270 130 140 150 160 170 pF1KE1 FWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTL----WIGSLGAHTPCHLDS :... ::.. .: ::.. :. : .:.: . :.: :. .: : : CCDS10 ---LAQHQ----LFDQIPELKQDISIPDYCSLG-DGEEEEITINAWFGPQGTISPLHQDP 280 290 300 310 320 180 190 200 210 220 230 pF1KE1 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR :.. ::.::: .:. :... ::: ... :...: ::::..::.: :: CCDS10 QQ-NFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNT--SQVDVENPDLEKFPKFAKAPF 330 340 350 360 370 380 240 250 260 270 280 290 pF1KE1 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTA . ::::..::.: ..::::...: .. :.. : CCDS10 LSCILSPGEILFIPVKYWHYVRALD-LSFSVSFWWS 390 400 410 300 310 320 330 340 350 pF1KE1 ENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEE >>CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 (454 aa) initn: 237 init1: 172 opt: 317 Z-score: 367.2 bits: 77.3 E(32554): 3.9e-14 Smith-Waterman score: 335; 28.3% identity (59.4% similar) in 244 aa overlap (34-272:234-452) 10 20 30 40 50 60 pF1KE1 GSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPA-RHWNAKY .: .. .:.:. ... :: ..:. .: CCDS45 KKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGRPVILKGVADHWPCMQKWSLEY 210 220 230 240 250 260 70 80 90 100 110 120 pF1KE1 LSQVLHGKQIRFRMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSK .... . . ..: : : .. : :..::.. . .. : :: . CCDS45 IQEIAGCRTVPVEVG--SRYTDEEWSQTL----MTVNEFIS----KYIVNEP-RDVGY-- 270 280 290 300 310 130 140 150 160 170 pF1KE1 FWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRNGQESTL----WIGSLGAHTPCHLDS :... ::.. .: ::.. :. : .:.: . :.: :. .: : : CCDS45 ---LAQHQ----LFDQIPELKQDISIPDYCSLG-DGEEEEITINAWFGPQGTISPLHQDP 320 330 340 350 360 180 190 200 210 220 230 pF1KE1 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR :.. ::.::: .:. :... ::: ... :...: ::::..::.: :: CCDS45 QQ-NFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNT--SQVDVENPDLEKFPKFAKAPF 370 380 390 400 410 240 250 260 270 280 290 pF1KE1 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINSWIELEEDHLARVEEAITRMLVCALKTA . ::::..::.: ..::::...: .. :.. : CCDS45 LSCILSPGEILFIPVKYWHYVRALD-LSFSVSFWWS 420 430 440 450 300 310 320 330 340 350 pF1KE1 ENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQALRTDGEHMKKEE >>CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 (315 aa) initn: 218 init1: 115 opt: 281 Z-score: 328.1 bits: 69.5 E(32554): 5.9e-12 Smith-Waterman score: 284; 30.1% identity (54.7% similar) in 289 aa overlap (41-305:27-294) 20 30 40 50 60 70 pF1KE1 VIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAKYLSQVLHGK ..: .. .. . . .:.. ::::: : CCDS42 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKK 10 20 30 40 50 80 90 100 110 120 pF1KE1 QIRFRMGMKSMSTVPQFE-TTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSKFWAYADY ...... ..: :.. . :.: :: : :: . . :..:.. : CCDS42 EVKIHV-----AAVAQMDFISKNFVYRTLP-F-----DQLVQRAA--EEKHKEFFVSEDE 60 70 80 90 100 130 140 150 160 170 pF1KE1 KYFV-SLFEDKTDLFQDVKWS------DFGFPGRNGQE----STLWIGSLGAHTPCHLDS ::.. :: :: :.. . :. :: .: :.. :.: : . : : CCDS42 KYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDV 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR . ::..:: :.:: :: :.:. .:: . .: : .:. :::: ..: : ::.: CCDS42 MD-NLLIQVTGKKRVVLFSPRDAQYLY---LKGTKSEV---LNIDNPDLAKYPLFSKARR 170 180 190 200 210 240 250 260 270 280 pF1KE1 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINS-WIELEEDHLARVEE-------AITR- . .: :.:::.: :.: : : . :..: : .: . ... : .: CCDS42 YECSLEAGDVLFIPALWFHNVIS-EEFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRA 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE1 --MLVCALKT-AENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQA .: :::: :: :.. : CCDS42 AQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE 280 290 300 310 488 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:47:30 2016 done: Mon Nov 7 01:47:30 2016 Total Scan time: 2.620 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]