FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5352, 315 aa 1>>>pF1KE5352 315 - 315 aa - 315 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9435+/-0.000917; mu= 17.8962+/- 0.055 mean_var=66.0079+/-13.078, 0's: 0 Z-trim(105.1): 16 B-trim: 17 in 1/48 Lambda= 0.157862 statistics sampled from 8260 (8268) to 8260 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.254), width: 16 Scan time: 2.230 The best scores are: opt bits E(32554) CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 ( 315) 2118 491.2 4.3e-139 CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 ( 488) 281 73.0 5.3e-13 CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 ( 416) 256 67.2 2.4e-11 CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 ( 454) 256 67.3 2.6e-11 >>CCDS42795.1 TYW5 gene_id:129450|Hs108|chr2 (315 aa) initn: 2118 init1: 2118 opt: 2118 Z-score: 2610.6 bits: 491.2 E(32554): 4.3e-139 Smith-Waterman score: 2118; 100.0% identity (100.0% similar) in 315 aa overlap (1-315:1-315) 10 20 30 40 50 60 pF1KE5 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKKEVKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKKEVKI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 HVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHKEFFVSEDEKYYLRSLGEDPRKDVAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 HVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHKEFFVSEDEKYYLRSLGEDPRKDVAD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 IRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDVMDNLLIQVTGKKRVVLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 IRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDVMDNLLIQVTGKKRVVLF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 SPRDAQYLYLKGTKSEVLNIDNPDLAKYPLFSKARRYECSLEAGDVLFIPALWFHNVISE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SPRDAQYLYLKGTKSEVLNIDNPDLAKYPLFSKARRYECSLEAGDVLFIPALWFHNVISE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 EFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRAAQILDRALKTLAELPEEYRDFYARR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRAAQILDRALKTLAELPEEYRDFYARR 250 260 270 280 290 300 310 pF1KE5 MVLHIQDKAYSKNSE ::::::::::::::: CCDS42 MVLHIQDKAYSKNSE 310 >>CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 (488 aa) initn: 218 init1: 115 opt: 281 Z-score: 346.9 bits: 73.0 E(32554): 5.3e-13 Smith-Waterman score: 284; 30.1% identity (54.7% similar) in 289 aa overlap (27-294:41-305) 10 20 30 40 50 pF1KE5 MAGQHLPVPRLEGVSREQFMQHLYPQRKPLVLEGIDLGPCTSKWTVDYLSQVGGKK ..: .. .. . . .:.. ::::: : CCDS30 VIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAKYLSQVLHGK 20 30 40 50 60 70 60 70 80 90 100 pF1KE5 EVKIHV-----AAVAQMDFISKNFVYRTLP-F-----DQLVQRAA--EEKHKEFFVSEDE ...... ..: :.. . :.: :: : :: . . :..:.. : CCDS30 QIRFRMGMKSMSTVPQFE-TTCNYVEATLEEFLTWNCDQSSISGPFRDYDHSKFWAYADY 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 KYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFKEEQFFSSVFRISSPGLQLWTHYDV ::.. :: :: :.. . :. :: .: :.. :.: : . : : CCDS30 KYFV-SLFEDKTDLFQDVKWS------DFGFPGRNGQE----STLWIGSLGAHTPCHLDS 130 140 150 160 170 170 180 190 200 210 pF1KE5 MD-NLLIQVTGKKRVVLFSPRDAQYLY---LKGTKSEV---LNIDNPDLAKYPLFSKARR . ::..:: :.:: :: :.:. .:: . .: : .:. :::: ..: : ::.: CCDS30 YGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKRFPQFRKAQR 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE5 YECSLEAGDVLFIPALWFHNVIS-EEFGVGVNIFWKHLPSECYDKTDTYGNKDPTAASRA . .: :.:::.: :.: : : . :..: : .: . ... : .: CCDS30 HAVTLSPGQVLFVPRHWWHYVESIDPVTVSINS-WIELEEDHLARVEE-------AITR- 240 250 260 270 280 280 290 300 310 pF1KE5 AQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE .: :::: :: :.. : CCDS30 --MLVCALKT-AENPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTSEVVEIQA 290 300 310 320 330 340 >>CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 (416 aa) initn: 279 init1: 126 opt: 256 Z-score: 317.1 bits: 67.2 E(32554): 2.4e-11 Smith-Waterman score: 359; 28.1% identity (61.7% similar) in 253 aa overlap (8-250:184-415) 10 20 30 pF1KE5 MAGQHLPVPRLEGVSREQFM-QHLYPQRKPLVLEGI- ::::. : ..: : : : : :..:.:. CCDS10 PARGSLPEQPCTKKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGR-PVILKGVA 160 170 180 190 200 210 40 50 60 70 80 90 pF1KE5 DLGPCTSKWTVDYLSQVGGKKEVKIHVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHK : :: .::...:.....: . : ..:.. . .... . ...... CCDS10 DHWPCMQKWSLEYIQEIAGCRTVPVEVGS----RYTDEEWSQTLMTVNEFISK------- 220 230 240 250 260 100 110 120 130 140 150 pF1KE5 EFFVSEDEKYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFK----EEQFFSSVFRIS ..:.: :..: .... : :.: :: ::..:.. . ::. .. .. CCDS10 -YIVNEP-----RDVGYLAQHQLFD---QIPELKQDISIPDYCSLGDGEEEEITINAWFG 270 280 290 300 310 160 170 180 190 200 pF1KE5 SPGLQLWTHYDVMDNLLIQVTGKKRVVLFSPRDAQYLYLKGTK----SEVLNIDNPDLAK : : : ..:.:.:: :.: . :.::... :: . :. . ....:::: : CCDS10 PQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNTSQVDVENPDLEK 320 330 340 350 360 370 210 220 230 240 250 260 pF1KE5 YPLFSKARRYECSLEAGDVLFIPALWFHNVISEEFGVGVNIFWKHLPSECYDKTDTYGNK .: :.:: : : :..::::. ..: : . ... .:...: CCDS10 FPKFAKAPFLSCILSPGEILFIPVKYWHYVRALDLSFSVSFWWS 380 390 400 410 270 280 290 300 310 pF1KE5 DPTAASRAAQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE >>CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 (454 aa) initn: 279 init1: 126 opt: 256 Z-score: 316.5 bits: 67.3 E(32554): 2.6e-11 Smith-Waterman score: 359; 28.1% identity (61.7% similar) in 253 aa overlap (8-250:222-453) 10 20 30 pF1KE5 MAGQHLPVPRLEGVSREQFM-QHLYPQRKPLVLEGI- ::::. : ..: : : : : :..:.:. CCDS45 PARGSLPEQPCTKKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGR-PVILKGVA 200 210 220 230 240 250 40 50 60 70 80 90 pF1KE5 DLGPCTSKWTVDYLSQVGGKKEVKIHVAAVAQMDFISKNFVYRTLPFDQLVQRAAEEKHK : :: .::...:.....: . : ..:.. . .... . ...... CCDS45 DHWPCMQKWSLEYIQEIAGCRTVPVEVGS----RYTDEEWSQTLMTVNEFISK------- 260 270 280 290 100 110 120 130 140 150 pF1KE5 EFFVSEDEKYYLRSLGEDPRKDVADIRKQFPLLKGDIKFPEFFK----EEQFFSSVFRIS ..:.: :..: .... : :.: :: ::..:.. . ::. .. .. CCDS45 -YIVNEP-----RDVGYLAQHQLFD---QIPELKQDISIPDYCSLGDGEEEEITINAWFG 300 310 320 330 340 350 160 170 180 190 200 pF1KE5 SPGLQLWTHYDVMDNLLIQVTGKKRVVLFSPRDAQYLYLKGTK----SEVLNIDNPDLAK : : : ..:.:.:: :.: . :.::... :: . :. . ....:::: : CCDS45 PQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHLLHNTSQVDVENPDLEK 360 370 380 390 400 410 210 220 230 240 250 260 pF1KE5 YPLFSKARRYECSLEAGDVLFIPALWFHNVISEEFGVGVNIFWKHLPSECYDKTDTYGNK .: :.:: : : :..::::. ..: : . ... .:...: CCDS45 FPKFAKAPFLSCILSPGEILFIPVKYWHYVRALDLSFSVSFWWS 420 430 440 450 270 280 290 300 310 pF1KE5 DPTAASRAAQILDRALKTLAELPEEYRDFYARRMVLHIQDKAYSKNSE 315 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 07:33:11 2016 done: Tue Nov 8 07:33:11 2016 Total Scan time: 2.230 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]