FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1148, 462 aa 1>>>pF1KE1148 462 - 462 aa - 462 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.5684+/-0.000716; mu= 14.8056+/- 0.044 mean_var=142.2813+/-27.897, 0's: 0 Z-trim(114.9): 18 B-trim: 3 in 1/53 Lambda= 0.107523 statistics sampled from 15403 (15415) to 15403 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.789), E-opt: 0.2 (0.474), width: 16 Scan time: 3.690 The best scores are: opt bits E(32554) CCDS12922.1 KMT5C gene_id:84787|Hs108|chr19 ( 462) 3289 521.3 8.4e-148 CCDS76444.1 KMT5B gene_id:51111|Hs108|chr11 ( 370) 1152 189.7 4.4e-48 CCDS44660.1 KMT5B gene_id:51111|Hs108|chr11 ( 393) 983 163.5 3.6e-40 CCDS31623.1 KMT5B gene_id:51111|Hs108|chr11 ( 885) 983 163.8 6.4e-40 >>CCDS12922.1 KMT5C gene_id:84787|Hs108|chr19 (462 aa) initn: 3289 init1: 3289 opt: 3289 Z-score: 2767.0 bits: 521.3 E(32554): 8.4e-148 Smith-Waterman score: 3289; 100.0% identity (100.0% similar) in 462 aa overlap (1-462:1-462) 10 20 30 40 50 60 pF1KE1 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMNVSPVPPLRRQQHLRSALETFLRQRD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMNVSPVPPLRRQQHLRSALETFLRQRD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETNG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 AKIVSTRAWKKNEKLELLVGCIAELREADEGLLRAGENDFSIMYSTRKRSAQLWLGPAAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AKIVSTRAWKKNEKLELLVGCIAELREADEGLLRAGENDFSIMYSTRKRSAQLWLGPAAF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 INHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 INHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 AFRTRPREPALPPRPLDKYQLRETKRRLQQGLDSGSRQGLLGPRACVHPSPLRRDPFCAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 AFRTRPREPALPPRPLDKYQLRETKRRLQQGLDSGSRQGLLGPRACVHPSPLRRDPFCAA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 CQPLRLPACSARPDTSPLWLQWLPQPQPRVRPRKRRRPRPRRAPVLSTHHAARVSLHRWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 CQPLRLPACSARPDTSPLWLQWLPQPQPRVRPRKRRRPRPRRAPVLSTHHAARVSLHRWG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 GCGPHCRLRGEALVALGQPPHARWAPQQDWHWARRYGLPYVVRVDLRRLAPAPPATPAPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GCGPHCRLRGEALVALGQPPHARWAPQQDWHWARRYGLPYVVRVDLRRLAPAPPATPAPA 370 380 390 400 410 420 430 440 450 460 pF1KE1 GTPGPILIPKQALAFAPFSPPKRLRLVVSHGSIDLDVGGEEL :::::::::::::::::::::::::::::::::::::::::: CCDS12 GTPGPILIPKQALAFAPFSPPKRLRLVVSHGSIDLDVGGEEL 430 440 450 460 >>CCDS76444.1 KMT5B gene_id:51111|Hs108|chr11 (370 aa) initn: 1147 init1: 598 opt: 1152 Z-score: 976.6 bits: 189.7 E(32554): 4.4e-48 Smith-Waterman score: 1152; 62.0% identity (81.2% similar) in 266 aa overlap (6-270:72-337) 10 20 30 pF1KE1 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMN ..:.::::::::::::::::::::.::::: CCDS76 AGKNAVERRSNRCNGNSGFEGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMN 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE1 VSPVPPLRRQQHLRSALETFLRQRDLEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYR . : ::..:. ..: : ... :: :.. :: : :. .:: ... :: .: ::. CCDS76 TRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKEHVFI 110 120 130 140 150 160 100 110 120 130 140 150 pF1KE1 YLRAFLPESGFTILPCTRYSMETNGAKIVSTRAWKKNEKLELLVGCIAELREADEG-LLR ::: : .::: ::::.::: : ::::::.:. ::.:.:.:::::::::: : .:. ::: CCDS76 YLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLR 170 180 190 200 210 220 160 170 180 190 200 210 pF1KE1 AGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVT ::::::.:::::: :::::::::::::::.:::::: . ..::::.:::::::.:.. CCDS76 HGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEIS 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE1 CFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQQGLDS :.::.:::::.:: :::.::::.: :::..: :: : .:: :::: .::.. CCDS76 CYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNRLKKL 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE1 GSRQGLLGPRACVHPSPLRRDPFCAACQPLRLPACSARPDTSPLWLQWLPQPQPRVRPRK CCDS76 GDSSKNSDSQSVSSNTDADTTQEKNNASK 350 360 370 >>CCDS44660.1 KMT5B gene_id:51111|Hs108|chr11 (393 aa) initn: 1164 init1: 598 opt: 983 Z-score: 834.6 bits: 163.5 E(32554): 3.6e-40 Smith-Waterman score: 1124; 58.1% identity (76.1% similar) in 289 aa overlap (6-270:72-360) 10 20 30 pF1KE1 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMN ..:.::::::::::::::::::::.::::: CCDS44 AGKNAVERRSNRCNGNSGFEGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMN 50 60 70 80 90 100 40 50 60 70 pF1KE1 VSPVP-----------------PLR------RQQHLRSALETFLRQRDLEAAYRALTLGG .: : :.: ::..:. ..: : ... :: :.. :: : CCDS44 TSAFPSRSSRHFSKSDSFSHNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGE 110 120 130 140 150 160 80 90 100 110 120 130 pF1KE1 WTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETNGAKIVSTRAWKKN :. .:: ... :: .: ::. ::: : .::: ::::.::: : ::::::.:. ::.: CCDS44 WARHYFLNKNKMQEKLFKEHVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRN 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE1 EKLELLVGCIAELREADEG-LLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKF .:.:::::::::: : .:. ::: ::::::.:::::: :::::::::::::::.::::: CCDS44 DKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKF 230 240 250 260 270 280 200 210 220 230 240 250 pF1KE1 VPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREPAL : . ..::::.:::::::.:..:.::.:::::.:: :::.::::.: :::..: :: CCDS44 VSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAP 290 300 310 320 330 340 260 270 280 290 300 310 pF1KE1 PPRPLDKYQLRETKRRLQQGLDSGSRQGLLGPRACVHPSPLRRDPFCAACQPLRLPACSA : .:: :::: .::.. CCDS44 APVINSKYGLRETDKRLNRLKKLGDSSKNSDSQSVSSNTDADTTQEKNNASK 350 360 370 380 390 >>CCDS31623.1 KMT5B gene_id:51111|Hs108|chr11 (885 aa) initn: 835 init1: 598 opt: 983 Z-score: 830.2 bits: 163.8 E(32554): 6.4e-40 Smith-Waterman score: 1124; 58.1% identity (76.1% similar) in 289 aa overlap (6-270:72-360) 10 20 30 pF1KE1 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMN ..:.::::::::::::::::::::.::::: CCDS31 AGKNAVERRSNRCNGNSGFEGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMN 50 60 70 80 90 100 40 50 60 70 pF1KE1 VSPVP-----------------PLR------RQQHLRSALETFLRQRDLEAAYRALTLGG .: : :.: ::..:. ..: : ... :: :.. :: : CCDS31 TSAFPSRSSRHFSKSDSFSHNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGE 110 120 130 140 150 160 80 90 100 110 120 130 pF1KE1 WTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETNGAKIVSTRAWKKN :. .:: ... :: .: ::. ::: : .::: ::::.::: : ::::::.:. ::.: CCDS31 WARHYFLNKNKMQEKLFKEHVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRN 170 180 190 200 210 220 140 150 160 170 180 190 pF1KE1 EKLELLVGCIAELREADEG-LLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKF .:.:::::::::: : .:. ::: ::::::.:::::: :::::::::::::::.::::: CCDS31 DKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKF 230 240 250 260 270 280 200 210 220 230 240 250 pF1KE1 VPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREPAL : . ..::::.:::::::.:..:.::.:::::.:: :::.::::.: :::..: :: CCDS31 VSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAP 290 300 310 320 330 340 260 270 280 290 300 310 pF1KE1 PPRPLDKYQLRETKRRLQQGLDSGSRQGLLGPRACVHPSPLRRDPFCAACQPLRLPACSA : .:: :::: .::.. CCDS31 APVINSKYGLRETDKRLNRLKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGV 350 360 370 380 390 400 462 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:13:09 2016 done: Sun Nov 6 13:13:10 2016 Total Scan time: 3.690 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]