FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1881, 406 aa 1>>>pF1KE1881 406 - 406 aa - 406 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3460+/-0.000759; mu= 17.0249+/- 0.046 mean_var=67.4078+/-13.510, 0's: 0 Z-trim(107.9): 9 B-trim: 26 in 1/50 Lambda= 0.156214 statistics sampled from 9886 (9893) to 9886 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.304), width: 16 Scan time: 2.700 The best scores are: opt bits E(32554) CCDS4046.1 BHMT gene_id:635|Hs108|chr5 ( 406) 2754 629.5 1.7e-180 CCDS4045.1 BHMT2 gene_id:23743|Hs108|chr5 ( 363) 1545 357.0 1.7e-98 CCDS54871.1 BHMT2 gene_id:23743|Hs108|chr5 ( 299) 1292 299.9 2.1e-81 >>CCDS4046.1 BHMT gene_id:635|Hs108|chr5 (406 aa) initn: 2754 init1: 2754 opt: 2754 Z-score: 3353.8 bits: 629.5 E(32554): 1.7e-180 Smith-Waterman score: 2754; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406) 10 20 30 40 50 60 pF1KE1 MPPVGGKKAKKGILERLNAGEIVIGDGGFVFALEKRGYVKAGPWTPEAAVEHPEAVRQLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 MPPVGGKKAKKGILERLNAGEIVIGDGGFVFALEKRGYVKAGPWTPEAAVEHPEAVRQLH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 REFLRAGSNVMQTFTFYASEDKLENRGNYVLEKISGQEVNEAACDIARQVADEGDALVAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 REFLRAGSNVMQTFTFYASEDKLENRGNYVLEKISGQEVNEAACDIARQVADEGDALVAG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GVSQTPSYLSCKSETEVKKVFLQQLEVFMKKNVDFLIAEYFEHVEEAVWAVETLIASGKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 GVSQTPSYLSCKSETEVKKVFLQQLEVFMKKNVDFLIAEYFEHVEEAVWAVETLIASGKP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 VAATMCIGPEGDLHGVPPGECAVRLVKAGASIIGVNCHFDPTISLKTVKLMKEGLEAARL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 VAATMCIGPEGDLHGVPPGECAVRLVKAGASIIGVNCHFDPTISLKTVKLMKEGLEAARL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 KAHLMSQPLAYHTPDCNKQGFIDLPEFPFGLEPRVATRWDIQKYAREAYNLGVRYIGGCC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 KAHLMSQPLAYHTPDCNKQGFIDLPEFPFGLEPRVATRWDIQKYAREAYNLGVRYIGGCC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWVRARARKEYWENLRIASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS40 GFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWVRARARKEYWENLRIASG 310 320 330 340 350 360 370 380 390 400 pF1KE1 RPYNPSMSKPDGWGVTKGTAELMQQKEATTEQQLKELFEKQKFKSQ :::::::::::::::::::::::::::::::::::::::::::::: CCDS40 RPYNPSMSKPDGWGVTKGTAELMQQKEATTEQQLKELFEKQKFKSQ 370 380 390 400 >>CCDS4045.1 BHMT2 gene_id:23743|Hs108|chr5 (363 aa) initn: 1947 init1: 1545 opt: 1545 Z-score: 1882.0 bits: 357.0 E(32554): 1.7e-98 Smith-Waterman score: 1948; 76.0% identity (88.4% similar) in 371 aa overlap (1-371:1-362) 10 20 30 40 50 60 pF1KE1 MPPVGGKKAKKGILERLNAGEIVIGDGGFVFALEKRGYVKAGPWTPEAAVEHPEAVRQLH : :.: :::::::::..::.:::::.:...:::::::::: :::::..:::.:::::: CCDS40 MAPAGRPGAKKGILERLESGEVVIGDGSFLITLEKRGYVKAGLWTPEAVIEHPDAVRQLH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 REFLRAGSNVMQTFTFYASEDKLENRGNYVLEKISGQEVNEAACDIARQVADEGDALVAG ::::::::::::::: ::::..:.. ..:: ::::.::.:: .::::::: CCDS40 MEFLRAGSNVMQTFTFSASEDNMESKW---------EDVNAAACDLAREVAGKGDALVAG 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 GVSQTPSYLSCKSETEVKKVFLQQLEVFMKKNVDFLIAEYFEHVEEAVWAVETLIASGKP :. :: : :.:...::.: :::::: ::::::::::::::::::::::.: : .: CCDS40 GICQTSIYKYQKDEARIKKLFRQQLEVFAWKNVDFLIAEYFEHVEEAVWAVEVLKESDRP 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 VAATMCIGPEGDLHGVPPGECAVRLVKAGASIIGVNCHFDPTISLKTVKLMKEGLEAARL ::.:::::::::.: . :::::::::::::::.::::.: : ::::..::::::: : : CCDS40 VAVTMCIGPEGDMHDITPGECAVRLVKAGASIVGVNCRFGPDTSLKTMELMKEGLEWAGL 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE1 KAHLMSQPLAYHTPDCNKQGFIDLPEFPFGLEPRVATRWDIQKYAREAYNLGVRYIGGCC ::::: :::..:.:::.:.::.::::.::::: ::::::::::::::::::::::::::: CCDS40 KAHLMVQPLGFHAPDCGKEGFVDLPEYPFGLESRVATRWDIQKYAREAYNLGVRYIGGCC 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE1 GFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWVRARARKEYWENLRIASG ::::::::::::::::::::::::::::::::::::::::::.:::::.:::::: ::: CCDS40 GFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWIRARARREYWENLLPASG 300 310 320 330 340 350 370 380 390 400 pF1KE1 RPYNPSMSKPDGWGVTKGTAELMQQKEATTEQQLKELFEKQKFKSQ ::. ::.:::: CCDS40 RPFCPSLSKPDF 360 >>CCDS54871.1 BHMT2 gene_id:23743|Hs108|chr5 (299 aa) initn: 1289 init1: 1289 opt: 1292 Z-score: 1575.1 bits: 299.9 E(32554): 2.1e-81 Smith-Waterman score: 1565; 65.0% identity (74.4% similar) in 371 aa overlap (1-371:1-298) 10 20 30 40 50 60 pF1KE1 MPPVGGKKAKKGILERLNAGEIVIGDGGFVFALEKRGYVKAGPWTPEAAVEHPEAVRQLH : :.: :::::::::..::.:::::.:...:::::::::: :::::..:::.:::::: CCDS54 MAPAGRPGAKKGILERLESGEVVIGDGSFLITLEKRGYVKAGLWTPEAVIEHPDAVRQLH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 REFLRAGSNVMQTFTFYASEDKLENRGNYVLEKISGQEVNEAACDIARQVADEGDALVAG ::::::::::::::: ::::..:.. CCDS54 MEFLRAGSNVMQTFTFSASEDNMESK---------------------------------- 70 80 130 140 150 160 170 180 pF1KE1 GVSQTPSYLSCKSETEVKKVFLQQLEVFMKKNVDFLIAEYFEHVEEAVWAVETLIASGKP :::::::::::::.: : .: CCDS54 ---------------------------------------YFEHVEEAVWAVEVLKESDRP 90 100 190 200 210 220 230 240 pF1KE1 VAATMCIGPEGDLHGVPPGECAVRLVKAGASIIGVNCHFDPTISLKTVKLMKEGLEAARL ::.:::::::::.: . :::::::::::::::.::::.: : ::::..::::::: : : CCDS54 VAVTMCIGPEGDMHDITPGECAVRLVKAGASIVGVNCRFGPDTSLKTMELMKEGLEWAGL 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE1 KAHLMSQPLAYHTPDCNKQGFIDLPEFPFGLEPRVATRWDIQKYAREAYNLGVRYIGGCC ::::: :::..:.:::.:.::.::::.::::: ::::::::::::::::::::::::::: CCDS54 KAHLMVQPLGFHAPDCGKEGFVDLPEYPFGLESRVATRWDIQKYAREAYNLGVRYIGGCC 170 180 190 200 210 220 310 320 330 340 350 360 pF1KE1 GFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWVRARARKEYWENLRIASG ::::::::::::::::::::::::::::::::::::::::::.:::::.:::::: ::: CCDS54 GFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWIRARARREYWENLLPASG 230 240 250 260 270 280 370 380 390 400 pF1KE1 RPYNPSMSKPDGWGVTKGTAELMQQKEATTEQQLKELFEKQKFKSQ ::. ::.:::: CCDS54 RPFCPSLSKPDF 290 406 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:13:07 2016 done: Sun Nov 6 17:13:08 2016 Total Scan time: 2.700 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]