FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4402, 417 aa 1>>>pF1KE4402 417 - 417 aa - 417 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6737+/-0.000814; mu= 15.1063+/- 0.049 mean_var=66.4815+/-13.426, 0's: 0 Z-trim(106.7): 8 B-trim: 0 in 0/50 Lambda= 0.157298 statistics sampled from 9128 (9133) to 9128 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.665), E-opt: 0.2 (0.281), width: 16 Scan time: 2.770 The best scores are: opt bits E(32554) CCDS10097.1 CKMT1B gene_id:1159|Hs108|chr15 ( 417) 2823 649.5 1.7e-186 CCDS32217.1 CKMT1A gene_id:548596|Hs108|chr15 ( 417) 2823 649.5 1.7e-186 CCDS4053.1 CKMT2 gene_id:1160|Hs108|chr5 ( 419) 2285 527.4 9.7e-150 CCDS9981.1 CKB gene_id:1152|Hs108|chr14 ( 381) 1735 402.6 3.3e-112 CCDS12659.1 CKM gene_id:1158|Hs108|chr19 ( 381) 1686 391.5 7.4e-109 >>CCDS10097.1 CKMT1B gene_id:1159|Hs108|chr15 (417 aa) initn: 2823 init1: 2823 opt: 2823 Z-score: 3461.7 bits: 649.5 E(32554): 1.7e-186 Smith-Waterman score: 2823; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417) 10 20 30 40 50 60 pF1KE4 MAGPFSRLLSARPGLRLLALAGAGSLAAGFLLRPEPVRAASERRRLYPPSAEYPDLRKHN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MAGPFSRLLSARPGLRLLALAGAGSLAAGFLLRPEPVRAASERRRLYPPSAEYPDLRKHN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 NCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFAD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERYVLSSRVRTGRSIRGLSLPPACT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERYVLSSRVRTGRSIRGLSLPPACT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 RAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 RAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAGM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQER 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 GWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVDT 310 320 330 340 350 360 370 380 390 400 410 pF1KE4 AATGGVFDISNLDRLGKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AATGGVFDISNLDRLGKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH 370 380 390 400 410 >>CCDS32217.1 CKMT1A gene_id:548596|Hs108|chr15 (417 aa) initn: 2823 init1: 2823 opt: 2823 Z-score: 3461.7 bits: 649.5 E(32554): 1.7e-186 Smith-Waterman score: 2823; 100.0% identity (100.0% similar) in 417 aa overlap (1-417:1-417) 10 20 30 40 50 60 pF1KE4 MAGPFSRLLSARPGLRLLALAGAGSLAAGFLLRPEPVRAASERRRLYPPSAEYPDLRKHN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MAGPFSRLLSARPGLRLLALAGAGSLAAGFLLRPEPVRAASERRRLYPPSAEYPDLRKHN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 NCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 NCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFAD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERYVLSSRVRTGRSIRGLSLPPACT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERYVLSSRVRTGRSIRGLSLPPACT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 RAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 RAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAGM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQER 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 GWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVDT 310 320 330 340 350 360 370 380 390 400 410 pF1KE4 AATGGVFDISNLDRLGKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 AATGGVFDISNLDRLGKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH 370 380 390 400 410 >>CCDS4053.1 CKMT2 gene_id:1160|Hs108|chr5 (419 aa) initn: 2280 init1: 2213 opt: 2285 Z-score: 2801.8 bits: 527.4 E(32554): 9.7e-150 Smith-Waterman score: 2285; 80.1% identity (92.5% similar) in 413 aa overlap (1-412:1-413) 10 20 30 40 50 pF1KE4 MAGPFSRLLSARPGLRLLALAGAGSLAAGFLLRPEPVRA-ASERRRLYPPSAEYPDLRKH ::. ::.::..: . :.: :.. :..:.:: . : : . :. ::.::::.::::::: CCDS40 MASIFSKLLTGRNASLLFATMGTSVLTTGYLLNRQKVCAEVREQPRLFPPSADYPDLRKH 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 NNCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFA ::::: ::::.::.: .:.::.:.:::::::::::::::::::::::::::::.::::: CCDS40 NNCMAECLTPAIYAKLRNKVTPNGYTLDQCIQTGVDNPGHPFIKTVGMVAGDEESYEVFA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 DLFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERYVLSSRVRTGRSIRGLSLPPAC :::::::. ::::::::.:::::::::::: .: :::.:::::::::::::::::::::: CCDS40 DLFDPVIKLRHNGYDPRVMKHTTDLDASKITQGQFDEHYVLSSRVRTGRSIRGLSLPPAC 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 TRAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAG ::::::::: :.. :: :::::::::::.:::::: .::.::::::::::::::::: :: CCDS40 TRAERREVENVAITALEGLKGDLAGRYYKLSEMTEQDQQRLIDDHFLFDKPVSPLLTCAG 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 MARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQE :::::::::::::: .:.::::.::::::::::::::::::::::::::::::::::::: CCDS40 MARDWPDARGIWHNYDKTFLIWINEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQE 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE4 RGWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVD ::::::::::::::::::::::::::::::...: :::: :: ::::::::::::::::: CCDS40 RGWEFMWNERLGYILTCPSNLGTGLRAGVHVRIPKLSKDPRFSKILENLRLQKRGTGGVD 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE4 TAATGGVFDISNLDRLGKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH :::.. :.::::.::.:.:::::::.::::::::.:::..:::::::..: :. CCDS40 TAAVADVYDISNIDRIGRSEVELVQIVIDGVNYLVDCEKKLERGQDIKVPPPLPQFGKK 370 380 390 400 410 >>CCDS9981.1 CKB gene_id:1152|Hs108|chr14 (381 aa) initn: 1681 init1: 1681 opt: 1735 Z-score: 2127.9 bits: 402.6 E(32554): 3.3e-112 Smith-Waterman score: 1735; 69.8% identity (86.7% similar) in 361 aa overlap (47-406:14-373) 20 30 40 50 60 70 pF1KE4 LLALAGAGSLAAGFLLRPEPVRAASERRRLYPPSAEYPDLRKHNNCMASHLTPAVYARLC .: :.::: ::: ::. ::: .::.: CCDS99 MPFSNSHNALKLRFPAEDEFPDLSAHNNHMAKVLTPELYAELR 10 20 30 40 80 90 100 110 120 130 pF1KE4 DKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFADLFDPVIQERHNGYDPR :.::.:.:::. :::::::::::.: ::: ::::::.:::: :::::.:..::.:: : CCDS99 AKSTPSGFTLDDVIQTGVDNPGHPYIMTVGCVAGDEESYEVFKDLFDPIIEDRHGGYKP- 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE4 TMKHTTDLDASKIRSGY-FDERYVLSSRVRTGRSIRGLSLPPACTRAERREVERVVVDAL . .: :::. .....: .: :::::::::::::::. ::: :.:.::: .:...:.:: CCDS99 SDEHKTDLNPDNLQGGDDLDPNYVLSSRVRTGRSIRGFCLPPHCSRGERRAIEKLAVEAL 110 120 130 140 150 160 200 210 220 230 240 250 pF1KE4 SGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAGMARDWPDARGIWHNNE :.: :::::::: :. :::::::::::::::::::::::: :.:::::::::::::::.. CCDS99 SSLDGDLAGRYYALKSMTEAEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDN 170 180 190 200 210 220 260 270 280 290 300 310 pF1KE4 KSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQERGWEFMWNERLGYILT :.::.::::::: :::::.::::::.:: ::: :: ..: :.. . .::::: .:::::: CCDS99 KTFLVWVNEEDHLRVISMQKGGNMKEVFTRFCTGLTQIETLFKSKDYEFMWNPHLGYILT 230 240 250 260 270 280 320 330 340 350 360 370 pF1KE4 CPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVDTAATGGVFDISNLDRL :::::::::::::::::: :.: .: ..:. :::::::::::::::.:::::.:: ::: CCDS99 CPSNLGTGLRAGVHIKLPNLGKHEKFSEVLKRLRLQKRGTGGVDTAAVGGVFDVSNADRL 290 300 310 320 330 340 380 390 400 410 pF1KE4 GKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH : :::::::.:.:::. ::. :.:::.:: : CCDS99 GFSEVELVQMVVDGVKLLIEMEQRLEQGQAIDDLMPAQK 350 360 370 380 >>CCDS12659.1 CKM gene_id:1158|Hs108|chr19 (381 aa) initn: 1683 init1: 1630 opt: 1686 Z-score: 2067.8 bits: 391.5 E(32554): 7.4e-109 Smith-Waterman score: 1686; 67.6% identity (85.9% similar) in 361 aa overlap (47-406:14-373) 20 30 40 50 60 70 pF1KE4 LLALAGAGSLAAGFLLRPEPVRAASERRRLYPPSAEYPDLRKHNNCMASHLTPAVYARLC : : ::::: :::: ::. :: .: .: CCDS12 MPFGNTHNKFKLNYKPEEEYPDLSKHNNHMAKVLTLELYKKLR 10 20 30 40 80 90 100 110 120 130 pF1KE4 DKTTPTGWTLDQCIQTGVDNPGHPFIKTVGMVAGDEETYEVFADLFDPVIQERHNGYDPR :: ::.:.:.:. ::::::::::::: ::: ::::::.:::: .::::.:..::.:: : CCDS12 DKETPSGFTVDDVIQTGVDNPGHPFIMTVGCVAGDEESYEVFKELFDPIISDRHGGYKP- 50 60 70 80 90 100 140 150 160 170 180 190 pF1KE4 TMKHTTDLDASKIRSGY-FDERYVLSSRVRTGRSIRGLSLPPACTRAERREVERVVVDAL : :: :::. ....: .: :::::::::::::.: .::: :.:.::: ::.. :.:: CCDS12 TDKHKTDLNHENLKGGDDLDPNYVLSSRVRTGRSIKGYTLPPHCSRGERRAVEKLSVEAL 110 120 130 140 150 160 200 210 220 230 240 250 pF1KE4 SGLKGDLAGRYYRLSEMTEAEQQQLIDDHFLFDKPVSPLLTAAGMARDWPDARGIWHNNE ..: :.. :.:: :. ::: :::::::::::::::::::: :.:::::::::::::::.. CCDS12 NSLTGEFKGKYYPLKSMTEKEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDN 170 180 190 200 210 220 260 270 280 290 300 310 pF1KE4 KSFLIWVNEEDHTRVISMEKGGNMKRVFERFCRGLKEVERLIQERGWEFMWNERLGYILT ::::.::::::: ::::::::::::.::.::: ::...:..... : ::::..:::.:: CCDS12 KSFLVWVNEEDHLRVISMEKGGNMKEVFRRFCVGLQKIEEIFKKAGHPFMWNQHLGYVLT 230 240 250 260 270 280 320 330 340 350 360 370 pF1KE4 CPSNLGTGLRAGVHIKLPLLSKDSRFPKILENLRLQKRGTGGVDTAATGGVFDISNLDRL ::::::::::.:::.:: ::: .: .:: :::::::::::::::.:.:::.:: ::: CCDS12 CPSNLGTGLRGGVHVKLAHLSKHPKFEEILTRLRLQKRGTGGVDTAAVGSVFDVSNADRL 290 300 310 320 330 340 380 390 400 410 pF1KE4 GKSEVELVQLVIDGVNYLIDCERRLERGQDIRIPTPVIHTKH :.:::: ::::.:::. ... :..::.::.: CCDS12 GSSEVEQVQLVVDGVKLMVEMEKKLEKGQSIDDMIPAQK 350 360 370 380 417 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:22:24 2016 done: Sun Nov 6 01:22:25 2016 Total Scan time: 2.770 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]