FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4372, 390 aa 1>>>pF1KE4372 390 - 390 aa - 390 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4151+/-0.000858; mu= 16.2895+/- 0.051 mean_var=66.1853+/-13.386, 0's: 0 Z-trim(105.9): 20 B-trim: 0 in 0/50 Lambda= 0.157650 statistics sampled from 8646 (8660) to 8646 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.644), E-opt: 0.2 (0.266), width: 16 Scan time: 2.700 The best scores are: opt bits E(32554) CCDS14192.1 PDHA1 gene_id:5160|Hs108|chrX ( 390) 2624 605.7 2.3e-173 CCDS55381.1 PDHA1 gene_id:5160|Hs108|chrX ( 397) 2600 600.3 1e-171 CCDS55380.1 PDHA1 gene_id:5160|Hs108|chrX ( 428) 2511 580.0 1.3e-165 CCDS3644.1 PDHA2 gene_id:5161|Hs108|chr4 ( 388) 2263 523.6 1.2e-148 CCDS55382.1 PDHA1 gene_id:5160|Hs108|chrX ( 359) 1285 301.1 1e-81 CCDS12581.1 BCKDHA gene_id:593|Hs108|chr19 ( 445) 424 105.4 1.1e-22 >>CCDS14192.1 PDHA1 gene_id:5160|Hs108|chrX (390 aa) initn: 2624 init1: 2624 opt: 2624 Z-score: 3226.0 bits: 605.7 E(32554): 2.3e-173 Smith-Waterman score: 2624; 100.0% identity (100.0% similar) in 390 aa overlap (1-390:1-390) 10 20 30 40 50 60 pF1KE4 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRED 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 HGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPLGAGIALA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 HGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPLGAGIALA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 CKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAAST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 CKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAAST 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 DYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 YRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 YRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPP 310 320 330 340 350 360 370 380 390 pF1KE4 LEELGYHIYSSDPPFEVRGANQWIKFKSVS :::::::::::::::::::::::::::::: CCDS14 LEELGYHIYSSDPPFEVRGANQWIKFKSVS 370 380 390 >>CCDS55381.1 PDHA1 gene_id:5160|Hs108|chrX (397 aa) initn: 1992 init1: 1992 opt: 2600 Z-score: 3196.4 bits: 600.3 E(32554): 1e-171 Smith-Waterman score: 2600; 98.2% identity (98.2% similar) in 397 aa overlap (1-390:1-397) 10 20 30 40 50 60 pF1KE4 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRED 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 GLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQ-------EACCVGLEAGINPTDH ::::::::::::::::::::::::::::::::::::: :::::::::::::::: CCDS55 GLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQFLLPLTQEACCVGLEAGINPTDH 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 LITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 GAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSV 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 ERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHS 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE4 MSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFA 310 320 330 340 350 360 360 370 380 390 pF1KE4 TADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS ::::::::::::::::::::::::::::::::::::: CCDS55 TADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS 370 380 390 >>CCDS55380.1 PDHA1 gene_id:5160|Hs108|chrX (428 aa) initn: 2511 init1: 2511 opt: 2511 Z-score: 3086.5 bits: 580.0 E(32554): 1.3e-165 Smith-Waterman score: 2511; 99.5% identity (99.7% similar) in 373 aa overlap (18-390:56-428) 10 20 30 40 pF1KE4 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLE . :::::::::::::::::::::::::::: CCDS55 LPSLVSISRLKQSSHLGLPKCWDYSHSLKTRQASRVLVASRNFANDATFEIKKCDLHRLE 30 40 50 60 70 80 50 60 70 80 90 100 pF1KE4 EGPPVTTVLTREDGLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EGPPVTTVLTREDGLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAG 90 100 110 120 130 140 110 120 130 140 150 160 pF1KE4 INPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 INPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIV 150 160 170 180 190 200 170 180 190 200 210 220 pF1KE4 GAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRY 210 220 230 240 250 260 230 240 250 260 270 280 pF1KE4 GMGTSVERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GMGTSVERAAASTDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTY 270 280 290 300 310 320 290 300 310 320 330 340 pF1KE4 RYHGHSMSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 RYHGHSMSDPGVSYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIE 330 340 350 360 370 380 350 360 370 380 390 pF1KE4 DAAQFATADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS ::::::::::::::::::::::::::::::::::::::::::: CCDS55 DAAQFATADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS 390 400 410 420 >>CCDS3644.1 PDHA2 gene_id:5161|Hs108|chr4 (388 aa) initn: 2256 init1: 2256 opt: 2263 Z-score: 2782.3 bits: 523.6 E(32554): 1.2e-148 Smith-Waterman score: 2263; 85.8% identity (94.1% similar) in 388 aa overlap (4-390:1-388) 10 20 30 40 50 pF1KE4 MRKMLAA-VSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRE :::: .:::: ..:: : :::::::: .::::::::::::. ::::::::::::: CCDS36 MLAAFISRVLRRVAQKSARRVLVASRNSSNDATFEIKKCDLYLLEEGPPVTTVLTRA 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 DGLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYR .:::::::: ::::::::::::::::.::::::::::::::::::::::::.::.::.:: CCDS36 EGLKYYRMMLTVRRMELKADQLYKQKFIRGFCHLCDGQEACCVGLEAGINPSDHVITSYR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 AHGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPLGAGIAL ::: .::::::: ::::::::.::::::::::::::.::::::::::::: :::::::: CCDS36 AHGVCYTRGLSVRSILAELTGRRGGCAKGKGGSMHMYTKNFYGGNGIVGAQGPLGAGIAL 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 ACKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAAS ::::.:.::.::::::::::::::: ::.::::::::::.:::::: ::::::.:::::: CCDS36 ACKYKGNDEICLTLYGDGAANQGQIAEAFNMAALWKLPCVFICENNLYGMGTSTERAAAS 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 TDYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGV ::::::.:::::.:::::.:::::::.::: :::::::::::::::::::::::::::: CCDS36 PDYYKRGNFIPGLKVDGMDVLCVREATKFAANYCRSGKGPILMELQTYRYHGHSMSDPGV 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE4 SYRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEP :::::::::::::: :::..:.::::::.::.::::::: .::::::.:::::::.:::: CCDS36 SYRTREEIQEVRSKRDPIIILQDRMVNSKLATVEELKEIGAEVRKEIDDAAQFATTDPEP 300 310 320 330 340 350 360 370 380 390 pF1KE4 PLEELGYHIYSSDPPFEVRGANQWIKFKSVS :::::.:::::: ::::::: :::::::: CCDS36 HLEELGHHIYSSDSSFEVRGANPWIKFKSVS 360 370 380 >>CCDS55382.1 PDHA1 gene_id:5160|Hs108|chrX (359 aa) initn: 1280 init1: 1280 opt: 1285 Z-score: 1580.7 bits: 301.1 E(32554): 1e-81 Smith-Waterman score: 2340; 92.1% identity (92.1% similar) in 390 aa overlap (1-390:1-359) 10 20 30 40 50 60 pF1KE4 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MRKMLAAVSRVLSGASQKPASRVLVASRNFANDATFEIKKCDLHRLEEGPPVTTVLTRED 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 GLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GLKYYRMMQTVRRMELKADQLYKQKIIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 HGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQVPLGAGIALA :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 HGFTFTRGLSVREILAELTGRKGGCAKGKGGSMHMYAKNFYGGNGIVGAQ---------- 130 140 150 160 170 190 200 210 220 230 240 pF1KE4 CKYNGKDEVCLTLYGDGAANQGQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAAST ::::::::::::::::::::::::::::::::::::::: CCDS55 ---------------------GQIFEAYNMAALWKLPCIFICENNRYGMGTSVERAAAST 180 190 200 250 260 270 280 290 300 pF1KE4 DYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DYYKRGDFIPGLRVDGMDILCVREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVS 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE4 YRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 YRTREEIQEVRSKSDPIMLLKDRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPP 270 280 290 300 310 320 370 380 390 pF1KE4 LEELGYHIYSSDPPFEVRGANQWIKFKSVS :::::::::::::::::::::::::::::: CCDS55 LEELGYHIYSSDPPFEVRGANQWIKFKSVS 330 340 350 >>CCDS12581.1 BCKDHA gene_id:593|Hs108|chr19 (445 aa) initn: 215 init1: 163 opt: 424 Z-score: 520.9 bits: 105.4 E(32554): 1.1e-22 Smith-Waterman score: 424; 27.6% identity (56.5% similar) in 322 aa overlap (56-373:97-417) 30 40 50 60 70 80 pF1KE4 ASRNFANDATFEIKKCDLHRLEEGPPVTTVLTREDGLKYYRMMQTVRRMELKADQLYKQK : .: :: :. : . :. . .: CCDS12 FIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQG 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE4 IIRGFCHLCDGQEACCVGLEAGINPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGC : .: :.:. :: :... :: .. :: : . : .. ..:. : . CCDS12 RI-SFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDL 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE4 AKGKGGSMHMYAK--NFYGGNGIVGAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQ .::. .:. : .: .. ...:.: ..: : : : . ..: . .:.:::..:. CCDS12 GKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGD 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE4 IFEAYNMAALWKLPCIFICENNRYGMGTSVERAAASTDYYKRGDF--IPGLRVDGMDILC ..:.:: . : ::.:.:: :...: . . . :: : ..:::: :.. CCDS12 AHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDGNDVFA 250 260 270 280 290 300 270 280 290 300 310 320 pF1KE4 VREATRFAAAYCRSGKGPILMELQTYRYHGHSMSDPGVSYRTREEIQEVRSKSDPIMLLK : .::. : . . :.:.: .::: :: :: . .::. .:.. ... :: :. CCDS12 VYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR 310 320 330 340 350 360 330 340 350 360 370 380 pF1KE4 DRMVNSNLASVEELKEIDVEVRKEIEDAAQFATADPEPPLEELGYHIYSSDPPFEVRGAN ..... . :. : . :... .: . : :.: . : .:. : CCDS12 HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQE 370 380 390 400 410 420 390 pF1KE4 QWIKFKSVS CCDS12 SLARHLQTYGEHYPLDHFDK 430 440 390 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:06:21 2016 done: Sat Nov 5 23:06:22 2016 Total Scan time: 2.700 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]