FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4322, 325 aa 1>>>pF1KE4322 325 - 325 aa - 325 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6451+/-0.000957; mu= 13.7908+/- 0.057 mean_var=61.2112+/-12.156, 0's: 0 Z-trim(103.8): 24 B-trim: 0 in 0/48 Lambda= 0.163930 statistics sampled from 7574 (7586) to 7574 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.61), E-opt: 0.2 (0.233), width: 16 Scan time: 2.500 The best scores are: opt bits E(32554) CCDS243.1 HMGCL gene_id:3155|Hs108|chr1 ( 325) 2087 502.2 2.3e-142 CCDS43474.1 HMGCLL1 gene_id:54511|Hs108|chr6 ( 340) 1479 358.4 4.5e-99 CCDS43475.1 HMGCLL1 gene_id:54511|Hs108|chr6 ( 370) 1479 358.4 4.9e-99 CCDS75473.1 HMGCLL1 gene_id:54511|Hs108|chr6 ( 308) 1038 254.1 1e-67 CCDS53279.1 HMGCL gene_id:3155|Hs108|chr1 ( 254) 900 221.5 5.8e-58 CCDS75472.1 HMGCLL1 gene_id:54511|Hs108|chr6 ( 237) 714 177.5 9.5e-45 >>CCDS243.1 HMGCL gene_id:3155|Hs108|chr1 (325 aa) initn: 2087 init1: 2087 opt: 2087 Z-score: 2669.5 bits: 502.2 E(32554): 2.3e-142 Smith-Waterman score: 2087; 100.0% identity (100.0% similar) in 325 aa overlap (1-325:1-325) 10 20 30 40 50 60 pF1KE4 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKNIVSTPVKIKLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKNIVSTPVKIKLI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 DMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPNLKGFEAAVAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 DMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPNLKGFEAAVAAG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 AKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVSCALGCPYEGKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 AKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVSCALGCPYEGKI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 SPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAALAVHCHDTYGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 SPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAALAVHCHDTYGQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLGIHTGVNLQKLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 ALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLGIHTGVNLQKLL 250 260 270 280 290 300 310 320 pF1KE4 EAGNFICQALNRKTSSKVAQATCKL ::::::::::::::::::::::::: CCDS24 EAGNFICQALNRKTSSKVAQATCKL 310 320 >>CCDS43474.1 HMGCLL1 gene_id:54511|Hs108|chr6 (340 aa) initn: 1538 init1: 1478 opt: 1479 Z-score: 1892.1 bits: 358.4 E(32554): 4.5e-99 Smith-Waterman score: 1479; 71.0% identity (90.8% similar) in 303 aa overlap (20-322:35-337) 10 20 30 40 pF1KE4 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKN : ::... ::. :::::::::::::::: CCDS43 PSAVKHCLSYQQLLREHLWIGDSVAGALDPAQETSQLSGLPEFVKIVEVGPRDGLQNEKV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 IVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPN :: : .::..:. ::..::::::.::::: .:::::.:::::.:::...::. ::::::: CCDS43 IVPTDIKIEFINRLSQTGLSVIEVTSFVSSRWVPQMADHTEVMKGIHQYPGVRYPVLTPN 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 LKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVS :.::. :::::: :. .:::::: :.:::::::::::. .:. ..:.:. :: .::::: CCDS43 LQGFHHAVAAGATEISVFGAASESFSKKNINCSIEESMGKFEEVVKSARHMNIPARGYVS 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 CALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAA :::::::::.:.: ::.::.:..:.:::::::::::::::::: :: :: .::.:.: .: CCDS43 CALGCPYEGSITPQKVTEVSKRLYGMGCYEISLGDTIGVGTPGSMKRMLESVMKEIPPGA 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 LAVHCHDTYGQALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLG ::::::::::::::: : :::::..::::.:.::::::::.:::::.:::::.:::.::: CCDS43 LAVHCHDTYGQALANILTALQMGINVVDSAVSGLGGCPYAKGASGNVATEDLIYMLNGLG 250 260 270 280 290 300 290 300 310 320 pF1KE4 IHTGVNLQKLLEAGNFICQALNRKTSSKVAQATCKL ..::::: :..:::.:::.:.:. :.::::::. CCDS43 LNTGVNLYKVMEAGDFICKAVNKTTNSKVAQASFNA 310 320 330 340 >>CCDS43475.1 HMGCLL1 gene_id:54511|Hs108|chr6 (370 aa) initn: 1538 init1: 1478 opt: 1479 Z-score: 1891.5 bits: 358.4 E(32554): 4.9e-99 Smith-Waterman score: 1479; 69.5% identity (89.7% similar) in 311 aa overlap (12-322:57-367) 10 20 30 40 pF1KE4 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPR :.: .. ::... ::. ::::::::: CCDS43 SVAGALDPAQTSLLTNLHCFQPDVSGFSVSLAGTVACIHWETSQLSGLPEFVKIVEVGPR 30 40 50 60 70 80 50 60 70 80 90 100 pF1KE4 DGLQNEKNIVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGI ::::::: :: : .::..:. ::..::::::.::::: .:::::.:::::.:::...::. CCDS43 DGLQNEKVIVPTDIKIEFINRLSQTGLSVIEVTSFVSSRWVPQMADHTEVMKGIHQYPGV 90 100 110 120 130 140 110 120 130 140 150 160 pF1KE4 NYPVLTPNLKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSAN ::::::::.::. :::::: :. .:::::: :.:::::::::::. .:. ..:.:. : CCDS43 RYPVLTPNLQGFHHAVAAGATEISVFGAASESFSKKNINCSIEESMGKFEEVVKSARHMN 150 160 170 180 190 200 170 180 190 200 210 220 pF1KE4 ISVRGYVSCALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAV : .::::::::::::::.:.: ::.::.:..:.:::::::::::::::::: :: :: .: CCDS43 IPARGYVSCALGCPYEGSITPQKVTEVSKRLYGMGCYEISLGDTIGVGTPGSMKRMLESV 210 220 230 240 250 260 230 240 250 260 270 280 pF1KE4 MQEVPLAALAVHCHDTYGQALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDL :.:.: .:::::::::::::::: : :::::..::::.:.::::::::.:::::.::::: CCDS43 MKEIPPGALAVHCHDTYGQALANILTALQMGINVVDSAVSGLGGCPYAKGASGNVATEDL 270 280 290 300 310 320 290 300 310 320 pF1KE4 VYMLEGLGIHTGVNLQKLLEAGNFICQALNRKTSSKVAQATCKL .:::.:::..::::: :..:::.:::.:.:. :.::::::. CCDS43 IYMLNGLGLNTGVNLYKVMEAGDFICKAVNKTTNSKVAQASFNA 330 340 350 360 370 >>CCDS75473.1 HMGCLL1 gene_id:54511|Hs108|chr6 (308 aa) initn: 1338 init1: 1038 opt: 1038 Z-score: 1329.1 bits: 254.1 E(32554): 1e-67 Smith-Waterman score: 1238; 63.7% identity (80.9% similar) in 303 aa overlap (20-322:35-305) 10 20 30 40 pF1KE4 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKN : ::... ::. :::::::::::::::: CCDS75 PSAVKHCLSYQQLLREHLWIGDSVAGALDPAQETSQLSGLPEFVKIVEVGPRDGLQNEKV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 IVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPN :: : .::..:. ::..::::::.::::: .:::: CCDS75 IVPTDIKIEFINRLSQTGLSVIEVTSFVSSRWVPQ------------------------- 70 80 90 110 120 130 140 150 160 pF1KE4 LKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVS ::::: :. .:::::: :.:::::::::::. .:. ..:.:. :: .::::: CCDS75 -------VAAGATEISVFGAASESFSKKNINCSIEESMGKFEEVVKSARHMNIPARGYVS 100 110 120 130 140 150 170 180 190 200 210 220 pF1KE4 CALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAA :::::::::.:.: ::.::.:..:.:::::::::::::::::: :: :: .::.:.: .: CCDS75 CALGCPYEGSITPQKVTEVSKRLYGMGCYEISLGDTIGVGTPGSMKRMLESVMKEIPPGA 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE4 LAVHCHDTYGQALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLG ::::::::::::::: : :::::..::::.:.::::::::.:::::.:::::.:::.::: CCDS75 LAVHCHDTYGQALANILTALQMGINVVDSAVSGLGGCPYAKGASGNVATEDLIYMLNGLG 220 230 240 250 260 270 290 300 310 320 pF1KE4 IHTGVNLQKLLEAGNFICQALNRKTSSKVAQATCKL ..::::: :..:::.:::.:.:. :.::::::. CCDS75 LNTGVNLYKVMEAGDFICKAVNKTTNSKVAQASFNA 280 290 300 >>CCDS53279.1 HMGCL gene_id:3155|Hs108|chr1 (254 aa) initn: 900 init1: 900 opt: 900 Z-score: 1154.1 bits: 221.5 E(32554): 5.8e-58 Smith-Waterman score: 1484; 78.2% identity (78.2% similar) in 325 aa overlap (1-325:1-254) 10 20 30 40 50 60 pF1KE4 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKNIVSTPVKIKLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKNIVSTPVKIKLI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 DMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPNLKGFEAAVAAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 DMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPNLKGFEAAV--- 70 80 90 100 110 130 140 150 160 170 180 pF1KE4 AKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVSCALGCPYEGKI CCDS53 ------------------------------------------------------------ 190 200 210 220 230 240 pF1KE4 SPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAALAVHCHDTYGQ :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 --------TKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAALAVHCHDTYGQ 120 130 140 150 160 250 260 270 280 290 300 pF1KE4 ALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLGIHTGVNLQKLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 ALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLGIHTGVNLQKLL 170 180 190 200 210 220 310 320 pF1KE4 EAGNFICQALNRKTSSKVAQATCKL ::::::::::::::::::::::::: CCDS53 EAGNFICQALNRKTSSKVAQATCKL 230 240 250 >>CCDS75472.1 HMGCLL1 gene_id:54511|Hs108|chr6 (237 aa) initn: 980 init1: 680 opt: 714 Z-score: 916.9 bits: 177.5 E(32554): 9.5e-45 Smith-Waterman score: 763; 47.9% identity (60.4% similar) in 303 aa overlap (20-322:35-234) 10 20 30 40 pF1KE4 MAAMRKALPRRLVGLASLRAVSTSSMGTLPKRVKIVEVGPRDGLQNEKN : ::... ::. :::::::::::::::: CCDS75 PSAVKHCLSYQQLLREHLWIGDSVAGALDPAQETSQLSGLPEFVKIVEVGPRDGLQNEKV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 IVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGDHTEVLKGIQKFPGINYPVLTPN :: : .::..:. ::..::::::.::::: .:::: CCDS75 IVPTDIKIEFINRLSQTGLSVIEVTSFVSSRWVPQ------------------------- 70 80 90 110 120 130 140 150 160 pF1KE4 LKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESFQRFDAILKAAQSANISVRGYVS CCDS75 ------------------------------------------------------------ 170 180 190 200 210 220 pF1KE4 CALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIGVGTPGIMKDMLSAVMQEVPLAA :.:..:.:::::::::::::::::: :: :: .::.:.: .: CCDS75 ------------------VSKRLYGMGCYEISLGDTIGVGTPGSMKRMLESVMKEIPPGA 100 110 120 130 140 230 240 250 260 270 280 pF1KE4 LAVHCHDTYGQALANTLMALQMGVSVVDSSVAGLGGCPYAQGASGNLATEDLVYMLEGLG ::::::::::::::: : :::::..::::.:.::::::::.:::::.:::::.:::.::: CCDS75 LAVHCHDTYGQALANILTALQMGINVVDSAVSGLGGCPYAKGASGNVATEDLIYMLNGLG 150 160 170 180 190 200 290 300 310 320 pF1KE4 IHTGVNLQKLLEAGNFICQALNRKTSSKVAQATCKL ..::::: :..:::.:::.:.:. :.::::::. CCDS75 LNTGVNLYKVMEAGDFICKAVNKTTNSKVAQASFNA 210 220 230 325 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:11:19 2016 done: Sat Nov 5 23:11:19 2016 Total Scan time: 2.500 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]