FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5220, 226 aa 1>>>pF1KE5220 226 - 226 aa - 226 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3015+/-0.000829; mu= 14.0996+/- 0.050 mean_var=64.8182+/-13.043, 0's: 0 Z-trim(106.5): 61 B-trim: 68 in 1/49 Lambda= 0.159304 statistics sampled from 8984 (9046) to 8984 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.664), E-opt: 0.2 (0.278), width: 16 Scan time: 1.810 The best scores are: opt bits E(32554) CCDS627.1 UBE2U gene_id:148581|Hs108|chr1 ( 226) 1554 365.6 1.4e-101 CCDS43369.1 UBE2D2 gene_id:7322|Hs108|chr5 ( 147) 319 81.7 2.8e-16 CCDS3659.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 149) 312 80.1 8.5e-16 CCDS3660.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 147) 311 79.9 9.9e-16 CCDS3661.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 148) 308 79.2 1.6e-15 CCDS14580.1 UBE2A gene_id:7319|Hs108|chrX ( 152) 305 78.5 2.6e-15 CCDS13370.1 UBE2C gene_id:11065|Hs108|chr20 ( 179) 303 78.1 4.2e-15 CCDS4174.1 UBE2B gene_id:7320|Hs108|chr5 ( 152) 298 76.9 8.1e-15 CCDS13374.1 UBE2C gene_id:11065|Hs108|chr20 ( 140) 297 76.6 8.8e-15 CCDS5474.1 UBE2D4 gene_id:51619|Hs108|chr7 ( 147) 294 75.9 1.5e-14 CCDS7252.1 UBE2D1 gene_id:7321|Hs108|chr10 ( 147) 293 75.7 1.7e-14 CCDS75172.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 118) 269 70.2 6.6e-13 CCDS47275.1 UBE2D2 gene_id:7322|Hs108|chr5 ( 118) 269 70.2 6.6e-13 CCDS10433.1 UBE2I gene_id:7329|Hs108|chr16 ( 158) 264 69.1 1.9e-12 CCDS78500.1 UBE2A gene_id:7319|Hs108|chrX ( 119) 250 65.8 1.4e-11 >>CCDS627.1 UBE2U gene_id:148581|Hs108|chr1 (226 aa) initn: 1554 init1: 1554 opt: 1554 Z-score: 1937.0 bits: 365.6 E(32554): 1.4e-101 Smith-Waterman score: 1554; 100.0% identity (100.0% similar) in 226 aa overlap (1-226:1-226) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY 130 140 150 160 170 180 190 200 210 220 pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK :::::::::::::::::::::::::::::::::::::::::::::: CCDS62 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK 190 200 210 220 >>CCDS43369.1 UBE2D2 gene_id:7322|Hs108|chr5 (147 aa) initn: 226 init1: 198 opt: 319 Z-score: 405.8 bits: 81.7 E(32554): 2.8e-16 Smith-Waterman score: 319; 35.2% identity (64.8% similar) in 145 aa overlap (9-153:6-147) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT .:... :: .. .: ::..::..:.. : : ..: .:: :: ::::: CCDS43 MALKRIHKELNDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV ..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .: CCDS43 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:. : ::: :. : : : ... : CCDS43 PDDPLVPEIARIYKTDREKYNRIAREWTQKYAM 120 130 140 190 200 210 220 pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK >>CCDS3659.1 UBE2D3 gene_id:7323|Hs108|chr4 (149 aa) initn: 212 init1: 198 opt: 312 Z-score: 397.1 bits: 80.1 E(32554): 8.5e-16 Smith-Waterman score: 312; 35.2% identity (64.8% similar) in 145 aa overlap (9-153:8-149) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT : ... :: .. .: ::..::..:.. : : ..: .:: :: ::::: CCDS36 MLSNRKCLSKELSDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV ..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .: CCDS36 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:. : ::: :.. : : : ... : CCDS36 PDDPLVPEIARIYKTDRDKYNRISREWTQKYAM 120 130 140 190 200 210 220 pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK >>CCDS3660.1 UBE2D3 gene_id:7323|Hs108|chr4 (147 aa) initn: 226 init1: 198 opt: 311 Z-score: 395.9 bits: 79.9 E(32554): 9.9e-16 Smith-Waterman score: 311; 34.5% identity (65.5% similar) in 145 aa overlap (9-153:6-147) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT ..... :: .. .: ::..::..:.. : : ..: .:: :: ::::: CCDS36 MALKRINKELSDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV ..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .: CCDS36 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:. : ::: :.. : : : ... : CCDS36 PDDPLVPEIARIYKTDRDKYNRISREWTQKYAM 120 130 140 190 200 210 220 pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK >>CCDS3661.1 UBE2D3 gene_id:7323|Hs108|chr4 (148 aa) initn: 212 init1: 198 opt: 308 Z-score: 392.1 bits: 79.2 E(32554): 1.6e-15 Smith-Waterman score: 308; 33.8% identity (65.5% similar) in 145 aa overlap (9-153:6-147) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT ..... :: .. .: ::..::..:.. : : ..: .:: :: ::::: CCDS36 MALKRINKELSDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV ..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .: CCDS36 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:. : ::: :.. : . : ... : CCDS36 PDDPLVPEIARIYKTDRDKYNRLAREWTEKYAML 120 130 140 190 200 210 220 pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK >>CCDS14580.1 UBE2A gene_id:7319|Hs108|chrX (152 aa) initn: 256 init1: 198 opt: 305 Z-score: 388.2 bits: 78.5 E(32554): 2.6e-15 Smith-Waterman score: 305; 36.4% identity (69.7% similar) in 132 aa overlap (9-140:9-137) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT : ::: :.:. :... : ...: :.. : : ... .. .:.:::.:: CCDS14 MSTPARRRLMRDFKRLQEDPPAGVSGAPSENNIMVWNAVIFGPEGTPFEDGTFKLTIEFT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV :: ::.:.:.. ::::: :. :.:.:.: .:. .: .:::: ..: .:..: CCDS14 EEYPNKPPTVRFVSKMFHPNVYAD-GSICLDILQN--RWSPTYDVSSILTSIQSLLDEPN 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:.: .::.. ... : CCDS14 PNSPANSQAAQLYQENKREYEKRVSAIVEQSWRDC 120 130 140 150 >>CCDS13370.1 UBE2C gene_id:11065|Hs108|chr20 (179 aa) initn: 222 init1: 131 opt: 303 Z-score: 384.7 bits: 78.1 E(32554): 4.2e-15 Smith-Waterman score: 303; 35.8% identity (71.5% similar) in 137 aa overlap (9-145:35-167) 10 20 30 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEV :.... : .. :::.: : :.....: CCDS13 NRDPAATSVAAARKGAEPSGGAARGPVGKRLQQELMTLMMSGDKGISAFPESDNLFKWVG 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 EIEGLQNSVWQGLVFQLTIHFTSEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEK :.: ..:.. : ..:...: : : : :.:::.: .::::: . :. :.:.: :: CCDS13 TIHGAAGTVYEDLRYKLSLEFPSGYPYNAPTVKFLTPCYHPNVDTQ-GNICLDIL--KEK 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 WNTNYTLSSILLALQVMLSNPVLENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQ :.. : . .:::..: .:..: ...:.: .::. : :. . .. :. CCDS13 WSALYDVRTILLSIQSLLGEPNIDSPLNTHAAE-LWKNPTAFKKYLQETYSKQVTSQEP 130 140 150 160 170 160 170 180 190 200 210 pF1KE5 ELPKDPRKCIRPIKTTSFSDYYQTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQ >>CCDS4174.1 UBE2B gene_id:7320|Hs108|chr5 (152 aa) initn: 112 init1: 112 opt: 298 Z-score: 379.5 bits: 76.9 E(32554): 8.1e-15 Smith-Waterman score: 298; 34.8% identity (70.5% similar) in 132 aa overlap (9-140:9-137) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT : ::: :.:. :... : ...:.:.. : : ... .. .:.:.:.:. CCDS41 MSTPARRRLMRDFKRLQEDPPVGVSGAPSENNIMQWNAVIFGPEGTPFEDGTFKLVIEFS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV :: ::.:.:.. ::::: :. :.:.:.: .:. .: .:::: ..: .:..: CCDS41 EEYPNKPPTVRFLSKMFHPNVYAD-GSICLDILQN--RWSPTYDVSSILTSIQSLLDEPN 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:.: .::.. ... : CCDS41 PNSPANSQAAQLYQENKREYEKRVSAIVEQSWNDS 120 130 140 150 >>CCDS13374.1 UBE2C gene_id:11065|Hs108|chr20 (140 aa) initn: 222 init1: 131 opt: 297 Z-score: 378.8 bits: 76.6 E(32554): 8.8e-15 Smith-Waterman score: 297; 37.9% identity (72.6% similar) in 124 aa overlap (22-145:9-128) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT :::.: : :.....: :.: ..:.. : ..:...: CCDS13 MTLMMSGDKGISAFPESDNLFKWVGTIHGAAGTVYEDLRYKLSLEFP 10 20 30 40 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV : : : :.:::.: .::::: . :. :.:.: :::.. : . .:::..: .:..: CCDS13 SGYPYNAPTVKFLTPCYHPNVDTQ-GNICLDILK--EKWSALYDVRTILLSIQSLLGEPN 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ...:.: .::. : :. . .. :. CCDS13 IDSPLNTHAAE-LWKNPTAFKKYLQETYSKQVTSQEP 110 120 130 140 >>CCDS5474.1 UBE2D4 gene_id:51619|Hs108|chr7 (147 aa) initn: 229 init1: 198 opt: 294 Z-score: 374.8 bits: 75.9 E(32554): 1.5e-14 Smith-Waterman score: 294; 31.7% identity (64.8% similar) in 145 aa overlap (9-153:6-147) 10 20 30 40 50 60 pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT ..... ::... .: ::..:...:.. : : ..: .:: :: ::::: CCDS54 MALKRIQKELTDLQRDPPAQCSAGPVGDDLFHWQATIMGPNDSPYQGGVFFLTIHFP 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV ..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .: CCDS54 TDYPFKPPKVAFTTKIYHPNINSN-GSICLDILRS--QWSPALTVSKVLLSICSLLCDPN 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY ..:. : :. :. : . : ... : CCDS54 PDDPLVPEIAHTYKADREKYNRLAREWTQKYAM 120 130 140 190 200 210 220 pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK 226 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:37:48 2016 done: Mon Nov 7 22:37:49 2016 Total Scan time: 1.810 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]