FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1471, 236 aa 1>>>pF1KE1471 236 - 236 aa - 236 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8665+/-0.000679; mu= 11.2583+/- 0.041 mean_var=72.3825+/-14.268, 0's: 0 Z-trim(110.6): 60 B-trim: 634 in 1/54 Lambda= 0.150750 statistics sampled from 11696 (11758) to 11696 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.745), E-opt: 0.2 (0.361), width: 16 Scan time: 2.340 The best scores are: opt bits E(32554) CCDS12030.1 CDC34 gene_id:997|Hs108|chr19 ( 236) 1618 360.5 5.3e-100 CCDS6546.1 UBE2R2 gene_id:54926|Hs108|chr9 ( 238) 1319 295.5 2e-80 CCDS32532.1 UBE2G1 gene_id:7326|Hs108|chr17 ( 170) 579 134.5 4.2e-32 CCDS13714.1 UBE2G2 gene_id:7327|Hs108|chr21 ( 165) 504 118.2 3.3e-27 CCDS33586.1 UBE2G2 gene_id:7327|Hs108|chr21 ( 137) 470 110.8 4.7e-25 CCDS7252.1 UBE2D1 gene_id:7321|Hs108|chr10 ( 147) 263 65.8 1.8e-11 CCDS4174.1 UBE2B gene_id:7320|Hs108|chr5 ( 152) 260 65.1 2.9e-11 CCDS14580.1 UBE2A gene_id:7319|Hs108|chrX ( 152) 257 64.5 4.6e-11 >>CCDS12030.1 CDC34 gene_id:997|Hs108|chr19 (236 aa) initn: 1618 init1: 1618 opt: 1618 Z-score: 1908.8 bits: 360.5 E(32554): 5.3e-100 Smith-Waterman score: 1618; 100.0% identity (100.0% similar) in 236 aa overlap (1-236:1-236) 10 20 30 40 50 60 pF1KE1 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES 190 200 210 220 230 >>CCDS6546.1 UBE2R2 gene_id:54926|Hs108|chr9 (238 aa) initn: 1390 init1: 1273 opt: 1319 Z-score: 1557.3 bits: 295.5 E(32554): 2e-80 Smith-Waterman score: 1319; 80.4% identity (93.3% similar) in 240 aa overlap (1-236:1-238) 10 20 30 40 50 60 pF1KE1 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA ::. . ::::::.::::.::::::::::.:::::.::::::::::::::: :::::::: CCDS65 MAQQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYFKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV ..:::::::::::.::::::::::::::.::::::::::::::::::::::::::::::: CCDS65 HIKFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQNV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG :::::::::::::::::::::::::::.:::..:::::.::..:::::: .::..::.:: CCDS65 RTILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKDKEYAEIIRKQVSATKAEAEKDG 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDG---EVEEEADS-CFGDDEDDSGTEES :::::::::::.:::.:. :..:::.::: :.: : ::: :. :. :.::::.::: CCDS65 VKVPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCY--DDDDSGNEES 190 200 210 220 230 >>CCDS32532.1 UBE2G1 gene_id:7326|Hs108|chr17 (170 aa) initn: 576 init1: 536 opt: 579 Z-score: 689.8 bits: 134.5 E(32554): 4.2e-32 Smith-Waterman score: 579; 52.5% identity (77.2% similar) in 162 aa overlap (10-167:5-163) 10 20 30 40 50 pF1KE1 MARPLVPSSQKALLL--ELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYF :.:::: .: :...::::: . :.:..::: ::: :.:::.: :::: : CCDS32 MTELQSALLLRRQLAELNKNPVEGFSAGLIDDNDLYRWEVLIIGPPDTLYEGGVF 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 KARLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQ ::.: :: ::: :: ..:.:..::::. ..::::::::: : .: . : : ::: : . CCDS32 KAHLTFPKDYPLRPPKMKFITEIWHPNVDKNGDVCISILHEPGEDKYGYEKPEERWLPIH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 NVRTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKD--REYTDIIRKQVLGTKVDA .:.::..::::.: .:: :::::::. ..:.:... . :. . .:: CCDS32 TVETIMISVISMLADPNGDSPANVDAA---KEWREDRNGEFKRKVARCVRKSQETAFE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 ERDGVKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES >>CCDS13714.1 UBE2G2 gene_id:7327|Hs108|chr21 (165 aa) initn: 484 init1: 465 opt: 504 Z-score: 601.9 bits: 118.2 E(32554): 3.3e-27 Smith-Waterman score: 504; 48.2% identity (69.9% similar) in 166 aa overlap (5-170:1-159) 10 20 30 40 50 60 pF1KE1 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA .. .. : :. : : : .: ::. . ..: ....::. :.:: .: .: : : : CCDS13 MAGTALKRLMAEYKQLTLNPPEGIVAGPMNEENFFEWEALIMGPEDTCFEFGVFPA 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV :.::.::: ::: .:: .:.::::: : ::::::: : :::.. : .:::.:.:.: CCDS13 ILSFPLDYPLSPPKMRFTCEMFHPNIYPDGRVCISILHAPGDDPMGYESSAERWSPVQSV 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG . :::::.:.: ::: : :::::: : :.. ::: : ::.. CCDS13 EKILLSVVSMLAEPNDESGANVDASKM---WRD----DREQFYKIAKQIVQKSLGL 120 130 140 150 160 190 200 210 220 230 pF1KE1 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES >>CCDS33586.1 UBE2G2 gene_id:7327|Hs108|chr21 (137 aa) initn: 466 init1: 447 opt: 470 Z-score: 563.2 bits: 110.8 E(32554): 4.7e-25 Smith-Waterman score: 470; 52.2% identity (72.5% similar) in 138 aa overlap (33-170:1-131) 10 20 30 40 50 60 pF1KE1 RPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKARL ..: ....::. :.:: .: .: : : : : CCDS33 MNEENFFEWEALIMGPEDTCFEFGVFPAIL 10 20 30 70 80 90 100 110 120 pF1KE1 KFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNVRT .::.::: ::: .:: .:.::::: : ::::::: : :::.. : .:::.:.:.:. CCDS33 SFPLDYPLSPPKMRFTCEMFHPNIYPDGRVCISILHAPGDDPMGYESSAERWSPVQSVEK 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 ILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDGVK :::::.:.: ::: : :::::: : :.. ::: : ::.. CCDS33 ILLSVVSMLAEPNDESGANVDASKM---WRD----DREQFYKIAKQIVQKSLGL 100 110 120 130 190 200 210 220 230 pF1KE1 VPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES >>CCDS7252.1 UBE2D1 gene_id:7321|Hs108|chr10 (147 aa) initn: 326 init1: 249 opt: 263 Z-score: 319.4 bits: 65.8 E(32554): 1.8e-11 Smith-Waterman score: 336; 35.9% identity (64.7% similar) in 153 aa overlap (11-162:4-142) 10 20 30 40 50 60 pF1KE1 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA : . ::. ::..: . : . ::..:...:.:::.. :.:: : CCDS72 MALKRIQKELSDLQRDPPAHCSAGPVGD-DLFHWQATIMGPPDSAYQGGVFFL 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV ..:: :::..:: . : ::..:::: .:..:..::. .:.:. .: CCDS72 TVHFPTDYPFKPPKIAFTTKIYHPNINSNGSICLDILRS-------------QWSPALTV 60 70 80 90 130 140 150 160 170 pF1KE1 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKD-REYTDIIRKQVLGTKVDAERD .:::. ::: .:: .: : . .:.. ::. .. ::.: CCDS72 SKVLLSICSLLCDPNPDDPLVPDIAQIYKSDKEKYNRHAREWTQKYAM 100 110 120 130 140 180 190 200 210 220 230 pF1KE1 GVKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES >>CCDS4174.1 UBE2B gene_id:7320|Hs108|chr5 (152 aa) initn: 334 init1: 210 opt: 260 Z-score: 315.7 bits: 65.1 E(32554): 2.9e-11 Smith-Waterman score: 362; 39.6% identity (68.8% similar) in 144 aa overlap (9-152:5-134) 10 20 30 40 50 60 pF1KE1 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA ... :. ..: :::.: : .:.....:...:::: .: .: : :: CCDS41 MSTPARRRLMRDFKRLQEDPPVGVS-GAPSENNIMQWNAVIFGPEGTPFEDGTFKL 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV ..: .:: .::. :::.::.:::.: :..:..::. .::.:: .: CCDS41 VIEFSEEYPNKPPTVRFLSKMFHPNVYADGSICLDILQ-------------NRWSPTYDV 60 70 80 90 100 130 140 150 160 170 180 pF1KE1 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG .:: :. :::.::: :::: .:. .:.. : CCDS41 SSILTSIQSLLDEPNPNSPANSQAAQLYQENKREYEKRVSAIVEQSWNDS 110 120 130 140 150 190 200 210 220 230 pF1KE1 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES >>CCDS14580.1 UBE2A gene_id:7319|Hs108|chrX (152 aa) initn: 332 init1: 208 opt: 257 Z-score: 312.1 bits: 64.5 E(32554): 4.6e-11 Smith-Waterman score: 359; 38.9% identity (68.1% similar) in 144 aa overlap (9-152:5-134) 10 20 30 40 50 60 pF1KE1 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA ... :. ..: :::.: : .:.... :...:::: .: .: : :: CCDS14 MSTPARRRLMRDFKRLQEDPPAGVS-GAPSENNIMVWNAVIFGPEGTPFEDGTFKL 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV ..: .:: .::. ::..::.:::.: :..:..::. .::.:: .: CCDS14 TIEFTEEYPNKPPTVRFVSKMFHPNVYADGSICLDILQ-------------NRWSPTYDV 60 70 80 90 100 130 140 150 160 170 180 pF1KE1 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG .:: :. :::.::: :::: .:. .:.. : CCDS14 SSILTSIQSLLDEPNPNSPANSQAAQLYQENKREYEKRVSAIVEQSWRDC 110 120 130 140 150 190 200 210 220 230 pF1KE1 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYEDGEVEEEADSCFGDDEDDSGTEES 236 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:56:44 2016 done: Mon Nov 7 00:56:45 2016 Total Scan time: 2.340 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]