FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3077, 238 aa 1>>>pF1KE3077 238 - 238 aa - 238 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0452+/-0.000816; mu= 11.5813+/- 0.050 mean_var=105.6342+/-20.606, 0's: 0 Z-trim(110.5): 59 B-trim: 0 in 0/52 Lambda= 0.124788 statistics sampled from 11623 (11683) to 11623 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.733), E-opt: 0.2 (0.359), width: 16 Scan time: 2.070 The best scores are: opt bits E(32554) CCDS6546.1 UBE2R2 gene_id:54926|Hs108|chr9 ( 238) 1618 301.3 3.6e-82 CCDS12030.1 CDC34 gene_id:997|Hs108|chr19 ( 236) 1319 247.5 5.6e-66 CCDS32532.1 UBE2G1 gene_id:7326|Hs108|chr17 ( 170) 610 119.8 1.2e-27 CCDS13714.1 UBE2G2 gene_id:7327|Hs108|chr21 ( 165) 516 102.8 1.4e-22 CCDS33586.1 UBE2G2 gene_id:7327|Hs108|chr21 ( 137) 478 95.9 1.4e-20 >>CCDS6546.1 UBE2R2 gene_id:54926|Hs108|chr9 (238 aa) initn: 1618 init1: 1618 opt: 1618 Z-score: 1588.8 bits: 301.3 E(32554): 3.6e-82 Smith-Waterman score: 1618; 100.0% identity (100.0% similar) in 238 aa overlap (1-238:1-238) 10 20 30 40 50 60 pF1KE3 MAQQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYFKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 MAQQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYFKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 HIKFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 HIKFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQNV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 RTILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKDKEYAEIIRKQVSATKAEAEKDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 RTILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKDKEYAEIIRKQVSATKAEAEKDG 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 VKVPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCYDDDDSGNEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 VKVPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCYDDDDSGNEES 190 200 210 220 230 >>CCDS12030.1 CDC34 gene_id:997|Hs108|chr19 (236 aa) initn: 1390 init1: 1273 opt: 1319 Z-score: 1297.9 bits: 247.5 E(32554): 5.6e-66 Smith-Waterman score: 1319; 80.4% identity (93.3% similar) in 240 aa overlap (1-238:1-236) 10 20 30 40 50 60 pF1KE3 MAQQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYFKA ::. . ::::::.::::.::::::::::.:::::.::::::::::::::: :::::::: CCDS12 MARPLVPSSQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 HIKFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQNV ..:::::::::::.::::::::::::::.::::::::::::::::::::::::::::::: CCDS12 RLKFPIDYPYSPPAFRFLTKMWHPNIYETGDVCISILHPPVDDPQSGELPSERWNPTQNV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 RTILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKDKEYAEIIRKQVSATKAEAEKDG :::::::::::::::::::::::::::.:::..:::::.::..:::::: .::..::.:: CCDS12 RTILLSVISLLNEPNTFSPANVDASVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDG 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 VKVPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCY--DDDDSGNEES :::::::::::.:::.:. :..:::.::: :. : : ::: :. :. :.::::.::: CCDS12 VKVPTTLAEYCVKTKAPAPDEGSDLFYDDYYE---DGEVEEEADS-CFGDDEDDSGTEES 190 200 210 220 230 >>CCDS32532.1 UBE2G1 gene_id:7326|Hs108|chr17 (170 aa) initn: 596 init1: 556 opt: 610 Z-score: 610.0 bits: 119.8 E(32554): 1.2e-27 Smith-Waterman score: 610; 53.0% identity (77.1% similar) in 166 aa overlap (6-167:1-163) 10 20 30 40 50 pF1KE3 MAQQQMTSSQKALML--ELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYF :: :.::.: .: :...::::: :.:..::: ::: :.:::.:::::: : CCDS32 MTELQSALLLRRQLAELNKNPVEGFSAGLIDDNDLYRWEVLIIGPPDTLYEGGVF 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 KAHIKFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQ :::. :: ::: :: ..:.:..::::. .:::::::::: : .: . : : ::: : . CCDS32 KAHLTFPKDYPLRPPKMKFITEIWHPNVDKNGDVCISILHEPGEDKYGYEKPEERWLPIH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 NVRTILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKD--KEYAEIIRKQVSATKAEA .:.::..::::.: .:: :::::::. ..::.... . .. :. .:: CCDS32 TVETIMISVISMLADPNGDSPANVDAA---KEWREDRNGEFKRKVARCVRKSQETAFE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 EKDGVKVPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCYDDDDSGNE >>CCDS13714.1 UBE2G2 gene_id:7327|Hs108|chr21 (165 aa) initn: 514 init1: 495 opt: 516 Z-score: 518.7 bits: 102.8 E(32554): 1.4e-22 Smith-Waterman score: 516; 50.0% identity (70.4% similar) in 162 aa overlap (8-169:4-158) 10 20 30 40 50 60 pF1KE3 MAQQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYFKA .. : :: : :.: .: ::. ..: ....::. :.:: .: .: : : : CCDS13 MAGTALKRLMAEYKQLTLNPPEGIVAGPMNEENFFEWEALIMGPEDTCFEFGVFPA 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 HIKFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQNV ..::.::: ::: .:: .:.::::: .: ::::::: : :::.. : .:::.:.:.: CCDS13 ILSFPLDYPLSPPKMRFTCEMFHPNIYPDGRVCISILHAPGDDPMGYESSAERWSPVQSV 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 RTILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKDKEYAEIIRKQVSATKAEAEKDG . :::::.:.: ::: : :::::: : ::: :.: : ::. CCDS13 EKILLSVVSMLAEPNDESGANVDASKM---WRD----DREQFYKIAKQIVQKSLGL 120 130 140 150 160 190 200 210 220 230 pF1KE3 VKVPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCYDDDDSGNEES >>CCDS33586.1 UBE2G2 gene_id:7327|Hs108|chr21 (137 aa) initn: 466 init1: 447 opt: 478 Z-score: 482.9 bits: 95.9 E(32554): 1.4e-20 Smith-Waterman score: 478; 52.6% identity (73.0% similar) in 137 aa overlap (33-169:1-130) 10 20 30 40 50 60 pF1KE3 QQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVAIFGPPNTLYEGGYFKAHI ..: ....::. :.:: .: .: : : : . CCDS33 MNEENFFEWEALIMGPEDTCFEFGVFPAIL 10 20 30 70 80 90 100 110 120 pF1KE3 KFPIDYPYSPPTFRFLTKMWHPNIYENGDVCISILHPPVDDPQSGELPSERWNPTQNVRT .::.::: ::: .:: .:.::::: .: ::::::: : :::.. : .:::.:.:.:. CCDS33 SFPLDYPLSPPKMRFTCEMFHPNIYPDGRVCISILHAPGDDPMGYESSAERWSPVQSVEK 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 ILLSVISLLNEPNTFSPANVDASVMFRKWRDSKGKDKEYAEIIRKQVSATKAEAEKDGVK :::::.:.: ::: : :::::: : ::: :.: : ::. CCDS33 ILLSVVSMLAEPNDESGANVDASKM---WRD----DREQFYKIAKQIVQKSLGL 100 110 120 130 190 200 210 220 230 pF1KE3 VPTTLAEYCIKTKVPSNDNSSDLLYDDLYDDDIDDEDEEEEDADCYDDDDSGNEES 238 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 05:17:28 2016 done: Sun Nov 6 05:17:28 2016 Total Scan time: 2.070 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]