FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5220, 226 aa
1>>>pF1KE5220 226 - 226 aa - 226 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3015+/-0.000829; mu= 14.0996+/- 0.050
mean_var=64.8182+/-13.043, 0's: 0 Z-trim(106.5): 61 B-trim: 68 in 1/49
Lambda= 0.159304
statistics sampled from 8984 (9046) to 8984 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.664), E-opt: 0.2 (0.278), width: 16
Scan time: 1.810
The best scores are: opt bits E(32554)
CCDS627.1 UBE2U gene_id:148581|Hs108|chr1 ( 226) 1554 365.6 1.4e-101
CCDS43369.1 UBE2D2 gene_id:7322|Hs108|chr5 ( 147) 319 81.7 2.8e-16
CCDS3659.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 149) 312 80.1 8.5e-16
CCDS3660.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 147) 311 79.9 9.9e-16
CCDS3661.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 148) 308 79.2 1.6e-15
CCDS14580.1 UBE2A gene_id:7319|Hs108|chrX ( 152) 305 78.5 2.6e-15
CCDS13370.1 UBE2C gene_id:11065|Hs108|chr20 ( 179) 303 78.1 4.2e-15
CCDS4174.1 UBE2B gene_id:7320|Hs108|chr5 ( 152) 298 76.9 8.1e-15
CCDS13374.1 UBE2C gene_id:11065|Hs108|chr20 ( 140) 297 76.6 8.8e-15
CCDS5474.1 UBE2D4 gene_id:51619|Hs108|chr7 ( 147) 294 75.9 1.5e-14
CCDS7252.1 UBE2D1 gene_id:7321|Hs108|chr10 ( 147) 293 75.7 1.7e-14
CCDS75172.1 UBE2D3 gene_id:7323|Hs108|chr4 ( 118) 269 70.2 6.6e-13
CCDS47275.1 UBE2D2 gene_id:7322|Hs108|chr5 ( 118) 269 70.2 6.6e-13
CCDS10433.1 UBE2I gene_id:7329|Hs108|chr16 ( 158) 264 69.1 1.9e-12
CCDS78500.1 UBE2A gene_id:7319|Hs108|chrX ( 119) 250 65.8 1.4e-11
>>CCDS627.1 UBE2U gene_id:148581|Hs108|chr1 (226 aa)
initn: 1554 init1: 1554 opt: 1554 Z-score: 1937.0 bits: 365.6 E(32554): 1.4e-101
Smith-Waterman score: 1554; 100.0% identity (100.0% similar) in 226 aa overlap (1-226:1-226)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
130 140 150 160 170 180
190 200 210 220
pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
190 200 210 220
>>CCDS43369.1 UBE2D2 gene_id:7322|Hs108|chr5 (147 aa)
initn: 226 init1: 198 opt: 319 Z-score: 405.8 bits: 81.7 E(32554): 2.8e-16
Smith-Waterman score: 319; 35.2% identity (64.8% similar) in 145 aa overlap (9-153:6-147)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
.:... :: .. .: ::..::..:.. : : ..: .:: :: :::::
CCDS43 MALKRIHKELNDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .:
CCDS43 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:. : ::: :. : : : ... :
CCDS43 PDDPLVPEIARIYKTDREKYNRIAREWTQKYAM
120 130 140
190 200 210 220
pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
>>CCDS3659.1 UBE2D3 gene_id:7323|Hs108|chr4 (149 aa)
initn: 212 init1: 198 opt: 312 Z-score: 397.1 bits: 80.1 E(32554): 8.5e-16
Smith-Waterman score: 312; 35.2% identity (64.8% similar) in 145 aa overlap (9-153:8-149)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
: ... :: .. .: ::..::..:.. : : ..: .:: :: :::::
CCDS36 MLSNRKCLSKELSDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .:
CCDS36 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:. : ::: :.. : : : ... :
CCDS36 PDDPLVPEIARIYKTDRDKYNRISREWTQKYAM
120 130 140
190 200 210 220
pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
>>CCDS3660.1 UBE2D3 gene_id:7323|Hs108|chr4 (147 aa)
initn: 226 init1: 198 opt: 311 Z-score: 395.9 bits: 79.9 E(32554): 9.9e-16
Smith-Waterman score: 311; 34.5% identity (65.5% similar) in 145 aa overlap (9-153:6-147)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
..... :: .. .: ::..::..:.. : : ..: .:: :: :::::
CCDS36 MALKRINKELSDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .:
CCDS36 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:. : ::: :.. : : : ... :
CCDS36 PDDPLVPEIARIYKTDRDKYNRISREWTQKYAM
120 130 140
190 200 210 220
pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
>>CCDS3661.1 UBE2D3 gene_id:7323|Hs108|chr4 (148 aa)
initn: 212 init1: 198 opt: 308 Z-score: 392.1 bits: 79.2 E(32554): 1.6e-15
Smith-Waterman score: 308; 33.8% identity (65.5% similar) in 145 aa overlap (9-153:6-147)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
..... :: .. .: ::..::..:.. : : ..: .:: :: :::::
CCDS36 MALKRINKELSDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFP
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .:
CCDS36 TDYPFKPPKVAFTTRIYHPNINSN-GSICLDILRS--QWSPALTISKVLLSICSLLCDPN
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:. : ::: :.. : . : ... :
CCDS36 PDDPLVPEIARIYKTDRDKYNRLAREWTEKYAML
120 130 140
190 200 210 220
pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
>>CCDS14580.1 UBE2A gene_id:7319|Hs108|chrX (152 aa)
initn: 256 init1: 198 opt: 305 Z-score: 388.2 bits: 78.5 E(32554): 2.6e-15
Smith-Waterman score: 305; 36.4% identity (69.7% similar) in 132 aa overlap (9-140:9-137)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
: ::: :.:. :... : ...: :.. : : ... .. .:.:::.::
CCDS14 MSTPARRRLMRDFKRLQEDPPAGVSGAPSENNIMVWNAVIFGPEGTPFEDGTFKLTIEFT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
:: ::.:.:.. ::::: :. :.:.:.: .:. .: .:::: ..: .:..:
CCDS14 EEYPNKPPTVRFVSKMFHPNVYAD-GSICLDILQN--RWSPTYDVSSILTSIQSLLDEPN
70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:.: .::.. ... :
CCDS14 PNSPANSQAAQLYQENKREYEKRVSAIVEQSWRDC
120 130 140 150
>>CCDS13370.1 UBE2C gene_id:11065|Hs108|chr20 (179 aa)
initn: 222 init1: 131 opt: 303 Z-score: 384.7 bits: 78.1 E(32554): 4.2e-15
Smith-Waterman score: 303; 35.8% identity (71.5% similar) in 137 aa overlap (9-145:35-167)
10 20 30
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEV
:.... : .. :::.: : :.....:
CCDS13 NRDPAATSVAAARKGAEPSGGAARGPVGKRLQQELMTLMMSGDKGISAFPESDNLFKWVG
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE5 EIEGLQNSVWQGLVFQLTIHFTSEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEK
:.: ..:.. : ..:...: : : : :.:::.: .::::: . :. :.:.: ::
CCDS13 TIHGAAGTVYEDLRYKLSLEFPSGYPYNAPTVKFLTPCYHPNVDTQ-GNICLDIL--KEK
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE5 WNTNYTLSSILLALQVMLSNPVLENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQ
:.. : . .:::..: .:..: ...:.: .::. : :. . .. :.
CCDS13 WSALYDVRTILLSIQSLLGEPNIDSPLNTHAAE-LWKNPTAFKKYLQETYSKQVTSQEP
130 140 150 160 170
160 170 180 190 200 210
pF1KE5 ELPKDPRKCIRPIKTTSFSDYYQTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQ
>>CCDS4174.1 UBE2B gene_id:7320|Hs108|chr5 (152 aa)
initn: 112 init1: 112 opt: 298 Z-score: 379.5 bits: 76.9 E(32554): 8.1e-15
Smith-Waterman score: 298; 34.8% identity (70.5% similar) in 132 aa overlap (9-140:9-137)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
: ::: :.:. :... : ...:.:.. : : ... .. .:.:.:.:.
CCDS41 MSTPARRRLMRDFKRLQEDPPVGVSGAPSENNIMQWNAVIFGPEGTPFEDGTFKLVIEFS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
:: ::.:.:.. ::::: :. :.:.:.: .:. .: .:::: ..: .:..:
CCDS41 EEYPNKPPTVRFLSKMFHPNVYAD-GSICLDILQN--RWSPTYDVSSILTSIQSLLDEPN
70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:.: .::.. ... :
CCDS41 PNSPANSQAAQLYQENKREYEKRVSAIVEQSWNDS
120 130 140 150
>>CCDS13374.1 UBE2C gene_id:11065|Hs108|chr20 (140 aa)
initn: 222 init1: 131 opt: 297 Z-score: 378.8 bits: 76.6 E(32554): 8.8e-15
Smith-Waterman score: 297; 37.9% identity (72.6% similar) in 124 aa overlap (22-145:9-128)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
:::.: : :.....: :.: ..:.. : ..:...:
CCDS13 MTLMMSGDKGISAFPESDNLFKWVGTIHGAAGTVYEDLRYKLSLEFP
10 20 30 40
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
: : : :.:::.: .::::: . :. :.:.: :::.. : . .:::..: .:..:
CCDS13 SGYPYNAPTVKFLTPCYHPNVDTQ-GNICLDILK--EKWSALYDVRTILLSIQSLLGEPN
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
...:.: .::. : :. . .. :.
CCDS13 IDSPLNTHAAE-LWKNPTAFKKYLQETYSKQVTSQEP
110 120 130 140
>>CCDS5474.1 UBE2D4 gene_id:51619|Hs108|chr7 (147 aa)
initn: 229 init1: 198 opt: 294 Z-score: 374.8 bits: 75.9 E(32554): 1.5e-14
Smith-Waterman score: 294; 31.7% identity (64.8% similar) in 145 aa overlap (9-153:6-147)
10 20 30 40 50 60
pF1KE5 MHGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT
..... ::... .: ::..:...:.. : : ..: .:: :: :::::
CCDS54 MALKRIQKELTDLQRDPPAQCSAGPVGDDLFHWQATIMGPNDSPYQGGVFFLTIHFP
10 20 30 40 50
70 80 90 100 110 120
pF1KE5 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV
..: . :: : : : .:::.. . :. :.:.: . .:. :.:..::.. .: .:
CCDS54 TDYPFKPPKVAFTTKIYHPNINSN-GSICLDILRS--QWSPALTVSKVLLSICSLLCDPN
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE5 LENPVNLEAARILVKDESLYRTILRLFNRPLQMKDDSQELPKDPRKCIRPIKTTSFSDYY
..:. : :. :. : . : ... :
CCDS54 PDDPLVPEIAHTYKADREKYNRLAREWTQKYAM
120 130 140
190 200 210 220
pF1KE5 QTWSRIATSKATEYYRTPLLKVPNFIGQYYKWKKMDLQHQKEWNLK
226 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 22:37:48 2016 done: Mon Nov 7 22:37:49 2016
Total Scan time: 1.810 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]