FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3183, 422 aa
1>>>pF1KE3183 422 - 422 aa - 422 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.3143+/-0.000847; mu= 12.5010+/- 0.052
mean_var=200.5954+/-40.076, 0's: 0 Z-trim(115.6): 7 B-trim: 167 in 1/52
Lambda= 0.090555
statistics sampled from 16182 (16187) to 16182 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.497), width: 16
Scan time: 3.100
The best scores are: opt bits E(32554)
CCDS1069.1 UBE2Q1 gene_id:55585|Hs108|chr1 ( 422) 2879 388.0 9.2e-108
CCDS10286.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 375) 1711 235.3 7.3e-62
CCDS45309.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 359) 1652 227.6 1.5e-59
CCDS66839.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 340) 1365 190.1 2.8e-48
CCDS47189.1 UBE2QL1 gene_id:134111|Hs108|chr5 ( 161) 605 90.4 1.3e-18
>>CCDS1069.1 UBE2Q1 gene_id:55585|Hs108|chr1 (422 aa)
initn: 2879 init1: 2879 opt: 2879 Z-score: 2048.1 bits: 388.0 E(32554): 9.2e-108
Smith-Waterman score: 2879; 100.0% identity (100.0% similar) in 422 aa overlap (1-422:1-422)
10 20 30 40 50 60
pF1KE3 MQQPQPQGQQQPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MQQPQPQGQQQPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 IASACLDELSCEFLLAGAGGAGAGAAPGPHLPPRGSVPGDPVRIHCNITESYPAVPPIWS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 IASACLDELSCEFLLAGAGGAGAGAAPGPHLPPRGSVPGDPVRIHCNITESYPAVPPIWS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 VESDDPNLAAVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 VESDDPNLAAVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE3 TQEDVSSEDEDEEMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 TQEDVSSEDEDEEMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQD
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE3 YLNGAVSGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YLNGAVSGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALH
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE3 NDLQILKEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NDLQILKEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGW
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE3 SSAYSIESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 SSAYSIESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKE
370 380 390 400 410 420
pF1KE3 DG
::
CCDS10 DG
>>CCDS10286.1 UBE2Q2 gene_id:92912|Hs108|chr15 (375 aa)
initn: 1543 init1: 1355 opt: 1711 Z-score: 1224.1 bits: 235.3 E(32554): 7.3e-62
Smith-Waterman score: 1831; 74.1% identity (86.0% similar) in 386 aa overlap (41-422:6-375)
20 30 40 50 60 70
pF1KE3 QPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELS
:. :::.: ::: ..::::::.: ::::
CCDS10 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELH
10 20 30
80 90 100 110 120
pF1KE3 CEFLLAGAGGAGAGAAPGPH-LPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLA
:.::. : .:: ::: :. .:::::::::. ::: :.:.::::.
CCDS10 CQFLVPQQG--------SPHSLPP-------PLTLHCNITESYPSSSPIWFVDSEDPNLT
40 50 60 70 80
130 140 150 160 170 180
pF1KE3 AVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC-TQEDVSSE
.::::: : :. :.:: :.:: .: .::.:::::.: ::::::::::. : : :.:.::
CCDS10 SVLERLEDTKN-NNLLRQQLKWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSE
90 100 110 120 130
190 200 210 220 230 240
pF1KE3 DEDEE--MPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAV
.:.:: : :: ::::::::::::: ::::::.:: ::::::::::.:.::::.:::::
CCDS10 EEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAV
140 150 160 170 180 190
250 260 270 280 290 300
pF1KE3 SGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQIL
::::::.:::::::::::::::.: : :.:::.:::::::.::: ::: :: ::.:::::
CCDS10 SGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQIL
200 210 220 230 240 250
310 320 330 340 350 360
pF1KE3 KEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSI
::::: ..::::::::::::::::::::: :::::::::::::.::::::::::::::::
CCDS10 KEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSI
260 270 280 290 300 310
370 380 390 400 410 420
pF1KE3 ESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG
:::::::.::::::::::::::::.::.:.::::::.:.:::::::::::::::::
CCDS10 ESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG
320 330 340 350 360 370
>>CCDS45309.1 UBE2Q2 gene_id:92912|Hs108|chr15 (359 aa)
initn: 1491 init1: 1355 opt: 1652 Z-score: 1182.6 bits: 227.6 E(32554): 1.5e-59
Smith-Waterman score: 1652; 71.6% identity (84.4% similar) in 366 aa overlap (60-422:2-359)
30 40 50 60 70 80
pF1KE3 GPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELSCEFLLAGAGGAGAGAAPGP
:. : ..: :.. . . : :
CCDS45 MRMDSLTEEKLECRLWCCLSDPSPPGLAARC
10 20 30
90 100 110 120 130 140
pF1KE3 HLPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLAAVLERLVDIKKGNTLLLQHL
. :. ::. .: ::::. ::: :.:.::::..::::: : :. :.:: :.:
CCDS45 CVLERSIVPS--LR-----QESYPSSSPIWFVDSEDPNLTSVLERLEDTKN-NNLLRQQL
40 50 60 70 80
150 160 170 180 190 200
pF1KE3 KRIISDLCKLYNLPQHPDVEMLDQPLPAEQC-TQEDVSSEDEDEE--MPEDTEDLDHYEM
: .: .::.:::::.: ::::::::::. : : :.:.::.:.:: : :: ::::::::
CCDS45 KWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEM
90 100 110 120 130 140
210 220 230 240 250 260
pF1KE3 KEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAVSGSVQATDRLMKELRDIYRS
::::: ::::::.:: ::::::::::.:.::::.:::::::::::.:::::::::::::
CCDS45 KEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRS
150 160 170 180 190 200
270 280 290 300 310 320
pF1KE3 QSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQILKEKEGADFILLNFSFKDNFP
::.: : :.:::.:::::::.::: ::: :: ::.:::::::::: ..::::::::::::
CCDS45 QSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFP
210 220 230 240 250 260
330 340 350 360 370 380
pF1KE3 FDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSIESVIMQISATLVKGKARVQF
::::::::: :::::::::::::.:::::::::::::::::::::::.::::::::::::
CCDS45 FDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQF
270 280 290 300 310 320
390 400 410 420
pF1KE3 GANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG
::::.::.:.::::::.:.:::::::::::::::::
CCDS45 GANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG
330 340 350
>>CCDS66839.1 UBE2Q2 gene_id:92912|Hs108|chr15 (340 aa)
initn: 1651 init1: 1355 opt: 1365 Z-score: 980.3 bits: 190.1 E(32554): 2.8e-48
Smith-Waterman score: 1608; 68.1% identity (79.4% similar) in 383 aa overlap (41-422:6-340)
20 30 40 50 60 70
pF1KE3 QPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELS
:. :::.: ::: ..::::::.: ::::
CCDS66 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELH
10 20 30
80 90 100 110 120
pF1KE3 CEFLLAGAGGAGAGAAPGPH-LPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLA
:.::. : .:: ::: :. .:::::::::. ::: :.:.::::.
CCDS66 CQFLVPQQG--------SPHSLPP-------PLTLHCNITESYPSSSPIWFVDSEDPNLT
40 50 60 70 80
130 140 150 160 170 180
pF1KE3 AVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQCTQEDVSSED
.::::: : :..: :. .:. :.:. :.
CCDS66 SVLERLEDTKNNN----------------------------LNGT--TEEVTSEE---EE
90 100
190 200 210 220 230 240
pF1KE3 EDEEMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAVSGS
:.::: :: ::::::::::::: ::::::.:: ::::::::::.:.::::.::::::::
CCDS66 EEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAVSGS
110 120 130 140 150 160
250 260 270 280 290 300
pF1KE3 VQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQILKEK
:::.:::::::::::::::.: : :.:::.:::::::.::: ::: :: ::.::::::::
CCDS66 VQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQILKEK
170 180 190 200 210 220
310 320 330 340 350 360
pF1KE3 EGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSIESV
:: ..::::::::::::::::::::: :::::::::::::.:::::::::::::::::::
CCDS66 EGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSIESV
230 240 250 260 270 280
370 380 390 400 410 420
pF1KE3 IMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG
::::.::::::::::::::::.::.:.::::::.:.:::::::::::::::::
CCDS66 IMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG
290 300 310 320 330 340
>>CCDS47189.1 UBE2QL1 gene_id:134111|Hs108|chr5 (161 aa)
initn: 514 init1: 371 opt: 605 Z-score: 447.5 bits: 90.4 E(32554): 1.3e-18
Smith-Waterman score: 605; 56.2% identity (80.5% similar) in 169 aa overlap (257-422:1-161)
230 240 250 260 270 280
pF1KE3 LAILEKIKKNQRQDYLNGAVSGSVQATDRLMKELRDIYR-SQSFKGGNYAVELVNDSLYD
::::.:: : :. : .::::..::.:
CCDS47 MKELQDIARLSDRF----ISVELVDESLFD
10 20
290 300 310 320 330 340
pF1KE3 WNVKLLKVDQDSALHNDLQILKEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVL
::::: .::.::.: .:.. . ...:::::..: :::::.:::.::.:: : .::::
CCDS47 WNVKLHQVDKDSVLWQDMK----ETNTEFILLNLTFPDNFPFSPPFMRVLSPRLENGYVL
30 40 50 60 70 80
350 360 370 380 390 400
pF1KE3 GGGAICMELLTKQGWSSAYSIESVIMQISATLVKGKARVQFGANKSQYSLTR--AQQSYK
:::::::::: .::::::..:.:. :..:.::::..:. :.::. :..: :. ..:
CCDS47 DGGAICMELLTPRGWSSAYTVEAVMRQFAASLVKGQGRICRKAGKSKKSFSRKEAEATFK
90 100 110 120 130 140
410 420
pF1KE3 SLVQIHEKNGWYTPPKEDG
:::. ::: :: ::: ::
CCDS47 SLVKTHEKYGWVTPPVSDG
150 160
422 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 18:53:33 2016 done: Sun Nov 6 18:53:34 2016
Total Scan time: 3.100 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]