FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5640, 565 aa
1>>>pF1KE5640 565 - 565 aa - 565 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.8304+/-0.00113; mu= 6.3079+/- 0.068
mean_var=127.7999+/-26.085, 0's: 0 Z-trim(106.9): 61 B-trim: 5 in 1/51
Lambda= 0.113451
statistics sampled from 9207 (9245) to 9207 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.648), E-opt: 0.2 (0.284), width: 16
Scan time: 2.690
The best scores are: opt bits E(32554)
CCDS33234.1 USP39 gene_id:10713|Hs108|chr2 ( 565) 3760 627.2 1.6e-179
CCDS74534.1 USP39 gene_id:10713|Hs108|chr2 ( 487) 3193 534.4 1.2e-151
CCDS58717.1 USP39 gene_id:10713|Hs108|chr2 ( 488) 3164 529.6 3.3e-150
CCDS58716.1 USP39 gene_id:10713|Hs108|chr2 ( 462) 2805 470.9 1.5e-132
>>CCDS33234.1 USP39 gene_id:10713|Hs108|chr2 (565 aa)
initn: 3760 init1: 3760 opt: 3760 Z-score: 3336.3 bits: 627.2 E(32554): 1.6e-179
Smith-Waterman score: 3760; 100.0% identity (100.0% similar) in 565 aa overlap (1-565:1-565)
10 20 30 40 50 60
pF1KE5 MSGRSKRESRGSTRGKRESESRGSSGRVKRERDREREPEAASSRGSPVRVKREFEPASAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MSGRSKRESRGSTRGKRESESRGSSGRVKRERDREREPEAASSRGSPVRVKREFEPASAR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EAPASVVPFVRVKREREVDEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 EAPASVVPFVRVKREREVDEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 EKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 EKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 IIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 IIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 QALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEML
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 QALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEML
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 QAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 QAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE5 PHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 PHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILA
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE5 KFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITNVDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 KFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITNVDL
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE5 REYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVTDILPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 REYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVTDILPQ
490 500 510 520 530 540
550 560
pF1KE5 MITLSEAYIQIWKRRDNDETNQQGA
:::::::::::::::::::::::::
CCDS33 MITLSEAYIQIWKRRDNDETNQQGA
550 560
>>CCDS74534.1 USP39 gene_id:10713|Hs108|chr2 (487 aa)
initn: 3189 init1: 3189 opt: 3193 Z-score: 2835.8 bits: 534.4 E(32554): 1.2e-151
Smith-Waterman score: 3193; 99.0% identity (99.2% similar) in 482 aa overlap (84-565:6-487)
60 70 80 90 100 110
pF1KE5 FEPASAREAPASVVPFVRVKREREVDEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINR
: . ::::::::::::::::::::::::
CCDS74 MLTAVPSHLFSAKNGRVDSEDRRSRHCPYLDTINR
10 20 30
120 130 140 150 160 170
pF1KE5 SVLDFDFEKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 SVLDFDFEKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFY
40 50 60 70 80 90
180 190 200 210 220 230
pF1KE5 CLPDNYEIIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 CLPDNYEIIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKAN
100 110 120 130 140 150
240 250 260 270 280 290
pF1KE5 DYANAVLQALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 DYANAVLQALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAH
160 170 180 190 200 210
300 310 320 330 340 350
pF1KE5 VSPHEMLQAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 VSPHEMLQAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSM
220 230 240 250 260 270
360 370 380 390 400 410
pF1KE5 RIFTKKLPHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 RIFTKKLPHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQV
280 290 300 310 320 330
420 430 440 450 460 470
pF1KE5 PLFNILAKFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 PLFNILAKFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNF
340 350 360 370 380 390
480 490 500 510 520 530
pF1KE5 PITNVDLREYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 PITNVDLREYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQ
400 410 420 430 440 450
540 550 560
pF1KE5 VTDILPQMITLSEAYIQIWKRRDNDETNQQGA
::::::::::::::::::::::::::::::::
CCDS74 VTDILPQMITLSEAYIQIWKRRDNDETNQQGA
460 470 480
>>CCDS58717.1 USP39 gene_id:10713|Hs108|chr2 (488 aa)
initn: 3164 init1: 3164 opt: 3164 Z-score: 2810.2 bits: 529.6 E(32554): 3.3e-150
Smith-Waterman score: 3164; 100.0% identity (100.0% similar) in 476 aa overlap (1-476:1-476)
10 20 30 40 50 60
pF1KE5 MSGRSKRESRGSTRGKRESESRGSSGRVKRERDREREPEAASSRGSPVRVKREFEPASAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MSGRSKRESRGSTRGKRESESRGSSGRVKRERDREREPEAASSRGSPVRVKREFEPASAR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 EAPASVVPFVRVKREREVDEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 EAPASVVPFVRVKREREVDEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINRSVLDFDF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 EKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 EKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 IIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 IIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 QALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEML
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 QALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEML
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 QAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 QAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE5 PHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 PHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQVPLFNILA
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE5 KFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITNVDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 KFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITGQAN
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE5 REYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVTDILPQ
CCDS58 GMNYKTSR
>>CCDS58716.1 USP39 gene_id:10713|Hs108|chr2 (462 aa)
initn: 2953 init1: 2805 opt: 2805 Z-score: 2493.0 bits: 470.9 E(32554): 1.5e-132
Smith-Waterman score: 2910; 92.7% identity (93.4% similar) in 482 aa overlap (84-565:6-462)
60 70 80 90 100 110
pF1KE5 FEPASAREAPASVVPFVRVKREREVDEDSEPEREVRAKNGRVDSEDRRSRHCPYLDTINR
: . :::::::::::::::::::::::
CCDS58 MLTAVPSHLFSAKNGRVDSEDRRSRHCPYLDTIN-
10 20 30
120 130 140 150 160 170
pF1KE5 SVLDFDFEKLCSISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFY
:.: . :::::::::::::::::::::::::::::
CCDS58 -------------SFS-----------PFPTGRGLKSHAYIHSVQFSHHVFLNLHTLKFY
40 50 60 70
180 190 200 210 220 230
pF1KE5 CLPDNYEIIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKAN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 CLPDNYEIIDSSLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKAN
80 90 100 110 120 130
240 250 260 270 280 290
pF1KE5 DYANAVLQALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 DYANAVLQALSNVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAH
140 150 160 170 180 190
300 310 320 330 340 350
pF1KE5 VSPHEMLQAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 VSPHEMLQAVVLCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSM
200 210 220 230 240 250
360 370 380 390 400 410
pF1KE5 RIFTKKLPHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 RIFTKKLPHPDLPAEEKEQLLHNDEYQETMVESTFMYLTLDLPTAPLYKDEKEQLIIPQV
260 270 280 290 300 310
420 430 440 450 460 470
pF1KE5 PLFNILAKFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 PLFNILAKFNGITEKEYKTYKENFLKRFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNF
320 330 340 350 360 370
480 490 500 510 520 530
pF1KE5 PITNVDLREYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 PITNVDLREYLSEEVQAVHKNTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQ
380 390 400 410 420 430
540 550 560
pF1KE5 VTDILPQMITLSEAYIQIWKRRDNDETNQQGA
::::::::::::::::::::::::::::::::
CCDS58 VTDILPQMITLSEAYIQIWKRRDNDETNQQGA
440 450 460
565 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 05:27:07 2016 done: Tue Nov 8 05:27:08 2016
Total Scan time: 2.690 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]