FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5726, 711 aa
1>>>pF1KE5726 711 - 711 aa - 711 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 12.4116+/-0.00116; mu= -8.4876+/- 0.070
mean_var=527.7698+/-108.029, 0's: 0 Z-trim(116.2): 34 B-trim: 952 in 1/53
Lambda= 0.055828
statistics sampled from 16824 (16854) to 16824 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.792), E-opt: 0.2 (0.518), width: 16
Scan time: 3.900
The best scores are: opt bits E(32554)
CCDS14370.1 USP51 gene_id:158880|Hs108|chrX ( 711) 5006 418.0 2.4e-116
CCDS65260.1 USP27X gene_id:389856|Hs108|chrX ( 438) 2170 189.4 9.7e-48
CCDS42285.1 USP22 gene_id:23326|Hs108|chr17 ( 525) 1944 171.3 3.3e-42
>>CCDS14370.1 USP51 gene_id:158880|Hs108|chrX (711 aa)
initn: 5006 init1: 5006 opt: 5006 Z-score: 2202.4 bits: 418.0 E(32554): 2.4e-116
Smith-Waterman score: 5006; 100.0% identity (100.0% similar) in 711 aa overlap (1-711:1-711)
10 20 30 40 50 60
pF1KE5 MAQVRETSLPSGSGVRWISGGGGGASPEEAVEKAGKMEEAAAGATKASSRREAEEMKLEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MAQVRETSLPSGSGVRWISGGGGGASPEEAVEKAGKMEEAAAGATKASSRREAEEMKLEP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 LQEREPAPEENLTWSSSGGDEKVLPSIPLRCHSSSSPVCPRRKPRPRPQPRARSRSQPGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 LQEREPAPEENLTWSSSGGDEKVLPSIPLRCHSSSSPVCPRRKPRPRPQPRARSRSQPGL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 SAPPPPPARPPPPPPPPPPPAPRPRAWRGSRRRSRPGSRPQTRRSCSGDLDGSGDPGGLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 SAPPPPPARPPPPPPPPPPPAPRPRAWRGSRRRSRPGSRPQTRRSCSGDLDGSGDPGGLG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 DWLLEVEFGQGPTGCSHVESFKVGKNWQKNLRLIYQRFVWSGTPETRKRKAKSCICHVCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 DWLLEVEFGQGPTGCSHVESFKVGKNWQKNLRLIYQRFVWSGTPETRKRKAKSCICHVCS
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 THMNRLHSCLSCVFFGCFTEKHIHKHAETKQHHLAVDLYHGVIYCFMCKDYVYDKDIEQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 THMNRLHSCLSCVFFGCFTEKHIHKHAETKQHHLAVDLYHGVIYCFMCKDYVYDKDIEQI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 AKETKEKILRLLTSTSTDVSHQQFMTSGFEDKQSTCETKEQEPKLVKPKKKRRKKSVYTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 AKETKEKILRLLTSTSTDVSHQQFMTSGFEDKQSTCETKEQEPKLVKPKKKRRKKSVYTV
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE5 GLRGLINLGNTCFMNCIVQALTHIPLLKDFFLSDKHKCIMTSPSLCLVCEMSSLFHAMYS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GLRGLINLGNTCFMNCIVQALTHIPLLKDFFLSDKHKCIMTSPSLCLVCEMSSLFHAMYS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE5 GSRTPHIPYKLLHLIWIHAEHLAGYRQQDAHEFLIAILDVLHRHSKDDSGGQEANNPNCC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GSRTPHIPYKLLHLIWIHAEHLAGYRQQDAHEFLIAILDVLHRHSKDDSGGQEANNPNCC
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE5 NCIIDQIFTGGLQSDVTCQACHSVSTTIDPCWDISLDLPGSCATFDSQNPERADSTVSRD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 NCIIDQIFTGGLQSDVTCQACHSVSTTIDPCWDISLDLPGSCATFDSQNPERADSTVSRD
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE5 DHIPGIPSLTDCLQWFTRPEHLGSSAKIKCNSCQSYQESTKQLTMKKLPIVACFHLKRFE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 DHIPGIPSLTDCLQWFTRPEHLGSSAKIKCNSCQSYQESTKQLTMKKLPIVACFHLKRFE
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE5 HVGKQRRKINTFISFPLELDMTPFLASTKESRMKEGQPPTDCVPNENKYSLFAVINHHGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 HVGKQRRKINTFISFPLELDMTPFLASTKESRMKEGQPPTDCVPNENKYSLFAVINHHGT
610 620 630 640 650 660
670 680 690 700 710
pF1KE5 LESGHYTSFIRQQKDQWFSCDDAIITKATIEDLLYSEGYLLFYHKQGLEKD
:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 LESGHYTSFIRQQKDQWFSCDDAIITKATIEDLLYSEGYLLFYHKQGLEKD
670 680 690 700 710
>>CCDS65260.1 USP27X gene_id:389856|Hs108|chrX (438 aa)
initn: 1891 init1: 1346 opt: 2170 Z-score: 970.5 bits: 189.4 E(32554): 9.7e-48
Smith-Waterman score: 2170; 73.8% identity (88.8% similar) in 427 aa overlap (287-711:1-426)
260 270 280 290 300 310
pF1KE5 CFTEKHIHKHAETKQHHLAVDLYHGVIYCFMCKDYVYDKDIEQIAKETKEKILRLLTSTS
::::::::::::::::: . . :.: .:::
CCDS65 MCKDYVYDKDIEQIAKEEQGEALKLQASTS
10 20 30
320 330 340 350 360 370
pF1KE5 TDVSHQQFMTSGFEDKQSTCETKEQEPKLVKPKKKRRK-KSVYTVGLRGLINLGNTCFMN
:.::::: . :. .: : :: . : .:. . .::. : .:.:::::::::::::::
CCDS65 TEVSHQQCSVPGLGEKFPTWETTKPELELLGHNPRRRRITSSFTIGLRGLINLGNTCFMN
40 50 60 70 80 90
380 390 400 410 420 430
pF1KE5 CIVQALTHIPLLKDFFLSDKHKCIMTSPSLCLVCEMSSLFHAMYSGSRTPHIPYKLLHLI
:::::::: :.:.::::::.:.: : :: :::::::::::. .:::. .::.:::::::.
CCDS65 CIVQALTHTPILRDFFLSDRHRCEMPSPELCLVCEMSSLFRELYSGNPSPHVPYKLLHLV
100 110 120 130 140 150
440 450 460 470 480 490
pF1KE5 WIHAEHLAGYRQQDAHEFLIAILDVLHRHSKDDSGGQEANNPNCCNCIIDQIFTGGLQSD
::::.:::::::::::::::: ::::::: : :. :. ::::: ::::::::::::::::
CCDS65 WIHARHLAGYRQQDAHEFLIAALDVLHRHCKGDDVGKAANNPNHCNCIIDQIFTGGLQSD
160 170 180 190 200 210
500 510 520 530 540 550
pF1KE5 VTCQACHSVSTTIDPCWDISLDLPGSCATFDSQNPERADSTVSRDDHIPGIPSLTDCLQW
:::::::.:::::::::::::::::::..: ..: : .:.:. ..::::: .:::::.
CCDS65 VTCQACHGVSTTIDPCWDISLDLPGSCTSFWPMSPGR-ESSVNGESHIPGITTLTDCLRR
220 230 240 250 260
560 570 580 590 600 610
pF1KE5 FTRPEHLGSSAKIKCNSCQSYQESTKQLTMKKLPIVACFHLKRFEHVGKQRRKINTFISF
:::::::::::::::.::::::::::::::.:::.:::::.::::: .::::::.:.:::
CCDS65 FTRPEHLGSSAKIKCGSCQSYQESTKQLTMNKLPVVACFHFKRFEHSAKQRRKITTYISF
270 280 290 300 310 320
620 630 640 650 660 670
pF1KE5 PLELDMTPFLASTKESRMK-EGQPPTDCVPNENKYSLFAVINHHGTLESGHYTSFIRQQK
:::::::::.::.:::::. . : ::. ::::::::::.::.:::::::::::::..:
CCDS65 PLELDMTPFMASSKESRMNGQLQLPTNSGNNENKYSLFAVVNHQGTLESGHYTSFIRHHK
330 340 350 360 370 380
680 690 700 710
pF1KE5 DQWFSCDDAIITKATIEDLLYSEGYLLFYHKQGLEKD
::::.::::.::::.:.:.: ::::::::::: ::..
CCDS65 DQWFKCDDAVITKASIKDVLDSEGYLLFYHKQVLEHESEKVKEMNTQAY
390 400 410 420 430
>>CCDS42285.1 USP22 gene_id:23326|Hs108|chr17 (525 aa)
initn: 2548 init1: 1527 opt: 1944 Z-score: 871.2 bits: 171.3 E(32554): 3.3e-42
Smith-Waterman score: 2548; 69.7% identity (85.4% similar) in 528 aa overlap (184-709:12-523)
160 170 180 190 200 210
pF1KE5 SRPGSRPQTRRSCSGDLDGSGDPGGLGDWLLEVEFGQGPTGCSHVESFKVGKNWQKNLRL
...:.. .: ::::. :::: ::..:::
CCDS42 MVSRPEPEGEAMDAELAVAPPGCSHLGSFKVD-NWKQNLRA
10 20 30 40
220 230 240 250 260 270
pF1KE5 IYQRFVWSGTPETRKRKAKSCICHVCSTHMNRLHSCLSCVFFGCFTEKHIHKHAETKQHH
::: :::::: :.:::::::::::::..:.::::::: ::::::::.::::.::..:.:.
CCDS42 IYQCFVWSGTAEARKRKAKSCICHVCGVHLNRLHSCLYCVFFGCFTKKHIHEHAKAKRHN
50 60 70 80 90 100
280 290 300 310 320 330
pF1KE5 LAVDLYHGVIYCFMCKDYVYDKDIEQIAKETKEKILRLLTSTSTDVSHQQFMTSGFEDKQ
::.::..: ::::.:.::.::::.: :::: ..: .. .: .:
CCDS42 LAIDLMYGGIYCFLCQDYIYDKDMEIIAKEEQRKAWKM---------------QGVGEKF
110 120 130 140
340 350 360 370 380 390
pF1KE5 STCETKEQEPKLVKPKKKRRK-KSVYTVGLRGLINLGNTCFMNCIVQALTHIPLLKDFFL
:: : ..: .:.: . :::: : :.::::::::::::::::::::::: :::.::::
CCDS42 STWEPTKRELELLKHNPKRRKITSNCTIGLRGLINLGNTCFMNCIVQALTHTPLLRDFFL
150 160 170 180 190 200
400 410 420 430 440 450
pF1KE5 SDKHKCIMTSPSLCLVCEMSSLFHAMYSGSRTPHIPYKLLHLIWIHAEHLAGYRQQDAHE
::.:.: : ::: ::::::::::. .::: :.::::::::::.: ::.:::::.::::::
CCDS42 SDRHRCEMQSPSSCLVCEMSSLFQEFYSGHRSPHIPYKLLHLVWTHARHLAGYEQQDAHE
210 220 230 240 250 260
460 470 480 490 500 510
pF1KE5 FLIAILDVLHRHSKDDSGGQEANNPNCCNCIIDQIFTGGLQSDVTCQACHSVSTTIDPCW
:::: ::::::: : :..:..::::: ::::::::::::::::::::.::.::::::: :
CCDS42 FLIAALDVLHRHCKGDDNGKKANNPNHCNCIIDQIFTGGLQSDVTCQVCHGVSTTIDPFW
270 280 290 300 310 320
520 530 540 550 560 570
pF1KE5 DISLDLPGSCATFDSQNPERADSTVSRDDHIPGIPSLTDCLQWFTRPEHLGSSAKIKCNS
::::::::: . : .: ..:. ..:. : .:::::. :::::::::::::::..
CCDS42 DISLDLPGSSTPFWPLSPGSEGNVVNGESHVSGTTTLTDCLRRFTRPEHLGSSAKIKCSG
330 340 350 360 370 380
580 590 600 610 620 630
pF1KE5 CQSYQESTKQLTMKKLPIVACFHLKRFEHVGKQRRKINTFISFPLELDMTPFLASTKESR
:.::::::::::::::::::::::::::: .: ::::.:..:::::::::::.::.::::
CCDS42 CHSYQESTKQLTMKKLPIVACFHLKRFEHSAKLRRKITTYVSFPLELDMTPFMASSKESR
390 400 410 420 430 440
640 650 660 670 680 690
pF1KE5 MK-EGQPPTDCVPNENKYSLFAVINHHGTLESGHYTSFIRQQKDQWFSCDDAIITKATIE
:. . : ::: . :.::::::::.::.::::::::::::::.:::::.:::::::::.:.
CCDS42 MNGQYQQPTDSLNNDNKYSLFAVVNHQGTLESGHYTSFIRQHKDQWFKCDDAIITKASIK
450 460 470 480 490 500
700 710
pF1KE5 DLLYSEGYLLFYHKQGLEKD
:.: ::::::::::: ::
CCDS42 DVLDSEGYLLFYHKQFLEYE
510 520
711 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 06:06:50 2016 done: Tue Nov 8 06:06:51 2016
Total Scan time: 3.900 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]