FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5745, 640 aa 1>>>pF1KE5745 640 - 640 aa - 640 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6735+/-0.000926; mu= 8.8533+/- 0.055 mean_var=140.2877+/-27.948, 0's: 0 Z-trim(110.3): 47 B-trim: 655 in 2/49 Lambda= 0.108284 statistics sampled from 11478 (11519) to 11478 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.354), width: 16 Scan time: 3.980 The best scores are: opt bits E(32554) CCDS4861.1 USP49 gene_id:25862|Hs108|chr6 ( 640) 4375 695.3 6.5e-200 CCDS69111.1 USP49 gene_id:25862|Hs108|chr6 ( 688) 4274 679.6 3.9e-195 CCDS9053.1 USP44 gene_id:84101|Hs108|chr12 ( 712) 1796 292.4 1.4e-78 CCDS58370.1 USP3 gene_id:9960|Hs108|chr15 ( 476) 535 95.4 2e-19 >>CCDS4861.1 USP49 gene_id:25862|Hs108|chr6 (640 aa) initn: 4375 init1: 4375 opt: 4375 Z-score: 3702.5 bits: 695.3 E(32554): 6.5e-200 Smith-Waterman score: 4375; 100.0% identity (100.0% similar) in 640 aa overlap (1-640:1-640) 10 20 30 40 50 60 pF1KE5 MDRCKHVGRLRLAQDHSILNPQKWCCLECATTESVWACLKCSHVACGRYIEDHALKHFEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MDRCKHVGRLRLAQDHSILNPQKWCCLECATTESVWACLKCSHVACGRYIEDHALKHFEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TGHPLAMEVRDLYVFCYLCKDYVLNDNPEGDLKLLRSSLLAVRGQKQDTPVRRGRTLRSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 TGHPLAMEVRDLYVFCYLCKDYVLNDNPEGDLKLLRSSLLAVRGQKQDTPVRRGRTLRSM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ASGEDVVLPQRAPQGQPQMLTALWYRRQRLLARTLRLWFEKSSRGQAKLEQRRQEEALER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 ASGEDVVLPQRAPQGQPQMLTALWYRRQRLLARTLRLWFEKSSRGQAKLEQRRQEEALER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 KKEEARRRRREVKRRLLEELASTPPRKSARLLLHTPRDAGPAASRPAALPTSRRVPAATL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 KKEEARRRRREVKRRLLEELASTPPRKSARLLLHTPRDAGPAASRPAALPTSRRVPAATL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 KLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFLNLDPSKTEHLFPKATNGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 KLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFLNLDPSKTEHLFPKATNGK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 TQLSGKPTNSSATELSLRNDRAEACEREGFCWNGRASISRSLELIQNKEPSSKHISLCRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 TQLSGKPTNSSATELSLRNDRAEACEREGFCWNGRASISRSLELIQNKEPSSKHISLCRE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 LHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQEFLCELLHKVQQELESEGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 LHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQEFLCELLHKVQQELESEGT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 TRRILIPFSQRKLTKQVLKVVNTIFHGQLLSQVTCISCNYKSNTIEPFWDLSLEFPERYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 TRRILIPFSQRKLTKQVLKVVNTIFHGQLLSQVTCISCNYKSNTIEPFWDLSLEFPERYH 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 CIEKGFVPLNQTECLLTEMLAKFTETEALEGRIYACDQCNSKRRKSNPKPLVLSEARKQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 CIEKGFVPLNQTECLLTEMLAKFTETEALEGRIYACDQCNSKRRKSNPKPLVLSEARKQL 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 MIYRLPQVLRLHLKRFRWSGRNHREKIGVHVVFDQVLTMEPYCCRDMLSSLDKETFAYDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MIYRLPQVLRLHLKRFRWSGRNHREKIGVHVVFDQVLTMEPYCCRDMLSSLDKETFAYDL 550 560 570 580 590 600 610 620 630 640 pF1KE5 SAVVMHHGKGFGSGHYTAYCYNTEGGACALLCGVGDTERG :::::::::::::::::::::::::::::::::::::::: CCDS48 SAVVMHHGKGFGSGHYTAYCYNTEGGACALLCGVGDTERG 610 620 630 640 >>CCDS69111.1 USP49 gene_id:25862|Hs108|chr6 (688 aa) initn: 4274 init1: 4274 opt: 4274 Z-score: 3616.8 bits: 679.6 E(32554): 3.9e-195 Smith-Waterman score: 4274; 100.0% identity (100.0% similar) in 626 aa overlap (1-626:1-626) 10 20 30 40 50 60 pF1KE5 MDRCKHVGRLRLAQDHSILNPQKWCCLECATTESVWACLKCSHVACGRYIEDHALKHFEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MDRCKHVGRLRLAQDHSILNPQKWCCLECATTESVWACLKCSHVACGRYIEDHALKHFEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TGHPLAMEVRDLYVFCYLCKDYVLNDNPEGDLKLLRSSLLAVRGQKQDTPVRRGRTLRSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TGHPLAMEVRDLYVFCYLCKDYVLNDNPEGDLKLLRSSLLAVRGQKQDTPVRRGRTLRSM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ASGEDVVLPQRAPQGQPQMLTALWYRRQRLLARTLRLWFEKSSRGQAKLEQRRQEEALER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 ASGEDVVLPQRAPQGQPQMLTALWYRRQRLLARTLRLWFEKSSRGQAKLEQRRQEEALER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 KKEEARRRRREVKRRLLEELASTPPRKSARLLLHTPRDAGPAASRPAALPTSRRVPAATL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KKEEARRRRREVKRRLLEELASTPPRKSARLLLHTPRDAGPAASRPAALPTSRRVPAATL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 KLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFLNLDPSKTEHLFPKATNGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFLNLDPSKTEHLFPKATNGK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 TQLSGKPTNSSATELSLRNDRAEACEREGFCWNGRASISRSLELIQNKEPSSKHISLCRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TQLSGKPTNSSATELSLRNDRAEACEREGFCWNGRASISRSLELIQNKEPSSKHISLCRE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 LHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQEFLCELLHKVQQELESEGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQEFLCELLHKVQQELESEGT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 TRRILIPFSQRKLTKQVLKVVNTIFHGQLLSQVTCISCNYKSNTIEPFWDLSLEFPERYH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TRRILIPFSQRKLTKQVLKVVNTIFHGQLLSQVTCISCNYKSNTIEPFWDLSLEFPERYH 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 CIEKGFVPLNQTECLLTEMLAKFTETEALEGRIYACDQCNSKRRKSNPKPLVLSEARKQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 CIEKGFVPLNQTECLLTEMLAKFTETEALEGRIYACDQCNSKRRKSNPKPLVLSEARKQL 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 MIYRLPQVLRLHLKRFRWSGRNHREKIGVHVVFDQVLTMEPYCCRDMLSSLDKETFAYDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MIYRLPQVLRLHLKRFRWSGRNHREKIGVHVVFDQVLTMEPYCCRDMLSSLDKETFAYDL 550 560 570 580 590 600 610 620 630 640 pF1KE5 SAVVMHHGKGFGSGHYTAYCYNTEGGACALLCGVGDTERG :::::::::::::::::::::::::: CCDS69 SAVVMHHGKGFGSGHYTAYCYNTEGGFWVHCNDSKLNVCSVEEVCKTQAYILFYTQRTVQ 610 620 630 640 650 660 >>CCDS9053.1 USP44 gene_id:84101|Hs108|chr12 (712 aa) initn: 2554 init1: 803 opt: 1796 Z-score: 1524.4 bits: 292.4 E(32554): 1.4e-78 Smith-Waterman score: 2582; 59.9% identity (79.1% similar) in 654 aa overlap (1-626:4-647) 10 20 30 40 50 pF1KE5 MDRCKHVGRLRLAQDHSILNPQKWCCLECATTESVWACLKCSHVACGRYIEDHALKH :: :::::.:.:::::: :::::: :..: ::::.::::.:::::::::::.::::: CCDS90 MLAMDTCKHVGQLQLAQDHSSLNPQKWHCVDCNTTESIWACLSCSHVACGRYIEEHALKH 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 FEETGHPLAMEVRDLYVFCYLCKDYVLNDNPEGDLKLLRSSLLAVRGQKQDTPVRRGRTL :.:..::.:.:: ..::::::: ::::::: ::::::: .: :...:. .: :: : CCDS90 FQESSHPVALEVNEMYVFCYLCDDYVLNDNTTGDLKLLRRTLSAIKSQNYHCTTRSGRFL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 RSMASGEDVVL----PQRAPQGQPQMLTALWYRRQRLLARTLRLWFEKSSRGQAKLEQRR :::..:.: . : :.. :. ::::.::. :... .: :::.: :. : :. CCDS90 RSMGTGDDSYFLHDGAQSLLQSEDQLYTALWHRRRILMGKIFRTWFEQSPIGRKKQEEPF 130 140 150 160 170 180 180 190 200 210 220 pF1KE5 QEEALERKKEEARRRRREVKRRLLEELASTPPRKSARLL------------LHTPRDAGP ::. . :.:...::.:.. .. :: : ::::: :: ...: .. CCDS90 QEKIV--VKREVKKRRQELEYQVKAELESMPPRKSLRLQGLAQSTIIEIVSVQVPAQTPA 190 200 210 220 230 230 240 250 260 270 pF1KE5 AASRPAALPTS-----RRVPAATLKLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQK . .. .: :: ..: ...: :.: ..:::::::::::::::::.::::::: CCDS90 SPAKDKVLSTSENEISQKVSDSSVK--RRPIVTPGVTGLRNLGNTCYMNSVLQVLSHLLI 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE5 FRECFLNLDPSKTEHLFPKATNGKTQLSGKP--TNSSATELS--LRNDRAEACEREGFC- ::.:::.:: .. . ... ::. .: :.. . ... ..: . .: :.. CCDS90 FRQCFLKLD---LNQWLAMTASEKTRSCKHPPVTDTVVYQMNECQEKDTGFVCSRQSSLS 300 310 320 330 340 350 340 350 360 370 380 pF1KE5 --WNGRASISRSLELIQNKEPSSKHISLCRELHTLFRVMWSGKWALVSPFAMLHSVWSLI .: :: .:..:::: :::.:..::::.::::::.:::::::::::::::::::: :: CCDS90 SGLSGGASKGRKMELIQPKEPTSQYISLCHELHTLFQVMWSGKWALVSPFAMLHSVWRLI 360 370 380 390 400 410 390 400 410 420 430 440 pF1KE5 PAFRGYDQQDAQEFLCELLHKVQQELESEGTTRRILIPFSQRKLTKQVLKVVNTIFHGQL :::::: :::::::::::: :.:.:::. ::. ::: ::::: ::::.:::.:::::: CCDS90 PAFRGYAQQDAQEFLCELLDKIQRELETTGTSLPALIPTSQRKLIKQVLNVVNNIFHGQL 420 430 440 450 460 470 450 460 470 480 490 500 pF1KE5 LSQVTCISCNYKSNTIEPFWDLSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEAL ::::::..:. :::::::::::::::::::.: : . . ::.::::::::::::: CCDS90 LSQVTCLACDNKSNTIEPFWDLSLEFPERYQCSGKD---IASQPCLVTEMLAKFTETEAL 480 490 500 510 520 530 510 520 530 540 550 560 pF1KE5 EGRIYACDQCNSKRRKSNPKPLVLSEARKQLMIYRLPQVLRLHLKRFRWSGRNHREKIGV ::.::.:::::::::. . ::.::.::.::::: .::::::::::::::::::.:::::: CCDS90 EGKIYVCDQCNSKRRRFSSKPVVLTEAQKQLMICHLPQVLRLHLKRFRWSGRNNREKIGV 540 550 560 570 580 590 570 580 590 600 610 620 pF1KE5 HVVFDQVLTMEPYCCRDMLSSLDKETFAYDLSAVVMHHGKGFGSGHYTAYCYNTEGGACA :: :...:.:::::::. :.:: : : :::::::::::::::::::::::::.::: CCDS90 HVGFEEILNMEPYCCRETLKSLRPECFIYDLSAVVMHHGKGFGSGHYTAYCYNSEGGFWV 600 610 620 630 640 650 630 640 pF1KE5 LLCGVGDTERG CCDS90 HCNDSKLSMCTMDEVCKAQAYILFYTQRVTENGHSKLLPPELLLGSQHPNEDADTSSNEI 660 670 680 690 700 710 >>CCDS58370.1 USP3 gene_id:9960|Hs108|chr15 (476 aa) initn: 713 init1: 175 opt: 535 Z-score: 462.4 bits: 95.4 E(32554): 2e-19 Smith-Waterman score: 583; 32.6% identity (60.2% similar) in 377 aa overlap (253-619:115-431) 230 240 250 260 270 280 pF1KE5 ASRPAALPTSRRVPAATLKLRRQPAMAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFL ::::::::::.::.::: ::....: :.. CCDS58 DRHKKRKLLENSTLNSKLLKVNGSTTAICATGLRNLGNTCFMNAILQSLSNIEQF-CCYF 90 100 110 120 130 140 290 300 310 320 330 340 pF1KE5 NLDPSKTEHLFPKATNGKTQLSGKPTNSSATELSLRNDRAEACEREGFCWNGRASISRSL . : :.:: ::... :. CCDS58 KELP-------------------------AVELR----------------NGKTAGRRTY 150 160 350 360 370 380 390 400 pF1KE5 ELIQNKEPSSKHISLCRELHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQE ... .....:: .:.. . ..:.:. . :: .... ::...: ::::.::::.: CCDS58 ---HTRSQGDNNVSLVEEFRKTLCALWQGSQTAFSPESLFYVVWKIMPNFRGYQQQDAHE 170 180 190 200 210 410 420 430 440 450 pF1KE5 FLCELLHKVQQELES--EGTTRRILIPF------SQRKLTKQVLKVVNTIFHGQLLSQVT :. :: ... ::.. .:..: .. :.. . . ::..:: : : ..:. CCDS58 FMRYLLDHLHLELQGGFNGVSRSAILQENSTLSASNKCCINGASTVVTAIFGGILQNEVN 220 230 240 250 260 270 460 470 480 490 500 510 pF1KE5 CISCNYKSNTIEPFWDLSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEAL-EGRI :. :. .: ..:: ::::..: ... ... : : : . : .::. : : : .. CCDS58 CLICGTESRKFDPFLDLSLDIPSQFRS-KRSKNQENGPVCSLRDCLRSFTDLEELDETEL 280 290 300 310 320 330 520 530 540 550 560 570 pF1KE5 YACDQCNSKRRKSNPKPLVLSEARKQLMIYRLPQVLRLHLKRFRWSGRNHREKIGVHVVF : : .:..:.... :.. : .::.:: ::::::.:.. :.:. ..: : CCDS58 YMCHKCKKKQKST-----------KKFWIQKLPKVLCLHLKRFHWTAY-LRNKVDTYVEF 340 350 360 370 380 580 590 600 610 620 630 pF1KE5 D-QVLTMEPYCCRDMLSSLDKETFAYDLSAVVMHHGKGFGSGHYTAYCYNTEGGACALLC . : :. : . :. :. :::.:::.:::.: :::::::: CCDS58 PLRGLDMKCYLLEPENSG--PESCLYDLAAVVVHHGSGVGSGHYTAYATHEGRWFHFNDS 390 400 410 420 430 440 640 pF1KE5 GVGDTERG CCDS58 TVTLTDEETVVKAKAYILFYVEHQAKAGSDKL 450 460 470 640 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:19:01 2016 done: Tue Nov 8 06:19:02 2016 Total Scan time: 3.980 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]