FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3673, 370 aa 1>>>pF1KE3673 370 - 370 aa - 370 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3844+/-0.00104; mu= 10.6759+/- 0.062 mean_var=79.0130+/-16.003, 0's: 0 Z-trim(105.3): 102 B-trim: 290 in 1/48 Lambda= 0.144286 statistics sampled from 8230 (8344) to 8230 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.626), E-opt: 0.2 (0.256), width: 16 Scan time: 2.620 The best scores are: opt bits E(32554) CCDS31952.1 USP12 gene_id:219333|Hs108|chr13 ( 370) 2478 525.6 2.7e-149 CCDS47053.1 USP46 gene_id:64854|Hs108|chr4 ( 366) 2190 465.6 2.9e-131 CCDS47054.1 USP46 gene_id:64854|Hs108|chr4 ( 359) 2150 457.3 9.3e-129 CCDS61632.1 USP8 gene_id:9101|Hs108|chr15 (1012) 421 97.5 5.3e-20 CCDS10137.1 USP8 gene_id:9101|Hs108|chr15 (1118) 421 97.5 5.8e-20 CCDS53944.1 USP50 gene_id:373509|Hs108|chr15 ( 334) 331 78.7 8.4e-15 CCDS2418.1 USP37 gene_id:57695|Hs108|chr2 ( 979) 328 78.2 3.5e-14 CCDS58370.1 USP3 gene_id:9960|Hs108|chr15 ( 476) 311 74.5 2.1e-13 CCDS32265.1 USP3 gene_id:9960|Hs108|chr15 ( 520) 311 74.6 2.3e-13 CCDS44084.1 USP48 gene_id:84196|Hs108|chr1 ( 485) 310 74.3 2.4e-13 CCDS81277.1 USP48 gene_id:84196|Hs108|chr1 ( 983) 310 74.4 4.7e-13 CCDS30623.1 USP48 gene_id:84196|Hs108|chr1 (1035) 310 74.4 4.9e-13 CCDS32755.1 USP36 gene_id:57602|Hs108|chr17 (1123) 309 74.2 6.1e-13 CCDS47535.1 USP42 gene_id:84132|Hs108|chr7 (1316) 296 71.5 4.6e-12 CCDS66941.1 USP7 gene_id:7874|Hs108|chr16 (1086) 292 70.7 6.9e-12 CCDS32385.1 USP7 gene_id:7874|Hs108|chr16 (1102) 292 70.7 7e-12 CCDS58189.1 USP2 gene_id:9099|Hs108|chr11 ( 362) 283 68.7 9.2e-12 CCDS8423.1 USP2 gene_id:9099|Hs108|chr11 ( 396) 283 68.7 1e-11 CCDS77901.1 USP17L15 gene_id:100288520|Hs108|chr4 ( 559) 283 68.7 1.4e-11 CCDS8422.1 USP2 gene_id:9099|Hs108|chr11 ( 605) 283 68.7 1.5e-11 CCDS43713.1 USP17L2 gene_id:377630|Hs108|chr8 ( 530) 280 68.1 2e-11 CCDS59467.1 USP17L5 gene_id:728386|Hs108|chr4 ( 530) 279 67.9 2.3e-11 CCDS59457.1 USP17L13 gene_id:100287238|Hs108|chr4 ( 530) 277 67.5 3.1e-11 CCDS59463.1 USP17L22 gene_id:100287513|Hs108|chr4 ( 530) 277 67.5 3.1e-11 CCDS59458.1 USP17L17 gene_id:100287327|Hs108|chr4 ( 530) 277 67.5 3.1e-11 CCDS78298.1 USP17L1 gene_id:401447|Hs108|chr8 ( 530) 277 67.5 3.1e-11 CCDS59461.1 USP17L20 gene_id:100287441|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59470.1 USP17L29 gene_id:728405|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59471.1 USP17L30 gene_id:728419|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59460.1 USP17L19 gene_id:100287404|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59466.1 USP17L26 gene_id:728379|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59459.1 USP17L18 gene_id:100287364|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59465.1 USP17L25 gene_id:728373|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59455.1 USP17L11 gene_id:100287178|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59469.1 USP17L28 gene_id:728400|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59464.1 USP17L24 gene_id:728369|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS59468.1 USP17L27 gene_id:728393|Hs108|chr4 ( 530) 276 67.3 3.6e-11 CCDS30920.1 USP21 gene_id:27005|Hs108|chr1 ( 565) 275 67.1 4.4e-11 CCDS78301.1 USP17L3 gene_id:645836|Hs108|chr8 ( 530) 273 66.6 5.5e-11 CCDS59454.1 USP17L10 gene_id:100287144|Hs108|chr4 ( 530) 272 66.4 6.4e-11 CCDS59462.1 USP17L21 gene_id:100287478|Hs108|chr4 ( 530) 272 66.4 6.4e-11 CCDS59456.1 USP17L12 gene_id:100287205|Hs108|chr4 ( 530) 272 66.4 6.4e-11 CCDS14277.1 USP11 gene_id:8237|Hs108|chrX ( 963) 273 66.7 9.6e-11 >>CCDS31952.1 USP12 gene_id:219333|Hs108|chr13 (370 aa) initn: 2478 init1: 2478 opt: 2478 Z-score: 2793.9 bits: 525.6 E(32554): 2.7e-149 Smith-Waterman score: 2478; 99.7% identity (99.7% similar) in 370 aa overlap (1-370:1-370) 10 20 30 40 50 60 pF1KE3 MEILMTVSKFASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQALYFC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MEILMTVSKFASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQALYFC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RPFREKVLAYKSQPRKKESLLTCLADLFHSIATQKKKVGVIPPKKFITRLRKENELFDNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 RPFREKVLAYKSQPRKKESLLTCLADLFHSIATQKKKVGVIPPKKFITRLRKENELFDNY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 MQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTWVDEIFQGTL :::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::: CCDS31 MQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTWVHEIFQGTL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 TNETRCLTCETISSKDEDFLDLSVDVEQNTSITHCLRGFSNTETLCSEYKYYCEECRSKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 TNETRCLTCETISSKDEDFLDLSVDVEQNTSITHCLRGFSNTETLCSEYKYYCEECRSKQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 EAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTSGDATNPDRMY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTSGDATNPDRMY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 DLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDDDIVEKIDAQAIEEFYGLTSDISKNSES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 DLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDDDIVEKIDAQAIEEFYGLTSDISKNSES 310 320 330 340 350 360 370 pF1KE3 GYILFYQSRD :::::::::: CCDS31 GYILFYQSRD 370 >>CCDS47053.1 USP46 gene_id:64854|Hs108|chr4 (366 aa) initn: 2242 init1: 2190 opt: 2190 Z-score: 2469.9 bits: 465.6 E(32554): 2.9e-131 Smith-Waterman score: 2190; 88.3% identity (96.2% similar) in 366 aa overlap (5-370:1-366) 10 20 30 40 50 60 pF1KE3 MEILMTVSKFASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQALYFC ::: ..::::.::.:::::::.:::::::.:::::::::::::::::::::::::: CCDS47 MTVRNIASICNMGTNASALEKDIGPEQFPINEHYFGLVNFGNTCYCNSVLQALYFC 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 RPFREKVLAYKSQPRKKESLLTCLADLFHSIATQKKKVGVIPPKKFITRLRKENELFDNY :::::.:::::.: .:::.::::::::::::::::::::::::::::.::::::.::::: CCDS47 RPFRENVLAYKAQQKKKENLLTCLADLFHSIATQKKKVGVIPPKKFISRLRKENDLFDNY 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 MQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTWVDEIFQGTL :::::::::::::::::::::::.:::::::.: :::... .:. :. ::: ::::::: CCDS47 MQQDAHEFLNYLLNTIADILQEEKKQEKQNGKLKNGNMNEPAENNKPELTWVHEIFQGTL 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE3 TNETRCLTCETISSKDEDFLDLSVDVEQNTSITHCLRGFSNTETLCSEYKYYCEECRSKQ :::::::.:::.::::::::::::::::::::::::: :::::::::: ::::: : ::: CCDS47 TNETRCLNCETVSSKDEDFLDLSVDVEQNTSITHCLRDFSNTETLCSEQKYYCETCCSKQ 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE3 EAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTSGDATNPDRMY ::.:::.::::::::::::::::::.::::::::::::::::::::::::.::.: :::: CCDS47 EAQKRMRVKKLPMILALHLKRFKYMEQLHRYTKLSYRVVFPLELRLFNTSSDAVNLDRMY 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE3 DLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDDDIVEKIDAQAIEEFYGLTSDISKNSES :::::::::::::::::::.::::: :::::::::::::::::::::::::::::::::: CCDS47 DLVAVVVHCGSGPNRGHYITIVKSHGFWLLFDDDIVEKIDAQAIEEFYGLTSDISKNSES 300 310 320 330 340 350 370 pF1KE3 GYILFYQSRD :::::::::. CCDS47 GYILFYQSRE 360 >>CCDS47054.1 USP46 gene_id:64854|Hs108|chr4 (359 aa) initn: 2183 init1: 2131 opt: 2150 Z-score: 2425.1 bits: 457.3 E(32554): 9.3e-129 Smith-Waterman score: 2150; 88.5% identity (96.1% similar) in 357 aa overlap (14-370:3-359) 10 20 30 40 50 60 pF1KE3 MEILMTVSKFASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQALYFC : .:.:::::::.:::::::.:::::::::::::::::::::::::: CCDS47 MNCFQGTNASALEKDIGPEQFPINEHYFGLVNFGNTCYCNSVLQALYFC 10 20 30 40 70 80 90 100 110 120 pF1KE3 RPFREKVLAYKSQPRKKESLLTCLADLFHSIATQKKKVGVIPPKKFITRLRKENELFDNY :::::.:::::.: .:::.::::::::::::::::::::::::::::.::::::.::::: CCDS47 RPFRENVLAYKAQQKKKENLLTCLADLFHSIATQKKKVGVIPPKKFISRLRKENDLFDNY 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE3 MQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTWVDEIFQGTL :::::::::::::::::::::::.:::::::.: :::... .:. :. ::: ::::::: CCDS47 MQQDAHEFLNYLLNTIADILQEEKKQEKQNGKLKNGNMNEPAENNKPELTWVHEIFQGTL 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE3 TNETRCLTCETISSKDEDFLDLSVDVEQNTSITHCLRGFSNTETLCSEYKYYCEECRSKQ :::::::.:::.::::::::::::::::::::::::: :::::::::: ::::: : ::: CCDS47 TNETRCLNCETVSSKDEDFLDLSVDVEQNTSITHCLRDFSNTETLCSEQKYYCETCCSKQ 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE3 EAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTSGDATNPDRMY ::.:::.::::::::::::::::::.::::::::::::::::::::::::.::.: :::: CCDS47 EAQKRMRVKKLPMILALHLKRFKYMEQLHRYTKLSYRVVFPLELRLFNTSSDAVNLDRMY 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE3 DLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDDDIVEKIDAQAIEEFYGLTSDISKNSES :::::::::::::::::::.::::: :::::::::::::::::::::::::::::::::: CCDS47 DLVAVVVHCGSGPNRGHYITIVKSHGFWLLFDDDIVEKIDAQAIEEFYGLTSDISKNSES 290 300 310 320 330 340 370 pF1KE3 GYILFYQSRD :::::::::. CCDS47 GYILFYQSRE 350 >>CCDS61632.1 USP8 gene_id:9101|Hs108|chr15 (1012 aa) initn: 252 init1: 177 opt: 421 Z-score: 472.5 bits: 97.5 E(32554): 5.3e-20 Smith-Waterman score: 469; 31.0% identity (57.1% similar) in 352 aa overlap (40-368:672-1002) 10 20 30 40 50 60 pF1KE3 FASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQAL--------YFCR :: :.::::: ::.:: : :: : CCDS61 CYPKAEISRLSASQIRNLNPVFGGSGPALTGLRNLGNTCYMNSILQCLCNAPHLADYFNR 650 660 670 680 690 700 70 80 90 100 110 120 pF1KE3 PFREKVLAYKSQPRKKESLLTCLADLFHSIATQKKKVGVIPPKKFITRLRKENELFDNYM . . .. .: . .. ..... : . . : :: : . : :. : .: CCDS61 NCYQDDINRSNLLGHKGEVAEEFGIIMKALWTGQYRY--ISPKDFKITIGKINDQFAGYS 710 720 730 740 750 130 140 150 160 170 pF1KE3 QQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTW-----VDE-- :::..:.: .:. : :.:. .. . : . : :. .. .. . .: ..: CCDS61 QQDSQELLLFLM----DGLHEDLNKADNRKRYKEENNDHLDDFKAAEHAWQKHKQLNESI 760 770 780 790 800 810 180 190 200 210 220 pF1KE3 ---IFQGTLTNETRCLTCETISSKDEDFLDLSVDVEQNTSIT--HCLRGFSNTETLCSEY .::: . . ..::::. : : :. ::. . .... : ::: ::. : : .. CCDS61 IVALFQGQFKSTVQCLTCHKKSRTFEAFMYLSLPLASTSKCTLQDCLRLFSKEEKLTDNN 820 830 840 850 860 870 230 240 250 260 270 280 pF1KE3 KYYCEECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLE-LRLFN ..:: .::..... :.... ::: .: .:::::.: . .. ::. : :::: : : . CCDS61 RFYCSHCRARRDSLKKIEIWKLPPVLLVHLKRFSYDGRWKQ--KLQTSVDFPLENLDLSQ 880 890 900 910 920 930 290 300 310 320 330 340 pF1KE3 TSGDATNPDRMYDLVAVVVHCGSGPNRGHYIAIVKS--HDFWLLFDDDIVEKIDAQAIEE : . :.: .: : : : . ::: : :. .. :. ::: : :...... CCDS61 YVIGPKNNLKKYNLFSVSNHYG-GLDGGHYTAYCKNAARQRWFKFDDHEVSDISVSSVK- 940 950 960 970 980 990 350 360 370 pF1KE3 FYGLTSDISKNSESGYILFYQSRD : ..::::: : CCDS61 -----------SSAAYILFYTSLGPRVTDVAT 1000 1010 >>CCDS10137.1 USP8 gene_id:9101|Hs108|chr15 (1118 aa) initn: 252 init1: 177 opt: 421 Z-score: 471.8 bits: 97.5 E(32554): 5.8e-20 Smith-Waterman score: 469; 31.0% identity (57.1% similar) in 352 aa overlap (40-368:778-1108) 10 20 30 40 50 60 pF1KE3 FASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQAL--------YFCR :: :.::::: ::.:: : :: : CCDS10 CYPKAEISRLSASQIRNLNPVFGGSGPALTGLRNLGNTCYMNSILQCLCNAPHLADYFNR 750 760 770 780 790 800 70 80 90 100 110 120 pF1KE3 PFREKVLAYKSQPRKKESLLTCLADLFHSIATQKKKVGVIPPKKFITRLRKENELFDNYM . . .. .: . .. ..... : . . : :: : . : :. : .: CCDS10 NCYQDDINRSNLLGHKGEVAEEFGIIMKALWTGQYRY--ISPKDFKITIGKINDQFAGYS 810 820 830 840 850 860 130 140 150 160 170 pF1KE3 QQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTW-----VDE-- :::..:.: .:. : :.:. .. . : . : :. .. .. . .: ..: CCDS10 QQDSQELLLFLM----DGLHEDLNKADNRKRYKEENNDHLDDFKAAEHAWQKHKQLNESI 870 880 890 900 910 920 180 190 200 210 220 pF1KE3 ---IFQGTLTNETRCLTCETISSKDEDFLDLSVDVEQNTSIT--HCLRGFSNTETLCSEY .::: . . ..::::. : : :. ::. . .... : ::: ::. : : .. CCDS10 IVALFQGQFKSTVQCLTCHKKSRTFEAFMYLSLPLASTSKCTLQDCLRLFSKEEKLTDNN 930 940 950 960 970 980 230 240 250 260 270 280 pF1KE3 KYYCEECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLE-LRLFN ..:: .::..... :.... ::: .: .:::::.: . .. ::. : :::: : : . CCDS10 RFYCSHCRARRDSLKKIEIWKLPPVLLVHLKRFSYDGRWKQ--KLQTSVDFPLENLDLSQ 990 1000 1010 1020 1030 290 300 310 320 330 340 pF1KE3 TSGDATNPDRMYDLVAVVVHCGSGPNRGHYIAIVKS--HDFWLLFDDDIVEKIDAQAIEE : . :.: .: : : : . ::: : :. .. :. ::: : :...... CCDS10 YVIGPKNNLKKYNLFSVSNHYG-GLDGGHYTAYCKNAARQRWFKFDDHEVSDISVSSVK- 1040 1050 1060 1070 1080 1090 350 360 370 pF1KE3 FYGLTSDISKNSESGYILFYQSRD : ..::::: : CCDS10 -----------SSAAYILFYTSLGPRVTDVAT 1100 1110 >>CCDS53944.1 USP50 gene_id:373509|Hs108|chr15 (334 aa) initn: 303 init1: 135 opt: 331 Z-score: 379.2 bits: 78.7 E(32554): 8.4e-15 Smith-Waterman score: 331; 28.2% identity (52.9% similar) in 308 aa overlap (25-324:31-329) 10 20 30 40 50 pF1KE3 MEILMTVSKFASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVL :: .: : . :: :.:::: :.. CCDS53 MTSQPSLPADDFDIYHVLAECTDYYDTLPVKEADGNQ-PHFQGVTGLWNLGNTCCVNAIS 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 QALYFCRPFREKVLAYKSQPRKKESLLTCLADLFHSIATQK--KKVGVIPPKKFITRLRK : : :. : :. : ... . .: : . :. . :. : . : . CCDS53 QCLCSILPLVEYFLTGKYITALQNDC-SEVATAFAYLMTDMWLGDSDCVSPEIFWSALGN 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 ENELFDNYMQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNN-STPDPTW : . :::::.::: .:: . . :.. . ..... .:. . . : . . CCDS53 LYPAFTKKMQQDAQEFLICVLNELHEALKKYHYSRRRS--YEKGSTQRCCRKWITTETSI 120 130 140 150 160 170 180 190 200 210 220 pF1KE3 VDEIFQGTLTNETRCLTCETISSKDEDF--LDLSVDVEQNTSITHCLRGFSNTETLCSEY . ..:. :. :: :: . :.: : ..: . . . :. ::. : . ..: . CCDS53 ITQLFEEQLNYSIVCLKCEKCTYKNEVFTVFSLPIPSKYECSLRDCLQCFFQQDALTWNN 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE3 KYYCEECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPL---ELRL . .: :..:::. : ...: : :. .::::: . .: :: . .:: .: CCDS53 EIHCSFCETKQETAVRASISKAPKIIIFHLKRFDIQGTTKR--KLRTDIHYPLTNLDLTP 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE3 FNTSGDATNPDRMYDLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDDDIVEKIDAQAIEE . : : :.: ::: : :. . ::: :. :. CCDS53 YICSIFRKYPK--YNLCAVVNHFGDLDG-GHYTAFCKNSVTQA 300 310 320 330 350 360 370 pF1KE3 FYGLTSDISKNSESGYILFYQSRD >>CCDS2418.1 USP37 gene_id:57695|Hs108|chr2 (979 aa) initn: 229 init1: 92 opt: 328 Z-score: 368.1 bits: 78.2 E(32554): 3.5e-14 Smith-Waterman score: 328; 32.0% identity (56.7% similar) in 275 aa overlap (40-296:342-600) 10 20 30 40 50 60 pF1KE3 FASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQALYFCRPFREKVLA :. :.::::: :..::.:. . : . .: CCDS24 SVKKLRCNQDYTGWNKPRVPLSSHQQQQLQGFSNLGNTCYMNAILQSLFSLQSFANDLLK 320 330 340 350 360 370 70 80 90 100 110 120 pF1KE3 YKSQPRKK---ESLLTCLADLF--HSIATQKKKVGVIPPKKFITRLRKENELFDNYMQQD .. : :: ..:. .: :. ..: ... : .. :: . . : :..:::.: CCDS24 -QGIPWKKIPLNALIRRFAHLLVKKDICNSETKKDLL--KKVKNAISATAERFSGYMQND 380 390 400 410 420 130 140 150 160 170 180 pF1KE3 AHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTWVDEIFQGTLTN-- :::::. : : :.:. .:: : . ...:.: .:: . . ..:: CCDS24 AHEFLSQCL----DQLKED--MEKLNKTWKTEPVSGEEN--SPDISATRAYTCPVITNLE 430 440 450 460 470 480 190 200 210 220 230 pF1KE3 -ETR----CLTCETISSKDEDFLDLSVDVEQNT------SITHCLRGFSNTETLCSEYKY :.. : .: : : :.: :::.:. . :: : : .: : .: CCDS24 FEVQHSIICKACGEIIPKREQFNDLSIDLPRRKKPLPPRSIQDSLDLFFRAE----ELEY 490 500 510 520 530 240 250 260 270 280 290 pF1KE3 YCEECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTSG ::.: .: : : : ..:: .: :::::... : .:.. .:..: : : . CCDS24 SCEKCGGKC-ALVRHKFNRLPRVLILHLKRYSFNVALSLNNKIGQQVIIPRYLTLSSHCT 540 550 560 570 580 590 300 310 320 330 340 350 pF1KE3 DATNPDRMYDLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDDDIVEKIDAQAIEEFYGLT . :.: CCDS24 ENTKPPFTLGWSAHMAISRPLKASQMVNSCITSPSTPSKKFTFKSKSSLALCLDSDSEDE 600 610 620 630 640 650 >>CCDS58370.1 USP3 gene_id:9960|Hs108|chr15 (476 aa) initn: 515 init1: 169 opt: 311 Z-score: 354.2 bits: 74.5 E(32554): 2.1e-13 Smith-Waterman score: 507; 32.0% identity (58.9% similar) in 341 aa overlap (40-344:116-454) 10 20 30 40 50 60 pF1KE3 FASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQAL----YFCRPFRE :: :.::::. :..::.: :: :.: CCDS58 RHKKRKLLENSTLNSKLLKVNGSTTAICATGLRNLGNTCFMNAILQSLSNIEQFCCYFKE 90 100 110 120 130 140 70 80 90 100 110 pF1KE3 ---------KVLAYKS-QPRKKESLLTCLADLFHSI--ATQKKKVGVIPPKKFITRLRKE :. . .. . :.. . . :.. :.. : . . .. :.... . : CCDS58 LPAVELRNGKTAGRRTYHTRSQGDNNVSLVEEFRKTLCALWQGSQTAFSPESLFYVVWKI 150 160 170 180 190 200 120 130 140 150 160 170 pF1KE3 NELFDNYMQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPD--PTW : .:.:::::::. :::. . :: . .... : ... . .:. . : CCDS58 MPNFRGYQQQDAHEFMRYLLDHLHLELQGGFNGVSRSAILQENSTLSASNKCCINGASTV 210 220 230 240 250 260 180 190 200 210 pF1KE3 VDEIFQGTLTNETRCLTCETISSKDEDFLDLSVDV-----------EQN---TSITHCLR : :: : : ::. :: : : : : . :::::.:. ..: :. ::: CCDS58 VTAIFGGILQNEVNCLICGTESRKFDPFLDLSLDIPSQFRSKRSKNQENGPVCSLRDCLR 270 280 290 300 310 320 220 230 240 250 260 270 pF1KE3 GFSNTETLCSEYKYYCEECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYR .:.. : : :.:..:..::.. :.. ..::: .: ::::::.. :. .:.. CCDS58 SFTDLEELDETELYMCHKCKKKQKSTKKFWIQKLPKVLCLHLKRFHWTAYLR--NKVDTY 330 340 350 360 370 380 280 290 300 310 320 330 pF1KE3 VVFPL---ELRLFNTSGDATNPDR-MYDLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDD : ::: ... . . ..:. .:::.::::: ::: . ::: : . . :. :.: CCDS58 VEFPLRGLDMKCYLLEPENSGPESCLYDLAAVVVHHGSGVGSGHYTAYATHEGRWFHFND 390 400 410 420 430 440 340 350 360 370 pF1KE3 DIVEKIDAQAIEEFYGLTSDISKNSESGYILFYQSRD . : : ... CCDS58 STVTLTDEETVVKAKAYILFYVEHQAKAGSDKL 450 460 470 >>CCDS32265.1 USP3 gene_id:9960|Hs108|chr15 (520 aa) initn: 515 init1: 169 opt: 311 Z-score: 353.6 bits: 74.6 E(32554): 2.3e-13 Smith-Waterman score: 507; 32.0% identity (58.9% similar) in 341 aa overlap (40-344:160-498) 10 20 30 40 50 60 pF1KE3 FASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQAL----YFCRPFRE :: :.::::. :..::.: :: :.: CCDS32 RHKKRKLLENSTLNSKLLKVNGSTTAICATGLRNLGNTCFMNAILQSLSNIEQFCCYFKE 130 140 150 160 170 180 70 80 90 100 110 pF1KE3 ---------KVLAYKS-QPRKKESLLTCLADLFHSI--ATQKKKVGVIPPKKFITRLRKE :. . .. . :.. . . :.. :.. : . . .. :.... . : CCDS32 LPAVELRNGKTAGRRTYHTRSQGDNNVSLVEEFRKTLCALWQGSQTAFSPESLFYVVWKI 190 200 210 220 230 240 120 130 140 150 160 170 pF1KE3 NELFDNYMQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPD--PTW : .:.:::::::. :::. . :: . .... : ... . .:. . : CCDS32 MPNFRGYQQQDAHEFMRYLLDHLHLELQGGFNGVSRSAILQENSTLSASNKCCINGASTV 250 260 270 280 290 300 180 190 200 210 pF1KE3 VDEIFQGTLTNETRCLTCETISSKDEDFLDLSVDV-----------EQN---TSITHCLR : :: : : ::. :: : : : : . :::::.:. ..: :. ::: CCDS32 VTAIFGGILQNEVNCLICGTESRKFDPFLDLSLDIPSQFRSKRSKNQENGPVCSLRDCLR 310 320 330 340 350 360 220 230 240 250 260 270 pF1KE3 GFSNTETLCSEYKYYCEECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYR .:.. : : :.:..:..::.. :.. ..::: .: ::::::.. :. .:.. CCDS32 SFTDLEELDETELYMCHKCKKKQKSTKKFWIQKLPKVLCLHLKRFHWTAYLR--NKVDTY 370 380 390 400 410 420 280 290 300 310 320 330 pF1KE3 VVFPL---ELRLFNTSGDATNPDR-MYDLVAVVVHCGSGPNRGHYIAIVKSHDFWLLFDD : ::: ... . . ..:. .:::.::::: ::: . ::: : . . :. :.: CCDS32 VEFPLRGLDMKCYLLEPENSGPESCLYDLAAVVVHHGSGVGSGHYTAYATHEGRWFHFND 430 440 450 460 470 480 340 350 360 370 pF1KE3 DIVEKIDAQAIEEFYGLTSDISKNSESGYILFYQSRD . : : ... CCDS32 STVTLTDEETVVKAKAYILFYVEHQAKAGSDKL 490 500 510 520 >>CCDS44084.1 USP48 gene_id:84196|Hs108|chr1 (485 aa) initn: 297 init1: 116 opt: 310 Z-score: 352.9 bits: 74.3 E(32554): 2.4e-13 Smith-Waterman score: 387; 28.3% identity (54.6% similar) in 339 aa overlap (38-360:88-396) 10 20 30 40 50 pF1KE3 SKFASICTMGANASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSVLQ----------AL . ::.:.: ::: :. :: :: CCDS44 IGEHIWLGEIDENSFHNIDDPNCERRKKNSFVGLTNLGATCYVNTFLQVWFLNLELRQAL 60 70 80 90 100 110 60 70 80 90 100 110 pF1KE3 YFC-RPFREKVLAYKSQPRKK-ESLLTC--LADLFHSIATQKKKVGVIPPKKFITRLRKE :.: . .:. : .: : : : :: . ..... : :. :. : CCDS44 YLCPSTCSDYMLGDGIQEEKDYEPQTICEHLQYLFALLQNSNRR--YIDPSGFVKALG-- 120 130 140 150 160 170 120 130 140 150 160 170 pF1KE3 NELFDNYMQQDAHEFLNYLLNTIADILQEERKQEKQNGRLPNGNIDNENNNSTPDPTWVD .:. .::::.:: . ... . : :. ::.. . : :: :. CCDS44 ---LDTGQQQDAQEFSKLFMSLLEDTLS---KQKNPDVR----NI-------------VQ 180 190 200 210 180 190 200 210 220 230 pF1KE3 EIFQGTLTNETRCLTCETISSKDEDFLDLSVDVEQNTSITHCLRGFSNTETLCSEYKYYC . : : . : : : :. : .: .... . ..: :. : . : : .. .:.: CCDS44 QQFCGEYAYVTVCNQCGRESKLLSKFYELELNIQGHKQLTDCISEFLKEEKLEGDNRYFC 220 230 240 250 260 270 240 250 260 270 280 290 pF1KE3 EECRSKQEAHKRMKVKKLPMILALHLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTSGDA :.:.:::.: ..... .:: : :.: :: . : . ::. . : :. .. . CCDS44 ENCQSKQNATRKIRLLSLPCTLNLQLMRFVFDRQTGHKKKLNTYIGFS-EILDMEPYVEH 280 290 300 310 320 300 310 320 330 340 350 pF1KE3 TNPDRMYDLVAVVVHCGSGPNRGHYIAIVKSHDF--WLLFDDDIVEKIDAQAIEEFYGLT . . .:.: ::..: : . ::::: ::. . : :.:. .::.... .. :. CCDS44 KGGSYVYELSAVLIHRGVSAYSGHYIAHVKDPQSGEWYKFNDEDIEKMEGKKLQ--LGIE 330 340 350 360 370 380 360 370 pF1KE3 SDISKNSESGYILFYQSRD :... :.: CCDS44 EDLAEPSKSQTRKPKCGKGTHCSRNAYMLVYRLQTQEKPNTTVQVPAFLQELVDRDNSKF 390 400 410 420 430 440 370 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 22:37:20 2016 done: Sun Nov 6 22:37:21 2016 Total Scan time: 2.620 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]