FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5566, 439 aa 1>>>pF1KE5566 439 - 439 aa - 439 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2383+/-0.000923; mu= 12.3006+/- 0.056 mean_var=74.4507+/-15.321, 0's: 0 Z-trim(106.2): 20 B-trim: 19 in 2/50 Lambda= 0.148641 statistics sampled from 8829 (8837) to 8829 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.649), E-opt: 0.2 (0.271), width: 16 Scan time: 2.870 The best scores are: opt bits E(32554) CCDS55001.1 UBR2 gene_id:23304|Hs108|chr6 ( 439) 3002 653.3 1.4e-187 CCDS4870.1 UBR2 gene_id:23304|Hs108|chr6 (1755) 2846 619.9 6e-177 CCDS10091.1 UBR1 gene_id:197131|Hs108|chr15 (1749) 1433 316.9 9.8e-86 CCDS2238.2 UBR3 gene_id:130507|Hs108|chr2 (1888) 306 75.3 5.9e-13 >>CCDS55001.1 UBR2 gene_id:23304|Hs108|chr6 (439 aa) initn: 3002 init1: 3002 opt: 3002 Z-score: 3481.1 bits: 653.3 E(32554): 1.4e-187 Smith-Waterman score: 3002; 100.0% identity (100.0% similar) in 439 aa overlap (1-439:1-439) 10 20 30 40 50 60 pF1KE5 MASELEPEVQAIDRSLLECSAEEIAGKWLQATDLTREVYQHLAHYVPKIYCRGPNPFPQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MASELEPEVQAIDRSLLECSAEEIAGKWLQATDLTREVYQHLAHYVPKIYCRGPNPFPQK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EDMLAQHVLLGPMEWYLCGEDPAFGFPKLEQANKPSHLCGRVFKVGEPTYSCRDCAVDPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 EDMLAQHVLLGPMEWYLCGEDPAFGFPKLEQANKPSHLCGRVFKVGEPTYSCRDCAVDPT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHELNTSEIEEEEDPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 CVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHELNTSEIEEEEDPL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VHLSEDVIARTYNIFAITFRYAVEILTWEKESELPADLEMVEKSDTYYCMLFNDEVHTYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 VHLSEDVIARTYNIFAITFRYAVEILTWEKESELPADLEMVEKSDTYYCMLFNDEVHTYE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 QVIYTLQKAVNCTQKEAIGFATTVDRDGRRSVRYGDFQYCEQAKSVIVRNTSRQTKPLKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QVIYTLQKAVNCTQKEAIGFATTVDRDGRRSVRYGDFQYCEQAKSVIVRNTSRQTKPLKV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 QVMHSSIVAHQNFGLKLLSWLGSIIGYSDGLRRILCQVGLQEGPDGENSSLVDRLMLSDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QVMHSSIVAHQNFGLKLLSWLGSIIGYSDGLRRILCQVGLQEGPDGENSSLVDRLMLSDS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 KLWKGARSVYHQLFMSSLLMDLKYKKLFAVRFAKNYERLQSDYVTDDHDREFSVADLSVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KLWKGARSVYHQLFMSSLLMDLKYKKLFAVRFAKNYERLQSDYVTDDHDREFSVADLSVQ 370 380 390 400 410 420 430 pF1KE5 IFTVPSLFSISAGRSGSPL ::::::::::::::::::: CCDS55 IFTVPSLFSISAGRSGSPL 430 >>CCDS4870.1 UBR2 gene_id:23304|Hs108|chr6 (1755 aa) initn: 2846 init1: 2846 opt: 2846 Z-score: 3290.3 bits: 619.9 E(32554): 6e-177 Smith-Waterman score: 2846; 96.7% identity (98.8% similar) in 427 aa overlap (1-427:1-427) 10 20 30 40 50 60 pF1KE5 MASELEPEVQAIDRSLLECSAEEIAGKWLQATDLTREVYQHLAHYVPKIYCRGPNPFPQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MASELEPEVQAIDRSLLECSAEEIAGKWLQATDLTREVYQHLAHYVPKIYCRGPNPFPQK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 EDMLAQHVLLGPMEWYLCGEDPAFGFPKLEQANKPSHLCGRVFKVGEPTYSCRDCAVDPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 EDMLAQHVLLGPMEWYLCGEDPAFGFPKLEQANKPSHLCGRVFKVGEPTYSCRDCAVDPT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 CVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHELNTSEIEEEEDPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 CVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHELNTSEIEEEEDPL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VHLSEDVIARTYNIFAITFRYAVEILTWEKESELPADLEMVEKSDTYYCMLFNDEVHTYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 VHLSEDVIARTYNIFAITFRYAVEILTWEKESELPADLEMVEKSDTYYCMLFNDEVHTYE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 QVIYTLQKAVNCTQKEAIGFATTVDRDGRRSVRYGDFQYCEQAKSVIVRNTSRQTKPLKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 QVIYTLQKAVNCTQKEAIGFATTVDRDGRRSVRYGDFQYCEQAKSVIVRNTSRQTKPLKV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 QVMHSSIVAHQNFGLKLLSWLGSIIGYSDGLRRILCQVGLQEGPDGENSSLVDRLMLSDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 QVMHSSIVAHQNFGLKLLSWLGSIIGYSDGLRRILCQVGLQEGPDGENSSLVDRLMLSDS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 KLWKGARSVYHQLFMSSLLMDLKYKKLFAVRFAKNYERLQSDYVTDDHDREFSVADLSVQ ::::::::::::::::::::::::::::::::::::..:: :.. :::.: ::. :::: CCDS48 KLWKGARSVYHQLFMSSLLMDLKYKKLFAVRFAKNYQQLQRDFMEDDHERAVSVTALSVQ 370 380 390 400 410 420 430 pF1KE5 IFTVPSLFSISAGRSGSPL .::.:.: CCDS48 FFTAPTLARMLITEENLMSIIIKTFMDHLRHRDAQGRFQFERYTALQAFKFRRVQSLILD 430 440 450 460 470 480 >>CCDS10091.1 UBR1 gene_id:197131|Hs108|chr15 (1749 aa) initn: 1332 init1: 870 opt: 1433 Z-score: 1652.7 bits: 316.9 E(32554): 9.8e-86 Smith-Waterman score: 1433; 48.8% identity (78.9% similar) in 412 aa overlap (17-427:18-427) 10 20 30 40 50 pF1KE5 MASELEPEVQAIDRSLLECSAEEIAGKWLQATDLTREVYQHLAHYVPKIYCRGPNPFPQ : . ...:. : : .:. .:::. ::.:: .: . CCDS10 MADEEAGGTERMEISAELPQTPQRLASWWDQQVDFYTAFLHHLAQLVPEIYFAEMDPDLE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 KEDMLAQHVLLGPMEWYLCGEDPAFGFPKLEQANKPSHLCGRVFKVGEPTYSCRDCAVDP :.. .: .. :.:::: :::: . . ::.... .::::::: :: ::::::::.:: CCDS10 KQEESVQMSIFTPLEWYLFGEDPDICLEKLKHSGA-FQLCGRVFKSGETTYSCRDCAIDP 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 TCVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKEGPYCQKHELNTSEIEEEEDP ::::::.:: :.:..:::.: :: :::::::::::::: ::.: .:: . . .: . CCDS10 TCVLCMDCFQDSVHKNHRYKMHTSTGGGFCDCGDTEAWKTGPFCVNHEPGRAGTIKE-NS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 LVHLSEDVIARTYNIFAITFRYAVEILTWEKESELPADLEMVEKSDTYYCMLFNDEVHTY :.:.::... .:: ...:.::. ::.:.::: .:.. ::.. :::.::::: :.: CCDS10 RCPLNEEVIVQARKIFPSVIKYVVEMTIWEEEKELPPELQIREKNERYYCVLFNDEHHSY 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 EQVIYTLQKAVNCTQKEAIGFATTVDRDGRRSVRYGDFQYCEQAKSVIVRNTSRQTK-PL ..:::.::.:..: :: .:..:..:::.:. : . :..:: : .. .. :: CCDS10 DHVIYSLQRALDCELAEAQLHTTAIDKEGRRAVKAGAYAACQEAKEDIKSHSENVSQHPL 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE5 KVQVMHSSIVAHQNFGLKLLSWLGSIIGYSDGLRRILCQVGLQEGPDGENSSLVDRLMLS .:.:.:: :.:::.:.:.: ::...:..::. .:.:.::. :.: ::.:: :..:::: CCDS10 HVEVLHSEIMAHQKFALRLGSWMNKIMSYSSDFRQIFCQACLREEPDSENPCLISRLMLW 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE5 DSKLWKGARSVYHQLFMSSLLMDLKYKKLFAVRFAKNYERLQSDYVTDDHDREFSVADLS :.::.::::.. :.:..::..:...::::::..:.: :..::..:..::::: .:.. :: CCDS10 DAKLYKGARKILHELIFSSFFMEMEYKKLFAMEFVKYYKQLQKEYISDDHDRSISITALS 360 370 380 390 400 410 420 430 pF1KE5 VQIFTVPSLFSISAGRSGSPL ::.::::.: CCDS10 VQMFTVPTLARHLIEEQNVISVITETLLEVLPEYLDRNNKFNFQGYSQDKLGRVYAVICD 420 430 440 450 460 470 >>CCDS2238.2 UBR3 gene_id:130507|Hs108|chr2 (1888 aa) initn: 257 init1: 257 opt: 306 Z-score: 346.0 bits: 75.3 E(32554): 5.9e-13 Smith-Waterman score: 308; 24.7% identity (54.2% similar) in 373 aa overlap (73-423:91-452) 50 60 70 80 90 pF1KE5 AHYVPKIYCRGPNPFPQKEDMLAQHVLLGPMEWYLCGEDPAFGFPKLEQANK---PSHLC .:: : . :. .. : . :. :: CCDS22 AERPLAAAAGGEDAAAAGGGGGPGAAEEEALEWCKCLLAGGGGYDEFCAAVRAYDPAALC 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 GRVFKVGEPTYSCRDCAVDPTCVLCMECFLGSIHRDHRYRMTTSGGGGFCDCGDTEAWKE : :. .. .: :: :...: :: ::: . : : . : : .:: :::::... .: CCDS22 GLVWTANFVAYRCRTCGISPCMSLCAECFHQGDHTGHDFNMFRSQAGGACDCGDSNVMRE 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE5 GPYCQKHEL-NTSEIEEEEDPLVHLSEDVIARTYNIFAITFR--YAVEILTWEKESELPA . .:..:.. ..:.: :. .:: :. : . .: : .:..: CCDS22 SGFCKRHQIKSSSNIPCVPKDLLMMSEFVLPRFIFCLIQYLREGYNEPAADGPSEKDLNK 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE5 DLEMVEKSDTYYCML--FNDEVHTY-EQVIYTLQKAVNCTQKEAIGFATTVDRDGRR--- :...: . .. : .. ... ::. . :. . :. ..: . : .. .. CCDS22 VLQLLEPQISFLEDLTKMGGAMRSVLTQVLTNQQNYKDLTS--GLGENACVKKSHEKYLI 250 260 270 280 290 280 290 300 310 320 330 pF1KE5 SVRYGDFQYCEQAKSVIVRNTSRQTKPLKVQVMHSSIVAHQNFGLKLLSWLGSIIGYSDG ... . . : :. :.. : :. : :: . : : ..: : . : :.: CCDS22 ALKSSGLTYPEDKLVYGVQEPSAGTSSLAVQ---GFIGATGTLGQVDSSDEDDQDG-SQG 300 310 320 330 340 350 340 350 360 370 380 pF1KE5 LRRILCQVGLQEGPDGENSSLVDRL----MLSDSKLWKGARSVYHQL--FMSSLLMDLKY : . .: :. : ...:..: : .: . .: ... :. ..: : .: CCDS22 LGK-RKRVKLSSGT--KDQSIMDVLKHKSFLEELLFWTIKYEFPQKMVTFLLNMLPDQEY 360 370 380 390 400 410 390 400 410 420 430 pF1KE5 KKLFAVRFAKNY----ERLQSDYVTDDHDREFSVADLSVQIFTVPSLFSISAGRSGSPL : :. :...: . :.... .: . . .. .:::.:. CCDS22 KVAFTKTFVQHYAFIMKTLKKSHESDTMSNR--IVHISVQLFSNEELARQVTEECQLLDI 420 430 440 450 460 CCDS22 MVTVLLYMMESCLIKSELQDEENSLHVVVNCGEALLKNNTYWPLVSDFINILSHQSVAKR 470 480 490 500 510 520 439 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:53:48 2016 done: Tue Nov 8 01:53:49 2016 Total Scan time: 2.870 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]