FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3183, 422 aa 1>>>pF1KE3183 422 - 422 aa - 422 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3143+/-0.000847; mu= 12.5010+/- 0.052 mean_var=200.5954+/-40.076, 0's: 0 Z-trim(115.6): 7 B-trim: 167 in 1/52 Lambda= 0.090555 statistics sampled from 16182 (16187) to 16182 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.497), width: 16 Scan time: 3.100 The best scores are: opt bits E(32554) CCDS1069.1 UBE2Q1 gene_id:55585|Hs108|chr1 ( 422) 2879 388.0 9.2e-108 CCDS10286.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 375) 1711 235.3 7.3e-62 CCDS45309.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 359) 1652 227.6 1.5e-59 CCDS66839.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 340) 1365 190.1 2.8e-48 CCDS47189.1 UBE2QL1 gene_id:134111|Hs108|chr5 ( 161) 605 90.4 1.3e-18 >>CCDS1069.1 UBE2Q1 gene_id:55585|Hs108|chr1 (422 aa) initn: 2879 init1: 2879 opt: 2879 Z-score: 2048.1 bits: 388.0 E(32554): 9.2e-108 Smith-Waterman score: 2879; 100.0% identity (100.0% similar) in 422 aa overlap (1-422:1-422) 10 20 30 40 50 60 pF1KE3 MQQPQPQGQQQPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MQQPQPQGQQQPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 IASACLDELSCEFLLAGAGGAGAGAAPGPHLPPRGSVPGDPVRIHCNITESYPAVPPIWS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 IASACLDELSCEFLLAGAGGAGAGAAPGPHLPPRGSVPGDPVRIHCNITESYPAVPPIWS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VESDDPNLAAVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VESDDPNLAAVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 TQEDVSSEDEDEEMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 TQEDVSSEDEDEEMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 YLNGAVSGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YLNGAVSGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 NDLQILKEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NDLQILKEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGW 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 SSAYSIESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SSAYSIESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKE 370 380 390 400 410 420 pF1KE3 DG :: CCDS10 DG >>CCDS10286.1 UBE2Q2 gene_id:92912|Hs108|chr15 (375 aa) initn: 1543 init1: 1355 opt: 1711 Z-score: 1224.1 bits: 235.3 E(32554): 7.3e-62 Smith-Waterman score: 1831; 74.1% identity (86.0% similar) in 386 aa overlap (41-422:6-375) 20 30 40 50 60 70 pF1KE3 QPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELS :. :::.: ::: ..::::::.: :::: CCDS10 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELH 10 20 30 80 90 100 110 120 pF1KE3 CEFLLAGAGGAGAGAAPGPH-LPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLA :.::. : .:: ::: :. .:::::::::. ::: :.:.::::. CCDS10 CQFLVPQQG--------SPHSLPP-------PLTLHCNITESYPSSSPIWFVDSEDPNLT 40 50 60 70 80 130 140 150 160 170 180 pF1KE3 AVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC-TQEDVSSE .::::: : :. :.:: :.:: .: .::.:::::.: ::::::::::. : : :.:.:: CCDS10 SVLERLEDTKN-NNLLRQQLKWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSE 90 100 110 120 130 190 200 210 220 230 240 pF1KE3 DEDEE--MPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAV .:.:: : :: ::::::::::::: ::::::.:: ::::::::::.:.::::.::::: CCDS10 EEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAV 140 150 160 170 180 190 250 260 270 280 290 300 pF1KE3 SGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQIL ::::::.:::::::::::::::.: : :.:::.:::::::.::: ::: :: ::.::::: CCDS10 SGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQIL 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE3 KEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSI ::::: ..::::::::::::::::::::: :::::::::::::.:::::::::::::::: CCDS10 KEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSI 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE3 ESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG :::::::.::::::::::::::::.::.:.::::::.:.::::::::::::::::: CCDS10 ESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG 320 330 340 350 360 370 >>CCDS45309.1 UBE2Q2 gene_id:92912|Hs108|chr15 (359 aa) initn: 1491 init1: 1355 opt: 1652 Z-score: 1182.6 bits: 227.6 E(32554): 1.5e-59 Smith-Waterman score: 1652; 71.6% identity (84.4% similar) in 366 aa overlap (60-422:2-359) 30 40 50 60 70 80 pF1KE3 GPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELSCEFLLAGAGGAGAGAAPGP :. : ..: :.. . . : : CCDS45 MRMDSLTEEKLECRLWCCLSDPSPPGLAARC 10 20 30 90 100 110 120 130 140 pF1KE3 HLPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLAAVLERLVDIKKGNTLLLQHL . :. ::. .: ::::. ::: :.:.::::..::::: : :. :.:: :.: CCDS45 CVLERSIVPS--LR-----QESYPSSSPIWFVDSEDPNLTSVLERLEDTKN-NNLLRQQL 40 50 60 70 80 150 160 170 180 190 200 pF1KE3 KRIISDLCKLYNLPQHPDVEMLDQPLPAEQC-TQEDVSSEDEDEE--MPEDTEDLDHYEM : .: .::.:::::.: ::::::::::. : : :.:.::.:.:: : :: :::::::: CCDS45 KWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEM 90 100 110 120 130 140 210 220 230 240 250 260 pF1KE3 KEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAVSGSVQATDRLMKELRDIYRS ::::: ::::::.:: ::::::::::.:.::::.:::::::::::.::::::::::::: CCDS45 KEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRS 150 160 170 180 190 200 270 280 290 300 310 320 pF1KE3 QSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQILKEKEGADFILLNFSFKDNFP ::.: : :.:::.:::::::.::: ::: :: ::.:::::::::: ..:::::::::::: CCDS45 QSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFP 210 220 230 240 250 260 330 340 350 360 370 380 pF1KE3 FDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSIESVIMQISATLVKGKARVQF ::::::::: :::::::::::::.:::::::::::::::::::::::.:::::::::::: CCDS45 FDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQF 270 280 290 300 310 320 390 400 410 420 pF1KE3 GANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG ::::.::.:.::::::.:.::::::::::::::::: CCDS45 GANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG 330 340 350 >>CCDS66839.1 UBE2Q2 gene_id:92912|Hs108|chr15 (340 aa) initn: 1651 init1: 1355 opt: 1365 Z-score: 980.3 bits: 190.1 E(32554): 2.8e-48 Smith-Waterman score: 1608; 68.1% identity (79.4% similar) in 383 aa overlap (41-422:6-340) 20 30 40 50 60 70 pF1KE3 QPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELS :. :::.: ::: ..::::::.: :::: CCDS66 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELH 10 20 30 80 90 100 110 120 pF1KE3 CEFLLAGAGGAGAGAAPGPH-LPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLA :.::. : .:: ::: :. .:::::::::. ::: :.:.::::. CCDS66 CQFLVPQQG--------SPHSLPP-------PLTLHCNITESYPSSSPIWFVDSEDPNLT 40 50 60 70 80 130 140 150 160 170 180 pF1KE3 AVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQCTQEDVSSED .::::: : :..: :. .:. :.:. :. CCDS66 SVLERLEDTKNNN----------------------------LNGT--TEEVTSEE---EE 90 100 190 200 210 220 230 240 pF1KE3 EDEEMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAVSGS :.::: :: ::::::::::::: ::::::.:: ::::::::::.:.::::.:::::::: CCDS66 EEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAVSGS 110 120 130 140 150 160 250 260 270 280 290 300 pF1KE3 VQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQILKEK :::.:::::::::::::::.: : :.:::.:::::::.::: ::: :: ::.:::::::: CCDS66 VQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQILKEK 170 180 190 200 210 220 310 320 330 340 350 360 pF1KE3 EGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSIESV :: ..::::::::::::::::::::: :::::::::::::.::::::::::::::::::: CCDS66 EGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSIESV 230 240 250 260 270 280 370 380 390 400 410 420 pF1KE3 IMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG ::::.::::::::::::::::.::.:.::::::.:.::::::::::::::::: CCDS66 IMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG 290 300 310 320 330 340 >>CCDS47189.1 UBE2QL1 gene_id:134111|Hs108|chr5 (161 aa) initn: 514 init1: 371 opt: 605 Z-score: 447.5 bits: 90.4 E(32554): 1.3e-18 Smith-Waterman score: 605; 56.2% identity (80.5% similar) in 169 aa overlap (257-422:1-161) 230 240 250 260 270 280 pF1KE3 LAILEKIKKNQRQDYLNGAVSGSVQATDRLMKELRDIYR-SQSFKGGNYAVELVNDSLYD ::::.:: : :. : .::::..::.: CCDS47 MKELQDIARLSDRF----ISVELVDESLFD 10 20 290 300 310 320 330 340 pF1KE3 WNVKLLKVDQDSALHNDLQILKEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVL ::::: .::.::.: .:.. . ...:::::..: :::::.:::.::.:: : .:::: CCDS47 WNVKLHQVDKDSVLWQDMK----ETNTEFILLNLTFPDNFPFSPPFMRVLSPRLENGYVL 30 40 50 60 70 80 350 360 370 380 390 400 pF1KE3 GGGAICMELLTKQGWSSAYSIESVIMQISATLVKGKARVQFGANKSQYSLTR--AQQSYK :::::::::: .::::::..:.:. :..:.::::..:. :.::. :..: :. ..: CCDS47 DGGAICMELLTPRGWSSAYTVEAVMRQFAASLVKGQGRICRKAGKSKKSFSRKEAEATFK 90 100 110 120 130 140 410 420 pF1KE3 SLVQIHEKNGWYTPPKEDG :::. ::: :: ::: :: CCDS47 SLVKTHEKYGWVTPPVSDG 150 160 422 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 18:53:33 2016 done: Sun Nov 6 18:53:34 2016 Total Scan time: 3.100 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]