FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3156, 375 aa 1>>>pF1KE3156 375 - 375 aa - 375 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8926+/-0.000868; mu= 10.8422+/- 0.053 mean_var=136.2301+/-26.963, 0's: 0 Z-trim(111.8): 24 B-trim: 302 in 1/52 Lambda= 0.109885 statistics sampled from 12628 (12642) to 12628 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.743), E-opt: 0.2 (0.388), width: 16 Scan time: 2.800 The best scores are: opt bits E(32554) CCDS10286.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 375) 2504 408.1 6.6e-114 CCDS45309.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 359) 2086 341.8 5.7e-94 CCDS1069.1 UBE2Q1 gene_id:55585|Hs108|chr1 ( 422) 1711 282.4 5.1e-76 CCDS66839.1 UBE2Q2 gene_id:92912|Hs108|chr15 ( 340) 1614 266.9 1.8e-71 CCDS47189.1 UBE2QL1 gene_id:134111|Hs108|chr5 ( 161) 566 100.5 1.1e-21 >>CCDS10286.1 UBE2Q2 gene_id:92912|Hs108|chr15 (375 aa) initn: 2504 init1: 2504 opt: 2504 Z-score: 2158.4 bits: 408.1 E(32554): 6.6e-114 Smith-Waterman score: 2504; 100.0% identity (100.0% similar) in 375 aa overlap (1-375:1-375) 10 20 30 40 50 60 pF1KE3 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELHCQFLVPQQGSPHSLPPPLTLHCNIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELHCQFLVPQQGSPHSLPPPLTLHCNIT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 ESYPSSSPIWFVDSEDPNLTSVLERLEDTKNNNLLRQQLKWLICELCSLYNLPKHLDVEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ESYPSSSPIWFVDSEDPNLTSVLERLEDTKNNNLLRQQLKWLICELCSLYNLPKHLDVEM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 LDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 AILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 VKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 GALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQ 310 320 330 340 350 360 370 pF1KE3 IHEKNGWYTPPKEDG ::::::::::::::: CCDS10 IHEKNGWYTPPKEDG 370 >>CCDS45309.1 UBE2Q2 gene_id:92912|Hs108|chr15 (359 aa) initn: 2086 init1: 2086 opt: 2086 Z-score: 1800.6 bits: 341.8 E(32554): 5.7e-94 Smith-Waterman score: 2093; 90.0% identity (92.8% similar) in 361 aa overlap (25-375:2-359) 10 20 30 40 50 60 pF1KE3 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELHCQFLVPQQGSPHSLPPPLTLHCNIT :. : ..:.:. : ..: :: :. .: . CCDS45 MRMDSLTEEKLECR-LWCCLSDPS--PPGLAARCCVL 10 20 30 70 80 90 100 110 pF1KE3 E----------SYPSSSPIWFVDSEDPNLTSVLERLEDTKNNNLLRQQLKWLICELCSLY : ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ERSIVPSLRQESYPSSSPIWFVDSEDPNLTSVLERLEDTKNNNLLRQQLKWLICELCSLY 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE3 NLPKHLDVEMLDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEMKEEEPISGKKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 NLPKHLDVEMLDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEMKEEEPISGKKS 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE3 EDEGIEKENLAILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 EDEGIEKENLAILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVE 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE3 LINDSLYDWHVKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LINDSLYDWHVKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLP 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE3 VLSGGYVLGGGALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VLSGGYVLGGGALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLAR 280 290 300 310 320 330 360 370 pF1KE3 AQQSYNSIVQIHEKNGWYTPPKEDG ::::::::::::::::::::::::: CCDS45 AQQSYNSIVQIHEKNGWYTPPKEDG 340 350 >>CCDS1069.1 UBE2Q1 gene_id:55585|Hs108|chr1 (422 aa) initn: 1543 init1: 1355 opt: 1711 Z-score: 1478.3 bits: 282.4 E(32554): 5.1e-76 Smith-Waterman score: 1831; 74.1% identity (86.0% similar) in 386 aa overlap (6-375:41-422) 10 20 30 pF1KE3 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELH :. :::.: ::: ..::::::.: :::: CCDS10 QPGPGQQLGGQGAAPGAGGGPGGGPGPGPCLRRELKLLESIFHRGHERFRIASACLDELS 20 30 40 50 60 70 40 50 60 70 80 pF1KE3 CQFLVPQQGS--------PHSLPP-------PLTLHCNITESYPSSSPIWFVDSEDPNLT :.::. :. :: ::: :. .:::::::::. ::: :.:.::::. CCDS10 CEFLLAGAGGAGAGAAPGPH-LPPRGSVPGDPVRIHCNITESYPAVPPIWSVESDDPNLA 80 90 100 110 120 90 100 110 120 130 pF1KE3 SVLERLEDTKNNN-LLRQQLKWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSE .::::: : :..: :: :.:: .: .::.:::::.: ::::::::::. : : :.:.:: CCDS10 AVLERLVDIKKGNTLLLQHLKRIISDLCKLYNLPQHPDVEMLDQPLPAEQC-TQEDVSSE 130 140 150 160 170 180 140 150 160 170 180 190 pF1KE3 EEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENLAILEKIRKTQRQDHLNGAV .:.: :: :: ::::::::::::: ::::::.:: ::::::::::.:.::::.::::: CCDS10 DEDE--EMPEDTEDLDHYEMKEEEPAEGKKSEDDGIGKENLAILEKIKKNQRQDYLNGAV 190 200 210 220 230 240 200 210 220 230 240 250 pF1KE3 SGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQIL ::::::.:::::::::::::::.: : :.:::.:::::::.::: ::: :: ::.::::: CCDS10 SGSVQATDRLMKELRDIYRSQSFKGGNYAVELVNDSLYDWNVKLLKVDQDSALHNDLQIL 250 260 270 280 290 300 260 270 280 290 300 310 pF1KE3 KEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSI ::::: ..::::::::::::::::::::: :::::::::::::.:::::::::::::::: CCDS10 KEKEGADFILLNFSFKDNFPFDPPFVRVVSPVLSGGYVLGGGAICMELLTKQGWSSAYSI 310 320 330 340 350 360 320 330 340 350 360 370 pF1KE3 ESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQIHEKNGWYTPPKEDG :::::::.::::::::::::::::.::.:.::::::.:.::::::::::::::::: CCDS10 ESVIMQISATLVKGKARVQFGANKSQYSLTRAQQSYKSLVQIHEKNGWYTPPKEDG 370 380 390 400 410 420 >>CCDS66839.1 UBE2Q2 gene_id:92912|Hs108|chr15 (340 aa) initn: 2239 init1: 1610 opt: 1614 Z-score: 1396.5 bits: 266.9 E(32554): 1.8e-71 Smith-Waterman score: 2173; 90.7% identity (90.7% similar) in 375 aa overlap (1-375:1-340) 10 20 30 40 50 60 pF1KE3 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELHCQFLVPQQGSPHSLPPPLTLHCNIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 MSVSGLKAELKFLASIFDKNHERFRIVSWKLDELHCQFLVPQQGSPHSLPPPLTLHCNIT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 ESYPSSSPIWFVDSEDPNLTSVLERLEDTKNNNLLRQQLKWLICELCSLYNLPKHLDVEM :::::::::::::::::::::::::::::::::: CCDS66 ESYPSSSPIWFVDSEDPNLTSVLERLEDTKNNNL-------------------------- 70 80 90 130 140 150 160 170 180 pF1KE3 LDQPLPTGQNGTTEEVTSEEEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENL ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 ---------NGTTEEVTSEEEEEEEEMAEDIEDLDHYEMKEEEPISGKKSEDEGIEKENL 100 110 120 130 140 190 200 210 220 230 240 pF1KE3 AILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 AILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWH 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE3 VKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 VKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGG 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE3 GALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 GALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQ 270 280 290 300 310 320 370 pF1KE3 IHEKNGWYTPPKEDG ::::::::::::::: CCDS66 IHEKNGWYTPPKEDG 330 340 >>CCDS47189.1 UBE2QL1 gene_id:134111|Hs108|chr5 (161 aa) initn: 590 init1: 326 opt: 566 Z-score: 503.0 bits: 100.5 E(32554): 1.1e-21 Smith-Waterman score: 566; 51.2% identity (78.6% similar) in 168 aa overlap (210-375:1-161) 180 190 200 210 220 230 pF1KE3 LAILEKIRKTQRQDHLNGAVSGSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDW ::::.:: : .. . ::::...::.:: CCDS47 MKELQDIARLSDR---FISVELVDESLFDW 10 20 240 250 260 270 280 290 pF1KE3 HVKLQKVDPDSPLHSDLQILKEKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLG .:::..:: :: : .:.. . . :.::::..: :::::.:::.::. : : .:::: CCDS47 NVKLHQVDKDSVLWQDMK----ETNTEFILLNLTFPDNFPFSPPFMRVLSPRLENGYVLD 30 40 50 60 70 80 300 310 320 330 340 350 pF1KE3 GGALCMELLTKQGWSSAYSIESVIMQINATLVKGKARVQFGANKNQYNLAR--AQQSYNS :::.:::::: .::::::..:.:. :. :.::::..:. :.:.. ...: :. ...: CCDS47 GGAICMELLTPRGWSSAYTVEAVMRQFAASLVKGQGRICRKAGKSKKSFSRKEAEATFKS 90 100 110 120 130 140 360 370 pF1KE3 IVQIHEKNGWYTPPKEDG .:. ::: :: ::: :: CCDS47 LVKTHEKYGWVTPPVSDG 150 160 375 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:11:14 2016 done: Sun Nov 6 14:11:14 2016 Total Scan time: 2.800 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]