FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6525, 95 aa 1>>>pF1KE6525 95 - 95 aa - 95 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2872+/-0.000602; mu= 8.5602+/- 0.036 mean_var=53.1059+/-10.560, 0's: 0 Z-trim(110.4): 10 B-trim: 5 in 1/51 Lambda= 0.175996 statistics sampled from 11571 (11580) to 11571 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.761), E-opt: 0.2 (0.356), width: 16 Scan time: 1.050 The best scores are: opt bits E(32554) CCDS34549.1 SUMO4 gene_id:387082|Hs108|chr6 ( 95) 632 167.6 1e-42 CCDS45774.1 SUMO2 gene_id:6613|Hs108|chr17 ( 95) 533 142.5 3.7e-35 CCDS33587.1 SUMO3 gene_id:6612|Hs108|chr21 ( 103) 500 134.1 1.3e-32 CCDS45773.1 SUMO2 gene_id:6613|Hs108|chr17 ( 71) 286 79.8 2.1e-16 CCDS68220.1 SUMO3 gene_id:6612|Hs108|chr21 ( 141) 251 70.9 1.9e-13 CCDS2352.1 SUMO1 gene_id:7341|Hs108|chr2 ( 101) 248 70.2 2.4e-13 CCDS46493.1 SUMO1 gene_id:7341|Hs108|chr2 ( 76) 214 61.5 7.3e-11 >>CCDS34549.1 SUMO4 gene_id:387082|Hs108|chr6 (95 aa) initn: 632 init1: 632 opt: 632 Z-score: 880.5 bits: 167.6 E(32554): 1e-42 Smith-Waterman score: 632; 100.0% identity (100.0% similar) in 95 aa overlap (1-95:1-95) 10 20 30 40 50 60 pF1KE6 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVKQIRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVKQIRF 10 20 30 40 50 60 70 80 90 pF1KE6 RFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY ::::::::::::::::::::::::::::::::::: CCDS34 RFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY 70 80 90 >>CCDS45774.1 SUMO2 gene_id:6613|Hs108|chr17 (95 aa) initn: 533 init1: 533 opt: 533 Z-score: 744.7 bits: 142.5 E(32554): 3.7e-35 Smith-Waterman score: 533; 85.3% identity (92.6% similar) in 95 aa overlap (1-95:1-95) 10 20 30 40 50 60 pF1KE6 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVKQIRF ::.::: : ::::::.::::::::::::::::::::.:::::::::::: .:::..:::: CCDS45 MADEKPKEGVKTENNDHINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQGLSMRQIRF 10 20 30 40 50 60 70 80 90 pF1KE6 RFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY :: ::::. :: ::::::::::::::::: ::::: CCDS45 RFDGQPINETDTPAQLEMEDEDTIDVFQQQTGGVY 70 80 90 >>CCDS33587.1 SUMO3 gene_id:6612|Hs108|chr21 (103 aa) initn: 458 init1: 458 opt: 500 Z-score: 698.8 bits: 134.1 E(32554): 1.3e-32 Smith-Waterman score: 500; 83.0% identity (91.5% similar) in 94 aa overlap (1-94:1-93) 10 20 30 40 50 60 pF1KE6 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVKQIRF :..::: : :::::. ::::::::::::::::::::.:::::::::::: .:::..:::: CCDS33 MSEEKPKEGVKTEND-HINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQGLSMRQIRF 10 20 30 40 50 70 80 90 pF1KE6 RFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY :: ::::. :: ::::::::::::::::: :::: CCDS33 RFDGQPINETDTPAQLEMEDEDTIDVFQQQTGGVPESSLAGHSF 60 70 80 90 100 >>CCDS45773.1 SUMO2 gene_id:6613|Hs108|chr17 (71 aa) initn: 403 init1: 286 opt: 286 Z-score: 407.8 bits: 79.8 E(32554): 2.1e-16 Smith-Waterman score: 357; 67.4% identity (70.5% similar) in 95 aa overlap (1-95:1-71) 10 20 30 40 50 60 pF1KE6 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVKQIRF ::.::: : ::::::.::::::::::::::::::::.:::::::::::: CCDS45 MADEKPKEGVKTENNDHINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCE----------- 10 20 30 40 70 80 90 pF1KE6 RFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY ::::::::::::::: ::::: CCDS45 -------------RQLEMEDEDTIDVFQQQTGGVY 50 60 70 >>CCDS68220.1 SUMO3 gene_id:6612|Hs108|chr21 (141 aa) initn: 456 init1: 251 opt: 251 Z-score: 354.8 bits: 70.9 E(32554): 1.9e-13 Smith-Waterman score: 406; 59.8% identity (64.6% similar) in 127 aa overlap (6-94:6-131) 10 20 30 40 pF1KE6 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCE----------- : : :::::. ::::::::::::::::::::.:::::::::::: CCDS68 MSEEKPKEGVKTEND-HINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQVRHLAPPQS 10 20 30 40 50 50 60 70 80 pF1KE6 ---------------------------PRGLSVKQIRFRFGGQPISGTDKPAQLEMEDED :.:::..:::::: ::::. :: :::::::::: CCDS68 LPVCALVLCVPGIPRARASRGWTQMQLPEGLSMRQIRFRFDGQPINETDTPAQLEMEDED 60 70 80 90 100 110 90 pF1KE6 TIDVFQQPTGGVY ::::::: :::: CCDS68 TIDVFQQQTGGVPESSLAGHSF 120 130 140 >>CCDS2352.1 SUMO1 gene_id:7341|Hs108|chr2 (101 aa) initn: 253 init1: 244 opt: 248 Z-score: 353.1 bits: 70.2 E(32554): 2.4e-13 Smith-Waterman score: 248; 44.0% identity (76.9% similar) in 91 aa overlap (5-93:7-97) 10 20 30 40 50 pF1KE6 MANEKP-TEEV-KTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVK :: ::.. .....:.::: :::.: ..::.: : :.:: ..::. .:. .. CCDS23 MSDQEAKPSTEDLGDKKEGEYIKLKVIGQDSSEIHFKVKMTTHLKKLKESYCQRQGVPMN 10 20 30 40 50 60 60 70 80 90 pF1KE6 QIRFRFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY ..:: : :: :. . : .: ::.::.:.:.:. ::: CCDS23 SLRFLFEGQRIADNHTPKELGMEEEDVIEVYQEQTGGHSTV 70 80 90 100 >>CCDS46493.1 SUMO1 gene_id:7341|Hs108|chr2 (76 aa) initn: 214 init1: 214 opt: 214 Z-score: 308.6 bits: 61.5 E(32554): 7.3e-11 Smith-Waterman score: 214; 43.1% identity (76.4% similar) in 72 aa overlap (22-93:1-72) 10 20 30 40 50 60 pF1KE6 MANEKPTEEVKTENNNHINLKVAGQDGSVVQFKIKRQTPLSKLMKAYCEPRGLSVKQIRF .. ::.: ..::.: : :.:: ..::. .:. ....:: CCDS46 MSDQDSSEIHFKVKMTTHLKKLKESYCQRQGVPMNSLRF 10 20 30 70 80 90 pF1KE6 RFGGQPISGTDKPAQLEMEDEDTIDVFQQPTGGVY : :: :. . : .: ::.::.:.:.:. ::: CCDS46 LFEGQRIADNHTPKELGMEEEDVIEVYQEQTGGHSTV 40 50 60 70 95 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:06:02 2016 done: Tue Nov 8 14:06:02 2016 Total Scan time: 1.050 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]