FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0798, 1616 aa 1>>>pF1KE0798 1616 - 1616 aa - 1616 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 14.7504+/-0.00131; mu= -13.4740+/- 0.080 mean_var=760.0724+/-155.725, 0's: 0 Z-trim(116.5): 157 B-trim: 0 in 0/55 Lambda= 0.046521 statistics sampled from 16987 (17126) to 16987 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.773), E-opt: 0.2 (0.526), width: 16 Scan time: 6.230 The best scores are: opt bits E(32554) CCDS45973.1 SMARCA4 gene_id:6597|Hs108|chr19 (1616) 10873 746.4 1.7e-214 CCDS45972.1 SMARCA4 gene_id:6597|Hs108|chr19 (1617) 10861 745.6 3e-214 CCDS54218.1 SMARCA4 gene_id:6597|Hs108|chr19 (1613) 10841 744.3 7.6e-214 CCDS54217.1 SMARCA4 gene_id:6597|Hs108|chr19 (1614) 10829 743.5 1.3e-213 CCDS12253.1 SMARCA4 gene_id:6597|Hs108|chr19 (1647) 8576 592.3 4.5e-168 CCDS34977.1 SMARCA2 gene_id:6595|Hs108|chr9 (1590) 7087 492.3 5.3e-138 CCDS34978.1 SMARCA2 gene_id:6595|Hs108|chr9 (1572) 6320 440.8 1.6e-122 CCDS83338.1 SMARCA2 gene_id:6595|Hs108|chr9 (1514) 2742 200.7 3.1e-50 CCDS47865.1 CHD7 gene_id:55636|Hs108|chr8 (2997) 1127 92.6 2.1e-17 CCDS76510.1 CHD4 gene_id:1108|Hs108|chr12 (1905) 1065 88.2 2.8e-16 CCDS8552.1 CHD4 gene_id:1108|Hs108|chr12 (1912) 1065 88.2 2.8e-16 CCDS75807.1 SMARCA2 gene_id:6595|Hs108|chr9 ( 248) 1031 85.0 3.3e-16 CCDS83339.1 SMARCA2 gene_id:6595|Hs108|chr9 ( 276) 1031 85.1 3.6e-16 CCDS75808.1 SMARCA2 gene_id:6595|Hs108|chr9 ( 278) 1031 85.1 3.6e-16 CCDS32555.1 CHD3 gene_id:1107|Hs108|chr17 (1966) 1058 87.8 3.9e-16 CCDS32554.1 CHD3 gene_id:1107|Hs108|chr17 (2000) 1058 87.8 4e-16 CCDS32553.2 CHD3 gene_id:1107|Hs108|chr17 (2059) 1058 87.8 4.1e-16 CCDS53885.1 CHD8 gene_id:57680|Hs108|chr14 (2581) 1043 86.9 9.5e-16 CCDS57.1 CHD5 gene_id:26038|Hs108|chr1 (1954) 1010 84.5 3.7e-15 CCDS13317.1 CHD6 gene_id:84181|Hs108|chr20 (2715) 1010 84.7 4.6e-15 CCDS45081.1 CHD8 gene_id:57680|Hs108|chr14 (2302) 1005 84.3 5.2e-15 CCDS45485.1 CHD9 gene_id:80205|Hs108|chr16 (2881) 989 83.3 1.3e-14 CCDS76865.1 CHD9 gene_id:80205|Hs108|chr16 (2897) 989 83.3 1.3e-14 CCDS10071.1 INO80 gene_id:54617|Hs108|chr15 (1556) 897 76.9 6e-13 CCDS7229.1 ERCC6 gene_id:2074|Hs108|chr10 (1493) 851 73.7 5e-12 CCDS10689.2 SRCAP gene_id:10847|Hs108|chr16 (3230) 822 72.1 3.2e-11 >>CCDS45973.1 SMARCA4 gene_id:6597|Hs108|chr19 (1616 aa) initn: 10873 init1: 10873 opt: 10873 Z-score: 3964.2 bits: 746.4 E(32554): 1.7e-214 Smith-Waterman score: 10873; 100.0% identity (100.0% similar) in 1616 aa overlap (1-1616:1-1616) 10 20 30 40 50 60 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE0 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE0 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE0 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE0 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE0 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE0 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE0 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE0 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE0 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE0 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE0 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE0 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE0 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KE0 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KE0 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KE0 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSS 1330 1340 1350 1360 1370 1380 1390 1400 1410 1420 1430 1440 pF1KE0 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI 1390 1400 1410 1420 1430 1440 1450 1460 1470 1480 1490 1500 pF1KE0 KYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVML 1450 1460 1470 1480 1490 1500 1510 1520 1530 1540 1550 1560 pF1KE0 LCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESESR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESESR 1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 1610 pF1KE0 SVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED 1570 1580 1590 1600 1610 >>CCDS45972.1 SMARCA4 gene_id:6597|Hs108|chr19 (1617 aa) initn: 9787 init1: 9787 opt: 10861 Z-score: 3959.9 bits: 745.6 E(32554): 3e-214 Smith-Waterman score: 10861; 99.9% identity (99.9% similar) in 1617 aa overlap (1-1616:1-1617) 10 20 30 40 50 60 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE0 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE0 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE0 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE0 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE0 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE0 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE0 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE0 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE0 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE0 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE0 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE0 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE0 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KE0 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KE0 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KE0 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSS 1330 1340 1350 1360 1370 1380 1390 1400 1410 1420 1430 1440 pF1KE0 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI 1390 1400 1410 1420 1430 1440 1450 1460 1470 1480 1490 pF1KE0 KYKDSS-GRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVM :::::: ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KYKDSSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVM 1450 1460 1470 1480 1490 1500 1500 1510 1520 1530 1540 1550 pF1KE0 LLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESES 1510 1520 1530 1540 1550 1560 1560 1570 1580 1590 1600 1610 pF1KE0 RSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED 1570 1580 1590 1600 1610 >>CCDS54218.1 SMARCA4 gene_id:6597|Hs108|chr19 (1613 aa) initn: 9263 init1: 9263 opt: 10841 Z-score: 3952.6 bits: 744.3 E(32554): 7.6e-214 Smith-Waterman score: 10841; 99.8% identity (99.8% similar) in 1616 aa overlap (1-1616:1-1613) 10 20 30 40 50 60 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE0 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE0 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE0 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE0 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE0 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE0 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE0 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE0 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE0 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE0 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE0 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE0 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE0 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KE0 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KE0 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KE0 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSS :::::::::::::::::::::::::::::::::::: ::::::::::::::::::::: CCDS54 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWL---KAIEEGTLEEIEEEVRQKKSS 1330 1340 1350 1360 1370 1390 1400 1410 1420 1430 1440 pF1KE0 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI 1380 1390 1400 1410 1420 1430 1450 1460 1470 1480 1490 1500 pF1KE0 KYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVML :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVML 1440 1450 1460 1470 1480 1490 1510 1520 1530 1540 1550 1560 pF1KE0 LCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESESR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESESR 1500 1510 1520 1530 1540 1550 1570 1580 1590 1600 1610 pF1KE0 SVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED 1560 1570 1580 1590 1600 1610 >>CCDS54217.1 SMARCA4 gene_id:6597|Hs108|chr19 (1614 aa) initn: 10345 init1: 9263 opt: 10829 Z-score: 3948.3 bits: 743.5 E(32554): 1.3e-213 Smith-Waterman score: 10829; 99.8% identity (99.8% similar) in 1617 aa overlap (1-1616:1-1614) 10 20 30 40 50 60 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE0 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE0 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE0 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE0 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE0 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE0 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE0 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE0 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE0 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE0 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE0 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE0 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE0 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KE0 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEE 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KE0 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDELPSWIIKDDAE 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KE0 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSS :::::::::::::::::::::::::::::::::::: ::::::::::::::::::::: CCDS54 VERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWL---KAIEEGTLEEIEEEVRQKKSS 1330 1340 1350 1360 1370 1390 1400 1410 1420 1430 1440 pF1KE0 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVI 1380 1390 1400 1410 1420 1430 1450 1460 1470 1480 1490 pF1KE0 KYKDSS-GRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVM :::::: ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KYKDSSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLNDLEKDVM 1440 1450 1460 1470 1480 1490 1500 1510 1520 1530 1540 1550 pF1KE0 LLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEGEEEGSESES 1500 1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 1610 pF1KE0 RSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRSGSGSEED 1560 1570 1580 1590 1600 1610 >>CCDS12253.1 SMARCA4 gene_id:6597|Hs108|chr19 (1647 aa) initn: 9635 init1: 8553 opt: 8576 Z-score: 3131.0 bits: 592.3 E(32554): 4.5e-168 Smith-Waterman score: 10753; 97.8% identity (97.8% similar) in 1650 aa overlap (1-1616:1-1647) 10 20 30 40 50 60 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQNQLHQL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGPGPGPGP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAAPTSTPQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRITPIQKPRG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNFQRQLRQEV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE0 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 VVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKHQEYLNSIL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE0 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 QHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMAEDEEGYRK 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE0 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAENAEGQTPAIG 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE0 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYEVAPRSDSEES 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE0 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENAKQDVDDEYGV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE0 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYNNNLNGILADE 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE0 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAA 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE0 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTH 850 860 870 880 890 900 910 920 930 940 950 960 pF1KE0 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 YVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEE 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KE0 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTD 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KE0 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASG 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KE0 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLK 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KE0 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVR 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 pF1KE0 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDE-- :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 VLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAILEHEEQDESR 1210 1220 1230 1240 1250 1260 1260 1270 1280 pF1KE0 -------------------------------EEDEVPDDETVNQMIARHEEEFDLFMRMD ::::::::::::::::::::::::::::: CCDS12 HCSTGSGSASFAHTAPPPAGVNPDLEEPPLKEEDEVPDDETVNQMIARHEEEFDLFMRMD 1270 1280 1290 1300 1310 1320 1290 1300 1310 1320 1330 1340 pF1KE0 LDRRREEARNPKRKPRLMEEDELPSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LDRRREEARNPKRKPRLMEEDELPSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYS 1330 1340 1350 1360 1370 1380 1350 1360 1370 1380 1390 1400 pF1KE0 DSLTEKQWLKTLKAIEEGTLEEIEEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDES ::::::::: :::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 DSLTEKQWL---KAIEEGTLEEIEEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDES 1390 1400 1410 1420 1430 1410 1420 1430 1440 1450 1460 pF1KE0 KKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVIKYKDSS-GRQLSEVFIQLPSRKELPEY ::::::::::::::::::::::::::::::::::::::: :::::::::::::::::::: CCDS12 KKQKKRGRPPAEKLSPNPPNLTKKMKKIVDAVIKYKDSSSGRQLSEVFIQLPSRKELPEY 1440 1450 1460 1470 1480 1490 1470 1480 1490 1500 1510 1520 pF1KE0 YELIRKPVDFKKIKERIRNHKYRSLNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 YELIRKPVDFKKIKERIRNHKYRSLNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVF 1500 1510 1520 1530 1540 1550 1530 1540 1550 1560 1570 1580 pF1KE0 TSVRQKIEKEDDSEGEESEEEEEGEEEGSESESRSVKVKIKLGRKEKAQDRLKGGRRRPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 TSVRQKIEKEDDSEGEESEEEEEGEEEGSESESRSVKVKIKLGRKEKAQDRLKGGRRRPS 1560 1570 1580 1590 1600 1610 1590 1600 1610 pF1KE0 RGSRAKPVVSDDDSEEEQEEDRSGSGSEED :::::::::::::::::::::::::::::: CCDS12 RGSRAKPVVSDDDSEEEQEEDRSGSGSEED 1620 1630 1640 >>CCDS34977.1 SMARCA2 gene_id:6595|Hs108|chr9 (1590 aa) initn: 6962 init1: 3931 opt: 7087 Z-score: 2591.1 bits: 492.3 E(32554): 5.3e-138 Smith-Waterman score: 8079; 75.6% identity (87.8% similar) in 1645 aa overlap (1-1616:1-1588) 10 20 30 40 50 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGP--SPGSAHSMMGPSPGPPSAGHPIPT :::: : :. :.:::::::::::: .::::::: ::::.::::::::::::..::.:: CCDS34 MSTPTDP-GAMPHPGPSPGPGPSPGPILGPSPGPGPSPGSVHSMMGPSPGPPSVSHPMPT 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 QGPGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQ .: .::..:::::::....:.::. .: . ..::: ::: : ::::: :::::::: CCDS34 MGSTDFPQEGMHQMHKPIDGIHDKGIVEDIHCGSMKGTGMRPP-HPGMGPPQSPMDQHSQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 GY----PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQ :: :::::. ::.:::. ..::. ::: . :: . : ::::..: ::::.::. CCDS34 GYMSPHPSPLGAPEHVSSPMSGGGPTP-PQMPPSQPGALIPG-DPQAMSQPNRGPSPFSP 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE0 NQLHQLRAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGP :::::::::.::::::::::::. ::.:::::: .::.:::. . CCDS34 VQLHQLRAQILAYKMLARGQPLPETLQLAVQGKRTLPGLQQQQQQQQQQQ---------- 180 190 200 210 220 240 250 260 270 280 290 pF1KE0 GPGPGPGPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAA . .. . . :.. :: :. . : :. : :: . .. CCDS34 -----------QQQQQQQQQQQQPQQQPPQPQ----TQQQQQPALVNYNRPSGPGPELSG 230 240 250 260 270 300 310 320 330 340 350 pF1KE0 PTSTPQKLIPPQPTGRPSPAPPAV--PPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRI : :::::: : : :::::::::. ::::. : :.. .:. :.::.:.. :.:::::: CCDS34 P-STPQKLPVPAPGGRPSPAPPAAAQPPAAA-VPGPSVPQPA-PGQPSPVLQLQQKQSRI 280 290 300 310 320 360 370 380 390 400 410 pF1KE0 TPIQKPRGLDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNF .:::::.::::::::::::::::::::::::::::::::: :::::::.:::::::::: CCDS34 SPIQKPQGLDPVEILQEREYRLQARIAHRIQELENLPGSLPPDLRTKATVELKALRLLNF 330 340 350 360 370 380 420 430 440 450 460 470 pF1KE0 QRQLRQEVVVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKH :::::::::.::::::.::::::.:::::::::.:::::.:::::::::::::::::::: CCDS34 QRQLRQEVVACMRRDTTLETALNSKAYKRSKRQTLREARMTEKLEKQQKIEQERKRRQKH 390 400 410 420 430 440 480 490 500 510 520 530 pF1KE0 QEYLNSILQHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMA :::::::::::::::::::::.::::::.:::::.:::::::::::.::::::::::::: CCDS34 QEYLNSILQHAKDFKEYHRSVAGKIQKLSKAVATWHANTEREQKKETERIEKERMRRLMA 450 460 470 480 490 500 540 550 560 570 580 590 pF1KE0 EDEEGYRKLIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKA--E :::::::::::::::.:::::::::::::::::.:: .:: ::.::::::....:: : CCDS34 EDEEGYRKLIDQKKDRRLAYLLQQTDEYVANLTNLVWEHKQAQAAKEKKKRRRRKKKAEE 510 520 530 540 550 560 600 610 620 630 640 650 pF1KE0 NAEGQTPAIGPDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYE :::: :.::::::.::.:::::::::: :.:.::.: : .::::.::.:::::::::: CCDS34 NAEGGESALGPDGEPIDESSQMSDLPVKVTHTETGKVLFGPEAPKASQLDAWLEMNPGYE 570 580 590 600 610 620 660 670 680 690 700 710 pF1KE0 VAPRSDSEESGSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENA :::::::::: :. :::.::::. . :.: . ::.:..::: ::..:::.: CCDS34 VAPRSDSEESDSDYEEEDEEEESSRQET-------EEKILLDPNSEEVSEKDAKQIIETA 630 640 650 660 670 680 720 730 740 750 760 770 pF1KE0 KQDVDDEYGVSQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYN ::::::::.. : ::: ::::.::::..:::.:::::..::.::.::..::::.::::: CCDS34 KQDVDDEYSM-QYSARGSQSYYTVAHAISERVEKQSALLINGTLKHYQLQGLEWMVSLYN 690 700 710 720 730 740 780 790 800 810 820 830 pF1KE0 NNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVV :::::::::::::::::::::::::::::::.:::.::::::::::::.::::::::::: CCDS34 NNLNGILADEMGLGKTIQTIALITYLMEHKRLNGPYLIIVPLSTLSNWTYEFDKWAPSVV 750 760 770 780 790 800 840 850 860 870 880 890 pF1KE0 KVSYKGSPAARRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH :.::::.:: ::..:::::::::::::::::::::::::::::::::::::::::::::: CCDS34 KISYKGTPAMRRSLVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH 810 820 830 840 850 860 900 910 920 930 940 950 pF1KE0 CKLTQVLNTHYVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMT ::::::::::::::::.::::::::::::::::::::::::::::::::::::::::::: CCDS34 CKLTQVLNTHYVAPRRILLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMT 870 880 890 900 910 920 960 970 980 990 1000 1010 pF1KE0 GEKVDLNEEETILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRH ::.::::::::::::::::::::::::::::::::.::::::::::::::::::..:::: CCDS34 GERVDLNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEYVIKCDMSALQKILYRH 930 940 950 960 970 980 1020 1030 1040 1050 1060 1070 pF1KE0 MQAKGVLLTDGSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIV :::::.:::::::::::::::.:::::::::::::::::::::::::::.::::...:.. CCDS34 MQAKGILLTDGSEKDKKGKGGAKTLMNTIMQLRKICNHPYMFQHIEESFAEHLGYSNGVI 990 1000 1010 1020 1030 1040 1080 1090 1100 1110 1120 1130 pF1KE0 QGLDLYRASGKFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTT .: .:::::::::::::::::::::::.:::::::::::::::::::.:.: :::::::: CCDS34 NGAELYRASGKFELLDRILPKLRATNHRVLLFCQMTSLMTIMEDYFAFRNFLYLRLDGTT 1050 1060 1070 1080 1090 1100 1140 1150 1160 1170 1180 1190 pF1KE0 KAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRA :.:::. ::: ::::::.::::::::::::::::::.::::.:::::::::::::::::: CCDS34 KSEDRAALLKKFNEPGSQYFIFLLSTRAGGLGLNLQAADTVVIFDSDWNPHQDLQAQDRA 1110 1120 1130 1140 1150 1160 1200 1210 1220 1230 1240 1250 pF1KE0 HRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 HRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAI 1170 1180 1190 1200 1210 1220 1260 1270 1280 1290 1300 1310 pF1KE0 LEHEEQDEEEDEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDEL :::::..:::::::::::.::::::.:::::::::::.:::::.:::::::::::::::: CCDS34 LEHEEENEEEDEVPDDETLNQMIARREEEFDLFMRMDMDRRREDARNPKRKPRLMEEDEL 1230 1240 1250 1260 1270 1280 1320 1330 1340 1350 1360 1370 pF1KE0 PSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEI ::::::::::::::::::::::.::::::.:..:::::.::::::: .:::.:.:::. CCDS34 PSWIIKDDAEVERLTCEEEEEKIFGRGSRQRRDVDYSDALTEKQWL---RAIEDGNLEEM 1290 1300 1310 1320 1330 1380 1390 1400 1410 1420 1430 pF1KE0 EEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTK ::::: :: .:.:. :.: :. :.: : .:.::::::::::::::.::: CCDS34 EEEVRLKKRKRRRNVDKD-----PA-------KEDVEKAKKRRGRPPAEKLSPNPPKLTK 1340 1350 1360 1370 1380 1440 1450 1460 1470 pF1KE0 KMKKIVDAVIKYKD------------------SSGRQLSEVFIQLPSRKELPEYYELIRK .:. :.:.::.::: :::::::::::::::::::::::::::: CCDS34 QMNAIIDTVINYKDRCNVEKVPSNSQLEIEGNSSGRQLSEVFIQLPSRKELPEYYELIRK 1390 1400 1410 1420 1430 1440 1480 1490 1500 1510 1520 1530 pF1KE0 PVDFKKIKERIRNHKYRSLNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQK :::::::::::::::::::.::::::::::.:::::::::: :::::::::::: :.::: CCDS34 PVDFKKIKERIRNHKYRSLGDLEKDVMLLCHNAQTFNLEGSQIYEDSIVLQSVFKSARQK 1450 1460 1470 1480 1490 1500 1540 1550 1560 1570 1580 1590 pF1KE0 IEKEDDSEGEESEEEEEGEEEGSESESRSVKVKIKLGRKE-KAQDRLKGGRRRPSRGSRA : ::..:: : .::::: .:: ::::..::::::::..:. :..:. :: ..::.:: .: CCDS34 IAKEEESEDESNEEEEEEDEEESESEAKSVKVKIKLNKKDDKGRDKGKG-KKRPNRG-KA 1510 1520 1530 1540 1550 1560 1600 1610 pF1KE0 KPVVSDDDSEEEQEEDRSGSGSEED :::::: ::.:::.: ... :: : CCDS34 KPVVSDFDSDEEQDEREQSEGSGTDDE 1570 1580 1590 >>CCDS34978.1 SMARCA2 gene_id:6595|Hs108|chr9 (1572 aa) initn: 6948 init1: 3931 opt: 6320 Z-score: 2312.9 bits: 440.8 E(32554): 1.6e-122 Smith-Waterman score: 8125; 76.5% identity (88.8% similar) in 1627 aa overlap (1-1616:1-1570) 10 20 30 40 50 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGP--SPGSAHSMMGPSPGPPSAGHPIPT :::: : :. :.:::::::::::: .::::::: ::::.::::::::::::..::.:: CCDS34 MSTPTDP-GAMPHPGPSPGPGPSPGPILGPSPGPGPSPGSVHSMMGPSPGPPSVSHPMPT 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 QGPGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQ .: .::..:::::::....:.::. .: . ..::: ::: : ::::: :::::::: CCDS34 MGSTDFPQEGMHQMHKPIDGIHDKGIVEDIHCGSMKGTGMRPP-HPGMGPPQSPMDQHSQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 GY----PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQ :: :::::. ::.:::. ..::. ::: . :: . : ::::..: ::::.::. CCDS34 GYMSPHPSPLGAPEHVSSPMSGGGPTP-PQMPPSQPGALIPG-DPQAMSQPNRGPSPFSP 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE0 NQLHQLRAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGP :::::::::.::::::::::::. ::.:::::: .::.:::. . CCDS34 VQLHQLRAQILAYKMLARGQPLPETLQLAVQGKRTLPGLQQQQQQQQQQQ---------- 180 190 200 210 220 240 250 260 270 280 290 pF1KE0 GPGPGPGPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAA . .. . . :.. :: :. . : :. : :: . .. CCDS34 -----------QQQQQQQQQQQQPQQQPPQPQ----TQQQQQPALVNYNRPSGPGPELSG 230 240 250 260 270 300 310 320 330 340 350 pF1KE0 PTSTPQKLIPPQPTGRPSPAPPAV--PPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRI : :::::: : : :::::::::. ::::. : :.. .:. :.::.:.. :.:::::: CCDS34 P-STPQKLPVPAPGGRPSPAPPAAAQPPAAA-VPGPSVPQPA-PGQPSPVLQLQQKQSRI 280 290 300 310 320 360 370 380 390 400 410 pF1KE0 TPIQKPRGLDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNF .:::::.::::::::::::::::::::::::::::::::: :::::::.:::::::::: CCDS34 SPIQKPQGLDPVEILQEREYRLQARIAHRIQELENLPGSLPPDLRTKATVELKALRLLNF 330 340 350 360 370 380 420 430 440 450 460 470 pF1KE0 QRQLRQEVVVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKH :::::::::.::::::.::::::.:::::::::.:::::.:::::::::::::::::::: CCDS34 QRQLRQEVVACMRRDTTLETALNSKAYKRSKRQTLREARMTEKLEKQQKIEQERKRRQKH 390 400 410 420 430 440 480 490 500 510 520 530 pF1KE0 QEYLNSILQHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMA :::::::::::::::::::::.::::::.:::::.:::::::::::.::::::::::::: CCDS34 QEYLNSILQHAKDFKEYHRSVAGKIQKLSKAVATWHANTEREQKKETERIEKERMRRLMA 450 460 470 480 490 500 540 550 560 570 580 590 pF1KE0 EDEEGYRKLIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKA--E :::::::::::::::.:::::::::::::::::.:: .:: ::.::::::....:: : CCDS34 EDEEGYRKLIDQKKDRRLAYLLQQTDEYVANLTNLVWEHKQAQAAKEKKKRRRRKKKAEE 510 520 530 540 550 560 600 610 620 630 640 650 pF1KE0 NAEGQTPAIGPDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYE :::: :.::::::.::.:::::::::: :.:.::.: : .::::.::.:::::::::: CCDS34 NAEGGESALGPDGEPIDESSQMSDLPVKVTHTETGKVLFGPEAPKASQLDAWLEMNPGYE 570 580 590 600 610 620 660 670 680 690 700 710 pF1KE0 VAPRSDSEESGSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENA :::::::::: :. :::.::::. . :.: . ::.:..::: ::..:::.: CCDS34 VAPRSDSEESDSDYEEEDEEEESSRQET-------EEKILLDPNSEEVSEKDAKQIIETA 630 640 650 660 670 680 720 730 740 750 760 770 pF1KE0 KQDVDDEYGVSQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYN ::::::::.. : ::: ::::.::::..:::.:::::..::.::.::..::::.::::: CCDS34 KQDVDDEYSM-QYSARGSQSYYTVAHAISERVEKQSALLINGTLKHYQLQGLEWMVSLYN 690 700 710 720 730 740 780 790 800 810 820 830 pF1KE0 NNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVV :::::::::::::::::::::::::::::::.:::.::::::::::::.::::::::::: CCDS34 NNLNGILADEMGLGKTIQTIALITYLMEHKRLNGPYLIIVPLSTLSNWTYEFDKWAPSVV 750 760 770 780 790 800 840 850 860 870 880 890 pF1KE0 KVSYKGSPAARRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH :.::::.:: ::..:::::::::::::::::::::::::::::::::::::::::::::: CCDS34 KISYKGTPAMRRSLVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH 810 820 830 840 850 860 900 910 920 930 940 950 pF1KE0 CKLTQVLNTHYVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMT ::::::::::::::::.::::::::::::::::::::::::::::::::::::::::::: CCDS34 CKLTQVLNTHYVAPRRILLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMT 870 880 890 900 910 920 960 970 980 990 1000 1010 pF1KE0 GEKVDLNEEETILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRH ::.::::::::::::::::::::::::::::::::.::::::::::::::::::..:::: CCDS34 GERVDLNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEYVIKCDMSALQKILYRH 930 940 950 960 970 980 1020 1030 1040 1050 1060 1070 pF1KE0 MQAKGVLLTDGSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIV :::::.:::::::::::::::.:::::::::::::::::::::::::::.::::...:.. CCDS34 MQAKGILLTDGSEKDKKGKGGAKTLMNTIMQLRKICNHPYMFQHIEESFAEHLGYSNGVI 990 1000 1010 1020 1030 1040 1080 1090 1100 1110 1120 1130 pF1KE0 QGLDLYRASGKFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTT .: .:::::::::::::::::::::::.:::::::::::::::::::.:.: :::::::: CCDS34 NGAELYRASGKFELLDRILPKLRATNHRVLLFCQMTSLMTIMEDYFAFRNFLYLRLDGTT 1050 1060 1070 1080 1090 1100 1140 1150 1160 1170 1180 1190 pF1KE0 KAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRA :.:::. ::: ::::::.::::::::::::::::::.::::.:::::::::::::::::: CCDS34 KSEDRAALLKKFNEPGSQYFIFLLSTRAGGLGLNLQAADTVVIFDSDWNPHQDLQAQDRA 1110 1120 1130 1140 1150 1160 1200 1210 1220 1230 1240 1250 pF1KE0 HRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 HRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAI 1170 1180 1190 1200 1210 1220 1260 1270 1280 1290 1300 1310 pF1KE0 LEHEEQDEEEDEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDEL :::::..:::::::::::.::::::.:::::::::::.:::::.:::::::::::::::: CCDS34 LEHEEENEEEDEVPDDETLNQMIARREEEFDLFMRMDMDRRREDARNPKRKPRLMEEDEL 1230 1240 1250 1260 1270 1280 1320 1330 1340 1350 1360 1370 pF1KE0 PSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEI ::::::::::::::::::::::.::::::.:..:::::.::::::: .:::.:.:::. CCDS34 PSWIIKDDAEVERLTCEEEEEKIFGRGSRQRRDVDYSDALTEKQWL---RAIEDGNLEEM 1290 1300 1310 1320 1330 1380 1390 1400 1410 1420 1430 pF1KE0 EEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTK ::::: :: .:.:. :.: :. :.: : .:.::::::::::::::.::: CCDS34 EEEVRLKKRKRRRNVDKD-----PA-------KEDVEKAKKRRGRPPAEKLSPNPPKLTK 1340 1350 1360 1370 1380 1440 1450 1460 1470 1480 1490 pF1KE0 KMKKIVDAVIKYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRS .:. :.:.::.::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QMNAIIDTVINYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRS 1390 1400 1410 1420 1430 1440 1500 1510 1520 1530 1540 1550 pF1KE0 LNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEG :.::::::::::.:::::::::: :::::::::::: :.:::: ::..:: : .::::: CCDS34 LGDLEKDVMLLCHNAQTFNLEGSQIYEDSIVLQSVFKSARQKIAKEEESEDESNEEEEEE 1450 1460 1470 1480 1490 1500 1560 1570 1580 1590 1600 pF1KE0 EEEGSESESRSVKVKIKLGRKE-KAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRS .:: ::::..::::::::..:. :..:. :: ..::.:: .::::::: ::.:::.: .. CCDS34 DEEESESEAKSVKVKIKLNKKDDKGRDKGKG-KKRPNRG-KAKPVVSDFDSDEEQDEREQ 1510 1520 1530 1540 1550 1560 1610 pF1KE0 GSGSEED . :: : CCDS34 SEGSGTDDE 1570 >>CCDS83338.1 SMARCA2 gene_id:6595|Hs108|chr9 (1514 aa) initn: 5513 init1: 2496 opt: 2742 Z-score: 1015.3 bits: 200.7 E(32554): 3.1e-50 Smith-Waterman score: 7609; 73.0% identity (85.2% similar) in 1627 aa overlap (1-1616:1-1512) 10 20 30 40 50 pF1KE0 MSTPDPPLGGTPRPGPSPGPGPSPGAMLGPSPGP--SPGSAHSMMGPSPGPPSAGHPIPT :::: : :. :.:::::::::::: .::::::: ::::.::::::::::::..::.:: CCDS83 MSTPTDP-GAMPHPGPSPGPGPSPGPILGPSPGPGPSPGSVHSMMGPSPGPPSVSHPMPT 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 QGPGGYPQDNMHQMHKPMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQ .: .::..:::::::....:.::. .: . ..::: ::: : ::::: :::::::: CCDS83 MGSTDFPQEGMHQMHKPIDGIHDKGIVEDIHCGSMKGTGMRPP-HPGMGPPQSPMDQHSQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 GY----PSPLGGSEHASSPVPASGPSSGPQMSSGPGGAPLDGADPQALGQQNRGPTPFNQ :: :::::. ::.:::. ..::. ::: . :: . : ::::..: ::::.::. CCDS83 GYMSPHPSPLGAPEHVSSPMSGGGPTP-PQMPPSQPGALIPG-DPQAMSQPNRGPSPFSP 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE0 NQLHQLRAQIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPGPGP :::::::::.::::::::::::. ::.:::::: .::.:::. . CCDS83 VQLHQLRAQILAYKMLARGQPLPETLQLAVQGKRTLPGLQQQQQQQQQQQ---------- 180 190 200 210 220 240 250 260 270 280 290 pF1KE0 GPGPGPGPGPAPPNYSRPHGMGGPNMPPPGPSGVPPGMPGQPPGGPPKPWPEGPMANAAA . .. . . :.. :: :. . : :. : :: . .. CCDS83 -----------QQQQQQQQQQQQPQQQPPQPQ----TQQQQQPALVNYNRPSGPGPELSG 230 240 250 260 270 300 310 320 330 340 350 pF1KE0 PTSTPQKLIPPQPTGRPSPAPPAV--PPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRI : :::::: : : :::::::::. ::::. : :.. .:. :.::.:.. :.:::::: CCDS83 P-STPQKLPVPAPGGRPSPAPPAAAQPPAAA-VPGPSVPQPA-PGQPSPVLQLQQKQSRI 280 290 300 310 320 360 370 380 390 400 410 pF1KE0 TPIQKPRGLDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNF .:::::.::::::::::::::::::::::::::::::::: :::::::.:::::::::: CCDS83 SPIQKPQGLDPVEILQEREYRLQARIAHRIQELENLPGSLPPDLRTKATVELKALRLLNF 330 340 350 360 370 380 420 430 440 450 460 470 pF1KE0 QRQLRQEVVVCMRRDTALETALNAKAYKRSKRQSLREARITEKLEKQQKIEQERKRRQKH :::::::::.::::::.::::::.:::::::::.:::::.:::::::::::::::::::: CCDS83 QRQLRQEVVACMRRDTTLETALNSKAYKRSKRQTLREARMTEKLEKQQKIEQERKRRQKH 390 400 410 420 430 440 480 490 500 510 520 530 pF1KE0 QEYLNSILQHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRLMA :::::::::::::::::::::.::::::.:::::.:::::::::::.::::::::::::: CCDS83 QEYLNSILQHAKDFKEYHRSVAGKIQKLSKAVATWHANTEREQKKETERIEKERMRRLMA 450 460 470 480 490 500 540 550 560 570 580 590 pF1KE0 EDEEGYRKLIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKA--E :::::::::::::::.:::::::::::::::::.:: .:: ::.::::::....:: : CCDS83 EDEEGYRKLIDQKKDRRLAYLLQQTDEYVANLTNLVWEHKQAQAAKEKKKRRRRKKKAEE 510 520 530 540 550 560 600 610 620 630 640 650 pF1KE0 NAEGQTPAIGPDGEPLDETSQMSDLPVKVIHVESGKILTGTDAPKAGQLEAWLEMNPGYE :::: :.::::::.::.:::::::::: :.:.::.: : .::::.::.:::::::::: CCDS83 NAEGGESALGPDGEPIDESSQMSDLPVKVTHTETGKVLFGPEAPKASQLDAWLEMNPGYE 570 580 590 600 610 620 660 670 680 690 700 710 pF1KE0 VAPRSDSEESGSEEEEEEEEEEQPQAAQPPTLPVEEKKKIPDPDSDDVSEVDARHIIENA :::::::::: :. :::.::::. . :.: . ::.:..::: ::..:::.: CCDS83 VAPRSDSEESDSDYEEEDEEEESSRQET-------EEKILLDPNSEEVSEKDAKQIIETA 630 640 650 660 670 680 720 730 740 750 760 770 pF1KE0 KQDVDDEYGVSQALARGLQSYYAVAHAVTERVDKQSALMVNGVLKQYQIKGLEWLVSLYN ::::::::.. : ::: ::::.::::..:::.:::::..::.::.::..::::.::::: CCDS83 KQDVDDEYSM-QYSARGSQSYYTVAHAISERVEKQSALLINGTLKHYQLQGLEWMVSLYN 690 700 710 720 730 740 780 790 800 810 820 830 pF1KE0 NNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVV :::::::::::::::::::::::::::::::.:::.::::::::::::.::::::::::: CCDS83 NNLNGILADEMGLGKTIQTIALITYLMEHKRLNGPYLIIVPLSTLSNWTYEFDKWAPSVV 750 760 770 780 790 800 840 850 860 870 880 890 pF1KE0 KVSYKGSPAARRAFVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH :.::::.:: ::..:::::::::::::::::::::::::::::::::::::::::::::: CCDS83 KISYKGTPAMRRSLVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHH 810 820 830 840 850 860 900 910 920 930 940 950 pF1KE0 CKLTQVLNTHYVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMT :::::: CCDS83 CKLTQV------------------------------------------------------ 960 970 980 990 1000 1010 pF1KE0 GEKVDLNEEETILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRH :::::::::::::::::::::::::::::::.::::::::::::::::::..:::: CCDS83 ----DLNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPEKVEYVIKCDMSALQKILYRH 870 880 890 900 910 920 1020 1030 1040 1050 1060 1070 pF1KE0 MQAKGVLLTDGSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGIV :::::.:::::::::::::::.:::::::::::::::::::::::::::.::::...:.. CCDS83 MQAKGILLTDGSEKDKKGKGGAKTLMNTIMQLRKICNHPYMFQHIEESFAEHLGYSNGVI 930 940 950 960 970 980 1080 1090 1100 1110 1120 1130 pF1KE0 QGLDLYRASGKFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGTT .: .:::::::::::::::::::::::.:::::::::::::::::::.:.: :::::::: CCDS83 NGAELYRASGKFELLDRILPKLRATNHRVLLFCQMTSLMTIMEDYFAFRNFLYLRLDGTT 990 1000 1010 1020 1030 1040 1140 1150 1160 1170 1180 1190 pF1KE0 KAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRA :.:::. ::: ::::::.::::::::::::::::::.::::.:::::::::::::::::: CCDS83 KSEDRAALLKKFNEPGSQYFIFLLSTRAGGLGLNLQAADTVVIFDSDWNPHQDLQAQDRA 1050 1060 1070 1080 1090 1100 1200 1210 1220 1230 1240 1250 pF1KE0 HRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 HRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQAGMFDQKSSSHERRAFLQAI 1110 1120 1130 1140 1150 1160 1260 1270 1280 1290 1300 1310 pF1KE0 LEHEEQDEEEDEVPDDETVNQMIARHEEEFDLFMRMDLDRRREEARNPKRKPRLMEEDEL :::::..:::::::::::.::::::.:::::::::::.:::::.:::::::::::::::: CCDS83 LEHEEENEEEDEVPDDETLNQMIARREEEFDLFMRMDMDRRREDARNPKRKPRLMEEDEL 1170 1180 1190 1200 1210 1220 1320 1330 1340 1350 1360 1370 pF1KE0 PSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLKAIEEGTLEEI ::::::::::::::::::::::.::::::.:..:::::.::::::: .:::.:.:::. CCDS83 PSWIIKDDAEVERLTCEEEEEKIFGRGSRQRRDVDYSDALTEKQWL---RAIEDGNLEEM 1230 1240 1250 1260 1270 1380 1390 1400 1410 1420 1430 pF1KE0 EEEVRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTK ::::: :: .:.:. :.: :. :.: : .:.::::::::::::::.::: CCDS83 EEEVRLKKRKRRRNVDKD-----PA-------KEDVEKAKKRRGRPPAEKLSPNPPKLTK 1280 1290 1300 1310 1320 1440 1450 1460 1470 1480 1490 pF1KE0 KMKKIVDAVIKYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRS .:. :.:.::.::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 QMNAIIDTVINYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRS 1330 1340 1350 1360 1370 1380 1500 1510 1520 1530 1540 1550 pF1KE0 LNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEGEESEEEEEG :.::::::::::.:::::::::: :::::::::::: :.:::: ::..:: : .::::: CCDS83 LGDLEKDVMLLCHNAQTFNLEGSQIYEDSIVLQSVFKSARQKIAKEEESEDESNEEEEEE 1390 1400 1410 1420 1430 1440 1560 1570 1580 1590 1600 pF1KE0 EEEGSESESRSVKVKIKLGRKE-KAQDRLKGGRRRPSRGSRAKPVVSDDDSEEEQEEDRS .:: ::::..::::::::..:. :..:. :: ..::.:: .::::::: ::.:::.: .. CCDS83 DEEESESEAKSVKVKIKLNKKDDKGRDKGKG-KKRPNRG-KAKPVVSDFDSDEEQDEREQ 1450 1460 1470 1480 1490 1500 1610 pF1KE0 GSGSEED . :: : CCDS83 SEGSGTDDE 1510 >>CCDS47865.1 CHD7 gene_id:55636|Hs108|chr8 (2997 aa) initn: 1125 init1: 637 opt: 1127 Z-score: 425.9 bits: 92.6 E(32554): 2.1e-17 Smith-Waterman score: 1462; 28.4% identity (56.3% similar) in 1325 aa overlap (46-1275:229-1493) 20 30 40 50 60 70 pF1KE0 PSPGPGPSPGAMLGPSPGPSPGSAHSMMGPSPGPPSAGHPIPTQGPGGYP--QDNMHQMH . :: .: .: :.:. : . ...:.: CCDS47 SMQQHGQPQQRMSQFSQGQEGLNQGNPFIATSGPGHLSH-VPQQSPSMAPSLRHSVQQFH 200 210 220 230 240 250 80 90 100 110 120 130 pF1KE0 K-PMESMHEKGMSDDPRYNQMKGMGMRSGGHAGMGPPPSPMDQHSQGYPSPLGGSEHASS . : ..: .... .::.. . .... : ....:: ::: .. : CCDS47 HHPSTALHGESVAHSPRFSPNPPQ------QGAVRPQTLNFSSRSQTVPSPTINNSGQYS 260 270 280 290 300 310 140 150 160 170 180 pF1KE0 PVPASGPSSGPQMSSG-------PGGAPLDGADPQ---ALG-QQNRGPTPFNQNQLHQLR : :. ..: ..: ...:.. . :. :.: .: : ..:. .: CCDS47 RYPYSNLNQGLVNNTGMNQNLGLTNNTPMNQSVPRYPNAVGFPSNSGQGLMHQQPIHPSG 320 330 340 350 360 370 190 200 210 220 230 pF1KE0 A-QIMAYKMLARGQPLPDHLQMAVQGKRPMPGMQQQMPTLPPPSVSATGPG-P---GPGP . . : . . .:: . . :: .:.. : :::.: . : : : : CCDS47 SLNQMNTQTMHPSQPQGTY--ASPPPMSPMKAMSNPAGT-PPPQVRPGSAGIPMEVGSYP 380 390 400 410 420 240 250 260 270 280 290 pF1KE0 G-PGPGPGPAPPNYSRPHGMGGPNMPPPGPSGVPP--GMPGQPPGGPPKPWPEG-PMANA . : : :. ::. :.: :: : . . : :: . : . :.: : .. CCDS47 NMPHPQPSHQPPG---AMGIGQRNMGPRNMQQSRPFIGMSSAPRELTGHMRPNGCPGVGL 430 440 450 460 470 480 300 310 320 330 340 350 pF1KE0 AAPTSTPQKLIPPQPTGRPSPAPPAVPPAASPVMPPQTQSPGQPAQPAPMVPLHQKQSRI . : . ..::: : .:. : :. :: :: : .: : :: ... CCDS47 GDPQAIQERLIPGQQ--HPGQQPSF---QQLPTCPPLQPHPGLHHQSSPPHPHHQPWAQL 490 500 510 520 530 540 360 370 380 390 400 410 pF1KE0 TPIQKPRGLDPVEILQEREYRLQARIAHRIQELENLPGSLAGDLRTKATIELKALRLLNF : .:.. : .. ... . . . . .. .. : : ... : .. CCDS47 HP--SPQNT-PQKVPVHQHSPSEPFLEKPVPDMTQVSGPNAQLVKSD-----DYLPSIEQ 550 560 570 580 590 420 430 440 450 460 470 pF1KE0 QRQLRQEVVVCMRRDTALETALNAKAYKRSKRQSLREARITEK--LEKQQKIEQERKRRQ : : ... .... . . .:.. .. . . . .. :. .:. :...:.:. CCDS47 QPQQKKK----KKKNNHIVAEDPSKGFGKDDFPGGVDNQELNRNSLDGSQE-EKKKKKRS 600 610 620 630 640 480 490 500 510 520 530 pF1KE0 KHQEYLNSILQHAKDFKEYHRSVTGKIQKLTKAVATYHANTEREQKKENERIEKERMRRL : .. . .. :. :: .. : : :. : .:.: . : .. : .. CCDS47 KAKKDPKEP-KEPKEKKEPKEPKTPKAPKIPKEPKEKKAKTATPKPKSSK---KSSNKKP 650 660 670 680 690 700 540 550 560 570 580 590 pF1KE0 MAEDEEGYRKLIDQKKDKRLAYLLQQTDEYVANLTELVRQHKAAQVAKEKKKKKKKKKAE .: .:. : . :..: : . ..: . ... :.:. .: CCDS47 DSEASALKKKVNKGKTEGSENSDLDKTPPPSPPPEE--DEDPGVQKRRSSRQVKRKRYTE 710 720 730 740 750 760 600 610 620 630 640 pF1KE0 NAE---GQTPAIGPDGEPLDETSQMSDLPVK-VIHVESG---KILTGTDAPK---AGQLE . : .. : :. : :. :. . . .:. ::... .. : .:. CCDS47 DLEFKISDEEADDADAAGRDSPSNTSQSEQQESVDAEGPVVEKIMSSRSVKKQKESGEEV 770 780 790 800 810 820 650 660 670 680 690 pF1KE0 AWLEMNPGYEVAPRSDSEESGSEEEEEEEEEEQP----QAAQPPTLPVEE-KKKIPDPD- :. :. . .. :. :.... .: .: : . . : . .. .:: CCDS47 EIEEFYVKYKNFSYLHCQWASIEDLEKDKRIQQKIKRFKAKQGQNKFLSEIEDELFNPDY 830 840 850 860 870 880 700 710 720 730 pF1KE0 ------------SDDVSEVDARHIIENAKQDVDD-----EYGVSQALARGLQSYYAVAHA .:: .: ...... . .: . ..:: . ... .. . CCDS47 VEVDRIMDFARSTDDRGEPVTHYLVKWCSLPYEDSTWERRQDIDQAKIEEFEKLMS-REP 890 900 910 920 930 940 740 750 760 770 780 pF1KE0 VTERVDKQSA-----------LMVNGVLKQYQIKGLEWLVSLYNNNLNGILADEMGLGKT ::::.. : :. :..::..:..::. . : : ::::::::::: CCDS47 ETERVERPPADDWKKSESSREYKNNNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKT 950 960 970 980 990 1000 790 800 810 820 830 840 pF1KE0 IQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWAPSVVKVSYKGSPAARRAFV- ::.:... : . : :.::::.:.::::. :: :: :. : : :.:: :.::.. CCDS47 IQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNV-VVYHGSQASRRTIQL 1010 1020 1030 1040 1050 850 860 870 880 890 pF1KE0 -------PQLR----SGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLT :: : : ::....::.:.:. : : .: :. ...::.::.::..::: CCDS47 YEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLL 1060 1070 1080 1090 1100 1110 900 910 920 930 940 950 pF1KE0 QVLNTHYVAPRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNAPFAMTGEKV . :. . ...::::::::: . ::..::.:: :. : : .:: : :. CCDS47 EGLKMMDLE-HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG---------- 1120 1130 1140 1150 1160 960 970 980 990 1000 1010 pF1KE0 DLNEEETILIIRRLHKVLRPFLLRRLKKEVEAQLPEKVEYVIKCDMSALQRVLYRHMQAK ::. :: . ..:. .:.:..:::::..:: .: : : .:. ... .:. :: . : CCDS47 DLKTEEQV---QKLQAILKPMMLRRLKEDVEKNLAPKEETIIEVELTNIQKKYYRAILEK 1170 1180 1190 1200 1210 1220 1020 1030 1040 1050 1060 pF1KE0 G-VLLTDGSEKDKKGKGGTKTLMNTIMQLRKICNHPYMFQHIEESFSEHLGFTGGI---- . ..:. :. :.... .:.::.:.::: :::::... ::.. :.. : . CCDS47 NFTFLSKGG-----GQANVPNLLNTMMELRKCCNHPYLINGAEEKILEEFKETHNAESPD 1230 1240 1250 1260 1270 1070 1080 1090 1100 1110 1120 pF1KE0 VQGLDLYRASGKFELLDRILPKLRATNHKVLLFCQMTSLMTIMEDYFAYRGFKYLRLDGT : . .:.::. :.:..::::.: .:.::.: ::. . :.:::. : . : :.:: CCDS47 FQLQAMIQAAGKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGR 1280 1290 1300 1310 1320 1330 1130 1140 1150 1160 1170 1180 pF1KE0 TKAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDR .... : . :..: :. :.::: :::::::.:: .::: :::::::::..::::: : CCDS47 VRGNLRQAAIDRFSKPDSDRFVFLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQAR 1340 1350 1360 1370 1380 1390 1190 1200 1210 1220 1230 1240 pF1KE0 AHRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVDQKVIQA--GMFD-----QKSSSHE :::::.. :.. :: : :: :.... :. ::..:. :.:. : . :. :..: CCDS47 CHRIGQSKSVKIYRLITRNSYEREMFDKASLKLGLDKAVLQSMSGRENATNGVQQLSKKE 1400 1410 1420 1430 1440 1450 1250 1260 1270 1280 1290 1300 pF1KE0 RRAFLQAILEHEEQDEEEDEVPD--DETVNQMIARHEEEFDLFMRMDLDRRREEARNPKR . .:. .::: :: .: ..:.. : CCDS47 IEDLLRKGAYGALMDEE-DEGSKFCEEDIDQILLRRTHTITIESEGKGSTFAKASFVASG 1460 1470 1480 1490 1500 1510 1310 1320 1330 1340 1350 1360 pF1KE0 KPRLMEEDELPSWIIKDDAEVERLTCEEEEEKMFGRGSRHRKEVDYSDSLTEKQWLKTLK CCDS47 NRTDISLDDPNFWQKWAKKAELDIDALNGRNNLVIDTPRVRKQTRLYSAVKEDELMEFSD 1520 1530 1540 1550 1560 1570 >>CCDS76510.1 CHD4 gene_id:1108|Hs108|chr12 (1905 aa) initn: 1024 init1: 549 opt: 1065 Z-score: 405.8 bits: 88.2 E(32554): 2.8e-16 Smith-Waterman score: 1342; 32.2% identity (59.6% similar) in 891 aa overlap (739-1561:702-1528) 710 720 730 740 750 760 pF1KE0 NAKQDVDDEYGVSQALARGLQSYYAVAHAVTERVDKQSALM--VNGVLKQYQIKGLEWLV : . ..: . ..:.:. ::..::.:: CCDS76 RGEEGRPGKKLKKVKLRKLERPPETPTVDPTVKYERQPEYLDATGGTLHPYQMEGLNWLR 680 690 700 710 720 730 770 780 790 800 810 820 pF1KE0 SLYNNNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSNWAYEFDKWA . .. . :::::::::::.:: ... :... . .::::. .::::. :: ::. :: CCDS76 FSWAQGTDTILADEMGLGKTVQTAVFLYSLYKEGHSKGPFLVSAPLSTIINWEREFEMWA 740 750 760 770 780 790 830 840 850 860 pF1KE0 PSVVKVSYKGSPAARRAFVPQ---------LRSGK------------FNVLLTTYEYIIK :.. :.: :. .: :.. . .:.:: :.::::.:: : CCDS76 PDMYVVTYVGDKDSR-AIIRENEFSFEDNAIRGGKKASRMKKEASVKFHVLLTSYELITI 800 810 820 830 840 850 870 880 890 900 910 920 pF1KE0 DKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLNTHYVAPRRLLLTGTPLQNKLPELWALL : ::..: : .::::.::.::.. :. .::: : ..::::::::::.: ::. :: CCDS76 DMAILGSIDWACLIVDEAHRLKNNQSKFFRVLNG-YSLQHKLLLTGTPLQNNLEELFHLL 860 870 880 890 900 930 940 950 960 970 980 pF1KE0 NFLLPTIFKSCSTFEQWFNAPFAMTGEKVDLNEEETILIIRRLHKVLRPFLLRRLKKEVE ::: : :.. : . : .:. .:. :..:: .: : .::::: .: CCDS76 NFLTPERFHNLEGFLEEF----------ADIAKEDQ---IKKLHDMLGPHMLRRLKADVF 910 920 930 940 950 990 1000 1010 1020 1030 1040 pF1KE0 AQLPEKVEYVIKCDMSALQRVLYRHMQAKGVLLTDGSEKDKKGKGGTKTLMNTIMQLRKI ..: :.: ... ..: .:. :.. .: . . .: :. .:.:..:.:.: CCDS76 KNMPSKTELIVRVELSPMQKKYYKY-----ILTRNFEALNARGGGNQVSLLNVVMDLKKC 960 970 980 990 1000 1010 1050 1060 1070 1080 1090 1100 pF1KE0 CNHPYMFQHIEESFSEHLGFTGGIVQGLDLYRASGKFELLDRILPKLRATNHKVLLFCQM :::::.: . : . .:. .: : :::::. ::...: .:. .:.::.: :: CCDS76 CNHPYLF---PVAAMEAPKMPNGMYDGSALIRASGKLLLLQKMLKNLKEGGHRVLIFSQM 1020 1030 1040 1050 1060 1110 1120 1130 1140 1150 1160 pF1KE0 TSLMTIMEDYFAYRGFKYLRLDGTTKAEDRGMLLKTFNEPGSEYFIFLLSTRAGGLGLNL :... ..::.. ..:.:: :.:: .. : . :: ::.. : :::::::::::.:: CCDS76 TKMLDLLEDFLEHEGYKYERIDGGITGNMRQEAIDRFNAPGAQQFCFLLSTRAGGLGINL 1070 1080 1090 1100 1110 1120 1170 1180 1190 1200 1210 1220 pF1KE0 QSADTVIIFDSDWNPHQDLQAQDRAHRIGQQNEVRVLRLCTVNSVEEKILAAAKYKLNVD .::::::.:::::::.:.:: .:::::::...: . :. : ::::.: .:: :. . CCDS76 ATADTVIIYDSDWNPHNDIQAFSRAHRIGQNKKVMIYRFVTRASVEERITQVAKKKMMLT 1130 1140 1150 1160 1170 1180 1230 1240 1250 1260 pF1KE0 QKVIQAGMFDQKSSSHERRAFLQAILEHEEQDEEEDEVPD-----------------DET . :.. :. .:..: .. :. ::. .. .::. : :.. CCDS76 HLVVRPGL-GSKTGSMSKQE-LDDILKFGTEELFKDEATDGGGDNKEGEDSSVIHYDDKA 1190 1200 1210 1220 1230 1240 1270 1280 1290 1300 1310 1320 pF1KE0 VNQMIARHEEEFDLFMRMDLDRRREEARNPK---RKPRLMEEDELPSWIIKDDAEVE--- ..... :...: . . ... . . :. .. ::.:. :::.. :. CCDS76 IERLLDRNQDETEDTELQGMNEYLSSFKVAQYVVREEEMGEEEEVEREIIKQEESVDPDY 1250 1260 1270 1280 1290 1300 1330 1340 1350 1360 1370 pF1KE0 -----RLTCEEEEEKM---FGRGSRHRKEVDYSD-SLTEKQWLKTLKAIEEGTLEEIEEE : :...: . .:.:.: ::.:.:.: : ...: .. ... CCDS76 WEKLLRHHYEQQQEDLARNLGKGKRIRKQVNYNDGSQEDRDWQD----------DQSDNQ 1310 1320 1330 1340 1350 1380 1390 1400 1410 1420 1430 pF1KE0 VRQKKSSRKRKRDSDAGSSTPTTSTRSRDKDDESKKQKKRGRPPAEKLSPNPPNLTKKMK . .:.. .: : : .: .:. ..:..: : :: :.. CCDS76 SDYSVASEEGDEDFDERSEAPRRPSRKGLRNDKDK--------------PLPPLLAR--- 1360 1370 1380 1390 1440 1450 1460 1470 1480 1490 pF1KE0 KIVDAVIKYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLND : . :. ..:: ..:.. : .: : : . .:. . .: .. CCDS76 --VGGNIEVLGFNARQ-RKAFLNAIMRYGMP--------PQDAFTTQWLVRDLRGKSEKE 1400 1410 1420 1430 1440 1500 1510 1520 1530 1540 pF1KE0 LEKDVML----LCQ----NAQTFN----LEGSLIYEDSIVLQSVFTSVRQKIEKEDDSEG .. : : ::. .:.:: :: : . .. .:.. .:.:... . .: CCDS76 FKAYVSLFMRHLCEPGADGAETFADGVPREG-LSRQHVLTRIGVMSLIRKKVQEFEHVNG 1450 1460 1470 1480 1490 1500 1550 1560 1570 1580 1590 1600 pF1KE0 EESEEE-EEGEEEGSESESRSVKVKIKLGRKEKAQDRLKGGRRRPSRGSRAKPVVSDDDS . : : : ::. . :. : CCDS76 RWSMPELAEVEENKKMSQPGSPSPKTPTPSTPGDTQPNTPAPVPPAEDGIKIEENSLKEE 1510 1520 1530 1540 1550 1560 1616 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 19:55:46 2016 done: Sun Nov 6 19:55:47 2016 Total Scan time: 6.230 Total Display time: 0.980 Function used was FASTA [36.3.4 Apr, 2011]