GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:48:54 Sequence gi568815576r:21669207_21967410 : 298204 bp : 48.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 707 756 50 0 2 92 110 20 0.742 3.00 1.02 Intr + 1804 1853 50 0 2 95 74 88 0.853 5.58 1.03 Intr + 3124 3175 52 1 1 121 92 74 0.998 10.01 1.04 Intr + 5858 5909 52 1 1 77 96 35 0.937 1.68 1.05 Intr + 12093 12184 92 1 2 111 85 192 0.980 20.91 1.06 Intr + 13231 13320 90 0 0 60 109 200 0.999 19.49 1.07 Intr + 13976 14051 76 1 1 97 91 41 0.879 4.39 1.08 Intr + 15547 15707 161 2 2 72 68 278 0.918 23.91 1.09 Intr + 17277 17352 76 2 1 86 68 37 0.865 0.59 1.10 Intr + 17686 17879 194 2 2 19 84 410 0.912 32.91 1.11 Intr + 17897 17983 87 1 0 70 50 57 0.535 0.27 1.12 Intr + 18437 18526 90 1 0 30 89 145 0.946 8.99 1.13 Intr + 18867 18900 34 2 1 105 77 10 0.953 -0.60 1.14 Intr + 19526 19643 118 0 1 116 68 189 0.953 19.22 1.15 Intr + 24610 24666 57 1 0 86 103 86 0.993 7.90 1.16 Intr + 25387 25459 73 1 1 85 66 105 0.998 7.41 1.17 Intr + 25549 25611 63 0 0 66 94 170 0.999 14.31 1.18 Intr + 25731 25864 134 2 2 64 94 186 0.962 16.24 1.19 Term + 26188 26284 97 1 1 91 43 89 0.140 2.14 1.20 PlyA + 26386 26391 6 1.05 2.05 PlyA - 28358 28353 6 1.05 2.04 Term - 32012 31923 90 2 0 89 35 86 0.862 1.12 2.03 Intr - 34272 34164 109 2 1 110 83 169 0.997 18.89 2.02 Intr - 34772 34633 140 2 2 42 67 67 0.553 -0.74 2.01 Init - 40807 40685 123 1 0 86 45 82 0.421 3.87 2.00 Prom - 43581 43542 40 -6.96 3.00 Prom + 44049 44088 40 -4.66 3.01 Init + 67011 67091 81 2 0 81 38 132 0.369 6.47 3.02 Intr + 85797 85965 169 2 1 96 33 98 0.214 4.62 3.03 Intr + 88136 88339 204 0 0 55 91 55 0.390 1.67 3.04 Term + 90913 91043 131 2 2 112 43 97 0.923 5.94 3.05 PlyA + 91313 91318 6 1.05 4.07 PlyA - 93407 93402 6 1.05 4.06 Term - 100114 99998 117 1 0 88 32 124 0.963 5.34 4.05 Intr - 103776 103667 110 1 2 90 94 113 0.898 12.10 4.04 Intr - 109644 109477 168 1 0 79 56 42 0.420 0.02 4.03 Intr - 119602 119488 115 1 1 133 82 34 0.854 7.32 4.02 Intr - 129922 129806 117 1 0 87 116 54 0.958 8.76 4.01 Init - 136814 136644 171 2 0 53 82 91 0.284 4.44 4.00 Prom - 137674 137635 40 -8.56 5.03 PlyA - 138365 138360 6 -1.95 5.02 Term - 138640 138445 196 0 1 102 48 278 0.989 22.08 5.01 Init - 198234 198116 119 0 2 125 89 317 0.977 33.17 5.00 Prom - 220374 220335 40 -2.56 6.09 PlyA - 222357 222352 6 1.05 6.08 Term - 254265 253886 380 1 2 95 43 459 0.946 37.05 6.07 Intr - 256456 256363 94 1 1 116 89 102 0.973 12.64 6.06 Intr - 262085 261942 144 2 0 129 110 29 0.854 9.68 6.05 Intr - 264373 264185 189 1 0 76 78 287 0.987 26.28 6.04 Intr - 265020 264659 362 1 2 66 65 261 0.678 16.54 6.03 Intr - 270474 270326 149 2 2 140 76 209 0.999 24.78 6.02 Intr - 276902 276637 266 2 2 100 96 230 0.632 21.31 6.01 Init - 277905 277876 30 0 0 89 93 2 0.335 0.56 6.00 Prom - 279549 279510 40 -9.06 7.00 Prom + 280645 280684 40 -7.56 7.01 Init + 281987 282010 24 1 0 105 103 2 0.797 3.03 7.02 Intr + 283608 283787 180 2 0 74 54 252 0.858 20.46 7.03 Intr + 283859 283940 82 1 1 112 119 -18 0.903 2.91 7.04 Term + 284229 284254 26 1 2 70 52 14 0.395 -5.61 7.05 PlyA + 284979 284984 6 -0.45 8.11 PlyA - 285119 285114 6 1.05 8.10 Term - 288389 287908 482 0 2 73 45 609 0.585 50.06 8.09 Intr - 289487 289286 202 2 1 96 113 398 0.990 41.96 8.08 Intr - 290026 289926 101 2 2 100 96 144 0.999 16.23 8.07 Intr - 290530 290381 150 2 0 86 40 312 0.989 26.33 8.06 Intr - 291243 291115 129 1 0 44 97 277 0.988 24.87 8.05 Intr - 293396 293223 174 0 0 66 72 304 0.951 26.51 8.04 Intr - 293687 293541 147 0 0 98 42 315 0.985 28.11 8.03 Intr - 294822 294717 106 0 1 140 54 131 0.985 14.69 8.02 Intr - 295109 294955 155 0 2 105 113 301 0.998 34.09 8.01 Intr - 296169 296079 91 0 1 129 71 100 0.994 11.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_1|548_aa XTLPVLNTLTFMVARSQPFVYPVCTPDGIVFDLLNIVPWLKKYGTNPSNGEKLDGRSLIK LNFSKNSEGKYHCPVLFTVFTNNTHIVAVRTTGNVYAYEAVEQLNIKAKNFRDLLTDEPF SRQDIITLQDPTNLDKFNVSNFYHVKNNMKIIDPDEEKAKQDPSYYLKNTNAETRETLQE LYKEFKGDEILAATMKAPEKKKVDKLNAAHYSTGKVSASFTSTAMVPETTHEAAAIDEDV LRYQFVKKKGYVRLHTNKGDLNLELHCDLVGVEASHSPCPKVISGSSDSHVLNIGLPLEI QPVTRRDPQQCPVSFLGGRVGSMACAQTPKTCENFIRLCKKHYYDGTIFHRSIRNFVIQG GDPTGTGTGGESYWGKPFKDEFRPNLSHTGRGILSMANSGPNSNRSQFFITFRSCAYLDK KHTIFGRVVGGFDVLTAMENVESDPKTDRPKEEIRIDATTVFVDPYEEADAQIAQERKTQ LKVAPETKVKSSQPQAGSQGPQTFRQGVGKYINPAATKRAAEEEPSTSATVPMSKKKPSR GFGDFSSW >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_1|1647_bp ngtacattacctgtgctgaatacactcacttttatggtggcaagaagccagccctttgtc tacccagtctgcactcccgatggcatcgtctttgacttactgaacattgttccatggctt aagaagtacgggaccaaccccagcaatggagagaagctggacgggaggtccctgatcaag ctgaacttttccaagaacagtgaggggaagtaccactgcccagtgctgtttaccgtgttc accaacaacacccacatcgtggctgtgaggacgaccggcaacgtctacgcctatgaggca gtggaacagctaaatatcaaggccaagaacttccgggacctgctgaccgacgagcccttc tcccggcaggacatcatcaccctccaggaccccaccaatttggacaagttcaatgtctct aacttctatcatgtgaagaataacatgaaaataatagacccagatgaagagaaggccaaa caggacccgtcttattatctgaaaaatacaaatgccgagacccgagagaccctgcaggag ctctacaaggagttcaaaggggacgagattctggcagccaccatgaaggccccggagaag aagaaagtggacaagctgaacgctgcccactattccacagggaaggtcagcgcttccttc acctccaccgcgatggtcccggagaccacacatgaagcagctgccatcgacgaggatgtg ctgcgctaccagtttgtgaagaagaagggctacgtgcggctgcacaccaacaagggcgac ctcaacctggagctgcactgcgacctggtgggtgtggaggccagccactccccatgcccc aaggtcatctctgggtcatctgacagccatgtcttaaatatagggctgccactggaaatt cagcctgtcactaggcgggacccgcagcagtgccctgtgtccttcctagggggccgcgtt ggcagcatggcctgcgcacagacaccaaaaacctgcgaaaacttcatcaggctttgcaag aagcattattacgatggcaccatcttccacagatccatccggaactttgtgatccaaggg ggcgaccccacaggcacaggcacgggtggggagtcatactgggggaagcccttcaaagac gagttccggcccaacctctcgcacacgggccgcggcatcctcagcatggccaactccggg cccaacagcaacaggtctcaattcttcatcacgtttcgctcctgtgcctacctggacaag aagcataccatctttggacgggttgttgggggctttgacgtactgacagccatggagaat gtggagagtgaccccaaaactgaccgccctaaggaggagatccgcattgatgccactaca gtgttcgtggacccctatgaggaggccgatgcccagattgcgcaggagcggaagacacag ctcaaggtagccccggagaccaaagtgaagagcagccagccccaggcagggagccagggc ccccagaccttccgccagggcgtgggcaagtacatcaacccagcagccacgaagcgagca gcagaggaagagccctcaaccagtgccactgtccccatgtccaagaagaagcccagtcgg ggttttggggacttcagctcctggtag >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_2|153_aa MEELMLSRNDSVLHPSSGHDIPPASGHELPASSYVMTTDPQLTAAGLDSPGSGQHRFVPE STRSQSHGTFFFQSFQGSQGRAYLFNSVVNVGCGPAEERVLLTGLHAVADIYCENCKTTL GWKYEHAFESSQKYKEGKFIIELAHMIKDNGWE >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_2|462_bp atggaggagctgatgctcagtcggaatgacagtgttctccacccatcctcaggtcatgac atccccccagcctcaggtcatgaactccccgcatcctcatacgtcatgaccaccgatcct cagctaacagccgcggggctggactcccctggttctgggcagcaccgcttcgttccagaa agcactcgcagccaatctcacgggaccttcttctttcagtcctttcaggggagccaggga cgcgcctacctcttcaattccgtggtgaacgtgggctgcggccctgcagaggagagggtc cttctcaccgggctgcatgcggttgccgacatctactgcgagaactgcaagaccacgctc gggtggaaatacgagcatgcctttgagagcagtcagaaatataaggaaggaaaattcatc attgagcttgctcatatgatcaaagacaatggctgggagtaa >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_3|194_aa MGRGRAGPAAPEAEASLRAASSSSAVHNRLRFSLFKALFPSEKSRTHCGWLPPRGLADGP RRGRSTGGTCVGHLLTLFSTFIASCVWSMVVLHRAETAGIPVLGTQPSTSQVTVPEHGTG VMFPVDSRESICRPDVWMWDIIIRKTQTLNFGEGLKNHLENPRSTDIPIQLEEQLTALTQ VTTCRANLEDIWGT >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_3|585_bp atgggccgagggcgtgcagggcccgcagctccagaggctgaggcgagcttgcgcgcagcc tcctcgtccagcgccgtgcacaaccgtcttcgcttcagtctattcaaggctctcttccct tcagagaagagccggacacactgtggctggctccctccccggggcctggctgatggtccc cggaggggccgcagcactggaggcacatgtgtgggccacctgctcaccctcttttccact ttcatcgcttcctgtgtctggagcatggtggttctgcatcgagctgaaactgctgggatt cctgtgcttgggacacagcccagcacatcacaggtgacagttccagagcatggaaccggt gtcatgtttccagtcgattcaagggagagcatctgtaggcccgatgtgtggatgtgggac attattatacgaaaaactcagactctaaactttggtgaaggtctgaagaaccacctggag aaccctaggagcactgacatccccattcaactcgaggaacagctcacagccctaacacaa gttaccacatgcagagcaaatctggaagacatctggggaacataa >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_4|265_aa METDLYKLLKTQHLSNDHICYFLYQILRGLKYIHSANVLHRDLKPSNLLLNTTCDLKICD FGLARVADPDHDHTGFLTEYVATRWYRAPEIMLNSKGYTKSIDIWSVGCILAEMLSNRPI FPGKHYLDQLNHILAWMPCLSLSSHCGQGDSGEYGPVSVTIPTAQPRSVGKPERSPIANT VGWIRGAVDAALDLLDKMLTFNPHKRIEVEQALAHPYLEQYYDPSDEPIAEAPFKFDMEL DDLPKEKLKELIFEETARFQPGYRS >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_4|798_bp atggaaacagatctttacaagctcttgaagacacaacacctcagcaatgaccatatctgc tattttctctaccagatcctcagagggttaaaatatatccattcagctaacgttctgcac cgtgacctcaagccttccaacctgctgctcaacaccacctgtgatctcaagatctgtgac tttggcctggcccgtgttgcagatccagaccatgatcacacagggttcctgacagaatat gtggccacacgttggtacagggctccagaaattatgttgaattccaagggctacaccaag tccattgatatttggtctgtaggctgcattctggcagaaatgctttctaacaggcccatc tttccagggaagcattatcttgaccagctgaaccacattttggcttggatgccatgccta tccctgagcagtcactgtggccaaggggactcaggtgaatatggcccagtgagtgtcacc attcccacagcacagcccaggagcgtgggcaagcctgagcgctcccctatagcaaacact gttggatggattagaggggcagtggatgctgctctggacttattggacaaaatgttgaca ttcaacccacacaagaggattgaagtagaacaggctctggcccacccatatctggagcag tattacgacccgagtgacgagcccatcgccgaagcaccattcaagttcgacatggaattg gatgacttgcctaaggaaaagctcaaagaactaatttttgaagagactgctagattccag ccaggatacagatcttaa >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_5|104_aa MAAAAAAGAGPEMVRGQVFDVGPRYTNLSYIGEGAYGMVCSAYDNVNKVRVAIKKISPFE HQTYCQRTLREIKILLRFRHENIIGINDIIRAPTIEQMKDVYPF >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_5|315_bp atggcggcggcggcggcggcgggcgcgggcccggagatggtccgcgggcaggtgttcgac gtggggccgcgctacaccaacctctcgtacatcggcgagggcgcctacggcatggtgtgc tctgcttatgataatgtcaacaaagttcgagtagctatcaagaaaatcagcccctttgag caccagacctactgccagagaaccctgagggagataaaaatcttactgcgcttcagacat gagaacatcattggaatcaatgacattattcgagcaccaaccatcgagcaaatgaaagat gtgtatcctttttag >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_6|537_aa MCLFPVREKPGEAWRPSCPGLAAGPRDALGMSSGAPQKSSPMASGAEETPGFLDTLLQDF PALLNPEDPLPWKAPGTVLSQEEVEGELAELAMGFLGSRKAPPPLAAALAHEAVSQLLQT DLSEFRKLPREEEEEEEDDDEEEKAPVTLLDAQSLAQSFFNRLWEVAGQWQKQVPLAARA SQRQWLVSIHAIRNTRRKMEDRHVSLPSFNQLFGLSVSAPVQTLGTASEDRGEGGGPPGT EKSCPPRLRQERHPSAPVLAQQETKKGNSDPVNRAYFAVFDGHGGVDAARYAAVHVHTNA ARQPELPTDPEGALREAFRRTDQMFLRKAKRERLQSGTTGVCALIAGATLHVAWLGDSQV ILVQQGQVVKLMEPHRPERQDEKARIEALGGFVSHMDCWRVNGTLAVSRAIGDVFQKPYV SGEADAASRALTGSEDYLLLACDGFFDVVPHQEVVGLVQSHLTRQQGSGLRVAEELVAAA RERGSHDNITVMVVFLRDPQELLEGGNQGEGDPQAEGRRQDLPSSLPEPETQAPPRS >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_6|1614_bp atgtgcctctttccagtgagagagaagccgggtgaagcctggagaccctcttgccctggc ctagctgcaggcccccgggatgctttgggcatgtcctctggagccccacagaagagcagc ccaatggccagtggagctgaggagaccccaggcttcctggacacgctcctgcaagacttc ccagccctgctgaacccagaggaccctctgccatggaaggccccagggacggtgctcagc caggaggaggtggagggcgagctggctgagctggccatgggctttctgggcagcaggaag gccccgccaccacttgctgctgctctggcccacgaagcagtttcacagctgctacagaca gacctttccgaattcaggaagttgcccagggaggaagaagaagaggaggaggacgatgac gaggaggaaaaggcccctgtgaccttgctggatgcccaaagcctggcacagagtttcttt aaccgcctttgggaagtcgccggccagtggcagaagcaggtgccattggctgcccgggcc tcacagcggcagtggctggtctccatccacgccatccggaacactcgccgcaagatggag gaccggcacgtgtccctcccttccttcaaccagctcttcggcttgtctgtgagtgctccc gtccagaccctggggacagcttcggaggaccggggcgagggtgggggccccccaggtacc gagaagagctgcccaccaaggctgagacaggagagacacccgtcagcccctgtacttgcc cagcaagagactaagaagggcaacagtgaccctgtgaaccgcgcctactttgctgtgttt gatggtcacggaggcgtggatgctgcgaggtacgccgctgtccacgtgcacaccaacgct gcccgccagccagagctgcccacagaccctgagggagccctcagagaagccttccggcgc accgaccagatgtttctcaggaaagccaagcgagagcggctgcagagcggcaccacaggt gtgtgtgcgctcattgcaggagcgaccctgcacgtcgcctggctcggggattcccaggtc attttggtacagcagggacaggtggtgaagctgatggagccacacagaccagaacggcag gatgagaaggcgcgcattgaagcattgggtggctttgtgtctcacatggactgctggaga gtcaacgggaccctggccgtctccagagccatcggggatgtcttccagaagccctacgtg tctggggaggccgatgcagcttcccgggcgctgacgggctccgaggactacctgctgctt gcctgtgatggcttctttgacgtcgtaccccaccaggaagttgttggcctggtccagagc cacctgaccaggcagcagggcagcgggctccgtgtcgccgaggagctggtggctgcggcc cgggagcggggctcccacgacaacatcacggtcatggtggtcttcctcagggacccccaa gagctgctggagggcgggaaccagggagaaggggacccccaggcagaagggaggaggcag gacttgccctccagccttccagaacctgagacccaggctccaccaagaagctag >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_7|103_aa MALCVQQVLSPRGSVSRKLLPAGPAHFPSLAAAGAAGAAQAQSGLQAGGAGPLLPPGGYV DGDCGPLVLWVGGWMGKAIPRSQRPEIKQLLPSIAASKPAMAS >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_7|312_bp atggccctttgtgtacagcaggtgctgtctcctcgcggctccgtgtcccgcaagctgctg ccggccggccccgcccacttcccgtccctggccgccgcgggcgccgcgggcgccgcgcag gcgcagtcgggcctccaggctggcggggccggacctctgctgccccctggcggctacgtg gacggtgactgcggccctttagtgctttgggtgggcggctggatggggaaagcaatccct aggtcacagcgcccagaaattaagcaacttctgccgtcaatagctgcctcgaagccagca atggccagttga >gi568815576r:21669207_21967410|GENSCAN_predicted_peptide_8|578_aa VEATSRKEKAKQRPLALNTVEMLRVASSSLGMGPQHAMQTAERLYTQGYISYPRTETTHY PENFDLKGSLRQQANHPYWADTVKRLLAEGINRPRKGHDAGDHPPITPMKSATEAELGGD AWRLYEYITRHFIATVSHDCKYLQSTISFRIGPELFTCSGKTVLSPGFTEVMPWQSVPLE ESLPTCQRGDAFPVGEVKMLEKQTNPPDYLTEAELITLMEKHGIGTDASIPVHINNICQR NYVTVESGRRLKPTNLGIVLVHGYYKIDAELVLPTIRSAVEKQLNLIAQGKADYRQVLGH TLDVFKRKFHYFVDSIAGMDELMEVSFSPLAATGKPLSRCGKCHRFMKYIQAKPSRLHCS HCDETYTLPQNGTIKLYKELRCPLDDFELVLWSSGSRGKSYPLCPYCYNHPPFRDMKKGM GCNECTHPSCQHSLSMLGIGQCVECESGVLVLDPTSGPKWKVACNKCNVVAHCFENAHRV RVSADTCSVCEAALLDVDFNKAKSPLPGDETQHMGCVFCDPVFQELVELKHAASCHPMHR GGPGRRQGRGRGRARRPPGKPNPRRPKDKMSALAAYFV >gi568815576r:21669207_21967410|GENSCAN_predicted_CDS_8|1737_bp gtggaggccacaagcaggaaagaaaaggccaagcagaggcccctggccctgaacactgtg gagatgctgcgtgtggccagctcttctctgggcatggggccgcagcacgccatgcagacg gctgagcggctctacacgcaaggctacatcagctacccacggacagagaccacccactac cctgagaactttgacctgaagggctctctgcggcagcaggccaaccacccctactgggcc gacacggtgaagcggttgttagcagaaggtatcaaccgcccgcggaaaggccatgacgcc ggcgaccatccccccatcacccccatgaagtctgccacagaggccgaattagggggtgac gcgtggcggctctatgagtacatcaccagacacttcatcgccacggtcagccatgactgc aagtacctgcagagcaccatctccttcagaattgggcccgagctcttcacctgctccggg aagaccgtcctctcaccaggcttcacggaggtcatgccctggcagagcgtgcccctggag gagagcctgcccacttgccagcggggtgatgccttccctgtgggcgaggtgaagatgctg gagaagcagacgaacccacccgactacctgacggaggccgagctcatcacgctcatggag aagcatggcatcggcacggatgccagcatccctgtgcatatcaacaacatctgccagcgc aactatgtcacggtggagagcgggcgccggctcaagcccaccaacctcggcatcgtcctg gtgcacggctactataagattgatgcagagctggtgctccccaccatccgcagtgcagtg gagaagcagctgaacctgatcgcccagggcaaggccgactaccgccaggtcctgggccac accctggacgtgttcaagaggaagttccactactttgtcgactccattgctggcatggat gagttgatggaggtgtctttctcgcccctggcggccacaggcaagcccctctcacgctgt gggaagtgccaccgcttcatgaagtacatccaggccaagccaagccgcctgcactgctcc cactgcgatgagacctacacgctcccccagaacggcaccatcaagctctacaaggagctc cgctgccctctggatgacttcgagctggtcctgtggtcatcaggctctcggggcaagagc tacccgctgtgcccctactgctacaaccacccacccttccgagacatgaagaaaggcatg ggctgcaacgagtgtacgcacccctcctgccagcactcgctgagcatgctgggcatcggc cagtgcgtggaatgtgagagcggggtgctggtgctggaccccacctcgggccccaagtgg aaggtggcctgcaacaagtgcaacgtggtagcgcactgcttcgagaacgcccaccgcgtg cgggtgtccgccgacacctgcagtgtctgtgaggccgccttgcttgatgtggacttcaac aaggccaagtccccactcccgggcgatgagacgcagcacatgggctgcgtcttttgtgac cccgtcttccaggagctggtggagctgaagcatgcggcctcctgccaccccatgcaccgc ggtggaccagggagaaggcagggtcgagggcggggccgggccaggaggccccctgggaag cccaaccccagacggcccaaggacaagatgtcagccctggccgcctactttgtatga