GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:06:14 Sequence gi568815576f:36764043_36977820 : 213778 bp : 48.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 3323 3264 60 2 0 114 89 73 0.161 8.93 1.03 Intr - 3820 3741 80 2 2 89 93 86 0.225 8.47 1.02 Intr - 11752 11632 121 1 1 113 109 66 0.790 11.27 1.01 Init - 15912 15841 72 0 0 84 41 66 0.347 2.57 1.00 Prom - 15961 15922 40 -8.06 2.14 PlyA - 16422 16417 6 1.05 2.13 Term - 18241 17930 312 1 0 -3 48 333 0.563 15.20 2.12 Intr - 32393 32297 97 1 1 74 76 37 0.071 1.11 2.11 Intr - 44071 43969 103 2 1 90 25 65 0.029 -0.37 2.10 Intr - 48962 48945 18 0 0 91 94 29 0.381 0.38 2.09 Intr - 49713 49604 110 2 2 139 76 165 0.938 20.33 2.08 Intr - 51193 51061 133 2 1 134 86 268 0.999 30.90 2.07 Intr - 51970 51932 39 2 0 99 75 23 0.550 0.30 2.06 Intr - 52970 52903 68 1 2 50 89 114 0.332 6.25 2.05 Intr - 58123 58026 98 1 2 92 63 102 0.325 6.91 2.04 Intr - 63727 63678 50 2 2 112 92 11 0.821 2.30 2.03 Intr - 65224 65069 156 2 0 -18 68 144 0.411 1.78 2.02 Intr - 67801 67664 138 2 0 -19 -75 458 0.433 19.24 2.01 Init - 67912 67855 58 1 1 24 70 46 0.554 -2.13 2.00 Prom - 70560 70521 40 -4.86 3.00 Prom + 78560 78599 40 -4.26 3.01 Sngl + 78723 79148 426 2 0 63 42 217 0.774 10.89 3.02 PlyA + 80710 80715 6 1.05 4.02 PlyA - 80942 80937 6 1.05 4.01 Sngl - 82831 82427 405 1 0 81 51 154 0.676 7.30 4.00 Prom - 87965 87926 40 -3.96 5.00 Prom + 90500 90539 40 -6.06 5.01 Init + 97130 97161 32 1 2 111 105 77 0.998 10.99 5.02 Intr + 100003 100087 85 1 1 67 41 157 0.997 8.72 5.03 Intr + 100877 101030 154 1 1 124 96 321 0.999 36.15 5.04 Intr + 103350 103420 71 1 2 75 76 176 0.829 14.00 5.05 Intr + 105467 105604 138 1 0 72 77 29 0.639 0.86 5.06 Intr + 106373 106500 128 1 2 71 84 123 0.988 9.68 5.07 Intr + 107610 107667 58 0 1 109 76 65 0.991 6.29 5.08 Intr + 108285 108383 99 2 0 92 105 134 0.183 15.81 5.09 Intr + 111611 111897 287 1 2 119 -1 301 0.515 20.44 5.10 Intr + 111987 112052 66 0 0 93 100 66 0.966 6.32 5.11 Term + 113586 113781 196 0 1 89 49 310 0.996 24.08 5.12 PlyA + 113949 113954 6 1.05 6.03 PlyA - 115645 115640 6 1.05 6.02 Term - 138553 138383 171 1 0 137 46 54 0.114 3.93 6.01 Init - 141408 141352 57 0 0 87 69 -4 0.300 -0.94 6.00 Prom - 146314 146275 40 -2.26 7.00 Prom + 152043 152082 40 -4.16 7.01 Init + 158166 158241 76 2 1 75 113 159 0.992 16.35 7.02 Intr + 159202 159325 124 2 1 53 61 290 0.950 22.44 7.03 Intr + 162026 162135 110 2 2 83 77 65 0.817 4.83 7.04 Intr + 165360 165517 158 1 2 88 75 174 0.913 15.83 7.05 Intr + 165597 165765 169 2 1 98 95 121 0.976 13.32 7.06 Intr + 166333 166468 136 2 1 55 84 125 0.999 8.43 7.07 Intr + 166613 166788 176 2 2 117 79 69 0.670 8.48 7.08 Intr + 168723 168862 140 1 2 98 100 192 0.993 21.58 7.09 Intr + 169790 169952 163 1 1 65 77 141 0.496 10.25 7.10 Intr + 170640 170873 234 1 0 89 75 39 0.357 0.36 7.11 Intr + 171273 171399 127 1 1 64 94 200 0.813 17.94 7.12 Intr + 171588 171645 58 0 1 103 91 57 0.997 6.39 7.13 Intr + 172507 172610 104 0 2 101 84 46 0.998 4.47 7.14 Term + 173335 174460 1126 1 1 130 53 607 0.662 52.88 7.15 PlyA + 176375 176380 6 1.05 8.06 PlyA - 177019 177014 6 -0.45 8.05 Term - 178470 178325 146 1 2 67 53 57 0.030 -1.83 8.04 Intr - 186596 186487 110 1 2 77 77 60 0.395 3.73 8.03 Intr - 186892 186760 133 2 1 55 44 88 0.275 0.80 8.02 Intr - 188993 188841 153 0 0 69 53 240 0.767 18.64 8.01 Init - 189681 189606 76 0 1 72 113 180 0.996 18.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 2302 2479 178 0 1 100 117 77 0.817 11.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_1|111_aa MSLCRSHQEALWKQIVGSTSSVQLASPGPEAHPLASLAAALVLWVRLAGVLVTMVKLAAK CILAGDPAVGKTALAQIFRSDGAHFQKSYTLTTGMDLVVKTVPVPDTGDSV >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_1|333_bp atgagcttgtgtcggagtcaccaggaggccttatggaagcagattgtgggctccacctcc agtgttcagctggccagcccagggcccgaagcccacccactcgcgtctctagcagccgct cttgtcctctgggtacggctcgcgggagtgttggttaccatggtgaagctggcagccaaa tgcatcctggcaggagacccagcagtgggcaagaccgccctggcacagatcttccgcagt gatggagcccatttccagaaaagctacaccctgacaacaggaatggatttggtggtgaag acagtgccagttcctgacacgggagacagtgtg >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_2|459_aa MSCAGSWEYSDAYDTLVTLKEEEEEEEEEEEEEEEEEKKKKKKKKKKKKKKKKKKNHINK KKKNHCQCGTEINAVSFPPCCERAPELDKLIVMLGEDSSMDNCSTAMITVNTGQYESDMS CHPLRAQKSDPQAWDLDLKVCSIAGTAILPPSGVNQFERVAQGIAENCRMSMTDLLNAED IKKAVGAFSGVLKQDPKIQKIAATDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGF IEEDELGFILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDGFNVSTEKIRMPRVHTC LHAVLVTFFLKPIPGDGSTALCFKAIALRLDRAGCQICISLLLAGHFAYTICYGPKAAAN KSKGGSPSALLQQTISRLLLPPADRIRTSRPPCPAEATAQASTLHSGRTEGESVPFPLPF IEMATAKYCISGDINNRAHVNSGKKNSKGHYREKAVHQM >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_2|1380_bp atgtcctgtgctggatcctgggagtacagcgatgcatatgacacacttgtcaccctcaaa gaagaagaagaagaagaagaagaagaagaagaagaggaagaagaagaagagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaatcatattaacaag aagaagaagaatcattgtcaatgtggcaccgaaatcaacgcagtttcctttccaccctgc tgcgaaagggcccctgagctcgacaagctcatagtcatgttgggagaagacagttccatg gataattgctccacagcaatgatcacagtaaacacaggtcaatatgagagtgacatgtcc tgccatcctttgagggcccagaagtcagatccacaggcctgggacttggacctgaaagtt tgtagcattgcagggacagccatcttgccaccctcaggggtcaaccagtttgagagagtg gcacaaggtattgctgagaattgcaggatgtcgatgacagacttgctgaacgctgaggac atcaagaaggcggtgggagcctttagcggtgttctaaaacaggaccccaagatacagaaa attgcagctaccgactccttcgaccacaaaaagttcttccaaatggtcggcctgaagaaa aagagtgcggatgatgtgaagaaggtgtttcacatgctggacaaggacaaaagtggcttc atcgaggaggatgagctgggattcatcctaaaaggcttctccccagatgccagagacctg tctgctaaagaaaccaagatgctgatggctgctggagacaaagatggggacggcaaaatt ggggttgacggcttcaatgtcagcacagagaagatccgcatgccccgtgtgcacacgtgc ctgcacgcagtcctagttacattcttcctgaaacccattcccggcgatggatccactgcc ctctgctttaaggctatagctctgcggctggacagagctggatgccaaatctgcatctcc ctccttcttgctggacactttgcatataccatctgctatggtccgaaggcggcagcaaat aagtcaaaaggcggcagcccttctgccctcctgcagcagacaatttcccggctgctgctg ccccctgctgacagaatcaggacatcgcggcctccgtgccccgccgaggccacagctcag gcctccactttgcattccgggcgcacagaaggtgagtcagttccatttcctctgcccttt attgagatggccaccgccaagtactgcatcagcggagacataaacaaccgggcccacgtg aacagtggaaaaaagaacagcaaaggtcattaccgagagaaggcagtgcaccagatgtga >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_3|141_aa MEYKEINVVFMPANTTILQPMNQGVISTFKSYYLRNTFYKATASINSDSSDGGGQNKLKT FWKRLIVLNATQHIFDSWEEVTISTLSGVWKKLIPTLIDDFEEFKTSVEEGTADVVEIAR ELELEVESEDMTALLQSHDQT >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_3|426_bp atggagtacaaggagattaatgtcgttttcatgcctgctaacacaaccattctgcagccc atgaatcaaggagtaatttcaactttcaagtcttattatttaagaaacacattttataag gctacagcttccataaacagtgattcctctgatggaggtggacaaaataaattgaaaacc ttctggaaacgactcattgttctaaatgccactcaacatatttttgattcatgggaggag gtcacaatatcaacattatcaggagtttggaagaagttgattccaaccctcatagacgac tttgaggagttcaagacttcagtggaagaaggaactgcagatgtggtagaaatagcaaga gaactagaattagaagtagagtctgaagatatgactgcattgctgcaatctcatgatcaa acctga >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_4|134_aa MDTALDQTIILTMTPGKNHGMNCHGCSSGLQSIRFSASHTSQGLRSPGWLPGTVLGPGDS DTKQVQHKVRQGSETQELVRLAQDEAGGEKQERRGQEISSGQTKEGLISQAMEFELYQAG QERCGKDLNRGDTG >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_4|405_bp atggacactgctctggatcagacaatcattttgactatgacccctggaaaaaatcatggc atgaactgccacggatgcagcagtggcttacagagcattcgcttctcggcatcacacacc agccaaggtttacggagccccggatggttgccaggcacagtgctaggccctggggattca gacacaaagcaggtacagcacaaagtcaggcaaggctcagagacccaggagctggttcgt ttggctcaagatgaagcaggaggggagaaacaggagaggagaggccaggagatcagcagc ggacagaccaaggagggccttataagtcaagccatggaatttgaactttatcaggcaggc caggagaggtgtgggaaggatctgaacagaggagatactgggtga >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_5|437_aa MAVAQQLRAESDFEQLPDDVAISANIADIEEKRGFTSHFVFVIEVKTKGGSKYLIYRRYR QFHALQSKLEERFGPDSKSSALACTLPTLPAKVYVGVKQEIAEMRIPALNAYMKFLSDKV PRPHLILLPPWDPLQGSGCLWSLSTQSQPSVSPDLCAWSLSLLSLPVWVLMDEDVRIFFY QSPYDSEQVPQALRRLRPRTRKVKSVSPQGNSVDRMAAPRAEALFDFTGNSKLELNFKAG DVIFLLSRINKDWLEGTVRGATGIFPLSFVKILKDFPEEDDPTNWLRCYYYEDTISTIKS VAWEGGACPAFLPSLRPLPLTSPSHGSLSHSKAPSGSQMSHNAVTSHQRPGDIAVEEDLS STPLLKDLLELTRREFQREDIALNYRDAEGDLVRLLSDEDVALMVRQARGLPSQKRLFPW KLHITQKDNYRVYNTMP >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_5|1314_bp atggctgtggcccagcagctgcgggccgagagtgactttgaacagcttccggatgatgtt gccatctcggccaacattgctgacatcgaggagaagagaggcttcaccagccactttgtt ttcgtcatcgaggtgaagacaaaaggaggatccaagtacctcatctaccgccgctaccgc cagttccatgctttgcagagcaagctggaggagcgcttcgggccagacagcaagagcagt gccctggcctgtaccctgcccacactcccagccaaagtctacgtgggtgtgaaacaggag atcgccgagatgcggatacctgccctcaacgcctacatgaagttcctctcggacaaagtt ccccgaccccacctcatcctgctgcccccatgggatcctctgcagggctctgggtgcctc tggagcctcagcacccaaagccaaccctcagtgtccccagacctgtgcgcttggagtttg agcctgctcagcctgccggtctgggtgctgatggatgaggacgtccggatcttcttttac cagtcgccctatgactcagagcaggtgccccaggcactccgccggctccgcccgcgcacc cggaaagtcaagagcgtgtccccacagggcaacagcgttgaccgcatggcagctccgaga gcagaggctctatttgacttcactggaaacagcaaactggagctgaatttcaaagctgga gatgtgatcttcctcctcagtcggatcaacaaagactggctggagggcactgtccgggga gccacgggcatcttccctctctccttcgtgaagatcctcaaagacttccctgaggaggac gaccccaccaactggctgcgttgctactactacgaagacaccatcagcaccatcaagtct gtggcctgggagggaggggcctgtccagccttcctgccatccctacgaccactgcccctc acatcaccttctcatgggtccctctcccactccaaagcccccagtggctcccagatgagc cacaatgctgtaacaagccatcaacgtccaggggacatcgcggtggaggaagatctcagc agcactcccctattgaaagacctgctggagctcacaaggcgggagttccagagagaggac atagctctgaattaccgggacgctgagggggatctggttcggctgctgtcggatgaggac gtagcgctcatggtgcggcaggctcgtggcctcccctcccagaagcgcctcttcccctgg aagctgcacatcacgcagaaggacaactacagggtctacaacacgatgccatga >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_6|75_aa MQSREPPVGLAQDPLPGSQVAELTGNRHDNMDVQPRDLVQIPIWPVNGGAMKQLDPRCHS GTRFWLIVENTISFH >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_6|228_bp atgcagtcaagggaacctccagtaggactggcccaagacccacttccaggctcacaggtg gctgagctaacggggaacagacatgacaatatggacgtgcaaccacgtgacttagtacaa ataccaatttggccagtcaatggaggagcaatgaagcagctggatcccagatgtcactca ggcacaaggttctggttgatcgtagaaaacaccatttcttttcattga >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_7|966_aa MVLAQGLLSMALLALCWERSLAGAEETIPLQTLRCYNDYTSHITCRWADTQDAQRLVNVT LIRRVNERCVIPCQSFVVTDVDYFSFQPDRPLGTRLTVTLTQHVQPPEPRDLQISTDQDH FLLTWSVALGSPQSHWLSPGDLEFEVVYKRLQDSWEDAAILLSNTSQATLGPEHLMPSST YVARVRTRLAPGSRLSGRPSKWSPEVCWDSQPGDEAQPQNLECFFDGAAVLSCSWEVRKE VASSVSFGLFYKPSPDAGSAVLLREEECSPVLREGLGSLHTRHHCQIPVPDPATHGQYIV SVQPRRAEKHIKSSVNIQMAPPSLNVTKDGDSYSLRWETMKMRYEHIDHTFEIQYRKDTA TWKDSKTETLQNAHSMALPALEPSTRYWARVRVRTSRTGYNGIWSEWSEARSWDTESVTT RRLPQNKTSPRGGFRESKDTGDIGSNPSPQPGLVAAGAQGEGHNQVERRPDQWAQERRKR GVLEEPGLGAQGWAADADIPLSPRLLEVLPMWVLALIVIFLTIAVLLALRFCGIYGYRLR RKWEEKIPNPSKSHLFQNGSAELWPPGSMSAFTSGSPPHQGPWGSRFPELEGVFPVGFGD SEVSPLTIEDPKHVCDPPSGPDTTPAASDLPTEQPPSPQPGPPAASHTPEKQASSFDFNG PYLGPPHSRSLPDILGQPEPPQEGGSQKSPPPGSLEYLCLPAGGQVQLVPLAQAMGPGQA VEVERRPSQGAAGSPSLESGGGPAPPALGPRVGGQDQKDSPVAIPMSSGDTEDPGVASGY VSSADLVFTPNSGASSVSLVPSLGLPSDQTPSLCPGLASGPPGAPGPVKSGFEGYVELPP IEGRSPRSPRNNPVPPEAKSPVLNPGERPADVSPTSPQPEGLLVLQQVGDYCFLPGLGPG PLSLRSKPSSPGPGPEIKNLDQAFQVKKPPGQAVPQVPVIQLFKALKQQDYLSLPPWEVN KPGEVC >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_7|2901_bp atggtgctggcccaggggctgctctccatggccctgctggccctgtgctgggagcgcagc ctggcaggggcagaagaaaccatcccgctgcagaccctgcgctgctacaacgactacacc agccacatcacctgcaggtgggcagacacccaggatgcccagcggctcgtcaacgtgacc ctcattcgccgggtgaatgagagatgtgtcattccctgccagagttttgtcgtcactgac gttgactacttctcattccaaccagacaggcctctgggcacccggctcaccgtcactctg acccagcatgtccagcctcctgagcccagggacctgcagatcagcaccgaccaggaccac ttcctgctgacctggagtgtggcccttgggagtccccagagccactggttgtccccaggg gatctggagtttgaggtggtctacaagcggcttcaggactcttgggaggacgcagccatc ctcctctccaacacctcccaggccaccctggggccagagcacctcatgcccagcagcacc tacgtggcccgagtacggacccgcctggccccaggttctcggctctcaggacgtcccagc aagtggagcccagaggtttgctgggactcccagccaggggatgaggcccagccccagaac ctggagtgcttctttgacggggccgccgtgctcagctgctcctgggaggtgaggaaggag gtggccagctcggtctcctttggcctattctacaagcccagcccagatgcaggctctgct gtgctcctcagggaggaagagtgctccccagtgctgagggaggggctcggcagcctccac accaggcaccactgccagattcccgtgcccgaccccgcgacccacggccaatacatcgtc tctgttcagccaaggagggcagagaaacacataaagagctcagtgaacatccagatggcc cctccatccctcaacgtgaccaaggatggagacagctacagcctgcgctgggaaacaatg aaaatgcgatacgaacacatagaccacacatttgagatccagtacaggaaagacacggcc acgtggaaggacagcaagaccgagaccctccagaacgcccacagcatggccctgccagcc ctggagccctccaccaggtactgggccagggtgagggtcaggacctcccgcaccggctac aacgggatctggagcgagtggagtgaggcgcgctcctgggacaccgagtcggtgaccacc aggcgactcccgcagaacaagacaagtccaaggggtggtttcagagagagtaaggataca ggagatattgggagcaacccaagtccccagccaggcttggtagcggcaggggcccaggga gagggccacaaccaggtagaaaggaggccggaccaatgggcccaagagaggaggaagagg ggagtcctggaggaacctggcctgggagctcagggctgggcagcagatgctgacattcct ctttctccccggctgctggaagtgctgcctatgtgggtgctggccctcatcgtgatcttc ctcaccatcgctgtgctcctggccctccgcttctgtggcatctacgggtacaggctgcgc agaaagtgggaggagaagatccccaaccccagcaagagccacctgttccagaacgggagc gcagagctttggcccccaggcagcatgtcggccttcactagcgggagtcccccacaccag gggccgtggggcagccgcttccctgagctggagggggtgttccctgtaggattcggggac agcgaggtgtcacctctcaccatagaggaccccaagcatgtctgtgatccaccatctggg cctgacacgactccagctgcctcagatctacccacagagcagccccccagcccccagcca ggcccgcctgccgcctcccacacacctgagaaacaggcttccagctttgacttcaatggg ccctacctggggccgccccacagccgctccctacctgacatcctgggccagccggagccc ccacaggagggtgggagccagaagtccccacctccagggtccctggagtacctgtgtctg cctgctggggggcaggtgcaactggtccctctggcccaggcgatgggaccaggacaggcc gtggaagtggagagaaggccgagccagggggctgcagggagtccctccctggagtccggg ggaggccctgcccctcctgctcttgggccaagggtgggaggacaggaccaaaaggacagc cctgtggctatacccatgagctctggggacactgaggaccctggagtggcctctggttat gtctcctctgcagacctggtattcaccccaaactcaggggcctcgtctgtctccctagtt ccctctctgggcctcccctcagaccagacccccagcttatgtcctgggctggccagtgga ccccctggagccccaggccctgtgaagtcagggtttgagggctatgtggagctccctcca attgagggccggtcccccaggtcaccaaggaacaatcctgtcccccctgaggccaaaagc cctgtcctgaacccaggggaacgcccggcagatgtgtccccaacatccccacagcccgag ggcctccttgtcctgcagcaagtgggcgactattgcttcctccccggcctggggcccggc cctctctcgctccggagtaaaccttcttccccgggacccggtcctgagatcaagaaccta gaccaggcttttcaagtcaagaagcccccaggccaggctgtgccccaggtgcccgtcatt cagctcttcaaagccctgaagcagcaggactacctgtctctgcccccttgggaggtcaac aagcctggggaggtgtgttga >gi568815576f:36764043_36977820|GENSCAN_predicted_peptide_8|205_aa MVLAWELLLMALLALCWGLSLAGAEETVLLQTLRCYSDYTSHITCGWADTQDAQRLVNVT LIRRVNESVTLGAGATVCSQPRTLRNPFKAFIKICNAFAHDCFPLTSSEIQSRSKTRRLL QRCVIPYQSFVISDIDYFSFQPDGPLGTWLTVTLTQHDPESCYLRHWVDEVTPAQDTQHD PWPKSPHFNALGTLTLHPGCPASID >gi568815576f:36764043_36977820|GENSCAN_predicted_CDS_8|618_bp atggtgctggcctgggagctgctcctcatggccctgctggccctgtgctggggacttagc ctggcaggggcagaagaaaccgtcctgctgcagaccctgcgctgctacagtgactatacc agccacatcacctgcgggtgggcagacacccaggatgcccagcggctcgtcaacgtgacc ctcattcgccgggtgaatgagtctgtgacgttgggtgcaggggccacggtctgttcacaa ccgaggaccctcaggaatcctttcaaagcattcatcaaaatctgcaacgcctttgctcat gactgtttcccactcactagcagtgagatccaatcccgatccaagacccggaggcttctt cagagatgtgtcattccctaccagagttttgtcatcagtgacattgactacttctcattc cagccagacgggcctctgggcacctggctcactgtcactctgacccagcatgacccagag agttgctacctgcgtcactgggtagatgaagttacccctgctcaggacacccagcatgac ccatggcccaaatcaccgcatttcaatgctttgggcacactgaccctgcacccgggctgc ccagcttctattgattga