GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:34:22 Sequence gi568815581r:34911971_35126553 : 214583 bp : 43.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 161 156 6 1.05 1.10 Term - 6366 6338 29 1 2 104 50 31 0.080 -0.86 1.09 Intr - 19081 18979 103 1 1 67 110 87 0.681 8.55 1.08 Intr - 20530 20397 134 2 2 85 82 96 0.866 9.06 1.07 Intr - 27360 27213 148 0 1 61 103 128 0.999 11.41 1.06 Intr - 30673 30514 160 0 1 74 100 96 0.953 9.39 1.05 Intr - 40083 39980 104 0 2 135 33 60 0.680 4.17 1.04 Intr - 42629 42456 174 2 0 64 87 91 0.989 6.74 1.03 Intr - 46724 46590 135 2 0 90 52 64 0.900 3.76 1.02 Intr - 47680 47617 64 0 1 68 89 110 0.956 7.82 1.01 Init - 48123 48113 11 0 2 87 33 -6 0.011 -6.21 1.00 Prom - 48744 48705 40 -7.56 2.00 Prom + 49347 49386 40 -8.26 2.01 Sngl + 49549 50715 1167 0 0 67 46 1159 0.453 103.71 2.02 PlyA + 51362 51367 6 1.05 3.00 Prom + 66960 66999 40 -2.16 3.01 Init + 71297 71582 286 1 1 62 110 287 0.796 25.54 3.02 Intr + 74018 74161 144 0 0 59 110 78 0.978 7.35 3.03 Intr + 77457 77693 237 1 0 109 48 231 0.964 18.79 3.04 Intr + 78993 79144 152 1 2 77 71 149 0.698 11.98 3.05 Intr + 79785 79867 83 2 2 77 75 171 0.619 13.14 3.06 Intr + 79988 80065 78 2 0 91 75 56 0.904 3.27 3.07 Intr + 80554 80722 169 1 1 97 73 302 0.999 29.55 3.08 Intr + 82306 82461 156 0 0 43 86 173 0.966 12.81 3.09 Intr + 84094 84225 132 0 0 123 74 70 0.989 9.94 3.10 Intr + 84604 84683 80 0 2 101 109 54 0.999 7.15 3.11 Intr + 85768 85855 88 1 1 120 98 87 0.999 12.87 3.12 Intr + 86249 86326 78 1 0 97 98 51 0.990 6.75 3.13 Intr + 86634 86757 124 2 1 114 77 184 0.994 20.06 3.14 Intr + 87337 87479 143 2 2 88 100 204 0.999 21.67 3.15 Intr + 87812 87886 75 1 0 69 64 91 0.958 4.51 3.16 Intr + 89287 89433 147 0 0 89 64 158 0.999 13.93 3.17 Intr + 89939 90134 196 1 1 117 91 166 0.990 18.79 3.18 Intr + 90698 90819 122 0 2 100 100 144 0.887 17.01 3.19 Term + 92303 92536 234 1 0 134 39 201 0.682 16.12 3.20 PlyA + 92816 92821 6 1.05 4.08 PlyA - 94163 94158 6 1.05 4.07 Term - 100179 99998 182 1 2 100 54 213 0.998 16.87 4.06 Intr - 102793 102770 24 2 0 102 92 6 0.568 0.40 4.05 Intr - 104610 104400 211 0 1 84 68 268 0.998 22.89 4.04 Intr - 105636 105553 84 0 0 124 75 116 0.999 13.92 4.03 Intr - 109814 109401 414 2 0 43 110 127 0.310 4.70 4.02 Intr - 114591 114404 188 1 2 89 113 78 0.941 9.81 4.01 Init - 118082 117950 133 2 1 78 47 73 0.698 2.50 4.00 Prom - 118206 118167 40 -2.46 5.00 Prom + 131367 131406 40 -2.06 5.01 Init + 150536 150624 89 1 2 79 119 -5 0.439 1.81 5.02 Intr + 156668 156854 187 2 1 26 0 147 0.048 -0.61 5.03 Term + 164834 165037 204 1 0 15 48 176 0.663 3.67 5.04 PlyA + 165060 165065 6 1.05 6.14 PlyA - 166191 166186 6 1.05 6.13 Term - 170290 170223 68 2 2 92 42 64 0.276 0.30 6.12 Intr - 177162 177135 28 0 1 118 94 -13 0.491 -0.01 6.11 Intr - 187723 187593 131 2 2 78 99 22 0.126 2.71 6.10 Intr - 189066 188997 70 0 1 116 20 31 0.172 -2.15 6.09 Intr - 189395 189231 165 2 0 64 99 99 0.575 8.76 6.08 Intr - 191354 191284 71 0 2 151 65 97 0.983 12.60 6.07 Intr - 191574 191484 91 0 1 105 113 58 0.912 9.57 6.06 Intr - 194511 194416 96 0 0 59 99 179 0.999 16.31 6.05 Intr - 195152 195018 135 2 0 75 110 113 0.994 12.96 6.04 Intr - 195477 195396 82 2 1 76 88 71 0.930 5.54 6.03 Intr - 206649 206531 119 0 2 50 72 153 0.872 9.16 6.02 Intr - 207202 207141 62 2 2 82 105 77 0.937 7.25 6.01 Init - 207643 207562 82 1 1 99 60 70 0.945 4.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 49423 49287 137 1 2 52 45 209 0.910 10.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:34911971_35126553|GENSCAN_predicted_peptide_1|353_aa MERRLVSGAGDIKLTKDGNVLLDEMQIQHPTASLIAKVATAQDDVTGDGTTSNVLIIGEL LKQADLYISEGLHPRIIAEGFEAAKIKALEVLEEVKVTKEMKRKILLDVARTSLQTKVHA ELADVLTEVVVDSVLAVRRPGYPIDLFMVEIMEMKHKLGTDTKEVNSGFFYKTAEEKEKL VKAERKFIEDRVQKIIDLKDKVCAQSNKGFVVINQKGEEKFTFIEECVNPCSVTLLVKGP NKHTLTQVKDAIRDGLRAIKNAIEDGCMVPGAGAIEVAMAEALVTYKNSIKGRARLGVQA FADALLIIPKVLAQNAGYDPQETLVKVQAEHVESKQLVGVDLNTAAYIIVKEI >gi568815581r:34911971_35126553|GENSCAN_predicted_CDS_1|1062_bp atggagagaaggcttgtttctggtgcaggtgacatcaaactcaccaaagatggcaatgtg ctgctcgatgagatgcaaattcaacatccaacagcttccttgatagcaaaagtagcaaca gctcaggatgacgtcacaggagatggtactacttcaaatgttctaattattggagagtta ttaaaacaagctgacctgtacatttctgagggcctgcaccctagaataatagctgaagga tttgaagctgcaaagataaaagcacttgaagttttggaggaagttaaagtgacaaaggag atgaaaagaaaaatcctcttagatgtagctagaacatcattacaaactaaagttcatgct gaactggctgatgtcttaacagaggttgtggtggattctgttttggctgttagaagacca ggttaccctattgatctcttcatggtagaaataatggagatgaagcataaattaggaaca gatacaaaagaggtgaactctggtttcttttataagactgcagaagagaaagagaaattg gtaaaagctgaaagaaaatttattgaagatagagtacaaaaaataatagacctgaaggac aaagtctgtgctcagtcaaataaaggatttgtcgtcattaatcaaaagggtgaagaaaag ttcacttttattgaggagtgtgttaacccttgctctgttaccttgttggttaaaggacca aataagcatactctcacacaagtcaaggatgccataagagatggacttcgtgctatcaaa aatgccattgaagatggttgtatggttcctggagctggtgcaattgaagtggcaatggct gaagctcttgttacatataagaacagtataaaaggaagagctcgtcttggagtccaagct tttgctgatgccttactcattattcccaaggttcttgctcagaatgctggttatgaccca caggaaacattagtaaaagttcaggctgagcatgtcgagtcaaaacaacttgtgggcgta gatttgaatacagctgcctatataattgtcaaagagatctag >gi568815581r:34911971_35126553|GENSCAN_predicted_peptide_2|388_aa MPRPTETRIVLGLVAKMASSASARTPAGKRVINQEELRRLMKEKQRLSTSRKRIESPFAK YNRLGQLSCALCNTPVKSELLWQTHVLGKQHREKVAELKGAKEASQGSSASSAPHSVKRK APDADDQDVKRAKATLVPQVQPSTSAWTTNFDKIGKEFIRATPSKPSGLSLLPDYEDEEE EEEEEEGDGERKRGDASKPLSDAQGKEHSVSSSREVTSSVLPNDFFSTNPPKAPIIPHSG SIEKAEIHEKVVERRENTAEALPEGFFDDPEVDARVRKVDAPKDQMDKEWDEFQKAMRQV NTISEAIVAEEDEEGRLDRQIGEIDEQIECYRRVEKLRNRQDEIKNKLKEILTIKELQKK EEENADSDDEGELQDLLSQDWRVKGALL >gi568815581r:34911971_35126553|GENSCAN_predicted_CDS_2|1167_bp atgcctcgcccgacggaaaccagaatcgttttgggtctggtcgccaagatggcgtcctcc gcctccgcccggactccggcagggaagcgagtgataaatcaggaagaattgcggcggtta atgaaggagaagcagcgtctgagcaccagtcggaaacggatagaatctccattcgcgaag tacaaccgtttggggcagctgagttgtgccctgtgtaacactccggttaagagcgagctc ctgtggcagactcacgtcctgggaaagcagcaccgagagaaagtggccgagctgaaaggc gcgaaggaagccagccagggttcgtccgccagttcagcgcctcattccgtcaagaggaaa gcgccggacgcagacgaccaagatgtcaagagagcgaaggccaccttggtgcctcaggta cagccctccacatctgcgtggaccaccaactttgacaaaataggaaaggagttcattaga gcgactcccagtaagccttcaggactcagtttactccccgattatgaagatgaggaggag gaggaagaggaggaggaaggagatggagaaagaaaaaggggggacgccagcaagccgctc tccgacgcacagggcaaggagcactcagtttcctcttcacgggaggtaacaagtagtgtg ctgccaaacgatttctttagtactaatcctcccaaggcccccataattcctcattcaggg tcaattgagaaagcagaaatacatgaaaaagtggtggaaaggagagaaaacaccgcggaa gcgttaccggaaggtttttttgacgaccctgaggtagatgcaagagtacgaaaggttgat gctccaaaagatcagatggacaaagagtgggacgaattccaaaaagccatgaggcaggtc aacactatttccgaagccatagttgccgaagaggatgaggagggacggttggaccgccag attggggagatcgatgagcagatagagtgttaccgacgggtggaaaagctacggaatcgc caggatgaaataaaaaataaacttaaagaaatcctgaccataaaagaactgcagaaaaag gaagaagagaatgctgacagcgatgatgagggggaactacaggatttgttgtctcaggat tggagggtgaaaggggcattgttatag >gi568815581r:34911971_35126553|GENSCAN_predicted_peptide_3|907_aa MAEQRFCVDYAKRGTAGCKKCKEKIVKGVCRIGKVVPNPFSESGGDMKEWYHIKCMFEKL ERARATTKKIEDLTELEGWEELEDNEKEQITQHIADLSSKAAGTPKKKAVVQAKLTTTGQ VTSPVKGASFVTSTNPRKFSGFSERQNEFISVVSPTAKPNNSGEAPSSPTPKRSLSSSKC DPRHKDCLLREFRKLCAMVADNPSYNTKTQIIQDFLRKGSAGDGFHGDVYLTVKLLLPGV IKTVYNLNDKQIVKLFSRIFNCNPDDMARDLEQEVDEFLLRLSKLTKEDEQQQALQDIAS RCTANDLKCIIRLIKHDLKMNSGAKHVLDALDPNAYEAFKASRNLQDVVERVLHNAQEVE KEPGQRRALSVQASLMTPVQPMLAEACKSVEYAMKKCPNGMFSEIKYDGERVQVHKNGDH FSYFSRSLKPVLPHKVAHFKDYIPQAFPGGHSMILDSEVLLIDNKTGKPLPFGTLGVHKK AAFQDANVCLFVFDCIYFNDVSLMDRPLCERRKFLHDNMVEIPNRIMFSEMKRVTKALDL ADMITRVIQEGLEGLVLKDVKGTYEPGKRHWLKVKKDYLNEGAMADTADLVVLGAFYGQG SKGGMMSIFLMGCYDPGSQKWCTVTKCAGGHDDATLARLQNELDMVKISKDPSKIPSWLK VNKIYYPDFIVPDPKKAAVWEITGAEFSKSEAHTADGISIRFPRCTRIRDDKDWKSATNL PQLKELYQLSKEKADFTVVAGDEGSSTTGGSSEENKGPSGSAVSRKAPSKPSASTKKAEG KLSNSNSKDGNMQTAKPSAMKVGEKLATKSSPVKVGEKRKAADETLCQTKVLLDIFTGVR LYLPPSTPDFSRLRRYFVAFDGDLVQEFDMTSATHVLGSRDKNPAAQQVSPEWIWACIRK RRLVAPC >gi568815581r:34911971_35126553|GENSCAN_predicted_CDS_3|2724_bp atggctgagcaacggttctgtgtggactatgccaagcgtggcacagctggctgcaaaaaa tgcaaggaaaagattgtgaagggcgtatgccgaattggcaaagtggtgcccaatcccttc tcagagtctgggggtgatatgaaagagtggtaccacattaaatgcatgtttgagaaacta gagcgggcccgggccaccacaaaaaaaatcgaggacctcacagagctggaaggctgggaa gagctggaagataatgagaaggaacagataacccagcacattgcagatctgtcttctaag gcagcaggtacaccaaagaagaaagctgttgtccaggctaagttgacaaccactggccag gtgacttctccagtgaaaggcgcctcatttgtcaccagtaccaatccccggaaattttct ggcttttcagaaaggcagaatgaattcatctctgttgtttctccaacagccaagcccaac aactctggggaagccccctcgagccccacccctaagagaagtctgtcttcaagcaaatgt gaccccaggcataaggactgtctgctacgggagtttcgaaagttatgcgccatggtggcc gataatcctagctacaacacgaagacccagatcatccaggacttccttcggaaaggctca gcaggagatggtttccacggtgatgtgtacctaacagtgaagctgctgctgccaggagtc attaagactgtttacaacttgaacgataagcagattgtgaagcttttcagtcgcattttt aactgcaacccagatgatatggcacgggacctagagcaggaagtggatgagttccttctg cggctgtccaagctcaccaaggaggatgagcagcaacaggccctacaggacattgcctcc aggtgtacagccaatgaccttaaatgcatcatcaggttgatcaaacatgatctgaagatg aactcaggtgcaaaacatgtgttagacgcccttgaccccaatgcctatgaagccttcaaa gcctcgcgcaacctgcaggatgtggtggagcgggtccttcacaacgcgcaggaggtggag aaggagccgggccagagacgagctctgagcgtccaggcctcgctgatgacacctgtgcag cccatgttggcggaggcctgcaagtccgttgagtatgcaatgaagaaatgtcccaatggc atgttctctgagatcaagtacgatggagagcgagtccaggtgcataagaatggagaccac ttcagctacttcagccgcagtctcaagcccgtccttcctcacaaggtggcccactttaag gactacattccccaggcttttcctgggggccacagcatgatcttggattctgaagtgctt ctgattgacaacaagacaggcaaaccactgccctttgggactctgggagtacacaagaaa gcagccttccaggatgctaatgtctgcctgtttgtttttgattgtatctactttaatgat gtcagcttgatggacagacctctgtgtgagcggcggaagtttcttcatgacaacatggtt gaaattccaaaccggatcatgttctcagaaatgaagcgagtcacaaaagctttggacttg gctgacatgataacccgggtgatccaggagggattggaggggctggtgctgaaggatgtg aagggtacatatgagcctgggaagcggcactggctgaaagtgaagaaagactatttgaac gagggggccatggccgacacagctgacctggtggtccttggagccttctatgggcaaggg agcaaaggcggcatgatgtcaatcttcctcatgggctgctacgaccctggcagccagaag tggtgcacagtcaccaagtgtgcaggaggccatgatgatgccacgcttgcccgcctgcag aatgaactagacatggtgaagatcagcaaggaccccagcaaaatacccagctggttgaag gtcaacaagatctactatcctgacttcatcgtcccagacccaaagaaagctgccgtgtgg gagatcacaggggctgaattctccaaatcggaggctcatacagctgacgggatctccatc cgattccctcgctgcacccgaatccgagatgataaggactggaaatctgccactaacctt ccccaactcaaggaactgtaccagttgtccaaggagaaggcagacttcactgtagtggct ggagatgaggggagctccactacagggggtagcagtgaagagaataagggtccctcaggg tctgctgtgtcccgcaaggcccccagcaagccctcagccagtaccaagaaagcagaaggg aagctgagtaactccaacagcaaagatggcaacatgcagactgcaaagccttccgctatg aaggtgggggagaagctggccacaaagtcttctccagtgaaagtaggggagaagcggaaa gctgctgatgagacgctgtgccaaacaaaggtattgctggacatcttcactggggtgcgg ctttacttgccaccctccacaccagacttcagccgtctcagacgctactttgtggcattc gacggggacctggtacaggaatttgatatgacttcagccacgcacgtgctgggtagcagg gacaagaaccctgcggcccagcaggtctccccagagtggatttgggcatgtatccggaaa cggagactggtagctccctgctag >gi568815581r:34911971_35126553|GENSCAN_predicted_peptide_4|411_aa MEYYAAIKNDEFMSFVGTWMKLEIIILSKLLQGQKTKHRMFSLIDFIMWATCCNWFCLDG QPEEVPPPQGARMQAYSNPGYSSFPSPTGLEPSCKSCGAHFANTARKQQTCLDCKKNFCM TCSSQVGNGPRLCLLCQRFRATAFQREELMKMKVKDLRDYLSLHDISTEMCREKEELVLL VLGQQPVISQEDRTRASTLSPDFPEQQAFLTQPHSSMVPPTSPNLPSSSAQATSVPPAQV QENQQANGHVSQDQEEPVYLESVARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEG LTVRQLKEILARNFVNYKGCCEKWELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLE ENLCKICMDSPIDCVLLECGHMVTCTKCGKRMNECPICRQYVIRAVHVFRS >gi568815581r:34911971_35126553|GENSCAN_predicted_CDS_4|1236_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaatcatcattctcagtaaactattgcaaggacaaaaaaccaaacaccgcatg ttctcgctcatagattttatcatgtgggcaacctgctgcaactggttctgcctggatgga cagcctgaggaggtcccaccaccccagggagccaggatgcaggcctattccaaccctggg tacagctccttcccttccccaacaggcttggaaccaagctgcaagtcctgtggggctcac tttgcaaacacggccaggaagcagcagacctgcttggactgtaagaaaaatttttgcatg acctgttcgagccaagtagggaatgggccccgcctctgccttctctgccaacggtttcga gctacagcctttcagcgagaggagctcatgaagatgaaggtgaaggacttgagggactat ctcagcctccatgacatctctaccgaaatgtgccgggagaaagaagagctggtgctcttg gtccttggccagcagcctgtaatctcccaggaggacaggactcgtgcctccaccttgtcc ccagactttcctgagcagcaggccttcctgacccagcctcactccagcatggttccacct acctcacccaacctcccctcttcatctgcacaagccacctctgttcccccagcccaggtt caggagaatcagcaggccaatggccatgtgtctcaggatcaagaggaacccgtctacctg gagagcgtggccagagtacctgctgaggatgagacccagtctattgactcagaggacagc tttgtcccaggccgaagggcctctctgtctgacctgactgacctggaggacattgaaggc ctgacagtgcggcagctgaaagagatcttggctcgcaactttgtcaactacaagggctgc tgtgagaagtgggagctgatggagagagtgacccggctatacaaggatcagaaaggactc cagcacctggtcagtggtgccgaagaccaaaacgggggagcagtaccatcaggcttggag gagaacctgtgtaagatctgcatggactcacccattgactgtgttcttctggagtgtggc cacatggtaacctgtaccaagtgtggcaagcgcatgaatgaatgtcccatctgccggcag tatgtaatccgagctgtgcatgtcttccggtcctga >gi568815581r:34911971_35126553|GENSCAN_predicted_peptide_5|159_aa MDTILAKQTLILNTYYDWSKYQVLLSTQPCSEKVSNIKRLATAYLRHNEILGSCMSVIEK VASMQEKKESRGVRTQPEDPMEEEEEKGSQQEKSTCGKCGYAAKRKRKYNWSAKAKRQTT TGTGQMRHLKIVYSKFGHGFCERTPPKPKRAAAAASSSS >gi568815581r:34911971_35126553|GENSCAN_predicted_CDS_5|480_bp atggatacaattctagcaaagcagaccctgattctcaatacatactatgattggagtaaa taccaggttctcctgagtactcaaccctgctcagaaaaggtatccaatatcaagaggcta gcaactgcctacctcaggcacaatgaaatactaggttcctgtatgtcagtaattgaaaaa gtagcaagcatgcaggaaaagaaagaatccagaggggtcaggacccagcctgaggacccg atggaggaggaggaggagaaaggaagtcagcaagaaaagtcgacctgtggcaaatgtggc tatgctgccaagcgcaagaggaagtataactggagtgccaaggctaaaagacaaactacc actggaactggtcaaatgaggcacttaaaaattgtatacagcaaattcgggcatggattc tgtgaaagaacaccacctaaacccaagagggcagctgctgcagcatccagttcatcttaa >gi568815581r:34911971_35126553|GENSCAN_predicted_peptide_6|399_aa MGVLRVGLCPGLTEEMIQLLRSHRIKTVVDLVSADLEEVAQKCGLSYKALVALRRVLLAQ FSAFPVNGADLYEELKTSTAILSTGIGSLDKLLDAGLYTGEVTEIVGGPGSGKTQVCLCM AANVAHGLQQNVLYVDSNGGLTASRLLQLLQAKTQDEEEQAEALRRIQVVHAFDIFQMLD VLQELRGTVAQQVTGSSGTVKVVVVDSVTAVVSPLLGGQQREGLALMMQLARELKTLARD LGMAVVVTNHITRDRDSGRLKPALGRSWSFVPSTRILLDTIEGAGASGGRRMACLAKSSR QPTGFQEMVDIGTWGTSEQSATLQGKEGPSYTQKVMTFRGDECNRDRIMEDWEGVANWHQ GVGFRTGLVPAGRAAAAAVLPLDLPIIIIDAGPVPGTRG >gi568815581r:34911971_35126553|GENSCAN_predicted_CDS_6|1200_bp atgggcgtgctcagggtcggactgtgccctggccttaccgaggagatgatccagcttctc aggagccacaggatcaagacagtggtggacctggtttctgcagacctggaagaggtagct cagaaatgtggcttgtcttacaaggccctggttgccctgaggcgggtgctgctggctcag ttctcggctttccccgtgaatggcgctgatctctacgaggaactgaagacctccactgcc atcctgtccactggcattggcagtcttgataaactgcttgatgctggtctctatactgga gaagtgactgaaattgtaggaggcccaggtagcggcaaaactcaggtatgtctctgtatg gcagcaaatgtggcccatggcctgcagcaaaacgtcctatatgtagattccaatggaggg ctgacagcttcccgcctcctccagctgcttcaggctaaaacccaggatgaggaggaacag gcagaagctctccggaggatccaggtggtgcatgcatttgacatcttccagatgctggat gtgctgcaggagctccgaggcactgtggcccagcaggtgactggttcttcaggaactgtg aaggtggtggttgtggactcggtcactgcggtggtttccccacttctgggaggtcagcag agggaaggcttggccttgatgatgcagctggcccgagagctgaagaccctggcccgggac cttggcatggcagtggtggtgaccaaccacataactcgagacagggacagcgggaggctc aaacctgccctcggacgctcctggagctttgtgcccagcactcggattctcctggacacc atcgagggagcaggagcatcaggcggccggcgcatggcgtgtctggccaaatcttcccga cagccaacaggtttccaggagatggtagacattgggacctgggggacctcagagcagagt gccacattacagggaaaggagggtccgagttacactcagaaagtaatgacatttagaggt gatgaatgcaacagggatagaatcatggaggattgggaaggagtggcaaactggcaccag ggtgttggattcaggacagggttggtgccggccgggagggcggccgcagcagctgtcctt cctttagacttgccaatcatcatcatagatgcaggcccagtccctggaactcgaggctaa