GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:00:46 Sequence gi568815597r:180876573_181105464 : 228892 bp : 46.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3504 3750 247 0 1 36 54 277 0.676 14.36 1.02 PlyA + 4223 4228 6 1.05 2.00 Prom + 11119 11158 40 -2.96 2.01 Init + 18517 18567 51 0 0 94 91 44 0.442 6.65 2.02 Intr + 19081 19205 125 0 2 41 50 115 0.236 2.38 2.03 Intr + 22908 22945 38 0 2 45 93 35 0.062 -2.39 2.04 Intr + 30422 30555 134 0 2 105 79 36 0.247 4.76 2.05 Intr + 36075 36222 148 2 1 0 27 143 0.193 -0.79 2.06 Intr + 39703 39831 129 2 0 74 64 79 0.706 4.77 2.07 Intr + 39941 40528 588 0 0 58 72 298 0.743 17.80 2.08 Intr + 41279 41342 64 0 1 75 110 65 0.909 5.18 2.09 Intr + 50053 50166 114 1 0 87 51 141 0.987 9.86 2.10 Intr + 51858 52001 144 0 0 74 117 111 0.996 12.00 2.11 Intr + 58543 60098 1556 1 2 113 91 821 0.738 73.83 2.12 Intr + 61983 62139 157 1 1 81 113 104 0.945 11.27 2.13 Intr + 64473 64713 241 0 1 126 117 105 0.998 14.75 2.14 Intr + 67817 67944 128 1 2 91 80 72 0.920 6.18 2.15 Intr + 68731 68915 185 1 2 123 85 242 0.995 26.83 2.16 Intr + 73768 73865 98 2 2 128 33 196 0.077 17.83 2.17 Intr + 83816 83993 178 1 1 75 29 66 0.007 -1.11 2.18 Intr + 87944 88204 261 0 0 115 69 170 0.399 15.26 2.19 Term + 89067 89191 125 1 2 103 42 126 0.942 8.05 2.20 PlyA + 91045 91050 6 1.05 3.03 PlyA - 91297 91292 6 -0.45 3.02 Term - 93074 92768 307 1 1 96 38 141 0.244 4.49 3.01 Init - 93569 93517 53 2 2 55 55 63 0.523 0.33 3.00 Prom - 94144 94105 40 -3.16 4.03 PlyA - 96173 96168 6 1.05 4.02 Term - 100005 99877 129 0 0 69 38 137 0.860 4.98 4.01 Init - 101993 101943 51 2 0 85 83 27 0.831 3.07 4.00 Prom - 104548 104509 40 -3.36 5.05 PlyA - 104778 104773 6 1.05 5.04 Term - 108199 108088 112 0 1 96 42 70 0.339 1.23 5.03 Intr - 111773 111667 107 2 2 44 65 216 0.920 13.91 5.02 Intr - 113537 113412 126 2 0 125 80 152 0.999 18.98 5.01 Init - 122673 122671 3 0 0 68 115 0 0.363 0.70 5.00 Prom - 123898 123859 40 -4.46 6.03 PlyA - 124722 124717 6 1.05 6.02 Term - 128891 128681 211 1 1 82 42 189 0.968 10.47 6.01 Init - 146101 146067 35 1 2 85 96 57 0.237 5.54 6.00 Prom - 154973 154934 40 -1.46 7.00 Prom + 160367 160406 40 -4.86 7.01 Init + 169526 169727 202 1 1 97 44 102 0.701 3.83 7.02 Intr + 172480 172740 261 2 0 115 69 138 0.969 12.06 7.03 Intr + 173439 173714 276 1 0 107 103 246 0.998 25.49 7.04 Intr + 175663 175938 276 2 0 82 86 159 0.754 12.59 7.05 Intr + 177001 177105 105 2 0 81 78 34 0.780 1.89 7.06 Term + 200864 201039 176 0 2 54 49 133 0.052 3.92 7.07 PlyA + 203289 203294 6 1.05 8.00 Prom + 208459 208498 40 -3.66 8.01 Sngl + 212331 213314 984 2 0 46 52 1016 0.623 90.63 8.02 PlyA + 216880 216885 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 73768 73961 194 2 2 128 46 198 0.912 17.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_1|82_aa XRFVWNFFRLENEHLNNCGEFRAVRDISVAPLNADDQTLLEQMMDQDDGVRNRQKNRSWK YNQSISLRRPRLASQYVWLLLL >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_1|249_bp nngcgatttgtgtggaacttcttccgcctggagaatgaacatctgaataactgtggtgaa ttccgtgctgtgcgggacatctctgtggcccccctgaacgcagatgatcagactctccta gaacagatgatggaccaggatgatggggtacgaaaccgccagaagaatcggtcatggaag tacaaccagagcatatccctgcgccggcctcgcctcgcttctcagtatgtatggcttcta cttctgtga >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_2|1487_aa MLIGLDEQEVASTLQTLRALVEEKECLTMVTMDNVARTVQKVLCPIKPNKSDGPSNNPFL GLLEAIWPADAGILFPNFQAHLELKCKANASVGDGWHFGEEKRGKGATGKEAWASDGPPP CEAGGRPVVPKLGAALDFHPAPLTAGDTRSLGRGSGEAAPRGSGRGKQNIQPDGPPASQG MGSTAPGPLCAGIQGEGFEGEDDSGQTGSAVVAPRTQNLPDGQLDGSINEEQPARDGGPR LPRPPAPGREYCNRGSPWPPEAEWTLPDHDRGPLLGPSSLQQSPIHGVTPGRPGGPGHCN KIIHIPSPRTGRSYPFPDGVVTEADLDSTSLTSEEVFVPRTALLGERWRAGDLEALGAGS SVLSLSDRVERNRLLLQEMLNVSGQSPRKVGTPAWTPSWDTAAPERPVGDVDWASGTSLQ DSGQNRSCDKCPLTAVEPQLSPEGSECAGWQPEAHGCRGASPGWTVGPNPEPVLSPRHEE ATHLLQRARMKARTRPLRASHDIVPTITQGSRDGHRSPARDPRTTPACRDSLQNGHTSDS SSGESSGGHRPRRGPSPSHVRFEDESAREAEFRHLERLQQRQRQVLSTVLQAADQGPLRS KPDLADYINGAPRLRDAGQGTFHRLVGSLDRRGHPAPPAPGSERRCQACGSCIDDPRPAQ GKAPPVPRTLQELQAACGMERVLGGLSSPLRLLPAEPRLHMEWIRETHIGDTVCPAEVDS ALDSTDNSDNCRTDSEEAGTSQAGWACGRTQGSSPRLRLRGSRPRGHRWSKKAEAELPWG LQAQQHLPRADDVEVENEVKEGRGHTPEGTLFLREDAKPPDLELKRVSLGPQWQPGPGLG SHQPHPLDSRTPCRTAYATTAPMTPESSGPGGQAQVTESHESLEIVSPSSLQQSHAEPSA PHQAWQPTASLCPEGWAPTPPPSRKTTSPVSHRKAALAGLLRLGDQTEPVGIPRPPSRSA VLRTCELPPSQTQPSRPQVRHPLLALSTNNCNNSAPRGLQEPYGGAVHEGRVERGPCSRE PEPPLENSRDGGPQGFLGSADVATINSTGITLSLSSEESESSKESEGSLQRTGSGSGGHV LSRASAGAGTGPGSPSAAPLDQNKKRSSSIASTLGLKKLFSALGQSSRPKLGKSRSYSVE QLQPAPPGLTSQSRAPSLQSLHPVSPSHQRRKAASFQNLHSLLSSKGNRSSLYLVAGPGD HSAAGRPAKTSPRRALSVEDVGAPSLARTVGRLVEVFPDGTSQLQLQRSPGGTFGFCVAS GNGRPDSGIYVQEMADMSMAKLYSGLLGVGDEILEVNGAKFQSRFHIFGFLFSNAPLYWY QFTILVRFYTADKDIPKTGKKKRFSWTYSSTWLGRIMAEGTHSLRYFRLGVSDPIHGVPE FISVGYVDSHPITTYDSVTQQKEPRAPWMAENLVPDHWERYTQLLKGWQQMFRVELKRQQ RHYNHSGSHTYQRMIGCELLEDGSTTGFLQYAYDGQNFLIFNKDTLS >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_2|4464_bp atgttgattgggctggatgagcaagaagtggctagcacactgcagaccttgcgagccctg gtagaggagaaagagtgtctgaccatggtcaccatggacaatgtggctagaactgtccaa aaagtactgtgtcctatcaagcccaataagtcagatggccccagcaacaatccattcctg ggcttgttggaggccatctggcctgcagatgctggcattttatttccaaatttccaggct cacttagaattgaagtgtaaggccaatgcttctgtcggtgatggatggcattttggggaa gagaaaagagggaagggggccacgggcaaggaggcctgggcatcagacgggccaccgccc tgcgaggctggcggccgccccgtcgtccccaagctcggggccgccctggacttccatccc gcgcctctgacggccggggacactcgctccctgggccgcggctcaggagaggcggccccg cggggctccgggcggggaaaacagaacatccagcctgatggccccccagcctcccagggt atggggagtacagctccagggcccctctgtgctggaatccaaggtgagggctttgaagga gaagatgacagtggccaaacagggagtgctgtggtggctcctcgtacccaaaacctgcct gatgggcagctggacggcagcatcaatgaggagcaacccgccagggatggaggccccagg cttcccaggccgcctgcccctggacgtgagtactgcaacagggggagcccgtggcctcca gaagccgaatggacacttcctgaccatgacagaggtccgctgctggggcccagctctttg caacagagcccgatccatggagttactcccggacggcctgggggtcctggtcattgtaac aaaatcatccacattcccagcccaaggacaggaaggtcctacccttttccagatggcgtg gtgacagaggcagatctggatagcacatccctgacctccgaggaggtctttgtccccagg acggccctgctgggtgagcgctggagagctggagacctggaggctctgggcgctgggagc agtgtcttgtccctgtctgatcgggtggagagaaaccgcctgttgctgcaggagatgctc aacgtttctgggcagagcccccgcaaggtgggaacccctgcctggactccatcctgggac acagctgcaccagagcgaccagtgggggatgtggactgggcctcgggcacctccttgcag gactccggccagaacaggtcttgtgacaaatgccctctgacagccgtggaaccacagctg tcaccggaaggctccgagtgtgcgggctggcagccagaggcccacggctgcaggggcgcc tctccgggctggaccgttggtcccaacccggagcctgtgctgagccccaggcatgaggaa gccacgcatctgctgcagcgtgcccgcatgaaggccaggacccggcccctccgtgccagc catgacatcgtgcccaccattacccagggcagccgagatggccacagaagcccagcccgg gaccccaggacgacccctgcctgcagagacagcctccagaacgggcacacgagcgattcc tccagcggagagtccagcggtgggcacaggccgaggcggggcccctcgccgtcgcacgtg cgctttgaggatgagtccgcccgcgaagccgagttccgtcacctggagcggctgcagcag cgccagcgccaggtgctgagcaccgtgttgcaggccgcggaccagggccccctgcgctcc aagcccgacctcgccgactacatcaacggggctccccggctccgggacgcggggcagggg acattccacaggcttgtgggcagcctggaccgcaggggacacccggcaccgccggcaccg ggcagcgagaggaggtgccaggcctgcggcagctgcatcgacgacccgcgccccgcccag gggaaggcgccccccgtccccaggaccctccaggagctccaggctgcctgtgggatggag agggtgctgggtggcctgagctccccactccggctccttcctgcagagccccggctccac atggaatggatccgggaaacacacatcggagacaccgtgtgccctgcggaggtggactct gccctggacagcacagacaactctgacaactgcaggaccgacagtgaggaggcggggacc tctcaggctggctgggcgtgtgggcggacccaaggcagcagcccgcgactgcgactgcgg ggctccaggcctcgaggccacaggtggtccaagaaggctgaggcggagctcccttggggc cttcaggcccagcaacacctgcctagggctgatgatgtggaggtggaaaatgaggtgaaa gagggcagaggacacacgcctgaaggaactctatttttgagagaagatgccaagcctcct gacctggagttgaagcgggtgtccctgggaccccagtggcagcctggaccagggctggga agtcaccagcctcaccctttggattcccggactccatgcaggacagcctatgccaccacc gcccccatgacgcctgaatcatcggggccaggaggccaggcccaggttacagaaagccac gagtccctggaaattgtctctccttcctccctgcaacagagccatgcagagccttctgcc ccacaccaagcctggcagccaacagcttccttgtgtcctgaaggctgggcgccaacccct cccccttcgaggaaaaccacctcgccagtgtctcacaggaaggcagccctggctggactg ctcaggctgggtgaccagacagagcctgtgggtatccctcggcctccttcaagaagcgcg gttctcaggacctgtgagctgcccccatcacagacccagcccagccgccctcaggtcagg cacccactgctggccctgtccaccaacaactgcaacaacagcgcacctcgggggctgcag gagccctacgggggagccgtccacgagggtagggtggagaggggcccctgcagccgggaa ccggagccgcccctggagaacagcagagatggaggaccccagggctttcttggctcagca gatgttgccaccatcaactccacgggcatcaccctctccctgtcctcagaggagtcagag tccagcaaggaatcagagggaagcctgcagaggacagggtcaggatctggaggacatgtg ctgtcaagagcatcagcaggagctggcacaggacccggctccccctcggctgcccctttg gaccagaacaagaaaaggagcagcagcatagcctccaccctggggctgaaaaagctcttc tcagccctgggccagagttcccggcccaagctgggcaagtcccgcagctacagtgtggag cagttgcagcccgccccgcctggcctgacgtcacagtccagggccccatcgttacaatcc ctgcacccggtgtcaccctctcaccagcgtcggaaagctgcctcttttcagaacctccat tctctgctgagcagcaaggggaaccggtccagcctctacctggtagcagggccaggggac cacagtgcagctggcaggccggccaagacttcaccacggcgtgccctcagtgtggaggac gtgggtgctcccagcctggctcgcaccgtgggccgcctggtggaggtgttcccagacggc accagccagctgcagctgcagcgctccccagggggcactttcggcttctgcgtggcctct gggaatgggcgcccagactcagggatctacgtgcaggagatggctgacatgagcatggcc aagctgtactcagggctgctgggggtgggcgatgagatcctcgaggtgaacggggccaag ttccaaagtcgcttccacattttcgggtttcttttcagcaatgccccactctactggtac caatttactatattagtccgtttttacactgctgacaaagacatacccaagactgggaag aaaaagaggtttagttggacttacagttccacatggctggggaggatcatggcggagggg acgcactctctgagatattttcgcctgggcgtttcggatcccatccatggggtccctgaa tttatttcagttgggtatgtggactcgcaccctatcaccacatatgacagtgtcactcag cagaaggagccacgggccccatggatggcagagaacctcgtgcctgatcactgggagagg tacactcagctgctgaagggctggcagcagatgttcagggtggaactgaagcgccagcag agacactacaatcactcagggtctcacacttaccagagaatgattggctgtgagctgctg gaggatggaagcactacaggatttctgcagtatgcatatgatgggcagaatttcctgatc ttcaataaagacaccctctcctga >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_3|119_aa MHDQATQRLPEAQCDMNCPFVSSEVWLLPLHLPQTQCHLARASASLAFQLPPSSAATQNL LAQTFIPNSRPGQIQKPLLVQPTRSGEKGERKSSSFLCFPAQGIGKSGWNLAFCFYHFE >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_3|360_bp atgcatgaccaggccactcagcggcttcctgaggcacagtgtgacatgaactgcccattt gtgtcctcagaggtctggctgctgccactgcacctgcctcagacccagtgtcacctcgcc agggcttcagccagcctggccttccagctgccacccagcagcgcagcaactcagaacctt ctggcccagaccttcatacccaactcccgcccagggcaaatacagaagcctctgctggtg cagccaacacggagtggggaaaaaggggagaggaaaagcagcagcttcctgtgcttccca gctcagggcattggtaagagtggatggaatttggcattctgcttctaccattttgaatga >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_4|59_aa MRGFFPIFAENLVKLLECCDGGASGCEFLLHMNRGEEEKLSTCDIAVYSHSYPGNILLH >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_4|180_bp atgaggggatttttcccaatattcgctgagaacctggtcaagctcctggagtgctgtgac ggcggggcctctgggtgcgagttcctcctgcatatgaaccgaggggaggaggagaagctg agcacgtgtgacattgccgtctactcacattcctatcctggaaacatactgctgcactga >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_5|115_aa MALLGDSGSQNWSTGTTDKYGRLDRELQRANSHFIEEQQAQQQLIVEQQDEQLELVSGSI GVLKNMSQRIGGELEEQAVMLEDFSHELESTQSRLDNVMKKLAKVSHMTSGMYGV >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_5|348_bp atggcactgctgggagacagtggcagccagaactggagcactggaacaacagataaatat gggcgtctggaccgagagctccagagagccaattctcatttcattgaggagcagcaggca cagcagcagttgatcgtggaacagcaggatgagcagttggagctggtctctggcagcatc ggggtgctgaagaacatgtcccagcgcatcggaggggagctggaggaacaggcagttatg ttggaagatttctctcacgaattggagagcactcagtcccggctggacaatgtgatgaag aaacttgcaaaagtatctcatatgaccagtggtatgtatggcgtttaa >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_6|81_aa MSMEDPFFVVKGEVQKAVNTAQGLFQRWTELLQDPSTATREEIDWTTNELRNNLRSIEWD LEDLDETINILFCGVCAIRDK >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_6|246_bp atgtccatggaggaccccttctttgtggtgaaaggagaggtacagaaagcagtcaacact gcccagggattgtttcagagatggacagagctcctccaggacccctccacagcaacaagg gaagaaatcgactggaccaccaacgagctgagaaataacctccggagcatagagtgggat ctagaggaccttgatgaaaccatcaatatccttttctgtggtgtctgtgccattcgggat aaatag >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_7|431_aa MPEPPTPSMGLCAARASPMSATPAPRRQSHDHPRAEECGRTAREWQAAPPAAPVGDPLGE ASWAPESGTHSLRYFRLGVSDPIHGVPEFISVGYVDSHPITTYDSVTRQKEPRAPWMAEN LAPDHWERYTQLLRGWQQMFKVELKRLQRHYNHSGSHTYQRMIGCELLEDGSTTGFLQYA YDGQDFLIFNKDTLSWLAVDNVAHTIKQAWEANQHELLYQKNWLEEECIAWLKRFLEYGK DTLQRTEPPLVRVNRKETFPGVTALFCKAHGFYPPEIYMTWMKNGEEIVQEIDYGDILPS GDGTYQAWASIELDPQSSNLYSCHVEHCGVHMVLQVPQESETIPLVMKAVSGSIVLVIVL AGVGVLVWRRRPRVMIGDISGPTEPGLAVPVDKEECERESFIDNEEEEVKVAGEESRKRR EGNREEGQSRQ >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_7|1296_bp atgcctgagcctcccaccccctccatgggcctctgtgcggcccgagcctccccgatgagc gccacccctgctccacggcgccagtcccatgaccacccaagggctgaggagtgcgggcgc acggcgcgggaatggcaggcagctccacctgcagccccagtgggggatccactgggtgaa gccagctgggctcctgagtctgggacgcactctctgagatattttcgcctgggcgtttcg gatcccatccatggggtccctgaatttatttcggttgggtacgtggactcgcaccctatc accacatatgacagtgtcactcggcagaaggagccacgggccccatggatggcagagaac ctcgcgcctgatcactgggagaggtacactcagctgctgaggggctggcagcagatgttc aaggtggaactgaagcgcctacagaggcactacaatcactcagggtctcacacttaccag agaatgattggctgtgagctgctggaggatggaagcaccacaggatttctgcagtatgca tatgacgggcaggatttcctgatcttcaataaagacaccctctcctggctggctgtagat aatgtggctcacaccatcaagcaggcatgggaggccaatcagcatgagttgctgtatcaa aagaattggctggaagaagaatgtattgcctggctaaagagattcctggagtatgggaaa gacaccctacaaagaacagagcccccactggtcagagtaaatcgcaaagaaacttttcca ggggttacagctctcttctgcaaagctcatggcttttaccccccagaaatttacatgaca tggatgaaaaacggggaagaaattgtccaagaaattgattatggagacattcttcccagt ggggatggaacctatcaggcgtgggcatcaattgagcttgatcctcagagcagcaacctt tactcctgtcatgtggagcactgcggtgtccacatggttcttcaggtcccccaggaatca gaaactatccctcttgtgatgaaagctgtctctgggtccattgtccttgtcattgtgctg gctggagttggtgttctagtctggagaagaaggccccgagtcatgataggagacatttca ggacccacagagcctgggctggcagtaccagttgacaaggaggagtgtgagagggagtct ttcatagacaatgaagaggaagaagtaaaagtggcaggcgaggagagcagaaaaaggaga gaaggaaacagggaggaaggacaaagccgacaatga >gi568815597r:180876573_181105464|GENSCAN_predicted_peptide_8|327_aa MEFKLEAHRIVSISLGKIYNSRVQRGGIKLHKNLLVSLVLRSARQVYLSDPCPGLYLAGP AGTPAPPPQQQPGEPAAGPPAGWGEPPPPAARASWPETEPQPERSSVSDAPRVGDEVPVA TVTGVGDVFQGGEADATEAAWSRVEGPRQAAAREAEGTAGGWGVFPEVSRAARRPCGCPL GGEDPPGTPAATPRAACCCAPQPAEDEPPAPPAVCPRKRCAAGVGGGPAGCPAPGSTPLK KPRRNLEQPPSGGEDDDAEEMETGNVANLISIFGSSFSGLLRKSPGGGREEEEGEESGPE AAEPGQICCDKPVLRDMNPWSTAIVAF >gi568815597r:180876573_181105464|GENSCAN_predicted_CDS_8|984_bp atggagttcaagctggaggctcatcgcatcgtcagcatctctctgggcaagatctacaac tcgcgggtccagcgcggcggcatcaagctgcataagaacctcctggtctcgctggtgctg cgcagcgcccgccaagtctacctgagcgacccgtgccccggcctctacctggccggtccc gctgggaccccggcgccgccaccgcagcagcagcccggggagccggcggccgggccaccc gccggctggggagagccgcccccgcccgccgctcgtgcctcttggccggagaccgagccg cagccggagcgctcctccgtctcagacgcgccgcgggtaggggacgaggtgccggtggcc acggtgactggagtcggggacgtttttcagggcggagaggcggacgcgacggaagctgcc tggagccgcgtggaggggccgcgccaggcggcggccagagaagccgagggtaccgccgga ggctggggcgtcttccccgaggtatctcgtgccgcgcgccgcccctgcggctgcccccta ggcggggaggacccgccgggtacaccggccgcgaccccccgcgctgcctgctgctgcgcg ccgcaaccagcggaggacgagccccccgcgccgcccgcggtgtgccccaggaagcgctgc gcggcgggggtgggcggcggcccagcgggctgcccggcgcccggctcgaccccgctcaag aagccccgccggaacttagagcagccgccgagtggaggagaggacgacgacgcggaggag atggagaccgggaacgtggctaacctcatcagcatcttcggttccagtttctcgggactc ctacggaaaagccccgggggcggcagagaggaagaggagggagaggagagcggtccggaa gccgccgagcccgggcagatctgctgcgataagccggtgctgagagacatgaacccctgg agcacagccatcgtggccttctga