GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:43:52 Sequence gi568815594r:75481836_75725988 : 244153 bp : 40.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 12 7 6 1.05 1.05 Term - 831 703 129 0 0 93 49 80 0.038 1.80 1.04 Intr - 8866 8746 121 0 1 63 76 98 0.176 5.68 1.03 Intr - 31342 31267 76 2 1 88 47 83 0.214 1.85 1.02 Intr - 32155 32035 121 1 1 61 89 51 0.718 1.65 1.01 Init - 32451 32320 132 0 0 89 47 194 0.884 15.59 1.00 Prom - 33422 33383 40 -13.11 2.00 Prom + 33735 33774 40 -4.65 2.01 Init + 34185 34192 8 2 2 92 83 10 0.967 0.73 2.02 Intr + 34937 35144 208 2 1 94 72 112 0.967 8.36 2.03 Intr + 39901 40026 126 0 0 79 78 28 0.642 0.86 2.04 Term + 45125 45379 255 1 0 104 44 195 0.998 11.30 2.05 PlyA + 46540 46545 6 1.05 3.04 PlyA - 46696 46691 6 1.05 3.03 Term - 51278 50683 596 0 2 -35 54 275 0.448 5.50 3.02 Intr - 69466 69435 32 0 2 55 103 1 0.031 -4.84 3.01 Init - 70595 70486 110 2 2 70 36 202 0.545 12.94 3.00 Prom - 71221 71182 40 -4.25 4.15 PlyA - 71932 71927 6 1.05 4.14 Term - 72412 72345 68 2 2 71 49 78 0.012 -0.68 4.13 Intr - 73878 73732 147 1 0 57 78 75 0.015 2.69 4.12 Intr - 80376 80257 120 1 0 65 65 83 0.008 3.25 4.11 Intr - 101796 101622 175 0 1 30 30 172 0.012 4.19 4.10 Intr - 110090 109984 107 0 2 63 92 69 0.290 3.81 4.09 Intr - 114505 114408 98 0 2 109 71 32 0.544 2.33 4.08 Intr - 115401 115100 302 0 2 58 80 185 0.922 9.31 4.07 Intr - 116377 116242 136 0 1 63 99 86 0.980 6.85 4.06 Intr - 118534 118446 89 1 2 82 65 54 0.977 0.35 4.05 Intr - 122121 121982 140 1 2 92 84 88 0.982 8.06 4.04 Intr - 123799 123687 113 0 2 82 82 34 0.979 1.30 4.03 Intr - 125526 125348 179 0 2 74 71 106 0.923 5.30 4.02 Intr - 132614 132420 195 2 0 60 101 49 0.755 2.09 4.01 Init - 144153 143986 168 0 0 46 113 167 0.831 14.59 4.00 Prom - 144913 144874 40 -7.65 5.04 PlyA - 146163 146158 6 1.05 5.03 Term - 148418 148203 216 2 0 54 42 115 0.576 -0.34 5.02 Intr - 148549 148470 80 2 2 104 42 90 0.901 4.35 5.01 Init - 149069 148625 445 2 1 81 84 233 0.567 16.43 5.00 Prom - 162319 162280 40 -3.35 6.14 PlyA - 162915 162910 6 1.05 6.13 Term - 163867 163595 273 1 0 55 48 369 0.999 24.09 6.12 Intr - 164621 164503 119 0 2 96 106 128 0.999 14.66 6.11 Intr - 165322 165194 129 2 0 70 80 162 0.999 13.35 6.10 Intr - 166906 166804 103 1 1 73 109 146 0.999 14.03 6.09 Intr - 173411 173231 181 1 1 43 72 294 0.992 22.25 6.08 Intr - 174035 173933 103 0 1 54 79 154 0.995 9.41 6.07 Intr - 175179 175089 91 0 1 51 100 105 0.999 6.65 6.06 Intr - 175895 175722 174 2 0 34 100 143 0.997 9.31 6.05 Intr - 177089 177008 82 1 1 94 101 94 0.995 10.02 6.04 Intr - 180214 180096 119 1 2 94 99 45 0.252 4.54 6.03 Intr - 191548 191373 176 2 2 65 85 216 0.192 17.74 6.02 Intr - 192005 191691 315 0 0 21 53 263 0.746 11.21 6.01 Init - 193460 193400 61 2 1 65 96 78 0.993 7.76 6.00 Prom - 199635 199596 40 -4.75 7.00 Prom + 204898 204937 40 -6.55 7.01 Init + 205550 205619 70 1 1 84 91 25 0.577 3.69 7.02 Intr + 206855 206961 107 0 2 52 115 43 0.612 2.41 7.03 Intr + 210337 210365 29 0 2 115 111 -15 0.348 -0.50 7.04 Intr + 213534 213609 76 0 1 85 121 -9 0.422 0.70 7.05 Term + 214789 214905 117 0 0 96 35 88 0.761 1.86 7.06 PlyA + 215723 215728 6 1.05 8.03 PlyA - 217782 217777 6 1.05 8.02 Term - 242912 242712 201 2 0 66 39 222 0.942 11.51 8.01 Intr - 243540 243265 276 0 0 29 23 218 0.286 6.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 82279 82417 139 2 1 145 37 103 0.902 9.50 S.002 Init - 101826 101622 205 0 1 89 30 176 0.811 11.26 S.003 Init + 149300 149426 127 1 1 73 82 196 0.972 17.87 S.004 Term + 153179 153342 164 0 2 88 43 89 0.822 1.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_1|192_aa MAATAREDGASGQERGQRGCEHYDRGCLLKVTPSMDSSLTSFRQQNPSWEVIRLAQWSLP VGLFVFCCRSRLRQPRACTGVFRSGHGLTNIAPLNWMKLLKNVGHPPSVVGYRCPLCMHS ALDMTRYWRQLDDEVAQTPMPSEYQNMTVDILCNDCNGRSTVQFHILGMKCKICESYNTA QAGGRRISLDQQ >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_1|579_bp atggcggcgacggcccgggaagatggcgccagcggtcaagagcgaggtcagcggggctgc gagcactatgacagaggatgtctcctaaaggtgacgccttctatggactcttccctgaca agcttccgtcagcaaaacccttcttgggaagtgattaggcttgcacagtggtctctgcca gtgggcttattcgtcttttgctgccgaagtagactgaggcagccccgggcctgtactggg gttttccgttcaggtcatggcttgacgaacattgctccactgaactggatgaagctcctt aaaaatgttggtcatccaccaagtgttgtaggctacagatgtccattatgtatgcactct gctttagatatgaccaggtattggagacagctggatgatgaagtagcacagactcctatg ccatcagaatatcagaacatgactgtggatattctctgcaatgactgtaatggacgatcc actgttcagtttcatatattaggcatgaaatgtaagatttgtgaatcctataatactgct caagctggaggacgtagaatttcactggatcagcaatga >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_2|198_aa MKEFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDVLCSRHFKKTDFDRSAPNIKLKPGV IPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEEFQSQFIFEHSYSV MDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELKDECLISQETANRL DTFCWDCCQESIEQDYIS >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_2|597_bp atgaaggaattccccacagatgaaaacatcaaaaggaaatgggtattagcaatgaaaaga cttgatgtgaatgcagccggcatttgggagcctaaaaaaggagatgtgttgtgttcgagg cactttaagaagacagattttgacagaagtgctccaaatattaaactgaaacctggagtc ataccttctatctttgattctccatatcacctacaggggaaaagagaaaaacttcattgt agaaaaaacttcaccctcaaaaccgttccagccactaactacaatcaccatcttgttggt gcttcctcatgtattgaagaattccaatcccagttcatttttgaacatagctacagtgta atggacagtccaaagaaacttaagcataaattagatcatgtgatcggcgagctagaggat acaaaggaaagtctacggaatgttttagaccgagaaaaacgttttcagaaatcattgagg aagacaatcagggaattaaaggatgaatgtctgatcagccaagaaacagcaaatagactg gacactttctgttgggactgttgtcaggagagcatagaacaggactatatttcatga >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_3|245_aa MVIIADGSSMHVIAPEDLPVEQDVEVEDSDSDDPDPVATSHPASSICGKESIKVQKICSP TCNKKKIPFSEEKFKPAAETCVSNEELNVNPQDNGENVSRACQRSSRQPLPSQVWRPRRK VWFCGPGPGSPCCVQPRDLVPCVPATPAMAERSQHTPWAVASEGASLKPWQLPYDVEPLS AQKSRTEVWASPPKFQMYGNAWMPRQKFVVGPGSSWRTSARAVQKGNVELEPPHRVPTGH CLVEL >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_3|738_bp atggttatcatagcagatggcagctctatgcatgttattgcccctgaagatcttccagtg gaacaagatgtggaggtggaagacagtgacagtgatgatcctgaccccgtagcaacttcc catcctgcaagctccatttgtggaaaggagagcataaaagttcagaaaatttgcagccca acttgcaataaaaagaaaatcccattttctgaagagaaattcaagccggctgcagaaact tgcgtaagtaatgaggagctgaatgttaatccccaagacaatggggaaaatgtctccagg gcatgtcagcgatcttcaaggcagccccttccatcacaggtctggaggcctaggaggaaa gtatggttttgtgggccaggcccagggtccccgtgttgtgtgcagcccagggacttggtg ccctgtgtcccagccactccagccatggctgaaaggagccaacatacaccttgggccgtg gcttcagagggtgcaagtctcaagccttggcagcttccatatgatgttgagcctttgagt gcacagaagtcaagaactgaggtttgggcatctccgcctaaatttcagatgtatggaaat gcctggatgcccaggcagaagtttgttgtagggccagggtcctcatggagaacctctgct agggcagtgcagaagggaaatgtggagttggaacccccacacagagtccctactgggcac tgcctagtggagctgtga >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_4|678_aa MEKYENLGLVGEGSYGMVMKCRNKDTGRIVAIKKFLESDDDKMVKKIAMREIKLLKQLRH ENLVNLLEVCKKKKRWYLVFEFVDHTILDDLELFPNGLDYQVVQKYLFQIINGIGFCHSH NIIHRDIKPENILVSQSGVVKLCDFGFARTLAAPGEVYTDYVATRWYRAPELLVGDVKYG KAVDVWAIGCLVTEMFMGEPLFPGDSDIDQLYHIMMCLGNLIPRHQELFNKNPVFAGVRL PEIKEREPLERRYPKLSEVVIDLAKKCLHIDPDKRPFCAELLHHDFFQMDGFAERFSQEL QLKVQKDARNVSLSKKSQNRKKEKEKDDSLVEERKTLVVQDTNADPKIKDYKLFKIKGSK IDGEKAEKGNRASNASCLHDSRTSHNKIVPSTSLKDCSNVSVDHTRNPSVAIPPLTHNLS AVAPSINSGMGTETIPIQGYRVDEKTKKCSIPFVKPNRHSPSGIYNINVTTLVTRNSRLT KKESKILSESRIPSLAAIDLHTPSITLHQMCGLLAKCLQFHIVGAFIVSLGVAAVCKIAV AEPRKKTYADFYRNYDSVKDLEEMGKAVPHTYILFHRRTLEMSVVSPTHLLAMADWTTDR HSTHSLSAYLLKKRNKRERALPGGFLTPVVCSGELGIILRPPTRVYAASLLCFCPGVSDT GQGAGAKDKIPALMELIV >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_4|2037_bp atggaaaaatatgaaaacctgggtttggttggagaagggagttatggaatggtgatgaag tgtaggaataaagatactggaagaattgtggccataaagaagttcttagaaagtgacgat gacaaaatggttaaaaagattgcaatgcgagaaatcaagttactaaagcaacttaggcat gaaaacttggtgaatctcttggaagtgtgtaagaaaaaaaaacgatggtacctagtcttt gaatttgttgaccacacaattcttgatgacttggagctctttccaaatggactagactac caagtagttcaaaagtatttgtttcagattattaatggaattggattttgtcacagtcac aatatcatacacagagatataaagccagagaatatattagtctcccagtctggcgttgtc aagctatgcgattttggatttgcgcgaacattggcagctcctggggaggtttatactgat tatgtggcaacccgatggtacagagctccagaactattggttggtgatgtcaagtatggc aaggctgttgatgtgtgggccattggttgtctggtaactgaaatgttcatgggggaaccc ctatttcctggagattctgatattgatcagctatatcatattatgatgtgtttaggtaat ctaattccaaggcatcaggagctttttaataaaaatcctgtgtttgctggagtaaggttg cctgaaatcaaggaaagagaacctcttgaaagacgctatcctaagctctctgaagtggtg atagatttagcaaagaaatgcttacatattgaccccgacaaaagacccttctgtgctgag ctcctacaccatgatttctttcaaatggatggatttgctgagaggttttcccaagaacta cagttaaaagtacagaaagatgccagaaatgtttctttatctaaaaaatcccaaaacaga aagaaggaaaaagaaaaagatgattccttagttgaagaaagaaaaacacttgtggtacag gataccaatgctgatcccaaaattaaggattataaactatttaaaataaaaggctcaaaa attgatggagaaaaagctgaaaaaggcaatagagcttcaaatgccagctgtctccatgac agtaggacaagccacaacaaaatagtgccttcaacaagcctcaaagactgcagcaatgtc agcgtggaccacacaaggaatccaagcgtggcaattcccccacttacacacaatctttct gcagttgctcccagcattaattctggaatggggactgagactataccaattcagggttac agagtggatgagaaaactaagaagtgttctattccatttgttaaaccgaacagacattcc ccatcaggcatttataacattaatgtgaccacattagtaactcgaaattccaggctaaca aagaaagagagcaaaattctttcagaatctcgaattccttctctggctgctattgacctg cacacccccagtattacattacatcagatgtgtggtcttctggccaaatgtctgcaattt catattgttggagcctttattgtatccctgggggttgcagctgtctgtaagattgctgtg gctgaaccaagaaagaagacatatgcagatttctacagaaattatgattccgtgaaagat ttggaggagatggggaaggctgtccctcatacatatattctgtttcatcgacgaaccctg gaaatgtcagttgtcagtcctacgcaccttctggctatggctgattggaccactgataga cactcaactcattcactgtctgcttacctacttaaaaagcgaaacaagagggagagggca ttacccggaggcttcctgaccccggtggtttgcagtggagagttggggatcattcttagg cccccaaccagggtttatgctgcttctctgctctgcttttgtccaggagtgtcagacact ggtcaaggtgctggggctaaagataagattcctgctctcatggaacttatagtttag >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_5|246_aa MESRSGRAYATQGARRPPLPQRRCYEGNRIAFCIVFAVLHNRHLYPSRGHHNRLLPLGKA ERRLLTRTKACGDAGPLRVREQNLGRALGEGAGWLALSQSWLVTTRLGQSEEGRPGALGA GEKPPPLAAPRPTPSPRGSWSGAGSQAPGVGACRERILTWAVSVSQDFVPMVGDRCELAA DRGPPASLPASPWTRDCVRRRNHNSAGIASLLGSIFWTCEPYAFHLKFVSKLKTFGSFYA NGLKKR >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_5|741_bp atggaatcccggtcaggccgcgcctacgcgactcagggcgcccggcgcccgcccctgccc cagcggcgatgctatgagggaaaccgtatcgcattttgcatagtcttcgcagtcctacat aaccgccacctttacccttcgcgtgggcatcacaatcgcctcctcccgctggggaaggca gaaaggcgcctcctgacgagaaccaaggcgtgtggggacgcagggcctctgcgtgtcagg gagcagaacctgggccgagccctaggtgaaggggcggggtggttggccctgagccaatca tggctcgtgacgactcggctcggccaatcagaagaagggaggcctggcgctctcggggcg ggtgagaaaccgcccccccttgcagctccgcggccaacgccttcgcccaggggtagttgg agcggtgcaggttcccaggctccaggtgttggtgcctgccgtgaacgcattctgacctgg gccgtatctgtctcccaagactttgtgcctatggttggggacagatgtgagcttgcggcg gaccgaggcccacctgcctccctgcctgcttcgccctggactcgtgactgcgtccgcaga agaaatcacaacagcgctggaattgctagtttgctaggcagcatcttttggacctgcgaa ccatatgcatttcacctcaaatttgtttccaagttgaaaacctttgggtctttctatgcg aacggattgaagaaacggtaa >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_6|641_aa MVKNIYNSIARKSSKPETLEARLQQQPADTYSERLLNSLLPINKKDQNISLQKTLRCCEP ALACAIRKKGGKAGASGPEKGSLLVTSNQRTLILALGRFRFPGFPGASVCERGSVLRVLR RGTREAPGAREEVVVRQTNSRLSGFRGFRVVGGRGRRAIRTRRTRSSATLLTSARRTRRR RWLEHLTLCSKEMVMEKPSPLLVGREFVRQYYTLLNKAPEYLHRFYGRNSSYVHGGVDAS GKPQEAVYGQNDIHHKVLSLNFSECHTKIRHVDAHATLSDGVVVQVMGLLSNSGQPERKF MQTFVLAPEGSVPNKFYVHNDMFRYEDEVFGDSEPELDEESEDEVEEEQEERQPSPEPVQ ENANSGYYEAHPVTNGIEEPLEESSHEPEPEPESETKTEELKPQVEEKNLEELEEKSTTP PPAEPVSLPQEPPKPRVEAKPEVQSQPPRVREQRPRERPGFPPRGPRPGRGDMEQNDSDN RRIIRYPDSHQLFVGNLPHDIDENELKEFFMSFGNVVELRINTKGVGGKLPNFGFVVFDD SEPVQRILIAKPIMFRGEVRLNVEEKKTRAARERETRGGGDDRRDIRRNDRGPGGPRGIV GGGMMRDRDGRGPPPRGGMAQKLGSGRGTGQMEGRFTGQRR >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_6|1926_bp atggtgaagaatatttataatagcattgcacggaagagctccaaaccggaaacacttgaa gctcgtctacagcagcaacctgcagacacctattcagagagattactaaattccctcctt cccattaataaaaaggaccagaatatcagtctacaaaagacactgcgttgctgcgaacca gctctcgcttgcgcgatcaggaagaagggcggcaaggctggagcctcgggaccggagaaa ggcagcctgcttgtgacgtcaaatcagcggactcttatcttggctttaggccggttccgg ttccccggctttccgggcgcgagcgtgtgcgagcgcggcagcgtactgcgcgtgctccgc agagggacacgggaagcgcctggcgcccgggaagaggtggttgtgaggcagacgaactcg cggctctccggcttccgaggcttccgagttgtcggaggaagggggcggcgagcaataaga acccgccgcacccggtcctcagcgactcttctgacctccgcgcgacgtacccgccgccgc cgttggctggagcatttgacattgtgcagcaaagaaatggttatggagaagcccagtccg ctgcttgtagggcgggagtttgtgaggcaatattatactttgctgaataaagctccggaa tatttacacaggttttatggcaggaattcttcctatgttcatggtggagtagatgctagt ggaaagccccaggaagctgtttatggccaaaatgatatacaccacaaagtattatctctg aacttcagtgaatgtcatactaaaattcgtcatgtggatgctcatgcaaccttgagtgat ggagtagttgtccaggtcatgggtttgctgtctaacagtggacaaccagaaagaaagttt atgcaaacctttgttctggctcctgaaggatctgttccaaataaattttatgttcacaat gatatgtttcgttatgaagatgaagtgtttggtgattctgagcctgaacttgatgaagaa tcagaagatgaagtagaagaggaacaagaagaaagacaaccatctcctgaacctgtgcaa gaaaatgctaacagtggttactatgaagctcaccctgtgactaatggcatagaggagcct ttggaagaatcctctcatgaacctgaacctgagccagaatctgaaacaaagactgaagag ctgaaaccacaagtggaggagaagaacttagaagaactagaggagaaatctactactcct cctccggcagaacctgtttctctgccacaagaaccaccaaagccaagagtcgaagctaaa ccagaagttcaatctcagccacctcgtgtgcgtgaacaacgacctagagaacgacctggt tttcctcctagaggaccaagaccaggcagaggagatatggaacagaatgactctgacaac cgtagaataattcgctatccagatagtcatcaactttttgttggtaacttgccacatgat attgatgaaaatgagctaaaggaattcttcatgagttttggaaacgttgtggaacttcgc atcaataccaagggtgttgggggaaagcttccaaattttggttttgtggtttttgatgac tctgaaccagttcagagaatcttaattgcaaaaccgattatgtttcgaggggaagtacgt ttaaatgtggaagagaaaaaaacaagagctgcaagagagcgagaaaccagaggtggtggt gatgatcgcagggatattaggcgcaatgatcgaggtcccggtggtccacgtggaattgtg ggtggtggaatgatgcgtgatcgtgatggaagaggacctcctccaaggggtggcatggca cagaaacttggctctggaagaggaaccgggcaaatggagggccgcttcacaggacagcgt cgctga >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_7|132_aa MIVRPPQPCGTVSPLNLFFIPVPGQKPWMIHKKKFHREQISVKTTLPCLARIPLARLRMN PVPSACSLTSVPGGDRYRCLGSEGQDRSWAKKSKCPVALVKNIRCVNDVSMADNCQLSSS CSQSSLSSWKED >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_7|399_bp atgattgtgagacctcctcagccatgtggaactgtaagtccgctgaacctctttttcatc ccagtcccagggcaaaaaccttggatgatccataaaaagaaattccacagggaacagatt tctgtaaaaactactttgccatgtttagcaagaattccactagctcgtttgagaatgaac cctgtgcctagtgcttgctctctaacgtctgtgcctggaggtgacaggtacaggtgtcta gggtctgagggtcaagacagaagctgggctaagaagtctaagtgcccagtggccttggta aaaaacattagatgtgtaaatgatgtgagcatggcagacaactgccagctttctagcagt tgttcccaatcatcattgtcgtcatggaaagaggactaa >gi568815594r:75481836_75725988|GENSCAN_predicted_peptide_8|158_aa PNALTNSWVLNLLAVFPNAPVSDCEGGSRTSGSSRPVQTSERERGRGPRPRGAWEAVRAS SAAHPSLPVRYHLRTTLLARASGPTELRGAAAGLGPNPALYTLLLLPPPPPTRRPLQPSK PGNSRQQRPRCACAGTWHYVAARRLKAKGERRREEPLS >gi568815594r:75481836_75725988|GENSCAN_predicted_CDS_8|477_bp ccaaatgctctcaccaacagctgggttttaaaccttctcgcagttttccccaacgcccca gtttcagactgcgagggagggagcaggacttcaggctcctctcggccagtgcagacgagc gaaagagaaagagggaggggcccacggccccgaggggcgtgggaggcagttcgggccagc tcggccgcgcatccgtcccttcccgtgcggtaccacctgcggaccactctcctagcccga gcttcagggcctacagagctgcggggcgcggccgccggcctgggccccaatcccgcactc tacacactcctactgctgccaccaccgcctccaactcggcggccactccagccctctaag cccggaaacagccgccagcagcggccaagatgcgcatgcgcggggacgtggcattacgtg gcggctcgaaggttgaaggcaaaaggggagcggaggcgagaggaacctcttagctag