GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:14:05 Sequence gi568815589f:114326355_114433131 : 106777 bp : 51.38% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3551 3664 114 1 0 68 100 190 0.992 16.37 1.02 Intr + 4080 4222 143 2 2 106 97 164 0.999 18.76 1.03 Intr + 4438 4508 71 1 2 49 89 67 0.939 2.22 1.04 Intr + 5213 5320 108 0 0 99 66 67 0.947 6.36 1.05 Intr + 5472 5575 104 1 2 111 92 134 0.887 16.49 1.06 Term + 6715 6780 66 0 0 118 43 77 0.969 4.43 1.07 PlyA + 6874 6879 6 1.05 2.10 PlyA - 7800 7795 6 -0.45 2.09 Term - 10952 10700 253 1 1 118 38 263 0.979 19.74 2.08 Intr - 15371 15179 193 0 1 103 38 71 0.476 2.67 2.07 Intr - 15771 15655 117 1 0 147 58 -11 0.582 2.74 2.06 Intr - 17449 17354 96 2 0 118 90 17 0.832 5.38 2.05 Intr - 19655 19509 147 0 0 137 96 42 0.981 10.62 2.04 Intr - 20430 20315 116 2 2 71 80 2 0.871 -1.71 2.03 Intr - 21546 21370 177 2 0 64 94 166 0.997 14.35 2.02 Intr - 24667 24505 163 2 1 103 89 66 0.981 7.75 2.01 Init - 25072 25006 67 1 1 76 81 28 0.883 2.08 2.00 Prom - 25552 25513 40 -3.61 3.18 PlyA - 26906 26901 6 -0.45 3.17 Term - 28048 27918 131 2 2 8 44 160 0.772 2.25 3.16 Intr - 29782 29571 212 0 2 86 92 54 0.740 4.68 3.15 Intr - 30615 30509 107 0 2 61 68 64 0.863 1.21 3.14 Intr - 31813 31567 247 0 1 66 107 67 0.865 4.40 3.13 Intr - 33440 33240 201 1 0 122 83 146 0.947 16.42 3.12 Intr - 33708 33542 167 0 2 67 101 169 0.985 15.37 3.11 Intr - 33798 33775 24 0 0 87 107 10 0.728 1.50 3.10 Intr - 35557 35350 208 0 1 70 67 44 0.950 0.10 3.09 Intr - 36179 36052 128 2 2 80 82 160 0.908 14.58 3.08 Intr - 41343 41189 155 1 2 58 113 144 0.957 14.20 3.07 Intr - 42241 42085 157 1 1 144 109 61 0.997 13.90 3.06 Intr - 42431 42295 137 0 2 120 94 5 0.988 5.10 3.05 Intr - 43600 43258 343 1 1 47 3 440 0.060 26.96 3.04 Intr - 46812 46681 132 0 0 94 80 8 0.127 1.95 3.03 Intr - 47813 47739 75 2 0 112 76 113 0.761 12.71 3.02 Intr - 51178 50112 1067 2 2 32 92 413 0.929 26.82 3.01 Init - 54979 54706 274 1 1 80 100 225 0.996 20.03 3.00 Prom - 55154 55115 40 -4.31 4.00 Prom + 55602 55641 40 -7.40 4.01 Init + 55737 55794 58 2 1 36 94 53 0.647 2.12 4.02 Intr + 58162 58254 93 2 0 44 106 78 0.914 5.63 4.03 Term + 62091 62221 131 1 2 95 46 110 0.713 6.15 4.04 PlyA + 62988 62993 6 -0.45 5.05 PlyA - 63302 63297 6 1.05 5.04 Term - 65674 65558 117 1 0 83 49 83 0.200 2.74 5.03 Intr - 69716 69635 82 1 1 37 88 31 0.183 -1.86 5.02 Intr - 72314 72086 229 0 1 41 94 147 0.429 8.06 5.01 Init - 74949 74892 58 0 1 64 90 40 0.715 3.22 5.00 Prom - 75494 75455 40 -7.20 6.12 PlyA - 75749 75744 6 1.05 6.11 Term - 76582 76400 183 1 0 106 44 305 0.991 25.76 6.10 Intr - 76985 76863 123 2 0 68 91 185 0.998 18.09 6.09 Intr - 77723 77542 182 0 2 101 105 184 0.869 21.50 6.08 Intr - 80361 80001 361 0 1 100 73 247 0.650 19.75 6.07 Intr - 81664 81593 72 1 0 110 79 119 0.937 13.30 6.06 Intr - 91854 91804 51 0 0 98 56 49 0.383 2.29 6.05 Intr - 93715 93665 51 1 0 105 56 61 0.440 4.19 6.04 Intr - 97169 96960 210 2 0 99 84 218 0.975 22.13 6.03 Intr - 98192 97980 213 2 0 85 55 387 0.998 34.54 6.02 Intr - 98670 98634 37 2 1 98 97 -5 0.871 0.15 6.01 Intr - 100059 99857 203 0 2 78 -3 194 0.846 7.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:114326355_114433131|GENSCAN_predicted_peptide_1|201_aa MALSWVLTVLSLLPLLEAQIPLCANLVPVPITNATLDRITGKWFYIASAFRNEEYNKSVQ EIQATFFYFTPNKTEDTIFLREYQTRQNQCFYNSSYLNVQRENGTVSRYEGGREHVAHLL FLRDTKTLMFGSYLDDEKNWGLSFYADKPETTKEQLGEFYEALDCLCIPRSDVMYTDWKK DKCEPLEKQHEKERKQEEGES >gi568815589f:114326355_114433131|GENSCAN_predicted_CDS_1|606_bp atggcgctgtcctgggttcttacagtcctgagcctcctacctctgctggaagcccagatc ccattgtgtgccaacctagtaccggtgcccatcaccaacgccaccctggaccggatcact ggcaagtggttttatatcgcatcggcctttcgaaacgaggagtacaataagtcggttcag gagatccaagcaaccttcttttactttacccccaacaagacagaggacacgatctttctc agagagtaccagacccgccagaaccagtgcttctataactccagttacctgaatgtccag cgggagaatgggaccgtctccagatacgagggaggccgagaacatgttgctcacctgctg ttccttagggacaccaagaccttgatgtttggttcctacctggacgatgagaagaactgg gggctgtctttctatgctgacaagccagagacgaccaaggagcaactgggagagttctac gaagctctcgactgcttgtgcattcccaggtcagatgtcatgtacaccgactggaaaaag gataagtgtgagccactggagaagcagcacgagaaggagaggaaacaggaggagggggaa tcctag >gi568815589f:114326355_114433131|GENSCAN_predicted_peptide_2|442_aa MHTWKEVTIISEEALKPAPQKHAVPGSEFEGHKRISEQPLPNKTISPPPAPAPAAAPLPC GPTETIPSFLLTRAGRDQAICELQEEVSRLRLRLEDSLHQPLQGSPTRPASAFDRPARTR GRPADSPATWGSHYGSKSTERLPGEPRGEEQIVPPGRQRARSSSVPREVLRLSLSSESEL PSLPLFSEKSKTTKDSPQAARDGKRGVGSAGWPDRVTFRGQYTGHEYHVLSPKAVPKGNG TVSCPHCRPIRTQDAGGAVTGDPLGPPPADTLQCPLCGQVGSPPEADGPGSATSGAEKAT TRRKASSTPSPKQRSKQAGSSPRPPPGLWYLATAPPAPAPPAFAYISSVPIMPYPPAAVY YAPAGPTSAQPAAKWPPTASPPPARRHRHSIQLDLGDLEELNKALSRAVQAAESVRSTTR QMRSSLSADLRQAHSLRGSCLF >gi568815589f:114326355_114433131|GENSCAN_predicted_CDS_2|1329_bp atgcatacgtggaaagaggtgactataatttcagaggaggctctgaagccagccccacaa aaacatgcggttcctggctcagagtttgaggggcacaaacggatttctgaacagcccctt cccaacaagacaatcagcccacccccagcccccgcccctgccgctgcgcctctaccctgt ggaccaacagagaccatccccagcttcctgctcaccagggcagggcgagaccaggccatc tgtgagctgcaagaagaggtgtcccggcttcgtctgcggctggaagacagcctgcaccag ccactccagggcagcccgacacgcccagcatctgcctttgaccgccccgcccggacccgc ggccggccagcagactccccagccacctggggctcccattatggcagtaaatccacagag agattgcctggtgagcctagaggtgaagagcagattgtccctccaggaaggcagcgagcc aggtcttcctcagtgcctcgggaggtgctccgactgtccctgagttcagaatctgagctg ccctccctaccactgttctctgagaagagcaagaccaccaaggacagtccacaggcagct cgggatggaaagagaggggtgggcagtgctggatggccagacagggtcaccttccggggc caatacacaggccacgaataccatgttctgtcccctaaggcggtcccaaaaggcaatggc acagtctcctgtccccactgccggcccattaggacccaggatgcgggtggtgctgtcaca ggggacccactgggaccgcctcccgctgatacccttcagtgtcccctgtgtggtcaagtt gggtctcccccagaggcagatggtccaggctcagccacctctggggcagagaaggccacc acgaggagaaaagcatcttcaactcccagccccaagcagaggagcaagcaggcggggtcg tcgccacgcccaccccccggactgtggtatctggcaacagcgcccccagcaccagcccct ccagcctttgcctacatctcctcggttcccatcatgccttatccacctgccgctgtgtac tatgcgcctgcaggacctacctcagcccaaccagctgccaagtggccgcccacagcctct cccccaccagcccggagacaccggcactccatccagctcgacctgggcgacctagaggag ctcaacaaggccctgagccgggccgtgcaggctgccgagagcgtccgctctaccaccagg cagatgagaagctcgctgtcagccgacctgcgccaggctcacagcctgcggggctcctgc ctcttctga >gi568815589f:114326355_114433131|GENSCAN_predicted_peptide_3|1254_aa MASSETEIRWAEPGLGKGPQRRRWAWAEDKRDVDRSSSQSWEEERLFPNATSPELLEDFR LAQQHLPPLEWDPHPQPDGHQDSESGETSGEEAEAEDVDSPASSHEPLAWLPQQGRQLDM TEEEPDGTLGSLEVEEAGESSSRLGYEAGLSLEGHGNTSPMALGHGQARGWVASGEQASG DKLSEHSEVNPSVELSPARSWSSGTVSLDHPSDSLDSTWEGETDGPQPTALAETLPEGPS HHLLSPDGRTGGSVARATPMEFQDSSAPPAQSPQHATDRWRRETTRFFCPQPKEHIWKQT KTSPKPLPSRFIGSISPLNPQPRPTRQGRPLPRQGATLAGRSSSNAPKYGRGQLNYPLPD FSKVGPRVRFPKDESYRPPKSRSHNRKPQAPARPLIFKSPAEIVQEVLLSSGEAALAKDT PPAHPITRVPQEFQTPEQATELVHQLQEDYHRLLTKYAEAENTIDQLRLGAKAAVEEDVG AEVQRCRCPSASPCHAPGPCPCHSCCSQCREALPGQIMMVMTITVMVIMVMVTMMTVMVM MMTVMVKMVMMVVVIVMVLVIMVIVMVMVMMVNDGDGNSDCYGADGDDGGGNDSGGDDAD GDDADGDADGDANDGSGDDGGGEDNGDECQGLPLRVNEKACSSSPVFGLAKGCTERWYLL LQGMCVELLNEDEAFQVNLFSDPPQPNHSIHTGMVPQGTKVLSFTIPQPRSAEWWPGPAE DPQASAASGWPSARGDLSPSSLTSMPTLGWLPENRDISEDQSSAEQTQALASQASQFLAK GFQRLKAAHAALEEEYLKACREQHPAQPLAGSKGTPGRFDPRRELEAEIYRLGSCLEELK EHIDQTQQEPEPPGSDSALDSTPALPCLHQPTHLPAPSGQAPMPAIKTSCPEPTARAWQE PATTTAAASTGPCPLHVNVEVSSGNSEVEDRPQDPLARLRHKELQMEQVYHGLMERYLSV KSLPEAMRMEEEEEGEEEEEEEGGGDSLEVDGVAATPGKAEATRVLPRQCPVQAEKSHGA PLEEATEKMVSMKPPGFQASLARDGHMSGLGKAEAAPPGPGVPPHPPGTKSAASHQSSMT SLEGSGISERLPQKPLHRGGGPHLEETWMASPETDSGFVGSETSRVSPLTQTPEHRLSHI STAGTLAQPFAASVPRDGASYPKARGSLIPRRATEPSTPRSQAQRYLSSPSGPLRQRAPN FSLERTLAAEMGSPSRGIPEEGVVITDDSAIIAPEEQDVEVEDRDIDDPDPIQA >gi568815589f:114326355_114433131|GENSCAN_predicted_CDS_3|3765_bp atggccagctcggagactgagatccgctgggctgagcctggcctggggaagggcccccag cggcggcgctgggcctgggccgaggacaagagggatgtggatagaagtagttcacaaagc tgggaagaagagagactctttcccaatgccaccagccccgagctcctagaggacttccgc ctggcccagcagcacctgccgcccctggagtgggacccacacccgcagcccgatgggcat caggattccgagtcaggagagacttcgggagaagaggctgaagcagaggatgtggacagc ccagcaagttcccatgagcctcttgcctggctcccccagcagggccgtcagctggacatg actgaagaggagccagatgggaccctcggaagtctggaggttgaggaggctggagagagc tcctcaaggttggggtatgaggctggtctcagcttggaaggccatggaaacaccagcccc atggctcttgggcatggtcaggccaggggctgggtggcttctggcgaacaagccagtggg gacaaactttctgaacattccgaggtcaacccatccgttgaactcagcccggcaaggtcc tggagcagtgggacagtgagcctcgaccaccctagtgacagccttgattctacctgggaa ggagagaccgatggcccccagcccactgccctggcagaaaccttgccagagggccccagc caccacctcctaagcccagatggcagaactggaggcagtgttgctcgggcaacccccatg gaattccaggactcctcagctcccccagcccagagtccgcagcatgccacagatagatgg aggagagaaacgaccagattcttctgccctcagcccaaggaacacatctggaagcagaca aagacgtcacctaagccactcccttcccgattcattggctccatcagccccctgaatccc cagcccaggccaacgcggcagggcaggccgctgcccagacagggagccactctggctggc cgctcctcttctaatgcccccaagtatggccgggggcagttgaactacccactccctgat ttctccaaggtagggccccgggtgagattccccaaagatgagagctaccgtccccccaag tccagaagccacaacaggaagcctcaggcccctgccaggcccctcatcttcaagtctcca gctgagattgtgcaggaggtgctgttgagcagtggagaagcagccctggcaaaggacacg cctcctgcccaccctatcaccagggtaccccaagaatttcagacgcctgagcaagccact gagctggtccatcagctccaggaagactaccacaggctcctcaccaagtacgctgaggcc gagaacaccattgaccagctacgcctcggggccaaggcagcagtagaggaggatgtggga gctgaggtgcagagatgcagatgccccagtgcctccccctgtcatgctccaggaccctgc ccctgccacagctgctgctctcaatgtcgagaggcacttccggggcagataatgatggtg atgacgataacggtgatggtgataatggtgatggttacgatgatgacggtgatggttatg atgatgacggtaatggtaaaaatggtgatgatggtggtggtaatagtgatggtgttggtg ataatggtgatagtgatggtgatggtaatgatggtgaatgatggtgatggtaatagtgat tgttatggtgctgacggtgatgatggtggtggtaatgacagtggtggtgatgatgctgat ggtgatgatgctgatggtgatgctgatggtgatgctaatgatggtagtggtgatgacggt ggtggtgaagataatggtgatgaatgtcagggactgccattaagagtaaatgaaaaggca tgcagttcttcacctgtgtttggcctagccaagggctgcactgaaagatggtatttatta ctacagggcatgtgtgttgagctcctgaatgaagatgaggcgtttcaggtgaacctgttc tctgacccaccccagcccaaccacagcatccacacgggaatggtgccccaggggaccaag gtcttgtccttcaccatcccacagccccgctctgcagagtggtggccgggcccggccgag gacccccaggcctctgcggcctcagggtggccatcagctcgaggagacttgagcccctcc tcgcttaccagcatgcccaccctggggtggcttccggagaaccgggacatctctgaggac cagtcctcagcagagcagacccaggcactggcttctcaggccagccagttcctggccaag ggcttccagcggctgaaggctgcccacgcggccctagaggaggagtacctgaaggcttgt cgggagcaacaccctgcccagccgcttgccggctccaaggggacgcctggaagatttgat cctcgcagggagctggaggcagagatataccgtctgggaagctgcctggaagagctgaag gaacacatagaccagacccagcaagagcctgagccgcccgggtcagactcagctctggac agcaccccagccctgccctgcctccatcagccaacgcacctgcctgctccttctggacaa gcccccatgccagccatcaagacctcctgccctgagcccacagctcgagcctggcaggag cctgctaccaccactgccgccgccagcactggcccctgcccattgcacgtaaatgtggag gtgagctctggcaacagtgaggtggaggacaggccacaggaccccctggcccgactcagg cacaaggagctgcagatggagcaagtttaccatggcctcatggagcggtacctcagtgtg aagtctctcccagaagccatgagaatggaggaggaggaagaaggagaggaggaggaggag gaagaggggggaggtgactccctggaagttgatggggtggctgcaactccagggaaagca gaggccaccagggtcctcccaaggcagtgcccggtgcaggctgagaaaagtcatggggct cccctggaggaggccacggagaagatggtatctatgaagccaccaggtttccaggcatcc ctggctagagacgggcacatgtcaggcctgggcaaggctgaggcagcccctccaggccct ggcgtgccaccccaccctccaggcaccaagtccgcagcatcccaccaaagtagtatgacc agcctggagggaagcggcatctctgagcgccttccacagaagcctttgcaccgaggcggt gggccccacctggaggagacctggatggcgtccccagagacagacagtggctttgtgggc tcagaaacaagcagagtttcacccctcacccagactccagagcaccggctctcccacatc agcacagcaggaacattagcccagccctttgctgcatctgtgcccagggatggagcttcc taccccaaggccaggggttctctgattcccagaagagccacagagcccagcacaccccgg agccaagcacagaggtacctctccagcccaagtgggcctctccggcagagggcacccaac ttcagcctggagcggacactggcagccgagatgggtagtccttcgagaggtattccagaa gaaggtgttgttatcacagatgacagtgctattattgccccagaagaacaagatgtggag gtagaagacagggatattgatgaccctgaccctatacaggcctag >gi568815589f:114326355_114433131|GENSCAN_predicted_peptide_4|93_aa MGELMAQEEADDHQATTNPDSRSISPLTPPPQCEDDKDVGLYDDPLPLNETLTLPMCNLN LRETVVIPAISTHRPHPEVVLQPEWDVRGSMEV >gi568815589f:114326355_114433131|GENSCAN_predicted_CDS_4|282_bp atgggagaactgatggcccaggaggaagcagacgatcaccaggcaactacaaatccagac agtagatcaatctcccccttaactcctcctcctcaatgtgaagatgacaaggatgtaggc ctttatgatgatccacttccacttaatgaaactctgactctgccaatgtgcaacctgaat ctcagggagaccgtggtgattcctgccatctcaacccaccgtcctcaccccgaggtggtc ctgcagccagagtgggatgttcggggcagcatggaggtgtga >gi568815589f:114326355_114433131|GENSCAN_predicted_peptide_5|161_aa MEEFEKSCFVQFCVQSSARAGFALPMSEFLGAGRGGGLSVIGALPEAPPPPGRPLLSRAG LRDYAKEPPPRPAPGLEAFVCRSTPGNHPRSAAASRLYQLRDVVEVGVIVVPLVQIESLR FREVQVILVPQARGKGWQGSADTAMHMTVAGKRVKVIKPWQ >gi568815589f:114326355_114433131|GENSCAN_predicted_CDS_5|486_bp atggaagagtttgagaaatcctgctttgtccagttctgtgtccagagttctgccagagca gggtttgcgctccctatgtccgagttcctgggggcgggccgtggcggcggcctttcggtg attggtgcgctcccagaggccccgccccccccgggccgccccctgctgtcgcgggcgggc ctcagggattatgcaaaggagccaccgccccgccccgcgcccggattggaggcctttgtt tgccgctcaactccaggaaaccacccgcgctcggcggccgccagcagattatatcagtta cgtgatgttgtcgaagtgggtgttattgttgtccctcttgtacagatagaaagcctgagg ttcagagaggttcaagtgattcttgtgcctcaggctcgtgggaagggctggcagggttct gcagacacagccatgcacatgactgttgctggtaaaagggtgaaagtcatcaagccctgg cagtag >gi568815589f:114326355_114433131|GENSCAN_predicted_peptide_6|561_aa VGDQILEVNGRSFLNILHDEAVRLLKSSRHLILTVKDVGRLPHARTTVDETKWIASSRIR ETMANSAGFLGDLTTEGINKPGFYKGPAGSQVTLSSLGNQTRVLLEEQARHLLNEQEHAT MAYYLDEYRGGSVSVEALVMALFKLLNTHAKFSLLSEVRGTISPQDLERFDHLVLRREIE SMKARQPPGPGAGDTYSMVSYSDTGSSTGSHGTSTTVSSARERLLWLIDLMEVPSFLPER LLWFIDLMEVPPFLPNTLDLEETGEAVQGNINALPDVSVNRSPPAGTAPTPGTSSAQDLP SSPIYASVSPANPSSKRPLDAHLALVNQHPIGPFPRVQSPPHLKSPSAEATVAGGCLLPP SPSGHPDQTGTNQHFVMVEVHRPDSEPDVNEVRALPQTRTASTLSQLSDSGQTLSEDSGV DAGEAEASAPGRGRQSVSTKSRSSKELPRNERPTDGANKPPGLLEPTSTLVRVKKSAATL GIAIEGGANTRQPLPRIVTIQRGGSAHNCGQLKVGHVILEVNGLTLRGKEHREAARIIAE AFKTKDRDYIDFLVTEFNVML >gi568815589f:114326355_114433131|GENSCAN_predicted_CDS_6|1686_bp gttggggaccagattctagaagtgaatgggcggagctttctcaacatcctacacgacgag gctgtcaggctgcttaagtcatctcggcacctcatcctgacagtgaaggacgtcgggagg ctgccccatgcccgcaccactgtggacgagaccaagtggatcgccagttcccggatcagg gagaccatggcgaactcggcagggtttcttggcgatctcacaacagaaggaataaacaag ccaggattttacaagggcccagccggctcccaggtgaccctgagcagcctggggaaccag acacgagtgctgctggaggagcaggctcggcacctgctgaacgagcaggaacacgccacc atggcctactacctggatgagtaccgtggtggcagcgtctctgtggaggccctcgtcatg gccctgttcaagctgctcaacacccacgccaagttctcactcctctctgaggtgagaggc accatttccccgcaagacctagaacgcttcgaccacctggtgctgaggcgtgagattgag tccatgaaggcgcggcagcccccaggccccggggctggggacacctactccatggtctcc tacagtgacacgggttcatccacaggcagccacggcacctccaccaccgtcagctcggcc agggagcggctgctgtggcttatagacctgatggaggtaccgtccttcctgccggagcga ctgctgtggtttatagacctgatggaggtaccacccttcctgccgaacactctggacctg gaggaaactggcgaggctgtccagggcaatatcaacgccctcccagatgtgtccgtgaac cgcagcccgccagcgggcaccgcacccaccccagggacctcctctgcacaggacttgccc tcttcccccatctatgcctccgtctcccctgccaaccccagctccaagaggccgctggac gcccatctggccctggtcaaccaacaccccatcggccccttcccacgggtccagtcaccc ccgcacctgaaaagcccctctgcagaggccacagtggctgggggctgccttctgccccca tcaccctctggccacccagaccagacaggcacaaaccagcactttgtcatggtggaggtc caccgccccgacagcgagccagacgtcaatgaagtgagggcgctgccccagacgcgcaca gcctctacgctctcccagctctcggacagcgggcagactctaagcgaggacagtggtgtg gatgctggcgaggcagaggccagcgccccaggccgaggaaggcagtcggtgtccaccaag agcaggagtagcaaggagctgcctcggaacgagaggcccacagatggggccaacaaaccg cctggacttctggagcccacgtccactctggtccgtgtgaagaaaagtgcggccaccctg ggcatcgccatcgagggtggcgccaacacccgccagcccctgcctaggattgtcactatt cagagaggcggctcagctcacaactgtgggcagctcaaggtgggccacgtgattctggaa gtgaatgggctgacgcttcggggcaaggagcaccgggaggccgcccgcattatcgccgag gccttcaagactaaggaccgtgactacattgactttctggtcactgagttcaatgtgatg ctctag