GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:43:51 Sequence gi568815581f:43384038_43623844 : 239807 bp : 47.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 15696 16301 606 2 0 74 46 750 0.772 65.60 1.02 PlyA + 17678 17683 6 1.05 2.06 PlyA - 20447 20442 6 1.05 2.05 Term - 27876 27793 84 0 0 134 47 4 0.412 -1.45 2.04 Intr - 30506 30411 96 2 0 52 72 50 0.312 0.01 2.03 Intr - 31497 31354 144 0 0 84 51 62 0.549 2.58 2.02 Intr - 33748 33579 170 2 2 119 37 64 0.522 4.07 2.01 Init - 41502 41454 49 0 1 86 89 23 0.141 1.32 2.00 Prom - 55426 55387 40 -4.06 3.00 Prom + 59810 59849 40 -3.86 3.01 Sngl + 92161 92628 468 0 0 110 43 440 0.989 37.83 3.02 PlyA + 92711 92716 6 1.05 4.00 Prom + 95941 95980 40 -6.96 4.01 Init + 100001 100148 148 1 1 89 85 223 0.890 22.35 4.02 Intr + 105412 105497 86 2 2 103 92 73 0.991 8.74 4.03 Intr + 108146 108291 146 1 2 92 46 184 0.956 13.68 4.04 Intr + 108644 109003 360 2 0 28 90 278 0.790 16.34 4.05 Intr + 109408 109552 145 1 1 70 100 110 0.930 10.68 4.06 Intr + 109646 109849 204 1 0 78 47 259 0.974 20.20 4.07 Intr + 112144 112231 88 0 1 79 88 156 0.919 14.24 4.08 Intr + 114825 114922 98 1 2 24 100 34 0.450 -2.07 4.09 Intr + 115919 116066 148 1 1 64 58 80 0.341 2.41 4.10 Intr + 120607 120788 182 2 2 95 94 165 0.651 17.39 4.11 Intr + 122966 123160 195 1 0 119 107 123 0.992 16.91 4.12 Intr + 123466 123651 186 0 0 91 88 213 0.999 21.49 4.13 Intr + 123772 123982 211 0 1 72 110 154 0.999 14.49 4.14 Intr + 124302 124483 182 1 2 60 87 125 0.998 9.19 4.15 Intr + 129325 129465 141 0 0 127 80 167 0.994 20.35 4.16 Intr + 133130 133285 156 1 0 70 70 77 0.918 4.31 4.17 Intr + 136093 136230 138 0 0 129 90 206 0.953 25.56 4.18 Intr + 136714 136842 129 0 0 91 97 89 0.999 10.99 4.19 Intr + 137332 137528 197 0 2 72 115 254 0.995 24.71 4.20 Intr + 138010 138189 180 1 0 62 78 277 0.999 23.08 4.21 Term + 139591 139810 220 1 1 99 48 333 0.999 26.91 4.22 PlyA + 143133 143138 6 -0.45 5.15 PlyA - 143831 143826 6 1.05 5.14 Term - 144706 144482 225 1 0 138 41 220 0.986 19.08 5.13 Intr - 145199 145098 102 2 0 66 81 184 0.827 15.87 5.12 Intr - 145639 145467 173 2 2 104 64 178 0.839 16.66 5.11 Intr - 145945 145847 99 2 0 59 77 134 0.976 9.48 5.10 Intr - 146144 146070 75 0 0 138 82 0 0.881 3.89 5.09 Intr - 148902 148637 266 2 2 25 110 192 0.994 12.26 5.08 Intr - 149311 149150 162 0 0 114 82 105 0.980 11.59 5.07 Intr - 149948 149822 127 0 1 95 99 88 0.994 10.34 5.06 Intr - 152442 152389 54 1 0 104 84 42 0.947 4.35 5.05 Intr - 155638 155458 181 1 1 36 44 145 0.404 4.34 5.04 Intr - 157839 157724 116 1 2 88 35 18 0.414 -3.33 5.03 Intr - 160985 160938 48 0 0 97 113 50 0.992 7.05 5.02 Intr - 161442 161237 206 2 2 113 98 214 0.654 23.74 5.01 Init - 163187 163114 74 2 2 93 37 37 0.374 -0.35 5.00 Prom - 163531 163492 40 -7.66 6.00 Prom + 166712 166751 40 -6.66 6.01 Init + 172393 172574 182 0 2 74 96 137 0.945 9.76 6.02 Intr + 173796 173826 31 0 1 95 101 20 0.848 2.13 6.03 Intr + 199455 199667 213 2 0 116 68 136 0.823 13.31 6.04 Intr + 230657 230858 202 1 1 70 51 83 0.249 1.66 6.05 Intr + 231193 231273 81 2 0 90 86 17 0.243 1.31 6.06 Term + 237947 238104 158 0 2 83 43 85 0.266 1.60 6.07 PlyA + 238959 238964 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 622 688 67 0 1 71 71 74 0.863 5.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:43384038_43623844|GENSCAN_predicted_peptide_1|201_aa MGNHLTEMAPTASSFLPHFQALHVVVIGLDSAGKTSLLYRLKFKEFVQSVPTKGFNTEKI RVPLGGSRGITFQVWDVGGQEKLRPLWRSYTRRTDGLVFVVDAAEAERLEEAKVELHRIS RASDNQGVPVLVLANKQDQPGALSAAEVEKRLAVRELAAATLTHVQGCSAVDGLGLQQGL ERLYEMILKRKKAARGGKKRR >gi568815581f:43384038_43623844|GENSCAN_predicted_CDS_1|606_bp atggggaaccacttgactgagatggcgcccactgcctcctccttcttgccccacttccaa gccctgcatgtcgtggtcattgggctggactctgctggaaagacctccctcctttaccgc ctcaagttcaaggagtttgtccagagtgtccccaccaaaggcttcaacaccgagaagatc cgggtgcccctcgggggatcgcgtggcatcaccttccaagtgtgggacgtcggggggcag gagaagctgcgaccactgtggcgctcttatacccgccggacagacggtctagtgtttgtg gtggacgctgcggaggctgagcggctggaggaagccaaggtggagttgcaccgaatcagc cgggcctcggacaaccagggcgtgccagtgctggtgctggccaacaagcaggaccagccc ggggcactgagcgctgctgaggtggagaagaggctggcagtccgagagctagcagccgcc actctcactcatgtgcaaggctgcagcgctgtggacggtctgggcctgcagcagggcctt gagcgcctctatgagatgatcctcaagaggaagaaggcagctcggggtggcaagaagaga cggtga >gi568815581f:43384038_43623844|GENSCAN_predicted_peptide_2|180_aa MGFRHVDQAGLKLLTSDSPLCHTPTQPSAIRALNTQKGKTICAEREDRAREEGKGCSLPA DVLQTELPVSVPRSSSPDRLARYLHQWLTLCPCLLHSPLSDIFKLFYFQTVFVIAISCLF EGNKLSWFAQVPDISQDAGLSVPKLDKLSQLGLDSGTSTNRAVPEEKEVSALSPSTPPTP >gi568815581f:43384038_43623844|GENSCAN_predicted_CDS_2|543_bp atggggtttcgccatgtagaccaggctggtctcaaactcctgacctcagacagcccgctc tgccacactcccacccagccctctgctatcagagctctgaacacccaaaaaggaaaaacg atttgtgctgagcgcgaagacagagccagggaggaagggaagggctgctcgctgccagct gacgttctgcagactgagctgcctgtgtctgtgccaaggtcctcaagtcctgatcgcctt gctcgctatttgcaccagtggcttactctctgtccctgcctcctacactctccactgtca gatattttcaagctgttttattttcagacagtcttcgtcattgccatcagctgcctattt gagggcaacaaactgtcctggtttgcccaggttccagatatttctcaggatgcaggactt tcagtgccaaaactagacaaattgagccagctgggactggactctggcacttctacaaat agagctgtgccagaagaaaaggaagtttcagcactcagtccatccacaccccccaccccc tga >gi568815581f:43384038_43623844|GENSCAN_predicted_peptide_3|155_aa MAKSKNHSTNNQSRKRHRNGIKKPRSRRYESLKGMDPKFPRNMCFAKKQNKKVLKKMQAN SDKAMSARAEVIKALVKPKEVKLKIPKGVSCKLDRLAYIAHPKLGKRARARIAKGLRLCW PKAKAKDQTKAQAAAPASVPAQAPKGAQAPTKASE >gi568815581f:43384038_43623844|GENSCAN_predicted_CDS_3|468_bp atggccaagtccaagaaccacagcacaaacaaccagtcccgaaaaaggcacagaaatggt atcaagaaaccccgatcacgaagatatgaatctcttaaggggatggaccccaagttcccg aggaacatgtgctttgccaagaagcaaaacaagaaggtcctaaagaagatgcaggccaac agtgacaaggccatgagtgcacgtgctgaggttatcaaggccctcgtaaagcccaaggag gttaagctcaagatcccaaagggtgtcagctgcaagctcgatcgacttgcctacattgcc caccccaagcttgggaagcgggctcgtgcccgcattgccaaggggctcaggctgtgctgg ccaaaggccaaggccaaggatcaaaccaaggcccaggctgcagctccagcttcagttcca gctcaggctcccaaaggtgcccaggcccctacaaaggcttcagagtag >gi568815581f:43384038_43623844|GENSCAN_predicted_peptide_4|1179_aa MAVAVAMAGALIGSEPGPAEELAKLEYLSLVSKVCTELDNHLGINDKDLAEFVISLAEKN TTFDTFKASLVKNGAEFTTMLDEDDVKVAVDVLKELEALMPSAAGQEKQRDAEHRFVLSV LSFGSLGDRTKKKKRSRSRDRNRDRDRDRERNRDRDHKRRHRSRSRSRSRTRERNKVKSR YRSRSRSQSPPKDRKDRDKYGERNLDRWRDKHVDRPPPEEPTIGDIYNGKVTSIMQFGCF VQLEGLRKRWEGLVHISELRREGRVANVADVVSKGQRVKVKVLSFTGTKTSLSMKDVDQE TGEDLNPNRRRNLVGETNEETSMRNPDRPTHLSLVSAPEVEDDSLERKRLTRISDPEKWE IKQMIAANVLSKEEFPDFDEETGILPKVDDEEDEDLEIELVEEEPPFLRGHTKQSMDMSP IKIVKNPDGSLSQAAMMQSALAKERRELKQAQREAEMDSIPMGLNKHWVDPLPDAEGRQI AANMRGIGMMPNDIPEWKKHAFGGNKASYGKKTQMSILEQRESLPIYKLKEQLVQAVHDN QILIVIGETGSGKTTQITQYLAEAGYTSRGKIGCTQPRRVAAMSVAKRVSEEFGCCLGQE VGYTIRFEDCTSPETVIKYMTDGMLLRECLIDPDLTQYAIIMLDEAHERTIHTDVLFGLL KKTVQKRQDMKLIVTSATLDAVKFSQYFYEAPIFTIPGRTYPVEILYTKEPETDYLDASL ITVMQIHLTEPPGDILVFLTGQEEIDTACEILYERMKSLGPDVPELIILPVYSALPSEMQ TRIFDPAPPGSRKVVIATNIAETSLTIDGIYYVVDPGFVKQKVYNSKTGIDQLVVTPISQ AQAKQRAGRAGRTGPGKCYRLYTERAYRDEMLTTNVPEIQRTNLASTVLSLKAMGINDLL SFDFMDAPPMETLITAMEQLYTLGALDDEGLLTRLGRRMAEFPLEPMLCKMLIMSVHLGC SEEMLTIVSMLSVQNVFYRPKDKQALADQKKAKFHQTEGDHLTLLAVYNSWKNNKFSNPW CYENFIQARSLRRAQDIRKQMLGIMDRHKLDVVSCGKSTVRVQKAICSGFFRNAAKKDPQ EGYRTLIDQQVVYIHPSSALFNRQPEWVVYHELVLTTKEYMREVTTIDPRWLVEFAPAFF KVSDPTKLSKQKKQQRLEPLYNRYEEPNAWRISRAFRRR >gi568815581f:43384038_43623844|GENSCAN_predicted_CDS_4|3540_bp atggctgtggctgtagccatggcgggagccttaatcgggtcggagccaggccccgcggaa gaacttgccaaactcgagtacctgtctttggtgtcaaaggtttgcactgagctggacaat cacttggggatcaacgacaaggaccttgctgaatttgtgatcagtcttgctgagaaaaat accacctttgatacttttaaggcttctctcgtcaaaaatggtgcagaatttacgaccatg ttggatgaagatgatgtgaaagttgctgtggatgtcctgaaagaactggaagctttaatg cccagcgcagcaggccaggagaagcaaagagatgctgaacaccggtttgtccttagtgtc ctgtcctttggaagtttaggggacaggacaaagaagaagaagcggagtcgaagccgagat cgaaaccgagatcgagacagagatagggaacgaaaccgagatagagaccacaagcggaga caccgatcccgctctcgatcacgttccaggacccgggagaggaataaagtgaagtctaga tatcggtccaggagcaggagtcagagtccccccaaagaccggaaggaccgggacaaatat ggagagcggaatctggatagatggcgggataagcatgtggaccgccctcctccagaagag cccaccattggtgacatttataatggcaaagttaccagcatcatgcagtttggttgcttt gtgcagctggaaggactaaggaagcggtgggaaggcctggtgcacatctctgagctccgg cgggagggtcgtgtggccaatgtagctgatgtcgtgagcaaaggccagagggtcaaagtc aaagtgctgtccttcactgggaccaagaccagcctgagcatgaaggatgtggatcaagag actggagaagatctaaacccaaatagacggcgaaatcttgtcggggagaccaatgaggag acctcaatgcggaatcctgatagacccactcacttgtcccttgtcagtgctcctgaagta gaggacgactcactggaacgcaagcgcctcacccgaatctctgacccagagaagtgggag atcaaacagatgattgctgccaatgtcctttccaaagaagaatttccagactttgatgaa gagactggcattctccctaaggtggatgatgaagaagatgaggaccttgagattgaattg gttgaggaagagcctccattcctgagagggcacactaagcaaagcatggacatgagcccc attaaaattgtcaagaacccagacggctccctctcccaagcagcaatgatgcagagtgcc ttggccaaagaaaggcgggaactcaaacaggcccagcgggaagctgagatggattctatt cccatgggactcaacaaacactgggttgaccctctgcctgatgcggaaggcagacagatt gctgccaacatgaggggtattgggatgatgcccaatgatattcctgagtggaagaagcat gcctttgggggcaacaaagcctcttacggaaaaaagacccagatgtcaatccttgagcag agggagagcctgcccatctacaaactgaaggagcaattggtccaggccgtccatgacaat cagatcctgattgtcattggtgagacaggatctggaaagacaacacagatcacccagtac ctggcggaggcaggctacacttccaggggcaagattgggtgtacccagcccagaagagtg gcagctatgtcggtggccaaaagagtgtcagaggagtttggttgttgcttaggccaagag gtgggctacaccattcgatttgaggactgcactagccctgaaacagtcatcaagtacatg acagatgggatgttgcttagagagtgcttgattgaccctgacctcactcagtacgcgatc atcatgttggacgaggcacatgagaggacaattcacactgatgtgctctttggattgttg aaaaagacagttcagaaacggcaggacatgaagctgattgtcacctcagccaccttggat gcagtgaagttttctcaatacttctatgaagctcccattttcaccatcccaggtcgaaca tatccagtggaaatactgtacacaaaggaacctgagacagattatctggatgccagcctg attactgttatgcagattcatttaacagaaccaccaggtgatatcctggtcttcctgact ggtcaggaagaaattgatactgcttgtgagatcctgtatgaaagaatgaaatccctggga cctgatgttccagagttaattatcctcccagtgtactctgctcttcccagtgagatgcag acccgaatctttgacccagctccaccaggcagcagaaaggttgtgattgccaccaatatc gcagagacatcgctgactattgatggtatctactatgtggtggacccaggattcgtgaaa cagaaagtttacaattccaagacagggattgaccagctcgtggtgacgcctatttctcag gctcaggcaaagcaacgagctggcagagctgggagaacaggcccagggaagtgttacagg ttgtacacagaacgtgcctaccgagatgaaatgctgaccaccaacgtgccggaaatccag agaaccaacttagcaagcacagtgctgtcactcaaggccatgggtatcaatgatctgctg tcctttgatttcatggatgccccacctatggaaactttgatcacagccatggagcagctg tacacactgggggccctggatgacgagggcctgctcactcgcttgggccgccggatggca gagttccctctggagccaatgctatgcaaaatgctcatcatgtctgtgcatctgggctgc agtgaggaaatgctgaccattgtatccatgctgtctgtgcagaacgtcttctataggccc aaggataaacaagcccttgcagatcagaagaaggccaaattccaccagactgaaggggac cacctcaccctgctagctgtgtacaactcctggaagaacaacaagttctccaacccatgg tgctatgagaactttatccaggctcgttccctgcgccgggcccaggacattcgcaagcag atgttaggcataatggacagacacaagctggatgttgtttcctgtggcaagtccacagtc cgagtgcagaaggccatctgcagtgggttcttccgtaatgctgccaagaaagacccgcag gagggttaccggacactgatcgaccagcaggtggtctatatccatccttccagtgccctc ttcaacagacagccagaatgggtggtgtaccatgagctggtgctcaccaccaaggaatac atgcgtgaagttaccaccatcgaccctcggtggcttgtggagtttgccccagccttcttc aaggtctcagacccaactaagctaagcaaacagaagaagcaacagcgtcttgaacccttg tacaaccgctatgaggaacccaatgcctggagaatatctcgagctttccgacggcgctga >gi568815581f:43384038_43623844|GENSCAN_predicted_peptide_5|635_aa MAERELHFRKSSTLGLLVSAAISRNPSGVLGCPAPESPEDPNLVPQTKRLRVTRGHSPRF SQKSPGNGSLREALIGPLGKLMDPGSLPPLDSEDLFQDLSHFQETWLAEATMPPSPSFCL FPFLKTLGDKYVPSTTSELVPSSTEAARAHSADFCQPEDKRLLHVYSCSQEITALSVYSN PVLGAGASDDKKDKALALTELNFLVEEIAQVPDSDEQFVPDFHSENLAFHSPTTRIKKEP QSPRTDPALSCSRKPPLPYHHGEQCLYSSAYDPPRQIAIKSPAPGALGQSPLQPFPRAEQ RNFLRSSGTSQPHPGHGYLGEHSSVFQQPLDICHSFTSQGGGREPLPAPYQHQLSEPCPP YPQQSFKQEYHDPLYEQAGQPAVDQGGVNGHRYPGAGVVIKQEQTDFAYDSDVTGCASMY LHTEGFSGPSPGDGAMDLSNPQFPPQGYGYEKPLRPFPDDVCVVPEKFEGDIKQEGVGAF REGPPYQRRGALQLWQFLVALLDDPTNAHFIAWTGRGMEFKLIEPEEVARLWGIQKNRPA MNYDKLSRSLRYYYEKGIMQKVAGERYVYKFVCEPEALFSLAFPDNQRPALKAEFDRPVS EEDTVPLSHLDESPAYLPELAGPAQPFGPKGGYSY >gi568815581f:43384038_43623844|GENSCAN_predicted_CDS_5|1908_bp atggctgagagggagctacactttcggaaatcatctaccctggggcttctggtttctgct gcaatcagtagaaatcccagcggagtcctgggctgccccgcccctgagtcacccgaggac cccaacctcgtcccccagactaagcgcctcagggtgactcgcgggcattctccccgcttc tcgcagaaatcgcccggaaatgggagcttgcgcgaagcgctgatcggcccgctggggaag ctcatggacccgggctccctgccgcccctcgactctgaagatctcttccaggatctaagt cacttccaggagacgtggctcgctgaagctacaatgcctccttctccatctttctgtctc ttccccttcctgaagactcttggggacaaatatgtcccctccacaactagtgaactggtt ccttccagtacagaggctgccagggcccacagtgctgacttctgccagcctgaggacaaa cgattattgcatgtttattcatgtagtcaagaaataacagcactgagcgtctactctaac cctgtgctgggtgctggggcatcagatgacaaaaaagataaagctcttgccctcacagag cttaacttcctggtggaggagatagctcaggtaccagacagtgatgagcagtttgttcct gatttccattcagaaaacctagctttccacagccccaccaccaggatcaagaaggagccc cagagtccccgcacagacccggccctgtcctgcagcaggaagccgccactcccctaccac catggcgagcagtgcctttactccagtgcctatgacccccccagacaaatcgccatcaag tcccctgcccctggtgcccttggacagtcgcccctacagccctttccccgggcagagcaa cggaatttcctgagatcctctggcacctcccagccccaccctggccatgggtacctcggg gaacatagctccgtcttccagcagcccctggacatttgccactccttcacatctcaggga gggggccgggaacccctcccagccccctaccaacaccagctgtcggagccctgcccaccc tatccccagcagagctttaagcaagaataccatgatcccctgtatgaacaggcgggccag ccagccgtggaccagggtggggtcaatgggcacaggtacccaggggcgggggtggtgatc aaacaggaacagacggacttcgcctacgactcagatgtcaccgggtgcgcatcaatgtac ctccacacagagggcttctctgggccctctccaggtgacggggccatggatctgagcaac ccccaatttcctccacaaggctatggctatgagaaacctctgcgaccattcccagatgat gtctgcgttgtccctgagaaatttgaaggagacatcaagcaggaaggggtcggtgcattt cgagaggggccgccctaccagcgccggggtgccctgcagctgtggcaatttctggtggcc ttgctggatgacccaacaaatgcccatttcattgcctggacgggccggggaatggagttc aagctcattgagcctgaggaggtcgccaggctctggggcatccagaagaaccggccagcc atgaattacgacaagctgagccgctcgctccgatactattatgagaaaggcatcatgcag aaggtggctggtgagcgttacgtgtacaagtttgtgtgtgagcccgaggccctcttctct ttggccttcccggacaatcagcgtccagctctcaaggctgagtttgaccggcctgtcagt gaggaggacacagtccctttgtcccacttggatgagagccccgcctacctcccagagctg gctggccccgcccagccatttggccccaagggtggctactcttactag >gi568815581f:43384038_43623844|GENSCAN_predicted_peptide_6|288_aa MTFLIYEVVLAALIWVLWGGGFEFLGAQSAVAIGRKGPTFLGREVELPLKEAEGGCQKRE GEPALSFSFVNNLSKELSTLTASTSSPPMDSSTRFPPFHGNDLWFNEVQTLVNFEIALAK VINGQLVFKLRLGSCPHALLALGWKDSENSKDIKQIHNGLSSINMSQVQTEMPQAPTWFN ENIHQLCLCSHFTNKQGGEMDSEIVPGEPARFASSVKRGNNPRPVSPHDRKGSLRQGSSA LMLYLGLIARESGLDGSPEHIKHCPLPRCSDCSSCHGDKDIFFLFQSQ >gi568815581f:43384038_43623844|GENSCAN_predicted_CDS_6|867_bp atgacctttctgatttacgaggtggtgctggctgcactcatttgggttctgtggggaggt ggctttgagttcctcggggcccagtctgcagtggctattgggagaaaaggccccaccttc ctgggcagagaagtggagcttccacttaaagaggcagaaggaggctgtcagaagagggaa ggggaaccagcgctctccttcagctttgtcaataacttatcgaaggagctgtccacactc actgcctccacttcctcacctcccatggattcctcaactcgctttccacccttccatgga aatgatctgtggtttaatgaggtgcagactttggtcaattttgaaattgctcttgctaaa gtcatcaatggccaacttgtcttcaaactacggcttggttcttgtcctcatgctcttctc gctctgggatggaaggactcggagaattcgaaggatattaaacagattcataatggactc agcagtataaacatgtctcaagtgcagacagaaatgccacaagctcctacttggtttaat gaaaacatacaccagttgtgtctgtgttcccattttacaaataaacaaggtggggagatg gactcggaaattgtccctggggagccagccaggtttgcctcatctgtgaaacggggtaac aatcccagacctgtgtcgccgcatgatcgcaagggctctctgagacaagggagctcggct ctaatgctgtacctgggcctcattgccagagagtctggtttagatgggtccccagagcac attaagcactgtcctctgccaaggtgctcagactgcagctcctgtcatggggacaaggat atcttcttcctgttccagagtcagtga