GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:33:36 Sequence gi568815587f:59988766_60195653 : 206888 bp : 38.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7054 7169 116 1 2 108 84 37 0.814 4.47 1.02 Term + 16330 16460 131 2 2 79 47 168 0.838 9.16 1.03 PlyA + 16665 16670 6 1.05 2.07 PlyA - 17267 17262 6 1.05 2.06 Term - 30957 30656 302 1 2 36 45 243 0.054 9.20 2.05 Intr - 31251 31079 173 2 2 48 31 115 0.019 0.46 2.04 Intr - 61972 60598 1375 2 1 67 53 430 0.150 24.35 2.03 Intr - 62433 62334 100 0 1 60 89 -28 0.159 -6.74 2.02 Intr - 66310 66009 302 2 2 1 78 232 0.263 9.03 2.01 Init - 68337 68232 106 0 1 67 92 71 0.754 5.83 2.00 Prom - 71388 71349 40 -4.15 3.00 Prom + 71923 71962 40 -11.24 3.01 Init + 72396 72551 156 2 0 69 98 105 0.995 9.69 3.02 Intr + 73703 73840 138 1 0 136 59 108 0.883 12.44 3.03 Intr + 75497 75553 57 1 0 75 106 41 0.803 2.76 3.04 Intr + 78186 78347 162 2 0 95 48 85 0.238 4.45 3.05 Intr + 80090 80172 83 1 2 69 72 39 0.009 -2.08 3.06 Term + 89213 89654 442 2 1 24 41 376 0.006 20.24 3.07 PlyA + 90054 90059 6 1.05 4.00 Prom + 99865 99904 40 -6.25 4.01 Init + 100001 100056 56 1 2 77 96 31 0.961 3.61 4.02 Intr + 100927 101056 130 1 1 86 84 63 0.859 5.48 4.03 Intr + 104635 104793 159 0 0 72 40 96 0.820 2.46 4.04 Intr + 105199 105297 99 0 0 74 71 68 0.836 3.09 4.05 Term + 106793 106891 99 1 0 98 48 47 0.845 -1.15 4.06 PlyA + 107655 107660 6 1.05 5.03 PlyA - 109047 109042 6 1.05 5.02 Term - 114146 113736 411 2 0 65 48 193 0.796 7.26 5.01 Init - 120491 120480 12 2 0 66 108 0 0.207 -0.04 5.00 Prom - 124026 123987 40 -5.75 6.00 Prom + 126048 126087 40 -3.55 6.01 Init + 130228 130467 240 0 0 46 52 155 0.222 5.62 6.02 Intr + 133268 133323 56 1 2 92 113 18 0.661 1.56 6.03 Intr + 136355 136449 95 2 2 102 36 65 0.711 1.39 6.04 Term + 136575 136729 155 1 2 58 55 100 0.389 0.80 6.05 PlyA + 136919 136924 6 1.05 7.00 Prom + 140598 140637 40 -3.35 7.01 Sngl + 145500 146006 507 2 0 50 39 293 0.171 16.39 7.02 PlyA + 146425 146430 6 1.05 8.13 PlyA - 146481 146476 6 1.05 8.12 Term - 155801 155713 89 0 2 94 47 73 0.432 0.64 8.11 Intr - 156302 156235 68 1 2 56 86 54 0.237 -0.37 8.10 Intr - 165799 164851 949 2 1 45 53 239 0.001 5.35 8.09 Intr - 172741 172579 163 1 1 66 84 164 0.714 12.43 8.08 Intr - 180431 180303 129 2 0 43 69 95 0.081 3.07 8.07 Intr - 186846 186637 210 0 0 91 87 52 0.134 3.59 8.06 Intr - 188650 188547 104 2 2 44 44 75 0.079 -2.33 8.05 Intr - 189551 189495 57 0 0 92 97 15 0.411 0.74 8.04 Intr - 191200 191066 135 2 0 121 59 25 0.661 2.52 8.03 Intr - 192976 192734 243 2 0 57 72 161 0.027 7.95 8.02 Intr - 195640 195555 86 0 2 36 80 79 0.002 0.44 8.01 Intr - 197334 197230 105 2 0 91 85 57 0.002 4.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 31084 30656 429 1 0 58 45 269 0.910 15.63 S.002 Sngl + 89229 89654 426 2 0 70 41 380 0.991 27.54 S.003 Sngl - 97121 96768 354 2 0 44 41 221 0.951 8.80 S.004 Sngl + 166216 166896 681 0 0 42 28 301 0.918 15.43 S.005 Init - 192962 192734 229 2 1 82 72 145 0.846 10.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_1|82_aa XKPSSFEQNVIQNNTDKSKKIVQTQYYQAGNWGIEECYLGIPEEDIVITGDDNSLCVIAH EDLPVGQYVEREDGEIEDPDPA >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_1|249_bp nnaaagccttcctcatttgaacaaaatgtaatacagaataatactgataaatctaagaaa atagtccaaactcagtattatcaagcaggaaattggggtatcgaggaatgttacttaggt attccagaagaagacattgttatcacaggagatgacaactccttgtgtgttattgcccac gaagaccttccagtgggacaatatgtggagagggaagacggtgagattgaggatcctgac cctgcatag >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_2|785_aa MHFADPILQAYRDHPSSSELADKLRCSQWDLMYLRGAKEGTDEREDRDRDRSQKEIAKRK VVKIRSFPPRFIQNHCKWNTEEERDDKKMPSAKKRKKSEEMKGKSGCFQHALFEFGAVEK AMKGSYLNSKTSVLPFHLLFPDFLMIAILTGVRWYLIVVLICISLMASDVLEVLARAIRQ EKDIKGIQLGKEEVKLSLFADDMIVYLENPVDSAQNLLKLISNFSKVSGYKINVQKSQAF LYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKN IPCSWVGRINIVKMAILPKVIYRFNAIPIKLSMTFFTELEKTTLKFIWHQKRVCITKSIL SQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITLHIYNYLIFDKPE KNKQWGKDSLFNKWCWGKWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENL GITIQDIGMGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTVRVNRQPTKWEKIFT TYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAI REMQIKTTMRYHLTPVRMAIIKKSGNNSYSGGDLENLCVHTLYLANLVGMWRTFVSSSGI VNAPISTLSKRTSQLSVKWTNQQDVAVWVHTAFMSCNTHAKVCSFAPEASQTTNPPGGTN NSRRAALRALILTAKICSFTPEPARPRTHQKEENAEHIRTSEGTNSGHATFKNCNTHREG PRLHS >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_2|2358_bp atgcactttgcagaccccatacttcaggcatatcgagaccacccctcatcatcagagctg gcagacaaactcaggtgcagccagtgggatttaatgtacctaagaggagcgaaggaaggg actgatgaaagagaagacagagacagagacagaagtcagaaggaaatagccaaaaggaag gtggtaaagataagatccttccctccccgttttatccagaaccattgcaaatggaatact gaagaagaaagagatgacaagaaaatgccttcagcgaagaagagaaagaaatctgaagaa atgaaaggaaagagtggatgctttcaacatgctctttttgaatttggagctgtagaaaaa gccatgaaaggaagttacttaaactctaagacttcggttctcccatttcacctgttgttt cctgactttttaatgattgccattctaactggtgtgagatggtatctcattgtggttttg atttgcatttctctgatggccagtgatgtgttggaagttctggccagggcaatcaggcag gagaaggacataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gatgacatgattgtatatctagaaaaccctgttgactcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttc aaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggta atttatagattcaatgccatccccatcaagttatcaatgactttcttcacagaattggaa aaaactactttaaagttcatatggcaccaaaaaagagtttgcatcaccaagtcaatccta agccaaaagaacaaagctggaggcatcatgctacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacaga acagagccctcagaaataacgctgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgctggggaaagtggcta gccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattca agatggattaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaaccta ggcattaccattcaggacataggcatgggcaaggacttcatgactaaaacaccaaaagca atggcaacaaaagccaaaattgacaaatgggatctcattaaactaaagagcttctgcaca gcaaaagaaactaccgtcagagtgaacaggcaacctacaaaatgggagaaaattttcaca acctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttac aagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctca aaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatcactggccatc agagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatc attaaaaagtcaggaaacaacagctactctggtggggacttggagaacctttgtgtccac actctgtatctagctaatctagtgggcatgtggagaacttttgtgtctagctcagggatt gtaaacgcaccaatcagcaccctgtcaaaacggaccagtcagctctctgtaaaatggacc aatcagcaggatgtggctgtttgggtccacactgcctttatgagctgtaacactcatgcg aaggtctgcagcttcgctcccgaagccagccagaccacgaacccaccaggaggaacaaac aactccagacgcgctgccttaagagctctaatactcaccgcgaagatctgcagcttcact cctgagccagcgagaccacgaacccaccagaaggaagaaaacgccgaacacatccgaaca tcagaaggaacaaactccggacacgccacctttaagaactgtaacactcaccgcgagggt ccgcggcttcattcttga >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_3|345_aa MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNPPTPRQALV CDVPRLASKCSHCSVPTYGLGEGAMLHFNNNNNNNNNNNNNNNKETTIFTWPEAAAGYDT KGILDQSQQAYQEACEITKKEMQPTDPIRLGMALNFSVFYYELLNSPEKSHSLVKAAFDE ALAELDTLSEESYKDRLLMQLLRDNLTLWTLDTQGDETEARGRKS >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_3|1038_bp atggcctcccacgaagttgataatgcagagctggggtcagcctctgcccatggtacccca ggcagtgaggcgggaccagaagagctgaatacttctgtctaccagcccatagatggatca ccagattatcagaaagcaaaattacaagttcttggggccatccagatcctgaatgcagca atgattctggctttgggtgtctttctgggttccttgcaatacccataccacttccaaaag cacttctttttcttcaccttctacacaggctacccgatttggggtgctgtgtttttctgt agttcaggaaccttgtctgttgtagcagggataaaacccacaagaacatggatacagaac agttttggaatgaacattgccagtgctacaattgcactagtggggactgcttttctctca ctaaatatagcagttaatatccagtcattaaggagttgtcactcttcatcagagtcaccg gacctatgcaattacatgggctccatatcaaatccccccaccccacgacaggccctggtg tgtgatgttccccgccttgcgtccaagtgttctcattgttcagttcccacctatggcctg ggggagggagcaatgctccatttcaacaacaacaacaacaacaacaacaacaacaacaac aacaacaacaaagagactactatttttacttggcctgaggctgctgctggttatgacaca aaagggatcctagatcagtcacaacaagcataccaagaagcttgtgaaatcaccaaaaag gaaatgcaaccaacagatcctatcagattgggtatggctctaaacttctctgtcttctat tatgagcttctgaactccccagagaaatctcattcacttgtaaaggcagcttttgatgaa gcccttgctgaacttgatacattaagtgaagagtcatacaaagatcgcctgctaatgcag ttactgagagacaacttgacactgtggacattggatactcaaggagatgaaactgaagca agaggaagaaaaagttaa >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_4|180_aa MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF LGVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTEIVVM MLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSPPIDL >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_4|543_bp atggacacagaaagtaataggagagcaaatcttgctctcccacaggagccttccagtgtg cctgcatttgaagtcttggaaatatctccccaggaagtatcttcaggcagactattgaag tcggcctcatccccaccactgcatacatggctgacagttttgaaaaaagagcaggagttc ctgggggtgagaggaagcctgggagcaaacactgccagcagcatagctgggggaacggga attaccatcctgatcatcaacctgaagaagagcttggcctatatccacatccacagttgc cagaaattttttgagaccaagtgctttatggcttccttttccactgaaattgtagtgatg atgctgtttctcaccattctgggacttggtagtgctgtgtcactcacaatctgtggagct ggggaagaactcaaaggaaacaaggttccagaggatcgtgtttatgaagaattaaacata tattcagctacttacagtgagttggaagacccaggggaaatgtctcctcccattgattta taa >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_5|140_aa MEIQVIKASIIGQYCIARERKEFTHPVGRLSCLRQKLYNGTTETVTSWSSNHTERNPFSK FPKLRTVWTHPESHRDWTAPTGLYWICGHRAFAKLPDESAGSCVIGTIKPSFFLLPIRTG ELLSFPVYASREKKSIAIEN >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_5|423_bp atggaaattcaagtcataaaagcctcaattattggacaatattgcatagctagagaaaga aaagaattcactcaccccgtaggacgacttagttgtctaagacagaaactgtataatggt accacagaaacagtcacttcgtggagttcaaatcacacagagagaaatccatttagtaaa ttcccaaagttgcgaaccgtttggacccatccagagtcccaccgggactggacagccccc actggattatactggatatgtgggcatagagcttttgccaaattacctgacgagtcggca ggtagttgtgttattggcactattaaaccatctttcttcttactgcccataaggacaggt gaactcctgagcttccctgtctatgcttcccgcgaaaagaaaagcatagctatagaaaat tga >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_6|181_aa MKENLLGELTYMITRAACKLRSQEASPGPKISEVGKLIMRFQSLAEGLRAPGKSLVLSLR IQKLKNLESDVRGQEVSNKRMGPPSCRKTNMAPTDSTLRVFEQPSPLAMQPGKKPVSGIL HSSVAQSGSEETQHIDFCDQTCGDFSPPKRQAIISGVNTSWLSFNSVETSSKLEIISDSM V >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_6|546_bp atgaaggaaaatttattaggagaattgacttacatgatcacaagggcagcctgcaagctg aggagccaggaagccagtccgggtcccaaaatctcagaagtagggaagctgataatgcga tttcagtctttggccgaaggcctgcgagcccctggcaaatcactagtgttaagtctaaga atccaaaaactgaagaacttggagtctgatgttcgagggcaggaagtatccaacaaaaga atgggaccacctagttgcaggaaaacaaacatggctcccactgattctacattacgggtc tttgagcagcccagtcccctagccatgcagccaggcaagaaacctgtgtctggcatactc cattcatccgtcgctcagtcagggtctgaggagacacaacatatagacttctgcgaccaa acgtgtggggatttctccccaccaaaaaggcaagcaatcatttctggagtgaacaccagc tggctgtccttcaattctgttgagacatcatctaaactggagataatatcagattccatg gtttga >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_7|168_aa MVLGLQVHRSQELRLYRNAWMSRQRGVAGLEPSWRTSARAMWKGNVGYEPPHRVPTGALP NGAVRRRPPSSRLQNGRCTDSLHCMPGKAADTQRQPMKAARRVAIHCKATEAELLKAMGA HFLHQHDLDVRHRVKGDHFGTLRFNVYAVGFQTCMRPAALLFWTVSPM >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_7|507_bp atggtgttgggtctgcaagtgcacagaagtcaagaattgagattgtatagaaatgcctgg atgtccaggcagaggggtgttgcagggctggagccatcatggagaacctctgctagggca atgtggaagggaaatgtggggtatgagcccccacacagagtccccactggagcactgcct aatggagctgtgagaagacggccaccatcctccagactccagaatggtagatgcactgac agcttgcactgtatgcctggaaaagctgcagacactcaacgccagcccatgaaagcagcc aggagggtggccatacactgcaaagccacagaggcagagcttctcaaggccatgggagcc cacttcttgcatcagcatgacctagatgtgagacatagagtcaaaggagatcattttgga actttaaggtttaatgtctatgctgttggatttcagacttgcatgaggcctgcagccctt ttgttttggacagtttctcccatgtag >gi568815587f:59988766_60195653|GENSCAN_predicted_peptide_8|779_aa XGLLCTALMRHSTGAIAYLGVLSGSASLKLAGVPLRCCEGDKDAGHPLETQTALCERGRG ARSLVGNTIMTSQPVPNETIIVLPSNVINFSQAEKPEPTNQGQDSLKKHLHAEIKVIGVN LIQNVLERGWGKCQEMIYVLGLDICHYPDLVWHDGIELGDHFGICFLLSKFYPSDFYTVE LCLPIHRTLFFYHLWLSINRHREKVNQAFGVDVGDYVRECHTMQLHRDSIWKLWLQKSPK ASKEVHSSLVGSILSALSALVGFIILSVKQATLNPASLQCELDKNNIPTRSYVSYFYHDS LYTTDCYTAKASLAVSHVAPACVVVPDGSCAARMGLGGNMGGGGQVSCEHGEGTEVPQYL DIRGRCSQGKDHKLSLERDHLTTKPVGGLDLALTATEGFEAFSEKAAPGPREKMAILPKV IDRVDAIPIKLPMPFFTELEKTTLKFIWNQKRARIAKSILSQKNKPGGIMLPDFKLYYKA TVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWL AICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGITIQGIGMGKDFMSKTPKA MATEAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELKQIC KKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSPSLAIREMQIKTTMRYHLTPVRMAI IKKSGNNRKQEEKKEASTYFKCLIQKLSYEEYLLMDKHSFVFLFYKSCEYSDVDGNMFA >gi568815587f:59988766_60195653|GENSCAN_predicted_CDS_8|2340_bp nngggcctgctgtgcactgctctgatgaggcattccactggggcaattgcctacctggga gtgctctcaggatctgcttcactcaagctggctggagtccccctcagatgctgtgagggt gacaaagatgcagggcacccactggaaacacagacggcactctgcgaaagaggaaggggc gccaggagcttggttggcaacaccatcatgacatcacaacctgttcccaatgagaccatc atagtgctcccatcaaatgtcatcaacttctcccaagcagagaaacccgaacccaccaac caggggcaggatagcctgaagaaacatctacacgcagaaatcaaagttattggggtaaat ctaattcagaacgtgttggagaggggttgggggaagtgccaagagatgatatatgtcttg ggactggacatctgtcactatccagatcttgtgtggcatgatggtattgagcttggggat cattttggcatctgcttccttctctccaaattttacccaagtgacttctacactgttgaa ctctgcttacccattcataggaccctttttttttatcatctctggctctctatcaatcgc cacagagaaaaggttaaccaagcttttggggttgatgttggagattatgtgagagaatgt cataccatgcaattgcaccgagactcaatttggaagctctggctacaaaaatctcccaaa gccagcaaggaagtgcatagcagcctggttggaagcattctgagtgctctgtctgccctg gtgggtttcattatcctgtctgtcaaacaggccaccttaaatcctgcctcactgcagtgt gagttggacaaaaataatataccaacaagaagttatgtttcttacttttatcatgattca ctttataccacggactgctatacagccaaagccagtctggctgtcagccatgtggcccca gcatgtgttgtggtacctgatggtagctgtgctgctaggatgggccttggtggaaacatg ggtggtggtggacaggtctcctgtgagcatggagaaggcactgaagtacctcaatacctg gacatcagaggcagatgcagtcaagggaaggatcataaactttctttggaaagagaccat cttactaccaagccagtcgggggcttggatcttgccctaacagcaactgaaggatttgag gctttttctgaaaaggcagccccaggtcctagggagaaaatggccatactgcccaaggta attgatagagtcgatgccatccccatcaagctaccaatgcctttcttcacagaactggaa aaaactactttaaagttcatatggaaccaaaaaagagcccgaattgccaagtcaatccta agccaaaagaacaaacctggaggcatcatgctacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacaga acggagccctcagaaataacgccgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggcta gccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattca agatggattaaagacttaaacattagacctaaaaccataaaaaccttagaagaaaaccta ggcattaccattcagggcataggcatgggcaaggacttcatgtctaaaacaccaaaagca atggcaacagaagccaaaattgacaaatgggatctcattaaactaaagagcttctgcaca gcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcacg acctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttgc aagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatgaacagacacttctca aaagaagacatttatgcagccaaaagacacatgaaaaaatgctcaccatcactggccatc agagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggcgatc attaaaaagtcaggaaacaacaggaaacaggaagagaagaaagaagcatccacttatttt aagtgtttaattcagaaattgtcctatgaagaatatctcctgatggataaacattctttt gtcttccttttctacaaatcctgtgagtactcggatgtagatggaaacatgtttgcctga