GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:24:05 Sequence gi568815593f:140591649_140798549 : 206901 bp : 43.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 104 22 83 2 2 40 70 90 0.353 0.84 1.03 Intr - 453 376 78 0 0 84 94 39 0.577 3.85 1.02 Intr - 2716 2597 120 1 0 99 109 38 0.687 7.69 1.01 Init - 28454 28371 84 2 0 59 116 41 0.275 4.82 1.00 Prom - 32431 32392 40 -4.56 2.02 PlyA - 32771 32766 6 1.05 2.01 Sngl - 41365 40208 1158 1 0 36 44 1302 0.936 114.82 2.00 Prom - 41893 41854 40 -7.26 3.00 Prom + 45787 45826 40 -4.96 3.01 Init + 47880 47964 85 2 1 82 64 114 0.996 9.42 3.02 Intr + 48091 48203 113 2 2 77 78 109 0.931 8.90 3.03 Term + 48656 48712 57 1 0 91 39 116 0.965 4.69 3.04 PlyA + 49617 49622 6 -0.45 4.00 Prom + 49735 49774 40 -10.35 4.01 Init + 50235 50405 171 2 0 91 66 241 0.974 19.64 4.02 Intr + 50619 50771 153 2 0 75 62 88 0.705 5.17 4.03 Intr + 51215 51393 179 1 2 49 92 99 0.441 5.12 4.04 Intr + 51916 52027 112 1 1 46 82 75 0.694 3.08 4.05 Intr + 52132 52318 187 0 1 130 80 83 0.998 10.96 4.06 Intr + 52902 53092 191 1 2 99 65 104 0.642 8.50 4.07 Term + 53337 53450 114 2 0 103 39 97 0.987 4.87 4.08 PlyA + 53733 53738 6 1.05 5.04 PlyA - 53746 53741 6 1.05 5.03 Term - 54030 53939 92 1 2 91 48 50 0.819 -0.82 5.02 Intr - 55714 55575 140 0 2 69 70 279 0.570 24.31 5.01 Init - 55935 55835 101 0 2 86 94 128 0.997 12.96 5.00 Prom - 58326 58287 40 -4.76 6.00 Prom + 59417 59456 40 -7.66 6.01 Init + 60441 60499 59 2 2 42 67 94 0.154 3.48 6.02 Intr + 61329 61496 168 0 0 78 69 197 0.999 15.86 6.03 Intr + 62290 62404 115 1 1 82 113 51 0.836 7.45 6.04 Intr + 62868 62938 71 2 2 83 108 112 0.999 10.68 6.05 Intr + 63033 63079 47 0 2 83 92 17 0.919 -0.35 6.06 Intr + 64181 64344 164 0 2 92 105 223 0.994 24.09 6.07 Intr + 65906 66014 109 1 1 97 77 99 0.965 9.56 6.08 Intr + 67089 67128 40 1 1 90 47 36 0.963 -3.02 6.09 Intr + 67291 67516 226 1 1 47 78 333 0.670 26.19 6.10 Intr + 67667 67685 19 1 1 106 113 36 0.756 4.08 6.11 Intr + 68108 68159 52 0 1 91 5 78 0.845 -2.23 6.12 Intr + 69110 69167 58 2 1 81 91 98 0.983 8.29 6.13 Intr + 69972 70060 89 2 2 74 89 106 0.999 8.07 6.14 Intr + 70251 70359 109 0 1 98 92 130 0.990 14.69 6.15 Intr + 70531 70565 35 0 2 87 80 63 0.887 2.42 6.16 Intr + 73207 73455 249 1 0 51 30 316 0.507 18.65 6.17 Intr + 76586 76686 101 2 2 95 70 59 0.957 4.55 6.18 Intr + 76767 76854 88 1 1 65 82 79 0.985 4.03 6.19 Intr + 76964 77143 180 2 0 58 87 235 0.979 19.38 6.20 Intr + 77243 77357 115 2 1 98 44 91 0.907 6.15 6.21 Intr + 77431 77600 170 0 2 88 94 217 0.999 21.04 6.22 Term + 77685 78006 322 0 1 92 49 463 0.999 37.09 6.23 PlyA + 78381 78386 6 -0.45 7.18 PlyA - 78399 78394 6 -0.45 7.17 Term - 79340 79336 5 0 2 96 39 0 0.452 -7.03 7.16 Intr - 79625 79548 78 0 0 128 81 11 0.741 3.92 7.15 Intr - 80102 79710 393 0 0 106 70 466 0.431 41.23 7.14 Intr - 81258 80797 462 1 0 101 102 970 0.619 92.83 7.13 Intr - 81740 81623 118 2 1 101 96 130 0.998 15.14 7.12 Intr - 83177 83031 147 2 0 129 78 232 0.802 26.73 7.11 Intr - 83485 83369 117 1 0 94 86 121 0.998 13.16 7.10 Intr - 84594 84541 54 0 0 63 89 33 0.487 0.08 7.09 Intr - 85248 85006 243 0 0 118 92 224 0.841 23.49 7.08 Intr - 85468 85341 128 2 2 104 87 172 0.999 19.10 7.07 Intr - 85772 85679 94 2 1 84 77 106 0.999 8.64 7.06 Intr - 86105 86007 99 2 0 119 87 123 0.985 15.61 7.05 Intr - 86367 86260 108 0 0 101 77 178 0.996 18.48 7.04 Intr - 87479 87354 126 2 0 72 56 211 0.978 17.18 7.03 Intr - 91571 91452 120 2 0 23 100 159 0.836 11.29 7.02 Intr - 99296 99207 90 2 0 116 85 76 0.937 10.29 7.01 Init - 99656 99567 90 2 0 87 85 154 0.999 15.57 7.00 Prom - 100275 100236 40 -7.16 8.00 Prom + 101388 101427 40 -9.85 8.01 Init + 102266 102406 141 1 0 68 116 77 0.034 8.63 8.02 Intr + 104090 104197 108 1 0 100 100 88 0.999 11.68 8.03 Intr + 104455 104553 99 0 0 -18 87 138 0.913 3.41 8.04 Intr + 104873 104966 94 1 1 39 95 88 0.939 4.14 8.05 Intr + 105295 105422 128 2 2 71 121 58 0.956 7.80 8.06 Intr + 105516 105758 243 2 0 96 100 240 0.997 23.69 8.07 Intr + 105921 106037 117 2 0 77 86 79 0.993 7.26 8.08 Intr + 106284 106430 147 2 0 95 105 139 0.771 16.73 8.09 Intr + 109171 109264 94 0 1 109 64 157 0.995 14.94 8.10 Intr + 110358 110481 124 1 1 74 96 96 0.999 8.64 8.11 Intr + 112270 112343 74 1 2 77 83 92 0.999 6.65 8.12 Intr + 112778 112923 146 0 2 71 94 219 0.999 20.80 8.13 Term + 113965 114108 144 0 0 99 54 243 0.999 19.91 8.14 PlyA + 114362 114367 6 1.05 9.04 PlyA - 115101 115096 6 1.05 9.03 Term - 119333 119271 63 2 0 97 39 66 0.489 0.49 9.02 Intr - 161316 161151 166 2 1 52 52 106 0.378 3.36 9.01 Init - 168482 167539 944 2 2 102 53 271 0.152 18.12 9.00 Prom - 170384 170345 40 -4.56 10.02 PlyA - 170415 170410 6 1.05 10.01 Sngl - 184103 183429 675 2 0 39 48 206 0.660 7.89 10.00 Prom - 192442 192403 40 -7.36 11.00 Prom + 193494 193533 40 -5.26 11.01 Init + 194643 197036 2394 2 0 51 101 3274 0.205 310.55 11.02 Intr + 201930 201972 43 2 1 99 55 12 0.017 -3.19 11.03 Intr + 203384 205704 2321 0 2 121 113 3035 0.033 296.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 9640 9751 112 0 1 65 116 50 0.841 5.88 S.002 Term + 11536 11546 11 2 2 131 47 3 0.830 -1.14 S.003 Init + 100001 100108 108 1 0 86 99 126 0.946 11.72 S.004 Intr + 102287 102406 120 1 0 91 116 57 0.954 9.49 S.005 Term + 102537 102644 108 2 0 72 48 111 0.913 4.01 S.006 Init + 103884 103985 102 2 0 74 75 76 0.842 5.14 S.007 Init - 136160 136158 3 2 0 108 81 0 0.883 1.30 S.008 Init + 141854 141919 66 1 0 61 116 27 0.972 3.87 S.009 Init + 203317 205704 2388 0 0 63 113 3129 0.959 298.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_1|122_aa MNKAAVNTHEQVFNSFDKYQGMWLLDLKEAEQELLARVQSTLGSLGRGYSVALLLRGRET EAPRLVPQLLQMLFEEALPLSCSDPVLSTLSLVQFSPSGRTQDLLSPGVENLSVLDVSPL GF >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_1|366_bp atgaataaagctgctgtaaacacccatgagcaagttttcaactcctttgataaataccaa ggaatgtggttgctggatcttaaggaggcagagcaggagctgctggcacgtgtccaatcc acactgggctctctgggccgagggtacagtgtggccctgttgctccgggggagggaaaca gaggcaccccgacttgtgcctcagctgctgcagatgctgtttgaagaagctctgcccctc agctgctctgatcctgtgcttagcactcttagcctggtgcagttcagccccagtggaagg acccaggacctgctctctccaggggtggagaacctgtcggtgctggacgtgtcccctctg ggcttn >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_2|385_aa MGPPRPLLSPQERASCLLLLLLPLVHVSATTPEPCELDDEDFRCVCNFSEPQPDWSEAFQ CVSAVEVEIHAGGLNLEPFLKRVDADADPRQYADTVKALRVRRLTVGAAQVPAQLLVGAL RVLAYSRLKELTLEDLKITGTMPPLPLEATGLALSSLRLRNVSWATGRSWLAELQQWLKP GLKVLSIAQAHSPAFSCEQVRAFPALTSLDLSDNPGLGERGLMAALCPHKFPAIQNLALR NTGMETPTGVCAALAAAGVQPHSLDLSHNSLRATVNPSAPRCMWSSALNSLNLSFAGLEQ VPKGLPAKLRVLDLSCNRLNRAPQPDELPEVDNLTLDGNPFLVPGTALPHEGSMNSGVVP ACARSTLSVGVSGTLVLLQGARGFA >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_2|1158_bp atggggcctcctagacctctgctctctccccaggagcgcgcgtcctgcttgttgctgctg ctgctgccgctggtgcacgtctctgcgaccacgccagaaccttgtgagctggacgatgaa gatttccgctgcgtctgcaacttctccgaacctcagcccgactggtccgaagccttccag tgtgtgtctgcagtagaggtggagatccatgccggcggtctcaacctagagccgtttcta aagcgcgtcgatgcggacgccgacccgcggcagtatgctgacacggtcaaggctctccgc gtgcggcggctcacagtgggagccgcacaggttcctgctcagctactggtaggcgccctg cgtgtgctagcgtactcccgcctcaaggaactgacgctcgaggacctaaagataaccggc accatgcctccgctgcctctggaagccacaggacttgcactttccagcttgcgcctacgc aacgtgtcgtgggcgacagggcgttcttggctcgccgagctgcagcagtggctcaagcca ggcctcaaggtactgagcattgcccaagcacactcgcctgccttttcctgcgaacaggtt cgcgccttcccggcccttaccagcctagacctgtctgacaatcctggactgggcgaacgc ggactgatggcggctctctgtccccacaagttcccggccatccagaatctagcgctgcgc aacacaggaatggagacgcccacaggcgtgtgcgccgcactggcggcggcaggtgtgcag ccccacagcctagacctcagccacaactcgctgcgcgccaccgtaaaccctagcgctccg agatgcatgtggtccagcgccctgaactccctcaatctgtcgttcgctgggctggaacag gtgcctaaaggactgccagccaagctcagagtgctcgatctcagctgcaacagactgaac agggcgccgcagcctgacgagctgcccgaggtggataacctgacactggacgggaatccc ttcctggtccctggaactgccctcccccacgagggctcaatgaactccggcgtggtccca gcctgtgcacgttcgaccctgtcggtgggggtgtcgggaaccctggtgctgctccaaggg gcccggggctttgcctaa >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_3|84_aa MWSRRQGRLRPTVCGVEELRRRRREREAALRKARREQQLVSKRLLRNDAPEEAGEGCVAA ILGETEFLQRYIGFVSQALGIRYV >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_3|255_bp atgtggagccgacggcagggccgcctcaggcccacggtctgcggggtggaggagctacgg cgccgccggcgggagcgggaggcagcactgcggaaggcgcggagggagcagcagctggtc agcaagaggctgctgagaaacgacgccccagaggaagctggagagggatgtgtggctgcg atcctcggggaaaccgagttcctgcaacgatatatcggatttgtgtctcaagcccttggc attcgctacgtctaa >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_4|368_aa MRTLVGLLTSNQALLQLEAARCLHELSHSEQSTVAEACLPATSYLLTYLSSHSSDFIPGM KQAKPSAFVACSQELCLYTLGNLIVESEAVRRQLLPQGIVPALAACIQGLVRTTGIFVVP TYSPASASSSILASTLPQHMLQMLQPGPKLNPGVAVEFAWCLHYIICSQVSNPLLIGHGA LSTLGLLLLDLAGAVQKTEDAGLELLACPVLRCLSNLLTEAAVETVGGQMQLRDERVVAA LFILLQFFFQKQPSLLPEGLWLLNNLTDYICVYLQVLTVLCNVAEKGPAYCQRLWPGPLL PALLHTLAFSDTEVVGQSLELLHLLFLYQPEAVQVFLQQSGLQALERHQEEAQLQDRVYA LQQTALQG >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_4|1107_bp atgcggaccctggtcgggctcctgaccagcaaccaggccctgctgcagcttgaggcggct cggtgcctgcatgagctctctcactccgagcagtccactgttgctgaggcctgcctgcca gccacttcttacctcctcacctacctctccagtcacagctcagacttcatacctggcatg aaacaggccaagcccagtgcttttgttgcctgttctcaggagctgtgtctgtatacactg ggtaacctgatcgtggagagtgaggctgtgagaaggcagctcctgccacagggcattgtt ccagccttggctgcctgcatccagggtttggtaagaaccactggcatcttcgtggttcct acttacagccctgcttctgccagcagctccatcttggcctccactctccctcagcacatg ctacaaatgttgcaacctggcccaaagctcaaccctggggtcgctgtggagtttgcctgg tgccttcattacatcatctgcagccaggtcagcaatcctctgctcattggccatggggct ctgtctactctggggttgctgctgttggacttggctggggctgtccagaaaaccgaggat gcaggactggagctgctggcatgccccgtgcttcgatgtctaagcaacctgctaactgag gcagcagtggagactgtgggagggcaaatgcagctcagagatgagcgtgttgtggcagcc ttatttatccttctgcagttctttttccagaaacagcccagtctgctccctgagggcctt tggctcctcaacaacctcactgactatatatgtgtctatctgcaggtgctcacagttctg tgcaatgttgcagaaaagggtcctgcttactgccagcggctgtggccagggcccctgctt cccgccttgctgcacacactagccttttctgacactgaagtagtaggccagagtttggag ctgctgcatctgctgttcctgtatcagccagaggctgttcaggtcttcctgcagcagtca gggctgcaagccctggaaaggcatcaggaagaggcccagctccaggatcgtgtgtatgct ctccagcagacagctcttcaagggtga >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_5|110_aa MAAAAASRGVGAKLGLREIRIHLCQRSPGSQGVRDFIEKRYVELKKANPDLPILIRECSD VQPKLWARYGECGTPGVWGSAFGQETNVPLNNFSADQVTRALENVLSGKA >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_5|333_bp atggcggcggccgcagcaagtcgaggagtcggggcaaagctgggcctgcgtgagattcgc atccacttatgtcagcgctcgcccggcagccagggcgtcagggacttcattgagaaacgc tacgtggagctgaagaaggcgaatcccgacctacccatcctaatccgcgaatgctccgat gtgcagcccaagctctgggcccgctacggtgagtgcgggacgccaggggtctggggctcc gcatttggccaagagacgaatgtccctttgaacaacttcagtgctgatcaggtaaccaga gccctggagaacgttctaagtggtaaagcctga >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_6|861_aa MPREYNEDEDPAARRRKKKSYYAKLRQQEIERERELAEKYRDRAKERRDGVNKDYEETEL ISTTANYRAVGPTAEADKSAAEKRRQLIQESKFLGGDMEHTHLVKGLDFALLQKVRAEIA SKEKEEEELMEKPQKETKKDEDPENKIEFKTRLGRNVYRMLFKSKAYERNELFLPGRMAY VVDLDDEYADTDIPTTLIRSKADCPTMEAQTTLTTNDIVISKLTQILSYLRQGTRNKKLK KKDKGKLEEKKPPEADMNIFEDIGDYVPSTTKTPRDKERERYRERERDRERDRDRDRERE RERDRERERERDREREEEKKRHSYFEKPKVDDEPMDVDKGPGSTKELIKSINEKFAGMDD MAVDSDEEVDYSKMDQGNKKGPLGRWDFDTQEEYSEYMNNKEALPKAAFQYGIKMSEGRK TRRFKETNDKAELDRQWKKISAIIEKRKKMEADGVGGSEADYPAAARLAVLLSMDRTCEE RPAEDGSDEEDPDSMEAPTRIRDTPEDIVLEAPASGLAFHPARDLLAAGDVDGDVFVFSY SCQEGETKELWSSGHHLKACRAVAFSEDGQKLITVSKDKAIHVLDVEQGQLERRVSKAHG APINSLLLVDENVLATGDDTGGICLWDQRKEGPLMDMRQHEEYIADMALDPAKKLLLTAS GDGCLGIFNIKRRRFELLSEPQSGDLTSVTLMKVQLVMWGKKVACGSSEGTIYLFNWNGF GATSDRFALRAESIDCMVPVTESLLCTGSTDGVIRAVNILPNRVVGSVGQHTGEPVEELA LSHCGRFLASSGHDQRLKFWDMAQLRAVVVDDYRRRKKKGGPLRALSSKTWSTDDFFAGL REEGEDSMAQEEKEETGDDSD >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_6|2586_bp atgccaagggagtacaatgaggatgaagacccagctgcacgaaggaggaaaaagaaaagt tattatgccaagctacgccaacaagaaattgagagagagagagagctagcagagaagtac cgggatcgtgccaaggaacggagagatggagtgaacaaagattatgaagaaaccgagctt atcagcaccacagctaactatagggctgttggccccactgctgaggcggacaaatcagct gcagagaagagaagacagttgatccaggagtccaaattcttgggtggtgacatggaacac acccatttggtgaaaggcttggattttgctctgcttcaaaaggtacgagctgagattgcc agcaaagagaaagaggaagaggaactgatggaaaagccccagaaagaaaccaagaaagat gaggatcctgaaaataaaattgaatttaaaacacgtctgggccgcaatgtttaccgaatg ctttttaagagcaaagcatatgagcggaatgagttgttcctgccgggccgcatggcctat gtggtagacctggatgatgagtatgctgacacagatatccccaccactcttatccgcagc aaggctgattgccccaccatggaggcccagaccacactgaccacaaatgacattgtcatt agcaagctgacccagatcctttcatacctgaggcagggaacccgtaacaagaagcttaag aagaaggataaagggaagctggaagagaagaaacctcctgaggctgacatgaatattttt gaagacattggggattacgtaccctccacaaccaagacacctcgggacaaggagcgggag agatatcgggaacgggagcgtgatcgggaaagagacagagaccgtgaccgagagcgagag cgagaacgagatcgggaacgagagcgagagcgggaccgagagagagaagaggaaaagaag agacacagctactttgagaagccaaaagtagatgatgagcccatggacgttgacaaagga cctgggtctaccaaggagttgatcaagtccatcaatgaaaagtttgctgggatggatgac atggctgtggatagtgatgaggaggtggattatagcaaaatggaccagggtaacaagaag gggcccttaggccgttgggactttgatacccaggaagaatacagcgagtatatgaacaac aaagaagctttgcccaaggctgcattccagtatggtatcaaaatgtctgaagggcggaaa accaggcgcttcaaggaaaccaatgacaaagcagagcttgatcgccagtggaagaagatt agtgcaatcattgagaagaggaagaagatggaagctgatggtgtgggtggttccgaggct gactaccctgcggcggcgcggctcgcagtccttctcagcatggaccgcacttgtgaggag aggcccgctgaggatgggagcgacgaggaggacccagactccatggaagccccaacccgg atccgggacactccggaagacatcgtgctggaagctccggctagtgggctggcgttccat ccggcccgtgacctactggctgcaggggacgtggacggggacgtgttcgtcttttcctac tcttgccaagagggagaaaccaaggagctctggtcatcaggtcaccatctcaaggcctgc cgagctgtggccttctctgaagatgggcagaagctcattactgtctccaaggacaaagcc atccatgttctagatgtggagcagggccaactggaaagacgtgtttccaaggctcatggt gcccccatcaatagtcttctgctggtggatgagaatgttctggccactggggatgacaca ggtggtatctgtctctgggaccagcggaaggagggccccttaatggatatgaggcaacat gaagagtacatcgcagacatggctctggatccagccaaaaagctgctgctgacagccagc ggggatggctgccttggcatcttcaacattaagaggcgtcggtttgagctgctctcagaa cctcagtctggggacctgacctctgtcactctcatgaaagtacagctggttatgtggggg aagaaggtagcctgtggctccagtgaaggtaccatctacctcttcaattggaatggcttt ggggccacaagtgaccgctttgccctgagagctgaatctatcgactgcatggttccagtc accgagagtctgctgtgtactggctccactgatggagtcatcagggctgtgaacatccta ccgaaccgagtggtgggcagtgtgggccagcacactggggagcctgtggaggagctggcc ctctcccactgtggccgcttcctggccagtagtggccatgaccagcgcctcaagttttgg gacatggcccagctgcgagctgtggtggtggatgactaccgtcggcgcaaaaaaaaggga ggaccactgcgggctctgagcagcaagacttggagcaccgatgacttcttcgcaggactg agggaagagggagaagactccatggctcaggaagaaaaggaggagactggggatgacagt gactga >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_7|823_aa MAERAALEELVKLQGERVRGLKQQKASAELIEEEVAKLLKLKAQLGPDESKQKFVLKTPK GTRDYSPRQMAVREKVFDVIIRCFKRHGAEVIDTPVFELKVPFARYLAMNKLTNIKRYHI AKVYRRDNPAMTRGRYREFYQCDFDIAGNFDPMIPDAECLKIMCEILSSLQIGDFLVKVN DRRILDGMFAICGVSDSKFRTICSSVDKLDKVSWEEVKNEMVGEKGLAPEVADRIGDYVQ QHGGVSLVEQLLQDPKLSQNKQALEGLGDLKLLFEYLTLFGIDDKISFDLSLARGLDYYT GVIYEAVLLQTPAQAGEEPLGVGSVAAGGRYDGLVGMFDPKGRKVPCVGLSIGVERIFSI VEQRLEMRTLKHYRSNNLLKAMKLALEEKIRTTETQVLVASAQKKLLEERLKLVSELWDA GIKAELLYKKNPKLLNQLQYCEEAGIPLVAIIGEQELKDGVIKLRSVTSREELWCERVNP ENKAALEAWVRETGIRLVQVNGQRKYGGPPPGWVGSPPPAGSEVFIGRLPQDVYEHQLIP LFQRVGRLYEFRLMMTFSGLNRGFAYARYSSRRGAQAAIATLHNHPLRPSCPLLVCRSTE KCELSVDGLPPNLTRSALLLALQPLGPGLQEARLLPSPGPAPGQIALLKFSSHRAAAMAK KALVEGQSHLCGEQVAVEWLKPDLKQRLRQQLVGPFLRSPQPEGSQLALARDKLGFQGAR ATLQLLCQRMKLGSPVFLTKCLGIGPAGWHRFWYQVVIPGHPVPFSGLIWVVLTLDGRDG HEVAKDAVSVRLLQALSTLNGHAEPVSGPIPAGLGGHFLTPKA >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_7|2472_bp atggcagagcgtgcggcgctggaggagctggtgaaacttcagggagagcgcgtgcgaggc ctcaagcagcagaaggccagcgccgagctgatcgaggaggaggtggcgaaactcctgaaa ctgaaggcacagctgggtcctgatgaaagcaaacagaaatttgtgctcaaaacccccaag ggcacaagagactatagtccccggcagatggcagttcgcgagaaggtgtttgacgtaatc atccgttgcttcaagcgccacggtgcagaagtcattgatacacctgtatttgaactaaag gttccttttgctcggtatttggcaatgaataaactgaccaacattaaacgctaccacata gcaaaggtatatcggcgggataacccagccatgacccgtggccgataccgggaattctac cagtgtgattttgacattgctgggaactttgatcccatgatccctgatgcagagtgcctg aagatcatgtgcgagatcctgagttcacttcagataggcgacttcctggtcaaggtaaac gatcgacgcattctagatgggatgtttgctatctgtggtgtttctgacagcaagttccgt accatctgctcctcagtagacaagctggacaaggtgtcctgggaagaggtgaagaatgag atggtgggagagaagggccttgcacctgaggtggctgaccgcattggggactatgtccag caacatggtggggtatccctggtggaacagctgctccaggatcctaaactatcccaaaac aagcaggccttggagggcctgggagacctgaagttgctctttgagtacctgaccctattt ggcattgatgacaaaatctcctttgacctgagccttgctcgagggctggattactacact ggggtgatctatgaggcagtgctgctacagaccccagcccaggcaggggaagagcccctg ggtgtgggcagtgtggctgctggaggacgctatgatgggctagtgggcatgttcgacccc aaagggcgcaaggtgccatgtgtggggctcagcattggggtggagcggattttctccatc gtggaacagagactagagatgagaacactgaagcattatagatcaaataatttgcttaag gccatgaaacttgctttggaggagaagatacggaccacggagacacaggtgcttgtggca tctgcacagaagaagctgctagaggaaagactaaagcttgtctcagaactgtgggatgct gggatcaaggctgagctgctgtacaagaagaacccaaagctactgaaccagttacagtac tgtgaggaggcaggcatcccactggtggctatcatcggcgagcaggaactcaaggatggg gtcatcaagctccgttcagtgacgagcagggaagagctgtggtgtgagagggtgaatcca gagaacaaggcggcgctggaggcgtgggtcagggagacaggcatccgcctggtgcaggtg aacgggcagaggaagtatggcgggccacccccaggctgggtgggcagcccgccgccagct gggtcagaggtgttcatcgggcggctgcctcaggacgtgtacgagcaccagcttatcccg ctgttccagcgcgtgggccgcctctacgagttccgcctgatgatgaccttcagcggcctg aaccgcggcttcgcctatgcccgctacagctcgaggcgcggcgcgcaggccgccatcgcc acgctgcacaaccatccgctgcggccgtcctgcccgctgctcgtgtgccgcagcaccgag aagtgtgagctgagcgttgacggcctgccgccgaatctgacccgcagcgcgctgctgctc gcgctgcagccgctgggtcccggcttgcaggaggcgcggctgctgcccagccccggaccg gcgcccgggcagatcgctctgctcaaattcagctcgcaccgggccgctgccatggccaaa aaggccctggtggaagggcagtcacacctctgtggagagcaggtggctgtggagtggctc aagccagacctgaagcagcgacttcgccagcagcttgtgggtcccttcttgcggtcccca cagccagagggcagccagttggctttggcaagggacaagttagggttccaaggggctcgg gctaccctgcagttgctgtgccaacgaatgaagctgggcagccctgtgttcctcaccaag tgtttgggcataggacctgctggctggcaccgcttctggtaccaggtggtgattcctggg catccggtgcccttcagcggcctcatctgggttgtgctgaccctagatggccgggatggg catgaggtggccaaggatgctgtgtctgtacggctgctgcaggcactcagcaccctgaat gggcatgcagagcctgtgtcaggccccatcccagcaggcctgggtggccactttctgacc cccaaagcttag >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_8|552_aa MAAFGFQGTRDLSPQHMVVREKILDLVISCFKRHGAKGMDTPAFELKDFDIAGQFDPMIP DAECLKIMCEILSGLQLGDFLIKVNDRRIVDGMFAVCGVPESKFRAICSSIDKLDKMAWK DVRHEMVVKKGLAPEVADRIGDYVQCHGGVSLVEQMFQDPRLSQNKQALEGLGDLKLLFE YLTLFGIADKISFDLSLARGLDYYTGVIYEAVLLQTPTQAGEEPLNVGSVAAGGRYDGLV GMFDPKGHKVPCVGLSIGVERIFYIVEQRMKTKGEKVRTTETQVFVATPQKNFLQERLKL IAELWDSGIKAEMLYKNNPKLLTQLHYCESTGIPLVVIIGEQELKEGVIKIRSVASREET KNLDFRRKWDKDEYEKLAEKRLTEEREKKDGKPVQPVKRELLRHRDYKVDLESKLGKTIV ITKTTPQSEMGGYYCNVCDCVVKDSINFLDHINGKKHQRNLGMSMRVERSTLDQVKKRFE VNKKKMEEKQKDYDFEERMKELREEEEKAKAYKKEKQKEKKRRAEEDLTFEEDDEMAAVM GFSGFGSTKKSY >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_8|1659_bp atggctgcttttggtttccagggtaccagggatcttagtcctcagcatatggttgtgagg gagaaaattcttgatttggttatcagctgctttaaacgtcatggagcaaaggggatggac accccagcatttgagctgaaggattttgacattgctggtcagtttgaccctatgatcccc gatgcagagtgtttgaagatcatgtgtgaaatcctaagtggattgcagttgggagacttt ctcattaaggtaaatgaccggcggattgtggatgggatgtttgctgtctgtggtgttcct gaaagcaagttccgtgccatctgctcctccatagataaactagacaagatggcttggaaa gatgtgagacatgagatggtggtgaagaaaggcctggctcctgaggtggctgatcgaatt ggggactatgtccagtgtcatggtggggtatccctagtagagcaaatgtttcaggatccc agactatcccagaacaagcaggccctggagggcctgggagacctaaagctgctatttgaa tacctgactttatttggaattgctgataagatctcctttgacctcagcctggctcggggc ctagactactatacaggagtgatctatgaagcagtgctgctgcagaccccaactcaggct ggggaggagcccctgaatgtgggcagtgtggctgctggtgggcgctatgatgggctggtg ggcatgtttgaccccaagggccacaaggtgccatgtgtgggactcagcattggggttgag cgaatcttctacattgtggagcagaggatgaagaccaaaggtgagaaggtgcggactaca gagactcaagtgtttgtggccacaccacagaagaactttctccaagaacggttgaagctt attgcagagctttgggattctggaatcaaggcagagatgctatacaagaacaaccccaaa ctattaacccagctgcactattgtgagagcacaggcattccactggtggtcattattggt gagcaagaactgaaagaaggggtcatcaagatccgttcagtggccagcagagaggagaca aaaaacttggactttcgccgaaagtgggacaaagatgaatatgagaaactcgccgagaag aggctcacggaagagagagaaaagaaagatggaaaaccagtgcagcctgtcaagcgagag cttttacggcatagggactacaaggtggacttggaatccaagcttgggaagacaattgtc attaccaagacaacccctcaatctgagatgggaggatattactgcaatgtctgtgactgt gtggtgaaggactccatcaactttctggatcacattaatggaaagaaacatcagagaaac ctgggcatgtctatgcgtgtggaacgttccaccctggatcaggtgaagaaacgttttgag gtcaacaagaagaagatggaagagaagcagaaggattatgattttgaggaaaggatgaag gagctcagagaagaggaggaaaaggccaaagcgtacaagaaagagaaacagaaggagaag aaaaggagggctgaggaggacttgacatttgaggaggacgatgagatggcagctgtgatg ggcttctctggctttggttccaccaagaagagttactga >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_9|390_aa MATLPKVIYRFNAIPNKLPMTFFTELEKTTFKFIWNQERARIAKTILSQKNKAGGIMLPD FKLYYKATVTKTVWYWYKNRDIDQWNRTEPSEIIPCIYNHLIFDKPDKNKKRGKDSLFNT WCWENWLAICRKLKLDSFLTPYTKINSRWIKDLNVRPKTIKTLEENLGTTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRGNSQPTEWEKIFAIYSSDKGLISRIY KELKQIYKKKTNNPIKKWAKDMNRHFSREDIYAANRHMKKCSSSLAIREMQIKTTMRYHL TPVRMAIIKKSGNNRDKAGQLNPPYTTIKPLRSSNGIKEKTHQRTATSKIEETSAHKDEK EPAQDLTTQQTLRSQNNHTGSLRDQIDFYN >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_9|1173_bp atggccacactgcccaaggtaatttatagattcaatgccatccccaacaagctaccaatg actttcttcacagaattggaaaaaactacttttaagttcatatggaaccaagaaagagct cgcattgccaagacaatcctaagccaaaagaataaagctggaggcattatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagtatggtactggtacaaaaacaga gatatagaccaatggaacagaacagagccctcagaaataataccatgcatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaacggggaaaggattccctatttaataca tggtgctgggaaaactggctagccatatgtagaaagctgaaactggattccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcactaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagggaacagccaacctaca gaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctac aaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaag gatatgaacagacacttctcaagagaagacatttatgcagccaacagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacagagacaaagccggtcaa ctgaacccaccttataccacaatcaaacccttgaggtcatcaaatgggataaaagaaaaa acccatcaaaggacagcaacttcaaagattgaagaaacatcagcccacaaagatgagaaa gaaccagcacaagatctgacaactcaacaaactctgcgctcccaaaacaaccacacaggg tcacttcgtgatcaaatcgacttttacaactag >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_10|224_aa MQGWFNIRKLINIIHHINGTNDKNHKIISKDAEKAFDKIQHPFMLKTINKLGIDGTYLKI VRAIYDKPIANIIQNGQNPEALPLKSSTRQRCPLSPLLFNIVLEVLARAIRQEKEITGIQ IGKEEVKLPLFADNMILYLEKPIVSAQKLLKLISNFSKVSGHKINVQKSQAFLYTNNRQA ESQIMNELPFTIATKRIKYLGIQLTKDVKDLVKENYKPLLKEIR >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_10|675_bp atgcaaggctggttcaacatacgcaaattaataaacataattcatcacataaacggaact aatgacaaaaaccacaagattatctcaaaagatgccgaaaaagcctttgataaaattcaa catcctttcatgttaaaaacaatcaataaactaggtattgatggaacatatctcaaaata gtaagagccatttacgacaaacccatagccaatatcatacagaatgggcaaaacccagaa gcacttcccttgaaaagcagcacaagacaaagatgccctctctcaccactcctattcaac atagtattggaagttctggccagggcaatcaggcaagagaaagaaataacgggtattcaa ataggaaaagaggaagtcaaattgcctctgtttgcagacaacatgatcctatatctagaa aaacccattgtctcagcccaaaagctccttaagctgataagcaacttcagcaaagtctca ggacacaaaatcaatgtgcaaaaatcacaagcattcctgtacaccaacaatagacaagca gagagccaaatcatgaatgaactcccattcacaattgctacaaagagaataaaataccta ggaatacagctaacaaaagatgtgaaggacctcgtcaaggagaactacaaaccactgctc aaggaaataagatag >gi568815593f:140591649_140798549|GENSCAN_predicted_peptide_11|1586_aa MVFSRRGGLGARDLLLWLLLLAAWEVGSGQLHYSIPEEAKHGTFVGRVAQDLGLELAELV PRLFRVASKTHRDLLEVNLQNGILFVNSRIDREELCQWSAECSIHLELIADRPLQVFHVE VKVKDINDNPPVFRGREQIIFIPESRLLNSRFPIEGAADADIGANALLTYTLSPSDYFSL DVEASDELSKSLWLELRKYLDREETPELHLLLTATDGGKPELQGTVELLITVLDVNDNAP LFDQAVYRVHLLETTANGTLVTTLNASDADEGVNGEVVFSFDSGISRDIQEKFKVDSSSG EIRLIDKLDYEETKSYEIQVKAVDKGSPPMSNHCKVLVKVLDVNDNAPELAVTSLYLPIR EDAPLSTVIALITVSDRDSGANGQVTCSLMPHVPFKLVSTFKNYYSLVLDSALDRESLSV YELVVTARDGGSPSLWATARVSVEVADVNDNAPAFAQPEYTVFVKENNPPGCHIFTVSAR DADAQENALVSYSLVERRVGERALSNYVSVHAESGKVYALQPLDHEELELLQFQVSARDA GVPPLGSNVTLQVFVLDENDNAPALLAPRVGGTIGAVSELVPRLVGAGHVVAKVRAVDAD SGYNAWLSYELQPAAGGARIPFRVGLYTGEISTTRVLDEADLSRYRLLVLVKDHGEPALT ATATVLVSLVESGQAPKASSRASVGVAGPEAALVDVNVYLIIAICAVSSLLVLTLLLYTA LRCSVPPTEGAYVPGKPTLVCSSALGSWSNSQQRRQRVCSSEGPPKTDLMAFSPGLSPSL NTSERNEQPEANLDLSGNEAENKIGDMVRVTEAWEVGSGQLRYSVPEEAKHGTFVGRIAQ DLGLELEELVPRLFRVASKRHGDLLEVNLQNGILFVNSRIDREELCGRSAECSIHVEVIV DRPLQVFHVEVEVKDINDNPPIFPMTVKTIRFPESRLLDSRFPLEGASDADIGVNALLSY KLSSSEFFFLDIQANDELSESLSLVLGKSLDREETAEVNLLLVATDGGKPELTGTVQILI KVLDVNDNEPTFAQSVYKVKLLENTANGTLVVKLNASDADEGPNSEIVYSLGSDVSSTIQ TKFTIDPISGEIRTKGKLDYEEAKSYEIQVTATDKGTPSMSGHCKISLKLVDINDNTPEV SITSLSLPISENASLGTVIALITVSDRDSGTNGHVTCSLTPHVPFKLVSTFKNYYSLVLD SALDRESVSAYELVVTARDGGSPSLWATTSVSIEVADVNDNAPAFAQPEYTVFVKENNPP GCHIFTVSAWDADAQENALVSYSLVERRVGERALSSYVSVHAESGKVYALQPLDHEEVEL LQFQVSARDAGVPPLGSNVTLQVFVLDENDNAPALLAPRAGTAAGAVSELVPWSVGAGHV VAKVRAVDADSGYNAWLSYELQLGTGSARIPFRVGLYTGEISTTRALDEADSPRHRLLVL VKDHGEPALTATATVLVSLVESGQAPKASSRAWVGAAGSEATLVDVNVYLIIAICAVSSL LVLTVLLYTALRCSVPPTEGARAPGKPTLVCSSAVGSWSYSQQRRQRVCSGEDPPKTDLM AFSPSLSQGPDSAEEKQLSESEYVGK >gi568815593f:140591649_140798549|GENSCAN_predicted_CDS_11|4758_bp atggtgttttctaggagagggggcctgggagcccgggatctgcttctttggcttctgctc ctcgcagcctgggaggtggggagcggccagctccactactcgatcccggaggaagccaaa cacggcaccttcgttggccgcgttgctcaggacctgggactggagctggcggagctggtg cctcgcctgttccgggtggcgtccaaaacacacagggaccttctggaggtaaatctgcag aatggcattttgtttgtgaattctcggatcgatcgcgaggagctgtgccagtggagcgcg gagtgcagcatccacctggagttgatcgccgacaggccgctgcaggttttccatgtggag gtgaaggtgaaagacattaacgataatccacccgtcttcaggggcagagaacaaataata tttattcctgaatctagactcctgaattcgcgttttccgatagaaggagctgctgatgca gacattggtgctaacgctcttctaacgtacacgctcagcccgagtgattatttctctttg gatgtagaggcaagtgatgaactgagtaaatctctttggcttgaattgagaaaatatttg gatagagaagaaacaccagaacttcacttattactgactgccactgatgggggcaaaccg gagctgcaaggtacagttgagctgctgatcaccgtcctcgacgttaatgataacgcccca ctgtttgaccaggccgtatacagagtccacttgttagagactacagcaaatggaacatta gtgaccacattaaatgcctctgatgctgacgaaggtgtaaatggtgaagtcgtcttttcc tttgacagtggtatttctcgtgacattcaagaaaaattcaaagttgattccagctcagga gaaattaggttaattgataaactggattatgaagaaacaaaatcctacgaaattcaagta aaggcagttgataaaggaagtcctccgatgtcaaatcactgtaaggttttggtgaaagtg ctggatgtaaatgataatgctccagaactggcggtcacttcattgtatttgcctatcaga gaggacgctccactcagcaccgtcatcgccctcatcaccgtgtctgaccgtgactcaggt gccaacgggcaggtgacttgctccttaatgccccacgtccccttcaagctggtgtccacc ttcaagaattactactcgttggtgttggacagcgccctggatcgcgagagcctgtcggtc tatgagctggtggtgaccgcgcgggacgggggctcgccttcgctgtgggccacggccagg gtgtccgtggaggtggccgacgtgaatgacaacgcgcctgcgttcgcgcagcccgagtac acagtattcgtgaaggagaacaacccgccgggctgccacatcttcacggtgtctgcgcgg gacgcggacgcgcaggagaacgcgctggtgtcctattcgctggtggaacggcgggtgggc gagcgcgcgctgtcgaactacgtgtcagtgcacgcggagagcggcaaggtgtacgcactg cagcccctggaccacgaggagctggagctgctgcagttccaggtgagcgcgcgggatgcg ggcgtgccgcctctgggcagcaacgtgacgctgcaggtgttcgtgctggacgagaacgac aacgcgccggcgctgctggcgcctcgagtgggtggcactattggtgcagtcagtgagctg gtgccgcgattggtgggtgcgggtcatgtggtggcgaaggtgcgcgcagtggacgccgac tcgggctacaacgcgtggctgtcctatgaactgcagccggcagcaggcggcgcgcgcatc ccgttccgcgtggggctgtacacgggcgagatcagcacgactcgtgtcctggacgaggct gacttgtcgcgctaccgccttctggtgctagtgaaggatcacggtgagccggcgctgaca gccacggccactgtgcttgtatctctggtggagagcggccaggcgccaaaggcgtcttcg cgggcgtcggtgggtgtcgcgggcccagaggcggcgctggtggatgtcaacgtgtacctg atcatcgccatctgcgcggtgtccagcctgctggtgctcacactgctgctgtacacggcg ctgcggtgctcagtgccgcccactgagggtgcgtatgtgccgggcaagcccactctggtg tgctccagcgcgttggggagctggtcgaactcacagcagaggcggcagagggtgtgctct agcgagggcccacccaagaccgacctcatggccttcagcccaggcctatctccaagtctt aacacgtcagaaagaaatgaacaaccagaagcaaatttggatctttctggtaatgaagct gaaaacaaaattggtgacatggttagagtaactgaagcctgggaggtggggagcggccag ctccgctactccgtccccgaggaggccaaacacggcaccttcgtgggccgcatcgcgcag gacctggggctggagctggaggagctggtgccgcgcctgttccgggtggcgtccaaaaga cacggggaccttctggaggtaaatctgcagaatggcattttgtttgtgaattctcggatc gaccgggaggagctgtgcgggcggagcgcggaatgtagcatccacgtggaggtgatcgtg gacaggccgctgcaggttttccatgtggaagtggaggtgaaggacattaacgacaacccg ccaatatttccaatgacagtaaagactatccggtttcccgaatcaaggctgcttgattct cggtttcctctagagggagcatctgatgcagatataggagtaaatgctcttctctcctac aagctcagctccagtgagtttttcttcctagatatacaggcaaatgatgaactaagcgaa tctttgtctctcgtgctggggaaatcgctggacagagaggaaactgctgaggttaatttg ttactggtggctactgatgggggcaaacctgagctcacgggcaccgttcaaatacttatt aaggtattagatgtaaatgacaatgaaccaacttttgcccaatcagtttacaaagtaaaa ttgttagagaatacggcaaatgggaccttagtggttaagttaaacgcttctgatgcagat gaaggaccgaacagcgagattgtgtattcactcggtagtgatgtgtcctccactatacag actaagtttaccatagatcccatctcaggggaaatcagaactaagggaaaattagattat gaagaagcaaagtcctacgagattcaggtcactgcaactgacaaaggaaccccttcaatg tcaggacattgtaaaatttcattaaaacttgtggacatcaatgataacacaccagaagtc tcaataacgtctctctcacttcccatctcagagaacgcttccctgggcactgtcattgct ctcatcacggtgtcggatcgcgactctggtacgaatggacatgtcacctgctccctgacg ccccacgtccctttcaagctggtgtccaccttcaagaattactactcgttggtgctggac agcgccctggaccgcgagagcgtgtcagcctatgagctggtggtgaccgcacgggacggg ggctcgccttcactgtgggccaccaccagcgtgtccatcgaggtggccgacgtgaacgac aacgcgccggcgttcgcacagcctgagtacacagtattcgtgaaggagaacaacccgccg ggctgccacatcttcacggtgtcagcgtgggatgcggacgcgcaggagaacgcgctggtg tcctactcgctggtggagcggcgggtgggcgagcgcgcgttgtcgagctacgtttcggtg cacgcggagagcggcaaggtgtacgcgctgcagccgctggaccacgaggaagtggagctg ctgcagttccaggtgagcgcgcgggatgcgggcgtgccgcctctgggcagcaacgtgacg ctgcaggtgttcgtgctggacgagaacgacaacgcgccggcactgttggcgcctagggct ggcaccgctgctggcgcagtgagtgagctggtgccgtggtcggtgggtgcagggcacgtg gtggcgaaggtgcgcgcagtggacgctgactcaggctacaacgcgtggctttcgtacgag cttcagctgggtactggcagcgctcgcatcccgttccgcgtggggctatacacgggtgag atcagcacgacacgtgccctagacgaggctgactcccctcgacaccgcctactcgtgctg gtgaaggaccacggcgaaccagcgttgacagccacggccaccgtgttagtgtcgttggtg gaaagtggccaggcacccaaggcctcgtcgcgggcgtgggtgggcgccgcgggctcagag gctacgctggtggatgtcaacgtgtacctgatcatcgccatctgcgcggtatccagcctg ttggtgctcacggtgctgctgtacactgcgctgcggtgctcggtgccacccaccgagggt gcgcgcgcgccaggaaagcccacgctggtgtgctccagcgccgtggggagctggtcttac tcgcagcagaggcggcagagggtgtgctctggggaggacccccccaagacggacctcatg gccttcagccctagcttatctcaaggtccagactccgcagaagagaaacagctctcagaa tcagaatacgtaggaaag