GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:00:31 Sequence gi568815584f:20355646_20557505 : 201860 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 252 386 135 2 0 85 69 40 0.848 2.56 1.02 Intr + 662 789 128 1 2 105 41 128 0.520 9.28 1.03 Intr + 969 1044 76 0 1 33 101 42 0.695 -0.48 1.04 Intr + 1406 1504 99 1 0 28 78 81 0.590 1.41 1.05 Intr + 1751 1875 125 1 2 65 95 68 0.965 4.58 1.06 Term + 1993 2152 160 1 1 104 42 106 0.698 5.01 1.07 PlyA + 2165 2170 6 1.05 2.48 PlyA - 2283 2278 6 1.05 2.47 Term - 12914 12792 123 2 0 106 49 51 0.984 1.38 2.46 Intr - 13257 13153 105 0 0 42 119 50 0.854 3.91 2.45 Intr - 13931 13699 233 0 2 56 113 67 0.789 3.49 2.44 Intr - 14134 14029 106 1 1 75 105 27 0.916 2.89 2.43 Intr - 15669 15573 97 2 1 76 111 62 0.993 7.31 2.42 Intr - 17212 17088 125 1 2 40 52 96 0.938 0.58 2.41 Intr - 17502 17366 137 1 2 115 109 43 0.994 9.39 2.40 Intr - 17757 17625 133 0 1 131 109 92 0.998 15.82 2.39 Intr - 17938 17862 77 2 2 84 97 13 0.984 1.03 2.38 Intr - 18165 18033 133 0 1 93 92 103 0.995 11.42 2.37 Intr - 18891 18784 108 0 0 87 69 58 0.921 4.28 2.36 Intr - 20619 20459 161 1 2 86 88 111 0.953 10.61 2.35 Intr - 21847 21635 213 2 0 110 79 28 0.808 2.79 2.34 Intr - 22108 21955 154 1 1 79 84 -21 0.199 -3.75 2.33 Intr - 22591 22379 213 1 0 64 82 110 0.370 6.91 2.32 Intr - 22890 22735 156 0 0 129 116 139 0.991 21.01 2.31 Intr - 23208 23109 100 2 1 43 116 55 0.997 3.91 2.30 Intr - 23460 23336 125 0 2 71 99 97 0.999 8.48 2.29 Intr - 24408 24285 124 2 1 94 80 126 0.938 12.99 2.28 Intr - 24830 24590 241 0 1 83 116 133 0.981 12.21 2.27 Intr - 26711 26579 133 2 1 80 49 155 0.879 11.02 2.26 Intr - 27070 26978 93 1 0 21 82 204 0.953 13.26 2.25 Intr - 27708 27529 180 0 0 46 47 230 0.997 14.76 2.24 Intr - 27999 27843 157 2 1 105 100 191 0.984 22.01 2.23 Intr - 28273 28098 176 1 2 106 80 5 0.991 0.24 2.22 Intr - 28587 28393 195 0 0 43 52 198 0.953 11.31 2.21 Intr - 28863 28746 118 2 1 113 76 138 0.996 15.57 2.20 Intr - 29068 28955 114 0 0 23 99 73 0.733 1.46 2.19 Intr - 29464 29340 125 1 2 157 84 158 0.999 21.68 2.18 Intr - 30550 30430 121 0 1 93 82 78 0.999 8.20 2.17 Intr - 30978 30802 177 2 0 81 64 195 0.943 15.43 2.16 Intr - 33652 33593 60 0 0 112 81 33 0.834 2.85 2.15 Intr - 34095 33965 131 0 2 115 67 14 0.962 1.49 2.14 Intr - 35113 35036 78 1 0 73 76 71 0.942 4.15 2.13 Intr - 35451 35293 159 0 0 38 71 161 0.996 9.58 2.12 Intr - 36122 35954 169 1 1 81 78 132 0.757 11.45 2.11 Intr - 39982 39805 178 2 1 37 91 47 0.750 -1.12 2.10 Intr - 40304 40214 91 2 1 144 61 82 0.995 10.67 2.09 Intr - 41085 40976 110 1 2 85 80 76 0.586 6.50 2.08 Intr - 45496 45339 158 0 2 94 8 14 0.361 -6.35 2.07 Intr - 45936 45812 125 0 2 39 94 131 0.686 8.18 2.06 Intr - 48239 48078 162 2 0 85 26 204 0.971 14.07 2.05 Intr - 49127 48966 162 2 0 67 80 150 0.903 12.27 2.04 Intr - 49940 49806 135 2 0 99 67 50 0.790 4.76 2.03 Intr - 50755 50588 168 1 0 70 93 157 0.997 14.54 2.02 Intr - 52757 52228 530 0 2 40 18 319 0.100 12.76 2.01 Init - 57949 57760 190 1 1 58 71 184 0.498 10.87 2.00 Prom - 58499 58460 40 -7.46 3.08 PlyA - 59114 59109 6 1.05 3.07 Term - 60338 60267 72 2 0 119 45 66 0.767 3.31 3.06 Intr - 69973 69852 122 2 2 87 91 56 0.811 6.01 3.05 Intr - 73756 73557 200 0 2 105 35 33 0.431 -1.31 3.04 Intr - 74024 73857 168 1 0 85 99 138 0.998 13.66 3.03 Intr - 75074 74150 925 0 1 115 75 489 0.624 40.59 3.02 Intr - 79176 78998 179 2 2 89 50 82 0.890 4.06 3.01 Init - 80166 79367 800 0 2 71 66 741 0.966 64.39 3.00 Prom - 81343 81304 40 -9.36 4.00 Prom + 81830 81869 40 -9.46 4.01 Init + 82202 82392 191 1 2 30 91 160 0.250 8.98 4.02 Intr + 83044 83102 59 1 2 71 105 110 0.530 9.53 4.03 Intr + 85007 85105 99 0 0 89 53 90 0.640 5.68 4.04 Intr + 85523 85581 59 0 2 84 44 53 0.560 -0.90 4.05 Term + 86087 86203 117 1 0 125 50 43 0.436 2.74 4.06 PlyA + 87013 87018 6 1.05 5.12 PlyA - 91442 91437 6 1.05 5.11 Term - 91634 91595 40 1 1 130 38 35 0.725 -0.64 5.10 Intr - 91875 91777 99 2 0 71 105 9 0.503 0.13 5.09 Intr - 92045 91970 76 0 1 121 88 59 0.985 7.77 5.08 Intr - 92349 92259 91 0 1 121 25 48 0.998 1.37 5.07 Intr - 92526 92461 66 0 0 92 95 77 0.996 7.90 5.06 Intr - 93166 93088 79 0 1 52 105 15 0.822 -0.85 5.05 Intr - 93368 93319 50 2 2 92 91 51 0.847 3.28 5.04 Intr - 93621 93526 96 0 0 96 121 126 0.999 16.91 5.03 Intr - 96504 96350 155 1 2 105 73 126 0.554 12.59 5.02 Intr - 96803 96684 120 0 0 98 51 84 0.756 6.17 5.01 Init - 99038 98924 115 2 1 99 77 155 0.997 15.87 5.00 Prom - 99277 99238 40 -12.01 6.00 Prom + 99935 99974 40 -11.82 6.01 Init + 100001 100058 58 1 1 65 110 38 0.981 5.19 6.02 Intr + 100269 100456 188 1 2 47 76 184 0.999 12.51 6.03 Intr + 101023 101215 193 0 1 106 66 158 0.999 14.47 6.04 Term + 101346 101863 518 1 2 68 47 507 0.984 39.08 6.05 PlyA + 102103 102108 6 1.05 7.04 PlyA - 102234 102229 6 1.05 7.03 Term - 103057 102914 144 1 0 53 52 49 0.123 -4.29 7.02 Intr - 105200 105010 191 0 2 95 84 113 0.888 10.90 7.01 Init - 105680 105239 442 2 1 117 78 144 0.695 12.54 7.00 Prom - 108595 108556 40 -5.66 8.00 Prom + 111321 111360 40 -6.26 8.01 Init + 114003 114130 128 2 2 52 67 92 0.546 1.13 8.02 Intr + 116663 116832 170 2 2 107 84 100 0.993 11.09 8.03 Intr + 118827 118930 104 1 2 88 115 46 0.987 7.19 8.04 Intr + 119128 119303 176 0 2 62 95 150 0.999 11.84 8.05 Intr + 119417 119607 191 2 2 40 77 185 0.999 11.83 8.06 Term + 120739 120956 218 2 2 60 43 280 0.999 18.01 8.07 PlyA + 121182 121187 6 1.05 9.00 Prom + 147397 147436 40 -5.26 9.01 Sngl + 154827 155477 651 2 0 89 48 464 0.941 36.68 9.02 PlyA + 159227 159232 6 1.05 10.04 PlyA - 163266 163261 6 1.05 10.03 Term - 167356 167230 127 0 1 49 50 292 0.903 19.26 10.02 Intr - 169478 169444 35 2 2 89 81 17 0.145 -1.88 10.01 Init - 173315 173133 183 2 0 95 47 121 0.646 7.82 10.00 Prom - 178336 178297 40 -3.16 11.00 Prom + 183899 183938 40 -1.96 11.01 Init + 183954 184068 115 2 1 77 48 75 0.493 2.77 11.02 Term + 186577 186641 65 2 2 82 48 80 0.234 1.45 11.03 PlyA + 187214 187219 6 1.05 12.02 PlyA - 189004 188999 6 1.05 12.01 Sngl - 201424 200807 618 1 0 75 43 375 0.854 25.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 52794 52228 567 0 0 81 18 335 0.879 20.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_1|240_aa ALGDIEIAIKLVKTELQSPEHPLDQHYRNLHCALRPLDHESYEFKVISQYLQSTHAPTHS DYTMTLLDLFEVEKDGEKEAFREDLHNRMSNWVGILSHGLRIAPPEAPITGYMFGKGIYF ADMSSKSANYCFASRLKNTGLLLLSEVALGQCNELLEANPKAEGLLQGKHSTKGLGKMAP SSAHFVTLNGSTVPLGPASDTGILNPDGYTLNYNEYIVYNPNQVRMRYLLKVQFNFLQLW >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_1|723_bp gctttgggagacattgaaattgctattaagctggtgaaaacagagctacaaagcccagaa cacccattggaccaacactatagaaacctacattgtgccttgcgcccccttgaccatgaa agttatgagttcaaagtgatttcccagtacctacaatctacccatgctcccacacacagc gactataccatgaccttgctggatttgtttgaagtggagaaggatggtgagaaagaagcc ttcagagaggaccttcataacaggatgagtaactgggtgggaatcttgagccatgggctt cgaattgccccacctgaagctcccatcacaggttacatgtttgggaaaggaatctacttt gctgacatgtcttccaagagtgccaattactgctttgcctctcgcctaaagaatacagga ctgctgctcttatcagaggtagctctaggtcagtgtaatgaactactagaggccaatcct aaggccgaaggattgcttcaaggtaaacatagcaccaaggggctgggcaagatggctccc agttctgcccacttcgtcaccctgaatgggagtacagtgccattaggaccagcaagtgac acaggaattctgaatccagatggttataccctcaactacaatgaatatattgtatataac cccaaccaggtccgtatgcggtaccttttaaaggttcagtttaatttccttcagctgtgg tga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_2|2352_aa MSSCPALGSLPFQQLLRPARARESPPAPSLLRSASGSSPGCHPSHRQNPERSFCFLLRES DAPDILSLENRCLAMLPDLQPLEKLHQHVSTHSDILSLKNQCLATLPDLKTMEKPHGYVS AHPDILSLENQCLATLSDLKTMEKPHGHVSAHPDILSLENRCLATLSSLKSTVSASPLFQ SLQISHMTQADLYRVNNSNCLLSEPPSWRAQHFSKGLDLSTCPIALKSISATETAQEATL GRWFDSEEKKGAETQMPSYSLSLGEEEEVEDLAVKLTSGDSESHPEPTDHVLQEKKMALL SLLCSTLVSEVNMNNTSDPTLAAIFEICRELALLEPEFILKASLYARQQLNVRNVANNIL AIAAFLPACRPHLRRYFCAIVQLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTDKF AQFDEYQLAKYNPRKHRAKRHPRRPPRSPFEKAGDTVSEKKNPPRFTLKKLVQRLHIHKP AQHVQALLGYRYPSNLQLFSRSRLPGPWDSSRAGKRMKLSRPETWERELSLRGNKASVWE ELIENGKLPFMAMLRNLCNLLRVGISSRHHELILQRLQHAKSVIHSRQFPFRFLNAHDAI DALEAQLRNQALPFPSNITLMRRILTRNEKNRPRRRFLCHLSRQQLRMAMRIPVLYEQLK REKLRVHKARQWKYDGEMLNRYRQALETAVNLSVKHSLPLLPGRTVLVYLTDANADRLCP KSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQE FDENDGWSLNTFGKYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVG ILLRRVQYLSTDLNPNDVTLSGCTDAILKWRSIRLFISSTFRDMHGERDLLLRSVLPALQ ARAAPHRISLHGIDLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGYIPPSYNL PDHPHFHWAQQYPSGRSVTEMEVMQFLNRNQRLQPSAQALIYFRDSSFLSSVPDAWKSDF VSESEEAARRISELKSYLSRQKGITCRRYPCEWGGVAAGRPYVGGLEEFGQLVLQDVWNM IQKLYLQPGALLEQPVSIPDDDLVQATFQQLQKPPSPARPRLLQDTVQRLMLPHGRLSLV TGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLALTLLRRLCTYLRGQL KEPGALPSTYRSLVWELQQRLLPKSAESLHPGQTQVLIIDGADRLVDQNGQLISDWIPKK LPRCVHLVLSVSSDAGLGETLEQSQGAHVLALGPLEASARARLVREELALYGKRLEESPF NNQMRLLLVKRESGRPLYLRLVTDHLRLFTLYEQVSERLRTLPATVPLLLQHILSTLEKE HGPDVLPQALTALEVTRSASSVPKEEQKLPEADVAVFRTFLRQQASILSQYPRLLPQQAA NQPLDSPLCHQASLLSRRWHLQHTLRWLNKPRTMKNQQSSSLSLAVSSSPTAVAFSTNGQ RAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTAFDGLLELWDLQHG CRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHTYPKSLNCVAFH PEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELW AWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSLSP ALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWLSPKVLVSGAED GSLQGWALKECSLQSLWLLSRFQKPVLGLATSQELLASASEDFTVQLWPRQLLTRPHKAE DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRISCSSDGSVGLWDPESGQRLGQFLGH QSAVSAVAAVEEHVVSVSRDGTLKVWDHQGVELTSIPAHSGPISHCAAAMEPRAAGQPGS ELLVVTVGLDGATRLWHPLLVCQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQV PKEADDTCIPRSSAAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIGALIW SSAHTFFVLSADEKISEWQVKLRKGSAPGNLRSSYTENPMILSTHKEYGIFVLQPKDPGV LSFLRQKESGEFEERLNFDINLENPSRTLISITQAKPESESSFLCASSDGILWNLAKCSP EGEWTTGNMWQKKANTPETQTPGTDPSTCRESDASMDSDASMDSEPTPHLKTRQRRKIHS GSVTALHVLPELLVTASKDRDVKLWERPSMQLLGLFRCEGSVSCLEPWLGANSTLQLAVG DVQGNVYFLNWE >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_2|7059_bp atgtcgtcttgcccggccctcggctcgctgcccttccagcagctgctgcgccccgcccgg gcccgagagtctccgcccgcaccgtctcttctcagatccgcatccggctcctcccccggc tgccacccttcccaccggcagaatccagagcgaagtttctgcttcctgctgcgggaatcg gacgccccagacatcctctccttggagaaccggtgcctggctatgctccctgacttacag cccttggagaaactacatcagcatgtatctacccactcagatatcctctccttgaagaac cagtgcctagccacgcttcctgacctgaagaccatggaaaaaccacatggatatgtgtct gcccacccagacatcctctccttggagaaccagtgcctggccacactttctgacctgaag accatggagaaaccacatggacatgtttctgcccacccagacatcctctccttggagaac cggtgcctggccaccctctctagtctaaagagcactgtgtctgccagccccttgttccag agtctacagatatctcacatgacgcaagctgatttgtaccgtgtgaacaacagcaattgc ctgctctctgagcctccaagttggagggctcagcatttctctaagggactagacctttca acctgccctatagccctgaaatccatctctgccacagagacagctcaggaagcaactttg ggtcgttggtttgattcagaagagaagaaaggggcagagacccaaatgccttcttatagt ctgagcttgggagaggaggaggaggtggaggatctggccgtgaagctcacctctggagac tctgaatctcatccagagcctactgaccatgtccttcaggaaaagaagatggctctactg agcttgctgtgctctactctggtctcagaagtaaacatgaacaatacatctgaccccacc ctggctgccatttttgaaatctgtcgtgaacttgccctcctggagcctgagtttatcctc aaggcatctttgtatgccaggcagcagctgaacgtccggaatgtggccaataacatcttg gccattgctgctttcttgccggcgtgtcgcccccacctgcgacgatatttctgtgccatt gtccagctgccttctgactggatccaggtggctgagctttaccagagcctggctgaggga gataagaataagctggtgcccctgcccgcctgtctccgtactgccatgacggacaaattt gcccagtttgacgagtaccagctggctaagtacaaccctcggaagcaccgggccaagaga cacccccgccggccaccccgctctccatttgagaaggccggtgatacagtgtcagagaaa aagaatcctccaaggttcaccctgaagaagctggttcagcgactgcacatccacaagcct gcccagcacgttcaagccctgctgggttacagatacccctccaacctacagctcttttct cgaagtcgccttcctgggccttgggattctagcagagctgggaagaggatgaagctgtct aggccagagacctgggagcgggagctgagcctacgggggaacaaagcgtcggtctgggag gaactcattgaaaatgggaagcttcccttcatggccatgcttcggaacctgtgcaacctg ctgcgggttggaatcagttcccgccaccatgagctcattctccagagactccagcatgcg aagtcggtgatccacagtcggcagtttccattcagatttcttaacgcccatgatgccatt gatgccctcgaggctcaactcagaaatcaagcattgccctttccttcgaatataacactg atgaggcggatactaactagaaatgaaaagaaccgtcccaggcggaggtttctttgccac ctaagccgtcagcagcttcggatggcaatgaggatacctgtgttgtatgagcagctcaag agggagaagctgagagtacacaaggccagacagtggaaatatgatggtgagatgctgaac aggtaccgacaggccctagagacagctgtgaacctctctgtgaagcacagcctgcccctg ctgccaggccgcactgtcttggtctatctgacagatgctaatgcagacaggctctgtcca aagagcaacccacaagggcccccgctgaactatgcactgctgttgattgggatgatgatc acgagggcggagcaggtggacgtcgtgctgtgtggaggtgacactctgaagactgcagtg cttaaggcagaagaaggcatcctgaagactgccatcaagctccaggctcaagtccaggag tttgatgaaaatgatggatggtccctgaatacttttgggaaatacctgctgtctctggct ggccaaagggttcctgtggacagggtcatcctccttggccaaagcatggatgatggaatg ataaatgtggccaaacagctttactggcagcgtgtgaattccaagtgcctctttgttggt atcctcctaagaagggtacaatacctgtcaacagatttgaatcccaatgatgtgacactc tcaggctgtactgatgcgatactgaaatggcgcagcatccggcttttcatttcatccact ttccgagacatgcatggggagcgggacctgctgctgaggtctgtgctgccagcactgcag gcccgagcggcccctcaccgtatcagccttcacggaatcgacctccgctggggcgtcact gaggaggagacccgtaggaacagacaactggaagtgtgccttggggaggtggagaacgca cagctgtttgtggggattctgggctcccgttatggatacattccccccagctacaacctt cctgaccatccacacttccactgggcccagcagtacccttcagggcgctctgtgacagag atggaggtgatgcagttcctgaaccggaaccaacgtctgcagccctctgcccaagctctc atctacttccgggattccagcttcctcagctctgtgccagatgcctggaaatctgacttt gtttctgagtctgaagaggccgcacgtcggatctcagaactgaagagctacctaagcaga cagaaagggatcacctgccgcagatacccctgtgagtgggggggtgtggcagctggccgg ccctatgttggcgggctggaggagtttgggcagttggttctgcaggatgtatggaatatg atccagaagctctacctgcagcctggggccctgctggagcagccagtgtccatcccagac gatgacttggtccaggccaccttccagcagctgcagaagccaccgagtcctgcccggcca cgccttcttcaggacacagtgcaacggctgatgctgccccacggaaggctgagcctggtg acggggcagtcaggacagggcaagacagccttcctggcatctcttgtgtcagccctgcag gctcctgatggggccaaggtggcatcattagtcttcttccacttttctggggctcgtcct gaccagggtcttgccctcactctgctcagacgcctctgtacctatctgcgtggccaacta aaagagccaggtgccctccccagcacctaccgaagcctggtgtgggagctgcagcagagg ctgctgcccaagtctgctgagtccctgcatcctggccagacccaggtcctgatcatcgat ggggctgataggttagtggaccagaatgggcagctgatttcagactggatcccaaagaag cttccccggtgtgtacacctggtgctgagtgtgtctagtgatgcaggcctaggggagacc cttgagcagagccagggtgcccacgtgctggccttggggcctctggaggcctctgctcgg gcccggctggtgagagaggagctggccctgtacgggaagcggctggaggagtcaccattt aacaaccagatgcgactgctgctggtgaagcgggaatcaggccggccgctctacctgcgc ttggtcaccgatcacctgaggctcttcacgctgtatgagcaggtgtctgagagactccgg accctgcctgccactgtccccctgctgctgcagcacatcctgagcacactggagaaggag cacgggcctgatgtccttccccaggccttgactgccctagaagtcacacggagtgcttct tcagtccccaaagaggaacaaaagctccccgaggctgacgttgcagtgtttcgcaccttc ctgaggcagcaggcttcaatcctcagccagtacccccggctcctgccccagcaggcagcc aaccagcccctggactcacctctttgccaccaagcctcgctgctctcccggagatggcac ctccaacacacactacgatggcttaataaaccccggaccatgaaaaatcagcaaagctcc agcctgtctctggcagtttcctcatcccctactgctgtggccttctccaccaatgggcaa agagcagctgtgggcactgccaatgggacagtttacctgttggacctgagaacttggcag gaggagaagtctgtggtgagtggctgtgatggaatctctgcttgtttgttcctctccgat gatacactctttcttactgccttcgacgggctcctggagctctgggacctgcagcatggt tgtcgggtgctgcagactaaggctcaccagtaccaaatcactggctgctgcctgagccca gactgccggctgctagccaccgtgtgcttgggaggatgcctaaagctgtgggacacagtc cgtgggcagctggccttccagcacacctaccccaagtccctgaactgtgttgccttccac ccagaggggcaggtaatagccacaggcagctgggctggcagcatcagcttcttccaggtg gatgggctcaaagtcaccaaggacctgggggcacccggagcctctatccgtaccttggcc ttcaatgtgcctgggggggttgtggctgtgggccggctggacagtatggtggagctgtgg gcctggcgagaaggggcacggctggctgccttccctgcccaccatggctttgttgctgct gcgcttttcctgcatgcgggttgccagttactgacggctggagaggatggcaaggttcag gtgtggtcagggtctctgggtcggccccgtgggcacctgggttccctttctctctctcct gccctctctgtggcactcagcccagatggtgatcgggtggctgttggatatcgagcggat ggcattaggatctacaaaatctcttcaggttcccagggggctcagggtcaggcactggat gtggcagtgtccgccctggcctggctaagccccaaggtattggtgagtggtgcagaagat gggtccttgcagggctgggcactcaaggaatgctcccttcagtccctctggctcctgtcc agattccagaagcctgtgctaggactggccacttcccaggagctcttggcttctgcctca gaggatttcacagtgcagctgtggccaaggcagctgctgacgcggccacacaaggcagaa gactttccctgtggcactgagctgcggggacatgagggccctgtgagctgctgtagtttc agcactgatggaggcagcctggccaccgggggccgggatcggatatcctgctccagtgat ggctctgtggggctctgggacccagagtcaggacagcggcttggtcagttcctgggtcat cagagtgctgtgagcgctgtggcagctgtggaggagcacgtggtgtctgtgagccgggat gggaccttgaaagtgtgggaccatcaaggcgtggagctgaccagcatccctgctcactca ggacccattagccactgtgcagctgccatggagccccgtgcagctggacagcctgggtca gagcttctggtggtaaccgtcgggctagatggggccacacggttatggcatccactcttg gtgtgccaaacccacaccctcctgggacacagcggcccagtccgtgctgctgctgtttca gaaacctcaggcctcatgctgaccgcctctgaggatggttctgtacggctctggcaggtt cctaaggaagcagatgacacatgtataccaaggagttctgcagccgtcactgctgtggct tgggcaccagatggttccatggcagtatctggaaatcaagctggggaactaatcttgtgg caggaagctaaggctgtggccacagcacaggctccaggccacattggtgctctgatctgg tcctcggcacacaccttttttgtcctcagtgctgatgagaaaatcagcgagtggcaagtg aaactgcggaagggttcggcacccggaaatttgaggagcagctatacagaaaatcctatg atattgtccacccacaaggagtatggcatatttgtcctgcagcccaaggatcctggagtt ctttctttcttgaggcaaaaggaatcaggagagtttgaagagaggctgaactttgatata aacttagagaatcctagtaggaccctaatatcgataactcaagccaaacctgaatctgag tcctcatttttgtgtgccagctctgatgggatcctatggaacctggccaaatgcagccca gaaggagaatggaccacaggtaacatgtggcagaaaaaagcaaacactccagaaacccaa actccagggacagacccatctacctgcagggaatctgatgccagcatggatagtgatgcc agcatggatagtgagccaacaccacatctaaagacacggcagcgtagaaagattcactcg ggctctgtcacagccctccatgtgctacctgagttgctggtgacagcttcgaaggacaga gatgttaagctatgggagagacccagtatgcagctgctgggcctgttccgatgcgaaggg tcagtgagctgcctggaaccttggctgggcgctaactccaccctgcagcttgccgtggga gacgtgcagggcaatgtgtactttctgaattgggaatga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_3|821_aa MSVSDTPHYPPELESVSHPVQRDCLGLLVHAQRTPSWPPSPDEDPRLPSFPLEEPGSRPL VPRNLPFPALSLEEEEEEEEDEDAAEPEWLRSEEHPSQFFAEAQRLREQRLLLDEEVSVA GRVYGVHRVILAAISSLFRDRLLGGGGPRPPFSLEVSPGGWEAVLTFAYEGVLGPASQGD VLAAAEALGAPRVKAAAQQTCERAGNAREDVKKPSQAEELRENLRGIELLYREGVGCDLK LEAGGCQLSGEGLWGIKLLQVHMLTECNKESPASWLELRVQSMAVEQAESGLTQAEINKR PFLASSIQDTGKPGEHGGKLNVCFHLVHRAALACGSEFFGAMLLSGMRESQGTEVSLRTI STQDLRLLVSFAYSGVVRARWPGLLRAAQAALQYQSSSCLDLCQKGLARGLSPARCLALF PMAEAPGLERLWSKARHYLLTHLPAVALCPAFPSLPAACLAELLDSDELHVQEEFEAFVA ARCWLAANPETQESEAKALLRCVRFGRMSTRELRRVRAAGLLPPLTPDLLHQLMVEADVP GQERRREPDRALVVIGGDGLRPDMALRQPSRAVWWARAFRCGVGLVRTVEWGQLPALPAP GRFRHGAASLAGSELYVCGGQDFYSHSNTLASTLRWEPSQEDWEEMAPLSQARSLFSLVA LDGKLYALGGRHNDVALDSVETYNPELNVWRPAPALPAPCFAHAAAILEGQLYVSGGCGG TGQYLASLMHYDPKLEKPGTFLSPMGVPRAGHVMAALEIAHSPGWTFSYFSNPKSSKPDQ SVQGGVASSLELIAKDPQTTNEGFRCEWYCPLGGVFGFHNN >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_3|2466_bp atgagcgtctcggacaccccacattatccccctgagctggagagtgtctctcatcccgtc cagagggactgccttggactccttgtgcacgcccagagaaccccttcctggcctccctcc cccgacgaggatcccagattgccttcctttcctctggaggagcctggttccaggcccctg gtcccaaggaaccttccctttccagccttgtccctagaggaagaagaggaagaggaagaa gatgaagatgcggcagagcccgagtggctgcgcagcgaggagcatccgagccagtttttc gccgaggcccagcggctgcgggagcagagactgttgctggacgaagaggtgtcagtcgcg gggcgggtatacggggtgcatcgggtgatcctggccgcaatcagcagcctcttccgagac aggctgctgggcggcggaggtccgcggccccccttcagcctcgaggtgtccccagggggc tgggaggccgtgctgacctttgcctatgagggggtgctgggccccgcctcgcagggggat gtgctggccgcagcagaggcgctgggagcgccccgggtgaaggctgctgcccagcagaca tgcgagagggctggaaatgccagggaagatgtaaagaagcccagccaggcagaggagctg agggagaacctgcgcggaatcgagctcctctaccgagagggcgtcgggtgtgacttgaag ctggaggcaggcggctgccagctgtcgggtgagggcctgtggggcattaaattactgcag gtgcatatgctaacggaatgtaacaaggaatctccagccagctggctagagctaagggta caatccatggcagtagaacaagcagaaagtggattgactcaagcagaaattaacaaaagg cccttcttggcatcctccattcaggacacaggaaagccaggggaacatggtggaaaactc aatgtgtgctttcaccttgtgcaccgagccgccctggcctgtggcagtgagttctttggg gccatgctcctgagcgggatgagggaatcccagggcacagaggtatctctgcggacgatc tccacccaggacctgcgactcctcgtctcttttgcttactccggagttgtgcgggcaagg tggccagggctactgagagctgcccaggctgctctgcagtaccagagctcttcctgcttg gatttgtgtcagaaaggcttggcacggggcctcagccctgcccgttgcctggccctgttc cccatggcggaagcccctgggttggagaggctctggagcaaagcccgtcactacctcctc acccacctgcctgctgtagccttgtgtcctgctttcccttctttaccagctgcctgcttg gctgagctcctggatagtgatgagctccatgtgcaggaggagtttgaggcctttgtggct gcacggtgttggctggctgccaaccccgagacccaggagtcagaggccaaggccctgctg cgatgtgtccgctttggccgcatgtccaccagggagttgcggagggtgcgggcagccggg ctacttccacccctgaccccagatctgttgcaccagctgatggtagaggctgatgttcca ggccaagagagacggagggagcctgaccgggcactggtagtgattggcggggatgggctc agaccagacatggccctaagacaaccatcccgagcagtgtggtgggcccgggccttccgc tgtggcgtgggactggtacgaactgttgagtgggggcagctgcctgccctgcctgccccc ggacgcttccggcatggggctgcaagcctggcaggaagtgaactctatgtgtgtggggga caagatttctacagtcactccaacaccctggcttcaactctcaggtgggagcccagtcaa gaggactgggaggagatggctcctttgtcccaggctcgaagccttttctccttggtggca ctggatggaaaactttatgccctgggtggaagacacaatgatgttgccctggactctgtg gagacctacaaccctgagctcaatgtctggaggccagcacctgcacttccagcaccatgt tttgcccacgcagctgcgattttggagggccagttgtacgtgagcggtggctgtggtggg actggccaatacctggcctcactgatgcactatgaccccaaacttgagaagccagggacg tttctgagccctatgggggtacctcgggctggccatgtcatggctgcattggagatagca cactctcctggatggaccttctcttatttctccaatccaaaatccagtaagccagatcaa agtgtccaggggggtgtggcttcatccttggagctgatagccaaggatccacagaccact aatgaaggttttcgatgtgagtggtactgccctctagggggtgtttttggctttcacaat aattag >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_4|174_aa MSSPEQDNVASILAPNQELEELQSGGDIPRASLCSPTAYKDEDTGPCYEFYANSVVLGWL HTACHTVTGLSISAISSNGKIKAAELSIIAPTIANFLLCSCALNFGCLYASLTWLSVVNW GSSAQADTYQMALSYSVSCLNQLKSNVLVLGYKQNWEKGPVSAMEYYIGILVRM >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_4|525_bp atgtcctccccagagcaagacaatgtggccagcatcttggcccccaaccaggagttggag gaactgcaatctggtggagacataccccgagccagcctctgttcaccaactgcttacaag gatgaggacactggaccctgctacgagttctatgccaactccgtagtcctgggatggctc catacagcttgtcatactgtcactggattgtctatctcagccatctccagcaacggcaaa atcaaggctgccgagctgagtatcattgcccccaccatcgccaacttcctcctctgttcc tgtgccctcaactttggctgtctctacgcctcccttacctggttgtctgttgtgaactgg ggctcttcagcccaggcagacacctaccaaatggcgctgtcctactcagtctcatgtttg aaccaactgaagtccaatgtgctggtcttgggatataagcaaaattgggagaaagggcct gtatctgccatggagtactacataggcattctagtgcgaatgtga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_5|328_aa MPAVLGFEGSANKIGVGVVRDGKVLANPRRTYVTPPGTGFLPGDTARHHRAVILDLLQEA LTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNKPLVGVNHCIGHIEMGRLI TGATSPTVLYVIAYSEHRYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIEQMAKRG KKLVELPYTVKGMDVSFSGILSFIEDVAHRMLATGECTPEDLCFSLQETVFAMLVEITER AMAHCGSQEALIVGGVGCNVRLQEMMATMCQERGARLFATDERFCIDNGAMIAQAGWEMF RAGHRTPLSDSGVTQRYRTDEVEVTWRD >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_5|987_bp atgccggcggtgctgggttttgaaggcagcgccaataagattggcgtgggcgtggtgcgg gatggcaaggtgctggcgaacccgcggcggacttacgtcacgcctcctggcacaggattc cttccaggtgatacagccaggcatcaccgagctgttatcctagacctgctgcaggaggca ctaacagagtctggattaacctcccaggatatcgactgcattgcatacaccaagggccct ggcatgggtgccccactggtttctgtggctgttgtggcccgtactgtggcccaactgtgg aataagccattggtgggtgtgaaccactgtataggccacattgagatgggccgcctcatc actggagccaccagcccaaccgtgttgtatgtgattgcatactcggaacatcgttaccgt atctttggggaaaccatcgatattgcagtgggtaattgtctggatcgttttgctcgagtg ctgaagatttctaacgacccaagtccaggatacaacattgaacagatggcaaagcgaggc aagaagctagttgagctgccatacactgtaaaggggatggacgtctcattctcagggatc ctgtctttcattgaggatgtagcccatcggatgctggccacaggcgagtgtactcctgag gatctgtgtttctccctgcaggaaactgtgtttgcaatgctggtagagatcacagagcga gccatggcacattgtggctcccaggaggccctcattgtgggaggagtggggtgtaatgtg aggctacaggagatgatggcaacaatgtgccaggaacgtggagcccggctttttgctaca gatgagagattctgtattgacaatggagcgatgatagcccaggctggctgggagatgttt cgggctggacacaggaccccactcagtgattctggggttacacagaggtatcggacagat gaagtagaggtgacctggagggactaa >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_6|318_aa MPKRGKKGAVAEDGDELRTEPEAKKSKTAAKKNDKEAAGEGPALYEDPPDQKTSPSGKPA TLKICSWNVDGLRAWIKKKGLDWVKEEAPDILCLQETKCSENKLPAELQELPGLSHQYWS APSDKEGYSGVGLLSRQCPLKVSYGIGDEEHDQEGRVIVAEFDSFVLVTAYVPNAGRGLV RLEYRQRWDEAFRKFLKGLASRKPLVLCGDLNVAHEEIDLRNPKGNKKNAGFTPQERQGF GELLQAVPLADSFRHLYPNTPYAYTFWTYMMNARSKNVGWRLDYFLLSHSLLPALCDSKI RSKALGSDHCPITLYLAL >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_6|957_bp atgccgaagcgtgggaaaaagggagcggtggcggaagacggggatgagctcaggacagag ccagaggccaagaagagtaagacggccgcaaagaaaaatgacaaagaggcagcaggagag ggcccagccctgtatgaggaccccccagatcagaaaacctcacccagtggcaaacctgcc acactcaagatctgctcttggaatgtggatgggcttcgagcctggattaagaagaaagga ttagattgggtaaaggaagaagccccagatatactgtgccttcaagagaccaaatgttca gagaacaaactaccagctgaacttcaggagctgcctggactctctcatcaatactggtca gctccttcggacaaggaagggtacagtggcgtgggcctgctttcccgccagtgcccactc aaagtttcttacggcataggcgatgaggagcatgatcaggaaggccgggtgattgtggct gaatttgactcgtttgtgctggtaacagcatatgtacctaatgcaggccgaggtctggta cgactggagtaccggcagcgctgggatgaagcctttcgcaagttcctgaagggcctggct tcccgaaagccccttgtgctgtgtggagacctcaatgtggcacatgaagaaattgacctt cgcaaccccaaggggaacaaaaagaatgctggcttcacgccacaagagcgccaaggcttc ggggaattactgcaggctgtgccactggctgacagctttaggcacctctaccccaacaca ccctatgcctacaccttttggacttatatgatgaatgctcgatccaagaatgttggttgg cgccttgattactttttgttgtcccactctctgttacctgcattgtgtgacagcaagatc cgttccaaggccctcggcagtgatcactgtcctatcaccctatacctagcactgtga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_7|258_aa MAADGERSPLLSEPIDGGAGGNGLVGPGGSGAGPGGGLTPSAPPYGAGKHAPPQGKPGRV RGAPRGTLKAGEGAGPRAEAGPSRQVRDCCTCDWARLPSLRNRDHSLGTEGGSEQPDRSA NYEKPSELGQRVEDQKDFPTTVEHQWGSFPPFPEGHPAVLPGEDPPPYSPLTSPDSGSAP MITCRVCQSLINVEGKMHQHVVKCGVCNEATFGTWKHARRYGGIYAAWAFVILLAVLCLG RALYWACMKVSHPVQNFS >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_7|777_bp atggcggcagatggagagcgttccccgctgctgtctgagcccatcgacggtggcgcgggc ggcaacggtttagtggggcccggcgggagtggggctgggcccgggggaggcctgaccccc tccgcaccaccgtacggagccggtaaacatgccccgccccagggtaagccggggcgggtc cgaggtgctccccggggtactctgaaagccggggagggggcgggaccgagggcggaggcg ggtcccagtcgccaggtgcgggactgctgcacctgtgactgggcgaggcttccttccctc cgtaatcgcgaccacagcctagggacggaagggggttctgagcaacctgatagaagtgcc aattatgagaagccctccgagcttggtcagagggttgaagatcagaaggacttccctacc accgtggagcatcagtgggggtcatttcccccgtttcccgaggggcatccagccgtgttg cctggggaggacccacccccctattcacccttaactagcccggacagtgggagtgcccct atgatcacctgccgagtctgccaatctctcatcaacgtggaaggcaagatgcatcagcat gtagtcaaatgtggtgtctgcaatgaagccacctttggcacatggaagcatgcacggcga tatggaggcatctatgcagcctgggcatttgtcatcctgttggctgtgctgtgtttgggc cgggctctttattgggcctgtatgaaggtcagccaccctgtccagaacttctcctga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_8|328_aa MQRGLALSLVSGARRQSHAAASAGRDARGNRAGVAPPEQEGDRYTYEDYKNTAEWLLSHT KHRPQVAIICGSGLGGLTDKLTQAQIFDYGEIPNFPRSTVPGHAGRLVFGFLNGRACVMM QGRFHMYEGYPLWKVTFPVRVFHLLGVDTLVVTNAAGGLNPKFEVGDIMLIRDHINLPGF SGQNPLRGPNDERFGDRFPAMSDAYDRTMRQRALSTWKQMGEQRELQEGTYVMVAGPSFE TVAECRVLQKLGADAVGMSTVPEVIVARHCGLRVFGFSLITNKVIMDYESLEKANHEEVL AAGKQAAQKLEQFVSILMASIPLPDKAS >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_8|987_bp atgcagagaggcctggcactgagcctagtgtcgggagcaaggcggcagagtcatgcggca gccagcgcgggcagggacgcacgcgggaatcgagctggggtggcgccaccggagcaggaa ggggacagatacacctatgaagattataagaacactgcagaatggcttctgtctcacact aagcaccgacctcaagttgcaataatctgtggttctggattaggaggtctgactgataaa ttaactcaggcccagatctttgactacggtgaaatccccaactttccccgaagtacagtg ccaggtcatgctggccgactggtgtttgggttcctgaatggcagggcctgtgtgatgatg cagggcaggttccacatgtatgaagggtacccactctggaaggtgacattcccagtgagg gttttccaccttctgggtgtggacaccctggtagtcaccaatgcagcaggagggctgaac cccaagtttgaggttggagatatcatgctgatccgtgaccatatcaacctacctggtttc agtggtcagaaccctctcagagggcccaatgatgaaaggtttggagatcgtttccctgcc atgtctgatgcctacgaccggactatgaggcagagggctctcagtacctggaaacaaatg ggggagcaacgtgagctacaggaaggcacctatgtgatggtggcaggccccagctttgag actgtggcagaatgtcgtgtgctgcagaagctgggagcagacgctgttggcatgagtaca gtaccagaagttatcgttgcacggcactgtggacttcgagtctttggcttctcactcatc actaacaaggtcatcatggattatgaaagcctggagaaggccaaccatgaagaagtctta gcagctggcaaacaagctgcacagaaattggaacagtttgtctccattcttatggccagc attccactccctgacaaagccagttga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_9|216_aa MKLNLVQIFFMLLMLLLGLGMGLGLGLHMATAVLEESDQPLNEFWSSDSQDKAEATEEGD GTQTTETLVLSNKEVVQPGWPEDPILGEDEVGGNKMLRASALFQSNKDYLRLDQTDRECN DMMAHKMKEPSQSCIAQYAFIHEDLNTVKAVCNSPVIACELKGGKCHKSSRPFDLTLCEL SQPDQVTPNCNYLTSVIKKHIIITCNDMKRQLPTGQ >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_9|651_bp atgaagctgaatctggtgcagatctttttcatgttgctgatgctgctgctgggcctgggg atgggcctggggttgggacttcatatggctacagcagtcttggaggagagtgatcaaccg ctcaatgaattttggtccagtgactcacaggacaaagctgaggccactgaggagggagac ggcacccaaaccacagaaacgctggtgcttagcaacaaagaagtggtgcaacctggctgg ccagaagatcccatcctcggtgaagatgaggttgggggtaacaagatgctcagagcctca gctctctttcagagcaacaaagactatcttaggcttgaccagacagatagagaatgcaat gatatgatggcacacaagatgaaggagcccagtcagagttgcatagcccagtatgcattc atccatgaggatctaaacacagtcaaagctgtctgtaacagtcctgtcattgcctgtgag ctcaaggggggaaaatgtcacaaaagctcccgaccttttgatttgacattgtgcgagctg tcccaaccagaccaggtcactcctaactgcaattacctaacttctgttataaaaaagcac attattataacctgtaatgacatgaagcgccagttaccaactggacaatga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_10|114_aa MAPKCPSPLPPQKKKSRLPPALRLEETSVSVGLPKKGEKEQQEAIEHIDEVQKEIDLMNK LDHVELATNVIYSLDNRAKKRRRREKEEEKEEEEEEEEEEEEEEEEEEEEEEKE >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_10|345_bp atggcccccaaatgcccgtctccactcccacctcaaaagaagaaatcaagactacctcct gctctgagactggaggagacatcagtctctgtaggcttgccgaagaagggagaaaaagaa cagcaagaagcaattgaacatattgatgaagtacaaaaagaaatagacttaatgaacaag ctagaccatgtggagctggctacaaatgtcatttacagcttggacaacagagcaaagaaa agaagaagaagagagaaagaagaagaaaaagaggaggaagaggaagaagaagaggaagaa gaagaagaagaagaagaggaggaggaggaggaggagaaagaatag >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_11|59_aa MGDAGSSSSCGNRLKISSDRTVMDMTPGQGAAGRTQRTFLMVPFSKADTCGSLIDFMAP >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_11|180_bp atgggtgatgccggaagcagcagctcctgtgggaacaggctcaagatatcctctgacagg acagttatggacatgacacctggacaaggagcagcaggaagaactcaaaggaccttcttg atggtgccttttagtaaagcagatacctgtgggagcctgatcgacttcatggctccatga >gi568815584f:20355646_20557505|GENSCAN_predicted_peptide_12|205_aa MMRTLITTHPLPLLLLPQQLLQLVQFQEVDTDFDFPEEDKKEEFEECLEKFFSTGPARPP TKEKVKRRVLIEPGMPLNHIEYCNHEIMGKNVYYKHRWVAEHYFLLMQYDELQKICYNRF VPCKNGIRKCNRSKGLVEGVYCNLTEAFEIPACKYESLYRKGYVLITCSWQNEMQKRIPH TINDLVEPPEHRSFLSEDGVFVISP >gi568815584f:20355646_20557505|GENSCAN_predicted_CDS_12|618_bp atgatgagaactctcatcaccacacacccactgcccctgcttctattgccgcagcagctg ctgcagctggtgcagtttcaagaggtggatacagattttgatttcccagaagaagataaa aaagaagaatttgaagagtgtttggaaaaattttttagtacagggcccgccagaccacct accaaagaaaaagtcaaaagacgtgtccttattgaacctggaatgccactaaatcatata gagtactgtaaccatgaaatcatgggaaaaaatgtttactacaaacaccgttgggtggca gaacattacttccttcttatgcaatatgacgagctccaaaaaatctgttacaacagattt gtgccatgtaagaatggaattaggaaatgtaacaggagcaaaggtcttgtagaaggagtg tattgtaatttaacagaagcatttgaaataccagcgtgtaaatacgaatcactttatagg aagggctacgtccttatcacttgttcatggcaaaatgaaatgcaaaaacgtattcctcat actataaatgatctcgtggagccacctgaacacagaagtttcctcagtgaggatggtgtc tttgtcatatcgccctag