GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:48:05 Sequence gi568815582r:2753096_2958104 : 205009 bp : 52.73% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3239 3511 273 2 0 134 60 577 0.993 56.69 1.02 Intr + 4377 4484 108 0 0 91 75 118 0.998 10.70 1.03 Intr + 4686 4850 165 0 0 105 74 166 0.999 16.49 1.04 Intr + 5375 5452 78 2 0 84 127 71 0.991 9.76 1.05 Intr + 6045 6077 33 0 0 114 82 35 0.907 3.32 1.06 Intr + 6257 6307 51 2 0 111 101 39 0.982 6.01 1.07 Intr + 6474 6566 93 0 0 94 99 59 0.991 7.18 1.08 Term + 7206 7416 211 0 1 95 44 106 0.895 3.99 1.09 PlyA + 7498 7503 6 1.05 2.10 PlyA - 8084 8079 6 1.05 2.09 Term - 9513 9044 470 1 2 66 55 205 0.876 10.52 2.08 Intr - 10891 10714 178 1 1 37 32 156 0.528 4.91 2.07 Intr - 12237 12151 87 0 0 94 3 82 0.329 0.96 2.06 Intr - 12521 12398 124 1 1 28 9 133 0.342 0.49 2.05 Intr - 13710 13053 658 1 1 -29 117 873 0.816 70.02 2.04 Intr - 14313 14104 210 1 0 -7 54 149 0.324 1.41 2.03 Intr - 15147 14962 186 1 0 31 39 362 0.663 25.78 2.02 Intr - 16128 15949 180 1 0 -88 -31 420 0.412 12.86 2.01 Init - 17688 17478 211 0 1 91 0 128 0.703 3.23 2.00 Prom - 18249 18210 40 -6.20 3.14 PlyA - 18339 18334 6 1.05 3.13 Term - 19007 18895 113 0 2 114 55 183 0.980 16.63 3.12 Intr - 22461 22356 106 0 1 96 94 133 0.991 14.99 3.11 Intr - 24032 23898 135 2 0 78 98 322 0.999 33.47 3.10 Intr - 24525 24142 384 0 0 8 89 196 0.458 7.31 3.09 Intr - 24658 24554 105 1 0 90 36 78 0.394 3.71 3.08 Intr - 30181 30120 62 2 2 104 49 69 0.540 3.54 3.07 Intr - 30483 30346 138 1 0 103 92 52 0.869 8.04 3.06 Intr - 31051 31000 52 1 1 -28 100 123 0.880 0.87 3.05 Intr - 31707 31561 147 0 0 103 -4 100 0.699 3.24 3.04 Intr - 32064 31907 158 1 2 64 69 222 0.653 18.14 3.03 Intr - 32551 32280 272 0 2 76 94 340 0.999 31.12 3.02 Intr - 32866 32704 163 2 1 103 76 152 0.921 15.05 3.01 Init - 33452 33407 46 2 1 53 92 91 0.730 4.80 3.00 Prom - 40032 39993 40 -0.71 4.00 Prom + 42181 42220 40 -3.21 4.01 Init + 45390 45453 64 2 1 91 88 243 0.835 23.86 4.02 Intr + 45864 46029 166 1 1 79 87 128 0.999 11.33 4.03 Intr + 46191 46474 284 0 2 73 45 370 0.889 28.80 4.04 Intr + 51294 51451 158 1 2 68 87 85 0.983 6.64 4.05 Intr + 51819 51962 144 2 0 93 26 128 0.304 8.09 4.06 Term + 56273 56299 27 1 0 131 48 4 0.166 -0.94 4.07 PlyA + 60072 60077 6 1.05 5.00 Prom + 62948 62987 40 -6.30 5.01 Init + 64174 64237 64 0 1 91 88 227 0.999 22.26 5.02 Intr + 64685 64871 187 0 1 8 92 152 0.520 6.87 5.03 Intr + 65582 65874 293 2 2 110 72 337 0.956 31.62 5.04 Intr + 67860 68014 155 1 2 79 69 171 0.780 14.60 5.05 Term + 68271 68510 240 2 0 117 52 46 0.716 0.26 5.06 PlyA + 68581 68586 6 1.05 6.03 PlyA - 69377 69372 6 1.05 6.02 Term - 76385 76158 228 2 0 128 50 115 0.946 8.66 6.01 Init - 76723 76601 123 1 0 50 80 102 0.975 3.77 6.00 Prom - 77021 76982 40 -3.81 7.00 Prom + 77174 77213 40 -0.51 7.01 Init + 77347 77398 52 0 1 80 83 146 0.906 12.57 7.02 Intr + 77599 77701 103 2 1 89 113 139 0.997 16.23 7.03 Term + 78701 79064 364 2 1 50 46 341 0.842 20.50 7.04 PlyA + 79160 79165 6 1.05 8.09 PlyA - 81084 81079 6 1.05 8.08 Term - 81132 81121 12 0 0 104 48 1 0.188 -3.82 8.07 Intr - 83882 83773 110 0 2 68 113 56 0.728 6.60 8.06 Intr - 86836 86650 187 1 1 130 49 136 0.720 13.68 8.05 Intr - 87193 87053 141 1 0 33 62 123 0.964 5.26 8.04 Intr - 87984 87747 238 2 1 107 32 159 0.869 10.35 8.03 Intr - 88233 88068 166 1 1 95 105 -49 0.703 -2.97 8.02 Intr - 88504 88478 27 2 0 124 86 -15 0.573 0.47 8.01 Init - 88636 88585 52 1 1 105 110 159 0.999 19.09 8.00 Prom - 91422 91383 40 -5.91 9.09 PlyA - 91898 91893 6 1.05 9.08 Term - 99104 98996 109 1 1 76 47 82 0.176 1.18 9.07 Intr - 99425 99301 125 2 2 115 81 14 0.760 3.29 9.06 Intr - 100234 100079 156 1 0 106 13 250 0.539 20.02 9.05 Intr - 100927 100770 158 2 2 119 93 182 0.633 22.04 9.04 Intr - 102756 102479 278 2 2 125 64 110 0.512 10.00 9.03 Intr - 103149 102987 163 1 1 42 30 209 0.891 10.05 9.02 Intr - 103753 103727 27 2 0 124 105 26 0.988 6.47 9.01 Init - 105009 104928 82 0 1 96 98 128 0.999 13.69 9.00 Prom - 105112 105073 40 -4.11 10.00 Prom + 105946 105985 40 -3.01 10.01 Init + 107635 107785 151 0 1 94 42 82 0.561 4.38 10.02 Term + 108243 108331 89 1 2 2 47 133 0.426 -1.28 10.03 PlyA + 109637 109642 6 1.05 11.03 PlyA - 111079 111074 6 1.05 11.02 Term - 116791 116629 163 0 1 104 47 87 0.282 3.92 11.01 Init - 130152 130034 119 0 2 83 61 113 0.402 5.76 11.00 Prom - 133744 133705 40 -3.01 12.00 Prom + 139933 139972 40 1.69 12.01 Init + 143355 143676 322 2 1 77 94 287 0.968 25.71 12.02 Intr + 145095 145216 122 1 2 50 60 12 0.534 -4.68 12.03 Intr + 145681 145863 183 0 0 109 64 113 0.733 11.50 12.04 Term + 145954 145983 30 0 0 132 43 23 0.782 0.34 12.05 PlyA + 146240 146245 6 1.05 13.00 Prom + 150119 150158 40 -1.91 13.01 Init + 151853 152020 168 1 0 52 69 97 0.378 3.71 13.02 Intr + 158620 158712 93 0 0 44 75 115 0.342 6.46 13.03 Term + 160972 161037 66 0 0 85 28 87 0.449 0.63 13.04 PlyA + 163564 163569 6 1.05 14.03 PlyA - 166152 166147 6 1.05 14.02 Term - 169664 169563 102 2 0 14 40 100 0.730 -3.62 14.01 Init - 169771 169685 87 1 0 74 37 115 0.274 5.60 14.00 Prom - 169993 169954 40 -1.41 15.00 Prom + 170640 170679 40 -0.71 15.01 Sngl + 173872 174594 723 0 0 59 53 191 0.750 9.34 15.02 PlyA + 174710 174715 6 -3.24 16.00 Prom + 174724 174763 40 -2.31 16.01 Init + 176591 176915 325 1 1 77 94 362 0.996 33.20 16.02 Intr + 177315 177785 471 1 0 81 92 549 0.646 48.24 16.03 Intr + 180035 180487 453 0 0 122 88 693 0.999 66.51 16.04 Intr + 180621 180884 264 1 0 112 82 414 0.998 41.22 16.05 Intr + 181520 181572 53 0 2 94 84 37 0.826 3.02 16.06 Intr + 183578 183782 205 1 1 87 81 41 0.765 2.60 16.07 Intr + 184026 184289 264 1 0 88 96 519 0.968 50.72 16.08 Intr + 185089 185361 273 2 0 120 92 306 0.995 32.25 16.09 Intr + 195775 196132 358 2 1 74 53 154 0.476 5.37 16.10 Intr + 200608 200672 65 1 2 93 55 48 0.561 0.85 16.11 Term + 200785 200984 200 2 2 53 39 150 0.559 4.48 16.12 PlyA + 204815 204820 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 36004 35832 173 2 2 53 52 106 0.897 1.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_1|337_aa XSGGAPPGHGAMYNGIGLPTPRGSGTNGYVQRNLSLVRGRRGERPDYKGEEELRRLEAAL VKRPNPDILDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEKDVNPGGKE ETPGQRPAVTETHQLAELNEKKNERLRAAFGISDSYVDGSSFDPQRRAREAKQPAPEPPK PYSLVRESSSSRSPTPKQKKKKKKKDRGRSESESKKRKHRSPTPKSKRKSKDKKRKRSRS TTPAPKSRRAHRSTSADSASSSDTSRSRSRSAAAKTHTTALAGRSPSPASGRRGEGDAPF SEPGTTSTQRPSSPETATKQPSSPYEDKDKDKKEVCS >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_1|1014_bp nngagcggtggtgccccccccgggcacggggccatgtacaacgggatcgggctgccgacg ccccggggcagcggcaccaacggctacgtccagcgcaacctgtccctggtgcggggccgc cggggtgagcggcctgactacaagggagaggaggaactgcggcgcctggaggctgccctg gtgaagcggcctaatcctgacatcctggaccacgagcgcaagcggcgcgtcgagctgcga tgcctcgagctggaggagatgatggaagagcaggggtacgaggaacagcaaattcaggaa aaagtggcgacctttcgactcatgttgctggagaaggatgtgaaccctgggggcaaggag gagaccccagggcagaggccagcggtcacggagactcaccagttggcagaattaaatgag aagaagaatgaaagactccgtgctgcctttggcatcagtgattcttacgtagatggcagc tcttttgatcctcagcgtcgtgcccgagaagctaaacaaccagctcctgagcctcccaaa ccttacagccttgttcgggagtctagcagttctcgctcaccaaccccaaagcagaagaag aagaaaaagaagaaagatagaggacggtcagaatctgagtccaagaaacgtaagcatagg tctcccactccaaagagcaaacgtaaatctaaggacaaaaagcgaaagcggtctcgaagt acaacaccagcccccaagagccgccgggcccaccgttcaacttctgctgactctgcttcc tcctccgatacttcccgcagtcggtctcgaagtgctgcagctaaaactcatacaactgcc ttggctgggcgaagtccttcccctgcttcagggcgacgcggggagggagatgcgcctttc agtgaaccaggtactaccagcacacaacggcctagtagcccggagactgctacgaaacag cctagcagcccttatgaagacaaagataaagacaagaaggaggtatgttcctga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_2|767_aa MGPPPTTTTGSRKGASAFQDTHLEERCLMGRGLGVSLLRLWLGERLSPRWPLREPRSLLL LQHRTDIRPQGLAGEGEEDDEEEEDEEEEEEEEEEEEEEEEEEEEEDEEEELELELLERL FRLAGVGSGGRCTGRLEPSESLELEPEEEEDDDDDDEEEEELEDDDDEEELRRSFAGCRA ANAEGCWAPVEAAGAVGVRGSARFTAEGATAVLGLAKFMAAAAAGVRESARFTAVGMAGV LALARFMAAAEAAGILLARLAAGVADRERSELLPDAERLRMAGGDLVKDRLPRAVRGVRD LERRRIASGERLRERLRGNSGVRDLERRRIAGGVRDLERRRVTGEVRERDLLRVTGDVRD RDLLRVTGDVLDLDLLRVIGEVLLRDRLLVTGGVLDLDLLRVTGGVLERERRRVVGVLDL ERRRVTGGVLDLDRLLLTGDALDRDLRRVTEVLDLDRLRLTGEVRDLDLRRLIRGVLDLD RRSGDDRELRRRGGVRDLVLGSGDDLEPLRAVLDLGGCSGEDSGRNQTYCAEGIVEICCQ PLEMIQTDSSAEREFESTVGYEEESDWNLLCSGDIPDFNIGLAENSPLSCLIGDLDASSV TGDEDLDRPPLDRDLPLLAGVLERERPLRLAGVLERERPRLAGVLERERPRLAGVLERDL PLLLAGDLLRDLRRTGDRVLDLRLAGVLDRDLPLRAGVLDRDLRLVGVLDRDLRLAGVLE RDLPRRAGVLERDLPLVAGDLEWDLPRLADLDLPLLWVFLLLDQPGR >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_2|2304_bp atggggccgccacccaccaccacaactggctcccggaagggggcatcagccttccaggac acgcacctggaggagcggtgtctcatgggccgagggctaggtgtctccctcctgcgcttg tggctgggggagcggctgtccccacgctggcctctccgggaaccccgctcactgctgctg ctgcaacacaggacagacatcaggccacagggcttagcaggggaaggggaggaagacgac gaggaggaggaagatgaggaggaagaagaagaagaggaagaggaggaggaagaagaggag gaggaggaggaggaggaagatgaagaggaggagctggaactggaactgctagagcgcctc ttccgtttggctggggttggctccggaggacgttgcacaggaaggctagagccctctgag tcactagaactggagccagaggaggaggaggacgacgacgacgatgatgaagaagaggag gagctagaggacgacgacgacgaggaggaactccgccgctcctttgctggctgcagggcg gctaatgcagaaggctgctgggccccagtagaggcagctggggctgtgggagtgcgaggg tcagccaggttcacagccgaaggtgccaccgctgttctggggctggccaagttcatggcc gctgctgcagctggcgttcgagagtcagccaggttcactgctgttggaatggcaggtgtc ctggcgctggctaggttcatggctgccgcagaggctgcaggaatcctgctggcaaggttg gctgctggagtagcagatcgtgaacgatcagaactacttccagatgcagaacgcctgcgg atggctggaggagatcttgttaaggaccgtttaccccgagctgttcgtggagtacgggat ctggagcggcggcggatagcaagtggtgagcgacttcgagaacgtttgcgtggtaacagt ggcgttcgagatctagagcggcgccgaatagctggaggtgtccgggacctagagcggcgg cgtgtcactggtgaggttcgagagcgggaccttcttcgagttactggagatgtgcgagat cgagatctccttcgggtgaccggagatgttctggatcttgatcttctgcgagtgataggc gaagttctgcttcgagatcgcctcctggttactggtggagtcctggatctggaccttctg cgagtcactggtggagttctagaacgggagcggcggcgtgttgttggcgttctagacctt gaacgacggcgggttactggtggcgttctggatctggatcgccttctgctcactggggat gctcttgaccgggatcttcgtcgagtcactgaagtcctggaccttgaccgtctccggctg actggtgaagttcgagatctggacctacgtcggcttatcaggggggttctggacctggat cgccgctccggagatgatcgagagctgcgacgtcgaggtggtgtacgagacttggtcttg ggctctggtgatgacctggaacctctgcgagcagttctggatttgggcggatgttcagga gaggactcggggaggaaccagacctactgcgccgaggggatagtcgagatttgctgtcaa cctctggagatgatccagaccgactcctctgccgaaagagaattcgagtccactgtagga tatgaagaagagtcagactggaacctgctctgctcaggagacattccagatttcaacata ggactcgctgagaactcacctctatcttgtcttattggagatctggatgccagctcagtg actggagatgaagacctggaccgacctcccttagaccgagacctccctctcctggctggt gttctggagcgtgagcgaccactgcgtctagctggggttctagagcgtgagcggccacgt ctggctggggttctggagcgtgagcggccacgtctagctggggttctagagcgtgacctg ccacttctcctggctggtgatctactacgagacctgcgtcgtactggtgatcgggtccta gatctgcgcctagcaggtgttctagaccgagacctgcccctccgggctggtgttctagac cgagacctacgcctggtgggagttctggatcgtgatctccgcctggcaggtgttctagag cgggacctgccccggcgggctggtgttctagaacgagatctacccctagtggctggggat ctagagtgggacctccctcgccttgctgacctagacctgcctcttctctgggtatttctg ctcctagaccagcctggtcgctga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_3|626_aa MRGVSCLQVLLLLVLACGQPRMSSRIVGGRDGRDGEWPWQASIQHRGAHVCGGSLIAPQW VLTAAHCFPRRALPAEYRVRLGALRLGSTSPRTLSVPVRRVLLPPDYSEDGARGDLALLQ LRRPVPLSARVQPVCLPVPGARPPPGTPCRVTGWGSLRPGEWRPLQGVRVPLLDSRTCDG LYHVGADVPQAERIVLPGSLCAGYPQGHKDACQGDSGGPLTCLQSGSWVLVGVVSWGKGC ALPNRPGVYTSVATYSPWIQARTHSRKALDSGSTLKELGKPGPPVPIDRGQFPQLQKEKK QNLENSKPTLELTGAISRREGKIREGGEQTMLQYSLATTGVVIRHQPKEFARAIPKPNSS PQGSSQVISPSERRQSSHPARHGGQKTPNAEPAQDGGHDYAATRRPRARCARAWSPPDRT NTGTSPLPVPRRRHRLQRPCPPSGWRPHAPRYRPLPDHLKWRRLRGGTGRGCAGGRDAAR GMLGHARAAGNGAERGCAGASRGEAAAAMDVFLMIRRHKTTIFTDAKESSTVFELKRIVE GILKRPPDEQRLYKDDQLLDDGKTLGECGFTSQTARPQAPATVGLAFRADDTFEALCIEP FSSPPELPDVMKPQDSGSSANEQAVQ >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_3|1881_bp atgagaggggtttcctgtctccaggtcctgctccttctggtgctggcctgcgggcagccc cgcatgtccagtcggatcgttgggggccgggatggccgggacggagagtggccgtggcag gcgagcatccagcatcgtggggcacacgtgtgcggggggtcgctcatcgccccccagtgg gtgctgacagcggcgcactgcttccccaggagggcactgccagctgagtaccgcgtgcgc ctgggggcgctgcgtctgggctccacctcgccccgcacgctctcggtgcccgtgcgacgg gtgctgctgcccccggactactccgaggacggggcccgcggcgacctggcactgctgcag ctgcgtcgcccggtgcccctgagcgctcgcgtccaacccgtctgcctgcccgtgcccggc gcccgcccgccgcccggcacaccatgccgggtcaccggctggggcagcctccgcccagga gagtggcgaccgctacaaggagtaagggtgccgctgctggactcgcgcacctgcgacggc ctctaccacgtgggcgcggacgtgccccaggctgagcgcattgtgctgcctgggagtctg tgtgccggctacccccagggccacaaggacgcctgccagggtgattctgggggacctctg acctgcctgcagtctgggagctgggtcctggtgggcgtggtgagctggggcaagggttgt gccctgcccaaccgtccaggggtctacaccagtgtggccacatatagcccctggattcag gctcgcacccacagccgcaaggcactggattctggcagcaccctgaaggagctgggaaag ccaggaccacccgttccaatagatagaggtcagtttccacagctgcagaaggaaaagaaa caaaacctggaaaattcaaaacctactctggagctgacaggggctatcagcagaagggaa gggaagataagggaaggtggagaacagaccatgctgcagtacagcttggccaccactggt gtggtcatcaggcaccagcccaaggagttcgcccgggccatccctaagcccaacagcagc ccccagggctcttcacaggtcataagcccctctgagcggcgacagtcctcgcatccagcc cggcacggcggccagaagaccccaaatgcggagcctgcccaagatggcggccacgactac gccgcgacaagacggccccgagctcggtgtgcccgagcttggagcccgccggaccgcacc aacacggggacctcccctctgccagtcccaagacggcgccaccgcctccaaaggccatgc ccgccgtcgggatggcggccgcacgcccctcgataccgcccgcttccagatcacttaaaa tggcggcggctgcggggcggcacggggcggggctgcgccgggggaagggacgcggcgcgg ggcatgctgggccacgcgcgggctgccgggaacggggcggagcgcggctgcgccggcgcg tcgaggggagaggcagcagccgcgatggacgtgttcctcatgatccggcgccacaagacc accatcttcacggacgccaaggagtccagcacggtgttcgaactgaagcgcatcgtcgag ggcatcctcaagcggcctcctgacgagcagcggctgtacaaggatgaccaactcttggat gatggcaagacactgggcgagtgtggcttcaccagtcaaacagcacggccacaggcccca gccacagtggggctggccttccgggcagatgacacctttgaggccctgtgcatcgagccg ttttccagcccgccagagctgcccgatgtgatgaagccccaggactcgggaagcagtgcc aatgaacaagccgtgcagtga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_4|280_aa MGARGALLLALLLARAGLGKPEACGHREIHALVAGGVESARGRWPWQASLRLRRRHRCGG SLLSRRWVLSAAHCFQKHYYPSEWTVQLGELTSRPTPWNLRAYSSRYKVQDIIVNPDALG VLRNDIALLRLASSVTYNAYIQPICIESSTFNFVHRPDCWVTGWGLISPSGTPLPPPYNL REAQVTILNNTRCNYLFEQPSSRSMIWDSMFCAGAEDGSVDTCKGDSGGPLVCDKDGLWY QVGIVSWGMDCGQPNRPGVYTNISVYFHWIRRNSHFVFEF >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_4|843_bp atgggcgcgcgcggggcgctgctgctggcgctgctgctggctcgggctggactcgggaag ccggaggcctgcggccaccgggaaattcacgcgctggtggcgggcggagtggagtccgcg cgcgggcgctggccatggcaggccagcctgcgcctgaggagacgccaccgatgtggaggg agcctgctcagccgccgctgggtgctctcggctgcgcactgcttccaaaagcactactat ccctccgagtggacggtccagctgggcgagctgacttccaggccaactccttggaacctg cgggcctacagcagtcgttacaaagtgcaggacatcattgtgaaccctgacgcacttggg gttttacgcaatgacattgccctgctgagactggcctcttctgtcacctacaatgcgtac atccagcccatttgcatcgagtcttccaccttcaacttcgtgcaccggccggactgctgg gtgaccggctgggggttaatcagccccagtggcacacctctgccacctccttacaacctc cgggaagcacaggtcaccatcttaaacaacaccaggtgtaattacctgtttgaacagccc tctagccgtagtatgatctgggattccatgttttgtgctggtgctgaggatggcagtgta gacacctgcaaaggtgactcaggtggacccttggtctgtgacaaggatggactgtggtat caggttggaatcgtgagctggggaatggactgcggtcaacccaatcggcctggtgtctac accaacatcagtgtgtacttccactggatccggaggaattctcactttgtctttgagttt tga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_5|312_aa MGARGALLLALLLARAGLRKPAVPLTIRGPCGRRVITSRIVGGEDAELGRWPWQGSLRLW DSHVCGVSLLSHRWALTAAHCFETYSDLSDPSGWMVQFGQLTSMPSFWSLQAYYTRYFVS NIYLSPRYLGNSPYDIALVKLSAPVTYTKHIQPICLQASTFEFENRTDCWVTGWGYIKED EALPSPHTLQEVQVAIINNSMCNHLFLKYSFRKDIFGDMVCAGNAQGGKDACFGDSGGPL ACNKNGLWYQIGVVSWGVGCGRPNRPGVYTNISHHFEWIQKLMAQSGMSQPDPSWPLLFF PLLWALPLLGPV >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_5|939_bp atgggcgcgcgcggggcgctgctgctggcgctgctgctggctcgggctggactcaggaag ccggcagttcctctgaccatccgaggaccatgcggccgacgggtcatcacgtcgcgcatc gtgggtggagaggacgccgaactcgggcgttggccgtggcaggggagcctgcgcctgtgg gattcccacgtatgcggagtgagcctgctcagccaccgctgggcactcacggcggcgcac tgctttgaaacctatagtgaccttagtgatccctccgggtggatggtccagtttggccag ctgacttccatgccatccttctggagcctgcaggcctactacacccgttacttcgtatcg aatatctatctgagccctcgctacctggggaattcaccctatgacattgccttggtgaag ctgtctgcacctgtcacctacactaaacacatccagcccatctgtctccaggcctccaca tttgagtttgagaaccggacagactgctgggtgactggctgggggtacatcaaagaggat gaggcactgccatctccccacaccctccaggaagttcaggtcgccatcataaacaactct atgtgcaaccacctcttcctcaagtacagtttccgcaaggacatctttggagacatggtt tgtgctggcaatgcccaaggcgggaaggatgcctgcttcggtgactcaggtggacccttg gcctgtaacaagaatggactgtggtatcagattggagtcgtgagctggggagtgggctgt ggtcggcccaatcggcccggtgtctacaccaatatcagccaccactttgagtggatccag aagctgatggcccagagtggcatgtcccagccagacccctcctggccgctactctttttc cctcttctctgggctctcccactcctggggccggtctga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_6|116_aa MDHPALCVLCPVRQVHQALALKALPLEAARVSPLCCIWVEQPLLSGRPVQADACGQNAIT LRRQLRKARVRRCLIPRTLKDPGHWEGTGVCRRVGVSDKDQGRRDKGRQMPKRGGN >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_6|351_bp atggatcacccagccctttgtgtcctgtgccctgtgcggcaggtgcaccaggcactggcc ctcaaggcactgcccttagaggctgcccgggtctctcctctgtgctgtatctgggtggag cagcccctcttgtcagggaggcctgtgcaggcggatgcctgtgggcagaacgccatcacc ctcagacgccagctgagaaaagctcgtgttcggagatgtctcatcccaaggaccctgaag gacccagggcactgggaaggaactggagtctgcagaagagttggggtgtcggataaggat caggggagaagagataaaggccgtcagatgcccaagcgaggcgggaactag >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_7|172_aa MLLLLTLALLGGPTWAGKMYGPGGGKYFSTTEDYDHEITGLRVSVGLLLVKSVQVKLGDS WDVKLGALGGNTQEVTLQPGEYITKVFVAFQAFLRGMVMYTSKDRYFYFGKLDGQISSAY PSQEGQVLVGIYGQYQLLGIKSIGFEWNYPLEEPTTEPPVNLTYSANSPVGR >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_7|519_bp atgctgctgctgctcacgcttgccctcctggggggccccacctgggcagggaagatgtat ggccctggaggaggcaagtatttcagcaccactgaagactacgaccatgaaatcacaggg ctgcgggtgtctgtaggtcttctcctggtgaaaagtgtccaggtgaaacttggagactcc tgggacgtgaaactgggagccttaggtgggaatacccaggaagtcaccctgcagccaggc gaatacatcacaaaagtctttgtcgccttccaagctttcctccggggtatggtcatgtac accagcaaggaccgctatttctattttgggaagcttgatggccagatctcctctgcctac cccagccaagaggggcaggtgctggtgggcatctatggccagtatcaactccttggcatc aagagcattggctttgaatggaattatccactagaggagccgaccactgagccaccagtt aatctcacatactcagcaaactcacccgtgggtcgctag >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_8|310_aa MGLRAGPILLLLLWLLPGAHWDVLPSECGHSKEAGRIVGGQDTQEGRWPWQVGLWLTSVG HVCGGSLIHPRWVLTAAHCFLRSEDPGLYHVKVGGLTPSLSEPHSALVAVRRLLVHSSYH GTTTSGDIALMELDSPLQASQFSPICLPGPQTPLAIGTVCWEVAVPLLDSNMCELMYHLG EPSLAGQRLIQDDMLCAGSVQGKKDSCQGDSGGPLVCPINDTWIQAGIVSWGFGCARPFR PGVYTQVLSYTDWIQRTLAESHSGMSGARPEPPTFPGSGQHERSLTPTGRGSLLSLFSLR GIRLLPTTPG >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_8|933_bp atggggcttcgggcaggccccatcctgcttctgctgctgtggctgctgccaggggcccat tgggatgtgctgccttcagaatgcggccactccaaggaggccgggaggattgtgggaggc caagacacccaggaaggacgctggccgtggcaggttggcctgtggttgacctcagtgggg catgtatgtgggggctccctcatccacccacgctgggtgctcacagccgcccactgcttc ctgaggtctgaggatcccgggctctaccatgttaaagtcggagggctgacaccctcactt tcagagccccactcggccttggtggctgtgaggaggctcctggtccactcctcataccat gggaccaccaccagcggggacattgccctgatggagctggactcccccttgcaggcctcc cagttcagccccatctgcctcccaggaccccagacccccctcgccattgggaccgtgtgc tgggaggtggctgtgcccctcctggactcgaacatgtgtgagctgatgtaccacctagga gagcccagcctggctggccagcgcctcatccaggacgacatgctctgtgctggctctgtc cagggcaagaaagactcctgccagggtgactccggggggccgctggtctgccccatcaat gatacgtggatccaggccggcattgtgagctggggattcggctgtgcccggcctttccgg cctggtgtctacacccaggtgctaagctacacagactggattcagagaaccctggctgaa tctcactcaggcatgtctggggcccgcccagaaccccccacatttcctggcagtggacag cacgaacgctccttgacccccacgggcaggggaagcctgctgagcctcttttctctccga ggcatccgacttctgccaacgaccccaggttaa >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_9|365_aa MVVSGAPPALGGGCLGTFTSLLLLASTAILNAARIPACGKPQQLNRVVGGEDSTDSEWPW IVSIQKNGTHHCAGSLLTSRWVITAAHCFKDNLNKPYLFSVLLGAWQLGNPGSRSQKVGV AWVEPHPVYSWKEGACADIALVRLERSIQFSERVLPICLPDASIHLPPNTHCWISGWGSI QDGVPLPHPQTLQKLKVPIIDSEVCSHLYWRGAGQGPITEDMLCAGYLEGERDACLGDSG GPLMCQVDGAWLLAGIISWGEGCAERNRPGVYISLSAHRSWVEKIVQGGRAADGEGAESR GLEDPALVLQEGEERGWDSETVRWIPKQGWSRFVERLTAKALNSPVWKKQRTVAGQASGS AGIRG >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_9|1098_bp atggtggtttctggagcgcccccagccctgggtgggggctgtctcggcaccttcacctcc ctgctgctgctggcgtcgacagccatcctcaatgcggccaggatacctgcctgtgggaag ccccagcagctgaaccgggttgtgggcggcgaggacagcactgacagcgagtggccctgg atcgtgagcatccagaagaatgggacccaccactgcgcaggttctctgctcaccagccgc tgggtgatcactgctgcccactgtttcaaggacaacctgaacaaaccatacctgttctct gtgctgctgggggcctggcagctggggaaccctggctctcggtcccagaaggtgggtgtt gcctgggtggagccccaccctgtgtattcctggaaggaaggtgcctgtgcagacattgcc ctggtgcgtctcgagcgctccatacagttctcagagcgggtcctgcccatctgcctacct gatgcctctatccacctccctccaaacacccactgctggatctcaggctgggggagcatc caagatggagttcccttgccccaccctcagaccctgcagaagctgaaggttcctatcatc gactcggaagtctgcagccatctgtactggcggggagcaggacagggacccatcactgag gacatgctgtgtgccggctacttggagggggagcgggatgcttgtctgggcgactccggg ggccccctcatgtgccaggtggacggcgcctggctgctggccggcatcatcagctggggc gagggctgtgccgagcgcaacaggcccggggtctacatcagcctctctgcgcaccgctcc tgggtggagaagatcgtgcaagggggcagggcagccgatggagaaggggcagagtctaga ggcctggaagacccagctctggtgcttcaggaaggggaggagaggggttgggactccgaa actgtgcgttggattccaaaacagggctggagccgatttgtcgagcgactcactgccaaa gctctaaactccccagtttggaagaagcagcggacagtggcagggcaggcctctggctct gcgggcatcaggggttga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_10|79_aa MPLSLRQGGQEESPEEKDLVGAGGSPEELGEDFLAQVLGKGPRQQFLNSQAFAEHRLYAG AVRSAEDTAMSKPKAMDIG >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_10|240_bp atgccgctatccctgagacagggtggccaggaagagagccccgaggagaaggacctggta ggagcagggggaagccctgaggagctgggtgaagatttcctggcacaagtcctggggaag ggccccagacagcagttcctcaattcccaggcatttgcagagcaccgcctgtatgccggc gctgtccgaagtgctgaagatacagcaatgagcaaaccaaaagccatggacatcggatga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_11|93_aa MRSKAAGALSSGWGRSSASCPCGVELASGQSEGGGGAKIQPNTRSVWNKVLVSLLLVVPL LNLFHKQMADSHPKAKLILSPPTQNPSLADFRF >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_11|282_bp atgcgcagcaaggccgcgggtgccctgtcttctggctggggccgctcctccgcgtcttgc ccgtgcggcgtagagctggcgtccggccaatcggagggcggaggcggggccaaaattcag cctaacacacggtcagtgtggaacaaagtcttggtctccctgctgctggttgttcctctc ctcaacctgttccacaaacagatggctgattctcatcctaaagcaaagttgattttatca cccccaactcaaaatccttccctggctgactttcgcttttga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_12|218_aa MPLPEPSEQEGESVKASQEPSPKPGTEVIPAAPRKPRKFSKLVLLTASKDSTKVAGAKRK GVHCVMSLGVPGPATLAKALLQTHPEAQRAIEAAPQEPEQKRSRQDPATGLELSRNLMDP SASIPLLESQAWCEQCSGTQRDPVLPEPVQQRRWLRLPSAQHKLGLKFPGNDFSSFCGPG LVALGAASLSGSHFSVRQRLLALEHPLVGAQTEQKTVD >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_12|657_bp atgcccctgcccgagcccagcgagcaggagggtgagagtgtgaaggccagccaggagcca tcccccaagccaggcacagaagtcatcccggcagcccccaggaagcccagaaagttctcc aaactggtcctgctcacagcctccaaagacagcaccaaggtggcgggggccaagcgcaag ggtgtgcactgtgtcatgtccctgggggtgcccggccccgccacccttgccaaggccctc ctccagacccaccccgaggcccagcgggccattgaggcagcccctcaggagcctgagcag aaacggagcaggcaggacccagccacaggcttggagctttcaaggaacctaatggacccc tcggcctccatccctctgctggagtctcaggcctggtgtgagcagtgctctgggactcag agggaccccgtcctcccagagccggtccagcagaggcgatggctgcggcttcccagtgct cagcacaaacttggcctcaagtttcctggaaatgactttagcagcttttgtggccctggc ctggtggccttgggagcagcgtccctgagtggcagtcacttctcagtcaggcagaggctt ctggccctggagcatccactggttggggcacagacagaacagaagacagtggattag >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_13|108_aa MWTDSEGRGPREDAGGCRTQEVRDAGGRGVVREPRAPRHGPARPVAPPAAALEREEPLYA KSVLFSSPILAPPLDAAVQSRDGPAFSDPYCTWMLNTLFLSNGRQFME >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_13|327_bp atgtggacggactcggaaggaaggggtcccagagaggacgcgggcggctgcaggacacaa gaggtgcgggacgcaggaggccgtggggtcgtgcgggagcccagggcacctcgacatgga cccgccagacctgtggctcctccagcggccgctttagagcgagaggagcccctttacgcc aagagcgtcctgttcagctcccccatcctagctccgcccctggacgccgccgtccaatct cgggatggccccgccttttccgacccctactgcacctggatgctgaacaccttgttcttg tctaatggtcgacagttcatggaataa >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_14|62_aa MKKTEDNTLVFTVDVKANKHQLKQAVKNSVNTLIGPDGEKKAHVPLAPDQDALGVANKME IM >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_14|189_bp atgaagaagacagaagacaacacacttgtcttcactgtggatgttaaagccaacaagcac cagctcaaacaggctgtgaagaactctgtcaataccctgattgggcctgatggagagaag aaggcacatgttccactggctcctgatcaggatgctttgggtgtggccaacaaaatggag atcatgtaa >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_15|240_aa MPIQIQYPQYQPVENKTQPPLAYQYWLPAELQYWLPPEVQYRPQVVCAMPNCTASYKQPM AVVFNTSAPQGAALCPQPPTMRLNPTAPPSGQGSTLHAIIDEARKQGDLEAWQFLVILQS VPAGEGAPAGAPAVANARYERFTMKMLKDMKEGVKQYGPNSPSMRTLLDSIAPAGVDVVT EYVKACNGIGGAMHKAILMAQAMTGVALGGQVRTFGGKCYNCGQIGRLKKNSPAGSVSRS >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_15|723_bp atgccaatccagatacagtatccacaatatcagccagtagaaaataaaacccaaccgcca ttagcttatcaatactggctgccagccgagcttcagtattggctgcctccagaggtccaa tacagacctcaagtggtgtgtgccatgccaaattgcacggcatcgtacaagcaacccatg gcggtggtgtttaatacgtcagcaccacagggcgcggcgctgtgtcctcagccgcccact atgagacttaatccaacagcaccacctagtggacaaggtagcacactgcacgcgatcatt gatgaagctagaaaacagggagatcttgaggcgtggcagttcctggtaattttacaatcg gtaccggccggggaaggggctccagcaggagcacctgcggtggctaatgctagatatgaa cgcttcaccatgaaaatgttaaaagacatgaaggaaggagttaaacaatatggacccaac tcaccttctatgagaacattattagattccattgctccagcaggagttgatgtagttaca gaatatgtgaaggcttgtaatgggattggaggagccatgcataaagctatcctaatggct caagcaatgactggggttgctttaggaggacaagttagaacatttggggggaaatgttat aattgtggccaaattggtcgtctaaaaaagaattccccggctggctcagtcagtagatca tga >gi568815582r:2753096_2958104|GENSCAN_predicted_peptide_16|976_aa MPLPEPSEQEGESVKAGQEPSPKPGTDVIPAAPRKPREFSKLVLLTASDQDEDGVGSKPQ EVHCVLSLEMAGPATLASTLQILPVEEQGGVVQPALEMPEQKCSKLDAAAPQSLEFLRTP FGGRLLVLESFLYKQEKAVGDKVYWKCRQHAELGCRGRAITRGLRATVMRGHCHAPDEQG LEARRQREKLPSLALPEGLGEPQGPEGPGGRVEEPLEGVGPWQCPEEPEPTPGLVLSKPA LEEEEAPRALSLLSLPPKKRSILGLGQARPLEFLRTCYGGSFLVHESFLYKREKAVGDKV YWTCRDHALHGCRSRAITQGQRVTVMRGHCHQPDMEGLEARRQQEKAVETLQAGQDGPGS QVDTLLRGVDSLLYRRGPGPLTLTRPRPRKRAKVEDQELPTQPEAPDEHQDMDADPGGPE FLKTPLGGSFLVYESFLYRREKAAGEKVYWTCRDQARMGCRSRAITQGRRVTVMRGHCHP PDLGGLEALRQREKRPNTAQRGSPGAGLSFQWLFRILQLLGHSPGVTPLPARLSGATPLS PIRLLSSFVPRGPRVIPLTNQARRTLHASLGGCWGRTSDAADTASLKAWQGGPEFLKTPL GGSFLVYESFLYRREKAAGEKVYWTCRDQARMGCRSRAITQGRRVMVMRRHCHPPDLGGL EALRQREHFPNLAQWDSPDPLRPLEFLRTSLGGRFLVHESFLYRKEKAAGEKVYWMCRDQ ARLGCRSRAITQGHRIMVMRSHCHQPDLAGLEALRQRERLPTTAQQEDPGGLPGGILGWC PHVGGEQCSELALADAAVVGQGGGAFLTFGRHDKAAWTRTPLLYGHSTPGFAEHAAFLGL STWRGPALLSTVQSECCCLGAWARLGKFLLQTELPRIGQVNPHQSLLDRCDYLVQGPEAR RTQGGSQETEEHIRLPRKEANHPLPHTEAPEEWAQLEALNRSRGHFVHRAYQAPTDEEGV GPERAITAAFGGPQPA >gi568815582r:2753096_2958104|GENSCAN_predicted_CDS_16|2931_bp atgcccctgcccgagcccagcgagcaggagggcgagagtgtgaaggccggccaggagcca tcccccaagccaggcacggacgtcatcccggcagcccccaggaagcccagggagttctcc aaactggtgctgctcacagcctccgaccaagatgaggatggggtgggatccaagccccag gaagtgcactgcgtcctgtccctggagatggctggccccgccaccctcgccagcaccttg cagatcctgccagttgaggagcagggaggggtggtccagccagccctagagatgcctgaa cagaagtgcagcaagctggatgcagcagcccctcagtccctggagttcctgaggacacca ttcgggggccgcctcctggtgctggagtccttcctgtacaagcaggagaaggcagtgggg gacaaggtgtactggaagtgccgccaacatgctgagctgggctgccggggccgggccatc acccgaggcctgcgggccacagtgatgcggggccactgccacgcgcccgatgagcaaggc ctggaggcccggcgccagagggagaaactgcccagcctggccctgccagagggcttggga gagccccagggtcctgagggccctggaggccgagtggaggagcccctggagggggtgggc ccgtggcagtgccctgaggagcccgagcccactcctgggctggtgctgagcaagccggcc ctggaggaggaggaggcaccccgagccctgtcactgctgagcctgccgcccaagaagcgc tcgatcctggggctgggacaggcccggcccctcgagttcctgaggacgtgctacgggggc agcttcctggtacacgagtcgttcctctacaagcgggagaaggctgtcggggacaaggtg tattggacctgccgggaccacgcgctgcacggctgccggagccgggccatcacccaggga cagcgggtgactgtgatgcgtgggcactgccaccagcccgatatggagggcctggaagcc cggcggcagcaggagaaggccgtggagacgctgcaggctgggcaggacggccctgggagc caagtggacacgctgctccgaggcgtggatagtttgctctaccgcaggggtccgggtccc ctgactctcaccaggcctcggcccagaaagcgagcaaaggtcgaagaccaggagctgcca acccagcccgaggccccagacgagcaccaggacatggacgcagacccgggaggccctgag ttcctgaagacgcccctggggggcagcttcctggtgtacgagtccttcctctaccggcgg gagaaggcggctggggagaaggtgtattggacctgccgggaccaggcccgcatgggctgc cgcagccgcgccatcacccagggccgacgggtgactgtcatgcgtggtcactgccacccg cccgacctgggaggcctggaggccctgaggcagcgggagaaacgccccaacacggcgcag cgggggagcccaggcgctggcctctctttccagtggctcttccggatcctgcagcttttg ggtcatagtcctggggtcacacctctacctgcgcggctctcgggggccacccctctcagc cccatccggctcctgagcagctttgtccccagagggccccgggtcattcctctgaccaac caggcacggcgcaccctgcatgcaagcctgggcgggtgctggggacggacgagtgacgca gcagacactgcctccctgaaggcatggcaaggaggccccgagttcctgaagacgcccctg gggggcagcttcctggtgtacgagtccttcctctaccggcgggagaaggcggccggggag aaggtgtattggacctgccgggaccaggcccgcatgggctgccgcagccgcgccatcacc cagggccggcgggtcatggtcatgcgcaggcactgccacccaccggacctgggcggcctg gaggccctgcggcagcgggagcacttccccaacctggcgcagtgggacagcccagatcct ctccggcccctggagttcctgaggacttccctggggggcaggttcctggtgcacgagtcc ttcctctacaggaaggagaaggcggctggggagaaggtgtactggatgtgccgggaccag gctcggctgggctgccgcagccgcgccataacccagggccaccgcatcatggtcatgcgc agccactgccatcagcctgacctggcaggcctggaggccttgaggcaacgggagcggctc cccaccacggcccagcaggaggacccaggaggtctcccaggaggaattcttggatggtgt cctcatgtcggcggagaacagtgctcagagctggcgcttgcagacgcagctgtcgtgggg cagggcggtggcgccttcctgacctttggaagacatgacaaagctgcctggacacggacg cccctgctgtacggccacagcacccctgggtttgcagagcacgcagccttcctagggctt tccacctggcgaggccccgctctgctcagcacggtgcaaagtgaatgctgctgtcttgga gcctgggcacgtttggggaagttcctgcttcaaactgagctgccccgcataggccaggtc aacccacaccaatctcttctggacaggtgcgactacctggttcaaggaccagaagccaga cgtacccaagggggttctcaggagacagaggaacacatccggctgccgaggaaggaggcc aatcacccccttccccatactgaagccccagaggaatgggcccagctggaggcacttaac cgtagccggggccattttgtgcacagggcctaccaggcccccactgacgaggagggtgtt ggacctgagcgggctatcactgcagcctttgggggtccacaacctgcctaa