GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:57:04 Sequence gi568815585f:110207904_110612188 : 404285 bp : 47.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 987 946 42 0 0 76 113 27 0.217 2.44 1.13 Intr - 2139 2077 63 0 0 91 100 50 0.254 5.41 1.12 Intr - 2309 2226 84 2 0 101 74 11 0.209 1.02 1.11 Intr - 3300 3162 139 2 1 67 26 59 0.170 -1.93 1.10 Intr - 4576 4514 63 0 0 101 94 57 0.703 5.43 1.09 Intr - 4715 4671 45 1 0 142 52 38 0.649 3.12 1.08 Intr - 5923 5879 45 0 0 90 70 58 0.743 1.72 1.07 Intr - 6112 6023 90 0 0 114 121 66 0.972 11.61 1.06 Intr - 8081 7902 180 1 0 51 54 124 0.601 4.28 1.05 Intr - 19854 19823 32 0 2 111 47 30 0.040 -1.87 1.04 Intr - 20291 20190 102 2 0 30 67 82 0.049 0.67 1.03 Intr - 34831 34772 60 1 0 103 113 47 0.944 7.63 1.02 Intr - 36834 36740 95 1 2 3 91 128 0.867 4.28 1.01 Init - 36935 36866 70 2 1 31 50 90 0.462 0.71 1.00 Prom - 45349 45310 40 -5.26 2.00 Prom + 45965 46004 40 -6.16 2.01 Sngl + 48406 48720 315 0 0 52 43 299 0.675 17.75 2.02 PlyA + 49152 49157 6 1.05 3.00 Prom + 49170 49209 40 -6.16 3.01 Init + 49355 49457 103 1 1 74 56 48 0.024 0.61 3.02 Intr + 54863 55037 175 0 1 73 54 68 0.023 0.90 3.03 Intr + 57274 57437 164 1 2 67 40 77 0.014 0.42 3.04 Intr + 58307 58572 266 0 2 28 78 136 0.020 3.83 3.05 Intr + 59539 59657 119 0 2 63 74 67 0.014 2.06 3.06 Term + 62762 62888 127 2 1 56 42 102 0.012 0.16 3.07 PlyA + 63178 63183 6 1.05 4.00 Prom + 63645 63684 40 -2.16 4.01 Init + 76001 76151 151 1 1 65 43 93 0.263 2.75 4.02 Intr + 81565 81635 71 2 2 90 57 58 0.421 1.80 4.03 Intr + 81957 82040 84 2 0 105 94 98 0.890 12.12 4.04 Intr + 90514 90634 121 0 1 80 111 -16 0.031 -0.03 4.05 Term + 90725 90747 23 0 2 96 48 34 0.052 -1.33 4.06 PlyA + 92058 92063 6 1.05 5.00 Prom + 96070 96109 40 -3.26 5.01 Init + 98864 99229 366 1 0 43 49 198 0.016 8.43 5.02 Intr + 99480 99625 146 2 2 46 78 117 0.023 5.58 5.03 Intr + 99957 100044 88 0 1 113 103 27 0.018 6.67 5.04 Intr + 100166 100220 55 1 1 124 115 44 0.018 9.25 5.05 Intr + 112083 112171 89 1 2 29 79 81 0.178 0.99 5.06 Intr + 116334 116438 105 2 0 49 56 153 0.163 8.61 5.07 Term + 125252 125383 132 1 0 94 50 111 0.169 5.99 5.08 PlyA + 131719 131724 6 1.05 6.00 Prom + 132517 132556 40 -6.06 6.01 Init + 136647 136683 37 2 1 93 110 10 0.812 3.87 6.02 Intr + 137115 137232 118 1 1 72 113 33 0.770 3.72 6.03 Intr + 138030 138209 180 0 0 -8 53 149 0.162 0.78 6.04 Intr + 148675 148828 154 1 1 72 37 54 0.028 -1.23 6.05 Intr + 149569 149649 81 0 0 92 86 61 0.547 6.13 6.06 Term + 155100 155336 237 2 0 69 54 146 0.379 5.47 6.07 PlyA + 155870 155875 6 1.05 7.07 PlyA - 156632 156627 6 1.05 7.06 Term - 160030 159987 44 2 2 125 38 8 0.018 -3.18 7.05 Intr - 171060 170896 165 1 0 101 43 70 0.545 3.73 7.04 Intr - 171322 171171 152 0 2 48 56 127 0.425 5.31 7.03 Intr - 179680 179643 38 1 2 99 64 31 0.124 -1.14 7.02 Intr - 181249 181118 132 1 0 25 72 78 0.193 0.74 7.01 Init - 183514 183455 60 1 0 70 82 70 0.656 5.85 7.00 Prom - 185345 185306 40 -6.36 8.00 Prom + 194004 194043 40 -3.66 8.01 Init + 200660 200826 167 1 2 87 42 135 0.165 7.91 8.02 Intr + 203100 203331 232 0 1 107 37 67 0.106 1.28 8.03 Intr + 216831 216965 135 2 0 79 64 104 0.959 7.86 8.04 Intr + 217050 217094 45 2 0 90 95 50 0.941 4.51 8.05 Intr + 220564 220680 117 0 0 111 86 51 0.994 7.86 8.06 Intr + 222498 222533 36 2 0 88 111 3 0.661 1.06 8.07 Intr + 224422 224457 36 0 0 62 108 55 0.263 3.36 8.08 Intr + 226498 226539 42 0 0 74 98 22 0.137 0.24 8.09 Intr + 228366 228464 99 2 0 38 98 111 0.175 7.41 8.10 Intr + 230715 230765 51 2 0 130 59 28 0.898 3.20 8.11 Intr + 231886 231930 45 0 0 102 115 7 0.902 3.41 8.12 Intr + 237926 237979 54 1 0 54 105 52 0.733 2.68 8.13 Intr + 238895 238961 67 1 1 118 93 -6 0.468 1.38 8.14 Intr + 241776 241886 111 1 0 113 80 23 0.241 4.35 8.15 Intr + 242402 242551 150 0 0 66 101 189 0.584 18.13 8.16 Term + 248181 248281 101 1 2 99 40 57 0.087 0.29 8.17 PlyA + 248569 248574 6 -0.45 9.04 PlyA - 248637 248632 6 -1.95 9.03 Term - 249253 248871 383 2 2 63 54 537 0.634 42.80 9.02 Intr - 249453 249265 189 1 0 103 -17 219 0.610 12.46 9.01 Init - 250254 250188 67 0 1 66 103 29 0.815 3.46 9.00 Prom - 251109 251070 40 -5.56 10.00 Prom + 252183 252222 40 -6.26 10.01 Init + 252333 252369 37 2 1 89 67 21 0.176 0.38 10.02 Intr + 254375 254481 107 0 2 91 86 82 0.883 8.23 10.03 Intr + 257502 257703 202 2 1 22 52 174 0.973 6.06 10.04 Intr + 258100 258159 60 2 0 124 78 50 0.873 6.31 10.05 Intr + 259137 259193 57 1 0 123 95 8 0.926 3.86 10.06 Intr + 260971 261020 50 2 2 92 54 11 0.439 -3.50 10.07 Intr + 261314 261421 108 1 0 55 109 82 0.619 7.48 10.08 Intr + 265035 265247 213 2 0 12 105 191 0.519 12.11 10.09 Intr + 267335 267458 124 1 1 -7 52 76 0.237 -5.34 10.10 Intr + 270100 270261 162 2 0 98 116 -7 0.225 3.05 10.11 Intr + 272317 272487 171 2 0 125 86 47 0.957 8.11 10.12 Intr + 277002 277124 123 1 0 100 100 112 0.986 14.16 10.13 Intr + 277752 277933 182 1 2 54 100 82 0.955 5.59 10.14 Intr + 281542 281605 64 0 1 70 109 36 0.986 2.19 10.15 Intr + 281808 281882 75 1 0 80 101 45 0.929 4.49 10.16 Intr + 283330 283437 108 2 0 62 98 57 0.914 4.36 10.17 Intr + 284167 284274 108 2 0 100 110 26 0.939 6.26 10.18 Intr + 285308 285379 72 0 0 57 78 62 0.485 1.48 10.19 Intr + 287439 287564 126 1 0 91 94 51 0.390 6.65 10.20 Intr + 293765 293881 117 0 0 62 121 30 0.941 4.14 10.21 Intr + 295218 295379 162 1 0 90 87 59 0.631 5.95 10.22 Intr + 295480 295578 99 2 0 111 119 57 0.999 11.18 10.23 Intr + 295944 296090 147 1 0 130 80 88 0.999 12.41 10.24 Intr + 296245 296361 117 2 0 92 113 71 0.998 10.44 10.25 Intr + 298512 298703 192 1 0 102 69 269 0.996 25.86 10.26 Intr + 298741 298803 63 2 0 86 119 41 0.981 5.69 10.27 Intr + 300032 300318 287 0 2 126 80 356 0.937 35.66 10.28 Term + 304031 304288 258 1 0 108 52 455 0.887 39.45 10.29 PlyA + 305096 305101 6 1.05 11.06 PlyA - 305901 305896 6 1.05 11.05 Term - 307394 307338 57 2 0 109 40 92 0.301 4.19 11.04 Intr - 315434 315327 108 2 0 100 74 15 0.249 1.78 11.03 Intr - 316294 315867 428 2 2 125 1 724 0.432 60.91 11.02 Intr - 317900 317719 182 1 2 79 -17 79 0.281 -3.99 11.01 Init - 318908 318760 149 2 2 75 54 97 0.307 4.58 11.00 Prom - 323841 323802 40 -7.06 12.00 Prom + 326825 326864 40 -6.06 12.01 Init + 327371 327441 71 1 2 85 92 49 0.926 4.57 12.02 Intr + 328159 328330 172 1 1 79 40 103 0.927 4.55 12.03 Intr + 329515 329676 162 0 0 53 39 155 0.452 7.27 12.04 Term + 330958 331080 123 0 0 69 48 38 0.257 -3.72 12.05 PlyA + 331375 331380 6 1.05 13.00 Prom + 332283 332322 40 -5.66 13.01 Init + 332982 333199 218 2 2 54 100 254 0.986 19.37 13.02 Term + 333697 333871 175 1 1 107 37 48 0.850 -1.17 13.03 PlyA + 336664 336669 6 1.05 14.05 PlyA - 337238 337233 6 1.05 14.04 Term - 347770 347552 219 1 0 77 52 173 0.939 9.64 14.03 Intr - 349857 349744 114 0 0 78 70 62 0.578 4.04 14.02 Intr - 350230 349983 248 2 2 58 -10 117 0.167 -3.92 14.01 Init - 353616 353445 172 0 1 84 94 405 0.992 40.30 14.00 Prom - 358520 358481 40 -4.46 15.04 PlyA - 361106 361101 6 1.05 15.03 Term - 363754 363662 93 1 0 126 42 32 0.216 0.23 15.02 Intr - 367204 367128 77 2 2 57 36 88 0.155 -0.27 15.01 Init - 374554 374428 127 1 1 81 49 85 0.475 4.47 15.00 Prom - 377028 376989 40 -3.66 16.03 PlyA - 377262 377257 6 1.05 16.02 Term - 396342 396156 187 2 1 40 45 141 0.371 1.96 16.01 Init - 396888 396806 83 0 2 69 89 91 0.620 7.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 73939 74105 167 2 2 120 49 84 0.938 5.78 S.002 Init - 99124 99041 84 1 0 121 113 186 0.970 23.22 S.003 Init + 100001 100044 44 1 2 107 103 89 0.975 10.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_1|370_aa MGFTIGVVLDEVQLTSGSMLREPALRVYRGDTEVNNEKRDKPDNRDMQSAIRRSQGGCAG SGCGKCDCHGVKGQKRGVSSGSMWVILLQKHVAWQVLRDGPAQLRFLSPNVLILPVENQG LLTYPFNETQLYGGVAAESQVHNVLPAFVKMEQAVNSPVKCQDWLSGLERRTTLDVYVMW VKEASRGYKVSLGFLECKDLRGHRDHQDKRVILENQDYLEQKGQEDLREHLATLETQDFP EFLAKTARQAPQVFQDAMAQRAGTSGIWMLGSALSKGGLQVSAAAVEITAYLDEGESPCA RSCDSEHGDPGEILGHVPGMLLKGERGFPGIPGTPGPPGLPGLQGPVGPPGFTGPPGQMG LSFQGPKGDK >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_1|1110_bp atgggttttacaattggtgtcgtcttagatgaagtacagctcacatcaggctcaatgcta cgggagcctgcactcagggtgtaccggggagacacagaagtaaacaacgagaaacgtgat aagcctgacaatagagacatgcagtcagcaattcgcaggagccagggtggctgtgctggc tctggctgtggcaaatgtgactgccatggagtgaagggacaaaagcggggtgtcagctct ggaagcatgtgggtgattttactgcagaagcacgtggcttggcaggtgctccgcgatggt cccgcccagctgcgcttcctgagtcctaacgtgctgattctccctgtggagaaccaagga ctcctgacatatcctttcaacgaaacacagctctatggaggtgtggcggctgagtcccag gttcataacgtccttcctgcgtttgtaaaaatggagcaggcagttaattcaccagttaag tgccaggactggctgagtggtttggaaagaaggacgactttggatgtttatgttatgtgg gtgaaagaggcctcccggggttacaaggtgtcattgggtttcctggaatgcaaggacctg aggggccacagggaccaccaggacaaaagggtgatactggagaaccaggactacctggaa caaaagggacaagaggacctccgggagcatctggctaccctggaaacccaggacttcccg gaattcctggccaagacggcccgccaggccccccaggtattccaggatgcaatggcacaa agggcggggacctcgggcatctggatgttgggatcagccttgtcaaagggaggcttacag gtctcagcagctgcagtagagataactgcttacctggatgaaggtgagagtccctgtgct cgctcctgtgattcagaacacggtgatccaggtgagatacttggccatgtgcccgggatg ctgttgaaaggtgaaagaggatttcccggaatcccagggactccaggcccaccaggactg ccagggcttcaaggtcctgttgggcctccaggatttaccggaccaccaggacaaatgggc ttaagttttcaaggaccaaaaggtgacaag >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_2|104_aa MIATNLQDAGEGKMVLDEDDTREEEVKTAEEDGHLLSGAALSGTSRHCERASSREQVNWT SEPRTTGRMSVPGKAKEEIPERESKGTKYGNENVHREFEEHWEV >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_2|315_bp atgattgcaacaaacctacaagatgctggagaaggaaaaatggttttggacgaggatgat actcgggaggaagaggtgaagacagcagaggaggatggccacttgctgagtggtgctgcc ctgtcaggaaccagcaggcactgtgaacgtgcatcctcgagggagcaggtgaactggacc tcagagcccagaaccacagggaggatgtcagtgcccgggaaagcaaaggaggaaattcca gaaagagaaagcaagggaaccaaatatggaaatgagaacgtgcacagggagtttgaagaa cattgggaagtttaa >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_3|317_aa MWKRSCGKPSLHLTSGQRSIWETSMRHPSKAMSKSVLICFTSMCSPMGLSLLHDNLPVYG CILKPVFHFKGCKESINTCALKAAASTDPPRTSQMKERIKKGREVENHRKRKEESADCNT ILVYFPYADFLILRRQFEYITKPEKVWAVSAVRSHNGLTDGSIHTHQAPGQPKGRKGNWQ ATVLHVGRRTCHNPPGVVPTLSPRILAQEKQEKAANPRSSSPGVTLAEMWPRSWMGHLLV VVQAEGLEESLQSIPVLEHPENSKELSVSGVFLIHHLSTITTHVPWLGALAETVPLVEMS ISLTATIYCVTGEYSLS >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_3|954_bp atgtggaagagatcctgcggcaaaccttctctgcatttgacaagtggtcagagatccatc tgggagacttccatgcgacatccttccaaggccatgagcaaatctgtcctgatctgcttc acttcgatgtgttccccaatgggtctaagtctccttcatgataaccttccagtttacggc tgtattttaaaaccagtatttcatttcaagggctgcaaggagtccatcaacacttgtgcc ctaaaggcggctgctagcactgatcccccaaggacctcacagatgaaagaaaggataaag aaaggtagggaggtagaaaatcacagaaagaggaaagaggaaagcgccgactgcaatacg attttggtttattttccatatgcagatttcctaattctacgaaggcaatttgagtacatc acaaagccagaaaaggtgtgggctgtttcagccgtgcggtctcacaatgggctcacagac ggcagcatccacacgcaccaagctccgggccagccgaagggacgcaaggggaactggcaa gccaccgttctccatgtggggagacggacgtgtcacaacccgcctggagtagttccaact ctgtcgccacgaatcctagctcaggaaaagcaagagaaggctgccaacccccggtcctcc agccctggagtcacgctagcagaaatgtggccacggagctggatggggcaccttctggtg gttgtacaagcagaagggcttgaggaatctttgcaaagcatccctgtgctggagcaccca gaaaactccaaagagctctcagtctctggagttttccttatccaccacctctccaccatc accacacacgtgccatggctgggggcactggccgaaactgtgcccttagtggagatgtcc atttctctaacagcgaccatttactgcgtcaccggagaatatagccttagttag >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_4|149_aa MRWQKLSLQPKEQRCMTVVATGQEPSAKGLSGDGHRPGAFSKGSFSKGSVGSPVTIPVYL LRKLFAGLEFTDFLKHVLPPVDRSQRRNEGQRFTHILMEDKQLILLILDRVCVCVRNKSD AEAELTIPWMVFSPITDPFSCFETGFGGR >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_4|450_bp atgaggtggcaaaagctctccctccagcccaaggaacagcgctgtatgacagtggtggcc acaggccaggagccttcagcaaagggtctgtccggtgatggccacaggccaggagccttc agcaaagggtccttcagcaaagggtctgtaggcagccctgttactattcctgtctacctt ctcaggaagctctttgcaggccttgagttcaccgatttcttgaaacatgttctgccccca gtggacagatcccagagacggaatgagggtcagcgcttcacccacatcctcatggaggac aagcagttaatattattgattctggaccgggtttgtgtatgtgtccgaaataaatcggat gctgaggctgaactaaccattccgtggatggtattttcccccatcactgacccattttcc tgctttgaaaccggatttggcggtcgttaa >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_5|326_aa MRDRSLQGPPTNPGPENAPGRATRDPRGAGGRVQARTKGPLGAPKLGAGRRERSWPGTHL RSGPAVLLVEQKGGGQQQQPDAEPGPHGGAPEAARDGCPACGGRGGQLALGRPDFQRYAP SRAEDRKGPPEPPGPAPREPSKAGRPAGVPPAGASGRANAFGQQPLKPSPRLSGTDRGPE WTNRQHGERPARGGRPCPTAVAAAGDSDRGVPRPERLGVQTSGFVVVTHIKAPEVGKVNA FLVKAHRTKLSFMGVKGKNVEGGLVAKITQSLDVVVTIHQCPLGCSDHLKATGPVTQKCT SLNAACLEVYLSSSQKDHRLLFVNEI >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_5|981_bp atgcgcgatcgtagcctacagggacccccaacgaaccccggcccagagaatgcacctggc cgtgccacgcgcgacccccgaggggcaggcggacgggtccaggcgcggacaaaggggcct ctcggggcgcccaagctcggggcgggacgccgggagcggagctggccgggaactcacctt cgcagcggcccggctgtgctcctcgtggagcagaagggcggcgggcagcagcagcagcca gacgctgagccggggccccatggtggcgcgcccgaggcggcgagggacggctgcccggcg tgcgggggccgcggcggacagctagctctcggaaggccggacttccagcgctacgcaccg tcccgggccgaggaccgaaaggggccgcccgagcccccggggccggcgcccagagagccc agcaaggccggccgccctgccggtgtgccgccggcgggtgcttctggaagggccaatgcg ttcgggcagcagcccctgaagccgagcccgaggctaagtgggactgaccggggcccagag tggacgaaccgccagcatggggagagaccagcgcgcggtggccggccctgccctacggcg gtggctgctgctggggacagtgaccgtggggttcctcgcccagagcgtcttggcgtgcag acatcaggttttgtggtggttacccacatcaaggcgccagaggttggaaaagtcaacgca tttctagttaaggcacaccgcacaaaactcagcttcatgggcgtcaagggtaagaatgtg gaaggtggtctggtggccaaaatcacacagtccctggatgtggtggtcaccatccaccaa tgtccactggggtgtagtgaccacctgaaggccactggcccggttactcagaaatgtacc tctttgaatgctgcctgcctcgaggtgtacctctcatcatcacagaaggaccaccgctta ctctttgttaatgaaatctga >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_6|268_aa MVFVPLSNRIAEECENFSETVIYCVTAEKPVLKKNEWGLDVNQCDLTFFFPRIVPAVCPL HDDIKKMGFGILPTPSSGALLMYLGNPSQDPGLPLSPSQPGGQEPAMKASESIFSRHMLH MSRQKAGLLPMPRSFSCMPQACSFLIPHFTWNPDSRNVLLTLKGVKKFDVPCGGRDCSGG CQCYPEKGGRGTSYVLRHLGIPETGKYEDWLDNSVLGISEMVKWPCHTTTGKREPTSSAP ERVSGGLCITQHASRRYPQCPLWPGSCS >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_6|807_bp atggtgtttgtaccactgagtaacaggatagctgaggagtgtgagaatttttcagaaaca gtcatatactgtgttacagcagaaaagcctgttcttaaaaagaatgagtggggactagat gtgaatcaatgtgatttgactttcttcttcccaagaatagtgccagctgtgtgtccactt catgatgacatcaagaagatgggctttgggatcctgcccacacccagctccggggcgctg ctgatgtatctgggcaaccccagccaggatcctgggctgcctctatcaccctcacaacct gggggccaggagccagcaatgaaggccagtgagagcattttctccaggcacatgctccac atgtccaggcagaaggcagggctgctgcccatgccccggagtttctcatgcatgccacag gcctgctccttcctgattcctcacttcacatggaatccagattctcggaatgttctcctg acgctgaagggtgtgaagaagtttgatgtgccgtgtggaggaagagattgcagtgggggc tgccagtgctaccctgagaaaggtggacgtggcacctcctatgtcctaaggcacttgggt attccagaaactggtaaatatgaagactggttggataactccgtgttgggaattagtgag atggtcaaatggccctgtcacacgacaacagggaaaagagaacccacgtcatcagcccct gaacgagtgagtggaggcctctgcatcactcagcatgcatccaggcggtacccacagtgc cccctgtggcctggcagctgcagctga >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_7|196_aa MLEPRLPDSSQMTKSGYSYWVSLKKLICEDVLHGLGEVSCLYKPNQRHQTQRKDGSECQL LKERLGAPDCWAASSGRMCSSVPVPHQMPETPSPQYDNQKCLQILSDVPERKLPLFKKPW CSHLRAEGAGECVRWDTEALPHMKQLPFMTVSTDTNVELSTMITLWRRKYTAGPNSLSKD CLGCNGMAGWGCERVQ >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_7|591_bp atgctggaaccccggctgccggactcctcacaaatgaccaagtctggctactcctattgg gtctccttgaagaagctgatctgtgaggacgtcctgcatgggttgggagaagtgtcttgc ctttataaacccaaccaaagacatcagacacaaaggaaggatgggtctgagtgtcagctg cttaaagaaaggctcggggcccccgactgctgggctgcctcctcgggcaggatgtgtagc agcgtccctgtgccccaccagatgccagaaacaccctctccacagtatgacaaccaaaag tgccttcagattttgtctgatgtccctgagaggaagctgccccttttcaagaagccctgg tgcagccacctaagagcagaaggagccggagagtgcgtgaggtgggacacggaggctctg cctcatatgaagcagcttccattcatgactgtgtccaccgacaccaatgtggagctctcc accatgatcacattgtggagaaggaaatacaccgcgggaccgaattccttgtctaaggac tgtctaggttgcaatggtatggccggttggggctgtgaacgtgtacagtaa >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_8|495_aa MQSRPLAIVGVMDPLGVRKRDALVWKAFHMVGRGHREKAPNILAALLLPGGTAHSHCLLL DALKPSLTACGVTAGTSTTVSHVPSQLLSVVSTQASLQVFSATTAFSQGTWASAKRPPDS PQLPWNTKLVFSKGQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGA RGVSGFPGADGIPGHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGEPGEPGL VGFQGPPGPPGPKGQQGNRGLGFYGVKGEKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPD QYKGISLKGEEGIMGFPGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPG PPGLPAYSPHPSLAKGARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPG EMGPKGFIGDPGIPALYGGPPGPDGKRGPPGPPGLPGPPGPDVRWRALCWCEEHRVSCTV IIRDAPAGIIEPCRS >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_8|1488_bp atgcagagccgccctctggccatcgtcggggtcatggatcctttaggagtgaggaagcgg gatgcgctggtgtggaaggcctttcatatggtcgggcgtggacacagagagaaagcgcca aacatccttgcggcattgctcctgcctggtggaacagcacattcacactgccttcttctc gatgctcttaagccctcactcacggcctgtggggttactgcagggacttctaccaccgtc tcccacgtcccatctcagctcctatctgtggtcagcacccaggcctccctacaggtcttc tcagccacgacagccttctcccaaggaacctgggcctctgcgaagcggcctcctgattcc ccacaactcccctggaacacaaagctcgtcttctccaagggtcagcctgggccagtgggc ccccaggggtacaatgggccaccaggattacaaggattcccgggactgcagggacgtaaa ggagacaagggtgaaaggggagcccccggagtaacgggacccaagggcgacgtgggagca agaggcgtttctggattccctggtgccgatggaattcctggacacccggggcaaggtggg cccaggggaaggccgggctacgatggctgcaacggaacccagggagactcaggtccacag gggccccccggctctgaggggttcaccgggcctcccggtgaacctggagagcctggattg gtcggtttccagggaccacctggaccccctggaccaaaaggacagcaaggcaacagagga cttggtttctacggagttaagggtgaaaagggtgacgtagggcagccgggacccaacggg attccatcagacaccctccaccccatcatcgcgcccacaggagtcaccttccacccagat cagtacaagggcatttccttgaagggagaagaaggaatcatgggctttcctggactgagg ggttaccctggcttgagtggtgaaaaaggatcaccaggacagaagggaagccgaggcctg gatggctatcaagggcctgatggaccccggggacccaagggagaagccggagacccaggg ccccctggactacctgcctactcccctcacccttccctagcaaaaggtgccagaggtgac ccgggattcccaggggcccaaggggagccaggaagccagggtgagccaggagacccgggc ctcccaggtccccctggcctctccatcggagatggagatcagaggagaggcctgccgggt gagatgggacccaagggcttcatcggagaccccggcatccctgcgctctacgggggccca cctggacctgatggaaagcgagggcctccaggaccccccgggctccctggaccacctgga cctgatgtgcgatggcgcgctctttgctggtgtgaagagcaccgtgtttcttgcaccgtc atcatcagggatgcccccgccgggatcattgagccctgcaggagctag >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_9|212_aa MPIHSGVLNALHILHTPKRPRRAQTGSLERQPQMQGMSPTDAWGPTQTQGISPTDAWGPT QTQGISPTDAWGPTQTQGISPTDAWDARHQPHDAWGPTQMHGISPTDAWGPTQTQGISPT DAWGPTQTQGISPTDAWGPTQTQGMSPTDAWGPTQTQGISPTDAWGPTQTHGISPTDAWG PTQTQGISPTDAWGSTQTQGIGTMGARHPTQL >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_9|639_bp atgcccatccacagcggggtgctgaatgcactgcacatcctacatacgccaaaacggcca agacgtgcccaaacaggaagcctagagagacaaccacagatgcagggcatgagccccacg gacgcctggggtcccacgcagacgcagggcatcagccccacggacgcctggggtcccacg cagacgcagggcatcagccccacggacgcctggggtcccacgcagacgcagggcatcagc cccacggacgcctgggacgcacggcatcagccccacgacgcctggggtcccacgcagatg cacggcatcagccccacggacgcctggggtcccacgcagacgcagggcatcagccccacg gacgcctggggtcccacgcagacgcagggcatcagccccacggacgcctggggtcccacg cagacgcagggcatgagccccacggacgcctggggtcccacgcagacgcagggcatcagc cccacggacgcctggggtcccacgcagacgcacggcatcagccccacggacgcctggggt cccacgcagacgcagggcatcagccccacggacgcctggggttccacgcagacgcagggc atcggcaccatgggtgcccggcatcccacccagttgtga >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_10|1196_aa MTPNCFMARSCAGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPPGDPG YPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQIDCDTD VKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGEETEAQKVVHGHTDRMVPKASEESQ ASQELMEDQGPGACQETQVVKGSQDPQDPEDPKVQWASLAQMDPQVPSACQGQMGPLGKG ASLEKSWELSPGHGEMLVCLDSLGLKAFPETEAPLDSEIRCKSALSAETEDSALRGGALC VSLPGRHLNEFTEGKVKDIRSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGLHGFPGAPGQ EGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGHPGSPGFKGI DGMPGTPGLKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIP GLSGIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGR AGLYGEIGATGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITG VTGVQGPPGLKGQTGFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRG LHGLPGTKGFPGSPGSDIHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERG PPGSPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPG APGTPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPG FPGAPGTVGAPGIAGIPQKIAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKA GPQGRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQ TDQEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLARKWPRSKGHSETPKPSTAGLAGS CLARFSTMPFLYCNPGDVCYYASRNDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAP AIAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFI ECNGGRGTCHYYANKYSFWLTTIPEQSFQGSPSADTLKAGLIRTHISRCQVCMKNL >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_10|3591_bp atgactcccaactgcttcatggcacggagctgtgcaggtgagcggggacagcccggcgtc ccaggtgtgcccgggatgaaaggtgacgatggcagcccaggccgcgatgggctcgatgga ttccccggcctcccaggccctcccggtgatggcatcaagggccctccaggggacccaggc tatccaggaatacctggaacgaagggtactccaggagaaatgggccccccaggactgggc cttcccggcctcaaaggccaacgtggtttccctggagacgccggcttacctggaccacca ggcttcctgggccctcctggccccgcagggaccccaggacaaatagattgtgacacagat gtgaaaagggccgttggaggtgacagacaggaggccatccagccaggttgcataggaggg cccaagggattgccaggcctgccaggacccccaggccccacaggtgaggaaactgaggca cagaaagttgtccacggtcacacagatagaatggtgccaaaggcctccgaggaatcccag gcttcgcaggagctgatggaggaccagggcccaggggcttgccaggagacgcaggtcgtg aagggttcccaggacccccaggaccccgaggatccaaaggtgcagtgggcctccctggcc cagatggatccccaggtcccatcggcctgccagggccagatgggccccctggggaaaggg gcctccctggagaagtcctgggagctcagcccgggccacggggagatgctggtgtgcctg gacagcctgggcttaaaggccttcccggagacagaggcccccctggattcagagataaga tgcaaatcagcgctgtctgcagaaactgaggactcggcgctgagaggtggggctctgtgt gtcagccttcctggaaggcacttaaatgagttcacagaaggcaaagtgaaggacatacga agccaagggatgcctgggatgccagggctgaagggccagccaggcctcccaggaccttcc ggccagccaggcctgtatgggcctccaggactgcatggattcccaggagctcctggccaa gaggggcccttggggctgccaggaatcccaggccgtgaaggtctgcctggtgatagaggg gaccctggggacacaggcgctcctggccctgtgggcatgaaaggtctctctggtgacaga ggagatgctggcttcacaggggagcaaggccatccaggaagccctggatttaaaggaatt gatggaatgcctgggacccccgggctaaaaggcagccgaggggaccctgggcccccagga ccacctcctgtcatcctgccaggaatgaaagacattaaaggagagaaaggagatgaaggg cctatggggctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatccca gggctgtcaggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaaggga gacatcggagtccccggcatccccggtttgccaggattccctggggtggctggcccccct ggaattacgggattcccaggattcataggaagccggggtgacaaaggtgccccagggaga gcaggcctgtatggcgagattggcgcgactggtgatttcggtgacatcggggacactata aatttaccaggaagaccaggcctgaagggggagcggggcaccactggaataccaggtctg aagggattctttggagagaagggaacagaaggtgacatcggcttccctgggataacaggc gtgactggagtccaaggccctcctggacttaaaggacaaacaggctttccagggctgact gggcctccagggtcgcagggagagctggggcggattggactgcctggtggcaaaggagat gatggctggccgggagctccgggcttaccaggttttccgggactccgtgggatccgcggc ttacacggcttgccaggcaccaagggctttccaggatccccaggttctgacatccacgga gacccaggcttcccaggccctcctggggaaagaggtgacccaggagaggccaacaccctt ccaggccctgtgggagtcccaggacagaaaggagaccaaggagctccaggggaacgaggc ccacctgggagcccaggacttcaggggttccctggtatcacacccccttccaacatctct ggggcacctggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggccca ccagggccaccaggttctgctgctcttcctggaagcaaaggtgacacagggaacccagga gctccaggaaccccagggaccaaaggatgggccggggactccgggccccagggcaggcct ggtgtgtttggtctcccaggagaaaaagggcccaggggtgaacaaggcttcatggggaac actggacccactggggcggtgggcgacagaggccccaagggacccaagggagacccagga ttccctggtgcccccgggactgtgggagcccccgggattgcaggaatcccccagaagatt gccgtccaaccagggacagtgggtccccaggggaggcgaggcccccctggggcaccgggg gagatggggccccagggcccccccggagaaccaggtttccgtggggctccagggaaagct gggccccaaggaagaggtggtgtgtctgctgttcccggcttccggggagatgaaggaccc ataggccaccaggggccgattggccaagaaggtgcaccaggccgtccagggagcccgggc ctgccgggtatgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccag acggaccaggagcccatgtgcccagtgggcatgaacaaactctggagtggatacagcctg ctgtacttcgagggccaggagaaggcgcacaaccaggacctggggctggcccggaagtgg ccaagatcaaagggccacagcgagactcccaaaccctccacggctgggctggcgggctcc tgcctggcgcggttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctac tatgccagccggaacgacaagtcctactggctctctaccactgcgccgctgcccatgatg cccgtggccgaggacgagatcaagccctacatcagccgctgttctgtgtgtgaggccccg gccatcgccatcgcggtccacagtcaggatgtctccatcccacactgcccagctgggtgg cggagtttgtggatcggatattccttcctcatgcacacggcggcgggagacgaaggcggt ggccaatcactggtgtcaccgggcagctgtctagaggacttccgcgccacaccattcatc gaatgcaatggaggccgcggcacctgccactactacgccaacaagtacagcttctggctg accaccattcccgagcagagcttccagggctcgccctccgccgacacgctcaaggccggc ctcatccgcacacacatcagccgctgccaggtgtgcatgaagaacctgtga >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_11|307_aa MAGLKGRGHHAGVSEASGMRMEAPAGPVGDECPLLPFLQFAANICGERKCCFLRARPPSA ADVSNDEKHAKAGRTYQQACSARPSLLGALTCCVHCTCPRNTCLGQICSPWREQFHGLGS MYCRGAAAIILTYDVNHRQSLVELEDRFLGLTDTASKDCLFAIVGNKVDLTEEGALAGQE KEECSPNMDAGDRVSPRAPKQVQLEDAVALYKKILKYKMLDEQDVPAAEQMCFETSAKTG YNVDLLFETLFDLRQKPASAPAPLTVSAPWVVTTSYNSDISCSQAFLFTVLDTYECIHTC GFGHVHV >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_11|924_bp atggcaggtctgaagggtcgtgggcatcacgctggagtgtctgaggcatctggcatgcgg atggaagcacctgctggccctgttggtgatgagtgccccttactgccatttctgcagttt gctgccaatatttgtggtgaaaggaaatgctgcttcctgagagctcgtccccccagtgcg gcagatgtgtccaacgatgaaaagcacgccaaggctgggaggacataccagcaggcgtgc tcggctcggccctccctgctgggggctctcacctgctgcgtgcactgcacctgtcccagg aacacctgccttggccagatctgctccccttggcgggagcagttccacggcctgggctcc atgtactgccggggggcggccgccatcatcctcacctatgatgtgaatcaccggcagagc ctggtggagctggaggaccggttcctgggcctgacagacacagccagcaaagactgcctc tttgccatcgtggggaacaaagtggacctcactgaggagggggccttggcgggccaggag aaggaagagtgcagtcccaatatggacgctggggaccgtgtctccccaagggcacctaag caggtgcagctggaggatgcggtggccctttataaaaagatcctcaagtacaagatgctg gatgagcaggatgtgccggccgctgagcaaatgtgctttgagaccagcgccaagaccggc tacaatgtggacctcctgtttgagaccctctttgacctgaggcagaagcctgcgtctgca cctgcacctctgaccgtttcagcaccctgggttgttaccacgtcctacaactctgacatt tcttgttctcaagcgtttctcttcactgtcctggacacgtacgagtgcatacatacctgc ggctttgggcacgttcacgtgtag >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_12|175_aa MLDGTEQTLLRAGDPQPEPTLNTREDTRRHRGKDDVKADPRGGDHGETEAEMRATAPQAR TASTSQELDSGGNSLSLEPPEICEKKEAVTTEKDIKEKRMICPAKEAINCPHLEEEAATK LIKRKVSHHKQTNLRARAPQWSPVTIQETSHENSAQLYQENTLSQHPAAGYDCGS >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_12|528_bp atgctggatggcacagagcagactctgctcagagccggagacccccaacccgaacccacc ctgaacaccagagaagacacccggagacacaggggcaaagacgatgtgaaggcagacccc cggggaggagaccatggggaaacagaggcagagatgagagccacagctccgcaggcaagg acggccagcaccagtcaggagctggacagtggcgggaacagcctctccctggagcctcct gagatatgtgagaagaaagaggcagtaaccactgaaaaagatatcaaagaaaaacgaatg atttgtccagccaaagaagccatcaattgtccgcaccttgaagaggaagctgccactaaa ctcataaagaggaaagtgtctcatcataaacagaccaacctcagggcaagagccccacaa tggtcacctgtcaccatccaggaaacatctcatgaaaactctgctcaattgtaccaagaa aatactctgagccaacatccagcagccggatatgactgtgggagttga >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_13|130_aa MLSTAPTLTAGLCWAAASPAQLPMPLEWELPGCWVKVQNLAQEAWLGPGFCCCQPEDLAE EETATLQPLINPRKSSQVSRELPALILVLVEFSGPEPFACLGLSFLSVPAPTGSDPFNQI SVPSRDFPKG >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_13|393_bp atgctctccactgccccgaccctcacggctggtctctgctgggctgcagcaagccccgcg cagctgcccatgcccctggaatgggagttacctggctgttgggttaaagtacagaatctg gctcaggaggcctggctgggtcctgggttctgctgctgccagcccgaggaccttgccgag gaggagacggccacgcttcagcctctcatcaacccaaggaaaagcagccaagttagcagg gaactgcctgccctaatcttggtattagtggaattctctgggccagagcccttcgcctgc ctggggctcagcttcctcagtgtccctgcaccaactggatctgatccctttaaccaaatc agtgttccttccagggacttccctaagggctag >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_14|250_aa MRKPDSKIVLLGDMNVGKTSLLQRYMERRFPDTVSTVGGAFYLKQWRSYNISIWDTAGQP KVVLPVVGKYKLRLPGAGHCLGPRSTAVSRPAGRCPAGPGFQRRGKGRQEVNGTASHLIV ALKKIRQADEKAAGVPSTDLHLALEEPAWKAAGSPDLRHFPVPASSSSCVSGDWVLERKA QADSRLRFRGVLCPSCEETVPLSVSTRSLSFGMSLADPNVLGARTQGRVDGKDVEERWAQ PLASLEDADG >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_14|753_bp atgaggaagcccgacagcaagatcgtgctcctgggggacatgaacgtggggaagacgtcg ctgctgcagcggtatatggagcggcgcttcccggacacggtcagcacggtgggcggcgcc ttctacctgaagcagtggcgctcctacaacatctccatctgggacaccgcaggacagcct aaagtggtgttgcctgttgttggcaaatataaactgaggctccccggtgccgggcactgt ttgggccctcgttctacagcagtgagcaggccggctggccgctgccccgcaggccctgga ttccagaggagggggaaaggcaggcaagaagtgaatggtacagcttcacatctgatcgtt gctctgaagaaaatacgacaggctgatgagaaagcggctggtgtgcccagcacagacctg cacctggcccttgaagagccagcttggaaggcggcggggagcccagacttgcgccacttc cctgtgccggcatccagctccagctgcgtctctggggactgggtgttggagagaaaggca caggctgattcacgtttgcgcttccgtggcgtcctctgcccaagctgcgaggaaacagtt cctctgtctgtctcaactagaagtctgagctttggcatgagccttgcagaccccaatgtt cttggtgccagaacccaaggacgggtggacggcaaagatgtggaagagagatgggcccag cctctggcctctttagaggatgctgatggctga >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_15|98_aa MKWFHRVCLQAPSARHSTENQPEEALEAGEEKAVASTSGQVQGTGGQLGPVEPVGTVAQL CGSCLVYLALSKLLPGGLASPSSKHHRVYSAFENIQKV >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_15|297_bp atgaagtggttccaccgagtctgcctgcaagccccctctgcaaggcattccacagaaaac cagcctgaggaggccctggaagctggagaggagaaggcagttgccagcacctcggggcag gtccagggtactggtgggcagcttggacccgtggagcctgtgggaaccgtggctcagctc tgtggctcctgcctggtgtacttggccctcagcaaacttcttccaggaggcctggcctcc cccagttctaagcatcaccgtgtctacagtgcttttgaaaacattcagaaagtataa >gi568815585f:110207904_110612188|GENSCAN_predicted_peptide_16|89_aa MDSKAAEKTCNINNAFDPGTANEHTVQCFPNPSETITSEKYAQQIEEMHRKLQRLQPALV NRKGPILLHRSAPTARHTTNTSKVEQIGL >gi568815585f:110207904_110612188|GENSCAN_predicted_CDS_16|270_bp atggatagtaaagcagcggagaaaacctgcaacatcaacaatgcatttgacccaggaact gctaatgaacacacagtgcagtgctttccgaatcccagtgaaaccattacatctgagaag tatgctcagcaaatcgaagagatgcatcgaaaactgcaacgcctgcagccggcattggtc aacaggaagggcccaattcttctccaccgcagcgcccccaccgcacgtcacacaaccaac acttcaaaagtggaacaaattgggctgtga