GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:56:42 Sequence gi568815595f:129428734_129633715 : 204982 bp : 51.95% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 3873 3770 104 1 2 102 94 113 0.995 13.69 1.08 Intr - 4514 4365 150 0 0 73 68 99 0.836 7.05 1.07 Intr - 5251 5117 135 2 0 45 36 89 0.486 0.45 1.06 Intr - 8575 7728 848 0 2 58 85 228 0.341 11.19 1.05 Intr - 9217 8987 231 0 0 65 115 76 0.523 5.42 1.04 Intr - 11114 10997 118 0 1 27 131 56 0.455 3.83 1.03 Intr - 11305 11188 118 1 1 -25 52 115 0.253 -2.86 1.02 Intr - 11737 11468 270 1 0 6 103 196 0.134 11.18 1.01 Init - 20455 20381 75 1 0 42 93 18 0.043 -1.26 1.00 Prom - 22710 22671 40 -0.31 2.03 PlyA - 22854 22849 6 1.05 2.02 Term - 27163 27095 69 1 0 124 36 107 0.587 7.33 2.01 Init - 30177 30136 42 0 0 59 100 32 0.405 1.92 2.00 Prom - 31847 31808 40 -2.81 3.00 Prom + 31856 31895 40 -3.31 3.01 Init + 32500 32571 72 0 0 87 84 15 0.290 2.03 3.02 Intr + 34847 34893 47 1 2 15 111 33 0.316 -4.00 3.03 Intr + 35902 36048 147 1 0 77 73 162 0.639 13.46 3.04 Intr + 38157 38333 177 0 0 60 70 193 0.988 14.25 3.05 Intr + 40609 40684 76 1 1 75 91 89 0.998 7.91 3.06 Intr + 47582 47773 192 1 0 95 83 102 0.916 10.61 3.07 Intr + 47930 48068 139 1 1 74 116 202 0.987 22.14 3.08 Intr + 49283 49485 203 0 2 35 80 311 0.964 24.53 3.09 Intr + 51052 51189 138 0 0 36 109 176 0.999 15.67 3.10 Intr + 52797 52961 165 2 0 49 72 398 0.999 34.97 3.11 Intr + 54752 54949 198 1 0 90 93 309 0.993 31.67 3.12 Intr + 59524 59664 141 0 0 37 121 135 0.597 12.76 3.13 Intr + 63408 63461 54 2 0 103 83 67 0.983 7.36 3.14 Intr + 66713 66874 162 1 0 83 86 225 0.996 22.49 3.15 Intr + 71169 71335 167 2 2 90 53 215 0.535 17.47 3.16 Intr + 73978 74149 172 1 1 135 78 397 0.999 43.86 3.17 Intr + 74873 74944 72 1 0 90 49 59 0.798 2.30 3.18 Intr + 75586 75688 103 0 1 106 102 142 0.999 17.65 3.19 Intr + 77676 77816 141 1 0 76 93 140 0.994 14.13 3.20 Intr + 78932 79029 98 0 2 46 103 101 0.765 7.63 3.21 Term + 79456 79470 15 0 0 112 47 -3 0.917 -3.48 3.22 PlyA + 79861 79866 6 -1.75 4.02 PlyA - 79922 79917 6 1.05 4.01 Sngl - 80797 80540 258 1 0 67 48 286 0.819 17.94 4.00 Prom - 82830 82791 40 -5.41 5.00 Prom + 83424 83463 40 -8.29 5.01 Init + 84201 84331 131 2 2 85 72 86 0.957 6.30 5.02 Intr + 85656 85821 166 0 1 90 78 193 0.995 18.98 5.03 Intr + 86755 86866 112 0 1 74 113 252 0.966 26.76 5.04 Intr + 88736 88890 155 0 2 111 -10 222 0.269 15.00 5.05 Intr + 89211 89279 69 2 0 71 49 79 0.338 2.17 5.06 Intr + 90364 90453 90 0 0 62 78 130 0.801 10.09 5.07 Intr + 90835 90999 165 0 0 79 96 118 0.998 12.37 5.08 Term + 99321 99530 210 2 0 83 49 93 0.236 2.52 5.09 PlyA + 99562 99567 6 1.05 6.00 Prom + 99873 99912 40 -2.61 6.01 Init + 100001 100361 361 1 1 84 60 660 0.862 60.06 6.02 Intr + 102143 102311 169 0 1 115 95 332 0.999 36.12 6.03 Intr + 103518 103683 166 0 1 106 80 387 0.999 40.18 6.04 Intr + 103800 104039 240 2 0 70 51 433 0.865 36.08 6.05 Term + 104875 104985 111 0 0 113 47 159 0.951 13.16 6.06 PlyA + 105349 105354 6 1.05 7.03 PlyA - 105671 105666 6 -1.75 7.02 Term - 105848 105763 86 0 2 96 28 31 0.418 -3.99 7.01 Init - 106230 106173 58 0 1 74 102 74 0.579 8.82 7.00 Prom - 110306 110267 40 0.99 8.00 Prom + 114406 114445 40 1.39 8.01 Init + 114486 114573 88 2 1 54 99 65 0.905 4.97 8.02 Intr + 118658 118947 290 0 2 99 80 417 0.968 39.51 8.03 Intr + 120268 120583 316 0 1 102 56 279 0.561 22.29 8.04 Intr + 122012 122076 65 0 2 70 100 42 0.952 2.53 8.05 Term + 122374 122607 234 0 0 88 49 128 0.916 5.55 8.06 PlyA + 122704 122709 6 1.05 9.42 PlyA - 125305 125300 6 1.05 9.41 Term - 127272 127142 131 1 2 45 43 113 0.474 1.15 9.40 Intr - 127695 127590 106 0 1 97 51 208 0.528 18.29 9.39 Intr - 127958 127884 75 2 0 103 109 148 0.999 18.61 9.38 Intr - 128490 128350 141 0 0 110 45 365 0.993 35.46 9.37 Intr - 129842 129695 148 1 1 95 82 345 0.998 35.35 9.36 Intr - 131050 130887 164 1 2 101 43 277 0.987 23.79 9.35 Intr - 131701 131597 105 1 0 84 113 186 0.999 21.61 9.34 Intr - 131990 131956 35 0 2 114 97 23 0.994 4.23 9.33 Intr - 132976 132913 64 1 1 107 57 57 0.821 3.28 9.32 Intr - 133170 133067 104 1 2 127 56 229 0.999 23.99 9.31 Intr - 134210 134054 157 2 1 136 60 322 0.943 34.30 9.30 Intr - 134507 134361 147 2 0 80 114 319 0.998 34.64 9.29 Intr - 136805 136607 199 1 1 105 76 489 0.998 49.28 9.28 Intr - 137284 137154 131 1 2 85 71 232 0.986 21.20 9.27 Intr - 137898 137794 105 0 0 68 113 52 0.987 6.61 9.26 Intr - 138916 138759 158 2 2 80 82 249 0.952 23.74 9.25 Intr - 139072 138965 108 2 0 110 67 244 0.980 25.26 9.24 Intr - 141224 141110 115 2 1 62 82 286 0.935 25.92 9.23 Intr - 141973 141857 117 1 0 50 78 85 0.750 4.87 9.22 Intr - 142202 142053 150 2 0 89 79 219 0.999 21.97 9.21 Intr - 142570 142286 285 1 0 55 49 428 0.669 33.68 9.20 Intr - 142914 142776 139 2 1 89 89 135 0.514 14.67 9.19 Intr - 143111 142944 168 1 0 96 94 241 0.999 25.08 9.18 Intr - 144015 143876 140 0 2 103 69 212 0.966 20.57 9.17 Intr - 144259 144109 151 0 1 -14 79 215 0.330 11.08 9.16 Intr - 145117 144861 257 1 2 62 64 267 0.569 18.48 9.15 Intr - 145757 145603 155 0 2 83 103 142 0.741 15.50 9.14 Intr - 146503 146423 81 2 0 110 25 49 0.434 0.91 9.13 Intr - 146829 146736 94 0 1 119 80 92 0.998 11.54 9.12 Intr - 147122 147033 90 2 0 133 89 112 0.977 16.49 9.11 Intr - 149700 149596 105 0 0 140 111 88 0.998 17.21 9.10 Intr - 154936 154834 103 0 1 65 115 41 0.932 5.18 9.09 Intr - 155500 155392 109 2 1 97 75 196 0.998 19.04 9.08 Intr - 155829 155652 178 0 1 70 98 173 0.989 16.51 9.07 Intr - 157348 157219 130 0 1 86 81 208 0.994 21.10 9.06 Intr - 157539 157439 101 0 2 76 77 99 0.721 7.01 9.05 Intr - 157986 157855 132 0 0 88 94 164 0.999 18.35 9.04 Intr - 160794 160618 177 0 0 96 100 331 0.853 35.73 9.03 Intr - 164854 164603 252 1 0 59 59 151 0.438 7.46 9.02 Intr - 168170 168095 76 1 1 68 -3 71 0.121 -3.99 9.01 Init - 169318 169200 119 1 2 90 93 102 0.337 8.55 9.00 Prom - 171331 171292 40 -6.60 10.02 PlyA - 171652 171647 6 1.05 10.01 Sngl - 177906 176512 1395 0 0 68 49 2842 0.917 272.92 10.00 Prom - 187914 187875 40 1.49 11.04 PlyA - 195964 195959 6 1.05 11.03 Term - 198964 198873 92 2 2 69 38 102 0.741 1.48 11.02 Intr - 200657 200558 100 2 1 94 65 46 0.842 3.08 11.01 Intr - 204046 203970 77 2 2 120 89 84 0.991 11.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 3243 3166 78 0 0 54 54 47 0.820 -4.04 S.002 Term + 91443 91532 90 2 0 55 47 110 0.927 1.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_1|683_aa MKTGFEGGRRSDTGARKKLFSQIVKAASPLPPSNTLQRLAHSPRGLDLLLFANRPAPHQC SALSLHVNTALITASLTARAPPTRLSVCLSNRLTNGRHSTSHFGYLRYPWQRTCANGFDW WSVEMAAVAEGAAGKQRRRLGRFRNIQTSVAARCPGPCSMGTTGLESLSLGDRGAAPTVT SSERLVPDPPNDLRKEDVAMELERVGEDEEQMMIKRSSECNPLLQEPIASAQFGATAGTE CRKSVPCGWERVVKQRLFGKTAGRFDVYFISPQGLKFRSKSSLANYLHKNGETSLKPEDF DFTVLSKRGIKSRYKDCSMAALTSHLQNQSNNSNWNLRTRSKCKKDVFMPPSSSSELQES RGLSNFTSTHLLLKEDEGVDDVNFRKVRKPKGKVTILKGIPIKKTKKGCRKSCSGFVQSD SKRESVCNKADAESEPVAQKSQLDRTVCISDAGACGETLSVTSEENSLVKKKERSLSSGS NFCSEQKTSGIINKFCSAKDSEHNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSE MDNNCSPTRKDFTALSPPRRKAFKKWTPPRSPFNLVQETLFHDPWKLLIATIFLNRTSGK MAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPI ELHGIGKYGNDSYRIFCVNEWKQ >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_1|2049_bp atgaaaacaggattcgaaggaggaagacgcagtgatacaggagctagaaagaaattattt agccagatagtgaaggcagcaagcccgctgcccccctcaaacacgctgcagcggctcgcg cactccccgcgaggactcgacctcctgctcttcgcgaaccgccccgctcctcaccagtgc tcggctttatctctccacgtcaacacggccctcatcacggcttcccttacggctcgggct cctcctacccgcctcagcgtctgtctcagcaaccgcctcactaacggtcgccactccaca agccactttggctacctgcgttacccctggcaacgtacatgcgcgaatggatttgattgg tggagcgtggaaatggcggctgtagccgagggggcggccggaaagcagcggcggcgtctg gggcgctttcgcaacattcagacctcggttgcagcccggtgccccggaccctgctcgatg ggcacgactgggctggagagtctgagtctgggggaccgcggagctgcccccaccgtcacc tctagtgagcgcctagtcccagacccgccgaatgacctccgcaaagaagatgttgctatg gaattggaaagagtgggagaagatgaggaacaaatgatgataaaaagaagcagtgaatgt aatcccttgctacaagaacccatcgcttctgctcagtttggtgctactgcaggaacagaa tgccgtaagtctgtcccatgtggatgggaaagagttgtgaagcaaaggttatttgggaag acagcaggaagatttgatgtgtactttatcagcccacaaggactgaagttcagatccaaa agttcacttgctaattatcttcacaaaaatggagagacttctcttaagccagaagatttt gattttactgtactttctaaaaggggtatcaagtcaagatataaagactgcagcatggca gccctgacatcccatctacaaaaccaaagtaacaattcaaactggaacctcaggacccga agcaagtgcaaaaaggatgtgtttatgccgccaagtagtagttcagagttgcaggagagc agaggactctctaactttacttccactcatttgcttttgaaagaagatgagggtgttgat gatgttaacttcagaaaggttagaaagcccaaaggaaaggtgactattttgaaaggaatc ccaattaagaaaactaaaaaaggatgtaggaagagctgttcaggttttgttcaaagtgat agcaaaagagaatctgtgtgtaataaagcagatgctgaaagtgaacctgttgcacaaaaa agtcagcttgatagaactgtctgcatttctgatgctggagcatgtggtgagaccctcagt gtgaccagtgaagaaaacagccttgtaaaaaaaaaagaaagatcattgagttcaggatca aatttttgttctgaacaaaaaacttctggcatcataaacaaattttgttcagccaaagac tcagaacacaacgagaagtatgaggatacctttttagaatctgaagaaatcggaacaaaa gtagaagttgtggaaaggaaagaacatttgcatactgacattttaaaacgtggctctgaa atggacaacaactgctcaccaaccaggaaagacttcactgctcttagccccccacgacgt aaagcctttaagaaatggacacctcctcggtcaccttttaatctcgttcaagaaacactt tttcatgatccatggaagcttctcatcgctactatatttctcaatcggacctcaggcaaa atggcaatacctgtgctttggaagtttctggagaagtatccttcagctgaggtagcaaga accgcagactggagagatgtgtcagaacttcttaaacctcttggtctctacgatcttcgg gcaaaaaccattgtcaagttctcagatgaatacctgacaaagcagtggaagtatccaatt gagcttcatgggattggtaaatatggcaacgactcttaccgaattttttgtgtcaatgag tggaagcag >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_2|36_aa MSYIKAPSPVLATPPIQPEDNEDEDLYDDPLPFNEE >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_2|111_bp atgagctacataaaagcacctagcccagtgctggccacgccacctattcaacctgaagac aatgaagatgaagacctttatgatgatccacttccatttaatgaagagtaa >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_3|892_aa MMLYNVSPTILLLINWHLVPPVTLKSVSKHKSSSKIICCSWTNDGQYLALGMFNGIISIR NKNGEEKVKIERPGGSLSPIWSICWNPSSRWESFWMNRENEDAEDVIVNRYIQEIPSTLK SAVYSSQGSEAEEEEPEEEDDSPRDDNLEERNDILAVADWGQKVSFYQLSGKQIGKDRAL NFDPCCISYFTKGEYILLGGSDKQVSLFTKDGVRLGTVGEQNSWVWTCQAKPDSNYVVVG CQDGTISFYQLIFSTVHGLYKDRYAYRDSMTDVIVQHLITEQKVRIKCKELVKKIAIYRN RLAIQLPEKILIYELYSEDLSDMHYRVKEKIIKKFECNLLVVCANHIILCQEKRLQCLSF SGVKEREWQMESLIRYIKVIGGPPGREGLLVGLKNGQILKIFVDNLFAIVLLKQATAVRC LDMSASRKKLAVVDENDTCLVYDIDTKELLFQEPNANSVAWNTQCEDMLCFSGGGYLNIK ASTFPVHRQKLQGFVVGYNGSKIFCLHVFSISAVEVPQSAPMYQYLDRKLFKEAYQIACL GVTDTDWRELAMEALEGLDFETAKKAFIRVQDLRYLELISSIEERKKRGETNNDLFLADV FSYQGKFHEAAKLYKRSGHENLALEMYTDLCMFEYAKDFLGSGDPKETKMLITKQADWAR NIKEPKAAVEMYISAGEHVKAIEICGDHGWVDMLIDIARKLDKAEREPLLLCATYLKKLD SPGYAAETYLKMGDLKSLVQLHVETQRWDESESKAFTERSRLTVVCSLNSSIGEAFALGE KHPEFKDDIYMPYAQWLAENDRFEEAQKAFHKAGRQREAVQVLEQLTNNAVAESRFNDAA YYYWMLSMQCLDIAQADPAQKDTMLGKFYHFQRLAELYHGYHAIHRHTFPSL >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_3|2679_bp atgatgctatacaatgtgtctcctacaatcctattactcatcaactggcatcttgttcct ccagtgactttgaagtctgtctccaaacacaaatcaagcagcaagatcatctgctgcagc tggacaaatgatggtcagtacctggcgctggggatgttcaatgggatcatcagcatacgg aacaaaaatggcgaggagaaagtaaagatcgagcggccggggggctccctctcgccaata tggtccatctgctggaacccttcaagccgatgggagagtttctggatgaacagagagaat gaggatgccgaggatgtcattgtcaacagatatattcaggaaatcccttccactctgaag tcagcagtgtacagtagtcagggtagtgaggcagaggaggaagaaccagaggaagaggac gacagtcccagggacgacaacttagaggaacgtaatgacatcctggctgtggctgactgg ggacagaaagtttccttctaccagctgagtggaaaacagattggaaaggatcgggcactg aactttgacccctgctgcatcagctactttactaaaggcgagtacattttgctggggggt tcagacaagcaagtatctcttttcaccaaggatggagtgcggcttgggactgttggggag cagaactcctgggtgtggacgtgtcaagcgaaaccggattccaactatgtggtggtcggc tgccaggacggcaccatttccttctaccagcttattttcagcacagtccatgggctttac aaggaccgctatgcctacagggatagcatgactgacgtcattgtgcagcacctgatcact gagcagaaagttcggattaaatgcaaagagcttgtcaagaagattgccatctacagaaat cgattggctatccaactgccagagaaaatcctcatctatgagttgtattcagaggactta tcagacatgcattaccgggtaaaggagaagattatcaagaagtttgagtgcaacctcctg gtggtgtgtgccaatcacatcatcctgtgccaggagaaacggctgcagtgcctgtccttc agcggagtgaaggagcgggagtggcagatggagtctctcattcgttacatcaaggtgatc ggtggccctcctggaagagaaggcctcttagtggggctgaagaatggacagatcctgaag atcttcgtggacaatctctttgctatcgtcctgctgaagcaggccacagctgtgcgctgc ttggacatgagtgcctcccgtaagaagctggccgtggtagatgaaaatgacacttgcctg gtgtatgacatcgacaccaaggagctgctttttcaggaaccaaacgccaacagtgtagct tggaacacccagtgtgaggacatgctctgcttctcgggaggaggctacctcaacatcaaa gccagcaccttccctgtgcaccggcagaagctgcagggctttgtggtcggctacaatggc tccaagatcttctgcctccatgtcttctccatttctgccgtggaggtgccgcagtccgct cccatgtaccagtacctggataggaaactgttcaaggaagcctaccagattgcttgcttg ggtgtcacagacactgattggcgtgaactggccatggaagcgctagaaggtttagatttt gaaacagcaaagaaggccttcatcagagtacaagacctccgatatttagagctcatcagc agcattgaggagaggaagaagcggggagagaccaacaatgacctgtttctggcagatgtg ttttcctaccaggggaagttccatgaggccgccaaactgtacaagaggagtgggcacgag aacctcgcgcttgaaatgtacaccgacctctgcatgtttgagtatgccaaggatttcctt ggatctggagaccccaaagaaacaaagatgctaatcaccaaacaggctgactgggccaga aatatcaaggagcccaaagccgccgtggagatgtacatctcagcaggagagcacgtcaag gccatcgagatctgtggtgaccatggctgggttgacatgttgatcgacatcgcccgcaaa ctggacaaggctgagcgcgagcccctgctgctgtgcgctacctacctcaagaagctggac agccctggctatgctgctgagacctacctgaagatgggtgacctcaagtccctggtgcag ctgcacgtggagacccagcgctgggatgagagtgaaagcaaagccttcactgagaggagc cgtctgactgttgtctgctctctgaacagctccattggtgaggcctttgctttgggtgag aagcatcctgagtttaaggatgacatctacatgccgtatgctcagtggctagcagagaac gatcgctttgaggaagcccagaaagcgttccacaaggctgggcgacagagagaagcggtc caggtgctggagcagctcacaaacaatgccgtggcggagagcaggtttaatgatgctgcc tattattactggatgctgtccatgcagtgcctcgatatagctcaagcagatcctgcccag aaggacacaatgcttggcaagttctaccacttccagcgtttggcagagctgtaccatggt taccatgccatccatcgccacacgtttccttctctgtga >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_4|85_aa MGHKATETTRNIDNAFGPGTANKHTVQWWFKKFCKGDERLEDEEHSGRPLEVDNNREQSS KLILSQLQEKLPKNSMSTILRLFGI >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_4|258_bp atgggtcataaagcaacagagacaactcgtaacatcgacaacgcatttggcccaggaact gctaacaaacatacagtgcagtggtggttcaagaagttttgcaaaggagatgagaggctt gaagatgaggagcatagtggccggccactggaagttgacaacaaccgagagcaatcatcg aagctgatcctctcacaactacaggagaagttgccgaagaactcaatgtcaaccattcta cggttgtttggcatttga >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_5|365_aa MVREVYLEEEMSIMQAEAAREDGQVDVPGEDPAYGEASKACGRGKILFTLAKQSKALGAY RLARHAYDKLRGLYIPARFQKSIELGTLTIRAKPFHDSEELVPLCYRCSTNNPLLNNLGN VCINCRQPFIFSASSYDVLHLVEFYLEEGITDEEAISLIDLEVLRPKRDDRQLEIANNSI HALGQSLCLFHSSHFRMQVTESKTDKLNKQRIPPGSQILRLVETKDSIGDEDPFTAKLSF EQGGSEFVPVVVSRLVLRSMSRRDVLIKRWPPPLRWQYFRSLLPDASITMCPSCFQGPAN KCLMNEQESEFQFHATRIGLLGPRLCVWHQKRKLQVAAPALMELLLSEECGDWMTPEVTC GGTNR >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_5|1098_bp atggtcagggaagtttacctggaggaggagatgtccatcatgcaagctgaggcagccagg gaagatggccaagtggatgttccaggggaggacccagcatatggagaggcaagcaaggca tgtgggagagggaaaatactcttcaccttggccaagcagagcaaggccctcggtgcctac aggctggcccggcacgcctatgacaagctgcgtggcctgtacatccctgccagattccaa aagtccattgagctgggtaccctgaccatccgcgccaagcccttccacgacagtgaggag ttggtgcccttgtgctaccgctgctccaccaacaacccgctgctcaacaacctgggcaac gtctgcatcaactgccgccagcccttcatcttctccgcctcttcctacgacgtgctacac ctggttgagttctacctggaggaagggatcactgatgaagaagccatctccctcatcgac ctggaggtgctgagacccaagcgggatgacagacagctagagattgcaaacaacagtatc catgcccttggccagagcctgtgtctcttccactcaagtcactttcgcatgcaagtgaca gagtccaaaactgacaagctgaacaagcaaaggatccctccaggctcccagattctgcgg ctagtggagaccaaggactccatcggagatgaggacccgttcacagctaagctgagcttt gagcaaggtggctcagagttcgtgccagtggtggtgagccggctggtgctgcgctccatg agccgccgggatgtcctcatcaagcgatggcccccacccctgaggtggcaatacttccgc tcactgctgcctgacgcctccattaccatgtgcccctcctgcttccaggggcctgcaaat aaatgtttaatgaacgaacaagagagtgaattccaattccatgcaacaaggattgggctc ctgggccctaggctatgtgtctggcaccagaaacggaagctgcaggttgcagcccctgcc ctcatggagctcctcctgtcagaggagtgtggggactggatgactccagaggtaacttgt gggggaacgaacaggtaa >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_6|348_aa MNGTEGPNFYVPFSNATGVVRSPFEYPQYYLAEPWQFSMLAAYMFLLIVLGFPINFLTLY VTVQHKKLRTPLNYILLNLAVADLFMVLGGFTSTLYTSLHGYFVFGPTGCNLEGFFATLG GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLAGWSRYIP EGLQCSCGIDYYTLKPEVNNESFVIYMFVVHFTIPMIIIFFCYGQLVFTVKEAAAQQQES ATTQKAEKEVTRMVIIMVIAFLICWVPYASVAFYIFTHQGSNFGPIFMTIPAFFAKSAAI YNPVIYIMMNKQFRNCMLTTICCGKNPLGDDEASATVSKTETSQVAPA >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_6|1047_bp atgaatggcacagaaggccctaacttctacgtgcccttctccaatgcgacgggtgtggta cgcagccccttcgagtacccacagtactacctggctgagccatggcagttctccatgctg gccgcctacatgtttctgctgatcgtgctgggcttccccatcaacttcctcacgctctac gtcaccgtccagcacaagaagctgcgcacgcctctcaactacatcctgctcaacctagcc gtggctgacctcttcatggtcctaggtggcttcaccagcaccctctacacctctctgcat ggatacttcgtcttcgggcccacaggatgcaatttggagggcttctttgccaccctgggc ggtgaaattgccctgtggtccttggtggtcctggccatcgagcggtacgtggtggtgtgt aagcccatgagcaacttccgcttcggggagaaccatgccatcatgggcgttgccttcacc tgggtcatggcgctggcctgcgccgcacccccactcgccggctggtccaggtacatcccc gagggcctgcagtgctcgtgtggaatcgactactacacgctcaagccggaggtcaacaac gagtcttttgtcatctacatgttcgtggtccacttcaccatccccatgattatcatcttt ttctgctatgggcagctcgtcttcaccgtcaaggaggccgctgcccagcagcaggagtca gccaccacacagaaggcagagaaggaggtcacccgcatggtcatcatcatggtcatcgct ttcctgatctgctgggtgccctacgccagcgtggcattctacatcttcacccaccagggc tccaacttcggtcccatcttcatgaccatcccagcgttctttgccaagagcgccgccatc tacaaccctgtcatctatatcatgatgaacaagcagttccggaactgcatgctcaccacc atctgctgcggcaagaacccactgggtgacgatgaggcctctgctaccgtgtccaagacg gagacgagccaggtggccccggcctaa >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_7|47_aa MGLSGKGNWKLGPMNIPKEGLMTAFCIPVTVPAYSPTPSPGLALMPS >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_7|144_bp atgggcttgtctggcaagggaaactggaaactggggcccatgaacatccccaaggaaggt ctgatgactgcattctgcattcctgtgactgtccctgcctacagcccaacccccagccct ggtctggccttgatgcctagctaa >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_8|330_aa MAPGSVTSDISPSSTSTAGSSRSPESEKPGPSHGGVPPGGPSHSSLPVGRRHPPVLRMVL EALQAGEQRRGTSVAAIKLYILHKYPTVDVLRFKYLLKQALATGMRRGLLARPLNSKARG ATGSFKLVPKHKKKIQPRKMAPATAPRRAGEAKGKGPKKPSEAKEDPPNVGKVKKAAKRP AKVQKPPPKPGAATEKARKQGGAAKDTRAQSGEARKVPPKPDKAMRAPSSAGDAEAYRKT KAESKSSKPTASKVKNGAASPTKKKVVAKAKAPKAGQGPNTKAAAPAKGSGSKVVPAHLS RKTEAPKGPRKAGLPIKASSSKVSSQRAEA >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_8|993_bp atggctcctgggagcgtcaccagcgacatctcaccctcctcgacttccacagcaggatca tccaggtctcctgaatctgaaaagccaggcccgagccacggcggtgtcccaccaggaggc ccgagccacagcagcctcccggtgggacgccgccaccccccggtgctacgcatggtgctg gaggcgctgcaggctggggagcagcgccggggcacgtcggtggcagctatcaagctctac atcctgcacaagtacccaacagtggacgtcctccgcttcaagtacctgctgaagcaggcg ctggccactggcatgcgccgtggcctcctcgccaggcccctcaactccaaagccaggggg gccactggcagcttcaaattagttcccaagcacaagaagaaaatccagcccaggaagatg gcccccgcgacggctcccaggagagcgggtgaggccaaggggaagggccccaagaaacca agtgaggccaaggaggaccctcccaacgtgggcaaggtgaaaaaggcagccaagaggcca gcaaaggtgcagaagcctcctcccaagccaggcgcagccacagagaaggctcgcaagcaa ggcggcgcggccaaggacaccagggcacagtcgggagaggctaggaaggtgccccccaag ccagacaaggccatgcgggcaccttccagtgctggagatgctgaggcctacaggaaaacc aaagctgagagtaagagttcaaaacccacggccagcaaggtcaagaatggtgctgcttcc ccgaccaaaaagaaggtggtggccaaggccaaggcccctaaagctgggcaggggccaaac accaaggctgctgctcctgctaagggcagtgggtccaaggtggtacctgcacatttgtcc aggaagacagaggcccccaagggccctagaaaggctgggctgcccatcaaggcctcatca tccaaagtgtccagccagagggctgaagcttag >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_9|1833_aa MADLVPEGTLWLLVCIGPARCWQDPGAEALTRSPHCGWTSSWNPARRCLRKAPSSPVFLA VSTAVSTGSMLENVMHVLPWEMCDQHSAITGVGGLSHILKTCRLGMPGEGTAWVQTPGAL KCQGEYECLRGMGWALAAEGTFTMVQEPRLQPEQLDCGAAHLQHPLSILQPLKATPVFRA PGLTSVAVASVNNYTAVFLGTVNGRLLKINLNESMQVVSRRVVTVAYGEPVHHVMQFDPA DSGYLYLMTSHQMARVKVAACNVHSTCGDCVGAADAYCGWCALETRCTLQQDCTNSSQQH FWTSASEGPSRCPAMTVLPSEIDVRQEYPGMILQISGSLPSLSGMEMACDYGNNIRTVAR VPGPAFGHQIAYCNLLPRDQFPPFPPNQDHVTVEMSVRVNGRNIVKANFTIYDCSRTAQV YPHTACTSCLSAQWPCFWCSQQHSCVSNQSRCEASPNPTSPQDCPRTLLSPLAPVPTGGS QNILVPLANTAFFQGAALECSFGLEEIFEAVWVNESVVRCDQVVLHTTRKSQVFPLSLQL KGRPARFLDSPEPMTGNRTQSLCRACKDLMSQGSPNLERGIWVMVYNCAMGSPDCSQCLG REDLGHLCMWSDGCRLRGPLQPMAGTCPAPEIHAWVCTAVPCWVPVTVGRSLLACSLRSH SPLSRVTWQIEPLSGPLDGGTLLTIRGRNLGRRLSDVAHGVWIGGVACEPLPDRYTVSEE VAVGAKDRWLTTAPPLRIVCVTGPAPGPLSGVVTVNASKEGKSRDRFSYVLPLVHSLEPT MGPKAGGTRITIHGNDLHVGSELQVLVNDTDPCTELMRTDTSIACTMPEGALPAPVPVCV RFERRGCVHGNLTFWYMQNPVITAISPRRSPVRLSGAAGALIGFAVPRSGGRTITVAGER FHMVQNVSMAVHHIGREPTLCKVLNSTLITCPSPGALSNASAPVDFFINGRAYADEVAVA EELLDPEEAQRGSRFRLDYLPNPQFSTAKREKWIKHHPGEPLTLVIHVSTKGAGKEQDSL GLQSHEYRVKIGQVSCDIQIVSDRIIHCSVNESLGAAVGQLPITHLNSLSVFTLNSSVQQ SRALHPHLTYEETEAQELSDLLEIQVGNFNQTIATLQLGGSETAIIVSIVICSVLLLLSV VALFVFCTKSRRAERYWQKTLLQMEEMESQIREEIRKGDGFSGRSRLSVCPPGFAELQTD MTDLTKELNRSQGIPFLEYKHFVTRTFFPKCSSLYEERYVLPSQTLNSQGSSQAQETHPL LGEWKIPESCRPNMEEGISLFSSLLNNKHFLIVFVHALEQQKDFAVRDRCSLASLLTIAL HGKLEYYTSIMKELLVDLIDASAAKNPKLMLRRTESVVEKMLTNWMSICMYSCLRETVGE PFFLLLCAIKQQINKGSIDAITGKARYTLSEEWLLRENIEAKPRNLNVSFQGCGMDSLSV RAMDTDTLTQVKEKILEAFCKNVPYSQWPRAEDVDLEWFASSTQSYILRDLDDTSVVEDG RKKLNTLAHYKIPEGASLAMSLIDKKDNTLGRVKDLDTEKYFHLVLPTDELAEPKKSHRQ SHRKKVLPEIYLTRLLSTKGTLQKFLDDLFKAILSIREDKPPLAVKYFFDFLEEQAEKRG ISDPDTLHIWKTNSLPLRFWVNILKNPQFVFDIDKTDHIDACLSVIAQAFIDACSISDLQ LGKDSPTNKLLYAKEIPEYRKIVQRYYKQIQDMTPLSEQEMNAHLAEESRKYQNEFNTNV AMAEIYKYAKRYRPQIMAALEANPTARRTQLQHKFEQVVALMEDNIYECYIGPAAKSPVH PGGQAQTGGEEAASPEAAAQKLYLIHTGAGSPP >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_9|5502_bp atggctgatctggttccggaagggaccctgtggctgctggtctgtattggccctgccagg tgctggcaggatccaggggcggaggcgctcacccggagcccccactgtggctggaccagc tcatggaatccagcccggagatgtttgcgcaaggctccaagcagccctgtcttcttggct gtgtccacagcggtcagcacaggctccatgctggagaacgtcatgcatgtgctgccatgg gaaatgtgtgaccagcacagtgccatcacaggagttggggggctgagccacatcttgaaa acttgccgtctgggcatgcctggagagggaacagcatgggtccagacaccgggggcgctg aagtgccaaggggagtacgaatgcctcagaggcatgggatgggcattggctgctgaaggg acatttacaatggtacaagagccccggctccagccagagcagctggactgtggagctgct cacctgcagcacccgctgtccatcctgcagcccctgaaggccacgcccgtgttccgcgcc ccgggcctcacctccgtggccgtggccagcgtcaacaactacacagcggtcttcctgggc acggtcaacgggaggcttctcaagatcaacctgaacgagagcatgcaggtggtgagcagg cgggtggtgactgtggcctatggggagcccgtgcaccatgtcatgcagtttgacccagca gactccggttacctttacctgatgacgtcccaccagatggccagggtgaaggtcgccgcc tgcaacgtgcactccacctgtggggactgcgtgggtgcggcggacgcctactgcggctgg tgtgccctggagacgcggtgcaccttgcagcaggactgcaccaattccagccagcagcat ttctggaccagtgccagcgagggccccagccgctgtcctgccatgaccgtcctgccttcc gagatcgatgtgcgccaggagtacccaggcatgatcctgcagatctcgggcagcctgccc agcctcagtggcatggagatggcctgtgactatgggaacaacatccgcactgtggctcgg gtcccaggccctgcctttggtcaccagattgcctactgcaacctcctgccgagggaccag tttccgcccttcccccccaaccaggaccacgtgactgttgagatgtctgtgagggtcaat gggcggaacatcgtcaaggccaatttcaccatctacgactgcagccgcactgcacaagtg tacccccacacagcctgtaccagctgcctgtcggcacagtggccctgtttctggtgcagc cagcagcactcctgtgtttccaaccagtctcggtgcgaggcctcaccaaaccccacgagc cctcaggactgcccccggaccctgctctcacccctggcacccgtgcctacgggtggctcc cagaacatcctggtgcctctggccaacactgcctttttccagggtgcagccctggagtgt agttttgggctggaggagatcttcgaggctgtgtgggtgaatgagtctgttgtacgctgt gaccaggtggtgctgcacacgacccggaagagccaggtgttcccgctcagcctccaacta aaggggcggccagcccgattcctggacagccctgagcccatgacaggaaacaggacacag tccctctgcagagcttgtaaggacctgatgagccagggctcgcccaaccttgagcgtggc atctgggtcatggtctataactgtgccatgggcagccccgactgttcccagtgcctgggc cgcgaagacctgggtcacctgtgcatgtggagtgatggctgccgcctgcgggggcctctg cagcccatggctggcacctgccccgcccccgagatccacgcgtgggtctgcacggctgtg ccctgctgggtgcctgtgactgtgggcagaagcctgctggcctgcagcctcagatcccat tcgcccctctctcgggtcacctggcagattgagcccctgagtggcccgttggacggtggg accctgctgaccatccgaggaaggaacctgggccggcggctcagtgacgtggcccacggc gtgtggattggtggtgtggcctgtgagccactgcctgacagatacacggtgtcggaggag gtggccgtgggggccaaggaccgctggctgaccacagcacctcccctcaggatcgtgtgt gtcacagggccagccccaggaccactctcaggtgtggtgaccgtgaacgcctctaaggag ggcaagtcccgggaccgcttctcctacgtgctgcccctggtccactccctggagcctacc atgggccccaaggccgggggcaccaggatcaccatccatgggaatgacctccatgtaggc tccgagctccaggtcctggtgaacgacacagacccctgcacggagctgatgcgcacagat accagcatcgcctgcaccatgcctgagggggccctgccggctccggtgcctgtgtgtgtg cgcttcgagcgtcggggctgcgtgcacggcaacctcaccttctggtacatgcagaacccg gtcatcacggccatcagtccccgccgcagccctgtcaggctctctggggctgcaggtgca ctgataggttttgctgtcccccgcagtggcggcaggaccatcacagtggctggtgagcgt ttccacatggtgcagaatgtgtccatggccgtccaccacattggccgggagcccacgctc tgcaaggttctcaactccaccctcatcacctgcccgtcccccggggccctgagcaacgca tcagcgccagtggacttcttcatcaatgggcgggcctacgcagacgaggtggctgtggct gaggagctactggaccccgaggaggcacagcggggcagcaggttccgcctggactacctc cccaacccacagttctctacggccaagagggagaagtggatcaagcaccaccccggggag cctctcaccctcgttatccacgtgagcaccaaaggggccgggaaggagcaggacagcctg gggctccagagtcacgagtaccgggtcaagataggccaagtaagctgcgacatccagatt gtctctgacagaatcatccactgctcggtcaacgagtccctgggcgcggccgtggggcag ctgcccatcacacacctgaattcactgagcgttttcacgctcaacagctcagtccagcaa agcagagctttacatccccacttgacatatgaggaaactgaggcccaagaacttagtgac ttgcttgagatccaggtagggaacttcaaccagaccatcgccacactgcagctggggggc agcgagacggccatcatcgtgtccatcgtcatctgcagcgtcctgctgctgctctccgtg gtggccctgttcgtcttctgtaccaagagccgacgtgctgagcgttactggcagaagacg ctgctgcagatggaggagatggaatctcagatccgagaggaaatccgcaaaggggatggg ttctcgggccgctcacggctgtccgtgtgtccgccaggcttcgctgagctgcagacagac atgacagatctcaccaaggagctgaaccgcagccagggcatccccttcctggagtataag cacttcgtgacccgcaccttcttccccaagtgttcctccctttatgaagagcgttacgtg ctgccctcccagaccctcaactcccagggcagctcccaggcacaggaaacccacccactg ctgggagagtggaagattcctgagagctgccggcccaacatggaagagggaattagcttg ttctcctcactactcaacaacaagcacttcctcatcgtctttgtccacgcgctggagcag cagaaggactttgcggtgcgcgacaggtgcagcctggcctcgctgctgaccatcgcgctg cacggcaagctggagtactacaccagcatcatgaaggagctgctggtggacctcattgac gcctcggccgccaagaaccccaagctcatgctgcggcgcacagagtctgtggtggagaag atgctcaccaactggatgtccatctgcatgtacagctgtctgcgggagacggtgggggag ccattcttcctgctgctgtgtgccatcaagcagcaaatcaacaagggctccatcgacgcc atcacaggcaaggcccgctacacactcagtgaggagtggctgctgcgggagaacatcgag gccaagccccggaacctgaacgtgtccttccagggctgtggcatggactcgctgagcgtg cgggccatggacaccgacacgctgacacaggtcaaggagaagatcctggaggccttctgc aagaatgtgccctactcccagtggccgcgtgcagaggacgtcgaccttgagtggttcgcc tccagcacacagagctacatccttcgggacctggacgacacctcagtggtggaagacggc cgcaagaagcttaacacgctggcccattacaagatccctgaaggtgcctccctggccatg agtctcatagacaagaaggacaacacactgggccgagtgaaagacttggacacagagaag tatttccatttggtgctgcctacggacgagctggcggagcccaagaagtctcaccggcag agccatcgcaagaaggtgctcccggaaatctacctgacccgcctgctctccaccaagggc acgttgcagaagtttctggatgacctgttcaaggccattctgagtatccgtgaagacaag cccccactggctgtcaagtactttttcgacttcctggaggagcaggctgagaagagggga atctccgaccccgacaccctacacatctggaagaccaacagccttcctctccggttctgg gtgaacatcctgaagaacccccagtttgtctttgacatcgacaagacagaccacatcgac gcctgcctttcagtcatcgcgcaggccttcatcgacgcctgctccatctctgacctgcag ctgggcaaggattcgccaaccaacaagctcctctacgccaaggagattcctgagtaccgg aagatcgtgcagcgctactacaagcagatccaggacatgacgccgctcagcgagcaagag atgaatgcccatctggccgaggagtcgaggaaataccagaatgagttcaacaccaatgtg gccatggcagagatttataagtacgccaagaggtatcggccgcagatcatggccgcgctg gaggccaaccccacggcccggaggacacaactgcagcacaagtttgagcaggtggtggct ttgatggaggacaacatctacgagtgctacattggtcctgctgccaagagccccgtgcat ccaggcggccaagcacaaactgggggagaggaggccgccagcccggaggctgcagcccag aaactctacctcatccacactggtgcagggagccctccttga >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_10|464_aa MAPRAAGGAPLSARAAAASPPPFQTPPRCPVPLLLLLLLGAARAGALEIQRRFPSPTPTN NFALDGAAGTVYLAAVNRLYQLSGANLSLEAEAAVGPVPDSPLCHAPQLPQASCEHPRRL TDNYNKILQLDPGQGLVVVCGSIYQGFCQLRRRGNISAVAVRFPPAAPPAEPVTVFPSML NVAANHPNASTVGLVLPPAAGAGGSRLLVGATYTGYGSSFFPRNRSLEDHRFENTPEIAI RSLDTRGDLAKLFTFDLNPSDDNILKIKQGAKEQHKLGFVSAFLHPSDPPPGAQSYAYLA LNSEARAGDKESQARSLLARICLPHGAGGDAKKLTESYIQLGLQCAGGAGRGDLYSRLVS VFPARERLFAVFERPQGSPAARAAPAALCAFRFADVRAAIRAARTACFVEPAPDVVAVLD SVVQGTGPACERKLNIQVHPGGGGGGGGGGGGGGGGGGREPSGS >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_10|1395_bp atggctcctcgcgccgcgggcggcgcaccccttagcgcccgggccgccgccgccagcccc ccgccgttccagacgccgccgcggtgcccggtgccgctgctgttgctgctgctcctgggg gcggcgcgggccggcgccctggagatccagcgtcggttcccctcgcccacgcccaccaac aacttcgccctggacggcgcggcggggaccgtgtacctggcggccgtcaaccgcctctat cagctgtcgggcgccaacctgagcctggaggccgaggcggccgtgggcccggtgcccgac agcccgctgtgtcacgctccgcagctgccgcaggcctcgtgcgagcacccgcggcgcctc acggacaactacaacaagatcctgcagctggaccccggccagggcctggtagtcgtgtgc gggtccatctaccagggcttctgccagctgcggcgccggggcaacatctcggccgtggcc gtgcgcttcccgcccgccgcgccgcccgccgagcccgtcacggtgttccccagcatgctg aacgtggcggccaaccacccgaacgcgtccaccgtggggctagttctgcctcccgccgcg ggcgcggggggcagccgcctgctcgtgggcgccacgtacaccggttacggcagctccttc ttcccgcgcaaccgcagcctggaggaccaccgcttcgagaacacgcccgagatcgccatc cgctccctggacacgcgcggcgacctggccaagctcttcaccttcgacctcaacccctcc gacgacaacatcctcaagatcaagcagggcgccaaggagcagcacaagctgggcttcgtg agcgccttcctgcacccgtccgacccgccgccgggtgcacagtcctacgcgtacctggcg ctcaacagcgaggcgcgcgcgggcgacaaggagagccaggcgcggagcctgctggcgcgc atctgcctgccccacggcgccggcggcgacgccaagaagctcaccgagtcctacatccag ttgggcttgcagtgcgcgggcggcgcgggccgcggcgacctctacagccgcctggtgtcg gtcttcccagcccgggagcggctctttgctgtcttcgagcggccccaggggtcccccgcg gcccgcgctgctccggccgcactctgcgccttccgcttcgccgacgtgcgagccgccatc cgagctgcgcgcaccgcctgcttcgtggaaccggcgcccgacgtggtggcggtgctcgac agcgtggtgcagggcacgggaccggcctgtgagcgcaagctcaacatccaggtacacccg ggcggtggcggcggcggcggcggcggcggcggcgggggcgggggcgggggcgggcgggaa ccgagcgggtcgtga >gi568815595f:129428734_129633715|GENSCAN_predicted_peptide_11|89_aa XPLGRLRSPLHIHASVGYATLRTRSMDGEYVPQSCLLKELVPFGESWMPHSKTVVGESGA TGLKTTEHIHAALADSRRAASTPLLLAGA >gi568815595f:129428734_129633715|GENSCAN_predicted_CDS_11|270_bp ngccctctaggacgtcttcgctctcccctgcatatccacgccagtgtcggctatgctaca cttcgcacccgctccatggatggggagtatgttccccaaagctgccttctcaaggagttg gtgccttttggggagtcttggatgccccattcgaagactgtggtgggtgaatcaggagcc actggcctcaaaaccaccgagcacatccacgcggccctcgccgactcccgcagggctgcc tccaccccgctgcttcttgccggcgcatag