GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:22:14 Sequence gi568815591f:100274044_100499904 : 225861 bp : 50.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 605 710 106 2 1 67 105 12 0.042 1.02 1.02 Intr + 13388 13556 169 1 1 91 43 103 0.848 5.62 1.03 Intr + 13668 13809 142 1 1 89 51 24 0.047 -1.79 1.04 Intr + 33773 33948 176 2 2 42 94 137 0.552 9.28 1.05 Intr + 34931 35149 219 0 0 42 54 280 0.826 18.27 1.06 Intr + 36317 36535 219 0 0 42 54 280 0.883 18.27 1.07 Intr + 37707 37925 219 1 0 42 54 270 0.883 17.27 1.08 Intr + 39097 39315 219 2 0 42 54 280 0.883 18.27 1.09 Intr + 40489 40707 219 2 0 42 54 291 0.904 19.37 1.10 Intr + 41742 41800 59 1 2 87 116 22 0.924 3.50 1.11 Intr + 42786 42916 131 2 2 53 17 88 0.685 -2.31 1.12 Intr + 45522 45765 244 0 1 80 90 204 0.820 17.30 1.13 Term + 46848 46952 105 2 0 112 43 87 0.998 5.01 1.14 PlyA + 48131 48136 6 1.05 2.09 PlyA - 48144 48139 6 1.05 2.08 Term - 49838 49736 103 1 1 35 48 94 0.241 -2.15 2.07 Intr - 51736 51648 89 1 2 65 74 38 0.224 -1.13 2.06 Intr - 54976 54793 184 0 1 81 87 121 0.917 11.09 2.05 Intr - 56241 56139 103 1 1 75 44 27 0.981 -3.77 2.04 Intr - 56506 56420 87 2 0 62 97 121 0.999 10.34 2.03 Intr - 58538 58399 140 1 2 50 111 93 0.912 7.91 2.02 Intr - 62320 62035 286 2 1 37 70 199 0.032 9.40 2.01 Init - 68943 68826 118 0 1 111 92 37 0.160 6.77 2.00 Prom - 75101 75062 40 -4.46 3.00 Prom + 75445 75484 40 -5.26 3.01 Init + 78378 78435 58 2 1 104 109 168 0.976 19.97 3.02 Intr + 78545 78871 327 0 0 -99 75 342 0.838 9.87 3.03 Intr + 78954 79080 127 1 1 76 97 69 0.690 6.34 3.04 Intr + 82707 82840 134 0 2 91 96 24 0.207 3.79 3.05 Intr + 84177 84323 147 1 0 84 105 96 0.119 11.11 3.06 Intr + 84647 85036 390 0 0 128 96 279 0.965 27.40 3.07 Intr + 85294 85494 201 2 0 76 115 18 0.061 2.56 3.08 Intr + 98605 98632 28 2 1 86 89 8 0.076 -2.13 3.09 Intr + 99553 99677 125 1 2 104 75 117 0.899 12.23 3.10 Intr + 100001 100390 390 0 0 122 96 251 0.027 24.00 3.11 Intr + 115845 116063 219 1 0 93 115 27 0.361 4.17 3.12 Intr + 123836 123869 34 0 1 126 109 27 0.813 5.88 3.13 Intr + 125248 125297 50 1 2 104 113 31 0.836 5.52 3.14 Intr + 125538 125569 32 1 2 82 121 8 0.863 1.35 3.15 Term + 125742 125864 123 2 0 90 41 60 0.909 -0.12 3.16 PlyA + 126032 126037 6 1.05 4.14 PlyA - 126849 126844 6 1.05 4.13 Term - 127293 126971 323 1 2 66 37 315 0.980 19.18 4.12 Intr - 128533 128473 61 1 1 118 99 25 0.995 4.91 4.11 Intr - 129742 129651 92 2 2 112 61 59 0.990 5.31 4.10 Intr - 130201 130135 67 1 1 86 97 24 0.990 1.58 4.09 Intr - 131050 130970 81 1 0 74 75 74 0.892 4.53 4.08 Intr - 132755 132651 105 2 0 45 89 74 0.661 3.61 4.07 Intr - 133260 133185 76 2 1 98 117 -49 0.716 -1.48 4.06 Intr - 134619 134496 124 1 1 94 58 155 0.296 12.74 4.05 Intr - 135501 135385 117 1 0 122 75 19 0.948 4.44 4.04 Intr - 142054 141932 123 2 0 126 97 70 0.981 12.26 4.03 Intr - 145146 145068 79 0 1 112 47 84 0.600 5.82 4.02 Intr - 145754 145587 168 2 0 59 76 82 0.501 4.24 4.01 Init - 154331 154113 219 2 0 85 63 193 0.821 15.13 4.00 Prom - 154969 154930 40 -7.96 5.00 Prom + 155365 155404 40 -7.36 5.01 Init + 155985 157646 1662 2 0 91 107 1176 0.653 111.67 5.02 Intr + 158876 159094 219 1 0 116 79 263 0.998 26.70 5.03 Intr + 159220 159346 127 0 1 99 116 199 0.999 24.05 5.04 Term + 159459 159511 53 1 2 131 42 59 0.999 3.19 5.05 PlyA + 160044 160049 6 1.05 6.05 PlyA - 161244 161239 6 1.05 6.04 Term - 161490 161317 174 0 0 63 41 109 0.643 1.46 6.03 Intr - 161724 161588 137 1 2 98 59 219 0.997 20.29 6.02 Intr - 162021 161805 217 0 1 97 72 391 0.950 36.48 6.01 Init - 162331 162098 234 1 0 76 99 153 0.496 13.40 6.00 Prom - 164619 164580 40 -7.46 7.08 PlyA - 165617 165612 6 1.05 7.07 Term - 169506 169338 169 2 1 68 42 170 0.965 7.75 7.06 Intr - 172342 172170 173 1 2 31 105 95 0.474 4.24 7.05 Intr - 189643 189320 324 1 0 119 82 229 0.943 21.37 7.04 Intr - 193125 192961 165 0 0 66 -17 297 0.066 17.16 7.03 Intr - 193557 193509 49 2 1 115 95 72 0.931 9.28 7.02 Intr - 200397 200231 167 0 2 119 82 160 0.960 17.26 7.01 Init - 203995 203234 762 1 0 94 110 437 0.999 41.20 7.00 Prom - 207815 207776 40 -6.36 8.00 Prom + 208185 208224 40 -9.55 8.01 Init + 211269 211336 68 2 2 69 94 89 0.975 8.14 8.02 Intr + 212778 213139 362 0 2 116 94 303 0.998 28.56 8.03 Intr + 214109 215623 1515 0 0 107 79 971 0.996 87.71 8.04 Intr + 216471 216686 216 1 0 51 51 128 0.610 3.88 8.05 Intr + 216943 217052 110 2 2 132 94 90 0.999 14.00 8.06 Term + 219603 219860 258 2 0 50 54 386 0.753 26.95 8.07 PlyA + 221062 221067 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 85077 85132 56 1 2 110 54 38 0.895 0.32 S.002 Term + 100001 100500 500 0 2 122 38 333 0.960 26.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_1|742_aa XLLLVHTSSDLVAFCAWRPPSIDFDDFPCHKPCLHKAQRLRQGLQMLMTECLTWARNGSI KHPLSLAQKSEFLFHQLQSRDNAICLIAEQKAALATGSCWCDVLVLKTDMSHLKQICSPL TAASKGPLPLCMASQGPGEHVQKKTKDRTETSFGEIGQILGKIMTSHQPQPQEEQSPQRS TSGYPLQEVVDDEVSGPSAPGVDPSPPRRSLGCKRKRECLDESDDEPEKELAPEPEETWV AETLCGLKMKAKRRRVSLVLPEYYEAFNRLLAPGVDPSPPRRSLGCKRKRECLDESDDEP EKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAFNRLLAPGVDPSPPRRSLGCK RKRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAFNRLLAPG VDPSPPRRSLGCKRKRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLP EYYEAFNRLLAPGVDPSPPRRSLGCKRKRECLDESDDEPEKELAPEPEETWVAETLCGLK MKAKRRRVSLVLPEYYEAFNRLLEDPVIKRFLAWDKDLRVSDKIPSEPTILGASPKTLPP ASRICIRPSNTPPPRNFHMSTVTPMLSYLANDMEEDDEAPKQKIFYFLYGKTHSHIPLRP KHWFQLCRPMNPRARKNCSQIALFQKRRFQFFCSMRCRAWVSPEELEENTGPRGDVDFQQ ELYSSANGRHQEGGEEPFVQII >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_1|2229_bp nngctactgttagtccacacttcgtctgacctagtggccttctgtgcatggcggccaccc agcatagattttgatgatttcccatgtcacaagccctgtcttcacaaggctcagagactc aggcaaggactacagatgctcatgaccgaatgtctcacctgggcccggaatggcagcatc aagcaccctctcagcctagcccagaagtcagagttcctatttcatcagttgcaaagcaga gacaatgccatctgcctgatagcagagcaaaaggcggctctggccaccggctcctgctgg tgtgatgttcttgttctaaaaacagatatgtcacacctcaagcagatttgcagtcccttg acagcagcctctaaaggacctctgcctctctgcatggcttcccagggccccggagaacat gtccagaagaagaccaaggacagaacagagactagcttcggtgagattggacagattttg ggaaagatcatgacgagccatcaaccgcagccccaggaagagcagagcccccagcggagc acctcagggtaccccctccaggaggtggtggatgatgaagtgtcgggaccatcagcccct ggggtagatcccagccccccacgtaggtcccttggctgcaaaaggaagagggagtgtttg gatgaatctgatgatgagccagagaaggagctcgcccctgagcctgaggagacctgggtg gcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctc cctgagtactacgaggccttcaacaggctgcttgcccctggggtagatcccagcccccca cgtaggtcccttggctgcaaaaggaagagggagtgtttggatgaatctgatgatgagcca gagaaggagctcgcccctgagcctgaggagacctgggtggcggagacgctgtgtggcctc aagatgaaggcgaagcgacggcgagtgtcgctcgtgctccctgagtactacgaggccttc aacaggctgcttgcccctggggtagatcccagccccccacgtaggtcccttggctgcaaa aggaagagggagtgtttggatgaatctgatgatgagccagagaaggagctcgcccctgag cctgaggagacctgggtggcagagacgctgtgtggcctcaagatgaaggcgaagcgacgg cgagtgtcgctcgtgctccctgagtactacgaggccttcaacaggctgcttgcccctggg gtagatcccagccccccacgtaggtcccttggctgcaaaaggaagagggagtgtttggat gaatctgatgatgagccagagaaggagctcgcccctgagcctgaggagacctgggtggcg gagacgctgtgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctccct gagtactacgaggccttcaacaggctgcttgcccctggggtagatcccagccccccgcgt aggtcccttggctgcaaaaggaagagggagtgtttggatgaatctgatgatgagccagag aaggagctcgcccctgagcctgaggagacctgggtggcggagacgctgtgtggcctcaag atgaaggcgaagcgacggcgagtgtcgctcgtgctccctgagtactacgaggccttcaac aggctgcttgaggatcctgtcattaaaagattcctggcctgggacaaagatctgagggtg tcagacaagatcccatcggagcccaccatcctgggagcatcacctaaaacccttcctccg gcttctcggatttgcatccgaccttcgaatacccctccaccccgcaatttccacatgagc acagtcaccccaatgctgagctatctggccaatgacatggaggaggacgacgaggccccc aaacaaaagatcttctacttcctgtacgggaagacccactctcacatacccttgcgccct aagcattggttccagttatgccgtcccatgaacccgagggccaggaagaactgctctcag atagccttgttccagaagcgtcggttccagttcttctgttccatgcgctgcagggcttgg gtttccccggaggagttggaggagaacaccggacccaggggagatgtggattttcagcag gaactttattccagtgctaatggcagacatcaggaaggaggagaggagccatttgtgcag atcatctag >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_2|369_aa MDVGWDLGEDCWQEHLHTAFSFDCWPCSQNGDRVPSVNPVGGGAPRSYLQVASAECWAAA PAVHVGEPVHAGGLHTERGADPVIGLYLVHRGGACQTPTVGNRQTPTLGIHARPRRRATT SLLTLLLAFGKNAVRTEPAKAIKPIDRKSVHQICSGPVVPSLSTAVKELVENSLDAGATN IDLKLKDYGVDLIEVSGNGCGVEEENFEGLTLKHHTSKIQEFADLPQVETFGFRGEALSS LCALSDVTISTCHVSAKVGTRLVFDHYGKIIQKTPYPHPRGMTVSVKQLFSTLPVHHKEF QRNIKKKRACFPFAFCRDCQFPEASPAMLPVQPAELELSNLRTWWEPPNLQPVGEKYMQS VDTQACSCI >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_2|1110_bp atggatgttggctgggacctcggtgaggactgctggcaagaacacctacacacggccttt tcctttgactgctggccttgctcacagaatggtgaccgggttcccagtgtgaacccagtg ggcgggggggccccgcgctcctacctgcaagtggccagtgccgagtgctgggccgccgct cctgccgtgcatgttggggagccagtacatgcaggtgggctccacacggagaggggcgcc gaccccgtgatagggctttacctggtacatcggggtggcgcgtgccagacaccaacggtc ggaaaccgccagacaccaacgctcggaatccacgccaggccacgacggagggcgactacc tcccttctgaccctgctgctggcgttcggaaaaaacgcagtccgtacagaacctgctaag gccatcaaacctattgatcggaagtcagtccatcagatttgctctgggccggtggtaccg agtctaagcactgcggtgaaggagttagtagaaaacagtctggatgctggtgccactaat attgatctaaagcttaaggactatggagtggatctcattgaagtttcaggcaatggatgt ggggtagaagaagaaaacttcgaaggcttaactctgaaacatcacacatctaagattcaa gagtttgccgacctacctcaggttgaaacttttggctttcggggggaagctctgagctca ctttgtgcactgagtgatgtcaccatttctacctgccatgtatcggcgaaggttgggact cgactggtgtttgatcactatgggaaaatcatccagaaaaccccctacccccaccccaga gggatgacagtcagtgtgaagcagttattttctacgctacctgtgcaccataaagaattt caaaggaatattaagaagaaacgtgcctgcttccccttcgccttctgccgtgattgtcag tttcctgaggcctccccagccatgcttcctgtacagcctgcagaacttgaattatcaaac ctgaggacgtggtgggaacctccaaatttgcagccagttggtgagaagtacatgcagtct gtggacacccaagcttgcagctgcatctga >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_3|794_aa MGRRTLALPWVLLTLRVTAVRMEATELSSFTIRCGFLESGSISLVTVSRGGPDGAGGTTL AVLHPELGIQQWAPARQARWETQSSISLILEGSGASSPCANTTFCCKFASFPEGSWEACG SLPPSSDPGLSVPPTPAPILRADLAGILGVSGVLLFDCGYLLHLLCRQKHREASSGSRTC GRRTSLCTSAKSSWTYRSGRLSWQSIKGTHLTITQALRQPLHRAPLLPGQLCWSPRPLEK NKAMGRPLLLPLLLLLQPPAFLQPGGSTGSGPSYLYGVTQPKHLSASMGGSVEIPFSFYY PWELAIVPNVRISWRRGHFHGQSFYSTRPPSIHKDYVNRLFLNWTEGQESGFLRISNLRK EDQSVYFCRVELDTRRSGRQQLQSIKGTKLTITQAVTTTTTWRPSSTTTIAGLRVTESKG HSESWHLSLDTAIRVALAVAVLKTVILGLLCLLLLWWRRRKGSFWISQGLRAPLLPGRLC WSPRPLEKNKAMGRPLLLPLLPLLLPPAFLQPSGSTGSGPSYLYGVTQPKHLSASMGGSV EIPFSFYYPWELATAPDVRISWRRGHFHRQSFYSTRPPSIHKDYVNRLFLNWTEGQKSGF LRISNLQKQDQSVYFCRVELDTRSSGRQQWQSIEGTKLSITQAVTTTTQRPSSMTTTWRL SSTTTTTGLRVTQGKRRSDSWHISLETAVGVAVAVTVLGIMILGLICLLRWRRRKGQQRT KATTPAREPFQNTEEPYENIRNEGQNTDPKLNPKDDGIVYASLALSSSTSPRAPPSHRPL KSPQNETLYSVLKA >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_3|2385_bp atggggcgccggaccctggccctgccctgggtgctgctgaccctgcgtgtcactgcagtt cggatggaggccaccgagctctcgtccttcaccatccgttgtgggttcctggagtctggc tccatctccctggtgactgtgagcagggggggccccgatggtgctggggggaccacgctg gctgtgttgcacccggaacttggcatccagcaatgggcccctgctcgccaggcccgctgg gaaacccagagcagcatctctctcatcctggaaggctctggggccagcagcccctgcgcc aacaccaccttctgctgcaagtttgcgtccttccctgagggctcctgggaggcctgtggg agcctcccgcccagctcagacccagggctctctgtcccgccgactcctgcccccattctg cgggcagacctggccgggatcttgggggtctcaggagtccttctctttgactgtggctac ctccttcatctgctgtgccgacagaagcaccgggaggcttcctcaggatctcgaacctgc ggaaggaggaccagtctgtgtacttctgccaagtccagctggacatacagatcagggagg ctgtcgtggcagtccatcaaggggacccacctcaccatcacccaggccctcaggcagccc ctccacagggcccctctcctgcctggacagctctgctggtctccccgtcccctggagaag aacaaggccatgggtcggcccctgctgctgcccctgctgctcctgctgcagccgccagca tttctgcagcctggtggctccacaggatctggtccaagctacctttatggggtcactcaa ccaaaacacctctcagcctccatgggtggctctgtggaaatccccttctccttctattac ccctgggagttagccatagttcccaacgtgagaatatcctggagacggggccacttccac gggcagtccttctacagcacaaggccgccttccattcacaaggattatgtgaaccggctc tttctgaactggacagagggtcaggagagcggcttcctcaggatctcaaacctgcggaag gaggaccagtctgtgtatttctgccgagtcgagctggacacccggagatcagggaggcag cagttgcagtccatcaaggggaccaaactcaccatcacccaggctgtcacaaccaccacc acctggaggcccagcagcacaaccaccatagccggcctcagggtcacagaaagcaaaggg cactcagaatcatggcacctaagtctggacactgccatcagggttgcattggctgtcgct gtgctcaaaactgtcattttgggactgctgtgcctcctcctcctgtggtggaggagaagg aaaggaagcttctggatcagccaagggctcagggcccctctcctgcctggacggctctgc tggtctccccgtcccctggagaagaacaaggccatgggtcggcccctgctgctgccccta ctgcccttgctgctgccgccagcatttctgcagcctagtggctccacaggatctggtcca agctacctttatggggtcactcaaccaaaacacctctcagcctccatgggtggctctgtg gaaatccccttctccttctattacccctgggagttagccacagctcccgacgtgagaata tcctggagacggggccacttccacaggcagtccttctacagcacaaggccgccttccatt cacaaggattatgtgaaccggctctttctgaactggacagagggtcagaagagcggcttc ctcaggatctccaacctgcagaagcaggaccagtctgtgtatttctgccgagttgagctg gacacacggagctcagggaggcagcagtggcagtccatcgaggggaccaaactctccatc acccaggctgtcacgaccaccacccagaggcccagcagcatgactaccacctggaggctc agtagcacaaccaccacaaccggcctcagggtcacacagggcaaacgacgctcagactct tggcacataagtctggagactgctgtgggggtggcagtggctgtcactgtgctcggaatc atgattttgggactgatctgcctcctcaggtggaggagaaggaaaggtcagcagcggact aaagccacaaccccagccagggaacccttccaaaacacagaggagccatatgagaatatc aggaatgaaggacaaaatacagatcccaagctaaatcccaaggatgacggcatcgtctat gcttcccttgccctctccagctccacctcacccagagcacctcccagccaccgtcccctc aagagcccccagaacgagaccctgtactctgtcttaaaggcctaa >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_4|544_aa MADLPQLRKRRRCFQLRMRLGPASAPPSGGASAQGVLVPFLGGGKPGLRCEGLAESLRQL VPPIAFRSSLLLKEETPGISSPETEARISLPKASLKKKEEKATMKNVPSREQEKKRKAQI NKQAEKKEKEKSSLTNAEFEEIVQIVLQKSLQECLEDEKVEKTQGGHEHRQEDRLKKTVQ DHSQIRDQQKGEISGFGQCLVWVQCSFPNCGKWRRLCGNIDPSVLPDNWSCDQNTADVQY NRCDIPEETWTGLESDVAYASYIPGSIIWAKQYGYPWWPGMIESDPDLGEYFLFTSHLDS LPSKYHVTFFGETVSRAWIPVNMLKNFQELSLELSVMKKRRNDCSQKLGVALMMAQEAEQ ISIQERVNLFGFWSRFNGSNSNGERKDLQLSGLNSPGSCLEKKEKEEELEKEEGEKTDPI LPIRKRVKIQTQKTKPRGPKKKFKAPQSKALAASFSEGKEVRTVPKNLGLSACKGACPSS AKEEPRHREPLTQEAGSVPLEDEASSDLDLEQLMEDVGRELGQSGELQHSNSDGEDFPVA LFGK >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_4|1635_bp atggcggatttacctcagctgcggaaacgcagacgctgtttccagttgcgcatgcgcctc ggccccgcgagcgcaccgccctcgggtggagctagtgctcagggcgtcctcgtgcctttt cttggtggcgggaaacctgggttgaggtgtgaggggcttgcggagtcgctgcggcagctg gttccgcccatcgcctttaggtcctccttgctcctgaaggaggagaccccggggatcagt tccccagagacagaggccaggataagcctgccaaaggccagtttaaagaagaaagaggaa aaagcaaccatgaagaatgttccaagcagggaacaggagaaaaaaagaaaggcacaaatc aacaagcaagcagagaagaaagaaaaggaaaaatcaagtcttaccaatgcagaatttgag gagattgtccagattgttctgcagaagtcccttcaggagtgcttggaagatgagaaggtg gagaaaactcaaggtggacatgagcacagacaggaagaccgactaaagaaaacagttcag gatcattctcagatcagggaccagcaaaaaggagagataagtggttttggtcaatgtctg gtctgggtccagtgttccttcccaaactgtgggaaatggaggcggctgtgtgggaacatt gacccctcagttctcccagataattggtcctgtgatcagaacacagcagatgtgcagtat aatcgctgtgatattcctgaggagacctggacagggcttgagagtgatgtggcctatgcc tcctacatcccaggatccatcatctgggccaagcaatacggttacccctggtggccaggc atgatagaatctgatcctgacttaggggaatattttctttttacttcccatcttgattcc ctgccgtctaagtaccatgtgacgttttttggagaaacagtttctcgtgcatggatccca gtcaacatgctaaagaacttccaggagctgtccctggagctatcagtcatgaaaaagcgc agaaatgactgcagccagaaactgggggtggccctgatgatggctcaagaggcagaacag atcagcattcaggaacgggttaacttgtttggtttctggagccgattcaacggatctaac agtaatggggaaagaaaagacttacagctctctggtttgaacagcccaggatcctgctta gagaaaaaggagaaagaggaagagttggaaaaggaggaaggagagaaaacagacccaatt ttgcccattcgtaagcgagtcaaaatacagacccaaaaaaccaagccaagaggccctaag aaaaaatttaaagctccccagagcaaggccttggcagccagcttttcagagggaaaagaa gttagaacagtgccaaagaacctgggcctatcagcgtgtaagggggcctgcccctcatct gcgaaagaagagcccagacaccgggaacccctgacccaggaggctggaagtgtccccctt gaggacgaagcctccagtgacctggacctggagcaactcatggaagatgttgggagagag ctggggcagagcggggagctgcagcacagcaacagtgatggcgaggacttccccgtggcg ctgtttgggaagtag >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_5|686_aa MAAEKEPFLVPAPPPPLKDESGGGGGPTVPPHQEAASGELRGGTERGPGRCAPSAGSPAA AVGRESPGAAATSSSGPQAQQHRGGGPQAQSHGEARLSDPPGRAAPPDVGEERRGGGGTE LGPPAPPRPRNGYQPHRPPGGGGGKRRNSCNVGGGGGGFKHPAFKRRRRVNSDCDSVLPS NFLLGGNIFDPLNLNSLLDEEVSRTLNAETPKSSPLPAKGRDPVEILIPKDITDPLSLNT CTDEGHVVLASPLKTGRKRHRHRGQHHQQQQAAGGSESHPVPPTAPLTPLLHGEGASQQP RHRGQNRDAPQPYELNTAINCRDEVVSPLPSALQGPSGSLSAPPAASVISAPPSSSSRHR KRRRTSSKSEAGARGGGQGSKEKGRGSWGGRHHHHHPLPAAGFKKQQRKFQYGNYCKYYG YRNPSCEDGRLRVLKPEWFRGRDVLDLGCNVGHLTLSIACKWGPSRMVGLDIDSRLIHSA RQNIRHYLSEELRLPPQTLEGDPGAEGEEGTTTVRKRSCFPASLTASRGPIAAPQVPLDG ADTSVFPNNVVFVTGNYVLDRDDLVEAQTPEYDVVLCLSLTKWVHLNWGDEGLKRMFRRI YRHLRPGGILVLEPQPWSSYGKRKTLTETIYKNYYRIQLKPEQFSSYLTSPDVGFSSYEL VATPHNTSKGFQRPVYLFHKARSPSH >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_5|2061_bp atggcggcggagaaggagccgtttctggtgccggccccgccgccgccgctcaaagatgag tcgggcggagggggcggccccacggtgccaccgcaccaagaggccgcctctggggagctc cgcggcgggacggagcgtggtccgggtcgttgcgcgccatctgcggggtccccagccgct gcggtcggtcgggaaagccccggggccgcggccacctcctccagtggtccccaggcgcag cagcaccgagggggcggcccccaggcgcagtcgcatggggaggcccgcctgtcggatccc ccggggcgagccgctcccccggacgtgggggaggagcgccggggagggggcgggacagag ctgggtccccctgctcctcctcgaccccgcaatggctatcagccccaccggccacctggg gggggcgggggcaagaggagaaatagctgtaatgtagggggaggcgggggaggcttcaaa catccggccttcaagaggcgcaggcgggtgaattcggactgtgactctgtgttaccctcc aacttcctcctggggggcaatatctttgatcccctgaacctgaatagcctcctggatgag gaagtgagccgcactctcaacgcggagacccctaagtcatccccccttccggccaaaggg cgagatccggtggagatcctcatccccaaagatattactgacccgctcagtctcaatact tgcactgatgagggccatgtagttcttgcttcgccactcaagactggtcggaagcggcat agacaccggggacagcaccaccagcagcagcaggcagccggagggagtgagagtcacccc gtgccgcccacagcccctctcacccccttactccacggggagggcgcctcacagcagccg cggcacaggggccagaaccgggatgccccccaaccctatgaactcaacacagccatcaac tgcagggatgaagtggtgtctccccttccatctgctctgcagggtccctcaggctcccta tcagcccctccagctgcctcagttatctctgcacccccatcttcctcctcccgacatcgc aaacgtcgcaggacttccagcaagtcggaggcaggggctaggggtggaggccagggttcc aaggaaaagggccgagggagttggggaggccgccaccaccaccaccacccactgcctgca gcaggcttcaaaaagcaacagcgcaagttccagtatgggaattattgcaaatactatggg taccgcaatccttcctgtgaggatgggcgccttcgggtgttgaagcctgagtggtttcgg ggccgggacgtcctagatctgggctgcaatgtgggccatctgaccctgagcattgcctgc aagtggggcccgtcccgcatggtgggcctggatatcgattcccggctcatccattctgcc cgccaaaacatccgacactacctttccgaggagctgcgtctcccaccccagactttggaa ggggacccgggggcagagggtgaggaagggaccaccaccgttcgaaagaggagctgcttc ccagcctcgctgactgccagccggggtcccatcgctgccccccaagtgcccttggatgga gcggacacatcagtcttccccaacaatgttgtcttcgtcacgggtaattatgtgctggat cgagatgacctggtggaggcccaaacacctgagtatgatgtggtgctctgcctcagcctc accaagtgggtgcatctgaactggggagacgagggcctgaagcgcatgtttcgccggatc taccggcacctacgccctgggggcatcctggtcctagagccccaaccctggtcgtcgtat ggcaagagaaagactcttacagaaacgatctacaagaactactaccgaatccaattgaag ccagagcagttcagttcctacctgacatccccagacgtgggcttctccagctatgagctt gtggccacaccccacaacacctctaaaggcttccagcgtcctgtgtacctgttccacaag gcccgatcccccagccactaa >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_6|253_aa MMMGCGESELKSADGEEAAAVPGPPPEPQVPQLRAPVPEPGLDLSLSPRPDSPQPRHGSP GRRKGRAERRGAARQRRQVRFRLTPPSPVRSEPQPAVPQELEMPVLKSSLALGLELRAAA GSHFDAAKAVEEQLRKSFQIRCGLEESVSEGLNVPRSKRLFRDLVSLQVPEEQVLNAALR EKLALLPPQARAPHPKEPPGPGPDMTILCDPETLFYESPHLTLDGLPPLRLQLRPRPSED TFLMHRTLRRWEA >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_6|762_bp atgatgatgggttgtggggagtcagagctgaagtcggcggacggggaagaagccgcggcg gtcccggggccacccccggagccccaagtcccgcaactccgagccccagtgcccgagccc ggcctggacttgagcctgagcccgcggcccgacagccctcagccgcggcacggcagcccc gggcggcggaaggggcgggcggagcggcggggcgcggctcggcagcggcggcaggtccgc ttccgcctgacgccgccctccccggtgcggtccgagccgcagcctgcggtgccgcaggag ctggagatgcccgtgctgaagagcagcctggccttgggcctggagctgcgggccgcagcc gggagccactttgatgctgcgaaggccgtggaggaacagctgagaaagtcgttccagatc cgctgcggcctggaggagagcgtgtccgaggggctgaacgtgccgcgctccaagcggctc ttccgggacctggtgagcctgcaggtgccggaggaacaggttctgaatgccgcgctcagg gagaaattggctctcctgccgccacaggctcgagccccgcacccaaaggagccacctggg cctgggccagacatgaccatcttgtgtgacccagaaacgctattttatgaatctccacac ctgaccctggacggtctgccccctctccgacttcaactccggccccgcccttcagaggac accttcctcatgcaccggacactgaggcgatgggaagcgtag >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_7|602_aa MSGGKKKSSFQITSVTTDYEGPGSPGASDPPTPQPPTGPPPRLPNGEPSPDPGGKGTPRN GSPPPGAPSSRFRVVKLPHGLGEPYRRGRWTCVDVYERDLEPHSFGGLLEGIRGASGGAG GRSLDSRLELASLGLGAPTPPSGLSQGPTSWLRPPPTSPGPQARSFTGGLGQLVVPSKAK AEKPPLSASSPQQRPPEPETGESAGTSRAATPLPSLRVEAEAGGSGARTPPLSRRKAVDM RLRMELGAPEEMGQVPPLDSRPSSPALYFTHDASLVHKSPDPFGAVAAQKFSLAHSMLAI SGHLDSDDDSGSGSLVGIDNKIEQAMDLVKSHLMFAVREEVEVLKEQIRELAERNAALEQ ENGLLRALASPEQLAQLPSSGVFFWRQKIKPTISGHPDSKKHSLKKMEKTLQVVETLRLV ELPKEAKPKLGESPELADPCVLAKTTEETEVELGQQGQSLLQLPRTAVKSVSTLMVSALQ SGWQMCSWKPPRDGDTAKVKVSGKTLREVQLCLFPVTAEIVQIATYYILDLLNSDLGIRK EVAGKQSLSWTVEKPHTNISLSNRLSWTVEKPHTNISLSNPEASMESLSTPRPPNISCVL IE >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_7|1809_bp atgagcgggggcaagaagaagagtagtttccaaatcaccagcgtcaccacggactatgag ggccctgggagcccaggggcttcggatccccctaccccacagcccccaaccgggcccccg ccccgcctgcccaatggggagcccagccccgatccggggggcaagggcaccccccggaat ggctccccaccacctggggccccttcctcccgtttccgggtggtgaagctgccccacggc ctgggagagccttatcgccgcggtcgctggacgtgtgtggatgtttatgagcgagacctg gagccccacagcttcggcggactcctggagggaattcgaggggcctcagggggcgccggg ggcagatctttggattccaggttggagctggccagcctcggcctgggcgcccccacccca ccgtcaggcctgtctcagggccccacctcctggctccgtccaccccccacctctcctgga cctcaggcccgctccttcactgggggactgggccagctggtggtgcccagcaaagccaag gcagagaaacccccactgtcggcctcctcaccccagcagcgccccccagagcctgagacc ggtgagagtgcgggcacatcccgggctgccacgcccctgccctctctgagggtggaagcg gaggctgggggctcaggggccaggacccctccactgtcccggaggaaagctgtagacatg cggctgcggatggagttgggtgctccagaagagatggggcaggtgcccccacttgactct cgccccagctccccagccctctacttcacccacgatgccagcctggttcacaaatctcca gaccccttcggagcagtagcagctcagaagttcagcctggcccactccatgttggccatc agtggtcacctagacagcgacgatgatagtggctccggaagcctggttggcattgacaac aaaatcgagcaagccatggacttggtgaagtcccacctcatgtttgcggtccgggaggag gtggaggtgctgaaggagcagatccgggaattggcggagcggaacgctgcgctggagcag gagaatgggctgctgcgcgccctggccagcccggagcagctggctcagctgccctcctcg ggggttttcttctggaggcaaaaaattaaaccaaccatctcaggacaccctgactccaag aaacactcattgaagaagatggagaagactctccaggtggttgagactttgaggttggtc gagctcccaaaagaggctaagcccaagttgggtgagtcccccgagctggcagatccctgc gtgttggccaagactacagaggagaccgaggtggagctgggccaacagggccaatcccta ctgcagctgccgaggacggccgtcaagtctgtctccacgctcatggtctctgccctgcag agcggctggcagatgtgcagctggaagccccccagagatggggatactgccaaggtgaag gtttcggggaagaccctcagggaggttcagctgtgtttgtttcctgtgactgctgaaatt gtacagattgccacatactacatcctagacctcctgaattctgatttggggatcaggaag gaggttgctgggaagcagagcctctcttggactgtggaaaaacctcacaccaacatctcc ttgtccaatcgcctctcttggactgtggaaaaacctcacaccaacatctccttgtccaat ccagaggcttccatggaatcactctccaccccaagaccccccaacatctcctgtgtgctc attgagtaa >gi568815591f:100274044_100499904|GENSCAN_predicted_peptide_8|842_aa MNLLYRKTKLEWRQHKEEEAKRSSSKEVAPAGSAGPAAGQGPGVRVRDIASLRRSLRMGF MTMPASQEHTPHPCRSAMAPRSLSCHSVGSMDSVGGGPGGASGGLTEDSSTRRPPAKPRR HPSTKLSMVGPGSGAETPPSKKAGSQKPTPEGRESSRKVPPQKPRRSPNTQLSVSFDESC PPGPSPRGGNLPLQRLTRGSRVAGDPDVGAQEEPVYIEMVGDVFRGGGRSGGGLAGPPLG GGGPTPPAGADSDSEESEAIYEEMKYPLPEEAGEGRANGPPPLTATSPPQQPHALPPHAH RRPASALPSRRDGTPTKTTPCEIPPPFPNLLQHRPPLLAFPQAKSASRTPGDGVSRLPVL CHSKEPAGSTPAPQVPARERETPPPPPPPPAANLLLLGPSGRARSHSTPLPPQGSGQPRG ERELPNSHSMICPKAAGAPAAPPAPAALLPGPPKDKAVSYTMVYSAVKVTTHSVLPAGPP LGAGEPKTEKEISVLHGMLCTSSRPPVPGKTSPHGGAMGAAAGVLHHRGCLASPHSLPDP TVGPLTPLWTYPATAAGLKRPPAYESLKAGGVLNKGCGVGAPSPMVKIQLQEQGTDGGAF ASISCAHVIASAGTPEEEEEEVGAATFGAGWALQRKVLYGGRKAKELDTEVEDGARAWNG SAEGPGKVEREDRGPGTSGIPVRSQGAEGLLARIHHGDRGGSRTALPIPCQTFPACHRNG DFTGGYRLGRSASTSGVRQVVLHTPRPCSQPRDALSQPHPALPLPLPLPPQPARERDGKL LEVIERKRCVCKEIKARHRPDRGLCKQESMPILPSWRRGPEPRKSGTPPCRRQHTVLWDT AI >gi568815591f:100274044_100499904|GENSCAN_predicted_CDS_8|2529_bp atgaacctcctctaccgaaaaaccaagctggagtggaggcagcacaaggaagaggaggcc aagaggagctccagtaaggaggtggcccccgctggctcggctgggcccgcggccggccag gggcctggggtccgcgtgcgggacatcgcctcgctgcggcgctccctcaggatgggtttc atgacgatgcccgcctcccaggagcacaccccgcacccctgccgcagcgccatggcccca cgctccctctcctgccactcggtgggcagcatggacagtgtcgggggtggccctggcggg gccagtgggggcctcacagaggacagcagcacccgaagaccccctgccaagccccggaga caccccagcaccaagctcagcatggtggggcctgggtctggggcagagacgccccccagc aagaaagcaggctcacagaagccaaccccagagggccgagagtccagccggaaggttcct ccgcagaagcccaggcgaagccctaacacccagctctctgtctccttcgatgagtcctgc cccccaggcccctctcctcgaggggggaacctgcctcttcagcgcctcactagggggtcc cgagtagctggggaccctgatgtgggtgcccaggaagagcctgtgtacattgagatggtg ggggacgtctttaggggaggaggacgaagtggaggaggcctggctgggccccctcttggg ggtgggggcccgacccctccagcgggcgccgactcggactctgaagagagtgaggccatc tatgaagagatgaagtacccgctgccggaagaggctggggaaggccgggccaatggccct ccaccattgacggcaacatccccgccacaacagcctcacgcccttccgccccatgcccac cgccgcccagcttcagccctcccgagccggagggacgggacgcccaccaagaccactcct tgtgaaatccccccgcccttccccaacctccttcagcaccggcctccactcctggccttc ccccaagccaagtctgcttcccgaacccctggcgatggggtctcaaggctacctgtcctc tgccactccaaggagccagccggctccaccccagctccccaagtgcctgcacgggagcgg gagacgcctcccccaccgcctccacctcctgctgccaacctgctgctgctgggaccatcg ggccgggcccggagccactcgacaccgttgccaccccagggctctggccagccccggggg gagcgggagctccccaactcccacagcatgatctgccctaaggcggcgggggcgccggca gccccccctgccccggccgccttgctccccggcccccccaaggacaaggccgtgtcttac accatggtgtactcggcggtcaaggtgaccacgcactctgtcctgccagctggtccaccc ctgggtgctggggagccaaagacggagaaggagatctcggtcctccatgggatgctgtgt accagctcaaggccccctgtgccagggaagaccagcccccacggtggggccatgggcgca gcagctggggtcctccaccaccgcggctgcctggcctccccccacagccttccggaccca actgtaggccccctgaccccgctgtggacctacccagccacagcagctgggctcaagaga ccccctgcctatgagagcctcaaggctgggggggtgctgaataagggctgtggtgtgggg gccccatcccccatggtcaagatccagctgcaggagcaagggaccgatgggggtgctttt gccagcatctcctgtgcccacgtcatcgccagcgcagggacaccagaggaggaagaagag gaggtgggcgccgcgacatttggggcaggctgggccctgcagaggaaggtcctctatgga gggagaaaagcaaaggagttggacacagaggtcgaggacggtgcccgggcctggaatggc agtgccgagggtccaggcaaggtggagcgtgaggacaggggccctgggacatcggggatc ccagtgagaagccagggggcagagggactgctggccaggatccaccatggagaccgagga gggagccgcaccgcgctgcccattccctgccagaccttccccgcctgccaccgcaatgga gacttcacgggaggctaccgcctggggcgctccgcctccacctccggagtccggcaggtc gtgctccacacaccccggccctgcagccagcccagggatgccctgagccagccccacccc gcgctgccgctgcctctgcccctgccgccccagccggcccgcgagcgtgacgggaagctg ctggaggtgatcgagcgcaagcgctgcgtgtgcaaggagatcaaggcgcgccaccgcccg gaccgaggcctctgcaagcaggagagcatgcccatcctccccagctggcggcggggaccc gagccccgcaagtccggcaccccgccctgccgccggcagcacacggtcctctgggacacc gccatctga