GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:30:01 Sequence gi568815591r:100301017_100517181 : 216165 bp : 50.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6800 6975 176 2 2 42 94 137 0.573 9.28 1.02 Intr + 7958 8176 219 0 0 42 54 280 0.827 18.27 1.03 Intr + 9344 9562 219 0 0 42 54 280 0.883 18.27 1.04 Intr + 10734 10952 219 1 0 42 54 270 0.883 17.27 1.05 Intr + 12124 12342 219 2 0 42 54 280 0.883 18.27 1.06 Intr + 13516 13734 219 2 0 42 54 291 0.904 19.37 1.07 Intr + 14769 14827 59 1 2 87 116 22 0.924 3.50 1.08 Intr + 15813 15943 131 2 2 53 17 88 0.685 -2.31 1.09 Intr + 18549 18792 244 0 1 80 90 204 0.820 17.30 1.10 Term + 19875 19979 105 2 0 112 43 87 0.998 5.01 1.11 PlyA + 21158 21163 6 1.05 2.09 PlyA - 21171 21166 6 1.05 2.08 Term - 22865 22763 103 1 1 35 48 94 0.241 -2.15 2.07 Intr - 24763 24675 89 1 2 65 74 38 0.224 -1.13 2.06 Intr - 28003 27820 184 0 1 81 87 121 0.917 11.09 2.05 Intr - 29268 29166 103 1 1 75 44 27 0.981 -3.77 2.04 Intr - 29533 29447 87 2 0 62 97 121 0.999 10.34 2.03 Intr - 31565 31426 140 1 2 50 111 93 0.912 7.91 2.02 Intr - 35347 35062 286 2 1 37 70 199 0.032 9.40 2.01 Init - 41970 41853 118 0 1 111 92 37 0.160 6.77 2.00 Prom - 48128 48089 40 -4.46 3.00 Prom + 48472 48511 40 -5.26 3.01 Init + 51405 51462 58 2 1 104 109 168 0.976 19.97 3.02 Intr + 51572 51898 327 0 0 -99 75 342 0.838 9.87 3.03 Intr + 51981 52107 127 1 1 76 97 69 0.690 6.34 3.04 Intr + 55734 55867 134 0 2 91 96 24 0.207 3.79 3.05 Intr + 57204 57350 147 1 0 84 105 96 0.119 11.11 3.06 Intr + 57674 58063 390 0 0 128 96 279 0.965 27.40 3.07 Intr + 58321 58521 201 2 0 76 115 18 0.061 2.56 3.08 Intr + 71632 71659 28 2 1 86 89 8 0.076 -2.13 3.09 Intr + 72580 72704 125 1 2 104 75 117 0.899 12.23 3.10 Intr + 73028 73417 390 0 0 122 96 251 0.027 24.00 3.11 Intr + 88872 89090 219 1 0 93 115 27 0.361 4.17 3.12 Intr + 96863 96896 34 0 1 126 109 27 0.813 5.88 3.13 Intr + 98275 98324 50 1 2 104 113 31 0.836 5.52 3.14 Intr + 98565 98596 32 1 2 82 121 8 0.863 1.35 3.15 Term + 98769 98891 123 2 0 90 41 60 0.909 -0.12 3.16 PlyA + 99059 99064 6 1.05 4.14 PlyA - 99876 99871 6 1.05 4.13 Term - 100320 99998 323 1 2 66 37 315 0.980 19.18 4.12 Intr - 101560 101500 61 1 1 118 99 25 0.995 4.91 4.11 Intr - 102769 102678 92 2 2 112 61 59 0.990 5.31 4.10 Intr - 103228 103162 67 1 1 86 97 24 0.990 1.58 4.09 Intr - 104077 103997 81 1 0 74 75 74 0.892 4.53 4.08 Intr - 105782 105678 105 2 0 45 89 74 0.661 3.61 4.07 Intr - 106287 106212 76 2 1 98 117 -49 0.716 -1.48 4.06 Intr - 107646 107523 124 1 1 94 58 155 0.296 12.74 4.05 Intr - 108528 108412 117 1 0 122 75 19 0.948 4.44 4.04 Intr - 115081 114959 123 2 0 126 97 70 0.981 12.26 4.03 Intr - 118173 118095 79 0 1 112 47 84 0.600 5.82 4.02 Intr - 118781 118614 168 2 0 59 76 82 0.501 4.24 4.01 Init - 127358 127140 219 2 0 85 63 193 0.821 15.13 4.00 Prom - 127996 127957 40 -7.96 5.00 Prom + 128392 128431 40 -7.36 5.01 Init + 129012 130673 1662 2 0 91 107 1176 0.653 111.67 5.02 Intr + 131903 132121 219 1 0 116 79 263 0.998 26.70 5.03 Intr + 132247 132373 127 0 1 99 116 199 0.999 24.05 5.04 Term + 132486 132538 53 1 2 131 42 59 0.999 3.19 5.05 PlyA + 133071 133076 6 1.05 6.05 PlyA - 134271 134266 6 1.05 6.04 Term - 134517 134344 174 0 0 63 41 109 0.643 1.46 6.03 Intr - 134751 134615 137 1 2 98 59 219 0.997 20.29 6.02 Intr - 135048 134832 217 0 1 97 72 391 0.950 36.48 6.01 Init - 135358 135125 234 1 0 76 99 153 0.496 13.40 6.00 Prom - 137646 137607 40 -7.46 7.08 PlyA - 138644 138639 6 1.05 7.07 Term - 142533 142365 169 2 1 68 42 170 0.965 7.75 7.06 Intr - 145369 145197 173 1 2 31 105 95 0.474 4.24 7.05 Intr - 162670 162347 324 1 0 119 82 229 0.943 21.37 7.04 Intr - 166152 165988 165 0 0 66 -17 297 0.066 17.16 7.03 Intr - 166584 166536 49 2 1 115 95 72 0.931 9.28 7.02 Intr - 173424 173258 167 0 2 119 82 160 0.960 17.26 7.01 Init - 177022 176261 762 1 0 94 110 437 0.999 41.20 7.00 Prom - 180842 180803 40 -6.36 8.00 Prom + 181212 181251 40 -9.55 8.01 Init + 184296 184363 68 2 2 69 94 89 0.975 8.14 8.02 Intr + 185805 186166 362 0 2 116 94 303 0.998 28.56 8.03 Intr + 187136 188650 1515 0 0 107 79 971 0.996 87.71 8.04 Intr + 189498 189713 216 1 0 51 51 128 0.610 3.88 8.05 Intr + 189970 190079 110 2 2 132 94 90 0.999 14.00 8.06 Term + 192630 192887 258 2 0 50 54 386 0.753 26.95 8.07 PlyA + 194089 194094 6 1.05 9.03 PlyA - 195860 195855 6 1.05 9.02 Term - 205553 205443 111 2 0 47 36 57 0.100 -5.04 9.01 Intr - 208611 208494 118 2 1 114 67 83 0.499 9.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 58104 58159 56 1 2 110 54 38 0.895 0.32 S.002 Term + 73028 73527 500 0 2 122 38 333 0.960 26.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_1|603_aa XVQKKTKDRTETSFGEIGQILGKIMTSHQPQPQEEQSPQRSTSGYPLQEVVDDEVSGPSA PGVDPSPPRRSLGCKRKRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLV LPEYYEAFNRLLAPGVDPSPPRRSLGCKRKRECLDESDDEPEKELAPEPEETWVAETLCG LKMKAKRRRVSLVLPEYYEAFNRLLAPGVDPSPPRRSLGCKRKRECLDESDDEPEKELAP EPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAFNRLLAPGVDPSPPRRSLGCKRKRECL DESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAFNRLLAPGVDPSPP RRSLGCKRKRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAF NRLLEDPVIKRFLAWDKDLRVSDKIPSEPTILGASPKTLPPASRICIRPSNTPPPRNFHM STVTPMLSYLANDMEEDDEAPKQKIFYFLYGKTHSHIPLRPKHWFQLCRPMNPRARKNCS QIALFQKRRFQFFCSMRCRAWVSPEELEENTGPRGDVDFQQELYSSANGRHQEGGEEPFV QII >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_1|1812_bp nntgtccagaagaagaccaaggacagaacagagactagcttcggtgagattggacagatt ttgggaaagatcatgacgagccatcaaccgcagccccaggaagagcagagcccccagcgg agcacctcagggtaccccctccaggaggtggtggatgatgaagtgtcgggaccatcagcc cctggggtagatcccagccccccacgtaggtcccttggctgcaaaaggaagagggagtgt ttggatgaatctgatgatgagccagagaaggagctcgcccctgagcctgaggagacctgg gtggcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtg ctccctgagtactacgaggccttcaacaggctgcttgcccctggggtagatcccagcccc ccacgtaggtcccttggctgcaaaaggaagagggagtgtttggatgaatctgatgatgag ccagagaaggagctcgcccctgagcctgaggagacctgggtggcggagacgctgtgtggc ctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctccctgagtactacgaggcc ttcaacaggctgcttgcccctggggtagatcccagccccccacgtaggtcccttggctgc aaaaggaagagggagtgtttggatgaatctgatgatgagccagagaaggagctcgcccct gagcctgaggagacctgggtggcagagacgctgtgtggcctcaagatgaaggcgaagcga cggcgagtgtcgctcgtgctccctgagtactacgaggccttcaacaggctgcttgcccct ggggtagatcccagccccccacgtaggtcccttggctgcaaaaggaagagggagtgtttg gatgaatctgatgatgagccagagaaggagctcgcccctgagcctgaggagacctgggtg gcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctc cctgagtactacgaggccttcaacaggctgcttgcccctggggtagatcccagccccccg cgtaggtcccttggctgcaaaaggaagagggagtgtttggatgaatctgatgatgagcca gagaaggagctcgcccctgagcctgaggagacctgggtggcggagacgctgtgtggcctc aagatgaaggcgaagcgacggcgagtgtcgctcgtgctccctgagtactacgaggccttc aacaggctgcttgaggatcctgtcattaaaagattcctggcctgggacaaagatctgagg gtgtcagacaagatcccatcggagcccaccatcctgggagcatcacctaaaacccttcct ccggcttctcggatttgcatccgaccttcgaatacccctccaccccgcaatttccacatg agcacagtcaccccaatgctgagctatctggccaatgacatggaggaggacgacgaggcc cccaaacaaaagatcttctacttcctgtacgggaagacccactctcacatacccttgcgc cctaagcattggttccagttatgccgtcccatgaacccgagggccaggaagaactgctct cagatagccttgttccagaagcgtcggttccagttcttctgttccatgcgctgcagggct tgggtttccccggaggagttggaggagaacaccggacccaggggagatgtggattttcag caggaactttattccagtgctaatggcagacatcaggaaggaggagaggagccatttgtg cagatcatctag >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_2|369_aa MDVGWDLGEDCWQEHLHTAFSFDCWPCSQNGDRVPSVNPVGGGAPRSYLQVASAECWAAA PAVHVGEPVHAGGLHTERGADPVIGLYLVHRGGACQTPTVGNRQTPTLGIHARPRRRATT SLLTLLLAFGKNAVRTEPAKAIKPIDRKSVHQICSGPVVPSLSTAVKELVENSLDAGATN IDLKLKDYGVDLIEVSGNGCGVEEENFEGLTLKHHTSKIQEFADLPQVETFGFRGEALSS LCALSDVTISTCHVSAKVGTRLVFDHYGKIIQKTPYPHPRGMTVSVKQLFSTLPVHHKEF QRNIKKKRACFPFAFCRDCQFPEASPAMLPVQPAELELSNLRTWWEPPNLQPVGEKYMQS VDTQACSCI >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_2|1110_bp atggatgttggctgggacctcggtgaggactgctggcaagaacacctacacacggccttt tcctttgactgctggccttgctcacagaatggtgaccgggttcccagtgtgaacccagtg ggcgggggggccccgcgctcctacctgcaagtggccagtgccgagtgctgggccgccgct cctgccgtgcatgttggggagccagtacatgcaggtgggctccacacggagaggggcgcc gaccccgtgatagggctttacctggtacatcggggtggcgcgtgccagacaccaacggtc ggaaaccgccagacaccaacgctcggaatccacgccaggccacgacggagggcgactacc tcccttctgaccctgctgctggcgttcggaaaaaacgcagtccgtacagaacctgctaag gccatcaaacctattgatcggaagtcagtccatcagatttgctctgggccggtggtaccg agtctaagcactgcggtgaaggagttagtagaaaacagtctggatgctggtgccactaat attgatctaaagcttaaggactatggagtggatctcattgaagtttcaggcaatggatgt ggggtagaagaagaaaacttcgaaggcttaactctgaaacatcacacatctaagattcaa gagtttgccgacctacctcaggttgaaacttttggctttcggggggaagctctgagctca ctttgtgcactgagtgatgtcaccatttctacctgccatgtatcggcgaaggttgggact cgactggtgtttgatcactatgggaaaatcatccagaaaaccccctacccccaccccaga gggatgacagtcagtgtgaagcagttattttctacgctacctgtgcaccataaagaattt caaaggaatattaagaagaaacgtgcctgcttccccttcgccttctgccgtgattgtcag tttcctgaggcctccccagccatgcttcctgtacagcctgcagaacttgaattatcaaac ctgaggacgtggtgggaacctccaaatttgcagccagttggtgagaagtacatgcagtct gtggacacccaagcttgcagctgcatctga >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_3|794_aa MGRRTLALPWVLLTLRVTAVRMEATELSSFTIRCGFLESGSISLVTVSRGGPDGAGGTTL AVLHPELGIQQWAPARQARWETQSSISLILEGSGASSPCANTTFCCKFASFPEGSWEACG SLPPSSDPGLSVPPTPAPILRADLAGILGVSGVLLFDCGYLLHLLCRQKHREASSGSRTC GRRTSLCTSAKSSWTYRSGRLSWQSIKGTHLTITQALRQPLHRAPLLPGQLCWSPRPLEK NKAMGRPLLLPLLLLLQPPAFLQPGGSTGSGPSYLYGVTQPKHLSASMGGSVEIPFSFYY PWELAIVPNVRISWRRGHFHGQSFYSTRPPSIHKDYVNRLFLNWTEGQESGFLRISNLRK EDQSVYFCRVELDTRRSGRQQLQSIKGTKLTITQAVTTTTTWRPSSTTTIAGLRVTESKG HSESWHLSLDTAIRVALAVAVLKTVILGLLCLLLLWWRRRKGSFWISQGLRAPLLPGRLC WSPRPLEKNKAMGRPLLLPLLPLLLPPAFLQPSGSTGSGPSYLYGVTQPKHLSASMGGSV EIPFSFYYPWELATAPDVRISWRRGHFHRQSFYSTRPPSIHKDYVNRLFLNWTEGQKSGF LRISNLQKQDQSVYFCRVELDTRSSGRQQWQSIEGTKLSITQAVTTTTQRPSSMTTTWRL SSTTTTTGLRVTQGKRRSDSWHISLETAVGVAVAVTVLGIMILGLICLLRWRRRKGQQRT KATTPAREPFQNTEEPYENIRNEGQNTDPKLNPKDDGIVYASLALSSSTSPRAPPSHRPL KSPQNETLYSVLKA >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_3|2385_bp atggggcgccggaccctggccctgccctgggtgctgctgaccctgcgtgtcactgcagtt cggatggaggccaccgagctctcgtccttcaccatccgttgtgggttcctggagtctggc tccatctccctggtgactgtgagcagggggggccccgatggtgctggggggaccacgctg gctgtgttgcacccggaacttggcatccagcaatgggcccctgctcgccaggcccgctgg gaaacccagagcagcatctctctcatcctggaaggctctggggccagcagcccctgcgcc aacaccaccttctgctgcaagtttgcgtccttccctgagggctcctgggaggcctgtggg agcctcccgcccagctcagacccagggctctctgtcccgccgactcctgcccccattctg cgggcagacctggccgggatcttgggggtctcaggagtccttctctttgactgtggctac ctccttcatctgctgtgccgacagaagcaccgggaggcttcctcaggatctcgaacctgc ggaaggaggaccagtctgtgtacttctgccaagtccagctggacatacagatcagggagg ctgtcgtggcagtccatcaaggggacccacctcaccatcacccaggccctcaggcagccc ctccacagggcccctctcctgcctggacagctctgctggtctccccgtcccctggagaag aacaaggccatgggtcggcccctgctgctgcccctgctgctcctgctgcagccgccagca tttctgcagcctggtggctccacaggatctggtccaagctacctttatggggtcactcaa ccaaaacacctctcagcctccatgggtggctctgtggaaatccccttctccttctattac ccctgggagttagccatagttcccaacgtgagaatatcctggagacggggccacttccac gggcagtccttctacagcacaaggccgccttccattcacaaggattatgtgaaccggctc tttctgaactggacagagggtcaggagagcggcttcctcaggatctcaaacctgcggaag gaggaccagtctgtgtatttctgccgagtcgagctggacacccggagatcagggaggcag cagttgcagtccatcaaggggaccaaactcaccatcacccaggctgtcacaaccaccacc acctggaggcccagcagcacaaccaccatagccggcctcagggtcacagaaagcaaaggg cactcagaatcatggcacctaagtctggacactgccatcagggttgcattggctgtcgct gtgctcaaaactgtcattttgggactgctgtgcctcctcctcctgtggtggaggagaagg aaaggaagcttctggatcagccaagggctcagggcccctctcctgcctggacggctctgc tggtctccccgtcccctggagaagaacaaggccatgggtcggcccctgctgctgccccta ctgcccttgctgctgccgccagcatttctgcagcctagtggctccacaggatctggtcca agctacctttatggggtcactcaaccaaaacacctctcagcctccatgggtggctctgtg gaaatccccttctccttctattacccctgggagttagccacagctcccgacgtgagaata tcctggagacggggccacttccacaggcagtccttctacagcacaaggccgccttccatt cacaaggattatgtgaaccggctctttctgaactggacagagggtcagaagagcggcttc ctcaggatctccaacctgcagaagcaggaccagtctgtgtatttctgccgagttgagctg gacacacggagctcagggaggcagcagtggcagtccatcgaggggaccaaactctccatc acccaggctgtcacgaccaccacccagaggcccagcagcatgactaccacctggaggctc agtagcacaaccaccacaaccggcctcagggtcacacagggcaaacgacgctcagactct tggcacataagtctggagactgctgtgggggtggcagtggctgtcactgtgctcggaatc atgattttgggactgatctgcctcctcaggtggaggagaaggaaaggtcagcagcggact aaagccacaaccccagccagggaacccttccaaaacacagaggagccatatgagaatatc aggaatgaaggacaaaatacagatcccaagctaaatcccaaggatgacggcatcgtctat gcttcccttgccctctccagctccacctcacccagagcacctcccagccaccgtcccctc aagagcccccagaacgagaccctgtactctgtcttaaaggcctaa >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_4|544_aa MADLPQLRKRRRCFQLRMRLGPASAPPSGGASAQGVLVPFLGGGKPGLRCEGLAESLRQL VPPIAFRSSLLLKEETPGISSPETEARISLPKASLKKKEEKATMKNVPSREQEKKRKAQI NKQAEKKEKEKSSLTNAEFEEIVQIVLQKSLQECLEDEKVEKTQGGHEHRQEDRLKKTVQ DHSQIRDQQKGEISGFGQCLVWVQCSFPNCGKWRRLCGNIDPSVLPDNWSCDQNTADVQY NRCDIPEETWTGLESDVAYASYIPGSIIWAKQYGYPWWPGMIESDPDLGEYFLFTSHLDS LPSKYHVTFFGETVSRAWIPVNMLKNFQELSLELSVMKKRRNDCSQKLGVALMMAQEAEQ ISIQERVNLFGFWSRFNGSNSNGERKDLQLSGLNSPGSCLEKKEKEEELEKEEGEKTDPI LPIRKRVKIQTQKTKPRGPKKKFKAPQSKALAASFSEGKEVRTVPKNLGLSACKGACPSS AKEEPRHREPLTQEAGSVPLEDEASSDLDLEQLMEDVGRELGQSGELQHSNSDGEDFPVA LFGK >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_4|1635_bp atggcggatttacctcagctgcggaaacgcagacgctgtttccagttgcgcatgcgcctc ggccccgcgagcgcaccgccctcgggtggagctagtgctcagggcgtcctcgtgcctttt cttggtggcgggaaacctgggttgaggtgtgaggggcttgcggagtcgctgcggcagctg gttccgcccatcgcctttaggtcctccttgctcctgaaggaggagaccccggggatcagt tccccagagacagaggccaggataagcctgccaaaggccagtttaaagaagaaagaggaa aaagcaaccatgaagaatgttccaagcagggaacaggagaaaaaaagaaaggcacaaatc aacaagcaagcagagaagaaagaaaaggaaaaatcaagtcttaccaatgcagaatttgag gagattgtccagattgttctgcagaagtcccttcaggagtgcttggaagatgagaaggtg gagaaaactcaaggtggacatgagcacagacaggaagaccgactaaagaaaacagttcag gatcattctcagatcagggaccagcaaaaaggagagataagtggttttggtcaatgtctg gtctgggtccagtgttccttcccaaactgtgggaaatggaggcggctgtgtgggaacatt gacccctcagttctcccagataattggtcctgtgatcagaacacagcagatgtgcagtat aatcgctgtgatattcctgaggagacctggacagggcttgagagtgatgtggcctatgcc tcctacatcccaggatccatcatctgggccaagcaatacggttacccctggtggccaggc atgatagaatctgatcctgacttaggggaatattttctttttacttcccatcttgattcc ctgccgtctaagtaccatgtgacgttttttggagaaacagtttctcgtgcatggatccca gtcaacatgctaaagaacttccaggagctgtccctggagctatcagtcatgaaaaagcgc agaaatgactgcagccagaaactgggggtggccctgatgatggctcaagaggcagaacag atcagcattcaggaacgggttaacttgtttggtttctggagccgattcaacggatctaac agtaatggggaaagaaaagacttacagctctctggtttgaacagcccaggatcctgctta gagaaaaaggagaaagaggaagagttggaaaaggaggaaggagagaaaacagacccaatt ttgcccattcgtaagcgagtcaaaatacagacccaaaaaaccaagccaagaggccctaag aaaaaatttaaagctccccagagcaaggccttggcagccagcttttcagagggaaaagaa gttagaacagtgccaaagaacctgggcctatcagcgtgtaagggggcctgcccctcatct gcgaaagaagagcccagacaccgggaacccctgacccaggaggctggaagtgtccccctt gaggacgaagcctccagtgacctggacctggagcaactcatggaagatgttgggagagag ctggggcagagcggggagctgcagcacagcaacagtgatggcgaggacttccccgtggcg ctgtttgggaagtag >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_5|686_aa MAAEKEPFLVPAPPPPLKDESGGGGGPTVPPHQEAASGELRGGTERGPGRCAPSAGSPAA AVGRESPGAAATSSSGPQAQQHRGGGPQAQSHGEARLSDPPGRAAPPDVGEERRGGGGTE LGPPAPPRPRNGYQPHRPPGGGGGKRRNSCNVGGGGGGFKHPAFKRRRRVNSDCDSVLPS NFLLGGNIFDPLNLNSLLDEEVSRTLNAETPKSSPLPAKGRDPVEILIPKDITDPLSLNT CTDEGHVVLASPLKTGRKRHRHRGQHHQQQQAAGGSESHPVPPTAPLTPLLHGEGASQQP RHRGQNRDAPQPYELNTAINCRDEVVSPLPSALQGPSGSLSAPPAASVISAPPSSSSRHR KRRRTSSKSEAGARGGGQGSKEKGRGSWGGRHHHHHPLPAAGFKKQQRKFQYGNYCKYYG YRNPSCEDGRLRVLKPEWFRGRDVLDLGCNVGHLTLSIACKWGPSRMVGLDIDSRLIHSA RQNIRHYLSEELRLPPQTLEGDPGAEGEEGTTTVRKRSCFPASLTASRGPIAAPQVPLDG ADTSVFPNNVVFVTGNYVLDRDDLVEAQTPEYDVVLCLSLTKWVHLNWGDEGLKRMFRRI YRHLRPGGILVLEPQPWSSYGKRKTLTETIYKNYYRIQLKPEQFSSYLTSPDVGFSSYEL VATPHNTSKGFQRPVYLFHKARSPSH >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_5|2061_bp atggcggcggagaaggagccgtttctggtgccggccccgccgccgccgctcaaagatgag tcgggcggagggggcggccccacggtgccaccgcaccaagaggccgcctctggggagctc cgcggcgggacggagcgtggtccgggtcgttgcgcgccatctgcggggtccccagccgct gcggtcggtcgggaaagccccggggccgcggccacctcctccagtggtccccaggcgcag cagcaccgagggggcggcccccaggcgcagtcgcatggggaggcccgcctgtcggatccc ccggggcgagccgctcccccggacgtgggggaggagcgccggggagggggcgggacagag ctgggtccccctgctcctcctcgaccccgcaatggctatcagccccaccggccacctggg gggggcgggggcaagaggagaaatagctgtaatgtagggggaggcgggggaggcttcaaa catccggccttcaagaggcgcaggcgggtgaattcggactgtgactctgtgttaccctcc aacttcctcctggggggcaatatctttgatcccctgaacctgaatagcctcctggatgag gaagtgagccgcactctcaacgcggagacccctaagtcatccccccttccggccaaaggg cgagatccggtggagatcctcatccccaaagatattactgacccgctcagtctcaatact tgcactgatgagggccatgtagttcttgcttcgccactcaagactggtcggaagcggcat agacaccggggacagcaccaccagcagcagcaggcagccggagggagtgagagtcacccc gtgccgcccacagcccctctcacccccttactccacggggagggcgcctcacagcagccg cggcacaggggccagaaccgggatgccccccaaccctatgaactcaacacagccatcaac tgcagggatgaagtggtgtctccccttccatctgctctgcagggtccctcaggctcccta tcagcccctccagctgcctcagttatctctgcacccccatcttcctcctcccgacatcgc aaacgtcgcaggacttccagcaagtcggaggcaggggctaggggtggaggccagggttcc aaggaaaagggccgagggagttggggaggccgccaccaccaccaccacccactgcctgca gcaggcttcaaaaagcaacagcgcaagttccagtatgggaattattgcaaatactatggg taccgcaatccttcctgtgaggatgggcgccttcgggtgttgaagcctgagtggtttcgg ggccgggacgtcctagatctgggctgcaatgtgggccatctgaccctgagcattgcctgc aagtggggcccgtcccgcatggtgggcctggatatcgattcccggctcatccattctgcc cgccaaaacatccgacactacctttccgaggagctgcgtctcccaccccagactttggaa ggggacccgggggcagagggtgaggaagggaccaccaccgttcgaaagaggagctgcttc ccagcctcgctgactgccagccggggtcccatcgctgccccccaagtgcccttggatgga gcggacacatcagtcttccccaacaatgttgtcttcgtcacgggtaattatgtgctggat cgagatgacctggtggaggcccaaacacctgagtatgatgtggtgctctgcctcagcctc accaagtgggtgcatctgaactggggagacgagggcctgaagcgcatgtttcgccggatc taccggcacctacgccctgggggcatcctggtcctagagccccaaccctggtcgtcgtat ggcaagagaaagactcttacagaaacgatctacaagaactactaccgaatccaattgaag ccagagcagttcagttcctacctgacatccccagacgtgggcttctccagctatgagctt gtggccacaccccacaacacctctaaaggcttccagcgtcctgtgtacctgttccacaag gcccgatcccccagccactaa >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_6|253_aa MMMGCGESELKSADGEEAAAVPGPPPEPQVPQLRAPVPEPGLDLSLSPRPDSPQPRHGSP GRRKGRAERRGAARQRRQVRFRLTPPSPVRSEPQPAVPQELEMPVLKSSLALGLELRAAA GSHFDAAKAVEEQLRKSFQIRCGLEESVSEGLNVPRSKRLFRDLVSLQVPEEQVLNAALR EKLALLPPQARAPHPKEPPGPGPDMTILCDPETLFYESPHLTLDGLPPLRLQLRPRPSED TFLMHRTLRRWEA >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_6|762_bp atgatgatgggttgtggggagtcagagctgaagtcggcggacggggaagaagccgcggcg gtcccggggccacccccggagccccaagtcccgcaactccgagccccagtgcccgagccc ggcctggacttgagcctgagcccgcggcccgacagccctcagccgcggcacggcagcccc gggcggcggaaggggcgggcggagcggcggggcgcggctcggcagcggcggcaggtccgc ttccgcctgacgccgccctccccggtgcggtccgagccgcagcctgcggtgccgcaggag ctggagatgcccgtgctgaagagcagcctggccttgggcctggagctgcgggccgcagcc gggagccactttgatgctgcgaaggccgtggaggaacagctgagaaagtcgttccagatc cgctgcggcctggaggagagcgtgtccgaggggctgaacgtgccgcgctccaagcggctc ttccgggacctggtgagcctgcaggtgccggaggaacaggttctgaatgccgcgctcagg gagaaattggctctcctgccgccacaggctcgagccccgcacccaaaggagccacctggg cctgggccagacatgaccatcttgtgtgacccagaaacgctattttatgaatctccacac ctgaccctggacggtctgccccctctccgacttcaactccggccccgcccttcagaggac accttcctcatgcaccggacactgaggcgatgggaagcgtag >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_7|602_aa MSGGKKKSSFQITSVTTDYEGPGSPGASDPPTPQPPTGPPPRLPNGEPSPDPGGKGTPRN GSPPPGAPSSRFRVVKLPHGLGEPYRRGRWTCVDVYERDLEPHSFGGLLEGIRGASGGAG GRSLDSRLELASLGLGAPTPPSGLSQGPTSWLRPPPTSPGPQARSFTGGLGQLVVPSKAK AEKPPLSASSPQQRPPEPETGESAGTSRAATPLPSLRVEAEAGGSGARTPPLSRRKAVDM RLRMELGAPEEMGQVPPLDSRPSSPALYFTHDASLVHKSPDPFGAVAAQKFSLAHSMLAI SGHLDSDDDSGSGSLVGIDNKIEQAMDLVKSHLMFAVREEVEVLKEQIRELAERNAALEQ ENGLLRALASPEQLAQLPSSGVFFWRQKIKPTISGHPDSKKHSLKKMEKTLQVVETLRLV ELPKEAKPKLGESPELADPCVLAKTTEETEVELGQQGQSLLQLPRTAVKSVSTLMVSALQ SGWQMCSWKPPRDGDTAKVKVSGKTLREVQLCLFPVTAEIVQIATYYILDLLNSDLGIRK EVAGKQSLSWTVEKPHTNISLSNRLSWTVEKPHTNISLSNPEASMESLSTPRPPNISCVL IE >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_7|1809_bp atgagcgggggcaagaagaagagtagtttccaaatcaccagcgtcaccacggactatgag ggccctgggagcccaggggcttcggatccccctaccccacagcccccaaccgggcccccg ccccgcctgcccaatggggagcccagccccgatccggggggcaagggcaccccccggaat ggctccccaccacctggggccccttcctcccgtttccgggtggtgaagctgccccacggc ctgggagagccttatcgccgcggtcgctggacgtgtgtggatgtttatgagcgagacctg gagccccacagcttcggcggactcctggagggaattcgaggggcctcagggggcgccggg ggcagatctttggattccaggttggagctggccagcctcggcctgggcgcccccacccca ccgtcaggcctgtctcagggccccacctcctggctccgtccaccccccacctctcctgga cctcaggcccgctccttcactgggggactgggccagctggtggtgcccagcaaagccaag gcagagaaacccccactgtcggcctcctcaccccagcagcgccccccagagcctgagacc ggtgagagtgcgggcacatcccgggctgccacgcccctgccctctctgagggtggaagcg gaggctgggggctcaggggccaggacccctccactgtcccggaggaaagctgtagacatg cggctgcggatggagttgggtgctccagaagagatggggcaggtgcccccacttgactct cgccccagctccccagccctctacttcacccacgatgccagcctggttcacaaatctcca gaccccttcggagcagtagcagctcagaagttcagcctggcccactccatgttggccatc agtggtcacctagacagcgacgatgatagtggctccggaagcctggttggcattgacaac aaaatcgagcaagccatggacttggtgaagtcccacctcatgtttgcggtccgggaggag gtggaggtgctgaaggagcagatccgggaattggcggagcggaacgctgcgctggagcag gagaatgggctgctgcgcgccctggccagcccggagcagctggctcagctgccctcctcg ggggttttcttctggaggcaaaaaattaaaccaaccatctcaggacaccctgactccaag aaacactcattgaagaagatggagaagactctccaggtggttgagactttgaggttggtc gagctcccaaaagaggctaagcccaagttgggtgagtcccccgagctggcagatccctgc gtgttggccaagactacagaggagaccgaggtggagctgggccaacagggccaatcccta ctgcagctgccgaggacggccgtcaagtctgtctccacgctcatggtctctgccctgcag agcggctggcagatgtgcagctggaagccccccagagatggggatactgccaaggtgaag gtttcggggaagaccctcagggaggttcagctgtgtttgtttcctgtgactgctgaaatt gtacagattgccacatactacatcctagacctcctgaattctgatttggggatcaggaag gaggttgctgggaagcagagcctctcttggactgtggaaaaacctcacaccaacatctcc ttgtccaatcgcctctcttggactgtggaaaaacctcacaccaacatctccttgtccaat ccagaggcttccatggaatcactctccaccccaagaccccccaacatctcctgtgtgctc attgagtaa >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_8|842_aa MNLLYRKTKLEWRQHKEEEAKRSSSKEVAPAGSAGPAAGQGPGVRVRDIASLRRSLRMGF MTMPASQEHTPHPCRSAMAPRSLSCHSVGSMDSVGGGPGGASGGLTEDSSTRRPPAKPRR HPSTKLSMVGPGSGAETPPSKKAGSQKPTPEGRESSRKVPPQKPRRSPNTQLSVSFDESC PPGPSPRGGNLPLQRLTRGSRVAGDPDVGAQEEPVYIEMVGDVFRGGGRSGGGLAGPPLG GGGPTPPAGADSDSEESEAIYEEMKYPLPEEAGEGRANGPPPLTATSPPQQPHALPPHAH RRPASALPSRRDGTPTKTTPCEIPPPFPNLLQHRPPLLAFPQAKSASRTPGDGVSRLPVL CHSKEPAGSTPAPQVPARERETPPPPPPPPAANLLLLGPSGRARSHSTPLPPQGSGQPRG ERELPNSHSMICPKAAGAPAAPPAPAALLPGPPKDKAVSYTMVYSAVKVTTHSVLPAGPP LGAGEPKTEKEISVLHGMLCTSSRPPVPGKTSPHGGAMGAAAGVLHHRGCLASPHSLPDP TVGPLTPLWTYPATAAGLKRPPAYESLKAGGVLNKGCGVGAPSPMVKIQLQEQGTDGGAF ASISCAHVIASAGTPEEEEEEVGAATFGAGWALQRKVLYGGRKAKELDTEVEDGARAWNG SAEGPGKVEREDRGPGTSGIPVRSQGAEGLLARIHHGDRGGSRTALPIPCQTFPACHRNG DFTGGYRLGRSASTSGVRQVVLHTPRPCSQPRDALSQPHPALPLPLPLPPQPARERDGKL LEVIERKRCVCKEIKARHRPDRGLCKQESMPILPSWRRGPEPRKSGTPPCRRQHTVLWDT AI >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_8|2529_bp atgaacctcctctaccgaaaaaccaagctggagtggaggcagcacaaggaagaggaggcc aagaggagctccagtaaggaggtggcccccgctggctcggctgggcccgcggccggccag gggcctggggtccgcgtgcgggacatcgcctcgctgcggcgctccctcaggatgggtttc atgacgatgcccgcctcccaggagcacaccccgcacccctgccgcagcgccatggcccca cgctccctctcctgccactcggtgggcagcatggacagtgtcgggggtggccctggcggg gccagtgggggcctcacagaggacagcagcacccgaagaccccctgccaagccccggaga caccccagcaccaagctcagcatggtggggcctgggtctggggcagagacgccccccagc aagaaagcaggctcacagaagccaaccccagagggccgagagtccagccggaaggttcct ccgcagaagcccaggcgaagccctaacacccagctctctgtctccttcgatgagtcctgc cccccaggcccctctcctcgaggggggaacctgcctcttcagcgcctcactagggggtcc cgagtagctggggaccctgatgtgggtgcccaggaagagcctgtgtacattgagatggtg ggggacgtctttaggggaggaggacgaagtggaggaggcctggctgggccccctcttggg ggtgggggcccgacccctccagcgggcgccgactcggactctgaagagagtgaggccatc tatgaagagatgaagtacccgctgccggaagaggctggggaaggccgggccaatggccct ccaccattgacggcaacatccccgccacaacagcctcacgcccttccgccccatgcccac cgccgcccagcttcagccctcccgagccggagggacgggacgcccaccaagaccactcct tgtgaaatccccccgcccttccccaacctccttcagcaccggcctccactcctggccttc ccccaagccaagtctgcttcccgaacccctggcgatggggtctcaaggctacctgtcctc tgccactccaaggagccagccggctccaccccagctccccaagtgcctgcacgggagcgg gagacgcctcccccaccgcctccacctcctgctgccaacctgctgctgctgggaccatcg ggccgggcccggagccactcgacaccgttgccaccccagggctctggccagccccggggg gagcgggagctccccaactcccacagcatgatctgccctaaggcggcgggggcgccggca gccccccctgccccggccgccttgctccccggcccccccaaggacaaggccgtgtcttac accatggtgtactcggcggtcaaggtgaccacgcactctgtcctgccagctggtccaccc ctgggtgctggggagccaaagacggagaaggagatctcggtcctccatgggatgctgtgt accagctcaaggccccctgtgccagggaagaccagcccccacggtggggccatgggcgca gcagctggggtcctccaccaccgcggctgcctggcctccccccacagccttccggaccca actgtaggccccctgaccccgctgtggacctacccagccacagcagctgggctcaagaga ccccctgcctatgagagcctcaaggctgggggggtgctgaataagggctgtggtgtgggg gccccatcccccatggtcaagatccagctgcaggagcaagggaccgatgggggtgctttt gccagcatctcctgtgcccacgtcatcgccagcgcagggacaccagaggaggaagaagag gaggtgggcgccgcgacatttggggcaggctgggccctgcagaggaaggtcctctatgga gggagaaaagcaaaggagttggacacagaggtcgaggacggtgcccgggcctggaatggc agtgccgagggtccaggcaaggtggagcgtgaggacaggggccctgggacatcggggatc ccagtgagaagccagggggcagagggactgctggccaggatccaccatggagaccgagga gggagccgcaccgcgctgcccattccctgccagaccttccccgcctgccaccgcaatgga gacttcacgggaggctaccgcctggggcgctccgcctccacctccggagtccggcaggtc gtgctccacacaccccggccctgcagccagcccagggatgccctgagccagccccacccc gcgctgccgctgcctctgcccctgccgccccagccggcccgcgagcgtgacgggaagctg ctggaggtgatcgagcgcaagcgctgcgtgtgcaaggagatcaaggcgcgccaccgcccg gaccgaggcctctgcaagcaggagagcatgcccatcctccccagctggcggcggggaccc gagccccgcaagtccggcaccccgccctgccgccggcagcacacggtcctctgggacacc gccatctga >gi568815591r:100301017_100517181|GENSCAN_predicted_peptide_9|76_aa XAPTPGASQLPHHEDIHATYRAAQEELRPAANNQHQLACRCLWVIPALQGAVPLLRLCTE ASIKVVTLAGHECSNL >gi568815591r:100301017_100517181|GENSCAN_predicted_CDS_9|231_bp nnagcccccactccgggggcaagccagctgccgcatcacgaggacatccacgcaacctat agagctgctcaggaggaactgaggccggctgccaataaccagcaccaacttgcctgccgt tgcctatgggtcatccctgctctgcaaggagccgtacctttgctgcggctgtgcactgaa gcttcaataaaagttgtaacactggctggacacgagtgctcaaacctataa