GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:26:55 Sequence gi568815591r:100366962_100578038 : 211077 bp : 50.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4900 5057 158 0 2 55 107 35 0.379 1.48 1.02 Intr + 6635 6759 125 2 2 104 75 117 0.909 12.23 1.03 Intr + 7083 7472 390 1 0 122 96 251 0.027 24.00 1.04 Intr + 22927 23145 219 2 0 93 115 27 0.361 4.17 1.05 Intr + 30918 30951 34 1 1 126 109 27 0.813 5.88 1.06 Intr + 32330 32379 50 2 2 104 113 31 0.836 5.52 1.07 Intr + 32620 32651 32 2 2 82 121 8 0.863 1.35 1.08 Term + 32824 32946 123 0 0 90 41 60 0.909 -0.12 1.09 PlyA + 33114 33119 6 1.05 2.14 PlyA - 33931 33926 6 1.05 2.13 Term - 34375 34053 323 2 2 66 37 315 0.980 19.18 2.12 Intr - 35615 35555 61 2 1 118 99 25 0.995 4.91 2.11 Intr - 36824 36733 92 0 2 112 61 59 0.990 5.31 2.10 Intr - 37283 37217 67 2 1 86 97 24 0.990 1.58 2.09 Intr - 38132 38052 81 2 0 74 75 74 0.892 4.53 2.08 Intr - 39837 39733 105 0 0 45 89 74 0.661 3.61 2.07 Intr - 40342 40267 76 0 1 98 117 -49 0.716 -1.48 2.06 Intr - 41701 41578 124 2 1 94 58 155 0.296 12.74 2.05 Intr - 42583 42467 117 2 0 122 75 19 0.948 4.44 2.04 Intr - 49136 49014 123 0 0 126 97 70 0.981 12.26 2.03 Intr - 52228 52150 79 1 1 112 47 84 0.600 5.82 2.02 Intr - 52836 52669 168 0 0 59 76 82 0.501 4.24 2.01 Init - 61413 61195 219 0 0 85 63 193 0.821 15.13 2.00 Prom - 62051 62012 40 -7.96 3.00 Prom + 62447 62486 40 -7.36 3.01 Init + 63067 64728 1662 0 0 91 107 1176 0.653 111.67 3.02 Intr + 65958 66176 219 2 0 116 79 263 0.998 26.70 3.03 Intr + 66302 66428 127 1 1 99 116 199 0.999 24.05 3.04 Term + 66541 66593 53 2 2 131 42 59 0.999 3.19 3.05 PlyA + 67126 67131 6 1.05 4.05 PlyA - 68326 68321 6 1.05 4.04 Term - 68572 68399 174 1 0 63 41 109 0.643 1.46 4.03 Intr - 68806 68670 137 2 2 98 59 219 0.997 20.29 4.02 Intr - 69103 68887 217 1 1 97 72 391 0.950 36.48 4.01 Init - 69413 69180 234 2 0 76 99 153 0.496 13.40 4.00 Prom - 71701 71662 40 -7.46 5.08 PlyA - 72699 72694 6 1.05 5.07 Term - 76588 76420 169 0 1 68 42 170 0.965 7.75 5.06 Intr - 79424 79252 173 2 2 31 105 95 0.474 4.24 5.05 Intr - 96725 96402 324 2 0 119 82 229 0.943 21.37 5.04 Intr - 100207 100043 165 1 0 66 -17 297 0.066 17.16 5.03 Intr - 100639 100591 49 0 1 115 95 72 0.931 9.28 5.02 Intr - 107479 107313 167 1 2 119 82 160 0.960 17.26 5.01 Init - 111077 110316 762 2 0 94 110 437 0.999 41.20 5.00 Prom - 114897 114858 40 -6.36 6.00 Prom + 115267 115306 40 -9.55 6.01 Init + 118351 118418 68 0 2 69 94 89 0.975 8.14 6.02 Intr + 119860 120221 362 1 2 116 94 303 0.998 28.56 6.03 Intr + 121191 122705 1515 1 0 107 79 971 0.996 87.71 6.04 Intr + 123553 123768 216 2 0 51 51 128 0.610 3.88 6.05 Intr + 124025 124134 110 0 2 132 94 90 0.999 14.00 6.06 Term + 126685 126942 258 0 0 50 54 386 0.753 26.95 6.07 PlyA + 128144 128149 6 1.05 7.00 Prom + 140803 140842 40 -6.66 7.01 Init + 145954 146058 105 0 0 74 80 98 0.754 7.82 7.02 Term + 150692 150838 147 1 0 78 43 97 0.724 2.10 7.03 PlyA + 153057 153062 6 1.05 8.00 Prom + 154756 154795 40 -6.86 8.01 Init + 172392 172606 215 2 2 74 76 350 0.666 28.62 8.02 Intr + 181861 181999 139 1 1 117 75 64 0.781 8.47 8.03 Intr + 183435 183550 116 2 2 111 91 118 0.999 13.65 8.04 Intr + 186386 186539 154 2 1 63 93 41 0.654 2.17 8.05 Intr + 187132 187297 166 0 1 62 98 60 0.651 3.93 8.06 Intr + 188649 188774 126 1 0 109 97 47 0.994 8.35 8.07 Intr + 189633 189665 33 1 0 56 111 34 0.500 0.69 8.08 Intr + 195298 195418 121 2 1 46 96 45 0.659 0.65 8.09 Intr + 195633 195721 89 0 2 100 105 112 0.997 13.71 8.10 Intr + 195902 195985 84 0 0 51 115 58 0.523 4.59 8.11 Intr + 196873 197001 129 2 0 99 95 36 0.977 6.07 8.12 Intr + 197257 197342 86 2 2 103 101 54 0.984 7.74 8.13 Intr + 198086 198168 83 1 2 117 67 35 0.732 2.74 8.14 Intr + 201120 201256 137 0 2 79 70 40 0.605 1.51 8.15 Term + 203101 203441 341 2 2 74 43 393 0.712 28.10 8.16 PlyA + 203883 203888 6 -0.45 9.16 PlyA - 205291 205286 6 1.05 9.15 Term - 205349 205326 24 2 0 128 38 0 0.524 -2.78 9.14 Intr - 205790 205693 98 0 2 78 94 111 0.999 10.43 9.13 Intr - 206052 205899 154 0 1 113 82 56 0.998 7.15 9.12 Intr - 206234 206145 90 2 0 91 92 40 0.820 4.89 9.11 Intr - 206439 206336 104 1 2 78 91 80 0.233 7.19 9.10 Intr - 208343 208154 190 2 1 128 -44 294 0.226 19.36 9.09 Intr - 208821 208744 78 0 0 88 64 89 0.961 6.25 9.08 Intr - 209047 208910 138 1 0 83 49 192 0.998 15.46 9.07 Intr - 209362 209277 86 2 2 108 99 81 0.905 10.74 9.06 Intr - 209816 209733 84 0 0 91 94 11 0.576 1.79 9.05 Intr - 210044 209941 104 1 2 114 75 -76 0.736 -6.48 9.04 Intr - 210428 210312 117 1 0 109 47 90 0.825 6.58 9.03 Intr - 210597 210536 62 0 2 133 80 0 0.972 1.23 9.02 Intr - 210779 210703 77 0 2 72 78 130 0.999 9.63 9.01 Intr - 210951 210861 91 0 1 89 109 131 0.994 14.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 7083 7582 500 1 2 122 38 333 0.960 26.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_1|376_aa MGVQLDPGPESDINMHVASKETARGSIGQHVPSRATQRLLAISSMWPPHWTAWAPLLPGR LCWSPRPLEKNKAMGRPLLLPLLPLLLPPAFLQPSGSTGSGPSYLYGVTQPKHLSASMGG SVEIPFSFYYPWELATAPDVRISWRRGHFHRQSFYSTRPPSIHKDYVNRLFLNWTEGQKS GFLRISNLQKQDQSVYFCRVELDTRSSGRQQWQSIEGTKLSITQAVTTTTQRPSSMTTTW RLSSTTTTTGLRVTQGKRRSDSWHISLETAVGVAVAVTVLGIMILGLICLLRWRRRKGQQ RTKATTPAREPFQNTEEPYENIRNEGQNTDPKLNPKDDGIVYASLALSSSTSPRAPPSHR PLKSPQNETLYSVLKA >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_1|1131_bp atgggggttcagctggatccagggcctgaaagtgacataaacatgcatgttgcatctaag gagactgcacgtggcagcattgggcagcatgtgccctccagagctacccagaggctgttg gctataagttctatgtggcccccacattggacagcctgggcccctctcctgcctggacgg ctctgctggtctccccgtcccctggagaagaacaaggccatgggtcggcccctgctgctg cccctactgcccttgctgctgccgccagcatttctgcagcctagtggctccacaggatct ggtccaagctacctttatggggtcactcaaccaaaacacctctcagcctccatgggtggc tctgtggaaatccccttctccttctattacccctgggagttagccacagctcccgacgtg agaatatcctggagacggggccacttccacaggcagtccttctacagcacaaggccgcct tccattcacaaggattatgtgaaccggctctttctgaactggacagagggtcagaagagc ggcttcctcaggatctccaacctgcagaagcaggaccagtctgtgtatttctgccgagtt gagctggacacacggagctcagggaggcagcagtggcagtccatcgaggggaccaaactc tccatcacccaggctgtcacgaccaccacccagaggcccagcagcatgactaccacctgg aggctcagtagcacaaccaccacaaccggcctcagggtcacacagggcaaacgacgctca gactcttggcacataagtctggagactgctgtgggggtggcagtggctgtcactgtgctc ggaatcatgattttgggactgatctgcctcctcaggtggaggagaaggaaaggtcagcag cggactaaagccacaaccccagccagggaacccttccaaaacacagaggagccatatgag aatatcaggaatgaaggacaaaatacagatcccaagctaaatcccaaggatgacggcatc gtctatgcttcccttgccctctccagctccacctcacccagagcacctcccagccaccgt cccctcaagagcccccagaacgagaccctgtactctgtcttaaaggcctaa >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_2|544_aa MADLPQLRKRRRCFQLRMRLGPASAPPSGGASAQGVLVPFLGGGKPGLRCEGLAESLRQL VPPIAFRSSLLLKEETPGISSPETEARISLPKASLKKKEEKATMKNVPSREQEKKRKAQI NKQAEKKEKEKSSLTNAEFEEIVQIVLQKSLQECLEDEKVEKTQGGHEHRQEDRLKKTVQ DHSQIRDQQKGEISGFGQCLVWVQCSFPNCGKWRRLCGNIDPSVLPDNWSCDQNTADVQY NRCDIPEETWTGLESDVAYASYIPGSIIWAKQYGYPWWPGMIESDPDLGEYFLFTSHLDS LPSKYHVTFFGETVSRAWIPVNMLKNFQELSLELSVMKKRRNDCSQKLGVALMMAQEAEQ ISIQERVNLFGFWSRFNGSNSNGERKDLQLSGLNSPGSCLEKKEKEEELEKEEGEKTDPI LPIRKRVKIQTQKTKPRGPKKKFKAPQSKALAASFSEGKEVRTVPKNLGLSACKGACPSS AKEEPRHREPLTQEAGSVPLEDEASSDLDLEQLMEDVGRELGQSGELQHSNSDGEDFPVA LFGK >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_2|1635_bp atggcggatttacctcagctgcggaaacgcagacgctgtttccagttgcgcatgcgcctc ggccccgcgagcgcaccgccctcgggtggagctagtgctcagggcgtcctcgtgcctttt cttggtggcgggaaacctgggttgaggtgtgaggggcttgcggagtcgctgcggcagctg gttccgcccatcgcctttaggtcctccttgctcctgaaggaggagaccccggggatcagt tccccagagacagaggccaggataagcctgccaaaggccagtttaaagaagaaagaggaa aaagcaaccatgaagaatgttccaagcagggaacaggagaaaaaaagaaaggcacaaatc aacaagcaagcagagaagaaagaaaaggaaaaatcaagtcttaccaatgcagaatttgag gagattgtccagattgttctgcagaagtcccttcaggagtgcttggaagatgagaaggtg gagaaaactcaaggtggacatgagcacagacaggaagaccgactaaagaaaacagttcag gatcattctcagatcagggaccagcaaaaaggagagataagtggttttggtcaatgtctg gtctgggtccagtgttccttcccaaactgtgggaaatggaggcggctgtgtgggaacatt gacccctcagttctcccagataattggtcctgtgatcagaacacagcagatgtgcagtat aatcgctgtgatattcctgaggagacctggacagggcttgagagtgatgtggcctatgcc tcctacatcccaggatccatcatctgggccaagcaatacggttacccctggtggccaggc atgatagaatctgatcctgacttaggggaatattttctttttacttcccatcttgattcc ctgccgtctaagtaccatgtgacgttttttggagaaacagtttctcgtgcatggatccca gtcaacatgctaaagaacttccaggagctgtccctggagctatcagtcatgaaaaagcgc agaaatgactgcagccagaaactgggggtggccctgatgatggctcaagaggcagaacag atcagcattcaggaacgggttaacttgtttggtttctggagccgattcaacggatctaac agtaatggggaaagaaaagacttacagctctctggtttgaacagcccaggatcctgctta gagaaaaaggagaaagaggaagagttggaaaaggaggaaggagagaaaacagacccaatt ttgcccattcgtaagcgagtcaaaatacagacccaaaaaaccaagccaagaggccctaag aaaaaatttaaagctccccagagcaaggccttggcagccagcttttcagagggaaaagaa gttagaacagtgccaaagaacctgggcctatcagcgtgtaagggggcctgcccctcatct gcgaaagaagagcccagacaccgggaacccctgacccaggaggctggaagtgtccccctt gaggacgaagcctccagtgacctggacctggagcaactcatggaagatgttgggagagag ctggggcagagcggggagctgcagcacagcaacagtgatggcgaggacttccccgtggcg ctgtttgggaagtag >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_3|686_aa MAAEKEPFLVPAPPPPLKDESGGGGGPTVPPHQEAASGELRGGTERGPGRCAPSAGSPAA AVGRESPGAAATSSSGPQAQQHRGGGPQAQSHGEARLSDPPGRAAPPDVGEERRGGGGTE LGPPAPPRPRNGYQPHRPPGGGGGKRRNSCNVGGGGGGFKHPAFKRRRRVNSDCDSVLPS NFLLGGNIFDPLNLNSLLDEEVSRTLNAETPKSSPLPAKGRDPVEILIPKDITDPLSLNT CTDEGHVVLASPLKTGRKRHRHRGQHHQQQQAAGGSESHPVPPTAPLTPLLHGEGASQQP RHRGQNRDAPQPYELNTAINCRDEVVSPLPSALQGPSGSLSAPPAASVISAPPSSSSRHR KRRRTSSKSEAGARGGGQGSKEKGRGSWGGRHHHHHPLPAAGFKKQQRKFQYGNYCKYYG YRNPSCEDGRLRVLKPEWFRGRDVLDLGCNVGHLTLSIACKWGPSRMVGLDIDSRLIHSA RQNIRHYLSEELRLPPQTLEGDPGAEGEEGTTTVRKRSCFPASLTASRGPIAAPQVPLDG ADTSVFPNNVVFVTGNYVLDRDDLVEAQTPEYDVVLCLSLTKWVHLNWGDEGLKRMFRRI YRHLRPGGILVLEPQPWSSYGKRKTLTETIYKNYYRIQLKPEQFSSYLTSPDVGFSSYEL VATPHNTSKGFQRPVYLFHKARSPSH >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_3|2061_bp atggcggcggagaaggagccgtttctggtgccggccccgccgccgccgctcaaagatgag tcgggcggagggggcggccccacggtgccaccgcaccaagaggccgcctctggggagctc cgcggcgggacggagcgtggtccgggtcgttgcgcgccatctgcggggtccccagccgct gcggtcggtcgggaaagccccggggccgcggccacctcctccagtggtccccaggcgcag cagcaccgagggggcggcccccaggcgcagtcgcatggggaggcccgcctgtcggatccc ccggggcgagccgctcccccggacgtgggggaggagcgccggggagggggcgggacagag ctgggtccccctgctcctcctcgaccccgcaatggctatcagccccaccggccacctggg gggggcgggggcaagaggagaaatagctgtaatgtagggggaggcgggggaggcttcaaa catccggccttcaagaggcgcaggcgggtgaattcggactgtgactctgtgttaccctcc aacttcctcctggggggcaatatctttgatcccctgaacctgaatagcctcctggatgag gaagtgagccgcactctcaacgcggagacccctaagtcatccccccttccggccaaaggg cgagatccggtggagatcctcatccccaaagatattactgacccgctcagtctcaatact tgcactgatgagggccatgtagttcttgcttcgccactcaagactggtcggaagcggcat agacaccggggacagcaccaccagcagcagcaggcagccggagggagtgagagtcacccc gtgccgcccacagcccctctcacccccttactccacggggagggcgcctcacagcagccg cggcacaggggccagaaccgggatgccccccaaccctatgaactcaacacagccatcaac tgcagggatgaagtggtgtctccccttccatctgctctgcagggtccctcaggctcccta tcagcccctccagctgcctcagttatctctgcacccccatcttcctcctcccgacatcgc aaacgtcgcaggacttccagcaagtcggaggcaggggctaggggtggaggccagggttcc aaggaaaagggccgagggagttggggaggccgccaccaccaccaccacccactgcctgca gcaggcttcaaaaagcaacagcgcaagttccagtatgggaattattgcaaatactatggg taccgcaatccttcctgtgaggatgggcgccttcgggtgttgaagcctgagtggtttcgg ggccgggacgtcctagatctgggctgcaatgtgggccatctgaccctgagcattgcctgc aagtggggcccgtcccgcatggtgggcctggatatcgattcccggctcatccattctgcc cgccaaaacatccgacactacctttccgaggagctgcgtctcccaccccagactttggaa ggggacccgggggcagagggtgaggaagggaccaccaccgttcgaaagaggagctgcttc ccagcctcgctgactgccagccggggtcccatcgctgccccccaagtgcccttggatgga gcggacacatcagtcttccccaacaatgttgtcttcgtcacgggtaattatgtgctggat cgagatgacctggtggaggcccaaacacctgagtatgatgtggtgctctgcctcagcctc accaagtgggtgcatctgaactggggagacgagggcctgaagcgcatgtttcgccggatc taccggcacctacgccctgggggcatcctggtcctagagccccaaccctggtcgtcgtat ggcaagagaaagactcttacagaaacgatctacaagaactactaccgaatccaattgaag ccagagcagttcagttcctacctgacatccccagacgtgggcttctccagctatgagctt gtggccacaccccacaacacctctaaaggcttccagcgtcctgtgtacctgttccacaag gcccgatcccccagccactaa >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_4|253_aa MMMGCGESELKSADGEEAAAVPGPPPEPQVPQLRAPVPEPGLDLSLSPRPDSPQPRHGSP GRRKGRAERRGAARQRRQVRFRLTPPSPVRSEPQPAVPQELEMPVLKSSLALGLELRAAA GSHFDAAKAVEEQLRKSFQIRCGLEESVSEGLNVPRSKRLFRDLVSLQVPEEQVLNAALR EKLALLPPQARAPHPKEPPGPGPDMTILCDPETLFYESPHLTLDGLPPLRLQLRPRPSED TFLMHRTLRRWEA >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_4|762_bp atgatgatgggttgtggggagtcagagctgaagtcggcggacggggaagaagccgcggcg gtcccggggccacccccggagccccaagtcccgcaactccgagccccagtgcccgagccc ggcctggacttgagcctgagcccgcggcccgacagccctcagccgcggcacggcagcccc gggcggcggaaggggcgggcggagcggcggggcgcggctcggcagcggcggcaggtccgc ttccgcctgacgccgccctccccggtgcggtccgagccgcagcctgcggtgccgcaggag ctggagatgcccgtgctgaagagcagcctggccttgggcctggagctgcgggccgcagcc gggagccactttgatgctgcgaaggccgtggaggaacagctgagaaagtcgttccagatc cgctgcggcctggaggagagcgtgtccgaggggctgaacgtgccgcgctccaagcggctc ttccgggacctggtgagcctgcaggtgccggaggaacaggttctgaatgccgcgctcagg gagaaattggctctcctgccgccacaggctcgagccccgcacccaaaggagccacctggg cctgggccagacatgaccatcttgtgtgacccagaaacgctattttatgaatctccacac ctgaccctggacggtctgccccctctccgacttcaactccggccccgcccttcagaggac accttcctcatgcaccggacactgaggcgatgggaagcgtag >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_5|602_aa MSGGKKKSSFQITSVTTDYEGPGSPGASDPPTPQPPTGPPPRLPNGEPSPDPGGKGTPRN GSPPPGAPSSRFRVVKLPHGLGEPYRRGRWTCVDVYERDLEPHSFGGLLEGIRGASGGAG GRSLDSRLELASLGLGAPTPPSGLSQGPTSWLRPPPTSPGPQARSFTGGLGQLVVPSKAK AEKPPLSASSPQQRPPEPETGESAGTSRAATPLPSLRVEAEAGGSGARTPPLSRRKAVDM RLRMELGAPEEMGQVPPLDSRPSSPALYFTHDASLVHKSPDPFGAVAAQKFSLAHSMLAI SGHLDSDDDSGSGSLVGIDNKIEQAMDLVKSHLMFAVREEVEVLKEQIRELAERNAALEQ ENGLLRALASPEQLAQLPSSGVFFWRQKIKPTISGHPDSKKHSLKKMEKTLQVVETLRLV ELPKEAKPKLGESPELADPCVLAKTTEETEVELGQQGQSLLQLPRTAVKSVSTLMVSALQ SGWQMCSWKPPRDGDTAKVKVSGKTLREVQLCLFPVTAEIVQIATYYILDLLNSDLGIRK EVAGKQSLSWTVEKPHTNISLSNRLSWTVEKPHTNISLSNPEASMESLSTPRPPNISCVL IE >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_5|1809_bp atgagcgggggcaagaagaagagtagtttccaaatcaccagcgtcaccacggactatgag ggccctgggagcccaggggcttcggatccccctaccccacagcccccaaccgggcccccg ccccgcctgcccaatggggagcccagccccgatccggggggcaagggcaccccccggaat ggctccccaccacctggggccccttcctcccgtttccgggtggtgaagctgccccacggc ctgggagagccttatcgccgcggtcgctggacgtgtgtggatgtttatgagcgagacctg gagccccacagcttcggcggactcctggagggaattcgaggggcctcagggggcgccggg ggcagatctttggattccaggttggagctggccagcctcggcctgggcgcccccacccca ccgtcaggcctgtctcagggccccacctcctggctccgtccaccccccacctctcctgga cctcaggcccgctccttcactgggggactgggccagctggtggtgcccagcaaagccaag gcagagaaacccccactgtcggcctcctcaccccagcagcgccccccagagcctgagacc ggtgagagtgcgggcacatcccgggctgccacgcccctgccctctctgagggtggaagcg gaggctgggggctcaggggccaggacccctccactgtcccggaggaaagctgtagacatg cggctgcggatggagttgggtgctccagaagagatggggcaggtgcccccacttgactct cgccccagctccccagccctctacttcacccacgatgccagcctggttcacaaatctcca gaccccttcggagcagtagcagctcagaagttcagcctggcccactccatgttggccatc agtggtcacctagacagcgacgatgatagtggctccggaagcctggttggcattgacaac aaaatcgagcaagccatggacttggtgaagtcccacctcatgtttgcggtccgggaggag gtggaggtgctgaaggagcagatccgggaattggcggagcggaacgctgcgctggagcag gagaatgggctgctgcgcgccctggccagcccggagcagctggctcagctgccctcctcg ggggttttcttctggaggcaaaaaattaaaccaaccatctcaggacaccctgactccaag aaacactcattgaagaagatggagaagactctccaggtggttgagactttgaggttggtc gagctcccaaaagaggctaagcccaagttgggtgagtcccccgagctggcagatccctgc gtgttggccaagactacagaggagaccgaggtggagctgggccaacagggccaatcccta ctgcagctgccgaggacggccgtcaagtctgtctccacgctcatggtctctgccctgcag agcggctggcagatgtgcagctggaagccccccagagatggggatactgccaaggtgaag gtttcggggaagaccctcagggaggttcagctgtgtttgtttcctgtgactgctgaaatt gtacagattgccacatactacatcctagacctcctgaattctgatttggggatcaggaag gaggttgctgggaagcagagcctctcttggactgtggaaaaacctcacaccaacatctcc ttgtccaatcgcctctcttggactgtggaaaaacctcacaccaacatctccttgtccaat ccagaggcttccatggaatcactctccaccccaagaccccccaacatctcctgtgtgctc attgagtaa >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_6|842_aa MNLLYRKTKLEWRQHKEEEAKRSSSKEVAPAGSAGPAAGQGPGVRVRDIASLRRSLRMGF MTMPASQEHTPHPCRSAMAPRSLSCHSVGSMDSVGGGPGGASGGLTEDSSTRRPPAKPRR HPSTKLSMVGPGSGAETPPSKKAGSQKPTPEGRESSRKVPPQKPRRSPNTQLSVSFDESC PPGPSPRGGNLPLQRLTRGSRVAGDPDVGAQEEPVYIEMVGDVFRGGGRSGGGLAGPPLG GGGPTPPAGADSDSEESEAIYEEMKYPLPEEAGEGRANGPPPLTATSPPQQPHALPPHAH RRPASALPSRRDGTPTKTTPCEIPPPFPNLLQHRPPLLAFPQAKSASRTPGDGVSRLPVL CHSKEPAGSTPAPQVPARERETPPPPPPPPAANLLLLGPSGRARSHSTPLPPQGSGQPRG ERELPNSHSMICPKAAGAPAAPPAPAALLPGPPKDKAVSYTMVYSAVKVTTHSVLPAGPP LGAGEPKTEKEISVLHGMLCTSSRPPVPGKTSPHGGAMGAAAGVLHHRGCLASPHSLPDP TVGPLTPLWTYPATAAGLKRPPAYESLKAGGVLNKGCGVGAPSPMVKIQLQEQGTDGGAF ASISCAHVIASAGTPEEEEEEVGAATFGAGWALQRKVLYGGRKAKELDTEVEDGARAWNG SAEGPGKVEREDRGPGTSGIPVRSQGAEGLLARIHHGDRGGSRTALPIPCQTFPACHRNG DFTGGYRLGRSASTSGVRQVVLHTPRPCSQPRDALSQPHPALPLPLPLPPQPARERDGKL LEVIERKRCVCKEIKARHRPDRGLCKQESMPILPSWRRGPEPRKSGTPPCRRQHTVLWDT AI >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_6|2529_bp atgaacctcctctaccgaaaaaccaagctggagtggaggcagcacaaggaagaggaggcc aagaggagctccagtaaggaggtggcccccgctggctcggctgggcccgcggccggccag gggcctggggtccgcgtgcgggacatcgcctcgctgcggcgctccctcaggatgggtttc atgacgatgcccgcctcccaggagcacaccccgcacccctgccgcagcgccatggcccca cgctccctctcctgccactcggtgggcagcatggacagtgtcgggggtggccctggcggg gccagtgggggcctcacagaggacagcagcacccgaagaccccctgccaagccccggaga caccccagcaccaagctcagcatggtggggcctgggtctggggcagagacgccccccagc aagaaagcaggctcacagaagccaaccccagagggccgagagtccagccggaaggttcct ccgcagaagcccaggcgaagccctaacacccagctctctgtctccttcgatgagtcctgc cccccaggcccctctcctcgaggggggaacctgcctcttcagcgcctcactagggggtcc cgagtagctggggaccctgatgtgggtgcccaggaagagcctgtgtacattgagatggtg ggggacgtctttaggggaggaggacgaagtggaggaggcctggctgggccccctcttggg ggtgggggcccgacccctccagcgggcgccgactcggactctgaagagagtgaggccatc tatgaagagatgaagtacccgctgccggaagaggctggggaaggccgggccaatggccct ccaccattgacggcaacatccccgccacaacagcctcacgcccttccgccccatgcccac cgccgcccagcttcagccctcccgagccggagggacgggacgcccaccaagaccactcct tgtgaaatccccccgcccttccccaacctccttcagcaccggcctccactcctggccttc ccccaagccaagtctgcttcccgaacccctggcgatggggtctcaaggctacctgtcctc tgccactccaaggagccagccggctccaccccagctccccaagtgcctgcacgggagcgg gagacgcctcccccaccgcctccacctcctgctgccaacctgctgctgctgggaccatcg ggccgggcccggagccactcgacaccgttgccaccccagggctctggccagccccggggg gagcgggagctccccaactcccacagcatgatctgccctaaggcggcgggggcgccggca gccccccctgccccggccgccttgctccccggcccccccaaggacaaggccgtgtcttac accatggtgtactcggcggtcaaggtgaccacgcactctgtcctgccagctggtccaccc ctgggtgctggggagccaaagacggagaaggagatctcggtcctccatgggatgctgtgt accagctcaaggccccctgtgccagggaagaccagcccccacggtggggccatgggcgca gcagctggggtcctccaccaccgcggctgcctggcctccccccacagccttccggaccca actgtaggccccctgaccccgctgtggacctacccagccacagcagctgggctcaagaga ccccctgcctatgagagcctcaaggctgggggggtgctgaataagggctgtggtgtgggg gccccatcccccatggtcaagatccagctgcaggagcaagggaccgatgggggtgctttt gccagcatctcctgtgcccacgtcatcgccagcgcagggacaccagaggaggaagaagag gaggtgggcgccgcgacatttggggcaggctgggccctgcagaggaaggtcctctatgga gggagaaaagcaaaggagttggacacagaggtcgaggacggtgcccgggcctggaatggc agtgccgagggtccaggcaaggtggagcgtgaggacaggggccctgggacatcggggatc ccagtgagaagccagggggcagagggactgctggccaggatccaccatggagaccgagga gggagccgcaccgcgctgcccattccctgccagaccttccccgcctgccaccgcaatgga gacttcacgggaggctaccgcctggggcgctccgcctccacctccggagtccggcaggtc gtgctccacacaccccggccctgcagccagcccagggatgccctgagccagccccacccc gcgctgccgctgcctctgcccctgccgccccagccggcccgcgagcgtgacgggaagctg ctggaggtgatcgagcgcaagcgctgcgtgtgcaaggagatcaaggcgcgccaccgcccg gaccgaggcctctgcaagcaggagagcatgcccatcctccccagctggcggcggggaccc gagccccgcaagtccggcaccccgccctgccgccggcagcacacggtcctctgggacacc gccatctga >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_7|83_aa MQEAMRAMKDERDSGQFPANLKFLDFQFQPTYVLMDVILIVIDNIMLICLGLEVSKIPQV LLRRGCNGVVRYNWQKCKDLSHQ >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_7|252_bp atgcaggaggctatgcgggccatgaaagatgaaagagacagtggccagttcccagccaac ctcaaattcttagattttcagttccagcccacctatgtccttatggatgtcatcctgatc gtcattgacaatatcatgctgatttgccttggacttgaagtatcaaagatcccccaggtc ttgctgagacgtggatgtaatggggttgtgagatataactggcagaaatgcaaggacctg tcacatcagtag >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_8|672_aa MAAKKGPGPGGGVSGGKAEAEAASEVWCRRVRELGGCSQAGNRHCFECAQRGVTYVDITV GSFVCTTCSGLLRGLNPPHRVKSISMTTFTEPEVVFLQSRGNEVSCHRWSGEKVTYLQVC RKIWLGLFDARTSLVPDSRDPQKVKEFLQEKYEKKRWYVPPDQVKGPTYTKGSASTPVQG SIPEGKPLRTLLGDPAPSLSVAASTSSQPVSQSHARTSQARSTQPPPHSSVKKASTDLLA DIGGDPFAAPQMAPAFAAFPAFGGQTPSQGGFANFDAFSSGPSSSVFGSLPPAGQASFQA QPTPAASRMLTESYSFGSSQGTPFGATPLAPASQPNSLADVGSFLGPGVPAAGVPSSLFG MAGQVPPLQSVTMGGGGGSSTGLAFGAFTNPFTAPAAQSPLPSTNPFQPNGLAPGPGFGM SSAGPGFPQAVPPTGAFASSFPAPLFPPQTPLVQQQNGSSFGDLGSAKLGQRPLSQPAGI STNPFMSSGDHLPVGISMGLGDGGGANALLGAYSTQGSVTQGGLELVTVIAAVISQGRVL QGDLVDGEWEGEGIKRSSSATLLPSPLNEARRRRPRGEPGMAAVPLSQPLPWACPADVRF CGHLRADPPRPECSESEQKFRACRARLQRSLSLAGACTTSKCADARQHHLIVLYTRDSSL RVAAAGEAEQQA >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_8|2019_bp atggcggcgaagaagggcccgggcccgggcggcggggtcagcgggggcaaggcggaggcg gaggcggcctcggaggtgtggtgccgtcgggtgcgggagctgggtggctgcagccaggcc gggaaccgccactgcttcgagtgcgcccagcgcggggtcacctacgtggatatcaccgtg ggcagcttcgtgtgcaccacctgctccggcctcctgagagggctgaacccccctcatcgt gtcaagtcaatctccatgacaactttcactgagcctgaagtagtattcctgcaatcccgt ggaaatgaggtgagctgccaccgatggagtggtgagaaggtcacctatctgcaggtttgc cggaagatttggttgggtctgtttgatgctcggacatctttagtaccagattccagggat cctcagaaagtgaaggagtttctccaggaaaaatatgagaagaagagatggtatgtcccc ccagaccaagtcaaggggcccacttataccaaaggcagtgcctccacccctgtgcagggc tccatcccagaagggaagccccttcggacacttctgggtgatcctgcaccgtctctctca gttgctgcctccacctcgagccagcccgtcagtcagtctcacgctcggacatcccaggcc cggagcactcagccacctccccactcctctgtcaaaaaagccagtactgacctgctggct gacatcggtggagacccctttgctgcaccccagatggcaccagcttttgctgcattccct gcctttgggggccagacaccttcccaaggaggctttgccaactttgatgcctttagcagt ggccccagctcttctgtgtttggaagcctccctccagctggtcaagcctcgttccaggcc cagccaactcctgcagccagtcggatgctaactgaaagttacagctttgggagcagccag gggactccatttggtgccactcccctggcacccgccagtcagccaaacagcctcgcagac gtgggcagcttcctgggacccggggtgcccgctgcaggtgttcctagcagcctcttcggg atggctggccaggtccccccgctccagtctgtcacgatgggcggcggcggcggcagcagc acagggctggcctttggagccttcactaaccctttcacagctcccgccgcccagtccccg ctgccttccaccaacccgttccagcccaatggcttggcgccagggcccggctttgggatg agcagtgctgggcctggcttcccccaggcagtgccacccactggggcctttgccagctcc ttcccagcaccgctgttccccccgcagaccccgcttgttcagcagcagaatggctcttcc ttcggggacttaggatcagccaagttggggcagaggccactgagccagccagctgggatc tccaccaaccccttcatgagctctggtgaccacttgcctgtgggcatttctatgggcctt ggggatggtggaggtgctaatgctttgcttggggcctacagcacccaaggttcagtgaca cagggtggtttggagctggtcactgtcatagcagctgtgatttcacaaggaagggtgctg cagggggacctggttgatggggagtgggaaggggaaggaataaagagatcttcctcagcc actttattgccttcgcccctgaatgaagcccgcaggcggcggccccggggagagccggga atggccgccgtgcccctcagccagccgctgccctgggcctgccctgccgacgtgcggttc tgcggccacctgcgcgcagacccgccgcgtcccgagtgttccgagagcgagcagaagttt cgcgcgtgccgagcgcggctccagcgcagcctgagcctggcgggtgcgtgcaccaccagc aagtgcgcagatgcgcgccagcaccacctgatcgtcctctacacccgcgacagcagcctg cgcgtggcggcggcaggcgaggcggagcagcaggcgtga >gi568815591r:100366962_100578038|GENSCAN_predicted_peptide_9|498_aa STDEFSELSFRISELAREPRGPRERKEDGSADGDPVQIDFIDSHVPGEDEERGTVEEQRP PELSPGAGDRERAPSSRREEPAGEERRRPDTLQLWQERERRQQQQSGAWGAPRKDSGSPK SSASQAGAAAGQGAPAPAPASQEPLPIAGPATAPAPRPLGSIQRPNSFLFRSSSQSGSGP SSPDSVLRPRRYPQVPDEKDLMTQLRQVLESRLQRPLPEDLAEALASGVILCQLANQLRP RSVPFIHVPSPAVPKLSALKARKNVESFLEACRKMGVPEADLCSPSDLLQGTARGLRTAL EAVKRVGGKALPPLWPPSGLGGFVVFYVVLMLLLYVTYTRLLAHSPHPVALPGSGESSCC CPTTCQARPLSSPPEPQVAWEVAPSRMTPLAPWDPKYEAKAGPRPVWGANCSSGASFSGR TLCHPSFWPLYEAASGRGLRPVAPATGHWNGQQAPPDAGFPVVCCEDVFLSDPLLPRGQR VPLYLSKAPQQTPDTHCP >gi568815591r:100366962_100578038|GENSCAN_predicted_CDS_9|1497_bp tcaacagatgaattttcagagctgtcattccggatctcagagctggcccgggagccccgg ggacccagagaacgcaaggaggatggctcagcggacggagaccctgtgcagattgacttc atcgacagccatgtccccggggaggatgaagagcgaggcactgtggaggagcagcgacca cccgaattaagccctggggcaggggacagggagagggcaccaagcagcaggcgggaggag ccggcaggggaggagcggcggcgcccggacaccttgcagctgtggcaggagcgggaacgg cggcagcagcagcagagcggggcgtggggggccccgaggaaggatagcggctcgcctaag tccagtgcctcccaagcaggggctgcagcggggcagggagcccccgcccctgcccctgcc tcccaagagccccttcccatagctggaccagcgacagcacctgctccacggccacttggc tccattcagagaccaaacagcttcctcttccgttcctcctctcagagtggctcaggccct tcctcaccagactctgtcctgagacctcggcggtacccccaggttccagatgagaaggac ttaatgactcagctgcgccaggtccttgagtcccggctgcagcggcccctgcctgaggac ctggccgaggctctggccagtggggtcatcctgtgccagctggccaaccagctacggccg cgctccgtgcccttcatccatgtgccctcccctgctgtgccaaaactcagtgccctcaag gctcggaagaatgtggagagttttctagaagcctgtcgaaaaatgggggtgcctgaggct gacctgtgctcgccctcggatctcctccagggcactgcccgggggctgcggaccgcgctg gaggccgtgaagcgggtggggggcaaggccctaccgcccctctggcccccctctggtctg ggcggcttcgtcgtcttctacgtggtcctcatgctgctgctctatgtcacctacactcgg ctcctggcccacagcccccatcccgtagcccttcctggatctggagagagcagctgctgc tgccccaccacctgccaggcccggccactgagcagccccccggagccccaggtggcctgg gaggtggccccctcgaggatgactccactagcgccctgggaccccaagtatgaagccaaa gcaggacctcggccggtgtggggggccaactgtagctcaggagcctcgttctcaggccgg acgctgtgtcacccctcattctggccgctgtatgaagcagcctcgggcaggggtctcagg cccgtggcccctgccacagggcactggaatggacagcaggcgcccccagatgcagggttc ccggtggtgtgctgtgaagatgtcttcctctcggaccctctgctgccccgggggcagcgt gttcccctgtacctgtccaaggccccccagcagaccccagacacccattgtccatag