GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:33:33 Sequence gi568815591r:75324055_75611113 : 287059 bp : 47.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1675 1589 87 2 0 62 97 126 0.985 10.84 1.02 Intr - 3709 3570 140 0 2 50 111 107 0.960 9.31 1.01 Init - 4038 3971 68 0 2 76 48 50 0.950 0.34 1.00 Prom - 5432 5393 40 -7.56 2.00 Prom + 7995 8034 40 -7.66 2.01 Init + 8841 8916 76 2 1 68 64 93 0.573 4.15 2.02 Intr + 9815 9905 91 0 1 36 89 71 0.796 1.05 2.03 Intr + 13038 13213 176 0 2 42 94 94 0.360 4.98 2.04 Intr + 14209 14427 219 2 0 42 54 265 0.446 16.77 2.05 Intr + 15460 15518 59 2 2 86 116 35 0.939 4.70 2.06 Intr + 16505 16635 131 1 2 53 17 89 0.776 -2.21 2.07 Intr + 18537 18780 244 0 1 80 90 268 0.974 23.70 2.08 Term + 19861 19965 105 0 0 110 43 91 0.993 5.21 2.09 PlyA + 21131 21136 6 1.05 3.06 PlyA - 21144 21139 6 1.05 3.05 Term - 22780 22724 57 1 0 25 48 95 0.169 -3.11 3.04 Intr - 27930 27782 149 1 2 3 87 109 0.625 2.25 3.03 Intr - 29601 29515 87 1 0 62 97 126 0.997 10.84 3.02 Intr - 31634 31495 140 1 2 50 111 103 0.956 8.91 3.01 Init - 31963 31896 68 1 2 76 48 46 0.944 -0.06 3.00 Prom - 33338 33299 40 -7.56 4.00 Prom + 34156 34195 40 -9.16 4.01 Init + 35341 35472 132 0 0 94 91 96 0.963 10.64 4.02 Intr + 37918 38034 117 0 0 79 98 107 0.997 11.46 4.03 Intr + 41201 41299 99 1 0 73 100 176 0.997 17.61 4.04 Intr + 41510 41589 80 1 2 109 78 63 0.999 5.75 4.05 Intr + 41898 41951 54 0 0 108 -1 118 0.783 2.99 4.06 Intr + 42270 42384 115 0 1 84 56 76 0.693 4.45 4.07 Intr + 42696 42847 152 2 2 81 29 107 0.048 3.06 4.08 Intr + 59186 59234 49 2 1 125 13 53 0.015 0.18 4.09 Term + 65236 65352 117 0 0 37 48 109 0.163 0.34 4.10 PlyA + 66541 66546 6 1.05 5.06 PlyA - 67911 67906 6 1.05 5.05 Term - 68301 68152 150 0 0 86 43 79 0.845 1.11 5.04 Intr - 69513 69311 203 1 2 124 49 88 0.934 7.50 5.03 Intr - 70302 70213 90 1 0 95 61 26 0.597 0.57 5.02 Intr - 71081 70964 118 2 1 60 73 239 0.913 19.64 5.01 Init - 71227 71171 57 1 0 75 94 24 0.229 3.01 5.00 Prom - 71332 71293 40 -5.56 6.00 Prom + 73399 73438 40 -8.06 6.01 Init + 74880 75278 399 2 0 43 81 545 0.929 43.97 6.02 Intr + 79591 79686 96 0 0 78 72 242 0.966 21.81 6.03 Intr + 80785 81015 231 0 0 74 92 400 0.998 36.97 6.04 Term + 83778 83882 105 2 0 89 33 88 0.519 1.81 6.05 PlyA + 85618 85623 6 1.05 7.00 Prom + 85927 85966 40 -5.36 7.01 Init + 86314 86406 93 0 0 103 74 176 0.999 18.32 7.02 Intr + 86552 86672 121 1 1 90 49 281 0.992 24.47 7.03 Intr + 87601 87705 105 2 0 70 94 15 0.532 0.49 7.04 Intr + 91066 91180 115 2 1 82 96 77 0.491 7.41 7.05 Intr + 91538 91716 179 2 2 117 80 291 0.738 30.76 7.06 Intr + 91927 92137 211 2 1 60 92 193 0.664 14.87 7.07 Term + 92221 92365 145 1 1 104 48 87 0.571 3.68 7.08 PlyA + 92711 92716 6 1.05 8.13 PlyA - 92755 92750 6 1.05 8.12 Term - 94839 94742 98 1 2 96 38 112 0.996 5.13 8.11 Intr - 95388 95266 123 1 0 115 111 36 0.996 9.16 8.10 Intr - 99149 97455 1695 0 0 99 94 1378 0.986 126.82 8.09 Intr - 100574 100472 103 2 1 99 79 106 0.857 10.55 8.08 Intr - 101141 101020 122 0 2 45 52 93 0.685 1.61 8.07 Intr - 101654 101581 74 1 2 85 73 39 0.748 1.15 8.06 Intr - 102358 102249 110 1 2 31 81 101 0.425 2.78 8.05 Intr - 113632 113461 172 0 1 81 87 142 0.984 13.35 8.04 Intr - 115170 115090 81 2 0 81 97 27 0.757 1.65 8.03 Intr - 117061 116900 162 0 0 71 87 111 0.993 8.39 8.02 Intr - 117593 117378 216 1 0 101 76 105 0.990 8.22 8.01 Init - 118665 118091 575 0 2 85 93 236 0.979 16.97 8.00 Prom - 124341 124302 40 -3.76 9.00 Prom + 127498 127537 40 -5.86 9.01 Init + 127552 127726 175 0 1 71 77 133 0.727 10.01 9.02 Intr + 162511 162559 49 2 1 135 20 70 0.009 2.64 9.03 Intr + 169978 170153 176 1 2 49 94 120 0.133 8.28 9.04 Intr + 171102 171320 219 1 0 39 54 189 0.689 8.87 9.05 Intr + 172620 172850 231 1 0 49 54 223 0.799 12.74 9.06 Intr + 173884 173942 59 2 2 86 116 42 0.949 5.40 9.07 Intr + 175177 175262 86 0 2 94 65 5 0.767 -2.58 9.08 Intr + 177308 177701 394 2 1 80 92 353 0.692 29.56 9.09 Term + 178779 178883 105 2 0 107 43 98 0.999 5.61 9.10 PlyA + 180051 180056 6 1.05 10.42 PlyA - 180064 180059 6 1.05 10.41 Term - 183467 183435 33 2 0 72 43 32 0.219 -5.31 10.40 Intr - 187060 186877 184 0 1 86 87 164 0.916 15.89 10.39 Intr - 188357 188255 103 0 1 33 44 33 0.558 -7.37 10.38 Intr - 188621 188535 87 0 0 40 97 109 0.991 6.94 10.37 Intr - 190725 190591 135 1 0 30 111 66 0.920 3.64 10.36 Intr - 191774 191664 111 0 0 94 111 68 0.952 10.05 10.35 Intr - 192166 192040 127 1 1 78 68 159 0.982 13.15 10.34 Intr - 201909 201851 59 1 2 154 109 22 0.969 9.50 10.33 Intr - 203654 203511 144 0 0 67 110 1 0.528 0.45 10.32 Intr - 209846 209704 143 1 2 103 -21 69 0.044 -2.50 10.31 Intr - 213431 213248 184 0 1 60 70 106 0.550 4.85 10.30 Intr - 215377 215269 109 1 1 69 121 169 0.999 18.16 10.29 Intr - 217926 217865 62 1 2 107 94 62 0.605 7.15 10.28 Intr - 218920 218797 124 1 1 79 68 180 0.999 15.26 10.27 Intr - 220746 220641 106 2 1 81 97 80 0.993 8.42 10.26 Intr - 221134 221034 101 1 2 87 78 55 0.998 3.31 10.25 Intr - 222978 222885 94 2 1 91 84 128 0.556 12.67 10.24 Intr - 223759 223701 59 1 2 49 71 44 0.570 -3.52 10.23 Intr - 224947 224837 111 1 0 69 75 187 0.969 16.08 10.22 Intr - 229535 229399 137 0 2 53 65 203 0.904 14.79 10.21 Intr - 230472 230386 87 1 0 127 116 73 0.999 13.94 10.20 Intr - 231497 231362 136 2 1 67 86 82 0.982 6.04 10.19 Intr - 232115 232059 57 2 0 75 105 100 0.998 9.48 10.18 Intr - 232757 232656 102 2 0 114 99 74 0.998 11.47 10.17 Intr - 233716 233600 117 1 0 102 113 172 0.999 21.76 10.16 Intr - 234201 234113 89 1 2 63 116 162 0.995 16.19 10.15 Intr - 235861 235678 184 1 1 42 61 409 0.682 32.96 10.14 Intr - 237347 237275 73 1 1 89 60 54 0.819 2.11 10.13 Intr - 238116 238019 98 0 2 106 81 161 0.869 16.01 10.12 Intr - 239021 238881 141 2 0 73 94 167 0.999 16.35 10.11 Intr - 239209 239134 76 0 1 102 99 130 0.697 15.02 10.10 Intr - 244202 244145 58 0 1 120 106 94 0.999 12.34 10.09 Intr - 249847 249707 141 2 0 88 76 186 0.959 17.72 10.08 Intr - 257244 257183 62 2 2 52 109 91 0.967 5.98 10.07 Intr - 258097 258021 77 1 2 143 89 118 0.999 15.71 10.06 Intr - 262779 262699 81 0 0 107 87 100 0.999 11.63 10.05 Intr - 268079 268002 78 2 0 116 92 37 0.685 6.65 10.04 Intr - 275193 275068 126 0 0 128 55 99 0.428 11.48 10.03 Intr - 275726 275618 109 1 1 39 42 90 0.514 -0.21 10.02 Intr - 281235 281122 114 2 0 51 92 58 0.501 2.06 10.01 Init - 286676 286555 122 2 2 110 -8 117 0.777 3.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 151134 151008 127 0 1 78 68 136 0.945 10.85 S.002 Intr - 160202 160144 59 0 2 154 109 24 0.932 9.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_1|99_aa MLSVFKKEDTIIAKDFGNLRDTITEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKKIVGN SLDAGATNIDLKLKDYGMDLIEVSGNGCGVEEENFEGLX >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_1|297_bp atgctttcagttttcaagaaagaagacaccattattgccaaagattttggtaatttgaga gatacaattacagaacctgctaaggccatcaaacctattgatcggaagtcagtccatcag atttgctctgggccggtggtactgagtctaagcactgcggtgaagaagatagtaggaaac agtctggatgctggtgccactaatattgatctaaagcttaaggactatggaatggatctc attgaagtttcaggcaatggatgtggggtagaagaagaaaacttcgaaggcttaann >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_2|366_aa MLDIMVSVSWAHYLPASASRSAGITEGKFSSRYRNHDSAENRREPQRWQHDQTTDSVQKK TKDRTETRFGEMGQILGKIMMSHQPQPQEERSPQRSTSGYPLQEVVDDEVLGPSAPGVDP SPPRRSLGWKRKRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYY EAFNRLLEDPVIKRLLAWDKDLRVSDKIPSEPTILGASPKTLPPASRICIRPSNTPPPRN FHMSTVTPTLSYLANDMEEDDEAPKQNIFYFLYEETRSHIPLLSELWFQLCRYMNPRARK NCSQIALFRKYRFHFFCSMRCRAWVSLEELEENTGPRGDVDFQQELYSNANGRHQEGGEE PFVQII >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_2|1101_bp atgttggacatcatggtctcggtctcttgggctcattatctgcctgcctcggcctcccga agtgctggcattacagaggggaagttctcttcaaggtacagaaaccacgactccgcagag aaccgcagagaaccgcagagatggcagcatgaccagaccacggacagtgtccagaagaag accaaggacagaacagagactaggtttggtgagatgggacagattttgggaaagatcatg atgagccatcaaccgcagccccaggaagagcggagcccccagcggagcacctcagggtac cccctccaggaggtggtggatgatgaagtgttgggaccatcagcccctggggtagatccc agccccccacgtaggtcccttggctggaaaaggaagagggaatgtttggatgaatctgat gatgagccagagaaggagctcgcccctgagcctgaggagacctgggtggcggagacgctg tgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctccctgagtactac gaggccttcaacaggctgcttgaggatcctgtcattaaaagactcctggcctgggacaaa gatctgagggtgtcggacaagatcccatcggagcccaccatcctgggagcatcaccaaaa acccttcctccggcttctcggatttgcatccgaccttcgaatacccctccaccccgcaat ttccacatgagcacagtcaccccaacactgagctatctggccaatgacatggaggaggac gacgaggcccccaaacaaaacatcttctacttcctgtacgaggagacccgctctcatata cccttgctcagtgagctttggttccagttatgccgttacatgaacccgagggccaggaag aactgctctcagatagccttgttccggaagtatcggttccacttcttttgttccatgcgc tgcagggcttgggtttccctggaggagttggaagagaacaccggacccaggggagatgtg gattttcagcaggaactttattccaatgctaatggcagacatcaggaaggaggagaggaa ccatttgtgcagatcatctag >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_3|166_aa MLSVFKKEDTIIAKDFGNLRDTITEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKKIVEN SLDAGAANIDLKLKDYGMDLIEVSGNGCGVEEENFEGLSWDSTGVFDHDGKIIQKTPYPH PRGTTVSVKQLFSTLPVRHKEFQRNIKKPVGEKYMRSVDTQACSCI >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_3|501_bp atgctttcagttttcaagaaagaagacaccattattgctaaagattttggtaatttgaga gatacaattacagaacctgctaaggccatcaaacctattgatcggaagtcagtccatcag atttgctctgggccggtggtactgagtctaagcactgcggtgaagaagatagtagaaaac agtctggatgctggtgccgctaatattgatctaaagcttaaggactatggaatggatctc attgaagtttcaggcaatggatgtggggtagaagaagaaaacttcgaaggcttaagttgg gactcgactggtgtttttgatcacgatgggaaaatcatccagaaaaccccctacccccac cccagagggaccacagtcagcgtgaagcagttattttctacgctacctgtgcgccataag gaatttcaaaggaatattaagaagccagttggtgagaagtacatgcggtctgtggacacc caagcttgcagctgcatctga >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_4|304_aa MPMYQVKPYHRVCAPLRVEPTCMYWLPNMHGRSGGPALGTGHLQVAKYPKKGSQAVHRHS RKQSEPPANDLFNAAKAAKSDMQHREVRVKCVKALKGLYGNRDLTARLELFTGRFKDWMV SMIMDREYSVAVEAVRLLILILKNMEGVLMDVDCESVYPIVLFYPECEIRTMGGREQRQS PGAQRTFFQLLLSFFVESKLHDHAAYLVDNLWDCAGTQLKDWEGLTSLLLEKDQSTCHME PGPGTFHLLGESPGISAIVQGAIEKSGERSSPPPKRIGSMKEGPMVLPLALDTQCWFPVE VPLD >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_4|915_bp atgccgatgtaccaggtaaagccctatcaccgggtctgcgcccctctccgtgtggagccc acctgcatgtactggctccccaacatgcacggcaggagcggcggcccagcacttggcact ggccacttgcaggtggcaaaatatccaaagaaagggtcccaagcggtacatcgtcatagc cggaaacagtcagagccaccagccaatgatcttttcaatgctgcgaaagctgccaaaagt gacatgcagcaccgagaagtccgcgtgaagtgcgtgaaggctctgaaagggctgtacggt aaccgggacctgaccgcacgcctggagctcttcactggccgcttcaaggactggatggtt tccatgatcatggacagagagtacagtgtggcagtggaggccgtcagattactgatactt atccttaagaacatggaaggggtgctgatggacgtggactgtgagagcgtctaccccatt gtacttttctaccctgagtgcgagataagaacgatgggtggaagagagcaacgccagagc ccaggtgcccagaggactttcttccagcttctgctgtccttctttgtggagagcaagctc cacgaccacgctgcttacttagtagacaacctgtgggactgtgcagggactcagctgaag gactgggagggtctgacaagcctgctgctggagaaggaccagagcacgtgccacatggag ccagggccagggaccttccacctcctaggagaatctccaggaatttctgccattgtgcag ggagcaattgaaaaatcaggcgagaggagctcgcccccacccaaacgtattggttcgatg aaggaagggcccatggttctgccactggccctggacacccagtgctggtttcccgtggaa gtccccctggactga >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_5|205_aa MGGSALNQGVLEGDDAPGQSLYERLSQRMLDISGDRGVLKDVIREGAGDLVAPDASVLVK YYGYLEHLDRPFDSNYFRKTPRLMKLGEDITLWGMELGLLSMQRGELARFLFKPNYAYGT LGSPPLIPPNTTVLFKIELLDFLDCAESDKFCALSAPHHSSLDLQLRTPGGLRTTPSRAS ELPLRFLLRNLLPTFLRVVGDSSRG >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_5|618_bp atggggggaagcgcgttaaaccagggagtcctggaaggggacgacgcccccggccagtcc ctgtacgagcggttaagtcagaggatgctggacatctcgggggaccggggcgtgctgaag gacgtcatccgagaaggagctggagacctagtggcgcctgatgcttcggtgctagtgaaa tactatggatacctggaacacttggacagacccttcgattctaattactttaggaaaact cctcggctaatgaaacttggagaggatattacattgtggggcatggagctgggccttctg agcatgcagagaggagagctggccaggtttctgttcaaaccgaactacgcctatggaacg ctgggctcccctcccttgatccccccaaacaccactgtcctgttcaagattgagctgctt gacttcctagactgtgctgagtcagacaagttttgtgctctctcagctcctcaccactcc tccctggacctgcagctccgcacccccgggggcctcagaactaccccttccagggcctca gaactacccctacggtttctcctgcgtaaccttctgcctaccttcctgagagtggttggt gacagcagccggggctag >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_6|276_aa MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV DGSSSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH PVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRREFQ ELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHEF IWLQVVAAMPWCFLACDRIIPISASIFTWPFLYVPV >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_6|831_bp atggcttggcaggtgagcctgctggagctggaggaccggcttcagtgtcccatctgcctg gaggtcttcaaggagtccctaatgctacagtgcggccactcctactgcaagggctgcctg gtttccctgtcctaccacctggacaccaaggtgcgctgccccatgtgctggcaggtggtg gacggcagcagctccttgcccaacgtctccctggcctgggtgatcgaagccctgaggctc cctggggacccggagcccaaggtctgcgtgcaccaccggaacccgctcagccttttctgc gagaaggaccaggagctcatctgtggcctctgcggtctgctgggctcccaccaacaccac ccggtcacgcccgtctccaccgtctgcagccgcatgaaggaggagctcgcagccctcttc tctgagctgaagcaggagcagaagaaggtggatgagctcatcgccaaactggtgaaaaac cggacccgaatcgtcaatgagtcggatgtcttcagctgggtgatccgccgcgagttccag gagctgcgccacccggtggacgaggagaaggcccgctgcctggaggggatagggggtcac acccgtggcctggtggcctccctggacatgcagctggagcaggcccagggaacccgggag cggctggcccaagccgagtgtgtgctggaacagttcggcaatgaggaccaccatgagttc atctggcttcaggtggtggctgccatgccctggtgtttcctggcctgtgaccgcatcatt ccaatctctgcctccatcttcacgtggccttttctgtatgtgccggtctaa >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_7|322_aa MGLYAAVAGVLAGVESRQGSIKGLVYSSNFQNVKQLYALVCETQRYSAVLDAVISSAGLL SAKKLQPHLAKASQLPRFVRVNTLKTCSVYVVISRDKVSPIRVGLPGQLSPSHAAGPLPG SHVIDACAAPGNKTSHLAALLKNQGKIFAFDLDAKRLASMATLLAWVGVSCCELAEEDFL AVSPLDPRYREVHYVLLDPSCSGSGMPSRQLEDPGAGTPSPVRLHALAGFQQRALCHALT FPSLQRLVYSMCSLCQEENEDMVPDALQQNPGAFRLAPALPARPHRGLSTFPGAEHCLRA SPKTTLSGGFFVAVIERVEMPM >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_7|969_bp atggggctgtacgctgcggtggcaggcgtgctggccggcgtggagagccgccagggctct atcaaggggctggtgtactccagcaacttccagaacgtgaagcagctgtacgcgctggtg tgcgaaacgcagcgctactccgccgtgctggatgccgtgatctccagcgccggcctcctc agtgcgaagaagctgcagccgcacctggccaaggcctcccagctgcctcgatttgtgcgt gtgaacactctcaagacctgctccgtttatgtagttatttcaagagacaaggtttctcct atcagggtcgggcttccaggccagctgtctcccagccatgctgctggacccctgccaggc tcccatgtcatcgatgcctgtgccgccccaggcaataagaccagtcacttggctgctctt ctgaagaaccaagggaagatctttgcctttgacctggatgccaagcggctggcatccatg gccacgctgctggcctgggttggcgtctcctgctgtgagctggctgaggaggacttcctg gcggtctcccccttagatccgcgctatcgtgaggtccactatgtcctgctggatccttcc tgcagtggctcgggtatgccgagcagacagctggaggatcccggggcagggacacctagc ccggtgcgtctgcatgccctggcagggttccagcagcgagccctgtgccacgcgctcact ttcccttccctgcagcggctcgtctactccatgtgctccctctgccaggaggagaatgaa gacatggtaccagatgcgctgcagcagaacccgggcgccttcaggctagctcccgccctg cctgcccggccccaccgaggcctgagcacgttcccgggtgccgagcactgcctccgggct tcccccaagaccacgcttagcggtggcttcttcgttgctgtaattgaacgggtcgagatg ccgatgtga >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_8|1176_aa MSPAAAAAGAGERRRPIASVRDGRGRGCGGPAGAALLGLSLVGLLLYLVPAAAALAWLAV GTTAAWWGLSREPRGSRPLSSFVQKARHRRTLFASPPAKSTANGNLLEPRTLLEGPDPAE LLLMGSYLGKPGPPQPAPAPEGQDLRNRPGRRPPARPAPRSTPPSQPTHRVHHFYPSLPT PLLRPSGRPSPRDRGTLPDRFVITPRRRYPIHQTQYSCPGVLPTVCWNGYHKKAVLSPRN SRMVCSPVTVRIAPPDRRFSRSAIPEQIISSTLSSPSSNAPDPCAKETVLSALKEKKKKR TVEEEDQIFLDGQENKRRRHDSSGSGHSAFEPLVASGVPASFVPKPGSLKRGLNSQSSDD HLNKRSRSSSMSSLTGAYTSGIPSSSRNAITSSYSSTRGISQPSLIPLPDTGEASKENKV FGILLQFSFATWTEGGEEREEELCHHSSSSTPLAADKESQGEKAADTTPRKKQNSNSQST PGSSGQRKRKVQLLPSRRGEQLTLPPPPQLGYSITAEDLDLEKKASLQWFNQALEDKSES AGAATTEALSPPKTPSLLPPLGLSQSGPPGLLPSPSFDSKPPTTLLGLIPAPSMVPATDT KAPPTLQAETATKPQATSAPSPAPKQSFLFGTQNTSPSSPAAPAASSASPMFKPIFTAPP KSEKEGLTPPGPSVSATAPSSSSLPTTTSTTAPTFQPVFSSMGPPASVPLPAPFFKQTTT PATAPTTTAPLFTGLASATSAVAPITSASPSTDSASKPAFGFGINSVSSSSVSTTTSTAT AASQPFLFGAPQASAASFTPAMGSIFQFGKPPALPTTTTVTTFSQSLPTAVPTATSSSAA DFSGFGSTLATSAPATSSQPTLTFSNTSTPTFNIPFGSSAKSPLPSYPGANPQPAFGAAE GQPPGAAKPALTPSFGSSFTFGNSAAPAPATAPTPAPASTIKIVPAHVPTPIQPTFGGAT HSAFGLKATASAFGAPASSQPAFGGSTAVFSFGAATSSGFGATTQTASSGSSSSVFGSTT PSPFTFGGSAAPAGSGSFGINVATPGSSATTGAFSFGAGQSGSTATSTPFTGGLGQNALG TTGQSTPFAFNVGSTTESKPVFGGTATPTFGQNTPAPGVGTSGSSLSFGASSAPAQGFVG VGPFGSAAPSFSIGAGSKTPGARQRLQARRQHTRKK >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_8|3531_bp atgtctccggcggctgcggcggctggagcaggcgagcggcggcggccgatagcgagtgtc agggacggccggggccggggctgcggcgggccggccggggcggcgcttctcggcctgtcg ctggtcggcctcctactgtacctcgtgcctgctgcggctgcgctggcctggctggccgtg gggactaccgcggcctggtggggactgagccgcgagccccgaggttcgcgccccttgtcc tccttcgttcagaaggcgcgacatcggcgaacactgttcgcttcgcctccggccaagtcg acagccaacggaaacctcctagagccgcggaccctgctcgaaggacctgaccctgccgaa ctgctcctcatgggcagttacctgggcaagcccgggccgccgcagcccgcccccgctccg gagggccaggacctgcggaataggcctggccgccgcccacccgcccgcccggcgccgcgc tccacaccgccctcccagccgacccatcgcgttcaccacttttacccctctctccccact cctcttctccgaccctccgggaggccttccccacgggatcgtgggactttaccagatcgg tttgtaataacacctcgaagacgctatccgatccatcagacccagtattcctgtccgggg gtacttcccacagtgtgctggaatggttatcacaagaaggctgtgctgtcccctcgcaac tccaggatggtgtgtagcccagtgactgtgaggatcgcccctcctgacagaagattttca cgttctgcgataccagagcagataatcagctcaacactgtcgtcaccatcaagtaatgcc ccagacccatgtgcaaaggagactgtactgagtgccctcaaagagaagaagaagaaaagg acagtggaggaagaagaccaaatattccttgatggccaggaaaataaaagaaggcgccat gatagcagtggcagtggacattcagcatttgagcccctggtggccagtggagtccccgct tcttttgtgcctaagcctgggtctctgaagagaggcctcaattctcagagctcagatgac cacttgaataagagatcccgaagctcttccatgagctccttgacaggcgcttacacaagt ggcatccctagctccagccgcaatgccattaccagttcctacagctccactcgaggcatc tcacagcccagcctcatcccgctcccagacaccggagaggccagcaaagaaaataaggta ttcggcattctcctgcagttttcatttgctacgtggacagaagggggtgaggaaagagaa gaagagctgtgtcatcattccagttcttcaactccattggcagcagacaaggagtcccag ggagaaaaggctgcagatacaaccccaaggaagaaacaaaactcgaattctcagtctaca cctggcagctctgggcagcgtaagcggaaagttcagctgctgccttctcggcgaggggaa cagctgaccttgcctccacctccccagcttggctattcgatcactgccgaggacctagac ttagagaagaaggcttcattacagtggttcaaccaggccttggaggacaagagtgaatct gctggagcagcaaccactgaggccctctcacctccaaagacacccagcctcctacccccg ctgggtttatcacagtcagggccgccagggctgctccccagcccctcctttgactccaaa cccccgaccactttgctggggctgatccctgctccatccatggtaccagccactgacacc aaggcacctccaacccttcaggcagagacggctaccaaaccccaagccacatctgccccg tcccccgcccccaagcaaagcttcctgtttggaacacagaacacctcaccttccagccct gccgcccctgctgcatcttcagcatctcccatgttcaagcccattttcacggctccaccc aagagtgagaaggaaggcctcacaccgcctggcccttcagtctcagccacagcgccctcc agctcctccctccccacgaccaccagcaccacagccccgaccttccagcctgtctttagc agcatggggccacctgcatctgtgcccttgcctgctcccttcttcaagcagacaactact cccgccactgctcccaccacaactgccccgctcttcactggcctggccagcgccacctct gctgtggctcccatcacctctgccagtccatccacagactctgcttcgaagcctgcgttt ggctttggcataaacagtgtgagcagcagcagtgtgagtaccacgaccagcaccgccact gccgcctcacagcctttcctcttcggggcgccccaggcctctgctgccagcttcaccccg gccatgggctccatattccagtttggcaaacctcctgccttgcccacaaccaccacagtc accaccttcagccagtccctgcccactgccgtgccaacggccaccagcagcagcgctgcc gactttagtggttttggcagcaccctcgccacctccgccccggccaccagcagccagccc actctgacgttcagtaacacgagcacccccacgttcaacattccctttggctcaagcgcc aagtccccgctcccatcatatccgggagccaacccccagcccgcatttggggccgctgag gggcagccaccgggggccgccaagccagcccttacccccagctttggcagctctttcact tttggaaactctgcagccccggccccggctactgcacccacacctgcacctgcgtccacg atcaagatcgtgcctgcgcacgtgcctacgcccatccagcctacctttggcggtgccacg cactcggcgtttggattgaaagccacggcttccgccttcggcgctcccgccagctcacag cccgcctttggcggctccactgctgtcttctccttcggtgcagccaccagctccggcttt ggagccaccacccagaccgccagcagcgggagcagcagctcggtgtttggcagcacaaca ccatcacccttcacgtttgggggttcggcagcccccgctggcagtgggagctttgggatc aacgtggccaccccaggctccagcgccaccaccggagctttcagctttggagcaggacag agtgggagcacagccacctccacccccttcacagggggcttaggtcagaacgccctgggc accaccggccagagcacaccgtttgccttcaacgtgggcagcacaactgagagcaaacct gtgtttggaggcaccgccacccccacctttggtcagaacacccctgcgcctggagtgggc acatcgggcagcagcctctcctttggggcatcttcagcacccgcccaaggctttgttggt gttggaccgttcggatcggcggccccttcattttccattggtgcgggatccaagacccca ggggctcgacagcgactgcaggcccgaaggcagcacacccgcaaaaagtag >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_9|497_aa MGKVFMTRTPKAIATKAKIDKWDLIKLKSFCIAKETSIQVNRQPTEREKIFAIYPSDKGS NPGLRYWVLEPKSSGDQKKAMDRTETRFRKRGQITEKITTSRQPQPQNEQSPQRSTSGYP LQEVVDDEVLGPSAPGVDPSPPCRSLGWKRKREWSDESAEEPEKELAPEPEETWVVEMLC GLKMKLKQQRVSPILPEHHKGFNSQLAPGVDPSPPHRSFCWKRKMEWWDESEESLEEEPR KVLAPEPEEIWVAEMLCGLKMKLKRRRVSLVLPEHHEAFNRLLEDPVIKRFLAWDKDLRV SDKYLLAMVIAYFSRAGFPSWQYQRIHFFLALYLANDMEEDDEDSKQNIFHFLYGKNRSR IPLLRKRWFQLGRSMNPRARKKRSRIPLLRKRRFQLGRSMNPRARKNRSRIPLLRKRRFQ LGRSMNLRARKNRSQIVLFQKRRFQFFCSMSGRAWVSPEELEENTGPRGDVDFQQELYSN ANGRHQEGGEEPFVQII >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_9|1494_bp atgggcaaagtcttcatgactagaacaccaaaagcaattgcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcatagcaaaagaaactagcatccaagtg aacaggcagcctacagaacgagagaaaatttttgcaatctacccatctgacaaagggtca aatccagggctccgttactgggtcctggagcccaagtcctctggtgatcagaagaaggcc atggacagaacggagactaggttccgtaagaggggacagattacggaaaagatcacgacc agccgtcaaccgcaaccccagaatgagcagagtccccagcggagcacctcggggtacccc ctccaggaggtggtggatgatgaagtgttgggaccatcagcccctggggtagatcccagc cccccatgtaggtcccttggctggaaaaggaagagggagtggtcagatgaatctgcggag gagccggagaaggagctcgcccctgaacctgaggagacctgggtagtggagatgctgtgt gggctcaagatgaagctgaagcaacagcgagtgtcacccatcctccctgagcaccacaag ggcttcaacagtcagcttgcccctggggtagatcccagccccccgcataggtccttttgc tggaaaaggaagatggagtggtgggacgaatctgaggagtcgttggaggaggagccacgg aaggtgctcgcccctgagcctgaggagatctgggtggcggagatgctgtgtggcctcaag atgaagctgaagcgacggcgagtgtcgctcgtgctccctgagcaccacgaggccttcaac aggctgcttgaggatcctgtcattaaaagattcctggcctgggacaaagatctgagggtg tcggacaagtatctcctggctatggtcatagcgtatttcagccgggccggcttcccctcc tggcaataccaacgcattcatttcttcctggctctctacctggccaatgacatggaggag gacgacgaggactccaaacaaaacatcttccacttcctgtatgggaagaaccgctctcgc atacccttgctccgtaagcgttggttccagttaggccgttccatgaacccgagggccagg aagaagcgctctcgcatacccttgctccgtaagcgtcggttccagttaggccgttccatg aacccgagggccaggaagaaccgctctcgcatacccttgctccgtaagcgtcggttccag ttaggccgttccatgaacctgagggccaggaagaaccgctctcagatagtcctgttccag aaacgtcggttccagttcttctgttccatgagcggcagggcttgggtttccccggaggag ttggaggagaacaccggacccaggggagatgtggattttcagcaggaactttattccaat gctaatggcagacaccaggaaggaggagaggaaccatttgtgcagatcatctag >gi568815591r:75324055_75611113|GENSCAN_predicted_peptide_10|1446_aa MPGLYLVHFRGESAGKQPVLMTYGLMTCLRDCGPTGKEKAGRPVICVQNPRPSPLQARTP PQYRAGAGSGLGTKEHDLGKYKGICLEVGKCPRVWENKDLVTTAGAGQPEKALWKTVSIN KAINTQEVAVKEKHARNILLDVAWKTDPEGGQLMGTEAVLLPLQVLKDSLRYRNELSDMS RMWGHLSEGYGQLCSIYLKLLRTKMEYHTKNPRFPGNLQMSDRQLDEAGESDVNNFFQLT VEMFDYLECELNLFQTVFNSLDMSRSVSVTAAGQCRLAPLIQVILDCSHLYDYTVKLLFK LHSCLPADTLQGHRDRFMEQFTKLKDLFYRSSNLQYFKRLIQIPQLPENPPNFLRASALS EHISPVVVIPAEASSPDSEPVLEKDDLMDMDASQQNLFDNKFDDIFGSSFSSDPFNFNSQ NGVNKDEKDHLIERLYREISGLKAQLENMKTESQRVVLQLKGHVSELEADLAEQQHLRQQ AADDCEFLRAELDELRRQREDTEKAQRSLSEIERKAQANEQRYSKLKEKYSELVQNHADL LRKNAEVTKQVSMARQAQVDLEREKKELEDSLERISDQGQRKTQEQLEVLESLKQELATS QRELQVLQGSLETSAQSEANWAAEFAELEKERDSLESMCQLAKDQRKMLLVGSRKAAEQV IQDALNQLEEPPLISCAGSADHLLSTVTSISSCIEQLEKSWSQYLACPEALTEACKQYGR ETLAYLASLEEEGSLENADSTAMRNCLSKIKAIGEELLPRGLDIKQEELGDLVDKEMAAT SAAIETATARIEEMLSKSRAGDTGVKLEVNERILGCCTSLMQAIQVLIVASKDLQREIVE SGRGTASPKEFYAKNSRWTEGLISASKAVGWGATVMVDAADLVVQGRGKFEELMVCSHEI AASTAQLVAASKVKADKDSPNLAQLQQASRGVNQATAGVVASTISGKSQIEETDNMDFSS MTLTQIKRQEMDSQVRVLELENELQKERQKLGELRKKHYELAGVAEGWEEGHTPGAESPG PPNCGSSSDGAAQASRCSISASTLTKCWPTQSMLQGQAELLSDSFPQKAEGEDCTTCSIE NHHHGPASHRRSKPASCLLPNSFLSQVQTKYKISTTPWCCPARAPREQNSLGEVDRRGPR EQTRAPATAAPPRPLGSRGAEAAEPQEGLSATVSACFQEQQEMNTLQGPVSFKDVAVDFT QEEWRQLDPDEKIAYGDVMLENYSHLVSVGYDYHQAKHHHGVEVKEVEQGEEPWIMEGEF PCQHSPEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKELVENSLDAGATNIDLKLKDYGV DLIEVSDNGCGVEEENFEGLTLKHHTCKIQEFADLTEVETFGFQGEALSSLCALSDVTIS TCHASVKVGTRLVFDHDGKIIQETPYPHPRGTTVSVKQLFSTLPVRHKEFQRNIKKSGME KPPCNW >gi568815591r:75324055_75611113|GENSCAN_predicted_CDS_10|4341_bp atgcctggcctctacctggtacactttagaggcgagagtgctgggaagcagccagtcctg atgacttatggacttatgacctgccttcgcgactgtgggcctacaggaaaggagaaggca ggaaggcctgtcatctgtgtccagaaccccagacccagtcccctgcaggcccgaacccct ccacagtacagagcgggtgctggttcagggcttggaaccaaagagcatgaccttgggaag tacaagggaatttgcttagaagttggaaaatgcccaagagtgtgggaaaacaaagactta gtgaccaccgccggtgctggccagccggagaaggctctgtggaagactgtcagcatcaat aaggccattaatacgcaggaagtggctgtaaaggaaaaacacgccagaaatatccttttg gatgttgcttggaagaccgaccctgagggaggtcagctcatggggactgaggctgttctt ttgcccctgcaggtcctgaaggactctctgagatacagaaatgaattgagtgacatgagc aggatgtggggccacctgagcgaggggtatggccagctgtgcagcatctacctgaaactg ctaagaaccaagatggagtaccacaccaaaaatcccaggttcccaggcaacctgcagatg agtgaccgccagctggacgaggctggagaaagtgacgtgaacaactttttccagttaaca gtggagatgtttgactacctggagtgtgaactcaacctcttccaaacagtattcaactcc ctggacatgtcccgctctgtgtccgtgacggcagcagggcagtgccgcctcgccccgctg atccaggtcatcttggactgcagccacctttatgactacactgtcaagcttctcttcaaa ctccactcctgcctcccagctgacaccctgcaaggccaccgggaccgcttcatggagcag tttacaaagttgaaagatctgttctaccgctccagcaacctgcagtacttcaagcggctc attcagatcccccagctgcctgagaacccacccaacttcctgcgagcctcagccctgtca gaacatatcagccctgtggtggtgatccctgcagaggcctcatcccccgacagcgagcca gtcctagagaaggatgacctcatggacatggatgcctctcagcagaatttatttgacaac aagtttgatgacatctttggcagttcattcagcagtgatcccttcaatttcaacagtcaa aatggtgtgaacaaggatgagaaggaccacttaattgagcgactatacagagagatcagt ggattgaaggcacagctagaaaacatgaagactgagagccagcgggttgtgctgcagctg aagggccacgtcagcgagctggaagcagatctggccgagcagcagcacctgcggcagcag gcggccgacgactgtgaattcctgcgggcagaactggacgagctcaggaggcagcgggag gacaccgagaaggctcagcggagcctgtctgagatagaaaggaaagctcaagccaatgaa cagcgatatagcaagctaaaggagaagtacagcgagctggttcagaaccacgctgacctg ctgcggaagaatgcagaggtgaccaaacaggtgtccatggccagacaagcccaggtagat ttggaacgagagaaaaaagagctggaggattcgttggagcgcatcagtgaccagggccag cggaagactcaagaacagctggaagttctagagagcttgaagcaggaacttgccacaagc caacgggagcttcaggttctgcaaggcagcctggaaacttctgcccagtcagaagcaaac tgggcagccgagttcgccgagctagagaaggagcgggacagcctggaatctatgtgccag cttgccaaagaccaacgaaaaatgcttctggtggggtccaggaaggctgcggagcaggtg atacaagacgccctgaaccagcttgaagaacctcctctcatcagctgcgctgggtctgca gatcacctcctctccacggtcacatccatttccagctgcatcgagcaactggagaaaagc tggagccagtatctggcctgcccagaagcactgaccgaggcctgtaagcagtatggcagg gaaaccctcgcctacctggcctccctggaggaagagggaagccttgagaatgccgacagc acagccatgaggaactgcctgagcaagatcaaggccatcggcgaggagctcctgcccagg ggactggacatcaagcaggaggagctgggggacctggtggacaaggagatggcggccact tcagctgctattgaaactgccacggccagaatagaggagatgctcagcaaatcccgagca ggagacacaggagtcaaattggaggtgaatgaaaggatccttggttgctgtaccagcctc atgcaagctattcaggtgctcatcgtggcctctaaggacctccagagagagattgtggag agcggcaggggtacagcatcccctaaagagttttatgccaagaactctcgatggacagaa ggacttatctcagcctccaaggctgtgggctggggagccactgtcatggtggatgcagct gatctggtggtacaaggcagagggaaatttgaggagctaatggtgtgttctcatgaaatt gctgctagcacagcccagcttgtggctgcatccaaggtgaaagctgataaggacagcccc aacctagcccagctgcagcaggcctctcggggagtgaaccaggccactgccggcgttgtg gcctcaaccatttccggcaaatcacagatcgaagagacagacaacatggacttctcaagc atgacgctgacacagatcaaacgccaagagatggattctcaggttagggtgctagagcta gaaaatgaattgcagaaggagcgtcaaaaactgggagagcttcggaaaaagcactacgag cttgctggtgttgctgagggctgggaagaagggcacacccctggggctgagtctccaggg ccccccaactgtggtagctccagcgatggtgctgcccaggcctctcggtgctccatctcc gcctccacactgaccaagtgctggcccacccagtccatgctccagggtcaggcggagctg ctgagtgacagctttcctcaaaaagcagaaggagaagactgcaccacctgtagcatagaa aaccaccaccacggacctgccagccatcggcgcagcaaaccagccagctgtttgctgcca aattccttcctctcccaggtccagaccaaatacaagatcagcaccacaccgtggtgttgc ccggcccgggcccctcgggagcagaacagccttggtgaggtggacaggaggggacctcgc gagcagacgcgcgcgccagcgacagcagccccgccccggcctctcgggagccggggggca gaggctgcggagccccaggagggtctatcagccacagtctctgcatgtttccaagagcaa caggaaatgaacacattgcaggggccagtgtcattcaaagatgtggctgtggatttcacc caggaggagtggcggcaactggaccctgatgagaagatagcatacggggatgtgatgttg gagaactacagccatctagtttctgtggggtatgattatcaccaagccaaacatcatcat ggagtggaggtgaaggaagtggagcagggagaggagccgtggataatggaaggtgaattt ccatgtcaacatagtccagaacctgctaaggccatcaaacctattgatcggaagtcagtc catcagatttgctctgggccagtggtactgagtctaagcactgcagtgaaggagttagta gaaaacagtctggatgctggtgccactaatattgatctaaagcttaaggactatggagtg gatctcattgaagtttcagacaatggatgtggggtagaagaagaaaactttgaaggctta actctgaaacatcacacatgtaagattcaagagtttgccgacctaactgaagttgaaact ttcggttttcagggggaagctctgagctcactgtgtgcactgagcgatgtcaccatttct acctgccacgcgtcggtgaaggttgggactcgactggtgtttgatcacgatgggaaaatc atccaggaaaccccctacccccaccccagagggaccacagtcagcgtgaagcagttattt tctacgctacctgtgcgccataaggaatttcaaaggaatattaagaagtctggcatggaa aagcccccgtgcaactggtaa