GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:55:08 Sequence gi568815578r:33607709_33820207 : 212499 bp : 49.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3445 3627 183 0 0 77 103 188 0.654 19.18 1.02 Intr + 15407 15588 182 1 2 74 83 202 0.915 16.97 1.03 Intr + 17056 17309 254 1 2 99 99 110 0.907 10.28 1.04 Intr + 20642 20727 86 0 2 57 66 71 0.956 1.34 1.05 Intr + 22011 22206 196 2 1 79 91 93 0.969 7.69 1.06 Intr + 28932 28961 30 1 0 124 92 11 0.652 3.20 1.07 Intr + 32633 32823 191 0 2 83 55 280 0.949 23.50 1.08 Term + 36639 36938 300 2 0 101 52 236 0.856 16.52 1.09 PlyA + 37527 37532 6 1.05 2.13 PlyA - 39567 39562 6 1.05 2.12 Term - 40527 40513 15 0 0 78 49 0 0.210 -6.66 2.11 Intr - 41745 41674 72 0 0 56 66 105 0.830 4.70 2.10 Intr - 42548 42390 159 2 0 96 72 97 0.943 9.08 2.09 Intr - 46842 46802 41 1 2 62 109 29 0.781 0.34 2.08 Intr - 50325 50234 92 2 2 45 85 204 0.997 15.44 2.07 Intr - 50846 50769 78 1 0 87 86 78 0.986 6.17 2.06 Intr - 51126 51014 113 0 2 82 73 143 0.681 11.38 2.05 Intr - 51842 51789 54 2 0 93 85 62 0.924 5.58 2.04 Intr - 52024 51945 80 2 2 78 100 133 0.944 12.77 2.03 Intr - 52295 52177 119 1 2 110 85 82 0.999 10.21 2.02 Intr - 52687 52551 137 1 2 110 80 213 0.988 22.07 2.01 Init - 52955 52911 45 2 0 95 76 -15 0.760 -1.08 2.00 Prom - 53191 53152 40 -15.35 3.00 Prom + 53518 53557 40 -7.36 3.01 Init + 54638 54763 126 1 0 72 40 160 0.845 9.76 3.02 Intr + 55824 56131 308 2 2 99 7 368 0.897 25.05 3.03 Intr + 58793 58811 19 2 1 97 94 11 0.723 -0.79 3.04 Term + 59289 60527 1239 2 0 2 47 1796 0.817 158.50 3.05 PlyA + 60785 60790 6 1.05 4.07 PlyA - 61127 61122 6 -0.45 4.06 Term - 61399 61388 12 1 0 74 35 11 0.466 -7.40 4.05 Intr - 61764 61667 98 1 2 93 105 133 0.715 15.23 4.04 Intr - 62004 61979 26 2 2 102 89 25 0.995 1.67 4.03 Intr - 63084 62976 109 1 1 114 72 122 0.998 12.54 4.02 Intr - 64714 64690 25 1 1 127 101 36 0.992 6.40 4.01 Init - 66644 66516 129 2 0 121 86 205 0.964 21.75 4.00 Prom - 67619 67580 40 -13.87 5.09 PlyA - 67995 67990 6 1.05 5.08 Term - 69271 69024 248 2 2 110 48 471 0.999 41.25 5.07 Intr - 69622 69397 226 1 1 89 98 81 0.916 6.76 5.06 Intr - 69810 69718 93 0 0 43 78 86 0.656 3.26 5.05 Intr - 70645 70471 175 0 1 111 37 275 0.995 24.64 5.04 Intr - 72266 72047 220 0 1 70 37 458 0.997 36.26 5.03 Intr - 72708 72618 91 0 1 122 77 60 0.987 7.87 5.02 Intr - 78071 77946 126 2 0 85 42 68 0.712 2.78 5.01 Init - 78556 78296 261 1 0 94 80 484 0.998 43.26 5.00 Prom - 81351 81312 40 -3.76 6.05 PlyA - 82159 82154 6 1.05 6.04 Term - 100261 99998 264 1 0 103 38 314 0.992 23.41 6.03 Intr - 103045 102847 199 0 1 29 103 235 0.942 18.45 6.02 Intr - 107028 106966 63 2 0 106 86 98 0.863 9.23 6.01 Init - 112499 112387 113 2 2 72 56 213 0.500 16.32 6.00 Prom - 112934 112895 40 -4.86 7.03 PlyA - 114424 114419 6 1.05 7.02 Term - 124420 124019 402 1 0 77 35 303 0.990 19.05 7.01 Init - 132357 132355 3 0 0 113 81 0 0.138 1.80 7.00 Prom - 135087 135048 40 -6.86 8.00 Prom + 135216 135255 40 -5.86 8.01 Init + 137385 137591 207 2 0 63 68 271 0.617 21.42 8.02 Intr + 141215 141364 150 1 0 93 110 191 0.998 22.16 8.03 Intr + 145464 145715 252 2 0 78 83 138 0.438 9.93 8.04 Intr + 149440 149635 196 0 1 113 86 169 0.930 18.19 8.05 Intr + 151008 151111 104 1 2 115 65 116 0.578 11.89 8.06 Intr + 154125 154347 223 2 1 107 94 255 0.968 25.70 8.07 Intr + 159143 159333 191 0 2 135 96 233 0.967 28.10 8.08 Intr + 162376 162584 209 0 2 87 108 373 0.531 37.08 8.09 Intr + 173583 173679 97 0 1 110 98 103 0.999 13.51 8.10 Intr + 173782 173898 117 0 0 122 43 59 0.978 5.46 8.11 Intr + 176024 176160 137 1 2 119 48 126 0.714 11.07 8.12 Intr + 180240 180340 101 0 2 110 -3 124 0.788 5.25 8.13 Intr + 181155 181266 112 1 1 111 80 150 0.997 15.94 8.14 Intr + 181810 181880 71 1 2 70 119 70 0.999 7.13 8.15 Term + 183280 183809 530 2 2 111 49 591 0.999 51.92 8.16 PlyA + 184538 184543 6 1.05 9.00 Prom + 187265 187304 40 -1.66 9.01 Init + 203761 203950 190 0 1 98 77 424 0.825 41.47 9.02 Intr + 203996 204189 194 0 2 90 32 59 0.394 -0.29 9.03 Term + 204638 204718 81 1 0 98 44 45 0.521 -1.21 9.04 PlyA + 205267 205272 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 190656 190767 112 2 1 56 95 103 0.811 8.17 S.002 Sngl - 201038 200856 183 2 0 77 43 165 0.953 5.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_1|473_aa RFSNGPASSTSSALTNQQLPATCGARQLSKLKRFLTTLQQFGNDISPEIGEKVRTLVLAL VANLPLLQRELLHCARAAKQTPSQYLAQHEHLLLNTSIASPADSSELLMEVHGNGKRPSP ERREENSFDRDTIAPEPPAKRVCTISPAPRHSPALTVPLMNPGGQFHPTPPPLQHYTLED IATSHLYREPNKMLEHREVRDRHHSLGLNGGYQDELVDHRLTEREWADEWKHLDHALNCI MEMVEKTRRSMAVLRRCQESDREELNYWKRRYNENTELRKTGTELVSRQHSPGSADSLSN DSQREFNSRPEEAVNKVKIQAMSEVQKAVAEAEQKAFEVIATERARMEQTIADVKRQAAE DAFLVINEQEESTENCWNCGRKASETCSGCNIARYCGSFCQHKDWERHHRLCGQNLHGQS PHGQGRPLLPVGRGSSARSADCSVPSPALDKTSATTSRSSTPASVTAIDTNGL >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_1|1422_bp agattcagcaatggtcctgcctcctccacatcatctgcactcacaaatcagcaattgcca gccacttgtggtgctcgacaactcagcaagttgaaacgctttcttaccactctgcaacag tttggcaatgacatctcccctgagattggggagaaggtgcggactcttgttcttgcactg gtggccaacctgcccctgctgcagcgggaactgctgcactgcgctcgggcggccaagcag accccatcccagtacctggctcagcacgaacaccttctgctcaacacaagcattgcatcg cctgctgactcgtcagagttgctcatggaggtgcacggaaatgggaagaggcccagtcca gagaggagagaagagaatagttttgatagagacacaattgctcctgagcctcctgccaag agagtatgtaccatcagccctgctcctcggcacagtcctgctctcactgtgcccctcatg aatcccgggggccaattccatcctacccctccacctcttcagcattacaccttagaggat attgcaacttctcacctgtatcgggaacccaacaagatgctagagcatcgagaagttcgt gatagacaccacagtcttggtctaaatggaggctatcaagatgagttggtagatcatcgt ttgacagaaagggaatgggctgatgaatggaaacatcttgaccatgcgctgaattgcatt atggaaatggtagagaaaacaaggcgctctatggcagttctgcggcgctgtcaggaatca gatcgtgaagaactcaactactggaaaagacggtacaatgaaaacacagagctgaggaaa acggggaccgagttggtctccaggcagcacagccctgggagtgcagattctctcagcaat gattctcagagagagttcaacagcaggccagaagaagctgtgaataaggtgaaaattcag gccatgtcagaagtacagaaggccgtcgctgaggcagagcagaaagcctttgaagtgatt gcaacagagagagcacgaatggagcaaaccatagcggatgtcaagcggcaggccgcagag gatgctttcctcgtcatcaatgagcaagaggagtccacggagaactgctggaactgtggc cgcaaagccagcgagacatgcagtggctgcaatatcgcgcgatactgtggctctttctgc cagcacaaggactgggagcggcaccaccgcctctgtggtcagaacctgcatggccagagc ccccacggccagggccggccgctgcttcctgtaggcaggggctcctctgccaggtccgcc gactgcagcgtgcccagcccagccctcgacaagacctcggcaaccacatcgcgttcctca acacctgcttctgtgacagctatcgacaccaacggactctga >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_2|334_aa MRSREARHFAGQMPPEYERASKVDQFVTRFLLRETVSQLQALQSSLEGASDTLEAQAHGW RSDAESVEAQSRLCGSRRAGRRALRSVSRSSTWSPGSSDTGRSSEAEMQWRLQVNRLQEL IDQLECKAPRLEPLREEDLAKGPDLHILMAQRQVQVAEEGLQDFHRALRCYVDFTGAQSH CLHVSAQKMLDGASFTLYEFWQDEASWRRHQQSPGSKAFQRILIDHLRAPDTLTTVFFPA TRPPQKRFHCKEQGTDQLVQLAFQRGQFQAHGVQTPPASCPWMLFRPRLEPVEAPLSLFL KDLGIRLTMEKVTCTIIFGNEHVDARGKRQFLGF >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_2|1005_bp atgaggtccagagaggccaggcacttcgctgggcagatgccccctgagtacgagagggcc tccaaagtggaccagtttgtgacgcgcttcctgctgcgggagacggtgagccagctgcaa gcccttcagagctcgctggagggggcgtcagataccctggaggcccaggcccatggctgg cggtcagatgcagagagcgtggaggcgcagagcaggctctgcggcagccggcgggcagga cgccgagccctgaggagtgtcagccggtcatccacctggtcccccggctcttctgacaca gggcgcagctcagaggccgagatgcagtggcggctccaggtgaaccgcctccaggagctc atcgaccagctcgagtgcaaggccccccggctggaacccctgcgtgaagaggacctggcc aaggggcctgacttgcacatcctcatggcccagaggcaggtccaggtggcagaggaaggc ctgcaggacttccaccgagccctgcgctgctatgtggacttcacaggggcccagagccat tgtctgcatgtgtccgcccagaagatgctggacggtgcctccttcaccctgtatgagttc tggcaggatgaggcctcctggagaaggcaccagcagtcgcctggcagcaaggccttccag cgcatcctcatcgaccacctgcgggccccggacaccctcaccactgtgttcttcccagcc acccgtccgccacagaaacgcttccactgtaaggagcagggcactgaccagttggtgcag ctggctttccagagaggccagttccaggcacatggggtgcagacaccccctgcttcatgc ccttggatgctcttcaggccacgactagaaccagttgaggcccctctctccctcttcctc aaggatttgggaattaggctcacgatggaaaaggtcacctgcaccatcatctttggaaat gagcacgtcgatgcccgaggaaagaggcagtttctaggtttctga >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_3|563_aa MGNYSSHKRTKAPKQARKERPADMDKAWWKSFLNHLTRKKPATRIVLILPLDKRQPLANA GQRIDYASGAGLGSPAAPRLRGAGEGSEREPRMPVLLLLRRQEARRPEEGGARAALSWPR LLSRFRSPGKAPREAGPAEEQPRKRNFRQAQDRGREVSGRRSRVARRSGAFSPAAGPSPK VASVSGSRRVHRPSSLGRIAVVVDQGSGFTKAGFAGENQPRIVLKSSSLVPSWDRPVLPG APGCELAGGVARAHPIKHGVVADWEALEGLWERLLVGGLRVCPEQWPVLVSDSPLAPPAG RERVAELLFETLAVPACHMASTALLALCSTGAFSGLAVEAGAGVCHATPIYAGHSWHQAT FRLNVAGSTLSRYLRDLLVAANPDLLQQALPRKAITHLKKRSCYVSLDFEGDLRDPARHH PASFSVGNGCCVCLSSERFRCPEPIFQPGLLGQAEQGLPALAFRALQKMPKTLRTRLADT VVLAGGSTLFPGFAERLDKELEAQCRRHGYAALRPHLVAKHGRGMAVWTGGSMVASLHSF QRRWITRAMYQECGSRLLYDVFN >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_3|1692_bp atgggaaactatagttcccacaaaaggaccaaagcacccaagcaggcccgcaaggagagg ccggctgacatggacaaggcctggtggaaatcgttcctcaaccacctcactcggaagaag ccggctaccaggatcgtgctgattctccccctggacaagcggcagccgctggccaacgct gggcaacggattgactacgcgtccggcgctgggctgggctccccggcggcacccagattg cgcggagcgggcgaaggtagcgagcgcgagccgaggatgccggtactgctgctgctgcgg cggcaagaggcgcggcggccggaagagggcggggccagggcagctctgagctggccgcgg ctgctctcgcgcttccggtccccggggaaggctccccgcgaagccggccccgccgaggag cagccgcgcaaacggaacttccgtcaagcccaggacagaggccgggaggtatcgggccga cgaagccgagtggcgcggaggagcggagccttcagccccgcggctgggccgagcccgaag gtggcgtcggtgtcggggagccgccgcgtgcaccggccgtcctccctgggccgcatcgcg gtggtggtggaccagggctcgggcttcaccaaggcgggcttcgcgggcgagaaccagccg cgcatagtgctgaagagctctagcttggtgcccagctgggaccggccggtgctgcccgga gcgccgggctgcgagctggcgggcggcgtggcgcgggcgcaccccatcaagcacggcgtg gtggcggactgggaggcgctggaagggctgtgggagcgcctgctggtgggcggcctgcgg gtgtgcccagagcagtggcccgtgctggtgagcgactcgccgttggcgccgcccgcgggc cgcgaaagggtggcggagctgctgttcgagaccctggcagtgcccgcgtgccacatggct agcaccgcgttgctggcgctctgctccaccggcgcgttcagcgggctggccgtggaggcg ggcgcgggcgtgtgccacgccacgcccatctacgcgggtcactcgtggcaccaggccacc ttccgactgaacgtggcaggcagcaccctgtcgcgctacctgcgggatctgctggtggcg gcgaaccctgacctcttgcagcaggccctgccccgcaaggccatcacacatctcaagaag cgcagctgctacgtgtccctggacttcgagggcgacctccgcgaccccgcccgccaccat ccggccagtttcagcgtgggtaacgggtgctgcgtctgcctcagcagtgagcgcttccgc tgccccgaacccatcttccagccgggcctgctgggccaggctgagcaggggctgcccgcg ctggccttccgggcgctgcagaagatgcccaaaacgctgcggacacgcctggcagacacc gtggtgctagccggcggctccacactgtttcctggcttcgccgagcgcctggacaaggag ctggaggcgcagtgccggcggcacggctacgcggccctgcggccccacctggtggccaag catgggcgtggcatggctgtgtggaccggcggctccatggtggcctccctgcactccttc cagcgccgctggataactcgggccatgtaccaggagtgtggctccaggctgctgtacgat gtgttcaactga >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_4|132_aa MACAGLLTVCLLRPPAPQPQPQTPRHPQLAPDPGPAGHTLFQDVFRRADKNDDGKLSFEE FQNYFADGVLSLGELQELFSGIDGHLTDNLETEKLCDYFSEHLGVYRPVLAALESLNRAV LAAMDATKLHPE >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_4|399_bp atggcgtgcgcggggctgctcaccgtgtgcctgctccggccgcccgcgccccagccccag ccccagaccccgcggcacccccagctcgcgcccgacccggggcccgccggacacacgctc ttccaggacgttttccgcagagcagacaagaatgatgatgggaagctctcatttgaggaa ttccagaattactttgccgatggggttctcagcctgggggagctgcaggaactgttcagc ggcattgatgggcatctcaccgacaatttagaaacagaaaaactgtgtgactacttctca gagcacctgggtgtctaccggccggtgctggctgcattggaatcgctgaaccgtgcagtg ctcgctgccatggatgccaccaagctgcacccagaataa >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_5|479_aa MALAGAPAGGPCAPALEALLGAGALRLLDSSQIVIISAAQDASAPPAPTGPAAPAAGPCD PDLLLFATPQAPRPTPSAPRPALGRPPLPHRSLPSVGLSDNNSPRVFNWRLLHARPLGDF KPAKTHNPAVKRRLDLETDHQYLAESSGPARGRGRHPGKGVKSPGEKSRYETSLNLTTKR FLELLSHSADGVVDLNWAAEVLKVQKRRIYDITNVLEGIQLIAKKSKNHIQWLGSHTTVG VGGRLEGLTQDLRQLQESEQQLDHLMNICTTQLRLLSEDTDSQRYPWIGRRDLRSIADPA EQMVMVIKAPPETQLQAVDSSENFQISLKSKQGPIDVFLCPEETVGGISPGKTPSQEVTS EEENRATDSATIVSPPPSSPPSSLTTDPSQSLLSLEQEPLLSRMGSLRAPVDEDRLSPLV AADSLLEHVREDFSGLLPEEFISLSPPHEALDYHFGLEEGEGIRDLFDCDFGDLTPLDF >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_5|1440_bp atggccttggccggggcccctgcgggcggcccatgcgcgccggcgctggaggccctgctc ggggccggcgcgctgcggctgctcgactcctcgcagatcgtcatcatctccgccgcgcag gacgccagcgccccgccggctcccaccggccccgcggcgcccgccgccggcccctgcgac cctgacctgctgctcttcgccacaccgcaggcgccccggcccacacccagtgcgccgcgg cccgcgctcggccgcccgccgcttcctcaccggagcctccccagcgtcggactcagtgat aataatagcccacgtgtattcaactggcgtttactgcatgccaggcctttgggcgacttc aagccagctaaaactcacaaccccgctgtgaagcggaggctggacctggaaactgaccat cagtacctggccgagagcagtgggccagctcggggcagaggccgccatccaggaaaaggt gtgaaatccccgggggagaagtcacgctatgagacctcactgaatctgaccaccaagcgc ttcctggagctgctgagccactcggctgacggtgtcgtcgacctgaactgggctgccgag gtgctgaaggtgcagaagcggcgcatctatgacatcaccaacgtccttgagggcatccag ctcattgccaagaagtccaagaaccacatccagtggctgggcagccacaccacagtgggc gtcggcggacggcttgaggggttgacccaggacctccgacagctgcaggagagcgagcag cagctggaccacctgatgaatatctgtactacgcagctgcgcctgctctccgaggacact gacagccagcgatatccttggattggccgtagggaccttcgtagcattgcagaccctgca gagcagatggttatggtgatcaaagcccctcctgagacccagctccaagccgtggactct tcggagaactttcagatctcccttaagagcaaacaaggcccgatcgatgttttcctgtgc cctgaggagaccgtaggtgggatcagccctgggaagaccccatcccaggaggtcacttct gaggaggagaacagggccactgactctgccaccatagtgtcaccaccaccatcatctccc ccctcatccctcaccacagatcccagccagtctctactcagcctggagcaagaaccgctg ttgtcccggatgggcagcctgcgggctcccgtggacgaggaccgcctgtccccgctggtg gcggccgactcgctcctggagcatgtgcgggaggacttctccggcctcctccctgaggag ttcatcagcctttccccaccccacgaggccctcgactaccacttcggcctcgaggagggc gagggcatcagagacctcttcgactgtgactttggggacctcacccccctggatttctga >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_6|212_aa MAAPPQLRALLVVVNALLRKRRYHAALAVLKGFRNGAVYGAKIRAPHALVMTFLFRNGSL QEKLWAILQATYIHSWNLARFVFTYKGLRALQSYIQGKTYPAHAFLAAFLGGILVFGENN NINSQINMYLLSRVLFALSRLAVEKGYIPEPRWDPFPLLTAVVWGLVLWLFEYHRSTLQP SLQSSMTYLYEDSNVWHDISDFLVYNKSRPSN >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_6|639_bp atggcagccccgccgcagctaagggctctgctcgtagtcgtcaacgcactgctgcgcaag cgccgctaccacgctgcgttggccgtgcttaagggcttccggaacggggctgtctatgga gccaaaatccgggcccctcacgcgctggtcatgacctttctcttccggaatggcagcctc caggagaagctgtgggccatactgcaggccacatatatccactcctggaacctggcacgg tttgtgttcacctacaagggtctccgtgccctgcagtcctacatacaaggcaagacctac ccagcacacgcattcctggcggccttcctcgggggtatcctggtgtttggagaaaacaat aacatcaacagccagatcaacatgtacctgttgtcacgcgtcctgtttgccctgagccgc ctggctgtagagaagggctacatccctgaacccaggtgggacccgttcccgctgctcact gcggtggtgtgggggctggtgctgtggctctttgagtatcaccgatccaccctgcagccc tcgctgcagtcctccatgacctacctctatgaggacagcaatgtatggcacgacatctca gacttcctcgtctataacaagagccgtccctccaattaa >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_7|134_aa MAGRCEGAAGGGAARNSRLRRPRRRSPSRASKIACAILEPPPSPPQEPKPGYASIVGAVA VESGPPGGAQEPPPSPQPWKPNLGSRDPSISGLPWLHSITQSPGHRVWTPPHLNDQVYTD LISAAHAELPIPII >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_7|405_bp atggccgggcgctgcgagggcgcggcgggagggggcgcagcgcggaacagccgcctccgc cggccccgccgccgctcaccctccagggcctcaaagatcgcctgcgccatcttggagccg ccgccgtcgccgccacaggaaccgaagcccggctacgcgagcattgtgggagccgtggcg gttgagtcggggccgccagggggcgcccaggagccgccgccgagcccccagccctggaaa cccaacctaggctcgcgggaccccagcatctctgggctgccctggctccactcgataacc cagtcaccgggccaccgagtgtggaccccgcctcacttgaacgaccaagtctacaccgac cttatatccgccgcacacgcggaacttcccatccctattatctaa >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_8|898_aa MTPDDEDVFLCGKCKKQFNSLPAFMTHKREQCQGNAPALATVSLATNSIYPPSAAPTAVQ QAPTPANRQISTYITVPPSPLIQTLVQGNILVSDDVLMSAMSAFTSLDQPMPQGPPPVQS SLNMHSVPSYLTQPPPPPPPPPPLPPPPPPQPPPPPPQSLGPPGRPNPGGNGVVEVYSAA APLAGSGTVEIQALGMQPYPPLEVPNQCVEPPVYPTPTVYSPGKQGFKPKGPNPAAPMTS ATGGTVATFDSPATLKTRRAKGARGLPEAAGKPKAQKLKCSYCDKSFTKNFDLQQHIRRY TCVEGTDKLVFHSHTGEKPFQCIACGRAFAQKSNVKKHMQTHKVWPPGHSGGTVSRNSVT VQVMALNPSRQEDEESTGLGQPLPGAPQPQALSTAGEEEGDKPESKQVVLIDSSYLCQFC PSKFSTYFQLKSHMTQHKNEQVYKCVVKSCAQTFPKLDTFLEHIKSHQEELSYRCHLCGK DFPSLYDLGVHQYSHSLLPQHSPKKDNAVYKCVKCVNKYSTPEALEHHLQTATHNFPCPH CQKDTQAGLRGCSTDDTVSLPGHLIHHIPDVAMSGLLLEPGWVFPCERYLRRHLPTHGSG GRFKCQVCKKFFRREHYLKLHAHIHSGSHSQSSGKGCRLCLPNMAVLGHFLVILTSTLIR AGEKPYKCSVCESAFNRKDKLKRHMLIHEPFKKYKCPFSTHTGCSKEFNRPDKLKAHILS HSGMKLHKCALCSKSFSRRAHLAEHQRAHTGNYKFRCAGCAKGFSRHKYLKDHRCRLGPQ KDKDLQTRRPPQRRAAPRSCGSGGRKVLTPLPDPLGLEELKDTGAGLVPEAVPGKPPFAE PDAVLSIVVGGAVGAETELVVPGHAEGLGSNLALAELQAGAEGPCAMLAVPVYIQASE >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_8|2697_bp atgaccccagatgacgaggatgtatttctctgcgggaagtgtaagaagcaattcaactcg ctgccagcgtttatgacccacaagcgggaacagtgccaggggaatgcccccgccctggcc acagtctcactggccaccaacagcatctacccaccttcggcagcacccacagcggtccag caggccccaactcctgccaatcgccagatctccacatacatcacagtgcccccgtcccca ctgatccagaccctggtgcaggggaacatcttggtgagcgatgatgtgctcatgtctgcc atgtcagccttcacatccctggaccagcccatgccccagggccccccacctgtgcagagc agcctgaacatgcattccgtgcccagctacctcacccagcctccacctcctcctccacct cctccaccactgcccccaccgccaccacctcagcctccaccacctccaccccagagcctg ggcccccctgggcgtcccaaccctggtgggaacggtgtggtggaggtgtacagtgctgct gcgcccctggctgggagtggaacggtggagatccaggcactggggatgcagccctaccca cccctagaggtgccaaaccagtgtgtggagcctccagtatatcccacccccacagtgtac agccctggcaaacagggattcaaacccaaaggaccaaaccccgccgcccccatgaccagc gccaccgggggcacggtggccacctttgactctccagcaacgctgaagacccgacgagct aaaggtgccaggggactcccggaagctgcagggaagccaaaggctcagaaactcaagtgc tcatactgtgacaagtcattcaccaaaaactttgacctgcagcagcacatccgaaggtac acatgcgtggagggcactgacaagcttgtcttccacagccacaccggtgagaagcccttc cagtgcattgcatgtggccgtgcctttgcccagaagtctaatgttaagaaacacatgcag acccacaaggtgtggcctccaggacacagtggtggcaccgtgtctcgaaactctgtgacc gtacaggtcatggccctgaaccccagcaggcaggaggacgaggaaagcacagggttgggc cagcccctgccgggtgcgccacagccccaggccttgtccacagctggtgaggaagagggg gacaagccggagtccaagcaggtggtcctcatcgacagctcctacctgtgccaattctgc cccagcaaattcagcacctacttccagctcaagtctcacatgacccagcataagaatgag caggtgtacaagtgtgtggtcaaaagctgtgcccagacgttcccaaagctcgacacattt ctggagcacatcaagagccaccaggaggagctgagctaccgctgccacctctgcggcaag gacttcccctcgctgtacgacctgggcgtgcaccagtactcccacagcctcctgccacag cacagccccaagaaggacaatgccgtctacaagtgtgtcaaatgtgtcaacaaatactcc acccctgaggccctggagcaccacctgcagaccgccactcacaacttcccctgcccacac tgccagaaggacacgcaggcagggctcaggggctgctccacagatgacacagtgtctctg cctggccatctcatccatcacattccagatgtggccatgtcggggctgctgctggagcca ggctgggtgtttccttgtgaacgctacctgcggcgtcatctgcccacccacggcagcggg ggcaggttcaagtgccaagtgtgcaagaagttcttccggcgggagcattatctcaaactg catgctcacatccactcgggtagtcacagccagtccagtggcaagggctgccgactgtgc cttcccaacatggctgtcctgggccacttcctggttatcctcacctccaccctcattcga gctggtgagaagccctacaaatgctcagtgtgcgagtctgcgttcaaccgcaaggacaaa ctgaagagacacatgttgatccacgagcccttcaagaaatacaaatgccctttctcgacg cacacaggctgcagtaaggagttcaaccggccggacaagctgaaggcccacatcctctcc cactctggcatgaagctccacaaatgcgccctgtgcagcaagtccttcagccgccgtgcc cacctcgccgagcatcagcgcgcccacacgggcaactacaagttccgctgtgctggctgc gccaagggcttttcccgccacaaatacctcaaagatcaccgctgtcgtctcggcccccaa aaggacaaggacctgcaaacccggcggcccccccagaggagggcagccccccgcagttgc ggcagtggtgggcgcaaggtgctgacccccttgcctgacccgctggggctggaggagctg aaggacacaggggctgggctggtgcccgaggctgtccccggcaagccgcccttcgcagag ccggacgcggtgctgtccatcgttgtgggtggtgcggtgggcgcggaaactgagctggtg gtacctggacacgctgaggggctgggctccaacctggctctggcggagctgcaggctggg gccgagggcccatgtgccatgctcgctgtgcccgtctacatccaggcctccgagtga >gi568815578r:33607709_33820207|GENSCAN_predicted_peptide_9|154_aa MSVFGKLFGAGGGKAGKGGPTPQEAIQRLRDTEEMLSKKQEFLEKKIEQELTAAKKHGTK NKRGSPGPDFTFIRLASGSGLTLELSGSDFLPGLGPPPRLPSMSPRSLDFSIVRLILPSP LSLSPPRQLDLTKVSRGASLHPMTHRFPIRLNQK >gi568815578r:33607709_33820207|GENSCAN_predicted_CDS_9|465_bp atgtcggtgttcgggaagctgttcggggctggagggggtaaggccggcaagggcggcccg accccccaggaggccatccagcggctgcgggacacggaagagatgttaagcaagaaacag gagttcctggagaagaaaatcgagcaggagctgacggccgccaagaagcacggcaccaaa aacaagcgcggctccccgggcccggacttcaccttcatcagactcgcctcggggtctggt ctgaccctggaactctccgggtcagacttcttgccgggtctgggaccccctccccgactc ccttcaatgtctccccgatccctggacttttccatcgtcagacttattctgccctctccc ctcagcctctctccaccccgccagcttgaccttaccaaagtttcccgtggcgcaagcctt caccccatgactcaccgcttccccatccgtctgaaccagaaatag