GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:56:12 Sequence gi568815579r:47330092_47566403 : 236312 bp : 52.38% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 10709 11722 1014 1 0 56 44 1062 0.999 96.10 1.02 PlyA + 12269 12274 6 1.05 2.00 Prom + 14086 14125 40 -6.01 2.01 Init + 22940 23644 705 1 0 69 109 842 0.928 79.44 2.02 Intr + 24948 25259 312 2 0 133 113 427 0.796 46.63 2.03 Intr + 27775 28029 255 0 0 77 64 407 0.946 35.37 2.04 Intr + 29877 29979 103 2 1 129 107 150 0.999 21.25 2.05 Intr + 32385 32602 218 1 2 86 94 293 0.999 28.45 2.06 Intr + 36890 37064 175 1 1 76 97 238 0.652 23.53 2.07 Intr + 42639 42832 194 1 2 89 94 336 0.846 34.03 2.08 Intr + 43508 43609 102 1 0 111 100 154 0.999 19.77 2.09 Intr + 45375 45656 282 2 0 110 86 392 0.723 39.36 2.10 Intr + 45833 46006 174 1 0 140 75 342 0.999 38.75 2.11 Intr + 46352 46469 118 1 1 118 77 255 0.996 27.94 2.12 Intr + 47009 47115 107 0 2 87 80 223 0.994 21.83 2.13 Intr + 49619 49894 276 1 0 119 75 447 0.828 44.75 2.14 Intr + 50725 50901 177 0 0 116 115 133 0.964 19.43 2.15 Intr + 51095 51233 139 1 1 39 115 194 0.989 17.74 2.16 Intr + 51889 52016 128 2 2 112 4 142 0.113 9.10 2.17 Intr + 55861 55921 61 0 1 117 77 72 0.137 7.80 2.18 Intr + 64784 65058 275 0 2 17 38 166 0.018 2.30 2.19 Intr + 65418 65504 87 2 0 43 115 64 0.944 5.26 2.20 Term + 66243 66437 195 2 0 46 33 118 0.641 -0.17 2.21 PlyA + 68603 68608 6 1.05 3.15 PlyA - 68618 68613 6 1.05 3.14 Term - 69143 69121 23 0 2 125 32 5 0.140 -2.64 3.13 Intr - 76880 76797 84 0 0 98 102 92 0.984 11.89 3.12 Intr - 77046 76988 59 2 2 80 92 88 0.999 7.42 3.11 Intr - 77337 77261 77 0 2 34 105 154 0.900 10.51 3.10 Intr - 79156 79008 149 2 2 63 89 176 0.855 15.66 3.09 Intr - 79456 79345 112 1 1 106 100 19 0.985 5.36 3.08 Intr - 84489 84304 186 0 0 14 45 152 0.620 3.90 3.07 Intr - 84775 84626 150 1 0 102 80 309 0.977 32.37 3.06 Intr - 85010 84960 51 2 0 106 105 105 0.999 13.59 3.05 Intr - 86611 86561 51 1 0 123 85 46 0.994 7.39 3.04 Intr - 86872 86713 160 0 1 137 80 172 0.922 21.80 3.03 Intr - 87259 87087 173 1 2 88 92 270 0.999 26.66 3.02 Intr - 89508 89341 168 0 0 78 82 17 0.512 0.76 3.01 Init - 92999 92997 3 2 0 76 81 0 0.302 -1.97 3.00 Prom - 93208 93169 40 -1.51 4.12 PlyA - 94211 94206 6 1.05 4.11 Term - 100374 99998 377 1 2 110 49 917 0.999 85.26 4.10 Intr - 102354 102076 279 1 0 55 52 550 0.410 46.09 4.09 Intr - 107470 107371 100 1 1 113 111 58 0.892 10.78 4.08 Intr - 107882 107758 125 0 2 67 55 179 0.689 13.31 4.07 Intr - 111095 111078 18 0 0 116 113 -13 0.649 1.06 4.06 Intr - 111349 111246 104 0 2 83 94 120 0.951 12.42 4.05 Intr - 118293 117718 576 2 0 66 58 1196 0.053 106.84 4.04 Intr - 119644 119581 64 2 1 107 71 67 0.597 5.17 4.03 Intr - 124989 124955 35 2 2 124 56 -16 0.740 -2.85 4.02 Intr - 127503 126839 665 0 2 101 94 1723 0.975 165.39 4.01 Init - 136312 135638 675 1 0 82 99 1323 0.719 126.06 4.00 Prom - 138220 138181 40 -3.51 5.13 PlyA - 138378 138373 6 1.05 5.12 Term - 145453 145325 129 1 0 122 47 36 0.496 1.39 5.11 Intr - 146623 146441 183 1 0 124 68 193 0.953 21.40 5.10 Intr - 146847 146712 136 2 1 42 75 222 0.945 17.38 5.09 Intr - 147690 147615 76 1 1 69 85 99 0.996 6.77 5.08 Intr - 149849 149772 78 0 0 98 84 100 0.998 10.62 5.07 Intr - 150316 150195 122 0 2 39 77 144 0.771 9.04 5.06 Intr - 150742 150669 74 1 2 50 63 100 0.926 2.40 5.05 Intr - 150942 150867 76 2 1 90 78 54 0.830 4.61 5.04 Intr - 153124 153070 55 2 1 97 64 54 0.857 2.43 5.03 Intr - 153288 153204 85 0 1 109 86 165 0.990 18.29 5.02 Intr - 153493 153411 83 2 2 92 77 125 0.999 11.65 5.01 Init - 154066 153844 223 1 1 94 80 411 0.947 39.60 5.00 Prom - 154304 154265 40 -9.07 6.11 PlyA - 154461 154456 6 1.05 6.10 Term - 158298 158197 102 0 0 92 46 251 0.999 19.88 6.09 Intr - 159670 159620 51 1 0 114 109 125 0.979 16.79 6.08 Intr - 160765 160697 69 1 0 100 106 68 0.997 9.67 6.07 Intr - 162289 161924 366 1 0 94 64 337 0.464 27.80 6.06 Intr - 162954 162870 85 2 1 91 75 205 0.966 19.82 6.05 Intr - 163083 163028 56 0 2 113 68 148 0.999 13.57 6.04 Intr - 163402 163325 78 1 0 88 113 147 0.996 17.44 6.03 Intr - 165505 165459 47 2 2 119 91 46 0.565 6.62 6.02 Intr - 166207 165984 224 0 2 68 75 93 0.689 4.30 6.01 Init - 167264 167152 113 2 2 61 77 20 0.245 -2.15 6.00 Prom - 168953 168914 40 -4.61 7.21 PlyA - 169743 169738 6 1.05 7.20 Term - 170658 170429 230 1 2 101 44 286 0.912 22.62 7.19 Intr - 173411 173332 80 1 2 96 68 73 0.177 5.79 7.18 Intr - 184878 184752 127 1 1 -34 72 194 0.006 5.84 7.17 Intr - 185518 185358 161 0 2 51 94 96 0.022 6.64 7.16 Intr - 191286 191137 150 2 0 96 7 266 0.459 19.09 7.15 Intr - 191563 191388 176 1 2 124 75 33 0.995 4.85 7.14 Intr - 191903 191763 141 2 0 81 75 107 0.602 9.76 7.13 Intr - 198947 198859 89 0 2 92 58 109 0.554 8.49 7.12 Intr - 199561 199486 76 1 1 105 66 110 0.058 10.08 7.11 Intr - 201706 201551 156 1 0 46 94 136 0.475 10.72 7.10 Intr - 202324 202216 109 0 1 31 40 66 0.424 -3.11 7.09 Intr - 202881 202818 64 1 1 106 97 115 0.908 12.47 7.08 Intr - 208348 208051 298 1 1 91 109 332 0.997 32.59 7.07 Intr - 208792 208724 69 1 0 42 25 135 0.837 2.47 7.06 Intr - 209787 209614 174 0 0 92 64 74 0.979 6.05 7.05 Intr - 210244 210085 160 0 1 94 72 98 0.969 9.30 7.04 Intr - 210860 210802 59 2 2 59 110 28 0.511 0.27 7.03 Intr - 215889 214035 1855 2 1 99 81 1699 0.472 158.52 7.02 Intr - 219394 219139 256 2 1 107 45 206 0.424 15.24 7.01 Init - 225765 225459 307 0 1 92 100 301 0.730 29.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 199621 199486 136 1 1 72 66 270 0.938 21.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_1|337_aa MGNDSVSYEYGDYSDLSDRPVDCLDGACLAIDPLRVAPLPLYAAIFLVGVPGNAMVAWVA GKVARRRVGATWLLHLAVADLLCCLSLPILAVPIARGGHWPYGAVGCRALPSIILLTMYA SVLLLAALSADLCFLALGPAWWSTVQRACGVQVACGAAWTLALLLTVPSAIYRRLHQEHF PARLQCVVDYGGSSSTENAVTAIRFLFGFLGPLVAVASCHSALLCWAARRCRPLGTAIVV GFFVCWAPYHLLGLVLTVAAPNSALLARALRAEPLIVGLALAHSCLNPMLFLYFGRAQLR RSLPAACHWALRESQGQDESVDSKKSTSHDLVSEMEV >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_1|1014_bp atggggaacgattctgtcagctacgagtatggggattacagcgacctctcggaccgccct gtggactgcctggatggcgcctgcctggccatcgacccgctgcgcgtggccccgctccca ctgtatgccgccatcttcctggtgggggtgccgggcaatgccatggtggcctgggtggct gggaaggtggcccgccggagggtgggtgccacctggttgctccacctggccgtggcggat ttgctgtgctgtttgtctctgcccatcctggcagtgcccattgcccgtggaggccactgg ccgtatggtgcagtgggctgtcgggcgctgccctccatcatcctgctgaccatgtatgcc agcgtcctgctcctggcagctctcagtgccgacctctgcttcctggctctcgggcctgcc tggtggtctacggttcagcgggcgtgcggggtgcaggtggcctgtggggcagcctggaca ctggccttgctgctcaccgtgccctccgccatctaccgccggctgcaccaggagcacttc ccagcccggctgcagtgtgtggtggactacggcggctcctccagcaccgagaatgcggtg actgccatccggtttctttttggcttcctggggcccctggtggccgtggccagctgccac agtgccctcctgtgctgggcagcccgacgctgccggccgctgggcacagccattgtggtg gggttttttgtctgctgggcaccctaccacctgctggggctggtgctcactgtggcggcc ccgaactccgcactcctggccagggccctgcgggctgaacccctcatcgtgggccttgcc ctcgctcacagctgcctcaatcccatgctcttcctgtattttgggagggctcaactccgc cggtcactgccagctgcctgtcactgggccctgagggagtcccagggccaggacgaaagt gtggacagcaagaaatccaccagccatgacctggtctcggagatggaggtgtag >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_2|1360_aa MPPPRTREGRDRRDHHRAPSEEEALEKWDWNCPETRRLLEDAFFREEDYIRQGSEECQKF WTFFERLQRFQNLKTSRKEEKDPGQPKHSIPALADLPRTYDPRYRINLSVLGPATRGSQG LGRHLPAERVAEFRRALLHYLDFGQKQAFGRLAKLQRERAALPIAQYGNRILQTLKEHQV VVVAGDTGCGKSTQVPQYLLAAGFSHVACTQPRRIACISLAKRVGFESLSQYGSQVGYQI RFESTRSAATKIVFLTVGLLLRQIQREPSLPQYEVLIVDEVHERHLHNDFLLGVLQRLLP TRPDLKVILMSATINISLFSSYFSNAPVVQVPGRLFPITVVYQPQEAEPTTSKSEKLDPR PFLRVLESIDHKYPPEERGDLLVFLSGMAEISAVLEAAQTYASHTQRWVVLPLHSALSVA DQDKVFDVAPPGVRKCILSTNIAETSVTIDGIRFVVDSGKVKEMSYDPQAKLQRLQEFWI SQASAEQRKGRAGRTGPGVCFRLYAESDYDAFAPYPVPEIRRVALDSLVLQMKSMSVGDP RTFPFIEPPPPASLETAILYLRDQGALDSSEALTPIGSLLAQLPVDVVIGKMLILGSMFS LVEPVLTIAAALSVQSPFTRSAQSSPECAAARRPLESDQGDPFTLFNVFNAWVQVKSERS RNSRKWCRRRGIEEHRLYEMANLRRQFKELLEDHGLLAGAQAAQVGDSYSRLQQRRERRA LHQLKRQHEEGAGRRRKVLRLQEEQDGGSSDEDRAGPAPPGASDGVDIQVGAMGCGVWGF TKDVKFKLRHDLAQLQAAASSAQDLSREQLALLKLVLGRGLYPQLAVPDAFNSSRKDSDQ IFHTQAKQGAVLHPTCVFAGSPEVLHAQELEASNCDGSRDDKDKMSSKHQLLSFVSLLET NKPYLVNCVRIPALQSLLLFSRSLDTNGDCSRLVADGWLELQLADSESAIRLLAASLRLR ARWESALDRQLAHQAQQQLEEEEEDTPVSPKEVATLSKELLQFTASKIPYSLRRLTGLEV QNMYVGPQTIPATPHLPGLFGSSTLSPHPTKGGYAVTDFLTYNCLTNDTDLYSDCLRTFW TCPHCGLHAPLTPLERIAHENTCPQAPQDGPPGAEEAALETLQKTSVLQRPYHCEACGKD FLFTPTEVLRHRKQHNKGPIRGDVLGVSVSFWEEEVEGLKKAAYKAVNYDKLKETTQGKE ENPAQFVAHLAATLRRYTALDPEGPEGRLILNMHFITQSTPDIRKKLQKLESGPQTPQQE LSNLAFKRLKTDAARSPRKPPRPSQTPSFMQLSQWKPLFAFTWTDPDTHQAQQTTWAVLP QGFTDSPHYFSQAQISSSSVTYLSIIIIKTQVLSLLIVSN >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_2|4083_bp atgcctcctcctagaacaagggagggcagggatcgccgagaccaccaccgggctcccagc gaggaagaggccttggagaaatgggactggaattgtccagagacgcgtcgcctcttggaa gatgccttcttccgtgaagaggattacatccgtcagggttctgaggaatgtcagaagttt tggaccttctttgaacgcctgcagagattccagaatctcaagacctccaggaaggaggag aaagaccctggacagcccaagcacagcatcccagcgctggccgacctacctcgcacttac gacccacgttaccgcatcaacctctctgttcttggccctgccacgcggggctctcaggga ctgggcaggcacttgcccgcggagagagtggctgagttccgccgagccctgttgcactac ctggactttggccagaagcaggcatttgggcgtctggccaagctgcagcgtgagcgggca gccctccccatcgcccagtatgggaaccgcatcctgcagacgctgaaggagcaccaggtg gtggtagtggccggtgacaccggctgtggcaagtccactcaggtgccccagtacctgctg gctgctggcttcagtcatgtggcgtgcacccagccccggcggatcgcctgcatctcactg gccaagcgtgtgggctttgagagcctcagtcagtatggctcacaggtcggctaccagatc cgctttgagagcacacgttcggcggccaccaagattgtattcctgacagtggggctgctc ctgcgacaaatccagcgggaacccagcctgccccagtatgaggtcctgattgtggatgaa gtccatgagcggcatctccacaacgatttcctcctgggcgtcctccagcgcctgttgccc acgcggcctgacctcaaggtcatcctcatgtcggccaccatcaacatctcgctcttctcc agctatttcagcaatgcccctgtggtacaggtgcctgggaggctgttccccatcacggtt gtgtaccagccgcaggaggcggagccgaccacgtccaagtcagagaagctggacccgcgg cctttcctgagggtgctggagtccattgaccacaagtacccgcctgaggagcggggtgac ctcctcgtcttcctcagcggcatggcggagatcagcgccgtgctggaggctgcccagacc tatgccagccacacccagcgctgggtggtactgccactgcacagcgccctgtctgtggcc gaccaggacaaggtatttgatgtggcaccccctggagtccggaaatgcatcctctccacc aacattgctgagacctcagtcaccattgacgggatccgcttcgtagtagattccggaaag gtgaaggagatgagctacgatccgcaggccaagctgcaacggctgcaggagttctggatt agtcaggccagcgcagagcagcggaagggccgggcgggccgcacgggccccggagtctgc ttccgcctctatgccgaatcggactatgatgccttcgccccctaccccgtcccagaaatt cggagggtggccctggactcgttggtgctgcagatgaagagcatgagtgtgggggacccc cgaaccttccccttcatcgagcccccaccaccagccagcctggaaaccgccatcctctac ctccgggaccagggggccctggacagctcagaggccctcacacccattgggtccctgcta gcccagctgcctgtggacgttgtgattgggaagatgctgatcctgggctccatgttcagc ctggtggagcctgtgctcaccatcgcagccgcacttagcgtccagtcgcccttcacccgc agcgcccagagcagcccagagtgcgcggcagcacggcggccgctggagagcgaccagggt gaccccttcacgctcttcaacgtcttcaacgcctgggtgcaggtgaaatctgaacggagc agaaactctcgcaagtggtgccgccgccggggcatagaggagcatcgactgtacgaaatg gccaaccttcggcgccagttcaaggagctgttggaggaccacgggctgctggctggggcc caggccgcgcaggtaggggacagctacagtcggttgcagcagcgccgggagcgccgggcc ctgcaccagctgaaacgccagcacgaggagggcgcggggcgcaggcgcaaggtgctgcgg ctgcaggaggagcaggacggcggctccagtgacgaggacagggctggcccagccccccca ggggccagtgatggcgtggacatccaggtgggcgccatgggctgtggggtgtgggggttt accaaggatgtgaagttcaagcttcggcatgacctggcgcagctgcaggccgctgccagc tcagcccaggacctgagccgcgagcagctggctctgctgaagctggtgctgggccggggc ctgtacccacagctggccgtccccgacgccttcaacagcagccgaaaggactcagaccag attttccacacgcaggccaagcagggcgccgtgctgcaccccacctgcgtcttcgctggc agccccgaggtgctgcacgcacaggagctggaggccagcaactgcgacggaagccgagac gacaaggacaagatgagcagcaaacaccagctcctcagcttcgtgtccctgctggagacc aacaagccgtacctggtgaactgcgtccgcatccctgccctccagtccctcctgcttttt agccggtctttggacaccaatggtgactgctcccgcctggtggccgatggctggctggag ctgcagctagcagacagtgaaagtgccatccgactcctggcggcttccctgcggctccgt gcccgctgggaaagtgccctggaccggcagctggcgcaccaggcccagcagcagctggag gaggaggaggaggatacgccagtcagccccaaggaggtggccaccctgagcaaggaactc ctgcaattcacggcatccaagattccttacagcctccggcggctcacagggctagaagtc cagaacatgtatgtgggaccccagaccatcccagccaccccccatcttcctggcctcttt ggcagctccaccctgtccccccaccccacaaaggggggctacgcagtcactgacttcctc acctacaactgcctcacgaatgacacagacctgtacagcgactgtctccgaaccttctgg acctgcccccactgtggcctgcatgcgcccctcacgcccctggagcgcatcgcccatgag aacacctgcccccaggccccacaggatgggcccccaggggctgaggaagctgccctcgaa accctccagaagacatctgtcctgcagaggccctaccactgcgaggcctgcgggaaggac ttcctctttacacccacagaggtgctgcgccaccggaagcagcacaacaaaggccccatc agaggggacgtgctgggtgtcagcgtcagcttctgggaggaggaagttgaagggcttaaa aaggcagcttacaaagctgttaattatgacaaacttaaagaaactacccaaggtaaagag gaaaacccagcccagttcgtggcccacttagcagcaacacttagacgctataccgcccta gacccagaagggccagaaggccgccttattcttaatatgcattttatcactcagtccact cctgacattaggaaaaaacttcaaaaattagaatctggccctcaaaccccacaacaggaa ttaagcaacctcgccttcaagcggctgaagactgatgctgcccgatcgcctcggaagccc cctagaccatcacagacgccaagcttcatgcaactctcacagtggaagcctctcttcgct ttcacttggactgaccctgacacccatcaggctcagcaaactacctgggctgtactgccg caaggcttcacagacagcccccattacttcagtcaagcccaaatttcatcctcatctgtt acctatctcagcataattatcataaaaacacaggtgctctccctgctgatcgtgtccaat taa >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_3|481_aa MLCRLTLAHPLSRSLVVLFNFHKTRERPGTGSAAASTPEPPPDPPSPRAAPRPLRSPYDE LPHYPGIVDGPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREKDEIYGHPLFP LLALVFEKCELATCSPRDGAGAGLGTPPGGDVCSSDSFNEDIAAFAKQVRSERPLFSSNP ELDNLMIQAIQVLRFHLLELEKVHDLCDNFCHRYITCLKGKMPIDLVIEDRDGGCREDFE DYPASCPSLPDQPHTHYTTATQTRMRLGNSAQTHAHQRLQMPPTDSATPTQVHIQPHQLS NSHFCSQPDSSVHRNNMWIRDHEDSGSVHLGTPGPSSGGLASQSGDNSSDQGDGLDTSVA SPSSGGEDEDLDQERRRNKKRGIFPKVATNIMRAWLFQHLSHPYPSEEQKKQLAQDTGLT ILQVNNWFINARRRIVQPMIDQSNRTGQGAAFSPEGQPIGGYTETQPHVAVRPPESGNAS H >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_3|1446_bp atgctctgcaggctcactctggctcatcccctatcccgctccttggtggttctgtttaat ttccacaagacaagggagagaccgggaacggggagcgcggctgccagcacccctgagccg ccgccggaccctccgtcgccccgggccgccccccgccccctgcggtccccgtatgatgag ctgccgcactacccaggcatcgtggatggccccgcagccctggctagcttcccagagaca gtgcccgcagtaccagggccctatggcccgcaccggcctccccagcccctgcccccaggc ttggacagcgacggcctgaagagggagaaggatgagatctatggacacccgctcttcccc ctcttggccctggtctttgagaaatgtgaactggctacatgctctccccgtgacggggcc ggagctgggctggggacaccccctggaggtgacgtctgctcctctgattccttcaacgag gacatcgctgcctttgccaagcaggttcgctctgagaggcccctcttctcctccaaccca gaactggacaatctgatgatccaggccatccaggtgctgcggttccacctgctggagctg gagaaggtccacgacctgtgcgacaacttctgtcaccgctacatcacctgcctcaaggga aagatgcccatcgacctggtcatcgaggatcgggacggcggctgcagggaggacttcgag gactacccagcctcctgccccagcctcccagaccagccacacacccactacacaacagct acgcagactcggatgcggctgggcaacagtgcacagacacacgcacaccagcggctgcag atgccacccacagattcagctacacccacacaggtccacatacagccacaccaactcagc aacagccatttctgctcacagccagacagtagcgtccaccggaataatatgtggattcga gaccatgaggatagtgggtctgtacatttggggaccccaggtccatccagtgggggcctg gcctcccagagtggggacaactccagtgaccaaggagacgggctggacaccagcgtggcc tctcccagttctggtggagaagatgaggacttggaccaggagcgacggcgaaacaagaag agggggatcttccccaaggtggccaccaacatcatgcgagcctggttgttccagcacctc tcgcacccgtacccctcggaggagcagaagaaacagctggcgcaggacacggggctcacc atcctgcaagtcaacaactggttcattaacgcccggagacgcatcgtgcaacctatgatc gatcaatccaaccgcacagggcagggtgcagccttcagcccagagggccagcccatcggg ggctataccgagacgcagccacacgtggccgtccggcctccggaatctggaaatgcctct cattaa >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_4|1005_aa MAPLALVGVTLLLAAPPCSGAATPTPSLPPPPANDSDTSTGGCQGSYRCQPGVLLPVWEP DDPSLGDKAARAVVYFVAMVYMFLGVSIIADRFMAAIEVITSKEKEITITKANGETSVGT VRIWNETVSNLTLMALGSSAPEILLSVIEVCGHNFQAGELGPGTIVGSAAFNMFVVIAVC IYVIPAGESRKIKHLRVFFVTASWSIFAYVWLYLILAVFSPGVVQVWEALLTLVFFPVCV VFAWMADKRLLFYKYVYKRYRTDPRSGIIIGAEGDPPKSIELDGTFVGAEAPGELGGLGP GPAEARELDASRREVIQILKDLKQKHPDKDLEQLVGIANYYALLHQQKSRAFYRIQATRL MTGAGNVLRRHAADASRRAAPAEGAGEDEDDGASRIFFEPSLYHCLENCGSVLLSVTCQG GEGNSTFYVDYRTEDGSAKAGSDYEYRVSHLLPRLEYSVPRRGDPSLSQILSPVRLSSPS ITCLMPVDPHTWEFFYGTYVSSGKHVQTPPVQHTADDRPSPRSSPSPPRRSEGTLVFKPG ETQKELRIGIIDDDIFEEDEHFFVRLLNLRVGDAQGMFEPDGGGRPKGRLVAPLLATVTI LDDDHAGIFSFQDRLLHVSECMGTVDVRVVRSSGARGTVRLPYRTVDGTARGGGVHYEDA CGELEFGDDETMKTLQVKIVDDEEYEKKDNFFIELGQPQWLKRGISALLLNQGDGDRKLT AEEEEARRIAEMGKPVLGENCRLEVIIEESYDFKNTVDKLIKKTNLALVIGTHSWREQFL EAITVSAGDEEEEEDGSREERLPSCFDYVMHFLTVFWKVLFACVPPTEYCHGWACFGVSI LVIGLLTALIGDLASHFGCTVGLKDSVNAVVFVALGTSIPDTFASKVAALQDQCADASIG NVTGSNAVNVFLGLGVAWSVAAVYWAVQGRPFEVRTGTLAFSVTLFTVFAFVGIAVLLYR RRPHIGGELGGPRGPKLATTALFLGLWLLYILFASLEAYCHIRGF >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_4|3018_bp atggctcccctggccttggtgggggtcacactcctcctggcggctcccccatgctccggg gcagccaccccaaccccctccctgccgcctcccccggccaatgacagcgacaccagcaca gggggctgccaggggtcctaccgctgccagccgggggtgctgctgcccgtgtgggagccc gacgacccgtcgctgggtgacaaggcggcacgggcagtggtgtactttgtggccatggtc tacatgtttctgggagtgtccatcatcgccgaccgtttcatggcggccatcgaggtcatc acgtcaaaagagaaggagatcaccatcaccaaggccaacggtgagaccagcgtgggcacc gttcgcatctggaatgagacggtgtccaacctcacgctcatggccctgggctcctccgca cctgagatcctgctgtcagtcatcgaagtctgcggccacaacttccaggcgggtgagctg ggcccaggcaccatcgtgggcagcgctgccttcaacatgtttgtggtcatcgccgtgtgc atctacgtcatcccagccggcgagagccgcaagatcaagcacctgagagtcttctttgtc actgcctcttggagcatcttcgcctatgtctggctttatctcatccttgctgttttttcc cccggtgtggtccaggtgtgggaggcgctgctgaccctggtcttcttcccggtgtgcgtg gtattcgcctggatggccgacaagcggctgctcttctacaagtacgtgtacaagcgctac cgcaccgacccacgcagcggcatcatcataggcgccgagggcgaccccccgaagagcatc gagctggacggcacgttcgtgggcgccgaggccccaggtgagctgggcggcctgggcccg ggccccgccgaggcgcgcgagctggacgccagccgccgcgaggtcatccagatcctcaag gacctcaagcagaagcacccggacaaggatctggagcagctggtgggcatcgccaactac tacgcgctgctgcaccagcagaagagccgcgccttctaccgcatccaggccacgcggctg atgaccggcgccgggaacgtgctgcgcagacacgcggcggacgcctcgcgcagggcggcg ccggccgagggcgcgggcgaggacgaagacgacggcgccagccgcatcttcttcgagcct agcctctaccactgcctggagaactgcggctccgtgctgctgtccgtcacgtgccagggc ggcgagggcaacagcaccttctacgtggactaccgcactgaggacggctctgccaaggcg ggctccgactacgagtacagagtttcgcacttgttgcccagactggagtacagtgttcct cgaagaggtgacccgtccctaagtcagatcctgtcccctgtccggctgtccagtcccagt attacttgtttaatgcctgtcgaccctcatacttgggagttcttctatggcacctacgtc tcctctgggaagcacgtccagactccccccgtacaacacacagccgatgaccgaccctca ccccgctcttccccctccccaccccgccgcagcgagggcacgctggtgttcaaaccaggc gagacgcagaaggagctgcgcatcggcatcatcgacgacgacatcttcgaggaggacgag catttcttcgtgcggctgctgaacctgcgcgtgggcgacgcgcagggcatgttcgagccg gacggcggcgggcggcccaaggggcggctggtggcgccgctgctggccaccgtcaccatc ctggacgacgaccacgcaggcatcttctccttccaggaccgcctgctgcacgtgagcgag tgcatgggcaccgtggacgtgcgcgtcgtgcgcagctcgggcgcgcgcggcaccgtgcgc cttccctaccgcacggtggacggcacggcgcgcggcggcggcgtgcactacgaggacgcg tgcggagagctggagtttggcgacgacgagaccatgaaaactcttcaggtgaagatagtt gatgacgaggaatatgagaaaaaggataatttcttcattgagctgggccagccccagtgg cttaagcgagggatttcagctctgctactcaatcaaggggatggggacaggaagctaaca gccgaggaggaggaggctcggaggatagcagagatgggcaagccagttcttggggagaac tgccggctggaggtcatcatcgaggagtcatatgattttaagaacacggtggataaactc atcaagaaaacgaacttggccttggtaattgggacccattcatggagggagcagttttta gaggcaattacggtgagcgcaggggacgaggaggaggaggaggacgggtcccgggaggag cggctgccgtcgtgctttgactacgtgatgcacttcctgacggtgttctggaaggtgctc ttcgcctgtgtgccccccaccgagtactgccacggctgggcctgctttggtgtctccatc ctggtcatcggcctgctcaccgccctcattggggacctcgcctcccacttcggctgcacc gttggcctcaaggactctgtcaatgctgttgtcttcgttgccctgggcacctccatccct gacacgttcgccagcaaggtggcggcgctgcaggaccagtgcgccgacgcgtccatcggc aacgtgaccggctccaacgcggtgaacgtgttccttggcctgggcgtcgcctggtctgtg gccgccgtgtactgggcggtgcagggccgccccttcgaggtgcgcactggcacgctggcc ttctccgtcacgctcttcaccgtcttcgccttcgtgggcattgccgtgctgctgtaccgg cgccggccgcacatcggcggcgagctgggcggcccgcgcggacccaagctcgccaccacc gcgctcttcctgggcctctggctcctgtacatcctcttcgccagcctggaggcgtactgc cacatccggggcttctag >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_5|439_aa MGEAAVAAGPCPLREDSFTRFSSQSNVYGLAGGAGGRGELLAATLKGKVLGFRYQDLRQK IRPVAKELQFNYIPVDAEIVSIDTFNKSPPKRGLVVGITFIKDSGDKGSPFLNIYCDYEP GSEYNLDSIAQSCLNLELQFTPFQLCHAEVQVGDQLETVFLLSGNDPAIHLYKENEGLHQ FEEQPVENLFPELTNLTSSVLWLDVHNFPGTSRRLSALGCQSGYVRVAHVDQRSRGEGRE VLQMWSVLQDGPISRVIVFSLSAAKETKDRPLQDEYSVLVASMLEPAVVYRDLLNRGLED QLLLPGSDQFDSVLCSLVTDVDLDGRPEVLVATYGQELLCYKYRGPESGLPEAQHGFHLL WQRSFSSPLLAMAHVDLTGDGLQELAVVSLKGVHILQHSLIQASELVLTRLRHQVEQRRR RLQGLEDGAGAGPAENAAS >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_5|1320_bp atgggggaggcggccgtggccgcggggccttgtccgttgcgcgaggacagcttcacgcgc ttctcgtcgcagagcaatgtgtacgggctggcaggcggcgccggcgggcgcggggagctg ctggccgccacccttaaaggcaaggtgctcggcttccgctaccaagacctccgacagaaa atccggccagtggccaaggagctgcagttcaactacattcccgtggatgcggagattgtc tccatcgacactttcaacaagtcaccccccaagcggggtctggttgtggggatcacgttc atcaaggattcaggggacaagggcagccccttcctgaacatttactgcgactacgagccc ggctctgagtacaaccttgactctattgcccagagctgcctgaacctggagctccagttc actccgttccagctgtgccatgcggaggtccaggtcggggatcaacttgagactgtgttt ctcttgagtgggaacgacccggccattcatctctacaaggagaacgaggggctgcatcag tttgaggaacagcccgtggaaaacctcttcccagagctgacgaacctgaccagtagcgtc ctctggctggacgtccacaacttccccggcacgtcccggcgcctctcagctctgggctgt cagagtggttatgtccgtgtcgcccacgtggaccagcggagtcgaggtgagggccgcgag gttctgcagatgtggtcggtcctgcaggacggtcccatctcccgagtgattgtgttcagc ctctcggccgccaaggagaccaaggacaggccactacaagatgagtacagcgtgctcgtg gccagcatgttggagccagcagtggtgtatcgggacctgctgaaccggggtcttgaagac cagcttctcctgcccggcagtgaccagtttgacagcgtcctctgcagcctggtcaccgat gtggatttggatgggcggccagaagtcctggtggccacctatggacaggaactgctgtgt tataagtaccggggcccagagtcggggcttcctgaggcccagcacgggttccatctgctg tggcagcggagcttctccagtcccctgctggccatggctcacgtggacctgaccggggat gggctgcaggagcttgccgtggtctccctgaagggcgtgcacatcctgcagcacagcctg attcaggcctcagagctggtcttgacccggcttcgacatcaagtggagcagaggagacgt cggctacaggggttggaggacggggcaggtgcagggcctgctgagaatgcagcctcttaa >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_6|396_aa MLTAGAPEPVPAGPGGSCRSHCSLRSSSLCRLSPQGARFFGKTFEELVGGRTRLKGLSRG LPTAELRGRGGSGEHTEGSVLMTSSLLVPEVQVLLVQEQKLLRILQRWWQSPEAINCLMR AIEIYTDMGRFTIAAKHHISIAEIYETELVDIEKAIAHYEQSADYYKGEESNSSANKCLL KVAGYAALLEQYQKAIDIYEQALRRPTNTWLQPPPPPSHHHTLILVAHALAADLSKLVPG SLPLGSWVNRASPRDMNQLPAGPPVPGSGSGPLAACPELTSATSPWLQVGTNAMDSPLLK YSAKDYFFKAALCHFCIDMLNAKLAVQKYEELFPAFSDSRECKLMKKLLEAHEEQNVDSY TESVKEYDSISRLDQWLTTMLLRIKKTIQGDEEDLR >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_6|1191_bp atgctgacggcgggtgctccagagccagtgccagcaggcccaggtggcagctgcagatct cactgctcactgcggagcagcagcctttgccgcctcagcccacagggtgccaggtttttt ggcaaaacctttgaggagctggtgggtggcaggactaggttaaagggactgagcagaggg ctcccgactgctgagctacgaggaagagggggcagtggagagcacactgagggctcagtg ttgatgacatccagcctcctcgtgccagaggtccaggtcctcctggtgcaggagcagaag ctgctcaggatcctgcagagatggtggcagagcccagaggccattaactgtttgatgcga gcaatcgagatctacacagacatgggccgattcacgattgcggccaagcaccacatctcc attgctgagatctatgagacagagttggtggacatcgagaaggccattgcccactacgag cagtctgcagactactacaaaggcgaggagtccaacagctcagccaacaagtgtctgctg aaggtggctggttacgctgcgctgctggagcagtatcagaaggccattgacatctacgaa caggctctcagaaggcccacgaacacctggctacagcctccacccccacccagccaccat cacaccctgatcttggtcgctcacgcactggccgctgacctctccaagctggtccctggc tccctgcccttggggtcctgggttaacagggcctcacctcgggacatgaaccagctccca gctggccccccagtgcctggcagtggctctggccctctggctgcttgccctgagctcacc agtgccacttctccatggctacaggtggggaccaatgccatggacagccccctcctcaag tacagcgccaaagactacttcttcaaggcggccctctgccacttctgcatcgacatgctc aacgccaagctggctgtccaaaagtatgaggagctgttcccagctttctctgattcccgg gaatgcaagttgatgaaaaaattgctagaggcccacgaggagcagaatgtggacagctac accgagtcggtgaaggaatacgactccatctcccggctggaccagtggctcaccaccatg ctgctgcgcatcaagaagaccatccagggcgatgaggaggacctgcgctaa >gi568815579r:47330092_47566403|GENSCAN_predicted_peptide_7|1578_aa MDQYSLGDEGALPSEMHLPSFSESQGLNCSDTLNRDLGPNTRGFLYAGLSGLDPDPSLPT PDMSSEVLEDNLDTLSLYSGKDSDSVKLLEEYADSESQASLQDLGLGVLKAKEADEGGRA TSGSARKGKRQHSSPQNPLLDCSLCGKVFSSASSLSKHYLTHSQERKHVCKICSKAFKRQ DHLYVGVWTGHMLTHQKTKPFVCIEQGCSKSYCDYRSLRRHYEVHHGLCILKEAPPEEEA CGDSPHAHESAGQPPPSSLRSLVPPEARSPGSLLPHRDLLRRIVSSIVHQKTPSPGPAPA GASDSEGRNTACPCPASSGSSSCTPAGPHAAPAALDTELPEEPCLPQKEPATDVFTAPNS RAAENGAPDPPEPEPDTALLQARSTAECWPEGGSVPACLPLFRGQTVPASSQPSSHSFQW LRNLPGCPKSKGNNVFVVHKPSAVPSREGSESGPGPSSGSPSEESPPGPGGGLEDALPFP AALLRVPAEAPSDPRSASGEDDPCAPKKVKVDCDSFLCQNPGEPGLQEAQKAGGLPADAS PLFRQLFLKSQEPLVSHEQMQVFQMITKSQRIFSHAQVAAVSSQLPAPEGKPAALRPLQG PWPQQPPPLAPAVDSLHAGPGNPEAEGSPARRRKTTPGVPREASPGSTRRDAKGGLKVAA VPTPLAAPSLDPSRNPDISSLAKQLRSSKGTLDLEDIFPSTGQRQTQLGGEEPPGASLPG KQAPAENGAASRITKGEKGPACSRGGGYRLLGNPRAPRFSGFRKEKAKMDMCCAASPSQV AMASFSSAGPPADPSKSKLTIFSRIQGGNIYRLPHPVKEENVAGRGNQQNGSPTDWTKPR STFVCKNCSQMFYTEKGLSSHMCFHSDQWPSPRGKQEPQVFGTEFCKPLRQVLRPEGDRH SPPGTKKPLDPTAAAPLVVPQSIPVVPVTRHIGSMAMPVLEDPLLKKLLLDSGTQSVDLK GQEKDGEERDSKESSQQRKRKKRPPPSTAGEPGPAGCHQSRLRSPMFLVDCLLKGLFQCS PYTPPPMLSPIREGSGVYFNTLCSTSTQASPDQLISSMLDQVDGSFGICVVKDDTKISIE PTYLMHLRQQRVSMEQAVAQEQGIQSVSISLPISVMFHLMTPVCLLFLMSFLETVTELCN VACSSVMPGGGTNLELALHCLHEAQGNVQVALETLLLRGPHKPRTHLLADYRYTGSDVWT PIEKRLFKKAFYAHKKDFYLIHKMIQTKTVAQCVEYYYIWKKMIKFDCGRAPGLEKRVKR EPEEVERTEEKVPCSPRERPSHHPTPKLKTKSYRRESILSSSPNAGSKRTPELLGSAESQ GIFPCRECERVFDKIKSRNAHMKRHRLQDHVEPIIRVKWPVKPFQLKEEELGADIGPLQC SSSTGRQREQEKAERFPFPTNAFPSEETGETSRGGGTKFCAELSNPGVGPELSRALLSPF VAAMDNSGKEAEAMALLAEAERKVKNSQSFFSGLFGGSSKIEEACEIYARAANMFKMAKN WSAAGNAFCQAAQLHLQLQSKHDAATCFVDAGNAFKKADPQGEGLCGPHGLAVRSAILAP CIDSSGFLAALVFPVADM >gi568815579r:47330092_47566403|GENSCAN_predicted_CDS_7|4737_bp atggaccagtacagccttggagacgagggtgccctcccatcagaaatgcacctcccttca ttttcagagagccaagggctcaactgcagcgacaccctcaaccgggatttgggtcccaac acgcgaggctttctttatgctggcctgagtggtctggacccggaccccagcctcccaacg cctgacatgtccagcgaggtgctggaggacaacttagacaccttgtccctgtactccggg aaggacagtgattctgtgaagctgctggaggagtacgcagattcggagtctcaggcatcc ttacaagacctggggctaggtgtgcttaaggctaaagaggctgacgaaggaggaagggcc acctcggggagtgccaggaaaggaaagcggcagcacagttcccctcaaaacccacttctg gactgcagcctgtgcgggaaggtgttcagcagcgccagttctctgagcaagcactacctg acacacagccaggaaaggaagcacgtctgcaaaatctgcagcaaggcctttaagcgccag gaccacctgtatgtgggagtctggaccgggcacatgctcacccaccagaagacaaagccc ttcgtgtgcatcgagcagggctgcagcaagagctactgcgactaccgctctctgcgccgg cactacgaggtccatcacggcctgtgcatcctgaaggaagcccccccggaggaagaggcc tgcggggactccccccacgcccacgagtcggccggccagccgccccccagcagcctgcgg tccctggtgcccccagaggccaggtcccccggctccctcctgccccaccgggacctcctg cgccgcatcgtgagtagcatcgtccaccagaagaccccttctcctggcccagccccggcg ggggcttcagacagcgaagggaggaacactgcctgtccctgccccgcctcatcggggtcc tcgtcctgcaccccagccggcccccacgcggccccagcagcgctggacaccgagcttccc gaggagccttgcctcccacagaaagagccggccactgacgtgttcacagcccctaattcc agggccgccgagaacggcgcccccgacccgccagagccggagccagataccgcgctgctc caggcccggtccaccgcggagtgctggcccgaaggcggctccgtgcctgcctgcctgcct ctcttccgaggccagacggtccctgccagttcccagccatcgagccacagcttccagtgg ctccggaacctgccgggctgtcccaaaagcaaaggcaacaacgtgtttgttgtccacaag ccctcggccgtgccctcgcgggagggctccgagtctggcccgggacccagcagcggaagc ccctcggaggagtccccgcccggccccggcggtggcctggaggatgctctgcccttccct gccgcgctcctcagggtccccgcggaggccccgagcgaccccaggtcggccagcggggaa gatgacccctgcgcccccaagaaggtcaaggtcgactgcgactccttcctgtgccagaac cccggggagcccggcctccaggaggcccagaaggcaggcgggctccctgcggatgcctcg ccgctcttccgccagctcttcctcaagtcgcaggaacctcttgtgagccacgagcagatg caggtgttccagatgatcaccaagtcccagcggatcttctcccatgcccaggtggcagca gtctcctcccagctccctgcgcccgagggcaaaccagccgccctgaggccgctgcagggg ccgtggccgcagcagcccccaccactggctcctgctgtggactctctccacgccggccct ggaaaccccgaggcagagggctccccagcccgcaggagaaaaaccacacccggggttccc agagaggcctcccccggcagcacgagacgagacgcaaaggggggactgaaagtggccgcg gttccaacgccccttgcagcaccgtctctggacccttccaggaatccagacatctcttct ctggccaagcagctgcgatcctctaaagggaccttggacctggaggacatcttcccctcc acaggccaacggcagacccagttaggtggggaggagccaccaggagcctcgctgccaggg aagcaggccccagccgagaatggcgcggcttcaaggatcacaaaaggtgaaaagggccca gcctgctcccggggtggaggctaccggctcttgggcaaccccagggccccgcgattctcc ggcttccggaaagagaaggcgaagatggatatgtgctgtgcggcttctccgagccaggta gccatggcctccttctcatcggccgggcccccggcagatccctccaagtccaagctgaca atattcagcagaatccagggtggaaacatctacaggctcccccatccggtgaaggaagag aatgtggcaggcagaggtaaccagcaaaacggcagtcccacagactggacgaagcccagg agcacttttgtctgcaagaactgcagccagatgttttatacggagaaagggctgagcagc cacatgtgttttcacagcgaccagtggccgtcacctcgagggaagcaggaaccgcaggtg tttggcacagagttttgcaagccgctaagacaggtgctgaggccagaaggggacaggcat agtcccccaggaaccaagaagcccttggaccccacagctgcagcccctttggtggtcccc caatcgatccccgtggttccagtgacccgacacatagggagcatggccatgcctgtactg gaggaccccctactgaagaagctcctgctggactcggggacacagtccgtggacctgaag ggacaagagaaagacggggaggagcgagacagcaaggagagcagccagcagagaaagcgg aagaagcggcccccaccctccacggctggggagcctggccctgcaggatgccaccagagc cgcctgcggtcacccatgttcctggtggactgcctcctgaagggcttattccagtgctcc ccctacacaccacccccaatgctcagccccatccgggagggctctggggtgtacttcaac accctctgttccacgtccactcaggccagccctgaccagctcatcagctccatgctcgat caagtggatgggtcctttggcatctgtgtggtgaaggatgacaccaaaatcagcattgag cccacgtacctgatgcacctgagacagcagagggtttccatggagcaggctgtggcccaa gagcagggcatacagagcgtctccatttcacttcccatctcagtcatgtttcacctgatg actcctgtgtgcctcctcttcctcatgtccttcttggaaacagtgaccgagctctgcaat gtggcatgctccagcgtgatgccaggagggggcaccaacctggagctcgctctgcactgc ctgcacgaggctcagggcaacgttcaggtcgccctggagactcttctgctccgagggccc cacaagccacggacacacctgctcgctgactatcgctacacaggttcagacgtctggacc cctatagagaagaggctttttaagaaggcgttctatgcccacaagaaggacttctacttg atacacaagatgatccagacaaagacggtagctcagtgcgttgagtattattacatctgg aaaaaaatgatcaagtttgactgtggccgagccccagggctagaaaagagggtcaagaga gagccggaggaagtggaaaggacagaggaaaaggtcccatgcagccctcgggagagaccc agccaccatccaactcccaagttaaagaccaagagttataggagggagtccatcctcagc tccagcccaaatgcaggctccaagcggacccccgagctgttgggaagtgcagagagccag ggcatttttccatgcagagagtgtgaaagggtgtttgacaagatcaagagtcgaaatgcc catatgaagcggcaccgccttcaggaccacgtggagcccatcatcagggtgaagtggcca gtgaagcccttccagctaaaggaagaggagctgggagctgacatcggccccctgcagtgc agttctagtactggccgacagagggagcaagaaaaagcagaacggtttccttttcccaca aacgcctttccaagtgaagaaaccggcgagaccagccgcgggggcgggactaagttctgc gcagaactgtcaaacccgggcgtgggccccgagctgtcacgcgctttgctgagtcccttt gtggccgccatggacaattccgggaaggaagcggaggcgatggcgctgttggccgaggcg gagcgcaaagtgaagaactcgcagtccttcttctctggcctctttggaggctcatccaaa atagaggaagcatgcgaaatctacgccagagcagcaaacatgttcaaaatggccaaaaac tggagtgctgctggaaacgcgttctgccaggctgcacagctgcacctgcagctccagagc aagcacgacgcagccacctgctttgtggacgctggcaacgcattcaagaaagccgacccc caaggtgagggcctctgcgggccacacgggctggctgtccggagcgccatcctagcaccc tgcattgattcctcaggtttcctggcggctctggtttttccagtggccgacatgtga