GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:47:51 Sequence gi568815597r:151192855_151416481 : 223627 bp : 47.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6929 6981 53 1 2 81 75 94 0.750 8.03 1.02 Intr + 25257 25384 128 0 2 41 84 66 0.690 1.82 1.03 Intr + 31391 31425 35 0 2 121 105 31 0.957 6.04 1.04 Intr + 34466 34546 81 1 0 96 93 27 0.931 3.83 1.05 Intr + 38817 38947 131 2 2 89 92 84 0.999 8.39 1.06 Intr + 39394 39511 118 1 1 65 99 172 0.999 16.47 1.07 Intr + 39697 39849 153 0 0 78 91 90 0.994 8.57 1.08 Intr + 41343 41642 300 2 0 103 97 255 0.994 24.83 1.09 Intr + 43704 43909 206 2 2 83 100 126 0.988 11.30 1.10 Intr + 45328 45411 84 1 0 100 94 36 0.951 4.34 1.11 Intr + 46276 46324 49 1 1 83 83 24 0.890 0.08 1.12 Intr + 47101 47185 85 0 1 125 95 107 0.994 14.49 1.13 Intr + 49584 49713 130 1 1 106 81 30 0.960 3.85 1.14 Intr + 69307 69480 174 1 0 41 80 107 0.514 4.25 1.15 Intr + 71060 71174 115 2 1 76 84 203 0.995 19.15 1.16 Intr + 71978 72064 87 1 0 64 76 146 0.996 11.17 1.17 Intr + 72312 72380 69 2 0 85 105 55 0.984 6.28 1.18 Intr + 72540 72755 216 2 0 77 85 248 0.998 22.10 1.19 Intr + 73150 73258 109 0 1 86 62 126 0.820 9.66 1.20 Intr + 73454 73585 132 0 0 104 99 210 0.998 24.32 1.21 Intr + 73666 73733 68 2 2 69 100 54 0.999 3.32 1.22 Intr + 74319 74485 167 2 2 93 44 294 0.014 24.26 1.23 Intr + 90334 90464 131 1 2 57 86 101 0.723 7.14 1.24 Intr + 93421 95552 2132 2 2 100 82 1463 0.983 134.24 1.25 Intr + 95674 95852 179 0 2 74 51 91 0.998 2.72 1.26 Intr + 96241 96417 177 1 0 111 61 95 0.992 8.13 1.27 Intr + 96524 96686 163 2 1 99 87 94 0.995 10.38 1.28 Intr + 96824 97153 330 1 0 108 113 210 0.995 21.33 1.29 Intr + 97268 97380 113 1 2 100 111 17 0.999 4.38 1.30 Intr + 97578 97719 142 0 1 121 96 125 0.999 16.96 1.31 Term + 97861 98355 495 0 0 83 34 427 0.999 31.37 1.32 PlyA + 98594 98599 6 -1.75 2.14 PlyA - 98967 98962 6 1.05 2.13 Term - 100179 99998 182 1 2 111 47 395 0.994 35.47 2.12 Intr - 100608 100546 63 1 0 128 65 50 0.940 5.39 2.11 Intr - 101284 101164 121 1 1 126 81 206 0.959 23.77 2.10 Intr - 101687 101555 133 1 1 116 76 195 0.798 21.75 2.09 Intr - 106219 105954 266 1 2 68 96 219 0.956 17.01 2.08 Intr - 109113 108990 124 2 1 123 110 141 0.998 20.39 2.07 Intr - 109444 109340 105 0 0 56 113 96 0.992 8.23 2.06 Intr - 110796 110687 110 0 2 83 72 148 0.988 11.78 2.05 Intr - 113509 113282 228 1 0 78 78 386 0.974 34.67 2.04 Intr - 114947 114720 228 2 0 36 71 284 0.970 19.57 2.03 Intr - 117401 117357 45 2 0 86 94 28 0.676 1.81 2.02 Intr - 123655 122719 937 0 1 112 83 989 0.640 91.99 2.01 Init - 124174 124164 11 1 2 92 31 5 0.274 -4.78 2.00 Prom - 125951 125912 40 -4.96 3.13 PlyA - 128511 128506 6 1.05 3.12 Term - 133519 133322 198 1 0 60 50 94 0.815 0.20 3.11 Intr - 134584 134417 168 1 0 14 99 166 0.384 10.44 3.10 Intr - 150324 149431 894 0 0 92 59 160 0.118 5.01 3.09 Intr - 150588 150488 101 1 2 88 110 112 0.518 13.23 3.08 Intr - 151028 150827 202 2 1 52 70 139 0.544 7.36 3.07 Intr - 151424 151301 124 1 1 14 98 -7 0.211 -6.51 3.06 Intr - 151682 151563 120 1 0 93 7 141 0.543 6.11 3.05 Intr - 151993 151874 120 0 0 85 72 134 0.620 11.11 3.04 Intr - 152334 152252 83 0 2 97 100 72 0.997 7.74 3.03 Intr - 153107 153074 34 1 1 115 119 30 0.997 7.03 3.02 Intr - 153479 153351 129 1 0 86 88 45 0.300 4.11 3.01 Init - 162561 162527 35 0 2 114 53 27 0.163 -0.54 3.00 Prom - 163817 163778 40 -4.26 4.12 PlyA - 164626 164621 6 1.05 4.11 Term - 171851 171689 163 1 1 102 43 190 0.999 13.31 4.10 Intr - 172190 172072 119 2 2 73 94 138 0.999 12.16 4.09 Intr - 172427 172335 93 2 0 70 116 61 0.819 7.26 4.08 Intr - 172830 172709 122 1 2 90 49 232 0.659 19.71 4.07 Intr - 172992 172914 79 0 1 79 109 104 0.999 10.72 4.06 Intr - 173599 173421 179 2 2 81 46 233 0.994 18.04 4.05 Intr - 174050 173868 183 0 0 113 85 195 0.999 21.46 4.04 Intr - 175465 175345 121 1 1 129 77 147 0.899 17.77 4.03 Intr - 176335 176186 150 1 0 79 10 225 0.898 14.16 4.02 Intr - 176700 176588 113 1 2 105 66 136 0.999 13.20 4.01 Init - 177045 176859 187 0 1 75 86 154 0.957 11.13 4.00 Prom - 200909 200870 40 -1.86 5.00 Prom + 205075 205114 40 -4.06 5.01 Init + 206734 206873 140 0 2 62 100 77 0.975 3.92 5.02 Intr + 207127 207333 207 1 0 111 78 253 0.998 24.79 5.03 Intr + 207588 207734 147 0 0 83 12 130 0.949 4.25 5.04 Intr + 207910 207991 82 1 1 49 111 99 0.999 7.94 5.05 Intr + 208385 208501 117 1 0 65 76 212 0.999 18.36 5.06 Intr + 208688 208776 89 1 2 73 109 110 0.848 10.37 5.07 Term + 208963 208975 13 1 1 95 44 14 0.823 -4.63 5.08 PlyA + 209065 209070 6 1.05 6.10 PlyA - 209894 209889 6 1.05 6.09 Term - 213610 211948 1663 0 1 107 48 1305 0.981 117.20 6.08 Intr - 213777 213753 25 1 1 108 87 -26 0.701 -3.62 6.07 Intr - 214169 214057 113 1 2 73 98 55 0.708 5.02 6.06 Intr - 214437 214381 57 2 0 104 94 6 0.625 0.80 6.05 Intr - 215386 215246 141 0 0 119 108 87 0.999 13.27 6.04 Intr - 215727 215555 173 0 2 101 1 108 0.953 2.14 6.03 Intr - 215974 215840 135 1 0 80 98 138 0.997 14.76 6.02 Intr - 218917 218771 147 1 0 77 79 100 0.926 8.43 6.01 Intr - 219542 219442 101 0 2 81 72 27 0.377 0.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 74319 74489 171 2 0 93 53 310 0.986 25.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:151192855_151416481|GENSCAN_predicted_peptide_1|2183_aa MGLKRYRKVIGRFRFLERQKGIFYASEDLNVCKTVKLRFQVSDLKVSAPLEYWHCLIQNK AASGIKRPMASEVPYASGMPIKKIGHRSVDSSGETTYKKTTSSALKGAIQLGITHTVGSL STKPERDVLMQDFYVVESIFFPSEGSNLTPAHHYNDFRFKTYAPVAFRYFRELFGIRPDD YLYSLCSEPLIELCSSGASGSLFYVSSDDEFIIKTVQHKEAEFLQKLLPGYYMNLNQNPR TLLPKFYGLYCVQAGGKNIRIVVMNNLLPRSVKMHIKYDLKGSTYKRRASQKEREKPLPT FKDLDFLQDIPDGLFLDADMYNALCKTLQRDCLVLQSFKIMDYSLLMSIHNIDHAQREPL SSETQYSVDTRRPAPQKALYSTAMESIQGEARRGGTMETDDHMGGIPARNSKGERLLLYI GIIDILQSYRFVKKLEHSWKALVHDGDTVSVHRPGFYAERFQRFMCNTVFKKIPCVHLGR PDVLPQTPPLEEISEGSPIPDPSFSPLVGETLQMLTTSVDNSEYMRNGDFLPTRLQAQQD AVNIVCHSKTRSNPENNVGLITLAKYGGQRRRGLSSDCEVLTTLTPDTGRILSKLHTVQP KGKITFCTGIRVAHLALKHRQGKNHKMRIIAFVGSPVEDNEKDLVKLAKRLKKEKVNVDI INFGEEEVNTEKLTAFVNTLNGKDGTGSHLVTVPPGPSLADALISSPILAGEGGAMLGLG ASDFEFGVDPSADPELALALRVSMEEQRQRQEEEARRAAAASAAEAGIATTGTEDSDDAL LKMTISQQEFGRTGLPDLSSMTEEEQIAYAMQMSLQGAEFGQAESADIDASSAMDTSEPA KEEDDYDVMQDPEFLQSVLENLPGVDPNNEAIRNAMGSLASQATKDGKKDKKEEDKKTCS PRQPSADGRDLIPSATQLGLNLIAASPSPSSALQALGPGKGLGSADMGDMKTPDFDDLLA AFDIPDIDANEAIHSGPEENEGPGGPGKPEPGVGSESEDTAAASAGDGPGVPAQASDHGL PPPDISVVSVIVKNTVCPEQSEALAGGSAGDGAQAAGVTKEGPVGPHRMQNGFGSPEPSL PGTPHSPAPPSGGTWKEKGMEGKTPLDLFAHFGPEPGDHSDPLPPSAPSPTREGALTPPP FPSSFELAQENGPGMQPPVSSPPLGALKQESCSPHHPQVLAQQGSGSSPKATDIPASASP PPVAGVPFFKQSPGHQSPLASPKVPVCQPLKEEDDDEGPVDKSSPGSPQSPSSGAEAADE DSNDSPASSSSRPLKVRIKTIKTSCGNITRTVTQVPSDPDPPAPLAEGAFLAEASLLKLS PATPTSEGPKVVSVQLGDGTRLKGTVLPVATIQNASTAMLMAASVARKAVVLPGGTATSP KMIAKNVLGLVPQALPKADGRAGLGTGGQKVNGASVVMVQPSKTATGPSTGGGTVISRTQ SSLVEAFNKILNSKNLLPAYRPNLSPPAEAGLALPPTGYRCLECGDAFSLEKSLARHYDR RSMRIEVTCNHCARRLVFFNKCSLLLHAREHKDKGLVMQCSHLVMRPVALDQMVGQPDIT PLLPVAVPPVSGPLALPALGKGEGAITSSAITTVAAEAPVLPLSTEPPAAPATSAYTCFR CLECKEQCRDKAGMAAHFQQLGPPAPGATSNVCPTCPMMLPNRCSFSAHQRMHKNRPPHV CPECGGNFLQANFQTHLREACLHVSRRVGYRCPSCSVVFGGVNSIKSHIQTSHCEVFHKC PICPMAFKSGPSAHAHLYSQHPSFQTQQAKLIYKCAMCDTVFTHKPLLSSHFDQHLLPQR VSVFKCPSCPLLFAQKRTMLEHLKNTHQSGRLEETAGKGAGGALLTPKTEPEELAVSQGG AAPATEESSSSSEEEEVPSSPEPPRPAKRPRRELGSKGLKGGGGGPGGWTCGLCHSWFPE RDEYVAHMKKEHGKSVKKFPCRLCERSFCSAPSLRRHVRVNHEGIKRVYPCRYCTEGKRT FSSRLILEKHVQVRHGLQLGAQSPGRGTTLARGSSARAQGPGRKRRQSSDSCSEEPDSTT PPAKSPRGGPGSGGHGPLRYRSSSSTEQSLMMGLRVEDGAQQCLDCGLCFASPGSLSRHR FISHKKRRGVGKASALGLGDGEEEAPPSRSDPDGGDSPLPASGGPLTCKVCGKSCDSPLN LKTHFRTHGMAFIRARQGAVGDN >gi568815597r:151192855_151416481|GENSCAN_predicted_CDS_1|6552_bp atgggacttaaaagatatcgaaaagttattggacgctttcgatttctggagagacaaaaa ggtattttctatgctagtgaggaccttaacgtctgtaaaactgtgaaactgcgctttcaa gtatcagacttgaaagtttctgctcctctagaatactggcattgcctaattcagaacaaa gcagcatctggaatcaagagacccatggcatctgaggtgccttatgcctctggcatgccc atcaagaaaataggccatagaagtgttgattcctcaggagagacaacatataaaaagaca acctcatcagccttgaaaggtgccatccagttaggcattacccacactgtggggagcctg agtaccaaaccagagcgtgatgtcctcatgcaagatttctacgtggttgagagtatcttc tttcccagtgaagggagcaacctgacccctgctcatcactacaatgactttcgtttcaag acctatgcacctgttgccttccgctacttccgggagctatttggtatccggcccgatgat tacttgtattccctctgcagtgagccgctgattgaactctgtagctctggagctagtggt tccctattctatgtgtccagcgacgatgagttcattattaagacagtccaacataaagag gcggaatttctgcagaagctgcttccaggatactacatgaacctcaaccagaaccctcgg actttgctgcctaaattctatggactgtactgtgtgcaggcaggtggcaagaacattcgg attgtggtgatgaacaatcttttaccaagatcggtaaaaatgcatatcaaatatgacctc aaaggctcaacctacaaacggcgggcttcccagaaagagcgagagaagcctcttcccaca tttaaagacctagacttcttacaagacatccctgatggtctttttttggatgctgacatg tacaacgctctctgtaagaccctgcagcgtgactgtttggtgctgcagagcttcaagata atggattacagcctcttgatgtcaatccataatatagatcatgcacaacgagagccctta agcagtgaaacacagtactcagttgatactcgaagaccggccccccaaaaggctctgtat tccacagccatggaatccatccagggagaggctcgacggggtggtaccatggagactgat gaccatatgggtggcatccctgcccggaatagtaaaggggaaaggcttctgctttatatt ggcatcattgacattctacagtcttacaggtttgttaagaagttggagcactcttggaaa gccctggtacatgacggagacactgtctcagtgcatcgcccaggcttctacgctgaacgg ttccagcgcttcatgtgcaacacagtatttaagaagattccctgcgttcaccttggtcgt cctgatgttttacctcagactccacctttggaggaaatcagtgagggctcgcctattcct gaccccagtttctcacctctagttggagagactttgcaaatgctaactacaagtgtggac aacagtgagtatatgcggaatggagacttcttacccaccaggctgcaggcccagcaggat gctgtcaacatagtttgtcattcaaagacccgcagcaaccctgagaacaacgtgggcctt atcacactggctaagtatgggggacagaggaggaggggactcagtagtgactgtgaagtg ctgaccacactcaccccagacactggccgtatcctgtccaagctacatactgtccaaccc aagggcaagatcaccttctgcacgggcatccgcgtggcccatctggctctgaagcaccga caaggcaagaatcacaagatgcgcatcattgcctttgtgggaagcccagtggaggacaat gagaaggatctggtgaaactggctaaacgcctcaagaaggagaaagtaaatgttgacatt atcaattttggggaagaggaggtgaacacagaaaagctgacagcctttgtaaacacgttg aatggcaaagatggaaccggttctcatctggtgacagtgcctcctgggcccagtttggct gatgctctcatcagttctccgattttggctggtgaaggtggtgccatgctgggtcttggt gccagtgactttgaatttggagtagatcccagtgctgatcctgagctggccttggccctt cgtgtatctatggaagagcagcggcagcggcaggaggaggaggcccggcgggcagctgca gcttctgctgctgaggccgggattgctacgactgggactgaagactcagacgatgccctg ctgaagatgaccatcagccagcaagagtttggccgcactgggcttcctgacctaagcagt atgactgaggaagagcagattgcttatgccatgcagatgtccctgcagggagcagagttt ggccaggcggaatcagcagacattgatgccagctcagctatggacacatctgagccagcc aaggaggaggatgattacgacgtgatgcaggaccccgagttccttcagagtgtcctagag aacctcccaggtgtggatcccaacaatgaagccattcgaaatgctatgggctccctggcc tcccaggccaccaaggacggcaagaaggacaagaaggaggaagacaagaagacttgctcc ccgcgccagccctcggcagatggcagggacttaattccgtctgctacccagcttggcctc aacctaatcgccgccagcccctcgccctcctctgcgctgcaggccttgggcccgggcaaa ggtctgggatctgccgatatgggggatatgaagacccctgattttgatgacctccttgct gcctttgacatccctgacattgatgcgaatgaagccatccattctgggccagaagaaaat gaggggcctggaggcccagggaagccagaaccaggtgtaggaagtgaatctgaagacaca gcagcagcctctgctggggatggccctggagttccagcccaggcctctgaccatggcctg ccaccgccagacatttctgtagtcagtgtcattgtcaagaacactgtgtgtcccgagcag tctgaggccctggctggaggctcagcaggagacggggcccaggctgctggggtaactaaa gaagggcctgtggggcctcatcgaatgcagaatggttttgggagccctgaaccttccctc ccaggaactccccactctcctgctcctcccagtgggggcacctggaaagaaaaaggcatg gaaggcaaaactcccttggacctgtttgctcattttggccctgagccaggggaccactca gatccgctgcctccctctgcaccctctcccactcgggagggggctctgaccccgcctcct ttcccctcttcctttgagctggcccaggagaatggcccaggcatgcagccacctgtttct tccccaccattgggggccttgaagcaggagagctgcagcccccatcatccccaggtccta gcccaacaaggctcaggctccagccctaaggccacggacatccctgccagtgcctcgcct cccccagttgctggggtgcccttcttcaagcagtctccagggcaccagagccctcttgcc tcccccaaagtgcccgtctgtcagcccttgaaggaagaagatgatgatgaggggccagtg gacaagtcttccccaggaagtccccagagtccctctagtggggccgaggctgcagatgag gacagcaatgactcccctgcctccagctcctctaggcctcttaaggtgcggatcaagacc attaaaacatcctgcgggaatatcacaaggactgtaactcaggtcccctcagatcctgat ccacctgcccccttggctgagggggccttcttggctgaggctagcctcttgaagctgtcc cctgcaacacctacttctgagggtccaaaggtggtgagcgtacagttgggtgatggtaca aggctgaaaggcactgtgctgcctgtggccaccatccagaacgccagtactgccatgctg atggcagccagtgtggctcgcaaggctgtggtgctgcctggggggactgccaccagccct aagatgattgctaagaacgtgctaggcctggtgccccaagccctgcctaaggctgacggg cgggcagggctggggactgggggacagaaggtgaatggtgcctcggtggtgatggtgcaa ccttcaaagacagctactgggccaagtacagggggcggcacagtgatatcacggacccag tccagcctggtggaggccttcaacaagatcctcaacagcaagaacctgctccctgcctat aggccaaacctgagcccaccagctgaggctgggctggccctgcctcccaccggctaccgc tgcctggagtgtggggatgccttctcattggagaagagcctggcacggcactatgaccgt cggagcatgcgcatcgaggtcacctgcaaccactgcgcccgccgcctggtcttcttcaac aagtgcagcctgctcctgcatgcacgtgaacacaaggacaaggggctcgtcatgcagtgc tcacatttggtcatgaggcctgtagcccttgaccagatggtggggcagccggacatcaca ccgctgctgcctgtagctgtcccacctgtctctggacctctggccttgcctgccttgggc aagggtgagggggccatcacctcctctgccattactacagttgctgctgaggcccctgtc ctgccgctctccacagagccgcctgctgccccggccacctctgcttacacatgctttcgc tgcctggagtgcaaggaacagtgccgggacaaggctggcatggcagctcacttccagcag ctcggcccccctgcccctggggccaccagcaatgtgtgcccaacctgccccatgatgctc cccaatcgctgcagcttcagcgcccaccagcgcatgcataagaatcgacccccccatgtc tgtcctgagtgtgggggcaacttcctgcaagccaattttcagacccatctccgggaggcc tgtctgcacgtctctcgccgtgtaggatacaggtgccccagctgttcagtggtgtttggg ggtgtgaactccatcaagtcccacatccagacgtcgcactgcgaggttttccacaagtgc cccatctgccccatggccttcaagtctgggccaagtgcccatgcccacctctactcccag catcccagcttccaaactcagcaggccaagctgatctacaagtgcgccatgtgcgacaca gtcttcactcacaaacccctcctctcctcacacttcgaccagcacttgctgccccagcgt gtcagtgtctttaagtgcccgtcttgtcctctgctctttgcccaaaaaaggaccatgctg gaacatctcaagaacacccatcagtctgggcgcttggaggagactgctgggaaaggggcc gggggtgccctgctgacccccaagactgagcctgaggagctggctgtttctcagggaggg gcagcccctgctactgaggagtcgtcttcatcttcagaagaggaggaagtacccagctcc cctgagcccccccgtccagccaaacggcctcggcgggaactagggagcaaaggcctcaag ggtgggggtggggggcctggaggctggacctgtggcctgtgtcactcctggttccctgag cgtgatgaatacgtggcccacatgaagaaggagcatggcaagtcagtgaaaaagttcccc tgtcgcctgtgtgagcgctccttctgctccgcccccagcctgaggcgccatgtcagagtt aatcacgagggcatcaagcgagtttacccctgcaggtattgcacagagggaaaacgcacc ttcagcagccgcctgatcctagagaaacatgtccaggtccggcacggcttgcagcttggg gcccagtcccctggccgggggaccaccttggctcggggttccagtgccagagcccagggg ccaggtcggaaacgccgccagtcttctgactcttgcagtgaggagcctgacagcacgaca ccgccagccaagtcccccaggggcggacctggatctggaggccatggccctctgcgctac cggagcagcagctccacagaacagagcctcatgatggggttgagggtggaggatggtgcc cagcagtgcctcgactgtggcttgtgctttgcctcccctggctccctgagccgacaccgt ttcatcagccacaagaagagacggggtgtgggtaaagccagtgccctggggctgggggat ggggaggaagaggcccctccatcaaggtctgaccccgatggtggagactcacccctgcct gcttctggaggcccactgacctgtaaggtctgtggcaagagctgcgacagccctctaaac ctcaagacccacttccgcacgcatggcatggcgttcatcagggctcggcagggggctgtt ggggacaactag >gi568815597r:151192855_151416481|GENSCAN_predicted_peptide_2|850_aa MVKPLEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSVITEGVGELSVIDPEV AQKACQEVLEKVKLLHGGVAVSSRGTPLELVNGDGVDSEIRCLDDPPAQIREEEDEMGAA VASGTAKGARRRRQNNSAKQSWLLRLFESKLFDISMAISYLYNSKEPGVQAYIGNRLFCF RNEDVDFYLPQLLNMYIHMDEDVGDAIKPYIVHRCRQSINFSLQCALLLGAYSSDMHIST QRHSRGTKLRKLILSDELKPAHRKRELPSLSPAPDTGLSPSKRTHQRSKSDATASISLSS NLKRTASNPKVENEDEELSSSTESIDNSFSSPVRLAPEREFIKSLMAIGKRLATLPTKEQ KTQRLISELSLLNHKLPARVWLPTAGFDHHVVRVPHTQAVVLNSKDKAPYLIYVEVLECE NFDTTSVPARIPENRIRSTRSVENLPECGITHEQRAGSFSTVPNYDNDDEAWSVDDIGEL QVELPEVHTNSCDNISQFSVDSITSQESKEPVFIAAGDIRRRLSEQLAHTPTAFKRDPED PSAVALKEPWQEKVRRIREGSPYGHLPNWRLLSVIVKCGDDLRQELLAFQVLKQLQSIWE QERVPLWIKPYKILVISADSGMIEPVVNAVSIHQVKKQSQLSLLDYFLQEHGSYTTEAFL SAQRNFVQSCAGYCLVCYLLQVKDRHNGNILLDAEGHIIHIDFGFILSSSPRNLGFETSA FKLTTEFVDVMGGLDGDMFNYYKMLMLQGLIAARKHMDKVVQIVEIMQQGCRRCSGSSPS GPMMTVAQVICSQLPCFHGSSTIRNLKERFHMSMTEEQLQLLVEQMVDGSMRSITTKLYD GFQYLTNGIM >gi568815597r:151192855_151416481|GENSCAN_predicted_CDS_2|2553_bp atggtgaaacccttggaagctcgaagtctggctgtggccatgggagatacagtagtggag cctgcccccttgaagccaacttctgagcccacttctggcccaccagggaataatgggggg tccctgctaagtgtcatcacggagggggtcggggaactatcagtgattgaccctgaggtg gcccagaaggcctgccaggaggtgttggagaaagtcaagcttttgcatggaggcgtggca gtctctagcagaggcaccccactggagttggtcaatggggatggtgtggacagtgagatc cgttgcctagatgatccacctgcccagatcagggaggaggaagatgagatgggggccgct gtggcctcaggcacagccaaaggagcaagaagacggcggcagaacaactcagctaaacag tcttggctgctgaggctgtttgagtcaaaactgtttgacatctccatggccatttcatac ctgtataactccaaggagcctggagtacaagcctacattggcaaccggctcttctgcttt cgcaacgaggacgtggacttctatctgccccagttgcttaacatgtacatccacatggat gaggacgtgggtgatgccattaagccctacatagtccaccgttgccgccagagcattaac ttttccctccagtgtgccctgttgcttggggcctattcttcagacatgcacatttccact caacgacactcccgtgggaccaagctacggaagctgatcctctcagatgagctaaagcca gctcacaggaagagggagctgccctccttgagcccggcccctgacacagggctgtctccc tccaaaaggactcaccagcgctctaagtcagatgccactgccagcataagtctcagcagc aacctgaaacgaacagccagcaaccctaaagtggagaatgaggatgaggagctctcctcc agcaccgagagtattgataattcattcagttcccctgttcgactggctcctgagagagaa ttcatcaagtccctgatggcgatcggcaagcggctggccacgctccccaccaaagagcag aaaacacagaggctgatctcagagctctccctgctcaaccataagctccctgcccgagtc tggctgcccactgctggctttgaccaccacgtggtccgtgtaccccacacacaggctgtt gtcctcaactccaaggacaaggctccctacctgatttatgtggaagtccttgaatgtgaa aactttgacaccaccagtgtccctgcccggatccccgagaaccgaattcggagtacgagg tccgtagaaaacttgcccgaatgtggtattacccatgagcagcgagctggcagcttcagc actgtgcccaactatgacaacgatgatgaggcctggtcggtggatgacataggcgagctg caagtggagctccccgaagtgcataccaacagctgtgacaacatctcccagttctctgtg gacagcatcaccagccaggagagcaaggagcctgtgttcattgcagcaggggacatccgc cggcgcctttcggaacagctggctcataccccgacagccttcaaacgagacccagaagat ccttctgcagttgctctcaaagagccctggcaggagaaagtacggcggatcagagagggc tccccctacggccatctccccaattggcggctcctgtcagtcattgtcaagtgtggggat gaccttcggcaagagcttctggcctttcaggtgttgaagcaactgcagtccatttgggaa caggagcgagtgcccctttggatcaagccatacaagattcttgtgatttcggctgatagt ggcatgattgaaccagtggtcaatgctgtgtccatccatcaggtgaagaaacagtcacag ctctccttgctcgattacttcctacaggagcacggcagttacaccactgaggcattcctc agtgcacagcgcaattttgtgcaaagttgtgctgggtactgcttggtctgctacctgctg caagtcaaggacagacacaatgggaatatccttttggacgcagaaggccacatcatccac atcgactttggcttcatcctctccagctcaccccgaaatctgggctttgagacgtcagcc tttaagctgaccacagagtttgtggatgtgatgggcggcctggatggcgacatgttcaac tactataagatgctgatgctgcaagggctgattgccgctcggaaacacatggacaaggtg gtgcagatcgtggagatcatgcagcaaggttgtcgccgttgctcaggatcatccccatct ggccccatgatgacggtggcccaggtcatctgttctcagcttccttgcttccatggctcc agcaccattcgaaacctcaaagagaggttccacatgagcatgactgaggagcagctgcag ctgctggtggagcagatggtggatggcagtatgcggtctatcaccaccaaactctatgac ggcttccagtacctcaccaacggcatcatgtga >gi568815597r:151192855_151416481|GENSCAN_predicted_peptide_3|735_aa MPGRLHLLTGKFPHAGMAEDEPDAKSPKTGGRAPPGGAEAGEPTTLLQRLRGTISKAVQN KVEGILQDVQKFSDNDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEEHT DTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGGRGQSKYCYSGI RRKTLVSMPPLPGLDLKGSESVSTKSPSNSTLLSQPEMGPEVTPAPRDELVEAACALTCD WAERILKRSFSSIVEVARFLLQQHLISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGL ENPEGGAHKKPERLAQPPKDLEARTGAGPLARGERKKSVVESSAPGANNLQVNALVARLP LLLPRAPRSLIPPIPVSPPILAPRLSSGALKVATLPLSSRAGAPPAAVPIINMILPTVPA LPGPGPGPGRAPPGGLTQPRGTENREVGIGGDQGPHDKGVKRTAEVPVSEASGQAPPAKA AKQDIEDTASDAKRKRGRPRKKSGGSGERNSTPLKSAAAMESAQSSRLPWETWGSGGEGN SAGGAERPGPMGEAEKGAVLAQGQGDGTVSKGGRGPGSQHTKEAEDKIPLVPSKVSVIKG SRSQKEAFPLAKGEAAPRAAPQPGPGAASAAAVREAQAANGTRERSLLLRRQCRQSNRDC PHPLRGGPPEDQLNLELRRKMCCEQGEPQLPQAVQDRRVFLKAGASFEEVPIRLHLVDYS GAATKRDEQACVEIE >gi568815597r:151192855_151416481|GENSCAN_predicted_CDS_3|2208_bp atgcccggccgactccacctcttgacagggaagttccctcatgccgggatggcagaagat gagcctgatgctaagagccccaagactgggggaagggcccccccaggtggtgctgaggct ggggaacctaccacccttcttcagaggctccgaggtaccatttccaaggccgtgcagaac aaagtagaggggatcctgcaagatgtacagaaattttctgacaatgacaagctgtatctc taccttcagctcccctcaggacccaccactggagacaaaagctcagagccaagtacactg agcaatgaggagtacatgtatgcctataggtggatccgcaaccacctggaagagcacact gacacctgtctgccaaagcaaagtgtttatgatgcctatcggaagtactgtgagagtctt gcctgttgccgcccactcagcacagccaactttggcaagatcatcagagagatcttccct gacatcaaagctcgaaggcttggtggccggggccagtccaaatattgctacagtggcata aggaggaagaccttggtgtctatgccacccctgcctggacttgacctaaagggttctgag agtgtaagtaccaaatcaccttccaattccactcttctctcccagccagaaatgggccca gaagtaaccccagcacctcgagatgaactggtggaggcagcgtgtgccctgacctgtgac tgggcagagcggatcctgaaacggtccttcagttccatcgttgaggtcgcccgcttcctg ctacagcagcatctcatctctgcccgatctgcacatgcccatgtgcttaaggccatgggg ctcgctgaagaggacgaacatgcacctcgggaacggtcatctaaaccaaagaatggttta gagaacccagagggtggagcccacaagaagccagagagactggcccagcctcctaaggat ctggaagcccgaactggggccggtcctctcgcacgtggagagcggaagaagagtgtagtt gagagctcggccccaggagccaataacctgcaggttaatgccctagtggctcggctgcct ctgctccttccccgggcccctcgctcactaattccgccaatcccagtctctccacctatt ctggcccccaggctttcttcaggtgccctgaaagtggctacactgcctctgtctagtagg gccggggcacccccagcagctgtgcccatcattaacatgatcttaccaactgttcctgct ttgcctggacctggacctgggcctgggcgagctccacctgggggactcactcagccccgg ggcacagagaacagagaggtaggcataggtggtgaccaaggaccacatgacaagggtgtc aagaggacagctgaagtacctgtgagtgaggccagtgggcaggctccaccagctaaagca gcaaagcaggatatagaggatacagcaagtgatgccaaaaggaaacgggggcgccctcga aaaaagtcaggtggaagtggggaaaggaattctacccctctcaagtcagcagctgccatg gaatctgcccagtcctcaaggttaccatgggagacatggggctcaggaggggaaggcaac tcagctggaggggcagagaggccagggccaatgggagaggctgaaaagggggcagtactt gcccagggtcagggagatggtactgtttccaaaggaggaaggggccccggttcccagcat accaaagaagcagaagataaaattcccttggtcccctcaaaagtgagtgtcatcaagggc agcagaagccaaaaggaggcttttcctttggcaaagggagaggcggcgccgcgggcagcc ccgcagccggggcctggtgcagcctccgcggccgctgtcagggaagcgcaggcggccaat ggaacccgggagcggtcgctgctgctgaggcggcagtgtcggcagtccaaccgcgactgc ccgcaccccctccgcgggggtcccccagaggatcaactaaaccttgaactaagaagaaaa atgtgttgtgagcagggggagcctcagctgcctcaggccgttcaggacagaagggtgttt ctgaaggccggagcaagttttgaagaagtccctatcagattacacttggttgactactcc ggagcagccactaagagggatgaacaggcctgcgtggaaattgaatga >gi568815597r:151192855_151416481|GENSCAN_predicted_peptide_4|502_aa MRLEWGPRPAALPWPAGMCAAERAEGAFTLQSVAQPMRPIASTATKCGNCGPGYSTPLEA MKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWN TCSSCFGDSTKSRTKLVLPSLISSRIYVVDVIEPKDIHAKCELAFLHTSHCLASGEVMIS SLGDVKGNGKGGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVL RDGFNPADVEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCAL SSTIQRFYKNEGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHG DLRQYDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMI QLSLDGKRLYITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEP LGPALAHELRYPGGDCSSDIWI >gi568815597r:151192855_151416481|GENSCAN_predicted_CDS_4|1509_bp atgaggctggagtggggacctaggccagccgcactgccgtggcccgctgggatgtgtgct gcagaacgtgcggagggagccttcaccctccagagcgtggcccagccaatgcgccccatt gcttccacagctacgaaatgtgggaattgtggacccggctactccacccctctggaggcc atgaaaggacccagggaagagatcgtctacctgccctgcatttaccgaaacacaggcact gaggccccagattatctggccactgtggatgttgaccccaagtctccccagtattgccag gtcatccaccggctgcccatgcccaacctgaaggacgagctgcatcactcaggatggaac acctgcagcagctgcttcggtgatagcaccaagtcgcgcaccaagctggtgctgcccagt ctcatctcctctcgcatctatgtggtggacgtcattgagcccaaggacatccatgccaag tgcgaactggcctttctccacaccagccactgcctggccagcggggaagtgatgatcagc tccctgggagacgtcaagggcaatggcaaagggggttttgtgctgctggatggggagacg ttcgaggtgaaggggacatgggagagacctgggggtgctgcaccgttgggctatgacttc tggtaccagcctcgacacaatgtcatgatcagcactgagtgggcagctcccaatgtctta cgagatggcttcaaccccgctgatgtggaggctggactgtacgggagccacttatatgta tgggactggcagcgccatgagattgtgcagaccctgtctctaaaagatgggcttattccc ttggagatccgcttcctgcacaacccagacgctgcccaaggctttgtgggctgcgcactc agctccaccatccagcgcttctacaagaacgagggaggtacatggtcagtggagaaggtg atccaggtgccccccaagaaagtgaagggctggctgctgcccgaaatgccaggcctgatc accgacatcctgctctccctggacgaccgcttcctctacttcagcaactggctgcatggg gacctgaggcagtatgacatctctgacccacagagaccccgcctcacaggacagctcttc ctcggaggcagcattgttaagggaggccctgtgcaagtgctggaggacgaggaactaaag tcccagccagagcccctagtggtcaagggaaaacgggtggctggaggccctcagatgatc cagctcagcctggatgggaagcgcctctacatcaccacgtcgctgtacagtgcctgggac aagcagttttaccctgatctcatcagggaaggctctgtgatgctgcaggttgatgtagac acagtaaaaggagggctgaagttgaaccccaacttcctggtggacttcgggaaggagccc cttggcccagcccttgcccatgagctccgctaccctgggggcgattgtagctctgacatc tggatttga >gi568815597r:151192855_151416481|GENSCAN_predicted_peptide_5|264_aa MEAFLGSRSGLWAGGPAPGQFYRIPSTPDSFMDPASALYRGPITRTQNPMVTGTSVLGVK FEGGVVIAADMLGSYGSLARFRNISRIMRVNNSTMLGASGDYADFQYLKQVLGQMVIDEE LLGDGHSYSPRAIHSWLTRAMYSRRSKMNPLWNTMVIGGYADGESFLGYVDMLGVAYEAP SLATGYGAYLAQPLLREVLEKQPVLSQTEARDLVERCMRVLYYRDARSYNRFQIATVTEK GVEIEGPLSTETNWDIAHMISGFE >gi568815597r:151192855_151416481|GENSCAN_predicted_CDS_5|795_bp atggaagcgtttttggggtcgcggtccggactttgggcggggggtccggccccaggacag ttttaccgcattccgtccactcccgattccttcatggatccggcgtctgcactttacaga ggtccaatcacgcggacccagaaccccatggtgaccgggacctcagtcctcggcgttaag ttcgagggcggagtggtgattgccgcagacatgctgggatcctacggctccttggctcgt ttccgcaacatctctcgcattatgcgagtcaacaacagtaccatgctgggtgcctctggc gactacgctgatttccagtatttgaagcaagttctcggccagatggtgattgatgaggag cttctgggagatggacacagctatagtcctagagctattcattcatggctgaccagggcc atgtacagccggcgctcgaagatgaaccctttgtggaacaccatggtcatcggaggctat gctgatggagagagcttcctcggttatgtggacatgcttggtgtagcctatgaagcccct tcgctggccactggttatggtgcatacttggctcagcctctgctgcgagaagttctggag aagcagccagtgctaagccagaccgaggcccgcgacttagtagaacgctgcatgcgagtg ctgtactaccgagatgcccgttcttacaaccggtttcaaatcgccactgtcaccgaaaaa ggtgttgaaatagagggaccattgtctacagagaccaactgggatattgcccacatgatc agtggctttgaatga >gi568815597r:151192855_151416481|GENSCAN_predicted_peptide_6|851_aa XKCKICEWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVDVHFRMIHEDTRHL LCPYCLKVFKNGNAFQQHYMRHQKRNVYHCNKCRLQFLFAKDKIEHKLQHHKTFRKPKQL EGLKPGTKVTIRASRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVFLYPPVQRSIQK RAVRKMSVMGRQTCLECSFEIPDFPNHFPTYVHCSLCRYSTCCSRAYANHMINNHVPRKS PKYLALFKNSVSGIKLACTSCTFVTSVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSRHG QTRDRVHDRNVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPALPSPASTATPPP TPTHPQALALPPLATEGAECLNVDDQDEGSPVTQEPELASGGGGSGGVGKKEQLSVKKLR VVLFALCCNTEQAAEHFRNPQRRIRRWLRRFQASQGENLEGKYLSFEAEEKLAEWVLTQR EQQLPVNEETLFQKATKIGRSLEGGFKISYEWAVRFMLRHHLTPHARRAVAHTLPKDVAE NAGLFIDFVQRQIHNQDLPLSMIVAIDEISLFLDTEVLSSDDRKENALQTVGTGEPWCDV VLAILADGTVLPTLVFYRGQMDQPANMPDSILLEAKESGYSDDEIMELWSTRVWQKHTAC QRSKGMLVMDCHRTHLSEEVLAMLSASSTLPAVVPAGCSSKIQPLDVCIKRTVKNFLHKK WKEQAREMADTACDSDVLLQLVLVWLGEVLGVIGDCPELVQRSFLVASVLPGPDGNINSP TRNADMQEELIASLEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLFEGESETESFYG FEEADLDLMEI >gi568815597r:151192855_151416481|GENSCAN_predicted_CDS_6|2556_bp nccaagtgcaagatctgtgaatgggcgtttgaaagtgagccactatttctccagcatatg aaggatactcataagcctggagagatgccttatgtttgccaggtgtgtcaatatcgctcc tcactctactctgaggtagatgtccattttcggatgatccatgaggatacccggcatctg ctctgcccttattgcctgaaggtcttcaaaaatggcaatgcattccaacagcattacatg aggcaccagaagagaaatgtttatcactgcaacaaatgccggctgcagtttctctttgcc aaggacaaaattgaacacaagcttcaacaccataaaaccttccgtaaacccaagcagctg gagggcttgaaaccaggcaccaaggtgacaatccgggcttcccgagggcagccacgaact gttcctgtatcctctaatgatacacctcccagcgccttgcaggaggcagcaccgctgacc tcctcaatggaccctctgcctgtcttcctttatccccctgtccagcgcagcatccagaag agagctgttaggaaaatgagtgtcatgggccggcagacatgcctggagtgcagcttcgag atcccagacttccctaatcatttccctacttacgtacactgctctctgtgtcgctatagc acctgctgttctcgagcttatgccaaccacatgatcaacaatcatgttccacggaagagc cccaagtatttggctttgtttaaaaattctgtgagtggaatcaagctggcctgcacttca tgtacctttgttacctctgtgggcgatgctatggccaagcatttggtattcaacccctct cacagatccagcagcatcctgccacggggactcacttggatagctcactcaaggcatggc cagactcgtgaccgagtgcatgaccggaacgtgaagaatatgtaccctcctccttccttc cccactaacaaagctgccactgtgaaatctgcgggggccaccccagctgagcctgaagag ctactaactcccttagccccagcactcccatcaccagcctcaactgcaaccccaccacca acccccactcacccgcaggctttagcccttccaccgctggctacagagggagccgaatgt ctgaatgttgatgatcaggatgaagggagcccagtcacccaagaacctgagctagcatca ggtggtggtggtagtggtggagttggcaaaaaggagcagctgtctgtgaagaagcttcga gtagtactgtttgctctatgctgcaatacagaacaggcagctgaacacttccgaaatccc cagcgacgtattcgccgttggcttcgacgtttccaggcctcccagggggagaatctagag ggcaaatatctgagctttgaggcagaagagaaactggctgagtgggtgctaacccagcgc gaacaacagctacctgtaaatgaggagaccttgttccagaaggccaccaaaataggacgt tctttggaaggggggtttaagatctcctatgagtgggctgtgcgtttcatgctgcggcac cacctgactccccatgcccggcgagctgtggcccacaccctacctaaggatgtagcagag aatgcaggactcttcattgattttgtacaacggcagattcacaaccaggacttacccttg tctatgattgtggctattgatgagatctctttgttcctggatacagaggtgctgagcagt gatgatcgaaaggagaatgccctgcagacagtgggcacaggggaaccttggtgtgatgta gtcctagccattctggcagatggcactgtccttcccaccctggttttctacagagggcag atggatcagcctgctaacatgccagactccatattgctagaggcaaaggagagtggctac agtgatgacgagatcatggagctgtggtcaactcgagtgtggcagaagcacacagcttgc cagcgcagcaaaggcatgcttgtgatggactgtcatcgcactcacttgtcagaagaggta ctggctatgcttagtgcctctagcactttgcctgcagtggtcccagcaggctgtagctcc aaaattcagccattagatgtatgcatcaaaagaactgtcaagaacttcctgcataaaaaa tggaaggaacaggctcgggaaatggcagatactgcatgtgattctgatgtcctgcttcag ctggtgcttgtctggctgggtgaagtgctaggtgtcattggggactgtccagagctagtt cagcgctccttcctggtggctagtgttctgcctggccccgatggcaacattaactcacct acaagaaatgctgacatgcaggaggagctaattgcctccctagaggagcaactgaagctg agtggggaacattctgagtcttccactccacgacccagatcatctcctgaagagacaatt gagcctgaaagtcttcaccagctctttgagggtgaaagtgagaccgagtctttctatggc tttgaagaagctgacctagatctgatggagatttga