GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:31:43 Sequence gi568815579r:40322839_40523546 : 200708 bp : 51.61% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 1238 1168 71 0 2 119 87 47 0.899 7.09 1.07 Intr - 3648 3496 153 1 0 91 79 243 0.867 24.26 1.06 Intr - 5712 5575 138 1 0 114 81 133 0.973 16.14 1.05 Intr - 11091 11013 79 0 1 109 89 51 0.967 6.92 1.04 Intr - 13386 13272 115 2 1 106 94 171 0.996 20.45 1.03 Intr - 13569 13482 88 1 1 59 66 153 0.999 9.73 1.02 Intr - 19052 19001 52 2 1 147 92 6 0.992 5.87 1.01 Init - 19229 19167 63 2 0 86 89 79 0.857 6.99 1.00 Prom - 24168 24129 40 -2.31 2.00 Prom + 26322 26361 40 -2.41 2.01 Init + 28878 28939 62 2 2 82 90 33 0.062 3.85 2.02 Term + 39332 39350 19 2 1 105 41 31 0.017 -1.83 2.03 PlyA + 40393 40398 6 1.05 3.00 Prom + 42153 42192 40 -2.01 3.01 Init + 43646 43672 27 1 0 86 92 33 0.982 3.12 3.02 Intr + 43772 43846 75 1 0 99 97 114 0.999 13.61 3.03 Intr + 43935 44077 143 2 2 144 91 157 0.997 21.26 3.04 Intr + 44858 45041 184 2 1 48 82 274 0.996 23.11 3.05 Intr + 47070 47190 121 2 1 116 89 208 0.987 24.27 3.06 Intr + 47272 47399 128 2 2 123 62 206 0.999 22.40 3.07 Intr + 48835 49035 201 0 0 82 101 310 0.994 31.70 3.08 Intr + 51643 51782 140 0 2 86 84 174 0.999 16.57 3.09 Intr + 53771 53936 166 2 1 88 103 290 0.994 30.98 3.10 Intr + 54948 55047 100 2 1 104 109 175 0.983 21.38 3.11 Term + 55148 55335 188 0 2 106 54 277 0.999 24.07 3.12 PlyA + 55630 55635 6 -3.24 4.05 PlyA - 56451 56446 6 -4.04 4.04 Term - 56931 56749 183 0 0 133 45 91 0.974 7.16 4.03 Intr - 58330 57485 846 1 0 92 83 1268 0.696 118.94 4.02 Intr - 61301 60945 357 2 0 124 116 699 0.936 72.31 4.01 Init - 67064 66600 465 2 0 97 113 1027 0.986 101.78 4.00 Prom - 68914 68875 40 -3.71 5.10 PlyA - 70949 70944 6 1.05 5.09 Term - 75132 71128 4005 0 0 129 48 3077 0.844 294.43 5.08 Intr - 75978 75782 197 1 2 90 80 406 0.986 39.65 5.07 Intr - 81024 80868 157 0 1 75 86 213 0.939 19.90 5.06 Intr - 85193 85068 126 2 0 119 105 106 0.968 16.78 5.05 Intr - 85418 85320 99 2 0 103 81 -10 0.457 0.61 5.04 Intr - 85545 85502 44 1 2 60 99 34 0.758 0.15 5.03 Intr - 88701 88566 136 0 1 58 82 80 0.067 5.05 5.02 Intr - 100708 100043 666 1 0 115 56 720 0.011 64.06 5.01 Init - 111297 111217 81 0 0 66 32 107 0.596 1.82 5.00 Prom - 117775 117736 40 -4.51 6.04 PlyA - 118027 118022 6 1.05 6.03 Term - 119248 118652 597 1 0 132 41 404 0.121 35.04 6.02 Intr - 120449 120358 92 0 2 -8 56 81 0.067 -4.49 6.01 Init - 121596 121377 220 0 1 54 77 144 0.109 6.68 6.00 Prom - 124400 124361 40 -6.70 7.06 PlyA - 124991 124986 6 1.05 7.05 Term - 125208 125051 158 1 2 86 42 215 0.997 15.11 7.04 Intr - 128654 128526 129 0 0 102 87 259 0.999 28.37 7.03 Intr - 135406 135317 90 2 0 83 63 183 0.932 15.76 7.02 Intr - 135707 135543 165 0 0 132 -52 240 0.677 14.85 7.01 Init - 139174 139120 55 1 1 72 76 44 0.266 3.04 7.00 Prom - 140470 140431 40 -3.41 8.00 Prom + 142495 142534 40 -4.71 8.01 Init + 142747 142797 51 0 0 71 77 101 0.206 6.41 8.02 Intr + 149769 149952 184 2 1 85 82 202 0.505 19.18 8.03 Intr + 164859 165010 152 1 2 38 81 310 0.743 25.69 8.04 Intr + 167237 167410 174 1 0 80 60 393 0.999 36.35 8.05 Intr + 170125 170216 92 0 2 88 94 61 0.991 5.99 8.06 Intr + 171135 171194 60 0 0 118 23 84 0.901 3.24 8.07 Intr + 172059 172139 81 0 0 100 81 110 0.998 10.75 8.08 Intr + 174651 174766 116 0 2 99 105 258 0.999 29.19 8.09 Intr + 179083 179195 113 2 2 109 70 194 0.989 20.30 8.10 Intr + 179290 179477 188 0 2 108 61 350 0.998 33.31 8.11 Intr + 179552 179669 118 2 1 87 100 122 0.999 14.27 8.12 Intr + 179937 180095 159 2 0 111 87 165 0.828 19.40 8.13 Intr + 180992 181294 303 1 0 49 55 517 0.996 41.83 8.14 Intr + 183398 183548 151 1 1 71 96 245 0.993 23.85 8.15 Intr + 189768 190716 949 1 1 69 105 1396 0.992 130.28 8.16 Intr + 192473 192610 138 2 0 76 91 180 0.998 17.19 8.17 Intr + 196563 197313 751 0 1 117 49 1285 0.339 119.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 119242 118652 591 1 0 100 41 398 0.860 32.87 S.002 Init + 120346 120423 78 0 0 74 86 20 0.860 1.42 S.003 Intr + 120840 120940 101 2 2 89 43 96 0.822 4.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_1|253_aa MVVAALSQADFLPAAPGYSWAAPGSSVRARETMVSVTMATSEWIQFFKEAGIPPGPAVNY AVMFVDNRIQKSMLLDLNKEIMNELGVTVVGDIIAILKHAKVVHRQDMCKAATESVPCSP SPLAGEIRRGTSAASRMITNSLNHDSPPSTPPRRPDTSTSKISVTVSNKMAAKSAKATAA LARREEESLAVPAKRRRVTAEMEGKYVINMPKGTTPRTRKILEQQQAAKGLHRTSVFDRL GAETKADTTTGSK >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_1|759_bp atggtggtggcagctctgtcccaggctgacttcctgccagctgctccaggctattcctgg gcggctccagggagcagtgtcagggccagggagacgatggtctccgtgactatggccact tccgagtggatccagttctttaaggaagccggcattcctccaggacctgccgtcaattat gccgtgatgtttgtggataataggattcagaagagcatgctgctggatctcaataaggag ataatgaatgagctgggcgtgaccgtggtgggtgacatcatcgccattctcaagcatgcc aaagtggtgcaccgtcaggacatgtgcaaagctgccactgagtcagtaccctgcagccct agcccccttgcaggcgaaattcgccgtggcaccagtgctgcctcccgaatgatcaccaac agcctgaaccatgactctccacccagcacaccccccaggcgcccggacaccagcacctcc aagatctcggtcactgtgtccaacaagatggcagcaaagagtgccaaggccactgcagcc ctggcccgccgggaggaggagagcctggctgttcctgccaagcggcgccgggtcactgct gagatggaggggaagtacgtcatcaacatgcccaaaggcaccacaccccgcacccgcaag atcctggagcagcagcaggctgcaaaaggtctccataggacgtctgtgtttgaccgcctc ggcgccgagaccaaggcagacaccacgacagggagtaaa >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_2|26_aa MGHIVEGLVSHAEGWLLLPVSCIPSA >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_2|81_bp atggggcacatcgtggaaggccttgtgagccatgcagaaggctggcttttactcccagtg agctgtatccccagtgcctaa >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_3|490_aa MKPKLMYQELKVPAEEPANELPMNEIEAWKAAEKKARWVLLVLILAVVGFGALMTQLFLW EYGDLHLFGPNQRPAPCYDPCEAVLVESIPEGLDFPNASTGNPSTSQAWLGLLAGAHSSL DIASFYWTLTNNDTHTQEPSAQQGEEVLRQLQTLAPKGVNVRIAVSKPSGPQPQADLQAL LQSGAQVRMVDMQKLTHGVLHTKFWVVDQTHFYLGSANMDWRSLTQVKELGVVMYNCSCL ARDLTKIFEAYWFLGQAGSSIPSTWPRFYDTRYNQETPMEICLNGTPALAYLASAPPPLC PSGRTPDLKALLNVVDNARSFIYVAVMNYLPTLEFSHPHRFWPAIDDGLRRATYERGVKV RLLISCWGHSEPSMRAFLLSLAALRDNHTHSDIQVKLFVVPADEAQARIPYARVNHNKYM VTERATYIGTSNWSGNYFTETAGTSLLVTQNGRGGLRSQLEAIFLRDWDSPYSHDLDTSA DSVGNACRLL >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_3|1473_bp atgaagcctaaactgatgtaccaggagctgaaggtgcctgcagaggagcccgccaatgag ctgcccatgaatgagattgaggcgtggaaggctgcggaaaagaaagcccgctgggtcctg ctggtcctcattctggcggttgtgggcttcggagccctgatgactcagctgtttctatgg gaatacggcgacttgcatctctttgggcccaaccagcgcccagccccctgctatgaccct tgcgaagcagtgctggtggaaagcattcctgagggcctggacttccccaatgcctccacg gggaacccttccaccagccaggcctggctgggcctgctcgccggtgcgcacagcagcctg gacatcgcctccttctactggaccctcaccaacaatgacacccacacgcaggagccctct gcccagcagggtgaggaggtcctccggcagctgcagaccctggcaccaaagggcgtgaac gtccgcatcgctgtgagcaagcccagcgggccccagccacaggcggacctgcaggctctg ctgcagagcggtgcccaggtccgcatggtggacatgcagaagctgacccatggcgtcctg cataccaagttctgggtggtggaccagacccacttctacctgggcagtgccaacatggac tggcgttcactgacccaggtcaaggagctgggcgtggtcatgtacaactgcagctgcctg gctcgagacctgaccaagatctttgaggcctactggttcctgggccaggcaggcagctcc atcccatcaacttggccccggttctatgacacccgctacaaccaagagacaccaatggag atctgcctcaatggaacccctgctctggcctacctggcgagtgcgcccccacccctgtgt ccaagtggccgcactccagacctgaaggctctactcaacgtggtggacaatgcccggagt ttcatctacgtcgctgtcatgaactacctgcccactctggagttctcccaccctcacagg ttctggcctgccattgacgatgggctgcggcgggccacctacgagcgtggcgtcaaggtg cgcctgctcatcagctgctggggacactcggagccatccatgcgggccttcctgctctct ctggctgccctgcgtgacaaccatacccactctgacatccaggtgaaactctttgtggtc cccgcggatgaggcccaggctcgaatcccatatgcccgtgtcaaccacaacaagtacatg gtgactgaacgcgccacctacatcggaacctccaactggtctggcaactacttcacggag acggcgggcacctcgctgctggtgacgcagaatgggaggggcggcctgcggagccagctg gaggccattttcctgagggactgggactccccttacagccatgaccttgacacctcagct gacagcgtgggcaacgcctgccgcctgctctga >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_4|616_aa MSTIQSETDCYDIIEVLGKGTFGEVAKGWRRSTGEMVAIKILKNDAYRNRIIKNELKLLH CMRGLDPEEAHVIRFLEFFHDALKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQV LTALARLKELAIIHADLKPENIMLVDQTRCPFRVKVIDFGSASIFSEVRYVKEPYIQSRF YRAPEILLGLPFCEKVDVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLH AACKAHHFFKRNPHPDAANPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETVNGGSVA SRLTFPDREALAEHADLKSMVELIKRMLTWESHERISPSAALRHPFVSMQQLRSAHETTH YYQLSLRSYRLSLQVEGKPPTPVVAAEDGTPYYCLAEEKEAAGMGSVAGSSPFFREEKAP GMQRAIDQLDDLSLQEAGHGLWGETCTNAVSDMMVPLKAAITGHHVPDSGPEPILAFYSS RLAGRHKARKPPAGSKSDSNFSNLIRLSQVSPEDDRPCRGSSWEEGEHLGASAEPLAILQ RDEDGPNIDNMTMEAERPDPELFDPSSCPGEWLSEPDCTLESVRGPRAQGLPPRRSHQHG PPRGATSFLQHVTGHH >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_4|1851_bp atgtccaccatccagtcggagactgactgctacgacatcatcgaggtcttgggcaagggg accttcggggaggtagccaagggctggcggcggagcacgggcgagatggtggccatcaag atcctcaagaatgacgcctaccgcaaccgcatcatcaagaacgagctgaagctgctgcac tgcatgcgaggcctagaccctgaagaggcccacgtcatccgcttccttgagttcttccat gacgccctcaagttctacctggtctttgagctgctggagcaaaaccttttcgagttccag aaggagaacaacttcgcgcccctccccgcccgccacatccgtacagtcaccctgcaggtg ctcacagccctggcccggctcaaggagctggctatcatccacgctgatctcaagcctgag aacatcatgctggtggaccagacccgctgccccttcagggtcaaggtgattgacttcgga tccgccagcattttcagcgaggtgcgctacgtgaaggagccatacatccagtcgcgcttc taccgggcccctgagatcctgctggggctgcccttctgcgagaaggtggacgtgtggtcc ctgggctgcgtcatggctgagctgcacctgggctggcctctctaccccggcaacaacgag tacgaccaggtgcgctacatctgcgaaacccagggcctgcccaagccacacctgttgcac gccgcctgcaaggcccaccacttcttcaagcgcaacccccaccctgacgctgccaacccc tggcagctcaagtcctcggctgactacctggccgagacgaaggtgcgcccattggagcgc cgcaagtatatgctcaagtcgttggaccagattgagacagtgaatggtggcagtgtggcc agtcggctaaccttccctgaccgggaggcgctggcggagcacgccgacctcaagagcatg gtggagctgatcaagcgcatgctgacctgggagtcacacgaacgcatcagccccagtgct gccctgcgccaccccttcgtgtccatgcagcagctgcgcagtgcccacgagaccacccac tactaccagctctcgctgcgcagctaccgcctctcgctgcaagtggaggggaagcccccc acgcccgtcgtggccgcagaagatgggaccccctactactgtctggctgaggagaaggag gctgcgggtatgggcagtgtggccggcagcagccccttcttccgagaggagaaggcacca ggtatgcaaagagccatcgaccagctggatgacctgagtctgcaggaggctgggcatggg ctgtggggtgagacctgcaccaatgcggtctccgacatgatggtccccctcaaggcagcc atcactggccaccatgtgcccgactcgggccctgagcccatcctggccttctacagcagc cgcctggcaggccgccacaaggcccgcaagccacctgcgggttccaagtccgactccaac ttcagcaacctcattcggctgagccaggtctcgcctgaggatgacaggccctgccggggc agcagctgggaggaaggagagcatctcggggcctctgctgagccactggccatcctgcag cgagatgaggatgggcccaacattgacaacatgaccatggaagctgagaggccagaccct gagctcttcgaccccagcagctgtcctggagaatggctgagtgagccagactgcaccctg gagagcgtcaggggcccacgggctcaggggctcccaccccgccgctcccaccagcatggt ccaccccggggggccaccagcttcctccagcatgtcaccgggcaccactga >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_5|1836_aa MGPRPALGVAGRPSPSPLPTLRPGPAGMLSKGLKRKREEEEEKEPLAVDSWWLDPGHTAV AQAPPAVASSSLFDLSVLKLHHSLQQSEPDLRHLVLVVNTLRRIQASMAPAAALPPVPSP PAAPSVADNLLASSDAALSASMASLLEDLSHIEGLSQAPQPLADEGPPGRSIGGAAPSLG ALDLLGPATGCLLDDGLEGLFEDIDTSMYDNELWAPASEGLKPGPEDGPGKEEAPELDEA ELDYLMDVLAPGMTLTATGLGQRADDPGDPLYYHLQGSGFSCPQLRTLCCVSVLERRTQE AQGAGGDPQAARTPRKGVSPADSCAAPRAGLLLGGSACTQAAAQRQLLHAELKLVLQQKG ERTQEPGVQVPPSNAMEARSRSAEELRRAELVEIIVETEAQTGVSGINVAGGGKEGIFVR ELREDSPAARSLSLQEGDQLLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDL ALRPGTVSGYEIKGPRAKVAKLNIQSLSPVKKKKMVPGALGVPADLAPVDVEFSFPKFSR LRRGLKAEAVKGPVPAAPARRRLQLPRLRVREVAEEAQAARLAAAAPPPRKAKVEAEVAA GARFTAPQVELVGPRLPGAEVGVPQVSAPKAAPSAEAAGGFALHLPTLGLGAPAPPAVEA PAVGIQVPQVELPALPSLPTLPTLPCLETREGAVSVVVPTLDVAAPTVGVDLALPGAEVE ARGEAPEVALKMPRLSFPRFGARAKEVAEAKVAKVSPEARVKGPRLRMPTFGLSLLEPRP AAPEVVESKLKLPTIKMPSLGIGVSGPEVKVPKGPEVKLPKAPEVKLPKVPEAALPEVRL PEVELPKVSEMKLPKVPEMAVPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVQLLKV SEMKLPKVPEMAVPEVRLPEVQLPKVSEMKLPEVSEVAVPEVRLPEVQLPKVPEMKVPEM KLPKVPEMKLPEMKLPEVQLPKVPEMAVPDVHLPEVQLPKVPEMKLPEMKLPEVKLPKVP EMAVPDVHLPEVQLPKVPEMKLPKMPEMAVPEVRLPEVQLPKVSEMKLPKVPEMAVPDVH LPEVQLPKVCEMKVPDMKLPEIKLPKVPEMAVPDVHLPEVQLPKVSEIRLPEMQVPKVPD VHLPKAPEVKLPRAPEVQLKATKAEQAEGMEFGFKMPKMTMPKLGRAESPSRGKPGEAGA EVSGKLVTLPCLQPEVDGEAHVGVPSLTLPSVELDLPGALGLQGQVPAAKMGKGERVEGP EVAAGVREVGFRVPSVEIVTPQLPAVEIEEGRLEMIETKVKPSSKFSLPKFGLSGPKVAK AEAEGAGRATKLKVSKFAISLPKARVGAEAEAKGAGEAGLLPALDLSIPQLSLDAHLPSG KVEVAGADLKFKGPRFALPKFGVRGRDTEAAELVPGVAELEGKGWGWDGRVKMPKLKMPS FGLARGKEAEVQGDRASPGEKAESTAVQLKIPEVELVTLGAQEEGRAEGAVAVSGMQLSG LKVSTAGQVVTEGHDAGLRMPPLGISLPQVELTGFGEAGTPGQQAQSTVPSAEGTAGYRV QVPQVTLSLPGAQVAGGELLVGEGVFKMPTVTVPQLELDVGLSREAQAGEAATGEGGLRL KLPTLGARARVGGEGAEEQPPGAERTFCLSLPDVELSPSGGNHAEYQVAEGEGEAGHKLK VRLPRFGLVRAKEGAEEGEKAKSPKLRLPRVGFSQSEMVTGEGSPSPEEEEEEEEEGSGE GASGRRGRVRVRLPRVGLAAPSKASRGQEGDAAPKSPVREKSPKFRFPRVSLSPKARSGS GDQEEGGLRVRLPSVGFSETGAPGPARMEGAQAAAV >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_5|5511_bp atgggcccgcggcccgccctgggcgtggcgggaaggcccagtccctccccgctgcccacc ctccgccctgggccggccgggatgctgagcaagggtctgaagcggaaacgggaggaggag gaggagaaggaacctctggcagtcgactcctggtggctagatcctggccacacagcggtg gcacaggcacccccggccgtggcctctagctccctctttgacctctcagtgctcaagctc caccacagcctgcagcagagtgagccggacctgcggcacctggtgctggtcgtgaacact ctgcggcgcatccaggcgtccatggcacccgcggctgccctgccacctgtgcctagccca cctgcagcccccagtgtggctgacaacttactggcaagctcggacgctgccctttcagcc tccatggccagcctcctggaggacctcagccacattgagggcctgagtcaggctccccaa cccttggcagacgaggggccaccaggccgtagcatcgggggagcagcgcccagcctgggt gccttggacctgctgggcccagccactggctgtctactggacgatgggcttgagggcctg tttgaggatattgacacctctatgtatgacaatgaactttgggcaccagcctctgagggc ctcaaaccaggccctgaggatgggccgggcaaggaggaagctccggagctggacgaggcc gaattggactacctcatggatgtgctggctccgggcatgaccctcacagccacgggcctg ggacagagagctgatgacccaggagaccccctctactaccacctacaaggttcaggcttc tcgtgtccccagctcaggactctgtgctgtgtatcagtcctggagcgccggacccaggag gcccaaggagctggaggtgaccctcaggcagcaagaaccccacggaagggcgtgagccct gcagacagctgtgcggcacctcgggctgggctcctgttaggaggaagtgcctgcacccag gcagcggctcagaggcagctgctccatgcagaactgaagctggttctgcagcagaaaggg gagaggacacaggagcctggggtgcaggtgcctcccagcaacgccatggaggccaggagc cggagtgccgaggagctgaggcgggcggagttggtggaaattatcgtggagacggaggcg cagaccggggtcagcggcatcaacgtagcgggcggcggcaaagagggaatcttcgttcgg gagctgcgcgaggactcacccgccgccaggagcctcagcctgcaggaaggggaccagctg ctgagtgcccgagtgttcttcgagaacttcaagtacgaggacgcactacgcctgctgcaa tgcgccgagccttacaaagtctccttctgcctgaagcgcactgtgcccaccggggacctg gctctgcggcccgggaccgtgtctggctacgagatcaagggcccgcgggccaaggtggcc aagctgaacatccagagtctgtcccctgtgaagaagaagaagatggtgcctggggctctg ggggtccccgctgacctggcccctgttgacgtcgagttctcctttcccaagttctcccgc ctgcgtcggggcctcaaagccgaggctgtcaagggtcctgtcccggctgcccctgcccgc cggcgcctccagctgcctcggctgcgtgtacgagaagtggccgaagaggctcaggcagcc cggctggccgccgccgctcctccccccaggaaagccaaggtggaggctgaggtggctgca ggagctcgtttcacagcccctcaggtggagctggttgggccgcggctgccaggggcggag gtgggtgtcccccaggtctcagcccccaaggctgccccctcagcagaggcagctggtggc tttgccctccacctgccaacccttgggctcggagccccggctccgcctgctgtggaggcc ccagccgtgggaatccaggtcccccaggtggagctgcctgccttgccctcactgcccact ctgcccacacttccctgcctagagacccgggaaggggctgtgtcggtagtggtgcccacc ctggatgtggcagcaccgactgtgggggtggacctggccttgccgggtgcagaggtggag gcccggggagaggcacctgaggtggccctgaagatgccccgccttagttttccccgattt ggggctcgagcaaaggaagttgctgaggccaaggtagccaaggtcagccctgaggccagg gtgaaaggtcccagacttcgaatgcccacctttgggctttccctcttggagccccggccc gctgctcctgaagttgtagagagcaagctgaagctgcccaccatcaagatgccctccctt ggcatcggagtgtcagggcccgaggtcaaggtgcccaagggacctgaagtgaagctcccc aaggctcctgaggtcaagcttccaaaagtgcccgaggcagcccttccagaggttcgactc ccagaggtggagctccccaaggtgtcagagatgaaactcccaaaggtgccagagatggct gtgccggaggtgcggcttccagaggtagagctgcccaaagtgtcagagatgaaactccca aaggtgccagagatggctgtgccggaggtgcggcttccagaggtacagctgctgaaagtg tcggagatgaaactcccaaaggtgccagagatggctgtgccggaggtgcggcttccagag gtacagctgccgaaagtgtcagagatgaaactcccagaggtgtcagaggtggctgtgcca gaggtgcggcttccagaggtgcagctgccgaaagtgccagagatgaaagtccctgagatg aagcttccaaaggtgcctgagatgaaacttcctgagatgaaactccctgaagtgcaactc ccgaaggtgcccgagatggccgtgcccgatgtgcacctcccagaagtgcagcttccaaaa gtcccagagatgaagctccctgagatgaaactccctgaggtgaaactcccgaaggtgccc gagatggctgtgcccgatgtgcacctcccggaagtgcagctcccgaaagtcccagagatg aaactccctaaaatgcctgagatggctgtgccagaggttcgactccccgaggtgcagctg ccaaaagtctcagagatgaaactccccaaggtgcctgaaatggccgtgcccgatgtgcac ctcccagaggtgcagctgcccaaagtctgtgaaatgaaagtccctgacatgaagctccca gagataaaactccccaaggtgcctgagatggctgtgcccgatgtgcacctccccgaggtg cagctgccgaaagtgtcagagattcggctgccggaaatgcaagtgccgaaggttcccgac gtgcatcttccgaaggcaccagaggtgaagctgcccagggctccggaggtgcagctaaag gccaccaaggcagaacaggcagaagggatggaatttggcttcaagatgcccaagatgacc atgcccaagctagggagggcagagtccccatcacgtggcaagccaggcgaggcgggtgct gaggtctcagggaagctggtaacacttccctgtctgcagccagaggtggatggtgaggct catgtgggtgtcccctctctcactctgccttcagtggagctagacctgccaggagcactt ggcctgcaggggcaggtcccagccgctaaaatgggcaagggagagcgggtggagggccct gaggtggcagcaggggtcagggaagtgggcttccgagtgccctctgttgaaattgtcacc ccacagctgcccgccgtggaaattgaggaagggcggctggagatgatagagacaaaagtc aagccctcttccaagttctccttacctaagtttggactctcggggccaaaggtggctaag gcagaggctgagggggctgggcgagctaccaagctgaaggtatccaaatttgccatctca ctccccaaggctcgggtgggggctgaggctgaggccaaaggggctggggaggcaggcctg ctgcctgccctcgatctgtccatcccacagctcagcctggatgcccacctgccctcaggc aaggtagaggtggcaggggccgacctcaagttcaaggggcccaggtttgctctccccaag tttggggtcagaggccgggacactgaggcagcagaactagtgccaggggtggctgagttg gagggcaagggctggggctgggatgggagggtgaagatgcccaagctgaagatgccttcc tttgggctggctcgagggaaggaagcagaagttcaaggtgatcgtgccagcccgggggaa aaggctgagtccaccgctgtgcagcttaagatccccgaggtggagctggtcacgctgggc gcccaggaggaagggagggcagagggggctgtggccgtcagtggaatgcagctgtcaggc ctgaaggtgtccacagccgggcaggtggtcactgagggccatgacgcggggctgaggatg cctccgctgggcatctccctgccacaggtggagctgaccggctttggggaggcaggtacc ccagggcagcaggctcagagtacagtcccttcagcagagggcacagcaggctacagggtt caggtgccccaggtgaccctgtctctgcctggagcccaggttgcaggtggtgagctgctg gtgggtgagggtgtctttaagatgcccaccgtgacagtgccccagcttgagctggacgtg gggctaagccgagaggcacaggcgggcgaggcggccacaggcgagggtgggctgaggctg aagttgcccacactgggggccagagctagggtggggggcgagggtgctgaggagcagccc ccaggggccgagcgtaccttctgcctctcactgcccgacgtggagctctcgccatccggg ggcaaccatgccgagtaccaggtggcagagggggagggagaggccggacacaagctcaag gtacggctgccccggtttggcctggtgcgggccaaggagggggccgaggagggtgagaag gccaagagccccaaactcaggctgccccgagtgggcttcagccaaagtgagatggtcact ggggaagggtcccccagccccgaggaggaggaggaggaggaggaagagggcagtggggaa ggggcctcgggtcgccggggccgggtccgggtccgcttgccacgtgtaggcctggcggcc ccttctaaagcctctcgggggcaggagggcgatgcagcccccaagtcccccgtcagagag aagtcacccaagttccgcttccccagggtgtccctaagccccaaggcccggagtgggagt ggggaccaggaagagggtggattgcgggtgcggctgcccagcgtggggttttcagagaca ggggctccaggcccggccaggatggagggggctcaggctgcggctgtctga >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_6|302_aa MAPPSQSTPRGGRDSSQSRLAARLRGGAGGPEARRRGPAEVSEAPTGHWWVLALAGRNAV GVPGGGRGEPGSCVRARRQFVYLAAVGPRFPEAPGPNWGVPGCSGIMVGGLKRKHSDLEE EEERWEWSPAGLQSYQQALLRISLDKVQRSLGPRAPSLRRHVLIHNTLQQLQAALRLAPA PALPPEPLFLGEEDFSLSATIGSILRELDTSMDGTEPPQNPVTPLGLQNEVPPQPDPVFL EALSSRYLGDSGLDDFFLDIDTSAVEKEPARAPPEPPHNLFCAPGSWEWNELDHIMEIIL GS >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_6|909_bp atggctccgcccagccaatcgacgccccgcggcgggcgggactccagccaatcgcggctc gcggctcggctccggggcggggctggtgggccagaggccagacggagaggccccgccgag gtgagcgaggctccaaccggtcactggtgggtcctggcactggcgggtcgcaacgctgtg ggcgttccaggaggtggtcgtggcgaacctggcagctgcgtgcgtgctcgacgtcaattc gtgtacctggcggctgttggcccccgtttcccagaagcccctgggccaaattggggcgtt cctggatgctctggcatcatggtgggaggcttgaagaggaaacactctgatttggaagag gaggaggagaggtgggagtggagtccagcaggccttcagagctaccagcaagccctgctc cgcatctccctagacaaagtccagcgcagcctgggcccccgagcacccagcctccgcagg catgtcctcatccataacaccctccaacagctgcaggctgcacttcgcctggctcccgcc cctgccctgccccccgagcccctcttcctgggcgaggaggatttctccctgtcagccacc attggctctatcctcagggagctggacacctccatggatgggactgagccccctcagaat ccagtgactccccttggcctccagaatgaagtgccaccccagcctgatccagtcttctta gaagctctgagctcccggtacttgggggactctggcctggatgacttctttctggacatt gacacatctgcggtagaaaaggagcctgcacgggccccaccagagcctcctcacaacctc ttctgtgccccaggttcttgggagtggaatgaactggatcacatcatggaaatcattctg gggtcctaa >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_7|198_aa MAENMTQSLDPQGAWFAAGYEVTVLVRDSSRLPSEGPRPAHVVVGDVLQAADVDKTVAGQ DAVIVLLGTRNDLSPTTVMSEGARNIVAAMKAHGVDKVVACTSAFLLWDPTKVPPRLQAV TDDHIRMHKVLRESGLKYVAVMPPHIGDQPLTGAYTVTLDGRGPSRVISKHDLGHFMLRC LTTDEYDGHSTYPSHQYQ >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_7|597_bp atggcagaaaacatgacacagtccctggacccacagggtgcttggtttgctgcaggttac gaagtgacagtgctggtgcgggactcctccaggctgccatcagaggggccccggccggcc cacgtggtagtgggagatgttctgcaggcagccgatgtggacaagaccgtggctgggcag gacgctgtcatcgtgctgctgggcacccgcaatgacctcagtcccacgacagtgatgtcc gagggcgcccggaacattgtggcagccatgaaggctcatggtgtggacaaggtcgtggcc tgcacctcggctttcctgctctgggaccctaccaaggtgcccccacgactgcaggctgtg actgatgaccacatccggatgcacaaggtgctgcgggaatcaggcctgaagtacgtggct gtgatgccgccacacataggagaccagccactaactggggcgtacacagtgaccctggat ggacgagggccctcaagggtcatctccaaacatgacctgggccatttcatgctgcgctgc ctcaccaccgatgagtacgacggacacagcacctacccctcccaccagtaccagtag >gi568815579r:40322839_40523546|GENSCAN_predicted_peptide_8|1260_aa MPRPPRLMPACTACARVASPSPMAQVPGEVDNMEGLPAPNNNPAARWESPDRGWEREQPA ASTAAASLFECSRIKALADEREAVQKKTFTKWVNSHLARVGCHIGDLYVDLRDGFVLTRL LEVLSGEQLPRPTRGRMRIHSLENVDKALQFLKEQRVHLENVGSHDIVDGNHRLTLGLVW TIILRFQIQVIKIETEDNRETRSAKDALLLWCQMKTAGAYTGKFKVLGRFSVAELVAGYP EVNIQNFTTSWRDGLAFNALIHRHRPDLVDFSKLTKSNANYNLQRAFRTAEQHLGLARLL DPEDVNMEAPDEKSIITYVVSFYHYFSKMKALAVEGKRIGKVLDQVLEVGKIIERYEELA AELLAWIHRTVGLISNQKFANSLSGVQQQLQAFTAYCTLEKPVKFQEKGNLEVLLFSIQS KLRACNRRLFVPREGCGIWDIDKAWGELEKAEHEREAALRAELIRQEKLELLAQRFDHKV AMRESWLNENQRLVSQDNFGYELPAVEAAMKKHEAIEADIAAYEERVQGVAELAQALAAE GYYDIRRVAAQRDSVLRQWALLTGLVGARRTRLEQNLALQKVFQEMVYMVDWMEEMQAQL LSRECGQHLVEADDLLQKHGLLEGDIAAQSERVEALNAAALRFSQLQGYQPCDPQVICNR VNHVHGCLAELQEQAARRRAELEASRSLWALLQELEEAESWARDKERLLEAAGGGGAAGA AGAAGTAGGAHDLSSTARLLAQHKILQGELGGRRALLQQALRCGEELVAAGGAVGPGADT VHLVGLAERAASARRRWQRLEEAAARRERRLQEARALHQFGADLDGLLDWLRDAYRLAAA GDFGHDEASSRRLARQHRALTGEVEAHRGPVSGLRRQLATLGGASGAGPLVVALQVRVVE AEQLFAEVTEVAALRRQWLRDALAVYRMFGEVHACELWIGEKEQWLLSMRVPDSLDDVEV VQHRFESLDQEMNSLMGRVLDVNHTVQELVEGGHPSSDEVRSCQDHLNSRWNRIVELVEQ RKEEMSAVLLVENHVLEVAEVRAQVREKRRAVESAPRAGGALQWRLSGLEAALQALEPRQ AALLEEAALLAERFPAQAARLHQGAEELGAEWGALASAAQACGEAVAAAGRLQRFLHDLD AFLDWLVRAQEAAGGSEGPLPNSLEEADALLARHAALKEEVDQREEDYARIVAASEALLA ADGAELGPGLALDEWLPHLELGWHKLLGLWEARREALVQAHIYQLFLRDLRQALVVLRNQ >gi568815579r:40322839_40523546|GENSCAN_predicted_CDS_8|3780_bp atgccccgcccgccccggctcatgcctgcttgcaccgcctgcgccagggtggcctcacct tccccgatggcgcaggtaccaggggaagtggacaacatggagggcctgcctgctcctaac aacaaccctgctgcccgctgggagagtccggatcggggctgggagcgggagcagccggct gcgtccaccgcagcggcctcgctctttgagtgctcccggatcaaggccttggcagatgag cgggaagccgtgcagaagaaaaccttcaccaagtgggtgaactcgcacctcgcccgcgtg ggctgccacatcggggacctctatgtggacctccgggacggcttcgtgctcacgcggctc ctggaagtgctgtctggggagcagctgcccaggcccacgcgcggccgcatgcggatccac tcactggagaacgtggacaaggcgctgcagtttctgaaggagcagcgcgtgcacctggag aacgtgggttcgcatgacatcgtggatgggaatcaccggctgacgctggggctggtctgg accatcatcctgcgcttccagattcaagtcatcaaaattgagactgaggacaacagagag acacgctcagccaaggatgctctgctcttgtggtgtcagatgaagacagctggggcatac acaggaaagttcaaagtccttggtcgcttcagtgtggctgagctggtggcgggttaccct gaggtaaacatccagaatttcaccaccagctggcgggatggcttggccttcaatgccctc attcaccggcacaggcctgatctcgtggacttcagcaaactcaccaagtccaatgccaac tacaacctgcagagagccttccgcacagctgagcagcacctggggctggcgcggctgctg gatcctgaagatgtgaacatggaggctccagatgagaagtccatcatcacctacgtggtc tctttctaccactatttctccaagatgaaggctctggctgtggaggggaagcgtatcggg aaggtcttggaccaggtattggaggtggggaagatcatagaacgctacgaggagctggcg gctgagctgctggcctggatccaccgcaccgtgggcctcatcagcaatcagaaatttgcc aactccttaagtggggtgcagcagcaactccaggctttcacggcctattgcacgctggag aagcctgtcaagttccaggagaaggggaacctagaggtgctgctcttcagcatccagagc aaactgcgtgcctgcaaccgtcgcctctttgtgcctcgggagggctgtggcatctgggat attgacaaggcatggggtgagctggagaaggctgagcatgagcgggaggctgccctacgg gctgagctgattcggcaggagaagctggaactactggcacagaggtttgaccacaaggtg gctatgagggagagctggctgaatgagaaccagcgtctggtctcccaggacaactttggg tatgagctgcccgcagtggaggcagccatgaagaaacacgaagcgatcgaggcagacatt gcggcctacgaggagcgggtgcagggtgtggcggagctggcccaggcattggcagccgaa ggctactacgatatccggcgggtggcagcccagcgtgacagcgtcctgcgccagtgggcc ctgctaactgggcttgtgggtgcccggcggacacgacttgagcagaaccttgccctgcag aaggtcttccaggagatggtgtacatggtggactggatggaggagatgcaggctcagctg ctgtcccgggagtgtgggcagcacctggtggaggcagacgacctgttgcagaagcatgga ctgctggagggagacattgccgcccagagcgagcgggtggaggctctcaatgccgctgcc ctgcgcttctcccagctgcagggctaccagccctgcgacccgcaggtcatctgcaaccgc gtgaaccacgtgcacggctgcctggcggagctgcaggagcaggcagcgcggcgacgcgcg gagctggaggcttcgcggagcctgtgggcgctgctgcaggagctggaggaggccgagagc tgggcgcgcgacaaggagcgtctcctggaggctgcgggcggcggcggtgcggcgggcgca gcgggcgcagcgggaacagcgggcggcgcgcatgacctgtccagcacagcgcgcctcctg gcccagcacaagatcctgcagggcgagctgggcgggcggcgagcgttgctgcagcaggcc ctgcggtgtggcgaggagctggttgcggccggcggtgccgtcggcccgggagcagacacc gtgcacctggtaggcctggcggagcgcgcggcgagcgcccggcgccgctggcagaggctg gaagaggcggcggcgcggcgagagcggcggctgcaggaggcgcgggcgctgcaccagttc ggcgctgacctcgacgggctgctggactggcttcgcgacgcttaccgcctggcagccgcc ggtgacttcggccacgacgaagcttccagccgccgcctggcgcgccagcaccgcgcgctc accggggaggtggaggcacatcgcgggcccgtgagcggcctgcggcgccagctggcgaca ctcgggggtgccagtggcgcagggccactggtggtggcgctgcaggtgcgcgtggtggaa gcagagcagttgttcgctgaggtgaccgaagtggcggcgctgaggcgccagtggctgcgg gacgcgctcgctgtctaccgcatgtttggcgaggtgcacgcgtgtgagctgtggatcggc gagaaggagcaatggctgctctccatgcgtgtgccggattcactcgacgacgtcgaggtg gtgcagcaccgattcgagagcctggaccaagagatgaacagcctgatgggccgcgttctg gacgtgaaccacacagtccaggagctggtggaaggaggccaccccagttcagatgaggtg cgttcctgccaggaccacctcaacagcaggtggaaccgcatcgtggagctagtggaacag cgcaaagaggaaatgagcgcggtgctgctggtggagaaccacgtgctggaggtggccgag gtgcgcgcccaggtgcgtgagaagcggagagctgtggagagcgcgccccgggccggcggc gccctgcagtggcgtcttagcggcctagaggccgctctgcaggcgctggagccgcgccag gcggcccttctggaggaggcagccctgctggctgagcgcttcccggcgcaggcggcgcgg ctgcaccagggcgcggaggagctgggcgccgagtggggcgcgctagctagcgcggctcag gcctgcggcgaggcggtggcggcagcagggcgcctgcagcgcttcctacatgacctcgac gctttcctggactggctcgtgcgcgcccaggaggcggcgggcggcagcgaggggcccctg cccaacagcctagaagaggcggacgcgctgctggcgcgccacgctgcgctcaaggaggag gtggaccagcgcgaggaagactatgctcgcatcgtggcggccagcgaggcgctgctggcc gccgacggcgcagagctgggcccgggcctggcactagacgagtggctgccacacctcgaa cttggctggcataaactgctcggcttgtgggaggcgcgcagggaggcgctggtccaggcg cacatctaccagctcttcctgcgggatctacgccaggcgctcgtggtgctgcgtaaccag