GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:36:57 Sequence gi568815579r:40279590_40489902 : 210313 bp : 51.09% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 5389 5853 465 0 0 44 45 357 0.998 21.50 1.02 PlyA + 10661 10666 6 1.05 2.10 PlyA - 11602 11597 6 1.05 2.09 Term - 42787 42293 495 1 0 111 41 531 0.991 45.66 2.08 Intr - 44487 44417 71 1 2 119 87 47 0.998 7.09 2.07 Intr - 46897 46745 153 2 0 91 79 243 0.867 24.26 2.06 Intr - 48961 48824 138 2 0 114 81 133 0.973 16.14 2.05 Intr - 54340 54262 79 1 1 109 89 51 0.967 6.92 2.04 Intr - 56635 56521 115 0 1 106 94 171 0.996 20.45 2.03 Intr - 56818 56731 88 2 1 59 66 153 0.999 9.73 2.02 Intr - 62301 62250 52 0 1 147 92 6 0.992 5.87 2.01 Init - 62478 62416 63 0 0 86 89 79 0.857 6.99 2.00 Prom - 67417 67378 40 -2.31 3.00 Prom + 69571 69610 40 -2.41 3.01 Init + 72127 72188 62 0 2 82 90 33 0.062 3.85 3.02 Term + 82581 82599 19 0 1 105 41 31 0.017 -1.83 3.03 PlyA + 83642 83647 6 1.05 4.00 Prom + 85402 85441 40 -2.01 4.01 Init + 86895 86921 27 2 0 86 92 33 0.982 3.12 4.02 Intr + 87021 87095 75 2 0 99 97 114 0.999 13.61 4.03 Intr + 87184 87326 143 0 2 144 91 157 0.997 21.26 4.04 Intr + 88107 88290 184 0 1 48 82 274 0.996 23.11 4.05 Intr + 90319 90439 121 0 1 116 89 208 0.987 24.27 4.06 Intr + 90521 90648 128 0 2 123 62 206 0.999 22.40 4.07 Intr + 92084 92284 201 1 0 82 101 310 0.994 31.70 4.08 Intr + 94892 95031 140 1 2 86 84 174 0.999 16.57 4.09 Intr + 97020 97185 166 0 1 88 103 290 0.994 30.98 4.10 Intr + 98197 98296 100 0 1 104 109 175 0.983 21.38 4.11 Term + 98397 98584 188 1 2 106 54 277 0.999 24.07 4.12 PlyA + 98879 98884 6 -3.24 5.05 PlyA - 99700 99695 6 -4.04 5.04 Term - 100180 99998 183 1 0 133 45 91 0.974 7.16 5.03 Intr - 101579 100734 846 2 0 92 83 1268 0.696 118.94 5.02 Intr - 104550 104194 357 0 0 124 116 699 0.936 72.31 5.01 Init - 110313 109849 465 0 0 97 113 1027 0.986 101.78 5.00 Prom - 112163 112124 40 -3.71 6.10 PlyA - 114198 114193 6 1.05 6.09 Term - 118381 114377 4005 1 0 129 48 3077 0.844 294.43 6.08 Intr - 119227 119031 197 2 2 90 80 406 0.986 39.65 6.07 Intr - 124273 124117 157 1 1 75 86 213 0.939 19.90 6.06 Intr - 128442 128317 126 0 0 119 105 106 0.968 16.78 6.05 Intr - 128667 128569 99 0 0 103 81 -10 0.457 0.61 6.04 Intr - 128794 128751 44 2 2 60 99 34 0.758 0.15 6.03 Intr - 131950 131815 136 1 1 58 82 80 0.067 5.05 6.02 Intr - 143957 143292 666 2 0 115 56 720 0.011 64.06 6.01 Init - 154546 154466 81 1 0 66 32 107 0.596 1.82 6.00 Prom - 161024 160985 40 -4.51 7.04 PlyA - 161276 161271 6 1.05 7.03 Term - 162497 161901 597 2 0 132 41 404 0.121 35.04 7.02 Intr - 163698 163607 92 1 2 -8 56 81 0.067 -4.49 7.01 Init - 164845 164626 220 1 1 54 77 144 0.109 6.68 7.00 Prom - 167649 167610 40 -6.70 8.06 PlyA - 168240 168235 6 1.05 8.05 Term - 168457 168300 158 2 2 86 42 215 0.997 15.11 8.04 Intr - 171903 171775 129 1 0 102 87 259 0.999 28.37 8.03 Intr - 178655 178566 90 0 0 83 63 183 0.932 15.76 8.02 Intr - 178956 178792 165 1 0 132 -52 240 0.677 14.85 8.01 Init - 182423 182369 55 2 1 72 76 44 0.266 3.04 8.00 Prom - 183719 183680 40 -3.41 9.00 Prom + 185744 185783 40 -4.71 9.01 Init + 185996 186046 51 1 0 71 77 101 0.206 6.41 9.02 Intr + 193018 193201 184 0 1 85 82 202 0.505 19.18 9.03 Intr + 208108 208259 152 2 2 38 81 310 0.742 25.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 162491 161901 591 2 0 100 41 398 0.860 32.87 S.002 Init + 163595 163672 78 1 0 74 86 20 0.860 1.42 S.003 Intr + 164089 164189 101 0 2 89 43 96 0.822 4.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_1|154_aa MARRHRPGPPRLVPPPPLGHPHSAPEGGARGPRSPRRARVVQPIAAPRGARTRPSWGGRS GTRAGALTCHRQPPSLSGGALPPPIATLRWFPFLVFPGSGNGAGSGSGGGDASSEAGPTA STARPPPPPPPSPRLPPPPTLWTEIPPSRATNRK >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_1|465_bp atggcccgccgccaccgccccggccccccgaggctggtcccaccgcccccgctcggccac ccccactcagctcctgaaggaggggctcgagggccgcggtccccccgccgggcacgggtg gtacagcccatcgcggcaccgcggggggcccggacgcgaccatcgtggggggggcgttcg gggacacgcgctggcgcactcacctgtcaccggcagccgcccagcctctccggcggggct ctacccccgcccatcgccacgctgcgctggttccctttccttgtgtttcccggcagcggc aacggcgccggcagcggcagcggcggcggcgacgcctcctccgaggcaggcccaacggct agcacggcgcggccccccccgccccctccccccagcccccgcttaccgccccctcccacc ctctggacggaaataccgccctctcgcgccacaaacaggaagtga >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_2|417_aa MVVAALSQADFLPAAPGYSWAAPGSSVRARETMVSVTMATSEWIQFFKEAGIPPGPAVNY AVMFVDNRIQKSMLLDLNKEIMNELGVTVVGDIIAILKHAKVVHRQDMCKAATESVPCSP SPLAGEIRRGTSAASRMITNSLNHDSPPSTPPRRPDTSTSKISVTVSNKMAAKSAKATAA LARREEESLAVPAKRRRVTAEMEGKYVINMPKGTTPRTRKILEQQQAAKGLHRTSVFDRL GAETKADTTTGSKPTGVFSRLGATPETDEDLAWDSDNDSSSSVLQYAGVLKKLGRGPAKA SPQPALTVKAKATSSATTAAAPTLRRLALSSRSGLERKPESLSKVSIIKRLGAAALVPEA QDSQVTSTKSKSSAEVKVTIKRTLVGPRGSSSSEGLGAQMDHAGTVSVFKRLGRRTF >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_2|1254_bp atggtggtggcagctctgtcccaggctgacttcctgccagctgctccaggctattcctgg gcggctccagggagcagtgtcagggccagggagacgatggtctccgtgactatggccact tccgagtggatccagttctttaaggaagccggcattcctccaggacctgccgtcaattat gccgtgatgtttgtggataataggattcagaagagcatgctgctggatctcaataaggag ataatgaatgagctgggcgtgaccgtggtgggtgacatcatcgccattctcaagcatgcc aaagtggtgcaccgtcaggacatgtgcaaagctgccactgagtcagtaccctgcagccct agcccccttgcaggcgaaattcgccgtggcaccagtgctgcctcccgaatgatcaccaac agcctgaaccatgactctccacccagcacaccccccaggcgcccggacaccagcacctcc aagatctcggtcactgtgtccaacaagatggcagcaaagagtgccaaggccactgcagcc ctggcccgccgggaggaggagagcctggctgttcctgccaagcggcgccgggtcactgct gagatggaggggaagtacgtcatcaacatgcccaaaggcaccacaccccgcacccgcaag atcctggagcagcagcaggctgcaaaaggtctccataggacgtctgtgtttgaccgcctc ggcgccgagaccaaggcagacaccacgacagggagtaaacccacaggagtcttcagccgc ctgggggccaccccagaaacggacgaggatctggcttgggacagcgacaatgacagcagc agctctgtcttgcagtatgccggggtcctgaagaagctaggacggggcccagccaaggcc agtccccagccagcactgactgtcaaagccaaggccacaagctcagcgacaacggctgct gccccgacactgcggcgcctggcgctttcctcacggtctgggcttgagaggaagccggag tccttgtctaaagtcagcatcatcaagagactgggcgcagctgcccttgtgcccgaggcc caggacagccaggtcaccagcaccaagagtaagtcctcagccgaggtcaaggtcaccatt aagaggactctggtggggccccgggggagcagctccagcgagggccttggtgcccagatg gaccacgcgggcactgtgagcgtgttcaaaagactgggccgcaggaccttctag >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_3|26_aa MGHIVEGLVSHAEGWLLLPVSCIPSA >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_3|81_bp atggggcacatcgtggaaggccttgtgagccatgcagaaggctggcttttactcccagtg agctgtatccccagtgcctaa >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_4|490_aa MKPKLMYQELKVPAEEPANELPMNEIEAWKAAEKKARWVLLVLILAVVGFGALMTQLFLW EYGDLHLFGPNQRPAPCYDPCEAVLVESIPEGLDFPNASTGNPSTSQAWLGLLAGAHSSL DIASFYWTLTNNDTHTQEPSAQQGEEVLRQLQTLAPKGVNVRIAVSKPSGPQPQADLQAL LQSGAQVRMVDMQKLTHGVLHTKFWVVDQTHFYLGSANMDWRSLTQVKELGVVMYNCSCL ARDLTKIFEAYWFLGQAGSSIPSTWPRFYDTRYNQETPMEICLNGTPALAYLASAPPPLC PSGRTPDLKALLNVVDNARSFIYVAVMNYLPTLEFSHPHRFWPAIDDGLRRATYERGVKV RLLISCWGHSEPSMRAFLLSLAALRDNHTHSDIQVKLFVVPADEAQARIPYARVNHNKYM VTERATYIGTSNWSGNYFTETAGTSLLVTQNGRGGLRSQLEAIFLRDWDSPYSHDLDTSA DSVGNACRLL >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_4|1473_bp atgaagcctaaactgatgtaccaggagctgaaggtgcctgcagaggagcccgccaatgag ctgcccatgaatgagattgaggcgtggaaggctgcggaaaagaaagcccgctgggtcctg ctggtcctcattctggcggttgtgggcttcggagccctgatgactcagctgtttctatgg gaatacggcgacttgcatctctttgggcccaaccagcgcccagccccctgctatgaccct tgcgaagcagtgctggtggaaagcattcctgagggcctggacttccccaatgcctccacg gggaacccttccaccagccaggcctggctgggcctgctcgccggtgcgcacagcagcctg gacatcgcctccttctactggaccctcaccaacaatgacacccacacgcaggagccctct gcccagcagggtgaggaggtcctccggcagctgcagaccctggcaccaaagggcgtgaac gtccgcatcgctgtgagcaagcccagcgggccccagccacaggcggacctgcaggctctg ctgcagagcggtgcccaggtccgcatggtggacatgcagaagctgacccatggcgtcctg cataccaagttctgggtggtggaccagacccacttctacctgggcagtgccaacatggac tggcgttcactgacccaggtcaaggagctgggcgtggtcatgtacaactgcagctgcctg gctcgagacctgaccaagatctttgaggcctactggttcctgggccaggcaggcagctcc atcccatcaacttggccccggttctatgacacccgctacaaccaagagacaccaatggag atctgcctcaatggaacccctgctctggcctacctggcgagtgcgcccccacccctgtgt ccaagtggccgcactccagacctgaaggctctactcaacgtggtggacaatgcccggagt ttcatctacgtcgctgtcatgaactacctgcccactctggagttctcccaccctcacagg ttctggcctgccattgacgatgggctgcggcgggccacctacgagcgtggcgtcaaggtg cgcctgctcatcagctgctggggacactcggagccatccatgcgggccttcctgctctct ctggctgccctgcgtgacaaccatacccactctgacatccaggtgaaactctttgtggtc cccgcggatgaggcccaggctcgaatcccatatgcccgtgtcaaccacaacaagtacatg gtgactgaacgcgccacctacatcggaacctccaactggtctggcaactacttcacggag acggcgggcacctcgctgctggtgacgcagaatgggaggggcggcctgcggagccagctg gaggccattttcctgagggactgggactccccttacagccatgaccttgacacctcagct gacagcgtgggcaacgcctgccgcctgctctga >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_5|616_aa MSTIQSETDCYDIIEVLGKGTFGEVAKGWRRSTGEMVAIKILKNDAYRNRIIKNELKLLH CMRGLDPEEAHVIRFLEFFHDALKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQV LTALARLKELAIIHADLKPENIMLVDQTRCPFRVKVIDFGSASIFSEVRYVKEPYIQSRF YRAPEILLGLPFCEKVDVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLH AACKAHHFFKRNPHPDAANPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETVNGGSVA SRLTFPDREALAEHADLKSMVELIKRMLTWESHERISPSAALRHPFVSMQQLRSAHETTH YYQLSLRSYRLSLQVEGKPPTPVVAAEDGTPYYCLAEEKEAAGMGSVAGSSPFFREEKAP GMQRAIDQLDDLSLQEAGHGLWGETCTNAVSDMMVPLKAAITGHHVPDSGPEPILAFYSS RLAGRHKARKPPAGSKSDSNFSNLIRLSQVSPEDDRPCRGSSWEEGEHLGASAEPLAILQ RDEDGPNIDNMTMEAERPDPELFDPSSCPGEWLSEPDCTLESVRGPRAQGLPPRRSHQHG PPRGATSFLQHVTGHH >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_5|1851_bp atgtccaccatccagtcggagactgactgctacgacatcatcgaggtcttgggcaagggg accttcggggaggtagccaagggctggcggcggagcacgggcgagatggtggccatcaag atcctcaagaatgacgcctaccgcaaccgcatcatcaagaacgagctgaagctgctgcac tgcatgcgaggcctagaccctgaagaggcccacgtcatccgcttccttgagttcttccat gacgccctcaagttctacctggtctttgagctgctggagcaaaaccttttcgagttccag aaggagaacaacttcgcgcccctccccgcccgccacatccgtacagtcaccctgcaggtg ctcacagccctggcccggctcaaggagctggctatcatccacgctgatctcaagcctgag aacatcatgctggtggaccagacccgctgccccttcagggtcaaggtgattgacttcgga tccgccagcattttcagcgaggtgcgctacgtgaaggagccatacatccagtcgcgcttc taccgggcccctgagatcctgctggggctgcccttctgcgagaaggtggacgtgtggtcc ctgggctgcgtcatggctgagctgcacctgggctggcctctctaccccggcaacaacgag tacgaccaggtgcgctacatctgcgaaacccagggcctgcccaagccacacctgttgcac gccgcctgcaaggcccaccacttcttcaagcgcaacccccaccctgacgctgccaacccc tggcagctcaagtcctcggctgactacctggccgagacgaaggtgcgcccattggagcgc cgcaagtatatgctcaagtcgttggaccagattgagacagtgaatggtggcagtgtggcc agtcggctaaccttccctgaccgggaggcgctggcggagcacgccgacctcaagagcatg gtggagctgatcaagcgcatgctgacctgggagtcacacgaacgcatcagccccagtgct gccctgcgccaccccttcgtgtccatgcagcagctgcgcagtgcccacgagaccacccac tactaccagctctcgctgcgcagctaccgcctctcgctgcaagtggaggggaagcccccc acgcccgtcgtggccgcagaagatgggaccccctactactgtctggctgaggagaaggag gctgcgggtatgggcagtgtggccggcagcagccccttcttccgagaggagaaggcacca ggtatgcaaagagccatcgaccagctggatgacctgagtctgcaggaggctgggcatggg ctgtggggtgagacctgcaccaatgcggtctccgacatgatggtccccctcaaggcagcc atcactggccaccatgtgcccgactcgggccctgagcccatcctggccttctacagcagc cgcctggcaggccgccacaaggcccgcaagccacctgcgggttccaagtccgactccaac ttcagcaacctcattcggctgagccaggtctcgcctgaggatgacaggccctgccggggc agcagctgggaggaaggagagcatctcggggcctctgctgagccactggccatcctgcag cgagatgaggatgggcccaacattgacaacatgaccatggaagctgagaggccagaccct gagctcttcgaccccagcagctgtcctggagaatggctgagtgagccagactgcaccctg gagagcgtcaggggcccacgggctcaggggctcccaccccgccgctcccaccagcatggt ccaccccggggggccaccagcttcctccagcatgtcaccgggcaccactga >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_6|1836_aa MGPRPALGVAGRPSPSPLPTLRPGPAGMLSKGLKRKREEEEEKEPLAVDSWWLDPGHTAV AQAPPAVASSSLFDLSVLKLHHSLQQSEPDLRHLVLVVNTLRRIQASMAPAAALPPVPSP PAAPSVADNLLASSDAALSASMASLLEDLSHIEGLSQAPQPLADEGPPGRSIGGAAPSLG ALDLLGPATGCLLDDGLEGLFEDIDTSMYDNELWAPASEGLKPGPEDGPGKEEAPELDEA ELDYLMDVLAPGMTLTATGLGQRADDPGDPLYYHLQGSGFSCPQLRTLCCVSVLERRTQE AQGAGGDPQAARTPRKGVSPADSCAAPRAGLLLGGSACTQAAAQRQLLHAELKLVLQQKG ERTQEPGVQVPPSNAMEARSRSAEELRRAELVEIIVETEAQTGVSGINVAGGGKEGIFVR ELREDSPAARSLSLQEGDQLLSARVFFENFKYEDALRLLQCAEPYKVSFCLKRTVPTGDL ALRPGTVSGYEIKGPRAKVAKLNIQSLSPVKKKKMVPGALGVPADLAPVDVEFSFPKFSR LRRGLKAEAVKGPVPAAPARRRLQLPRLRVREVAEEAQAARLAAAAPPPRKAKVEAEVAA GARFTAPQVELVGPRLPGAEVGVPQVSAPKAAPSAEAAGGFALHLPTLGLGAPAPPAVEA PAVGIQVPQVELPALPSLPTLPTLPCLETREGAVSVVVPTLDVAAPTVGVDLALPGAEVE ARGEAPEVALKMPRLSFPRFGARAKEVAEAKVAKVSPEARVKGPRLRMPTFGLSLLEPRP AAPEVVESKLKLPTIKMPSLGIGVSGPEVKVPKGPEVKLPKAPEVKLPKVPEAALPEVRL PEVELPKVSEMKLPKVPEMAVPEVRLPEVELPKVSEMKLPKVPEMAVPEVRLPEVQLLKV SEMKLPKVPEMAVPEVRLPEVQLPKVSEMKLPEVSEVAVPEVRLPEVQLPKVPEMKVPEM KLPKVPEMKLPEMKLPEVQLPKVPEMAVPDVHLPEVQLPKVPEMKLPEMKLPEVKLPKVP EMAVPDVHLPEVQLPKVPEMKLPKMPEMAVPEVRLPEVQLPKVSEMKLPKVPEMAVPDVH LPEVQLPKVCEMKVPDMKLPEIKLPKVPEMAVPDVHLPEVQLPKVSEIRLPEMQVPKVPD VHLPKAPEVKLPRAPEVQLKATKAEQAEGMEFGFKMPKMTMPKLGRAESPSRGKPGEAGA EVSGKLVTLPCLQPEVDGEAHVGVPSLTLPSVELDLPGALGLQGQVPAAKMGKGERVEGP EVAAGVREVGFRVPSVEIVTPQLPAVEIEEGRLEMIETKVKPSSKFSLPKFGLSGPKVAK AEAEGAGRATKLKVSKFAISLPKARVGAEAEAKGAGEAGLLPALDLSIPQLSLDAHLPSG KVEVAGADLKFKGPRFALPKFGVRGRDTEAAELVPGVAELEGKGWGWDGRVKMPKLKMPS FGLARGKEAEVQGDRASPGEKAESTAVQLKIPEVELVTLGAQEEGRAEGAVAVSGMQLSG LKVSTAGQVVTEGHDAGLRMPPLGISLPQVELTGFGEAGTPGQQAQSTVPSAEGTAGYRV QVPQVTLSLPGAQVAGGELLVGEGVFKMPTVTVPQLELDVGLSREAQAGEAATGEGGLRL KLPTLGARARVGGEGAEEQPPGAERTFCLSLPDVELSPSGGNHAEYQVAEGEGEAGHKLK VRLPRFGLVRAKEGAEEGEKAKSPKLRLPRVGFSQSEMVTGEGSPSPEEEEEEEEEGSGE GASGRRGRVRVRLPRVGLAAPSKASRGQEGDAAPKSPVREKSPKFRFPRVSLSPKARSGS GDQEEGGLRVRLPSVGFSETGAPGPARMEGAQAAAV >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_6|5511_bp atgggcccgcggcccgccctgggcgtggcgggaaggcccagtccctccccgctgcccacc ctccgccctgggccggccgggatgctgagcaagggtctgaagcggaaacgggaggaggag gaggagaaggaacctctggcagtcgactcctggtggctagatcctggccacacagcggtg gcacaggcacccccggccgtggcctctagctccctctttgacctctcagtgctcaagctc caccacagcctgcagcagagtgagccggacctgcggcacctggtgctggtcgtgaacact ctgcggcgcatccaggcgtccatggcacccgcggctgccctgccacctgtgcctagccca cctgcagcccccagtgtggctgacaacttactggcaagctcggacgctgccctttcagcc tccatggccagcctcctggaggacctcagccacattgagggcctgagtcaggctccccaa cccttggcagacgaggggccaccaggccgtagcatcgggggagcagcgcccagcctgggt gccttggacctgctgggcccagccactggctgtctactggacgatgggcttgagggcctg tttgaggatattgacacctctatgtatgacaatgaactttgggcaccagcctctgagggc ctcaaaccaggccctgaggatgggccgggcaaggaggaagctccggagctggacgaggcc gaattggactacctcatggatgtgctggctccgggcatgaccctcacagccacgggcctg ggacagagagctgatgacccaggagaccccctctactaccacctacaaggttcaggcttc tcgtgtccccagctcaggactctgtgctgtgtatcagtcctggagcgccggacccaggag gcccaaggagctggaggtgaccctcaggcagcaagaaccccacggaagggcgtgagccct gcagacagctgtgcggcacctcgggctgggctcctgttaggaggaagtgcctgcacccag gcagcggctcagaggcagctgctccatgcagaactgaagctggttctgcagcagaaaggg gagaggacacaggagcctggggtgcaggtgcctcccagcaacgccatggaggccaggagc cggagtgccgaggagctgaggcgggcggagttggtggaaattatcgtggagacggaggcg cagaccggggtcagcggcatcaacgtagcgggcggcggcaaagagggaatcttcgttcgg gagctgcgcgaggactcacccgccgccaggagcctcagcctgcaggaaggggaccagctg ctgagtgcccgagtgttcttcgagaacttcaagtacgaggacgcactacgcctgctgcaa tgcgccgagccttacaaagtctccttctgcctgaagcgcactgtgcccaccggggacctg gctctgcggcccgggaccgtgtctggctacgagatcaagggcccgcgggccaaggtggcc aagctgaacatccagagtctgtcccctgtgaagaagaagaagatggtgcctggggctctg ggggtccccgctgacctggcccctgttgacgtcgagttctcctttcccaagttctcccgc ctgcgtcggggcctcaaagccgaggctgtcaagggtcctgtcccggctgcccctgcccgc cggcgcctccagctgcctcggctgcgtgtacgagaagtggccgaagaggctcaggcagcc cggctggccgccgccgctcctccccccaggaaagccaaggtggaggctgaggtggctgca ggagctcgtttcacagcccctcaggtggagctggttgggccgcggctgccaggggcggag gtgggtgtcccccaggtctcagcccccaaggctgccccctcagcagaggcagctggtggc tttgccctccacctgccaacccttgggctcggagccccggctccgcctgctgtggaggcc ccagccgtgggaatccaggtcccccaggtggagctgcctgccttgccctcactgcccact ctgcccacacttccctgcctagagacccgggaaggggctgtgtcggtagtggtgcccacc ctggatgtggcagcaccgactgtgggggtggacctggccttgccgggtgcagaggtggag gcccggggagaggcacctgaggtggccctgaagatgccccgccttagttttccccgattt ggggctcgagcaaaggaagttgctgaggccaaggtagccaaggtcagccctgaggccagg gtgaaaggtcccagacttcgaatgcccacctttgggctttccctcttggagccccggccc gctgctcctgaagttgtagagagcaagctgaagctgcccaccatcaagatgccctccctt ggcatcggagtgtcagggcccgaggtcaaggtgcccaagggacctgaagtgaagctcccc aaggctcctgaggtcaagcttccaaaagtgcccgaggcagcccttccagaggttcgactc ccagaggtggagctccccaaggtgtcagagatgaaactcccaaaggtgccagagatggct gtgccggaggtgcggcttccagaggtagagctgcccaaagtgtcagagatgaaactccca aaggtgccagagatggctgtgccggaggtgcggcttccagaggtacagctgctgaaagtg tcggagatgaaactcccaaaggtgccagagatggctgtgccggaggtgcggcttccagag gtacagctgccgaaagtgtcagagatgaaactcccagaggtgtcagaggtggctgtgcca gaggtgcggcttccagaggtgcagctgccgaaagtgccagagatgaaagtccctgagatg aagcttccaaaggtgcctgagatgaaacttcctgagatgaaactccctgaagtgcaactc ccgaaggtgcccgagatggccgtgcccgatgtgcacctcccagaagtgcagcttccaaaa gtcccagagatgaagctccctgagatgaaactccctgaggtgaaactcccgaaggtgccc gagatggctgtgcccgatgtgcacctcccggaagtgcagctcccgaaagtcccagagatg aaactccctaaaatgcctgagatggctgtgccagaggttcgactccccgaggtgcagctg ccaaaagtctcagagatgaaactccccaaggtgcctgaaatggccgtgcccgatgtgcac ctcccagaggtgcagctgcccaaagtctgtgaaatgaaagtccctgacatgaagctccca gagataaaactccccaaggtgcctgagatggctgtgcccgatgtgcacctccccgaggtg cagctgccgaaagtgtcagagattcggctgccggaaatgcaagtgccgaaggttcccgac gtgcatcttccgaaggcaccagaggtgaagctgcccagggctccggaggtgcagctaaag gccaccaaggcagaacaggcagaagggatggaatttggcttcaagatgcccaagatgacc atgcccaagctagggagggcagagtccccatcacgtggcaagccaggcgaggcgggtgct gaggtctcagggaagctggtaacacttccctgtctgcagccagaggtggatggtgaggct catgtgggtgtcccctctctcactctgccttcagtggagctagacctgccaggagcactt ggcctgcaggggcaggtcccagccgctaaaatgggcaagggagagcgggtggagggccct gaggtggcagcaggggtcagggaagtgggcttccgagtgccctctgttgaaattgtcacc ccacagctgcccgccgtggaaattgaggaagggcggctggagatgatagagacaaaagtc aagccctcttccaagttctccttacctaagtttggactctcggggccaaaggtggctaag gcagaggctgagggggctgggcgagctaccaagctgaaggtatccaaatttgccatctca ctccccaaggctcgggtgggggctgaggctgaggccaaaggggctggggaggcaggcctg ctgcctgccctcgatctgtccatcccacagctcagcctggatgcccacctgccctcaggc aaggtagaggtggcaggggccgacctcaagttcaaggggcccaggtttgctctccccaag tttggggtcagaggccgggacactgaggcagcagaactagtgccaggggtggctgagttg gagggcaagggctggggctgggatgggagggtgaagatgcccaagctgaagatgccttcc tttgggctggctcgagggaaggaagcagaagttcaaggtgatcgtgccagcccgggggaa aaggctgagtccaccgctgtgcagcttaagatccccgaggtggagctggtcacgctgggc gcccaggaggaagggagggcagagggggctgtggccgtcagtggaatgcagctgtcaggc ctgaaggtgtccacagccgggcaggtggtcactgagggccatgacgcggggctgaggatg cctccgctgggcatctccctgccacaggtggagctgaccggctttggggaggcaggtacc ccagggcagcaggctcagagtacagtcccttcagcagagggcacagcaggctacagggtt caggtgccccaggtgaccctgtctctgcctggagcccaggttgcaggtggtgagctgctg gtgggtgagggtgtctttaagatgcccaccgtgacagtgccccagcttgagctggacgtg gggctaagccgagaggcacaggcgggcgaggcggccacaggcgagggtgggctgaggctg aagttgcccacactgggggccagagctagggtggggggcgagggtgctgaggagcagccc ccaggggccgagcgtaccttctgcctctcactgcccgacgtggagctctcgccatccggg ggcaaccatgccgagtaccaggtggcagagggggagggagaggccggacacaagctcaag gtacggctgccccggtttggcctggtgcgggccaaggagggggccgaggagggtgagaag gccaagagccccaaactcaggctgccccgagtgggcttcagccaaagtgagatggtcact ggggaagggtcccccagccccgaggaggaggaggaggaggaggaagagggcagtggggaa ggggcctcgggtcgccggggccgggtccgggtccgcttgccacgtgtaggcctggcggcc ccttctaaagcctctcgggggcaggagggcgatgcagcccccaagtcccccgtcagagag aagtcacccaagttccgcttccccagggtgtccctaagccccaaggcccggagtgggagt ggggaccaggaagagggtggattgcgggtgcggctgcccagcgtggggttttcagagaca ggggctccaggcccggccaggatggagggggctcaggctgcggctgtctga >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_7|302_aa MAPPSQSTPRGGRDSSQSRLAARLRGGAGGPEARRRGPAEVSEAPTGHWWVLALAGRNAV GVPGGGRGEPGSCVRARRQFVYLAAVGPRFPEAPGPNWGVPGCSGIMVGGLKRKHSDLEE EEERWEWSPAGLQSYQQALLRISLDKVQRSLGPRAPSLRRHVLIHNTLQQLQAALRLAPA PALPPEPLFLGEEDFSLSATIGSILRELDTSMDGTEPPQNPVTPLGLQNEVPPQPDPVFL EALSSRYLGDSGLDDFFLDIDTSAVEKEPARAPPEPPHNLFCAPGSWEWNELDHIMEIIL GS >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_7|909_bp atggctccgcccagccaatcgacgccccgcggcgggcgggactccagccaatcgcggctc gcggctcggctccggggcggggctggtgggccagaggccagacggagaggccccgccgag gtgagcgaggctccaaccggtcactggtgggtcctggcactggcgggtcgcaacgctgtg ggcgttccaggaggtggtcgtggcgaacctggcagctgcgtgcgtgctcgacgtcaattc gtgtacctggcggctgttggcccccgtttcccagaagcccctgggccaaattggggcgtt cctggatgctctggcatcatggtgggaggcttgaagaggaaacactctgatttggaagag gaggaggagaggtgggagtggagtccagcaggccttcagagctaccagcaagccctgctc cgcatctccctagacaaagtccagcgcagcctgggcccccgagcacccagcctccgcagg catgtcctcatccataacaccctccaacagctgcaggctgcacttcgcctggctcccgcc cctgccctgccccccgagcccctcttcctgggcgaggaggatttctccctgtcagccacc attggctctatcctcagggagctggacacctccatggatgggactgagccccctcagaat ccagtgactccccttggcctccagaatgaagtgccaccccagcctgatccagtcttctta gaagctctgagctcccggtacttgggggactctggcctggatgacttctttctggacatt gacacatctgcggtagaaaaggagcctgcacgggccccaccagagcctcctcacaacctc ttctgtgccccaggttcttgggagtggaatgaactggatcacatcatggaaatcattctg gggtcctaa >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_8|198_aa MAENMTQSLDPQGAWFAAGYEVTVLVRDSSRLPSEGPRPAHVVVGDVLQAADVDKTVAGQ DAVIVLLGTRNDLSPTTVMSEGARNIVAAMKAHGVDKVVACTSAFLLWDPTKVPPRLQAV TDDHIRMHKVLRESGLKYVAVMPPHIGDQPLTGAYTVTLDGRGPSRVISKHDLGHFMLRC LTTDEYDGHSTYPSHQYQ >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_8|597_bp atggcagaaaacatgacacagtccctggacccacagggtgcttggtttgctgcaggttac gaagtgacagtgctggtgcgggactcctccaggctgccatcagaggggccccggccggcc cacgtggtagtgggagatgttctgcaggcagccgatgtggacaagaccgtggctgggcag gacgctgtcatcgtgctgctgggcacccgcaatgacctcagtcccacgacagtgatgtcc gagggcgcccggaacattgtggcagccatgaaggctcatggtgtggacaaggtcgtggcc tgcacctcggctttcctgctctgggaccctaccaaggtgcccccacgactgcaggctgtg actgatgaccacatccggatgcacaaggtgctgcgggaatcaggcctgaagtacgtggct gtgatgccgccacacataggagaccagccactaactggggcgtacacagtgaccctggat ggacgagggccctcaagggtcatctccaaacatgacctgggccatttcatgctgcgctgc ctcaccaccgatgagtacgacggacacagcacctacccctcccaccagtaccagtag >gi568815579r:40279590_40489902|GENSCAN_predicted_peptide_9|129_aa MPRPPRLMPACTACARVASPSPMAQVPGEVDNMEGLPAPNNNPAARWESPDRGWEREQPA ASTAAASLFECSRIKALADEREAVQKKTFTKWVNSHLARVGCHIGDLYVDLRDGFVLTRL LEVLSGEQL >gi568815579r:40279590_40489902|GENSCAN_predicted_CDS_9|387_bp atgccccgcccgccccggctcatgcctgcttgcaccgcctgcgccagggtggcctcacct tccccgatggcgcaggtaccaggggaagtggacaacatggagggcctgcctgctcctaac aacaaccctgctgcccgctgggagagtccggatcggggctgggagcgggagcagccggct gcgtccaccgcagcggcctcgctctttgagtgctcccggatcaaggccttggcagatgag cgggaagccgtgcagaagaaaaccttcaccaagtgggtgaactcgcacctcgcccgcgtg ggctgccacatcggggacctctatgtggacctccgggacggcttcgtgctcacgcggctc ctggaagtgctgtctggggagcagctg