GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:34:09 Sequence gi568815579r:53166763_53367665 : 200903 bp : 43.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16314 16379 66 2 0 85 89 10 0.232 1.87 1.02 Intr + 22494 22588 95 2 2 72 29 93 0.022 0.56 1.03 Intr + 30404 30500 97 2 1 71 99 18 0.362 1.21 1.04 Intr + 37254 37380 127 2 1 77 68 111 0.970 8.25 1.05 Intr + 38190 38318 129 1 0 54 110 39 0.672 3.37 1.06 Intr + 44219 45676 1458 0 0 43 -58 733 0.143 43.87 1.07 Term + 45858 46501 644 1 2 44 42 319 0.695 17.23 1.08 PlyA + 48154 48159 6 1.05 2.07 PlyA - 49284 49279 6 1.05 2.06 Term - 58002 57209 794 1 2 91 28 418 0.856 29.56 2.05 Intr - 71795 70320 1476 0 0 73 50 449 0.883 29.31 2.04 Intr - 77135 76982 154 2 1 67 46 125 0.764 5.85 2.03 Intr - 88183 88072 112 0 1 89 100 56 0.075 7.28 2.02 Intr - 91845 91751 95 0 2 68 29 65 0.016 -2.64 2.01 Init - 100903 100190 714 1 0 90 93 204 0.441 14.00 2.00 Prom - 107825 107786 40 -3.46 3.00 Prom + 110670 110709 40 0.24 3.01 Init + 112939 113010 72 0 0 86 76 51 0.838 4.77 3.02 Intr + 113794 113966 173 0 2 66 83 7 0.891 -3.26 3.03 Intr + 114738 114915 178 0 1 81 56 151 0.941 11.12 3.04 Intr + 115362 115564 203 2 2 142 116 55 0.992 11.78 3.05 Intr + 116047 116155 109 1 1 48 69 54 0.735 -0.21 3.06 Term + 117238 118185 948 0 0 92 54 218 0.851 10.88 3.07 PlyA + 118765 118770 6 1.05 4.02 PlyA - 118831 118826 6 1.05 4.01 Sngl - 123612 122902 711 0 0 67 35 271 0.992 15.93 4.00 Prom - 125546 125507 40 -6.16 5.07 PlyA - 125704 125699 6 1.05 5.06 Term - 132951 132286 666 0 0 78 54 131 0.575 2.53 5.05 Intr - 134112 134004 109 2 1 54 96 45 0.851 2.19 5.04 Intr - 134798 134596 203 2 2 142 116 67 0.993 12.98 5.03 Intr - 135422 135245 178 1 1 81 56 170 0.913 13.02 5.02 Intr - 142730 142706 25 0 1 109 77 -5 0.027 -2.42 5.01 Init - 143739 143682 58 0 1 55 78 41 0.056 1.27 5.00 Prom - 144037 143998 40 -6.56 6.00 Prom + 144061 144100 40 -3.46 6.01 Init + 145400 145481 82 1 1 89 113 16 0.308 5.33 6.02 Intr + 148693 149090 398 2 2 76 93 157 0.627 9.30 6.03 Intr + 152903 153001 99 1 0 82 75 34 0.043 1.81 6.04 Intr + 178744 178870 127 0 1 86 68 63 0.067 4.35 6.05 Term + 184056 186826 2771 1 2 59 43 2086 0.020 186.60 6.06 PlyA + 187165 187170 6 1.05 7.00 Prom + 193331 193370 40 -7.06 7.01 Init + 195035 195254 220 1 1 71 103 158 0.918 14.39 7.02 Term + 196952 196998 47 0 2 73 45 69 0.196 -1.53 7.03 PlyA + 197850 197855 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 88139 88072 68 0 2 101 100 82 0.835 9.32 S.002 Sngl + 184076 186826 2751 1 0 71 43 2075 0.889 194.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_1|871_aa MVLCCPLHQKRCRDEKLLQEKQRRESVEITTQDKEIKEKAAAPGGPLPPMHGDRLNLRFA VPLLPSPGSGGVPTDLKMTTPRPPPQGRLTFRDVAIEFSQEEWKCLDPAQRALYRDVMLE NYRNLVSLGISLSEMSITSMLGQGKEPWTVESEVKITKTSSGWECIKGVNTDVSPTHMIK ELPPKENNSTGKVFQAVMLERHEIHDIQDFYFREMQQNIHDFECQCHNDERNYREIPAIK IKNITGRRDQGDGRNAGNKPVENQLELSFWLHLAELQGFQIERKIYECNQIEKSVTRGSS VSLLQTNLPCVNTSISNIYRNDFMHPSLLTQDQKAYIREKPYKCSDCGKAFNQRSNLTTH QRIHTGQKPHKCDICGKGFRRIANLASHHRIHTGEKPYRCNECGKTFNQTFNLTTHQRIH TGQKPYKCDRCGKGFRQIGNLASHHRIHTGEKPYRCNECGKTFNRMFHLTRHQRIHTGQK PYKCDICGKGFRQIANLASHHRIHTGEKPYKCSDCGKTFNYRSHLTRHQRIHTGQKPYKC DTCGKDFSQNSYLENHQRIHTREKSYRCNECGKTFNKMTNLTTHQRIHTGQKTYKCDICG KGFRQIGNLASHHRIHTGEKPYRCNECGKTFNRMFHLTRHQRIHTGQKPYKCDICGKKFK IVKHHRIHSGEKQNKSNECSKAFIQNSSLVDHQRTHTGEKPYKCNECGKTFIGHSSLTNH QVIHTGEKPYKCNECGKAYVKWSHLRHHESIQTGEKPYKCTKCSKAFRQWADIRIHQKIY AGEKPHKYDECGKTFTQASHLTIHQIIHTGEKPYEYDIYGKVFSQNSHFKSYHRICTEEK PYKCVWQGPQSKFTPCKSSENSYWREILQMS >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_1|2616_bp atggtcctctgctgcccactgcaccagaaaagatgcagggatgagaaactcctacaggaa aaacaacgacgagagagtgtagaaataacgacacaagacaaagagataaaagaaaaggca gctgcgcccgggggaccactaccaccaatgcacggagaccgattaaatctgcgcttcgca gtcccccttcttccatcccccgggtctggaggggtccctacagatcttaaaatgaccaca ccgcgccctccaccccagggaaggttgacattcagggacgtggccatcgaattctctcag gaggagtggaaatgcctggaccctgctcagagggctttgtacagggacgtgatgttggag aactacaggaacctggtctccctgggaatctctctttctgaaatgagtattacctccatg ttggggcaagggaaagagccctggactgtggagagtgaagtgaaaataacaaaaacatca agtggttgggaatgcatcaaaggtgtgaacacagatgtctctcctacacatatgatcaag gaattaccaccaaaagagaacaatagcacaggaaaagtattccaagcagtgatgttggaa agacatgaaatccatgacatccaagatttttacttcagagaaatgcagcaaaatattcat gactttgagtgtcagtgtcacaatgatgagagaaattacagagaaatacctgcaattaaa attaaaaacatcactggaagaagagatcaaggtgatggaaggaatgcaggaaataagcca gttgaaaatcagcttgaactaagcttttggttgcatctggctgaactgcagggatttcaa attgaaaggaaaatttatgaatgtaatcaaattgagaagtctgtcacccgtgggtcctca gtttcattacttcaaacaaatcttccttgtgtcaacaccagcatttctaatatatacagg aatgattttatgcatccttcattactcacacaagaccagaaagcatacattagggaaaaa ccttacaaatgtagtgattgtggcaaggcctttaatcagaggtccaaccttactacacat cagagaatccatactggacagaaaccacataaatgtgacatatgtggcaaaggtttcagg cgaattgcaaacctagcaagtcatcatagaattcatactggagagaaaccttacagatgt aatgagtgtggcaagaccttcaatcagacgttcaaccttactacacatcagagaatccat actggacagaaaccatataaatgtgatagatgtggcaaaggtttcaggcaaattggaaac ctagcaagtcatcatagaattcatactggagagaaaccttacagatgtaacgagtgtggc aagacctttaacaggatgttccaccttactagacatcagagaatccatactggacagaaa ccatataaatgtgatatatgtggcaaaggtttcaggcaaattgcaaacctagcaagtcat catagaattcatactggagagaaaccttacaaatgtagtgattgtggcaagacctttaat tacaggtcccaccttacaagacatcagagaatccatactggacagaaaccgtataaatgt gatacatgtggcaaagatttcagtcaaaattcataccttgaaaatcatcagagaattcac actagagagaaatcttacagatgtaatgagtgtggcaagacctttaataagatgaccaac cttactacacatcagagaatccatactggacagaaaacatataaatgtgatatatgtggc aaaggtttcaggcaaattggaaacctagcaagtcatcatagaattcatactggagagaaa ccttacagatgtaatgagtgtggcaagaccttcaataggatgttccaccttactagacat cagagaatccatactggacagaaaccatataaatgtgatatatgtggcaaaaagttcaaa attgtaaaacatcacaggattcattctggagagaaacaaaacaaaagtaatgaatgtagt aaggcatttattcaaaattcaagtctggtggaccatcagagaactcacactggagagaaa ccttacaagtgtaatgagtgtggtaaaacctttattgggcattcaagcctaactaaccat caggtaattcatactggagagaaaccttacaaatgtaatgagtgtggaaaggcttatgtg aagtggtcccaccttagacatcatgagagtattcaaactggagagaagccatacaaatgt accaaatgcagcaaggcctttagacaatgggcggacatcaggattcaccaaaaaatctat gctggagagaaacctcacaagtatgatgagtgtggaaaaacctttacccaggcctctcac ctcactatacatcagattatccatactggagagaaaccatatgaatatgacatatatggc aaagtcttcagtcaaaattcacatttcaaaagttatcataggatttgtactgaagagaag ccttacaaatgtgtgtggcaaggtcctcagtcaaaattcacaccttgtaaatcatcagag aattcatactggagagaaatcctacagatgtcatga >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_2|1114_aa MASRYVAVGMILSQTVVGVLGSFSVLLHYLSFYCTGCRLRSTDLIVKHLIVANFLALRCK GVPQTMAAFGVRYFLNALGCKLVFYLHRVGRGVSIGTTCLLSVFQVITVSSRKSRWAKLK EKAPKHVGFSVLLCWIVCMLVNIIFPMYVTGKWNYTNITVNEDLGYCSGGGNNKIAQTLR AMLLSFPDVLCLGLMLWVSSSMVCILHRHKQRVQHIDRSNLSPRASPENRATQSILILRQ ESVEIKTQDKEMKEKTAVSGGPLPPRRGDRCPRPAPALAPCAAPVHSRTRRFLPTRKRIS RGYWRSRGLFTFKDVAIEFSQEEWECLDPAQRALYRDVMLENYRNLLSLDEDNIPPEDDI SVGFTSKGLSPKENNKEELYHLVILERKESHGINNFDLKEVWENMPKFDSLWDYDVKNYK GMPLTCNKNLTHRKDQQHNKSSIHFSLKQSVSIRDSAHQYFIHDKPFIRNLLKLKNNIRY AGNKYVKCFENKIGLSLQAQLAELQRFQTGEKMYECNPVEKSINSSSVSPLPPCVKNICN KYRKILKYPLLHTQYGRTHIREKSYKCNDCGKAFSKSSNLTNHQRIHSGQRPYKCNECGK AFNQCSNLTRHQRVHTGEKPYQCNICGKVCSQNSNLASHQRMHTGEKPYKCNECGKAFIQ RSHLWGHERIHTGEKPYKCNECDKAFAERSSLTQHKRIHTGEKPYICNECGKAFKQCSHL TRHQNIHPGEKPHKCNVCGRAFIQSSSLVEHQRIHTGEKPYKCNKCDKAFIKRSHLWGHQ RTHTGEKPYKCTECGKAFTERSNLTQHKKIHTGEKPYKCTECGKAFTQFANLTRHQKIHI EKKHCKHNIHVLGAGEPWALHGPAGGGKDTTALALLGRPLQPHRKTCEELHKTAMQNGAG GALFVHRDTPENNPDTPFDFTPENYKRIEAIVKNYPEGHKAAAVLPVLDLAQRQNGWLPI SAMNKVAEVLQVPPIRVYEVATFYTMYNRKPVEKYHIQVCTTTPCMLPNSDSILEAIQKK LGIKLGETTPDKLFTLVEVECLGACVNAPMVQINDNYYEDLTAKDIEEIIDELKAGKIPK PGPRSGRFSCEPAGGLTPLTEPPKGPGFGIQAGL >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_2|3345_bp atggcctcccggtatgtggcagtgggaatgatcttatcacagaccgtggtgggagtcctg gggagcttctctgttcttctccattatctctccttttactgcactgggtgcaggttaagg tccacagatttgattgttaagcacctgattgtagccaacttcttagctctccgctgtaaa ggagtcccccagacaatggcagcttttggggttagatattttctcaatgctcttgggtgc aaacttgttttctatctccatagagtgggcaggggagtgtccattggcaccacctgcctc ttgagtgtcttccaggtgatcacggtcagctccaggaaatccaggtgggcaaaacttaaa gagaaagcccccaagcatgttggcttttctgttctcctgtgctggatcgtgtgcatgttg gtaaacatcatctttcccatgtatgtgactggcaaatggaactacacaaacatcacagtg aacgaggatttgggatactgttctgggggaggcaacaacaaaatcgcacagacactgcgt gcaatgttgttatcattccctgatgtgttgtgtctggggctcatgctctgggtcagcagc tccatggtttgcatactgcacaggcacaagcagcgggtccagcacattgataggagcaat ctctcccccagagcctccccagagaacagagctacgcagagcatcctcatcctgagacaa gagagtgtagaaataaagacacaagacaaagagatgaaagaaaagacagctgtgtctggg ggaccactaccaccaagacgcggagaccgttgtccgcgccctgcccctgcccttgcccct tgcgccgccccagtgcactcgcgtacgcgcaggttcctgccgacccggaagcggatctcg cggggctactggcgctctcggggactgtttacattcaaggatgtggccatagaattctct caagaggagtgggagtgcctggaccctgcccagagggccttgtacagggacgtgatgttg gagaactacaggaacctgctttctctcgatgaggataacatccctccagaagatgatatt tctgttggatttacaagcaagggattatcaccaaaggaaaataataaagaggaattatac catctggtgatattagaaagaaaggaaagccatggcatcaacaattttgacctcaaggaa gtctgggaaaatatgcctaagtttgacagcctgtgggactatgatgtaaaaaattacaaa ggaatgcctttgacctgtaacaaaaatctcactcacagaaaagatcaacaacataataaa tcctcaatacatttctctttaaagcagagtgtttctataagagatagtgcacaccagtat ttcatccatgacaagccatttataaggaatttgttaaaactgaaaaataacataaggtat gccggaaacaaatacgtgaagtgttttgaaaataaaattggattaagcttacaggcacag ctggctgaactacagagatttcaaactggggagaaaatgtatgaatgtaatccagttgag aagtctatcaatagttcctcagtttcaccacttcctccttgtgtcaaaaacatttgtaat aaatataggaagattttgaaataccctttattacatacacagtatgggagaacacacatt agagaaaaatcatacaagtgtaatgactgtggaaaggcttttagcaaaagttcgaacctc actaatcatcagagaattcactctggacagagaccttacaaatgtaacgagtgtggcaaa gcctttaaccagtgttcgaacctcactaggcatcagagagtccatacaggagagaaacca tatcaatgtaatatatgtggcaaggtctgtagtcaaaattcaaatcttgcaagtcatcag aggatgcatactggagagaaaccttacaaatgtaatgaatgtggtaaggcatttatccag cgttcacacctttggggtcatgaaagaattcatactggagagaaaccttacaaatgtaat gaatgtgacaaagcctttgctgaacgttcaagccttacccaacataagagaatccatact ggagagaagccttacatatgtaatgagtgtggcaaagcttttaagcagtgctcacatctc actaggcatcagaatatacatcctggagagaaaccacacaaatgtaatgtgtgtggcagg gcttttatccaaagttcaagtcttgtggaacatcagagaattcacactggagaaaaacct tacaaatgtaataaatgtgataaagcttttatcaaacgttcacacctttggggtcatcag agaactcatactggagagaaaccttacaaatgtactgaatgtggcaaagcctttactgaa cgttcaaatcttactcagcataagaaaatccatactggagagaaaccttacaaatgtact gaatgtggcaaagcttttacccaatttgcaaacctcactagacaccagaaaatacacatt gaaaagaaacattgtaaacataatatacatgttttgggcgcaggagagccctgggctcta cacgggcctgcaggcggcggcaaggacactacggccttagccctcttagggaggccgctg cagccccaccggaagacatgtgaggaattgcataagacagctatgcaaaatggagctgga ggagctttatttgtgcacagagatactcctgagaataaccctgatactccatttgatttc acaccagaaaactataagaggatagaggcaattgtaaaaaactatccagaaggccataaa gcagcagctgttcttccagtcctggatttagcccaaaggcagaatgggtggttgcccatc tctgctatgaacaaggttgcagaagttttacaagtacctccaataagagtatatgaagta gcaactttttatacaatgtataatcgaaagccagttgaaaagtatcacattcaggtctgc actactacaccctgcatgcttccaaactctgacagcatactggaggccattcagaaaaag cttggaataaagcttggggagactacacctgacaaacttttcactcttgtagaagtggaa tgtttaggggcctgtgtgaacgcaccaatggttcaaataaatgacaattactatgaagat ttgacagctaaggatattgaagaaattattgatgagctcaaggctggcaaaatcccaaaa cctgggccaaggagtggacgcttctcttgtgagccagctggaggtcttacccctttgact gaaccacccaagggacctggatttggcatacaagcaggcctttaa >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_3|560_aa MDPNTLSMEQGKQFLKQCGIPQRDIDLMTEKWVVLASVEVLLRFPLKPGEDPTARYVSNK KCQPSVDWPTTISQRRGYQEKWKSNHHLMKKNDRKFKDLPQMARHSVHHQAQRPPRAKNP QEQQRRPMGQTTLPAEQEESRVKCKNCGAFGHSARSKTCPIKRWSGALPLQALGSHKEKE NLKPAKAQLPFTTPGPFTTNDREKERSPSPQQQQSEAPTQTFPRTPQEKMQEAWKEPAED CLFLRHPTMPLPVHTTKKRSVLGPVSTGPPPVNKPEMRLLCPSGHNDSPQLSTCGPTKGH GRDVTASLLPVLKSSHQTPTLSARLPANRPDMSSHGALQPAMQALALGPGLKSQAEIKHP DADAKPRPQQVRKQCGQDSRTQAPDKEPAPVPTQTFQNPAKKARFSSFQTPALRTQLPDV GAVQTLQPPRTATGLGSKEAPKATAETAATKTATLQPRVNLQPAPSSPFLGPAQGCPVLQ PGPPIHVPGRPGSVTFMRGDKGQKSPRFRMPPTSRPPENSASAQSPRFSRQPEGQGPQVS TSVLYEDLLVTSSSEDSDSD >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_3|1683_bp atggaccccaacactctctccatggaacaggggaagcagttcttaaagcagtgcgggatc ccacaaagagatattgacttgatgacagagaaatgggttgttctcgcctctgtggaggtt cttctgcgatttcctctgaagccaggagaggatccgaccgcccgctatgtctcaaataaa aaatgccaaccttcagtggactggcccaccacaatttcacagagacgtggataccaagag aagtggaaatccaaccatcatttaatgaagaagaatgatcggaagttcaaggacctgcca cagatggcccgtcattccgtccatcatcaggctcagaggccccccagagccaaaaaccca caggagcagcagaggagacccatggggcagacgactctcccagcagagcaggaggaatcc agggtgaaatgcaagaactgcggggcctttgggcactcagccaggagcaagacctgcccc attaagaggtggagtggggcccttcctctgcaggccctgggctcacacaaggagaaggag aacctgaaaccagcaaaggcccagctaccctttacgactccagggccctttacgacgaat gacagagaaaaggagcgaagtccaagtccccagcagcagcagagcgaagctccgacgcag acatttcccagaactccccaagagaaaatgcaggaagcctggaaggagccagcagaagat tgtttgttcctgaggcatcctaccatgccactgcctgtccacaccaccaagaagagatct gtcctgggccctgtgtccacaggtccaccgcctgtcaacaaacccgagatgagattactc tgcccttcgggtcacaacgattcacctcaactgagcacctgtggacccaccaaaggacat ggcagggacgttactgcctccctgctccctgttctgaagagctcccaccagacccccact ctcagtgccaggctgccagccaacaggcctgacatgtcctcccatggtgctctccagcct gccatgcaggcgcttgccctgggtcctggccttaaatcccaggcagaaatcaaacatccc gacgcagatgcaaagcccagaccacagcaagtcagaaaacagtgtggccaggactccaga acccaggcaccagacaaggagcctgcccccgtccccacccagactttccagaaccccgca aagaaagcaagattcagctccttccagacccctgcactgagaactcagctcccggatgtg ggcgctgtgcagacactccagcctccccgcactgcaactggacttggatccaaagaggca cccaaggcgaccgcagagacagcagccaccaagacagcaaccctgcagcccagagtcaac ctccagcccgcacccagctcacctttcctgggcccagcccagggctgccccgtcctccag cctggaccacccattcatgttccagggaggcccggcagtgtcaccttcatgagaggggac aagggacagaagagccccaggttcagaatgcctcccacatcccgtcctcctgaaaactct gcttctgctcagagccctcgcttctcaaggcagcctgaggggcagggtccccaggtctca acgagtgtcctctatgaggaccttctggtcacttcctcctctgaggacagtgacagtgac tga >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_4|236_aa MTGYEARLITFGTWMYSVNKEQLARAGFYAIGQEDKVQCFHCGGGLANWKPKEDPWEQHA KWYPGCKYLLEEKGHEYINNIHLTRSLEGALVQTTKKTPSLTKRISDTIFPNPMLQEAIR MGFDFKDVKKIMEERIQTSGSNYKTLEVLVADLVSAQKDTTENELNQTSLQREISPEEPL RRLQEEKLCKICMDRHIAVVFIPCGHLVTCKQCAEAVDRCPMCSAVIDFKQRVFMS >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_4|711_bp atgacgggttatgaagcccggctcattacttttgggacatggatgtactccgttaacaaa gagcagcttgcaagagctggattttatgctataggtcaagaggataaagtacagtgcttt cactgtggaggagggctagccaactggaagcccaaggaagatccttgggaacagcatgct aaatggtatccaggttgcaaatatctgctagaagagaagggacatgaatatataaacaac attcatttaacccgttcacttgagggagctctggtacaaactaccaagaaaacaccatca ctaactaaaagaatcagtgataccatcttccctaatcctatgctacaagaagctatacga atgggatttgatttcaaggacgttaagaaaataatggaggaaagaattcaaacatctggg agcaactataaaacgcttgaggttcttgttgcagatctagtgagcgctcagaaagacact acagaaaatgaattgaatcagacttcattgcagagagaaatcagccctgaagagccgcta aggcgtctgcaagaggagaagctttgtaaaatctgcatggacagacatatcgctgttgtt tttattccttgtggacatctggtcacttgtaaacaatgtgctgaagcagttgacagatgt cccatgtgcagcgcggttattgatttcaagcaaagagtttttatgtcttaa >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_5|412_aa MSVEDVSIPPSRKHCILFPAHQLGHCGTKSNNHLMKKNDQKFKDLPQMAGHPIHHQAQGP PGAKNPQEQQRRPMGQTTLPAEQEESRVKCKNCGAFGHSARNKTCPIKRWSGALPLQVLG SHKEKENLKPAKAQLPFTTPGPFTTNDREKERSPSPQQQQNRAPKQTFPRTPQEKTQEAW KEPAEACSFLRPAMQALALGPGLKSQAEIKHPVADAKPRPQQVRKQCGQDSRTQAPDKEP GPVPTQTFQNPAKKARFSSFQTPALRTQLPDVGAVQTLQPPRTATGLGSKQAPEATAQTA ATKTATLQPRVNLQPAPSSPFLGPAQGCPVLQPGPPIHVPGRPGSVTFMRGDKGQKSPRF RTPPTSRPPENSASAQSPRFSRQPEGRGPQVSTSVLYEDLLVTSSSEDSDSD >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_5|1239_bp atgagtgtggaagatgtttctatacctccttctcggaaacactgcatcctatttcctgcg catcaactcggacattgcgggacgaaatccaacaaccacttaatgaagaagaatgatcag aagttcaaggacctgccacagatggccggtcatcccatccatcatcaggctcaggggccc cccggagccaagaacccacaggagcagcagaggagacccatggggcagacgactctccca gcagagcaggaggaatccagggtgaaatgcaagaactgcggggcctttgggcactcagcc aggaacaagacctgccccattaagaggtggagtggggcccttcctctgcaggtcctgggc tcacacaaggagaaggagaacctgaaaccagcaaaggcccagctaccctttacgactcca gggccctttacgacgaatgacagagaaaaggagcgaagtccaagtccccagcagcagcag aacagagctccgaagcagacatttcccagaactccccaagagaaaacgcaggaagcctgg aaggagccagcggaagcctgttcattcctgaggcctgccatgcaggcgcttgccctgggt cctggccttaaatcccaggcagaaatcaaacatcccgttgcagatgcaaagcccagacca cagcaagtcagaaaacagtgtggccaggactccagaacccaggcaccagacaaggagcct ggccccgtccccacccagactttccagaaccccgcaaagaaagcaagattcagctccttc cagacccctgcactgagaactcagctcccggatgtgggcgctgtgcagacactccagcct ccccgcactgcaactggacttggatccaaacaggcacccgaggcgactgcacagacagca gccaccaagacagcaaccctgcagcccagagtcaacctccagcccgcacccagctcacct ttcctgggcccagcccagggctgccccgtcctccagcctggaccacccattcatgttcca gggaggcccggcagtgtcaccttcatgagaggggacaagggacagaagagccccaggttc agaacgcctcccacatcccgtcctcctgaaaactctgcttctgctcagagccctcgcttc tcaaggcagcctgaggggcggggtccccaggtctcaacgagtgtcctctatgaggacctt cttgtcacttcctcctctgaggacagtgacagtgactga >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_6|1158_aa MTAAILQIGTWCQDTNTCADAASVVFKGDHGQLQEIQVGKLKEKAPKHVGFSVLLCWIVC MLVNIIFPMYVTGKWNYTNITVNEDLGYCSGGGNNKIAQTLRAMLLSFPDVLCLGLMLWV SSSMVCILHRHKQRVQHIDRSDLSPRASPENRATQSILILASYQKVKKNFLQCDCFSLWE ACSDNLEFEPEEKGLLTFRDVAIEFSQEEWKCLDPAQRTLYRDVMLENYRNLVSLDISSK CMMKEFSSTAQGNTEVIHTGTLQRHERHHIGDFCFQEMEKDIHDFEFQWKEDERNSHEAP MTEIKQLTGSTNRHDQRHAGNKPIKDQLGSSFHSHLPELHMFQTEGKIGNQVEKSINSAS LVSTSQRISCRPKTHISKNYGNNFLNSSLLTQKQEVHMREKSFQCNESGKAFNYSSVLRK HQIIHLGAKQYKCDVCGKVFNQKRYLACHRRCHTGKKPYKCNDCGKTFSQELTLTCHHRL HTGEKHYKCSECGKTFSRNSALVIHKAIHTGEKSYKCNECGKTFSQTSYLVYHRRLHTGE KPYKCEECDKAFSFKSNLERHRKIHTGEKPYKCNECSRTFSRKSSLTRHRRLHTGEKPYK CNDCGKTFSQMSSLVYHRRLHTGEKPYKCEECDEAFSFKSNLERHRRIHTGEKPYKCNDC GKTFSQTSSLVYHRRLHTGEKPYKCEECDEAFSFKSNLERHRIIHTGEKLYKCNECGKTF SRKSSLTRHCRLHTGEKPYQCNECGKAFRGQSALIYHQAIHGIGKLYKCNDCHQVFSNAT TIANHWRIHNEERSYKCNRCGKFFRHRSYLAVHWRTHSGEKPYKCEECDEAFSFKSNLQR HRRIHTGEKPYRCNECGKTFSRKSYLTCHRRLHTGEKPYKCNECGKTFGRNSALIIHKAI HTGEKPYKCNECGKAFSQKSSLTCHLRLHTGEKPYKCEECDKVFSRKSSLEKHRRIHTGE KPYKCKVCDKAFGRDSHLAQHTRIHTGEKPYKCNECGKNFRHNSALVIHKAIHSGEKPYK CNECGKTFRHNSALEIHKAIHTGEKPYKCSECGKVFNRKANLSRHHRLHTGEKPYKCNKC GKVFNQQAHLACHHRIHTGEKPYKCNECGKTFRHNSVLVIHKTIHTGEKPYKCNECGKVF NRKAKLARHHRIHTGKKH >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_6|3477_bp atgaccgccgctattctccagataggaacctggtgtcaagatacaaatacgtgtgctgac gctgcgagtgtggtgtttaaaggtgatcacggtcagctccaggaaatccaggtgggaaaa cttaaagagaaagcccccaagcatgttggcttttctgttctcctgtgctggatcgtgtgc atgttggtaaacatcatctttcccatgtatgtgactggcaaatggaactacacaaacatc acagtgaacgaggatttgggatactgttctgggggaggcaacaacaaaatcgcacagaca ctgcgtgcaatgttgttatcattccctgatgtgttgtgtctggggctcatgctctgggtc agcagctccatggtttgcatcctgcacaggcacaagcagcgggtccagcacattgatagg agcgatctctcccccagagcctccccagagaacagagctacgcagagcatcctcatcctg gccagttaccaaaaggtaaagaaaaacttcctgcaatgtgattgcttctccctatgggaa gcctgttcagataacctggaatttgaacctgaagaaaagggtctattgacattcagggat gtggccatagaattctctcaggaagagtggaagtgcctggaccctgctcagaggactcta tacagggacgtgatgctggagaattataggaacctggtctccctggatatctcttccaaa tgcatgatgaaggagttctcatcaacagcacaaggcaatacagaagtgatccacacaggg acattgcaaagacatgaacgtcatcacattggagatttttgcttccaggaaatggagaaa gatattcatgattttgagtttcagtggaaagaagatgaaagaaatagccatgaagcaccc atgacagaaatcaaacagttgacgggtagtacaaaccgacatgatcaaaggcatgctgga aacaagcctattaaagatcagcttggatcaagctttcattcgcatctgcctgaactccac atgtttcagaccgaagggaaaattggtaatcaagttgagaagtctatcaacagtgcttcg ttggtttcaacatcccaaagaatttcttgtaggcctaaaacccacatttctaagaactat gggaataatttcctgaattcttcattactcacacaaaagcaggaagtacacatgagagaa aaatctttccaatgtaatgagagtggcaaagcctttaattatagctcagtcttaaggaaa catcagataatccatttaggagcgaaacaatataaatgtgatgtgtgtggcaaggtcttt aatcaaaagcgatatcttgcatgtcatcgtagatgtcacactggcaagaaaccttacaag tgtaatgattgtggcaagaccttcagtcaggagttaacccttacatgccatcatagactt catactggagagaaacattacaagtgcagtgagtgtggcaagaccttcagtcgaaattca gcccttgtaattcataaggcaattcatactggagagaaatcttacaagtgtaatgaatgt ggcaagaccttcagtcaaacgtcataccttgtgtaccatcgtagacttcatactggagag aaaccttacaaatgtgaagaatgtgacaaagctttcagtttcaaatcaaaccttgaaaga cataggaaaattcatactggagagaaaccttacaagtgtaatgaatgcagcaggaccttt agtcggaagtcatcccttacacgccatcgtagacttcatactggagagaaaccttataag tgtaatgattgtggcaagaccttcagtcagatgtcatcccttgtataccatcgtagactt catactggagagaaaccttacaaatgtgaagaatgtgatgaagctttcagtttcaaatcg aaccttgaaagacataggagaattcatactggagagaaaccttacaagtgtaatgattgt ggcaagaccttcagtcagacatcatcccttgtataccatcgtagacttcatactggagag aaaccttacaaatgtgaagaatgtgatgaagctttcagtttcaaatcaaaccttgaaaga cataggataattcatactggagagaaactttacaagtgtaatgaatgtggcaagaccttt agtcggaagtcatcccttacacgccattgtagacttcatactggagagaaaccttaccag tgtaatgagtgtggcaaagcctttcgtgggcagtcagcacttatttaccatcaagcaatc catggtatagggaaactttacaaatgtaatgattgtcaccaagtctttagtaatgctaca accattgcaaatcattggagaatccataatgaagagagatcgtacaagtgtaatagatgt ggcaaatttttcagacatcgttcataccttgcagttcattggcgaactcatagtggagag aaaccttacaaatgtgaagaatgtgatgaagctttcagtttcaaatcaaaccttcaaaga cataggagaattcatactggagagaaaccttacaggtgtaatgaatgtggcaagaccttt agtcggaagtcataccttacatgccatcgtagacttcatactggagagaaaccttacaag tgtaatgagtgtggcaagaccttcggtcgaaattcagcccttataattcacaaggcaatt catactggagagaaaccttacaagtgtaatgagtgtggcaaggccttcagtcagaagtca tcccttacatgccatcttagacttcatactggagagaaaccttacaaatgtgaagaatgt gacaaagttttcagtcgcaaatcaagccttgaaaaacacaggagaattcatactggagag aaaccatacaaatgtaaggtttgtgacaaagcttttgggcgtgattcacacctggcacaa catactagaattcacactggagagaaaccttacaagtgtaatgaatgtggcaagaacttc cgtcacaattcagcccttgtaattcataaggcaattcatagtggagagaaaccttacaag tgtaatgagtgtggcaagaccttccgtcacaattcagcccttgaaattcataaggcaatt catactggagaaaaaccttacaagtgtagtgaatgtggcaaggtttttaatagaaaagca aacctttcacgtcatcatagacttcatactggagagaaaccttacaagtgtaataaatgt ggtaaggtttttaatcaacaagcacaccttgcatgtcatcatagaattcatactggagag aaaccttacaagtgtaatgagtgtggcaaaaccttccgtcacaattcagtccttgtaatt cataagacaattcatactggagagaaaccttacaagtgtaatgaatgtggcaaggttttt aatcgaaaagcaaaacttgcacgtcatcatagaattcatactggaaagaaacattag >gi568815579r:53166763_53367665|GENSCAN_predicted_peptide_7|88_aa METPQSNTIDSQGEQNGDVRRTDEVAIHQESRAADLGTIKEADTVTKKKPREHKGDTNSR EHAACSFDDCINGGVPSSSEETATIENG >gi568815579r:53166763_53367665|GENSCAN_predicted_CDS_7|267_bp atggaaacaccacaatcaaacaccatcgattcacaaggtgaacaaaatggtgatgtcaga agaacagatgaagttgccatccaccaagaaagcagagccgccgacttgggcacaattaaa gaagctgacacagttaccaaaaaaaagcctagagaacacaaaggtgacacaaactccaga gaacacgctgcttgcagctttgatgattgtatcaacggtggtgtacccagcagctccgaa gagacagcgaccattgagaatgggtga