GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:15:39 Sequence gi568815579r:57808553_58012191 : 203639 bp : 46.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 551 234 318 0 0 92 78 85 0.251 3.83 1.04 Intr - 4868 4742 127 2 1 47 106 125 0.912 10.45 1.03 Intr - 6209 6159 51 2 0 63 91 36 0.313 0.50 1.02 Intr - 11245 11055 191 2 2 75 8 181 0.297 8.10 1.01 Init - 14360 14339 22 2 1 91 37 11 0.044 -3.74 1.00 Prom - 15702 15663 40 -6.66 2.00 Prom + 15856 15895 40 -5.86 2.01 Init + 18204 18267 64 2 1 78 42 83 0.029 2.03 2.02 Intr + 27258 27325 68 1 2 79 100 37 0.125 2.62 2.03 Intr + 30471 30597 127 2 1 47 106 107 0.972 8.65 2.04 Term + 32286 34024 1739 1 2 92 48 654 0.466 49.53 2.05 PlyA + 34180 34185 6 1.05 3.00 Prom + 37429 37468 40 -8.36 3.01 Init + 40995 41049 55 2 1 71 99 46 0.831 5.45 3.02 Intr + 41548 41603 56 2 2 112 17 60 0.059 -0.00 3.03 Intr + 47555 47681 127 1 1 43 106 99 0.753 7.45 3.04 Term + 50024 51588 1565 0 2 93 48 803 0.638 67.35 3.05 PlyA + 57687 57692 6 1.05 4.07 PlyA - 57733 57728 6 1.05 4.06 Term - 66674 64270 2405 0 2 92 48 1140 0.422 95.78 4.05 Intr - 68490 68364 127 0 1 52 106 101 0.978 8.55 4.04 Intr - 71109 71042 68 1 2 79 100 37 0.161 2.62 4.03 Intr - 80186 80157 30 0 0 101 54 59 0.112 1.90 4.02 Intr - 80325 80215 111 1 0 53 91 53 0.726 2.45 4.01 Init - 84308 84239 70 2 1 94 35 22 0.109 -1.29 4.00 Prom - 87044 87005 40 -2.66 5.00 Prom + 91670 91709 40 -3.56 5.01 Init + 93122 93235 114 1 0 101 103 66 0.893 9.61 5.02 Term + 95217 95321 105 2 0 107 54 39 0.834 0.81 5.03 PlyA + 95755 95760 6 1.05 6.04 PlyA - 96065 96060 6 -0.45 6.03 Term - 101562 99998 1565 1 2 80 48 833 0.924 69.05 6.02 Intr - 103634 103508 127 2 1 44 106 99 0.737 7.55 6.01 Init - 108171 108067 105 0 0 89 83 62 0.452 6.02 6.00 Prom - 114003 113964 40 -7.66 7.05 PlyA - 115404 115399 6 1.05 7.04 Term - 119244 117598 1647 0 0 32 48 1169 0.914 96.88 7.03 Intr - 119495 119425 71 0 2 89 76 13 0.558 -0.90 7.02 Intr - 122002 121876 127 1 1 29 106 51 0.400 1.25 7.01 Init - 126641 126528 114 2 0 82 46 129 0.593 6.34 7.00 Prom - 127600 127561 40 -8.66 8.18 PlyA - 127999 127994 6 1.05 8.17 Term - 134095 132372 1724 2 2 106 43 802 0.716 65.65 8.16 Intr - 135508 135382 127 1 1 63 82 125 0.997 9.65 8.15 Intr - 137717 137574 144 2 0 23 89 108 0.485 4.88 8.14 Intr - 138928 138890 39 1 0 92 91 89 0.981 8.02 8.13 Intr - 141066 140871 196 2 1 70 74 37 0.107 -0.08 8.12 Intr - 144829 144649 181 2 1 13 44 200 0.269 7.03 8.11 Intr - 152999 152839 161 1 2 46 72 156 0.530 9.43 8.10 Intr - 158080 157978 103 2 1 109 92 42 0.917 5.93 8.09 Intr - 163952 163911 42 0 0 59 121 28 0.017 1.41 8.08 Intr - 171727 169892 1836 2 0 115 41 738 0.003 59.13 8.07 Intr - 179750 179655 96 0 0 90 94 72 0.932 7.98 8.06 Intr - 180162 180043 120 1 0 -38 82 232 0.699 10.47 8.05 Intr - 180479 180377 103 2 1 63 70 158 0.785 11.25 8.04 Intr - 191344 191158 187 0 1 108 45 40 0.391 1.39 8.03 Intr - 192791 192737 55 0 1 68 84 53 0.424 0.94 8.02 Intr - 193705 193572 134 0 2 76 40 86 0.579 2.89 8.01 Init - 194529 194228 302 0 2 29 78 256 0.647 13.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 14338 14305 34 1 1 76 105 25 0.807 0.80 S.002 Init + 47346 47480 135 2 0 81 57 57 0.847 2.05 S.003 Sngl - 171368 169749 1620 2 0 64 38 847 0.881 72.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_1|237_aa MVTHTLSVPRSSPGPLTGQTRVSQINVAATRGPRKSRPYIRRTHGNASFRALIGSDGIHS TQSFLSNVVDQLTPRGPMAAAALRFPVQGTVTFEDVAVKFTQEEWNLLSEAQRCLYRDVT LENLALMSSLGCWCGVEDEAAPSKQSIYIQRETQVRTPMAGVSPKKAHPCEMCGPILGDI LHVADHQGTHHKQKLHRCEAWGNKLYDSGNFHQHQNEHIGEKPYRGSVEEALFAKRX >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_1|711_bp atggtcactcacaccctctcagtcccacgtagcagtccggggccactcactgggcaaaca cgcgtatcgcagatcaacgtggccgctacaagaggaccccggaagtctcggccctacatt aggcgcacacacggaaatgcttctttccgagccctcattggctctgacggcattcattca acgcagagctttctgagtaatgtagttgaccagcttaccccacgtggtcccatggcggcg gccgcgctaaggttccccgttcagggcacagtgacttttgaagacgtggctgtgaaattt acccaggaggaatggaatctccttagtgaggctcagagatgcctgtaccgtgatgtgacg ctggagaacctggcacttatgtcctccctgggttgttggtgtggagtggaagatgaggcg gcaccttctaagcagagtatttatatacaaagagagactcaggtcaggactcctatggca ggtgtgtctcccaagaaggcccacccctgtgagatgtgtggcccgatcttgggagacatt ttgcatgtggcagatcatcagggaacacatcacaagcagaaactgcacaggtgtgaggcc tgggggaataaattgtatgacagtggaaactttcatcagcaccagaatgagcacattgga gagaaaccctacagagggagtgttgaggaggcgttgtttgcgaagaggtnn >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_2|665_aa MVSTCTPMAAADAGWAGSATGGRDIWEGIDIGDIQYMIANVWTKGTVTFEDVAVKFTQEE WNLLSEAQRCLYRDVTLENLALMSSLGCWCGVEDEAAPSKQSIYIQRETQVRTPVTGVSP KKAHPCEMCGPILGDILHVADHQGTHHKQKLHRCEAWGNKLYDSGNFHQHQNEHIGEKPY RGSVEEALFVKRCKLHVSGESSVFSESGKDFLPRSGLLQQEASHTGEKSNSKTECVSPFQ CGGAHYSHGDSMKHFSTKHILSQHQRLLPREECYVCCECGKSFSKYVSFSNHQRVHSGKR PYECGECEKSFSQKSSLIQHQQFHTGGKPYGCEECGKYFSLEGYLRRHQKVHAGKGPYEC GECGKSFSSNVNLKSHQRIHTGERPYKCGECEKSFSRKPSLSYHQRIHTEVRPYKCGECG KSYISKGHLRIHQRMHTGERPYKCGDCGKSFNEKGHLRSHQRVHTTERPYKCGECGKCFS HKGNLILHQHGHTRKRPYMCWECGKLFKKKSHLLVHQRIHSGEKPYACEACQKFFRHKCH LTAHQRVHTGERPYECSDCGKSFTHSCAFIVHKRVHTGQKPYECSECGKSFAASSYLTSH RRVHTGQKPYECSECGKSFAGISSLTNHRRVHTGEKPYGCSECEKKFRKSSSLRYHQRVH ERKAL >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_2|1998_bp atggtttccacctgcactcccatggcggcggccgatgctgggtgggcgggtagcgcgacg ggcggtcgggatatctgggaaggaattgatattggagacatccagtatatgatagctaat gtgtggactaagggcacagtgacttttgaagacgtggctgtgaaatttacccaggaggaa tggaatctccttagtgaggctcagagatgcctgtaccgtgatgtgactctggagaacctg gcacttatgtcctccctgggttgttggtgtggagtggaagatgaggcggcaccttctaag cagagtatttatatacaaagagagactcaggtcaggactcctgtgacaggtgtgtctccc aagaaggcccacccctgtgagatgtgtggcccgatcttgggagacattttgcatgtggca gatcatcagggaacacatcacaagcagaaactgcacaggtgtgaggcctgggggaataaa ttgtatgacagtggaaactttcatcagcaccagaatgagcacattggagagaaaccctac agagggagtgttgaggaggcgttgtttgtgaagaggtgtaagttgcatgtgtcaggggag tcatctgtcttcagtgagagtgggaaggactttttgcccaggtcaggattactccagcag gaggccagtcacactggggagaagtcaaacagcaaaactgagtgtgtgtctccctttcag tgtgggggagctcactatagccatggagattccatgaaacattttagcaccaaacatata ctcagtcagcaccagagacttctccctcgagaagaatgttatgtgtgctgtgaatgtggg aaatcctttagcaaatatgttagcttcagtaatcatcagagagttcacagtggaaaaaga ccttatgaatgtggagaatgtgagaaatcttttagtcaaaagagcagcctcattcaacat cagcaatttcacactggaggaaaaccttatgggtgtgaagaatgtgggaaatattttagc ttagaaggatatcttaggcgccatcaaaaagttcacgctggaaaagggccttatgagtgt ggagaatgtgggaaatcttttagttcaaacgtgaaccttaagagtcatcagcgcattcac actggagagagaccttacaagtgtggagaatgtgagaaatcttttagtcggaagcccagc cttagttaccatcagcgcattcacactgaagtaagaccttacaagtgtggagaatgtggg aaatcttatatttcaaaggggcaccttaggatccatcagcgcatgcacactggagaaaga ccttacaagtgtggagactgtgggaaatcttttaatgaaaaaggacaccttaggagtcat cagcgagttcacactacagaaagaccttataagtgtggggaatgtgggaaatgttttagt cacaagggtaacctcattctacaccagcatggccatactagaaaaaggccttatatgtgt tgggaatgtggaaaattatttaagaagaagtctcacctccttgtacaccagagaattcac agtggagagaagccatatgcttgtgaggcttgtcagaaattttttaggcacaagtgccac ctcactgcacaccagagagttcacactggagaaaggccatatgaatgcagtgattgtggg aagtcatttacccacagctgtgcattcattgttcataagagagttcacactggtcagaag ccttatgagtgcagtgaatgtgggaaatcttttgctgcaagctcctatctcactagtcac aggagagttcacactggtcagaagccttatgagtgcagtgaatgtgggaaatcttttgct ggaatctccagtctcactaatcacaggagagttcacactggagaaaagccttatgggtgt agtgaatgtgaaaaaaaatttaggaaaagctcttcacttcgttaccatcagagagttcat gaaagaaaggccttatga >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_3|600_aa MPAPPTVDGSSHTEGPSSGHPIVTQVLNQRGSGSCSWGTVTFEDVAVNFSQEEWCLLSEA QRCLYRDVMLENLALISSLGCWCGSKDEEAPCKQRISVQRESQSRTPRAGVSPKKAHPCE MCGLILEDVFHFADHQETHHKQKLNRSGACGKNLDDTAYLHQHQKQHIGEKFYRKSVREA SFVKKRKLRVSQEPFVFREFGKDVLPSSGLCQEEAAVEKTDSETMHGPPFQEGKTNYSCG KRTKAFSTKHSVIPHQKLFTRDGCYVCSDCGKSFSRYVSFSNHQRDHTAKGPYDCGECGK SYSRKSSLIQHQRVHTGQTAYPCEECGKSFSQKGSLISHQLVHTGEGPYECRECGKSFGQ KGNLIQHQQGHTGERAYHCGECGKSFRQKFCFINHQRVHTGERPYKCGECGKSFGQKGNL VHHQRGHTGERPYECKECGKSFRYRSHLTEHQRLHTGERPYNCRECGKLFNRKYHLLVHE RVHTGERPYACEVCGKLFGNKHSVTIHQRIHTGERPYECSECGKSFLSSSALHVHKRVHS GQKPYKCSECGKSFSECSSLIKHRRIHTGERPYECTKCGKTFQRSSTLLHHQSSHRRKAL >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_3|1803_bp atgcctgctccacccacagttgatggcagcagccacactgagggacctagctcaggtcac cccatcgtcacccaggtcctaaaccagcgagggagcggctcctgctcgtggggcactgtg acctttgaagatgtggctgtgaacttttcccaggaggagtggtgtcttcttagtgaggct cagaggtgcttgtaccgtgatgtgatgctagagaacctggctctcatatcctcgctgggt tgttggtgtggatcaaaagatgaggaggcaccttgtaagcagagaatttctgtacaaaga gagtctcagagcaggactcctagggcaggtgtttctcctaagaaggctcacccctgtgaa atgtgtggcctcatcttggaggatgtttttcactttgctgaccaccaggaaactcatcac aagcagaagctgaacaggagtggagcatgtggaaaaaacttggatgacactgcatacctt catcagcaccagaagcagcatattggagagaaattctacagaaagagtgtcagagaagca tcgtttgtaaagaaacgtaagctcagggtgtcacaggagccatttgtcttccgcgagttt gggaaggacgttctgcccagttcaggattgtgccaagaagaagctgctgtagagaagaca gacagtgaaactatgcatggcccaccctttcaggagggaaaaactaattacagttgtgga aaacgcacaaaagccttcagcaccaaacactcagttattccacaccagaaacttttcact agagatggatgttatgtgtgcagtgattgtggaaaatcctttagcagatatgtcagcttc agtaatcatcagcgagatcacactgcaaaaggaccttatgattgtggagagtgtgggaaa tcttatagtcgaaagagcagccttattcaacatcagcgagtccacactggacagacagct tatccctgtgaggagtgcgggaaatcttttagtcagaagggcagccttattagccatcag cttgttcacactggagaagggccttatgagtgtagagaatgtgggaaatcttttggtcaa aagggtaacctcattcaacatcagcaaggtcacactggagagagagcttatcactgtggg gaatgtgggaaatcttttcgtcagaagttctgctttattaaccatcagcgtgttcacact ggagaaaggccttacaagtgtggagaatgtgggaaatcttttggtcaaaagggcaacctc gttcaccatcagcgaggtcatactggagaaaggccctatgagtgcaaggaatgtgggaaa tcatttaggtacagatcccacctcactgaacaccagagacttcacactggggaaagacct tacaattgtagggaatgtgggaaattatttaacaggaagtatcatcttctggttcatgag agagttcacactggagaaaggccatatgcgtgtgaggtatgtgggaaattatttggcaat aagcacagcgtgactatacatcagaggattcacactggagaaaggccgtatgaatgcagt gaatgtgggaaatcatttctttccagctctgcgcttcatgttcataaaagagttcattct ggacaaaagccttataagtgcagtgaatgtggaaaatccttttctgaatgttccagtctc attaaacacaggagaattcacactggagaaaggccttatgaatgcaccaaatgtggaaaa acatttcagcgaagctctaccctccttcatcatcagagttcacacaggagaaaggcctta tga >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_4|936_aa MVTLNNRKDKKGKGDREKHCLWQGYGPSCPYLLAGRPILPTLLKTTWFDGCRGYAEALRS GHLIVTQKQRGRDIWEGIDIGDIQYMIANVWTKGTVTFEDVAVNFTWEEWNLLSEAQRCL YRDVTLENLALISSLGCWCGVEDEAAPSKQSIYIQRETQVRTPMAGVSPKKAHPCEMCGP ILGDILHVADHQGTHHKQKLHRCEAWGNKLYDSGNFHQHQNEHIGEKPYRGSVEEALFAK RCKLHVSGESSVFSESGKDFLPRSGLLQQEASHTGEKSNSKTECVSPIQCGGAHYSCGES MKHFSTKHILSQHQRLLTREECYVCCECGKSFSKYASLSNHQRVHTEKKHECGECGKSFS KYVSFSNHQRVHTEKKHECGECGKSFSKYVSFSNHQRVHTGKRPYECGECGKSFSKYASF SNHQRVHTEKKHYECGECGKSFSKYVSFSNHQRVHTGKRPYECGECGKSFSKYASFSNHQ RVHTDKKHYECGECGKSFSQKSSLIQHQRFHTGEKPYGCEECGKSFSSEGHLRSHQRVHA GERPFKCGECVKSFSHKRSLVHHQRVHSGERPYQCGECGKSFSQKGNLVLHQRVHTGARP YECGECGKSFSSKGHLRNHQQIHTGDRLYECGECGKSFSHKGTLILHQRVHPRERSYGCG ECGKSFSSIGHLRSHQRVHTGERPYECGECGKSFSHKRSLVHHQRMHTGERPYKCGDCGK SFNEKGHLRNHQRVHTTERPFKCGECGKCFSHKGNLILHQHGHTGERPYVCRECGKLFKK KSHLLVHQRIHNGEKPYACEACQKFFRNKYQLIAHQRVHTGERPYECNDCGKSFTHSSTF CVHKRIHTGEKPYECSECGKSFAESSSFTKHKRVHTGEKPYECSECGKSFAESSSLTKHK RVHTGEKPYKCEKCGKLFNKKSHLLVHQSSHWRKAI >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_4|2811_bp atggtcaccctgaataacagaaaagataagaaagggaaaggagacagagaaaaacattgc ctgtggcagggatacggaccgtcgtgcccatatctcctggctggtcgccctatcctcccg actctgcttaaaaccacgtggttcgatggctgccgcggctacgctgaggctctccgctca ggtcacctcatcgtcacccaaaagcagcgaggtcgggatatctgggaaggaattgatatt ggagacatccagtatatgatagctaatgtgtggactaagggcacagtgacttttgaagat gtggctgtgaactttacctgggaggaatggaatctccttagtgaggctcagagatgcctg taccgtgatgtgacgctggagaacctggcacttatatcctccctgggttgttggtgtgga gtggaagatgaggcggcaccttctaagcagagtatttatatacaaagagagactcaggtc aggactcctatggcaggtgtgtctcccaagaaggcccacccctgtgagatgtgtggcccg atcttgggagacattttgcatgtggcagatcatcagggaacacatcacaagcagaaactg cacaggtgtgaggcctgggggaataaattgtatgacagtggaaactttcatcagcaccag aatgagcacattggagagaaaccctacagagggagtgttgaggaggcgttgtttgcaaag aggtgtaagttgcatgtgtcaggggagtcatctgtcttcagtgagagtgggaaggacttt ttgcccaggtcaggattactccagcaggaggccagtcacactggggagaagtcaaacagc aaaactgagtgtgtgtctcccattcagtgtgggggagctcactacagctgtggagaatcc atgaaacattttagcaccaaacatatactcagtcagcaccagagactgctcactagagaa gagtgttatgtgtgctgtgaatgtgggaagtcctttagcaaatatgctagcttgagtaat catcagagagttcacactgaaaaaaaacatgaatgtggagaatgtgggaaatcctttagc aaatatgttagcttcagtaatcatcagagagttcacactgaaaaaaaacatgaatgtgga gaatgtgggaaatcctttagcaaatatgttagcttcagtaatcatcagagagttcacact gggaaaagaccttatgaatgtggagaatgtgggaaatcgtttagcaaatatgctagcttc agtaatcatcagagagttcacactgaaaaaaaacattatgaatgtggagaatgtgggaaa tcctttagcaaatatgttagcttcagtaatcatcagagagttcacactgggaaaagacct tatgaatgtggagaatgtgggaaatcgtttagcaaatatgctagcttcagtaatcatcag agagttcacactgacaaaaaacattatgaatgtggagaatgtgggaaatcctttagtcaa aagagcagcctcattcaacatcagcgatttcacactggagaaaaaccttatgggtgtgaa gaatgtgggaaatcttttagttcagaaggacatcttaggagccatcaacgagttcacgcc ggagaaagacctttcaagtgtggagaatgtgtgaaatctttcagtcataagcgcagcctt gttcaccatcagcgagttcacagtggagaaagaccttatcagtgtggagaatgtgggaaa tctttcagtcaaaagggcaacctcgttctacaccagcgagttcacactggagcaagacct tatgagtgtggagaatgtgggaaatcatttagttcaaaaggacatcttaggaaccatcag caaattcacactggggacagactttatgagtgtggagagtgtgggaaatcttttagtcat aaaggcaccctcattctacatcagcgagttcaccctagagaaagatcttatgggtgtgga gaatgtgggaaatcttttagttcaatcgggcaccttaggagccatcagcgcgttcatact ggagagaggccttatgagtgtggagaatgtgggaaatcttttagtcataagcgcagcctt gttcaccatcagcgcatgcacactggagaaagaccttacaagtgtggagactgtgggaaa tcttttaatgaaaaaggacaccttaggaatcatcagcgagttcacactacagaaagacct tttaagtgtggggaatgtgggaaatgttttagtcacaagggtaacctcattctacaccag catggccatactggagaaagaccttatgtatgtagggaatgtggaaaattatttaagaag aagtctcacctccttgtacaccagagaattcacaatggagaaaagccatatgcttgtgaa gcttgtcagaaattttttagaaacaagtaccaactcattgcacatcagagagttcacact ggagaaaggccttatgaatgcaatgattgtggaaaatcatttacccacagctctacattc tgtgttcataagcgaattcacactggagaaaagccttatgagtgcagtgaatgtggaaaa tctttcgctgaaagctccagtttcacaaaacacaaaagagttcacactggagaaaagcct tatgagtgcagtgaatgtggaaaatcttttgctgaaagctccagtctcactaaacacaag agagttcacactggagaaaagccttataaatgtgagaaatgtgggaaattatttaacaag aagtctcacctccttgtacaccagagttcacactggagaaaagccatatga >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_5|72_aa MGTNQETGTDAEDNLRAQNKSKTTSNLMVAMMVVLTVAMRKPLLERKPLPRTSNDSCGGF AGQKRETCWQKS >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_5|219_bp atggggacaaatcaggaaactggcacagatgcagaggacaatctgagagcacagaacaaa tcaaaaacaactagtaacctaatggtggctatgatggtggtacttactgtggcgatgcgg aagccgcttctggagagaaagccgctgccaagaacaagcaatgatagctgtggcggtttt gcaggacaaaaaagggagacatgttggcagaagagctga >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_6|598_aa MESGELHYPESSALNDSRSEPMSTRKEGISTCARNGTVTFEDVAVNFSQEEWCLLSEAQR CLYRDVMLENLALISSLGCWCGSKDEEAPCKQRISVQRESQSRTPRAGVSPKKAHPCEMC GLILEDVFHFADHQETHHKQKLNRSGACGKNLDDTAYLHQHQKQHIGEKFYRKSVREASF VKKRKLRVSQEPFVFREFGKDVLPSSGLCQEAAAVEKTDSETMHGPPFQEGKTNYSCGKR TKAFSTKHSVIPHQKLFTRDGCYVCSDCGKSFSRYVSFSNHQRDHTAKGPYDCGECGKSY SRKSSLIQHQRVHTGKTAYPCEECGKSFSQKGSLISHQRVHTGERPYECREYGKSFGQKG NLIQHQQGHTGERAYHCGECGKSFRQKFCFINHQRVHTGERPYKCGECGKSFGQKGNLVQ HQRGHTGERPYECKECGKSFRYRSHLTEHQRLHTGERPYNCRECGKLFNRKYHLLVHERV HTGERPYACEVCGKLFGNKNCVTIHQRIHTGERPYECNECGKSFLSSSALHVHKRVHSGQ KPYKCSECGKSFAECSSLIKHRRIHTGERPYECTKCGKTFQRSSTLLHHQSSHRRKAL >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_6|1797_bp atggaaagtggggaactacattacccagaaagctctgcgttaaacgacagccggtcagag ccaatgagcactcggaaagaaggcatttccacgtgtgcacgtaacggcactgtgaccttt gaagatgtggctgtgaacttttcccaggaggagtggtgtcttcttagtgaggctcagagg tgcttgtaccgtgatgtgatgctagagaacctggctctcatatcctcgctgggttgttgg tgtggatcaaaagatgaggaggcaccttgtaagcagagaatttctgtacaaagagagtct cagagcaggactcctagggcaggtgtttctcctaagaaggctcacccctgtgaaatgtgt ggcctcatcttggaggatgtttttcactttgctgaccaccaggaaactcatcacaagcag aagctgaacaggagtggagcatgtggaaaaaacttggatgacactgcataccttcatcag caccagaagcagcatattggagagaaattctacagaaagagtgtcagagaagcatcgttt gtaaagaaacgtaagctcagggtgtcacaggagccatttgtcttccgcgagtttgggaag gacgttctgcccagttcaggattgtgccaagaagcagctgctgtagagaagacagacagt gaaactatgcatggcccaccctttcaggagggaaaaactaattacagttgtggaaaacgc acaaaagccttcagcaccaaacactcagttattccacaccagaaacttttcactagagat ggatgttatgtgtgcagtgattgtggaaaatcctttagcagatatgtcagcttcagtaat catcagcgagatcacactgcaaaaggaccttatgattgtggagagtgtgggaaatcttat agtcgaaagagcagccttattcaacatcagcgagtccacactggaaagacagcttatccc tgtgaggagtgcgggaaatcttttagtcagaagggcagccttattagccatcagcgtgtt cacactggagaaaggccttatgagtgtagagaatatgggaaatcttttggtcaaaagggt aacctcattcaacatcagcaaggtcacactggagagagagcttatcactgtggggaatgt gggaaatcttttcgtcagaagttctgctttattaaccatcagcgtgttcacactggagaa aggccttacaagtgtggagaatgtggaaaatcttttggtcaaaagggcaacctcgttcaa catcagcgaggtcatactggagaaaggccctatgagtgcaaggaatgtgggaaatcattt aggtacagatcccacctcactgaacaccagagacttcacactggggaaagaccttacaat tgtagggaatgtgggaaattatttaacaggaagtatcatcttctcgttcatgagagagtt cacactggagaaaggccatatgcgtgtgaggtatgtgggaaattatttggtaataagaac tgcgtgactatacatcagaggattcacactggagaaaggccgtatgaatgcaatgaatgt gggaaatcatttctttccagctctgcgcttcatgttcataaaagagttcattctggacaa aagccttataagtgcagtgaatgtggaaaatcctttgctgaatgttccagtctcattaaa cacaggagaattcacactggagaaaggccttatgaatgtaccaaatgtggaaaaacattt cagcgaagctctaccctccttcatcatcagagttcacacaggagaaaggccttatga >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_7|652_aa MAAAALRLPAQVIVPSLPSGHLILTRILKQRESGDCSQGTVAFEDVAVNFSQEEWSLLSE VQRCLYHDVMLENWVLISSLGCWCGSEDEEAPSKKSISIQRVSQNQYLGEKPYRSSVEEA LFVKRCKFHVSEESSIFIQSGKDFLPSSGLLLQEATHTGEKSNSKPECESPFQWGDTHYS CGECMKHSSTKHVFVQQQRLPSREECYCWECGKSFSKYDSVSNHQRVHTGKRPYECGECG KSFSHKGSLVQHQRVHTGKRPYECGECGKSFSHKGSLVQHQRVHTGERPYECGECGKSFS QNGTLIKHQRVHTGERPYECEECGKCFTQKGNLIQHQRGHTSERPYECEECGKCFSQKGT LTEHHRVHTRERPYECGECGKSFSRKGHLRNHQRGHTGERPYECGECGKSFSRKGNLIQH QRSHTGERPYECRECRKLFRGKSHLIEHQRVHTGERPYECNECGKSFQDSSGFRVHQRVH TGEKPFECSECGKSFPQSCSLLRHRRVHTGERPYECGECGKSFHQSSSLLRHQKTHTAER PYECRECGKFFSSLLEHRRVHTGERPYECRECGKTFTRRSAHFKHQRLHTRGKPYECSEC GKSFAETFSLTEHRRVHTGERPYECSECGKSFHRSSSLLRHQRVHTERSPYK >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_7|1959_bp atggcggcggccgcgctgaggctcccggctcaggtaattgtgccttccctgccctcaggt cacctcatcctaacccgaatcctgaagcagcgagagagcggcgactgttcacagggcact gtggcatttgaagatgtggctgtgaacttttcccaggaggagtggagtctccttagtgag gttcagagatgcctttaccatgacgtgatgctggagaactgggtacttatatcctccctg ggttgttggtgtggatcagaagatgaggaggcaccttctaagaagagcatttctatacaa agagtgtctcagaatcagtaccttggagagaaaccctatagaagcagtgttgaggaagca ttgtttgtgaagaggtgtaagttccatgtgtcagaggagtcatctatcttcattcagagt ggaaaggactttttgcccagctcaggattactgctgcaggaggccactcacactggggag aagtcaaacagcaaacctgagtgtgagtctccctttcagtggggagatactcattacagc tgtggagaatgcatgaaacattctagcaccaaacacgtatttgttcaacagcagagactt ccctctagagaggaatgttattgctgggaatgtgggaaatcctttagcaaatatgatagc gtcagtaatcatcagagagttcacactgggaaaagaccttatgaatgtggagaatgtggg aaatcttttagtcataagggcagccttgttcagcatcagcgagttcacactgggaaaaga ccttatgaatgtggagaatgtgggaaatcttttagtcataagggcagccttgttcagcat cagcgagttcatactggagaaagaccttatgagtgtggagaatgtgggaaatcttttagt caaaatggtactctcattaaacatcaacgagttcacactggagaaagaccttatgagtgt gaagaatgtgggaaatgttttactcagaagggcaatctcattcaacatcaacgaggtcac actagtgaaagaccttatgagtgtgaagaatgtggaaaatgttttagtcaaaagggcacc ctaactgaacatcatcgagttcacactagagaacgaccttatgagtgtggagaatgtggg aaatcttttagtcgaaagggacaccttaggaaccatcagcgaggtcacactggagaaaga ccttacgagtgtggagaatgtgggaaatcttttagtcgaaagggcaacctcattcagcat cagcgaagccacactggagaaaggccttatgagtgtagagagtgtaggaaattatttagg ggcaagtcccacctcattgaacaccagagagttcacactggagaaaggccatatgaatgt aatgaatgtgggaaatcatttcaagacagctctgggtttcgtgttcatcagagagttcac actggagaaaaaccgtttgagtgtagtgaatgtgggaagtcatttcctcaaagctgttcc ctccttcgacatcggagagttcatactggagaaaggccttatgaatgtggagaatgtgga aagtcatttcatcagagctcttccctccttcgacatcagaaaactcacactgcagaaaga ccttatgagtgcagagaatgtgggaaattcttctccagtctccttgaacacaggagagtt cacactggagaaaggccttatgaatgcagggaatgtggaaaaacatttactcgaaggtct gcgcattttaaacatcagagacttcatactcgaggaaagccttacgagtgcagcgaatgt gggaaatcctttgctgaaaccttcagtcttactgaacacaggagagtacacactggagaa aggccttatgagtgcagtgaatgtggaaaatcatttcatcgaagctcttctctccttcga catcagagagttcacacagaaagaagtccttacaagtga >gi568815579r:57808553_58012191|GENSCAN_predicted_peptide_8|1849_aa MGKPRPAGPRLTQAFLQWEATVEEGGLGPGQQSPSCRRLRRRCQKGPNPAQAGRGGAAQE PVMAAPLPDAGSARLELQLPADPAARSGRRRGSSRHCPRLSRRFSPCQFTQSTVKDVRPG LETAPGSLKHAVTAAVTVKLQIMVQSAWSISPRDGSHQPVGLLGSVSSVSCLARGGKPGG GEEGHWAPSSPGSGEMGLSCSQRMMSRLSLVIPVDAALQFTTVGVWKRGVAKVGDIASVL EFGKSLRRDLLSFAVVIVDPVTFKDVAVDFTQEEWGQLDLVQRTLYRDVMLETYGHLLSV GNQIAKPEVISLLEQGEEPWSVEQACPQRTCPEWVRNLESKALIPAQSIFEEEQSHGMKL ERYIWDDPWFSRLEVLGCKDQLEMYHMNQSTAMRQMVFMQKQVLSQRSSEFCGLGAEFSQ NLNFVPSQRVSQIEHFYKPDTHAQSWRCDSAIMYADKVTCENNDYDKTVYQSIQPIYPAR IQTGDNLFKCTDAVKSFNHIIHFGDHKGIHTGEKLYEYKECHQIFNQSPSFNEHPRLHVG ENQYNYKEYENIFYFSSFMEHQKIGTVEKAYKYNEWEKVFGYDSFLTQHTSTYTAEKPYD YNECGTSFIWSSYLIQHKKTHTGEKPYECDKCGKVFRNRSALTKHERTHTGIKPYECNKC GKAFSWNSHLIVHKRIHTGEKPYVCNECGKSFNWNSHLIGHQRTHTGEKPFECTECGKSF SWSSHLIAHMRMHTGEKPFKCDECEKAFRDYSALSKHERTHSGAKPYKCTECGKSFSWSS HLIAHQRTHTGEKPYNCQECGKAFRERSALTKHEIIHSGIKPYECNKCGKSCSQMAHLVR HQRTHTGEKPYECNKCGKSFSQSCHLVAHRRIHTGEKPYKCNQCERSFNCSSHLIAHRRT HTGEKPYRCNECGKAFNESSSLIVHLRNHTGEKPYKCNHCEKAFSASPTNPMKFLRNKAI IRHRPALVKVILISSVAFSIALICGMAISYMIYRLAQAEERQQLESLYKNLRIPLLGDEE EGSEDEGESTHLLPENENELEKFIHSGESKCVQNKQILMEHILMEHVHELKAEKAQKKLL AAQAEAHRSENKQAHKWLKSTLRPRRRCWCLQQKPPAVCLVQSQTCKEPAPVPAPGAARP AAAASVPGCVHWLDPSLARSHTPCGSAPGSPLSQMAAAELTAPAQGVVQGADIQDVAVVR TKVSELWVTVRRGKLQVLVVKLLVGQDWEDDDRGIVTFEDVAVYFSWKEWGLLDEAQKCL YHDVMLENLTLTTSLGGSGAGDEEAPYQQSTSPQRVSQVRIPKALPSPQKTNPCEICGPV LRQILHLVEHQGTHHGQKLYTDGACRKQLQFTAYLHQHQKQHVGQKHFRSNGGRDMFLSS CTFEVSGKPFTCKEVGKDFLVRSRFLQQQAAHTRKKSNRTKSAVAFHSVKNHYNWGECVK AFSYKHVRVQHQGDLIRERSYMCSECGKSFSTSCSLSDHLRVHTSEKPYTCGECGKSYRQ SSSLITHRRIHTGVRPHQCDECGKLFNRKYDLLIHQRVHTGERPYKCSECGKSFSHSSSL ITHQRIHTGMRPYECSECGKSFIHSSSLITHQRVHTGTRPYMCSECGKSFSQSCHLIKHR RLHIGEGPYECSECGKLFTYRSRFFQHQRVHTGVRSHECHECGKLFSRKFDLIVHERVHT GERPYECSECGKSFTCKSYLISHWKVHTGARPYECGECGKSFTHSSTLLQHQRVHTGERP YECNECGKFFSQSSSLIRHRRSHTGERPYECSECWKSFSNHSSLVKHRRVHTGERPYECS ECGKSFSQSSNLTNHQRIHSGERPYECSDCGKFFTFNSNLLKHQNVHKG >gi568815579r:57808553_58012191|GENSCAN_predicted_CDS_8|5550_bp atgggcaaaccgaggcccgcggggccgcgacttacccaggcctttttgcagtgggaggcg acggtggaggagggtggtctgggaccgggacaacaaagtccatcctgccgccggcttcga cgcaggtgccagaaaggtcccaaccccgcccaggcagggcgaggcggcgcggcccaggaa cctgtcatggcggcgccgctgcccgacgccggaagtgcccgcctggaactacagctccca gcagaccccgcggcgcgctccggtcgacgccggggaagcagccgccattgtccgcggctg agccgtcgtttctccccctgccagttcacacaaagcactgtgaaggacgtcagacctggg ttggaaacagctccgggttcccttaaacatgcagtgacggcggcggtcactgtcaagttg caaataatggtgcaaagtgcctggtcaatcagtcctcgggatggcagccatcaacccgtg ggcctcctgggctctgtgtcctcagtatcctgcctggcacgtggagggaagcctggagga ggggaggagggccactgggctcccagcagcccaggttcaggtgagatggggctgtcctgt tcccagaggatgatgtccaggttatccctggtcatcccagttgatgcggctctccagttt acaacagttggagtttggaagaggggtgttgccaaggtgggggacatagccagtgtgctg gaatttggcaagagtcttcgaagagacctgctcagctttgctgtggttattgtggaccca gtgaccttcaaggacgtggccgtggacttcacccaagaagagtgggggcagctggacctt gttcagaggaccctgtaccgtgatgtgatgctggagacctatggtcacctgctctctgtg ggaaatcagattgccaagcctgaggtcatctccctgttggagcaaggagaagagccgtgg tcagtggagcaggcatgtcctcaacgcacttgtccagaatgggtgagaaatcttgaaagc aaagcattgatcccagcacagagcatttttgaggaagaacaatcccatggcatgaagttg gaaagatatatatgggatgatccttggttctccaggttagaagttttgggatgtaaagac caattagaaatgtaccacatgaaccagagtacagctatgaggcagatggtcttcatgcaa aagcaagtactatcccagagaagctctgaattctgtggacttggggcagagtttagccag aacttaaactttgttccatctcagagagtttctcagatagaacatttctataagcctgat acacatgctcaaagttggagatgtgactcagccataatgtatgcagataaggttacctgt gaaaataatgattatgacaaaactgtttatcagtccattcaacctatttaccctgcaaga atacaaactggagataatcttttcaaatgtactgatgctgttaaatctttcaatcatata atacattttggtgatcataaaggaattcacacaggagaaaaactctatgaatataaggaa tgccatcaaatctttaaccagagcccatcatttaatgaacacccaaggcttcatgttgga gaaaaccagtataattacaaagaatatgagaatatcttttatttctcatcctttatggaa catcaaaaaattggtactgtagagaaagcgtataaatacaatgaatgggagaaagtcttt gggtatgactctttccttactcaacatacaagcacttacactgcagagaaaccctatgac tacaatgaatgtgggacgtctttcatctggagctcttaccttattcaacataagaaaact catactggagaaaaaccctatgaatgtgataaatgtggaaaagtttttaggaatcgctca gcccttacgaaacatgaacggactcacactggaataaaaccctatgaatgtaataaatgt ggaaaagccttcagctggaattctcatcttattgtacataagagaattcatacaggagaa aaaccttatgtttgtaatgagtgtgggaaatctttcaactggaactctcatcttattgga catcagaggactcatacaggagagaaaccttttgaatgtactgaatgtgggaaatcattc agctggagctcccatcttattgcccatatgagaatgcatactggagagaaaccctttaaa tgtgatgaatgtgaaaaagcttttagggactactcagcccttagtaaacatgaaagaact cattctggagcaaaaccatataaatgtactgaatgtggaaaatccttcagctggagctcc catcttattgcccatcagagaactcacacgggagagaaaccatataactgtcaggaatgt ggcaaagcattcagagaacgctcagccctcactaaacatgagataattcattctggaatt aagccctatgaatgtaataaatgtggaaaatcctgtagccagatggctcaccttgttaga catcaaaggactcatactggagaaaaaccctatgaatgcaataaatgtggaaaatccttc agtcagagctgtcaccttgttgctcatcggagaattcacactggtgagaaaccctataaa tgtaatcagtgtgaaagatcctttaactgtagttctcacctcattgcacaccggagaact catactggagagaaaccatacaggtgtaatgaatgtgggaaagcatttaatgagagttca tcccttattgtacacctaagaaaccatactggagaaaagccctacaaatgtaatcattgt gaaaaagcattttctgcatcccctacgaaccccatgaaattcctgaggaataaagcaata attcggcatagacctgctcttgttaaagtaattttaatttcgagcgtagccttcagcatt gccctgatatgtgggatggcaatctcctatatgatatatcgactggcacaggctgaggaa agacaacagctcgagtcactttataagaacctcaggataccgttattaggagatgaagaa gagggctcagaggacgagggtgagtccacgcacctacttccagagaacgaaaatgagctg gaaaagttcatccactcaggtgaaagcaaatgtgttcaaaacaagcagattctcatggag cacatcctcatggagcacgtccacgagctgaaggcagaaaaggcccaaaagaagctcctg gctgcccaggctgaggcccacaggtctgagaacaagcaagcacacaagtggctgaagagt acactcaggccaagaaggagatgctggtgcctgcagcagaagccgcctgcagtatgtctg gtccagtcgcagacttgcaaggagccggcgcctgtgccagcgcctggagctgcccgcccc gccgcagcagccagcgtgcctggctgtgtgcactggctggacccttcacttgctcgctca cacaccccttgcggctccgcgcctggctcgcccttgagtcagatggcggcggccgagctg acggccccggcccagggagtggttcagggtgcagatattcaagatgtggctgttgtgcgc accaaggttagtgagctgtgggtcacagttaggcggggcaagctccaggttttggttgta aagctgttagtgggccaagactgggaggatgatgatagaggcattgtgacctttgaggac gtggctgtttacttctcctggaaggagtggggtcttcttgatgaggctcagaaatgcctg taccacgatgtgatgctggagaacttgacacttacaacctccctgggtggttctggagca ggggatgaggaggcaccttatcagcagagcacttctccacagcgggtgtcacaggttagg attcctaaggcccttccttctccccagaagaccaacccctgtgagatatgtggcccagtc ttgagacagattttgcacttggttgaacaccaaggaacacaccatggtcagaaactgtat acagacggggcatgtaggaaacaattacaatttactgcataccttcatcagcaccagaag cagcatgttggacagaaacacttcagaagcaatgggggcagagacatgtttttgagcagc tgcacatttgaagtatctgggaagcccttcacttgcaaggaggttgggaaggatttcctg gtgagatcaagatttcttcagcaacaggctgctcacaccagaaagaagtcaaacagaacc aagagtgcagtggcctttcacagtgtaaaaaatcattacaactggggagaatgtgtgaaa gctttcagctacaaacatgtacgtgttcagcaccagggagacctcattagggaaagatct tacatgtgcagtgaatgtgggaaatcttttagcacaagctgtagcctcagtgatcatttg agagttcacacttcagaaaagccttatacatgtggagaatgtgggaaatcctataggcaa agctctagccttattacgcaccgaagaattcacactggagtaagacctcatcaatgtgat gaatgtggaaaattatttaacaggaagtatgaccttcttatacatcagagagttcatact ggagaaaggccttacaagtgcagtgaatgtgggaaatcctttagccatagctctagcctc attacacaccagagaattcatactggaatgaggccttatgagtgcagtgaatgtgggaaa tcttttatccatagttctagccttattacacaccagagagttcacactggtacaaggcct tatatgtgcagtgaatgtgggaaatcctttagccagagctgtcacctcattaaacaccgg agacttcacattggagaagggccttatgagtgtagtgaatgtgggaaattgtttacttat agatctcgtttcttccaacaccagagagttcatactggagtaagatctcatgaatgtcat gaatgtggaaaattatttagcaggaaatttgacctcattgtacatgagagagttcacaca ggagaaaggccatatgagtgcagtgaatgtggaaaatcctttacctgtaaatcctacctc atctcacactggaaagttcatactggagcaaggccttatgaatgtggggagtgtgggaaa tcatttactcatagctctacgctccttcaacaccagagagttcacactggagaaaggcct tatgagtgcaatgaatgtgggaagttttttagccagagctccagcctcattagacatagg agaagtcacaccggagaaaggccttatgagtgcagtgagtgttggaaatcctttagtaac cactctagcctcgttaaacaccgaagagttcataccggagaaaggccttatgaatgcagt gaatgtggaaaatcctttagccagagctctaacctcactaatcaccagcgaattcacagt ggggaaaggccttatgagtgtagtgactgtggaaaattttttaccttcaactccaacctc ctaaaacatcagaacgttcacaagggataa