GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:18:22 Sequence gi568815581r:45410243_45650331 : 240089 bp : 47.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8742 8916 175 2 1 84 45 99 0.038 4.71 1.02 Intr + 13882 13915 34 2 1 121 47 5 0.017 -3.02 1.03 Term + 14179 14332 154 1 1 47 49 125 0.291 1.89 1.04 PlyA + 14573 14578 6 1.05 2.14 PlyA - 15592 15587 6 1.05 2.13 Term - 20055 19246 810 0 0 115 49 1134 0.815 105.39 2.12 Intr - 21410 21379 32 0 2 67 86 -7 0.465 -5.15 2.11 Intr - 22107 21974 134 2 2 118 81 100 0.719 12.59 2.10 Intr - 22692 22510 183 2 0 66 34 108 0.577 2.10 2.09 Intr - 24907 24815 93 0 0 80 80 30 0.301 0.48 2.08 Intr - 27727 27644 84 0 0 99 20 114 0.477 4.64 2.07 Intr - 29392 29235 158 1 2 70 85 247 0.994 21.41 2.06 Intr - 33920 33836 85 1 1 136 49 15 0.667 2.22 2.05 Intr - 35421 35228 194 0 2 61 83 300 0.918 25.09 2.04 Intr - 40521 40376 146 1 2 97 131 198 0.976 25.00 2.03 Intr - 44030 43113 918 0 0 113 55 1348 0.999 125.23 2.02 Intr - 44246 44215 32 1 2 62 93 11 0.875 -3.23 2.01 Init - 45328 45165 164 1 2 69 80 181 0.877 14.60 2.00 Prom - 47594 47555 40 -6.96 3.07 PlyA - 49229 49224 6 1.05 3.06 Term - 54967 54872 96 1 0 108 32 52 0.628 -0.43 3.05 Intr - 58351 57967 385 0 1 52 117 206 0.677 14.75 3.04 Intr - 65484 64858 627 2 0 82 70 731 0.406 62.02 3.03 Intr - 67905 67658 248 0 2 68 52 351 0.999 25.76 3.02 Intr - 76807 76625 183 1 0 112 78 58 0.729 7.18 3.01 Init - 80532 80410 123 0 0 79 97 87 0.893 6.89 3.00 Prom - 81912 81873 40 -5.66 4.00 Prom + 88410 88449 40 -5.96 4.01 Sngl + 90693 91028 336 2 0 65 45 204 0.878 10.06 4.02 PlyA + 91822 91827 6 1.05 5.13 PlyA - 94959 94954 6 1.05 5.12 Term - 98124 98021 104 1 2 36 36 110 0.476 -0.96 5.11 Intr - 98297 98243 55 2 1 107 88 63 0.296 6.75 5.10 Intr - 103047 102943 105 0 0 103 79 57 0.987 6.71 5.09 Intr - 103913 103626 288 2 0 6 78 253 0.578 13.54 5.08 Intr - 104925 104510 416 1 2 42 94 217 0.593 11.52 5.07 Intr - 105137 104997 141 0 0 30 38 175 0.674 7.02 5.06 Intr - 107040 106914 127 0 1 103 5 41 0.742 -2.45 5.05 Intr - 107906 107619 288 2 0 6 78 253 0.563 13.54 5.04 Intr - 108918 108761 158 1 2 42 59 118 0.713 4.03 5.03 Intr - 109130 108990 141 0 0 30 38 175 0.470 7.02 5.02 Intr - 111183 111065 119 2 2 98 72 5 0.462 0.01 5.01 Init - 118848 118787 62 0 2 106 36 58 0.626 3.12 5.00 Prom - 128539 128500 40 -1.86 6.03 PlyA - 129236 129231 6 1.05 6.02 Term - 139813 137481 2333 2 2 103 48 1197 0.069 102.58 6.01 Init - 145614 145470 145 0 1 67 66 98 0.048 5.78 6.00 Prom - 151251 151212 40 -3.06 7.00 Prom + 153860 153899 40 -3.56 7.01 Init + 175752 176667 916 2 1 75 69 986 0.946 87.63 7.02 Intr + 176707 176784 78 2 0 129 81 40 0.944 6.92 7.03 Term + 186758 186804 47 0 2 33 40 91 0.006 -3.83 7.04 PlyA + 187440 187445 6 1.05 8.03 PlyA - 187728 187723 6 1.05 8.02 Term - 191644 190627 1018 0 1 69 42 1648 0.730 149.89 8.01 Init - 192158 191773 386 2 2 82 26 419 0.563 31.41 8.00 Prom - 194950 194911 40 -5.66 9.00 Prom + 197390 197429 40 -8.56 9.01 Init + 198329 198672 344 1 2 75 75 382 0.139 32.31 9.02 Intr + 205797 205897 101 0 2 71 26 46 0.002 -3.55 9.03 Intr + 222808 222932 125 2 2 109 55 51 0.688 4.20 9.04 Term + 226041 226202 162 2 0 70 43 158 0.892 7.44 9.05 PlyA + 227690 227695 6 1.05 10.04 PlyA - 228137 228132 6 1.05 10.03 Term - 235678 235545 134 2 2 73 36 75 0.401 -0.95 10.02 Intr - 236495 236348 148 2 1 81 8 66 0.337 -2.29 10.01 Intr - 239561 239476 86 0 2 116 93 50 0.434 7.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 174726 174582 145 0 1 92 94 181 0.991 17.53 S.002 Sngl + 198329 198676 348 1 0 75 49 395 0.839 30.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_1|120_aa GPPTKWVLLQLDPSSGEQSHINSLCDSSAGGTERTTKPVAATRPQHSRDPGRTKDAITGD PQPGEGKEAGHGDSTTSLIISYALVTIMDFPIPQGLALTTHPGWHSVLMPTPPCTGRSQL >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_1|363_bp ggcccccccaccaagtgggtactcctgcagctcgaccccagctctggggagcaaagtcat ataaactcactctgtgacagctcagcagggggaacagagagaaccaccaaaccagttgcc gccaccagaccccagcacagccgggacccaggacgcactaaagatgccatcacaggggac ccccagccaggagagggcaaggaggcagggcatggagactcgaccacatccctgatcatc agctacgccctggtcacgatcatggactttcccattccccagggcctggcactgaccacc catccaggctggcattcagtactgatgcccacccctccctgtactggacgcagtcagctc tag >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_2|1010_aa MRTIQEEAGWQQEFILDMSAEILVDMYGTGGYMNVDFKNKVQAGDKNWELKPESSEIQDA RAAVDGLSNPFRGLMKLGTVERRGAMGIWKELFCELSPLEFRLYLSNEEHTCVENCSLLR CESVGPAHSDGRFELVFSGKKLALRASSQDEAEDWLDRVREALQKVRPQQEDEWVNVQYP DQPEEPPEAPQGCLSPSDLLSEPAALQGTQFDWSSAQVPEPDAIKESLLYLYMDRTWMPY IFSLSLEALKCFRIRNNEKMLSDSHGVETIRDILPDTSLGGPSFFKIITAKAVLKLQAGN AEEAALWRDLVRKVLASYLETAEEAVTLGGSLDENCQEVLKFATRENGFLLQYLVAIPME KGLDSQGCFCAGCSRQIGFSFVRPKLCAFSGLYYCDICHQDDASVIPARIIHNWDLTKRP ICRQALKFLTQIRAQPLINLQMVNASLYEHVERMHLIGRRREQLKLLGDYLGLCRSGALK ELSKRAGPESASLKTASSILAAVLPEPGLLRARIADGVYEGFLKALIEFASQHVYHCDLC TQRGFICQICQHHDIIFPFEFDTTVRCAECKTVFHQSCQAVVKKGCPRCARRRKTVDGRP GEEKRNWEREAQRRATARDLVLPCRPAPVREEVLPGALRRCLPGYIVEQETRQRGSRTVE EGARDAEPTRACRGKWRRSRRADAPRVGEGTEVRSPSYPARDIELRWASLRWESRSEFFQ ARPGCSGKGPAHGLPEARPADENAAAAMAADVVGDVYVLVEHPFEYTGKDGRRVAIRPNE RYRLLRRSTEHWWHVRREPGGRPFYLPAQYVRELPALGNPAAAAPPGPHPSPAAPEPLAY DYRFVSAAATAGPDGAPEESGGRASSLCGPAQRGAATQRSSLAPGLPACLYLRPAAPVRP AQSLNDLACAAVSPPAGLLGSSGSFKACSVAGSWVCPRPLARSDSENVYEVIQDLHVPPP EESAEQVPPRALGRGGGWRARDRARTEPGRKETRSAQRRARRPPLSEDFG >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_2|3033_bp atgaggaccattcaggaggaagcaggctggcagcaggaattcatcctggacatgtcagct gagatcctagtggacatgtatggcacaggtggatatatgaatgtggatttcaagaacaag gtccaggctggcgataaaaattgggagctaaagcctgagagcagtgaaatacaggacgca agagcagcagtggacggactgtccaacccattccggggtctcatgaagctgggcaccgtg gagcggcggggggcaatgggcatctggaaggagctcttctgcgagctctccccgctggag ttccgcctctacctgagcaacgaggagcacacctgtgtggagaactgctcgctgcttcgc tgtgagtctgtggggccagcccatagtgatgggcgctttgagctggtcttctctggcaag aagctggccctgcgcgcctcctcccaggacgaagctgaggactggctggaccgggtgcgg gaggccctgcagaaggtccggcctcagcaggaggatgagtgggtgaacgtgcagtaccca gaccagcctgaggaaccccccgaggcgccccagggctgcctctctccctcagacctgctc tcggagcccgcggccctccagggcacacagtttgactggtcgtccgcccaggttccagag ccagatgccatcaaggagtccctgctgtacttgtacatggacaggacctggatgccctat atattttctctgtccttggaggctctgaaatgtttccgcatcaggaacaatgagaagatg ctgagtgacagccacggcgtggagaccatccgggacatcctgccagacaccagccttggg ggcccatccttcttcaaaatcatcacggccaaggctgtcctgaagctgcaggccggaaac gccgaggaagccgccctgtggagggatctggtccgcaaagtcctggcatcctacttggag acagccgaggaggcggtgaccctgggcgggagcctggatgaaaactgtcaggaggtgctg aaatttgccacccgggagaatggcttcctgctgcagtacctggtggctatccccatggag aaaggccttgactcccaaggctgcttctgcgcaggctgctcccggcagatcggcttctcc tttgtacgacccaagctctgtgccttctctggcctctattactgtgacatctgccaccaa gacgatgcctcagtgattccggccaggatcatccacaactgggacctcaccaagcgcccg atctgcaggcaggccctgaagtttctgacacagatccgggcccagcccctcatcaacctg cagatggtgaacgcgtctctgtacgagcatgtggagcggatgcacctcattgggaggaga cgggagcagctgaagctcctgggggattacctgggcctgtgccggagtggcgccctgaag gagctcagcaagagggctggcccagagagcgcctctctgaagactgcctcctccatcctg gctgcagtgctcccagagccgggcctcctcagggccaggatcgcagacggggtgtatgaa ggattcctcaaggccctgattgaatttgcctcccagcatgtctaccactgcgacctgtgc acccagcgcggcttcatctgccagatctgccagcaccacgacatcatcttcccctttgag tttgacaccacagtcaggtgtgccgagtgcaagaccgtcttccaccagagctgccaggct gtggtgaagaagggctgcccccgctgtgcccgccggcgcaagacagtagatggtaggccc ggagaagagaagagaaactgggagagggaggcccagaggagggccacagcacgtgacctg gtgctgccctgccgcccagccccagtccgggaggaagttctgcccggcgctctccgccgg tgcctccctggttatattgttgaacaggaaacccggcagcgcgggagccgcacagtcgag gagggagcgcgggacgccgagcccacgcgcgcctgccggggcaagtggaggcgaagccgg cgagcggacgccccgagggtcggggaaggaaccgaggtccggtccccttcttatccagcc cgagacatcgagctccggtgggcgtccctgcgctgggaatcccgctcggagtttttccag gcccggccgggctgctccgggaaaggccctgcccatgggctccctgaggcccgcccagcc gacgaaaacgccgcggctgcgatggcggcggacgtggtgggggacgtgtacgtgctggtg gagcaccccttcgagtacaccggcaaggacgggcgccgcgtggccatccggccgaatgag cgctaccggctgctgcggcgcagcaccgagcactggtggcacgtgcggcgtgagcccggc ggccgccccttctacctgcctgcgcagtacgtgcgcgagctgcccgcgctgggcaaccct gccgccgccgcgccgccaggtccccacccgagccccgcggcccctgagccgctcgcctac gactaccggtttgtgagcgcggcggcgaccgcgggccccgacggcgcccccgaggagtcc ggaggccgagccagctccctgtgcggccctgcgcaacgcggcgccgcgacccagcgcagc agcctggcgcccggcctgccagcctgcctgtacctgcggcccgcggcgcccgtgcggccc gcgcagtccctgaacgacctggcctgcgccgccgtctcgcctcccgccggcctcctagga agcagcggcagcttcaaggcctgcagcgtggcgggctcctgggtgtgcccgcggcctctg gcgcgcagcgactcagagaacgtctacgaggtcatccaggacttgcacgtcccgccgccg gaggagagcgcagagcaggtacctccccgggcgctggggcgcggaggcgggtggcgcgct agggaccgcgcccgcacggagccggggcgcaaggagacccgctccgctcagcgtcgggca cgacgcccacctctgtccgaagacttcggatga >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_3|553_aa MASGAGAPLPVVSGRETASSARRLRGCRGRERPSPRSSLRLVGLLEATPLFTLVLCVNEE RFEDRKWNQFHRMVDSESGPPGWVTQYCPHSQETQSLLLIPAVIKKKLVGSVKALQKQYV SLDTVVTSEDGDANTMCSALEAVFIHGLHAKHIRAEAGGKRKKSAHQKPLPQPVFWPLLK AVTHKHIISELEHLTFVNTDVGRCRAWLRLALNDGLMECYLKLLLQEQARLHEYYQPTAL LRDAEEGEFLLSFLQGLTSLSFELSYKSAILNEWTLTPLALSGLCPLSELDPLSTSGAEL QRKESLDSISHSSGSEDIEVHHSGHKIRRNQKLTASSLSLDTASSSQLSCSLNSDSCLLQ ENGSKSPDHCEEPMSCDSDLGTANAEDSDRSLQEVLLEFSKAQVNSVPTNGLSQETEIPT PQASLSLHGLNTSTYLHCEAPAEPLPAQAASGTQDGVHVQEPRPQAPSPLDLQQPVESTS GQQPSSTVSETAREVGQGNGLQKAQAHDGAGLKLVVSSPTSPNISSMIQSTRPVHFCIVS TKHCTWEKVGMDE >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_3|1662_bp atggcgtcaggggccggggcgccgcttcctgttgtcagtggccgagagaccgcatcgtcg gctcggaggctgaggggctgccgcggccgggagcgcccctcgcctcgctcctcgctccgc ttggtaggtctccttgaagccacacctctcttcactttggttctttgtgtcaatgaagag cgttttgaggacagaaagtggaaccagttccataggatggtagattcagaatctgggcca ccagggtgggtgacgcagtactgtccccacagtcaagagacacaaagcctccttctgatt cccgccgtcatcaagaagaagctggtgggatccgtgaaggccttgcagaagcagtacgtg tccctggacacggtggtcactagtgaagacggagatgccaacaccatgtgcagcgccctg gaggccgtatttatccatggcctgcacgccaagcacatccgagctgaggccggaggaaaa aggaagaaaagtgcccaccagaagcctctgccccagcctgtcttctggcccctcctgaaa gctgtcacccacaaacacatcatctcagagttggagcacctgacgtttgtcaacacggat gtgggccgctgccgggcatggctgcggctggccctgaacgatggcctgatggagtgctac ctgaagctgctgctgcaggagcaggcccgcttgcatgagtactaccagcccaccgccctg ctccgggatgctgaggagggcgagttcctccttagcttcctgcagggcctcacgtccttg tccttcgaactctcctacaagtctgccatcttaaatgagtggacgctcaccccattggcc ctgtctgggctttgcccgctttctgagctggaccctctctctacctctggtgcagaacta cagcggaaggaatctctggattccatttcccattcttcaggctctgaagacatcgaagtc catcactcgggccataagatacggaggaaccagaagctgactgcctcctccctcagcctg gacacggccagttcatcccagctgtcctgcagcctaaactctgatagctgcttactccaa gagaatggctccaagagtccagaccattgcgaggagcccatgtcctgtgactcagacctg ggcacagcaaatgctgaggactcagaccggtctctgcaagaggtattgttggaattcagc aaagcccaggtaaactctgtgccaaccaacggactgagccaagaaacagagatccccaca ccacaggcctcgctctccctccatggcctcaacaccagcacatacctgcactgtgaggca cctgcagagccccttcctgcccaggcagcctctggaactcaagatggtgtccacgtgcag gagccgcgtccccaggcgcccagccccctggacttacagcagcctgtagagagcacctca ggccagcagccttctagtactgtcagcgagacagccagagaagtgggccaagggaatggc ctgcagaaggcccaggctcatgacggagctggtctgaagctggtagtttcctcacccacc agtccgaatattagctctatgatacagagcaccaggcctgttcacttctgtattgtcagc accaagcactgtacctgggagaaagtaggcatggatgaatag >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_4|111_aa MVSLWVEDTFLSPGFGFAHVACSGLGMKQKRKAASSEPTSEVALGGSAGPVRSHLHPEGL LWCSRCFFSLRPKGTEPPGRSAGLQGATERSGWTSIQAQAQACENLVRAAV >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_4|336_bp atggtttcgttgtgggtggaggatactttcctgtcccctggctttgggtttgcccacgtg gcttgctctggccttggaatgaagcagaaacgaaaggctgccagttccgagcccacgtct gaagtcgccttaggtggttccgcgggccccgtgcgctcccaccttcacccagagggcctt ctctggtgcagccgctgcttcttcagcctccgcccaaaaggaacggagccccctggccga tccgcaggcctacagggagccacagagcgcagcggctggaccagcattcaagcccaagca caggcctgcgagaaccttgttcgagccgccgtttag >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_5|667_aa MPSISIVFEEKSAINLIEDPLILPSHMACCLCQLKNSIEAVCKTVKLHCNSACLTNTIHC PEEESVGNPEGAFMKMLQARKNYTSTELTVEPEEPSDSSGINLSGFGTVNLDVESMLLPF IKLPTTGNSLAKIQTVGQNRQKVNRVLMGPMSIQKRHFKEFEIQLTQQLQSLIPNNNVRR LISHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASEIEWDTDQWKTENYI NESTEAQSEQKEKSLELTKEVPGYGYTDKLILALIVTEILMILIILFCLIVVRTIINSAE EESVGNPEGAFMKMLQARKNYTSTELTVEPEEPSDSSGINLSGFGTVNLDVESMLLPFIK LPTTGNSLAKIQTVGQNWQKVNRVLMGPMSIQKRHFKEVGRQSIRREQGAQASVENAAEE KRLGSPAPRELEQPHTQQGPEKLAGNAIYTKPSFSQEHKAAVSVLTPFSKGAPSTSSPAK ALPQFEIQLTQQLQSLIPNNNVRRLISHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLL SGQQEVKASEIEWDTDQWKTENYINESTEAQSEQKEKSLELTKEVPGYGYTDKLILALIV TEILMILIILFCLIVGHFQVSATEEMLFTKGESDMYKPLSATRVNNHAWKLHKKSSNEDE ILNRDPG >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_5|2004_bp atgcccagcatctccattgtttttgaagagaaatcagctattaatcttattgaggatccc ttgatcttacctagccatatggcctgctgcctctgccaacttaaaaacagcattgaggct gtctgcaagacagtcaagctgcattgcaacagtgcatgtctgacaaacaccatacattgt cctgaagaagaatctgtagggaatccagaaggagcattcatgaagatgttacaagcccgg aagaattacacaagcactgagctgactgttgagccggaggagccctcagacagcagtggc atcaacttgtcaggctttgggacagtaaacctagatgtggaatcaatgttactaccgttc attaaactgccaaccacaggaaacagcctggcaaagattcaaactgtaggccaaaaccgg caaaaagtgaatagagtcctcatgggcccaatgagcatccagaaaaggcacttcaaagag tttgaaattcagctaacccagcagctacagtcccttatccccaacaacaatgtgagaagg ctcatttctcatgttatccggaccttgaagatggactgctctggggcccatgtgcaagtg acctgtgccaagctcatctccaggacaggccacctgatgaagcttctcagtgggcagcag gaagtaaaggcatctgagatagaatgggatacggaccaatggaagactgagaactacatt aatgagagcacggaagcccagagtgaacagaaagagaagtcgcttgagctcacaaaagaa gttccaggatacggctatactgacaaactcatcttggcattaattgtgactgaaatacta atgattttgattatacttttctgcctcattgtggtaaggacaataattaattcagctgaa gaagaatctgtagggaatccagaaggagcattcatgaagatgttacaagcccggaagaat tacacaagcactgagctgactgttgagccggaggagccctcagacagcagtggcatcaac ttgtcaggctttgggacagtaaacctagatgtggaatcaatgttactaccgttcattaaa ctgccaaccacaggaaacagcctggcaaagattcaaactgtaggccaaaactggcaaaaa gtgaatagagtcctcatgggcccaatgagcatccagaaaaggcacttcaaagaggtggga aggcagagcatcaggagggaacagggtgcccaggcatctgtggagaacgctgccgaagaa aaaaggctcgggagtccagccccaagggagctggaacagcctcacacacagcaggggcct gagaagttagcgggaaacgccatctacaccaagccttcgttcagccaagagcataaggca gcagtctctgtgctgacacccttctccaagggcgcgccttctacctccagccctgcaaaa gccctaccacagtttgaaattcagctaacccagcagctacagtcccttatccccaacaac aatgtgagaaggctcatttctcatgttatccggaccttgaagatggactgctctggggcc catgtgcaagtgacctgtgccaagctcatctccaggacaggccacctgatgaagcttctc agtgggcagcaggaagtaaaggcatctgagatagaatgggatacggaccaatggaagact gagaactacattaatgagagcacggaagcccagagtgaacagaaagagaagtcgcttgag ctcacaaaagaagttccaggatacggctatactgacaaactcatcttggcattaattgtg actgaaatactaatgattttgattatacttttctgcctcattgtggggcattttcaggtt tctgccacggaggagatgctcttcacgaagggagagtcagatatgtacaaacctctcagt gccacaagagtaaataatcacgcatggaagctgcacaagaagtcatctaatgaggacgag atcctcaacagggaccctgggtaa >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_6|825_aa MITQIGILNPLVWVGFKTVVIPMKGKIAQIAIEKALSDAFQKLLIVVLEMPAPPQESTEN LVPFLDTWDSAGEQPLEPEQFLASQQDLKDKLSPQERLPVSPKKLKKDPAQRWSLAEIIG IARQLSTPQSQKQTLQNEYSSTDTPYPSSLPPELRVKSDEPPGPSEQVGPSQFHLEPETQ NPETLEDIQSSSLQQEAPAQLPQLLEEEPSSMQQEAPALPPESSMESLTLPNHEVSVQPP GEDQAYYHLPNITVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIE PPVSPMEHELSISEQQQPVQPSESSREVESSLTQQETPGQPPEHHEVTVSPPGHHQTHHL DSPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATARLSGSGNDVEPPAIQHGGPPLLPES SEEAGPLAVQQETSFQSPEPINNENPSPTQQEAAAEHPQTAEEGESSLTHQEAPAQTPEF PNVVVAQPPEHSHLTQATVQPLDLGFTITPESMTEVELSPTMKETPTQPPKKVVPQLRVY QGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHSTAPKRTIVSPKHPEVTL PHPDQVQTQHSHLTRATVQPLDLGFTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSDK GQAQHSHLTQATVQPLDLELTITTKPTTEVKPSPTTEETSTQPPDLGLAIIPEPTTETGH STALEKTTAPHPDRVQSLHRSLTEVTGPPTELEPAQDSLVQSESYTQNKALTAPEEQKAS TSTNICELCTCGDEMLSCIDLNPEQRLRQVPVPEPNTHNGTFTIL >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_6|2478_bp atgatcactcagataggcatcttaaatccattagtttgggttggatttaagaccgttgtc attcctatgaaagggaagatagcccagattgctattgagaaggctttgtcagatgcattc cagaaactgttgattgtggttctagagatgccagccccaccccaggaatcgactgaaaat ttggttccattcctggacacctgggattcagctggagagcagcccctggagccagagcag ttcttggcttcacagcaggatttaaaggacaagctgagtccacaggaaagactccctgtt tcgcccaagaagctgaagaaagatccagctcagcgttggagccttgctgagattattgga attgcacgccaattatccacacctcagagtcagaaacagactttgcagaatgaatattcc agtacagatacaccgtatcccagtagcctgcctccagaactccgggtgaagtcagatgag cctccagggccctctgagcaagttggaccttctcaattccatctagagcccgaaactcaa aatccagagacccttgaagacatccagtcctcttcactccagcaagaagccccagcacag cttccacagctccttgaggaagaaccttcttcaatgcagcaggaggccccagctctgcct ccagagtcctctatggagagtctaactctaccgaatcatgaggtgtcagttcaacctcca ggtgaggatcaagcttattatcacttgcccaacattacagttaaacctgcagatgtggag gttaccataacttcagagcctaccaatgagacagaatcttcccaagcccagcaggagacc ccaattcagtttccagaggaggtggaaccttctgcaacccaacaggaggccccaattgag cctccagtttctcctatggagcatgaactttccatcagtgagcagcagcagccagttcag ccttctgagtcttctagggaggtcgaatcttctctgacccagcaggagaccccaggtcag cctccagaacatcatgaagtcacagtttcacctccaggtcaccatcaaactcatcattta gattcacccagtgtctctgtgaagcctccagacgtgcagctcaccatagcagcagagcct agtgcagaggtgggaacttctctagtccaccaggaggctacagctcggctctcagggtca ggtaatgatgtagaacctcccgccatccagcacgggggcccacctctgcttccagagtca tcagaagaagctggacctttagcagttcaacaggagacttcatttcaatctccggaacct attaataatgagaacccctctccaacccagcaggaggctgcagctgagcatccacagacc gctgaggagggtgagtcttccctaacccatcaggaggccccagctcagactccagagttc cctaatgtagttgtagctcaacctccagagcattcacacctgactcaagccacagttcaa cctttggatctggggtttaccatcactccagaatccatgacagaggttgaactttctcca accatgaaggagaccccaactcagcctcctaagaaagttgtaccccaacttcgagtatat caaggggtaacaaatccaacaccaggtcaggatcaagctcagcatccagtgtcacccagc gttacagttcaacttttggacctgggacttaccatcactccagaacccactacggaggtt ggacattctacagccccgaagaggactatagtttctccaaagcatcctgaggtgacactt ccacatccagaccaggttcagactcagcattcacacctgactcgagccacagttcaacct ttggacctggggtttaccatcactccaaaatccatgacagaggttgaaccttctacagcc ctgatgactacagctcctcctccaggacaccctgaggtgacacttccaccttcagacaag ggtcaggctcagcattcacacctgactcaagccaccgttcaacctctggacctggagctt accataactacaaaacctactacagaggttaaaccatctccaaccacagaggagacctca actcagcctccagacctgggacttgccatcattccagaacccactacagagactggacat tctacagccctggagaagactacagctcctcatccagaccgggttcagagtctgcatcga agcctgactgaagtcacaggtccacctactgaactagaacctgctcaggattcactggtg cagtctgaaagttacacccaaaataaggctttaactgcaccagaggaacagaaggcctcc acaagcaccaacatatgtgagctctgtacctgcggagatgagatgttgtcatgtattgat ctcaacccagagcagaggctccgccaagtgcctgtgccagagcccaacacccacaatggc accttcaccatcttgtaa >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_7|346_aa MAGHPQAGWAARRRLGQRCSSGGCLRKCMSTSYPAVPARGPPLRVPPDDDLQRPEPRLRI CPLQLAARRAGRHRPLHNHPLRPSCPLLLCRSTGKCELSVDCLPPNLTRTALQPALQPLG PGLQEARLLPSPGPAPGQIALLKFSSHWTAAMAKKALEEGQPHLCGEQVAVEWLKPELKQ RLRQQLVGPSLWSPQPDGSQLALARDKLGSQGARATLQLLCQRMKLGSSVFLTKCLGIGP AGWHRFWYQVVIPGHPVPFSGLIWVVLILDGRDGHEVAKDAVSVRLLQALIESGANLLWS AGAEAGSPNGHAQPVSGPNPADLGGHYLTPKVESMDEEPADLEGQL >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_7|1041_bp atggcgggccacccccaggctgggtgggcagcccgccgccggctgggtcagaggtgttca tcgggcggctgcctcaggaagtgtatgagcaccagctatcctgctgttccagcgcgtggg ccgcctctacgagttccgcctgatgatgaccttcagcggcctgaaccgcggcttcgcata tgcccgctgcagctcgcggcgcggcgcgcaggccgccatcgcccgctgcacaaccacccg ctgcggccgtcctgcccgctgcttctgtgccgcagcaccgggaagtgtgagctgagcgtt gactgcctgccgccgaatctgacccgcaccgcgctgcagcccgcgctgcagccgctgggt cccggcctgcaggaggcgcggctgctgcccagccccggaccggcgcccgggcagatcgct ctgctcaaattcagctcgcactggaccgctgccatggccaaaaaggccctggaggaaggg cagccacacctctgtggagagcaggtggctgtggagtggctcaagccagaactgaagcag cgacttcgccagcagcttgtgggtccctccttgtggtccccacagccagacggcagccag ttggccttggcaagggacaagttagggtcccaaggggctcgggctaccctgcagttgctg tgccaacgaatgaagctgggcagctctgtgttcctcaccaagtgtttgggcataggacct gctggctggcaccgcttctggtaccaggtggtgattcctgggcatccggtgcccttcagc ggcctcatctgggttgtgctgatcctagatggccgggatgggcatgaggtggccaaggat gctgtgtctgtacggctgctgcaggcactcattgagtctggggccaacctcctgtggtct gctggggctgaggcaggcagcccgaatgggcatgcacagcctgtgtcaggccccaaccca gcagacctgggtggccactatctgacccccaaagttgaatccatggatgaggaacctgca gatctggagggccagctgtag >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_8|467_aa MKRIKKERRKEKKKEKKREGSAGGGGAGSRLQAEMLQMDLIDATGDTPGAEDDEEDDDEE RAARRPGAGPPKAESGQEPASRGQGQSQGQSQGPGSGDTYRPKRPTTLNLFPQVQLSQDT LNNNSLGKKSRHLHQQPTLLVDEHAQLEPVSLRPCFGDYSDESDSAIVYDNCASVSSPYE SAIGEEYEEASRPQPPACLSKDSTPDEPDVHFSKKFLNIFMSGRSRSSSAESFGLFSCII NREEQEQTHRTIFRFVPRHEDEPELEVDDPLLVELQAEDYWYEAYNMRTGARGFFTAYYA IEVTKEPEHMAALAKNSDWVDQFRVKFLGSVQVPYHKGDVVLSAAMQKIATTRRLTVHFN PPSSCVLEISVRGVKIGVKADDSQEAKGNKCSHFFQLKNISFRGYHPKNNKYFGFITKHL ADHRFACHVFVSEDSTKALAESVGRAFQQFHKQFVEYTCPTEDIYLE >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_8|1404_bp atgaagagaataaagaaagaaagaagaaaagaaaagaaaaaagaaaagaaaagggagggc tctgcgggtggcggcggcgcggggagccggttgcaggccgagatgctgcagatggacctg atcgacgcgacgggggacactcccggggccgaggacgacgaggaggacgacgacgaggag cgcgcggcccggcggccgggagcggggccgcccaaggccgagtccggccaggagccggcg tcccgcggccagggccagagccaaggccagagccagggcccgggcagcggggacacgtac cggcccaagcggcccaccacgctcaacctctttccgcaggtgcagttgtctcaggacaca ctgaataataattctctgggcaaaaaatcgaggcacctccaccaacagcccacgctgctg gtagatgagcacgcgcagctggagccggtgagcctgcggccgtgcttcggagactacagt gacgagagtgactcggccatcgtctatgacaactgtgcctccgtctcctcgccctatgag tcagccatcggagaggaatatgaggaggcctcccggccccagcctcctgcctgcctctcc aaggactccacgcctgacgaacccgacgtccatttctccaagaagttcctgaacatcttc atgagtggccgctcccgctcctccagtgccgagtccttcgggctgttctcctgcatcatc aaccgggaggagcaggagcagacccaccggaccatattcaggtttgtgcctcgacacgaa gacgaacctgagctggaagtggatgaccctctgctagtggagctccaggctgaagactac tggtacgaggcctacaacatgcgcactggtgcccggggcttctttactgcctattacgcc atcgaagtcaccaaggagcccgagcacatggcagccctggctaaaaacagtgactgggtg gaccagttccgggtgaagttcctgggctcagtccaggttccctatcacaagggcgatgtc gtcctctctgccgctatgcaaaagattgccaccacccgccggctaaccgtgcactttaac ccgccctccagctgtgtcctggagatcagcgtgcggggtgtgaagataggtgtcaaggcc gatgactcccaggaggccaaggggaataaatgtagccactttttccagttaaaaaacatc tctttccgcggatatcatccaaagaacaacaagtactttgggttcatcaccaagcacctc gccgaccaccggtttgcctgccacgtctttgtgtctgaagactccaccaaagccctggca gagtccgtggggagagcattccagcagtttcacaagcagtttgtggagtacacctgcccc acagaagatatctacctggagtag >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_9|243_aa MTKKRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKKFVIRNIVEAAAVRDISEVSVFD AYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPGPPPKPIQDLST YYLQDSPYVNLYSNSRMDVLLLRELRDRGLIFMVDSNDREQIDEAWEVLTYLLEDDELRN AVLLVFANKQDLPNTMNAAEITDKLGLHSLRYRNWHIQATCATTGHGLYEGLNWLANQFQ NQN >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_9|732_bp atgacaaagaaaagaaggaacaatggtcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaggacaaggccattaagaaattcgtc attcgaaacatagtggaggccgcagcagtcagggacatttctgaagtgagcgtcttcgat gcctatgtgcttcccaaactgtatgtgaagctacattactgtgtgagttgtgcaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccccaggtcccccaccaaagcccatacaggatcttagcacc tactatttgcaagactccccctatgttaacctttacagcaactctcggatggatgtttta ttattgagagaactgagagacagaggtctgatttttatggttgacagtaatgacagagag cagattgatgaggcctgggaagtgctaacttacttgttagaggacgatgagctcagaaat gcagttttattggtatttgccaataaacaagatctccctaatactatgaacgcggcagag ataacggacaagctcggcctccattccctccgctacagaaactggcacattcaggctact tgtgccactactggacatgggctttacgaaggcctgaactggctcgccaaccagttccag aaccagaactga >gi568815581r:45410243_45650331|GENSCAN_predicted_peptide_10|122_aa XTAGQMVSSLCQVPRSLPARLSENVSRVKVEEAEEGATPVDSILTLAGENRNDGFTLLTA PPGTTSFSPLPDPSPGFPEERIIFNLERIINMIMGKSGSKTDDAALKENPSALWEPNTLN SN >gi568815581r:45410243_45650331|GENSCAN_predicted_CDS_10|369_bp naaacagccggccagatggtcagctcgctttgccaggtgcccagatccctgccagcacgg ctgtctgaaaatgtcagcagagtaaaggtcgaggaagcagaggagggagccactcctgtg gactcaattctcaccctcgctggtgaaaacagaaatgatgggttcacattgctcacagct ccccctggaacgacttccttctctccacttccagacccaagcccaggcttcccagaagag agaataatcttcaacctggaaaggataataaacatgattatggggaaatcgggatctaag acagatgatgcagcactcaaggagaacccttccgctttatgggagcccaacactcttaac tccaactaa