GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:26:51 Sequence gi568815594r:87237279_87491070 : 253792 bp : 39.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4067 4202 136 2 1 112 48 84 0.846 3.31 1.02 PlyA + 4566 4571 6 1.05 2.00 Prom + 5660 5699 40 -3.45 2.01 Init + 21688 21751 64 0 1 87 56 75 0.857 5.56 2.02 Intr + 27664 27782 119 2 2 38 80 86 0.047 2.06 2.03 Intr + 28500 28597 98 2 2 71 71 10 0.010 -4.51 2.04 Intr + 33434 33613 180 2 0 40 86 129 0.043 5.96 2.05 Intr + 41346 41430 85 0 1 61 66 120 0.447 6.00 2.06 Intr + 48301 48348 48 0 0 72 115 12 0.277 0.26 2.07 Intr + 52627 52785 159 0 0 85 73 94 0.813 6.86 2.08 Term + 53789 53812 24 1 0 77 38 54 0.257 -3.45 2.09 PlyA + 54873 54878 6 1.05 3.09 PlyA - 54897 54892 6 1.05 3.08 Term - 57915 57413 503 1 2 18 42 206 0.037 2.66 3.07 Intr - 69111 69035 77 2 2 33 85 34 0.082 -4.16 3.06 Intr - 73081 72965 117 0 0 98 100 28 0.763 3.66 3.05 Intr - 76682 76545 138 1 0 63 116 89 0.711 7.86 3.04 Intr - 78321 78215 107 0 2 23 111 76 0.584 1.49 3.03 Intr - 79945 79814 132 1 0 37 92 102 0.681 5.42 3.02 Intr - 81158 81051 108 2 0 56 94 98 0.950 6.76 3.01 Init - 85563 85354 210 0 0 74 90 150 0.883 10.87 3.00 Prom - 87059 87020 40 -5.45 4.05 PlyA - 87318 87313 6 1.05 4.04 Term - 97485 97144 342 0 0 -4 45 264 0.627 6.43 4.03 Intr - 103233 103212 22 2 1 50 115 17 0.114 -2.77 4.02 Intr - 111376 110506 871 2 1 75 53 182 0.229 2.86 4.01 Init - 113589 111432 2158 0 1 44 86 867 0.286 69.64 4.00 Prom - 113847 113808 40 -5.25 5.12 PlyA - 113929 113924 6 -4.04 5.11 Term - 115055 113947 1109 0 2 17 43 843 0.001 63.92 5.10 Intr - 124727 124531 197 1 2 62 96 86 0.462 4.94 5.09 Intr - 126337 126050 288 0 0 32 44 161 0.543 1.54 5.08 Intr - 126727 126634 94 2 1 53 92 86 0.888 3.60 5.07 Intr - 131822 131748 75 0 0 23 96 76 0.071 0.47 5.06 Intr - 135467 135431 37 2 1 57 84 15 0.092 -5.08 5.05 Intr - 137552 137421 132 2 0 63 92 91 0.822 6.92 5.04 Intr - 145084 144977 108 1 0 97 62 156 0.904 13.46 5.03 Intr - 153734 153583 152 0 2 96 87 93 0.050 8.96 5.02 Intr - 158929 158579 351 2 0 33 44 310 0.015 15.47 5.01 Init - 162400 162139 262 1 1 61 43 186 0.110 7.11 5.00 Prom - 162724 162685 40 -6.05 6.06 PlyA - 164029 164024 6 1.05 6.05 Term - 167353 165938 1416 1 0 -24 48 409 0.252 15.70 6.04 Intr - 167622 167407 216 0 0 77 7 115 0.436 0.08 6.03 Intr - 168517 168402 116 2 2 27 52 143 0.484 3.85 6.02 Intr - 169098 168712 387 1 0 93 32 176 0.156 6.34 6.01 Init - 170724 170025 700 0 1 17 -33 467 0.112 22.69 6.00 Prom - 174942 174903 40 -6.05 7.00 Prom + 176542 176581 40 -4.15 7.01 Init + 185628 185734 107 2 2 79 66 132 0.012 7.85 7.02 Intr + 197703 197942 240 0 0 113 99 137 0.073 13.04 7.03 Intr + 200999 201094 96 2 0 65 92 51 0.700 1.41 7.04 Intr + 204551 204637 87 2 0 111 110 23 0.917 4.97 7.05 Intr + 208115 208229 115 2 1 47 87 60 0.595 1.33 7.06 Intr + 214311 214457 147 2 0 34 83 130 0.709 6.61 7.07 Intr + 217093 217177 85 0 1 58 93 111 0.999 7.07 7.08 Term + 220565 220743 179 0 2 60 37 259 0.999 14.87 7.09 PlyA + 221041 221046 6 1.05 8.07 PlyA - 222757 222752 6 1.05 8.06 Term - 230563 230379 185 2 2 82 40 84 0.231 -0.28 8.05 Intr - 236803 236708 96 2 0 97 78 67 0.817 5.66 8.04 Intr - 242300 242152 149 1 2 49 94 148 0.933 10.46 8.03 Intr - 243242 243094 149 2 2 105 82 117 0.995 10.91 8.02 Intr - 245422 245146 277 0 1 54 106 116 0.120 6.60 8.01 Init - 248219 247180 1040 2 2 81 29 397 0.127 26.60 8.00 Prom - 249395 249356 40 -9.05 9.05 PlyA - 249691 249686 6 1.05 9.04 Term - 250878 250597 282 0 0 61 42 222 0.179 9.34 9.03 Intr - 251255 251137 119 0 2 26 21 113 0.059 -2.34 9.02 Intr - 253115 252995 121 2 1 106 64 82 0.266 6.75 9.01 Intr - 253600 253482 119 2 2 69 108 44 0.768 3.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 10676 10491 186 2 0 65 39 132 0.874 2.51 S.002 Sngl - 114963 113947 1017 0 0 88 43 788 0.989 71.07 S.003 Init - 153792 153583 210 0 0 39 87 182 0.931 10.09 S.004 Term + 182489 182562 74 0 2 130 47 54 0.905 2.59 S.005 Init - 193788 193680 109 0 1 53 99 132 0.844 11.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_1|45_aa XSLEVLKINNLAIMTAMKTSSLAVSEDGQNGFGTLKSPKPRELSL >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_1|138_bp nnctccctagaagtcttgaaaattaacaaccttgctatcatgacagccatgaaaaccagc agtctagcagtctctgaagatggccagaatggatttggaactctcaaaagcccaaagccc agagaactgtcattatag >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_2|258_aa MEQEGPHQMQPLNLGLPGLQNCHTYLDKHLDVLSVHLGSRSKGMESPPKAIGAASARELC KLLGICSLHKGSQYLHDQQPHRGHLVITEPFLDRYYQLTSVLVQILQKADTKVENVSGET PVEIMERKLGEDCESSQTAMWVLPSAGERGQEKSKVLKKEDSTSYEEHGDPNEMQPPLVI RYSWVCHYQQHENGLIQMGQLSAPEEEEPACISNQDLGVLSGELEDRIEREPIRLSCLLA LPFKSLEVSRLVDRQVEK >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_2|777_bp atggagcaagaaggccctcaccagatgcagcccctcaatcttggacttcccggactccag aactgccatacatatttggataaacatctggatgttctctcagtgcatcttgggtctagg agcaagggtatggagtccccacctaaggcaataggagctgcttctgccagagagctgtgc aaattactggggatatgcagcctacacaagggttcccaatacctgcatgatcaacagccc cacagaggccatttagtcattacagaaccatttttagatagatattatcagttgacttct gtgttagtccagattctccaaaaggcagacacaaaggtggaaaatgtatcaggagaaacg cctgtagagatcatggagaggaaactgggagaagactgtgagagttcccagactgccatg tgggtcttacccagtgcaggagagaggggacaggagaaaagcaaggttctgaaaaaagag gacagcacctcctatgaagagcatggagaccctaacgagatgcagccacctctggtgata cgttattcttgggtatgtcactatcagcagcatgaaaatggactaatacagatgggtcaa ctgtcagcaccagaagaagaggaaccagcatgcatcagcaaccaggacctaggagtgcta agtggagagcttgaggacagaatagagagggagcctatcaggctgagctgcctcttggct cttcccttcaaatcacttgaagtgtcaagattggtggatcggcaggttgagaaataa >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_3|463_aa MNIILEILLLLITIIYSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQ SILVLWDINKRGVEETAAECRKLGVTAHAYVVDCSNREEIYRSLNQVKKEVGDVTIVVNN AGTVYPADLLSTKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIP YLIPYCSSKFAAVGFHRGLTSELQALGKTGIKTSCLCPVFVNTGFTKNPSTRLWPVLETD EVVRSLIDGILTNKKMIFVPSYINIFLRLQKVHNRYSVNSKGEVKACIEWILVVSLVTAL ATTAGVALHRSIQTAHFVNDRQANSTQMWNAQQGIGQKLANQINDLRQSVIWLGDRLMSL KHRMQMQCDWNTSDFCITPYSYEIDHSREMVKGHLLGREDNLSLDITKLKKQIFEASQAH LSIVPGAETLDQVAESLYGLNPTTWIKSIRGSTVVTLELCFSV >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_3|1392_bp atgaacatcatcctagaaatccttctgcttctgatcaccatcatctactcctacttggag tcgttggtgaagtttttcattcctcagaggagaaaatctgtggctggggagattgttctc attactggagctgggcatggaataggcaggcagactacttatgaatttgcaaaacgacag agcatattggttctgtgggatattaataagcgcggtgtggaggaaactgcagctgagtgc cgaaaactaggcgtcactgcgcatgcgtatgtggtagactgcagcaacagagaagagatc tatcgctctctaaatcaggtgaagaaagaagtgggtgatgtaacaatcgtggtgaataat gctgggacagtatatccagccgatcttctcagcaccaaggatgaagagattaccaagaca tttgaggtcaacatcctaggacatttttggatcacaaaagcacttcttccatcgatgatg gagagaaatcatggccacatcgtcacagtggcttcagtgtgcggccacgaagggattcct tacctcatcccatattgttccagcaaatttgccgctgttggctttcacagaggtctgaca tcagaacttcaggccttgggaaaaactggtatcaaaacctcatgtctctgcccagttttt gtgaatactgggttcaccaaaaatccaagcacaagattatggcctgtattggagacagat gaagtcgtaagaagtctgatagatggaatacttaccaataagaaaatgatttttgttcca tcgtatatcaatatctttctgagactacagaaagtgcataatagatattcagtaaatagt aaaggagaggtgaaggcttgcatagaatggattctggtggtgtctcttgtcactgcactg gccaccactgccggagtggcattacaccgatctattcaaacggctcattttgttaatgat cggcaagccaattccacccaaatgtggaatgctcaacagggcattggtcaaaaattagct aatcaaattaatgatttaagacagtctgttatttggcttggagatcggctaatgagtctc aaacatcgcatgcaaatgcagtgcgattggaatacttctgatttctgtatcacaccatat tcctacgagattgatcattcacgggaaatggtcaaaggacaccttctgggtagggaagat aatttatccttggacataactaaattaaagaaacaaatatttgaagcctctcaagctcat ttatccattgtgcctggagctgagacgttagatcaggtggcagaaagtctttatggacta aaccccacaacttggattaagtctattaggggctccactgtagtaacattggaattatgt ttctctgtttaa >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_4|1130_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELP FTIASKRIKYLGIQHCEGPLQGELQTTAQGNKRGDKQMEEHSMLMGRKNQYRENGHTAQE LEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQW NRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKI NSRWIKDLNVKPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKDKIDKWDLIKLKSF CTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRH FSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNSFFNNIGKSFS SPVQKALLEGIELAVNTAVVAETQKNLGVYSPYSGEKMNDYSIMGILNFDKKKERKRRKG KGRRRRREEEEGKRKREKKKKRRRRGRKEEEERSFPTLVSKSESLSMLFE >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_4|3393_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctcagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatcgaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaaccggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgggtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacactgtgaaggacctctt caaggagaactacaaaccactgctcaaggaaataaaagaggagacaaacaaatggaagaa cattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagaa ttggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtca atcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactac aaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatgg aacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaa cctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaac tggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatc aattcaagatggattaaagatttaaacgttaaacctaaaaccataaaaaccctagaagaa aacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaacacca aaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagcttc tgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaatt tttgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaa atttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacac ttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatcactg gccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatg gcaatcattaaaaagtcaggaaacaacagcttttttaacaacattggaaagtccttcagc agtcctgtgcagaaggcattgttagaaggcattgaactagctgtgaacacagcagtagta gcagaaacccagaagaaccttggagtctatagcccttactctggagaaaagatgaatgac tatagtatcatggggattttgaattttgataagaagaaggagaggaagaggaggaagggg aaggggaggaggaggaggagggaagaagaagaagggaagaggaagagggagaagaagaag aaaagaagaagaagaggaagaaaagaagaagaagaaagaagctttccaacattagtctca aagtctgaaagcttgtcaatgttatttgaatga >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_5|934_aa MCTSGRLVTALGLGLSFAPKSEQVPVAGRVQAAGARTSESAEARGLPGPLRVQRCPGPQL QLDSCSYRGSYLLLAPKSTGMPRSRAMGNERLSTRASSCGGCTGSPSSASPPALCSISHR ALAAFPQGRARDLQPSMPEPPTHSVGSCAARASPRSTNPCSTAPSPVDRPRAEECKRTAQ DWQAAPPAAPVRDPLGEASWAPESESFVKLFIPKRRKSVTGEIVLITGAGHGIGRLTAYE FAKLKSKLVLWDINKHGLEETAAKCKGLGAKVHTFVVDCSNREDIYSSAKKVKAEIGDVS ILVNNAGVVYTSDLFATQDPQIEKTFEVNVLAHFWLDMSRSPSYWLTEMLMERSPDPDLK RGFLDLVQERIQVLFKEEILSELLLPDFSHAKRPIFPASELYQRQSPREGTAMRLKGQAP KHIKQGGDFIGFFVCFRDLQPSLLLTSLLGSLEKRAYRCSKPMFYPEVPLYTEKRIHSTK YTSLRLALEFFFALIKSLQRGLFSIVTLWSARNMLQDPNTYPKVAIGSGFLHFSPFVVAR KMLPERGPDPDPKRGFLDLMKERIQEGKLTTRKDIYTENPSVHHHHQRPKVDKTTKMGEK QNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQTKGK EVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSAMEDE MNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQDIIQE NFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTLKG KPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDKQMLR DFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_5|2805_bp atgtgcacatctggccggcttgtgacagcgctggggctgggcctcagctttgctccaaaa tcagagcaggtgcctgtagcggggagagtccaggctgcaggagcacgcacttctgagtct gctgaagcgcgggggcttcctgggcccctgagagtgcagagatgcccgggtccccagctg cagctggatagctgcagttacaggggctcctacctgctcctggcccccaagagcacaggg atgcccaggtccagagccatgggcaatgagagacttagcacccgggccagcagctgcgga gggtgtactgggtcccccagcagtgccagcccaccggcgctgtgctcgatttctcaccga gccttagctgccttcccacagggcagggctcgggacctgcagccctccatgcctgagcct cccacccactccgtgggttcctgtgcggcccgagcctccccaaggagcaccaacccctgc tccacggcgcccagtcccgtcgaccgcccaagggctgaggaatgcaagcgcacggcacag gactggcaggcagctccacctgcagcccctgtgcgggatccactaggtgaagccagctgg gctcctgagtctgagtccttcgtgaagctttttattcctaagaggagaaaatcagtcacc ggcgaaatcgtgctgattacaggagctgggcatggaattgggagactgactgcctatgaa tttgctaaacttaaaagcaagctggttctctgggatataaataagcatggactggaggaa acagctgccaaatgcaagggactgggtgccaaggttcatacctttgtggtagactgcagc aaccgagaagatatttacagctctgcaaagaaggtgaaggcagaaattggagatgttagt attttagtaaataatgctggtgtagtctatacatcagatttgtttgctacacaagatcct cagattgaaaagacttttgaagttaatgtacttgcacatttctggctggacatgtctcgg tccccttcttactggcttactgaaatgttaatggaaaggagtcccgatccagacctcaag agagggttcttggatctcgtgcaagaaagaattcaggtgctctttaaagaagaaatcctt tcagagctcttattacccgactttagccatgccaagcggccaatatttccagcttctgaa ctttaccaaagacagagcccaagagaaggtactgccatgcggttaaaaggtcaagctccc aagcacataaaacaaggtggagacttcatcgggttttttgtttgtttcagggacctgcaa ccaagtttgttactgaccagcttgctgggtagtcttgaaaagcgggcttacaggtgttct aagcccatgttttatcctgaagtacccctctacacggaaaaacgaattcatagcacaaaa tacaccagcttaagactagccttagaattctttttcgcattaattaagtctttacagagg ggtctcttcagtattgtaactctatggtctgccagaaatatgttacaggaccccaacact tacccaaaggtagccatagggtcagggtttctgcactttagtcccttcgtggttgccaga aagatgttaccagaaaggggtcccgatccagaccccaagagagggttcttggatctcatg aaagaaagaattcaggaaggaaaactaacaaccagaaaggacatctacaccgaaaaccca tctgtacatcaccatcatcaaagaccaaaagtagataaaaccacaaagatgggggaaaaa cagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcctccaaaggaacgc agttcctcaccagcaacagaacaaagctggatggagaatgattttgacgagctgagagaa gaaggcttcagacgatcaaattactctgagctacgggaggacattcaaaccaaaggcaaa gaagttgaaaactttgaaaaaaatttagaagaatgtataactagaataaccaatacagag aagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaactacgtgaagaatgc agaagcctcaggagccgatgcgatcaactggaagaaagggtatcagcaatggaagatgaa atgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaaagaaatgagcaa agcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctgattggtgtacct gaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggatattatccaggag aacttccccaatctagcaaggcaggccaacgttcagattcaggaaatacagagaacgcca caaagatactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagtt gaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggttaccctcaaagga aagcccatcagactaacagcggatctctcggcagaaaccctacaagccagaagagagtgg gggccaatattcaacattcttaaagaaaagaattttcaacccagaatttcatatccagcc aaactaagcttcataagtgaaggagaaataaaatactttatagacaagcaaatgttgaga gattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcgctaaacatggaa aggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_6|944_aa MWALIKAALEPFQTDDEADSDEEEEDECKKLPSDSECEEQEMEEIKEKKGKLKKVCFTSP LALPAELSERPPPLSPLNGREDELATKLTAPVVATLKSGAIGDAIQNSIQKARVEADLEA WQFPITIIQQEGQNIANWTTFPFKLIKEFKQAISQYGRNSPFVQTLLKNVSLNNMLIPYD WKTLTKSVLTSSQYLQFKTWWADEAQTQTRENTQARPPVPVSFEQLMGVGPDWGERIAQL LLLPYIKLGSSTMKRTGGFGRTNPTGNAVYWVNQVSDKRPICTVTIQGKDFEGLVDTGAD VSIIAINQWPWDWPKQKAFIGIFGVGASSEVFQSSFILPHQGPNGLEGTIQPIITPIPLN LWEAAIVEPLAPIPLVWLTAKLVWVDQWLLKQEKLEALKELALEELQLIEEKIQQAQVER INPMQPLQFLVFPMKHSPTGVIVQQDDLVEWLFLPHNTTKMFTLYLLDQIAVLIIVLLTK QQIQQAYVNSQEWQVNLAGFVGILDNHYPKSKIFQFLKLISRILPSITQKAPTKGALTVF TDGSSNGKASFAGAQKQVLQTDFASDQRAELMAVITVLKTFKQPVNIVSDSAYVVQATQN IERALIQNVTDEQLNPLFHSLQQALQQRHSPFYITHMRVHTNLPGPLTKLNQRVDALGSA AFADAQTFHSLTHLNAAGLRKRYGVSWKQAKEIVQHSSACQVLHSPHQGAGVNPRGLSPN SIWQMDVTHIPAFGKLSFVHVSVDTYSHFIWATCQTGEATTYVKRHLLSCFSVMGIPEKI KTDNSPGYCSKAMATFFQQWNIAHTMGIPYNSQGQAIVERANCTLKTQIQKQKAGDQEYK TLHMQLHLALLKLNFLNLQKDQLIAAEQHLTGQKENKKAGQDTWWRDAHTKNWGKGKIIT WGRVFACVSPGDNQVPVWVPAKHLKIYHGPQHLVDPPVQCELKV >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_6|2835_bp atgtgggcgctaataaaagcagctcttgagccatttcaaacagatgatgaggcagattca gatgaggaagaggaggatgagtgtaaaaaactaccctcagattctgaatgtgaggaacag gaaatggaggaaattaaagaaaagaaagggaaactgaaaaaagtatgttttactagcccg ttggctctacctgctgaattaagtgaaagaccacctcctctctctccccttaatgggcga gaagatgaattagctacaaaacttactgctcctgtagttgcaacattaaaatctggagca attggtgatgctatacaaaattctattcaaaaggctagagtggaggcagaccttgaagca tggcaatttcccataactataatccagcaggaaggacagaatatagctaattggaccacc tttccttttaagctgataaaggaattcaagcaagccattagtcaatatgggcgaaactct ccttttgtgcaaactttattaaaaaatgtttctctcaataatatgttaataccgtatgat tggaaaactttaacaaaatctgttctcacttcatctcagtacttgcagtttaaaacctgg tgggctgatgaagctcaaactcagacaagggaaaatacacaagcacgaccacctgtgcct gtttcctttgaacagttaatgggagttggacctgattggggagaaagaattgctcagttg ttgctgttaccttacataaaactaggaagcagcacaatgaagagaacaggaggctttggt agaactaatccaacaggaaatgctgtatattgggttaatcaagtgtctgacaaaagacct atttgcacagtaactattcagggaaaagattttgaaggactagtagatactggagctgat gtctctattattgctataaatcagtggccctgggactggcctaagcaaaaggcattcatt ggtatttttggagtaggagcttcctcggaagtttttcaaagttcctttattttgccacat caagggccaaatggtctggaagggacaattcagcctatcattacacctattcctctcaac ttatgggaagcggccattgttgagcctctggctcccattcctctcgtttggctaactgcc aaactggtttgggtggaccaatggctgctgaaacaggaaaaactggaggctttaaaagag ctggcattggaggaacttcaattaattgaggaaaaaattcagcaagcacaagtagaacgg attaatccaatgcaaccattacagtttttagtttttcctatgaagcattcacctacagga gttatagttcaacaggatgatctggttgagtggctttttctgcctcacaatacaaccaaa atgttcactctgtacttgttggatcaaattgctgtgctaattatagttctgttaactaaa caacaaattcaacaagcctatgttaattcccaagaatggcaagttaatttggcaggtttt gttggcattcttgataatcattatccaaaatctaagatatttcagtttctaaaattaata tcccggatattgccttctattactcaaaaagcccctactaaaggggcccttactgttttt actgatggatctagtaatggaaaagcctcatttgcaggagctcaaaaacaagttttgcaa actgactttgcttctgatcaaagggctgaacttatggctgtgataacagtgttaaaaact tttaaacagccagtaaacattgtttctgattcagcctatgtagtgcaagccacacaaaat attgaacgtgccttaattcaaaatgtgactgatgaacaacttaatcctttatttcattct ttacagcaagcactacaacaaaggcattcacctttctatatcactcatatgagagtacat actaacctccctggccctttaactaaacttaatcaaagggtggatgcattggggtctgca gcttttgctgatgcacagacatttcattctttaacccatcttaatgctgcaggccttagg aaaagatatggtgtatcatggaaacaagctaaagaaattgtgcaacactcttctgcctgc caagtcctgcattcgccacatcaaggagcaggagtcaaccctagaggtttatctccaaac tccatctggcagatggatgtaacacatattcctgcttttggaaaattgtcctttgttcat gtttcagtagatacctattcacattttatctgggccacatgtcaaacaggggaggctaca acttatgttaaaagacatcttttatcttgcttttctgttatgggaatcccagaaaaaatc aaaactgataacagcccaggatactgtagtaaagccatggctacattctttcaacaatgg aatattgcccatactatgggtattccatataactcacaaggacaggcaatagtggaaaga gctaattgtactttaaaaactcaaatacaaaagcagaaggcaggagaccaggaatataaa acattgcatatgcaattgcatctagctttattaaaattaaattttttaaatttacaaaaa gatcaactcatagcagcagaacaacacctgacaggacagaaggaaaataaaaaggctgga caagatacatggtggagggatgcacatacaaagaactggggaaaaggaaagataattaca tggggaagagtatttgcttgtgtctcgccaggtgacaatcaggtgcctgtgtgggtgcct gccaaacatctgaagatctatcatggaccacagcatctagtggacccacctgtacagtgt gaattgaaagtctga >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_7|351_aa MAGRLLGKALAAVSLSLALASVTIRSSRCRGIQAFRNSFSSSWFHLNTNVMSGSNGSKEN SHNKARTSPYPGSKVERSQVPNEKVGWLVEWQDYKPVEYTAVSVLAGPRWADPQISESNF SPKFNEKDGHVERKSKNGLYEIENGRPRNPAGRTGLVGRGLLGRWGPNHAADPIITSMHF MIQNSVVQDFINFFRSSGAGVKFSDSQASLLEIVQGMVDPGEKISATLKREFGEEALNSL QKTSAEKREIEEKLHKLFSQDHLVIYKGYVDDPRNTDNAWMETEAVNYHDETGEIMDNLM LEAGDDAGKVKWVDINDKLKLYASHSQFIKLVAEKRDAHWSEDSEADCHAL >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_7|1056_bp atggcgggacgcctcctgggaaaggctttagccgcggtgtctctctctctggccttggcc tctgtgactatcaggtcctcgcgctgccgcggcatccaggcgttcagaaactcgttttca tcttcttggtttcatcttaataccaacgtcatgtctggttctaatggttccaaagaaaat tctcacaataaggctcggacgtctccttacccaggttcaaaagttgaacgaagccaggtt cctaatgagaaagtgggctggcttgttgagtggcaagactataagcctgtggaatacact gcagtctctgtcttggctggacccaggtgggcagatcctcagatcagtgaaagtaatttt tctcccaagtttaacgaaaaggatgggcatgttgagagaaagagcaagaatggcctgtat gagattgaaaatggaagaccgagaaatcctgcaggacggactggactggtgggccggggg cttttggggcgatggggcccaaatcacgctgcagatcccattataaccagtatgcacttc atgattcaaaacagcgttgttcaggatttcataaactttttcagaagctccggggcaggt gttaaattttcggattcccaggcatctctcctagagattgtccaggggatggtggatcca ggagagaagattagtgccacactgaaaagagaatttggtgaggaagctctcaactcctta cagaaaaccagtgctgagaagagagaaatagaggaaaagttgcacaaactcttcagccaa gaccacctagtgatatataagggatatgttgatgatcctcgaaacactgataatgcatgg atggagacagaagctgtgaactaccatgacgaaacaggtgagataatggataatcttatg ctagaagctggagatgatgctggaaaagtgaaatgggtggacatcaatgataaactgaag ctttatgccagtcactctcaattcatcaaacttgtggctgagaaacgagatgcacactgg agcgaggactctgaagctgactgccatgcgttgtag >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_8|631_aa MDAKILNKILANQIQEHIKKLIYHNQVTFIPGMQGWFNIRKSMRVSHHINRTNDKNHMII SIDAEKVFNKIQYPFMLKTLNKLGIDGTYLKIITAIYDKPIANIILNGQKLETFPLKTST RQGCPLSPLLFNIVLEVLARAVRQEKEIKRIQIGRQEVKLSLFADDMIAYLENPIISVQN LLKLISNFSKVSGYKINVRKSQAFLYTNNRQAESQIISECPFAIATKRIKYLGIQLTMDV KDLFKENYKPLLKEIREDTNKCSWIGRINIVKMVVLPKVIYRFNPIPIKLPLTFFTELEK NYFKVGNQKRAHISKTILSKNNKAGGITLTNFKLYYKATVTKTALYWAEDHCHNNGDLRK LCQLPEVVKLPIMRTCHRTQISPKITVHFLLVTAIPTCTDFEVIQFPLRMRDWLKNILMQ LYEANSEHAGYLNEKQRNKVKKIYLDEKRLLAGDHPIDLLLRDFKKNYHMYVYPVHWQFS ELDQHPMDRVLTHSELAPLRASLVPMEHCITRFFEECDPNKDKHITLKEWGHCFGIKEVV LNPAAKLPAEFSKRLHAISPPLGILILESRGPLSPTSATAPGKHGFGESEIAKRIYLLDG GYETFQILILKYEALGSPQESAIQTSLPYSL >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_8|1896_bp atggatgcgaaaatcctcaataaaatactggcaaaccaaatccaggagcacatcaaaaag cttatctaccacaatcaagtcaccttcatccctgggatgcaaggatggttcaacatacgc aaatcaatgcgtgtaagccatcacataaacagaactaatgacaaaaaccacatgattatc tcaatagatgcagaaaaggtcttcaacaaaattcagtatcccttcatgctaaaaactctc aataaactaggtattgatggaacatatctcaaaataataacagctatttatgacaaaccc atagcaaatatcatactgaatgggcagaagctggaaacattccctttgaaaaccagcaca agacaaggatgccctctctcaccactcctattcaacatagtattggaagttctggccagg gcagtcaggcaagagaaagaaataaagcgtattcaaataggaagacaggaagtcaaattg tccctgtttgcagatgacatgattgcatatttagaaaaccccatcatctcagtccaaaat ctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacgaaaa tcacaagcattcctatataccaataacagacaagcagagagccaaatcataagtgaatgc ccatttgcaattgctacaaagagaataaaatacctaggaatacaacttacaatggatgtg aaggacctcttcaaggagaactacaaaccattgctgaaggaaataagagaggacacaaac aaatgctcatggataggaagaatcaatatcgtgaaaatggttgtactgcccaaagtaatt tatagattcaatcctatccccatcaagctaccactgactttcttcacagaattagaaaaa aactactttaaagttgggaaccaaaaaagagcccatatatccaagacaatcctaagcaaa aacaacaaagctggaggcatcacgctaactaacttcaaactatactacaaggctacagta actaaaacagcattgtactgggctgaggaccactgtcataataatggggacctgagaaag ctctgtcagttgccagaagtcgttaagcttcctattatgaggacttgccataggacacaa ataagcccaaagattacagtacattttctgcttgtaacagctattcctacttgtacggac tttgaagtgattcagtttcctctacggatgagagactggctcaagaatatcctcatgcag ctttatgaagccaactctgaacacgctggttatctaaatgagaagcagagaaataaagtc aagaaaatttacctggatgaaaagaggcttttggctggggaccatcccattgatcttctc ttaagggactttaagaaaaactaccacatgtatgtgtatcctgtgcactggcagtttagt gaacttgaccaacaccctatggatagagtcttgacacattctgaacttgctcctctgcga gcatctctggtgcccatggaacactgcataacccgtttctttgaggagtgtgaccccaac aaggataagcacatcaccctgaaggagtggggccactgctttggaattaaagaagtggtt ctcaaccctgcagccaaattacctgcagagttttctaaacgtctacatgccattagtcca cccctagggattctgattctagagtccagaggtcctctttcccctacctccgcaaccgcc cctggtaagcatgggtttggggaatctgaaatagctaaaaggatttatttgttggatggt ggttatgaaactttccaaattttgatactgaaatatgaggctttaggcagtccccaggaa tcagctattcaaacttctctcccatatagcttatag >gi568815594r:87237279_87491070|GENSCAN_predicted_peptide_9|213_aa XSCMSFQCKRGHICKADQQGKPHCVCQDPVTCPPTKPLDQVCGTDNQTYASSCHLFATKC RLEGTKKGHQLQLDYFGACKWVDRHLIQESSGWHLAGTPLGRSFQRKEQAAMFAVLKPLL DHNSLPAREQKWMENEFEEVTEVGFRTWVITNSSELKEHVITQYKEAKNLEKRLEELLTR ITSLEKNINDLMELKNTARELREAYTSINIQID >gi568815594r:87237279_87491070|GENSCAN_predicted_CDS_9|642_bp nattcttgcatgagcttccagtgtaaaagaggccacatctgtaaggcagaccaacaggga aaacctcactgtgtctgccaggatccagtgacttgtcctccaacaaaaccccttgatcaa gtttgtggcactgacaatcagacctatgctagttcctgtcatctattcgctactaaatgc agactggaggggaccaaaaaggggcatcaactccagctggattattttggagcctgcaaa tgggtcgacagacatctcatacaggagagctctggctggcatctagcaggtacccctctg ggacgaagcttccagaggaaggaacaggcagcaatgtttgctgttctgaagcctctgctg gatcacaactccttgccagcaagggaacaaaagtggatggagaatgagtttgaggaagtg acagaagtaggcttcagaacatgggtaataacaaactcttctgagttaaaggaacatgtt ataacccaatacaaggaagctaagaaccttgaaaaaaggttagaggaattgctaaccaga ataaccagtttagagaagaacataaatgacctgatggagctgaaaaacacagcacgagaa cttcgtgaagcatacacaagtatcaatatccaaatcgattaa