GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:50:00 Sequence gi568815594f:94108395_94389531 : 281137 bp : 38.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 90 151 62 0 2 75 84 54 0.108 1.16 1.02 Term + 28664 28815 152 0 2 43 42 167 0.396 4.69 1.03 PlyA + 28829 28834 6 1.05 2.05 PlyA - 29960 29955 6 1.05 2.04 Term - 31655 31482 174 2 0 42 38 144 0.343 1.58 2.03 Intr - 36351 35922 430 2 1 26 36 225 0.305 3.89 2.02 Intr - 37757 36669 1089 1 0 36 72 464 0.018 27.82 2.01 Init - 39234 39014 221 0 2 88 72 147 0.029 11.25 2.00 Prom - 41630 41591 40 -3.55 3.00 Prom + 43937 43976 40 -6.65 3.01 Init + 46262 46437 176 1 2 65 84 127 0.222 8.87 3.02 Intr + 48697 48830 134 1 2 -21 78 124 0.402 -0.13 3.03 Term + 50256 50647 392 1 2 78 44 171 0.809 5.86 3.04 PlyA + 50940 50945 6 -0.45 4.03 PlyA - 51431 51426 6 1.05 4.02 Term - 53451 53241 211 2 1 75 41 93 0.908 -0.92 4.01 Init - 54288 54203 86 0 2 36 69 148 0.913 8.04 4.00 Prom - 54560 54521 40 -5.25 5.06 PlyA - 55623 55618 6 1.05 5.05 Term - 56802 56656 147 0 0 82 40 79 0.086 -0.58 5.04 Intr - 58527 58426 102 0 0 65 81 45 0.042 0.95 5.03 Intr - 62552 62421 132 2 0 10 101 66 0.009 0.02 5.02 Intr - 88110 87751 360 0 0 26 72 331 0.449 19.79 5.01 Init - 92801 92793 9 2 0 58 100 6 0.537 -0.93 5.00 Prom - 93690 93651 40 -6.95 6.00 Prom + 98660 98699 40 -7.25 6.01 Init + 100001 100190 190 1 1 64 109 156 0.979 14.52 6.02 Intr + 105287 105376 90 0 0 26 105 66 0.377 1.15 6.03 Intr + 117725 117902 178 0 1 114 58 75 0.020 5.16 6.04 Intr + 125560 125728 169 1 1 26 24 167 0.017 3.13 6.05 Intr + 128558 128624 67 1 1 69 85 109 0.822 6.26 6.06 Intr + 136765 136914 150 2 0 -21 -5 275 0.903 6.51 6.07 Intr + 137486 137676 191 0 2 44 52 131 0.850 3.48 6.08 Intr + 141260 141361 102 1 0 92 109 71 0.994 9.05 6.09 Intr + 142358 142439 82 1 1 32 86 94 0.990 1.89 6.10 Intr + 144222 144613 392 1 2 68 95 262 0.788 18.42 6.11 Intr + 156313 156512 200 0 2 62 84 80 0.512 2.23 6.12 Intr + 162334 162424 91 1 1 65 115 52 0.933 4.68 6.13 Intr + 165223 165322 100 0 1 79 58 70 0.933 1.86 6.14 Intr + 166496 166571 76 0 1 76 113 46 0.995 3.55 6.15 Intr + 167945 168080 136 2 1 42 64 195 0.951 12.15 6.16 Intr + 170222 170339 118 1 1 99 10 73 0.919 -0.28 6.17 Intr + 170535 170656 122 1 2 44 116 98 0.948 7.49 6.18 Intr + 172198 172386 189 0 0 68 62 126 0.988 6.86 6.19 Intr + 173078 173196 119 1 2 67 81 11 0.876 -3.36 6.20 Intr + 174727 174909 183 1 0 95 97 101 0.994 9.68 6.21 Intr + 176566 176675 110 1 2 64 113 49 0.980 4.01 6.22 Term + 179107 179174 68 2 2 88 44 72 0.791 -0.08 6.23 PlyA + 179306 179311 6 1.05 7.04 PlyA - 179446 179441 6 1.05 7.03 Term - 188292 188122 171 0 0 63 42 114 0.198 1.14 7.02 Intr - 189793 189588 206 2 2 83 19 193 0.827 9.90 7.01 Init - 189981 189924 58 0 1 42 119 20 0.873 2.02 7.00 Prom - 196975 196936 40 -2.95 8.03 PlyA - 198834 198829 6 1.05 8.02 Term - 202955 202059 897 2 0 -19 37 306 0.024 6.18 8.01 Init - 204360 203323 1038 0 0 59 41 547 0.026 41.13 8.00 Prom - 204450 204411 40 -5.45 9.08 PlyA - 204620 204615 6 1.05 9.07 Term - 205614 204848 767 1 2 -40 43 527 0.017 27.98 9.06 Intr - 206046 205884 163 0 1 71 71 97 0.010 4.93 9.05 Intr - 213120 212086 1035 0 0 29 81 306 0.703 13.78 9.04 Intr - 214824 213947 878 1 2 33 41 357 0.120 15.62 9.03 Intr - 226244 226103 142 2 1 137 108 43 0.743 10.21 9.02 Intr - 226691 226582 110 0 2 55 60 47 0.357 -2.32 9.01 Init - 227642 227594 49 2 1 86 58 42 0.408 0.16 9.00 Prom - 234247 234208 40 -5.75 10.05 PlyA - 234370 234365 6 1.05 10.04 Term - 236172 235969 204 0 0 100 47 172 0.996 10.69 10.03 Intr - 238660 238552 109 0 1 55 47 90 0.028 0.97 10.02 Intr - 258981 258922 60 2 0 92 109 58 0.048 5.23 10.01 Init - 273865 273729 137 1 2 41 24 177 0.087 6.26 10.00 Prom - 274508 274469 40 -5.35 11.02 PlyA - 274593 274588 6 1.05 11.01 Term - 278545 278375 171 1 0 60 42 182 0.256 7.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 39234 38935 300 0 0 88 54 189 0.958 11.04 S.002 Sngl - 117014 116799 216 2 0 63 39 190 0.817 6.52 S.003 Sngl - 204360 203137 1224 0 0 59 41 571 0.909 45.05 S.004 Sngl - 205681 204848 834 1 0 63 43 564 0.968 44.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_1|71_aa XNVEFDIGAEPREQMRRQKGRENKKGGQTVQSVRYYLHIRREEKVTEKKEDTNDQIRKER GDITNDDREIL >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_1|216_bp nnaaacgtagagtttgatattggagcagagcccagggaacagatgaggagacaaaaagga agggaaaataaaaaaggaggtcaaacagttcaatctgtgaggtactatcttcatattcga agagaagaaaaagtgaccgagaaaaaggaagatacaaatgaccaaatcagaaaggaaaga ggagacatcactaatgatgatagagaaatactgtga >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_2|637_aa MGRKQSRKAENSKNQSASSSKDCSSLPATEQSWTENDFDELTEVGFRRSVITNFSELKMY VRTHCKEAKNLEKRSTQFIKQVLRDLQRDLDSHTIIMGDFNTPLSILERSTTQKVNNDTQ ELNSALHQVDLIDIYRTLHPKSTEYTFFLAPHHTYSKIDHIVRSKALFSKCKRTEITTNC LSDHSAIKLELRIKKLTQNHTTTWKLNNLLLNDYWVNNEMRAEIKMFFETNENKDTMYQN LWDTFKAVCRGKFIALNAHKRKQERSKIDNLTSQLKELEMQEQTHSKASRRQETTKIRAE LKEIETQKTLQKITESRSWCFEKINKIDRLLARLIMKKREKNQIDTIKNDKGDITTDPTE IQTTISGYYKHLSTNKLENLEEMDKLLDTYTLPRLYQEQVESLNTPITGPEIEAIVNSPP TKKIPVSDEFTAEFYQRTNDKNHMIISIDAEKAFDESQQHFMLKTLNKLGIHGTHLKILR AVYDKPTANIILNGQKLEAFPLKTRTREGCPPSPLLFNIVLEVLTRAIRQEKEIKGIQLG KEEVKLSLFADDVIVYLENPIVSAQKLLKLISNFSKVSGLRVLANVYGSFQDMETQGLRF FGQNSVPQQLRPYNGASLQTLESGEKASAPAKDHCLV >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_2|1914_bp atggggagaaaacagagcagaaaagctgaaaattctaaaaaccagagtgcctcttcttca aaggattgcagttccttgccagcaactgaacaaagctggacagagaatgactttgatgag ttgacagaagtaggtttcagaaggtcagtaataacaaacttctccgagctaaaaatgtat gttcgaacccattgcaaggaagctaaaaaccttgaaaaaaggagcacccaattcataaaa caagtccttagagacctacaaagagacttagattcccacacaataataatgggagacttt aacactccactgtcaatattagaaagatcaacgacacagaaggttaacaacgatactcag gaattgaactcagctctgcaccaagtggacctaatagacatctacagaactctccacccc aaatcaacagaatatacattcttcttagcaccacatcatacttattccaaaattgaccac atagttagaagtaaagcactcttcagcaaatgtaaaagaacagaaatcacaacaaactgt ctctcagaccacagtgcaatcaaattagaactcaggattaagaaactcactcaaaaccac acaactacatggaaactgaacaacctgctcctgaatgactactgggtaaataatgaaatg agagcagaaataaagatgttctttgaaaccaatgagaacaaagacacaatgtaccagaat ctctgggacacatttaaagcagtgtgtagagggaaatttatagcactaaatgcccacaag agaaagcaggaaagatctaaaattgacaacctaacatcacaattaaaagaactagagatg caagagcaaacacattcaaaagccagcagaaggcaagaaacaactaagatcagagcagaa ctgaaggagatagagacacaaaaaacccttcaaaaaatcactgaatccaggagctggtgt tttgaaaagatcaacaaaattgatagactgctagcaagactaataatgaagaaaagagag aagaatcaaatagacacaataaaaaatgataaaggggatataaccacagatcccacagaa atacaaactaccatcagtggatactataaacacctctccacaaataaactagaaaatcta gaagaaatggataaattgctggacacatacaccctcccaagactataccaggaacaagtt gaatctctgaatacaccaataacaggccctgaaattgaggcaatagttaatagcccacca accaaaaaaattccagtatcagacgaattcacagctgaattctaccagagaaccaacgac aaaaaccacatgattatctcaatagatgcagaaaaggccttcgacgaaagtcaacagcac ttcatgctgaaaactcttaataaactaggtattcatggaacgcatctcaaaatattaaga gctgtttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattc cctttgaaaacccgcacaagagaaggatgccctccgtcaccactcctattcaacatagtg ttggaagttctgaccagggcaatcaggcaagagaaagaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgcagatgacgtgattgtatatttagaaaacccc attgtctcagcccaaaaactccttaagctgataagcaatttcagcaaagtctcaggactt cgagtgctagcgaacgtctatgggagcttccaggacatggaaacacaagggctgaggttc tttgggcagaattcagtcccccaacagctgcgcccctacaatggtgcctcgctgcagaca ctcgagtctggggaaaaggcaagtgcccctgcaaaagatcactgtctagtgtaa >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_3|233_aa MIPTTSPVNSPIWPVQKTDGSWKMVVDYCKLNQVVTTIATAVPDVVSLLEQINTSPGTWP PACWGRKKPVVAQSESKSLKTREADSAAFRLWLKAQDPLANHWSDCMVSTHTEDRSSSPS PLTQMSISSGNIHTDTPKNNTLPAIWAPLHPIKLTNNINHHSTVSFRASIHASRNQGVEV EVAPLAITPSDLASKIFASCSCHIMFCWPRSLISRGRNAATRGHNNDSMELGS >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_3|702_bp atgatccctaccacatccccagtcaactctcctatttggcctgttcagaagacagatgga tcttggaaaatggtagtggattattgtaagcttaaccaagtggtgactacaattgcaact gctgtaccagatgtggtttcattgcttgagcaaattaacacatctcctggtacctggcca cctgcatgctggggaagaaagaagccagtagtggctcagtctgagtccaaaagcctcaaa accagggaagctgacagtgcagcctttcgtctgtggctgaaggcccaagaccccctggca aaccactggtctgattgcatggtgtccacccacactgaggacaggtcttcctctcccagc ccactgactcaaatgtcaatctcctctggcaacatccacacagacacacccaagaacaat actttaccagccatctgggcaccccttcatccaatcaagttgacaaataatattaaccat cacagtactgtttctttcagagccagtattcatgcgtccaggaatcaaggggtggaagtg gaagtggcaccactcgcaattacccctagtgacctggctagcaaaatttttgcttcctgt tcctgccacattatgttctgctggcctagaagtcttatttccagagggaggaatgctgcc accaggggacacaacaatgactctatggaactgggaagttaa >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_4|98_aa MRCIIEEQPPGFGDRTPTEGQCPEDAVVRDMHEAGNHHSQQSNTGTENQTPYVLTHKWEL NNENTWTQGGEQHTMGPVRGSGAREGRALGQKPNACGA >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_4|297_bp atgagatgcatcattgaggagcagccccctggttttggtgacagaactcctactgaagga cagtgccctgaggatgctgtggtgagggacatgcatgaagctggaaaccatcattctcag caaagtaacacgggaacagaaaaccaaacaccatatgttctcactcataagtgggagttg aacaatgagaacacatggacacagggaggagaacaacacacaatgggacctgtccggggg tcgggggcaagggaagggagagcattaggacaaaaacctaatgcatgtggggcttaa >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_5|249_aa MHKKPKGKISAYAFFGQTCREEHKKKNPEVPVNFAEFSTKCSERWKTMSGKEKSKFDETA KVDKERYDRKMKDYGPAKRGKKEKDPNAPKGHHLGSSYSVQNSAPRSNPQTLASLLEMWQ KSWSVLQFLKAACPEFVPSDVQMCLEFLPSGGFMVSLASGVKLQTFAPACTQVIKNFIAH TKPVWWSLHTDAHESDHFREKISKGRTQGLQYYGSILHETEGTLDARMVRDVWNQKHIVT LSMIGEMKI >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_5|750_bp atgcacaagaaaccaaagggcaagatctctgcttatgccttctttgggcagacgtgcaga gaagaacataagaagaaaaacccagaagtccctgtcaattttgcagaattttccacgaag tgctctgagaggtggaagacaatgtctgggaaagaaaagtctaaatttgatgaaacggcc aaggtggataaagaacgctatgatcggaaaatgaaggattatggaccagctaaaagaggc aagaaggagaaggatcctaatgcccccaaaggccaccatctgggttcttcctattctgtt cagaattccgccccaagatcaaatccacaaaccttggcatctctattggagatgtggcaa aaaagctggtcagtgctacagttcttaaaggcagcatgtccggagtttgttccttctgat gttcagatgtgtttggagtttcttccttctggtgggttcatggtctccctggcgtcagga gtgaagctgcagaccttcgcgcccgcctgcacccaggtgattaaaaactttattgctcac acaaagcctgtttggtggtctcttcacacggacgcgcatgaaagtgaccatttcagggaa aagatctctaagggacgtacccagggcctgcaatattatgggtcaatacttcatgaaaca gaaggtactcttgatgcaaggatggtaagggatgtttggaatcagaaacatatagtcact ctgagtatgataggagagatgaagatatga >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_6|1040_aa MNLFNLDRFRFEKRNKIEEAPEATPQPSQPGPSSPISLSAEEENAEGEVSRANTPDSDIT EKTGDQLNVEDLQITQMFLAWTAGGLTPWTELREDSSVPETPDNERKASISYFKNQRGIQ YIDLSSDSEDVVSPNCSNTVQEKTFNKDTVIIVSEPSEDEESQGLPTMARRNDDISELED LSELEDLKDAKLQTLKELFPQRSDNDLLKLIESTSTMDGAIAAALLMFGDAEKQFGLGAV KNEDIEKFLGSCPDDLTTENIQQLAAYGCMDAEDVDDSDNETKLTVVSRRGHDDLSLKLS KTIFPLGLAKDFIQHPYLQQILGESLHLLVTKPKWRSTLYCTLDLGEESNESAESSSNWE KQESIVLKLQKEFPNFDKQELREVLKEHEWMYTEALESLKVFAEDQDMQYVSQSEVPNGK EVSSRSQNYPKNATKTKLKQKFSMKAQNGFNKKRKKNVFNPKRVVEDSEYDSGSDVGSSL DEDYSSGEEVMEDGYKGKILHFLQDASIGELTLIPQCSQKKAQKITELRPFNSWEALFTK MSKTNGLSEDLIWHCKTLIQERDVVIRLMNKCEDISNKLTKQVTMLTGNGGGWNIEQPSI LNQSLSLKPYQKVGLNWLALVHKHGLNGILADEMGLGKTIQAIAFLAYLYQEGNNGPHLI VVPASTIGSQEERKQIRFNIHSRYEDYNVIVTTYNCAISSSDDRSLFRRLKLNYAIFDEG HMLKNMGSIRYQHLMTINVLKQLPPKKDRIELCAMSEKQEQLYLGLFNRLKKSINNLEKN TEMCNVMMQLRKMANHPLLHRQYYTAEKLKEMSQLMLKEPTHCEANPDLIFEDMEVMTDF ELHVLCKQYRHINNFQLDMDLILDSGKFRVLGCILSELKQKGDRVVLFSQFTMMLDILEV LLKHHQHRYLRLDGKTQISERIHLIDEFNTDMDIFVFLLSTKAGGLGINLTSANVVILHD IDCNPYNDKQAEDRCHRVGQTKEVLVIKLISQGTIEESMLKINQQKLKLEQDMTTVDEEL ALVVTRVPAAGAASPKKLAI >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_6|3123_bp atgaatcttttcaacctggaccgttttcgctttgagaaaaggaataagattgaggaagcg cccgaagcaacccctcaaccttcccagcctggcccttcttcaccaatttctcttagtgct gaagaggagaatgctgaaggggaagttagcagggcaaacactcctgattcagatataact gaaaaaacaggtgatcagttgaatgtggaggatttacagataactcaaatgttccttgct tggacagctggaggattgacaccatggactgaattaagagaagattctagtgttccagaa actccagataatgaaagaaaagcaagtatatcatatttcaaaaatcaaagaggaatacag tatattgatttgtcttctgatagtgaagatgtcgtttccccaaattgctccaatacagtt caagagaaaacattcaacaaagatacagtgattatagtttctgagccatctgaagatgaa gagtcccaaggccttcctaccatggcacgtagaaatgatgatatttcagaactggaagac ctttcggaattggaagaccttaaagatgctaaacttcagactttgaaggaactttttcca caaagaagtgacaatgatttacttaagttgattgaatcaacaagcactatggatggagca attgctgctgccttgctgatgtttggtgatgcagaaaaacagtttggactcggagcagtc aaaaatgaggatattgagaagtttctagggtcctgtcctgatgatctaaccacagaaaac attcagcaacttgctgcttatggctgtatggatgcagaagatgtcgatgacagtgataat gaaacaaaactgactgtggtcagtagaagaggacatgatgacctttctttaaaactctct aaaacgattttcccactggggcttgccaaagacttcatacaacatccttacctgcaacag attcttggagaatcattgcatctgctggtcacgaaacccaagtggcggagtaccttgtat tgcactctggacctgggagaggaatcaaatgagtctgcagaatctagcagtaattgggaa aagcaggaaagtattgtactgaaattgcaaaaggaatttcccaattttgataaacaggaa cttagagaagtactcaaggaacatgaatggatgtacacagaagctttagaatctctaaaa gtgtttgcagaagaccaagatatgcaatatgtatcacaaagtgaggttccaaatggaaaa gaagtttcttcaagaagtcaaaattaccctaaaaatgcaactaaaacaaaactaaaacag aaattttcaatgaaagcacaaaatggctttaacaagaaacgtaaaaaaaatgtttttaat ccaaagagagttgttgaagactctgaatatgattcaggttctgatgtcggtagttcacta gatgaggactatagtagtggtgaagaagtgatggaggatggctataaaggtaaaattctt cacttccttcaagatgcttcaattggtgaacttactttgattcctcagtgttctcagaaa aaggctcagaagataacagaactccggccctttaatagttgggaggctctgttcacaaag atgtccaaaactaatggcttatcagaagatttgatatggcactgtaaaacactgatccaa gaaagagatgtagttataaggcttatgaacaaatgtgaagacatttcaaataaattgacc aaacaagttaccatgcttactggaaatggaggtggatggaacatagaacaaccttccatt ctaaaccaaagtttgtcactcaagccctatcagaaggttggtttgaattggctggcattg gtacataaacatggacttaatggcattttggcagatgaaatgggcctaggaaaaactatt caagccattgcatttctggcatacctctatcaggagggtaataatggtcctcatttgatc gttgttccagcttcaactataggttctcaagaagaacgtaaacaaattagatttaacatt catagtagatatgaagattacaatgtaattgtgaccacatataactgtgcgatcagcagt tctgatgaccgtagtctgtttcgacggctgaaacttaattacgcaatttttgatgagggc catatgctgaagaatatgggctccattcgctaccagcaccttatgacaattaatgttctc aagcagttaccccccaagaaagatcgaattgagttgtgtgcaatgtcggagaagcaggag caactctatttgggtcttttcaacagattgaaaaaatctatcaataacttggaaaaaaac acagaaatgtgcaatgtcatgatgcagttgaggaaaatggccaatcatcctttattacat cgccaatattacacagctgaaaaactcaaggaaatgtctcagcttatgctaaaggaacct acacattgtgaggctaaccctgacctgatctttgaagatatggaagttatgacagacttc gaactacatgtactttgtaaacagtaccgacacattaataactttcagttagacatggac ttgattttagattctggaaaatttcgagttttaggatgcatcttgtctgaattgaaacag aagggtgatagagttgtgttatttagccaatttaccatgatgctggatatcttagaggtt ctattaaaacatcatcagcataggtacctcagattagatggaaagactcagatttctgaa aggattcatctaattgatgagtttaataccgatatggatatctttgtgtttctgctatca acaaaagctggtggattaggaataaatctgacttcagcaaatgttgttatacttcacgat attgactgtaatccttataatgacaaacaagcagaagatagatgccatagagtaggccag actaaagaagtactagttataaaactaataagccaagggacgattgaagaatccatgcta aaaattaaccaacagaaattgaaactagaacaggatatgactacagtagatgaagagctt gccttagttgttaccagagtaccagcagcaggagcagcatctcctaagaaattggcaatt tga >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_7|144_aa MTSGPQTAQPKKHHTNFKSETKETRFIRGPKTPAPVTDWEGSLPLVFNQCRDASLIIHPC IKGVRPRRDACLGPSPLAASPAFLGKGQELATSARNLTTRPRNACSPGFLLRHVPSVRDP TGNWTVQLTWQPLPEPLELWPKAL >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_7|435_bp atgacctcaggtcctcagaccgcccagcccaagaaacatcacaccaatttcaaatccgag acaaaagagacacgttttatccgtggacccaaaactccggcgccggtcacggactgggaa ggcagccttcccttggtgtttaatcaatgcagggacgcctctctgattatacacccatgt atcaagggtgtcagaccacgcagggacgcctgccttggtccttcacccttagcggcaagt cccgcttttctggggaaggggcaagagcttgctacaagtgccagaaatctgaccactagg ccaaggaatgcctgcagcccaggattcctcctaagacatgtcccatctgtgcgggacccc actggaaattggactgttcaactcacctggcagccactcccagagcccctggaactctgg cccaaggctctctga >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_8|644_aa MGDFNTPLSTLDRSMRQKVNKDIQELKSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYS KVDHIVGSKALLSKCKRTEITTNCLSDHSAIKLELRIKKLTQNRSTTCKLNNLLLNDYWV HNEMKAEITMFFETNENKDTTYQNLWDTFKVVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSLFFEKINKIDRLLATLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYTNKLENLEEMDTFLDTYTLPRLN QEEVESLNSPITDSQIEAIIYSLPTKKSPGPDGLTAEFYQRYKEELPFMLKTLNKLGIDG TYLKIVRAICDKPTANIILNGQKLEAFPLKTGTRQGCPLTPLLFNTVLEVLARAIRQEKE INVIQLGKGEVKLSPFADDMIVYLENPIISAKNLLKLISNFSKVSGYKINVQKSQVFLYT NNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKTLLNKIKEDTNKWKNIPC SWVGRINVVKMAILPKVIYRFSAVPIKLPVTFFTELEKNYFKVHMEPKKSLHCQVNPKQK EQGWKHHATRLQTILQGYSNQNSMVLVPKQRYRQTSGTEQSPQK >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_8|1935_bp atgggagattttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatatccaggaattgaagtcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacaccacacttactcc aaagttgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatc acaacaaactgtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actcaaaaccgctcaactacatgtaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataacgatgttctttgaaaccaatgagaacaaagacaca acgtaccagaatctctgggacacatttaaagtagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaatcgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggagatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagcttattttttgaaaagatcaacaaaattgatagactgctggcaacactaataaag aagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatataaccaca gatcccacagaaatacaaactaccatcagagaatactataaacacctctacacaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacaccctcccaaggctaaac caggaagaagttgaatctctgaatagtccaataacagactctcaaattgaggcaataatt tatagcttaccaaccaaaaaaagtccaggaccagatggactcacagctgaattctaccag agatacaaggaggagctgcccttcatgctaaaaactctcaataaattaggtatagatggg acgtatctcaaaatagtaagagctatctgtgacaaacccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctcaca ccactcctattcaacacagtgttggaagttctggccagggcaatcaggcaagagaaggaa ataaatgttattcaattaggaaaaggggaagtcaaattgtccccgtttgcagatgacatg attgtatatttagaaaaccccatcatctcagccaaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagtattcttatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaaggacctctttaaggagaat tacaaaacactgctcaacaaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatgttgtgaaaatggccatactgcccaaggtaatttataga tttagtgccgttcccatcaagctaccagtgactttcttcacagaattggaaaaaaactac tttaaagttcatatggaaccaaaaaagagcctgcattgccaagtcaatcctaagcaaaaa gaacaaggctggaagcatcacgctacccgacttcaaactatactacaaggctacagtaac caaaacagcatggtactggtaccaaaacagagatataggcagaccagtggaacagaacag agccctcagaaataa >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_9|1047_aa MGFHHVGQAGLELLASALGCPYPDVLINSELCHKGKQNRQNIKYYTLPVTKRRNCTMPNY KLTYFNMRGRAEIIRYIFAYLDIQYEDHRIEQADWPEIKSTPHHTYSKIDHIVGSKALLS KCKRTEITTNCLSDHSAIKLELRIKKLTQNHTNTWKLNNLLLNDYWVHNEMKAEINIFFE TNENKDTMYQNLWDTFKAVCTGKFIALNAHKRKQKRSEIDTLTSQLRELEKQEQTHSKAS RRQEISKIRTELKETEIQKTLQKISEPRSLFFEKINKIDRLLARLIKKKREKNQIDAMKN DKGDITTDPTEIQTTIREYYKHLYTNKLENLEELDKLLDTYTFPKINQEEVESLNRPITG SEIEAIINSLPTKKSPGPGGFTAEFYQRYKEELRIKYLGIQLKRDVKDLFKENYKPLLNK IKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPSHLQVTFFTELEKTTLKFIWNQ KRACIAKSILSKKNKAGDIMLPDFKLYYKPTVTKTAWYWYQNRDIDQWNRTEASEIIPHI YNHLIFDKPDKNKKWGKDSLFNKWCWENWLAIWRKLKLDPFLTPYTKIHSRWIKDFNVRP KTIKTLEENLGNTIQDIGMGKDFMTKTPKAMATKAKIDKWDLIKLRSFCTVKETTIRVNR QPAEWEKNFAIYPSDKWLISRIYKELKQIYKKKSNNPIKKWAKDMNRHFSKKDIYAVNRH MKKCSSSLVITEMQIKTMPPLLIPRQKGSGVDLWQTPTDLQLRVLTVKRKTNKQKGHPHQ NSICTSPSSKTKEKSLKDLMELKTMARELHDECTSFSSWFDQLEERVSVMEDQKNEMKQE EKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPNPAK QANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAKEKGRVTHKGKPIRLTA DLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGKIKSFPDKQMLRDFVTIRP ALQELLKEALNMEWNNQYQPLQKHAKL >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_9|3144_bp atggggtttcaccatgttggccaggctggtctcgaactcctggcttcagcccttggctgt ccttacccagatgtcctcataaactctgagctgtgtcacaagggcaaacaaaacaggcaa aatatcaaatattacacattaccagtgactaaaaggaggaattgcaccatgccaaactac aaactcacttattttaatatgagggggagagcagaaattattcgttacatatttgcttat ttggacatacagtatgaagaccacagaatagaacaagctgactggcctgaaatcaaatca acaccacaccacacttattccaaaattgaccacatagttggaagtaaagcactcctcagc aaatgtaaaagaacagaaatcacaacaaactgtctctcagaccacagtgcaatcaaacta gaactcaggattaagaaactcactcaaaaccacacaaatacatggaaactgaacaacctg ctcctgaatgactactgggtacataacgaaatgaaggcagaaataaacatattctttgaa accaatgagaacaaagacacaatgtaccagaatctctgggatacatttaaagcagtgtgt acagggaaatttatagcactaaatgcccacaagagaaagcagaaaagatctgaaattgac accctaacatcccaattaagagaactagagaagcaagagcaaacacattcaaaagctagc agaaggcaagaaataagtaagatcagaacagaactgaaggagacagagatacaaaaaacc cttcaaaaaatcagcgaacccaggagcttattttttgaaaagatcaacaaaattgataga ctgctagcaagactaataaagaagaaaagagagaagaatcaaatagatgcaatgaaaaat gataaaggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactat aaacacctctacacaaataaactagaaaatctagaagaactggataaattgctggacaca tacaccttccccaaaataaaccaggaagaagttgaatccctgaatagaccaataacagga tctgaaattgaggcaataattaatagcctaccaaccaaaaaaagtccaggaccaggtgga ttcacagctgaattctaccagagatacaaggaggagctgagaataaaatacctaggaatc caacttaaaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacaaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat atcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccagccac ctacaagtgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcctgcattgccaagtcaatcctaagcaaaaagaacaaagctggagacatcatg ctacctgacttcaaactatactacaagcctacagtaaccaaaacagcatggtactggtac caaaacagagatatagaccaatggaacagaacagaggcctcagaaataataccacacatc tacaaccatctgatctttgacaaacctgacaaaaacaagaaatgggggaaggattcccta tttaataaatggtgctgggaaaactggctagccatatggagaaagctgaaactggatccc ttccttacaccttatacaaaaattcattcaagatggattaaagacttcaatgtcagacct aaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggc aaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgg gatctaattaaactaaggagcttctgcacagtaaaagaaactaccattagagtgaacagg caacctgcagaatgggagaaaaattttgcaatctatccatctgacaaatggctaatatcc agaatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaag tgggcgaaggatatgaacagacacttctcaaaaaaagacatttatgcagtcaacagacac atgaaaaaatgctcgtcatcactggtcatcacagaaatgcaaatcaaaaccatgcctccg ctgctgatacccaggcaaaaagggtccggagtggacctctggcaaactccaacagacctg cagctgagggtcctgactgttaaaaggaaaactaacaaacagaaaggacatccacaccaa aactccatctgtacgtcaccatcatcaaagactaaagagaagtccttaaaggacctgatg gagctgaaaaccatggcacgagaactacatgatgaatgcacaagcttcagtagctggttt gatcaactggaagaaagggtatcagtgatggaagatcaaaagaatgaaatgaagcaagaa gagaagtttagagaaaaaagaataaaaagaaatgaacaaagcctccaagaaatatgggac tatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgatggggagaatgga accaagttggaaaacactctgcaggatattatccaggagaacttccccaatccagcaaag caggccaacattcaaattcaggaaatacagagaacgccacaaagatactcctcgagaaga gcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgtta agggcagccaaagagaaaggtcgggttacccacaaaggaaagcccatcagactaacagct gatctctcagcagaaactctacaagccagaagagagtgggggccaatattcaacattctt aaagaaaagaattttcaacccagaatttcatatccagccaaactaagcttcataagtgaa ggaaaaataaaatcctttccagacaagcaaatgctgagagattttgtcaccattaggcct gccctacaagagctcctgaaggaagcactaaacatggaatggaacaaccagtaccagccc ctgcaaaaacatgccaaattgtaa >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_10|169_aa MKTTAGEDAVNIVEMTIKDLEYDINLVEKAVAGFERMTSILKEVLLDTKLSNYPHKRAPS SEPKIRTEVQRGDMTLLGNTGQGTGKGGTYCTVLPPTAQVVMGGRKTTEKKWLCWVLGRW NLRQRCDETTTLGQEGSKMRMLQELQRAPGVEAMHTRTMDASRAASVDR >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_10|510_bp atgaaaactactgctggtgaagatgctgtgaacattgttgaaatgacaataaaggattta gaatatgacataaacttagttgagaaagcagtagcaggatttgagaggatgacttcaatt ttgaaagaagttctactggacaccaaattgagcaactatccgcacaaaagagcaccttct tcagaaccaaaaatcagaactgaggttcagagaggagacatgactcttttggggaacaca ggacagggcacaggcaaaggtgggacttactgcacagtccttcctcctacagcacaagtt gtaatgggaggtagaaagaccacagagaagaagtggctctgttgggtactgggcaggtgg aatctcaggcagcgttgtgatgaaaccaccacactggggcaagaaggtagcaaaatgaga atgctacaggagctgcagagagctcctggcgtggaggccatgcacacaaggacaatggat gcctctagagcagcctctgttgatcggtga >gi568815594f:94108395_94389531|GENSCAN_predicted_peptide_11|56_aa ELATSAGNLATGSRNACSPGFLLSHVPSVRDPTEDRTVQLTWQPLPEPLELWPKAL >gi568815594f:94108395_94389531|GENSCAN_predicted_CDS_11|171_bp gagcttgctacaagtgctggaaatctggccaccgggtcaaggaatgcctgcagcccagga ttcctcctaagccatgtcccatctgtgcgggaccccactgaagatcggactgttcaactc acctggcagccactcccagagcccctggaactctggcccaaggctctctga