GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:33:34 Sequence gi568815595f:149029724_149272219 : 242496 bp : 38.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.21 PlyA - 363 358 6 1.05 1.20 Term - 2649 2497 153 0 0 88 28 124 0.991 3.34 1.19 Intr - 5275 5195 81 1 0 61 87 102 0.985 6.32 1.18 Intr - 9506 9326 181 1 1 107 100 123 0.999 14.35 1.17 Intr - 9970 9858 113 1 2 61 97 66 0.999 3.06 1.16 Intr - 10433 10308 126 2 0 75 98 79 0.994 7.56 1.15 Intr - 11945 11767 179 0 2 68 92 51 0.555 2.22 1.14 Intr - 12567 12443 125 2 2 111 96 18 0.983 4.21 1.13 Intr - 16536 16357 180 2 0 38 86 73 0.449 0.16 1.12 Intr - 18440 18305 136 0 1 90 97 18 0.817 1.61 1.11 Intr - 19278 19140 139 0 1 53 89 63 0.738 2.02 1.10 Intr - 25677 25580 98 1 2 55 87 69 0.526 2.31 1.09 Intr - 31135 31056 80 0 2 31 87 110 0.473 3.48 1.08 Intr - 33801 33708 94 1 1 60 92 11 0.438 -3.20 1.07 Intr - 35143 35068 76 1 1 49 94 72 0.892 2.07 1.06 Intr - 38612 38517 96 2 0 33 98 61 0.594 0.89 1.05 Intr - 41720 41529 192 2 0 69 19 171 0.979 7.07 1.04 Intr - 41934 41860 75 0 0 53 57 110 0.899 3.19 1.03 Intr - 43599 43502 98 1 2 54 69 80 0.890 1.51 1.02 Intr - 44625 44492 134 2 2 51 106 68 0.998 4.27 1.01 Init - 46309 46158 152 1 2 40 57 203 0.764 11.96 1.00 Prom - 51282 51243 40 -6.45 2.00 Prom + 52099 52138 40 -5.55 2.01 Init + 55483 55540 58 0 1 59 45 58 0.647 -1.88 2.02 Intr + 56353 56526 174 2 0 13 77 257 0.782 16.09 2.03 Intr + 56609 56755 147 0 0 76 115 74 0.977 8.19 2.04 Intr + 67304 67429 126 0 0 26 105 88 0.007 4.03 2.05 Intr + 79541 79654 114 0 0 78 48 61 0.116 0.60 2.06 Intr + 79940 80061 122 0 2 81 28 114 0.091 3.99 2.07 Intr + 80175 80399 225 2 0 45 26 199 0.099 6.66 2.08 Intr + 89107 89241 135 0 0 25 45 113 0.084 0.54 2.09 Intr + 89376 89549 174 2 0 68 97 108 0.149 8.91 2.10 Term + 96315 96500 186 2 0 -22 36 242 0.732 4.51 2.11 PlyA + 97305 97310 6 1.05 3.00 Prom + 98643 98682 40 -7.25 3.01 Init + 100001 100217 217 1 1 80 105 148 0.849 14.82 3.02 Intr + 103896 104015 120 1 0 71 -5 120 0.081 0.55 3.03 Intr + 110281 110775 495 2 0 72 94 330 0.433 24.04 3.04 Intr + 111294 111465 172 1 1 34 35 108 0.968 -1.82 3.05 Intr + 111572 111657 86 2 2 84 119 66 0.963 7.84 3.06 Intr + 115631 115823 193 0 1 86 79 49 0.945 1.43 3.07 Intr + 120876 120957 82 0 1 91 116 61 0.883 7.92 3.08 Intr + 127627 127808 182 0 2 85 94 151 0.781 13.14 3.09 Intr + 128943 129123 181 0 1 85 -16 131 0.961 1.35 3.10 Intr + 130323 130556 234 2 0 81 64 98 0.733 3.66 3.11 Intr + 132476 132610 135 1 0 29 76 96 0.373 2.34 3.12 Intr + 132901 133155 255 0 0 25 99 207 0.556 12.22 3.13 Intr + 134119 134226 108 0 0 71 111 9 0.633 1.06 3.14 Intr + 137311 137517 207 0 0 88 67 89 0.745 5.15 3.15 Term + 142478 142534 57 1 0 75 53 28 0.053 -5.29 3.16 PlyA + 143210 143215 6 1.05 4.22 PlyA - 143524 143519 6 1.05 4.21 Term - 146689 146489 201 1 0 28 42 90 0.195 -5.19 4.20 Intr - 148256 148117 140 0 2 54 116 103 0.995 8.96 4.19 Intr - 148908 148692 217 0 1 78 80 167 0.926 12.05 4.18 Intr - 151371 151250 122 1 2 7 69 162 0.962 5.49 4.17 Intr - 152410 152282 129 2 0 96 78 67 0.911 6.25 4.16 Intr - 153882 153743 140 2 2 72 77 183 0.921 14.79 4.15 Intr - 155723 155516 208 0 1 52 15 143 0.801 0.61 4.14 Intr - 158479 158329 151 1 1 80 93 106 0.942 9.11 4.13 Intr - 168819 168644 176 1 2 84 111 159 0.608 16.54 4.12 Intr - 170141 169989 153 0 0 83 81 29 0.604 0.82 4.11 Intr - 172518 172379 140 2 2 107 92 169 0.999 18.39 4.10 Intr - 174090 173986 105 2 0 78 6 111 0.543 0.31 4.09 Intr - 176616 176445 172 1 1 111 86 195 0.999 19.68 4.08 Intr - 177894 177640 255 1 0 86 97 245 0.995 21.69 4.07 Intr - 179676 179488 189 1 0 -5 50 241 0.376 9.64 4.06 Intr - 180656 180444 213 0 0 86 83 144 0.972 11.46 4.05 Intr - 182975 182728 248 1 2 104 110 131 0.998 13.08 4.04 Intr - 188301 188143 159 2 0 42 41 131 0.666 1.98 4.03 Intr - 192089 191924 166 0 1 109 113 15 0.783 4.20 4.02 Intr - 197090 196939 152 1 2 79 23 116 0.893 3.09 4.01 Init - 198350 198097 254 2 2 68 98 123 0.586 8.16 4.00 Prom - 200720 200681 40 -4.25 5.00 Prom + 203062 203101 40 -3.65 5.01 Sngl + 207135 207491 357 2 0 60 49 160 0.251 5.11 5.02 PlyA + 207732 207737 6 1.05 6.15 PlyA - 208848 208843 6 1.05 6.14 Term - 209096 208967 130 1 1 90 38 82 0.297 0.07 6.13 Intr - 211778 211586 193 0 1 106 69 79 0.345 5.43 6.12 Intr - 214732 214531 202 1 1 108 44 183 0.822 13.84 6.11 Intr - 216046 215940 107 2 2 86 94 -1 0.808 -0.69 6.10 Intr - 217773 217645 129 1 0 109 93 16 0.765 3.95 6.09 Intr - 219438 219293 146 2 2 59 64 119 0.407 5.61 6.08 Intr - 220446 220272 175 1 1 110 0 84 0.429 -0.12 6.07 Intr - 223335 223123 213 1 0 57 71 80 0.545 0.96 6.06 Intr - 224545 224395 151 1 1 108 56 139 0.988 11.51 6.05 Intr - 226418 226207 212 0 2 62 109 127 0.267 9.91 6.04 Intr - 229940 229651 290 1 2 12 54 217 0.048 6.67 6.03 Intr - 237314 237143 172 0 1 115 90 75 0.036 8.48 6.02 Intr - 240339 239992 348 1 0 48 54 170 0.240 4.00 6.01 Intr - 241197 241027 171 1 0 68 26 198 0.996 10.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:149029724_149272219|GENSCAN_predicted_peptide_1|835_aa MVALQRDPNNPYDKNAIKVNNVNGNQVGHLKKELAGALAYIMDNKLAQIEGVVPFGANNA FTMPLHMTFWGKEENRKAVSDQLKKHGFKLGPAPKTLGFNLESGWGSGRAGPSYSMPVHA AVQMTTEQLKTEFDKLFEDLKEDDKTHEMEPAEAIETPLLPHQKQALAWMVSRENSKELP PFWEQRNDLYYNTITNFSEKDRPENVHGGILADDMGLGKTLTAIAVILTNFHDGRPLPIE RVKKNLLKKEYNVNDDSMKLGGNNTSEKADGLSKDASRCSEQPSISDIKEKSKFRMSELS SSRPKRRKTAVQYIESSDSEEIETSELPQKMKGACAVEGSKKTDVEERPRTTLIICPLSV LSNWITKGDSPLHSIRWLRVILDEGHAIRNPNAQQTKAVLDLESERRWVLTGTPIQNSLK DLWSLLSFLKLKPFIDREWWHRTIQRPVTMGDEGGLRRLQSLIKNITLRRTKTSKIKGKP VLELPERKVFIQHITLSDEERKIYQSVKNEGRATIGRYFNEGTVLAHYADVLGLLLRLRQ ICCHTYLLTNAVSSNGPSGNDTPEELRKKLIRKMKLILSSGSDEECAICLDSLTVPVITH CAHVFCKPCICQVIQNEQPHAKCPLCRNDIHEDNLLECPPEELARDSEKKSDMEWTSSSK INALMHALTDLRKKNPNIKSLVVSQFTTFLSLIEIPLKASGFVFTRLDGSMAQKKRVESI QCFQNTEAGSPTIMLLSLKAGGVGLNLSAASRVFLMDPAWNPAAEDQCFDRCHRLGQKQE VIITKFIVKDSVEENMLKIQNKKRELAAGAFGTKKPNADEMKQAKINEIRTLIDL >gi568815595f:149029724_149272219|GENSCAN_predicted_CDS_1|2508_bp atggttgcattacaacgagatcctaataacccttatgataagaatgcaattaaagtaaac aatgtgaatggaaatcaagttggccatttaaagaaagagcttgcaggtgctttggcctat atcatggacaacaaattggcacaaattgaaggggtagttccttttggtgcaaacaatgct tttaccatgcctctgcatatgactttttggggaaaagaagaaaatagaaaagcggtttca gatcagttgaagaaacatggatttaaattgggtcctgcaccaaaaactttaggattcaat ttggaaagtggttggggctctggaagagctggaccaagctatagtatgccagtgcatgct gcagtacagatgacaactgaacagcttaaaacagaatttgacaaattgtttgaagattta aaagaagatgataaaacccatgaaatggaaccagctgaggctattgaaacaccactgctt ccacatcaaaaacaagctctagcttggatggtgtcacgggaaaatagcaaagaacttcca ccattctgggaacagcgaaatgacttatactataacacaataacaaatttttctgagaag gaccgaccagaaaatgtccatggaggaattttagctgatgatatgggtttgggtaaaact cttacggccattgcagtaatccttaccaacttccatgatggcagacctcttcctattgaa agagttaaaaagaatctactgaagaaggaatataatgttaacgatgactctatgaaactt ggaggaaacaataccagtgaaaaggcagatggactaagcaaagacgcatctagatgtagt gaacaacccagtatttcagatatcaaggagaagagtaagtttcgcatgtcagaattgtct agctcccgccccaaaagaagaaaaactgctgtccagtacatagaaagcagtgattcagag gaaattgaaacaagtgaattgccgcagaaaatgaaaggagcttgtgcagtggaggggtca aagaaaactgatgttgaggagagaccaagaacaacactgatcatctgtccgctttctgtg ttaagcaactggattactaaaggagatagtccattacatagcataaggtggctaagagtg atcctggatgaaggacatgccatacgaaatccaaatgctcagcagacaaaagctgtactt gacttagaatcagaaagaagatgggttttgacaggtactccaatccagaattctttaaag gacttgtggtctcttctttcctttttaaaacttaaaccatttattgatagagaatggtgg catagaacaatacagcgtcctgtcacaatgggagatgaaggaggacttaggcgtttacag tccctaattaaaaatattacacttagaagaacaaagacaagcaaaattaaaggaaaacct gttttggagttaccagaacgtaaagtatttattcagcacattacactttcagatgaagag agaaagatttatcagtctgtgaaaaatgaaggcagagccactattggaaggtattttaat gaagggactgtcctggcacattatgcagatgtcctgggtcttttgcttagactgcggcaa atttgttgccatacttaccttcttacaaatgcagtgtcttccaatggcccctcaggaaat gatacacctgaagaactgagaaagaagttaataaggaagatgaagttaattctgagctca ggttcagatgaggaatgtgcaatttgcctggattctttaacagttcctgtgataacacat tgtgcacatgtattttgtaaaccctgtatttgccaagtcattcagaatgagcagccacat gctaaatgccctttatgcagaaatgatatacatgaagataatttattagaatgtcctcca gaagaattagcacgtgacagtgagaaaaagtctgatatggaatggacatccagttcaaag attaatgcgctaatgcacgcattgactgacttaagaaagaagaatcccaacataaaaagt ttggttgtttctcagtttacaacattcctgtctttaatagaaataccacttaaagcctct ggatttgtgtttactcgtttggatggttccatggcccaaaagaaaagagttgaatcaatt cagtgttttcaaaacactgaagcaggatctccaactataatgcttctgtccttaaaagca ggtggagttggtttgaatctgtctgcagcttctcgagtgtttttaatggatccagcctgg aatcctgctgctgaagatcagtgctttgacagatgccatagacttggtcagaagcaagaa gttatcatcacaaaattcattgtaaaggactctgttgaagaaaatatgctgaaaatacaa aacaaaaagagagaacttgcagcaggagcctttggaactaaaaaaccaaatgctgacgaa atgaaacaagccaaaattaatgaaatcagaacattaattgacttataa >gi568815595f:149029724_149272219|GENSCAN_predicted_peptide_2|486_aa MGRMRWLTPGIPALWEAEAGPPQSVTEISQVPNTGLALGAPTRQRRQCIYFSGGHICDQQ NEYSCTNRPGNAEERGEGHGAEWDDKRSASAPLDRFRAASIRLLPGPAALKPGTNSERRI RSARLKGVKLRTFTVSVTALKVARLELFVPPGGFVVSLASGVKLQTFAASAPILAMLEEP FSPPLHCGEPLSGLAKARAGSLSLRGARASPTSTVPCSTATGPIDHPRAEECRCTAGDWQ AAPPAARLKVCKHTNQHPVSSSGFVNAPIDTLHLANLAGTWRTFVPSSGIVNTPISTLSK QTNQLSVKQTNRLAVKWTNQQDRGKRKGNSRTHPTPGGTRPALGDGHMCPRGTRSGPSLE HSADAEQTTEARINPYLSLCRGNQGSPRRLSPGEGVCEASLRLDSLAPHGVNKVPKIETL DGVEKSSEKGGIGSSKGDRGGKSEVDCQKPSEENDLRRESDTMLLTGQDESISFGNVGDL KGKDTL >gi568815595f:149029724_149272219|GENSCAN_predicted_CDS_2|1461_bp atgggccggatgcggtggctcacgcctggaatcccagcactttgggaggcagaggcaggc cccccacagtcggtgacagagatttcccaagtccctaacacgggactcgccctaggagcc cctactcgccagcgaagacaatgcatttatttctccggcggccacatatgcgaccaacag aacgaatacagctgcacaaatcgcccagggaacgcagaggaacgcggggaaggacatggc gctgagtgggatgacaagaggagcgcctcggctcccctggatcgttttcgagccgcctcg atacgcctccttccaggccccgcagccctgaagccggggacaaattccgagcgccggatc aggagcgcacgactgaaaggagtgaagctgcggaccttcacggtgagtgttacagctctt aaggtggcgcgtctggagttgtttgttcctcccggtgggttcgtggtctcgctggcttca ggagtgaagctgcagaccttcgctgcctcggcgcccattctggccatgcttgaggagccc ttcagcccaccgctgcactgtggggagcccctttctgggctggccaaggccagagctggc tccctcagcttgcggggagcccgagcctccccaacgagcaccgtcccctgctccacagcc accggtcccatcgaccacccaagggctgaggagtgcaggtgcacagcaggggactggcag gcagctccacctgcagcccggctcaaggtttgtaagcacaccaatcagcaccctgtgtct agctcagggtttgtgaatgcaccaatcgacactctgcatctggctaatctagcggggacg tggagaacttttgtgcctagctcagggattgtaaacacaccaatcagcaccctctcaaaa cagaccaatcagctctctgtaaaacagaccaatcggcttgctgtaaaatggaccaatcag caggatagaggcaaacggaagggtaacagtcgcacccaccccacccctggtggcaccagg ccagccttgggggatggccacatgtgcccacgtggcacaagatcaggccccagccttgag cacagtgcagatgccgagcagactacagaggccagaattaatccatatctttccctttgc cgagggaaccaaggcagcccccgccggctttctcctggcgaaggtgtctgcgaagcctct cttagacttgacagtttggcacctcatggtgtcaataaagttccaaaaattgagactttg gatggagtggaaaagagttcagagaaaggagggataggatccagtaaaggagacagagga ggaaaatcagaagtggactgtcagaagcccagtgaagagaatgacttaagaagagagagt gatacaatgctgctgacgggtcaagatgagagcattagttttggcaatgtgggagatctt aaaggcaaggataccctctaa >gi568815595f:149029724_149272219|GENSCAN_predicted_peptide_3|907_aa MVQLYNLHPFGSQQVVPCKLEPDRFCGGGRDALFVAAGCKVEAFAVAGQELCQPRCAFST LGRVLRLAYSEAGFATATPASSDHHPDQSAAISIGARPFTSEKICLVKGLDDRDYLVAIE EKNKATFLRAYVNWRNKRTENSRVCIRMIGHNVEGPFSKAFRDQMYIIEMPLSEAPLCIS CCPVKGDLLVGCTNKLVLFSLKYQIINEEFSLLDFERSLIIHIDNITPVEVSFCVGYVAV MSDLEVLIVKLESGPKNGERVHHHPHKTNNRIRRTEEGISNEISQLESDDFVICQKPLEL LGEKSEQSGLSVTLESTGLADEKRKYSHFQHLLYRRFAPDISSYVLSDDIKLHSLQLLPI YQTGSLTSDGKNLSQEKELLSLFCFFSLPHVGYLYMVVKSVELMSVYQYPEKSQQAVLTP QFLHVITSNNLQCFTVRCSAAAAREEDPYMDTTLKVDYSNTYKTVKTQSCIHLLSEAHLL VRAALMDASQLEPGEKAELLEAFKESCGHLGDCYSRLDSQHSHLTLPYYKMSGLSMAEVL ARTDWTVEDGLQKYERGLIFYINHSLYENLDEELNEELAAKVVQMFYVAEPKQVPHILCS PSMKNINPLTAMSYLRKLDTSGFSSILVTLTKAAVALKMGDLDMHRNEMKSHSERKGQIV PTELALHLKETQPGLLVASVLGLQKNNKIGIEEADSFFKPSCRFISAIFIWNIEAPFTVF KVLCAKDEDTIPQLLVDFWEAQLVACLPDVVLQELFFKLTSQYIWRLSKRQPPDTTPLRT SEDLINACSHYGLIYPWVHVVISSDSLADKNYTEDLSKLQSLICGPSFDIASIIPFLEPL SEDTIAGLSVHVLCRTRLKEYEQCIDILLERCPEAVIPYANHELKEENRSKETIDLKVSF EKYHNGI >gi568815595f:149029724_149272219|GENSCAN_predicted_CDS_3|2724_bp atggtgcagctgtacaacctgcacccgttcgggtcgcagcaggtggtgccctgcaagctg gagccggaccggttctgtggcggggggcgtgacgcgcttttcgtggcggcgggctgcaag gtggaggcgttcgcggtggccggccaggagctgtgccagccgcggtgcgccttctccacg ctgggccgggtgttgcgcctggcctacagcgaggctggatttgccacagccaccccagcc tccagtgaccaccaccctgatcagtcagcagccatcagcatcggggcaagacccttcacc agtgaaaagatatgtctcgttaaaggcttggatgatcgagattatttggtagcaattgaa gagaaaaacaaagctacatttctacgtgcttatgtgaactggagaaataaaaggactgaa aactctcgtgtgtgtatccgaatgattgggcataatgtggagggaccattcagcaaagcc ttcagagaccagatgtacattattgaaatgccgctttcggaggcccccttgtgcatttcc tgttgccctgtgaaaggagaccttctcgttggctgcacaaataaattagtcttatttagt ttgaagtaccagatcattaatgaggaattctcactattggactttgaacgttctttaatt atacacatagataatatcactcctgttgaggtttctttttgtgttggatatgttgctgtc atgtcagacttagaagtcttaatcgtaaaactggagtcaggccctaaaaatggagagaga gttcaccaccatccacataagaccaacaatcgaataagacggacagaagaaggcatcagt aatgaaatttcacagcttgagtcagatgattttgtcatctgccagaagcccctggaactt cttggtgaaaaaagtgaacagtctggattatctgttacactggagtctacgggattagct gatgaaaaaagaaaatattcccactttcagcacctgctctatagacgttttgctcctgat atttcgtcctatgtcttgtctgatgacatcaagctacattccctccagctgctacccatt taccagaccggttctcttacatctgatggaaaaaatttgtctcaggaaaaagaattgctg agtctcttttgctttttctccttacctcatgtgggctatctctacatggttgtcaaatct gttgaattgatgtcagtctaccagtatcctgaaaagtctcagcaggcagtactcacgcca caatttttgcacgtcattacaagtaacaacctgcagtgtttcactgtgcggtgcagtgcg gcggcagctcgtgaggaggacccgtacatggacaccaccctgaaggtagactatagcaat acctataagactgtcaaaacccagagctgcattcaccttctcagtgaggctcatctgtta gtgcgagctgccctgatggatgccagtcagctggaacctggagagaaggcagagcttttg gaagcatttaaggaaagctgtgggcaccttggggactgttacagcaggcttgactcccag cattctcatctcaccttgccatactataagatgtctggtttgtctatggctgaagttctg gcccgcacggactggacagtagaggatggattacagaaatacgagagaggattaatcttt tacattaatcattcactttatgaaaacctggatgaagaattaaatgaagaattagcagca aaagtggttcagatgttttatgtggctgagccaaagcaagtgccccatattctctgtagt ccttctatgaagaatattaatcctttaactgccatgagctatctaaggaagctggatact tctgggttttcatcgatcttagtgacattgaccaaggcagcagtggctctgaaaatggga gatcttgacatgcacagaaatgaaatgaaaagccattcagagagaaagggacagattgtt ccaaccgagcttgcacttcacttgaaggaaactcagcctggattgcttgtggcttcagtt ctgggcttgcagaagaacaacaaaattggaattgaagaagcagattccttttttaagccc agctgtcgctttatcagtgctatatttatctggaatatagaggctccttttactgttttt aaggtgctttgtgctaaggatgaagatacaattcctcagctcttggtagacttttgggaa gctcagctagtggcatgtctcccagatgtggtacttcaggaactctttttcaaactcaca tcacagtacatctggagattgtctaagaggcagcctcctgacaccacaccattgcgaaca tcggaggatctgataaatgcctgtagtcattatggcttaatttatccatgggttcacgtc gtaatatcatctgattctttagctgataaaaattatacagaagatctttcaaaattacag tctcttatatgtggtccttcatttgacatagcttccattattccgttcttggagccactt tcagaagacactattgccggcctcagtgtccatgttctgtgtcgtacacgcttgaaagag tatgaacagtgcatagacatactgttagagagatgcccggaggcagtcattccatatgct aatcatgaactgaaagaagagaaccggtcgaaagaaaccattgacttaaaggtatcattt gaaaaataccataatggcatttga >gi568815595f:149029724_149272219|GENSCAN_predicted_peptide_4|1229_aa MVAAAPHVTSTLKAEKMEKGGVSVTSEETTANFCLFLIGQNHYYLATPNCKGDWKRTFVA SIMKGTRETGLEMSCGLANSLCFRQDMDGTGSHYPQQTNAGVEKQAPHALTYKWELNKKT HGHREGNNTYWGLSGGSKKGKKMKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASD HGEKKLISVDTSYKHYYRVFHSIRGFQNKWYSEDVTISRPNDQEEITAQILTKKLDNKEH STLREHSNIYLQNGPDRIGRLYKKALYLQYTDETFRTTIEKPVWLGFLGPIIKAETGDKV YVHLKNLASRPYTFHSHGITYYKEHEGAIYPDNTTDFQRADDKVYPGEQYTYMLLATEEQ SPGEGDGNCVTRIYHSHIDAPKDIASGLIGPLIICKKVTLTPDSLDKEKEKHIDREFVVM FSVVDENFSWYLEDNIKTYCSEPEKVDKDNEDFQESNRMYSVNGYTFGSLPGLSMCAEDR VKWYLFGMGNEVDVHAAFFHGQALTNKNYRIDTINLFPATLFDAYMVAQNPGEWMLSCQN LNHLKAGLQAFFQVQECNKSSSKDNIRGKHVRHYYIAAEEIIWNYAPSGIDIFTKENLTA PGRCKVHVDKEESTLLMVKSSTHSRIPSKEQVFDAFVNDSAVFFEQGTTRIGGSYKKLVY REYTDASFTNRKERGPEEEHLGILGPVIWAEVGDTIRVTFHNKGAYPLSIEPIGVRFNKN NEGTYYSPNYNPQSRKTFTYEWTVPKEVGPTNADPVCLAKMYYSAVEPTKDIFTGLIGPM KICKKGSLHANGRQKDVDKEFYLFPTVFDENESLLLEDNIRMFTTAPDQVDKEDEDFQES NKMHWTFNVECLTTDHYTGGMKQKYTVNQCRRQSEDSTFYLGERTYYIAAVEVEWDYSPQ REWEKELHHLQEQNVSNAFLDKGEFYIGSKYKKVVYRQYTDSTFRVPVERKAEEEHLGIL GPQLHADVGDKVKIIFKNMATRPYSIHAHGVQTESSTVTPTLPVTLRGQGQNLGSVKDHC NNLGKIIIIIIIIEIWNMVEAAEVDLYSGLIGPLIVCRRPYLKVFNPRRKLEFALLFLVF DENESWYLDDNIKTYSDHPEKVNKDDEEFIESNKMHAINGRMFGNLQGLTMHVGDEVNWY LMGMGNEIDLHTVHFHGHSFQYKHRGVYSSDVFDIFPGTYQTLEMFPRTPGIWLLHCHVT DHIHAGMETTYTVLQNEGEYPGSNSRSHI >gi568815595f:149029724_149272219|GENSCAN_predicted_CDS_4|3690_bp atggttgctgcagctccacatgttacatccacattgaaggcagagaagatggaaaaaggg ggagtatcagtcacatctgaagagacaacagccaatttctgcctttttcttattggtcaa aaccattattatctggctactcctaactgcaaaggagactggaaaagaacttttgtagcc tctatcatgaaaggaacaagggagactgggttagaaatgtcttgtgggttggctaacagc ctttgctttagacaggacatggatggaactggaagccattatcctcagcaaactaatgca ggagtagaaaaacaagcaccgcatgctctcacttataagtgggagctgaacaagaaaaca catgggcacagggaggggaacaacacatactggggcctgtcggggggctccaagaagggg aaaaaaatgaagattttgatacttggtatttttctgtttttatgtagtaccccagcctgg gcgaaagaaaagcattattacattggaattattgaaacgacttgggattatgcctctgac catggggaaaagaaacttatttctgttgacacttcttacaaacattattaccgggttttt cattccattagaggtttccaaaacaaatggtacagtgaagatgtcactataagcaggcct aatgaccaagaagaaataacagctcaaatcctgaccaaaaaactagacaacaaagaacat tccaccttacgggaacattccaatatctatcttcaaaatggcccagatagaattgggaga ctatataagaaggccctttatcttcagtacacagatgaaacctttaggacaactatagaa aaaccggtctggcttgggtttttaggccctattatcaaagctgaaactggagataaagtt tatgtacacttaaaaaaccttgcctctaggccctacacctttcattcacatggaataact tactataaggaacatgagggggccatctaccctgataacaccacagattttcaaagagca gatgacaaagtatatccaggagagcagtatacatacatgttgcttgccactgaagaacaa agtcctggggaaggagatggcaattgtgtgactaggatttaccattcccacattgatgct ccaaaagatattgcctcaggactcatcggacctttaataatctgtaaaaaagtaacttta actccagattctctagataaagaaaaagaaaaacatattgaccgagaatttgtggtgatg ttttctgtggtggatgaaaatttcagctggtacctagaagacaacattaaaacctactgc tcagaaccagagaaagttgacaaagacaacgaagacttccaggagagtaacagaatgtat tctgtgaatggatacacttttggaagtctcccaggactctccatgtgtgctgaagacaga gtaaaatggtacctttttggtatgggtaatgaagttgatgtgcacgcagctttctttcac gggcaagcactgactaacaagaactaccgtattgacacaatcaacctctttcctgctacc ctgtttgatgcttatatggtggcccagaaccctggagaatggatgctcagctgtcagaat ctaaaccatctgaaagccggtttgcaagcctttttccaggtccaggagtgtaacaagtct tcatcaaaggataatatccgtgggaagcatgttagacactactacattgccgctgaggaa atcatctggaactatgctccctctggtatagacatcttcactaaagaaaacttaacagca cctggaagatgtaaggtccacgtggacaaggaagagtccaccttgcttatggttaagtca tcaacacatagcaggattcccagcaaagagcaggtgtttgatgcatttgtcaatgactca gcggtgttttttgaacaaggtaccacaagaattggaggctcttataaaaagctggtttat cgtgagtacacagatgcctccttcacaaatcgaaaggagagaggccctgaagaagagcat cttggcatcctgggtcctgtcatttgggcagaggtgggagacaccatcagagtaaccttc cataacaaaggagcatatcccctcagtattgagccgattggggtgagattcaataagaac aacgagggcacatactattccccaaattacaacccccagagcagaaaaacattcacctat gaatggactgtccccaaagaagtaggacccactaatgcagatcctgtgtgtctagctaag atgtattattctgctgtggaacccactaaagatatattcactgggcttattgggccaatg aaaatatgcaagaaaggaagtttacatgcaaatgggagacagaaagatgtagacaaggaa ttctatttgtttcctacagtatttgatgagaatgagagtttactcctggaagataatatt agaatgtttacaactgcacctgatcaggtggataaggaagatgaagactttcaggaatct aataaaatgcactggacttttaatgttgaatgccttacaactgatcattacacaggcggc atgaagcaaaaatatactgtgaaccaatgcaggcggcagtctgaggattccaccttctac ctgggagagaggacatactatatcgcagcagtggaggtggaatgggattattccccacaa agggagtgggaaaaggagctgcatcatttacaagagcagaatgtttcaaatgcattttta gataagggagagttttacataggctcaaagtacaagaaagttgtgtatcggcagtatact gatagcacattccgtgttccagtggagagaaaagctgaagaagaacatctgggaattcta ggtccacaacttcatgcagatgttggagacaaagtcaaaattatctttaaaaacatggcc acaaggccctactcaatacatgcccatggggtacaaacagagagttctacagttactcca acattaccagtgaccctaagggggcaaggacagaatctggggtcagttaaggaccactgc aataatctgggtaaaatcatcatcatcatcatcatcatcgagatttggaatatggtggaa gcggctgaagtggacctctacagtggattaattggccccctgattgtttgtcgaagacct tacttgaaagtattcaatcccagaaggaaactggaatttgcccttctgtttctagttttt gatgagaatgaatcttggtacttagatgacaacatcaaaacatactctgatcaccccgag aaagtaaacaaagatgatgaggaattcatagaaagcaataaaatgcatgctattaatgga agaatgtttggaaacctacaaggcctcacaatgcacgtgggagatgaagtcaactggtat ctgatgggaatgggcaatgaaatagacttacacactgtacattttcacggccatagcttc caatacaagcacaggggagtttatagttctgatgtctttgacattttccctggaacatac caaaccctagaaatgtttccaagaacacctggaatttggttactccactgccatgtgacc gaccacattcatgctggaatggaaaccacttacaccgttctacaaaatgaaggtgaatat ccaggtagtaattctagaagccatatataa >gi568815595f:149029724_149272219|GENSCAN_predicted_peptide_5|118_aa MQRTPVRYSMRRSTPRHIIIKFSKIEMKEKMLRAARDKGQVTYKGKPVRLTVDFSVETLQ ARRDWGPIFNILKEKNFQPRISYLAKLSFFVSEGKIRLFSDKQMQKEFVTIILLCKSS >gi568815595f:149029724_149272219|GENSCAN_predicted_CDS_5|357_bp atgcagagaaccccagtaagatactccatgagaagatcaaccccaaggcacataatcatc aaattctccaagattgaaatgaaagaaaaaatgttaagggcagctagggacaaaggccag gtcacctacaaagggaagcccgtcagactaacagtggacttctcagtggaaaccctacaa gccagaagagattgggggccaatattcaacattcttaaagaaaagaatttccaacccaga atttcatatctggccaaactaagcttcttcgtaagtgaaggaaaaataagattattttcg gacaagcaaatgcagaaggaatttgtcaccatcatcctgctttgcaagagctcctga >gi568815595f:149029724_149272219|GENSCAN_predicted_peptide_6|879_aa XTLNGDTEKDIDRSSFLMFSTTDESRSWYSDENIRAFTESGKINTSDPRFEESMSMQSIN GYIYGNLPNLTMCAEDRVQWYFVGMGGVADIHPVYLRGQTLISRNHRKDTIMLFPSSLED AFMVAKAPGVWMLGCQIHGSDLLLLRDTESENFQGHSPLQMPYLTNEETDIQEESMQAFF KVSNCQKPSTEAFVTGTHVIHYYIAAKEILWNYAPSGIDFFTKKNLTAAGRWVPAEVMAA GWADLSSYPGSSVQIQRWLVEKVDPQASITLVGSGRSGTPPETPGEEECIDAGDGRQGRL IPRSLDSTCGHWQRWHQAEWTCSQALDGTPPPSSHVSPGTTFVYTWEVPKDVGPTSTDPN CLTWFYYSSVNGKKDINSGLLGPLLICRNGSLGDDGKQKGVDKEFYLLATIFDENESNLL DENIRTFITEPENIDKEDTDCQASNKMYSINGYMYGNLPGLDTCLGDNVLWHVFSVGSVE DLHGIYFSGNTFTSLGARRDTIPMFPYTSQTLLMTPDSIGTFDLVCMTIKHNLGGMKHKY HVRQCGKPNPDQTQYQEEKIIITIAAEEMEWDYSPSRNQTSMYVDRSGTLLGSKYKKVLY RQYDDNTFTNQTKRNEGEKHLDILGIGPLILLNPGQIIQIIFKNKAARPYSIHAHGVKTN NSTVVPTQPGEIQIYTWQIPDRTGPTSLDFECIPWFYYSTVSVAKDLHSGLVGPLSVCRK DINPNIVHRVLHFMIFDENESWYFEDSINTYASKPNKVDKENDNFQLSNQMHAINGRLFG NNQGITFHVGDVVNWYLIGIGNEADLHTVHFHGHSFEYKVRAFYSPYPVMSCKPVIRSLN KLFQSIANQEIFESTYDLEASPPPVVPPFQTEPVYILCV >gi568815595f:149029724_149272219|GENSCAN_predicted_CDS_6|2640_bp nggacactgaatggagacactgaaaaagatattgacaggtcttcttttctgatgttttct acaactgatgaaagcagaagctggtatagtgatgaaaatattcgtgcatttactgaatct ggcaagattaatactagtgatccccgttttgaggagagcatgagcatgcaatcaataaat ggatacatctatggaaatctgcccaatctcaccatgtgtgctgaagatagggtccagtgg tattttgttggcatgggtggcgtggctgacatacaccccgtctacctccgcggacaaact ctgatctctcggaatcacagaaaggacaccattatgctcttcccctcctcactggaagat gccttcatggtggccaaggcccctggagtgtggatgctgggatgccagatacatggtagt gatctattacttttgcgtgatacagagtcagagaacttccaaggtcatagcccattacaa atgccttaccttacaaatgaagaaaccgatatccaagaagagagtatgcaggcatttttc aaagtaagtaattgccagaaaccttcaacagaagcctttgttactgggacacatgttata cattactatattgctgctaaagaaattctttggaactatgctccatctggtatagatttc ttcactaaaaaaaatttaacagcagctggaaggtgggtgccagcagaggtgatggcagca ggttgggcagacctgtcctcataccctggaagtagtgtacaaattcagcggtggttggtg gagaaggttgatcctcaggcttcgataacacttgtgggctctggcaggtcaggcacgccc cctgagactcctggggaggaggagtgcatagacgctggtgatggcaggcagggcagattg atccccaggtccctggacagcacatgtgggcattggcagaggtggcatcaggcagagtgg acctgttctcaggccctcgatggtacccctccaccctcttcacatgtaagtcctggcaca acatttgtctatacatgggaagttccaaaagatgtgggtcccacctccacagatcccaac tgcttgacctggttctattactcttcagtaaatgggaaaaaagacatcaacagtggcctt ctggggcctctccttatatgtagaaatggaagtcttggagacgatggcaaacagaaagga gtagacaaagagttttacctacttgccacaatatttgatgaaaatgaaagtaatctcttg gatgaaaatatcagaacatttatcacagagcctgaaaacatagataaagaggatacagac tgccaagcctcaaataagatgtactccataaatggatacatgtatggaaatctgcctgga ttggacacgtgcttaggagacaacgttttgtggcacgtttttagtgtaggatcagtggaa gatttacacgggatatatttttcaggaaataccttcacttctttaggagcaagaagggac acaatacctatgtttccttatacttctcagacgcttttgatgacacctgattctatagga acttttgatttggtttgcatgacaataaagcacaatctaggaggcatgaaacataaatat cacgtgaggcaatgtgggaagccaaaccctgatcaaacacaataccaggaggagaaaata attattaccattgcagccgaggaaatggaatgggattattctcctagtagaaaccaaacg agcatgtatgtggacagaagtggaacacttcttgggtccaaatacaagaaagtcttatat cgtcaatatgatgataacacgttcacaaatcaaacaaaaaggaatgaaggtgaaaaacat ctcgatatactaggtattggtccattaatattgctcaaccctggtcaaataattcaaatt atctttaaaaataaagccgcaagaccgtattctattcatgctcatggagtgaaaacaaat aattccactgttgttccaactcagccaggagagattcaaatatatacttggcagatacct gatagaactggtcctacctcactggactttgaatgcataccttggttttactattcaact gtatctgtggctaaggaccttcacagtggactggtaggccctctctctgtatgccgcaaa gacatcaaccccaacatagttcaccgtgttctccacttcatgatatttgatgagaatgaa tcctggtacttcgaagacagtatcaacacctatgcttcaaaaccaaacaaagtggacaag gaaaatgataattttcaactcagcaaccaaatgcacgcaattaacggaagactgtttgga aataaccaaggtataacattccatgttggggatgtagtgaattggtatctgattggcata gggaatgaagctgacctgcacacagttcactttcatggccatagctttgaatacaaggta agagccttctacagtccatatcctgtgatgtcttgtaaaccagttatcaggtctttaaat aaactctttcaatcaattgccaatcaggaaatctttgaatctacctatgacctggaagcc tcacctcctccagttgtcccacctttccagactgaaccagtgtacatcttatgtgtatga