GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:52:42 Sequence gi568815581r:41877261_42117416 : 240156 bp : 50.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.24 Intr - 936 843 94 2 1 92 73 121 0.817 10.97 1.23 Intr - 1664 1537 128 2 2 115 100 129 0.999 16.28 1.22 Intr - 5972 5862 111 2 0 80 64 143 0.995 11.68 1.21 Intr - 7014 6933 82 2 1 104 105 65 0.999 9.44 1.20 Intr - 9048 8852 197 0 2 125 89 312 0.999 33.21 1.19 Intr - 10443 10339 105 0 0 48 48 240 0.999 16.41 1.18 Intr - 15187 15019 169 0 1 88 76 259 0.999 24.65 1.17 Intr - 15914 15773 142 0 1 81 94 209 0.999 20.21 1.16 Intr - 20579 20489 91 2 1 105 88 44 0.991 5.67 1.15 Intr - 21525 21371 155 1 2 114 86 220 0.997 24.19 1.14 Intr - 24577 24436 142 1 1 74 103 155 0.873 15.53 1.13 Intr - 27530 27469 62 0 2 57 87 145 0.977 9.75 1.12 Intr - 28398 28262 137 2 2 76 81 238 0.711 22.11 1.11 Intr - 29386 29268 119 1 2 87 60 93 0.995 5.66 1.10 Intr - 30349 30182 168 1 0 27 94 164 0.429 11.04 1.09 Intr - 31808 31721 88 1 1 105 45 113 0.426 8.67 1.08 Intr - 31983 31925 59 0 2 94 57 27 0.517 -2.12 1.07 Intr - 32440 32225 216 1 0 37 23 355 0.497 22.60 1.06 Intr - 33024 32962 63 0 0 112 91 90 0.989 10.61 1.05 Intr - 35282 35160 123 2 0 97 79 102 0.903 10.98 1.04 Intr - 36636 36455 182 1 2 72 99 178 0.461 16.89 1.03 Intr - 41919 41620 300 1 0 78 83 131 0.366 8.31 1.02 Intr - 42621 42530 92 2 2 55 95 -17 0.178 -4.66 1.01 Init - 42945 42845 101 0 2 72 65 65 0.230 2.33 1.00 Prom - 46778 46739 40 -9.46 2.00 Prom + 47655 47694 40 -2.66 2.01 Init + 53069 53408 340 1 1 62 2 224 0.903 8.41 2.02 Intr + 53444 53577 134 0 2 82 92 215 0.999 21.66 2.03 Intr + 57957 58088 132 2 0 82 57 73 0.493 4.44 2.04 Intr + 58339 58489 151 0 1 96 81 206 0.999 20.44 2.05 Intr + 59213 59274 62 0 2 140 94 40 0.999 8.25 2.06 Intr + 59502 59667 166 2 1 58 8 262 0.919 14.73 2.07 Intr + 61297 61521 225 2 0 110 25 472 0.974 41.06 2.08 Intr + 61705 61912 208 2 1 90 73 221 0.951 18.94 2.09 Intr + 67876 67962 87 1 0 99 94 88 0.969 9.59 2.10 Intr + 71893 72089 197 1 2 117 86 285 0.999 30.26 2.11 Intr + 77957 78057 101 0 2 91 39 174 0.913 12.63 2.12 Intr + 84122 84206 85 1 1 85 105 138 0.997 14.59 2.13 Term + 87733 88223 491 2 2 87 48 257 0.950 16.42 2.14 PlyA + 89201 89206 6 1.05 3.00 Prom + 89307 89346 40 -11.82 3.01 Init + 89625 89627 3 2 0 85 81 0 0.866 -1.00 3.02 Intr + 90808 91480 673 0 1 117 51 1471 0.995 137.55 3.03 Intr + 94632 94771 140 1 2 119 76 134 0.875 15.48 3.04 Term + 96215 96664 450 1 0 118 48 482 0.984 42.39 3.05 PlyA + 96805 96810 6 -3.24 4.15 PlyA - 97174 97169 6 -0.45 4.14 Term - 98075 97951 125 0 2 42 49 111 0.411 1.15 4.13 Intr - 100063 100001 63 2 0 85 100 79 0.925 7.49 4.12 Intr - 104747 104595 153 0 0 80 89 235 0.999 22.84 4.11 Intr - 105141 104995 147 1 0 47 80 207 0.945 16.01 4.10 Intr - 106376 106303 74 1 2 67 115 113 0.954 10.95 4.09 Intr - 110650 110559 92 1 2 35 103 80 0.937 2.99 4.08 Intr - 111636 111472 165 0 0 88 121 84 0.999 11.86 4.07 Intr - 112297 112144 154 0 1 70 89 133 0.999 11.67 4.06 Intr - 113122 113004 119 1 2 126 57 110 0.999 10.96 4.05 Intr - 117684 117610 75 0 0 65 87 109 0.876 8.21 4.04 Intr - 119164 119051 114 1 0 88 80 99 0.972 9.74 4.03 Intr - 119979 119855 125 1 2 93 79 67 0.961 6.60 4.02 Intr - 123310 123222 89 0 2 130 66 63 0.982 7.91 4.01 Init - 140156 140080 77 2 2 89 76 167 0.876 16.16 4.00 Prom - 141465 141426 40 -9.06 5.00 Prom + 142392 142431 40 -7.76 5.01 Init + 144318 144411 94 2 1 79 92 93 0.944 9.61 5.02 Intr + 145139 145380 242 0 2 60 85 389 0.930 33.07 5.03 Term + 146394 146633 240 2 0 117 49 292 0.999 24.23 5.04 PlyA + 147518 147523 6 -0.45 6.10 PlyA - 148337 148332 6 1.05 6.09 Term - 149873 149637 237 2 0 79 41 120 0.967 2.57 6.08 Intr - 150468 150358 111 0 0 46 65 104 0.693 4.48 6.07 Intr - 150986 150790 197 0 2 99 75 70 0.873 5.93 6.06 Intr - 151675 151523 153 2 0 91 82 18 0.674 1.54 6.05 Intr - 154524 154359 166 0 1 127 86 56 0.991 8.83 6.04 Intr - 157075 156965 111 1 0 90 81 57 0.973 5.78 6.03 Intr - 160625 160477 149 0 2 116 68 101 0.912 10.85 6.02 Intr - 163929 162486 1444 0 1 106 20 1635 0.953 147.71 6.01 Init - 166116 165574 543 0 0 79 94 712 0.769 65.80 6.00 Prom - 167972 167933 40 -9.46 7.07 PlyA - 169599 169594 6 -0.45 7.06 Term - 171530 171390 141 2 0 -4 49 106 0.361 -4.57 7.05 Intr - 173375 173297 79 1 1 64 75 85 0.631 4.35 7.04 Intr - 178137 178040 98 0 2 133 55 35 0.539 3.51 7.03 Intr - 185296 185132 165 1 0 92 81 35 0.300 3.36 7.02 Intr - 185798 185620 179 0 2 95 55 208 0.657 17.84 7.01 Init - 196632 196575 58 0 1 111 99 42 0.531 7.97 7.00 Prom - 199510 199471 40 -4.46 8.04 PlyA - 199553 199548 6 -0.45 8.03 Term - 201462 201346 117 0 0 72 53 75 0.691 0.94 8.02 Intr - 203901 203848 54 0 0 79 75 53 0.696 2.28 8.01 Init - 204917 204840 78 2 0 89 103 15 0.736 4.18 8.00 Prom - 206770 206731 40 -4.96 9.22 PlyA - 207355 207350 6 1.05 9.21 Term - 224686 224501 186 1 0 117 47 275 0.993 23.79 9.20 Intr - 225052 224956 97 0 1 113 94 115 0.990 14.61 9.19 Intr - 226538 226348 191 2 2 105 43 311 0.996 26.68 9.18 Intr - 227667 227506 162 0 0 122 52 314 0.997 31.37 9.17 Intr - 227907 227758 150 0 0 101 78 177 0.989 18.36 9.16 Intr - 228729 228476 254 1 2 43 64 278 0.681 18.05 9.15 Intr - 230535 230344 192 1 0 112 77 315 0.994 32.26 9.14 Intr - 230848 230722 127 1 1 64 85 198 0.978 17.35 9.13 Intr - 231317 231024 294 2 0 86 80 41 0.439 0.31 9.12 Intr - 232126 232010 117 1 0 81 101 83 0.987 9.56 9.11 Intr - 233653 233463 191 2 2 67 76 291 0.991 25.10 9.10 Intr - 234237 234036 202 0 1 103 81 291 0.978 28.76 9.09 Intr - 234633 234465 169 2 1 40 96 203 0.243 16.25 9.08 Intr - 236582 236393 190 0 1 99 32 547 0.249 48.84 9.07 Intr - 236824 236740 85 1 1 62 73 166 0.961 11.89 9.06 Intr - 237022 236959 64 0 1 83 81 107 0.974 8.22 9.05 Intr - 237134 237098 37 0 1 68 84 -13 0.809 -6.38 9.04 Intr - 237344 237230 115 2 1 98 62 234 0.736 21.82 9.03 Intr - 237775 237629 147 1 0 69 42 402 0.821 34.13 9.02 Intr - 238573 238463 111 1 0 137 60 296 0.985 32.28 9.01 Intr - 239901 239775 127 2 1 116 105 180 0.986 23.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_1|1042_aa MGSKFSYLTPGSNRIGGGGLGEDFVCKCRNTVAVTDFTYSAPQVHQHLSDWMSQFLVWKI RSVSGSQEAVDGPAGLQVPAGPGARPRGAEPSQQRIGRSPGAQARERVWPIAGPFSPRAD GGGEKSGWAGTKAGSREATGCWGAPDFAGFVGPVEEAPRTDFGRGRAGLSAAMSAKAISE QTGKELLYKFICTTSAIQNRFKYARVTPDTDWARLLQDHPWLLSQNLVVKPDQLIKRRGK LGLVGVNLTLDGVKSWLKPRLGQEATVGKATGFLKNFLIEPFVPHSQAEEFYVCIYATRE GDYVLFHHEGGVDVGDVDAKAQKLLVGVDEKLNPEDIKKHLLVHAPEDKKELSERDGREW LKSYVQGAGVAEEMEGPVGILASFISGLFNFYEDLYFTYLEINPLGNWSRCLMIASAPLE VVTKDGVYVLDLAAKVDATADYICKVKWGDIEFPPPFGREAYPEEAYIADLDAKSGASLK LTLLNPKGRIWTMVAGGGASVVYSDTICDLGGVNELANYGEYSGAPSEQQTYDYAKTILS LMTREKHPDGKILIIGGSIANFTNVAATFKSHVTLGFQGIVRAIRDYQGPLKEHEVTIFV RRGGPNYQEGLRVMGEVGKTTGIPIHVFGTETHMTAIVGMALGHRPIPNQPPTAAHTANF LLNASGSTSTPAPSRTASFSESRADEVAPAKKAKPAMPQGKSTTLFSRHTKAIVWGMQTR AVQGMLDFDYVCSRDEPSVAAMVYPFTGDHKQKFYWGHKEILIPVFKNMADAMRKHPEVD VLINFASLRSAYDSTMETMNYAQIRTIAIIAEGIPEALTRKLIKKADQKGVTIIGPATVG GIKPGCFKIGNTGGMLDNILASKLYRPGSVAYVSRSGGMSNELNNIISRTTDGVYEGVAI GGDRYPGSTFMDHVLRYQDTPGVKMIVVLGEIGGTEEYKICRGIKEGRLTKPIVCWCIGT CATMFSSEVQFGHAGACANQASETAVAKNQALKEAGVFVPRSFDELGEIIQSVYEDLVAN GVIVPAQEVPPPTVPMDYSWAR >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_1|3126_bp atgggtagcaagttcagttaccttactcctggcagcaacaggattgggggagggggtctt ggagaagattttgtctgcaaatgtcgcaacactgttgcagtgactgatttcacatattct gccccacaagttcaccagcacctgtcagattggatgtcccagtttttggtttggaaaatt cgatctgtctcaggctcccaagaggcggtggatgggccagcgggactacaagtcccagca ggccccggggcccgccctcgtggggcggagccaagccagcagcgaattgggaggagccct ggcgctcaggctagggaacgcgtgtggccaatcgcggggccgttctcgccgcgagccgat gggggcggggaaaagtccggctgggccgggacaaaagccggatcccgggaagctaccggc tgctggggtgctccggattttgcggggttcgtcgggcctgtggaagaagcgccgcgcacg gacttcggcagaggtagagcaggtctctctgcagccatgtcggccaaggcaatttcagag cagacgggcaaagaactcctttacaagttcatctgtaccacctcagccatccagaatcgg ttcaagtatgctcgggtcactcctgacacagactgggcccgcttgctgcaggaccacccc tggctgctcagccagaacttggtagtcaagccagaccagctgatcaaacgtcgtggaaaa cttggtctcgttggggtcaacctcactctggatggggtcaagtcctggctgaagccacgg ctgggacaggaagccacagttggcaaggccacaggcttcctcaagaactttctgatcgag cccttcgtcccccacagtcaggctgaggagttctatgtctgcatctatgccacccgagaa ggggactacgtcctgttccaccacgaggggggtgtggacgtgggtgatgtggacgccaag gcccagaagctgcttgttggcgtggatgagaaactgaatcctgaggacatcaaaaaacac ctgttggtccacgcccctgaagacaagaaagagttgagtgagagggacggtagggagtgg ctgaagagctatgtccagggagcaggggttgcagaggagatggaagggccagtgggaatt ctggccagttttatctccggcctcttcaatttctacgaggacttgtacttcacctacctc gagatcaatccccttggtaactggagccggtgtctgatgattgcctctgctccccttgaa gtagtgaccaaagatggagtctatgtccttgacttggcggccaaggtggacgccactgcc gactacatctgcaaagtgaagtggggtgacatcgagttccctccccccttcgggcgggag gcatatccagaggaagcctacattgcagacctcgatgccaaaagtggggcaagcctgaag ctgaccttgctgaaccccaaagggaggatctggaccatggtggccgggggtggcgcctct gtcgtgtacagcgataccatctgtgatctagggggtgtcaacgagctggcaaactatggg gagtactcaggcgcccccagcgagcagcagacctatgactatgccaagactatcctctcc ctcatgacccgagagaagcacccagatggcaagatcctcatcattggaggcagcatcgca aacttcaccaacgtggctgccacgttcaagtcacatgtcactttgggcttccagggcatc gtgagagcaattcgagattaccagggccccctgaaggagcacgaagtcacaatctttgtc cgaagaggtggccccaactatcaggagggcttacgggtgatgggagaagtcgggaagacc actgggatccccatccatgtctttggcacagagactcacatgacggccattgtgggcatg gccctgggccaccggcccatccccaaccagccacccacagcggcccacactgcaaacttc ctcctcaacgccagcgggagcacatcgacgccagcccccagcaggacagcatctttttct gagtccagggccgatgaggtggcgcctgcaaagaaggccaagcctgccatgccacaagga aagagcaccaccctcttcagccgccacaccaaggccattgtgtggggcatgcagacccgg gccgtgcaaggcatgctggactttgactatgtctgctcccgagacgagccctcagtggct gccatggtctaccctttcactggggaccacaagcagaagttttactgggggcacaaagag atcctgatccctgtcttcaagaacatggctgatgccatgaggaagcatccggaggtagat gtgctcatcaactttgcctctctccgctctgcctatgacagcaccatggagaccatgaac tatgcccagatccggaccatcgccatcatagctgaaggcatccctgaggccctcacgaga aagctgatcaagaaggcggaccagaagggagtgaccatcatcggacctgccactgttgga ggcatcaagcctgggtgctttaagattggcaacacaggtgggatgctggacaacatcctg gcctccaaactgtaccgcccaggcagcgtggcctatgtctcacgttccggaggcatgtcc aacgagctcaacaatatcatctctcggaccacggatggcgtctatgagggcgtggccatt ggtggggacaggtacccgggctccacattcatggatcatgtgttacgctatcaggacact ccaggagtcaaaatgattgtggttcttggagagattgggggcactgaggaatataagatt tgccggggcatcaaggagggccgcctcactaagcccatcgtctgctggtgcatcgggacg tgtgccaccatgttctcctctgaggtccagtttggccatgctggagcttgtgccaaccag gcttctgaaactgcagtagccaagaaccaggctttgaaggaagcaggagtgtttgtgccc cggagctttgatgagcttggagagatcatccagtctgtatacgaagatctcgtggccaat ggagtcattgtacctgcccaggaggtgccgcccccaaccgtgcccatggactactcctgg gccagg >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_2|792_aa MAVRRAHILTTYASSLFGTLPPTLFLFSWLSYKGAFSGSNSRENRQSVQQQTLTSPAVSH AIITPWPSARQNWRKRTPAGCTQNPGPCQTGAEPEVAVHLMVAKKRSFHKPDRASLNPVT MSDPEGETLRSTFPSYMAEGERLYLCGEFSKAAQSFSNALYLQDGDKNCLVARSKCFLKM GDLERSLKDAEASLQSDPAFCKGILQKAETLYTMGDFEFALVFYHRGYKLRPDREFRVGI QKAQEAINNSVGSPSSIKLENKGDLSFLSKQAENIKAQQKPQPMKHLLHPTKGEPKWKAS LKSEKTVRQLLGELYVDKEYLEKLLLDEDLIKGTMKGGLTVEDLIMTGINYLDTHSNFWR QQKPIYARERDRKLMQEKWLRDHKRRPSQTAHYILKSLEDIDMLLTSGSAEGSLQKAEKV LKKVLEWNKEEVPNKDELVGNLYSCIGNAQIELGQMEAALQSHRKDLEIAKEYDLPDAKS RALDNIGRVFARVGKFQQAIDTWEEKIPLAKTTLEKTWLFHEIGRCYLELDQAWQAQNYG EKSQQCAEEEGDIEWQLNASVLVAQAQVKLRDFESAVNNFEKALERAKLVHNNEAQQAII SALDDANKGIIRELRKTNYVENLKEKSEGEASLYEDRIITREKDMRRVRDEPEKVVKQWD HSEDEKETDEDDEAFGEALQSPASGKQSVEAGKARSDLGAVAKGLSGELGTRSGETGRKL LEAGRRESREIYRRPSGELEQRLSGEFSRQEPEELKKLSEVGRREPEELGKTQFGEIGET KKTGNEMEKEYE >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_2|2379_bp atggcggtacgccgtgcacatatccttaccacgtatgcctcatccctattcggaactttg cctccaaccctcttcctgttttcttggctgtcttacaagggagccttttcaggcagcaac tccagagagaaccgccagagcgtgcaacagcaaacactaaccagcccagccgtcagccac gcaatcatcactccctggcccagtgcgcggcagaactggcgcaagcgcacgccggcaggt tgtacccagaatccgggcccttgccagacgggggcggaaccggaagtcgctgtacatctc atggttgctaagaaacggagcttccacaaaccagatagagcgtctctaaatccggtcacc atgtcggaccccgaaggcgagaccttgcgaagcacctttccctcttatatggccgaaggc gagcggctctacctgtgcggggaattttctaaagccgcgcagagcttcagcaacgctctt taccttcaggatggagacaagaactgcctggttgctcgctcaaagtgcttcctgaagatg ggagacttggagagatccctgaaggatgctgaggcttcgctccagagtgacccagctttc tgtaaggggattttgcaaaaggctgagacactgtacaccatgggagactttgagtttgcc ttggtattctatcatcgaggctacaagctgaggcctgatcgggaattcagagttggcatt cagaaagcccaggaagccatcaacaactcagtgggaagtccttcttccattaagctggag aacaaaggggacctctccttcttaagcaagcaggctgagaatataaaagcccagcagaag cctcagcccatgaaacacctcttacaccccaccaagggagagcccaagtggaaggcctcg ctcaagagtgagaagactgtccgccagcttctgggggagctctacgtggacaaagagtat ttggagaagctcctattggatgaagacctgatcaaaggcaccatgaagggcggcctgact gtggaggacctcatcatgacgggcatcaactacctggatactcacagcaacttctggagg cagcagaagccgatctacgccagggagcgggaccggaagctgatgcaagagaaatggctg cgggaccacaaacgccgtccctcacagacagcccattacatcctcaagagcctggaggac attgatatgttgctcacaagtggcagtgctgaagggagtcttcagaaagctgagaaagtg ctgaagaaggtactggaatggaacaaggaagaggtacccaacaaggatgaactggttgga aacttgtatagctgcatagggaatgcccagattgagctggggcagatggaggcagccctg cagagccacagaaaggacctggagatcgccaaggaatatgaccttcctgatgcaaaatcg agagcccttgacaacattggcagagtttttgccagagttgggaaattccagcaagccatt gacacgtgggaagaaaagatccctctggcaaaaaccaccctggagaagacctggctgttc cacgagatcggccgctgctacttggagctggaccaggcctggcaggcccagaattatggc gagaagtcccagcagtgtgccgaggaggaaggggacattgagtggcaactgaatgccagt gttctggtggcccaggcacaagtgaagctgagagacttcgagtcagccgtgaacaatttt gagaaggccctggagagagcaaagcttgtgcataacaacgaggcgcagcaggccatcatc agtgccttggacgatgccaacaagggtatcatcagagaactgaggaaaaccaactacgtg gagaatctcaaagaaaaaagcgagggagaagcttcactgtatgaagatagaataataaca agagagaaggacatgaggagagtgagagatgagcccgagaaggtggtgaagcagtgggac catagtgaggatgagaaagagacagatgaggacgatgaggcttttggggaagctctgcag agcccagcaagcggaaagcagagtgtggaagcaggaaaagccagaagcgatttgggagca gttgccaagggcctgtcaggagaattaggcacaagatcaggagaaacaggcaggaagcta ctagaagctggcagaagagagtcaagagaaatttataggaggccttcgggagaattagag caaagactctcaggagaattcagcagacaggaaccagaagaactaaagaaactttcagaa gtgggcagaagagagccagaagaactgggaaaaacacaatttggagaaataggagaaacg aaaaaaacaggaaatgagatggaaaaggaatatgaatga >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_3|421_aa MNRGFSRKSHTFLPKIFFRKMSSSGAKDKPELQFPFLQDEDTVATLLECKTLFILRGLPG SGKSTLARVIVDKYRDGTKMVSADAYKITPGARGAFSEEYKRLDEDLAAYCRRRDIRILV LDDTNHERERLEQLFEMADQYQYQVVLVEPKTAWRLDCAQLKEKNQWQLSADDLKKLKPG LEKDFLPLYFGWFLTKKSSETLRKAGQVFLEELGNHKAFKKELRQFVPGDEPREKMDLVT YFGKRPPGVLHCTTKFCDYGKAPGAEEYAQQDVLKKSYSKAFTLTISALFVTPKTTGARV ELSEQQLQLWPSDVDKLSPTDNLPRGSRAHITLGCAADVEAVQTGLDLLEILRQEKGGSR GEEVGELSRGKLYSLGNGRWMLTLAKNMEVRAIFTGYYGKGKPVPTQGSRKGGALQSCTI I >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_3|1266_bp atgaacagaggcttctcccgaaaaagccacacattcctgcccaagatcttcttccgcaag atgtcatcctcaggggccaaggacaagcctgagctgcagtttcccttccttcaggatgag gacacagtggccacgctgctagagtgcaagacgctcttcatcttgcgcggcctgccagga agcggcaagtccacgctggcacgggtcatcgtggacaagtaccgtgatggcaccaagatg gtgtcggctgacgcttacaagatcacccccggcgctcgaggagccttctccgaggagtac aagcggctcgatgaggacctggctgcctactgccgccgccgggacatcagaattcttgtg cttgatgacaccaaccacgaacgggaacggctggagcagctctttgaaatggccgaccag taccagtaccaggtggtgctggtggagcccaagacggcgtggcggctggactgtgcccag ctcaaggagaagaaccagtggcagctgtcggctgatgacctgaagaagctgaagcctggg ctggagaaggacttcctgccgctctacttcggctggttcctgaccaagaagagctctgag accctccgcaaagccggccaggtcttcctggaagagctggggaaccacaaggccttcaag aaggagctgcgacaattcgtccctggggatgagcccagggagaagatggacttggtcacc tactttggaaagagacccccaggcgtgctgcattgcacaaccaagttttgtgactacggg aaggctcccggggcagaggagtacgctcaacaagatgtgttaaagaaatcttactccaag gccttcacgctgaccatctctgccctctttgtgacacccaagacgactggggcccgggtg gagttaagcgagcagcaactgcagttgtggccgagtgatgtggacaagctgtcacccact gacaacctgccgcgggggagccgcgcccacatcaccctcggctgtgcagctgacgtagag gccgtgcagacgggccttgacctcttagagattctgcggcaggagaaggggggcagccga ggcgaggaggtgggcgagctaagccggggcaagctctattccttgggcaatgggcgctgg atgctgaccctggccaagaacatggaggtcagggccatcttcacggggtactacgggaaa ggcaaacctgtgcccacgcaaggtagccggaaggggggcgccttgcagtcctgcaccatc atatga >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_4|523_aa MAAAAECDVVMAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPK NASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQ RALELDHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALEFAPACHRFKI LKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYEDCIEKAVQFFVQALRMA PDHEKACIACRNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGIDPNNIKTNAKLYCNR GTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKT KEHKQLLKNAQLELKKSKRKDYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAEV QKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFS FEANTVQLPNLTTAVPGQQRGDRTDGPKLWDKASPNTPSPRED >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_4|1572_bp atggcggctgccgcggagtgcgatgtggtaatggcggcgaccgagccggagctgctcgac gaccaagaggcgaagagggaagcagagactttcaaggaacaaggaaatgcatactatgcc aagaaagattacaatgaagcttataattattatacaaaagccatagatatgtgtcctaaa aatgctagctattatggtaatcgagcagccaccttgatgatgcttggaaggttccgggaa gctcttggagatgcacaacagtcagtgaggttggatgacagttttgtccggggacatcta cgagagggcaagtgccacctctctctggggaatgccatggcagcatgtcgcagcttccag agagccctagaactggatcataaaaatgctcaggcacaacaagagttcaagaatgctaat gcagtcatggaatatgagaaaatagcagaaacagattttgagaagcgagattttcggaag gttgttttctgcatggaccgtgccctagaatttgcccctgcctgccatcgcttcaaaatc ctcaaggcagaatgtttagcaatgctgggtcgttatccagaagcacagtctgtggctagt gacattctacgaatggattccaccaatgcagatgctctgtatgtacgaggtctttgcctt tattacgaagattgtattgagaaggcagttcagtttttcgtacaggctctcaggatggct cctgaccacgagaaggcctgcattgcctgcagaaatgccaaagcactcaaagcaaagaaa gaagatgggaataaagcatttaaggaaggaaattacaaactagcatatgaactgtacaca gaagccctggggatagaccccaacaatataaaaacaaatgctaaactctactgtaatcgg ggtacggttaattccaagcttaggaaactagatgatgcaatagaagactgcacaaatgca gtgaagcttgatgacacttacataaaagcctacttgagaagagctcagtgttacatggac acagaacagtatgaagaagcagtacgagactatgaaaaagtataccagacagagaaaaca aaagaacacaaacagctcctaaaaaatgcgcagctggaactgaagaagagtaagaggaaa gattactacaagattctaggagtggacaagaatgcctctgaggacgagatcaagaaagct tatcggaaacgggccttgatgcaccatccagatcggcatagtggagccagtgctgaggtt cagaaggaggaggagaagaagttcaaggaagttggagaggcctttactatcctctctgat cccaagaaaaagactcgctatgacagtggacaggacctagatgaggagggcatgaatatg ggtgattttgatccaaacaatatcttcaaggcattctttggcggtcctggcggcttcagc tttgaagccaacactgttcagctgcccaacctcaccacagctgtgcccggccagcagagg ggagacaggaccgatggccccaagctgtgggacaaagccagtcctaatacacccagccct agggaagactga >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_5|191_aa MGKSCKVVVCGQASVGKTSILEQLLYGNHVVGSEMIETQEDIYVGSIETDRGVREQVRFY DTRGLRDGAELPRHCFSCTDGYVLVYSTDSRESFQRVELLKKEIDKSKDKKEVTIVVLGN KCDLQEQRRVDPDVAQHWAKSEKVKLWEVSVADRRSLLEPFVYLASKMTQPQSKSAFPLS RKNKGSGSLDG >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_5|576_bp atggggaagagctgcaaggtggtcgtgtgtggccaggcgtctgtgggcaaaacttcaatc ctggagcagcttctgtatgggaaccatgtagtgggttcggagatgatcgagacgcaggag gacatctacgtgggctccattgagacagaccggggggtgcgagagcaggtgcgtttctat gacacccgggggctccgagatggggccgaactgccccgacactgcttctcttgcactgat ggctacgtcctggtctatagcacagatagcagagagtcttttcagcgtgtggagctgctc aagaaggagattgacaaatccaaggacaagaaggaggtcaccatcgtggtccttggcaac aagtgtgacttacaggagcagcggcgtgtagacccagatgtggctcagcactgggccaag tcagagaaggtgaagctgtgggaggtgtcagtggcggaccggcgctccctcctggagccc tttgtctacttggccagcaagatgacgcaaccccagagcaagtctgccttccccctcagc cggaagaacaagggcagcggctccttggatggctga >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_6|1036_aa MVPPGKKPAGEASNSNKKCKRYFNEHWKEEFTWLDFDYERKLMFCLECRQALVRNKHGKA ENAFTVGTDNFQRHALLRHVTSGAHRQALAVNQGQPPFEGQAEGGGACPGLATTPASRGV KVELDPAKVAVLTTVYCMAKEDVPNDRCSALLELQRFNLCQALLGTEHGDYYSPRRVRDM QVAIASVLHTEACQRLKASPYVGLVLDETRDWPESHSLALFATSVSPCDGQPATTFLGSV ELQEGEATAGQLLDILQAFGVSAPKLAWLSSSLPSERLGSVGPQLRATCPLLAELHCLPG RTDPEPPAYLGQYESILDALFRLHGGPSSHLVPELRAALDLAAIDLAGPRPVPWASLLPV VEAVAEAWPGLVPTLEAAALASPVAGSLALALRQFTFVAFTHLLLDALPSVQKLSLVLQA EEPDLALLQPLVMAAAASLQAQRGSGGARLQGFLQELASMDPDASSGRCTYRGVELLGYS EAAVRGLEWLRGSFLDSMRKGLQDSYPGPSLDAVAAFAAIFDPRRYPQAPEELGTHGEGA LRVLLRGFAPAVVRQRALGDFALFKRVVFGLGRLGPRALCTQLACAHSELHELFPDFAAL AALALALPAGAGLLDKVGRSRELRWWGQSGAGEGRGGHMVKIAVDGPPLHEFDFGLAVEF LETGPASGAPSPLLASLPLPTRPLQPPLDFKHLLAFHFNGAAPLSLFPNFSTMDPVQKAV ISHTFGVPSPLKKKLFISCNICHLRFNSANQAEAHYKGHKHARKLKAVEAAKSKQRPHTQ AQDGAVVSPIPTLASGAPGEPQSKEPGREAPGPEPAAAAVGSSMSGEGRSEKGHLYCPTC KVTVNSASQLQAHNTGAKHRWMMEGQRGAPRRSRGRPVSRGGAGHKAKRVTGGRGGRQGP SPAFHCALCQLQVNSETQLKQHMSSRRHKDRLAGKTPKPSSQHSKLQKHAALAVSILKSK LALQKQLTKTLAARFLPSPLPTAATAICALPGPLALRPAPTAATTLFPAPILGPALFRTP AGAVRPATGPIVLAPY >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_6|3111_bp atggtgcccccagggaagaaaccagcgggagaggcctccaactccaacaagaagtgtaag cgttacttcaacgagcactggaaagaggagtttacctggctggactttgactatgagcgg aagctgatgttctgcctcgagtgccgccaggccctggtacggaacaagcatggcaaagcc gaaaacgccttcactgtgggcacagacaacttccagcgccacgccctgctgcgccacgtg acctcaggagcccaccgccaggctctggctgtcaaccagggccagcccccttttgagggc caggctgaaggtggaggggcctgcccaggcctggctacgacccctgcctccaggggcgtc aaggtggagttagacccggccaaagtggctgtgctgactactgtgtactgcatggcaaag gaggatgtgcccaatgaccgctgctctgccctgctcgagctgcagaggttcaacctgtgc caggcactgctgggcacagagcatggcgattactacagtcccaggagggtgagggacatg caggtggccattgccagtgtcttgcacacagaggcctgccagcgcctgaaggcatcccca tatgtggggctggtgttggacgagaccagagactggccggagtcacacagcctggccctg tttgccacttcagtgtccccctgtgatggccagcccgccaccaccttcctgggcagtgtg gagctacaggagggcgaggccactgctggccagctcttggacatcctgcaggctttcggc gtatctgcacccaagctggcctggctcagctcaagcctccccagtgagcgcctggggagt gtgggcccacagctccgggccacttgcccactgctggcagagctgcattgtctccctggc cggacagatcctgagcccccggcctacttgggtcaatatgagagcatattggatgcccta ttccgcctccatggtggccctagttcccacttggtccctgagctccgggcagcactggac cttgcagctattgacttggcagggcctcggccagtgccctgggcctccctgctgcctgta gtggaagcagtggccgaggcctggcctggcctggtgcccaccctggaggctgcagccctt gcctcacctgtggcggggtcactggccctggccctgcgccagttcaccttcgtggccttc acccacctgctgctggatgccctgccctctgtgcagaagctctcccttgtcctgcaggca gaagagccggacttggccttgctgcagcctctggtgatggcggctgcggcctccctccaa gctcagcgcggctcaggtggggcccgcctccagggcttcctgcaggaactggcatccatg gaccctgacgccagcagcggacgctgcacctaccgcggcgtggagctgctcggttactcc gaggctgcggtccggggcttggagtggctccggggatccttcctggactccatgcggaag ggcctacaggactcctaccccgggccttcgctggacgccgtggccgccttcgcagcgatc ttcgacccccgacgctacccgcaggcgccggaggagctgggcacgcatggcgagggggcg ctgcgggtgctgctgcgcggctttgctcctgccgtggtgcgccagcgggcgctgggcgac ttcgcgctgtttaagcgcgtagtattcggccttgggcggctcggcccgcgggccctgtgc acccagctggcgtgcgcgcactcggagctgcacgagctcttccccgacttcgccgcccta gccgccttggctttggcgctgcccgcgggcgctggcctgctggacaaggtcggccgcagc cgggagctgcggtggtgggggcagagtggggccggggaaggccgggggggccacatggtg aagatcgcagtggatgggcccccgctgcacgagtttgacttcgggttggctgtggagttc ttagagacaggcccggcctccggcgcccccagccccctgctggcctccctgcccctgccc acccggcctctgcagcccccgctggacttcaagcacttgctcgccttccacttcaatggc gctgccccgctcagtctcttccccaacttcagcacgatggacccggtccagaaagctgtc atcagccacacgtttggtgtcccctcccctctgaagaagaagctgttcatttcctgtaac atctgtcacctgaggttcaactcagcgaaccaggccgaggcacattataaaggccacaaa cacgccagaaaactcaaggctgtcgaggctgccaagagcaagcagaggccacacacccag gcccaggatggggctgtagtgtccccaatcccaacgctggccagtggagcccctggagag ccacagagtaaagagcctgggagagaggcaccggggcctgagccagcggcagctgccgtg ggaagcagcatgagtggggaaggcaggagtgagaaggggcacctctactgccccacgtgt aaggtgacagtgaactcggcctcccagcttcaggctcacaacacaggagccaagcaccgg tggatgatggaaggtcagcgaggggctccccggaggagccggggccgcccggtgtccagg ggaggtgccggacacaaagccaagagagtcacagggggccggggcggccggcaggggccc agccctgccttccactgtgctctctgtcagctccaggtcaattcagagacccaactgaag cagcacatgagcagcaggaggcacaaagaccgcctggccgggaagacccccaagccctcc agccagcacagcaagctgcagaagcacgcagcgctggctgtgagtatcctcaagtctaaa ctggccttgcagaagcaactcaccaagacgttggcagcccgcttcctgcccagcccgctc cccaccgcagccactgccatctgtgctctgccagggcccctggccctccgccctgcccct acagcagccactaccctcttcccggctcccatcctgggcccagctctgtttcgcacccca gcaggagctgtccgccctgccacaggacctatcgtccttgccccttattag >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_7|239_aa MGPPLCVAWSPVGLPQCQADMKRPLSPPPPAEKETPISGAAECLPRPPEPPKPKRERKRP SYTLCDVCNIQLNSAAQAQGTLPVGSFGIRTPKQHFSSLEPPGSHRLSDKGLICGVGVPG SSSGLWSLKAKVSKDHQEKQGCLGTELGGLAQIPALEEDKDSVCVSGGCCRSVEPKETAA RASTAQDGGSCVSAKDKGGLDKAARQWCGEKGQVRSGFWRKNQQDLWMEQVLDVEENRD >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_7|720_bp atggggcctcctctctgtgtggcctggtccccagtaggcctgccccagtgccaggcagat atgaagcggccactgagcccacccccaccggctgagaaggagacccccatatctggagct gctgagtgcctccctcggcccccagaaccacctaagcccaagcgagaaagaaagcggcca tcgtacacgctctgtgatgtctgcaacatccagctgaactcggcggcccaggcccaggga actctccctgtgggatcttttgggatcaggactcctaagcagcacttctccagcctggag cctcctggtagccacaggctttcagacaagggcctcatctgcggggtgggggtcccaggc tccagctcaggtctgtggtctctgaaggccaaggtcagcaaggaccaccaggaaaagcaa ggctgtctgggtacagaattagggggcttagctcagatccctgccctggaagaagacaaa gattctgtctgtgtctctggtggctgctgtcgctccgtggagcccaaggagaccgcggcc agagcgtccaccgcccaagatggtggctcctgtgtctccgcaaaagataaaggtggcctg gacaaggccgccaggcagtggtgtggagagaagggacaagttagaagtggattttggagg aagaatcaacaagacctatggatggagcaggttctggatgtagaagagaacagggattga >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_8|82_aa MKHAMEQDGIGASAKGLRWWEASGFMICKFHEGPLDFRSLLDPKCSGASTWPVSGAHLVM VQEKVTVGKSGESRVSAAAASF >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_8|249_bp atgaagcacgccatggaacaggatggcataggggcttcagccaaggggctgaggtggtgg gaagcttcaggcttcatgatttgcaagttccatgagggccctttggacttccgttcactg ctagatcccaagtgcagtggagcctccacgtggcctgtatctggagctcatctggtcatg gtccaggagaaggtgacagtgggaaagtcgggggagtcaagggtgtcagctgctgctgct tctttttga >gi568815581r:41877261_42117416|GENSCAN_predicted_peptide_9|1069_aa XKHKTLALIKDGRVIGGICFRMFPTQGFTEIVFCAVTSNEQVKGYGTHLMNHLKEYHIKH NILYFLTYADEYAIGYFKKQGFSKDIKVPKSRYLGYIKDYEGATLMECELNPRIPYTELS HIIKKQKEVIIKKLIERKQAQIRKVYPGLSCFKEGVRQIPVESVPGIRETGWKPLGKEKG KELKDPDQLYTTLKNLLAQIKSHPSAWPFMEPVKKSEAPDYYEVIRFPIDLKTMTERLRS RYYVTRKLFVADLQRVIANCREYNPPDSEYCRCASALEKFFYFKLKEGGLIDKMELRSYQ WEVIMPALEGKNIIIWLPTGAGKTRAAAYVAKRHLETVDGAKVVVLVNRVHLVTQHGEEF RRMLDGRWTVTTLSGDMGPRAGFGHLARCHDLLICTAELLQMALTSPEEEEHVELTVFSL IVVDECHHTHKDTVYNVIMSQYLELKLQRAQPLPQVLGLTASPGTGGASKLDGAINHVLQ LCANLDTWCIMSPQNCCPQLQEHSQQPCKQYNLCHRRSQGLVALFSLWASKRALPCRAEP SSVNSVSRAVLPSTDSRVQYELTQTGYIASGVRACPPNSLSHSGPDVRLGLFLDQSTSMC LTLGITVPRPKGKSTSRDPFGDLLKKLMDQIHDHLEMPELSRKFGTQMYEQQVVKLSEAA ALAGLQEQRVYALHLRRYNDALLIHDTVRAVDALAALQDFYHREHVTKTQILCAERRLLA LFDDRKNELAHLATHGPENPKLEMLEKILQRQFSSSNSPRGIIFTRTRQSAHSLLLWLQQ QQGLQTVDIRAQLLIGAGNSSQSTHMTQRDQQEVIQKFQDGTLNLLVATSVAEEGLDIPH CNVVVRYGLLTNEISMVQARGRARADQSVYAFVATEGSRELKRELINEALETLMEQAVAA VQKMDQAEYQAKIRDLQQAALTKRAAQAAQRENQRQQFPVEHVQLLCINCMVAVGHGSDL RKVEGTHHVNVNPNFSNYYNVSRDPVVINKVFKDWKPGGVISCRNCGEVWGLQMIYKSVK LPVLKVRSMLLETPQGRIQAKKWSRVPFSVPDFDFLQHCAENLSDLSLD >gi568815581r:41877261_42117416|GENSCAN_predicted_CDS_9|3210_bp nngaagcacaagactctggccttgatcaaggatgggcgggtcatcggtggcatctgcttc cgcatgtttcccacccagggcttcacggagattgtcttctgtgctgtcacctcgaatgag caggtcaagggttatgggacccacctgatgaaccacctgaaggagtatcacatcaagcac aacattctctacttcctcacctacgccgacgagtacgccatcggctacttcaaaaagcag ggtttctccaaggacatcaaggtgcccaagagccgctacctgggctacatcaaggactac gagggagcgacgctgatggagtgtgagctgaatccccgcatcccctacacggagctgtcc cacatcatcaagaagcagaaagaggtgatcatcaagaagctgattgagcgcaaacaggcc cagatccgcaaggtctacccggggctcagctgcttcaaggagggcgtgaggcagatccct gtggagagcgttcctggcattcgagagacaggctggaagccattggggaaggagaagggg aaggagctgaaggaccccgaccagctctacacaaccctcaaaaacctgctggcccaaatc aagtctcaccccagtgcctggcccttcatggagcctgtgaagaagtcggaggcccctgac tactacgaggtcatccgcttccccattgacctgaagaccatgactgagcggctgcgaagc cgctactacgtgacccggaagctctttgtggccgacctgcagcgggtcatcgccaactgt cgcgagtacaaccccccggacagcgagtactgccgctgtgccagcgccctggagaagttc ttctacttcaagctcaaggagggaggcctcattgacaaaatggagcttcggtcctaccaa tgggaggtgatcatgcctgccctggagggcaagaatatcatcatctggctgcccacgggt gccgggaagacccgggcggctgcttatgtggccaagcggcacctagagactgtggatgga gccaaggtggttgtattggtcaacagggtgcacctggtgacccagcatggtgaagagttc aggcgcatgctggatggacgctggaccgtgacaaccctgagtggggacatgggaccacgt gctggctttggccacctggcccggtgccatgacctgctcatctgcacagcagagcttctg cagatggcactgaccagccccgaggaggaggagcacgtggagctcactgtcttctccctg atcgtggtggatgagtgccaccacacgcacaaggacaccgtctacaacgtcatcatgagc cagtacctagaacttaaactccagagggcacagccgctaccccaggtgctgggtctcaca gcctccccaggcactggcggggcctccaaactcgatggggccatcaaccacgtcctgcag ctctgtgccaacttggacacgtggtgcatcatgtcaccccagaactgctgcccccagctg caggagcacagccaacagccttgcaaacagtacaacctctgccacaggcgcagccagggg ttagtagcattattctctctctgggcgagcaagcgtgccctgccgtgccgagctgaacca agctccgtgaacagcgtcagcagggcagttttaccttctactgacagtagagtccagtat gaacttacacaaacaggttatatagcaagtggagtacgtgcctgccccccaaactcgctg agtcactctggcccggatgtccgcctcggcctattccttgaccaaagcacgtccatgtgc cttacactaggcatcaccgtgcccaggccaaagggtaagagcacatcgcgggatccgttt ggggacttgctgaagaagctcatggaccaaatccatgaccacctggagatgcctgagttg agccggaaatttgggacgcaaatgtatgagcagcaggtggtgaagctgagtgaggctgcg gctttggctgggcttcaggagcaacgggtgtatgcgcttcacctgaggcgctacaatgac gcgctgctcatccatgacaccgtccgcgccgtggatgccttggctgcgctgcaggatttc tatcacagggagcacgtcactaaaacccagatcctgtgtgccgagcgccggctgctggcc ctgttcgatgaccgcaagaatgagctggcccacttggcaactcatggcccagagaatcca aaactggagatgctggaaaagatcctgcaaaggcagttcagtagctctaacagccctcgg ggtatcatcttcacccgcacccgccaaagcgcacactccctcctgctctggctccagcag cagcagggcctgcagactgtggacatccgggcccagctactgattggggctgggaacagc agccagagcacccacatgacccagagggaccagcaagaagtgatccagaagttccaagat ggaaccctgaaccttctggtggccacgagtgtggcggaggaggggctggacatcccacat tgcaatgtggtggtgcgttatgggctcttgaccaatgaaatctccatggtccaggccagg ggccgtgcccgggccgatcagagtgtatacgcgtttgtagcaactgaaggtagccgggag ctgaagcgggagctgatcaacgaggcgctggagacgctgatggagcaggcagtggctgct gtgcagaaaatggaccaggccgagtaccaggccaagatccgggatctgcagcaggcagcc ttgaccaagcgggcggcccaggcagcccagcgggagaaccagcggcagcagttcccagtg gagcacgtgcagctactctgcatcaactgcatggtggctgtgggccatggcagcgacctg cggaaggtggagggcacccaccatgtcaatgtgaaccccaacttctcgaactactataat gtctccagggatcctgtggtcatcaacaaagtcttcaaggactggaagcctgggggtgtc atcagctgcaggaactgtggggaggtctggggtctgcagatgatctacaagtcagtgaag ctgccagtgctcaaagtccgcagcatgctgctggagacccctcaggggcggatccaggcc aaaaagtggtcccgcgtgcccttctccgtgcctgactttgacttcctgcagcattgtgcc gagaacttgtcggacctctccctggactga