GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:23:09 Sequence gi568815597r:158599681_158800652 : 200972 bp : 37.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 6759 7700 942 2 0 71 49 198 0.327 10.48 1.02 PlyA + 8847 8852 6 1.05 2.54 PlyA - 10838 10833 6 1.05 2.53 Term - 11709 11584 126 0 0 109 38 78 0.985 2.20 2.52 Intr - 13281 13137 145 2 1 120 57 171 0.999 16.56 2.51 Intr - 14187 14041 147 2 0 120 75 177 0.999 18.03 2.50 Intr - 14626 14573 54 0 0 112 78 63 0.967 4.88 2.49 Intr - 15723 15536 188 0 2 114 11 276 0.998 20.07 2.48 Intr - 17908 17857 52 0 1 90 106 53 0.959 5.29 2.47 Intr - 19654 19542 113 1 2 44 69 76 0.963 -0.44 2.46 Intr - 20786 20490 297 2 0 87 76 298 0.938 24.65 2.45 Intr - 23512 23303 210 1 0 91 9 266 0.979 17.19 2.44 Intr - 24269 24153 117 2 0 79 70 47 0.769 1.74 2.43 Intr - 26542 26466 77 2 2 84 94 73 0.913 5.82 2.42 Intr - 27327 27159 169 0 1 128 52 131 0.909 12.10 2.41 Intr - 28043 27945 99 2 0 59 105 93 0.991 7.49 2.40 Intr - 34995 34863 133 2 1 44 94 156 0.985 11.53 2.39 Intr - 36354 36233 122 0 2 41 117 131 0.738 9.67 2.38 Intr - 37081 36961 121 0 1 91 91 134 0.999 13.58 2.37 Intr - 38561 38353 209 2 2 -12 36 307 0.958 12.45 2.36 Intr - 40006 39902 105 1 0 106 107 87 0.998 11.89 2.35 Intr - 40327 40190 138 1 0 66 76 145 0.999 10.84 2.34 Intr - 42862 42731 132 1 0 104 98 218 0.942 24.32 2.33 Intr - 43015 42990 26 2 2 54 19 46 0.940 -8.87 2.32 Intr - 43296 43190 107 2 2 48 33 177 0.551 7.14 2.31 Intr - 43745 43642 104 2 2 81 71 163 0.999 11.95 2.30 Intr - 44716 44573 144 1 0 76 90 138 0.985 12.36 2.29 Intr - 45705 45508 198 0 0 143 92 258 0.999 30.33 2.28 Intr - 45914 45815 100 1 1 60 98 70 0.931 4.39 2.27 Intr - 48040 47859 182 1 2 75 80 196 0.997 15.24 2.26 Intr - 48973 48829 145 0 1 74 100 222 0.834 21.36 2.25 Intr - 50267 50176 92 2 2 112 90 105 0.994 10.97 2.24 Intr - 51788 51687 102 2 0 66 93 97 0.775 7.45 2.23 Intr - 52973 52787 187 1 1 80 100 263 0.999 25.47 2.22 Intr - 53745 53594 152 0 2 86 100 242 0.999 23.24 2.21 Intr - 55068 54931 138 0 0 103 57 133 0.999 11.44 2.20 Intr - 56976 56884 93 0 0 53 110 60 0.931 3.94 2.19 Intr - 58014 57797 218 1 2 85 74 387 0.997 34.30 2.18 Intr - 61729 61607 123 2 0 90 95 110 0.999 11.54 2.17 Intr - 63265 63022 244 1 1 92 45 227 0.998 14.95 2.16 Intr - 66817 66636 182 2 2 91 72 192 0.923 16.57 2.15 Intr - 68382 68178 205 0 1 90 93 216 0.999 20.15 2.14 Intr - 69883 69728 156 1 0 80 102 202 0.999 20.09 2.13 Intr - 70106 70029 78 2 0 74 63 122 0.936 7.13 2.12 Intr - 71773 71663 111 1 0 109 55 152 0.999 13.66 2.11 Intr - 72516 72379 138 0 0 72 82 164 0.780 13.94 2.10 Intr - 74750 74649 102 2 0 64 84 217 0.999 18.35 2.09 Intr - 74995 74860 136 0 1 122 75 231 0.982 24.85 2.08 Intr - 76615 76461 155 1 2 60 40 140 0.873 4.35 2.07 Intr - 78154 78010 145 0 1 7 90 174 0.999 8.86 2.06 Intr - 78854 78721 134 2 2 102 86 103 0.994 10.02 2.05 Intr - 81049 80903 147 1 0 91 78 218 0.813 20.61 2.04 Intr - 81987 81847 141 0 0 75 69 218 0.993 18.23 2.03 Intr - 83816 83691 126 2 0 36 71 114 0.924 4.46 2.02 Intr - 85667 85428 240 2 0 73 43 250 0.960 15.82 2.01 Init - 86837 86814 24 2 0 90 103 4 0.886 1.69 2.00 Prom - 88353 88314 40 -1.45 3.05 PlyA - 89236 89231 6 1.05 3.04 Term - 95394 94948 447 0 0 42 55 148 0.390 1.13 3.03 Intr - 96591 96472 120 0 0 75 60 68 0.355 2.47 3.02 Intr - 101083 100031 1053 1 0 104 57 424 0.271 30.54 3.01 Init - 118195 117785 411 1 0 78 55 311 0.001 23.36 3.00 Prom - 120827 120788 40 -5.15 4.02 PlyA - 121636 121631 6 1.05 4.01 Sngl - 125140 124433 708 1 0 78 38 393 0.617 28.07 4.00 Prom - 129110 129071 40 -6.45 5.00 Prom + 130080 130119 40 -7.05 5.01 Init + 135576 135638 63 2 0 55 81 74 0.177 4.60 5.02 Intr + 152466 152673 208 2 1 69 31 117 0.011 1.83 5.03 Intr + 153623 153726 104 0 2 39 110 77 0.022 3.97 5.04 Term + 155208 156167 960 2 0 93 37 510 0.149 37.46 5.05 PlyA + 156597 156602 6 1.05 6.03 PlyA - 157631 157626 6 1.05 6.02 Term - 158670 158656 15 0 0 145 37 5 0.510 -1.64 6.01 Init - 163771 162704 1068 1 0 60 42 365 0.391 23.47 6.00 Prom - 164511 164472 40 -8.45 7.02 PlyA - 165322 165317 6 1.05 7.01 Sngl - 167002 166064 939 1 0 71 44 437 0.972 32.95 7.00 Prom - 174175 174136 40 -6.45 8.04 PlyA - 174562 174557 6 1.05 8.03 Term - 177961 177002 960 1 0 75 38 452 0.189 29.96 8.02 Intr - 188343 188196 148 2 1 49 83 101 0.055 5.02 8.01 Init - 197613 197483 131 0 2 79 75 73 0.576 4.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 152355 152203 153 2 0 115 95 33 0.831 4.97 S.002 Init - 152642 152455 188 2 2 65 78 154 0.900 8.91 S.003 Term - 194905 194767 139 0 1 -5 39 197 0.867 1.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_1|313_aa MGQTNVTSWRDFVFLGFSSSGELQLLLFALFLSLYLVTLTSNVFIIIAIRLDSHLHTPMY LFLSFLSFSETCYTLGIIPRMLSGLAGGDQAISYVGCAAQMFFSASWACTNCFLLAAMGF DRYVAICAPLHYASHMNPTLCAQLVITSFLTGYLFGLGMTLVIFHLSFCSSHEIQHFFCD TPPVLSLACGDTGPSELRIFILSLLVLLVSFFFITISYAYILAAILRIPSAEGQKKAFST CASHLTVVIIHYGCASFVYLRPKASYSLERDQLIAMTYTVVTPLLNPIVYSLRNRAIQTA LRNAFRGRLLGKG >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_1|942_bp atggggcagaccaacgtaacctcctggagggattttgtcttcctgggcttctccagttct ggggagttgcagctccttctctttgccttgttcctctctctgtatctagtcactctgacc agcaatgtcttcattatcatagccatcaggctggatagccatctgcacacccccatgtac ctcttcctttccttcctatccttctctgagacctgctacactttgggcatcatccctaga atgctctctggcctggctgggggggaccaggctatctcctatgtgggctgtgctgcccag atgttcttttctgcctcatgggcctgtactaactgcttccttctggctgccatgggcttt gacagatatgtggccatctgtgctccactccactatgccagccacatgaatcctaccctc tgtgcccagctggtcattacttccttcctgactggatacctctttggactgggaatgaca ctagttattttccacctctcattctgcagctcccatgaaatccagcactttttttgtgac acgccacctgtgctgagcctagcctgtggagatacaggcccgagtgagctgaggatcttt atcctcagtcttttggtcctcttggtctccttcttcttcatcaccatctcctacgcctac atcttggcagcaatactgaggatcccctctgctgaggggcagaagaaggccttctccact tgtgcctcgcaccttacagtggtcattattcattatggctgtgcttccttcgtgtacctg aggcccaaagccagctactctcttgagagagatcagcttattgccatgacctatactgta gtgacccccctccttaatcccattgtttatagtctaaggaatagggctatacagacagct ctgaggaatgctttcagagggagattgctgggtaaaggatga >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_2|2442_aa MEQFPKETVVESSGPKVLETAEEIQERRQEVLTRYQSFKERVAERGQKLEDSYHLQVFKR DADDLGKWIMEKVNILTDKSYEDPTNIQGKYQKHQSLEAEVQTKSRLMSELEKTREERFT MGHSAHEETKAHIEELRHLWDLLLELTLEKGDQLLRALKFQQYVQECADILEWIGDKEAI ATSVELGEDWERTEVLHKKFEDFQVELVAKEGRVVEVNQYANECAEENHPDLPLIQSKQN EVNAAWERLRGLALQRQKALSNAANLQRFKRDVTEAIQWIKEKEPVLTSEDYGKDLVASE GLFHSHKGLERNLAVMSDKVKELCAKAEKLTLSHPSDAPQIQEMKEDLVSSWEHIRALAT SRYEKLQATYWYHRFSSDFDELSGWMNEKTAAINADELPTDVAGGEVLLDRHQQHKHEID SYDDRFQSADETGQDLVNANHEASDEVREKMEILDNNWTALLELWDERHRQYEQCLDFHL FYRDSEQVDSWMSRQEAFLENEDLGNSLGSAEALLQKHEDFEEAFTAQEEKIITVDKTAT KLIGDDHYDSENIKAIRDGLLARRDALREKAATRRRLLKESLLLQKLYEDSDDLKNWINK KKKLADDEDYKDIQNLKSRVQKQQVFEKELAVNKTQLENIQKTGQEMIEGGHYASDNVTT RLSEVASLWEELLEATKQKGTQLHEANQQLQFENNAEDLQRWLEDVEWQVTSEDYGKGLA EVQNRLRKHGLLESAVAARQDQVDILTDLAAYFEEIGHPDSKDIRARQESLVCRFEALKE PLATRKKKLLDLLHLQLICRDTEDEEAWIQETEPSATSTYLGKDLIASKKLLNRHRVILE NIASHEPRIQEITERGNKMVEEGHFAAEDVASRVKSLNQNMESLRARAARRQNDLEANVQ FQQYLADLHEAETWIREKEPIVDNTNYGADEEAAGALLKKHEAFLLDLNSFGDSMKALRN QANACQQQQAAPVEGVAGEQRVMALYDFQARSPREVTMKKGDVLTLLSSINKDWWKVEAA DHQGIVPAVYVRRLAHDEFPMLPQRRREEPGNITQRQEQIENQYRSLLDRAEERRRRLLQ RYNEFLLAYEAGDMLEWIQEKKAENTGVELDDVWELQKKFDEFQKDLNTNEPRLRDINKV ADDLLFEGLLTPEGAQIRQELNSRWGSLQRLADEQRQLLGSAHAVEVFHREADDTKEQIE KKCQALSAADPGSDLFSVQALQRRHEGFERDLVPLGDKVTILGETAERLSESHPDATEDL QRQKMELNEAWEDLQGRTKDRKESLNEAQKFYLFLSKARDLQNWISSIGGMVSSQELAED LTGIEILLERHQEHRADMEAEAPTFQALEDFSAELIDSGHHASPEIEKKLQAVKLERDDL EKAWEKRKKILDQCLELQMFQGNCDQVESWMVARENSLRSDDKSSLDSLEALMKKRDDLD KAITAQEGKITDLEHFAESLIADEHYAKEEIATRLQRVLDRWKALKAQLIDERTKLGDYA NLKQFYRDLEELEEWIKTSESDVEQRKYLKHQTFAHEVDGRSEQVHGVINLGNSLIECSA CDGNEEAMKEQLEQLKEHWDHLLERTNDKGKKLNEASRQQRFNTSIRDFEFWLSEAETLL AMKDQARDLASAGNLLKKHQLLEREMLAREDALKDLNTLAEDLLSSGTFNVDQIVKKKDN VNKRFLNVQELAAAHHEKLKEAYALFQFFQDLDDEESWIEEKLIRVSSQDYGRDLQGVQN LLKKHKRLEGELVAHEPAIQNVLDMAEKLKDKAAVGQEEIQLRLAQFVEHWEKLKELAKA RGLKLEESLEYLQFMQNAEEEEAWINEKNALAVRGDCGDTLAATQSLLMKHEALENDFAV HETRVQNVCAQGEDILNKVLQEESQNKEISSKIEALNEKTPSLAKAIAAWKLQLEDDYAF QEFNWKADVVEAWIADKETSLKTNGNGADLGDFLTLLAKQLEQLNAEHQVTRLHKSGSPG PSPGTPTCYPVPKSLPHFQDTLDASLQSFQQERLPEITDLKDKLISAQHNQSKAIEERYA ALLKRWEQLLEASAVHRQKLLEKQLPLQKAEDLFVEFAHKASALNNWCEKMEENLSEPVH CVSLNEIRQLQKDHEDFLASLARAQADFKCLLELDQQIKALGVPSSPYTWLTVEVLERTW KHLSDIIEEREQELQKEEARQVKNFEMCQEFEQNASTFLQWILETRSLLKETGTLESQLE ANKRKQKEIQAMKRQLTKIVDLGDNLEDALILDIKYSTIGLAQQWDQLYQLGLRMQHNLE QQIQAKDIKGVSEETLKEFSTIYKHFDENLTGRLTHKEFRSCLRGLNYYLPMVEEDEHEP KFEKFLDAVDPGRKGYVSLEDYTAFLIDKESENIKSSDEIENAFQALAEGKSYITKEDMK QALTPEQVSFCATHMQQYMDPRGRSHLSGYDYVGFTNSYFGN >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_2|7329_bp atggagcaatttccaaaggaaaccgttgtggagagcagtgggccaaaggttttggaaaca gcagaagagatccaggagaggcgtcaggaagtgttgactcggtatcaaagtttcaaggag cgggtcgctgagaggggtcagaagcttgaggattcctatcacttacaagttttcaagcga gatgcagatgatctggggaagtggatcatggagaaagtcaatatcttaaccgataagagc tatgaagacccaactaatatacaggggaaatatcagaagcatcaatcccttgaagcagag gtgcaaacaaaatcaagactcatgtctgaactggaaaaaacaagggaagaacgatttacc atgggtcattctgcccacgaagaaacgaaggcccatatagaggagctacgccacctgtgg gacctgctgttagagctgaccctggagaagggtgaccagttgctgcgggccctgaagttc cagcagtatgtacaggagtgtgctgacatcttagagtggattggagacaaggaggctata gcgacatcagtggagctaggtgaagactgggagcgcaccgaagttctgcataagaaattt gaagacttccaagtggagctggtagctaaagaagggagagttgttgaagtgaaccaatat gccaatgagtgtgccgaggaaaaccatcctgacctacccttaattcagtctaagcaaaat gaggtgaatgctgcctgggagcgccttcgtggtttggctctccagagacagaaagctctg tccaatgctgcaaacttacaacgattcaaaagggatgtgactgaagccatccagtggatc aaggagaaggaacctgtactcacctctgaggactatggcaaagaccttgttgcctctgaa ggactgtttcacagtcacaagggacttgagagaaatcttgctgtcatgagtgacaaggtg aaggagttatgtgctaaagcagagaagctgacactttcccatccttcagatgcacctcag atccaggagatgaaagaagatctggtctccagctgggagcatattcgtgccctggccacc agcagatatgaaaaactgcaggctacttattggtaccatcgattttcatctgactttgat gaactctcaggctggatgaacgagaagactgctgcgatcaatgctgatgagctgccaaca gatgtggctggtggagaagttctgctggacaggcatcagcagcataagcatgagattgac tcttacgatgaccgatttcaatctgctgatgagactggtcaagacctcgtgaatgccaat catgaagcctctgatgaagttcgggaaaagatggaaatacttgacaacaactggactgcc ctgctggaactgtgggacgagcgtcatcgtcagtatgagcagtgcttggactttcatctc ttctacagagacagtgagcaagtggacagttggatgagtagacaagaggccttcctggaa aacgaggatctgggaaactcactgggcagtgcagaagcccttcttcagaagcatgaagac tttgaggaagcctttactgcccaggaagagaagatcataactgtagacaagactgcaacc aaattgattggtgatgaccattatgattcagagaacatcaaggctatccgtgacgggctg ttagcccggcgggatgccctacgtgaaaaggctgccactagacgtagattgctgaaggag tcattgcttctgcaaaaactgtatgaggactcagatgacctaaagaactggatcaacaag aagaaaaagttggcagatgatgaagattacaaggacatacagaacttgaagagcagggtt caaaagcagcaagtctttgaaaaggagttggcagttaataagacccagctggaaaacata cagaaaactggccaagagatgattgagggtggtcactatgcctctgacaatgtgaccact cgtctgagtgaagttgccagcctctgggaggagttgctggaggctacaaaacagaaaggg acccagttgcatgaggccaaccagcagctgcaatttgaaaataatgcagaagatttgcag cgctggctggaggatgttgagtggcaagtcacctctgaggattatgggaaaggcctggcc gaggtacagaatcgactcaggaaacacggcctcctggagtcggctgtggctgctcgtcag gatcaggtggatatccttacagacctggctgcatattttgaagaaataggccatcctgat tctaaggatataagggcaaggcaagagtccttggtatgccgatttgaagctctgaaagag ccactggccacccgaaagaagaagctcttagaccttctccatctgcagctgatttgtaga gacacagaggatgaggaggcctggatccaagagactgaaccctcagctacttccacctac cttggaaaggacctgattgcttccaaaaagcttctgaataggcatagagtcatcctggag aacattgccagccatgaaccacgcattcaagagataacagaaaggggaaacaaaatggta gaggaaggacactttgctgcagaagatgtggcctctagggtcaagagtttgaaccagaat atggagtctctccgtgctcgagctgctaggcgacaaaatgatcttgaagccaatgtccag ttccagcagtacctggctgacctgcatgaagcagaaacatggatcagagagaaggaacct attgtagataatactaactatggtgctgatgaagaagcagctggggctcttctaaagaag catgaggcctttctattagatctcaattcatttggagacagtatgaaagctctgcggaat caggcaaacgcctgccagcaacaacaggctgcaccagtggagggagttgctggagaacaa agggtcatggctttatatgacttccaggcccgcagcccccgagaagtcaccatgaagaaa ggtgatgtcttaacgctgctcagttccatcaataaggactggtggaaggtggaagctgct gatcatcagggcattgtcccagctgtctatgtcagaagactggcccacgatgagttcccg atgctcccacagcggcgacgagaagagccaggaaacatcacccagcgccaggagcagatt gagaaccaataccgctccctcttggatcgggcagaagaacgcagacgtcgtctattgcaa cgttataatgaatttttattggcctatgaggcaggagacatgctggaatggattcaagag aaaaaggcagaaaacactggagtggaactagatgatgtttgggagctgcagaaaaagttt gatgagttccaaaaggatttgaataccaatgagcctcggctaagggatatcaacaaggta gctgatgatctactatttgaaggacttctaacaccagaaggagctcaaatccggcaggaa ttgaattcccgctggggttctttgcagaggcttgcagatgaacagcggcagctgctgggc agtgcccatgctgttgaagtgtttcacagagaagcagatgacacgaaggagcagattgag aagaaatgccaggccctcagtgctgcagaccctggctcagatctgttcagtgttcaggct cttcagcgacggcatgagggctttgaaagggacctcgtacccctgggagataaggtgacc atactgggggagacagcagagcggctcagtgagtcccatccagatgccactgaggacctg cagagacagaaaatggagctgaatgaggcctgggaagacctgcaggggcgtacaaaggat cgtaaggagagcctaaatgaggcccagaaattctacctgttcctcagcaaggccagggat ctgcagaactggatcagtagcattggtggcatggtatcatcacaggagctggccgaagac ttaactggcatagagatcttgctggagagacatcaggagcaccgtgctgacatggaggca gaggctcccaccttccaggccttagaggacttcagtgcagaacttatcgacagtgggcac catgctagccctgaaattgaaaaaaagcttcaagctgtcaagctagagagagatgatttg gagaaggcttgggaaaaacgcaagaagatcctagaccagtgcctggagttgcagatgttc caggggaactgtgatcaagttgagagctggatggtggcacgtgagaattccctgaggtca gatgacaaaagttccttagacagtctggaggctttgatgaagaaacgggacgatttggac aaagcaatcactgcccaggaagggaagatcactgacctagaacattttgctgagagcctc attgctgatgaacactatgccaaagaagagattgctacgcggctccaacgtgtactagac aggtggaaggctctcaaagcacaactgattgatgagcggacaaagcttggagactatgcc aacctaaaacaattctaccgagaccttgaggagctggaagaatggatcaagacttcagag tctgatgtggagcagaggaaatacctgaaacaccagacctttgcacatgaagtcgatggc cgatctgagcaggtgcatggcgtcatcaacctggggaactccctgattgagtgtagcgct tgtgatggcaatgaagaggccatgaaggagcaactggaacagctgaaggaacattgggat catctgcttgagagaacaaatgacaaagggaagaagctcaatgaggccagtcgtcaacag aggttcaacacaagcatccgggactttgagttctggctctcagaggcagagacattgctg gccatgaaagatcaggccagggacttggcttcagcaggaaacctactcaagaagcatcag ctattggagagagagatgttggctcgagaggatgcactcaaggacctgaatacattggct gaagatttgctctccagcgggactttcaacgttgatcagattgtgaagaaaaaagataat gtcaacaagcgtttcctgaatgtccaagaattggcagctgcacaccacgaaaaattgaaa gaggcctatgccttgttccagttcttccaggatctagatgatgaggaatcctggatagag gagaagttgatacgagtgagctcccaggactatgggagagatcttcagggggttcagaac ttgctgaagaagcacaaacgcctagagggggagctggtggcccatgagcctgccatccag aatgtgctggatatggcagagaagctgaaagacaaggctgctgtggggcaagaggagatc cagttgcggctggctcagtttgttgaacactgggagaagctcaaagagttggccaaggcc cgaggacttaagttggaagaatccctagaatacttgcaattcatgcagaatgctgaggaa gaggaagcttggatcaatgaaaagaatgctttggctgtccgaggagattgtggagataca ttagctgctactcagagcttgctaatgaagcatgaagctttggaaaatgactttgctgtc catgagacccgagtacaaaatgtgtgtgcacaaggagaagacatcctaaataaggtgttg caggaggaaagtcagaacaaagagatttcttccaagatagaggctctgaatgaaaagacc ccttctctggctaaggcaatagctgcttggaagttgcaattggaagacgattatgccttt caggaattcaactggaaggctgatgtggtagaggcttggatagctgataaggaaacaagc ctaaagaccaatggcaatggtgcagaccttggtgacttcctcactcttctggcaaaacag ctagaacagctgaatgcagagcaccaagtcactaggctacacaaatcagggagccctggg cccagcccaggaacccctacctgttacccggttccaaagtcacttccacattttcaggac actctggatgccagtctgcagagtttccagcaagagagacttcccgagatcactgacctg aaggacaaactgatttctgctcaacacaaccagtctaaagccattgaagagcgttatgcc gctctgctgaagcgctgggaacagttgctggaagcctcggcagtccacagacagaaattg ctggagaaacagctgcctctacagaaggctgaggacctgttcgtggaatttgcacataag gcttcagctttgaacaactggtgtgaaaagatggaagaaaacttgtcagagcctgtgcac tgtgtctccctgaatgaaattcggcagctgcagaaagaccatgaggacttcttggcctcc ctggctagggctcaagcagactttaaatgtttgctggagctagaccagcagattaaggcc ttaggtgtgccttccagcccttatacctggttaacagtggaggtgctggaaaggacctgg aagcacctatctgacatcattgaggaacgggagcaggagctgcaaaaggaagaggcaaga caggtcaagaactttgagatgtgtcaggagtttgaacagaatgccagtaccttccttcaa tggatcctggaaaccagatcattgctcaaagaaacaggaactctggaatctcagctggaa gcaaataaaagaaaacagaaggagatccaggcgatgaagcgtcaactaaccaagattgtg gacctgggggacaacttggaagacgctctgatccttgatatcaaatacagcaccattgga ttggctcagcagtgggaccagctctaccagcttgggttgcggatgcaacacaacctggag caacagatccaggccaaggacatcaaaggtgtgagtgaagagactctaaaggaatttagc acaatctataaacactttgatgagaatttgacagggcgcctgactcacaaagagttccgg tcctgcctgagaggactcaattactacttgcccatggtggaggaggatgaacatgagccc aagtttgagaagttcctggatgctgtggatccagggaggaagggctatgtctcactggag gactatactgctttcctgattgacaaggagtcagaaaacatcaagtccagtgatgaaata gagaatgccttccaagccctggcagagggcaagtcatatattaccaaagaagacatgaag caggcccttaccccagagcaagtgtcattctgtgccacacatatgcagcaatatatggac ccacggggtcgaagccatctctctggctatgactacgttggcttcaccaattcctacttt ggcaactaa >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_3|676_aa MLSNLISEKKAISMTGCILQMYFFHSLENSEGILLTTMAIDRYVAICNPLRYQMIMTPRL CAQLSAGSCLFGFLILLPEIVMISTLPFCGPNQIHQIFCDLVPVLSLACTDTSMILIEDV IHAVTIIITFLIIALSYGSIQTLASCKCIPLSNSRCLLEGLSALEIETLTFNLEMESPNR TTIQEFIFSAFPYSWVKSVVCFVPLLFIYAFIVVGNLVIITVVQLNTHLHTPMYTFISAL SFLEIWYTTATIPKMLSSLLSERSISFNGCLLQMYFFHSTGICEVCLLTVMAFDHYLAIC SPLHYPSIMTPKLCTQLTLSCCVCGFITPLPEIAWISTLPFCGSNHLEHIFCDFLPVLRL ACTDTRAIVMIQVVDVIHAVEIITAVMLIFMSYDGIVAVILRIHSAGGRRTAFSTCVSHF IVFSLFFGSVTLMYLRFSATYSLFWDIAIALAFAVLSPFFNPIIYSLRNKEIKEAIKKHI GQAKIFFSGTEILKDVLKNLLCTRHTLPSLHCGIYGITVQFQAFPLLLVILGCCICGFFT LLPEIAWISTLPFCGPNQIHNIFCDLDPILNLACVDTGPVVLIKVVDIVHAVEIITAIML VTLAYVQIIAVILRNCSADGCQKAFSTYAFHLAIFLIFFGSVALMYLLFSAKYSFFWDTT ISLMFAVLSPTQSSVV >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_3|2031_bp atgctctccaacctcatcagtgaaaagaaggccatctcaatgactggctgcatcttgcag atgtatttcttccactcacttgaaaactcagaggggatcttgctgaccaccatggccatt gacagatacgttgccatctgcaaccctcttcgctatcaaatgatcatgaccccccggctc tgtgctcaactctctgcaggttcctgcctcttcggtttccttatcctgcttcccgagatt gtgatgatttccacactgcctttctgtgggcccaaccaaatccatcagatcttctgtgac ttggtccctgtgctaagcctggcctgtacagacacgtccatgattctgattgaggatgtg attcatgctgtgaccatcatcattaccttcctaatcattgccctgtcctatggctctatt cagacgctggcttcttgtaagtgtattcctttatccaatagtagatgcctcctagaaggc ttgagtgcactggaaattgaaactctcactttcaacttggagatggagagccccaatcga accaccattcaggagtttatcttctccgctttcccttattcctgggttaagtctgttgtc tgctttgttccactgctcttcatctatgctttcattgttgttggaaacctggtcatcatc acagtggtccagttgaatactcacctccacactcccatgtatacttttatcagtgctctt tctttcctggagatttggtataccacagccacaatcccaaagatgctgtctagcctgctt agtgagaggagcatttccttcaatggttgtctcctgcagatgtatttcttccattccacc ggcatctgtgaggtgtgtctcttgacagttatggcctttgaccactacctggccatatgc agccctcttcattatccctctatcatgacccccaagctatgtacccaactgactttaagt tgctgtgtttgtggctttatcacaccccttcctgagattgcctggatctctacactgcca ttttgtggttcgaatcaccttgaacatatcttctgtgacttcctcccagtgctgcgtctg gcctgcacagacacacgagccatcgtcatgattcaggtagtggatgtcattcatgcagtg gagattattacagctgtgatgctcatcttcatgtcctacgatggtattgtggctgtaatt ctacgtattcattcagctggaggccgccgcacagcattttccacgtgtgtctctcacttc attgtcttttcgctcttctttggcagtgtgactctcatgtacctacgcttctctgccacc tactctttgttctgggatatagccattgctctggcctttgcagttttgtctcccttcttc aaccccattatctatagcctgaggaataaagaaataaaagaagctataaaaaagcacata ggtcaagctaagatatttttttccggcacggaaatactaaaagacgtgctaaagaatctc ctgtgtaccagacatactcttccttcccttcactgtggaatctatggaataacagtccaa tttcaagcatttcctctgcttttggtgattctaggttgttgcatctgtggcttcttcacg ctgctccctgagattgcttggatatccacactgccattttgtggtccaaatcaaatccac aacattttctgtgaccttgatcctatcctgaatctagcatgtgtagacactggcccagtt gttttaatcaaggttgtggacattgtacatgctgtggagatcatcacagctataatgctt gtgactttggcttacgtccaaattattgcagtgatcctaagaaactgctctgctgatgga tgccaaaaggcattttctacctatgctttccaccttgctattttcttaatcttttttgga agtgtagccctgatgtacctgctcttctctgccaagtactcctttttctgggacacaacc atcagcctaatgtttgcagtgctgtcaccgacacaatcatctgtagtctga >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_4|235_aa MLSILISRQRTISMVGCLLQMYFFHSLGNSEGILLTTMAIDRYVAICNPLRYPTIMTPGL CVQLSVGSCIFGFLVLLPEIAWISTLPFCGPNQIHQIFCDFEPVLRLACTDTSMILIEDV IHAVAIVFSVLIIALSYIRIITVILRIPSVEGRQKAFSTCAAHLSVFLMFYGSVSLMYLR FSATFPPILDTAVALMFAVLAPFFNPIIYSFRNKDMKIAIKKLFCPQKMVNLSVD >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_4|708_bp atgctctccatcctcatcagcaggcagaggaccatctccatggttggctgcctcttgcag atgtacttcttccattcactgggaaattcagaggggattttgttgaccaccatggccatt gataggtacgttgccatctgtaaccctctccgctacccaaccatcatgacccccgggctc tgtgttcagctctctgtggggtcctgcatctttggctttcttgtgttgctcccagagatt gcatggatttccacactgcccttctgtggacccaaccaaatccaccagatcttctgtgat tttgaacctgtgctgcgcttggcctgtacagacacgtccatgattctgattgaggatgtg atccatgctgtggccattgtattctctgtcctgattattgccctttcttatatcagaatc atcactgtaatcctgaggattccctctgttgaaggccgccagaaggccttttctacctgt gccgcccatcttagtgtctttctgatgttctatggcagtgtatccctcatgtacctgcgt ttctctgccactttcccaccgattttggacacagctgttgcactgatgtttgcagttctt gctccctttttcaaccctatcatctatagctttagaaataaggacatgaagattgcaatt aaaaagcttttctgccctcagaagatggttaatttatctgtagattaa >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_5|444_aa MGKAAPPTPGSVQDQCIAACEIGVDIDSREKPGSRSRHFQTCKRRGTFPGPQERRGAWVR SRGLGGYTCMEEWEAMIHSHDLGGCSPNLLVIPGRGFRDICTLIDRLGEEVAAEILRSIL MVSGRMTQLTASGNQTMVTEFLFSMFPHAHRGGLLFFIPLLLIYGFILTGNLIMFIVIQV GMALHTPLYFFISVLSFLEICYTTTTIPKMLSCLISEQKSISVAGCLLQMYFFHSLGITE SCVLTAMAIDRYIAICNPLRYPTIMIPKLCIQLTVGSCFCGFLLVLPEIAWISTLPFCGS NQIHQIFCDFTPVLSLACTDTFLVVIVDAIHAAEIVASFLVIALSYIRIIIVILGMHSAE GHHKAFSTCAAHLAVFLLFFGSVAVMYLRFSATYSVFWDTAIAVTFVILAPFFNPIIYSL KNKDMKEAIGRLFHYQKRAGWAGK >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_5|1335_bp atggggaaagctgctcctcctacaccaggctctgtgcaggaccaatgcattgcagcctgt gaaattggagtggacattgacagcagggagaagccaggcagcaggagcaggcactttcaa acctgcaagagaagggggaccttcccaggaccccaggagcgcagaggtgcctgggtccgc agccgtggtttgggtggctacacttgcatggaggagtgggaggccatgatccacagtcat gacttgggtggctgtagcccgaacctcctggttattcctggaagaggcttcagagacatt tgtacccttattgacaggctaggagaagaggtggctgcagagattttgaggagcattctc atggtgtcaggcaggatgacacagttgacggccagtgggaatcagacaatggtgactgag ttcctcttctctatgttcccgcatgcgcacagaggtggcctcttattctttattcccttg cttctcatctacggatttatcctaactggaaacctaataatgttcattgtcatccaggtg ggcatggccctgcacacccctttgtatttctttatcagtgtcctctccttcctggagatc tgctataccacaaccaccatccccaagatgctgtcctgcctaatcagtgagcagaagagc atttccgtggctggctgcctcctgcagatgtactttttccactcacttggtatcacagaa agctgtgtcctgacagcaatggccattgacaggtacatagctatctgcaatccactccgt tacccaaccatcatgattcccaaactttgtatccagctgacagttggatcctgcttttgt ggcttcctccttgtgcttcctgagattgcatggatttccaccttgcctttctgtggctcc aaccagatccaccagatattctgtgatttcacacctgtgctgagcttggcctgcacagat acattcctagtggtcattgtggatgccatccatgcagcggaaattgtagcctccttcctg gtcattgctctatcctacatccggattattatagtgattctgggaatgcactcagctgaa ggtcatcacaaggccttttccacctgtgctgctcaccttgctgtgttcttgctatttttt ggcagtgtggctgtcatgtatttgagattctcagccacctactcagtgttttgggacaca gcaattgctgtcacttttgttatccttgctccctttttcaaccccatcatctatagcctg aaaaacaaggacatgaaagaggctattggaaggcttttccactatcagaagagggctggt tgggctgggaaatag >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_6|360_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENFKALLNKIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPVTFFTELEKTTLKFIWNQKRACITKSILSQKNKAGGIMLP DFKLYYKATVTKTAWCWYQNRDIDQWNRTETSEITPHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISTI YNELKQMYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTALIH >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_6|1083_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaacttcaaagcactgctcaacaaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca gtgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcctgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctaccc gacttcaaactgtactacaaggctacagtaaccaaaacagcatggtgctggtaccaaaac agagatatagatcaatggaacagaacagagacctcagaaataacgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacattagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaacatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccacaatc tacaatgaactcaaacaaatgtacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacagccttgatccac tga >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_7|312_aa MDTGNWSQVAEFIILGFPHLQGVQIYLFLLLLLIYLMTVLGNLLIFLVVCLDSRLHTPMY HFVSILSFSELGYTAATIPKMLANLLSEKKTISFSGCLLQIYFFHSLGATECYLLTAMAY DRYLAICRPLHYPTLMTPTLCAEIAIGCWLGGLAGPVVEISLISRLPFCGPNRIQHVFCD FPPVLSLACTDTSINVLVDFVINSCKILATFLLILCSYVQIICTVLRIPSAAGKRKAIST CASHFTVVLIFYGSILSMYVQLKKSYSLDYDQALAVVYSVLTPFLNPFIYSLRNKEIKEA VRRQLKRIGILA >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_7|939_bp atggacacagggaactggagccaggtagcagaattcatcatcttgggcttcccccatctc cagggtgtccagatttatctcttcctcttgttgcttctcatttacctcatgactgtgttg ggaaacctgctgatattcctggtggtctgcctggactcccggcttcacacacccatgtac cactttgtcagcattctctccttctcagagcttggctatacagctgccaccatccctaag atgctggcaaacttgctcagtgagaaaaagaccatttcattctctgggtgtctcctgcag atctatttctttcactcccttggagcgactgagtgctatctcctgacagctatggcctac gataggtatttagccatctgccggcccctccactacccaaccctcatgaccccaacactt tgtgcagagattgccattggctgttggttgggaggcttggctgggccagtagttgaaatt tccttgatttcacgcctcccattctgtggccccaatcgcattcagcacgtcttttgtgac ttccctcctgtgctgagtttggcttgcactgatacgtctataaatgtcctagtagatttt gttataaattcctgcaagatcctagccaccttcctgctgatcctctgctcctatgtgcag atcatctgcacagtgctcagaattccctcagctgccggcaagaggaaggccatctccacg tgtgcctcccacttcactgtggttctcatcttctatgggagcatcctttccatgtatgtg cagctgaagaagagctactcactggactatgaccaggccctggcagtggtctactcagtg ctcacacccttcctcaaccccttcatctacagcttgcgcaacaaggagatcaaggaggct gtgaggaggcagctaaagagaattgggatattggcatga >gi568815597r:158599681_158800652|GENSCAN_predicted_peptide_8|412_aa MDSQLSQHLGFPGRRPVPASTHRKVHKYLNGEKFLKNDRDKGGRRLSISTVITALTRSAQ IISAIILSDKGHCLRLLSSRMPGGYKALKVSVLPPMDQYNHSSLAEFVFLGFASVGYVRG WLFVLLLLAYLFTICGNMLIFSVIRLDAALHTPMYHFVSVLSFLELWYTATTIPKMLSNI LSEKKTISFAGCLLQTYFFHSLGASECYLLTAMAYDRYLAICRPLHYPIIMTTTLCAKMA AACWTCGFLCPISEVILASQLPFCAYNEIQHIFCDFPPLLSLACKDTSANILVDFAINAF IILITFFFIMISYARIIGAVLKIKTASGRKKAFSTCASHLAVVLIFFGSIIFMYVRLKKS YSLTLDRTLAIVYSVLTPMVNPIIYSLRNKEIIKAIKRTIFQKGDKASLAHL >gi568815597r:158599681_158800652|GENSCAN_predicted_CDS_8|1239_bp atggacagccagctttctcaacatcttgggtttcctggcaggagacctgttcctgcctca acacacagaaaggtgcacaagtacttgaatggagaaaaattcctgaagaatgacagagac aaagggggcaggaggcttagcatttccactgtgatcactgcccttaccaggtctgctcaa attatctctgccatcatcctgtcagacaaaggacactgcctgcgtcttctgagttccagg atgcctggtggatataaggctctaaaagtctctgtattacctcccatggatcaatacaac cattcaagcctggctgaatttgtgttccttggctttgccagtgtgggctatgtcaggggc tggctttttgtcctgctgctattggcatacctgttcaccatctgtggtaacatgctcatc ttctcagtcatccgactggatgcagctctgcacacacctatgtaccactttgtcagtgtt ctttccttcttggagttgtggtatacagctaccactatccctaagatgttgtctaatatt ctcagtgagaagaaaaccatttcttttgcaggatgcctccttcagacctacttcttccac tccttgggagcgtctgaatgctaccttcttacagccatggcctatgatagatacctggcc atttgtcggcccctccactaccctataattatgaccaccacactctgtgccaagatggct gctgcttgttggacttgtggcttcctgtgtcccatttctgaggtcatccttgcctcccag ctcccattttgtgcttacaatgaaatccaacacattttctgtgactttccacctttgctg agcttggcctgcaaggacacatctgctaacattctggtggactttgccattaatgctttc ataattcttatcactttcttctttatcatgatttcttatgcaaggatcattggggctgtg ctgaagataaaaacagcatcaggaagaaagaaggccttttctacctgtgcctcacatctt gctgtggtcctcatcttctttgggagcatcatcttcatgtatgtgcggctaaagaagagc tattccctgacccttgaccgaacacttgctatagtttactccgtactaacaccaatggtc aatccaattatctacagtcttcgtaacaaggaaatcattaaagctatcaagaggaccatc ttccagaagggagataaagctagtcttgctcatctttga