GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:11:40 Sequence gi568815594r:120595114_121022608 : 427495 bp : 37.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 812 807 6 1.05 1.03 Term - 19152 19053 100 2 1 90 42 62 0.636 -1.58 1.02 Intr - 22821 22690 132 2 0 18 63 136 0.017 2.94 1.01 Init - 49868 48742 1127 2 2 60 53 331 0.473 20.50 1.00 Prom - 50011 49972 40 -9.25 2.02 PlyA - 50076 50071 6 1.05 2.01 Sngl - 52107 51451 657 0 0 71 43 403 0.931 30.02 2.00 Prom - 53071 53032 40 -7.45 3.00 Prom + 54972 55011 40 -7.95 3.01 Sngl + 56271 57473 1203 2 0 70 54 515 0.805 41.88 3.02 PlyA + 57826 57831 6 1.05 4.05 PlyA - 57986 57981 6 1.05 4.04 Term - 72920 72616 305 0 2 43 48 181 0.483 3.95 4.03 Intr - 78982 78831 152 0 2 71 59 65 0.061 0.79 4.02 Intr - 79206 79118 89 0 2 77 53 68 0.023 0.05 4.01 Init - 93458 93327 132 2 0 64 33 88 0.062 1.09 4.00 Prom - 97389 97350 40 -4.25 5.06 PlyA - 98346 98341 6 1.05 5.05 Term - 100162 99998 165 1 0 81 38 195 0.997 10.73 5.04 Intr - 115300 115196 105 1 0 50 76 164 0.974 10.89 5.03 Intr - 127766 127630 137 0 2 99 19 102 0.390 3.77 5.02 Intr - 128254 128157 98 0 2 96 38 53 0.140 -0.17 5.01 Init - 136659 136505 155 0 2 81 77 109 0.303 8.60 5.00 Prom - 138914 138875 40 -7.25 6.00 Prom + 143322 143361 40 -5.25 6.01 Sngl + 147569 148225 657 1 0 65 43 380 0.797 27.12 6.02 PlyA + 148453 148458 6 1.05 7.00 Prom + 148622 148661 40 -6.15 7.01 Init + 148715 151871 3157 1 1 44 60 879 0.365 72.50 7.02 Intr + 167988 168216 229 1 1 71 105 120 0.016 8.01 7.03 Intr + 189901 190081 181 1 1 71 38 164 0.020 8.65 7.04 Intr + 192348 192447 100 2 1 43 77 39 0.004 -2.94 7.05 Intr + 194880 194987 108 1 0 20 99 105 0.040 4.14 7.06 Intr + 204233 204303 71 0 2 87 83 31 0.105 0.48 7.07 Term + 207703 207966 264 0 0 55 37 298 0.739 15.92 7.08 PlyA + 208050 208055 6 1.05 8.00 Prom + 209043 209082 40 -6.15 8.01 Sngl + 209993 210538 546 1 0 70 41 229 0.680 12.25 8.02 PlyA + 210560 210565 6 1.05 9.07 PlyA - 212309 212304 6 1.05 9.06 Term - 213338 213075 264 2 0 38 48 152 0.050 0.72 9.05 Intr - 216336 216257 80 1 2 84 86 34 0.082 1.15 9.04 Intr - 221461 221340 122 0 2 60 81 121 0.659 7.82 9.03 Intr - 223414 223240 175 2 1 61 41 166 0.765 7.28 9.02 Intr - 226232 226058 175 2 1 32 67 183 0.906 9.19 9.01 Init - 231109 231026 84 1 0 14 99 74 0.411 1.97 9.00 Prom - 233694 233655 40 -6.45 10.00 Prom + 236829 236868 40 -5.15 10.01 Init + 240563 240572 10 1 1 91 93 3 0.533 1.88 10.02 Intr + 243711 244096 386 1 2 71 19 275 0.171 12.54 10.03 Term + 249933 250175 243 2 0 8 39 228 0.316 4.82 10.04 PlyA + 251376 251381 6 1.05 11.03 PlyA - 253993 253988 6 1.05 11.02 Term - 261920 261737 184 1 1 65 48 103 0.327 0.03 11.01 Init - 264793 264666 128 1 2 75 35 94 0.428 2.48 11.00 Prom - 288940 288901 40 -2.75 12.13 PlyA - 289640 289635 6 1.05 12.12 Term - 300533 300397 137 0 2 30 49 180 0.216 5.50 12.11 Intr - 306800 306638 163 2 1 72 37 72 0.103 -0.87 12.10 Intr - 312444 312361 84 0 0 47 78 81 0.103 2.10 12.09 Intr - 315155 315110 46 1 1 61 107 38 0.016 0.49 12.08 Intr - 318272 318142 131 2 2 110 91 37 0.025 4.77 12.07 Intr - 327096 327013 84 0 0 131 4 88 0.338 3.80 12.06 Intr - 327469 327403 67 0 1 23 98 70 0.075 -0.51 12.05 Intr - 327724 327639 86 1 2 79 59 79 0.061 1.80 12.04 Intr - 329558 329391 168 2 0 91 6 196 0.106 10.92 12.03 Intr - 333920 333836 85 1 1 71 87 27 0.787 -0.20 12.02 Intr - 335296 335204 93 0 0 22 94 85 0.404 0.66 12.01 Init - 341813 341728 86 2 2 78 101 40 0.485 4.64 12.00 Prom - 341873 341834 40 -7.25 13.00 Prom + 344589 344628 40 -4.85 13.01 Sngl + 350415 351017 603 2 0 36 43 245 0.874 10.74 13.02 PlyA + 351243 351248 6 1.05 14.00 Prom + 351412 351451 40 -6.15 14.01 Init + 352154 352540 387 1 0 22 41 199 0.273 5.45 14.02 Term + 368609 368767 159 1 0 106 52 52 0.063 0.36 14.03 PlyA + 369298 369303 6 1.05 15.06 PlyA - 369388 369383 6 1.05 15.05 Term - 376052 375968 85 1 1 125 38 30 0.029 -2.15 15.04 Intr - 386980 386843 138 0 0 61 66 108 0.209 4.56 15.03 Intr - 390717 390438 280 1 1 5 11 188 0.049 -1.59 15.02 Intr - 394998 394725 274 0 1 7 71 220 0.599 8.39 15.01 Init - 402018 401770 249 0 0 82 68 123 0.349 7.21 15.00 Prom - 402072 402033 40 -5.65 16.02 PlyA - 402188 402183 6 1.05 16.01 Sngl - 402713 402321 393 2 0 60 43 199 0.342 8.59 16.00 Prom - 404612 404573 40 -6.15 17.00 Prom + 407688 407727 40 -7.25 17.01 Init + 408829 409015 187 0 1 78 56 140 0.569 9.07 17.02 Intr + 410760 410896 137 1 2 98 64 39 0.181 1.87 17.03 Term + 423533 423646 114 1 0 131 38 89 0.192 5.79 17.04 PlyA + 425872 425877 6 1.05 18.02 PlyA - 425949 425944 6 1.05 18.01 Sngl - 427076 426831 246 2 0 77 42 189 0.620 8.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 182168 182075 94 2 1 100 80 74 0.911 6.42 S.002 Intr - 186190 186030 161 2 2 98 106 29 0.941 4.49 S.003 Intr - 189978 189885 94 0 1 120 98 61 0.943 8.92 S.004 Init + 319475 319528 54 1 0 81 64 67 0.809 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_1|452_aa MSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLQMPFFTELEKTTLKFIWNQKRARIAKSILRQKNKAGGITLP DFKLYYKATVTKTAWHWYQNRDIDQWNRTEPSEITPHIYNYLIFDKTEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKELNIRPKTIKTLEENLGITIEDIGMGKD FMSKTPKAMATKDKIGKWDLTKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRI YNELKQIYKKKTNNPIKKWAKDMNRHFPKEDIYAAKKHMKKCSSSLAIREMQIKITMRYH LTPIRMAIIKKSGNNREGILATVCRIKCKEEKQGLGNVVLCKDEHQALVSDRPVFETQAK FFPLKCGCEKQMFSKAESLGIFINASQLNTLG >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_1|1359_bp atgagtgaactcccattcacaattgctacaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacaa atgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagacaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggcactggtaccaaaat agagatatagatcaatggaacagaacagagccctcagaaataacgccacatatctacaac tatctgatctttgacaaaactgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagagttaaatattagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattgaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattggcaaatgggatcta actaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aaggacatgaacagacacttcccaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaatcacaatgagataccat ctcacaccaattagaatggcaatcattaaaaagtcaggaaacaacagggagggtattttg gcaacagtgtgtagaataaaatgcaaagaagaaaaacaaggacttggaaatgtggtgctt tgcaaggatgagcatcaggcactggtttcagacagacctgtgtttgaaacacaagccaaa tttttccctttaaaatgtggttgtgaaaagcagatgtttagtaaagctgaaagtcttgga atattcatcaatgcatcgcaactcaataccttagggtag >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_2|218_aa MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQADVQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKAKMLRAAREKGWV TLKGKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISKGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNRYQLLQNHAKM >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_2|657_bp atggaagatgaaatgaatgaaatgaagcgagaaggaaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacacgctgcaggat attatccaggagaacttccccaatctagcaaggcaggccgatgttcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggcaaaaatgttaagggcagccagagagaaaggttgggtt accctcaaagggaagcccatcagactaacagcggatctcttggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagcaaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_3|400_aa MDKFLDTYTLPRLNKEEAESLNRSITGSEIVAIINSLPEIVAIINSLPTKKSPGPDGFTA DFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLINT DAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTQDKNHTIIS IDAEKAFDKIQQRFMLKTLNKLRIDRTNLKIIRPIYDKPTANIILNGQKLEAFPLKTGTR QGCPLSPLLFNIVLEVLARAIRQEKEIKDIQLGKEEVKLSLFADEMIVYLENPIVSAQNL LKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVK DLFKENYKPLLNEIKEDTNKWKNIPLFSKYHSMCMEFSLR >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_3|1203_bp atggataaattcctcgacacatacaccctcccaagactaaacaaggaagaagctgaatct ctgaacagatcaataacaggctctgaaattgtggcaataatcaatagcttacctgaaatt gtggcaataatcaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagcc gacttctaccagaggtacaaggaggagctggtaccattccttctgaaactattccaatca atagaaaaagagggaattctccctaactcgttttatgaggccagcatcatcctgatacca aagcctggcagagacacaaccaaaaaagagaattttagaccaatatccttgattaacact gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacatatgcaaa tcgataaatgtaatccagcatataaacagaacccaagacaaaaaccacacgattatctca atagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgctaaaaactctcaat aaattacgtattgataggacgaatctcaaaataataagacctatctacgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccactcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaaggatattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagatgagatgattgtatatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaa tggaagaacattccattattttcaaaataccattcgatgtgcatggaatttagtttgaga tga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_4|225_aa MLYGIISGNDFLDRTPKAKTTKAKIDKWNYIKLKNFCTTNEIIRNGEGLQSWTAVIVIAL LGLVTQWGYQALGRFNVLASSNAGYANSKLVMRTDSGCLVSQDVAGSGVGSRFPLPGCSV IQPKGQVGSSISSFNVTADGQKPVWESRSNWKTFDFMELCCQRAYHQMPNESSRMKYQQL RWQRLIGDKTDGILLKQQVTKSLVLQIQQLSKSGLSLCDHLWPRL >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_4|678_bp atgctttatggcatcatatctggcaatgatttcttggataggacaccaaaagcaaagaca acaaaagcaaaaatagacaaatggaactacatcaaacttaaaaacttctgcacaacaaat gaaataatcagaaatggggaaggactgcagagctggactgcagtgattgttattgccctt ctgggtctagtcactcagtggggctaccaggctctgggcaggtttaatgtgctggcttcc tcaaatgctggttatgctaatagtaaacttgtcatgcggacagattcagggtgtctagtt agccaggatgttgcaggcagtggtgttggaagccgttttccccttcctgggtgcagtgtt attcagccaaaagggcaggttgggagcagcataagtagctttaatgtcactgcagatgga cagaagccagtatgggagtccagaagcaactggaagacctttgacttcatggaactgtgc tgccaaagagcatatcaccagatgccaaatgaaagttcacggatgaaatatcaacagcta aggtggcaaagactgattggagataaaacagatggtatcctgctgaaacaacaagtgacc aagagtttagtacttcaaattcaacagctgtcaaaatcaggcttgagtctctgtgaccat ctctggcctagactatga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_5|219_aa MDFATLAITSSFQATGRRRQEKVTGYAPADYTSTKELSGSPTHQLPHDHTWRITKDILYN AQVRCSQNSYQQIFFGFLCNSCYVSSSSRAQHMAAAAPAASPYVVCINRQKRPFPESPID LPSLLSNRSQEKPYKCSECSKAFSQKRGLDEHKRTHTGEKPFQCDVCDLAFSLKKMLIRH KMTHNPNRPLAECQFCHKKFTRNDYLKVHMDNIHGVADS >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_5|660_bp atggattttgccactctggccatcacatcctcattccaggcaacaggaaggaggaggcaa gagaaggtaacaggatacgcccctgctgattacacttccactaaggagctttctggaagt cctacccaccaacttccacatgaccacacttggcggattacaaaggacattctttacaat gcacaagtacgttgttctcagaattcttatcagcagatttttttcggcttcctttgtaac tcttgttatgtgtccagctcctcgcgtgctcagcacatggctgcagcagctccagcagcc tctccttatgtggtctgtatcaacaggcagaagaggccttttccagaatctcccatagac ttaccttctcttctcagtaatcgcagccaggagaagccgtacaagtgctcagagtgcagc aaggccttcagccagaagcgaggcctggatgagcacaagaggacgcacactggagaaaag ccttttcagtgtgatgtttgtgatttggcttttagcctgaagaaaatgctgattcgacac aagatgactcataatcccaatcgtcccctggcagaatgccagttttgccataagaagttt acaaggaatgactacctcaaagtgcacatggacaatatccatggtgtagctgacagctaa >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_6|218_aa MEGEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQVQEIQRMPQRYSSRRATPRHIIVRFTKVEMKAKMLRAAREKGRV TFKGKPIRLIADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK EMLRDFVTTRPALKELLKEALNKERNNRYQPLQNHAKM >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_6|657_bp atggaaggtgaaatgaatgaaatgaagcgagaaggaaagtttagagaaaaaagaataaaa agaaacgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaatttccccaatctagcaaggcaggccaacgttcaggttcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggcaaaaatgttaagggcagccagagagaaaggtcgggtt accttcaaagggaagcccatcagactaatagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccgatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag gaaatgctgagagattttgtcaccactaggcctgccctaaaagagctcctgaaggaagca ctaaacaaggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_7|1369_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLTDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRREIITNYLSDHSAIKLERRIKNLTQNRSTTWKLNNVLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCREKFIALNAHKRKEERSKIDTLTSQLK ELEKQEQTHSKASRRQQITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDARKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIVARINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYKASIILIPKPGRDTTKKENFRPISLINTDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAVRQE KEIKGIQLGKEEVKLSLFADEMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNVIPIKLPMPFFTELEKTTLKFIWNQKRARISKSILS QKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITSHIYNYLIFDKPDK NKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLG ITIQDIGMGKDFMSKTPKAMATKAKMDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFAT YSSDKGLISRIYNELQQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSPAIR EMQIKTTMRYHLTPVTMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLLQPLWKSVWRFLR DLELEIPFDPAIPLLGIYPKGYKSCCYKDTCTQSYSYSIPPSFLKRCQLKGHEVHWEEGK KQDSHVLTNFSVGNHKKPDWVDLLRVKQKVNFTDANFFKKEYEMQNEKRCLCKENGVRNK ALHSSHWNGLSSEWVCRGKTLSQEDLESTSHYNERVELGFMMRKRKEGKDALMAVLTTKC RGKHKVEKQQQCSRREIIRAYPGQREVAKRRNNHRFSDEVIEGSNKDSKSGNLPLNDTAN LKGRHQYPISQMKNKETEAQGGERSSLPATEQSWMENDFDELREEGFRRSNFSELKEEVQ THRREAKNLEKRLDEWLTRISSVGKSLNDLMEQKTMAQELHDKCTSFGS >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_7|4110_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaacagacatctacaga actcttcaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaagagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaacgcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacgtgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acttaccagaatctctgggacacattcaaagcagtgtgtagagagaaatttatagcacta aatgcccacaagagaaaggaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacactcaaaagctagcaggaggcaacaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacgcaagaaaaaatgataaaggggatattaccacg gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaagaatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttataaggccagcatcattctgataccaaagccaggcaga gacacaaccaaaaaagagaattttagaccaatatccttgattaacactgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacatatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcagttaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gagatgattgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattccta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccgctgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgtcatccccatcaagctaccaatgcctttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatttccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataacgtcgcatatctacaactatctgatctttgacaaacctgacaaa aacaagaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattccaga tggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gcaacaaaagccaaaatggacaaatgggatctaattaaactaaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactccaacaaatttacaag aaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaa gaagacatttatgcagccaaaaaacacatgaaaaaatgctcatcatcaccggccatcaga gaaatgcaaatcaaaaccacaatgagataccatctcacaccagttacaatggcaatcatt aaaaagtcaggaaacaacaggtgctggagaggatgtggagaaataggaacacttttacac tgttggtgggactgtaaactacttcaaccattgtggaagtcagtgtggcgattcctcagg gatctagaactagaaataccatttgaccctgccatcccattactgggtatatacccaaag ggctataaatcatgctgctataaagacacatgcacacaatcttacagttacagtatccct ccctcatttctgaagagatgccaactcaaaggacatgaggttcattgggaagaaggaaaa aaacaggacagccacgtgctcacgaacttttcagtcggaaatcataaaaagccagattgg gtggatttactgagggtgaaacaaaaagtgaattttactgatgcaaatttttttaaaaaa gaatacgagatgcagaatgaaaagaggtgtctctgtaaagaaaatggggtccggaacaaa gctttacattcttcacattggaacggtctctcctcagagtgggtctgcagaggaaaaaca ctgagtcaggaggatctagaatcaaccagtcattacaatgaacgagtggagcttggattc atgatgcgaaagaggaaagaagggaaggatgccttgatggctgtgttgacaacaaaatgt aggggaaaacacaaagtggagaagcaacagcaatgcagcaggagagagataattagagct tatccagggcaaagggaagttgctaagagaagaaataaccacagattttcagatgaagtt atagaaggtagcaacaaagattccaagtcaggaaaccttccactcaacgacactgctaat ttgaaaggtaggcaccagtatcccatttcacaaatgaagaataaagaaactgaggctcaa ggaggagaacgcagctccttgccagcaacagaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaacttctccgagctaaaggaagaagttcaa acccatcgcagagaagctaaaaaccttgaaaaaagattagacgaatggctaactagaata agcagtgtagggaagtccttaaatgacctgatggagcagaaaaccatggcacaagaacta catgacaaatgcacaagcttcggtagctga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_8|181_aa MDTFLDTYTLPRLKQEKGESLNRPITGSEIEAIINSLPTKKSPRPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFCEASIILIRKPGRDRIKKENFGPISLKNIDAKILDKILA NQIQQHIKKLIHHDQVGFIPGMQVWFNIHKSKNVIQYINRIKDKNRMIISIDTEKAFDKI Q >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_8|546_bp atggatacattcctcgacacatacaccctcccaagactaaagcaggaaaaaggtgaatcc ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcctaccaaccaaa aaaagtccaagaccagacggattcacagccgaattctaccagaggtacaaggaggagctg gtaccatttcttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttgtgaggccagcatcatcctgatacgaaagcctggcagagacagaataaaaaaagag aattttggaccaatatccctgaagaacatcgatgcaaaaatcctcgataaaatactggca aatcaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcattcct gggatgcaagtctggttcaacatacacaaatcaaaaaatgtaatccagtatataaacaga atcaaagacaaaaaccgcatgattatctcaatagatacagaaaaggcctttgacaaaatt caatag >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_9|299_aa MEEEEQVHLKEIWGQENDMKIQMMGVHLEGENIFYLAVEDIETDTELLIGYLDSDMEAEE EEQQIMTVIKEGEVENSRRQSTAGRKDRLGCKEDYACPQCESSFTSEDILAEHLQTLHQK PTEEKEFKCKNCGKKFPVKQALQRHFEQHQETCRGDARFVCKADSCGKRLKSKDALKRHQ ENVHTGDPKKKLICSVCNKKCSSASSLQEHRKFKVYKHTNQHPVSSSGFVNAPINTLYLA NGDVENFCVHTLYLANLVGRWRTVVSSSGIVNAPISTLSKRTNQLSVKWTNQQDVGRAR >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_9|900_bp atggaagaagaggagcaggtacatttgaaggagatttggggacaagagaatgacatgaaa attcagatgatgggagttcacttggaaggagaaaacattttctatttggcagttgaagat atagaaacagacacggagcttctgattggctacctggatagtgacatggaggctgaggag gaagaacagcaaattatgacagtcatcaaagaaggggaagttgaaaattctagaagacaa tcaacagcgggcagaaaagatcgccttggctgtaaagaggactatgcttgtcctcaatgt gaatcgagttttaccagtgaggatattcttgctgagcatctccagacattgcaccagaaa cccacagaggagaaagaatttaagtgcaagaactgtgggaagaaattcccagttaagcag gctttgcaaagacattttgagcagcaccaggagacttgccggggggatgccaggtttgtg tgcaaggctgacagctgtggaaagaggctgaagagcaaggatgccctgaaaagacaccag gaaaatgtccacactggagatcctaagaaaaagcttatatgttcagtgtgcaataaaaag tgttcttcagcatcaagcctacaggaacatagaaagttcaaggtttataaacacaccaat cagcaccctgtgtctagctcagggtttgtgaatgcaccaatcaacactctgtatctagct aatggggacgtggagaacttttgtgtccacactctgtatctagctaatctagtggggagg tggagaactgttgtctctagctcagggattgtgaatgcaccaatcagcaccctgtcaaaa cggaccaatcagctctctgtaaaatggaccaatcagcaggatgtgggtagggccagataa >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_10|212_aa MRNCCVQLTLMAWIPHLPRETAWSGKGCESECGVWPMHSQICRLLQKGGQLQVPAQVPAL CQAAAGTGTPQVASTADTREHSGIQKLGDTRNCSTPKKESQPWLGELPGLGSLKGCSSSL LLFTRNVASKEQAERDKEAAEERLDASRIMRSKERSHLYNTKVQGEAARADGEAAASYPE DLSKILIIKKTGYTKQQIFNVQEIVLLEEDTI >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_10|639_bp atgaggaactgctgtgttcagctcacactaatggcctggattccacacctgccaagggag actgcatggagcggcaaggggtgtgagagtgagtgtggggtctggccaatgcacagtcag atatgccggctgctgcagaagggtgggcagctccaggtgccagcacaggtgccagctctc tgccaggctgcagctgggacaggcacaccgcaagtagcttccacagctgacaccagggaa cacagtggcatccagaagctcggagacaccaggaactgcagcaccccaaagaaggagtca cagccctggcttggggagctcccaggtctgggatccctgaaaggctgcagctcttctctc cttctcttcactcgcaatgtggcaagcaaggagcaggctgagagagacaaggaagctgca gaagaaaggctggatgctagcagaattatgaggtctaaggaaagaagccatctctataac acaaaagtacaaggtgaagcagcaagagctgatggagaagctgcagcaagttatccagaa gatctatctaagatattaataattaagaaaactggctacactaaacaacagattttcaat gtacaagaaatagttctattggaagaagataccatctag >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_11|103_aa MRITRTVMELKIMSKKEGVICFVEEKRREGGILAFRNLKGYRWKPVQIKQVFELRWFLLH DETVDFIIGPRYAEDDPKSDSNQIHFISKKIQIKEDGIWKTKS >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_11|312_bp atgaggataacccggacagtaatggaactgaaaatcatgtccaaaaaggaaggagttatt tgttttgtggaagagaaacgtagagagggagggattcttgccttcagaaatttgaagggc tatagatggaagcctgtgcaaatcaaacaggttttcgaactgaggtggtttttactacac gatgagactgttgacttcatcattggccccaggtatgcagaagatgatcctaagtcagat agtaaccaaatacacttcatttccaagaaaattcaaataaaagaagatggtatttggaag actaagtcttga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_12|409_aa MKHRSYYGGWYQLNKKKTIEKTFHAMGKIFWWPEGDGRDTPLVLEIAGEILGKMTSAGER IPAEYMLHQSCRIGSTKGSFHFLLVGTLRSKAVEILVPKETVAEHLTESPGWLKGGWKPV FDKELNAPGKCQGPTFWTVGPVDLQEWQERIRAEPDPGRGGAEPRGILESAGRFSLKSSR VQDGMGLYTARRVRKEVSAAGASLFLGYGCELRLCVGEGTRGREAGRWNKMFVAISSVAQ NVRAEVFAILLDSRWLLQPQLSCSYSSHENSSLFDELFVTDPGEKFGPFAGEKRMPEDLD ENMDYRLMWESLNVSVDAVGVQWCDLMQVESYAMPQFSTLGGPSGKTAFTVCRILKNQIQ TKISNKEVEGEEADVHNDDEEDLKHILMKKMRAEGKKMKMVLMWKREEG >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_12|1230_bp atgaagcatagatcatactatggaggatggtatcagttaaataagaagaaaacaatagag aagacatttcacgcaatgggtaaaattttctggtggccagaaggagatggccgggatacc ccactggttctggagattgcaggtgagatcctgggaaaaatgacaagtgctggagagaga atccctgctgagtatatgcttcaccaaagctgtcgtataggctccaccaaaggaagcttc cactttcttcttgtaggaactttgcgatcaaaagctgtggaaattctcgtgcccaaagag acagtggcagagcatttgactgagagccctggatggcttaagggtggctggaagcctgtg tttgacaaagagctgaatgctccaggaaaatgccagggccccaccttctggactgtgggc ccagtggatctgcaggaatggcaggaacggatccgggccgagcccgacccgggacgagga ggggctgagccgcgggggatcctggaatcggcagggaggttctccctgaagtcctcccgg gttcaggacggcatggggctctacacggcccgcagagtgcgaaaggaagtgagcgcggcg ggagcctccctcttcctcggttacgggtgtgagctgcgtctttgcgttggcgaaggcacg cgtgggcgggaagctggcagatggaacaagatgtttgtggccattagttcagtggctcaa aatgtcagggcagaagtctttgcaattctcttggactcaaggtggctgcttcagccccag ctatcatgctcatattccagccatgaaaattcaagtttatttgatgaactctttgtaaca gacccgggtgaaaagttcggaccctttgctggagagaagagaatgcctgaagacttggat gaaaatatggattacaggttgatgtgggagtctttgaacgtttctgtggatgcagtgggt gttcagtggtgtgatctgatgcaagtggagtcatatgctatgccacagttcagtacactg ggtggtccctctggaaagacagcttttactgtgtgcagaatcctaaagaaccagatacaa actaaaatttcaaataaagaagtggaaggagaagaagctgatgttcataatgatgatgaa gaagacttgaaacatattttgatgaagaagatgagggcagaggggaagaagatgaagatg gtgttgatgtggaagagggaggaaggttga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_13|200_aa MKRNDQRIQEILDSVKRPNLHLIGIPESDGENGTRLENTLQDIIQENFTNLARQANIQIK EIQRLTQRYSSRRATIRLMIIRFTKVEMKEKTLRAVREKGQVTHKGKPIRLTADVSAETL QARREWGPIFNILKEKNFQPRISYPAKLSFISKGEIRPFTDKQMLRDFVTTRPALQELLM KALNIERNNWYQPLQKHTKL >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_13|603_bp atgaaaaggaatgatcaaagaatccaagaaatattggactctgtgaaaagaccaaatcta catttgattggtatacctgaaagtgatggggagaatggaaccaggttggaaaacactctt caggatattatccaggagaacttcaccaacctagcaagacaggccaatattcaaattaag gaaatacagaggctaacacaaagatactcctcaagaagagcaaccataagactcatgatc atcagattcaccaaggttgaaatgaaggaaaaaacattaagggcagtcagagagaaaggt caggttactcacaaagggaagcccatcagactaacagcagatgtctctgcagaaacccta caagccagaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaaccc agaatttcatatccagccaaactaagcttcataagcaaaggagaaatacgaccctttaca gacaagcaaatgctgagagattttgtcaccaccaggcctgccttacaagagctcctgatg aaagcactaaacatagaaaggaacaactggtaccagccactgcaaaaacataccaaattg taa >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_14|181_aa MNPGAGFFEKINKIDRPLARLIKKKREKNQIDTVKNDTGDIITDPTEIQTTIREYYKHLY PNKLENLEEMNKFLDTYTLPTLDQEEVESLNRRITSCEIGAVINSLPTKNSPGPDQFTAK PYQRYRKELSGVDSNRTVGDIRPGTNCIDMNSTQTTCSLLILNFVFLKRNQRHNLLSSTI K >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_14|546_bp atgaatccaggagctggattttttgaaaagattaacaaaatagatagaccactagcaaga ttaataaagaagaaaagagagaagaatcaaatagacacagtaaaaaatgatacaggggat atcatcactgatcccacagaaatacaaactaccatcagagaatactataaacacctctac ccaaataaactagaaaatctagaagaaatgaataaatttctggacacatacacactccca actctagaccaggaagaagtcgaatccctgaatagaagaataacaagttgtgaaattggg gcagtaattaatagcctaccaaccaaaaacagtccaggaccagaccaattcacagccaaa ccctaccagagatacagaaaggagctgtctggtgttgacagcaacaggacagttggggat ataaggcctggaacaaattgcattgacatgaactcaacgcaaaccacatgttcactactg attttgaactttgtgttcctcaaaagaaatcaaagacataatctactttcttcaacaatt aaatga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_15|341_aa MGRDFKSKTPKAMATKAKIDKWDLMKLKSFCTAKETTIRMNRQPTEWEKIFAIYPSDKGP ISRIYRELKQIYKNTTNNPIKKWESSDWHLAGAHLGRSFQSKEQAAIFAVLQPLLLIPRK TRSGVDLQQTPADLQQRGLAVRRKTNKQKGIASTSPKGRPLRGPIRRSPSSKTKEKNTGN AIQDIGMGKNFMTKTPKAVATKAKIDKWDLIKLKSFCTAKETIVRVNRQPTEWEKIFAIY PSDKGPMSRIYKELKQIYKNTTNNPIKNFLTFIQTLCPITFFDNFPLLVKANIKTLRLNC FLGSSFPYEGSSVMILFTGLSFRKMEPQKEASSRQVVIVDL >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_15|1026_bp atgggcagggacttcaagtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaatgaaattaaagagcttctgcacagcaaaagaaactaccatcaggatg aacaggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggcca atatccagaatctacagagaacttaaacaaatttacaagaatacaacaaacaaccccatc aaaaagtgggagagctctgactggcatctggcaggtgcccatctgggacgaagcttccag agcaaggaacaggcagcaatctttgctgttctgcagcctctgctgttgatacccaggaaa acaaggtctggagtggacctccagcaaactccagcagacctgcagcagaggggcctggct gttagaaggaaaactaacaaacagaaaggaatagcatcaacatcaccaaaaggacgtcca ctcagaggccccatccgaaggtcaccatcatcaaagaccaaagaaaaaaacacaggcaat gccattcaggacataggcatgggcaaaaacttcatgactaaaacaccaaaggcagtggca acaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactatcgtcagagtgaacaggcaacctacagaatgggagaaaatttttgctatctat ccatctgacaaagggccaatgtccagaatctacaaggaacttaaacaaatttacaagaat acaacaaacaaccccatcaaaaatttcctaaccttcattcagaccctttgtcccatcaca tttttcgacaactttccactcttggtcaaagctaacataaaaacactcaggcttaactgc ttcttgggatcttcatttccttatgaaggctccagtgtcatgatactgtttactggctta agttttaggaaaatggagccacaaaaggaagcatcaagcaggcaagttgtgattgttgac ttataa >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_16|130_aa MSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKGKNIPCSCIGRINIV LPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKTMISQKNNIGRIMLPDFKL YYKATVTKTA >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_16|393_bp atgagtgaactcccattcacaattgctaccaagagaataaaatacctaggaatacaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactactcaatgaaataaaa gaggacacaaacaaagggaaaaacattccatgctcatgtataggaagaatcaatatcgta ctgcccaaggtaatttatagattcaatgccatccccatcaagttaccaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacattgcc aagacaatgataagccaaaagaacaacattggacgtatcatgctacctgacttcaaacta tactacaaggctacagtaacaaaaacagcatga >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_17|145_aa MPLASHHVGEEKERRAVALRRSQTWSSLSQDCDFLFGALWFLVSPSVLVSPCSLVPAGEA ACGNTFSVIGCKAEKAIASEKVSQARTESQQTKDNPQLLNSRWMLNSGDPIEVTMLHLIV MSLGLLLVEIISQAIFDFDDLDSFE >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_17|438_bp atgccccttgctagccaccatgtgggtgaagagaaggagagaagagctgtggcccttcgg agatcccagacctggagctccctgagccaggactgtgacttcctctttggggccctgtgg ttcctggtatctccaagcgtcttggtgtcaccatgttccctggtgccagctggggaagct gcttgtggtaatacattctctgtcataggatgtaaggctgagaaagctattgcttcggaa aaagttagtcaggccaggacagaaagccagcaaacaaaagacaaccctcaactcttaaat tctagatggatgttaaactctggggatcccattgaagttaccatgctacatttaatagtc atgtctcttgggctcctcttggttgagataatttctcaggctatctttgattttgatgac cttgacagttttgagtag >gi568815594r:120595114_121022608|GENSCAN_predicted_peptide_18|81_aa MGSEPSCRVPTGAPSSGAVRREPLSSRSRNGKSTYSLHHVPGKATDTQCQPVKAARRGYT LQSHNSSCLRPWEPTSCISVT >gi568815594r:120595114_121022608|GENSCAN_predicted_CDS_18|246_bp atggggtcagagccctcatgcagagtccctactggtgcaccatctagtggagctgtgaga agagaaccactgtcctccagatcccggaatggtaaatccacctacagcttgcaccatgtg cctggaaaagccacagacactcagtgccagcccgtgaaagcagccaggaggggctatacc ctgcaaagccacaattcaagctgcctaaggccatgggagcctacctcctgcatcagtgtg acctga