GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:59:49 Sequence gi568815593r:72120912_72420221 : 299310 bp : 40.73% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1335 1497 163 0 1 88 20 160 0.572 8.26 1.02 Term + 4580 4714 135 1 0 69 54 55 0.367 -2.76 1.03 PlyA + 5130 5135 6 1.05 2.06 PlyA - 5697 5692 6 1.05 2.05 Term - 12188 11953 236 0 2 7 36 236 0.497 5.80 2.04 Intr - 12439 12346 94 1 1 88 95 38 0.498 3.12 2.03 Intr - 14955 14560 396 0 0 -3 99 184 0.503 4.25 2.02 Intr - 17528 17430 99 2 0 49 83 63 0.650 1.29 2.01 Init - 22881 22684 198 0 0 76 63 98 0.357 5.05 2.00 Prom - 28774 28735 40 -5.05 3.00 Prom + 58842 58881 40 -3.15 3.01 Init + 60318 60348 31 2 1 73 89 41 0.821 2.67 3.02 Intr + 62832 62914 83 1 2 105 99 114 0.998 12.64 3.03 Intr + 63970 64215 246 0 0 53 1 172 0.198 1.73 3.04 Intr + 65703 65843 141 2 0 98 92 216 0.910 22.63 3.05 Intr + 72955 79456 6502 0 1 73 96 7195 0.929 705.25 3.06 Intr + 82652 82890 239 0 2 57 94 244 0.618 18.31 3.07 Term + 84173 84328 156 1 0 145 42 175 0.997 15.55 3.08 PlyA + 84822 84827 6 1.05 4.00 Prom + 94074 94113 40 -4.65 4.01 Init + 94763 94832 70 1 1 39 93 67 0.592 3.59 4.02 Term + 95000 95142 143 0 2 114 47 57 0.790 1.31 4.03 PlyA + 98468 98473 6 1.05 5.11 PlyA - 98635 98630 6 1.05 5.10 Term - 100237 99998 240 1 0 94 38 244 0.994 15.04 5.09 Intr - 103005 102772 234 0 0 82 63 261 0.475 19.96 5.08 Intr - 105288 105146 143 1 2 89 80 101 0.769 8.55 5.07 Intr - 107457 107355 103 0 1 58 99 30 0.900 -0.07 5.06 Intr - 108304 108116 189 1 0 67 49 96 0.768 2.46 5.05 Intr - 111647 111532 116 0 2 74 84 55 0.766 2.95 5.04 Intr - 113286 113208 79 0 1 87 116 6 0.347 1.51 5.03 Intr - 117217 117103 115 0 1 83 113 34 0.089 4.93 5.02 Intr - 130756 130688 69 0 0 79 70 80 0.367 2.68 5.01 Init - 131579 131566 14 2 2 45 115 2 0.238 -1.61 5.00 Prom - 140662 140623 40 -6.35 6.00 Prom + 140812 140851 40 -4.45 6.01 Init + 144808 144950 143 0 2 56 74 90 0.556 4.05 6.02 Intr + 151764 151946 183 0 0 6 35 177 0.038 2.18 6.03 Intr + 168143 168250 108 2 0 94 52 111 0.825 6.58 6.04 Intr + 169998 170123 126 0 0 102 69 86 0.655 6.97 6.05 Intr + 186879 187024 146 0 2 52 79 40 0.573 -1.49 6.06 Term + 187200 187777 578 1 2 27 42 285 0.392 11.44 6.07 PlyA + 189635 189640 6 1.05 7.00 Prom + 191457 191496 40 -5.15 7.01 Init + 192070 192144 75 0 0 74 93 42 0.297 4.44 7.02 Intr + 199137 199365 229 2 1 55 -17 223 0.001 5.02 7.03 Intr + 199464 199598 135 1 0 68 53 135 0.001 7.62 7.04 Intr + 201261 201353 93 1 0 95 85 46 0.002 4.02 7.05 Intr + 205701 205830 130 1 1 94 94 40 0.006 4.03 7.06 Intr + 210347 210464 118 2 1 79 30 67 0.113 -0.45 7.07 Intr + 214107 214185 79 2 1 74 106 34 0.494 2.01 7.08 Intr + 214883 214974 92 0 2 90 99 62 0.569 6.29 7.09 Intr + 217711 217824 114 0 0 96 103 55 0.986 7.52 7.10 Intr + 222051 222125 75 2 0 111 101 1 0.608 2.49 7.11 Intr + 231730 231843 114 0 0 70 89 34 0.392 1.42 7.12 Intr + 237292 237465 174 0 0 108 49 144 0.928 11.71 7.13 Intr + 237529 237656 128 0 2 24 36 101 0.040 -2.94 7.14 Intr + 238693 238748 56 1 2 104 63 33 0.008 0.10 7.15 Intr + 246402 246546 145 1 1 3 83 147 0.019 4.12 7.16 Intr + 247842 247942 101 0 2 54 121 37 0.023 2.43 7.17 Intr + 257533 257602 70 2 1 91 78 35 0.037 0.12 7.18 Term + 261207 261345 139 0 1 45 49 162 0.606 4.45 7.19 PlyA + 261900 261905 6 1.05 8.03 PlyA - 262053 262048 6 -0.45 8.02 Term - 262984 262865 120 1 0 99 39 89 0.870 2.59 8.01 Init - 263610 263455 156 0 0 58 35 147 0.709 6.36 8.00 Prom - 270468 270429 40 -4.55 9.00 Prom + 275264 275303 40 -3.85 9.01 Init + 282280 282406 127 0 1 53 78 93 0.105 5.17 9.02 Intr + 296218 296484 267 2 0 70 56 144 0.579 5.98 9.03 Term + 296728 296954 227 2 2 13 53 354 0.783 20.36 9.04 PlyA + 298377 298382 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 151764 151956 193 0 1 6 47 185 0.875 2.11 S.002 Intr - 245467 245334 134 0 2 94 115 27 0.825 5.37 S.003 Init - 249778 249663 116 1 2 29 70 134 0.832 5.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_1|99_aa XEQSDRQLLMLSALTPVLPLRLCRPSSRQPSHPHLHPFLAVKEPDLPRRRLEFYSQGTVQ FLALQAIRRLVSKSCLQLFTIEYGNKITNETSQSIISAK >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_1|300_bp nnggaacagagtgatcggcagctcctcatgctgtccgctctcaccccagtgctcccattg agactttgccgcccaagcagcagacagcccagtcacccccacctgcatccttttctagcg gtgaaagagccagaccttccaaggagaagattagaattttatagtcaaggcacagttcag ttcctagctctgcaggccataagaagacttgtaagtaaatcctgtcttcagttgtttaca attgagtatggaaataagataactaatgaaacaagccagtctataataagtgccaaatga >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_2|340_aa MAEITRNEEKGMEQILPQYPQKEPAFGLLECETIHFCYLRHPVDDTLLWQPWGTNIEEYL ETSTLKKVVYIQIAEYEGAYLVLFASDQWYHLLKPCNLEVIGHEKSNEEKEIAHQLPFKA KQECKWKGPREIWMENYHLTRGSDLDWIHPFSQDDSVPGKLDARNILGSKYGELRTIVRV SKNSQEKFTRTLFLPVLWENSKNLKFQSQSNPGDPQSHSYWVYSTRRKSRLRLTSDPNRA GGRGESCSALQPELDSCLRKNTSSHARCMAAQGPWTQGGGYTPEPDFQVLDERVKLPAPK WAALLRSPYNPLLANAEPGQGVSGIPEPPSYTGRRKQTEL >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_2|1023_bp atggcagaaatcactagaaatgaggagaaaggtatggaacagattctccctcagtacccc cagaaggaaccagccttcggcctcttggaatgtgagacaatacatttctgttacttacgc cacccagttgatgatactttgttatggcagccctggggaactaacatagaagagtactta gaaacaagcacactgaagaaggttgtttatatccaaatagcagaatatgaaggtgcctat cttgtcctctttgccagtgatcagtggtatcatttgttaaaaccttgcaacttggaggta attggacatgaaaagtccaatgaggaaaaagaaatagctcaccagctgcctttcaaggca aaacaagaatgcaaatggaaaggtcccagggaaatctggatggaaaattatcacctcaca agagggtctgatctggattggattcatcctttttcacaggatgactctgttccagggaaa ctagatgctaggaacattctgggatccaaatatggagaactgagaactattgtaagggtt tctaaaaactcccaagaaaaattcacaaggactctttttcttccagtgctttgggaaaat tcaaaaaacctgaaattccaaagccagtctaatccaggtgaccctcaatctcatagctac tgggtctactccacaaggaggaaatcaagactgcgcctcacctcagatcccaacagagct gggggtagaggagagagctgttctgcgttacagcctgagctggacagttgtctcaggaaa aacacctcatcccacgccagatgcatggctgctcagggaccatggacccagggtggaggt tatacacctgaacctgacttccaggttctagatgagcgggtaaaattgccagctcccaag tgggcagccttgctaaggagtccctacaaccctctgctggcaaatgctgaacctggacag ggagtttcagggattcctgaacctccttcttatacaggaaggaggaaacaaacagagctc taa >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_3|2465_aa MGLVTVMVVGGQKILHHRSDVLETVVLINPSDEAVSTECIYKVLQPSPLSNSRIRLSRPK SPELISSRSPFFPDADNCEPACPDLPILDISYKYNDILCGLACLAYFSLTVSRFIHVVAC VRLMITDAARHKLLVLTGQCFENTGELILQSGSFSFQNFIEIFTDQEIGELLSTTHPANK ASLTLFCPEEGDWKNSNLDRHNLQDFINIKLNSASILPEMEGLSEFTEYLSESVEVPSPF DILEPPTSGGFLKLSKPCCYIFPGGRGDSALFAVNGFNMLINGGSERKSCFWKLIRHLDR VDSILLTHIGDDNLPGINSMLQRKIAELEEEQSQGSTTNSDWMKNLISPDLGVVFLNVPE NLKNPEPNIKMKRSIEEACFTLQYLNKLSMKPEPLFRSVGNTIDPVILFQKMGVGKLEMY VLNPVKSSKEMQYFMQQWTGTNKDKAEFILPNGQEVDLPISYLTSVSSLIVWHPANPAEK IIRVLFPGNSTQYNILEGLEKLKHLDFLKQPLATQKDLTGQVPTPVVKQTKLKQRADSRE SLKPAAKPLPSKSVRKESKEETPEVTKVNHVEKPPKVESKEKVMVKKDKPIKTETKPSVT EKEVPSKEEPSPVKAEVAEKQATDVKPKAAKEKTVKKETKVKPEDKKEEKEKPKKEVAKK EDKTPIKKEEKPKKEEVKKEVKKEIKKEEKKEPKKEVKKETPPKEVKKEVKKEEKKEVKK EEKEPKKEIKKLPKDAKKSSTPLSEAKKPAALKPKVPKKEESVKKDSVAAGKPKEKGKIK VIKKEGKAAEAVAAAVGTGATTAAVMAAAGIAAIGPAKELEAERSLMSSPEDLTKDFEEL KAEEVDVTKDIKPQLELIEDEEKLKETEPVEAYVIQKEREVTKGPAESPDEGITTTEGEG ECEQTPEELEPVEKQGVDDIEKFEDEGAGFEESSETGDYEEKAETEEAEEPEEDGEEHVC VSASKHSPTEDEESAKAEADAYIREKRESVASGDDRAEEDMDEAIEKGEAEQSEEEADEE DKAEDAREEEYEPEKMEAEDYVMAVVDKAAEAGGAEEQYGFLTTPTKQLGAQSPGREPAS SIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEISSEPTPMDEMSTPRDVM SDETNNEETESPSQEFVNITKYESSLYSQEYSKPADVTPLNGFSEGSKTDATDGKDYNAS ASTISPPSSMEEDKFSRSALRDAYCSEVKASTTLDIKDSISAVSSEKVSPSKSPSLSPSP PSPLEKTPLGERSVNFSLTPNEIKVSAEAEVAPVSPEVTQEVVEEHCASPEDKTLEVVSP SQSVTGSAGHTPYYQSPTDEKSSHLPTEVIEKPPAVPVSFEFSDAKDENERASVSPMDEP VPDSESPIEKVLSPLRSPPLIGSESAYESFLSADDKASGRGAESPFEEKSGKQGSPDQVS PVSEMTSTSLYQDKQEGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPT QIDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGVAEDTYSHMEGVASVSTASVATSSFP EPTTDDVSPSLHAEVGSPHSTEVDDSLSVSVVQTPTTFQETEMSPSKEECPRPMSISPPD FSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEQSLAMDFSRQSPDHPTVGAGVLHITENG PTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTSSAHTPSQIAS PLQEDTLSDVAPPRDMSLYASLTSEKVQSLEGEKLSPKSDISPLTPRESSPLYSPTFSDS TSAVKEKTATCHSSSSPPIDAASAEPYGFRASVLFDTMQHHLALNRDLSTPGLEKDSGGK TPGDFSYAYQKPEETTRSPDEEDYDYESYEKTTRTSDVGGYYYEKIERTTKSPSDSGYSY ETIGKTTKTPEDGDYSYEIIEKTTRTPEEGGYSYDISEKTTSPPEVSGYSYEKTERSRRL LDDISNGYDDSEDGGHTLGDPSYSYETTEKITSFPESEGYSYETSTKTTRTPDTSTYCYE TAEKITRTPQASTYSYETSDLCYTAEKKSPSEARQDVDLCLVSSCEYKHPKTELSPSFIN PNPLEWFASEEPTEESEKPLTQSGGAPPPPGGKQQGRQCDETPPTSVSESAPSQTDSDVP PETEECPSITADANIDSEDESETIPTDKTVTYKHMDPPPAPVQDRSPSPRHPDVSMVDPE ALAIEQNLGKALKKDLKEKTKTKKPGTKTKSSSPVKKSDGKSKPLAASPKPAGLKESSDK VSRVASPKKKESVEKAAKPTTTPEVKAARGEEKDKETKNAANASASKSAKTATAGPGTTK TTKSSAVPPGLPVYLDLCYIPNHSNSKNVDVEFFKRVRSSYYVVSGNDPAAEEPSRAVLD ALLEGKAQWGSNMQVTLIPTHDSEVMREWYQETHEKQQDLNIMVLASSSTVVMQDESFPA CKIEL >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_3|7398_bp atgggcttggtaacagtgatggtggttggtggacaaaagatccttcatcaccgaagtgac gttttagaaacagtggtcctgatcaacccttctgatgaagcagtcagcaccgagtgtatt tacaaagttttgcaaccatcacccttatctaattctagaatacgtctatcacgccccaaa agccctgagctcatcagcagtcggtctccattcttccccgatgccgacaactgcgaacct gcctgtcctgatttgccaattctggacatttcatataaatataatgatatactatgtggc cttgcgtgtctggcttattttagcctaacagtttcaaggttcatccatgtcgtagcatgt gtgcgcttaatgatcactgatgctgcccgacacaagctgctcgtgctgaccgggcagtgc tttgaaaataccggagagctcattctccagtccggctctttctccttccagaacttcata gagattttcaccgatcaagagatcggggagttactaagcaccacccatcctgccaacaaa gccagcttaaccctgttctgtcctgaagaaggggactggaagaactccaatcttgacaga cacaatctccaagacttcatcaatattaaactcaattcagcttctatcttgccagaaatg gaaggactttctgagtttaccgagtatctctcagaatcagtggaagtcccatctcccttt gacatcttggaacctcccacatcgggtggatttctgaagctctccaagccctgctgttat atttttccaggagggaggggcgattctgccttgtttgcagtgaatggtttcaatatgctc atcaatggcggatcagagagaaaatcctgcttctggaagctcatccgacacttagaccga gtggactccatcctgctcacccacattggggatgacaatttgcctggaataaacagcatg ttacagcggaaaattgcagagctcgaggaagaacagtcccagggctccaccacaaatagt gactggatgaaaaacctcatctcccctgacttaggagttgtatttctcaatgtacctgaa aatctcaaaaatccagagccaaacatcaagatgaagagaagcatagaagaagcctgcttc actctccagtacctaaacaaattgtccatgaaaccagaacctctgtttagaagtgtaggc aatactattgatcctgtcattcttttccaaaaaatgggagtaggtaaacttgagatgtat gtgcttaatccagtcaagagcagcaaggaaatgcagtattttatgcagcagtggactggt accaacaaagacaaggctgaattcattctgcctaatggtcaagaagtagatctcccgatt tcctacttaacttcagtctcatctttgattgtgtggcatccagcaaaccctgcggagaaa atcatccgagtcctgtttcctgggaacagcacccagtacaacatcctggaagggttggaa aagctcaaacatctagactttctgaagcagccactggccacccaaaaggatctcactggc caggtgcccactcctgtggtgaaacaaacaaaactgaaacagagggctgatagccgagaa agtctgaagccagccgcaaaaccacttcctagcaaatccgtgcgcaaggagtcaaaagaa gaaacccctgaggtcacaaaagtgaatcacgtggaaaagccacccaaagttgaaagcaaa gaaaaggtaatggtgaaaaaagacaagccaataaaaacagagaccaaaccttcagtgact gaaaaggaggttcccagcaaagaagagccatctccagtgaaagccgaggtggctgagaag caagccacagatgtcaaacccaaagctgccaaggagaagacggtgaaaaaggaaacaaag gtaaagcctgaagacaagaaagaggagaaagaaaagccaaagaaagaagtggctaaaaag gaggacaaaacacctatcaagaaggaggaaaaaccaaaaaaggaagaggtgaaaaaagaa gtcaaaaaagagatcaagaaagaagagaaaaaagaacccaagaaagaggttaagaaagaa acaccgccaaaggaagtcaagaaggaagttaagaaggaagagaagaaggaagtgaaaaag gaagaaaaggaacccaaaaaagaaattaagaagctccctaaagacgcaaagaaatcatct actcctctgtctgaagcaaaaaaaccagctgctttaaaaccaaaagtacccaagaaggaa gagtctgtcaagaaagattctgttgctgccggaaagccaaaggagaaggggaaaataaaa gtcattaagaaggaaggcaaggccgcagaggctgtcgctgcagctgtcggcactggagcc accacagcagctgtcatggcggcagctggaatagcagccattggccctgccaaagaactc gaagctgagaggtcccttatgtcatctcctgaggatctaaccaaggactttgaagagtta aaggctgaagaggtcgatgtaacaaaggacatcaagcctcagctggagctaatcgaagac gaagagaaactgaaggaaactgagccagtcgaagcctacgtcatccagaaggagagagaa gtcaccaaaggtcctgccgagtcccctgatgagggaatcactaccactgaaggggagggc gaatgtgaacagacacctgaggagctggagcccgtcgagaagcagggagtagacgacatt gaaaaatttgaagatgaaggagccggttttgaagaatcttcagagactggagactatgaa gagaaggcagaaactgaggaggctgaggagccagaagaggatggggaggaacacgtatgt gtgagcgcctccaagcacagccccactgaggatgaggaaagtgccaaggcggaggctgat gcatacatcagggagaagagggagtctgtggccagtggggatgaccgagccgaagaagac atggatgaggccattgagaaaggagaggctgaacaatctgaagaggaggctgatgaggag gacaaagctgaagatgccagagaggaggaatatgagccggaaaaaatggaagctgaagac tatgtgatggctgtggtcgacaaggctgcagaggctggtggtgccgaggagcagtatgga ttcctcaccacaccaaccaagcaactaggagcccagtctcctggccgagaacctgcatct tcaattcatgatgagactttacctggaggctcagagagcgaggccaccgcttctgatgag gagaatcgagaagaccagcctgaggaattcactgccacctctggctacactcagtctact attgagatatccagtgagcccacccccatggatgagatgtctacccctcgagacgtgatg agtgatgagaccaacaatgaagagacggagtccccttctcaggaattcgtaaatatcacc aaatatgaatcttcattgtattctcaggaatactctaaacctgctgatgttacaccgctc aacggattttctgaaggatcaaaaacagatgccactgatggcaaggattacaatgcttca gcctctaccatatcaccaccctcttccatggaggaagacaaattcagcagatctgcttta cgtgatgcttactgctctgaagtgaaagccagcaccactttggacatcaaagatagcatc tcagctgtttcaagtgaaaaggtcagcccatcgaagagcccgtccctgagtccatctcca ccatcacccttagaaaagacccccctgggtgaacgtagtgtgaacttctctctgacgccc aatgagattaaagtctctgcagaggcagaagtagccccggtgtctcctgaggtgacccaa gaagtagttgaagaacattgtgctagtcctgaggacaagactctggaagtggtgtcacca tctcagtccgtgactggcagtgctggtcacacaccttactatcaatctcctactgacgag aaatccagtcatctccctacagaagtcattgaaaaaccaccagcagttccagtgagtttt gaattcagtgatgccaaagatgagaatgaaagggcttcagtaagccccatggatgagccc gtgcctgactcagagtctcctattgaaaaagttttgtctcctttacgcagcccgcccctc attggatccgagtctgcttatgaaagttttctaagtgctgatgacaaggcttctggcaga ggtgccgaaagtccttttgaagaaaagagtggaaaacaaggctctccagaccaagtaagt ccagtttctgaaatgacttctactagtctttaccaagacaaacaggaagggaaaagcaca gactttgcaccaataaaagaagactttggccaagaaaagaaaactgatgatgttgaagcc atgagttctcaaccagcactggctctggatgaaaggaaattaggagatgtttctcccaca caaatagatgtcagtcagtttggatcttttaaagaagacactaagatgtccatttctgaa ggtactgtctcagacaagtcagctactcctgttgatgagggcgtagcagaagacacgtac tctcatatggagggtgtggcctcagtgtccacagcctcagtggctacgagctcatttcca gagccaacaacagatgatgtgtctccatctctgcatgctgaggttggctccccacattcc acagaagtagatgactccctttcagtgtctgttgtgcaaacacctaccacattccaggaa acagaaatgtctccatctaaagaagaatgcccaagaccgatgtcaatttctccaccagat ttctcccctaaaactgcaaagtccaggacacccgttcaagatcacagatctgaacagtcc tcaatgtctattgaatttggccaagaatctcctgagcaatcccttgctatggacttcagt cgacagtctccagatcaccctacagtgggtgcaggcgtgcttcacatcactgaaaatggg ccaactgaagtggactacagtccttctgacatgcaggactccagtttatcacataagata ccacctatggaggagccgtcctacacccaagataatgatctttctgagctcatctcagta tctcaggtagaggcctccccgtccacctcttctgctcataccccttctcagatcgcttct cctctccaagaagatactctatccgatgttgctcctcccagagatatgtccttatatgcc tcactcacctctgaaaaagtgcaaagtctggaaggagagaagctctctccaaaatctgat atctctccactcaccccacgagagtcctctcctttatattcacctactttttcagattct acctctgcagtcaaagagaaaacagcaacttgccacagttcctcttctccaccaatagat gcagcatccgcagagccctatggcttccgtgcctcagtgttattcgatacaatgcaacac catctagccttgaatagagatttgtccacacctggcctggagaaggacagtggagggaag acacctggtgactttagctatgcctatcaaaagcctgaggaaacaaccaggtccccagat gaagaagattatgactatgagtcttatgagaagaccacccggacctcagatgtgggtggc tattactatgagaagatagagagaaccacaaaatctccaagtgacagtggctactcctat gagaccattgggaaaactaccaagacccctgaagatggtgactattcctatgaaattatt gagaagaccacacggacccctgaagagggtgggtactcatatgacataagtgaaaagacc accagcccccccgaagtgagtggttacagctatgaaaagactgagaggtctagaaggctt ctggatgacatcagcaatggctatgatgactctgaggatggtggccacacacttggggac cccagctactcttatgaaaccactgagaaaattaccagtttccctgagtctgaaggttat tcctatgagacatctacaaagacaacacgaacccctgatacttccacatactgttacgag actgcagagaaaatcactagaacccctcaggcatccacatattcctacgagacttcagac ctatgctacactgcagaaaagaagtccccctcagaagcccgtcaggatgtcgatttatgc ctcgtgtcctcttgtgaatacaagcaccccaagacagagctttcaccctctttcattaat cccaatcctcttgagtggtttgccagtgaagaacccactgaagaatctgaaaagcccctc actcaatcagggggagccccaccgcctccaggaggaaagcaacagggccgacagtgtgat gaaacccctcccacctcagtcagcgagtcagccccatcccagaccgactctgatgttccc ccggagactgaagagtgcccctccatcacggccgatgccaatatcgactctgaagacgag tcggaaaccatccccacagacaaaactgtcacgtacaaacacatggacccacctccagct cccgtgcaagaccgcagcccttcgccacgccaccctgatgtgtccatggtggacccagag gccttggccattgagcagaacctgggcaaagctctaaagaaagatctgaaagagaagacc aaaaccaaaaagccaggtacaaagaccaagtcatcttcacctgtcaaaaagagtgatggg aagtctaagcccttggcagcttcaccaaaaccagcgggcttgaaagaatcctcggataaa gtgtccagggtggcttctcctaagaagaaagaatctgtggaaaaggcagcaaaacccacc accactcctgaggtcaaagctgcacgtggggaagagaaagacaaggagaccaagaatgct gccaatgcctctgcatccaagtcggccaagaccgccactgcaggaccaggaactaccaag acgaccaagtcatctgctgtgcccccaggcctccctgtgtatttggacctgtgctacatt cctaaccacagcaatagtaagaatgttgatgtggaatttttcaagagagtgcggtcttcc tactacgtggtgagtgggaatgaccctgctgctgaggagcccagccgggctgtcctggac gctttgttggaaggaaaggctcagtggggcagcaacatgcaggtgacactgatcccaact catgactcagaagtgatgagggaatggtaccaggagacccatgagaaacagcaagatctc aacatcatggttttagcaagcagcagcacagtggttatgcaagatgaatccttccctgca tgcaagattgaactgtaa >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_4|70_aa MFPRGPGGTPLIEAIGNALGKGTDVGPPPEGFPHPIPIPPPFIFHRLYPPKTAFTVTSDS DADLLPGGPK >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_4|213_bp atgttccctcgagggccaggtggcacgccgctcatcgaggccattgggaatgcactgggg aagggcactgatgtaggacctccacctgaaggctttcctcacccaatcccaattccacct cctttcatttttcacaggctttatccccccaaaactgctttcactgttacctctgactct gatgctgatctgcttcctggggggcctaagtga >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_5|433_aa MLGTWHPHSLAVYLNQRLVGTELTQSSRFRHSPNCWYLRNWTIHTWIRQCLKYDAQDKAL YTLVNKVQYGIFPDNFTFNLLMDSFIKKENYKDALSVVFEVMMQEAFEVPSTQLLSLYVL FHCLAKKTDFSRPVQGNLSLVFYVLKPELGLQLCPLCTHVAPRRSQIPLGSCQGSPLAAL PGSPPVIVVTSLQQWEEERNFGASLLLPGLKQKNSVGFSSQLYGYALLGKVELQQGLRAV YHNMPLIWKPGYLDRALQVMEKVAASPEDIKLCREAPFFTMCFWAIKHLVADPLSAPQLD VLGAVLKALTSADGASEEQSQNDEDNQGSEKLVEQLDIEETEQSKLPQYLERFKALHSKL QALGKIESEGLLSLTTQLVKEKLSTCEAEDIATYEQNLQQWHLDLVQLIQREQQQREQAK QEYQAQKAAKASA >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_5|1302_bp atgcttggcacatggcatccacactccttagctgtctacctcaaccagaggctagtgggc actgagctgacacaaagctcaaggtttcgacacagccccaactgctggtacctgagaaac tggactatccacacctggattaggcagtgtctaaaatatgatgcacaagacaaagcccta tatacccttgtaaataaggttcaatatggaatttttccagataactttacattcaattta ctgatggattctttcataaagaaagaaaattacaaagatgctttatctgtggtttttgag gtcatgatgcaagaagcctttgaagtgccttccacccaacttctctccctctatgtttta tttcattgcctggcaaagaagacagacttcagtagaccagtgcagggcaacctgagctta gtgttctatgttttgaagccagagttagggctccagctctgtccactgtgtactcatgtg gcaccaagaagatcacagatcccactggggagttgccaaggctctcccctggccgcattg ccaggaagtcctccagtcatcgttgttactagtctgcagcagtgggaagaggagaggaac tttggtgcatcccttttgcttccaggcctaaaacaaaagaactcagtgggtttcagttcc cagttgtatggctatgcacttcttgggaaggtggagttgcagcaagggctacgggctgtg taccacaacatgcctctgatatggaaaccaggctaccttgacagagcccttcaagtgatg gagaaagtggctgcctccccagaagacataaagctgtgtagagaagcgcctttcttcacc atgtgcttttgggccataaaacatttggttgctgaccctctttctgctccacagctcgat gtgctgggtgcagtgctgaaggctctgacttcagctgatggggcttcagaggagcagtcc caaaatgatgaagacaaccaggggtcagaaaaactggtggagcagttagacatcgaggaa acagagcagtccaagcttcctcaatacctggaacgatttaaggccttacattctaagctt caagctctgggcaaaattgagtcagaaggtcttttaagtctgaccacccagcttgtcaag gaaaaactctccacctgtgaagcagaggacatcgccacctatgagcagaatctgcagcag tggcatctagaccttgtacagttgatccagagagaacagcaacagagggagcaagcgaag caggagtaccaggctcagaaagcagcaaaggcatctgcctaa >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_6|427_aa MMFTSLFYNWELQIQEAINITRKTQGMLTVLIEKEAWLCYENKENDFSCKYEYRLCPVSL PWNLKIVLETPDTEWVKKKDSNDYNRNCVACKAHIIYYLALYRKFVNPWLAANPGPGDMQ AAICLGPAVPDLLSISNQGYLGSARTFAATTGKEHDFLPHQGLGDGDIKIKRKCSCLAPC QRWSPDRFTRFVVIHQCLYYDLHTFQYVYYTPIKKYFKEEIDYFNVGGRSYGRISGPRQL HWRPPRPRRWETALTLGPSPRSSLAQSATRTKRGGTGTASPLREARTSLSPYSPAGHGSG ETALWHMKPTPTRRSVPCCLDLQWGWERRKPHIGPPPSPNQRARFEHLTALRLAAPNHSA PHAGRGPARARYPTAPRPLAAFHPRGGGFPQGPHCHLLSHLQSWSHGRSPYHAASLLSAS SSTQTLK >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_6|1284_bp atgatgttcacatctctgttttacaactgggaattgcaaatccaagaagccattaatata actcgtaaaacacaaggaatgttgactgtcttaattgaaaaagaagcctggctctgttac gaaaacaaggaaaatgacttcagctgcaagtacgagtataggctgtgtcctgtgagtctg ccatggaacctgaagatcgtactggagactcctgacacagaatgggtgaagaaaaaagat agcaacgactacaacagaaactgtgtggcctgcaaagcccacattatttactatttggcc ctttacagaaagtttgtcaacccctggctggctgccaacccaggacctggagacatgcag gctgccatctgccttggcccagctgttcctgacctgctatccatctccaaccagggatac ctgggatctgccagaacctttgctgctaccactgggaaagagcatgacttcttacctcac caaggactgggtgatggtgatatcaaaataaagcggaaatgctcctgtctagcaccctgc cagcgctggtctcctgacaggtttactcgctttgtggtaattcatcaatgtctatattac gacttgcacacttttcagtatgtatattatactccaataaaaaagtattttaaagaagaa atcgactactttaatgtaggcggaagatcctatggtagaatctcaggtcctagacagctg cattggcgtccaccgcggccccggcgctgggaaactgccctcacgctagggccctccccg aggagctctctggcacaatcagccacgaggacgaagagagggggcaccggaaccgcaagc ccactccgagaggccagaacctctctctcgccgtactccccggcagggcacggctcaggg gagacggcgctctggcacatgaagcccacacccacacgccgcagtgtcccttgctgcctc gatctccaatggggatgggagcggcgaaagcctcatatcggccctcctccgtcccccaat cagcgagcccgctttgagcacctcacggccctccggctggcggcgcccaatcacagcgct ccacatgctgggcgcggccctgcgcgcgcacgctacccgacagccccgcgcccactcgcc gcgttccacccgcggggcggcggattcccgcaggggccacactgccacctactgtcccat ctccagagctggagccacgggcgctctccctaccatgcggcctccctgctaagtgccagt agctcaacccaaacgctcaaataa >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_7|688_aa MAKEEYDKVLVIRRLPWASYRQNETQSDSRNVSSPLESPPLPSPKKQNNQGPNEAVRPTC RRELRKNHLPRQEHPAPHYGGSHLGAYQKEQPTGYGQRKGRVVGMVRDSMAAAFRPSNRV LLQALQILVYPGVGGSGSVSCRCPLGAKRYLLTDNVVKLKEFQQKKVAVACNLSGTKETY FRNLKKKLTQNKLILKGELITLLHLCESRDHVELAKNVIYRYHAENKNFTLGEYKFGPLF VRLCYELDLEESAVELMKDQHLRGFFSDSTSFNILMDMLFIKGKYKSALQVLIEMKNQDV KFTKDTYVLAFAICYKLNSPESFKICTTLREEALLKGEILSRRASCFAVALALNQNEMAK AVSIFSQIMNPESIACINLNIIIHIQSNMLENLIKTLKNAAEGNLSKFVKRHVFSEEVLA KVREKVKDVPALVAKFDEIYGTLHITGQVTTDSLDAVLCHTPRDRKSHTLLLNKRMSTYG SEGPASSELLPFLRSQVSHFSRQCADTWSSPEIPKFTEWLRKRSMQERPNLNCRLCPKTR AQEEERTDALVQDDFFMGIDRYILENHPKYLEETLVQLCIVQQLMRKVTRTLGAVHLLPK CVSQCPASYAAFDMSKVPQGVSPSLNIRFLIYTPNSLYTSGSCPSLNKAGKFPSLFMHNE NIVARVDEVKSTINFLTEVLCLAMTNAM >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_7|2067_bp atggcaaaagaagagtatgataaagtacttgttataagaaggctaccctgggcctcttat agacagaatgagactcaatcagattccagaaacgtttcctctcctcttgaaagcccaccg ctcccttcacccaaaaaacagaataatcaggggcctaatgaagctgtccggccaacctgc aggagagagctgaggaagaaccacttgccgcgccaggagcatcccgcgccgcactatgga ggcagccatcttggagcgtaccaaaaggaacagccaacgggttacggacagaggaagggg cgggtagttggtatggtccgagacagtatggctgctgcatttcggccctcgaatcgagtt ctcctgcaggcgctgcagattttggtgtatcctggggtgggaggctccggctctgtcagc tgccgctgccctctcggagctaaaagatacctacttacagataatgtggtgaaattaaaa gaatttcaacaaaagaaagtggctgttgcatgtaatctttctggcactaaagaaacgtat tttagaaacttgaaaaagaaactgacccagaacaagctcatcttgaagggggagttgata accttactacatttgtgtgagtctcgggaccatgtggaactggctaaaaatgtcatttac aggtaccatgcagagaacaaaaatttcactttgggggagtataaatttggaccgcttttt gtgaggttgtgttacgagttggatctcgaggaatctgcagtggagctcatgaaagaccag catttacgaggtttcttctcagactccacatcattcaatattttgatggatatgttattt atcaaaggcaaatataaaagtgctttgcaagtattgatagagatgaaaaaccaagatgtg aagttcaccaaagatacctatgttcttgcttttgcaatttgctacaaactgaatagccct gagtctttcaaaatctgtactacattaagagaagaagctctactcaaaggagaaattctc tccaggagagcatcctgtttcgctgtggcattagctctgaatcagaatgagatggcaaaa gctgtgtccattttttctcaaatcatgaatccagaaagcatagcctgcattaatttaaat attataatccatatccagtcaaatatgttggaaaacctgataaagactctaaaaaatgct gcagaaggaaatttatcaaaatttgtgaaaagacatgtgttctcggaggaagtgctggcc aaagtgagggaaaaagtgaaggatgtgcctgcccttgtggccaaatttgatgagatctat gggacactgcacatcactggccaggtcaccactgattctttggatgctgtgctctgccac acccccagggacaggaaatctcacacgttgctattaaacaagaggatgtccacctatgga tctgaggggcctgcttctagtgagttattacctttcctaagaagccaggtatcgcacttc agcagacagtgtgctgacacttggtcttctcctgaaattcccaaattcactgaatggtta aggaaacgatccatgcaagagaggcccaatctcaactgtagactgtgtcccaagaccaga gctcaggaggaagaaaggacagatgcccttgttcaagatgacttcttcatgggcatcgat agatacattcttgaaaaccacccaaagtacctagaggagaccctagtgcaactgtgcatt gtacaacaactgatgagaaaggtgacaaggacacttggagctgtccatttgctccccaaa tgtgtctcccaatgcccagcatcttatgcagcctttgacatgagtaaagtccctcaaggt gtctcaccttctctgaacatcaggttcctcatctatacaccaaattctctatacacatca ggttcatgccccagcctgaataaagctggcaaattcccttccctgttcatgcacaatgaa aacatcgtggccagagttgatgaggtgaagtccacaatcaacttcctgacggaggtgctg tgtctggccatgaccaatgccatgtga >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_8|91_aa MWRIEHSGKGDQVRQWSRRERTWTQGRAVAMEGFERHLGKIIDGAQSLIAWRIFQALFGD CESTDGCCLTGTGEIYRRQNVAIVVGQEAKI >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_8|276_bp atgtggagaatagaacactcgggaaaaggagaccaggtgaggcagtggtctaggcgagag aggacgtggacccaaggtagagcagtggccatggaaggatttgagaggcacttaggaaag ataatcgatggagcccagagcctcattgcatggaggatttttcaggctttatttggagat tgtgagtccacagatgggtgctgtctaactggaactggagaaatttacagacgacagaat gtagcaatagttgtaggacaagaggcaaaaatctga >gi568815593r:72120912_72420221|GENSCAN_predicted_peptide_9|206_aa MEKVEACLSAGSLNIPFQKANWHLVSHSELKGRKTPGSGASFFYQNIAGSPLSALRRPIP PAFLGLYLLSSARCVCSPTAFLAFQVLETSSIIATSSGVNLACGLYCVLSLHKLSCSDDR CHMCLKTSELDGQEGWGFISGNDTKEDVFCTADCHKAAAAQESPSQRGRGELWSLVLKEQ RVRRRQMPRALVAVQYEAVHAATATL >gi568815593r:72120912_72420221|GENSCAN_predicted_CDS_9|621_bp atggaaaaggtggaagcttgtctgtcagcagggtcactaaatattccattccagaaagcc aactggcacttggtcagccattcagagttgaaaggccgcaagacacctggatcaggtgca tcctttttctaccagaacattgctgggtccccactctctgctctaagacgccccatacca cctgctttcctgggcctttacctattgtcatctgcaagatgtgtctgttctccaactgcc ttcctggccttccaggtgctagaaacttcctccatcatagccacttcgtcgggggttaat ctggcctgtggattatactgtgtcctgtccctccacaaactgagttgtagcgatgacaga tgtcacatgtgccttaaaacttctgaattagatggacaagaaggatggggtttcatcagc gggaatgacaccaaggaagatgtattttgtacagcagactgccataaagcagcagcagct caggaatcaccttcgcagcgtggtcgtggagagctgtggagtttggtgttgaaggagcaa agggtgcggaggcggcaaatgccacgggccctggtggcggtccagtacgaggcagtacat gcagcgaccgcaaccctttga