GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:37:42 Sequence gi568815593r:138057211_138277667 : 220457 bp : 44.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 26880 26947 68 2 2 83 119 50 0.998 6.20 1.02 Intr + 27040 27073 34 1 1 116 97 24 0.999 4.33 1.03 Intr + 27288 27426 139 2 1 68 103 129 0.998 12.44 1.04 Intr + 30596 30721 126 0 0 88 111 113 0.995 14.25 1.05 Intr + 31717 31859 143 2 2 25 68 115 0.859 3.27 1.06 Intr + 33318 33846 529 2 1 36 82 367 0.507 23.41 1.07 Term + 36625 36833 209 2 2 80 41 99 0.477 1.90 1.08 PlyA + 37527 37532 6 1.05 2.31 PlyA - 38305 38300 6 1.05 2.30 Term - 39320 39255 66 2 0 94 32 77 0.134 0.64 2.29 Intr - 53602 53496 107 2 2 120 26 21 0.007 -0.97 2.28 Intr - 71369 71269 101 1 2 41 113 51 0.586 2.65 2.27 Intr - 72258 72053 206 0 2 92 91 29 0.580 1.60 2.26 Intr - 81543 81442 102 0 0 30 93 107 0.691 5.77 2.25 Intr - 83672 83495 178 1 1 78 84 196 0.939 18.12 2.24 Intr - 86860 86749 112 2 1 44 -2 115 0.588 -2.46 2.23 Intr - 87163 86941 223 1 1 38 101 91 0.398 3.10 2.22 Intr - 88035 87942 94 2 1 89 81 59 0.534 5.27 2.21 Intr - 88668 88579 90 2 0 103 96 20 0.833 3.41 2.20 Intr - 92587 92430 158 1 2 84 61 108 0.957 6.51 2.19 Intr - 93798 93535 264 0 0 42 94 75 0.675 1.21 2.18 Intr - 95550 95272 279 0 0 103 77 173 0.786 15.37 2.17 Intr - 102963 102859 105 0 0 99 100 116 0.966 14.31 2.16 Intr - 103858 103681 178 0 1 124 80 152 0.999 17.92 2.15 Intr - 104654 104586 69 1 0 93 83 81 0.988 6.40 2.14 Intr - 106124 105920 205 0 1 -8 62 254 0.323 11.46 2.13 Intr - 107203 107110 94 1 1 85 99 23 0.684 2.64 2.12 Intr - 107956 107504 453 1 0 88 65 420 0.999 33.16 2.11 Intr - 108898 108618 281 2 2 73 88 393 0.999 35.00 2.10 Intr - 109517 109308 210 0 0 71 96 72 0.408 5.18 2.09 Intr - 110868 110724 145 0 1 95 111 -4 0.869 2.46 2.08 Intr - 112148 112012 137 0 2 72 84 53 0.777 3.59 2.07 Intr - 113199 113135 65 2 2 56 98 90 0.997 5.16 2.06 Intr - 113702 113622 81 1 0 100 42 65 0.783 1.85 2.05 Intr - 113950 113828 123 0 0 75 48 134 0.995 7.80 2.04 Intr - 114200 114151 50 2 2 89 87 30 0.900 0.48 2.03 Intr - 114924 114855 70 2 1 67 106 40 0.895 2.88 2.02 Intr - 120457 120361 97 2 1 99 87 65 0.904 6.57 2.01 Init - 121404 121386 19 0 1 115 73 30 0.930 2.74 2.00 Prom - 121872 121833 40 -9.16 3.00 Prom + 122244 122283 40 -13.52 3.01 Init + 122471 122635 165 1 0 91 103 137 0.987 15.13 3.02 Intr + 124212 124301 90 2 0 53 111 35 0.829 2.49 3.03 Intr + 124399 124518 120 0 0 117 80 109 0.905 13.69 3.04 Intr + 125113 125251 139 0 1 56 113 73 0.999 6.64 3.05 Intr + 125376 125563 188 1 2 108 88 159 0.996 17.31 3.06 Intr + 125651 125780 130 1 1 101 80 151 0.992 15.87 3.07 Intr + 125959 126153 195 2 0 77 98 173 0.998 16.59 3.08 Intr + 126260 126371 112 0 1 81 109 150 0.999 15.84 3.09 Intr + 126752 126895 144 2 0 81 105 107 0.996 11.10 3.10 Intr + 127029 127194 166 0 1 85 94 155 0.997 15.76 3.11 Intr + 127302 127466 165 2 0 79 78 138 0.993 12.06 3.12 Intr + 127597 127736 140 0 2 110 79 137 0.959 14.26 3.13 Intr + 127885 127987 103 1 1 100 109 97 0.999 13.18 3.14 Intr + 128302 128500 199 0 1 59 100 138 0.999 11.02 3.15 Intr + 128751 128842 92 1 2 97 86 110 0.999 11.41 3.16 Intr + 129084 129221 138 2 0 119 66 38 0.974 5.36 3.17 Term + 129886 130203 318 0 0 70 48 306 0.977 19.68 3.18 PlyA + 130715 130720 6 1.05 4.15 PlyA - 130945 130940 6 1.05 4.14 Term - 131938 131768 171 1 0 80 33 68 0.717 -1.67 4.13 Intr - 132542 132423 120 2 0 122 76 33 0.830 6.19 4.12 Intr - 134727 134652 76 2 1 104 77 85 0.997 8.52 4.11 Intr - 135178 135059 120 0 0 84 111 56 0.989 7.11 4.10 Intr - 135447 135294 154 1 1 55 28 90 0.960 -1.17 4.09 Intr - 141070 140989 82 1 1 112 91 23 0.975 4.21 4.08 Intr - 141313 141216 98 2 2 103 84 40 0.994 4.83 4.07 Intr - 141572 141395 178 2 1 68 105 46 0.979 3.79 4.06 Intr - 144029 143897 133 1 1 139 54 108 0.999 13.15 4.05 Intr - 144238 144133 106 2 1 71 84 105 0.998 7.67 4.04 Intr - 144945 144903 43 0 1 71 68 60 0.736 0.11 4.03 Intr - 149474 149337 138 2 0 88 81 133 0.999 13.26 4.02 Intr - 155853 155781 73 2 1 74 94 -2 0.975 -1.59 4.01 Init - 156102 155942 161 0 2 87 81 213 0.996 17.81 4.00 Prom - 172086 172047 40 -6.26 5.10 PlyA - 172214 172209 6 1.05 5.09 Term - 193077 193066 12 0 0 140 53 -4 0.741 -0.50 5.08 Intr - 196165 196077 89 2 2 103 91 11 0.864 2.59 5.07 Intr - 196690 196556 135 2 0 129 70 140 0.997 16.84 5.06 Intr - 196950 196847 104 2 2 28 73 78 0.960 0.12 5.05 Intr - 200741 200429 313 0 1 128 89 528 0.990 52.14 5.04 Intr - 202439 202347 93 0 0 99 64 101 0.990 8.74 5.03 Intr - 207220 207051 170 0 2 79 73 128 0.567 9.99 5.02 Intr - 216848 216695 154 0 1 60 60 54 0.557 -1.07 5.01 Init - 217214 217124 91 2 1 94 117 256 0.989 27.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:138057211_138277667|GENSCAN_predicted_peptide_1|415_aa MGNLFMLWAALGICCAAFSASAWSVNNFLITGPKAYLTYTTSVALGAQSGIEECKFQFAW ERWNCPENALQLSTHNRLRSATRETSFIHAISSAGVMYIITKNCSMGDFENCGCDGSNNG KTGGHGWIWGGCSDNVEFGERISKLFVDSLEKGKDARALMNLHNNRAGRLAVRATMKRTC KCHGISGSCSIQTCWLQLAEFREMGDYLKAKYDQALKIEMDKRQLRAGNSAEGHWVPAEA FLPSAEAELIFLEESPDYCTCNSSLGIYGTEGRECLQNSHNTSRWERRSCGRLCTECGLQ VEERKTEVISSCNCKFQWCCTVKCDQCRHVVSKYYCARSPGSAQSLELSVTPTNLPTWTL CQKQQEFGFLYIHRLPAKDSFQGNTASFRFVSYSPISLPFWFILNKLAIIKVTEQ >gi568815593r:138057211_138277667|GENSCAN_predicted_CDS_1|1248_bp atggggaacctgtttatgctctgggcagctctgggcatatgctgtgctgcattcagtgcc tctgcctggtcagtgaacaatttcctgataacaggtcccaaggcctatctgacctacacg actagtgtggccttgggtgcccagagtggcatcgaggagtgcaagttccagtttgcttgg gaacgctggaactgccctgaaaatgctcttcagctctccacccacaacaggctgagaagt gctaccagagagacttccttcatacatgctatcagctctgctggagtcatgtacatcatc accaagaactgtagcatgggtgacttcgaaaactgtggctgtgatgggtcaaacaatgga aaaacaggaggccatggctggatctggggaggctgcagcgacaatgtggaatttggggaa aggatctccaaactctttgtggacagtttggagaaggggaaggatgccagagccctgatg aatcttcacaacaacagggccggcagactggcagtgagagccaccatgaaaaggacatgc aaatgtcatggcatctctgggagctgcagcatacagacatgctggctgcagctggctgaa ttccgggagatgggagactacctaaaggccaagtatgaccaggcgctgaaaattgaaatg gataagcggcagctgagagctgggaacagcgccgagggccactgggtgcccgctgaggcc ttccttcctagcgcagaggcggaactgatctttttagaggaatcaccagattactgtacc tgcaattccagcctgggcatctatggcacagagggtcgtgagtgcctacagaacagccac aacacatccaggtgggagcgacgtagctgtgggcgcctgtgcactgagtgtgggctgcag gtggaagagaggaaaactgaggtcataagcagctgtaactgcaaattccagtggtgctgt acggtcaagtgtgaccagtgtaggcatgtggtgagcaagtattactgcgcacgctcccca ggcagtgcccagtccctggagctatcagtcacaccaaccaatcttcccacatggaccctg tgccagaaacaacaggaatttggatttctctacatccatcgccttcctgccaaggattca ttccaaggcaacacagcctcattcagatttgtcagttacagccctatatcccttcccttt tggttcatccttaacaagctggctatcattaaggtgacagaacagtaa >gi568815593r:138057211_138277667|GENSCAN_predicted_peptide_2|1453_aa MATGTGKHKLLSTGPTEPWSIREKLCLASSVMRSGDQNWVSVSRAIKPFAEPGRPPDWFS QKHCASQYSELLETTETPKRKRGEKGEVVETVEDVIVRKLTAERVEELKKVIKETQERYR RLKRDAELIQAGHMDSRLDELCNDIATKKKLEEEEAEVKRKATDAAYQARQAVKTPPRRL PTVMVRSPIDSASPGGDYPLGDLTPTTMEEATSGVNESEMAVASGHLNSTGVLLEVGGVL PMIHGGEIQQTPNTVAASPAASGAPTLSRLLEAGPTQFTTPLASFTTVASEPPVKLVPPP VESVSQATIVMMPALPAPSSAPAVSTTESVAPVSQPDNCVPMEAVGDPHTVTVSMDSSEI SMIINSIKEECFRSGVAEAPVGSKAPSIDGKEELDLAEKMDIAVSYTGEELDFETVGDII AIIEDKVDDHPEVLDVAAVEAALSFCEENDDPQSLPGPWEHPIQQERDKPVPLPAPEMTV KQERLDFEETENKGIHELVDIREPSAEIKVEPAEPEPVISGAEIVAGVVPATSMEPPELR SQDLDEELGSTAAGEIVEADVAIGKGDETPLTNVKTEASPESMLSPSHGSNPIEDPLEAE TQHKFEMSGEDEEEDGVSEAASLEEPKEEDQGEGYLSEMDNEPPVSESDDGFSIHNATLQ SHTLADSIPSSPASSQLYANVFLQPVTDDIAPGYHSIVQRPMDLSTIKKNIENGLIRSTA EFQRDIMLMFQNAVMYNSSDHDVYHMAVEMQRDVLEQIQQFLATQLIMQTSESGISAKSL RGRDSTRKQDASEKMGHEWVWLDSEQDHPNDSELSNDCRSLFSSWDSSLDLDVGNWRETE DPEAEELEESSPEREPSELLVGDGGSEESQEAARKASHQNLLHFLSEVAYLMEPLCISSN ESSEGCCPPSGTRQEGREIKASEGERELCRETEELSAKGDPLVAEKPLGENGKPEVASAP SVICTVQGLLTESEEGEAQQESKGEDQGEVYVSEMEDQPPSGECDDAFNIKETPLVDTLF SHATSSKLTDLSQDDPVQDHLLFKKTLLPVWKMIASHRFSSPFLKPVSERQAPGYKDVVK RSVKKGSRKFLKMVCPEFVPSDVQMCPEFLPPGGFVVSLSSGVKLQTFTASVTALKGGAS GVVRSSQWVGGLAGFRSEAADLGGAKRLTFTVSVTVHKGSADPKSEEQQDLLRSERTKPP QPMDLTSLKRNLSKGRIRTMAQFLRDLMLMFQNAVMYNDSDHHVYHMAVEMRQEVLEQIQ IYVEKTLAIIKPDIVDKEEEIQDIILRSGFTIVQRRKLRLSPEQCSNFYVEKYGKMFFPN LTAYMSSGPLVAMILARHKAISYWLELLGPNNSLVAKETHPDSLRAIYGTDDLRNALHGS NDFAAAEREIRFMFPEDQIVFIAYPLPALSKAPLPSAPEGIPSSGVLGSSCRKLKFMYLK TVWEKETTGEKLK >gi568815593r:138057211_138277667|GENSCAN_predicted_CDS_2|4362_bp atggcgacgggaacgggcaaacacaagctgctaagcactggccccacagagccatggtcc atccgagagaagctatgtttagcatcttctgtcatgagaagtggcgatcaaaattgggta tcagttagcagagcaatcaagccctttgcagaacctggccgccctccagactggttctct caaaaacattgtgcttcccagtactcggagcttttagagaccactgagacaccaaaacgg aaacgaggtgaaaagggagaagtggtggaaactgttgaagatgttattgttcggaaattg actgctgagcgagttgaagaactaaagaaagtgataaaggaaacccaggagagatataga cggctaaagagagatgcagaactaattcaagctggacacatggacagcagactggatgag ctttgcaatgacattgcaacgaaaaagaaattggaagaagaggaggctgaagtaaagagg aaggctacagatgctgcataccaggctcgtcaagcagtaaaaacacccccccggaggtta cccactgtgatggttcgctctcctatagattctgcctccccaggaggtgattatccactt ggggacttgactccaaccactatggaagaggctacctctggggtcaatgagagtgaaatg gctgtggcttctggccacctgaacagtacaggtgtcctcctggaggtaggcggggtcctt cccatgatacatggtggggagatacagcaaacacccaatactgttgcagcctcccctgct gcatcaggtgctcccactctttcccggcttttagaagctggtcctacacagttcaccaca cctcttgcttccttcactactgttgccagtgagcctccagttaaacttgtgccaccccct gtagagtctgtgtcccaagctaccattgtcatgatgcctgcgctgccagcaccatcctct gctccggctgtctccactactgaaagtgtagctccagtgagtcaacccgacaactgtgtt cccatggaggctgtgggggatccacatactgtgactgtttccatggacagcagtgaaata tccatgatcatcaattctatcaaagaagagtgttttcgatcaggggtagcagaggctcct gttggatcaaaggctcccagcatagatgggaaggaagaattagatctggctgagaagatg gatattgctgtgtcttacacaggtgaagagctggattttgagactgttggagacatcatt gccatcattgaggacaaggtagatgatcatcctgaagtgctggatgtggcagcagtggaa gcagcactgtcattttgtgaagaaaatgatgatcctcagtccctgcctggcccctgggag catcctatccagcaggagcgggacaagccagtacctctccctgcaccagaaatgacggtc aagcaagagagactggactttgaggaaacggaaaacaagggaatacatgaactggtggac atcagggagcccagtgcagagatcaaggtggaacctgcagaaccagagccagtcatttca ggagccgaaatagtagctggagttgttccagccacaagtatggagccaccagaactcagg agtcaggacttagatgaggaactgggaagtactgcagctggagagattgttgaagcagat gttgccattgggaaaggcgatgagactccacttacaaatgtgaagacagaggcatcccct gaaagcatgttgtctccatcacatggctcaaatcccattgaagatcctttagaggcagag actcagcacaagtttgaaatgtcaggtgaggatgaggaggaagatggtgtcagtgaagcg gccagcctagaggagcctaaggaagaggatcaaggagaaggctacttgtcagaaatggat aatgaacctcctgtgagcgagagtgatgatggcttcagcatacacaatgctacactgcag tcacacacactggcagactccatccccagcagccctgcttcttcacagttgtatgccaat gtcttcctgcagcctgttacagatgacatagcacctggctaccacagcattgtgcagagg cctatggatttgtcaactattaagaaaaacatagaaaatggactgatccgaagcacagct gaatttcagcgtgacattatgctgatgtttcagaatgctgtaatgtacaatagctcagac catgatgtctatcacatggcagtggagatgcagcgagatgtcttggaacagatccagcaa ttcttggccacgcagttgattatgcaaacatccgagtctgggatcagtgctaaaagtctt cgagggagagattctacccgcaaacaggatgcttcagagaagatgggacacgagtgggtt tggctggattctgaacaagatcatcccaatgactctgagttgagcaatgactgcaggtcc ctcttcagctcatgggactccagtctggatcttgatgtgggcaactggagggaaactgag gatccagaggctgaggaactagaggaaagcagcccggagagagaacctagtgaactgctt gttggggatggaggcagtgaggaatctcaggaagcggcaaggaaagccagccaccagaac ctcctccactttctctctgaggtagcttatttaatggagcccttgtgcatcagcagcaac gaatcaagtgaaggctgctgccctccatctggtaccagacaagaaggaagggaaattaaa gctagcgaaggagaaagggagctctgcagagagactgaagagctttcagctaaaggagac cccttagtagctgaaaagccactgggagaaaatggaaagccagaggtggcttcagctccc tcagttatttgtacagttcagggactactcacagagagtgaagagggggaggctcagcaa gaatccaaaggggaggaccagggtgaagtatatgtgtcagagatggaagaccagccccct tcaggcgagtgtgatgatgcctttaacattaaggagactcccttggtggatacacttttc agccatgctacctcctcaaagctgactgatctaagccaggatgaccctgttcaggatcat ttgctatttaagaagactctcctgccagtctggaagatgattgccagtcacaggttcagc agtccatttctgaagcctgtgtcagaaaggcaggccccagggtacaaggatgtggtgaaa aggtcagtgaaaaaggggtcaagaaagttcttaaagatggtgtgtccagagtttgttcct tcggatgttcagatgtgtccggagtttcttcctcccggtgggtttgtggtctcgctgagt tcaggagtgaagctgcagaccttcacagcgagtgttacagctcttaaaggtggcgcctct ggagttgttcgttcctcccagtgggttggtggtctcgctggcttcaggagtgaagctgca gaccttggcggagcgaagcggctgaccttcacggtgagtgttacagttcataaaggtagt gcggacccaaagagtgaggagcagcaagatttattgcggagtgaaagaacaaagcctcca caacccatggacttaactagcctgaagagaaatctctctaagggtcggattcgcaccatg gcccaattcctgcgagacctgatgctgatgttccaaaatgctgtaatgtacaatgactct gatcatcatgtataccatatggctgtggagatgcggcaagaagtcctggagcagattcag atatatgtagaaaaaactctggccattatcaaaccagatattgttgacaaagaggaggag atacaagatattattcttagatccggattcaccattgttcagagaagaaaactacgcctc agccctgagcaatgtagtaacttttatgtggaaaagtatggaaaaatgtttttccccaac ttaacagcttacatgagttctggaccacttgtcgccatgatattagctagacataaagcc atctcttattggttagaacttttgggaccaaataatagcttagtagcgaaggagacacat ccagacagtctgagggcaatttatggcacagatgacctaaggaatgcacttcatgggagt aatgactttgctgctgcggaaagagaaatacgttttatgtttcctgaagaccaaattgtt tttattgcctaccccctgccggcactttccaaagcaccactgccatcagctccagaggga atcccttcctcaggcgtcctgggcagcagctgcaggaaactcaagttcatgtatctgaag acagtctgggaaaaagagacaactggtgaaaaactgaagtaa >gi568815593r:138057211_138277667|GENSCAN_predicted_peptide_3|867_aa MSQGILSPPAGLLSDDDVVVSPMFESTAADLGSVVRKNLLSDCSVVSTSLEDKQQVPSED SMEKVKVYLRVRPLLPSELERQEDQGCVRIENVETLVLQAPKDSFALKSNERGIGQATHR FTFSQIFGPEVGQASFFNLTVKEMVKDVLKGQNWLIYTYGVTNSGKTHTIQGTIKDGGIL PRSLALIFNSLQGQLHPTPDLKPLLSNEVIWLDSKQIRQEEMKKLSLLNGGLQEEELSTS LKRSVYIESRIGTSTSFDSGIAGLSSISQCTSSSQLDETSHRWAQPDTAPLPVPANIRFS IWISFFEIYNELLYDLLEPPSQQRKRQTLRLCEDQNGNPYVKDLNWIHVQDAEEAWKLLK VGRKNQSFASTHLNQNSSRRLSLCDLAGSERCKDQKSGERLKEAGNINTSLHTLGRCIAA LRQNQQNRSKQNLVPFRDSKLTRVFQGFFTGRGRSCMIVNVNPCASTYDETLHVAKFSAI ASQLVHAPPMQLGFPSLHSFIKEHSLQVSPSLEKGAKADTGLDDDIENEADISMYGKEEL LQVVEAMKTLLLKERQEKLQLEMHLRDEICNEMVEQMQQREQWCSEHLDTQKELLEEMYE EKLNILKESLTSFYQEEIQERDEKIEELEALLQEARQQSVAHQQSGSELALRRSQRLAAS ASTQQLQEVKAKLQQCKAELNSTTEELHKYQKMLEPPPSAKPFTIDVDKKLEEGQKNIRL LRTELQKLGESLQSAERACCHSTGAGKLRQALTTCDDILIKQDQTLAELQNNMVLVKLDL RKKAACIAEQYHTVLKLQGQVSAKKRLGTNQENQQPNQQPPGKKPFLRNLLPRTPTCQSS TDCSPYARILRSRRSPLLKSGPFGKKY >gi568815593r:138057211_138277667|GENSCAN_predicted_CDS_3|2604_bp atgtcgcaagggatcctttctccgccagcgggcttgctgtccgatgacgatgtcgtagtt tctcccatgtttgagtccacagctgcagatttggggtctgtggtacgcaagaacctgcta tcagactgctctgtcgtctctacctccctagaggacaagcagcaggttccatctgaggac agtatggagaaggtgaaagtatacttgagggttaggcccttgttaccttcagagttggaa cgacaggaagatcagggttgtgtccgtattgagaatgtggagacccttgttctacaagca cccaaggactcttttgccctgaagagcaatgaacggggaattggccaagccacacacagg ttcaccttttcccagatctttgggccagaagtgggacaggcatccttcttcaacctaact gtgaaggagatggtaaaggatgtactcaaagggcagaactggctcatctatacatatgga gtcactaactcagggaaaacccacacgattcaaggtaccatcaaggatggagggattctc ccccggtccctggcgctgatcttcaatagcctccaaggccaacttcatccaacacctgat ctgaagcccttgctctccaatgaggtaatctggctagacagcaagcagatccgacaggag gaaatgaagaagctgtccctgctaaatggaggcctccaagaggaggagctgtccacttcc ttgaagaggagtgtctacatcgaaagtcggataggtaccagcaccagcttcgacagtggc attgctgggctctcttctatcagtcagtgtaccagcagtagccagctggatgaaacaagt catcgatgggcacagccagacactgccccactacctgtcccggcaaacattcgcttctcc atctggatctcattctttgagatctacaacgaactgctttatgacctattagaaccgcct agccaacagcgcaagaggcagactttgcggctatgcgaggatcaaaatggcaatccctat gtgaaagatctcaactggattcatgtgcaagatgctgaggaggcctggaagctcctaaaa gtgggtcgtaagaaccagagctttgccagcacccacctcaaccagaactccagccgcagg ctgtcactctgtgatctggctggctcagagcgctgcaaagatcagaagagtggtgaacgg ttgaaggaagcaggaaacattaacacctctctacacaccctgggccgctgtattgctgcc cttcgtcaaaaccagcagaaccggtcaaagcagaacctggttcccttccgtgacagcaag ttgactcgagtgttccaaggtttcttcacaggccgaggccgttcctgcatgattgtcaat gtgaatccctgtgcatctacctatgatgaaactcttcatgtggccaagttctcagccatt gctagccagcttgtgcatgccccacctatgcaactgggattcccatccctgcactcgttc atcaaggaacatagtcttcaggtatcccccagcttagagaaaggggctaaggcagacaca ggccttgatgatgatattgaaaatgaagctgacatctccatgtatggcaaagaggagctc ctacaagttgtggaagccatgaagacactgcttttgaaggaacgacaggaaaagctacag ctggagatgcatctccgagatgaaatttgcaatgagatggtagaacagatgcaacagcgg gaacagtggtgcagtgaacatttggacacccaaaaggaactattggaggaaatgtatgaa gaaaaactaaatatcctcaaggagtcactgacaagtttttaccaagaagagattcaggag cgggatgaaaagattgaagagctagaagctctcttgcaggaagccagacaacagtcagtg gcccatcagcaatcagggtctgaattggccctacggcggtcacaaaggttggcagcttct gcctccacccagcagcttcaggaggttaaagctaaattacagcagtgcaaagcagagcta aactctaccactgaagagttgcataagtatcagaaaatgttagaaccaccaccctcagcc aagcccttcaccattgatgtggacaagaagttagaagagggccagaagaatataaggctg ttgcggacagagcttcagaaacttggtgagtctctccaatcagcagagagagcttgttgc cacagcactggggcaggaaaacttcgtcaagccttgaccacttgtgatgacatcttaatc aaacaggaccagactctggctgaactgcagaacaacatggtgctagtgaaactggacctt cggaagaaggcagcatgtattgctgagcagtatcatactgtgttgaaactccaaggccag gtttctgccaaaaagcgccttggtaccaaccaggaaaatcagcaaccaaaccaacaacca ccagggaagaaaccattccttcgaaatttacttccccgaacaccaacctgccaaagctca acagactgcagcccttatgcccggatcctacgctcacggcgttcccctttactcaaatct gggccttttggcaaaaagtactaa >gi568815593r:138057211_138277667|GENSCAN_predicted_peptide_4|550_aa MAASTSMVPVAVTAAVAPVLSINSDFSDLREIKKQLLLIAGLTRERGLLHSSKWSAELAF SLPALPLAELQPPPPITEEDAQDMDAYTLAKAYFDVKEYDRAAHFLHGCNSKKAYFLYMY SRYLSGEKKKDDETVDSLGPLEKGQVKNEALRELRVELSKKHQARELDGFGLYLYGVVLR KLDLVKEAIDVFVEATHVLPLHWGAWLELCNLITDKEMLKFLSLPDTWMKEFFLAHIYTE LQLIEEALQKYQNLIDVGFSKSSYIVSQIAVAYHNIRDIDKALSIFNELRKQDPYRIENM DTFSNLLYVRSMKSELSYLAHNLCEIDKYRVETCCVIGNYYSLRSQHEKAALYFQRALKL NPRYLGAWTLMGHEYMEMKNTSAAIQAYRHAIEVNKRDYRAWYGLGQTYEILKMPFYCLY YYRRAHQLRPNDSRMLVALGECYEKLNQLVEAKKEIVEHLEESTAFRYLAQYYFKCKLWD EASTCAQKCCAFNDTREEGKALLRQILQLRNQGETPTTEVPAPFFLPASLSANNTPTRRV SPLNLSSVTP >gi568815593r:138057211_138277667|GENSCAN_predicted_CDS_4|1653_bp atggctgcgagtacctccatggtcccggtggctgtgacggcggcagtggcgcctgtcctg tccataaacagcgatttctcagatttgcgggaaattaaaaagcaactgctgcttattgcg ggccttacccgggagcggggcctactacacagtagcaaatggtcggcggagttggctttc tctctccctgcattgcctctggccgagctgcaaccgcctccgcctattacagaggaagat gcccaggatatggatgcctataccctggccaaggcctactttgacgttaaagagtatgat cgggcagcacatttcctgcatggctgcaatagcaagaaagcctattttctgtatatgtat tccagatatctgtctggagaaaaaaagaaggacgatgaaacagttgatagcttaggcccc ctggaaaaaggacaagtgaaaaatgaggcgcttagagaattgagagtggagctcagcaaa aaacaccaagctcgagaacttgatggatttggactttatctgtatggtgtggtgcttcga aaactggacttggttaaagaggccattgatgtgtttgtggaagctactcatgttttgccc ttgcattggggagcctggttagaactctgtaacctgatcacagacaaagagatgctgaag ttcctgtctttgccagacacctggatgaaagagttttttctggctcatatatacacagag ttgcagttgatagaggaggccctgcaaaagtatcagaatctcattgatgtgggcttctct aagagctcgtatattgtttcccaaattgcagttgcctatcacaatatcagagatattgac aaagccctctccatttttaatgagctaaggaaacaagacccttacaggattgaaaatatg gacacattctccaaccttctttatgtcaggagcatgaaatcggagttgagttatctggct cataacctctgtgagattgataaatatcgtgtagaaacgtgctgtgtaattggcaattat tacagtttacgttctcagcatgagaaagcagccttatatttccagagagccctgaaatta aatcctcggtatcttggtgcctggacactaatgggacatgagtacatggagatgaagaac acgtctgctgctatccaggcttatagacatgccattgaggtcaacaaacgggactacaga gcttggtatggcctcgggcagacctatgaaatccttaagatgccattttactgcctttat tattatagacgggcccaccagcttcgacccaatgattctcgcatgctggttgctttagga gaatgttacgagaaactcaatcaactagtggaagccaaaaaggaaatagtagaacacttg gaggaaagcactgcctttcgctatctggcccagtactattttaagtgcaaactgtgggat gaagcttcaacttgtgcacaaaagtgttgtgcatttaatgatacccgggaagaaggtaag gccttactccggcaaatcctacagcttcggaaccaaggcgagactcctaccaccgaggtg cctgctccctttttcctacctgcttcactctctgctaacaatacccccacacgcagagtt tctccactcaacttgtcttctgtcacgccatag >gi568815593r:138057211_138277667|GENSCAN_predicted_peptide_5|386_aa MVRPLNPRPLPPVVLMLLLLLPPSPLPLAAGRSTQSQNNPQPGHYQDSVSKRRGESDQKQ SVICSTWALTLPSALRTEEETSISTPLPSEEPSVPADCLEAAQQLRNSSLIGCMCHRRMK NQVACLDIYWTVHRARSLGNYELDVSPYEDTVTSKPWKMNLSKLNMLKPDSDLCLKFAML CTLNDKCDRLRKAYGEACSGPHCQRHVCLRQLLTFFEKAAEPHAQGLLLCPCAPNDRGCG ERRRNTIAPNCALPPVAPNCLELRRLCFSDPLCRSRLVDFQTHCHPMDILGTCATEQSRC LRAYLGLIGTAMTPNFVSNVNTSVALSCTCRGSGNLQEECEMLEGFFSHNPCLTEAIAAK MRFHSQLFSQDWPHPTFAVMAHQEDL >gi568815593r:138057211_138277667|GENSCAN_predicted_CDS_5|1161_bp atggtgcgccccctgaacccgcgaccgctgccgcccgtagtcctgatgttgctgctgctg ctgccgccgtcgccgctgcctctcgcagccggtaggtctacacagtcacagaacaaccca cagccaggtcactaccaagacagcgtctctaagaggcggggagaatcagatcagaaacag agtgtcatttgctccacgtgggctttaactcttccatctgctttaaggactgaggaagaa actagcataagcaccccactgccctcagaggagccttcggtccctgctgactgcctggag gcagcacagcaactcaggaacagctctctgataggctgcatgtgccaccggcgcatgaag aaccaggttgcctgcttggacatctattggaccgttcaccgtgcccgcagccttggtaac tatgagctggatgtctccccctatgaagacacagtgaccagcaaaccctggaaaatgaat ctcagcaaactgaacatgctcaaaccagactcagacctctgcctcaagtttgccatgctg tgtactctcaatgacaagtgtgaccggctgcgcaaggcctacggggaggcgtgctccggg ccccactgccagcgccacgtctgcctcaggcagctgctcactttcttcgagaaggccgcc gagccccacgcgcagggcctgctactgtgcccatgtgcccccaacgaccggggctgcggg gagcgccggcgcaacaccatcgcccccaactgcgcgctgccgcctgtggcccccaactgc ctggagctgcggcgcctctgcttctccgacccgctttgcagatcacgcctggtggatttc cagacccactgccatcccatggacatcctaggaacttgtgcaacagagcagtccagatgt ctacgagcatacctggggctgattgggactgccatgacccccaactttgtcagcaatgtc aacaccagtgttgccttaagctgcacctgccgaggcagtggcaacctgcaggaggagtgt gaaatgctggaagggttcttctcccacaacccctgcctcacggaggccattgcagctaag atgcgttttcacagccaactcttctcccaggactggccacaccctacctttgctgtgatg gcacaccaggaagatctctga