GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:24:43 Sequence gi568815579r:50522693_50737411 : 214719 bp : 52.99% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 107 102 6 1.05 1.09 Term - 2236 1980 257 2 2 59 54 141 0.701 3.88 1.08 Intr - 2654 2374 281 1 2 12 37 240 0.652 8.96 1.07 Intr - 7868 7661 208 0 1 100 44 32 0.588 -1.34 1.06 Intr - 8744 8460 285 0 0 127 64 144 0.906 13.76 1.05 Intr - 16709 16598 112 2 1 34 71 75 0.177 0.86 1.04 Intr - 19654 19529 126 1 0 28 82 59 0.053 0.68 1.03 Intr - 26099 25850 250 1 1 71 85 269 0.861 22.87 1.02 Intr - 26181 26104 78 2 0 91 74 73 0.899 5.36 1.01 Init - 28336 28188 149 1 2 89 75 83 0.533 4.79 1.00 Prom - 34543 34504 40 -1.21 2.07 PlyA - 35314 35309 6 1.05 2.06 Term - 36191 36166 26 0 2 87 47 -3 0.042 -5.83 2.05 Intr - 37892 37761 132 0 0 121 99 60 0.858 11.52 2.04 Intr - 39687 39621 67 0 1 82 90 52 0.854 3.77 2.03 Intr - 42402 42339 64 2 1 55 60 48 0.010 -2.09 2.02 Intr - 55391 55218 174 1 0 -1 55 166 0.010 3.97 2.01 Init - 69129 69116 14 0 2 108 80 32 0.222 3.00 2.00 Prom - 74931 74892 40 -1.51 3.15 PlyA - 76454 76449 6 1.05 3.14 Term - 76964 76879 86 0 2 71 43 71 0.317 -0.99 3.13 Intr - 82007 81651 357 0 0 68 77 201 0.358 12.79 3.12 Intr - 83641 83546 96 2 0 10 77 131 0.130 4.68 3.11 Intr - 100063 99995 69 2 0 117 86 20 0.157 4.35 3.10 Intr - 102602 102385 218 1 2 27 69 315 0.177 22.17 3.09 Intr - 102872 102701 172 0 1 82 81 486 0.999 46.82 3.08 Intr - 103325 103205 121 2 1 66 94 160 0.999 14.97 3.07 Intr - 106820 106602 219 2 0 53 69 428 0.999 36.43 3.06 Intr - 107479 107092 388 0 1 104 63 548 0.978 49.26 3.05 Intr - 110119 109594 526 2 1 116 94 293 0.954 25.30 3.04 Intr - 114734 114572 163 2 1 80 76 305 0.763 28.56 3.03 Intr - 115428 115260 169 2 1 67 35 31 0.625 -3.84 3.02 Intr - 117271 117098 174 0 0 -15 100 159 0.573 6.37 3.01 Init - 122465 122443 23 2 2 90 93 21 0.331 2.09 3.00 Prom - 125294 125255 40 -4.31 4.00 Prom + 125318 125357 40 -6.50 4.01 Init + 126753 126819 67 2 1 78 84 67 0.404 6.48 4.02 Intr + 133358 133430 73 0 1 111 80 82 0.181 8.66 4.03 Intr + 133534 133654 121 1 1 118 97 197 0.999 24.60 4.04 Intr + 135297 135436 140 2 2 53 117 252 0.994 24.37 4.05 Term + 136255 136450 196 1 1 110 48 367 0.999 32.10 4.06 PlyA + 137830 137835 6 1.05 5.26 PlyA - 138649 138644 6 -3.74 5.25 Term - 139990 139273 718 0 1 87 45 592 0.961 48.20 5.24 Intr - 146593 143500 3094 2 1 111 99 2367 0.552 228.65 5.23 Intr - 161031 160824 208 0 1 33 49 124 0.665 2.17 5.22 Intr - 163663 163545 119 2 2 85 80 115 0.897 11.09 5.21 Intr - 164120 164052 69 0 0 111 52 115 0.998 9.85 5.20 Intr - 164970 164890 81 1 0 113 80 117 0.998 13.51 5.19 Intr - 165366 165231 136 0 1 84 42 223 0.773 17.95 5.18 Intr - 166276 166152 125 2 2 114 78 195 0.999 21.91 5.17 Intr - 166587 166505 83 2 2 84 96 93 0.983 9.48 5.16 Intr - 172473 172211 263 0 2 35 97 183 0.811 10.72 5.15 Intr - 173268 173040 229 2 1 77 92 14 0.634 -0.90 5.14 Intr - 174972 174897 76 1 1 31 94 158 0.877 9.77 5.13 Intr - 175264 175151 114 2 0 72 90 60 0.891 5.52 5.12 Intr - 179968 179775 194 0 2 106 75 188 0.975 18.96 5.11 Intr - 181138 180808 331 2 1 82 91 313 0.999 26.24 5.10 Intr - 181494 181428 67 0 1 145 92 98 0.999 14.87 5.09 Intr - 181822 181745 78 1 0 70 41 124 0.926 6.14 5.08 Intr - 188795 188679 117 2 0 99 52 118 0.672 10.47 5.07 Intr - 189422 189255 168 2 0 91 47 167 0.996 13.56 5.06 Intr - 191257 191106 152 2 2 88 89 125 0.999 12.99 5.05 Intr - 191598 191490 109 0 1 56 75 209 0.999 16.76 5.04 Intr - 193038 192967 72 0 0 67 100 123 0.998 11.50 5.03 Intr - 193786 193583 204 1 0 91 57 391 0.796 36.22 5.02 Intr - 194216 193973 244 1 1 38 79 270 0.550 19.13 5.01 Init - 196638 196580 59 0 2 39 59 154 0.546 6.33 5.00 Prom - 197950 197911 40 -7.50 6.00 Prom + 199292 199331 40 -1.31 6.01 Init + 200834 200980 147 1 0 33 105 69 0.534 1.19 6.02 Intr + 201213 201399 187 2 1 83 75 124 0.831 10.38 6.03 Intr + 201718 201909 192 2 0 106 96 332 0.982 35.78 6.04 Term + 202330 202775 446 2 2 98 44 709 0.999 63.28 6.05 PlyA + 202995 203000 6 1.05 7.02 PlyA - 203721 203716 6 -0.45 7.01 Sngl - 205759 205433 327 1 0 72 48 212 0.822 9.94 7.00 Prom - 214102 214063 40 1.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 24989 24879 111 2 0 75 34 87 0.809 0.86 S.002 Init + 133309 133430 122 0 2 85 80 138 0.815 10.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_1|581_aa MAAAGLEAGVGAVGSHRPCGVPRPQIQALRFGPALWLRGCGLDHRQGLKSPPASTRSMNI EDGACPRLPVPPAAARMSWPHGALLFLWLFSPPLGAGGGGVAVTSAAGGGSPPATSCPVA CSCSNQASRVICTRRDLAEVPASIPVNTRYLNLQENGIQLTCPGPLGEDRVQIFASISSG PGGKVHHLGWAQPVHSSLLSQDDFSHLMDVEMEAQALSDLSKVAQQAVAPMGLPCASEAG SLRAERRAASGCPPVATLLSRSSVQRHQGDLENTVGWSPRCTLPGTGSPAESLDFTEPWS PVSEMTCDAHLQGCRGSQQEPGVPQRLRELQAGGLLYLHSINKQWGNEWRPLGWVQEGEQ ELSRWMKGKGVLSRGNSMCKGGVPGERVQGLILANVYPSEGCEAWYKFLNANDNDNNEND DVLSTPEHWDLISSPSPSEVLFIILLVVRVSPEVQRAQVTGPTTRTGEAGFEQGSALHTT PVTSVSVSVHGERRWVGLLDGQKVTVSTERPWAPSKPAASHRAAASIGAAGWARGLHALL LRGALDALGVKHMLGFLGFRSSTIKSVKWLNRKFASDDRFK >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_1|1746_bp atggcagcggcaggcctggaggccggggtgggtgccgtgggctcccacaggccctgcggg gtcccccggcctcaaatccaggccctgcgctttgggccagccctttggctaagagggtgt gggctggatcatcgccagggtcttaagagccccccagcgtccacgcggagcatgaacatt gaggatggcgcgtgcccgcggctccccgtgccccccgctgccgcccggatgtcctggccc cacggggcattgctcttcctctggctcttctccccacccctgggggccggtggaggtgga gtggccgtgacgtctgccgccggagggggctccccgccggccacctcctgccccgtggcc tgctcctgcagcaaccaggccagccgggtgatctgcacacggagagacctggccgaggtc ccagccagcatcccggtcaacacgcggtacctgaacctgcaagagaacggcatccagctc acatgccctggccccctgggagaagaccgggtgcagatcttcgcctccatctcctcggga cctggggggaaggtgcaccacctcggctgggcccagcctgtccactcctcgctcctcagt caggatgacttctcccatcttatggacgtggaaatggaggcacaggctctgagtgactta tccaaggtcgcccagcaagctgtagcccccatgggcttgccatgtgcctcagaagctggg tcactgcgtgctgagcgccgagcagcctcaggctgtccgcctgtggccacgctgttgtca cggtcctctgtccagaggcaccaaggagacttggaaaacactgtgggctggagccccagg tgcacgctccccgggaccggcagccctgctgagtcacttgacttcaccgagccttggtct cctgtctctgaaatgacgtgtgatgctcacctccagggctgccgggggagtcagcaggag cctggggtgccgcagaggctccgggagctgcaggcaggaggcctgctgtatctccattca atcaataagcaatgggggaatgagtggagacctttgggctgggttcaagagggtgagcag gagctttcccggtggatgaaagggaagggcgtcctgagcagagggaacagcatgtgcaaa ggtggagtgcccggcgagagggtccaaggactgatcctggctaacgtgtatccgagtgag ggctgcgaggcatggtacaagtttctcaacgccaacgataacgataacaatgagaatgac gacgtgctgagcacccctgaacactgggacctcatctcctctccctcaccctctgaagtg cttttcatcatcctcctcgtggtaagagtgagcccagaggtacagagagctcaagtgaca ggccctacaaccaggacgggggaggcgggatttgaacagggctctgctcttcacactaca cccgtgacctctgtttccgtgtctgtccacggggaacggcgatgggtggggctgctggat ggacaaaaggtgactgtgtccacagagcgtccctgggcaccgagtaaacccgcagcatct cacagggcagccgccagcatcggggctgctggctgggctcggggactgcatgcgttgcta ctgagaggtgccctggacgccctgggtgtaaaacacatgttgggtttccttggatttcga agctccaccataaagagtgtaaaatggctcaatcgcaagtttgcatctgatgacaggttc aaatga >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_2|158_aa MVLARRFCGQCCQVEDWDAGREVAIECAVRRPLVALDKDSLVGSGMQKPSWCLQRDYEMK KERDSPESVPLATGLLRDLLAVREATLQGGTVIIPGLTDGETEAQRGRCDEVDEEKEDLL DPGISSLSIFLLCPRHFPKLITDIRPSPPTAVISLASP >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_2|477_bp atggtgctggccaggaggttctgtggccagtgctgccaagtggaagactgggatgcgggt agagaagtggccattgagtgtgcagtacggaggccattggtggccttggacaaggacagc ttggtcggaagtgggatgcagaagcccagctggtgtttgcagagggactatgagatgaag aaggagagagacagccctgagagtgttcctctcgccacggggctcctcagggacctcctc gccgtccgagaggcaaccctgcaaggaggcacagtcatcatccctggtttgacagatggg gaaactgaggcccagagaggacggtgtgatgaggtagacgaggagaaggaggatcttctg gatcctggaataagttcactgagcattttccttttatgccccaggcatttccctaaactc atcacagatatcagaccttccccacctacagctgtgatctccctggcttctccatga >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_3|926_aa MAIALAKRPSDFGFLAAGSEWGKEGVQGEEPRTCDSGEGRSRAGTAAGARGCGGGGGGSG GSGGSGCVLWSASHRQSDSDRFVYVFLFACLLYVSSRWIVTPGGQGTPGPCSLLDSQQPA QHDQTATMSGDYEDDLCRRALILVSDLCARVRDADTNDRCQEFNDRIRGYPRGPDADISV SLLSVIVTFCGIVLLGVSLFVSWKLCWVPWRDKGGSAVGGGPLRKDLGPGVGLAGLVGGG GHHLAAGLGGHPLLGGPHHHAHAAHHPPFAELLEPGSLGGSDTPEPSYLDMDSYPEAAAA AVAAGVKPSQTSPELPSEGGAGSGLLLLPPSGGGLPSAQSHQQVTSLAPTTRYPALPRPL TQQTLTSQPDPSSEERPPALPLPLPGGEEKAKLIGQIKPELYQGTGPGGRRSGGGPGSGE AGTGAPCGRISFALRYLYGSDQLVVRILQALDLPAKDSNGFSDPYVKIYLLPDRKKKFQT KVHRKTLNPVFNETFQFSVPLAELAQRKLHFSVYDFDRFSRHDLIGQVVLDNLLELAEQP PDRPLWRDIVEGGSEKADLGELNFSLCYLPTAGRLTVTIIKASNLKAMDLTGFSDPYVKA SLISEGRRLKKRKTSIKKNTLNPTYNEALVFDVAPESVENVGLSIAVVDYDCIGHNEVIG VCRVGPDAADPHGREHWAEMLANPRKPVEHWHQLVEVSRGPTPTPDKHPCTLGPFIHATN ALHVRKDCDQLHKRQQRTIRERELRVRASARWSKAASKAGTSPCDVTSRQPAAAACELAL LAEDIPLRERHAKGRVDPHFQASSYVQALTHQASDQSNGSNERQQVSNQVGQGCQDDTSL SIGLPGSVSSSAERPLALGKARPSTPASARWSKAASKAGTSPCDVTSRQPAAAACELAPM TVSVNSFNDRDTGPSSRTRGSMFAEH >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_3|2781_bp atggccatagccctggccaaaaggccaagtgatttcggctttcttgccgcgggttcggag tgggggaaggagggggtgcagggggaggagccgcggacctgcgacagtggcgaggggagg agccgagccggcaccgcggctggcgcgcggggctgcggcggcggcggcggcggcagcggc ggcagcggcggctccggctgtgtcctgtggagcgcttctcacagacagtctgacagtgac cggtttgtgtatgtgtttttgtttgcttgtttactctatgtctcctctcgttggattgtg acaccgggaggtcagggaactccaggaccttgttctctgctggattcgcagcaaccagca cagcacgaccagacggccaccatgtcaggagactacgaggatgacctctgccggcgggca ctcatcctggtctcggacctctgtgcgcgggtccgagatgctgacaccaacgacaggtgc caggagttcaatgaccgaatccgaggctatccccggggtccagatgcagacatctccgtg agcctgctgtcggtcatcgtgacattctgtggcattgtccttctgggtgtctctctcttc gtgtcctggaagttgtgctgggtgccctggcgggacaagggaggctcggcagtgggcggt ggccccctgcgcaaagacctaggccctggtgtcgggctggcaggcctggtaggcggaggc gggcaccacctggcggctggcctgggtggccatcctctgctgggcggcccacaccaccat gcccatgccgcccaccatccaccctttgctgagctgctggagccaggcagcctggggggt tctgacacccctgagccctcctacttggacatggactcgtatccagaggctgcagcagca gcagtggccgctggggtcaaaccgagccaaacatcccctgagctgccctctgagggggga gcaggctctgggttgctcctgctgccccccagtggtgggggcttgcccagtgcccagtca catcagcaggtcacaagcctggcacccactaccaggtacccagccctgccccgacccctc acccagcagactctgacctcccagccggaccccagcagtgaggagcggccacctgccctg cccttacccctgcctggaggcgaggaaaaagccaaactcattgggcagattaagccagag ctgtaccaggggactggccctggtggccggcggagcggtgggggcccaggctctggagag gcaggcacaggggcaccctgtggccgtatcagcttcgccctgcggtacctctatggctcg gaccagctggtggtgaggatcctgcaggccctggacctccctgccaaggactccaacggc ttctcagacccctacgtcaagatctacctgctgcctgaccgcaagaaaaagtttcagacc aaggtgcacaggaagaccctgaaccccgtcttcaatgagacgtttcaattctcggtgccc ctggccgagctggcccaacgcaaactgcacttcagcgtctatgactttgaccgcttctcg cggcacgacctcatcggccaggtggtgctggacaacctcctggagctggccgagcagccc cctgaccgcccgctctggagggacatcgtggagggcggctcggaaaaagcagatcttggg gagctcaacttctcactctgctacctccccacggccgggcgcctcaccgtgaccatcatc aaagcctctaacctcaaagcgatggacctcactggcttctcagacccctacgtgaaggcc tccctgatcagcgaggggcggcgtctgaagaagcggaaaacctccatcaagaagaacacg ctgaaccccacctataatgaggcgctggtgttcgacgtggcccccgagagcgtggagaac gtggggctcagcatcgccgtggtagactacgactgcatcgggcacaacgaggtgatcggc gtgtgccgtgtgggccccgacgctgccgacccgcacggccgcgagcactgggcagagatg ctggccaatccccgcaagcccgtggagcactggcatcagctagtggaggtgagtaggggt cccacaccgacccctgacaagcacccgtgcaccctcggccccttcatccatgcaacgaac gccctgcatgttcgaaaagactgtgaccagcttcacaaaaggcagcaaaggactatcaga gaaagagaactccgagtgagggcaagcgcgcgctggtcgaaggcagccagtaaggcaggg acaagcccctgcgacgtcactagtcgccagccggccgccgcggcctgcgagctcgcgctg cttgcggaagatatcccgctaagagagagacatgccaaaggtagggtagatccacatttc caggcatccagttatgtacaagccttgacccaccaggcatccgatcagagtaatggatcc aacgaacggcagcaggtatccaatcaggtaggacagggttgccaggatgacacgagtctc tccatagggctcccagggtcagtgagctctagcgctgagcgccccctggcgctgggaaag gcgcggccgtctactccagcaagcgcgcgctggtcgaaggcagccagtaaggcagggaca agcccctgcgacgtcactagtcgccagccggccgccgcggcctgcgagctcgcgcctatg actgtctcggtaaattcctttaatgaccgcgacaccggccccagcagtcgcacccgcggc agtatgtttgcagaacactaa >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_4|198_aa MQPEVEPVCFPAMGSPTMHRKAGALLMDLETPEEMQARSLGRPIKSSKQYLRQVIAEYEA LDRELPCIRKFPTPPASQPLCLCMETLPEEDFTHLEVLQALEAQLPGAMESGRVSSIRFE NMNVICGTAGRRNRWLIAVTDFQTRSRLLRSGLSPRGLAHQIVRHDDLLLGDYRLHLRRS LVRRRMLEALGAEPNEEA >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_4|597_bp atgcagccagaggtggagcccgtgtgcttccctgccatgggcagccccaccatgcacagg aaggcaggagccctccttatggacctggagaccccagaggagatgcaggctcggagcctg ggcaggcccatcaaatcctcaaagcagtacctgcggcaggtcattgcagagtacgaggca ctggaccgagaactcccgtgcatccggaagttccccacaccaccagcttcccagcccctc tgcctctgcatggagaccttgcccgaggaggattttacccacctggaggtgctgcaagcc ctggaggcccagttaccaggggccatggagagcgggcgcgtgagcagcatccgctttgag aacatgaacgtcatctgtgggactgctgggcgccggaaccggtggctcatcgcggtcacg gacttccagacgcgctcgcgcttgctgcgctccgggctcagtccccgcgggcttgcgcac cagatcgtgcgccacgacgacctcctgctgggcgactaccgcctgcacctgcgccgctcc ctggtccggcggcgcatgctcgaggccctgggggcggagccgaacgaggaggcctga >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_5|2369_aa MSLARRRAPGLGVRLGVRVSPATSEDEERHSASECPEGGSESDSSPDGPGRGPRGTRGQG SGAPGSLASVRGLQGRSMSVPDDAHFSMMVFRIGIPDLHQTKCLRFNPDATIWTAKQQVL CALSESLQDVLNYGLFQPATSGRDANFLEEERLLREYPQSFEKGVPYLEFRYKTRVYKQT NLDEKQLAKLHTKTGLKKFLEYVQLGTSDKVARLLDKGLDPNYHDSDSGETPLTLAAQTE GSVEVIRTLCLGGAHIDFRARDGMTALHKAACARHCLALTALLDLGGSPNYKDRRGLTPL FHTAMVGGDPRCCELLLFNRAQLGIADENGWQEIHQACQRGHSQHLEHLLFYGAEPGAQN ASGNTALHICALYNKETCARILLYRGADKDVKNNNGQTPFQVAVIAGNFELGELIRNHRE QDVVPFQESPKYAARRRGPPGTGLTVPPALLRANSDTSMALPDWMVFSAPGAASSGAPGP TSGSQGQSQPSAPTTKLSSGTLRSASSPRGARARSPSRGRHPEDAKRQPRGRPSSSGTPR EGPAGGTGGSGGPGGSLGSRGRRRKLYSAVPGRSFMAVKSYQAQAEGEISLSKGEKIKVL SIGEGGFWEGQVKGRVGWFPSDCLEEVANRSQESKQESRSDKAKRLFRHYTVGSYDSFDA PSQGPTPASLPELGGPSVCLPSWRVPPPPRAGSGSTLSRGAVVVCSPLRLGFSGGGGVLL MALGPSLNVLAFLADCVEHEAGPGGPDTGVRPSGYGDAAAAAAAGAAGTGTRAGAGAAAA MALSAVGGGGLGGGPGGSSLPQPPPALSSSWPALGPRRRSVWYIYSDYIIKEKTVLLQKK DSEGFGFVLRGAKAQTPIEEFTPTPAFPALQYLESVDEGGVAWRAGLRMGDFLIEVNGQN VVKVGHRQVVNMIRQGGNTLMVKVVMVTRHPDMDEAVHKKAPQQAKRLPPPTISLRSKSM TSELEEMEYEQQPAPVPSMEKKRTVYQMALNKLDEILAAAQQTISASESPGPGGLASLGK HRPKGFFATETRLNEGAFSFRLLYCERVSFLGLFSATCFASLGLLVVISAFKRSPKHGAE ELSSVPKCEKAVMCLMEKIRAAEDDRPYLAPPAMKFSRSLSVPGSEDIPPPPTTSPPEPP YSTPPVPSSSGRLTPSPRGGPFNPGSGGPLPASSPASFDGPSPPDTRVGSREKSLYHSGP LPPAHHHPPHHHHHHAPPPQPHHHHAHPPHPPEMETGGSPDDPPPRLALGPQPSLRGWRG GGPSPTPGAPSPSHHGSAGGGGGSSQGPALRYFQLPPRAASAAMYVPARSGRGRKGPLVK QTKVEGEPQKGGGLPPAPSPTSPASPQPPPAVAAPSEKNSIPIPTIIIKAPSTSSSGRSS QGSSTEAEPPTQPEPTGGGGGGGSSPSPAPAMSPVPPSPSPVPTPASPSGPATLDFTSQF GAALVGAARREGGWQNEARRRSTLFLSTDAGDEDGGDGGLGTGAAPGPRLRHSKSIDEGM FSAEPYLRLESAGSGAGYGGYGAGSRAYGGGGGSSAFTSFLPPRPLVHPLTGKALDPASP LGLALAARERALKESSEGGGAPQPPPRPPSPRYEAPPPTPHHHSPHAHHEPVLRLWGASP PDPARRELGYRAGLGSQEKSLPASPPAARRSLLHRLPPTAPGVGPLLLQLGTEPPAPHPG VSKPWRSAAPEEPERLPLHVRFLENCQPRAPVTSGRGPPSEDGPGVPPPSPRRSVPPSPT SPRASEENGLPLLVLPPPAPSVDVEDGEFLFVEPLPPPLEFSNSFEKPESPLTPGPPHPL PDTPAPATPLPPVPPPAVAAAPPTLDSTASSLTSYDSEVATLTQGASAAPGDPHPPGPPA PAAPAPAAPQPGPDPPPGTDSGIEEVDSRSSSDHPLETISSASTLSSLSAEGGGSAGGGG GAGAGVASGPELLDTYVAYLDGQAFGGSSTPGPPYPPQLMTPSKLRGRALGASGGLRPGP SGGLRDPVTPTSPTVSVTGAGTDGLLALRACSGPPTAGVAGGPVAVEPEVPPVPLPTASS LPRKLLPWEEGPGPPPPPLPGPLAQPQASALATVKASIISELSSKLQQFGGSSAAGGALP WARGGSGGGGDSHHGGASYVPERTSSLQRQRLSDDSQSSLLSKPVSSLFQNWPKPPLPPL PTGTGVSPTAAAAPGATSPSASSSSTSTRHLQGVEFEMRPPLLRRAPSPSLLPASEHKVS PAPRPSSLPILPSGPLYPGLFDIRGSPTGGAGGSADPFAPVFVPPHPGISGGLGGALSGA SRSLSPTRLLSLPPDKPFGAKPLGFWTKFDVADWLEWLGLAEHRAQFLDHEIDGSHLPAL TKEDYVDLGVTRVGHRMNIDRALKFFLER >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_5|7110_bp atgtccttggcccggcgccgggctcctggcctgggtgtgcggctgggggtacgggtgagc cccgcgacaagcgaggacgaggaacgccacagtgccagcgagtgtcccgaggggggctca gagtccgacagctccccagacgggccaggtcgaggcccccgggggacccggggccagggc agtggggcacctggtagcctggcctctgttagaggcctccagggccgctcaatgtccgtc ccagacgacgcccacttcagcatgatggtcttcaggattggcatcccggacctgcaccag acaaaatgccttcgcttcaaccccgatgccaccatctggacggccaagcagcaggtgctc tgtgccctgagcgagagcctgcaggatgtgctcaactatggcctgttccaaccggccacc tccggccgcgatgccaacttcctggaggaggagaggctgctgcgggagtacccccagtcc tttgagaagggggtcccctacctggagttccgatacaagacccgagtttacaaacagacc aacctggatgagaagcagctggccaagttgcacacgaagacggggttgaagaagttcctg gagtatgtgcagctcgggacatctgacaaggtggcgcggctgctggacaaggggctggac cccaattaccatgactcggattcgggagagacccccttgacactggcggcccagaccgaa ggctctgtagaggtgattcgaaccctgtgcctgggcggggcccacattgacttccgggcc cgggatggcatgaccgcactgcataaggccgcatgcgcccgacactgcctggcactcacg gcgctcctggaccttgggggttcccccaactacaaggaccgtcgggggctgacccctctg ttccacacggccatggtgggtggtgacccccgatgctgcgagctgctcctgttcaacagg gcccagctgggcatagctgatgagaacggctggcaggaaatccaccaggcctgccagcgg ggtcactctcagcacctggagcatctgcttttctacggggctgagcctggagcccagaac gcctcggggaacacggctctgcacatctgcgccctctacaacaaggagacctgtgccagg atcctcctgtatcgaggtgccgacaaggatgtgaagaacaacaacggacagacccccttc caggtggcagtgattgctgggaattttgagctgggggagctgatccgaaaccaccgagaa caggatgtggtgcccttccaggagtcccccaagtacgcggcccggcgacgggggccccca ggcacagggctgacggtgcccccggcgctgctgcgggccaacagtgacaccagcatggcg ctgcccgactggatggtgttctccgccccgggggccgcgtcctctggggcccctggccct acctcagggtcccagggccagtcgcagccctcggcccccaccaccaagctcagcagcggg accctccgaagtgccagcagcccccggggtgccagggcccgctctccatcccgagggagg caccctgaggacgccaagaggcagccccgaggccggcccagctccagcgggacaccccgg gaagggccagccgggggcacggggggctcagggggccccgggggctccctgggcagccgc gggaggcggaggaagctctactcagcggtacccggacgctccttcatggctgtgaagtcc taccaggcccaagccgagggggagatctccctgagcaagggcgagaagatcaaagtactt agcatcggggaaggaggcttctgggaaggccaggtcaaaggtcgtgttggctggttcccc tctgactgcctggaagaagtggcgaatcgctctcaggagagcaagcaagaaagccgcagt gacaaggcaaagagactcttccggcattataccgtgggctcctacgacagctttgatgcc ccaagccagggcccaaccccagcctccctgccggagctgggcggtccttccgtgtgtctc ccctcctggcgtgtgccgccccctccccgcgctggctcggggtccacgctctctcgggga gctgtggtggtgtgttcgcccctccgcctgggcttctctggtgggggaggggtgttgctc atggccttggggcccagtctgaatgtcctggcctttcttgccgactgtgtcgagcatgag gcggggcctgggggcccggacacgggggttcggccgagcgggtacggggacgcggcggcg gcggcggcggcgggggcggccggcaccgggacccgggccggggctggggccgcggcggcg atggctttgtctgcggtcggcggcgggggtctcgggggcggccccggtgggagctccctg ccgcagccgccccccgcgctgtcctccagctggccagccctggggccccggcggcgcagc gtctggtacatctacagcgattacatcattaaggagaagacagtcttgctgcagaagaag gacagtgaggggtttgggttcgtgctccggggggccaaggcgcagacccccatcgaggag ttcacccccaccccggccttcccggcgctgcagtacctggagtcggtggacgagggtggc gtggcatggcgagctggactgcgaatgggagacttcctcatcgaggtgaacgggcagaat gtggtgaaggtcggccaccgacaggtggtgaacatgatccgccaagggggcaacacgctg atggtgaaggtggtgatggtcaccaggcacccggacatggatgaggcagtgcacaagaaa gcaccccagcaggccaagcggctgccgcccccaaccatctccctgcgttccaaatctatg acctcagagctggaggagatggagtacgagcagcagccggcgccggtgcccagcatggag aaaaagcggaccgtgtatcagatggctctcaacaaactggacgaaatcctggccgcagct caacagaccatcagtgcaagcgaaagccctggtcccggtggcctcgcgtccctgggcaaa caccgacccaaaggtttctttgccactgagacgaggctgaacgagggtgccttctcattt cggctcttatactgtgaacgagtgtcttttctcggtctgtttagtgccacgtgctttgca tctttgggccttttagtggtgatatcagcgtttaaaaggtcccccaagcatggtgctgaa gagctgtctagcgttcctaagtgcgagaaggccgtgatgtgccttatggagaaaatacgt gcggcagaagatgacagaccttacctagcacccccagccatgaaattcagccgcagcctg tctgtgcctggttcggaggacattcccccgccacccaccacgtccccaccggagcctccc tacagcacacctccagtcccctcctcctcagggcgcctcaccccctcccctcggggaggg cccttcaaccctggctctggtggccccctccccgcctcctcccctgcatcctttgacggg ccctcccctcccgacactcgcgtggggagccgcgagaagagcctgtaccacagtgggccc ctgcccccggcccaccaccacccgccccaccaccaccaccaccacgccccgccccctcag ccccaccaccaccacgcccacccccctcatcctcccgagatggagacaggcggctctccc gacgaccctccaccccgcctggctctggggccccagcccagcctgcgaggctggaggggc ggcgggcccagcccgaccccgggggccccgtccccatcgcaccacggcagcgcgggcggg ggcggcggctcctcccagggcccggctctacgctatttccagctgcccccgcgggcggcc agcgcagccatgtacgtgcccgcccgctcgggccgcggccgcaagggcccgctggtcaag cagaccaaggtggaaggcgagccccagaagggcggcggcctcccgcccgcgccgtcgccc acgtccccggcctccccgcagccgccgcccgccgtggccgcgccctcggagaagaactcc atccccatccccaccatcatcatcaaggccccgtccaccagtagcagcggccgcagcagc cagggcagcagcaccgaggcggagccccccacccagccggagcccacgggaggcggcggc ggcggcggctcctcgcccagccccgccccggccatgtcacccgtgcccccgtccccctcg cccgtgcccacccccgcctcgcccagcggcccggccacgctggacttcacgagccagttc ggggccgccctggtgggggcggcccggagggaggggggctggcagaatgaggcgcgccgg cgctccacgctgttcctgtccaccgacgcgggggacgaggacggcggggacggcgggctg ggcacaggggcggccccgggcccgcggctgcgccactccaaatccatcgacgagggcatg ttctccgccgagccctacctccgactggagtctgcgggcagcggcgcgggctacggcggc tacggggccggtagccgagcctacgggggtggcgggggcagcagcgccttcaccagcttc ctgcccccgcgacccctggtgcacccgctgaccggcaaggccctggatcccgcctccccg ctggggctggccctggccgcccgcgagcgagcgctgaaggagtcctcggagggcggcggg gccccccagccgcctcccaggcccccatcgccccgctacgaggccccgccgcccaccccg caccaccactcgccccacgcccaccacgagccagtgctgcgtctctggggggcctccccg ccggaccctgcgcgccgggagctggggtacagggccgggctgggcagccaggagaagtcc cttcccgccagcccgcccgccgcccggcgttccctgctacaccgcctgccgcccaccgct cccggggtggggcccctcctgctgcagctggggacggagcccccggccccgcaccccgga gtaagcaagccctggaggtccgcagcccccgaagaacccgagcggctgccgctgcacgtg cggttccttgaaaactgccagccccgggcccctgtgacgagcggaaggggtcccccctcg gaggacgggccgggggtcccgccgcccagcccacgccggtccgtgcccccctccccgacc tccccgagggccagcgaagagaacgggctgcccctgctggtcctgccgcctcccgccccc tcggtggatgtggaagatggcgaattccttttcgtggaaccgctgcctccgcctctggaa ttctccaacagcttcgaaaagccagagtcgcccctcacgcctgggcctccccacccgctg cccgacacacctgcccctgccaccccgttaccccctgtgccacccccggctgtggccgca gcccctcccaccctggactccaccgcatccagcctgacatcctatgacagcgaggtggcc accctgacccagggggcctccgccgctcctggggacccccatccaccaggcccgcctgcc ccagcagcaccggctcccgctgccccacagcctggcccggaccctccgcctggcacggat tctggcatcgaggaggtggacagtcggagcagcagtgaccacccactggagaccatcagc agcgcctccacgctgagcagcctatctgccgaaggtggtggcagcgcagggggtgggggc ggggctggggccggtgtggccagtgggccggagcttctggacacctatgtggcctacctg gacggccaggcctttgggggcagcagtactcccggcccgccataccctcctcagctcatg actccctctaagctccggggccgggcgctaggagccagcggaggcctgcggcctggcccc agcgggggactccgagaccctgttacccccaccagccccaccgtctcggtgacaggggct ggaaccgatgggctgctggccctgcgtgcttgttcaggaccccccacggcaggcgtggcg gggggtccggtggctgtagagccagaagtcccaccggtgcccttgccgacggcctcctct ctgccccggaagctgctgccctgggaggagggcccgggcccaccgccaccacctctgccc gggcccttggcccagcctcaggcctcagccttggccacagtaaaagccagcatcatcagt gaactcagctccaagcttcagcagtttgggggctcctcggcagctggcggcgctctgccc tgggcccgaggaggcagtgggggaggcggagacagccaccacgggggagccagctatgtc cccgagaggacctcctccctgcagcggcagagactctccgacgactcccagtcctcactc ctctccaagcctgtcagcagcctgtttcagaactggcccaaaccacctctgccgccactc cccaccggaacaggggtctcccctacagccgctgcggccccaggggccacctcaccctca gcctcctcctcctccacgtccacccgccacctccagggcgtggagttcgagatgcggccc cctctgctccgccgggcccccagcccctcgctgctgcccgcctcggagcacaaggtcagc cctgcgcccaggccctcgtccctgcccatcctgccttccggacccctctacccaggcctc tttgacatccgtggctccccaactggaggggcaggaggctcggctgaccccttcgcccca gtctttgtgccgccacacccggggatatccggggggctcgggggagccttgtcaggggcc tcgcgctccctctcaccgacccgcctgctctcgctgcccccggacaagccgtttggcgct aaacctctggggttctggaccaagttcgacgtggctgattggctggagtggctgggtttg gcggagcaccgagcccagttcctggaccacgagatcgatggctcccacctgcccgccttg accaaggaggactacgtcgatctaggtgtgaccagggtgggccaccgcatgaacatcgac cgggctctcaaattcttcctggagaggtga >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_6|323_aa MQAAWLLGALVVPQLLGFGHGARGAEREWEGGWGGAQEEEREREALMLKHLQEALGLPAG RGDENPAGTVEGKEDWEMEEDQGEEEEEEATPTPSSGPSPSPTPEDIVTYILGRLAGLDA GLHQLHVRLHALDTRVVELTQGLRQLRNAAGDTRDAVQALQEAQGRAEREHGRLEGCLKG LRLGHKCFLLSRDFEAQAAAQARCTARGGSLAQPADRQQMEALTRYLRAALAPYNWPVWL GVHDRRAEGLYLFENGQRVSFFAWHRSPRPELGAQPSASPHPLSPDQPNGGTLENCVAQA SDDGSWWDHDCQRRLYYVCEFPF >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_6|972_bp atgcaggcagcctggcttttgggggctttggtggtcccccagctcttgggctttggccat ggggctcggggagcagagagggagtgggagggaggctggggaggtgcccaggaggaggag cgggagagggaggccctgatgctgaagcatctgcaggaagccctaggactgcctgctggg aggggggatgagaatcctgccggaactgttgagggaaaagaggactgggagatggaggag gaccagggggaggaagaggaggaggaagcaacgccaaccccatcctccggccccagcccc tctcccacccctgaggacatcgtcacttacatcctgggccgcctggccggcctggacgca ggcctgcaccagctgcacgtccgtctgcacgcgttggacacccgcgtggtcgagctgacc caggggctgcggcagctgcggaacgcggcaggcgacacccgcgatgccgtgcaagccctg caggaggcgcagggtcgcgccgagcgcgagcacggccgcttggagggctgcctgaagggg ctgcgcctgggccacaagtgcttcctgctctcgcgcgacttcgaagctcaggcggcggcg caggcgcggtgcacggcgcggggcgggagcctggcgcagccggcagaccgccagcagatg gaggcgctcactcggtacctgcgcgcggcgctcgctccctacaactggcccgtgtggctg ggcgtgcacgatcggcgcgccgagggcctctacctcttcgaaaacggccagcgcgtgtcc ttcttcgcctggcatcgctcaccccgccccgagctcggcgcccagcccagcgcctcgccg catccgctcagcccggaccagcccaacggtggcacgctcgagaactgcgtggcgcaggcc tctgacgacggctcctggtgggaccacgactgccagcggcgtctctactacgtctgcgag ttccccttctag >gi568815579r:50522693_50737411|GENSCAN_predicted_peptide_7|108_aa MRLLLGLLGFIALQARAAAGSSSSVESSQASCPCSTTSLGSLNASIPISCRGLPWSMALA LHTTSAPQRARERTLANSGPQEVWGGFLEERVGFPRRPESGISAECHI >gi568815579r:50522693_50737411|GENSCAN_predicted_CDS_7|327_bp atgcgcctgctgctcggcctcctgggctttatcgccctgcaagcccgtgcggccgctggt tcttcctcctccgtcgaatcatcccaggcctcctgcccttgttctactacttccctggga tcccttaatgcatctattccaatcagctgccgaggcctcccctggtccatggctctggcc ttgcacactaccagcgctccgcagagagccagggagaggacattggcaaactctggcccg caggaagtctgggggggcttcctggaggagagagtaggtttcccacgcagaccagagtca ggaatcagcgccgaatgccacatatag