GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:26:18 Sequence gi568815595r:101581138_101786557 : 205420 bp : 41.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4300 4374 75 0 0 78 95 110 0.648 9.49 1.02 Intr + 18175 18381 207 0 0 97 5 163 0.016 7.25 1.03 Intr + 31207 31334 128 0 2 111 46 90 0.054 5.66 1.04 Term + 42929 43052 124 2 1 88 47 92 0.416 1.98 1.05 PlyA + 46262 46267 6 1.05 2.00 Prom + 50382 50421 40 -4.65 2.01 Init + 51953 51992 40 1 1 66 56 56 0.146 -1.21 2.02 Intr + 54363 54644 282 1 0 -6 60 267 0.128 10.97 2.03 Intr + 55229 55476 248 0 2 73 67 136 0.286 6.26 2.04 Intr + 56042 56190 149 1 2 77 88 104 0.997 7.41 2.05 Intr + 56411 56958 548 2 2 36 82 363 0.879 22.10 2.06 Intr + 57115 57666 552 2 0 -9 81 279 0.269 9.40 2.07 Intr + 57735 57892 158 1 2 23 61 143 0.775 3.91 2.08 Intr + 58201 58386 186 0 0 63 -21 172 0.475 2.76 2.09 Intr + 58693 58809 117 0 0 67 87 149 0.927 12.44 2.10 Term + 59695 59916 222 0 0 103 36 140 0.939 6.23 2.11 PlyA + 60856 60861 6 1.05 3.17 PlyA - 63343 63338 6 1.05 3.16 Term - 70546 70029 518 2 2 111 48 412 0.977 32.99 3.15 Intr - 71534 71359 176 1 2 96 88 108 0.991 10.26 3.14 Intr - 71801 71643 159 1 0 32 95 99 0.714 3.18 3.13 Intr - 73684 73567 118 2 1 56 89 58 0.750 1.30 3.12 Intr - 75111 74967 145 0 1 107 95 42 0.978 5.73 3.11 Intr - 78891 78646 246 0 0 10 103 137 0.629 4.13 3.10 Intr - 83577 83401 177 0 0 70 100 152 0.999 13.79 3.09 Intr - 84671 83827 845 0 2 88 80 674 0.966 56.76 3.08 Intr - 90224 89993 232 2 1 71 95 204 0.929 15.82 3.07 Intr - 91076 90841 236 0 2 99 78 270 0.998 23.58 3.06 Intr - 95803 95468 336 2 0 77 78 408 0.777 33.27 3.05 Intr - 96148 95838 311 0 2 35 42 190 0.377 4.13 3.04 Intr - 101770 101634 137 1 2 111 94 166 0.996 17.95 3.03 Intr - 104791 104681 111 1 0 86 123 109 0.999 13.86 3.02 Intr - 105420 105345 76 2 1 112 93 46 0.605 6.10 3.01 Init - 105539 105535 5 2 2 101 74 0 0.594 -0.58 3.00 Prom - 109930 109891 40 -4.75 4.00 Prom + 110687 110726 40 -8.85 4.01 Init + 111125 111180 56 1 2 65 21 42 0.124 -3.99 4.02 Intr + 111666 111832 167 0 2 53 103 107 0.529 7.38 4.03 Intr + 112019 113867 1849 0 1 34 -33 1174 0.101 86.27 4.04 Intr + 115680 117717 2038 0 1 51 89 557 0.033 38.47 4.05 Intr + 118823 118961 139 1 1 70 -2 179 0.322 6.22 4.06 Intr + 123079 123266 188 2 2 60 49 86 0.145 0.39 4.07 Intr + 127872 127982 111 2 0 36 110 63 0.649 2.96 4.08 Term + 131405 131611 207 1 0 35 54 304 0.838 18.06 4.09 PlyA + 133569 133574 6 1.05 5.00 Prom + 138000 138039 40 -6.35 5.01 Init + 143540 143582 43 1 1 90 116 70 0.999 9.06 5.02 Intr + 145457 145599 143 0 2 88 96 56 0.996 5.55 5.03 Intr + 146246 146404 159 1 0 85 63 159 0.996 12.36 5.04 Intr + 150703 150816 114 0 0 63 83 36 0.550 0.32 5.05 Intr + 151351 151517 167 0 2 66 92 139 0.575 9.94 5.06 Intr + 155571 155813 243 0 0 -19 72 199 0.198 3.29 5.07 Intr + 156096 156301 206 0 2 22 81 104 0.706 1.02 5.08 Intr + 158969 159408 440 0 2 58 29 202 0.298 3.61 5.09 Term + 167422 167523 102 0 0 73 50 31 0.007 -4.90 5.10 PlyA + 167835 167840 6 1.05 6.02 PlyA - 167866 167861 6 1.05 6.01 Sngl - 170022 169252 771 0 0 66 41 396 0.710 28.50 6.00 Prom - 170484 170445 40 -6.75 7.02 PlyA - 170652 170647 6 1.05 7.01 Sngl - 171877 171419 459 1 0 88 48 423 0.999 34.22 7.00 Prom - 172613 172574 40 -8.55 8.00 Prom + 172624 172663 40 -12.72 8.01 Init + 172730 172758 29 1 2 68 74 52 0.687 -0.56 8.02 Intr + 174293 174457 165 2 0 65 99 85 0.903 5.45 8.03 Intr + 175926 176059 134 0 2 48 83 101 0.942 4.97 8.04 Intr + 176497 177286 790 2 1 83 90 567 0.973 46.00 8.05 Intr + 181348 181423 76 1 1 53 97 51 0.977 1.10 8.06 Term + 183710 184414 705 1 0 92 39 562 0.992 44.22 8.07 PlyA + 185473 185478 6 1.05 9.02 PlyA - 186057 186052 6 1.05 9.01 Term - 197981 197744 238 1 1 30 45 252 0.925 9.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 18345 18106 240 0 0 46 42 276 0.893 13.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_1|177_aa KPKETVPTLAPKTLSVAAAFNEDEDDGFAVLGGNIVSDSSTVRFVAHQQHFQLPDVVDQE LPEATGQHVLCFFAALITSVGHQGLALEPSMNPVTKYLSQDSGGGGGDNSEFLSDILALQ AKHEMQQPEVLFPFLSWQPILIASNPYTSHMDTPNLISATPIQTSPLHECYVLTPIV >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_1|534_bp aagcctaaagaaactgttccaactcttgctccaaaaactctttcagtagcagcagctttt aatgaagatgaagatgatggctttgcggttcttggaggaaacattgtgagtgatagcagc acagtaagatttgttgcacatcagcagcacttccagctccctgacgttgtagatcaggaa cttccagaagccactgggcagcatgtgctttgtttttttgctgctctcataaccagtgtt gggcatcaaggtctggcccttgaaccttctatgaaccctgttacaaaatatctgagccag gattctggaggtggaggtggggacaatagcgagtttctgagtgacatccttgctttgcaa gctaagcacgagatgcagcagcctgaggtcctctttcctttcctctcctggcagccaatc cttattgcctccaatccttatacttcccacatggatactcccaacttgatctctgccact ccaatccagaccagccctctacatgaatgctatgttttgacaccaattgtctga >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_2|833_aa MLKDRVLVSVRLLFVQGLDESPGVFLERLQEAYRTHTPFDPAAPENSRAINLAFVAQAAP DIKRKLQKLEGFAGMNISQLLEVAQKVCDNREFEKQKQVAQAAERAPAQQAHLPLGDTTP PTAQLLLTTPLSEEYLLVSPSQPLENKTNSLLLDLQTLFPRVWAESNPLGLAKHHPPVVV ELLATALPVQELGYRVSAKKAQLCLPRVSYLGYEIKEKGHSPVLGRKPSCKSPLPPPRDS QGVAKGVLTQTLRPCRRPVACLSKRLDPVASGWPSCLGAVAATASLVQEADKLTLGQNLT LMAPHAVETLLRSASGKWMSNARILQYQNLLLDQPRLTFSPTMCLNPATLLPDPDFTTPV HDCQELLETTETGRPDLQDVPLKEVDTTMFTDGSSYLEQGVQKAGVAITTETDILWAQTL PAAGKIIKNKEEILALLEAIWLPQQVAVIHCKEPQREDTAIARGNQRADSAAQEAARLPV APLTLLPAVSFLQPDSPAHPEYSTEEERQALDFQASKNQEGWWIIPHSRIVMPRALGKTL ISRLHSTTHLGGVKWAQLLRSRFKIPHLQDLANQAALRCTACAQVNAKQRPKPRPGHCLW GDYQEKDTFSGWTEAFATRNETATTVVRLLLNEIIRRHGLPAAIGSNNGLAFTLSIARSV QDIIQPLVWGAHPNPVPDQMGPCHSFQPGDLVYVKKFQKEGLTLAWKRPHTVILSTPTAL KSLPVPMKNNAICRSWEWGASTLLQDLETLEARLDVEAPKVYVYSGPEAQLLIAPPELHS RFRRAAPLLVPLPASLSIAGSAAIGTAALVQGETGLMSLSQQVDADLSNLQQP >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_2|2502_bp atgctgaaggacagagtcctggtttctgtcaggcttctctttgtccaggggcttgatgag tcacctggagtctttttagaacgcctccaggaggcctatcgaactcacaccccttttgac ccggcggctcctgagaatagccgtgctattaatttggcatttgtggctcaggcagcccct gatattaaaagaaaattacaaaaactggagggatttgctggaatgaacattagccagctt ttagaagtagcccagaaagtttgtgacaatcgagagtttgaaaaacaaaaacaagtagct caggcagctgaaagggctcccgcccagcaggctcacctcccgttaggggacacaacgccc cctactgcccaactcctgctaactacccctctgtcagaggaatatcttttggtttcacca tcacaaccactggagaataaaactaattctctcctattggatctacagactctctttcct cgagtctgggccgagtcaaaccccctgggactagcaaagcatcatccgccggtagttgtg gaacttctggccactgccctgccagtccaggagcttgggtatcgggtctcagccaagaag gcccaactttgtcttcccagagtgtcctacctggggtatgagataaaggaaaaagggcac tcaccagtgctcggaaggaagccatcctgcaaatccccactcccaccaccaagagacagc caaggagtcgctaaaggggtactcactcaaactttgcggccatgtcgacgcccagtggcc tgtttatctaaaagactggatcctgttgcctccgggtggccaagttgtctgggagctgta gcagccacagcaagcctggttcaggaggctgataaactgactctaggtcaaaatttaacc cttatggcgcctcatgccgtagagactttgctgcgaagtgcttctggcaaatggatgtca aatgctcgcattttgcagtatcagaatttactgttagatcagcctcgtttaactttctct cccacaatgtgtttaaatccagctactttgctcccagatccagacttcaccacacctgtc catgactgccaggaactgttagagactacagaaactggccgacctgatctccaagatgtg cctttaaaggaggtggacaccaccatgtttacagacggtagcagctaccttgagcaggga gttcaaaaggctggtgtggccatcactacggagacagatatactgtgggcccagacacta ccagcagcaggaaagattatcaagaataaagaagaaattttagccctgcttgaagccatt tggctccctcagcaggtggctgtaattcactgcaaagaacctcaaagagaggacacagcc atagctcgtggtaaccaaagagcagactctgcagctcaggaggcagcccggctcccagtc gcacctctgaccctactgcctgcagtgtcctttctgcaacctgactcaccagctcaccca gaatactccacagaagaagaaagacaggctttggattttcaggccagtaaaaatcaggaa ggttggtggattattcctcattccagaatcgtcatgccccgagccctcgggaaaacttta atcagtcgtctgcattctaccacccatttgggaggagtaaaatgggcccagcttctgagg agccgtttcaagatcccccaccttcaggacttagctaaccaagcagctcttcggtgtaca gcttgtgctcaggtaaacgccaagcagcgtcctaaacccagaccaggccactgcctctgg ggagactaccaggagaaagacaccttttccggatggactgaggcatttgccaccagaaac gagactgccaccacggtagttaggcttttactcaatgagatcatccgtcggcatgggctg cctgctgccatagggtctaataacggactggccttcaccttgtccatagctcggtcagta caagacatcatccagccgcttgtctggggagctcatcccaatccagttcctgaccagatg gggccctgccactctttccagccaggtgacctggtgtatgttaaaaagttccagaaagaa ggactcactcttgcctggaaacgacctcataccgtcatcctcagcacgccgacggctctg aaaagcctgcccgtacccatgaagaacaacgcaatctgccggtcatgggagtggggagcg tcaaccttgctgcaggatttggaaactctggaagccagactggatgtggaagctccaaag gtctatgtgtatagcgggccagaagcacaacttctcattgctccccctgaattacattcc aggtttcgccgagcagccccactcctcgtaccccttccggccagtcttagcatagccgga tcagcagccattggcacggctgccctggttcagggagaaactgggctaatgtccctatct caacaagtagatgctgatttaagcaatctccaacagccataa >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_3|1275_aa MKVELCSFSGYKIYPGHGRRYARTDGKVFQFLNAKCESAFLSKRNPRQINWTVLYRRKHK KGQSEEIQKKRTRRAVKFQRAITGASLADIMAKRNQKPEVRKAQREQAISEGSYPLPKCP PAFLTSLDRLHQARKRECAPIRAQSPPLPCVTSGGAAAILSVASGSRALEKCSSPWLPTR PLFLRTKGAVEERSPKAQPFLPTVPPLLLQPPVGALREPRSAMSSEESYRAILRYLTNER EPYAPGTEGNVKRKIRKAAACYVVRGGTLYYQRRQRHRKTFAELEVVLQPERRRDLIEAA HLGPGGTHHTRHQTWHYLSKTYWWRGILKQVKDYIKQCSKCQEKLDRSRPISDVSEMLEE LGLDLESGEESNESEDDLSNFTSSPTTASKPAKKKPVSKHELVFVDTKGVVKRSSPKHCQ AVLKQLNEQRLSNQFCDVTLLIEGEEYKAHKSVLSANSEYFRDLFIEKGAVSSHEAVVDL SGFCKASFLPLLEFAYTSVLSFDFCSMADVAILARHLFMSEVLEICESVHKLMEEKQLTV YKKGEVQTVASTQDLRVQNGGTAPPVASSEGTTTSLPTELGDCEIVLLVNGELPEAEQNG EVGRQPEPQVSSEAESALSSVGCIADSHPEMESVDLITKNNQTELETSNNRENNTVSNIH PKLSKENVISSSPEDSGMGNDISAEDICAEDIPKHRQKVDQPLKDQENLVASTAKTDFGP DDDTYRSRLRQRSVNEGAYIRLHKGMEKKLQKRKAVPKSAVQQVAQKLVQRGKKMKQPKR DAKENTEEASHKCGECGMVFQRRYALIMHKLKHERARDYKCPLCKKQFQYSASLRAHLIR HTRKDAPSSSSSNSTSNEASGTSSEKGRTKREFICSICGRTLPKLYSLRIHMLKHTGVKP HACQVCGKTFIYKHGLKLHQSLHQSQKQFQCELCVKSFVTKRSLQEHMSIHTGESKYLCS VCGKSFHRGSGLSKHFKKHQPKPEVRGYHCTQCEKSFFEARDLRQHMNKHLGVKPFQCQF CDKCYSWKKDWYSHVKSHSVTEPYRCNICGKEFYEKALFRRHVKKATHGKKGRAKQNLER VCEKCGRKFTQLREYRRHMNNHEGVKPFECLTCGVAWADARSLKRHVRTHTGERPYVCPV CSEAYIDARTLRKHMTKFHRDYVPCKIMLEKDTLQFHNQGTQVAHAVSILTAGMQEQESS GPQELETVVVTGETMEALEAVAATEEYPSVSTLSDQSIMQVVNYVLAQQQGQKLSEVAEA IQTVKVEVAHISGGE >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_3|3828_bp atgaaggtcgagctgtgcagttttagcgggtacaagatctaccccggacacgggaggcgc tacgccaggaccgacgggaaggttttccagtttcttaatgcgaaatgcgagtcggctttc ctttccaagaggaatcctcggcagataaactggactgtcctctacagaaggaagcacaaa aagggacagtcggaagaaattcaaaagaaaagaacccgccgagcagtcaaattccagagg gccattactggtgcatctcttgctgatataatggccaagaggaatcagaaacctgaagtt agaaaggctcaacgagaacaagctatcagtgaagggtcctacccgcttccaaaatgtcca cccgcattcctgacgtcactggatcggctccatcaagccagaaaacgagagtgcgctcca atcagggctcagtccccgcccttgccttgcgtcacttccggtggtgcagcagccattttg tcagtcgccagcggatcccgcgcgctggagaagtgcagttccccctggttaccaactcgc ccgcttttccttcggactaagggagccgtcgaagagcgctcgccaaaggcgcagccgttt ctccctacggtgccgccgctcctcctgcagccgcccgtaggcgccctcagggagccgcgg tccgcgatgtcaagcgaggaaagctaccgggccatcctgcgttacctgacgaacgagcgc gagccgtatgcgccgggcaccgagggcaatgtcaagcgtaaaatccgaaaagctgccgcc tgctacgtggtgcgcggcgggactctgtattaccagcggcggcagcggcaccgcaagacc ttcgcggagctggaggtggtgctgcagccggagcgacgccgggacctcatcgaggcggcg cacctgggtcccggcggcactcaccacacccggcatcagacctggcactacttgtccaag acgtactggtggcgaggtatattgaagcaagtcaaagattacattaaacagtgtagcaaa tgccaggagaaactagatcgatcccgtccaatatcagatgtttcagaaatgttggaagaa ttgggactagaccttgaatctggagaagaaagtaatgaatcggaagatgacctgagcaac tttacttcatctccaactacagcatccaagcctgcaaaaaagaagccagtatccaaacat gaacttgtgtttgttgacaccaaaggagtggtaaaacgttcttctccaaaacattgtcag gctgtcttaaaacagctgaacgaacagagactttccaaccagttctgtgatgttactttg ttaattgaaggagaagagtacaaagctcataaatctgttttgtcagcaaatagcgagtat tttcgagatctttttattgagaaaggagctgtttccagtcatgaggctgtggtggatctt tctggtttttgtaaggccagcttccttcctttactggaatttgcctatacttctgtacta agttttgatttctgtagcatggctgatgtagccatcttagctcgtcatcttttcatgtca gaagtcttagagatttgtgaaagtgtacataagctaatggaagagaagcagctaacagta tataagaagggcgaagtacaaacagttgcatccacccaggacttacgagtacagaatgga ggtacagcacctcctgttgctagcagtgagggaaccacaacaagtttacctactgaactt ggggattgtgaaattgtactactggtaaatggagaattgccagaagctgagcagaatgga gaggtaggacgacagcctgagccccaggtttcttcagaggctgaatctgccctgtcatca gtaggatgtatagctgattcccatcctgaaatggagtctgttgatttaataacaaaaaac aaccagacagaactagaaacttcaaacaacagagaaaataacacagtttctaatatacac cctaaactttcaaaagagaatgtaattagtagctcgccagaggatagtggtatgggaaat gatatatcagctgaggatatttgtgccgaagacattccaaaacataggcagaaagttgac caacctttaaaagatcaggaaaatctagttgcatcaacagcaaagacagactttggccct gatgatgatacttatagaagcaggcttcgacaacgttctgttaatgaaggggcatatatt cgactacacaagggaatggagaaaaagctgcagaaacggaaagccgttcccaagtcagca gttcaacaggtggctcagaagttagttcaaagaggaaaaaagatgaaacagccaaaaaga gatgctaaagagaacacagaagaagcatctcataaatgtggggaatgtggaatggttttt cagagacgatacgcccttataatgcacaaactgaaacatgaaagagctagagattacaaa tgtccattgtgtaaaaaacagtttcagtacagtgcctctttgcgagcacatcttattcgt cataccagaaaagatgcaccctcttcatcctcgtccaattccacgtctaatgaagcatcg ggaacatcatctgagaagggcagaaccaagcgggaatttatatgttccatatgtggaaga acattacctaaattatattctctccgaatacatatgttaaagcacacaggtgtaaagcca catgcatgccaggtctgtggaaagacttttatctataagcatggtctaaaattacatcag agtcttcatcaatcacagaagcagttccagtgtgaactgtgtgttaagtcatttgttacc aaacggagtcttcaagaacatatgagtattcacacaggagagtccaagtacctttgctca gtttgtggaaagtcttttcataggggctctggactcagcaagcacttcaagaaacaccaa ccaaagcctgaggttcgaggctatcattgtactcaatgtgaaaaaagtttctttgaagct agagatcttcgccagcacatgaacaaacatcttggtgtgaagccattccagtgccaattt tgtgataagtgctatagttggaagaaagattggtattcccatgtgaagtctcattctgtc actgagccttataggtgtaatatatgtggcaaagaattttatgaaaaagctttgttcaga aggcatgtaaagaaagctacccatgggaagaaaggaagagcaaagcaaaacctggaacgg gtgtgtgaaaaatgtggaagaaaattcactcagctaagagagtataggagacacatgaac aaccatgaaggagttaagccatttgagtgcttaacatgtggagtagcttgggctgatgcc cgatctctaaaacgccatgtcagaacacatactggtgaacggccctatgtctgtcctgta tgtagcgaagcctacatagatgctcgaacactccgtaaacatatgactaaattccacaga gactatgtgccttgcaaaattatgctggaaaaagacacccttcagtttcataaccaagga actcaagtggcacatgctgttagcatcttaacagcaggcatgcaggaacaagaaagcagt ggtcctcaagaacttgagactgtggtagtgacaggagaaactatggaagctctggaagct gttgcagctactgaagagtatccatcggtatctacactttctgaccaaagtattatgcaa gtggttaattatgtattagcacaacagcaaggacagaagctatctgaagttgcagaagct attcaaactgttaaagtagaggtagcacatatttcaggaggagaatga >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_4|1584_aa MASWEGKDLTVPQPDTRKGLSFHLTRNTHRCGGATHPFIWCPTWRLFSRVKVRSSVVIED KLTRDPEYIYSQPCEQGTLDLKDWKRIGEELKQAGRKGNIIPLTVWNDWAIIKAALEPFQ TKEDSVSVSDAPGSCVIDCNEKTGRKSQKETESLHCEYVTEPVMAQSTQNVDYNQLQGVI YPETLKLEGKGPELVGPSESKPRGPSPLPAGQVPVTLQPQTQVKENKTQPPVAYQYWPPA ELQYLPPPESQYGYPGMPPALQGRAPYPQPPTVRLNPTASRSGQGGTLHAVIDEARKQGD LEAWRFLVILQLVQAGEETQVGAPARAETRCEPFTMKMLKDIKEGVKQYGSNSPYIRTLL DSIAHGNRLTPYDWESLAKSSLSSSQYLQFKTWWIDGVQEQVRKNQATKPTVNIDADQLL GTGPNWSTINQQSVMQNEAIEQVRAICLRAWGKIQDPGTAFPINSIRQGSKEPYPDFVAR LQDAAQKSITDDNARKVIVELMAYENANPECQSAIKPLKGKVPAGVDVITEYVKACDGIG GAMHKAMLMAQAMRGLTLGGQVRTFGKKCYNCGQIGHLKRSCPVLNKQNIINQAITAKNK KPSGLCPKCGKGKHWANQCHSKFDKDGQPLSGNRKRGQPQAPQQTGAFPVQLFVPQGFQG QQPLQKIPPLQGVSQLQQSNSCPAPQQAAPHTIKTFTLYLDQMATLIGQGRLRIITLCGN DPDKITVPFNKQQVRQAFISSGAWQIGLANFLGIIDNHYPKTKIFQFLKLTTWILPKITR REPLENALTVFTDGSSNGKAAYTGPKERVIKTPYQSAQRAELVAVITVLQDFDQPINIIS DSAYVVQATRDVETALIKYSTDDHLNQLFNLLQQTVRKRNFPFYITHIRAHTNLPGPLTK ANEQADLLVSSAFIKAQELLALTHVNAAGLKNKFDVTWKQAKDIVQHCTQCQVLHLSTQE AGVNPRGLCPNALWQMDGTHVPSFGRLSYVHVTVDTYSHFIWATCQTGESTSHVKKHLLS CFAVMGVPEKIKTDNGPGYCSKAFQKFLSQWKISHTTGIPYNSQGQAIVERTNRTLKTQL VKQKEGGDSKECTTPQMQLNLALYTLNFLNIYRNQTTTSAKQHLTGKKHSPHEGKLIWWK DNKNKTWEIGKVITWGRGFACVSPGENQLPVWIPTRHLKFYNEPIGDAKKRASTEMVTPV TWMDNPIEVYVNDSVWVPGPTDDRCPAKPEEEGMMINISIVYRYPPICLGRAPGCLMPAV QNWLVEVPTVSPNSRFTYHMVSGMSLRPRVNYLQDFSYQRSLKFRPKGKPCPKEIPKESK NTEVLVWEECVANSAVILQNNEFGTIIDWAPRGQFYHNCSGQTQSCPSAQVYPTAPKRQR PSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERTLRRFFPISVDGTYNRVLYRDIPL PRDGQETDAFLLSQLQEACLPLTLILLSTDPLRVSGWGTTQQPTSIDVKEEISRLTERTF CGKKVSKDKQDPRPCQESLKELEEEAEEEQRILQQSVVKTYEDMTLEELEDHEDEFNEED ECAIEMCRQQRLAEWKATKLKNKF >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_4|4755_bp atggcctcgtgggaagggaaagacctgaccgtcccccagcctgacacccgtaaaggtctc tcgttccaccttacgagaaacacccacaggtgtggaggggcaacccaccccttcatctgg tgcccaacgtggaggcttttctctagggtgaaggtacgctcgagcgtggtcattgaggac aagttgacgagagatcccgagtacatctacagtcagccttgcgaacaaggaactttagat ctaaaagattggaaaagaattggcgaggaactaaaacaagcaggtagaaagggtaatatc attccacttacagtatggaatgattgggccattattaaagcagctttagaaccatttcaa acaaaagaagatagcgtttcagtttctgatgcccctggaagctgtgtaatagattgtaat gaaaagacagggagaaaatcccagaaagaaacagaaagtttacattgcgaatatgtaaca gagccagtaatggctcagtcaacgcaaaatgttgactataatcaattacagggggtgata tatcctgaaacgttaaaattagaaggaaaaggtccagaattagtggggccatcagagtct aaaccacgagggccaagtcctcttccagcaggtcaggtgcccgtaacattacaacctcaa acgcaggttaaagaaaataagacccaaccgccagtagcttatcaatactggccgccggct gaacttcagtatctgccacccccagaaagtcagtatggatatccaggaatgcccccagca ctacagggcagggcgccatatcctcagccgcccactgtgagacttaatcctacagcatca cgtagtggacaaggtggtacactgcacgcagtcattgatgaagccagaaaacagggagat cttgaggcatggcggttcctggtaattttacaactggtacaggccggggaagagactcaa gtaggagcgcctgcccgagctgagactagatgtgaacctttcaccatgaaaatgttaaaa gatataaaggaaggagttaaacaatatggatccaactccccttatataagaacattatta gattccattgctcatggaaatagacttactccttatgactgggaaagtttggccaaatct tccctttcatcctctcagtatctacagtttaaaacctggtggattgatggagtacaagaa caggtacgaaaaaatcaggctactaagcccactgttaatatagacgcagaccaattgtta ggaacaggtccaaattggagcaccattaaccaacaatcagtgatgcagaatgaggctatt gaacaagtaagggctatttgcctcagggcctggggaaaaattcaggacccaggaacagct ttccctattaattcaattagacaaggctctaaagagccatatcctgactttgtggcaaga ttacaagatgctgctcaaaagtctattacagatgacaatgcccgaaaagttattgtagaa ttaatggcctatgaaaatgcaaatccagaatgtcagtcggccataaagccattaaaagga aaagttccagcaggagttgatgtaattacagaatatgtgaaggcttgtgatgggattgga ggagctatgcataaggcaatgctaatggctcaagcaatgagggggctcactctaggagga caagttagaacatttgggaaaaaatgttataattgtggtcaaatcggtcatctgaaaagg agttgcccagtcttaaataaacagaatataataaatcaagctattacagcaaaaaataaa aagccatctggcctgtgtccaaaatgtggaaaaggaaaacattgggccaatcaatgtcat tctaaatttgataaagatgggcaaccattgtcgggaaacaggaagaggggccagcctcag gccccccaacaaactggggcattcccagttcaactgtttgttcctcagggttttcaagga caacaacccctacagaaaataccaccacttcagggagtcagccaattacaacaatccaac agctgtcccgcgccacagcaggcagcgccacatacaattaagacttttacattgtacttg gatcaaatggctacattaattggtcagggaagattacgaataataacattgtgtggaaat gacccagataaaatcactgttcctttcaacaagcaacaagttagacaagcctttatcagt tctggtgcatggcagattggtcttgctaattttctgggaattattgataatcattaccca aaaacaaaaatcttccagttcttaaaattgactacttggattctacctaaaattaccaga cgtgaacctttagaaaatgctctaacagtatttactgatggttccagcaatggaaaagcg gcttacacagggccgaaagaacgagtaatcaaaactccgtatcaatcagctcaaagagca gagttggttgcagtcattacagtgttacaagattttgaccaacctatcaatattatatca gattctgcatatgtagtacaggctacaagggatgttgagacagctctaattaaatatagc acggacgatcatttaaaccagctattcaatttattacaacaaactgtaagaaaaagaaat ttcccattttatattactcatattcgagcacacactaatttaccagggcctttgactaaa gcaaatgaacaagctgacttactggtatcatctgcattcataaaagcacaagaacttctt gctttgactcatgtaaatgcagcaggattaaaaaacaaatttgatgtcacatggaaacag gcaaaagatattgtacaacattgcacccagtgtcaagtcttacacctgtccactcaagag gcaggagttaatcccagaggtctgtgtcctaatgcgttatggcaaatggatggcacgcat gttccttcatttggaagattatcatatgttcatgtaacagttgatacttattcacatttc atatgggcaacttgccaaacaggagaaagtacttcccatgttaaaaaacatttattatct tgttttgctgtaatgggagttccagaaaaaatcaaaactgacaatggaccaggatattgt agtaaagctttccaaaaattcttaagtcagtggaaaatttcacatacaacaggaattcct tataattcccaaggacaggccatagttgaaagaactaatagaacactcaaaactcaatta gttaaacaaaaagaagggggagacagtaaggagtgtaccactcctcagatgcaacttaat ctagcactctatactttaaattttttaaacatttatagaaatcagactactacttctgca aaacaacatcttactggtaaaaagcacagcccacatgaaggaaaactaatttggtggaaa gataataaaaataagacatgggaaatagggaaggtgataacgtgggggagaggttttgct tgtgtttcaccaggagaaaatcagcttcctgtttggatacccactagacatttgaagttc tacaatgaacccatcggagatgcaaagaaaagggcctccacagagatggtaaccccagtc acatggatggataatcctatagaagtatatgttaatgatagtgtatgggtacctggcccc acagatgatcgctgccctgccaaacctgaggaagaagggatgatgataaatatttccatt gtgtatcgttatcctcctatttgcctagggagagcaccaggatgtttaatgcctgcagtc caaaattggttggtagaagtacctactgtcagtcctaacagtagattcacttatcacatg gtaagcgggatgtcactcaggccacgggtaaattatttacaagacttttcttatcaaaga tcattaaaatttagacctaaagggaaaccttgccccaaggaaattcccaaagaatcaaaa aatacagaagttttagtttgggaagaatgtgtggccaatagtgcggtgatattacaaaac aatgaattcggaactattatagattgggcacctcgaggtcaattctaccacaattgctca ggacaaactcagtcgtgtccaagtgcacaagtgtacccaacagctccgaagagacagcga ccatcgagaacgggccatgatgacgatggcggttttgtcgaaaagaaaagggggaaatgt ggggaaaagcaagagagatcagattgttactgtgtctgtgtagaaagaaccctaaggcgg tttttccctatctcagtagatggaacgtacaatcgggttttataccgagacattccattg cccagggacgggcaggagacagatgccttcctcttgtctcaactgcaagaggcatgcctt cctcttacactaatcctcctcagcacagaccctttacgggtgtcgggctgggggacgaca caacaaccgaccagcattgatgttaaagaagagatatcaagactgacagaaaggactttt tgtggcaaaaaggtatcaaaagataaacaagacccaaggccatgccaggaaagtctgaaa gaattggaagaggaggcagaagaggagcagcgcatcctccagcagtcagtggtgaaaaca tatgaagatatgactttggaagagctggaggatcatgaagacgagtttaatgaggaggat gaatgtgctattgaaatgtgcagacagcagagactggctgagtggaaagcaactaaactg aagaataaattttga >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_5|538_aa MAVARVDAALPPGEGSVVNWSGQGLQKLGPNLPCEADIHTLILDKNQIIKLENLEKCKRL IQLSVANNRLVRMMGVAKLTLLRVLNLPHNSIGCVEGLKELVHLEWLNLAGNNLKTLLLH GNIITSLRMAPAYLPRSLAILSLAENEIRDLNEISFLASLTELEQLSIMNNPCVMATPSI PGFDYRPYIVSWCLNLRVLDGYVISQKERPIVDKSTKMRKNQCKKAENSKNQSASSPAKD HNCLPAKEQNWTENEFHKLTEVGFRIITNSVELKEHILTQCKEAKNLEQSDRENGTKLGN TLQDIIQENVPNLARQANIQIQEIKRTPQRYSSRRATPRHIIARFAKVEMKEKMLRATIL EVLAKAIRQEKEIKGIQIGREEVKLSLFADDMIVYLENPIVPAQNLLKLISNFSKVSGYK INVQKSQAFLYNIIESLIMSELPFTIAAKRLKYLGIQLTRDVKDLFKENYKPVLKEIRED TNKWKNIPCSCIGRINIVKMAILPKENVCRLRADFMKILNDFDLVHRDVSMEEYFFFF >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_5|1617_bp atggcggtggcgcgcgtggacgcggctttgcctcccggagaaggatcagtggtcaattgg tcaggacagggactacagaaattaggtccaaatttaccctgtgaagctgatattcacact ttgattctggataaaaatcagattattaaattggaaaatctggagaaatgcaaacgatta atacagttatcagtagctaataatcggctggttcggatgatgggtgtggccaagctgacg ttgcttcgtgtattaaatttgcctcataatagcattggctgtgtggaagggctaaaggaa ctagtacatctggaatggctgaatttggcaggaaataatcttaagaccctgcttttacat ggaaacatcatcacctctcttagaatggcacctgcttacctacccagaagtcttgctata ctttctttggcagaaaatgaaatccgagacttaaatgagatctcttttttggcatcctta actgaattggaacagttgtcgattatgaacaatccttgtgtgatggcaacaccatccatc ccaggatttgactatcggccgtacatcgtcagctggtgcctaaacctcagagtcctagat ggatatgtgatttctcagaaggaaagaccaatagtagataaatccacgaagatgaggaaa aaccagtgcaaaaaggctgaaaattccaaaaaccagagtgcttcttctcccgcaaaggat cacaactgcttgccagcaaaagaacaaaactggacagagaatgagtttcacaaattgaca gaagtaggcttcagaataataacaaactccgttgagctaaaggagcatattctaacccaa tgcaaggaagctaagaaccttgaacaaagtgacagggagaatggaaccaagttaggaaac actcttcaggatattatccaggagaacgtccccaacctagcaagacaggccaacattcaa attcaggaaataaagagaacaccacaaagatactcctcgagaagagcaaccccaagacac ataattgccagattcgccaaagttgaaatgaaggaaaaaatgttaagggcaaccatattg gaagttttggccaaggcaatcaggcaagagaaagaaataaagggtattcagataggaaga gaggaagtcaaattgtctctgtttgcagatgacatgattgtatatttggaaaaccccatt gttccagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcctatacaacataatagagagcctaatcatgagt gaacttccattcacaattgctgcaaagagactaaaatacctaggaatacaacttacaagg gatgtaaaggacctcttcaaggagaactacaaaccagtgctcaaggaaataagagaggac acaaacaaatggaaaaacattccatgctcatgcataggaagaatcaatattgtgaaaatg gccatactgcccaaagagaatgtatgcagacttagagcagatttcatgaagattctaaat gattttgatttagtccacagggatgtttccatggaagaatacttctttttcttttga >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_6|256_aa MKAEIKMFFETNKNKDTTYQNLWHTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITNIRAELKEIETQKTLQTINESRSWFFERINKIDRPLPRLIKKKR EKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENQEEMDKFLDTYTLPRLNQEE VESLNRPITGSEIVAIINTLPTKKSPGPDGFTAKFYQRYKEELVPFLLKLFQSIEKEGTK LEASHYLTSNYTTRLQ >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_6|771_bp atgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacacaacataccag aatctctggcacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaagaactagaa aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaacatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaacaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaattgatagaccactaccaagactaataaagaagaaaaga gagaagaatcaaatagacacaataaaaaatgacaaaggggatatcaccaccgatcctaca gaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat caagaagaaatggataaattcctcgacacatacaccctcccaaggctaaaccaggaagaa gttgaatctctgaacagaccaataacaggctctgaaattgtggcaataatcaatacctta ccaaccaaaaagagtccaggaccagatggattcacagccaaattctaccagaggtacaag gaggaactggtaccattccttctgaaactattccaatcaatagaaaaagagggaacaaag ctggaggcatcacactacctgacttcaaactatactacaaggctacagtaa >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_7|152_aa MGKKQSRKTGNSKNQSTSPPPKECSSSPATEQSWTENDFDELREEGFRRSNYSELQEEIE TKGKEVKNFEKNLDECITRIINTEKCLKELMELKAKARELREECRRLRSRCDQLEERVSV MEDEMNEMKREGKFTEKRIKRNEQSLQEIGTM >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_7|459_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaaccagagcacctctcctcct ccaaaggaatgcagttcctcaccagcaacggaacaaagctggacggagaatgactttgac gagctgagagaagaaggcttcagaagatcaaactactccgagctacaggaggaaattgaa accaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtataactagaata atcaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcgagaacta cgtgaagaatgcagaagactcaggagccgatgcgatcaactggaagaaagggtatcagtg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttacagaaaaaagaataaaa agaaacgaacaaagcctccaagaaatagggactatgtga >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_8|632_aa MPRPASACARLKAEWLYSQGKGRAYRPGQHIQLVQYLATVCPLTSTLGLQTAEDAKLEKI LSKQRFHQRQLMNQSQNEELSPLVPVETRASLIPEHSSPVQDCQISQESEPVIQVNSWVG INSNDDQLFAVKNNFPASVHTTRYSRNDLHLEDIQTDEDKLNCSLLSSESTFMPVASGLS PLSPTVELRLQGINLGLEDDGVADESVKGLESQVLDKEEEQPLWAANENSVQMMRSEINT EVNEKAGLLPCPEPTIISAILKDDNHSLTFFPESTEQKQSDIKKPENTQPENKETISQAT SEKLPMILTQRSVALGQDKVALQKLNDAATKLQACWRGFYARNYNPQAKDVRYEIRLRRM QEHIVCLTDEIRRLRKERDEERIKKFVQEEAFRFLWNQVRSLQVWQQTVDQRLSSWHTDV PPISSTLVPSKHPLFTQSQESSCDQNADWFIASDVAPQEKSLPEFPDSGFHSSLTEQVHS LQHSLDFEKSSTEGSESSIMGNSIDTVRYGKESDLGDVSEEHGEWNKESSNNEQDNSLLE QYLTSVQQLEDADERTNFDTETRDSKLHIACFPVQLDTLSDGASVDESHGISPPLQGEIS QTQENSKLNAEVQGQQPECDSTFQLLHVGVTV >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_8|1899_bp atgcctcgccctgcttcggcttgcgcacgtttgaaagctgaatggctctatagtcaaggc aaggggagagcatatcggcctggccagcacatccagcttgtccaatatctggctacagtc tgccccctcacttctacactaggtcttcaaactgcagaggatgccaaactagagaagatt ttgagcaaacagaggtttcaccagaggcagttgatgaaccaaagccaaaatgaagagttg tctcctcttgttcctgttgaaacaagggcatcccttattcctgagcattcaagccctgtt caagattgccagatatcccaggaaagtgaacccgtcattcaagtgaattcttgggttggg ataaacagtaatgatgatcagttatttgcggttaagaataattttccagcctctgtacac actacgagatattctcgaaatgatctgcacctggaagacatacagacggatgaggacaag ttaaactgtagtcttctctcttcagagtctacttttatgccagttgcatcaggactgtct ccactatcacctacagttgagctgaggctgcagggcattaacttgggcctagaagatgat ggtgttgcagatgaatctgtgaaagggctggaaagccaggtgttggataaggaagaggaa cagcctttatgggctgcaaatgagaattctgttcaaatgatgagaagtgaaatcaataca gaggtaaatgagaaagctggactattaccttgtcctgagccaacaataatcagtgctatc ttgaaggatgataaccacagtcttacattttttcctgagtcaactgagcagaaacaatca gacataaagaaaccagaaaatacacaaccagaaaataaagaaaccatatctcaagcaact tcagagaaacttcccatgattttaacccagagatctgttgctttgggacaagacaaagtt gcccttcagaaattaaatgatgcagccaccaagcttcaggcctgttggcggggattttat gccaggaactacaaccctcaagccaaagatgtgcgttacgaaatccggctacgcagaatg caagagcacattgtctgcttaactgatgaaataaggagattacgaaaagaaagagatgaa gaacgtattaaaaaatttgtacaagaagaagctttcagattcctttggaaccaggtaagg tctctacaggtttggcaacagacagtggaccagcgtctaagttcctggcatactgatgtt cctcctatatcaagtactcttgtgccatcgaaacatccattatttacccaaagccaggag tcctcttgtgatcaaaatgctgattggtttattgcttctgatgtagctcctcaagagaaa tcattaccagaatttccagactctggttttcattcctctctaacagaacaagttcattca ttgcagcattctttggattttgagaaaagttccacagaaggcagtgaaagctccataatg gggaattccattgacacagtcagatatggcaaagaatcagatttaggggatgttagtgaa gaacatggtgaatggaataaggaaagctcaaataacgagcaggacaatagtctgcttgaa cagtatttaacttcagttcaacagctggaagatgctgatgagaggaccaattttgataca gagacaagagatagcaaacttcacattgcttgtttcccagtacagttagatacattgtct gacggtgcttctgtagatgagagtcatggcatatctcctcctttgcaaggtgaaattagc cagacacaagagaattctaaattaaatgcagaagttcaggggcagcagccagaatgtgat tctacatttcagctattgcatgttggtgttactgtgtag >gi568815595r:101581138_101786557|GENSCAN_predicted_peptide_9|79_aa XPGSEVGTLALYAAKLLLATRHCSCRPADRGLAPLTTQLLVNFGRLSFLGASTYLKAFRL SAKVPTTAVDSSRAVFPVE >gi568815595r:101581138_101786557|GENSCAN_predicted_CDS_9|240_bp nngcctggaagcgaagttggcaccctggcgctctacgccgccaagctcctgctcgcgacg cgacactgcagctgtcgccccgcggatcgaggtctggcccctctcacgacccaactcctc gtgaacttcggacgtctgagtttcctgggagcatccacctacctcaaggcttttcgcttg tcagcgaaagtgcccacaaccgcagttgactctagtagggcagtctttcctgtggagtga