GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:24:29 Sequence gi568815592r:33316653_33517668 : 201016 bp : 48.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 264 10 255 0 0 95 29 229 0.634 12.93 1.00 Prom - 1292 1253 40 -8.76 2.08 PlyA - 1928 1923 6 1.05 2.07 Term - 2150 2091 60 2 0 121 37 95 0.685 5.50 2.06 Intr - 2567 2345 223 1 1 109 116 -17 0.635 1.33 2.05 Intr - 3202 2728 475 2 1 151 90 385 0.998 37.12 2.04 Intr - 3528 3359 170 2 2 52 91 425 0.599 38.79 2.03 Intr - 3939 3717 223 1 1 90 44 175 0.568 10.49 2.02 Intr - 4915 4084 832 1 1 76 91 990 0.983 89.07 2.01 Init - 5273 5067 207 2 0 53 83 250 0.847 19.82 2.00 Prom - 8746 8707 40 -3.26 3.00 Prom + 9234 9273 40 -7.56 3.01 Init + 14418 14504 87 2 0 69 98 67 0.736 4.44 3.02 Term + 22163 22843 681 1 0 -39 48 760 0.487 52.96 3.03 PlyA + 23709 23714 6 1.05 4.02 PlyA - 30981 30976 6 1.05 4.01 Sngl - 49576 48896 681 1 0 73 36 760 0.996 65.49 4.00 Prom - 63263 63224 40 -4.86 5.03 PlyA - 64836 64831 6 1.05 5.02 Term - 74957 74752 206 0 2 105 42 105 0.926 5.13 5.01 Init - 75444 75018 427 0 1 80 80 103 0.603 3.86 5.00 Prom - 79263 79224 40 -3.06 6.02 PlyA - 81819 81814 6 1.05 6.01 Sngl - 83904 83407 498 0 0 87 47 520 0.991 43.85 6.00 Prom - 85397 85358 40 -3.16 7.00 Prom + 86167 86206 40 -9.85 7.01 Init + 86622 86715 94 2 1 40 93 8 0.610 -2.91 7.02 Intr + 86833 86883 51 2 0 78 92 73 0.938 5.58 7.03 Intr + 87077 87477 401 0 2 94 94 282 0.729 23.62 7.04 Intr + 88200 88979 780 2 0 76 53 755 0.981 62.48 7.05 Intr + 89544 89834 291 2 0 114 81 271 0.982 26.33 7.06 Intr + 89940 90013 74 2 2 54 76 89 0.983 2.50 7.07 Intr + 90148 90223 76 1 1 75 86 95 0.917 7.52 7.08 Intr + 93741 93893 153 2 0 38 64 87 0.496 1.57 7.09 Intr + 95856 95937 82 2 1 95 99 7 0.902 1.71 7.10 Intr + 96046 96141 96 2 0 96 81 26 0.821 2.68 7.11 Intr + 96544 96644 101 2 2 69 98 45 0.988 3.43 7.12 Intr + 96757 96905 149 0 2 103 64 100 0.999 8.13 7.13 Intr + 97084 97179 96 1 0 98 94 94 0.999 10.12 7.14 Intr + 97389 97457 69 0 0 76 93 49 0.756 2.50 7.15 Intr + 97591 97714 124 1 1 79 94 30 0.745 3.29 7.16 Intr + 97825 97892 68 0 2 98 117 53 0.878 6.90 7.17 Intr + 98073 98184 112 0 1 103 2 27 0.759 -4.02 7.18 Intr + 98319 98492 174 2 0 51 84 80 0.730 4.04 7.19 Intr + 98583 98677 95 2 2 98 82 139 0.889 13.06 7.20 Intr + 98938 99018 81 1 0 79 94 79 0.799 6.35 7.21 Term + 99158 99446 289 2 1 123 47 73 0.904 1.55 7.22 PlyA + 99768 99773 6 1.05 8.06 PlyA - 99918 99913 6 1.05 8.05 Term - 100124 99998 127 1 1 65 45 103 0.905 1.46 8.04 Intr - 100317 100268 50 0 2 116 89 -16 0.749 -1.22 8.03 Intr - 100490 100445 46 1 1 87 81 58 0.813 3.41 8.02 Intr - 100658 100599 60 1 0 92 84 77 0.973 5.55 8.01 Init - 101016 100829 188 0 2 28 91 294 0.968 20.23 8.00 Prom - 102721 102682 40 -5.86 9.00 Prom + 102841 102880 40 -11.14 9.01 Init + 103613 103679 67 1 1 52 94 64 0.454 3.16 9.02 Intr + 106825 106946 122 2 2 123 105 163 0.884 21.71 9.03 Intr + 109146 109251 106 2 1 87 64 126 0.996 9.89 9.04 Intr + 111939 112108 170 1 2 96 89 20 0.526 2.57 9.05 Intr + 115293 115380 88 2 1 90 66 5 0.505 -1.96 9.06 Intr + 115509 115600 92 1 2 105 109 48 0.991 8.31 9.07 Intr + 116033 116193 161 1 2 126 23 162 0.563 12.29 9.08 Intr + 118500 118653 154 0 1 86 69 78 0.679 5.77 9.09 Intr + 118863 118961 99 2 0 95 67 81 0.983 7.01 9.10 Intr + 121016 121639 624 1 0 66 113 761 0.982 68.84 9.11 Intr + 121767 121911 145 2 1 75 52 195 0.991 14.46 9.12 Intr + 122123 122267 145 0 1 83 85 192 0.999 17.74 9.13 Intr + 124077 124313 237 0 0 106 61 285 0.996 24.33 9.14 Intr + 124521 124722 202 0 1 83 82 188 0.968 16.99 9.15 Intr + 124929 125107 179 2 2 42 99 218 0.921 17.02 9.16 Intr + 125801 125842 42 2 0 109 94 77 0.995 7.76 9.17 Intr + 126237 127308 1072 0 1 89 96 704 0.922 61.53 9.18 Intr + 127792 127965 174 0 0 73 63 260 0.983 22.14 9.19 Intr + 129923 130134 212 1 2 32 94 396 0.982 32.31 9.20 Intr + 131191 131281 91 1 1 81 44 176 0.999 12.50 9.21 Intr + 132525 132648 124 2 1 122 96 -62 0.086 -1.84 9.22 Term + 138378 139870 1493 1 2 121 42 779 0.869 67.33 9.23 PlyA + 140868 140873 6 1.05 10.06 PlyA - 141519 141514 6 1.05 10.05 Term - 149786 149661 126 2 0 96 44 69 0.648 1.58 10.04 Intr - 182068 182009 60 1 0 90 71 49 0.351 2.33 10.03 Intr - 186155 185870 286 1 1 66 45 175 0.398 8.34 10.02 Intr - 195855 195789 67 1 1 105 61 8 0.118 -2.24 10.01 Init - 197699 197666 34 2 1 77 82 70 0.583 5.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 48275 48261 15 2 0 144 49 13 0.833 1.24 S.002 Init - 48429 48373 57 0 0 103 81 121 0.826 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_1|85_aa MEPSPLSPSGAALPLPLSLAPPPLPLPAAAVVHVSFPEVTSALLESLNQQRLQGQLCDVS IRVQGREFRAHRAVLAASSPYFHDQ >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_1|255_bp atggagccatctcctctgtctcccagtggggcagcacttcccctgccgctgtcgctggct ccgcccccactacccctgccagcagctgcagtggtacatgtgtccttccctgaggtgacc agtgccctcttggagtccctcaatcagcagcgtctgcagggccagctctgcgatgtatct atcagagtgcagggccgggagttccgggctcatcgggctgtcctggctgcctcctcccct tacttccatgatcag >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_2|729_aa MATANSIIVLDDDDEDEAAAQPGPSHPLPNAASPGAEAPSSSEPHGARGSSSSGGKKCYK LENEKLFEEFLELCKMQTADHPEVVPFLYNRQQRAHSLFLASAEFCNILSRVLSRARSRP AKLYVYINELCTVLKAHSAKKKLNLAPAATTSNEPSGNNPPTHLSLDPTNAENTASQSPR TRGSRRQIQRLEQLLALYVAEIRRLQEKELDLSELDDPDSAYLQEARLKRKLIRLFGRLC ELKDCSSLTGRVIEQRIPYRGTRYPEVNRRIERLINKPGPDTFPDYGDVLRAVEKAAARH SLGLPRQQLQLMAQDAFRDVGIRLQERRHLDLIYNFGCHLTDDYRPGVDPALSDPVLARR LRENRSLAMSRLDEVISKYAMLQDKSEEGERKKRRARLQGTSSHSADTPEASLDSGEVWM GAETDDEDDEESDEEEEEEEEEEEEEATDSEEEEDLEQMQEGQEDDEEEDEEEEAAAGKD GDKSPMSSLQISNEKNLEPGKQISRSSGEQQNKGRIVSPSLLSEEPLAPSSIDAESNGEQ PEELTLEEESPVSQLFELEIEALPLDTPSSVETDISSSRKQSEEPFTTVLENGAGMVSST SFNGGVSPHNWGDSGPPCKKSRKEKKQTGSGPLGNSYVERQRSVHEKNGKKICTLPSPPS PLASLAPVADSSTRVDSPSHGLVTSSLCIPSPARLSQTPHSQPPRPGTCKTSVATQCDPE EIIVLSDSD >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_2|2190_bp atggccaccgctaacagcatcatcgtgctggatgatgatgacgaagatgaagcagctgct cagccagggccctcccacccactccccaatgcggcctcacctggggcagaagcccctagc tcctctgagcctcatggggccagaggaagcagtagttcgggcggcaagaaatgctacaag ctggagaatgagaagctgttcgaagagttccttgaactttgtaagatgcagacagcagac caccctgaggtggtcccattcctctataaccggcagcaacgtgcccactctctgtttttg gcctcggcggagttctgcaacatcctctctagggtcctgtctcgggcccggagccggcca gccaagctctatgtctacatcaatgagctctgcactgttctcaaggcccactcagccaaa aagaagctgaacttggcccctgccgccaccacctccaatgagccctctgggaataaccct cccacacacctctccttggaccccacaaatgctgaaaacactgcctctcagtctccaagg acccgtggttcccggcggcagatccagcgtttggagcagctgctggcgctctatgtggca gagatccggcggctgcaggaaaaggagttggatctctcagaattggatgacccagactcc gcatacctgcaggaggcacggttgaagcgtaagctgatccgcctctttgggcgactatgt gagctgaaagactgctcttcactgaccggccgtgtcatagagcagcgcatcccctaccgt ggcacccgctacccagaggttaacaggcgcattgagcggctcatcaacaagccagggcct gataccttccctgactatggggatgtgcttcgggctgtagagaaggcagctgcccgacac agccttggcctcccccgacagcagctccagctcatggctcaggatgccttccgagatgtg ggcatcaggttacaggagcgacgtcacctcgatctcatctacaactttggctgccacctc acagatgactataggccaggcgttgaccctgcactatcagatcctgtgttggcccggcgc cttcgggaaaaccggagtttggccatgagtcggctggatgaggtcatctccaaatatgca atgttgcaagacaaaagtgaggagggcgagagaaaaaagagaagagctcggctccaaggc acctcttcccactctgcagacacccccgaagcctccttggattctggtgaggtgtggatg ggagctgagacagatgacgaagacgatgaggagagtgatgaggaagaggaggaggaggag gaagaagaagaggaggaggccacagattctgaagaggaggaggatctggaacagatgcag gagggtcaggaggatgatgaagaggaggacgaagaggaagaagcagcagcaggtaaagat ggagacaagagccccatgtcctcactacagatctccaatgaaaagaacctggaacctggc aaacagatcagcagatcttcaggggagcagcaaaacaaaggacgcatagtgtcaccatcg ttactgtcagaagaacccctggccccctccagcatagatgctgaaagcaatggagaacag cctgaggagctgaccctggaggaagaaagccctgtgtctcagctctttgagctagagatt gaagctttgcccctggataccccttcctctgtggagacggacatttcctcttccaggaag caatcagaggagcccttcaccactgtcttagagaatggagcaggcatggtctcttctact tccttcaatggaggcgtctctcctcacaactggggagattctggtcccccctgcaaaaaa tctcggaaggagaagaagcaaacaggatcagggccattaggaaacagctatgtggaaagg caaaggtcagtgcatgagaagaatgggaaaaagatatgtaccctgcccagcccaccttcc cccttggcttccttggccccagttgctgattcctccacgagggtggactctcccagccat ggcctggtgaccagctccctctgcatcccttctccagcccggctgtcccaaaccccccat tcacagcctcctcggcctggtacttgcaagacaagtgtggccacacaatgcgatccagaa gagatcatcgtgctctcagactctgattag >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_3|255_aa MARSRLTATSASRVQTILLPQPPKSLELQCPVKEHQIFEEDFRHEQKPKIKSKNDTALCS SPSLIVWRRAAHFWCLALELNNHHVEQKGKTKTTKKLPQRTTSNVFAMFDQSQIQEFKEA FNMIDQNRDGFINKEDLHDMLVSLGKNPTDAYLDAIMNEAPGPIDFTMFLTIFGEKLNGT DPEDVIGNAFACFDEEATGIIQEDYLRELLITMWDRFTDEEVDELYREAPINKKGNFNYI EFTCILKHGAKDKDD >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_3|768_bp atggcacgatctcggctcactgcaacctccgcctcccgggttcaaaccattctcctgcct cagcctcccaagtcgctggaattacagtgcccagttaaggaacaccagatatttgaggaa gacttcagacatgagcaaaaacccaaaattaaaagtaaaaacgacacagcattgtgctct tcgccttccctcatcgtctggcgcagggcagcccacttctggtgtttggcgctggaatta aacaaccaccatgtggagcaaaaaggcaagaccaagaccaccaaaaagctccctcagcgc acaacatccaacgtgtttgccatgtttgaccagtcacagattcaggagttcaaagaggcc ttcaacatgattgatcagaacagagatggtttcatcaacaaagaagatttgcatgatatg cttgtttccctagggaagaatcccaccgatgcataccttgatgccataatgaatgaggca ccagggcccatcgatttcaccatgttcctcaccatatttggtgagaagttaaatggcaca gatcctgaagatgtcattggaaatgcttttgcttgctttgatgaagaagcaacaggcatt attcaggaagattacctgagagagctgctgataaccatgtgggatcggtttacggatgag gaagtggatgagctgtacagagaagcgcctattaacaaaaaggggaatttcaattacatc gagttcacatgcatcctgaaacatggagcaaaagacaaagacgactga >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_4|226_aa MSVPLLNDAATVSGAERETAVVIFLHGLGDTGHSWADALSTIRLPHVKYICSHEPRIPVT LNMKMVMPSWFDLMGLSPDAPEDEAGIKKAAENIKALIEHEMKNGIPANQIILGGFSQGR ALSLYMALTCPHPLAGILALSCWPPLHRAFPQAANGSAKDLAILQCHGELDPMVPVRFGA LMAEKLRSVVTPARVQFQTYLGVMHSSCPQEMAAVKEFLEKLLPPV >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_4|681_bp atgtctgtgcccctgctcaacgatgctgccaccgtgtctggagctgagcgggaaacggcc gtggttatttttttacatggacttggagacacagggcacagctgggctgacgccctctcc accattcggctccctcacgtcaagtacatctgttcccatgagcctaggatccctgtgacc ctcaacatgaagatggtgatgccctcctggtttgacctgatggggctgagtccagatgcc ccagaggacgaggctggcatcaagaaggcagcagagaacatcaaggccttgattgagcat gaaatgaagaacgggatccctgccaatcaaatcatcctgggaggcttttcacagggccgg gccctgtccctctacatggccctcacctgcccccaccctctggctggcatcctggctttg agctgctggccgcctctgcaccgggccttcccccaggcagctaatggcagtgccaaggac ctggccatcctccagtgccatggggagctggaccccatggtgcccgtacggtttggggcc ctgatggctgagaagctccggtctgttgtcacacctgccagggtccagttccagacatac ctgggtgtcatgcacagctcctgtcctcaggagatggcagctgtgaaggaatttcttgag aagctgctgcctcctgtctaa >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_5|210_aa MTKADREPGARQHRSGQPSSPSPGQDTCAAPTHLRIHVHLGPTRAGGMQWKRTGCRGSWD SRSLARSRSRVENSHGSGRYQPRATPATFEFQRPPPTLTALCTQGPIVAVATAEGQSQRS PPPEATPRPTGALSFLFRRRRRAAIVLDFRSAPRRRSLRQKMRGRRRTNATGQAKLVAAG GEGAGTLEERNDACANEDPCRETGGTSASW >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_5|633_bp atgacaaaggccgaccgggagccgggggcgcgacagcatcggagcggtcagccttcgtcc ccatccccagggcaggacacctgcgccgcccctactcacctgcggatccatgtccacctc ggtcccacacgcgccgggggaatgcagtggaagagaactgggtgccggggatcctgggac tcgcgttctctcgcccgctcgcgaagcagggtagagaactcgcacggctccggccgctac cagccccgcgccacacccgccacttttgaattccaacggccaccacccactctcaccgcg ctctgcacgcagggaccaatcgtcgctgtcgccacagccgagggccaatcgcagcgttct ccgccacccgaagccacaccccgcccgacaggcgccttgtcttttctgtttcgcaggcgc aggagagcggcaatagtgctggacttccgctcggctccccgccgtcgctcgctacgtcag aaaatgcgtggacgtcgccgcacgaacgcaactggccaagcgaaactggtggcggccgga ggagaaggggcggggacgctggaggaaagaaatgacgcgtgcgcaaacgaggacccgtgc cgggagacaggcgggactagcgcctcctggtga >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_6|165_aa MPPKFDPNEIKVVYLRCTRGEVGATSALAPKIGPLGLSPKKVGDDIAKATGDWKGLRITV KLTIENRQAQIEVVPSASAPIIKALKKPPRDRKKQKNIKHNGNITFDEIVNIARQMRHRS LARELSGTIKEIPGTAQSMGCNVDGHHPHDIIDDINSGAVECPAS >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_6|498_bp atgccaccgaagttcgaccccaacgagatcaaggtcgtatacctgaggtgcaccagaggt gaagtcggtgccacttctgccctggcccccaagatcggccccctgggtctgtctccaaaa aaggttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtg aaactgaccattgagaacagacaggcccagattgaggtggtgccttctgcctctgccccg atcatcaaagccctcaagaaaccaccaagagacagaaagaaacagaaaaacattaaacac aatgggaatatcacttttgatgagatcgtcaacattgctcgacagatgcggcaccgatcc ttagccagagaactctctggaaccattaaagagattccggggactgcccagtctatgggc tgtaatgttgatggccaccaccctcatgacatcatagatgacatcaacagtggtgctgtg gaatgcccagctagttaa >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_7|1151_aa MIILPLLSPISWAAQKVSKKTGPRCSTAIATGLKNQKPVPAVPVQKSGTSGVPPMAGGKK PSKRPAWDLKGQLCDLNAELKRCRERTQTLDQENQQLQDQLRDAQQQVKALGTERTTLEG HLAKVQAQAEQGQQELKNLRACVLELEERLSTQEGLVQELQKKQVELQEERRGLMSQLEE KERRLQTSEAALSSSQAEVASLRQETVAQAALLTEREERLHGLEMERRRLHNQLQELKGN IRVFCRVRPVLPGEPTPPPGLLLFPSGPGGPSDPPTRLSLSRSDERRGTLSGAPAPPTRH DFSFDRVFPPGSGQDEVFEEIAMLVQSALDGYPVCIFAYGQTGSGKTFTMEGGPGGDPQL EGLIPRALRHLFSVAQELSGQGWTYSFVASYVEIYNETVRDLLATGTRKGQGGECEIRRA GPGSEELTVTNARYVPVSCEKEVDALLHLARQNRAVARTAQNERSSRSHSVFQLQISGEH SSRGLQCGAPLSLVDLAGSERLDPGLALGPGERERLRETQAINSSLSTLGLVIMALSNKE SHVPYRNSKLTYLLQNSLGGSAKMLMFVNISPLEENVSESLNSLRFASKPWKHGAGRPRE AARTPADSLAPTADGRRPVKEGLSPPRCRAQEGTRVRKRAVDSAREVCLVQFEDDSQFLV LWKDISPAALPGEELLCCVCRSETVVPGNRLVSCEKCRHAYHQDCHVPRAPAPGEGEGTS WVCRQCVFAIATKRGGALKKGPYARAMLGMKLSLPYGLKGLDWDAGHLSNRQQSYCYCGG PGEWNLKMLQCRSCLQWFHEACTQCLSKPLLYGDRFYEFECCVCRGGPEKVRRLQLRWVD VAHLVLYHLSVCCKKKYFDFDREILPFTSENWDSLLLGELSDTPKGERSSRLLSALNSHK DRFISGREIKKRKCLFGLHARMPPPVEPPTGDGALTRSLGPGGGVSRPLGKRRRPEPEPL RRRQKGKVEELGPPSAVRNQPEPQEQRERAHLQRALQASVSPPSPSPNQSYQGSSGYNFR PTDARCLPSSPIRMFASFHPSASTAGTSGDSGPPDRSPLELHIGFPTDIPKSAPHSMTAS SSSVSSPSPGLPRRSAPPSPLCRSLSPGTGGGVRGGVGYLSRGDPVRVLARRVRPDGSVQ YLVEWGGGGIF >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_7|3456_bp atgatcattctacctttgctctctcccatctcctgggcagctcaaaaagtttccaagaag acaggaccccggtgttccacagctattgccacagggttgaagaaccagaagccagttcct gctgttcctgtccagaagtctggcacatcaggtgttcctcccatggcaggagggaagaaa cccagcaaacgtccagcctgggacttaaagggtcagttatgtgacctaaatgcagaacta aaacggtgccgtgagaggactcaaacgttggaccaagagaaccagcagcttcaggaccag ctcagagatgcccagcagcaggtcaaggccctggggacagagcgcacaacactggagggg catttagccaaggtacaggcccaggctgagcagggccaacaggagctgaagaacttgcgt gcttgtgtcctggagctggaagagcggctgagcacgcaggagggcttggtgcaagagctt cagaaaaaacaggtggaattgcaggaagaacggaggggactgatgtcccaactagaggag aaggagaggaggctgcagacatcagaagcagccctgtcaagcagccaagcagaggtggca tctctgcggcaggagactgtggcccaggcagccttactgactgagcgggaagaacgtctt catgggctagaaatggagcgccggcgactgcacaaccagctgcaggaactcaagggcaac atccgtgtattctgccgggtccgccctgtcctgccgggggagcccactccaccccctggc ctcctcctgtttccctctggccctggtgggccctctgatcctccaacccgccttagcctc tcccggtctgacgagcggcgtgggaccctgagtggggcaccagctcccccaactcgccat gatttttcctttgaccgggtattcccaccaggaagtggacaggatgaagtgtttgaagag attgccatgcttgtccagtcagccctggatggctatccagtatgcatctttgcctatggc cagacaggcagtggcaagaccttcacaatggagggtgggcctgggggagacccccagttg gaggggctgatccctcgggccctgcggcacctcttctctgtggctcaggagctgagtggt cagggctggacctacagctttgtagcaagctacgtagagatctacaatgagactgtccgg gacctgctggccactggaacccggaagggtcaagggggcgagtgtgagattcgccgtgca gggccagggagtgaggagctcactgtcaccaatgctcgatatgtccctgtctcctgtgag aaagaagtggacgccctgcttcatctggcccgccagaatcgggctgtggcccgcacagcc cagaatgaacggtcatcacgcagccacagtgtattccagctacagatttctggggagcac tccagccgaggcctgcagtgtggggcccccctcagtcttgtggacctggccgggagtgag cgacttgaccccggcttagccctcggccccggggagcgggaacgccttcgggaaacacag gccattaacagcagcctgtccacgctggggctggttatcatggccctgagcaacaaggag tcccacgtgccttaccggaacagcaaactgacctacctgctgcagaactctctgggtggt agtgctaagatgctcatgtttgtgaacatttctccactggaagagaacgtctccgagtcc ctcaactctctacgctttgcctccaagccctggaagcacggggcgggacgtccacgggaa gcggcgcgcacgcccgccgactccctcgcgccaaccgccgacggccgccgcccggtgaag gaggggctcagtcctcccaggtgccgcgcgcaggaggggacacgcgtgcgcaaaagggcg gtggacagtgctagggaggtgtgtctggtccagtttgaggatgattcgcagtttctggtt ctatggaaagacattagccctgctgccctccctggagaggaactcctctgttgtgtctgt cgctctgagactgtggtccctgggaaccggctggtcagctgtgagaagtgtcgccatgct tatcaccaggactgccatgttcccagggctccagcccctggagagggagagggcacatcc tgggtatgccgccagtgtgtctttgcgatcgccaccaagaggggaggtgccctgaagaag ggcccctatgcccgggccatgctgggtatgaagctttctctgccatatggactgaagggg ctggactgggatgctggacatctgagcaaccgacagcagagttactgttactgtggtggc cctggggagtggaacctgaaaatgctgcagtgccggagctgcctgcagtggttccatgag gcctgcacccagtgtctgagcaagcccctcctctatggggacaggttctatgaatttgaa tgctgtgtgtgtcgcgggggccctgagaaagtccggagactacagcttcgctgggtggat gtggcccatcttgtcctgtatcacctcagtgtttgctgtaagaagaaatactttgatttt gatcgtgagatcctccccttcacttctgagaattgggacagtttgctcctgggggagctt tcagacacccccaaaggagaacgttcttccaggctcctctctgctcttaacagccacaag gaccgtttcatttcagggagagagattaagaagaggaaatgtttgtttggtctccatgct cggatgcctccccctgtggagccccctactggagatggagcactcaccaggtcactgggc cctgggggaggggtctcacgtcccctggggaagcgccggaggccggagccagagcccctg aggaggaggcagaaggggaaagtggaggagctggggccaccctcagcagtgcgcaatcag cccgagccccaggagcagagggagcgggctcatctgcagagggcactgcaggcctcagtg tctccaccatcccccagccctaaccagagttaccagggcagcagcggctacaacttccgg cccacagatgcccgctgcctgcccagcagccccatccggatgtttgcttccttccaccct tctgccagcaccgcagggacctctggggacagtggacccccagacaggtcacccctggaa cttcacattggtttccccacagacatccctaaaagtgccccccactcgatgactgcctca tcttcctcagtttcatccccatccccaggtcttcctagacgctcagcacccccttctccc ctgtgccgtagtttgtctcctgggactgggggaggagtccgaggtggggttggttacctg tcccgaggggaccctgtccgggtccttgctcggagagtacggcctgatggctctgtgcag tacctggttgagtggggaggagggggcatcttctga >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_8|156_aa MPALLPVASRLLLLPRVLLTMASGSPPTQPSPASDSGSGYVPGSVSAAFVTCPNEKVAKE IARAVVEKRLAACVNLIPQITSIYEWKGKIEEDSEVLMMIKTQSSLVPALTDFVRSVHPY EVAEVIALPVEQGNFPYLQWVRQVTESVSDSITVLP >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_8|471_bp atgccggcgctgctgcctgtggcctcccgccttttgttgctaccccgagtcttgctgacc atggcctctggaagccctccgacccagccctcgccggcctcggattccggctctggctac gttccgggctcggtctctgcagcctttgttacttgccccaacgagaaggtcgccaaggag atcgccagggccgtggtggagaagcgcctagcagcctgcgtcaacctcatccctcagatt acatccatctatgagtggaaagggaagatcgaggaagacagtgaggtgctgatgatgatt aaaacccaaagttccttggtcccagctttgacagattttgttcgttctgtgcacccttac gaagtggccgaggtaattgcattgcctgtggaacaggggaactttccgtacctgcagtgg gtgcgccaggtcacagagtcagtttctgactctatcacagtcctgccatga >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_9|1932_aa MSRSRASIHRGSIPAMSYAPFRDVRGPSMHRTQYVHSPYDRPGWNPRFCIISGNQLLMLD EDEIHPLLIRDRRSESSRNKLLRRTVSVPVEGRPHGEHGGCWASHGLVGQSAALLVHVTS DPEHAYPWLGLTSCQASVCGNVWPYEQDFLCPWWQGAPAQVPCPLLPAASLSAVAALPAA FRGVEYHLGRSRRKSVPGGKQYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKSQPKLDRTS SFRQILPRFRSADHDRYRGWSMWDEIDVMARLMQSFKESHSHESLLSPSSAAEALELNLD EDSIIKPVHSSILGQEFCFEVTTSSGTKCFACRSAAERDKWIENLQRAVKPNKDNSRRVD NVLKLWIIEARELPPKKRYYCELCLDDMLYARTTSKPRSASGDTVFWGEHFEFNNLPAVR ALRLHLYRDSDKKRKKDKAGYVGLVTVPVATLAGRHFTEQWYPVTLPTGSGGSGGMGSGG GGGSGGGSGGKGKGGCPAVRLKARYQTMSILPMELYKEFAEYVTNHYRMLCAVLEPALNV KGKEEVASALVHILQSTGKAKDFLSDMAMSEVDRFMEREHLIFRENTLATKAIEEYMRLI GQKYLKDAIGEFIRALYESEENCEVDPIKCTASSLAEHQANLRMCCELALCKVVNSHCVF PRELKEVFASWRLRCAERGREDIADRLISASLFLRFLCPAIMSPSLFGLMQEYPDEQTSR TLTLIAKVIQNLANFSKFTSKEDFLGFMNEFLELEWGSMQQFLYEISNLDTLTNSSSFEG YIDLGRELSTLHALLWEVLPQLSKEALLKLGPLPRLLNDISTALRNPNIQRQPSRQSERP RPQPVVLRGPSAEMQGYMMRDLNSSIDLQSFMARGLNSSMDMARLPSPTKEKPPPPPPGG GKDLFYVSRPPLARSSPAYCTSSSDITEPEQKMLSVNKSVSMLDLQGDGPGGRLNSSSVS NLAAVGDLLHSSQASLTAALGLRPAPAGRLSQGSGSSITAAGMRLSQMGVTTDGVPAQQL RIPLSFQNPLFHMAADGPGPPGGHGGGGGHGPPSSHHHHHHHHHHRGGEPPGDTFAPFHG YSKSEDLSSGVPKPPAASILHSHSYSDEFGPSGTDFTRRQLSLQDNLQHMLSPPQITIGP QRPAPSGPGGGSGGGSGGGGGGQPPPLQRGKSQQLTVSAAQKPRPSSGNLLQSPEPSYGP ARPRQQSLSKEGSIGGSGGSGGGGGGGLKPSITKQHSQTPSTLNPTMPASERTVAWVSNM PHLSADIESAHIEREEYKLKEYSKSMDESRLDRVKEYEEEIHSLKERLHMSNRKLEEYER RLLSQEEQTSKILMQYQARLEQSEKRLRQQQAEKDSQIKSIIGRLMLVEEELRRDHPAMA EPLPEPKKRLLDAQCMSVTPHFTRASLGAGGGFVNGVEAMMGWRILAIGAVLTAAAFIPR GVYPQALLLFPILVTFEEAMETPTPLPPVPASPTCNPAPRTIQIEFPQHSSSLLESLNRH RLEGKFCDVSLLVQGRELRAHKAVLAAASPYFHDKLLLGDAPRLTLPSVIEADAFEGLLQ LIYSGRLRLPLDALPAHLLVASGLQMWQVVDQCSEILRELETSGGGISARGGNSYHALLS TTSSTGGWCIRSSPFQTPVQSSASTESPASTESPVGGEGSELGEVLQIQVEEEEEEEEDD DDEDQGSATLSQTPQPQRVSGVFPRPHGPHPLPMTATPRKLPEGESAPLELPAPPALPPK IFYIKQEPFEPKEEISGSGTQPGGAKEETKVFSGGDTEGNGELGFLLPSGPGPTSGGGGP SWKPVDLHGNEILSGGGGPGGAGQAVHGPVKLGGTPPADGKRFGCLCGKRFAVKPKRDRH IMLTFSLRPFGCGICNKRFKLKHHLTEHMKTHAGALHACPHCGRRFRVHACFLRHRDLCK GQGWATAHWTYK >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_9|5799_bp atgagcaggtctcgagcctccatccatcgggggagcatccccgcgatgtcctatgccccc ttcagagatgtacggggaccctctatgcaccgaacccaatacgttcattccccgtatgat cgtcctggttggaaccctcggttctgcatcatctcggggaaccagctgctcatgctggat gaggatgagatacaccccctactgatccgggaccggaggagcgagtccagtcgcaacaaa ctgctgagacgcacagtctccgtgccggtggaggggcggccccacggcgagcatgggggc tgttgggccagccacgggcttgtggggcagtccgcggccttgctggtccatgtcacctct gaccctgagcatgcatatccctggctggggctcacctcctgtcaggcttctgtgtgtggg aatgtgtggccctatgagcaggattttctgtgtccctggtggcagggggctcctgctcag gttccttgccccctccttcccgctgccagcctctccgccgtcgctgctcttcctgctgct ttccggggggtagaataccacttgggtcgctcgaggaggaagagtgtcccaggggggaag cagtacagcatggagggtgcccctgctgcgcccttccggccctcgcaaggcttcctgagc cgacggctaaaaagctccatcaaacgaacgaagtcacaacccaaacttgaccggaccagc agctttcgccagatcctgcctcgcttccgaagtgctgaccatgaccggtacaggggctgg agcatgtgggatgagattgatgtaatggcccggctgatgcaaagctttaaggagtcacac tctcatgagtccttgctgagtcctagcagtgcagctgaggcattggagctcaacttggat gaagattccattatcaagccagtgcacagctccatcctgggccaggagttctgttttgag gtaacaacttcatcaggaacaaaatgctttgcctgtcggtctgcggccgaaagagacaaa tggattgagaatctgcagcgggcagtaaagcccaacaaggacaacagccgccgggtagac aatgtgctaaagctgtggatcatagaggcccgggagctgccccccaagaagcggtactac tgtgagctctgcctggatgacatgctgtatgcacgcaccacctccaagccccgctctgcc tctggggacaccgtcttctggggcgagcacttcgagtttaacaacctgccggctgtccgt gccctgcggctgcatctgtaccgtgactcagacaaaaagcgcaagaaggacaaggcaggc tatgtcggcctggtgactgtgccagtggccaccctggctgggcgccacttcacagagcag tggtaccctgtaaccctgccaacaggcagtgggggatctgggggcatgggttcgggaggg ggagggggctcggggggtggctcagggggcaagggcaaaggaggttgcccggctgtgcgg ctgaaagcacgttaccagacaatgagcatcttgcccatggagctatataaagagtttgca gagtatgtcaccaaccattatcggatgctgtgtgcagtcttggagcccgccctgaatgtc aaaggcaaggaggaggttgccagtgcactagttcacatcctgcagagtacaggcaaggcc aaggacttcctttcagacatggccatgtctgaggtagaccggttcatggaacgggagcac ctcatattccgcgagaacacgcttgccactaaagccatagaagagtatatgagactgatt ggtcagaaatacctcaaggatgccattggagaattcatccgtgctctgtatgaatctgag gaaaactgcgaggtagaccctatcaagtgcacagcatccagtttggcagagcaccaggcc aacctgcgaatgtgctgtgagttggccctgtgcaaggtggtcaactcccactgcgtgttc ccgagggagctgaaggaggtgtttgcttcgtggcggctgcgctgcgcagagcgaggccgg gaggacatcgcagacaggcttatcagcgcctcactcttcctgcgcttcctctgcccagcg attatgtcgcccagtctctttgggcttatgcaggagtacccagatgagcagacctcacga accctcaccctcattgccaaggtcatccagaacctggccaacttttccaagtttacctca aaggaggactttctgggcttcatgaatgagtttctggagctggaatggggttccatgcag cagtttttgtatgagatctccaatctggacacgctaaccaacagcagtagctttgagggt tacatcgacttgggccgagagctctccacactgcatgccctactctgggaggtgctgccc cagctcagcaaggaagccctcctgaagctgggtccactgccccggctcctcaacgacatc agcacagctctgaggaaccccaacatccaaaggcagccaagccgccagagtgagcggccc cggcctcagcctgtggtactgcgggggccatcggctgagatgcagggctacatgatgcgg gacctcaacagctccatcgaccttcagtccttcatggctcgaggcctcaacagctctatg gacatggctcgcctcccctccccaaccaaggaaaagccacccccaccaccgcctggtggt ggtaaagacctgttctatgtaagccgtccacccctggcccgttcctcaccagcatactgc acgagcagctcggacatcacagagccagagcagaagatgctgagtgtcaacaagagtgtg tccatgctggacttacagggtgatgggcctggtggccgcctcaacagcagcagtgtttcg aacctggcggccgtaggggacctgctgcactcaagccaggcctcgctgacagcagccttg gggctacggcctgcgcctgccggacgcctctcccaggggagtggctcatccatcacggcg gctggcatgcgcctcagccagatgggtgtcaccacagacggtgtccctgcccagcaactg cgaatccccctctccttccagaaccctctcttccacatggctgctgatgggccaggtccc ccaggcggccatggagggggcggtggccatggcccaccttcctcccatcaccaccaccac caccatcaccaccaccgaggtggagagccccctggggacacctttgccccattccatggc tatagcaagagtgaggacctctcttccggggtccccaagccccctgctgcctccatcctt catagccacagctacagtgatgagtttggaccctctggcactgacttcacccgtcggcag ctttcactccaggacaacctgcagcacatgctgtcccctccccagatcaccattggtccc cagaggccagccccctcagggcctggaggtgggagcggtgggggcagcggtgggggtggc gggggccagccgcctccattgcagaggggcaagtctcagcagttgacagtcagcgcagcc cagaaaccccggccatccagcgggaatctattgcagtccccagagccaagttatggcccc gcccgtccacggcaacagagcctcagcaaggagggcagcattgggggcagcgggggcagc ggtggcggagggggtggggggctgaagccctccatcaccaagcagcattctcagacacca tccacattgaaccccacaatgccagcctctgagcggacagtggcctgggtctccaacatg cctcacctgtcggctgacatcgagagtgcccacatcgagcgggaagagtacaagctcaag gagtactcaaaatcgatggatgagagccggctggatagggtgaaggagtacgaggaggag attcactcactgaaagagcggctgcacatgtccaaccggaagctggaagagtatgagcgg aggctgctgtcccaggaagaacaaaccagcaaaatcctgatgcagtatcaggcccgactg gagcagagtgagaagaggctaaggcagcagcaggcagagaaggattcccagatcaagagc atcattggcaggctgatgctggtggaggaggagctgcgccgggaccaccccgccatggct gagccgctgccagaacccaagaagaggctgctcgacgctcagtgtatgtctgtcaccccc catttcaccagagcgtccttaggggctgggggtgggtttgttaatggggtggaggcaatg atgggttggaggatcttggctataggggctgtgctgactgcagcagccttcatcccgcgt ggagtctacccccaagcccttctcctcttcccaattcttgtcaccttcgaggaggccatg gaaaccccaacacctttgccgcctgtacccgcctccccgacctgcaacccagccccacgg acaatccagatcgagttcccacagcatagctcgtctctgctggaatctctgaaccgccac aggctagagggaaagttctgtgatgtgtccctcctggtgcagggccgggaacttagggct cataaagcagtgttagctgctgcctctccttacttccatgacaagctgcttctgggggat gcgcctcgtctcactctaccgagtgtcattgaagccgatgccttcgaggggctgctccag ctcatttattcagggcgtctccgcctgccactggatgctcttcctgctcatctccttgtg gccagtggccttcaaatgtggcaggtagtagatcagtgctcagaaattcttagagaatta gaaacttcaggtggtggaatttcagcccgtggaggaaactcctaccatgcccttctttcc actacatcctctacaggaggctggtgcattcgctcttcgcctttccagaccccagtacag tcctctgcttctactgaaagccctgcttccactgagagccctgtgggaggggagggaagt gaactgggagaagtgctgcaaattcaggtggaagaagaagaggaggaggaggaagatgat gatgatgaggaccaggggtcagccacactctctcagactcctcagccccagagagtatca ggggtttttccccgtcctcatggaccccacccactgcccatgactgctactccccgaaag cttccagagggtgagagtgcaccacttgagcttcctgcccctcctgcactgccccccaaa atcttctacattaagcaggaacccttcgagcctaaggaggagatatcaggaagcggaact cagcctggaggagcaaaggaggaaaccaaagtgttttctggaggggacactgaagggaat ggggagctagggttcttgttgccttcagggccagggccaacatctgggggagggggtcca tcctggaaaccagtggatcttcatgggaatgaaatcctgtcagggggtggaggacctggg ggagcaggccaggccgtgcatgggcctgtgaagctaggggggacaccccctgcagatgga aaacgctttggttgcctgtgtgggaagcggtttgcagtgaagccaaagcgtgaccggcac atcatgctgaccttcagccttcggccttttggctgtggcatctgcaacaagcgcttcaag ctgaagcaccatctgacagagcacatgaagacccatgctggagccctgcatgcctgtccc cactgtggccgtcggttccgagtccatgcctgttttctccgccaccgggacctatgcaag ggccagggctgggccactgcccactggacttacaagtga >gi568815592r:33316653_33517668|GENSCAN_predicted_peptide_10|190_aa MAIASKFCQWVAHNTAGDGEGPTLGCSPGVTEDEASGAGRPLRVWGHTHLELALARDRLP RLSLHIFLQAEGAGSDLGQPREGLPQCGGGLKGSSSTVRVGVEAEQTPRASEGCQHDVTS HQHFGKPRQVHREAKGEEDEEDLYLVLEQAQHHMEATKAWGFHPLKPQPKLYIGPFQSWL ELLGHRTPGP >gi568815592r:33316653_33517668|GENSCAN_predicted_CDS_10|573_bp atggccatcgcctccaagttctgccagtgggtggcccataacactgcgggggatggagaa gggccgaccctcggatgctcacctggagtaacggaggatgaggccagtggcgctggccgg ccactccgagtgtggggccacacccacctggaactcgcgctggcccgtgatcgtctccca cgcctctccctccacatcttcctgcaagcagagggagccggctccgacctcggccagccc agagaggggctcccacagtgcggcggcgggctgaagggctcttcaagcacggtcagagtg ggcgtcgaggccgagcagacaccaagagcgagcgagggctgccagcacgatgtcacctct caccagcactttgggaagccaaggcaggtacacagagaagcaaagggtgaagaagatgaa gaggacctttatttagtgttagaacaggctcaacaccacatggaagctaccaaggcttgg ggcttccaccctctgaagccacagcccaagctgtacattggcccctttcagtcatggctg gagttgctgggacacaggacaccaggcccctag