GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:48:21 Sequence gi568815581f:41921578_42123890 : 202313 bp : 51.26% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3338 3377 40 0.09 1.01 Init + 8752 9091 340 0 1 62 2 224 0.892 8.42 1.02 Intr + 9127 9260 134 2 2 82 92 215 0.999 22.17 1.03 Intr + 13640 13771 132 1 0 82 57 73 0.496 4.95 1.04 Intr + 14022 14172 151 2 1 96 81 206 0.999 20.95 1.05 Intr + 14896 14957 62 2 2 140 94 40 0.999 8.74 1.06 Intr + 15185 15350 166 1 1 58 8 262 0.917 15.25 1.07 Intr + 16980 17204 225 1 0 110 25 472 0.972 41.58 1.08 Intr + 17388 17595 208 1 1 90 73 221 0.816 19.46 1.09 Intr + 23559 23645 87 0 0 99 94 88 0.957 10.08 1.10 Intr + 27576 27772 197 0 2 117 86 285 0.997 30.78 1.11 Intr + 33640 33740 101 2 2 91 39 174 0.872 13.13 1.12 Intr + 39805 39889 85 0 1 85 105 138 0.996 15.09 1.13 Term + 43416 43906 491 1 2 87 48 257 0.843 16.81 1.14 PlyA + 44884 44889 6 1.05 2.00 Prom + 44990 45029 40 -9.07 2.01 Init + 45308 45310 3 1 0 85 81 0 0.849 -1.07 2.02 Intr + 46491 47163 673 2 1 117 51 1471 0.990 138.17 2.03 Intr + 50315 50454 140 0 2 119 76 134 0.874 15.99 2.04 Term + 51898 52347 450 0 0 118 48 482 0.984 42.77 2.05 PlyA + 52488 52493 6 -3.24 3.17 PlyA - 52857 52852 6 -0.45 3.16 Term - 53819 53634 186 2 0 51 49 100 0.529 0.21 3.15 Intr - 54322 54259 64 0 1 99 113 7 0.646 3.51 3.14 Intr - 54825 54741 85 1 1 68 75 23 0.526 -1.62 3.13 Intr - 55746 55684 63 1 0 85 100 79 0.931 7.98 3.12 Intr - 60430 60278 153 2 0 80 89 235 0.999 23.36 3.11 Intr - 60824 60678 147 0 0 47 80 207 0.945 16.52 3.10 Intr - 62059 61986 74 0 2 67 115 113 0.931 11.44 3.09 Intr - 66333 66242 92 0 2 35 103 80 0.929 3.49 3.08 Intr - 67319 67155 165 2 0 88 121 84 0.999 12.37 3.07 Intr - 67980 67827 154 2 1 70 89 133 0.999 12.19 3.06 Intr - 68805 68687 119 0 2 126 57 110 0.999 11.47 3.05 Intr - 73367 73293 75 2 0 65 87 109 0.823 8.71 3.04 Intr - 74847 74734 114 0 0 88 80 99 0.916 10.25 3.03 Intr - 75662 75538 125 0 2 93 79 67 0.961 7.11 3.02 Intr - 78993 78905 89 2 2 130 66 63 0.981 8.41 3.01 Init - 95839 95763 77 1 2 89 76 167 0.871 16.11 3.00 Prom - 97148 97109 40 -6.30 4.00 Prom + 98075 98114 40 -5.01 4.01 Init + 100001 100094 94 1 1 79 92 93 0.972 9.56 4.02 Intr + 100822 101063 242 2 2 60 85 389 0.955 33.60 4.03 Term + 102077 102316 240 1 0 117 49 292 0.999 24.56 4.04 PlyA + 103201 103206 6 -0.45 5.10 PlyA - 104020 104015 6 1.05 5.09 Term - 105556 105320 237 1 0 79 41 120 0.967 2.90 5.08 Intr - 106151 106041 111 2 0 46 65 104 0.698 4.98 5.07 Intr - 106669 106473 197 2 2 99 75 70 0.870 6.45 5.06 Intr - 107358 107206 153 1 0 91 82 18 0.720 2.06 5.05 Intr - 110207 110042 166 2 1 127 86 56 0.986 9.35 5.04 Intr - 112758 112648 111 0 0 90 81 57 0.975 6.28 5.03 Intr - 116308 116160 149 2 2 116 68 101 0.908 11.36 5.02 Intr - 119612 118169 1444 2 1 106 20 1635 0.938 148.51 5.01 Init - 121799 121257 543 2 0 79 94 712 0.769 65.85 5.00 Prom - 123655 123616 40 -6.70 6.07 PlyA - 125282 125277 6 -0.45 6.06 Term - 127213 127073 141 1 0 -4 49 106 0.342 -4.26 6.05 Intr - 129058 128980 79 0 1 64 75 85 0.610 4.85 6.04 Intr - 133820 133723 98 2 2 133 55 35 0.438 4.01 6.03 Intr - 140979 140815 165 0 0 92 81 35 0.306 3.87 6.02 Intr - 141481 141303 179 2 2 95 55 208 0.673 18.36 6.01 Init - 152315 152258 58 2 1 111 99 42 0.267 7.92 6.00 Prom - 155193 155154 40 -1.71 7.04 PlyA - 155236 155231 6 -0.45 7.03 Term - 157145 157029 117 2 0 72 53 75 0.725 1.24 7.02 Intr - 159584 159531 54 2 0 79 75 53 0.799 2.76 7.01 Init - 160600 160523 78 1 0 89 103 15 0.808 4.13 7.00 Prom - 162453 162414 40 -2.21 8.34 PlyA - 163038 163033 6 1.05 8.33 Term - 180369 180184 186 0 0 117 47 275 0.993 24.11 8.32 Intr - 180735 180639 97 2 1 113 94 115 0.990 15.11 8.31 Intr - 182221 182031 191 1 2 105 43 311 0.984 27.20 8.30 Intr - 183350 183189 162 2 0 122 52 314 0.997 31.89 8.29 Intr - 183590 183441 150 2 0 101 78 177 0.989 18.87 8.28 Intr - 184412 184159 254 0 2 43 64 278 0.678 18.59 8.27 Intr - 186218 186027 192 0 0 112 77 315 0.994 32.78 8.26 Intr - 186531 186405 127 0 1 64 85 198 0.977 17.86 8.25 Intr - 187000 186707 294 1 0 86 80 41 0.497 0.85 8.24 Intr - 187809 187693 117 0 0 81 101 83 0.987 10.07 8.23 Intr - 189336 189146 191 1 2 67 76 291 0.992 25.63 8.22 Intr - 189920 189719 202 2 1 103 81 291 0.977 29.28 8.21 Intr - 190316 190148 169 1 1 40 96 203 0.078 16.76 8.20 Intr - 192265 192076 190 2 1 99 32 547 0.086 49.36 8.19 Intr - 192507 192423 85 0 1 62 73 166 0.962 12.39 8.18 Intr - 192705 192642 64 2 1 83 81 107 0.983 8.71 8.17 Intr - 192817 192781 37 2 1 68 84 -13 0.858 -5.89 8.16 Intr - 193027 192913 115 1 1 98 62 234 0.780 22.32 8.15 Intr - 193458 193312 147 0 0 69 42 402 0.821 34.64 8.14 Intr - 194256 194146 111 0 0 137 60 296 0.980 32.78 8.13 Intr - 195584 195458 127 1 1 116 105 180 0.986 23.69 8.12 Intr - 196019 195811 209 2 2 94 48 396 0.999 34.60 8.11 Intr - 196237 196101 137 2 2 77 64 249 0.999 22.20 8.10 Intr - 196440 196330 111 1 0 134 45 45 0.958 5.65 8.09 Intr - 196826 196720 107 1 2 57 91 65 0.844 4.06 8.08 Intr - 197207 197171 37 0 1 84 81 18 0.502 -1.49 8.07 Intr - 197643 197477 167 2 2 59 88 48 0.587 2.02 8.06 Intr - 197859 197668 192 2 0 76 80 236 0.995 20.73 8.05 Intr - 198141 197960 182 0 2 80 96 153 0.995 14.48 8.04 Intr - 198542 198453 90 2 0 137 94 111 0.999 17.29 8.03 Intr - 198793 198648 146 2 2 99 105 189 0.999 22.21 8.02 Intr - 199252 199129 124 1 1 85 82 85 0.885 8.26 8.01 Init - 199727 199389 339 2 0 103 81 137 0.991 9.77 8.00 Prom - 200223 200184 40 -8.77 9.00 Prom + 200507 200546 40 -9.64 9.01 Sngl + 201274 201753 480 0 0 27 41 590 0.577 44.64 9.02 PlyA + 202119 202124 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 192265 192072 194 2 2 99 47 562 0.914 51.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_1|792_aa MAVRRAHILTTYASSLFGTLPPTLFLFSWLSYKGAFSGSNSRENRQSVQQQTLTSPAVSH AIITPWPSARQNWRKRTPAGCTQNPGPCQTGAEPEVAVHLMVAKKRSFHKPDRASLNPVT MSDPEGETLRSTFPSYMAEGERLYLCGEFSKAAQSFSNALYLQDGDKNCLVARSKCFLKM GDLERSLKDAEASLQSDPAFCKGILQKAETLYTMGDFEFALVFYHRGYKLRPDREFRVGI QKAQEAINNSVGSPSSIKLENKGDLSFLSKQAENIKAQQKPQPMKHLLHPTKGEPKWKAS LKSEKTVRQLLGELYVDKEYLEKLLLDEDLIKGTMKGGLTVEDLIMTGINYLDTHSNFWR QQKPIYARERDRKLMQEKWLRDHKRRPSQTAHYILKSLEDIDMLLTSGSAEGSLQKAEKV LKKVLEWNKEEVPNKDELVGNLYSCIGNAQIELGQMEAALQSHRKDLEIAKEYDLPDAKS RALDNIGRVFARVGKFQQAIDTWEEKIPLAKTTLEKTWLFHEIGRCYLELDQAWQAQNYG EKSQQCAEEEGDIEWQLNASVLVAQAQVKLRDFESAVNNFEKALERAKLVHNNEAQQAII SALDDANKGIIRELRKTNYVENLKEKSEGEASLYEDRIITREKDMRRVRDEPEKVVKQWD HSEDEKETDEDDEAFGEALQSPASGKQSVEAGKARSDLGAVAKGLSGELGTRSGETGRKL LEAGRRESREIYRRPSGELEQRLSGEFSRQEPEELKKLSEVGRREPEELGKTQFGEIGET KKTGNEMEKEYE >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_1|2379_bp atggcggtacgccgtgcacatatccttaccacgtatgcctcatccctattcggaactttg cctccaaccctcttcctgttttcttggctgtcttacaagggagccttttcaggcagcaac tccagagagaaccgccagagcgtgcaacagcaaacactaaccagcccagccgtcagccac gcaatcatcactccctggcccagtgcgcggcagaactggcgcaagcgcacgccggcaggt tgtacccagaatccgggcccttgccagacgggggcggaaccggaagtcgctgtacatctc atggttgctaagaaacggagcttccacaaaccagatagagcgtctctaaatccggtcacc atgtcggaccccgaaggcgagaccttgcgaagcacctttccctcttatatggccgaaggc gagcggctctacctgtgcggggaattttctaaagccgcgcagagcttcagcaacgctctt taccttcaggatggagacaagaactgcctggttgctcgctcaaagtgcttcctgaagatg ggagacttggagagatccctgaaggatgctgaggcttcgctccagagtgacccagctttc tgtaaggggattttgcaaaaggctgagacactgtacaccatgggagactttgagtttgcc ttggtattctatcatcgaggctacaagctgaggcctgatcgggaattcagagttggcatt cagaaagcccaggaagccatcaacaactcagtgggaagtccttcttccattaagctggag aacaaaggggacctctccttcttaagcaagcaggctgagaatataaaagcccagcagaag cctcagcccatgaaacacctcttacaccccaccaagggagagcccaagtggaaggcctcg ctcaagagtgagaagactgtccgccagcttctgggggagctctacgtggacaaagagtat ttggagaagctcctattggatgaagacctgatcaaaggcaccatgaagggcggcctgact gtggaggacctcatcatgacgggcatcaactacctggatactcacagcaacttctggagg cagcagaagccgatctacgccagggagcgggaccggaagctgatgcaagagaaatggctg cgggaccacaaacgccgtccctcacagacagcccattacatcctcaagagcctggaggac attgatatgttgctcacaagtggcagtgctgaagggagtcttcagaaagctgagaaagtg ctgaagaaggtactggaatggaacaaggaagaggtacccaacaaggatgaactggttgga aacttgtatagctgcatagggaatgcccagattgagctggggcagatggaggcagccctg cagagccacagaaaggacctggagatcgccaaggaatatgaccttcctgatgcaaaatcg agagcccttgacaacattggcagagtttttgccagagttgggaaattccagcaagccatt gacacgtgggaagaaaagatccctctggcaaaaaccaccctggagaagacctggctgttc cacgagatcggccgctgctacttggagctggaccaggcctggcaggcccagaattatggc gagaagtcccagcagtgtgccgaggaggaaggggacattgagtggcaactgaatgccagt gttctggtggcccaggcacaagtgaagctgagagacttcgagtcagccgtgaacaatttt gagaaggccctggagagagcaaagcttgtgcataacaacgaggcgcagcaggccatcatc agtgccttggacgatgccaacaagggtatcatcagagaactgaggaaaaccaactacgtg gagaatctcaaagaaaaaagcgagggagaagcttcactgtatgaagatagaataataaca agagagaaggacatgaggagagtgagagatgagcccgagaaggtggtgaagcagtgggac catagtgaggatgagaaagagacagatgaggacgatgaggcttttggggaagctctgcag agcccagcaagcggaaagcagagtgtggaagcaggaaaagccagaagcgatttgggagca gttgccaagggcctgtcaggagaattaggcacaagatcaggagaaacaggcaggaagcta ctagaagctggcagaagagagtcaagagaaatttataggaggccttcgggagaattagag caaagactctcaggagaattcagcagacaggaaccagaagaactaaagaaactttcagaa gtgggcagaagagagccagaagaactgggaaaaacacaatttggagaaataggagaaacg aaaaaaacaggaaatgagatggaaaaggaatatgaatga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_2|421_aa MNRGFSRKSHTFLPKIFFRKMSSSGAKDKPELQFPFLQDEDTVATLLECKTLFILRGLPG SGKSTLARVIVDKYRDGTKMVSADAYKITPGARGAFSEEYKRLDEDLAAYCRRRDIRILV LDDTNHERERLEQLFEMADQYQYQVVLVEPKTAWRLDCAQLKEKNQWQLSADDLKKLKPG LEKDFLPLYFGWFLTKKSSETLRKAGQVFLEELGNHKAFKKELRQFVPGDEPREKMDLVT YFGKRPPGVLHCTTKFCDYGKAPGAEEYAQQDVLKKSYSKAFTLTISALFVTPKTTGARV ELSEQQLQLWPSDVDKLSPTDNLPRGSRAHITLGCAADVEAVQTGLDLLEILRQEKGGSR GEEVGELSRGKLYSLGNGRWMLTLAKNMEVRAIFTGYYGKGKPVPTQGSRKGGALQSCTI I >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_2|1266_bp atgaacagaggcttctcccgaaaaagccacacattcctgcccaagatcttcttccgcaag atgtcatcctcaggggccaaggacaagcctgagctgcagtttcccttccttcaggatgag gacacagtggccacgctgctagagtgcaagacgctcttcatcttgcgcggcctgccagga agcggcaagtccacgctggcacgggtcatcgtggacaagtaccgtgatggcaccaagatg gtgtcggctgacgcttacaagatcacccccggcgctcgaggagccttctccgaggagtac aagcggctcgatgaggacctggctgcctactgccgccgccgggacatcagaattcttgtg cttgatgacaccaaccacgaacgggaacggctggagcagctctttgaaatggccgaccag taccagtaccaggtggtgctggtggagcccaagacggcgtggcggctggactgtgcccag ctcaaggagaagaaccagtggcagctgtcggctgatgacctgaagaagctgaagcctggg ctggagaaggacttcctgccgctctacttcggctggttcctgaccaagaagagctctgag accctccgcaaagccggccaggtcttcctggaagagctggggaaccacaaggccttcaag aaggagctgcgacaattcgtccctggggatgagcccagggagaagatggacttggtcacc tactttggaaagagacccccaggcgtgctgcattgcacaaccaagttttgtgactacggg aaggctcccggggcagaggagtacgctcaacaagatgtgttaaagaaatcttactccaag gccttcacgctgaccatctctgccctctttgtgacacccaagacgactggggcccgggtg gagttaagcgagcagcaactgcagttgtggccgagtgatgtggacaagctgtcacccact gacaacctgccgcgggggagccgcgcccacatcaccctcggctgtgcagctgacgtagag gccgtgcagacgggccttgacctcttagagattctgcggcaggagaaggggggcagccga ggcgaggaggtgggcgagctaagccggggcaagctctattccttgggcaatgggcgctgg atgctgaccctggccaagaacatggaggtcagggccatcttcacggggtactacgggaaa ggcaaacctgtgcccacgcaaggtagccggaaggggggcgccttgcagtcctgcaccatc atatga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_3|593_aa MAAAAECDVVMAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPK NASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQ RALELDHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALEFAPACHRFKI LKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYEDCIEKAVQFFVQALRMA PDHEKACIACRNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGIDPNNIKTNAKLYCNR GTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKT KEHKQLLKNAQLELKKSKRKDYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAEV QKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFS FEVCEPELSPTWTWGAASCCMRAVGTQPSGRRHTAHGCSGMHHAVLSSVVFKPTPTFCMS LRPASAFCLLMKANTVQLPNLTTAVPGQQRGDRTDGPKLWDKASPNTPSPRED >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_3|1782_bp atggcggctgccgcggagtgcgatgtggtaatggcggcgaccgagccggagctgctcgac gaccaagaggcgaagagggaagcagagactttcaaggaacaaggaaatgcatactatgcc aagaaagattacaatgaagcttataattattatacaaaagccatagatatgtgtcctaaa aatgctagctattatggtaatcgagcagccaccttgatgatgcttggaaggttccgggaa gctcttggagatgcacaacagtcagtgaggttggatgacagttttgtccggggacatcta cgagagggcaagtgccacctctctctggggaatgccatggcagcatgtcgcagcttccag agagccctagaactggatcataaaaatgctcaggcacaacaagagttcaagaatgctaat gcagtcatggaatatgagaaaatagcagaaacagattttgagaagcgagattttcggaag gttgttttctgcatggaccgtgccctagaatttgcccctgcctgccatcgcttcaaaatc ctcaaggcagaatgtttagcaatgctgggtcgttatccagaagcacagtctgtggctagt gacattctacgaatggattccaccaatgcagatgctctgtatgtacgaggtctttgcctt tattacgaagattgtattgagaaggcagttcagtttttcgtacaggctctcaggatggct cctgaccacgagaaggcctgcattgcctgcagaaatgccaaagcactcaaagcaaagaaa gaagatgggaataaagcatttaaggaaggaaattacaaactagcatatgaactgtacaca gaagccctggggatagaccccaacaatataaaaacaaatgctaaactctactgtaatcgg ggtacggttaattccaagcttaggaaactagatgatgcaatagaagactgcacaaatgca gtgaagcttgatgacacttacataaaagcctacttgagaagagctcagtgttacatggac acagaacagtatgaagaagcagtacgagactatgaaaaagtataccagacagagaaaaca aaagaacacaaacagctcctaaaaaatgcgcagctggaactgaagaagagtaagaggaaa gattactacaagattctaggagtggacaagaatgcctctgaggacgagatcaagaaagct tatcggaaacgggccttgatgcaccatccagatcggcatagtggagccagtgctgaggtt cagaaggaggaggagaagaagttcaaggaagttggagaggcctttactatcctctctgat cccaagaaaaagactcgctatgacagtggacaggacctagatgaggagggcatgaatatg ggtgattttgatccaaacaatatcttcaaggcattctttggcggtcctggcggcttcagc tttgaagtctgtgagccagagctgagcccgacctggacttggggtgcagcaagctgctgc atgcgggctgtgggcacccagcctagtggcaggagacacactgctcacggatgcagcggc atgcaccatgctgtcctgtcaagcgtggtgttcaagccaacccccaccttctgtatgagt ctgaggccagcctctgctttctgcctcctcatgaaagccaacactgttcagctgcccaac ctcaccacagctgtgcccggccagcagaggggagacaggaccgatggccccaagctgtgg gacaaagccagtcctaatacacccagccctagggaagactga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_4|191_aa MGKSCKVVVCGQASVGKTSILEQLLYGNHVVGSEMIETQEDIYVGSIETDRGVREQVRFY DTRGLRDGAELPRHCFSCTDGYVLVYSTDSRESFQRVELLKKEIDKSKDKKEVTIVVLGN KCDLQEQRRVDPDVAQHWAKSEKVKLWEVSVADRRSLLEPFVYLASKMTQPQSKSAFPLS RKNKGSGSLDG >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_4|576_bp atggggaagagctgcaaggtggtcgtgtgtggccaggcgtctgtgggcaaaacttcaatc ctggagcagcttctgtatgggaaccatgtagtgggttcggagatgatcgagacgcaggag gacatctacgtgggctccattgagacagaccggggggtgcgagagcaggtgcgtttctat gacacccgggggctccgagatggggccgaactgccccgacactgcttctcttgcactgat ggctacgtcctggtctatagcacagatagcagagagtcttttcagcgtgtggagctgctc aagaaggagattgacaaatccaaggacaagaaggaggtcaccatcgtggtccttggcaac aagtgtgacttacaggagcagcggcgtgtagacccagatgtggctcagcactgggccaag tcagagaaggtgaagctgtgggaggtgtcagtggcggaccggcgctccctcctggagccc tttgtctacttggccagcaagatgacgcaaccccagagcaagtctgccttccccctcagc cggaagaacaagggcagcggctccttggatggctga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_5|1036_aa MVPPGKKPAGEASNSNKKCKRYFNEHWKEEFTWLDFDYERKLMFCLECRQALVRNKHGKA ENAFTVGTDNFQRHALLRHVTSGAHRQALAVNQGQPPFEGQAEGGGACPGLATTPASRGV KVELDPAKVAVLTTVYCMAKEDVPNDRCSALLELQRFNLCQALLGTEHGDYYSPRRVRDM QVAIASVLHTEACQRLKASPYVGLVLDETRDWPESHSLALFATSVSPCDGQPATTFLGSV ELQEGEATAGQLLDILQAFGVSAPKLAWLSSSLPSERLGSVGPQLRATCPLLAELHCLPG RTDPEPPAYLGQYESILDALFRLHGGPSSHLVPELRAALDLAAIDLAGPRPVPWASLLPV VEAVAEAWPGLVPTLEAAALASPVAGSLALALRQFTFVAFTHLLLDALPSVQKLSLVLQA EEPDLALLQPLVMAAAASLQAQRGSGGARLQGFLQELASMDPDASSGRCTYRGVELLGYS EAAVRGLEWLRGSFLDSMRKGLQDSYPGPSLDAVAAFAAIFDPRRYPQAPEELGTHGEGA LRVLLRGFAPAVVRQRALGDFALFKRVVFGLGRLGPRALCTQLACAHSELHELFPDFAAL AALALALPAGAGLLDKVGRSRELRWWGQSGAGEGRGGHMVKIAVDGPPLHEFDFGLAVEF LETGPASGAPSPLLASLPLPTRPLQPPLDFKHLLAFHFNGAAPLSLFPNFSTMDPVQKAV ISHTFGVPSPLKKKLFISCNICHLRFNSANQAEAHYKGHKHARKLKAVEAAKSKQRPHTQ AQDGAVVSPIPTLASGAPGEPQSKEPGREAPGPEPAAAAVGSSMSGEGRSEKGHLYCPTC KVTVNSASQLQAHNTGAKHRWMMEGQRGAPRRSRGRPVSRGGAGHKAKRVTGGRGGRQGP SPAFHCALCQLQVNSETQLKQHMSSRRHKDRLAGKTPKPSSQHSKLQKHAALAVSILKSK LALQKQLTKTLAARFLPSPLPTAATAICALPGPLALRPAPTAATTLFPAPILGPALFRTP AGAVRPATGPIVLAPY >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_5|3111_bp atggtgcccccagggaagaaaccagcgggagaggcctccaactccaacaagaagtgtaag cgttacttcaacgagcactggaaagaggagtttacctggctggactttgactatgagcgg aagctgatgttctgcctcgagtgccgccaggccctggtacggaacaagcatggcaaagcc gaaaacgccttcactgtgggcacagacaacttccagcgccacgccctgctgcgccacgtg acctcaggagcccaccgccaggctctggctgtcaaccagggccagcccccttttgagggc caggctgaaggtggaggggcctgcccaggcctggctacgacccctgcctccaggggcgtc aaggtggagttagacccggccaaagtggctgtgctgactactgtgtactgcatggcaaag gaggatgtgcccaatgaccgctgctctgccctgctcgagctgcagaggttcaacctgtgc caggcactgctgggcacagagcatggcgattactacagtcccaggagggtgagggacatg caggtggccattgccagtgtcttgcacacagaggcctgccagcgcctgaaggcatcccca tatgtggggctggtgttggacgagaccagagactggccggagtcacacagcctggccctg tttgccacttcagtgtccccctgtgatggccagcccgccaccaccttcctgggcagtgtg gagctacaggagggcgaggccactgctggccagctcttggacatcctgcaggctttcggc gtatctgcacccaagctggcctggctcagctcaagcctccccagtgagcgcctggggagt gtgggcccacagctccgggccacttgcccactgctggcagagctgcattgtctccctggc cggacagatcctgagcccccggcctacttgggtcaatatgagagcatattggatgcccta ttccgcctccatggtggccctagttcccacttggtccctgagctccgggcagcactggac cttgcagctattgacttggcagggcctcggccagtgccctgggcctccctgctgcctgta gtggaagcagtggccgaggcctggcctggcctggtgcccaccctggaggctgcagccctt gcctcacctgtggcggggtcactggccctggccctgcgccagttcaccttcgtggccttc acccacctgctgctggatgccctgccctctgtgcagaagctctcccttgtcctgcaggca gaagagccggacttggccttgctgcagcctctggtgatggcggctgcggcctccctccaa gctcagcgcggctcaggtggggcccgcctccagggcttcctgcaggaactggcatccatg gaccctgacgccagcagcggacgctgcacctaccgcggcgtggagctgctcggttactcc gaggctgcggtccggggcttggagtggctccggggatccttcctggactccatgcggaag ggcctacaggactcctaccccgggccttcgctggacgccgtggccgccttcgcagcgatc ttcgacccccgacgctacccgcaggcgccggaggagctgggcacgcatggcgagggggcg ctgcgggtgctgctgcgcggctttgctcctgccgtggtgcgccagcgggcgctgggcgac ttcgcgctgtttaagcgcgtagtattcggccttgggcggctcggcccgcgggccctgtgc acccagctggcgtgcgcgcactcggagctgcacgagctcttccccgacttcgccgcccta gccgccttggctttggcgctgcccgcgggcgctggcctgctggacaaggtcggccgcagc cgggagctgcggtggtgggggcagagtggggccggggaaggccgggggggccacatggtg aagatcgcagtggatgggcccccgctgcacgagtttgacttcgggttggctgtggagttc ttagagacaggcccggcctccggcgcccccagccccctgctggcctccctgcccctgccc acccggcctctgcagcccccgctggacttcaagcacttgctcgccttccacttcaatggc gctgccccgctcagtctcttccccaacttcagcacgatggacccggtccagaaagctgtc atcagccacacgtttggtgtcccctcccctctgaagaagaagctgttcatttcctgtaac atctgtcacctgaggttcaactcagcgaaccaggccgaggcacattataaaggccacaaa cacgccagaaaactcaaggctgtcgaggctgccaagagcaagcagaggccacacacccag gcccaggatggggctgtagtgtccccaatcccaacgctggccagtggagcccctggagag ccacagagtaaagagcctgggagagaggcaccggggcctgagccagcggcagctgccgtg ggaagcagcatgagtggggaaggcaggagtgagaaggggcacctctactgccccacgtgt aaggtgacagtgaactcggcctcccagcttcaggctcacaacacaggagccaagcaccgg tggatgatggaaggtcagcgaggggctccccggaggagccggggccgcccggtgtccagg ggaggtgccggacacaaagccaagagagtcacagggggccggggcggccggcaggggccc agccctgccttccactgtgctctctgtcagctccaggtcaattcagagacccaactgaag cagcacatgagcagcaggaggcacaaagaccgcctggccgggaagacccccaagccctcc agccagcacagcaagctgcagaagcacgcagcgctggctgtgagtatcctcaagtctaaa ctggccttgcagaagcaactcaccaagacgttggcagcccgcttcctgcccagcccgctc cccaccgcagccactgccatctgtgctctgccagggcccctggccctccgccctgcccct acagcagccactaccctcttcccggctcccatcctgggcccagctctgtttcgcacccca gcaggagctgtccgccctgccacaggacctatcgtccttgccccttattag >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_6|239_aa MGPPLCVAWSPVGLPQCQADMKRPLSPPPPAEKETPISGAAECLPRPPEPPKPKRERKRP SYTLCDVCNIQLNSAAQAQGTLPVGSFGIRTPKQHFSSLEPPGSHRLSDKGLICGVGVPG SSSGLWSLKAKVSKDHQEKQGCLGTELGGLAQIPALEEDKDSVCVSGGCCRSVEPKETAA RASTAQDGGSCVSAKDKGGLDKAARQWCGEKGQVRSGFWRKNQQDLWMEQVLDVEENRD >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_6|720_bp atggggcctcctctctgtgtggcctggtccccagtaggcctgccccagtgccaggcagat atgaagcggccactgagcccacccccaccggctgagaaggagacccccatatctggagct gctgagtgcctccctcggcccccagaaccacctaagcccaagcgagaaagaaagcggcca tcgtacacgctctgtgatgtctgcaacatccagctgaactcggcggcccaggcccaggga actctccctgtgggatcttttgggatcaggactcctaagcagcacttctccagcctggag cctcctggtagccacaggctttcagacaagggcctcatctgcggggtgggggtcccaggc tccagctcaggtctgtggtctctgaaggccaaggtcagcaaggaccaccaggaaaagcaa ggctgtctgggtacagaattagggggcttagctcagatccctgccctggaagaagacaaa gattctgtctgtgtctctggtggctgctgtcgctccgtggagcccaaggagaccgcggcc agagcgtccaccgcccaagatggtggctcctgtgtctccgcaaaagataaaggtggcctg gacaaggccgccaggcagtggtgtggagagaagggacaagttagaagtggattttggagg aagaatcaacaagacctatggatggagcaggttctggatgtagaagagaacagggattga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_7|82_aa MKHAMEQDGIGASAKGLRWWEASGFMICKFHEGPLDFRSLLDPKCSGASTWPVSGAHLVM VQEKVTVGKSGESRVSAAAASF >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_7|249_bp atgaagcacgccatggaacaggatggcataggggcttcagccaaggggctgaggtggtgg gaagcttcaggcttcatgatttgcaagttccatgagggccctttggacttccgttcactg ctagatcccaagtgcagtggagcctccacgtggcctgtatctggagctcatctggtcatg gtccaggagaaggtgacagtgggaaagtcgggggagtcaagggtgtcagctgctgctgct tctttttga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_8|1682_aa MAEPSQAPTPAPAAQPRPLQSPAPAPTPTPAPSPASAPIPTPTPAPAPAPAAAPAGSTGT GGPGVGSGGAGSGGDPARPGLSQQQRASQRKAQVRGLPRAKKLEKLGVFSACKANETCKC NGWKNPKPPTAPRMDLQQPAANLSELCRSCEHPLADHVSHLENVSEDEINRLLGMVVDVE NLFMSVHKEEDTDTKQVYFYLFKLLRKCILQMTRPVVEGSLGSPPFEKPNIEQGVLNFVQ YKFSHLAPRERQTMFELSKMFLLCLNYWKLETPAQFRQRSQAEDVATYKVNYTRWLCYCH VPQSCDSLPRYETTHVFGRSLLRSIFTVTRRQLLEKFRVEKDKLVPEKRTLILTHFPKIW PQVHILPVVPFFPGRLPGLVPPLPPWAFWDLGVYLADLPMAQKQLASTSLGMAVPPAIRW EQEMARFLSMLEEEIYGANSPIWESGFTMPPSEGTQLVPRPASVSAAVVPSTPIFSPSMG GGSNSSLSLDSAGAEPMPGEKRTLPENLTLEDAKRLRVMGDIPMELVNEVMLTITDPAAM LGPETSLLSANAARDETARLEERRGIIEFHVIGNSLTPKANRRVLLWLVGLQNVFSHQLP RMPKEYIARLVFDPKHKTLALIKDGRVIGGICFRMFPTQGFTEIVFCAVTSNEQVKGYGT HLMNHLKEYHIKHNILYFLTYADEYAIGYFKKQGFSKDIKVPKSRYLGYIKDYEGATLME CELNPRIPYTELSHIIKKQKEVIIKKLIERKQAQIRKVYPGLSCFKEGVRQIPVESVPGI RETGWKPLGKEKGKELKDPDQLYTTLKNLLAQIKSHPSAWPFMEPVKKSEAPDYYEVIRF PIDLKTMTERLRSRYYVTRKLFVADLQRVIANCREYNPPDSEYCRCASALEKFFYFKLKE GGLIDKMELRSYQWEVIMPALEGKNIIIWLPTGAGKTRAAAYVAKRHLETVDGAKVVVLV NRVHLVTQHGEEFRRMLDGRWTVTTLSGDMGPRAGFGHLARCHDLLICTAELLQMALTSP EEEEHVELTVFSLIVVDECHHTHKDTVYNVIMSQYLELKLQRAQPLPQVLGLTASPGTGG ASKLDGAINHVLQLCANLDTWCIMSPQNCCPQLQEHSQQPCKQYNLCHRRSQGLVALFSL WASKRALPCRAEPSSVNSVSRAVLPSTDSRVQYELTQTGYIASGVRACPPNSLSHSGPDV RLGLFLDQSTSMCLTLGITVPRPKGKSTSRDPFGDLLKKLMDQIHDHLEMPELSRKFGTQ MYEQQVVKLSEAAALAGLQEQRVYALHLRRYNDALLIHDTVRAVDALAALQDFYHREHVT KTQILCAERRLLALFDDRKNELAHLATHGPENPKLEMLEKILQRQFSSSNSPRGIIFTRT RQSAHSLLLWLQQQQGLQTVDIRAQLLIGAGNSSQSTHMTQRDQQEVIQKFQDGTLNLLV ATSVAEEGLDIPHCNVVVRYGLLTNEISMVQARGRARADQSVYAFVATEGSRELKRELIN EALETLMEQAVAAVQKMDQAEYQAKIRDLQQAALTKRAAQAAQRENQRQQFPVEHVQLLC INCMVAVGHGSDLRKVEGTHHVNVNPNFSNYYNVSRDPVVINKVFKDWKPGGVISCRNCG EVWGLQMIYKSVKLPVLKVRSMLLETPQGRIQAKKWSRVPFSVPDFDFLQHCAENLSDLS LD >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_8|5049_bp atggcggaaccttcccaggccccgaccccggccccggctgcgcagccccggccccttcag tccccagcccctgccccaactccgactcctgcacccagcccggcttcagccccgattccg actcccaccccggcaccagcccctgccccagctgcagccccagccggcagcacagggact ggggggcccggggtaggaagtgggggggccgggagcgggggggatccggctcgacctggc ctgagccagcagcagcgcgccagtcagaggaaggcgcaagtccgggggctgccgcgcgcc aagaagcttgagaagctaggggtcttctcggcttgcaaggccaatgaaacctgtaagtgt aatggctggaaaaaccccaagccccccactgcaccccgcatggatctgcagcagccagct gccaacctgagtgagctgtgccgcagttgtgagcaccccttggctgaccacgtatcccac ttggagaatgtgtcagaggatgagataaaccgactgctggggatggtggtggatgtggag aatctcttcatgtctgttcacaaggaagaggacacagacaccaagcaggtctatttctac ctcttcaagctactgcggaaatgcatcctgcagatgacccggcctgtggtggaggggtcc ctgggcagccctccatttgagaaacctaatattgagcagggtgtgctgaactttgtgcag tacaagtttagtcacctggctccccgggagcggcagacgatgttcgagctctcaaagatg ttcttgctctgccttaactactggaagcttgagacacctgcccagtttcggcagaggtct caggctgaggacgtggctacctacaaggtcaattacaccagatggctctgttactgccac gtgccccagagctgtgatagcctcccccgctacgaaaccactcatgtctttgggcgaagc cttctccggtccattttcaccgttacccgccggcagctgctggaaaagttccgagtggag aaggacaaattggtgcccgagaagaggaccctcatcctcactcacttccccaagatttgg ccccaagttcacatcctccctgttgtccccttttttccaggaaggcttcctggattggtc cctcctctccctccatgggccttttgggatctgggcgtctacctggcagacttgcccatg gcccagaagcaacttgctagtactagtctggggatggcagtgccccctgccatcaggtgg gaacaagagatggcaagattcctgtccatgctggaggaggagatctatggggcaaactct ccaatctgggagtcaggcttcaccatgccaccctcagaggggacacagctggttccccgg ccagcttcagtcagtgcagcggttgttcccagcacccccatcttcagccccagcatgggt gggggcagcaacagctccctgagtctggattctgcaggggccgagcctatgccaggcgag aagaggacgctcccagagaacctgaccctggaggatgccaagcggctccgtgtgatgggt gacatccccatggagctggtcaatgaggtcatgctgaccatcactgaccctgctgccatg ctggggcctgagacgagcctgctttcggccaatgcggcccgggatgagacagcccgcctg gaggagcgccgcggcatcatcgagttccatgtcatcggcaactcactgacgcccaaggcc aaccggcgggtgttgctgtggctcgtggggctgcagaatgtcttttcccaccagctgccg cgcatgcctaaggagtatatcgcccgcctcgtctttgacccgaagcacaagactctggcc ttgatcaaggatgggcgggtcatcggtggcatctgcttccgcatgtttcccacccagggc ttcacggagattgtcttctgtgctgtcacctcgaatgagcaggtcaagggttatgggacc cacctgatgaaccacctgaaggagtatcacatcaagcacaacattctctacttcctcacc tacgccgacgagtacgccatcggctacttcaaaaagcagggtttctccaaggacatcaag gtgcccaagagccgctacctgggctacatcaaggactacgagggagcgacgctgatggag tgtgagctgaatccccgcatcccctacacggagctgtcccacatcatcaagaagcagaaa gaggtgatcatcaagaagctgattgagcgcaaacaggcccagatccgcaaggtctacccg gggctcagctgcttcaaggagggcgtgaggcagatccctgtggagagcgttcctggcatt cgagagacaggctggaagccattggggaaggagaaggggaaggagctgaaggaccccgac cagctctacacaaccctcaaaaacctgctggcccaaatcaagtctcaccccagtgcctgg cccttcatggagcctgtgaagaagtcggaggcccctgactactacgaggtcatccgcttc cccattgacctgaagaccatgactgagcggctgcgaagccgctactacgtgacccggaag ctctttgtggccgacctgcagcgggtcatcgccaactgtcgcgagtacaaccccccggac agcgagtactgccgctgtgccagcgccctggagaagttcttctacttcaagctcaaggag ggaggcctcattgacaaaatggagcttcggtcctaccaatgggaggtgatcatgcctgcc ctggagggcaagaatatcatcatctggctgcccacgggtgccgggaagacccgggcggct gcttatgtggccaagcggcacctagagactgtggatggagccaaggtggttgtattggtc aacagggtgcacctggtgacccagcatggtgaagagttcaggcgcatgctggatggacgc tggaccgtgacaaccctgagtggggacatgggaccacgtgctggctttggccacctggcc cggtgccatgacctgctcatctgcacagcagagcttctgcagatggcactgaccagcccc gaggaggaggagcacgtggagctcactgtcttctccctgatcgtggtggatgagtgccac cacacgcacaaggacaccgtctacaacgtcatcatgagccagtacctagaacttaaactc cagagggcacagccgctaccccaggtgctgggtctcacagcctccccaggcactggcggg gcctccaaactcgatggggccatcaaccacgtcctgcagctctgtgccaacttggacacg tggtgcatcatgtcaccccagaactgctgcccccagctgcaggagcacagccaacagcct tgcaaacagtacaacctctgccacaggcgcagccaggggttagtagcattattctctctc tgggcgagcaagcgtgccctgccgtgccgagctgaaccaagctccgtgaacagcgtcagc agggcagttttaccttctactgacagtagagtccagtatgaacttacacaaacaggttat atagcaagtggagtacgtgcctgccccccaaactcgctgagtcactctggcccggatgtc cgcctcggcctattccttgaccaaagcacgtccatgtgccttacactaggcatcaccgtg cccaggccaaagggtaagagcacatcgcgggatccgtttggggacttgctgaagaagctc atggaccaaatccatgaccacctggagatgcctgagttgagccggaaatttgggacgcaa atgtatgagcagcaggtggtgaagctgagtgaggctgcggctttggctgggcttcaggag caacgggtgtatgcgcttcacctgaggcgctacaatgacgcgctgctcatccatgacacc gtccgcgccgtggatgccttggctgcgctgcaggatttctatcacagggagcacgtcact aaaacccagatcctgtgtgccgagcgccggctgctggccctgttcgatgaccgcaagaat gagctggcccacttggcaactcatggcccagagaatccaaaactggagatgctggaaaag atcctgcaaaggcagttcagtagctctaacagccctcggggtatcatcttcacccgcacc cgccaaagcgcacactccctcctgctctggctccagcagcagcagggcctgcagactgtg gacatccgggcccagctactgattggggctgggaacagcagccagagcacccacatgacc cagagggaccagcaagaagtgatccagaagttccaagatggaaccctgaaccttctggtg gccacgagtgtggcggaggaggggctggacatcccacattgcaatgtggtggtgcgttat gggctcttgaccaatgaaatctccatggtccaggccaggggccgtgcccgggccgatcag agtgtatacgcgtttgtagcaactgaaggtagccgggagctgaagcgggagctgatcaac gaggcgctggagacgctgatggagcaggcagtggctgctgtgcagaaaatggaccaggcc gagtaccaggccaagatccgggatctgcagcaggcagccttgaccaagcgggcggcccag gcagcccagcgggagaaccagcggcagcagttcccagtggagcacgtgcagctactctgc atcaactgcatggtggctgtgggccatggcagcgacctgcggaaggtggagggcacccac catgtcaatgtgaaccccaacttctcgaactactataatgtctccagggatcctgtggtc atcaacaaagtcttcaaggactggaagcctgggggtgtcatcagctgcaggaactgtggg gaggtctggggtctgcagatgatctacaagtcagtgaagctgccagtgctcaaagtccgc agcatgctgctggagacccctcaggggcggatccaggccaaaaagtggtcccgcgtgccc ttctccgtgcctgactttgacttcctgcagcattgtgccgagaacttgtcggacctctcc ctggactga >gi568815581f:41921578_42123890|GENSCAN_predicted_peptide_9|159_aa MQRVGNTFSNESRVASRCPSVGLAERNRVATMPVRLLRDSPAAQEDNDHARDGFQMKLDA HGFAPEELVVQVDGQWLMVTGQQQLDVRDPERVSYRMSQKVHRKMLPSNLSPTAMTCCLT PSGQLWVRGQCVALALPEAQTGPSPRLGSLGSKASNLTR >gi568815581f:41921578_42123890|GENSCAN_predicted_CDS_9|480_bp atgcagagagtcggtaacaccttctccaacgagagccgggtggcatcccggtgtcccagc gtgggccttgctgaacggaaccgggtggccacaatgccggtgcggctgctcagggacagt ccagcggctcaggaggacaatgaccatgccagagacggtttccaaatgaagctggatgcc cacggcttcgccccggaggaactggtggtgcaggtggatggccaatggctgatggtgacc ggacagcagcaactggacgtcagggacccggaaagggtcagttaccgcatgtcacagaag gtgcaccggaaaatgctcccgtccaacctgagtcctaccgccatgacctgctgcctgacc ccctccgggcagctgtgggtcagaggccagtgtgtggcgctggccctccctgaagcccaa acaggaccgtccccgagactcgggagcctcggctctaaggcttccaacctgacccggtaa