GENSCAN 1.0 Date run: 1-Aug-119 Time: 17:03:57 Sequence gi568815582r:67728270_67933859 : 205590 bp : 51.44% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 242 104 139 1 1 102 70 194 0.750 19.97 1.08 Intr - 1215 1011 205 1 1 89 51 147 0.995 9.88 1.07 Intr - 1559 1411 149 1 2 101 76 140 0.998 14.49 1.06 Intr - 1777 1669 109 2 1 94 108 220 0.999 24.44 1.05 Intr - 3315 3203 113 2 2 91 78 61 0.973 5.93 1.04 Intr - 6773 6589 185 2 2 140 81 252 0.980 28.80 1.03 Intr - 9766 9744 23 2 2 161 117 31 0.110 11.05 1.02 Intr - 16186 16019 168 2 0 142 62 239 0.223 27.13 1.01 Init - 24080 24050 31 2 1 51 45 41 0.159 -4.00 1.00 Prom - 24387 24348 40 -3.11 2.00 Prom + 26674 26713 40 -2.41 2.01 Init + 29149 29251 103 0 1 92 88 -3 0.228 0.72 2.02 Intr + 30953 31025 73 0 1 88 86 27 0.538 1.46 2.03 Intr + 32061 32216 156 0 0 46 95 68 0.758 2.94 2.04 Intr + 35314 35375 62 1 2 102 113 56 0.940 8.37 2.05 Term + 46380 46624 245 1 2 82 42 81 0.312 -0.71 2.06 PlyA + 46712 46717 6 1.05 3.06 PlyA - 49746 49741 6 -0.45 3.05 Term - 51483 51308 176 1 2 68 36 108 0.002 1.84 3.04 Intr - 76021 76005 17 0 2 87 89 35 0.044 -0.83 3.03 Intr - 77270 77159 112 0 1 89 94 73 0.606 7.94 3.02 Intr - 78292 78033 260 0 2 84 69 193 0.939 14.64 3.01 Init - 79032 78833 200 0 2 38 19 244 0.327 10.91 3.00 Prom - 83088 83049 40 -1.51 4.00 Prom + 84096 84135 40 -3.71 4.01 Init + 85246 85250 5 0 2 70 55 0 0.064 -5.70 4.02 Intr + 86033 86132 100 2 1 98 53 69 0.182 5.01 4.03 Intr + 92570 92682 113 1 2 92 98 98 0.741 10.88 4.04 Intr + 92830 92956 127 1 1 92 71 190 0.999 18.89 4.05 Intr + 95357 95450 94 1 1 54 86 117 0.997 8.14 4.06 Intr + 96314 96510 197 0 2 51 53 303 0.726 22.75 4.07 Intr + 96868 97003 136 0 1 86 115 210 0.571 24.15 4.08 Intr + 97398 97567 170 1 2 80 53 204 0.999 16.28 4.09 Intr + 97648 97807 160 0 1 57 100 296 0.893 27.77 4.10 Intr + 97883 98013 131 0 2 82 58 242 0.890 21.52 4.11 Intr + 98168 98293 126 1 0 60 71 226 0.994 19.48 4.12 Intr + 98423 98575 153 1 0 101 105 117 0.998 15.48 4.13 Intr + 98694 98803 110 2 2 66 52 127 0.999 6.48 4.14 Intr + 98980 99106 127 1 1 91 90 117 0.997 13.39 4.15 Intr + 99204 99310 107 2 2 101 74 124 0.991 11.81 4.16 Term + 99484 99724 241 1 1 76 49 405 0.999 31.23 4.17 PlyA + 99777 99782 6 -3.44 5.13 PlyA - 99908 99903 6 1.05 5.12 Term - 100121 99998 124 1 1 121 40 164 0.999 13.06 5.11 Intr - 100309 100205 105 0 0 91 81 176 0.999 17.03 5.10 Intr - 100574 100398 177 1 0 80 89 110 0.473 9.85 5.09 Intr - 101247 101154 94 1 1 106 47 -2 0.848 -2.98 5.08 Intr - 101819 101496 324 0 0 87 110 239 0.935 22.30 5.07 Intr - 102279 102121 159 1 0 93 73 94 0.906 8.87 5.06 Intr - 103089 102947 143 2 2 83 80 100 0.924 9.21 5.05 Intr - 103839 103743 97 1 1 109 86 56 0.957 7.07 5.04 Intr - 104046 103959 88 0 1 35 119 87 0.872 6.54 5.03 Intr - 104240 104186 55 1 1 63 110 2 0.687 -0.63 5.02 Intr - 105775 105481 295 2 1 16 71 291 0.742 16.81 5.01 Init - 110183 110135 49 2 1 80 58 55 0.673 0.75 5.00 Prom - 111625 111586 40 -7.50 6.00 Prom + 113006 113045 40 -4.91 6.01 Sngl + 114286 115230 945 0 0 99 43 1467 0.999 140.32 6.02 PlyA + 117949 117954 6 1.05 7.03 PlyA - 121381 121376 6 1.05 7.02 Term - 127809 127686 124 2 1 76 54 261 0.610 19.66 7.01 Init - 128895 128879 17 0 2 74 97 5 0.468 -0.12 7.00 Prom - 130276 130237 40 -6.30 8.00 Prom + 130521 130560 40 -1.11 8.01 Init + 136862 136960 99 1 0 60 102 52 0.943 4.02 8.02 Intr + 140071 140142 72 0 0 94 84 67 0.992 7.00 8.03 Intr + 140232 140330 99 2 0 113 90 100 0.962 13.51 8.04 Term + 142531 142644 114 0 0 130 43 131 0.993 11.67 8.05 PlyA + 143021 143026 6 1.05 9.00 Prom + 143050 143089 40 -13.74 9.01 Init + 143377 143592 216 0 0 74 91 87 0.794 6.29 9.02 Intr + 144775 144919 145 0 1 81 88 51 0.786 4.77 9.03 Intr + 144988 145074 87 2 0 10 89 228 0.790 15.54 9.04 Intr + 145269 145304 36 1 0 131 100 8 0.968 5.02 9.05 Intr + 147676 147832 157 2 1 85 84 141 0.993 12.98 9.06 Intr + 148219 148330 112 1 1 63 100 64 0.998 5.98 9.07 Intr + 148604 148703 100 1 1 71 84 143 0.653 12.38 9.08 Intr + 148948 149160 213 2 0 56 32 242 0.969 14.61 9.09 Intr + 149221 149387 167 2 2 64 76 209 0.978 17.49 9.10 Intr + 149472 149576 105 2 0 103 115 91 0.998 14.21 9.11 Intr + 149897 150006 110 1 2 55 116 117 0.999 10.78 9.12 Intr + 150091 150174 84 1 0 108 61 43 0.891 3.03 9.13 Intr + 150267 150362 96 0 0 121 111 34 0.997 8.62 9.14 Intr + 150468 150570 103 0 1 95 102 89 0.994 11.68 9.15 Intr + 150688 150868 181 0 1 83 93 238 0.969 23.76 9.16 Intr + 150968 151040 73 0 1 96 78 42 0.833 2.96 9.17 Intr + 151143 151234 92 0 2 136 70 123 0.999 15.44 9.18 Intr + 151318 151505 188 2 2 56 78 132 0.997 8.93 9.19 Intr + 151581 151702 122 2 2 76 94 185 0.997 17.70 9.20 Intr + 151794 151947 154 0 1 81 89 60 0.919 5.99 9.21 Intr + 152288 152721 434 1 2 107 100 280 0.994 24.12 9.22 Intr + 152807 152911 105 2 0 39 81 142 0.549 8.43 9.23 Intr + 152996 153148 153 2 0 103 45 170 0.996 13.90 9.24 Intr + 153228 153264 37 0 1 67 111 27 0.972 1.65 9.25 Intr + 153399 153840 442 2 1 35 84 255 0.523 13.41 9.26 Intr + 153943 154058 116 2 2 139 43 130 0.999 14.27 9.27 Intr + 154160 154349 190 1 1 86 81 274 0.572 26.18 9.28 Intr + 154410 154596 187 1 1 61 89 225 0.998 19.17 9.29 Intr + 154689 154908 220 0 1 117 96 318 0.999 34.33 9.30 Intr + 155299 155462 164 0 2 123 111 116 0.999 16.69 9.31 Intr + 155687 155765 79 2 1 91 85 147 0.410 14.75 9.32 Intr + 156605 156713 109 1 1 7 109 58 0.135 0.16 9.33 Intr + 157453 157585 133 2 1 79 86 158 0.671 14.91 9.34 Term + 157705 157990 286 1 1 106 33 191 0.911 10.62 9.35 PlyA + 160547 160552 6 1.05 10.00 Prom + 162698 162737 40 -8.38 10.01 Init + 165007 165366 360 0 0 80 12 315 0.069 18.20 10.02 Intr + 169107 169270 164 2 2 40 87 27 0.009 -2.91 10.03 Intr + 180411 181437 1027 0 1 122 105 1431 0.133 139.50 10.04 Term + 199056 199373 318 2 0 103 52 485 0.986 41.63 10.05 PlyA + 201030 201035 6 -3.64 11.08 PlyA - 201327 201322 6 1.05 11.07 Term - 201826 201665 162 1 0 138 49 129 0.977 12.25 11.06 Intr - 202049 201916 134 0 2 108 115 89 0.999 14.47 11.05 Intr - 202319 202139 181 2 1 65 45 126 0.939 5.96 11.04 Intr - 202538 202412 127 1 1 90 94 94 0.598 11.39 11.03 Intr - 202730 202651 80 2 2 66 109 75 0.995 6.24 11.02 Intr - 202932 202829 104 1 2 -36 86 95 0.740 -2.71 11.01 Init - 203583 203532 52 0 1 84 84 93 0.846 7.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 13061 13489 429 1 0 57 47 242 0.964 13.63 S.002 Init + 180352 181437 1086 0 0 72 105 1453 0.803 138.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_1|374_aa MAASSIFKASSWDKHSYGYHGDDGHSFCSSGTGQPYGPTFTTGDVIGCCVNLINGTCFYT KNGHSLGIAFTDLPANLYPTVGLQTPGEIVDANFGQQPFLFDIEDYMREWRAKVQGTVHC FPISARLGEWQAVLQNMVSSYLVHHGYCATATAFARMTETPIQEEQASIKNRQKIQKLVL EGRVGEAIETTQRFYPGLLEHNPNLLFMLKCRQFVEMVNGTDSEVRSLSSRSPKSQDSYP GSPSLSPRHGPSSSHMHNTGADSPSCSNGVASTKSKQNHSKYPAPSSSSSSSSSSSSSSP SSVNYSESNSTDSTKSQHHSSTSNQETSDSEMEMEAEHYPNGVLGSMSTRIVNGAYKHED LQTDESSMGESHGP >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_1|1122_bp atggctgcatcctccatcttcaaagccagcagttgggacaaacattcctatggttaccat ggtgatgatgggcattcgttctgctcctcggggactggccagccctatggtcccacattc accacaggagacgtgatcggctgctgtgtcaacctcatcaatggcacctgcttctacacc aagaatggccacagccttggtatagccttcacagacctcccggccaacctctaccccacc gtaggcctgcagacacctggggagattgtggacgccaactttgggcagcagcccttcctg tttgacattgaggactacatgcgggagtggcgtgccaaggtccagggcacggtccactgc ttccccatcagtgcccggcttggcgagtggcaggcagtgctgcagaacatggtttcatct tacctcgtgcatcatgggtattgtgccacagccacggcttttgctcgaatgactgaaacc ccgattcaggaagaacaggcgtccataaagaacagacaaaagatccagaagctggtgctg gagggccgtgtgggcgaggccatcgagaccacccagcgcttctacccagggctgctggag cacaaccccaacctcctcttcatgctcaagtgccggcagtttgtggagatggtgaatggg acggacagtgaggtccgaagtttgagctcccgaagccccaagtcccaggacagctaccct ggctcccccagcctcagtccccgacatggccccagtagttcccacatgcacaacacagga gcagacagtcccagctgtagcaatggcgtcgcgtccaccaagagcaaacagaaccacagt aaataccctgcacccagctcctcatcctcgtcctcctcctcctcctcgtcctcttcccca tcctccgtcaattactccgagtccaactcaacagactccaccaagtcccagcaccacagc agtaccagtaaccaggagaccagcgacagtgagatggagatggaggcagagcactacccc aacggtgtgctaggaagcatgtccacacgcattgttaatggtgcctacaagcatgaggac ctgcagacggatgagtccagcatgggtgagagccacggcccg >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_2|212_aa MPSGSQNVTLEAGCGGMSAALAWPECFLSMGPCPGHVPGVSRNEKQQNEKGREGQGSTRP LVFCPDLCYQWFLFHGTRRCEISAVCRTCPTQQKQLPVGPMPLTAHSEQAWVWQSTAIIH RPHKSSGSLKEGVNDTGSSPTGAQVAYSNEVTAESGFKANKPPLFKYWVTLSLFQKHGVL KLPKDALFYLPETLHKLKDYHPTNLRETKQTL >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_2|639_bp atgcccagtggctctcagaatgtcactcttgaggctggctgtggagggatgagtgctgcg ctggcttggccagagtgcttcctgagcatggggccgtgtccaggccatgtgccaggggtc tctcggaacgagaaacaacagaatgagaaaggcagagaggggcaaggcagcaccagaccc ctggtgttctgccctgatctctgctatcagtggttcctcttccatggcacacgcagatgt gaaatatcagctgtttgcagaacgtgtcctacccagcagaagcagcttcctgttgggcct atgcctctaacagctcactcagagcaagcctgggtctggcaaagcacagccatcattcat aggcctcacaagtcttctgggtccctcaaggaaggtgtcaatgacacagggagcagcccc acaggagcccaggttgcatattccaatgaggtcacagctgagtcagggttcaaggcgaat aagcccccactcttcaaatactgggtgacattaagtcttttccaaaagcatggtgtttta aagctaccaaaagatgcactcttctatcttccagaaactcttcataaacttaaagactat catccaaccaatctgagagaaacaaagcagaccctgtga >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_3|254_aa MVRGQQEVQRSGSSAISLSRSTQTLISALIPASAYLDADEKLWYRDCWQAMTHGQAGNQH PRPGKLCDTIVAPSKMAAATADPGAGNPQPGDSSGGGAGGGLPSPGEQELSRRLQRLYPA VNQQETPLPRSWSPKDKYNYIGLSQGNLRVHYKGHGKNHKDAASVRATHPIPAACGIYYF EVKIVSKGRDGDCDEAASYWVQGIFVDDMNDWKNLYEKKFITPVFTELLNVSKDPRAPNN SAIFRCLQRSIEEG >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_3|765_bp atggtcagaggccagcaggaggtgcagaggtccggatcaagtgccatttccctctctaga agtactcaaaccctcatctctgccctcatccctgccagcgcctacctcgacgcggacgag aagctgtggtaccgcgactgctggcaggccatgacccacgggcaggcgggcaatcagcac ccgcggcctgggaagctgtgtgacacaatagtagctccctccaagatggcggcagcgacg gcagacccgggagctgggaacccgcagcctggggactcctccggcgggggcgctgggggc gggctgccgtcccctggggagcaggagctgagccggcgcttgcagcgcctgtatcccgcg gtcaaccagcaagagactccgctgccgcgctcctggagccccaaggacaaatacaactac attggtctctcccagggcaacctccgcgtccactacaaaggtcatggcaaaaatcacaaa gatgcggcctcagtgcgtgccacccaccccatacctgctgcctgtggcatttattacttt gaagtgaagattgtcagcaaaggaagagatggtgattgtgacgaagcagcatcatactgg gtacagggcatttttgttgatgacatgaatgactggaagaacctctatgaaaagaaattt atcacacctgtttttactgaattactcaacgttagcaaggacccaagagctcctaataac tcagccatattcaggtgcctgcagagatcaatagaggaagggtag >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_4|698_aa MRLQPRPSGVTIDESFLTEDKSTQNRKLLQKRRTLTGQFSMGGHLSPWPTYTSGQTILQN RKPCSDDYRKRVGSCQQHPFRTAKPQYLEELENYLRKELLLLDLGTDSTQELRLQPYREI FEFFIEDFKTYKPLLSSIKNAYEGMLAHQREKIRALEPLKAKLVTVNEDCNERILAMRAE EKYEISLLKKEKMNLLKLIDKKNEEKISLQSEVTKLRKNLAEEYLHYLSERDACKILIAD LNELRYQREDMSLAQSPGIWGEDPVKLTLALKMTRQDLTRTQMELNNMKANFGDVVPRRD FEMQEKTNKDLQEQLDTLRASYEEVRKEHEILMQLHMSTLKERDQFFSELQEIQRTSTPR PDWTKCKDVVAGGPERWQMLAEGKNSDQLVDVLLEEIGSGLLREKDFFPGLGYGEAIPAF LRFDGLVENKKPSKKDVVNLLKDAWKERLAEEQKETFPDFFFNFLEHRFGPSDAMAWAYT IFENIKIFHSNEVMSQFYAVLMGKRSENVYVTQKETVAQLLKEMTNADSQNEGLLTMEQF NTVLKSTFPLKTEEQIQELMEAGGWHPSSSNADLLNYRSLFMEDEEGQSEPFVQKLWEQY MDEKDEYLQQLKQELGIELHEEVTLPKLRGGLMTIDPSLDKQTVNTYMSQAFQLPESEMP EEGDEKEEAVVEILQTALERLQVIDIRRVGPREPEPAS >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_4|2097_bp atgagattacagccacggccttcaggagtgaccatagacgaatcctttctcacagaagac aagagcacccagaatcgcaagcttcttcagaaacgaaggacgctgactggtcagttctcc atgggtgggcacctgtccccatggcccacatacaccagtggccagaccattttgcaaaat cgaaaaccctgttcagatgactaccggaagcgagtagggagctgccagcagcaccccttt cgcactgccaagccccagtacttggaggaactggaaaactacctacgcaaggagctcctc ctgctggacctgggcacagattccacccaggaactaaggctgcagccttacagagagatc tttgagttcttcatagaggacttcaaaacgtacaagccattactatcctccatcaagaat gcgtatgaggggatgctggcccaccaaagggagaagattcgggctctggagcccctgaag gccaagcttgtcactgtgaatgaggactgcaatgagaggatcctggccatgagagctgag gagaaatatgaaatctccctgctcaagaaagagaagatgaacttgctaaaactcatcgac aaaaagaatgaggagaagatttcattgcagagcgaggtgaccaaactgaggaagaacttg gctgaggagtacctgcactacctcagtgagcgagatgcctgtaagatcctcatcgcagac ctgaatgagctgcggtaccagcgggaggacatgtcattagcccagtcgccaggcatctgg ggggaggaccctgtgaagttaaccctggctcttaagatgacccggcaagacctgacccgc acgcagatggaactcaacaacatgaaggccaactttggagatgtggtccccaggagggac tttgaaatgcaggagaagaccaacaaggatcttcaggagcagctggacaccctgagagcc agctacgaggaggttcgcaaggagcatgagatcctcatgcagctgcacatgagcacgctg aaggaacgggaccaattcttctctgagctgcaggagatccagcgcacttccacgccgcgg cctgactggaccaagtgcaaagatgtggtggctgggggcccagagcgctggcagatgctg gctgagggcaagaacagcgaccagctggtggacgtgctcctggaagagattggttcgggg ctgctgcgggagaaagacttcttccctggtctgggctatggggaagccatccctgctttt cttcggtttgatggcctcgtggagaacaagaagccaagcaagaaggacgtggtcaacctc ctcaaggatgcctggaaggaacgtcttgctgaggagcagaaagagacgttcccagatttc ttcttcaatttcctggagcatcgctttgggcccagtgatgccatggcctgggcttatact atttttgaaaatatcaagatcttccactccaacgaggttatgagtcagttctatgcagtc ttgatgggaaagcggagtgagaatgtgtatgtcacccagaaggagacagtagcccagctg ctgaaggagatgacaaatgctgacagtcagaacgaggggctactaaccatggagcagttc aacactgtcctcaagagtaccttccctctcaagacagaagagcaaatccaggagctgatg gaggcagggggctggcatcccagcagcagcaatgcagacttgctcaactaccgctcactg tttatggaggatgaggagggccagagtgagccctttgtgcaaaaactctgggaacaatac atggatgagaaggacgagtacttacagcagctaaagcaggagcttggcatagaactccat gaggaagtgactctgcccaagctgcgagggggcctgatgaccatcgaccccagcctggac aagcagacagtgaacacctacatgagccaggccttccagctccctgagtcggaaatgcca gaggagggtgacgagaaggaagaagccgtggtggaaatcctccagactgccctggagcgg cttcaggtgattgacatcaggcgtgtgggacctcgagagccagagcctgcaagctag >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_5|569_aa MRFHHVVRAGLELLTPARLPPKHPEFATITVHPASCDSARTTIPVGLCGSIARVTLTVAA GRRSLQLAGRLGGPGAETMADHNPDSDSTPRTLLRRVLDTADPRTPRRPRSARAGKLSGQ TRTIARGRSHGARSVGRSAHIQASGHLEEQTPRTLLKNILLTAPESSILMPESVVKPVPA PQAVQPSRQESSCGRSLNLTFATPLQPQSVQRPGLARRPPARRAVDVGAFLRDLRDTSLA PPNIVLEDTQPFSQPMVGSPNVYHSLPCTPHTGAEDAEQAAGRKTQSSGPGLQKNSPGKP AQFLAGEAEEVNAFALGFLSTSSGVSGEDEVEPLHDGVEEAEKKMEEEGVSVSEMEATGA QGPSRVEEAEGHTEVTEAEGSQGTAEADGPGASSGDEDASGRAASPESASSTPESLQARR HHQFLEPAPAPGAAVLSSEPAEPLLVRHPPRPRTTGPRPRQDPHKAGLSHYVKLFSFYAK MPMERKALEMVEKCLDKYFQHLCDDLEVFAAHAGRKTVKPEDLELLMRRQGLVTDQVSLH VLVERHLPLEYRQLLIPCAYSGNSVFPAQ >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_5|1710_bp atgaggtttcaccacgttgtccgggctggtctcgaactcctgaccccagcgaggctgcct ccaaagcatccggagtttgcaaccatcactgtccaccccgcgagctgcgatagtgcgaga actacaattcccgtggggctgtgcgggagcatcgcgagagttacattaactgtggcggcg ggaaggcggagcctgcagctggctgggcggttaggagggcccggggccgagacgatggct gaccacaaccctgacagcgactccacgccgcgcacgctgctgcgacgcgtgctggataca gcggacccgcgcaccccgcggcgaccccggagtgctcgggctgggaagttgagtggccaa acaaggacgatagccagagggcgttcccatggagccaggtctgttggcagatcggcccat attcaggccagtgggcacttggaggaacagacacctcggacgctgctgaagaacatccta ctaactgccccagaatcttccatcctgatgcctgagtcggtagtgaagccagtgccagca ccgcaggcggtccaaccctccagacaagagagcagttgcggcagatccctcaacctgacc tttgccacacctcttcagccacagtcagtgcagaggcctggcttggcccgcagacctcca gcccgccgagctgtagacgtgggtgcctttttgcgggatctgcgagatacttccctggct cctccaaacattgtgttggaggacacccagccgttctctcagcccatggttggctccccc aacgtgtatcactccctgccctgcacgcctcacactggggctgaagacgctgagcaggct gccggtcgcaagacacagagcagtgggcctgggctgcagaagaatagccctgggaaacca gcccagtttctggcaggagaggcagaggaggtcaatgcctttgctctgggcttcctgagc accagcagtggtgtctctggagaagatgaagtagagcccttacacgatggagttgaagag gcagagaaaaagatggaagaagaaggtgtgagtgtgagtgaaatggaggcaacaggagca caaggacccagcagggtagaagaggctgagggacacacagaggtgacagaagcagaggga tcccaggggactgctgaggctgacgggccaggagcatcttcaggggatgaggatgcctct ggcagggcagcaagtccagagtcggcctccagcacccctgagtctctccaggccaggcga catcatcagtttcttgagccagccccagcgcctggtgctgcagtcttatcttcagagcct gcagagcctctgttggtcaggcatccccctaggccccggaccaccggccccaggccccgg caagatccccacaaggctggactgagccactatgtgaaactctttagcttctatgccaag atgcccatggagaggaaggctcttgagatggtggagaagtgcctagataaatatttccag catctttgtgatgatctggaggtatttgctgctcatgctggccgcaagactgtgaagcca gaggacctggagctgctgatgcggcggcagggcctggtcactgaccaagtctcactgcac gtgctagtggagcggcacctgcccctggagtaccggcagctgctcatcccctgtgcatac agtggcaactctgtcttccctgcccagtag >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_6|314_aa MPGFTCCVPGCYNNSHRDKALHFYTFPKDAELRRLWLKNVSRAGVSGCFSTFQPTTGHRL CSVHFQGGRKTYTVRVPTIFPLRGVNERKVARRPAGAAAARRRQQQQQQQQQQQQQQQQQ QQQQQQQQQQQQSSPSASTAQTAQLQPNLVSASAAVLLTLQATVDSSQAPGSVQPAPITP TGEDVKPIDLTVQVEFAAAEGAAAAAAASELQAATAGLEAAECPMGPQLVVVGEEGFPDT GSDHSYSLSSGTTEEELLRKLNEQRDILALMEVKMKEMKGSIRHLRLTEAKLREELREKD RLLAMAVIRKKHGM >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_6|945_bp atgcctggctttacgtgctgcgtgccaggctgctacaacaactcgcaccgggacaaggcg ctgcacttctacacgtttccaaaggacgctgagttgcggcgcctctggctcaagaacgtg tcgcgtgccggcgtcagtgggtgcttctccaccttccagcccaccacaggccaccgtctc tgcagcgttcacttccagggcggccgcaagacctacacggtacgcgtccccaccatcttc ccgctgcgcggcgtcaatgagcgcaaagtagcgcgcagacccgctggggccgcggccgcc cgccgcaggcagcagcagcaacagcagcagcagcagcaacagcagcaacagcagcagcag cagcaacagcagcagcagcagcagcagcagcagcagtcctcaccctctgcctccactgcc cagactgcccagctgcagccgaacctggtatctgcttccgcggccgtgcttctcaccctt caggccactgtagacagcagtcaggctccgggatccgtacagccggcgcccatcactccc actggagaagacgtgaagcccatcgatctcacagtgcaagtggagtttgcagccgcagag ggcgcagccgctgcggccgccgcgtcggagttacaggctgctaccgcagggctggaggct gccgagtgccctatgggcccccagttggtggtggtaggggaagagggcttccctgatact ggctccgaccattcgtactccttgtcgtcaggcaccacggaggaggagctcctgcgcaag ctgaatgagcagcgggacatcctggctctgatggaagtgaagatgaaagagatgaaaggc agcattcgccacctgcgtctcactgaggccaagctgcgcgaagaactgcgtgagaaggat cggctgcttgccatggctgtcatccgcaagaagcacggaatgtga >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_7|46_aa MNLIVRDVQIGHIVTVGECRPLSKTVRFNVLKVTKAAGTKKQFQKF >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_7|141_bp atgaacttgattgtcagggacgtccagattggccacatcgtaacagtgggtgagtgccgg cccctgagcaagacagtacgcttcaacgtgctcaaggtcaccaaggccgctggcaccaag aagcagttccagaagttctga >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_8|127_aa MGDKPIWEQIGSSFIQHYYQLFDNDRTQLGAIYIDASCLTWEGQQFQGKAAIVEKLSSLP FQKIQHSITAQDHQPTPDSCIISMVVGQLKADEDPIMGFHQMFLLKNINDAWVCTNDMFR LALHNFG >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_8|384_bp atgggagacaagccaatttgggagcagattggatccagcttcattcaacattactaccag ttatttgataatgatagaacccaactaggcgcaatttacattgacgcgtcatgccttacg tgggaaggacaacagttccaggggaaagctgccattgtggagaagttgtctagccttccg ttccagaaaattcagcacagcatcaccgcgcaggaccatcagcccactccagatagctgc atcatcagcatggttgtgggccagcttaaggcggatgaagaccccatcatggggttccac cagatgttcctattaaagaacatcaacgatgcttgggtttgcaccaatgacatgttcagg ctcgccctgcacaactttggctga >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_9|1731_aa MILNPVEAGEALAKWLITFRPSGLSCQQYQGRGLLYPQKDQDHIQDAPHTPSQAEGSSAP VPSALDIFTQRQVPGASARGCRKWRRLVGLAGLSDAARVAVCELRVDCVVTGVPLSRPVA GPMASCASIDIEDATQHLRDILKLDRPAGGCSALSFSVSVEGPSAESPRPSSAYNGDLNG LLVPDPLCSGDSTSANKTGLRTMPPINLQEKQVICLSGDDSSTCIGILAKEVEIVASSDS SISSKARGSNKVKIQPVAKYDWEQKYYYGNLIAVSNSFLAYAIRAANNGSAMVRVISVST SERTLLKGFTGSVADLAFAHLNSPQLACLDEAGNLFVWRLALVNGKIQYPFLPVGALNTL LREEILVHIRQPEGTPLNHFRRIIWCPFIPEESEDCCEESSPTVALLHEDRAEVWDLDML RSSHSTWPVDVSQIKQGFIVVKGHSTCLSEGALSPDGTVLATASHDGYVKFWQIYIEGQD EPRCLHEWKPHDGRPLSCLLFCDNHKKQDPDVPFWRFLITGADQNRELKMWCTVSWTCLQ TIRFSPDIFSSVSVPPSLKVCLDLSAEYLILSDVQRKVLYVMELLQNQEEGHACFSSISE FLLTHPVLSFGIQVVSRCRLRHTEVLPAEEENDSLGADGTHGAGAMESAAGVLIKLFCVH TKALQDVQIRFQPQLNPDVVAPLPTHTAHEDFTFGESRPELGSEGLGSAAHGSQPDLRRI VELPAPADFLSLSSETKPKLMTPDAFMTPSASLQQITASPSSSSSGSSSSSSSSSSSLTA VSAMSSTSAVDPSLTRPPEELTLSPKLQLDGSLTMSSSGSLQASPRGLLPGLLPAPADKL TPKGPGQVPTATSALSLELQEVEPLGLPQASPSRTRSPDVISSASTALSQDIPEIASEAL SRGFGSSAPEGLEPDSMASAASALHLLSPRPRPGPELGPQLGLDGGPGDGDRHNTPSLLE AALTQEASTPDSQVWPTAPDITRETCSTLAESPRNGLQEKHKSLAFHRPPYHLLQQRDSQ DASAEQSDHDDEVASLASASGGFGTKVPAPRLPAKDWKTKGSPRTSPKLKRKSKKDDGDA AMGSRLTEHQVAEPPEDWPALIWQQQRELAELRHSQEELLQRLCTQLEGLQSTVTGHVER ALETRHEQERILETGSTTWHRDGGSILGLGRSTRPAPGPFLSYGAERRLERALAEGQQRG GQLQEQLTQQLSQALSSAVAGRLERSIRDEIKKTVPPCVSRSLEPMAGQLSNSVATKLTA VEGSMKENISKLLKSKNLTDAIARAAADTLQGPMQAAYREAFQSVVLPAFEKSCQAMFQQ INDSFRLGTQECEWGHMAQDLQQLESHMKSRKAREQEAREPVLAQLRGLVSTLQSATEQM AATVAGSVRAEVQHQLHVAVGSLQESILAQVQRIVKGEVSVALKEQQAAVTSSIMQAMRS AAGTPVPSAHLDCQAQQAHILQLLQQGHLNQAFQQALTAADLNLVLYVCETVDPAQVFGQ PPCPLSQPVLLSLIQQLASDLGTRTDLKLSYLEEAVMHLDHSDPITRDHMGSVMAQLAPA LGSQPGMMRCCRRRCCCRQPPHALRPLLLLPLVLLPPLAAAAAGPNRCDTIYQGFAECLI RLGDSMGRGGELETICRSWNDFHACASQVLSGCPEEAAAVWESLQQEARQAPRPNNLHTL CGAPVHVRERGTGSETNQETLRATAPALPMAPAPPLLAAALALAYLLRPLA >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_9|5196_bp atgatcctgaatcctgttgaagctggagaggccttggcaaaatggctcatcacgttcagg ccctccgggctgagttgtcagcagtatcaagggaggggcctgctctatccccagaaggat caggatcatatccaggatgccccacatacaccaagccaggcagagggcagctcagctcct gtcccatctgctttggatatctttacccaaaggcaggttccgggagcctcggctcgtggg tgccggaagtggaggcggttggtggggttggcggggctcagcgacgctgcgcgggtggcg gtttgcgaactgcgggtggactgtgtagtgaccggcgtcccgctgtctcgccccgtggcg gggcccatggcctcctgcgcgagcatcgacatcgaggacgccacgcagcacctgcgggac atcctcaagctggaccggcccgcgggcggctgttctgccttgtccttcagcgtctcggtt gaaggccccagtgcagagagcccacggccatccagtgcctacaatggggacctcaatgga cttctggtcccagacccgctctgctcaggtgatagtacctcagcaaacaagactggtctt cggaccatgccacccattaacctgcaagagaagcaggtcatctgtctctcaggagatgat agctccacctgcattgggattttggccaaggaggtggagattgtggctagcagtgactct agcatttcaagcaaggcccggggaagcaacaaggtgaaaattcagcctgtcgccaagtat gactgggaacagaagtactactatggcaacctgattgctgtgtctaactccttcttggcc tatgccattcgggctgccaacaatggctctgccatggtgcgggtgatcagcgtcagcact tcggagcggaccttgctcaagggcttcacaggcagtgtggctgatctggctttcgcgcac ctcaactctccacagctggcctgcctggatgaggcaggcaacctgttcgtgtggcgcttg gctctggttaatggcaaaattcagtatccattccttcctgtgggtgcccttaacaccctg ctcagagaagagatcttggtccatattcggcagccagagggcacgccactgaaccacttt cgcaggatcatctggtgccccttcatccctgaggagagcgaagactgctgtgaggagagc agcccaacagtggccctgctgcatgaagaccgggctgaggtgtgggacctggacatgctc cgctccagccacagtacctggcctgtggatgttagccagatcaagcagggcttcattgtg gtaaaaggtcatagcacgtgcctcagtgaaggagccctctctcctgatgggactgtgctg gctactgcgagccacgatggctatgtcaagttctggcagatctacattgaggggcaagat gagccaaggtgtctgcacgagtggaaacctcatgatgggcggcccctctcctgcctcctg ttctgtgacaaccataagaaacaagaccctgatgtccctttctggaggttccttattact ggtgctgaccagaaccgagagttaaagatgtggtgtacagtatcctggacctgcctgcag actattcgcttctccccagatatcttcagctcagtgagtgtgccccctagcctcaaggtt tgcttggacctctcagcagaatacctgattctcagcgatgtgcaacggaaggtcctctat gtgatggagctgctgcaaaaccaggaggagggccacgcctgcttcagctccatctcggag ttcctgctcacccaccctgtgctgagctttggtatccaggttgtgagtcgctgccggcta cggcacactgaggtgctgcctgccgaagaggaaaatgacagcctgggtgctgatggtacc catggagccggtgccatggagtctgcggccggtgtgctcatcaagctcttttgtgtgcat actaaggcactgcaagatgtgcagatccgcttccagccacagctgaaccctgatgtggtg gccccactgcccacccacactgcccacgaggacttcacatttggagagtctcggcccgaa ctgggctctgagggcctggggtcagccgctcacggctcccagcctgacctccgacgaatc gtggagctgcctgcacctgccgacttcctcagtctgagcagtgagaccaagcccaagttg atgacacctgacgccttcatgacacctagcgcctccttgcagcagatcactgcctctccc agcagcagcagcagcggtagcagcagcagcagcagcagtagcagcagctcccttacagct gtgtctgccatgagcagcacctcagctgtggacccctccttgaccaggccacctgaggag ctgaccttgagccccaagctgcagctggatggcagcctgacaatgagcagcagtggcagc cttcaggcaagcccgcgtggcctcctgcctggcctgctcccagccccagctgacaaactg actcccaaggggccgggccaggtgcctactgccacctctgcactgtccctggagctgcag gaagtggagcccctggggctaccccaagcctcccctagccgcactcgttcccctgatgtc atctcctcagcttccactgccctgtcccaggacatccctgagattgcatctgaggccctg tcccgtggttttggctcctctgcaccagagggccttgagccagacagtatggcttcagcc gcctcggcactgcacctgctgtccccacggccccggccagggcccgagctcggcccccag ctcgggcttgatggaggccctggggatggagatcggcataataccccctccctcctggag gcagccttgacccaggaggcctcgactcctgacagtcaggtttggcccacagcacctgac attactcgtgagacctgcagcaccctggcagaaagccccaggaatggccttcaggaaaag cacaagagcctggccttccaccgaccaccatatcacctgctgcagcaacgtgacagccag gatgccagtgctgagcaaagtgaccatgatgatgaggtggccagccttgcctctgcttca ggaggctttggcaccaaagttcctgctccacggctgcctgccaaggactggaagaccaag ggatcccctcgaacctcacccaagctcaagaggaaaagcaagaaggatgatggggatgca gccatgggatcccggctcacagagcaccaggtggcagagccccctgaggactggccagca ctaatttggcaacagcagagagagctggcagagctgcggcacagccaggaagagctgctg cagcgtctgtgtacccaactcgaaggcctgcagagcacagtcacaggccacgtagaacgt gcccttgagactcggcacgagcaggaacgtatccttgagactggtagcacaacatggcat agggacgggggcagcattcttggcctgggaaggagtacacgacctgctccaggcccgttc cttagctatggcgcagagcggcggctggagcgagcactggctgaggggcagcagcgggga gggcagctgcaggagcagctgacacaacagttgtcccaagcactgtcgtcagctgtagct gggcggctagagcgcagcatacgggatgagatcaagaagacagtccctccatgtgtctca aggagtctggagcctatggcaggccaactgagcaactcagtggctaccaagctcacagct gtggagggcagcatgaaagagaacatctccaagctgctcaagtccaagaacttgactgat gccatcgcccgagcagctgcagacacattacaagggccgatgcaggctgcctaccgggaa gccttccagagtgtggtgctgccggcctttgagaagagctgccaggccatgttccagcaa atcaatgatagcttccggctggggacacaggaatgtgagtggggtcatatggcccaagac ttgcagcagctagaaagccacatgaagagccggaaggcacgggaacaggaggccagggag cctgtgctagcccagctgcggggcctggtcagcacactgcagagtgccactgagcagatg gcagccaccgtggccggcagtgttcgtgctgaggtgcagcaccagctgcatgtggctgtg ggcagcctgcaggagtccattttagcacaggtacagcgcatcgttaagggtgaggtgagt gtggcgctcaaggagcagcaggccgccgtcacctccagcatcatgcaggccatgcgctca gctgctggcacacctgtcccctctgcccaccttgactgccaggcccagcaagcccatatc ctgcagctgctgcagcagggccacctcaatcaggccttccagcaggcgctgacagctgct gacctgaacctggtgctgtatgtgtgtgaaactgtggacccagcccaggtttttgggcag ccaccctgcccgctctcccagcctgtgctcctttccctcatccagcagctggcatctgac cttggcactcgaactgacctcaagctcagctacctggaagaggccgtgatgcacctggac cacagtgaccccatcactcgggaccacatgggctccgttatggcccagctagctcctgca ctaggctctcagccagggatgatgcgctgctgccgccgccgctgctgctgccggcaacca ccccatgccctgaggccgttgctgttgctgcccctcgtccttttacctcccctggcagca gctgcagcgggcccaaaccgatgtgacaccatataccagggcttcgccgagtgtctcatc cgcttgggggacagcatgggccgcggaggcgagctggagaccatctgcaggtcttggaat gacttccatgcctgtgcctctcaggtcctgtcaggctgtccggaggaggcagctgcagtg tgggaatcactacagcaagaagctcgccaggccccccgtccgaataacttgcacactctg tgcggtgccccggtgcatgttcgggagcgcggcacaggctccgaaaccaaccaggagacg ctgcgggctacagcgcctgcactccccatggcccctgcgcccccactgctggcggctgct ctggctctggcctacctcctgaggcctctggcctag >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_10|622_aa MAAAAAAAAAAAIARRWPAEPPRRRRARRPREVRRPGPGRRPDGREGQPNCGRGRFLALR GLPLVSLQRRPVPRAPGPTGLESRPGDGCSGSRRPSLDWPRNVLSCWIARLQARLGLVDV PSVAMVSDVMLRWKHVMFSFAGRKFSFQAASLEMASQGILRVMYQGTLGVEPSRLCRRGT AFRAGPASLAGEDALVSVMGCGTSKVLPEPPKDVQLDLVKKVEPFSGTKSDVYKHFITEV DSVGPVKAGFPAASQYAHPCPGPPTAGHTEPPSEPPRRARVAKYRAKFDPRVTAKYDIKA LIGRGSFSRVVRVEHRATRQPYAIKMIETKYREGREVCESELRVLRRVRHANIIQLVEVF ETQERVYMVMELATGGELFDRIIAKGSFTERDATRVLQMVLDGVRYLHALGITHRDLKPE NLLYYHPGTDSKIIITDFGLASARKKGDDCLMKTTCGTPEYIAPEVLVRKPYTNSVDMWA LGVIAYILLSGTMPFEDDNRTRLYRQILRGKYSYSGEPWPSVSNLAKDFIDRLLTVDPGA RMTALQALRHPWVVSMAASSSMKNLHRSISQNLLKRASSRCQSTKSAQSTRSSRSTRSNK SRRVRERELRELNLRYQQQYNG >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_10|1869_bp atggcggcggcggcggcggcggcggcggccgctgccattgcccggagatggccggcagag ccgccgagacgccgaagagcccgccgcccgcgcgaggtgaggcggcccggcccaggacga cggcccgacgggcgggaggggcagccgaactgtggtcgtggtcgctttctggcgctccgg ggcctgcccctggtcagcctgcagcgtcggcccgtgccccgtgcgccagggcctacaggg ttggagagcaggccgggcgatggctgctcgggttcccggcgtccgagtctggactggcct cggaacgtgctgtcgtgctggatagcccgtctccaggcccgcttgggcctggtggacgtg ccctctgttgccatggtgtctgatgtgatgctaaggtggaaacatgtgatgttttcattt gctgggagaaagttttctttccaagctgccagccttgaaatggcctctcagggcatccta agagtcatgtatcaagggactcttggtgtggagccatccaggctgtgtagacggggcact gccttcagagcaggtcctgccagcctcgctggagaggatgccctcgtgtccgtgatgggc tgtgggacaagcaaggtccttcccgagccacccaaggatgtccagctggatctggtcaag aaggtggagcccttcagtggcactaagagtgacgtgtacaagcacttcatcacagaggtg gacagtgttggccctgtcaaagccgggttcccagcagcaagtcagtatgcacacccctgc cccggtcccccgactgctggccacacggagcctccctcagaaccaccacgcagggccagg gtagctaagtacagggccaagtttgacccacgtgttacagctaagtatgacatcaaggcc ctaattggccgaggcagcttcagccgagtggtacgtgtagagcaccgggcaacccggcag ccgtatgccatcaagatgattgagaccaagtaccgggaggggcgggaggtgtgtgagtcg gagctgcgtgtgctgcgtcgggtgcgtcatgccaacatcatccagctggtggaggtgttc gagacacaggagcgggtgtacatggtgatggagctggccactggtggagagctctttgac cgcatcattgccaagggctccttcaccgagcgtgacgccacgcgggtgctgcagatggtg ctggatggcgtccggtatctgcatgcactgggcatcacacaccgagacctcaaacctgag aatctgctctactaccatccgggcactgactccaagatcatcatcaccgacttcggcctg gccagtgctcgcaagaagggtgatgactgcttgatgaagaccacctgtggcacgcctgag tacattgccccagaagtcctggtccgcaagccatacaccaactcagtggacatgtgggcg ctgggcgtcattgcctacatcctactcagtggcaccatgccgtttgaggatgacaaccgt acccggctgtaccggcagatcctcaggggcaagtacagttactctggggagccctggcct agtgtgtccaacctggccaaggacttcattgaccgcctgctgacagtggaccctggagcc cgtatgactgcactgcaggccctgaggcacccgtgggtggtgagcatggctgcctcttca tccatgaagaacctgcaccgctccatatcccagaacctccttaaacgtgcctcctcgcgc tgccagagcaccaaatctgcccagtccacgcgttccagccgctccacacgctccaataag tcacgccgtgtgcgggaacgggagctgcgggagctcaacctgcgctaccagcagcaatac aatggctga >gi568815582r:67728270_67933859|GENSCAN_predicted_peptide_11|279_aa MLLLSLTLSLVLLGSSWGCGIPAIKPALSFSQRIVNGENAVLGSWPWQVSLQDSSGFHFC GGSLISQSWVVTAAHCNVSPGRHFVVLGEYDRSSNAEPLQVLSVSRVSAWAADTEEKWAV QAITHPSWNSTTMNNDVTLLKLASPAQYTTRISPVCLASSNEALTEGLTCVTTGWGRLSG VGNVTPAHLQQVALPLVTVNQCRQYWGSSITDSMICAGGAGASSCQGDSGGPLVCQKGNT WVLIGIVSWGTKNCNVRAPAVYTRVSKFSTWINQVIAYN >gi568815582r:67728270_67933859|GENSCAN_predicted_CDS_11|840_bp atgttgctgctcagcctgaccctaagcctggttctcctcggctcctcctggggctgcggc attcctgccatcaaaccggcactgagcttcagccagaggattgtcaacggggagaatgca gtgttgggctcctggccctggcaggtgtccctgcaggacagcagcggcttccacttctgc ggtggttctctcatcagccagtcctgggtggtcactgctgcccactgcaatgtcagccct ggccgccattttgttgtcctgggcgagtatgaccgatcatcaaacgcagagcccttgcag gttctgtccgtctctcgggtgagtgcctgggctgcagacacggaggaaaagtgggcagtg caggccattacacaccctagctggaactctaccaccatgaacaatgacgtgacgctgctg aagctcgcctcgccagcccagtacacaacacgcatctcgccagtttgcctggcatcctca aacgaggctctgactgaaggcctcacgtgtgtcaccaccggctggggtcgcctcagtggc gtgggcaatgtgacaccagcacatctgcagcaggtggctttgcccctggtcactgtgaat cagtgccggcagtactggggctcaagtatcactgactccatgatctgtgcaggtggcgca ggtgcctcctcgtgccagggtgactccggaggccctcttgtctgccagaagggaaacaca tgggtgcttattggtattgtctcctggggcaccaaaaactgcaatgtgcgcgcacctgct gtgtatactcgagttagcaagttcagcacctggatcaaccaggtcatagcctacaactga