GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:08:11 Sequence gi568815587r:62692479_62903979 : 211501 bp : 48.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 319 185 135 1 0 105 109 214 0.866 25.96 1.02 Intr - 2255 2090 166 1 1 11 123 158 0.536 11.56 1.01 Init - 13034 12823 212 2 2 91 94 164 0.894 15.64 1.00 Prom - 13268 13229 40 -8.46 2.00 Prom + 13369 13408 40 -13.96 2.01 Init + 13739 13806 68 1 2 100 47 87 0.965 4.36 2.02 Intr + 14714 14869 156 2 0 71 48 126 0.645 6.03 2.03 Intr + 15817 15916 100 1 1 97 100 95 0.997 11.71 2.04 Term + 16200 16328 129 2 0 85 49 160 0.998 9.98 2.05 PlyA + 16702 16707 6 1.05 3.00 Prom + 16889 16928 40 -7.46 3.01 Sngl + 17252 17488 237 1 0 81 44 183 0.778 7.56 3.02 PlyA + 18576 18581 6 1.05 4.12 PlyA - 19314 19309 6 1.05 4.11 Term - 22901 22821 81 2 0 93 48 131 0.968 7.29 4.10 Intr - 23129 23022 108 2 0 123 76 142 0.999 16.98 4.09 Intr - 23516 23386 131 0 2 55 84 167 0.471 13.41 4.08 Intr - 24711 24511 201 1 0 78 101 292 0.996 28.76 4.07 Intr - 27713 27545 169 2 1 77 98 127 0.995 12.12 4.06 Intr - 29464 29342 123 1 0 114 110 36 0.998 9.18 4.05 Intr - 29902 29639 264 1 0 91 69 205 0.971 16.61 4.04 Intr - 30235 30123 113 2 2 56 87 151 0.860 11.90 4.03 Intr - 31512 31436 77 2 2 44 33 96 0.919 -1.14 4.02 Intr - 31948 31813 136 2 1 113 93 120 0.919 14.63 4.01 Init - 34678 34141 538 1 1 101 72 1020 0.618 96.73 4.00 Prom - 35047 35008 40 -5.46 5.00 Prom + 36271 36310 40 -10.45 5.01 Init + 36371 36451 81 1 0 62 68 185 0.747 14.87 5.02 Intr + 42918 43086 169 2 1 72 121 228 0.716 24.02 5.03 Term + 45810 45904 95 1 2 59 42 124 0.970 2.89 5.04 PlyA + 47117 47122 6 1.05 6.02 PlyA - 47777 47772 6 1.05 6.01 Sngl - 61186 59612 1575 1 0 70 49 1003 0.979 89.98 6.00 Prom - 66989 66950 40 -6.16 7.00 Prom + 67517 67556 40 -9.46 7.01 Init + 68292 68357 66 2 0 16 86 31 0.363 -3.23 7.02 Intr + 69317 69426 110 1 2 111 81 202 0.996 20.88 7.03 Intr + 70389 70548 160 0 1 120 100 200 0.992 24.39 7.04 Intr + 72704 72754 51 1 0 92 103 41 0.971 5.10 7.05 Intr + 72862 72927 66 0 0 87 105 11 0.774 1.80 7.06 Intr + 73175 73246 72 1 0 94 68 162 0.999 14.40 7.07 Intr + 73765 73798 34 0 1 97 92 53 0.954 4.40 7.08 Intr + 77668 77746 79 2 1 71 56 31 0.195 -3.19 7.09 Intr + 78809 79012 204 2 0 55 99 76 0.235 3.72 7.10 Intr + 83293 83452 160 1 1 65 86 283 0.356 25.79 7.11 Intr + 83906 83992 87 1 0 115 105 136 0.997 18.17 7.12 Intr + 85500 85650 151 2 1 111 24 111 0.378 6.74 7.13 Intr + 85807 85857 51 2 0 66 90 60 0.705 2.88 7.14 Intr + 86391 86485 95 1 2 97 105 63 0.976 8.58 7.15 Intr + 89635 89855 221 0 2 132 28 220 0.999 17.50 7.16 Intr + 90215 90347 133 2 1 92 105 112 0.995 13.95 7.17 Intr + 93782 93910 129 1 0 127 105 123 0.999 18.79 7.18 Term + 94039 94818 780 0 0 90 47 873 0.999 76.76 7.19 PlyA + 94844 94849 6 1.05 8.00 Prom + 94887 94926 40 -13.06 8.01 Init + 94954 95072 119 0 2 121 96 240 0.793 25.77 8.02 Intr + 95236 95272 37 1 1 75 60 35 0.231 -2.34 8.03 Intr + 96545 96732 188 1 2 68 62 105 0.445 4.39 8.04 Intr + 96814 96948 135 1 0 90 63 109 0.895 8.28 8.05 Term + 97123 97231 109 1 1 53 44 70 0.503 -2.92 8.06 PlyA + 97572 97577 6 -4.33 9.03 PlyA - 97890 97885 6 1.05 9.02 Term - 98437 98145 293 2 2 107 49 169 0.975 10.41 9.01 Init - 99516 99201 316 0 1 97 103 615 0.999 59.40 9.00 Prom - 99718 99679 40 -4.66 10.22 PlyA - 99775 99770 6 -3.84 10.21 Term - 100036 99998 39 1 0 100 43 40 0.598 -2.11 10.20 Intr - 100223 100163 61 1 1 80 116 52 0.599 5.94 10.19 Intr - 101962 101780 183 0 0 69 81 119 0.612 8.20 10.18 Intr - 102529 102457 73 2 1 75 110 31 0.949 2.46 10.17 Intr - 103465 103423 43 1 1 85 116 9 0.999 1.21 10.16 Intr - 103703 103588 116 0 2 86 72 207 0.951 19.07 10.15 Intr - 103867 103809 59 0 2 101 94 40 0.945 4.43 10.14 Intr - 104089 103982 108 0 0 113 65 93 0.992 8.90 10.13 Intr - 104760 104705 56 0 2 64 98 7 0.990 -2.92 10.12 Intr - 104908 104840 69 1 0 116 97 81 0.873 11.18 10.11 Intr - 106097 106061 37 1 1 95 119 22 0.989 4.26 10.10 Intr - 108008 107899 110 2 2 106 82 144 0.999 14.68 10.09 Intr - 108723 108616 108 0 0 110 44 73 0.899 5.58 10.08 Intr - 108939 108851 89 1 2 68 94 94 0.717 7.69 10.07 Intr - 109153 109084 70 1 1 110 80 85 0.576 8.65 10.06 Intr - 109341 109261 81 0 0 100 100 43 0.983 6.53 10.05 Intr - 109568 109464 105 2 0 95 61 178 0.752 16.21 10.04 Intr - 109782 109699 84 0 0 118 94 32 0.980 6.82 10.03 Intr - 111094 110941 154 0 1 66 79 16 0.936 -1.43 10.02 Intr - 111500 111314 187 0 1 91 73 131 0.948 10.65 10.01 Init - 112878 112851 28 0 1 70 115 41 0.939 4.82 10.00 Prom - 112999 112960 40 -6.16 11.11 PlyA - 113473 113468 6 -0.45 11.10 Term - 115150 114991 160 0 1 107 46 306 0.982 25.71 11.09 Intr - 131809 131688 122 1 2 71 78 120 0.739 8.59 11.08 Intr - 132087 131981 107 1 2 121 80 223 0.996 24.73 11.07 Intr - 132639 132558 82 0 1 74 106 32 0.573 2.81 11.06 Intr - 132861 132805 57 0 0 110 78 25 0.780 2.78 11.05 Intr - 133061 132945 117 2 0 83 75 133 0.998 12.16 11.04 Intr - 134747 134677 71 0 2 85 99 124 0.999 12.10 11.03 Intr - 134920 134865 56 0 2 69 65 67 0.999 1.12 11.02 Intr - 135153 135083 71 0 2 86 89 76 0.999 5.48 11.01 Init - 138765 138541 225 0 0 43 110 313 0.999 27.47 11.00 Prom - 139229 139190 40 -14.70 12.03 PlyA - 139860 139855 6 1.05 12.02 Term - 140100 139898 203 1 2 91 46 255 0.844 19.15 12.01 Init - 140272 140131 142 1 1 100 72 133 0.711 13.20 12.00 Prom - 140411 140372 40 -16.72 13.15 PlyA - 140452 140447 6 1.05 13.14 Term - 140653 140474 180 1 0 96 48 133 0.998 7.71 13.13 Intr - 141459 141314 146 1 2 113 115 -49 0.914 0.30 13.12 Intr - 141853 141798 56 0 2 88 113 85 0.978 9.62 13.11 Intr - 142049 141949 101 2 2 46 89 80 0.981 2.81 13.10 Intr - 143054 142953 102 2 0 77 110 64 0.994 7.87 13.09 Intr - 143363 143217 147 2 0 81 92 40 0.895 4.13 13.08 Intr - 143558 143483 76 1 1 96 53 49 0.923 1.72 13.07 Intr - 146757 146636 122 0 2 104 94 68 0.955 8.29 13.06 Intr - 146950 146844 107 2 2 82 109 107 0.986 12.13 13.05 Intr - 147228 147063 166 0 1 19 3 193 0.596 3.43 13.04 Intr - 149737 149603 135 1 0 51 70 48 0.440 0.06 13.03 Intr - 149986 149895 92 2 2 86 70 62 0.406 3.91 13.02 Intr - 162073 162041 33 2 0 132 98 7 0.223 4.29 13.01 Init - 163507 163387 121 1 1 41 103 131 0.241 8.38 13.00 Prom - 183824 183785 40 -4.96 14.00 Prom + 184502 184541 40 -9.46 14.01 Init + 188546 188969 424 1 1 103 100 656 0.991 64.81 14.02 Intr + 189415 189588 174 2 0 60 20 174 0.972 7.71 14.03 Intr + 190430 190521 92 0 2 114 113 145 0.941 19.31 14.04 Intr + 191979 192047 69 2 0 100 113 70 0.999 10.08 14.05 Intr + 192154 192212 59 0 2 141 94 48 0.999 8.38 14.06 Intr + 192699 192879 181 0 1 133 65 67 0.842 8.77 14.07 Intr + 192987 193130 144 2 0 74 84 102 0.779 8.88 14.08 Intr + 195657 195740 84 2 0 106 116 62 0.989 10.82 14.09 Term + 195853 196215 363 0 0 85 38 363 0.998 25.57 14.10 PlyA + 196357 196362 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_1|171_aa MVNDPPVPALLWAQEVGQVLAGRARRLLLQFGVLFCTILLLLWVSVFLYGSFYYSYMPTV SHLSPVHFYYRNGGYLWQVLMYGQPYRVTLELELPESPVNQDLGMFLVTISCYTRGGRII STSSRSVMLHYRSDLLQMLDTLVFSSLLLFGFAEQKQLLEVELYADYRENS >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_1|513_bp atggtcaacgaccctccagtacctgccttactgtgggcccaggaggtgggccaagtcttg gcaggccgtgcccgcaggctgctgctgcagtttggggtgctcttctgcaccatcctcctt ttgctctgggtgtctgtcttcctctatggctccttctactattcctatatgccgacagtc agccacctcagccctgtgcatttctactacagaaatgggggctacctttggcaggtgctg atgtatggacagccgtatcgtgttaccttagagcttgagctgccagagtcccctgtgaat caagatttgggcatgttcttggtcaccatttcctgctacaccagaggtggccgaatcatc tccacttcttcgcgttcggtgatgctgcattaccgctcagacctgctccagatgctggac acactggtcttctctagcctcctgctatttggctttgcagagcagaagcagctgctggag gtggaactctacgcagactatagagagaactcg >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_2|150_aa METSPPAATGLPTHQPRALPGQPHLPDEPLLTLDLPLSHLWLKREVAIQTLIPVAHHIFL DMENGGSLGEKRRFRMKGETPVNSTMSIGQARKMVEQLKIEASLCRIKVSKAAADLMTYC DAHACEDPLITPVPTSENPFREKKFFCALL >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_2|453_bp atggaaaccagcccgccggctgccacagggctgccgactcaccagcctcgcgcgctgcca gggcaaccacatcttcctgacgagcctctgttgactctggatcttccactgagtcacttg tggctaaaacgtgaagtggcgatccagacgctgatacctgtggcgcatcacattttcctg gatatggaaaatggagggtccctgggagagaaacgaagattcaggatgaaaggtgagacc ccggtgaacagcactatgagtattgggcaagcacgcaagatggtggaacagcttaagatt gaagccagcttgtgtcggataaaggtgtccaaggcagcagcagacctgatgacttactgt gatgcccacgcctgtgaggatcccctcatcacccctgtgcccacttcggagaaccccttc cgggagaagaagttcttctgtgctctcctctga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_3|78_aa MAAPALTKPGPAPCNPEIWISVSSRLSRRHTGGHNEESLNKGTPGSTAWPTLVAWILRGN LGFYILPRKERLPLERKY >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_3|237_bp atggctgcccctgccctaaccaagccggggcccgcgccctgcaaccccgaaatctggatt tcggtgtcctcccgattgagtcgaaggcacacgggcggccataacgaagaatcactaaac aaggggaccccaggaagtacagcgtggccgacgctggtcgcgtggatcttgcgagggaat ctaggtttctacattttgccaagaaaggagaggctccctttggagcggaaatattag >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_4|646_aa MEVKRLKVTELRSELQRRGLDSRGLKVDLAQRLQEALDAEMLEDEAGGGGAGPGGACKAE PRPVAASGGGPGGDEEEDEEEEEEDEEALLEDEDEEPPPAQALGQAAQPPPEPPEAAAME AAAEPDASEKPAEATAGSGGVNGGEEQGLGKREEDEPEERSGDETPGSEVPGDKAAEEQG DDQDSEKSKPAGSDGERRGVKRQRDEKDEHGRAYYEFREEAYHSRSKSPLPPEEEAKDEE EDQTLVNLDTCEDEFSYGFDGRGLKAENGQFEEFGQTFGENDVIGCFANFETEEVELSFS KNGEDLGVAFWISKDSLADRALLPHVLCKNCVVELNFGQKEEPFFPPPEEFVFIHAVPVE ERVRTAVPPKTIEECEVILMVGLPGSGKTQWALKYAKENPEKRYNVLGAETVLNQMRCNV YNSGQRRKLLLFKTFSRKVVVVVPNEEDWKKRLELRKEVEGDDVPESIMLEMKANFSLPE KCDYMDEVTYGELEKEEAQPIVTKYKEEARKLLPPSEKRTNRRNNRNKRNRQNRSRGQGY DVEPGPHASCLSALSISLPVGGQRRGYDNRAYGQQYWGQPGNRGGYRNFYDRYRGDYDRF YGRDYEYNRYRDYYRQYNRDWQSYYYHHPQDRDRYYRNYYGYQGYR >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_4|1941_bp atggaggtgaagcggctgaaagtgaccgagctgcggtcggagctgcagcggcggggcctg gactcgcgcggcctcaaggtggatctggcgcagcggctgcaggaggcgctggacgccgag atgctcgaggacgaggccggcggcggcggggccgggcccggcggggcctgcaaggcggag cctcggcctgtggccgcgtcgggcggcggcccgggcggggacgaggaggaggacgaagag gaggaggaggaggacgaggaggcgctgcttgaggacgaggacgaggagccaccccctgct caagccttgggtcaggccgcgcagccgccgccggagcccccggaggcggcagccatggag gccgcggccgagccagatgcttccgagaagccggcggaggccacggccgggtcaggcggg gtaaatggtggcgaagagcagggcctcggcaagagggaggaagacgaacccgaggagcgg agcggggacgagacgccgggatccgaggtgccgggtgacaaggccgccgaggaacaggga gatgaccaggatagtgaaaagtcaaaaccagcaggctcagatggtgagcggcggggggta aagagacagcgggatgagaaggatgaacatggccgagcttactatgaattccgagaggag gcttaccacagccgctcaaagtctccactgcctcctgaagaagaggcaaaagatgaggag gaggatcaaactcttgtgaacctggacacgtgtgaagatgaattctcttacggtttcgat ggacgaggactcaaggcagaaaatggacaatttgaggaatttggccagacttttggggag aatgatgttattggctgctttgctaattttgagactgaagaagtagaactttccttctcc aagaatggagaagacctaggtgtggcattctggatcagcaaggattccctggcagaccgg gcccttctaccccatgtcctctgcaaaaattgtgttgtagaattaaacttcggtcagaag gaggagcccttcttcccaccaccagaagagtttgtgttcattcatgctgtgcctgttgag gagcgtgtacgcactgcagtccctcccaagaccatagaggaatgtgaggtgattctgatg gtgggactacccggatctggaaagacccagtgggcactgaaatatgcaaaagaaaaccct gagaaaagatacaatgtcctgggagctgagactgtgctcaatcaaatgaggtgtaatgtg tacaattctggccaacggcggaagctattgctgttcaagaccttctctcggaaagtggtg gtggttgtccctaatgaggaagattggaagaagaggctggagttgaggaaggaagtagag ggagatgatgtgcctgaatctataatgctggagatgaaagccaacttctctttgcctgaa aaatgcgactatatggatgaggtgacatatggggagctggagaaggaggaagctcagccc attgtcactaagtacaaggaggaggcaaggaagcttctgcccccctccgagaagcggaca aatcgccgaaacaaccgaaacaagcgtaaccggcagaaccgaagccggggccaaggctat gatgtggagccaggaccccatgccagctgtctgagcgcgctctccatttctcttccagtg ggcgggcagcgccgaggctacgacaaccgggcctacgggcagcagtactgggggcagcct ggaaacagagggggttaccgtaatttctatgatcgatacaggggagactatgatcgattt tacgggcgagattatgagtacaacagatacagagactattacagacaatacaatcgggat tggcagagttactactaccaccacccccaggacagagaccgatactacaggaattactac gggtaccaagggtatcggtga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_5|114_aa MEKRLQEAQLYKEEGNQRYREGKYRDAMEPVNYERVREYSQKVLERQPDNAKALYRAGVA FFHLQDYDQARHYLLAAVNRQPKDANVRRYLQLTQSELSSYHRKEKQLYLGMFG >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_5|345_bp atggagaagcgtctgcaggaggctcagctgtacaaggaggaagggaaccagcgctaccgg gaagggaagtaccgagatgctatggagcccgtgaactacgaacgagtgagagaatatagt cagaaagtcctggaacgacagcctgataatgccaaggccttgtatcgggccggagtggcc tttttccatctgcaggactatgaccaggcccgccactacctcctggctgccgtgaatagg cagcctaaagatgccaacgtccggcggtacctccagctgacacagtcagaactcagcagc taccatagaaaagagaagcagctctacctgggcatgtttggttaa >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_6|524_aa MEFPEHSQQLLQSLREQRSQGFLCDCTVMVGSTQFLAHRAVLASCSPFFQLFYKERELDK RDLVCIHNEIVTAPAFGLLLDFMYAGQLTLRGDTPVEDVLAAASYLHMNDIVKVCKRRLQ ARALAEADSTKKEEETNSQLPSLEFLSSTSRGTQPSLASAETSGHWGKGEWKGSAAPSPT VRPPDEPPMSSGADTTQPGMEVDAPHLRAPHPPVADVSLASPSSSTETIPTNYFSSGISA VSLEPLPSLDVGPESLRVVEPKDPGGPLQGFYPPASAPTSAPAPVSAPVPSQAPAPAEAE LVQVKVEAIVISDEETDVSDEQPQGPERAFPSGGAVYGAQPSQPEAFEDPGAAGLEEVGP SDHFLPTDPHLPYHLLPGAGQYHRGLVTSPLPAPASLHEPLYLSSEYEAAPGSFGVFTED VPTCKTCGKTFSCSYTLRRHATVHTRERPYECRYCLRSYTQSGDLYRHIRKAHNEDLAKR SKPDPEVGPLLGVQPLPGSPTADRQSSSGGGPPKDFVLAPKTNI >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_6|1575_bp atggagttcccagaacacagtcagcagctgctgcagagcctccgggagcagcggtcccag ggtttcctttgtgactgcaccgtgatggtgggtagtacccagttcttggcccatcgggct gtgctggcctcctgcagcccattcttccagcttttctacaaggagcgggaattggacaag agggatctggtgtgtattcacaatgaaattgtcacagccccagcctttgggctgcttctg gactttatgtatgctggccagctgaccctgagaggggatacccctgtggaggatgtgctg gcagctgccagctacttgcacatgaatgacatcgtcaaggtgtgtaagcggcggcttcaa gcccgggccctggcagaggcagacagtaccaagaaggaggaggaaaccaactcacagctt cctagtttggagtttttgtctagtacttcccgtggcacccaaccttcgttggcatctgct gagacatcaggccactggggcaaaggggaatggaaaggctctgctgctccctcacctact gtccgtcctccagatgagccaccaatgtctagtggggctgacactacacagcctggcatg gaggttgacgcaccacatctgcgggcacctcatccaccagtggctgatgtctctcttgcc agccctagtagctccactgagaccattcctacaaactacttctcttctggcatctcagca gtttcattggagccactgccatctcttgatgtgggtcctgagagtctgagggtggtggaa ccaaaggatcctggaggaccactgcaaggcttctatcccccagcctcagccccaacgtca gccccagcccctgtctcagctccagttccatcccaggctccagccccagctgaagctgag ctggtccaggtgaaagttgaagctattgtgatctctgatgaagagactgatgtgtcagat gaacagcctcagggtcctgagagagctttcccatctggaggagcagtgtatggggcacag ccctcccagccagaggcttttgaagacccaggggcagcaggactggaggaggtggggcca agtgaccacttcctgccaacagaccctcatctaccctaccatctgctgccaggtgcaggg cagtatcatcgaggactggtgacctcacctctgcctgcacccgcatccctgcatgagcca ctttacctgtcttctgagtacgaagcagctccaggaagctttggggtttttactgaggat gttcccacctgcaagacatgtgggaagaccttctcatgctcttacacactacggcgacat gccacggtgcacacacgtgagcgaccctatgagtgccgctactgcctgcggagctacacg cagtcaggggacctctaccgccacatccgcaaggctcacaatgaggacctggccaaacgc agcaaaccagatccagaagtgggaccccttctaggggtgcagcctctccctggctcccca acagcagacagacagagcagcagtggtggagggccacctaaagattttgtattggcccca aaaactaacatctaa >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_7|882_aa MRIKGENYIKRLAQCLASRKKVISLEHEILLHPRYFGPNLLNTVKQKLFTEVEGTCTGKY GFVIAVTTIDNIGAGVIQPGRGFVLYPVKYKAIVFRPFKGEVVDAVVTQVNKVGLFTEIG PMSCFISRHSIPSEMEFDPNSNPPCYKTMDEDIVIQQDDEIRLKIVGTRVDKNDIFAIGS LMDDYLECSHSSLSANYPILGELKGIHCMPCDILFTLANQEPGSLLEGRKLSKAPADWLY WGGRGRGPGRGNECELVSGRRRHRPRRRRLGSSLRHAGVFSSTGAMSEREERRFVEIPRE SVRLMAESTGLELSDEVAALLAEDVCYRLREATQNSSQFMKHTKRRKLTVEDFNRALRWS SVEAVCGYGSQEALPMRPAREGELYFPEDREVNLVELALATNIPKGCAETAVRVHVSYLD GKGNLAPQGSVPSAVSSLTDDLLKYYHQVTRAVLGDDPQLMKVKSVSHDLEQLHRLLQVA RSLFRNPHLCLGPYVRCLVGSVLYCVLEPLAASINPLNDHWTLRDGAALLLSHIFWTHGD LVSGLYQHILLSLQKILADPVRPLCCHYGAVVGLHALGWKAVERVLYPHLSTYWTNLQAV LDDYSVSNAQVKADGHKVYGAILVAVERLLKMKAQAAEPNRGGPGGRGCRRLDDLPWDSL LFQESSSGGGAEPSFGSGLPLPPGGAGPEDPSLSVTLADIYRELYAFFGDSLATRFGTGQ PAPTAPRPPGDKKEPAAAPDSVRKMPQLTASAIVSPHGDESPRGSGGGGPASASGPAASE SRPLPRVHRARGAPRQQGPGTGTRDVFQKSRFAPRGAPHFRFIIAGRQAGRRCRGRLFQT AFPAPYGPSPASRYVQKLPMIGRTSRPARRWALSDYSLYLPL >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_7|2649_bp atgagaattaagggagagaattatatcaagcgcttagcacagtgtctggcatctaggaag aaagtaatctccctagagcacgaaatcctgctgcacccgcgctacttcggccccaacttg ctcaacacggtgaagcagaagctcttcaccgaggtggaggggacctgcacagggaagtat ggctttgtaattgctgtcaccaccattgacaatattggtgctggtgtgatccagccaggc cgaggctttgtcctttatccagttaagtacaaggccattgttttccggccatttaaaggg gaggtcgtggatgctgttgtcactcaggtcaacaaggttggactcttcacagaaattggg cccatgtcttgcttcatctctcgacattccatcccttcagagatggagtttgatcctaac tccaacccaccatgttacaagacaatggatgaggatattgtgattcagcaggacgatgag atccgcttaaagattgtggggacccgtgtggacaagaatgacatttttgctattggctcc ctgatggacgattacttggaatgttcccacagttcactcagtgcaaactaccccatcttg ggtgaacttaaaggaattcactgtatgccttgtgacattctgttcacactggccaatcaa gagccaggaagcctcctggagggccggaaactttccaaggcgcccgccgactggctgtat tggggagggcggggccggggccccgggagagggaatgagtgtgagctcgtgagtgggcgc cgccgccaccgcccccgccgccgtcgtctcggtagcagccttcgccacgccggggtcttc agctccactggggccatgtcagagcgagaagagcggcggtttgtggagatccctcgggag tctgtccggctcatggcggagagcacgggcctggagctgagcgatgaggtggcggcgctg ctcgcagaggacgtgtgctatcgtctgagagaggccacgcagaatagctctcagttcatg aagcacaccaaacgccggaagctgacggttgaggacttcaacagggccctcagatggagc agcgtggaggctgtgtgtggttacggatcacaggaggcactgcccatgcgccccgccagg gagggtgaactctactttcctgaggatcgagaggtgaacctggtggagctggccctggct accaacatccccaaaggctgtgctgagacagctgtcagagttcatgtctcctacctggat ggcaaagggaacctggcacctcaaggatcggtgcccagtgctgtgtcttcactgacagat gaccttctcaagtactatcaccaggtgactcgtgctgtgctaggggatgatccgcaactg atgaaggtgaaatctgtaagccatgacctggagcaactgcaccggctgctgcaggtggca cggagcctatttcgtaatccgcacctgtgcttggggccctatgtccgctgtctggtgggc agtgtcctctactgtgtcctggagccactggctgcctccatcaaccccctgaatgaccac tggactctgcgggatggggctgccctcctgctcagccacatcttctggactcatggggac cttgtaagtggcctctatcagcatatcctgctatccctgcagaagatcctggcagatcct gtgcggccgctctgctgccactatggagccgtggtggggctgcatgctcttggctggaag gcagtagaacgagtcctgtacccacacctgtccacctactggacaaacttgcaggctgtg ctggatgattattcagtatctaatgcccaggtcaaagcagatggacacaaagtctatgga gccattctggtggcggtagagcgactgctgaagatgaaggcccaggcagcagagcccaac aggggtggcccaggtggcagggggtgccggcgcctggacgacctgccatgggacagcctt ctctttcaagagtcgtcctccgggggcggtgcagaacccagctttgggtccggcctcccg ctgccgccagggggcgcggggccggaggacccttctctttcggtgaccctggccgacatc taccgggagctctacgccttcttcggtgacagcttggccacacgctttggcaccggccag cctgcacccacggctccgcggccgcccggggacaagaaggagccggcggcagccccggac tcggtgcggaagatgccgcagctgacggcaagcgccatagtcagcccgcacggcgacgag agcccccggggcagcggcggaggcggccccgcgtcggcctctgggcccgccgcctctgag agcaggcccttgccgcgcgtgcatcgggcgcgcggggcaccccggcagcagggccccggg accggcacccgcgacgttttccagaagagccgtttcgccccgcgcggcgccccgcacttt cgtttcatcatagccgggcggcaggctgggaggcgctgccgcgggcgccttttccagact gccttccccgcgccgtacgggcctagcccggcctcgcgctacgtgcagaaactgcccatg atcggccgtaccagccgccccgcccgccggtgggcgctctcggactactcgctgtacttg ccgctctga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_8|195_aa MALSWLQRVELALFAAAFLCGAVAAAAMTRTQVRLRGGVRSYFEVVPSPKGRGSFSGRCP LYGVATLNGSSLALSRPSAPSLCYFVAGASGLLALYCLLLLLFWIYSSCIEDSHRGAIGL RIALAISAIAVFLVLVSACILRFGTRSLCNSIISLNTTISCSEAQKIPWTPPGTALQFYS NLHNAEVRPKDRRKN >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_8|588_bp atggcgctgtcctggctgcagcgcgtcgagcttgcgctctttgctgccgccttcctgtgc ggggccgtggcggccgcggcgatgactcggacccaggtgcggctgcggggcggggtcagg tcatacttcgaagttgtcccctcgccaaagggacggggctccttcagtggtagatgtccc ctgtatggtgtggccaccctgaatggctcctccctggccttatcccgtccctcagcacca tccctgtgctactttgtagctggggcctctggcctcttggccctctactgcctcctgctt ttgctcttctggatctacagcagctgcatcgaggactcccacagaggtgctatagggctg cgcattgcactggccatctcagctatagccgtcttcctggtcttggtgtctgcctgtatc cttcgatttggcaccaggtctctctgcaactccatcatctccttgaacactacaattagc tgttctgaagcccagaaaattccatggacaccccctggaactgctctgcagttttactcc aacctacacaatgctgaagtgagacccaaggataggaggaaaaactga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_9|202_aa MAAPWRRWPTGLLAVLRPLLTCRPLQGTTLQRDVLLFEHDRGRFFTILGLFCAGQGVFWA SMAVAAVSRPPVPVQPLDAEVPNRGPFDLRSALWRYGLAVGCGAIGALVLGAGLLFSLRS VRSVVLRAGGQQVTLTTHAPFGLGAHFTVPLKQVSCMAHRGEVPAMLPLKVKGRRFYFLL DKTGHFPNTKLFDNTVGAYRSL >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_9|609_bp atggcggcgccttggaggcgatggcccacggggctgctagccgtgctgcggcccctgctc acctgccggcccctgcaaggcacgacgctgcaacgggatgtgctgctctttgagcatgat cggggccgcttcttcaccatcctcgggctgttctgcgcgggccagggcgtcttctgggct tccatggctgtggcagccgtgtcccggcccccggttccggtgcagcctctggatgcggag gtcccaaatcgtggccccttcgacctgcgctccgcgctctggcgctacggtctggccgtc ggctgcggcgccatcggagccctcgtactcggtgctggtcttctcttctctctccggtct gtgcgctcagtggtgcttcgagctggagggcagcaggtgaccctcaccactcatgccccc tttggcttgggggcccatttcacagttcctttgaagcaggtatcttgcatggcccaccgg ggtgaagtccctgccatgctacctctgaaagtcaaaggccgacgcttctatttcctcttg gacaaaactggacacttccctaacacaaaactctttgacaatactgtgggtgcctaccgg agcttgtga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_10|619_aa MADEGKSYSEHDDERVNFPQRKKKGRGPFRWKYGEGNRRSGRGGSGIRSSRLEEDDGDVA MSDAQDGPRVRYNPYTTRPNRRGDTWHDRDRIHVTVRRDRAPPERGGAGTSQDGTSKNWF KITIPYGRKYDKAWLLSMIQSKCSVPFTPIEFHYENTRAQFFVEDASTASALKAVNYKIL DRENRRISIIINSSAPPHTILNELKPEQVEQLKLIMSKRYDGSQQALDLKGLRSDPDLVA QNIDVVLNRRSCMAATLRIIEENIPELLSLNLSNNRLYRLDDMSSIVQKAPNLKILNLSG NELKSERELDKIKGLKLEELWLDGNSLCDTFRDQSTYISAIRERFPKLLRLDGHELPPPI AFDVEAPTTLPPCKGSYFGTENLKSLVLHFLQQYYAIYDSGDRQGLLDAYHDGACCSLSI PFIPQNPARSSLAEYFKDSRNVKKLKDPTLRFRLLKHTRLNVVAFLNELPKTQHDVNSFV VDISAQTSTLLCFSVNGVFKEVDGKSRDSLRAFTRTFIAVPASNSGLCIVNDELFVRNAS SEEIQRAFAMPAPTPSSSPVPTLSPEQQEMLQAFSTQSGMNLEWSQKCLQDNNWDYTRSA QAFTHLKAKGEIPEVAFMK >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_10|1860_bp atggcggacgaggggaagtcgtacagcgaacacgatgatgaacgcgttaatttccctcaa agaaagaagaaaggccggggtcccttccggtggaaatatggtgaaggaaaccgtaggtct ggaagaggcggttctggtattcggtcttcccgccttgaggaagatgatggagatgtggca atgagtgatgcccaggatggtccccgagtacgatacaacccctataccacccgacctaac cgtcggggtgatacttggcatgatcgagatcgcattcatgttactgtgcggagagacaga gctcctccagagagaggaggggctggcaccagccaggatgggacctcaaagaactggttc aagattacaattccttatggcagaaagtatgacaaggcatggctcctgagcatgattcag agcaagtgcagtgtgcccttcacccctattgagtttcactatgagaatacacgggcccag ttcttcgttgaagacgccagtactgcctctgcattgaaggctgtcaactataagattttg gatcgggagaaccgaaggatatctatcatcatcaactcttctgctccaccccacactata ctgaatgaactgaagccagaacaagtagaacagctaaagctgatcatgagcaaacgatac gatggctcccaacaagcccttgacctcaaaggcctccgttcagacccagatttggtggcc cagaacattgacgttgtcctgaatcgcagaagctgtatggcagctaccctgaggatcatt gaagagaacatccctgagctattgtccttgaacttgagcaacaacaggctgtacaggctg gatgacatgtctagcattgttcagaaggcacccaacctgaagatcctaaacctttctgga aatgaattgaagtctgagcgggaattggacaagataaaggggctgaagctagaagagctc tggctcgatggaaactccctgtgtgacaccttccgagaccagtccacctacatcagcgcc attcgcgaacgatttcccaagttactacgcctggatggccatgagctacccccaccaatt gcctttgatgttgaagcccccacgacgttaccgccctgcaagggaagctattttggaaca gaaaacttgaagagtctggtcttgcacttcctgcaacagtactatgcaatttacgactct ggagaccgacaagggctcctggatgcctaccatgatggggcctgctgttccctgagcatt cctttcattcctcagaaccctgcccgaagcagcttagccgagtatttcaaggatagcaga aatgtgaagaagcttaaagaccctaccttgcggttccggctgctgaagcacacgcgtctc aacgttgttgccttcctcaatgagttgcccaaaacccagcacgacgtcaattccttcgtg gtagacataagcgcccagacaagcacattgctgtgtttttctgtcaatggagtcttcaag gaagtggacggaaagtcccgggattctttgcgagccttcacccggacattcattgctgtt cctgctagcaattcagggctatgtattgtaaatgatgagctatttgtgcggaatgccagt tctgaagagatccaaagagccttcgctatgcctgcacccacgccttcctccagcccggtg cccaccctctctccagagcagcaggaaatgttgcaagcattctctacccagtctggcatg aacctcgagtggtcccagaagtgccttcaggacaacaactgggactacaccagatctgcc caggccttcactcatctcaaggccaagggcgagatcccagaagtggcattcatgaagtga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_11|355_aa MIPRKRYGSKNTDQGVYLGLSKTQVLSPATAGSSSSDIAPLPPPVTLVPPPPDTMSCRDR TQEFLSACKSLQTRQNGIQTNKPALRAVRQRSEFTLMAKRIGKDLSNTFAKLEKLTILAK RKSLFDDKAVEIEELTYIIKQDINSLNKQIAQLQDFVRAKGSQSGRHLQTHSNTIVVSLQ SKLASMSNDFKSVLEVRTENLKQQRSRREQFSRAPVSALPLAPNHLGGGAVVLGAESHAS KDVAIDMMDSRTSQQLQLIDEQDSYIQSRADTMQNIESTIVELGSIFQQLAHMVKEQEET IQRIDENVLGAQLDVEAAHSEILKYFQSVTSNRWLMVKIFLILIVFFIIFVVFLA >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_11|1068_bp atgatcccgcggaaacgctacgggtctaagaacacggatcagggtgtctacctgggtctc tcaaagacacaggtcctgtcccctgcaactgctggcagtagcagcagcgacatcgcccct ctgccccccccagtgaccctcgtccctccccctcccgacaccatgtcctgccgggatcgg acccaggagtttctgtctgcctgcaagtcgctgcagacccgtcagaatggaatccagaca aataagccagctttgcgtgctgtccgacaacgcagtgaattcaccctcatggccaagcgc attgggaaagaccttagcaacacatttgccaagctggagaagctgacaatcttggcaaag cgcaagtccctctttgatgataaagcagtggaaattgaagagctaacatatatcatcaaa caggacatcaatagcctcaacaaacaaattgctcagctccaggatttcgtgagagccaag ggcagccagagtggccggcacctgcagacccactccaacaccattgtggtctccttgcag tcgaaactggcttctatgtccaatgacttcaaatcggttttagaagtgaggacagagaac ctgaagcagcagaggagccggagagagcagttctcccgggcacctgtgtcagccctgccc cttgcccctaaccacctgggcggtggtgctgtggttctgggggcagagtcccatgcctcc aaggatgtcgccatcgacatgatggactctcggaccagccagcagctgcagctcattgac gagcaggattcctacatccagagtcgggcagacaccatgcagaacattgagtcgacaatt gttgagttgggctccatctttcagcagttggcacacatggttaaggaacaggaggaaacc attcagaggatcgacgagaacgtgctaggagcccagctggacgttgaggccgcccattca gagatcctcaagtacttccagtctgtcacctccaaccggtggctcatggtcaaaatcttc ctcatcctcattgtcttcttcatcatctttgtggtcttccttgcttga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_12|114_aa MGCCQDKDFEMSDEQSKEEESEDGREDETTDTQRGPRECERGLPEGRGAEDIDLNSPDHP NHKSNESLLITVLWRRLSTFGRRGSSRPSKRQPDQIRKQESPIREGNQEEPEKG >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_12|345_bp atgggctgctgccaagacaaggactttgagatgtctgatgagcagtccaaggaggaagag tctgaggacggcagggaagatgagaccacagacacacaaagagggcccagggagtgtgag agggggcttcccgagggtaggggcgccgaagatatcgacttgaactccccggaccatccg aatcacaagtcgaacgaaagccttctgattaccgtgctgtggcggcgactatccacgttc ggtcgtcggggctcctcgcggccaagcaagaggcaaccagaccagattcggaagcaggag agtccgatccgagaaggcaaccaggaggagccggagaagggatga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_13|527_aa MRSTYARALAVAPPPNPVGLCGMVFNRLPGEIRSHFSTARGIPDDSLPLSYMKGQMFAGG QYKCKVRVVPRPYVLSSHKLRRKGPGEGRRHLQHCPAAPWKAEKPLGWETNATIGEKEAT RALKSRLQNKHGSFTVSALQTSSNYTALHRTRRPLVTHTLRRKCELSASRLCHGGCCCTL EPWVNLQRKQAANFTAGGQPRREEAVSALCWGTGGETQMLVGCADRTVKHFSTEDGIFQG QRHCPGGEGMFRGLAQADGTLITCVDSGILRVWHDKDKDTSSDPLLELRVGPGVCRMRQD PAHPHVVATGGKENALKIWDLQGSEEPVFRAKNVRNDWLDLRVPIWDQDIQFLPGSQKLV TCTGYHQVRVYDPASPQRRPVLETTYGEYPLTAMTLTPGGNSVIVGNTHGQLAEIDLRQG RLLGCLKGLAGSVRGLQCHPSKPLLASCGLDRVLRIHRIQNPRGLEHKDEPQEPQEPNKV PLEDTETDELWASLEAAAKRKLSGLEQPQGALQTRRRKKKRPGSTSP >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_13|1584_bp atgcggtcaacgtatgcgcgtgcgctcgccgtagccccgcccccaaatcccgtgggcctc tgcggcatggtatttaatcgtctccccggggagattcgttctcatttttctactgctcgt ggtattcctgatgacagtctgcctctatcttacatgaagggccagatgtttgctggcgga cagtacaaatgcaaggtgcgggtcgtgcctcgcccctatgtcctcagttctcacaaactc cggagaaaaggtcccggagaggggcgccgccacctgcagcactgtcccgccgccccctgg aaagccgaaaaacccctaggatgggagactaacgccacaatcggggaaaaggaggccacg cgggcattgaaatcacgactgcaaaacaagcatggaagctttactgtttcggctcttcaa acttccagcaactacactgcgctgcatcggactcgacgcccgctggtgacgcacacgctg cgccggaagtgtgaactgtctgcctccaggctttgtcatggcggctgctgctgcacgctg gaaccatgggtaaatcttcagcgaaaacaggcggcgaacttcacggccggaggacagccg cggcgcgaggaggcagtgagcgccctgtgttggggcaccggcggcgagacccagatgctg gtgggctgcgcggacaggacggtgaagcacttcagcaccgaggatggcatattccagggt cagagacactgcccgggcggggagggcatgttccgtggcctcgcccaggccgacggcacc ctcatcacatgtgtggattctgggattctcagagtctggcatgacaaggacaaggacaca tcctctgacccactcctggaactgagagtgggccctggggtgtgtaggatgcgccaagac ccagcacacccccatgtggttgccacaggtgggaaagagaatgctttgaagatatgggac ctgcagggctctgaggaacctgtgttcagggccaagaacgtgcggaatgactggctggac ttgcgggttcccatctgggaccaggacatacagtttctcccaggatcacagaagcttgtc acctgcacagggtaccaccaggtccgtgtttatgatccagcatccccccagcgccggcca gtcctagagaccacctatggagagtacccactaacagccatgaccctcactccgggaggc aactcagtgattgtgggaaacactcatgggcagctggcagaaattgaccttcggcaaggg cgtctactgggctgtctgaaggggctggcaggcagtgtgcgtgggttgcagtgccaccct tcaaagcctctactagcctcctgtggcttggacagagtcttgaggatacacaggatccag aatccacggggtctggagcataaggatgagccccaagagcctcaagaacccaacaaggtg cccctagaagacacagagacagatgaactttgggcatccttggaggcagctgccaagcgg aagctctcgggtttggagcagccccaaggagctctccaaacgagacggagaaagaagaag cggcctgggtccaccagcccctga >gi568815587r:62692479_62903979|GENSCAN_predicted_peptide_14|529_aa MSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKF TGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHT GALYRIGDLQAFQGHGAGNLAGLKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQID PNFGSKEDFDSLLQSAKKKSIRVILDLTPNYRGENSWFSTQVDTVATKVKDALEFWLQAG VDGFQVRDIENLKDASSFLAEWQNITKGFSEDRLLIAGTNSSDLQQILSLLESNKDLLLT SSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWSLSQARLLTSFLPAQLLRLYQLMLFTLP GTPVFSYGDEIGLDAAALPGQPMEAPVMLWDESSFPDIPGAVSANMTVKGQSEDPGSLLS LFRRLSDQRSKERSLLHGDFHAFSAGPGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQAS DLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA >gi568815587r:62692479_62903979|GENSCAN_predicted_CDS_14|1590_bp atgagccaggacaccgaggtggatatgaaggaggtggagctgaatgagttagagcccgag aagcagccgatgaacgcggcgtctggggcggccatgtccctggcgggagccgagaagaat ggtctggtgaagatcaaggtggcggaagacgaggcggaggcggcagccgcggctaagttc acgggcctgtccaaggaggagctgctgaaggtggcaggcagccccggctgggtacgcacc cgctgggcactgctgctgctcttctggctcggctggctcggcatgcttgctggtgccgtg gtcataatcgtgcgagcgccgcgttgtcgcgagctaccggcgcagaagtggtggcacacg ggcgccctctaccgcatcggcgaccttcaggccttccagggccacggcgcgggcaacctg gcgggtctgaaggggcgtctcgattacctgagctctctgaaggtgaagggccttgtgctg ggtccaattcacaagaaccagaaggatgatgtcgctcagactgacttgctgcagatcgac cccaattttggctccaaggaagattttgacagtctcttgcaatcggctaaaaaaaagagc atccgtgtcattctggaccttactcccaactaccggggtgagaactcgtggttctccact caggttgacactgtggccaccaaggtgaaggatgctctggagttttggctgcaagctggc gtggatgggttccaggttcgggacatagagaatctgaaggatgcatcctcattcttggct gagtggcaaaatatcaccaagggcttcagtgaagacaggctcttgattgcggggactaac tcctccgaccttcagcagatcctgagcctactcgaatccaacaaagacttgctgttgact agctcatacctgtctgattctggttctactggggagcatacaaaatccctagtcacacag tatttgaatgccactggcaatcgctggtgcagctggagtttgtctcaggcaaggctcctg acttccttcttgccggctcaacttctccgactctaccagctgatgctcttcaccctgcca gggacccctgttttcagctacggggatgagattggcctggatgcagctgcccttcctgga cagcctatggaggctccagtcatgctgtgggatgagtccagcttccctgacatcccaggg gctgtaagtgccaacatgactgtgaagggccagagtgaagaccctggctccctcctttcc ttgttccggcggctgagtgaccagcggagtaaggagcgctccctactgcatggggacttc cacgcgttctccgctgggcctggactcttctcctatatccgccactgggaccagaatgag cgttttctggtagtgcttaactttggggatgtgggcctctcggctggactgcaggcctcc gacctgcctgccagcgccagcctgccagccaaggctgacctcctgctcagcacccagcca ggccgtgaggagggctcccctcttgagctggaacgcctgaaactggagcctcacgaaggg ctgctgctccgcttcccctacgcggcctga