GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:47:11 Sequence gi568815584r:21252676_21483927 : 231252 bp : 42.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14739 14883 145 0 1 52 77 62 0.486 0.86 1.02 Intr + 16252 16437 186 0 0 32 37 189 0.884 7.16 1.03 Intr + 16682 16823 142 1 1 43 30 154 0.347 4.11 1.04 Intr + 35264 35386 123 0 0 106 88 77 0.848 9.14 1.05 Intr + 42002 42134 133 0 1 56 75 147 0.862 8.98 1.06 Intr + 48291 48562 272 0 2 71 64 197 0.712 11.86 1.07 Intr + 49813 49909 97 2 1 71 74 57 0.709 0.75 1.08 Intr + 50737 50868 132 1 0 58 103 146 0.388 11.94 1.09 Intr + 55056 55161 106 0 1 67 105 87 0.982 7.60 1.10 Intr + 55556 55601 46 1 1 80 115 0 0.627 -1.04 1.11 Intr + 59066 59295 230 0 2 29 84 193 0.611 9.67 1.12 Intr + 59764 59831 68 0 2 26 113 91 0.187 2.18 1.13 Intr + 65021 65175 155 2 2 37 102 226 0.274 17.69 1.14 Intr + 67342 67502 161 2 2 58 94 158 0.999 12.19 1.15 Intr + 68584 68742 159 0 0 86 47 167 0.593 11.66 1.16 Intr + 69179 69296 118 1 1 35 63 128 0.985 4.12 1.17 Intr + 71943 72395 453 1 0 87 111 291 0.941 23.60 1.18 Intr + 72557 72708 152 0 2 74 78 122 0.999 8.76 1.19 Intr + 73156 73498 343 0 1 60 81 317 0.999 22.28 1.20 Intr + 74948 75132 185 0 2 101 93 167 0.961 17.09 1.21 Intr + 75701 75952 252 1 0 103 94 117 0.631 10.51 1.22 Intr + 77577 77712 136 2 1 47 27 73 0.700 -3.68 1.23 Intr + 81930 82030 101 1 2 73 92 118 0.514 9.61 1.24 Intr + 90334 90553 220 0 1 52 72 166 0.326 8.35 1.25 Intr + 92438 92522 85 0 1 37 72 171 0.509 8.36 1.26 Intr + 95497 95627 131 1 2 35 79 126 0.979 5.82 1.27 Term + 98429 98541 113 0 2 84 49 98 0.930 3.24 1.28 PlyA + 98819 98824 6 -0.45 2.27 PlyA - 98837 98832 6 1.05 2.26 Term - 100143 99998 146 1 2 59 32 178 0.734 6.39 2.25 Intr - 100890 100813 78 1 0 98 2 146 0.786 5.60 2.24 Intr - 101157 101028 130 0 1 31 92 284 0.996 22.45 2.23 Intr - 101865 101736 130 2 1 66 92 183 0.923 16.28 2.22 Intr - 104691 104522 170 0 2 58 100 203 0.999 16.32 2.21 Intr - 105327 105252 76 2 1 67 92 57 0.979 2.60 2.20 Intr - 105752 105640 113 2 2 107 43 120 0.997 7.66 2.19 Intr - 106934 106809 126 2 0 91 72 200 0.999 18.66 2.18 Intr - 107858 107740 119 0 2 126 69 67 0.999 7.86 2.17 Intr - 108297 108171 127 0 1 76 56 125 0.999 7.43 2.16 Intr - 108538 108403 136 0 1 60 83 228 0.999 19.15 2.15 Intr - 109649 109522 128 2 2 69 82 101 0.999 6.16 2.14 Intr - 110272 110119 154 0 1 89 79 169 0.998 15.25 2.13 Intr - 110474 110359 116 2 2 53 63 200 0.999 12.33 2.12 Intr - 110653 110558 96 1 0 83 89 151 0.999 13.99 2.11 Intr - 110828 110763 66 2 0 105 97 22 0.885 2.98 2.10 Intr - 112264 112152 113 2 2 101 85 114 0.999 11.58 2.09 Intr - 112468 112395 74 0 2 124 115 42 0.999 8.63 2.08 Intr - 113854 113764 91 2 1 92 63 124 0.968 8.43 2.07 Intr - 115766 115594 173 1 2 86 77 177 0.999 15.06 2.06 Intr - 116680 116529 152 1 2 51 107 162 0.999 12.44 2.05 Intr - 117221 117075 147 2 0 41 84 199 0.996 14.31 2.04 Intr - 117813 117661 153 0 0 33 69 155 0.772 7.45 2.03 Intr - 119369 119199 171 2 0 89 77 174 0.777 15.62 2.02 Intr - 120755 120663 93 2 0 89 94 165 0.865 16.44 2.01 Init - 131252 131187 66 2 0 77 76 166 0.999 15.42 2.00 Prom - 132085 132046 40 -15.49 3.38 PlyA - 132544 132539 6 1.05 3.37 Term - 133501 132938 564 1 0 113 48 502 0.318 42.10 3.36 Intr - 138388 138272 117 1 0 62 100 47 0.891 3.04 3.35 Intr - 139006 138788 219 1 0 91 101 195 0.946 18.68 3.34 Intr - 139271 139158 114 2 0 51 105 81 0.982 5.82 3.33 Intr - 140134 139832 303 1 0 102 69 298 0.982 25.16 3.32 Intr - 140579 140431 149 0 2 59 87 168 0.997 12.83 3.31 Intr - 141520 140801 720 2 0 125 78 630 0.998 56.17 3.30 Intr - 141810 141602 209 2 2 115 103 268 0.982 28.80 3.29 Intr - 142387 142237 151 2 1 60 50 242 0.582 15.90 3.28 Intr - 142677 142623 55 0 1 73 92 91 0.989 5.63 3.27 Intr - 143217 143142 76 2 1 80 94 85 0.999 6.90 3.26 Intr - 145277 145148 130 0 1 116 98 142 0.998 16.83 3.25 Intr - 147030 146927 104 2 2 91 88 77 0.994 6.90 3.24 Intr - 147395 147306 90 1 0 132 98 64 0.999 9.99 3.23 Intr - 147632 147476 157 0 1 101 82 91 0.999 7.95 3.22 Intr - 147937 147738 200 0 2 70 91 211 0.999 17.67 3.21 Intr - 148396 148200 197 1 2 71 86 241 0.999 19.49 3.20 Intr - 148807 148728 80 2 2 -37 113 85 0.256 -3.15 3.19 Intr - 149461 149299 163 1 1 85 15 271 0.471 18.13 3.18 Intr - 149828 149661 168 2 0 72 64 216 0.994 16.82 3.17 Intr - 150537 150342 196 2 1 79 103 331 0.999 32.20 3.16 Intr - 150988 150778 211 2 1 102 94 165 0.999 15.55 3.15 Intr - 152789 152534 256 2 1 61 77 151 0.761 7.39 3.14 Intr - 153189 153046 144 0 0 114 91 97 0.930 12.16 3.13 Intr - 154357 154181 177 1 0 70 105 106 0.513 9.69 3.12 Intr - 155880 155637 244 2 1 53 100 151 0.508 9.38 3.11 Intr - 156150 156029 122 0 2 88 103 61 0.976 5.97 3.10 Intr - 157313 157176 138 2 0 89 103 160 0.999 17.34 3.09 Intr - 160321 160238 84 1 0 49 105 93 0.986 6.20 3.08 Intr - 161743 161626 118 0 1 94 76 12 0.524 0.15 3.07 Intr - 162318 162263 56 0 2 88 56 64 0.518 0.06 3.06 Intr - 162967 162899 69 1 0 96 78 139 0.999 12.16 3.05 Intr - 163232 163050 183 2 0 102 91 237 0.999 24.46 3.04 Intr - 173567 173453 115 1 1 98 98 125 0.972 14.03 3.03 Intr - 175579 175194 386 1 2 39 89 492 0.685 37.12 3.02 Intr - 176660 176289 372 2 0 97 94 333 0.681 29.13 3.01 Init - 178968 178126 843 0 0 90 62 943 0.903 86.62 3.00 Prom - 202626 202587 40 -1.15 4.09 PlyA - 202915 202910 6 1.05 4.08 Term - 203975 203901 75 2 0 63 55 101 0.144 1.16 4.07 Intr - 209743 209675 69 1 0 121 94 38 0.976 6.16 4.06 Intr - 211092 210981 112 2 1 79 64 111 0.996 7.26 4.05 Intr - 215774 215682 93 1 0 89 83 38 0.742 1.56 4.04 Intr - 216033 215995 39 2 0 79 105 75 0.121 4.72 4.03 Intr - 222259 222076 184 2 1 78 62 111 0.067 5.42 4.02 Intr - 223924 223853 72 2 0 91 97 48 0.106 4.46 4.01 Intr - 224694 224395 300 1 0 81 28 178 0.047 6.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 26111 26273 163 1 1 51 100 55 0.840 2.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:21252676_21483927|GENSCAN_predicted_peptide_1|1481_aa XHSIQNKVECHVKSSRDLITNVSCLNSLQKEHQPPSIPESGTSMRGKKKKNGGNRRHRRY VNFDYINEHHDLHFHDPVVLAVVENERKAAVNTGRAKKGHFPWRSWPRGHQRSRLLLEVG NAATTAQSSSLHKMAPESPLPPRRRPSPRPIPENLPIGLSSGISYSLGTEIMSHLVDPTS GDLPVRDIDAIPLVLPASKGKNMKTQPPLSRMNREELEDSFFRLREDHMLVKELSWKQQD EIKRLRTTLLRLTAAGRDLRVAEEAAPLSETARRGQKAGWRQRLSMHQRPQMHRLQGHFH CVGPASPRRAQPRVQVGHRQLHTAGAPVPEKPKRGPRDRLSYTAPPSFKEHATNENRGEV ASKPSELAHIMASNTMQVEEPPKSPEKMWPKDENFEQRSSLECAQKAAELRASIKEKVEL IRLKKLLHERNASLVMTKAQLTEVQEIFTVMIFFKPDNSPQENARVAGVWRPLVLSDMTI PRGTFLLTQNQGILSAAHEALLKQVNELRAELKEESKKAVSLKSQLEDVSILQMTLKEER VEDLEKERKLLNDNYDKLLESMLDSSDSSSQPHWSNELIAEQLQQQVSQLQDQLDAELED KRKVLLELSREKAQNEDLKLEVTNILQKHKQEVELLQNAATISQPPDRQSEPATHPAVLQ ENTQIEPSEPKNQEEKKLSQVLNELQVSHAETTLELEKTRDMLILQRKINVCYQVQGKME ELEAMMTKADNDNRDHKEKLERLTRLLDLKNNRIKQLEEQLKDVAYGTRPLSLCLETLPA HGDEDKVDISLLHQGENLFELHIHQAFLTSAALAQAGDTQPTTFCTYSFYDFETHCTPLS VGPQPLYDFTSQYVMETDSLFLHYLQEASARLDIHQAMASEHSTLAAGWICFDRVLETVE KVHGLATLIGAGGEEFGVLEYWMRLRFPIKPSLQACNKRKKAQVYLSTDVLGGRKAQEEE FRSESWEPQNELWIEITKCCGLRSRWLGTQPSPYAVYRFFTFSDHDTAIIPASNNPYFRD QARFPVLVTSDLDHYLRREALSIHVFDDEDLEPGSYLGRARVPLLPLAKNESIKGDFNLT DPAEKPNGSIQVQLDWKFPYIPPESFLKPEAQTKGKDTKDSSKISSEEEKASFPSQVVKL PACNAISDLSIQDQMASPEVPIEAGQYRSKRKPPHGGERKEKEHQVVSYSRRKHGKRIGV QGKNRMEYLSLNILNGNTPEVNYTEWKFSETNSFIGDGFKNQHEEEEMTLSHSALKQKEP LHPVNDKESSEQGSEVSEAQTTDSDDVIVPPMSQKYPKAELNKHFPYQDSEKMCIEIVSL AFYPEAEVMSDENIKQVYVEYKFYDLPLSETETPVSLRKPRAGEEIHFHFSKVIDLDPQE QQGRRRFLFDMLNGQDPDQGHLKFTVVSDPLDEEKKECEEVGYAYLQLWQILESGRDILE QELDIVSPEDLATPIGRLKVSLQAAAVLHAIYKEMTEDLFS >gi568815584r:21252676_21483927|GENSCAN_predicted_CDS_1|4446_bp nnccattccatccaaaacaaggtcgaatgccatgtcaaatcctctagagatttaattaca aacgtaagctgtttaaattctctacagaaggaacatcaaccaccaagtattcccgagagt ggcactagcatgagggggaaaaaaaagaagaacggaggtaatagaaggcatcggcgctac gttaacttcgactatattaatgagcatcacgatcttcattttcatgatcccgtagtcctc gcggtcgtagaaaatgaaaggaaagcagccgttaacacaggaagagcgaaaaaaggacat tttccctggcgatcgtggcctagaggacaccagagaagccgactgctgctggaggtcggc aacgcggccacaaccgctcagtcttcgtctcttcacaaaatggcccccgagtctcctctg ccgccgaggcgaaggcctagtccccgccccattcctgagaatcttcctattggcttgtcc tctgggatctcttacagcttgggaacagagatcatgtcacatctggtggaccctacatca ggagacttgccagttagagacatagatgctatacctctggtgctaccagcctcaaaaggt aagaatatgaaaactcaaccacccttgagcaggatgaaccgggaggaattggaggacagt ttctttcgacttcgcgaagatcacatgttggtgaaggagctttcttggaagcaacaggat gagatcaaaaggctgaggaccaccttgctgcggttgaccgctgctggccgggacctgcgg gtcgcggaggaggcggcgccgctctcggagaccgcaaggcgcgggcagaaggcgggatgg cggcagcgcctctccatgcaccagcgcccccagatgcaccgactgcaagggcatttccac tgcgtcggccctgccagcccccgccgcgcccagcctcgcgtccaagtgggacacagacag ctccacacagccggtgcaccggtgccggagaaacccaagagggggccaagggacaggctg agctacacagcccctccatcgtttaaggagcatgcgacaaatgaaaacagaggtgaagta gccagtaaacccagtgaacttgcccacatcatggccagcaataccatgcaagtggaagag ccacccaagtctcctgagaaaatgtggcctaaagatgaaaattttgaacagagaagctca ttggagtgtgctcagaaggctgcagagcttcgagcttccattaaagagaaggtagagctg attcgacttaagaagctcttacatgaaagaaatgcttcattggttatgacaaaagcacaa ttaacagaagttcaagagatatttacagtcatgatcttttttaagcctgacaacagccct caagagaatgctagggttgctggtgtctggagaccactcgtgctgagtgatatgaccatt cccagaggtactttccttttgacccagaatcagggaatcctgagtgcagcccatgaggcc ctcctcaagcaagtgaatgagctcagggcagagctgaaggaagaaagcaagaaggctgtg agcttgaagagccaactggaagatgtgtctatcttgcagatgactctgaaggaggagaga gttgaagatttggaaaaagaacgaaaattgctgaatgacaattatgacaaactcttagaa agcatgctggacagcagtgacagctccagtcagccccactggagcaacgagctcatagcg gaacagctacagcagcaagtctctcagctgcaggatcagctggatgctgagctggaggac aagagaaaagttttacttgagctgtccagggagaaagcccaaaatgaggatctgaagctt gaagtcaccaacatacttcagaagcataaacaggaagtagagctcctccaaaatgcagcc acaatttcccaacctcctgacaggcaatctgaaccagccactcacccagctgtattgcaa gagaacactcagatcgagccaagtgaacccaaaaaccaagaagaaaagaaactgtcccag gtgctaaatgagttgcaagtatcacacgcagagaccacattggaactagaaaagaccagg gacatgcttattctgcagcgcaaaatcaacgtgtgttatcaggtgcaaggaaagatggag gaactggaggcaatgatgacaaaagctgacaatgataatagagatcacaaagaaaagctg gagaggttgactcgactactagacctcaagaataaccgtatcaagcagctggaagaacag ctcaaagatgttgcttatggcacccgaccgttgtcgttatgtttggaaacactgccagcc catggagatgaggataaagtggatatttctctgctgcatcagggtgagaatctttttgaa ctgcacatccaccaggccttcctgacatctgccgccctagctcaggctggagatacccaa cctaccactttctgcacctattccttctatgactttgaaacccactgtaccccattatct gtggggccacagcccctctatgacttcacctcccagtatgtgatggagacagattcgctt ttcttacactaccttcaagaggcttcagcccggcttgacatacaccaggccatggccagt gaacacagcactcttgctgcaggatggatttgctttgacagggtgctagagactgtggag aaagtccatggcttggccacactgattggagctggtggagaagagttcggggttctagag tactggatgaggctgcgtttccccataaaacccagcctacaggcgtgcaataaacgaaag aaagcccaggtctacctgtcaaccgatgtgcttggaggccggaaggcccaggaagaggag ttcagatcggagtcttgggaacctcagaacgagctgtggattgaaatcaccaagtgctgt ggcctccggagtcgatggctgggaactcaacccagtccatatgctgtgtaccgcttcttc accttttctgaccatgacactgccatcattccagccagtaacaacccctactttagagac caggctcgattcccagtgcttgtgacctctgacctggaccattatctgagacgggaggcc ttgtctatacatgtttttgatgatgaagacttagagcctggctcgtatcttggccgagcc cgagtgcctttactgcctcttgcaaaaaatgaatctatcaaaggtgattttaacctcact gaccctgcagagaaacccaacggatctattcaagtgcaactggattggaagtttccctac ataccccctgagagcttcctgaaaccagaagctcagactaaggggaaggataccaaggac agttcaaagatctcatctgaagaggaaaaggcttcatttccttcccaggttgttaaacta ccagcttgtaatgctatctctgatctttctattcaggatcagatggcatctcctgaggtt cccattgaagctggccagtatcgatctaagagaaaacctcctcatgggggagaaagaaag gagaaggagcaccaggttgtgagctactcaagaagaaaacatggcaaaagaataggtgtt caaggaaagaatagaatggagtatcttagccttaacatcttaaatggaaatacaccagag gtgaattacactgagtggaagttctcagagactaacagcttcataggtgatggctttaaa aatcagcacgaggaagaggaaatgacattatcccattcagcactgaaacagaaggaacct ctacatcctgtaaatgacaaagaatcctctgaacaaggttctgaagtcagtgaagcacaa actaccgacagtgatgatgtcatagtgccacccatgtctcagaaatatcctaaggcagaa ctaaataaacattttccttatcaggattcagagaagatgtgcattgaaattgtctccctg gccttctacccagaggcagaagtgatgtctgatgagaacataaaacaggtgtatgtggag tacaaattctacgacctacccttgtcggagacagagactccagtgtccctaaggaagcct agggcaggagaagaaatccactttcactttagcaaggtaatagacctggacccacaggag cagcaaggccgaaggcggtttctgttcgacatgctgaatggacaagatcctgatcaagga catttaaagtttacagtggtaagtgatcctctggatgaagaaaagaaagaatgtgaagaa gtgggatatgcatatcttcaactgtggcagatcctggagtcaggaagagatattctagag caagagctagacattgttagccctgaagatctggctaccccaataggaaggctgaaggtt tcccttcaagcagctgctgtcctccatgctatttacaaggagatgactgaagatttgttt tcatga >gi568815584r:21252676_21483927|GENSCAN_predicted_peptide_2|1047_aa MAVTLDKDAYYRRVKRLYSNWRKGEDEYANVDAIVVSVGVDEEIVYAKSTALQTWLFGYE LTDTIMVFCDDKIIFMASKKKVEFLKQIANTKGNENANGAPAITLLIREKNESNKSSFDK MIEAIKESKNGKKIGVFSKDKFPGEFMKSWNDCLNKEGFDKIDISAVVAYTIAVKEDGEL NLMKKAASITSEVFNKFFKERVMEIVDADEKVRHSKLAESVEKAIEEKKYLAGADPSTVE MCYPPIIQSGGNYNLKFSVVSDKNHMHFGAITCAMGIRFKSYCSNLVRTLMVDPSQEVQE NYNFLLQLQEELLKELRHGVKICDVYNAVMDVVKKQKPELLNKITKNLGFGMGIEFREGS LVINSKNQYKLKKGMVFSINLGFSDLTNKEGKKPEEKTYALFIGDTVLVDEDGPATVLTS VKKKVKNVGIFLKNEDEEEEEEEKDEAEDLLGRGSRAALLTERTRNEMTAEEKRRAHQKE LAAQLNEEAKRRLTEQKGEQQIQKARKSNVSYKNPSLMPKEPHIREMKIYIDKKYETVIM PVFGIATPFHIATIKNISMSVEGDYTYLRINFYCPGSALGRNEGNIFPNPEATFVKEITY RASNIKAPGEQTVPALNLQNAFRIIKEVQKRYKTREAEEKEKEGIVKQDSLVINLNRSNP KLKDLYIRPNIAQKRMQGSLEAHVNGFRFTSVRGDKVDILYNNIKHALFQPCDGEMIIVL HFHLKNAIMFGKKRHTDVQFYTEVGEITTDLGKHQHMHDRDDLYAEQMEREMRHKLKTAF KNFIEKVEALTKEELEFEVPFRDLGFNGAPYRSTCLLQPTSSALVNATEWPPFVVTLDEV ELIHFERVQFHLKNFDMVIVYKDYSKKVTMINAIPVASLDPIKEWLNSCDLKYTEGVQSL NWTKIMKTIVDDPEGFFEQGGWSFLEPEGEGSDAEEGDSESEIEDETFNPSEDDYEEEEE DSDEDYSSEAEESDYSKESLGSEEESGKDWDELEEEARKADRESRYEEEEEQSRSMSRKR KASVHSSGRGSNRGSRHSSAPPKKKRK >gi568815584r:21252676_21483927|GENSCAN_predicted_CDS_2|3144_bp atggctgtgactctggacaaagacgcttattatcggcgagtgaagagactgtacagcaat tggcggaaaggagaagatgagtatgccaacgttgatgccattgttgtatcagtgggtgtt gatgaagaaattgtttatgccaaatcaactgccttacagacatggctctttggttatgaa ctaactgatactatcatggtcttttgtgatgacaaaatcatctttatggccagcaagaaa aaagtggagttcttgaaacagattgccaacactaagggcaatgagaatgctaatggagcc cctgccatcacactgctaatacgagaaaagaatgaaagtaataagagtagctttgacaaa atgattgaagccattaaagaaagcaagaatggcaagaagattggagtgttcagcaaagac aaattccctggagagttcatgaagagctggaatgactgcctcaacaaagaaggctttgac aaaatagatatcagtgcagttgtggcatataccatcgctgtaaaggaggatggggagctc aacctaatgaagaaagcagccagcatcacttctgaagtcttcaacaaattcttcaaggaa agagtcatggaaatagttgatgcagatgagaaagttcgacacagcaaactggctgagtct gtggaaaaggccattgaagagaaaaaataccttgctggggcagacccttctactgtggaa atgtgttaccctcctatcattcagagtggtggcaactataatctcaagttcagtgtggtg agtgacaagaatcatatgcactttggggctatcacttgtgccatgggtattcgcttcaag tcttactgctccaaccttgttcgcactttgatggttgatccttctcaagaagttcaagaa aattataactttttgctccagcttcaagaggagctgctgaaggaattaagacatggtgtg aagatatgtgacgtgtataacgctgtcatggacgtggttaaaaagcagaagccagaactg ctgaacaaaattaccaaaaacctagggtttgggatgggaattgaattccgtgaaggctcc ctagtaatcaatagcaaaaatcaatacaaactgaagaaaggaatggttttcagcatcaat ttaggattctcagacctgactaacaaggaggggaaaaagccagaagagaaaacctatgcc ctgttcattggtgacacagtgcttgtggatgaggatggcccagctactgttctcacttct gtgaagaagaaagtgaagaatgtggggattttcctaaagaatgaagatgaggaagaagag gaggaggagaaagatgaggcagaggaccttttgggaagaggttctcgggcagcattactt acagaaagaacaagaaatgaaatgactgcagaagagaagcgaagagcacatcagaaagaa ctagcggctcaactcaatgaagaagcaaagaggcgattgactgaacaaaagggagaacag cagattcagaaagctcgcaagtctaatgtgtcctataaaaacccatctctgatgcctaag gaaccacatattcgggaaatgaagatctacatcgataagaaatatgagactgtaataatg cccgtgtttggcattgcaacaccgtttcacattgccacaatcaagaatataagtatgtcc gtggaaggagattatacttacttgcgaatcaacttttattgcccaggcagtgctctgggc aggaatgaaggcaacatctttcctaaccctgaagcgacttttgtcaaggaaattacatac cgagcatcaaatattaaggcacccggagaacagacagtaccagccttgaaccttcagaat gctttccgaattattaaagaagtacagaaacgttataaaactcgagaagctgaagagaaa gagaaggaggggattgtaaaacaagactcactggtgatcaatctaaaccggagtaatccg aaactgaaagatctatacattcgcccaaatattgcccaaaagaggatgcaaggctcactg gaggcccatgtcaatggcttccgcttcacatctgttcgaggagacaaagtggatattttg tacaataatattaagcatgctttgttccagccctgtgatggagaaatgattattgtcttg cactttcacctcaagaatgccatcatgtttgggaagaagcggcacacggatgtgcagttc tacacagaagtgggagagataaccacggacttggggaaacatcagcatatgcatgaccga gatgacctctatgctgagcagatggaacgagaaatgaggcacaaactgaaaacagccttt aaaaatttcattgagaaagtagaggctctaactaaggaggaactggaatttgaagtgcct tttagggacttgggatttaacggagctccctataggagtacctgcctccttcagcccact agtagtgcgctggtaaatgctacggaatggccaccttttgtggtgacattggatgaggta gagctgatccactttgagcgggtccagtttcacctgaagaactttgatatggtaatcgtc tacaaggactacagcaagaaagtgaccatgatcaacgccattcctgtagcctctcttgac cccatcaaggaatggttgaattcctgcgacctgaaatacacagaaggagtacagtccctc aactggactaaaatcatgaagaccattgttgatgaccctgagggcttcttcgaacaaggt ggctggtctttcctggagcctgagggtgaggggagtgatgctgaagaaggggattcagag tctgaaattgaagatgagacttttaatccttcagaagatgactatgaagaggaagaggag gacagtgatgaagattattcatcagaagcagaagagtcagactattctaaggagtcattg ggtagtgaagaagagagtggaaaggattgggatgaactggaggaagaagcccgaaaagcg gaccgagaaagtcgttacgaggaagaagaagaacaaagtcgaagtatgagccggaagagg aaggcatctgtgcacagttcgggccgtggctctaaccgtggttccagacacagctctgca ccccccaagaaaaagaggaagtaa >gi568815584r:21252676_21483927|GENSCAN_predicted_peptide_3|2559_aa MADPIMDLFDDPNLFGLDSLTDDSFNQVTQDPIEEALGLPSSLDSLDQMNQDGGGGDVGN SSASELVPPPEETAPTELSKESTAPAPESITLHDYTTQPASQEQPAQPVLQTSTPTSGLL QVSKSQEILSQGNPFMGVSATAVSSSSAGGQPPQSAPKIVILKAPPSSSVTGAHVAQIQA QGITSTAQPLVAGTANGGKVTFTKVLTGTPLRPGVSIVSGNTVLAAKVPGNQAAVQRIVQ PSRPVKQLVLQPVKGSAPAGNPGATGPPLKPAVTLTSTPTQGESKRITLVLQQPQSGGPQ GHRHVVLGSLPGKIVLQGNQLAALTQAKNAQGQPAKVVTIQLQVQQPQQKIQIVPQPPSS QPQPQQPPSTQPVTLSSVQQAQIMGPGQSPGQRLSVPVKVVLQPQAGSSQGASSGLSVVK VLSASEVAALSSPASSAPHSGGKTGMEENRRLEHQKKQEKANRIVAEAIARARARGEQNI PRVLNEDELPSVRPEEEGEKKRRKKSAGERLKEEKPKKSKTSGASKTKGKSKLNTITPVV GKKRKRNTSSDNSDVEVMPAQSPREDEESSIQKRRSNRQVKRKKYTEDLDIKITDDEEEE EVDVTGPIKPEPILPEPVQEPDGETLPSMQFFVENPSEEDAAIVDKVLSMRIVKKELPSG QYTEAEEFFVKYKNYSYLHCEWATISQLEKDKRIHQKLKRFKTKMAQMRHFFHEDEEPFN PDYVEVDRILDESHSIDKDNGEPVIYYLVKWCSLPYEDSTWELKEDVDEGKIREFKRIQS RHPELKRVNRPQASAWKKLELSHEYKNRNQLREYQLEGVNWLLFNWYNRQNCILADEMGL GKTIQSIAFLQEVYNVGIHGPFLVIAPLSTITNWEREFNTWTEMNTIVYHGSLASRQMIQ QYEMYCKDSRGRLIPGAYKFDALITTFEMILSDCPELREIEWRCVIIDEAHRLKNRNCKL LDSLKHMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSQFPSESEFLKDFGDLKTEEQVQK LQAILKPMMLRRLKEDVEKNLAPKQETIIEVELTNIQKKYYRAILEKNFSFLSKGAGHTN MPNLLNTMMELRKCCNHPYLINGAEEKILTEFREACHIIPHDFHLQAMVRSAGKLVLIDK LLPKLKAGGHKVLIFSQMVRCLDILEDYLIQRRYLYERIDGRVRGNLRQAAIDRFSKPDS DRFVFLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKAVKVYRLITR NSYEREMFDKASLKLGLDKAVLQSMSGRDGNITGIQQFSKKEIEDLLRKGAYAAIMEEDD EGSKFCEEDIDQILLRRTTTITIESEGKDISLDDPNFWQKWAKKADLDMDLLNSKNNLVI DTPRVRKQTRHFSTLKDDDLVEFSDLESEDDERPRSRRHDRHHAYGRTDCFRVEKHLLVY GWGRWRDILSHGRFKRRMTERDVETICRAILVYCLLHYRGDENIKGFIWDLISPAENGKT KELQNHSGLSIPVPRGRKGKKVKSQSTFDIHKADWIRKYNPDTLFQDESYKKHLKHQCNK VLLRVRMLYYLRQEVIGDQAEKVLGGAIASEIDIWFPVVDQLEVPTTWWDSEADKSLLIG VFKHGYEKYNTMRADPALCFLEKAGRPDDKAIAAEHRVLDNFSDIVEGVDFDKDCEDPEY KPLQGPPKDQDDEGDPLMMMDEEISVIDGDEARLRRLVTAYQRSYKREQMKIEAAERGDR RRRRCEAAFKLKEIARREKQQRWTRREQTDFYRVVSTFGVEYDPDTMQFHWDRFRTFARL DKKTDESLTKYFHGFVAMCRQVCRLPPAAGDEPPDPNLFIEPITEERASRTLYRIELLRR LREQVLCHPLLEDRLALCQPPGPELPKWWEPVRHDGELLRGAARHGVSQTDCNIMQDPDF SFLAARMNYMQNHQAGAPAPSLSRCSTPLLHQQYTSRTASPLPLRPDAPVEKSPEETATQ VPSLESLTLKLEHEVVARSRPTPQDYEMRVSPSDTTPLVSRSVPPVKLEDEDDSDSELDL SKLSPSSSSSSSSSSSSSSTDESEDEKEEKLTDQSRSKLYDEESLLSLTMSQDGFPNEDG EQMTPELLLLQERQRASEWPKDRVLINRIDLVCQAVLSGKWPSSRRSQEMVTGGILGPGN HLLDSPSLTPGEYGDSPVPTPRSSSAASMAEEEASAVSTAAAQFTKLRRGMDEKEFTVQI KDEEGLKLTFQKHKLMANGVMGDGHPLFHKKKGNRKKLVEKSMCSWGWILGLQLEVECME EPNHLDVDLETRIPVINKVDGTLLVGEDAPRRAELEMWLQGHPEFAVDPRFLAYMEDRRK QKWQRCKKNNKAELNCLGMEPVQTANSRNGKKGHHTETVFNRVLPGPIAPESSKKRARRM RPDLSKMMALMQGGSTGSLSLHNTFQHSSSGLQSVSSLGHSSATSASLPFMPFVMGGAPS SPHVDSSTMLHHHHHHPHPHHHHHHHPGLRAPGYPSSPVTTASGTTLRLPPLQPEEDDDE DEEDDDDLSQGYDSSERDFSLIDDPMMPANSDSSEDADD >gi568815584r:21252676_21483927|GENSCAN_predicted_CDS_3|7680_bp atggcagaccccatcatggatctgttcgatgacccaaatttatttggcctggactctctg actgatgacagctttaaccaggtcacacaagaccccattgaggaagcccttggactgcca agctctctggactccttggatcagatgaaccaggatggtggaggtggtgatgtggggaat tcatcagcaagtgaactggtccctccaccagaggaaacagctcccacagaactttccaaa gaatccacagctccagctccagaatccataaccttgcatgattataccactcagcctgcc agccaggagcagccagcccaacctgtcttacagacatcgacgccaacatcaggacttttg caagtctccaagagccaggagatcctgagccaagggaatcctttcatgggtgtctctgcc acagctgtctcctccagtagtgctggagggcagccacctcagtcagcccctaagattgtt atccttaaggccccaccaagctcctcagtcactggtgcccatgtggcacaaattcaggcc caaggtatcaccagcacagctcagcccctggtggcaggcacagccaatggtggaaaagtc acttttaccaaagtgctaaccggcacaccccttcgaccaggtgtttccattgtctctggt aatacagtgttggccgccaaggtccctgggaaccaggctgctgttcagcgcattgtccag cccagccgaccagtaaagcagctggtcctccagccagttaagggttcagctcctgctgga aaccctggggccacagggcccccactgaagcctgcagttacactgacctctacacctacc cagggtgaatcgaaacgcatcaccctggtcctccagcagccacagtctggaggtccccaa ggacatcggcatgttgtgctagggagtctaccaggcaagatagtgttacagggcaaccag ctggcagccctgactcaagccaagaatgcccaagggcagcctgccaaggtagtaactatc cagctgcaggtgcagcagccacagcaaaaaatccagattgtaccacaaccaccatcatcg cagccacagccccagcagccaccctccacccagccagtgactctgtcctctgtacagcag gctcagataatgggaccaggacaaagcccaggacaaagactttcagtaccagtcaaggtg gtactgcagccacaggctggctcttcccaaggggcctcttctgggctctctgtagttaaa gttctgagtgccagtgaagtggcagctttgtcatcaccagcaagctctgctcctcattcg gggggaaagacaggaatggaggaaaaccgcagattggaacaccagaagaagcaagagaaa gcaaatcggattgtagcagaggccattgcgagagcccgtgcccgcggtgagcagaacata cctcgagtcttaaatgaggacgagttgcccagcgttcggccagaggaggaaggcgagaag aaacgcaggaagaagagtgctggggagaggctgaaagaggagaagccaaagaagagtaaa acatctggtgcctccaaaacaaagggcaagagcaagctcaacaccatcactcctgtagtg ggtaagaagagaaaacgtaatacctcatctgataattcagatgtggaagtcatgcctgca cagtcacctcgagaagatgaagaaagcagcattcagaagagacgctcaaaccgccaagtt aagcgaaaaaaatatacagaggacctggatataaagatcacagatgatgaagaagaagaa gaggtggatgtaactggtccaataaaacctgagcctatcctccctgaaccagtgcaagaa ccagatggcgagactttgccttccatgcagttctttgtggagaatcccagtgaagaagat gcagccattgtagacaaagtgctttctatgcggattgtgaagaaggagctcccttctgga caatatactgaagcagaagaattctttgtcaagtacaagaactactcctatctgcattgt gaatgggctactatctcccaactagagaaggataagaggatacatcaaaaattaaagcgc ttcaaaaccaaaatggctcagatgagacacttcttccatgaggatgaagagccctttaat ccagactacgtagaggtggataggatattggatgagtctcacagtattgacaaggacaat ggggagcccgttatttactacctggtgaaatggtgctctctgccctatgaggatagcaca tgggagctaaaagaagatgttgatgagggcaagattcgagaatttaaacggattcagtca aggcacccagaactcaaaagggtgaatcgtccgcaggcaagtgcctggaagaaattggag ctatcacatgaatataaaaacagaaaccagctacgggaatatcagttggaaggggttaat tggctgctctttaattggtataacaggcagaactgcatcctggctgatgagatgggattg ggcaaaactattcagtccattgccttcttgcaggaagtatataatgtgggcatccatggt cccttcttggtcattgccccactgtccacaattactaactgggagcgagaatttaataca tggacagaaatgaacactattgtgtaccatggcagtctggccagcaggcagatgattcaa cagtatgaaatgtactgcaaagattcacggggacgcctcatcccaggcgcatacaagttt gacgctctgatcaccacttttgagatgattttgtcagattgtcctgagcttcgtgaaatt gaatggcgttgtgttatcattgatgaagcccatcgactgaaaaaccgtaattgcaagctg cttgatagtctcaagcacatggacctggaacacaaggtgctactcacaggaacaccattg caaaatactgtagaagaactgtttagcttgcttcatttcttggaaccgtcacaatttccc tcagaatcagagtttctcaaggactttggggatctcaagacagaggaacaggttcaaaag ctacaggccattcttaagccaatgatgctgagaagactcaaagaggatgttgaaaaaaac ttggcacccaaacaggaaacaattattgaagtagagctgactaatatccagaagaaatac tatcgggctattttggagaagaatttctccttcctttccaaaggggcaggtcataccaac atgcctaatctacttaacacaatgatggagttgcgcaagtgctgcaaccacccatatctc atcaatggtgctgaagaaaaaatcctaacagaattccgtgaagcttgccatattatacct catgactttcacctgcaggccatggttcgttcagccggcaaactggttcttattgacaag ttgcttccaaagcttaaagctggtggccataaagttctgatcttctctcagatggtgcgc tgcctagacatcctagaggattatttaatccagaggaggtacttatatgaacgtattgat gggcgagttagaggcaaccttcgacaggctgccattgaccgcttcagcaagcctgactca gaccgctttgtcttcttactgtgtacccgggctggtggacttggtattaatcttacagct gctgatacctgcatcatctttgattcagactggaatccacaaaatgacctgcaggcccag gcacgatgtcatcgaattgggcagagcaaagctgtgaaggtgtaccgcctcatcactcgt aattcctacgagagagagatgtttgataaggccagcctcaagttggggttggataaggct gtgcttcaatccatgagtggtcgggatggcaacattactggaatccaacagttctctaag aaggagattgaagatcttttaagaaaaggagcatatgcagccatcatggaggaagatgat gaaggctccaagttttgtgaagaggacattgaccagatcttgttaagacgaactacaacc atcaccattgaatctgaaggaaaagatatttctttggatgaccccaacttttggcaaaag tgggccaaaaaggctgacctagacatggatctgctcaacagcaagaataatttggtaatt gacacacctagagtacgaaaacaaacgcgccactttagcactctgaaagatgatgacctg gtggaattctctgatttggaaagtgaggatgatgagcggccacgctcccgcagacatgac cgtcatcatgcctatgggcgcactgactgctttcgggtggaaaagcatctcctggtatat ggttggggacgatggcgagatattttatctcatggacgcttcaagcgacgtatgactgaa cgagatgtggagaccatttgtcgggccattctcgtgtactgtcttctacactaccgtggg gatgaaaatattaaaggcttcatctgggacttgattagcccagctgaaaatggcaagaca aaagaattgcagaatcattcaggtctatctatccctgtgcctcgtggacgcaaaggaaaa aaagtaaagtcacaaagcacttttgatatccataaggcagattggatccggaaatataac cctgacactttgttccaagatgaaagttataagaagcacttgaaacatcagtgtaacaag gtactgttgcgggtacgaatgctatactacctgaggcaggaggttattggagaccaagca gaaaaggtgttagggggtgcgattgccagtgagattgacatatggttcccagtagtggat caactggaggttccaacaacttggtgggacagtgaggctgacaagtcgctgctcattgga gtctttaaacatggctatgagaaatataataccatgagggcagacccagccttatgtttc ctagaaaaggctggccgaccagatgacaaagcaattgcagcagaacatcgagtgttggat aacttctctgacatagtagaaggggttgactttgataaagattgtgaagatcctgaatat aaaccactccaaggtcccccaaaggaccaagatgatgagggtgatcccttgatgatgatg gatgaggagatctcagtgattgatggagatgaagctaggcttcggcgtctagtaacagcg tatcagcgcagctacaagagagaacaaatgaagatagaggctgcagaacgtggggaccgg cgaaggcggcgttgtgaagcagccttcaagctgaaagaaattgcacggcgggagaaacaa caacgatggacaaggcgtgaacaaactgatttttatcgagtggtgtctacgtttggtgtg gaatatgaccctgacaccatgcagttccattgggatcgcttccgcacttttgctcgacta gacaaaaagacagatgaaagccttaccaagtacttccatggctttgtggccatgtgccgc caagtatgccgccttcccccagcagctggagatgaaccccccgaccctaacctgttcatt gagcccatcactgaggagagagcctcacggactctctaccgtatagaattgcttcggcgc ttacgggaacaagttttatgccacccccttttggaagatcggctggcattgtgtcagcct cctggtcctgaattgcccaaatggtgggagcctgttcggcatgatggggagcttctaaga ggggcagcccgccatggggtgagccaaacagactgcaacatcatgcaggacccagacttc tcttttctggctgcccgtatgaattatatgcagaaccatcaagcaggagcaccagctcca tccttgtcacgctgctctactccactgctgcaccagcagtatacctcacgcactgcctca ccactgcccctgcgcccagatgctcctgttgaaaagtcacccgaggagacagctacccag gtccccagtctggagagtctgactttaaagctagagcacgaggtggtggccaggagccga ccaaccccacaagactatgagatgcgagtatccccctctgatactacccctctggtttcc cggagtgttccaccagtcaaactggaggatgaggatgattcggactctgagctggacttg agcaagctgtcaccatcttcttcttcttcctcatcctcatccagctccagctccagcact gatgagagtgaggatgagaaggaagagaagctaactgaccagtcccgctcaaagctctat gatgaagagagtctcctgtccctcactatgtcccaagatggattcccaaatgaagatgga gaacaaatgacccctgagcttctgctactgcaggaaagacaaagagcctctgagtggccc aaggatcgtgtcctgataaaccgtattgacctcgtctgccaggctgtactctcagggaag tggccttctagccgtaggagccaggaaatggtaacaggaggaattttggggccaggcaac cacttgctagacagtccctcattgactcctggagaatatggtgactctccagtccccaca ccacgaagtagtagtgcagcttccatggcagaggaggaagcatctgcagtcagcacagcg gcagcccagttcaccaaacttcgccgaggcatggatgaaaaggagtttacagttcaaatc aaagatgaggaaggattgaagttaacattccagaagcacaagttgatggcgaatggagta atgggagatggacatccactgtttcataagaagaaggggaacagaaagaagctagtagag aagagtatgtgctcatgggggtggattctgggtttgcagctggaggtggagtgcatggaa gagcctaatcaccttgatgtggacctggagacccggatccctgtcatcaataaggtggat ggtactttgctggtgggtgaggatgcccctcgccgggctgaactggagatgtggttacag ggtcatccagagtttgctgttgatccccgatttctagcgtatatggaggatcgcagaaaa cagaagtggcaaagatgtaaaaaaaataataaggcagaattgaactgtttgggaatggaa ccagtacagacagctaactctagaaatgggaaaaagggtcatcacactgaaacggtgttc aaccgggttttgccagggcctattgcaccagagagcagcaagaagcgggcccgtaggatg cgaccagacctttctaagatgatggccctcatgcagggtggaagcactgggtctctatct ctgcataacacgttccaacacagcagtagtggcctacagtctgtgtcatctttgggtcac agcagtgccacttctgcatctttgccttttatgccatttgtgatgggtggtgcaccatca tcccctcatgtagactccagcaccatgcttcatcaccaccaccaccacccccacccccac catcaccaccatcaccatccaggcttgagagcccctggctacccctcttcaccagtgact accgcctctggtactaccttgcggttgccaccactgcaacctgaggaggatgacgatgag gatgaagaagatgatgatgacttatctcagggctatgatagctcagaaagggacttctca ctcattgatgatcctatgatgccagctaactcagactccagtgaagatgctgatgactga >gi568815584r:21252676_21483927|GENSCAN_predicted_peptide_4|314_aa XASPPGFPSLLQPGRPRSLASLLSYQVPTSIFTQPLPPSQTLIAPTAPTGLGTAVTSART CVLSADWKERRGAAESQALGYRPLEFGGIRACRSARSVPGGVGKSCLLLQFTDKRFQPVH DLTIGVEFGARMVNIDGKQIKLQIWDTVRVQNNLVSDQYPSITRKADAKERGDEGGTNGG DKVGERGAAGALLVYDITRRETFNHLTSWLEDARQHSSSNMVIMLIGNKSDLESRRDVKR EEGEAFAREHGLIFMETSAKTACNVEEAFINTAKEIYRKIQQGLFDVHNELSERVVVVVA RNHIRHYSRVSRRD >gi568815584r:21252676_21483927|GENSCAN_predicted_CDS_4|945_bp ntcgcctctccgcccggcttcccgtccctgcttcaacccggtcggccgcgttctctcgcc agccttcttagctatcaggttcctacctccatcttcacacaaccgctcccaccgtctcag accctcatcgctcccaccgcccccactggactcggaactgccgtcacttccgcaaggacg tgtgttctctctgctgattggaaggagcgccgtggggctgcagagagtcaggcgctgggc tatcgccccctggagtttggagggataagggcatgtagaagtgctagaagcgtccctgga ggtgtggggaagtcatgtctcctcctgcagtttacagataagcggttccagcctgtccac gacctcacaataggtgtggagtttggagctcgtatggtcaacattgatggaaaacaaatc aaactgcaaatctgggatacggtgagagtacaaaataatttagtctctgaccaatatcca agtataacaagaaaagctgatgcaaaggaaaggggcgatgagggtggaactaatggtggg gataaagtaggggaaaggggagcagctggagcactgctggtgtacgacattacaaggcgt gaaaccttcaaccacctgacctcatggttagaggatgcccggcagcactctagttccaac atggttatcatgctcattgggaataagagtgacctagagtcccgcagggatgtgaagaga gaagaaggagaggcctttgctagggagcatggacttatattcatggaaacttcagccaaa acagcctgcaatgttgaagaggccttcattaacacagccaaagaaatatataggaagatc cagcagggtttatttgatgtccacaatgagttaagtgagagggtggtggtggttgttgcc cgtaaccacatacggcattactccagggtttcacgtcgtgactga