GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:52:15 Sequence gi568815581r:58205061_58427920 : 222860 bp : 46.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 833 828 6 1.05 1.15 Term - 1110 1019 92 1 2 81 36 66 0.626 -1.42 1.14 Intr - 1320 1223 98 2 2 92 77 35 0.863 2.45 1.13 Intr - 1487 1405 83 2 2 50 109 101 0.994 6.84 1.12 Intr - 2158 2025 134 2 2 93 57 133 0.987 11.06 1.11 Intr - 2941 2834 108 2 0 105 91 109 0.884 13.16 1.10 Intr - 3114 3045 70 0 1 92 105 55 0.988 6.35 1.09 Intr - 3556 3453 104 2 2 94 56 30 0.628 0.29 1.08 Intr - 5664 5599 66 1 0 70 99 103 0.805 8.48 1.07 Intr - 5962 5920 43 1 1 110 123 9 0.999 4.41 1.06 Intr - 7374 7318 57 0 0 75 101 103 0.992 9.38 1.05 Intr - 8030 7922 109 1 1 64 69 159 0.982 11.89 1.04 Intr - 8809 8705 105 0 0 107 94 123 0.701 14.13 1.03 Intr - 9327 9199 129 2 0 87 87 204 0.968 20.01 1.02 Intr - 14221 14091 131 1 2 74 23 151 0.242 6.69 1.01 Init - 23672 23664 9 2 0 80 115 11 0.032 2.87 1.00 Prom - 31426 31387 40 -3.56 2.00 Prom + 33605 33644 40 -5.46 2.01 Init + 37920 37995 76 2 1 63 105 136 0.998 12.05 2.02 Intr + 38934 39021 88 1 1 48 98 93 0.990 5.33 2.03 Intr + 42418 42578 161 1 2 91 92 165 0.887 16.83 2.04 Intr + 44000 44117 118 0 1 84 43 55 0.841 0.12 2.05 Intr + 44506 44635 130 1 1 55 81 243 0.871 21.00 2.06 Intr + 45355 45561 207 0 0 125 64 255 0.994 26.07 2.07 Intr + 47122 47446 325 0 1 101 109 438 0.995 42.55 2.08 Intr + 49751 49911 161 0 2 59 91 213 0.995 18.41 2.09 Intr + 59662 59914 253 0 1 108 68 234 0.967 20.41 2.10 Intr + 61093 61266 174 2 0 91 105 248 0.999 26.71 2.11 Intr + 62289 62595 307 1 1 62 40 184 0.984 6.51 2.12 Term + 62727 62934 208 0 1 126 50 232 0.991 20.01 2.13 PlyA + 63431 63436 6 -3.24 3.13 PlyA - 64823 64818 6 1.05 3.12 Term - 65803 65596 208 0 1 67 44 238 0.988 14.11 3.11 Intr - 66832 66595 238 2 1 95 85 369 0.992 33.97 3.10 Intr - 67858 67688 171 2 0 125 94 219 0.999 26.11 3.09 Intr - 68609 68354 256 2 1 22 121 508 0.996 44.52 3.08 Intr - 70642 70482 161 2 2 58 96 196 0.996 17.11 3.07 Intr - 73085 72767 319 2 1 93 76 770 0.996 71.83 3.06 Intr - 74154 73948 207 0 0 97 32 306 0.978 25.17 3.05 Intr - 74366 74237 130 1 1 105 93 269 0.989 29.80 3.04 Intr - 74586 74463 124 1 1 122 80 183 0.994 20.54 3.03 Intr - 74954 74779 176 1 2 78 44 243 0.503 18.48 3.02 Intr - 75399 75306 94 1 1 100 81 183 0.999 17.82 3.01 Init - 75620 75545 76 2 1 89 92 136 0.645 13.36 3.00 Prom - 75911 75872 40 -5.56 4.33 PlyA - 78130 78125 6 1.05 4.32 Term - 79196 79080 117 2 0 90 42 65 0.036 0.64 4.31 Intr - 97387 97274 114 1 0 111 70 11 0.254 2.24 4.30 Intr - 99800 99732 69 2 0 70 94 75 0.964 5.68 4.29 Intr - 100111 100001 111 1 0 111 82 26 0.877 4.88 4.28 Intr - 100398 100327 72 0 0 137 36 117 0.955 11.00 4.27 Intr - 100583 100480 104 0 2 81 78 106 0.998 8.79 4.26 Intr - 101353 101282 72 2 0 79 98 15 0.648 0.98 4.25 Intr - 101908 101740 169 1 1 96 80 174 0.992 16.92 4.24 Intr - 102702 102551 152 1 2 86 57 149 0.778 11.48 4.23 Intr - 102881 102782 100 2 1 107 71 6 0.760 0.48 4.22 Intr - 104320 103481 840 1 0 62 33 573 0.365 40.78 4.21 Intr - 105098 104907 192 2 0 101 92 314 0.998 32.79 4.20 Intr - 106153 106085 69 1 0 99 89 61 0.792 6.68 4.19 Intr - 106662 106511 152 1 2 105 75 206 0.999 20.88 4.18 Intr - 107662 106832 831 2 0 115 91 578 0.929 52.17 4.17 Intr - 111072 110963 110 2 2 100 97 211 0.999 23.13 4.16 Intr - 111518 111365 154 0 1 64 80 90 0.693 4.93 4.15 Intr - 112710 112578 133 0 1 51 71 52 0.669 0.02 4.14 Intr - 113392 113220 173 2 2 99 76 64 0.946 5.96 4.13 Intr - 114234 114030 205 0 1 20 116 135 0.978 8.27 4.12 Intr - 115521 115471 51 0 0 88 75 60 0.929 3.80 4.11 Intr - 117352 117248 105 1 0 45 75 150 0.953 9.81 4.10 Intr - 117716 117594 123 2 0 88 100 86 0.996 10.58 4.09 Intr - 117979 117890 90 1 0 72 81 66 0.917 4.49 4.08 Intr - 118321 118238 84 1 0 66 93 193 0.999 17.62 4.07 Intr - 118485 118408 78 0 0 86 119 85 0.918 11.15 4.06 Intr - 119597 119301 297 2 0 66 92 54 0.324 0.67 4.05 Intr - 119942 119751 192 2 0 84 83 245 0.834 23.29 4.04 Intr - 120653 120474 180 2 0 86 86 144 0.953 14.06 4.03 Intr - 121361 121233 129 2 0 53 65 235 0.972 18.59 4.02 Intr - 121730 121623 108 2 0 78 81 155 0.856 14.28 4.01 Init - 122860 122528 333 1 0 86 116 144 0.535 14.37 4.00 Prom - 125486 125447 40 -6.36 5.18 PlyA - 125845 125840 6 1.05 5.17 Term - 128113 128088 26 2 2 145 38 32 0.359 2.29 5.16 Intr - 132516 132415 102 1 0 29 48 105 0.083 0.75 5.15 Intr - 142181 142128 54 0 0 130 85 34 0.984 6.25 5.14 Intr - 142524 142469 56 2 2 147 89 41 0.999 8.72 5.13 Intr - 146448 146342 107 0 2 98 94 73 0.987 7.91 5.12 Intr - 147118 146986 133 0 1 40 3 167 0.009 4.05 5.11 Intr - 150007 149887 121 2 1 84 19 101 0.020 2.35 5.10 Intr - 153763 152408 1356 2 0 93 105 475 0.929 38.18 5.09 Intr - 155191 155089 103 1 1 84 91 61 0.756 5.75 5.08 Intr - 155884 155723 162 1 0 98 110 21 0.703 5.47 5.07 Intr - 157588 157484 105 1 0 81 105 143 0.617 15.71 5.06 Intr - 158346 158215 132 0 0 75 94 178 0.991 17.94 5.05 Intr - 158540 158439 102 2 0 131 81 105 0.498 14.47 5.04 Intr - 165973 165851 123 1 0 112 69 176 0.996 18.88 5.03 Intr - 170918 170885 34 1 1 114 70 5 0.031 -0.47 5.02 Intr - 197790 197748 43 1 1 70 99 8 0.015 -2.60 5.01 Init - 210517 210319 199 1 1 64 93 234 0.953 18.68 5.00 Prom - 213050 213011 40 -4.96 6.03 PlyA - 213444 213439 6 1.05 6.02 Term - 217370 217300 71 0 2 60 55 131 0.971 5.10 6.01 Init - 220169 220121 49 2 1 86 80 40 0.828 2.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 141253 141186 68 2 2 97 43 66 0.897 1.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:58205061_58427920|GENSCAN_predicted_peptide_1|445_aa MDTENRAARFPAQSRRVAAVMAETVWSTDTGEAVYRSRDPVRNLRLREGGILKSRIVTWE PSEEFVRNNHVINTPLQTMHIMADLGPYKKLGYKKYEHVLCTLKVDSNGVITVKPDFTGL KGPYRIETEGEKQELWKYTIDNVSPHAQPEEEERERRVFKDLYGRHKEYLSSLVGTDFEM TVPGALRLFVNGEVVSAQGYEYDNLYVHFFVELPTARELTIFGLWLSDWSSPAFQQLSGV TQTCTTKSLAMDKVAHFSYPFTFEAFFLHEDESSDALPEWPVLYCEVLSLDFWQRYRVEG YGAVVLPATPGSHTLTVSTWRPVELGTVAELRRFFIGGSLELEDLSYVRIPGSFKGERLS RFGLRTETTGTVTFRLHCLQQSRAFMESSSLQKRMRSVLDRLEGFSQQSSIHNVLEAFRR ARRRMQEARESLPQDLVSPSGTLVS >gi568815581r:58205061_58427920|GENSCAN_predicted_CDS_1|1338_bp atggacacggaaaacagagcggcgcgctttccggcgcagtcgcggcgcgtcgcagctgtc atggcggagaccgtctggagcactgacaccggggaggcagtgtatcgctcccgggacccc gtgcgcaacttgcgcctccgggagggcggcatcctcaagtcacgcatcgtcacctgggag ccctcagaagagtttgtcaggaacaaccacgtcattaacacccctcttcagacaatgcac atcatggcagacctggggccctataaaaagcttggctataagaagtatgaacatgtcctg tgtactctgaaggtggatagcaatggtgtgatcacagtaaagcctgacttcacgggcctc aaaggaccctacaggattgagacggagggggagaagcaggagctgtggaaatatacgatc gacaatgtttccccccacgcacagccggaggaggaggagcgggaacggcgagtgttcaag gatctttatggccggcacaaggagtatctcagcagcctcgtaggcaccgactttgagatg actgtcccaggtgccctccggctctttgtaaatggagaggtcgtttcagcccaaggctat gagtatgacaatctctacgtccacttctttgtagaattgccaactgctcgcgagcttact atatttggtctttggctttcagactggtcaagcccagcattccagcagctctcaggagta acacagacctgcaccaccaagtccctggcaatggacaaggtggctcacttctcctaccca ttcacgtttgaagccttcttcctccatgaggatgaatcttctgatgcactcccggagtgg cctgtgctctactgtgaggtcctctcgctggacttctggcagaggtaccgtgtggaaggc tatggggctgtggtgctgcctgccactccaggctcacacaccctgacagtctccacgtgg agacctgtggagcttggcacggtggctgagctgaggaggtttttcattggcggttctctg gaactggaggacctctcctatgtacggataccaggatccttcaagggggaacgcctgagc cgctttggactccgcacagagaccacaggcactgtcaccttccgcttgcactgtctgcag cagtccagggccttcatggaatcgagctcccttcagaaaaggatgcggagtgtgttggac cgtctggaagggttcagccagcagagttccattcacaatgtgctagaggccttccgtcga gcccggcgccgcatgcaggaggcccgggaaagcctcccgcaggacctagtgagcccctct ggaaccctggtctcctag >gi568815581r:58205061_58427920|GENSCAN_predicted_peptide_2|735_aa MRVLLHLPALLASLILLQAAASTTRAQTTRTSAISDTVSQAKVQVNKAFLDSRTRLKTAM SSETPTSRQLSEYLKHAKGRTRTAIRNGQVWEESLKRLRQKASLTNVTDPSLDLTSLSLE VGCGAPAPVVRCDPCSPYRTITGDCNNRRKPALGAANRALARWLPAEYEDGLSLPFGWTP GKTRNGFPLPLAREVSNKIVGYLNEEGVLDQNRSLLFMQWGQIVDHDLDFAPDTELGSSE YSKAQCDEYCIQGDNCFPIMFPPNDPKAGTQGKCMPFFRAGFVCPTPPYKSLAREQINAL TSFLDASFVYSSEPSLASRLRNLSSPLGLMAVNQEVSDHGLPYLPYDSKKPSPCEFINTT ARVPCFLAGDSRASEHILLATSHTLFLREHNRLARELKRLNPQWDGEKLYQEARKILGAF VQIITFRDYLPILLGDHMQKWIPPYQGYSESVDPRISNVFTFAFRFGHLEVPSSMFRLDE NYQPWGPEPELPLHTLFFNTWRMVKDGGIDPLVRGLLAKKSKLMKQNKMMTGELRNKLFQ PTHRIHGFDLAAINTQRCRDHGQPGYNSWRAFCDLSQPQTLEELNTVLKSKMLAKKLLGL YGTPDNIDIWIGAIAEPLVERGRVGPLLACLLGKQFQQIRDGDRQVRPQPGREGQGPSPR GVSQGPAFWWENPGVFTNEQKDSLQKMSFSRLVCDNTRITKVPRDPFWANSYPYDFVDCS AIDKLDLSPWASVKN >gi568815581r:58205061_58427920|GENSCAN_predicted_CDS_2|2208_bp atgagggtccttctccatctcccagccctcctggcttccctcatcttgcttcaggctgca gcatctaccacaagagcgcagactaccagaacctctgccatctccgatactgtgagtcag gccaaggtccaagtcaacaaggccttcctggactcccgaaccaggctgaagaccgccatg agctctgagactcccaccagccgacagctctcagaatacctcaagcatgccaaaggccgg acgcgcacagccatccgcaatggacaggtgtgggaggagtctttaaagagactgaggcag aaggcatccttgaccaatgtcacagatcccagcctggacttgacttcactgtctctggag gtgggctgtggtgctcctgctcccgtggtgagatgcgacccgtgcagcccttaccgcacc attacgggagactgcaataacaggaggaagcctgcgctgggcgccgccaacagggctctg gcgcgctggctgcccgcggagtacgaggacgggctctccctgcccttcggctggacgccg gggaagacgcgcaacggcttccctctcccgctggcccgggaggtatctaacaagattgtt ggctatctgaatgaggagggtgttctggaccaaaacaggtccctgctcttcatgcagtgg ggtcagattgtggatcatgacctggactttgcccctgacaccgagctggggagtagcgag tactccaaagcccagtgtgatgagtactgtatccagggagacaactgcttccccatcatg ttcccacccaatgaccccaaggcggggactcaagggaaatgcatgcctttcttccgagct gggttcgtctgccccactccaccctacaagtccctggcccgagagcagatcaacgctctg acctccttcctggatgccagctttgtgtacagctccgagccaagcctggccagccgcctc cgcaacctcagcagccccctgggcctcatggctgtcaaccaggaggtctcagaccatgga ctaccctacctgccctatgacagcaagaagccaagcccctgtgagttcatcaacaccact gcccgtgtgccctgcttcctggcaggagattctcgagcctcagagcatattctgctggcc acatcccacaccctctttctccgcgagcataaccggctggccagagaactaaagagactc aaccctcagtgggatggagagaagctctaccaggaagcccggaaaatcctgggagccttc gtgcagattatcacctttagggactacctacccattttgctaggtgaccacatgcagaag tggatacccccatatcaaggctacagtgaatctgtggatcccagaatttccaatgtcttc accttcgccttccgctttggccacttggaggtcccctctagtatgttccgcctggatgag aattatcagccatgggggccagaaccagaactccccctccacaccctcttcttcaacact tggaggatggtcaaagatggtggaattgatcctctggtgcggggcctgctggccaagaaa tccaagctgatgaaacagaataaaatgatgactggagagctgcgcaacaagcttttccag ccaactcacaggatccatggctttgacctggctgccatcaacacacagcgttgccgggac catgggcaacctgggtacaattcctggagagccttctgtgacctctcacagccgcagaca ctagaggagttgaacacagtgctgaagagcaagatgctggccaagaagttactgggtctc tacgggacccctgacaacatcgacatctggataggggccattgctgagccgctggtggaa aggggtcgggtggggcctctcctggcctgcctcttgggcaagcagttccagcagatccgt gatggagacaggcaagtgcgtcctcagccagggagggaagggcagggcccttctccaaga ggggtgtcccaaggtcctgcgttctggtgggaaaaccctggggtcttcacgaacgagcag aaggactctctacagaaaatgtccttctcacgccttgtctgtgacaacacccgcatcacc aaggtcccacgggacccattctgggccaacagctacccctatgacttcgtggattgctca gccatcgacaagctggacctgtcaccctgggcctcagtgaagaattag >gi568815581r:58205061_58427920|GENSCAN_predicted_peptide_3|719_aa MKLLLALAGLLAILATPQPSEGAAPAVLGEVDTSLVLSSMEEAKQLVDKAYKERRESIKQ RLRSGSASPMELLSYFKQPVAATRTAVRAADYLHVALDLLERKLRSLWRRPFNVTDVLTP AQLNVLSKSSGCAYQDVGVTCPEQDKYRTITGMCNNRRSPTLGASNRAFVRWLPAEYEDG FSLPYGWTPGVKRNGFPVALARAVSNEIVRFPTDQLTPDQERSLMFMQWGQLLDHDLDFT PEPAARASFVTGVNCETSCVQQPPCFPLKIPPNDPRIKNQADCIPFFRSCPACPGSNITI RNQINALTSFVDASMVYGSEEPLARNLRNMSNQLGLLAVNQRFQDNGRALLPFDNLHDDP CLLTNRSARIPCFLAGDTRSSEMPELTSMHTLLLREHNRLATELKSLNPRWDGERLYQEA RKIVGAMVQIITYRDYLPLVLGPTAMRKYLPTYRSYNDSVDPRIANVFTNAFRYGHTLIQ PFMFRLDNRYQPMEPNPRVPLSRVFFASWRVVLEGGIDPILRGLMATPAKLNRQNQIAVD EIRERLFEQVMRIGLDLPALNMQRSRDHGLPGYNAWRRFCGLPQPETVGQLGTVLRNLKL ARKLMEQYGTPNNIDIWMGGVSEPLKRKGRVGPLLACIIGTQFRKLRDGDRFWWENEGVF SMQQRQALAQISLPRIICDNTGITTVSKNNIFMSNSYPRDFVNCSTLPALNLASWREAS >gi568815581r:58205061_58427920|GENSCAN_predicted_CDS_3|2160_bp atgaagctgcttctggccctagcagggctcctggccattctggccacgccccagccctct gaaggtgctgctccagctgtcctgggggaggtggacacctcgttggtgctgagctccatg gaggaggccaagcagctggtggacaaggcctacaaggagcggcgggaaagcatcaagcag cggcttcgcagcggctcagccagccccatggaactcctatcctacttcaagcagccggtg gcagccaccaggacggcggtgagggccgctgactacctgcacgtggctctagacctgctg gagaggaagctgcggtccctgtggcgaaggccattcaatgtcactgatgtgctgacgccc gcccagctgaatgtgttgtccaagtcaagcggctgcgcctaccaggacgtgggggtgact tgcccggagcaggacaaataccgcaccatcaccgggatgtgcaacaacagacgcagcccc acgctgggggcctccaaccgtgcctttgtgcgctggctgccggcggagtatgaggacggc ttctctcttccctacggctggacgcccggggtcaagcgcaacggcttcccggtggctctg gctcgcgcggtctccaacgagatcgtgcgcttccccactgatcagctgactccggaccag gagcgctcactcatgttcatgcaatggggccagctgttggaccacgacctcgacttcacc cctgagccggccgcccgggcctccttcgtcactggcgtcaactgcgagaccagctgcgtt cagcagccgccctgcttcccgctcaagatcccgcccaatgacccccgcatcaagaaccaa gccgactgcatcccgttcttccgctcctgcccggcttgccccgggagcaacatcaccatc cgcaaccagatcaacgcgctcacttccttcgtggacgccagcatggtgtacggcagcgag gagcccctggccaggaacctgcgcaacatgtccaaccagctggggctgctggccgtcaac cagcgcttccaagacaacggccgggccctgctgccctttgacaacctgcacgatgacccc tgtctcctcaccaaccgctcagcgcgcatcccctgcttcctggcaggggacacccgttcc agtgagatgcccgagctcacctccatgcacaccctcttacttcgggagcacaaccggctg gccacagagctcaagagcctgaaccctaggtgggatggggagaggctctaccaggaagcc cggaagatcgtgggggccatggtccagatcatcacttaccgggactacctgcccctggtg ctggggccaacggccatgaggaagtacctgcccacgtaccgttcctacaatgactcagtg gacccacgcatcgccaacgtcttcaccaatgccttccgctacggccacaccctcatccaa cccttcatgttccgcctggacaatcggtaccagcccatggaacccaacccccgtgtcccc ctcagcagggtcttttttgcctcctggagggtcgtgctggaaggtggcattgaccccatc ctccggggcctcatggccacccctgccaagctgaatcgtcagaaccaaattgcagtggat gagatccgggagcgattgtttgagcaggtcatgaggattgggctggacctgcctgctctg aacatgcagcgcagcagggaccacggcctcccaggatacaatgcctggaggcgcttctgt gggctcccgcagcctgaaactgtgggccagctgggcacggtgctgaggaacctgaaattg gcgaggaaactgatggagcagtatggcacgcccaacaacatcgacatctggatgggcggc gtgtccgagcctctgaagcgcaaaggccgcgtgggcccactcctcgcctgcatcatcggt acccagttcaggaagctccgggatggtgatcggttttggtgggagaacgagggtgtgttc agcatgcagcagcgacaggccctggcccagatctcattgccccggatcatctgcgacaac acaggcatcaccaccgtgtctaagaacaacatcttcatgtccaactcatatccccgggac tttgtcaactgcagtacacttcctgcattgaacctggcttcctggagggaagcctcctag >gi568815581r:58205061_58427920|GENSCAN_predicted_peptide_4|1902_aa MEQLTTLPRPGDPGAMEPWALPTWHSWTPGRGGEPSSAAPSIADTPPAALQLQELRSEES SKPKGDGSSRPVGGTDPEGAEACLPSLGQQASSSGPACQRPEDEEVEAFLKAKLNMSFGD RPNLELLRALGELRQRCAILKEENQMLRKSSFPETEEKVRRLKRKNAELAVIAKRLEERA RKLQETNLRVVSAPLPRPGTSLELCRKALARQRARDLSETASALLAKDKQIAALQRECRE LQARLTLVGKEGPQWLHVRDFDRLLRESQREVLRLQRQIALRNQRETLPLPPSWPPGPAL QARAGAPAPGAPGEGPPPSLPCSYLRRALPWGLQGSALHRPAPSGAWPAAPAALPVVRPG RKSLVVLAAARGPARAASSRRRRRAGTPPSAPAWLARPGPSYPGAAVRPPRLQATPQEDA DNLPVILGEPEKEQRVQQLESELSKKRKKCESLEQEARKKQRRCEELELQLRQAQNENAR LVEENSRLSGRATEKEQVEWENAELRGQLLGVTQERDSALRKSQGLQSKLESLEQVLKHM REVAQRRQQLEVEHEQARLSLREKQEEVRRLQQAQAEAQREHEGAVQLLEARVRELEEQC RSQTEQFSLLAQELQAFRLHPGPLDLLTSALDCGSLGDCPPPPCCCSIPQPCRGSGPKDL DLPPGSPGRCTPKSSEPAPATLTGVPRRTAKKAESLSNSSHSESIHNSPKSCPTPECGAH RMCAQHGAAVIITKLLGITHSSGRQLILLSTHKPQKVEEAGPEEKPALSVCPQVDTASEV EELEADSVSLLPAAPEGSRGGARIQVFLARYSYNPFEGPNENPEAELPLTAGEYIYIYGN MDEDGFFEGELMDGRRGLVPSNFVERVSDDDLLTSLPPELADLSHSSGPELSFLSVGGGG SSSGGQSSVGRSQPRPEEEDAGDELSLSPSPEGLGEPPAVPYPRRLVVLKQLAHSVVLAW EPPPEQVELHGFHICVNGELRQALGPGAPPKAVLENLDLWAGPLHISVQALTSRGSSDPL RCCLAVGARAGVVPSQLRVHRLTATSAEITWVPGNSNLAHAIYLNGEECPPASPSTYWAT FCHLRPGTPYQAQVEAQLPPQGPWEPGWERLEQRAATLQFTTLPAGPPDAPLDVQIEPGP SPGILIISWLPVTIDAAGTSNGVRVTGYAIYADGQKIMEVASPTAGSVLVELSQLQLLQK EDTAELGVHLVNSLVDHGRNSDLSDIQEEEEEEEEEEEEELGSRTCSFQKQVAGNSIREN GAKSQPDPFCETDSDEEILEQILELPLQQFCSKKLFSIPEEEEEEEEDEEEEKSGAGCSS RDPGPPEPALLGLGCDSGQPRRPGQCPLSPESSRAGDCLEDMPGLVGGSSRRRGGGSPEK PPSRRRPPDPREHCSRLLSNNGPQASGRLGPTRERGGLPVIEGPRTGLEASGRGRLGPSR RCSRGRALEPGLASCLSPKCLEISIEYDSEDEQEAGSGGISITSSCYPGDGEAWGTATVG RPRGPPKANSGPKPYPRLPAWEKGEPERRGRSATGRAKEPLSRATETGEARGQDGSGRRG PQKRGVRVLRPSTAELVPARSPSETLAYQHLPVRIFVALFDYDPVSMSPNPDAGEEELPF REGQILKVFGDKDADGFYQGEGGGRTGYIPCNMVAEVAVDSPAGRQQLLQRGYLSPDILL EGSGNGPFVYSTAHTTGPPPKPRRSKKGPPKLVPSADLKAPHSMVAAFDYNPQESSPNMD VEAELPFRAGDVITVFGGMDDDGFYYGELNGQRGLVPSNFLEGPGPEAGGLDREPRTPQA ESQRDDSCDPDPQAISPAANTWWLTQDWGCTTQGSPGPPGGPCTPSSGSAPRIERGEPQG RSEKRTVHTATRAGELSKVTWTMHFRFTDLMTPWGFGTIISQ >gi568815581r:58205061_58427920|GENSCAN_predicted_CDS_4|5709_bp atggagcaactgacaaccctcccacggcctggggaccctggagccatggagccatgggca ctgcccacctggcatagctggactccaggtcgagggggtgaacctagcagtgcagcccca agcatcgctgatactcctccggcagctctgcagcttcaagaactgaggtctgaggagagt tccaagcccaaaggagacgggagctccaggcccgtggggggaactgaccctgaaggagca gaggcttgtctgcccagcctgggccagcaagcatccagctctggacccgcctgccagagg ccagaggatgaggaagtggaggctttcctgaaggccaagctgaatatgagctttggggac aggcccaatctggagctgctgagggccctgggggagctgcggcagcgctgtgccatcctt aaggaggaaaaccagatgctgaggaagagcagcttccctgagacagaagagaaggtgcgg aggctgaagaggaagaacgccgagctggcggtcattgccaagcgcctggaggagagggcc cgaaagctgcaggaaacgaacctgagggtggtgagtgcccccttgccccggccggggacc agcttggagttgtgtcggaaggccctagcccgccagcgagcccgggacctcagtgagaca gccagtgcactgctggccaaggacaagcagattgctgccttgcagcgggagtgcagggag ctgcaggccaggctcactctggtgggcaaggagggtccccagtggctccacgtgcgggac ttcgatcggctgctgcgcgagtcccagcgggaggtgctgcggctgcagaggcagatcgcg ctgcgcaaccagcgggagacgctcccgctcccgccgtcctggcccccgggccctgctctc caggccagagcaggggcgcctgctcccggggccccgggagagggtccacctccaagcctc ccgtgctcgtacctgcgccgtgccctcccctgggggctccagggttccgccctccatagg cctgcgcccagcggggcctggccggccgcccctgccgcgctgccagtggtacggcccggc cgcaaatccctggttgtcctggcagccgcccggggcccagcccgcgccgcctcctcccgc cgccgccgtagagcggggacgcccccgtcagcgcccgcctggctggccaggcctggcccc agctacccgggggcggcggtgcgtccgccccggctccaggccacgccccaggaggatgcg gacaacctacccgtgattctaggggagccagagaaagagcagagggtgcagcagctggaa tcggagctcagcaagaagcggaagaaatgcgagagcctggagcaggaagcccggaaaaag cagaggcgatgtgaggagctggaactgcagctgagacaagcgcagaatgagaatgcccgc ctggtggaggagaactcccggctcagtgggagagccacagagaaggagcaggtggagtgg gagaatgcggagctgaggggccagctcctgggggtgacacaggagagggactcagccctt cgcaagagccagggcctgcagagcaagctggagagcctggagcaagtgctgaagcacatg cgggaggtggcccagcggcggcagcagctggaggtggagcatgaacaggctcggctcagc ctacgggagaagcaggaggaggtccggagactgcagcaggcccaggctgaagcccagagg gaacatgaaggagccgtgcagctgctggaggcccgggttcgagagctcgaagaacagtgc cgcagccaaaccgagcagttcagcctcctggcacaggaactccaggctttccgcctgcac ccgggccccttggatctgctcacatctgccctggactgtgggagccttggagactgccca ccacccccctgctgctgctccattccccagccttgccgggggtctggccccaaagacctt gacctcccgccgggctcccctgggcgctgcaccccaaagtcttccgagcctgcccctgcc actctcactggggtccctcgaaggacagccaagaaggcagagtctctctccaactcctcc cactccgagtccatccacaacagccccaagtcatgccctacacctgagtgtggggcacac aggatgtgtgctcaacacggggcagctgttattattactaagctcctgggcatcactcat agctctggtcgtcaactcatccttctttctacccacaaaccccagaaggtggaggaggca ggccctgaagagaaaccagccctttctgtttgcccccaggtggacacagccagtgaggta gaggagctggaggcagacagtgtctccctgctcccagctgcgccagagggcagccgggga ggagccaggatccaggtcttcctagcacgttatagctacaacccctttgagggtcccaat gagaatccagaagcagagcttccgctgacagctggcgagtacatctacatctatggcaac atggatgaggatggcttttttgaaggagagctcatggatggccgaaggggcctggtccct tccaattttgtagagcgtgtgtcggatgatgacctcctgacctccctccctccagagctg gccgatttgtcccacagctcaggccctgaactcagtttcctgagtgtaggtgggggtggc agcagtagcgggggccaaagcagtgtgggaaggagccagcccagacctgaggaggaggat gcaggggacgagctcagtctgagcccatcaccggagggcctgggcgagcctcctgccgtg ccttacccccgccgtctggtggtcctcaagcagctggcccacagcgtggtgctggcctgg gagccgcctcctgagcaagtggagctacacggcttccatatctgtgtgaatggggagctg cgacaggccctggggcctggggcgccacccaaggccgtgctggagaacctggacctgtgg gccgggccccttcacatttctgtccaggccctgactagccggggcagctctgacccactg cgctgttgcttggcggtgggtgcccgggccggagtggtgcccagccagctgcgggtccat cggttgacagccacatctgctgagatcacctgggtgcccggcaatagcaacttggcccat gccatctacctcaatggggaagagtgcccacctgccagccccagtacctactgggccacc ttctgccacttacggcctggcacaccctatcaggcccaagtggaggctcagctcccaccc caagggccctgggaaccaggctgggagaggctggagcagcgggctgccaccctgcagttc accacactcccagcaggcccacctgatgcccctctggatgtgcagatcgagcctgggccc tcccctgggatcttgatcatcagttggctcccagtcaccatcgatgctgctggcacatcc aacggtgtccgggtcacaggctatgccatctacgctgatgggcagaagatcatggaggtg gcctcacccacggcaggcagtgtactggtggagttgtcccagctgcagctgctgcagaag gaggacacagcagagcttggggttcatctggtgaactccctcgtggaccacggccgcaac tcagacctgtcagacatccaggaggaagaggaagaggaggaggaggaggaggaagaggag ctgggttccaggacttgctccttccagaagcaggttgctggcaacagcatcagggagaat ggggccaagtcccagcccgaccccttttgtgagactgacagcgatgaggagatcttggag cagatcctggagctgcccctccagcagttctgtagcaagaagctctttagcatcccggag gaggaggaagaggaagaggaggacgaggaggaggagaagtcaggggcaggctgttcttcc cgagaccctggcccgcctgaacctgcattgctggggctgggctgtgacagtggtcagccc cgaagacctggccagtgtcccttgtctcctgagtcctccagggctggagactgcctggaa gacatgcctggattagttggtggaagcagccggaggagaggagggggctcccctgagaag cccccaagccgcaggcggcctccagatccccgagaacactgcagccgacttctcagcaac aatgggccccaggcctctggacgactgggccccacacgggagaggggtggcctccccgta attgagggccccaggactggactagaggctagcgggagaggccggctgggcccttcccgg aggtgctcccgtggccgggcgctggagcctggcctggccagctgcctttcccccaagtgc ttggaaatcagcattgaatatgattcggaggatgagcaggaggcgggcagcgggggcatc agcatcaccagctcctgctaccctggagatggggaggcctggggcacagcaactgtagga aggcccagggggcctccgaaggccaattcaggccccaagccctacccacgcctcccagcc tgggagaaaggggaaccagagcggagaggccgcagtgcgacgggcagagccaaggagcca ctctcccgggcaacagagaccggagaggccagagggcaggacggctctgggcggaggggc ccccagaagagaggtgtccgagtcctcaggccaagcactgcagagctagtccctgcgagg agcccctcagaaacactggcttaccagcacctacccgtcaggatctttgtggctctgttt gactatgaccccgtgtcaatgtcgcccaatcctgatgctggagaagaagagcttcccttc cgagagggtcagatcctgaaggtgtttggggacaaggatgccgatggcttctaccagggc gaaggtgggggccggacaggctacattccctgcaacatggtggctgaggtggctgtggac agccctgctgggagacagcaactgctccagcggggttatttgtccccagatattctcctt gagggctcagggaatggtccgtttgtgtactccacagcccacacaactgggcctcctccc aagccccgccgctccaagaaaggcccccctaagctggtcccctctgctgacctgaaagct ccccactccatggtggctgcatttgactacaacccccaggagagttcccccaatatggac gtggaggcagagctgcccttccgggcaggggatgtcattactgtgtttgggggcatggac gatgacggtttctactatggggaattaaatggacaaaggggcctggttccatccaacttc ctggagggccctgggcctgaggcaggcggcctggacagggaacccaggacaccccaggct gagagtcagagggatgatagctgtgaccctgaccctcaggccatctcgccagctgccaac acctggtggctgacgcaggactggggctgcaccacacaagggtccccagggcccccaggt gggccttgtacccccagctctggcagcgcccccaggattgaacgtggggagccccagggc agaagcgagaagcgcactgtccacactgcaaccagagcaggtgagctctccaaagtaact tggaccatgcacttccgcttcacagacctcatgactccctggggcttcgggacaataatt tctcagtag >gi568815581r:58205061_58427920|GENSCAN_predicted_peptide_5|985_aa MSGGHQLQLAALWPWLLMATLQAGFGRTGLVLAAAVESERSAEQKAIIRVIPLKMDPTGK LNLTLEGKPQHPEGLNSGLGRPFCLQDTHTSSSHPLYLCNASDDDNLEPGFISIVKLESP RRAPRPCLSLASKARMAGERGASAVLFDITEDRAAAEQVPRDICVFKLQQPLGLTWPVVL IWGNDAEKLMEFVYKNQKAHVRIELKEPPAWPDYDVWILMTVVGTIFVIILASVLRIRCR PRHSRPDPLQQRTAWAISQLATRRYQASCRQARGEWPDSGSSCSSAPVCAICLEEFSEGQ ELRVISCLHEFHRNCVDPWLHQHRTCPLCMFNITEGDSFSQSLGPSRSYQEPGRRLHLIR QHPGHAHYHLPAAYLLGPSRSAVARPPRPGPFLPSQEPGMGPRHHRFPRAAHPRAPGEQQ RLAGAQHPYAQGWGLSHLQSTSQHPAACPVPLRRARPPDSSGSGESYCTERSGYLADGPA SDSSSGPCHGSSSDSVVNCTDISLQGVHGSSSTFCSSLSSDFDPLVYCSPKGDPQRVDMQ PSVTSRPRSLDSVVPTGETQVSSHVHYHRHRHHHYKKRFQWHGRKPGPETGVPQSRPPIP RTQPQPEPPSPDQQVTRSNSAAPSGRLSNPQCPRALPEPAPGPVDASSICPSTSSLFNLQ KSSLSARHPQRKRRGGPSEPTPGSRPQDATVHPACQIFPHYTPSVAYPWSPEAHPLICGP PGLDKRLLPETPGPCYSNSQPVWLCLTPRQPLEPHPPGEGPSEWSSDTAEGRPCPYPHCQ VLSAQPATLCCRREKQVPGLTLNDLFPPRSLNTGSEEELEELCEQAVLWSSISLLFFPSA KMALETVPKDLRHLRACLLCSLVKVSVGDLVTIDQFEYDGCDNCDAYLQMKGNREMVYDC TSSSFDGIIAMMSPEDSWVSKWQRVSNFKPGVYAVSVTGRLPQERGGERDESKLVRRAVL TEQSLSNSHHLFQLVLGCPKESVNT >gi568815581r:58205061_58427920|GENSCAN_predicted_CDS_5|2958_bp atgagtggtggccaccagctgcagctggctgccctctggccctggctgctgatggctacc ctgcaggcaggctttggacgcacaggactggtactggcagcagcggtggagtctgaaaga tcagcagaacagaaagctattatcagagtgatccccttgaaaatggaccccacaggaaaa ctgaatctcactttggaaggaaaacctcagcaccctgagggcctgaactcaggtcttggc aggcccttctgccttcaggacactcacacttcctcatcccacccgctgtacctgtgcaat gccagtgatgacgacaatctggagcctggattcatcagcatcgtcaagctggagagtcct cgacgggccccccgcccctgcctgtcactggctagcaaggctcggatggcgggtgagcga ggagccagtgctgtcctctttgacatcactgaggatcgagctgctgctgagcaggtaccc agggacatttgcgtgttcaagctgcagcagccgctggggctgacctggccagtggtgttg atctggggtaatgacgctgagaagctgatggagtttgtgtacaagaaccaaaaggcccat gtgaggattgagctgaaggagcccccggcctggccagattatgatgtgtggatcctaatg acagtggtgggcaccatctttgtgatcatcctggcttcggtgctgcgcatccggtgccgc ccccgccacagcaggccggatccgcttcagcagagaacagcctgggccatcagccagctg gccaccaggaggtaccaggccagctgcaggcaggcccggggtgagtggccagactcaggg agcagctgcagctcagcccctgtgtgtgccatctgtctggaggagttctctgaggggcag gagctacgggtcatttcctgcctccatgagttccatcgtaactgtgtggacccctggtta catcagcatcggacttgccccctctgcatgttcaacatcacagagggagattcattttcc cagtccctgggaccctctcgatcttaccaagaaccaggtcgaagactccacctcattcgc cagcatcccggccatgcccactaccacctccctgctgcctacctgttgggcccttcccgg agtgcagtggctcggcccccacgacctggtcccttcctgccatcccaggagccaggcatg ggccctcggcatcaccgcttccccagagctgcacatccccgggctccaggagagcagcag cgcctggcaggagcccagcacccctatgcacaaggctggggactgagccacctccaatcc acctcacagcaccctgctgcttgcccagtgcccctacgccgggccaggccccctgacagc agtggatctggagaaagctattgcacagaacgcagtgggtacctggcagatgggccagcc agtgactccagctcagggccctgtcatggctcttccagtgactctgtggtcaactgcacg gacatcagcctacagggggtccatggcagcagttctactttctgcagctccctaagcagt gactttgaccccctagtgtactgcagccctaaaggggatccccagcgagtggacatgcag cctagtgtgacctctcggcctcgttccttggactcggtggtgcccacaggggaaacccag gtttccagccatgtccactaccaccgccaccggcaccaccactacaaaaagcggttccag tggcatggcaggaagcctggcccagaaaccggagtcccccagtccaggcctcctattcct cggacacagccccagccagagccaccttctcctgatcagcaagtcaccagatccaactca gcagccccttcggggcggctctctaacccacagtgccccagggccctccctgagccagcc cctggcccagttgacgcctccagcatctgccccagtaccagcagtctgttcaacttgcaa aaatccagcctctctgcccgacacccacagaggaaaaggcgggggggtccctccgagccc acccctggctctcggccccaggatgcaactgtgcacccagcttgccagatttttccccat tacacccccagtgtggcatatccttggtccccagaggcacaccccttgatctgtggacct ccaggcctggacaagaggctgctaccagaaaccccaggcccctgttactcaaattcacag ccagtgtggttgtgcctgactcctcgccagcccctggaaccacatccacctggggagggg ccttctgaatggagttctgacaccgcagagggcaggccatgcccttatccgcactgccag gtgctgtcggcccagcctgccacactctgctgcaggcgagagaagcaggtccctggcctg accctcaatgacctctttcctccccgctctctaaatacaggctcagaggaggaactcgag gagctgtgtgaacaggctgtgctgtggtcgtctatctccctgttgttcttcccatcggcg aagatggccctggagacggtgccgaaggacctgcggcatctgcgggcctgtttgctgtgt tcgctggtcaaggtgtcagtcggggacctggttactatagaccagtttgaatatgatggt tgtgacaattgtgatgcatatctacaaatgaagggtaaccgagagatggtatatgactgc actagctcttcctttgatggaatcattgcgatgatgagtccagaggacagctgggtctcc aagtggcagcgagtcagtaactttaagccaggtgtatatgcggtgtcagtcactggtcgc ctgccccaagaacgtggaggtgaacgggatgaatccaagctggttcgcagggcagtcctc actgagcagtctctttccaactctcaccaccttttccagctggtcctgggatgtcccaaa gagagcgtgaacacctag >gi568815581r:58205061_58427920|GENSCAN_predicted_peptide_6|39_aa MGFHHVGQAGLELLTLDLHLLVDVACKQERFPKEEELKE >gi568815581r:58205061_58427920|GENSCAN_predicted_CDS_6|120_bp atggggtttcaccatgttggtcaggctggtctcgaactcctgaccttggatcttcatctg ctggtggacgtggcctgcaagcaggagcgctttccaaaggaggaagaattaaaagagtga