GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:59:37 Sequence gi568815586r:1693658_2018473 : 324816 bp : 44.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 372 367 6 1.05 1.03 Term - 19704 18825 880 2 1 24 38 256 0.516 6.22 1.02 Intr - 20216 20103 114 1 0 71 68 93 0.531 5.16 1.01 Init - 21175 20628 548 1 2 41 50 316 0.412 17.48 1.00 Prom - 22660 22621 40 -2.16 2.03 PlyA - 23064 23059 6 1.05 2.02 Term - 41689 41237 453 1 0 15 48 301 0.867 13.96 2.01 Init - 42695 42516 180 2 0 79 41 155 0.798 9.09 2.00 Prom - 59485 59446 40 -3.56 3.02 PlyA - 59548 59543 6 1.05 3.01 Sngl - 63940 63647 294 1 0 72 55 284 0.876 19.20 3.00 Prom - 64297 64258 40 -6.56 4.00 Prom + 66067 66106 40 -3.36 4.01 Init + 77833 77838 6 0 0 74 96 10 0.656 0.71 4.02 Intr + 79185 79304 120 2 0 95 116 76 0.860 11.79 4.03 Intr + 82005 82023 19 2 1 112 78 -9 0.202 -3.32 4.04 Intr + 90223 90416 194 2 2 82 106 160 0.886 16.41 4.05 Term + 92287 92415 129 0 0 127 43 79 0.976 5.48 4.06 PlyA + 92589 92594 6 1.05 5.17 PlyA - 92732 92727 6 1.05 5.16 Term - 100102 99998 105 1 0 103 47 35 0.571 -0.69 5.15 Intr - 101833 101642 192 1 0 38 64 83 0.316 0.59 5.14 Intr - 102123 101982 142 2 1 97 40 96 0.608 6.06 5.13 Intr - 103878 103761 118 1 1 117 63 287 0.993 28.62 5.12 Intr - 104900 104833 68 1 2 61 89 12 0.728 -2.85 5.11 Intr - 105129 104965 165 2 0 68 38 121 0.437 4.18 5.10 Intr - 107987 107917 71 2 2 127 100 35 0.960 6.58 5.09 Intr - 116683 116621 63 1 0 112 105 131 0.989 16.11 5.08 Intr - 116930 116886 45 2 0 104 84 50 0.795 4.81 5.07 Intr - 118066 118005 62 2 2 108 96 21 0.818 3.35 5.06 Intr - 120180 119986 195 1 0 91 67 71 0.506 4.69 5.05 Intr - 122750 122723 28 2 1 107 77 13 0.193 -0.21 5.04 Intr - 127172 126979 194 0 2 99 28 144 0.362 8.71 5.03 Intr - 127516 127279 238 1 1 61 91 95 0.169 4.29 5.02 Intr - 132979 132893 87 1 0 110 49 19 0.098 0.37 5.01 Init - 134493 134437 57 0 0 103 53 77 0.183 5.07 5.00 Prom - 135237 135198 40 -5.46 6.00 Prom + 136260 136299 40 -12.87 6.01 Init + 136654 136657 4 0 1 91 33 0 0.083 -5.04 6.02 Intr + 137278 137868 591 2 0 50 83 732 0.797 61.47 6.03 Term + 140610 141064 455 1 2 91 53 682 0.961 60.32 6.04 PlyA + 141759 141764 6 -1.75 7.37 PlyA - 142362 142357 6 -4.33 7.36 Term - 142970 142879 92 0 2 97 47 82 0.661 2.88 7.35 Intr - 147162 147082 81 1 0 110 91 155 0.887 17.61 7.34 Intr - 150872 150745 128 1 2 70 92 261 0.998 25.02 7.33 Intr - 153032 152937 96 1 0 94 94 143 0.508 14.62 7.32 Intr - 160387 160294 94 2 1 55 41 166 0.216 7.62 7.31 Intr - 162452 162355 98 1 2 85 65 203 0.988 17.35 7.30 Intr - 162572 162527 46 0 1 68 119 25 0.923 1.07 7.29 Intr - 164987 164920 68 1 2 119 98 61 0.979 8.75 7.28 Intr - 166549 166488 62 1 2 99 94 112 0.999 10.43 7.27 Intr - 167498 167286 213 2 0 48 17 137 0.076 1.51 7.26 Intr - 171356 170981 376 1 1 5 48 569 0.129 39.62 7.25 Intr - 172168 171738 431 1 2 13 13 281 0.353 5.61 7.24 Intr - 175830 175683 148 2 1 55 90 89 0.161 6.04 7.23 Intr - 182597 182560 38 2 2 49 86 24 0.039 -4.64 7.22 Intr - 184732 184658 75 1 0 49 82 107 0.777 5.91 7.21 Intr - 185379 185299 81 0 0 117 105 90 0.992 13.43 7.20 Intr - 186224 186147 78 2 0 126 113 86 0.999 14.65 7.19 Intr - 189343 189210 134 2 2 97 105 336 0.999 36.56 7.18 Intr - 190664 190586 79 2 1 69 113 52 0.959 4.92 7.17 Intr - 191224 191111 114 1 0 83 55 249 0.937 21.74 7.16 Intr - 191419 191330 90 1 0 61 49 159 0.858 9.49 7.15 Intr - 192382 192308 75 1 0 57 105 113 0.998 9.61 7.14 Intr - 192716 192566 151 1 1 108 41 167 0.655 14.16 7.13 Intr - 193412 193352 61 0 1 48 99 73 0.384 2.19 7.12 Intr - 213914 213783 132 0 0 55 110 133 0.938 12.82 7.11 Intr - 214531 214218 314 0 2 25 63 470 0.965 34.03 7.10 Intr - 215265 214934 332 0 2 -27 64 445 0.699 25.03 7.09 Intr - 216308 216249 60 2 0 89 89 122 0.754 11.33 7.08 Intr - 219482 219366 117 2 0 55 78 216 0.992 17.96 7.07 Intr - 221278 221197 82 0 1 59 86 80 0.289 4.54 7.06 Intr - 224772 224590 183 2 0 109 85 90 0.297 9.70 7.05 Intr - 231739 231715 25 2 1 95 109 22 0.240 2.08 7.04 Intr - 235059 234948 112 0 1 56 52 110 0.114 4.15 7.03 Intr - 242296 242126 171 1 0 90 89 7 0.599 1.14 7.02 Intr - 243370 243227 144 1 0 85 100 27 0.800 4.08 7.01 Init - 247969 247481 489 1 0 87 95 332 0.566 29.00 7.00 Prom - 248144 248105 40 -9.16 8.00 Prom + 248589 248628 40 -4.96 8.01 Init + 249858 250113 256 2 1 43 40 223 0.247 10.39 8.02 Term + 250353 251332 980 1 2 47 47 272 0.459 11.23 8.03 PlyA + 251431 251436 6 -0.45 9.10 PlyA - 251936 251931 6 1.05 9.09 Term - 252629 252549 81 2 0 70 48 82 0.929 0.09 9.08 Intr - 255506 255429 78 2 0 83 115 95 0.717 11.45 9.07 Intr - 259631 258759 873 2 0 85 99 494 0.982 41.73 9.06 Intr - 261903 261775 129 0 0 64 96 71 0.968 6.39 9.05 Intr - 272036 271901 136 1 1 76 113 159 0.998 17.77 9.04 Intr - 274253 274187 67 0 1 95 106 45 0.607 4.96 9.03 Intr - 299734 299607 128 0 2 92 76 24 0.199 1.92 9.02 Intr - 304318 304278 41 1 2 72 116 3 0.205 -1.48 9.01 Init - 310774 310625 150 1 0 86 65 398 0.939 35.34 9.00 Prom - 315622 315583 40 -2.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 166637 166581 57 2 0 85 40 103 0.865 4.51 S.002 Term - 171356 170768 589 1 1 5 36 586 0.844 39.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_1|513_aa MLLLDQTLAFNEKNAALVAAQEFGDTWHLSHVNDRMTVEERDKFPTGQQAILSVDPHRVL DSDHGDWSREHLLTCVLEGLSRIRKKPMNYSMTSTITQGKEENPTAFLERLQEALRKYTS LLPDSLEGQLILKDKFITQSAADIRIKLQKLALAPEQNLDALLNLATSVFYSRDREEQAE KEKRLSSRSVTIRGSVLISCPGQLSSRSITIRGILGQSVTRSTKTSQWRLVQDLRLINEA VIPLYPVVPKPYTLFSQIPKEAESFTVLDLKDAFFCIPLHSDSQFLFAFEDPTDHTSQLT WTVLPQRFRDSPHLFGQALVQDLGHFSSPGTLVLQFVDDLLLAASSEASCQQATRDLLNF LANQGYKASKSKAQLCLQQVKYLGLILARGTRALSKKRIQPILAYPRPKTLKQLRGFLGI TSFCRLQIPGYSKMARPLYTLIKETQRANTHLVEWEPEAETAFKSLKQALVQAPALSLPT RQNFSLYVRERAGIALEVLTQTHGTTLQPVAKG >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_1|1542_bp atgctattgttagatcaaaccctggcctttaatgaaaagaatgcagctttagttgcagcc caagagtttggagatacctggcatctcagtcacgtaaatgacagaatgacagtcgaagaa agggacaaattccctactggtcagcaagccattctcagtgtggatccccaccgggtcctc gactcagatcatggggactggagtcgtgaacatctgttaacctgtgtcctagaaggacta agcagaattaggaaaaagcccatgaattattcaatgacatccaccataactcagggaaag gaagaaaatcctaccgccttcctcgagcggctacaggaggccttaagaaaatatacttcc ctgttacccgactcactcgagggtcaattgatcctaaaagataagtttattacccagtca gctgcagatatcaggataaagctccaaaagctagccctggcccctgaacaaaatttggac gcattattaaacctggcaacctccgtgttctacagtagggatcgagaagaacaggctgaa aaggaaaaacggctgtcctcaaggtccgttaccatccgaggctcagtgttaatctcctgt cccggacagctgtcctcaaggtccattaccatccgaggaatcctgggacagtctgtaacc aggagtacaaaaaccagtcagtggagactagtgcaagatcttagactcatcaatgaagca gtaattcctctatatccagttgtacccaagccctataccctgttctctcaaataccaaag gaagcagaatcgttcacagttctggacctcaaggatgccttcttctgtattcccctgcac tctgactcccagtttctctttgcctttgaggatcctacagatcacacgtcccaacttacg tggacagtcttgccccaaaggttcagggatagccctcacctgtttggtcaggcactggtc caagatctaggccacttctcaagtccaggcactctggtccttcagtttgtggatgattta cttttggctgccagttcggaagcctcatgccagcaagctactcgagatctcttgaacttt ctagctaatcaagggtacaaggcatctaaatcaaaggcccagctctgcctacaacaagtc aaatatctaggcctaatcttagccagaggaaccagggccctcagcaagaaacgaatacag cctatactggcttatcctcgccctaagacattaaaacagttgcgggggttccttggaatc accagcttttgccgactacagatccctggatacagcaagatggccaggccactctatact ctaataaaggagacccagagggcaaatactcatctagtagaatgggaaccagaggcagaa acagccttcaaatccttaaagcaggccctagtacaagctccagccttaagccttcccaca agacaaaacttctctttatacgtcagagagagagcaggaatagctcttgaagtccttact cagactcacgggacaaccctacaaccagtggcaaaaggctag >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_2|210_aa MDKFLDTYTLPRLNQEEVESLNKPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL RIKYLGIQLTRDVKDLFKENYKPLLNEVKEDTNKWKNIPCSWVGRISIVKMAVLPKVIYR FNAIPIKLPMTFFTEFEKTTLKFIWKQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVT KTAWYWYQNRDIDQNRALRNSAAYLQLSDL >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_2|633_bp atggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaataaaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggagctg agaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggagaac tacaaaccgctgctcaatgaagtaaaagaggacacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcagtatcgtgaaaatggccgtactgcccaaggtaatttataga ttcaatgccatccccatcaagctaccaatgactttcttcacagaattcgaaaaaactact ttaaagttcatatggaagcaaaaaagagcccacattgccaagtcaatcctaagccaaaag aacaaagctggaggcatcacgctacctgacttcaaactgtactacaaggctacagtaacc aaaacagcatggtactggtaccaaaacagagatatagaccagaacagagccctcagaaat agtgccgcatatctacaactatctgatctttga >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_3|97_aa MPGTIRYPDSLIKVNDTIQTDLEIGKITNFIKFDNGNLCMVTVGANLGRTGVITNRERHS GSFDVVHMKDANGNSFATQLSNIFVIGKGNKPQTSLP >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_3|294_bp atgcccggcaccatccgctaccctgattccctcatcaaggtgaatgataccattcagaca gatttggagattggcaagattaccaatttcatcaagtttgacaatggtaacctatgcatg gtgactgtaggtgctaacctgggaagaactggtgtgataaccaacagagagaggcactct ggatcttttgatgtggttcacatgaaagatgccaatggaaacagctttgccactcaactt tccaacatttttgttattggcaagggcaacaaaccacagacttctcttccctga >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_4|155_aa MKSSEEHEYSDEAPQEDEGFMGMSPLLQAHHAMEKMEEFVCKKIKFPQRVFLGLGLSGII PTLHYVISEGFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLF HIFVVAGAFVHFHGVSNLQEFRFMIGGGCSEEDAL >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_4|468_bp atgaagagctctgaggaacatgaatacagtgatgaagctcctcaggaagatgagggcttt atgggcatgtcccctctcttacaagcccatcatgctatggaaaaaatggaagaatttgtt tgtaagaaaataaaattcccccaaagagtgtttttgggcctaggcctgagtggaatcatt cctaccttgcactatgtcatctcggaggggttccttaaggccgccaccatagggcagata ggctggttgatgctgatggccagcctctacatcacaggagctgccctgtatgctgcccgg atccccgaacgctttttccctggcaaatgtgacatctggtttcactctcatcagctgttt catatctttgtggttgctggagcttttgttcacttccatggtgtctcaaacctccaggag tttcgtttcatgatcggcgggggctgcagtgaagaggatgcactgtga >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_5|609_aa MVRPTALRALSLWGERSPWRGCQSAEGPAGVGRGVTRLGTDVGLQAAQSRAQRQHKALGC FSHAAGTGFYATRDCHLAPALPSSPLPSRIHSSCPQHCGLRVPSVVHLWDAQVPEPEQLL RPQQLPAATRYSPVFTLTANAFNQVALNPPVRPSWDSYRNPDTTGRQSAVGTGARVSTQR EQRLARSTQHQEVMTGPPGDEALIFVMSQQMGLPVPFQPTPFSYLELKATKTPPKHSNVP ISQDLFKKASQPEHHPRIAMRLQGRGAAGVQMKLEFLQRKFWAATRQCSTVDGPCTQSCE DSDLDCFVIDNNGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSHSCILQKLKMIPLN LPAQNFSSLGYLSPGAPPPGLAYTDVSVPSEGALDLESQAGAQASVPSPGDKTHSPGRCG LAPLAHKHKKQDPLQPCDTEYPVFVYQPAIREANGIVECGPCQKVFVVQQIPNSNLLLLV TDPTCDCSIFPPVLQEATEVKYILQCSGMCALLAGQLVAPEQVLPDLFPVCRMGSGPEIL TLTVASAHNASVKCDRMRSQKLRRRPDSCHAFHPEENAQDCGGASDTSASPPLLLLPVCA WGLLPQLLR >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_5|1830_bp atggtgaggcccacggctctccgggccctgtcgctctggggagagaggagcccctggcga gggtgccagtctgccgaaggccccgcaggcgttggcagaggggtcacaagactgggcaca gatgtgggattgcaggcagcccagagccgggcacaacgacagcacaaggctcttggctgc ttctcccatgctgcagggaccggtttctatgccacccgggactgtcatctggcgccagcg ctgccctcctccccacttcccagcagaatccacagctcttgtccccagcactgtgggctc agagttcccagtgtcgtgcacctttgggatgctcaggtgcctgagcccgagcagcttctt cggccccagcaattgcctgcagccactcggtactcacccgtctttactctgactgcaaac gccttcaatcaggtggctctcaaccctccagtaagaccctcctgggattcctatagaaat ccagacaccaccggcaggcagagcgcagtgggcactggtgccagagtctccactcagcgg gagcagcggctggcacgcagcacacagcaccaggaggtcatgacaggccctcctggggac gaagctctcatctttgtcatgagccagcaaatgggtctccccgtccccttccagcccact cctttcagctacctggaactcaaagcaacaaagacacccccaaagcactcaaatgttccc atatcacaggacctgttcaagaaggccagccagccggagcaccaccccaggattgccatg agactccaagggaggggagccgcgggcgtccaaatgaagctggaattcctccagcgcaaa ttctgggcggcaacgcggcagtgcagcactgtggatgggccgtgcacacagagctgcgag gacagtgatctggactgcttcgtcatcgacaacaacgggttcattctgatctccaagagg tcccgagagacgggaagatttctgggggaggtggatggtgctgtcctgacccagctgctc agcatgggggtgttcagccacagctgcatcttgcaaaagttgaagatgatccccctcaat ctgcctgctcagaatttctcttctctggggtacctgtcccctggagctcctccccctggt ctggcctacacagatgtcagtgtccccagtgaaggcgcactggacttggagtcccaggcc ggggcgcaggcttctgtgcccagccctggggacaaaactcacagccctggccgctgtggg cttgctcctctcgcccacaaacacaagaagcaggacccgctgcagccctgcgacacggag taccccgtgttcgtgtaccagccggccatccgggaggccaacgggatcgtggagtgcggg ccctgccagaaggtatttgtggtgcagcagattcccaacagtaacctcctcctcctggtg acagaccccacctgtgactgcagcatcttcccaccagtgctgcaggaggcgacagaagtc aaatatatcctgcaatgctcggggatgtgtgcgctgcttgcagggcaactggtggcacca gagcaggttcttccagacctttttccagtctgcagaatgggctccggtcctgagatattg accttaacagtggcttctgcacataatgcctctgtcaaatgtgaccggatgcgctcccag aagctccgccggcgaccagactcctgccacgccttccatccagaggagaatgcccaggac tgcggcggcgcctcggacacctcagcctcgccgcccctactcctgctgcctgtgtgtgcc tgggggctactgccccaactcctgcggtga >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_6|349_aa MGITCWIALYAVEALPTCPFSCKCDSRSLEVDCSGLGLTTVPPDVPAATRTLLLLNNKLS ALPSWAFANLSSLQRLDLSNNFLDRLPRSIFGDLTNLTELQLRNNSIRTLDRDLLRHSPL LRHLDLSINGLAQLPPGLFDGLLALRSLSLRSNRLQNLDRLTFEPLANLQLLQVGDNPWE CDCNLREFKHWMEWFSYRGGRLDQLACTLPKELRGKDMRMVPMEMFNYCSQLEDENSSAG LDIPGPPCTKASPEPAKPKPGAEPEPEPSTACPQKQRHRPASVRRAMGTVIIAGVVCGVV CIMMVVAAAYGCIYASLMAKYHRELKKRQPLMGDPEGEHEDQKQISSVA >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_6|1050_bp atggggatcacctgctggatcgccctgtatgctgtggaggccctccccacctgccctttc tcctgcaagtgtgacagccgcagcctggaggtggactgcagtggccttggcctcaccacg gtgcccccagacgtgcccgcagccacccgaaccctcttgctcttgaacaataagctgagt gccctgccaagctgggctttcgccaacctctccagcctgcagcggttggacctgtccaac aacttcctggaccggctgccccgctccattttcggggacctgacgaatctgactgagctt cagctgcgcaataacagcatcaggaccctggacagggacctgctgcggcactcgccgctg ctccgccacctggacctgtccatcaacggcctggcccagttgccccctggtcttttcgac gggctcctggctctgcgctccctctcgcttcgctccaaccgtctgcagaatctggaccgg ctgacatttgaacccctagcaaacctgcagctgctgcaggtcggggataacccctgggag tgtgactgtaacctgcgtgagttcaaacactggatggagtggttctcctaccgaggggga cgcttggaccagcttgcctgcaccctgcccaaggagctgagggggaaggacatgcggatg gtccccatggagatgttcaactactgctcccagctggaggacgagaatagctcagctggg ctggatattcctgggccaccctgcaccaaggccagtccagagcctgctaagcccaagccc ggggctgagccggagccggagcccagcacagcctgcccacagaagcagaggcaccggccg gcgagcgtgaggcgagccatgggcacggtgatcattgcaggggtcgtgtgcggcgtcgtc tgcatcatgatggtggtggccgctgcctatggctgcatctacgcctccctcatggccaag taccaccgggagctcaaaaagcgccagcccctgatgggggaccccgagggcgagcacgag gaccagaagcagatctcttctgtggcctga >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_7|1689_aa MGKKQCKKAKNSKNQNASSPPKDHNSSPAGEQNWMENELTEVGFRRWVVTDSSKLKEHVL TQCKEAKNLEKRLQELLTRITSLEKNIYDLMELKNTARELRDAYISISSRIDQAEKRISE IEDQLNEIKREDKIREKNEKDEQGLQEIWDYVKRPNLHLIGVPGLLYSDMCRLLPSPRNQ PALQALERGVILEVKCVVCSTQAGAARRGVKISIKGKGFSVVSVVGTLQWLLWARAAHAP HWRFLRWMAALWDVPGKTGPSPISLTGQRGNRGPESSSILRGVPKDFSTGTSAQLRRAMG LAPEGGGFQAFFPRPTMPATPNFLANPSSSSRWIPLQPMPVAWAFVQKTSALLWLLLLGT SLSPAWGQAKIPLETVKLWADTFGGDLYNTVTKYSGSLLLQKKYKDVESSLKIEEVDGLE LVRKFSEDMENMLRRKVEAVQNLVEAAEEADLNHEFNESLVEPGVGVGVGMSVTQSGVGV GVGMSVTQSGVGVGVGMSITLSGVGVGVGMSVRQSGVGVGVGMSVTQSGVGVGVGMSVTQ SGVGVGVGMSVRQSGVGVGVGMSVTQSWGVFSAQRAAAGACVDSDGRPAPALSSSHLRRF SSSLSACPGARAASVGLTRPPQFDYYNSVLINERDEKGNFVELGAEFLLESNAHFSNLPV NTSISSVQLPTNVYNKDPDILNGVYMSEALNAVFVENFQRDPTLTWQYFGSATGFFRIYP GIKWTPDENGVITFDCRNRGWYIQAATSPKDIVILVDVSGSMKGLRMTIAKHTITTILDT LGENDFINIIAYNDYVHYIEPCFKGILVQADRDNREHFKLLVEELMVKGVGVVDQALREA FQILKQFQEAKQGSLCNQAIMLISDGAVEDYEPVFEKYNWPDCKVRVFTYLIGREVSFAD RMKWIACNNKGYYTQISTLADTQENVMEYLHVLSRPMVINHDHDIIWTEAYMDSKLLSSQ AQSLTLLTTVAMPVFSKKNETRSHGILLGVVGSDVALRELMKLAPRYKLGVHGYAFLNTN NGYILSHPDLRPLMKMQKADAIKSGRMADDPQTEPVPSMRSCILTRLGLRETEGGEPAAA GGFESGFGPCLESTTQTENGIFLVQERHRPSGQRPATASPRPGSPHTGSPRACRPGCIWW SPRLPLVLLLLLGLVRGAAAADAVKKGNGTACVMASFAAFSMNYDSKSGSKNVTFDLPSN AEVLDSGSCSKENTSDPRHAIVLWRKTDTDLPFHKKCDTQRERGAHLETRERRGEGTTVL GFRLGMNAGSSRIFLRGIQLNTTLPDPRDPTFKAANDSLRALRAAIGNSTAGAERVRGTK AFSINIFPVWGQAFKVEGNQFGSAEERLLGENNTPIPIAVGGALFPKSKKESQKEVKGIR TSPGPTAKCGFPNDMNLCGGQDNPQNLSTPNAPSRQPRAREAVDNPDLQMRTRSEKRVLF LTNDYFFTDISDTPFSLGVVLSRGHGEYILLGNTSVEEGLHDLLHPDLALAGDWIYCITD IDPDHRKLSQLEAMIRFLTRKDPDLECDEELVREVLFDAVVTAPMEAYWTALALNMSEES EHVVDMAFLGTRAGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAG SFVFNLRWAEGPESAGEPMVVTASTAVAVTVDKRTAIAAVPLENPQIWYQPTRQKKEDCG HGQGRLGSQ >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_7|5070_bp atggggaaaaaacagtgcaaaaaggctaaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaggggaacaaaactggatggagaatgaattgaca gaagtaggcttcagaaggtgggtcgttacagactcctccaagctaaaggagcatgttcta acccaatgcaaggaagctaagaaccttgaaaaaaggttacaggaactgctaactagaata accagtttagagaagaacatatatgacctgatggagctgaaaaacacagcacgagaactt cgtgatgcatacataagtatcagtagccgaattgatcaagcagaaaaaaggatatcagag attgaagatcaacttaatgaaataaagcgtgaagacaagattagagaaaaaaatgaaaag gatgaacaaggcctccaagaaatatgggactatgtgaaaagaccaaacctacatttgatt ggtgtacctggcctgctctattcagacatgtgccgcctcctcccttctccaagaaatcag cctgccttgcaggctttggaacgtggagtcatcttggaggtcaaatgcgtcgtttgcagc acgcaggctggggcagcgcggcgtggtgtaaagataagtatcaaggggaaaggattttct gtggtgtcagtcgtaggaaccttgcaatggttgctttgggccagagccgcgcatgctcca cactggcgttttctacgttggatggcagccctgtgggatgtgccaggcaaaactggccct tcccccatcagcctcacgggtcagagagggaaccggggtccggagtcctcctctatcctc cgaggcgtgcccaaggatttcagcacgggaacatcagcccaactaaggcgagccatgggg ctggcccctgagggcggaggtttccaggccttcttccccaggcccaccatgcctgcaact cccaacttcctcgcaaaccccagctccagcagccgctggattcccctccagccaatgccc gtggcctgggcctttgtgcagaagacctcggccctcctgtggctgctgcttctaggcacc tccctgtcccctgcgtggggacaggccaagattcctctggaaacagtgaagctatgggct gacaccttcggcggggacctgtataacactgtgaccaaatactcaggctctctcttgctg cagaagaagtacaaggatgtggagtccagtctgaagatcgaggaggtggatggcttggag ctggtgaggaagttctcagaggacatggagaacatgctgcggaggaaagtcgaggcggtc cagaatctggtggaagctgccgaggaggccgacctgaaccacgaattcaatgaatccctg gtggaacctggcgtgggagttggcgtggggatgtccgtgacgcagtccggcgtgggagtt ggcgtggggatgtccgtgacgcagtccggcgtgggagttggcgtggggatgtccataacg ctgtccggcgtgggagttggcgtggggatgtccgtgaggcagtccggcgtgggagttggc gtggggatgtccgtgacgcagtccggcgtgggagttggcgtggggatgtccgtgacgcag tccggcgtgggagttggcgtggggatgtccgtgaggcagtccggcgtgggagttggcgtg gggatgtccgtgacgcagtcctggggggtgttcagtgcccagcgcgccgccgcgggtgct tgtgtagactctgatggccgcccggccccggccctctcgtcctctcacctgcgccgtttc tcttcctctctctccgcctgtcccggtgctcgggccgcctccgtgggcctcacccgtcca ccccagttcgactattacaactcggtcctgatcaacgagagggacgagaagggcaacttc gtggagctgggcgccgagttcctcctggagtccaatgctcacttcagcaacctgccggtg aacacctccatcagcagcgtgcagctgcccaccaacgtgtacaacaaagacccagatatt ttaaatggagtctacatgtctgaagccttgaatgctgtcttcgtggagaacttccagaga gacccaacgttgacctggcaatattttggcagtgcaactggattcttcaggatctatcca ggtataaaatggacacctgatgagaatggagtcattacttttgactgccgaaaccgcggc tggtacattcaagctgctacttctcccaaggacatagtgattttggtggacgtgagcggc agtatgaaggggctgaggatgactattgccaagcacaccatcaccaccatcttggacacc ctgggggagaatgacttcattaatatcatagcgtacaatgactacgtccattacatcgag ccttgttttaaagggatcctcgtccaggcggaccgagacaatcgagagcatttcaaactg ctggtggaggagttgatggtcaaaggtgtgggggtcgtggaccaagccctgagagaagcc ttccagatcctgaagcagttccaagaggccaagcaaggaagcctctgcaaccaggccatc atgctcatcagcgacggcgccgtggaggactacgagccggtgtttgagaagtataactgg ccagactgtaaggtccgagttttcacttacctcattgggagagaagtgtcttttgctgac cgcatgaagtggattgcatgcaacaacaaaggctactacacgcagatctcaacgctggcg gacacccaggagaacgtgatggaatacctgcacgtgctcagccgccccatggtcatcaac cacgaccacgacatcatctggacagaggcctacatggacagcaagctcctcagctcgcag gctcagagcctgacactgctcaccactgtggccatgccagtcttcagcaagaagaacgaa acgcgatcccatggcattctcctgggtgtggtgggctcagatgtggccctgagagagctg atgaagctggcgccccggtacaagcttggagtgcacggatacgcctttctgaacaccaac aatggctacatcctctcccatcccgacctccggcccctgatgaaaatgcagaaggcagat gccataaagtctggaaggatggcagatgacccgcagacggagccagtgccttccatgagg agctgcatcctcaccaggttaggtctcagggaaactgaaggaggagagccagcagcagcg ggtggctttgaatctggcttcggcccatgcctagaaagcacgacgcaaacagaaaatgga atatttttggtccaggaacgccaccgtccttcaggccagcgccccgcaaccgcctccccg cgccccggttccccacacaccggctcccctcgcgcctgccgcccgggctgcatctggtgg agcccccggctgcccctggtgctgctgctgctgctgggcctcgtgcgtggtgcagccgca gcggatgcggtgaaaaagggcaacgggacagcctgcgtaatggccagcttcgctgccttc tcgatgaactacgactctaagagtggctctaagaatgtgacctttgacctgccatccaat gcggaagtgcttgacagtggctcttgcagtaaagagaacacttccgaccccaggcacgcg attgtgctttggaggaagacagacactgaccttccatttcacaagaagtgcgacacccag cgggagcgcggcgcccacctggagacccgggagcggcgcggcgagggcaccacggtcctg ggcttccggttggggatgaatgcaggttctagccggattttcctacgaggaatccagctg aatacgactcttcctgaccccagagaccccacttttaaagctgccaacgactccctgcga gcattgcgggcagccatcggcaattctacagccggagcggagcgtgtccgcggcacgaag gcgttttcaatcaacatattcccagtgtggggccaggctttcaaggtggaaggcaaccag ttcggatctgcggaggagcgtctgctgggcgagaacaacacgccgatccccatcgccgtc ggcggcgccctgtttccaaaatcaaagaaggaaagtcagaaagaagtaaagggaataaga acaagtcctgggccaactgcaaaatgtggctttcccaatgacatgaacttgtgtggtgga caggacaatccacagaacctcagcaccccgaatgcaccctcacggcagcctcgtgcaaga gaggccgtggacaatcctgatttacaaatgagaactcgaagtgaaaagcgagttcttttc ctgaccaatgactacttcttcacggacatcagcgacacccctttcagtttgggggtggtg ctgtcccggggccacggagaatacatccttctggggaacacgtctgtggaagaaggcctg catgacttgcttcacccagacctggccctggccggtgactggatctactgcatcacagat attgacccagaccaccggaagctcagccagctagaggccatgatccgcttcctcaccagg aaggacccagacctggagtgtgacgaggagctggtccgggaggtgctgtttgacgcggtg gtgacagcccccatggaagcctactggacagcgctggccctcaacatgtccgaggagtct gaacacgtggtggacatggccttcctgggcacccgggctggcctcctgagaagcagcttg ttcgtgggctccgagaaggtctccgacaggaagttcctgacacctgaggacgaggccagc gtgttcaccctggaccgcttcccgctgtggtaccgccaggcctcagagcatcctgctggc agcttcgtcttcaacctccgctgggcagaaggaccagaaagtgcgggtgaacccatggtg gtgacggcaagcacagctgtggcggtgaccgtggacaagaggacagccattgctgcagtc cctttagaaaatccccagatctggtaccagcccacacgccaaaagaaggaagactgtggc catggccagggccgcctggggtcccagtaa >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_8|411_aa MNIKAKILNKILANRIQQHIKKLIHHDHVSFIPRMQGWFNIHKPINVIHHINRTNDKNHM IISIDAEKAFDKIQHPFTLKTLNKLDDMTVYLENPIVSAQNLKLISNFSKVSGYKINVQK SQAFLYTNNRQTESQIISELPFTIATKIIKYLGIQLTRDVKDLFKETYKPQLKEIREDTK KWKNIPCSWLGRISMIKMATLPKVIYRFNAIPIKLPLTFFTELEKTTLNFIWNQKRARIA KTILSKKNKAAGIMLPDFKLYYKATVTKTAWYWYQNRYTDQGTEQRPSEIMPHIYNHLTF DKPDKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTL EENLGNTVQDIGMCKDFMTKTPKAMATKAKIDKWDLIKLKSFCTSKETISE >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_8|1236_bp atgaacatcaaagcgaaaatccttaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccacgatcatgtcagcttcatccctcggatgcaaggctggttcaac atacacaaaccaataaacgtcatccatcacataaacagaaccaatgacaaaaaccacatg attatctcaatagatgcagaaaaggccttcgataaaattcaacaccccttcacgctaaaa actctcaataaactagatgacatgactgtatatttagaaaaccccatcgtctcagcccaa aatcttaaactcataagcaatttcagcaaagtctcaggatacaaaatcaatgtgcaaaaa tcacaagcattcctatacaccaataatagacaaacagagagccaaatcataagtgaactc ccattcacaattgctacaaagataataaaatacctaggaatacaacttacaagggatgtg aaggatctcttcaaggagacctacaaaccacaactcaaggaaataagagaggacacaaag aaatggaaaaacattccatgctcatggctaggaagaatcagtatgatcaaaatggccaca ctgcccaaagtaatttatagattcaatgctatccccatcaagctaccattgactttcttc acagaattagaaaaaactactttaaatttcatatggaaccaaaaaagagcccgtatagcc aagacaatcctaagtaaaaagaacaaagctgcaggcatcatgttacctgacttcaaacta tactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagatatacagac caaggaacagaacagaggccctcagaaataatgccacacatctacaaccatctgactttt gacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataagtggtgttgg gaaaactggctagccatatgcagaaagctgaaactggaccccttccttacaccttataca aaaattaactcaagatggattaaagacttaaatgtaagacctaaaaccataaaaacccta gaagaaaacctaggcaataccgttcaggacataggcatgtgcaaagacttcatgactaaa acaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctgattaaactaaag agtttctgcacatcaaaagaaactatatcagaatga >gi568815586r:1693658_2018473|GENSCAN_predicted_peptide_9|560_aa MAAVAAGGLVGKGRDISLAALQRHDPYINRIVDVASQVALYTFGHRANEWEKTDVEGTLF VYTRSASPKHGFTIMNRLSMENRTEPITKDLDFQLQDPFLLYRNARLSIYGIWFYDKEEC QRIAELMKNLTQYEQLKAHQGTGAGISPVILNSGEGKEVDILRMLIKAKDEYTKCKTCSE PKKITSSSAIYDNPNLIKPIPVKPSENQQQRIPQPNQTLDPEPQHLSLTALFGKQDKATC QETVEPPQTLHQQQQQQQQQQEKLPIRQGVVRSLSYEEPRRHSPPIEKQLCPAIQKLMVR SADLHPLSELPENRPCENGSTHSAGEFFTGPVQPGSPHNIGTSRGVQNASRTQNLFEKLQ STPGAANKCDPSTPAPASSAALNRSRAPTSVTPVAPGKGLAQPPQAYFNGSLPPQTVGHQ AHGREQSTLPRQTLPISGSQTGSSGVISPQELLKKLQIVQQEQQLHASNRPALAAKFPVL AQSSGTGKPLESWINKTPNTEQQTPLFQSPEPSVITSSPLTKLQLQEALLYLIQNDDNFL NIIYEAYLFSMTQAAMKKTM >gi568815586r:1693658_2018473|GENSCAN_predicted_CDS_9|1683_bp atggcagccgtggcggcaggcggcctggtgggaaaggggcgcgacatcagcctagcggcc ctgcagcgccacgacccctatatcaaccgcatcgtggacgtggccagccaggtggctctg tacaccttcggccatcgggccaacgagtgggagaaaactgatgtggaaggaaccttattt gtttatacaaggtctgcttctccaaagcatggattcaccattatgaataggctgagcatg gaaaataggacagaacctattactaaagacttggatttccaactccaggaccctttcctt ctctacagaaatgccagattgtccatctatggaatttggttttatgataaggaagaatgc caaagaattgcagagcttatgaaaaacctaactcagtatgaacagttgaaagcccatcag ggaactggagcaggaatttccccagtgatcctcaattcaggagagggcaaagaagtagac attttacgaatgctcatcaaggccaaagacgaatacacaaagtgtaaaacctgttctgag ccaaaaaagataaccagttcctctgccatctatgacaatccaaatctcatcaaaccaatt ccagtgaaacccagtgaaaaccagcaacagcgtatacctcagcccaaccagaccttagac cctgaaccccaacacttatccttgacagctctgtttgggaagcaggacaaagctacatgt caggaaactgtggagcctccgcagactctccaccagcagcagcagcagcagcagcagcag caagagaagcttccaattaggcagggggttgtacgctccctgtcctatgaggaacccaga agacactcaccccccattgagaagcagctctgtccagccattcagaaactcatggtcagg agcgcagacctccacccattgtcagagctgcctgaaaaccggccttgtgaaaatggcagt acccattctgcgggagaattttttacaggacctgtccagccagggtctcctcacaacatt ggaacttctcgtggtgtacaaaatgcttccagaactcagaacctgttcgagaaacttcag agtaccccaggggcagcaaacaagtgtgaccctagtacaccagcacctgccagctcagct gccctgaaccgcagcagagctcccacttctgtcacccctgtggctccaggaaagggtctg gctcagccaccacaggcctatttcaatggctcccttccacctcagacagtaggacatcag gctcatggaagagaacagtccacactcccaagacaaacactccccatctctggtagtcag actggcagctctggagtgatctcccctcaagagttactgaagaagcttcagattgtacag caggagcagcagctgcatgcctctaaccggccagccttggccgctaagtttcctgtgctc gctcagagctctggaacagggaaacccttggaatcctggatcaacaagacacccaacaca gaacagcagactcctcttttccagagcccggagccctccgtgatcaccagcagcccactc accaagctccagctccaggaggcactgctgtacctcattcagaatgatgacaacttctta aatataatctatgaagcctatctcttcagcatgactcaagcagccatgaaaaagactatg tga