GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:18:22 Sequence gi568815580f:518001_745098 : 227098 bp : 42.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1153 1302 150 0 0 90 39 167 0.524 8.93 1.02 PlyA + 4495 4500 6 1.05 2.00 Prom + 9927 9966 40 -6.45 2.01 Sngl + 18606 19619 1014 2 0 88 43 756 0.996 67.86 2.02 PlyA + 19637 19642 6 -4.04 3.00 Prom + 19719 19758 40 -5.25 3.01 Sngl + 19977 23033 3057 2 0 44 39 1068 0.666 88.67 3.02 PlyA + 23711 23716 6 1.05 4.09 PlyA - 26013 26008 6 1.05 4.08 Term - 27123 27058 66 0 0 99 39 98 0.366 2.96 4.07 Intr - 29510 29405 106 1 1 74 41 63 0.168 -0.50 4.06 Intr - 48080 47891 190 0 1 49 80 134 0.096 6.42 4.05 Intr - 48384 48356 29 2 2 81 73 54 0.522 0.04 4.04 Intr - 49751 49678 74 2 2 47 115 42 0.579 0.09 4.03 Intr - 55969 55844 126 1 0 106 95 95 0.958 11.96 4.02 Intr - 58856 58790 67 1 1 100 88 71 0.821 6.29 4.01 Init - 59785 59703 83 1 2 21 36 88 0.677 -2.61 4.00 Prom - 61546 61507 40 -4.95 5.00 Prom + 61738 61777 40 -6.05 5.01 Sngl + 62409 62927 519 2 0 92 45 839 0.992 75.79 5.02 PlyA + 63594 63599 6 1.05 6.00 Prom + 70059 70098 40 -8.75 6.01 Init + 70157 70168 12 1 0 88 68 0 0.088 -1.70 6.02 Intr + 78802 79449 648 0 0 81 41 290 0.285 14.81 6.03 Intr + 88411 88747 337 0 1 82 45 112 0.015 0.47 6.04 Intr + 92555 92653 99 0 0 77 32 122 0.209 4.66 6.05 Intr + 101213 101361 149 0 2 42 92 163 0.935 11.13 6.06 Intr + 106865 107032 168 1 0 51 116 144 0.999 12.72 6.07 Intr + 109097 109533 437 1 2 89 6 268 0.031 10.15 6.08 Intr + 111758 111904 147 2 0 78 68 61 0.022 1.53 6.09 Intr + 112029 112062 34 0 1 82 78 26 0.041 -1.69 6.10 Intr + 113686 113761 76 0 1 56 86 72 0.637 1.97 6.11 Intr + 123327 123541 215 1 2 80 103 262 0.859 24.41 6.12 Intr + 126910 127097 188 0 2 88 107 101 0.678 9.57 6.13 Term + 134275 134434 160 1 1 44 42 84 0.046 -4.17 6.14 PlyA + 134825 134830 6 1.05 7.08 PlyA - 135376 135371 6 1.05 7.07 Term - 137531 137469 63 2 0 67 48 51 0.419 -4.09 7.06 Intr - 139963 139842 122 2 2 100 84 182 0.757 18.29 7.05 Intr - 140267 140097 171 0 0 44 56 133 0.785 4.69 7.04 Intr - 141043 140770 274 1 1 14 82 160 0.867 4.19 7.03 Intr - 141776 141399 378 2 0 -14 42 316 0.415 11.04 7.02 Intr - 143393 143333 61 1 1 42 113 69 0.654 2.62 7.01 Init - 147276 145175 2102 0 2 70 53 602 0.597 42.44 7.00 Prom - 147590 147551 40 -10.45 8.00 Prom + 148128 148167 40 -8.95 8.01 Init + 148169 148266 98 1 2 31 52 110 0.916 1.53 8.02 Intr + 148885 149273 389 1 2 -108 19 845 0.607 51.51 8.03 Intr + 149294 149605 312 0 0 -98 19 661 0.249 35.83 8.04 Intr + 152692 152867 176 2 2 79 86 236 0.833 21.24 8.05 Term + 154860 154997 138 2 0 80 46 190 0.988 10.98 8.06 PlyA + 155181 155186 6 1.05 9.15 PlyA - 155934 155929 6 1.05 9.14 Term - 156406 156305 102 1 0 73 42 121 0.958 3.30 9.13 Intr - 157402 157321 82 0 1 93 121 37 0.914 6.22 9.12 Intr - 159444 159345 100 1 1 37 68 57 0.541 -3.15 9.11 Intr - 159872 159743 130 2 1 69 80 124 0.823 9.05 9.10 Intr - 160737 160696 42 0 0 71 109 35 0.510 1.42 9.09 Intr - 165380 165246 135 2 0 80 97 78 0.951 7.74 9.08 Intr - 168008 167921 88 1 1 51 27 154 0.963 4.65 9.07 Intr - 168092 168047 46 0 1 23 92 91 0.234 -0.45 9.06 Intr - 170608 170519 90 2 0 118 92 55 0.584 7.95 9.05 Intr - 173276 173204 73 2 1 85 95 57 0.553 4.16 9.04 Intr - 176104 175970 135 1 0 19 -9 196 0.435 2.84 9.03 Intr - 176334 176248 87 0 0 65 86 82 0.378 4.95 9.02 Intr - 179355 179240 116 1 2 -2 82 96 0.662 -0.75 9.01 Init - 188638 188470 169 1 1 26 97 282 0.717 20.64 9.00 Prom - 188818 188779 40 -7.85 10.00 Prom + 189471 189510 40 -9.85 10.01 Init + 191183 191405 223 1 1 55 86 251 0.648 20.36 10.02 Intr + 193483 193564 82 2 1 68 45 61 0.313 -2.42 10.03 Intr + 193882 194138 257 1 2 81 32 155 0.173 5.36 10.04 Intr + 194269 194621 353 2 2 45 37 364 0.137 21.12 10.05 Term + 199216 199266 51 0 0 113 43 9 0.281 -4.65 10.06 PlyA + 201182 201187 6 1.05 11.07 PlyA - 203008 203003 6 1.05 11.06 Term - 206632 206424 209 2 2 81 31 186 0.988 8.72 11.05 Intr - 214965 214834 132 1 0 27 115 66 0.214 2.90 11.04 Intr - 218961 218808 154 0 1 61 93 137 0.977 10.22 11.03 Intr - 221811 221735 77 1 2 101 70 60 0.989 3.82 11.02 Intr - 225097 224918 180 2 0 74 95 155 0.999 13.72 11.01 Intr - 225415 225260 156 2 0 40 86 175 0.895 11.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 109097 109540 444 1 0 89 42 263 0.931 16.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:518001_745098|GENSCAN_predicted_peptide_1|49_aa AYLERREICLAVGLSLLFSDDLSRGGYEQLLSPEPGPPNEKCEADHVAW >gi568815580f:518001_745098|GENSCAN_predicted_CDS_1|150_bp gcctatttggagcgacgtgaaatatgccttgctgttgggctctccctcctgttctctgat gatctgtcacgagggggatatgaacaactgctgtccccagagcctgggcccccgaatgag aaatgtgaagcagaccacgtggcctggtag >gi568815580f:518001_745098|GENSCAN_predicted_peptide_2|337_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGAPESDVENGTKLENTLQN IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFKVEMKEKMLRAAREKGRVT LKGKPIRLTADLSAETLQARREWGPIFNIHKEKNFQPRISYPAKLSFISEGEIKYFIDKQ MLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815580f:518001_745098|GENSCAN_predicted_CDS_2|1014_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacacagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagcg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgcacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcagaat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga tttaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggttacc ctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagccaga agagagtgggggccaatattcaacattcataaagaaaagaattttcaacccagaatttca tatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaagcaa atgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcgcta aatatggaaaggaacaaccggtaccagccactgcaaaatcatgccaaaatgtaa >gi568815580f:518001_745098|GENSCAN_predicted_peptide_3|1018_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTNPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSET VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIHK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFTLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNRQTESQIMSELPF TIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILP KVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYY KATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWEN WLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTP KAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTQWEKIFATYSSDKGLISRIYNELKQ IYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIKRNANQNHYEISSHTS >gi568815580f:518001_745098|GENSCAN_predicted_CDS_3|3057_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcggacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggatgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaactcattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccaatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaact gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatc gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacacaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcacgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatattgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgtcctctctcaccactcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacagacaaacagagagccaaatcatgagtgaactcccattc acaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggac ctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaatgg aagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatggccatactgccc aaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaa ttggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtca atcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactac aaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatgg aacagaacagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaa cctgagaaaaacaaacaatggggaaaggattccctatttaataaatggtgctgggaaaac tggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatc aattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaa aacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaacacca aaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttc tgcacagcaaaagaaactaccatcagagtgaacaggcaacctacacaatgggagaaaatt ttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaa atttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacac ttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatcactg gccatcaagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttag >gi568815580f:518001_745098|GENSCAN_predicted_peptide_4|246_aa MCAINSYNKGPPRAPFIETVPIVTVSSWCTCEVALLPGVTPKVLGLTATEIPDQPPIKHC FDGNRLRKDKIYVDINSKQNLMGFAIRQLSHPSTIQFIHRWMKLPNSPSRETTISTRGWC DFASGGEIGEREKKSSSTIVIGREQFPSFSGSSEYLLTELKTHLRVPLEEAPVFSRETVQ QGGKSRPCCRDLVPCIPIVPALAERAQHRELWLQRVEAPNLGSFHPTQYEDDENEDLYEG PLPLNE >gi568815580f:518001_745098|GENSCAN_predicted_CDS_4|741_bp atgtgtgcaatcaacagttacaacaaagggccaccaagagctcccttcatagaaacagtg cccatagtcacagtcagctcttggtgtacttgtgaagtggccttgttgcctggggtgaca cccaaggttctcggtctcacggccacggagatcccagatcagcccccaattaagcattgc tttgatggaaacaggctgagaaaagacaagatctatgttgacataaactccaagcagaac ctcatggggtttgcaattcggcagttgtcccaccctagcaccatccagtttatccatagg tggatgaaactgcccaactcgccatcccgagaaacaacaatcagcaccaggggttggtgt gactttgcctctggtggagagataggagagagagaaaaaaagagttcctccacaattgtg attggaagagaacaatttccttcattttctggctccagtgagtacctcctaacagagctg aaaactcatcttcgggttcctctagaagaggctccagtgtttagcagagaaacagttcag cagggtggaaagagccggccgtgttgcagggacttggtgccctgcatcccaattgttcca gctttggctgaaagggcccaacatagggagttgtggcttcagagagtggaagccccaaac cttggcagcttccaccctactcaatatgaagacgacgagaatgaagacctttatgagggt ccacttccacttaatgagtag >gi568815580f:518001_745098|GENSCAN_predicted_peptide_5|172_aa MASGFKKPSAASTGQKRKVAPKPELTEDQKQEVREAFDLFDVDGSGTIDAKELKVAMRAL GFEPRKEEMKKMISEVDREGTGKISFNDFLAVMTQKMSEKDTKEEILKAFRLFDDDETGK ISFKNLKRVANELGENLTDEELQEMIDEADRDGDGEVNEEEFLRIMKKTSLY >gi568815580f:518001_745098|GENSCAN_predicted_CDS_5|519_bp atggcttccggcttcaagaagcccagcgctgcctccaccggccaaaagagaaaggtggca cctaagcccgagctcactgaggatcagaagcaagaagttcgggaagcatttgacctcttc gacgtggacggaagtgggaccatcgacgcgaaggagctgaaggtggccatgagagcgctg ggcttcgaacccaggaaggaagagatgaagaaaatgatctccgaggtggacagggaaggc acggggaagatcagcttcaatgacttcctggccgtgatgacgcagaagatgtccgagaag gacaccaaagaagaaatcctgaaggccttcaggctctttgatgacgatgagaccgggaag atctcgttcaaaaacctgaagcgtgtggccaacgagctgggggagaacctcacggatgag gagctgcaggagatgatcgacgaagctgatcgggatggggacggcgaagtgaacgaggag gagttccttcggatcatgaagaagaccagcctttactga >gi568815580f:518001_745098|GENSCAN_predicted_peptide_6|889_aa MGVQVPPTRRLQAPPTRRHLQAPPTRRLQAPPTRRHLQAPPTRRLQAPPTRRHLQAPPTR RLQAPPTLRHLQARAAGLVSTLEVADTLCPRLTSSRWHRRLQGAALKRILGMTGENILRA PDFSPRRSCPFLRRARESRSQVQGRAPQTPGPSGADRTKPLFLPGKVEILWAASEAASKT QAISWAVPIRLFSFSKQEEEVEAGRHTIPAKLLAKTKRSRDQPIVHNVEVQLPFKSEFFL HDISLQSPGTHSSETLAVPQPTPSSLSSPLVRRSQNISQLPSNQAWLSHPGLAPEQPPAS FSVAKRLLESSQMKEILSKAWRRKEKEDYIIKRNFIEREFQAVEIRAPRTKPQGRETSDG GQRKKGFSEVGEIDADEEVKKALTGIKQMKIMMERKEKEHTNLMSTLKKCREEKQEALKL LNEVQEHLEEEERLCRESLADSWGECRSCLENNCMRIYTTCQPSWSSVKNKIERFFRKIY QFLFPFHEDNEKDLPISEKLIEEDAQLTQMEDVFSQLTVDVNSLFNRSFNVFRQMQQEFD QTFQSHFISDTDLTEPYFFPAFSKEPMTKADLEQCWDIPNFFQLFCNFSVSIYESVSETI TKMLKAIEDLPKQDKGNIPPLARQYTEATAEETLSSQMSKCGLFIKNSQAGALQHLNVVS ITFLDRTCEFMLCGKGKELSPEKGKQISEQLHHHIVGKMAEEDCPDVPALHTELDEAIRL VNVSNQQYGQILQMTRKHLEDTAYLVEKMRGQFGWVSELANQAPETEIIFNSIQVVPRIH EGNISKQDETMMTDLSILPSSNFTLKIPLEESAESSNFIGYVVAKALQHFKEHFKTCFQL PEINCGFKILCGKFQKYIVSFQLHAIKSHAVPDPFLSGGECSLCPVAPR >gi568815580f:518001_745098|GENSCAN_predicted_CDS_6|2670_bp atgggggtgcaggtcccgcccacccggcgtctgcaggccccgcccacccggcgtcacctg caggccccgcccacccggcgtctgcaggccccgcccacccggcgtcacctgcaggccccg cccacccggcgtctgcaggccccgcccacccggcgtcacctgcaggccccgcccacccgg cgtctgcaggccccgcccaccctgcgtcacctgcaggcccgggccgcggggttggtttcc accctggaggttgctgacaccctgtgccctcggctgacttccagccggtggcacagacgc ctccagggggcagcactcaagcgcatcttaggaatgacaggtgagaacatcctccgggcc ccagatttctctcctcgccgctcttgcccatttctccggagagccagagaaagccgctcc caagtccaaggccgagctccgcagacgcccggcccctccggcgcggacagaacaaagcca ctgttcttgccggggaaggtagaaatactgtgggctgcttcagaggctgccagcaaaact caggcaatctcctgggctgttccaatacgtttattctctttttcaaaacaggaggaggag gtagaggcggggagacacaccatccctgcaaaactactggcaaaaactaagcggagccgg gaccagcccatcgtccacaacgtggaagtccagcttccgttcaaatcggagttctttctt catgacatttctttgcaaagtcccggaacccacagctctgagactctggctgtcccccaa cccaccccatcttccttgtcctcacccctggtcaggagaagccaaaacatcagtcagctt cccagtaatcaagcctggctttctcacccagggctcgccccagaacaaccaccggcttct ttcagtgtagccaaaaggctattggagtcttctcaaatgaaagagattttatcaaaggct tggagaagaaaagaaaaagaggattatataataaaacgtaacttcatcgaaagagagttt caggcagtagaaataagagcacccaggacaaagccccagggaagagaaacatctgacgga ggacagaggaagaagggtttttctgaggtgggggagatagatgcagatgaagaggtgaag aaggctttgactggtattaagcaaatgaaaatcatgatggaaagaaaagagaaggaacac accaatctaatgagcaccctgaagaaatgcagagaagaaaagcaggaggccctgaaactt ctgaatgaagttcaagaacatctggaggaagaagaaaggctatgccgggagtctttggca gattcctggggtgaatgcaggtcttgcctggaaaataactgcatgagaatttatacaacc tgccaacctagctggtcctctgtgaaaaataagattgaacggtttttcaggaagatatat caatttctatttcctttccatgaagataatgaaaaagatctccccatcagtgaaaagctc attgaggaagatgcacaattgacccaaatggaggatgtgttcagccagttgactgtggat gtgaattctctctttaacaggagttttaacgtcttcagacagatgcagcaagagtttgac cagacttttcaatcacatttcatatcagatacagacctaactgagccttacttttttcca gctttctctaaagagccgatgacaaaagcagatcttgagcaatgttgggacattcccaac ttcttccagctgttttgtaatttcagtgtctctatttatgaaagtgtcagtgaaacaatt actaagatgctgaaggcaatagaagatttaccaaaacaagacaaaggcaatatccctccc ctagccagacagtacacagaagctaccgcagaggagacactgtcttcccagatgagcaaa tgtggactgtttatcaagaatagtcaggcaggcgctctacagcacttgaatgtggtttcc atcacttttctggacagaacctgtgaatttatgttatgtggcaaagggaaagagctcagt ccagagaaaggaaaacagattagtgaacaattacatcaccatattgtgggtaaaatggca gaagaagactgtcctgatgtacctgctctgcacacagaattagacgaggcgatcaggttg gtcaatgtatccaatcagcagtatggccagattctccagatgacccggaagcacttggag gacaccgcctatctggtggagaagatgagagggcaatttggctgggtgtctgaactggca aaccaggccccagaaacagagatcatctttaattcaatacaggtagttccaaggattcat gaaggaaatatttccaaacaagatgaaacaatgatgacagacttaagcattctgccttcc tctaatttcacactcaagatccctcttgaagaaagtgctgagagttctaacttcattggc tacgtagtggcaaaagctctacagcattttaaggaacattttaaaacctgctttcagtta cccgagatcaactgcggttttaaaatattatgtggaaaattccagaaatacatagtaagt tttcaattgcatgccattaaatctcatgctgtccctgaccccttcctctccggaggtgaa tgctccctttgtccagtggctccacgatga >gi568815580f:518001_745098|GENSCAN_predicted_peptide_7|1056_aa MDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKLGRDTTKKENFRPISLMNIDAKFLNKILA NRIQQHIKKLIHHDHVGFIPGMQGWFNIHKSINVIQHINRSKDKNHMIISIDAEKAFDKI QQHIMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINLQKSQAFLYTNNRQTESQIMSELPLTIASKRIKYLGIHLKRDVKDLFKENYKPL LNEIKEDTKKWKTIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI WNQKRAHIAKSILSQKNKAGDITLPDFKLYYKATVTKTVWYWYQNRHIDQWNRTEPSEIT PHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICKKLKLDPFLTPYTKINSRWIKDLN VRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIR VNRQPTKWEKIFATYSSDKELISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAA KKHMKKCSPSLAIREMQIKTTMRHHLTPVRMAIIKKSGNNSAGASTPVFDRVTNGVTPTI KDLTGCCVENRLLTSNSSDFFTLINHSNSSKTPFQNTRLVVSRGNSSEKQFAIRFQDGKT DHAIQLSSGKKTALGREALEHPESLDSRKVGQRSRWSSQAASPISGPIQAETALLCPGDQ WTQEFHTKGGLKMRSMAHQRPLGLELQKSGSLSSDKTPHAVPRQSATAAAMYSSLDGSVG RRAFPLATVNLLHHEWVSPFKTTSTPVSRLRSIYLSGQALGKRCPMTPASGATASLGRLR ARPRSRWDAAYLPAVAAVCVARASHVPNGTLRFGVWARGVTSQAVARLHAEYRQGAGARA VVLPDAAAEDVLDLPQGFSRTVESQPYVGIPPPHTS >gi568815580f:518001_745098|GENSCAN_predicted_CDS_7|3171_bp atggataaattccttgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaagaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaagccagcatcatcctgataccaaagctgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaattcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcatgtgggcttcatccct gggatgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacaga tccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacacatcatgctaaaaactctcaataaattaggtattgatgggacatatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatataaaatcaatctgcaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaacttccactcacaattgcttcaaagagaataaaatac ctaggaatccaccttaaaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaatgaaataaaagaggatacaaagaaatggaagaccattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagctgga gacatcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagtatgg tactggtaccaaaacagacatatagatcaatggaacagaacagagccctcagaaataaca ccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtaaaaagctgaaa ctggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagacttaaac gttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacata ggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacccacaaaatgggagaaaatttttgcaacctactcatctgacaaagag ctaatatccagaatctacaatgaactcaaacaaatttataagaaaaaaacaaacaacccc atcaaaaagtgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagcc aaaaaacacatgaaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaaacc acaatgagacaccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaac agtgcaggtgcctcaactcctgtatttgatcgagtcaccaatggagtaactccaactatc aaggatctcactggctgctgtgttgagaacagactactgacttctaatagcagcgacttc tttaccttgataaaccacagcaactcctccaaaacacccttccagaacacacgtttggtt gtcagcagagggaattcatctgaaaaacaatttgccatacggtttcaagatgggaagaca gatcatgccatccaactttcttcagggaagaaaactgccctgggacgtgaggccctggag catccagaaagcctagactcccgcaaagtgggccagagatcacggtggagctcccaggct gcaagtcccatcagtggcccgattcaggctgagacagcacttctgtgccccggtgaccag tggacacaggaattccacacaaaaggtgggttaaaaatgaggtctatggctcatcaaagg cccctgggcttggagctgcagaaaagtggatccctctcctcagacaaaactccccacgct gtcccacgccaatcagccacagcagctgctatgtattcgtctctagatgggtccgtgggc cggagggcttttccgctagctacagtgaatctgcttcaccacgaatgggtttccccattc aaaaccacctccacccctgtgtcaaggctgcgctccatttatctgagcggtcaagcactg ggcaaacgctgcccgatgacgcccgcctcgggggccacggcatcactggggcgactgcga gcccggccgcggagccgctgggacgcggcttacctcccggctgtcgctgctgtgtgtgtt gcccgcgccagtcacgtccctaatgggaccctccgtttcggcgtctgggcccgcggcgtc acctctcaggctgtagcgcgcctgcatgccgaataccgacagggtgccggtgcccgtgcg gtcgtccttcctgacgccgcagcggaggatgtgttggatctgccccagggctttagcagg actgtggaaagccaaccatatgttggcattcccccaccccacaccagctga >gi568815580f:518001_745098|GENSCAN_predicted_peptide_8|370_aa MIGTYWALGTFHTVSSADSCMVGVRAGEWEVHRSGDGDGDGDGDGDGDGDGDGDGDGDGD GDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGD GDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDDGDGDGDGDGDGDGDGDG DGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDGDDGDGDGDGDGDGDGDGDGDGDGD GDGDGDGDGDGDGDGDGDGDGDGDGDDLPLMALPPCHALCQFYVVNSELSCQLYQRSGDM GLGVPFNIASYALLTYMIAHITGLKLQREPRPFPKLRILRKVEKIDDFKAEDFQIEGYNP HPTIKMEMAV >gi568815580f:518001_745098|GENSCAN_predicted_CDS_8|1113_bp atgattggcacttactgggcgcttggcactttccatactgtgtcatcggcagatagctgc atggttggtgttcgtgctggggaatgggaagttcatcgttcgggagatggtgatggagat ggtgatggtgatggagatggtgatggtgatggagatggtgatggtgatggagatggagat ggtgatggagatggagatggtgatggtgatggagatggtgatggtgatggagatggagat ggtgatggagatggtgatggtgatggagatggagatggtgatggtgatggagatggtgat ggtgatggagatggagatggagatggtgatggtgatggtgatggagatggtgatggtgat ggtgatggtgatggagatggtgatggtgatggtgatggtgatggagatggtgatggtgat ggtgatggagatggtgatggtgatggagatggagatggtgatggtgatggtgatggagat ggtgatgatggtgatggtgatggtgatggagatggtgatggtgatggagatggtgatgga gatggtgatggtgatggagatggtgatggtgatggtgatggtgatggagatggtgatggt gatggtgatggtgatggtgatggagatggtgatggtgatggtgatggtgatgatggagat ggtgatggtgatggtgatggtgatggtgatggagatggtgatggtgatggtgatggtgat ggagatggtgatggtgatggtgatggagatggtgatggtgatggagatggtgatggtgat ggtgatggtgatggtgatgatcttcctctgatggcgctgcctccatgccatgccctctgc cagttctatgtggtgaacagtgagctgtcctgccagctgtaccagagatcgggagacatg ggcctcggtgtgcctttcaacatcgccagctacgccctgctcacgtacatgattgcgcac atcacgggcctgaagcttcagcgagaacccagacctttcccaaagctcaggattcttcga aaagttgagaaaattgatgacttcaaagctgaagactttcagattgaagggtacaatccg catccaactattaaaatggaaatggctgtttag >gi568815580f:518001_745098|GENSCAN_predicted_peptide_9|464_aa MVLFGFLTLALFRRGTLLLQHTDPDYSAAYVVIETDAEDGIKGCGITFTLGKGTEVVVCA VNALAHHVLNKDLKDIVGDFRGFYRQLTSDGQLRWIGPEKGVVHLATAAVLNAVWDLWAK QEGKVLAVGRELQEEEKEETGWRKAQAAVEGGVGTWWLTASIRAANAFTDPRMLVSCIDF RYITDVLTEEDALALCPGAEGWLDQVSVMMDLTFPVGGRRDSGKTVTSAETPDALAKSRF KVKVGADLQDDMRRCQIIRDMIGPEKTLMMDANQRWDVPEAVEWMSKLAKFKPLWIEEPT SPDDILGHATISKALVPLGIGIATGEQCHNRVIFKQLLQAKALQFLQIDSCRLGSVNENL SVLLMAKKFEIPVCPHAGGVGLCELVQHLIIFDYISVSASLENRVCEYVDHLHEHFKYPV MIQRASYMPPKDPGYSTEMKEESVKKHQYPDGEVWKKLLPAQEN >gi568815580f:518001_745098|GENSCAN_predicted_CDS_9|1395_bp atggttcttttcgggtttctcacactggcattgtttcggcggggaactcttctcctgcag cacacggaccctgactactcggctgcctatgtcgtcatagaaactgatgcagaagatgga atcaaggggtgtggaattaccttcactctgggaaaaggcactgaagttgttgtctgtgct gtgaatgccctcgcccaccatgtgctcaacaaggacctcaaggacattgttggtgacttc agaggcttctataggcagctcacaagtgatgggcagctcagatggattggtccagaaaag ggcgtggtgcacctggcgacagcggccgtcctaaacgcggtgtgggacttgtgggccaag caggagggaaaggttttggcagtaggtagggaactgcaggaggaggagaaagaggagaca ggatggcggaaggcgcaggcagcagtagaggggggtgtggggacctggtggctgacagcc agcattagagctgccaacgcgtttactgatcccaggatgctggtatcctgcatagatttc aggtacatcactgatgtcctgactgaggaggatgccctagctctgtgcccaggcgctgaa ggatggctggaccaggtgagtgtgatgatggacctgactttcccagttggcggcaggaga gactcaggcaagacggtcacttctgcagaaactccagatgcccttgccaagtccaggttt aaagtaaaggtgggtgctgatctccaggatgacatgcgaagatgccaaatcatccgagac atgattggaccggaaaagactttgatgatggatgccaaccagcgctgggatgtgcctgag gcggtggagtggatgtccaagctggccaagttcaagccattgtggattgaggagccaacc tcccctgatgacattctggggcacgccaccatttccaaggcactggtcccattaggaatt ggcattgccacaggagaacagtgccacaatagagtgatatttaagcaactcctacaggcg aaggccctgcagttcctccagattgacagttgcagactgggcagtgtcaatgagaacctc tcagtattgctgatggccaaaaagtttgaaattcctgtttgcccccatgctggtggagtt ggcctctgtgaactggtgcagcacctgattatatttgactacatatcagtttctgcaagc cttgaaaatagggtgtgtgagtatgttgaccacctgcatgagcatttcaagtatcccgtg atgatccagcgggcttcctacatgcctcccaaggatcccggctactcaacagaaatgaag gaggaatctgtaaagaaacaccagtatccagatggtgaagtttggaagaaactccttcct gctcaagaaaattaa >gi568815580f:518001_745098|GENSCAN_predicted_peptide_10|321_aa MRSGSLRHKGWHRSQNAERVFMGVCRETGKTLRPEPQRGKKGPVEEEGAVRKAGGHEGRE HTKKRQPTAENVSEAYFHTFTTYPYNSAFLARPISSVDYEGQRVQETQRGLELLQVTQQV VEAGSPLKTPSPGLPVLPQRQPRRVQGTEEASERCALGFPLAGTQQNHFAAIAYYAFYHF QQNVSRKKAMGSKSGSCPGPAELQSAGREGPGPQAARTMASALTMASALTMASALTMASA LTMASALTMASALTMASALTMASALTMASAPWPPSDVGKRTSRTESREILPRTMAPAPRG RGPRAVTDSLLCVADLQEGLG >gi568815580f:518001_745098|GENSCAN_predicted_CDS_10|966_bp atgagatctgggagtctccggcataaagggtggcaccgaagtcagaatgcggagagggtc ttcatgggagtgtgtagggagaccggaaagacactgaggcctgagcctcagagagggaag aaggggccagtggaggaagaaggagctgtcagaaaggcaggaggccacgaggggagagag cacacgaagaagaggcagccgacagcggaaaacgtatctgaggcatactttcacacattc actacatacccatacaactctgcttttctagcgcggccaatctcatcagtggactacgag gggcaacgggtccaagagacccagagaggtctggaacttctccaagtcacacagcaagtt gtggaagcgggatcaccactcaaaactcccagccccgggcttcccgttctccctcagcgc caaccccgtcgcgttcagggtacagaggaagccagcgagcgctgtgctctagggtttcca ctggcgggaacccaacaaaatcattttgctgctattgcctattatgcattctaccatttc cagcaaaatgtgagtcgtaaaaaagcaatgggttcgaagtcgggaagctgcccggggccc gcagagctgcagtcggcggggcgggagggacccgggccgcaagccgcgcgtaccatggcg tccgcgcttaccatggcgtccgcgcttaccatggcgtccgcgcttaccatggcgtccgcg cttaccatggcgtccgcgcttaccatggcgtccgcgcttaccatggcgtccgcgcttacc atggcgtccgcgcttaccatggcgtccgcgccgtggcccccaagcgacgtggggaagcgc acgtcccggaccgagagccgggagatcctgccgcgcaccatggcccctgcgccccgtggc cgcggcccccgtgcggtcactgactcactgttatgtgtagctgatttgcaggaaggtttg gggtag >gi568815580f:518001_745098|GENSCAN_predicted_peptide_11|302_aa XHADGLCHKLTTVCPTVKPQTQGLAKDAWEIPRESLRLEVKLGQGCFGEVWMGTWNGTTK VAIKTLKPGTMMPEAFLQEAQIMKKLRHDKLVPLYAVVSEEPIYIVTEFMSKGSLLDFLK EGDGKYLKLPQLVDMAAQIADGMAYIERMNYIHRDLRAANILVGENLVCKIADFGLARLI EDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGILQTELVTKGRVPYPGMVNREV LEQVERGYRMPCPQGCPESLHELMNLCWKKDPDERPTFEYIQSFLEDYFTATEPQYQPGE NL >gi568815580f:518001_745098|GENSCAN_predicted_CDS_11|909_bp naacatgctgatggtttatgccacaagttgacaactgtgtgtccaactgtgaaacctcag actcaaggtctagcaaaagatgcttgggaaatccctcgagaatctttgcgactagaggtt aaactaggacaaggatgtttcggcgaagtgtggatgggaacatggaatggaaccacgaaa gtagcaatcaaaacactaaaaccaggtacaatgatgccagaagctttccttcaagaagct cagataatgaaaaaattaagacatgataaacttgttccactatatgctgttgtttctgaa gaaccaatttacattgtcactgaatttatgtcaaaaggaagcttattagatttccttaag gaaggagatggaaagtatttgaagcttccacagctggttgatatggctgctcagattgct gatggtatggcatatattgaaagaatgaactatattcaccgagatcttcgggctgctaat attcttgtaggagaaaatcttgtgtgcaaaatagcagactttggtttagcaaggttaatt gaagacaatgaatacacagcaagacaaggtgcaaaatttccaatcaaatggacagctcct gaagctgcactgtatggtcggtttacaataaagtctgatgtctggtcatttggaattctg caaacagaactagtaacaaagggccgagtgccatatccaggtatggtgaaccgtgaagta ttagaacaagtggagcgaggatacaggatgccgtgccctcagggctgtccagaatccctc catgaattgatgaatctgtgttggaagaaggaccctgatgaaagaccaacatttgaatat attcagtccttcttggaagactacttcactgctacagagccacagtaccagccaggagaa aatttataa