GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:03:27 Sequence gi568815584r:23017490_23219091 : 201602 bp : 46.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 367 362 6 1.05 1.03 Term - 8886 8600 287 1 2 100 48 446 0.967 37.37 1.02 Intr - 16185 15879 307 0 1 69 105 291 0.945 24.92 1.01 Init - 17392 17195 198 1 0 94 70 71 0.500 4.88 1.00 Prom - 18421 18382 40 -6.56 2.00 Prom + 22335 22374 40 -7.96 2.01 Sngl + 24602 25639 1038 1 0 78 54 1075 0.453 98.23 2.02 PlyA + 29585 29590 6 1.05 3.12 PlyA - 29603 29598 6 1.05 3.11 Term - 30990 30491 500 1 2 80 53 1069 0.999 97.29 3.10 Intr - 31786 31538 249 2 0 112 97 160 0.531 16.71 3.09 Intr - 32249 32138 112 2 1 88 92 153 0.976 15.65 3.08 Intr - 32454 32333 122 1 2 73 99 162 0.991 16.01 3.07 Intr - 35120 34984 137 1 2 110 85 100 0.999 12.11 3.06 Intr - 36260 36007 254 2 2 111 96 338 0.999 33.13 3.05 Intr - 36839 36652 188 0 2 63 77 264 0.999 22.21 3.04 Intr - 37184 37017 168 0 0 64 61 286 0.994 23.42 3.03 Intr - 37377 37258 120 1 0 91 81 158 0.809 15.87 3.02 Intr - 37864 37570 295 1 1 92 97 413 0.996 39.18 3.01 Init - 38244 38044 201 0 0 76 116 228 0.977 21.28 3.00 Prom - 41053 41014 40 -12.96 4.20 PlyA - 41099 41094 6 1.05 4.19 Term - 41985 41659 327 0 0 84 37 355 0.976 24.71 4.18 Intr - 43695 43595 101 1 2 79 109 108 0.999 11.83 4.17 Intr - 44133 43809 325 0 1 137 100 367 0.995 38.15 4.16 Intr - 44786 44679 108 2 0 92 94 79 0.998 9.38 4.15 Intr - 45034 44927 108 1 0 80 89 16 0.695 1.38 4.14 Intr - 45585 45440 146 1 2 55 111 117 0.988 10.70 4.13 Intr - 46088 45947 142 2 1 63 113 134 0.999 13.33 4.12 Intr - 46768 46616 153 1 0 45 110 215 0.962 19.67 4.11 Intr - 46999 46866 134 2 2 34 100 75 0.980 3.66 4.10 Intr - 50476 50341 136 1 1 35 89 96 0.711 4.54 4.09 Intr - 52128 51987 142 2 1 82 82 211 0.948 20.26 4.08 Intr - 60777 60662 116 0 2 90 75 51 0.710 3.25 4.07 Intr - 61549 61331 219 1 0 69 65 71 0.455 1.40 4.06 Intr - 63320 62058 1263 2 0 70 87 772 0.970 62.60 4.05 Intr - 64347 64259 89 1 2 91 93 86 0.910 9.09 4.04 Intr - 72612 72493 120 1 0 87 62 73 0.969 5.07 4.03 Intr - 73144 73033 112 1 1 95 121 79 0.992 11.85 4.02 Intr - 76055 75990 66 2 0 100 94 16 0.824 2.50 4.01 Init - 77623 77486 138 1 0 100 60 245 0.489 23.04 4.00 Prom - 79662 79623 40 -6.36 5.00 Prom + 80136 80175 40 -9.75 5.01 Sngl + 80284 80592 309 0 0 64 44 235 0.448 12.60 5.02 PlyA + 80974 80979 6 1.05 6.10 PlyA - 81594 81589 6 1.05 6.09 Term - 82687 82678 10 0 1 93 37 4 0.198 -6.53 6.08 Intr - 83590 83361 230 1 2 101 98 97 0.513 8.67 6.07 Intr - 84075 83803 273 0 0 66 94 96 0.720 5.73 6.06 Intr - 84276 84180 97 2 1 50 84 130 0.647 8.81 6.05 Intr - 84422 84353 70 0 1 69 89 59 0.400 2.34 6.04 Intr - 84777 84603 175 0 1 78 72 144 0.974 11.31 6.03 Intr - 85562 85437 126 2 0 78 17 117 0.946 4.48 6.02 Intr - 86329 86186 144 1 0 83 57 65 0.880 3.38 6.01 Init - 87431 87339 93 2 0 64 39 252 0.924 16.29 6.00 Prom - 94383 94344 40 -5.46 7.03 PlyA - 97522 97517 6 1.05 7.02 Term - 100333 99998 336 1 0 123 54 494 0.656 44.07 7.01 Init - 101602 101093 510 1 0 80 116 562 0.999 53.13 7.00 Prom - 103840 103801 40 -7.36 8.10 PlyA - 107829 107824 6 1.05 8.09 Term - 109854 109688 167 1 2 87 55 212 0.999 15.88 8.08 Intr - 110707 110530 178 1 1 90 97 137 0.942 14.29 8.07 Intr - 112310 112161 150 2 0 79 100 239 0.914 24.56 8.06 Intr - 114068 113972 97 1 1 110 70 84 0.982 8.81 8.05 Intr - 120535 120432 104 1 2 109 109 60 0.997 9.17 8.04 Intr - 122058 121935 124 2 1 133 83 142 0.990 18.79 8.03 Intr - 123135 122982 154 1 1 104 89 202 0.999 21.03 8.02 Intr - 125715 125590 126 1 0 65 113 44 0.953 5.25 8.01 Init - 125802 125742 61 0 1 78 66 56 0.813 3.81 8.00 Prom - 127114 127075 40 -5.16 9.09 PlyA - 127306 127301 6 1.05 9.08 Term - 136369 136227 143 2 2 68 42 129 0.303 4.39 9.07 Intr - 136842 136728 115 0 1 72 67 43 0.184 0.62 9.06 Intr - 142251 142207 45 0 0 93 80 70 0.250 5.31 9.05 Intr - 145924 145833 92 2 2 41 89 12 0.204 -3.69 9.04 Intr - 147947 147796 152 1 2 102 92 240 0.999 25.61 9.03 Intr - 149051 148847 205 0 1 82 86 328 0.986 30.26 9.02 Intr - 164705 164661 45 0 0 65 123 14 0.663 0.98 9.01 Init - 165425 165275 151 2 1 60 95 161 0.996 14.20 9.00 Prom - 167345 167306 40 -3.26 10.03 PlyA - 168845 168840 6 1.05 10.02 Term - 182193 182140 54 0 0 115 49 38 0.078 0.16 10.01 Intr - 189968 189911 58 1 1 74 93 41 0.006 2.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 103570 103407 164 2 2 106 44 133 0.849 8.80 S.002 Term - 164202 164102 101 1 2 68 53 88 0.892 1.59 S.003 Init - 185841 185781 61 0 1 54 87 88 0.808 6.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_1|263_aa MALASVLERPLPVNQRGFFGLGGRADLLDLGPGSLSDGLSLAAPGWGVPEEPGIEMLHGT TTLAFKFRHGVIVAADSRATAGAYIASQTVKKVIEINPYLLGTMAGGAADCSFWERLLAR QCRIYELRNKERISVAAASKLLANMVYQYKGMGLSMGTMICGWDKRGPGLYYVDSEGNRI SGATFSVGSGSVYAYGVMDRGYSYDLEVEQAYDLARRAIYQATYRDAYSGGAVNLYHVRE DGWIRVSSDNVADLHEKYSGSTP >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_1|792_bp atggcgcttgccagcgtgttggagagaccgctaccggtgaaccagcgcgggtttttcgga cttgggggtcgtgcagatctgctggatctaggtccagggagtctcagtgatggtctgagc ctggccgcgccaggctggggtgtcccagaagagccaggaatcgaaatgcttcatggaaca accaccctggccttcaagttccgccatggagtcatagttgcagctgactccagggctaca gcgggtgcttacattgcctcccagacggtgaagaaggtgatagagatcaacccatacctg ctaggcaccatggctgggggcgcagcggattgcagcttctgggaacggctgttggctcgg caatgtcgaatctatgagcttcgaaataaggaacgcatctctgtagcagctgcctccaaa ctgcttgccaacatggtgtatcagtacaaaggcatggggctgtccatgggcaccatgatc tgtggctgggataagagaggccctggcctctactacgtggacagtgaagggaaccggatt tcaggggccaccttctctgtaggttctggctctgtgtatgcatatggggtcatggatcgg ggctattcctatgacctggaagtggagcaggcctatgatctggcccgtcgagccatctac caagccacctacagagatgcctactcaggaggtgcagtcaacctctaccacgtgcgggag gatggctggatccgagtctccagtgacaatgtggctgatctacatgagaagtatagtggc tctaccccctga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_2|345_aa MKRQLTHLPGRFWLWPSFSVASLLSHQTPATNSWLASSKLHSAPGMALQDVCKWQSPDTQ GPSPHLPRAGGWAVPRGCDPQTFLQIHGPRLAHGTTTLAFRFRHGVIAAADTRSSCGSYV ACPASCKVIPVHQHLLGTTSGTSADCATWYRVLQRELRLRELREGQLPSVASAAKLLSAM MSQYRGLDLCVATALCGWDRSGPELFYVYSDGTRLQGDIFSVGSGSPYAYGVLDRGYRYD MSTQEAYALARCAVAHATHRDAYSGGSVDLFHVRESGWEHVSRSDACVLYVELQKLLEPE PEEDASHAHPEPATAHRAAEDRELSVGPGEVTPGDSRMPAGTETV >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_2|1038_bp atgaagcgtcagctcacacaccttcctggccggttctggctgtggcccagcttctctgta gcgtccctcctatcccaccagaccccagccacaaattcctggcttgcttcttccaaactt cattcagccccagggatggctctgcaggatgtgtgcaagtggcagtcccctgacacccag ggaccatcacctcacctgcctcgggctggcggctgggctgtgccccggggttgtgaccct caaaccttcctgcagatccatggccccagactggcccacggcaccaccactctggccttc cgcttccgtcatggagtcattgctgcagctgacacgcgttcctcctgtggcagctatgtg gcgtgtccagcctcatgcaaggtcatccctgtgcaccagcacctcctgggtaccacctct ggcacctctgccgactgtgctacctggtatcgggtattacagcgggagctgcggcttcgg gaactgagggagggtcagctgcccagtgtggccagtgctgccaagctcttgtcagccatg atgtctcaataccggggactggatctctgtgtggccactgccctctgcggctgggaccgc tctggccctgagctcttctacgtctatagcgacggcacccgcctgcagggggacatcttc tctgtgggctctggatctccctatgcctacggcgtgctagaccgtggctatcgctacgac atgagcacccaggaagcctacgccctggctcgctgcgccgtggcccacgccacccaccgt gatgcctattcagggggctctgtagaccttttccacgtgcgggagagtggatgggagcat gtgtcacgcagtgatgcctgtgtgctgtacgtggagttacagaagctcctggagccggag ccagaggaggatgccagccatgcccatcctgagcctgccactgcccacagagctgcagaa gatagagagctctctgtggggccaggggaggtgacaccaggagactccaggatgccagca gggactgagacggtgtga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_3|781_aa MWGLVRLLLAWLGGWGCMGRLAAPARAWAGSREHPGPALLRTRRSWVWNQFFVIEEYAGP EPVLIGKLHSDVDRGEGRTKYLLTGEGAGTVFVIDEATGNIHVTKSLDREEKAQYVLLAQ AVDRASNRPLEPPSEFIIKVQDINDNPPIFPLGPYHATVPEMSNVGTSVIQVTAHDADDP SYGNSAKLVYTVLDGLPFFSVDPQTGVVRTAIPNMDRETQEEFLVVIQAKDMGGHMGGLS GSTTVTVTLSDVNDNPPKFPQSLYQFSVVETAGPGTLVGRLRAQDPDLGDNALMAYSILD GEGSEAFSISTDLQGRDGLLTVRKPLDFESQRSYSFRVEATNTLIDPAYLRRGPFKDVAS VRVAVQDAPEPPAFTQAAYHLTVPENKAPGTLVGQISAADLDSPASPIRYSILPHSDPER CFSIQPEEGTIHTAAPLDREARAWHNLTVLATELDSSAQASRVQVAIQTLDENDNAPQLA EPYDTFVCDSAAPGQLIQVIRALDRDEVGNSSHVSFQGPLGPDANFTVQDNRDGSASLLL PSRPAPPRHAPYLVPIELWDWGQPALSSTATVTVSVCRCQPDGSVASCWPEAHLSAAGLS TGALLAIITCVGALLALVVLFVALRRQKQEALMVLEEEDVRENIITYDDEGGGEEDTEAF DITALQNPDGAAPPAPGPPARRDVLPRARVSRQPRPPGPADVAQLLALRLREADEDPGVP PYDSVQVYGYEGRGSSCGSLSSLGSGSEAGGAPGPAEPLDDWGPLFRTLAELYGAKEPPA P >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_3|2346_bp atgtggggcctggtgaggctcctgctggcctggctgggtggctggggctgcatggggcgt ctggcagccccagcccgggcctgggcagggtcccgggaacacccagggcctgctctgctg cggactcgaaggagctgggtctggaaccagttctttgtcattgaggaatatgctggtcca gagcctgttctcattggcaagctgcactcggatgttgaccggggagagggccgcaccaag tacctgttgaccggggagggggcaggcaccgtatttgtgattgatgaggccacaggcaat attcatgttaccaagagccttgaccgggaggaaaaggcgcaatatgtgctactggcccaa gccgtggaccgagcctccaaccggcccctggagcccccatcagagttcatcatcaaagtg caagacatcaacgacaatccacccatttttccccttgggccctaccatgccaccgtgccc gagatgtccaatgtcgggacatcagtgatccaggtgactgctcacgatgctgatgacccc agctatgggaacagtgccaagctggtgtacactgttctggatggactgcctttcttctct gtggacccccagactggagtggtgcgtacagccatccccaacatggaccgggagacacag gaggagttcttggtggtgatccaggccaaggacatgggcggccacatgggggggctgtca ggcagcactacggtgactgtcacgctcagcgatgtcaacgacaacccccccaagttccca cagagcctataccagttctccgtggtggagacagctggacctggcacactggtgggccgg ctccgggcccaggacccagacctgggggacaacgccctgatggcatacagcatcctggat ggggaggggtctgaggccttcagcatcagcacagacttgcagggtcgagacgggctcctc actgtccgcaagcccctagactttgagagccagcgctcctactccttccgtgtcgaggcc accaacacgctcattgacccagcctatctgcggcgagggcccttcaaggatgtggcctct gtgcgtgtggcagtgcaagatgccccagagccacctgccttcacccaggctgcctaccac ctgacagtgcctgagaacaaggccccggggaccctggtaggccagatctccgcggctgac ctggactcccctgccagcccaatcagatactccatcctcccccactcagatccggagcgt tgcttctctatccagcccgaggaaggcaccatccatacagcagcacccctggatcgcgag gctcgcgcctggcacaacctcactgtgctggctacagagctcgacagttctgcacaggcc tcgcgcgtgcaagtggccatccagaccctggatgagaatgacaatgctccccagctggct gagccctacgatacttttgtgtgtgactctgcagctcctggccagctgattcaggtcatc cgggccctggacagagatgaagttggcaacagtagccatgtctcctttcaaggtcctctg ggccctgatgccaactttactgtccaggacaaccgagatggctccgccagcctgctgctg ccctcccgccctgctccaccccgccatgccccctacttggttcccatagaactgtgggac tgggggcagccggcgctgagcagcactgccacagtgactgttagtgtgtgccgctgccag cctgacggctctgtggcatcctgctggcctgaggctcacctctcagctgctgggctcagc accggcgccctgcttgccatcatcacctgtgtgggtgccctgcttgccctggtggtgctc ttcgtggccctgcggcggcagaagcaagaagcactgatggtactggaggaggaggacgtc cgagagaacatcatcacctacgacgacgagggcggcggcgaggaggacaccgaggccttc gacatcacggccttgcagaacccggacggggcggcccccccggcgcccggccctcccgcg cgccgagacgtgttgccccgggcccgggtgtcgcgccagcccagaccccccggccccgcc gacgtggcgcagctcctggcgctgcggctccgcgaggcggacgaggaccccggcgtaccc ccgtacgactcggtgcaggtgtacggctacgagggccgcggctcctcttgcggctccctc agctccctgggctccggcagcgaagccggcggcgcccccggccccgcggagccgctggac gactggggtccgctcttccgcaccctggccgagctgtatggggccaaggagcccccggcc ccctga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_4|1314_aa MAELEEVTLDGKPLQALRVTDLKAALEQRGLAKSGQKSALVKRLKGALMLENLQKHSTPH AAFQPNSQIGEEMSQNSFIKQYLEKQQELLRQRLEREAREAAELEEASAESEDEMIHPEG VASLLPPDFQSSLERPELELSRHSPRKSSSISEEKGDSDDEKPRKGERRSSRVRQARAAK LSEGSQPAEEEEDQETPSRNLRVRADRNLKTEEEEEEEEEEEEDDEEEEGDDEGQKSREA PILKEFKEEGEEIPRVKPEEMMDERPKTRSQEQEVLERGGRFTRSQEEARKSHLARQQQE KEMKTTSPLEEEEREIKSSQGLKEKSKSPSPPRLTEDRKKASLVALPEQTASEEETPPPL LTKEASSPPPHPQLHSEEEIEPMEGPAPAVLIQLSPPNTDADTRELLVSQHTVQLVGGLS PLSSPSDTKAESPAEKVPEESVLPLVQKSTLADYSAQKDLEPESDRSAQPLPLKIEELAL AKGITEECLKQPSLEQKEGRRASHTLLPSHRLKQSADSSSSRSSSSSSSSSRSRSRSPDS SGSRSHSPLRSKQRDVAQARTHANPRGRPKMGSRSTSESRSRSRSRSRSASSNSRKSLSP GVSRDSSTSYTETKDPSSGQEVATPPVPQLQVCEPKERTSTSSSSVQARRLSQPESAEKH VTQRLQPERGSPKKCEAEEAEPPAATQPQTSETQTSHLPESERIHHTVEEKEEVTMDTSE NRPENDVPEPPMPIADQVSNDDRPEGSVEDEEKKELESLRRCQPQLSEEKYSDLAAECLP GPGVFTYPQATFIRGSMIPLAATKGVPAGNSDTEGGQPGRKRRWGASTATTQKKPSISIT TESLKSLIPDIKPLAGQEAVVDLHADDSRISEDETERNGDDGTHDKGLKICRTVTQVVPA EGQENGQREEEEEEKEPEAEPPVPPQVSVEVALPPPAEHEVKKVTLGDTLTRRSISQQKS GVSITIDDPVRTAQVPSPPRGKISNIVHISNLVRPFTLGQLKELLGRTGTLVEEAFWIDK IKSHCFVTYSTVEEAVATRTALHGVKWPQSNPKFLCADYAEQDELDYHRGLLVDRPSETK TEEQGIPRPLHPPPPPPVQPPQHPRAEQREQERAVREQWAEREREMERRERTRSEREWDR DKVREGPRSRSRSRDRRRKERAKSKEKKSEKKEKAQEEPPAKLLDDLFRKTKAAPCIYWL PLTDSQIVQKEAERAERAKEREKRRKEQEEEEQKEREKEAERERNRQLEREKRREHSRER DRERERERERDRGDRDRDRERDRERGRERDRRDTKRHSRSRSRSTPVRDRGGRR >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_4|3945_bp atggcggagctggaggaggtgactctggacgggaagcctcttcaggcgctgcgggtgacc gacctgaaggccgcactggagcagcgaggcctagccaagagcgggcagaagagtgccctg gtcaagcggctcaaaggggctctaatgctagaaaatttacagaaacactcaacaccccat gctgcattccagccaaattcccagattggtgaggaaatgagccagaacagtttcataaaa cagtatctggaaaagcagcaggagctacttaggcagcgtctggaacgtgaagctcgagaa gctgcagaacttgaagaagcttcagctgagtcggaggacgagatgatccatcctgaggga gtggcttccctgctgcctcctgactttcagagcagcctggagagaccagagctggagctc agcagacattcgcccagaaaaagctcctcaatttctgaagagaaaggtgactctgatgat gagaaaccaaggaaaggagaaagacgatcatctagggtcagacaggcaagagcagctaaa ctgtctgagggcagccaacctgctgaggaggaagaggatcaagaaacaccttccagaaac ctaagggtcagagcagatcgaaatttgaaaacagaggaggaagaagaggaggaggaggag gaggaagaagatgatgaagaagaggaaggtgatgatgagggacaaaaatctagggaggca ccaatcctgaaagagtttaaggaagaaggggaagagatacctagagtaaaaccagaggag atgatggatgagagacccaaaacaagatcccaggaacaggaggtgttagagagaggaggg agatttacaagatcccaggaagaggctagaaaaagtcatctggccagacagcagcaggag aaggaaatgaaaacaacatctccccttgaggaggaagaaagagaaataaaatcttcacaa ggcttaaaggaaaaatcgaagtctccttcccctcctcgactgactgaagatcgaaagaag gcctcacttgtagcgctgccagagcaaactgccagcgaggaggagactcctccaccttta ctaacaaaggaagcatcttctccaccacctcatccacagctccatagcgaagaagaaata gagcccatggaaggcccagcccccgctgtcctcattcagttatctcctcctaatacagat gctgacaccagggagctattagtatctcagcatactgtccagttggtaggaggcctgtct cctttgtcaagtccttcagacaccaaagcagaatctccagcagagaaagtgccagaggag agtgtcctgcctctggttcagaaaagcacactggctgactactcagcccagaaggatctt gaacctgagtcagacagatctgctcagcccctccctctaaaaattgaggaattagcactg gccaaaggaatcactgaagaatgtctgaaacagccatctttggaacagaaggaaggcaga agagcttctcatacccttctcccaagccacagattgaaacagtcagctgattcatcctct agccggtcctcctcatcttcctcctccagttctagatcaagatctcgctctcctgacagt tcaggttctcggtctcattcaccgctcagatccaagcagagagatgtagcccaggcacgt actcatgccaaccctcgtggtagacccaagatgggctccagatcaacatcagagtccaga tcaaggtcacgttcacgttctcgttcagcatcaagcaacagcagaaaatctctgagccct ggagtctccagggacagcagcaccagctatactgaaaccaaagatccctcttctggtcag gaggttgcaactccaccagtgccacaactgcaggtctgtgagccaaaggagaggacttcc acctcctcatcctctgtccaagcaaggcgtctgagtcagcctgaatcagctgaaaagcat gtgacccagaggttacagcctgagcgggggagcccaaagaagtgtgaagctgaagaggca gagccaccagctgccacacagccccaaacctcagagactcagacctctcatctgccagaa tcagaaagaattcatcacactgttgaggagaaggaggaagtgaccatggacacaagtgaa aacagacctgaaaatgatgttccagaacctcccatgcctattgcagaccaagtcagcaat gatgaccgcccggagggcagtgttgaagatgaggagaagaaagagctggagtccctgagg agatgccagccacaactctccgaagagaagtattctgaccttgctgctgaatgtctgcca ggtcctggggtattcacctacccccaggctaccttcatcagaggctccatgatcccactc gcagctaccaagggggtgccagctggaaacagtgacacagaggggggccagcctggtcgg aaacgacgctggggagccagcacagccaccacacagaagaaaccttccatcagtatcacc actgaatcactaaagagcctcatccccgacatcaaacccctggcggggcaggaggctgtt gtggatcttcatgctgatgactctcgcatctctgaggatgagacagagcgtaatggcgat gatgggacccatgacaaggggctgaaaatatgccggacagtcactcaggtagtacctgca gagggccaggagaatgggcagagggaagaagaggaagaagagaaggaacctgaagcagaa cctcctgtacctccccaggtgtcagtagaggtggccttgcccccacctgcagagcatgaa gtaaagaaagtgactttaggagataccttaactcgacgttccattagccagcagaagtcc ggagtttccattaccattgatgacccagtccgaactgcccaggtgccctccccaccccgg ggcaagattagcaacattgtccatatctccaatttggtccgtcctttcactttaggccag ctaaaggagttgttggggcgcacaggaaccttggtggaagaggccttctggattgacaag atcaaatctcattgctttgtaacgtactcaacagtagaggaagctgttgccacccgcaca gctctgcacggggtcaaatggccccagtccaatcccaaattcctttgtgctgactatgcc gagcaagatgagctggattatcaccgaggcctcttggtggaccgtccctctgaaactaag acagaggagcagggaataccacggcccctgcaccccccacccccacccccggtccagcca ccacagcacccccgggcagagcagcgggagcaggaacgggcagtgcgggaacagtgggca gaacgggaacgggaaatggagcggcgggagcggactcgatcagagcgtgaatgggatcgg gacaaagttcgagaagggccccgttcccgatcaaggtcccgtgaccgccgccgcaaggaa cgtgcgaagtctaaagaaaagaagagtgagaagaaagagaaagcccaggaggaaccacct gccaagctgctggatgaccttttccgaaagaccaaggcagctccctgcatctattggctc ccactgactgacagccagatcgttcagaaagaggcagagcgggccgaacgggccaaggag cgggagaagcggcgaaaggagcaagaagaagaagagcaaaaggagcgggagaaggaagcc gagcgggaacggaaccgacagctggagcgagagaaacgtcgggagcacagtcgggagagg gacagggagagagagagagaaagggagcgggacaggggggaccgagatcgggatagggaa agggaccgagaacgaggcagggaaagggatcgcagggacaccaagcgccacagcagaagc cggagtcggagcacacctgtgcgggaccggggtgggcgccgctag >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_5|102_aa MKCILHWFANWSGPQRERFLEDLVAKAVPEKLQPLLDSLEQLSVSGADRPPSIFECQLHL WDQWFRGWAEQERNEFVRQLEFSEPDFVAKFYQAVAATAGKD >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_5|309_bp atgaagtgtattcttcactggtttgccaattggtcaggtccccagcgtgaacgtttccta gaggacctggtagctaaggcagtgccagaaaaattacaaccactgctggatagtctggag cagcttagtgtgtctggggcagaccgaccaccttctatctttgagtgccagctacatctt tgggatcagtggtttcgaggctgggctgagcaggagcgcaatgaatttgtcagacagctg gagttcagtgagccagacttcgtggcaaagttttaccaagcagtggctgctacagctggt aaggactga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_6|405_aa MLLLLLLLLLLPPLVLRVAASRCLHDETQKSIPDTHLRGYALWPEQGPPQLVQPDGPGVQ NTDFLLYVRVAHTSKCHQEPSVIAYAACCQLDSEDRPLAGTIVYCAQHLTSPSLSHSDIV MEGLLSSHWEARLLQGSLMTATFDGAQRTRLDPITLAAFKDSGWYQVNHSAAEELLWGQG SGPEFGLVTTCGTGSSDFFCTGSGLGCHYLHLDKGSCSSDPMLEGCRMYKPLANGSECWK KENGFPAGVDNPHGEIYHPQSRCFFANLTSQLLPGDKPRHPSLTPHLKEAELMGRCYLHQ CTGRGAYKVQVEGSPWVPCLPGKVIQIPGYYGLLFCPRGRLCQTNEDINAVTSPPVSLST PDPLFQLSLELAGPPGHSLGKEQQEGLAEAVLEALASKGGTGRLK >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_6|1218_bp atgctgctgctgctgctgctgctgctgctgctgccgccactagtcctcagggttgctgca agccgatgtctacatgatgagacacagaagtctattcctgacacccatcttcgcggttat gccttgtggccggagcagggtcccccacaactggtccagccagatgggcctggggtccaa aacactgattttctcctgtatgtgcgagttgctcacacttccaagtgccaccaagagccc tctgtcatagcctatgctgcctgctgccagctggactcagaagacaggcccctcgctggt accattgtctactgtgcccaacatctcaccagccccagcctcagccacagtgacatcgtc atggagggccttctgtcctcgcactgggaggccagactactccagggttctttaatgact gctacctttgatggagcccagcgcactcgactcgacccaatcaccctcgctgccttcaaa gactcaggctggtaccaggtcaaccacagcgctgcagaggagctgttgtggggccaggga tctggcccagaatttggcttggtgaccacatgtgggactggctcctcagacttcttctgt actggcagtgggctgggctgccactacctgcacctggacaagggaagctgctcctcagac cccatgctggaaggctgccgcatgtacaagcccttagccaatgggagtgaatgctggaag aaggaaaacggattccctgctggggtggataatccccatggggagatctaccatccccag agccgttgcttctttgccaacctcacttcacagctgctccctggggataagcccaggcat ccttctcttaccccacacctcaaggaagcagagctcatgggccgctgctacttacatcaa tgcacagggaggggagcttacaaggtgcaggtggagggctcgccttgggtcccatgcctt cctggaaaggttatacagatacctgggtactatggtcttctcttctgtccccggggtcgg ctgtgtcagactaatgaagatatcaatgctgttacttccccacctgtgagtctttcaacc ccagatccactattccagctctctttagaattagctgggcctccaggacactctctgggg aaggaacagcaagaagggctagctgaagcagtactggaggctttggcgagcaaaggcggc actggcaggctcaagtga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_7|281_aa MSHGTYYECEPRGGQQPLEFSGGRAGPGELGDMCEHEASIDLSAYIESGEEQLLSDLFAV KPAPEARGLKGPGTPAFPHYLPPDPRPFAYPPHTFGPDRKALGPGIYSSPGSYDPRAVAV KEEPRGPEGSRAASRGSYNPLQYQVAHCGQTAMHLPPTLAAPGQPLRVLKAPLATAAPPC SPLLKAPSPAGPLHKGKKAVNKDSLEYRLRRERNNIAVRKSRDKAKRRILETQQKVLEYM AENERLRSRVEQLTQELDTLRNLFRQIPEAANLIKGVGGCS >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_7|846_bp atgtcccacgggacctactacgagtgtgagccccggggtggccagcagccactcgagttc tcagggggccgagctgggcccggggagctaggggacatgtgtgagcatgaggcctccatt gacctctccgcctacatcgagtctggggaagagcagcttctctccgatctctttgccgtg aagccagcgcctgaggccagaggcctcaagggccccggaacccctgccttcccccactac ttgccgcctgaccctcggccctttgcctaccctccacataccttcggcccagacaggaag gcgctggggcctggcatctacagcagcccagggagctacgaccccagggctgtggcggtg aaggaggagccccgggggccagagggcagccgagctgccagccgaggcagctacaatccc ctgcagtaccaagtggcacactgtgggcagacagccatgcacctgcccccaactctggca gcacccggccagcctctgcgcgttctcaaggcccctttggccactgccgcacccccctgc agtcccctcctgaaggcgccctccccggctggccccttacacaagggcaagaaggcagtg aacaaagatagccttgagtaccggctgaggcgggagcgcaacaacatcgccgtgcgcaag agccgagacaaggccaagaggcgcattctggagacgcagcagaaggtgctggagtacatg gcagagaacgagcgcctccgcagccgcgtggagcagctcacccaggagctagacaccctc cgcaacctcttccgccagattcctgaggcggccaacctcatcaagggcgtggggggttgc agctga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_8|386_aa MREQFCLCALGIMQPDTAPEVLLTWVNCSSVRWATRVQDIFTAGKLLALALIIIMGIVQI CKGEYFWLEPKNAFENFQEPDIGLVALAFLQGSFAYGGWNFLNYVTEELVDPYKNLPRAI FISIPLVTFVYVFANVAYVTAMSPQELLASNAVAVTFGEKLLGVMAWIMPISVALSTFGG VNGSLFTSSRLFFAGAREGHLPSVLAMIHVKRCTPIPALLFTCISTLLMLVTSDMYTLIN YVGFINYLFYGVTVAGQIVLRWKKPDIPRPIKINLLFPIIYLLFWAFLLVFSLWSEPVVC GIGLAIMLTGVPVYFLGVYWQHKPKCFSDFIELLTLVSQKMCVVVYPEVERGSGTEEANE DMEEQQQPMYQPTPTKDKDVAGQPQP >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_8|1161_bp atgagggagcagttctgcttgtgtgccttgggcatcatgcagccagatacagcccctgaa gtgctcctcacatgggtcaactgttccagtgtgcggtgggccacccgggttcaagacatc ttcacagctgggaagctcctggccttggccctgattatcatcatggggattgtacagata tgcaaaggagagtacttctggctggagccaaagaatgcatttgagaatttccaggaacct gacatcggcctcgtcgcactggctttccttcagggctcctttgcctatggaggctggaac tttctgaattacgtgactgaggagcttgttgatccctacaagaaccttcccagagccatc ttcatctccatcccactggtcacatttgtgtatgtctttgccaatgtcgcttatgtcact gcaatgtccccccaggagctgctggcatccaacgccgtcgctgtgacttttggagagaag ctcctaggagtcatggcctggatcatgcccatttctgttgccctgtccacatttggagga gttaatgggtctctcttcacctcctctcggctgttcttcgctggagcccgagagggccac cttcccagtgtgttggccatgatccacgtgaagcgctgcaccccaatcccagccctgctc ttcacatgcatctccaccctgctgatgctggtcaccagcgacatgtacacactcatcaac tatgtgggcttcatcaactacctcttctatggggtcacggttgctggacagatagtcctt cgctggaagaagcctgatatcccccgccccatcaagatcaacctgctgttccccatcatc tacttgctgttctgggccttcctgctggtcttcagcctgtggtcagagccggtggtgtgt ggcattggcctggccatcatgctgacaggagtgcctgtctatttcctgggtgtttactgg caacacaagcccaagtgtttcagtgacttcattgagctgctaaccctggtgagccagaag atgtgtgtggtcgtgtaccccgaggtggagcggggctcagggacagaggaggctaatgag gacatggaggagcagcagcagcccatgtaccaacccactcccacgaaggacaaggacgtg gcggggcagccccagccctga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_9|315_aa MEEGARHRNNTEKKHPGGGESDASPEAGSGGGGVALKKEIGLVSACGIIVDGEVMEEKRG QIVPLGNIIGSGIFVSPKGVLENAGSVGLALIVWIVTGFITVVGALCYAELGVTIPKSGG DYSYVKDIFGGLAGFLRLWIAVLVIYPTNQAVIALTFSNYVLQPLFPTCFPPESGLRLLA AICLCLLTDLEAEECRLQNQAACACISSPPFLTAVKASNCLVQNAVTEKLLEGRESVGAG QSVAEGREVNAPAKGEAEKPTLLAGHDPDKEEWPGSGNVKGKPSLSKPVQSTPEKGASQR DREIQMEFREFGGNM >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_9|948_bp atggaagaaggagccaggcaccgaaacaacaccgaaaagaaacacccaggtgggggcgag tcggacgccagccccgaggctggttccggagggggcggagtagccctgaagaaagagatc ggattggtcagtgcctgtggtatcatcgtagatggggaagttatggaagaaaagagggga cagattgtgcctctcgggaacatcatcggctctggaatctttgtctcgccaaagggagtg ctggagaatgctggttctgtgggccttgctctcatcgtctggattgtgacgggcttcatc acagttgtgggagccctctgctatgctgaactcggggtcaccatccccaaatctggaggt gactactcctatgtcaaggacatcttcggaggactggctgggttcctgaggctgtggatt gctgtgctggtgatctaccccaccaaccaggctgtcatcgccctcaccttctccaactac gtgctgcagccgctcttccccacctgcttccccccagagtctggccttcggctcctggct gccatctgcttatgcctgctcacagatctggaggcagaggaatgcaggctccaaaaccag gctgcctgcgcctgcatctccagtccaccatttctaactgcagtgaaagcctccaactgc ctggtgcagaatgctgtcacagagaaactcctggaggggagggagtctgtgggagctggc cagagcgtggcagaaggcagagaagtgaacgccccagcaaaaggtgaggctgagaagcca acccttctggcaggccatgacccagacaaggaggagtggcctggttctggaaatgttaaa ggaaaaccaagcctttccaagccagtccagtccactcctgagaaaggtgcaagccagcga gaccgagaaatccaaatggagttccgggagtttggagggaacatgtga >gi568815584r:23017490_23219091|GENSCAN_predicted_peptide_10|37_aa XFTCFQQEGKFSSFHDDERQDMRLHNLLKIFVRDLPV >gi568815584r:23017490_23219091|GENSCAN_predicted_CDS_10|114_bp nnttttacttgcttccaacaggaaggcaagttttcctccttccatgacgatgaaaggcag gacatgagacttcacaacctgctaaaaatcttcgttcgggacctacctgtctag