GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:15:22 Sequence gi568815590f:18300004_18500873 : 200870 bp : 39.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 8122 8325 204 0 0 66 45 157 0.465 4.14 1.02 PlyA + 10067 10072 6 1.05 2.03 PlyA - 11336 11331 6 1.05 2.02 Term - 12986 12898 89 0 2 120 53 80 0.696 4.54 2.01 Init - 17741 17639 103 2 1 34 77 130 0.539 6.95 2.00 Prom - 20541 20502 40 -5.95 3.00 Prom + 21090 21129 40 -4.25 3.01 Init + 30296 30420 125 1 2 56 49 89 0.214 1.49 3.02 Intr + 32707 32774 68 1 2 59 93 86 0.224 3.83 3.03 Intr + 42999 43125 127 1 1 14 47 117 0.081 -0.98 3.04 Intr + 47821 48001 181 1 1 43 91 91 0.142 3.85 3.05 Intr + 52866 53020 155 2 2 26 55 99 0.095 -1.65 3.06 Intr + 59934 60030 97 0 1 123 67 49 0.278 5.39 3.07 Term + 70940 71224 285 1 0 106 48 156 0.061 7.82 3.08 PlyA + 71494 71499 6 1.05 4.02 PlyA - 71515 71510 6 1.05 4.01 Sngl - 83774 83382 393 2 0 36 41 431 0.990 29.19 4.00 Prom - 84085 84046 40 -8.95 5.00 Prom + 84416 84455 40 -8.25 5.01 Init + 85578 85788 211 2 1 91 40 116 0.590 6.09 5.02 Intr + 86250 86599 350 1 2 61 81 95 0.197 0.25 5.03 Intr + 87198 87584 387 2 0 62 43 249 0.441 11.86 5.04 Intr + 95970 95981 12 2 0 114 121 5 0.181 0.46 5.05 Intr + 98300 98326 27 1 0 77 115 19 0.203 0.89 5.06 Term + 99995 100873 879 1 0 79 39 506 0.314 36.35 5.07 PlyA + 100886 100891 6 1.05 6.03 PlyA - 101024 101019 6 1.05 6.02 Term - 109568 109394 175 1 1 126 42 97 0.259 5.15 6.01 Init - 122884 122736 149 1 2 42 30 148 0.019 4.01 6.00 Prom - 139273 139234 40 -4.85 7.00 Prom + 149079 149118 40 -5.75 7.01 Init + 150006 150114 109 2 1 90 110 80 0.873 10.83 7.02 Term + 150930 151207 278 1 2 -23 39 195 0.450 -1.86 7.03 PlyA + 153622 153627 6 1.05 8.00 Prom + 157568 157607 40 -3.65 8.01 Sngl + 167920 168222 303 0 0 83 54 247 0.888 16.38 8.02 PlyA + 168807 168812 6 1.05 9.00 Prom + 172579 172618 40 -3.95 9.01 Sngl + 176518 176724 207 0 0 61 48 202 0.566 8.44 9.02 PlyA + 177165 177170 6 1.05 10.02 PlyA - 179084 179079 6 1.05 10.01 Sngl - 186356 185931 426 2 0 67 52 306 0.673 20.94 10.00 Prom - 193031 192992 40 -3.45 11.02 PlyA - 193196 193191 6 1.05 11.01 Term - 198997 198890 108 1 0 87 54 145 0.953 8.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 136449 136511 63 2 0 69 93 67 0.876 6.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_1|67_aa MLKIANHQENANQHHNITSHPLGWLLPKTNNNNKKITSIGIGKNVEKLEPLCIPDGNAKR YSHYGKQ >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_1|204_bp atgctcaaaattgccaatcaccaggaaaacgcaaatcaacaccacaatatcacctcacac ccattaggatggctactaccgaaaacaaacaacaacaacaagaaaataacaagcattggc attggcaagaatgtggagaaactggaacccttgtgcattcctgatgggaatgcaaaacgg tacagtcactacggaaaacagtag >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_2|63_aa MGSCWKTAVPNVGTALEHHPPFTPFGMGISAGENASSAQRRGDVNTSESEMTAVYRPREE PSE >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_2|192_bp atggggagctgttggaagacagctgttcccaatgtggggacagctcttgagcatcatcct ccgttcacccctttcggcatgggaatctctgctggagagaatgcctcaagtgcacaaaga agaggtgatgtgaacacaagtgagagtgagatgacagctgtctataggccaagagaagag ccctcagaatga >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_3|345_aa MYRKLKQDNEQKQTDPLKKWAKDMNRHFSKEDIQAAKKPEKWCLSEDIPWPEQLKGRHES SRAHFLEAEKAKIKAPASGKALLPSHAATTEGKRAERNPVSSHGRKVKCRREETPLLLGG RSGKGTGALQPSASPALVKHRTRQNPEASDSRPLPPHGTSRSTPDIQDHTSSVAMAPKQN ENFEVTDRKLKIWIIKMLGEIQEKLENQLKETRKIIQDMLLVLLFLDFSQALTFHVTHGK SRIRLPDSRNQVTVNGKYHITDGQFGSSFQMWQPLGFISGKDQPQVSLIFCLRERVDSAT WITLEDVSTFQTKIFLILISGKKRNNKKKSTPLLLNPEQLKILSL >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_3|1038_bp atgtataggaaacttaaacaagacaacgagcaaaaacaaacagacccattaaaaaaatgg gcaaaggacatgaacagacacttctcaaaagaagacatacaagcagccaagaaacctgaa aaatggtgcttgtctgaagacatcccctggcctgaacaactaaaaggacggcatgagagt tctcgtgctcactttctggaggctgagaaagccaagatcaaggcaccagcttctggcaaa gcccttcttccatcacatgctgccaccacagaaggcaaaagggcagagaggaatcctgtg tcttcacatggcagaaaagtaaagtgcaggagagaggagacaccccttctgcttggggga aggagtggaaagggtacaggggccttgcaacctagtgccagccctgctctggtaaaacac agaacaaggcagaatcctgaagcatcagattccagaccattacccccacatggcacttcc agatccaccccagatatacaggatcatactagttctgtagcaatggctcctaagcaaaat gaaaattttgaagtgacagatagaaaactcaaaatatggattataaaaatgctcggtgag atccaggagaaacttgaaaatcaacttaaagaaacaagaaaaataattcaggatatgtta ttggttctgcttttcctggacttttcccaagctctcaccttccacgtaacacatgggaag tctcgcattaggcttccagatagcagaaaccaggtgaccgttaacggcaagtaccacatc actgatggtcagtttggaagctccttccagatgtggcagcctctggggtttatttctggg aaggatcagcctcaggtgtcactcattttctgcttgagagagagagtggattctgctacc tggatcaccttagaagatgtcagcacatttcaaactaagattttcttaattctgatctcc gggaagaaaagaaataacaaaaaaaaatctactcctttactcctgaatcctgaacaattg aagattttgagtctgtga >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_4|130_aa MLLSLKSTPHSKFSLCSLGDQQHREEARAVGIPDVNIEMMKELNESKKLVNTLDKKNDAF VASESLIKQIPRILGPGLNKAGKFPSLLTHNENMVAKGDEVKSTIKFQMKKVLCLAVTVG HVKMTGDEFV >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_4|393_bp atgcttctcagtcttaagtccactccccactccaagttctccctctgttctttgggggac cagcagcaccgtgaagaggcaagggccgtgggtatccccgacgtgaacatcgagatgatg aaagaactcaacgagagtaagaaactggtcaatacgctggacaagaagaatgacgcattt gtggcttcagagtctctgatcaagcagatcccacgaatcctcggcccaggcctaaataag gcaggaaagttcccttctctgctcacacacaacgaaaacatggtggccaaaggggacgag gtgaagtccacaatcaagttccaaatgaagaaggtgttatgtctggctgtgactgttggc catgtgaagatgacaggcgatgagtttgtgtaa >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_5|621_aa MQSEQSQITHGEPWSSLPIKGVLSHWVEATHTVHVLSSMAEADAEGNGLDCQQSTLLAAE WPVISQKRSRAPPSARKLLLSDSRRVVRHVSHCPTTGASNWFRGGHVAQLTSMIWKGTAP FHPLNFELRNAARATKADKRMRLKQAEAKRAEKTMVFCPGHTVPGWAPASPECQVPIHPT PGLSSLPVLAPPLLLHLVRPALLPLAAPPPPLPRFSLRLKPLGQALGVPDCRALRLVRQV CPRPLHVGLQLSCLPLVPAPEFSSARTSSLSAARGSPSAPACSSSLRLAGRLVLIHLHLV DDQPRRLLRLLAGHLWFSQKGSYDLQSNKGIMDIEAYFERIGYKNSRNKLDLETLTDILE HQIRAVPFENLNMHCGQAMELGLEAIFDHIVRRNRGGWCLQVNQLLYWALTTIGFQTTML GGYFYIPPVNKYSTGMVHLLLQVTIDGRNYIVDAGSGSSSQMWQPLELISGKDQPQVPCI FCLTEERGIWYLDQIRREQYITNKEFLNSHLLPKKKHQKIYLFTLEPRTIEDFESMNTYL QTSPTSSFITTSFCSLQTPEGVYCLVGFILTYRKFNYKDNTDLVEFKTLTEEEVEEVLRN IFKISLGRNLVPKPGDGSLTI >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_5|1866_bp atgcagagtgaacagtctcaaataactcacggggaaccgtggagcagtctgcctattaaa ggagtgcttagtcactgggtggaggccacccacacagtgcatgtccttagctcaatggct gaagcagatgcagaaggcaatggtctggactgtcaacaatctacactccttgcagctgaa tggccagttatttctcaaaaaagatcaagagctccaccctctgccaggaaacttttactt tctgactccaggagggtagtgaggcatgtcagtcactgccccacaactggagccagcaac tggttcagaggtggccatgtggcccaacttacatccatgatctggaaaggtacagctccc ttccaccctttgaattttgagctgcggaatgcggcccgggctactaaggcagataaaaga atgagactcaaacaggcagaagccaagagagcagagaagaccatggtgttctgtcccgga cacacggtgcctggatgggctccagcttctcccgagtgccaggtgcccatacaccccacc ccggggttgtcctcccttccagttttggcgccgccgcttcttctccacctcgtccgcccc gcgctcctcccgctggcggcgccgccgccgccgctcccgcgcttctccctgcgcctcaag cccctagggcaggcgctaggcgttccggattgtcgcgccctccgcctggttcggcaggtc tgtccgcggccccttcacgttggcctccagctctcgtgccttccgctggtcccggccccg gagttcagcagcgcccggacttcctccctttccgctgcgcgaggctccccttcagctcct gcctgttcctcctctttgcgccttgctggccggctcgtgctcattcacctccaccttgtg gacgaccagccccgacggctcctgcggctccttgcaggccacctctggttttctcaaaag ggatcttatgacctacaatccaacaaggggatcatggacattgaagcatattttgaaaga attggctataagaactctaggaacaaattggacttggaaacattaactgacattcttgag caccagatccgggctgttccctttgagaaccttaacatgcattgtgggcaagccatggag ttgggcttagaggctatttttgatcacattgtaagaagaaaccggggtgggtggtgtctc caggtcaatcaacttctgtactgggctctgaccacaatcggttttcagaccacaatgtta ggagggtatttttacatccctccagttaacaaatacagcactggcatggttcaccttctc ctgcaggtgaccattgacggcaggaattacattgtcgatgctgggtctggaagctcctcc cagatgtggcagcctctagaattaatttctgggaaggatcagcctcaggtgccttgcatt ttctgcttgacagaagagagaggaatctggtacctggaccaaatcaggagagagcagtat attacaaacaaagaatttcttaattctcatctcctgccaaagaagaaacaccaaaaaata tacttatttacgcttgaacctcgaacaattgaagattttgagtctatgaatacatacctg cagacgtctccaacatcttcatttataaccacatcattttgttccttgcagaccccagaa ggggtttactgtttggtgggcttcatcctcacctatagaaaattcaattataaagacaat acagatctggtcgagtttaaaactctcactgaggaagaggttgaagaagtgctgagaaat atatttaagatttccttggggagaaatctcgtgcccaaacctggtgatggatcccttact atttag >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_6|107_aa MPYYGTPEGFGAEGNVRRSGKRQLAAGRYWSGEVSGDCPMTVQEVENCLKCTKIGKLKAD SELGVGCMPAAAGRRVGTDTQRSLPDKHNKKTRKQSKPVINSHTLNP >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_6|324_bp atgccctattatggcacccctgaaggctttggggctgaaggtaatgtgaggagatccgga aagaggcagctggctgcaggaagatactggagtggggaagtcagtggggactgtcctatg acagtccaggaagttgaaaactgcctcaagtgcactaagatagggaagctaaaagcagac tcagaattgggggtagggtgtatgcctgcagctgcaggaagacgtgtgggaacagacaca caacgctccctcccagacaagcacaacaaaaagacacggaagcagtcaaagcctgtgata aactctcacaccctgaatccttaa >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_7|128_aa MTSSNSKAPSPNTITLEVKDSVYEFVKDTNSQPLTPGRSQLIGKGWIYKRKRRQSIDKIP EKTELDGVQSRDRKINLQQEVGHALCDKRGKRQRGGKEDVRSLEDKRLKISEDSYFLCEN GGQPIYYV >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_7|387_bp atgacctcatctaactctaaggccccatctccaaataccatcacattagaggtgaaagac tcagtatatgaatttgtgaaagatacaaacagtcagcccttaacaccaggaagaagccag ttaatagggaaaggttggatttataagagaaaaagaaggcaatcaattgataagatccct gagaagacagagctagatggggtccagagcagagatagaaagattaacctgcaacaggag gtgggacatgccctttgtgataagagggggaaaaggcaacgaggggggaaggaagatgtc cgcagtttggaggataagagattgaagatatctgaggactcctattttctttgtgaaaat ggaggccagcccatctactacgtgtaa >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_8|100_aa MGRNQSRKGENSKNQSTSSPPKDHSSSPAMEQNWTENDFDELTEVGFRRLLITNFSKLKE DVRIHCKEAKNLAKRLDEWLTRINSIEKTLNDQMELKTMA >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_8|303_bp atggggagaaaccagagcagaaaaggtgaaaattctaaaaaccagagcacctcttctcct ccaaaggatcacagctcatcaccagcaatggaacaaaactggacagagaatgactttgat gagctgacagaagtaggcttcagaaggttgctaataacaaacttctccaagctaaaggag gatgttcgaatccattgcaaggaagctaaaaaccttgcaaaaagattagacgaatggcta actagaataaacagcatagaaaagaccttaaatgaccagatggagctgaaaaccatggca tga >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_9|68_aa MYYNPGQSDSGMELRYHREKGSQRSGVAELVERITDWTCWPPSLPPAGVRKPKNSYGGGK RNIFNYII >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_9|207_bp atgtactacaacccagggcagtctgactcaggaatggaacttcggtatcacagagaaaaa ggctctcagcgttctggggtggctgagctggtagaaagaataacagactggacctgctgg ccaccatctctgccaccagctggtgtgaggaagccaaaaaatagctatggaggtggaaag aggaatattttcaattacatcatctga >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_10|141_aa MEHDQARPVNSVSSGIRHTCAEFLPHHFLCELGTGHNITYGDCEIMMGKVARSKCSVHVS VHHDDDDDDDDNEEEELLHSSMPLNLHGSNAWQSAPTDTNFSISCDSHAYFGFSLYKTQL FILPFQNHPVLLPLILDFLLP >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_10|426_bp atggagcacgaccaagcacggccagttaatagtgtcagttctggaattagacacacctgc gctgaattcctgccccaccacttcttgtgtgaattgggcacaggacataacattacctat ggtgactgtgaaatcatgatgggaaaggtggcacgcagcaaatgctcagtgcatgtcagc gttcaccatgatgatgatgatgatgatgatgacaatgaggaggaagaactgctccactcc agcatgcctctaaacctgcatgggagcaatgcctggcagagcgctcccacagacacgaac ttttccatttcctgtgattcacatgcttattttggtttctcgttatataagacccagctc ttcattttgccatttcaaaatcacccagtgctcctaccactcatcctcgatttccttctt ccatga >gi568815590f:18300004_18500873|GENSCAN_predicted_peptide_11|35_aa LCACCQNENEQCQCHDVQLSCDLDSGLAYTEATES >gi568815590f:18300004_18500873|GENSCAN_predicted_CDS_11|108_bp ctctgtgcgtgctgccaaaatgaaaatgaacaatgtcagtgccatgatgtccagttgagc tgtgacctggattctggtctcgcatacacagaagcaactgaaagctga