GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:17:40 Sequence gi568815591f:137621987_137822850 : 200864 bp : 40.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4424 4524 101 1 2 77 69 88 0.316 3.79 1.02 Term + 9469 9613 145 1 1 55 48 151 0.362 4.20 1.03 PlyA + 10277 10282 6 1.05 2.12 PlyA - 12169 12164 6 1.05 2.11 Term - 14983 14560 424 0 1 -22 50 231 0.013 2.08 2.10 Intr - 15638 15592 47 2 2 111 111 27 0.016 3.59 2.09 Intr - 17727 17529 199 2 1 54 86 180 0.004 12.83 2.08 Intr - 23551 23424 128 1 2 102 61 46 0.006 1.86 2.07 Intr - 30665 30552 114 2 0 73 86 57 0.030 3.72 2.06 Intr - 34554 34480 75 0 0 21 116 57 0.018 0.59 2.05 Intr - 36051 35997 55 2 1 101 15 82 0.019 0.26 2.04 Intr - 36676 36539 138 0 0 102 95 59 0.847 6.66 2.03 Intr - 43565 43471 95 2 2 51 85 61 0.033 -0.06 2.02 Intr - 56666 56571 96 2 0 86 73 70 0.119 4.59 2.01 Init - 66681 66580 102 0 0 76 78 57 0.103 3.79 2.00 Prom - 69570 69531 40 -5.75 3.00 Prom + 71528 71567 40 -4.15 3.01 Sngl + 100001 100816 816 1 0 94 39 1081 0.999 99.30 3.02 PlyA + 101416 101421 6 1.05 4.03 PlyA - 103884 103879 6 1.05 4.02 Term - 104437 104255 183 1 0 48 42 172 0.446 5.16 4.01 Init - 130912 130781 132 1 0 47 63 119 0.241 5.49 4.00 Prom - 131289 131250 40 -7.25 5.00 Prom + 131887 131926 40 -7.55 5.01 Init + 131938 132115 178 0 1 62 81 173 0.747 13.47 5.02 Intr + 144094 144166 73 2 1 116 69 78 0.540 6.25 5.03 Intr + 144663 144760 98 0 2 66 65 9 0.454 -4.87 5.04 Term + 144878 145269 392 0 2 30 48 259 0.541 10.26 5.05 PlyA + 145272 145277 6 1.05 6.00 Prom + 148015 148054 40 -3.65 6.01 Init + 149705 149784 80 1 2 86 92 16 0.613 2.38 6.02 Term + 152834 152957 124 2 1 82 48 107 0.715 2.98 6.03 PlyA + 153820 153825 6 1.05 7.00 Prom + 155239 155278 40 -4.05 7.01 Init + 155949 156018 70 2 1 75 110 58 0.758 7.96 7.02 Intr + 164117 164266 150 0 0 24 59 108 0.081 0.71 7.03 Intr + 198409 198511 103 2 1 66 66 127 0.288 6.61 7.04 Intr + 198702 198895 194 0 2 40 72 78 0.137 -0.39 7.05 Term + 198978 199099 122 1 2 37 44 119 0.249 0.06 7.06 PlyA + 199646 199651 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 14961 14560 402 0 0 76 50 227 0.933 13.72 S.002 Intr + 22302 22388 87 2 0 126 93 77 0.823 11.25 S.003 Term + 50362 50459 98 2 2 114 53 86 0.837 4.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_1|81_aa GQQQPPQSPTPESCNRLNRTAYKRTQSLVGQRVRAMCLIREGEEAAKSEKNYISKELAAG PNHYVLAGMGESGGLDPEGEG >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_1|246_bp ggccagcaacaacccccacagtccccaacacctgagagttgcaaccgcctaaacagaaca gcttacaagaggacacagagtctagtgggccaaagggtgagagcaatgtgcctgatcagg gaaggtgaagaggctgctaagagtgaaaaaaactacatctcaaaagaactagctgcagga cctaatcactatgtactagcaggaatgggagagagtgggggactggatcctgagggtgaa ggatga >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_2|490_aa MMRVYYVSRNRKSWFDIHDVPNNPVVCSSRAFFEENAVNGEHLWLETNVSGDLCYLGEEN CQVRFANRTKHAVRRASSEFSCGTREAIPTIIEHETVSDFRNRLRNPWNSIELNAERSVK HEARLGQELLARREAFQASLNHSLKRKSESDKSICKQNKGISKSALRRKCAVCKIVVHTA CIEQLEKHGSNRKVVGHAQGEESRRRMIGLWCLSATQSADSVGQKNFVRHHWVHRRRQEG KCKQCGKVRAQPPHGLAGGSPGLCKQLSKYSALVTLGPFVLSSVVPAHLSTFFWLLLLGL LVLVNPDDLHGEQRPKPQARVAMRTLPVMVFRSQGRINPKVHPQLIKQVSSAGCTRSMAP ASGEGFRKLLLVVQGEGGALSVEITWQERKKRGASVTQQSDVTGTNREFTHCCECSTKPL IGDLPPSPKHLLTVPTSNTGDQISAGDSRGDRYPSCISSSDSFRLRAIPEDGRVTHLKTF SFQDSSQLRE >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_2|1473_bp atgatgagagtgtattatgtatcaaggaaccgaaagagttggtttgatattcacgatgtt ccaaacaaccctgtagtctgcagcagcagggccttctttgaggagaatgccgtgaatgga gaacacctgtggctggagaccaacgtctcgggagacctctgctaccttggagaggagaac tgccaagtcagatttgcaaacagaacaaaacacgctgttcgaagagcatcaagtgagttt tcctgcggtactcgagaggcaattccaaccataatagagcatgagactgtcagtgacttt cgtaacagactaagaaatccctggaactccatagaacttaatgcagagcgtagtgtcaag cacgaagctagactaggacaggagttattggcaaggagggaggcttttcaggcatcctta aatcacagcctgaagaggaaatccgagagtgataaaagcatctgcaagcagaataaagga atctcaaaatcagctctcaggaggaagtgtgcagtctgtaaaatcgtcgtccacaccgcc tgcattgagcagctagaaaagcatggcagcaataggaaggttgtgggccatgcgcagggt gaggaaagcaggagaagaatgatagggctgtggtgcctcagtgccacccagagtgcagac agtgtgggccagaagaattttgtacgtcatcactgggtgcacaggcgtcggcaggagggg aaatgtaagcagtgtggtaaggtaagagcccagccacctcatggcctcgcaggaggctcc ccaggcctctgcaagcaactgagcaaatacagtgccctggtcaccctggggccctttgtt ctttcttcagtagttcccgcccacttgtctaccttcttctggctgctgctgttgggcctg ctggtacttgtgaacccagatgacctgcacggggagcagaggcccaaacctcaggccagg gtggcaatgaggacacttcctgtcatggtgttcaggtcccagggcaggattaatccaaaa gtacatccacagttgattaaacaagtcagttctgcaggctgtacaagaagcatggcacca gcatctggtgagggcttcaggaaacttctgctcgtggtgcagggggaaggaggagccctg agtgtagagatcacatggcaagagagaaagaaaaggggtgccagtgtcacccaacaatca gatgtcactggaactaatagagaattcactcattgctgtgaatgcagcaccaagccactc ataggggatctgcctccatcacccaaacacctcctaacggttcccacttccaacactggg gatcaaatttcagcaggagattctagaggggacagatatccaagctgcatcagctcaagt gactcttttaggctcagggcaattcctgaagatgggagggtgactcatttgaagaccttc agcttccaagactcttctcaactgagggaatga >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_3|271_aa MAGEKVEKPDTKEKKPEAKKADAGGKVKKGNLKAKKPKKGKPHCSRNPVIVRGIGRYSRS AMYSRKAMYKRKYSAAKSKVEKKKKEKVLATVTKPVGGDKNGSTRVVKLRKMPRYYPTED VPRKLLSHSKKPFSQHVRKLRASITPGTILIILTGRHRGKRVVFLKQLASGLLLVTGPLV LNRVPLRRTHQKFVIATSTKIDISNVKIPKHLTDAYFKKKKLRKPRHQEGEIFDTEKEKY EITEQCKIEQKAVDSQILPKIKAIPQLQGYL >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_3|816_bp atggcgggtgaaaaagttgagaagccagatactaaagagaagaaacctgaagccaagaag gctgatgctggtggcaaggtgaaaaagggtaacctcaaggctaaaaagcccaagaagggg aagccccattgcagccgcaaccctgtcattgtcagaggaattggcaggtattcccgatct gctatgtattccagaaaggccatgtacaagaggaagtactcagccgctaaatccaaggtt gaaaagaaaaagaaggagaaggttcttgcaactgttacaaaaccagttggtggtgacaag aacggcagtacccgggtggttaaacttcgcaaaatgcctagatattatcctactgaagat gtgcctcgaaagctgttgagccacagcaaaaaacccttcagtcagcacgtgagaaaactg cgagccagcattacccccgggaccattctgatcatcctcactggacgccacaggggcaag agggtggttttcctgaagcagctggctagtggcttgttacttgtgactggacctctggtc ctcaatcgagttcctctacgaagaacacaccagaaatttgtcattgccacctcaaccaaa atcgatatcagcaatgtaaaaatcccaaaacatcttactgatgcttacttcaagaagaag aagctgcggaagcccagacaccaggaaggtgagatcttcgacacagaaaaagagaaatat gagattacggagcagtgcaagattgagcagaaagctgtggactcacaaattttaccaaaa atcaaagctattcctcagctccagggctacctgtga >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_4|104_aa MAGEWRCSRTSRRQNPSDTELKKEGLYSAGSFGKTHISKNQAPQNSDHEKNILELDLDWK GKVEDRRNRSEAFAAIYVDMAQIGLWGRGDAVNGELQDIWEREN >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_4|315_bp atggctggtgagtggaggtgtagcaggacaagccgcagacaaaacccctcagacactgag ttaaagaaggaagggctttattcggccgggagcttcggcaagactcacatctccaaaaac caagctccccaaaatagtgaccatgagaaaaatattttggaattggacttggattggaag ggaaaagtggaggacaggagaaatcgatcagaagcatttgcagcaatctatgtggacatg gctcagattggcctgtggggcagaggggatgcagtgaatggagagcttcaagacatatgg gagagagagaattga >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_5|246_aa MGNQCFYYNIKKEEEEEERGGGGEEEVVVPLTGNYVVSEEEQKQGSLESKRGHGGLCKED EDKRRKELEQDCSGLTILRGKEESTPRAGQEEFLFLEKLKEGSSGTRDSNREGLQGAGAQ PQEQVWVVLLWRFWPAQGKSFQMLTSEDPPSEQICDPRPLCSGGPLPDEPGPPTEGVLSA LQSLIHVCDLMGGPLPDEPGPPTEVILSALQSLVHVCDLMAADLLPLQESHQYERQGQST QLGRKN >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_5|741_bp atgggcaatcaatgtttctattacaacattaaaaaagaggaggaggaggaggagcgtgga gggggaggagaagaagaagtagtagttcccctaactgggaactatgttgtcagtgaggag gagcaaaaacagggtagtctagaatccaagaggggtcatggggggttatgtaaagaagat gaagataagaggagaaaagagctagaacaagattgttcaggacttacgatcctaagagga aaggaagaaagcacacccagggcaggacaggaggaatttctttttctagagaaactgaaa gagggatcctcaggaactagggactcaaatagggagggcttgcagggtgcaggtgcacag ccccaggaacaagtctgggtggttctattatggagattctggccagcccaagggaaaagc ttccagatgctgacatctgaggatcctccaagtgaacagatctgtgatcctaggccactc tgcagtggggggccactaccagacgagccggggccacccacagaaggtgtcctgtcagct cttcagagccttattcatgtctgtgacctgatgggggggccactaccagatgagcctggg ccacccacggaagttatcctgtcagctcttcagagccttgttcatgtatgtgacctgatg gctgcagatctcctgccattgcaggaaagccaccagtatgaaagacagggccaaagtact caattaggaagaaagaactga >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_6|67_aa MVFMESVTSLTTEQSDRFIQICLTDPGSYMKRRREGLKGERERQAGVKISVTSQDCYEWL LTISEKT >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_6|204_bp atggtcttcatggaaagtgtgacctcattaacgactgaacaatctgacagattcattcaa atatgtctaacagaccctgggtcctatatgaaaaggaggagggaagggcttaaaggagaa cgagaaagacaagcaggagtcaagatttctgtgacgtcacaggactgttatgagtggctg ctaaccatttctgaaaagacatga >gi568815591f:137621987_137822850|GENSCAN_predicted_peptide_7|212_aa MWLFSPFTFPYFLNKNTRRLAITVLEVLARAIRQEKEIKGIQIGKEEVKLLLCADDIIVY LENPRVIQKAPRTENSATALAETLADENGCESPAPCRSREAQLVGALSCLSNVWENPVSD HGLSCLQVKPARKGSQEISQLAATLEVGLCETRPASQQDARASPSRHQALLKALEGESSS SPAAANLHLISNSLSPVTVPQKSQTEHLVGAK >gi568815591f:137621987_137822850|GENSCAN_predicted_CDS_7|639_bp atgtggctattcagtccgttcaccttcccttatttcctcaacaagaatactaggcgattg gcaatcacagtactagaagtcctggctagagcaatcaggcaagagaaagaaataaagggc atccaaattggtaaagaggaagtcaaactgttgctgtgtgctgatgatatcattgtatac ctagaaaaccctagagttatccagaaagctcctagaactgaaaactctgccacagctctg gcagagacccttgcagatgaaaatggctgtgagagccctgcaccttgcagaagcagagaa gcgcagctggtgggagcactcagctgtctctccaatgtctgggaaaaccccgtgtctgat catggcctctcctgcttgcaagtaaagccagccagaaaaggcagccaagaaattagccag ctggcagcaactctggaggtggggctgtgtgagaccagaccagcctctcagcaagacgca agggcctcccccagccggcatcaggccttgctaaaggccctagagggggagagcagctcc tccccagctgctgccaatttgcacctcattagcaacagcctctccccagtgacagtccct cagaaatcccagactgagcacctcgttggtgctaaatga