GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:18:39 Sequence gi568815578r:32590183_32790815 : 200633 bp : 50.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 19421 19481 61 1 1 89 26 96 0.287 2.91 1.02 Intr + 20871 20935 65 1 2 101 58 70 0.554 3.74 1.03 Term + 21191 21304 114 1 0 77 53 81 0.622 2.07 1.04 PlyA + 23129 23134 6 1.05 2.04 PlyA - 24897 24892 6 1.05 2.03 Term - 32842 32469 374 2 2 133 46 148 0.917 9.95 2.02 Intr - 33661 33571 91 1 1 52 53 41 0.739 -3.43 2.01 Init - 37289 37224 66 2 0 72 76 101 0.750 6.39 2.00 Prom - 40367 40328 40 -3.96 3.08 PlyA - 41469 41464 6 1.05 3.07 Term - 48621 48516 106 2 1 129 54 52 0.459 3.78 3.06 Intr - 53781 53742 40 1 1 121 89 -24 0.096 -1.72 3.05 Intr - 59821 59598 224 0 2 117 72 80 0.591 7.07 3.04 Intr - 61580 61551 30 1 0 102 94 22 0.274 1.45 3.03 Intr - 77248 77103 146 1 2 100 98 72 0.347 8.48 3.02 Intr - 99235 99173 63 1 0 130 92 -31 0.005 0.41 3.01 Init - 100633 100022 612 1 0 76 56 661 0.025 56.43 3.00 Prom - 110555 110516 40 -6.56 4.10 PlyA - 110659 110654 6 1.05 4.09 Term - 113276 113200 77 0 2 77 34 54 0.315 -3.10 4.08 Intr - 113889 113841 49 0 1 87 95 7 0.463 -0.45 4.07 Intr - 114307 114258 50 2 2 122 108 7 0.870 4.50 4.06 Intr - 114722 114632 91 2 1 102 78 61 0.974 6.07 4.05 Intr - 116438 116401 38 0 2 132 70 25 0.681 3.08 4.04 Intr - 116578 116522 57 2 0 121 69 29 0.581 3.16 4.03 Intr - 137813 137711 103 2 1 106 101 34 0.935 6.25 4.02 Intr - 153364 153126 239 2 2 22 80 243 0.355 14.23 4.01 Init - 160295 160148 148 2 1 97 73 111 0.780 10.75 4.00 Prom - 166899 166860 40 -3.66 5.00 Prom + 167570 167609 40 -9.85 5.01 Init + 171105 171181 77 2 2 81 78 3 0.459 -0.83 5.02 Intr + 172247 172517 271 2 1 57 99 220 0.196 17.54 5.03 Intr + 190136 190283 148 1 1 123 24 238 0.792 20.71 5.04 Intr + 191171 191232 62 0 2 96 80 22 0.761 0.65 5.05 Intr + 194576 194677 102 1 0 52 87 108 0.976 7.47 5.06 Intr + 196320 196445 126 2 0 122 7 87 0.938 4.88 5.07 Intr + 197048 197269 222 1 0 104 80 138 0.949 12.92 5.08 Intr + 198672 198830 159 2 0 84 105 134 0.997 14.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 100633 99998 636 1 0 76 41 692 0.934 58.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:32590183_32790815|GENSCAN_predicted_peptide_1|79_aa MPCSLRLGLQLLLSPPRLTAWMLQLPRLLMEDERASLAELKEPRHGRQLSFLKAQYRLAS WQTPEAREMLASNSFCNHQ >gi568815578r:32590183_32790815|GENSCAN_predicted_CDS_1|240_bp atgccctgctctctgcggctgggactccaactactcctgtcccctccccggctcactgcc tggatgcttcagcttcctcgtctgctcatggaggatgagagggcctccctggcagagctg aaggagcctcgccatggtaggcagctgagcttcctgaaagctcagtacagactcgcaagc tggcagaccccagaggcccgagagatgctggcttccaactccttctgtaaccaccagtga >gi568815578r:32590183_32790815|GENSCAN_predicted_peptide_2|176_aa MAVLPTVPRAGIVPGQGAAARRAGPVNRGPLACPLLGEYCHQQARQVAARAAGEEWGQTK PKPNKIAGSLTGLTMVCKMPCSSRMCCQLGVQMGTLEQGTPGQRSAQMVKAQEATARPPE PEPGLQRRAVEVAVAPRPPGRCFLLVGLSCPGRALPLAVALRDTKHRQPSGILLPP >gi568815578r:32590183_32790815|GENSCAN_predicted_CDS_2|531_bp atggctgtcctccccactgtacctcgggcaggcatcgtgccaggccagggtgcggctgcc cggcgggctgggccggtgaaccgaggccccctagcctgcccgctgctcggggaatattgt caccagcaggccaggcaggtggcggctcgtgctgcaggagaggagtggggccaaaccaaa ccaaaaccaaacaaaattgctggaagcctgacaggtctaacgatggtttgtaagatgccc tgctcgagccggatgtgctgtcagcttggcgtccagatgggcaccctagagcagggcacg cctggccagcgcagtgcacagatggtgaaagcacaggaggccacagccaggcctccggag ccagagccaggactgcagcgacgggcagtggaggtggcggtggcccctcggcctccaggc cgttgcttcctcctggtgggcctcagctgcccgggtcgggctctgccactggcagttgca ctgcgggacaccaagcatcgccagccttcaggaatccttctccctccctga >gi568815578r:32590183_32790815|GENSCAN_predicted_peptide_3|406_aa MASGQGPGPPRQECGEPALPSASEEQVAQDTEEVFRSYVFYHHQQEQEAEGAAAPADPEM VTLPLQPSSTMGQVGRQLAIIGDDINRRYDSEFQTMLQHLQPTAENAYEYFTKIASSLFE SGINWGRVVALLGFSYRLALHIYQRGLTGFLGQVTRFVVDFMLHHCIARWIAQRGGWVAA LNLGNGPILNVLVVLGVVLLGQFVTLAPNPFTTGEGSHPHPWGPWPCQYSLKLFCGCHQR SSRPPSAHLGISLGSQSHLLRSPPGSSQSQLQRQVPDCSCHMKRAQKAPLRMLDLRTGAH ALHPALSSQILEAQSKLWAKGAGRAEACFVSAGSGELGALAGAHGPGPNQPPSVSLKKGL RVPHRAMMLSWWASAWQPAHVEAWRQVVHGFCGHFSGEKRKQGQLC >gi568815578r:32590183_32790815|GENSCAN_predicted_CDS_3|1221_bp atggcctcggggcaaggcccaggtcctcccaggcaggagtgcggagagcctgccctgccc tctgcttctgaggagcaggtagcccaggacacagaggaggttttccgcagctacgttttt taccaccatcagcaggaacaggaggctgaaggggcggctgcccctgccgacccagagatg gtcaccttacctctgcaacctagcagcaccatggggcaggtgggacggcagctcgccatc attggggacgacatcaaccgacgctatgactcagagttccagaccatgttgcagcacctg cagcccacggcagagaatgcctatgagtacttcaccaagattgcctccagcctgtttgag agtggcatcaattggggccgtgtggtggctcttctgggcttcagctaccgtctggcccta cacatctaccagcgtggcctgactggcttcctgggccaggtgacccgctttgtggtggac ttcatgctgcatcactgcattgcccggtggattgcacagaggggtggctgggtggcagcc ctgaacttgggcaatggtcccatcctgaacgtgctggtggttctgggtgtggttctgttg ggccagtttgtgacacttgctcccaacccattcactacaggtgaaggctctcacccccat ccctgggggccttggccttgtcagtattccctgaagctcttctgtggctgtcatcaacgc agctctaggccaccttcagcccacctgggcatctcactgggcagccaatctcacctgctc cgaagccctccaggcagcagccagagtcagcttcaaagacaagtgccagactgcagctgc cacatgaagagagcccagaaagctcctctgagaatgctggacctcaggacaggggcccat gctctgcacccggccctcagcagccagatcctagaggcccagtccaagctgtgggccaag ggagcggggcgggctgaggcctgctttgtgtcggcagggtctggggaattgggagcactg gcgggagcccatggtccggggccgaaccagcccccttctgtttccctgaagaaggggctg agagtgccccacagggccatgatgctttcatggtgggccagtgcatggcagccagcccat gtggaggcctggagacaggtggtccacggcttctgtggccatttttcaggagagaagagg aagcaaggacagctttgctga >gi568815578r:32590183_32790815|GENSCAN_predicted_peptide_4|283_aa MGDRCCHGNVLRLDCINVNILVMTLWDLCIEGNRVKATRNVSVSFLKTVSQGGRGRSALR RPWRKCGCGCAERPAAVAAAAEEEAEQGGGGGGTCGAGARAMGRLHCTEDPVPEAVGGDM QQLNQLGAQVERFLAQLSEFATTNQISLGSLRSIVKSLLLVPNGALKKSLTAKQVQADFI TLGLSEEKATYFSEKWKQNAPTLARWAIGQTLMINQLIDMEWKFGVTSGSSELEKVGSIF LQLKLVVKKGNQTENVYIELTLPQFYSFLHEMERVRTSMECFC >gi568815578r:32590183_32790815|GENSCAN_predicted_CDS_4|852_bp atgggagatcgttgctgccatggaaatgttctgcgtcttgactgtatcaatgtcaatatc ctggtgatgacactgtgggatctgtgcatcgagggaaaccgggtaaaggccacaaggaat gtctctgtatcatttcttaaaactgtatcccagggcggccgggggaggagtgcgctgcga cggccgtggcgaaagtgcggttgtggatgcgcggagaggccggcagcggtggcagcggca gcggaggaggaagctgagcagggcggcggcggcggtggaacctgcggggctggggcgcgc gccatgggccgcctgcactgcactgaggacccggtgccggaggccgtgggcggcgacatg cagcagctgaaccagctgggcgcgcaggtggaaagatttctggctcagctctctgaattt gccaccaccaatcagatcagtcttggctccctcagaagcatcgtgaaaagcctccttctg gttccaaatggtgctttgaagaagagtctcacagccaagcaggtccaggcggatttcata actctgggtcttagtgaggagaaagccacttacttttctgaaaagtggaagcagaatgct cccacccttgctcgatgggccataggtcagactctgatgattaaccagctcatagatatg gagtggaaatttggagtgacatctgggagcagcgaattggagaaagtgggaagtatattt ttacaactaaagttggtggttaagaaaggaaatcaaaccgaaaatgtgtatatagaatta accttgcctcagttctacagcttcctgcacgagatggagcgagtcagaaccagcatggag tgtttctgctga >gi568815578r:32590183_32790815|GENSCAN_predicted_peptide_5|389_aa MDRGDFRWSHREKPLHKVTFWEKTEGSCSRLRGRSPRGRSERPPTDGTGSLAVGRAGGNA ARPAALGLSGPSKPSSAIGAGDSRAQRPARPPAGLPPASPDPRLRRPAAPQPALRQESMK GDTRHLNGEEDAGGREDSILVNGACSDQSSDSPPILEAIRTPEIRGRRSSSRLSKREVSS LLSYTQDLTGDGDGEDGDGSDTPVMPKLFRETRTRSESPAVRTRNNNSVSSRERHRPSPR STRGRQGRNHVDESPVEFPATRSLRRRATASAGTPWPSPPSSYLTIDLTDDTEDTHGTPQ SSSTPYARLAQDSQQGGMESPQVEADSGDGDSSEYQDGKEFGIGDLVWGKIKGFSWWPAM VVSWKATSKRQAMSGMRWVQWFGDGKFSE >gi568815578r:32590183_32790815|GENSCAN_predicted_CDS_5|1167_bp atggacaggggagacttcaggtggagtcatcgggaaaagcctctccataaagtgaccttc tgggagaaaaccgagggcagctgctcccggctccgcggccgcagcccgcgtggacgctcc gagcgccccccgacggacgggaccggctccctggcggtcgggcgagcgggcggcaacgct gcccggccggcagcgctggggttaagtggcccaagtaaacctagctcggcgatcggcgcc ggagattcgcgagcccagcgccctgcacggccgccagccggcctcccgccagccagcccc gacccgcggctccgccgcccagccgcgccccagccagccctgcggcaggaaagcatgaag ggagacaccaggcatctcaatggagaggaggacgccggcgggagggaagactcgatcctc gtcaacggggcctgcagcgaccagtcctccgactcgcccccaatcctggaggctatccgc accccggagatcagaggccgaagatcaagctcgcgactctccaagagggaggtgtccagt ctgctaagctacacacaggacttgacaggcgatggcgacggggaagatggggatggctct gacaccccagtcatgccaaagctcttccgggaaaccaggactcgttcagaaagcccagct gtccgaactcgaaataacaacagtgtctccagccgggagaggcacaggccttccccacgt tccacccgaggccggcagggccgcaaccatgtggacgagtcccccgtggagttcccggct accaggtccctgagacggcgggcaacagcatcggcaggaacgccatggccgtcccctccc agctcttaccttaccatcgacctcacagacgacacagaggacacacatgggacgccccag agcagcagtaccccctacgcccgcctagcccaggacagccagcaggggggcatggagtcc ccgcaggtggaggcagacagtggagatggagacagttcagagtatcaggatgggaaggag tttggaataggggacctcgtgtggggaaagatcaagggcttctcctggtggcccgccatg gtggtgtcttggaaggccacctccaagcgacaggctatgtctggcatgcggtgggtccag tggtttggcgatggcaagttctccgag