GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:28:27 Sequence gi568815583f:48778177_48978737 : 200561 bp : 39.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 387 382 6 1.05 1.12 Term - 1898 1692 207 2 0 48 39 123 0.475 -0.24 1.11 Intr - 3183 3049 135 0 0 62 42 96 0.405 2.24 1.10 Intr - 4054 3963 92 2 2 113 89 87 0.878 10.09 1.09 Intr - 5944 5797 148 1 1 104 94 86 0.998 9.69 1.08 Intr - 10825 10625 201 1 0 46 47 280 0.974 18.26 1.07 Intr - 13200 13061 140 1 2 40 82 166 0.937 10.46 1.06 Intr - 15285 15145 141 1 0 65 116 132 0.968 13.10 1.05 Intr - 17984 17834 151 2 1 60 39 109 0.966 2.01 1.04 Intr - 19403 19125 279 2 0 76 72 247 0.939 18.65 1.03 Intr - 25951 25838 114 1 0 76 81 72 0.571 5.02 1.02 Intr - 27480 27387 94 2 1 87 107 148 0.025 15.65 1.01 Init - 39611 39427 185 2 2 58 76 78 0.021 2.24 1.00 Prom - 40057 40018 40 -4.15 2.00 Prom + 40514 40553 40 -3.55 2.01 Init + 61263 61473 211 2 1 62 82 117 0.722 7.49 2.02 Term + 61901 61968 68 0 2 97 54 87 0.986 3.32 2.03 PlyA + 62029 62034 6 1.05 3.10 PlyA - 62444 62439 6 1.05 3.09 Term - 64927 64884 44 2 2 142 50 53 0.956 3.24 3.08 Intr - 65412 65233 180 1 0 57 100 97 0.942 6.72 3.07 Intr - 68809 68726 84 2 0 96 53 116 0.922 7.77 3.06 Intr - 73636 73504 133 1 1 67 76 114 0.888 7.40 3.05 Intr - 77948 77777 172 1 1 97 76 148 0.998 13.52 3.04 Intr - 79639 79516 124 2 1 91 82 55 0.838 3.92 3.03 Intr - 83398 83201 198 2 0 99 69 29 0.155 0.50 3.02 Intr - 89693 89642 52 2 1 87 64 64 0.244 1.46 3.01 Init - 91727 91674 54 2 0 79 85 23 0.367 2.45 3.00 Prom - 91769 91730 40 -4.35 4.00 Prom + 98264 98303 40 -6.55 4.01 Sngl + 100001 100564 564 1 0 69 33 900 0.881 78.49 4.02 PlyA + 101436 101441 6 1.05 5.00 Prom + 109283 109322 40 -2.35 5.01 Init + 117722 117724 3 1 0 99 101 0 0.507 2.45 5.02 Intr + 128354 128419 66 1 0 95 65 61 0.001 2.68 5.03 Intr + 134695 134815 121 0 1 77 23 69 0.000 -1.55 5.04 Intr + 135556 135657 102 2 0 86 103 47 0.001 5.23 5.05 Intr + 144018 144148 131 1 2 67 3 111 0.002 -0.01 5.06 Intr + 149063 149203 141 1 0 104 90 46 0.692 6.03 5.07 Intr + 152224 152298 75 0 0 75 111 49 0.502 4.69 5.08 Intr + 161042 161221 180 1 0 34 94 120 0.251 6.34 5.09 Term + 166131 166187 57 2 0 88 54 27 0.023 -3.99 5.10 PlyA + 167358 167363 6 1.05 6.06 PlyA - 168589 168584 6 1.05 6.05 Term - 175424 175321 104 0 2 80 43 104 0.881 2.56 6.04 Intr - 177870 177790 81 1 0 136 70 38 0.929 5.49 6.03 Intr - 178570 178495 76 1 1 47 92 64 0.886 0.87 6.02 Intr - 184889 184255 635 0 2 76 110 365 0.967 28.42 6.01 Init - 185197 185143 55 1 1 59 52 118 0.807 4.80 6.00 Prom - 190053 190014 40 -5.05 7.00 Prom + 190402 190441 40 -5.55 7.01 Init + 193834 194505 672 0 0 69 38 1011 0.820 87.43 7.02 Term + 194569 194853 285 0 0 -33 48 393 0.959 17.62 7.03 PlyA + 195396 195401 6 1.05 8.02 PlyA - 197038 197033 6 -0.45 8.01 Term - 198422 198277 146 0 2 95 43 155 0.532 8.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 27473 27387 87 2 0 66 107 149 0.971 15.29 S.002 Sngl - 54899 54639 261 2 0 108 47 149 0.929 7.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_1|628_aa MANKDVVGDIASALSRPPGVPQSTSVYPILGFCVFCFCVTVHSENRCRLVKLLWVTKNQN GLRTMSLDFGSVALPVQNEDEEYDEEDYEREKELATVPTSHTDMGLCTQTCLLHSGKFSA RALRAFEFMTLGYNEIQSLYAGEKCGNVWEENRSKTEDRHPVYHPEEGGDEGGSGYSPPS KCEQTDLYHLPENFRPYTNGQKQEFNNQATNVIKFSDPQWNHFQGPSCQGLEPYNKVTYK PYQSSAQNNGSPAQEITGSDTFEGLQQQFLGANENSAENMQIIQLQVLNKAKERQLENLI EKLNESERQIRYLNHQLVIIKDEKDGLTLSLRESQKLFQNGKEREIQLEAQIKALETQIQ ALKVNEEQMIKKSRTTEMALESLKQQLVDLHHSESLQRAREQHESIVMGLTKKYEEQVLS LQKNLDATVTALKEQEDICSRLKDHVKQLERNQEAIKLEKTEIINKLTRSLEESQKQCAH LLQSGSVQEVAQLQFQLQQAQKAHAMSANMNKALQEELTELKDEISLYESAAKLGIHPSD SEGELNIELTESYVDLGIKKRARARVGKDSIHPSASSSALARRKLWKPNRLELLSITANS STHLRVFDPHQNVHSYDPKLSKKGFQFG >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_1|1887_bp atggcaaacaaggatgttgtaggagatatagccagtgctttgtccagaccccctggagtc cctcaaagcacttctgtctaccccatccttggcttctgtgtgttttgcttctgtgtcact gttcattctgagaacagatgcaggctagtgaagctgctttgggtcaccaagaaccagaac ggcctgaggaccatgtcattagactttggcagtgtggcactaccagtgcaaaatgaagat gaagagtatgacgaagaggactatgaaagagagaaagagctggccacagtccccacttct cacacagacatgggcttatgcactcagacctgcctgcttcactctggtaaattctctgcc agggccctgagggcatttgagtttatgaccctgggctataatgaaattcagagtttatat gctggagaaaaatgtggtaatgtctgggaagaaaatagaagtaaaactgaagaccgacat cctgtgtaccatcctgaagaaggtggagatgaaggtggaagtggttatagtcctccaagt aaatgtgaacagactgatttatatcaccttcctgaaaactttaggccatataccaatggt cagaagcaggaatttaataaccaagcaaccaatgtaattaaattttcagatcctcaatgg aaccattttcagggtcccagttgtcaaggtttggaaccgtataataaagtgacatataaa ccttatcagtcttctgcccagaataatggctcaccagcccaggagataacaggaagtgac acattcgaaggcctgcaacaacaatttttaggagctaatgagaactctgcagaaaatatg cagattattcaacttcaggttcttaacaaagcaaaagagagacaactggagaacttaatt gaaaagttaaatgaaagtgaacgtcaaattcgatatctgaatcaccagcttgtaataata aaagatgaaaaggatggtttgactctcagccttcgagaatcacagaaactctttcagaat ggaaaagaaagagagatacagcttgaagctcaaataaaagcactggagactcagatacaa gcattaaaagtcaatgaagaacagatgatcaagaagtccagaacaactgaaatggctctg gaaagcttgaagcagcagctggtggaccttcatcattctgaatcacttcaacgagctaga gaacagcatgagagcattgttatgggcctcacaaagaagtacgaagagcaagtattgtcc ttacaaaagaatttggatgccacagtcaccgcacttaaagaacaggaagacatttgctct cgtctgaaagatcacgtgaaacaactggaaaggaatcaagaagcaatcaagttagaaaag actgagatcattaataagttgacaagaagtctagaggagagtcaaaagcagtgtgcccac ttgttgcagtccgggtcagtacaagaggtggctcagctacagttccagctgcagcaagca cagaaggcacatgctatgagtgcaaacatgaacaaggctttgcaagaagaattaacagaa ctaaaagatgaaatttctctctatgaatctgctgcaaaactaggaatacatccaagtgac tcagaaggagaattaaatatagaactcactgaatcgtatgtggatttgggtattaaaaag agggctagggcaagggttgggaaagacagtatccatccttcggccagcagctcagcctta gcaaggagaaaactatggaaacccaacaggctagagttgctgagtatcacagcaaatagc tctacacatttacgtgtgtttgatccacatcaaaatgtccacagttatgatcccaagttg tcaaagaaaggcttccagtttggttag >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_2|92_aa MVGTTQMASSGSTDDSGHQKSFPKAEPGSKQIAFSTSSSCSLHNSSLARFGHRCGPVTAV PHCCIPPFCESECKCDDEGPSSHFGPEMETRC >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_2|279_bp atggtaggaacaacccagatggcatcatctggatccacagatgacagtggacaccaaaag tcctttcctaaagctgaaccaggatccaagcaaattgctttctccacttccagctcatgt agtcttcacaactctagcctggccagatttggtcatcgttgtggaccagtgactgctgtc ccacactgttgcattcctcccttttgtgaatctgaatgcaaatgtgatgatgagggccca agcagccactttggaccagagatggaaaccagatgttga >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_3|346_aa MGKFVPAALRAPLCSLKLDTTDYVAYVAKDPVNQRAENWAILGQAFGIGFNPGKWAYLYV IQGPPLSCPMTNLKKTISIEQHSMSMRGNNGTVKVHVLPEGACHILECHNGMAQDVISTI GQAFELRFKQYLKNPSLNTSCESEEVHIDSHAEEREDHEYYNEIPGKQPPVGGVSDMRIK VQATEQMAYCPIQCEKLCYLCCQTNLSKDTVHLKGIKWIPPPQYIADESCMYRDDRSSLD KIFKVLYCALKGGSFQERLDINTDVFGVILRDGNVHPRGVQSQRDTSLLKHTCRVDLFDD PCYINTQALQSTPGSAGNQRSAQPLGSPWHCGRQQSLDLGGDEMIV >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_3|1041_bp atgggaaagtttgtacctgctgcattgagagcacctctgtgctctctaaagctggatact acagactatgttgcctacgtagctaaagatccagttaatcaacgagctgaaaattgggca attcttggccaagcttttggaattggttttaatcctgggaaatgggcatatttatatgtg attcaaggcccacctctatcctgtcccatgaccaacctaaaaaagaccatttccatagag caacatagcatgtctatgagaggcaacaatggaactgtgaaagtacatgtgctccctgaa ggtgcctgtcacatattggaatgccacaatggaatggcccaagacgtcataagtaccata gggcaggcttttgaactccggtttaaacagtacttgaaaaatccttctttgaatacttct tgtgaaagtgaggaggtgcatattgatagccatgccgaggagagagaagatcatgaatat tacaatgaaattccagggaagcagccaccagtaggtggtgtttcagatatgcggatcaaa gttcaagccacggaacaaatggcttactgccccatacagtgtgaaaagttgtgctatttg tgctgccaaactaatctttctaaagacactgttcatctcaaaggcatcaagtggatccct cccccacagtatattgctgacgagagttgtatgtacagggatgatcgaagttctttggat aagattttcaaggtactctattgtgcgcttaaaggtggcagttttcaagagaggctagat ataaacaccgatgtttttggtgtcatcctacgggatggtaatgtccatccaagaggggtg cagtcccagcgagatacctcattattgaagcacacgtgccgagtggatctctttgatgac ccctgctacattaatacacaggctcttcaaagtacacctggctctgctggaaatcaaagg tcagcccaaccactggggagcccatggcactgcggaagacaacagtcactggatttagga ggagatgaaatgattgtatga >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_4|187_aa MSEMAELSELYEESSDLQMDVMPGEGDLPQMEVGSGSRELSLRPSRSGAQQLEEEGPMEE EEAQPMAAPEGKRSLANGPNAGEQPGQVAGADFESEDEGEEFDDWEDDYDYPEEEQLSGA GYRVSAALEEADKMFLRTREPALDGGFQMHYEKTPFDQLAFIEELFSLMVVNRLTEELGC DEIIDRE >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_4|564_bp atgtcggaaatggctgagttgtccgagctgtatgaagagagcagtgacctgcagatggat gtgatgcctggcgagggtgaccttccgcagatggaggtaggcagcgggagccgggagcta tccctgcgtccctcccgcagcggggcccaacagctcgaggaggaaggcccaatggaggag gaggaggcccagccaatggcggcgccagaggggaaacggagccttgctaacgggcccaac gctggggagcagccaggccaggtggcgggcgcagacttcgagagcgaggacgagggcgag gaatttgatgactgggaggacgactacgactatcccgaagaggagcagctcagtggtgcc ggctacagagtatcagccgctcttgaagaagccgacaagatgtttctgagaacaagagaa ccagccctggatggcgggtttcagatgcattatgagaagaccccgtttgatcagttagct tttatcgaagagcttttttcactgatggttgtcaatcgtctgaccgaagaactcggctgt gatgagattattgatagagagtag >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_5|291_aa MDYHGTTILPQYSAVLVKDVWSRPPSESTWLWAGSGGRLHRVLRCELSMGLSAVDTSACS SGGGRNDLLGGPASSQGLSTASSTLGFRSALQIDSAPVCFPSNSVNSMRVEALSYSLEQF QELYGYSINIRVKKKESMNEWRPLPPQSLGLASLYPAVLALERVDSPHRLESRPGSPPIP PGGLVGEENIPASWLGVWENIFQEKFKAQALHLECDEHWDARGYGSVSERLQTSLRESGK KGQRKDAGASYEGQAMRGDGGGISRQWAQHVQEIPKLPFLSFPGSQEPKEM >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_5|876_bp atggattaccatggaactaccatattgccacaatacagtgctgtcctggttaaagatgtc tggtccaggccacccagtgagtctacctggctctgggctggttctgggggtcgtctgcac agagtcctgcgatgtgaactgtctatgggtctctcagctgtggataccagtgcctgttcc agtggagggggcaggaatgacctgctagggggcccagcgagctcccagggcctttctact gcttcctctacccttggatttcgttcagctctccaaattgactcagctccagtctgtttc cccagcaacagtgtaaattccatgagggtagaggctttgtcttattcacttgaacagttt caggaactttatggatactccataaacattcgagtgaaaaagaaagaatcaatgaatgaa tggcgccctcttccaccccagagcctaggtcttgcgtccctatatcctgcagtcctggct ctggaaagagtggactccccacacaggctggagagcaggcctggatcccctccaataccc ccaggaggtcttgttggtgaggagaatatacctgcatcttggctaggagtatgggagaat atttttcaagagaagtttaaggcacaggccctgcacttggagtgtgatgagcactgggat gcaaggggctatgggagtgtgtcagagaggctccagaccagcctcagggaatcagggaag aagggtcagaggaaggacgctggggcgagttatgaagggcaggcaatgagaggtgatggt ggaggaatttccagacaatgggcacagcatgtgcaagagatccccaaactgccatttctc agcttccctggaagccaggagcccaaggagatgtga >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_6|316_aa MQRAPRPALARVLRLRVGHDEAIYATVSCFSTAKAMRERGQDSLAGLVLYVGLFGHPGML HRAKYSRFRNESITSLDEGSSGGSVGNKGSPQPPHPALAPHLPTEDATLPSQESPTPLCT LIPRMASMKLANPATLLSLKNFCLGTKEVPRLKLQESRDPGSSGPSSPETSLSRSGTAPP PQQDLVGHRATALTPDSCPLPGPGEPTLRSRQDRHFLQHLLGMGMNYCVRHPLTNTELAS PVRQKGGCYTGMGGGNLHISLDWSSPFMPLDTRALTARVLEIRSLHQLPHNDPVEWVDGE ETDTKRGPLSGLDAYK >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_6|951_bp atgcagcgggccccccgaccggcgctcgcccgggttctgcgcctaagagttggccacgac gaggcgatttatgcaacagtatcctgtttcagcactgccaaggctatgcgagaacgcggc caggacagcctggcaggactcgtgctgtatgtaggactcttcgggcaccccgggatgctg cacagggccaagtacagccgctttcggaacgagtcgatcacgtccttggacgaaggtagc tccggaggctcggtcgggaacaagggctcgccgcagcctccccaccccgccctggcacct cacctgccgactgaagatgccaccttgccgtcgcaggagagccccaccccactgtgcacc ttgatcccccgcatggcaagcatgaagctggccaacccggccactttgctgagtctgaaa aacttttgcctgggtaccaaagaggtgcctcggctgaagctccaggaaagccgggaccca ggttccagcggcccctcttccccagaaaccagtttaagtaggtccgggactgcacctcca ccgcagcaggacctggtgggacacagggcaaccgccctaacccctgattcgtgcccgctt cctggccctggggagccaacacttaggagcaggcaggacaggcactttctacagcacctg ttggggatgggcatgaactactgtgtgaggcaccctcttactaacactgagctggcttca ccagtaaggcagaaaggagggtgttatactggcatgggtggaggcaaccttcacatatcc cttgattggtccagtcctttcatgcccctggacactagagccttaactgcaagagtctta gagatcagaagcttgcatcaattacctcacaatgaccctgtggagtgggtagatggagag gaaactgacaccaagagaggcccattgtctggccttgatgcttataagtag >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_7|318_aa MSWAPAWVCVEAMRGQRMGGITAVTVNQNVLSPLNLEVDPNMQGMHTQEKEQVKTLNKSA SFVDKVLFLEQQNKMLETNWSLLQQQKPAQSNMDNMFESYINNLLGQLEILGQEKLKLEA ELGNMQGLCMFFVKEDFKKKYEDEINKPTEMENEFVLIKKDVDDAYMNKVELEFRLEGLT DEINFLRQLYEEEIPELQSQILDTSVVLSMDNSRSLDMDSIIAEIKYEELQTLAGKHWDD RRLTKTKISEMNRNISRLQAEIEGLPQRPEQRASLESVIADAEQRGELAIKDAKTKLSEL EAAQQRAKQDMAHGTAAA >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_7|957_bp atgtcctgggcaccagcatgggtctgtgtggaggctatgcggggccagcgtatgggaggc atcacagccgtcacagtcaaccagaacgtgctgagcccccttaacctggaagtggacccc aacatgcagggcatgcacacccaggagaaagagcaggtcaagaccctcaacaagtctgcc tccttcgttgacaaggtactgttcctggaacagcagaacaagatgctggagaccaactgg agcctcctgcagcagcagaagccggctcagagcaacatggacaacatgttcgagagctac atcaacaaccttctggggcagctagagatcctgggccaggagaagctgaaactggaggca gagcttggcaacatgcaggggctctgcatgtttttcgtgaaggaggacttcaagaaaaag tacgaggacgagatcaataagcctacagagatggagaatgaatttgtcctcatcaagaag gatgtggatgacgcttacatgaacaaggtagagctagagtttcgcctggaggggctgact gacgagatcaacttcctcaggcagctgtatgaagaggagatcccagagctgcagtcccag atcttggacacatctgtggtgctgtccatggacaacagccgctccctggacatggacagc atcatcgcggagatcaagtatgaggagctgcagacactggctgggaagcactgggatgac cggcgtcttacaaagacgaagatttctgagatgaaccggaacatcagccggctccaggct gagattgagggcctgcctcaaaggccagagcagagagcttccctggagtccgtcatcgca gatgctgagcagcggggggagctggccattaaggacgccaagactaagctgtccgagctg gaggccgcccagcagcgggccaagcaggacatggcacatggcacggcagctgcctga >gi568815583f:48778177_48978737|GENSCAN_predicted_peptide_8|48_aa XQVRIAVQHVCIGAFRTAGSSNLYWRPPARRSWPEDSALDRNTDRCPS >gi568815583f:48778177_48978737|GENSCAN_predicted_CDS_8|147_bp nggcaagttcgcattgctgtacaacacgtgtgcataggcgccttccggacggctggctcc agtaatttgtactggcgtccccctgctcgtaggtcctggccagaggactcagctctcgat agaaacactgaccgctgtcccagctag