GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:17:44 Sequence gi568815597r:7820838_8039994 : 219157 bp : 44.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1385 1387 3 1 0 108 81 0 0.441 1.30 1.02 Intr + 4054 4081 28 0 1 102 105 -9 0.681 -0.11 1.03 Intr + 5643 5873 231 1 0 80 102 167 0.947 15.04 1.04 Intr + 6281 6486 206 0 2 112 4 162 0.502 9.12 1.05 Intr + 9074 9324 251 1 2 22 72 220 0.381 10.04 1.06 Intr + 16162 16312 151 1 1 107 82 181 0.924 19.56 1.07 Term + 16572 16583 12 2 0 64 46 -7 0.215 -9.10 1.08 PlyA + 17062 17067 6 1.05 2.07 PlyA - 17360 17355 6 1.05 2.06 Term - 19330 19046 285 1 0 -27 43 298 0.912 9.20 2.05 Intr - 22900 22783 118 0 1 101 91 -3 0.603 1.77 2.04 Intr - 25670 25547 124 0 1 50 39 96 0.539 0.54 2.03 Intr - 30085 29975 111 2 0 54 98 60 0.528 3.95 2.02 Intr - 55594 55532 63 2 0 75 93 29 0.000 0.79 2.01 Init - 80198 80129 70 2 1 35 111 51 0.158 3.31 2.00 Prom - 95556 95517 40 -3.36 3.00 Prom + 105015 105054 40 -0.16 3.01 Sngl + 133081 133419 339 0 0 59 41 152 0.547 3.63 3.02 PlyA + 133959 133964 6 1.05 4.00 Prom + 140119 140158 40 -3.06 4.01 Init + 141949 142038 90 0 0 72 98 117 0.975 11.62 4.02 Intr + 144487 144588 102 0 0 74 68 74 0.925 4.37 4.03 Intr + 150057 150126 70 2 1 61 66 107 0.932 4.55 4.04 Intr + 156815 156901 87 0 0 66 107 59 0.900 5.54 4.05 Intr + 164057 164187 131 0 2 120 11 161 0.066 12.01 4.06 Intr + 165120 165191 72 2 0 17 99 69 0.052 0.50 4.07 Intr + 169231 169884 654 0 0 -77 19 1633 0.819 131.97 4.08 Term + 182546 182668 123 1 0 75 50 70 0.294 0.28 4.09 PlyA + 185872 185877 6 1.05 5.04 PlyA - 186500 186495 6 1.05 5.03 Term - 193559 192373 1187 0 2 95 41 496 0.844 37.52 5.02 Intr - 194547 194471 77 2 2 88 76 13 0.609 -0.74 5.01 Init - 194707 194658 50 1 2 95 106 21 0.536 5.02 5.00 Prom - 196856 196817 40 -8.26 6.00 Prom + 197167 197206 40 -0.16 6.01 Init + 206288 206324 37 1 1 84 89 37 0.322 3.67 6.02 Term + 206585 206706 122 0 2 86 44 95 0.328 3.54 6.03 PlyA + 206736 206741 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 164057 164217 161 0 2 120 43 183 0.892 15.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:7820838_8039994|GENSCAN_predicted_peptide_1|293_aa MVVFSSHECPARDATLFCEPWTLNMQPAPLTSEEFKHVGLTAAVLSAHTQKEEQNYVDKF REKILSSPYSSYLQQESRSKAKYSYFQGDSTSKQTRSAGCRKGKHKRKKLPEPPDSSSSN TGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMGESIPSYCQRSVHRIASHEESIPS YCQRSVHRIASHEESIPSYCQHTVHGIASQQDSIPSYCHCSVHGVTSQRIPIQNWFSSIR VKEVVLKEDLEKLESMRQQQPQFSHGQKEELAKVYNWIQSQTVTQEIDIQLEC >gi568815597r:7820838_8039994|GENSCAN_predicted_CDS_1|882_bp atggtggttttttcctcccatgaatgccctgccagggatgctaccctcttctgtgagccc tggaccctgaacatgcagccagcccctttgacctcggaagaatttaaacacgtggggctc acagcggctgttctgtcagcgcacacccagaaggaagagcagaattatgttgataaattc cgagaaaagatcctgtcatcaccctacagctcctatcttcagcaagaaagcaggagcaaa gctaaatattcatattttcaaggagattctacttccaagcagacgcggtcggccggctgc aggaaagggaagcacaagcggaagaagctgccggagccgccagacagcagcagctcgaac accggctctggtccccgcaggggagcgcatcagaacgcacagccctgctgcccctccgcg gcctcctctccgcacacctcgagcccgaccttcccacctgccgccatgggagaatccatc ccatcctactgccagcgctctgtccacaggatcgcctcccatgaagaatccatcccatcc tactgccagcgctctgtccacaggatcgcctcccatgaagaatccatcccatcctactgc cagcacactgtccatgggattgcctcccagcaggactccatcccatcctactgccactgt tctgtccacggggtcacctcccagcgaatccccatccagaactggttcagcagcatcagg gttaaagaagttgtactaaaagaagacctggaaaagctagaaagtatgaggcagcagcag ccccagttttctcatgggcaaaaggaggagctggctaaggtgtataattggattcaaagc cagactgtcactcaagaaatcgacattcaattggaatgttag >gi568815597r:7820838_8039994|GENSCAN_predicted_peptide_2|256_aa MIREIETEVKIFVKLNLDLDDFPAHAGTSIGGNRLGGPIFRPPKAPHEDARLTPEELERA SLLQILPEMLGAERGDILRKAEGSATTKEQHTTCVNLVTDHHKFPREYRSQSVYPKDIKN LAGPHNTALFKGKQKAVPVRPVVGLERPLLYKLIGKKELTVLRDSKADLSRQKKEYTYED RTIETVKSDEQKEKRLKKSQQGLEDMWETIEWTNISTTGAPEEEKEKEAERLFEKIMAEN SPSLMKNMNVKVQEAK >gi568815597r:7820838_8039994|GENSCAN_predicted_CDS_2|771_bp atgattagggaaattgaaacagaagtaaaaatctttgtcaaattaaacttagacctggat gactttccagcacatgctggcaccagcattggtggcaacaggctgggtggacccatcttc aggcccccaaaagcacctcatgaagacgcgcgcttaactccggaggagctagaaagagct tcccttctacagatactgccagagatgctgggtgcagaaagaggggatattctcaggaaa gcagaggggtctgcaacaaccaaagagcaacacacgacttgtgtcaatcttgtcactgac catcacaagtttcctagagagtacaggtctcagagcgtttaccctaaagacatcaagaac ttggccggtccccacaacactgccctgttcaagggcaaacaaaaggcagtgccagtcaga cctgtggtagggctggagcgacctcttttgtataaattgattggaaagaaagagttaact gttttgagggattcaaaagccgatctaagcaggcagaagaaagaatacacgtatgaagac cggacaattgaaactgtcaagtccgacgaacagaaagaaaaaagactgaagaaaagtcaa cagggcctggaggacatgtgggaaaccatcgagtggaccaacataagcactacaggggcc ccagaggaagaaaaagagaaagaggcagaaagattattcgaaaaaataatggctgaaaac tccccaagtttgatgaaaaacatgaatgtaaaagtccaagaggctaaatga >gi568815597r:7820838_8039994|GENSCAN_predicted_peptide_3|112_aa MGTGKSSSFFAEPAGCGHQNESGPSRCQGLDLPPCIRQAAHTPRKSARSLRQSDSRAQRP TRDALKEVTFRGCYSQTRPSSTRRIYVGSDPCKHRRDRASGWEHATVPPTAT >gi568815597r:7820838_8039994|GENSCAN_predicted_CDS_3|339_bp atgggaacaggcaaatcgtcgtcgttctttgctgagcctgcgggctgcggtcaccagaac gaatctgggcccagcaggtgccaagggctcgacctcccgccttgcatccggcaggccgca cacacccctcggaagtcggcgagatcgctgaggcaaagcgactccagggcgcagagaccc acacgcgacgctctaaaggaagtgacctttaggggctgttactctcagaccaggcccagc agcacccggcgcatttacgtcggatctgacccctgcaagcaccggcgcgaccgcgctagc ggctgggaacacgcgacggtcccgcccactgcgacctga >gi568815597r:7820838_8039994|GENSCAN_predicted_peptide_4|442_aa MASKRALVILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVVICPDASLED AKKESAAVKEILKEQENRKGLIAAICAGPTALLAHEIGFGSKVTTHPLAKDKMMNGGHYT YSENRVEKDGLILTSRGPGTSFEFALAIVEALNGKEVAAQFLLGGHMIQTQPIRALFTTG RVTQNVNAGFTTITITTTTITITISIIINIPTVVTTTTITITIITITITIITITTTVITI TLTTLTITLTTITTLTTTTITTIITTSITLTTITITTTITFTTTTTITTSITLTITLTTT ITTLTTTITITTSITLTTTITITTTITFTTTTITTSITFTITLTTTITTTNTITLTTTII TITTTITITLTTTITISITTTTTTVTIIITTIIIFFTSCSLQEKDLPTSGFVTRKGIENI SSKSGYQILEGNLSDLESVMEK >gi568815597r:7820838_8039994|GENSCAN_predicted_CDS_4|1329_bp atggcttccaaaagagctctggtcatcctggctaaaggagcagaggaaatggagacggtc atccctgtagatgtcatgaggcgagctgggattaaggtcaccgttgcaggcctggctgga aaagacccagtacagtgtagccgtgatgtggtcatttgtcctgatgccagccttgaagat gcaaaaaaagagtctgctgctgtgaaggagatactgaaggagcaggaaaaccggaagggc ctgatagccgccatctgtgcaggtcctactgctctgttggctcatgaaataggttttgga agtaaagttacaacacaccctcttgctaaagacaaaatgatgaatggaggtcattacacc tactctgagaatcgtgtggaaaaagacggcctgattcttacaagccgggggcctgggacc agcttcgagtttgcgcttgcaattgttgaagccctgaatggcaaggaggtggcggctcaa ttcctcctgggcgggcacatgatccagacccagccaatccgagccctgtttaccactggc cgtgtgacccagaatgtgaatgctggattcaccaccatcaccatcaccaccaccacaatc actatcaccatctccatcatcatcaacatccccactgttgtcaccaccaccaccatcact atcaccatcattaccatcaccatcaccatcatcaccatcactaccaccgtcatcaccatc accctcaccaccctcaccatcaccctcaccactatcaccacactcaccaccaccaccatc accaccatcattaccaccagcatcaccctcaccaccatcaccatcaccaccaccatcacc ttcaccaccaccaccaccatcaccaccagcatcaccctcaccatcaccctcaccaccacc atcaccacactcaccaccaccatcaccatcaccaccagcatcaccctcaccaccaccatc accatcaccaccaccatcaccttcaccaccaccaccatcaccaccagcatcaccttcacc atcaccctcaccaccaccatcaccactaccaacaccatcaccctcaccactaccatcatc accatcaccaccaccatcaccatcaccctcaccaccaccatcaccatcagcatcaccacc accaccaccaccgtcaccatcattatcaccaccatcatcatctttttcacttcttgcagc ttgcaggaaaaagacctgccaacatcaggatttgtaaccagaaaaggcattgaaaacata tcctcaaaatcaggctatcaaattttagaaggaaacctttcagacctagaatcagttatg gagaagtga >gi568815597r:7820838_8039994|GENSCAN_predicted_peptide_5|437_aa MGNMRKTYWSSRSEFKNNFLNIDPITMAYSLNSSAQERLIPLGHASKSAPMNGHCFAENG PSQKSSLPPLLIPPSENLGPHEEDQVVCGFKKLTVNGVCASTPPLTPIKNSPSLFPCAPL CERGSRPLPPLPISEALSLDDTDCEVEFLTSSDTDFLLEDSTLSDFKYDVPGRRSFRGCG QINYAYFDTPAVSAADLSYVSDQNGGVPDPNPPPPQTHRRLRRSHSGPAGSFNKPAIRIS NCCIHRASPNSDEDKPEVPPRVPIPPRPVKPDYRRWSAEVTSSTYSDEDRPPKVPPREPL SPSNSRTPSPKSLPSYLNGVMPPTQSFAPDPKYVSSKALQRQNSEGSASKVPCILPIIEN GKKVSSTHYYLLPERPPYLDKYEKFFREAEETNGGAQIQPLPADCGISSATEKPDSKTKM DLGGHVKRKHLSYVVSP >gi568815597r:7820838_8039994|GENSCAN_predicted_CDS_5|1314_bp atggggaatatgaggaagacctactggagcagtcgcagtgagtttaaaaacaacttttta aatattgacccgataaccatggcctacagtctgaactcttctgctcaggagcgcctaata ccacttgggcatgcttccaaatctgctccgatgaatggccactgctttgcagaaaatggt ccatctcaaaagtccagcttgccccctcttcttattcccccaagtgaaaacttgggacca catgaagaggatcaagttgtatgtggttttaagaaactcacagtgaatggggtttgtgct tccacccctccactgacacccataaaaaactccccttcccttttcccctgtgcccctctt tgtgaacggggttctaggcctcttccaccgttgccaatctctgaagccctctctctggat gacacagactgtgaggtggaattcctaactagctcagatacagacttccttttagaagac tctacactttctgatttcaaatatgatgttcctggcaggcgaagcttccgtgggtgtgga caaatcaactatgcatattttgataccccagctgtttctgcagcagatctcagctatgtg tctgaccaaaatggaggtgtcccagatccaaatcctcctccacctcagacccaccgaaga ttaagaaggtctcattcgggaccagctggctcctttaacaagccagccataaggatatcc aactgttgtatacacagagcttctcctaactccgatgaagacaaacctgaggttcccccc agagttcccatacctcctagaccagtaaagccagattatagaagatggtcagcagaagtt acttcgagcacctatagtgatgaagacaggcctcccaaagtaccgccaagagaacctttg tcaccgagtaactcgcgcacaccgagtcccaaaagccttccgtcttacctcaatggggtc atgcccccgacacagagctttgcccctgatcccaagtatgtcagcagcaaagcactgcaa agacagaacagcgaaggatctgccagtaaggttccttgcattctgcccattattgaaaat gggaagaaggttagttcaacacattattacctactacctgaacgaccaccatacctggac aaatatgaaaaattttttagggaagcagaagaaacaaatggaggcgcccaaatccagcca ttacctgctgactgcggtatatcttcagccacagaaaagccagactcaaaaacaaaaatg gatctgggtggccacgtgaagcgtaaacatttatcctatgtggtttctccttag >gi568815597r:7820838_8039994|GENSCAN_predicted_peptide_6|52_aa MKDVKGQCGTVDGLCLLRRPELAPTSFGFGEATDAQGQSPSGPVGGGFSGRQ >gi568815597r:7820838_8039994|GENSCAN_predicted_CDS_6|159_bp atgaaggatgtcaagggacagtgtgggacagtggacggtttgtgtttactcaggcgacct gagctggccccgacgtcttttggctttggagaggccacggatgcccaaggacaaagcccc agcgggcctgtgggaggaggtttctcggggcgccagtga