GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:32:12 Sequence gi568815592r:158536830_158741362 : 204533 bp : 46.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 2830 2714 117 0 0 15 105 83 0.149 2.28 1.01 Init - 4943 4765 179 2 2 76 105 91 0.713 8.33 1.00 Prom - 5458 5419 40 -4.96 2.00 Prom + 15811 15850 40 -5.46 2.01 Init + 23260 23403 144 0 0 32 78 247 0.172 16.22 2.02 Term + 24823 24912 90 0 0 70 54 47 0.087 -2.78 2.03 PlyA + 25087 25092 6 -0.45 3.00 Prom + 25337 25376 40 -4.76 3.01 Init + 28194 28324 131 2 2 86 20 173 0.451 8.00 3.02 Intr + 36591 36694 104 0 2 112 94 206 0.804 23.42 3.03 Intr + 48475 48596 122 2 2 117 84 115 0.987 14.21 3.04 Intr + 52843 52953 111 0 0 77 98 106 0.977 11.08 3.05 Intr + 68438 68518 81 1 0 53 74 102 0.348 5.13 3.06 Intr + 70415 70514 100 1 1 62 64 43 0.581 -1.02 3.07 Intr + 71504 71634 131 0 2 83 99 187 0.806 19.71 3.08 Intr + 73100 73174 75 1 0 55 70 75 0.556 2.11 3.09 Intr + 88275 88377 103 2 1 114 106 61 0.681 10.25 3.10 Intr + 92895 92990 96 1 0 69 103 37 0.904 3.28 3.11 Intr + 94494 94560 67 1 1 65 106 96 0.995 7.06 3.12 Term + 94981 95059 79 1 1 51 52 198 0.999 9.74 3.13 PlyA + 96243 96248 6 1.05 4.08 PlyA - 96292 96287 6 1.05 4.07 Term - 100068 99998 71 1 2 143 43 76 0.995 6.70 4.06 Intr - 101065 100942 124 1 1 91 20 80 0.960 1.66 4.05 Intr - 104531 104490 42 2 0 119 94 33 0.907 5.44 4.04 Intr - 117724 117555 170 2 2 108 1 82 0.014 1.17 4.03 Intr - 118489 118437 53 0 2 72 89 18 0.381 -1.15 4.02 Intr - 119527 119346 182 1 2 73 76 56 0.357 1.57 4.01 Init - 122912 122820 93 2 0 51 61 92 0.367 3.18 4.00 Prom - 125164 125125 40 -8.06 5.00 Prom + 125425 125464 40 -6.56 5.01 Init + 126440 126549 110 1 2 80 78 84 0.963 6.29 5.02 Intr + 128512 128784 273 1 0 62 115 208 0.935 17.55 5.03 Intr + 146096 146160 65 2 2 103 84 71 0.472 6.56 5.04 Term + 154033 154115 83 2 2 69 35 89 0.359 -0.44 5.05 PlyA + 154353 154358 6 1.05 6.00 Prom + 163132 163171 40 -6.46 6.01 Init + 166395 166555 161 2 2 61 28 216 0.511 10.25 6.02 Intr + 171493 171562 70 1 1 84 59 85 0.982 4.38 6.03 Intr + 176241 176378 138 2 0 82 64 61 0.930 3.76 6.04 Intr + 176971 177049 79 0 1 101 106 58 0.814 8.02 6.05 Term + 181258 181391 134 2 2 101 52 67 0.426 2.65 6.06 PlyA + 183267 183272 6 1.05 7.03 PlyA - 184571 184566 6 1.05 7.02 Term - 189154 188940 215 2 2 58 38 163 0.793 5.69 7.01 Intr - 204169 204059 111 2 0 67 47 80 0.166 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 80093 79050 1044 2 0 43 41 272 0.991 15.15 S.002 Init - 107879 107853 27 2 0 81 76 33 0.857 1.07 S.003 Term - 117724 117480 245 2 2 108 53 99 0.859 4.46 S.004 Sngl + 139149 139802 654 2 0 86 54 179 0.964 10.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_1|99_aa MRPSDMLTSKSVQMRQEITLALNKKHGSQTFKPECNLIKGVKAERTVGQHGGVETEHEPS KYANSHAAEQRGQSCFVNGALAAVFFASPCYEKTANHVS >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_1|297_bp atgaggccctctgatatgcttacatcaaaatcagtacagatgcgtcaggaaataacgttg gccttaaataaaaaacatggctcccagacctttaaacctgaatgtaacctgatcaaaggt gtcaaagcagaaaggacggttggccagcatggaggcgtggaaacggaacatgagccaagc aaatatgcaaacagccatgctgctgaacagaggggccaatcatgttttgtaaacggagct ttagcagctgtgttctttgcttcgccctgttatgagaaaactgccaaccacgtgagn >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_2|77_aa MSFPRPAAAQPLSLRLRLRLPLPRLLRGAWRARDARAGAEGSGRRDGADRELDQRKQLTK ERSRGGQAKRSLNFVRV >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_2|234_bp atgagttttccgcggccggccgctgctcagccgctgtcgctccggctccggctgcggctg ccgctgccgaggctgctgcgcggcgcctggcgggctcgggacgcgcgggccggggccgag ggctctgggcgccgagatggagccgacagggaactagaccaaagaaagcagttgaccaag gaaaggtcccgaggaggacaagcaaaacgcagcttgaattttgtgcgagtttga >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_3|399_aa MKVSLTPLGPLLRMLVLLGSVEVGMLVLASLTSLSLSLQDSNLRLAPMRLYTLSKRHFVL VFVVFFICFGLTIFVGIRETSIKTSFPMTVKVDGVAQDGTTMYIHNKVHNRTRTLTCAGK CAEIIVAHLGYLNYTQYTVIVGFEHLKLPIKGMNFTWKTYNPAFSRLEIWFRFFFVVLTF IVTCLFAHSLRKFSMRDWGIEQKWMSVLLPLLLLYNDPFFPLSFLVNSWLPGMLDDLFQS MFLCALLLFWLCVYHGIRVQVEEYVRTAMRPTDVGKVLQQGFHGQGMKVFFMVVAAVYIL YLLFLIVRACSELRHMPYVAPPAEFLSFYGLLNFYLYTLAFVYSPSKNALYESQLKDNPA FSMLNDSDDDVIYGSDYEEMPLQNGQAIRAKYKEESDSD >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_3|1200_bp atgaaggtgtctctcacaccccttgggcccctgctgcggatgctggtgttgctgggttct gtagaagtgggaatgctggtgctggcttccctcacatctctgtccctctcgctgcaggac agcaatttgaggctggcgcccatgcggctctacacgctctccaagcgccactttgtcctc gtgtttgtcgtcttcttcatctgctttggcctgaccatcttcgttgggatcagagaaact tctattaagacaagctttcccatgactgttaaagtcgatggtgtagctcaagatggaacc acgatgtacattcataacaaagttcacaaccggacaaggaccctcacatgtgcagggaaa tgtgcggagattattgtggctcaccttggctacctgaactacactcagtatacagtgata gtgggatttgaacacctgaagctccccatcaagggaatgaacttcacatggaagacttat aaccctgccttctcccggttggaaatctggttccggtttttctttgtggtgctcaccttc atcgtcacttgcctgtttgcgcattccctccggaaattttccatgagagactggggcatc gagcagaagtggatgtctgttctcctgcctctgctgctactttacaatgatccgttcttc cccctctccttcctggtcaacagctggctcccagggatgctggatgacctctttcagtcc atgttcctgtgcgccctgctgctcttctggctgtgcgtgtaccacgggattcgtgtccag gtggaggagtacgtcaggactgccatgaggcccaccgacgttgggaaagtacttcagcag gggttccatggccagggaatgaaggtcttcttcatggtggtggcagcggtgtacattctg tacctcttgttcttgatagtgcgggcgtgttccgagctacgtcacatgccttatgtggca ccaccagccgagttcttatctttctatggcctgttgaacttctatctctacaccttggcc tttgtatattctccatcgaagaatgccctctatgagtcccagctgaaagacaatcctgcc ttctccatgctgaatgactcggatgatgatgtgatttatgggagtgactatgaggaaatg ccgctgcagaacggccaggccatccgggccaagtacaaggaggagtcagatagtgactga >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_4|244_aa MKSQDAVYSFNTLKEIQKDILKMQTTKTIAQANDHMQPPRAAVMLLNGPRGRGRLTGDRK GQDTCLAETPPPGLGEEGSCVRIGKEEKASLGAGSPYSSTAGSKDGQGARRPKSRKLSHC SGIGRDAGSHGENFNLGVSIGLYLVMLYPSGTSASRQTQFGYGIECTAFVVDEVSNIVKE AIESAIGGNAYQHSKVNQWTTNVVEQTLSQLTKLGKPFKYIGSCTVRWENKTMYCIVSAF GLSI >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_4|735_bp atgaaaagtcaagatgctgtctatagctttaatacactgaaagaaatacagaaagacatc ctcaaaatgcaaacgactaagacaattgcccaggcaaacgaccacatgcaaccacccaga gcagcagtgatgttgctgaatggaccacgaggaagaggaagactgacaggtgacaggaaa ggacaggacacctgcctggctgagacgccaccgccaggcctgggagaagaggggagttgt gtgaggattggaaaggaggagaaggcatctctaggggcaggaagcccatattcaagtaca gcaggaagtaaggacgggcagggggcaagaaggcccaagagtaggaagttaagccactgt tctgggattgggagggacgcgggcagccatggggaaaacttcaatctaggtgtttccatt ggtctttatctggtcatgctgtatcccagtggaacttcggcatccagacaaacacagttt ggttacggcattgagtgtactgcttttgttgttgatgaagtgagcaacattgtaaaagag gctatagaaagcgcaattggtggtaacgcttatcaacacagcaaagtgaaccagtggacc acaaatgtagtagaacaaactttaagccaactcaccaagctgggaaaaccatttaaatac atcgggagctgcactgtgcgatgggagaataagaccatgtactgcatcgtcagtgccttc ggactgtctatttga >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_5|176_aa MAQEIDLSALKELEREAILQVLYRDQAVQNTEEERTRLGSGVPSNTIFFNSEGCRKLKTH LQHLRWKGAKNTDWEHKEKCCARCQQVLGFLLHRGAVCRGCSHRVCAQCRVFLRGTHAWK CTVCFEDRNVKIKTGEWFYEERAKKFPTGGSSPGQPGIITFCASIHGHLAIRAEGF >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_5|531_bp atggcccaagaaatagatctgagtgctctcaaggagttagaacgcgaggccattctccag gtcctgtaccgagaccaggcggttcaaaacacagaggaggagaggacacgcctggggtca ggtgttccatctaatactatcttctttaactctgagggctgcaggaaactgaaaacacac ctgcagcatctccggtggaaaggagcgaagaacacggactgggagcacaaagagaagtgc tgtgcgcgctgccagcaggtgctggggttcctgctgcaccggggcgccgtgtgccggggc tgcagccaccgcgtgtgtgcccagtgccgagtgttcctgagggggacccatgcctggaag tgcacggtgtgcttcgaggacaggaatgtcaaaataaaaactggagaatggttctatgag gaacgagccaagaaatttccaactggaggaagctccccaggacaacctggcatcatcacc ttctgtgcctccatccacgggcatttggcaatccgtgcagagggtttttaa >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_6|193_aa MLRAVPGRHGCQPAMCLRQALAASDLGPGLQLFLSLAKTGIPAKVTRQPDRAGCKISVVP PTPPPVSESQCSRSPGRFQTETGVTFKILLYFLLEISNSPESFQYLRAHDVFIGQMDDWM GGNLQEFGQFRGFNKSVENLFLSLATHVKKLSKSQNDMTSEKHLLATGPRQCVGQTERRS QSDTAVNVTTRVP >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_6|582_bp atgctccgcgccgtgcctggcagacacggctgccaaccagccatgtgcctgagacaagcc ctggcagcctcagacctgggccctggcctgcagctgttcctgtccctggcaaagacgggt atccctgctaaagtcactcgccagccagaccgggcagggtgcaaaatttctgtggttcct cctactccacctcctgtcagcgagagccagtgcagccgcagtcctggcaggtttcaaact gagactggggtgaccttcaaaatcctgctctactttctcctggaaatttcaaattctcca gagtcgttccagtacctacgtgctcatgatgtatttattggacagatggatgattggatg ggtggtaatttacaggaatttggtcagtttagaggatttaataagtccgtggaaaatttg tttctgtctcttgctacccacgtgaaaaagctctccaaatcccagaatgatatgacttct gagaagcatcttctcgccacgggccccaggcagtgtgtgggacagacagagagacggagc cagtctgacactgcggtcaacgtcaccaccagggtaccctga >gi568815592r:158536830_158741362|GENSCAN_predicted_peptide_7|108_aa XYYFFKHCENFVQGKHNQKDAENAFQEFIESQSTDFYAPTTWKKLTGALLPYLFTRVAEC CIIESYRALVEWIDNESADTPGVTPRHRPEDNATLLKPVEAFKEWKDG >gi568815592r:158536830_158741362|GENSCAN_predicted_CDS_7|327_bp nactactacttcttcaagcattgtgagaactttgtgcagggaaaacacaaccagaaggat gcagaaaatgctttccaagagttcattgaatcccaaagcacagatttttatgctccaacc acttggaagaagctgaccggcgccctcctgccctatctgtttacgcgagttgcagaatgc tgcattatagaaagctacagagcactggtggagtggattgataatgaatctgctgacaca cctggagtaactccaagacacaggcctgaggataacgcaactttactaaaacctgtggaa gcctttaaggaatggaaggatggataa