GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:16:38 Sequence gi568815592r:170435223_170653242 : 218020 bp : 43.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1378 1373 6 1.05 1.03 Term - 2913 2765 149 1 2 84 47 66 0.415 0.16 1.02 Intr - 3585 3400 186 1 0 79 70 104 0.466 7.36 1.01 Init - 4447 4195 253 1 1 75 50 83 0.210 0.70 1.00 Prom - 8629 8590 40 -5.86 2.00 Prom + 10848 10887 40 -4.26 2.01 Init + 14750 14987 238 1 1 89 45 238 0.895 15.77 2.02 Intr + 26988 27165 178 1 1 40 115 192 0.703 16.08 2.03 Intr + 27842 28032 191 2 2 58 81 42 0.286 -0.27 2.04 Intr + 29622 29702 81 1 0 68 113 113 0.563 11.41 2.05 Term + 33889 34139 251 2 2 -20 53 181 0.160 -0.33 2.06 PlyA + 36020 36025 6 1.05 3.05 PlyA - 37480 37475 6 1.05 3.04 Term - 38433 38116 318 0 0 40 42 147 0.020 0.18 3.03 Intr - 43784 43405 380 0 2 -67 74 334 0.380 11.18 3.02 Intr - 49797 49708 90 1 0 104 95 39 0.393 6.17 3.01 Init - 63607 63472 136 1 1 101 47 86 0.578 4.13 3.00 Prom - 91386 91347 40 -3.56 4.08 PlyA - 91615 91610 6 1.05 4.07 Term - 100183 99998 186 1 0 80 39 172 0.997 8.99 4.06 Intr - 102118 102012 107 2 2 80 95 68 0.998 6.63 4.05 Intr - 108508 108379 130 1 1 69 70 124 0.594 8.97 4.04 Intr - 110962 110881 82 0 1 73 71 52 0.383 1.64 4.03 Intr - 113891 113784 108 1 0 80 87 90 0.357 7.50 4.02 Intr - 118004 117908 97 0 1 54 60 39 0.013 -3.23 4.01 Init - 120166 119965 202 1 1 47 108 79 0.069 4.84 4.00 Prom - 120680 120641 40 -7.36 5.00 Prom + 121104 121143 40 -8.46 5.01 Init + 121808 121861 54 1 0 78 90 15 0.695 2.23 5.02 Intr + 126569 127011 443 1 2 132 73 491 0.172 44.25 5.03 Intr + 129323 129410 88 2 1 66 72 69 0.296 3.07 5.04 Intr + 131696 131787 92 1 2 85 53 46 0.285 -0.41 5.05 Intr + 134390 134557 168 2 0 107 97 26 0.469 4.46 5.06 Intr + 136188 136282 95 0 2 92 111 -45 0.742 -2.19 5.07 Term + 136964 137043 80 0 2 111 39 81 0.832 3.43 5.08 PlyA + 137612 137617 6 1.05 6.08 PlyA - 138088 138083 6 1.05 6.07 Term - 142495 142337 159 1 0 101 43 156 0.964 10.34 6.06 Intr - 143748 143635 114 0 0 54 80 39 0.559 0.34 6.05 Intr - 144883 144780 104 2 2 86 103 39 0.998 5.09 6.04 Intr - 147966 147835 132 1 0 108 94 105 0.997 13.72 6.03 Intr - 149447 149077 371 1 2 63 83 400 0.468 31.75 6.02 Intr - 153680 153553 128 2 2 99 103 -8 0.659 1.28 6.01 Init - 159489 159292 198 0 0 55 79 122 0.557 6.90 6.00 Prom - 162934 162895 40 -4.06 7.00 Prom + 166271 166310 40 -5.56 7.01 Init + 174665 174814 150 1 0 89 80 80 0.480 7.43 7.02 Intr + 191345 191575 231 1 0 83 60 90 0.033 3.67 7.03 Term + 204624 204782 159 2 0 53 47 216 0.186 11.94 7.04 PlyA + 205273 205278 6 1.05 8.02 PlyA - 205649 205644 6 1.05 8.01 Sngl - 213481 213122 360 1 0 70 38 159 0.535 5.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 117926 118111 186 1 0 52 45 200 0.872 9.59 S.002 Sngl - 196901 196713 189 2 0 72 42 161 0.819 5.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_1|195_aa MNGLLQLPLSAPQLKLKTQELPLGSGISSNHPSAIAVHPRSLPEQLSRTSSWDCGPWTAL LTANCSPCVLITPGLQNGGTDVHCPSSPPPSLLIPSPQASASPTQPHSRAQIKVKPEQQM FVEPPPIKGFVARVTGTLHQLCEQMVNLVFPAFGLAPQNEVPNLSTITSMRIAQGERILV ITSLIASMIAATAFH >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_1|588_bp atgaacgggctgctgcagcttccattatcagctcctcaactcaagctaaaaactcaggaa ctccctctggggtctgggatcagcagcaaccacccctccgcaattgcagtgcaccctagg tcccttccggagcaactgagcagaacgtcttcctgggactgtggtccatggacggccctg ctaacagctaattgcagcccatgtgtcctcatcacaccaggtctccaaaacgggggcaca gatgtccactgcccctccagcccaccccccagcctgctgatccccagcccccaggcctca gcatcccccacccagccccattccagagcacagataaaagtgaaacctgagcagcaaatg tttgtggagccgccgccaatcaagggcttcgttgccagggtaaccgggacacttcatcag ctctgtgaacagatggtcaaccttgtattccctgcatttggtttggcccctcagaatgag gtccccaatctctccaccatcaccagcatgaggatagctcagggagaaaggatattggtg attacatcgttgattgccagcatgattgcagcaactgcctttcactga >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_2|312_aa MVGKGLRSGSLVVRAGLRTGSGSLVVRAGLRTGSGSLVVRAGLRTGSGSNSSSVLFSKSL SEPQFLQPARRAYNRIGVAAKLVTRRLPCGEDQADKMETTIQQVVRPQPQSPRLLKTVKA RSPGQPPPSTSLQLVVATRLRARRAPLTTRGSTWSIGQVWNGDLHIGRVWNGDLHIGRVW NGGLQSANIAKHLATHTLWCGDEDGNSHLLVVEGAQGVEDCVGCGIICKDKTIRLRGEPA DHLVQCSSQGATESPVCSETADVLWPQGATESPVCSETTDVLWPQGATESPVCSETTDVL WPQGAHGISRVL >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_2|939_bp atggtggggaagggcctccgctcagggtcgctcgtggtcagggccggcctgcggaccggc tcggggtcgctcgtggtcagggccggcctgcggaccggatcggggtcgctcgtggtcagg gccggcctgcggaccggctcgggctctaattccagctcggtgctcttcagcaagtcactc tctgagcctcagtttcttcagccagcaagaagagcttacaataggattggggtggcagca aagctggtcacccgacggctgccctgtggagaggaccaggcagacaaaatggaaaccacc atccagcaggtggtccggccacagccccagtcaccgagactcctgaagacggtcaaggcc aggagcccaggccagcccccaccttccacctccctgcagctggttgtggccacaaggctc agagctcgccgggcacccctcacaaccagggggtccacctggagcatcgggcaggtttgg aacggtgacctccacatcgggcgtgtttggaacggtgacctccacatcgggcgtgtttgg aacggaggcctccaatcagcgaacattgcaaagcacctagccacgcacacactctggtgt ggggatgaggacgggaattcacacctgctggtggtggagggggcccaaggtgtggaggac tgtgttggctgcggcatcatctgcaaagacaagaccatcagactcagaggagaacctgca gaccatctggtccagtgttctagccagggggccacagaatctcccgtgtgctctgagacc gctgacgtgctatggcctcagggggccacggaatctcccgtgtgctctgagaccactgac gtgctctggcctcagggggccacggaatctcccgtgtgctctgagaccactgacgtgctc tggcctcagggggcccacggaatctcccgtgtgctctga >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_3|307_aa MQWVLPALLPPSFCSRMLAVGYPNSTPQPYSHALGLHMSKRVFRLAHLPSIIKGHGNATL AKCCCLDLGLPGLRNGSENRWGSLYSSSHPNTVKGLTPVHLDTVKGLTPVHPDAVKGLTP VPPDAVKGLTPVPPDAVKGLTPVPPDAVKGLTPVPPDAVKGLTPVPPDAVKGLTPVPPDA VKGLTPVHPDAVKGLTPVPPGGQLLYVRHSARCRNRAELVAGSCGCSWRKGGLNIGWKKS YEEPGKEPVKFNCEHQSLGRQRDKMGQLAEEQHLCPWQCEHLSVSSCLWWKPAGSLYLLA VGGDTQH >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_3|924_bp atgcagtgggtcctgcctgctttgctacccccttcattctgttcacgcatgttagctgtg ggctaccccaactccactccccagccttacagccatgccctgggcctccacatgtccaag agggtcttcagactagcccatctgccttctatcattaaaggacacggcaacgcgaccctc gccaaatgctgttgcctggatcttggactccctggcctccgtaatggctctgagaaccgc tggggaagtctgtacagctccagtcatcccaatactgtgaagggcctcacccctgttcat ctcgacactgtgaagggcctcacccctgttcatcccgacgctgtgaagggcctcacccct gttcctcccgacgctgtgaagggcctcacccctgttcctcccgacgctgtgaagggcctc acccctgttcctcccgacgctgtgaagggcctcacccctgttcctcccgacgctgtgaag ggcctcacccctgttcctcccgacgctgtgaagggcctcacccctgttcctcccgacgct gtgaagggcctcacccctgttcatcccgacgctgtgaagggcctcacccctgttcctcct ggtgggcaactgctgtacgtcaggcattctgccaggtgcagaaacagagctgagctcgtg gccgggtcctgtggttgctcctggaggaaagggggcctgaatattggatggaaaaagtct tatgaggagcctggaaaagaacctgtcaagttcaactgtgagcaccagagcctgggcagg cagagagacaagatggggcagctggcagaggagcagcatctctgcccttggcagtgtgag catctgtctgtcagcagctgcctctggtggaagcctgcaggcagcttgtacctcctggca gtgggaggagacacccagcactaa >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_4|303_aa MEFRKCSMNRSNLARLDEFERSKTGGRKFRFRNLRLRQEIMTAGFKAIISMIRDLSRTFG GFRPLFATMYSAPGRDLGMEPHRAAGPLQLRFSPYVFNGGTILAIAGEDFAIVASDTRLS EGFSIHTRDSPKCYKLTDKTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKAMTTGAIAA MLSTILYSRRFFPYYVYNIIGGLDEEGKGAVYSFDPVGSYQRDSFKAGGSASAMLQPLLD NQVGFKNMQNVEHVPLSLDRAMRLVKDVFISAAERDVYTGDALRICIVTKEGIREETVSL RKD >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_4|912_bp atggaatttcggaagtgcagtatgaacagatctaatttggcaagactagatgaatttgag aggagtaaaactggaggacggaagtttcgtttcaggaatctaaggttgaggcaagagata atgacggcaggatttaaagccattataagtatgattagagatttatccaggaccttcggt ggcttcaggccactttttgcaaccatgtattcggctcctggcagagacttggggatggaa ccgcacagagccgcgggccctttgcagctgcgattttcgccctacgttttcaacggaggt actatactggcaattgctggagaagattttgcaattgttgcttctgatactcgattgagt gaagggttttcaattcatacgcgggatagccccaaatgttacaaattaacagacaaaaca gtcattggatgcagcggttttcatggagactgtcttacgctgacaaagattattgaagca agactaaagatgtataagcattccaataataaggccatgactacgggggcaattgctgca atgctgtctacaatcctgtattcaaggcgcttctttccatactatgtttacaacatcatc ggtggacttgatgaagaaggaaagggggctgtatacagctttgatccagtagggtcttac cagagagactccttcaaggctggaggctcagcaagtgccatgctacagcccctgcttgac aaccaggttggttttaagaacatgcagaatgtggagcatgttccgctgtccttggacaga gccatgcggctggtgaaagatgtcttcatttctgcggctgagagagatgtgtacactggg gacgcactccggatctgcatagtgaccaaagagggcatcagggaggaaactgtttcctta aggaaggactga >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_5|339_aa MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQ QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGTSGQAPQ LFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDL KTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARV VQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRI VLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_5|1020_bp atggatcagaacaacagcctgccaccttacgctcagggcttggcctcccctcagggtgcc atgactcccggaatccctatctttagtccaatgatgccttatggcactggactgacccca cagcctattcagaacaccaatagtctgtctattttggaagagcaacaaaggcagcagcag caacaacaacagcagcagcagcagcagcagcagcaacagcaacagcagcagcagcagcag cagcagcagcagcagcagcagcagcagcagcagcagcagcaacaggcagtggcagctgca gccgttcagcagtcaacgtcccagcaggcaacacagggaacctcaggccaggcaccacag ctcttccactcacagactctcacaactgcacccttgccgggcaccactccactgtatccc tcccccatgactcccatgacccccatcactcctgccacgccagcttcggagagttctggg attgtaccgcagctgcaaaatattgtatccacagtgaatcttggttgtaaacttgaccta aagaccattgcacttcgtgcccgaaacgccgaatataatcccaagcggtttgctgcggta atcatgaggataagagagccacgaaccacggcactgattttcagttctgggaaaatggtg tgcacaggagccaagagtgaagaacagtccagactggcagcaagaaaatatgctagagtt gtacagaagttgggttttccagctaagttcttggacttcaagattcagaatatggtgggg agctgtgatgtgaagtttcctataaggttagaaggccttgtgctcacccaccaacaattt agtagttatgagccagagttatttcctggtttaatctacagaatgatcaaacccagaatt gttctccttatttttgtttctggaaaagttgtattaacaggtgctaaagtcagagcagaa atttatgaagcatttgaaaacatctaccctattctaaagggattcaggaagacgacgtaa >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_6|401_aa MSQMLRKRFQEDPEDPLVAAISRLMLRRTPTVTSNTHQTQPPTWGQIKKLSQLAEENLRK AGQPVTDTIGETGCLPMLLLDLRSQLNPLGKTVLITLSTQLLYRVPDLWCPPLAFRPGAR FPPSDPAVGCAPRQPAPRMAAAGARPVELGFAESAPAWRLRSEQFPSKVGGRPAWLGAAG LPGPQALACELCGRPLSFLLQVYAPLPGRPDAFHRCIFLFCCREQPCCAGLRDHLDHIIP DHNFLFPEFEIVIETEDEIMPEVVEKEDYSEIIGSMGEALEEELDSMAKHESREDKIFQK FKTQIALEPEQILRYGRGIAPIWISGENIPQEKDIPDCPCGAKRILEFQVMPQLLNYLKA DRLGKSIDWGILAVFTCAESCSLGTGYTEEFVWKQDVTDTP >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_6|1206_bp atgagccagatgctgaggaagagattccaggaggatcctgaggaccccctggttgcagcc atctcaagactgatgctgaggaggaccccaactgtcacgagcaacacccatcaaacacag ccacccacttggggacagatcaagaagctgtcacagttggcagaagaaaacctgaggaaa gcaggacaaccagtcacagacacaattggggaaactggttgtttaccaatgcttttactt gacttacggagccaattaaacccccttggaaaaaccgtcctcataactttgtctacacag ctcctgtacagggttcctgacctgtggtgcccgcctcttgccttccggcccggcgcccga tttccgccttccgacccagctgtgggctgcgccccacgccagcccgcgccccgcatggct gccgccggggccaggcctgtggagctgggcttcgccgagtcggcgccggcgtggcgactg cgcagcgagcagttccccagcaaggtgggcgggcggccggcatggctgggcgcggccggg ctgccggggccccaggccctggcctgcgagctgtgcggccgcccgctctccttcctgctg caggtgtatgcgccgctgcctggccgcccggacgccttccaccgctgcatcttcctcttc tgctgccgcgagcagccgtgctgtgccggcctgcgagatcatctggaccatataattcca gaccacaacttcctttttccagaatttgaaattgtaatagaaacagaagatgagattatg cctgaggttgtggaaaaggaagattactcagagattatagggagcatgggtgaagcactt gaggaagaactggattccatggcaaaacatgaatccagggaagataaaatttttcagaag tttaaaactcagatagcccttgaaccagaacagattcttagatatggcagaggtattgcc cccatctggatttctggtgaaaatattcctcaagaaaaggatattccagattgcccctgt ggtgccaagagaatattggaattccaggtcatgcctcagctcctaaactacctgaaggct gacagactgggcaagagcattgactggggcatcctggctgtcttcacctgtgctgagagc tgcagcttgggtactggctatacagaagaatttgtgtggaagcaggatgtaacagataca ccgtaa >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_7|179_aa MKFHDRGQIQPKPSPAEAVNSAPQREPEGSILRAADPLVWAAKAMSASRAPRDLVLCVPA APATAKRGQGTAQAMASEGASLKLWQLPCGVEPVDSQKSRIEVWEPPPRFQRMYGNAWIS RQKFAAGMIYDLFRKRKVISFGGCIAQIFFIHVIGGVEMVLLIAMAFDSYVALLSPSTI >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_7|540_bp atgaagtttcatgacagaggccaaatccagcccaaacccagcccagcagaggctgtcaac tcagcgccccagcgagagccggaaggttccatcctcagagctgcagaccctctcgtgtgg gctgcaaaggccatgtctgcatcccgggcgcctagggacttggtgctctgtgtcccagct gctccagccacagctaaaaggggtcaaggtacagctcaggccatggcttcagagggtgca agcctcaagctttggcagcttccatgtggtgttgagcctgtggattcacagaagtccaga atcgaggtatgggaacctccacctagatttcagagaatgtatggaaatgcctggatctcc agacagaagtttgctgcagggatgatttatgacctgttcagaaagcgcaaagtcatctcc tttggaggctgcatcgctcaaatcttcttcatccacgtcattggtggtgtggagatggtg ctgctcatagccatggcctttgacagttatgtggccctattaagcccctccactatctga >gi568815592r:170435223_170653242|GENSCAN_predicted_peptide_8|119_aa MRTSCSMHKDQALIRSSGRKISETEYQLNEINQEDKIREKRMKRNEQSLQEIWDYVKRPN LRLIAVPESDGENGTKLENTLWDIIQENFPNLARQANIQIQKYGEHHKDTPQEKQPQDT >gi568815592r:170435223_170653242|GENSCAN_predicted_CDS_8|360_bp atgagaacttcatgcagcatgcacaaggatcaagcactgattcggtcaagcggaagaaag atatcagagactgaatatcaacttaatgaaataaatcaagaagacaagattagagaaaaa agaatgaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaacgaccaaat ctacgtttgattgctgtacctgaaagtgatggggagaatggaaccaagttagaaaacact ctttgggatattatccaggagaacttccctaacctagcaaggcaggccaatattcaaatt cagaaatatggagaacatcacaaagacactcctcaagaaaagcaaccccaagacacatag