GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:08:43 Sequence gi568815584f:50213021_50422719 : 209699 bp : 41.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 17943 18239 297 2 0 74 71 132 0.738 7.37 1.02 Intr + 18475 18652 178 0 1 78 89 60 0.709 3.67 1.03 Term + 19453 19610 158 2 2 7 43 150 0.558 -0.49 1.04 PlyA + 19963 19968 6 1.05 2.11 PlyA - 22266 22261 6 1.05 2.10 Term - 34233 34038 196 2 1 122 36 236 0.918 17.70 2.09 Intr - 52469 52338 132 1 0 105 82 60 0.730 5.94 2.08 Intr - 54890 54733 158 2 2 72 60 98 0.764 3.29 2.07 Intr - 56310 56143 168 0 0 77 94 116 0.890 10.32 2.06 Intr - 65534 65500 35 0 2 122 98 9 0.866 2.32 2.05 Intr - 71013 70851 163 0 1 78 103 180 0.953 17.13 2.04 Intr - 81226 81095 132 1 0 58 110 79 0.990 7.02 2.03 Intr - 89148 88997 152 1 2 100 86 110 0.997 10.96 2.02 Intr - 89997 89882 116 2 2 19 77 137 0.927 4.87 2.01 Init - 99130 98991 140 1 2 51 109 58 0.750 2.63 2.00 Prom - 99836 99797 40 -3.45 3.00 Prom + 104168 104207 40 -2.85 3.01 Init + 106214 106281 68 1 2 66 30 83 0.483 0.90 3.02 Intr + 109491 109699 209 0 2 84 28 218 0.602 13.20 3.03 Intr + 110925 111096 172 1 1 89 71 102 0.490 6.68 3.04 Intr + 112589 112625 37 2 1 100 46 -8 0.010 -6.45 3.05 Intr + 119002 119216 215 0 2 70 91 174 0.078 12.49 3.06 Term + 122439 122538 100 0 1 84 41 135 0.560 5.02 3.07 PlyA + 122902 122907 6 -0.45 4.21 PlyA - 122931 122926 6 -3.74 4.20 Term - 123200 122973 228 2 0 39 48 177 0.781 4.45 4.19 Intr - 126009 125927 83 1 2 87 101 84 0.935 8.04 4.18 Intr - 128212 128012 201 2 0 36 106 360 0.704 30.94 4.17 Intr - 129202 129112 91 1 1 56 61 52 0.166 -2.05 4.16 Intr - 130038 129886 153 0 0 28 72 94 0.056 1.15 4.15 Intr - 145124 145004 121 1 1 38 76 64 0.164 -0.22 4.14 Intr - 146129 146008 122 2 2 91 96 75 0.505 6.97 4.13 Intr - 149374 149166 209 2 2 37 52 124 0.593 1.57 4.12 Intr - 150017 149870 148 2 1 58 73 163 0.965 10.69 4.11 Intr - 163399 163317 83 2 2 111 57 2 0.060 -2.16 4.10 Intr - 164709 164521 189 1 0 71 81 155 0.895 11.74 4.09 Intr - 165412 165157 256 1 1 88 103 102 0.633 7.79 4.08 Intr - 167212 167085 128 2 2 47 63 64 0.319 -0.72 4.07 Intr - 177164 177109 56 1 2 81 91 32 0.683 0.50 4.06 Intr - 177432 177257 176 0 2 -5 36 192 0.116 2.52 4.05 Intr - 182904 182681 224 1 2 6 86 276 0.772 16.12 4.04 Intr - 184261 184038 224 0 2 64 37 231 0.648 12.45 4.03 Intr - 184616 184566 51 1 0 91 37 139 0.567 6.20 4.02 Intr - 197628 197426 203 0 2 29 49 239 0.003 11.26 4.01 Init - 203441 203406 36 2 0 63 98 44 0.029 2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 150221 150201 21 2 0 60 101 13 0.832 -0.81 S.002 Term + 154378 154555 178 1 1 81 47 163 0.867 7.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:50213021_50422719|GENSCAN_predicted_peptide_1|210_aa MKSSFMIITKPSFRDQAKKKNLRNVLQNATGNCEPPIASQRETARPRGLEPCGRDDLLPV RSSGGHGPPAGRPAPSTDLPQGRDQQSPPFRTVLLAEELRPQRKRGDPQARATPGGDLLS RRRWPASGAAPPPLSGFAARQGYSWPSPAPTPATAAAGGPSLARAPVRSQNHIVFTKLSR YFRIVFLIVSSKLCRLQVLQNLDRRLGNTH >gi568815584f:50213021_50422719|GENSCAN_predicted_CDS_1|633_bp atgaaatccagttttatgatcataacaaaaccttccttccgggaccaggcaaaaaaaaaa aacttgagaaacgtccttcagaatgcaacaggcaattgtgagccccccattgccagccaa cgagaaacggctcggccccggggactcgagccctgtgggagggacgacctgctccccgtt agaagctcggggggccacgggccacccgccggccgccccgcccctagcactgaccttccg cagggccgagaccaacagtccccgccatttcggactgttctcctcgctgaagaactccgc ccgcagaggaagcgcggcgacccgcaagcccgggcgaccccgggcggcgacctcctttcc cggcggcgctggcccgcttctggggcggctcctcctcctttgtctgggttcgcggcccgt caggggtacagctggcccagccccgccccgactcccgctacggccgcggcgggcggtcct tcacttgcacgggcccctgtgcgttcacaaaatcatatcgtttttacaaaactttcaaga tattttcgaattgttttcttaatcgtgtcatccaaattgtgtaggcttcaggtcctacaa aacctggatcggcgtctaggaaacactcattag >gi568815584f:50213021_50422719|GENSCAN_predicted_peptide_2|463_aa MVPALRYLVGACGRARGLFAGGSPGACGFASGRPRPLCGGSRSASTSSFDIVIVGGGIVG LASARALILRHPSLSIGVLEKEKDLAVHQTGHNSGVIHSGIYYKPESLKAKLCVQGAALL YEYCQQKGISYKQCGKLIVAVEQEEIPRLQALYEKGLQNGVPGLRLIQQEDIKKKEPYCR GLMAIDCPHTGIVDYRQVALSFAQDFQEAGGSVLTNFEVKGIEMAKESPSRSIDGMQYPI VIKNTKGEEIRCQYVVTCAGLYSDRISELSGCTPDPRIVPFRGDYLLLKPEKCYLVKGNI YPVPDSRFPFLGVHFTPRMDGSIWLGPNAVLAFKREGYRPFDFSATDVMDIIINSGLIKL ASQNFSYGVTEMYKACFLGATVKYLQKFIPEITISDILRGPAGVRAQALDRDGNLVEDFV FDAGVGDIGNRILHVRNAPSPAATSSIAISGMIADEVQQRFEL >gi568815584f:50213021_50422719|GENSCAN_predicted_CDS_2|1392_bp atggtgccagcgctgcgttatttggttggtgcctgcggacgggcccgcgggcttttcgcc ggtggctcccctggggcgtgcgggttcgcgtctgggaggccaagaccgctgtgtggaggt agccgcagcgccagcaccagctcatttgatatagtcatcgttggtggcggaattgtgggg cttgcctctgccagagcactcatcctgcgacatccatcactttctattggtgttctggaa aaggagaaagatttagctgttcaccagactggacataacagtggtgtcatacatagtgga atttattataaacctgagtctctgaaagccaaattatgtgtacaaggtgcagccctcctc tatgagtactgtcagcaaaagggaatttcctacaagcagtgtggcaagcttatagtagct gttgaacaagaagaaattcccagacttcaggccctatatgagaaaggcctccagaatggt gtcccgggcctgaggctgatccagcaggaggatataaaaaagaaggagccatattgtagg ggtctaatggctattgattgtccacatactggcattgtggactatcggcaggtggctttg tcatttgcccaggatttccaagaagcaggtggctctgtcttgaccaattttgaagtaaaa ggtattgaaatggctaaagaaagtccttcaagaagtatagatggaatgcaatatccaatt gttataaagaatacaaagggagaggaaattcgatgtcagtatgttgtgacatgtgcagga ctttactcagaccgtatttcagagttgagtggctgcactcctgatcctcgaattgtacca ttccggggagattacctgcttttgaagccagaaaaatgttatcttgtaaaaggaaatatt tatccggtcccagatagccggtttcctttcctaggagttcacttcacaccaaggatggat ggcagtatttggctagggcctaatgcagttcttgcctttaaacgagagggttacagaccc tttgacttcagtgccacagatgttatggatataattatcaatagtggcttgattaaactg gcatcccagaatttttcctatggagttactgaaatgtataaagcatgttttcttggtgca acagtgaagtatcttcaaaaattcatccctgaaattactatcagtgatatacttaggggc ccagctggagtaagagcccaggccctggatagagatggaaatctggtagaagattttgta tttgatgcaggagttggggatattggaaatcgcattcttcatgtgagaaatgcaccttct cctgctgctacttcttccattgcaatttctggaatgattgcagatgaagtacaacaaaga tttgaattataa >gi568815584f:50213021_50422719|GENSCAN_predicted_peptide_3|266_aa MIRGSGAEACSNEDGCSEKGMKGVDYDRIRDVGPDRAASEWLLRCGAMVRYHGQERWQKD YNHLPTGPLDKYKIQAIDATDSCIMSIGFDHMEGLEHVEKIRLCKCHYIEDDCLLRLSQL ENLQKTILEMEIISCGNITDKGIIALRHLRNLKYLLLSDLPGAGNPGPCVASGSSRTRAP LGFRERGLQETDVPGALVDRSYDLSPSAGMRMLELKATTAILKSRFQLSVPGGSLLFKQS PVGHSSISFWPYKAILRQSRAGDNQE >gi568815584f:50213021_50422719|GENSCAN_predicted_CDS_3|801_bp atgataagaggtagtggagcagaggcttgcagtaatgaagacggctgtagtgaaaaaggg atgaaaggggtggattatgatcgcatcagggatgttggccctgacagggcggcatccgag tggttgctgcgctgtggggccatggtgcgctaccatggccaggagaggtggcagaaggac tacaaccaccttccaacaggccctctggacaaatacaagattcaggcgatcgacgccacc gactcttgtatcatgagcattggatttgatcacatggagggcctagagcatgttgaaaaa ataaggctgtgcaagtgtcattatatcgaggatgactgtttgctgagacttagtcaactt gaaaatttacaaaaaaccatattggaaatggaaataatatcctgtgggaatatcacagac aaaggcatcattgctttgcgtcatttaagaaacctcaaatatttgttgttaagtgatctt cctggagctggaaaccctggcccctgtgtggcatcaggaagcagcaggacaagggcacct ttaggattcagggagagggggcttcaggagacagatgtccctggggccctggttgataga tcctatgacctgtctcctagtgctggaatgaggatgctggagctgaaagccacaactgcc atcctcaagtctagattccaactctctgtaccaggtggcagcctgctgtttaaacagtct cctgtggggcattcgtcaatctccttttggccgtataaggcgattttgcgccagtcacgt gctggagacaatcaggaataa >gi568815584f:50213021_50422719|GENSCAN_predicted_peptide_4|993_aa MELSTDTGWNQKSNFPPCDSRVSKEEAQLPFPTPSSYGADSSNQTYAHKSQSEREQHKEV RAVLNHCSGLDGGEESGLCGTSTRIDAGTGNGTDDKVTDQHRHRRCLQGTKGNNRGSEVW GLLLQGNVDRSGGAPSAGVLLRRRGYSCALHGLRKFANLAGLLSRQQDSARGVSHHSRLK IHFKKIYSSMMEKYEKIGKIGEGSYGVVFKCRNRDTGQIVAIKKFLESEDDPVIKKIALR EIRMLKAPSPYAAEPSLCGMKMVRRGKKEFLPAVAEKVDAPSGVGGQGQDSVTVGSLGRR STYGRKQEKQVRQREGIIYCYVAVLLRIYYFDQGCVAREEEQFQELVFGPFCHIGSYFTG HRTNVRPYILLLSRPSPFKTAAGTYEAGLVILECSYFLAEQEPYCPTQALQQPHPIIGPW ALEGGGVESKEDRHPPPKEAPASCEGFLRSAVPKQAYTPFKTSPDKRLSDCVATPPWAPP TPLIISSGVLVAICSMIDPVPEFHSEGLLAKATSGSAGILVWIFLCNDAFIYGKYILRSG VLWVRGLPGFKSEAAALCHKCYSSADPKREQQQDLLQRVKEQSFHSVNGAQARHKGSPSP HQTQEPSWPHPVDPTPGHRWSCLPVPCRAPALLSPWVVDGTGCCGAGGGSDRGGSAAQEP TQLKHPNLVNLLEVFRRKRRLHLVFEYCDHTVLHELDRYQRGICNIFVCTGRRLGEHTEA LSKKKKKGGGGPFLKLRAASCRITLFKNVGCGLETTGDLSLNSGGGAASRGVAAALRALV CGTELTSSDSPQRCIHRDVKPENILITKHSVIKLCDFGFARLLTGPSDYYTDYVATRWYR SPELLVGDTQYGPPVDVWAIGCVFAELLSGVPLWPGKSDVDQLYLIRKTLGDLIPRHQQV FSTNQYFSGVKIPDPEDMSLCLSVTLTEGGLLASGAVKRSQMGSSVSQATSWPHPDIVAE TAELDDIAMARQTPVMLRFNRQKEQEKYLSYGA >gi568815584f:50213021_50422719|GENSCAN_predicted_CDS_4|2982_bp atggagctgtccactgatactggctggaatcagaagagtaatttcccgccttgtgactct cgtgtctccaaagaagaagcccaactccctttcccaacacctagcagctacggtgcagac tcctccaaccagacttacgcacacaagtctcagtctgaaagagagcagcacaaggaagta cgtgctgtcctgaaccattgttctggcttggatggtggagaggagtctggcctttgtggg acaagtacccgcattgatgctggcactgggaacgggacggatgataaagtcactgatcag cacagacacagaagatgcctccaagggaccaaagggaataaccgcggttctgaggtgtgg ggactgctgttgcagggaaatgtggaccgctcaggaggggctcctagcgcaggcgtcttg ctccgcagacgcggttacagctgcgctttgcacggcctgcggaagtttgcaaatctagca ggtctgctctcccggcagcaggactcggcccgcggcgtgagccaccattctcggctgaag atccattttaagaagatttattcctctatgatggagaagtatgaaaaaattgggaaaatt ggagaaggatcctatggagttgttttcaaatgtagaaacagggacacgggtcagattgtg gccatcaagaagtttctggaatcagaagatgaccctgtcataaagaaaattgcccttcgg gaaatccgaatgctcaaggccccaagcccctatgctgctgaaccttctctctgtggaatg aaaatggtaagaagaggaaagaaagaatttctgccagcagtggcagaaaaagtagatgct cctagcggagttggaggtcaaggacaggacagtgtcactgtgggatccctgggcagacgt tctacctatggcagaaaacaagaaaaacaagtgagacaaagagaaggaatcatctactgc tatgtggcagttttgcttcggatttactattttgaccagggttgtgttgccagggaggaa gagcagtttcaggagcttgtgtttggccctttctgccatattggctcttacttcacaggt catagaaccaatgtgaggccttacattttactactatcacggcccagccccttcaagacg gctgctggtacctatgaggcaggtcttgtcattcttgagtgcagttatttcctcgcagag caagaaccttattgccctactcaggcactgcaacaacctcatcctattataggtccgtgg gccctagaaggaggtggggtagaatctaaggaggacaggcatccccctcctaaggaggca cctgcatcatgtgagggatttttaaggtctgcagtgccgaagcaagcttacaccccattc aaaaccagccctgacaaaaggcttagtgactgtgttgctacacctccctgggctcctccc accccacttatcatcagcagtggggtccttgttgccatctgctccatgattgatccagtt ccagaattccattctgaaggcctgctagctaaggccacttcaggaagtgctggaatcctg gtatggatatttttatgcaatgatgcttttatttatggaaagtatattctcagaagtgga gtgctatgggttcgtggtctccctggcttcaagagtgaagctgcagccctttgccataag tgttacagcagtgcggacccaaagcgtgagcagcagcaagatttattgcaaagagtgaaa gaacaaagcttccacagtgtgaatggtgcccaggctagacataaaggttctccaagtccc caccagactcaggagcccagctggcctcacccagtggatcccacaccaggccacaggtgg agctgcctgccagtcccatgccgtgcgcccgcactcctcagtccttgggtggtcgatggg accgggtgctgcggagcagggggcggttctgatcgtggaggctcggccgcgcaggagccc acgcaactcaagcatcccaaccttgttaacctcctggaagtcttcaggaggaaacggagg cttcacctggtgtttgaatattgtgaccacacagttctccatgagttggacagataccaa agaggcatatgcaacatctttgtgtgcactggaagaagacttggtgagcacacagaggcc ttgtccaagaaaaagaaaaaaggaggagggggtcccttcctgaagttgagggcagcctct tgcaggatcaccctgtttaagaatgttggctgtgggctggaaaccacaggtgatttgagt ctaaattctggagggggggccgcatctcgaggggtggctgcagctctcagagcacttgtg tgtgggacagagctgacatctagcgactctccccagaggtgcatacatagagacgtgaag ccagaaaatatcctcatcacgaaacattccgtgattaagctttgtgactttggatttgct cggcttttgactggaccgagtgactactatacagactacgtggctaccaggtggtaccgc tcccctgagctgctggtgggggacacgcagtacggccccccggtggatgtttgggcaatt ggctgtgtctttgctgagctgctgtcaggagtgcctctgtggccaggaaaatcggatgtg gatcagctgtatctgattaggaagaccttgggggatctcattcctaggcaccagcaagtg tttagcacgaatcagtacttcagtggagtgaaaattccagaccctgaagatatgtccctc tgcctgtcagtaaccctgacagagggaggcctgcttgcttctggggctgtgaaacgctca cagatgggctccagcgtatcacaggcaaccagttggcctcatccagacattgttgctgag acagcagagctcgatgatatagcaatggcacgtcaaaccccagtgatgctcagattcaac cgacagaaagaacaagagaaatatttaagctatggagcatga