GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:04:36 Sequence gi568815574f:3479245_3679799 : 200555 bp : 36.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 349 450 102 0 0 81 75 55 0.598 3.79 1.02 Intr + 14893 14981 89 0 2 32 92 117 0.150 4.35 1.03 Intr + 49352 49670 319 2 1 37 81 120 0.002 1.24 1.04 Intr + 49784 49943 160 1 1 80 56 96 0.278 4.24 1.05 Term + 74370 74560 191 1 2 41 46 135 0.271 1.23 1.06 PlyA + 74693 74698 6 1.05 2.00 Prom + 80283 80322 40 -4.65 2.01 Init + 100001 100447 447 1 0 57 -13 564 0.782 37.21 2.02 Term + 100492 100725 234 0 0 36 38 391 0.855 24.24 2.03 PlyA + 102103 102108 6 1.05 3.02 PlyA - 102609 102604 6 1.05 3.01 Sngl - 104245 104048 198 1 0 87 33 241 0.807 13.42 3.00 Prom - 104304 104265 40 -9.65 4.05 PlyA - 104342 104337 6 1.05 4.04 Term - 104641 104424 218 2 2 14 42 172 0.429 1.52 4.03 Intr - 104916 104755 162 1 0 33 65 145 0.202 5.73 4.02 Intr - 110970 110691 280 0 1 22 46 175 0.206 2.83 4.01 Init - 111599 111399 201 2 0 60 86 119 0.848 7.82 4.00 Prom - 111659 111620 40 -10.15 5.03 PlyA - 111805 111800 6 1.05 5.02 Term - 113813 112921 893 0 2 30 41 304 0.421 11.50 5.01 Init - 115254 115032 223 0 1 88 96 176 0.819 17.16 5.00 Prom - 120311 120272 40 -6.45 6.00 Prom + 120383 120422 40 -5.25 6.01 Sngl + 121182 121547 366 2 0 21 42 275 0.899 12.04 6.02 PlyA + 122476 122481 6 1.05 7.07 PlyA - 122806 122801 6 1.05 7.06 Term - 144115 143736 380 2 2 39 36 312 0.357 15.07 7.05 Intr - 152978 152902 77 1 2 42 68 96 0.009 1.24 7.04 Intr - 154662 154611 52 1 1 89 97 28 0.049 0.85 7.03 Intr - 163428 163268 161 2 2 110 90 16 0.010 2.71 7.02 Intr - 170040 169876 165 2 0 52 61 89 0.019 0.75 7.01 Init - 197784 197723 62 0 2 77 65 90 0.766 6.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 571 912 342 0 0 43 47 142 0.858 -0.87 S.002 Term - 9685 9552 134 2 2 76 45 80 0.867 -0.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_1|286_aa MGRESPLLKRRGKSEKDFVLWLGCQLRHRRIEQQLASGNGSLQMDIGNLGAGDRQIPDGT LGAGRNAQVKAGQQCCGPLTAETRSPQLEQQRLAAVGCVAHLHFPPTKVALGFINTLGGM HMCWASLLSLWPWSGKDLACGCGSCKGLVSDLWECALRGTHSHSCSVQMKLSVLTVPECV QPPHSLPGVGAARVEAAVKGLSVTSGSGALKRMLNCGQCSDGDGMIFNPYNNHKENIYEI YTKRNKKRIKTCYNKNSTKQEKEEVMEGMMNAKINKHAENDNEIAI >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_1|861_bp atggggagagaatccccgcttctgaaaagaagaggaaaaagtgaaaaagactttgtcttg tggcttggatgccagctcagacacagacgaatagagcagcagctggcatcagggaacggt agtctccaaatggatataggaaacctaggggctggtgatcggcagattcctgatgggacc ttgggagctgggagaaatgctcaggtgaaggcagggcagcagtgctgtgggcctttaact gcagagaccaggtcccctcagctggagcaacagagactggcagctgtggggtgtgtggca cacttgcacttccctcctacaaaagtagctctgggttttattaatactcttgggggcatg cacatgtgctgggcctccttgctctctctctggccctggagtggcaaggacttagcctgt ggctgtggcagctgcaaggggctagtcagcgacctctgggagtgtgctctcagaggaaca cacagccacagctgcagtgttcagatgaagttatctgtgctgacagtgccagaatgtgtc cagcctccccattccctccctggcgtaggggcagcaagggtggaggcagctgtaaagggc ttgtcagtgacctctgggagtggtgctctcaaaagaatgctgaactgtggccagtgttca gatggagatggtatgatattcaatccctataataaccacaaggaaaatatctatgaaata tacacaaaaagaaataagaagagaattaaaacatgctataacaaaaactcaactaaacaa gagaaagaagaagtaatggagggaatgatgaatgcaaaaattaataagcatgcagaaaat gataacgagattgcaatataa >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_2|226_aa MEAAADGPAETQSPVEKDSPAKTQSPAQDTSIMSRNNADTGRVLALPEHKKKRKGNLPAE SVKILRDWMYKHRFKAYPSEEEKQMLSEKTNLSLLRISNWFINARRRILPDMLQQRRNDP IIGHKTGKDAHATHLQSTEASVPAKSGPVGQMSREKQPDPESAPSQKLTGIAQPKKKVKI SITSPSSPELVSPEEYADFSSFLLLVDAAVQTAAELELEKKQEPNP >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_2|681_bp atggaggccgctgcagacggcccggctgagacccaaagcccggtggaaaaagacagcccg gcgaagacccaaagcccagcccaagacacctcaatcatgtcgagaaataacgcagataca ggcagagttcttgccttaccagagcacaagaagaagcgcaagggaaacttgccagccgag tccgttaagatcctccgcgactggatgtataagcatcggtttaaggcctacccttcagaa gaagagaagcaaatgctgtcagagaagaccaatttgtctttgttgcggatttctaactgg tttatcaatgctcgcagacgcattctcccggatatgcttcaacagcgtagaaacgacccc atcattggccacaaaacgggcaaagatgcccatgccacccacctgcagagcaccgaggcg tctgtgccggccaagtcagggccagtgggccagatgtcaagagagaagcaaccagatccg gagtcggcccctagccagaagctcaccggaatagcccagccaaagaaaaaggtcaagatt tctatcacttccccgtcttctccagaacttgtgtctccagaggagtacgccgacttcagc agcttcctgctgctagtcgatgcagcagtacaaacggctgccgagctggagctagagaag aagcaagagcctaatccatga >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_3|65_aa MNAPEGRNSEHIRTTGGTNSRRATLRAVTLTTRVHGFILEVSETKNPPIPDTILHLLTVY FRHSK >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_3|198_bp atgaacgcaccagaaggaagaaattccgaacacatccgaacaacaggaggaacaaactcc agacgcgccaccttaagagctgtaacactcaccacgagggtccacggcttcattcttgaa gtcagtgagaccaagaacccaccaattccggacacaatcctacatcttttgactgtatat ttccgtcattccaaatag >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_4|286_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KTAILPKIKDLNVRPRTIKILEQNLDNTIQDTGTGKDFMSETPKALATKAKIDKWHLIKL KSFCTAKETIIRVNRQPTEWEKIFAIYPSDKELISRIYKEPSPVSAAPCSTVPSPIDHPR AEECRHMARDWQAAPPAALVWDPLGEASWAPESGGFVNAPINTVSSYSGGDLEHLCVDTR YLANLVGMWRTFASSSGIVNAPISALSKQTTQLYQSAGCGWDQIRE >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_4|861_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatccaactt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtg aaaacggccatactgcccaagatcaaagacttaaacgtaagacctaggaccataaaaatc ctagaacaaaacctggacaataccattcaggacacaggcacgggcaaagacttcatgtct gaaacaccaaaagcattggcaacgaaagccaaaatagacaaatggcatctaattaaacta aagagcttctgcacagcaaaagaaactatcatcagagtgaacaggcaacctacagaatgg gagaaaatttttgcaatctatccatctgacaaagagctaatatccagaatctacaaagaa ccctccccggtgagcgccgccccctgctccacggtgcctagtcccatcgaccacccaagg gctgaggagtgcaggcacatggcacgggactggcaggcagctccacctgcagccctggtg tgggatccactgggtgaagccagctgggctcctgagtctggtgggtttgtgaatgcacca atcaacactgtatctagctactctggtggggacttggagcacctttgtgtggacactcgg tatctagctaatctggtggggatgtggagaacctttgcgtctagctctgggattgtaaac gcaccaatcagcgccctgtcaaaacagaccactcagctctaccaatcagcaggatgtggg tgggaccagataagagaataa >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_5|371_aa MGKSQCKKAENSKNQNASPPPKDHNSSPAREQNWTENEFDELTEVGFRRWVITNNSELKE HVLSQCKEAKNLEKGGPRFIKQVLRDLQRDLDSHTITVGDFNTPLSVLDRSTRQKINNDV QDLNLALDQVEPIDLYRTLHPKSTEYTFSSAPHCTYSKKDHIIGSKTLLSKCKRMEIITN SLSDHSAIKLELRIKKLIQNHTNTWKMNNLLLNDYCVNNKIKAEINKFFKTNENEDTTYQ NLWDTFKAVFRGKFIALNAHNRKQEISKIDILTSKLKELEKQQQTNSKASRRHETTKIRA ELKELETQITLQKINESRSCFLKKINKIDRLLARLIKEKTEKNQIDAIKNDIRDITTDLT EICGYSHQRIL >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_5|1116_bp atggggaaaagccagtgcaaaaaggctgaaaattccaaaaaccagaatgcctctcctcct ccaaaggatcacaactcatctccagcaagggaacaaaactggacagagaatgagtttgat gaactgacagaagtaggcttcagaaggtgggtaataacaaacaactccgagctaaaggag catgttctatcccaatgcaaggaagctaagaatcttgaaaaaggaggacccagattcata aagcaagttcttagagacctacaaagagacttagactctcacacaataacagtgggagac tttaacaccccactgtcagtattggacagatcaacaagacagaaaattaacaacgatgtt caggacttgaacttagctctagaccaagtggaaccaatagacctctacagaactctccac cccaaatcaacagaatatacattctcttcagcacctcattgcacttattctaaaaaggac cacataattggaagtaaaacactcctcagcaaatgcaaaagaatggaaatcataacaaac agtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactcattcaaaac cacacaaatacatggaaaatgaacaacctgctcctgaatgactactgtgtaaataacaaa attaaggcagaaataaataagttcttcaaaaccaatgagaatgaagacacaacgtaccag aatctctgggacacatttaaagcagtgtttagaggaaaatttatagcactaaatgcccac aacagaaagcaggaaatatctaaaattgacatcctaacatcaaaattaaaagaactagag aagcaacagcaaacaaattcaaaagctagcagaagacatgaaacaactaagatcagagca gaactgaaggagttagagacacaaataacccttcaaaaaatcaatgaatccaggagctgc tttttgaaaaaaatcaacaaaatagatagattgctagccagattaataaaggagaaaaca gagaagaatcaaatagatgcaataaaaaatgatataagggatatcaccactgatctcaca gaaatatgtgggtattcccatcagagaatactgtaa >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_6|121_aa MSRGAEEQSGREWQRAVDSSRVSQKRRKEEASEHREFSRELPDPSTFQLPIHLAELHLQH SIKACTHPLSPRVIGSFWDTGQVLRIQKAVTLALCPCDKAEGPLSLFTIKQSADGKTERP L >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_6|366_bp atgtcaagaggagcagaggaacagagcggcagagagtggcagagagcagtagacagcagc agggtgtcacagaaaagaaggaaggaagaagcgtctgaacatcgagagttcagcagagaa ctccccgacccctccaccttccagctccccatccatcttgctgagctccacctccaacat tcaataaaagcttgcactcatcctttgagcccacgtgtgattggatccttctgggacacg gggcaagtgctcaggatacagaaggctgtcacactggccctctgcccttgcgataaggca gagggtccattgagcttatttaccatcaagcaatctgcagatggcaaaactgaaagacct ttgtag >gi568815574f:3479245_3679799|GENSCAN_predicted_peptide_7|298_aa MIRGSHRKDPKQHGILHWKLRNQEKAQEREIRDTRKRQKLSWLPCRLTNPRIPEKHLLTA TNMASHGTLGKTALEEDVLEKGYTVAAAHFWLILTYCSESPVKQPKFSLPQSADCRKLQS NIGIGWNREGLEIFVHFENDMERFRRRNNAYHWLAVNKYVGKSLFGAVSSEGCNKGLSTR ASGCGRCTRSPSSASPPAPRSISHGALAAFPQGRAQDLQPAMPEPPTHSIGSCVARASPT STTPCSRVPSPLDHLRAEECERMARDWQAAPPAAPVRDPLREASWAPESGEDVESLYV >gi568815574f:3479245_3679799|GENSCAN_predicted_CDS_7|897_bp atgatcagaggctcccatcgaaaagatcccaaacagcatggaatcctgcactggaaacta aggaatcaggagaaggctcaggagagagagataagagacacaagaaaaaggcagaagctc tcatggctgccttgcaggctcacaaatcccagaatccctgagaagcacctgttaactgct acaaatatggcaagccatggcactttaggaaaaactgccctggaagaggatgtcttagaa aagggctatactgttgcagctgcccacttttggttgatacttacatattgctcagaatca cccgtaaagcagcccaagttttctcttcctcaatcagctgattgtagaaaactccaatca aacataggtataggatggaatagagaaggcttggagatctttgtgcattttgagaatgac atggagaggttcagaagaagaaataatgcttatcactggcttgctgtcaataaatacgtg ggtaaatctctgttcggggctgtcagctctgaaggctgcaataaaggacttagcacccgg gccagcggctgcggaaggtgtactaggtcccccagcagtgccagcccaccggcaccgcgc tcgatttctcacggagccttagcggccttcccgcagggcagggctcaggacctgcagcct gccatgcctgagcctcccacccactccataggctcctgtgtggccagagcctccccgacg agcaccaccccctgctccagggtgcccagtcccctcgaccacctaagggctgaagagtgc gagcgcatggcgcgggactggcaggcagctccacctgcagccccggtgcgagatccacta cgtgaagccagctgggctcctgagtctggtgaggacgtggagagtctttatgtctag