GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:50:55 Sequence gi568815587f:49932874_50133791 : 200918 bp : 40.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10348 10539 192 0 0 66 97 89 0.147 6.61 1.02 Intr + 17100 17151 52 2 1 100 3 78 0.089 -1.94 1.03 Intr + 19521 20150 630 1 0 91 29 322 0.038 17.81 1.04 Intr + 27678 27735 58 1 1 85 28 53 0.002 -3.98 1.05 Term + 31694 31970 277 2 1 65 41 180 0.024 4.95 1.06 PlyA + 32021 32026 6 1.05 2.00 Prom + 32633 32672 40 -3.75 2.01 Sngl + 35678 36121 444 1 0 71 49 243 0.944 14.69 2.02 PlyA + 36461 36466 6 1.05 3.03 PlyA - 38413 38408 6 1.05 3.02 Term - 44617 44516 102 1 0 27 49 87 0.195 -4.00 3.01 Init - 49628 48756 873 2 0 55 83 306 0.387 21.36 3.00 Prom - 53398 53359 40 -7.35 4.00 Prom + 57721 57760 40 -5.85 4.01 Init + 62351 62740 390 1 0 55 82 218 0.494 14.72 4.02 Term + 69549 70073 525 2 0 17 38 294 0.837 10.57 4.03 PlyA + 70903 70908 6 1.05 5.00 Prom + 71285 71324 40 -10.05 5.01 Init + 74486 75114 629 1 2 44 19 285 0.586 12.06 5.02 Intr + 75514 76243 730 1 1 30 73 238 0.713 7.00 5.03 Intr + 80609 80760 152 1 2 36 97 118 0.633 5.54 5.04 Intr + 86192 86324 133 2 1 61 93 148 0.898 12.33 5.05 Intr + 90712 90927 216 0 0 16 18 214 0.128 4.98 5.06 Intr + 94286 94383 98 1 2 25 93 43 0.043 -3.61 5.07 Intr + 94697 94821 125 2 2 27 94 114 0.122 5.21 5.08 Intr + 100194 100592 399 1 0 74 29 335 0.266 19.75 5.09 Intr + 111627 111745 119 1 2 48 89 96 0.795 4.96 5.10 Term + 111948 112031 84 2 0 91 39 73 0.792 -0.63 5.11 PlyA + 112063 112068 6 1.05 6.00 Prom + 118117 118156 40 -5.85 6.01 Sngl + 141711 142094 384 2 0 101 54 265 0.658 20.34 6.02 PlyA + 148267 148272 6 1.05 7.04 PlyA - 148559 148554 6 1.05 7.03 Term - 170024 169891 134 0 2 96 38 98 0.236 2.87 7.02 Intr - 178328 178304 25 2 1 85 98 12 0.076 -1.42 7.01 Intr - 183497 183291 207 2 0 77 83 69 0.062 3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 27097 26879 219 1 0 87 103 122 0.839 12.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_1|402_aa MTTSSNAKTPNKFIGIMKDQTNMIPPKEINKSTVIDPKEMEIYELSDKEFKIIVFKNSWE YITQFLRLTLRGPEPCKDPFVCNVINHHLKYMANRNNVTEFILLGLTENPKMQKIIFVVF SVIYINAMIGNVLIVVTITASPSLRSPMYFFLAYLSFIDACYSSVNTPKLITDSLYENKT ILFNGCMTQVFGEHFFRGVEVILLTVMAYDHYVAICKPLHYTTVMKQHVCSLLVGVSWVG GFLHATIQILFICQLPFCGPNVIDHFMCDLYTLINLACTNTHTLGLFIAANTQRCFDAAE MPLPLPGGLPWNTTSRKQGNKAGWSLNMFDKLTEVGFRRWVITYSSKLNKHVLSQCKEAK KLEKRLEEWLTGITNVEKSLNDLMELKNTAQDLCEAYTSFNT >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_1|1209_bp atgacaacttcttcaaatgcaaagacaccaaacaagtttataggaatcatgaaggatcag acaaacatgataccaccaaaggaaataaataaatcaacagtaattgaccctaaagaaatg gagatatatgaactgtctgacaaggaattcaaaataattgtcttcaagaattcatgggaa tacataacacagtttcttcgtcttaccctgcggggaccagagccatgtaaagatccgttt gtgtgtaatgtaattaaccatcatttgaaatacatggcgaatagaaacaatgtgacagag tttattctattggggcttacagagaatccaaaaatgcagaaaatcatatttgttgtgttt tctgtcatctacatcaacgccatgataggaaatgtgctcattgtggtcaccatcactgcc agcccatcactgagatcccccatgtactttttcctggcctatctctcctttattgatgcc tgctattcctctgtcaatacccctaagctgatcacagattcactctatgaaaacaagact atcttattcaatggatgtatgactcaagtctttggagaacattttttcagaggtgttgag gtcatcctacttactgtaatggcctatgaccactatgtggccatctgcaagcccttgcac tataccaccgtcatgaagcagcatgtttgtagcctgctagtgggagtgtcatgggtagga ggctttcttcatgcaaccatacagatcctcttcatctgtcaattacctttctgtggtcct aatgtcatagatcactttatgtgtgatctctacactttgatcaatcttgcctgcactaat acccacactctaggactcttcattgctgccaacacacaaaggtgctttgatgcagcagaa atgcctctgccccttcctggagggttgccctggaacacaacttctcgtaaacaagggaac aaagctggatggagtttgaatatgtttgacaaattgacagaagtaggcttcagaaggtgg gtaataacatactcctccaagctaaacaagcatgttctaagccaatgcaaggaagctaaa aaacttgaaaaaaggttagaggaatggctaactggaataacaaatgtagagaagagctta aatgacctgatggagctgaaaaacacagcacaagatctttgtgaagcatacacaagtttc aatacctga >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_2|147_aa MGKDFMSETPKAMTTKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWENIFAIFSSDKGL ISRIYNELKQIDKKKTNNAIKKWAKYRNKHFSKEDIYAAKRHMKKCSSSLAIREMQIKTT MRYHLTPVRMVIIKNQETTGAGEDVEK >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_2|444_bp atgggcaaggacttcatgtctgaaacaccaaaagcaatgacaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacagacaacctacagaatgggagaacatttttgcaatcttctcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaattgacaagaaaaaaacaaacaacgccatc aaaaagtgggcaaagtataggaacaaacacttttcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactggccatcagagagatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggtgatcattaaaaatcaggaaacaacaggt gctggagaggatgtggagaaatag >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_3|324_aa MEKKKNVTEFILIGLTQNPIMEKVTFVVFLVLYMITLSGNLLIVVTITTSQALSSPMYFF LTHLSLIDTVYSSSSAPKLIVDSFQEKKIISFNGCMAQAYAEHIFGATEIILLTVMACDC YVAICKPLNYTTIMSHSLCILLVAVAWVGGFLHATIQILFTVWLPFCGPNVIGHFMCDLY PLLKLVCIDTHTLGLFVAVNSGFICLLNFLILVVSYVIILRSLKNNSLEGRCKALSTCIS HIIVVVLFFVPCIFVYLRSVTTLPIDKAVAVFYTMVVPMLNPVVYTLRNAESIIAGGLRE REQNTLSDAAPILSGAIVQKRARL >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_3|975_bp atggagaagaaaaagaatgtgactgaattcattttaataggtcttacacagaaccccata atggagaaagtcacgtttgtagtatttttggttctttacatgataacactttcaggcaac ctgctcattgtggttaccattaccaccagccaggctctgagctcccccatgtacttcttc ctgacccacctttctttgatagacacagtttattcttcttcttcagctcctaagttgatt gtggattcctttcaagagaagaaaatcatctcctttaatgggtgtatggctcaagcctat gcagaacacatttttggtgctactgagatcatcctgctgacagtgatggcctgtgactgc tatgtggccatctgcaaacctctgaactacacaaccattatgagccacagcctgtgcatt ctcctggtggcagtggcctgggtgggaggatttcttcatgcaactattcagattctcttt acagtatggctgcccttctgtggccccaatgtcataggccacttcatgtgtgacttgtac ccattgttaaaacttgtttgcatagacactcatacccttggtctctttgttgctgtgaac agtgggtttatctgcttattaaacttccttatcttggtggtatcctatgtgatcatcttg agatctttaaagaacaatagcttggaggggaggtgtaaagccctctccacctgtatttct cacatcatagtagttgtcttattctttgtgccctgtatatttgtgtatctgcgctcagtg accactctgcccattgataaagctgttgctgtattttatactatggtggtcccaatgtta aatcccgtggtctacacactcagaaatgctgagtcaatcatcgctggtgggctccgggag agggaacaaaataccctaagtgatgcagctcccattttatctggtgcaattgtccagaaa agggcaaggctgtga >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_4|304_aa MVRVAASLTRPGDKLAGEIMENPPAPSGCWAYWLCFKPVSLPGAPSHRMGLKEVPEQLRI SGQDYTLVLSEGFWTNSSLQLPNQPSAIGSPAFLSQFPPFLPAIVMSPIISIYAMLRKFL QFSEIVLLGKHHLLAGGQPTPNQCTKQKYNQEASQNPPHSPATSTGARCWYPQLKDLKTD DIKGLFVDTPQYHSGARQLHWVTRPKRAKTITAVCLSVRPIHRGMGRIPHQGSTLGDKRI GTAALEPDIFPLTQSTQMRRNQKNNSSDMTKQGSLTPPKDHTSSPTVDPNPDEISELPEK EFRS >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_4|915_bp atggttagagtggctgccagccttacaagaccaggggacaagcttgctggggagatcatg gagaatcccccagcaccctcaggttgctgggcatattggctatgtttcaaaccagtttcc cttcctggagcacccagccatcgtatggggcttaaagaggtcccggagcaactaagaatt tctggccaggactacaccctggtgttatctgagggcttctggaccaactccagccttcaa ctgcccaaccaaccgtcggcaataggatctccagcttttctatcacaatttcctcctttc ctaccggcaattgtcatgtctcctatcatctctatatatgcaatgctccggaaattttta cagttcagcgaaatagtcctgttaggaaagcaccacctcctggctggaggccaaccaaca ccaaaccagtgcaccaaacaaaaatacaaccaagaagcttcacaaaatccacctcactcc cctgctacctccactggagcaaggtgctggtatccacagctgaaagacttgaagacagac gacattaaaggactctttgttgatactccccagtaccactctggagcacggcagctccac tgggtgactagacccaaaagagcaaaaacaataactgcagtttgcctctcagtaaggccc atccataggggaatggggagaataccacatcaagggagcaccttgggggacaaaagaata ggaacagcagcacttgagcccgatatcttccctctgacacagtctacccaaatgagaagg aaccagaaaaacaattctagtgatatgacaaaacaaggttctttaacacccccaaaagat cacaccagctcaccaacagtggatccaaacccagatgaaatctctgaattgccagaaaaa gaattcagaagttga >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_5|894_aa MVKGSIQQEELTILNRYAPNTGAPRFIKQVLGDLQRDSDSHTIIMGDFNTPLSTLDRSMR QKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYTEIDHMVGSKALLSKCK RTEIITNCLSDHSTIKLELRIKKLTQNCSTTWKLNNLPLNDYWVHNEMKAEIKMFFETNE NNDTAHQNLWDTFKTVCRGKFIALNAHKRKLNQEEVESLNRPITGSQIEAIINSLPTKKS PGPDEFTAEFYQRYKGELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENF RPISLMNIDAEILNKMLANRIQQHIKMLIHHDQVGFIPGMQGWLNIPKSINVIQHINRTN DKNHMIISIDAEKAFDKIQQPFMLKILNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEA FPLKTNTRQGCSLLALVFNIVLEFLARATRQEKTLPDPIHVEDVPGPKSEFCSVFPSTPQ ASGNFQNQLFQDSAEAIPLDEGRSACMTQLFAEHFFGRVEIILLVVMAYDRYETICKPLY YLITMNRKSSPEDSVAIKRTGLQQLESAVLTDTVLLSLEISTWNLPPKALSTSKTLSCKT NGPPGPDKEILFVYIALPGLSWLKGAKVQMRPLIQRVQGPSFHMVLGLQVYRRATGMELP KALRAQLLFHQCGLDVRHVVKGDYFGALGFNDYPDIFYSSSIAPKMIFDLISENNTISFN GCMTQLFTEHFFAAAETILLSVMAYDCYVAICKPLHYATIMTQSMCGFLMVVAGILGFVH GGIQTLFIAQLPFCGPNVIDHFMCDLVPLLELACTDTHTLGPLIAANNFLFDEETPRTLT HIINLGQFRKIFQEYGNKAFPFPTRKRVQEKNTMTYTCDPQEDLKLLFNKHTDK >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_5|2685_bp atggtaaagggatcaattcaacaagaagaactaactatcctaaatagatatgcacccaat acaggagcacccagattcataaagcaagtccttggagacctacaaagagactcagactcc cacacaataataatgggagactttaacaccccactgtcaacattagacagatcaatgaga cagaaagttaacaaggatatccaggaattgaactctgctctgcatcaagcggacctaata gacatctacagaactctccaccccaaatcaacagaatatacattcttttcagcaccacac cacacctataccgaaattgaccacatggttggaagtaaagcactcctcagcaaatgtaaa agaacagaaattataacaaactgtctctcagaccacagtacaatcaaactagaactcagg attaagaaactcactcaaaactgctcaactacatggaaactgaacaacctgcccctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgag aacaacgacacagcacaccagaatctctgggacacattcaaaacagtgtgtagagggaaa tttatagcactaaatgcccacaagagaaaactaaaccaggaagaagtggaatctctgaat agaccaataacaggctctcaaattgaggcaataattaatagcttaccaaccaaaaaaagt ccaggaccggatgaattcacagccgaattctaccagaggtacaagggggagctggtacca ttccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttat gaggccagcatcatcctgataccaaagcctggcagagacacaacaaagaaagagaatttt agaccaatatccctgatgaatatcgatgcagaaatcctcaataaaatgctggcaaaccga atccagcagcacatcaaaatgcttatccatcatgatcaagtgggcttcatccctgggatg caaggctggctcaacatacccaaatcaataaacgtaatccagcatataaacagaaccaac gacaaaaaccatatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaa cctttcatgctaaaaattctcaataaattaggtattgatgggacgtatctcaaaataata agagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagca ttccctttgaaaactaacacaagacagggatgctctctcttagcactcgtattcaacata gtgttggaatttctggccagggcaaccaggcaggaaaagaccctaccagatcccattcat gtagaggatgtccctggtcctaagtctgaattctgttcagtcttcccttccactccacag gcatcaggaaatttccagaatcaactattccaagacagcgctgaagcaattcccctggat gagggccgaagtgcgtgcatgacccagctctttgcagaacatttctttgggagagttgag atcattctgctcgtggtaatggcctatgaccgctatgagacaatctgcaagcccctgtac tacctgatcacaatgaacaggaagtcctcaccagaggactctgtggccataaaaagaaca ggacttcagcaactcgaatcagctgtcttaacagacactgtcttgttgtcacttgagata agcacctggaatctgccaccgaaggctctgtcgacatcaaagacgctttcttgcaagacc aatggaccacctggcccagacaaggaaattctttttgtctacattgctctccctggactg tcatggctaaaaggggccaaagtacagatgaggccattgattcagagggtgcaaggccca agcttccacatggtgttgggcctacaggtgtacagaagagccacagggatggagctgccc aaggccttgagagcccaacttctttttcatcagtgtggcctggatgtgagacatgtagtc aaaggagattactttggagctttaggatttaatgactacccagatattttctactcttct tccatagcccctaaaatgatctttgacttgatctctgaaaacaacaccatatccttcaat ggctgcatgactcagctcttcacagaacatttctttgcggcagctgagaccatcttatta agtgtcatggcctacgactgctatgtggccatctgtaagcccttgcactatgcaaccatc atgacccaatctatgtgtggattcctgatggtggtggctggaattctgggatttgtgcat ggaggaatccagactttgttcatagcccagttaccattctgtggccccaatgtcatcgac cactttatgtgtgatttagtacctcttctggagctggcctgcacagacactcacacttta gggcctctgattgctgccaacaatttcctgtttgatgaggagacacctagaacgctcact catattatcaacttaggccagtttcgaaagatctttcaagaatatggaaataaagcattt ccctttcccacaaggaaaagagtgcaagagaagaacaccatgacctacacttgtgatcct caagaagatttgaagctgttgtttaacaaacacacagataagtag >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_6|127_aa MRRQEAGPREAEMRTILACRVRWEEELGRERPTRGSSGPVEDAKKQTLSLERVSRGMSLV YRVHCDAGDEPVEADFWTILGCRHRWEEELGLDMLTGGSFGPGVDVKKQNLGEKRPPGGP SLAYKGH >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_6|384_bp atgaggagacaggaggctggccctcgagaggccgagatgaggactattttggcctgcaga gtccgctgggaggaagagctgggccgggagaggccaactagaggaagttcagggcctgta gaggatgcaaagaagcaaacgctaagcttggaaagggtgtcgagaggcatgagtttggtc tacagagtccactgcgatgcaggagatgagcctgtagaggctgatttctggacaattttg ggctgcagacaccgttgggaggaagagcttggcctggacatgctgactggaggaagtttt gggcctggagtggatgtcaaaaagcaaaatcttggcgagaaaagaccaccaggaggcccg agccttgcctataaaggacattga >gi568815587f:49932874_50133791|GENSCAN_predicted_peptide_7|121_aa QVGNTRFVESVRRHLGGHFDPWGKTKYPQVKTRREKSVKLLGDVWIHLTELNLSFDLAGW NTLFVESVKKVVSGCLPAQALGLIEAVKVPYSVFQSNPEFLCVEGLPEGIPFRSPTCFGI P >gi568815587f:49932874_50133791|GENSCAN_predicted_CDS_7|366_bp caggttggaaacactcgttttgtagaatctgtgaggagacatttgggaggccattttgac ccatggggaaaaaccaaatatccccaggtaaaaactagaagggagaaatctgtgaaacta cttggtgatgtgtggatacatctcacagagttaaatctttcttttgatttagcaggttgg aacactctttttgtagaatctgtgaagaaagttgtaagtggctgcctgccagctcaagct cttggactcattgaggcagtaaaagtaccatattctgtgtttcaatcaaaccccgagttc ctatgtgtagaaggcttgccagaggggattccctttcgaagccctacctgctttggaatt ccatga