GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:12:11 Sequence gi568815594f:9110657_9215404 : 104748 bp : 45.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1654 1660 7 0 1 95 81 11 0.127 1.51 1.02 Intr + 12988 13095 108 2 0 55 67 91 0.156 3.96 1.03 Intr + 14198 14794 597 0 0 64 38 181 0.566 3.21 1.04 Term + 17580 17812 233 1 2 41 48 187 0.718 6.64 1.05 PlyA + 18399 18404 6 1.05 2.00 Prom + 18994 19033 40 -4.16 2.01 Init + 30465 30598 134 2 2 87 37 161 0.595 10.51 2.02 Intr + 30646 30952 307 1 1 -101 98 476 0.706 26.15 2.03 Intr + 42586 42735 150 0 0 92 37 146 0.467 10.26 2.04 Intr + 44868 44930 63 2 0 83 116 59 0.991 7.11 2.05 Intr + 46823 46903 81 1 0 62 86 71 0.944 4.13 2.06 Intr + 48499 48600 102 0 0 61 64 196 0.997 14.87 2.07 Intr + 49834 49967 134 0 2 87 60 118 0.999 8.34 2.08 Intr + 50050 50232 183 1 0 74 23 191 0.992 10.10 2.09 Intr + 50587 50837 251 1 2 70 66 45 0.491 -2.42 2.10 Intr + 51147 51296 150 1 0 43 71 235 0.752 17.43 2.11 Intr + 51962 52134 173 0 2 26 -57 162 0.273 -4.84 2.12 Term + 54760 54966 207 0 0 70 47 132 0.472 4.64 2.13 PlyA + 55070 55075 6 1.05 3.05 PlyA - 55293 55288 6 -0.45 3.04 Term - 55772 55641 132 2 0 78 41 108 0.959 3.19 3.03 Intr - 56840 56765 76 1 1 100 91 119 0.927 12.92 3.02 Intr - 57959 57845 115 0 1 87 46 103 0.912 5.51 3.01 Init - 58384 58336 49 1 1 86 58 28 0.421 -1.29 3.00 Prom - 59599 59560 40 -11.23 4.00 Prom + 61367 61406 40 -1.66 4.01 Init + 62517 62639 123 2 0 74 110 81 0.747 9.09 4.02 Intr + 63134 63333 200 1 2 122 105 66 0.999 9.85 4.03 Intr + 63792 63900 109 0 1 103 96 103 0.999 12.89 4.04 Term + 64549 65511 963 0 0 134 53 456 0.997 38.76 4.05 PlyA + 66053 66058 6 1.05 5.04 PlyA - 66308 66303 6 1.05 5.03 Term - 67614 67316 299 1 2 29 43 222 0.221 7.23 5.02 Intr - 67826 67697 130 2 1 54 71 62 0.218 1.37 5.01 Init - 76196 76095 102 2 0 52 92 50 0.164 2.04 5.00 Prom - 79669 79630 40 -3.56 6.03 PlyA - 79694 79689 6 1.05 6.02 Term - 87439 87321 119 2 2 85 50 145 0.943 9.10 6.01 Init - 90324 90267 58 0 1 83 81 17 0.336 1.98 6.00 Prom - 93713 93674 40 -5.26 7.00 Prom + 95372 95411 40 -2.56 7.01 Sngl + 100001 101593 1593 1 0 106 39 1122 0.976 104.47 7.02 PlyA + 102321 102326 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_1|314_aa MAKVQYGSQAVRPVPNSKALYQQPKAMAFDLTVPPSGQGFSGTTTPTVNTTISGNQPITI RQLSPATAGSAAVDLCSTQMISLLPGEPPQKIPTGVYGPLPEGMVGLILGRSSLNLKGVQ IHTGVIDSDYKGEIQLVISSTVPWSANPGDRIAQLLLLPYIKIGDSKTETGVFGSTNTAG KAVYWASQLSENRPVCTVTIQGKQFEGLVDTGADVSIIALNQWQKNQPKQKPVTGFVGEN QLPVWLPTRHLKFYNEPIRDAREGASAETENPQSNINDSQGEQNGDIRRTDEVAIHQESG AADLGPAKEADTVS >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_1|945_bp atggccaaggttcagtatggatctcaggcggtgcgtcctgtgccaaatagcaaggcacta tatcaacaacccaaggcgatggcgtttgatcttacagtaccacctagtggacaagggttt tcagggacaacaacccccacagtaaataccaccatttcaggaaatcagccaattacaata cgacaattatcccctgccacagcaggcagtgctgccgtagatttatgttctactcaaatg atttctttactccctggagagccccctcaaaagattcctacaggggtatatgggccgctg ccagaagggatggtaggccttattttaggaagatctagtctaaatttgaaaggagttcaa attcatactggggtaattgactcagattataaaggggaaattcagttagtgatcagctct actgttccctggagtgccaatccaggtgatagaattgctcaattactgctcttgccttat attaaaattggggatagcaaaacagaaacaggagtgtttggaagtaccaacactgctgga aaagctgtttattgggctagtcagctctcagagaatagacctgtgtgtacagttactatt cagggaaaacagtttgaaggattagtggatactggggctgatgtttccatcattgcctta aatcaatggcaaaaaaatcagcctaaacaaaagcctgttacaggatttgttggagaaaat cagcttcctgtttggctacccactagacatttgaagttctataatgaacccatcagagat gcaagggaaggcgcctccgcagagacagagaacccgcaatcgaacatcaacgactcgcag ggtgaacaaaatggtgatatcagaaggacagatgaagttgccatccaccaagaaagtggg gccgccgacctgggcccagctaaagaagctgacacagttagctga >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_2|644_aa MNWFTKEDFDFVTLCYREPDNVGHRFGPEAENRKLMIQQIGRTIGVIITRDHGVTTEKKR PNVNKIPLSNYIKFRDWVKFDIVGYGGFGMPLPKSGQEEALYQALKNAYPHLHIYKKEEF PEHFHIAKHDRVLPIVMYANSGYTINGSRPGFLQVSRQRCGSAHVMAPEENAGTELWLQG FERRFLAARSLRSFPWQSLEAKLRDSSDSELLRDILQKTVKHPVCVKHPPSVKYARCFLS ELIKKHEAVHTEPLDELYEVLAETLMAKESTQGHRSYLLPSGGSFTLSEITAIISHGTTG LVTWDATLYLAEWAIENPAAFTNRGVLELGSGAGLTGLAICKMCRPQAYIFSDCHSRVLE QLRGNVLLNGLSLEADITANLDAPGDHRRKTTTSGTRTGPLRKGGVWLGHRKPLTPASTL SPLSGGTELCLWPWVPALKPTGPAVARDTGPPLQASRPNGRHLKQEVHDVLYCPEAIVSL VGVLRRLAACREHKQAPEVYLAFTVRNPETCQLFTTELAPPEHSPSWKPCAQMHPQQPLP AHRDTDNPVPVHVGQPVNYRANKQASTRRHTGFHDRRALGDGNAEPHTVGLTHDSNGLVR IKSLSWEEFLYGKADKTFIALECLENVKFQIRNHKLPSNKTLAI >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_2|1935_bp atgaactggttcaccaaggaagactttgactttgtgactctgtgctacagagagccagat aacgtgggacaccgattcgggccagaggcagagaacaggaagttgatgattcagcaaatc ggcaggaccatcggcgtcatcatcacacgagaccatggggtgaccaccgagaagaagaga cccaatgtcaacaagatccccttgtccaactacatcaagttcagggactgggtcaagttt gatattgtgggctatggtggctttgggatgcccctgcccaaatcggggcaagaagaagcc ctttaccaggcactgaagaatgcgtaccctcacctccacatctacaagaaggaggagttt ccagaacacttccatatcgctaaacatgaccgggttctgccaatcgtgatgtatgccaac tctggttacactatcaatgggtccaggcccggcttcctccaggtctccaggcaacgctgc ggctccgcccacgtcatggcgcccgaggagaacgcggggacagaactctggctgcagggt ttcgagcgccgcttcctggcggcgcgctcactgcgctccttcccctggcagagcttagag gcaaagttaagagactcatcagattctgagctgctgcgggatattttgcagaagactgtg aagcatcccgtgtgtgtgaagcacccgccatcagtcaagtatgcccggtgctttctctca gaactcatcaaaaagcacgaggctgtccacacggagcctttggacgagctgtacgaggtg ctggcggagactctgatggccaaggagtccacccagggccaccggagctatttgctgccc tcgggaggctcgttcacactttccgagatcacagccatcatctcccatggtactacaggc ctggtcacatgggacgccaccctctaccttgcagaatgggccatcgagaacccagcagcc ttcactaacaggggtgtcctagagcttggcagtggcgctggcctcacaggcctggccatc tgcaagatgtgtcgcccccaggcatacatcttcagcgactgtcacagccgggtcctcgag cagctccgagggaatgtccttctcaatggcctctcattagaggcagacatcactgccaac ttagacgccccaggagaccacaggagaaaaacaaccacttctgggacgaggacagggccc ttgagaaaaggtggtgtttggctgggccaccgaaaacccctcacccctgccagcacactc agtcccctctctggtggaacagagctctgcctgtggccctgggtcccagccctgaaaccc acaggtccagcggtggccagggacacaggcccacccctgcaagccagcagaccaaacggc agacacctgaaacaagaagttcacgacgtgctgtattgcccagaagccatcgtgtcactg gtcggggtcctgcggaggctggctgcctgccgggagcacaagcaggctcctgaggtctac ctggcctttaccgtccgcaacccagagacgtgccagctgttcaccaccgagctagctccc cctgagcatagcccctcctggaagccatgtgcacagatgcacccgcagcagcctctgcct gcacacagagacacggacaatccagtgcctgtccacgtggggcagcccgttaactacaga gccaacaaacaagccagcacacgaagacatactgggttccacgacagaagagcacttgga gatggcaatgctgaacctcacactgtaggactcacacacgactccaacgggcttgtgaga attaagtcactctcgtgggaagaatttttatatgggaaagcggataaaactttcattgca ctggaatgtttggaaaatgttaaattccaaatcaggaaccacaaactgccctctaataag acattggctatctaa >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_3|123_aa MGFHHVGQAGLELLTSGSVDLGVCLDTSSSGLGLPMKVVDMFRSCLPVCAVNFKCLHELV KHEENGLVFEDSEELAAQLQMLFSNFPDPAGKLNQFWKNLRESQQLRWDESWVQTVLPLV MDT >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_3|372_bp atggggtttcaccatgttggccaggctggtcttgaactcctgacctcagggtcggtggac ctgggtgtctgtctggacacctcctccagtggcctgggcctgcccatgaaggtggtggac atgttcaggagctgtttgcctgtgtgtgccgtgaacttcaagtgtttacatgagctagtg aaacatgaagaaaacggcctggtctttgaggactcagaggaactggcagctcagctgcag atgcttttctcaaacttccctgatcctgcaggcaagctaaaccagttctggaagaacctg cgggagtcgcagcagctccgatgggatgagagttgggtgcagactgtgcttcctttggtt atggacacataa >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_4|464_aa MMACRDPKPGAKRLVRAQTLQKQRRAPVGPRAPPPDEEDPRLKCKNCGAFGHMARSTRCP MKCWKAALVPPTLGKKEGKENLKPWKPQVEANPGPLNKDKGEKEERPRQQDPQRKALLHI FSGKPPEKPLPNRKGSTESSVYLRVASGPMPVHTTSKRPRVDPVLADRSATEMSDRGSAL ASLSPLRKASLSSSSSLGPKERQTGAAADIPQPAVRHQGPEPLLVVKPTHSSPEGGCREV PQAASKTHGLLQAISPQAQDKRPAVTSQPCPPAATHSLGLGSNLSFGPGAKRPAQAPIQA CLNFPKKPRLGPFQIPESAIQGGELGAPEYLQPPPATTELGPSTSPQMGRRTPAQVSSVD RQPPHSRPCLPTAQACTMSHHPATSHDGAQPLRVLFRRLENGRWSSSLLAAPSFHSPEKP GAFLAQSPHVSEKSEVPRVRVPPNVLYEDLQVSSSSEDSDSDLE >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_4|1395_bp atgatggcatgtcgtgaccccaaacctggggcaaagagactggtgagagcccagaccctc cagaagcagcggagagccccagttgggccaagggctcccccgcccgatgaagaagatccc aggctcaagtgcaaaaactgtggggcctttggtcacatggccagaagtaccaggtgcccc atgaagtgctggaaggcagccctggttccaccgaccttggggaaaaaggaagggaaggaa aacctgaaaccatggaagccccaggttgaagcaaacccggggcccttgaacaaggataag ggagagaaggaagagagaccaaggcaacaagacccgcagaggaaggctctcctccacata ttttccgggaaacctccagagaagccgctgccaaatcgaaaaggatccacggaatcttct gtttatctgagggttgcaagcgggccaatgccggtccacacaaccagtaagaggccgcgt gtggaccctgtcctcgctgatcgctcagctaccgaaatgtctgacaggggctccgccttg gcttcactgtctcccctcagaaaagccagtctgagctcctcctcaagtcttggaccaaag gaaagacagacaggggctgcggccgacatccctcagcctgcagtcagacaccagggcccc gagcctctcctcgtggtgaagccgacacacagcagccctgagggtggctgccgagaagtt ccccaggctgcctccaaaacccacggcctgctccaggccatcagcccccaggcacaagac aaacgtcctgcggtgacctcacagccctgcccaccagccgccacacatagcttgggtcta ggctccaatctcagcttcgggccaggagccaagagacctgcccaggctccgattcaggct tgcctgaacttccccaagaaaccgaggctgggtcccttccagatccccgaaagcgccatc cagggaggtgagctgggggccccggagtatctccaacctccgccggcaacaaccgaactt ggaccaagtacgtcgccccagatgggcaggaggacacccgcccaggtgtccagcgtcgac cggcagcctccgcacagcagaccttgcctgcctactgcccaggcctgcaccatgtcccat cacccagcgaccagccatgatggggcccagcctctcagagtgctcttccggagactggaa aacggacgctggagctccagcctcctggcggccccctcatttcactctcctgagaagccg ggagccttcctcgctcagagccctcatgtctcagagaagtctgaggttccccgtgttcgt gtcccaccgaacgtcctctatgaggaccttcaggtttcctcctcctcagaggacagcgac tctgacctggagtga >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_5|176_aa MWVHSFDQKTLLNGNISYKYFGKRLRFMNKQPFIVIWIIKITTLNSGGKGAAIWCVSENP AQFRGLLDVESCSELFPGSGGEPWKGPVSDIRKEDGMNVLPLKYIPNVGVNFTFAGVYLA SETLPGSFAHPEATSRGAVANGTTHLASAVEPNGDSWYKERSPRVSVREIRLAEFY >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_5|531_bp atgtgggtgcacagttttgaccagaaaaccttgttaaatgggaacataagctacaaatat tttggcaaaagattaaggtttatgaacaaacaaccatttatagtgatctggataatcaag ataacgaccctcaacagcggcggaaagggagcagccatttggtgtgtctcagaaaatccc gctcagttccgaggcctcctagatgtggaatcctgctcagagttgttcccaggatcaggg ggagaaccatggaaaggcccggtgtcagacatccggaaagaagacgggatgaacgtttta cctctgaagtacatcccaaatgtgggagttaacttcacctttgctggggtctatttggct agtgaaactctgcctggttcattcgcacatccggaagccacttcacggggggccgtcgca aatggaaccacacacttggcatcggcggttgagccaaatggggactcgtggtacaaggaa cgctccccacgtgttagcgtgcgtgagattcggttggcagaattttactag >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_6|58_aa MVEGKTEAGTFFHKVAGERSTNLIFYYFADISSANTRQIETMTFRRTQTTLTAGKKIQ >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_6|177_bp atggtggaaggcaagacagaagcaggcaccttttttcacaaggtggcaggagagagaagc accaatttgatcttctactactttgcagacatctcttcggcaaacaccagacaaattgag acaatgaccttccgcaggacccaaaccacccttactgcaggaaagaagatccagtga >gi568815594f:9110657_9215404|GENSCAN_predicted_peptide_7|530_aa MEDDSLYLGGEWQFNHFSKLTSSRPDAAFAEIQRTSLPEKSPLSCETRVDLCDDLAPVAR QLAPREKPPLSSRRPAAVGAGLQNMGNTCYVNASLQCLTYKPPLANYMLFREHSQTCHRH KGCMLCTMQAHITRALHIPGHVIQPSQALAAGFHRGKQEDAHEFLMFTVDAMRKACLPGH KQVDRHSKDTTLIHQIFGGYWRSQIKCLHCHGISDTFDPYLDIALDIQAAQSVQQALEQL VKPEELNGENAYHCGVCLQRAPASKTLTLHNSAKVLILVLKRFPDVTGNKIAKNVQYPEC LDMQPYMSQQNTGPLVYVLYAVLVHAGWSCHNGHYSSYVKAQEGQWYKMDDAEVTASSIT SVLSQQAYVLFYIQKSEWERHSESVSRGREPRALGVEDTDRRATQGELKRDHPCLQAPEL DEHLVERATQESTLDHWKFLQEQNKTKPEFNVRRVEGTVPPDVLVIHQSKYKCRMKNHHP EQQSSLLNLSSTTPTDQESMNTGTLASLRGRTRRSKGKNKHSKRALLVCQ >gi568815594f:9110657_9215404|GENSCAN_predicted_CDS_7|1593_bp atggaggacgactcactctacttgggaggtgagtggcagttcaaccacttttcaaaactc acatcttctcggccagatgcagcttttgctgaaatccagcgtacttctctccctgagaag tcaccactctcatgtgagacccgtgtcgacctctgtgatgatttggctcctgtggcaaga cagcttgctcccagggagaagcctcctctgagtagcaggagacctgctgcggtgggggct gggctccagaatatgggaaatacctgctacgtgaacgcttccctgcagtgcctgacatac aaaccgccacttgccaactacatgctgttccgggagcactctcaaacgtgtcatcgtcac aagggctgcatgctctgtactatgcaagctcacatcacaagggccctccacattcctggc catgtcatccagccctcacaggcattggctgctggcttccatagaggcaagcaggaagat gcccatgaatttctcatgttcactgtggatgccatgagaaaggcatgccttcccgggcac aagcaggtagatcgtcactctaaggacaccaccctcatccaccaaatatttggaggctac tggagatctcaaatcaagtgtctccactgccacggcatttcagacacttttgacccttac ctggacatcgccctggatatccaggcagctcagagtgtccagcaagctttggaacagttg gtgaagcccgaagaactcaatggagagaatgcctatcattgtggtgtttgtctccagagg gcgccggcctccaagacgttaactttacacaactctgccaaggtcctcatccttgtattg aagagattccccgatgtcacaggcaacaaaattgccaagaatgtgcaatatcctgagtgc cttgacatgcagccatacatgtctcagcagaacacaggacctctcgtctatgtcctctat gctgtgctggtccacgctgggtggagttgtcacaacggacattactcctcttatgtcaaa gctcaagaaggccagtggtataaaatggatgatgccgaggtcaccgcctctagcatcact tctgtcctgagtcaacaggcctacgtcctcttttacatccagaagagtgaatgggaaaga cacagtgagagtgtgtcaagaggcagggaaccaagagcccttggcgtagaagacacagac aggcgagcaacgcaaggagagctcaagagagaccacccctgcctccaggcccccgagttg gacgagcacttggtggaaagagccactcaggaaagcaccttagaccactggaaattcctt caagagcaaaacaaaacgaagcctgagttcaacgtcagaagagtcgaaggtacggtgcct cccgacgtacttgtgattcatcaatcaaaatacaagtgtcggatgaagaaccatcatcct gaacagcaaagctccctgctaaacctctcttcgacgaccccgacagatcaggagtccatg aacactggcacactcgcttccctacgagggaggaccaggagatccaaagggaagaacaaa cacagcaagagggctctgcttgtgtgccagtga