GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:30:41 Sequence gi568815586r:68101848_68325756 : 223909 bp : 38.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5044 5091 48 2 0 60 111 36 0.109 0.83 1.02 Term + 11800 12062 263 2 2 8 55 218 0.210 5.20 1.03 PlyA + 12784 12789 6 1.05 2.05 PlyA - 13152 13147 6 1.05 2.04 Term - 16710 16601 110 1 2 121 41 64 0.019 2.69 2.03 Intr - 34562 34384 179 1 2 44 66 115 0.224 3.54 2.02 Intr - 40971 40749 223 1 1 46 93 135 0.139 6.06 2.01 Init - 46573 46519 55 1 1 74 100 84 0.601 9.70 2.00 Prom - 48203 48164 40 -4.25 3.05 PlyA - 49061 49056 6 1.05 3.04 Term - 53640 53506 135 0 0 118 39 173 0.968 12.44 3.03 Intr - 56248 56066 183 1 0 77 85 148 0.999 12.46 3.02 Intr - 56412 56344 69 0 0 120 99 39 0.990 6.66 3.01 Init - 57768 57655 114 0 0 67 78 59 0.795 3.15 3.00 Prom - 57930 57891 40 -2.95 4.00 Prom + 72310 72349 40 -3.05 4.01 Init + 73079 73143 65 1 2 78 67 29 0.414 0.47 4.02 Intr + 75795 76034 240 0 0 72 115 112 0.755 8.04 4.03 Intr + 76117 76235 119 1 2 34 87 57 0.621 -0.61 4.04 Term + 83526 83662 137 1 2 29 48 157 0.616 3.00 4.05 PlyA + 83821 83826 6 1.05 5.04 PlyA - 85469 85464 6 1.05 5.03 Term - 97025 96828 198 2 0 42 45 156 0.302 3.12 5.02 Intr - 101389 101262 128 2 2 66 24 118 0.568 2.68 5.01 Init - 104168 104003 166 2 1 49 -18 232 0.716 8.54 5.00 Prom - 108128 108089 40 -2.25 6.19 PlyA - 108320 108315 6 1.05 6.18 Term - 122337 122218 120 0 0 20 41 91 0.300 -4.91 6.17 Intr - 123436 123302 135 1 0 70 75 83 0.586 5.04 6.16 Intr - 141069 140834 236 1 2 46 93 128 0.116 5.58 6.15 Intr - 145056 144978 79 0 1 80 77 38 0.055 0.11 6.14 Intr - 149731 149666 66 1 0 64 97 103 0.990 6.98 6.13 Intr - 150800 150657 144 2 0 133 75 49 0.951 7.66 6.12 Intr - 150982 150917 66 1 0 123 58 66 0.944 5.28 6.11 Intr - 151632 151416 217 2 1 75 59 202 0.560 13.58 6.10 Intr - 170685 170573 113 0 2 91 37 92 0.190 2.66 6.09 Intr - 188597 188551 47 0 2 79 91 38 0.010 0.41 6.08 Intr - 193519 193418 102 2 0 81 50 59 0.124 0.63 6.07 Intr - 201025 200773 253 1 1 76 46 266 0.996 17.28 6.06 Intr - 211705 211596 110 2 2 107 92 27 0.555 4.08 6.05 Intr - 211906 211797 110 0 2 46 17 82 0.456 -4.09 6.04 Intr - 213418 213101 318 0 0 46 76 526 0.729 41.35 6.03 Intr - 214406 214231 176 2 2 48 72 203 0.999 12.52 6.02 Intr - 221393 221226 168 2 0 63 61 70 0.233 1.02 6.01 Intr - 223728 223594 135 0 0 57 70 56 0.211 0.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 123909 123739 171 0 0 58 100 82 0.840 4.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:68101848_68325756|GENSCAN_predicted_peptide_1|103_aa XNPVSEMFSAFPSYPVDIPNAMEEGFKQALQNLFQTHRCCWKVQELIHFEQNRKHIVIQV PKNPVAIFNGDRHQSFNGGYHQEELLRASATQADGRSPGDKRD >gi568815586r:68101848_68325756|GENSCAN_predicted_CDS_1|312_bp ngaaacccagtatctgagatgttcagtgcctttcccagttacccagttgacatccccaat gcaatggaagagggcttcaagcaggctcttcaaaaccttttccagactcatagatgctgc tggaaggttcaggaactaattcactttgagcagaataggaaacatatagtgatccaagtt cctaagaacccagtagccatttttaatggagacagacaccagagtttcaatgggggctac caccaggaggagctcctgagagcctctgccacacaagctgatggtaggagcccaggtgac aaaagagactga >gi568815586r:68101848_68325756|GENSCAN_predicted_peptide_2|188_aa MGDKGKNARELEASGTRKGSSGSGTAVHRACKTHLASGIWSRKGRAPWRLVRLWCGSFEE KGWEHLDRDPNLGDKGGKCDFEESIYKRRGHHCLQLPGSAPEVMRESILARCCSNHSSRK VISVILRGKMIMSLKLGFIGLATENLGGRSKLGLCLPLRCHDLRGWAAVIIPATFQRPLS YPLLDLRK >gi568815586r:68101848_68325756|GENSCAN_predicted_CDS_2|567_bp atgggagacaaaggcaagaatgccagagaacttgaagcctctggcacacggaaaggatca tcagggagtgggactgccgtgcatagggcttgcaaaactcacctagcttcagggatttgg tcacgaaagggcagagcaccctggcggttggtgagactgtggtgtggcagttttgaggag aaagggtgggagcatctagaccgtgatccaaatcttggagacaagggcggcaagtgtgat tttgaggaatccatctataaaagacgaggccaccactgcctacagcttcctggaagtgcc cctgaagtgatgagagaaagcatcctggccagatgttgcagcaaccacagcagtaggaaa gtgatcagcgtaattctaagaggaaaaatgatcatgagtctcaagctgggcttcataggt ctggccacagagaacctcggaggaaggtcaaaactaggtctctgtcttcctctgcgctgt catgaccttcggggctgggctgctgtgatcatccctgccaccttccagagaccactctca taccctctgcttgatctcagaaaatag >gi568815586r:68101848_68325756|GENSCAN_predicted_peptide_3|166_aa MKYTSYILAFQLCIVLGSLGCYCQDPYVKEAENLKKYFNAGHSDVADNGTLFLGILKNWK EESDRKIMQSQIVSFYFKLFKNFKDDQSIQKSVETIKEDMNVKFFNSNKKKRDDFEKLTN YSVTDLNVQRKAIHELIQVMAELSPAAKTGKRKRSQMLFRGRRASQ >gi568815586r:68101848_68325756|GENSCAN_predicted_CDS_3|501_bp atgaaatatacaagttatatcttggcttttcagctctgcatcgttttgggttctcttggc tgttactgccaggacccatatgtaaaagaagcagaaaaccttaagaaatattttaatgca ggtcattcagatgtagcggataatggaactcttttcttaggcattttgaagaattggaaa gaggagagtgacagaaaaataatgcagagccaaattgtctccttttacttcaaacttttt aaaaactttaaagatgaccagagcatccaaaagagtgtggagaccatcaaggaagacatg aatgtcaagtttttcaatagcaacaaaaagaaacgagatgacttcgaaaagctgactaat tattcggtaactgacttgaatgtccaacgcaaagcaatacatgaactcatccaagtgatg gctgaactgtcgccagcagctaaaacagggaagcgaaaaaggagtcagatgctgtttcga ggtcgaagagcatcccagtaa >gi568815586r:68101848_68325756|GENSCAN_predicted_peptide_4|186_aa MEYYAAIKRNEIMSFARTWMELFYYILYFSALVSVYQQVAFARTFVVTSDENSSHTILIK ERISETGTQVLTGFVSLAVTSASLCETMSILPPTEQHSPAGREMQIKTTRYPHTHYEMAK IWNTDNTKCWLRRGASGTLTEGMDLKTKENGWVHEGPDAGTSSEEVFLGPRGSERWPGVK TEQAER >gi568815586r:68101848_68325756|GENSCAN_predicted_CDS_4|561_bp atggaatactatgcagccataaaaaggaatgagatcatgtcctttgcaagaacatggatg gaacttttttattacatcctttatttttcggcattagtgtcagtataccaacaagttgca tttgccaggacttttgtggtgacaagtgacgaaaattccagtcacactattttgatcaaa gaaaggatctcagagacaggtactcaagtgttgacaggatttgtctctctagctgtcact tctgcttctctttgtgagacaatgtcaatcctgcctcccacagagcagcattcaccagct ggaagggaaatgcaaattaaaacaacaagatacccacacacccattatgaaatggcaaaa atctggaacactgacaacaccaaatgctggctgagacgtggagcatcaggaactctgact gaagggatggatctcaagacaaaggaaaatggttgggtgcacgaggggccagatgctgga accagttctgaagaagtgttcctggggccaagaggatctgagaggtggccaggtgtgaag actgaacaagctgagcgttaa >gi568815586r:68101848_68325756|GENSCAN_predicted_peptide_5|163_aa MDLGEIQELVDTAPQELAEDDLMEMSVSEPVPDDEEEDIEEAIPENKLTLDNWHKGDSCA QIGFENSLAQSGLVVSPGQPAYKPAARDTSLLLLLISKKYMKLEPDHFNLPTDTSLEKVK PRCSGFKMVSELPSFFPEMEGRKHTHPVSHSMVLMTVALDRGC >gi568815586r:68101848_68325756|GENSCAN_predicted_CDS_5|492_bp atggatcttggagaaattcaagagctagtagacactgcaccacaggaattagcagaagac gacttgatggagatgagtgtttctgaaccagtgccagacgatgaggaagaagacatagaa gaagcaatcccagaaaacaaattgacattagacaattggcataagggtgattcctgtgct cagatagggtttgagaattcacttgcacaaagcggactggtagtttcccctggccagcca gcctacaagccagctgctcgagatactagcctgctgcttctcctaatttcgaaaaaatat atgaaattagaaccagatcattttaacctgcccacagataccagtttagaaaaagtcaag ccacgatgttcaggtttcaaaatggtttcagagctcccgtcattttttccggagatggaa ggacgaaagcacactcacccagtcagtcactccatggttctgatgactgtggctcttgac agagggtgctag >gi568815586r:68101848_68325756|GENSCAN_predicted_peptide_6|864_aa LDRLLRKKAGLTVVPSYNALRNSEYQRQFVWKTSKETAPAFAANQVFHNKSQFVPPFKGN SVIHETEYKRNFKGLSPVKEPKLRNDLRENRNLETVSPERKVKELREKAEFYRKRVQGTH FSRDHLNQILSDSNCCWDVSSTTSSEGTVSSNIRALDLAGDPTSHKTLQKCPSTEPEEKG NIVEEQPQKNTTEKLGVSAPTIPVRRRLAWDTENTSEDVQKQPGEKEEEDDNEEEGDRKT GKQAFMGEQEKLDVREKSKADKMKEGSDSSVSSEKGGRLPTPKLRELGGIQRTHHDLTTP AVGGAVLVSPSKMKPPAPEQRKRMTSQDCLETSKNDFTKKESRAVSLLTSPAAGIKTVDP LPLREDSEDNIHKFAEATLPVSKIPKYPTNPPGQLPSPPHVPSYWHPSRRIQGSLRDPEF QHNDEDRLSEISARSAASSLRAFQTLARAKKRKENFWDLSVVTYIQLTQRQFQIFFFPPG KANHTVGGLECPVTFGDNLKPPRKENISSLCHQLLELELSAMAALQKSVSSFLMGTLATS CLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFMLAKEASLADNNTDVRLIGEKL FHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEVVPFLARLSNRLSTCHIEGDDL HIQRNVQKLKDTVKKSFSSQLLVSLWQLMVTGQGCSPVLPKEAGKQTNTLTILVQQSWKP KVTAIEAQKQALISLSGVVLQNQHALDVLTTKAGGTCVLLSETCCFYINTSGQIEESLKK KNCQFQEQLLSFFMEDVFGQLQLQGCKKIRFVEDFHSLRQKLSHCPPAVVACRWYKGWVG SLIAILGKPVGKELFILGIMVTET >gi568815586r:68101848_68325756|GENSCAN_predicted_CDS_6|2595_bp ttggatagacttctgcgtaagaaagctggattgactgttgttccttcatataatgccttg agaaattctgaatatcaaaggcagtttgtttggaagacttctaaagaaactgctccagct tttgcagccaatcaggttttccacaataaaagccagtttgttccaccattcaaaggtaac tcagtcatccatgaaactgaatacaaaagaaatttcaagggtttatctccagtgaaagaa ccaaaattaagaaatgatttgagagaaaacagaaatcttgaaacagtgtctcctgaaagg aaggttaaagaactccgagaaaaggctgagttttataggaagcgagttcaggggacgcat ttttctcgggaccatctgaatcagattttatctgatagcaactgctgttgggatgtctcc tcaaccacaagctcagaaggaaccgttagtagcaacatcagagcattagatcttgctgga gatcctacaagccataagactttgcagaaatgtccttctacagaaccagaagaaaaagga aatatcgtggaagaacagccccagaaaaataccacggagaaattgggtgtgtcagctccc accatacccgttagaaggcggctggcttgggatacagagaacacaagtgaagacgtacag aaacagcccggggagaaagaggaggaggacgacaatgaagaggaaggggacaggaaaacg ggcaagcaggcttttatgggagagcaagagaagttggatgtacgtgagaaatctaaggca gataagatgaaagaagggtcagattcttctgtatcctcagaaaaaggaggccggcttcct actcccaagctgagagaacttggtggaatccagaggactcatcatgatctcactactcca gctgttggtggtgctgttttagtgtctccatctaagatgaagcctccagccccagaacag aggaaaagaatgacctctcaggattgtttagaaacttcaaagaatgattttactaagaaa gaaagtcgtgctgtatccctactgacttctccagctgctggtataaaaacagttgatcct ctgcctttgcgggaagattctgaagacaatatccacaaatttgctgaggcaactcttcca gtttcaaaaattccaaaatacccaacaaatccccctggacagttgccttctccaccacat gttccatcctactggcatccctctcgacgaattcagggctctcttagagatccagagttt cagcacaatgatgaggacagattgtctgagatttctgctcgctctgcagcttctagtctc cgggcttttcaaactctggcacgagctaagaaaaggaaggagaatttctgggacctatca gttgtgacctacattcagcttacccagaggcagtttcagatattcttctttcctcctggc aaagctaaccatacggtcggaggacttgaatgcccagtgacgtttggtgacaatctgaag ccaccaaggaaagaaaatatttcttctttgtgtcaccagttgctcgagttagaattgtct gcaatggccgccctgcagaaatctgtgagctctttccttatggggaccctggccaccagc tgcctccttctcttggccctcttggtacagggaggagcagctgcgcccatcagctcccac tgcaggcttgacaagtccaacttccagcagccctatatcaccaaccgcaccttcatgctg gctaaggaggctagcttggctgataacaacacagacgttcgtctcattggggagaaactg ttccacggagtcagtatgagtgagcgctgctatctgatgaagcaggtgctgaacttcacc cttgaagaagtgctgttccctcaatctgataggttccagccttatatgcaggaggtggtg cccttcctggccaggctcagcaacaggctaagcacatgtcatattgaaggtgatgacctg catatccagaggaatgtgcaaaagctgaaggacacagtgaaaaagtctttcagcagccag cttctagtatctctctggcaattgatggtaactggccagggctgttctccggtgctgcct aaagaagcaggaaaacagactaatacactcaccatcttagtccagcaatcttggaaacct aaagtcacagccattgaggcccagaaacaggccctaatttcattgtcaggggttgtcctg cagaatcaacatgctttagatgtgcttaccaccaaggcaggaggcacttgtgtgctgtta agtgaaacctgctgcttttacatcaatacttcaggtcaaatagaagaaagtttaaaaaag aaaaactgtcaatttcaagaacagcttctgtccttcttcatggaagacgtttttggtcaa ctgcaattgcaaggctgcaagaaaatacgctttgtggaggactttcatagccttaggcag aaattgagccactgtcctcctgcagttgtagcttgcagatggtacaaaggttgggttggg agccttattgcaatccttgggaagccagttggaaaggagctcttcatcctgggcatcatg gtcactgagacttaa