GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:30:34 Sequence gi568815594r:75545433_75762025 : 216593 bp : 40.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 1072 1067 6 1.05 1.14 Term - 7027 6885 143 2 2 -32 42 255 0.009 6.01 1.13 Intr - 10281 10135 147 1 0 57 78 75 0.019 2.69 1.12 Intr - 16779 16660 120 1 0 65 65 83 0.010 3.25 1.11 Intr - 38199 38025 175 0 1 30 30 172 0.012 4.19 1.10 Intr - 46493 46387 107 0 2 63 92 69 0.290 3.81 1.09 Intr - 50908 50811 98 0 2 109 71 32 0.544 2.33 1.08 Intr - 51804 51503 302 0 2 58 80 185 0.922 9.31 1.07 Intr - 52780 52645 136 0 1 63 99 86 0.980 6.85 1.06 Intr - 54937 54849 89 1 2 82 65 54 0.977 0.35 1.05 Intr - 58524 58385 140 1 2 92 84 88 0.982 8.06 1.04 Intr - 60202 60090 113 0 2 82 82 34 0.979 1.30 1.03 Intr - 61929 61751 179 0 2 74 71 106 0.923 5.30 1.02 Intr - 69017 68823 195 2 0 60 101 49 0.755 2.09 1.01 Init - 80556 80389 168 0 0 46 113 167 0.831 14.59 1.00 Prom - 81316 81277 40 -7.65 2.04 PlyA - 82566 82561 6 1.05 2.03 Term - 84821 84606 216 2 0 54 42 115 0.576 -0.34 2.02 Intr - 84952 84873 80 2 2 104 42 90 0.901 4.35 2.01 Init - 85472 85028 445 2 1 81 84 233 0.567 16.43 2.00 Prom - 98722 98683 40 -3.35 3.14 PlyA - 99318 99313 6 1.05 3.13 Term - 100270 99998 273 1 0 55 48 369 0.999 24.09 3.12 Intr - 101024 100906 119 0 2 96 106 128 0.999 14.66 3.11 Intr - 101725 101597 129 2 0 70 80 162 0.999 13.35 3.10 Intr - 103309 103207 103 1 1 73 109 146 0.999 14.03 3.09 Intr - 109814 109634 181 1 1 43 72 294 0.992 22.25 3.08 Intr - 110438 110336 103 0 1 54 79 154 0.995 9.41 3.07 Intr - 111582 111492 91 0 1 51 100 105 0.999 6.65 3.06 Intr - 112298 112125 174 2 0 34 100 143 0.997 9.31 3.05 Intr - 113492 113411 82 1 1 94 101 94 0.995 10.02 3.04 Intr - 116617 116499 119 1 2 94 99 45 0.252 4.54 3.03 Intr - 127951 127776 176 2 2 65 85 216 0.192 17.74 3.02 Intr - 128408 128094 315 0 0 21 53 263 0.746 11.21 3.01 Init - 129863 129803 61 2 1 65 96 78 0.993 7.76 3.00 Prom - 136038 135999 40 -4.75 4.00 Prom + 141301 141340 40 -6.55 4.01 Init + 141953 142022 70 1 1 84 91 25 0.577 3.69 4.02 Intr + 143258 143364 107 0 2 52 115 43 0.612 2.41 4.03 Intr + 146740 146768 29 0 2 115 111 -15 0.348 -0.50 4.04 Intr + 149937 150012 76 0 1 85 121 -9 0.422 0.70 4.05 Term + 151192 151308 117 0 0 96 35 88 0.761 1.86 4.06 PlyA + 152126 152131 6 1.05 5.04 PlyA - 154185 154180 6 1.05 5.03 Term - 179315 179115 201 2 0 66 39 222 0.941 11.51 5.02 Intr - 179943 179668 276 0 0 29 23 218 0.432 6.19 5.01 Init - 186769 186731 39 1 0 64 81 73 0.400 4.44 5.00 Prom - 199788 199749 40 -2.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 18682 18820 139 2 1 145 37 103 0.898 9.50 S.002 Init - 38229 38025 205 0 1 89 30 176 0.811 11.26 S.003 Init + 85703 85829 127 1 1 73 82 196 0.972 17.87 S.004 Term + 89582 89745 164 0 2 88 43 89 0.822 1.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:75545433_75762025|GENSCAN_predicted_peptide_1|703_aa MEKYENLGLVGEGSYGMVMKCRNKDTGRIVAIKKFLESDDDKMVKKIAMREIKLLKQLRH ENLVNLLEVCKKKKRWYLVFEFVDHTILDDLELFPNGLDYQVVQKYLFQIINGIGFCHSH NIIHRDIKPENILVSQSGVVKLCDFGFARTLAAPGEVYTDYVATRWYRAPELLVGDVKYG KAVDVWAIGCLVTEMFMGEPLFPGDSDIDQLYHIMMCLGNLIPRHQELFNKNPVFAGVRL PEIKEREPLERRYPKLSEVVIDLAKKCLHIDPDKRPFCAELLHHDFFQMDGFAERFSQEL QLKVQKDARNVSLSKKSQNRKKEKEKDDSLVEERKTLVVQDTNADPKIKDYKLFKIKGSK IDGEKAEKGNRASNASCLHDSRTSHNKIVPSTSLKDCSNVSVDHTRNPSVAIPPLTHNLS AVAPSINSGMGTETIPIQGYRVDEKTKKCSIPFVKPNRHSPSGIYNINVTTLVTRNSRLT KKESKILSESRIPSLAAIDLHTPSITLHQMCGLLAKCLQFHIVGAFIVSLGVAAVCKIAV AEPRKKTYADFYRNYDSVKDLEEMGKAVPHTYILFHRRTLEMSVVSPTHLLAMADWTTDR HSTHSLSAYLLKKRNKRERALPGGFLTPVVCSGELGIILRPPTRVYAASLLCFCPGGPSG SIPEEGMVIIADGSSMHVIAPEDLPVEQDVEVEDSDSDDPDPV >gi568815594r:75545433_75762025|GENSCAN_predicted_CDS_1|2112_bp atggaaaaatatgaaaacctgggtttggttggagaagggagttatggaatggtgatgaag tgtaggaataaagatactggaagaattgtggccataaagaagttcttagaaagtgacgat gacaaaatggttaaaaagattgcaatgcgagaaatcaagttactaaagcaacttaggcat gaaaacttggtgaatctcttggaagtgtgtaagaaaaaaaaacgatggtacctagtcttt gaatttgttgaccacacaattcttgatgacttggagctctttccaaatggactagactac caagtagttcaaaagtatttgtttcagattattaatggaattggattttgtcacagtcac aatatcatacacagagatataaagccagagaatatattagtctcccagtctggcgttgtc aagctatgcgattttggatttgcgcgaacattggcagctcctggggaggtttatactgat tatgtggcaacccgatggtacagagctccagaactattggttggtgatgtcaagtatggc aaggctgttgatgtgtgggccattggttgtctggtaactgaaatgttcatgggggaaccc ctatttcctggagattctgatattgatcagctatatcatattatgatgtgtttaggtaat ctaattccaaggcatcaggagctttttaataaaaatcctgtgtttgctggagtaaggttg cctgaaatcaaggaaagagaacctcttgaaagacgctatcctaagctctctgaagtggtg atagatttagcaaagaaatgcttacatattgaccccgacaaaagacccttctgtgctgag ctcctacaccatgatttctttcaaatggatggatttgctgagaggttttcccaagaacta cagttaaaagtacagaaagatgccagaaatgtttctttatctaaaaaatcccaaaacaga aagaaggaaaaagaaaaagatgattccttagttgaagaaagaaaaacacttgtggtacag gataccaatgctgatcccaaaattaaggattataaactatttaaaataaaaggctcaaaa attgatggagaaaaagctgaaaaaggcaatagagcttcaaatgccagctgtctccatgac agtaggacaagccacaacaaaatagtgccttcaacaagcctcaaagactgcagcaatgtc agcgtggaccacacaaggaatccaagcgtggcaattcccccacttacacacaatctttct gcagttgctcccagcattaattctggaatggggactgagactataccaattcagggttac agagtggatgagaaaactaagaagtgttctattccatttgttaaaccgaacagacattcc ccatcaggcatttataacattaatgtgaccacattagtaactcgaaattccaggctaaca aagaaagagagcaaaattctttcagaatctcgaattccttctctggctgctattgacctg cacacccccagtattacattacatcagatgtgtggtcttctggccaaatgtctgcaattt catattgttggagcctttattgtatccctgggggttgcagctgtctgtaagattgctgtg gctgaaccaagaaagaagacatatgcagatttctacagaaattatgattccgtgaaagat ttggaggagatggggaaggctgtccctcatacatatattctgtttcatcgacgaaccctg gaaatgtcagttgtcagtcctacgcaccttctggctatggctgattggaccactgataga cactcaactcattcactgtctgcttacctacttaaaaagcgaaacaagagggagagggca ttacccggaggcttcctgaccccggtggtttgcagtggagagttggggatcattcttagg cccccaaccagggtttatgctgcttctctgctctgcttttgtccaggaggtccttcagga agtattccagaagaaggcatggttatcatagcagatggcagctctatgcatgttattgcc cctgaagatcttccagtggaacaagatgtggaggtggaagacagtgacagtgatgatcct gaccccgtgtag >gi568815594r:75545433_75762025|GENSCAN_predicted_peptide_2|246_aa MESRSGRAYATQGARRPPLPQRRCYEGNRIAFCIVFAVLHNRHLYPSRGHHNRLLPLGKA ERRLLTRTKACGDAGPLRVREQNLGRALGEGAGWLALSQSWLVTTRLGQSEEGRPGALGA GEKPPPLAAPRPTPSPRGSWSGAGSQAPGVGACRERILTWAVSVSQDFVPMVGDRCELAA DRGPPASLPASPWTRDCVRRRNHNSAGIASLLGSIFWTCEPYAFHLKFVSKLKTFGSFYA NGLKKR >gi568815594r:75545433_75762025|GENSCAN_predicted_CDS_2|741_bp atggaatcccggtcaggccgcgcctacgcgactcagggcgcccggcgcccgcccctgccc cagcggcgatgctatgagggaaaccgtatcgcattttgcatagtcttcgcagtcctacat aaccgccacctttacccttcgcgtgggcatcacaatcgcctcctcccgctggggaaggca gaaaggcgcctcctgacgagaaccaaggcgtgtggggacgcagggcctctgcgtgtcagg gagcagaacctgggccgagccctaggtgaaggggcggggtggttggccctgagccaatca tggctcgtgacgactcggctcggccaatcagaagaagggaggcctggcgctctcggggcg ggtgagaaaccgcccccccttgcagctccgcggccaacgccttcgcccaggggtagttgg agcggtgcaggttcccaggctccaggtgttggtgcctgccgtgaacgcattctgacctgg gccgtatctgtctcccaagactttgtgcctatggttggggacagatgtgagcttgcggcg gaccgaggcccacctgcctccctgcctgcttcgccctggactcgtgactgcgtccgcaga agaaatcacaacagcgctggaattgctagtttgctaggcagcatcttttggacctgcgaa ccatatgcatttcacctcaaatttgtttccaagttgaaaacctttgggtctttctatgcg aacggattgaagaaacggtaa >gi568815594r:75545433_75762025|GENSCAN_predicted_peptide_3|641_aa MVKNIYNSIARKSSKPETLEARLQQQPADTYSERLLNSLLPINKKDQNISLQKTLRCCEP ALACAIRKKGGKAGASGPEKGSLLVTSNQRTLILALGRFRFPGFPGASVCERGSVLRVLR RGTREAPGAREEVVVRQTNSRLSGFRGFRVVGGRGRRAIRTRRTRSSATLLTSARRTRRR RWLEHLTLCSKEMVMEKPSPLLVGREFVRQYYTLLNKAPEYLHRFYGRNSSYVHGGVDAS GKPQEAVYGQNDIHHKVLSLNFSECHTKIRHVDAHATLSDGVVVQVMGLLSNSGQPERKF MQTFVLAPEGSVPNKFYVHNDMFRYEDEVFGDSEPELDEESEDEVEEEQEERQPSPEPVQ ENANSGYYEAHPVTNGIEEPLEESSHEPEPEPESETKTEELKPQVEEKNLEELEEKSTTP PPAEPVSLPQEPPKPRVEAKPEVQSQPPRVREQRPRERPGFPPRGPRPGRGDMEQNDSDN RRIIRYPDSHQLFVGNLPHDIDENELKEFFMSFGNVVELRINTKGVGGKLPNFGFVVFDD SEPVQRILIAKPIMFRGEVRLNVEEKKTRAARERETRGGGDDRRDIRRNDRGPGGPRGIV GGGMMRDRDGRGPPPRGGMAQKLGSGRGTGQMEGRFTGQRR >gi568815594r:75545433_75762025|GENSCAN_predicted_CDS_3|1926_bp atggtgaagaatatttataatagcattgcacggaagagctccaaaccggaaacacttgaa gctcgtctacagcagcaacctgcagacacctattcagagagattactaaattccctcctt cccattaataaaaaggaccagaatatcagtctacaaaagacactgcgttgctgcgaacca gctctcgcttgcgcgatcaggaagaagggcggcaaggctggagcctcgggaccggagaaa ggcagcctgcttgtgacgtcaaatcagcggactcttatcttggctttaggccggttccgg ttccccggctttccgggcgcgagcgtgtgcgagcgcggcagcgtactgcgcgtgctccgc agagggacacgggaagcgcctggcgcccgggaagaggtggttgtgaggcagacgaactcg cggctctccggcttccgaggcttccgagttgtcggaggaagggggcggcgagcaataaga acccgccgcacccggtcctcagcgactcttctgacctccgcgcgacgtacccgccgccgc cgttggctggagcatttgacattgtgcagcaaagaaatggttatggagaagcccagtccg ctgcttgtagggcgggagtttgtgaggcaatattatactttgctgaataaagctccggaa tatttacacaggttttatggcaggaattcttcctatgttcatggtggagtagatgctagt ggaaagccccaggaagctgtttatggccaaaatgatatacaccacaaagtattatctctg aacttcagtgaatgtcatactaaaattcgtcatgtggatgctcatgcaaccttgagtgat ggagtagttgtccaggtcatgggtttgctgtctaacagtggacaaccagaaagaaagttt atgcaaacctttgttctggctcctgaaggatctgttccaaataaattttatgttcacaat gatatgtttcgttatgaagatgaagtgtttggtgattctgagcctgaacttgatgaagaa tcagaagatgaagtagaagaggaacaagaagaaagacaaccatctcctgaacctgtgcaa gaaaatgctaacagtggttactatgaagctcaccctgtgactaatggcatagaggagcct ttggaagaatcctctcatgaacctgaacctgagccagaatctgaaacaaagactgaagag ctgaaaccacaagtggaggagaagaacttagaagaactagaggagaaatctactactcct cctccggcagaacctgtttctctgccacaagaaccaccaaagccaagagtcgaagctaaa ccagaagttcaatctcagccacctcgtgtgcgtgaacaacgacctagagaacgacctggt tttcctcctagaggaccaagaccaggcagaggagatatggaacagaatgactctgacaac cgtagaataattcgctatccagatagtcatcaactttttgttggtaacttgccacatgat attgatgaaaatgagctaaaggaattcttcatgagttttggaaacgttgtggaacttcgc atcaataccaagggtgttgggggaaagcttccaaattttggttttgtggtttttgatgac tctgaaccagttcagagaatcttaattgcaaaaccgattatgtttcgaggggaagtacgt ttaaatgtggaagagaaaaaaacaagagctgcaagagagcgagaaaccagaggtggtggt gatgatcgcagggatattaggcgcaatgatcgaggtcccggtggtccacgtggaattgtg ggtggtggaatgatgcgtgatcgtgatggaagaggacctcctccaaggggtggcatggca cagaaacttggctctggaagaggaaccgggcaaatggagggccgcttcacaggacagcgt cgctga >gi568815594r:75545433_75762025|GENSCAN_predicted_peptide_4|132_aa MIVRPPQPCGTVSPLNLFFIPVPGQKPWMIHKKKFHREQISVKTTLPCLARIPLARLRMN PVPSACSLTSVPGGDRYRCLGSEGQDRSWAKKSKCPVALVKNIRCVNDVSMADNCQLSSS CSQSSLSSWKED >gi568815594r:75545433_75762025|GENSCAN_predicted_CDS_4|399_bp atgattgtgagacctcctcagccatgtggaactgtaagtccgctgaacctctttttcatc ccagtcccagggcaaaaaccttggatgatccataaaaagaaattccacagggaacagatt tctgtaaaaactactttgccatgtttagcaagaattccactagctcgtttgagaatgaac cctgtgcctagtgcttgctctctaacgtctgtgcctggaggtgacaggtacaggtgtcta gggtctgagggtcaagacagaagctgggctaagaagtctaagtgcccagtggccttggta aaaaacattagatgtgtaaatgatgtgagcatggcagacaactgccagctttctagcagt tgttcccaatcatcattgtcgtcatggaaagaggactaa >gi568815594r:75545433_75762025|GENSCAN_predicted_peptide_5|171_aa MGAVNTGDNNGGKPNALTNSWVLNLLAVFPNAPVSDCEGGSRTSGSSRPVQTSERERGRG PRPRGAWEAVRASSAAHPSLPVRYHLRTTLLARASGPTELRGAAAGLGPNPALYTLLLLP PPPPTRRPLQPSKPGNSRQQRPRCACAGTWHYVAARRLKAKGERRREEPLS >gi568815594r:75545433_75762025|GENSCAN_predicted_CDS_5|516_bp atgggagcagtcaacactggggacaacaacggggggaagccaaatgctctcaccaacagc tgggttttaaaccttctcgcagttttccccaacgccccagtttcagactgcgagggaggg agcaggacttcaggctcctctcggccagtgcagacgagcgaaagagaaagagggaggggc ccacggccccgaggggcgtgggaggcagttcgggccagctcggccgcgcatccgtccctt cccgtgcggtaccacctgcggaccactctcctagcccgagcttcagggcctacagagctg cggggcgcggccgccggcctgggccccaatcccgcactctacacactcctactgctgcca ccaccgcctccaactcggcggccactccagccctctaagcccggaaacagccgccagcag cggccaagatgcgcatgcgcggggacgtggcattacgtggcggctcgaaggttgaaggca aaaggggagcggaggcgagaggaacctcttagctag