GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:23:53 Sequence gi568815585r:47977142_48190405 : 213264 bp : 40.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 2631 2523 109 2 1 91 87 23 0.013 1.87 1.03 Intr - 11840 11741 100 0 1 73 99 92 0.456 7.05 1.02 Intr - 19882 19702 181 1 1 89 47 199 0.191 14.42 1.01 Init - 24128 23991 138 2 0 82 72 123 0.405 10.40 1.00 Prom - 27598 27559 40 -5.85 2.00 Prom + 29414 29453 40 -7.15 2.01 Init + 32938 33063 126 0 0 57 100 131 0.898 11.41 2.02 Intr + 34664 34802 139 1 1 44 99 9 0.050 -3.28 2.03 Term + 38748 39838 1091 1 2 23 49 332 0.022 14.06 2.04 PlyA + 40655 40660 6 1.05 3.00 Prom + 47295 47334 40 -4.95 3.01 Init + 54752 54898 147 1 0 75 65 110 0.826 7.54 3.02 Intr + 60587 60843 257 1 2 32 29 328 0.901 16.62 3.03 Intr + 61837 61941 105 1 0 98 75 73 0.959 5.41 3.04 Intr + 63779 63975 197 2 2 88 69 152 0.997 11.44 3.05 Intr + 68519 68631 113 0 2 123 48 80 0.699 6.68 3.06 Intr + 81723 81801 79 2 1 64 115 86 0.498 7.11 3.07 Term + 82360 82739 380 2 2 70 39 245 0.438 11.77 3.08 PlyA + 84341 84346 6 1.05 4.13 PlyA - 84457 84452 6 1.05 4.12 Term - 90494 90364 131 0 2 51 33 126 0.608 0.76 4.11 Intr - 100170 100012 159 1 0 102 67 128 0.334 11.14 4.10 Intr - 102834 102703 132 1 0 73 95 113 0.983 10.20 4.09 Intr - 104590 104504 87 2 0 72 97 21 0.525 0.42 4.08 Intr - 106287 106230 58 0 1 37 96 44 0.961 -2.36 4.07 Intr - 109311 109141 171 0 0 70 119 159 0.998 16.42 4.06 Intr - 113277 113211 67 2 1 87 111 8 0.958 0.99 4.05 Intr - 118135 117919 217 2 1 69 69 214 0.170 14.24 4.04 Intr - 129368 129236 133 2 1 25 61 82 0.032 -1.50 4.03 Intr - 131409 131218 192 0 0 104 98 18 0.473 3.17 4.02 Intr - 148750 148629 122 2 2 67 58 98 0.375 3.99 4.01 Init - 150878 150818 61 2 1 54 119 16 0.478 2.76 4.00 Prom - 162026 161987 40 -5.55 5.00 Prom + 163764 163803 40 -6.35 5.01 Init + 173588 173636 49 1 1 65 58 83 0.410 4.16 5.02 Intr + 181048 181116 69 2 0 19 69 120 0.014 1.44 5.03 Intr + 189360 189441 82 1 1 119 86 50 0.447 5.68 5.04 Intr + 205510 205630 121 1 1 11 77 122 0.016 2.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 19882 19664 219 1 0 89 54 219 0.809 14.66 S.002 Sngl + 38732 39838 1107 1 0 58 49 322 0.907 21.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:47977142_48190405|GENSCAN_predicted_peptide_1|176_aa MAASMFYGRLVAVATLRNHRPRTAQRAAAQVMPSSPLSSRGSAGERVLGSSGLFNNHGLQ VQQQQQRNLSLHEYMSMELLQEAGVSVPKGYVAKSPDEAYAIAKKLGSKDVVIKAQVLAG GRGKGTFESGLKGGVKIVFSTSSAVLNRSGKSGHPCFVSDVRGKAFSFSFLCMMLA >gi568815585r:47977142_48190405|GENSCAN_predicted_CDS_1|528_bp atggcggcctccatgttctacggcaggctagtggccgtggccacccttcggaaccaccgg cctcggacggcccagcgggctgctgctcaggtaatgccttccagtccgctgtcgtctcga gggtcggcaggagaaagggttctgggaagttctggattgtttaataaccatggactccaa gtacagcagcaacagcaaaggaatctctcactacatgaatacatgagtatggaattattg caagaagctggtgtctccgttcccaaaggatatgtggcaaagtcaccagatgaagcttat gcaattgccaaaaaattaggttcaaaagatgtcgtgataaaggcacaggttttagctggt ggtagaggaaaaggaacatttgaaagtggcctcaaaggaggagtgaagatagttttctcg acatctagtgctgtgttgaacagaagcggcaagagtgggcatccttgctttgtttctgat gttagaggaaaagctttcagcttctcatttttgtgtatgatgttggct >gi568815585r:47977142_48190405|GENSCAN_predicted_peptide_2|451_aa MLKFAILKVEGPGDNSVVEVHGRGREISGNEETGVFEGPPRKFRAVLHSSVRTNEYKSLY SQEKLCKVPLSLTSKVIVSEILWKASSPENPIVSAQNLLKLISNFSKVSGYKINVQKSQA FLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDEKDLFKENYKPLLNEIKEDTKKWK NIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQKRAHITKST LSQKNEAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKS DKNKKWGKDSLFNKWCWENWLAICRKLKLDSFLTPYTKINSRWIKDLNVRPKTIKTLEEN LGNTIQDIGMGKDFMCETPKAIATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIF ATYSSDKGLISRIYNELKQIYKKKTTPSKSG >gi568815585r:47977142_48190405|GENSCAN_predicted_CDS_2|1356_bp atgcttaaatttgctattttgaaggtagaaggtccaggtgataacagtgtcgtggaagta catggaagaggtagagagattagtggaaatgaggaaactggagtttttgagggccctcca cggaagtttcgggcagttcttcatagtagtgtgagaacaaacgaatacaaaagtctatac tctcaagagaagctttgcaaagtgcctctgtccctgacttccaaagtcattgtttcagaa attctctggaaagcttcctccccagaaaaccccatcgtctcagcccaaaatctcctcaaa ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagca ttcttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaatccaacttacaagggatgagaaggacctc ttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaagaaatggaag aacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaag gtaatttatagattcaatgccatccccatcaagctaccaatgcctttcttcacagaactg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccacatcaccaagtcaacc ctaagccaaaagaatgaagctggaggcatcacactacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaatggaac agaacagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaatct gacaaaaacaagaaatggggaaaggattccctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactggattccttccttacaccttatacaaaaattaat tcaagatggattaaagacttaaatgttagacctaaaaccataaaaaccctggaagaaaac ctaggcaataccattcaggacataggtatgggcaaggacttcatgtgtgaaacaccaaaa gcaatagcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgc acagcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggagaaaattttt gcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatt tacaagaaaaaaacaaccccatcaaaaagtgggtga >gi568815585r:47977142_48190405|GENSCAN_predicted_peptide_3|425_aa MKSGRVKGMRKDSLREKVGPGGQRKYGGCEGPELWKPTLFVGDQTKKQVARPPARYDGQR TAARAAARSRSRSRGDQLQASALRPPGEEERLGWSWQFPTPWRSSGVRVSSRGREETGAC AEGRTHAVLASVCRAVLEEEREQAIQVTDAALIHPLVSSGHDIHMCAFSGETWEECAQRE TWEEAALHLKNVHFASVVNSFIEKENYHYVTILMKGEVDVTHDSEPKNVEPEKNESWEWV PWEELPPLDQLFWGLRCLKEQGYDPFKEDLNHLLKTDTARSPRKPTGPSQTLWVTLTVEE PDKPYKLVQDLRLINQIVLPIHPVVPNPYTLLSSIPPSTTHYSVLDLKHAFFTIPLHPSS QPLFTFTWTDADTHQAQQFTWAVLPQAFTDSPHYFSQAQISSSSVTCLGVILIKTHVHSL LIVSD >gi568815585r:47977142_48190405|GENSCAN_predicted_CDS_3|1278_bp atgaagtccggcagagtcaaaggaatgagaaaagacagtttgagagagaaagtgggtcca gggggccaacgcaagtatggaggctgtgaaggccctgagctctggaagcccacgctattt gttggtgatcaaacaaagaaacaggtggcgcgtcctcccgcgcgctatgacggccagcgc acagccgcgcgggcggcggccaggagtcggagtcggagtcgtggtgaccagctgcaagca tccgcgttgcgtcctcctggggaagaggaaaggctcggttggagctggcagtttccaact ccctggaggtcatctggagttcgggtgagcagccgcggacgcgaggaaaccggggcgtgc gcagagggacgaactcacgccgtgcttgcttccgtgtgccgtgctgtgttggaggaggag agagagcaagccattcaggtgactgatgctgctctgatccatccattggtgtcctctggg catgatatccacatgtgtgctttctcaggtgaaacctgggaagaatgtgctcaaagggaa acctgggaagaagcagctcttcacctgaaaaatgttcactttgcctcagttgtgaattct ttcattgagaaggagaattaccattatgttactatattaatgaaaggagaagtggatgtg actcatgattcagaaccaaagaatgtagagcctgaaaaaaatgaaagttgggagtgggtt ccttgggaagaactacctcccctggaccagcttttctggggactgcgttgtttaaaagaa caaggctatgatccatttaaagaagatctgaaccatctgctgaagactgacactgcccga tcgcctcggaagcctacaggaccatcacagacactctgggtaactctcacagtggaagaa ccagacaagccttacaagttagttcaggatctgcgccttatcaaccaaattgttttgcct atccaccccgtggtgccaaacccatatactctcctatcctcaatacctccctcaacaacc cattattctgttctggatctcaaacatgctttctttactattcctttgcacccttcatcc cagcctctcttcactttcacttggactgacgctgacacccatcaggctcagcaatttacc tgggctgtactgccacaagccttcactgacagcccccattacttcagtcaagcccaaatt tcttcctcatctgttacctgtctcggtgtaattctcataaaaacacatgtgcactccctg ctgatcgtgtctgactga >gi568815585r:47977142_48190405|GENSCAN_predicted_peptide_4|509_aa MHGKKETPKNISKFSTVSISGEWNVLGEAEALEFEVKESKEGSVHDLKRYFLLVKMLTGH LDLIQDTTLPLVTVSPLAPLGCQFLRASLFLMALNILKSMGQVFCRKPFTWDLSDGVRVW RKTPKFRYSSPLNLLWKLDPQCWRRSLGEGVWIMWQIPHELLGGILLVMRFGPESPRRPN RWRRCPQCASRPKGGALARPLFPLLGDSSAFRARLLATHLRQLALRVRRWRDSGENGCVF EWELIEMLAISRNQKLLQAGEENQVLELLIHRDGEFQELMKLALNQGKIHHEMQVLEKEV EKRDSDIQQLQKQLKEAEQILATAVYQAKEKLKSIEKARKGAISSEEIIKYAHRISASNA VCAPLTWVPGDPRRPYPTDLEMRSGLLGQMNNPSTNGVNGHLPGDALAAGRLPDVLAPQY PWQSNDMSMNMLPPNHSSDFLLEPPGHNKENEDDVEIMSTDSSSSSSHALKDVKQDGGLQ RSLLLTIQKVMQSTPDWLQLKTNLTDPFS >gi568815585r:47977142_48190405|GENSCAN_predicted_CDS_4|1530_bp atgcatgggaaaaaggaaaccccgaaaaatataagcaaattctcaactgtgagtatttct ggagagtggaatgtccttggagaggccgaggcactggaatttgaagtgaaagaaagtaaa gagggctctgttcatgacttaaaacgttatttccttcttgtcaagatgcttactggccac ctggatctcattcaggacaccacattaccattagtcacagtgtctcctctggctcctctt ggctgtcagtttctcagagcgtccttgtttttaatggccttgaatattttgaagagtatg ggtcaggtgttttgtagaaagcccttcacttgggatttatctgatggagttagagtttgg aggaaaaccccaaagtttagatattcatcccctctaaatctcttgtggaaacttgatcct cagtgttggaggcggagcctcggagaaggtgtttggatcatgtggcagatccctcatgaa cttcttggtggcatcctcctggtaatgaggtttggaccagaaagccctcgccggcccaac agatggcgccggtgtccgcagtgcgcctccaggcccaaagggggcgctctcgccaggcct ctttttcctctcctgggtgactcttccgccttccgagcccgccttctcgccacgcacctg cgtcagctcgctctgcgcgtgcgccggtggcgggactctggggaaaatggctgcgtcttc gagtgggaacttatagaaatgctggcaatttcaagaaaccaaaagttgttacaggctgga gaggaaaaccaggtcctggagttgttaattcaccgagatggggaatttcaagaactaatg aaattggcacttaatcagggaaaaattcatcatgaaatgcaagttttagaaaaagaagta gagaagagagacagtgatattcagcagctacaaaaacagctaaaggaagcagaacaaata ctggcaacagctgtttaccaagcgaaggagaaactcaagtcaatagaaaaagcaagaaaa ggtgctatctcctctgaagaaataattaagtatgcacataggatcagtgcaagtaatgct gtatgtgctccactgacctgggttccaggggacccccggagaccctacccaactgattta gagatgagaagtgggttactgggtcagatgaacaatccttccactaatggcgtgaatggc catttaccaggagatgcacttgcagcaggaagattgccagatgtccttgctccacagtat ccatggcagtcaaatgacatgtcgatgaatatgttaccaccaaatcatagtagtgacttt ttgttggaacctcctgggcataataaagaaaatgaagatgatgtagagattatgtcaacg gactcctcaagcagtagtagtcatgctctcaaggatgtaaaacaagatggaggcctgcag cgaagtttgctgctgactatacagaaagtcatgcaaagcacaccagattggctacagctt aagacaaaccttacagatcctttttcatag >gi568815585r:47977142_48190405|GENSCAN_predicted_peptide_5|107_aa MGPQAKGCGQPLEVGKENKNESSDRARNSGKRADRQRAGGDLKIWNNSQPGAYSAATAAA GRTLSLGMSERDLLEALRRMKYQFEKSLYQSNMNSAKIEERANIKLM >gi568815585r:47977142_48190405|GENSCAN_predicted_CDS_5|321_bp atggggccacaagccaagggatgtggacagcctttagaagttggaaaagaaaataagaat gaaagctctgacagagccaggaacagtggcaagagagctgatcggcaaagagcaggtggt gacttgaaaatttggaataattcacaacctggtgcttattcagcagcaacagcagcagca ggcagaactctgagtttgggcatgtctgaacgggatctccttgaggcactaagaaggatg aaatatcagtttgaaaagagcctctatcagagcaacatgaattctgctaaaattgaagaa agagcaaatatcaaattgatg