GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:36:09 Sequence gi568815586r:43695588_43904574 : 208987 bp : 38.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 537 532 6 1.05 1.03 Term - 3717 3606 112 2 1 28 49 196 0.622 6.75 1.02 Intr - 14801 14716 86 2 2 99 44 43 0.246 -1.30 1.01 Init - 15666 15601 66 0 0 68 108 27 0.486 3.82 1.00 Prom - 22583 22544 40 -3.85 2.08 PlyA - 23424 23419 6 1.05 2.07 Term - 35115 34789 327 0 0 132 41 160 0.966 9.42 2.06 Intr - 41072 40794 279 2 0 27 106 151 0.479 7.55 2.05 Intr - 46968 46870 99 0 0 56 56 98 0.812 2.79 2.04 Intr - 50651 50459 193 1 1 75 116 113 0.999 11.37 2.03 Intr - 53022 52863 160 1 1 82 75 163 0.999 12.52 2.02 Intr - 59674 58749 926 0 2 149 91 565 0.984 52.84 2.01 Init - 64019 63721 299 2 2 75 78 141 0.178 8.70 2.00 Prom - 67577 67538 40 -5.85 3.00 Prom + 69881 69920 40 -6.35 3.01 Init + 72525 72685 161 2 2 57 87 78 0.819 3.95 3.02 Intr + 75633 75778 146 0 2 57 97 85 0.994 5.31 3.03 Intr + 76593 76775 183 1 0 44 113 122 0.965 9.14 3.04 Intr + 77325 77485 161 1 2 68 92 186 0.988 15.79 3.05 Intr + 78378 78442 65 2 2 68 106 99 0.484 6.30 3.06 Intr + 86720 86903 184 2 1 54 103 222 0.998 19.17 3.07 Intr + 88799 88934 136 1 1 52 47 88 0.629 0.32 3.08 Intr + 90100 90284 185 2 2 34 119 52 0.477 1.49 3.09 Intr + 90812 90970 159 1 0 13 91 194 0.953 11.46 3.10 Term + 96047 96217 171 1 0 -14 43 141 0.138 -3.76 3.11 PlyA + 96287 96292 6 1.05 4.09 PlyA - 96949 96944 6 1.05 4.08 Term - 100168 99998 171 1 0 88 39 180 0.954 9.94 4.07 Intr - 102246 102121 126 0 0 63 91 147 0.986 12.46 4.06 Intr - 103915 103811 105 1 0 87 107 44 0.978 5.69 4.05 Intr - 104943 104848 96 0 0 21 62 109 0.090 0.89 4.04 Intr - 106877 106699 179 0 2 27 103 77 0.106 1.82 4.03 Intr - 108985 108908 78 2 0 89 49 65 0.095 1.30 4.02 Intr - 110443 110225 219 2 0 74 -8 171 0.117 3.45 4.01 Init - 110760 110634 127 0 1 53 87 237 0.508 18.47 4.00 Prom - 135087 135048 40 -1.05 5.00 Prom + 147108 147147 40 -8.05 5.01 Init + 149065 149341 277 0 1 54 111 114 0.377 7.50 5.02 Intr + 161326 161450 125 2 2 29 116 22 0.310 -1.52 5.03 Term + 168300 168812 513 2 0 69 37 271 0.610 13.46 5.04 PlyA + 169180 169185 6 1.05 6.00 Prom + 189313 189352 40 -3.25 6.01 Sngl + 191971 192150 180 0 0 95 45 167 0.905 7.88 6.02 PlyA + 194019 194024 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 104931 104848 84 0 0 42 62 112 0.844 4.87 S.002 Term - 128903 128766 138 2 0 96 50 103 0.929 4.28 S.003 Init - 130603 130526 78 1 0 91 51 56 0.898 1.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:43695588_43904574|GENSCAN_predicted_peptide_1|87_aa MNKTENDPSSEDGGRELVNNKQEKRGKSGEDYVLHVGYQLSYSRIGHQPESFRTNAPKDM ALMSGGMPGSSSRVELEKTTQAQEEQF >gi568815586r:43695588_43904574|GENSCAN_predicted_CDS_1|264_bp atgaacaagacagagaatgatccttctagtgaagatggtgggagggagctggtaaacaat aaacaagagaagagagggaagagtggggaagactatgtcttgcatgttggataccagctc agctacagcaggatagggcaccagccagagtcattccgaaccaatgcaccaaaagatatg gctctgatgagtggaggaatgccaggatcttcatctcgagttgaattagaaaaaacgaca caggcacaggaggagcagttttaa >gi568815586r:43695588_43904574|GENSCAN_predicted_peptide_2|760_aa MSHRTWTGFAFIPLSPSGFQADRYTFFPHLRMYSSDPISRLHFLIPWKLGGRRNKRITIV WRLSELLAPQASVRCWPDFLVLGKRDFISQSGSRNPGPERHCYRRMEEDTDYRIRFSSLC FFNDHVGFHGTIKSSPSDFIVIEIDEQGQLVNKTIDEPIFKISEIQLEPNNFPKKPKLDL QNLSLEDGRNQEVHTLIKYTDGDQNHQSGSEKEDTIVDGTSKCEEKADVLSSFLDEKTHE LLNNFACDVREKWLSKTELIGLPPEFSIGRILDKNQRASLHSAIRQKFPFLVTVGKNSEI VVKPNLEYKELCHLVSEEEAFDFFKYLDAKKENSKFTFKPDTNKDHRKAVHHFVNKKFGN LVETKSFSKMNCSAGNPNVVVTVRFREKAHKRGKRPLSECQEGKVIYTAFTLRKENLEMF EAIGFLAIKLGVIPSDFSYAGLKDKKAITYQAMVVRKVTPERLKNIEKEIEKKRMNVFNI RSVDDSLRLGQLKGNHFDIVIRNLKKQINDSANLRERIMEAIENVKKKGFVNYYGPQRFG KGRKVHTDQIGLALLKNEMDAKGTLSLMPEFKVRERALLEALHRFGMTEEGCIQAWFSLP HSMRIFYVHAYTSKIWNEAVSYRLETYGARVVQGDLVCLDEDIDDENFPNSKVVLPVLGY NIQYPKNKVGQWYHDILSRDGLQTCRFKVPTLKLNIPGCYRQILKHPCNLSYQLMEDHDI DVKTKGSHIDETALSLLISFDLDASCYATVCLKEIMKHDV >gi568815586r:43695588_43904574|GENSCAN_predicted_CDS_2|2283_bp atgagccaccgcacctggacagggtttgcttttatccctttgtctccctctggcttccaa gctgacaggtacacttttttcccccatctgcgaatgtactcatctgatcccataagccgc ctccactttctaattccctggaagcttggaggccggagaaacaagcgaattacaatcgtg tggaggctcagcgagctgctggcgccacaggcatctgtcaggtgctggcctgacttcctg gttctggggaaaagagacttcatttcgcagtcgggttcacggaatccagggcctgagagg cactgttatagaagaatggaagaagatacagattatagaatcaggtttagttctttgtgt ttctttaatgatcacgttggatttcatggcactataaaaagctcaccaagtgactttatt gttattgaaattgatgaacagggacagttagttaataagaccatcgatgagcctattttc aagattagtgaaatacaacttgagccaaataattttcccaaaaaaccaaaactagatctt caaaatctgtccttagaagatggaagaaaccaagaagttcatactttgattaagtacact gatggtgaccaaaatcatcagtctggttcagaaaaggaagatactatcgttgatggaact tccaaatgtgaagaaaaagctgatgttttaagctcctttttggatgaaaaaactcatgag ttactgaataattttgcctgtgatgtaagagagaagtggctttctaaaacagagctaatt ggactacctcctgaattctcaataggcagaatccttgacaaaaaccagagggctagttta cacagtgccattaggcagaaatttccatttttagtaactgtaggaaaaaacagtgaaatt gttgtaaaaccaaatcttgaatataaagaactttgtcatttggtatctgaagaggaagca tttgacttttttaaatatttggatgcaaagaaagaaaattccaaatttacctttaaacct gatacaaacaaagaccacagaaaagctgtccaccattttgtcaacaaaaagtttggaaac cttgtggaaaccaaatctttttctaaaatgaattgcagtgctggtaatccgaatgtggtg gtaacagtaagatttcgggaaaaagcacacaaacgtgggaaaaggcctctttctgaatgc caagaaggaaaagttatatatacagcttttaccctacgaaaggaaaacctggaaatgttt gaagcgattggttttttagctatcaaacttggtgttattccttcggattttagttatgca ggccttaaagacaagaaagccatcacctatcaagcaatggttgttagaaaagtgactcca gagaggttgaaaaatattgaaaaagaaattgaaaagaaaagaatgaatgtctttaatatt cggtctgtagatgattccctgagacttggtcagctcaaaggaaatcactttgatattgtc attagaaatttaaaaaaacaaataaatgattctgcaaacctgagggagagaattatggaa gcaatagaaaatgttaagaaaaaaggctttgtgaattactatggaccacagagatttggg aagggaaggaaagttcacacagaccaaattggactagctttgctgaagaatgaaatggat gctaaaggcacactttcattgatgcctgaattcaaagtgcgtgagagagcattgttggag gcattgcaccgctttggcatgaccgaggaaggttgtatccaggcatggttctctttaccc cattccatgcgcatattctatgttcacgcatataccagcaaaatttggaatgaggcagta tcttacagacttgaaacctatggagcaagagtagtgcagggtgatttggtctgtttggat gaagacattgatgacgagaatttcccaaatagtaaagtggttcttccagtacttggatac aatattcagtacccgaagaacaaagtagggcagtggtaccatgacatacttagcagagat ggactacagacatgtaggtttaaagtacctactctgaaactgaatataccaggttgctat agacagattttgaaacatccctgtaatctctcataccaactaatggaagatcatgacatt gatgtcaaaacgaaaggttcccacattgatgaaacagctttgtctcttttgatctctttt gatcttgatgcttcatgctatgctaccgtttgtctgaaggaaataatgaagcatgacgtt taa >gi568815586r:43695588_43904574|GENSCAN_predicted_peptide_3|516_aa MNKPITPSTYVRCLNVGLIRKLSDFIDPQEGWKKLAVAIKKPSGDDRYNQFHIRRFEALL QTGKSPTSELLFDWGTTNCTVGDLVDLLIQNEFFAPASLLLPDAVPKTANTLPSKEAITV QQKQMPFCDKDRTLMTPVQNLEQSYMPPDSSSPENKSLEVSDTRFHSFSFYELKNVTNNF DERPISVGGNKMGEGGFGVVYKGYVNNTTVAVKKLAAMVDITTEELKQQFDQEIKVMANA NILLDEAFTAKISDFGLARASEKFAQTVMTSRIVGTTAYMAPEALRGEITPKSDIYSFGV HQSQIPQVKSSVPKTAIPPDTSCKFRPPELLTNCLQVGVPMTSSVGIHSEKCVVRQFHCV NLIECTYTNLDDMTYDIPGLYGIAPRLQTYAIQHNTLLNTVSNCNTLLDIKEEIEDEEKT IEDYIDKKMNDADSTSVEAMYSVASQCLHEKKNKRPDIKKNQLNEDPYWKAFDAQEWVST AILCGRTHGFRRARDFASASEFSCGQKENDGGQILW >gi568815586r:43695588_43904574|GENSCAN_predicted_CDS_3|1551_bp atgaacaaacccataacaccatcaacatatgtgcgctgcctcaatgttggactaattagg aagctgtcagattttattgatcctcaagaaggatggaagaagttagctgtagctattaaa aaaccatctggtgatgatagatacaatcagtttcacataaggagatttgaagcattactt caaactggaaaaagtcccacttctgaattactgtttgactggggcaccacaaattgcaca gttggtgatcttgtggatcttttgatccaaaatgaattttttgctcctgcgagtcttttg ctcccagatgctgttcccaaaactgctaatacactaccttctaaagaagctataacagtt cagcaaaaacagatgcctttctgtgacaaagacaggacattgatgacacctgtgcagaat cttgaacaaagctatatgccacctgactcctcaagtccagaaaataaaagtttagaagtt agtgatacacgttttcacagtttttcattttatgaattgaagaatgtcacaaataacttt gatgaacgacccatttctgttggtggtaataaaatgggagagggaggatttggagttgta tataaaggctacgtaaataacacaactgtggcagtgaagaagcttgcagcaatggttgac attactactgaagaactgaaacagcagtttgatcaagaaataaaagtaatggcaaatgca aatatcttactggatgaagcttttactgctaaaatatctgactttggccttgcacgggct tctgagaagtttgcccagacagtcatgactagcagaattgtgggaacaacagcttatatg gcaccagaagctttgcgtggagaaataacacccaaatctgatatttacagctttggtgtg catcagagtcagatcccacaggttaagagctcagtccccaagactgccatcccaccagac accagttgtaagtttaggcctccagaacttctgaccaactgccttcaagttggagttcct atgacctcctctgtgggaatacattccgagaaatgtgttgttaggcagtttcattgtgtg aacctcatagagtgtacttacacaaacctagatgatatgacctacgacatacctgggcta tatggtattgctcctaggctacaaacgtatgctatacagcataatactctcctgaatact gtaagcaattgtaacacactgctagatattaaagaagaaattgaagatgaagaaaagaca attgaagattatattgataaaaagatgaatgatgctgattccacttcagttgaagctatg tactctgttgctagtcaatgtctgcatgaaaagaaaaataagagaccagacattaagaag aatcagctgaatgaagatccctactggaaggcatttgatgctcaggaatgggtcagtact gccattttgtgtggtagaacccatggctttcgcagggcaagagattttgccagtgctagt gagttttcctgtggccagaaagaaaatgatggaggccagatcctctggtga >gi568815586r:43695588_43904574|GENSCAN_predicted_peptide_4|366_aa MTRRGRRRSSHFLGPPAGAAGCTQRRSRELAAAAMSHQTGIQARSLLPSRCRLIRNSGER RKLAFEAAWLKGACLPPSPTVAQLSPFWSRREELGQWRRLGLQYAVENQASFLFHASEDV KEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPLLEDKQPCYILFRL DSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDEVFGTVKEDVSLHG YKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAFPISREAFQALEKL NNRQLNYVQLIEIDNGDELTADFLYEEVHPKQHAHKQSFAKPKGPAGKRGIRRLIRGPAE TEATTD >gi568815586r:43695588_43904574|GENSCAN_predicted_CDS_4|1101_bp atgacgcgccggggccggcggaggagcagccacttcctggggccgccggccggggccgct ggctgcactcagcgccggagccgggagctagcggccgccgccatgtcccaccagaccggc atccaagcccggtccctgctgccgtcacgttgcagattgatccgaaattcgggggagcga aggaaattggctttcgaggcggcctggttgaagggggcgtgccttccaccgtccccgaca gtcgcccaactttcccccttttggtcacgtagggaagagctgggacagtggaggcgactg gggcttcagtacgctgtcgagaaccaggcgagttttctttttcatgcaagtgaagatgtt aaagagatctttgccagagccagaaatggaaagtacagacttctgaaaatatctattgaa aatgagcaacttgtgattggatcatatagtcagccttcagattcctgggataaggattat gattcctttgttttacccctgttggaggacaaacaaccatgctatatattattcaggtta gattctcagaatgcccagggatatgaatggatattcattgcatggtctccagatcattct catgttcgtcaaaaaatgttgtatgcagcaacaagagcaactctgaagaaggaatttgga ggtggccacattaaagatgaagtatttggaacagtaaaggaagatgtatcattacatgga tataaaaaatacttgctgtcacaatcttcccctgccccactgactgcagctgaggaagaa ctacgacagattaaaatcaatgaggtacagactgacgtgggtgtggacactaagcatcaa acactacaaggagtagcatttcccatttctcgagaagcctttcaggctttggaaaaattg aataatagacagctcaactatgtgcagttgatcgagatagacaatggggatgagttgact gcagacttcctttatgaagaagtacatcccaagcagcatgcacacaagcaaagttttgca aaaccaaaaggtcctgcaggaaaaagaggaattcgaagactaattaggggcccagcggaa actgaagctactactgattaa >gi568815586r:43695588_43904574|GENSCAN_predicted_peptide_5|304_aa MGKDFRYYFQHPWSRMIVAYLVIFFNFLIFAEDPVSHSQTEANVIVVGNCFSFVTNKYPR GVGWRILKVLLWLLAILTGLIAGKFLFHQRLFGTPFFRGRFGVLELLNFVALFKKLILLL CVPVMFSLLEQESKGGVEGEAQVGTGAAPGACGPARVPGGHGLGGPTLGAAGQPALDSEG LSTRASSCGGCAGSPSSAGPPALRWISHGALAASLGGRAQDLQPAMPESPSLPPQWVPAR PEPPRPAPPPASWHPVPPTAQGLRSAGTLRRDWQAALSAAPVRDPLGEASWASESSGDLE NLYV >gi568815586r:43695588_43904574|GENSCAN_predicted_CDS_5|915_bp atgggtaaagactttcgttactatttccagcatccctggtctcgcatgattgtggcttac ttggtgatcttctttaacttcttaatatttgcggaggacccagtttctcatagccaaaca gaagccaatgttattgttgttggaaactgtttttcatttgttacaaataaataccctaga ggagttggctggaggattttgaaggtgcttctatggctacttgccattctcacaggacta atagctggcaaatttctgttccatcagcgtttgtttggaacacccttcttcagagggaga tttggtgtgttggagctactgaactttgtagcccttttcaaaaagctgatcttattactt tgtgttcctgttatgttctctctcctagagcaggaaagcaagggaggtgtggagggagag gcacaggtgggaaccggggctgcgcccggcgcttgcgggccagctagagttccgggtggg catgggcttggcggccccacactcggagctgccggccagccagccctggacagtgagggg cttagcacccgggccagcagctgtggagggtgcgctgggtcccccagcagtgctggccca ccagcgctgcgctggatttctcacggggctttagctgcctccctggggggcagggctcag gacctgcagcctgccatgcctgagtctccctcactccccccgcagtgggttcctgcgcgg cctgagcctccccgaccagcacctccccctgcttcatggcacccagtcccacccaccgcc caagggctgaggagtgcaggcacactgcgcagggactggcaggcagctctatctgcagcc ccagtgcgagatccactgggtgaagccagctgggcttctgagtctagtggggacttggag aacctttatgtctag >gi568815586r:43695588_43904574|GENSCAN_predicted_peptide_6|59_aa MPFSQCRHLISGQVPLLQKTASTLTPTALMHLTAKAWMEKNSAEVTLMTAESSAEVVLI >gi568815586r:43695588_43904574|GENSCAN_predicted_CDS_6|180_bp atgcccttcagccagtgcagacatttgatctcaggacaggtgcccctgctgcagaagacc gccagcaccctgactccaacagctctgatgcatctcactgccaaggcctggatggaaaaa aattcagcagaagttacacttatgacggcagaaagttcagcagaagttgtacttatttag