GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:40:38 Sequence gi568815586r:43630379_43855245 : 224867 bp : 37.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 19765 19858 94 0 1 109 75 79 0.168 7.32 1.02 Intr + 29530 29712 183 2 0 -20 36 212 0.067 4.04 1.03 Intr + 30168 30314 147 1 0 -20 21 214 0.106 3.19 1.04 Term + 30386 30723 338 0 2 60 47 398 0.582 26.65 1.05 PlyA + 31321 31326 6 1.05 2.03 PlyA - 31989 31984 6 1.05 2.02 Term - 45539 45053 487 1 1 9 48 268 0.023 8.09 2.01 Init - 55772 55438 335 2 2 56 54 164 0.054 6.61 2.00 Prom - 68581 68542 40 -6.05 3.04 PlyA - 68630 68625 6 1.05 3.03 Term - 68926 68815 112 0 1 28 49 196 0.569 6.75 3.02 Intr - 80010 79925 86 0 2 99 44 43 0.232 -1.30 3.01 Init - 80875 80810 66 1 0 68 108 27 0.483 3.82 3.00 Prom - 87792 87753 40 -3.85 4.08 PlyA - 88633 88628 6 1.05 4.07 Term - 100324 99998 327 1 0 132 41 160 0.966 9.42 4.06 Intr - 106281 106003 279 0 0 27 106 151 0.479 7.55 4.05 Intr - 112177 112079 99 1 0 56 56 98 0.812 2.79 4.04 Intr - 115860 115668 193 2 1 75 116 113 0.999 11.37 4.03 Intr - 118231 118072 160 2 1 82 75 163 0.999 12.52 4.02 Intr - 124883 123958 926 1 2 149 91 565 0.984 52.84 4.01 Init - 129228 128930 299 0 2 75 78 141 0.178 8.70 4.00 Prom - 132786 132747 40 -5.85 5.00 Prom + 135090 135129 40 -6.35 5.01 Init + 137734 137894 161 0 2 57 87 78 0.819 3.95 5.02 Intr + 140842 140987 146 1 2 57 97 85 0.994 5.31 5.03 Intr + 141802 141984 183 2 0 44 113 122 0.965 9.14 5.04 Intr + 142534 142694 161 2 2 68 92 186 0.988 15.79 5.05 Intr + 143587 143651 65 0 2 68 106 99 0.484 6.30 5.06 Intr + 151929 152112 184 0 1 54 103 222 0.998 19.17 5.07 Intr + 154008 154143 136 2 1 52 47 88 0.629 0.32 5.08 Intr + 155309 155493 185 0 2 34 119 52 0.477 1.49 5.09 Intr + 156021 156179 159 2 0 13 91 194 0.953 11.46 5.10 Term + 161256 161426 171 2 0 -14 43 141 0.138 -3.76 5.11 PlyA + 161496 161501 6 1.05 6.09 PlyA - 162158 162153 6 1.05 6.08 Term - 165377 165207 171 2 0 88 39 180 0.954 9.94 6.07 Intr - 167455 167330 126 1 0 63 91 147 0.986 12.46 6.06 Intr - 169124 169020 105 2 0 87 107 44 0.978 5.69 6.05 Intr - 170152 170057 96 1 0 21 62 109 0.090 0.89 6.04 Intr - 172086 171908 179 1 2 27 103 77 0.106 1.82 6.03 Intr - 174194 174117 78 0 0 89 49 65 0.095 1.30 6.02 Intr - 175652 175434 219 0 0 74 -8 171 0.117 3.45 6.01 Init - 175969 175843 127 1 1 53 87 237 0.508 18.47 6.00 Prom - 200296 200257 40 -1.05 7.02 PlyA - 203080 203075 6 1.05 7.01 Term - 205385 205226 160 1 1 71 34 138 0.046 3.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 55772 55398 375 2 0 56 50 172 0.852 6.09 S.002 Init - 170140 170057 84 1 0 42 62 112 0.844 4.87 S.003 Term - 194112 193975 138 0 0 96 50 103 0.929 4.28 S.004 Init - 195812 195735 78 2 0 91 51 56 0.898 1.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_1|253_aa VVQTQISKTADELISYWGTSFPPPFAASLTLYELEYCITIDISLWKFETSKYYATVIDAP GHRDFIKNMITGTSQADCAVLIFAAGVGEFEAVTFAPVNVTTEVKSVEMHHEVLSEALPG DNVDFDVKNVSVKEVHHGNVAGQISAGCAPVLDCHMAHIACKFAKLKKKTGSTSGKKLED GPTFLKSGDAAIVDMVPGKPMCVESFSVYPPLSRFAVCDMRQTVAVGVIEAMDKKAAGAG KVTKSTQKAQKAK >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_1|762_bp gtagtgcagacccaaatcagcaaaactgctgatgagctgatttcctactggggaacaagc ttcccacctccctttgctgcatccttaaccttgtatgagcttgaatattgtatcaccatt gatatctccctgtggaaatttgagaccagcaagtactacgcgactgtcattgatgcccca ggacacagagacttcatcaaaaacatgattacagggacatctcaggctgactgtgctgtc ctgatttttgctgctggtgttggtgaatttgaagctgtcacctttgctccggtcaatgtc acaactgaagtcaagtctgttgaaatgcaccatgaagttttgagtgaagctcttcctggg gacaatgtggacttcgatgtcaagaatgtgtctgtcaaggaggttcatcatggcaacgtt gctggccaaatcagtgctggctgtgctcccgtactggattgccacatggctcacattgca tgcaagtttgctaagctgaagaaaaagactggtagcacttctggtaaaaagctggaagat ggccctactttcttgaagtctggtgatgctgccatcgttgatatggttcctggcaagccc atgtgtgttgagagcttctcagtctacccacctctgagtcgctttgctgtttgtgatatg agacagacagttgctgtgggtgtcatcgaagcaatggacaagaaggctgctggagctggc aaggtcaccaagtctacccagaaagctcagaaggctaaatga >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_2|273_aa MILNEHAAFKHLFNKAHLAPPLIHLTLSGYSTRFREHRVGGKVTDQQDPKAEEFILVQNK MKSLPCLLLSTQTRQPSDFSIFSPPFPAFYSTKPPLSSPPVLNELLGTPPRRGSTGVSLA MGGVCGSQGSLPTLLPRQLEAAKLDTAGSGWFRLSRAGTGADLGEVLGQLSGHWDNRPGK GRGISPVPQSQLRGRGVAWGFQRSRELWGLLSSHTLIWWVFLLSVSGSGNRPNQLGYFQV IYAQITEPFQAFQSIKFPVAESKAIRPDTFQVL >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_2|822_bp atgattcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggatacagcacacgtttcagagagcacagggttggg ggtaaggtcaccgatcaacaggatcccaaggcagaagaatttatcttagtacagaacaaa atgaaaagtctcccatgtctactactttctacacagacacggcaaccatccgatttctca atcttttccccacctttccccgctttctattccacaaaaccgccattgtcatccccgccc gttctcaatgagctgttgggtacacctcccagacggggaagtactggggtgtccctggcg atgggtggggtgtgtggctctcaggggtcactccccacactgctgccaaggcagctagag gcagcaaagctggacactgctgggtcaggctggttcaggctctctagggcaggtacaggt gctgaccttggtgaagttctggggcagctctcaggccactgggacaaccgtccagggaag ggcagaggcatctctcctgtgccacagagccagctcagaggaagaggggtggcctggggt tttcagcgcagtagggagctgtggggcctgctcagctcccacaccctgatctggtgggtc ttcctcctctcagtgtctggctctggcaacaggcccaaccagttaggctacttccaagtc atctatgcccagatcactgagccattccaggcattccagagtataaaattccctgtggca gaaagtaaggctatcaggccagacacattccaggtcctgtga >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_3|87_aa MNKTENDPSSEDGGRELVNNKQEKRGKSGEDYVLHVGYQLSYSRIGHQPESFRTNAPKDM ALMSGGMPGSSSRVELEKTTQAQEEQF >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_3|264_bp atgaacaagacagagaatgatccttctagtgaagatggtgggagggagctggtaaacaat aaacaagagaagagagggaagagtggggaagactatgtcttgcatgttggataccagctc agctacagcaggatagggcaccagccagagtcattccgaaccaatgcaccaaaagatatg gctctgatgagtggaggaatgccaggatcttcatctcgagttgaattagaaaaaacgaca caggcacaggaggagcagttttaa >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_4|760_aa MSHRTWTGFAFIPLSPSGFQADRYTFFPHLRMYSSDPISRLHFLIPWKLGGRRNKRITIV WRLSELLAPQASVRCWPDFLVLGKRDFISQSGSRNPGPERHCYRRMEEDTDYRIRFSSLC FFNDHVGFHGTIKSSPSDFIVIEIDEQGQLVNKTIDEPIFKISEIQLEPNNFPKKPKLDL QNLSLEDGRNQEVHTLIKYTDGDQNHQSGSEKEDTIVDGTSKCEEKADVLSSFLDEKTHE LLNNFACDVREKWLSKTELIGLPPEFSIGRILDKNQRASLHSAIRQKFPFLVTVGKNSEI VVKPNLEYKELCHLVSEEEAFDFFKYLDAKKENSKFTFKPDTNKDHRKAVHHFVNKKFGN LVETKSFSKMNCSAGNPNVVVTVRFREKAHKRGKRPLSECQEGKVIYTAFTLRKENLEMF EAIGFLAIKLGVIPSDFSYAGLKDKKAITYQAMVVRKVTPERLKNIEKEIEKKRMNVFNI RSVDDSLRLGQLKGNHFDIVIRNLKKQINDSANLRERIMEAIENVKKKGFVNYYGPQRFG KGRKVHTDQIGLALLKNEMDAKGTLSLMPEFKVRERALLEALHRFGMTEEGCIQAWFSLP HSMRIFYVHAYTSKIWNEAVSYRLETYGARVVQGDLVCLDEDIDDENFPNSKVVLPVLGY NIQYPKNKVGQWYHDILSRDGLQTCRFKVPTLKLNIPGCYRQILKHPCNLSYQLMEDHDI DVKTKGSHIDETALSLLISFDLDASCYATVCLKEIMKHDV >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_4|2283_bp atgagccaccgcacctggacagggtttgcttttatccctttgtctccctctggcttccaa gctgacaggtacacttttttcccccatctgcgaatgtactcatctgatcccataagccgc ctccactttctaattccctggaagcttggaggccggagaaacaagcgaattacaatcgtg tggaggctcagcgagctgctggcgccacaggcatctgtcaggtgctggcctgacttcctg gttctggggaaaagagacttcatttcgcagtcgggttcacggaatccagggcctgagagg cactgttatagaagaatggaagaagatacagattatagaatcaggtttagttctttgtgt ttctttaatgatcacgttggatttcatggcactataaaaagctcaccaagtgactttatt gttattgaaattgatgaacagggacagttagttaataagaccatcgatgagcctattttc aagattagtgaaatacaacttgagccaaataattttcccaaaaaaccaaaactagatctt caaaatctgtccttagaagatggaagaaaccaagaagttcatactttgattaagtacact gatggtgaccaaaatcatcagtctggttcagaaaaggaagatactatcgttgatggaact tccaaatgtgaagaaaaagctgatgttttaagctcctttttggatgaaaaaactcatgag ttactgaataattttgcctgtgatgtaagagagaagtggctttctaaaacagagctaatt ggactacctcctgaattctcaataggcagaatccttgacaaaaaccagagggctagttta cacagtgccattaggcagaaatttccatttttagtaactgtaggaaaaaacagtgaaatt gttgtaaaaccaaatcttgaatataaagaactttgtcatttggtatctgaagaggaagca tttgacttttttaaatatttggatgcaaagaaagaaaattccaaatttacctttaaacct gatacaaacaaagaccacagaaaagctgtccaccattttgtcaacaaaaagtttggaaac cttgtggaaaccaaatctttttctaaaatgaattgcagtgctggtaatccgaatgtggtg gtaacagtaagatttcgggaaaaagcacacaaacgtgggaaaaggcctctttctgaatgc caagaaggaaaagttatatatacagcttttaccctacgaaaggaaaacctggaaatgttt gaagcgattggttttttagctatcaaacttggtgttattccttcggattttagttatgca ggccttaaagacaagaaagccatcacctatcaagcaatggttgttagaaaagtgactcca gagaggttgaaaaatattgaaaaagaaattgaaaagaaaagaatgaatgtctttaatatt cggtctgtagatgattccctgagacttggtcagctcaaaggaaatcactttgatattgtc attagaaatttaaaaaaacaaataaatgattctgcaaacctgagggagagaattatggaa gcaatagaaaatgttaagaaaaaaggctttgtgaattactatggaccacagagatttggg aagggaaggaaagttcacacagaccaaattggactagctttgctgaagaatgaaatggat gctaaaggcacactttcattgatgcctgaattcaaagtgcgtgagagagcattgttggag gcattgcaccgctttggcatgaccgaggaaggttgtatccaggcatggttctctttaccc cattccatgcgcatattctatgttcacgcatataccagcaaaatttggaatgaggcagta tcttacagacttgaaacctatggagcaagagtagtgcagggtgatttggtctgtttggat gaagacattgatgacgagaatttcccaaatagtaaagtggttcttccagtacttggatac aatattcagtacccgaagaacaaagtagggcagtggtaccatgacatacttagcagagat ggactacagacatgtaggtttaaagtacctactctgaaactgaatataccaggttgctat agacagattttgaaacatccctgtaatctctcataccaactaatggaagatcatgacatt gatgtcaaaacgaaaggttcccacattgatgaaacagctttgtctcttttgatctctttt gatcttgatgcttcatgctatgctaccgtttgtctgaaggaaataatgaagcatgacgtt taa >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_5|516_aa MNKPITPSTYVRCLNVGLIRKLSDFIDPQEGWKKLAVAIKKPSGDDRYNQFHIRRFEALL QTGKSPTSELLFDWGTTNCTVGDLVDLLIQNEFFAPASLLLPDAVPKTANTLPSKEAITV QQKQMPFCDKDRTLMTPVQNLEQSYMPPDSSSPENKSLEVSDTRFHSFSFYELKNVTNNF DERPISVGGNKMGEGGFGVVYKGYVNNTTVAVKKLAAMVDITTEELKQQFDQEIKVMANA NILLDEAFTAKISDFGLARASEKFAQTVMTSRIVGTTAYMAPEALRGEITPKSDIYSFGV HQSQIPQVKSSVPKTAIPPDTSCKFRPPELLTNCLQVGVPMTSSVGIHSEKCVVRQFHCV NLIECTYTNLDDMTYDIPGLYGIAPRLQTYAIQHNTLLNTVSNCNTLLDIKEEIEDEEKT IEDYIDKKMNDADSTSVEAMYSVASQCLHEKKNKRPDIKKNQLNEDPYWKAFDAQEWVST AILCGRTHGFRRARDFASASEFSCGQKENDGGQILW >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_5|1551_bp atgaacaaacccataacaccatcaacatatgtgcgctgcctcaatgttggactaattagg aagctgtcagattttattgatcctcaagaaggatggaagaagttagctgtagctattaaa aaaccatctggtgatgatagatacaatcagtttcacataaggagatttgaagcattactt caaactggaaaaagtcccacttctgaattactgtttgactggggcaccacaaattgcaca gttggtgatcttgtggatcttttgatccaaaatgaattttttgctcctgcgagtcttttg ctcccagatgctgttcccaaaactgctaatacactaccttctaaagaagctataacagtt cagcaaaaacagatgcctttctgtgacaaagacaggacattgatgacacctgtgcagaat cttgaacaaagctatatgccacctgactcctcaagtccagaaaataaaagtttagaagtt agtgatacacgttttcacagtttttcattttatgaattgaagaatgtcacaaataacttt gatgaacgacccatttctgttggtggtaataaaatgggagagggaggatttggagttgta tataaaggctacgtaaataacacaactgtggcagtgaagaagcttgcagcaatggttgac attactactgaagaactgaaacagcagtttgatcaagaaataaaagtaatggcaaatgca aatatcttactggatgaagcttttactgctaaaatatctgactttggccttgcacgggct tctgagaagtttgcccagacagtcatgactagcagaattgtgggaacaacagcttatatg gcaccagaagctttgcgtggagaaataacacccaaatctgatatttacagctttggtgtg catcagagtcagatcccacaggttaagagctcagtccccaagactgccatcccaccagac accagttgtaagtttaggcctccagaacttctgaccaactgccttcaagttggagttcct atgacctcctctgtgggaatacattccgagaaatgtgttgttaggcagtttcattgtgtg aacctcatagagtgtacttacacaaacctagatgatatgacctacgacatacctgggcta tatggtattgctcctaggctacaaacgtatgctatacagcataatactctcctgaatact gtaagcaattgtaacacactgctagatattaaagaagaaattgaagatgaagaaaagaca attgaagattatattgataaaaagatgaatgatgctgattccacttcagttgaagctatg tactctgttgctagtcaatgtctgcatgaaaagaaaaataagagaccagacattaagaag aatcagctgaatgaagatccctactggaaggcatttgatgctcaggaatgggtcagtact gccattttgtgtggtagaacccatggctttcgcagggcaagagattttgccagtgctagt gagttttcctgtggccagaaagaaaatgatggaggccagatcctctggtga >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_6|366_aa MTRRGRRRSSHFLGPPAGAAGCTQRRSRELAAAAMSHQTGIQARSLLPSRCRLIRNSGER RKLAFEAAWLKGACLPPSPTVAQLSPFWSRREELGQWRRLGLQYAVENQASFLFHASEDV KEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPLLEDKQPCYILFRL DSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDEVFGTVKEDVSLHG YKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAFPISREAFQALEKL NNRQLNYVQLIEIDNGDELTADFLYEEVHPKQHAHKQSFAKPKGPAGKRGIRRLIRGPAE TEATTD >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_6|1101_bp atgacgcgccggggccggcggaggagcagccacttcctggggccgccggccggggccgct ggctgcactcagcgccggagccgggagctagcggccgccgccatgtcccaccagaccggc atccaagcccggtccctgctgccgtcacgttgcagattgatccgaaattcgggggagcga aggaaattggctttcgaggcggcctggttgaagggggcgtgccttccaccgtccccgaca gtcgcccaactttcccccttttggtcacgtagggaagagctgggacagtggaggcgactg gggcttcagtacgctgtcgagaaccaggcgagttttctttttcatgcaagtgaagatgtt aaagagatctttgccagagccagaaatggaaagtacagacttctgaaaatatctattgaa aatgagcaacttgtgattggatcatatagtcagccttcagattcctgggataaggattat gattcctttgttttacccctgttggaggacaaacaaccatgctatatattattcaggtta gattctcagaatgcccagggatatgaatggatattcattgcatggtctccagatcattct catgttcgtcaaaaaatgttgtatgcagcaacaagagcaactctgaagaaggaatttgga ggtggccacattaaagatgaagtatttggaacagtaaaggaagatgtatcattacatgga tataaaaaatacttgctgtcacaatcttcccctgccccactgactgcagctgaggaagaa ctacgacagattaaaatcaatgaggtacagactgacgtgggtgtggacactaagcatcaa acactacaaggagtagcatttcccatttctcgagaagcctttcaggctttggaaaaattg aataatagacagctcaactatgtgcagttgatcgagatagacaatggggatgagttgact gcagacttcctttatgaagaagtacatcccaagcagcatgcacacaagcaaagttttgca aaaccaaaaggtcctgcaggaaaaagaggaattcgaagactaattaggggcccagcggaa actgaagctactactgattaa >gi568815586r:43630379_43855245|GENSCAN_predicted_peptide_7|53_aa XAAAAGEGRTQEPGGGWGAADRKSTILQGTNEGPTGMSSKSYKLSKAISGRKT >gi568815586r:43630379_43855245|GENSCAN_predicted_CDS_7|162_bp nnagcggcagctgcaggcgagggacggactcaggaacctggtgggggttggggggcggct gacagaaaatccaccatcctgcagggaaccaatgagggtcccactggaatgtcctcgaag agctacaaactttcgaaagcaatctcaggtcgcaaaacatga