GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:33:45 Sequence gi568815586f:43668112_43886559 : 218448 bp : 38.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 532 527 6 1.05 1.02 Term - 7806 7320 487 2 1 9 48 268 0.024 8.09 1.01 Init - 18039 17705 335 0 2 56 54 164 0.055 6.61 1.00 Prom - 30848 30809 40 -6.05 2.04 PlyA - 30897 30892 6 1.05 2.03 Term - 31193 31082 112 1 1 28 49 196 0.569 6.75 2.02 Intr - 42277 42192 86 1 2 99 44 43 0.232 -1.30 2.01 Init - 43142 43077 66 2 0 68 108 27 0.483 3.82 2.00 Prom - 50059 50020 40 -3.85 3.08 PlyA - 50900 50895 6 1.05 3.07 Term - 62591 62265 327 2 0 132 41 160 0.966 9.42 3.06 Intr - 68548 68270 279 1 0 27 106 151 0.479 7.55 3.05 Intr - 74444 74346 99 2 0 56 56 98 0.812 2.79 3.04 Intr - 78127 77935 193 0 1 75 116 113 0.999 11.37 3.03 Intr - 80498 80339 160 0 1 82 75 163 0.999 12.52 3.02 Intr - 87150 86225 926 2 2 149 91 565 0.984 52.84 3.01 Init - 91495 91197 299 1 2 75 78 141 0.178 8.70 3.00 Prom - 95053 95014 40 -5.85 4.00 Prom + 97357 97396 40 -6.35 4.01 Init + 100001 100161 161 1 2 57 87 78 0.819 3.95 4.02 Intr + 103109 103254 146 2 2 57 97 85 0.994 5.31 4.03 Intr + 104069 104251 183 0 0 44 113 122 0.965 9.14 4.04 Intr + 104801 104961 161 0 2 68 92 186 0.988 15.79 4.05 Intr + 105854 105918 65 1 2 68 106 99 0.484 6.30 4.06 Intr + 114196 114379 184 1 1 54 103 222 0.998 19.17 4.07 Intr + 116275 116410 136 0 1 52 47 88 0.629 0.32 4.08 Intr + 117576 117760 185 1 2 34 119 52 0.477 1.49 4.09 Intr + 118288 118446 159 0 0 13 91 194 0.953 11.46 4.10 Term + 123523 123693 171 0 0 -14 43 141 0.138 -3.76 4.11 PlyA + 123763 123768 6 1.05 5.09 PlyA - 124425 124420 6 1.05 5.08 Term - 127644 127474 171 0 0 88 39 180 0.954 9.94 5.07 Intr - 129722 129597 126 2 0 63 91 147 0.986 12.46 5.06 Intr - 131391 131287 105 0 0 87 107 44 0.978 5.69 5.05 Intr - 132419 132324 96 2 0 21 62 109 0.090 0.89 5.04 Intr - 134353 134175 179 2 2 27 103 77 0.106 1.82 5.03 Intr - 136461 136384 78 1 0 89 49 65 0.095 1.30 5.02 Intr - 137919 137701 219 1 0 74 -8 171 0.117 3.45 5.01 Init - 138236 138110 127 2 1 53 87 237 0.508 18.47 5.00 Prom - 162563 162524 40 -1.05 6.00 Prom + 174584 174623 40 -8.05 6.01 Init + 176541 176817 277 2 1 54 111 114 0.377 7.50 6.02 Intr + 188802 188926 125 1 2 29 116 22 0.310 -1.52 6.03 Term + 195776 196288 513 1 0 69 37 271 0.610 13.46 6.04 PlyA + 196656 196661 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 18039 17665 375 0 0 56 50 172 0.852 6.09 S.002 Init - 132407 132324 84 2 0 42 62 112 0.844 4.87 S.003 Term - 156379 156242 138 1 0 96 50 103 0.929 4.28 S.004 Init - 158079 158002 78 0 0 91 51 56 0.898 1.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:43668112_43886559|GENSCAN_predicted_peptide_1|273_aa MILNEHAAFKHLFNKAHLAPPLIHLTLSGYSTRFREHRVGGKVTDQQDPKAEEFILVQNK MKSLPCLLLSTQTRQPSDFSIFSPPFPAFYSTKPPLSSPPVLNELLGTPPRRGSTGVSLA MGGVCGSQGSLPTLLPRQLEAAKLDTAGSGWFRLSRAGTGADLGEVLGQLSGHWDNRPGK GRGISPVPQSQLRGRGVAWGFQRSRELWGLLSSHTLIWWVFLLSVSGSGNRPNQLGYFQV IYAQITEPFQAFQSIKFPVAESKAIRPDTFQVL >gi568815586f:43668112_43886559|GENSCAN_predicted_CDS_1|822_bp atgattcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggatacagcacacgtttcagagagcacagggttggg ggtaaggtcaccgatcaacaggatcccaaggcagaagaatttatcttagtacagaacaaa atgaaaagtctcccatgtctactactttctacacagacacggcaaccatccgatttctca atcttttccccacctttccccgctttctattccacaaaaccgccattgtcatccccgccc gttctcaatgagctgttgggtacacctcccagacggggaagtactggggtgtccctggcg atgggtggggtgtgtggctctcaggggtcactccccacactgctgccaaggcagctagag gcagcaaagctggacactgctgggtcaggctggttcaggctctctagggcaggtacaggt gctgaccttggtgaagttctggggcagctctcaggccactgggacaaccgtccagggaag ggcagaggcatctctcctgtgccacagagccagctcagaggaagaggggtggcctggggt tttcagcgcagtagggagctgtggggcctgctcagctcccacaccctgatctggtgggtc ttcctcctctcagtgtctggctctggcaacaggcccaaccagttaggctacttccaagtc atctatgcccagatcactgagccattccaggcattccagagtataaaattccctgtggca gaaagtaaggctatcaggccagacacattccaggtcctgtga >gi568815586f:43668112_43886559|GENSCAN_predicted_peptide_2|87_aa MNKTENDPSSEDGGRELVNNKQEKRGKSGEDYVLHVGYQLSYSRIGHQPESFRTNAPKDM ALMSGGMPGSSSRVELEKTTQAQEEQF >gi568815586f:43668112_43886559|GENSCAN_predicted_CDS_2|264_bp atgaacaagacagagaatgatccttctagtgaagatggtgggagggagctggtaaacaat aaacaagagaagagagggaagagtggggaagactatgtcttgcatgttggataccagctc agctacagcaggatagggcaccagccagagtcattccgaaccaatgcaccaaaagatatg gctctgatgagtggaggaatgccaggatcttcatctcgagttgaattagaaaaaacgaca caggcacaggaggagcagttttaa >gi568815586f:43668112_43886559|GENSCAN_predicted_peptide_3|760_aa MSHRTWTGFAFIPLSPSGFQADRYTFFPHLRMYSSDPISRLHFLIPWKLGGRRNKRITIV WRLSELLAPQASVRCWPDFLVLGKRDFISQSGSRNPGPERHCYRRMEEDTDYRIRFSSLC FFNDHVGFHGTIKSSPSDFIVIEIDEQGQLVNKTIDEPIFKISEIQLEPNNFPKKPKLDL QNLSLEDGRNQEVHTLIKYTDGDQNHQSGSEKEDTIVDGTSKCEEKADVLSSFLDEKTHE LLNNFACDVREKWLSKTELIGLPPEFSIGRILDKNQRASLHSAIRQKFPFLVTVGKNSEI VVKPNLEYKELCHLVSEEEAFDFFKYLDAKKENSKFTFKPDTNKDHRKAVHHFVNKKFGN LVETKSFSKMNCSAGNPNVVVTVRFREKAHKRGKRPLSECQEGKVIYTAFTLRKENLEMF EAIGFLAIKLGVIPSDFSYAGLKDKKAITYQAMVVRKVTPERLKNIEKEIEKKRMNVFNI RSVDDSLRLGQLKGNHFDIVIRNLKKQINDSANLRERIMEAIENVKKKGFVNYYGPQRFG KGRKVHTDQIGLALLKNEMDAKGTLSLMPEFKVRERALLEALHRFGMTEEGCIQAWFSLP HSMRIFYVHAYTSKIWNEAVSYRLETYGARVVQGDLVCLDEDIDDENFPNSKVVLPVLGY NIQYPKNKVGQWYHDILSRDGLQTCRFKVPTLKLNIPGCYRQILKHPCNLSYQLMEDHDI DVKTKGSHIDETALSLLISFDLDASCYATVCLKEIMKHDV >gi568815586f:43668112_43886559|GENSCAN_predicted_CDS_3|2283_bp atgagccaccgcacctggacagggtttgcttttatccctttgtctccctctggcttccaa gctgacaggtacacttttttcccccatctgcgaatgtactcatctgatcccataagccgc ctccactttctaattccctggaagcttggaggccggagaaacaagcgaattacaatcgtg tggaggctcagcgagctgctggcgccacaggcatctgtcaggtgctggcctgacttcctg gttctggggaaaagagacttcatttcgcagtcgggttcacggaatccagggcctgagagg cactgttatagaagaatggaagaagatacagattatagaatcaggtttagttctttgtgt ttctttaatgatcacgttggatttcatggcactataaaaagctcaccaagtgactttatt gttattgaaattgatgaacagggacagttagttaataagaccatcgatgagcctattttc aagattagtgaaatacaacttgagccaaataattttcccaaaaaaccaaaactagatctt caaaatctgtccttagaagatggaagaaaccaagaagttcatactttgattaagtacact gatggtgaccaaaatcatcagtctggttcagaaaaggaagatactatcgttgatggaact tccaaatgtgaagaaaaagctgatgttttaagctcctttttggatgaaaaaactcatgag ttactgaataattttgcctgtgatgtaagagagaagtggctttctaaaacagagctaatt ggactacctcctgaattctcaataggcagaatccttgacaaaaaccagagggctagttta cacagtgccattaggcagaaatttccatttttagtaactgtaggaaaaaacagtgaaatt gttgtaaaaccaaatcttgaatataaagaactttgtcatttggtatctgaagaggaagca tttgacttttttaaatatttggatgcaaagaaagaaaattccaaatttacctttaaacct gatacaaacaaagaccacagaaaagctgtccaccattttgtcaacaaaaagtttggaaac cttgtggaaaccaaatctttttctaaaatgaattgcagtgctggtaatccgaatgtggtg gtaacagtaagatttcgggaaaaagcacacaaacgtgggaaaaggcctctttctgaatgc caagaaggaaaagttatatatacagcttttaccctacgaaaggaaaacctggaaatgttt gaagcgattggttttttagctatcaaacttggtgttattccttcggattttagttatgca ggccttaaagacaagaaagccatcacctatcaagcaatggttgttagaaaagtgactcca gagaggttgaaaaatattgaaaaagaaattgaaaagaaaagaatgaatgtctttaatatt cggtctgtagatgattccctgagacttggtcagctcaaaggaaatcactttgatattgtc attagaaatttaaaaaaacaaataaatgattctgcaaacctgagggagagaattatggaa gcaatagaaaatgttaagaaaaaaggctttgtgaattactatggaccacagagatttggg aagggaaggaaagttcacacagaccaaattggactagctttgctgaagaatgaaatggat gctaaaggcacactttcattgatgcctgaattcaaagtgcgtgagagagcattgttggag gcattgcaccgctttggcatgaccgaggaaggttgtatccaggcatggttctctttaccc cattccatgcgcatattctatgttcacgcatataccagcaaaatttggaatgaggcagta tcttacagacttgaaacctatggagcaagagtagtgcagggtgatttggtctgtttggat gaagacattgatgacgagaatttcccaaatagtaaagtggttcttccagtacttggatac aatattcagtacccgaagaacaaagtagggcagtggtaccatgacatacttagcagagat ggactacagacatgtaggtttaaagtacctactctgaaactgaatataccaggttgctat agacagattttgaaacatccctgtaatctctcataccaactaatggaagatcatgacatt gatgtcaaaacgaaaggttcccacattgatgaaacagctttgtctcttttgatctctttt gatcttgatgcttcatgctatgctaccgtttgtctgaaggaaataatgaagcatgacgtt taa >gi568815586f:43668112_43886559|GENSCAN_predicted_peptide_4|516_aa MNKPITPSTYVRCLNVGLIRKLSDFIDPQEGWKKLAVAIKKPSGDDRYNQFHIRRFEALL QTGKSPTSELLFDWGTTNCTVGDLVDLLIQNEFFAPASLLLPDAVPKTANTLPSKEAITV QQKQMPFCDKDRTLMTPVQNLEQSYMPPDSSSPENKSLEVSDTRFHSFSFYELKNVTNNF DERPISVGGNKMGEGGFGVVYKGYVNNTTVAVKKLAAMVDITTEELKQQFDQEIKVMANA NILLDEAFTAKISDFGLARASEKFAQTVMTSRIVGTTAYMAPEALRGEITPKSDIYSFGV HQSQIPQVKSSVPKTAIPPDTSCKFRPPELLTNCLQVGVPMTSSVGIHSEKCVVRQFHCV NLIECTYTNLDDMTYDIPGLYGIAPRLQTYAIQHNTLLNTVSNCNTLLDIKEEIEDEEKT IEDYIDKKMNDADSTSVEAMYSVASQCLHEKKNKRPDIKKNQLNEDPYWKAFDAQEWVST AILCGRTHGFRRARDFASASEFSCGQKENDGGQILW >gi568815586f:43668112_43886559|GENSCAN_predicted_CDS_4|1551_bp atgaacaaacccataacaccatcaacatatgtgcgctgcctcaatgttggactaattagg aagctgtcagattttattgatcctcaagaaggatggaagaagttagctgtagctattaaa aaaccatctggtgatgatagatacaatcagtttcacataaggagatttgaagcattactt caaactggaaaaagtcccacttctgaattactgtttgactggggcaccacaaattgcaca gttggtgatcttgtggatcttttgatccaaaatgaattttttgctcctgcgagtcttttg ctcccagatgctgttcccaaaactgctaatacactaccttctaaagaagctataacagtt cagcaaaaacagatgcctttctgtgacaaagacaggacattgatgacacctgtgcagaat cttgaacaaagctatatgccacctgactcctcaagtccagaaaataaaagtttagaagtt agtgatacacgttttcacagtttttcattttatgaattgaagaatgtcacaaataacttt gatgaacgacccatttctgttggtggtaataaaatgggagagggaggatttggagttgta tataaaggctacgtaaataacacaactgtggcagtgaagaagcttgcagcaatggttgac attactactgaagaactgaaacagcagtttgatcaagaaataaaagtaatggcaaatgca aatatcttactggatgaagcttttactgctaaaatatctgactttggccttgcacgggct tctgagaagtttgcccagacagtcatgactagcagaattgtgggaacaacagcttatatg gcaccagaagctttgcgtggagaaataacacccaaatctgatatttacagctttggtgtg catcagagtcagatcccacaggttaagagctcagtccccaagactgccatcccaccagac accagttgtaagtttaggcctccagaacttctgaccaactgccttcaagttggagttcct atgacctcctctgtgggaatacattccgagaaatgtgttgttaggcagtttcattgtgtg aacctcatagagtgtacttacacaaacctagatgatatgacctacgacatacctgggcta tatggtattgctcctaggctacaaacgtatgctatacagcataatactctcctgaatact gtaagcaattgtaacacactgctagatattaaagaagaaattgaagatgaagaaaagaca attgaagattatattgataaaaagatgaatgatgctgattccacttcagttgaagctatg tactctgttgctagtcaatgtctgcatgaaaagaaaaataagagaccagacattaagaag aatcagctgaatgaagatccctactggaaggcatttgatgctcaggaatgggtcagtact gccattttgtgtggtagaacccatggctttcgcagggcaagagattttgccagtgctagt gagttttcctgtggccagaaagaaaatgatggaggccagatcctctggtga >gi568815586f:43668112_43886559|GENSCAN_predicted_peptide_5|366_aa MTRRGRRRSSHFLGPPAGAAGCTQRRSRELAAAAMSHQTGIQARSLLPSRCRLIRNSGER RKLAFEAAWLKGACLPPSPTVAQLSPFWSRREELGQWRRLGLQYAVENQASFLFHASEDV KEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPLLEDKQPCYILFRL DSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDEVFGTVKEDVSLHG YKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAFPISREAFQALEKL NNRQLNYVQLIEIDNGDELTADFLYEEVHPKQHAHKQSFAKPKGPAGKRGIRRLIRGPAE TEATTD >gi568815586f:43668112_43886559|GENSCAN_predicted_CDS_5|1101_bp atgacgcgccggggccggcggaggagcagccacttcctggggccgccggccggggccgct ggctgcactcagcgccggagccgggagctagcggccgccgccatgtcccaccagaccggc atccaagcccggtccctgctgccgtcacgttgcagattgatccgaaattcgggggagcga aggaaattggctttcgaggcggcctggttgaagggggcgtgccttccaccgtccccgaca gtcgcccaactttcccccttttggtcacgtagggaagagctgggacagtggaggcgactg gggcttcagtacgctgtcgagaaccaggcgagttttctttttcatgcaagtgaagatgtt aaagagatctttgccagagccagaaatggaaagtacagacttctgaaaatatctattgaa aatgagcaacttgtgattggatcatatagtcagccttcagattcctgggataaggattat gattcctttgttttacccctgttggaggacaaacaaccatgctatatattattcaggtta gattctcagaatgcccagggatatgaatggatattcattgcatggtctccagatcattct catgttcgtcaaaaaatgttgtatgcagcaacaagagcaactctgaagaaggaatttgga ggtggccacattaaagatgaagtatttggaacagtaaaggaagatgtatcattacatgga tataaaaaatacttgctgtcacaatcttcccctgccccactgactgcagctgaggaagaa ctacgacagattaaaatcaatgaggtacagactgacgtgggtgtggacactaagcatcaa acactacaaggagtagcatttcccatttctcgagaagcctttcaggctttggaaaaattg aataatagacagctcaactatgtgcagttgatcgagatagacaatggggatgagttgact gcagacttcctttatgaagaagtacatcccaagcagcatgcacacaagcaaagttttgca aaaccaaaaggtcctgcaggaaaaagaggaattcgaagactaattaggggcccagcggaa actgaagctactactgattaa >gi568815586f:43668112_43886559|GENSCAN_predicted_peptide_6|304_aa MGKDFRYYFQHPWSRMIVAYLVIFFNFLIFAEDPVSHSQTEANVIVVGNCFSFVTNKYPR GVGWRILKVLLWLLAILTGLIAGKFLFHQRLFGTPFFRGRFGVLELLNFVALFKKLILLL CVPVMFSLLEQESKGGVEGEAQVGTGAAPGACGPARVPGGHGLGGPTLGAAGQPALDSEG LSTRASSCGGCAGSPSSAGPPALRWISHGALAASLGGRAQDLQPAMPESPSLPPQWVPAR PEPPRPAPPPASWHPVPPTAQGLRSAGTLRRDWQAALSAAPVRDPLGEASWASESSGDLE NLYV >gi568815586f:43668112_43886559|GENSCAN_predicted_CDS_6|915_bp atgggtaaagactttcgttactatttccagcatccctggtctcgcatgattgtggcttac ttggtgatcttctttaacttcttaatatttgcggaggacccagtttctcatagccaaaca gaagccaatgttattgttgttggaaactgtttttcatttgttacaaataaataccctaga ggagttggctggaggattttgaaggtgcttctatggctacttgccattctcacaggacta atagctggcaaatttctgttccatcagcgtttgtttggaacacccttcttcagagggaga tttggtgtgttggagctactgaactttgtagcccttttcaaaaagctgatcttattactt tgtgttcctgttatgttctctctcctagagcaggaaagcaagggaggtgtggagggagag gcacaggtgggaaccggggctgcgcccggcgcttgcgggccagctagagttccgggtggg catgggcttggcggccccacactcggagctgccggccagccagccctggacagtgagggg cttagcacccgggccagcagctgtggagggtgcgctgggtcccccagcagtgctggccca ccagcgctgcgctggatttctcacggggctttagctgcctccctggggggcagggctcag gacctgcagcctgccatgcctgagtctccctcactccccccgcagtgggttcctgcgcgg cctgagcctccccgaccagcacctccccctgcttcatggcacccagtcccacccaccgcc caagggctgaggagtgcaggcacactgcgcagggactggcaggcagctctatctgcagcc ccagtgcgagatccactgggtgaagccagctgggcttctgagtctagtggggacttggag aacctttatgtctag