GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:09:52 Sequence gi568815580f:54777586_54989410 : 211825 bp : 38.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1947 2044 98 2 2 70 68 105 0.224 6.53 1.02 Intr + 3015 3150 136 0 1 70 59 39 0.204 -1.15 1.03 Term + 14089 14439 351 0 0 31 34 303 0.350 12.70 1.04 PlyA + 14874 14879 6 1.05 2.04 PlyA - 14901 14896 6 1.05 2.03 Term - 15321 15191 131 1 2 61 41 94 0.002 -0.64 2.02 Intr - 29999 29853 147 0 0 71 116 78 0.725 8.19 2.01 Init - 33988 33961 28 1 1 74 94 21 0.299 1.13 2.00 Prom - 42519 42480 40 -4.05 3.05 PlyA - 43279 43274 6 1.05 3.04 Term - 46978 46914 65 2 2 110 48 89 0.863 4.17 3.03 Intr - 51170 50842 329 1 2 13 82 261 0.181 12.42 3.02 Intr - 51360 51245 116 0 2 51 50 134 0.280 4.23 3.01 Init - 68080 67979 102 1 0 50 100 63 0.697 3.99 3.00 Prom - 69586 69547 40 -4.05 4.05 PlyA - 70828 70823 6 1.05 4.04 Term - 77623 77446 178 0 1 57 37 140 0.310 1.98 4.03 Intr - 89060 89029 32 2 2 140 98 19 0.351 4.11 4.02 Intr - 89325 89173 153 0 0 93 61 129 0.750 10.05 4.01 Init - 98498 98493 6 2 0 84 87 0 0.334 0.44 4.00 Prom - 99690 99651 40 -7.75 5.00 Prom + 99917 99956 40 -8.05 5.01 Init + 100001 100153 153 1 0 68 89 108 0.888 8.96 5.02 Intr + 101784 101869 86 2 2 82 32 88 0.068 0.30 5.03 Intr + 106748 106851 104 2 2 71 80 20 0.021 -1.60 5.04 Intr + 110410 110533 124 2 1 73 106 157 0.999 14.72 5.05 Term + 111639 111828 190 0 1 60 38 256 0.989 13.74 5.06 PlyA + 112372 112377 6 1.05 6.00 Prom + 117230 117269 40 -4.25 6.01 Init + 127064 127139 76 1 1 63 70 116 0.839 8.60 6.02 Term + 128959 129056 98 2 2 101 39 33 0.530 -3.15 6.03 PlyA + 129899 129904 6 1.05 7.14 PlyA - 130005 130000 6 1.05 7.13 Term - 132332 132096 237 2 0 76 43 245 0.351 13.98 7.12 Intr - 139226 139086 141 2 0 102 53 31 0.157 0.63 7.11 Intr - 140411 140328 84 2 0 77 110 18 0.225 2.00 7.10 Intr - 148223 148115 109 1 1 78 49 59 0.195 0.37 7.09 Intr - 156976 156948 29 1 2 110 94 13 0.672 0.10 7.08 Intr - 157363 157235 129 1 0 55 107 145 0.985 13.07 7.07 Intr - 159373 159248 126 1 0 52 81 144 0.996 10.06 7.06 Intr - 160512 160372 141 0 0 82 86 119 0.983 10.73 7.05 Intr - 163498 163412 87 1 0 90 85 22 0.712 1.35 7.04 Intr - 165218 165090 129 2 0 37 95 115 0.070 7.07 7.03 Intr - 172159 172008 152 2 2 71 86 80 0.079 5.06 7.02 Intr - 174968 174873 96 0 0 37 60 108 0.135 1.96 7.01 Init - 175258 175210 49 1 1 86 58 47 0.163 0.70 7.00 Prom - 179450 179411 40 -6.45 8.00 Prom + 180082 180121 40 -3.95 8.01 Init + 181626 182012 387 2 0 46 39 276 0.047 15.36 8.02 Intr + 197742 197832 91 2 1 120 67 65 0.044 6.25 8.03 Intr + 197971 198067 97 2 1 73 69 58 0.027 0.55 8.04 Term + 205626 205755 130 0 1 30 49 125 0.013 -0.53 8.05 PlyA + 206545 206550 6 1.05 9.02 PlyA - 206817 206812 6 1.05 9.01 Term - 207820 207616 205 0 1 55 42 144 0.192 2.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 101784 101873 90 2 0 82 38 101 0.880 1.24 S.002 Init - 165206 165090 117 2 0 63 95 120 0.900 10.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_1|194_aa MTKLKCSTAREESTQQRTKVQDVKAIPSVGKNSKDTIRIGLEPTHMTSYNYLFKGPVSKY GHILRYWQLGFQHTDLGETTGPAHPGPIKTGDPSRPTHRRPDVKRSISADKDTSSWTERG HLERTLAEEHTGTCWQASRPSTRGTRWSLAGQSEESRGRQAAQLQGKTISLLALHGGTAP LIKLCTQSPSPGVI >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_1|585_bp atgaccaaactcaagtgcagcacagccagggaggaatctacacagcaacgtacaaaggtc caggatgtgaaagctattccttcagttggaaaaaatagtaaggacaccattcgtattgga ttagagccaacccacatgacctcatataactacctctttaaaggtcctgtctccaaatat ggtcatattctgagatattggcagttaggatttcaacatacagatttgggggagaccact gggcctgcccatcctgggcctataaaaaccggagaccctagcaggccgactcacaggcgg ccagacgtcaagaggagcatatcagcagacaaagacacaagcagctggacagagagagga catctagagcgcacgctggctgaagagcacactggcacatgctggcaagccagcaggcca tcaacccgcgggacgaggtggagtttggcagggcagtcggaggagagtaggggccgccaa gctgcccaactccaggggaaaaccatctcccttctggctctccatggtgggacagctcct ttaataaaactttgcactcaatctccaagcccaggtgtgatctga >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_2|101_aa MQIHRARLKPGNQMIRILIELRNELVVSKAGDLVMSQVIMFNPVKLQVPEEIMSLKQEVK TEERLYYAIDTGDNFANLLYLSIFYFMSTTGTLEYLFVRNL >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_2|306_bp atgcagattcaccgagccagactaaagcctggcaaccagatgatacgcatattgatagag ctaagaaatgagctggttgtgtcaaaggctggagacttagtgatgagtcaagttatcatg ttcaatcctgtaaagctccaagttcccgaggagattatgtccttaaaacaagaagtcaaa actgaggaacgactgtattatgctattgacactggtgacaattttgcaaatctcttatac ctgagtattttctattttatgagtactaccggaactttagaatacctttttgttagaaat ttgtaa >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_3|203_aa MQKARAKTRQTYSRGQSGGPTSDSGPWEQTRKNKLFDKQTLKESQVELAHAIRAAHSTTV FFARISAGNRDRSTRVGGNAFQQQQPAFPTHQPAKFPCRESRCQISERFPSPSQALELPG STVSKETGSAPQGDEHGDSYLSPPTCGSGWVQGRAVRSPCPSGLRVAGLVAPPSESAAAA GLVRRCEEPTKSSKPLEDMKAPR >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_3|612_bp atgcaaaaggctagagcaaagacaagacaaacatattccagggggcagtcagggggtcca accagcgactcagggccctgggaacagactaggaaaaacaagctgtttgataagcaaaca ctcaaggagagccaggtcgagttggcccatgccatccgtgcagcacacagcactactgtc ttctttgcaaggatttctgctggaaaccgcgatcgaagcacccgggtgggtgggaacgcg ttccagcagcagcagcccgccttcccaactcaccagcctgcgaagtttccttgccgggag agcaggtgtcagatttcagagaggtttccttccccttcccaagccctggaactcccgggc tccaccgtcagcaaggagacaggatctgctccacagggcgatgagcacggggacagctac ttatctcctccaacctgtggctccggatgggtccaaggccgggcggtccgcagcccctgc ccgtcaggactgcgagtggcagggctggtggcaccaccctcggagtccgccgcggcggcg gggctggtgcgcagatgcgaagaaccaacgaagagctctaagcccctagaagacatgaaa gcccctagatga >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_4|122_aa MIDPYIKTFELVSRYGGALNWCLPFKIRGAAPHRVFSRTPRRKNGLAPSARMMVGQLGIW LSARSAAALDSHRSMNSIVNCTCKGSGLHASYENLMPDDLSLSPITPRWDSPVAGKQAQG SH >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_4|369_bp atgattgatccgtacataaaaacctttgaacttgtttcccgttatggtggtgccttgaat tggtgtcttccatttaaaatacgaggagctgccccacatcgggttttctccaggacaccg aggagaaaaaatggtttggctcccagtgccagaatgatggtaggacaactaggcatctgg ctgtctgccagatcagcagcagcattagattctcacaggagcatgaattctattgtgaac tgcacatgcaagggatctgggttgcacgcttcttatgagaatctaatgcctgatgatctg tcactgtctcccatcacccccagatgggacagtccagttgcgggaaaacaagctcagggc tcccactga >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_5|218_aa MTDGDYDYLIKLLALGDSGVGKTTFLYRYTDNKFNPKFITTVGIDFREKRVVYNAQGPNG SSGKAFKVHLQLWDTAGQERFRSLTTAFFRDAMGFLLMFDLTSQQSFLNVRNWMSQLQAN AYCENPDIVLIGNKADLPDQREVNERQARELADKYGIPYFETSAATGQNVEKAVETLLDL IMKRMEQCVEKTQIPDTVNGGNSGNLDGEKPPEKKCIC >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_5|657_bp atgaccgatggagactatgattatctgatcaaactcctggccctcggggattcaggggtg gggaagacaacatttctttatagatacacagataataaattcaatcccaaattcatcact acagtaggaatagactttcgggaaaaacgtgtggtttataatgcacaaggaccgaatgga tcttcagggaaagcatttaaagtgcatcttcagctttgggacactgcgggacaagagcgg ttccggagtctcaccactgcatttttcagagacgccatgggcttcttattaatgtttgac ctcaccagtcaacagagcttcttaaatgtcagaaactggatgagccaactgcaagcaaat gcttattgtgaaaatccagatatagtattaattggcaacaaggcagacctaccagatcag agggaagtcaatgaacggcaagctcgggaactggctgacaaatatggcataccatatttt gaaacaagtgcagcaactggacagaatgtggagaaagctgtagaaacccttttggactta atcatgaagcgaatggaacagtgtgtggagaagacacaaatccctgatactgtcaatggt ggaaattctggaaacttggatggggaaaagccaccagagaagaaatgtatctgctag >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_6|57_aa MREHEGSRREEEPVSAIPPGEDIPQRSKQLGLAIVLHHPNGNGKDGREQNISHPSFP >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_6|174_bp atgagggagcatgagggaagccggagagaggaagagcctgtatctgccattccacctggt gaagacattccccagaggtccaagcagctgggactggccatagtgcttcaccatcccaat gggaatggtaaagatggaagggagcaaaatattagccatccttccttcccttag >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_7|502_aa MGFRHVGQSALDLLTSGNCQAVVRSSSTFSIPISNRQEFQLFHILTRTAPGSEHEQVRDL SCKTHTDGASAYLGRRFKHKLAKTCQTSSMVLFNQQPPDPEFTMTTVTVTTEIPPRDKME DNSALYESTSAHIIEETEYVKKIRTTLQKIRTQMFKDEIRHDSTNHKLDAKHCGNLQQGS DSEMDPSCCSLDLLMKKIKGKDLQLLEMNKENEVLKIKLQASREAGAAALRNVAQRLFEN YQTQSEEVRKKQEDSKQLLQVNKLEKEQKLKQHVENLNQVAEKLEEKHSQITELENLVQR MEKEICDLERGTSYPLFHWGPIVSDVYTHGSPEKHWILNRRQYESPQMVMEGLKNNLKEQ DKRIENLREKVNILEAQVRIITIQGEIWVRTQSQTISTPLLALSCCPLLAMPLPCRKCNE APAKAKRRKEELWPFGQPRPGSSLSQGCDSLFGALRFLASPSFWVPPCSPVPAVEAACGA PGLAAASQSQHPCQYPALCAPL >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_7|1509_bp atggggtttcgccatgttggccagtctgctctcgatctcctgacctcaggaaactgccaa gctgttgtccgcagcagctccacattttccattcccatcagcaacagacaggagttccaa ctgttccacatcctcaccagaactgctccagggagtgagcatgaacaagtaagagacctt tcctgcaagactcatacagatggtgcctcagcatatctgggaaggagattcaagcacaag ctagccaaaacctgccagacatccagcatggtgcttttcaatcagcagcccccagaccca gaattcacaatgacaacagtgacagtgaccacagaaattcccccaagggataagatggaa gataattctgccttgtatgagtctacgtccgctcacattattgaagaaaccgagtatgtg aaaaagattcgaactactctgcaaaagatcaggacccagatgtttaaagatgaaataaga catgacagtacaaatcacaaactagatgcaaagcactgtggaaaccttcaacagggctct gattctgaaatggatccttcttgttgcagtttggatttgcttatgaaaaagataaaagga aaagacctacagctcttagaaatgaacaaagagaatgaagtattgaaaatcaagctgcaa gcctccagagaagcaggagcagcagctctgagaaacgtggcccagagattatttgaaaac taccaaacgcaatctgaagaagtgagaaagaagcaggaggacagtaaacaattactccag gttaacaagcttgaaaaagaacagaaattgaaacaacatgttgaaaatctgaatcaagtt gctgaaaaacttgaagaaaaacacagtcaaattacagaattggagaaccttgtacagaga atggaaaaggaaatatgtgacttagaaaggggaaccagttatcctttattccactggggg cccatagtcagtgatgtttacactcatggaagtccagagaaacattggatcctaaacaga aggcagtatgagtccccacagatggtaatggaaggattaaaaaataatttaaaagaacaa gacaaaagaattgaaaatctcagagaaaaggttaacatacttgaagcccaggtgaggatt attacaattcaaggtgagatttgggtgaggacacagagccaaaccatatcaacccccttg ctagcactcagctgctgtcctctgctggcgatgcccctgccttgcaggaagtgcaatgaa gcccctgccaaggcaaagagaaggaaagaagagctgtggccctttgggcagcccagacct gggagctccctgagccagggctgtgactctctctttggggccttgcggttcctggcatct ccaagcttctgggtgccaccgtgttccccggtgccagctgtggaagctgcttgtggtgca cctggtctagccgcagcctcgcagagccagcacccatgccagtacccggcgctgtgtgcc ccactgtag >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_8|234_aa MSLMPGTPLGKPLERHLDTHTHGGDAGARRRDRARRWPDSPTLWGAQPTRGPGAAPPLGD RTLGPGSAARALRVQPPPPAGLAPPRGARPGQSPAPVRPRGGVAAGRVRPALESQALPLT PLARVLYRRGPSSFALALGSAEGKRRHSIPKSSLDFWSCRKQGTFICCAELRPQRAQVTA VLLSLHLELLGRIPVRNLDSADSKYDNMLPPSGHFIKRLDIQVPETVYVFRILE >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_8|705_bp atgtccttgatgccaggaactccactggggaagccgctggaaaggcacctggacacccac acacatggaggggatgcgggggcgcgccgcagggaccgggcccgaaggtggccagactcg cctaccttgtggggcgcccagccgacgcggggtcccggtgctgctcctcccctgggcgat cggaccttggggccagggtcggctgccagggccctgcgagtgcagccgccaccgcccgca ggcttggctccgccccgcggggcgcgcccagggcagagtccggcgccggtgaggccccgg ggaggagtcgcggcaggacgcgtacgcccagcgctggagagccaggctctgcccctcacg cccctcgctagggtgctgtacaggcggggtccttctagctttgctctagctcttggttca gcagaaggaaaaagaagacacagtatccccaagtcctccctggacttctggtcttgtagg aagcagggcaccttcatttgttgtgctgagctcaggcctcagagggcacaagtgaccgcc gtattgctcagtcttcatcttgagctccttggcaggatccctgttagaaatttagacagt gcggacagtaaatatgacaacatgttgccacctagtggacatttcataaaacgtctagac atccaggttcctgaaacagtgtacgtatttaggatcctggagtag >gi568815580f:54777586_54989410|GENSCAN_predicted_peptide_9|68_aa XNAQASVDQEQGGDRRGRVIRKVREVAGSKLSKAMKAIMGTSSLEKKWEPLEGFEHESRH TLIAESHS >gi568815580f:54777586_54989410|GENSCAN_predicted_CDS_9|207_bp nnaaatgcacaggccagtgtggaccaagaacaaggtggggacaggcgtggaagagtaata agaaaagtcagagaggtagcggggtcaaaactgtctaaagccatgaaggccattatgggt acttcgtccttggagaaaaaatgggagcccttggagggctttgagcatgaaagccgccac actctgatagctgagtcacattcttaa