GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:20:27 Sequence gi568815578f:17500737_17707143 : 206407 bp : 46.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 8260 8153 108 1 0 105 101 153 0.990 18.78 1.04 Intr - 11332 11240 93 1 0 81 94 117 0.970 11.76 1.03 Intr - 14080 13985 96 1 0 67 84 86 0.217 6.31 1.02 Intr - 24172 24112 61 0 1 91 105 39 0.066 4.64 1.01 Init - 30593 30217 377 2 2 85 82 773 0.229 72.81 1.00 Prom - 47333 47294 40 -3.96 2.03 PlyA - 47549 47544 6 1.05 2.02 Term - 52365 52328 38 1 2 122 39 39 0.140 -0.10 2.01 Init - 69013 68914 100 1 1 100 44 133 0.682 8.52 2.00 Prom - 71642 71603 40 -6.16 3.00 Prom + 74949 74988 40 -1.26 3.01 Init + 75619 75672 54 0 0 56 65 59 0.532 1.69 3.02 Intr + 93402 93494 93 2 0 88 52 50 0.505 1.56 3.03 Intr + 100002 100309 308 2 2 83 91 168 0.774 11.85 3.04 Intr + 103819 103895 77 1 2 79 95 62 0.975 5.16 3.05 Term + 106301 106410 110 0 2 85 33 83 0.949 1.17 3.06 PlyA + 106469 106474 6 1.05 4.23 PlyA - 106593 106588 6 1.05 4.22 Term - 113484 113446 39 0 0 130 44 33 0.360 0.29 4.21 Intr - 114144 114001 144 0 0 44 35 208 0.721 11.58 4.20 Intr - 115318 115190 129 1 0 -4 55 211 0.266 9.49 4.19 Intr - 116103 115987 117 0 0 95 89 29 0.638 4.36 4.18 Intr - 117943 117860 84 1 0 87 82 78 0.984 7.12 4.17 Intr - 118992 118897 96 0 0 68 84 192 0.964 17.01 4.16 Intr - 119634 119563 72 0 0 58 91 66 0.911 3.50 4.15 Intr - 120071 119979 93 2 0 99 71 155 0.782 15.06 4.14 Intr - 120811 120722 90 1 0 128 101 149 0.995 20.39 4.13 Intr - 121037 120954 84 2 0 96 123 136 0.890 17.92 4.12 Intr - 121211 121119 93 2 0 72 91 60 0.446 4.86 4.11 Intr - 123932 123840 93 2 0 134 77 179 0.999 21.56 4.10 Intr - 124866 124776 91 2 1 67 82 164 0.948 13.70 4.09 Intr - 126646 126612 35 1 2 72 111 17 0.981 -0.58 4.08 Intr - 126946 126768 179 2 2 95 76 163 0.994 15.44 4.07 Intr - 129225 129087 139 0 1 110 107 223 0.978 26.44 4.06 Intr - 130383 130192 192 0 0 125 11 47 0.337 0.39 4.05 Intr - 132877 132724 154 0 1 62 109 307 0.998 30.27 4.04 Intr - 134928 134810 119 0 2 90 82 182 0.988 17.06 4.03 Intr - 135993 135841 153 0 0 112 98 295 0.645 33.17 4.02 Intr - 141183 141061 123 0 0 90 87 200 0.998 20.88 4.01 Init - 142530 142243 288 0 0 79 86 387 0.822 34.72 4.00 Prom - 144776 144737 40 -6.86 5.03 PlyA - 146719 146714 6 1.05 5.02 Term - 147527 147469 59 0 2 99 41 88 0.226 3.05 5.01 Init - 159771 157860 1912 0 1 91 119 1983 0.308 189.23 5.00 Prom - 171995 171956 40 -4.66 6.00 Prom + 178824 178863 40 -8.26 6.01 Sngl + 180636 181415 780 2 0 80 42 321 0.998 20.70 6.02 PlyA + 182764 182769 6 1.05 7.00 Prom + 190849 190888 40 -5.56 7.01 Init + 193131 193252 122 2 2 82 27 96 0.447 2.56 7.02 Intr + 199259 199319 61 2 1 48 105 45 0.399 0.94 7.03 Intr + 200141 200280 140 1 2 93 70 32 0.672 1.16 7.04 Intr + 200621 200892 272 2 2 41 37 203 0.733 7.69 7.05 Intr + 200943 201028 86 1 2 28 91 76 0.970 1.44 7.06 Term + 201479 201688 210 1 0 23 32 153 0.666 0.49 7.07 PlyA + 204546 204551 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_1|245_aa MYRRSYVFQTRKEQYEHADEASRAAEPERPADEGWAGATSLAALQGLGERVAAHVQRARA LEQRHAGLRRQLDAFQRLGELAGPEDALARQVESNRQRVRDLEAERARLERQGTEAQRAL DEFRSKYENECECQLLLKEMLERLNKEADEALLHNLRLQLEAQFLQDDISAAKDRHKKNL LEVQTYISILQQIIHTTPPASIVTSGMREEKLLTEREVAALRSQLEEGREVLSHLQAQRV ELQAQ >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_1|735_bp atgtaccggcgcagctacgtcttccagacccgcaaggagcagtacgagcacgccgacgag gcttcgcgcgccgccgagcccgagcgcccggccgacgagggctgggctggggcaacgagc ctggcggcgctgcaggggctcggcgagcgcgtggccgcccacgtccagcgggcccgcgcc ctcgagcagcgccatgccgggctccggaggcagctggatgccttccagcgcctgggcgag ctggccgggcccgaggacgccctcgcccgccaagtcgagagcaaccgccagcgcgtccgg gacctggaggccgagcgcgcccggctggagcgccagggcaccgaggcgcagcgcgcgctc gacgagttccgaagcaagtatgaaaatgagtgcgaatgtcaactcctgctaaaagaaatg cttgaacggcttaacaaggaagctgatgaagccttgctgcataacctacgccttcagctg gaagcccaatttctgcaagatgatatcagtgcggcaaaggacaggcacaagaagaatctt ctggaagttcagacctatatcagcatcctgcagcagatcatccacaccactcctccagca tccattgtgacgagtgggatgagggaggagaagctcctgacggagcgggaggtggccgcc ctgcggagtcagctggaggagggccgggaggtgctctcccacctgcaggcgcagagagtg gagctgcaggcacag >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_2|45_aa MARAGRQPPGQVCLPLGRGEAVEGKRRGEPGERGSFAGSSSSPNR >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_2|138_bp atggcccgcgcggggaggcagccacctggccaggtgtgcctgccgctggggcggggcgag gccgtggaggggaaaaggaggggagagccgggggagaggggctcctttgctggttcctct tcatctcctaaccgctga >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_3|213_aa MVILFDRVHSFERDAHGVINGKPLDCSKYGDQIHVYVEDELEGKGDGQLASGVQVADEVC RIFYDMKVRKCSTPEEIKKRKKAVIFCLSADKKCIIVEEGKEILVGDVGVTITDPFKHFV GMLPEKDCRYALYDASFETKESRKEELMFFLWAPELAPLKSKMIYASSKDAIKKKFQGIK HECQANGPEDLNRACIAEKLGGSLIVAFEGCPV >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_3|642_bp atggtcatcctctttgaccgagtgcatagctttgagagggatgcacatggagtgatcaat ggaaagccactggactgttctaagtatggtgatcagattcatgtttatgtagaggatgaa ttggagggtaaaggtgatgggcagttggcctcaggagtgcaagtagctgatgaagtatgt cgcattttttatgacatgaaagttcgtaaatgctccacaccagaagaaatcaagaaaaga aagaaggctgtcattttttgtctcagtgcagacaaaaagtgcatcattgtagaagaaggc aaagagatcttggttggagatgttggtgtaaccataactgatcctttcaagcattttgtg ggaatgcttcctgaaaaagattgtcgctatgctttgtatgatgcaagctttgaaacaaaa gaatccagaaaagaagagttgatgttttttttgtgggcaccagaactagcacctctgaaa agtaaaatgatctatgcaagctccaaggatgcaattaaaaagaaatttcaaggcataaaa catgaatgtcaagcaaatggaccagaagatctcaatcgggcttgtattgctgaaaagtta ggtggatccttaattgtagcctttgaaggatgccctgtgtag >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_4|868_aa MLSLLTPSSQRALRGCDQNLLGVMGIVATCVVAYGPKSQLFPSCCTGPPDADGPLYLPYK TLVSTVGSMVFNEGEAQRLIEILSEKAGIIQDTWHKATQKGDPVAILKRQLEEKEKLLAT EQEDAAVAKSKLRELNKEMAAEKAKAAAGEAKVKKQLVAREQEITAVQARMQASYREHVK EVQQLQGKIRTLQEQLENGPNTQLARLQQENSILRDALNQATSQVESKQNAELAKLRQEL SKVSKELVEKSEAVRQDEQQRKALEAKAAAFEKQVLQLQELELSNPLLRSNAESTAHPKA APSFHCTVHVNHGLVATSLVMASLAPATPLPGWCSSEQLPQLQASHRESEEALQKRLDEV SRELCHTQSSHASLRADAEKAQEQQQQMAELHSKLQSSEAEVRSKCEELSGLHGQLQEAR AENSQLTERIRSIEALLEAGQARDAQDVQASQAEADQQQTRLKELESQVSGLEKEAIELR EAVEQQKVKNNDLREKNWKAMEALATAEQACKEKLLSLTQAKEESEKQLCLIEAQTMEAL LALLPELSVLAQQNYTEWLQDLKEKGPTLLKHPPAPAEPSSDLASKLREAEETQSTLQAE CDQYRSILAETEGMLRDLQKSVEEEEQVWRAKVGAAEEELQKSRVTVKHLEEIVEKLKGE LESSDQVREHTSHLEAELEKHMAAASAECQNYAKEVAGLRQLLLESQSQLDAAKSEAQKQ SDELALVRQQLSEMKSHVEDGDIAGAPASSPEAPPAEQDPVQVRAGSSPGCRVMFAVFAQ LKTQLEWTEAILEDEQTQRQKLTAEFEEERLEKEKKLTSDLGRAATRLQELLKTTQEQLA REKDTVKKLQEQLEKAEDGSSSKEGTSV >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_4|2607_bp atgctcagcctgctgaccccaagcagtcagagagctcttcgtggctgtgaccagaacctt cttggtgtgatgggcattgtggccacgtgtgttgtggcttatgggcctaagtctcagctc tttccctcttgctgcacagggcccccagatgccgacggccctctctacctcccctacaag acgctggtctccacggttgggagcatggtgttcaacgagggcgaggcccagcggctcatc gagatcctgtctgagaaggctggcatcattcaggacacctggcacaaggccactcagaag ggtgaccctgtggcgattctgaaacgccagctggaagagaaggaaaaactgctggccaca gaacaggaagatgcggctgtcgccaagagcaaactgagggagctcaacaaggagatggca gcagaaaaggccaaagcagcagccggggaggccaaagtgaaaaagcagctggtggcccgg gagcaggagatcacggctgtgcaggcacgcatgcaggccagctaccgggagcacgtgaag gaggtgcagcagctgcagggcaagatccggactcttcaggagcagctggagaatggcccc aacacgcagctggcccgcctgcagcaggagaactccatcctgcgggatgccttgaaccag gccacgagccaggtggagagcaagcagaacgcagagctggccaagcttcggcaggagctc agcaaggtcagcaaagagctggtggagaagtcagaggctgtgcggcaagatgagcagcag cggaaagctctggaagccaaggcagctgccttcgagaagcaggtcctgcagctgcaggag ttggaactgagtaatcccctgctacgttctaatgcagaaagcactgcccatcctaaagct gcccccagttttcactgcacagtacacgtgaaccacggcctagtggccacctccttggtg atggcttcactggctccagccacgcctctgcccggatggtgctcatctgagcagttaccc cagctccaggcgtcccacagggagagtgaggaggccctgcagaagcgcctggacgaggtc agccgggagctgtgccacacgcagagcagccacgccagcctccgggcggatgccgagaag gcccaggagcaacagcagcagatggccgagctgcacagcaagttacagtcctccgaggcg gaggtgcgcagcaaatgcgaggagctgagtggcctccacgggcagctccaggaggccagg gcggagaactcccagctcacagagagaatccgttccattgaggccctgctggaggcgggc caggcgcgggatgcccaggacgtccaggccagccaggcggaggctgaccagcagcagact cgcctcaaggagctggagtcccaggtgtcgggtctggagaaggaggccatcgagctcagg gaggccgtcgagcagcagaaagtgaagaacaatgacctccgggagaagaactggaaggcc atggaggcactggccacggccgagcaggcctgcaaggagaagctgctctccctgacccag gccaaggaggaatcggagaagcagctctgtctgattgaggcgcagaccatggaggccctg ctggctctgctcccagaactctctgtcttggcacaacagaattacaccgagtggctgcag gatctcaaagagaaaggccccacgctgctgaagcacccgccagctcccgcggagccctcc tcggacctggcctccaagttgagggaggccgaggagacgcagagcacactgcaggccgag tgtgaccagtaccgcagcatcctggcggagacggagggcatgctcagagacctgcagaag agcgtggaggaggaggagcaggtgtggagggccaaggtgggcgccgcagaggaggagctc cagaagtcccgggtcacagtgaagcatctcgaagagattgtagagaagctaaaaggagaa cttgaaagttcggaccaggtgagggagcacacgtcgcatttggaggcagagctggaaaag cacatggcggccgccagcgccgagtgccagaactacgccaaggaggtggcagggctgagg caacttctcctagaatctcaatctcagctcgatgccgccaagagcgaagcccagaaacag agcgatgagcttgccctggtcaggcagcagttgagtgaaatgaagagccacgtagaggat ggtgacatagctggggccccagcttcctccccagaggcgcccccagccgagcaggacccc gttcaggttagggcgggttcctcaccgggctgccgggttatgtttgcggtgtttgcacag ctgaagacgcagctggagtggacagaagccatcctggaggatgagcagacacagcggcag aagctcacggccgagtttgaggaggagagactagaaaaagagaagaagttaacaagtgac ctggggcgcgccgccacgagactgcaggagcttctgaagacgacccaggagcagctggca agggagaaggacacggtgaagaagctgcaggaacagctggaaaaggcagaggacggcagc agctcaaaggagggcacctctgtctga >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_5|656_aa MDIYDTQTLGVVVFGGFMVVSAIGIFLVSTFSMKETSYEEALANQRKEMAKTHHQKVEKK KKEKTVEKKGKTKKKEEKPNGKIPDHDPAPNVTVLLREPVRAPAVAVAPTPVQPPIIVAP VATVPAMPQEKLASSPKDKKKKEKKVAKVEPAVSSVVNSIQVLTSKAAILETAPKEVPMV VVPPVGAKGNTPATGTTQGKKAEGTQNQSKKAEGAPNQGRKAEGTPNQGKKTEGTPNQGK KAEGTPNQGKKAEGTPNQGKKAEGAQNQGKKVDTTPNQGKKVEGAPTQGRKAEGAQNQAK KVEGAQNQGKKAEGAQNQGKKGEGAQNQGKKAEGAQNQGKKAEGAQNQGKKAEGAQNQGK KAEGAQNQGKKAEGAQNQGKKAEGAQNQGKKVEGAQNQGKKAEGAQNQGKKAEGAQNQGK KAEGAQNQGKKAEGAQNQGKKAEGAQNQGKKAEGAQNQGKKAEGAQNQGKKVEGAQNQGK KAEGAQNQGKKAEGAQNQGKKAEGAQNQGQKGEGAQNQGKKTEGAQGKKAERSPNQGKKG EGAPIQGKKADSVANQGTKVEGITNQGKKAEGSPSEGKKAEGSPNQGKKADAAANQGKKT ESASVQGRNTDVAQSPEAPKQEAPAKKKSGSKKKGEPGSGGAFTATVQERAVRGQQ >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_5|1971_bp atggatatttacgacactcaaaccttgggggttgtggtctttggaggattcatggttgtt tctgccattggcatcttcctggtgtcgactttctccatgaaggaaacgtcatatgaagaa gccctagccaaccagcgcaaggagatggcgaaaactcaccaccagaaagtcgagaagaaa aagaaggagaaaacagtggagaagaaaggaaagaccaagaaaaaggaagagaaacctaat gggaagatacctgatcatgatccagcccccaatgtgactgtcctccttcgagaaccagtg cgggctcctgctgtggctgtggctccaaccccagtgcagccccccattatcgttgctcct gtcgccacagttccagccatgccccaggagaagctggcctcctcccccaaggacaaaaag aagaaggagaaaaaagtggcaaaagtggaaccagctgtcagctctgtagtgaattccatc caggttctcacttcgaaggctgccatcttggaaactgctcccaaggaggtgccgatggtg gtggtgcccccagtgggtgccaagggcaacacaccagccactggcactactcagggcaaa aaggcggaggggactcagaatcaaagcaaaaaggctgaaggagccccaaaccagggcaga aaggcagagggaaccccaaaccagggcaaaaagacagagggaaccccaaaccaagggaaa aaggcagagggaaccccaaaccaaggcaaaaaggcagaaggaaccccaaaccaaggcaaa aaggcggagggggcccagaaccagggtaaaaaggtagatacaaccccaaaccaggggaaa aaggtggagggggccccaacccagggcagaaaggccgagggggctcagaaccaggccaaa aaggtagaaggggcccagaaccagggcaaaaaggcagagggggcccagaatcagggcaaa aagggagagggggcccagaaccagggcaagaaggccgagggggcccagaatcagggcaag aaggccgagggggcccagaatcagggcaagaaggccgagggggcccagaatcagggcaag aaggccgagggggcccagaatcagggcaagaaggctgagggggctcagaaccagggcaaa aaggccgagggggctcagaaccagggcaaaaaagtagaaggggcccagaaccagggcaag aaggctgagggtgcccagaaccagggcaaaaaggccgagggggcccagaatcagggcaaa aaggccgagggggcccagaaccagggcaagaaggccgagggggcccagaaccagggcaag aaggccgagggggcccagaaccagggcaagaaggccgagggggcccagaaccagggcaag aaggccgagggtgctcagaaccagggcaaaaaagtagaaggggcccagaaccagggcaag aaggctgagggggcccagaaccagggcaagaaggccgagggggctcagaaccagggcaaa aaggccgagggagcccagaaccagggccaaaaaggagagggagcccagaatcagggtaaa aagacagaaggggctcagggcaaaaaggcagaaaggagtcccaaccaaggcaaaaaagga gagggagctcccatccagggcaaaaaggcagattcggttgctaatcagggcacaaaggta gagggtattacaaaccaggggaaaaaagcagaagggtcccccagtgaaggcaaaaaggca gaagggtcccccaaccaaggcaaaaaggcagacgcagctgccaatcagggtaaaaagaca gagtcagcttctgtccagggcagaaatacagatgtggcccagagcccagaggcaccaaag caagaggctcctgccaagaagaagtctggttcaaagaaaaaaggtgagcctggctcagga ggagccttcacggccacagtccaagaacgagcagttagaggccagcagtga >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_6|259_aa MDRPGPCAASPAAPARPVGGRPSPAKRQRAAARDRRARPVGPHSIPPSSPRPASAAFRPH PTPPARTPARQRHPAASRNLTPSSASCAPPRPAPPPTFFAQLRSALAGPTALPGGREAGS GRHVAPVCRRSSPDGEGRGVHTTGRALPTPRYGACPAPPPRVRPRPRWCTAAAAGGGRRA GRGRGRRGDVSLAAQRGRPRPPDGPARARPLQGPAKFLTCSQRRERESERAGAAAFAAAA EPGERSRPSLRQETAPEQL >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_6|780_bp atggacagacccggtccctgcgccgcgagccctgcggccccggcccgccccgtcgggggc cgtcccagccccgcgaagcgccagcgggccgcggcccgggaccggcgcgctcggcccgtg ggcccccattcaatcccgccctcctcccctcggccggcctccgccgccttccgcccgcac cccacccctccggcccggactccagcccggcaacgtcatccggcggcttcccgaaactta accccttcctccgccagctgtgcgccgccccgccccgccccgccgccaactttcttcgcc caacttcggagcgcgctggccggcccgacggccctgccgggagggagggaggcagggagc ggtcgccacgtcgcgcccgtctgccgccgcagctccccggacggcgagggccgcggggtc cacaccacggggcgcgccctgcctacgccccggtacggcgcttgcccggcacccccgccc cgagtccgaccccgaccccgatggtgcacggccgcggcggcaggcggcgggcggcgggcg ggccgggggaggggccgccgaggagacgtgtcactcgcggcgcagcgcggccggccccgg ccccccgacggcccggcccgcgcgcggccgctgcagggcccggccaagtttcttacctgc agccagagacgcgagagggaaagcgagagggcaggagccgccgccttcgcagccgccgcg gagccgggagagaggagccggccaagcctccgccaagaaaccgcccccgagcagctgtag >gi568815578f:17500737_17707143|GENSCAN_predicted_peptide_7|296_aa MPSLFPHLHTTVADLEPDGQWQAFRSMADGHVLDGFLQPLGSRSKVTETNCQLPLAYQEH LSVYGPFTICECQHVHTAAAASLLPQRSGFYRDPMILRRLLALLDCVRCSQVHSTDIETE DCSVLVPRLKRVINLEHAFTLYRLSPTLELSEKRMGCFARQQECAGHGSSISMEGNGILV ISWELDQKTFKGCFQIQDCGSRLKDGCKLSVRQGELGRTIEISYGPQANCHRCAKFHSAY FQLLLDVALMDAELPVGPPVSLARPLTLMGQDGAMSIHGQMLCTKEENLLIESSFS >gi568815578f:17500737_17707143|GENSCAN_predicted_CDS_7|891_bp atgccctccttattcccgcatcttcacaccacggtggctgacttggagcctgatgggcag tggcaagcgttcagatctatggccgacgggcacgtcttggacggattccttcagccactg ggttccagatccaaggtgactgagacaaactgccagctgccactggcttatcaggagcac ctgtctgtctacggtccattcaccatctgtgagtgccaacacgtccacacagctgcagct gcgtcccttctcccacaaaggagtggcttctatagggaccccatgattttaagaaggctc ctggctctccttgactgtgtaaggtgttctcaggttcattctacagacatagaaactgaa gactgttctgtgctggtgccacgtctaaagagggtcattaacctggagcatgccttcact ctctatcgactatccccaacattagagctctctgaaaagagaatgggctgctttgcaagg cagcaagagtgtgcaggccacggctcttccatcagcatggaagggaacgggattcttgtg atcagctgggagctggaccaaaagacctttaaaggctgtttccaaatccaagattgtggg tcacgccttaaggacggctgcaaactttctgttaggcaaggagaactgggaagaaccata gagatcagctacgggccccaggcaaactgccatcgatgtgccaagttccacagtgcctat ttccagctcctcttggacgttgccctgatggatgctgaactccctgtgggcccgcctgtt tcactggcccggcccctcactctgatggggcaagatggagcgatgtctatacacggacag atgttatgcaccaaggaagaaaatctgctaatcgaaagcagcttcagctaa