GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:27:15 Sequence gi568815591r:76229580_76458808 : 229229 bp : 50.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5449 5720 272 0 2 113 72 409 0.720 38.14 1.02 Intr + 18609 18710 102 0 0 133 84 132 0.674 16.59 1.03 Intr + 30217 30303 87 1 0 87 77 34 0.689 1.29 1.04 Intr + 30327 30455 129 0 0 53 76 257 0.668 20.81 1.05 Intr + 30538 30618 81 1 0 140 97 64 0.999 11.25 1.06 Intr + 31773 31835 63 0 0 87 80 120 0.981 8.93 1.07 Intr + 35186 35236 51 2 0 113 99 56 0.954 7.22 1.08 Intr + 37679 37904 226 2 1 57 40 183 0.573 8.39 1.09 Intr + 44505 44621 117 2 0 69 77 71 0.587 4.76 1.10 Intr + 51862 52223 362 0 2 111 101 140 0.808 11.62 1.11 Intr + 53069 53293 225 2 0 101 117 142 0.998 15.50 1.12 Intr + 53385 53522 138 0 0 96 80 168 0.984 16.38 1.13 Term + 56036 56264 229 2 1 97 52 236 0.905 17.00 1.14 PlyA + 57687 57692 6 1.05 2.00 Prom + 73059 73098 40 -6.86 2.01 Init + 73134 73497 364 2 1 102 105 742 0.678 72.51 2.02 Intr + 74223 74286 64 1 1 118 94 144 0.999 15.78 2.03 Term + 74405 74594 190 2 1 94 42 288 0.965 21.72 2.04 PlyA + 74692 74697 6 1.05 3.05 PlyA - 75643 75638 6 1.05 3.04 Term - 100654 99998 657 1 0 113 48 1256 0.999 118.05 3.03 Intr - 129245 129143 103 1 1 100 92 291 0.633 30.88 3.02 Intr - 145905 145798 108 2 0 40 47 109 0.195 1.40 3.01 Init - 153855 153713 143 0 2 83 70 65 0.347 3.81 3.00 Prom - 157707 157668 40 -6.46 4.11 PlyA - 159783 159778 6 1.05 4.10 Term - 160796 160480 317 0 2 116 46 277 0.999 21.40 4.09 Intr - 162462 162385 78 1 0 88 90 27 0.836 2.42 4.08 Intr - 164137 163808 330 2 0 111 100 563 0.577 55.40 4.07 Intr - 164325 164251 75 1 0 75 106 14 0.652 1.39 4.06 Intr - 165751 165674 78 2 0 134 60 -5 0.481 0.82 4.05 Intr - 168253 167939 315 2 0 95 80 506 0.909 46.54 4.04 Intr - 169218 169141 78 1 0 99 68 47 0.695 3.32 4.03 Intr - 171012 170707 306 1 0 78 101 411 0.947 37.82 4.02 Intr - 178795 178631 165 2 0 73 103 50 0.612 4.93 4.01 Init - 179424 179415 10 0 1 65 98 -1 0.566 -0.76 4.00 Prom - 187250 187211 40 -3.56 5.00 Prom + 189555 189594 40 -6.46 5.01 Init + 193737 193793 57 2 0 60 91 0 0.358 -1.15 5.02 Intr + 195473 195697 225 1 0 110 110 236 0.693 26.18 5.03 Intr + 199936 200054 119 0 2 97 67 228 0.983 20.86 5.04 Intr + 203348 203451 104 2 2 79 90 97 0.910 8.82 5.05 Intr + 203891 204068 178 0 1 58 80 208 0.999 15.98 5.06 Intr + 204459 204576 118 0 1 87 95 88 0.992 9.87 5.07 Intr + 210671 210762 92 1 2 71 94 122 0.997 9.89 5.08 Intr + 210896 211032 137 2 2 77 80 37 0.898 2.01 5.09 Term + 212263 212477 215 2 2 85 42 146 0.824 7.09 5.10 PlyA + 216156 216161 6 1.05 6.02 PlyA - 216569 216564 6 1.05 6.01 Term - 226221 226089 133 2 1 104 40 79 0.478 2.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:76229580_76458808|GENSCAN_predicted_peptide_1|693_aa GQRPRPAAPGPATMSSTVNNGAASMQSTPDAANGFPQPSSSSGTWPRAEEELRAAEPGLV KRAHREILDHERKRRVELKCMELQEMMEEQGYSEEEIRQKVGTFRQMLMEKEGVLTREDR PGGHIRVAARRLAGLGQAPCAEKRGEEKSSRDWCVAETPRLTEGAEPGLEYAPFDDDDGP VDCDCPASCYRGHRGYRTKHWSSSSASPPPKKKKKKKGGHRRSRCGSSSPLRKKKKSVKK HRRDRSRSSKCKRKEKNKEKKRLSPKHRDEGRKTGSQRSSGSRSPSPSGGSGWGSPQRNG GSGQRSGAHGGRPGSAHSPPDVRTLRFAEGSRAAGCAPPGLRSPRWPSGPFRAKGSDRGA SAGPPGLQILEEVLAKKPSSPSPRVRDKAAAAAPTPPARGKESPSPRSAPSSQGRGGRAA GGAGRRRRRRRRRRRSRSSASAPRRRGRRRPRPAPPRGSSRSLSRARSSSDSGSGRGAPG PGPEPGSERGHGGHGKRAKERPPRARPASTSPSPGAHGRRGGPEGKSSSRSPGPHPRSWS SSRSPSKSRSRSAEKRPHSPSRSPSPKKPLSRDKDGEGRARHSEAEATRARRRSRSYSPI RKRRRDSPSFMEPRRITSARKRPIPYYRPSPSSSSSCLSSDYSTRSHSRSPSPGHSHGSY SSRSHGTRSRTRSPSRTPSPSYHSRSSSESGGF >gi568815591r:76229580_76458808|GENSCAN_predicted_CDS_1|2082_bp ggccagcggcccaggccagcggctccagggccagccacgatgtcctccaccgtgaacaac ggggcggccagcatgcagtccacacccgacgccgcgaacggcttcccgcagcccagctcc tcctcggggacctggccgcgggcggaagaggagctgcgcgccgcggagccgggcctggtg aagcgcgcgcaccgcgagatcctggaccacgagcgcaagcggcgggtggagctcaagtgc atggagctgcaggagatgatggaggagcaggggtattcggaggaggagattcggcagaaa gtggggacattccggcagatgctgatggagaaggagggagtgctcaccagggaggaccgg cctgggggccacatccgggtagcagcccgcagacttgcggggctaggtcaggccccctgc gccgagaaaaggggtgaagaaaagtcgtcccgagactggtgtgtggcggagaccccgcgg ctgaccgagggcgctgagccgggcctggagtacgcgccctttgacgatgacgacggccca gtggactgtgactgcccggcctcctgctaccgcggccaccgcgggtacaggaccaagcat tggtctagcagctcggcatcgccccctcccaagaaaaagaagaaaaagaaaggcggccac cggagaagccgctgtgggagctcctcacccctccgcaagaagaagaagagtgtgaagaag catcgccgagacagatctcgaagctccaagtgcaaaagaaaagagaagaacaaagagaag aagaggctgagccccaagcaccgagacgaagggcgaaagacgggcagccagcggtccagc ggaagccggtcgccttccccgtcgggcggcagcggatgggggtcgccccagcggaacggc ggcagcgggcagcggagcggagcgcacgggggccgccccggctcggcgcacagcccgccc gatgtacgtacgcttcgctttgcggagggttcccgcgccgcgggctgcgcccccccggga cttcggtcaccccgctggccttcagggccctttcgggcaaaaggcagtgacagaggagcc agtgcggggcctcctggccttcagatcctggaggaagttctggccaagaagcccagctcg ccctcgcccagggtccgtgacaaggcggcggccgccgcacccacgccgcccgcgcggggg aaggagagcccgagcccgcgctcggcgccgtcgtcccaaggtcgcggaggccgcgcggcg ggcggggcgggcaggcggcggcggcggcggcgtaggcggcggcgctcgcggtcctcggcg tccgcgccccgccgcaggggtcgccggcgcccccggcccgcgcccccccggggctcgtcg cgctcgctcagcagggcccgctccagcagcgactccggcagcggccgcggcgcccccggc cccgggcccgagcccggctctgagcgaggccacggcggacacgggaaacgggccaaggag cggcccccgcgcgcgcggcccgccagcacctctccgtccccgggcgcgcacggccggcgc ggcggcccagaagggaagagctcgtcgcgcagccccggcccgcacccccgctcctggagc tccagccgctcgccctccaaatctcgctcgcgctctgcggagaagcggccccacagcccc agccgctcgccgtcgcccaagaagcccctcagccgggacaaggacggcgagggccgcgca aggcactctgaggccgaggccacccgcgcccggcgccgctcccgcagctactcgcccatc cgcaagcggcgccgggactcgccaagcttcatggagccgcggcgcatcaccagcgcccgc aagcgtcctattccatactaccggcccagcccctcttcctcctccagctgcttgagcagc gactactcgacccggagccacagccgcagccccagccccggccacagccacgggagctac agcagtcgcagccatgggacccgcagccggacacgcagcccctcgaggacccccagtccc agctaccacagccggagcagctctgagagcgggggcttctga >gi568815591r:76229580_76458808|GENSCAN_predicted_peptide_2|205_aa MTERRVPFSLLRGPSWDPFRDWYPHSRLFDQAFGLPRLPEEWSQWLGGSSWPGYVRPLPP AAIESPAVAAPAYSRALSRQLSSGVSEIRHTADRWRVSLDVNHFAPDELTVKTKDGVVEI TGKHEERQDEHGYISRCFTRKYTLPPGVDPTQVSSSLSPEGTLTVEAPMPKLATQSNEIT IPVTFESRAQLGGPEAAKSDETAAK >gi568815591r:76229580_76458808|GENSCAN_predicted_CDS_2|618_bp atgaccgagcgccgcgtccccttctcgctcctgcggggccccagctgggaccccttccgc gactggtacccgcatagccgcctcttcgaccaggccttcgggctgccccggctgccggag gagtggtcgcagtggttaggcggcagcagctggccaggctacgtgcgccccctgcccccc gccgccatcgagagccccgcagtggccgcgcccgcctacagccgcgcgctcagccggcaa ctcagcagcggggtctcggagatccggcacactgcggaccgctggcgcgtgtccctggat gtcaaccacttcgccccggacgagctgacggtcaagaccaaggatggcgtggtggagatc accggcaagcacgaggagcggcaggacgagcatggctacatctcccggtgcttcacgcgg aaatacacgctgccccccggtgtggaccccacccaagtttcctcctccctgtcccctgag ggcacactgaccgtggaggcccccatgcccaagctagccacgcagtccaacgagatcacc atcccagtcaccttcgagtcgcgggcccagcttgggggcccagaagctgcaaaatccgat gagactgccgccaagtaa >gi568815591r:76229580_76458808|GENSCAN_predicted_peptide_3|336_aa MAMNLLAMKRFGKCYSRSFNGNLAWPAEEPRSVVIGAKKNVLTNQRSRVTPLGFLQQFVQ MEEMPQLPGLVASTFPEQLSFLRRPSPAKMVDREQLVQKARLAEQAERYDDMAAAMKNVT ELNEPLSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTSADGNEKKIEMVRAYREKIEK ELEAVCQDVLSLLDNYLIKNCSETQYESKVFYLKMKGDYYRYLAEVATGEKRATVVESSE KAYSEAHEISKEHMQPTHPIRLGLALNYSVFYYEIQNAPEQACHLAKTAFDDAIAELDTL NEDSYKDSTLIMQLLRDNLTLWTSDQQDDDGGEGNN >gi568815591r:76229580_76458808|GENSCAN_predicted_CDS_3|1011_bp atggccatgaaccttttggctatgaaacggtttgggaagtgctattccaggtccttcaac ggaaacttagcctggcctgcagaagaaccacggagcgttgtgattggagcaaagaagaat gtgctaacgaatcagagaagccgggtcaccccgctgggctttctgcagcagtttgtgcag atggaagagatgccccagctacctggcctggtggcctccaccttccctgagcagctgtct ttcttgcgcagacccagccccgcgaagatggtggaccgcgagcaactggtgcagaaagcc cggctggccgagcaggcggagcgctacgacgacatggccgcggccatgaagaacgtgaca gagctgaatgagccactgtcgaatgaggaacgaaaccttctgtctgtggcctacaagaac gttgtgggggcacgccgctcttcctggagggtcatcagtagcattgagcagaagacatct gcagacggcaatgagaagaagattgagatggtccgtgcgtaccgggagaagatagagaag gagttggaggctgtgtgccaggatgtgctgagcctgctggataactacctgatcaagaat tgcagcgagacccagtacgagagcaaagtgttctacctgaagatgaaaggggactactac cgctacctggctgaagtggccaccggagagaaaagggcgacggtggtggagtcctccgag aaggcctacagcgaagcccacgagatcagcaaagagcacatgcagcccacccaccccatc cgattaggcctggctcttaactactccgtcttctactatgagatccagaacgccccagag caagcgtgccacttggccaagaccgcgttcgacgacgccatcgccgagcttgacaccctc aacgaggactcctacaaggactccacgctcatcatgcagctcctccgcgacaacctcacg ctctggacgagcgaccagcaggacgacgatggcggcgaaggcaacaattaa >gi568815591r:76229580_76458808|GENSCAN_predicted_peptide_4|583_aa MIRAPISQLSPVQAEGGQKVIIPLGAVLGPMNGVLKSLCLHPDRLSSSPQPDFVHQGTEL RLVGGPSRCRGRLEVMHGGSWGSVCDDDWDVVDANVVCRQLGCGLALPVPRPLAFGQGRG PILLDNVECRGQEAALSECGSRGWGVHNCFHYEDVAVLCDEFLPTQPPTRKMLTSRAPPT TLPNGKSEGSVRLVGGANLCQGRVEILHSGLWGTVCDDDWGLPDAAVVCRQLGCGAAMAA TTNAFFGYGTGHILLDNVHCEGGEPRLAACQSLGWGVHNCGHHEDAGALCAGLGPPTLTA LPSSATREDWAWQTDPSATGVGPQPSRETALLTTAAWAAGKKSGRLRLVGGPGPCRGRVE VLHAGGWGTVCDDDWDFADARVACREAGCGPALGATGLGHFGYGRGPVLLDNVGCAGTEA RLSDCFHLGWGQHNCGHHEDAGALCAGEADSEGPEELGLQVQQDGSETTRVPTPRPRDGH LRLVNGAHRCEGRVELYLGQRWGTVCDDAWDLRAAGVLCRQLGCGQALAAPGEAHFGPGR GPILLDNVKCRGEESALLLCSHIRWDAHNCDHSEDASVLCQPS >gi568815591r:76229580_76458808|GENSCAN_predicted_CDS_4|1752_bp atgataagggctcccatctcccagctgtccccagtccaggcagaaggaggacaaaaggtg attattccactgggggctgtcctgggccccatgaatggggtgttgaaatccctgtgtctg catccagataggctttcatcttctccccaaccagactttgtacaccagggcacagagctg aggctggtggggggccccagccgctgccggggccgcctggaagtcatgcacggtggctcc tggggcagcgtctgtgatgacgactgggacgtggtggacgccaacgtagtgtgtcgccag ctgggctgtggcctggcactgcccgtgccacggccccttgcctttggccaaggccgaggc cccatcctgctggacaacgtggagtgccgcgggcaggaagctgcgctgagcgagtgcggc agccgcggctggggcgtccacaattgctttcactacgaggatgtggctgtcctgtgtgat gaattcttgccaacgcagcccccaacaaggaagatgttaaccagtagagcacctcctacg acactgccgaatggaaaaagtgagggcagcgtacgcctggtagggggcgcgaacctgtgt cagggccgagtggagatcctgcacagtggcctgtggggcaccgtgtgtgacgacgactgg gggctgccggatgccgctgtggtctgtcgtcagctgggctgcggggcggccatggccgcc accaccaacgccttcttcggctatggcaccggacacatcctgctggacaacgtgcactgc gaaggcggcgagccccgcctggcagcctgccagagcctgggctggggtgtgcacaactgc ggccaccacgaggacgcgggcgcgctctgcgcaggcctgggtcccccaacgctcacagca ctgccatcctcagccacaagagaggactgggcttggcagacagatccgtccgctacagga gttggcccccagccttcccgggagacagcactgctcaccaccgccgcctgggccgcgggg aagaaaagtggacggctgcgactggtgggcggcccgggtccgtgccgcggccgcgtggag gtgttgcacgccgggggctggggcaccgtgtgcgacgatgactgggactttgcggacgcg cgcgtggcctgccgcgaagcgggctgcgggcctgcgctgggcgctacgggactgggccac ttcggctacggccgcggccccgtgctgctggacaacgtgggctgcgccggcaccgaggct cgcctgagcgactgcttccacctgggctggggccagcacaactgcggccaccacgaggac gcgggagcgctctgcgcaggtgaggctgacagcgaaggcccagaggagctgggactgcaa gtccagcaggatggttctgagaccacgcgggtgcccactcctcggcccagggacgggcat ctacgtctggtcaatggagcccaccgatgcgagggacgtgtagagctctacctagggcaa cggtggggcactgtctgtgatgatgcttgggacctgcgggcagccggtgtcctgtgccgc cagctgggctgtggccaggccctcgcagcccctggcgaggctcactttggcccaggccga ggccccattctcctggacaatgtcaagtgccgtggggaagaaagtgctctgctgctctgc tctcatatccgctgggatgcccacaactgtgaccacagcgaggatgccagtgtcctgtgc cagccttcatga >gi568815591r:76229580_76458808|GENSCAN_predicted_peptide_5|414_aa MGIYIRDRACRLCWAKHVKGGASHPETSVQPVLVECQEATLMVMVSKDLFGTGKLIRAAD LTLGPEACEPLVSMDTEDVVRFEVGLHECGNSMQVTDDALVYSTFLLHDPRPVGNLSIVR TNRAEIPIECRYPRQGNVSSQAILPTWLPFRTTVFSEEKLTFSLRLMEENWNAEKRSPTF HLGDAAHLQAEIHTGSHVPLRLFVDHCVATPTPDQNASPYHTIVDFHGCLVDGLTDASSA FKVPRPGPDTLQFTVDVFHFANDSRNMIYITCHLKVTLAEQDPDELNKACSFSKPSNSWF PVEGSADICQCCNKGDCGTPSHSRRQPHVMSQWSRSASRNRRHVTEEADVTVGPLIFLDR RGDHEVEQWALPSDTSVVLLGVGLAVVVSLTLTAVILVLTRRCRTASHPVSASE >gi568815591r:76229580_76458808|GENSCAN_predicted_CDS_5|1245_bp atgggaatttacatcagggatagggcatgccggctgtgctgggcaaagcacgtgaagggt ggagccagccatcctgagacgtccgtacagcccgtactggtggagtgtcaggaggccact ctgatggtcatggtcagcaaagacctttttggcaccgggaagctcatcagggctgctgac ctcaccttgggcccagaggcctgtgagcctctggtctccatggacacagaagatgtggtc aggtttgaggttggactccacgagtgtggcaacagcatgcaggtaactgacgatgccctg gtgtacagcaccttcctgctccatgacccccgccccgtgggaaacctgtccatcgtgagg actaaccgcgcagagattcccatcgagtgccgctaccccaggcagggcaatgtgagcagc caggccatcctgcccacctggttgcccttcaggaccacggtgttctcagaggagaagctg actttctctctgcgtctgatggaggagaactggaacgctgagaagaggtcccccaccttc cacctgggagatgcagcccacctccaggcagaaatccacactggcagccacgtgccactg cggttgtttgtggaccactgcgtggccacaccgacaccagaccagaatgcctccccttat cacaccatcgtggacttccatggctgtcttgtcgacggtctcactgatgcctcttctgca ttcaaagttcctcgacccgggccagatacactccagttcacagtggatgtcttccacttt gctaatgactccagaaacatgatatacatcacctgccacctgaaggtcaccctagctgag caggacccagatgaactcaacaaggcctgttccttcagcaagccttccaacagctggttc ccagtggaaggctcggctgacatctgtcaatgctgtaacaaaggtgactgtggcactcca agccattccaggaggcagcctcatgtcatgagccagtggtccaggtctgcttcccgtaac cgcaggcatgtgacagaagaagcagatgtcaccgtggggccactgatcttcctggacagg aggggtgaccatgaagtagagcagtgggctttgccttctgacacctcagtggtgctgctg ggcgtaggcctggctgtggtggtgtccctgactctgactgctgttatcctggttctcacc aggaggtgtcgcactgcctcccaccctgtgtctgcttccgaataa >gi568815591r:76229580_76458808|GENSCAN_predicted_peptide_6|44_aa XSHLGPELAPSPETPGDVHSSEIMSKAAVASNQPIPPGCQMEAN >gi568815591r:76229580_76458808|GENSCAN_predicted_CDS_6|135_bp nngtcacacctgggtcctgagctggcgccatcaccggagacgccaggtgatgtccattca tcagaaataatgagcaaagcagctgtggcttctaatcagcccatccctccagggtgtcag atggaggccaattag