GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:22:11 Sequence gi568815590r:38044276_38250818 : 206543 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 171 166 6 1.05 1.03 Term - 2163 2096 68 1 2 89 37 43 0.775 -2.60 1.02 Intr - 2439 2355 85 0 1 83 75 104 0.630 7.99 1.01 Init - 7127 7041 87 2 0 55 103 78 0.587 6.64 1.00 Prom - 7509 7470 40 -6.36 2.00 Prom + 11419 11458 40 -1.06 2.01 Init + 12141 12159 19 2 1 53 68 -16 0.499 -6.84 2.02 Intr + 12806 12985 180 0 0 147 101 228 0.999 29.84 2.03 Intr + 18856 18869 14 2 2 74 99 0 0.002 -5.80 2.04 Intr + 28284 28388 105 2 0 116 45 107 0.392 9.61 2.05 Intr + 28773 28821 49 2 1 82 101 56 0.352 4.55 2.06 Term + 34682 34800 119 0 2 -24 45 123 0.050 -4.50 2.07 PlyA + 34959 34964 6 1.05 3.06 PlyA - 34998 34993 6 1.05 3.05 Term - 40726 40565 162 1 0 26 33 137 0.067 -0.06 3.04 Intr - 43781 43600 182 0 2 41 69 124 0.201 5.39 3.03 Intr - 45460 45060 401 0 2 83 81 170 0.222 9.95 3.02 Intr - 53985 53795 191 0 2 29 110 47 0.001 -0.62 3.01 Init - 56836 56834 3 1 0 97 81 0 0.003 0.20 3.00 Prom - 58135 58096 40 -6.16 4.00 Prom + 58934 58973 40 -4.96 4.01 Init + 61276 61463 188 0 2 104 94 111 0.875 9.84 4.02 Intr + 62103 62169 67 0 1 70 94 57 0.931 3.41 4.03 Intr + 62746 62891 146 0 2 57 80 132 0.876 8.38 4.04 Intr + 66104 66192 89 2 2 70 71 47 0.885 0.81 4.05 Intr + 66464 66558 95 0 2 93 75 72 0.929 6.08 4.06 Intr + 70630 70725 96 0 0 61 88 88 0.987 6.31 4.07 Intr + 72375 72450 76 2 1 80 115 43 0.999 5.29 4.08 Intr + 74995 75088 94 2 1 67 59 140 0.985 8.02 4.09 Intr + 76657 76874 218 1 2 106 87 115 0.997 11.35 4.10 Intr + 84016 84183 168 2 0 87 87 152 0.979 14.92 4.11 Intr + 84483 84676 194 1 2 74 113 144 0.998 14.71 4.12 Intr + 89179 89271 93 0 0 73 105 48 0.960 5.16 4.13 Intr + 91393 91491 99 0 0 71 100 26 0.822 2.41 4.14 Term + 94689 94796 108 2 0 107 47 61 0.964 2.41 4.15 PlyA + 95406 95411 6 1.05 5.08 PlyA - 97493 97488 6 1.05 5.07 Term - 100111 99998 114 1 0 64 49 162 0.951 8.47 5.06 Intr - 101040 100947 94 2 1 90 67 27 0.460 0.77 5.05 Intr - 101872 101688 185 1 2 50 90 236 0.995 18.59 5.04 Intr - 102172 102014 159 1 0 104 105 313 0.849 34.78 5.03 Intr - 104052 103925 128 1 2 58 111 143 0.998 14.00 5.02 Intr - 104479 104366 114 2 0 72 123 80 0.346 10.32 5.01 Init - 106543 106480 64 1 1 68 84 55 0.272 4.41 5.00 Prom - 118553 118514 40 -6.16 6.05 PlyA - 119084 119079 6 1.05 6.04 Term - 119565 119395 171 0 0 87 33 226 0.998 14.83 6.03 Intr - 125642 125527 116 0 2 47 92 92 0.930 5.67 6.02 Intr - 132062 132000 63 0 0 69 79 93 0.963 5.19 6.01 Init - 132596 132422 175 2 1 99 78 90 0.975 8.62 6.00 Prom - 134260 134221 40 -6.06 7.00 Prom + 141120 141159 40 -4.36 7.01 Init + 150429 150440 12 2 0 110 89 4 0.685 2.65 7.02 Intr + 163345 163491 147 0 0 50 109 55 0.384 4.23 7.03 Intr + 164738 164992 255 1 0 85 97 146 0.997 12.84 7.04 Term + 165733 166218 486 0 0 72 48 342 0.890 23.30 7.05 PlyA + 167225 167230 6 1.05 8.00 Prom + 179158 179197 40 -2.96 8.01 Init + 188810 188939 130 1 1 79 82 40 0.232 2.81 8.02 Intr + 190119 190309 191 1 2 54 51 122 0.222 4.40 8.03 Intr + 193814 193934 121 1 1 63 40 54 0.435 -1.83 8.04 Intr + 197975 198110 136 0 1 77 56 48 0.487 0.13 8.05 Intr + 201467 201675 209 2 2 88 60 133 0.816 9.22 8.06 Intr + 201958 202025 68 2 2 81 86 14 0.860 -0.88 8.07 Intr + 203438 203560 123 1 0 50 58 119 0.459 5.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 49480 49615 136 0 1 59 38 121 0.811 4.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_1|79_aa MPNTKNEKYKHVLQPGCRESSNCHPKSQQIGDDNDDDDDGSANFIGLLCGFEQWLTQCPR KPRTIINFSMFVSQYPSGF >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_1|240_bp atgccaaacaccaagaatgagaagtacaagcacgttctccaacctggatgcagggagagc tccaactgtcaccccaaatcacaacagattggcgacgacaatgatgatgatgatgatggt agtgccaacttcataggattactgtgtggatttgaacagtggctgacacaatgcccacgg aaaccaagaaccatcataaacttcagcatgtttgttagccagtatccttcaggattttaa >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_2|161_aa MHFFHRGTRIIYDRKFLMECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSP EDKRAGGSKRQIFGFLVLHSFIHTLSASTMIHRLQVPAEKPTYHILHSAGTLTRDIRYGL THEMKAQKLQQRLNDCSIEMNYRIIIFIMTLTNTESFVLLN >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_2|486_bp atgcattttttccaccgtggtaccaggatcatctatgaccggaaattcctgatggagtgt cggaactcacctgtgaccaaaacacccccaagggatctgcccaccattccgggggtcacc agcccttccagtgatgagccccccatggaagccagccagagccacctgcgcaatagccca gaagataagcgggcgggcgggtctaagagacagatatttgggtttcttgtactacacagc ttcattcatactctaagtgccagcaccatgattcatcgcctgcaggttccagcagaaaaa cccacctaccacatcctgcattctgcagggactctcacaagagacattcgctatggccta acccatgaaatgaaggcccagaagcttcagcagagactgaatgactgctccattgagatg aactaccgcataattatcttcatcatgactttaacaaacacagaaagctttgttttgctt aattga >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_3|312_aa MCYYREQETKGKYWTLEGLAELGGCTPAGAWISVFTLCSICTKLRKSHSKDERKHASEVL VLPGRHFQLEEHTLSYMEKPQALIDLMQSIFLTHNPTWADCKQLLLSLFNTEEHRRVIQA AHQRLEKNAPVGTGDVRQCARQALPIETDPGWDPNQAQDLLKLLRYQEALIQGIKTEGKK ATNTGKVSEVYQKPDESPMLKPSGDYQPVQDLRAVNQVAAILHAIVPNPYTVLGQIPASA AWFTCLDIKDAFFCILLAPSTKTCSLSSRPMDESARSRSSNSTDLAEEPRRSRKRQLPCY DHTRGWSVNARL >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_3|939_bp atgtgttattatagggaacaggagaccaagggaaaatactggacactggaaggcctggct gagcttggaggatgcacacctgctggggcatggatttctgtcttcacactttgctccatc tgcacaaagctacgtaagagtcacagcaaagatgaacggaaacatgcatctgaggttttg gttttgccaggcagacattttcaactggaagagcatactctctcctatatggaaaaaccc caggctcttatcgacctaatgcagtccatcttcttaactcacaacccaacctgggctgac tgcaaacagctccttctgtcactgttcaatacagaagaacaccgcagagtaatacaagcg gctcatcagaggctagaaaaaaatgccccagtaggtacaggagatgtcagacagtgtgct cggcaggctttgccaatagaaactgacccaggctgggacccaaatcaggcccaagatctg ctgaagttgctgagataccaagaggctctaatacaaggaataaagactgaagggaagaag gcaacaaacactggaaaggtttcagaagtctatcagaaaccagatgaaagccccatgtta aaaccttctggtgactaccagcctgtacaagatttaagggcagtcaaccaggtagctgct atactgcatgctattgtgcctaacccgtacactgtgcttggacaaatacctgctagtgct gcttggttcacatgcttggacattaaagatgcattcttctgcatcctgttagccccttca actaaaacctgcagcctcagttcaagaccaatggacgagtcagcaagatccagatcatcc aactcgactgatcttgcagaggaaccaaggcgcagcaggaaaagacaactgccctgctac gaccacaccagaggctggtcggtcaacgcacggctgtag >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_4|576_aa MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEA EGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGIC TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS RTQDEHPKTMFSKDKSKERDVFLVKEHPDPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQK QSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQDGGTTGTTKKARSDPLFSAQRLPPHGYPL EHPFNKDGYRYILAEPDPHAPDPEKLELDCWAGKPIPGDLYRACLYERVLLALHDRAPQL KISDDRLTVVGEKGYSMVRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLG YDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGDVLGFYINLPEDTETAKSLPDTYKDKALI KFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSC TMSDMGWGAVVEHTLADVLYHVETEVDGRRSPPWEP >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_4|1731_bp atggcggcggcaggagcaggacctggccaggaagcgggtgccgggcctggcccaggagcg gtcgcaaatgcaacaggggcagaagagggggagatgaagccggtggcagcgggagcagcc gctcctcctggagaggggatctctgctgctccgacagttgagcccagttccggggaggct gaaggcggggaggcaaacttggtcgatgtaagcggtggcttggagacagaatcatctaat ggaaaagatacactagaaggtgctggggatacatcagaggtgatggatactcaggcgggc tccgtggatgaagagaatggccgacagttgggtgaggtagagctgcaatgtgggatttgt acaaaatggttcacggctgacacatttggcatagatacctcatcctgtctacctttcatg accaactacagttttcattgcaacgtctgccatcacagtgggaatacctatttcctccgg aagcaagcaaacttgaaggaaatgtgccttagtgctttggccaacctgacatggcagtcc cgaacacaggatgaacatccgaagacaatgttctccaaagataagagtaaagaaagagat gtattcttggtaaaggaacacccagatccaggcagtaaagatccagaagaagattacccc aaatttggacttttggatcaggaccttagtaacattggtcctgcttatgacaaccaaaaa cagagcagtgctgtgtctactagtgggaatttaaatgggggaattgcagcaggaagcagc ggaaaaggacgaggagccaagcgcaaacagcaggatggagggaccacagggaccaccaag aaggcccggagtgaccctttgttttctgctcagcgccttccccctcatggctacccattg gaacacccgtttaacaaagatggctatcggtatattctagctgagcctgatccgcacgcc cctgaccccgagaagctggaacttgactgctgggcaggaaaacctattcctggagacctc tacagagcctgcttgtatgaacgggttttgttagccctacatgatcgagctccccagtta aagatctcagatgaccggctgactgtggttggagagaagggctactctatggtgagggcc tctcatggagtacggaaaggtgcctggtattttgaaatcactgtggatgagatgccacca gataccgctgccagactgggttggtcccagcccctaggaaaccttcaagctcctttaggt tatgataaatttagctattcttggcggagcaaaaagggaaccaagttccaccagtccatt ggcaaacactactcttctggctatggacagggagacgtcctgggattttatattaatctt cctgaagacacagagacagccaagtcattgccagacacatacaaagataaggctttgata aaattcaagagttatttgtattttgaggaaaaagactttgtggataaagcagagaagagc ctgaagcagactccccatagtgagataatattttataaaaatggtgtcaatcaaggtgtg gcttacaaagatatttttgagggggtttacttcccagccatctcactgtacaagagctgc acgatgagtgacatgggctggggcgccgtggtagagcacaccctggctgacgtcttgtat cacgtggagacagaagtggatgggaggcgcagtcccccatgggaaccctga >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_5|285_aa MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_5|858_bp atgctgctagcgacattcaagctgtgcgctgggagctcctacagacacatgcgcaacatg aaggggctgaggcaacaggctgtgatggccatcagccaggagctgaaccggagggccctg gggggccccacccctagcacgtggattaaccaggttcggcggcggagctctctactcggt tctcggctggaagagactctctacagtgaccaggagctggcctatctccagcagggggag gaggccatgcagaaggccttgggcatccttagcaaccaagagggctggaagaaggagagt cagcaggacaatggggacaaagtgatgagtaaagtggtcccagatgtgggcaaggtgttc cggctggaggtcgtggtggaccagcccatggagaggctctatgaagagctcgtggagcgc atggaagcaatgggggagtggaaccccaatgtcaaggagatcaaggtcctgcagaagatc ggaaaagatacattcattactcacgagctggctgccgaggcagcaggaaacctggtgggg ccccgtgactttgtgagcgtgcgctgtgccaagcgccgaggctccacctgtgtgctggct ggcatggccacagacttcgggaacatgcctgagcagaagggtgtcatcagggcggagcac ggtcccacttgcatggtgcttcacccgttggctggaagtccctctaagaccaaacttacg tggctactcagcatcgacctcaaggggtggctgcccaagagcatcatcaaccaggtcctg tcccagacccaggtggattttgccaaccacctgcgcaagcgcctggagtcccaccctgcc tctgaagccaggtgttga >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_6|174_aa MGSAALKRFPPRSPRPTHRAGARGRLRMESSEGGRTTPANPRPGPAPSAAEPRKLRDGAL FQFKMNYMPGTASLIEDIDTNLVLHQTVERIHVGKKYGDIPRGIFVVRGENVVLLGEIDL EKESDTPLQQVSIEEILEEQRVEQQTKLEAEKLKVQALKDRGLSIPRADTLDEY >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_6|525_bp atgggatccgctgccctgaagcgcttcccgccccgctccccacggccgacccatcgcgca ggcgcccgcgggcgattgcgaatggagagcagcgaagggggtcggacgacgccagccaat cccaggcccgggcccgccccttctgctgccgaaccccgaaagctgagggacggagcatta tttcagttcaaaatgaactatatgcctggcaccgccagcctcatcgaggacattgacaca aacttagtgctacatcagactgtggagcgtattcatgtgggcaaaaaatacggtgatatt cctcgagggatttttgtggtcagaggagaaaatgtggtcctactaggagaaatagacttg gaaaaggagagtgacacacccctccagcaagtatccattgaagaaattctagaagaacaa agggtggaacagcagaccaagctggaagcagagaagttgaaagtgcaggccctgaaggac cgaggtctttccattcctcgagcagatactcttgatgagtactaa >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_7|299_aa MPGQTSYSTEVPSTYRSSGNSPTPVSRWIYPQQDCQTEAPPLRGQVPGYPPSQNPGMTLP HYPYGDGNRSVPQSGPTVRPQEDAWASPGAYGMGGRYPWPSSAPSAPPGNLYMTESTSPW PSSGSPQSPPSPPVQQPKDSSYPYSQSDQSMNRHNFPCSVHQYESSGTVNNDDSDLLDSQ VQYSAEPQLYGNATSDHPNNQDQSSSLPEECVPSDESTPPSIKKIIHVLEKVQYLEQEVE EFVGKKTDKAYWLLEEMLTKELLELDSVETGGQDSVRQARKEAVCKIQAILEKLEKKGL >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_7|900_bp atgcctggccagaccagttactccacagaagttccaagtacttaccgttcatctggcaac agcccaactccagtctctcgttggatctatccccagcaggactgtcagactgaagcaccc cctcttagggggcaggttccaggatatccgccttcacagaaccctggaatgaccctgccc cattatccttatggagatggtaatcgtagtgttccacaatcaggaccgactgtacgacca caagaagatgcgtgggcttctcctggtgcttatggaatgggtggccgttatccctggcct tcatcagcgccctcagcaccacccggcaatctctacatgactgaaagtacttcaccatgg cctagcagtggctctccccagtcacccccttcacccccagtccagcagcccaaggattct tcatacccctatagccaatcagatcaaagcatgaaccggcacaactttccttgcagtgtc catcagtacgaatcctcggggacagtgaacaatgatgattcagatcttttggattcccaa gtccagtatagtgctgagcctcagctgtatggtaatgccaccagtgaccatcccaacaat caagatcaaagtagcagtcttcctgaagaatgtgtaccttcagatgaaagtactcctccg agtattaaaaaaatcatacatgtgctggagaaggtccagtatcttgaacaagaagtagaa gaatttgtaggaaaaaagacagacaaagcatactggcttctggaagaaatgctaaccaag gaacttttggaactggattcagttgaaactgggggccaggactctgtacggcaggccaga aaagaggctgtttgtaagattcaggccatactggaaaaattagaaaaaaaaggattatga >gi568815590r:38044276_38250818|GENSCAN_predicted_peptide_8|326_aa MDAGSLYEPVSPHWFYCKIIDSKETWIPFNSEDSQQLEEAYSSGKGCNGRVVPTDGGRYD VHLGERMRYAVYWDELASEVRRCTWFYKGDKDNKYVPYSESFSQVLELMVHYQPVAGSDD WGSTPTEQGRPRTVKRGVENISVDIHCVNDFRSVSLNLLQTHFKKAQENQQIGRVEFLPV NWHSPLHSTGVDVDLQRITLPSINRLRHFTNDTILDVFFYNSPTYCQTIVDTVASEMNRI YTLFLQRNPDFKGGVSIAGHSLGSLILFDILTNQKDSLGDIDSEKDSLNIVMDQGDTPTL EEDLKKLQLSEFFDIFEKEKVDKEAL >gi568815590r:38044276_38250818|GENSCAN_predicted_CDS_8|978_bp atggatgctggcagcttgtatgaaccagtttctccccattggttttattgtaagataata gattctaaggagacatggattcctttcaactctgaggattcacagcagctggaagaggca tatagctctggaaaaggttgtaatgggagagttgttcctactgatgggggcagatatgat gttcatttgggggagaggatgcggtatgctgtatactgggatgaactggcatcggaagtg agacgatgtacgtggttttacaagggggacaaagacaataagtatgttccctactcggag agcttcagccaagttttagagcttatggtgcattaccagccagttgcagggtctgatgat tggggttcaacacccacggagcagggtcgaccaagaactgtgaagagaggagttgagaac atctctgttgacattcattgtgttaatgattttcgcagtgtttccttgaacttgctacag acacattttaagaaagcccaagaaaatcagcagattgggagggtagaatttcttccagtc aactggcacagtcctttgcattctactggtgtggatgtagatctgcagcgaataaccctg cccagcattaaccgcctcaggcacttcaccaatgacacaattctggatgtcttcttctac aatagtcccacctactgtcagactattgtggacacagttgcttctgaaatgaaccgaata tacacactttttctacagaggaaccctgatttcaaagggggtgtatccattgctggtcat agtttaggttcgcttatattgtttgatatcctaacaaatcagaaagattctttgggggat attgacagtgaaaaggattcgctaaatattgtaatggatcaaggagatacacctacacta gaggaagatttgaagaaacttcagctctctgaattctttgatatctttgagaaggagaaa gtagataaggaagctctg