GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:13:25 Sequence gi568815590f:38005551_38239068 : 233518 bp : 44.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 25024 25168 145 0 1 79 80 211 0.648 19.54 1.02 Intr + 51531 51710 180 1 0 147 101 228 0.999 29.84 1.03 Intr + 57581 57594 14 0 2 74 99 0 0.002 -5.80 1.04 Intr + 67009 67113 105 0 0 116 45 107 0.392 9.61 1.05 Intr + 67498 67546 49 0 1 82 101 56 0.352 4.55 1.06 Term + 73407 73525 119 1 2 -24 45 123 0.050 -4.50 1.07 PlyA + 73684 73689 6 1.05 2.06 PlyA - 73723 73718 6 1.05 2.05 Term - 79451 79290 162 2 0 26 33 137 0.067 -0.06 2.04 Intr - 82506 82325 182 1 2 41 69 124 0.201 5.39 2.03 Intr - 84185 83785 401 1 2 83 81 170 0.222 9.95 2.02 Intr - 92710 92520 191 1 2 29 110 47 0.001 -0.62 2.01 Init - 95561 95559 3 2 0 97 81 0 0.003 0.20 2.00 Prom - 96860 96821 40 -6.16 3.00 Prom + 97659 97698 40 -4.96 3.01 Init + 100001 100188 188 1 2 104 94 111 0.875 9.84 3.02 Intr + 100828 100894 67 1 1 70 94 57 0.931 3.41 3.03 Intr + 101471 101616 146 1 2 57 80 132 0.876 8.38 3.04 Intr + 104829 104917 89 0 2 70 71 47 0.885 0.81 3.05 Intr + 105189 105283 95 1 2 93 75 72 0.929 6.08 3.06 Intr + 109355 109450 96 1 0 61 88 88 0.987 6.31 3.07 Intr + 111100 111175 76 0 1 80 115 43 0.999 5.29 3.08 Intr + 113720 113813 94 0 1 67 59 140 0.985 8.02 3.09 Intr + 115382 115599 218 2 2 106 87 115 0.997 11.35 3.10 Intr + 122741 122908 168 0 0 87 87 152 0.979 14.92 3.11 Intr + 123208 123401 194 2 2 74 113 144 0.998 14.71 3.12 Intr + 127904 127996 93 1 0 73 105 48 0.960 5.16 3.13 Intr + 130118 130216 99 1 0 71 100 26 0.822 2.41 3.14 Term + 133414 133521 108 0 0 107 47 61 0.964 2.41 3.15 PlyA + 134131 134136 6 1.05 4.08 PlyA - 136218 136213 6 1.05 4.07 Term - 138836 138723 114 2 0 64 49 162 0.951 8.47 4.06 Intr - 139765 139672 94 0 1 90 67 27 0.460 0.77 4.05 Intr - 140597 140413 185 2 2 50 90 236 0.995 18.59 4.04 Intr - 140897 140739 159 2 0 104 105 313 0.849 34.78 4.03 Intr - 142777 142650 128 2 2 58 111 143 0.998 14.00 4.02 Intr - 143204 143091 114 0 0 72 123 80 0.346 10.32 4.01 Init - 145268 145205 64 2 1 68 84 55 0.272 4.41 4.00 Prom - 157278 157239 40 -6.16 5.05 PlyA - 157809 157804 6 1.05 5.04 Term - 158290 158120 171 1 0 87 33 226 0.998 14.83 5.03 Intr - 164367 164252 116 1 2 47 92 92 0.930 5.67 5.02 Intr - 170787 170725 63 1 0 69 79 93 0.963 5.19 5.01 Init - 171321 171147 175 0 1 99 78 90 0.975 8.62 5.00 Prom - 172985 172946 40 -6.06 6.00 Prom + 179845 179884 40 -4.36 6.01 Init + 189154 189165 12 0 0 110 89 4 0.685 2.65 6.02 Intr + 202070 202216 147 1 0 50 109 55 0.384 4.23 6.03 Intr + 203463 203717 255 2 0 85 97 146 0.997 12.84 6.04 Term + 204458 204943 486 1 0 72 48 342 0.890 23.30 6.05 PlyA + 205950 205955 6 1.05 7.04 PlyA - 206139 206134 6 1.05 7.03 Term - 217844 217813 32 0 2 54 42 38 0.168 -6.18 7.02 Intr - 220673 220479 195 0 0 40 116 82 0.486 5.59 7.01 Init - 226255 226111 145 1 1 52 100 134 0.748 9.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 88205 88340 136 1 1 59 38 121 0.811 4.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_1|203_aa MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKFLM ECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSPEDKRAGGSKRQIFGFLVL HSFIHTLSASTMIHRLQVPAEKPTYHILHSAGTLTRDIRYGLTHEMKAQKLQQRLNDCSI EMNYRIIIFIMTLTNTESFVLLN >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_1|612_bp atgtccgggggcagcagctgcagccagaccccaagccgggccatccccgccactcgccgg gtggtgctcggcgacggcgtgcagctcccgcccggggactacagcacgacccccggcggc acgctcttcagcaccaccccgggaggtaccaggatcatctatgaccggaaattcctgatg gagtgtcggaactcacctgtgaccaaaacacccccaagggatctgcccaccattccgggg gtcaccagcccttccagtgatgagccccccatggaagccagccagagccacctgcgcaat agcccagaagataagcgggcgggcgggtctaagagacagatatttgggtttcttgtacta cacagcttcattcatactctaagtgccagcaccatgattcatcgcctgcaggttccagca gaaaaacccacctaccacatcctgcattctgcagggactctcacaagagacattcgctat ggcctaacccatgaaatgaaggcccagaagcttcagcagagactgaatgactgctccatt gagatgaactaccgcataattatcttcatcatgactttaacaaacacagaaagctttgtt ttgcttaattga >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_2|312_aa MCYYREQETKGKYWTLEGLAELGGCTPAGAWISVFTLCSICTKLRKSHSKDERKHASEVL VLPGRHFQLEEHTLSYMEKPQALIDLMQSIFLTHNPTWADCKQLLLSLFNTEEHRRVIQA AHQRLEKNAPVGTGDVRQCARQALPIETDPGWDPNQAQDLLKLLRYQEALIQGIKTEGKK ATNTGKVSEVYQKPDESPMLKPSGDYQPVQDLRAVNQVAAILHAIVPNPYTVLGQIPASA AWFTCLDIKDAFFCILLAPSTKTCSLSSRPMDESARSRSSNSTDLAEEPRRSRKRQLPCY DHTRGWSVNARL >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_2|939_bp atgtgttattatagggaacaggagaccaagggaaaatactggacactggaaggcctggct gagcttggaggatgcacacctgctggggcatggatttctgtcttcacactttgctccatc tgcacaaagctacgtaagagtcacagcaaagatgaacggaaacatgcatctgaggttttg gttttgccaggcagacattttcaactggaagagcatactctctcctatatggaaaaaccc caggctcttatcgacctaatgcagtccatcttcttaactcacaacccaacctgggctgac tgcaaacagctccttctgtcactgttcaatacagaagaacaccgcagagtaatacaagcg gctcatcagaggctagaaaaaaatgccccagtaggtacaggagatgtcagacagtgtgct cggcaggctttgccaatagaaactgacccaggctgggacccaaatcaggcccaagatctg ctgaagttgctgagataccaagaggctctaatacaaggaataaagactgaagggaagaag gcaacaaacactggaaaggtttcagaagtctatcagaaaccagatgaaagccccatgtta aaaccttctggtgactaccagcctgtacaagatttaagggcagtcaaccaggtagctgct atactgcatgctattgtgcctaacccgtacactgtgcttggacaaatacctgctagtgct gcttggttcacatgcttggacattaaagatgcattcttctgcatcctgttagccccttca actaaaacctgcagcctcagttcaagaccaatggacgagtcagcaagatccagatcatcc aactcgactgatcttgcagaggaaccaaggcgcagcaggaaaagacaactgccctgctac gaccacaccagaggctggtcggtcaacgcacggctgtag >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_3|576_aa MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEA EGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGIC TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS RTQDEHPKTMFSKDKSKERDVFLVKEHPDPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQK QSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQDGGTTGTTKKARSDPLFSAQRLPPHGYPL EHPFNKDGYRYILAEPDPHAPDPEKLELDCWAGKPIPGDLYRACLYERVLLALHDRAPQL KISDDRLTVVGEKGYSMVRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLG YDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGDVLGFYINLPEDTETAKSLPDTYKDKALI KFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSC TMSDMGWGAVVEHTLADVLYHVETEVDGRRSPPWEP >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_3|1731_bp atggcggcggcaggagcaggacctggccaggaagcgggtgccgggcctggcccaggagcg gtcgcaaatgcaacaggggcagaagagggggagatgaagccggtggcagcgggagcagcc gctcctcctggagaggggatctctgctgctccgacagttgagcccagttccggggaggct gaaggcggggaggcaaacttggtcgatgtaagcggtggcttggagacagaatcatctaat ggaaaagatacactagaaggtgctggggatacatcagaggtgatggatactcaggcgggc tccgtggatgaagagaatggccgacagttgggtgaggtagagctgcaatgtgggatttgt acaaaatggttcacggctgacacatttggcatagatacctcatcctgtctacctttcatg accaactacagttttcattgcaacgtctgccatcacagtgggaatacctatttcctccgg aagcaagcaaacttgaaggaaatgtgccttagtgctttggccaacctgacatggcagtcc cgaacacaggatgaacatccgaagacaatgttctccaaagataagagtaaagaaagagat gtattcttggtaaaggaacacccagatccaggcagtaaagatccagaagaagattacccc aaatttggacttttggatcaggaccttagtaacattggtcctgcttatgacaaccaaaaa cagagcagtgctgtgtctactagtgggaatttaaatgggggaattgcagcaggaagcagc ggaaaaggacgaggagccaagcgcaaacagcaggatggagggaccacagggaccaccaag aaggcccggagtgaccctttgttttctgctcagcgccttccccctcatggctacccattg gaacacccgtttaacaaagatggctatcggtatattctagctgagcctgatccgcacgcc cctgaccccgagaagctggaacttgactgctgggcaggaaaacctattcctggagacctc tacagagcctgcttgtatgaacgggttttgttagccctacatgatcgagctccccagtta aagatctcagatgaccggctgactgtggttggagagaagggctactctatggtgagggcc tctcatggagtacggaaaggtgcctggtattttgaaatcactgtggatgagatgccacca gataccgctgccagactgggttggtcccagcccctaggaaaccttcaagctcctttaggt tatgataaatttagctattcttggcggagcaaaaagggaaccaagttccaccagtccatt ggcaaacactactcttctggctatggacagggagacgtcctgggattttatattaatctt cctgaagacacagagacagccaagtcattgccagacacatacaaagataaggctttgata aaattcaagagttatttgtattttgaggaaaaagactttgtggataaagcagagaagagc ctgaagcagactccccatagtgagataatattttataaaaatggtgtcaatcaaggtgtg gcttacaaagatatttttgagggggtttacttcccagccatctcactgtacaagagctgc acgatgagtgacatgggctggggcgccgtggtagagcacaccctggctgacgtcttgtat cacgtggagacagaagtggatgggaggcgcagtcccccatgggaaccctga >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_4|285_aa MLLATFKLCAGSSYRHMRNMKGLRQQAVMAISQELNRRALGGPTPSTWINQVRRRSSLLG SRLEETLYSDQELAYLQQGEEAMQKALGILSNQEGWKKESQQDNGDKVMSKVVPDVGKVF RLEVVVDQPMERLYEELVERMEAMGEWNPNVKEIKVLQKIGKDTFITHELAAEAAGNLVG PRDFVSVRCAKRRGSTCVLAGMATDFGNMPEQKGVIRAEHGPTCMVLHPLAGSPSKTKLT WLLSIDLKGWLPKSIINQVLSQTQVDFANHLRKRLESHPASEARC >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_4|858_bp atgctgctagcgacattcaagctgtgcgctgggagctcctacagacacatgcgcaacatg aaggggctgaggcaacaggctgtgatggccatcagccaggagctgaaccggagggccctg gggggccccacccctagcacgtggattaaccaggttcggcggcggagctctctactcggt tctcggctggaagagactctctacagtgaccaggagctggcctatctccagcagggggag gaggccatgcagaaggccttgggcatccttagcaaccaagagggctggaagaaggagagt cagcaggacaatggggacaaagtgatgagtaaagtggtcccagatgtgggcaaggtgttc cggctggaggtcgtggtggaccagcccatggagaggctctatgaagagctcgtggagcgc atggaagcaatgggggagtggaaccccaatgtcaaggagatcaaggtcctgcagaagatc ggaaaagatacattcattactcacgagctggctgccgaggcagcaggaaacctggtgggg ccccgtgactttgtgagcgtgcgctgtgccaagcgccgaggctccacctgtgtgctggct ggcatggccacagacttcgggaacatgcctgagcagaagggtgtcatcagggcggagcac ggtcccacttgcatggtgcttcacccgttggctggaagtccctctaagaccaaacttacg tggctactcagcatcgacctcaaggggtggctgcccaagagcatcatcaaccaggtcctg tcccagacccaggtggattttgccaaccacctgcgcaagcgcctggagtcccaccctgcc tctgaagccaggtgttga >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_5|174_aa MGSAALKRFPPRSPRPTHRAGARGRLRMESSEGGRTTPANPRPGPAPSAAEPRKLRDGAL FQFKMNYMPGTASLIEDIDTNLVLHQTVERIHVGKKYGDIPRGIFVVRGENVVLLGEIDL EKESDTPLQQVSIEEILEEQRVEQQTKLEAEKLKVQALKDRGLSIPRADTLDEY >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_5|525_bp atgggatccgctgccctgaagcgcttcccgccccgctccccacggccgacccatcgcgca ggcgcccgcgggcgattgcgaatggagagcagcgaagggggtcggacgacgccagccaat cccaggcccgggcccgccccttctgctgccgaaccccgaaagctgagggacggagcatta tttcagttcaaaatgaactatatgcctggcaccgccagcctcatcgaggacattgacaca aacttagtgctacatcagactgtggagcgtattcatgtgggcaaaaaatacggtgatatt cctcgagggatttttgtggtcagaggagaaaatgtggtcctactaggagaaatagacttg gaaaaggagagtgacacacccctccagcaagtatccattgaagaaattctagaagaacaa agggtggaacagcagaccaagctggaagcagagaagttgaaagtgcaggccctgaaggac cgaggtctttccattcctcgagcagatactcttgatgagtactaa >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_6|299_aa MPGQTSYSTEVPSTYRSSGNSPTPVSRWIYPQQDCQTEAPPLRGQVPGYPPSQNPGMTLP HYPYGDGNRSVPQSGPTVRPQEDAWASPGAYGMGGRYPWPSSAPSAPPGNLYMTESTSPW PSSGSPQSPPSPPVQQPKDSSYPYSQSDQSMNRHNFPCSVHQYESSGTVNNDDSDLLDSQ VQYSAEPQLYGNATSDHPNNQDQSSSLPEECVPSDESTPPSIKKIIHVLEKVQYLEQEVE EFVGKKTDKAYWLLEEMLTKELLELDSVETGGQDSVRQARKEAVCKIQAILEKLEKKGL >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_6|900_bp atgcctggccagaccagttactccacagaagttccaagtacttaccgttcatctggcaac agcccaactccagtctctcgttggatctatccccagcaggactgtcagactgaagcaccc cctcttagggggcaggttccaggatatccgccttcacagaaccctggaatgaccctgccc cattatccttatggagatggtaatcgtagtgttccacaatcaggaccgactgtacgacca caagaagatgcgtgggcttctcctggtgcttatggaatgggtggccgttatccctggcct tcatcagcgccctcagcaccacccggcaatctctacatgactgaaagtacttcaccatgg cctagcagtggctctccccagtcacccccttcacccccagtccagcagcccaaggattct tcatacccctatagccaatcagatcaaagcatgaaccggcacaactttccttgcagtgtc catcagtacgaatcctcggggacagtgaacaatgatgattcagatcttttggattcccaa gtccagtatagtgctgagcctcagctgtatggtaatgccaccagtgaccatcccaacaat caagatcaaagtagcagtcttcctgaagaatgtgtaccttcagatgaaagtactcctccg agtattaaaaaaatcatacatgtgctggagaaggtccagtatcttgaacaagaagtagaa gaatttgtaggaaaaaagacagacaaagcatactggcttctggaagaaatgctaaccaag gaacttttggaactggattcagttgaaactgggggccaggactctgtacggcaggccaga aaagaggctgtttgtaagattcaggccatactggaaaaattagaaaaaaaaggattatga >gi568815590f:38005551_38239068|GENSCAN_predicted_peptide_7|123_aa MARPRPLPQPGRGAAEQGGTCSPAGLAPNTRRHGAPSGRVAGPDARRKGLSSTGRNAATR NHNDSTELEIQTDTQPLWAPHASESKGQEGNYYVAGVTDPDYQGEIGLLLPSEATTEKFL PST >gi568815590f:38005551_38239068|GENSCAN_predicted_CDS_7|372_bp atggcccggccgcggcctttaccccagcccgggcggggcgcggcggagcagggcggaacc tgcagccccgccgggctcgcgccgaacaccaggcgccacggcgctcccagtgggcgagta gcgggcccggatgccaggcgaaaaggtcttagttccacagggaggaatgctgccaccagg aaccacaacgattccactgaactggaaattcagactgacacccagccactttgggctcct catgcatctgagtcaaaaggccaagaagggaattactatgttgctggggtgactgatcca gactatcaaggggaaattggcctactactccctagtgaagccacaactgaaaagttcctg ccctcaacgtga