GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:04:17 Sequence gi568815590r:106661040_106870190 : 209151 bp : 37.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12015 12159 145 2 1 60 54 107 0.698 4.83 1.02 Intr + 18171 18253 83 1 2 65 93 70 0.867 3.64 1.03 Intr + 22160 22267 108 1 0 93 67 47 0.840 2.66 1.04 Intr + 23207 23320 114 1 0 107 93 66 0.765 8.72 1.05 Intr + 31689 31838 150 2 0 -46 98 148 0.020 1.84 1.06 Intr + 36476 36763 288 1 0 100 44 216 0.000 14.92 1.07 Intr + 41867 42051 185 1 2 107 66 173 0.250 14.66 1.08 Intr + 45343 46106 764 1 2 49 106 678 0.205 55.88 1.09 Intr + 49583 49751 169 0 1 85 93 56 0.598 3.88 1.10 Intr + 52784 52946 163 2 1 78 94 174 0.995 16.06 1.11 Intr + 59639 59704 66 1 0 72 87 40 0.546 0.48 1.12 Intr + 76481 76561 81 1 0 78 49 103 0.948 4.32 1.13 Intr + 78419 78544 126 1 0 74 86 138 0.999 12.16 1.14 Intr + 79304 79456 153 1 0 17 80 166 0.965 8.05 1.15 Intr + 81183 81278 96 2 0 84 94 44 0.903 3.89 1.16 Intr + 84750 84823 74 2 2 50 110 45 0.491 0.19 1.17 Term + 89767 89902 136 1 1 48 36 93 0.228 -3.39 1.18 PlyA + 91499 91504 6 1.05 2.05 PlyA - 91671 91666 6 1.05 2.04 Term - 100475 99998 478 1 1 92 46 418 0.994 31.33 2.03 Intr - 102121 102084 38 1 2 57 77 52 0.550 -2.86 2.02 Intr - 104431 104284 148 0 1 28 20 164 0.313 2.92 2.01 Init - 109151 108484 668 2 2 102 100 569 0.974 54.23 2.00 Prom - 115370 115331 40 -3.05 3.00 Prom + 127504 127543 40 -3.65 3.01 Init + 128034 128132 99 2 0 49 16 44 0.185 -6.30 3.02 Intr + 128181 128318 138 2 0 95 -2 98 0.558 1.24 3.03 Intr + 128979 129154 176 2 2 -27 44 232 0.469 5.12 3.04 Intr + 129647 130149 503 2 2 102 4 311 0.338 15.80 3.05 Term + 132261 132523 263 1 2 107 43 131 0.416 5.20 3.06 PlyA + 132850 132855 6 1.05 4.00 Prom + 157944 157983 40 -2.85 4.01 Init + 160802 160913 112 1 1 16 113 96 0.563 5.32 4.02 Term + 161427 161524 98 1 2 90 31 54 0.683 -2.95 4.03 PlyA + 162220 162225 6 1.05 5.05 PlyA - 162269 162264 6 1.05 5.04 Term - 167820 167468 353 1 2 43 42 205 0.136 5.06 5.03 Intr - 168641 168475 167 1 2 22 62 171 0.065 6.58 5.02 Intr - 177113 176899 215 2 2 -15 115 103 0.062 -0.71 5.01 Init - 183459 183253 207 0 0 83 100 114 0.729 10.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 37014 36332 683 1 2 56 37 585 0.878 43.03 S.002 Term - 175040 174950 91 1 1 82 55 94 0.880 1.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:106661040_106870190|GENSCAN_predicted_peptide_1|966_aa MWKRLWNWVTGRGWNSLEGSEEDRKMWESLELPRNLSNGFDQNADSDNDTGQKKTLDKKD GRRMSFQKPKGTIEYTVESRDSLNSIALKFDTTPNELVQLNKLFSRAVVTGQVLYVPDPE YVSSVESSPSLSPVSPLSPTSSEAEFDKTTNPDVHPTEATPSSTFTGIRPARVVSSTSEE EEAFTEKFLKINCKYITSGKLVIREVPRILLGFLREPVGIPREEPIIPGYCQPTQIPQRP LHTGHVLSFPEVHSLEDVSGVNAIALSCLQELRDLLHLLEGHGGGLDLLYWDLPLGGTVS GVLLVTPNNIMFDPHKNDPLVQENGCEEYGIMCPMEEVMSAAMYKEILDSKIKESLPIDI DQLSGRDFCHSKKMTGSNTEEIDSRIRDAGNDSASTAPRSTEESLSEDVFTESELSPIRE ELVSSDELRQDKSSGASSESVQTVNQAEVESLTVKSESTGTPGHLRSDTEHSTNEVGTLC HKTDLNNLEMAIKEDQIADNFQGISGPKEDSTSIKGNSDQDSFLHENSLHQEESQKENMP CGETAEFKQKQSVNKGKQGKEQNQDSQTEAEELRKLWKTHTMQQTKQQRENIQQVSQKEA KHKITSADGHIESSALLKEKQRHRLHKFLCLRVGKPMRKTFVSQASATMQQYAQRDKKHE YWFAVPQERTDHLYAFFIQWSPEIYAEDTGEYTREPGFIVVKKIEESETIEDSSNQAAAR EWEDFHHVVYSEILQGEEDQKTYQAVVSVAEYHRRIDALNTEELRTLCRRLQITTREDIN SKQVATVKADLESESFRPNLSDPSELLLPDQIEKLTKHLPPRTIGYPWTLVYGTGKHGTS LKTLYRTMTGLDTPVLMVIKDSDGQVFGALASEPLKVSDGFYGTGETFVFTFCPEFEVFK WTGDNMFFIKGDMDSLAFGGGGGEFALWLDGDLYHGRSHSCKTFGNRTLSKKEDFFIQDI EIWAFE >gi568815590r:106661040_106870190|GENSCAN_predicted_CDS_1|2901_bp atgtggaagcgactttggaactgggtaacaggcagaggttggaacagtttggagggctca gaagaagacaggaagatgtgggaaagtttggaacttcctagaaacttgtcgaatggcttt gaccaaaatgctgatagtgataatgacactggccaaaagaagaccctagacaagaaagat ggaagacgaatgtcttttcagaaacctaaagggactattgagtatactgttgaatcaagg gattctttgaatagcatagccctgaagtttgatacaacacctaacgaacttgttcaatta aataagttattctcccgagcagttgttactggacaggttctgtatgttcctgatcctgaa tatgtctccagtgttgagagctctccatctctaagccccgtaagtcctctgtcaccaaca tcatctgaggctgaatttgataagaccactaatcctgatgtccatccaacagaagcaact ccctcatctactttcactggtattcgacctgcacgagttgtatcttcaacttctgaggag gaggaagcatttactgagaaatttcttaaaattaattgcaaatatattaccagtggcaag ctggttatccgagaagttccgaggattctccttggatttcttagggaaccagttgggatc cccagagaagagcccatcatcccaggctactgccaacccacccagattcctcagcgtccg ctgcacacaggccatgttctttccttcccagaggtccacagtttggaagatgtcagtggt gttaatgccatagcgctcagctgcttgcaggaactgagagatcttctccatctgcttgaa ggccatggtggaggcctggatcttctttactgggacctgccccttgggggcacagtcagt ggtgtgctgctagttacaccaaataatataatgtttgatccacataaaaatgaccctttg gttcaagagaatggctgtgaggaatatggcatcatgtgtccaatggaagaggtgatgtca gctgcaatgtacaaagaaattttggatagcaaaataaaggaatctttacccatagatata gatcagctatcaggaagggacttctgccattcaaagaaaatgacaggaagtaacactgag gaaatagactcaagaatccgagatgcaggtaatgatagtgccagcactgctcctaggagc actgaggagtctctttctgaagatgtgttcacagaatcagaactttcccctatacgagag gagcttgtatcttcagatgaactgcgacaagataaatcttctggtgcgtcatcagaatct gtgcaaactgtcaatcaggctgaagtagaaagtctgacagtcaaatcagaatctactggt actcctggtcacttaagatctgatactgaacattctacaaatgaagttgggactttatgt cataaaactgatttaaataatcttgaaatggccattaaggaagatcagattgcagataac tttcaaggaatatcaggtcctaaagaagacagcacaagtataaaaggtaattcagaccag gattcttttcttcatgagaattcgttacaccaagaagagagtcaaaaagaaaatatgcct tgtggggaaacagcagaatttaaacaaaagcaaagtgttaacaaaggaaaacaaggaaag gagcaaaatcaggactcacagacagaggcagaagagctacgcaaactttggaaaacccat actatgcaacaaactaaacagcaaagggaaaatattcaacaagtgtcacaaaaagaagct aagcataaaattacatctgctgatggacacatagaaagttctgcacttttaaaagaaaag caaaggcatcgattacataagttcttgtgtctcagagttggaaaaccaatgaggaaaacg tttgtatctcaagcaagtgctacaatgcaacagtatgcacagagagataagaaacatgaa tattggtttgctgtgccacaagaaaggacagatcacttgtatgccttcttcattcagtgg agtccagaaatatatgcagaagatactggcgaatataccagagaacctggatttatagta gtaaaaaagattgaggagtctgaaacaattgaggattctagtaatcaagcagcagccaga gaatgggaggacttccatcatgttgtgtactcagagatactacaaggagaggaagatcaa aaaacataccaggctgtagtgtcagtggctgagtatcaccgcaggatcgatgctctaaat actgaagaactgcgcacactctgcagacgcctccagattactacaagggaagacataaat tcaaagcaggttgctacagtgaaagcagacctggagtctgaatcttttcgaccaaaccta agtgatcccagtgaacttttactgccagatcaaattgaaaagcttaccaagcatcttcca ccaagaacaattggctatccatggactcttgtttatggtactggaaaacatggcacaagc ttgaaaactctttatcgaacaatgacaggtttagacaccccagtgctgatggtgattaaa gacagtgatggacaggtttttggtgcgttagcatctgagccactgaaagtgagtgatggc ttttatggtactggagagacctttgtttttacattctgtccggagtttgaggtctttaag tggacaggagataatatgttttttatcaaaggagacatggattcactagctttcggtggt ggaggaggagaatttgcgctttggcttgatggagatctctaccatggaagaagccattct tgtaaaacgtttgggaatcgtacactttctaagaaggaagatttctttatccaagatatt gaaatctgggcttttgaataa >gi568815590r:106661040_106870190|GENSCAN_predicted_peptide_2|443_aa MAPGEKESGEGPAKSALRKIRTATLVISLARGWQQWANENSIRQAQEPTGWLPGGTQDSP QAPKPITPPTSHQKAQSAPKSPPRLPEGHGDGQSSEKAPEVSHIKKKEVSKTVVSKTYER GGDVSHLSHRYERDAGVLEPGQPENDIDRILHSHGSPTRRRKCANLVSELTKGWRVMEQE EPTWRSDSVDTEDSGYGGEAEERPEQDGVQVAVVRIKRPLPSQGGCLHEEEKYSLYRKIY GVGSESPFSLSPYRSETEEKIMPLGVPCILKLPLGNQISGSEFGWVNRFTEKLNCKAQQK YSPVGNLKGRWQQWADEHIQSQKLNPFSEEFDYELAMSTRLHKGDEGYGRPKEGTKTAER AKRAEEHIYREMMDMCFIICTMARHRRDGKIQVTFGDLFDRYVRISDKVVGILMRARKHG LVDFEGEMLWQGRDDHVVITLLK >gi568815590r:106661040_106870190|GENSCAN_predicted_CDS_2|1332_bp atggctccgggcgaaaaggaaagcggggagggcccagccaagagcgccctccggaagata cgcacagccaccctggtcatcagcttggcccgaggttggcagcagtgggcgaatgagaac agcatcaggcaggcccaggagcctacaggctggctgccgggagggacccaggactcacct caagctcctaaaccaatcacaccccctacttcacaccagaaagctcagagtgccccaaag tcgccaccccgcctgccagaaggacatggagatggacaaagctcagagaaagcccctgag gtttctcacatcaaaaagaaagaggtgtccaaaacggtggtcagcaagacttatgagaga ggaggggacgtgagccacctcagccacaggtacgagagggatgctggtgtgcttgaacct gggcagccagagaatgacattgacagaatcctccacagccacggctccccaacgcggagg agaaaatgtgccaacctggtgtctgagctaaccaagggctggagagtgatggagcaggag gagcccacatggaggagtgacagcgtagacacagaggacagcggctatggaggagaggct gaggagaggcccgagcaggatggagtgcaggtggctgtggtcaggatcaagcgccccttg ccctcccaaggagggtgcctgcatgaggaggagaaatactccctctacaggaagatctat ggagttggcagtgaaagcccattttcactttctccatacagatcggagacagaagagaaa atcatgcctttaggcgtcccatgcattttgaagctgcctttgggtaaccagatttcggga agtgagtttggatgggtaaacagatttacagagaaactcaactgcaaagcccaacagaaa tatagcccagtgggcaacttgaaagggagatggcagcagtgggctgatgaacacatacaa tcccagaagctcaatcctttcagtgaagagtttgattacgagctggccatgtccacccgc ctacacaaaggagatgagggctatggccgccccaaagaaggaaccaaaactgctgaaagg gccaagcgtgctgaggagcacatctacagggaaatgatggacatgtgcttcattatctgc acaatggctcgccacagacgagatggcaagatccaggttacttttggagatctctttgac agatacgttcgtatttcagataaagtagtgggcattctcatgcgtgccaggaaacatgga ctggtagactttgaaggagagatgctatggcaaggccgagatgaccatgttgtgattacg ctactcaagtga >gi568815590r:106661040_106870190|GENSCAN_predicted_peptide_3|392_aa MVSFLKSARPRPTGRNQLRTYLGDPDGTIAKEWNSGAKYWAPVGQLKATSAATGLKTRVS GFLGKDSLTTPDSSELGALVFENASVRANKSDFPRSSLWSKRKIKVFAAVSVNATIPISR VQGPFRVLGQEEFLLLHQKETSKEISKGPQKPLGYWLCPLQAVGGGEFGPTWVHVPFSLS DLKQIKAYLGKFSDDPDRYVDVLQGLGHTFDLTWREVMLLLDQTLAFHEKNAALVAAREF GDTWYLSQVNDRMTAEERDKFPSGQQAIPSMDPHWDLDSDHGDWSRKHLLTCVLEGLGRI RKKPMSPVTAILLLLAFGPCIFNLLVKFVSSRIEAIQLQMVLQMEPQMSSTNNFYRGPLD QPAGPLTGLKSSPLEDTTTAGPLLHPYPAESS >gi568815590r:106661040_106870190|GENSCAN_predicted_CDS_3|1179_bp atggtttcattcttgaagtcagcgagaccaagacccactggaaggaaccaactccggaca tatcttggcgacccagatgggactattgccaaggaatggaattcgggggctaaatactgg gcacctgtcggccagttaaaagcaactagtgcagccactggactaaagacacgggtgtca ggctttctgggaaaggactctctaacaacccctgactcttcagagttgggagcattggtt ttcgagaatgcatctgtaagggccaataaatccgactttcctcggtcctctttgtggtct aagaggaaaattaaggtttttgctgctgtgtcagtgaatgcaactattccaatcagcagg gtccagggaccatttcgggttcttgggcaagaggagtttctgctgctgcatcagaaggaa acaagcaaagaaatctccaagggaccacaaaaacccctgggctattggttatgtcccctt caagctgtagggggaggggaatttggcccaacctgggtacatgtccccttctccctctct gatttaaagcagatcaaggcatacctggggaagttttcagatgatcctgataggtatgta gatgtcctacagggtctaggtcataccttcgacctcacttggagagaagtcatgctattg ttagatcaaaccctggcctttcatgaaaagaatgcggctttagttgcagcccgagagttt ggagatacctggtatcttagtcaagtaaatgatagaatgacagccgaagaaagggacaaa ttccctagcggtcagcaagccatccccagtatggatccccactgggatctagactcagat catggagactggagtcgtaaacatctgctcacctgtgttctagaaggactagggagaatt aggaaaaagcccatgagtcctgtgacagccatcttgctattactcgccttcgggccctgt atttttaacctccttgtcaaatttgtttcctctaggattgaggccatccagctacagatg gtcttacaaatggaaccccaaatgagctcaactaacaacttctaccgaggacccctggac caaccagctggccctttgactggcctaaagagttcccctctggaggacactacaactgca gggccccttcttcacccctatccagcagaaagtagctag >gi568815590r:106661040_106870190|GENSCAN_predicted_peptide_4|69_aa MNRRQIEGKRSLIVSRVGSFWWVLGLADFKNEAADLHDRQKSSPSPHSTQEVQLASPLIT FNSLLQLDI >gi568815590r:106661040_106870190|GENSCAN_predicted_CDS_4|210_bp atgaatcgtagacaaattgagggcaaaaggagccttattgtgtccagagttggttccttc tggtgggttcttggtctcgctgacttcaagaatgaagctgcagaccttcacgatagacag aaaagttctccaagtccccactcgacccaggaagtccagctggcttcacctctcattacc tttaactccttgctccaattagatatttaa >gi568815590r:106661040_106870190|GENSCAN_predicted_peptide_5|313_aa MAEGKGEAKALSYMAAGKRACAGELSFIKPSDLVRLIHYCENSMKEPTPMIELSPPGPIL YTWGLLQFKMFIAAIHNSQKVEISKVFIMDKQNLVYTYNGMGKQMGKQNVVYTYNGRVFS DIKGKKHCYMLQHGRILKALCQRDDNFSETFLKDEVFMREQKDDQKKPEKLEKVDYCKLI RQHREAEGGTWLGKQSECGRHDFQGQVIQSNMTAAWLSLETHVFGALSHHVRSLKTEATM LKRALGEATETQRKMPRRPSVADAAYSRPLSLEAFKKTQPQPLWDCSPMREPEREPPTWT QPNLSTFRDNNHQ >gi568815590r:106661040_106870190|GENSCAN_predicted_CDS_5|942_bp atggcagaaggtaaaggagaagcaaaggcactctcttacatggcagctggcaagagagct tgtgcgggggaactgtcatttataaaaccatcagatctcgtgagacttattcactactgt gagaacagtatgaaggaacccacccccatgattgaattatctccacctggccccatcctt tacacgtggggattattacaattcaagatgttcatagcagctattcacaatagtcaaaag gtggaaatatccaaagtgttcataatggataaacaaaatttggtatatacatacaatgga atgggtaaacaaatgggtaaacaaaatgtggtatatacatacaatggaagagtcttcagt gatataaaaggaaagaagcactgctacatgctacaacatggacgaatcttgaaagcatta tgccaaagagatgataacttcagtgaaacctttctgaaggacgaggtcttcatgcgagaa caaaaagatgatcagaaaaagcctgaaaaattggaaaaggtagactattgcaagttgata agacaacacagagaagctgaaggaggaacctggttgggaaagcagtcagaatgtggaaga catgacttccagggccaggtcatacaaagcaacatgactgctgcttggctgtctcttgag acacatgtgtttggagccctgagtcaccatgtaagaagtctgaaaactgaagccaccatg ctgaagagagcccttggagaggccacagagacccagagaaagatgcccaggagacccagt gtggcagacgcagcctacagcagacctttgagtttggaggccttcaagaagacccagcct cagccactatgggactgcagtcccatgagagaacctgagcgagagccacctacctggact cagccaaacctcagcactttcagagataataatcatcagtga