GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:49:15 Sequence gi568815592r:77362234_77563403 : 201170 bp : 36.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 377 285 93 1 0 70 84 89 0.009 4.86 1.07 Intr - 1646 1508 139 0 1 35 94 37 0.009 -2.40 1.06 Intr - 6815 6653 163 2 1 49 98 214 0.049 17.13 1.05 Intr - 21364 21140 225 1 0 58 82 92 0.023 2.86 1.04 Intr - 29052 28955 98 1 2 16 85 99 0.002 1.21 1.03 Intr - 56122 56031 92 0 2 68 100 82 0.231 6.12 1.02 Intr - 69705 69611 95 0 2 45 95 43 0.055 -1.46 1.01 Init - 70037 69867 171 2 0 56 105 79 0.149 5.89 1.00 Prom - 94002 93963 40 -2.95 2.09 PlyA - 95031 95026 6 1.05 2.08 Term - 101202 99998 1205 1 2 44 38 1276 0.955 109.04 2.07 Intr - 101983 101584 400 1 1 73 66 220 0.396 11.45 2.06 Intr - 102834 102745 90 0 0 103 9 74 0.382 0.27 2.05 Intr - 145607 145466 142 1 1 47 75 81 0.412 2.13 2.04 Intr - 146281 146089 193 2 1 94 30 100 0.450 2.43 2.03 Intr - 147032 146758 275 1 2 -8 52 235 0.298 6.56 2.02 Intr - 147928 147504 425 1 2 38 -32 298 0.437 4.54 2.01 Init - 148309 148136 174 1 0 101 71 155 0.845 14.64 2.00 Prom - 149327 149288 40 -8.35 3.00 Prom + 154334 154373 40 -5.75 3.01 Init + 155969 156045 77 1 2 55 98 -31 0.768 -4.78 3.02 Intr + 156752 157209 458 2 2 108 71 276 0.080 19.84 3.03 Intr + 159533 159763 231 0 0 31 72 107 0.008 0.22 3.04 Intr + 182603 182760 158 0 2 50 77 149 0.231 8.81 3.05 Intr + 199098 199206 109 2 1 81 86 91 0.478 7.14 3.06 Intr + 199445 199575 131 0 2 46 77 104 0.421 4.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 228 326 99 2 0 58 69 107 0.904 6.11 S.002 Intr + 4649 4703 55 1 1 114 102 51 0.828 6.73 S.003 Intr + 114044 114107 64 1 1 128 86 44 0.827 5.57 S.004 Sngl + 159918 160364 447 2 0 39 42 201 0.817 6.57 S.005 Init + 162029 162111 83 1 2 83 27 77 0.883 1.59 S.006 Term + 164772 164988 217 0 1 89 53 111 0.840 3.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:77362234_77563403|GENSCAN_predicted_peptide_1|359_aa MVTSYGRRILQLLSFSFQPWASTKFSEKQTPRRRWPFPGNGQVCHYPATLRPVIRPEGLM VRSRTPTDVHLWHRIVGKSFTIPKSLKLLPPATLSQQVMLRGDVANSGFPLMVVTEEWEA SLQGEEKVECTSHTPTSAAAFQDTGFYHTTLKWLYYFYISTNSGNSPFPHQQEVARKNTP PGPFYNYRDGFTEQERGASPSWTSAAILKFTLIKNCLNPKGISLMAKALEGTHAQVAGLR EEFIGEEDASGWSWRGLFDERRQKSTAKKSQDTESPLYLQQDPFKAGSLWVIQQMGCNNI SKCAICYLENGKSFTKYCIFYFELEFARMLPVALLPSAPPQDAATCILVTLALSADDSS >gi568815592r:77362234_77563403|GENSCAN_predicted_CDS_1|1077_bp atggtaacttcatatggaagaagaattcttcagcttctcagtttctccttccagccatgg gcttcgacaaagttctcagagaaacagactcccagacgacgctggccattccctggcaac ggccaggtttgccattatccagccactctgagacctgttatcaggcctgagggcctgatg gtcaggtccaggactcctacagatgtgcatctgtggcacaggattgtgggcaaaagcttc acaatacccaaatcccttaagctcctgcctcctgcaacactttcccagcaggtaatgtta cgaggtgatgttgcaaattcaggctttccactgatggtagtgacggaggagtgggaagct tctctgcaaggagaggagaaagtggaatgcacatcccataccccaacgtctgcagctgca tttcaagatactggcttctatcacactacactcaagtggctatactatttttacatttcc accaacagtggaaactctccattccctcatcagcaggaagtagccagaaagaacacgccc cctggtcctttttataactatagggatggatttacagagcaggagcgaggagcgtcacca tcttggacaagcgctgccattttaaagttcaccttgatcaaaaactgcctaaatccaaag ggcatcagcctaatggctaaggccctagaaggcacacacgcacaagtggctggacttcgt gaggaattcataggggaagaagatgcaagtggctggtcatggagaggactttttgacgag cgccggcaaaagagcacagcgaagaaatcccaggatacggaaagccctctgtacttgcaa caagacccattcaaagctggcagcctttgggtcattcaacaaatgggttgtaacaacata tccaaatgtgccatttgttatcttgaaaatggaaagagtttcaccaaatactgtatcttc tactttgagttagaatttgcaaggatgctgcctgtagctctgttgccctctgcaccacct caggatgctgctacctgcatcctggtcacattggctctatctgcagatgactccagn >gi568815592r:77362234_77563403|GENSCAN_predicted_peptide_2|967_aa MTLFRMWQGPLEIGLPLEMKVTGFRTISQVVFKPLAPAAQTAITAQELQVILEIEGRKRE LTHMGASILMAPGQSLCLPLVEANINPEVWAAQGRIGRAVTTRLVQIHLKDPTIFPNQKQ YPLKQDARKGPEAIINNLKMQGLLKRCKSPCNTPILTVQNQRGMETFQDLHLINEFINPI YLVVPNPYTLLTEIPEGTKWILQAMETWYSEIAHPLYHLIRETQAAKTHILTWETEAQKA FKQLKQALLKAIALSLPVGRAFNLYVSERKGMALGVLMQAQGPAQQPVGYLSTSAQLAEL RALTRALELSKGKVANIYNDAKYAFLVLHTHAAIWKERHFPTTNGSAIKYPQEFNRMGSY PGEDWQIDFTHMPKMKHPIPPVWVDTFINWVEAFPCCTEKASEVGCWKEHAYWQLFSRGT EALSSWKRMVEQKCAAPGPPDPRTQWHGSEWLPWDQGGGGGGAARAAATPAPVSPLLWLR LRGAARPSGQRVKREGGQSSGARRGAALLLDFTPPSSGGRCSPPKVPQLGARGGNARSRD LSLACKLWSLHLGNSCGQSLQIQKRPELRSGAGARRAMEEPGAQCAPPPPAGSETWVPQA NLSSAPSQNCSAKDYIYQDSISLPWKVLLVMLLALITLATTLSNAFVIATVYRTRKLHTP ANYLIASLAVTDLLVSILVMPISTMYTVTGRWTLGQVVCDFWLSSDITCCTASILHLCVI ALDRYWAITDAVEYSAKRTPKRAAVMIALVWVFSISISLPPFFWRQAKAEEEVSECVVNT DHILYTVYSTVGAFYFPTLLLIALYGRIYVEARSRILKQTPNRTGKRLTRAQLITDSPGS TSSVTSINSRVPDVPSESGSPVYVNQVKVRVSDALLEKKKLMAARERKATKTLGIILGAF IVCWLPFFIISLVMPICKDACWFHLAIFDFFTWLGYLNSLINPIIYTMSNEDFKQAFHKL IRFKCTS >gi568815592r:77362234_77563403|GENSCAN_predicted_CDS_2|2904_bp atgaccctgttcagaatgtggcagggaccactggaaatcggactgcccctggagatgaag gtgactgggttcagaacaatctcacaggtggtgttcaaacccctggctccagcagctcaa actgccattacagcacaggagctccaggtgattctggaaattgaaggaagaaagagagag ttaactcacatgggggccagcatccttatggccccaggacaatctctttgtcttcccctg gtggaagctaatatcaatccagaagtgtgggcagctcaaggaagaataggtcgagctgta accactagactagtccagatccatcttaaggatcccaccatttttcctaaccagaaacaa tatcccctgaagcaagatgctaggaaagggccagaagctattattaataacctgaagatg cagggcctcctcaaacgctgtaagagcccctgcaacactccaatattaacagtgcaaaac caacggggaatggagacttttcaggacctccacctcattaatgagttcataaacccaatc tatctggtagtacctaatccctataccttgctgaccgaaatacctgagggaactaaatgg attttgcaggctatggaaacctggtacagtgagatagctcatcctctatatcacctcata agagaaactcaagcagctaaaactcatatcctaacctgggaaactgaagctcaaaaggcc tttaagcagctaaagcaggccctacttaaggcaatagctctcagccttcccgttggaaga gccttcaatctgtatgtatcagaaaggaagggaatggccctgggagtcttaatgcaggcc caaggaccagctcaacaaccagtgggttatctgagcacaagcgctcaactagctgaactg agagctcttacaagagcacttgagttaagcaagggaaaggtagctaacatttacaatgat gccaagtatgctttcttggttctccatactcatgctgccatttggaaggaaagacacttt cctaccaccaatggatctgctataaaataccctcaggaatttaacaggatgggaagctat ccaggggaggactggcagatagactttacccacatgccaaagatgaagcatccaatacct cctgtatgggtagatacttttattaactgggtagaagcatttccatgctgtacagaaaag gcctctgaggtaggctgttggaaggaacacgcttactggcaactgttttccagaggcact gaagcactgagcagctggaagaggatggtggagcagaagtgtgccgcgccaggtcctcca gacccgcgcacccagtggcatggctccgagtggctcccgtgggaccagggtggcggtggc ggcggcgcggcccgagcagccgcaactccagcccccgtgtccccccttttatggctccgt ctccgcggggcagctcgtccgagtggccagagagtgaaaagagagggagggcagagctcc ggcgcgaggcgcggcgcagcgctgctcctagacttcaccccacccagctctggcggccgc tgcagccccccaaaagtgccccagcttggggcgaggggtgggaatgcaagatctcgggac ctctcgctggcctgcaagctttggtctctacacctaggaaactcctgtgggcaaagtctg cagatccaaaagcgtccagagctgcgctccggagctggggcgaggagagccatggaggaa ccgggtgctcagtgcgctccaccgccgcccgcgggctccgagacctgggttcctcaagcc aacttatcctctgctccctcccaaaactgcagcgccaaggactacatttaccaggactcc atctccctaccctggaaagtactgctggttatgctattggcgctcatcaccttggccacc acgctctccaatgcctttgtgattgccacagtgtaccggacccggaaactgcacaccccg gctaactacctgatcgcctctctggcggtcaccgacctgcttgtgtccatcctggtgatg cccatcagcaccatgtacactgtcaccggccgctggacactgggccaggtggtctgtgac ttctggctgtcgtcggacatcacttgttgcactgcctccatcctgcacctctgtgtcatc gccctggaccgctactgggccatcacggacgccgtggagtactcagctaaaaggactccc aagagggcggcggtcatgatcgcgctggtgtgggtcttctccatctctatctcgctgccg cccttcttctggcgtcaggctaaggccgaagaggaggtgtcggaatgcgtggtgaacacc gaccacatcctctacacggtctactccacggtgggtgctttctacttccccaccctgctc ctcatcgccctctatggccgcatctacgtagaagcccgctcccggattttgaaacagacg cccaacaggaccggcaagcgcttgacccgagcccagctgataaccgactcccccgggtcc acgtcctcggtcacctctattaactcgcgggttcccgacgtgcccagcgaatccggatct cctgtgtatgtgaaccaagtcaaagtgcgagtctccgacgccctgctggaaaagaagaaa ctcatggccgctagggagcgcaaagccaccaagaccctagggatcattttgggagccttt attgtgtgttggctacccttcttcatcatctccctagtgatgcctatctgcaaagatgcc tgctggttccacctagccatctttgacttcttcacatggctgggctatctcaactccctc atcaaccccataatctataccatgtccaatgaggactttaaacaagcattccataaactg atacgttttaagtgcacaagttga >gi568815592r:77362234_77563403|GENSCAN_predicted_peptide_3|388_aa MTWIPWQDGRIGTAPVCSSQRDQCRRTSLKERQQPQSAAYRQNFHLPGDEKRAPGGRGSC GCSFSRLKRSCLPSLKRAADLPAQYSSSAKGQTASSSWSLIPVHPDRKTPPNRGQQSPYT GKLWLASGRCPSGTKLPEEGASSSLCCSAAFATAVASTSTKRRPCKNSIERSPTAKTKEI QTTIREYYQHLYANKPESLEEMHKFLEIYTLPRLNQEKVNTLNRPITISEIDTAVNNLPT KKAQDQLDSQPNFARETEQGEVKGGKVAKRKGKTGSFDNHNELEDVLVNVDAKDAGLRMT DQSSGLQDHPTNWKQVEHIEELGLTTAICNDIKVKEKMKVPSSKVSEDDIWVCEQELGVT AWCQRENGPCQNAASQVYTASERVGFPK >gi568815592r:77362234_77563403|GENSCAN_predicted_CDS_3|1164_bp atgacatggattccctggcaagatggccgaataggaacagctccagtctgcagctcccag cgagaccaatgcagaaggacatctctgaaagaaaggcagcagccccagtcagcagcttat agacaaaactttcatctccctggggatgaaaaaagagcacctgggggaaggggcagctgt gggtgcagcttcagcagacttaaacgttcctgcctgccatctctgaagagagcagcagat ctcccagcacagtactcgagctcagctaagggacagactgcctcctcaagttggtccctg atccccgtacatcctgacaggaaaacacctcccaacaggggtcaacaatcaccttataca ggaaagctctggctggcatctggcaggtgcccctctgggacaaagcttccagaggaagga gcaagcagcagtctttgctgttctgcagcctttgctacagcagtagcatcgacatcaaca aaaaggagaccatgcaaaaactccatcgaaaggtcaccaacagcaaagaccaaagaaata caaactaccatcagagaatactatcaacatctctatgcaaataaaccagaaagtcttgaa gaaatgcataaatttttggaaatatacaccctcccaagactaaaccaggaaaaagtcaac accctgaatagaccaataacaatttctgaaattgatacagcagttaataacctaccaaca aaaaaagcccaggaccagctggattcacagccaaattttgccagagagactgagcaggga gaagtgaaagggggaaaagtagcaaagagaaagggaaaaactgggtcattcgacaatcac aatgaacttgaagatgttttagttaatgtggatgcaaaggatgctgggctgaggatgact gatcagagctcaggcttgcaagaccatcctaccaattggaagcaggtggaacatatagaa gaactaggtcttactacagcaatctgcaatgatatcaaggtgaaggaaaaaatgaaagta ccctcttctaaagtatcagaggatgatatttgggtctgtgaacaagagttgggagtcact gcttggtgtcagagagaaaatgggccatgtcagaatgcagcttcccaggtatacacagca tctgagagggttggctttcctaag