GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:47:32 Sequence gi568815587f:86145230_86378522 : 233293 bp : 39.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4949 5088 140 1 2 30 64 124 0.068 2.54 1.02 Intr + 13312 13484 173 1 2 26 25 210 0.148 7.16 1.03 Intr + 16224 16327 104 1 2 98 64 72 0.518 4.77 1.04 Intr + 17116 17269 154 0 1 74 72 63 0.503 2.02 1.05 Intr + 20485 20721 237 2 0 22 20 194 0.010 2.66 1.06 Intr + 38340 38483 144 1 0 41 44 116 0.654 1.83 1.07 Intr + 39568 39842 275 2 2 21 101 159 0.800 6.83 1.08 Term + 41066 41461 396 1 0 69 42 181 0.965 5.69 1.09 PlyA + 41554 41559 6 1.05 2.00 Prom + 42414 42453 40 -7.35 2.01 Sngl + 49905 50204 300 2 0 71 38 403 0.895 28.64 2.02 PlyA + 50231 50236 6 -4.04 3.00 Prom + 50247 50286 40 -14.43 3.01 Sngl + 50356 50973 618 0 0 64 32 383 0.916 26.24 3.02 PlyA + 51517 51522 6 1.05 4.00 Prom + 51686 51725 40 -3.55 4.01 Init + 63056 63085 30 1 0 71 101 24 0.598 1.79 4.02 Intr + 67130 67271 142 1 1 75 39 77 0.836 0.51 4.03 Intr + 68050 68161 112 2 1 84 59 88 0.530 4.02 4.04 Term + 68480 68588 109 2 1 90 49 79 0.805 1.20 4.05 PlyA + 68889 68894 6 1.05 5.00 Prom + 85053 85092 40 -4.75 5.01 Init + 100001 100114 114 1 0 46 101 132 0.486 10.57 5.02 Intr + 105067 105219 153 0 0 103 95 66 0.953 8.15 5.03 Intr + 106919 107011 93 1 0 71 70 48 0.513 0.54 5.04 Intr + 112286 112367 82 1 1 18 115 63 0.730 0.29 5.05 Intr + 117040 117123 84 2 0 65 72 56 0.433 0.57 5.06 Intr + 118943 119034 92 0 2 56 87 86 0.842 4.09 5.07 Intr + 120854 120987 134 1 2 73 92 30 0.679 0.42 5.08 Intr + 123227 123332 106 2 1 129 86 -15 0.686 1.70 5.09 Intr + 131751 131909 159 2 0 95 64 133 0.943 10.86 5.10 Intr + 132689 132762 74 1 2 45 108 75 0.996 2.49 5.11 Term + 133170 133296 127 0 1 111 38 150 0.999 9.07 5.12 PlyA + 133489 133494 6 1.05 6.00 Prom + 138099 138138 40 -8.85 6.01 Sngl + 139176 139490 315 2 0 83 33 205 0.968 10.20 6.02 PlyA + 142287 142292 6 1.05 7.00 Prom + 142427 142466 40 -8.85 7.01 Sngl + 150110 150352 243 1 0 102 42 303 0.773 21.93 7.02 PlyA + 151427 151432 6 1.05 8.03 PlyA - 152317 152312 6 1.05 8.02 Term - 157246 157032 215 2 2 72 47 155 0.427 6.21 8.01 Init - 164418 164112 307 0 1 71 19 172 0.147 6.10 8.00 Prom - 164472 164433 40 -6.45 9.02 PlyA - 164589 164584 6 1.05 9.01 Sngl - 166952 166242 711 2 0 53 43 348 0.586 22.77 9.00 Prom - 177256 177217 40 -6.05 10.00 Prom + 189448 189487 40 -5.85 10.01 Init + 192179 192301 123 1 0 90 95 100 0.206 11.12 10.02 Intr + 199374 199492 119 2 2 57 60 122 0.170 4.64 10.03 Term + 210783 210906 124 0 1 84 48 88 0.320 1.28 10.04 PlyA + 211925 211930 6 1.05 11.05 PlyA - 212368 212363 6 1.05 11.04 Term - 213340 213182 159 1 0 74 42 75 0.352 -1.54 11.03 Intr - 215098 214879 220 0 1 66 68 116 0.241 4.78 11.02 Intr - 219610 219512 99 0 0 80 70 50 0.084 0.71 11.01 Intr - 230597 230342 256 0 1 109 68 61 0.101 1.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 3630 3435 196 2 1 104 54 128 0.825 6.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_1|540_aa KRQNQINIVKLGTEHVEERSKREVVEGYAFGEEEEEKEFLQSLASKRNFRSSQETIESGL FQPDRVKALLSVRACHMQKHEPSCKEVMCQWGVRKYLLRYARQKVLDSWIWAPDLIPDSS RKTHAKYHGVSDNEQTLSYEMECWFHVLEDTASCQKEDLGLISMCASIQMRLSGLSWMSH GNGQTSAASWEVGTPNTRSRLDLGDTIQDIGTGKYFMTKMRRAIATKAKIDKWDLIKLKS FCTAKEAINRVNRPLIEGEKSIANYAFDKEVDVEKRIQAKVVYLVDDPRKPQSESGEARC RREGADKVCAIKDVTTEVDTSAATSTNSKPSEFLGSFASNLESQAVISGGPRLGHMLVRQ LPGTQEANRLACFPASIRGRHILPPSMSHAMAVSPDTGRGLRRWAAKKNGFHTKSEEITR RKCLPREKEPRGSAKNVRLYGGSRYELLSKPDSVVGLRAWLLSRERLSNPTRMISKTALL ALLTFASALFKEETIYPDAGPLRLHSLARSSQELGRVEDSTFMKPFSHTVPAQPLKMCAL >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_1|1623_bp aaaaggcagaaccagattaacatagtaaaactggggacagagcatgtggaagaaaggagc aagagggaggtggtagaaggttacgcatttggggaggaagaggaagagaaggaatttttg cagagccttgcctcaaagaggaattttcgatctagtcaagagacaatagagagtgggctc ttccaacctgatcgggtaaaagcactgctgtcagtgagggcctgccacatgcaaaaacat gagccttcctgcaaggaggtgatgtgccagtggggagtaaggaagtacctcctacgatat gcaagacaaaaagtactggacagctggatttgggctccagatctaatccctgattccagc aggaaaacacatgcaaaatatcatggagtctcagacaatgagcagactctgtcttatgaa atggagtgctggttccatgtgctagaagacacagcttcctgccagaaagaggatctgggc ctcatctccatgtgtgccagcattcagatgcggctgtctgggctttcatggatgagtcat ggcaatggccagacctcagctgcaagttgggaagtaggaacaccaaacacaagaagcagg ttggacctaggtgataccattcaggacataggcacgggcaaatatttcatgacaaagatg cgaagagcaattgcaacaaaagcaaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaagctatcaacagagtaaacagaccacttatagaaggggagaaa agtattgcaaactatgcatttgacaaagaagtggacgttgagaaaaggattcaagcaaag gtagtttatttggtggacgatcctaggaaaccccagtcagaaagtggggaagcgaggtgc agaagagaaggagccgataaagtgtgtgctatcaaagacgttaccactgaggtggacacc agtgctgccaccagcacaaattctaaaccgtccgagtttcttgggtcatttgcttcaaac ttagagtctcaggcagtaatatcaggggggccaagattaggtcatatgcttgtgcgccag ctgccagggacccaggaggccaaccgtctggcctgttttccagcttccatcaggggaaga cacattctacctcccagcatgtctcatgcaatggcagtttccccagacacaggaaggggg ctcagacgctgggcagccaagaagaatgggtttcatacaaagagtgaggagattacaaga aggaaatgtcttccaagagagaaagaacccagaggttctgccaaaaatgtgagactatat gggggaagcagatatgagctactttcaaagcctgactccgtggtgggactgagagcctgg ctcctgagtcgagagaggctttccaatcccacaagaatgatttcaaaaacagccctcttg gcattactgacatttgcctctgcccttttcaaggaggaaacaatttacccagatgcaggc ccgcttaggctccattctctagcaagatcaagccaagaacttggaagggtggaggacagc acattcatgaagccattttcacatactgttcctgcacagcctttgaaaatgtgtgccttg taa >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_2|99_aa MAATEGVGEAAQGGEPRQLEQPPPQPHPPLPQEQHEEEMAAEAGEAVASPMDDGFLSLDS PSYVPYRDRTEWADTDPVPQNDGPNPVVQIIYSDKFRDV >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_2|300_bp atggcggccaccgagggggtcggggaggctgcgcaaggcggtgagccccggcagctggag cagcccccgccccagccgcacccaccgctgccccaggagcagcacgaggaagagatggca gcagaggctggggaagccgtggcgtcccccatggacgacgggtttctgagcctggattcg ccctcctatgtcccgtacagggacagaacagaatgggctgatacagatccagtgccgcag aatgatggccccaatcccgtggtccagatcatttatagtgacaaatttagagatgtttga >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_3|205_aa MNYITEIIEGQPKNYLVCHNRRVIVEWLRDPSQEPEFIDNILNQDAKNYHAWQHRQWVIQ EFKLWDNELQYVDQLLKEDVRNNSVWNRRYFVISNTTGYNDCAVLEREVQYTLEMIQLVP HNSSWNYLKGILQDRGLSKYPNLLNQLLDLQPSHSSPYLIAFLVGIYEDMLENQCVNKED ILNNALGLCEILAKEKDMIRKEYWR >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_3|618_bp atgaactacatcactgaaataattgaggggcagcccaaaaactatctagtttgtcacaat aggcgagtaatagtggaatggctaagagatccatctcaggagcctgaatttattgataat attcttaatcaggatgcaaagaattatcatgcctggcagcatcgacaatgggttattcag gaatttaaactttgggataatgagctgcagtatgtggaccaacttctcaaagaggatgtg agaaataactctgtctggaaccgaagatattttgttatttccaacaccactggctacaat gattgtgctgtattggagagagaagtccaatacactctggaaatgattcaactagtacca cataatagttcatggaactatttgaaaggaattttgcaggatcgtggtctttccaaatat cctaatctgttaaatcaattacttgatttacaaccaagtcatagttccccctacctaatt gcctttcttgtgggtatctatgaagacatgctagaaaaccagtgtgtcaataaggaagac attcttaataatgcattagggttatgtgaaatcctagctaaagaaaaggacatgataaga aaggaatattggagataa >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_4|130_aa MVNITNYQENRRELTLRRCKVQDHMTHGQKHQDVDLALSSFKAQAFFLILYHLLGWILPG FYVHEILTVPLGESMARYAGSGGRSRAVTLPLTSRAHSLSKQGDAVEEKKPELNGAVLNP KTREYHWEQQ >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_4|393_bp atggtcaacatcactaattatcaggaaaatagaagggagctgacactccgaaggtgcaaa gtccaagatcatatgactcatggacagaagcaccaagatgtggacctagctttgtcttcc ttcaaagcccaggcctttttcctcatactataccatctcttgggttggatcttgccaggg ttctatgtgcatgaaatcttgactgtccctcttggggagtccatggcaagatatgctgga agcggcggaaggtccagggcagtgactctgcctctaacctctagggcacacagcctcagc aagcagggcgatgcggtcgaagaaaagaagccagaactcaatggggcagtgttgaacccc aaaaccagagaataccactgggagcaacagtag >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_5|405_aa MSEREVSTAPAGTDMPAAKKQKLSSDENSNPDLSGDENDDAVSIESGTNTERPDTPTNTP NAPGRKSWGKGKWKSKKCKYSFKCVNSLKEDHNQPLFGVQFNWHSKEGDPLVFATVGSNR HYVGHGNAINELKFHPRDPNLLLSVSKDHFSFYDKQPEEAMHHCECFDAQIFLLPDHALR LWNIQTDTLVAIFGGVEGHRDEVLSADYDLLGEKIMSCGMDHSLKLWRINSKRMMNAIKE SYDYNPNKTNRPFISQKIHFPDFSTRDIHRNYVDCVRWLGDLILSKSCENAIVCWKPGKM EDDIDKIKPSESNVTILGRFDYSQCDIWYMRFSMDFWQKMLALGNQVGKLYVWDLEVEDP HKAKCTTLTHHKCGAAIRQTSFSRDSSILIAVCDDASIWRWDRLR >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_5|1218_bp atgtccgagagggaagtgtcgactgcgccggcgggaacagacatgcctgcggccaagaag cagaagctgagcagtgacgagaacagcaatccagacctctctggagacgagaatgatgac gctgtcagtatagaaagtggtacaaacactgaacgccctgatacacctacaaacacgcca aatgcacctggaaggaaaagttggggaaagggaaaatggaagtcaaagaaatgcaaatat tctttcaaatgtgtaaatagtctcaaggaagatcataaccaaccattgtttggagttcag tttaactggcacagtaaagaaggagatccattagtgtttgcaactgtaggaagcaacaga cactatgttggccatggaaatgctatcaatgagctgaaattccatccaagagatccaaat cttctcctgtcagtaagtaaagatcatttctcattttatgataagcagccagaagaagcc atgcatcactgtgaatgctttgatgctcagatatttcttctgccagatcatgctttacga ttatggaatatccagacggacactctggtggcaatatttggaggcgtagaagggcacaga gatgaagttctaagtgctgattatgatcttttgggtgaaaaaataatgtcctgtggtatg gatcattctcttaaactttggaggatcaattcaaagagaatgatgaatgcaattaaggaa tcttatgattataatccaaataaaactaacaggccatttatttctcagaaaatccatttt cctgatttttctaccagagacatacataggaattatgttgattgtgtgcgatggttaggc gatttgatactttctaagtcttgtgaaaatgccattgtgtgctggaaacctggcaagatg gaagatgatatagataaaattaaacccagtgaatctaatgtgactattcttgggcgattt gattacagccagtgtgacatttggtacatgaggttttctatggatttctggcaaaagatg cttgcattgggcaatcaagttggcaaactttatgtttgggatttagaagtagaagatcct cataaagccaaatgtacaacactgactcatcataaatgtggtgctgctattcgacaaacc agttttagcagggatagcagcattcttatagctgtttgtgatgatgccagtatttggcgc tgggatcgacttcgataa >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_6|104_aa MDLISFLDYNCTLDSEGSFGSAHLADLPSNSKVDKTCSEVLSTPTLNVHSGQKYALCPKG TKVMMTYARAWVKDTVRILGFQNKVFLEPPVLHEAYVQLVVKEM >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_6|315_bp atggacctcatttccttcctggattacaactgcaccttggacagtgagggctcttttgga agtgcacacctagctgacctgccctccaactcaaaagtggacaagacctgttcagaggtg ttaagcacccccacactcaatgtccattcaggccaaaagtatgctctttgccccaaggga accaaagtcatgatgacatatgccagagcttgggtaaaggatacagtgaggattcttgga tttcagaacaaagtattcctggagccccctgtacttcatgaagcctatgtccagcttgtt gtcaaagaaatgtag >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_7|80_aa MGNQRASSPGLLAILMQVQMNEERSSKMVFGQINYSITWFPIMDDEEGEGEKEDEDRDED EGEDKGDEGEEGEEDEGEDD >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_7|243_bp atggggaaccagagagcttcttcacctggcttactggccattctgatgcaggtgcagatg aatgaggaaaggtcatcaaagatggtatttggccaaatcaattacagtattacttggttc ccaataatggatgatgaggaaggagaaggagaaaaagaagatgaagacagggatgaggat gaaggtgaagataaaggtgatgaaggggaggaaggagaggaagatgaaggagaagatgac taa >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_8|173_aa MGKEFMTKTPKAMATKAKIDKWDLIKLKNFCTAKETTIRVNRQPTEWEKIFAIYPSDKGL ISRIYKELKQIYKKKIKQPHQKVDNGYEHTLLKRRHVCSQQTPSPPPSSQTWRRQLRSAG SDLKPATRSPGATALVETKNLPEFAGGCLPLHYIVTGPAARMPLPPRTLTFSS >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_8|522_bp atgggcaaggaattcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagaacttctgcacagcaaaagaaactaccatcagagtg aacaggcaacccacagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaaatcaaacagccccat caaaaagtggacaacggatatgaacacacacttctcaaaagaagacatgtatgtagccaa cagacaccctccccgccaccaagcagccaaacatggcgacggcagctccgttcagccggc agtgacctgaagccggcgactaggagtcctggggctactgccctagtcgagactaagaac cttccagaatttgcgggagggtgtctgcccctccactacatagtgactgggccagcggca agaatgcctctgcccccgcgaactctgaccttctcttcctaa >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_9|236_aa MIMGDFNTPLSTLDRSTRQKVNKDIQELNSALHQADLIDIYRTLHTKSTEYTFFSAPHRT YSKIDHIVGGKALLSKCKRKKIITNCLSDHRAFKLELRIKKRTQNRSTTRKLNNLLLNDY WVHNEMKAEIKMFFETNKNKDTAYQKLWDTFKAVCRGKFIALNAHKRKQERSTIDTLTSQ LKELEKQEQTHSKASRRQEIAKIRAELKEIETEKTLQKINESRSWFFERSTKYIDH >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_9|711_bp atgataatgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaa gttaacaaggatatccaggaacttaactcagctctgcaccaagcagacctaatagacatc tacagaactctccacaccaaatcaacagaatatacattcttctcagcaccacatcgcact tattccaaaattgaccacatagttggaggtaaagcactcctcagcaaatgtaaaagaaaa aaaattataacaaactgtctctcagaccacagggcattcaaactagaactcaggattaag aaacgcactcaaaaccgctcaactacaaggaaattgaacaatctgctcctgaacgactac tgggtacataatgaaatgaaagcagaaataaagatgttctttgaaaccaataagaataaa gacacagcataccagaaactctgggacacattcaaagcagtgtgtagagggaaatttata gcactaaatgcccacaagagaaagcaggaaagatctacaattgacaccctaacgtcacaa ttaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaata gctaagatcagagcagaactgaaggagatagagacagaaaaaacccttcagaaaatcaat gaatccaggagctggttttttgaaagatcaacaaaatacatagaccactag >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_10|121_aa MNIVRTPSVAQIGISVELLDSMAQQTPVGNAAVSSVDSFTQFTQKMLDNFYNFASSFAVS QAQMTPSPSEMFIPANVVLKCQILSSEEARAEVAADLYGFVTVTKWFAYVIMIILNNDKP K >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_10|366_bp atgaatattgtccgaactccatctgttgctcagattggaatttcagtggaattattagac agtatggctcagcagactcctgtaggtaatgctgctgtatcctcagttgactcattcact cagttcacacaaaagatgttggacaatttctacaattttgcttcatcatttgctgtctct caggcccagatgacaccaagcccatctgaaatgttcattccggcaaatgtggttctgaaa tgtcaaattctttcttctgaggaggcaagagctgaggttgctgcagacctgtatggattc gtcactgtaacaaaatggtttgcttatgtgataatgattattctcaataatgacaagccc aaatga >gi568815587f:86145230_86378522|GENSCAN_predicted_peptide_11|244_aa XRSQPHFGVTLLQTMLKTDLVLLLTKPNEFSLRRCQVPFQSTYPVAFYDHLIFLIQRHDV KGNKINRLCMNSQKEPFRVPASLFSRVKNKMGIFTTTAVIHHYAVGPREFYKTRKRNRRS TLCVILVGLPFLVTTFHYQSRRKQILALLDSAGPGARYVTQYQSSDALSGAQTVELLTGR QQENVEVISTGMERPTLINGVAVLSPLYFLNKLAFTLHCGLALNSFLHDIQEPSLGVWIG TPLL >gi568815587f:86145230_86378522|GENSCAN_predicted_CDS_11|735_bp nacagatctcagcctcactttggggtgactcttctacaaacgatgctaaaaacagatctt gtcttgctactaaccaagcctaatgagttttctctgagaaggtgccaggtccccttccaa agtacttaccctgtagctttttatgatcatctcattttcttaatacaacggcatgatgtc aaaggcaacaaaatcaaccgtctttgcatgaactctcaaaaggagccatttagggtccct gcatctcttttctctagagtcaagaataagatggggatctttactaccacagctgttatt catcattatgctgtaggcccaagagagttctacaagacaagaaagagaaatagaagatct acattgtgtgtaatcttagtggggctgccattcctggtgactacattccattaccaaagc cgaaggaaacagatccttgccctccttgactctgcaggaccaggggccaggtatgtgacc caatatcagtcatcagatgctctttctggggctcagactgttgagctgttgacaggcaga cagcaggaaaatgtggaagttatttccacagggatggaacgtcctactcttatcaatgga gtagctgttctttcaccactttattttcttaataaacttgcttttactttgcactgtgga ctcgccctgaattctttcttgcatgatatccaagaaccctctcttggggtctggattggg acccctctcctgtaa