GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:39:52 Sequence gi568815584r:69785343_69986140 : 200798 bp : 44.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 965 755 211 1 1 100 116 277 0.683 30.62 1.01 Init - 11813 11458 356 2 2 74 96 517 0.991 47.71 1.00 Prom - 14765 14726 40 -5.06 2.06 PlyA - 15300 15295 6 1.05 2.05 Term - 17901 17711 191 1 2 56 42 148 0.952 4.61 2.04 Intr - 23573 23475 99 0 0 83 53 81 0.331 4.18 2.03 Intr - 37656 37570 87 1 0 75 94 21 0.349 1.34 2.02 Intr - 38572 38484 89 0 2 44 108 41 0.495 1.31 2.01 Init - 39004 38766 239 1 2 81 36 199 0.712 11.49 2.00 Prom - 43880 43841 40 -5.76 3.00 Prom + 45246 45285 40 -4.56 3.01 Init + 48880 48886 7 0 1 40 84 0 0.172 -4.25 3.02 Intr + 57377 57523 147 0 0 43 58 213 0.559 14.01 3.03 Intr + 60413 60558 146 0 2 89 45 41 0.090 -0.10 3.04 Term + 74063 74272 210 1 0 108 42 67 0.013 1.39 3.05 PlyA + 75618 75623 6 1.05 4.00 Prom + 86670 86709 40 -4.76 4.01 Init + 94337 94435 99 1 0 101 110 201 0.913 21.86 4.02 Term + 96433 96549 117 0 0 63 53 31 0.531 -4.36 4.03 PlyA + 97525 97530 6 1.05 5.02 PlyA - 97661 97656 6 1.05 5.01 Sngl - 100798 99998 801 1 0 94 36 910 0.992 82.24 5.00 Prom - 113268 113229 40 -1.76 6.00 Prom + 128859 128898 40 -4.56 6.01 Init + 146668 146755 88 0 1 87 96 53 0.424 7.00 6.02 Intr + 148727 148795 69 0 0 103 89 36 0.253 4.35 6.03 Intr + 157603 157697 95 2 2 87 63 68 0.071 3.88 6.04 Intr + 163636 163731 96 0 0 33 94 97 0.921 5.01 6.05 Intr + 166796 166961 166 1 1 66 113 147 0.960 14.53 6.06 Intr + 168078 168190 113 1 2 95 94 120 0.705 13.40 6.07 Intr + 184523 184705 183 1 0 82 78 24 0.084 0.78 6.08 Intr + 190373 190462 90 1 0 82 42 52 0.055 0.19 6.09 Intr + 193060 193179 120 0 0 86 72 36 0.759 2.49 6.10 Intr + 193816 194073 258 0 0 26 100 156 0.451 8.26 6.11 Intr + 198135 198169 35 2 2 88 58 12 0.249 -4.78 6.12 Term + 198522 199032 511 0 1 73 43 162 0.591 4.05 6.13 PlyA + 200254 200259 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 158043 158126 84 2 0 87 92 64 0.806 7.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:69785343_69986140|GENSCAN_predicted_peptide_1|189_aa MEAHNASAPFNFTLPPNFGKRPTDLALSVILVFMLFFIMLSLGCTMEFSKIKAHLWKPKG LAIALVAQYGIMPLTAFVLGKVFRLKNIEALAILVCGCSPGGNLSNVFSLAMKGDMNLSI VMTTCSTFCALGMMPLLLYIYSRGIYDGDLKDKVPYKGIVISLVLVLIPCTIGIVLKSKR PQYMRYVIK >gi568815584r:69785343_69986140|GENSCAN_predicted_CDS_1|567_bp atggaggcccacaacgcgtctgccccattcaacttcaccctgccacccaactttggcaag cgccccacagacctggcactgagcgtcatcctggtgttcatgttgttcttcatcatgctc tcgctgggctgcaccatggagttcagcaagatcaaggctcacttatggaagcctaaaggg ctggccatcgccctggtggcacagtatggcatcatgcccctcacggcctttgtgctgggc aaggtcttccggctgaagaacattgaggcactggccatcttggtctgtggctgctcacct ggagggaacctgtccaatgtcttcagtctggccatgaagggggacatgaacctcagcatt gtgatgaccacctgctccaccttctgtgcccttggcatgatgcctctcctcctgtacatc tactccagggggatctatgatggggacctgaaggacaaggtgccctataaaggcatcgtg atatcactggtcctggttctcattccttgcaccatagggatcgtcctcaaatccaaacgg ccacaatacatgcgctatgtcatcaag >gi568815584r:69785343_69986140|GENSCAN_predicted_peptide_2|234_aa MPSGTIWILEARHSHSGVLLWPIYRVTRKAASFEREPEQDKALQQAQSAVQAALPLGLYD PADPMVLEVSVADNDAGPHRHKVGRAQQHPIIQWKSYILDQAQTGPEGTKRAEICPHQVA GLIDLDYQDEISLLIHNRAMQHQIQKFTNVLDGTERVQKYSLSDIAASAIGSHKMPCSFD LSLVEQLLWGHQLPDKKSDYSEITMQMTYPQQPYEDPKQKPSAEPVISQNCERE >gi568815584r:69785343_69986140|GENSCAN_predicted_CDS_2|705_bp atgcctagtgggactatttggattttggaagcaagacattctcattcgggtgtgttactc tggcccatttatcgagtgacccgaaaggctgccagttttgagcgagaaccagaacaggac aaggcgctgcaacaggcccagtctgctgtgcaagctgctctgccacttgggctatatgac ccagcagatccaatggtgcttgaggtgtcagtggcagataatgatgctggcccccataga cataaagtgggtcgtgcacagcagcatcccatcatccaatggaagtcgtatatacttgat caggctcaaacaggtcctgaaggcacaaaaagggcagaaatttgtcctcaccaggtggct gggctgattgacctggactatcaagatgaaattagtctactaatccataacagagctatg cagcaccaaattcagaaatttacaaatgtgcttgatggcacagagagagttcagaagtac tcactgagtgatattgctgcttcagccataggaagtcataagatgccttgcagctttgac ttaagcctcgtggagcaattgctctggggccatcagctaccagataagaagtctgactac tctgagatcacgatgcagatgacttatcctcagcaaccatatgaggaccccaagcaaaag ccatcagctgaacctgtcatatcccagaactgtgaaagagaataa >gi568815584r:69785343_69986140|GENSCAN_predicted_peptide_3|169_aa MQAAKRPSDEWEGKAEGGEEKEKEEEEEEEEEEEEVEEVAVAVQARQSGMECDSPQFFGA SIAPSQSGPNLELKQVAGGSQSGLLIPLAPVIEYVAQPEAVRSERGCLEGPLLWPQKPHS RSPQAPLSMRTVDPKAKEKSKKLGPDAMQLTPKFASKLGVREKLRIPKH >gi568815584r:69785343_69986140|GENSCAN_predicted_CDS_3|510_bp atgcaagctgcaaagaggcccagtgatgaatgggaggggaaagcggaaggtggggaggag aaggagaaggaggaggaggaggaggaggaggaggaggaggaggaggtggaggaggtggca gtggcggtacaagcaaggcagagtgggatggagtgtgacagcccccaattttttggtgca tccattgctccctcacaatcaggccccaacttagagctgaagcaggtggcaggtggaagc cagtcagggcttctcattcctctggctccagtaattgagtatgtggctcagccagaagca gtgaggtcagaaagagggtgccttgaaggcccccttctgtggcctcagaaaccacactcc cgcagtccccaggctcccctcagcatgaggacagtggatccaaaggcaaaggagaagagt aagaagctggggcccgatgccatgcaattaacccccaagtttgcttcaaaacttggtgta agagaaaaactgaggatccccaagcactaa >gi568815584r:69785343_69986140|GENSCAN_predicted_peptide_4|71_aa MLPARCARLLTPHLLLVLVQLSPARGHRTTGPRITIRATTYEHLGWALYMISYSNENKKL GLLKAILQMTK >gi568815584r:69785343_69986140|GENSCAN_predicted_CDS_4|216_bp atgctgcccgcgcgctgcgcccgcctgctcacgccccacttgctgctggtgttggtgcag ctgtcccctgctcgcggccaccgcaccacaggccccaggataacaataagagctaccact tatgaacatctgggctgggcactttacatgatctcttactcgaacgagaacaagaagttg ggtcttttaaaagccattttacagatgacaaagtga >gi568815584r:69785343_69986140|GENSCAN_predicted_peptide_5|266_aa MPKGKKAKGKKVAPAPAVVKKQEAKKVVNPLFEKRPKNFGIGQDIQPKRDLTRFVKWPRY IRLQRQRAILYKRLKVPPAINQFTQALDRQTATQLLKLAHKYRPETKQEKKQRLLARAEK KAAGKGDVPKKRPPVLRAGVNTVTTLVENKKAQLVVISHDMDPIELVVFLPALCRKMGVP YCIIKGKARLGRLVHRKTCTTVAFTQVNSEDKGALAKLVEAIRTNYNDRYDEIRRHWGGN VLGPKSVARIAKRKKAKAKELATKLG >gi568815584r:69785343_69986140|GENSCAN_predicted_CDS_5|801_bp atgccgaaaggaaagaaggccaagggaaagaaggtggctccggccccagctgtcgtgaag aagcaggaggctaagaaagtggtgaatcccctgtttgagaaaaggcctaagaattttggc attggacaggacatccagcccaaaagagacctcacccgctttgtgaaatggccccgctat atcaggttgcagcggcagagagccatcctctataagcggctgaaagtgcctcctgcgatt aaccagttcacccaggccctggaccgccaaacagctactcagctgcttaagctggcccac aagtacaggccagagacaaagcaagagaagaagcagagactgttggcccgggccgagaag aaagctgctggcaaaggggacgtcccgaagaagagaccacctgtccttcgagcaggagtt aacactgtcaccaccttggtggagaacaagaaagctcagctggtggtgatttcacacgac atggatcccatcgagctggttgtcttcttgcctgccctatgtcgtaaaatgggggtccct tactgcattatcaaggggaaggcaagactgggacgtctagtccacaggaagacctgcacc actgtcgccttcacacaggtgaactcagaagacaaaggcgctttggctaagctggtggaa gctatcaggaccaattacaatgacagatacgatgagatccgccgtcactggggtggcaat gtcctgggtcctaagtctgtggctcgtatcgccaagcgcaaaaaggcaaaggctaaagaa cttgccactaaactgggttaa >gi568815584r:69785343_69986140|GENSCAN_predicted_peptide_6|607_aa MEHPPVLAVQRVALPSRSSEPELFLNPPTDEETEAHRGMYLDQGPRTKDWIPVAPGVREP SPLGPGPSSLWLLVVGGTVEEGSRKHRPTCKKDFTKSKTVDQAPTLCARHQMLNIQFLIS DRDPQCNLHCSRTQPKPICASDGRSYESMCEYQRAKCRDPTLGVVHRGRCKDAGQSKCRL ERAQALEQAKKPQEAVFVPECGEDGSFTQNTSCIHSSVYQIGENSSLTFLEKREESGHLQ RDWFPLWFPRKARVIGRRVKEEATKKRVWEVQCHTYTGYCWCVTPDGKPISGSSVQNKTP HIQILFTKARLALWACDLCSCYPEILNNSVFESDGTVKMRLQGDQGKALGPASSAYAMGW EEEAKSPTLNAGDKDLENLLPPNAGAPRRCGSGTGDTTTHQKTGTNGPDGLLLFPWAKMR KWLEREVEENLQRSSRLLSEGHRTRVVKAEELSLSGSIAVILCLQKERCEIQGLIILKFL RKGNGTRIAETILKKKTKVGGTILSDFKMNKARVLEIVWYLWSNRCMNQWNRIEDPETDP QTNGALAIGHPQTKQIKLTNRPQSLNLNLRPDMKMNSKWIVDLNVKCEAIKTFRRKQEKI FTTKNII >gi568815584r:69785343_69986140|GENSCAN_predicted_CDS_6|1824_bp atggagcatccgcctgttctggcagtccaaagagtggctttgccatcccgttcctcagaa cccgagctgttcctcaaccctccaacagatgaggaaactgaggctcatagaggtatgtat cttgatcaaggtccaaggaccaaggactggatcccagtggcacctggtgtaagagagcct tctcccctgggccctggcccatcatccctgtggctgctggtggtggggggcacagtggag gagggaagtcggaaacaccggcccacctgcaaaaaggacttcacaaagtcaaaaactgtt gaccaagcacctactctgtgtgccagacaccagatgctgaacatccagtttctaataagt gaccgtgacccacagtgcaacctccactgctccaggactcaacccaaacccatctgtgcc tctgatggcaggtcctacgagtccatgtgtgagtaccagcgagccaagtgccgagacccg accctgggcgtggtgcatcgaggtagatgcaaagatgctggccagagcaagtgtcgcctg gagcgggctcaagccctggagcaagccaagaagcctcaggaagctgtgtttgtcccagag tgtggcgaggatggctcctttacccagaacacaagctgcattcattcatcagtgtatcaa attggagagaacagcagcctcacatttctggagaaaagagaagaaagtgggcacctgcag agagactggtttcctctctggtttcctaggaaagcaagggtgattggtaggagagtgaaa gaagaagcaacaaagaagcgagtctgggaggtgcagtgccatacttacactgggtactgc tggtgtgtcaccccggatgggaagcccatcagtggctcttctgtgcagaataaaactcct cacatccagattctcttcactaaggcgaggctggctttgtgggcatgcgacctgtgcagt tgctatcctgaaattctgaataattctgtctttgagtctgatgggacagtgaagatgcga ttgcagggagaccaaggtaaagccctgggccctgcctccagtgcttatgccatgggatgg gaagaagaggcgaagtcacccactctgaatgcaggagataaggatctggagaacctgctg cccccaaatgcaggtgctccaagaagatgtggaagtggaacaggtgataccaccacccac cagaaaacagggaccaatgggccagatgggctactgctctttccctgggcaaaaatgaga aagtggttggaaagggaagtggaggaaaacctccagcggagcagcaggcttctctcagag ggccacagaactcgtgtggtaaaggccgaggaactgtcactctcaggttccattgcagtg atcctgtgcttgcagaaagaaagatgtgagattcaagggctgattattctcaagtttctt cggaaaggcaatggaactagaatagctgaaacaattttgaaaaagaaaactaaagtggga ggaaccattttatctgattttaagatgaataaagctagagttctagagatagtgtggtat ttgtggagtaacagatgcatgaatcaatggaacagaatagaggacccagaaacagatcca caaacaaatggtgctctagcaattggacatccacagaccaaacaaataaaattaacaaat cgaccccaaagcctcaacctaaacctcagacctgatatgaaaatgaattcaaaatggatc gtagacttaaatgtaaaatgtgaagctattaaaacttttagaagaaaacaagagaaaatc ttcaccaccaaaaacataatctag