GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:32:18 Sequence gi568815584r:35302016_35504644 : 202629 bp : 44.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5979 6073 95 1 2 86 95 32 0.805 3.38 1.02 Intr + 6899 6980 82 1 1 83 91 26 0.900 1.61 1.03 Intr + 8725 8880 156 2 0 43 81 141 0.981 8.88 1.04 Intr + 10866 11044 179 1 2 118 100 49 0.992 8.74 1.05 Intr + 12346 12440 95 0 2 78 109 38 0.949 3.66 1.06 Intr + 29937 30018 82 0 1 79 23 55 0.059 -2.26 1.07 Intr + 30539 30608 70 1 1 77 78 67 0.133 3.35 1.08 Intr + 37431 37514 84 1 0 67 67 55 0.494 1.09 1.09 Intr + 38785 38883 99 2 0 113 80 -11 0.286 0.68 1.10 Intr + 39713 39750 38 0 2 125 66 -14 0.173 -1.92 1.11 Term + 50700 50774 75 2 0 85 47 133 0.613 6.74 1.12 PlyA + 55295 55300 6 1.05 2.00 Prom + 56983 57022 40 -4.26 2.01 Init + 65839 65899 61 0 1 51 76 83 0.904 4.83 2.02 Term + 67283 67569 287 0 2 36 42 213 0.840 7.07 2.03 PlyA + 69105 69110 6 1.05 3.04 PlyA - 72901 72896 6 1.05 3.03 Term - 86328 86106 223 2 1 41 41 306 0.411 17.59 3.02 Intr - 86909 86330 580 0 1 39 20 500 0.378 30.06 3.01 Init - 88808 88724 85 2 1 69 100 68 0.566 7.38 3.00 Prom - 98025 97986 40 -7.56 4.08 PlyA - 98588 98583 6 1.05 4.07 Term - 100045 99998 48 1 0 107 53 58 0.853 1.50 4.06 Intr - 100648 100379 270 1 0 76 105 304 0.895 28.64 4.05 Intr - 100844 100756 89 0 2 105 77 115 0.916 11.79 4.04 Intr - 101345 101135 211 2 1 91 80 146 0.992 12.59 4.03 Intr - 101783 101675 109 1 1 132 85 216 0.999 25.99 4.02 Intr - 102857 102403 455 2 2 87 99 613 0.683 54.36 4.01 Init - 111724 111662 63 1 0 81 41 57 0.441 1.45 4.00 Prom - 112558 112519 40 -4.16 5.00 Prom + 118105 118144 40 -2.86 5.01 Sngl + 128353 128616 264 0 0 46 47 306 0.450 17.50 5.02 PlyA + 129298 129303 6 1.05 6.03 PlyA - 129392 129387 6 1.05 6.02 Term - 172605 172320 286 2 1 50 38 354 0.992 21.48 6.01 Init - 177977 177925 53 2 2 96 87 63 0.998 7.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 15234 15291 58 0 1 126 41 38 0.858 0.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:35302016_35504644|GENSCAN_predicted_peptide_1|351_aa XYAFKAINQGGLTSVAVRGKDCAVIVTQKKVPDKLLDSSTVTHLFKITENIGCVMTGMTA DSRSQVQRARYEAANWKYKYGYEIPVDMLCKRIADISQVYTQNAEMRPLGCCMILIGIDE EQGPQVYKCDPAGYYCGFKATAAGVKQTESTSFLEKKVKKKFDWTFEQTVETAITCLSTV LSIDFKPSEIEVGVVTVENPKFRACGSSTGPCCSPLSLRGLSHIQRYEGNELIQVLIKRE GKKLSVKMAVLAFWFLLEYSSLSTDCIAISYNCLEAKDLGTGAGNTGVSHRGGHEKGLEI PFWKSKIHVNYVRAEFTTQEARPAGWEAFVESTKDKHSLAVKPVGACMLRT >gi568815584r:35302016_35504644|GENSCAN_predicted_CDS_1|1056_bp naatatgcttttaaggctattaaccagggtggccttacatcagtagctgtcagagggaaa gactgtgcagtaattgtcacacagaagaaagtacctgacaaattattggattccagcaca gtgactcacttattcaagataactgaaaacattggttgtgtgatgaccggaatgacagct gacagcagatcccaggtacagagggcacgctatgaggcagctaactggaaatacaagtat ggctatgagattcctgtggacatgctgtgtaaaagaattgccgatatttctcaggtctac acacagaatgctgaaatgaggcctcttggttgttgtatgattttaattggtatagatgaa gagcaaggccctcaggtatataagtgtgatcctgcaggttactactgtgggtttaaagcc actgcagcgggagttaaacaaactgagtcaaccagcttccttgaaaaaaaagtgaagaag aaatttgattggacatttgaacagacagtggaaactgcaattacatgcctgtctactgtt ctatcaattgatttcaaaccttcagaaatagaagttggagtagtgacagttgaaaatcct aaattcagggcttgtggcagctccaccgggccctgctgcagccccttgtcattgagaggc ctctcccacatccagagatacgaggggaatgaacttatacaagtcctcatcaagagggag ggcaaaaaattgtcggtgaagatggctgtgctggcattttggtttcttctggaatattca tctttgtccactgactgcatcgccataagctataactgcttagaggccaaggacttaggc acaggtgctgggaatacaggcgtgagccaccgcggcggccatgaaaagggcttggagatt cctttctggaaatctaaaatccatgtgaattatgttagggcagagttcaccacacaggag gcaaggccagcgggctgggaggcctttgtggagagcaccaaggacaagcatagcctggct gttaaacccgtgggcgcctgcatgctacgcacctga >gi568815584r:35302016_35504644|GENSCAN_predicted_peptide_2|115_aa MGPLISPGTFPYIRLQLEAFALTLVAAPRWALAFVNGSFIKLSPNTHGMRVTADSFLPVT PAMCTKSISDPLLTPPDPVKKASMNATLHAGVSGPARSQLSGFEGTDGPGVWQCE >gi568815584r:35302016_35504644|GENSCAN_predicted_CDS_2|348_bp atggggccccttatctcccccggcactttcccctacatccggctgcaactagaagctttc gcactaaccctcgtggctgccccacgctgggccctggcctttgtaaacggttcctttatt aaactctctccaaatacccacggcatgcgtgtgacagcggactccttcctgcccgtcaca ccggcgatgtgcacaaaaagtatctcagaccctctcctcaccccacctgatcccgtcaaa aaggcctccatgaacgccaccctccatgctggggtctcggggcctgcaagatcccagctc agtggatttgaaggaactgatgggcctggagtctggcagtgtgaatga >gi568815584r:35302016_35504644|GENSCAN_predicted_peptide_3|295_aa MKRRAVMGPPQVQAWKALDLWESSQVMEGKAVVPMGRNPMMRKATQGQLENSPALEKLLP PLQGNVGFAFTKEDLTEVRDLLLANKVPAATRAGAIGPCEVTVPAQNTGLGPEKISFFQA LGITTKISRGTTEILSDVQLIKTGDRVGASEATLLNTPNISPFSFGLVIQQVFNNGSIYN PEVLDITEETLYSGFLEGVRNVASVSANWLPTCCISTPFYHHGYKGCLALSVETDYTFPP AEKIKTFLADPSAFVAAAPVATTATAAPAAAAAPTKFENEESEELDEDMGFGLFD >gi568815584r:35302016_35504644|GENSCAN_predicted_CDS_3|888_bp atgaaacggagggctgtcatgggcccaccccaggtgcaagcctggaaggccctggacctc tgggagtcctctcaagtcatggaagggaaggccgtggtgccaatgggcaggaaccccatg atgcgcaaggccacccaaggacaactggaaaacagcccagctctggagaaactgttgcct cctctgcaggggaatgtgggctttgcgttcaccaaggaggacctcactgaggtcagggat ctgctgctggccaataaggtgccagctgccacccgtgctggtgccattggcccatgtgaa gtcactgtaccagcccagaacactggtctgggacccgagaagatttcttttttccaggct ttaggcattaccactaaaatctccaggggcaccactgaaatcctgagtgatgtgcagctg atcaagactggagacagagtgggagccagcgaagccacactgctgaacacgccgaacatc tctcccttctcctttgggctggtcatccagcaggtgttcaacaatggtagtatctacaac cctgaagtgcttgacatcacagaagaaactctgtattctggcttcctggagggtgtccgc aatgttgccagtgtgtctgcaaactggctacccacctgttgcatcagtacccccttctat catcatgggtacaaaggatgcctggctctatctgtggagactgattacaccttcccgcct gctgaaaagatcaagaccttcttggctgatccatctgcctttgtggctgctgcccctgtg gccactaccgccacggctgctcctgctgctgcggcagccccaactaagtttgaaaacgag gagtcggaggagttggatgaggatatgggatttggtctctttgactaa >gi568815584r:35302016_35504644|GENSCAN_predicted_peptide_4|414_aa MVQLSPHYRGDSEDGRKRMVRQRTKPVLFFWSDWLGNSPSLTPPQRNPQPAFIGRRGGAA EPTAVRAAVPPASAPARKQRAARGPAHPQQRPQLVRAMFQAAERPQEWAMEGPRDGLKKE RLLDDRHDSGLDSMKDEEYEQMVKELQEIRLEPQEVPRGSEPWKQQLTEDGDSFLHLAII HEEKALTMEVIRQVKGDLAFLNFQNNLQQTPLHLAVITNQPEIAEALLGAGCDPELRDFR GNTPLHLACEQGCLASVGVLTQSCTTPHLHSILKATNYNGHTCLHLASIHGYLGIVELLV SLGADVNAQEPCNGRTALHLAVDLQNPDLVSLLLKCGADVNRVTYQGYSPYQLTWGRPST RIQQQLGQLTLENLQMLPESEDEESYDTESEFTEFTEDELPYDDCVFGGQRLTL >gi568815584r:35302016_35504644|GENSCAN_predicted_CDS_4|1245_bp atggtacaactcagcccgcactaccgtggggacagtgaagatggaaggaagaggatggtc cggcagaggacgaagccagttctctttttctggtctgactggcttggaaattccccgagc ctgaccccgccccagagaaatccccagccagcgtttatagggcgccgcggcggcgctgca gagcccacagcagtccgtgccgccgtcccgcccgccagcgccccagcgaggaagcagcgc gcagcccgcggcccagcgcacccgcagcagcgcccgcagctcgtccgcgccatgttccag gcggccgagcgcccccaggagtgggccatggagggcccccgcgacgggctgaagaaggag cggctactggacgaccgccacgacagcggcctggactccatgaaagacgaggagtacgag cagatggtcaaggagctgcaggagatccgcctcgagccgcaggaggtgccgcgcggctcg gagccctggaagcagcagctcaccgaggacggggactcgttcctgcacttggccatcatc catgaagaaaaggcactgaccatggaagtgatccgccaggtgaagggagacctggccttc ctcaacttccagaacaacctgcagcagactccactccacttggctgtgatcaccaaccag ccagaaattgctgaggcacttctgggagctggctgtgatcctgagctccgagactttcga ggaaatacccccctacaccttgcctgtgagcagggctgcctggccagcgtgggagtcctg actcagtcctgcaccaccccgcacctccactccatcctgaaggctaccaactacaatggc cacacgtgtctacacttagcctctatccatggctacctgggcatcgtggagcttttggtg tccttgggtgctgatgtcaatgctcaggagccctgtaatggccggactgcccttcacctc gcagtggacctgcaaaatcctgacctggtgtcactcctgttgaagtgtggggctgatgtc aacagagttacctaccagggctattctccctaccagctcacctggggccgcccaagcacc cggatacagcagcagctgggccagctgacactagaaaaccttcagatgctgccagagagt gaggatgaggagagctatgacacagagtcagagttcacggagttcacagaggacgagctg ccctatgatgactgtgtgtttggaggccagcgtctgacgttatga >gi568815584r:35302016_35504644|GENSCAN_predicted_peptide_5|87_aa MKLFAQLEIKRKEREAKEMHERKRQREEEIEAQEKAKREREWQKNFEEGRDGRVDSWRNF QVNTKGKKEKKNRTFLRPPKVKMEQHE >gi568815584r:35302016_35504644|GENSCAN_predicted_CDS_5|264_bp atgaaactctttgctcagctggaaattaaaaggaaagagagagaagccaaagagatgcat gaaaggaaacggcaaagggaagaagagattgaagctcaagaaaaagccaaacgagaaaga gagtggcagaaaaactttgaggaaggtcgagatggtcgtgtggacagctggcgaaacttc caagtcaatacaaaggggaagaaagagaagaaaaatcggaccttcctgagaccaccgaaa gtaaaaatggagcagcatgagtga >gi568815584r:35302016_35504644|GENSCAN_predicted_peptide_6|112_aa MPSLEVSEREKSDVPAERRRRRRRGKEKEGRGGKRKKRKEEEEEEKEEEEEEEEEEEEEE GEGEEERRMRRRRKKKEEGGGGGRRRRKKKKKKKKKLLHYCYYCRYHHIFSV >gi568815584r:35302016_35504644|GENSCAN_predicted_CDS_6|339_bp atgcccagcctggaagtttcagaaagagaaaaatcggatgttcctgcagaaagaagaagg aggaggaggagggggaaggagaaggagggaagaggaggtaagaggaagaagaggaaggaa gaggaagaagaggaaaaggaggaagaggaggaagaagaagaagaggaggaagaggaagaa ggagaaggagaagaagaaagaagaatgaggaggaggaggaagaagaaagaagaaggagga ggaggaggaagaagaagaagaaagaagaagaagaagaagaagaagaaactactacactac tgctactactgccgctaccaccacattttttctgtctga