GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:44:49 Sequence gi568815584f:35192477_35417303 : 224827 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1050 1045 6 1.05 1.02 Term - 26778 26675 104 1 2 81 45 86 0.625 2.04 1.01 Init - 29329 29242 88 1 1 89 40 73 0.674 3.40 1.00 Prom - 36560 36521 40 -4.76 2.05 PlyA - 36853 36848 6 1.05 2.04 Term - 43083 42788 296 1 2 49 50 264 0.556 14.17 2.03 Intr - 49372 49246 127 1 1 71 105 -2 0.439 0.05 2.02 Intr - 49562 49506 57 2 0 106 70 13 0.268 0.38 2.01 Init - 52702 52700 3 1 0 81 101 0 0.372 0.60 2.00 Prom - 54905 54866 40 -3.36 3.00 Prom + 62278 62317 40 -2.86 3.01 Init + 73046 73114 69 1 0 74 37 35 0.505 -1.95 3.02 Intr + 74251 74399 149 0 2 81 103 106 0.773 10.43 3.03 Intr + 77982 78120 139 0 1 57 95 103 0.820 8.37 3.04 Intr + 99858 99947 90 2 0 39 54 125 0.009 4.39 3.05 Intr + 116438 116519 82 1 1 83 91 26 0.896 1.61 3.06 Intr + 118264 118419 156 2 0 43 81 141 0.981 8.88 3.07 Intr + 120405 120583 179 1 2 118 100 49 0.992 8.74 3.08 Intr + 121885 121979 95 0 2 78 109 38 0.949 3.66 3.09 Intr + 139476 139557 82 0 1 79 23 55 0.059 -2.26 3.10 Intr + 140078 140147 70 1 1 77 78 67 0.133 3.35 3.11 Intr + 146970 147053 84 1 0 67 67 55 0.494 1.09 3.12 Intr + 148324 148422 99 2 0 113 80 -11 0.286 0.68 3.13 Intr + 149252 149289 38 0 2 125 66 -14 0.173 -1.92 3.14 Term + 160239 160313 75 2 0 85 47 133 0.613 6.74 3.15 PlyA + 164834 164839 6 1.05 4.00 Prom + 166522 166561 40 -4.26 4.01 Init + 175378 175438 61 0 1 51 76 83 0.904 4.83 4.02 Term + 176822 177108 287 0 2 36 42 213 0.840 7.07 4.03 PlyA + 178644 178649 6 1.05 5.04 PlyA - 182440 182435 6 1.05 5.03 Term - 195867 195645 223 2 1 41 41 306 0.411 17.59 5.02 Intr - 196448 195869 580 0 1 39 20 500 0.378 30.06 5.01 Init - 198347 198263 85 2 1 69 100 68 0.566 7.38 5.00 Prom - 207564 207525 40 -7.56 6.08 PlyA - 208127 208122 6 1.05 6.07 Term - 209584 209537 48 1 0 107 53 58 0.853 1.50 6.06 Intr - 210187 209918 270 1 0 76 105 304 0.895 28.64 6.05 Intr - 210383 210295 89 0 2 105 77 115 0.916 11.79 6.04 Intr - 210884 210674 211 2 1 91 80 146 0.992 12.59 6.03 Intr - 211322 211214 109 1 1 132 85 216 0.999 25.99 6.02 Intr - 212396 211942 455 2 2 87 99 613 0.657 54.36 6.01 Init - 221263 221201 63 1 0 81 41 57 0.334 1.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 115518 115612 95 1 2 86 95 32 0.825 3.38 S.002 Term + 124773 124830 58 0 1 126 41 38 0.858 0.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:35192477_35417303|GENSCAN_predicted_peptide_1|63_aa MTVIFKNEENMVLEYQNNRSEKGHDHGRPVAFLAGSGLGNTSGRRLYGEPYWLGVPYLIT GQE >gi568815584f:35192477_35417303|GENSCAN_predicted_CDS_1|192_bp atgactgtgattttcaaaaacgaagagaatatggttcttgaataccagaacaaccgctca gaaaaggggcatgaccatgggaggccagtcgcattcctggcaggcagtggactgggaaac accagtggtcgcagactgtatggagaaccttactggctgggcgtgccctatctgatcaca ggacaggagtaa >gi568815584f:35192477_35417303|GENSCAN_predicted_peptide_2|160_aa MVLPEARVDIRVNKTWSLPSIKESANAKAQMSEMLRYVERITRILGPVECTVQGGREEKL RRGVNTITTLEENKKAQLVVTAHEVDPIKLVVFLLALCHKMGVPYCIIKGKARLGHLVHR KTCTTVNFTQVNSEDKDTLAKLVEAIRTNYNDRYDEIRHH >gi568815584f:35192477_35417303|GENSCAN_predicted_CDS_2|483_bp atggtactacctgaagcacgggtggatataagggtaaacaagacatggtccctgccctca attaaagaatcagcaaatgcaaaggcacaaatgagtgaaatgttacggtatgttgagaga ataacaagaattttaggaccagtggagtgtacagtgcagggtgggagagaagaaaagctg aggagaggagttaacaccatcaccaccttggaggagaacaagaaggctcagctggtggtg actgcacatgaagtggatcccatcaagctagttgtcttcctgcttgccttgtgtcataaa atgggggtcccttactgcattatcaaggggaaggcaagactgggacatctagtccacagg aagacctgcaccactgtcaactttacacaggttaactcggaagacaaagacactttggct aagctggtggaagctatcaggaccaattacaatgacagatatgatgaaatccgccatcac tag >gi568815584f:35192477_35417303|GENSCAN_predicted_peptide_3|468_aa MKWRPVGSGENGDKVTDWKVGYRLLNVVSQLAKRNLRLLVLGRKHMLRRSSQWSRDEMEE VQKQASCFFADDMFITRDLMRDHKACLPDAKTQRLFFKWQQGHQLAIVNRFPGSKLTFQS REFGMQERKKRGWYPGSSRCNFREVLVCLDKLLDSSTVTHLFKITENIGCVMTGMTADSR SQVQRARYEAANWKYKYGYEIPVDMLCKRIADISQVYTQNAEMRPLGCCMILIGIDEEQG PQVYKCDPAGYYCGFKATAAGVKQTESTSFLEKKVKKKFDWTFEQTVETAITCLSTVLSI DFKPSEIEVGVVTVENPKFRACGSSTGPCCSPLSLRGLSHIQRYEGNELIQVLIKREGKK LSVKMAVLAFWFLLEYSSLSTDCIAISYNCLEAKDLGTGAGNTGVSHRGGHEKGLEIPFW KSKIHVNYVRAEFTTQEARPAGWEAFVESTKDKHSLAVKPVGACMLRT >gi568815584f:35192477_35417303|GENSCAN_predicted_CDS_3|1407_bp atgaagtggaggccagtgggaagtggagagaatggggataaagtaactgactggaaagta ggctacaggctcttgaatgtcgtctctcaactagccaaacggaatctgcgactgctggtc ctaggccggaagcacatgctaagacggagttcccagtggagtcgggatgagatggaagag gtgcaaaagcaagccagctgtttttttgctgatgacatgtttatcacaagagacctgatg cgggaccacaaggcctgtctgcctgatgccaagacccaacgcctgttttttaagtggcag cagggacatcagctggcaattgtaaataggtttccaggatcaaaactaacctttcagagc cgtgagttcggcatgcaagagcggaagaaacgcggctggtaccccggaagcagtcgctgc aacttccgggaggtgcttgtgtgcctggacaaattattggattccagcacagtgactcac ttattcaagataactgaaaacattggttgtgtgatgaccggaatgacagctgacagcaga tcccaggtacagagggcacgctatgaggcagctaactggaaatacaagtatggctatgag attcctgtggacatgctgtgtaaaagaattgccgatatttctcaggtctacacacagaat gctgaaatgaggcctcttggttgttgtatgattttaattggtatagatgaagagcaaggc cctcaggtatataagtgtgatcctgcaggttactactgtgggtttaaagccactgcagcg ggagttaaacaaactgagtcaaccagcttccttgaaaaaaaagtgaagaagaaatttgat tggacatttgaacagacagtggaaactgcaattacatgcctgtctactgttctatcaatt gatttcaaaccttcagaaatagaagttggagtagtgacagttgaaaatcctaaattcagg gcttgtggcagctccaccgggccctgctgcagccccttgtcattgagaggcctctcccac atccagagatacgaggggaatgaacttatacaagtcctcatcaagagggagggcaaaaaa ttgtcggtgaagatggctgtgctggcattttggtttcttctggaatattcatctttgtcc actgactgcatcgccataagctataactgcttagaggccaaggacttaggcacaggtgct gggaatacaggcgtgagccaccgcggcggccatgaaaagggcttggagattcctttctgg aaatctaaaatccatgtgaattatgttagggcagagttcaccacacaggaggcaaggcca gcgggctgggaggcctttgtggagagcaccaaggacaagcatagcctggctgttaaaccc gtgggcgcctgcatgctacgcacctga >gi568815584f:35192477_35417303|GENSCAN_predicted_peptide_4|115_aa MGPLISPGTFPYIRLQLEAFALTLVAAPRWALAFVNGSFIKLSPNTHGMRVTADSFLPVT PAMCTKSISDPLLTPPDPVKKASMNATLHAGVSGPARSQLSGFEGTDGPGVWQCE >gi568815584f:35192477_35417303|GENSCAN_predicted_CDS_4|348_bp atggggccccttatctcccccggcactttcccctacatccggctgcaactagaagctttc gcactaaccctcgtggctgccccacgctgggccctggcctttgtaaacggttcctttatt aaactctctccaaatacccacggcatgcgtgtgacagcggactccttcctgcccgtcaca ccggcgatgtgcacaaaaagtatctcagaccctctcctcaccccacctgatcccgtcaaa aaggcctccatgaacgccaccctccatgctggggtctcggggcctgcaagatcccagctc agtggatttgaaggaactgatgggcctggagtctggcagtgtgaatga >gi568815584f:35192477_35417303|GENSCAN_predicted_peptide_5|295_aa MKRRAVMGPPQVQAWKALDLWESSQVMEGKAVVPMGRNPMMRKATQGQLENSPALEKLLP PLQGNVGFAFTKEDLTEVRDLLLANKVPAATRAGAIGPCEVTVPAQNTGLGPEKISFFQA LGITTKISRGTTEILSDVQLIKTGDRVGASEATLLNTPNISPFSFGLVIQQVFNNGSIYN PEVLDITEETLYSGFLEGVRNVASVSANWLPTCCISTPFYHHGYKGCLALSVETDYTFPP AEKIKTFLADPSAFVAAAPVATTATAAPAAAAAPTKFENEESEELDEDMGFGLFD >gi568815584f:35192477_35417303|GENSCAN_predicted_CDS_5|888_bp atgaaacggagggctgtcatgggcccaccccaggtgcaagcctggaaggccctggacctc tgggagtcctctcaagtcatggaagggaaggccgtggtgccaatgggcaggaaccccatg atgcgcaaggccacccaaggacaactggaaaacagcccagctctggagaaactgttgcct cctctgcaggggaatgtgggctttgcgttcaccaaggaggacctcactgaggtcagggat ctgctgctggccaataaggtgccagctgccacccgtgctggtgccattggcccatgtgaa gtcactgtaccagcccagaacactggtctgggacccgagaagatttcttttttccaggct ttaggcattaccactaaaatctccaggggcaccactgaaatcctgagtgatgtgcagctg atcaagactggagacagagtgggagccagcgaagccacactgctgaacacgccgaacatc tctcccttctcctttgggctggtcatccagcaggtgttcaacaatggtagtatctacaac cctgaagtgcttgacatcacagaagaaactctgtattctggcttcctggagggtgtccgc aatgttgccagtgtgtctgcaaactggctacccacctgttgcatcagtacccccttctat catcatgggtacaaaggatgcctggctctatctgtggagactgattacaccttcccgcct gctgaaaagatcaagaccttcttggctgatccatctgcctttgtggctgctgcccctgtg gccactaccgccacggctgctcctgctgctgcggcagccccaactaagtttgaaaacgag gagtcggaggagttggatgaggatatgggatttggtctctttgactaa >gi568815584f:35192477_35417303|GENSCAN_predicted_peptide_6|414_aa MVQLSPHYRGDSEDGRKRMVRQRTKPVLFFWSDWLGNSPSLTPPQRNPQPAFIGRRGGAA EPTAVRAAVPPASAPARKQRAARGPAHPQQRPQLVRAMFQAAERPQEWAMEGPRDGLKKE RLLDDRHDSGLDSMKDEEYEQMVKELQEIRLEPQEVPRGSEPWKQQLTEDGDSFLHLAII HEEKALTMEVIRQVKGDLAFLNFQNNLQQTPLHLAVITNQPEIAEALLGAGCDPELRDFR GNTPLHLACEQGCLASVGVLTQSCTTPHLHSILKATNYNGHTCLHLASIHGYLGIVELLV SLGADVNAQEPCNGRTALHLAVDLQNPDLVSLLLKCGADVNRVTYQGYSPYQLTWGRPST RIQQQLGQLTLENLQMLPESEDEESYDTESEFTEFTEDELPYDDCVFGGQRLTL >gi568815584f:35192477_35417303|GENSCAN_predicted_CDS_6|1245_bp atggtacaactcagcccgcactaccgtggggacagtgaagatggaaggaagaggatggtc cggcagaggacgaagccagttctctttttctggtctgactggcttggaaattccccgagc ctgaccccgccccagagaaatccccagccagcgtttatagggcgccgcggcggcgctgca gagcccacagcagtccgtgccgccgtcccgcccgccagcgccccagcgaggaagcagcgc gcagcccgcggcccagcgcacccgcagcagcgcccgcagctcgtccgcgccatgttccag gcggccgagcgcccccaggagtgggccatggagggcccccgcgacgggctgaagaaggag cggctactggacgaccgccacgacagcggcctggactccatgaaagacgaggagtacgag cagatggtcaaggagctgcaggagatccgcctcgagccgcaggaggtgccgcgcggctcg gagccctggaagcagcagctcaccgaggacggggactcgttcctgcacttggccatcatc catgaagaaaaggcactgaccatggaagtgatccgccaggtgaagggagacctggccttc ctcaacttccagaacaacctgcagcagactccactccacttggctgtgatcaccaaccag ccagaaattgctgaggcacttctgggagctggctgtgatcctgagctccgagactttcga ggaaatacccccctacaccttgcctgtgagcagggctgcctggccagcgtgggagtcctg actcagtcctgcaccaccccgcacctccactccatcctgaaggctaccaactacaatggc cacacgtgtctacacttagcctctatccatggctacctgggcatcgtggagcttttggtg tccttgggtgctgatgtcaatgctcaggagccctgtaatggccggactgcccttcacctc gcagtggacctgcaaaatcctgacctggtgtcactcctgttgaagtgtggggctgatgtc aacagagttacctaccagggctattctccctaccagctcacctggggccgcccaagcacc cggatacagcagcagctgggccagctgacactagaaaaccttcagatgctgccagagagt gaggatgaggagagctatgacacagagtcagagttcacggagttcacagaggacgagctg ccctatgatgactgtgtgtttggaggccagcgtctgacgttatga