GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:36:29 Sequence gi568815576r:44396840_44597556 : 200717 bp : 49.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4962 5025 64 0 1 99 109 27 0.683 4.62 1.02 Intr + 10461 10576 116 2 2 58 30 119 0.104 2.35 1.03 Intr + 12550 12794 245 1 2 40 34 119 0.263 -1.16 1.04 Intr + 38898 39081 184 1 1 25 98 162 0.616 9.75 1.05 Intr + 42388 42521 134 1 2 22 94 103 0.701 4.59 1.06 Intr + 44242 44349 108 2 0 116 74 17 0.501 3.36 1.07 Intr + 70617 70662 46 1 1 110 70 25 0.003 0.37 1.08 Intr + 75531 75654 124 0 1 17 37 100 0.005 -1.61 1.09 Term + 83016 83279 264 2 0 13 39 654 0.994 48.51 1.10 PlyA + 84110 84115 6 1.05 2.02 PlyA - 86623 86618 6 1.05 2.01 Sngl - 100717 99998 720 1 0 103 48 547 0.488 48.43 2.00 Prom - 118956 118917 40 -3.46 3.00 Prom + 121070 121109 40 -4.36 3.01 Init + 131602 131694 93 0 0 72 113 59 0.408 7.19 3.02 Term + 139460 139504 45 1 0 107 47 27 0.015 -2.29 3.03 PlyA + 140619 140624 6 1.05 4.05 PlyA - 140834 140829 6 1.05 4.04 Term - 141325 141194 132 1 0 36 39 169 0.770 4.89 4.03 Intr - 143893 143819 75 1 0 65 70 56 0.504 1.21 4.02 Intr - 147631 147578 54 1 0 91 94 44 0.592 4.48 4.01 Init - 153140 152979 162 2 0 69 26 135 0.448 5.13 4.00 Prom - 164555 164516 40 -6.66 5.00 Prom + 166627 166666 40 -5.66 5.01 Init + 168821 168828 8 1 2 55 70 0 0.267 -4.60 5.02 Intr + 170126 170209 84 2 0 88 65 109 0.954 7.54 5.03 Intr + 170411 170828 418 2 1 66 61 672 0.929 56.33 5.04 Term + 170848 171411 564 0 0 -24 48 935 0.893 72.69 5.05 PlyA + 171451 171456 6 1.05 6.00 Prom + 171868 171907 40 -6.86 6.01 Init + 174089 174189 101 1 2 53 41 97 0.609 1.23 6.02 Intr + 174341 174493 153 2 0 80 71 58 0.751 2.49 6.03 Intr + 174537 174653 117 0 0 46 105 103 0.930 7.38 6.04 Intr + 175770 175971 202 0 1 83 81 63 0.600 4.39 6.05 Intr + 183529 183690 162 0 0 92 36 76 0.320 2.97 6.06 Intr + 187065 187161 97 2 1 53 18 95 0.101 -1.42 6.07 Intr + 189620 189745 126 0 0 81 98 23 0.156 3.25 6.08 Intr + 193203 193408 206 1 2 2 80 118 0.302 1.32 6.09 Intr + 194067 194158 92 2 2 82 49 62 0.823 0.49 6.10 Term + 194719 194890 172 1 1 95 53 113 0.454 5.80 6.11 PlyA + 197393 197398 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 101257 101376 120 0 0 100 61 172 0.897 13.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:44396840_44597556|GENSCAN_predicted_peptide_1|428_aa XSKIQAQAGPSEFSFAVWESEQSSVGVYTLILLTRSLALWSALADGTAAETHEQTIRELT CREQPRRSLALLSPAALTPVLSQASLLQELAAADASSPSGREKSIMANPTPPVIPELCAV ERLAPNPLWPPPSTSLPLRTGAEFQIDNTLLESAFGKDAEVFGGAREHKARCLVRLRIYV ASASHFLPFSASDEENQVRLDSRGFSICVMAMPKVAHVETFSVAGTTLGPQHPPLIHTAI RRHHPSSPAAALSGENGDEGGGPGFHTTHSRLWLGLQEGRDRPYVRLDFQSNAEEKWGSR PQYSSPLKDATTRRHFGSREQPSPDTNPADTLILDLLASRTRLYRSVITTIITITIITTI TIIIITIITIINITIITIITIIIIIIAIITIIIITIITIIIIIMLEETFLKHLDDTWTWQ TWIQDDLR >gi568815576r:44396840_44597556|GENSCAN_predicted_CDS_1|1287_bp nnctcaaaaatccaggcacaggctggaccctcagaattctccttcgcggtttgggagtcg gagcagagttcagtgggggtctacactctcatcctgctgactcggagcctggccttgtgg tctgctttggcagatggaacagcagccgagacacacgagcagaccatccgcgagcttacg tgtagagagcagcccagacgaagtctggccctgctgagcccagcagccctcacacctgtg ctcagccaagccagcctgctgcaggagttggctgcggctgatgccagctcgccctcaggc cgagaaaaatccatcatggctaatcccactcccccagtgattcctgagctgtgtgccgtg gaaaggctggccccaaatcctctctggcctccccccagcaccagccttcctctgcgaacc ggggcggaatttcaaattgataacactctcttagaatcggcgtttggcaaggacgctgag gtttttggaggagcacgggaacataaagcaaggtgcctggtgcggctccggatttatgta gcctcagcatcccatttcttgccattttctgcttcggatgaagagaatcaagtacgtctg gattcccggggtttctccatctgtgtcatggcgatgccaaaggtagcccacgtggagacc ttctctgtggcaggcacaactctaggtcctcagcatcctcctttaatccacacggcaatt cgacgccatcaccccagttctccagccgcagctttgtcgggggagaatggggatgaaggt gggggccctggtttccataccacccacagcagactctggctcgggcttcaagagggcagg gaccggccctatgtaaggcttgattttcagtccaatgctgaagagaagtgggggagcagg ccacagtattcctccccactaaaggatgctacaacaagacgccattttggaagcagagag cagccctcaccagacaccaatcctgctgacaccttgatcttggaccttctggcctccaga actaggctatacaggagtgtcatcaccaccatcatcaccatcaccatcatcaccaccatc accatcatcatcatcaccatcatcaccatcatcaacatcaccatcatcaccatcatcacc atcatcattatcatcatcgccatcatcaccatcatcattatcaccataatcaccatcatc atcatcattatgctagaagagacgtttctcaaacatctggatgatacctggacgtggcag acatggatccaagatgatctcagatag >gi568815576r:44396840_44597556|GENSCAN_predicted_peptide_2|239_aa MVQPQTSKAESPALAASPNAQMDDVIDTLTSLRLTNSALRREASTLRAEKANLTNMLESV MAELTLLRTRARIPGALQITPPISSITSNGTRPMTTPPTSLPEPFSGDPGRLAGFLMQMD RFMIFQASRFPGEAERVAFLVSRLTGEAEKWAIPHMQPDSPLRNNYQGFLAELRRTYKSP LRHARRAQIRKTSASNRAVRERQMLCRQLASAGTGPCPVHPASNGTSPAPALPARARNL >gi568815576r:44396840_44597556|GENSCAN_predicted_CDS_2|720_bp atggtgcagccgcagacgtccaaagctgaaagcccagccttggcagcgtctccgaatgcc cagatggatgacgttattgacaccctgacctccctgcgcctcaccaactcggcgctgagg cgggaggcttccaccctgcgggcggagaaggccaatctcaccaacatgctggagagcgtg atggcagagctgaccttgttacgcaccagggcgcggatcccgggggctctgcagatcacc ccgcccatctcctcaattacttcaaacgggactcgacccatgaccacacctccaacctct ctgcccgagcccttttccggggacccaggccggttggcggggttcctgatgcagatggac agattcatgatcttccaggcctcccgcttcccgggtgaggccgagcgtgtggccttcctt gtgtctcgactgactggggaggcggagaagtgggctatcccccacatgcaacctgacagc cccttgcgcaacaactatcaggggttcctggcagagttgcggagaacctacaagtctccg ctccggcatgcgcggcgcgcccaaatcaggaagacttctgcctctaatagggctgtgcga gagaggcagatgctctgccgccagctggcctctgcgggcacggggccttgcccagtgcat ccagcttccaacgggactagtccagcgccagccctgcctgcccgagcacggaatctttaa >gi568815576r:44396840_44597556|GENSCAN_predicted_peptide_3|45_aa MLKAEMEWVKEAFLKALTFDLRPGRGAGTSHDQEPAPTARSWSQA >gi568815576r:44396840_44597556|GENSCAN_predicted_CDS_3|138_bp atgctgaaggctgagatggaatgggttaaggaagccttcttgaaggcgttgacctttgat ttgagacctggacgaggagcaggtaccagccacgaccaagaacctgcccccacggcccgt tcttggagccaggcttga >gi568815576r:44396840_44597556|GENSCAN_predicted_peptide_4|140_aa MDIGQWTQTLPLRKEEKEHLGHQLAPRHCGPQSGESLSLLGAPGESFEKGMLRGMAPGLS NDTMLSNADDAEAREALANASVFGGGHAGLPGRALCRANRFPVPLLTTFSLALKKSVFGS LLRRWRLLGDKRLSDAFNKT >gi568815576r:44396840_44597556|GENSCAN_predicted_CDS_4|423_bp atggatataggacaatggacacagacactcccactcaggaaagaggagaaggagcatctg gggcaccagcttgcaccgaggcactgtggaccacagtctggcgagagcctgagcctgctg ggggccccaggggagagcttcgagaagggaatgcttcgagggatggccccaggcctctca aatgacacaatgctgagcaatgctgatgatgcagaggccagagaggctcttgccaatgcc agtgtatttggtggaggccatgctggccttcctggaagagccctgtgcagagcaaacagg tttccagtcccgctgctgacgaccttctcattggcattgaaaaagagcgttttcggctcc ctgctcagacgctggcggcttctgggtgacaagcggctgtcagatgcttttaacaagact tag >gi568815576r:44396840_44597556|GENSCAN_predicted_peptide_5|357_aa MPSLSFTTCSTFSTNYRSLGPAQAPSYGTGRVRHLETKNRKLESKIWEHLEKKGPQVRDW SHYFKTIRNQRAQSLAITVDNACIVLQINNTHLAADDFRVKYETELAMCQSVESNIHGLC KVNDDTNVTRLQLETEIKALKEELLFMKKNHEEEVKGLQAQIASSGLTMEHLAKIMAAIR AQYDELAWKNGEELDKYLSQQIEESTTVVTTQSAKAGAAEMTLTELRCTVQSLEINLNSM RNLKASLENSVREVKACYTLQMEQLNGILLHLGSELAQTQAKGQCQAQEYEALLNIKVKL EAEIATYCHLLEDGKDFNLGDALDSSNSMQTIPKTTTHQRVDGKVVSETNDTKVLRH >gi568815576r:44396840_44597556|GENSCAN_predicted_CDS_5|1074_bp atgcccagcttgagcttcaccacttgctccacattctccaccaactaccgatccctgggc cctgcccaggcgcccagctacggcaccggccgagtgaggcacctggagaccaagaaccgg aagctggagagcaaaatctgggagcacctggagaagaagggaccccaggtcagagactgg agccattacttcaagaccatcaggaaccagagggctcagagcttggcaattactgtagac aatgcctgcattgttctgcagatcaacaacacccatcttgctgctgatgactttagagtc aagtatgagacagagctggccatgtgccagtctgtggagagcaacatccatggactctgt aaggtcaatgatgacaccaacgtcactcggctgcagctggagacagagatcaaggctctc aaggaggagctgctcttcatgaagaagaaccacgaagaggaagtaaaaggcctacaagcc cagattgctagctctgggttgaccatggagcacctcgccaagatcatggcagccatccgg gcccaatatgacgagctggcttggaagaacggagaggagctggacaagtacttgtctcag cagattgaggagagcaccacagtggtcaccacgcagtctgccaaggctggagctgctgag atgacgctcacagagctgagatgtacagtccagtccttggagatcaacctgaactccatg agaaatctgaaagccagtttggagaacagcgtgagggaggtgaaggcctgctacaccttg cagatggagcagctcaacgggatcctgctgcacctggggtcagagctggcacagacccag gcaaagggacagtgccaggcccaggagtatgaggccctgctgaacatcaaggtcaagctg gaggctgagatcgccacctactgccacctgctggaagatggcaaggacttcaatcttgga gatgccctggacagcagcaactccatgcaaaccattccaaagaccaccacccaccagaga gtggatggcaaagtggtgtctgagaccaacgacaccaaagttctgagacattaa >gi568815576r:44396840_44597556|GENSCAN_predicted_peptide_6|475_aa MLTIAKHDAGLIGEAKKSSSFQSLIREEKGHEGRAPGKTFSPTGLPLEYPLLPHWKVFCF LELGLPREEEKMVHVPREPMMCRPSPTHFTDEEMEAREMKLADFLWNFLWTSGGTSCQQS RLDWLSWVCPRGNGRGAGEQAEGTQPPETQLKVGHSHFRFVLLTNMDHRAKPRVRGGQAN PAPVKRAAPKKERKLRLGPGMPESGFEPTSFSPDPQVTLSTSLPPEGTGEQKPLEAIRVH GAMGQAEPPAALPYAAPSKPLMVPAIAPYCPRPSQTTATEAQSSFSVGSTHRQPHLSSSA LSCPSPDHLLPPPSKRTQQVPEETLELAERSDRPQARPSGPSSHASTGTKGLLHLAQLAN LTMQGLQPHLANGKAHIRLANLSEAIRQNLLQSLRHTVAYESGCFSAIKGWKQHLGQGSC PTASMNWEVKDASFRTLPGCPACDHTAADPSPRKALPHSAAGRPPIQPPVLARRL >gi568815576r:44396840_44597556|GENSCAN_predicted_CDS_6|1428_bp atgctaacaatagccaagcacgatgccggcctgattggagaggctaaaaaaagcagctca ttccaaagcctcatcagggaagaaaagggccacgaaggcagggcaccggggaagaccttc agccctactgggctaccccttgagtaccctctcctcccacattggaaggtcttctgcttc ctggagcttggcctgcctcgggaggaggagaaaatggtccatgtccccagagagcccatg atgtgccggcctagccctacccactttacggatgaggaaatggaggctcgggagatgaag ctggcagacttcctctggaacttcctctggacatctggagggacgtcctgtcagcagagc aggctggactggctgagctgggtgtgtcctcgtggcaatggcagaggtgcaggagagcaa gcagagggtacacagccccccgagactcagctcaaggtgggccacagccacttccgcttt gtcctgttgaccaacatggatcacagggccaagcccagagtgagaggtgggcaggccaac ccagccccggtgaaacgggcagctccaaagaaggagaggaagctgaggctggggccaggc atgccggagtcaggatttgaacccacatcattcagtcccgatccccaggtcaccctgagc acctctcttcctcccgaaggcactggagagcagaagcccttggaagccatcagagtccat ggagctatggggcaggcagagccaccagcagctctcccgtacgctgctccaagcaagccc ttaatggtccccgccatcgccccctactgcccccgacccagccagaccacagccactgaa gcacagagctccttctcagtgggaagcactcaccgccagccccacctttccagctcggcg ctctcatgccccagccccgaccatctcttgccccctccttccaaacgaacacagcaagtc ccagaggagaccctcgagttagcagagaggtcggacagacctcaggccagaccttctggg ccctccagccatgcaagcactggcaccaagggtctgctccatcttgcacagttagccaac ctcacaatgcaagggctacagccccacttggcaaatggcaaagctcacatcagactggcc aacttgtctgaggctatacggcaaaaccttctgcagagcctgaggcacaccgtggcctat gagagcgggtgctttagtgcaatcaagggctggaagcaacacttgggccaaggcagctgc ccaaccgcctccatgaactgggaggtaaaagacgcctccttccgcaccctgccgggctgt cctgcctgtgaccacactgctgccgacccaagtcctcgcaaggccctgccccactctgca gcaggcaggcccccaattcagccccctgtccttgcccggagactctga