GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:01:29 Sequence gi568815580r:47741830_47996756 : 254927 bp : 40.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5902 6089 188 0 2 65 64 108 0.346 4.90 1.02 Term + 6765 7275 511 0 1 28 38 290 0.511 10.86 1.03 PlyA + 7885 7890 6 -1.75 2.03 PlyA - 10619 10614 6 -0.45 2.02 Term - 11332 11235 98 2 2 85 50 95 0.488 2.55 2.01 Init - 16177 15901 277 1 1 78 23 239 0.586 13.64 2.00 Prom - 18840 18801 40 -8.35 3.00 Prom + 27882 27921 40 -4.25 3.01 Init + 29326 29580 255 0 0 53 110 96 0.489 5.58 3.02 Intr + 32647 32699 53 0 2 75 55 76 0.411 -0.21 3.03 Intr + 38245 38410 166 1 1 42 44 149 0.299 4.94 3.04 Intr + 53457 53865 409 2 1 60 81 184 0.219 7.81 3.05 Intr + 55291 55418 128 2 2 63 73 73 0.797 2.78 3.06 Intr + 55550 55654 105 1 0 82 91 93 0.993 8.49 3.07 Intr + 56527 56647 121 0 1 108 -3 101 0.680 2.15 3.08 Intr + 57648 57962 315 1 0 124 56 90 0.826 4.51 3.09 Intr + 60954 61062 109 1 1 72 60 111 0.716 5.12 3.10 Intr + 67215 67331 117 0 0 74 34 127 0.785 4.56 3.11 Intr + 82241 82365 125 2 2 64 62 118 0.017 6.11 3.12 Term + 99168 99301 134 1 2 55 38 124 0.314 1.37 3.13 PlyA + 99317 99322 6 -0.45 4.15 PlyA - 99436 99431 6 1.05 4.14 Term - 101347 101133 215 2 2 73 49 175 0.872 8.51 4.13 Intr - 103971 103834 138 1 0 100 80 33 0.893 3.21 4.12 Intr - 106858 106646 213 2 0 117 77 96 0.993 9.16 4.11 Intr - 109498 109445 54 2 0 78 106 12 0.503 0.03 4.10 Intr - 123304 123230 75 2 0 56 98 77 0.935 4.07 4.09 Intr - 127607 127414 194 1 2 123 91 168 0.996 18.81 4.08 Intr - 142135 141899 237 0 0 77 58 105 0.341 2.21 4.07 Intr - 154980 154692 289 1 1 40 115 236 0.121 16.88 4.06 Intr - 162682 162564 119 0 2 75 108 37 0.009 3.59 4.05 Intr - 170180 170050 131 2 2 70 30 105 0.002 1.47 4.04 Intr - 188617 188532 86 2 2 114 103 76 0.203 10.32 4.03 Intr - 189711 189526 186 1 0 112 60 183 0.533 16.64 4.02 Intr - 191437 191356 82 1 1 87 -17 61 0.119 -6.21 4.01 Init - 194333 194217 117 2 0 78 74 187 0.487 16.55 4.00 Prom - 202093 202054 40 -7.15 5.04 PlyA - 204063 204058 6 1.05 5.03 Term - 213737 213587 151 1 1 78 49 239 0.899 15.40 5.02 Intr - 214669 214399 271 2 1 59 41 215 0.623 9.58 5.01 Init - 216600 216447 154 0 1 85 91 48 0.847 4.99 5.00 Prom - 221818 221779 40 -4.35 6.00 Prom + 225821 225860 40 -7.45 6.01 Init + 232968 233024 57 2 0 105 78 45 0.691 6.56 6.02 Term + 238133 238249 117 1 0 122 49 65 0.775 3.56 6.03 PlyA + 238518 238523 6 1.05 7.04 PlyA - 239042 239037 6 1.05 7.03 Term - 242295 242002 294 0 0 45 55 207 0.703 7.42 7.02 Intr - 242744 242634 111 2 0 81 97 31 0.696 2.96 7.01 Init - 245100 245047 54 0 0 107 70 18 0.630 3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 73049 73270 222 1 0 71 42 163 0.828 4.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_1|232_aa MGRSEGTRGTLSWSQLAQSHGGGEVNQPEFLLYSLPADRNKGQEEQALQLKGSLGGPSQI PCRVTLEVPEGPEDLEGPKDDSGRDSVPDPTIWFVSALGVSCKTPHRGSQFPHPSCFRRS GIRRIRGHTLGKSLTVSQMSEGTLWPPGSQICPPHEPRPRSSSAVRLNVNTQGKLQGKHT ADKNLMLYVTKENEYLIRAFMRNHITRQPHAAPLPSPAPGSEAVKRLATSSS >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_1|699_bp atgggacgcagcgaaggcacccgtgggaccctgtcctggtcccagcttgcccagtctcac ggtggtggtgaagtgaaccaacctgagtttcttctctattctctgcctgctgacaggaac aaaggccaggaggaacaggccttacagctaaaaggttcccttggaggcccatctcaaatt ccctgcagggtgacgctggaggttcccgaggggcctgaagacttagaaggtcccaaagat gacagtggtagagactccgtgccggaccccacaatctggtttgtgtcagccctgggagtg tcatgcaagacccctcaccgcgggtctcagtttccccatccatcctgcttccgtcgcagc ggtattagaaggatccggggacacacgcttgggaaaagtctaacggtgtcacagatgtca gagggaaccctctggcctccaggatcacagatctgtcctccccacgagccaaggcccagg agttcctctgcagtccgcctgaacgttaacacgcagggaaaacttcagggaaaacataca gctgataaaaatttaatgctatatgtaacaaaggagaatgaatatttaatccgtgctttt atgcgaaaccacataaccaggcagccccacgcggcgcccctcccctcgcccgcgcccggc agcgaagcagttaagcggctggcaacaagcagctcctaa >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_2|124_aa MGWTPALTNLTFWTLLIVDQSGKQRSMAGGGLAEEQPERSDWAHDPDHYLPSHLPRTAGP VLGVEILEVLPAEDTVLTDGLALAFWVSCRDWGHSGWHVAGGGIEAGTWGKCGGQLEATA MVQV >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_2|375_bp atgggctggacccctgctctcacgaaccttacattctggacactcctgattgtggaccag agcgggaagcagaggagcatggctggaggaggccttgcagaggagcagcctgagaggagt gattgggcccatgaccctgatcattatctgccatctcatctccccagaacagctggccct gttttaggggttgagatcctggaggttctgccggccgaggacacagtactcacagatggc ctcgcccttgccttctgggtttcgtgcagggattggggtcattctggctggcatgtggca ggtggagggatagaggcggggacatggggcaagtgtggaggacagttagaagctactgca atggtccaggtgtga >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_3|678_aa MNGATKRKAFPPRIAMNFTQTHSILHSQVSTETARPRHRTHEHCSQETHSCWDSRQNQGI LAPLCLSGEPGSCLSSSEHLRILVQPPILLTNVEALYSRTPEKGSTKVKAKNNKKKTPDG SRGTAFPKSGRCHFPKTTAATFLIPHGFPDYRTTLLRKVLLSGHGWVGAESLPHLQRQWP LSERKAAKSTVAARRTRGPPNSGHHLDVLQAKSQRKEQWDPPLPRSQSPTPRPVLVTHSS PGSGLWRKWPRSAPEELCRDKPQEGPWSPGRKNRARIKITTVGNSNNLQGTWVSALAPLG KKNEHWQCGVPEKPPLWPASGWRRESRVGVTEHCGQNAHANCPREPDTAGHRELPVTAAG SKGSSFAQYPLQARANCLQVPDAALGSRAQIPRVTGHILAPAWLKGPGETDWHKHAPLPP PPWTYANNPARKFPLRQEPTFQGFTTRPSSHYLSPTLAPTPPELALSGLRPSPPLPCLLP SQGQDASQTQLSPHLSLALEEHWRLLSPGSALPRGEEEAEGRQLEKGAQWAVSVSTDHAT VTTNPSLRARLLQSGPSEKPMVPKAVLKEEFSAAVLNNRSQKQGCPPRASLGRMQDQTNN PGKVYSVTKGQSKPNYKMIHHRYFRKNIFIKRGKELTGASSINATITGDDVMLCCFDYED KNGNGKKRLGKVFSMVSL >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_3|2037_bp atgaatggagccacgaaaagaaaggcctttccccccaggattgccatgaacttcacacaa actcacagcatcctgcacagtcaagtctccactgagacagctcgaccaagacacagaact catgaacactgcagccaggagacccactcatgctgggactcgaggcagaaccaaggaatc cttgctcctttgtgtctttctggggaacctggtagctgtctgtcctcctctgaacatctc agaattcttgtccagccaccaatacttctcacaaacgtggaagccctctattctagaaca ccggagaaagggtcaactaaagttaaggcaaaaaataacaagaagaagacacctgatgga agcagagggactgcatttcccaagagcggcagatgccattttccaaaaaccaccgcagca acgtttctgatcccacatggatttcctgattatcggacaaccctcttacgaaaggtgttg ctaagcggccatggctgggtaggagcagaaagcctcccccacctccagagacagtggccc ctctcagaacggaaagctgcgaaaagcaccgtggctgccaggagaaccaggggcccgcca aactcaggtcatcacttagacgtgctccaggcaaagagccaaagaaaagaacagtgggac cccccactccccagaagccagtcccctacccctagacccgttctagtgactcactcctcc ccagggagtggtctttggaggaaatggccacggagtgccccggaggagctttgcagggac aaacctcaggaaggtccctggtctccaggaaggaaaaacagagcaaggattaaaatcaca acggttggcaacagcaacaatctccagggcacgtgggtgtctgcactggcaccgttagga aagaagaatgaacattggcagtgtggagtccccgagaagccacccctgtggcctgcttct ggctggagaagagagtcacgagttggggtcacagagcactgtggtcagaatgcacatgcc aattgcccccgggagcctgacactgctggccacagagagttgccagtcactgctgcaggc tccaaagggtcctcctttgcccagtaccccctgcaggcacgtgccaattgcctgcaggtg cctgatgctgcgcttggctccagagcgcagattccgagagtaacaggacacatccttgcc cccgcgtggctcaaggggccaggggaaacagattggcataaacatgctccccttccaccc ccaccatggacttacgctaataatccagccaggaaattcccgcttagacaggaacccacc ttccagggctttacaaccagaccttcctctcattacctatccccgaccctggctcccacc cctcctgaattggccttgagtggtctccgcccatccccacccctgccatgcctgttgcca tctcaaggccaggatgcgtcccagacacaactcagcccccacctctcgttagccttggag gaacactggcgtttgctctctccgggctcagcactgcctaggggagaagaagaggcagag ggaagacagttggagaaaggcgcacagtgggctgtatcggtcagcacggaccatgccact gtaacaaccaaccccagcctcagggcccggctgcttcagagcggacctagtgaaaaacca atggtgcctaaggcggtgctgaaggaagaattctcagcagcagttctcaacaacagatca cagaaacaaggctgtcctcctcgggcatctctggggaggatgcaggaccagaccaataac ccgggaaaggtctattctgtcaccaagggacaatcaaagcctaactacaagatgattcat catagatactttcggaaaaacatcttcatcaagagagggaaagaattaactggtgccagc agtatcaatgcaactattactggtgatgatgtcatgttatgctgttttgattatgaggac aaaaatgggaatggaaaaaagaggcttggaaaggttttcagtatggtttccttataa >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_4|711_aa MGVHADQEQEEESVGSARAEKVDTTCTIHGTKWSQDFGEILEGFLNGRVMDPKSETLEKW SNLDLRRASSALRGASPWDEHVSAAGLTTDGGRGGRPLQAPPTTLARDRAVTGNRLNARQ GAGLSLSAPPSIPSEEEGTKGPGPPGSDGAGPGATLQDKTVSHDTTVSVVGSSRAYFVAQ SRNPTGAKGSQVHFLLPDSPSICVPLSGTGGSNVSSLPVAVEYGGYCYRVKGMFVCGRWA SSIQEAVFLAWLAAFGKNMSSILPFTPPVVKRLLGWKKSAGGSGGAGGGEQNGQEEKWCE KAVKSLVKKLKKTGRLDELEKAITTQNCNTKCVTIPRWSTWAFLPACPDSRQKGKRKWWG LKPISSQVSEMELDFLFNTNRINKKKTTLRYIMVKLMKEGILKQLVKNGTQHTNDRSLDG RLQVSHRKGLPHVIYCRLWRWPDLHSHHELKAIENCEYAFNLKKDEVCVNPYHYQRVETP ETPPPGYISEDGETSDQQLNQSMDTGSPAELSPTTLSPVNHSLDLQPVTYSEPAFWCSIA YYELNQRVGETFHASQPSLTVDGFTDPSNSERFCLGLLSNVNRNATVEMTRRHIGRGVRL YYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCKIPPAFLGISEKFLGTEMQIPAVG SCLEHTEKSKGYSGALIPSASESDILGSTVLNYIEHVLLGDEELIMGFSHI >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_4|2136_bp atgggtgtgcatgcagaccaggaacaggaagaagagtcagtgggcagtgccagagcagaa aaggtggatactacctgcacaatccatggcactaagtggtcacaggactttggagaaatc ctggaaggcttcctaaatggaagagttatggacccaaagtctgagactttggagaaatgg tctaacttagacctcaggcgcgcgtcctccgcgctgcgcggggcctccccgtgggatgag cacgtgtccgctgccggcctcacgacggacggtggacgaggcggcaggcccctacaggcc ccacccacgacgctggcgagggatcgggcggtcaccgggaatcgtcttaatgcgcggcaa ggcgcgggcctctccctctccgcccccccctccatcccatcggaagaggaaggaacaaaa ggtcccggaccccccggatctgacggggcgggacctggcgccaccttgcaggataagact gtgtcacatgatactacggtgtctgtagttggctcctctagagcatactttgtggcacag agtaggaatccaacaggggccaaaggtagccaagtacactttttgctgcctgatagtccc agcatctgtgtgcctttgtctgggactggtggtagcaatgtttcctcccttcccgtggca gtagaatatggtggttattgttacagagtgaaaggcatgtttgtgtgtgggagatgggca agttcgatacaagaggctgttttcctagcgtggcttgctgcctttggtaagaacatgtcg tccatcttgccattcacgccgccagttgtgaagagactgctgggatggaagaagtcagct ggtgggtctggaggagcaggcggaggagagcagaatgggcaggaagaaaagtggtgtgag aaagcagtgaaaagtctggtgaagaagctaaagaaaacaggacgattagatgagcttgag aaagccatcaccactcaaaactgtaatactaaatgtgttaccataccaaggtggagcacc tgggctttcttaccagcttgcccagattcgaggcagaaggggaagaggaaatggtggggc ttgaaacctatcagcagtcaagtatcagaaatggagttggatttcttattcaacacaaac aggataaataaaaagaaaaccacacttagatatatcatggtcaaactcatgaaagaggga atcttaaagcagctggtgaaaaatggcacgcagcatacaaatgacaggtctcttgatggt cgtctccaggtatcccatcgaaaaggattgccacatgttatatattgccgattatggcgc tggcctgatcttcacagtcatcatgaactcaaggcaattgaaaactgcgaatatgctttt aatcttaaaaaggatgaagtatgtgtaaacccttaccactatcagagagttgagacacca gaaacgccacctcctggatatatcagtgaagatggagaaacaagtgaccaacagttgaat caaagtatggacacaggctctccagcagaactatctcctactactctttcccctgttaat catagcttggatttacagccagttacttactcagaacctgcattttggtgttcgatagca tattatgaattaaatcagagggttggagaaaccttccatgcatcacagccctcactcact gtagatggctttacagacccatcaaattcagagaggttctgcttaggtttactctccaat gttaaccgaaatgccacggtagaaatgacaagaaggcatataggaagaggagtgcgctta tactacataggtggggaagtttttgctgagtgcctaagtgatagtgcaatctttgtgcag agccccaattgtaatcagagatatggctggcaccctgcaacagtgtgtaaaattccacca gcctttttgggtatatcagagaagttcctgggcacagaaatgcagatacctgcagttgga agttgtctagagcacactgaaaaatccaaagggtacagtggggcactgataccatctgcc tcagaaagtgacatccttggaagtacagttttaaattacattgaacatgtgttgcttgga gatgaagagttgataatgggcttttctcatatttga >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_5|191_aa MELRRGRNQGGFQGLDDRNREASRIRYGKVYTEKNKRLRIPSTLESITCVEGNWETRSIV RSLLNEPRTGMKTLLLGCMRFLRSGSADESPAASLHLDCVRVSQCWALSLSLITQCPLEL LRLWLSSRSDPSLRIATGPLDRQCGSKVPAQDSGDVDTWVLVEQWNALEKDSLETKLEVT QQQPSVSSKGH >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_5|576_bp atggagcttaggagaggaagaaatcaaggaggattccaaggcttggatgacagaaatagg gaagcttctagaattagatacggtaaagtgtatacagaaaagaacaaaaggctaaggatt ccctcaaccttggagagtatcacatgtgtagaaggtaactgggaaaccagatccattgtc agaagcttgctgaatgagcccagaaccggaatgaaaacactcctgcttggatgcatgcgc tttctcagaagtggtagtgcagatgaatctcctgccgcctctctccatctggactgtgta cgcgtctctcagtgctgggccctaagtctttctttaatcacacagtgccccttagagctg ctgaggttatggctttcttcacgctctgatccaagcctaaggatagccactggtcccctg gacagacaatgtggcagcaaggtacctgcccaggacagtggagatgtggacacctgggtg ctggtagagcagtggaacgccctggagaaggacagcctggagacaaaacttgaggtaaca cagcagcagccatcagtatcgtcaaagggccactag >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_6|57_aa MAKGVAFVEPGQLHKCENELGKSPLPQVEATFSGIHLPKPMTSPTPPSGMQGPLSPA >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_6|174_bp atggcaaaaggggttgcattcgtggaacctggacaattacataaatgtgaaaacgagctc ggtaaatctcctttacctcaagtcgaggccaccttctctggcatccatcttcctaagccc atgacatcccccacacctccctcagggatgcagggccctttaagccctgcttag >gi568815580r:47741830_47996756|GENSCAN_predicted_peptide_7|152_aa MAEHSWGTDYGALWVQEKETSQFWESTLQGAQLLYSKSASPWLQAASQAGEEQSKSLSAA CILAYVSQGGPASPDGCELAAQIQSQLITRDVCARGEAQRQTQSHSRERRKKAREKKERG GGSEKGQEAGQPRFFLIDEPKLLAGKPSTPEG >gi568815580r:47741830_47996756|GENSCAN_predicted_CDS_7|459_bp atggctgagcattcatggggaactgactatggggctctctgggtgcaggagaaggaaacc tctcagttttgggaaagcactttgcaaggggcccagcttttatattccaagtcagccagc ccttggcttcaggctgcttcccaagcaggggaggaacaaagcaagagtctttctgccgcc tgtattttagcatatgtcagtcagggtggcccagcgtctccagatggctgtgagcttgct gcacaaatccagagccagctcatcactagggacgtatgtgcacgtggagaggcccagaga cagacgcagagccatagcagggagagaaggaaaaaggcaagggagaaaaaggagagaggg ggagggagtgaaaagggacaggaagcaggacagcctcgattcttcctcatagatgaacct aagcttcttgcagggaagccttctactccagaaggttga