GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:49:31 Sequence gi568815591f:116426523_116659284 : 232762 bp : 38.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6351 6597 247 2 1 28 115 196 0.565 12.41 1.02 Intr + 18726 18801 76 1 1 96 55 26 0.009 -2.25 1.03 Term + 26735 26903 169 2 1 121 47 55 0.103 1.07 1.04 PlyA + 27546 27551 6 1.05 2.00 Prom + 30880 30919 40 -3.95 2.01 Init + 47207 47295 89 1 2 81 39 145 0.618 9.06 2.02 Intr + 48455 48608 154 2 1 0 68 114 0.063 -0.25 2.03 Term + 50049 50207 159 2 0 27 49 194 0.996 6.36 2.04 PlyA + 51113 51118 6 1.05 3.00 Prom + 53325 53364 40 -5.25 3.01 Init + 64528 64682 155 0 2 101 35 69 0.787 2.40 3.02 Term + 65240 65351 112 2 1 88 43 104 0.862 2.95 3.03 PlyA + 65500 65505 6 1.05 4.00 Prom + 70260 70299 40 -5.95 4.01 Init + 70369 70427 59 0 2 98 100 96 0.978 12.63 4.02 Intr + 73148 73409 262 2 1 29 67 302 0.861 18.77 4.03 Intr + 73738 73925 188 0 2 94 85 206 0.954 18.47 4.04 Intr + 78965 79103 139 2 1 19 16 109 0.216 -3.65 4.05 Term + 79449 79637 189 2 0 128 37 115 0.903 6.87 4.06 PlyA + 80119 80124 6 1.05 5.00 Prom + 82901 82940 40 -2.95 5.01 Sngl + 85300 85503 204 0 0 76 41 185 0.941 7.54 5.02 PlyA + 87402 87407 6 1.05 6.00 Prom + 93220 93259 40 -6.75 6.01 Init + 98879 99037 159 1 0 41 66 150 0.835 6.06 6.02 Intr + 100003 100167 165 0 0 116 121 375 0.998 42.84 6.03 Term + 132424 132765 342 0 0 102 36 340 0.418 23.73 6.04 PlyA + 132886 132891 6 1.05 7.00 Prom + 144930 144969 40 -3.65 7.01 Init + 153607 153738 132 0 0 84 85 50 0.409 2.76 7.02 Intr + 154967 155120 154 1 1 75 87 67 0.824 4.02 7.03 Intr + 159342 159479 138 1 0 59 37 91 0.121 0.61 7.04 Intr + 161448 161527 80 1 2 66 40 70 0.080 -1.65 7.05 Intr + 166729 166859 131 0 2 79 85 38 0.098 1.17 7.06 Term + 168620 168734 115 2 1 99 44 81 0.097 1.86 7.07 PlyA + 169497 169502 6 1.05 8.03 PlyA - 170473 170468 6 1.05 8.02 Term - 171243 171053 191 1 2 49 50 174 0.389 6.33 8.01 Init - 192821 192671 151 2 1 78 75 57 0.835 3.65 8.00 Prom - 193646 193607 40 -3.35 9.05 PlyA - 194419 194414 6 1.05 9.04 Term - 196582 196466 117 1 0 18 49 130 0.692 -0.34 9.03 Intr - 198103 198002 102 1 0 113 71 49 0.590 5.15 9.02 Intr - 209239 209137 103 0 1 110 66 75 0.651 6.76 9.01 Init - 218475 218342 134 0 2 65 73 89 0.680 4.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 48482 48608 127 2 1 25 68 121 0.881 3.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_1|163_aa YEIHKEKLTASWPFAKEQVQKDTHILFGSGPTSNRYGLVPRCLLTEEAKNTAPCSHQWTP PQQQGLPSPLPMREMVAVQQENGSWHQTMPGMELFLKVEGLMALYHRRPFQVFPSPPGIK ANFLSWLTRLYMIRLYLSPHYLSDFHGFPLDSILATLASSRLC >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_1|492_bp tatgaaattcacaaagagaaacttactgcttcttggccttttgctaaagagcaagtgcag aaagacacacatattctctttggcagtgggcctacatcaaatcgttatggtctcgtacca agatgtcttttgactgaggaagctaagaatacggctccttgttctcaccagtggactcct cctcagcagcaggggctcccttccccactacccatgagggagatggtggcagttcaacag gaaaatggttcctggcatcagacaatgcctggaatggaactattcctaaaagtggaaggg ctgatggcattgtaccatcgtagacctttccaagtcttcccatctccccctggaataaaa gcaaatttcttatcgtggcttacaagactttacatgattaggctgtacctgagtcctcat tacctctctgatttccatggctttccccttgactccattctagccacgctggcctcctca cgactctgctga >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_2|133_aa MAKESLSKMNKSKGITLPDYEEAIVTKTACNPTIAYLPRGKEVIIRKRYLHVHVYSRTIC DCKNVEPAKTPINQREDKENVWHPTSSPSLNLTEEPNACSTDGDTNPKAVPCMLNPEDLK TITLWDSKAELDP >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_2|402_bp atggccaaagaaagcctaagcaaaatgaacaaatccaaaggcatcacattacccgactac gaggaagccatagtcaccaaaacagcatgcaatcccactattgcgtatctacccagagga aaagaagtcattatacgaaaaagatacctgcacgtgcatgtttacagcaggacaatttgc gattgcaaaaatgtggaaccagccaaaacgcccatcaaccagcgagaggataaagaaaat gtgtggcatccaacatcctctccttcattgaaccttactgaagaacctaacgcctgctcc actgacggtgacaccaacccaaaagctgtgccatgcatgctgaatcctgaagatctcaaa accattacactttgggattccaaagcagaactggacccatga >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_3|88_aa MEPVEVSQVRSEPRPGRRSEMGEGESTDFEGRTNRPHWRRSNEGNGGMKADTPSVTRYGN SFREKENYKKYLETVTREKQLKEMELLS >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_3|267_bp atggagcctgtggaggtttctcaggtgagatctgaacccagaccagggaggagaagtgaa atgggagagggtgaaagcactgattttgaaggtagaactaacaggcctcattggagaaga agtaatgagggaaatggaggaatgaaagctgacacaccaagtgtcacaagatatgggaac agcttcagagaaaaagaaaattacaagaaatatttggaaacagtgacacgtgaaaaacaa ctgaaagaaatggagctgcttagctag >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_4|278_aa MAEGKQGVSPHVATEGAKERARRGEGGRGRALREGEAARTGSRTAPAGLQRPRTKAAMGL ETEKADVQLFMDDDSYSHHSGLEYADPEKFADSDQDRDPHRLNSHLKLGFEDVIAEPVTT HSFDKVWICSHALFEISKYVMYKFLTVFLAIPLAFIAGILFATLSCLHICSAGCTGSTVP ASTLGKDLRKLPLIVEGEGGAGTSHGKKGSKRENVEDFNAFCKDLPNGSAFSADNMEECD RCYHCSIVYERRTMLLFCQPATEPGLNTWTPGLEIGIL >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_4|837_bp atggcggaaggtaaacagggagtatcacctcacgtggccacagaaggagccaaagaaagg gctaggcgaggcgagggggggcggggccgggcgctacgggaaggggaggccgcgcggacc gggagccgcaccgcgccagccgggctgcagcggccgcgcaccaaggctgcgatggggctg gagacggagaaggcggacgtacagctcttcatggacgacgactcctacagccaccacagc ggcctcgagtacgccgaccccgagaagttcgcggactcggaccaggaccgggatccccac cggctcaactcgcatctcaagctgggcttcgaggatgtgatcgcagagccggtgactacg cactcctttgacaaagtgtggatctgcagccatgccctctttgaaatcagcaaatacgta atgtacaagttcctgacggtgttcctggccattcccctggccttcattgcgggaattctc tttgccaccctcagctgtctgcacatctgttctgcgggctgtacaggaagcacagtgcca gcatctactttaggcaaggacctcaggaagcttccacttatagtggaaggagaaggagga gcaggcacgtcacatggcaagaaagggagcaagagagagaatgtggaggattttaatgcc ttttgtaaagacctgcctaatggttctgccttcagtgcagacaatatggaagagtgtgac agatgttatcattgctccattgtgtacgagcgtaggacgatgcttctcttctgtcagcct gcaactgagccaggattgaatacttggaccccaggtctggagattgggatactgtaa >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_5|67_aa MRALLEPCALKTMPKRATENENETVTCDHLSVSSFALPLCAVDCCGMSAAKKDHAAVREI RGHHFNN >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_5|204_bp atgagggctctgctggaaccatgtgccctaaagacaatgccgaagcgtgcaactgaaaat gaaaatgaaactgtaacttgtgaccatctatctgtgtcttctttcgccttgcctctctgt gctgttgactgttgcggcatgtctgcggcaaagaaagaccatgctgcagtcagagaaatc agaggacaccacttcaacaattaa >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_6|221_aa MLPCRGTPAVRPCLLGVRRGGVQGGGVIYPSPGDSPRDSPPGAQTGRSRRRRAGHLYTVP IREQGNIYKPNNKAMADELSEKQVYDAHTKEIDLVNRDPKHLNDDVVKIDFEDVIAEPEG THSFDGIWKASFTTFTVTKYWFYRLLSALFGIPMALIWGIYFAILSFLHIWAVVPCIKSF LIEIQCISRVYSIYVHTVCDPLFEAVGKIFSNVRINLQKEI >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_6|666_bp atgctcccttgtcgcgggacccccgcggtccggccctgcctgctgggggttcgaagaggt ggagtgcagggtggaggtgttatttacccgagtcctggggacagtccccgggactctccg ccaggcgcccagaccggcaggtcccgcaggcggcgcgcgggacatctctacaccgttccc atccgggaacagggcaacatctacaagcccaacaacaaggccatggcagacgagctgagc gagaagcaagtgtacgacgcgcacaccaaggagatcgacctggtcaaccgcgaccctaaa cacctcaacgatgacgtggtcaagattgactttgaagatgtgattgcagaaccagaaggg acacacagttttgacggcatttggaaggccagcttcaccaccttcactgtgacgaaatac tggttttaccgcttgctgtctgccctctttggcatcccgatggcactcatctggggcatt tacttcgccattctctctttcctgcacatctgggcagttgtaccatgcattaagagcttc ctgattgagattcagtgcatcagccgtgtctattccatctacgtccacaccgtctgtgac ccactctttgaagctgttgggaaaatattcagcaatgtccgcatcaacttgcagaaagaa atataa >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_7|249_aa MVSVNSLALLAYLLLGRARQRNFHCSAERKKTMGPTMQILKRTKDQNLFGPRSFGCDLPI LQHCLVLPSNAQKTEPYSQCLCHWPLQTKETLLEAVALGMLWGQPSTPGCVVGTLARITL ATEGPWTGLEGSNTEETLTEGVGHRATLPTEELSPPATIIVAVVPASFEPIQKSVDHIKF PFLTSYPRILYNIPPNLTPSEHQVPSLTKKESPSYVSLFDCFHDMIPSPVNILVTLFRAG CNLPVPVLL >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_7|750_bp atggtgagtgttaattctctggcacttctggcctacctgctcctgggcagagcaagacag agaaattttcactgttctgcagagagaaaaaagactatgggccccaccatgcagatactg aaaagaacaaaggatcagaacctgtttgggcccagatcatttggctgtgaccttcctatc ctgcaacactgcctggttcttccaagtaatgcccagaagactgaaccctattctcaatgt ctctgccactggcctcttcaaactaaagagactctcctagaagcagttgctctaggtatg ctctggggacagcctagtactccaggctgtgtagttggcaccctggccaggatcaccctt gccactgagggcccctggacaggacttgaagggtcaaacaccgaggaaacactgacagag ggtgtgggacacagagccaccctccccacagaggagctctctcctcctgctaccattatc gtggcagtagttcctgcctcttttgaacctatccaaaaatctgtggatcatataaagttt ccttttcttacctcctacccccgaatactttacaacattccaccaaatcttactccctcg gagcatcaggttcctagtctaaccaagaaagaaagcccaagctacgttagtttatttgac tgttttcatgacatgataccaagccctgtcaacatcttagtcactctcttccgagctggc tgcaacttgcctgtccctgtgcttctctga >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_8|113_aa MRMEELISGRRHHKRSVVGGCIEVGLCKIGRVQEAQNRKTENTEGEKYRTELEDGVYSCG RNITVDKTGLMLSSRGRAALPAARFLVVQWPATFEALPGALASSGAACFWYQL >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_8|342_bp atgaggatggaagagcttatctcaggcagaaggcatcataagaggtctgtggtaggagga tgcattgaagtagggctttgcaagataggcagagtacaggaggcacagaatagaaaaact gaaaacactgagggggaaaagtataggacagagcttgaggatggggtgtatagttgtggc agaaatataactgtggataaaacagggctgatgctctctagccgtggaagagcagctctt ccagctgcccgctttctggttgttcagtggcctgccactttcgaggcactaccaggagca ctggctagctctggagcagcatgcttctggtatcagttatga >gi568815591f:116426523_116659284|GENSCAN_predicted_peptide_9|151_aa MESRSETGDEALDLCEVDAGTVCYTNGALIGNGTTETLPSGAGPGLSLILLYSPPAGGSV GKISKCCKVNEGGDPGKGCDLTWPFGGLQTSHPAAMGVAGYKEVTGFPGQLFKMRQQKVQ GKREESGKNSSVPITKTVEGVNLENSSGPTL >gi568815591f:116426523_116659284|GENSCAN_predicted_CDS_9|456_bp atggagtccagatcagagacaggagatgaagccttagatctgtgtgaggtagacgctggg acagtctgctacacaaacggggcccttataggaaatgggactacagaaactctaccctct ggggcaggcccaggactttccctgattcttctttactccccacctgcaggtggatcagta ggcaaaatcagcaagtgttgtaaggtcaatgagggtggtgatccaggcaagggctgcgac ttgacttggcctttcggtggtctgcaaacctctcatccagcagccatgggtgtggcaggc tacaaagaagtgactggatttcctgggcaattgtttaagatgagacagcaaaaagtacaa ggcaagagagaagaaagtggcaagaactccagtgtgcctataacaaagactgtggaggga gtgaatctggagaactcatcgggacccacgttgtga