GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:50:00 Sequence gi568815591f:116399782_116606118 : 206337 bp : 38.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10071 10345 275 2 2 71 58 276 0.044 19.39 1.02 Term + 11208 12921 1714 0 1 24 43 627 0.034 38.48 1.03 PlyA + 13112 13117 6 1.05 2.00 Prom + 14612 14651 40 -3.65 2.01 Init + 17275 17346 72 0 0 95 111 -20 0.076 2.12 2.02 Intr + 33092 33338 247 1 1 28 115 196 0.302 12.41 2.03 Intr + 45467 45542 76 0 1 96 55 26 0.007 -2.25 2.04 Term + 53476 53644 169 1 1 121 47 55 0.102 1.07 2.05 PlyA + 54287 54292 6 1.05 3.00 Prom + 57621 57660 40 -3.95 3.01 Init + 73948 74036 89 0 2 81 39 145 0.618 9.06 3.02 Intr + 75196 75349 154 1 1 0 68 114 0.063 -0.25 3.03 Term + 76790 76948 159 1 0 27 49 194 0.996 6.36 3.04 PlyA + 77854 77859 6 1.05 4.00 Prom + 80066 80105 40 -5.25 4.01 Init + 91269 91423 155 2 2 101 35 69 0.787 2.40 4.02 Term + 91981 92092 112 1 1 88 43 104 0.862 2.95 4.03 PlyA + 92241 92246 6 1.05 5.00 Prom + 97001 97040 40 -5.95 5.01 Init + 97110 97168 59 2 2 98 100 96 0.978 12.63 5.02 Intr + 99889 100150 262 1 1 29 67 302 0.861 18.77 5.03 Intr + 100479 100666 188 2 2 94 85 206 0.954 18.47 5.04 Intr + 105706 105844 139 1 1 19 16 109 0.216 -3.65 5.05 Term + 106190 106378 189 1 0 128 37 115 0.903 6.87 5.06 PlyA + 106860 106865 6 1.05 6.00 Prom + 109642 109681 40 -2.95 6.01 Sngl + 112041 112244 204 2 0 76 41 185 0.941 7.54 6.02 PlyA + 114143 114148 6 1.05 7.00 Prom + 119961 120000 40 -6.75 7.01 Init + 125620 125778 159 0 0 41 66 150 0.835 6.06 7.02 Intr + 126744 126908 165 2 0 116 121 375 0.998 42.84 7.03 Term + 159165 159506 342 2 0 102 36 340 0.418 23.73 7.04 PlyA + 159627 159632 6 1.05 8.00 Prom + 171671 171710 40 -3.65 8.01 Init + 180348 180479 132 2 0 84 85 50 0.409 2.76 8.02 Intr + 181708 181861 154 0 1 75 87 67 0.824 4.02 8.03 Intr + 186083 186220 138 0 0 59 37 91 0.121 0.61 8.04 Intr + 188189 188268 80 0 2 66 40 70 0.077 -1.65 8.05 Intr + 193470 193600 131 2 2 79 85 38 0.092 1.17 8.06 Term + 195361 195475 115 1 1 99 44 81 0.096 1.86 8.07 PlyA + 196238 196243 6 1.05 9.02 PlyA - 197214 197209 6 1.05 9.01 Term - 197984 197794 191 0 2 49 50 174 0.519 6.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 11356 12921 1566 0 0 44 43 646 0.894 50.80 S.002 Intr + 75223 75349 127 1 1 25 68 121 0.881 3.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_1|662_aa MEQSWMENDFDELREEGFRRSNYSELREDIQTKGKEVENFEKSSEEYITRIANTQKCLKE LMELKTKAPELREECRSLRSQCDQLEERASVIDTHSLKIKGWRKIYQANGKQKKAGVAIL VSDKTDFKPTKIKRDKEGHYIMVKGSNQQEELTILNKYAPNTGAPRFIKQVLSDLQRDLD SHTLIMGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAP HHTYFKTDHIVGSKVLLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSTTWKLNNLLL NDYCVHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTL TSQLKKLEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERISKTDKPL ARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDIYT LPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQ SIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDGKILNKILANRIQQHIKK LIHHDQVGFIPGIQGWFNIRKSINVIQHINRTKDKHHMIISIDAEKAFDKIQQPFMQKLS IN >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_1|1989_bp atggaacaaagctggatggagaatgactttgacgagctgagagaagaaggcttcagacga tcaaattactctgagctacgggaggacattcaaaccaaaggcaaagaagttgaaaacttt gaaaaaagttcagaagaatatataactagaatagccaatacacagaagtgcttaaaggag ctgatggagctgaaaaccaaggctccagaactacgtgaagaatgcagaagcctcaggagc caatgcgatcaactggaagaaagggcatcagtgatagacacacatagtctcaaaataaaa ggatggaggaagatctaccaagcaaatggaaaacaaaaaaaggcaggggttgcaatccta gtctctgataaaacagactttaaaccaacaaagatcaaaagagacaaagaaggccattac ataatggtaaagggatcaaatcaacaagaagagctaactatcctaaataaatatgcaccc aatacaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagac tcccatacattaataatgggagactttaacaccccactgtcaacattagacagatcaaca agacagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcggaccta atagacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcacca caccacacctatttcaaaactgaccacatagttggaagtaaagttctcctcagcaaatgt aaaagaacagaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactc aggattaagaaactcactcaaaaccactcaactacatggaaactgaacaacttgctcctg aatgactactgcgtacataacgaaatgaaggcagaaataaagatgttttttgaaaccaac gagaacaaagacacaacataccagaatctctgggacgcattcaaagcagtgtgtagaggg aaatttatagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacacccta acatcacaattaaaaaaactagaaaagcaagagcaaacacattcaaaagctagcagaagg caagaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaa aaaattaatgaatccaggagctggttttttgaaaggatcagcaaaactgataaaccgcta gcaagactaataaagaagaaaagagagaagaatcaaatagatgcaataaaaaatgataaa ggggatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacac ctctacgcaaataaactagaaaatctagaagaaatggataaattcctcgacatatacact ctcccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggagctgaa attgtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcaca gccgaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaa tcaatagaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgata ccaaagccaggcagagacacaacaaaaaaagagaattttagaccaatatccttgatgaac attgatggaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaag cttatccaccatgatcaagtgggcttcatccctgggatacaaggctggttcaatatacgc aaatcaatcaatgtaatccagcatataaacagaaccaaagacaaacaccacatgattatc tcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgcaaaaactctca ataaattag >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_2|187_aa MVSVTNFMIFIKTSKGCGKDLLTPYEIHKEKLTASWPFAKEQVQKDTHILFGSGPTSNRY GLVPRCLLTEEAKNTAPCSHQWTPPQQQGLPSPLPMREMVAVQQENGSWHQTMPGMELFL KVEGLMALYHRRPFQVFPSPPGIKANFLSWLTRLYMIRLYLSPHYLSDFHGFPLDSILAT LASSRLC >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_2|564_bp atggtctctgtcacaaatttcatgatttttattaagacctcaaaaggctgtggcaaagat ttgcttaccccgtatgaaattcacaaagagaaacttactgcttcttggccttttgctaaa gagcaagtgcagaaagacacacatattctctttggcagtgggcctacatcaaatcgttat ggtctcgtaccaagatgtcttttgactgaggaagctaagaatacggctccttgttctcac cagtggactcctcctcagcagcaggggctcccttccccactacccatgagggagatggtg gcagttcaacaggaaaatggttcctggcatcagacaatgcctggaatggaactattccta aaagtggaagggctgatggcattgtaccatcgtagacctttccaagtcttcccatctccc cctggaataaaagcaaatttcttatcgtggcttacaagactttacatgattaggctgtac ctgagtcctcattacctctctgatttccatggctttccccttgactccattctagccacg ctggcctcctcacgactctgctga >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_3|133_aa MAKESLSKMNKSKGITLPDYEEAIVTKTACNPTIAYLPRGKEVIIRKRYLHVHVYSRTIC DCKNVEPAKTPINQREDKENVWHPTSSPSLNLTEEPNACSTDGDTNPKAVPCMLNPEDLK TITLWDSKAELDP >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_3|402_bp atggccaaagaaagcctaagcaaaatgaacaaatccaaaggcatcacattacccgactac gaggaagccatagtcaccaaaacagcatgcaatcccactattgcgtatctacccagagga aaagaagtcattatacgaaaaagatacctgcacgtgcatgtttacagcaggacaatttgc gattgcaaaaatgtggaaccagccaaaacgcccatcaaccagcgagaggataaagaaaat gtgtggcatccaacatcctctccttcattgaaccttactgaagaacctaacgcctgctcc actgacggtgacaccaacccaaaagctgtgccatgcatgctgaatcctgaagatctcaaa accattacactttgggattccaaagcagaactggacccatga >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_4|88_aa MEPVEVSQVRSEPRPGRRSEMGEGESTDFEGRTNRPHWRRSNEGNGGMKADTPSVTRYGN SFREKENYKKYLETVTREKQLKEMELLS >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_4|267_bp atggagcctgtggaggtttctcaggtgagatctgaacccagaccagggaggagaagtgaa atgggagagggtgaaagcactgattttgaaggtagaactaacaggcctcattggagaaga agtaatgagggaaatggaggaatgaaagctgacacaccaagtgtcacaagatatgggaac agcttcagagaaaaagaaaattacaagaaatatttggaaacagtgacacgtgaaaaacaa ctgaaagaaatggagctgcttagctag >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_5|278_aa MAEGKQGVSPHVATEGAKERARRGEGGRGRALREGEAARTGSRTAPAGLQRPRTKAAMGL ETEKADVQLFMDDDSYSHHSGLEYADPEKFADSDQDRDPHRLNSHLKLGFEDVIAEPVTT HSFDKVWICSHALFEISKYVMYKFLTVFLAIPLAFIAGILFATLSCLHICSAGCTGSTVP ASTLGKDLRKLPLIVEGEGGAGTSHGKKGSKRENVEDFNAFCKDLPNGSAFSADNMEECD RCYHCSIVYERRTMLLFCQPATEPGLNTWTPGLEIGIL >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_5|837_bp atggcggaaggtaaacagggagtatcacctcacgtggccacagaaggagccaaagaaagg gctaggcgaggcgagggggggcggggccgggcgctacgggaaggggaggccgcgcggacc gggagccgcaccgcgccagccgggctgcagcggccgcgcaccaaggctgcgatggggctg gagacggagaaggcggacgtacagctcttcatggacgacgactcctacagccaccacagc ggcctcgagtacgccgaccccgagaagttcgcggactcggaccaggaccgggatccccac cggctcaactcgcatctcaagctgggcttcgaggatgtgatcgcagagccggtgactacg cactcctttgacaaagtgtggatctgcagccatgccctctttgaaatcagcaaatacgta atgtacaagttcctgacggtgttcctggccattcccctggccttcattgcgggaattctc tttgccaccctcagctgtctgcacatctgttctgcgggctgtacaggaagcacagtgcca gcatctactttaggcaaggacctcaggaagcttccacttatagtggaaggagaaggagga gcaggcacgtcacatggcaagaaagggagcaagagagagaatgtggaggattttaatgcc ttttgtaaagacctgcctaatggttctgccttcagtgcagacaatatggaagagtgtgac agatgttatcattgctccattgtgtacgagcgtaggacgatgcttctcttctgtcagcct gcaactgagccaggattgaatacttggaccccaggtctggagattgggatactgtaa >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_6|67_aa MRALLEPCALKTMPKRATENENETVTCDHLSVSSFALPLCAVDCCGMSAAKKDHAAVREI RGHHFNN >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_6|204_bp atgagggctctgctggaaccatgtgccctaaagacaatgccgaagcgtgcaactgaaaat gaaaatgaaactgtaacttgtgaccatctatctgtgtcttctttcgccttgcctctctgt gctgttgactgttgcggcatgtctgcggcaaagaaagaccatgctgcagtcagagaaatc agaggacaccacttcaacaattaa >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_7|221_aa MLPCRGTPAVRPCLLGVRRGGVQGGGVIYPSPGDSPRDSPPGAQTGRSRRRRAGHLYTVP IREQGNIYKPNNKAMADELSEKQVYDAHTKEIDLVNRDPKHLNDDVVKIDFEDVIAEPEG THSFDGIWKASFTTFTVTKYWFYRLLSALFGIPMALIWGIYFAILSFLHIWAVVPCIKSF LIEIQCISRVYSIYVHTVCDPLFEAVGKIFSNVRINLQKEI >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_7|666_bp atgctcccttgtcgcgggacccccgcggtccggccctgcctgctgggggttcgaagaggt ggagtgcagggtggaggtgttatttacccgagtcctggggacagtccccgggactctccg ccaggcgcccagaccggcaggtcccgcaggcggcgcgcgggacatctctacaccgttccc atccgggaacagggcaacatctacaagcccaacaacaaggccatggcagacgagctgagc gagaagcaagtgtacgacgcgcacaccaaggagatcgacctggtcaaccgcgaccctaaa cacctcaacgatgacgtggtcaagattgactttgaagatgtgattgcagaaccagaaggg acacacagttttgacggcatttggaaggccagcttcaccaccttcactgtgacgaaatac tggttttaccgcttgctgtctgccctctttggcatcccgatggcactcatctggggcatt tacttcgccattctctctttcctgcacatctgggcagttgtaccatgcattaagagcttc ctgattgagattcagtgcatcagccgtgtctattccatctacgtccacaccgtctgtgac ccactctttgaagctgttgggaaaatattcagcaatgtccgcatcaacttgcagaaagaa atataa >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_8|249_aa MVSVNSLALLAYLLLGRARQRNFHCSAERKKTMGPTMQILKRTKDQNLFGPRSFGCDLPI LQHCLVLPSNAQKTEPYSQCLCHWPLQTKETLLEAVALGMLWGQPSTPGCVVGTLARITL ATEGPWTGLEGSNTEETLTEGVGHRATLPTEELSPPATIIVAVVPASFEPIQKSVDHIKF PFLTSYPRILYNIPPNLTPSEHQVPSLTKKESPSYVSLFDCFHDMIPSPVNILVTLFRAG CNLPVPVLL >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_8|750_bp atggtgagtgttaattctctggcacttctggcctacctgctcctgggcagagcaagacag agaaattttcactgttctgcagagagaaaaaagactatgggccccaccatgcagatactg aaaagaacaaaggatcagaacctgtttgggcccagatcatttggctgtgaccttcctatc ctgcaacactgcctggttcttccaagtaatgcccagaagactgaaccctattctcaatgt ctctgccactggcctcttcaaactaaagagactctcctagaagcagttgctctaggtatg ctctggggacagcctagtactccaggctgtgtagttggcaccctggccaggatcaccctt gccactgagggcccctggacaggacttgaagggtcaaacaccgaggaaacactgacagag ggtgtgggacacagagccaccctccccacagaggagctctctcctcctgctaccattatc gtggcagtagttcctgcctcttttgaacctatccaaaaatctgtggatcatataaagttt ccttttcttacctcctacccccgaatactttacaacattccaccaaatcttactccctcg gagcatcaggttcctagtctaaccaagaaagaaagcccaagctacgttagtttatttgac tgttttcatgacatgataccaagccctgtcaacatcttagtcactctcttccgagctggc tgcaacttgcctgtccctgtgcttctctga >gi568815591f:116399782_116606118|GENSCAN_predicted_peptide_9|63_aa XLEDGVYSCGRNITVDKTGLMLSSRGRAALPAARFLVVQWPATFEALPGALASSGAACFW YQL >gi568815591f:116399782_116606118|GENSCAN_predicted_CDS_9|192_bp nagcttgaggatggggtgtatagttgtggcagaaatataactgtggataaaacagggctg atgctctctagccgtggaagagcagctcttccagctgcccgctttctggttgttcagtgg cctgccactttcgaggcactaccaggagcactggctagctctggagcagcatgcttctgg tatcagttatga