GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:42:45 Sequence gi568815578f:10175492_10406194 : 230703 bp : 40.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 10078 10163 86 2 2 134 43 120 0.934 8.94 1.02 PlyA + 11181 11186 6 1.05 2.05 PlyA - 12360 12355 6 1.05 2.04 Term - 13559 13461 99 2 0 84 54 91 0.277 2.45 2.03 Intr - 26073 25950 124 2 1 76 36 118 0.655 5.07 2.02 Intr - 26291 26168 124 0 1 85 96 20 0.501 1.22 2.01 Init - 32248 32191 58 1 1 79 100 37 0.780 5.52 2.00 Prom - 36052 36013 40 -8.25 3.00 Prom + 37754 37793 40 -2.25 3.01 Init + 43110 43182 73 2 1 62 62 93 0.517 5.38 3.02 Intr + 43439 43576 138 0 0 67 34 90 0.328 1.01 3.03 Intr + 44858 44976 119 0 2 45 103 73 0.486 3.76 3.04 Intr + 56695 56918 224 0 2 80 72 122 0.091 5.80 3.05 Intr + 61109 61242 134 2 2 6 31 137 0.012 -0.83 3.06 Intr + 65318 65510 193 0 1 80 57 148 0.187 8.53 3.07 Term + 71688 71811 124 0 1 54 45 107 0.233 -0.12 3.08 PlyA + 77109 77114 6 1.05 4.00 Prom + 77961 78000 40 -5.05 4.01 Init + 100001 100072 72 1 0 104 94 137 0.725 17.02 4.02 Intr + 106610 106768 159 1 0 49 76 61 0.417 0.26 4.03 Intr + 107813 108009 197 1 2 50 102 86 0.670 3.49 4.04 Intr + 121434 121559 126 0 0 93 99 151 0.999 15.57 4.05 Term + 123777 123972 196 0 1 73 47 269 0.675 17.20 4.06 PlyA + 125451 125456 6 -0.45 5.05 PlyA - 126296 126291 6 1.05 5.04 Term - 128160 128033 128 1 2 108 49 59 0.273 1.46 5.03 Intr - 133513 133355 159 2 0 5 80 241 0.203 14.04 5.02 Intr - 144685 144506 180 2 0 64 80 73 0.077 3.02 5.01 Init - 147125 147119 7 2 1 86 98 0 0.136 1.89 5.00 Prom - 149399 149360 40 -6.05 6.00 Prom + 155914 155953 40 -6.65 6.01 Init + 156891 157005 115 2 1 48 111 60 0.750 4.72 6.02 Intr + 166624 166785 162 2 0 -9 56 153 0.334 1.43 6.03 Term + 166940 167067 128 0 2 91 45 69 0.383 0.36 6.04 PlyA + 168375 168380 6 1.05 7.07 PlyA - 169293 169288 6 1.05 7.06 Term - 173971 173666 306 1 0 -2 37 384 0.886 18.43 7.05 Intr - 177251 177130 122 0 2 110 23 60 0.359 0.99 7.04 Intr - 185462 185310 153 0 0 96 37 105 0.004 5.32 7.03 Intr - 188249 188083 167 1 2 58 55 100 0.008 2.38 7.02 Intr - 188897 188763 135 1 0 43 68 143 0.004 6.56 7.01 Init - 199884 199823 62 0 2 77 65 75 0.070 4.87 7.00 Prom - 202703 202664 40 -5.95 8.00 Prom + 210797 210836 40 -3.95 8.01 Init + 210894 210986 93 2 0 84 36 65 0.793 1.33 8.02 Intr + 211287 211791 505 2 1 59 23 644 0.656 46.62 8.03 Term + 211828 212003 176 2 2 -11 32 259 0.839 7.34 8.04 PlyA + 212187 212192 6 1.05 9.00 Prom + 212993 213032 40 -3.45 9.01 Init + 215010 215101 92 2 2 90 69 77 0.835 6.01 9.02 Term + 223366 223513 148 1 1 83 49 90 0.787 0.99 9.03 PlyA + 224113 224118 6 1.05 10.02 PlyA - 224448 224443 6 1.05 10.01 Term - 230196 229756 441 0 0 81 48 375 0.991 27.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 188399 188950 552 1 0 49 39 313 0.929 18.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_1|28_aa XLWVECKLLEVFEDVTKEAMRIGNEAVV >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_1|87_bp ngcttatgggtggaatgcaagctattggaagtatttgaagatgtgaccaaggaagctatg agaattggaaatgaggcagttgtttag >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_2|134_aa MERLDSDSNPHKTIKSTGKGVFGNERQPFCQNYGLPHFSTPGLHGFKALDQRKALAWGSG RLKGFAKAQMTIVAGVCPAEASFNQQNGQVDPQVDESHKMVVRFLAEKPVSKGKLTLSFL RGPPDIFPSTTPTI >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_2|405_bp atggagagactggacagtgactcaaatccacataaaacaataaagagtactggtaaaggg gtatttgggaatgagaggcagcctttctgccagaattatgggcttcctcatttctctacc ccaggactgcatgggtttaaggcactggaccaaaggaaagccttggcatggggttctgga agactaaagggatttgccaaagcccagatgactatagttgcaggagtgtgtcctgctgag gccagcttcaaccagcagaacggccaagttgacccacaggttgatgagtcacacaaaatg gttgttcggtttttggctgagaagccagtgagcaaagggaagctcactctcagctttcta cgcggtcctcccgacatcttcccaagcactaccccgaccatctga >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_3|334_aa MNAFAQDRVLKWSHKSRPVRVFIEVATCRGPENTTLPRSPGKNPPPQVVALARDGCIPRR DSSWAVRVGLELLEVFAISTGYRRLIALDFDNIKTGLPQENPSVSFRVMQKHTHAICTLW TSTMEKELTIVVCILDSKIPRKISDWHSLGQVLNQLWVWSGMEECDFHAGTTQFSWEEPV AKQTHPLPNEHFGNGNTREEAMEKNSEGLSAVLPRHGKQAKQLRGSLAAEEARLAEPWKS RQGALEHSVFQCLPLATWGVMRMASIAPWSQQSQQALGVPDMSWRKASVKQLPGIWEAAE STLVLPRRLPLSSYTDDQQQSGQVVPPTTTCSDQ >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_3|1005_bp atgaacgcgtttgcccaagaccgagtgttaaaatggagtcataagagtcgccccgtgcgg gtgtttatagaagttgcaacctgcagaggcccggagaacacaaccctcccgagaagccca ggtaagaacccccctccccaggtcgtggctctggctcgggatggctgcatacccagaagg gacagcagctgggcagtccgagtgggcttggagttactggaagtatttgctatcagcact ggttatcgcaggctgatagctttagattttgacaatattaagactggccttcctcaggag aatccaagtgtgtccttcagggtaatgcagaaacatacacatgcgatttgtactctctgg acttccactatggaaaaggaactgacaattgttgtctgtatcctcgattcaaaaatacca aggaagatctctgattggcatagcttgggccaagtgctcaatcaattgtgggtgtggagt ggcatggaggaatgtgatttccatgctggaaccacacagttcagttgggaagaaccagtt gcaaagcagacacatcctctgcccaatgagcactttggtaatgggaacacaagggaggag gcaatggaaaaaaactctgaaggcctttcagctgtgttgccgcgtcatggaaaacaagcc aagcaactgcgtggaagcctcgcagcagaggaggctaggctcgcagagccctggaagagc aggcagggggctctggagcattcagtctttcagtgtttgccattagcgacctggggagtg atgcgaatggcttcaattgctccgtggagccagcaatcacagcaggctttgggggtccct gacatgagctggagaaaagcctctgtgaagcagctgccagggatttgggaagctgcagag agcaccctagttcttcccagaaggctgcccctgagctcttatactgatgaccaacaacag agtggtcaagtggtgccaccaacaaccacttgttctgatcagtga >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_4|249_aa MAEDADMRNELEEMQRRADQLADECLEQSRGSINTRKEGGKEGRMEGRKEGRKEGRKEGR KEGRKEGKGRKESKKEPENGGTEQSGHLPEAPQLGNEITSIPAQSLSPQSALSRTPEATL GQPCPYYILTLPLKPIKVCSKARLKSSDAYKKAWGNNQDGVVASQPARVVDEREQMAISG GFIRRVTNDARENEMDENLEQVSGIIGNLRHMALDMGNEIDTQNRQIDRIMEKVSTWQSA SPYCESLLK >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_4|750_bp atggccgaagacgcagacatgcgcaatgagctggaggagatgcagcgaagggctgaccag ttggctgatgagtgcctagagcagagtagaggctcaataaacactaggaaggaaggaggg aaggaaggaaggatggaaggaaggaaggaaggaaggaaggaaggaaggaaggaaggaagg aaggaaggaaggaaggaagggaaaggaaggaaggaaagtaaaaaggagcctgagaacgga ggcacagaacagtcaggacacttgcctgaggccccacagttagggaatgaaatcaccagc attccagcccagagtctgtctccacagtctgcgctgtcaaggactcctgaggccaccctg ggccagccctgtccatactacatcctgactttgccactaaagcctatcaaagtgtgctca aaggcaaggcttaaatcaagtgatgcttacaaaaaagcctggggcaataatcaggacgga gtggtggccagccagcctgctcgtgtagtggacgaacgggagcagatggccatcagtggc ggcttcatccgcagggtaacaaatgatgcccgagaaaatgaaatggatgaaaacctagag caggtgagcggcatcatcgggaacctccgtcacatggccctggatatgggcaatgagatc gatacacagaatcgccagatcgacaggatcatggagaaggtgagcacgtggcagtcagca agtccctactgcgagtcacttctgaaatga >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_5|157_aa MPDMAMGKEGAEERDIRRWHRQDAVTDLMRGWFLASVPRQRSLEGGGGEHVMVVHDEFEV PVERGGGGGRGDGSDDDDVGRGDGWSGGGGDGGSGSGGGDGGDDGDYNGGGNKGGAWEEE EEAARQCGETKLFFSQPVSLNDYQLCTYCGAVLPNFY >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_5|474_bp atgccagatatggccatgggaaaggagggggcagaggaaagagatattcggaggtggcat agacaagacgcggtgactgatttaatgagggggtggttcctggcttcagtcccaaggcag agaagtttagaagggggcggtggtgagcatgtcatggtggttcatgatgagtttgaagtg cctgtggagagaggtggaggaggtggcagaggtgatggaagtgatgatgatgatgttggc agaggagatggatggtcaggtggtggtggtgatggtggcagtgggagcggtggtggtgat ggaggtgatgatggtgattataatggtggtggtaataaaggtggagcctgggaagaagag gaggaggctgccaggcagtgtggggaaaccaaactgtttttctctcaaccagtatcactc aatgattaccagctttgtacctattgtggtgctgtacttcccaatttttattga >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_6|134_aa MSTRNGLEKAQKRGREETSGTMSTKGRETQSFRMEKKVGEREERRAAALLEAQTKSSQNQ GYDTQFRALPFLVSPRFWAPTGSPVPAVEATCGVGSRLGAKAECSLPGQVGGTSPVGPSK TQAKVPLATDVPSW >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_6|405_bp atgagcacaaggaacggcctggaaaaggcacagaaaagaggcagagaggaaacttccggc actatgtccacaaaaggcagggaaacacaaagcttcagaatggaaaaaaaggttggagaa agagaggagagaagagctgctgcccttctggaagcccagactaagagctcccagaaccag ggctatgacacccaatttagggctctgccattcctggtgtctccaagattctgggcacca acaggttccccagtgcctgcagtggaagccacttgtggtgtgggatccaggctaggagca aaagccgaatgcagcctgccaggccaagtgggtggaacaagtccagtgggcccaagcaaa actcaggcaaaggtgccattggccacagatgttcccagctggtga >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_7|314_aa MWKKKIYGQRKESEVRKTEVSIKIHASGFDKLLESMFCILLVVEAFSLQKVVKMLEEVVA GWREVRKVDVELFANFPCNCKRISFEDCSQLVVVNFRWSATMLLIFKALISFAKLFEPPL YCLITLINYHLLKGSLEHRTSPGLRHQPVLSLSLAGSVASEFPAKVLKMFENFPLITKPT PLAGDMPTNPKALVTQRKQPFFRAFITCNHAFYRQPKYPRKSTPRRNKLGHYVIIKFLLT TESPMKKTEDNSTPVFIVDVKANKHQIKQAVKKLCDIDVAKVNTLVRTDREKKAHILLGP DYDALDVANKIGII >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_7|945_bp atgtggaagaagaagatttatggacagagaaaagaaagtgaggtacggaaaacagaagtg agcataaaaatccatgcttcgggatttgacaaactcttggaaagcatgttctgcatcctg ctggtagtggaagcattttccctgcaaaaagttgtcaagatgctggaagaagtggtagct ggttggcgagaggtcagaaaggtcgatgttgagttatttgccaacttcccgtgtaattgt aagaggatcagctttgaagattgctctcaattggttgttgtaaatttccgatggtcagcc actatgctcctcatcttcaaggctctcatctcctttgcaaagctttttgaaccaccactg tactgtctgataactttaataaactaccatcttcttaaaggctctctggagcacaggact tcacctgggctccggcatcaaccagtattaagtttatccttagcaggttcagtggcctct gagtttcctgctaaggtgctgaaaatgtttgaaaactttccactgatcacaaaacccaca ccactcgctggtgatatgcccacgaatcccaaggctttagtcacacaaagaaaacagccg ttcttccgcgctttcataacatgtaaccatgccttttacaggcagcccaaatatcctcga aagagcacccccaggagaaacaagcttggccactatgtcatcatcaagtttctgctgacc actgagtctcccatgaagaagacagaagacaacagcacacctgtgttcattgtggatgtt aaagccaacaagcaccagatcaaacaggctgtgaaaaagctctgtgacattgatgtggcc aaggtcaacaccctggttaggactgatagagagaagaaggcacatattcttctgggtcct gattatgatgctttggatgttgccaacaaaattgggatcatctaa >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_8|257_aa MSLDVMIELYRGNIWNDAKTVNVITTAHFSKVSAAPPKRSNKDPPFAAQASHHLVPPEII QSLLMTVANNLVTDKNSGEVMTVGINAIKEKTARCSLAMTEELLQDLAQYKTHKDKNVMM CARTLIQLFRTLNPQMLQKKFQGKPTEASIEARVQEYGELDAKDYIPGAEVLEGEKEENA ENDEDGWENTSLSEEQDADEQQEISKKLNRMPMEEWKAKAAAISTSQVLTQEDFQKIRMA QLRKELDAAPGKSQKRK >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_8|774_bp atgtctttagatgtaatgattgaactctacagagggaacatctggaatgatgccaaaact gtcaatgttatcacaactgcacatttctctaaggtttctgcagccccaccaaagagaagt aacaaagatcctccgtttgctgcacaagcatctcatcacctagtacccccagagattatt caatcattgcttatgactgtggcaaacaatttggttaccgacaagaattctggagaagtc atgacagtaggaatcaatgctataaaagagaaaacagctcgatgttctctggccatgact gaagaacttctccaagacctggctcagtataaaacacacaaggataagaatgtaatgatg tgtgctagaactttgattcagctcttccgaacactgaatcctcagatgctgcagaagaaa ttccagggtaagcctactgaggcctccatagaagcaagagtacaagaatatggagaatta gatgctaaagattacattccaggagcagaagttctggaaggtgagaaagaagagaatgct gaaaatgatgaagatggatgggaaaataccagtctcagtgaggagcaggatgctgatgaa cagcaagaaatctccaagaagctgaacaggatgcccatggaggagtggaaagccaaagct gcagccatcagcaccagccaagttttaactcaggaagacttccagaaaatccgcatggcc caactgagaaaagaacttgatgctgcccctgggaaatcccagaagaggaaataa >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_9|79_aa MAPASAPGERLRLLLLMAEGKGKLACSGITCPGEGFTSWQFFTICGELNDVPPKDMSKDL SPATCECDLIWKQGLCKCN >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_9|240_bp atggcgccagcatctgctcctggtgagagactcaggctgcttctactcatggcagaaggc aaagggaagctggcctgttcaggaatcacgtgccctggagaaggatttacatcttggcag ttttttaccatctgtggtgagttgaatgatgtccccccaaaagatatgtccaaagaccta agtcctgctacctgtgaatgtgaccttatttggaaacagggtctttgcaagtgcaattaa >gi568815578f:10175492_10406194|GENSCAN_predicted_peptide_10|146_aa THNDPESILKDDECTQTELQLIAEAFCSALESVVGSLEHDGGEILTDMKYGHLWSVQADS PCVANWPDLLSQCGCGLYNSQEELNWSFLRSTRRPFVPQSCLPHEAVGSASNLTLDCLTA KLSGLQVAVETANLILDLSYVIEDKN >gi568815578f:10175492_10406194|GENSCAN_predicted_CDS_10|441_bp actcacaacgacccagaaagcattctcaaagatgatgaatgtactcaaacagaacttcaa ttaattgctgaagcattttgcagtgccctagaatctgttgttggctctttagaacatgat ggaggtgaaattctcactgacatgaagtatggacacctttggtcagttcaggcagattct ccctgtgttgctaactggccagatttgctttcacagtgtggctgtggattatacaatagc caggaagaactcaactggtctttcttaagaagcacacgtcgtccatttgtgccacaaagc tgccttccacatgaagctgtgggctcagccagcaacctgaccttggactgtttgactgca aagcttagtggcctacaggtggctgtagagacagccaatttgattttggatctttcatat gttattgaagataaaaactaa