GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:10:29 Sequence gi568815593r:115731807_115941693 : 209887 bp : 38.35% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 953 948 6 1.05 1.03 Term - 3982 3872 111 1 0 78 44 145 0.804 6.68 1.02 Intr - 13508 13330 179 0 2 34 80 135 0.647 6.02 1.01 Init - 15313 15217 97 1 1 54 103 94 0.958 8.02 1.00 Prom - 16293 16254 40 -4.25 2.04 PlyA - 17152 17147 6 1.05 2.03 Term - 19474 19410 65 2 2 54 32 106 0.621 -1.33 2.02 Intr - 19584 19513 72 1 0 105 87 6 0.585 0.66 2.01 Init - 20530 20374 157 1 1 68 39 139 0.993 7.12 2.00 Prom - 21387 21348 40 0.45 3.00 Prom + 32852 32891 40 -4.95 3.01 Init + 44473 44568 96 0 0 48 35 105 0.248 1.56 3.02 Intr + 53948 54182 235 1 1 56 89 216 0.706 14.84 3.03 Intr + 63923 64020 98 0 2 61 105 55 0.143 3.31 3.04 Term + 70231 70386 156 0 0 55 52 110 0.400 1.05 3.05 PlyA + 72106 72111 6 1.05 4.10 PlyA - 72948 72943 6 1.05 4.09 Term - 73656 73627 30 0 0 111 49 34 0.632 -1.32 4.08 Intr - 74712 74543 170 1 2 42 82 117 0.754 5.24 4.07 Intr - 79509 79355 155 2 2 32 47 82 0.687 -2.71 4.06 Intr - 81452 81375 78 1 0 99 98 90 0.976 8.85 4.05 Intr - 84630 84422 209 0 2 66 99 369 0.090 32.75 4.04 Intr - 100858 100796 63 1 0 90 83 52 0.615 2.90 4.03 Intr - 105958 105822 137 2 2 101 56 156 0.846 13.07 4.02 Intr - 109760 109584 177 0 0 8 27 294 0.413 14.27 4.01 Init - 111131 110474 658 2 1 29 64 341 0.678 21.27 4.00 Prom - 119528 119489 40 -3.85 5.04 PlyA - 119793 119788 6 1.05 5.03 Term - 125231 125098 134 0 2 35 38 138 0.270 0.77 5.02 Intr - 139872 139781 92 2 2 125 69 73 0.210 7.82 5.01 Init - 145703 145555 149 2 2 83 1 92 0.418 -0.39 5.00 Prom - 145882 145843 40 -3.85 6.02 PlyA - 146400 146395 6 -1.75 6.01 Sngl - 147652 146615 1038 1 0 42 42 267 0.887 14.36 6.00 Prom - 147811 147772 40 -10.35 7.02 PlyA - 148182 148177 6 1.05 7.01 Sngl - 149138 148497 642 2 0 50 34 302 0.803 16.93 7.00 Prom - 149348 149309 40 -7.25 8.02 PlyA - 149635 149630 6 1.05 8.01 Sngl - 150521 150129 393 2 0 79 42 297 0.757 20.19 8.00 Prom - 157386 157347 40 -6.65 9.00 Prom + 157865 157904 40 -4.05 9.01 Init + 160430 160447 18 1 0 121 6 27 0.167 -2.13 9.02 Intr + 163281 163352 72 2 0 126 74 69 0.689 7.98 9.03 Intr + 171079 171186 108 0 0 89 94 61 0.633 6.36 9.04 Term + 181556 181684 129 1 0 80 42 97 0.488 1.50 9.05 PlyA + 182149 182154 6 1.05 10.03 PlyA - 182298 182293 6 1.05 10.02 Term - 189814 189690 125 2 2 33 48 159 0.501 3.97 10.01 Init - 193912 193702 211 1 1 86 85 73 0.460 5.79 10.00 Prom - 194130 194091 40 -4.45 11.00 Prom + 198784 198823 40 -8.05 11.01 Init + 202903 203141 239 0 2 76 47 189 0.556 11.13 11.02 Intr + 204349 204459 111 1 0 54 41 143 0.513 4.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 84591 84422 170 0 2 67 99 362 0.801 34.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_1|128_aa MHSEATKWITLEGNDRDKAKENALNLESLFLDALDSSEQREALYVWEKVRDENKSLCLVI QRILDGPRPSRWYLHKSARTTVLLDLQYSLKQGSFGEQYGQFIQVENSCPPYVPAGRWGS NARDSLLS >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_1|387_bp atgcacagtgaagccacaaaatggataactcttgagggaaatgacagagacaaagccaaa gagaatgcactgaatcttgagagtcttttcctggatgctttagatagctcagaacagaga gaagccctgtatgtttgggagaaagtaagggacgagaacaagagtctctgcctggtaatc cagagaattctggatggtccaagaccatcaaggtggtacctccacaagtctgcaagaacc acagtgttactggatttgcagtactccctaaagcaggggagtttcggtgaacagtacggg cagttcattcaagtggaaaattcctgtcctccgtatgtgccagcaggtcggtggggttcc aatgcccgtgacagcttgttgtcttga >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_2|97_aa MDAAGGDYPKQTNTGTENQMLHVLTYKWELHNENTWIPGGKQETSKAYLRVEVQHAGTSM NCFHPTDFCWSLWEALGEMQTVFKQPDSIPDGAGIEA >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_2|294_bp atggatgcagctggaggcgattatcctaagcaaactaacacaggaacagaaaatcaaatg ctgcatgttctcacttataagtgggagctacacaacgagaacacatggataccaggaggg aaacaagagacatcgaaggcctacttgagagtggaggtccagcatgctggtaccagcatg aactgtttccatccaactgacttttgctggagtttgtgggaagcccttggtgaaatgcag actgtattcaagcaacctgattccataccggatggagctggaattgaagcttaa >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_3|194_aa MSKAKTYGEDCSKEEQVTVQVKVPSTVLRTNQITANRQGSPSSTLAPESASQQQPSSSDH SKVAENPSVSRHFTSESPAVSECQDSENLHKINSGFAPADYLTSTVLTVSYLQHQNPPDF WGPLLKVCISGLHPRPTDLDSLEDLDNKGEKKEMTLSDNPKCFRECLEISLLDSVIGKFL SGKMASELKPDGQE >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_3|585_bp atgagcaaagcaaagacatatggagaggattgcagcaaggaggagcaggtgacagtgcaa gtaaaagtacctagcacagtgcttcgcacaaaccagataacagccaatcggcaagggtct ccctccagtacccttgctccagagtcagccagtcaacagcagccttcctccagtgaccac agtaaggtagctgaaaatccttcagtctctaggcacttcacttctgaaagcccggcagtc tctgaatgtcaggattctgaaaacctacataagatcaactctggctttgctccagcagac tatcttacaagcacagttcttactgtgtcttaccttcagcatcagaatcccccagatttc tggggaccgttgttaaaagtgtgtatttctggtcttcatcctagacccacagacttggac tctctggaggatttagacaataaaggagagaaaaaagaaatgacactttcagataaccct aagtgcttcagagagtgcttagagatttcactgttagatagtgtgattgggaaattcctc tctgggaagatggcatctgaactgaaacctgatgggcaagaatga >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_4|558_aa MRANSHTWKNAKPEAEIITFRYRTTAPISQLQTSPVTSRVNNLQPAGKKPDKNIRSSHIF LFNKETPESHTGQSVTYRNMQKRDCTERAVIKRRKPKFLCYERTTVKNGSPSVQRTERSP PGKDGSRRRGARRTSSTPSSSSRANCSRGRQEGSRAGWALGDWAQGSAARLGCGCPIGEG RAPGSLQGLARAARKACSPPTGGGTARAKPRRWLTARQGVSPSKMAEEPQSVLQLPTSIA AGGEGLTDVSPETTTPEPPSSAAVSPGTEEPAGDTKKKIDILLKAVGDTPIMKTKKWAVE RTRTIQGLIDFIKKFLKLVASEQLFIYVNQSFAPSPDQEVGTLYEPSSETRAQPAPHEME QTEVLKPRTLADLIRILHQLFAGDEVNVEEVQAIMEAYESDPTEWAMYAKFDQYRYTRNL VDQGNGKFNLMILCWGEGHGSSIHDHTNSHCFLKMLQGNLKETLFAWPDKKSNEMVKKSE RVLRENQCAYINDSIGLHRVENISHTEPAVSLHLYSPPFDTCHAFDQRTGHKNKVTMTFH SKFGIRTPNATSGSLENN >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_4|1677_bp atgagggcaaattcccatacgtggaaaaatgccaaaccagaagctgaaatcataactttt aggtatagaaccacagctccaatcagtcaactacaaacgtcgccggttacttcgagggta aataatttgcagccagctggtaaaaagcctgataaaaatattaggtcctcacatatcttc cttttcaataaggagacacctgaaagtcacactgggcagtctgtcacgtatcgaaacatg cagaagagagactgtacagaacgtgcagttatcaaacgcaggaagccaaagttcctgtgc tatgagaggaccacagtgaaaaacggtagcccatctgtccaaagaacagaaagaagtcca cctggaaaggatgggagtcggcggaggggggcgagaagaacgagttccacgccgtcttcc tcctcccgggctaactgcagccgcgggcggcaggaaggcagccgcgcggggtgggctctt ggggactgggcgcagggctccgcggcgcggctgggctgcggctgcccgatcggcgagggc cgggccccgggaagtctccagggcctcgcccgcgcagccaggaaagcatgcagcccaccg acgggcggaggaaccgcccgggcaaagccacgtcggtggctgaccgcgcggcagggagtg tctccaagcaagatggcggaggagccgcagtctgtgttgcagcttcctacttcaattgct gctggaggggaaggacttacggatgtctccccagaaacaaccaccccggagcccccgtct tccgctgcagtttccccgggaacagaggaacctgctggcgacaccaagaaaaaaattgac attttgctaaaggctgtgggagacactcctattatgaaaacaaagaagtgggcagtagag cgaacacgaaccatccaaggactcattgacttcatcaaaaagtttcttaaacttgtggcc tcagaacagttgtttatttatgtgaatcagtcctttgctccttccccagaccaagaagtt ggaactctctatgagcccagcagtgagacgcgcgcgcagccagctccccacgagatggaa cagaccgaagtgctgaagccacggaccctggctgatctgatccgcatcctgcaccagctc tttgccggcgatgaggtcaatgtagaggaggtgcaggccatcatggaagcctacgagagc gaccccaccgagtgggcaatgtacgccaagttcgaccagtacaggtatacccgaaatctt gtggatcaaggaaatggaaaatttaatctgatgattctctgttggggtgaaggacatggc agcagtattcatgatcataccaactcccactgctttctgaagatgctacagggaaatcta aaggagacattatttgcctggcctgacaaaaaatccaatgagatggtcaagaagtctgaa agagtcttgagggaaaaccagtgtgcctacatcaatgattccattggcttacatcgagta gagaacatcagccatacggaacctgctgtgagccttcacttgtacagtccaccttttgat acatgccatgcctttgatcaaagaacaggacataaaaacaaagtcacaatgacattccat agtaaatttggaatcagaactccaaatgcaacttcgggctcgctggagaacaactaa >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_5|124_aa MDEAGNHHSQQTNTGTENQTLHVLIHKWELHNENTWTQGGEHHTPGLVRGYVQMYGCGQG VQCTQGYITTMIMKFKLDKEKCWQYSTKKISSGGYQRLVVGIKGGTGKAKMLIKGHKVSV RLEE >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_5|375_bp atggatgaagctggaaaccatcactctcagcaaacgaacacgggaacagaaaaccaaaca ctgcatgttctcattcataagtgggagttgcacaatgagaacacatggacacagggaggg gaacatcacacaccgggacttgtcagggggtatgttcagatgtatgggtgtgggcaagga gttcagtgtacacaaggctatatcactaccatgattatgaaatttaagctggataaggaa aaatgctggcaatactcaactaagaaaatcagtagcggtggttaccagcggctggtggtg ggaataaaaggaggaacggggaaagcaaaaatgttgatcaaagggcacaaagtttcagtc agactggaggaataa >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_6|345_aa MIISIDAEKAFDKIQHPFTLKTLSKLRIDGTYLKIIRAIYDKTIANIILNGQKLEAFPLK TGTRQGCPLSPLLFNTVLEVLARAIRQEKEINCIHRGREKVKLSLFADDMIVYLENSIVS AQNLLKLISNFSKVSGYKTNVQKSQAFLYTNNRQTESQIRSELPFTIATKRIKYLGIQLT RDVKGLFKENYKLLLKEIRQNTNKWKNIPCSWIGRISIVKMATLSNVIYRFNVIPIKLPL TFFTELQKSTLNFIWNQKRACTAKTILNKRNKAEGITIPDFKLYYKARVTQTAWYWCQNR YTHQGNRTEASEIMPHIYNHLTFGKSDKNKKWGKDPCLIDVGKTG >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_6|1038_bp atgattatttcaatagatgcagaaaaggcctttgataaaattcaacaccccttcacacta aaaactctcagtaaactacgtatagatggaacatatctcaaaataataagagctatttat gacaaaaccatagccaatatcatactgaatgggcaaaagctggaagcattccctttgaaa actggcacaagacaaggatgccctctctcaccactcctattcaacacagtattggaagtt ctggccagggcaatcaggcaagagaaagaaataaactgtattcatagaggaagagagaaa gtcaaactgtctctgtttgcagatgacatgattgtatatttagaaaactccatagtctca gcccaaaatctcctgaagctgataagtaacttcagcaaagtctcaggatacaaaaccaat gtgcaaaagtcacaagcattcctatacaccaataatagacaaacggagagccaaattagg agtgaactcccattcacaattgctacaaagagaataaaatacctaggaatacaacttaca agggatgtgaagggcctcttcaaggagaattacaaactactgctcaaggaaataagacag aacacaaacaaatggaaaaacattccatgctcatggataggaagaatcagtatcgtgaaa atggccacactgtccaacgtaatttatagattcaatgttatccccatcaagctaccattg actttcttcacagaattacaaaaaagtactttaaacttcatatggaaccaaaaaagagcc tgtacagccaagacaatcctaaacaaaaggaataaagctgaaggcatcacaatacctgac ttcaaactatactataaggctagagtaacccaaacagcatggtactggtgccaaaacagg tatacacaccaagggaacaggacagaggcctcagaaataatgccacacatctacaaccat ctgacctttggtaaatctgacaaaaacaagaaatggggaaaagatccctgtttaatagat gttgggaaaactggctag >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_7|213_aa MVKGSTQQEELAFLNIYAPNTGAPRFIKQVLRNIQRDSDSHIIVGEFNTPLSILDRSTRQ KINKDIRNFNSAVDQVDLIDIYRMLHPKSTEYTFFSAPHHTYSKIDHIIGSKILPSKCKR TEIIMNRLSDHSAIKLELRIKKLTQNLTTTWKLNNLLQSDYWVNNEIKVEINEFFETNEN KDTMYQNFWDTAKAVFQWKFIALKSPQEKAGKI >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_7|642_bp atggtaaagggatcaacccaacaagaagagctagctttcctaaatatatatgcacccaac acaggagcacccagattcataaagcaagttcttagaaacatacaaagagattcagactcc cacataatagtgggagagtttaacaccccattgtcaatattagacagatccacaagacag aaaattaacaaggatattcggaacttcaactcagctgtggaccaagtggacctaatagac atctacagaatgctccaccccaaatcaacagaatatacattcttctcagcaccacatcac acttattctaaaattgaccacataattggaagtaaaatactccccagcaaatgcaaaaga acagaaatcataatgaacagactctcagaccacagtgctatcaaattagaactcaggatt aagaaactcactcaaaaccttacaactacatggaaactgaacaacctgctccagagtgac tactgggtaaataatgaaattaaggtagaaataaatgagttctttgaaaccaatgagaac aaagatacaatgtaccagaatttctgggacacagctaaagcagtgtttcagtggaaattc atagcactaaaatctccacaggagaaagcgggaaagatctaa >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_8|130_aa MRKNQCKKSENSKSQTASSPPKVHSSSKAREQNWKVNEFDELTEVGFRWWVITNSSELKE HVLTQCKEAKSLDKRSEELLTRITSLERNINDLMELKNTAGELHEAHTVSIAKSIKQKKG YRRLKLNLMK >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_8|393_bp atgaggaaaaaccagtgcaaaaagtctgaaaattccaaaagccagactgcctcttctcct ccaaaggtccacagttcctcaaaagcaagggaacaaaactggaaggtgaatgagtttgat gaattgacagaagtaggcttcagatggtgggtaataacaaactcctctgagctaaaggag catgttctaacccaatgcaaggaagctaagagccttgataaaaggtcagaggaattgcta actagaataaccagtttagagaggaacataaatgacctgatggagctgaaaaacacagca ggagaacttcatgaagcacacacagtatcaatagccaaatcaatcaagcagaagaaagga tatcggagattgaagctcaacttaatgaaataa >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_9|108_aa MGSGLEVFVETLDKCFENVCELDLIFHVDKVHNILAEMVMGGMVLETNMNEIVTQIDAQN KLEKSEAGLAGAPARAVSAVKNMNLPEIPRNINIGDISIKVPNLPSFK >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_9|327_bp atggggagcggtttggaggtatttgtggaaacattagacaaatgttttgaaaatgtctgt gagctggatttgattttccatgtagacaaggttcacaatattcttgcagaaatggtgatg gggggaatggtattggagacaaatatgaatgagattgttacacaaattgatgcacaaaat aagctggaaaaatctgaggctggcttagcaggagctccagcccgtgctgtatcagctgta aagaatatgaatcttcctgagatcccaagaaatattaacattggtgacatcagtataaaa gtgccaaacctgccctcttttaaataa >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_10|111_aa MQPGKESSYPEYVSIPVRINLLPIQDGRHPKLLATKWLVSSSEGAESGAEPCSLLLVDGI FDTSSSSISLGRPCEEPWLWVTQISTAGGSSQRVTQLSGHCQQLKKRVPQS >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_10|336_bp atgcagccaggaaaggaaagttcatatccagaatatgtatcaattccagtcagaattaat ttgctccctattcaagatggaaggcatccaaagttacttgccaccaagtggctggtttcc tcaagcgaaggtgctgaatcaggggctgagccttgctctctgttactggtagatgggata tttgacacaagcagtagctcaatcagccttggacgtccttgtgaggagccatggctttgg gtgacgcagatctctacagctggtggcagttctcaaagagtaactcaactgagtggtcac tgccagcagctgaagaaacgagtgccgcagtcctga >gi568815593r:115731807_115941693|GENSCAN_predicted_peptide_11|117_aa MPKDVDGPAEKQGIPRDSLGTTGRRAVGLKESFADVPPSPTSSLLVWGDHVNPHLKTCSS DTSTQSLGLKAWGATLPLEWQDSNLVVDQKALLPERLLPHTLEEGMLHRETKKNLNS >gi568815593r:115731807_115941693|GENSCAN_predicted_CDS_11|351_bp atgcctaaggatgtagatggcccagcagagaagcagggaatccctcgagacagtttagga accacaggacgccgggccgtggggctgaaggagagctttgctgatgtgccaccctcaccc acatcaagcctccttgtctggggagatcacgtgaatccacatttgaagacctgctcttcg gataccagcacacagagcctgggtcttaaagcttggggtgccacattgcctcttgaatgg caggactctaatctggttgtggatcaaaaggccctgcttccagagaggctcctgccccat actctggaggaaggaatgctgcacagagagaccaagaagaatctgaacagn