GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:15:39 Sequence gi568815583r:59542136_59752277 : 210142 bp : 41.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1263 1386 124 2 1 73 121 63 0.890 8.48 1.02 Intr + 2912 3062 151 0 1 46 61 96 0.619 1.00 1.03 Intr + 5977 6093 117 1 0 36 72 111 0.758 2.96 1.04 Intr + 9890 10093 204 2 0 38 77 117 0.523 2.99 1.05 Term + 10748 11006 259 2 1 46 35 172 0.378 1.74 1.06 PlyA + 13394 13399 6 1.05 2.03 PlyA - 16799 16794 6 1.05 2.02 Term - 27207 27052 156 0 0 64 42 104 0.480 0.35 2.01 Init - 33242 33123 120 2 0 21 110 71 0.320 2.84 2.00 Prom - 37632 37593 40 -3.65 3.00 Prom + 49733 49772 40 -4.35 3.01 Init + 53978 54114 137 1 2 76 79 93 0.915 6.86 3.02 Intr + 54183 54219 37 0 1 85 85 50 0.615 1.75 3.03 Term + 54264 54317 54 2 0 113 49 15 0.482 -3.32 3.04 PlyA + 56689 56694 6 1.05 4.00 Prom + 57175 57214 40 -2.95 4.01 Init + 67766 67818 53 1 2 92 42 103 0.591 6.78 4.02 Intr + 71853 71970 118 0 1 43 100 50 0.359 1.25 4.03 Term + 72082 72264 183 0 0 44 48 142 0.376 2.36 4.04 PlyA + 72799 72804 6 1.05 5.00 Prom + 75111 75150 40 -6.95 5.01 Sngl + 76104 77420 1317 2 0 80 53 1017 0.999 91.21 5.02 PlyA + 77852 77857 6 1.05 6.03 PlyA - 79877 79872 6 1.05 6.02 Term - 86098 86013 86 2 2 55 41 115 0.421 0.34 6.01 Init - 91043 90965 79 2 1 56 100 72 0.343 6.47 6.00 Prom - 93106 93067 40 -5.65 7.10 PlyA - 93146 93141 6 1.05 7.09 Term - 93373 93243 131 2 2 14 41 153 0.439 0.56 7.08 Intr - 100107 100001 107 2 2 19 106 70 0.048 0.84 7.07 Intr - 115181 114958 224 2 2 -20 76 259 0.209 9.90 7.06 Intr - 118902 118830 73 2 1 66 95 73 0.973 4.29 7.05 Intr - 121985 121938 48 1 0 59 98 88 0.756 3.78 7.04 Intr - 130604 130502 103 0 1 107 97 58 0.970 6.91 7.03 Intr - 135952 135776 177 2 0 63 78 166 0.996 12.07 7.02 Intr - 137633 137457 177 0 0 87 94 161 0.998 15.57 7.01 Init - 138190 138106 85 1 1 36 99 123 0.567 9.23 7.00 Prom - 138822 138783 40 -6.55 8.00 Prom + 140248 140287 40 -8.25 8.01 Init + 141854 141981 128 1 2 68 92 52 0.188 3.28 8.02 Intr + 146686 146876 191 1 2 32 55 132 0.231 2.61 8.03 Intr + 146942 147183 242 0 2 31 42 321 0.387 18.15 8.04 Intr + 159722 159766 45 1 0 99 87 34 0.427 2.09 8.05 Intr + 163982 164171 190 1 1 54 53 155 0.639 6.84 8.06 Term + 168041 168108 68 0 2 101 44 47 0.601 -1.28 8.07 PlyA + 168713 168718 6 1.05 9.00 Prom + 173226 173265 40 -7.65 9.01 Init + 174595 174654 60 0 0 47 100 39 0.419 1.88 9.02 Intr + 180520 180669 150 0 0 104 80 81 0.628 8.34 9.03 Intr + 183294 183394 101 2 2 40 80 67 0.493 -0.91 9.04 Term + 186316 186622 307 1 1 90 39 125 0.495 1.40 9.05 PlyA + 186729 186734 6 1.05 10.00 Prom + 187421 187460 40 -1.55 10.01 Init + 189575 189622 48 1 0 61 82 50 0.530 2.93 10.02 Term + 193935 194042 108 2 0 109 54 48 0.522 1.03 10.03 PlyA + 195478 195483 6 1.05 11.03 PlyA - 197574 197569 6 1.05 11.02 Term - 199936 199763 174 1 0 57 40 75 0.083 -3.62 11.01 Intr - 204890 204647 244 1 1 74 103 197 0.971 16.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_1|284_aa MVSPEETGKNRGALPKCLNPGGAAPRKREGTNCAVPRAGQEETFWTSSGSTFGLTGKSES TAVKCAGETVGCPHGSLVNSYGAGQRQPTRIDNYKGEVGDRKEKKSGPIHKAIRSHKQMV HFQTKSLVEAGHVFYSVLYSVHIDENFSRIKFKGVELINEKCANRAAPRITGDSQRPQGF HVVEDLLTKRNGIQKLEVSMLGDTADSSSLAVVTLVGRSFLNSTHSLDVYNITFLIDSHI CGQRNDSMFSKRTREHISDASPLSLCVGHFGELLEDGGSRRKAP >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_1|855_bp atggttagcccggaagagacaggaaagaacagaggagctctgcccaagtgccttaaccct ggaggggctgcgccgaggaagagggaggggaccaactgtgcggttccaagggcaggacag gaagagacgttttggacttcctcaggcagtacctttggactgacaggcaagtcagaaagc acagcagtcaagtgtgctggagagacagtggggtgcccacatgggtccttggtcaactca tatggagcagggcaaaggcaaccaacacgaattgataattacaagggtgaagttggagac agaaaggaaaagaaatcggggcccatacacaaagccatacgttcacacaaacaaatggtg cacttccaaacaaaatccctggtggaggctggccacgttttttattccgttttgtattct gtccatattgatgaaaacttcagccgaattaaatttaaaggagttgaattgatcaatgaa aaatgtgcgaatcgagcagcccccagaatcacaggagattcacagagaccccagggcttc cacgtggtggaagatttattgacaaagagaaatggcatacagaaattggaagtgagcatg ctgggggacacggcagactcttccagtttagccgtggtaacacttgtggggcgttccttt ttgaacagtacccattcccttgatgtctacaatatcacctttcttatagattcacatata tgtggccaaaggaacgactccatgttttctaaaaggactagagaacatatatcggatgcc tctcctctttccctttgtgttggtcattttggcgaattactggaagatggcggttcccgc cgaaaggcgccttag >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_2|91_aa MVHRVSEAQRPRKESLDPVEGPAKEARKSHLNACPISKDRVFLKLQDFSGGIMLPTQRAF RAHFLSRSNRHELYRRALKYLQLSDKEWASP >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_2|276_bp atggtccacagagtttctgaagcacaaaggccgagaaaggaatccctggacccagttgag ggtccagcaaaagaggcaaggaagagccatttaaatgcgtgccccatttccaaagacagg gttttcctcaagttacaggatttcagtggtggcattatgctccctacccaaagggccttc agggcccattttctgagtcgaagtaacagacatgaactctacagaagagctttgaagtac ttgcaactgtccgataaagaatgggcttccccctaa >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_3|75_aa MGSSPQPGQTPAAVPEPPGQGYRRKHCSKTKLISDELELYYMTAGRERREEAHDWLLQKG RLSAQAQHKVSVIHA >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_3|228_bp atgggctctagtccacaaccaggccagactccagctgctgttccagagccccctgggcag ggctatcgcaggaaacactgctcaaagacaaagctcatctctgatgagctggaattatat tatatgacagccggaagggagagaagagaagaggcccatgactggctcctacagaaaggt agactgtctgcccaggcccagcacaaggtgtcagtgatccatgcctga >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_4|117_aa MAVAALCFRGAEQQQGQLLGDSETLSLKNTYTNKSPQLFKVLSLTEGIYNAVFMGMKRFY TQQWKVTENPSTAENSLWSRAHVGVQMTLLPERGPDPDPNRGFLDLAQERIQGESVK >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_4|354_bp atggccgtggctgccctttgcttccgtggtgctgagcagcagcaggggcagctcctgggt gacagcgagaccctgtctctgaaaaacacatacacaaacaaaagccctcaactattcaag gtgttgtctcttactgagggaatctataatgcagtttttatgggaatgaagagattttat actcaacaatggaaagtcactgagaatcccagtaccgcagaaaacagcctgtggagcagg gcccacgtaggggtgcagatgaccctgttaccagaaaggggtcccgatccagaccccaac agagggttcttggatcttgcgcaagaaagaattcagggcgagtccgtaaagtga >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_5|438_aa MVQWKRLCQLHYLWALGCYMLLATVALKLSFRLKCDSDHLGLESRESQSQYCRNILYNFL KLPAKRSINCSGVTRGDQEAVLQAILNNLEVKKKREPFTDTHYLSLTRDCEHFKAERKFI QFPLSKEEVEFPIAYSMVIHEKIENFERLLRAVYAPQNIYCVHVDEKSPETFKEAVKAII SCFPNVFIASKLVRVVYASWSRVQADLNCMEDLLQSSVPWKYFLNTCGTDFPIKSNAEMV QALKMLNGRNSMESEVPPKHKETRWKYHFEVVRDTLHLTNKKKDPPPYNLTMFTGNAYIV ASRDFVQHVLKNPKSQQLIEWVKDTYSPDEHLWATLQRARWMPGSVPNHPKYDISDMTSI ARLVKWQGHEGDIDKGAPYAPCSGIHQRAICVYGAGDLNWMLQNHHLLANKFDPKVDDNA LQCLEEYLRYKAIYGTEL >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_5|1317_bp atggttcaatggaagagactctgccagctgcattacttgtgggctctgggctgctatatg ctgctggccactgtggctctgaaactttctttcaggttgaagtgtgactctgaccacttg ggtctggagtccagggaatctcaaagccagtactgtaggaatatcttgtataatttcctg aaacttccagcaaagaggtctatcaactgttcaggggtcacccgaggggaccaagaggca gtgcttcaggctattctgaataacctggaggtcaagaagaagcgagagcctttcacagac acccactacctctccctcaccagagactgtgagcacttcaaggctgaaaggaagttcata cagttcccactgagcaaagaagaggtggagttccctattgcatactctatggtgattcat gagaagattgaaaactttgaaaggctactgcgagctgtgtatgcccctcagaacatatac tgtgtccatgtggatgagaagtccccagaaactttcaaagaggcggtcaaagcaattatt tcttgcttcccaaatgtcttcatagccagtaagctggttcgggtggtttatgcctcctgg tccagggtgcaagctgacctcaactgcatggaagacttgctccagagctcagtgccgtgg aaatacttcctgaatacatgtgggacggactttcctataaagagcaatgcagagatggtc caggctctcaagatgttgaatgggaggaatagcatggagtcagaggtacctcctaagcac aaagaaacccgctggaaatatcactttgaggtagtgagagacacattacacctaaccaac aagaagaaggatcctcccccttataatttaactatgtttacagggaatgcgtacattgtg gcttcccgagatttcgtccaacatgttttgaagaaccctaaatcccaacaactgattgaa tgggtaaaagacacttatagcccagatgaacacctctgggccacccttcagcgtgcacgg tggatgcctggctctgttcccaaccaccccaagtacgacatctcagacatgacttctatt gccaggctggtcaagtggcagggtcatgagggagacatcgataagggtgctccttatgct ccctgctctggaatccaccagcgggctatctgcgtttatggggctggggacttgaattgg atgcttcaaaaccatcacctgttggccaacaagtttgacccaaaggtagatgataatgct cttcagtgcttagaagaatacctacgttataaggccatctatgggactgaactttga >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_6|54_aa MEAISKLQAQGKEIETEPGSLAELRKVPPPSATTTLDQSAAINIETRPTSAKRS >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_6|165_bp atggaggcaatatccaagctgcaggcacagggaaaggaaatagaaacagaacctggcagt cttgcggagttgaggaaagtgcccccaccttcagcaaccaccacccttgaccagtcagca gccatcaacattgagacaagacctacatcagcaaaaagatcatga >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_7|374_aa MPVSSRPLPEDDSIEADILAITGPEDQPGSLEVNGNKVRKKLMAPDISLTLDPSDGSVLS DDLDESGEIDLDGLDTPSENSNEFEWEDDLPKPKTTEVIRKGSITEYTAAEEKEDGRRWR MFRIGEQDHRVDMKAIEPYKKVISHGGYYGDGLNAIVVFAVCFMPESSQPNYRYLMDNLF KVDQELNGKQDEPKNEQEVPYVTADGNISCMQQENLQAIRKSLRALRSWDLPVTDRCRQG PAAGTGEGRRGGGDGRVPWREARRSRGAHRTHLGEVFSPPPYFQPGECLKASTSALFCDN VWTFVLNDVEFREVTELIKVDKVKIVACDGKKLTGQDANPWSHDEVHRSEPFKAEWNLND DLMTSTSSFSAQEN >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_7|1125_bp atgcctgtgtcttctagacctttaccagaagatgatagtattgaagcagatatactagct ataactggaccagaggaccagcctggctcactagaagttaatggaaataaagtgagaaag aaactaatggctccagacattagcctgacactggatcctagtgatggctctgtattgtca gatgatttggatgaaagtggggagattgacttagatggcttagacacaccgtcagagaat agtaatgagtttgagtgggaagatgatcttccaaaacccaagactactgaagtaattagg aaaggctcaattactgaatacacagcagcagaggaaaaagaagatggacgacgctggcgt atgttcaggattggagaacaggaccacagggttgatatgaaggcaattgaaccctataaa aaagttatcagccatgggggatattatggggatggattaaatgccattgttgtgtttgct gtctgtttcatgcctgaaagtagtcagcctaactatagatacctgatggacaatcttttt aaagttgatcaagaacttaatggaaaacaagatgaaccgaaaaatgaacaggaagtacca tatgtgactgctgatggtaacatttcttgtatgcagcaggaaaacctgcaggctatacgc aagtctctgagggcccttcgctcctgggatctgcccgttaccgaccgttgccggcaaggc cccgcggccggcaccggagaagggcggcgaggcggcggtgatggtcgcgtcccgtggcgg gaggctcgtcgttcacggggcgcccacagaacccacctaggggaggtcttcagcccacca ccgtattttcagccaggtgaatgtcttaaagcatccactagcgccctattctgcgataat gtgtggacttttgtactgaatgatgttgaattcagagaggtgacagaacttattaaagtg gataaagtgaaaattgtagcctgtgatggtaaaaaacttactggccaagatgccaaccca tggtcacatgatgaggttcacagatcagaacctttcaaagctgaatggaacctcaacgat gatctaatgacttcaacctcctctttcagtgcacaggaaaattaa >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_8|287_aa MSYSYLICYVNIQKMHKVTKKKALCSVMTKLQWKYPEVLDFTRVKGWGSGGGARFHPEHN YQKRKSDGVGGQTAKYKQDPSQGRGPKQGGDPGSTSRAGSMAPPTKGRFPVQLRSHRAEF SQAALTPEEALAPSSSSPLQAGDVGLTARSQKAGPSGARSPRSAVETPAQSPGRSGTASA AAADPDTDYSTIHWINCVDPTKDLSSGCYMASKRVEEKRMLSVGAKTKLEILVHICINML SGLGSPSYRRPLHHPEHRLLKYFWPDLGCIPQLQPHPLQGSEQIGLI >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_8|864_bp atgtcctatagctatcttatctgctacgtcaacattcaaaagatgcacaaagttacaaag aaaaaagccttatgttccgtaatgacaaaattacaatggaaatatcctgaagtgttagat ttcacaagggtaaaagggtgggggagcggaggaggggcaaggttccatcccgagcacaac taccagaaacggaaaagcgacggggtggggggccaaactgccaaatacaaacaggaccca agccaagggcggggacctaagcagggcggggatccaggaagcacctcccgagcaggttct atggctccccctaccaagggccggttcccagtccagctccggagccaccgtgccgagttc tcccaggccgcactcaccccggaggaagccttggccccctcgtcctcttcgcccctccag gccggcgacgtggggctgacggccaggtcgcaaaaagcagggccgagcggagcccgctcc cctcggtcggcggtggagaccccggcccaatcccccggccgcagcggtacggcgtcggcg gcagcagctgacccggacacagactatagtaccattcactggatcaactgtgttgatcct acaaaggatttaagcagcggttgctacatggcatcaaagagagtggaggagaaaaggatg ctgtcagttggtgccaaaaccaaactggaaatcctggtgcatatttgcataaatatgctg tctgggcttggaagcccttcttaccgcagacctcttcatcaccctgaacatcgccttttg aaatatttctggccagatctagggtgtattccacagctgcagccccatccattgcaaggt tctgagcagattggcttgatatga >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_9|205_aa MSERHLPALVLAAKALACDPQLSSLTMRAELTRHFKSCYQLNGSLVKKSTSSFSLSILII ADGLSIISSWTLPVKYTDKNRTTPGAPFLGNDANCFKLDSGLSRAGERSWASAVEGLSSL QLSVEADDEASTGIAAVVTLYGDMCLGTNINIFSCFKGFSKGKSPVMMTSLRLEYRFLRE IFKFCIFTTVAFNKHASGIGGIMKM >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_9|618_bp atgtcagaaagacacctgccagcacttgtcctggcagctaaggcattggcctgtgaccca cagctgagctccctgaccatgcgagctgagctgacacggcatttcaagtcctgttaccag ctcaacggttctcttgtaaaaaaaagcacctcttcatttagcctctccatcttgatcatt gctgatgggctatcaataatttcaagttggaccttgcctgtgaaatacacagataaaaat cgtaccacacctggtgccccctttttggggaatgatgcaaattgtttcaaattggacagt ggactctccagggcaggggaaagatcttgggcaagtgctgtagaaggtctttcttccctg cagctctctgttgaggctgacgatgaagcaagcactggaattgcagccgttgttacactg tatggcgatatgtgtttgggtaccaacatcaacattttttcatgctttaaaggtttttct aaagggaaatcgccagtcatgatgacctcactacgactagaatacaggtttttaagggaa atttttaaattctgtatttttactactgtggctttcaataaacatgcttctggcataggt ggcatcatgaaaatgtaa >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_10|51_aa MPKGKLLWLKQGEGLQLTPIMLKATDHLFLASPTEASICQFPTYRAAKMPA >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_10|156_bp atgcctaaaggaaagctgctctggctgaagcagggtgaagggctgcagctgacacctatc atgttgaaggctactgatcatcttttcctagcctcaccaacagaagcatcaatatgccaa tttccaacctaccgtgcagcaaagatgccggcctga >gi568815583r:59542136_59752277|GENSCAN_predicted_peptide_11|139_aa XSECQVDSPGNPTDDTAVEKPGQEAKRLTLQARGEAPSGCICGALLKSAGNWAPQLWLAQ EASLRRPEVALKRCSIHSACHTLADTATLRDIPLKEGQAQPCGRALSHLSLPSPPPTNLD SERNGSLHPWTVSTCICID >gi568815583r:59542136_59752277|GENSCAN_predicted_CDS_11|420_bp nngtctgagtgccaagtggactcccctgggaatccaacggatgacacagctgtagagaag ccagggcaggaagcaaagaggctcacgctgcaggccagaggcgaagccccatcaggctgc atctgtggagctttgttgaagtctgcagggaactgggcaccacagctgtggctggcacaa gaggcaagtctgagaagacctgaagtggcgctcaagaggtgttcaatacacagtgcatgt cacacgttggcagatacagcaactttaagagacatccctctgaaagaaggacaagctcaa ccctgtggcagagctctgtcccacctcagcctccccagcccaccccccaccaacctcgac tcagagcgcaatggctctttgcatccttggacagtttctacttgcatctgcatcgattaa