GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:45:26 Sequence gi568815583r:59564072_59782457 : 218386 bp : 41.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 180 175 6 1.05 1.02 Term - 5271 5116 156 0 0 64 42 104 0.378 0.35 1.01 Init - 11306 11187 120 2 0 21 110 71 0.273 2.84 1.00 Prom - 15696 15657 40 -3.65 2.00 Prom + 27797 27836 40 -4.35 2.01 Init + 32042 32178 137 1 2 76 79 93 0.915 6.86 2.02 Intr + 32247 32283 37 0 1 85 85 50 0.616 1.75 2.03 Term + 32328 32381 54 2 0 113 49 15 0.482 -3.32 2.04 PlyA + 34753 34758 6 1.05 3.00 Prom + 35239 35278 40 -2.95 3.01 Init + 45830 45882 53 1 2 92 42 103 0.591 6.78 3.02 Intr + 49917 50034 118 0 1 43 100 50 0.359 1.25 3.03 Term + 50146 50328 183 0 0 44 48 142 0.376 2.36 3.04 PlyA + 50863 50868 6 1.05 4.00 Prom + 53175 53214 40 -6.95 4.01 Sngl + 54168 55484 1317 2 0 80 53 1017 0.999 91.21 4.02 PlyA + 55916 55921 6 1.05 5.03 PlyA - 57941 57936 6 1.05 5.02 Term - 64162 64077 86 2 2 55 41 115 0.421 0.34 5.01 Init - 69107 69029 79 2 1 56 100 72 0.343 6.47 5.00 Prom - 71170 71131 40 -5.65 6.10 PlyA - 71210 71205 6 1.05 6.09 Term - 71437 71307 131 2 2 14 41 153 0.439 0.56 6.08 Intr - 78171 78065 107 2 2 19 106 70 0.048 0.84 6.07 Intr - 93245 93022 224 2 2 -20 76 259 0.209 9.90 6.06 Intr - 96966 96894 73 2 1 66 95 73 0.973 4.29 6.05 Intr - 100049 100002 48 1 0 59 98 88 0.756 3.78 6.04 Intr - 108668 108566 103 0 1 107 97 58 0.970 6.91 6.03 Intr - 114016 113840 177 2 0 63 78 166 0.996 12.07 6.02 Intr - 115697 115521 177 0 0 87 94 161 0.998 15.57 6.01 Init - 116254 116170 85 1 1 36 99 123 0.567 9.23 6.00 Prom - 116886 116847 40 -6.55 7.00 Prom + 118312 118351 40 -8.25 7.01 Init + 119918 120045 128 1 2 68 92 52 0.188 3.28 7.02 Intr + 124750 124940 191 1 2 32 55 132 0.231 2.61 7.03 Intr + 125006 125247 242 0 2 31 42 321 0.387 18.15 7.04 Intr + 137786 137830 45 1 0 99 87 34 0.427 2.09 7.05 Intr + 142046 142235 190 1 1 54 53 155 0.639 6.84 7.06 Term + 146105 146172 68 0 2 101 44 47 0.601 -1.28 7.07 PlyA + 146777 146782 6 1.05 8.00 Prom + 151290 151329 40 -7.65 8.01 Init + 152659 152718 60 0 0 47 100 39 0.419 1.88 8.02 Intr + 158584 158733 150 0 0 104 80 81 0.628 8.34 8.03 Intr + 161358 161458 101 2 2 40 80 67 0.493 -0.91 8.04 Term + 164380 164686 307 1 1 90 39 125 0.495 1.40 8.05 PlyA + 164793 164798 6 1.05 9.00 Prom + 165485 165524 40 -1.55 9.01 Init + 167639 167686 48 1 0 61 82 50 0.530 2.93 9.02 Term + 171999 172106 108 2 0 109 54 48 0.522 1.03 9.03 PlyA + 173542 173547 6 1.05 10.04 PlyA - 175638 175633 6 1.05 10.03 Term - 178000 177827 174 1 0 57 40 75 0.083 -3.62 10.02 Intr - 182954 182711 244 1 1 74 103 197 0.974 16.38 10.01 Init - 193929 193739 191 0 2 46 98 109 0.964 6.23 10.00 Prom - 196832 196793 40 -6.95 11.00 Prom + 202609 202648 40 -6.45 11.01 Sngl + 204281 205075 795 1 0 105 39 1030 0.998 95.42 11.02 PlyA + 205099 205104 6 1.05 12.03 PlyA - 205364 205359 6 1.05 12.02 Term - 214930 214876 55 0 1 124 43 61 0.378 1.25 12.01 Intr - 217424 217193 232 0 1 46 72 150 0.194 5.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_1|91_aa MVHRVSEAQRPRKESLDPVEGPAKEARKSHLNACPISKDRVFLKLQDFSGGIMLPTQRAF RAHFLSRSNRHELYRRALKYLQLSDKEWASP >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_1|276_bp atggtccacagagtttctgaagcacaaaggccgagaaaggaatccctggacccagttgag ggtccagcaaaagaggcaaggaagagccatttaaatgcgtgccccatttccaaagacagg gttttcctcaagttacaggatttcagtggtggcattatgctccctacccaaagggccttc agggcccattttctgagtcgaagtaacagacatgaactctacagaagagctttgaagtac ttgcaactgtccgataaagaatgggcttccccctaa >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_2|75_aa MGSSPQPGQTPAAVPEPPGQGYRRKHCSKTKLISDELELYYMTAGRERREEAHDWLLQKG RLSAQAQHKVSVIHA >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_2|228_bp atgggctctagtccacaaccaggccagactccagctgctgttccagagccccctgggcag ggctatcgcaggaaacactgctcaaagacaaagctcatctctgatgagctggaattatat tatatgacagccggaagggagagaagagaagaggcccatgactggctcctacagaaaggt agactgtctgcccaggcccagcacaaggtgtcagtgatccatgcctga >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_3|117_aa MAVAALCFRGAEQQQGQLLGDSETLSLKNTYTNKSPQLFKVLSLTEGIYNAVFMGMKRFY TQQWKVTENPSTAENSLWSRAHVGVQMTLLPERGPDPDPNRGFLDLAQERIQGESVK >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_3|354_bp atggccgtggctgccctttgcttccgtggtgctgagcagcagcaggggcagctcctgggt gacagcgagaccctgtctctgaaaaacacatacacaaacaaaagccctcaactattcaag gtgttgtctcttactgagggaatctataatgcagtttttatgggaatgaagagattttat actcaacaatggaaagtcactgagaatcccagtaccgcagaaaacagcctgtggagcagg gcccacgtaggggtgcagatgaccctgttaccagaaaggggtcccgatccagaccccaac agagggttcttggatcttgcgcaagaaagaattcagggcgagtccgtaaagtga >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_4|438_aa MVQWKRLCQLHYLWALGCYMLLATVALKLSFRLKCDSDHLGLESRESQSQYCRNILYNFL KLPAKRSINCSGVTRGDQEAVLQAILNNLEVKKKREPFTDTHYLSLTRDCEHFKAERKFI QFPLSKEEVEFPIAYSMVIHEKIENFERLLRAVYAPQNIYCVHVDEKSPETFKEAVKAII SCFPNVFIASKLVRVVYASWSRVQADLNCMEDLLQSSVPWKYFLNTCGTDFPIKSNAEMV QALKMLNGRNSMESEVPPKHKETRWKYHFEVVRDTLHLTNKKKDPPPYNLTMFTGNAYIV ASRDFVQHVLKNPKSQQLIEWVKDTYSPDEHLWATLQRARWMPGSVPNHPKYDISDMTSI ARLVKWQGHEGDIDKGAPYAPCSGIHQRAICVYGAGDLNWMLQNHHLLANKFDPKVDDNA LQCLEEYLRYKAIYGTEL >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_4|1317_bp atggttcaatggaagagactctgccagctgcattacttgtgggctctgggctgctatatg ctgctggccactgtggctctgaaactttctttcaggttgaagtgtgactctgaccacttg ggtctggagtccagggaatctcaaagccagtactgtaggaatatcttgtataatttcctg aaacttccagcaaagaggtctatcaactgttcaggggtcacccgaggggaccaagaggca gtgcttcaggctattctgaataacctggaggtcaagaagaagcgagagcctttcacagac acccactacctctccctcaccagagactgtgagcacttcaaggctgaaaggaagttcata cagttcccactgagcaaagaagaggtggagttccctattgcatactctatggtgattcat gagaagattgaaaactttgaaaggctactgcgagctgtgtatgcccctcagaacatatac tgtgtccatgtggatgagaagtccccagaaactttcaaagaggcggtcaaagcaattatt tcttgcttcccaaatgtcttcatagccagtaagctggttcgggtggtttatgcctcctgg tccagggtgcaagctgacctcaactgcatggaagacttgctccagagctcagtgccgtgg aaatacttcctgaatacatgtgggacggactttcctataaagagcaatgcagagatggtc caggctctcaagatgttgaatgggaggaatagcatggagtcagaggtacctcctaagcac aaagaaacccgctggaaatatcactttgaggtagtgagagacacattacacctaaccaac aagaagaaggatcctcccccttataatttaactatgtttacagggaatgcgtacattgtg gcttcccgagatttcgtccaacatgttttgaagaaccctaaatcccaacaactgattgaa tgggtaaaagacacttatagcccagatgaacacctctgggccacccttcagcgtgcacgg tggatgcctggctctgttcccaaccaccccaagtacgacatctcagacatgacttctatt gccaggctggtcaagtggcagggtcatgagggagacatcgataagggtgctccttatgct ccctgctctggaatccaccagcgggctatctgcgtttatggggctggggacttgaattgg atgcttcaaaaccatcacctgttggccaacaagtttgacccaaaggtagatgataatgct cttcagtgcttagaagaatacctacgttataaggccatctatgggactgaactttga >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_5|54_aa MEAISKLQAQGKEIETEPGSLAELRKVPPPSATTTLDQSAAINIETRPTSAKRS >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_5|165_bp atggaggcaatatccaagctgcaggcacagggaaaggaaatagaaacagaacctggcagt cttgcggagttgaggaaagtgcccccaccttcagcaaccaccacccttgaccagtcagca gccatcaacattgagacaagacctacatcagcaaaaagatcatga >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_6|374_aa MPVSSRPLPEDDSIEADILAITGPEDQPGSLEVNGNKVRKKLMAPDISLTLDPSDGSVLS DDLDESGEIDLDGLDTPSENSNEFEWEDDLPKPKTTEVIRKGSITEYTAAEEKEDGRRWR MFRIGEQDHRVDMKAIEPYKKVISHGGYYGDGLNAIVVFAVCFMPESSQPNYRYLMDNLF KVDQELNGKQDEPKNEQEVPYVTADGNISCMQQENLQAIRKSLRALRSWDLPVTDRCRQG PAAGTGEGRRGGGDGRVPWREARRSRGAHRTHLGEVFSPPPYFQPGECLKASTSALFCDN VWTFVLNDVEFREVTELIKVDKVKIVACDGKKLTGQDANPWSHDEVHRSEPFKAEWNLND DLMTSTSSFSAQEN >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_6|1125_bp atgcctgtgtcttctagacctttaccagaagatgatagtattgaagcagatatactagct ataactggaccagaggaccagcctggctcactagaagttaatggaaataaagtgagaaag aaactaatggctccagacattagcctgacactggatcctagtgatggctctgtattgtca gatgatttggatgaaagtggggagattgacttagatggcttagacacaccgtcagagaat agtaatgagtttgagtgggaagatgatcttccaaaacccaagactactgaagtaattagg aaaggctcaattactgaatacacagcagcagaggaaaaagaagatggacgacgctggcgt atgttcaggattggagaacaggaccacagggttgatatgaaggcaattgaaccctataaa aaagttatcagccatgggggatattatggggatggattaaatgccattgttgtgtttgct gtctgtttcatgcctgaaagtagtcagcctaactatagatacctgatggacaatcttttt aaagttgatcaagaacttaatggaaaacaagatgaaccgaaaaatgaacaggaagtacca tatgtgactgctgatggtaacatttcttgtatgcagcaggaaaacctgcaggctatacgc aagtctctgagggcccttcgctcctgggatctgcccgttaccgaccgttgccggcaaggc cccgcggccggcaccggagaagggcggcgaggcggcggtgatggtcgcgtcccgtggcgg gaggctcgtcgttcacggggcgcccacagaacccacctaggggaggtcttcagcccacca ccgtattttcagccaggtgaatgtcttaaagcatccactagcgccctattctgcgataat gtgtggacttttgtactgaatgatgttgaattcagagaggtgacagaacttattaaagtg gataaagtgaaaattgtagcctgtgatggtaaaaaacttactggccaagatgccaaccca tggtcacatgatgaggttcacagatcagaacctttcaaagctgaatggaacctcaacgat gatctaatgacttcaacctcctctttcagtgcacaggaaaattaa >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_7|287_aa MSYSYLICYVNIQKMHKVTKKKALCSVMTKLQWKYPEVLDFTRVKGWGSGGGARFHPEHN YQKRKSDGVGGQTAKYKQDPSQGRGPKQGGDPGSTSRAGSMAPPTKGRFPVQLRSHRAEF SQAALTPEEALAPSSSSPLQAGDVGLTARSQKAGPSGARSPRSAVETPAQSPGRSGTASA AAADPDTDYSTIHWINCVDPTKDLSSGCYMASKRVEEKRMLSVGAKTKLEILVHICINML SGLGSPSYRRPLHHPEHRLLKYFWPDLGCIPQLQPHPLQGSEQIGLI >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_7|864_bp atgtcctatagctatcttatctgctacgtcaacattcaaaagatgcacaaagttacaaag aaaaaagccttatgttccgtaatgacaaaattacaatggaaatatcctgaagtgttagat ttcacaagggtaaaagggtgggggagcggaggaggggcaaggttccatcccgagcacaac taccagaaacggaaaagcgacggggtggggggccaaactgccaaatacaaacaggaccca agccaagggcggggacctaagcagggcggggatccaggaagcacctcccgagcaggttct atggctccccctaccaagggccggttcccagtccagctccggagccaccgtgccgagttc tcccaggccgcactcaccccggaggaagccttggccccctcgtcctcttcgcccctccag gccggcgacgtggggctgacggccaggtcgcaaaaagcagggccgagcggagcccgctcc cctcggtcggcggtggagaccccggcccaatcccccggccgcagcggtacggcgtcggcg gcagcagctgacccggacacagactatagtaccattcactggatcaactgtgttgatcct acaaaggatttaagcagcggttgctacatggcatcaaagagagtggaggagaaaaggatg ctgtcagttggtgccaaaaccaaactggaaatcctggtgcatatttgcataaatatgctg tctgggcttggaagcccttcttaccgcagacctcttcatcaccctgaacatcgccttttg aaatatttctggccagatctagggtgtattccacagctgcagccccatccattgcaaggt tctgagcagattggcttgatatga >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_8|205_aa MSERHLPALVLAAKALACDPQLSSLTMRAELTRHFKSCYQLNGSLVKKSTSSFSLSILII ADGLSIISSWTLPVKYTDKNRTTPGAPFLGNDANCFKLDSGLSRAGERSWASAVEGLSSL QLSVEADDEASTGIAAVVTLYGDMCLGTNINIFSCFKGFSKGKSPVMMTSLRLEYRFLRE IFKFCIFTTVAFNKHASGIGGIMKM >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_8|618_bp atgtcagaaagacacctgccagcacttgtcctggcagctaaggcattggcctgtgaccca cagctgagctccctgaccatgcgagctgagctgacacggcatttcaagtcctgttaccag ctcaacggttctcttgtaaaaaaaagcacctcttcatttagcctctccatcttgatcatt gctgatgggctatcaataatttcaagttggaccttgcctgtgaaatacacagataaaaat cgtaccacacctggtgccccctttttggggaatgatgcaaattgtttcaaattggacagt ggactctccagggcaggggaaagatcttgggcaagtgctgtagaaggtctttcttccctg cagctctctgttgaggctgacgatgaagcaagcactggaattgcagccgttgttacactg tatggcgatatgtgtttgggtaccaacatcaacattttttcatgctttaaaggtttttct aaagggaaatcgccagtcatgatgacctcactacgactagaatacaggtttttaagggaa atttttaaattctgtatttttactactgtggctttcaataaacatgcttctggcataggt ggcatcatgaaaatgtaa >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_9|51_aa MPKGKLLWLKQGEGLQLTPIMLKATDHLFLASPTEASICQFPTYRAAKMPA >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_9|156_bp atgcctaaaggaaagctgctctggctgaagcagggtgaagggctgcagctgacacctatc atgttgaaggctactgatcatcttttcctagcctcaccaacagaagcatcaatatgccaa tttccaacctaccgtgcagcaaagatgccggcctga >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_10|202_aa MSCYVPRRRCRVSVKATKNQHRDLSVCCYHRIPVLTTIAGLIERYSSYLIISQCAPIGPQ ASEGSECQVDSPGNPTDDTAVEKPGQEAKRLTLQARGEAPSGCICGALLKSAGNWAPQLW LAQEASLRRPEVALKRCSIHSACHTLADTATLRDIPLKEGQAQPCGRALSHLSLPSPPPT NLDSERNGSLHPWTVSTCICID >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_10|609_bp atgtcgtgctatgtgccaaggagaagatgcagagtatcagtgaaagccactaagaaccag cacagagatctttcagtttgctgctatcacagaatcccagtacttaccacaattgcagga cttattgaacggtattccagttacctgattattagtcagtgtgcccccataggaccccaa gcttctgaagggtctgagtgccaagtggactcccctgggaatccaacggatgacacagct gtagagaagccagggcaggaagcaaagaggctcacgctgcaggccagaggcgaagcccca tcaggctgcatctgtggagctttgttgaagtctgcagggaactgggcaccacagctgtgg ctggcacaagaggcaagtctgagaagacctgaagtggcgctcaagaggtgttcaatacac agtgcatgtcacacgttggcagatacagcaactttaagagacatccctctgaaagaagga caagctcaaccctgtggcagagctctgtcccacctcagcctccccagcccaccccccacc aacctcgactcagagcgcaatggctctttgcatccttggacagtttctacttgcatctgc atcgattaa >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_11|264_aa MAVGKNKRLTKGGKKGAKKKVVDPFSKKDWYDVKAPAMFNIRNIGKTLVTRTQGTKIASD GLKGRVFEVSLADLQNDEVAFRKFKLITEDVQGKNCLTNFHGMDLTRDKMCSMVKKWQTM IEAHVDVKTTDGYLLRLFCVGFTKKRKNQIRKTSYAQHQQVRQIRKKMMEIMIREVQTND LKEVVNKLIPDSIGKDIEKACQSIYPLHDVFVRKVKMLKKPKFELGKLMELHGEGSSSGK ATGDETGAKVERADGYEPPVQESV >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_11|795_bp atggctgttggcaagaacaagcgccttacgaaaggcggcaaaaagggagccaagaagaaa gtggttgatccattttctaagaaagattggtatgatgtgaaagcacctgctatgttcaat ataagaaatattggaaagacgctcgtcaccaggacccaaggaaccaaaattgcgtctgat ggtctcaagggtcgtgtgtttgaagtgagtcttgctgatttgcagaatgatgaagttgca tttagaaaattcaagctgattactgaagatgttcagggtaaaaactgcctgactaacttc catggcatggatcttacccgtgacaaaatgtgttccatggtcaaaaaatggcagacaatg attgaagctcacgttgatgtcaagactaccgatggttacttgcttcgtctgttctgtgtt ggttttactaaaaaacgcaaaaatcagatacggaagacctcttatgctcagcaccaacag gtccgccaaatccggaagaagatgatggaaatcatgatccgagaggtgcagacaaatgac ttgaaagaagtggtcaataaattgattccagacagcattggaaaagacatagaaaaggct tgccaatctatttatcctctccatgatgtcttcgttagaaaagtaaaaatgctgaagaag cccaagtttgaattgggaaagctcatggagcttcatggtgaaggcagtagttctggaaaa gccactggggacgagacaggtgctaaagttgaacgagctgatggatatgaaccaccagtc caagaatctgtttaa >gi568815583r:59564072_59782457|GENSCAN_predicted_peptide_12|95_aa XIQTTIREYYKRLYANKLENLEEMDKFLDTETLPRLNQEEVESLNRPITGSEIEAIINSL PTKKSPGPDGFTAKFYQRTGGDDLPASVPVQGLGP >gi568815583r:59564072_59782457|GENSCAN_predicted_CDS_12|288_bp naaatacaaactaccatcagagaatactataaacgcctctatgcaaataaactagaaaat ctagaagaaatggataaattcctagacacagagaccctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggctctgaaattgaggcaataattaatagctta ccaaccaaaaaaagtccaggaccagatggattcacagccaaattctaccagagaacaggt ggagatgatcttccggcatcagtcccagtccaaggtttgggcccctga