GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:10:55 Sequence gi568815576r:36041203_36245601 : 204399 bp : 46.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 19395 19446 52 2 1 44 86 51 0.429 1.82 1.02 Term + 42051 42343 293 1 2 114 45 148 0.833 8.61 1.03 PlyA + 42734 42739 6 1.05 2.00 Prom + 49627 49666 40 -2.46 2.01 Init + 56200 56210 11 0 2 68 94 15 0.821 -0.80 2.02 Intr + 56338 56364 27 1 0 123 103 39 0.794 6.13 2.03 Intr + 63508 63557 50 1 2 87 71 60 0.644 2.52 2.04 Intr + 82912 83094 183 2 0 53 53 156 0.435 8.36 2.05 Intr + 92785 92839 55 2 1 118 27 44 0.089 -0.66 2.06 Term + 94832 94991 160 2 1 107 43 92 0.219 4.01 2.07 PlyA + 94996 95001 6 1.05 3.04 PlyA - 96283 96278 6 1.05 3.03 Term - 100856 99998 859 1 1 87 46 569 0.999 44.83 3.02 Intr - 104397 104271 127 1 1 46 91 134 0.709 9.24 3.01 Init - 107853 107844 10 0 1 114 96 -2 0.763 3.65 3.00 Prom - 112640 112601 40 -3.76 4.06 PlyA - 113333 113328 6 1.05 4.05 Term - 118478 118360 119 0 2 119 48 85 0.961 6.30 4.04 Intr - 119587 119467 121 1 1 96 111 25 0.879 5.67 4.03 Intr - 125555 125479 77 0 2 62 33 56 0.139 -3.27 4.02 Intr - 129287 129135 153 0 0 74 -4 110 0.249 0.44 4.01 Init - 139300 138922 379 1 1 74 55 250 0.664 17.37 4.00 Prom - 139657 139618 40 -7.66 5.00 Prom + 143007 143046 40 -4.96 5.01 Init + 145231 145321 91 0 1 72 103 65 0.650 6.38 5.02 Term + 146863 147086 224 2 2 56 54 126 0.912 3.08 5.03 PlyA + 148861 148866 6 1.05 6.05 PlyA - 148976 148971 6 1.05 6.04 Term - 150710 149873 838 1 1 87 47 507 0.999 38.94 6.03 Intr - 154235 154109 127 0 1 78 91 143 0.997 13.34 6.02 Intr - 158174 158128 47 1 2 81 121 0 0.157 0.65 6.01 Init - 160834 160791 44 1 2 100 93 37 0.495 5.52 6.00 Prom - 162518 162479 40 -4.06 7.00 Prom + 173371 173410 40 -6.16 7.01 Init + 181996 182155 160 0 1 71 72 94 0.614 4.90 7.02 Intr + 183637 183728 92 2 2 88 47 35 0.788 -0.89 7.03 Intr + 184809 185009 201 2 0 60 97 163 0.858 13.88 7.04 Term + 185263 185613 351 0 0 62 37 68 0.427 -6.41 7.05 PlyA + 185813 185818 6 1.05 8.07 PlyA - 186152 186147 6 -5.32 8.06 Term - 187078 186202 877 0 1 94 46 574 0.999 45.85 8.05 Intr - 190264 190138 127 2 1 90 115 112 0.999 13.84 8.04 Intr - 192039 191951 89 2 2 70 96 58 0.820 4.41 8.03 Intr - 192253 192200 54 0 0 121 105 1 0.700 3.19 8.02 Intr - 198033 197934 100 1 1 23 100 37 0.301 -2.43 8.01 Intr - 198359 198239 121 2 1 79 111 25 0.406 3.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 175497 175431 67 0 1 73 99 73 0.819 6.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_1|114_aa MRMQRHKNDTMDFEDSEAISPAIRMPIPCCDDPGPDLRCPDHKISGEGGLLELGGPGHSS ISRGHSVAGTIICVTFFLSETRKSYLVVVETLKIVEKPNEQLEIIHEHCQDSSS >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_1|345_bp atgaggatgcaaaggcataagaatgatacaatggactttgaagactcggaggccatctct ccagccatacgcatgcccatcccctgctgcgatgaccctgggccagacctgcgatgccct gatcacaagatatcaggggaaggtggcttgttggagttgggtggtccagggcactcaagt atctcaagaggccactctgtagcaggcaccattatttgtgtaacattttttctctcggag acacgcaagtcatacctggttgttgtagaaactttgaaaattgtagaaaaacctaatgaa caattggaaatcatccatgagcactgccaagacagcagtagctag >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_2|161_aa MLIRYLAIPVYRMGLLIGEPKLSQTVIYASPGLRLRSADSTMRVPTAPPPGSAEAVGNTQ GSQCARFSTAALKIQHEAWLGELEWLTLWSDSQCSSQYGLESELQNGKESGLLWNASQGR FLKSNGAHLTVQAAWKELHPRTRICIVSWVTTKELAGISTF >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_2|486_bp atgttgatcaggtatttggctattccagtataccgcatgggtttgcttattggagaaccc aagctaagccaaacggtcatctatgcaagtcccgggctgcggctgcgcagcgccgacagc acgatgcgtgtgccgacagcgccacctcctggctctgccgaggctgttggcaacacgcag ggctcgcagtgcgcccgcttttctactgcagccctcaaaattcagcacgaggcttggcta ggggagcttgagtggctgacactgtggtctgacagccagtgttcaagtcagtatggcctt gagagtgaactgcagaatggaaaggagtctggcctgctgtggaacgcctctcaaggacgg ttcctgaagagcaatggagcacacctcacagtccaggctgcctggaaggagctgcatcct cggaccaggatctgcattgtttcctgggtaacaactaaggaattagctggaatttcaaca ttttag >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_3|331_aa MDSEKKRFTEEATKYFRERVSPVHLQILLTNNEAWKRFVTAAELPRDEADALYEALKKLR TYAAIEDEYVQQKDEQFREWFLKEFPQVKRKIQESIEKLRALANGIEEVHRGCTISNVVS SSTGAASGIMSLAGLVLAPFTAGTSLALTAAGVGLGAASAVTGITTSIVEHSYTSSAEAE ASRLTATSIDRLKVFKEVMRDITPNLLSLLNNYYEATQTIGSEIRAIRQARARARLPVTT WRISAGSGGQAERTIAGTTRAVSRGARILSATTSGIFLALDVVNLVYESKHLHEGAKSAS AEELRRQAQELEENLMELTQIYQRLNPCHTH >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_3|996_bp atggactcagaaaagaaacgctttactgaagaggccaccaaatacttccgggagagagtc agcccagtgcatctgcaaatcctgctgactaacaatgaagcctggaagagattcgtgact gcggctgaattgcccagggatgaggcagatgctctctacgaagctctgaagaagcttaga acatatgcagctattgaggacgaatatgtgcagcagaaagatgagcagtttagggaatgg tttttgaaagagtttccccaagtcaagaggaagatccaggagtccatagaaaagcttcgt gcccttgcaaatggtattgaagaggtccacagaggctgcaccatctccaatgtggtgtcc agctccactggcgctgcctctggcatcatgtcccttgctggtcttgttttggcaccattt acagcagggacgagtctggcccttactgcagctggggtagggctgggagcagcgtctgct gtgactgggatcaccaccagcatcgtggagcactcatacacatcatcagcagaagctgaa gccagcaggctgactgcaaccagcattgaccgattgaaggtatttaaggaagttatgcgt gacatcacacccaacttactttcccttcttaataattattacgaagccacacaaaccatt gggagtgaaatccgtgccatcaggcaagccagagccagggcccgactccctgtgaccacc tggcgaatctcagctggaagtggtggtcaagcagagagaacgattgcaggcaccacccgg gcagtgagcagaggagcccggatcctgagtgcgaccacttcaggcatcttccttgcactg gatgtggtcaaccttgtatacgagtcaaagcacttgcatgagggggcaaagtctgcatct gctgaggagctgaggcggcaggctcaggagctggaggagaatctaatggagctcactcag atctatcagcgtctgaatccatgccatacccactga >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_4|282_aa MWLPYKSDHQGAAMRWAKAKTVGFPAHCADNLAAVQNPGTCSWLRLERFPPHRVTYIEPY NLCGTGESILLAPPPQSGGKRYDLTCVDTTMRLLQAFSVKRATQLETMKRLTALSVTNGM AKRTDRDIRSPEVLKRHIHLTWYERATEGYTGVECLVGMSCQTAIIQASPENGPMFLGVW GGGHEAAGTFAQDETVPSERVMLGISQSLENVSGYYADARLEVGSTQLRTAGSCSHSFKR SFLGLWVIEKKEAVLFPSGKDGPDRGSENGMLQNGVCHMIPA >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_4|849_bp atgtggctaccttataagagcgaccaccagggggcagccatgaggtgggccaaagcaaag actgtaggcttccctgcccactgtgcagataatctagcagctgttcagaaccccgggacc tgttcatggctgcgactagaaaggttccctccacaccgggtcacatacatcgagccatac aacctgtgcgggactggcgagtcgattttattggcccccccgccccagagcggagggaaa cggtatgacttaacctgtgtggatacaacaatgaggctactacaggccttcagtgtaaaa cgtgccacccaactggagaccatgaagcgtctcaccgctcttagtgtcacgaatggcatg gctaaaaggacagatagagatattaggtcccctgaagtcttgaagagacatattcatctt acttggtacgagagggccactgaaggctatactggagtggagtgcctagtgggcatgtcc tgtcagacagcaataattcaagcatcccctgagaatggccctatgttccttggtgtatgg ggtggtggccacgaggctgcagggacatttgcccaggatgaaacagttcccagtgaaagg gtcatgttgggtatatctcagagcctggagaacgtgtctggttattatgcagatgcacgg ctggaggtgggatccacacagctcagaacagctggatcttgctcacactctttcaagaga agcttccttggcctctgggtgatagagaagaaagaagctgtgctgttccctagtgggaaa gatggcccagacaggggcagtgagaacggcatgttgcagaatggtgtctgccacatgatc ccagcttga >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_5|104_aa MQKRAMVCVALFGEEASAFVFVVRMAGFITGELRPQQPTLDSMKALGFCAKSNRKPLKTW KPVFDEICVFMGQLGPMWRKSWPSGKGCVSAKTILEAFAGVWAK >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_5|315_bp atgcagaagagggcgatggtgtgtgtggctctctttggagaagaagcaagtgcatttgtc tttgtcgttcggatggctggattcataacaggggaactcaggccacagcagcctacatta gacagcatgaaggctctgggcttttgtgctaagagtaataggaagccattgaagacatgg aagccggtgtttgatgaaatctgtgtttttatgggtcaacttggcccaatgtggaggaag agctggcctagcggcaagggctgtgtatcagcgaagaccattttggaagcttttgcagga gtctgggcgaagtga >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_6|351_aa MEGAALLKIFVVCIWVQQNHPGWTVAGQFQEKKRFTEEVIEYFQKKVSPVHLKILLTSDE AWKRFVRVAELPREEADALYEALKNLTPYVAIEDKDMQQKEQQFREWFLKEFPQIRWKIQ ESIERLRVIANEIEKVHRGCVIANVVSGSTGILSVIGVMLAPFTAGLSLSITAAGVGLGI ASATAGIASSIVENTYTRSAELTASRLTATSTDQLEALRDILRDITPNVLSFALDFDEAT KMIANDVHTLRRSKATVGRPLIAWRYVPINVVETLRTRGAPTRIVRKVARNLGKATSGVL VVLDVVNLVQDSLDLHKGAKSESAESLRQWAQELEENLNELTHIHQSLKAG >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_6|1056_bp atggagggagctgctttgctgaaaatctttgtcgtctgcatctgggtgcagcaaaaccat ccaggctggacagtggctggacagttccaagaaaagaaacgcttcactgaagaagtcatt gaatacttccagaagaaagttagcccagtgcatctgaaaatcctgctgactagcgatgaa gcctggaagagatttgtgcgtgtggctgaattgcccagggaagaggcagatgctctctat gaagctctgaagaatcttacaccatatgtggctattgaggacaaagacatgcagcaaaaa gaacagcagtttagggagtggtttttgaaagagtttcctcaaatcagatggaagattcag gagtccatagaaaggcttcgtgtcattgcaaatgagattgaaaaggtccacagaggctgc gtcatcgccaatgtggtgtctggctccactggcatcctgtctgtcattggcgttatgttg gcaccatttacagcagggctgagcctgagcattactgcagctggggtagggctgggaata gcatctgccacggctgggatcgcctccagcatcgtggagaacacatacacaaggtcagca gaactcacagccagcaggctgactgcaaccagcactgaccaattggaggcattaagggac attctgcgtgacatcacacccaatgtgctttcttttgcacttgattttgacgaagccaca aaaatgattgcgaatgatgtccatacactcaggagatctaaagccactgttggacgccct ttgattgcttggcgatatgtacctataaatgttgttgagacactgagaacacgtggggcc cccacccggatagtgagaaaagtagcccggaacctgggcaaggccacttcaggtgtcctt gttgtgctggatgtagtcaaccttgtgcaagactcactggacttgcacaagggggcaaaa tccgagtctgctgagtcgctgaggcagtgggctcaggagctggaggagaatctcaatgag ctcacccatatccatcagagtctaaaagcaggctag >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_7|267_aa MVCSSSRLGCQLCPGLDEPPSSKCWPLRCDAKNSEMTWYDKEEGGLNVHRRLEGPQATTG YFRQHHGLHLCAKSNGKPLEGWKQHFKFPWNFIGPSVSPGLEEHRPLAKAPTTLLPSQPP SPPLREKLETDGPQSLEGAERPKNDSNKSSLGSSRGHAPFSNLEGTRHMTTTPTTIMKKP LADCDTPPEGQGRVDAGKTERETITRERLYSWKDIKPGGGSVVELLFLLLYPTVLTLVFS IFSSFFSWPSHCWFLQAPPLFFAQYIL >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_7|804_bp atggtttgctccagcagcaggctgggatgccagctgtgcccagggctggatgagccaccc tcaagtaaatgctggccacttcgttgtgacgccaaaaattcagaaatgacttggtatgac aaagaagaaggaggactcaacgtgcataggagattggaaggaccccaggccacaacaggc tactttaggcagcatcatggcctacatctttgtgctaagagtaatgggaagccattagag ggttggaagcagcacttcaagttcccctggaacttcatcggtccatcggtgtcccctggt ctggaagagcaccgaccacttgccaaggcccccaccactctgctgccatcacaaccacca tcaccaccattacgagaaaaacttgagactgatggcccccaaagcttagaaggagctgag agaccaaagaatgactcgaacaagtccagcttgggttcaagcaggggacatgcacccttt agtaacctggaggggacccgtcacatgacaaccaccccaaccaccatcatgaagaagcca ctggctgactgtgatacacccccagaaggacaagggagagtggatgctggaaagacagag cgagagaccatcaccagggaaagactttattcttggaaggacatcaaacctggggggggg tcggtagtggagctgctgtttcttctcctgtatccaacagttctaactctggttttctcc attttcagctctttcttttcctggccttctcattgctggttcctgcaagctccccctcta ttcttcgcccaatatattctttag >gi568815576r:36041203_36245601|GENSCAN_predicted_peptide_8|455_aa GISGRPEDVSGYYTDAQLDVGSTQLRTVGSCSVSVRGRSLERGGENSLDGFTKLPGLSSG SDTLRHGQLTPEERTRGPCLGVRVREEEAGTRVKENLPVWTVTGELQGKPLGNPAAGTMN PESSIFIEDYLKYFQDQVSRENLLQLLTDDEAWNGFVAAAELPRDEADELRKALNKLASH MVMKDKNRHDKDQQHRQWFLKEFPRLKRELEDHIRKLRALAEEVEQVHRGTTIANVVSNS VGTTSGILTLLGLGLAPFTEGISFVLLDTGMGLGAAAAVAGITCSVVELVNKLRARAQAR NLDQSGTNVAKVMKEFVGGNTPNVLTLVDNWYQVTQGIGRNIRAIRRARANPQLGAYAPP PHVIGRISAEGGEQVERVVEGPAQAMSRGTMIVGAATGGILLLLDVVSLAYESKHLLEGA KSESAEELKKRAQELEGKLNFLTKIHEMLQPGQDQ >gi568815576r:36041203_36245601|GENSCAN_predicted_CDS_8|1368_bp ggtatatctgggaggccggaggacgtgtctggttattacacagatgcacagctggacgtg ggatccacacagctcagaacagttggatcttgctcagtctctgtcagaggaagatccctt gagagaggtggggaaaacagcttagatgggttcacaaagctccccgggctcagctctggt tctgacaccctcagacacggccagctgaccccagaagaaaggacaagaggaccctgcctt ggtgtgagagtgagggaagaggaagctggaacgagggttaaggaaaaccttccagtctgg acagtgactggagagctccaaggaaagcccctcggtaacccagccgctggcaccatgaac ccagagagcagtatctttattgaggattaccttaagtatttccaggaccaagtgagcaga gagaatctgctacaactgctgactgatgatgaagcctggaatggattcgtggctgctgct gaactgcccagggatgaggcagatgagctccgtaaagctctgaacaagcttgcaagtcac atggtcatgaaggacaaaaaccgccacgataaagaccagcagcacaggcagtggtttttg aaagagtttcctcggttgaaaagggagcttgaggatcacataaggaagctccgtgccctt gcagaggaggttgagcaggtccacagaggcaccaccattgccaatgtggtgtccaactct gttggcactacctctggcatcctgaccctcctcggcctgggtctggcacccttcacagaa ggaatcagttttgtgctcttggacactggcatgggtctgggagcagcagctgctgtggct gggattacctgcagtgtggtagaactagtaaacaaattgcgggcacgagcccaagcccgc aacttggaccaaagcggcaccaatgtagcaaaggtgatgaaggagtttgtgggtgggaac acacccaatgttcttaccttagttgacaattggtaccaagtcacacaagggattgggagg aacatccgtgccatcagacgagccagagccaaccctcagttaggagcgtatgccccaccc ccgcatgtcattgggcgaatctcagctgaaggcggtgaacaggttgagagggttgttgaa ggccccgcccaggcaatgagcagaggaaccatgatcgtgggtgcagccactggaggcatc ttgcttctgctggatgtggtcagccttgcatatgagtcaaagcacttgcttgagggggca aagtcagagtcagctgaggagctgaagaagcgggctcaggagctggaggggaagctcaac tttctcaccaagatccatgagatgctgcagccaggccaagaccaatga