GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:30:16 Sequence gi568815584r:64976479_65202339 : 225861 bp : 44.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 347 473 127 2 1 76 29 120 0.427 5.58 1.02 Intr + 10395 10619 225 2 0 71 94 136 0.453 10.68 1.03 Intr + 13542 13631 90 2 0 -23 99 104 0.141 0.59 1.04 Intr + 27771 27835 65 2 2 80 121 77 0.420 7.72 1.05 Intr + 32641 32659 19 1 1 75 105 -21 0.262 -5.09 1.06 Intr + 35138 35319 182 1 2 101 29 177 0.307 11.77 1.07 Intr + 35839 35911 73 1 1 68 82 46 0.565 1.41 1.08 Intr + 39147 39238 92 2 2 97 70 70 0.696 4.89 1.09 Intr + 50975 51121 147 2 0 69 92 192 0.998 17.05 1.10 Intr + 51220 51303 84 1 0 111 105 61 0.989 9.04 1.11 Intr + 56132 56218 87 2 0 102 100 100 0.992 11.69 1.12 Intr + 64312 64441 130 1 1 105 83 86 0.977 10.50 1.13 Intr + 67833 67965 133 2 1 88 105 113 0.999 13.22 1.14 Intr + 76760 76871 112 0 1 105 89 102 0.987 11.44 1.15 Intr + 78097 78211 115 1 1 92 99 103 0.995 12.25 1.16 Term + 84703 84834 132 0 0 117 44 128 0.959 9.39 1.17 PlyA + 85834 85839 6 1.05 2.00 Prom + 87579 87618 40 -5.86 2.01 Init + 88790 88854 65 1 2 97 82 -1 0.513 0.34 2.02 Intr + 89385 89555 171 0 0 29 100 89 0.450 3.26 2.03 Term + 93523 93631 109 1 1 70 42 123 0.529 3.88 2.04 PlyA + 97136 97141 6 1.05 3.09 PlyA - 98670 98665 6 1.05 3.08 Term - 100185 99998 188 1 2 68 46 266 0.993 18.05 3.07 Intr - 101558 101435 124 2 1 80 105 151 0.923 16.16 3.06 Intr - 117337 117230 108 1 0 115 113 58 0.941 11.48 3.05 Intr - 125864 125826 39 2 0 42 86 90 0.192 2.62 3.04 Intr - 133496 133361 136 1 1 48 75 32 0.166 -1.53 3.03 Intr - 142143 142032 112 1 1 89 82 46 0.608 3.54 3.02 Intr - 149068 148951 118 1 1 77 58 61 0.665 2.04 3.01 Init - 154274 154158 117 2 0 110 87 45 0.562 4.85 3.00 Prom - 157054 157015 40 -6.66 4.04 PlyA - 157623 157618 6 1.05 4.03 Term - 158579 158410 170 0 2 134 53 112 0.962 10.34 4.02 Intr - 159400 159368 33 2 0 109 82 40 0.679 3.69 4.01 Init - 173122 173053 70 1 1 101 35 42 0.255 1.41 4.00 Prom - 183883 183844 40 -1.66 5.05 PlyA - 183943 183938 6 1.05 5.04 Term - 186892 186656 237 1 0 88 49 130 0.497 5.27 5.03 Intr - 194680 194600 81 1 0 76 50 76 0.236 2.43 5.02 Intr - 206729 206581 149 0 2 49 58 90 0.306 2.05 5.01 Init - 209342 209294 49 2 1 31 72 77 0.267 1.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 204411 204253 159 0 0 77 47 84 0.817 1.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:64976479_65202339|GENSCAN_predicted_peptide_1|604_aa XHTEHAPKGTTVGPAIATAEQLESCTNRQSITKTQRGWTMGLPRFSPLEFQCALLLNEAE SYTLSAALLIMASPSSFTYYCPPSSSPVWSEPLYSLRPEHARERLQDDSVETVTSIEQMR EEAIMDLVQEYSGQRKQQAQRTKAAECQAKVEEKIQEVFSSYKFNHLVPRKDFWIESHRL FCQTKKEVQPLCFARPVQVAMGGTFHLKASIADLKAKDSDWLQNTVLWPGLWVTRGRLVL QREKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQIVATDVCQFLELC QSPEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQYLYSLKQPDGSFLM HVGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIGGVPGMEAHGGYTF CGLAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCYSFWQAGLLPLLHR ALHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDFYHTCYCLSGLSIA QHFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPVPGFEELKDETSAE PATD >gi568815584r:64976479_65202339|GENSCAN_predicted_CDS_1|1815_bp nnccacactgagcatgcacccaaaggcaccactgtgggacctgccatagcaacagctgaa cagctggagagttgcaccaataggcagtccatcaccaagactcaacgcggctggaccatg gggctcccccgctttagcccgctcgagtttcaatgcgcgttgttgcttaacgaagcagag tcctacacactgtctgctgctctcctgatcatggcttctccgagttctttcacctactat tgccctccatcttcctcccccgtctggtcagagccgctgtacagtctgaggcccgagcac gcgcgagagcggttgcaggacgactcggtggaaacagtcacgtccatagaacagatgagg gaggaagccatcatggatctggtgcaagagtattctgggcagagaaagcagcaagcacaa aggaccaaggcagcagaatgccaggcaaaagtagaagaaaagatccaagaggtcttcagt tcttacaagttcaaccaccttgtaccaagaaaagatttttggatagagtcacacaggctc ttctgccagaccaagaaagaagtgcagcctctgtgctttgcccggcctgtgcaggtggcc atgggcggcacattccatcttaaagccagtattgctgacttgaaagccaaggacagcgac tggctgcagaacacggtcctctggccagggctgtgggtcacacgtgggaggcttgttttg cagagggagaagcacttccattatctgaaaagaggccttcgacaactgacagatgcctat gagtgtctggatgccagccgcccatggctctgctattggatcctgcacagcttggaactg ctagatgaacccatcccccagatagtggctacagatgtgtgtcagttcctggagctgtgt cagagcccagaaggtggctttggaggaggacccggtcagtatccacaccttgcacccaca tatgcagcagtcaatgcattgtgcatcattggcaccgaggaggcctatgacatcattaac agagagaagcttcttcagtatttgtactccctgaagcaacctgacggctcctttctcatg catgtcggaggtgaggtggatgtgagaagcgcatactgtgctgcctccgtagcctcgctg accaacatcatcactccagacctctttgagggcactgctgaatggatagcaaggtgtcag aactgggaaggtggcattggcggggtaccagggatggaagcccatggtggctataccttc tgtggcctggccgcgctggtaatcctcaagagggaacgttccttgaacttgaagagctta ttacaatgggtgacaagccggcagatgcgatttgaaggaggatttcagggccgctgcaac aagctggtggatggctgctactccttctggcaggcggggctcctgcccctgctccaccgc gcactgcacgcccaaggtgaccctgcccttagcatgagccactggatgttccatcagcag gccctgcaggagtacatcctgatgtgctgccagtgccctgcgggggggcttctggataaa cctggcaagtcgcgtgatttctaccacacctgctactgcctgagcggcctgtccatagcc cagcacttcggcagcggagccatgttgcatgatgtggtcctgggtgtgcccgaaaacgct ctgcagcccactcacccagtgtacaacattggaccagacaaggtgatccaggccactaca tactttctacagaagccagtcccaggttttgaggagcttaaggatgagacatcggcagag cctgcaaccgactag >gi568815584r:64976479_65202339|GENSCAN_predicted_peptide_2|114_aa MALITWLPRPTWPLDFQLYETMWVKQAYYLPFTTGETGWRAYVNHPRFPSCQVVVPEPEP VSAASKAHALPPPQASEGRAVDGTSRQETMKLSSTVHGSHGLPGCQAASRIPDE >gi568815584r:64976479_65202339|GENSCAN_predicted_CDS_2|345_bp atggccctgatcacatggctcccgaggcccacgtggcctctggattttcagttatatgag acaatgtgggtgaagcaggcgtattacctgccttttacaactggagaaacaggttggaga gcttacgtaaatcacccaaggtttcccagctgtcaagtggtggtgcctgagcccgagccc gtatctgctgcctccaaagcccatgcccttccaccaccgcaggcctctgaaggaagagct gttgatggcaccagcagacaggaaacaatgaagctctcaagcactgtgcatgggagccat gggctgccgggctgccaggctgccagccggattccggatgagtga >gi568815584r:64976479_65202339|GENSCAN_predicted_peptide_3|313_aa MPGRNLMVLKCGALPLLLSRSLLLPSKTCFAYPLPLAIILWLCDVPPPPLPSTMIVFPEA SPEAQQMPASRVLYSLQNRQLAREGRPDRLEEQTSSSSCQRPKDTVFLGPLTKPARRYQV LWMNGGLQGRSELDDHQHIHDSGHEHLEERTEPQISPATNQEMSDNDDIEVESDADKRAH HNALERKRRDHIKDSFHSLRDSVPSLQGEKASRAQILDKATEYIQYMRRKNHTHQQDIDD LKRQNALLEQQVRALEKARSSAQLQTNYPSSDNSLYTNAKGSTISAFDGGSDSSSESEPE EPQSRKKLRMEAS >gi568815584r:64976479_65202339|GENSCAN_predicted_CDS_3|942_bp atgcctggcagaaatctgatggttttaaagtgtggggcacttcccctcttgctgtctcgc tccctcctgctgccctctaagacatgctttgcctaccctttaccattggccattattctc tggctatgcgatgtgcctcctcctcctttgccttccactatgattgtgtttcctgaggcc tctccagaagcccagcagatgccagcatcacgtgtcctgtacagcttgcagaaccgtcag ctagccagagagggtcggcctgaccgactagaagaacagacatcttccagcagctgtcag cgtccaaaggacactgtgttccttggacctcttaccaaaccagccaggagatatcaagtg ctatggatgaatggagggcttcaaggaaggtctgagctagatgaccatcaacatatacat gacagtggacatgaacatctagaggaaagaactgaaccccaaatatctccagcaaccaat caggaaatgagcgataacgatgacatcgaggtggagagcgacgctgacaaacgggctcat cataatgcactggaacgaaaacgtagggaccacatcaaagacagctttcacagtttgcgg gactcagtcccatcactccaaggagagaaggcatcccgggcccaaatcctagacaaagcc acagaatatatccagtatatgcgaaggaaaaaccacacacaccagcaagatattgacgac ctcaagcggcagaatgctcttctggagcagcaagtccgtgcactggagaaggcgaggtca agtgcccaactgcagaccaactacccctcctcagacaacagcctctacaccaacgccaag ggcagcaccatctctgccttcgatgggggctcggactccagctcggagtctgagcctgaa gagccccaaagcaggaagaagctccggatggaggccagctaa >gi568815584r:64976479_65202339|GENSCAN_predicted_peptide_4|90_aa MRAAYTVYCTNPEVAMCMHFSVNENLLSVLEEMGDHVVLDPETTSAYLQEAECNLSVMNT GMKMSVPSNPQRFSYFSVLGTVGFSSGRHD >gi568815584r:64976479_65202339|GENSCAN_predicted_CDS_4|273_bp atgagggctgcatatactgtgtactgcacaaatccagaagttgccatgtgcatgcacttc agtgtaaatgagaacctgctgagtgtcctggaggaaatgggagaccatgtggtcctggac ccagaaactaccagtgcttacctccaagaagcagaatgtaatctgagtgtaatgaacaca ggcatgaagatgtctgtgccttccaacccacagaggttctcctacttcagtgtgctgggg actgtgggattctcatcaggcagacacgactga >gi568815584r:64976479_65202339|GENSCAN_predicted_peptide_5|171_aa MGIRSNRYWMVTTDIEGVENIFDKIQHPFMMKTLNKLGIEGMNLNTIYIKPIAIIILNSE KLKAFPPHCPLLEVSALLPLAFTSCRQDAALSEKKYTFSSSFPINWILEFSKLVHIYGQG QISFTPERSTPNRKWGMLAGSSPTHRQVALVKGLLHLSFSPFFQSFSYRTN >gi568815584r:64976479_65202339|GENSCAN_predicted_CDS_5|516_bp atgggaatcagatcgaacaggtactggatggtaaccaccgacatcgagggtgtggaaaac atatttgacaaaattcaacatcctttcatgatgaaaactctcaacaaattaggtatagaa ggaatgaacctcaacacaatatacattaaacccatagctatcattatactcaacagtgaa aaattgaaagcctttcctccccactgccccctcctggaagtcagcgcactgctgccgctg gcattcacatcgtgtcggcaggacgccgcgttgtcagagaaaaaatataccttctcatct tccttcccaatcaactggattctggaattctccaagttggtgcacatctacgggcaggga cagatcagcttcaccccagaaagatcaaccccaaacaggaaatggggaatgctggctggg tcttcccctacacatcgtcaggtggctctggtcaaaggtctacttcacctttccttctct cccttcttccagtccttctcctatcgaaccaactga