GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:02:26 Sequence gi568815591r:129918667_130151366 : 232700 bp : 44.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 30260 30299 40 -0.96 1.01 Init + 33502 33683 182 0 2 68 35 150 0.469 6.36 1.02 Intr + 39866 39887 22 2 1 142 96 -5 0.497 3.25 1.03 Intr + 45016 45116 101 0 2 37 103 65 0.250 1.81 1.04 Intr + 75725 75749 25 2 1 71 96 53 0.017 2.43 1.05 Term + 82849 82950 102 0 0 96 48 23 0.024 -2.62 1.06 PlyA + 85297 85302 6 1.05 2.08 PlyA - 85314 85309 6 1.05 2.07 Term - 90274 89981 294 1 0 6 35 172 0.067 -0.99 2.06 Intr - 103859 103653 207 2 0 119 81 98 0.993 11.47 2.05 Intr - 105057 104845 213 0 0 93 109 128 0.991 14.31 2.04 Intr - 110402 110236 167 0 2 81 101 20 0.188 2.28 2.03 Intr - 122419 122285 135 2 0 41 95 109 0.386 7.44 2.02 Intr - 129318 129284 35 2 2 44 44 33 0.090 -7.63 2.01 Init - 132700 132555 146 1 2 104 77 105 0.555 10.60 2.00 Prom - 137562 137523 40 -2.26 3.00 Prom + 141013 141052 40 -4.16 3.01 Init + 141917 141930 14 1 2 114 34 5 0.237 -2.49 3.02 Intr + 151750 151834 85 1 1 113 115 40 0.694 9.02 3.03 Intr + 151993 152143 151 0 1 30 109 95 0.488 5.54 3.04 Intr + 186403 186543 141 2 0 49 67 96 0.037 3.92 3.05 Intr + 197779 198000 222 2 0 77 94 137 0.810 11.30 3.06 Intr + 202083 202237 155 1 2 74 81 50 0.666 2.69 3.07 Intr + 203388 203536 149 2 2 87 78 144 0.538 12.33 3.08 Intr + 205785 205869 85 0 1 83 71 33 0.309 0.92 3.09 Term + 211871 212080 210 1 0 63 55 81 0.097 -0.41 3.10 PlyA + 217019 217024 6 1.05 4.03 PlyA - 217870 217865 6 1.05 4.02 Term - 222138 221939 200 1 2 85 36 130 0.736 5.06 4.01 Init - 229095 228975 121 0 1 96 63 108 0.559 9.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 82769 82817 49 1 1 87 94 51 0.822 6.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:129918667_130151366|GENSCAN_predicted_peptide_1|143_aa MGSVVESEATGRAKRKKAVSGVQTSRRKVTGSGDTGRASSNAYFGVLRRKGELKRLSAPG RAEPTLDQALYQGALLTTNQPGRTDVILLVAQMKNLTFKEVMLYALYVYQGKYSRYKYHI DRYDAVCPLPANAGECVNKPTPG >gi568815591r:129918667_130151366|GENSCAN_predicted_CDS_1|432_bp atgggctcggtggtagagagtgaggcgactggtcgtgcaaaaagaaaaaaggccgtctcg ggagtccaaacgtcgcggaggaaagtcacaggctctggggatacaggaagagcgagcagc aacgcgtatttcggggtgctgcggagaaaaggcgagctgaaaaggctctcggcccctggg agagctgagcccacccttgaccaggcactgtaccaaggtgcattacttacaaccaaccaa cctggcaggacagatgttattctgcttgttgcacagatgaagaatcttacgttcaaagaa gtcatgttatatgccctctatgtgtaccagggtaaatacagtagatataagtatcatatc gatagatatgatgctgtatgtcccttacctgccaatgctggggagtgtgtaaacaagcca acccctggctag >gi568815591r:129918667_130151366|GENSCAN_predicted_peptide_2|398_aa MAAPCEGQAFAVGVEKNWGAVVRSPEGTPQKIRQLIDEGIAPEEGGVDAKVHNLAADFHQ SKPFELSPLVCAKYGWVTVECDMLKCSSCQAFLCASLQPAFDFDRFYTNIIFQFTFSTDR FGMLPLDEPAILVSEFLDRFQSLCHLDLQLPSLRPEDLKTMAEKSPGPIVSRTRSWDSSS PVDRPEPEAASPTTRTRPVTRSMGTGDTPGLEVPSSPLRKAKRARLCSSSSSDTSSRSFF DPTSQHRDWCPWVNITLGKESRENGGTEPDASAPAEPGWKAVLTILLAHKQSSQPAETDS MIWLQTVKSLVGPDNSSETQSLLCGKQNVNRGLRVFGKSAPRIRDGPGSSFSLSSIPCYT RSHSLCPTGPQGALGSLAKSQHCPGPCSRSTVTGLNSH >gi568815591r:129918667_130151366|GENSCAN_predicted_CDS_2|1197_bp atggcggcgccctgtgagggacaagcgtttgccgtaggggttgaaaagaattggggtgca gtagttcgctccccagaagggaccccccagaaaatccggcagctgatagatgaggggatt gccccggaagagggaggcgtggacgcgaaagttcacaacttggcagctgacttccatcag agtaagccctttgagctgtctccactcgtctgtgcaaaatatggctgggtcacagtggaa tgtgatatgctcaagtgctctagctgtcaagcttttctctgtgccagtttacaaccagct tttgactttgacagattttacaccaatataatcttccaatttactttttccacagaccga tttgggatgttgcccctggatgagcctgctattcttgttagtgaattcctagatcgtttt caaagcctttgtcacttggacctccagcttccttccctaaggccggaggacttgaaaact atggctgaaaagagccctggtcccattgtctctcgaactcggagctgggactcttccagt cctgttgaccgtcctgagccagaggctgctagccccaccaccagaactcgcccagtgacc cgaagcatgggaacaggagacacccctggcctggaggtaccatctagccctctgcggaaa gccaagcgagctcgcctctgctcctccagcagttcggacacatcttcccgaagcttcttt gatcccacctctcagcatagagactggtgcccttgggtgaatatcacacttggcaaagaa agcagggagaatggtggaactgaaccagatgccagcgccccagcagagccaggctggaaa gcagtgctgaccatcctcttggcgcacaaacagtctagccagccagctgaaacggactcc atgatctggctccaaacagtgaagagcctggtggggcctgacaacagttcagagacgcag agcctcctctgtgggaaacagaatgtgaaccgaggccttagagtgtttggaaaatcagct ccacgaatcagagatggaccgggcagcagcttctccctctcctccatcccatgctacacc cgtagccactctctgtgccccactggtcctcagggggccctgggctctcttgccaagtct cagcactgcccaggcccatgcagccggtcaacagtgactgggctcaactcccactaa >gi568815591r:129918667_130151366|GENSCAN_predicted_peptide_3|403_aa MPGQLHSYGYHVSLRLPPPTKKTTDHWLPFTKAGWDRNRRRGGGAAGAGGGGSGAGGGSG GSGGRGTGQLNRFVQLSGRPHLPGIATLKRLTVAIPSIGKDMEQQEFSHAALGYVKWYNH FENILALPNEGHRPPPARSGHRCVADNTNLYVFGGYNPDYDESGGPDNEDYPLFRELWRY HFATGVWHQMGTDGYMPRELASMSLVLHGNNLLVFGGTGIPFGESNGNDVHVCNVKYKRW ALLSCRGKKPSRIYGQAMAIINGSLYVFGGTTGYIYSTDLHKLDLNTREWTQLKPNNLSC DLPEERYRHEIAHDGQRIYILGGGTSWTAYSLNKAGCMYIHGGVVNIHENKRTGSLFKIW LVVPSLLELAWEKLLAAFPNLANLSRTQLLHLGLTQGLIERLK >gi568815591r:129918667_130151366|GENSCAN_predicted_CDS_3|1212_bp atgcccggccagctccatagctatggttaccacgtttctctccgcctcccgccacctacc aagaaaacgactgaccattggctgcctttcaccaaggcgggctgggacaggaaccgccgg aggggaggaggcgccgccggcgctggtggcggaggtagcggggccggcgggggcagtggg ggcagcgggggtcgggggactggccagctcaaccgcttcgtgcaactctccgggcggccg cacctgccaggaattgctacattaaaaagactgacagtggcaataccaagtattggcaag gatatggaacaacaggaattctcacatgctgcccttggatatgtaaagtggtacaaccat tttgaaaacattttggctttaccaaacgaaggccacagacctccaccagcacgaagtgga catcgttgtgtggcagataataccaacctatatgtgtttggaggttataacccagattat gatgaatcgggagggcctgataatgaagactatcctctcttcagggaactctggaggtat cattttgctacaggagtatggcaccagatgggcacagatggctacatgccccgggaattg gcatctatgtcacttgtgctgcatggaaacaacctgttagtatttggaggtacgggcatc ccatttggagagagcaacggcaatgacgtccatgtgtgtaatgtgaagtataagagatgg gctttgctcagctgtcgggggaagaaacccagtcgtatatatggacaggctatggccatc atcaatggctccctttatgtctttggaggtacaaccggctatatttacagcacagacctg cacaagttagatctcaataccagagagtggacacaactgaaaccaaacaacctatcctgt gatctaccagaagagagataccgacatgaaattgcacatgacgggcagaggatttacatc ttgggaggtggtacttcctggacagcatattccttaaacaaggctggttgcatgtacatt catggaggagtggtgaacatccatgaaaacaaacggactgggtcattgtttaagatctgg ctggtggtacctagcctgctggaactggcatgggagaagctgcttgcggccttccctaac cttgcaaacctctcccgaacacaacttctgcaccttggactcacacagggactcatcgaa cgcttgaaatga >gi568815591r:129918667_130151366|GENSCAN_predicted_peptide_4|106_aa MAEAKLVERDGVEETACSPKGEKSTLGKAPCNGSEGWRQPAMCAAGGKSMASGRVAQRTG PWGAARSTDRPGPWLHRPCAAAPNPESREARGDFTSPKALLKEFSQ >gi568815591r:129918667_130151366|GENSCAN_predicted_CDS_4|321_bp atggcggaggccaaactggtggaaagagatggtgttgaggagacagcctgctccccgaag ggtgagaagagcaccctgggaaaagccccctgtaacggcagcgagggctggaggcagccg gccatgtgcgctgcaggcggcaagtccatggccagcgggcgggtagcccagcgcaccggg ccttggggagccgcccgcagcacggaccgccctggcccctggcttcaccgcccctgcgcg gctgctcccaacccagaatcccgggaagccaggggtgacttcacgtccccgaaagccctt ctgaaagaattttctcagtaa