GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:47:03 Sequence gi568815580r:45984285_46198231 : 213947 bp : 43.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 330 325 6 1.05 1.06 Term - 4475 4426 50 0 2 116 38 36 0.440 -1.13 1.05 Intr - 7918 7822 97 1 1 56 68 90 0.877 3.38 1.04 Intr - 9419 9321 99 2 0 45 100 86 0.922 5.81 1.03 Intr - 13544 13429 116 0 2 107 55 148 0.631 13.57 1.02 Intr - 14555 14510 46 2 1 92 94 42 0.882 3.18 1.01 Init - 15259 15152 108 1 0 51 75 185 0.987 13.72 1.00 Prom - 16870 16831 40 -8.06 2.00 Prom + 19845 19884 40 -2.66 2.01 Init + 29925 29973 49 2 1 96 91 -7 0.112 -0.29 2.02 Term + 44088 44428 341 1 2 39 40 529 0.127 37.90 2.03 PlyA + 46052 46057 6 1.05 3.05 PlyA - 46296 46291 6 1.05 3.04 Term - 55542 55524 19 2 1 107 54 20 0.806 -1.71 3.03 Intr - 55763 55663 101 2 2 107 102 197 0.847 21.91 3.02 Intr - 56184 56142 43 2 1 58 94 46 0.108 0.44 3.01 Init - 62104 62010 95 1 2 71 20 109 0.088 2.26 3.00 Prom - 74007 73968 40 -5.46 4.00 Prom + 76647 76686 40 -4.46 4.01 Init + 82245 82316 72 2 0 60 105 10 0.310 0.97 4.02 Intr + 86652 86786 135 2 0 53 96 68 0.343 4.86 4.03 Intr + 87566 87641 76 1 1 112 64 37 0.875 2.79 4.04 Term + 87993 88249 257 1 2 89 36 161 0.821 6.75 4.05 PlyA + 88718 88723 6 1.05 5.11 PlyA - 88907 88902 6 1.05 5.10 Term - 100079 99998 82 1 1 90 37 77 0.954 -0.03 5.09 Intr - 100370 100220 151 0 1 101 80 139 0.999 13.62 5.08 Intr - 101973 101829 145 0 1 79 87 112 0.969 10.06 5.07 Intr - 102948 102724 225 0 0 86 67 212 0.985 17.08 5.06 Intr - 103208 103057 152 0 2 48 48 212 0.999 13.08 5.05 Intr - 103973 103825 149 1 2 59 67 148 0.999 9.68 5.04 Intr - 105448 105282 167 1 2 83 38 223 0.998 15.56 5.03 Intr - 105712 105539 174 1 0 81 100 235 0.999 24.14 5.02 Intr - 107567 107398 170 0 2 77 94 105 0.189 9.67 5.01 Init - 113947 113731 217 1 1 84 49 268 0.109 19.36 5.00 Prom - 114148 114109 40 -7.96 6.00 Prom + 114641 114680 40 -6.76 6.01 Init + 119942 119995 54 1 0 43 98 49 0.872 2.69 6.02 Intr + 120910 121084 175 0 1 66 100 142 0.469 12.71 6.03 Intr + 138372 138421 50 1 2 100 111 -4 0.358 1.50 6.04 Intr + 141460 141507 48 0 0 80 94 28 0.671 1.48 6.05 Intr + 142054 142189 136 0 1 66 64 73 0.296 2.84 6.06 Term + 173206 173495 290 2 2 87 55 100 0.290 2.14 6.07 PlyA + 175411 175416 6 1.05 7.04 PlyA - 176814 176809 6 1.05 7.03 Term - 179749 179623 127 0 1 48 49 116 0.726 1.46 7.02 Intr - 180258 180221 38 0 2 104 113 56 0.719 6.76 7.01 Init - 181583 181536 48 2 0 70 58 38 0.366 -2.03 7.00 Prom - 187519 187480 40 -4.46 8.03 PlyA - 188716 188711 6 1.05 8.02 Term - 189609 188916 694 2 1 67 41 406 0.302 26.84 8.01 Intr - 190418 190258 161 2 2 88 64 89 0.359 5.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 43988 44428 441 1 0 -55 40 607 0.851 36.86 S.002 Term + 113619 114052 434 1 2 92 46 373 0.818 28.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_1|171_aa MTKAKKNYEQKCRDKDEAEQAVSRSANLVNPKQQEKLFVKLATSKTAVEDSDKAYMLHIG TLDKVREEWQSEHIKACEVPVTGSCSGLGWAFEAQECERINFFRNALWLHVNQLSQQCVT SDEMYEQVRKSLEMCSIQRDIEYFVNQRKTGQIPPDDPNYSLVDDYSLLYQ >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_1|516_bp atgacaaaggcaaagaagaactatgagcagaaatgccgggacaaagatgaggcagaacag gccgtcagccggagtgccaacctggtgaacccgaagcaacaagaaaagctttttgtgaaa ctggcaacttcaaagaccgcagtagaggactcagacaaagcatacatgctgcacatcggc accctggataaggtccgagaagagtggcagagtgagcacatcaaggcctgcgaggtaccc gtaaccggaagctgctcaggactggggtgggcatttgaggctcaagaatgtgaacgaata aacttcttccggaatgcattgtggttacatgtgaatcagctgtcacaacaatgtgtcacc agtgatgaaatgtacgaacaagtccgaaagagtttagaaatgtgcagcattcagagggac attgaatactttgtgaatcaacgcaaaactggacagattccaccagatgatcccaattac tctttggttgatgactacagtttgctctatcagtaa >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_2|129_aa MEFYHVGQTSLELLTSAKTAGQAKRGEGRGETAPAAEPPDAGEDKPAVEWCLEELVFGDV EDDEDTLLRRLRGPWVQGHEDLGDSEAENEAKGNCAPQKKPIWVDEEDEDEEMVDMMNNR FRKDNDEKC >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_2|390_bp atggagttttaccatgttggccagactagtcttgaactcctgacctcagcgaaaaccgcc ggccaggccaagcgcggcgagggaagaggagagacggctccggcagcggaaccgcccgac gctggagaagacaaaccggccgtggagtggtgcctggaggagctggtcttcggcgatgtc gaggacgacgaggacacgctgctgcggcgtctgcggggtccgtgggttcaaggacatgaa gacttgggagactcagaagcggagaatgaagcaaaaggtaattgtgcacctcaaaagaag ccaatttgggtggatgaagaagatgaagatgaggaaatggttgacatgatgaacaatcgg tttcggaaagataatgatgaaaaatgctag >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_3|85_aa MRGTGWRCSTELDVRMLSSGTRNPVGGFGDCGLSKVNMTVISICYQSADILSTIGYDNII QHLNNGRKNCKEFEDFLKERLWNDT >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_3|258_bp atgagaggaacgggctggagatgcagcacagaactagatgtgcggatgctcagcagcggg acccgtaaccctgttggtggctttggagactgcggtctgtcgaaggtcaacatgactgtg atcagcatttgttatcagagtgcagacatcctcagcaccatcggctatgacaacattatc caacatctgaacaatggccgcaagaactgcaaagagtttgaagactttctaaaagaaagg ttatggaatgatacctga >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_4|179_aa MEKKFLSSTSCCVFSVGHKVSHQKSLFPTVPAMGSPSPPFYLKNSQASRESHLWCPWFLC EAATTLHPREHALLGARAAEGGPWSPSPRDSRARDSTSAFHTRAGKPPPGPKPLPIATAA VGALRSLRETWAGPSALTRPGAAARTRTQAHLKPHSVGPRSIYQPEASCLLLEVFGYWK >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_4|540_bp atggagaagaagttcctctcttcaacaagttgctgtgttttctccgttggtcacaaggtc tctcaccagaagagcctttttcccacagtgcctgccatgggatcccccagtcctccattc tacctgaaaaactcgcaagcctcccgtgagagccatctgtggtgcccttggttcctctgt gaggctgccaccaccctgcaccctagggagcacgcgctgctgggggcgcgggcggcagag ggtgggccctggagtccgagcccccgggactcgcgggcacgcgactcaacttctgcgttc cacactcgcgcagggaagccgcccccgggtcctaaaccactcccgattgcgacggccgcc gtcggcgctctgcgctccctgcgcgagacctgggcgggaccctcggcgctcacccggcca ggggctgcagcccgcacccggacccaggcccatttaaagccccacagcgtcgggccacga tcaatatatcagcccgaagctagctgcctccttctggaagtctttgggtattggaaataa >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_5|543_aa MLSVRVAAAVVRALPRRAGLVSTEGRHDAGGRVGLQGGGAPARALSAGGRGAVANAAILH PWLLRLDRAGDTGTAEMSSILEERILGADTSVDLEETGRVLSIGDGIARVHGLRNVQAEE MVEFSSGLKGMSLNLEPDNVGVVVFGNDKLIKEGDIVKRTGAIVDVPVGEELLGRVVDAL GNAIDGKGPIGSKTRRRVGLKAPGIIPRISVREPMQTGIKAVDSLVPIGRGQRELIIGDR QTGKTSIAIDTIINQKRFNDGSDEKKKLYCIYVAIGQKRSTVAQLVKRLTDADAMKYTIV VSATASDAAPLQYLAPYSGCSMGEYFRDNGKHALIIYDDLSKQAVAYRQMSLLLRRPPGR EAYPGDVFYLHSRLLERAAKMNDAFGGGSLTALPVIETQAGDVSAYIPTNVISITDGQVA GTMKLELAQYREVAAFAQFGSDLDAATQQLLSRGVRLTELLKQGQYSPMAIEEQVAVIYA GVRGYLDKLEPSKITKFENAFLSHVVSQHQALLGTIRADGKISEQSDAKLKEIVTNFLAG FEA >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_5|1632_bp atgctgtccgtgcgcgttgctgcggccgtggtccgcgcccttcctcggcgggccggactg gtgagcaccgaaggccggcatgatgcaggcggccgggtggggctgcagggtggtggtgcg ccggctcgggcgctctctgcaggagggcgaggggctgtggcgaatgccgccatcttgcac ccgtggcttctccggctggacagagcaggcgacacagggactgctgagatgtcctctatt cttgaagagcgtattcttggagctgatacctctgttgatcttgaagaaactgggcgtgtc ttaagtattggtgatggtattgcccgcgtacatgggctgaggaatgttcaagcagaagaa atggtagagttttcttcaggcttaaagggtatgtccttgaacttggaacctgacaatgtt ggtgttgtcgtgtttggaaatgataaactaattaaggaaggagatatagtgaagaggaca ggagccattgtggacgttccagttggtgaggagctgttgggtcgtgtagttgatgccctt ggtaatgctattgatggaaagggtccaattggttccaagacgcgtaggcgagttggtctg aaagcccccggtatcattcctcgaatttcagtgcgggaaccaatgcagactggcattaag gctgtggatagcttggtgccaattggtcgtggtcagcgtgaactgattattggtgaccga cagactgggaaaacctcaattgctattgacacaatcattaaccagaaacgtttcaatgat ggatctgatgaaaagaagaagctgtactgtatttatgttgctattggtcaaaagagatcc actgttgcccagttggtgaagagacttacagatgcagatgccatgaagtacaccattgtg gtgtcggctacggcctcggatgctgccccacttcagtacctggctccttactctggctgt tccatgggagagtattttagagacaatggcaaacatgctttgatcatctatgacgactta tccaaacaggctgttgcttaccgtcagatgtctctgttgctccgccgaccccctggtcgt gaggcctatcctggtgatgtgttctacctacactcccggttgctggagagagcagccaaa atgaacgatgcttttggtggtggctccttgactgctttgccagtcatagaaacacaggct ggtgatgtgtctgcttacattccaacaaatgtcatttccatcactgacggacaggtagca ggtaccatgaagctggaattggctcagtatcgtgaggttgctgcttttgcccagttcggt tctgacctcgatgctgccactcaacaacttttgagtcgtggcgtgcgtctaactgagttg ctgaagcaaggacagtattctcccatggctattgaagaacaagtggctgttatctatgcg ggtgtaaggggatatcttgataaactggagcccagcaagattacaaagtttgagaatgct ttcttgtctcatgtcgtcagccagcaccaagccttgttgggcactatcagggctgatgga aagatctcagaacaatcagatgcaaagctgaaagagattgtaacaaatttcttggctgga tttgaagcttaa >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_6|250_aa MPEPSTSLAQGLNGQESQVAAWLKKIFGDHPIPQYEVNPRTTEILHHLSERNRVRDRDVY LVIEDLKQKASEYESEARHYYSLFTDERTIEKLNPSLAQVKIEEAKRELWKKGNILRDIE YSRQKEFAKAYKLKHGGLTQAVASDKVRLLRNMKCLSTCNPECPRRETPGQVIPTGNCNN DWRVNVDQGVSWGTIQKRRVRKKLQHKMEIRGKNEVSGAIIICLPDAHSKSICRDTGLKQ RKRFNCSVTE >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_6|753_bp atgccagagccttcgaccagccttgcacaggggctcaacggccaggagtctcaggttgct gcgtggttaaaaaaaatatttggagatcatcctattccacagtatgaggtgaacccacgg accacagagattttacatcacctttcagaacgcaacagggtccgggacagggatgtctac ctggtaatagaggacttgaagcagaaagcaagtgaatacgagtcagaagctaggcattat tattccctttttacagatgagaggactatagagaagttgaatccgtctcttgctcaagtg aaaattgaagaagcaaagcgagaactatggaaaaaggggaatatattgagagatatagaa tattctaggcaaaaagaatttgcaaaggcatataagctgaaacatggaggacttactcaa gctgtagcaagtgataaagtccggctgcttagaaacatgaaatgtttgtccacatgcaat ccagagtgtcccaggagggaaaccccggggcaggtgatacctacaggcaactgcaacaat gactggagggtaaatgtggaccaaggtgtgagctggggcaccatacaaaagagaagggtg aggaagaagctgcagcacaaaatggagattagagggaaaaacgaggttagtggagccatc atcatctgcttgcccgatgcacacagcaagtcaatatgccgagacaccgggttgaagcag agaaagaggtttaactgtagtgtcactgaatga >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_7|70_aa MGFSMLVRLVLSSRPQVAVKAAIPPDTRRTGLNLETHHNSRGGRCRIGTCRMEKLIPKST KDFLEEEAFD >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_7|213_bp atggggttctccatgttggtcaggctggtcttgagctcccgacctcaggtggcagtcaaa gcggccattcccccagacaccagaagaacagggctcaacctagaaacacatcacaatagc agagggggtcgatgcagaatagggacatgtcgcatggagaaattaattcccaagagcacc aaggacttcttggaagaggaagcttttgactag >gi568815580r:45984285_46198231|GENSCAN_predicted_peptide_8|284_aa PGDCGTSSLPSAPPEKDSPQPGPSVWGLGVPEPTPPGREDARVTNGGRARRSERPRGVKA PAPRASFPRSLLLLRPSPSRNVFPGGWESHSMQAKDNAFSKGQRSLFCALLSTHPLLKPF RKQLAVSESQSRSSGEPVPFSPRFFADADSWARGDSAELFSGSGTNICSSFCPGTLQALQ IDRHLLVGRGPLGRPRSAPNSASAVALRFAVSPGEGGGNEVDPNPEEGEEGAGPLAGGAF PKEEQEMLKTYLKTQDLAKDPRSPKAKQCFEKKNNNNKALRRKM >gi568815580r:45984285_46198231|GENSCAN_predicted_CDS_8|855_bp cccggggactgcggcacctccagcctcccctcagcacccccggaaaaggactcgcctcaa cccggccccagcgtctgggggcttggagtgccggagccaactccgccgggccgggaggac gctagggtcaccaacggcggacgagcaaggaggtcggagagaccgcggggcgtgaaggcg ccggctccccgggcttctttcccgcgctccctccttctgttgagaccgtcaccttcccga aatgtctttcccggggggtgggaaagccactcaatgcaagccaaggacaatgccttcagc aaaggacagcgctcccttttctgtgcgctgctcagtactcaccccctcctgaagcccttc cgaaagcagcttgcggtgtctgaatcccagagtcgctccagcggagagcccgtgcccttc tctccccgctttttcgcggatgccgatagctgggctcggggagactcggctgagctcttc tcgggcagcggcaccaatatctgctcaagtttctgtccgggcaccttgcaggccttgcag atcgaccgccacctgctggtgggacgtggcccgctggggagaccccgctctgccccgaat tccgcctctgcagtcgccctccgatttgcagtcagccctggggagggtgggggaaacgag gtggacccgaaccccgaagaaggggaggagggcgcaggcccactggcgggtggagcgttt cccaaggaggagcaggagatgctgaagacatatctcaagactcaagatctggccaaggac ccgcgctcccccaaagcaaagcagtgttttgagaagaaaaataataataataaagctttg aggaggaaaatgtaa