GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:36:10 Sequence gi568815597f:70121730_70350802 : 229073 bp : 36.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2381 2494 114 1 0 88 90 74 0.675 7.32 1.02 Term + 5126 5449 324 1 0 80 41 134 0.249 1.68 1.03 PlyA + 6933 6938 6 1.05 2.09 PlyA - 7081 7076 6 1.05 2.08 Term - 24176 24071 106 1 1 77 38 137 0.986 4.50 2.07 Intr - 26943 26758 186 2 0 45 88 104 0.296 3.98 2.06 Intr - 34067 33960 108 1 0 28 116 78 0.136 3.08 2.05 Intr - 54253 54081 173 1 2 58 97 138 0.784 9.52 2.04 Intr - 63185 63056 130 1 1 32 83 60 0.162 -0.32 2.03 Intr - 65609 65536 74 2 2 46 98 72 0.155 1.29 2.02 Intr - 67544 67363 182 0 2 -12 95 105 0.030 -0.13 2.01 Init - 83811 83661 151 0 1 67 116 125 0.286 13.67 2.00 Prom - 84618 84579 40 -4.45 3.00 Prom + 97219 97258 40 -3.05 3.01 Init + 99908 100110 203 1 2 89 64 179 0.982 14.01 3.02 Intr + 106693 106826 134 1 2 88 76 93 0.927 7.47 3.03 Intr + 110539 110648 110 2 2 42 79 87 0.852 2.28 3.04 Intr + 112967 113059 93 1 0 49 74 74 0.713 1.34 3.05 Intr + 113772 113821 50 2 2 76 82 77 0.959 2.46 3.06 Intr + 115696 115823 128 1 2 86 97 221 0.999 22.20 3.07 Intr + 122955 123086 132 1 0 31 75 201 0.656 12.80 3.08 Intr + 125091 125178 88 1 1 -2 110 104 0.992 1.71 3.09 Intr + 128636 128774 139 2 1 66 53 212 0.999 15.05 3.10 Term + 128882 129076 195 1 0 33 42 290 0.933 15.33 3.11 PlyA + 129726 129731 6 1.05 4.09 PlyA - 129829 129824 6 1.05 4.08 Term - 141118 140988 131 2 2 75 36 198 0.960 10.66 4.07 Intr - 149227 149127 101 0 2 93 82 37 0.980 2.43 4.06 Intr - 153089 152991 99 1 0 92 81 21 0.502 0.11 4.05 Intr - 155115 155036 80 0 2 109 106 90 0.997 10.33 4.04 Intr - 170820 170659 162 0 0 76 60 189 0.792 14.15 4.03 Intr - 193831 193752 80 2 2 25 99 116 0.446 4.75 4.02 Intr - 203228 203124 105 0 0 82 54 63 0.365 1.57 4.01 Intr - 214370 214329 42 0 0 108 96 20 0.400 2.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:70121730_70350802|GENSCAN_predicted_peptide_1|145_aa FRNEEKRKGILDQLEKLSDMPKHLTVNSTAMLHTENRQRVKLTCLHCCSCCYRSVPPTPH VFPKAVDVCFYPAETQIRDPLRVIAVRRALSINRWLQLPIWEELSWVTQEPGVVALTKAQ RWSAYNVTDNNKQSLCAMVGVAMPL >gi568815597f:70121730_70350802|GENSCAN_predicted_CDS_1|438_bp ttcagaaatgaggaaaagagaaaagggatccttgaccaactggaaaagttatctgatatg ccaaagcacctcactgtcaatagtactgccatgttgcatactgagaacaggcagcgtgta aaactgacctgtcttcattgctgcagttgctgctatagatccgtccctccaactccccat gtttttccaaaggcagtggatgtctgcttctatcctgctgaaacacagataagagaccct cttagagtgatagctgttcggagagccttgagtatcaacagatggttacagctcccaatt tgggaggaactttcttgggtaacgcaggaacctggagtagtagcattaactaaggcccaa agatggagtgcttataatgttactgacaacaacaaacaaagcctgtgtgcaatggtgggg gtggcaatgcctttataa >gi568815597f:70121730_70350802|GENSCAN_predicted_peptide_2|369_aa MSRLKRIAGQDLRAGFKAGGRDCGTSVPQGLLKAARKSGQLNLSGRNLSEVPQCVWRINV DIPEEANQNLSFGATERWWEQTDLTKLIISNNKLQSLTDDLRLLPALTVLDIHDNQLTSL PSAIRELENLQKLNVSHNKLKILPEEITNLRNLKCLYLQHNELTCISEGFEQLSNLEDLE LHVGENQIEMLEAEHLKHLNSILVLDLRDNKLKSVPDEIILLRSLERLDLSNNDISSDKQ ATLIPDEVFDAVKSNIVTSINFSKNQLCEIPKRFKMLPEVLYRIFTLETILISNNQVGSV DPQKMKMMENLTTLDLQNNDLLQIPPELGNCVNLRTLLLDGNPFRVPRAAILMKGTAAIL EYLRDRIPT >gi568815597f:70121730_70350802|GENSCAN_predicted_CDS_2|1110_bp atgtcgcgcctgaagcggatagcggggcaggatctccgcgctggtttcaaagcaggtgga agagactgcggtacctcggtaccccaagggctgttgaaggcagcgaggaagagcggccag ttaaacctgtcgggtagaaacctcagtgaagtgccgcagtgtgtctggagaataaatgtg gatatccctgaggaagctaatcagaatctttcgtttggtgctactgaaagatggtgggag cagacagatttgaccaaactaataatatcaaacaataaacttcagtcacttacagatgac ctgcgactcttgcctgcactgactgttcttgatatacatgataatcagttgacatccctt ccttctgctataagagagctagaaaatcttcagaaacttaatgtcagccataataaactg aaaatactccctgaagaaattacaaacctaagaaacctgaagtgcctgtatctccagcat aatgaattaacctgcatatcagagggatttgaacaactttccaatttagaagatttagaa ttgcacgtaggtgaaaaccagattgaaatgttagaggcagaacatcttaaacatctgaat tcaattcttgtgctagacctgagggataacaagttaaaatctgttccagatgaaattata ctactacggtccttggaaaggcttgacctaagcaacaatgatattagtagtgataaacaa gcaactttgattcctgatgaggtgtttgatgcagtaaaaagcaacatcgtcacttctatt aacttcagtaagaatcaactatgtgaaattccaaaaaggtttaaaatgctacctgaagtt ctatatcgtatcttcacacttgaaacaattctgattagtaataatcaggttggatctgtg gaccctcagaaaatgaagatgatggaaaatctgaccacgttggaccttcaaaataatgac ctcttacaaattccaccagagctcggtaattgtgtaaacttaagaacattactactggat ggaaatccattccgagttcctcgagcagccatattaatgaaaggaacagctgctatactt gaatatttgagagaccgaattcctacttaa >gi568815597f:70121730_70350802|GENSCAN_predicted_peptide_3|423_aa MSNTTVVPSTAGPGPSGGPGGGGGGGGGGGGTEVIQVTNVSPSASSEQMRTLFGFLGKID ELRLFPPDDSPLPVSSRVCFVKFHDPDSAVVAQHLTNTVFVDRALIVVPYAEGVIPDEAK ALSLLAPANAVAGLLPGGGLLPTPNPLTQIGAVPLAALGAPTLDPALAALGLPGANLNSQ SLAADQLLKLMSTVDPKLNHVAAGLVSPSLKSDTSSKEIEEAMKRVREAQSLISAAIEPG GQEADRDGGHILSLGVGDDPKAQGGEDLIPEKEVEGQGAHQKQDKKKEDKEKKRSKTPPK SYSTARRSRSASRHKKEKKKDKDKERSRDERERSTSKKKKSKDKEKDRERKSESDKDVKV TRDYDEEEQGYDSEKEKKEEKKPIETGSPKTKECSVEKGTGDSLRESKVNGDDHHEEDMD MSD >gi568815597f:70121730_70350802|GENSCAN_predicted_CDS_3|1272_bp atgagcaacactaccgtcgtccccagcactgcaggtccgggccccagcggcgggcccggt ggcggaggtggtggtggcggcggaggcggcggcaccgaggtaatccaggtgactaatgtc tccccgagcgctagctctgagcagatgcggactctcttcggtttcctaggcaagatcgac gaactgcgcctcttcccgccggatgattcgcctttgccagtctcatctcgtgtctgcttt gttaagttccatgatccagactcagcagttgtggcacagcatctgacaaacactgtattc gttgacagagctttgatagtcgtaccatatgcagaaggagttattcctgatgaagctaaa gctttgtctctgttggcaccagctaatgcagtggcaggtcttctgcctggtggtggactc ctgcctactcctaacccacttacccagattggcgctgttccactggctgctttgggggct cctactcttgatcctgcccttgctgcacttgggcttcctggagcaaacttgaactctcag tctcttgctgcagatcagttgctgaagcttatgagtactgttgatcccaagttgaatcat gtagctgctggtctcgtttcaccaagtctgaaatcggatacctctagtaaagaaatagag gaagctatgaaaagagtacgagaagcacagtccctaatttctgctgctatagaaccaggc ggtcaagaagcagatcgagacggcggtcacattctaagtctaggagtcggcgacgatcca aaagcccaaggcggagaagatctcattccagagaaagaggtagaaggtcaaggagcacat caaaaacaagacaaaaagaaagaagacaaagaaaagaaacgttctaaaacaccaccaaaa agttacagcacagccagacgttctagaagtgcaagcagacataaaaaggagaagaagaaa gataaagacaaagaaagaagtagggatgaaagagaacgatcaacaagcaagaagaagaag agtaaagataaggaaaaggaccgggaaagaaaatcagagagtgataaagatgtaaaagtt acacgggattatgatgaagaggaacaggggtatgacagtgagaaagagaaaaaagaagag aagaaaccaatagaaacaggttcccctaaaacaaaggaatgttctgtggaaaagggaact ggtgattcactaagagaatccaaagtgaatggggatgatcatcatgaagaagacatggat atgagtgactga >gi568815597f:70121730_70350802|GENSCAN_predicted_peptide_4|266_aa XNTPLHLAVMLGNKECAHLLLAHNAPVKVKNAQGWSPLAEAISYGDRQMTLLRKLKQQSR ESVEEKRPRLLKALKEERVGNFLADFYLVNGLVLESRKRREHLSEEDILRNKAIMESLSK GGNIMEQNFEPIRRQSLTPPPQNTITWEEYISAENGKAPHLGRELVCKESKKTFKATIAM SQEFPLGIELLLNVLEVVAPFKHFNKLREFVQMKLPPGFPVKLDIPVFPTITATVTFQEF RYDEFDGSIFTIPDDYKEDPSRFPDL >gi568815597f:70121730_70350802|GENSCAN_predicted_CDS_4|801_bp ngaaatactcctttacaccttgctgtgatgttaggaaataaagaatgtgcccatttactt ttggctcacaatgctccagtcaaggtgaaaaatgctcagggatggagccctctggcggaa gccatcagctatggagataggcagatgactcttttgaggaagcttaagcagcaatccagg gaaagtgttgaagaaaaacgacctcgattattaaaagccctgaaagaggaaagagtagga aactttttggcagacttttacctggtgaatggacttgttttagaatcaaggaaaagaaga gaacatctcagtgaagaggatattcttcgaaataaggccatcatggagagtttgagtaaa ggtggaaacataatggaacagaattttgagccgattcgaagacagtctcttacacctcct cctcagaacactattacatgggaagaatatatatctgctgaaaatggaaaagctcctcat ctgggtagagaattggtgtgcaaagagagtaagaaaacgtttaaagctacgatagccatg agccaggaatttcccttagggatagagttattattgaatgttttagaagtagtagctccc ttcaagcactttaacaagcttagagaatttgttcagatgaagcttcctccaggctttcct gtaaaattagatatacctgtgtttcccacaatcacagccactgtgacttttcaggagttt cgatacgatgaatttgatggctccatctttactatacctgatgactacaaggaagaccca agccgttttcctgatctttaa