GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:52:01 Sequence gi568815588r:68315042_68571864 : 256823 bp : 43.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16375 16604 230 0 2 80 16 195 0.273 9.13 1.02 Intr + 16656 16768 113 0 2 20 76 97 0.196 1.72 1.03 Intr + 17345 17410 66 0 0 98 22 95 0.161 2.78 1.04 Intr + 17728 17832 105 2 0 122 42 32 0.129 2.19 1.05 Intr + 22158 22292 135 1 0 87 77 72 0.878 6.54 1.06 Intr + 24099 24185 87 1 0 48 83 128 0.926 8.24 1.07 Intr + 24399 24514 116 1 2 89 82 12 0.935 0.87 1.08 Intr + 26133 26268 136 2 1 27 85 113 0.943 5.04 1.09 Intr + 26544 26639 96 1 0 98 71 67 0.979 5.98 1.10 Intr + 26718 26810 93 1 0 82 115 42 0.974 6.24 1.11 Term + 37269 37279 11 1 2 99 48 10 0.107 -3.54 1.12 PlyA + 37990 37995 6 1.05 2.12 PlyA - 40949 40944 6 1.05 2.11 Term - 47444 47384 61 1 1 105 43 49 0.147 -0.62 2.10 Intr - 61931 61812 120 1 0 10 68 139 0.235 3.71 2.09 Intr - 64480 64383 98 1 2 51 87 72 0.955 2.21 2.08 Intr - 66358 66191 168 1 0 17 110 167 0.994 11.94 2.07 Intr - 68873 68757 117 2 0 71 96 66 0.982 6.36 2.06 Intr - 69111 69010 102 0 0 76 107 26 0.926 3.67 2.05 Intr - 79410 79287 124 2 1 67 108 114 0.930 11.89 2.04 Intr - 86696 86579 118 0 1 75 72 134 0.820 10.02 2.03 Intr - 89803 89630 174 2 0 94 64 56 0.042 3.71 2.02 Intr - 92249 92145 105 0 0 28 82 130 0.005 6.59 2.01 Init - 103575 103521 55 0 1 95 61 0 0.019 -0.62 2.00 Prom - 105573 105534 40 -1.46 3.00 Prom + 108984 109023 40 -0.76 3.01 Init + 109519 109770 252 0 0 93 61 454 0.714 40.54 3.02 Term + 109828 109956 129 0 0 2 36 154 0.694 -0.22 3.03 PlyA + 110741 110746 6 1.05 4.08 PlyA - 113502 113497 6 1.05 4.07 Term - 117627 117592 36 0 0 81 55 62 0.513 -0.46 4.06 Intr - 135206 134987 220 1 1 127 75 138 0.933 14.70 4.05 Intr - 147582 147520 63 2 0 42 110 59 0.691 1.33 4.04 Intr - 150771 150626 146 0 2 74 101 21 0.935 1.08 4.03 Intr - 153265 153082 184 0 1 81 61 85 0.974 4.89 4.02 Intr - 155122 154940 183 0 0 54 51 137 0.911 5.50 4.01 Init - 156823 156750 74 1 2 83 42 115 0.747 6.94 4.00 Prom - 165983 165944 40 -2.96 5.05 PlyA - 167580 167575 6 1.05 5.04 Term - 168547 168391 157 0 1 39 42 135 0.933 1.41 5.03 Intr - 172171 172103 69 0 0 106 95 59 0.944 6.70 5.02 Intr - 188654 188591 64 0 1 91 110 43 0.076 4.58 5.01 Init - 212334 212205 130 0 1 68 90 225 0.993 19.04 5.00 Prom - 219781 219742 40 -2.66 6.03 PlyA - 220334 220329 6 1.05 6.02 Term - 227542 227334 209 2 2 42 41 116 0.452 -0.20 6.01 Init - 229792 229471 322 1 1 69 35 143 0.471 4.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 89759 89630 130 2 1 83 64 64 0.942 3.81 S.002 Sngl + 92147 92560 414 1 0 66 39 252 0.930 12.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:68315042_68571864|GENSCAN_predicted_peptide_1|395_aa MNTPKKPGSRAAHWDEARWRAASNSGPGEGGGHPAREPKSAPVDAFIFAAQKHSYFSPHN ANMVDQKIRPAFAGRTTICSRWGQITGELLQSATARFLLHTGRTAIAIAPRRQNGRAACR WEDLLRWNSVDDAEARISAGLPGLTGLRVTSDADSQDERGWSWGRKANGDEAFKSNGIEM DWVMKHNGPNDASDGTVRLRGLPFGCSKEEIVQFFQGYGGFDDYGGYNNYGYGNDGFDDR MRDGRGMGGHGYGGAGDASSGFHGGHFVHMRGLPFRATENDIANFFSPLNPIRVHIDIGA DGRATGEADVEFVTHEDAVAAMSKDKNNMQHRYIELFLNSTPGGGSGMGGSGMGGYGRDG MDNQGGYGSVGRMGMGNNYSGGYGTPDGLGGYGSY >gi568815588r:68315042_68571864|GENSCAN_predicted_CDS_1|1188_bp atgaatacacccaagaaacccggatcccgcgcggcacattgggacgaagcgcgctggcgg gcggccagcaactctgggccaggggaaggaggcggacacccagcccgagagccgaaatcg gccccagtggacgctttcatcttcgcggcccagaaacactcctatttttcaccgcacaat gcaaacatggtggaccagaaaatccgccccgcgtttgccggtcgaaccacaatttgttca cggtgggggcagatcaccggagaacttctccagagcgctacggcacggttccttctacac actggcagaactgccattgccatcgccccgaggcggcagaatgggagggcggcttgccga tgggaggacttgctcaggtggaattcagtggacgacgccgaggcccgaatctcggcaggg ctacctgggctgacgggactgcgagtgacttctgacgcagattcccaagatgagagaggc tggagctgggggaggaaagccaatggcgacgaagcatttaaatcaaacggtattgagatg gattgggttatgaaacataatggtccaaatgacgctagtgatgggacagtacgacttcgt ggactaccatttggttgcagcaaagaggaaatagttcagttctttcaaggttatggaggt tttgatgactatggtggctataataattacggctatgggaatgatggctttgatgacaga atgagagatggaagaggtatgggaggacatggctatggtggagctggtgatgcaagttca ggttttcatggtggtcatttcgtacatatgagagggttgccttttcgtgcaactgaaaat gacattgctaatttcttctcaccactaaatccaatacgagttcatattgatattggagct gatggcagagccacaggagaagcagatgtagagtttgtgacacatgaagatgcagtagct gccatgtctaaagataaaaataacatgcaacatcgatatattgaactcttcttgaattct actcctggaggcggctctggcatgggaggttctggaatgggaggctacggaagagatgga atggataatcagggaggctatggatcagttggaagaatgggaatggggaacaattacagt ggaggatatggtactcctgatggtttgggtggttatggaagctactga >gi568815588r:68315042_68571864|GENSCAN_predicted_peptide_2|413_aa MAFSGLGVRGGNLVDGSTAGLAADAVVAAERLEGAAGQAEGASARDRAAAAAMATKDPTA VERANLLNMAKLSIKGLIESALSFGRTLDSDYPPLQQFFVVMEHCLKHGLKVRKSFLSYN KTIWGPLELVEKLYPEAEEIGASVRDLPGLNEFYEYHALMMEEEGAVIVGLLVGLNVIDA NLCVKGEDLDSQLAIAKNNIIKLQEENHQLRSENKLILMKTQQHLEVTKVDVETELQTYK HSRQGLDEMYNEARRQLRDESQLRQDVENELAVQVSMKHEIELAMKLLEKDIHEKQDTLI GLRQQLEEVKAINIEMYQKLQGSEDGLKEKNEIIARLEEKTNKITAAMRQLEQRLQQAEK AQMEAEDEDEKYLQECLSKSDSLQKQISQKEKQLLCFGVAALRLIEHIARYLE >gi568815588r:68315042_68571864|GENSCAN_predicted_CDS_2|1242_bp atggcctttagcgggttgggggtgaggggcgggaacctggtggatgggtcaacagcgggc ctggctgcagatgcggtggtggccgccgagcgcctggaaggagctgctggacaggccgag ggagcctccgcccgagaccgcgcagccgccgccgccatggctacaaaagaccccacagct gtagagagagcaaacttgttaaacatggctaaactgagtatcaaaggactcattgaatct gctctgagctttggccgcactttggattctgactatccccccttgcagcaattctttgtt gttatggaacattgcctgaaacacggtcttaaagtaagaaaatcatttttgagttacaac aaaaccatctggggccctttggaactggtggagaagctgtaccccgaagcagaggaaata ggagctagtgtccgggatctacctggtctgaatgagttttatgagtatcacgcactaatg atggaagaagaaggagcagtaattgttgggctgctggttggcctgaatgtgatcgatgct aatctgtgtgtgaagggagaggatttagactcacaattagcaatagcaaagaataacatc attaaactccaggaagaaaatcatcaattacgaagtgaaaataaattgattttaatgaaa acacagcagcacctagaggttaccaaagtagatgtggaaactgagcttcaaacatataag cattctcgtcaggggctagatgaaatgtacaatgaagccagaaggcagcttcgagatgaa tctcagttacgacaggatgtagagaatgagctagcagtacaagttagtatgaagcatgag attgaacttgccatgaagttgctggagaaagatatccatgagaaacaagatactctgata ggccttcgacaacaactagaggaagttaaagcaattaacatagagatgtatcaaaagttg cagggttctgaagatggcttgaaagaaaaaaatgaaataattgcccgactagaagaaaaa accaataaaattactgcagccatgaggcagctggaacaaagattgcagcaagcagagaag gcgcaaatggaagctgaagatgaggatgagaaatatctacaagaatgtctcagtaaatct gatagtctgcagaaacaaatctcccaaaaggagaaacagcttctgtgttttggggtagca gctttgcgcctgattgaacatattgccagatatttggaataa >gi568815588r:68315042_68571864|GENSCAN_predicted_peptide_3|126_aa MKFNPFVNLDRSKNRKRHFHAPLHVHRKIMSSPLSKELRQKYNVRSTPIRKDDEVQVVQG HYKGQQIGKVVQVYRKKDVIYTEQVVITRLNLNKDRKKIIEHKAKSRQVRKEKGKYKEEL IEKMQE >gi568815588r:68315042_68571864|GENSCAN_predicted_CDS_3|381_bp atgaagttcaatcccttcgtgaacttggaccgcagcaaaaaccgcaaacgtcacttccat gcccccttgcacgtgcaccggaagatcatgtcatccccgctctccaaggagctgcggcag aagtacaatgtccgctccacacccatccgcaaggacgacgaggtccaggtagttcaagga cactacaaaggtcagcaaattggcaaggtagtccaggtgtacagaaagaaagatgtcatc tacactgagcaggtggttatcaccaggctaaatctcaacaaggatcggaaaaaaattatt gaacacaaagccaagtctcgacaagtcagaaaagagaaaggcaaatataaggaggaactt attgagaaaatgcaggaataa >gi568815588r:68315042_68571864|GENSCAN_predicted_peptide_4|301_aa MEQLNELELLMEKSFWEEAELPAELFQKKVVASFPRTVLSTGMDNRYLVLAVNTVQNKEG NCEKRLVITASQSLENKELCILRNDWCSVPVEPGDIIHLEGDCTSDTWIIDKDFGYLILY PDMLISGTSIASSIRCMRRAVLSETFRSSDPATRQMLIGTVLHEVFQKAINNSFAPEKLQ ELAFQTIQEIRHLKEIVVIKIAELKHKEGTGIIILGGPSDNSKDNSTCNIEVVKPMDIEE SIWSPRFGLKGKIDVTVGVKIHRGYKTKYKIMPLELKTGKESNSIEHRSQGLMLSLIGKC D >gi568815588r:68315042_68571864|GENSCAN_predicted_CDS_4|906_bp atggagcagctgaacgaactggagctgctgatggagaagagtttttgggaggaggcggag ctgccggcggagctatttcagaagaaagtggtagcttcctttccaagaacagttctgagc acaggaatggataaccggtacctggtgttggcagtcaatactgtacagaacaaagaggga aactgtgaaaagcgcctggtcatcactgcttcacagtcactagaaaataaagaactatgc atccttaggaatgactggtgttctgttccagtagagccaggagatatcattcatttggag ggagactgcacatctgacacttggataatagataaagattttggatatttgattctgtat ccagacatgctgatttctggcaccagcatagccagtagtattcgatgtatgagaagagct gtcctgagtgaaacttttaggagctctgatccagccacacgccaaatgctaattggtacg gttctccatgaggtgtttcaaaaagccataaataatagctttgccccagaaaagctacaa gaacttgcttttcaaacaattcaagaaataagacatttgaaggaaatagtagttattaag attgcggagcttaaacataaagaaggaactggaataatcatcctgggagggccaagtgat aatagtaaggataattcaacatgtaacattgaagtcgtgaaaccaatggatattgaagaa agcatttggtcccctaggtttggattgaaaggcaaaatagatgttacagttggtgtgaaa atacatcgagggtataaaacaaaatacaagataatgccgctggaacttaaaactggcaaa gaatcaaattctattgaacaccgtagtcagggcctcatgctgtccttgattggcaagtgt gactga >gi568815588r:68315042_68571864|GENSCAN_predicted_peptide_5|139_aa MAAATAAAALAAADPPPAMPQAAGAGGPTTRRDFYWLRSFLAGVNYYEAGNFRSCAQING WIHGSYPFDVTRRRMQLGTVLPEFEKCLTMRDTMKYVYGHHGIRKGLYRGLSLNYIRCIP SQAVAFTTYELMKQFFHLN >gi568815588r:68315042_68571864|GENSCAN_predicted_CDS_5|420_bp atggcggcggcgacggccgcggcagccctggcggcggccgatccccctcccgcaatgccg caggcggcaggggccggagggcccacaacccgcagagacttctactggctgcgctccttt ctggccggagttaattactacgaagctgggaatttcaggtcatgtgcacagattaatggc tggatccatggcagctacccatttgatgtgactcgtcggcgaatgcaattaggaactgtt ctgccggaatttgaaaagtgccttaccatgcgggatactatgaagtatgtctatggacac catggaattcgaaaaggactctatcgtggtttatctcttaattacattcgctgtattccc tctcaagcagtggcttttacaacatacgaacttatgaagcagttttttcacctcaactaa >gi568815588r:68315042_68571864|GENSCAN_predicted_peptide_6|176_aa MEKPLFPLVPLHWFGFGYTALVVSGGIVGYVKTGRAPSLAAGLLFGSLAGVGAYQLYQDP RNVWDFLAATSVTFVGIMGMRSYYYGKFMPVGLIAGASLLMAAKVGVHWDANIWNYWEPT SEPDKDTEVIWEVEDNYRPLLRVCSRKQKPYTYILAISNSLCRNLRRSKIIPQKNY >gi568815588r:68315042_68571864|GENSCAN_predicted_CDS_6|531_bp atggagaagcccctcttcccattagtgcctttgcattggtttggctttggctacacagca ctggttgtttctggtgggatcgttggctatgtaaaaacaggcagggcgccgtccctggct gcagggctgctctttggcagtctagccggcgtgggtgcttaccagctgtatcaggatcca aggaatgtttgggatttcctagccgctacatctgttacttttgttggtattatgggaatg agatcctactactatggaaaattcatgcctgtaggtttaattgcaggtgccagtttgctg atggctgccaaagttggagttcactgggatgcaaatatttggaattattgggaaccaact agtgagccagataaagacacagaggttatctgggaggtggaggataattatcgtcccctt ttgagggtatgcagcaggaagcaaaagccctatacgtacatcctggccataagcaatagc ctttgcaggaatcttcgacgatccaagattattcctcagaagaattattag