GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:16:00 Sequence gi568815586r:49129724_49286834 : 157111 bp : 45.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 634 885 252 0 0 58 42 189 0.576 6.68 1.02 PlyA + 3330 3335 6 1.05 2.05 PlyA - 3742 3737 6 1.05 2.04 Term - 56267 55287 981 2 0 117 39 1076 0.983 97.67 2.03 Intr - 56735 56587 149 0 2 115 84 359 0.999 38.15 2.02 Intr - 57110 56888 223 2 1 115 97 237 0.947 25.00 2.01 Init - 58696 58694 3 1 0 91 101 0 0.427 1.60 2.00 Prom - 64970 64931 40 -5.06 3.00 Prom + 72434 72473 40 -5.66 3.01 Init + 86499 86613 115 2 1 68 85 61 0.570 4.17 3.02 Intr + 98368 98443 76 2 1 62 116 72 0.323 5.97 3.03 Term + 102187 102235 49 1 1 77 52 50 0.078 -2.92 3.04 PlyA + 102537 102542 6 1.05 4.03 PlyA - 103097 103092 6 1.05 4.02 Term - 104344 104067 278 2 2 66 37 180 0.634 6.32 4.01 Init - 114580 114532 49 1 1 88 65 41 0.517 2.91 4.00 Prom - 132767 132728 40 -4.46 5.00 Prom + 135320 135359 40 -0.26 5.01 Init + 135459 135704 246 2 0 74 47 107 0.298 0.80 5.02 Intr + 139742 139964 223 1 1 129 97 270 0.974 29.70 5.03 Intr + 140105 140253 149 0 2 108 119 344 0.999 39.45 5.04 Term + 142530 143504 975 2 0 80 37 1113 0.991 97.46 5.05 PlyA + 143586 143591 6 1.05 6.03 PlyA - 143841 143836 6 1.05 6.02 Term - 148044 147935 110 1 2 103 44 103 0.422 6.07 6.01 Init - 155330 155180 151 2 1 73 70 69 0.447 1.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:49129724_49286834|GENSCAN_predicted_peptide_1|83_aa MNEQFRARALAPPGGAAEARNGHVCKGAAQAAHPPLPAGIGLSLTPPQQSERSRRLSAAQ IWVKIRFPFPQNPLLPPPAPLNS >gi568815586r:49129724_49286834|GENSCAN_predicted_CDS_1|252_bp atgaatgagcaattccgggcgcgcgctcttgcgccccccggcggtgctgcagaggcacga aatggccacgtctgcaaaggcgctgcccaagccgcccatcctcccttgcctgccggcatc gggctcagcctaacacctccacagcagtctgagcgcagtaggagattatcggcggcccag atctgggtcaagatccgatttccgttccctcaaaatcccttactgccaccacctgctccc ctgaattcctaa >gi568815586r:49129724_49286834|GENSCAN_predicted_peptide_2|451_aa MRECISIHVGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTIGGGDDSFNTFFSETGAGK HVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKEIIDLVLD RIRKLADQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKLEFSIYPAPQVSTA VVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPTYTNLNRLIGQIVSSITA SLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVISAEKAYHEQLSVAEITNACFEPAN QMVKCDPRHGKYMACCLLYRGDVVPKDVNAAIATIKTKRTIQFVDWCPTGFKVGINYQPP TVVPGGDLAKVQRAVCMLSNTTAIAEAWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSE AREDMAALEKDYEEVGVDSVEGEGEEEGEEY >gi568815586r:49129724_49286834|GENSCAN_predicted_CDS_2|1356_bp atgcgtgagtgcatctccatccacgttggccaggctggtgtccagattggcaatgcctgc tgggagctctactgcctggaacacggcatccagcccgatggccagatgccaagtgacaag accattgggggaggagatgattccttcaacaccttcttcagtgagacgggggctggcaag catgtgccccgggcagtgtttgtagacttggaacccacagtcattgatgaagttcgcact ggcacctaccgccagctcttccaccctgagcaacttatcacaggcaaagaagatgctgcc aataactatgcccgagggcactacaccattggcaaggagatcattgacctcgtgttggac cgaattcgcaagctggccgaccagtgcacgggtctccagggcttcttggttttccacagc tttggtgggggaactggttctgggttcacctcgctgctcatggaacgtctctcagttgat tatggcaagaagtccaagctggagttctctatttacccggcgccccaggtttccacagct gtagttgagccctacaactccatcctcaccacccacaccaccctggagcactctgattgt gccttcatggtagacaatgaggccatctatgacatctgtcgtagaaacctcgatattgag cgtccaacctatactaacctgaataggttaataggtcaaattgtgtcctccatcactgct tccctgagatttgatggagccctgaatgttgacctgacagaattccagaccaacctggtg ccctatccccgcatccacttccctctggccacatatgcccctgtcatctctgctgagaaa gcctaccatgaacagctttctgtagcagagatcaccaatgcttgctttgagccagccaac cagatggtgaaatgtgaccctcgccatggtaaatacatggcttgctgcctgttgtaccgt ggtgacgtggttcccaaagatgtcaatgctgccattgccaccatcaagaccaagcgtacc atccagtttgtggattggtgccccactggcttcaaggttggcatcaactaccagcctccc actgtggtgcctggtggagacctggccaaggtacagagagctgtgtgcatgctgagcaac accacagccattgctgaggcctgggctcgcctggaccacaagtttgacctgatgtatgcc aaacgtgcctttgttcactggtacgttggggaggggatggaggaaggtgagttttcagag gcccgtgaggacatggctgcccttgagaaggattatgaggaggttggtgtggattctgtt gaaggagagggtgaggaagaaggagaggaatactaa >gi568815586r:49129724_49286834|GENSCAN_predicted_peptide_3|79_aa MASLSLTGHPTDEEPRPESGVTMPKRQSQVVAEPRWKPGLFSSLRLLTHAHPAKAEENLA VIQSFRIIVEACLNVTYRT >gi568815586r:49129724_49286834|GENSCAN_predicted_CDS_3|240_bp atggcatccctgtccctcactggtcatcctactgacgaggagcctagaccagaaagtgga gtgaccatgccaaagaggcagagccaggtggtggcggaaccacgctggaagccaggactc ttcagctccctgcgccttttaacacatgcacatccagcaaaagcagaggagaacctggct gtgattcaaagcttcagaattattgtagaagcttgcctgaacgtcacctatagaacctga >gi568815586r:49129724_49286834|GENSCAN_predicted_peptide_4|108_aa MTKQCVEFENENTVVSGASAIICLLRVTVIQWESLVVPPFSTYGCGPQEDDGLRFCSGAS PVAGNCNPQDDARAQLPSFYVAEFMLPCTEQTLSLTQPCPSPCPVIPE >gi568815586r:49129724_49286834|GENSCAN_predicted_CDS_4|327_bp atgaccaaacaatgtgtggaatttgaaaatgaaaatactgtggtgagtggagcctctgca atcatctgcttattgcgcgtcaccgtcatccagtgggagagccttgtggtaccacctttc tccacctatggctgcggcccgcaggaagatgacgggttgcgcttctgctctggagccagc cctgttgccgggaactgcaacccgcaagatgatgccagagctcagcttccctctttttat gttgcagagtttatgctgccctgcactgagcagacgctttcgcttacgcagccctgccct tcaccttgcccagtgattccggaataa >gi568815586r:49129724_49286834|GENSCAN_predicted_peptide_5|530_aa MVSGVPSGLGKSARPRGRRARKLLPAPRAAPRTAPDYPGPLRLTWLVAAGLEGRVHLADT SSGRKTWPGCGHQWKWKALLILRECISIHVGQAGVQIGNACWELYCLEHGIQPDGQMPSD KTIGGGDDSFNTFFSETGAGKHVPRAVFVDLEPTVIDEVRTGTYRQLFHPEQLITGKEDA ANNYARGHYTIGKEIIDLVLDRIRKLADQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSV DYGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDI ERPTYTNLNRLISQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVISAE KAYHEQLTVAEITNACFEPANQMVKCDPRHGKYMACCLLYRGDVVPKDVNAAIATIKTKR TIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAVAEAWARLDHKFDLMY AKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGADSADGEDEGEEY >gi568815586r:49129724_49286834|GENSCAN_predicted_CDS_5|1593_bp atggtgagtggggttccctcggggctggggaagagtgcgcgtcccaggggacggcgggcc cggaaactactgcctgcacctcgggccgcgcccaggacagctccagactaccccgggccc ctccggttaacctggcttgtggcggccgggctggaaggtcgagttcacttggcagacacc agttcgggccggaaaacctggcccgggtgcggccatcagtggaaatggaaagccctcttg atcctacgtgagtgcatctccatccacgttggccaggctggtgtccagattggcaatgcc tgctgggagctctactgcctggaacacggcatccagcccgatggccagatgccaagtgac aagaccattgggggaggagatgattccttcaacaccttcttcagtgaaacgggtgctggc aagcatgtgccccgggcagtgtttgtagacttggaacccacagtcattgatgaagttcgc actggcacttaccgccagctcttccaccctgagcaactcatcacaggcaaggaagatgct gccaataactatgcccgagggcactacaccattggcaaggagatcattgacctcgtgttg gaccgaattcgcaagctggctgaccagtgcaccggtcttcagggcttcttggttttccac agctttggtgggggaactggttctgggttcacctcgctgctcatggaacgtctctcagtt gattatggcaagaagtccaagctggagttctccatttacccggcgccccaggtttccaca gctgtagttgagccctacaactccatcctcaccacccacaccaccctggagcactctgat tgtgccttcatggtagacaatgaggccatctatgacatctgtcgtagaaacctcgatatc gagcgcccaacctacactaaccttaaccgccttattagccagattgtgtcctccatcact gcttccctgagatttgatggagccctgaatgttgacctgacagaattccagaccaacctg gtgccctacccccgcatccacttccctctggccacatatgcccctgtcatctctgctgag aaagcctaccacgaacagcttactgtagcagagatcaccaatgcttgctttgagccagcc aaccagatggtgaaatgtgaccctcgccatggtaaatacatggcttgctgcctgttatac cgtggtgacgtggttcccaaagatgtcaatgctgccattgccaccatcaaaaccaagcgt accatccagtttgtggattggtgccccactggcttcaaggttggcattaattaccagcct cccactgtggtgcctggcggagacctggccaaggtacagagagctgtgtgcatgctgagc aataccacagctgttgccgaggcctgggctcgcctggaccacaagtttgacctgatgtat gccaagcgtgcctttgttcactggtacgtgggtgaggggatggaggaaggcgagttttca gaggcccgtgaggacatggctgcccttgagaaggattatgaggaggttggagcagatagt gctgacggagaggatgagggtgaagagtattaa >gi568815586r:49129724_49286834|GENSCAN_predicted_peptide_6|86_aa MLPKLVSNSWAQAICLEWQGLEWLNSRGCDEGQVIGSEASRTREKEVFEEDSRPGIGLLE PSAWVISASFTLPPQLTGTDHSGPGA >gi568815586r:49129724_49286834|GENSCAN_predicted_CDS_6|261_bp atgttgcccaagctggtctcgaactcctgggctcaggcaatctgcctggaatggcagggg cttgaatggctgaactccagggggtgtgatgaaggacaagttattgggagtgaggccagc aggactagggagaaagaggtttttgaggaagatagtcgccctggcattgggctcctggag ccatctgcctgggtcatctctgccagcttcaccttgccgcctcagctgacgggcactgac cactctggccctggagcatag