GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:45:58 Sequence gi568815580r:47929263_48141107 : 211845 bp : 47.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 1184 1099 86 0 2 114 103 33 0.755 6.94 1.06 Intr - 2278 2093 186 2 0 112 60 180 0.857 17.26 1.05 Intr - 16401 16338 64 0 1 115 53 19 0.022 -0.61 1.04 Intr - 26304 26196 109 2 1 78 77 143 0.230 12.49 1.03 Intr - 41880 41845 36 2 0 95 75 45 0.048 1.28 1.02 Intr - 47599 47543 57 0 0 71 98 41 0.070 1.40 1.01 Init - 50750 50719 32 2 2 76 88 39 0.103 1.92 1.00 Prom - 71894 71855 40 -3.16 2.03 PlyA - 72987 72982 6 1.05 2.02 Term - 79039 78867 173 2 2 59 47 125 0.146 3.49 2.01 Init - 90279 90222 58 0 1 75 98 81 0.759 7.29 2.00 Prom - 97641 97602 40 -6.86 3.15 PlyA - 98090 98085 6 1.05 3.14 Term - 100649 99998 652 1 1 93 44 1163 0.764 105.91 3.13 Intr - 102013 101938 76 2 1 92 85 41 0.230 2.77 3.12 Intr - 107388 107363 26 2 2 129 30 22 0.075 -1.83 3.11 Intr - 111861 110638 1224 2 0 125 85 2090 0.595 199.70 3.10 Intr - 125158 125055 104 1 2 141 28 29 0.021 1.17 3.09 Intr - 131661 131492 170 1 2 79 65 41 0.019 0.57 3.08 Intr - 145237 145076 162 2 0 59 22 131 0.164 3.55 3.07 Intr - 146484 146431 54 1 0 117 70 27 0.290 2.75 3.06 Intr - 153149 153123 27 0 0 94 80 38 0.121 1.69 3.05 Intr - 154656 154589 68 2 2 57 106 10 0.096 -1.75 3.04 Intr - 155296 155177 120 0 0 106 68 27 0.603 2.11 3.03 Intr - 155994 155856 139 1 1 125 102 13 0.869 5.92 3.02 Intr - 162679 162573 107 0 2 61 64 61 0.261 0.86 3.01 Init - 164988 164927 62 0 2 47 82 46 0.232 0.62 3.00 Prom - 166041 166002 40 -0.26 4.00 Prom + 174349 174388 40 -2.86 4.01 Init + 185393 185429 37 1 1 86 106 0 0.212 1.78 4.02 Term + 190060 190196 137 2 2 64 47 130 0.387 4.68 4.03 PlyA + 191354 191359 6 1.05 5.03 PlyA - 193134 193129 6 1.05 5.02 Term - 195761 195702 60 2 0 86 55 55 0.650 -0.20 5.01 Init - 198345 198175 171 0 0 70 110 134 0.973 13.26 5.00 Prom - 201397 201358 40 -3.36 6.04 PlyA - 201637 201632 6 1.05 6.03 Term - 206659 206576 84 1 0 71 36 87 0.471 -0.55 6.02 Intr - 207519 207365 155 1 2 31 83 58 0.261 -0.61 6.01 Init - 207871 207745 127 1 1 32 94 149 0.374 8.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 104790 104610 181 2 1 70 39 135 0.825 3.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:47929263_48141107|GENSCAN_predicted_peptide_1|190_aa MDAREGGLDLRIVPYHRLDVQYAVGHSDTRFAGMMSGKRCEGQCGSKVPAQDSGDVDTWV LVEQWNALEKDSLETKLEGFKGSPQTNPVLDHFRQPEARGASSALRGASPWDEHVSAAGL TTDGGRGGRPLQAPPTTLARDRAVTGNRLNARQGAGLSLSAPPSIPSEEEGTKGPGPPGS DGAGPGATLQ >gi568815580r:47929263_48141107|GENSCAN_predicted_CDS_1|570_bp atggatgccagagaaggtggcctcgacttgagaattgtcccatatcacagactggatgtc cagtatgctgtggggcacagtgatacaagatttgcaggaatgatgtctggcaaacgctgt gaaggacaatgtggcagcaaggtacctgcccaggacagtggagatgtggacacctgggtg ctggtagagcagtggaacgccctggagaaggacagcctggagacaaaacttgagggcttt aaaggatctccacaaaccaaccctgtattagatcactttcgtcagcctgaggcaagaggc gcgtcctccgcgctgcgcggggcctccccgtgggatgagcacgtgtccgctgccggcctc acgacggacggtggacgaggcggcaggcccctacaggccccacccacgacgctggcgagg gatcgggcggtcaccgggaatcgtcttaatgcgcggcaaggcgcgggcctctccctctcc gcccccccctccatcccatcggaagaggaaggaacaaaaggtcccggaccccccggatct gacggggcgggacctggcgccaccttgcag >gi568815580r:47929263_48141107|GENSCAN_predicted_peptide_2|76_aa MVKPLPVTLGAALNILLPEVRSAGNKARGGAGSVRRLGPGTGILGSTDARNKSSCASRHA VGDYSEERKLGGNEAL >gi568815580r:47929263_48141107|GENSCAN_predicted_CDS_2|231_bp atggtcaagcccctccctgtgacactgggagctgcactaaacatcctcttgcctgaagtg cgcagcgcagggaacaaagcccgcggcggggcggggtcggttcggcggctgggtccgggc accgggattctgggaagcaccgacgcccgaaacaaaagcagctgtgcttctaggcacgct gttggagattattccgaggagcgcaagctgggagggaatgaggcgctgtga >gi568815580r:47929263_48141107|GENSCAN_predicted_peptide_3|996_aa MRGTQPDDIIIQAAVTTLPIRTGGVGCEEKHWPKCEYTEHWTLPNGINGGHRVASEAYSQ LYLQSLLSGQSLRPLESKVCVFFAFLPTPTRPAVKPSRGPGSRHTVGPGSCWMPQKSADM PSGVRSEEQRSCRDLKSHLPPLLIGSSPNGSTEALPFFRDKIGGENKEQDFGNVGRMTLT CVVSFLPFYLGKVVLIHSYDSRLLSTYCVPGQRAMETEEVPASHVQGAVLYEGKSDISGM TAPQMEAPFTLRAMGVPDPPTLTLQHPLWSADPIPSTSHPRAISPALPRPITIAPIFHLH LHVLPPPTAAPPRPDPTWRSGVEKGRGVAVNNAERLPGALAENMANDIDELIGIPFPNHS SEVLCSLNEQRHDGLLCDVLLVVQEQEYRTHRSVLAACSKYFKKLFTAGTLASQPYVYEI DFVQPEALAAILEFAYTSTLTITAGNVKHILNAARMLEIQCIVNVCLEIMEPGGDGGEED DKEDDDDDEDDDDEEDEEEEEEEEEDDDDDTEDFADQENLPDPQDISCHQSPSKTDHLTE KAYSDTPRDFPDSFQAGSPGHLGVIRDFSIESLLRENLYPKANIPDRRPSLSPFAPDFFP HLWPGDFGAFAQLPEQPMDSGPLDLVIKNRKIKEEEKEELPPPPPPPFPNDFFKDMFPDL PGGPLGPIKAENDYGAYLNFLSATHLGGLFPPWPLVEERKLKPKASQQCPICHKVIMGAG KLPRHMRTHTGEKPYMCTICEVRFTSKDPDMLPSVPASIHGNVLFSPALPLENHFENESW QDKLKIHMRKHTGERPYLCIHCNAKFVHNYDLKNHMRIHTGVRPYQCEFCYKSFTRSDHL HRHIKRQSCRMARPRRGRKPAAWRAASLLFGPGGPAPDKAAFVMPPALGEVGGHLGGAAV CLPGPSPAKHFLAAPKGALSLQELERQFEETQMKLFGRAQLEAERNAGGLLAFALAENVA AARPYFPLPDPWAAGLAGLPGLAGLNHVASMSEANN >gi568815580r:47929263_48141107|GENSCAN_predicted_CDS_3|2991_bp atgagagggactcagccagatgatatcattatccaggcagctgtgaccacattacccatc aggactggaggagttgggtgtgaagagaagcactggcctaagtgtgaatacacagagcac tggactctgccaaatgggataaacggaggccaccgagtggcctctgaagcatactcccag ctctacctccaaagcctgctctcaggacagagcctaaggccccttgaaagcaaggtctgt gtcttcttcgcctttttacctacccccacccgccctgccgtaaagcccagcagaggacca ggctcaaggcatactgtgggtccaggttcatgctggatgccccagaaaagtgcagacatg cccagtggagtcaggtcagaggagcagaggagctgcagggacctcaagagccacctgccg ccactgctgattggaagttcaccaaatgggtccactgaagctcttcctttcttcagggac aagataggaggggaaaataaggaacaggacttcggtaatgtagggaggatgacgctgacc tgtgtcgtgtcatttctacctttctacctggggaaagtggtgctcatccattcctatgac agtcgtctgctgagcacctactgtgtaccaggccagagggccatggagacagaggaggtc cctgcttctcatgtgcaaggggctgtgttatatgaaggaaaaagcgatatctcaggaatg actgccccccagatggaggcccctttcactctacgggctatgggtgtacctgatcccccg acactcaccctgcagcacccgctgtggtctgctgaccccatccccagcacctcacacccc agagctatttctcctgctctccccaggcccatcacaatagcacctattttccacttacat ctccatgtgctgccaccccccacggcagctcctccccggccagatcccacatggaggtct ggggtagagaagggcaggggagtggccgtgaataatgcagagcggcttcctggggctctg gctgagaacatggccaatgacattgatgagctcattggcattcccttccccaaccacagc agtgaggtcctgtgcagcctcaatgagcaacggcacgatggcctgctgtgtgacgtgctc ctggtggtgcaggagcaggagtatcggacccaccgctccgtcctggctgcctgcagcaag tacttcaagaagcttttcacagccggcaccctagccagccagccctacgtctatgagatc gactttgtccagcctgaggctctggctgctatcctggagttcgcctacacctccacgctc accatcaccgctggcaatgtcaagcacatcctcaacgcagccaggatgctggagatccag tgcatcgtgaacgtgtgcctggagatcatggagcctgggggggacgggggggaggaggat gacaaggaggacgatgacgacgacgaagatgatgatgatgaggaggacgaagaggaggag gaggaagaggaggaggatgacgatgatgacacggaggactttgctgaccaagaaaacttg cctgacccccaggacatcagctgccaccaaagcccttccaagacagaccatctcacagag aaggcctattcagacacccccagggacttccctgactccttccaggctggcagtcctggc catctgggggtgatccgggacttctccatcgaatctctgctaagggagaacctgtacccc aaggccaacatccccgacaggagaccctccttgtctccattcgccccggacttctttcca cacctctggccaggggacttcggtgcctttgcccagctgcctgagcagcccatggacagt gggccactggatctggtcatcaagaatcggaagatcaaggaggaggagaaggaggagctg cccccacccccaccgccacccttccctaatgacttcttcaaggacatgttccctgacctg ccgggggggcctctgggacccatcaaggcggagaacgactacggtgcctatctcaacttc ctgagtgccacccacctgggaggcctcttcccaccctggcccctggtagaagagcgcaag ctgaagcccaaggcctctcagcagtgccccatctgccacaaagtcatcatgggggccggg aagctgccgcggcacatgaggacccataccggggagaagccatacatgtgcaccatctgc gaggtccgcttcaccagcaaggacccagacatgctgccctcagtgcctgccagtattcat gggaatgtccttttctctcctgctctgcccctggagaaccattttgaaaatgagagctgg caggacaagctgaaaatccacatgcggaagcacacaggggagcggccctacctgtgcatc cactgcaacgccaagttcgtgcacaactacgacctcaagaaccacatgcgcatccacacg ggcgtgcggccctaccagtgcgagttctgctacaagagcttcacgcgctctgaccacctg caccgccacatcaagcgccagagctgccgcatggcacggccccgacgcggccgcaagcct gctgcgtggagggccgccagcctgctcttcgggcccggcggcccggcccccgacaaggcg gccttcgtgatgccccctgcgctgggcgaggtgggcggccacctgggcggcgcagctgtg tgcctcccgggccccagccccgccaagcacttcctggcagcgcccaagggcgccctgagc ctgcaagagctggagcggcagttcgaggagacacagatgaagctgttcgggcgcgcgcag ctggaggctgagaggaacgcggggggcctcctggccttcgcgctggccgagaacgtggcg gcggcgcggccctacttcccgctgcccgacccttgggccgccggcctggccggcctccct gggctcgccggcctcaaccacgtggcctccatgtccgaagccaacaactag >gi568815580r:47929263_48141107|GENSCAN_predicted_peptide_4|57_aa MQTSPILLTPKMGEKEPEHQASAGATPAQAQAQCKRVFVPTASPMLMEDLRSTHSFP >gi568815580r:47929263_48141107|GENSCAN_predicted_CDS_4|174_bp atgcagacatctccaatattactgaccccaaaaatgggtgagaaggagccagaacatcaa gccagtgcaggcgcaaccccagcgcaggcgcaagcccagtgcaagcgagtctttgttcct actgccagccccatgctcatggaagacctgcgttccacacattccttcccctga >gi568815580r:47929263_48141107|GENSCAN_predicted_peptide_5|76_aa MLAHKAFLTTSGGHRRRCLISKDKAAPTVCGPHADPCRTPMQVFGPFLLKSCHPFSQWPL HSACDRGVSPATADES >gi568815580r:47929263_48141107|GENSCAN_predicted_CDS_5|231_bp atgctggcccacaaggccttcctcacaaccagtgggggccatcgaaggcgatgcctcatc agcaaagacaaagcagcacctactgtgtgcggaccccatgctgacccctgtaggacaccc atgcaggtctttggccccttccttctaaagagttgtcatccatttagccagtggcccctg cattctgcctgtgaccgtggcgtttctcctgccacggctgatgaaagctga >gi568815580r:47929263_48141107|GENSCAN_predicted_peptide_6|121_aa MTTSRRLRQAPASSAAAWIPSPGPWRRSGHGPGTSAASPGRPASPELLRGVGPRTSSEIG ERIWNSAESNSRHCPIAAGPRQRAPLGRGASRRNLVPSFAQKEAPLQRLIGRLVADHFQF F >gi568815580r:47929263_48141107|GENSCAN_predicted_CDS_6|366_bp atgacaacctcccgccggctgcggcaggctcccgcgagcagcgcggccgcgtggattccc agcccgggaccctggcggcgctcaggacacggcccgggcacgagcgcggcgtccccgggg cggccagcctcgccggagctcctgcggggcgtggggccacgcacctcttccgaaatagga gagaggatctggaactctgcagagagcaactctcgccactgccccattgccgccggccca cgccagcgggctcctttggggcgaggggccagccgaaggaacctggtacccagcttcgcg cagaaggaagccccactccagcgacttattggtcggctagtggcagaccacttccagttc ttctaa