GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:31:10 Sequence gi568815579f:33279240_33479689 : 200450 bp : 50.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 869 690 180 0 0 62 61 147 0.306 9.24 1.05 Intr - 3080 2938 143 1 2 97 36 49 0.185 0.60 1.04 Intr - 5613 5591 23 0 2 77 75 48 0.143 -1.26 1.03 Intr - 13989 13804 186 0 0 30 113 130 0.271 9.59 1.02 Intr - 18442 18349 94 0 1 18 81 93 0.104 1.57 1.01 Init - 23175 22103 1073 0 2 78 49 2271 0.131 216.04 1.00 Prom - 51275 51236 40 -4.26 2.06 PlyA - 52103 52098 6 1.05 2.05 Term - 62267 62125 143 0 2 63 44 138 0.118 4.99 2.04 Intr - 71560 71500 61 1 1 87 42 71 0.006 0.71 2.03 Intr - 81856 81306 551 2 2 25 85 572 0.099 43.19 2.02 Intr - 82057 81979 79 1 1 42 57 105 0.840 1.92 2.01 Init - 82207 82067 141 1 0 114 -5 215 0.843 15.01 2.00 Prom - 87299 87260 40 -1.96 3.00 Prom + 91377 91416 40 -1.46 3.01 Init + 94456 94538 83 0 2 102 -29 174 0.242 5.54 3.02 Intr + 98992 99019 28 1 1 92 78 17 0.095 -0.78 3.03 Term + 99905 100453 549 1 0 71 41 388 0.215 26.70 3.04 PlyA + 100734 100739 6 1.05 4.18 PlyA - 101202 101197 6 1.05 4.17 Term - 108242 108105 138 2 0 68 43 275 0.808 18.96 4.16 Intr - 108842 108651 192 2 0 133 84 389 0.986 42.69 4.15 Intr - 110074 109974 101 2 2 69 76 5 0.291 -2.77 4.14 Intr - 111516 111432 85 0 1 86 110 25 0.633 3.89 4.13 Intr - 112240 112056 185 2 2 108 78 308 0.843 31.31 4.12 Intr - 122630 122482 149 1 2 118 99 276 0.951 31.58 4.11 Intr - 126726 126524 203 0 2 33 60 140 0.301 3.78 4.10 Intr - 129152 129029 124 1 1 95 59 97 0.802 8.09 4.09 Intr - 131952 131827 126 2 0 40 33 108 0.359 0.29 4.08 Intr - 132510 132433 78 2 0 96 103 107 0.874 11.67 4.07 Intr - 134404 134336 69 0 0 113 115 151 0.965 18.60 4.06 Intr - 136549 136492 58 2 1 61 67 80 0.460 1.14 4.05 Intr - 136963 136776 188 0 2 79 32 93 0.458 2.13 4.04 Intr - 139834 139706 129 0 0 55 16 107 0.147 0.01 4.03 Intr - 144160 144000 161 1 2 68 59 63 0.002 0.19 4.02 Intr - 148358 148243 116 0 2 7 84 93 0.010 0.97 4.01 Init - 157146 157083 64 0 1 59 105 92 0.575 7.31 4.00 Prom - 160605 160566 40 -4.26 5.11 PlyA - 161468 161463 6 1.05 5.10 Term - 162201 162106 96 0 0 73 44 125 0.862 4.57 5.09 Intr - 168045 167943 103 2 1 95 97 64 0.800 8.18 5.08 Intr - 168860 168770 91 0 1 77 76 54 0.864 2.15 5.07 Intr - 168950 168881 70 2 1 52 78 110 0.934 5.15 5.06 Intr - 182201 182093 109 1 1 84 95 22 0.038 2.79 5.05 Intr - 183802 183756 47 1 2 87 121 56 0.976 6.01 5.04 Intr - 184823 184748 76 1 1 47 107 103 0.178 7.62 5.03 Intr - 189491 189228 264 1 0 64 80 118 0.048 5.23 5.02 Intr - 198032 197841 192 1 0 55 43 135 0.196 4.31 5.01 Intr - 199397 199355 43 0 1 114 89 34 0.762 3.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 23175 22099 1077 0 0 78 54 2285 0.851 221.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:33279240_33479689|GENSCAN_predicted_peptide_1|567_aa MESADFYEAEPRPPMSSHLQSPPHAPSSAAFGFPRGAGPAQPPAPPAAPEPLGGICEHET SIDISAYIDPAAFNDEFLADLFQHSRQQEKAKAAVGPTGGGGGGDFDYPGAPAGPGGAVM PGGAHGPPPGYGCAAAGYLDGRLEPLYERVGAPALRPLVIKQEPREEDEAKQLALAGLFP YQPPPPPPPSHPHPHPPPAHLAAPHLQFQIAHCGQTTMHLQPGHPTPPPTPVPSPHPAPA LGAAGLPGPGSALKGLGAAHPDLRASGGSGAGKAKKSVDKNSNEYRVRRERNNIAVRKSR DKAKQRNVETQQKVLELTSDNDRLRKRVEQLSRELDTLRGIFRQLPESSLVKAMGNCAAV TDITVISWISWESVNVGFGPDTCDEPLGLPRRPQDKGFIPEQSEGTVVCHAEIYASFSIT SATCLEPGPLGMELKATEPWLTVGLQSRTGHLSALDVSRPPSPNFPNLGCQSHGRTQSKW VWEDSGVTGQHSPQGPAGGCLTAAETDISGSAQRGRQQVDTATVNRTPRQPTPLEPSLWL PHLPVLQLPETLPQVVCCSNSGSTGLX >gi568815579f:33279240_33479689|GENSCAN_predicted_CDS_1|1701_bp atggagtcggccgacttctacgaggcggagccgcggcccccgatgagcagccacctgcag agccccccgcacgcgcccagcagcgccgccttcggctttccccggggcgcgggccccgcg cagcctcccgccccacctgccgccccggagccgctgggcggcatctgcgagcacgagacg tccatcgacatcagcgcctacatcgacccggccgccttcaacgacgagttcctggccgac ctgttccagcacagccggcagcaggagaaggccaaggcggccgtgggccccacgggcggc ggcggcggcggcgactttgactacccgggcgcgcccgcgggccccggcggcgccgtcatg cccgggggagcgcacgggcccccgcccggctacggctgcgcggccgccggctacctggac ggcaggctggagcccctgtacgagcgcgtcggggcgccggcgctgcggccgctggtgatc aagcaggagccccgcgaggaggatgaagccaagcagctggcgctggccggcctcttccct taccagccgccgccgccgccgccgccctcgcacccgcacccgcacccgccgcccgcgcac ctggccgccccgcacctgcagttccagatcgcgcactgcggccagaccaccatgcacctg cagcccggtcaccccacgccgccgcccacgcccgtgcccagcccgcaccccgcgcccgcg ctcggtgccgccggcctgccgggccctggcagcgcgctcaaggggctgggcgccgcgcac cccgacctccgcgcgagtggcggcagcggcgcgggcaaggccaagaagtcggtggacaag aacagcaacgagtaccgggtgcggcgcgagcgcaacaacatcgcggtgcgcaagagccgc gacaaggccaagcagcgcaacgtggagacgcagcagaaggtgctggagctgaccagtgac aatgaccgcctgcgcaagcgggtggaacagctgagccgcgaactggacacgctgcggggc atcttccgccagctgccagagagctccttggtcaaggccatgggcaactgcgctgctgtc acggatattactgttatctcctggatctcctgggaatctgtgaacgtggggtttggccct gacacctgcgatgagccactgggcctgcccaggaggccacaggacaaaggatttatcccg gaacaatctgaaggcactgtggtctgtcatgctgaaatatatgcctcattcagcatcaca agtgctacttgcctggagccaggaccactgggcatggagctgaaggctacagaaccctgg ctcacagtgggccttcagtctcgtaccggacacctcagtgccctggacgtgtcaaggccc ccctcacctaatttcccaaaccttggctgccagagccacgggaggacccagtccaagtgg gtgtgggaggactctggagtcaccgggcagcacagcccgcagggcccagctggcggatgc ctgacggccgctgagacggatatcagtggaagtgcccagagaggaaggcagcaggtggac acagccacagtgaaccgcacaccgaggcagccgacaccgctggagccctctttgtggctg ccccacctgcccgttctccagcttccggagacgttgccccaggttgtctgctgtagcaac agcggcagcactgggctcann >gi568815579f:33279240_33479689|GENSCAN_predicted_peptide_2|324_aa MASHLVLNNGTKMPILGLGTWKSPPGRVTEAVNVAINVGYCHIDCAHHENEVGVAIQEKL REQVVKREELFIVTWEGIFPLNELGNVVPSDTNILDTWAAMEELVDEWLMKAIGISNFNH LQVERLLNKPGLKYKPVVNQIECHPYLTQEKLIQYCQSKGIVVTAYSPVGTPSRPWVNPE DPSLLRDPRIKAIAAKHNKTTAQVLIRFPMQRNLGVIPKSVTPELIADNFKVFDFELNSQ DMTALLSYNRNWRVCALKPLESHRNVITEPPTSIGIQGKIHAMEVRQAVFALNIFSNQLQ FSKCNFIILQINRARFKHMTLKAN >gi568815579f:33279240_33479689|GENSCAN_predicted_CDS_2|975_bp atggccagccaccttgtgctcaacaatggcaccaagatgcccatcctggggctgggcacc tggaagtcccctccaggccgggtgactgaggccgtgaatgtggccatcaatgtcgggtac tgccacattgactgtgcccaccatgagaatgaggtgggggtggccattcaggagaagctc agggagcaggtggtaaagcgtgaggagctcttcatcgttacctgggagggaatttttcca ttgaatgagttgggtaatgtggttcccagtgacaccaacattctggacacatgggcagcc atggaagagctggtggatgaatggctgatgaaagctattggcatctccaacttcaaccac ctccaagtggagaggctcttaaacaaacctggcttaaagtataagcctgtggttaaccag attgagtgccacccttacctcactcaggagaagttaatccagtactgccagtccaaaggc atcgtggtgaccgcctatagccccgttggcactcccagcaggccctgggtcaaccccgag gacccttccctcctgagggatcccaggatcaaggcgattgcagccaagcacaataaaact acagcccaggtcctgatccggttccccatgcagaggaatttgggtgtgatccccaagtct gtgacaccagaactcattgctgacaactttaaggtttttgactttgaactgaacagccag gatatgaccgccttactcagctacaacaggaattggagagtctgtgccttgaaaccactt gagtcacacaggaacgtcatcactgagccgcccaccagtatcggcatccaaggtaagatc catgccatggaagttaggcaggcagtttttgccctcaacatcttcagtaatcagcttcaa ttttctaagtgcaacttcatcattctgcagatcaacagggctcgcttcaaacacatgacg cttaaggccaactga >gi568815579f:33279240_33479689|GENSCAN_predicted_peptide_3|219_aa MAAAARAAPASRGSRWPRLRNGRRLPVSQQHRDASELVHVKIFWQLSVETIDHPALISTC SVLARESAQMSKISQQNSTPGVNGISVIHTQAHASGLQQVPQLVPAGPGGGGKAVAPSKQ SKKSSPMDRNSDEYRQRRERNNMAVKKSRLKSKQKAQDTLQRVNQLKEENERLEAKIKLL TKELSVLKDLFLEHAHNLADNVQSISTENTTADGDNAGQ >gi568815579f:33279240_33479689|GENSCAN_predicted_CDS_3|660_bp atggcggcggcggcgcgcgcggccccggcgagcaggggaagccggtggccgcggctgcgg aacgggcggaggctgccggtttcgcagcagcacagggatgcttcagagctggtacatgtg aagattttttggcagcttagcgtggaaaccattgatcaccctgctctcatttctacctgt tctgtgttggcaagggagagtgcccaaatgagcaagatatcgcagcaaaacagcactcca ggggtgaacggaattagtgttatccatacccaggcacatgccagcggcttacagcaggtt cctcagctggtgcctgctggccctgggggaggaggcaaagctgtggctcccagcaagcag agcaaaaagagttcgcccatggatcgaaacagtgacgagtatcggcaacgccgagagagg aacaacatggctgtgaaaaagagccggttgaaaagcaagcagaaagcacaagacacactg cagagagtcaatcagctcaaagaagagaatgaacggttggaagcaaaaatcaaattgctg accaaggaattaagtgtactcaaagatttgtttcttgagcatgcacacaaccttgcagac aacgtacagtccattagcactgaaaatacgacagcagatggcgacaatgcaggacagtag >gi568815579f:33279240_33479689|GENSCAN_predicted_peptide_4|721_aa MLRTAPACLLGRAWGLLGRLPGLSYPKVEFARVIPGILRLEDFLELALCLWDVSSKGLGH EFSPRTVKRHKAMRGHHTERQKKPPSLTAGSWLSKLEAVVLLIQPFTYSVTQQRTGVGAA GRGSEELRAAEQAAEAEHRICCRRPVSDAAMLVPATRTTRQALAQLCGGTRCVRIKHGPQ FGMSTPQCSPLEDQNCAAPGSHASTGPLCRHSCPELGCPLCGVVNVAGKGDQCRAALCSL FEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCKSAVLTRGCVQG FPEHDVPCSQFFIEATTRRCGQALRPPPLGGHLAQRGPSSGKSSRPGVLTELGRPGPDAH FQQLVLQHRLETPRRPTGRSCSRHPPRAPGIAGSGDQKLLLSWAGGMGSFPVCRGPGGVA VTRKGGRDAQERTGERSPCLFDMGGEYYCFASDITCSFPANGKFTADQKAVYEAVLRSSR AVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGI DVHDVGGYPEAFWGPLADKGGWGPSPDKVPRPSEDHPEAPGQRPRGKCCGAGATPARLWC LLIASRLACSLQGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALADPARA SFLNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGP K >gi568815579f:33279240_33479689|GENSCAN_predicted_CDS_4|2166_bp atgctgaggacggctcccgcctgcctcttaggcagggcatggggcctgctgggaaggctg cctggtttaagctaccccaaagttgaatttgcaagagttattcctggcatcctccgattg gaggattttttggaacttgccctgtgtctctgggatgtcagcagcaaaggtttaggacat gaatttagcccgaggacagtgaagaggcacaaggccatgcgtggccatcatacagagagg cagaagaagcctccatctctaacagcaggaagttggttaagtaaattagaggctgtggtg ctgctcatacagccattcacttattcagtcactcaacagaggactggtgtaggagcggca ggaagagggagtgaggagctccgggcagcagagcaagctgctgaagccgagcaccgcatc tgctgcagacggcctgtgtctgacgctgccatgttggtgccggctacacgcaccactcgc caagctctggcccagctgtgcgggggcacacgttgcgtccgcatcaagcatgggccccag tttggaatgtccacccctcagtgctcaccgcttgaagatcagaactgtgccgcgccgggc tcacacgcttccacaggccccctgtgccgccacagctgcccagaacttgggtgccctttg tgtggagttgtcaacgtggccggcaaaggtgaccagtgtcgggctgcactttgcagcctc ttcgagcactactgctactcccggggcggcatgcgccacagctcctacacctgcatctgc ggcagtggtgagaactcagccgtgctacactacggacacgccggagctcccaacgaccga acgatccagaatggggatatgtgcaagtcagcagtgctgacccgtggctgcgtgcagggc tttcctgagcacgatgtgccgtgctcacagttcttcatagaagccacgacaaggagatgt gggcaggccctgcggcctccgcccctcggtggccacctggcacagcgtgggccctcttcg ggcaagagcagcaggcccggtgtcctcaccgagctgggcaggccaggccctgatgcccac ttccagcagctggttctgcagcacaggctggagacacccaggaggcccacaggacgcagc tgctcccgtcacccaccgcgtgcaccagggattgctggctctggggaccagaagctgctg ctgtcatgggctggcggcatgggcagctttcctgtctgccgaggccctggaggagtggcc gtcacacgaaaagggggcagagatgcccaggaacgcactggagagaggtccccttgcctg ttcgacatgggcggtgagtattactgcttcgcttccgacatcacctgctcctttcccgcc aacggcaagttcactgcagaccagaaggccgtctatgaggcagtgctgcggagctcccgt gccgtcatgggtgccatgaagccaggtgtctggtggcctgacatgcaccgcctggctgac cgcatccacctggaggagctggcccacatgggcatcctgagcggcagcgtggacgccatg gtccaggctcacctgggggccgtgtttatgcctcacgggcttggccacttcctgggcatt gacgtgcacgacgtgggaggctacccagaggcattctggggcccgttggcagataaggga ggctgggggccatctccagacaaggttccccggccctcagaggaccacccagaggctccc ggtcagcggcccagaggcaagtgttgtggggccggggccacaccagccaggctgtggtgt ctcctcatcgcctcccgcctggcctgctctctccagggcgtggagcgcatcgacgagccc ggcctgcggagcctgcgcactgcacggcacctgcagccaggcatggtgctcaccgtggag ccgggcatctacttcatcgaccacctcctggatgaggccctggcggacccggcccgcgcc tccttccttaaccgcgaggtcctgcagcgctttcgcggttttggcggggtccgcatcgag gaggacgtcgtggtgactgacagcggcatagagctgctgacctgcgtgccccgcactgtg gaagagattgaagcatgcatggcaggctgtgacaaggcctttacccccttctctggcccc aagtag >gi568815579f:33279240_33479689|GENSCAN_predicted_peptide_5|363_aa XFDEKSANNLIEDPLAWDQGGILKDEWASAVEKEGVGESDPGKRTTRSQKQEKGWWLCVA SAEPGSRAREQLERQAVAGRFCGLGKVSLRCRVPASFPSDKWLAVPAVQRHGDRVSQRLA GMGCSLEVPKGLLPLMGVGEPPGPSHSRHPRLPRGLCLGCLDGGQLSRVFKTDMELEVLR YTNKISSEAHREVMKAVKVGMKEYELESVPEPDPAQPGDRAAWAMGQGPGDLEYPPFLSP HAFQKILIQTYHLLVRKTLPAGVIIPGPLSSVDDMNGFIQEGLSPYLPAGMSALEAWSHP ALVQQCSPGATLHECATSLVDAPVTALGVPSLAAVFPGDVVTNPSEPDSPVVIEESGLHY WAG >gi568815579f:33279240_33479689|GENSCAN_predicted_CDS_5|1092_bp ngttttgatgagaaatcagctaataatctaattgaggatcccttagcttgggaccaagga ggtatcctgaaagatgaatgggcatcggctgtggaaaaagaaggagtaggagagagtgac cctggcaaaaggaccactcgttcacagaaacaagagaaagggtggtggctctgtgtggcc agtgcggagccagggagcagagcccgagaacagttggagagacaggcagtggctggaagg ttctgtggtctcgggaaggtgtcgctgcgctgcagggtgcccgcctcgtttcccagtgac aaatggctggctgtcccagctgtgcagcgtcacggagaccgtgtgagtcagcggctggct ggcatgggctgctcccttgaagttcccaagggcttgctgcctctgatgggagtgggggag ccaccaggcccctcgcattcccgccatcctcgcctgcctcggggcctctgtctgggctgc ctcgatggaggccagctcagccgagtgtttaagacggatatggagctggaggttctgcgc tataccaataaaatctccagcgaggcccaccgtgaggtaatgaaggctgtaaaagtggga atgaaagaatatgagttggaaagcgtcccagagcccgaccctgcccaacctggggacagg gctgcatgggctatgggtcaggggccaggggacctggaatacccgccgttccttagccct catgccttccagaaaatcctcatccagacctatcacctgctggtgcgaaagacgctccct gcgggagtaattattccgggccctctgagttctgtggatgacatgaatggtttcatccag gaagggctgtctccatacctgccggcaggcatgtctgctcttgaagcctggagtcatccg gcactggtgcagcagtgttcccctggggccaccctccatgagtgtgccacaagccttgtg gatgcacctgtgacagcacttggggtcccctcattggcggccgtatttcctggggatgtg gtcaccaaccccagtgagcctgatagcccagtggttatcgaggaatcaggtctgcactac tgggctggctga