GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:35:30 Sequence gi568815585f:72628941_72855213 : 226273 bp : 37.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5024 5095 72 1 0 41 106 83 0.694 6.52 1.02 Intr + 26400 26435 36 2 0 127 71 15 0.249 1.24 1.03 Intr + 51334 51424 91 0 1 44 100 67 0.161 2.15 1.04 Intr + 54020 54159 140 0 2 54 64 81 0.470 1.56 1.05 Term + 61614 61715 102 2 0 126 43 134 0.986 10.00 1.06 PlyA + 62299 62304 6 -0.45 2.00 Prom + 63150 63189 40 -4.95 2.01 Sngl + 63941 64303 363 1 0 55 41 217 0.614 9.53 2.02 PlyA + 66993 66998 6 1.05 3.00 Prom + 73357 73396 40 -4.05 3.01 Sngl + 74817 75041 225 2 0 97 47 125 0.607 4.19 3.02 PlyA + 75212 75217 6 1.05 4.05 PlyA - 76570 76565 6 1.05 4.04 Term - 86157 85992 166 2 1 79 50 99 0.426 1.61 4.03 Intr - 86739 86612 128 0 2 53 62 103 0.639 2.76 4.02 Intr - 90157 90012 146 2 2 100 53 124 0.636 9.18 4.01 Init - 98662 98584 79 1 1 92 113 179 0.994 20.07 4.00 Prom - 99757 99718 40 -10.45 5.00 Prom + 99820 99859 40 -10.94 5.01 Init + 100001 100153 153 1 0 34 100 80 0.040 3.83 5.02 Intr + 102341 102447 107 1 2 57 123 81 0.924 6.59 5.03 Intr + 106020 106065 46 0 1 71 78 57 0.612 0.49 5.04 Intr + 109022 109103 82 1 1 60 82 52 0.100 0.09 5.05 Intr + 116041 116267 227 2 2 46 89 97 0.372 2.38 5.06 Intr + 117004 117136 133 0 1 64 91 42 0.836 1.40 5.07 Intr + 117561 118171 611 1 2 109 110 519 0.958 47.41 5.08 Intr + 124750 124881 132 0 0 97 102 61 0.474 8.32 5.09 Term + 126103 126129 27 0 0 74 39 22 0.198 -7.00 5.10 PlyA + 126361 126366 6 -0.45 6.19 PlyA - 126482 126477 6 1.05 6.18 Term - 127995 127633 363 0 0 61 55 176 0.520 5.18 6.17 Intr - 131711 131589 123 2 0 9 110 72 0.576 1.36 6.16 Intr - 132581 132423 159 2 0 62 80 84 0.938 4.26 6.15 Intr - 132874 132706 169 0 1 81 68 153 0.991 11.63 6.14 Intr - 133197 132983 215 0 2 112 63 107 0.999 7.19 6.13 Intr - 134667 134511 157 2 1 85 86 109 0.999 9.49 6.12 Intr - 137118 137032 87 2 0 84 58 93 0.941 4.07 6.11 Intr - 139972 139845 128 1 2 81 38 162 0.997 9.06 6.10 Intr - 142956 142855 102 0 0 76 94 75 0.987 6.35 6.09 Intr - 143335 143219 117 1 0 91 51 104 0.991 6.74 6.08 Intr - 143899 143753 147 1 0 66 115 76 0.992 7.61 6.07 Intr - 144881 144744 138 2 0 30 75 128 0.977 5.44 6.06 Intr - 145119 145006 114 0 0 62 105 21 0.650 0.92 6.05 Intr - 146435 146271 165 2 0 79 60 237 0.931 19.24 6.04 Intr - 148553 148480 74 0 2 63 63 61 0.885 -0.69 6.03 Intr - 149440 149247 194 0 2 81 89 170 0.999 14.61 6.02 Intr - 152063 151906 158 2 2 16 63 166 0.965 4.79 6.01 Init - 152844 152665 180 0 0 84 96 173 0.925 15.00 6.00 Prom - 153537 153498 40 -10.55 7.00 Prom + 154383 154422 40 -9.25 7.01 Init + 154530 154781 252 2 0 49 116 202 0.995 16.49 7.02 Intr + 163507 163607 101 0 2 52 110 97 0.993 6.29 7.03 Intr + 166419 166617 199 0 1 46 84 218 0.999 15.63 7.04 Intr + 168967 169086 120 0 0 51 68 123 0.649 6.37 7.05 Intr + 192909 193042 134 2 2 70 107 169 0.987 15.52 7.06 Intr + 198070 198178 109 1 1 63 99 47 0.978 2.67 7.07 Intr + 198793 198950 158 0 2 55 5 193 0.020 5.59 7.08 Intr + 206303 206428 126 2 0 80 98 174 0.052 16.47 7.09 Intr + 225117 225215 99 0 0 38 115 180 0.206 14.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 98903 99067 165 1 0 51 94 123 0.891 8.90 S.002 Intr + 99986 100153 168 1 0 72 100 72 0.910 6.02 S.003 Init - 154061 154043 19 2 1 78 102 -13 0.918 -0.67 S.004 Term + 198793 199005 213 0 0 55 37 200 0.921 7.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_1|146_aa MTAPNKELTIEQCNSGAKVEEKNWIKIYDGYSQKQKVLKIQLGKEMTVTKYFTDCLRKVI RDDGGEGETPLLVSFRRSAFKLIPQGYLSSPRGPSSPRISLLHVDSSEHSESQALNRVDD DTRIEEDNLPYPVRQFKCQPYLETLS >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_1|441_bp atgactgcccctaataaagaacttacaattgagcagtgcaactctggggcaaaggtagaa gaaaagaattggataaaaatatatgatggttatagtcagaaacaaaaggttttaaaaatt caacttggcaaggaaatgacagttactaaatatttcactgactgtctcaggaaagttatt agagatgatggaggagaaggggagaccccactgttggtcagcttcagaagatctgctttc aagctcattccccagggctacctcagttctccacgtggcccctcatctcccaggatctct cttctccatgtggactcttcagagcatagtgagagccaggccctcaatagagtggatgat gacacccgcattgaggaggacaatctaccttacccagtccgccaattcaaatgtcaacct tatttggaaacactttcatag >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_2|120_aa MADKEKAFWGEEFKLAVKQVLATEICITKREPTANSQENGENVWKAFWRFKWQPLPSQAL RSRRKEWFCGPDPGPHCPAQPWEIAPCILTPWLDLWLKVPGYSSGCCFRGCELFALTASM >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_2|363_bp atggcagataaagaaaaagctttctggggagaggaattcaagctggctgtgaagcaagta cttgctacagaaatctgcataactaaaagagagccaactgctaatagccaagagaatggg gaaaatgtctggaaggcattttggagatttaagtggcagcccctcccatcacaggccctg aggtctaggaggaaagaatggttttgtgggccagacccagggcctcactgccctgcgcag ccttgggaaattgctccctgcatcctgaccccttggctcgatttgtggctcaaagttccc ggatacagctcaggctgctgcttcagagggtgtgagctcttcgccttgacggcttccatg tga >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_3|74_aa MELSLWPRICEIYKKGDIVDIKGMVTLQVLPWRNWKSLQRYPAGCWQCCQQVKGKILAKR TKNTLSTLRVETAS >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_3|225_bp atggagttgtccctttggccacgtatatgcgaaatctacaagaaaggtgatattgtagac atcaagggaatggttactcttcaggtgttaccatggcgaaactggaagagtctacaacgt tatccagcaggctgctggcagtgttgtcaacaagttaagggcaagattcttgccaagaga actaagaacacattaagtactctaagagttgagacagcttcttga >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_4|172_aa MASSSGAGAAAAAAAANLNAVRETMDVLLEISRILNTGLDMETLSICVRLCEQGINPEAL SSVIKELRKATEALKRVQHQEDGAKPFMRDEFSQLPPGPTSNIGDYNGDYNIGNYISRVK PPWLLSWAGIVECLQLFKAQGTSCQWIYHSGVGERWPSSHSSIRQFPSEDIV >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_4|519_bp atggcgagtagcagcggtgctggggcggcggcggcggccgcggcggcgaatctgaatgcg gtgcgggagaccatggacgttctgcttgagatttcaagaattttgaatactggcttagat atggaaactctgtctatttgtgtacggctttgtgaacaaggaattaacccagaagcttta tcatcggttattaaggagcttcgcaaggctactgaagcactgaagcgagtacagcaccaa gaggatggtgctaaaccattcatgagggatgaattcagtcaactcccaccaggccccacc tccaacattggagattacaatggagattacaacattggaaattacatttcaagggttaag cccccatggctgctttcgtgggctggcattgttgagtgtctgcagcttttcaaggctcag ggtacaagctgtcagtggatctaccattctggggttggagaacggtggccctcttctcac agctccattcggcagttccccagtgaggacattgtgtag >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_5|505_aa MGDVKESKMQITPETPGRIPVLNPFESPSDYSNLHEQTLASPSVFKSTKLPTPGKFRWSI DQLAVINPVEIDPEDIHRQALYLSHSRIDKDVEDKRQKAIEEFFTKDVIVPSPWTDHEGK QLSQCHSSKCDYFRADEFADQSPGNLSSSSLRRKLFLDGNGSISDSLPSASPGSPHSGVQ TSLEMFYSIDLSPVKCRSPLQTPSSGQFSSSPIQASAKKYSLGSITSPSPISSPTFSPIE FQIGETPLSEQRKFTVHSPDASSGTNSNGITNPCIRSPYIDGCSPIKNWSPMRLQMYSGG TQYRTSVIQIPFTLETQGEDEEDKENIPSTDVSSPAMDAAGIHLRQFSNEASTHGTHLVV TAMSVTQNQSSASEKELALLQDVEREKDNNTVDMVDPIEIADETTWIKEPVDNGSLPMTD FVSGIAFSIENSHMCMSPLAESSVIPCESSNIQMDSGYNTQNCGSNIMDTVGAESYCKES DAQTCEVESKSQAFNMKSSAKETCF >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_5|1518_bp atgggagatgtcaaggaatcaaagatgcaaataacaccagaaactccaggaaggatccct gttttaaatccttttgaaagtcctagtgattattctaatctccatgaacaaactctcgcc agtccttctgtttttaaatcaacaaaattaccaactccagggaaatttagatggtctatt gatcaactagctgtaataaatcctgtagaaatagacccagaagatattcatcgtcaagct ttatacttaagtcattctcgaatagataaagatgtggaagacaaaagacaaaaagccatt gaagagtttttcactaaagatgtcatcgtaccctctccttggactgatcatgaagggaaa cagctttcacaatgtcattccagtaaatgtgactattttagagctgatgaatttgcagat caatctcctggaaacctcagttcttcatccctcagaagaaagctgtttttagatgggaac ggaagcatctccgactccttaccttcggcttctcccggaagtcctcacagtggtgttcaa acatcactagagatgttttattcaatagatttgtctcctgtaaagtgtaggagccccttg cagacaccaagttcggggcagttttcttctagccctattcaggctagtgcaaaaaaatac agcttgggaagcataactagtccttcgcctatttcttcacccactttctcaccaattgaa tttcagataggagagactccactctcagaacaaaggaagtttactgttcattctcctgat gcttcatctggaacaaattctaatgggataactaatccgtgtatcagaagtccttatata gatggctgctcgccaattaaaaattggtctcctatgagacttcagatgtatagtggtggt actcagtatcggacctcagtgattcagataccttttactcttgagactcaaggtgaagat gaggaagataaagagaatattccttccacagatgtctcatcacccgccatggatgctgct ggaatacacctacggcagtttagtaatgaggcttctacccatggtacacatttggttgtg actgccatgtctgttacacaaaatcagtccagtgcttctgagaaagaattagcactgttg caggatgttgaaagggagaaagacaataacactgtggatatggttgatcctatagagata gcagatgagaccacttggattaaggagccggttgataatggcagtttacccatgactgat tttgtaagtggcattgccttcagtattgaaaactctcatatgtgcatgtcacctcttgct gaaagcagtgtcattccttgtgaaagcagtaacattcagatggatagtggctataatacg cagaattgtggaagcaatattatggatacagttggggcagaaagttactgcaaagaaagt gatgcacaaacatgtgaagttgagagtaaatctcaagcatttaatatgaagagttcagct aaagaaacgtgcttttag >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_6|929_aa MKIVREHYLRDDIGCGAPGCAACGGAHEGPALEPQPQDPASSVCPQPHYLLPDTNVLLHQ IDVLEDPAIRNVIVLQTVLQEVRNRSAPVYKRIRDVTNNQEKHFYTFTNEHHRETYVEQE QGENANDRNDRAIRVAAKWYNEHLKKMSADNQLQVIFITNDRRNKEKAIEEGIPAFTCEE YVKSLTANPELIDRLACLSEEGIILQGLKHLNRAVHEDIVAVELLPKSQWVAPSSVVLHD EGQNEEDVEKEEETERMLKTAVSEKMLKPTGRVVGIIKRNWRPYCGMLSKSDIKESRRHL FTPADKRIPRIRIETRQASTLEGRRIIVAIDGWPRNSRYPNGHFVRNLGDVGEKETETEV LLLEHDVPHQPFSQAVLSFLPKMPWSITEKDMKNREDLRHLCICSVDPPGCTDIDDALHC RELENGNLEVGVHIADVSHFIRPGNALDQESARRGTTVYLCEKASLTYAEAQLRIDSANM NDDITTSLRGLNKLAKILKKRRIEKGALTLSSPEVRFHMDSETHDPIDLQTKELRETNSM VEEFMLLANISVAKKIHEEFSEHALLRKHPAPPPSNYEILVKAARSRNLEIKTDTAKSLA ESLDQAESPTFPYLNTLLRILATRCMMQAVYFCSGMDNDFHHYGLASPIYTHFTSPIRRY ADVIVHRLLAVAIGADCTYPELTDKHKLADICKNLNFRHKMAQYAQRASVAFHTQLFFKS KGIVSEEAYILFVRKNAIVVLIPKYGLEGTVFFEEKDKPNPQLIYDDEIPSLKIEDTVFH VFDKVKVKIMLDSSNLQHQKIRMSLVEPQMRKWRFMVCSHTATKKYPKLGNYYKGKRFNG LSSALLVRPQDTYTIMAEKGEAGTFFTGRQDENDCQQWKCQTLIKPSDLMRPAHYHRNSM EETAPMIQLPPPGPTLDTWGLLQFKVRFG >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_6|2790_bp atgaagatcgtgcgcgagcactacctgcgagacgacatcggctgcggtgcgcccgggtgc gcagcgtgtggaggggcgcacgaggggccggccctggagccgcagccccaggacccggcg agcagcgtctgcccgcaaccgcactacttgctgcccgacactaatgtgttactgcaccag attgatgttcttgaggaccctgccatcaggaatgtaattgtgctacaaacagttcttcaa gaagtgagaaatcgcagtgcccccgtatataaacgcatccgagatgtgactaataaccaa gagaagcatttctatactttcactaatgagcaccatagagaaacctatgtagaacaagaa cagggagaaaatgctaatgacaggaatgatagagcgattcgagtagcagcaaaatggtac aatgaacatttgaaaaaaatgtcagcagacaaccagctgcaagttatcttcataacaaat gacaggagaaacaaagagaaagccatagaagaaggaataccagctttcacttgtgaagaa tatgtaaagagcctaactgctaaccccgaactcatagatcgtcttgcttgtttgtctgaa gaagggataatcttacagggacttaaacatttaaacagagctgttcacgaagatattgtg gctgtggagcttctccccaagagtcagtgggtagcaccatcttctgtggttttacatgat gaaggtcaaaatgaagaagatgtggagaaagaagaagagacagaacgaatgcttaagact gctgtaagcgagaaaatgttgaagcctacaggtagagttgtaggaataataaaaaggaat tggagaccatattgtggcatgctttccaagtctgacattaaggagtcaagaagacatctc tttacacctgctgataagagaatccctcgaattcgcatagaaaccagacaggcttccaca ttagaaggacggagaattattgttgctattgatggttggcccagaaattccagatatcca aatggacactttgtgagaaatttaggtgatgttggagagaaagagactgaaacagaagtt ttgttacttgaacacgatgttccccatcagcctttttcacaggctgttcttagttttctg ccaaagatgccctggagcattactgaaaaggacatgaaaaaccgagaagacctgaggcat ctgtgtatttgtagtgtagacccaccaggatgtactgatatagacgatgctctacattgt cgagaactcgaaaatggaaatttggaggttggtgttcatattgctgatgtgagccatttt attaggccaggaaatgccttggatcaagaatcagccagaagaggaacaactgtgtatctt tgtgaaaaggcatctctgacgtatgctgaagctcagttgagaattgattcagcaaacatg aatgatgatattaccactagtctccgtggactgaataaactagccaaaattctgaagaaa agaaggattgaaaaaggggctttgactctatcctctcctgaagttcgattccacatggac agtgaaactcacgatcctatagatctgcagaccaaggaacttagggaaacaaattccatg gttgaagaatttatgttacttgccaatatttctgttgcaaaaaaaattcatgaggaattt tctgaacatgctctgcttcgaaaacatcctgctccacctccatcaaattatgaaattctt gttaaggcagccaggtcaaggaatttggaaattaagactgatacagccaagtctttggct gagtctttggatcaggccgaatctcctacttttccatatctaaacactctgttgagaata ttagccactcgctgtatgatgcaagctgtgtacttctgttctggaatggataatgatttt catcactatggcttagcgtctccaatatacacacattttacttcacccattagaagatac gcagatgtcattgttcatcggcttttggctgtggctattggggctgactgtacttatcca gagttgacagacaaacacaagcttgcagatatatgtaaaaatctaaatttccggcacaaa atggctcaatatgcccaacgtgcatcagtggcttttcatacccagttattcttcaaaagc aaaggaatagtaagtgaagaagcctatattttatttgtaagaaagaatgccattgtggta ttaattccaaagtatggtttagaagggacagtcttttttgaagaaaaggacaaaccaaac ccacagcttatttatgatgatgagataccctcacttaaaatagaagatacagtgttccat gtatttgataaagttaaagtgaaaatcatgttagactcatctaatcttcaacatcagaag atccgaatgtccctggtagaaccacagatgagaaaatggaggtttatggtctgttctcac actgctacgaagaaataccccaaactgggtaattattataaaggaaagaggtttaatgga ctcagttccgcattgctggtgaggcctcaggacacttacacaatcatggcagaaaaagga gaagcaggcacctttttcacagggcggcaggatgaaaatgactgccagcagtggaaatgc cagacgcttataaaaccttcagatctcatgagaccggctcactatcacaggaacagcatg gaggaaaccgcccccatgatccagttacctccacctggtcccacccttgacacgtgggga ttgctacaattcaaggttagatttgggtga >gi568815585f:72628941_72855213|GENSCAN_predicted_peptide_7|433_aa MSRKISKESKKVNISSSLESEDISLETTVPTDDISSSEEREGKVRITRQLIERKELLHNI QLLKIELSQKTMMIDNLKVDYLTKIEELEEKLNDALHQKQLLTLRLDNQLAFQQKDASKY QELMKQEMETILLRQKQLEETNLQLREKAGDVRRNLRDFELTEEQYIKLKAFPEDQLSIP EYVSVRFYELVNPLRKEICELQVKKNILAEELSTNKNQLKQLTETYEEDRKNYSEVQIRC QRLALELADTKQLIQQGDYRQENYDKVKSERDALEQEVIELRRKHEILEASHMIQTKERS ELSKEVVTLEQTVTLLQKDKEYLNRQNMELSVRCAHEEDRLERLQAQLEESKKAREEIDH YKTEYENKLHDELEQIRLKTNQEIDQLRNASREMYERENRNLREARDNAVAEKERAVMAE KDALEKHDQLLDS >gi568815585f:72628941_72855213|GENSCAN_predicted_CDS_7|1299_bp atgtctcgaaaaatttcaaaggagtcaaaaaaagtgaacatctctagttctctggaatct gaagatattagtttagaaacaacagttcctacggatgatatttcctcatcagaagagcga gagggcaaagtcagaatcaccaggcagctaattgaacgaaaagaactacttcataatatt cagttactaaaaattgagctatcccagaaaactatgatgatcgacaatttgaaagtggat tatcttacaaagattgaagaattggaggagaaacttaatgatgcacttcaccagaagcag ctactaacattgagattagacaaccaattggcttttcaacagaaagatgccagcaaatat caagaattaatgaaacaagaaatggaaaccattttgttgagacagaaacaactagaagag acaaatcttcagctaagagaaaaagctggagatgttcgtcgaaacctgcgtgactttgag ttgacagaagagcaatatattaaattaaaagcttttcctgaagatcagctttctattcct gaatatgtatctgttcgcttctatgagctagtgaatccattaagaaaggaaatctgtgaa ctacaagtgaaaaagaatatcctagcagaagaattaagtacaaacaaaaaccaactgaag cagctgacagagacatatgaggaagatcgaaaaaactactctgaagttcaaattagatgt caacgtttggccttagaattagcagacacaaaacagttaattcagcaaggtgactaccgt caagagaactatgataaagtcaagagtgaacgtgatgcacttgaacaggaagtaattgag cttaggagaaaacatgaaatacttgaagcctctcacatgattcaaacaaaagaacgaagt gaattatcaaaagaggtagtcaccttagagcaaactgttactttactgcaaaaggataaa gaatatcttaatcgccaaaacatggagcttagtgttcgctgtgctcatgaagaggatcgc cttgaaagacttcaagctcaactggaagaaagcaaaaaggctagagaagagatagaccat tataaaacagaatatgaaaataaactacatgatgaactagaacaaatcagattgaaaacc aaccaagaaattgatcaacttcgaaatgcctctagggaaatgtatgaacgagaaaacaga aatctccgagaagcaagggataatgctgtggctgaaaaggaacgagcagtgatggctgaa aaggatgctttagaaaaacacgatcagctcttagacagn