GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:39:41 Sequence gi568815585r:72659798_72881832 : 222035 bp : 37.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 20504 20567 64 1 1 48 100 52 0.572 3.76 1.02 Intr + 23163 23302 140 1 2 54 64 81 0.474 1.56 1.03 Term + 30757 30858 102 0 0 126 43 134 0.986 10.00 1.04 PlyA + 31442 31447 6 -0.45 2.00 Prom + 32293 32332 40 -4.95 2.01 Sngl + 33084 33446 363 2 0 55 41 217 0.614 9.53 2.02 PlyA + 36136 36141 6 1.05 3.00 Prom + 42500 42539 40 -4.05 3.01 Sngl + 43960 44184 225 0 0 97 47 125 0.607 4.19 3.02 PlyA + 44355 44360 6 1.05 4.05 PlyA - 45713 45708 6 1.05 4.04 Term - 55300 55135 166 0 1 79 50 99 0.426 1.61 4.03 Intr - 55882 55755 128 1 2 53 62 103 0.639 2.76 4.02 Intr - 59300 59155 146 0 2 100 53 124 0.636 9.18 4.01 Init - 67805 67727 79 2 1 92 113 179 0.994 20.07 4.00 Prom - 68900 68861 40 -10.45 5.00 Prom + 68963 69002 40 -10.94 5.01 Init + 69144 69296 153 2 0 34 100 80 0.040 3.83 5.02 Intr + 71484 71590 107 2 2 57 123 81 0.924 6.59 5.03 Intr + 75163 75208 46 1 1 71 78 57 0.612 0.49 5.04 Intr + 78165 78246 82 2 1 60 82 52 0.100 0.09 5.05 Intr + 85184 85410 227 0 2 46 89 97 0.372 2.38 5.06 Intr + 86147 86279 133 1 1 64 91 42 0.836 1.40 5.07 Intr + 86704 87314 611 2 2 109 110 519 0.958 47.41 5.08 Intr + 93893 94024 132 1 0 97 102 61 0.474 8.32 5.09 Term + 95246 95272 27 1 0 74 39 22 0.198 -7.00 5.10 PlyA + 95504 95509 6 -0.45 6.19 PlyA - 95625 95620 6 1.05 6.18 Term - 97138 96776 363 1 0 61 55 176 0.520 5.18 6.17 Intr - 100854 100732 123 0 0 9 110 72 0.576 1.36 6.16 Intr - 101724 101566 159 0 0 62 80 84 0.938 4.26 6.15 Intr - 102017 101849 169 1 1 81 68 153 0.991 11.63 6.14 Intr - 102340 102126 215 1 2 112 63 107 0.999 7.19 6.13 Intr - 103810 103654 157 0 1 85 86 109 0.999 9.49 6.12 Intr - 106261 106175 87 0 0 84 58 93 0.941 4.07 6.11 Intr - 109115 108988 128 2 2 81 38 162 0.997 9.06 6.10 Intr - 112099 111998 102 1 0 76 94 75 0.987 6.35 6.09 Intr - 112478 112362 117 2 0 91 51 104 0.991 6.74 6.08 Intr - 113042 112896 147 2 0 66 115 76 0.992 7.61 6.07 Intr - 114024 113887 138 0 0 30 75 128 0.977 5.44 6.06 Intr - 114262 114149 114 1 0 62 105 21 0.650 0.92 6.05 Intr - 115578 115414 165 0 0 79 60 237 0.931 19.24 6.04 Intr - 117696 117623 74 1 2 63 63 61 0.885 -0.69 6.03 Intr - 118583 118390 194 1 2 81 89 170 0.999 14.61 6.02 Intr - 121206 121049 158 0 2 16 63 166 0.965 4.79 6.01 Init - 121987 121808 180 1 0 84 96 173 0.925 15.00 6.00 Prom - 122680 122641 40 -10.55 7.00 Prom + 123526 123565 40 -9.25 7.01 Init + 123673 123924 252 0 0 49 116 202 0.995 16.49 7.02 Intr + 132650 132750 101 1 2 52 110 97 0.993 6.29 7.03 Intr + 135562 135760 199 1 1 46 84 218 0.999 15.63 7.04 Intr + 138110 138229 120 1 0 51 68 123 0.649 6.37 7.05 Intr + 162052 162185 134 0 2 70 107 169 0.987 15.52 7.06 Intr + 167213 167321 109 2 1 63 99 47 0.978 2.67 7.07 Intr + 167936 168093 158 1 2 55 5 193 0.022 5.59 7.08 Intr + 175446 175571 126 0 0 80 98 174 0.059 16.47 7.09 Intr + 194260 194358 99 1 0 38 115 180 0.711 14.01 7.10 Term + 209520 209556 37 0 1 93 37 50 0.230 -3.77 7.11 PlyA + 209826 209831 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 68046 68210 165 2 0 51 94 123 0.891 8.90 S.002 Intr + 69129 69296 168 2 0 72 100 72 0.910 6.02 S.003 Init - 123204 123186 19 0 1 78 102 -13 0.918 -0.67 S.004 Term + 167936 168148 213 1 0 55 37 200 0.918 7.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_1|101_aa MTVTKYFTDCLRKVIRDDGGEGETPLLVSFRRSAFKLIPQGYLSSPRGPSSPRISLLHVD SSEHSESQALNRVDDDTRIEEDNLPYPVRQFKCQPYLETLS >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_1|306_bp atgacagttactaaatatttcactgactgtctcaggaaagttattagagatgatggagga gaaggggagaccccactgttggtcagcttcagaagatctgctttcaagctcattccccag ggctacctcagttctccacgtggcccctcatctcccaggatctctcttctccatgtggac tcttcagagcatagtgagagccaggccctcaatagagtggatgatgacacccgcattgag gaggacaatctaccttacccagtccgccaattcaaatgtcaaccttatttggaaacactt tcatag >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_2|120_aa MADKEKAFWGEEFKLAVKQVLATEICITKREPTANSQENGENVWKAFWRFKWQPLPSQAL RSRRKEWFCGPDPGPHCPAQPWEIAPCILTPWLDLWLKVPGYSSGCCFRGCELFALTASM >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_2|363_bp atggcagataaagaaaaagctttctggggagaggaattcaagctggctgtgaagcaagta cttgctacagaaatctgcataactaaaagagagccaactgctaatagccaagagaatggg gaaaatgtctggaaggcattttggagatttaagtggcagcccctcccatcacaggccctg aggtctaggaggaaagaatggttttgtgggccagacccagggcctcactgccctgcgcag ccttgggaaattgctccctgcatcctgaccccttggctcgatttgtggctcaaagttccc ggatacagctcaggctgctgcttcagagggtgtgagctcttcgccttgacggcttccatg tga >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_3|74_aa MELSLWPRICEIYKKGDIVDIKGMVTLQVLPWRNWKSLQRYPAGCWQCCQQVKGKILAKR TKNTLSTLRVETAS >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_3|225_bp atggagttgtccctttggccacgtatatgcgaaatctacaagaaaggtgatattgtagac atcaagggaatggttactcttcaggtgttaccatggcgaaactggaagagtctacaacgt tatccagcaggctgctggcagtgttgtcaacaagttaagggcaagattcttgccaagaga actaagaacacattaagtactctaagagttgagacagcttcttga >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_4|172_aa MASSSGAGAAAAAAAANLNAVRETMDVLLEISRILNTGLDMETLSICVRLCEQGINPEAL SSVIKELRKATEALKRVQHQEDGAKPFMRDEFSQLPPGPTSNIGDYNGDYNIGNYISRVK PPWLLSWAGIVECLQLFKAQGTSCQWIYHSGVGERWPSSHSSIRQFPSEDIV >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_4|519_bp atggcgagtagcagcggtgctggggcggcggcggcggccgcggcggcgaatctgaatgcg gtgcgggagaccatggacgttctgcttgagatttcaagaattttgaatactggcttagat atggaaactctgtctatttgtgtacggctttgtgaacaaggaattaacccagaagcttta tcatcggttattaaggagcttcgcaaggctactgaagcactgaagcgagtacagcaccaa gaggatggtgctaaaccattcatgagggatgaattcagtcaactcccaccaggccccacc tccaacattggagattacaatggagattacaacattggaaattacatttcaagggttaag cccccatggctgctttcgtgggctggcattgttgagtgtctgcagcttttcaaggctcag ggtacaagctgtcagtggatctaccattctggggttggagaacggtggccctcttctcac agctccattcggcagttccccagtgaggacattgtgtag >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_5|505_aa MGDVKESKMQITPETPGRIPVLNPFESPSDYSNLHEQTLASPSVFKSTKLPTPGKFRWSI DQLAVINPVEIDPEDIHRQALYLSHSRIDKDVEDKRQKAIEEFFTKDVIVPSPWTDHEGK QLSQCHSSKCDYFRADEFADQSPGNLSSSSLRRKLFLDGNGSISDSLPSASPGSPHSGVQ TSLEMFYSIDLSPVKCRSPLQTPSSGQFSSSPIQASAKKYSLGSITSPSPISSPTFSPIE FQIGETPLSEQRKFTVHSPDASSGTNSNGITNPCIRSPYIDGCSPIKNWSPMRLQMYSGG TQYRTSVIQIPFTLETQGEDEEDKENIPSTDVSSPAMDAAGIHLRQFSNEASTHGTHLVV TAMSVTQNQSSASEKELALLQDVEREKDNNTVDMVDPIEIADETTWIKEPVDNGSLPMTD FVSGIAFSIENSHMCMSPLAESSVIPCESSNIQMDSGYNTQNCGSNIMDTVGAESYCKES DAQTCEVESKSQAFNMKSSAKETCF >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_5|1518_bp atgggagatgtcaaggaatcaaagatgcaaataacaccagaaactccaggaaggatccct gttttaaatccttttgaaagtcctagtgattattctaatctccatgaacaaactctcgcc agtccttctgtttttaaatcaacaaaattaccaactccagggaaatttagatggtctatt gatcaactagctgtaataaatcctgtagaaatagacccagaagatattcatcgtcaagct ttatacttaagtcattctcgaatagataaagatgtggaagacaaaagacaaaaagccatt gaagagtttttcactaaagatgtcatcgtaccctctccttggactgatcatgaagggaaa cagctttcacaatgtcattccagtaaatgtgactattttagagctgatgaatttgcagat caatctcctggaaacctcagttcttcatccctcagaagaaagctgtttttagatgggaac ggaagcatctccgactccttaccttcggcttctcccggaagtcctcacagtggtgttcaa acatcactagagatgttttattcaatagatttgtctcctgtaaagtgtaggagccccttg cagacaccaagttcggggcagttttcttctagccctattcaggctagtgcaaaaaaatac agcttgggaagcataactagtccttcgcctatttcttcacccactttctcaccaattgaa tttcagataggagagactccactctcagaacaaaggaagtttactgttcattctcctgat gcttcatctggaacaaattctaatgggataactaatccgtgtatcagaagtccttatata gatggctgctcgccaattaaaaattggtctcctatgagacttcagatgtatagtggtggt actcagtatcggacctcagtgattcagataccttttactcttgagactcaaggtgaagat gaggaagataaagagaatattccttccacagatgtctcatcacccgccatggatgctgct ggaatacacctacggcagtttagtaatgaggcttctacccatggtacacatttggttgtg actgccatgtctgttacacaaaatcagtccagtgcttctgagaaagaattagcactgttg caggatgttgaaagggagaaagacaataacactgtggatatggttgatcctatagagata gcagatgagaccacttggattaaggagccggttgataatggcagtttacccatgactgat tttgtaagtggcattgccttcagtattgaaaactctcatatgtgcatgtcacctcttgct gaaagcagtgtcattccttgtgaaagcagtaacattcagatggatagtggctataatacg cagaattgtggaagcaatattatggatacagttggggcagaaagttactgcaaagaaagt gatgcacaaacatgtgaagttgagagtaaatctcaagcatttaatatgaagagttcagct aaagaaacgtgcttttag >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_6|929_aa MKIVREHYLRDDIGCGAPGCAACGGAHEGPALEPQPQDPASSVCPQPHYLLPDTNVLLHQ IDVLEDPAIRNVIVLQTVLQEVRNRSAPVYKRIRDVTNNQEKHFYTFTNEHHRETYVEQE QGENANDRNDRAIRVAAKWYNEHLKKMSADNQLQVIFITNDRRNKEKAIEEGIPAFTCEE YVKSLTANPELIDRLACLSEEGIILQGLKHLNRAVHEDIVAVELLPKSQWVAPSSVVLHD EGQNEEDVEKEEETERMLKTAVSEKMLKPTGRVVGIIKRNWRPYCGMLSKSDIKESRRHL FTPADKRIPRIRIETRQASTLEGRRIIVAIDGWPRNSRYPNGHFVRNLGDVGEKETETEV LLLEHDVPHQPFSQAVLSFLPKMPWSITEKDMKNREDLRHLCICSVDPPGCTDIDDALHC RELENGNLEVGVHIADVSHFIRPGNALDQESARRGTTVYLCEKASLTYAEAQLRIDSANM NDDITTSLRGLNKLAKILKKRRIEKGALTLSSPEVRFHMDSETHDPIDLQTKELRETNSM VEEFMLLANISVAKKIHEEFSEHALLRKHPAPPPSNYEILVKAARSRNLEIKTDTAKSLA ESLDQAESPTFPYLNTLLRILATRCMMQAVYFCSGMDNDFHHYGLASPIYTHFTSPIRRY ADVIVHRLLAVAIGADCTYPELTDKHKLADICKNLNFRHKMAQYAQRASVAFHTQLFFKS KGIVSEEAYILFVRKNAIVVLIPKYGLEGTVFFEEKDKPNPQLIYDDEIPSLKIEDTVFH VFDKVKVKIMLDSSNLQHQKIRMSLVEPQMRKWRFMVCSHTATKKYPKLGNYYKGKRFNG LSSALLVRPQDTYTIMAEKGEAGTFFTGRQDENDCQQWKCQTLIKPSDLMRPAHYHRNSM EETAPMIQLPPPGPTLDTWGLLQFKVRFG >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_6|2790_bp atgaagatcgtgcgcgagcactacctgcgagacgacatcggctgcggtgcgcccgggtgc gcagcgtgtggaggggcgcacgaggggccggccctggagccgcagccccaggacccggcg agcagcgtctgcccgcaaccgcactacttgctgcccgacactaatgtgttactgcaccag attgatgttcttgaggaccctgccatcaggaatgtaattgtgctacaaacagttcttcaa gaagtgagaaatcgcagtgcccccgtatataaacgcatccgagatgtgactaataaccaa gagaagcatttctatactttcactaatgagcaccatagagaaacctatgtagaacaagaa cagggagaaaatgctaatgacaggaatgatagagcgattcgagtagcagcaaaatggtac aatgaacatttgaaaaaaatgtcagcagacaaccagctgcaagttatcttcataacaaat gacaggagaaacaaagagaaagccatagaagaaggaataccagctttcacttgtgaagaa tatgtaaagagcctaactgctaaccccgaactcatagatcgtcttgcttgtttgtctgaa gaagggataatcttacagggacttaaacatttaaacagagctgttcacgaagatattgtg gctgtggagcttctccccaagagtcagtgggtagcaccatcttctgtggttttacatgat gaaggtcaaaatgaagaagatgtggagaaagaagaagagacagaacgaatgcttaagact gctgtaagcgagaaaatgttgaagcctacaggtagagttgtaggaataataaaaaggaat tggagaccatattgtggcatgctttccaagtctgacattaaggagtcaagaagacatctc tttacacctgctgataagagaatccctcgaattcgcatagaaaccagacaggcttccaca ttagaaggacggagaattattgttgctattgatggttggcccagaaattccagatatcca aatggacactttgtgagaaatttaggtgatgttggagagaaagagactgaaacagaagtt ttgttacttgaacacgatgttccccatcagcctttttcacaggctgttcttagttttctg ccaaagatgccctggagcattactgaaaaggacatgaaaaaccgagaagacctgaggcat ctgtgtatttgtagtgtagacccaccaggatgtactgatatagacgatgctctacattgt cgagaactcgaaaatggaaatttggaggttggtgttcatattgctgatgtgagccatttt attaggccaggaaatgccttggatcaagaatcagccagaagaggaacaactgtgtatctt tgtgaaaaggcatctctgacgtatgctgaagctcagttgagaattgattcagcaaacatg aatgatgatattaccactagtctccgtggactgaataaactagccaaaattctgaagaaa agaaggattgaaaaaggggctttgactctatcctctcctgaagttcgattccacatggac agtgaaactcacgatcctatagatctgcagaccaaggaacttagggaaacaaattccatg gttgaagaatttatgttacttgccaatatttctgttgcaaaaaaaattcatgaggaattt tctgaacatgctctgcttcgaaaacatcctgctccacctccatcaaattatgaaattctt gttaaggcagccaggtcaaggaatttggaaattaagactgatacagccaagtctttggct gagtctttggatcaggccgaatctcctacttttccatatctaaacactctgttgagaata ttagccactcgctgtatgatgcaagctgtgtacttctgttctggaatggataatgatttt catcactatggcttagcgtctccaatatacacacattttacttcacccattagaagatac gcagatgtcattgttcatcggcttttggctgtggctattggggctgactgtacttatcca gagttgacagacaaacacaagcttgcagatatatgtaaaaatctaaatttccggcacaaa atggctcaatatgcccaacgtgcatcagtggcttttcatacccagttattcttcaaaagc aaaggaatagtaagtgaagaagcctatattttatttgtaagaaagaatgccattgtggta ttaattccaaagtatggtttagaagggacagtcttttttgaagaaaaggacaaaccaaac ccacagcttatttatgatgatgagataccctcacttaaaatagaagatacagtgttccat gtatttgataaagttaaagtgaaaatcatgttagactcatctaatcttcaacatcagaag atccgaatgtccctggtagaaccacagatgagaaaatggaggtttatggtctgttctcac actgctacgaagaaataccccaaactgggtaattattataaaggaaagaggtttaatgga ctcagttccgcattgctggtgaggcctcaggacacttacacaatcatggcagaaaaagga gaagcaggcacctttttcacagggcggcaggatgaaaatgactgccagcagtggaaatgc cagacgcttataaaaccttcagatctcatgagaccggctcactatcacaggaacagcatg gaggaaaccgcccccatgatccagttacctccacctggtcccacccttgacacgtgggga ttgctacaattcaaggttagatttgggtga >gi568815585r:72659798_72881832|GENSCAN_predicted_peptide_7|444_aa MSRKISKESKKVNISSSLESEDISLETTVPTDDISSSEEREGKVRITRQLIERKELLHNI QLLKIELSQKTMMIDNLKVDYLTKIEELEEKLNDALHQKQLLTLRLDNQLAFQQKDASKY QELMKQEMETILLRQKQLEETNLQLREKAGDVRRNLRDFELTEEQYIKLKAFPEDQLSIP EYVSVRFYELVNPLRKEICELQVKKNILAEELSTNKNQLKQLTETYEEDRKNYSEVQIRC QRLALELADTKQLIQQGDYRQENYDKVKSERDALEQEVIELRRKHEILEASHMIQTKERS ELSKEVVTLEQTVTLLQKDKEYLNRQNMELSVRCAHEEDRLERLQAQLEESKKAREEIDH YKTEYENKLHDELEQIRLKTNQEIDQLRNASREMYERENRNLREARDNAVAEKERAVMAE KDALEKHDQLLDRDMDEIGNHHSQ >gi568815585r:72659798_72881832|GENSCAN_predicted_CDS_7|1335_bp atgtctcgaaaaatttcaaaggagtcaaaaaaagtgaacatctctagttctctggaatct gaagatattagtttagaaacaacagttcctacggatgatatttcctcatcagaagagcga gagggcaaagtcagaatcaccaggcagctaattgaacgaaaagaactacttcataatatt cagttactaaaaattgagctatcccagaaaactatgatgatcgacaatttgaaagtggat tatcttacaaagattgaagaattggaggagaaacttaatgatgcacttcaccagaagcag ctactaacattgagattagacaaccaattggcttttcaacagaaagatgccagcaaatat caagaattaatgaaacaagaaatggaaaccattttgttgagacagaaacaactagaagag acaaatcttcagctaagagaaaaagctggagatgttcgtcgaaacctgcgtgactttgag ttgacagaagagcaatatattaaattaaaagcttttcctgaagatcagctttctattcct gaatatgtatctgttcgcttctatgagctagtgaatccattaagaaaggaaatctgtgaa ctacaagtgaaaaagaatatcctagcagaagaattaagtacaaacaaaaaccaactgaag cagctgacagagacatatgaggaagatcgaaaaaactactctgaagttcaaattagatgt caacgtttggccttagaattagcagacacaaaacagttaattcagcaaggtgactaccgt caagagaactatgataaagtcaagagtgaacgtgatgcacttgaacaggaagtaattgag cttaggagaaaacatgaaatacttgaagcctctcacatgattcaaacaaaagaacgaagt gaattatcaaaagaggtagtcaccttagagcaaactgttactttactgcaaaaggataaa gaatatcttaatcgccaaaacatggagcttagtgttcgctgtgctcatgaagaggatcgc cttgaaagacttcaagctcaactggaagaaagcaaaaaggctagagaagagatagaccat tataaaacagaatatgaaaataaactacatgatgaactagaacaaatcagattgaaaacc aaccaagaaattgatcaacttcgaaatgcctctagggaaatgtatgaacgagaaaacaga aatctccgagaagcaagggataatgctgtggctgaaaaggaacgagcagtgatggctgaa aaggatgctttagaaaaacacgatcagctcttagacagggacatggatgaaattggaaat catcattctcagtaa