GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:09:56 Sequence gi568815597f:103625744_103750469 : 124726 bp : 36.42% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 177 172 6 1.05 1.05 Term - 24655 24537 119 2 2 103 33 122 0.929 5.92 1.04 Intr - 25336 25231 106 1 1 32 16 149 0.895 0.97 1.03 Intr - 26808 26586 223 2 1 35 86 147 0.716 6.41 1.02 Intr - 28279 27885 395 1 2 8 67 319 0.404 14.33 1.01 Init - 28792 28595 198 1 0 64 55 120 0.311 5.25 1.00 Prom - 29705 29666 40 -10.65 2.00 Prom + 29743 29782 40 -5.25 2.01 Init + 30588 30755 168 2 0 78 94 106 0.685 9.43 2.02 Intr + 31101 31247 147 2 0 70 98 30 0.757 1.71 2.03 Intr + 32051 32248 198 1 0 88 91 148 0.999 13.73 2.04 Intr + 32697 32927 231 2 0 80 94 130 0.996 9.85 2.05 Intr + 33667 33800 134 0 2 85 33 125 0.993 5.22 2.06 Intr + 34617 34739 123 0 0 52 66 71 0.496 0.08 2.07 Intr + 34834 34933 100 1 1 65 106 76 0.996 6.29 2.08 Intr + 36914 37032 119 1 2 42 100 114 0.999 6.34 2.09 Intr + 37143 37268 126 0 0 72 119 102 0.999 10.57 2.10 Term + 38588 38777 190 2 1 88 38 143 0.998 5.24 2.11 PlyA + 38868 38873 6 1.05 3.04 PlyA - 38951 38946 6 1.05 3.03 Term - 39855 39832 24 0 0 105 43 9 0.455 -4.65 3.02 Intr - 42525 42328 198 0 0 78 91 192 0.708 17.13 3.01 Init - 51162 51157 6 0 0 82 52 4 0.126 -3.01 3.00 Prom - 53740 53701 40 -5.75 4.10 PlyA - 53904 53899 6 1.05 4.09 Term - 61895 61706 190 1 1 88 38 143 0.997 5.24 4.08 Intr - 63340 63215 126 0 0 72 119 102 0.999 10.57 4.07 Intr - 63569 63451 119 2 2 42 100 114 0.999 6.34 4.06 Intr - 65650 65551 100 0 1 65 106 76 0.996 6.29 4.05 Intr - 66817 66684 134 1 2 85 33 125 0.995 5.22 4.04 Intr - 67787 67557 231 2 0 80 94 130 0.996 9.85 4.03 Intr - 68433 68236 198 0 0 88 91 148 0.999 13.73 4.02 Intr - 69383 69237 147 2 0 70 98 30 0.757 1.71 4.01 Init - 69896 69729 168 2 0 78 94 106 0.679 9.43 4.00 Prom - 70741 70702 40 -5.25 5.00 Prom + 70779 70818 40 -10.65 5.01 Init + 71692 71889 198 0 0 64 55 120 0.377 5.25 5.02 Intr + 72205 72599 395 0 2 8 67 319 0.406 14.33 5.03 Intr + 73676 73898 223 2 1 35 86 147 0.716 6.41 5.04 Intr + 75148 75293 146 0 2 32 47 142 0.064 2.66 5.05 Intr + 75372 75447 76 0 1 70 62 62 0.038 0.40 5.06 Intr + 87980 88210 231 1 0 70 73 149 0.092 8.65 5.07 Intr + 89012 89145 134 1 2 64 33 125 0.983 3.12 5.08 Intr + 90176 90275 100 2 1 50 111 75 0.984 5.19 5.09 Intr + 92248 92366 119 0 2 49 100 111 0.999 6.74 5.10 Intr + 92478 92603 126 0 0 72 119 85 0.996 8.87 5.11 Term + 93939 94128 190 0 1 79 38 96 0.882 -0.36 5.12 PlyA + 96910 96915 6 1.05 6.07 PlyA - 97502 97497 6 1.05 6.06 Term - 118793 118675 119 0 2 103 33 122 0.899 5.92 6.05 Intr - 119474 119369 106 2 1 32 16 149 0.861 0.97 6.04 Intr - 120946 120724 223 0 1 35 86 158 0.835 7.51 6.03 Intr - 122417 122023 395 2 2 8 67 319 0.449 14.33 6.02 Intr - 123101 122733 369 2 0 40 55 205 0.389 6.78 6.01 Intr - 123618 123538 81 0 0 55 68 74 0.497 1.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 75148 75253 106 0 1 32 16 149 0.884 0.97 S.002 Term + 75829 75947 119 2 2 103 33 122 0.917 5.92 S.003 Init + 87938 88210 273 1 0 68 73 144 0.893 7.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:103625744_103750469|GENSCAN_predicted_peptide_1|346_aa MAPTVPSTPYPVGRPPSPEPTAPRPPRVDKNKSETAGKSTSLAARLTAQDGNSNALREQR YTRTDEDPNEGPDMERLRGYREALIERLKKGAQKGTNVNKVSEVIQGKEESPAQFYQRPC EAYCMYTPFDLESPENQPLINTALVIQSAEDIQRKLQKQARFAGMKTLQLLERANEVFVN RDATSGQESRKEGERQARFKKSPTIFGEASVGDLQMFPAKDLGCILLQYVDDLLLGHSMA VRCAKRTGALLRHLEDCGYKVPKKKAQICRQQHNAKQGPSVPRGIEASGAAHFEDLQVDF TEMPKCRSDIVWIKDWNVAPLWPQWKGPQTINLTTPTAVNVEGIPA >gi568815597f:103625744_103750469|GENSCAN_predicted_CDS_1|1041_bp atggcaccaacagtgccctcaaccccttatccagtggggaggcccccttctcctgagccc acagcccctagaccacccagagtagacaagaacaaaagtgaaactgcgggaaaatccact tccctggcagcccgcttaacggcccaagacgggaattcaaatgccctgagagagcagcga tatactaggacagatgaggatccaaatgaaggaccagatatggaaaggctaagagggtac cgagaggcattaattgaaaggttgaaaaaaggggctcaaaagggtaccaatgtaaataaa gtttctgaagtcatccaaggaaaggaggaaagcccagcccagttctatcaaagaccgtgt gaggcctattgcatgtacactcctttcgatctggagagtcctgaaaatcagccgttgatt aatacggccttagttattcagagtgcagaagatatccagagaaaattgcaaaaacaggct aggtttgcaggaatgaaaaccttgcagttactggaaagagctaatgaagtatttgtaaat agagatgcaacaagcggccaagaaagccgtaaggagggcgaacgccaggccaggttcaaa aagtcccccaccatctttggggaggcctcggttggagacctccaaatgtttcctgctaaa gacctaggttgcatcctgctccagtatgtagatgaccttctgctaggacactccatggca gtcaggtgtgcaaaaaggacgggtgccctgcttcgacacctggaggactgtggatataaa gtgcccaaaaagaaagctcagatctgcagacagcagcacaatgcgaagcaaggcccctct gtacctcggggaatagaagcctctggagcagctcattttgaagatcttcaagtggacttc acagaaatgcctaaatgtagaagtgacattgtgtggatcaaggactggaacgtggctccg ctgtggccacagtggaaaggaccccagacaattaacctgaccactcccacagctgtcaat gtagaaggaatcccagcctag >gi568815597f:103625744_103750469|GENSCAN_predicted_peptide_2|511_aa MKLFWLLFTIGFCWAQYSSNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPP NENVAIHNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMCGN AVSAGTSSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLSGL LDLALGKDYVRSKIAEYMNHLIDIGVAGFRIDASKHMWPGDIKAILDKLHNLNSNWFPEG SKPFIYQEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKNWGEGWG FMPSDRALVFVDNHDNQRGHGAGGASILTFWDARLYKMAVGFMLAHPYGFTRVMSSYRWP RYFENGKDVNDWVGPPNDNGVTKEVTINPDTTCGNDWVCEHRWRQIRNMVNFRNVVDGQP FTNWYDNGSNQVAFGRGNRGFIVFNNDDWTFSLTLQTGLPAGTYCDVISGDKINGNCTGI KIYVSDDGKAHFSISNSAEDPFIAIHAESKL >gi568815597f:103625744_103750469|GENSCAN_predicted_CDS_2|1536_bp atgaagctcttttggttgcttttcaccattgggttctgctgggctcagtattcctcaaat acacaacaaggacgaacatctattgttcatctgtttgaatggcgatgggttgatattgct cttgaatgtgagcgatatttagctcccaagggatttggaggggttcaggtctctccacca aatgaaaatgttgccattcacaaccctttcagaccttggtgggaaagataccaaccagtt agctataaattatgcacaagatctggaaatgaagatgaatttagaaacatggtgactaga tgcaacaatgttggggttcgtatttatgtggatgctgtaattaatcatatgtgtggtaat gctgtgagtgcaggaacaagcagtacctgtggaagttacttcaaccctggaagtagggac tttccagcagtcccatattctggatgggattttaatgatggtaaatgtaaaactggaagt ggagatatcgagaactataatgatgctactcaggtcagagattgtcgtctgtctggtctt ctcgatcttgcactggggaaggattatgtgcgttctaagattgccgaatatatgaaccat ctcattgacattggtgttgcagggttcagaattgatgcttccaagcacatgtggcctgga gacataaaggcaattttggacaaactgcataatctaaacagtaactggttcccggaaggt agtaaacctttcatttaccaggaggtaattgatctgggtggtgagccaattaaaagcagt gactactttggtaatggccgggtgacagaattcaagtatggtgcaaaactcggcacagtt attcgcaagtggaatggagagaagatgtcttacttaaagaactggggagaaggttggggt ttcatgccttctgacagagcgcttgtctttgtggataaccatgacaatcaacgaggacat ggcgctggaggagcctctatacttaccttctgggatgctaggctgtacaaaatggcagtt ggatttatgcttgctcatccttatggatttacacgagtaatgtcaagctaccgttggcca agatattttgaaaatggaaaagatgttaatgattgggttgggccaccaaatgataatgga gtaactaaagaagttactattaatccagacactacttgtggcaatgactgggtctgtgaa catcgatggcgccaaataaggaacatggttaatttccgcaatgtagtggatggccagcct tttacaaactggtatgataatgggagcaaccaagtggcttttgggagaggaaacagagga ttcattgttttcaacaatgatgactggacattttctttaactttgcaaactggtcttcct gctggcacatactgtgatgtcatttctggagataaaattaatggcaactgcacaggcatt aaaatctacgtttctgatgatggcaaagctcatttttctattagtaactctgctgaagat ccatttattgcaattcatgctgaatctaaattgtaa >gi568815597f:103625744_103750469|GENSCAN_predicted_peptide_3|75_aa MVVRIYVDAVINHMCGNAVSAGTSSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDI ENYNDATQLYKLIRP >gi568815597f:103625744_103750469|GENSCAN_predicted_CDS_3|228_bp atggtggttcgtatttatgtggatgctgtaattaatcatatgtgtggtaacgctgtgagt gcaggaacaagcagtacctgtggaagttacttcaaccctggaagtagggactttccagca gtcccatattctggatgggatttcaatgatggtaaatgtaaaactggaagtggagatatc gagaactacaatgatgctactcagctatacaagttgattaggccctag >gi568815597f:103625744_103750469|GENSCAN_predicted_peptide_4|470_aa MKLFWLLFTIGFCWAQYSSNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPP NENVAIHNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMCGN AVSAGTSSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLSGL LDLALGKDYVRSKIAEYMNHLIDIGVAGFRIDASKHMWPGDIKAILDKLHNLNSNWFPEG SKPFIYQEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKLYKMAVG FMLAHPYGFTRVMSSYRWPRYFENGKDVNDWVGPPNDNGVTKEVTINPDTTCGNDWVCEH RWRQIRNMVNFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDWTFSLTLQTGLPA GTYCDVISGDKINGNCTGIKIYVSDDGKAHFSISNSAEDPFIAIHAESKL >gi568815597f:103625744_103750469|GENSCAN_predicted_CDS_4|1413_bp atgaagctcttttggttgcttttcaccattgggttctgctgggctcagtattcctcaaat acacaacaaggacgaacatctattgttcatctgtttgaatggcgatgggttgatattgct cttgaatgtgagcgatatttagctcccaagggatttggaggggttcaggtctctccacca aatgaaaatgttgccattcacaaccctttcagaccttggtgggaaagataccaaccagtt agctataaattatgcacaagatctggaaatgaagatgaatttagaaacatggtgactaga tgcaacaatgttggggttcgtatttatgtggatgctgtaattaatcatatgtgtggtaat gctgtgagtgcaggaacaagcagtacctgtggaagttacttcaaccctggaagtagggac tttccagcagtcccatattctggatgggattttaatgatggtaaatgtaaaactggaagt ggagatatcgagaactataatgatgctactcaggtcagagattgtcgtctgtctggtctt ctcgatcttgcactggggaaggattatgtgcgttctaagattgccgaatatatgaaccat ctcattgacattggtgttgcagggttcagaattgatgcttccaagcacatgtggcctgga gacataaaggcaattttggacaaactgcataatctaaacagtaactggttcccggaaggt agtaaacctttcatttaccaggaggtaattgatctgggtggtgagccaattaaaagcagt gactactttggtaatggccgggtgacagaattcaagtatggtgcaaaactcggcacagtt attcgcaagtggaatggagagaagatgtcttacttaaagctgtacaaaatggcagttgga tttatgcttgctcatccttatggatttacacgagtaatgtcaagctaccgttggccaaga tattttgaaaatggaaaagatgttaatgattgggttgggccaccaaatgataatggagta actaaagaagttactattaatccagacactacttgtggcaatgactgggtctgtgaacat cgatggcgccaaataaggaacatggttaatttccgcaatgtagtggatggccagcctttt acaaactggtatgataatgggagcaaccaagtggcttttgggagaggaaacagaggattc attgttttcaacaatgatgactggacattttctttaactttgcaaactggtcttcctgct ggcacatactgtgatgtcatttctggagataaaattaatggcaactgcacaggcattaaa atctacgtttctgatgatggcaaagctcatttttctattagtaactctgctgaagatcca tttattgcaattcatgctgaatctaaattgtaa >gi568815597f:103625744_103750469|GENSCAN_predicted_peptide_5|645_aa MAPTVPSTPYPVGRPPSPEPTAPRPPRVDKNKSETAGKSTSLAARLTAQDGNSNALREQR YTRTDEDPNEGPDMERLRGYREALIERLKKGAQKGTNVNKVSEVIQGKEESPAQFYQRPC EAYCMYTPFDLESPENQPLINTALVIQSAEDIQRKLQKQARFAGMKTLQLLERANEVFVN RDATSGQESRKEGERQARFKKSPTIFGEASVGDLQMFPAKDLGCILLQYVDDLLLGHSMA VRCAKRTGALLRHLEDCGYKVPKKKAQICRQQHNAKQGPSVPRGIEASGAAHFEDLQVDF TEMPKCRSNKYLLVLVFTYSKFGLPLRIGSNNRPAFVADLIQKTAKVRDCRLTGLLDLAL EKDYVRSKIAEYMNHLIDIGVAGFRLDASKLMWPGDIKAILDKLHNLNSNWFPAGSKPFI YQEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKLYKMAVGFMLAH PYGFTRVMSSYRWPRQFQNGNDVNDWVGPPNNNGVIKEVTINPDTTCGNDCVCEHRWRQI RNMVIFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDWSFSLTLQTGLPAGTYCD VISGDKINGNCTGIKIYVSDDGKAHFSISNSAEDPFIAIHAESKL >gi568815597f:103625744_103750469|GENSCAN_predicted_CDS_5|1938_bp atggcaccaacagtgccctcaaccccttatccagtggggaggcccccttctcctgagccc acagcccctagaccacccagagtagacaagaacaaaagtgaaactgcgggaaaatccact tccctggcagcccgcttaacggcccaagacgggaattcaaatgccctgagagagcagcga tatactaggacagatgaggatccaaatgaaggaccagatatggaaaggctaagagggtac cgagaggcattaattgaaaggttgaaaaaaggggctcaaaagggtaccaatgtaaataaa gtttctgaagtcatccaaggaaaggaggaaagcccagcccagttctatcaaagaccgtgt gaggcctattgcatgtacactcctttcgatctggagagtcctgaaaatcagccgttgatt aatacggccttagttattcagagtgcagaagatatccagagaaaattgcaaaaacaggct aggtttgcaggaatgaaaaccttgcagttactggaaagagctaatgaagtatttgtaaat agagatgcaacaagcggccaagaaagccgtaaggagggcgaacgccaggccaggttcaaa aagtcccccaccatctttggggaggcctcggttggagacctccaaatgtttcctgctaaa gacctaggttgcatcctgctccagtatgtagatgaccttctgctaggacactccatggca gtcaggtgtgcaaaaaggacgggtgccctgcttcgacacctggaggactgtggatataaa gtgcccaaaaagaaagctcagatctgcagacagcagcacaatgcgaagcaaggcccctct gtacctcggggaatagaagcctctggagcagctcattttgaagatcttcaagtggacttc acagaaatgcctaaatgtagaagtaacaagtatttactggttctagtgtttacttactct aagtttggactgcctctacgaattggctcaaataataggccagcatttgttgctgacttg atacagaagacggcaaaggtcagagattgtcgtctgactggtcttcttgatcttgcactg gagaaggattacgtgcgttctaagattgccgaatatatgaaccatctcattgacattggt gttgcagggttcagacttgatgcttccaagctcatgtggcctggagacataaaggcaatt ttggacaaactgcataatctaaacagtaactggttccctgcaggaagtaaacctttcatt taccaggaggtaattgatctgggtggtgagccaattaaaagcagtgactactttggtaat ggccgggtgacagaattcaagtatggtgcaaaactcggcacagttattcgcaagtggaat ggagagaagatgtcttacttaaagctgtacaaaatggcagttggatttatgcttgctcat ccttacggatttacacgagtaatgtcaagctaccgttggccaagacagtttcaaaatgga aacgatgttaatgattgggttgggccaccaaataataatggagtaattaaagaagttact attaatccagacactacttgtggcaatgactgcgtctgtgaacatcgatggcgccaaata aggaacatggttattttccgcaatgtagtggatggccagccttttacaaattggtatgat aatgggagcaaccaagtggcttttgggagaggaaacagaggattcattgttttcaacaat gatgactggtcattttctttaactttgcaaactggtcttcctgctggcacatactgtgat gtcatttctggagataaaattaatggcaattgcacaggcattaaaatttacgtttctgat gatggcaaagctcatttttctattagtaactctgctgaagatccatttattgcaattcat gctgaatctaaattgtaa >gi568815597f:103625744_103750469|GENSCAN_predicted_peptide_6|430_aa CCVPLKGTEIVHLGSSDFETVAIRCSQRFLVTASFRSSTVVKGQAAAVVVAKGHLFQESS RSTGQEKPAPKVLFDPELEDSRQEMAPTVPSTPYPVGRPPSPEPTAPRPPRVDKNKSETA GKSTSLAARLTAQDGNSNALREQRYTRTDEDPNEGPDMERLRGYREALIERLKKGAQKGT NVNKVSEVIQGKEESPAQFYQRPCEAYCMYTPFDLESPENQPLINTALVIQSAEDIQRKL QKQARFAGMKTLQLLERANEVFVNRDATSGQESRKEGERQARFKKSPTIFREASVGDLQM FPAKDLGCILLQYVDDLLLGHSMAVRCAKRTGALLRHLEDCGYKVPKKKAQICRQQHNAK QGPSVPRGIEASGAAHFEDLQVDFTEMPKCRSDIVWIKDWNVAPLWPQWKGPQTINLTTP TAVNVEGIPA >gi568815597f:103625744_103750469|GENSCAN_predicted_CDS_6|1293_bp tgctgtgtgcccttaaaagggacagaaattgtgcacttggggagctcggattttgagaca gtagctatccgatgctcccagagattcctagttacagctagttttagatcctctacagtg gtaaaaggacaggcagcagcagtagtagtagcaaagggacatttatttcaggaaagttct cgctccaccggccaagagaagccagcacctaaagttctgtttgacccagagctcgaggac tcaaggcaggagatggcaccaacagtgccctcaaccccttatccagtggggaggccccct tctcctgagcccacagcccctagaccacccagagtagacaagaacaaaagtgaaactgcg ggaaaatccacttccctggcagcccgcttaacggcccaagacgggaattcaaatgccctg agagagcagcgatatactaggacagatgaggatccaaatgaaggaccagatatggaaagg ctaagagggtaccgagaggcattaattgaaaggttgaaaaaaggggctcaaaagggtacc aatgtaaataaagtttctgaagtcatccaaggaaaggaggaaagcccagcccagttctat caaagaccgtgtgaggcctattgcatgtacactcctttcgatctggagagtcctgaaaat cagccgttgattaatacggccttagttattcagagtgcagaagatatccagagaaaattg caaaaacaggctaggtttgcaggaatgaaaaccttgcagttactggaaagagctaatgaa gtatttgtaaatagagatgcaacaagcggccaagaaagccgtaaggagggcgaacgccag gccaggttcaaaaagtcccccaccatctttcgggaggcctcggttggagacctccaaatg tttcctgctaaagacctaggttgcatcctgctccagtatgtagatgaccttctgctagga cactccatggcagtcaggtgtgcaaaaaggacgggtgccctgcttcgacacctggaggac tgtggatataaagtgcccaaaaagaaagctcagatctgcagacagcagcacaatgcgaag caaggcccctctgtacctcggggaatagaagcctctggagcagctcattttgaagatctt caagtggacttcacagaaatgcctaaatgtagaagtgacattgtgtggatcaaggactgg aacgtggctccgctgtggccacagtggaaaggaccccagacaattaacctgaccactccc acagctgtcaatgtagaaggaatcccagcctag