GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:09:43 Sequence gi568815584f:76054364_76273694 : 219331 bp : 43.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3689 3908 220 2 1 82 42 146 0.508 5.91 1.02 PlyA + 4161 4166 6 1.05 2.00 Prom + 8789 8828 40 -3.96 2.01 Init + 17225 17334 110 1 2 78 89 50 0.748 3.79 2.02 Intr + 20788 20925 138 1 0 122 67 29 0.403 3.78 2.03 Intr + 22234 22328 95 1 2 44 96 46 0.686 0.61 2.04 Intr + 27932 28004 73 0 1 108 96 59 0.997 7.16 2.05 Intr + 28254 28329 76 0 1 111 105 134 0.997 16.92 2.06 Intr + 28864 28926 63 0 0 124 77 91 0.985 10.51 2.07 Term + 29095 29214 120 0 0 61 52 52 0.325 -2.63 2.08 PlyA + 29310 29315 6 1.05 3.04 PlyA - 30633 30628 6 1.05 3.03 Term - 37492 37407 86 2 2 99 43 70 0.605 1.42 3.02 Intr - 40853 40784 70 2 1 64 100 67 0.338 4.25 3.01 Init - 44843 44724 120 2 0 17 96 117 0.301 4.17 3.00 Prom - 49816 49777 40 -5.26 4.08 PlyA - 52362 52357 6 1.05 4.07 Term - 59036 58949 88 1 1 82 54 113 0.700 4.43 4.06 Intr - 59812 59668 145 2 1 67 70 50 0.030 0.44 4.05 Intr - 63596 63455 142 2 1 82 24 60 0.009 -1.07 4.04 Intr - 71716 71614 103 0 1 107 48 71 0.598 5.18 4.03 Intr - 72053 71966 88 0 1 100 53 34 0.683 0.13 4.02 Intr - 76745 76654 92 1 2 89 84 38 0.740 3.14 4.01 Init - 79982 79936 47 2 2 100 90 24 0.762 4.20 4.00 Prom - 81381 81342 40 -4.66 5.00 Prom + 86459 86498 40 -1.76 5.01 Init + 100001 100662 662 1 2 69 81 674 0.573 59.53 5.02 Intr + 112300 112364 65 1 2 103 90 31 0.395 3.16 5.03 Intr + 117480 117656 177 1 0 69 105 69 0.865 6.59 5.04 Intr + 123529 123568 40 2 1 79 110 22 0.195 0.78 5.05 Intr + 141515 141609 95 2 2 84 60 74 0.050 3.81 5.06 Intr + 148074 148165 92 1 2 78 33 74 0.511 0.61 5.07 Term + 152580 152636 57 2 0 72 43 79 0.280 -0.51 5.08 PlyA + 152964 152969 6 1.05 6.05 PlyA - 153498 153493 6 1.05 6.04 Term - 174287 174250 38 0 2 111 53 62 0.266 2.50 6.03 Intr - 181505 181454 52 2 1 74 100 8 0.229 -0.92 6.02 Intr - 184643 184476 168 2 0 62 72 91 0.731 5.04 6.01 Init - 184871 184785 87 2 0 50 94 45 0.700 1.94 6.00 Prom - 185632 185593 40 -6.06 7.00 Prom + 186929 186968 40 -3.56 7.01 Init + 192332 192392 61 1 1 72 54 55 0.244 1.91 7.02 Intr + 195073 195166 94 2 1 90 84 38 0.398 2.62 7.03 Intr + 197688 197745 58 0 1 45 115 25 0.050 -0.21 7.04 Intr + 202090 202176 87 0 0 80 48 48 0.008 0.17 7.05 Intr + 209737 209790 54 0 0 93 100 36 0.017 4.48 7.06 Term + 210322 210465 144 0 0 96 48 142 0.038 8.91 7.07 PlyA + 211880 211885 6 1.05 8.03 PlyA - 212722 212717 6 1.05 8.02 Term - 213394 213342 53 2 2 76 44 39 0.415 -4.11 8.01 Init - 213922 213787 136 1 1 69 41 153 0.412 9.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 146607 146652 46 2 1 86 123 12 0.932 5.35 S.002 Init - 210432 210300 133 0 1 89 69 190 0.890 15.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_1|73_aa XPRGWFLCTLTCENHYHRYACISQALSLLTHHHSEGPLVLRVKGTEKQFQSLKPELRGED IPFCSLQVASADS >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_1|222_bp nngccccgcggatggttcttatgcaccctgacgtgtgagaaccactaccatagatacgcg tgtatcagccaggccctgtcactcctcacccaccatcactcagaaggtcctctcgtcctt cgtgtgaagggcacagaaaagcagttccaaagtctgaagcctgaactgcgaggcgaggac atccctttttgctcgttgcaagtcgcaagtgctgacagctaa >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_2|224_aa MGSSTITRSLLPPYRYTIGTHSLKITDNRKTVKFRKWATRMLRMVSPVPHLLQHGSVATV AVFRLFFSYLPVTSTALDPALISGTQTGKQQLDLNACYHKTHHRDLGLASLEEADIPIIP DLEEVQEEDFVLQVAAPPSIQIKRVMTYRDLDNDLMKYSAIQTLDGEIDLKLLTKVLAPE HEVREDDVGWDWDHLFTEVSSEVLTEWDPLQTEKEDPAGQARHT >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_2|675_bp atgggcagcagcaccatcacacggtctttacttcctccatacaggtacacgataggcaca catagcctcaagatcacagataataggaaaacagtaaaattcaggaagtgggccacacgc atgctgcgtatggtcagtcctgttccacaccttttgcagcatggctccgtagccactgtc gcagttttcagactctttttctcctacctgccagtaacgagcacagccctggacccagct ctaatcagcggtacccaaacaggcaaacaacagctggatctgaacgcatgctatcacaaa acgcatcacagagatttggggctggcttcattggaagaggcagatattcctatcattccg gatctggaggaagtacaggaagaagactttgttttgcaggtggcagcccctcccagcatc cagataaagcgggtgatgacctaccgtgacctggacaatgacctcatgaagtactcagcc attcagacactggatggggagatcgacctgaaactcctcaccaaagtgctcgcgccggag cacgaagtccgggaggatgatgtcggctgggactgggaccatctgttcactgaggtgtcc tcagaggtcctcactgagtgggacccactgcagacggagaaggaggaccctgcggggcag gccaggcacacctga >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_3|91_aa MCAAALGLPERGAGGCCTCRALSLHPTEGSRRRRVQLRWQTQGKQLSGPLKGGQKENLMS LVQGLLSWVGFHIGAYLNERGICYWRPTGDP >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_3|276_bp atgtgtgcagcagccctggggttaccggagagaggggctggcggctgctgcacctgccgg gctctgtcgctgcaccccactgagggctcccggaggcgccgcgtgcagctgcgatggcag actcaaggaaaacaacttagtggcccacttaaaggtggtcagaaggaaaacctgatgtcc ttggtgcaaggccttctcagttgggttggattccacattggtgcttacctgaatgaaaga gggatctgctactggcgccccaccggggatccctga >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_4|234_aa MDGKPPLRRYRFISARIPCRGQRDVHWATCPHQMQSLEEMRVDESQGLFSPYDFSSRMTQ TSLYGDLELQERKQKLLCAEGFGEHLIEWRVLLSSEATDAAGSLGALTSPGVPLIKMNGL PRRSLGPHGWQPAGDSSPQLEQFICCNYWQDNTVEGSVASRETLNDTVLKEDMGCGQERD SLTYIALTTLIIITKALPVEYPLRTSSPVWASDMIPDSQGAAKNLLEKVSSQRA >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_4|705_bp atggacgggaagcctccgctgaggagatatcgcttcatttcggcaaggatcccctgcagg ggccagagggatgttcactgggcaacgtgtcctcaccaaatgcagagtttggaagagatg cgagtggatgagagccagggtcttttctcgccatatgacttctctagtaggatgacccag acctctttatatggagacttagaactccaagaaagaaagcagaagttgctttgtgcagag ggctttggggaacatctcattgaatggagggtgctgctctccagtgaggccacagatgct gctgggtcccttggtgctctaacttcacctggtgtacctcttattaaaatgaatggactc ccacggcgcagcctggggccgcatggctggcagccagccggggacagcagccctcaactg gagcagtttatctgctgtaattactggcaggacaacacagtggagggttctgtggcttca agggaaaccttgaacgacacagttttgaaggaggatatgggctgtggtcaagaaagagac agtctcacctacattgcattgacgaccttgataataataacaaaggcactgcctgttgag taccccctacgaaccagctccccagtctgggcctccgacatgatcccagacagccaggga gcagccaagaacctgctggaaaaagtcagctcccaacgagcctga >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_5|395_aa MDELVHDLASALEQTSEQNKLGELWEEMALSPRQQRRQLRKRRGRKRRSDFTHLAEHTCC YSEASESSLDEATKDCREVAPVTNFSDSDDTMVAKRHPALNAIVKSKQHSWHESDSFTEN APCRPLRRRRKVKRVTSEVAASLQQKLKVSDWSYERGCRFKSAKKQRLSRWKENTPWTSS GHGLCESAENRTFLSKTGRKERMECETDEQKQGSDENMSECETSSVCSSSDTGLFTNDEG RQGDDEQSDWFYEGECVPGFTVPNLLPKWAPDHCSEVERMDSGLDKFSDSTFLLPSRPAQ RVSGYHQLGKENEIRQANVHWGPPCSRDIKRKRKPVATASLSSPSADCLQKLRQLISSSS CIVFYLIFRVYVLLKEDLLKDLFEELTKDAIKGYV >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_5|1188_bp atggatgagctggtacacgacttagcctcagccttggagcagacatctgagcagaataag cttggtgaactgtgggaggagatggcgctgagcccccgacagcagaggcggcagcttcgg aaacgccgaggtcggaagcgtcgttctgacttcactcacctggcagagcatacctgctgc tacagcgaggcctctgagtcaagtctggatgaggccactaaggactgtcgagaagtggct ccggtgaccaattttagtgactctgatgacacaatggtagccaaacgacacccagctctc aatgccattgtcaagagtaagcaacattcttggcatgaatctgactcctttactgaaaat gcaccttgtcgaccactcaggcgcaggcggaaggtgaagcgagtgacatcagaggtggct gctagccttcagcagaagctgaaggtgtcagattggagctatgagagaggctgcaggttc aagtctgctaagaagcagcgtctgtcccgctggaaggagaatactccctggacctcatca ggccacggattgtgtgaatcagcagaaaataggactttcctaagcaaaacaggaaggaaa gaaaggatggagtgtgaaacagatgaacaaaaacagggctctgatgagaacatgtcagaa tgtgaaaccagcagtgtgtgtagcagcagtgacactgggctctttaccaatgatgaaggg cgacaaggtgatgacgaacagagtgattggttctatgaaggagaatgtgtcccaggattc actgtccctaatcttctgcccaagtgggctcctgatcattgttctgaagtagaaagaatg gattctggattggataaattttcagattccacattccttttaccttctcggccagctcaa agagtttctggttaccatcagctgggaaaagagaacgaaatcagacaggcaaatgtacac tggggaccaccatgttcacgtgacatcaagaggaagcggaaaccagtggccacagcatct ttgtctagccccagtgcagattgtctgcagaaactcagacagctcatcagcagcagcagc tgtattgtattttacctcatcttcagagtttatgttctactgaaagaggacctcctgaaa gatctttttgaagaattgacaaaggatgctatcaagggctatgtttag >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_6|114_aa MKIVEKFNTGQYRMRMLVKSKYLLPHDSHDGLRERKGSFFRTVVLHDTKTDSNSDTETNS STPPRTLLEMQILSPTWTYEISSSRKVTLRPVWAGCKEIMGEGIYSNETVYGLC >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_6|345_bp atgaaaatcgtggaaaaattcaacactggtcagtatagaatgcgcatgctcgtaaagtct aaatatttgcttcctcatgactcccacgatggtttaagggaaaggaaaggaagtttcttt agaacagtggttctccatgataccaagactgacagcaacagcgacactgagaccaacagc agcacaccacctagaaccctgttagaaatgcaaattctcagtcccacctggacctatgaa atcagcagctctaggaaagtcaccctaaggccagtgtgggcaggctgcaaggagatcatg ggggaaggaatttatagcaacgaaacagtctacgggctgtgctga >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_7|165_aa MTTLENLTDSKRTLVTEDALDGAFYLFLTTLFTGKESKKDYELFSVIDFLKGWSLLVKAK EQVGVLKMKWMCLAHNICKYLLDERASSTGFCGFAIMEGPVVRLLLAKDRQRREDGGKAR HHDAKCGLLIKDSIESWSLCGESSSVGSEPDLRRHGYRQFCCDSP >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_7|498_bp atgacaacgcttgaaaacctgactgactctaagagaacactggtgactgaagatgcactt gacggagctttctacttgtttctgaccacactatttactggtaaagaatccaagaaagat tatgaattatttagtgttattgacttcctaaaaggctggtcattgttggtgaaagctaag gagcaagtaggagttctcaagatgaaatggatgtgcttggcacataatatatgcaaatac ttgttggatgaaagggcctctagtactggtttctgtggatttgccatcatggaaggccct gtggtccggctgctgctggctaaggacaggcagaggagggaggatggaggaaaggccaga caccacgatgccaaatgcgggctgctcatcaaagactccattgagagctggtcgctgtgt ggggaaagcagcagtgttggctccgaacctgacctgaggaggcatggctaccgtcagttc tgttgtgacagcccttga >gi568815584f:76054364_76273694|GENSCAN_predicted_peptide_8|62_aa MKLLARTPHEQKDGCGTSRKQRCSSNDSSRIPLRRAQEGGGSKVDGGEQFPGNETRPEER RA >gi568815584f:76054364_76273694|GENSCAN_predicted_CDS_8|189_bp atgaagctgctggcccgtactccgcatgaacagaaggacggctgtgggacctctagaaaa cagcggtgttcctccaacgacagttctagaattcctctaagacgagcacaggaaggtgga ggttccaaggtggacggaggggagcagtttcctggaaatgaaactcgtccagaggaacgt agggcctag