GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:04:35 Sequence gi568815594f:133050141_133263282 : 213142 bp : 34.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 801 840 40 -3.15 1.01 Init + 1720 1782 63 0 0 78 100 31 0.112 4.50 1.02 Intr + 19235 19330 96 1 0 57 97 51 0.556 2.19 1.03 Term + 22241 22387 147 1 0 45 37 182 0.317 5.72 1.04 PlyA + 24743 24748 6 1.05 2.04 PlyA - 25046 25041 6 1.05 2.03 Term - 45810 45682 129 0 0 97 42 138 0.756 7.30 2.02 Intr - 65506 65356 151 0 1 79 86 33 0.288 1.44 2.01 Init - 66278 66229 50 2 2 72 92 16 0.493 0.88 2.00 Prom - 70402 70363 40 -1.75 3.00 Prom + 73914 73953 40 -8.25 3.01 Init + 74132 74210 79 1 1 83 45 48 0.456 1.27 3.02 Intr + 74838 75037 200 1 2 15 91 140 0.405 5.15 3.03 Term + 75272 75463 192 1 0 101 43 132 0.917 6.44 3.04 PlyA + 77460 77465 6 1.05 4.00 Prom + 90281 90320 40 -5.35 4.01 Sngl + 98776 99504 729 0 0 16 44 390 0.779 23.37 4.02 PlyA + 99696 99701 6 1.05 5.00 Prom + 99793 99832 40 -13.49 5.01 Init + 100034 102631 2598 1 0 38 94 3403 0.718 323.45 5.02 Intr + 104167 104225 59 0 2 92 80 17 0.347 -1.94 5.03 Intr + 104777 104883 107 2 2 68 119 129 0.732 12.94 5.04 Intr + 112837 113142 306 2 0 100 80 315 0.777 27.40 5.05 Intr + 183504 183618 115 1 1 5 92 104 0.378 1.09 5.06 Term + 184038 184146 109 0 1 85 49 90 0.382 1.80 5.07 PlyA + 184900 184905 6 1.05 6.03 PlyA - 186410 186405 6 1.05 6.02 Term - 190336 190149 188 2 2 78 36 210 0.705 11.47 6.01 Init - 194092 194047 46 1 1 62 80 72 0.697 4.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:133050141_133263282|GENSCAN_predicted_peptide_1|101_aa MDTKNGTTDTRTYLKMEAVRKALAVVSRSGVIPRPKTVDWGISGHTTALLPGKNIGTEKC GIAIKIPENVEATLELGNRQRLEQFGGLRRRQEDERSFGTS >gi568815594f:133050141_133263282|GENSCAN_predicted_CDS_1|306_bp atggacacaaagaatggaacgacagacaccaggacctacttgaagatggaggctgtgagg aaggcactggctgtggtaagcaggagtggagtaatccccaggcccaagacagtggattgg ggcatcagtggccacactacagccctgctaccagggaagaatattggtactgagaagtgt ggcattgctataaagatacccgaaaatgtagaagcaactttggaactgggtaacaggcag aggttggaacagtttggagggctcagaagaagacaggaagatgagagaagttttggaact tcctag >gi568815594f:133050141_133263282|GENSCAN_predicted_peptide_2|109_aa MSHGARPLRSLKVIYDGLMALFTTSLIALLSSRGKNVAIEYIKIHTIEKEDVHFCKQKIT NRMLKLKVLYGEDGNLLTQSQNALNPHILASGVLLLQYLFQTKSRKDSN >gi568815594f:133050141_133263282|GENSCAN_predicted_CDS_2|330_bp atgagccacggcgcccggcccttaagaagtcttaaagtcatctatgatgggctaatggca ctcttcacgacaagcctgattgcactgttaagctccagaggaaaaaatgtggctatagag tatattaaaatacatacaattgaaaaggaagatgttcatttttgcaagcagaagattacc aacagaatgctaaaattaaaggttctctatggtgaagatggcaacttgctgactcagtct cagaatgcgttaaatccacatattctggcaagtggagttctgcttctccaatatctcttc cagactaagtctcgaaaggacagtaattga >gi568815594f:133050141_133263282|GENSCAN_predicted_peptide_3|156_aa MDIKYGHIRKNSQVQFQVEVVYSLAESNRAASIVNSNLQSHGLETCDPLLPKPATSAESP SSPDYSTGSSKGGSARRISRPGMALGAPEMCNQHPQQISFIAVQPFPQDSSSPQDSSSPS PTVPLSNSSQNYMQPPQQIQIFHCPLGQYSNSNPKY >gi568815594f:133050141_133263282|GENSCAN_predicted_CDS_3|471_bp atggacatcaaatatggtcacatcagaaaaaacagccaagtccagttccaagttgaagtt gtgtatagcctagcagaatccaacagagcagcatcaatagtgaactcaaatcttcagagc catggccttgaaacatgtgaccctttgctaccaaagccagctacttcagcagaatctcca tcttcacctgactatagcactggaagcagtaaaggtggcagtgcaagaaggatctccagg ccaggtatggcactaggtgccccggaaatgtgcaaccagcatcctcagcagataagcttc attgcagtgcagcccttcccacaggacagctcctccccacaggacagctcctccccatct ccaactgttccactcagcaattcttcccagaactacatgcagccccctcaacagatccag atttttcactgtcctcttggacagtattctaattccaacccgaaatattga >gi568815594f:133050141_133263282|GENSCAN_predicted_peptide_4|242_aa MLASLEIPLLGEDIAAVAIMFSHLLMQTGEPLYSLPITCGWGLGTFRGIVPSQRLGTCGA VLALTLTLTPAARARVLSRPARLPRDVSVWRPVWPIIGSSVGGAPGRLVGRMRALRKNGE AERSELRNCQPRPQAQRLKQEEGRTGRKKRQVRGKEAWEENSRKETAHYTYREASNGGDE DRGNQDSERQKIQIERKRKKMSRRTSIRRNEENESFKLQSRSVLFRHKIISLILSPFAFA SR >gi568815594f:133050141_133263282|GENSCAN_predicted_CDS_4|729_bp atgttagcatcattggaaatcccgctcctaggagaggatatcgcagctgttgctattatg ttttcgcatttgctgatgcaaacaggggaacctctctattccctccccattacctgcggc tggggactgggaactttccgcggcattgtccccagccagcgcctcgggacctgcggggct gttctcgctctcacactcacactcaccccggccgctcgggcgagagttctgtcccgcccc gcccggctgccgcgtgacgtgtctgtttggcggccagtttggccaatcatcggcagctcg gtgggtggtgctcctgggcgattggttggcagaatgagggcgctgcgcaaaaacggagaa gccgagcgctcggagctcagaaactgccagcccagaccacaggctcagaggctgaagcag gaggaaggaaggactggaaggaaaaagagacaggttagagggaaagaggcttgggaagaa aacagcagaaaagaaactgctcattacacttacagagaggcaagtaacggtggagatgag gacagagggaaccaagactctgaaagacaaaaaatacaaatagagcgaaagaggaaaaaa atgtcaagaagaacatccatccggagaaatgaagagaatgaaagttttaagctgcagagc cgttctgtgcttttccggcacaaaattatatcgctgattttaagcccttttgcatttgcc agccgttga >gi568815594f:133050141_133263282|GENSCAN_predicted_peptide_5|1097_aa MVEGVFSQLHYTVQEEQEHGTFVGNIAEDLGLDITKLSARGFQTVPNSRTPYLDLNLETG VLYVNEKIDREQICKQSPSCVLHLEVFLENPLELFQVEIEVLDINDNPPSFPEPDLTVEI SESATPGTRFPLESAFDPDVGTNSLRDYEITPNSYFSLDVQTQGDGNRFAELVLEKPLDR EQQAVHRYVLTAVDGGGGGGVGEGGGGGGGAGLPPQQQRTGTALLTIRVLDSNDNVPAFD QPVYTVSLPENSPPGTLVIQLNATDPDEGQNGEVVYSFSSHISPRARELFGLSPRTGRLE VSGELDYEESPVYQVYVQAKDLGPNAVPAHCKVLVRVLDANDNAPEISFSTVKEAVSEGA APGTVVALFSVTDRDSEENGQVQCELLGDVPFRLKSSFKNYYTIVTEAPLDREAGDSYTL TVVARDRGEPALSTSKSIQVQVSDVNDNAPRFSQPVYDVYVTENNVPGAYIYAVSATDRD EGANAQLAYSILECQIQGMSVFTYVSINSENGYLYALRSFDYEQLKDFSFQVEARDAGSP QALAGNATVNILIVDQNDNAPAIVAPLPGRNGTPAREVLPRSAEPGYLLTRVAAVDADDG ENARLTYSIVRGNEMNLFRMDWRTGELRTARRVPAKRDPQRPYELVIEVRDHGQPPLSST ATLVVQLVDGAVEPQGGGGSGGGGSGEHQRPSRSGGGETSLDLTLILIIALGSVSFIFLL AMIVLAVRCQKEKKLNIYTCLASDCCLCCCCCGGGGSTCCGRQARARKKKLSKSDIMLVQ SSNVPSNPAQVPIEESGGFGSHHHNQNYCYQVCLTPESAKTDLMFLKPCSPSRSTDTEHN PCGAIVTGYTDQQPDIISNGSILSNETKHQRAELSYLVDRPRRVNSSAFQEADIVSSKDS GHGDSEQGDSDHDATNRAQSAGMDLFSNCTEECKALGHSDRCWMPSFVPSDGRQAADYRS NLHVPGMDSVPDTEVFETPEAQPGAERSFSTFGKEKALHSTLERKELDGLLTNTRAPYKP PYLNLQRENTNSGWRCEADTEVEEGGGWEPYVGYSKWLLGNGLGALQIPSHTQNLTQRVQ RTKPPADLGPSGLQHVT >gi568815594f:133050141_133263282|GENSCAN_predicted_CDS_5|3294_bp atggtggaaggagtcttttcccagcttcactacacggtacaggaggagcaggaacatggc actttcgtggggaatatcgctgaagatctgggtctggacattacaaaactttcggctcgc gggtttcagacggtgcccaactcaaggaccccttacttagacctcaacctggagacaggg gtgctgtacgtgaacgagaaaatagaccgcgaacaaatctgcaaacagagcccctcctgt gtcctgcacctggaggtctttctggagaaccccctggagctgttccaggtggagatcgag gtgctggacattaatgacaaccccccctctttcccggagccagacctgacggtggaaatc tctgagagcgccacgccaggcactcgcttccccttggagagcgcattcgacccagacgtg ggcaccaactccttgcgcgactacgagatcacccccaacagctacttctccctggacgtg cagacccagggggatggcaaccgattcgctgagctggtgctggagaagccactggaccga gagcagcaagcggtgcaccgctacgtgctgaccgcggtggacggaggaggtgggggagga gtaggagaaggagggggaggtggcgggggagcaggcctgcccccccagcagcagcgcacc ggcacggccctactcaccatccgagtgctggactccaatgacaatgtgcccgctttcgac caacccgtctacactgtgtccctaccagagaactctcccccaggcactctcgtgatccag ctcaacgccaccgacccggacgagggccagaacggtgaggtcgtgtactccttcagcagc cacatttcgccccgggcgcgggagcttttcggactctcgccgcgcactggcagactggag gtaagcggcgagttggactatgaagagagcccagtgtaccaagtgtacgtgcaagccaag gacctgggccccaacgccgtgcctgcgcactgcaaggtgctagtgcgagtactggatgct aatgacaacgcgccagagatcagcttcagcaccgtgaaggaagcggtgagtgagggcgcg gcgcccggcactgtggtggcccttttcagcgtgactgaccgcgactcagaggagaatggg caggtgcagtgcgagctactgggagacgtgcctttccgcctcaagtcttcctttaagaat tactacaccatcgttaccgaagcccccctggaccgagaggcgggggactcctacaccctg actgtagtggctcgggaccggggcgagcctgcgctctccaccagtaagtcgatccaggta caagtgtcggatgtgaacgacaacgcgccgcgtttcagccagccggtctacgacgtgtat gtgactgaaaacaacgtgcctggcgcctacatctacgcggtgagcgccaccgaccgggat gagggcgccaacgcccagcttgcctactctatcctcgagtgccagatccagggcatgagc gtcttcacctacgtttctatcaactctgagaacggctacttgtacgccctgcgctccttc gactatgagcagctgaaggacttcagttttcaggtggaagcccgggacgctggcagcccc caggcgctggctggtaacgccactgtcaacatcctcatagtggatcaaaatgacaacgcc cctgccatcgtggcgcctctaccagggcgcaacgggactccagcgcgtgaggtgctgccc cgctcggcggagccgggttacctgctcacccgcgtggccgccgtggacgcggacgacggc gagaacgcccggctcacttacagcatcgtgcgtggcaacgaaatgaacctctttcgcatg gactggcgcaccggggagctgcgcacagcacgccgagtcccggccaagcgcgacccccag cggccttatgagctggtgatcgaggtgcgcgaccatgggcagccgcccctttcctccacc gccaccctggtggttcagctggtggatggcgccgtggagccccagggcgggggcgggagc ggaggcggagggtcaggagagcaccagcgccccagtcgctctggcggcggggaaacctcg ctagacctcaccctcatcctcatcatcgcgttgggctcggtgtccttcatcttcctgctg gccatgatcgtgctggccgtgcgttgccaaaaagagaagaagctcaacatctatacttgt ctggccagcgattgctgcctctgctgctgctgctgcggtggcggaggttcgacctgctgt ggccgccaagcccgggcgcgcaagaagaaactcagcaagtcagacatcatgctggtgcag agctccaatgtacccagtaacccggcccaggtgccgatagaggagtccgggggctttggc tcccaccaccacaaccagaattactgctatcaggtatgcctgacccctgagtccgccaag accgacctgatgtttcttaagccctgcagcccttcgcggagtacggacactgagcacaac ccctgcggggccatcgtcaccggttacaccgaccagcagcctgatatcatctccaacgga agcattttgtccaacgagactaaacaccagcgagcagagctcagctatctagttgacaga cctcgccgagttaacagttctgcattccaggaagccgacatagtaagctctaaggacagt ggtcatggagacagtgaacagggagatagtgatcatgatgccaccaaccgtgcccagtca gctggtatggatctcttctccaattgcactgaggaatgtaaagctctgggccactcagat cggtgctggatgccttcttttgtcccttctgatggacgccaggctgctgattatcgcagc aatctgcatgttcctggcatggactctgttccagacactgaggtgtttgaaactccagaa gcccagcctggggcagagcggtccttttccacctttggcaaagagaaggcccttcacagc actctggagaggaaggagctggatggactgctgactaatacgcgagcgccttacaaacca ccatatttgaatcttcagagagaaaacaccaatagtggatggagatgtgaagcagacact gaggttgaggaaggaggaggctgggaaccctacgtggggtattccaaatggctcctggga aatggcctgggagcacttcagatacccagtcacacccagaacctgactcagagggtccaa aggacaaagccgccagctgatcttgggccctcagggctgcaacatgtaacttag >gi568815594f:133050141_133263282|GENSCAN_predicted_peptide_6|77_aa MDIGNDKEEKEHVGRASPTSTTPCSTVPSPIDHPRAEECEGTAQDWQAAPPAAPVWDPLG EASWAPESGGDVESLYV >gi568815594f:133050141_133263282|GENSCAN_predicted_CDS_6|234_bp atggatataggaaacgacaaagaagaaaaagagcatgtaggtcgagcctccccgacgagc accaccccctgctccacggtgcccagtcccatcgaccacccaagggctgaggaatgcgag ggcacggcacaggactggcaggcagctccacctgcagccccggtgtgggatccactaggt gaagccagctgggctcctgagtctggtggggacgtggagagtctttatgtctag