GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:15:56 Sequence gi568815595f:56973695_57174241 : 200547 bp : 45.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 3716 3669 48 2 0 95 93 102 0.865 10.38 1.00 Prom - 6820 6781 40 -2.36 2.08 PlyA - 9476 9471 6 1.05 2.07 Term - 24284 24186 99 2 0 99 44 76 0.416 2.43 2.06 Intr - 32636 32577 60 2 0 50 64 90 0.166 1.73 2.05 Intr - 38190 38146 45 0 0 123 36 43 0.138 1.21 2.04 Intr - 39320 39114 207 2 0 59 52 102 0.206 2.97 2.03 Intr - 40469 40346 124 1 1 30 101 65 0.071 2.59 2.02 Intr - 45726 45608 119 0 2 131 59 73 0.777 7.96 2.01 Init - 52880 52827 54 2 0 73 94 19 0.450 2.31 2.00 Prom - 53100 53061 40 -8.46 3.00 Prom + 54523 54562 40 -2.26 3.01 Init + 62604 62656 53 2 2 72 82 25 0.278 0.93 3.02 Intr + 65485 65618 134 1 2 110 77 39 0.769 5.29 3.03 Intr + 71634 71711 78 1 0 74 97 32 0.106 2.22 3.04 Term + 82063 82127 65 2 2 99 54 70 0.528 2.75 3.05 PlyA + 82393 82398 6 1.05 4.00 Prom + 84977 85016 40 -2.46 4.01 Init + 89866 89966 101 0 2 82 94 57 0.397 5.51 4.02 Term + 100706 100784 79 2 1 88 38 54 0.099 -2.36 4.03 PlyA + 101719 101724 6 1.05 5.13 PlyA - 107648 107643 6 1.05 5.12 Term - 122811 122699 113 1 2 144 42 24 0.870 2.12 5.11 Intr - 124844 123902 943 2 1 120 82 1009 0.931 94.23 5.10 Intr - 127669 127485 185 2 2 70 64 65 0.561 1.81 5.09 Intr - 128895 128785 111 1 0 53 65 118 0.946 6.35 5.08 Intr - 131513 131387 127 2 1 94 94 32 0.775 4.65 5.07 Intr - 132314 132163 152 0 2 89 99 257 0.950 26.78 5.06 Intr - 133983 133837 147 1 0 96 48 53 0.773 2.31 5.05 Intr - 135963 135843 121 0 1 132 105 -23 0.999 3.87 5.04 Intr - 136617 136499 119 1 2 45 93 104 0.968 6.78 5.03 Intr - 141123 140998 126 1 0 111 109 152 0.997 20.25 5.02 Intr - 146619 146562 58 0 1 134 98 78 0.981 11.86 5.01 Init - 147552 147511 42 0 0 30 89 20 0.100 -3.25 5.00 Prom - 148422 148383 40 -2.46 6.00 Prom + 152462 152501 40 -0.66 6.01 Sngl + 160667 161029 363 1 0 59 48 455 0.486 34.68 6.02 PlyA + 161087 161092 6 1.05 7.00 Prom + 167582 167621 40 -5.36 7.01 Init + 168574 168607 34 0 1 73 60 9 0.376 -3.92 7.02 Intr + 169272 169301 30 1 0 89 94 27 0.411 1.50 7.03 Term + 169602 169843 242 1 2 56 48 247 0.621 13.69 7.04 PlyA + 171786 171791 6 1.05 8.03 PlyA - 172571 172566 6 1.05 8.02 Term - 183600 183550 51 0 0 95 48 76 0.509 1.73 8.01 Init - 191592 191467 126 0 0 102 86 180 0.982 19.40 8.00 Prom - 199736 199697 40 -2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_1|16_aa MAMLMQVLALFPGAHL >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_1|48_bp atggcaatgctgatgcaagtgctggccctgttcccaggagcccatctg >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_2|235_aa MYIMSVSLLGNLIENSCQEHTEAEKDCDFLAIAYLGAHTWVQGKNKKPDPEELCREGRVK LQTFPLSVTALKGSRLELFVPPGGFVVSLALRVKLQTFAPGPGAKPLTAQGQQECRACQA RTHPEFTLACEPVRSPGSCPCFSLHIPLQAEGASFTSASPEMGSHSAACKSEALRALGSS EDGWFSRYLTLVIGIDSWELPWQMRKQRPEELACDVAFNCTHVHEKCDFGIVKVL >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_2|708_bp atgtacatcatgtcggtctccctcctgggaaatctgatagaaaattcctgccaggaacac accgaggctgagaaagactgtgactttcttgccattgcctacctgggtgcacacacttgg gtccagggaaaaaataagaaacctgatcctgaagaattgtgccgggaagggagagtgaag ctgcaaaccttcccgctgagtgttacagctcttaaggggtcacgtctggagttgttcgtt cctcctggcgggttcgtggtctcgctggctttaagagtgaagctccagaccttcgcgcct ggcccgggtgctaagcccctcactgcccagggccagcaggagtgcagggcctgccaagcc cgcacccacccggaattcacgctggcctgcgagcccgtgcgcagccccggttcctgccca tgcttctccctccacatccctctgcaagcagagggagccagcttcacctcggccagccca gagatgggttcccacagtgcagcgtgcaaatcagaggccctgcgagcactgggaagcagt gaagatggatggttctccagatacctcacgctggtcattggcatcgacagctgggagctc ccctggcagatgaggaaacagaggccagaagagcttgcatgtgacgttgccttcaactgt acacacgttcatgagaagtgtgactttggaattgtgaaagttttatga >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_3|109_aa MTTWTGKIPFGCMTPKFSWQDQHPLVFADTSCINLLQSTTASIGQKVLLILHLKISQEAP SVVMPFMDTQYVFVDGQRACLWIHGGPKEASDSSSPLKPNRIPMHIPTV >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_3|330_bp atgaccacctggactggtaaaataccattcgggtgcatgacgcccaagtttagctggcag gatcagcacccattggtatttgctgatacctcctgcatcaacctcctgcaaagcacaact gcgtccattggacagaaggtcctgctaattcttcatttaaaaatttcccaggaagcacca agtgttgtaatgcctttcatggacactcagtatgtttttgttgatggacagagagcatgc ttatggattcatggaggacccaaagaagcctctgattcctcctcgcccctgaagcccaac agaattccaatgcacatccctactgtttga >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_4|59_aa MTPQALGLSNQEVGVLLNRDVGRGSGESGILEARYHFFQEASLSHFPNITDEEIKIQKG >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_4|180_bp atgacgccccaggctttgggcctgagcaaccaggaggttggggtgctccttaacagagat gtgggaagaggatcaggagagtcgggcatcctagaagccagatatcactttttccaagaa gcctccctgtcacacttccccaatatcacagatgaagaaatcaagattcagaaaggttaa >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_5|747_aa MVPSNILLVEWKLQGVGPASRNSGLYNITFKYDNCTTYLNPVGKHVIADAQNITISQYAC HDQVAVTILWSPGALGIEFLKGFRVILEELKSEGRQCQQLILKDPKQLNSSFKRTGMESQ PFLNMKFETDYFVKVVPFPSIKNESNYHPFFFRTRGYLFNLPELLLSNFGGWSPWNRQQL LLAFLISLSVIQGLGWSVLSNIPSFWKPRNLNISQHGSDMQVSFDHAPHNFGFRFFYLHY KLKHEGPFKRKTCKQALSFMLLESIPLYETEAAVIPILQMRKLRLRDVTWLAQGRTAMHS PWAGPIRAVAITVPLVVISAFATLFTVMCRKKQQENIYSHLDEESSESSTYTAALPRERL RPRPKVFLCYSSKDGQNHMNVVQCFAYFLQDFCGCEVALDLWEDFSLCREGQREWVIQKI HESQFIIVVCSKGMKYFVDKKNYKHKGGGRGSGKGELFLVAVSAIAEKLRQAKQSSSAAL SKFIAVYFDYSCEGDVPGILDLSTKYRLMDNLPQLCSHLHSRDHGLQEPGQHTRQGSRRN YFRSKSGRSLYVAICNMHQFIDEEPDWFEKQFVPFHPPPLRYREPVLEKFDSGLVLNDVM CKPGPESDFCLKVEAAVLGATGPADSQHESQHGGLDQDGEARPALDGSAALQPLLHTVKA GSPSDMPRDSGIYDSSVPSSELSLPLMEGLSTDQTETSSLTESVSSSSGLGEEEPPALPS KLLSSGSCKADLGCRSYTDELHAVAPL >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_5|2244_bp atggttccctccaacatccttcttgttgagtggaaacttcagggagtggggccagccagc agaaacagtgggctgtacaacatcaccttcaaatatgacaattgtaccacctacttgaat ccagtggggaagcatgtgattgctgacgcccagaatatcaccatcagccagtatgcttgc catgaccaagtggcagtcaccattctttggtccccaggggccctcggcatcgaattcctg aaaggatttcgggtaatactggaggagctgaagtcggagggaagacagtgccaacaactg attctaaaggatccgaagcagctcaacagtagcttcaaaagaactggaatggaatctcaa cctttcctgaatatgaaatttgaaacggattatttcgtaaaggttgtcccttttccttcc attaaaaacgaaagcaattaccaccctttcttctttagaacccgaggttatctctttaat cttcctgagttgctgctttcaaactttggtggctggtcgccctggaacaggcagcagctc cttcttgccttcctgatcagtctttctgtcattcaaggccttggatggtcagtcctgtcc aacatcccctccttctggaagcctcggaacctgaacatcagccagcatggctcggacatg caggtgtccttcgaccatgcaccgcacaacttcggcttccgtttcttctatcttcactac aagctcaagcacgaaggacctttcaagcgaaagacctgtaagcaggccctgagctttatg ctccttgaatccataccactctatgaaacagaggctgctgtcatccccattttacagatg agaaagctgaggctcagggatgtcacctggcttgctcaaggccgcacagctatgcactcc ccgtgggccgggcccatcagagccgtggccatcacagtgccactggtagtcatatcggca ttcgcgacgctcttcactgtgatgtgccgcaagaagcaacaagaaaatatatattcacat ttagatgaagagagctctgagtcttccacatacactgcagcactcccaagagagaggctc cggccgcggccgaaggtctttctctgctattccagtaaagatggccagaatcacatgaat gtcgtccagtgtttcgcctacttcctccaggacttctgtggctgtgaggtggctctggac ctgtgggaagacttcagcctctgtagagaagggcagagagaatgggtcatccagaagatc cacgagtcccagttcatcattgtggtttgttccaaaggtatgaagtactttgtggacaag aagaactacaaacacaaaggaggtggccgaggctcggggaaaggagagctcttcctggtg gcggtgtcagccattgccgaaaagctccgccaggccaagcagagttcgtccgcggcgctc agcaagtttatcgccgtctactttgattattcctgcgagggagacgtccccggtatccta gacctgagtaccaagtacagactcatggacaatcttcctcagctctgttcccacttgcac tcccgagaccacggcctccaggagccggggcagcacacgcgacagggcagcagaaggaac tacttccggagcaagtcaggccggtccctatacgtcgccatttgcaacatgcaccagttt attgacgaggagcccgactggttcgaaaagcagttcgttcccttccatcctcctccactg cgctaccgggagccagtcttggagaaatttgattcgggcttggttttaaatgatgtcatg tgcaaaccagggcctgagagtgacttctgcctaaaggtagaggcggctgttcttggggca accggaccagccgactcccagcacgagagtcagcatgggggcctggaccaagacggggag gcccggcctgcccttgacggtagcgccgccctgcaacccctgctgcacacggtgaaagcc ggcagcccctcggacatgccgcgggactcaggcatctatgactcgtctgtgccctcatcc gagctgtctctgccactgatggaaggactctcgacggaccagacagaaacgtcttccctg acggagagcgtgtcctcctcttcaggcctgggtgaggaggaacctcctgcccttccttcc aagctcctctcttctgggtcatgcaaagcagatcttggttgccgcagctacactgatgaa ctccacgcggtcgcccctttgtaa >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_6|120_aa MGISKRKGTANAQMPGNVTWMRRMRILCWLLRRYCESKKIDHHTYHSLYLKVKGNVFKNK WILMEHILKLKADKAHKKLQADQAKARRSKTKEARKHHEDRLQAKEEIIKTLSKEEETEK >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_6|363_bp atgggcataagtaagcgaaagggtacagccaatgcccaaatgccagggaacgtaacttgg atgaggagaatgcggattctgtgctggctgctgagaagatactgtgaatctaagaagatt gatcaccacacatatcacagcctgtacctgaaggtgaaggggaatgtgttcaaaaacaag tggattctcatggaacacatcctcaagctgaaggcagacaaggcccacaagaagctgcag gctgaccaggctaaggcccgcaggtctaagaccaaggaagcacgcaagcaccatgaagac cgcctacaggccaaggaggagatcatcaagactttgtctaaggaggaagagaccgagaag tga >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_7|101_aa MGANTFLLGIQEGLMLALSIEGTKLSQCWRDLSPLYLDNLNKGYDNLNKGYVILISDEVK TGTPSEEPVLKVPCYCLPRVQGTSPGSTASVPEGCQYPYQK >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_7|306_bp atgggagccaatacattccttctgggaattcaggaggggttgatgctggctctgagtatt gaaggcacaaaactgtctcagtgctggagggacctatctccactctaccttgacaatctc aacaaaggctacgacaatctcaacaaaggctacgtcattctcatcagtgatgaggtaaaa acagggactccctcagaggagccagtcctgaaggtgccctgctactgcctccctagagtg caaggaacatctccagggtctacagcctcagtccctgaaggctgccagtacccctatcag aagtag >gi568815595f:56973695_57174241|GENSCAN_predicted_peptide_8|58_aa MAPWLQLCSVFFTVNACLNGSQLAVAAGGSGRARGADTCGWRRNLNGLLLLLILELPP >gi568815595f:56973695_57174241|GENSCAN_predicted_CDS_8|177_bp atggccccgtggctgcagctctgctccgtcttctttacggtcaacgcctgcctcaacggc tcgcagctggctgtggccgctggcgggtccggccgcgcgcggggcgccgacacctgtggc tggaggagaaacttgaacgggcttctgctgctcctcatcctagaactgcccccgtga