GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:24:31 Sequence gi568815590r:38500685_38701200 : 200516 bp : 45.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 1661 1598 64 2 1 113 80 77 0.154 10.71 1.00 Prom - 8161 8122 40 -4.56 2.00 Prom + 10774 10813 40 -4.06 2.01 Init + 18908 18980 73 1 1 77 89 51 0.255 5.33 2.02 Intr + 19532 19554 23 0 2 93 92 -1 0.116 -1.94 2.03 Intr + 27206 27390 185 1 2 79 58 84 0.123 3.09 2.04 Term + 27993 28020 28 0 1 147 52 25 0.663 2.35 2.05 PlyA + 28689 28694 6 1.05 3.07 PlyA - 30501 30496 6 1.05 3.06 Term - 43162 43103 60 1 0 100 49 69 0.782 2.00 3.05 Intr - 56175 55875 301 2 1 36 -78 882 0.075 63.14 3.04 Intr - 60856 60739 118 2 1 3 75 106 0.475 0.32 3.03 Intr - 66520 66370 151 1 1 77 23 52 0.061 -2.66 3.02 Intr - 67107 66913 195 0 0 110 78 65 0.248 7.31 3.01 Init - 75617 75516 102 2 0 63 66 81 0.184 3.67 3.00 Prom - 75974 75935 40 -9.95 4.00 Prom + 76016 76055 40 -5.06 4.01 Init + 76073 76158 86 1 2 104 30 52 0.250 1.29 4.02 Intr + 77361 77379 19 0 1 94 100 5 0.321 -1.09 4.03 Intr + 78095 78225 131 1 2 111 109 86 0.989 12.49 4.04 Intr + 87304 87415 112 1 1 57 92 48 0.113 2.48 4.05 Intr + 88946 89015 70 1 1 81 60 8 0.009 -3.95 4.06 Intr + 92906 93055 150 0 0 80 88 7 0.013 0.03 4.07 Intr + 98905 98934 30 2 0 93 103 41 0.018 4.20 4.08 Intr + 111939 112071 133 1 1 115 64 66 0.483 6.60 4.09 Intr + 119930 120026 97 2 1 74 31 66 0.064 -0.49 4.10 Term + 130967 131356 390 1 0 76 41 213 0.882 10.39 4.11 PlyA + 134310 134315 6 1.05 5.00 Prom + 141454 141493 40 -5.36 5.01 Init + 142316 142329 14 1 2 110 50 0 0.304 -2.03 5.02 Term + 149972 150413 442 2 1 87 42 247 0.927 14.73 5.03 PlyA + 150660 150665 6 1.05 6.00 Prom + 158230 158269 40 -1.36 6.01 Init + 161246 161299 54 1 0 77 98 -13 0.227 -2.10 6.02 Intr + 162106 162129 24 0 0 108 80 18 0.506 1.22 6.03 Intr + 162227 162355 129 1 0 49 108 44 0.674 3.39 6.04 Intr + 170796 170885 90 2 0 36 21 137 0.059 1.99 6.05 Term + 175456 175626 171 0 0 10 38 201 0.670 5.13 6.06 PlyA + 176301 176306 6 1.05 7.03 PlyA - 180029 180024 6 1.05 7.02 Term - 181719 181616 104 1 2 92 55 47 0.157 0.24 7.01 Init - 193473 193413 61 0 1 121 49 43 0.450 5.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 56175 55809 367 2 1 36 51 790 0.862 64.18 S.002 Term + 170796 170900 105 2 0 36 48 154 0.838 4.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_1|22_aa METPLGIQDEQPDRKAPVGSGX >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_1|66_bp atggagacgccgctgggcattcaggacgagcagcctgacaggaaggctcccgtaggctct ggagnn >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_2|102_aa MGIIRMKENVCKTYSTMPTTGLILSRCAALNWFLPLQNEGLEMQDSKIVAISDSDNNPVS LKNSALLLHRGFQQEAGTDGFLWNVFRNVVCDVRKIFNICAI >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_2|309_bp atgggcatcattaggatgaaagaaaacgtgtgtaaaacttacagcacgatgcctacaaca gggctgatcctcagcagatgtgctgctttgaactggttcctccctctgcaaaacgaaggg cttgaaatgcaggattctaagattgtggccatctctgactctgataacaaccccgtgagc ctgaagaattcagcactgctgctccatagggggtttcagcaggaagcgggcacagatggc tttctctggaacgtgttcagaaatgtggtgtgtgatgtacggaagatcttcaatatctgt gccatttga >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_3|308_aa MRSLGEEEVHTSPRLLGIQIQSVFIKPMMEEAQQCPTFNQLNALFCHSESPCLNSEEVIV STPCYQCFSLIHSVDSFSSVSSMGQALPGEQLAAPILKECLSSLPSSWEDEQGPLWRGES RKETVPGAVGHRNNKNCTQHTPKEQQWDRVLKAAETAGGLRAGDGTATVKFRKITVASVW GVVKEQESSLGNTARLSLKNEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE EEEEEEEEEEEEEEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKGRKTDLGRTHTELNLYR RVDEDKHL >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_3|927_bp atgaggtccctgggggaagaagaggttcataccagtcccagacttttgggcattcaaatc cagtcagtgttcatcaagcccatgatggaggaggcacagcagtgtcccaccttcaatcag ctgaatgctctcttttgtcattctgaatctccttgtctgaactctgaagaggtcattgtt tctactccttgctaccagtgtttcagtttaattcattcagttgatagtttttcttcagtc tcctccatgggccaggccctgccaggtgaacagctcgcagccccaatcctcaaggagtgt ttatcttcccttccaagcagctgggaggatgagcagggtccactgtggagaggagaaagc aggaaagagactgtacctggagccgtgggtcaccggaacaacaagaactgcacacaacac accccgaaggaacagcagtgggacagagtcctgaaggctgcggaaactgctggagggctt cgagcaggggatggcacagccacagtgaagtttagaaaaatcactgtggcctcggtgtgg ggtgtagtgaaggagcaagagagcagcctgggcaacacagcaagactcagtctcaaaaat gaagaagaagaagaagaagaagaagaagaggaagaggaagaggaagaggaagaggaagag gaagaggaagaagaagaagaggaagaagaagaggaagaagaagaggaagaagaagaggaa gaagaagaagaagaagaagaggaggaagaggaggaggaggaggaggaggaggagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaaaggaaggaaaactgaccttggccgcacacatacagagctgaacttgtatcgc cgtgtagatgaagacaaacacctttag >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_4|405_aa MAMSVTSGRPHCYTPTSAMTVYKCHGNVRNSNEAKWTEPFPVALLAIQAMVVTLLHSLSP YIGPQIFPIFLYTLCTCFSLWSVTPLHSKGFIFVSAFLGEWAWLGDVTVDPGFCHWLHLI EPDSKAANSWAAHGLWESWEPTQVRRNQKTNSGNMTKQDSLTPPKNYTSSQQWTQTKKKS LIYLKKNSGDPGASHGTLGGPPVLATASHHPNSVQAWVLFDRLSPEYCSLVNKEFSEKKA EGLGASGSDWNLHHWLSDSQDFEIYRWLSWVSSLQMFHLHPKWKATQVNEWDLIIMVIII IITVVQSLLSASTMRSALCHLMKLKLNFTKEQTEAWNNLSESQRQKVIHREVCVTLAGAN SSLFLPLYSLLYLPLLWARHTAMLKVQRDLEKYSLPQEERLHTCD >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_4|1218_bp atggcgatgagtgtgacctctggtcgtcctcactgctacactcccaccagtgccatgaca gtttacaaatgccatggcaatgtcagaaattcaaatgaggcaaagtggacagagccattc cccgtggccctgctggcaatccaggcaatggttgtcaccctcttacattccctcagcccc tacattggtcctcagatcttcccaatcttcctctatactctctgcacctgcttcagcttg tggtctgttactcctctgcattcaaaaggattcatctttgtcagtgcctttctgggagag tgggcctggcttggagatgtcactgtggatccaggattttgccactggctgcacttgatt gaacctgactcaaaagcagccaactcatgggctgcccatggcctgtgggagtcttgggag cctacccaagtaagaaggaaccagaaaaccaactctggtaatatgacaaaacaagattct ttaacaccccccaaaaattacactagctcacagcaatggacccaaaccaagaagaaatcc ctgatttaccttaaaaagaattcaggagaccctggtgcttcccacgggaccctgggtggg cctcctgttcttgccacagcctcccaccaccccaactcagtccaggcctgggtcctcttt gacaggctgtcccctgagtactgttcattggtgaacaaagagttcagtgaaaagaaggca gaaggcctgggggcttcaggctcagactggaacctgcaccactggctctctgactctcag gactttgaaatataccgctggctttcctgggtctccagcttgcagatgtttcatctccat cctaaatggaaagctacccaagtgaatgagtgggacctcatcatcatggtcatcatcatc atcatcacagttgtgcagagcttactatctgccagcaccatgcgaagtgccctatgccat ttaatgaaattgaagcttaattttacaaaggagcaaacagaagcttggaataacttgtct gagtcacagagacagaaagtgattcatagagaggtctgtgtaactctggctggtgctaac agtagcttgttcctacctctgtactccctgctgtatctcccgctgctctgggcccggcac acagcgatgctcaaagtacagagagatctagagaaatacagcctgccccaggaggaacgg ctccatacatgtgactga >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_5|151_aa MPGQGIPLKRDSCNDFQMSASCLLADWGGCGKLRDGNQNPHQMCHVNVSLAHIWLQCRPP APEQMAAFPLYSASQRSPVKRRWGRNGRGLRGRDAATPSLGRGAGPVLAAVGPGSLEAEP RALFPRGAAVGWPGGQNEGLRDGVRKKIVRV >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_5|456_bp atgcctggtcaagggatccccttgaagcgtgacagctgcaatgacttccagatgtccgcc tcctgtctcctagccgactggggtggatgtggtaaactccgcgatggaaaccaaaacccc caccagatgtgccatgtcaacgtcagcctcgcgcacatctggcttcaatgccggccgcca gccccagaacaaatggcggctttcccgctgtattcagctagtcagcgttccccggttaaa aggcgctggggcaggaacggccggggccttcgggggcgcgacgcggcgacgcccagcctg ggaaggggcgcggggcccgtgttggccgcggtgggtcccggctccctggaggctgagccc cgggcgctctttcctcgcggcgctgccgtggggtggccgggagggcagaacgaggggctg cgggacggtgttcggaagaaaatcgtgcgagtttaa >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_6|155_aa MGQSWWLTPVIPALWEAKVYQNPFEKKYGESIPTGHFQIFHHCIKALNKEAFYILSQSLA KQKQLSMPQSQALHPCGLDEFPLTTKEEEEEECAVLVQQGYTKYIIMNFQDVGIKEKLQQ AFKEREREKSYMQIIKYQKGIGFLNSNPESQNAVK >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_6|468_bp atgggccagtcttggtggctcacaccagtaatcccagcactttgggaggctaaggtttac caaaatccttttgaaaaaaaatatggtgagtccatacctacaggtcactttcagatattc caccactgtatcaaagccctcaacaaagaagccttctacatcctttctcagtcattagcc aagcaaaagcaattgtcaatgcctcaaagccaagccctgcacccctgtggcttggatgag ttccctttgaccaccaaggaagaggaggaagaggagtgcgcagtcctcgtgcagcagggt tacaccaagtacatcatcatgaattttcaggatgttgggatcaaggaaaagctccaacaa gctttcaaagagagagagagagagaaaagttacatgcaaattatcaagtatcaaaaaggt attggatttctcaacagcaacccagaaagtcagaatgcagtgaagtaa >gi568815590r:38500685_38701200|GENSCAN_predicted_peptide_7|54_aa MAGLQAVFLLELGERGKDDGARGILFVFKSQQFSQNALKPAGFEQLERAVLAFC >gi568815590r:38500685_38701200|GENSCAN_predicted_CDS_7|165_bp atggctggcctccaagctgtcttcttactagaactgggagaaagaggaaaggatgatggg gcgcggggcatcctgtttgtttttaaaagccaacaattttcccagaatgcactgaaacca gcaggttttgaacaactcgagagagctgtgctggccttctgctga