GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:32:48 Sequence gi568815593r:134097775_134325861 : 228087 bp : 44.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16412 16579 168 1 0 77 103 107 0.813 8.88 1.02 Intr + 17103 17381 279 2 0 42 83 527 0.854 45.27 1.03 Intr + 17547 17613 67 2 1 120 113 50 0.873 9.18 1.04 Intr + 18135 18259 125 1 2 80 119 140 0.740 16.60 1.05 Intr + 24607 24698 92 0 2 73 89 25 0.091 -0.11 1.06 Intr + 24938 25082 145 2 1 43 75 75 0.354 1.98 1.07 Intr + 32030 32126 97 1 1 79 94 -3 0.349 -1.02 1.08 Intr + 34746 34966 221 1 2 121 33 76 0.503 3.42 1.09 Intr + 40285 40390 106 0 1 102 82 54 0.583 5.99 1.10 Intr + 41177 41264 88 0 1 113 105 8 0.961 4.03 1.11 Intr + 44411 44530 120 2 0 120 76 115 0.994 13.11 1.12 Intr + 44947 45109 163 1 1 80 76 299 0.999 27.88 1.13 Intr + 45219 45326 108 2 0 119 83 174 0.964 20.48 1.14 Intr + 45818 45866 49 1 1 129 94 53 0.938 8.25 1.15 Intr + 48596 48711 116 0 2 114 92 1 0.329 3.27 1.16 Intr + 52636 52697 62 0 2 22 74 53 0.161 -5.17 1.17 Term + 53971 54154 184 1 1 79 44 106 0.464 2.32 1.18 PlyA + 54409 54414 6 1.05 2.15 PlyA - 55081 55076 6 1.05 2.14 Term - 59994 59959 36 0 0 85 44 54 0.902 -1.96 2.13 Intr - 60821 60681 141 2 0 50 96 212 0.994 18.75 2.12 Intr - 63356 63213 144 2 0 42 119 144 0.971 13.38 2.11 Intr - 69469 69396 74 2 2 29 116 64 0.401 2.43 2.10 Intr - 76248 76152 97 0 1 101 115 67 0.857 10.28 2.09 Intr - 83453 83394 60 2 0 82 108 4 0.171 0.73 2.08 Intr - 88595 88511 85 1 1 88 58 16 0.008 -1.58 2.07 Intr - 101430 101312 119 0 2 108 78 46 0.876 4.86 2.06 Intr - 104247 104074 174 0 0 82 91 78 0.994 7.64 2.05 Intr - 108357 108148 210 0 0 89 86 72 0.941 6.21 2.04 Intr - 128121 127923 199 2 1 -35 50 451 0.080 28.35 2.03 Intr - 128295 128148 148 1 1 81 7 250 0.808 15.49 2.02 Intr - 178311 178200 112 0 1 67 68 78 0.386 3.65 2.01 Init - 182642 182640 3 2 0 113 81 0 0.345 1.80 2.00 Prom - 184082 184043 40 -4.86 3.07 PlyA - 185593 185588 6 1.05 3.06 Term - 188785 188599 187 0 1 93 44 133 0.191 6.36 3.05 Intr - 210692 210364 329 2 2 66 83 194 0.615 11.20 3.04 Intr - 210953 210800 154 1 1 45 116 46 0.598 3.17 3.03 Intr - 214606 214518 89 1 2 106 80 14 0.526 1.17 3.02 Intr - 221723 221584 140 0 2 129 50 -4 0.469 0.08 3.01 Intr - 224129 224017 113 1 2 76 69 60 0.433 2.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100070 99998 73 1 1 73 39 73 0.809 -1.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:134097775_134325861|GENSCAN_predicted_peptide_1|729_aa MEDGARAATAPRSSNAAGSLGYEHSLKQPQALGSPQFKAQGALEPGHDTVHTTAAMRPAP RRAERTMPQLDSGGGGAGGGDDLGAPDELLAFQDEGEEQDDKSRDSAAGPERDLAELKSS LVNESEGAAGGAGIPGVPGAGAGARGEAEALGREHAAQRLFPDKLPEPLEDGLKAPECTS GMYKETVYSAFNLLMHYPPPSGAGQHPQPQPPLALPGSTKPVSTAAHGNIHTSTATSSAT LRLNHTVTPLTPCDWCFLLLCTNKALVEETGVLSGHRGPHPQWYSAQILLLQAPLSDVKA RRWVDWPHAPSRALCNCGNSLVPAGLRPEETATFFAGKVLWGSSIWGFSESTVRGEGVSL LAVARAVSVHTACEQTAAVYRENADCVYFSPEPAGSWGHKANQPPHGVPQLSLYEHFNSP HPTPAPADISQKQVHRPLQTPDLSGFYSLTSGSMGQLPHTVSWFTHPSLMLGSGVPGHPA AIPHPAIVPPSGKQELQPFDRNLKTQAESKAEKEAKKPTIKKPLNAFMLYMKEMRAKVIA ECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYGKKKRRS REKHQESTTELLTSPAEPAPTSPGLSTALSLPTPGPPQAPRSTLQSTQRGRKGISSCIRF SENSTEQKGMLLNTGTQTRGSQQYTRLQGNKYIINELAIYHSNYVPHWKCVKQEHLRQHF SATTNDHLH >gi568815593r:134097775_134325861|GENSCAN_predicted_CDS_1|2190_bp atggaggatggggcaagagccgcaacagcgccgcggagttccaacgctgccggttccctg gggtacgagcacagcctcaagcagcctcaagccctaggaagcccccagttcaaagcacag ggcgcattggagcctgggcacgatacagttcacaccacggctgcgatgcgccccgcgccc cggcgggcggagcgcaccatgccgcagctggactccggcgggggcggcgcgggcggcggc gacgacctcggcgcgccggacgagctgctggccttccaggatgaaggcgaggagcaggac gacaagagccgcgacagcgccgccggtcccgagcgcgacctggccgagctcaagtcgtcg ctcgtgaacgagtccgagggcgcggccggcggcgcagggatcccgggggtcccgggggcc ggcgccggggcccgcggcgaggccgaggctctcgggcgggaacacgctgcgcagagactc ttcccggacaaacttccagagcccctggaggacggcctgaaggccccggagtgcaccagc ggcatgtacaaagagaccgtctactccgccttcaatctgctcatgcattacccacccccc tcgggagcagggcagcacccccagccgcagcccccgctggctctaccaggttccaccaag ccagtttccacggcagcacacgggaatattcataccagcactgccacctcctcagctact ctgagactcaaccatacagtcacacccctcacaccttgtgactggtgcttcctgctgctg tgtacaaacaaggctctggtggaggaaacaggtgttttatcaggtcacaggggcccccac ccccaatggtacagtgcacagatcctcctgctgcaggcccctttatctgatgtcaaggcc aggcggtgggtggactggccacacgcacctagcagagctctgtgcaactgtggcaacagc cttgtgccagcaggcttgaggccagaggaaacagcaactttcttcgctgggaaagtgttg tggggctcaagcatttgggggttttcagagtcaaccgtcagaggcgagggagtttctctg ctggcggtggcaagggctgtcagcgtgcacacagcctgcgagcagaccgctgcggtctac agggaaaatgccgactgtgtttatttttcacctgagcctgcaggaagttgggggcacaag gccaatcagcccccccacggtgtcccccaactctctctctacgaacatttcaacagccca catcccacccctgcacctgcggacatcagccagaagcaagttcacaggcctctgcagacc cctgacctctctggcttctactccctgacctcaggcagcatggggcagctcccccacact gtgagctggttcacccacccatccttgatgctaggttctggtgtacctggtcacccagca gccatcccccacccggccattgtgcccccctcagggaagcaggagctgcagcccttcgac cgcaacctgaagacacaagcagagtccaaggcagagaaggaggccaagaagccaaccatc aagaagcccctcaatgccttcatgctgtacatgaaggagatgagagccaaggtcattgca gagtgcacacttaaggagagcgctgccatcaaccagatcctgggccgcaggtggcacgcg ctgtcgcgagaagagcaggccaagtactatgagctggcccgcaaggagaggcagctgcac atgcagctatacccaggctggtcagcgcgggacaactacgggaagaagaagaggcggtcg agggaaaagcaccaagaatccaccacagaactgcttactagccctgcggagccggcacct acatccccaggtctctccactgctctcagcctcccaaccccagggcccccacaggccccc cgcagcaccctgcagagcacacagagaggaagaaagggaatcagttcctgcatccgtttc tcagaaaacagcacagaacagaagggaatgctgctcaatactggcacacagaccagaggt agccagcagtacacaagactgcaaggcaacaagtacattatcaatgaattagccatctat cactcaaactatgtcccgcattggaagtgtgtcaagcaagagcaccttcgacaacacttt tcggccaccactaatgatcatctccactga >gi568815593r:134097775_134325861|GENSCAN_predicted_peptide_2|533_aa MWDNQKLKALEQLMAKPDGFVVSSYKELLMYCSGRADTQSRALEPQRAEEEAQRPTAEYC GESQRASASLNSRQKYTRNRRRRVRVGPAERQPAGAGGIMDEKVFTKELDQWIEQLNECK QLSESQVKSLCEKVSCSTAADSRRGAEPAEEKAAAKEILTKESNVQEVRCPVTVCGDVHG QFHDLMELFRIGGKSPDTNYLFMGDYVDRGYYSVETVTLLVALKVRYRERITILRGNHES RQITQVYGFYDECLRKYGNANVWKYFTDLFDYLPLTALVDGQGYNWCHDRNVVTIFSAPN YCYRCGNQAAIMELDDTLKYSLLLCQFFPSSKYGNDPRLSTHTTFSLLFQRLFPFDSRNI IWPLPGKFSQMPSIKLQSSDGEIFEVDVEIAKQSVTIKTMLEDLGMDDEGDDDPVPLPNV NAAILKKVIQWCTHHKDDPPPPEDDENKEKRTDDIPVWDQEFLKVDQGTLFELILAANYL DIKGLLDVTCKTVANMIKGKTPEEIRKTFNIKNDFTEEEEAQVRKENQWCEEK >gi568815593r:134097775_134325861|GENSCAN_predicted_CDS_2|1602_bp atgtgggataatcagaaactgaaagctcttgagcagttaatggctaagccagatggtttc gtggtatcatcttacaaagagctccttatgtactgcagtggaagagcagacacacagagc cgagctctggagcctcagcgagcggaggaggaggcgcagcggccgacggccgagtactgc ggtgagagccagcgggccagcgccagcctcaacagccgccagaagtacacgaggaaccgg cggcggcgtgtgcgtgtaggccccgcggagcggcagccggctggggcgggtggcatcatg gacgagaaggtgttcaccaaggagctggaccagtggatcgagcagctgaacgagtgcaag cagctgtccgagtcccaggtcaagagcctctgcgagaaggtgagctgtagtacggctgcg gacagccgccgcggggccgagcccgccgaggaaaaggcggccgctaaagaaatcctgaca aaagaatccaacgtgcaagaggttcgatgtccagttactgtctgtggagatgtgcatggg caatttcatgatctcatggaactgtttagaattggtggcaaatcaccagatacaaattac ttgtttatgggagattatgttgacagaggatattattcagttgaaacagttacactgctt gtagctcttaaggttcgttaccgtgaacgcatcaccattcttcgagggaatcatgagagc agacagatcacacaagtttatggtttctatgatgaatgtttaagaaaatatggaaatgca aatgtttggaaatattttacagatctttttgactatcttcctctcactgccttggtggat gggcagggatataactggtgccatgaccggaatgtagtaacgattttcagtgctccaaac tattgttatcgttgtggtaaccaagctgcaatcatggaacttgacgatactctaaaatac tctttactcctttgccagttcttcccatcctctaaatatggaaatgacccaaggcttagt actcacacgactttttctctcctcttccagcgcttatttccatttgacagcagaaatata atctggccacttcctggcaaattcagtcaaatgccttcaattaagttgcagagttctgat ggagagatatttgaagttgatgtggaaattgccaaacaatctgtgactattaagaccatg ttggaagatttgggaatggatgatgaaggagatgatgacccagttcctctaccaaatgtg aatgcagcaatattaaaaaaggtcattcagtggtgcacccaccacaaggatgaccctcct cctcctgaagatgatgagaacaaagaaaagcgaacagatgatatccctgtttgggaccaa gaattcctgaaagttgaccaaggaacactttttgaactcattctggctgcaaactactta gacatcaaaggtttgcttgatgttacatgcaagactgttgccaatatgatcaaggggaaa actcctgaggagattcgcaagaccttcaatatcaaaaatgactttactgaagaggaggaa gcccaggtacgcaaagagaaccagtggtgtgaagagaagtga >gi568815593r:134097775_134325861|GENSCAN_predicted_peptide_3|337_aa XPVDIWALGCMIIEMATGNPYLPSSSDLDLLHKIVLKVGNLSPHLQNIFSKSPIFAGVVL PQVQHPKNARKKYPKLNGLLADIVHACLQIDPADRISSSDLLHHEYFTRDGFIEKFMPEL KAKLLQEAKVNSLIKPKESSKENELRKDERKTVYTNTLLSSSVLGKEIEKEKKPKEIKVR VIKVKGGRGDISEPKKKEYEGGLGQQDANENVHPMSPDTKLVTIEPPNPINPSTNCNGLK ENPHCGGSVTMPPINLTNSNLMAANLSSNLFHPSVRYVFISGMKTDQYSELVPVEWGIAE KIPKNVEATLELGNRQRFEQFGELRRRQENVGKFGTF >gi568815593r:134097775_134325861|GENSCAN_predicted_CDS_3|1014_bp nnacctgtggatatctgggctttgggctgtatgatcattgagatggccactggaaatccc tatcttcctagtagttctgatttggatttactccataaaattgttttgaaagtgggcaat ttgtcacctcacttgcagaatatcttttccaagagccccatttttgctggggtagttctt cctcaagttcaacaccccaaaaatgcaagaaaaaaatatccaaagcttaatggattgttg gcagatatagttcatgcttgtttacaaattgatcctgctgacaggatatcatctagtgat cttttgcatcatgagtattttactagagatggatttattgaaaaattcatgccagaactg aaagctaaattactgcaggaagcaaaagtcaattcattaataaagccaaaagagagttct aaagaaaatgaactcaggaaagatgaaagaaaaacagtttataccaatacactgctaagt agttcagttttgggaaaggaaatagaaaaagagaaaaagcccaaggagatcaaagtcaga gttattaaagtcaaaggaggaagaggagatatctcagaaccaaaaaagaaagagtatgaa ggtggacttggtcaacaggatgcaaatgaaaatgttcatcctatgtctccagatacaaaa cttgtaaccattgaaccaccaaaccctatcaatcccagcactaactgtaatggcttgaaa gaaaatccacattgcggaggttctgtgacaatgccacccatcaatctaactaacagtaat ttgatggctgcaaatctcagttcaaatctctttcaccccagtgtgaggtacgtctttatc agcggcatgaagacagaccaatacagtgaattggtaccagtagagtggggcattgctgaa aagatacccaaaaatgtggaagcaactttggaactgggtaacaggcagaggtttgaacag tttggagagctcagaagaagacaggaaaatgtgggaaagtttggaactttctag