GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:31:10 Sequence gi568815583r:82918070_83167151 : 249082 bp : 43.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 624 769 146 1 2 91 89 15 0.251 1.90 1.02 Intr + 1234 1287 54 0 0 127 52 8 0.285 0.28 1.03 Intr + 8068 8188 121 0 1 42 81 81 0.598 2.87 1.04 Intr + 10027 10178 152 2 2 68 91 84 0.757 6.58 1.05 Term + 24161 24367 207 1 0 85 37 147 0.310 6.64 1.06 PlyA + 25969 25974 6 1.05 2.00 Prom + 26230 26269 40 -3.96 2.01 Init + 34551 34685 135 2 0 53 -9 148 0.021 -0.26 2.02 Intr + 34812 35013 202 2 1 67 110 56 0.043 4.56 2.03 Intr + 35384 35523 140 0 2 88 52 77 0.046 4.28 2.04 Term + 45807 45914 108 2 0 72 37 107 0.302 2.51 2.05 PlyA + 46278 46283 6 1.05 3.00 Prom + 48317 48356 40 -2.16 3.01 Init + 70950 71091 142 2 1 56 48 156 0.937 8.70 3.02 Term + 71812 72020 209 2 2 63 39 201 0.992 10.20 3.03 PlyA + 72298 72303 6 1.05 4.12 PlyA - 73012 73007 6 1.05 4.11 Term - 81973 81902 72 1 0 88 38 74 0.777 0.31 4.10 Intr - 82262 82017 246 2 0 87 86 66 0.754 3.96 4.09 Intr - 90606 90479 128 1 2 74 100 38 0.952 4.00 4.08 Intr - 92294 92168 127 2 1 73 74 69 0.834 4.25 4.07 Intr - 100784 100638 147 2 0 63 69 100 0.636 6.03 4.06 Intr - 102693 102606 88 2 1 85 92 45 0.780 4.57 4.05 Intr - 112259 112067 193 0 1 100 70 39 0.553 1.85 4.04 Intr - 123856 123659 198 2 0 67 110 127 0.687 12.12 4.03 Intr - 132109 132004 106 1 1 56 97 46 0.351 2.09 4.02 Intr - 138476 138320 157 1 1 74 111 72 0.873 8.11 4.01 Init - 149082 148682 401 0 2 71 83 839 0.883 75.84 4.00 Prom - 153071 153032 40 -5.16 5.00 Prom + 159910 159949 40 -3.36 5.01 Init + 164764 164811 48 0 0 92 41 73 0.724 2.07 5.02 Intr + 164817 164934 118 2 1 60 33 111 0.285 2.84 5.03 Term + 175334 175461 128 0 2 79 43 99 0.642 2.94 5.04 PlyA + 175755 175760 6 -0.45 6.00 Prom + 176788 176827 40 -2.86 6.01 Init + 189612 189703 92 2 2 70 61 219 0.268 17.39 6.02 Intr + 194728 194831 104 1 2 97 88 104 0.279 11.12 6.03 Intr + 197776 197873 98 2 2 92 64 74 0.240 5.13 6.04 Intr + 201509 201612 104 1 2 63 83 87 0.631 4.67 6.05 Term + 218450 218603 154 2 1 69 42 71 0.055 -2.01 6.06 PlyA + 218832 218837 6 1.05 7.06 PlyA - 219112 219107 6 1.05 7.05 Term - 221206 221201 6 1 0 111 36 0 0.061 -5.03 7.04 Intr - 233292 233146 147 0 0 41 74 187 0.856 13.03 7.03 Intr - 239504 239346 159 2 0 85 27 123 0.939 6.08 7.02 Intr - 239972 239834 139 1 1 77 95 121 0.964 12.17 7.01 Intr - 246006 245930 77 0 2 69 111 43 0.861 3.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_1|226_aa XTHTTGSVTMAETWESHTQHTAGTSPSLSSPQQPRLSASYLVVSSNGPQALLREVTTGCS SIQDQGQSTRAAITKYERLRIAYKQQKFISHSSDERSKVKVPVDLVSVLAVVSLLEMHCW PKDQPEVHYWAKSIVCKGNLHLDCLDAALFPLAATATQEGRDSVLRPVFLVSSPEKVLNI QVMVIGQRIKKGKRADELMKGDQKGIQIHTSKDGSPETYRDPETTD >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_1|681_bp ntgactcacaccactggatctgtcactatggcagaaacctgggaatctcacacccagcac acagcaggaacatctccctcgctgtcctctcctcagcagcccaggctgtcagcctcctac ctagtagtgtcctccaatgggccacaggccttactcagggaagtcacaacgggttgctcg tccatacaggatcaaggacagtccacacgggctgctatcacaaaatacgagaggctgcgt atagcttataaacaacagaaatttatttctcacagttctgatgagaggtccaaagtcaag gtgccagtagatttggtgtctgtattggcagtggtttctctgctagaaatgcactgctgg cctaaagaccagccagaagtacactactgggctaaatctattgtgtgcaagggtaacctc cacctggactgcttggacgcagctctcttcccactggcggccaccgcaacacaggaaggc agggactctgtcttacgacctgtcttcctggtgtctagcccagagaaggtactcaacatt caggtgatggtcattggccaaaggatcaagaaaggtaaaagagctgacgaactgatgaag ggtgatcagaaaggaattcagatccatacgagtaaggatgggagtccagagacatacagg gaccctgaaactactgactag >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_2|194_aa MRRPVRARLSPGAAPFRGAARLPLLPLPPPPPGAGGRGLGGRAGAPRSRRFSPGPPSPRC RPRVGAPPPHADPGPWRSAINILPDGLKVPFTPGGDFRATFTLQIREPRPGEGPEFLPKA GTGTNLTFIPRDFHGIIFCWEHTAHISLASQDSKKVKGQGLLLPPLLPKALDDPKFWQPH ALAASLQLSTDDEM >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_2|585_bp atgcggcggcccgtgcgcgcccggctcagccccggcgccgctccattccgcggggctgcg cggctgccactgctgccactgccgccaccgccgccaggcgcgggcgggcgggggctgggc ggccgcgctggggcgccgcgcagtcgccgcttttctcccgggccacccagtcctcggtgc cggccccgcgttggcgcgcccccgccccacgccgacccggggccctggcgcagtgccatc aacatcctgcctgatggacttaaagtcccattcacacctggcggggactttagggctacc tttactttgcagataagggaaccgaggcctggggagggtcctgagtttctaccaaaggct ggcactgggactaacctaaccttcatcccccgggacttccacgggatcatcttctgctgg gaacacaccgcccacataagcctggcatctcaggacagtaaaaaagtcaagggacagggc ctcttacttccacctctgctgccaaaagccttggatgatcccaagttctggcagccccat gccctggctgcttccctacaactgagcactgacgatgagatgtag >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_3|116_aa MTDTAEAVPKFEEMFASRFTENDKEYQEYLKRPPESPPIVEEWNSRAGCKTTDSSEAGTT DGGGQVTIDPISGMDDPGVTTTRNTDKNLTIPSNMDIMVTTSGLLTVTTDRNVGSF >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_3|351_bp atgactgacactgccgaagctgttccaaagtttgaagagatgtttgctagtagattcaca gaaaatgacaaggagtatcaggaatacctgaaacgccctcctgagtctcctccaattgtt gaggaatggaatagcagagctggttgcaagacaacagacagttcagaggcagggacaaca gatgggggtggccaagtgacaatcgatccaatcagtggcatggacgatcctggggtaaca actacccgcaacacagacaagaaccttactatccccagcaatatggacattatggttaca accagcggcctccttacggttactactgatagaaatgttggcagcttttag >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_4|620_aa MASLGPAAAGEQASGAEAEPGPAGPPPPPSPSSLGPLLPLQREPLYNWQATKASLKERFA FLFNSELLSDVRFVLGKGRGAAAAGGPQRIPAHRFVLAAGSAVFDAMFNGGMATTSAEIE LPDVEPAAFLALLRFLYSDEVQIGPETVMTTLYTAKKYAVPALEAHCVEFLTKHLRADNA FMLLTQARLFDEPQLASLCLDTIDKSTMDAISAEGFTDIDIDTLCAVLERDTLSIRESRL FGAVVRWAEAECQRQQLPVTFGNKQKVLGKALSLIRFPLMTIEEFAAGPAQSGILSDREV VNLFLHFTVNPKPRVEYIDRPRCCLRGKECCINRFQQVESRWGYSGTSDRIRFTVNRRIS IVGFGLYGSIHGPTDYQVNIQIIEYEKKQTLGQNDTGFSCDGTANTFRVMFKEPIEILPN VCYTACATLKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVN VAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKVSEIHVASVSENGSRLGNWQGANFL SRCQLINLSEEHNCQIDCIVGSASENILSLWEISLRESSFEGCLESLESEPGKLGARGWT YGISQPQKVFWDNTGDNDPS >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_4|1863_bp atggcctcactcgggcctgccgcagctggggagcaggcgtcgggggctgaggcggagccg ggccccgcggggccgccgccgccgccctcaccgtcctctctggggcccctgctccccctg cagcgggaacctctctacaactggcaggcgaccaaggcgtcgctgaaggagcgcttcgcc ttcctcttcaactcggagctgctgagcgatgtgcgcttcgtactgggcaagggtcgcggc gccgccgccgctgggggcccgcagcgcatccccgcccaccgcttcgtgctggcggccggc agcgccgtctttgacgccatgttcaacggcggcatggccaccacgtcggccgagatcgag ctgccggacgtggagcccgcagccttcctggcgctgctgagatttctatattcagatgaa gttcaaattggtccagaaacagttatgaccactctttatactgccaagaaatacgcagtc ccagccttggaagcacactgtgtagaatttctcaccaaacatcttagggcagataatgcc tttatgttacttactcaggctcgattatttgatgaacctcagcttgctagtctttgtcta gatacaatagacaaaagcacaatggatgcaataagtgcagaagggtttactgatattgat atagatacactctgtgcagttttagagagagacacactcagtattcgagaaagtcgactt tttggagctgttgtacgctgggcagaagcagaatgtcagagacaacaattacctgtgact tttgggaataaacaaaaagttctaggaaaagcactttccttaatccggttcccactgatg acaattgaggaatttgcagcaggtcctgctcaatctggaattttgtcagatcgtgaagtg gtaaacctctttcttcattttactgtcaaccctaaaccccgagttgaatacattgaccga ccaagatgctgtctcaggggaaaggaatgctgcatcaatagattccagcaagtagaaagc cgctggggttacagtgggacgagtgatcgaatcagattcacagttaatagaaggatctct atagttggatttggcttgtatggatctattcatggccctacagattatcaagtgaatata cagatcattgaatatgagaaaaagcaaaccctgggacagaatgataccggctttagttgt gatgggacagctaacacattcagggtcatgttcaaggaacccatagagatcctgcccaat gtgtgctacacagcatgtgcaacactcaaaggtaaaagccagagcaaggaaccagagaga ccacttcctcccttaggtcctgtggcagttgatcctaaaggatgcgtcaccatagccatc catgcaaaacctggctccaaacaaaatgctgtaacagatttgacagcagaggctgtaaat gtagctattgcagcacctccatcagagggagaggctaatgctgagctctgtcggtatctt tccaaggtcctagaactcaggaagagtgatgtggttttggataaggtttcagaaatccac gtagccagtgtttctgaaaatggatctaggctgggcaattggcaaggagctaatttctta tcaagatgccagcttattaacttgagtgaagaacacaactgccaaatagactgtatagtg ggatctgctagtgagaacattctgtccctttgggaaatctctttaagagaatcatcattt gaaggctgcttggagagcctggaatcagagccagggaaattaggggccaggggatggacc tatggtatcagccagccacagaaggtcttttgggacaacactggagataatgatccttca tag >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_5|97_aa MDLLGEGEVTLLLDLLGPDGQCGSDSVELMGSVKAKVRLSQLESGPLALTERNKAGLLYS CHPINARGPLSLTPGLQLLFPEPHVFLCIVYCLVLVE >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_5|294_bp atggacctgctgggagagggtgaggtgaccctattgctcgacctcctggggccagatgga caatgtggcagtgattcagtggagctcatggggtcagtgaaagccaaggtgaggctgagc cagctggagagtggccctttggctctgaccgagaggaacaaagctggcttgctgtacagc tgtcaccccatcaatgctaggggacccctcagcctcactcctgggctgcagctcctgttt cctgaaccccacgtcttcctctgcatcgtgtactgcctggttttggtggagtaa >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_6|183_aa MSASAATGVFVLSLSAIPVTYVFNHLAAQHDSWTIVGVAALILFLVALLARVLVKRKPPR DPLFYVYAVFGFTSVVNLIIGLEQDGIIDGFMTHYLREGEPYLNTAYGHMICYWDGSAHY LMYLVMVAAIAWETAYVYRVPEEAKILFLALNIAYGVLPQLLAYRCIYKPEFFIKTKAEE KVE >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_6|552_bp atgagtgcctctgcggccaccggggtcttcgtgctgtccctctcggccatcccggtcacc tatgtcttcaaccacctggcggcccagcatgattcctggactattgtaggggttgctgcc ctcatcctgttcctggtagcactgctggctcgtgtcctcgtcaaaagaaaaccaccccgg gacccactgttctatgtgtatgcagtttttggatttaccagcgtggtgaacctcatcata ggactggagcaagatggaatcattgacgggttcatgacacactacttgagagagggtgaa ccgtatctgaacaccgcatatgggcacatgatctgctactgggatggctctgctcattat ctgatgtacctggtgatggtggcagccatagcatgggaaactgcttatgtctacagagtc cctgaagaagcaaaaatcctttttttagcattaaacatagcatatggagttcttcctcag ctcttggcctatcgttgtatctacaaaccagagttcttcataaaaacaaaggcagaagaa aaagtggaataa >gi568815583r:82918070_83167151|GENSCAN_predicted_peptide_7|175_aa IDELPEGAVKPPANKYPIFFFGTHETAFLGPKDLFPYKEYKDKFGKSNKRKGFNEGLWEI ENNPGVKFTGYQAIQQQSSSETEGEGGNTADASSEEEGDRVEEDGKGKRKNEKAGSKRKK SYTSKKSSKQSRKSPGDEDDKDCKEEENKSSSEGGDAGNDTRNTTSDLQKTSEGT >gi568815583r:82918070_83167151|GENSCAN_predicted_CDS_7|528_bp attgatgaactcccagagggcgctgtgaagcctccagcaaacaagtatcctatcttcttt tttggcacccatgaaactgcatttctaggtcccaaagacctttttccatataaggagtac aaagacaagtttggaaagtcaaacaaacggaaaggatttaacgaaggattgtgggaaata gaaaataacccaggagtaaagtttactggctaccaggcaattcagcaacagagctcttca gaaactgagggagaaggtggaaatactgcagatgcaagcagtgaggaagaaggtgataga gtagaagaagatggaaaaggcaaaagaaagaatgaaaaagcaggctcaaaacggaaaaag tcatatacttcaaagaaatcctctaaacagtcccggaaatctccaggagatgaagatgac aaagactgcaaagaagaggaaaacaaaagcagctctgagggtggagatgcgggcaacgac acaagaaacacaacttcagacttgcagaaaaccagtgaagggacctaa