GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:11:38 Sequence gi568815597r:206668639_206872435 : 203797 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16592 16870 279 1 0 101 56 586 0.688 51.71 1.02 Intr + 25275 25422 148 2 1 82 70 50 0.063 2.41 1.03 Intr + 37661 37789 129 0 0 96 43 42 0.001 1.17 1.04 Intr + 37928 37976 49 0 1 96 74 23 0.000 -0.56 1.05 Intr + 58415 58511 97 2 1 113 65 49 0.220 5.11 1.06 Intr + 60072 60211 140 2 2 52 103 247 0.410 21.86 1.07 Intr + 60397 60461 65 1 2 44 60 56 0.943 -3.24 1.08 Intr + 60758 60837 80 0 2 119 45 110 0.695 9.07 1.09 Intr + 61334 61460 127 1 1 91 106 135 0.999 15.85 1.10 Intr + 62050 62125 76 2 1 79 76 92 0.988 5.67 1.11 Intr + 62500 62624 125 1 2 85 116 170 0.998 19.73 1.12 Intr + 63002 63087 86 0 2 108 103 114 0.999 14.44 1.13 Intr + 63201 63281 81 2 0 100 100 50 0.991 7.23 1.14 Term + 63937 64080 144 0 0 101 52 227 0.998 18.31 1.15 PlyA + 69259 69264 6 1.05 2.05 PlyA - 71721 71716 6 1.05 2.04 Term - 72727 72657 71 2 2 103 46 76 0.806 3.00 2.03 Intr - 86173 86017 157 1 1 43 100 56 0.165 1.88 2.02 Intr - 87831 87766 66 0 0 35 115 47 0.197 1.20 2.01 Init - 90412 90356 57 1 0 50 101 51 0.237 3.91 2.00 Prom - 97371 97332 40 -3.86 3.06 PlyA - 98987 98982 6 1.05 3.05 Term - 100090 99998 93 1 0 78 53 172 0.999 10.53 3.04 Intr - 101256 101191 66 0 0 113 105 72 0.991 10.50 3.03 Intr - 102421 102269 153 1 0 98 75 169 0.999 16.87 3.02 Intr - 102777 102718 60 0 0 105 100 72 0.998 9.03 3.01 Init - 103797 103633 165 0 0 69 59 165 0.982 9.33 3.00 Prom - 103891 103852 40 -2.96 4.00 Prom + 104266 104305 40 -10.35 4.01 Init + 105340 105450 111 0 0 98 38 58 0.860 2.01 4.02 Intr + 108873 109040 168 2 0 55 100 82 0.898 6.24 4.03 Intr + 116337 116582 246 2 0 56 74 149 0.174 7.96 4.04 Intr + 128471 128609 139 1 1 66 34 50 0.032 -2.56 4.05 Intr + 130223 130368 146 0 2 125 98 0 0.091 4.70 4.06 Intr + 140790 140870 81 2 0 130 58 6 0.032 1.63 4.07 Intr + 168320 168385 66 1 0 103 67 54 0.912 3.90 4.08 Intr + 171212 171364 153 1 0 66 56 205 0.848 15.37 4.09 Intr + 172366 172440 75 0 0 37 110 34 0.326 0.21 4.10 Intr + 175189 175298 110 0 2 122 75 2 0.484 1.38 4.11 Intr + 178630 178695 66 1 0 58 96 43 0.031 0.12 4.12 Intr + 185925 186031 107 0 2 68 80 26 0.029 -0.34 4.13 Intr + 196909 197048 140 2 2 60 83 53 0.505 2.18 4.14 Intr + 197185 197373 189 0 0 73 76 58 0.599 2.88 4.15 Intr + 197661 197726 66 2 0 77 95 21 0.633 0.80 4.16 Intr + 197846 197998 153 1 0 107 56 94 0.848 8.37 4.17 Term + 199849 199926 78 0 0 95 49 35 0.461 -1.94 4.18 PlyA + 200563 200568 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 38495 38317 179 2 2 41 94 135 0.932 8.13 S.002 Init + 153576 153718 143 2 2 86 68 107 0.805 8.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:206668639_206872435|GENSCAN_predicted_peptide_1|541_aa MLSNSQGQSPPVPFPAPAPPPQPPTPALPHPPAQPPPPPPQQFPQFHVKSGLQIKKNAII DDYKVTSQVLGLGINGKVLQIFNKRTQEKFALKLSHMQNVTECMGFFYLLTAVLMCVHGG VASSSTPQGSWRLVEQDEERGSATETQSHSDGGGAKPVPLIIASLLVEKGQMLLCSYPDH PAGRLGSKPLPRTSELTEAGMGMSLGHPSPLNVQSSWSVDCVGLILTSHEDAKSMLQDCP KARREVELHWRASQCPHIVRIVDVYENLYAGRKCLLIVMECLDGGELFSRIQDRGDQAFT EREASEIMKSIGEAIQYLHSINIAHRDVKPENLLYTSKRPNAILKLTDFGFAKETTSHNS LTTPCYTPYYVAPEVLGPEKYDKSCDMWSLGVIMYILLCGYPPFYSNHGLAISPGMKTRI RMGQYEFPNPEWSEVSEEVKMLIRNLLKTEPTQRMTITEFMNHPWIMQSTKVPQTPLHTS RVLKEDKERWEDVKEEMTSALATMRVDYEQIKIKKIEDASNPLLLKRRKKARALEAAALA H >gi568815597r:206668639_206872435|GENSCAN_predicted_CDS_1|1626_bp atgctgtccaactcccagggccagagcccgccggtgccgttccccgccccggccccgccg ccgcagccccccacccctgccctgccgcaccccccggcgcagccgccgccgccgcccccg cagcagttcccgcagttccacgtcaagtccggcctgcagatcaagaagaacgccatcatc gatgactacaaggtcaccagccaggtcctggggctgggcatcaacggcaaagttttgcag atcttcaacaagaggacccaggagaaattcgccctcaaactcagccacatgcagaatgtc accgagtgcatgggtttcttctacctcttgactgcagtattgatgtgtgtccatggaggt gtggcctcttcctctactccccagggatcctggagacttgtggagcaagatgaggaaagg ggctcagccacagagacccagagtcactcagatgggggaggagcaaagccagtccctctg atcattgcctctttgttagtggagaaagggcagatgctgctctgcagttatcccgaccac cctgcaggacgtctggggagcaagcccttgccaagaacctctgagctcaccgaggcgggt atgggcatgtctcttggacaccccagccctctgaatgtccaaagcagttggagtgtggac tgtgtagggctcatcttgacttcccatgaagatgctaaatctatgcttcaggactgcccc aaggcccgcagggaggtggagctgcactggcgggcctcccagtgcccgcacatcgtacgg atcgtggatgtgtacgagaatctgtacgcagggaggaagtgcctgctgattgtcatggaa tgtttggacggtggagaactctttagccgaatccaggatcgaggagaccaggcattcaca gaaagagaagcatccgaaatcatgaagagcatcggtgaggccatccagtatctgcattca atcaacattgcccatcgggatgtcaagcctgagaatctcttatacacctccaaaaggccc aacgccatcctgaaactcactgactttggctttgccaaggaaaccaccagccacaactct ttgaccactccttgttatacaccgtactatgtggctccagaagtgctgggtccagagaag tatgacaagtcctgtgacatgtggtccctgggtgtcatcatgtacatcctgctgtgtggg tatccccccttctactccaaccacggccttgccatctctccgggcatgaagactcgcatc cgaatgggccagtatgaatttcccaacccagaatggtcagaagtatcagaggaagtgaag atgctcattcggaatctgctgaaaacagagcccacccagagaatgaccatcaccgagttt atgaaccacccttggatcatgcaatcaacaaaggtccctcaaaccccactgcacaccagc cgggtcctgaaggaggacaaggagcggtgggaggatgtcaaggaggagatgaccagtgcc ttggccacaatgcgcgttgactacgagcagatcaagataaaaaagattgaagatgcatcc aaccctctgctgctgaagaggcggaagaaagctcgggccctggaggctgcggctctggcc cactga >gi568815597r:206668639_206872435|GENSCAN_predicted_peptide_2|116_aa MDIKIESGDHYVGVIDTKMPVNVKEKFLKEIKSDTPLNTQMAKCYQTASHATEKSVVKES INPANFTAVYFKKLLQPPQPSATTTLISSHQHQGICMPVREAGSPVKIKHQEAKQG >gi568815597r:206668639_206872435|GENSCAN_predicted_CDS_2|351_bp atggacatcaagattgagtctggagatcactatgtgggggtcattgatacaaagatgcca gtaaatgtgaaggaaaagtttctgaaggaaattaaaagtgatactccactgaatacacag atggcaaaatgctatcaaacagcgtcacatgctacagagaaatctgttgtgaaagagtca atcaatccagcaaacttcactgctgtctattttaagaaattgctacaaccacctcagcct tcagcaaccaccaccttaatcagcagccatcaacatcaaggcatttgcatgccagtcaga gaagcaggtagcccggtcaagatcaaacaccaggaggccaaacaaggctga >gi568815597r:206668639_206872435|GENSCAN_predicted_peptide_3|178_aa MHSSALLCCLVLLTGVRASPGQGTQSENSCTHFPGNLPNMLRDLRDAFSRVKTFFQMKDQ LDNLLLKESLLEDFKGYLGCQALSEMIQFYLEEVMPQAENQDPDIKAHVNSLGENLKTLR LRLRRCHRFLPCENKSKAVEQVKNAFNKLQEKGIYKAMSEFDIFINYIEAYMTMKIRN >gi568815597r:206668639_206872435|GENSCAN_predicted_CDS_3|537_bp atgcacagctcagcactgctctgttgcctggtcctcctgactggggtgagggccagccca ggccagggcacccagtctgagaacagctgcacccacttcccaggcaacctgcctaacatg cttcgagatctccgagatgccttcagcagagtgaagactttctttcaaatgaaggatcag ctggacaacttgttgttaaaggagtccttgctggaggactttaagggttacctgggttgc caagccttgtctgagatgatccagttttacctggaggaggtgatgccccaagctgagaac caagacccagacatcaaggcgcatgtgaactccctgggggagaacctgaagaccctcagg ctgaggctacggcgctgtcatcgatttcttccctgtgaaaacaagagcaaggccgtggag caggtgaagaatgcctttaataagctccaagagaaaggcatctacaaagccatgagtgag tttgacatcttcatcaactacatagaagcctacatgacaatgaagatacgaaactga >gi568815597r:206668639_206872435|GENSCAN_predicted_peptide_4|697_aa MDPDGGHFQCSASSSPFCISNGTTPVAAGSELAALEKLKPKSGEQASSLVRVALELQREG SGIRACWTLGLDSLLGNTVSRQRAGRLTMQAEKSTVRPVWVLVQSQELPAATGFEVLKSP NASIVSHGKRRAGREEWPVLETGYLTPVKTTVDEYSGHAGSGSADSWILEGGSMQCRDFA AGQDVNILGFCMVKLSHLSGVIVTQSLPNLFISPGPHGILPKRCLHTLTGVQECALRERF RTDLRVPYHSHMCTHISMCVCQCFGALFHGVFTLAQLQGKTLLANWVEYFSAQPTVLQAK DTFPNVTILSTLETLQIIKPLDVCCVTKNLLAFYVDRVFKDHQEPNPKILRKISSIANSF LYMQKTLRQCQEQRQCHCRQEATNATRVIHDNYDQAHCDPAIVVCSVFRKHTRHVLTIQC MHFVYMHTHKLRTWLLDGISEPALGNSGAHSHEGSASPDGSLPCPGNGSCSFCMANQAPG SLPGSPCYSGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIFSSREAWQFFLLLWS PDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNGFSEIRGSVQ AKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKISSLAN SFLTIKKDLRLCLEPQAAVVKALGELDILLQWMEETE >gi568815597r:206668639_206872435|GENSCAN_predicted_CDS_4|2094_bp atggacccagatggagggcatttccagtgttctgctagttcttcacctttctgcatcagc aatgggacaactccagtggcagcaggttctgagcttgctgcactagagaagctcaaaccc aagagtggggagcaggctagcagccttgtgagggtggctctggagctgcagagggaaggt tcagggatccgggcttgctggactctgggtttagacagcttgttgggcaacacggtcagc cggcagagagcaggcaggttgacgatgcaggcagagaagagcacagtgaggccagtctgg gtgctggttcagtcccaggagttgccagctgctacgggctttgaagtattaaagtccccc aatgcctccattgtatcccatgggaaaaggcgagcaggcagggaggaatggccagtgcta gaaacaggatatctgacccctgtgaaaaccactgtggatgaatactcgggtcatgcaggc tctggatcagcagattcttggatccttgaaggagggtccatgcagtgtagagactttgct gcgggacaggatgtgaacatccttggcttttgcatggtcaagttaagtcacttgtccggg gtcatagtgacacaatcattgccaaacctgttcatttcaccagggccccatgggattttg ccaaagcggtgcttgcacacactgacaggagtccaagaatgtgcactgagggagcgtttc cgcacagatctgcgtgttccttaccactcacacatgtgcacacacatatccatgtgtgtg tgccagtgctttggggctctgttccacggggtcttcactttagctcagcttcaagggaag actctgttggccaactgggtagaatatttctctgcccagcccacagttctgcaagctaag gacaccttcccaaatgtcactatcctgtccacattggagactctgcagatcattaagccc ttagatgtgtgctgcgtgaccaagaacctcctggcgttctacgtggacagggtgttcaag gatcatcaggagccaaaccccaaaatcttgagaaaaatcagcagcattgccaactctttc ctctacatgcagaaaactctgcggcaatgtcaggaacagaggcagtgtcactgcaggcag gaagccaccaatgccaccagagtcatccatgacaactatgatcaggctcactgtgaccca gccatagttgtttgctctgtgtttcgcaaacacaccaggcatgttctcaccatacagtgt atgcactttgtgtacatgcacactcataaactgaggacttggctcttggatggcatttct gaacctgccctggggaacagtggagcccacagccatgaagggtctgcatctccagatggc tctctgccatgtcctggcaatggctcttgtagtttctgcatggccaaccaggctccaggt agccttcctggttctccctgctactcaggccttacctgctgggcactaacggcggagcca ggatggggacagaataaaggagccacgacctgtgccaccaactcgcactcagactctgaa ctcagacctgaaatcttctcttcacgggaggcttggcagtttttcttactcctgtggtct ccagatttcaggcctaagatgaaagcctctagtcttgccttcagccttctctctgctgcg ttttatctcctatggactccttccactggactgaagacactcaatttgggaagctgtgtg atcgccacaaaccttcaggaaatacgaaatggattttctgagatacggggcagtgtgcaa gccaaagatggaaacattgacatcagaatcttaaggaggactgagtctttgcaagacaca aagcctgcgaatcgatgctgcctcctgcgccatttgctaagactctatctggacagggta tttaaaaactaccagacccctgaccattatactctccggaagatcagcagcctcgccaat tcctttcttaccatcaagaaggacctccggctctgtctggaacctcaggcagcagttgtg aaggctttgggggaactagacattcttctgcaatggatggaggagacagaatag