GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:57:08 Sequence gi568815579r:9754189_9960797 : 206609 bp : 49.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 1188 1183 6 1.05 1.10 Term - 4576 3287 1290 1 0 74 45 752 0.999 61.10 1.09 Intr - 5754 5672 83 1 2 76 91 55 0.994 3.96 1.08 Intr - 7980 7894 87 1 0 93 96 8 0.704 2.04 1.07 Intr - 9220 9094 127 1 1 -1 66 136 0.869 2.75 1.06 Intr - 14179 14101 79 0 1 84 105 15 0.789 2.35 1.05 Intr - 14800 14547 254 1 2 36 -14 228 0.403 3.63 1.04 Intr - 31891 31738 154 0 1 68 80 114 0.667 8.67 1.03 Intr - 39217 38688 530 1 2 17 85 172 0.018 1.53 1.02 Intr - 42745 42635 111 1 0 97 64 78 0.435 6.88 1.01 Init - 49141 49094 48 1 0 106 57 -15 0.148 -1.84 1.00 Prom - 50262 50223 40 -3.16 2.04 PlyA - 50689 50684 6 1.05 2.03 Term - 57529 56708 822 1 0 113 36 866 0.976 76.99 2.02 Intr - 64429 64357 73 0 1 114 88 112 0.999 13.21 2.01 Init - 64625 64540 86 2 2 103 80 160 0.999 15.05 2.00 Prom - 65672 65633 40 -8.86 3.00 Prom + 65801 65840 40 -1.66 3.01 Sngl + 66031 66270 240 0 0 59 55 465 0.998 35.38 3.02 PlyA + 66309 66314 6 1.05 4.00 Prom + 66365 66404 40 -7.16 4.01 Init + 72642 72678 37 2 1 49 80 5 0.472 -4.02 4.02 Intr + 73108 73244 137 2 2 80 83 90 0.508 7.99 4.03 Term + 73349 73483 135 1 0 85 43 100 0.378 3.22 4.04 PlyA + 75642 75647 6 1.05 5.00 Prom + 76371 76410 40 -4.16 5.01 Init + 81157 81214 58 0 1 101 55 96 0.984 9.07 5.02 Intr + 84248 84460 213 0 0 105 101 395 0.816 41.19 5.03 Intr + 91622 91747 126 0 0 31 80 122 0.621 6.35 5.04 Intr + 93842 93952 111 0 0 111 69 151 0.987 15.85 5.05 Term + 94902 95011 110 1 2 124 55 230 0.962 21.97 5.06 PlyA + 96137 96142 6 1.05 6.09 PlyA - 97716 97711 6 1.05 6.08 Term - 100675 99998 678 1 0 58 52 1242 0.999 111.29 6.07 Intr - 102725 102619 107 0 2 114 100 155 0.999 19.23 6.06 Intr - 103294 103075 220 1 1 41 59 386 0.997 28.87 6.05 Intr - 103673 103527 147 2 0 75 94 231 0.975 22.83 6.04 Intr - 106606 106457 150 1 0 86 100 155 0.505 16.86 6.03 Intr - 109219 109191 29 2 2 119 80 22 0.247 2.33 6.02 Intr - 114986 114887 100 2 1 87 7 108 0.010 2.28 6.01 Init - 159429 159295 135 0 0 88 78 318 0.979 30.94 6.00 Prom - 170739 170700 40 -3.96 7.03 PlyA - 172004 171999 6 1.05 7.02 Term - 176020 175916 105 1 0 84 42 19 0.101 -4.69 7.01 Init - 182178 182116 63 0 0 68 91 211 0.950 18.55 7.00 Prom - 204015 203976 40 -1.96 8.03 PlyA - 205391 205386 6 1.05 8.02 Term - 206369 206223 147 2 0 87 54 128 0.937 7.20 8.01 Intr - 206556 206463 94 2 1 -14 94 120 0.689 2.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 32357 32341 17 2 2 100 93 17 0.910 1.96 S.002 Term - 114986 114879 108 2 0 87 49 119 0.951 6.41 S.003 Term - 129380 129310 71 0 2 62 42 109 0.897 1.80 S.004 Init - 133372 133327 46 1 1 88 116 34 0.817 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_1|920_aa MAGDTALRRPGEHVPKTLEFNVFSGNQHGRSEQSLLKHKDISYLIPHNHMIEGRKTLTYK NANKVVKRDVSVIYCRITNHQETAHPRTSRMNAEESEQRHFEVPPNRLPPSWSRPHPKPP PGSAPTVPSEAGGTGSNRCRLRGNWDFSPGSWRTRPPTLPPPHASETGSRALSRFRAPRG ESEVRVIRAAPPSQEPQLNQRFPLHQPLRPGGASGAASPRGGRKTRAGTRPPRPGPEPSV PSETLERNQKCAWPQRRLLPFSRRWPNGFPSSSLCDQVRLQSPPPTRPLALGFLVRRRSQ YRQIDLDPGPPGMRENGTFLYACRRGGPSSSLSFTSSSFQSSWGSAPTLNAVRNRKRKCQ PARSYTRKSISAVVEKGPSVLLSPTGRLQCVPHLVTFEDVAVDFTQEEWTLLDQAQRDLY RDVMLENYKNLIILAGSELFKRSLMSGLEQMEELRTGVTGVLQELDLQLKTKGSPLLQDI SAERSPNGVQLERSNTAEKLYDSNHSGKVFNEHPFLMTHMITHIGEKTSEDNQSGKALRK NFPHSFYKKSHAEGKMPKCVKHEKAFNQFPNLTRQNKTHTQEKLCECKDCWRTFLNQSSL KLHIRSHNGDKHYVCKECGKAFSNSSHLIGHGRIHSGEKPYVCKECGKAFTQSTGLKLHI RTHSGEKPYKCKECGKAFTHSSYLTDHTRIHSGKKPYVCMECGKAFTRSTGLILHMRIHT GEKPYECKECGKAFIHSSYLTKHVRIHSGEKLYLCKACGKAFTRSSGLVLHMRTHTGEKP YECKECGKAFNNSSMLSQHVRIHTGEKPYECKECGKAFTQSSGLSTHLRTHTGEKACECK ECGKAFARSTNLNMHMRTHTGEKPYACKECGKAFRYSTYLNVHTRTHTGAKPYECKKCGK NFTQSSALAKHLRTKACEKT >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_1|2763_bp atggccggtgatacagccctcaggaggccaggagaacatgtgcccaagactctagagttt aacgtcttcagtgggaatcagcatggacgttcagaacagtccctccttaaacacaaagac atttcctacctcatacctcataatcacatgatagaagggaggaagaccctgacctataaa aacgcaaacaaagtagtgaagagagatgtatcagttatctactgccgtataacaaaccac caggaaacagcgcatcctcgtacaagccgcatgaatgcagaggaaagcgagcagaggcat ttcgaggtcccgcccaacagactccctccatcgtggtcccgcccccaccccaagccgccg ccgggttctgcccccacggtgcccagtgaagccggagggacggggtccaaccgttgtcgg ctgcgcggaaactgggacttctccccagggtcgtggaggacgcggccgccgaccctccca ccgcctcacgcctcggagaccgggtccagagccctcagccgcttccgagccccgagaggg gaatcagaagtgcgcgtgatcagagcggcacccccttcccaggaaccgcagctgaaccaa cggtttcccctgcaccagcctttgcgaccaggtggggcttcaggtgcggcttcgccacgt ggaggcaggaagacccgggcagggactcggcctccgaggccgggtccagagccctcagtt ccttctgaaaccctggagcggaatcagaagtgcgcgtggccacagcggcgcctcctgccc tttagccgccgctggcccaacggtttccccagcagtagcctctgcgaccaggtgcggctt cagtccccgcctccaacccggcccttggcgctgggtttcctggtccggcgtcgttcccaa tatcgacagattgatctggatcctggtccacctgggatgcgagaaaacgggacttttctg tacgcgtgtcgaagaggtggccccagctcttctctcagcttcacatcatcctcattccaa agctcttggggctcggcccctacacttaatgccgtgcgcaaccggaagcggaagtgccag ccggcgcgcagctatacaaggaaaagcatctcagctgttgtggaaaagggccccagcgtc ctcttgtcaccgacagggcgattgcagtgcgttccgcacttagtgacctttgaggatgtg gctgtggactttacccaggaggagtggactttgttggatcaagcccagagagatctctac agagatgtgatgttggagaactacaagaatctcattatactagcagggtctgaattattc aaacgtagtctcatgtctggattggaacaaatggaagagctgaggacaggagtgacagga gttctgcaggaattggatttgcaactcaaaaccaaaggctccccactgctgcaagatatt tctgcagaaagatcaccaaatggagtacaattggagagaagcaatactgcagagaaactg tatgactctaaccattctggaaaagtcttcaatgaacacccatttcttatgactcacatg ataactcacattggagagaaaacttctgaggataatcagagtggaaaagccttaagaaag aactttcctcatagtttttacaagaaaagtcatgctgaggggaaaatgcctaagtgtgtt aaacatgaaaaagccttcaaccagtttccaaatcttactaggcagaataaaactcacaca caagagaaattgtgtgaatgcaaagactgttggagaacttttcttaatcagtcatccctt aagttacatataagatctcacaatggagacaaacactatgtatgtaaggaatgtgggaaa gccttcagtaattcctcacaccttataggacatggaagaattcacagtggagagaagccc tatgtctgtaaagaatgtggtaaagctttcactcaatccacaggacttaaattacacatc agaactcacagtggagaaaaaccatataaatgtaaagagtgtgggaaagccttcacccat tcttcataccttactgatcatacaagaatccacagtggaaagaagccctatgtatgtatg gaatgtggaaaagccttcactagatccacaggacttattttacacatgcgaattcacact ggagaaaagccatatgaatgtaaggagtgtggaaaagcttttattcattcctcatacctt acaaaacatgtaaggattcacagtggagagaagctgtatttatgtaaggcatgtgggaaa gcttttactcgttcctcaggacttgttttacacatgagaacacatactggagaaaagccc tatgaatgtaaagaatgtgggaaagcctttaataattcctcaatgcttagtcaacatgta aggattcacactggagagaagccatatgaatgcaaagaatgtgggaaagctttcactcaa tcctcgggccttagtacccatttaagaactcacactggagaaaaggcctgtgaatgtaag gaatgcggtaaagcatttgctcgttccacaaatcttaatatgcacatgcgaacgcacaca ggagaaaagccttatgcatgtaaagaatgtgggaaagccttcaggtattccacatacctt aacgttcacacacgaactcacactggagcaaaaccatatgaatgtaagaaatgtgggaaa aacttcactcaatcttcagcacttgctaaacatctaagaactaaggcatgtgaaaaaacc tga >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_2|326_aa MATLVELPDSVLLEIFSYLPVRDRIRISRVCHRWKRLVDDRWLWRHVDLTLYTMRPKVMW HLLRRYMASRLHSLRMGGYLFSGSQAPQLSPALLRALGQKCPNLKRLCLHVADLSMVPIT SLPSTLRTLELHSCEISMAWLHKQQDPTVLPLLECIVLDRVPAFRDEHLQGLTRFRALRS LVLGGTYRVTETGLDAGLQELSYLQRLEVLGCTLSADSTLLAISRHLRDVRKIRLTVRGL SAPGLAVLEGMPALESLCLQGPLVTPEMPSPTEILSSCLTMPKLRVLELQGLGWEGQEAE KILCKGLPHCMVIVRACPKESMDWWM >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_2|981_bp atggcgactttggtcgaactgccggactcggtcctgctcgagatcttctcttacctcccg gtacgggaccggatccgcatctccagggtctgtcaccgctggaagaggctggtggacgac cggtggctgtggcgacatgtcgacctgacgctctacacgatgcgacctaaagtcatgtgg cacctccttcgaaggtacatggcatcccggctccattccctgcggatgggtggctacctg ttctctggctcccaggccccccagttgtcccctgctctgttgagagccctgggccagaag tgccccaacctgaagcgcctctgcctgcacgtggccgacctgagcatggtgcccatcacc agcctgcccagcaccttgaggaccctggagctgcacagctgcgagatctccatggcctgg ctccacaagcagcaggaccccaccgtgctgcccctgcttgaatgcatcgtgctggaccgc gtccccgccttccgtgacgagcacctgcagggcctgacgcgcttccgggccttgcgctcg ctggtgctgggtggtacctaccgtgtgaccgagacagggctggatgctggcctgcaggag ctcagctatctgcagaggcttgaggtgctgggctgcaccctgtctgccgacagcaccctg ctggccatcagccgccacctccgagatgtgcgcaagatccggctgaccgtgaggggcctc tctgcccctggcctggctgtgctggagggaatgccggccctggagagtctgtgcctgcag ggtcccctcgtcaccccagaaatgccctcccccactgaaatcctctcctcctgcctcact atgcccaagctcagagtccttgagctgcaggggctggggtgggagggtcaggaggcggag aagatcctgtgtaaggggctgccccactgtatggtcatcgtcagggcttgccccaaagag tctatggactggtggatgtaa >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_3|79_aa MSIRTKLQNKEHAIEALRRAKFKFPGRQKIYISKKWGFTKFNADEFEDMVAEKRLIPDGC GVKYIPNRGPLDKWLALHS >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_3|240_bp atgtccatccgcaccaagctgcagaacaaggagcatgcgattgaggccttgcgcagggcc aagttcaagtttcctggccgccagaagatctacatctcaaagaagtggggcttcaccaag ttcaatgctgatgaatttgaagacatggtggctgagaagcggctcatcccagatggctgt ggggtcaagtacatccccaatcgtggccctctggacaagtggctggccctgcactcatga >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_4|102_aa MNGGGSQAHRGQRHQEPGTVRHHAVTAAVGQRRVYSRIKLRLLTGVRIPGERDGQREELP LQEASGSGSGAYPYSRDPLASPPCDRVSLQQTMTSNELGAGA >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_4|309_bp atgaatggaggcggctctcaagcacacaggggccaaagacaccaagagcctggaactgtc cggcaccatgcggtaacagcagctgttgggcagcgtagagtctactctaggatcaagctt cgcctactgactggagtccgtatcccgggggaacgcgatggccagagggaagagttgccg ctgcaagaggcttcgggctcaggatcaggagcttacccctactcccgggaccctctggcg tccccaccatgtgaccgagtctccctgcaacaaacaatgacttcaaacgagctgggcgcc ggcgcctga >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_5|205_aa MADEEKLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSGGKNGQGEPARVRCSHL LVKHSQSRRPSSWRQEKITRTKEEALELINGIPVAVSPALAAGGCGKVMLVQGLGLAPQT LSACRQSCFPEVGYIQKIKSGEEDFESLASQFSDCSSAKARGDLGAFSRGQMQKPFEDAS FALRTGEMSGPVFTDSGIHIILRTE >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_5|618_bp atggcggacgaggagaagctgccgcccggctgggagaagcgcatgagccgcagctcaggc cgagtgtactacttcaaccacatcactaacgccagccagtgggagcggcccagcggcaac agcagcagtggtggcaaaaacgggcagggggagcctgccagggtccgctgctcgcacctg ctggtgaagcacagccagtcacggcggccctcgtcctggcggcaggagaagatcacccgg accaaggaggaggccctggagctgatcaacgggatccctgtggctgtgtccccagccctg gccgctgggggctgtggcaaggtgatgcttgttcagggcctgggcctggctcctcagacg ctcagtgcctgcaggcagtcctgcttccccgaagtaggctacatccagaagatcaagtcg ggagaggaggactttgagtctctggcctcacagttcagcgactgcagctcagccaaggcc aggggagacctgggtgccttcagcagaggtcagatgcagaagccatttgaagacgcctcg tttgcgctgcggacgggggagatgagcgggcccgtgttcacggattccggcatccacatc atcctccgcactgagtga >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_6|521_aa MSVPLLKIGAVLSTMAMVTNWMSQTLPSLVGLNGTVSRAGASEKIDTENRINDQLSLSEL AAKELGAKLMARPYADCSEQPEVTVNTQTLFQNPEEGWQLYTSAQAPDGKCICTAVIPAQ STCSRDGRSRELRQLMEKVQNVSQSMEVLELRTYRDLQYVRGMETLMRSLDARLRAADGS LSAKSFQELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNLSGSLAAIQEEMGAYGYE DLQQRVMALEARLHACAQKLGCGKLTGVSNPITVRAMGSRFGSWMTDTMAPSADSRVWYM DGYYKGRRVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFYNKYQSNVVVKYHF RSRSVLVQRSLPGAGYNNTFPYSWGGFSDMDFMVDESGLWAVYTTNQNAGNIVVSRLDPH TLEVMRSWDTGYPKRSAGEAFMICGVLYVTNSHLAGAKVYFAYFTNTSSYEYTDVPFHNQ YSHISMLDYNPRERALYTWNNGHQVLYNVTLFHVISTSGDP >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_6|1566_bp atgtcggtgccgctgctcaagatcggggccgtgctgagcaccatggccatggtcaccaac tggatgtcgcagacgctgccctcgctcgtggggctcaacggcaccgtgtcccgtgcgggc gcctctgagaaaatcgacactgagaaccgcataaatgaccagctttccctttctgagttg gctgctaaggagcttggagccaaacttatggcccggccatatgctgattgttcagagcag ccagaggtcactgtgaacacccaaactctcttccagaacccagaagagggctggcagctg tacacctcagcccaggcccctgacgggaaatgcatctgcacggccgtgatcccagcgcag agtacctgctctcgagatggcaggagtcgggagctgcggcaactgatggagaaggtccag aacgtctcccagtccatggaggtccttgagttgcggacgtatcgcgacctccagtatgta cgcggcatggagaccctcatgcggagcctggatgcgcggctccgggcagctgatgggtcc ctctcggccaagagcttccaggagctgaaggacaggatgacggaactgttgcccctgagc tcggtcctggagcagtacaaggcagacacgcggaccattgtacgcttgcgggaggaggtg aggaatctctccggcagtctggcggccattcaggaggagatgggtgcctacgggtatgag gacctgcagcaacgggtgatggccctggaggcccggctccacgcctgcgcccagaagctg ggctgtgggaagctgaccggggtcagtaaccccatcaccgttcgggccatggggtcccgc ttcggctcctggatgactgacacgatggcccccagtgcggatagccgggtctggtacatg gatggctattacaaaggccgccgggtcctggagttccgtaccctgggagacttcatcaaa ggccagaactttatccagcacctgctgccccagccgtgggcgggcacgggccacgtggtg tacaacggctccctgttctataacaagtaccagagcaacgtggtggtcaaataccacttc cgctcgcgctctgtgctggtgcagaggagcctcccgggcgccggttacaacaacaccttc ccctactcctggggcggcttctccgacatggacttcatggtggacgagagcgggctctgg gctgtgtacaccaccaaccagaacgcgggcaacatcgtggtcagccggctggacccgcac accctcgaggtcatgcggtcctgggacaccggctaccccaagcgcagcgctggcgaggcc ttcatgatctgcggtgtgctctacgtgaccaactcccacctggctggggccaaggtctac ttcgcctattttaccaacacgtccagttacgagtacacggacgtgcccttccacaaccag tattcccacatctcgatgctggattacaacccccgggagcgcgccctctatacctggaac aacggccaccaggtgctctacaatgtcaccctgtttcacgtcatcagcacctctggggac ccctga >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_7|55_aa MWPLTVPPPLLLLLCSGLAGQACVCGVMRVCVCLVTVCMFRDHEHRDMLVSSSHQ >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_7|168_bp atgtggccgctcacggtcccgccgccgctgctgctgctgctgtgctcaggcctggccgga caggcatgtgtctgtggtgtcatgcgtgtatgtgtatgtttggtgactgtgtgcatgttc agggaccatgagcacagggacatgcttgtatcatcatcacaccagtga >gi568815579r:9754189_9960797|GENSCAN_predicted_peptide_8|80_aa XHSARFLGTNGEELSFNQTTAATVSVPQDGCRLRKGQTKTLFEFSSSRAGFLPLWDVAAT DFGQTNQKFGFELGPVCFSS >gi568815579r:9754189_9960797|GENSCAN_predicted_CDS_8|243_bp nnccactccgcccgcttccttggcaccaatggagaggagctgtctttcaaccagacgaca gcagccactgtcagcgtcccccaggatggctgccggctccggaaaggacagacgaagacc cttttcgaattcagctcttctcgagcgggatttctgcccctgtgggatgtggcggccact gactttggccagacgaaccaaaagtttgggtttgaactgggccccgtctgcttcagcagc tga