GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:22:20 Sequence gi568815580r:12694281_12984141 : 289861 bp : 43.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 4923 4699 225 0 0 51 86 116 0.538 5.88 1.02 Intr - 6833 6678 156 2 0 34 74 117 0.934 5.11 1.01 Init - 8268 8206 63 0 0 92 100 115 0.868 14.25 1.00 Prom - 21081 21042 40 -2.96 2.00 Prom + 24034 24073 40 -6.46 2.01 Init + 26309 26403 95 1 2 63 113 88 0.741 8.65 2.02 Intr + 30219 30339 121 0 1 69 89 49 0.573 3.60 2.03 Term + 54969 55145 177 2 0 100 49 70 0.164 1.99 2.04 PlyA + 57640 57645 6 1.05 3.06 PlyA - 59765 59760 6 1.05 3.05 Term - 61314 60568 747 0 0 49 42 191 0.764 4.05 3.04 Intr - 62656 62321 336 1 0 -5 84 146 0.342 0.62 3.03 Intr - 62892 62804 89 1 2 35 108 102 0.848 6.59 3.02 Intr - 63997 63877 121 1 1 81 55 48 0.336 0.87 3.01 Init - 64499 64434 66 2 0 74 72 41 0.707 2.17 3.00 Prom - 64910 64871 40 -2.26 4.00 Prom + 68546 68585 40 -3.36 4.01 Init + 71507 71578 72 1 0 45 56 70 0.470 0.61 4.02 Intr + 72738 72905 168 2 0 52 82 101 0.967 6.04 4.03 Intr + 74903 75037 135 1 0 37 60 130 0.873 5.86 4.04 Term + 76329 76397 69 2 0 43 55 69 0.646 -2.96 4.05 PlyA + 78248 78253 6 1.05 5.11 PlyA - 80398 80393 6 1.05 5.10 Term - 81586 81290 297 1 0 5 42 241 0.380 6.47 5.09 Intr - 85069 85015 55 0 1 90 44 29 0.024 -2.32 5.08 Intr - 100205 100104 102 1 0 101 32 137 0.716 8.69 5.07 Intr - 107871 107690 182 0 2 97 99 90 0.949 9.67 5.06 Intr - 120075 119923 153 0 0 90 97 24 0.841 3.77 5.05 Intr - 131664 131530 135 0 0 32 64 83 0.225 1.06 5.04 Intr - 136761 136663 99 0 0 150 109 107 0.992 19.31 5.03 Intr - 147186 147023 164 1 2 90 17 46 0.004 -2.61 5.02 Intr - 164974 164884 91 1 1 49 103 115 0.728 8.67 5.01 Init - 189861 189793 69 0 0 103 97 133 0.006 16.75 5.00 Prom - 201423 201384 40 -3.66 6.00 Prom + 202276 202315 40 -6.56 6.01 Init + 202376 202458 83 1 2 72 80 48 0.572 2.87 6.02 Intr + 202684 202819 136 1 1 47 30 83 0.314 -1.03 6.03 Intr + 206931 207047 117 2 0 63 77 107 0.592 7.76 6.04 Intr + 216991 217123 133 0 1 22 -53 226 0.456 2.12 6.05 Term + 217498 218078 581 2 2 16 49 392 0.894 22.75 6.06 PlyA + 220421 220426 6 1.05 7.08 PlyA - 221381 221376 6 1.05 7.07 Term - 226813 226754 60 1 0 89 50 38 0.218 -2.10 7.06 Intr - 227307 227176 132 0 0 41 23 110 0.097 0.64 7.05 Intr - 234597 234522 76 2 1 68 75 86 0.223 4.82 7.04 Intr - 235323 235219 105 2 0 104 37 65 0.131 2.33 7.03 Intr - 240464 240446 19 0 1 106 30 33 0.030 -5.03 7.02 Intr - 242104 241908 197 0 2 26 107 121 0.487 6.86 7.01 Init - 244087 244011 77 1 2 59 54 69 0.521 1.16 7.00 Prom - 251412 251373 40 -3.56 8.03 PlyA - 251603 251598 6 1.05 8.02 Term - 254311 253767 545 2 2 52 42 368 0.794 23.03 8.01 Init - 254397 254388 10 0 1 61 85 0 0.322 -2.22 8.00 Prom - 256436 256397 40 -2.76 9.00 Prom + 257467 257506 40 -5.36 9.01 Init + 258140 258188 49 1 1 96 58 45 0.121 1.41 9.02 Intr + 259364 259392 29 0 2 59 100 4 0.121 -3.47 9.03 Intr + 261183 261329 147 2 0 83 72 60 0.132 4.33 9.04 Intr + 268880 269091 212 1 2 112 115 -3 0.221 2.51 9.05 Intr + 276873 276971 99 0 0 112 82 110 0.702 12.03 9.06 Intr + 284472 284612 141 0 0 82 105 45 0.625 5.07 9.07 Intr + 288238 288395 158 1 2 68 116 -18 0.051 -1.35 9.08 Intr + 289760 289801 42 0 0 107 90 16 0.043 1.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 189620 189820 201 1 0 23 48 145 0.936 1.39 S.002 Term - 240464 240442 23 0 2 106 49 35 0.851 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_1|148_aa MSLPPEKASELKQLIHQQLSKMDVHGRIREILAETIREELAPDQQHLSTEDLIKALRRRG IIDDVMKELNFVTLILIQHGGIFTFRFWVEKLSWNICKNLSLYLDKFVQRLLYVYIIETN VFVLNLFHVPVNQIFMMAFYLKYTEKAW >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_1|444_bp atgtcgctgcctccggagaaagcctccgagctgaagcagctcatccaccagcagctgagc aagatggatgtccatggtagaataagagaaatccttgctgagactatacgggaagaattg gcacctgatcaacagcatttatcaacagaagatttgatcaaagcccttagacgtcgagga atcattgacgatgtgatgaaagaacttaattttgttactctaatattgatccaacacgga ggtatctttaccttcaggttttgggtggaaaagctttcttggaacatctgcaagaacctg agcctttacctggacaagtttgttcaacgtttactttatgtttacattatcgaaaccaac gttttcgttctaaacctgttccatgtgcctgtgaaccagattttcatgatggctttttac ttgaagtacacagagaaagcttgg >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_2|130_aa MEKSRCIPEIDDSEFCIRIPGGGITKTLYDESCSKEIQMAVLLKFVSEGDNIPDALGLVE YLNEWLQILKPLPQPLLPFLHSWNHSHLGRGGCFLKSLETHSGAHAGPLDSLVSVACPVA TSPRMPLQGR >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_2|393_bp atggaaaaaagccggtgcattcctgaaatagatgattccgagttttgtatccgcattccg ggaggaggtatcacaaaaacactctatgatgaaagctgttctaaagaaatccaaatggca gttctgctgaaatttgtttcagaaggggacaacatcccagatgcattaggtcttgttgag tatcttaatgagtggcttcagatactcaaaccacttccccagcccttgctcccgtttctg cactcttggaatcactctcacctcgggcgtggtggctgcttcctgaagtctctggagaca cacagtggtgcacacgctggccctctggactcgctggtgtcggtcgcctgccccgtggcc acatctcctcggatgccgcttcaaggtcgctga >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_3|452_aa MHNWNRHTQQLAESHIGSLTGRGYINSVALCHNLIWRELDRFSLLQDITLVHYIDDIMLI GSTIKRVMHSSIPSSNGSGIYMIGLEQVRKAQIVLHDMQPPCENGTASALQPLSRKSLKD SSEGKSSQWAELRAVHLAVHVAWKEKWPDVRLDTDSWAVANGLARWSGTWKEHDRKIGDK EVWGRGTRIELSEWSKTVTIFVSHCFYQDYHPSVGSQNALYTNMVFHTALPLTKALTLRL KNCNSGLMLTEFTGLTMFPIIQGWGKVLQKAVYALNQRPIYGAVSPTDRIHMSRNQGVEV EVAPLTITPGDPLAKVLLPFPVTLLSAGLAVLVTEGGKLPTGDTTMSPLNWKLKLPPGHF GLLLPLSQQAKKGVTELAAVIDSNYQDEISPLLHNRGKEEYAWNIGDPLGRLLVLPCPVI KVNGKLQQPNPGRITNNPDPSGMKVWVIPPGK >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_3|1359_bp atgcataattggaatagacatactcagcagctggcagaatcccacattggctccctgact ggtagggggtatatcaactctgtggctttgtgtcataatcttatttggagagaacttgat cgcttttcgcttctgcaggatatcacactggtccattacattgatgacattatgctgatt ggatccaccataaagcgtgtcatgcacagcagcattccatcatcaaatggaagtggtata tacatgatcgggctcgagcaggtccggaaggcacaaatagttctgcacgatatgcagcca ccatgtgaaaatgggacagcttcagcactacagcctctttctaggaaatccttgaaggac agcagtgaagggaaatcttcccagtgggcagaacttcgagccgtgcacctggctgtgcac gttgcatggaaggagaaatggccagatgtgcgattagatactgattcatgggccgtagcc aatggtttggccagatggtcagggacttggaaggagcatgataggaaaattggtgacaaa gaagtctggggaagaggtacgcggatagaactctctgagtggtcaaaaactgtgacgata tttgtatcccattgcttctaccaagactaccatccatccgtgggctcacagaatgcctta tacaccaacatggtattccacacagcattgcctctgaccaaggcactcactttacggcta aagaattgcaacagtgggctcatgctcacggaattcactggtcttaccatgttccccatc atccagggttggggcaaagttctccagaaggccgtgtatgctctgaatcagcgtccaata tatggtgctgtttctcccacagacaggattcatatgtccaggaatcaaggggtggaagtg gaagtggcaccactcaccatcacccctggtgatccactagcaaaagttttgcttcctttt cctgtgacattactttctgctggcctagcggtcttagttacagagggaggaaagctgcca acaggagacacaacaatgagtccattaaactggaagttaaaattgccacctggacatttt gggctcctcttacctttaagtcaacaggctaagaagggagttacagagttggctgcagtg attgactcgaactatcaagatgaaatcagtccactactccacaacagaggaaaggaagag tatgcatggaatataggagatccattagggcgtctcttagtattaccatgccctgtgatt aaggtcaatgggaaactacaacagcccaatccaggcaggattacaaataacccagaccct tcaggaatgaaggtttgggtcattccaccaggaaaatag >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_4|147_aa MGDAAEEGRLELEPPHSTSCAETQGGHKGKPVGDLRAEVKQEVSRVVHIVACQWARLGSM GRQRPQGSTFPEPSPGSDVKCDCALEKVKESDVTGLEAGSELTPVPGDAKHHRGPPVRVG ACGGQPHSIFSGIHRIGLEQVKKEQVA >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_4|444_bp atgggagacgcagcagaagagggccgcctggagctcgagccgccccacagcacctcctgt gctgagacacagggtggtcataagggcaaacctgtgggagatctgagggcagaagtgaag caggaggtgtcacgtgtcgtacacatcgtcgcctgccagtgggctcgcctcgggagtatg ggaaggcagcggccgcagggctccaccttccctgagccaagccctgggtcagatgtaaag tgtgactgtgcgctggagaaggtgaaagaatcagatgttacaggactagaagctggctca gaattgacaccagttcctggagatgcaaaacaccaccgaggtcccccagtcagagtagga gcttgtggaggacagccgcattccatcttcagtggaatacacaggattgggctcgagcag gtcaagaaggaacaagttgcatga >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_5|448_aa MPTTIEREFEELDTQRRWQPLYLEIRNESHDYPHRVAKFPENRNRNRYRDVSPLYWCMKL PSSWFSPSSMLLRVIFMKIKCVYTSPLKNFTGALSRACACSTAHVPFRGPLPNTCCHFWL MVWQQKTKAVVMLNRIVEKESVKCAQYWPTDDQEMLFKETGFSVKLLSEDVKSYYTVHLL QLENINMEKGDDINIKQVLLNMRKYRMGLIQTPDQLRFSYMAIIEGAKCIKGDSSIQKRW KELSKEDLSPAFDHSPNKIMTEKYNGNRIGLEEEKLTGDRCTGLSSKMQDTMEENSESAL RKRIREDRKATTAQKVQQMKQRLNENERKRKRTHDGCGIPVIMFTFQECRKEKTANTAGH QKEQTGDTLPLRNITGTVRVHGFILEVSETKNPPNPGHKTTSISQRPKALVSLGPEVRRG TRGEDEKALEKEGGGRRWECGGANELCG >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_5|1347_bp atgcccaccaccatcgagcgggagttcgaagagttggatactcagcgtcgctggcagccg ctgtacttggaaattcgaaatgagtcccatgactatcctcatagagtggccaagtttcca gaaaacagaaatcgaaacagatacagagatgtaagcccattgtattggtgtatgaagttg ccttcttcttggttcagtccatcttctatgctgctgagagttattttcatgaaaatcaaa tgtgtttataccagtcctttaaagaacttcactggggccctgtccagagcctgtgcctgt agcacggcccacgtgcccttcaggggtccacttcctaacacatgctgccatttctggctt atggtttggcagcagaagaccaaagcagttgtcatgctgaaccgcattgtggagaaagaa tcggttaaatgtgcacagtactggccaacagatgaccaagagatgctgtttaaagaaaca ggattcagtgtgaagctcttgtcagaagatgtgaagtcgtattatacagtacatctacta caattagaaaatatcaatatggaaaaaggagatgatattaacataaaacaagtgttactg aacatgagaaaataccgaatgggtcttattcagaccccagatcaactgagattctcatac atggctataatagaaggagcaaaatgtataaagggagattctagtatacagaaacgatgg aaagaactttctaaggaagacttatctcctgcctttgatcattcaccaaacaaaataatg actgaaaaatacaatgggaacagaataggtctagaagaagaaaaactgacaggtgaccga tgtacaggactttcctctaaaatgcaagatacaatggaggagaacagtgagagtgctcta cggaaacgtattcgagaggacagaaaggccaccacagctcagaaggtgcagcagatgaaa cagaggctaaatgagaatgaacgaaaaagaaaaaggacgcatgatggctgcgggatccca gtcatcatgttcacattccaggagtgcagaaaagaaaaaactgccaacacagccggacat cagaaggaacaaactggggacacgctgcctttaagaaacataacaggcaccgtgagggtc cacggcttcattcttgaagttagtgagaccaagaacccacctaatcccgggcacaaaacc acctcaatctcccaaagaccgaaagccctggtgtcgctgggaccagaggtgagacgtgga acacgtggggaagatgagaaggcgctggagaaggagggtggtggtcgcagatgggaatgt ggtggagccaatgagctgtgcgggtaa >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_6|349_aa MPTLSLDTAAKAHPELRQSPLGSSRPRRTLPALASATLPTGLRQPAGASTERAREQRPSG AKTPEPSRPPPGQPTRVLIFLASAECHISPLFEDVMLTGPGEREVASPLAALGWFWERRK GIDLRTNQVVVIRAVDPEEAEDAQQENPALSCATAPTKEEPSHSDLHRMRAQFLIPKSSP SELEDRGKPLEELAGLARRGPRVLTLSRSSGSARRPRAPPTPFLPGFMGAAGAGNPRGTA RSPAPRIPTSPARQRTRIRPPRNPASLYSPAESTQQAVLEDGPARLAEARGARREAPCSS TLGGAPSGSAERSAGRAAAARPALELHEPVLHTAESPRRGPAEPGTERV >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_6|1050_bp atgcctacgttgtcactcgacacagccgccaaagctcacccggagctgcgtcagtcccca ctgggttcctccaggccaaggagaactcttcctgcgcttgcaagcgccacgctgcccaca ggactgcgccagccggccggcgcctcaacagagcgcgccagggagcagcgcccgtcggga gccaagacgcctgagccatcgaggccgccgccgggccagcccaccagggtcttgatcttc ttggcatctgcagaatgtcacatcagcccactgtttgaggatgtcatgttaactggacct ggtgagcgggaagttgcaagtcctctggctgccttgggctggttctgggagaggcgcaag ggcatcgacctccgcaccaatcaggtggtggtcattagggcagtggacccggaggaggcc gaggacgcccagcaggagaaccctgcgctcagctgtgctacagccccgaccaaggaggag ccttcgcactccgacctgcaccgcatgcgcgcccagttcctgatccccaagagcagcccg tctgagctggaggaccgcggcaagcccttggaggagctggcaggcctggcccgacgagga ccccgcgtcctgacactgtcaaggagctccgggagcgcaaggcgaccgcgcgctccgcca acaccttttcttccggggttcatgggcgctgccggcgctggaaatcccaggggcacggcg aggagcccagctcctaggatcccgacatcgccagcgcggcagaggacgcggatcaggcct ccccgcaacccggcgtccctctacagtccggccgagtccacgcagcaagctgtactggag gacggccctgcacggctcgcagaagcccgcggagcccgtcgagaggccccgtgttcgtcc acgctgggcggcgcgccttccgggagcgcagagagaagcgcaggcagagcggcggccgca agacccgccttggagctccacgagccggtgctgcacacggcagaatcccctcgacgtggg cccgcagagccggggacagagagagtctga >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_7|221_aa MQIGPDEQDAATALDLLVRNVHVRECLSSKGKSASTRKHNSDSTKLSVNVPPSHLGLLML LSQQAKKGAAVFSGVIGPGYQGKIGLLLPDGGSSVNCVCRGKELGAHSKKVAVHKLGPDP AGTLISDFQPPKLHVQELQSPYVFANIGVIIFLMLALLRPLKLTGRRSGADTERKVNIKT LSVMCTFHPCSVSSLCYTQIAEVTGKQVPFGYMNKFFSADF >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_7|666_bp atgcagattggacctgatgagcaagatgcagcaactgctctagacttattagtaagaaat gtgcatgtcagagaatgtcttagttccaaagggaagagtgcttccaccaggaaacacaac agtgattccaccaaactctcagttaacgtgccacccagccacttggggctcctcatgctt ctgagtcaacaggctaagaagggagctgctgtgttttctggggtgattggtcctggctac caagggaaaattgggctgctactccccgacggaggttcatccgtgaactgtgtgtgcaga gggaaggaactgggagcacacagcaagaaagtggctgttcacaagctaggacctgaccct gctggcaccttgatctcggacttccagcctccaaaactccatgtgcaagagcttcagtca ccctacgtctttgccaacattggtgtgatcatctttttgatgttagccctcctaagacca ctgaagctaacaggaagacgttctggtgctgatacagagaggaaagtcaacataaaaacg ctctcagtgatgtgcacattccatccctgttcggtgtcctccttgtgctacacgcagata gctgaggttactgggaaacaggtgccgtttggttacatgaataagttctttagcgctgat ttctga >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_8|184_aa MCPGRRVNSAGKTHDAEADGGMPLGSRPQPRQRGLGEGPAYPAGGSAHVAPAAQAHGRAA REAAVTPQYAGGEGFRSVQPGLARHPTASSGVTEPRHSPVPRAAHPLPFLPSGVGPARKR RAHLNALIAGAGCHPPPVEVERDIVDEILVVRRDAASHKHGFRRARASEEDSGGGSGGGA WPVA >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_8|555_bp atgtgccctggtcgtcgggtgaacagcgcgggcaagacccacgacgccgaggctgacggt gggatgcccctgggttcccgcccgcagccccggcagcgcggcctgggagagggcccggcg taccctgcgggcgggagcgcgcacgtggcaccagctgcgcaggcgcacgggagggcggcc cgcgaggccgccgtgaccccacaatacgcgggcggggagggcttccgctcagtccagcca gggctcgcccgccaccccacagcttccagcggcgtcactgaaccaagacacagccccgtt ccccgcgccgcccacccactccccttccttccttccggggtcggccccgcccgcaagcgc cgcgcgcaccttaacgctctgatcgctggagcaggttgccatccgccgcccgtggaagtc gaaagagacatcgtggatgagatccttgtggtccgccgcgatgctgcgagccacaaacat ggtttccgtcgggcccgcgcctccgaagaggacagtggcggcggcagcggcggcggcgcg tggcccgtagcctag >gi568815580r:12694281_12984141|GENSCAN_predicted_peptide_9|293_aa MGFRHVGQAGLKLLTSGLSKVPPPFKTHSGSVWRVTWAHPEFGQVLASCSFDRTAAVWEE IVGESNDKLRGQSHWVKRTTLVDSRTSVTDVKFAPKHMGLMLATCSADGIVRIYEAPDVM NLSQWSLQHEISCKLSCSCISWNPSSSRAHSPMIAVGSDDSSPNAMAKVQIFEYNENTRK YAKAETLMTVTDPVHDIAFAPNLGRSFHILAIATKDVRIFTLKPVRKELTSSGGPTKFEI HIVAQFDNHNSQVWRVSWNITGTVLASSGDDGCVRLWKANYMDNWKCTGILKX >gi568815580r:12694281_12984141|GENSCAN_predicted_CDS_9|879_bp atggggtttcgccatgttggccaggctggtctcaaactcctgacctcaggtctatccaag gtcccccccccttttaagacacatagtggatctgtatggcgtgtgacatgggcccatcct gaatttgggcaggttttggcttcctgttcttttgaccgaacagctgctgtatgggaagaa atagtaggagaatcaaatgataaactgcgaggacagagccactgggttaaaaggacaact ctggtggatagcagaacatctgttactgatgtgaagtttgctcccaagcacatgggtctt atgttagcaacctgttccgcagatggtatagtaagaatctatgaggcaccagatgttatg aatctcagccagtggtctttgcagcatgagatctcatgtaagctaagctgtagttgtatt tcttggaacccttcaagctctcgtgctcattcccccatgatcgccgtaggaagtgatgac agtagccccaacgcaatggccaaggttcagatttttgaatataatgaaaacaccaggaaa tatgcaaaagctgaaactcttatgacagtcactgatcctgttcatgatattgcattcgct ccaaatttgggaagatctttccatattctagcaatagcgaccaaagatgtgagaattttt acattaaagcctgtgaggaaagaactgacttcctctggtgggccaacaaagtttgaaatc catatagtggctcagttcgataatcataattctcaggtctggcgagtgagttggaatata acaggaacggtgctagcatcttcaggagatgatgggtgtgtaagattgtggaaagctaat tatatggacaattggaagtgtactggtattttgaaagnn