GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:28:46 Sequence gi568815586r:1492874_1694066 : 201193 bp : 47.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 692 731 40 -2.06 1.01 Init + 3130 3206 77 0 2 83 76 47 0.490 3.56 1.02 Intr + 7263 7419 157 0 1 80 66 59 0.237 2.91 1.03 Intr + 12283 12417 135 0 0 77 38 59 0.038 0.56 1.04 Term + 24003 24068 66 2 0 132 42 43 0.218 2.04 1.05 PlyA + 24751 24756 6 1.05 2.09 PlyA - 26705 26700 6 1.05 2.08 Term - 32123 32013 111 2 0 80 43 80 0.651 1.26 2.07 Intr - 36621 36563 59 1 2 79 80 47 0.790 1.60 2.06 Intr - 38202 37505 698 2 2 62 10 180 0.421 -1.27 2.05 Intr - 38714 38458 257 2 2 93 70 183 0.746 13.24 2.04 Intr - 40620 40579 42 0 0 37 105 53 0.411 0.34 2.03 Intr - 53676 53510 167 1 2 85 33 72 0.118 1.08 2.02 Intr - 54384 54308 77 2 2 81 115 22 0.366 3.36 2.01 Init - 78056 77875 182 2 2 60 31 129 0.132 2.21 2.00 Prom - 80138 80099 40 -1.06 3.05 PlyA - 82014 82009 6 1.05 3.04 Term - 91863 91725 139 2 1 71 37 115 0.279 2.14 3.03 Intr - 101217 99995 1223 0 2 82 94 1904 0.109 177.70 3.02 Intr - 102078 101817 262 2 1 112 -24 191 0.179 7.79 3.01 Init - 124136 124054 83 2 2 56 80 103 0.191 6.74 3.00 Prom - 135460 135421 40 -2.26 4.00 Prom + 135726 135765 40 -7.76 4.01 Init + 138482 138561 80 1 2 95 89 190 0.995 18.33 4.02 Intr + 139785 140032 248 0 2 96 82 181 0.998 15.40 4.03 Intr + 146811 147103 293 1 2 54 98 654 0.999 59.85 4.04 Term + 152921 153379 459 1 0 128 41 933 0.997 87.69 4.05 PlyA + 153435 153440 6 -3.94 5.06 PlyA - 153527 153522 6 -3.24 5.05 Term - 153903 153779 125 1 2 7 49 119 0.093 -1.55 5.04 Intr - 154841 154679 163 2 1 139 62 79 0.201 9.95 5.03 Intr - 159938 159854 85 1 1 78 88 12 0.031 0.02 5.02 Intr - 198666 198536 131 0 2 25 30 126 0.145 -0.01 5.01 Intr - 198891 198808 84 0 0 47 100 66 0.325 3.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:1492874_1694066|GENSCAN_predicted_peptide_1|144_aa MDKVCSAQEANSNGRQPEGQGSQDLRAPVPGTQPRNKDFRATVPRPPPRLPQQPGPGLEV SSKPQDTGSDPRRAGQVQVCISHCTVRDSSDQTSPGQPAPTLPPPATRTRVDGGGQHRQA VPEVFGHHHSTFCISEFDYSEYLI >gi568815586r:1492874_1694066|GENSCAN_predicted_CDS_1|435_bp atggacaaggtctgcagtgcccaagaggccaactccaatggcaggcagcccgagggccag gggtcccaggatttacgtgctcccgtgccgggaacacaacccaggaacaaggacttccgt gcgacggtgccccggccaccaccacgcctcccgcagcagccaggcccgggccttgaggtc agcagtaaaccccaggacacaggcagtgaccctagacgagcagggcaggtccaggtttgc atcagtcactgcactgtcagggactctagtgaccaaacaagcccgggccagccggccccc accttgccacctcccgccactcgcacccgtgtggacggaggcggccagcatcggcaggct gtgcctgaggtcttcggccaccaccattctactttctgtatctctgagtttgactactct gagtacctcatataa >gi568815586r:1492874_1694066|GENSCAN_predicted_peptide_2|530_aa MELHYMATLGSLVYTITAPPIMTTPPGRTTKGKKIKRGRGELLPLGLDGHVIKDSFIDYS VVQGLCEGDWRIYWFVETKASITTVTGLPLLQLPVPVGAVRAGDPVENKPSSMSVTLQVC GNGCSVQDTLTRIPSFDPHSASITVKTLNRNQPRLPGRAGSPQTELPLPAVCSKYPGVRL RLERGLRVHRGSKKVKEEVSSRLSHGGDVERRSDRRSTKPSGARALPARSGQKRRLPPAS GGPTRRTPSAAAGPGRRRSRARFGSRGGGGGGGAVPGGEGRRLPACPAACQARGLRAPLG PARLFLRPAALGRRARRGPRGRRGRPAGPGADPARSSQRAGRSGRRLRLFIRGRGAGLAA RAGVLGSRGPAEASRAPPPMLGQPPRCPPEPGQAAPALGVCAPSCRPEGSLLRAGDPRTQ RAGTVGGAQPAADPEGGPGPEARAREAAVDLAGLRALLRSAGPGEVSAQVGCGAGNLIAP LGELPKRVSKEVKWNEAVEGVTKNWQNLETLANCNGKSFIISSVPLVKVP >gi568815586r:1492874_1694066|GENSCAN_predicted_CDS_2|1593_bp atggagctgcattacatggcgacactaggcagcttggtctacaccatcactgctcctccc atcatgaccacaccacctgggaggaccaccaaagggaaaaagatcaagagaggccgagga gaattactaccgcttggtctcgatggtcacgtcatcaaagattcattcattgactattca gttgttcagggtctctgtgagggagactggagaatttactggtttgtggaaacaaaagct tccatcacgactgtaacaggcctcccgcttctccagctgcccgtcccagtaggggcagtc agggcgggtgacccggtggaaaataaacctagcagcatgtccgtgacgcttcaggtttgt gggaatgggtgcagcgttcaagacactctcacgcgtatcccttcgtttgatcctcacagc gcctcaataacggtgaagacgctcaacagaaaccagccaaggctgcccggccgggcaggc tccccgcaaaccgaactgcctctcccagcggtctgcagcaaatacccgggggttcggctc cgactggagcgcgggctacgcgtccaccgaggctccaagaaagtgaaggaggaggtctcg tcccgcctctcccacggcggggacgtggagcgccggagcgaccggagaagcacaaagccg agcggcgctcgggctctacctgcgcggtccgggcagaaacgcaggctcccacccgcgtcc ggcggacccacccggcggacacccagcgcggccgcggggccgggaaggcggaggagccgc gcgaggttcggcagccggggcggcggcggcggcggcggggccgtccccgggggcgagggc cggcgcttacctgcatgtcccgccgcgtgtcaggcccgcgggctccgggcgccgctcggc ccggcccggctcttcctgcgcccggctgcgctgggtcggcgggcgaggcgcggcccgcgg gggcgcagggggcggccggcggggcccggggcagaccctgcccgctcctctcagcgcgcg ggccgctcgggccgccgcctccgactcttcatccgcggccggggcgcggggctggcggcg cgggcgggcgtgctcggctcccggggtcccgccgaggcgtctcgggccccccccccgatg ctggggcagcctccgaggtgtccaccggagcccggccaggcagccccggcgctcggagtc tgcgcgccctcctgccgcccggagggctccctgctccgcgccggggacccccgaactcag cgggcggggacggtcgggggcgcgcagcctgcggccgatccggagggaggccctgggccc gaggcgcgggctcgggaggccgccgtcgatctcgccgggctgcgcgccctgctccggagc gcgggacccggggaagtttcggcccaagttggctgcggggcgggtaacctcatagcacct ctgggtgagctgcccaagagggtgtccaaggaggtgaagtggaatgaagcagttgaaggt gtcactaaaaactggcaaaatctggagactctagccaattgcaatgggaagagtttcatc atttcatcagtaccactcgtcaaggtcccataa >gi568815586r:1492874_1694066|GENSCAN_predicted_peptide_3|568_aa MNGTAEGQHPWVHSTFNGEPQSLSSSVRVKNQGPRDQAELGFQGNGSTVPAAQARTIARA RGSGSGGGDALQGGGAGAGPGATKRTARLLARGLRRAALRPDGRGVLGAGAGSSQRRGGG RRKMETHISCLFPELLAMIFGYLDVRDKGRAAQVCTAWRDAAYHKSVWRGVEAKLHLRRA NPSLFPSLQARGIRRVQILSLRRSLSYVIQGMANIESLNLSGCYNLTDNGLGHAFVQEIG SLRALNLSLCKQITDSSLGRIAQYLKGLEVLELGGCSNITNTGLLLIAWGLQRLKSLNLR SCRHLSDVGIGHLAGMTRSAAEGCLGLEQLTLQDCQKLTDLSLKHISRGLTGLRLLNLSF CGGISDAGLLHLSHMGSLRSLNLRSCDNISDTGIMHLAMGSLRLSGLDVSFCDKVGDQSL AYIAQGLDGLKSLSLCSCHISDDGINRMVRQMHGLRTLNIGQCVRITDKGLELIAEHLSQ LTGIDLYGCTRITKRGLERITQLPCLKVLNLGLWQMTDSEKVRCTEFSQFSGICKSRMVK FCGPLNFIQLTADGCTKQKRSSSFQSLE >gi568815586r:1492874_1694066|GENSCAN_predicted_CDS_3|1707_bp atgaatggaacagcggaaggacagcacccctgggtccacagcaccttcaacggcgagccc cagagcctgtcttcatcagtcagggttaaaaatcaggggccacgggaccaggccgaactc ggtttccagggcaacggctccacagttccggcagcgcaggcccgtaccattgcgcgggcg cgggggagcgggagcggcggaggggacgcgctgcagggcggcggagccggggccggcccg ggcgctaccaaacgcacggcccgcctgctcgcccggggtctgcgccgagccgcgctccgg ccggacggccgcggcgtccttggtgcgggggccggcagctcccaacgccgcggagggggg aggaggaagatggagacccacatctcatgcctgttcccggagctgctggccatgatcttc ggctacctggacgtccgggacaaggggcgcgcggcgcaggtgtgcaccgcctggcgggac gccgcctaccacaagtcggtgtggcggggggtggaggccaagctgcacctgcgccgggcc aacccgtcgctgttccccagcctgcaggcccggggcatccgccgggtgcagatcctgagc ctccgccgcagcctcagctacgtgatccagggcatggccaacatcgagagcctcaacctc agcggctgctacaacctcaccgacaacgggctgggccacgcgtttgtgcaggagatcggc tccctgcgcgctctcaacctgagcctctgcaagcagatcactgacagcagcctgggccgc atagcccagtacctcaagggcctggaggtgctggagctgggaggttgcagcaacatcacc aacactggccttctgctcatcgcctggggtctgcagcgcctcaagagccttaacctccgc agctgccgccacctttcggatgtgggcatcgggcacctggccggcatgacgcgcagcgcg gcggagggctgcctgggcctggagcagctcacgctacaggactgccagaagctcacagat ctttctctaaagcacatctcccgagggctgacgggcctgaggctcctcaacctcagcttc tgtgggggaatctcggacgctggcctcctgcacctgtcgcacatgggcagcctgcgcagc ctcaacctgcgctcctgtgacaacatcagtgacacgggcatcatgcatctggccatgggc agcctgcgcctctcggggctggatgtttcgttctgtgacaaggtgggagaccagagtctg gcttacatagcccaggggctggatggcctcaagtctctctccctctgctcctgccacatc agtgatgatggcatcaaccgcatggtgcggcagatgcacgggctgcgcacgctcaacatt ggacagtgtgtgcgcatcacggacaagggcctggagctgatcgctgagcacctgagccaa ctcaccggcatagacctgtacggctgcacccgaatcaccaagcgcggcctggagcgcatc acgcagctgccgtgcctcaaggtactcaacctgggactctggcagatgacggacagtgag aaggtcagatgcacagaatttagccagttttcaggaatctgtaaaagcaggatggttaag ttctgtggccccctgaacttcatccagctgacagctgacggttgtacgaagcagaaaagg tcttcaagtttccagagcctggagtag >gi568815586r:1492874_1694066|GENSCAN_predicted_peptide_4|359_aa MPSLLLLFTAALLSSWAQLLTDANSWWSLALNPVQRPEMFIIGAQPVCSQLPGLSPGQRK LCQLYQEHMAYIGEGAKTGIKECQHQFRQRRWNCSTADNASVFGRVMQIGSRETAFTHAV SAAGVVNAISRACREGELSTCGCSRTARPKDLPRDWLWGGCGDNVEYGYRFAKEFVDARE REKNFAKGSEEQGRVLMNLQNNEAGRRAVYKMADVACKCHGVSGSCSLKTCWLQLAEFRK VGDRLKEKYDSAAAMRVTRKGRLELVNSRFTQPTPEDLVYVDPSPDYCLRNESTGSLGTQ GRLCNKTSEGMDGCELMCCGRGYNQFKSVQVERCHCKFHWCCFVRCKKCTEIVDQYICK >gi568815586r:1492874_1694066|GENSCAN_predicted_CDS_4|1080_bp atgcccagcctgctgctgctgttcacggctgctctgctgtccagctgggctcagcttctg acagacgccaactcctggtggtcattagctttgaacccggtgcagagacccgagatgttt atcatcggtgcccagcccgtgtgcagtcagcttcccgggctctcccctggccagaggaag ctgtgccaattgtaccaggagcacatggcctacataggggagggagccaagactggcatc aaggaatgccagcaccagttccggcagcggcggtggaattgcagcacagcggacaacgca tctgtctttgggagagtcatgcagataggcagccgagagaccgccttcacccacgcggtg agcgccgcgggcgtggtcaacgccatcagccgggcctgccgcgagggcgagctctccacc tgcggctgcagccggacggcgcggcccaaggacctgccccgggactggctgtggggcggc tgtggggacaacgtggagtacggctaccgcttcgccaaggagtttgtggatgcccgggag cgagagaagaactttgccaaaggatcagaggagcagggccgggtgctcatgaacctgcaa aacaacgaggccggtcgcagggctgtgtataagatggcagacgtagcctgcaaatgccac ggcgtctcggggtcctgcagcctcaagacctgctggctgcagctggccgagttccgcaag gtcggggaccggctgaaggagaagtacgacagcgcggccgccatgcgcgtcacccgcaag ggccggctggagctggtcaacagccgcttcacccagcccaccccggaggacctggtctat gtggaccccagccccgactactgcctgcgcaacgagagcacgggctccctgggcacgcag ggccgcctctgcaacaagacctcggagggcatggatggctgtgagctcatgtgctgcggg cgtggctacaaccagttcaagagcgtgcaggtggagcgctgccactgcaagttccactgg tgctgcttcgtcaggtgtaagaagtgcacggagatcgtggaccagtacatctgtaaatag >gi568815586r:1492874_1694066|GENSCAN_predicted_peptide_5|195_aa KKRTLTDGGLGKQDSRYQAGCLTRHLKKGAGATAPEGEQRAAGSDPPPRKARRGEQRAPD SSDSGRKEGGSRRVLPVIAFKVYMAFAVKTKRTKEGQPQGDNNISLPGSSSSALPSWPQI LESKIADSNTEETAKGIPITRALTGTQGTSQGSLDAAQVHVDERVREHQASYGTHLHSGQ DPEDIEVEAEFPPPS >gi568815586r:1492874_1694066|GENSCAN_predicted_CDS_5|588_bp aagaagcgaacactgacggacgggggcttgggaaaacaggactccaggtaccaagcaggc tgcctgacgcgccacctgaaaaagggagctggagcaactgcacccgaaggtgagcagagg gcagcaggctccgaccctccgccgcgaaaggcgaggcgaggggaacagcgggcgccggac tcctccgacagcggtcgcaaggagggcggctccagacgtgtcctgcctgtcatagcattc aaggtctatatggcttttgctgtcaaaacaaaaaggacaaaagaaggccagccacagggc gataataacatctcccttcctggatcctcaagctcagccttaccctcctggccccagata ctggagtccaaaattgcagacagcaacacagaggagacggcaaagggcatccccattacc cgtgctttaacagggacccagggaacaagccagggttcgcttgatgccgcacaagtgcac gtggatgaaagagtaagagagcaccaggcctcttacggaacccatctacattctggacaa gaccctgaagacatcgaggttgaagctgagttcccgccaccttcatag