GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:27:22 Sequence gi568815588r:32169066_32446915 : 277850 bp : 40.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8346 8478 133 2 1 106 65 46 0.722 4.46 1.02 Intr + 9931 9993 63 2 0 80 96 48 0.100 2.57 1.03 Term + 13575 13717 143 1 2 75 54 91 0.056 1.51 1.04 PlyA + 15744 15749 6 1.05 2.03 PlyA - 18135 18130 6 1.05 2.02 Term - 19683 19255 429 0 0 77 38 457 0.999 33.92 2.01 Init - 23486 23433 54 2 0 79 115 41 0.633 7.23 2.00 Prom - 24324 24285 40 -6.95 3.03 PlyA - 24466 24461 6 1.05 3.02 Term - 30749 30660 90 2 0 93 45 60 0.317 -1.06 3.01 Init - 40188 39799 390 0 0 71 13 204 0.329 8.02 3.00 Prom - 40242 40203 40 -5.65 4.05 PlyA - 40359 40354 6 1.05 4.04 Term - 41178 40963 216 0 0 67 54 179 0.463 8.56 4.03 Intr - 58369 58327 43 0 1 89 92 25 0.017 0.32 4.02 Intr - 58812 58678 135 2 0 103 -4 112 0.042 2.26 4.01 Init - 61253 61123 131 2 2 59 92 102 0.948 7.37 4.00 Prom - 70539 70500 40 -5.75 5.02 PlyA - 70548 70543 6 1.05 5.01 Sngl - 74035 73610 426 1 0 44 32 220 0.957 8.04 5.00 Prom - 78165 78126 40 -5.05 6.16 PlyA - 79343 79338 6 1.05 6.15 Term - 84669 84509 161 1 2 101 44 64 0.084 0.42 6.14 Intr - 84914 84742 173 1 2 67 31 87 0.012 -0.44 6.13 Intr - 102852 102489 364 1 1 100 113 228 0.900 19.82 6.12 Intr - 103102 102961 142 1 1 85 110 155 0.975 16.41 6.11 Intr - 104216 104098 119 0 2 60 94 162 0.961 13.26 6.10 Intr - 115985 115633 353 1 2 82 76 84 0.325 0.64 6.09 Intr - 117777 117629 149 0 2 54 116 26 0.971 0.11 6.08 Intr - 117950 117861 90 2 0 72 87 94 0.990 6.97 6.07 Intr - 118209 118033 177 0 0 78 87 233 0.999 21.39 6.06 Intr - 122257 122098 160 0 1 83 55 154 0.952 10.67 6.05 Intr - 123579 123431 149 0 2 49 80 96 0.979 3.01 6.04 Intr - 124129 123923 207 1 0 19 106 219 0.993 15.15 6.03 Intr - 124672 124527 146 2 2 56 87 220 0.996 17.78 6.02 Intr - 136866 136707 160 0 1 121 94 96 0.969 12.14 6.01 Init - 149590 149513 78 1 0 43 36 101 0.271 1.51 6.00 Prom - 163606 163567 40 -5.65 7.00 Prom + 174874 174913 40 -5.65 7.01 Init + 176568 176619 52 2 1 60 98 51 0.955 4.67 7.02 Intr + 177343 178007 665 2 2 70 39 537 0.992 37.55 7.03 Intr + 188101 188193 93 0 0 24 98 78 0.211 1.64 7.04 Term + 195864 196019 156 2 0 105 50 57 0.279 0.55 7.05 PlyA + 196276 196281 6 1.05 8.00 Prom + 197603 197642 40 -6.05 8.01 Init + 200074 200172 99 0 0 61 63 143 0.888 9.41 8.02 Intr + 210031 210156 126 0 0 65 77 110 0.180 7.56 8.03 Intr + 210472 210544 73 0 1 90 91 23 0.179 0.86 8.04 Term + 210688 210743 56 2 2 63 47 22 0.103 -7.66 8.05 PlyA + 210777 210782 6 1.05 9.04 PlyA - 211547 211542 6 1.05 9.03 Term - 212230 212202 29 2 2 127 43 50 0.848 1.66 9.02 Intr - 215444 215346 99 0 0 67 116 93 0.918 9.16 9.01 Init - 220838 220736 103 2 1 74 58 90 0.438 5.05 9.00 Prom - 231022 230983 40 -3.65 10.07 PlyA - 231890 231885 6 1.05 10.06 Term - 240017 239878 140 0 2 65 54 108 0.484 2.24 10.05 Intr - 266625 266475 151 0 1 8 82 167 0.188 6.91 10.04 Intr - 268843 268777 67 0 1 101 65 57 0.175 2.69 10.03 Intr - 269299 269204 96 0 0 76 86 52 0.576 2.01 10.02 Intr - 272352 272316 37 1 1 88 86 14 0.096 -2.40 10.01 Init - 277085 276944 142 2 1 53 33 155 0.113 5.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 58812 58659 154 2 1 103 42 119 0.926 5.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_1|112_aa MDLFWGATQIWVEVTQEVQVTPTPCYCPQSRSPQVRLHGIPNEEGKGSRSIPQERFLDVT QEITQGGRPQLLPRVGGEMPAGNFISADTATARQPHRTGDKTRNKLLLPPHL >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_1|339_bp atggacttgttctggggagcaactcagatttgggtggaggtaactcaggaggttcaagtc acacccactccctgctactgtccccagagtaggagtccccaagtcaggctccatggtatc cccaatgaagagggaaaggggtcccgatccataccccaagagaggttcttggatgtcaca caagaaataactcagggaggaagaccccagctgctgccccgggtgggtggggagatgcca gcgggaaacttcatctccgcagacacagcaactgcccgacagccccacaggactggggac aagacacgaaataaattgcttctcccaccacacctctga >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_2|160_aa MVVQQCECTLMPLKMAKMLFADKFPKTAENFRALSTGEKGFSYKGSCFHRIIPGFMYQGR DFTRHNGTGGKSIYGEKFEDENFILKHTGPGILSMANAGPNTNGSQFFICTAKTEWLDGK HVVFGKVKEGMNIMEAMERFGSRNGKTSKKITIADCGQLE >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_2|483_bp atggttgtacaacaatgtgaatgtaccttaatgccgcttaaaatggctaaaatgctgttt gcagacaagttcccaaagacagcagaaaattttcgtgctctgagcactggagagaaagga tttagttataagggttcctgctttcacagaattattccagggtttatgtatcagggtcgt gacttcacacgccataatggcactggtggcaagtccatctatggggagaaatttgaagat gagaacttcatcctaaagcatacaggtcctggcatcttgtccatggcaaatgctggaccc aacacaaatggttcccagtttttcatctgcactgccaagactgagtggttggatggcaag catgtggtctttggcaaagtgaaagaaggcatgaatattatggaggccatggagcgcttt gggtccaggaatggcaagaccagcaagaagatcaccattgctgactgtggacaactcgaa taa >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_3|159_aa MGKDFMAKTSKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYPSDKGL ISRIYKELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAANGHMKKCSSSLAIREMQIKTT MTYHLTPVRMLMINNKCPMQKVQGPQLKEIFHQNAINAL >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_3|480_bp atgggcaaggacttcatggctaaaacatcaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgcagccaac ggacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgacataccatctcacaccagttagaatgctaatgattaataataagtgccccatgcaa aaagtacaaggtcctcagctaaaggaaatctttcaccaaaatgccatcaatgcactttga >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_4|174_aa MQTGPKAAQAESWLPVRRENFRELPGLKRPHGGKQWTSRVLLHGLQVPAHQTPSSSAFGL LDLHQWFARDSRAFGHRPKAALMASLLLSPVLTCKMLETHIKQFNHVRKSKATVLNLPAL QNALEGLLKPRLLGSTHSVSASVGLRRDWRICIAPASQVMPMLLTGDHTLRSIA >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_4|525_bp atgcagacaggaccaaaggcagcccaggctgagtcctggttaccagtgagacgtgaaaac ttcagagagctgccagggctgaaaaggccacatggagggaaacagtggacctctcgggtc ctcctgcatggactccaagttcctgcccatcagactccaagttcctcagcttttggactc ttggacttacaccagtggtttgccagggactctcgggcctttggccacagaccgaaagct gcactgatggcttccctacttttgagcccagttttgacttgtaaaatgttggagacgcac atcaaacagttcaaccacgttcgaaaatctaaagcaacagttctcaatttaccagcgctt cagaatgccctggagggcttgttaaaacctagattgctgggcagcacccacagtgtgtcc gcttcagtaggtctgaggcgggactggagaatttgcattgctcctgcttcccaggtgatg ccgatgctgctgactggggaccacactttgagaagcattgcttga >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_5|141_aa MEGATAVSGGLLFRGSQQPLTLKTSTAIAAIMDSAEKEMCLPTEGATTGLLELELSPSFL LPATTCMPGLQSYGCSTHTCASDPSPQPLTSIHAAETDTAASVGMPEPQTLKLPPQQAHL RLGPWSHNCSSLEKKRTGGPQ >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_5|426_bp atggaaggtgccacagcagtctccggtggcctgctcttcagaggatcccaacaacctttg acactgaagacctcaacagccattgcagctatcatggattctgcagaaaaagagatgtgc ctccctacagaaggagccaccactgggctccttgagctggagctttcaccttccttcctc ctgcctgcaaccacatgtatgcctggactccagagctatggctgctccacacacacctgt gcttcagaccccagcccacagccactcacgagcatccatgctgcagaaacagacactgct gcctcagtgggcatgcctgaacctcaaacattgaagctaccaccacagcaggcacacctg cgccttgggccctggagccataattgctcctcattggagaaaaagagaacaggaggacct cagtaa >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_6|875_aa MTVKLADVGQEEEQIWRVMEQGCRCYEHHLQRAISAQQVYGEKRDNMVIPVPEAESNIAY YESIYPGEFKMPKQLIHIQPFSLDAEQPDYDLDSEDEVFVNKLKKKMDICPLQFEEMIDR LEKGSGQQPVSLQEAKLLLKEDDELIREVYEYWIKKRKNCRGPSLIPSVKQEKRDGSSTN DPYVAFRRRTEKMQTRKNRKNDEASYEKMLKLRRDLSRAVTILEMIKRREKSKRELLHLT LEIMEKRYNLGDYNGEIMSEVMAQRQPMKPTYAIPIIPITNSSQFKHQEAMDVKEFKVNK QDKADLIRPKRKYEKKPKVLPSSAAATPQQTSPAALPVFNAKDLNQYDFPSSDEEPLSQV LSGSSEAEEDNDPDGPFAFRRKAGCQYYAPHLDQTGNWPWTSPKDGGLGDVRYRYCLTTL TVPQRCIGFARRRVGRGGRVLLDRAHSDYDSVFHHLDLEMLSSPQHSPVNQFANTSETNT SDKSFSKDLSQILVNIKSCRWRHFRPRTPSLHDSDNDELSCRKLYRSINRTGTAQPGTQT CSTSTQSKSSSGSAHFAFTAEQYQQHQQQLALMQKQQLAQIQQQQANSNSSTNTSQGFVS KTLDSASAQFAASALVTSEQLMGFKMKDDVVLGIGVNGVLPASGVYKGLHLSSTTPTALV HTSPSTAGSALLQPSNITQTSSSHSALSHQVTAANSATTQVLIGNNIRLTVPSSVATVNS IAPINARHIPRTLSAVPSSALKLAAAANCQVSKVPSSSSVDSVPRVKGQDRHWFTLSSTA LAVSFWLPLSRPYGSLGQWGREDSPPYVLSSQLASGGLQTPRNPWPLPPANKIILSTRCL GQKDLNQQMLFAVFAGTACRSQTGAPSLGEAELRW >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_6|2628_bp atgacagtgaaacttgctgacgtaggtcaggaggaggaacagatttggagagtcatggag caggggtgtcgatgttatgaacatcatcttcagcgggctatttcagcacagcaggtgtat ggcgagaagagggataatatggttataccggtcccagaggcagaaagtaatattgcttac tatgagtctatatatcctggggaatttaagatgccaaagcagctcattcacatacagcct tttagtttggatgctgaacagcctgattatgatttggattctgaagatgaagtatttgtg aataaactgaaaaagaaaatggacatctgcccattgcaatttgaggagatgattgaccgc ctagaaaaaggcagtggtcagcagccagtcagtctgcaggaagccaaactactgctaaaa gaagatgatgaactaattagagaagtttatgaatattggattaaaaagagaaaaaactgt cgagggccatctcttattccatcagtaaaacaagagaagcgagatggttccagcacaaat gatccttatgtggcttttagaaggcgtactgaaaaaatgcagactcgaaaaaatcgcaaa aatgatgaagcctcttacgaaaaaatgcttaagctgcgacgagatctaagtcgagctgtt actattctagagatgataaaaagaagagaaaaaagtaaaagagagctattgcacttaaca ctggaaattatggaaaagaggtataatttgggcgactacaatggagagatcatgtctgag gttatggcacagagacagccaatgaaacctacttatgccatccccatcatccctattact aatagcagtcaatttaaacaccaggaagcaatggatgtgaaggagttcaaagttaataag caagataaagccgatcttatccgaccgaaacggaaatatgaaaagaagcccaaagtctta ccatcgtctgccgctgctactccccaacagacgagtcctgctgcactgccagtcttcaat gctaaagatctgaatcagtatgactttcccagctcagacgaagaacctctctcccaggtt ttgtctggctcttcggaagctgaggaagacaatgatcctgatggtccttttgctttccgt aggaaagcaggctgtcagtactatgctcctcacttagaccaaactggcaactggccttgg actagtcctaaagatggaggattaggggatgtgcgatatagatactgcttaactactctc accgtaccccaaaggtgtattggatttgcacgaagacgggttgggcgcggtggaagggtc ttactggacagagctcattcagactatgacagtgtgtttcaccatctggatttggaaatg ctttcctcaccacaacattctccagtcaatcagtttgccaatacctcagaaacaaatacc tcggacaaatctttctctaaagacctcagtcagatactagtcaatatcaaatcatgtaga tggcggcattttaggcctcggacaccatccctacatgacagtgacaatgatgaactctcc tgtagaaaattatataggagtataaaccgaacaggaacagcacaacctgggacccagaca tgcagtacctctacgcaaagtaaaagtagcagtggttcagcacactttgcatttacagcc gaacaataccagcaacatcaacagcaactggcactcatgcagaaacagcagcttgcacaa attcagcaacagcaagcaaatagtaattcctccaccaacacatcacagggttttgtttct aagactttggattctgctagtgcacagtttgctgcttctgctttggtgacatcagaacaa ctgatgggattcaagatgaaggatgatgtggtgcttggaatcggggtgaatggcgtcctt ccagcctcaggagtatacaagggcttacacctcagtagtactacaccaacagcacttgta catacaagtccatcaacggcaggttcagctttgttacagccttcaaatattacacagact tcaagttcccacagtgcactgagtcatcaagtaactgctgccaattctgcaacaactcag gttctgattgggaacaacattcgattaactgtaccttcatcagttgccactgtaaactct attgccccaataaatgcacgacatatacctaggactttaagtgctgttccatcatctgcc ttaaagctggccgctgcagcaaactgtcaagtttccaaggtcccatcttcatcctctgta gattcagttccaagggtaaaggggcaggacagacactggttcacactgtcttccacagct ctggcagttagcttctggttacctttatccaggccctacgggtcactggggcagtggggt agggaggatagccccccatatgtgctttcatctcagctggccagtggaggtcttcagacc ccaagaaacccctggcctttacctcctgccaataagattatcctctcaactagatgcctg gggcagaaagacctaaatcaacaaatgctctttgcagtgtttgctggaacagcttgcaga tcacagactggtgctcccagccttggagaagcggagctcaggtggtga >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_7|321_aa MRGRFYYPGVRTQLPQQATQPVPLSGCTRPNPRTKTQNGCRQLRHVTPLCLSGPRTPRDP GTSGQGQRVALEGAPLGRGGRGSRLGEAGYRPGRRLRVGGEEKMAAILCGFAAPPPPQAA EGAGLTVGRTGELTRTDSSSFSIPVGICGTALFIEAYSCRSGRSSQRKTGSGFEASSARA RNDSLLISGAADTSPLWGNGPGQRDHGEPGVRSPLANRCRGLEGRSAEPAVRALTPAGRN RVHDVSSRVKAVEAWKPKTYMTSLHELLTQLIGYRLYIFWPKRILLLPMWPREAKRLDTP GLHSKESTKRSIREKVLEDMP >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_7|966_bp atgagaggaagattctattatcctggtgtccgaacccaacttccacaacaagccacccag cccgttcccctctccggatgcacaaggcccaacccaagaactaagacacaaaatggctgc cggcagcttcgtcacgtgacccctttgtgcctttcgggcccccgaactccgcgggacccg ggtacaagtggccagggtcagcgggtggccttagaaggggccccgctgggccgcggcggc cgggggtcgaggctgggggaggcggggtataggcccggccggcgactgagagtcggcggc gaagagaagatggcggccattttgtgtgggtttgctgctccgccgccgccgcaggcagca gagggagcgggcctgacggtgggtcggacaggggagttaacacgtaccgactcctcttcc ttctccattccggtgggcatctgcggcacggccctgtttatcgaggcgtattcgtgcagg tcgggcagatcctcacagcggaaaaccggcagcggcttcgaggcgtctagcgcccgcgcc cgaaacgacagtttactcatctcaggcgcagcagatacctctccgctctggggaaacggc cccggccagcgggatcatggagaaccgggggttcggtccccactcgccaaccgctgccgg ggacttgaggggcggagcgcagagcccgccgtccgggcactaacaccagccgggaggaac agagtacatgacgtatcttccagagttaaagctgtagaagcttggaaaccaaagacctac atgacttctctacatgaattactgacacagctcatcggctatcgtttgtatattttttgg cccaagaggattctgcttcttccaatgtggcccagggaagccaaaagattggatacccct ggtttacatagtaaagaaagcactaagagaagtataagagaaaaagttctggaagacatg ccctag >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_8|117_aa MREIQSVKDQKYLEELIVLRDNVKEYDILPNSKHYHVFLEQIVDPFQPLDRSSACKPGHV KTMSIKIFFRFDLMKEVIKIQNKYSTESTPLSLNSGETFGFQIKWAGKFDARSLVVS >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_8|354_bp atgagagaaattcagtcagtcaaggatcagaaatatctggaagagttgattgttctacga gacaatgtgaaagaatatgacattctgccaaattcaaagcattatcatgtttttcttgag cagattgttgatccttttcaaccactcgatagatcctctgcatgcaaacctggccatgta aaaacgatgtccatcaaaatattttttcgatttgatctcatgaaggaagtgataaaaatc caaaataaatacagtactgaaagtacaccattgagtctgaattcgggggaaacctttggc tttcaaattaagtgggctgggaaatttgatgctcggtctctagttgtgagctga >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_9|76_aa MRPMMASQKSRVVTSRGDRVHGKGSKRMAATGAIATLSTKELPLVDTMEHRTPPRCGDHP PSLKKGKDTQCEDNEE >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_9|231_bp atgaggccaatgatggcaagtcaaaaatcaagggtggtcacgtcaagaggtgatcgtgta catggtaaaggtagcaaacggatggctgccactggggcaatagctacccttagcactaaa gagctccctttggttgacacaatggaacaccggactccacccaggtgtggtgaccatcct ccaagcctgaagaagggcaaagatactcaatgtgaggacaatgaggaataa >gi568815588r:32169066_32446915|GENSCAN_predicted_peptide_10|210_aa MGPLKPRPRRPTPRRLGAAGVADAASVSCAPRAEEETQFGGCGDLPEGSLSQESLTERAS HPRPHSHMKEVVIVTEIEATPETKSMDSLSPRNHKAVRRSHSLGSLDDTDEQDETKRGEK AAEEKLEASRVWFMRFKEKSHRYNVTGQEGSVDVEATASYLKDLESAPRGHCQSLLLAPS KAAAQTTPGPTKAMAGMAKKHYAEMQEAEF >gi568815588r:32169066_32446915|GENSCAN_predicted_CDS_10|633_bp atggggccgttgaagccacgtcccaggcgcccgacgccacgacgccttggcgccgccggc gtcgctgacgctgcgagcgtgagctgcgcgccgagagcggaagaggagacgcagtttggc ggctgtggggatttgccggaaggttctctgtctcaggaatctcttactgagagggccagc caccccagaccacatagccacatgaaagaagtggtcatagtaacagagatagaggctaca cctgagaccaaaagcatggactccctctcaccaaggaaccacaaggctgtgagaaggagt cacagtcttggtagtctagatgacactgacgagcaggatgagactaagagaggtgagaaa gctgcagaagaaaagttggaagctagtagagtttggttcatgaggtttaaggaaaaaagc catcgctataacgtaacaggtcaagaaggaagtgttgatgtagaagctacagcaagttat ctgaaagatctagagtcagcaccgcgtggacactgccaaagcttgctacttgcaccttcc aaagcagcagcacaaaccacacctgggcctactaaagctatggctggcatggccaagaag cattatgctgagatgcaggaagcggagttctga