GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:21:15 Sequence gi568815588r:121379860_121693817 : 313958 bp : 45.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2526 2703 178 2 1 97 68 156 0.479 13.93 1.02 Intr + 31184 31303 120 0 0 69 75 65 0.005 3.77 1.03 Intr + 35545 35620 76 2 1 45 78 57 0.002 -1.03 1.04 Intr + 46632 46765 134 0 2 56 69 73 0.090 2.49 1.05 Intr + 58909 58934 26 2 2 71 92 10 0.172 -2.56 1.06 Intr + 69499 69636 138 0 0 98 94 24 0.520 4.66 1.07 Term + 73402 73608 207 0 0 70 48 92 0.484 0.74 1.08 PlyA + 78221 78226 6 1.05 2.13 PlyA - 78538 78533 6 1.05 2.12 Term - 81076 80957 120 1 0 39 53 127 0.296 2.77 2.11 Intr - 81687 81574 114 0 0 43 55 77 0.213 0.54 2.10 Intr - 86141 86079 63 2 0 89 3 95 0.122 0.01 2.09 Intr - 103944 103839 106 2 1 71 94 69 0.901 6.02 2.08 Intr - 105673 105536 138 0 0 103 65 183 0.992 17.08 2.07 Intr - 107565 107495 71 0 2 78 56 34 0.986 -2.82 2.06 Intr - 108254 108132 123 2 0 80 101 134 0.999 14.68 2.05 Intr - 116863 116673 191 2 2 83 87 234 0.810 22.10 2.04 Intr - 118746 118636 111 1 0 64 101 95 0.995 8.75 2.03 Intr - 121088 120967 122 1 2 44 105 194 0.325 16.84 2.02 Intr - 124082 123931 152 2 2 75 78 128 0.421 9.46 2.01 Init - 127262 127152 111 2 0 57 89 60 0.893 3.24 2.00 Prom - 129592 129553 40 -3.96 3.14 PlyA - 130925 130920 6 1.05 3.13 Term - 134429 134349 81 2 0 96 44 73 0.776 1.39 3.12 Intr - 135460 135264 197 2 2 66 92 287 0.991 26.03 3.11 Intr - 137604 137511 94 0 1 117 58 105 0.862 9.94 3.10 Intr - 140310 140120 191 1 2 110 100 275 0.997 30.20 3.09 Intr - 146335 146296 40 1 1 93 86 29 0.241 1.00 3.08 Intr - 150381 150323 59 1 2 98 97 2 0.206 0.70 3.07 Intr - 158856 158733 124 0 1 115 92 128 0.642 16.06 3.06 Intr - 171600 171431 170 1 2 61 70 213 0.851 16.47 3.05 Intr - 184720 184643 78 2 0 34 89 127 0.940 6.92 3.04 Intr - 185845 185579 267 2 0 97 109 263 0.994 26.80 3.03 Intr - 200297 200112 186 0 0 122 83 51 0.826 7.66 3.02 Intr - 200654 200592 63 0 0 79 85 43 0.332 1.79 3.01 Init - 213958 213850 109 1 1 95 115 118 0.981 13.69 3.00 Prom - 214337 214298 40 -10.94 4.00 Prom + 214466 214505 40 -7.36 4.01 Init + 217161 217679 519 2 0 61 64 194 0.409 9.17 4.02 Intr + 218269 218478 210 0 0 42 101 125 0.454 8.31 4.03 Intr + 228919 229407 489 0 0 80 -5 212 0.397 4.30 4.04 Intr + 232674 232754 81 2 0 82 74 39 0.649 1.73 4.05 Intr + 235044 235128 85 2 1 89 100 47 0.873 5.39 4.06 Intr + 236521 236675 155 2 2 29 61 85 0.185 -0.31 4.07 Intr + 242344 242439 96 0 0 67 105 50 0.710 4.81 4.08 Term + 243188 243613 426 1 0 91 47 106 0.821 2.10 4.09 PlyA + 244761 244766 6 1.05 5.06 PlyA - 246128 246123 6 1.05 5.05 Term - 253112 252957 156 2 0 66 45 86 0.147 0.03 5.04 Intr - 259785 259696 90 0 0 93 36 58 0.157 1.29 5.03 Intr - 261681 261611 71 1 2 108 84 27 0.733 3.20 5.02 Intr - 265339 265191 149 0 2 67 75 43 0.065 0.78 5.01 Init - 275800 275745 56 1 2 77 71 73 0.125 3.29 5.00 Prom - 276955 276916 40 -4.46 6.00 Prom + 278996 279035 40 -4.26 6.01 Init + 281052 281215 164 2 2 64 80 96 0.617 5.66 6.02 Intr + 293448 293505 58 0 1 93 110 1 0.364 1.69 6.03 Intr + 296913 297195 283 2 1 70 7 191 0.498 6.19 6.04 Intr + 297544 297639 96 2 0 94 82 34 0.639 3.38 6.05 Intr + 305610 305857 248 1 2 21 64 117 0.091 -0.22 6.06 Term + 307710 308009 300 2 0 97 38 88 0.351 -0.08 6.07 PlyA + 308765 308770 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:121379860_121693817|GENSCAN_predicted_peptide_1|292_aa MPERPHGSVGSCAAEASQTNTAPCSTVPSPIDHPRAEECGRTAQDWQAAPPAAPVRDPLA MCSKRKTIPDRGGPIFSQGYRSSYAPLLEPPNKLSRKGSVLCQVTNLSFQPLKSKTRSCM PCTNFQSDASMCDEESGSKAEVMILLTLPTDKNWLGAFSLSAAGVSVRLRHRICSVAELS DAPDPPTAHLQQEILGFTILKRSLKKSPSYLGAEVVSLASPNILPHEHIVTNHLSAAFRP SLLTQPYTLARDLDTDQSLGICMQCKQLQMCNQRRTHQSTLGTQQRTESSCF >gi568815588r:121379860_121693817|GENSCAN_predicted_CDS_1|879_bp atgcctgagcgtccccatggctccgtgggctcctgtgcggctgaagcctcccagacgaac accgccccctgctccaccgtgcccagtcccatcgaccacccaagggctgaggagtgtggg cgcacagcgcaggactggcaggcagctccacctgcagccccagtgcgggatccactggct atgtgctccaagaggaagaccatccctgatagaggaggaccaatcttcagtcaagggtat aggagtagctatgctcccctgctagaacctccgaacaagctctcaagaaaaggctcagtt ctttgccaggtcaccaacttgtccttccagccactcaagagtaaaaccagaagctgtatg ccctgcaccaatttccaatcagatgcaagcatgtgtgatgaggaaagtggaagtaaggct gaagtaatgattctgttgactctccctactgataagaattggcttggagcatttagcctc tcggcagcaggtgttagtgtccggcttcgccatcggatctgctctgtagctgagctctca gatgctccagacccacctacagctcatctccagcaggagatcttgggttttacgatttta aaacgtagcttgaaaaagtctccatcttatctcggagcagaagtggtgtcacttgccagc ccaaatattctacctcatgaacacattgttaccaaccatctttctgctgcttttcgaccc tctctgctcactcaaccctacaccctagccagagatctggatacagatcagagcctgggc atctgtatgcaatgtaaacagctccagatgtgcaaccagaggagaacgcaccagtccact ctgggcacccagcaaagaacagagagttcctgcttttaa >gi568815588r:121379860_121693817|GENSCAN_predicted_peptide_2|473_aa MSTGRAATGLAVAHIEGCRDATKPSLVAEGRAESSDLVSAESSSSMNSNTPLVRITTRLS STADTPMLAGVSEYELPEDPKWEFPRDKLTLGKPLGEGCFGQVVMAEAVGIDKDKPKEAV TVAVKMLKDDATEKDLSDLVSEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNL REYLRARRPPGMEYSYDINRVPEEQMTFKDLVSCTYQLARGMEYLASQKCIHRDLAARNV LVTENNVMKIADFGLARDINNIDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGVL MWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTNELYMMMRDCWHAVPSQRPTFKQ LVEDLDRILTLTTNEPPDSIHGNYKTVGIDPFLPQPGRSRRQRKRKGHISPFIIIIVIII IIISSLSLPGSCFEGWSQYLGQANDQALAWSEMGMYPKSVQSEDFFLGIQDNE >gi568815588r:121379860_121693817|GENSCAN_predicted_CDS_2|1422_bp atgagcacaggaagggcagcaacgggattggctgttgcccacatcgaaggatgcagagat gccacaaaacctagcctggtagcagaaggaagggccgaaagcagtgatcttgtttcggct gagtccagctcctccatgaactccaacaccccgctggtgaggataacaacacgcctctct tcaacggcagacacccccatgctggcaggggtctccgagtatgaacttccagaggaccca aaatgggagtttccaagagataagctgacactgggcaagcccctgggagaaggttgcttt gggcaagtggtcatggcggaagcagtgggaattgacaaagacaagcccaaggaggcggtc accgtggccgtgaagatgttgaaagatgatgccacagagaaagacctttctgatctggtg tcagagatggagatgatgaagatgattgggaaacacaagaatatcataaatcttcttgga gcctgcacacaggatgggcctctctatgtcatagttgagtatgcctctaaaggcaacctc cgagaatacctccgagcccggaggccacccgggatggagtactcctatgacattaaccgt gttcctgaggagcagatgaccttcaaggacttggtgtcatgcacctaccagctggccaga ggcatggagtacttggcttcccaaaaatgtattcatcgagatttagcagccagaaatgtt ttggtaacagaaaacaatgtgatgaaaatagcagactttggactcgccagagatatcaac aatatagactattacaaaaagaccaccaatgggcggcttccagtcaagtggatggctcca gaagccctgtttgatagagtatacactcatcagagtgatgtctggtccttcggggtgtta atgtgggagatcttcactttagggggctcgccctacccagggattcccgtggaggaactt tttaagctgctgaaggaaggacacagaatggataagccagccaactgcaccaacgaactg tacatgatgatgagggactgttggcatgcagtgccctcccagagaccaacgttcaagcag ttggtagaagacttggatcgaattctcactctcacaaccaatgagcccccagacagcatc cacgggaactacaagactgtgggcattgacccttttcttccccagccaggaaggtcaagg cgtcagaggaaaaggaagggacacattagccccttcatcatcatcattgtcatcatcatc atcatcatctcatcactttcattacctgggagttgctttgagggatggagccagtacctg ggccaggccaatgaccaggctctggcctggtcggaaatgggcatgtaccctaagtcagtc caatcagaggactttttcctggggatacaggacaatgagtga >gi568815588r:121379860_121693817|GENSCAN_predicted_peptide_3|552_aa MVSWGRFICLVVVTMATLSLARPSFSLVEDTTLEPEGHNSVPSGVCTGFEMLARPELALA TLKVFLYSESAQCGRACSMGLGENVKRDLDIYLTFQGHTGQTYPLEYAPFIECLHQYQGE PPTKYQISQPEVYVAAPGESLEVRCLLKDAAVISWTKDGVHLGPNNRTVLIGEYLQIKGA TPRDSGLYACTASRTVDSETWYFMVNVTDAISSGDDEDDTDGAEDFVSENSNNKRAPYWT NTEKMEKRLHAVPAANTVKFRCPAGGNPMPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLI MESVVPSDKGNYTCVVENEYGSINHTYHLDVVGDPAVVFMVFMLGLFSQSEKLFGVAKRL PFWRKERSPHRPILQAGLPANASTVVGGDVEFVCKVYSDAQPHIQWIKHVEKNGSKYGPD GLPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEYTCLAAPGREKEITASPDYLEIAIY CIGVFLIACMVVTVILCRMKNTTKKPDFSSQPAVHKLTKRIPLRRQRWKRKADVDAERLQ KHLKLGSDKGFF >gi568815588r:121379860_121693817|GENSCAN_predicted_CDS_3|1659_bp atggtcagctggggtcgtttcatctgcctggtcgtggtcaccatggcaaccttgtccctg gcccggccctccttcagtttagttgaggataccacattagagccagaaggccacaactct gtgccttcaggcgtctgcacgggtttcgagatgctggccaggcctgaacttgccttggcc actttaaaagtatttctttattcagaaagtgcgcagtgtgggagggcctgctctatgggc ttgggggaaaatgtcaaacgggatctggacatctatctgacctttcagggccatacaggg caaacgtatccgctggagtatgcaccatttattgaatgtttacatcaatatcagggagag ccaccaaccaaataccaaatctctcaaccagaagtgtacgtggctgcgccaggggagtcg ctagaggtgcgctgcctgttgaaagatgccgccgtgatcagttggactaaggatggggtg cacttggggcccaacaataggacagtgcttattggggagtacttgcagataaagggcgcc acgcctagagactccggcctctatgcttgtactgccagtaggactgtagacagtgaaact tggtacttcatggtgaatgtcacagatgccatctcatccggagatgatgaggatgacacc gatggtgcggaagattttgtcagtgagaacagtaacaacaagagagcaccatactggacc aacacagaaaagatggaaaagcggctccatgctgtgcctgcggccaacactgtcaagttt cgctgcccagccggggggaacccaatgccaaccatgcggtggctgaaaaacgggaaggag tttaagcaggagcatcgcattggaggctacaaggtacgaaaccagcactggagcctcatt atggaaagtgtggtcccatctgacaagggaaattatacctgtgtagtggagaatgaatac gggtccatcaatcacacgtaccacctggatgttgtgggagacccggctgttgtattcatg gtcttcatgcttggtttgttttcacagtcagagaagctctttggcgttgctaagagactg ccattttggaggaaagagcgatcgcctcaccggcccatcctccaagccggactgccggca aatgcctccacagtggtcggaggagacgtagagtttgtctgcaaggtttacagtgatgcc cagccccacatccagtggatcaagcacgtggaaaagaacggcagtaaatacgggcccgac gggctgccctacctcaaggttctcaaggccgccggtgttaacaccacggacaaagagatt gaggttctctatattcggaatgtaacttttgaggacgctggggaatatacgtgcttggcg gcgcctggaagagaaaaggagattacagcttccccagactacctggagatagccatttac tgcataggggtcttcttaatcgcctgtatggtggtaacagtcatcctgtgccgaatgaag aacacgaccaagaagccagacttcagcagccagccggctgtgcacaagctgaccaaacgt atccccctgcggagacagagatggaagcggaaggcagatgtagatgcagaacgtttacaa aagcatttgaaacttggttctgataaaggtttcttttga >gi568815588r:121379860_121693817|GENSCAN_predicted_peptide_4|686_aa MGALSLKAGLPLRVGFIELGERKEGLENGLQGSGVRGERRRLVVPERASCARSVVREGRS QRHAGPSAFFPSAAPWPKKMSAQVRTRLRFRHLPAAAAEQAGELEKVGPVRPLAPGALRS LQRRRRLRARGAQAPEPSPPQQCHPWPSPPLRDVTGPTPAAPSRKRGLRRAWMAAEERGH DARGLPSDLGNERKKGLRLGVASTKLCSRVAKAQSAQASCGLGAPGARGRPPPARRAVAA PGQGGAHRAPAPPCGREHRARKAGASAARGVGDGAGIKVEGGRAGRRGSPASAEPSPEVV SAVQMMSDPGRGESGAQVLAQAQLDEMSVTPTAGARDLAGTWCCSGLCLGLACGYLRLIL HSLRGFAPGRRPHVSSLWINKAENHLCTLAGKPAPTSGPLLGLLALLPVMSLSYLDSVYP SSKFATNSFSFREPIQICILAILQVPPALGEAALTCCTHRQGCGGGRVQAADPSSTVASR TQEAHGASHVHDHLQLIPEAATSTCQVQPSQEQEYDWLWHYSDEPSSPSMKEEMQKSICP CNLKKVDSHIPKQDSIHTSVCAQSTWGPCGNVEADSVVLVWGLRFCMSTGSQVMWGHAFY SVGPQNMDYSLPSPDDGDSPLPQGWHPREAEEGACKDLQNFRGWRSLTRPMSSCIAVPCK PNQVALAGTAVELRGVSTSCQGCQET >gi568815588r:121379860_121693817|GENSCAN_predicted_CDS_4|2061_bp atgggtgccctgagtttgaaagcaggtttgcctttaagggtaggttttattgaattggga gaaaggaaggagggtctggagaacggtctgcaagggagcggggtgcgaggagagcgacgg aggctggtggtccccgagcgggcctcctgcgccaggtccgtggtccgcgagggtcggagc cagcgccacgcagggccctcggcttttttccccagcgcggccccgtggccgaaaaaaatg agcgcgcaagttagaactcgcctgcgctttcgacatctcccagccgctgcggcagagcag gcaggcgaactagaaaaagtcggtccagttcgccccctagccccaggcgcactgcgctcc ctccagcgccgacggcggctgcgggcgaggggcgcccaggcaccggagccgtcgcctccc cagcagtgccacccctggccatcgccacccctacgggatgtcaccggccccaccccagcg gccccctcccgaaaaaggggtctccgcagggcctggatggctgcggaggagcgcgggcat gacgcccgcgggctgccctcggatttggggaacgagaggaagaaaggactcaggcttggc gttgcctccaccaaactttgctcgcgagttgcgaaggctcagagcgcgcaggcaagctgc ggcctcggggcccccggggctcgcggccggcccccgccagcccggagagcagtcgccgcg ccgggccagggaggagcgcatcgggcgccagcgcccccgtgtggccgcgagcaccgcgcg cgcaaagcgggggcgagcgccgcccggggcgtgggtgacggggctggcatcaaggtagag ggcggccgggcagggaggaggggcagccccgccagcgccgagcccagccctgaagtcgtt tcggctgtgcagatgatgagcgacccaggtcgaggcgaatcaggagcccaggtgcttgcc caagcacagctggacgaaatgtctgttaccccgacggctggggcgcgggaccttgcaggg acgtggtgctgcagcggcctctgcctgggcctggcgtgcggctacctcaggctaattctc cattcgctgcgtggattcgcccccgggaggcggccccacgtttccagcctgtggatcaac aaggcggagaaccacctttgcaccctcgccgggaaaccagctcctacctccggccccctg ttgggccttctggccctgctccctgtcatgtcactctcctatttggactcagtttaccca tcgtccaaatttgccacaaattccttcagcttcagagaaccaatccaaatctgcatcctg gccatccttcaagtcccacctgctctgggagaggccgccctcacgtgctgcacccatcgg caaggatgtgggggagggagagtccaagcagcagatccttctagtacagtcgccagcaga acccaggaggcccacggtgcaagccacgtacatgatcatcttcaactgatcccggaggcg gccacaagcacctgccaggtgcagcccagccaagagcaagaatatgactggctttggcat tactctgatgaaccatcaagccccagcatgaaagaggagatgcagaagagcatttgtcca tgcaatctaaaaaaggtggacagccacatcccaaagcaggactccattcatacttcagtg tgtgcacagagcacttggggaccttgtggaaatgtagaagctgactcagtagttctggtg tggggcctgagattctgcatgtctaccggttcccaggtgatgtggggccatgctttttac agtgtaggtccacagaacatggactactcattgccaagcccagacgatggtgatagcccc ttgccccaaggatggcacccgagggaagctgaggagggagcctgcaaagacctgcagaac ttccgaggttggagaagcctgaccagacccatgagcagctgcatagctgtgccttgcaaa cccaaccaagtggctttagcaggaactgcggtagagctcagaggtgtctcaaccagctgc cagggctgccaggaaacttaa >gi568815588r:121379860_121693817|GENSCAN_predicted_peptide_5|173_aa MSLRSALLGGHSTGLGSGSGFEDEAKFSLMISQRELIATLSDLAATSFINECFLNLNVLT DHLPILSQVVMSESGHNHPMVYLVVVFRFFKSPPQALGLQLYLRSNDYDLKEAPESSEKL CSCALPCVTVIDKQVSSPSWPANPLTASLIFGSLEHPTQCLQNEGLAGHMRPH >gi568815588r:121379860_121693817|GENSCAN_predicted_CDS_5|522_bp atgtccctccgctccgcactgcttggcggtcacagcactggcctgggatctggcagtggc tttgaggatgaagccaagttctccttaatgatatcacagcgggagttgattgctaccttg agtgatttggctgcaacaagcttcataaatgagtgtttcttaaacttgaatgtgcttaca gatcacctgccgatcttgtcacaagttgtaatgtctgaatcagggcacaaccacccaatg gtgtatttggttgttgtgtttcgtttcttcaaatctcctcctcaagccctggggctacag ctttatctaaggagtaatgactatgatctcaaggaggcccctgagtcatcagaaaagtta tgttcatgtgcattgccatgtgttacagttattgataaacaggtgtcatctccctcctgg cctgcaaacccactgacggccagtctcatctttggatcccttgagcacccaacgcagtgc ctgcagaatgagggccttgcaggacacatgcgaccacactga >gi568815588r:121379860_121693817|GENSCAN_predicted_peptide_6|382_aa MNDKKCLLHVRPGVQTSASGFQSREQFLWGKDVPKIFTYKKSWTWVLKDHWYSERPQTAV FTGSCCPGKPLACQPSFQEPMLLMHEQVQGSGGESEPWAKGLLIIINAVVFAVFVQGKTT LWKNITGKGGRDHNRSQDVLPDCVLRNAHIPGSTLLICSPAQEALAYSQHYLPTQRVLPV VLNAFINMHLEHQQFSALRKSHSLGELVSGPELKNVSGITGDVTFVLDLEHGWDLVGIDG DEARLQGVEESWGWSWGRCMRLSGRKLVRKEDSLGSPPEGPIQDASSDERGNFDPALTQP STGKEKPRRHLEPVKPEPAMGGTESAECRLLWKWKEIVAWGLLNPPEGSILRTIMPSGSA LAPEAQRNPGCAVFISEVLEHY >gi568815588r:121379860_121693817|GENSCAN_predicted_CDS_6|1149_bp atgaatgacaagaagtgtctgctgcatgtcagaccgggggtccagacatctgcctccgga tttcagagcagggagcagtttctgtggggcaaagatgtcccaaaaatcttcacatacaag aagtcctggacctgggttttgaaggatcactggtattcagagaggcctcagactgctgtc ttcactggctcctgctgcccaggaaaaccccttgcatgccagccaagcttccaggagcct atgctgctcatgcatgaacaggttcaagggagtggaggcgaatctgagccctgggcaaaa gggcttctcataataattaatgcggtagtttttgcagtcttcgtgcaaggtaaaaccacc ctttggaagaacatcactggcaaaggtggccgtgaccataatagatctcaagatgttctt cctgactgtgttcttcgaaatgcccatattccagggtcaacgctgctcatatgctcacca gcccaagaagcacttgcctactcacagcactatctcccaactcaaagggttcttcctgtt gtcttgaatgccttcattaacatgcacctggaacatcaacagttttcagcattaaggaaa tctcacagtctaggagagcttgtttctggcccagagttgaagaatgtttcaggaatcaca ggagatgtgacatttgtgctggatctggaacatgggtgggatctggtgggcatagatgga gatgaagcccggctgcagggagtcgaggagtcctggggctggagctggggaaggtgcatg cgtctgagtggccggaaacttgtcagaaaggaagattccctgggttcacccccagagggt cccattcaggatgcaagctctgatgaaagggggaactttgatcctgctctcactcagcca tccacagggaaggaaaagccaaggagacacctagagccagtaaagccggagcctgccatg ggtggaacagagtcagcagaatgcaggctcctctggaagtggaaagagattgttgcctgg ggcttactgaaccctccagaaggttccattttaagaaccatcatgccttcaggttctgct ttggccccagaagcccagagaaaccctggctgtgcagtcttcatttcagaggttctcgag cactactaa