GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:45:48 Sequence gi568815588r:121383698_121693817 : 310120 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 584 624 41 1 2 73 117 24 0.104 3.46 1.02 Intr + 21880 22040 161 1 2 115 70 -2 0.001 0.33 1.03 Intr + 27346 27465 120 2 0 69 75 65 0.005 3.77 1.04 Intr + 31707 31782 76 1 1 45 78 57 0.002 -1.03 1.05 Intr + 42794 42927 134 2 2 56 69 73 0.090 2.49 1.06 Intr + 55071 55096 26 1 2 71 92 10 0.172 -2.56 1.07 Intr + 65661 65798 138 2 0 98 94 24 0.520 4.66 1.08 Term + 69564 69770 207 2 0 70 48 92 0.484 0.74 1.09 PlyA + 74383 74388 6 1.05 2.13 PlyA - 74700 74695 6 1.05 2.12 Term - 77238 77119 120 0 0 39 53 127 0.296 2.77 2.11 Intr - 77849 77736 114 2 0 43 55 77 0.213 0.54 2.10 Intr - 82303 82241 63 1 0 89 3 95 0.122 0.01 2.09 Intr - 100106 100001 106 1 1 71 94 69 0.901 6.02 2.08 Intr - 101835 101698 138 2 0 103 65 183 0.992 17.08 2.07 Intr - 103727 103657 71 2 2 78 56 34 0.986 -2.82 2.06 Intr - 104416 104294 123 1 0 80 101 134 0.999 14.68 2.05 Intr - 113025 112835 191 1 2 83 87 234 0.810 22.10 2.04 Intr - 114908 114798 111 0 0 64 101 95 0.995 8.75 2.03 Intr - 117250 117129 122 0 2 44 105 194 0.325 16.84 2.02 Intr - 120244 120093 152 1 2 75 78 128 0.421 9.46 2.01 Init - 123424 123314 111 1 0 57 89 60 0.893 3.24 2.00 Prom - 125754 125715 40 -3.96 3.14 PlyA - 127087 127082 6 1.05 3.13 Term - 130591 130511 81 1 0 96 44 73 0.776 1.39 3.12 Intr - 131622 131426 197 1 2 66 92 287 0.991 26.03 3.11 Intr - 133766 133673 94 2 1 117 58 105 0.862 9.94 3.10 Intr - 136472 136282 191 0 2 110 100 275 0.997 30.20 3.09 Intr - 142497 142458 40 0 1 93 86 29 0.241 1.00 3.08 Intr - 146543 146485 59 0 2 98 97 2 0.206 0.70 3.07 Intr - 155018 154895 124 2 1 115 92 128 0.642 16.06 3.06 Intr - 167762 167593 170 0 2 61 70 213 0.851 16.47 3.05 Intr - 180882 180805 78 1 0 34 89 127 0.940 6.92 3.04 Intr - 182007 181741 267 1 0 97 109 263 0.994 26.80 3.03 Intr - 196459 196274 186 2 0 122 83 51 0.826 7.66 3.02 Intr - 196816 196754 63 2 0 79 85 43 0.332 1.79 3.01 Init - 210120 210012 109 0 1 95 115 118 0.981 13.69 3.00 Prom - 210499 210460 40 -10.94 4.00 Prom + 210628 210667 40 -7.36 4.01 Init + 213323 213841 519 1 0 61 64 194 0.409 9.17 4.02 Intr + 214431 214640 210 2 0 42 101 125 0.454 8.31 4.03 Intr + 225081 225569 489 2 0 80 -5 212 0.397 4.30 4.04 Intr + 228836 228916 81 1 0 82 74 39 0.649 1.73 4.05 Intr + 231206 231290 85 1 1 89 100 47 0.873 5.39 4.06 Intr + 232683 232837 155 1 2 29 61 85 0.185 -0.31 4.07 Intr + 238506 238601 96 2 0 67 105 50 0.710 4.81 4.08 Term + 239350 239775 426 0 0 91 47 106 0.821 2.10 4.09 PlyA + 240923 240928 6 1.05 5.06 PlyA - 242290 242285 6 1.05 5.05 Term - 249274 249119 156 1 0 66 45 86 0.147 0.03 5.04 Intr - 255947 255858 90 2 0 93 36 58 0.157 1.29 5.03 Intr - 257843 257773 71 0 2 108 84 27 0.733 3.20 5.02 Intr - 261501 261353 149 2 2 67 75 43 0.065 0.78 5.01 Init - 271962 271907 56 0 2 77 71 73 0.125 3.29 5.00 Prom - 273117 273078 40 -4.46 6.00 Prom + 275158 275197 40 -4.26 6.01 Init + 277214 277377 164 1 2 64 80 96 0.617 5.66 6.02 Intr + 289610 289667 58 2 1 93 110 1 0.364 1.69 6.03 Intr + 293075 293357 283 1 1 70 7 191 0.498 6.19 6.04 Intr + 293706 293801 96 1 0 94 82 34 0.639 3.38 6.05 Intr + 301772 302019 248 0 2 21 64 117 0.091 -0.22 6.06 Term + 303872 304171 300 1 0 97 38 88 0.351 -0.08 6.07 PlyA + 304927 304932 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:121383698_121693817|GENSCAN_predicted_peptide_1|300_aa MRSLVSDSQDAVTRLYHHMSLLKPGEHPQVVGPCAPDFLKPQICGICSPEPTYPSSPAVS TPQLSRLTMCSKRKTIPDRGGPIFSQGYRSSYAPLLEPPNKLSRKGSVLCQVTNLSFQPL KSKTRSCMPCTNFQSDASMCDEESGSKAEVMILLTLPTDKNWLGAFSLSAAGVSVRLRHR ICSVAELSDAPDPPTAHLQQEILGFTILKRSLKKSPSYLGAEVVSLASPNILPHEHIVTN HLSAAFRPSLLTQPYTLARDLDTDQSLGICMQCKQLQMCNQRRTHQSTLGTQQRTESSCF >gi568815588r:121383698_121693817|GENSCAN_predicted_CDS_1|903_bp atgaggagtctggtgtctgacagtcaggatgcagtaacccgattgtatcaccatatgtct ctcttgaaacctggagaacatccacaagtggtaggcccctgtgctcctgatttcctaaag ccccagatttgtggaatatgcagcccagagccaacctaccccagctccccagcagtcagc actccacagctgtctaggctcactatgtgctccaagaggaagaccatccctgatagagga ggaccaatcttcagtcaagggtataggagtagctatgctcccctgctagaacctccgaac aagctctcaagaaaaggctcagttctttgccaggtcaccaacttgtccttccagccactc aagagtaaaaccagaagctgtatgccctgcaccaatttccaatcagatgcaagcatgtgt gatgaggaaagtggaagtaaggctgaagtaatgattctgttgactctccctactgataag aattggcttggagcatttagcctctcggcagcaggtgttagtgtccggcttcgccatcgg atctgctctgtagctgagctctcagatgctccagacccacctacagctcatctccagcag gagatcttgggttttacgattttaaaacgtagcttgaaaaagtctccatcttatctcgga gcagaagtggtgtcacttgccagcccaaatattctacctcatgaacacattgttaccaac catctttctgctgcttttcgaccctctctgctcactcaaccctacaccctagccagagat ctggatacagatcagagcctgggcatctgtatgcaatgtaaacagctccagatgtgcaac cagaggagaacgcaccagtccactctgggcacccagcaaagaacagagagttcctgcttt taa >gi568815588r:121383698_121693817|GENSCAN_predicted_peptide_2|473_aa MSTGRAATGLAVAHIEGCRDATKPSLVAEGRAESSDLVSAESSSSMNSNTPLVRITTRLS STADTPMLAGVSEYELPEDPKWEFPRDKLTLGKPLGEGCFGQVVMAEAVGIDKDKPKEAV TVAVKMLKDDATEKDLSDLVSEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNL REYLRARRPPGMEYSYDINRVPEEQMTFKDLVSCTYQLARGMEYLASQKCIHRDLAARNV LVTENNVMKIADFGLARDINNIDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGVL MWEIFTLGGSPYPGIPVEELFKLLKEGHRMDKPANCTNELYMMMRDCWHAVPSQRPTFKQ LVEDLDRILTLTTNEPPDSIHGNYKTVGIDPFLPQPGRSRRQRKRKGHISPFIIIIVIII IIISSLSLPGSCFEGWSQYLGQANDQALAWSEMGMYPKSVQSEDFFLGIQDNE >gi568815588r:121383698_121693817|GENSCAN_predicted_CDS_2|1422_bp atgagcacaggaagggcagcaacgggattggctgttgcccacatcgaaggatgcagagat gccacaaaacctagcctggtagcagaaggaagggccgaaagcagtgatcttgtttcggct gagtccagctcctccatgaactccaacaccccgctggtgaggataacaacacgcctctct tcaacggcagacacccccatgctggcaggggtctccgagtatgaacttccagaggaccca aaatgggagtttccaagagataagctgacactgggcaagcccctgggagaaggttgcttt gggcaagtggtcatggcggaagcagtgggaattgacaaagacaagcccaaggaggcggtc accgtggccgtgaagatgttgaaagatgatgccacagagaaagacctttctgatctggtg tcagagatggagatgatgaagatgattgggaaacacaagaatatcataaatcttcttgga gcctgcacacaggatgggcctctctatgtcatagttgagtatgcctctaaaggcaacctc cgagaatacctccgagcccggaggccacccgggatggagtactcctatgacattaaccgt gttcctgaggagcagatgaccttcaaggacttggtgtcatgcacctaccagctggccaga ggcatggagtacttggcttcccaaaaatgtattcatcgagatttagcagccagaaatgtt ttggtaacagaaaacaatgtgatgaaaatagcagactttggactcgccagagatatcaac aatatagactattacaaaaagaccaccaatgggcggcttccagtcaagtggatggctcca gaagccctgtttgatagagtatacactcatcagagtgatgtctggtccttcggggtgtta atgtgggagatcttcactttagggggctcgccctacccagggattcccgtggaggaactt tttaagctgctgaaggaaggacacagaatggataagccagccaactgcaccaacgaactg tacatgatgatgagggactgttggcatgcagtgccctcccagagaccaacgttcaagcag ttggtagaagacttggatcgaattctcactctcacaaccaatgagcccccagacagcatc cacgggaactacaagactgtgggcattgacccttttcttccccagccaggaaggtcaagg cgtcagaggaaaaggaagggacacattagccccttcatcatcatcattgtcatcatcatc atcatcatctcatcactttcattacctgggagttgctttgagggatggagccagtacctg ggccaggccaatgaccaggctctggcctggtcggaaatgggcatgtaccctaagtcagtc caatcagaggactttttcctggggatacaggacaatgagtga >gi568815588r:121383698_121693817|GENSCAN_predicted_peptide_3|552_aa MVSWGRFICLVVVTMATLSLARPSFSLVEDTTLEPEGHNSVPSGVCTGFEMLARPELALA TLKVFLYSESAQCGRACSMGLGENVKRDLDIYLTFQGHTGQTYPLEYAPFIECLHQYQGE PPTKYQISQPEVYVAAPGESLEVRCLLKDAAVISWTKDGVHLGPNNRTVLIGEYLQIKGA TPRDSGLYACTASRTVDSETWYFMVNVTDAISSGDDEDDTDGAEDFVSENSNNKRAPYWT NTEKMEKRLHAVPAANTVKFRCPAGGNPMPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLI MESVVPSDKGNYTCVVENEYGSINHTYHLDVVGDPAVVFMVFMLGLFSQSEKLFGVAKRL PFWRKERSPHRPILQAGLPANASTVVGGDVEFVCKVYSDAQPHIQWIKHVEKNGSKYGPD GLPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEYTCLAAPGREKEITASPDYLEIAIY CIGVFLIACMVVTVILCRMKNTTKKPDFSSQPAVHKLTKRIPLRRQRWKRKADVDAERLQ KHLKLGSDKGFF >gi568815588r:121383698_121693817|GENSCAN_predicted_CDS_3|1659_bp atggtcagctggggtcgtttcatctgcctggtcgtggtcaccatggcaaccttgtccctg gcccggccctccttcagtttagttgaggataccacattagagccagaaggccacaactct gtgccttcaggcgtctgcacgggtttcgagatgctggccaggcctgaacttgccttggcc actttaaaagtatttctttattcagaaagtgcgcagtgtgggagggcctgctctatgggc ttgggggaaaatgtcaaacgggatctggacatctatctgacctttcagggccatacaggg caaacgtatccgctggagtatgcaccatttattgaatgtttacatcaatatcagggagag ccaccaaccaaataccaaatctctcaaccagaagtgtacgtggctgcgccaggggagtcg ctagaggtgcgctgcctgttgaaagatgccgccgtgatcagttggactaaggatggggtg cacttggggcccaacaataggacagtgcttattggggagtacttgcagataaagggcgcc acgcctagagactccggcctctatgcttgtactgccagtaggactgtagacagtgaaact tggtacttcatggtgaatgtcacagatgccatctcatccggagatgatgaggatgacacc gatggtgcggaagattttgtcagtgagaacagtaacaacaagagagcaccatactggacc aacacagaaaagatggaaaagcggctccatgctgtgcctgcggccaacactgtcaagttt cgctgcccagccggggggaacccaatgccaaccatgcggtggctgaaaaacgggaaggag tttaagcaggagcatcgcattggaggctacaaggtacgaaaccagcactggagcctcatt atggaaagtgtggtcccatctgacaagggaaattatacctgtgtagtggagaatgaatac gggtccatcaatcacacgtaccacctggatgttgtgggagacccggctgttgtattcatg gtcttcatgcttggtttgttttcacagtcagagaagctctttggcgttgctaagagactg ccattttggaggaaagagcgatcgcctcaccggcccatcctccaagccggactgccggca aatgcctccacagtggtcggaggagacgtagagtttgtctgcaaggtttacagtgatgcc cagccccacatccagtggatcaagcacgtggaaaagaacggcagtaaatacgggcccgac gggctgccctacctcaaggttctcaaggccgccggtgttaacaccacggacaaagagatt gaggttctctatattcggaatgtaacttttgaggacgctggggaatatacgtgcttggcg gcgcctggaagagaaaaggagattacagcttccccagactacctggagatagccatttac tgcataggggtcttcttaatcgcctgtatggtggtaacagtcatcctgtgccgaatgaag aacacgaccaagaagccagacttcagcagccagccggctgtgcacaagctgaccaaacgt atccccctgcggagacagagatggaagcggaaggcagatgtagatgcagaacgtttacaa aagcatttgaaacttggttctgataaaggtttcttttga >gi568815588r:121383698_121693817|GENSCAN_predicted_peptide_4|686_aa MGALSLKAGLPLRVGFIELGERKEGLENGLQGSGVRGERRRLVVPERASCARSVVREGRS QRHAGPSAFFPSAAPWPKKMSAQVRTRLRFRHLPAAAAEQAGELEKVGPVRPLAPGALRS LQRRRRLRARGAQAPEPSPPQQCHPWPSPPLRDVTGPTPAAPSRKRGLRRAWMAAEERGH DARGLPSDLGNERKKGLRLGVASTKLCSRVAKAQSAQASCGLGAPGARGRPPPARRAVAA PGQGGAHRAPAPPCGREHRARKAGASAARGVGDGAGIKVEGGRAGRRGSPASAEPSPEVV SAVQMMSDPGRGESGAQVLAQAQLDEMSVTPTAGARDLAGTWCCSGLCLGLACGYLRLIL HSLRGFAPGRRPHVSSLWINKAENHLCTLAGKPAPTSGPLLGLLALLPVMSLSYLDSVYP SSKFATNSFSFREPIQICILAILQVPPALGEAALTCCTHRQGCGGGRVQAADPSSTVASR TQEAHGASHVHDHLQLIPEAATSTCQVQPSQEQEYDWLWHYSDEPSSPSMKEEMQKSICP CNLKKVDSHIPKQDSIHTSVCAQSTWGPCGNVEADSVVLVWGLRFCMSTGSQVMWGHAFY SVGPQNMDYSLPSPDDGDSPLPQGWHPREAEEGACKDLQNFRGWRSLTRPMSSCIAVPCK PNQVALAGTAVELRGVSTSCQGCQET >gi568815588r:121383698_121693817|GENSCAN_predicted_CDS_4|2061_bp atgggtgccctgagtttgaaagcaggtttgcctttaagggtaggttttattgaattggga gaaaggaaggagggtctggagaacggtctgcaagggagcggggtgcgaggagagcgacgg aggctggtggtccccgagcgggcctcctgcgccaggtccgtggtccgcgagggtcggagc cagcgccacgcagggccctcggcttttttccccagcgcggccccgtggccgaaaaaaatg agcgcgcaagttagaactcgcctgcgctttcgacatctcccagccgctgcggcagagcag gcaggcgaactagaaaaagtcggtccagttcgccccctagccccaggcgcactgcgctcc ctccagcgccgacggcggctgcgggcgaggggcgcccaggcaccggagccgtcgcctccc cagcagtgccacccctggccatcgccacccctacgggatgtcaccggccccaccccagcg gccccctcccgaaaaaggggtctccgcagggcctggatggctgcggaggagcgcgggcat gacgcccgcgggctgccctcggatttggggaacgagaggaagaaaggactcaggcttggc gttgcctccaccaaactttgctcgcgagttgcgaaggctcagagcgcgcaggcaagctgc ggcctcggggcccccggggctcgcggccggcccccgccagcccggagagcagtcgccgcg ccgggccagggaggagcgcatcgggcgccagcgcccccgtgtggccgcgagcaccgcgcg cgcaaagcgggggcgagcgccgcccggggcgtgggtgacggggctggcatcaaggtagag ggcggccgggcagggaggaggggcagccccgccagcgccgagcccagccctgaagtcgtt tcggctgtgcagatgatgagcgacccaggtcgaggcgaatcaggagcccaggtgcttgcc caagcacagctggacgaaatgtctgttaccccgacggctggggcgcgggaccttgcaggg acgtggtgctgcagcggcctctgcctgggcctggcgtgcggctacctcaggctaattctc cattcgctgcgtggattcgcccccgggaggcggccccacgtttccagcctgtggatcaac aaggcggagaaccacctttgcaccctcgccgggaaaccagctcctacctccggccccctg ttgggccttctggccctgctccctgtcatgtcactctcctatttggactcagtttaccca tcgtccaaatttgccacaaattccttcagcttcagagaaccaatccaaatctgcatcctg gccatccttcaagtcccacctgctctgggagaggccgccctcacgtgctgcacccatcgg caaggatgtgggggagggagagtccaagcagcagatccttctagtacagtcgccagcaga acccaggaggcccacggtgcaagccacgtacatgatcatcttcaactgatcccggaggcg gccacaagcacctgccaggtgcagcccagccaagagcaagaatatgactggctttggcat tactctgatgaaccatcaagccccagcatgaaagaggagatgcagaagagcatttgtcca tgcaatctaaaaaaggtggacagccacatcccaaagcaggactccattcatacttcagtg tgtgcacagagcacttggggaccttgtggaaatgtagaagctgactcagtagttctggtg tggggcctgagattctgcatgtctaccggttcccaggtgatgtggggccatgctttttac agtgtaggtccacagaacatggactactcattgccaagcccagacgatggtgatagcccc ttgccccaaggatggcacccgagggaagctgaggagggagcctgcaaagacctgcagaac ttccgaggttggagaagcctgaccagacccatgagcagctgcatagctgtgccttgcaaa cccaaccaagtggctttagcaggaactgcggtagagctcagaggtgtctcaaccagctgc cagggctgccaggaaacttaa >gi568815588r:121383698_121693817|GENSCAN_predicted_peptide_5|173_aa MSLRSALLGGHSTGLGSGSGFEDEAKFSLMISQRELIATLSDLAATSFINECFLNLNVLT DHLPILSQVVMSESGHNHPMVYLVVVFRFFKSPPQALGLQLYLRSNDYDLKEAPESSEKL CSCALPCVTVIDKQVSSPSWPANPLTASLIFGSLEHPTQCLQNEGLAGHMRPH >gi568815588r:121383698_121693817|GENSCAN_predicted_CDS_5|522_bp atgtccctccgctccgcactgcttggcggtcacagcactggcctgggatctggcagtggc tttgaggatgaagccaagttctccttaatgatatcacagcgggagttgattgctaccttg agtgatttggctgcaacaagcttcataaatgagtgtttcttaaacttgaatgtgcttaca gatcacctgccgatcttgtcacaagttgtaatgtctgaatcagggcacaaccacccaatg gtgtatttggttgttgtgtttcgtttcttcaaatctcctcctcaagccctggggctacag ctttatctaaggagtaatgactatgatctcaaggaggcccctgagtcatcagaaaagtta tgttcatgtgcattgccatgtgttacagttattgataaacaggtgtcatctccctcctgg cctgcaaacccactgacggccagtctcatctttggatcccttgagcacccaacgcagtgc ctgcagaatgagggccttgcaggacacatgcgaccacactga >gi568815588r:121383698_121693817|GENSCAN_predicted_peptide_6|382_aa MNDKKCLLHVRPGVQTSASGFQSREQFLWGKDVPKIFTYKKSWTWVLKDHWYSERPQTAV FTGSCCPGKPLACQPSFQEPMLLMHEQVQGSGGESEPWAKGLLIIINAVVFAVFVQGKTT LWKNITGKGGRDHNRSQDVLPDCVLRNAHIPGSTLLICSPAQEALAYSQHYLPTQRVLPV VLNAFINMHLEHQQFSALRKSHSLGELVSGPELKNVSGITGDVTFVLDLEHGWDLVGIDG DEARLQGVEESWGWSWGRCMRLSGRKLVRKEDSLGSPPEGPIQDASSDERGNFDPALTQP STGKEKPRRHLEPVKPEPAMGGTESAECRLLWKWKEIVAWGLLNPPEGSILRTIMPSGSA LAPEAQRNPGCAVFISEVLEHY >gi568815588r:121383698_121693817|GENSCAN_predicted_CDS_6|1149_bp atgaatgacaagaagtgtctgctgcatgtcagaccgggggtccagacatctgcctccgga tttcagagcagggagcagtttctgtggggcaaagatgtcccaaaaatcttcacatacaag aagtcctggacctgggttttgaaggatcactggtattcagagaggcctcagactgctgtc ttcactggctcctgctgcccaggaaaaccccttgcatgccagccaagcttccaggagcct atgctgctcatgcatgaacaggttcaagggagtggaggcgaatctgagccctgggcaaaa gggcttctcataataattaatgcggtagtttttgcagtcttcgtgcaaggtaaaaccacc ctttggaagaacatcactggcaaaggtggccgtgaccataatagatctcaagatgttctt cctgactgtgttcttcgaaatgcccatattccagggtcaacgctgctcatatgctcacca gcccaagaagcacttgcctactcacagcactatctcccaactcaaagggttcttcctgtt gtcttgaatgccttcattaacatgcacctggaacatcaacagttttcagcattaaggaaa tctcacagtctaggagagcttgtttctggcccagagttgaagaatgtttcaggaatcaca ggagatgtgacatttgtgctggatctggaacatgggtgggatctggtgggcatagatgga gatgaagcccggctgcagggagtcgaggagtcctggggctggagctggggaaggtgcatg cgtctgagtggccggaaacttgtcagaaaggaagattccctgggttcacccccagagggt cccattcaggatgcaagctctgatgaaagggggaactttgatcctgctctcactcagcca tccacagggaaggaaaagccaaggagacacctagagccagtaaagccggagcctgccatg ggtggaacagagtcagcagaatgcaggctcctctggaagtggaaagagattgttgcctgg ggcttactgaaccctccagaaggttccattttaagaaccatcatgccttcaggttctgct ttggccccagaagcccagagaaaccctggctgtgcagtcttcatttcagaggttctcgag cactactaa