GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:18:09 Sequence gi568815591f:24185241_24389580 : 204340 bp : 40.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 22986 23067 82 2 1 50 72 60 0.652 1.78 1.02 Intr + 28124 28311 188 0 2 82 70 144 0.550 10.49 1.03 Term + 33248 33373 126 1 0 63 45 116 0.494 2.10 1.04 PlyA + 34947 34952 6 1.05 2.00 Prom + 37116 37155 40 -4.25 2.01 Init + 39733 39749 17 0 2 92 92 -5 0.180 -0.05 2.02 Term + 42525 43113 589 0 1 93 33 224 0.825 10.40 2.03 PlyA + 43297 43302 6 1.05 3.00 Prom + 45194 45233 40 -5.05 3.01 Init + 46886 46888 3 1 0 63 115 0 0.177 0.25 3.02 Intr + 65091 65206 116 2 2 55 86 113 0.309 6.13 3.03 Intr + 72215 72350 136 2 1 71 82 21 0.120 -0.55 3.04 Term + 74277 74360 84 2 0 87 52 88 0.570 1.77 3.05 PlyA + 74728 74733 6 1.05 4.03 PlyA - 76533 76528 6 1.05 4.02 Term - 77197 77175 23 2 2 112 43 33 0.328 -1.30 4.01 Init - 82799 82730 70 2 1 60 100 56 0.405 5.26 4.00 Prom - 88222 88183 40 -2.95 5.05 PlyA - 88986 88981 6 1.05 5.04 Term - 90218 89841 378 2 0 -51 42 260 0.345 1.30 5.03 Intr - 91200 90320 881 1 2 16 41 446 0.115 22.81 5.02 Intr - 92804 92561 244 2 1 94 -16 200 0.155 6.35 5.01 Init - 93659 93549 111 2 0 87 31 47 0.519 -0.84 5.00 Prom - 95012 94973 40 -5.45 6.00 Prom + 98916 98955 40 -4.55 6.01 Init + 99138 99386 249 2 0 46 62 254 0.586 14.16 6.02 Intr + 100001 100188 188 1 2 81 90 238 0.904 20.87 6.03 Intr + 104259 104410 152 0 2 82 51 149 0.085 9.49 6.04 Intr + 119871 120058 188 1 2 63 80 81 0.005 3.29 6.05 Intr + 133099 133216 118 0 1 75 88 -1 0.026 -2.28 6.06 Term + 134060 134214 155 0 2 56 54 175 0.678 8.00 6.07 PlyA + 134812 134817 6 1.05 7.00 Prom + 147126 147165 40 -5.75 7.01 Init + 147593 147805 213 1 0 76 55 97 0.560 3.99 7.02 Term + 152138 152263 126 1 0 99 47 112 0.934 5.50 7.03 PlyA + 153400 153405 6 1.05 8.00 Prom + 154344 154383 40 -7.55 8.01 Init + 156562 156602 41 0 2 81 101 53 0.962 3.73 8.02 Intr + 158642 158827 186 2 0 80 26 159 0.011 6.78 8.03 Intr + 165606 165719 114 0 0 2 60 140 0.004 1.24 8.04 Intr + 167438 167571 134 2 2 49 39 75 0.003 -1.93 8.05 Intr + 167839 168082 244 2 1 -12 113 183 0.076 6.43 8.06 Intr + 176125 176180 56 1 2 59 39 49 0.007 -5.20 8.07 Intr + 177000 177101 102 1 0 76 68 76 0.189 3.63 8.08 Intr + 182587 182766 180 2 0 48 96 93 0.311 5.02 8.09 Intr + 188018 188133 116 0 2 -62 94 159 0.243 0.75 8.10 Intr + 191460 191511 52 2 1 96 119 2 0.517 1.66 8.11 Term + 193535 193659 125 0 2 42 49 210 0.684 10.07 8.12 PlyA + 193753 193758 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 104259 104417 159 0 0 82 48 151 0.829 8.68 S.002 Term + 158642 158831 190 2 1 80 42 160 0.910 6.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_1|131_aa MPIFKPTKVKETGTIRDGLRPVKNLPLAGSEPTSGIYSKEQLTNNCCQVSECTHMILEVA VSSSCDEVVSSELLLTHAVGPGMKRRKADLRPLLGGFGLSSNKHGYPVSQTTADAEETQG GERGATRCWRL >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_1|396_bp atgcccatatttaaaccaaccaaagtgaaggagactggaactatccgtgatgggcttaga cctgtcaagaatcttcctctggctggctcagaacccacttctggaatctactctaaagag caattgaccaacaactgctgccaagtttccgagtgcactcacatgattctggaagtggca gtttcaagcagctgtgatgaggtggtgagcagtgagttgctcctcactcatgcagtgggc cctggaatgaagaggaggaaagcagacctgcgccccttgcttggaggctttggactcagt tctaacaagcacggttatccagtttcccagaccacggcagatgcagaggaaacccaagga ggtgaaagaggtgccactcgctgctggaggctatga >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_2|201_aa MQWPARNGDGAQSPTASHWQARVSSEATQLCACLQPQTSSFLLTSALPALGQPLFLHVSP LGLVMMPGEPMLLCCVVSFGINLPDSYKGLETLGYGPVGGGHWQDHGNIQRAGAYVSSSY STTPNPENPPPVLTVALRLSPQPAALTPQALTSILTPPYSSQSKSFQEKLAFLVCSRTLL KEAYLFTPTPKALMSYLTVII >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_2|606_bp atgcagtggccagcgagaaatggagatggggctcagagtcccacagcctcccactggcag gcccgagtatcatctgaagccacccagctctgtgcctgcctgcagccccagacctccagc tttctactaacgtcagcattgcctgccctggggcaacccctgtttctccatgtgtctcct ctgggtctggtcatgatgcctggggagccaatgctgctttgctgtgttgtatcatttggg atcaacttgcctgactcttacaaaggacttgaaaccttggggtatggccctgttggggga ggtcactggcaggaccacggtaatatccaaagggcaggggcctatgtcagctccagctac tccaccaccccgaatccagaaaatcctcctcctgttcttaccgtggctctccggctgtct ccccaacctgcagctctaactccccaggccctgacctctatcctaacccctccctactca tcccaatccaaaagctttcaggagaagcttgcatttcttgtttgcagcaggacgctcctt aaagaagcttacctttttacccctacccccaaagctttgatgagctatttaacagtaatt atttaa >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_3|112_aa MKIQLATIQRPEQLFENPNARNKPETPMWSTELNGIKLEGPIMEQRYFEHRHGFFTLERG SLPSCPEARWGDQEFKQTHSSSYLQVEKFQGVPMQSPSKATEEPRGDFQEGK >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_3|339_bp atgaaaattcaactagcaactatccaaagaccagaacaactttttgaaaaccccaatgca aggaacaagcctgagacaccaatgtggtccacagaactgaatggaattaaattagaaggt cctattatggagcaaagatattttgaacacagacatggatttttcactttggaaagggga tcccttccatcttgtcctgaggccagatggggggaccaagaatttaagcagacacattca tcttcctatcttcaggtggagaaattccaaggtgttcccatgcagtctcccagcaaagcc actgaggagccaagaggagacttccaagaggggaagtga >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_4|30_aa MGKKGAYWSAGWNDQEERKSGTEVKKQHAQ >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_4|93_bp atggggaaaaagggagcatattggtctgctggatggaatgatcaggaagaaagaaaaagt ggcactgaagtgaaaaagcagcatgctcaatag >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_5|537_aa MTDCTGKIGTLPPKYCAVPTVLANSTPGNYIPHMAQQDCSSSPAMEQSWTENDFDKLTEV AFRKSVITNFSELKEDVRTHCKGAKNLENRLDEWLTKINSVGKPLNDLRKLKTMARELPP HHTYSKIDHIVGSKALLSKCKRTEITTNCLSDHSAIKFELRIKKLTQKHTTTWKLNNLLL NDYWVNNEMKAEIKMFFETNENKDKMYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDIL TSQLKELEKQEQTNSNASRRQEITKIRAELKEIKIQETLQKISESSSWFFEKISKIDRPL ARLIKKKKEKNQIDTIKVDKGDITTDPTEIQTTIREYYKHLYANEVENLEEMDKFLDTYT LPRLHQEEVKSLNRPITSSEMEVVINSLPTEKKSPGPDGFTVKFYQRYKEELRHNNNKKE NFRPISLMNINEKILNRILANRIQQHIKKLIHYDQACFIPGMQGSFNIRKSIHIIHHINR TNDKNHMIISIDAEKAFDKIQQDFMLKTLNKLGIDGTYLKIVRATYDKPTASIILNG >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_5|1614_bp atgactgactgtaccgggaaaattgggacactgccacctaaatactgtgctgttcccacg gttttagcaaacagcacaccaggaaattatatcccacacatggctcagcaggattgcagc tcttcaccagcaatggaacaaagctggacagagaatgactttgacaagttgacagaagta gccttcagaaagtcagtaataacaaacttctctgagctaaaggaggatgttcgaacccat tgcaagggagctaaaaaccttgaaaacagattagatgaatggctaactaaaataaacagt gtagggaagcccttaaatgacctgaggaagctgaaaaccatggcaagagaactaccacca catcacacttattccaaaattgaccacatagttggaagtaaagcactcctcagcaaatgt aaaagaacagaaattacaacaaactgtctctcagaccacagtgcaatcaaatttgaactc aggattaagaaactcactcaaaagcacacaactacatggaaactaaacaacctgctcctg aatgactactgggtaaataatgaaatgaaggcagaaataaagatgttctttgaaaccaat gagaacaaagacaaaatgtaccagaatctctgggacacatttaaagcagtgtgtagaggg aaattcatagcactaaatgcccacaagagaaagcaggaaagatctaaaatcgacatccta acatcacaattaaaagaactagagaagcaagagcaaacaaattcaaatgctagcagaagg caagaaataactaagatcagagcagaactgaaggagataaagatacaagaaactcttcaa aaaatcagtgaatccagcagttggttttttgaaaagatcagcaaaattgatagaccatta gcaagattaataaagaagaaaaaagagaagaatcaaatagacacaataaaagttgataaa ggggatatcaccactgatcccacagaaatacaaactaccatcagagaatactataaacac ctctacgcaaatgaagtagaaaatctagaagaaatggataaattcctggacacatacacc ctcccaaggctacaccaggaagaagtcaaatccctaaatagaccaataacaagttctgaa atggaggtagtaattaatagcttaccaaccgaaaaaaaaagcccaggaccagatggattc acagtcaaattctaccagaggtacaaagaggagctgagacacaacaacaacaaaaaagag aattttagaccaatatccctgatgaacatcaatgagaaaatcctcaatagaatactggca aaccgaatccagcagcacatcaaaaagcttatccactatgatcaagcgtgcttcatccct gggatgcaaggctcgttcaacatacgcaaatcaatacacattatccatcacataaacaga accaatgacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaggacttcatgctaaaaactctcaataaactaggtattgatggaacgtatctcaaa atagtaagagctacttatgacaaacccacagccagtatcatactgaatgggtaa >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_6|349_aa MIALHSPADYAGSAPRRGPALLAAKFHLFLCSPLIVHGPSPGPFPLGGSSELPFSRYGGR TRDRYPKAAGGLPAGRRLEAQEAMLGNKRLGLSGLTLALSLLVCLGALAEAYPSKPDNPG EDAPAEDMARYYSALRHYINLITRQRYGKRSSPETLISDLLMRESTENVPRTRYDKACDG DIVAELKVPRGGKSETGNSEIFVSLVTPITLQSETLIQSSHFRGDETEDSKVKYLSKSRD SNSILLTPRPVLYMINASQGSCSTSCDYLPAKHSKIQLAHILKRYSFSFLGLKEGGGRGQ MITPKSGETENIAEVCLPGTEATGAIDWQEHASGHFDKLLEVASGVAGE >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_6|1050_bp atgatcgcgctccactccccagcggactatgccggctccgcgccccgacgcggaccagcc ctcttggcggctaaattccacttgttcctctgctcccctctgattgtccacggcccttct cccgggcccttcccgctgggcggttcttctgagttaccttttagcagatatggagggaga acccgggaccgctatcccaaggcagctggcggtctccctgcgggtcgccgccttgaggcc caggaagcgatgctaggtaacaagcgactggggctgtccggactgaccctcgccctgtcc ctgctcgtgtgcctgggtgcgctggccgaggcgtacccctccaagccggacaacccgggc gaggacgcaccagcggaggacatggccagatactactcggcgctgcgacactacatcaac ctcatcaccaggcagagatatggaaaacgatccagcccagagacactgatttcagacctc ttgatgagagaaagcacagaaaatgttcccagaactcggtatgacaaggcttgtgatggg gacattgttgcagagctcaaggtgcccaggggagggaagtcagagacaggtaattctgag atttttgtgtcattggtcactccaatcaccttgcaaagtgaaacattaatccaatcttca catttcagaggtgacgagactgaggacagcaaagttaagtacctgtccaagagcagggac tcaaattcaattcttttgactccacgtccagtgctctacatgattaatgcttcacagggg agttgttcaacaagttgtgactatcttccagcaaagcacagcaaaatccagcttgcccac attttgaaaaggtacagctttagtttcctgggactgaaagagggagggggaagaggacaa atgatcaccccaaaatctggagagacagagaatatagcagaggtctgcttacctggaaca gaagcaactggagccatagactggcaagagcatgcaagtggtcactttgacaaattgttg gaagttgccagtggagtagctggagagtga >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_7|112_aa MKAPLGDFEQKLEGSERGHHIDTKELSSQANRNSMCKGPGVGTHLACSRISKESTSVDTG RAAMGRVVGDEESRMKAQTSTRMTVAGCHGNSEKALDSAFDFTVDHKPKHED >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_7|339_bp atgaaggctccactgggtgactttgaacaaaaacttgaaggaagtgagagaggtcaccat atagataccaaggagttgagcagtcaggcaaacagaaacagcatgtgcaaaggccctggg gtgggaacacatctggcatgttcaaggattagcaaggagtccaccagtgttgacactggg agagctgcaatggggagagtagtaggagatgaggagtctagaatgaaggcacaaacttcc accaggatgactgtggctggttgccatggaaatagcgagaaggcactggattcagcattt gacttcactgtggatcacaagcccaagcatgaagactga >gi568815591f:24185241_24389580|GENSCAN_predicted_peptide_8|449_aa MAGEWELVCVAITCTTCRNATVKEFRQFLMSPSEVIKKKNPSERLSSQKGHIDCHKDSPC PANRTEEGGREQCEEKEALGSAAAAVTSCNGGTRFLTRKPSPASVQLQMKHQKGAPGGRG GCGQSFGILKHSCLLALKRTADLRAQCSSSAKGQTASSKRNSININKKDVHTKTPSEGHQ HQRPKLDKSTKMRRSQRKKAENFKNQNASFPPKDHNSSPAKEQNWAENQFDELTEVGFRS NWIKERDAVLTGSTAYFPGMELRHSGVLCEHLLMTRTSKDLYPTNTSTAVHTACWRWCRG HGEEGKKEISVCLTDFRIGDTPKKQKVFHWKAQQTNWRRAATIATIGSDCSKGQTKHFSD SWNTEPPAEKSSWPEATVLVLRPPGYKERSQGEPLYSPNSGIFLNHPSEPQAPLAEEKKR RGEAAEHQRLRLDVKEQQLDLEGQLDGVA >gi568815591f:24185241_24389580|GENSCAN_predicted_CDS_8|1350_bp atggcaggagaatgggaactggtatgtgtggcgatcacatgtaccacttgcagaaatgcc actgttaaagaatttcgacaatttttgatgagccccagtgaggtgataaagaaaaagaat ccatctgagagactcagtagccaaaaagggcatattgattgtcacaaagacagcccatgc ccagcaaatagaactgaagaaggagggcgagaacaatgcgaagaaaaagaggctctagga tctgcagcagcagcagtgacatcctgcaatggaggcaccaggttcctcacccgcaaaccc tcaccagccagtgtgcagctgcaaatgaagcatcagaagggagcacctgggggaaggggc ggttgtgggcaaagcttcggcatacttaaacattcctgcctgctggctctgaagagaaca gcagatctccgagcacagtgctcgagctctgctaagggacagacagcctcctcaaaaagg aatagcatcaacatcaacaaaaaggacgtccacacaaaaaccccatccgaaggtcaccaa catcaaagaccaaagctagataaatccactaagatgaggagaagccagcgcaaaaaggct gaaaatttcaaaaaccagaatgcctcttttcctccaaaggatcacaactcctcgccagca aaggaacaaaactgggcagagaatcagtttgatgaattgacagaagtaggcttcagaagt aattggatcaaagaaagagatgctgtgttaacgggcagtacagcctattttccaggaatg gagcttaggcacagtggagtcctttgtgagcacctgctaatgaccaggactagcaaggac ctctacccaacaaacacatctacagctgtccacactgcgtgttggaggtggtgcagaggc catggggaggagggaaagaaggaaatatcagtgtgtctgacagacttcagaataggagac accccaaagaagcaaaaagtattccactggaaggcacagcagaccaactggagaagagca gctacaattgctacaattgggagtgactgcagtaagggtcaaacgaagcacttttctgac tcttggaacactgaaccgccagctgagaagtccagctggcctgaggccaccgtgctggta ctcaggccaccaggctacaaggagaggtcgcagggagaaccattgtattcaccaaacagt gggatcttcctaaaccatccttctgaaccccaggctccactggcagaggaaaagaagaga agaggagaagcagcagaacatcagagactacggttggatgtcaaagagcagcagcttgat ttagagggacagcttgatggcgttgcttag