GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:12:18 Sequence gi568815592r:130758139_131056485 : 298347 bp : 38.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 778 773 6 1.05 1.02 Term - 6120 5985 136 2 1 85 42 85 0.142 0.11 1.01 Init - 13632 13553 80 0 2 63 89 75 0.554 5.68 1.00 Prom - 15845 15806 40 -5.25 2.06 PlyA - 16863 16858 6 1.05 2.05 Term - 19670 19602 69 2 0 99 47 99 0.483 3.86 2.04 Intr - 26713 26618 96 1 0 84 87 53 0.539 4.09 2.03 Intr - 35406 35361 46 2 1 45 95 28 0.104 -3.31 2.02 Intr - 43575 43449 127 1 1 61 36 191 0.645 10.02 2.01 Init - 44992 44872 121 1 1 68 74 57 0.520 2.70 2.00 Prom - 52912 52873 40 -6.55 3.00 Prom + 54359 54398 40 -4.95 3.01 Sngl + 61845 62669 825 2 0 39 39 331 0.812 18.78 3.02 PlyA + 62961 62966 6 1.05 4.00 Prom + 65171 65210 40 -5.25 4.01 Init + 69276 69513 238 2 1 70 110 226 0.981 21.22 4.02 Term + 78778 78941 164 2 2 62 50 114 0.651 2.12 4.03 PlyA + 78971 78976 6 1.05 5.03 PlyA - 79023 79018 6 1.05 5.02 Term - 83405 83196 210 2 0 94 47 88 0.059 1.61 5.01 Init - 95539 95471 69 1 0 64 40 106 0.189 4.50 5.00 Prom - 97372 97333 40 -5.45 6.09 PlyA - 99158 99153 6 1.05 6.08 Term - 100105 99998 108 1 0 80 42 153 0.999 7.43 6.07 Intr - 105580 105500 81 1 0 71 99 141 0.999 12.42 6.06 Intr - 107496 107398 99 0 0 33 98 70 0.767 1.89 6.05 Intr - 109443 109321 123 0 0 88 87 95 0.969 9.26 6.04 Intr - 111988 111425 564 1 0 68 84 683 0.945 57.96 6.03 Intr - 114400 114233 168 1 0 88 42 119 0.906 6.52 6.02 Intr - 114959 114864 96 2 0 53 50 81 0.341 0.09 6.01 Init - 117398 117336 63 2 0 53 85 47 0.531 2.10 6.00 Prom - 120757 120718 40 -4.85 7.13 PlyA - 120971 120966 6 1.05 7.12 Term - 125029 124364 666 1 0 -22 43 289 0.494 6.24 7.11 Intr - 127130 126958 173 0 2 52 86 190 0.655 13.94 7.10 Intr - 132328 132156 173 0 2 140 82 140 0.999 17.26 7.09 Intr - 136303 136206 98 1 2 48 91 102 0.791 4.39 7.08 Intr - 136981 136829 153 1 0 51 99 89 0.777 5.65 7.07 Intr - 141440 141353 88 1 1 83 79 76 0.987 5.25 7.06 Intr - 143042 142824 219 1 0 143 62 254 0.921 24.80 7.05 Intr - 146402 146327 76 0 1 45 80 36 0.032 -3.85 7.04 Intr - 150725 150683 43 2 1 83 100 38 0.002 1.39 7.03 Intr - 168571 168467 105 1 0 60 106 66 0.728 5.09 7.02 Intr - 197179 196967 213 1 0 84 78 293 0.807 25.89 7.01 Init - 198347 197856 492 2 0 90 91 651 0.968 60.80 7.00 Prom - 202778 202739 40 -4.15 8.08 PlyA - 202878 202873 6 1.05 8.07 Term - 213464 213309 156 2 0 105 40 124 0.977 6.25 8.06 Intr - 219333 219249 85 2 1 107 85 44 0.338 4.90 8.05 Intr - 227508 227277 232 1 1 61 40 133 0.011 1.71 8.04 Intr - 235573 235459 115 1 1 59 14 146 0.054 3.40 8.03 Intr - 250580 250260 321 2 0 61 37 258 0.008 13.13 8.02 Intr - 267828 267632 197 1 2 99 23 95 0.018 2.31 8.01 Init - 275758 275632 127 1 1 76 93 54 0.600 5.07 8.00 Prom - 286838 286799 40 -6.25 9.02 PlyA - 287070 287065 6 1.05 9.01 Term - 296049 295863 187 2 1 24 53 243 0.869 10.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_1|71_aa MIESSYHSTSLPAFGVVTVLDFGLYNSLNWNMGVFNQDNTINPFKNEWYNTPLATQSINH SSQFGGETVPT >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_1|216_bp atgattgagagttcttatcactccacatccttgccagcatttggtgttgtcactgttttg gattttggcctttacaacagcttgaactggaatatgggtgttttcaatcaggacaacaca ataaacccatttaaaaatgaatggtataacactcctttggctactcagagcatcaaccac agctcacagtttggaggagagactgtgccaacataa >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_2|152_aa MTLKEHAAFKHLFNKAHLVPPLIHSTLRGYSTCFREHGVGVASTLSDSVGDHGGKANIRA AQPWLIRSANPGLRPVAPRRSGSFRKVFLAKHEWVKVLSLSAGEFKEKIVLRPVTVRKYK ALDGPGRMVKPTQCEDSEDEDFYDDPLPVYKE >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_2|459_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgtaccg cccttaatccattcaaccctgagaggatacagcacatgtttcagagagcacggggttggg gttgcttctactttatcggatagtgttggagatcatggagggaaggcaaatatacgcgcg gcccagccgtggctaatacggagtgcgaatccggggctccggcccgtggccccgcggcgg tccgggagcttcagaaaagtcttccttgcaaaacatgagtgggtgaaagtactgagcctg agtgctggagaatttaaagaaaagattgtattgaggccagtaactgtcagaaaatataag gctttagatggtcctggaaggatggttaagcctactcaatgtgaagacagtgaggatgaa gacttttatgatgatccacttccagtttataaagagtaa >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_3|274_aa MNIDAKILNKILANQIQQHIKKIIRHDQMGFIPGMQGWFNTCKSINVLHHMNRIKNKNHM IISIDAEKAFDKIQHRFMIKTLSKISIQGTYLNIMKAIYDKPTVNIILNGGKLKVFALRT GTRQGFPLSLLLFNIVLEVLARAIRQEKEIKSIQIGKEEVILSLFADYMIVCLENPKYFS RKLLELIREFSKVSRYKINVHKSIALLYNNSNQAENQIKNSTPFTIAAKKENKILRNIPN QGGERPLQGKLQNTAERNHRQHKQMETHLLLMDE >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_3|825_bp atgaatatagatgctaaaatcctcaacaaaatactagctaaccaaatccagcaacatatc aaaaagataatccgccatgatcaaatgggtttcataccagggatgcagggctggtttaac acatgcaagtcaataaatgtgttacaccacatgaacagaattaaaaacaaaaatcacatg atcatctcaatagatgcagaaaaagcattcgacaaaatccagcatcgctttatgattaaa actctcagcaaaatcagcatacaagggacatacctcaatataatgaaagccatctatgac aaacccacagtcaacataatactgaacgggggaaagttgaaagtgttcgctttgagaact ggaacaagacaaggattcccactctcactactcctcttcaacatagtactggaagtccta gccagagcaatcagacaagagaaagaaataaaaagcatccaaatcggtaaagaggaagtc atactgtcgttgtttgctgattatatgattgtttgtctagaaaaccctaaatatttctcc agaaagctcctagaactgataagagaattcagcaaagtttccagatacaaaattaatgta cacaaatcaatagctctcctatacaacaacagcaaccaagcagagaatcaaatcaagaac tcaaccccttttacaatagctgcaaaaaaagaaaataaaatacttaggaatatacctaac caaggaggtgaaagacctctacaaggaaaactacaaaacactgctgaaagaaatcacaga cagcacaaacaaatggaaacacatctcttgctcatggatgagtag >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_4|133_aa MLSKGRSPRRKQVQTQRKAALVLSVTPMVPVGSVWLAMSSVLSAFMRELPGWFLFFGVFL PVTLLLLLLIAYFRIKLIEDQVGSLPTLPYIQLHPMLQGLSPGARNAPTTLNGLTTTVPN LIPNAIVSETVHQ >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_4|402_bp atgctgagcaaaggccggagccccagaagaaaacaagtacagactcagaggaaagctgcc ctggtcctgagtgtgactcccatggtccccgtggggtctgtgtggttggcaatgagctct gtgctgtcagctttcatgagggagctccctggctggttcctgttctttggggtcttcctc cccgtgactttgctgctgctcctcctcatcgcctacttcaggatcaaactgattgaggac caagtcggcagtctccctacattaccctacattcagctgcatcctatgctccaaggcctt tcaccaggggccagaaatgctcccacaaccctcaatggtctcactacaacagttccgaat cttattccaaatgctatagtatctgaaacggttcaccaatag >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_5|92_aa MVKEQPAEDEPPKSTFLSGWQPALDSCTGLLTVVPASTPCRLQLIHHTEATAVFSKRKSH HDKPFLKTKEETIPTSFPGLIMPCMTGQPQVL >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_5|279_bp atggtaaaagaacaacctgcagaggacgaacctcccaagtccacctttctcagtggatgg cagccagctctggactcctgcactggtctcctgactgttgttcctgcctctactccttgc cggcttcagcttattcatcacactgaagccacagcagtgttttcaaaacgtaaatcacac catgacaaacccttcctcaaaaccaaagaagaaacgatcccaacctcctttcctgggctt atcatgccctgcatgactggccaaccccaagttctttga >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_6|433_aa MYPRVRTSHNTGTERPNDRGLRVKVVLRREIVGVGVVREGLTENADYEVGLRESLEEEIT SILFSEKGFSDSMKATFTSATTWTAEGAVVSANAPSEKLSSSPFSHLLESSHETLNIVEE KKRAEVGKDERVITEEMNGKEISPGSGPGEIRKVEPVTQKDSTSLSSESSSSSSESEEED VGEYRPHHRVTEGTIREEQEYEEEVEEEPRPAAKVVEREEAVPEASPVTQAGASVITVET VIQENVGAQKIPGEKSVHEGALKQDMGEEAEEEPQKVNGEVSHVDIDVLPQIICCSEPPV VKTEMVTISDASQRTEISTKEVPIVQTETKTITYESPQIDGGAGGDSGTLLTAQTITSES VSTTTTTHITKTVKGGISETRIEKRIVITGDGDIDHDQALAQAIREAREQHPDMSVTRVV VHKETELAEEGED >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_6|1302_bp atgtatccacgtgtaaggacgagccacaacacaggaactgaaagaccaaatgacaggggc ttgagagttaaggtggtattgaggagggagattgtaggagttggagtggtgagggaaggc cttacagaaaatgcagactatgaagtgggcctcagggagagcctagaagaggagataaca tcaattttgtttagtgaaaagggcttttctgatagcatgaaagccaccttcacttcagcc accacttggacagcagagggcgctgttgtgagcgctaatgcaccttctgaaaagctttct tcctcgcccttctcacacttgcttgagagttcacatgagactctgaatatagtggaggag aagaagcgggcagaggttgggaaagacgaaagagtaatcacagaagaaatgaatggtaaa gagatatcacctgggagtggtcctggggagattcgtaaggtggagcctgtgacacaaaaa gactccacctccctgtcttctgagagcagcagcagcagcagtgagagtgaggaggaagac gtgggagagtaccgtccccaccaccgagtgaccgagggcaccatcagggaggaacaggag tatgaagaagaggtggaggaagaaccccgcccggcagccaaggtagtagagagggaggaa gcagtgcccgaagccagcccagtcacacaagcaggtgccagtgtaatcacagtagaaaca gtgatccaggaaaatgtaggtgcccaaaagatacccggagagaagagtgtacacgaaggc gctcttaagcaagacatgggagaagaagcagaggaagagccacagaaagttaacggagag gtgtcccatgttgacattgatgttttgccacaaattatttgttgttcagagccaccagtg gtaaaaacagagatggtaacaatttctgatgcctcacaaaggacagaaatctccaccaag gaagtccccattgtccaaactgagaccaaaaccatcacatatgagtctccacagattgat ggcggggctggtggtgattcgggcacgttactgaccgcacaaaccatcacatctgagtcc gtgtcaacaacgacaaccacacacatcaccaagactgtaaaaggtggaatttctgaaaca agaattgagaaacgcattgtgatcacaggagatggagatattgatcatgaccaggcactg gctcaggcgatcagggaagccagagagcagcaccctgacatgtcggtcacaagagtggtg gtacacaaagaaacagagttggctgaggaaggggaagattaa >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_7|832_aa MTTEVGSVSEVKKDSSQLGTDATKEKPKEVAENQQNQSSDPEEEKGSQPPPAAESQSSLR RQKREKETSESRGISRFIPPWLKKQKSYTLVVAKDGGDKKEPTQAVVEEQVLDKEEPLPE EQRQAKGDAEEMAQKKQEIKVEVKEEKPSVSKEEKPSVSKVEMQPTELVSKEREEKVKET QEDKLEGGAAKRETKEVQTNELKAEKASQKVTKKTKTVQCKVTLLDGTEYSCDLEKHAKG QVLFDKVCEHLNLLEKDYFGLLFQESPEQKNWLDPAKEIKRQLRNLPWLFTFNVKFYPPD PSQLTEDITRYFLCLQLRQDIASGRLPCSFVTHALLGSYTLQAELGDYDPEEHGSIDLSE FQFAPTQTKELEEKVAELHKTHRGLSPAQADSQFLENAKRLSMYGVDLHHAKDSEGVDIK LGVCANGLLIYKDRLRINRFAWPKILKISYKRSNFYIKVRPAELEQFESTIGFKLPNHRA AKRLWKVCVEHHTFYRLVSPEQPPKAKFLTLGSKFRYSGRTQAQTRQASTLIDRPAPHFE RTSSKRVSRSLDGAPIGVMDQSLMKDFPGAAGEISAYGPGLVSIAVVQDGDGRREVRSPT KAPHLQLIEGKTVSMCPPSPLIHRTSSFSPNLLPVPEMDLLSDISEEDPFGEADQITLDS LEHLSTGEISEQGDSEVAMPDLVLDSAIAPSQPDCPSPIHGKTLKAADDSDSEFGYFSFS FSKCFPSGFPSLLDEDGYLAFPSLPKVWVSFLPAGVQHYVPITSPSFIPSLILIFGLLLS ASQSVPFSLAFSLPLALSLCYLEAKAASFNVSYDCDLNDKLEEEEVVAGMPT >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_7|2499_bp atgactactgaagtaggctctgtgtctgaagtgaagaaggactctagccagttaggaaca gatgcaaccaaggaaaaacctaaagaagtagcagaaaatcagcagaatcagtcttccgat ccagaggaggaaaaaggttcccagccacctcctgcagctgaaagccaaagtagtctacgc cgccagaagagagagaaggaaacatcggagagcaggggtatttctcggttcataccgcca tggcttaagaagcaaaagtcatataccttagtagtggccaaagatggaggagataaaaaa gagcctacccaagctgttgttgaagaacaggtcttagataaagaggaaccccttccagaa gaacagagacaggctaagggtgatgctgaagaaatggctcagaagaaacaagagattaaa gttgaagtcaaggaagaaaaaccctcagtgagcaaggaagaaaaaccctcagtgagcaaa gtggagatgcagcctactgaattagtaagtaaggagagagaagagaaggtaaaagaaaca caggaagacaaattagaaggaggagcagcaaaaagggagaccaaggaagtgcagaccaat gagctgaaagcagagaaggcatctcaaaaagtcaccaagaagaccaaaactgtccagtgt aaagtgaccctcttagatggcaccgaatacagctgtgacctggagaaacatgccaaggga caagtgttatttgacaaagtgtgtgaacacctcaatctcttggagaaagactactttgga cttttgtttcaggaaagccctgagcagaaaaactggttagatcctgctaaagaaataaag agacaactgagaaaccttccatggctattcacttttaatgtgaagttttatcctcctgat ccttctcaattgactgaagatatcaccagatacttcttgtgccttcagctccggcaggac attgcctctggccgcctgccctgctcttttgtgactcatgctctcctgggatcctacacc ctgcaggctgaacttggtgactatgacccagaagaacatggcagcatcgacctcagtgaa ttccagtttgcccctactcagactaaggagctggaagagaaggtggcagagctgcacaaa acccacaggggcttatcgccagcacaagctgattcccagttcttagaaaatgcaaagagg ctttccatgtatggtgttgacctacatcatgccaaggactcagaaggtgtggacatcaag ctgggcgtgtgtgctaatggacttctcatttacaaagacagactgcgaatcaatcgtttt gcttggccgaaaatcttaaaaatttcctataaacgcagtaacttctacattaaagtcaga ccggcagagctggaacagtttgagagtaccattggattcaaactgccaaaccaccgggca gcgaaaagactatggaaagtgtgcgtggagcatcatactttctacaggcttgtttctcca gagcagccaccaaaagccaagttcctgaccttggggtccaaatttcgctatagtggccgc acccaagcacagacccgccaggccagcaccctcatagataggccagcaccacactttgag cgcacttctagtaaacgggtctccaggagtctagatggagctccgattggtgtcatggac caaagtcttatgaaggattttcctggcgctgctggggagatttcagcctatggacctgga cttgtcagcattgccgtggtacaagatggggacggcaggagggaagtgagaagcccaact aaagccccacatttgcagctcattgaaggaaagactgtttccatgtgcccaccttcacct ctcatccaccgtacttcatctttttccccaaacctattgcctgtccctgaaatggacttg ctttcagacatttcagaggaagacccctttggggaggctgaccagatcacactagacagc ttagagcacctttccacaggtgaaatatcggagcagggggattctgaagttgctatgcct gatttggtcttggattctgccatagccccttctcagcctgactgtccttctcccatccat gggaagactcttaaagcagctgacgacagtgattctgagtttggttacttctccttcagt ttctccaaatgttttccctccggcttcccctcccttcttgatgaagatggatatcttgct ttccccagcctccccaaggtctgggtctccttcctccccgctggtgtgcagcactatgtc ccaatcacctccccttcattcattccttcccttatcctcatctttgggctacttctgtct gcttctcagtcagttcctttttctcttgccttttcccttcctctggctctatccctctgc tatctggaggctaaagccgcctcctttaatgtttcttatgactgtgacttgaatgacaag ctcgaggaagaggaagtagttgctggcatgcctacgtaa >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_8|410_aa MKYVHNAVQPSPLSISRTFSSSQTQTLYPLNSNSSSFFLAAPGDFERLCFDPLCDDFCIL LKDNLTFRFHFAPVYCIQFEVDQDCWQGSKSGSQLMGSASDCELALLLIMMQELGSHGLG KFHPCGFAGYSPTPGCFHGLALSVCGFSRDKVQAVGESTVLGSGGQWLSSHSFTMQCPRG DSVWGSNPIFPFRTALAEVLSESSTSAANFCLDIQFQGARSLAPAAGVSKIAEAGRELAG RHIPPEDREGGRLGHQLLLTMTQDSFPGHFASWGPSAGDVPSGPHLAMLPAAGDVTFTWP FQVESSLCTGSRVLVPRTGSRVLVAHPRRMSVSPLLIQAGQFATGKGIEIISKNLTSILD VNYPFAQWLHVVEAATCQLLSSRLGDQTDYRDITVLVFKSPLFYVIKVQE >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_8|1233_bp atgaagtatgttcacaatgctgtgcaaccatcaccactgtctatttccagaactttttca tcatcacaaacacagactctgtacccgttaaacagtaactcctcttcctttttccttgca gccccaggtgactttgaacggctttgctttgatccactttgtgatgacttttgtatactc ttaaaggataatcttacatttagatttcattttgcccctgtgtactgcatacaatttgaa gtggaccaagattgctggcagggaagtaaatcaggaagtcagctcatgggttcagcttct gactgtgagcttgctttgctgcttatcatgatgcaagagttgggctcccatggccttggg aagttccacccctgtggctttgcagggtacagccccactcccggctgctttcatgggctt gcattgagtgtctgtggcttttccagggacaaggtgcaagctgttggtgaatctaccgtt ctggggtctggaggacagtggctgtcttctcacagcttcaccatgcagtgccccagaggg gactctgtgtggggttccaaccctatatttcccttccgcactgccctagcagaggttctt tctgagagctcgacttctgcagcaaacttctgcctagacatccagttccaaggagcaagg tcacttgcgccagcagccggtgtcagcaagatagcagaagcaggaagagagctggccgga agacacataccccctgaagatcgagagggaggccgtctgggtcaccaactcctactcaca atgacacaggattcttttcctggtcactttgcaagctggggaccttcagctggcgatgtc ccatctggacctcacttggccatgctacctgctgcaggagacgtcacattcacttggccc ttccaggtcgaatctagcttatgcactggttcccgagttcttgtcccacgcactggttcc cgagttcttgtcgcgcacccaagaagaatgagtgtgtctccccttctcatccaggcagga cagtttgctacaggaaaaggcattgagattattagtaagaatctgacatctatcctggat gtgaattatccatttgcccagtggctccatgttgtagaagcagccacttgtcagttactt agtagccggcttggtgatcagactgactatcgtgatatcacagtgcttgtgttcaaatca cccttattctacgtaataaaagtacaagaatag >gi568815592r:130758139_131056485|GENSCAN_predicted_peptide_9|62_aa XCVRPWNGRLERSMDSSYRPCSPNTSHEPVESECEDGMRTDRSHTDINPHDMGIDQENHT GS >gi568815592r:130758139_131056485|GENSCAN_predicted_CDS_9|189_bp nngtgtgtgcgaccatggaatggaagactggagagatccatggattctagctacaggcct tgttcccctaatacgagccatgagccagttgaatctgaatgcgaagatggaatgaggacc gaccggagtcacactgacatcaaccctcatgacatggggatagatcaagaaaaccacaca ggaagctga