GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:43:39 Sequence gi568815588f:47821988_48023112 : 201125 bp : 44.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 51793 51897 105 0 0 82 42 107 0.314 3.91 1.02 PlyA + 54239 54244 6 1.05 2.00 Prom + 55411 55450 40 -4.26 2.01 Init + 59331 59395 65 2 2 64 100 43 0.601 3.72 2.02 Intr + 63638 63771 134 2 2 59 96 75 0.620 5.69 2.03 Intr + 73869 73938 70 1 1 122 85 14 0.027 2.74 2.04 Intr + 74750 74777 28 2 1 77 92 1 0.017 -2.48 2.05 Intr + 89493 89631 139 2 1 100 101 18 0.001 4.34 2.06 Intr + 95041 95184 144 2 0 110 46 36 0.002 1.85 2.07 Term + 99942 101128 1187 1 2 95 41 1450 0.014 132.92 2.08 PlyA + 103269 103274 6 1.05 3.03 PlyA - 104948 104943 6 1.05 3.02 Term - 111535 111301 235 0 1 90 54 65 0.171 -0.91 3.01 Init - 114137 114019 119 2 2 68 82 102 0.768 7.27 3.00 Prom - 116024 115985 40 -4.16 4.00 Prom + 116471 116510 40 -5.46 4.01 Init + 119804 119897 94 1 1 59 50 71 0.312 0.94 4.02 Intr + 136610 136657 48 0 0 121 63 34 0.080 2.85 4.03 Intr + 142371 142447 77 1 2 56 117 72 0.653 6.13 4.04 Intr + 144587 144710 124 1 1 95 60 53 0.307 3.36 4.05 Term + 145734 145777 44 1 2 96 49 42 0.301 -1.58 4.06 PlyA + 145916 145921 6 -1.95 5.08 PlyA - 145937 145932 6 1.05 5.07 Term - 146718 146314 405 0 0 39 47 332 0.105 19.39 5.06 Intr - 169800 169693 108 0 0 58 106 25 0.437 1.78 5.05 Intr - 170559 170308 252 0 0 41 34 141 0.256 1.63 5.04 Intr - 171801 171642 160 2 1 5 34 143 0.302 0.59 5.03 Intr - 173524 173395 130 2 1 68 79 166 0.557 13.45 5.02 Intr - 175752 175690 63 1 0 97 87 85 0.966 7.99 5.01 Init - 177778 177706 73 1 1 72 53 114 0.632 5.89 5.00 Prom - 180624 180585 40 -6.86 6.04 PlyA - 181387 181382 6 1.05 6.03 Term - 189658 188183 1476 1 0 102 35 1396 0.979 126.32 6.02 Intr - 190340 190289 52 1 1 80 115 21 0.976 2.91 6.01 Intr - 196381 196281 101 1 2 137 78 44 0.965 7.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 101128 1128 1 0 67 41 1459 0.828 135.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:47821988_48023112|GENSCAN_predicted_peptide_1|34_aa VRALQGQQAPCKGANVDSKACQDSGSSEHPVHSC >gi568815588f:47821988_48023112|GENSCAN_predicted_CDS_1|105_bp gtcagagcacttcagggacagcaggctccgtgcaaaggggcaaatgtggacagcaaagcc tgccaggacagcggctcctctgagcacccggtgcattcctgctaa >gi568815588f:47821988_48023112|GENSCAN_predicted_peptide_2|588_aa MVEEFHYRETSGTQSVVVGAGRQNQAVSGIYSFWWVLGLGDFKNEAVDLTVSVTALKGGR LEFAPSESCHITLNGPENTRTPRRMKEGICPGDSLHCLQMTGQLFCSMSLTWVEADASSG LDSLACTFSRKITEAVLRFPCILSCDAVCGLVLGSGGGDKSCDLDIGEWGDKAAQADNQL GCYICLCSRAWEGGVIPQVYHLVQESWNLFTSTMNTSHLLALLLPKSPQGENRSKPLGTP YNFSEHCQDSVDVMVFIVTSYSIETVVGVLGNLCLMCVTVRQKEKANVTNLLIANLAFSD FLMCLLCQPLTAVYTIMDYWIFGETLCKMSAFIQCMSVTVSILSLVLVALERHQLIINPT GWKPSISQAYLGIVLIWVIACVLSLPFLANSILENVFHKNHSKALEFLADKVVCTESWPL AHHRTIYTTFLLLFQYCLPLGFILVCYARIYRRLQRQGRVFHKGTYSLRAGHMKQVNVVL VVMVVAFAVLWLPLHVFNSLEDWHHEAIPICHGNLIFLVCHLLAMASTCVNPFIYGFLNT NFKKEIKALVLTCQQSAPLEESEHLPLSTVHTEVSKGSLRLSGRSNPI >gi568815588f:47821988_48023112|GENSCAN_predicted_CDS_2|1767_bp atggtggaggaattccactacagggagacatcaggaacacaaagtgtggttgttggagct ggaagacaaaatcaagctgtgtcaggaatttattccttctggtgggttctcggtcttggt gacttcaagaatgaagctgtggacctcacagtgagtgttacagctcttaaaggtgggcgt ctggaatttgctccttcagaatcctgtcacattactcttaatggtccagaaaatacaagg actcctcgaaggatgaaagaagggatttgccctggagacagccttcactgcttgcagatg acaggccagttattctgtagtatgtccctcacctgggttgaggctgatgcttcttcaggg ttggattcacttgcatgcaccttcagcaggaagatcacagaagctgtgctacgttttcct tgcatcctgtcatgcgatgctgtgtgtggacttgtgctgggatctggaggcggagacaag agctgtgacctagacataggggagtggggagacaaagctgctcaggcagacaaccaacta ggctgctacatctgcctctgcagcagagcatgggagggtggcgtcatccctcaagtgtat cacttagttcaagagtcctggaatcttttcacatccactatgaacacctctcacctcctg gccttgctgctcccaaaatctccacaaggtgaaaacagaagcaaacccctgggcacccca tacaacttctctgaacattgccaggattccgtggacgtgatggtcttcattgtcacttcc tacagcattgagactgtcgtgggggtcctgggtaacctctgcctgatgtgtgtaactgtg aggcagaaggagaaagccaacgtgaccaacctgcttatcgccaacctggccttctctgac ttcctcatgtgcctcctctgccagccgctgaccgccgtctacaccatcatggactactgg atctttggagagaccctctgcaagatgtcggccttcatccagtgcatgtcggtgacggtc tccatcctctcgctcgtcctcgtggccctggagaggcatcagctcatcatcaacccaaca ggctggaagcccagcatctcacaggcctacctggggattgtgctcatctgggtcattgcc tgtgtcctctccctgcccttcctggccaacagcatcctggagaatgtcttccacaagaac cactccaaggctctggagttcctggcggataaggtggtctgtaccgagtcctggccactg gctcaccaccgcaccatctacaccaccttcctgctcctcttccagtactgcctcccactg ggcttcatcttggtctgttatgcacgcatctaccggcgcctgcagaggcaggggcgcgtg tttcacaagggcacctacagcttgcgagctgggcacatgaagcaggtcaatgtggtgctg gtggtgatggtggtggcctttgccgtgctctggctgcctctgcatgtgttcaacagcctg gaagactggcaccatgaggccatccccatctgccatgggaacctcatcttcttagtgtgc cacttgcttgccatggcctccacctgtgtcaacccattcatctatggctttctcaacacc aacttcaagaaggagatcaaggccctggtgctgacttgccagcagagcgcccccctggag gagtcagagcatctgcccctgtccacagtacatacggaagtctccaaagggtccctgagg ctaagtggcaggtccaatcccatttaa >gi568815588f:47821988_48023112|GENSCAN_predicted_peptide_3|117_aa MKDNIQYYSNYMAFRKRQNYGDGKKIGGCQGLTGKEVINRGCEVYQNLEEILWEERPSQT ASPQCCKNQISSMGDFQVSSAFSQSSLYPCHSVNHSNNSQLPAEKLDPEAGLDSLQK >gi568815588f:47821988_48023112|GENSCAN_predicted_CDS_3|354_bp atgaaagacaacatacagtattattccaactacatggctttccggaaaaggcaaaactac ggggatggtaaaaagatcggtggttgccagggtttaacaggaaaggaggtgataaatagg ggttgtgaggtgtatcagaaccttgaagagatactctgggaagaaaggccttcccagact gcctctccgcagtgctgtaagaatcagatttcttctatgggggacttccaggtctcttca gcattttctcagtcatcactgtatccctgccactcagtgaaccacagcaacaactcccag cttccagccgagaagcttgaccctgaagctggcctcgattctctacagaagtga >gi568815588f:47821988_48023112|GENSCAN_predicted_peptide_4|128_aa MGQAHGNWKQHYSEGKGMWRSSELTGESIFDCLWSDVESMWPPVYAGAFGGRHCRYPTEE ESDTVGGEGHSYPTHGCRPDWQGQEAGAKWDMSSVLAKHTLDADEPYAWETEKWGHNSNY QFNYNKGF >gi568815588f:47821988_48023112|GENSCAN_predicted_CDS_4|387_bp atgggccaggcccatggcaactggaagcagcactactcagaaggaaaaggcatgtggaga agctcagagctgacaggggagtctatatttgattgcctgtggtctgatgttgagtccatg tggcccccagtctatgccggagccttcggaggcaggcactgccgctatcccaccgaggag gagtctgacactgtgggcggtgaaggccacagttacccgacccacggctgtaggccagac tggcaggggcaagaggcaggagccaagtgggacatgtccagtgtcctggccaaacataca ctggacgcagatgagccatatgcctgggaaacagagaaatggggtcacaacagtaactat cagttcaactacaacaaaggtttctga >gi568815588f:47821988_48023112|GENSCAN_predicted_peptide_5|396_aa MLGGLGKLAAEGLAHRTEKATEGAIHAVEEVVKEVVGHAKETGEKAIAEAIKKAQESGDK KMKEITETVTNTVTNAITHAAESLDKLGHCFTNQTSSSDASEWSRGVVVAGQSQAGARVS LGGDGAEAITGLTVDQYGMLYKAPGTPQSNMKPVHERNQECLPPKKRDLPMTSCSTNHTS SSDASEWSRGVVVAGQSQAGARVSLGGDGAEAITGLTVDQHGMLYKLQEPDMWSPSRGQP VSLHLREKGAPEVKEMAWWKAWKYHTVNDHNCEVRKALSKQEMASASSSQRGRSGSGNFG GGRGGGFSGNDNFGIGGNFSGLGGFGGSRGGGGYGGSGDGCNGFGNDGSNFGGGGSYNDF GNYNNESSNFGPMKGGNFGGRSSGPCGNGGLYFAKP >gi568815588f:47821988_48023112|GENSCAN_predicted_CDS_5|1191_bp atgctgggaggcctggggaagctggctgccgagggcctggcccaccgcaccgagaaggcc accgagggagccattcatgccgtggaggaagtggtgaaggaggtggtgggacatgccaag gagactggagagaaagccattgctgaagccataaagaaagcccaggagtcaggggacaaa aagatgaaggaaatcaccgagacagtgaccaacacagtcacaaatgccatcacccatgca gcagaaagtctggacaaacttggacactgcttcactaaccagacatcctccagtgatgcc tctgaatggtcccgaggggttgtggtggccgggcagagccaggcaggagccagagtcagc ctggggggtgatggagctgaggccatcaccggtctgacagtggaccagtatggcatgctg tataaggctccaggaacaccacaaagcaatatgaaacctgttcatgagaggaaccaggaa tgccttccaccaaagaaacgagacctccccatgaccagctgctccactaaccacacatcc tccagtgatgcctctgaatggtcccgaggggttgtggtggccgggcagagccaggcagga gccagagtcagcctggggggtgatggagctgaggccatcaccggtctgacagtggaccag catggcatgctgtataagctgcaggagccagacatgtggagtcccagcagaggccaacct gtgtctcttcatctccgtgagaaaggtgcccccgaagtgaaagagatggcctggtggaaa gcctggaaataccatactgtgaatgaccacaactgtgaagttaggaaagccctgtcaaag caagagatggctagtgcttcatccagccaaagaggtcgaagtggttctggaaactttggt ggtggtcgtggaggtggtttcagtgggaatgacaactttggtattggaggaaacttcagt ggtcttggtggctttggtggcagtcgtggtggtggtggatatggtggcagtggggatggc tgtaatggatttggtaatgatggaagcaattttggaggtggtggaagctacaatgatttt ggcaattacaacaatgagtcttcaaattttggacccatgaagggaggaaattttggaggc agaagctctggcccctgtggcaatggaggcctatactttgcaaaaccatga >gi568815588f:47821988_48023112|GENSCAN_predicted_peptide_6|542_aa VSTERFSQQYSSCSTIFLDDSTAIQHYLTMTIISDADRSLSIPDEQLHSFAVSTVHIMKK RNGGGSLNNYSSSIPPTPSTSQEDPQFSVPPTANTPTPICKRSMRWSNLFTSEKGSDPDK ERKAPENHADTIGSGRAIPIKQGMLLKRSGKWLKTWKKKYVTLCSNGVLTYYSSLGDYMK NIHKKEIDLRTSTIKVPGKWPSLATSARAPISSSKSNGLSKDMDTGLGDSICFSPSISST TSPKLNPPPSPHANKKKHLKKKSTNNFMIVSATGQTWHFEATTYEERDAWVQAIQSQILA SLQSCESSKSKSQLTSQSEAMALRSIQNMRGNAHCVDCETQNPKWASLNLGVLMCIECSG IHRSLGTRLSRVRSLELDDWPVELRKVMSSIGNELANSIWEGSSQGQTKPSIKSTREEKE RWIRSKYEEKLFLAPLPCTELSLGQQLLRATADEDLQTAILLLAHGSREEVNETCGEGDG CTALHLACRKGNVVLEQLLIWYGVDVMARDAHGNTALTYARQASSQECINVLLQYGCPDE CV >gi568815588f:47821988_48023112|GENSCAN_predicted_CDS_6|1629_bp gtatctactgagcgtttcagtcaacaatacagctcgtgttcgacaatattccttgatgac agcacagccatccagcattatcttacaatgacaataatatcagatgcagatagatctttg agcatacctgatgaacagttacactcatttgcggtttccaccgtgcacattatgaagaaa agaaatggaggtgggagtttaaataactattcctcctccattccaccgactcccagcacc agccaggaggaccctcagttcagtgttcctcccactgccaacacacccacccccatttgc aagcggtccatgcgctggtccaacctgtttacatctgagaaagggagtgacccagacaaa gagaggaaagccccggagaatcatgctgacaccatcgggagcggcagagccatccccatt aaacagggcatgctcttaaagcgaagtgggaaatggctgaagacatggaaaaagaaatac gtcaccctgtgttccaatggcgtgctcacctattattcaagcttaggtgattatatgaag aatattcataaaaaagagattgaccttcggacatctaccatcaaagtcccaggaaagtgg ccatccctagccacatcagcccgtgcacccatctccagctctaaaagcaatggcctatcc aaggacatggacaccgggctgggtgactccatatgcttcagccccagtatctccagcacc accagccccaagctcaacccacccccctctcctcatgccaataaaaagaaacacctaaag aagaaaagcaccaacaactttatgattgtgtctgccactggccaaacgtggcactttgaa gccacgacgtatgaggagcgggatgcctgggtccaagccatccagagccagatcctggcc agcctgcagtcatgcgagagcagtaaaagcaagtcccagctgaccagccagagtgaggcc atggccctgcggtcgatccaaaacatgcgtgggaacgcccactgtgtggactgtgagacc cagaatcctaagtgggccagtttgaacttgggagtcctcatgtgtattgaatgctcaggt atccaccgcagtcttggcacccgcctttcccgtgtgcgatctctggagctggatgactgg ccagttgagctcaggaaggttatgtcgtctattggcaatgagctagccaacagcatctgg gaagggagcagccaggggcagacaaaaccctcaataaagtccacgagggaagagaaggaa cggtggatccgttccaaatatgaggagaagctctttctggccccactaccctgcactgag ctgtctctgggccagcagctgctgcgggccaccgctgatgaggacctgcagacagccatc ctgctgctggcacatggctcccgtgaggaggtgaacgagacctgtggggagggagacggc tgcacggcactccatctggcctgccgcaaggggaatgtggtcctggagcagctcctgatc tggtacggggtggacgtcatggcccgagatgcccacgggaacacagcgctgacctacgcc cggcaggcctccagccaggagtgcatcaacgtgcttctgcagtacggctgccccgacgag tgcgtgtag