GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:22:33 Sequence gi568815597f:7862786_8085051 : 222266 bp : 44.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3322 3370 49 0 1 87 80 68 0.768 5.31 1.02 Intr + 4017 4070 54 1 0 93 102 2 0.515 1.05 1.03 Term + 13351 13547 197 2 2 60 46 102 0.193 0.77 1.04 PlyA + 14666 14671 6 1.05 2.00 Prom + 63067 63106 40 -0.16 2.01 Sngl + 91133 91471 339 1 0 59 41 152 0.547 3.63 2.02 PlyA + 92011 92016 6 1.05 3.00 Prom + 98171 98210 40 -3.06 3.01 Init + 100001 100090 90 1 0 72 98 117 0.975 11.62 3.02 Intr + 102539 102640 102 1 0 74 68 74 0.925 4.37 3.03 Intr + 108109 108178 70 0 1 61 66 107 0.932 4.55 3.04 Intr + 114867 114953 87 1 0 66 107 59 0.900 5.54 3.05 Intr + 122109 122239 131 1 2 120 11 161 0.066 12.01 3.06 Intr + 123172 123243 72 0 0 17 99 69 0.052 0.50 3.07 Intr + 127283 127936 654 1 0 -77 19 1633 0.819 131.97 3.08 Term + 140598 140720 123 2 0 75 50 70 0.294 0.28 3.09 PlyA + 143924 143929 6 1.05 4.04 PlyA - 144552 144547 6 1.05 4.03 Term - 151611 150425 1187 1 2 95 41 496 0.844 37.52 4.02 Intr - 152599 152523 77 0 2 88 76 13 0.609 -0.74 4.01 Init - 152759 152710 50 2 2 95 106 21 0.536 5.02 4.00 Prom - 154908 154869 40 -8.26 5.00 Prom + 155219 155258 40 -0.16 5.01 Init + 164340 164376 37 2 1 84 89 37 0.323 3.67 5.02 Term + 164637 164758 122 1 2 86 44 95 0.328 3.54 5.03 PlyA + 164788 164793 6 1.05 6.02 PlyA - 165912 165907 6 1.05 6.01 Sngl - 194727 194455 273 0 0 65 36 250 0.944 12.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 122109 122269 161 1 2 120 43 183 0.892 15.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:7862786_8085051|GENSCAN_predicted_peptide_1|99_aa MVDHADHAHALGPLPVGLTMVPGLERALLLMIKLPPTQTSPGLEDESAQPSGAHEHCLGP GDQPAYLLPPACAYTTGDIEDRNTLPTTTSTHTHQGPRN >gi568815597f:7862786_8085051|GENSCAN_predicted_CDS_1|300_bp atggtcgaccacgctgaccatgcccacgctctgggccccttgcctgtggggctgaccatg gtaccgggcctggaaagggctctattactgatgatcaaattaccacccacacaaacatcc ccagggctggaggatgagtctgcccagccttctggtgcccatgaacattgtctgggcccg ggggatcaacctgcctacctgctgccaccagcatgtgcttacaccacgggggatattgag gataggaacaccttgccaactaccaccagcacccacacacatcaggggccgaggaattga >gi568815597f:7862786_8085051|GENSCAN_predicted_peptide_2|112_aa MGTGKSSSFFAEPAGCGHQNESGPSRCQGLDLPPCIRQAAHTPRKSARSLRQSDSRAQRP TRDALKEVTFRGCYSQTRPSSTRRIYVGSDPCKHRRDRASGWEHATVPPTAT >gi568815597f:7862786_8085051|GENSCAN_predicted_CDS_2|339_bp atgggaacaggcaaatcgtcgtcgttctttgctgagcctgcgggctgcggtcaccagaac gaatctgggcccagcaggtgccaagggctcgacctcccgccttgcatccggcaggccgca cacacccctcggaagtcggcgagatcgctgaggcaaagcgactccagggcgcagagaccc acacgcgacgctctaaaggaagtgacctttaggggctgttactctcagaccaggcccagc agcacccggcgcatttacgtcggatctgacccctgcaagcaccggcgcgaccgcgctagc ggctgggaacacgcgacggtcccgcccactgcgacctga >gi568815597f:7862786_8085051|GENSCAN_predicted_peptide_3|442_aa MASKRALVILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVVICPDASLED AKKESAAVKEILKEQENRKGLIAAICAGPTALLAHEIGFGSKVTTHPLAKDKMMNGGHYT YSENRVEKDGLILTSRGPGTSFEFALAIVEALNGKEVAAQFLLGGHMIQTQPIRALFTTG RVTQNVNAGFTTITITTTTITITISIIINIPTVVTTTTITITIITITITIITITTTVITI TLTTLTITLTTITTLTTTTITTIITTSITLTTITITTTITFTTTTTITTSITLTITLTTT ITTLTTTITITTSITLTTTITITTTITFTTTTITTSITFTITLTTTITTTNTITLTTTII TITTTITITLTTTITISITTTTTTVTIIITTIIIFFTSCSLQEKDLPTSGFVTRKGIENI SSKSGYQILEGNLSDLESVMEK >gi568815597f:7862786_8085051|GENSCAN_predicted_CDS_3|1329_bp atggcttccaaaagagctctggtcatcctggctaaaggagcagaggaaatggagacggtc atccctgtagatgtcatgaggcgagctgggattaaggtcaccgttgcaggcctggctgga aaagacccagtacagtgtagccgtgatgtggtcatttgtcctgatgccagccttgaagat gcaaaaaaagagtctgctgctgtgaaggagatactgaaggagcaggaaaaccggaagggc ctgatagccgccatctgtgcaggtcctactgctctgttggctcatgaaataggttttgga agtaaagttacaacacaccctcttgctaaagacaaaatgatgaatggaggtcattacacc tactctgagaatcgtgtggaaaaagacggcctgattcttacaagccgggggcctgggacc agcttcgagtttgcgcttgcaattgttgaagccctgaatggcaaggaggtggcggctcaa ttcctcctgggcgggcacatgatccagacccagccaatccgagccctgtttaccactggc cgtgtgacccagaatgtgaatgctggattcaccaccatcaccatcaccaccaccacaatc actatcaccatctccatcatcatcaacatccccactgttgtcaccaccaccaccatcact atcaccatcattaccatcaccatcaccatcatcaccatcactaccaccgtcatcaccatc accctcaccaccctcaccatcaccctcaccactatcaccacactcaccaccaccaccatc accaccatcattaccaccagcatcaccctcaccaccatcaccatcaccaccaccatcacc ttcaccaccaccaccaccatcaccaccagcatcaccctcaccatcaccctcaccaccacc atcaccacactcaccaccaccatcaccatcaccaccagcatcaccctcaccaccaccatc accatcaccaccaccatcaccttcaccaccaccaccatcaccaccagcatcaccttcacc atcaccctcaccaccaccatcaccactaccaacaccatcaccctcaccactaccatcatc accatcaccaccaccatcaccatcaccctcaccaccaccatcaccatcagcatcaccacc accaccaccaccgtcaccatcattatcaccaccatcatcatctttttcacttcttgcagc ttgcaggaaaaagacctgccaacatcaggatttgtaaccagaaaaggcattgaaaacata tcctcaaaatcaggctatcaaattttagaaggaaacctttcagacctagaatcagttatg gagaagtga >gi568815597f:7862786_8085051|GENSCAN_predicted_peptide_4|437_aa MGNMRKTYWSSRSEFKNNFLNIDPITMAYSLNSSAQERLIPLGHASKSAPMNGHCFAENG PSQKSSLPPLLIPPSENLGPHEEDQVVCGFKKLTVNGVCASTPPLTPIKNSPSLFPCAPL CERGSRPLPPLPISEALSLDDTDCEVEFLTSSDTDFLLEDSTLSDFKYDVPGRRSFRGCG QINYAYFDTPAVSAADLSYVSDQNGGVPDPNPPPPQTHRRLRRSHSGPAGSFNKPAIRIS NCCIHRASPNSDEDKPEVPPRVPIPPRPVKPDYRRWSAEVTSSTYSDEDRPPKVPPREPL SPSNSRTPSPKSLPSYLNGVMPPTQSFAPDPKYVSSKALQRQNSEGSASKVPCILPIIEN GKKVSSTHYYLLPERPPYLDKYEKFFREAEETNGGAQIQPLPADCGISSATEKPDSKTKM DLGGHVKRKHLSYVVSP >gi568815597f:7862786_8085051|GENSCAN_predicted_CDS_4|1314_bp atggggaatatgaggaagacctactggagcagtcgcagtgagtttaaaaacaacttttta aatattgacccgataaccatggcctacagtctgaactcttctgctcaggagcgcctaata ccacttgggcatgcttccaaatctgctccgatgaatggccactgctttgcagaaaatggt ccatctcaaaagtccagcttgccccctcttcttattcccccaagtgaaaacttgggacca catgaagaggatcaagttgtatgtggttttaagaaactcacagtgaatggggtttgtgct tccacccctccactgacacccataaaaaactccccttcccttttcccctgtgcccctctt tgtgaacggggttctaggcctcttccaccgttgccaatctctgaagccctctctctggat gacacagactgtgaggtggaattcctaactagctcagatacagacttccttttagaagac tctacactttctgatttcaaatatgatgttcctggcaggcgaagcttccgtgggtgtgga caaatcaactatgcatattttgataccccagctgtttctgcagcagatctcagctatgtg tctgaccaaaatggaggtgtcccagatccaaatcctcctccacctcagacccaccgaaga ttaagaaggtctcattcgggaccagctggctcctttaacaagccagccataaggatatcc aactgttgtatacacagagcttctcctaactccgatgaagacaaacctgaggttcccccc agagttcccatacctcctagaccagtaaagccagattatagaagatggtcagcagaagtt acttcgagcacctatagtgatgaagacaggcctcccaaagtaccgccaagagaacctttg tcaccgagtaactcgcgcacaccgagtcccaaaagccttccgtcttacctcaatggggtc atgcccccgacacagagctttgcccctgatcccaagtatgtcagcagcaaagcactgcaa agacagaacagcgaaggatctgccagtaaggttccttgcattctgcccattattgaaaat gggaagaaggttagttcaacacattattacctactacctgaacgaccaccatacctggac aaatatgaaaaattttttagggaagcagaagaaacaaatggaggcgcccaaatccagcca ttacctgctgactgcggtatatcttcagccacagaaaagccagactcaaaaacaaaaatg gatctgggtggccacgtgaagcgtaaacatttatcctatgtggtttctccttag >gi568815597f:7862786_8085051|GENSCAN_predicted_peptide_5|52_aa MKDVKGQCGTVDGLCLLRRPELAPTSFGFGEATDAQGQSPSGPVGGGFSGRQ >gi568815597f:7862786_8085051|GENSCAN_predicted_CDS_5|159_bp atgaaggatgtcaagggacagtgtgggacagtggacggtttgtgtttactcaggcgacct gagctggccccgacgtcttttggctttggagaggccacggatgcccaaggacaaagcccc agcgggcctgtgggaggaggtttctcggggcgccagtga >gi568815597f:7862786_8085051|GENSCAN_predicted_peptide_6|90_aa MGVPYCTIKGKARLGRLVYRKTCTTVAFAWVNSEDKGALAKLVEAIRTNYKDRHDEICCH WGGNVLASKSMARITNLKKAKAKELATKLD >gi568815597f:7862786_8085051|GENSCAN_predicted_CDS_6|273_bp atgggggtcccttactgcactatcaaggggaaggcaagactgggacgtctagtctacagg aagacctgcaccactgttgccttcgcatgggttaactcggaagacaaaggagctttggct aagctggtggaagctatcaggaccaattacaaagacagacacgacgagatctgctgtcac tggggaggcaatgtcctggcttccaagtctatggcccgcattaccaatctcaaaaaggca aaggctaaagaacttgccaccaagctggattaa