GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:38:08 Sequence gi568815597f:236042623_236308155 : 265533 bp : 43.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 6367 6068 300 1 0 83 121 181 0.969 17.83 1.01 Init - 22457 22233 225 2 0 76 83 545 0.003 49.27 1.00 Prom - 28435 28396 40 -1.06 2.00 Prom + 35508 35547 40 -5.66 2.01 Init + 37252 37347 96 0 0 58 107 64 0.823 5.61 2.02 Intr + 45894 45992 99 2 0 109 73 -5 0.079 0.41 2.03 Intr + 58320 58428 109 2 1 93 81 56 0.656 5.26 2.04 Intr + 58941 58985 45 1 0 50 92 73 0.474 2.28 2.05 Intr + 62192 62293 102 0 0 51 94 45 0.276 1.55 2.06 Intr + 67431 67746 316 1 1 35 -3 392 0.074 19.92 2.07 Intr + 68277 68557 281 0 2 9 42 288 0.244 13.42 2.08 Term + 68603 69750 1148 0 2 15 55 1930 0.852 174.49 2.09 PlyA + 70127 70132 6 -0.45 3.00 Prom + 72573 72612 40 -4.56 3.01 Init + 78496 78588 93 0 0 77 21 81 0.190 0.68 3.02 Intr + 84458 84754 297 1 0 50 52 113 0.351 0.97 3.03 Term + 86852 87172 321 1 0 134 44 125 0.388 7.52 3.04 PlyA + 87751 87756 6 1.05 4.00 Prom + 96729 96768 40 -5.56 4.01 Init + 100001 100414 414 1 0 73 109 883 0.916 83.37 4.02 Intr + 100618 100760 143 0 2 13 75 110 0.539 1.35 4.03 Term + 100942 101050 109 1 1 15 49 116 0.690 -1.62 4.04 PlyA + 101499 101504 6 -0.45 5.03 PlyA - 103071 103066 6 1.05 5.02 Term - 107893 107741 153 1 0 61 49 144 0.111 5.72 5.01 Init - 113623 113549 75 1 0 76 44 149 0.374 8.39 5.00 Prom - 114975 114936 40 -6.06 6.03 PlyA - 115502 115497 6 1.05 6.02 Term - 117257 117145 113 0 2 82 45 78 0.035 1.62 6.01 Init - 130949 130877 73 2 1 77 55 113 0.413 8.13 6.00 Prom - 131973 131934 40 -2.46 7.00 Prom + 132929 132968 40 -4.16 7.01 Init + 135781 136014 234 0 0 48 100 186 0.541 11.96 7.02 Intr + 137257 137406 150 0 0 117 82 214 0.967 24.06 7.03 Intr + 141156 141284 129 2 0 125 89 -34 0.163 1.29 7.04 Intr + 162504 162628 125 2 2 86 90 85 0.800 7.88 7.05 Term + 164188 164200 13 1 1 113 53 3 0.488 -3.03 7.06 PlyA + 169093 169098 6 1.05 8.08 PlyA - 172012 172007 6 1.05 8.07 Term - 175106 175061 46 1 1 112 43 31 0.589 -2.32 8.06 Intr - 178343 178210 134 2 2 66 116 44 0.690 4.44 8.05 Intr - 182517 182448 70 2 1 77 93 27 0.891 1.28 8.04 Intr - 183893 183647 247 0 1 108 69 141 0.957 10.72 8.03 Intr - 187628 187602 27 0 0 127 92 -4 0.759 1.99 8.02 Intr - 238489 238459 31 1 1 74 111 1 0.122 -1.30 8.01 Init - 239161 239060 102 1 0 84 115 226 0.961 23.14 8.00 Prom - 249678 249639 40 -3.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 21625 21519 107 2 2 67 45 100 0.947 2.17 S.002 Intr + 126084 126133 50 1 2 125 131 -36 0.873 2.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_1|175_aa MLASSSRIRAAWTRALLLPLLLAGPVGCLSRQELFPFGPGQGDLELEDGDDFVSPALELS GALRFYDRSDIDAVYVTTNGIIATSEPPAKESHPGLFPPTFGAVAPFLADLDTTDGLGKV YYREDLSPSITQRAAECVHRGFPEISFQPSSAVVVTWESVAPYQGPSRDPDQKGK >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_1|525_bp atgttggcctcgagcagccggatccgggctgcgtggacgcgggcgctgctgctgccgctg ctgctggcggggcctgtgggctgcctgagccgccaggagctctttcccttcggccccgga cagggggacctggagctggaggacggggatgacttcgtctctcctgccctggagctgagt ggggcgctccgcttctacgacagatccgacatcgacgcagtctacgtcaccacaaatggc atcattgctacgagtgaacccccggccaaagaatcccatcccgggctcttcccaccaaca ttcggtgcagtcgcccctttcctggcggacttggacacgaccgatggcctggggaaggtt tattatcgagaagacttatccccctccatcactcagcgagcagcagagtgtgtccacaga gggttcccggagatctctttccagcctagtagcgcggtggttgtcacttgggaatccgtg gccccctaccaagggcccagcagggacccagaccagaaaggcaag >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_2|731_aa MYETVLPFLLGSKVVNVTNEVFEYSTPNLVFLASLQPRPWCTARNPSAPGAPLPIVTTEA RVWNAKPGKAATLAQASDTARPRASHSPGPGKAEIKSRRWQGVFVGVVCVKVAVYVGCDR DMNGEEGKGNGKGGRIVSKYFPRSRGRGLAVHKMAPYSPLVTRLQKALGVRQYHVASVLC QRAKMAMSHFEPNEYIHYDLLEKNINIVCKRLNWPLTLLEKIVYGHLDDPASQEIEQGKT PVAVAGPRGHAAVHQQNWGHHFRVPLQPQDEEVPKQDGRTDIANLADEFQDHLVPDPGCH YDQLIEINLSELKPHINGPFTPDQAHPVAEVGKVAEKEGWPLDIRVGLTAVAKQALAHDF KCKSHFTITPGSEQIHATIERDSYAQILRNVGGIVLANACGPCIGRWDTKDIKKGKKYTI VTSYNRNFTGCNDANPETHVFVTSPEIVTALAIVGILKFNPETNYLMGKDRKKFKLEALD ADDLPQEEFDPGQDPYQHPPQDSSGQHVDVSPTSQRLQLLEAFDKWDSKDLEDPQILIKV KGKCTTDHISAAGPWLKFCGHLDNISNNLLTGAINIENSQANSVCSANIENTQEFGPVPD TARYYKKHGIRWVVIGDENSSREHAVLEPPHLWGQAIITKSFARIHKTNPKKQGLLPLTF ADPGDYNKVHPVDKLTIQGLKDFAPDKPLKCIIKHPNGTQETILLNHTFNEMRIKWFCAG SALNRMKELQQ >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_2|2196_bp atgtatgaaactgtgctgccatttttattaggttccaaggttgttaacgtcactaatgaa gttttcgagtattctactcctaacctggttttcctggcgtctttgcaacctagaccctgg tgcacagcccggaatcccagtgctcctggtgcgccattgccaatagtaaccacagaagct agagtgtggaacgcaaaacctggcaaagcagccacattggcccaagccagcgacaccgcg cggccccgcgcttcccacagccctgggcccggcaaggctgaaatcaagtctagaagatgg caaggtgtgtttgtgggtgttgtatgtgtgaaggtggccgtgtatgtgggctgtgaccgg gacatgaatggagaggaaggaaagggaaatggtaaagggggcaggattgtctccaagtac ttccccaggtccaggggaagagggctggcagtgcacaaaatggctccctacagcccactg gtgacccggctgcagaaagctctgggtgtgcggcaataccatgtggcctcagtcctgtgc caacgggccaagatggcgatgagccactttgagcccaatgagtacatccactatgacctg ctagagaagaacattaacattgtttgcaaacgactgaactggcctctgaccctcttggag aagatcgtgtatggacacctggatgacccggccagccaggaaatcgagcagggcaagaca cctgtggctgtggccggaccacgtggccatgctgcagttcatcagcaaaattggggccac cacttccgtgttcccttacaaccacaggatgaggaggtacctaagcaagacggccggaca gacattgccaatctagctgatgaattccaggatcacttggtgcctgaccctggctgccat tatgaccaactaattgaaattaacctcagtgagctgaagccgcacatcaatgggcccttc acccccgaccaggctcaccctgtggcagaagtgggcaaggtggcagagaaggaaggatgg cctctggacatccgagtaggtctgactgctgtggccaagcaggcactggcccatgacttc aagtgcaagtcccacttcaccatcactccaggctccgagcagatccacgccaccattgag cgggacagctacgcacagattttgaggaatgtgggtggcatcgtcctggccaatgcttgt ggcccctgcattggccgctgggacacgaaggacatcaagaaggggaagaagtacacaatc gtcacctcctacaacaggaacttcacgggctgcaatgatgcgaaccctgagacccatgtc ttcgtcacgtccccagagattgtcacagccctggccattgtgggaatcctcaagttcaac ccagagaccaactacctgatgggcaaagataggaagaagttcaagctagaggctctggat gcagacgaccttccccaagaggagtttgacccagggcaggacccctaccagcacccccca caggacagcagtggccagcacgtggatgtgagccccactagccagcgcctgcagctcctg gaggcttttgacaagtgggatagcaaggacctggaggacccgcagatactcatcaaggtc aaagggaagtgtactactgaccacatctcggctgctggcccctggctcaagttttgtggg cacttggacaacatctccaacaacctgctcactggtgccatcaacattgaaaacagccag gccaactccgtgtgcagtgccaacattgaaaacacccaggagtttggccccgtccctgac actgcccgctactacaagaaacatggcatcaggtgggtggtgatcggggacgagaactcg agccgggagcacgcagtgctggagcctccccacctctggggccaggccatcatcaccaag agctttgccaggatccacaagaccaacccgaagaagcagggcctgctgcccctgactttt gctgacccaggcgactacaacaaggttcaccctgtggacaagctgactattcagggcctg aaggacttcgcccctgacaagcccctgaagtgcatcatcaagcaccccaatgggacccag gagaccatcctcctgaaccacaccttcaatgagatgcggatcaagtggttctgtgccggc agtgccctcaacagaatgaaggagctgcagcagtga >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_3|236_aa MVTKQEAELVVAADTGNSPSFAPREMCHPYILAVVESRRIQTKIQPSQQVRVKMSLLEQH LLEQNRTGCSLALGLAALAAGGPESRAWLHIEGHQPERAAGGTAVRKVISSRGHYYGRGI GCCHPRLAEVCPDEKTQDGSRPKKAFACAVGTEVFLPSLSTKQSLCSAEQRKNANGALGV AKFLGPQVLACILYGVAFVFPFCRLCLKYEDYLKLILCQDTDTTINWVMSVNSKTI >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_3|711_bp atggttactaagcaggaggctgagctggtggttgcagctgacactggaaattcacccagc ttcgccccccgggagatgtgccacccctacatccttgctgttgtggagtcaagaaggatc cagaccaagatccaaccttcacagcaggtcagagttaaaatgtcattactcgagcagcac ttactggagcagaacagaactggctgctcactggcccttggccttgctgcccttgctgcg gggggaccagagtccagagcctggctgcatattgaaggacaccagccagaaagggcagca gggggaactgcagttcgtaaggtcatcagctccaggggccactactatggccgtggaatt ggctgttgtcatcccaggttggcggaagtgtgtccagatgagaagacacaggatggctct cgtcccaagaaggccttcgcttgtgctgtgggcacagaggtctttctcccctctctgagc acaaagcagtcactgtgctctgccgagcagaggaaaaatgcaaatggtgctctgggtgtg gccaagttccttgggccccaagtgctggcctgcattctttatggggtagcctttgttttc cctttctgcaggctgtgcctgaaatatgaagattacttaaaactcatactatgccaggac acggacaccacaataaactgggtaatgagtgtcaactcaaagactatttaa >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_4|221_aa MRPERPRPRGSAPGPMETPPWDPARNDSLPPTLTPAVPPYVKLGLTVVYTVFYALLFVFI YVQLWLVLRYRHKRLSYQSVFLFLCLFWASLRTVLFSFYFKDFVAANSLSPFVFWLLYCF PVCLQFFTLTLMNLYFTQGQGEGRGGLEAAAHSLPLAPHRAVDSGRFSVRSAKQQRLHTP VSTDGRSSTELRLTNSFHRIAKQQAGVTPSSELLPLIRVTV >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_4|666_bp atgaggcccgagcgtccccggccgcgcggcagcgcccccggcccgatggagaccccgccg tgggacccagcccgcaacgactcgctgccgcccacgctgaccccggccgtgcccccctac gtgaagcttggcctcaccgtcgtctacaccgtgttctacgcgctgctcttcgtgttcatc tacgtgcagctctggctggtgctgcgttaccgccacaagcggctcagctaccagagcgtc ttcctctttctctgcctcttctgggcctccctgcggaccgtcctcttctccttctacttc aaagacttcgtggcggccaattcgctcagccccttcgtcttctggctgctctactgcttc cctgtgtgcctgcagtttttcaccctcacgctgatgaacttgtacttcacgcaggggcaa ggagaaggcaggggagggctggaggccgccgctcactcgctgcccctggctccgcatcgt gcggtggattcggggcgcttctccgtgcgcagcgcgaagcagcagcgcctgcacacgcca gttagtacggatggaaggagctctacagaactgcggcttactaactcattccaccggatt gctaaacagcaggctggggtaacacccagctccgagctgctgccgctcatcagagttact gtctga >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_5|75_aa MWPKKLVLPAPGLVLALGMWAPQTSGITEALMRSQPQLQREEHAAMARGEGSGNWAVCLE SGLREVLGVNGRPRK >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_5|228_bp atgtggcccaagaagctggtcctgcctgctcctggcctcgtgctggcccttggaatgtgg gccccccagacttcaggcatcaccgaggcactgatgcggtcacagcctcagctgcagaga gaagagcacgctgccatggctcgaggagagggctcgggaaactgggctgtatgtttagaa agtggcctgcgggaggtgctgggggtgaacggcagaccaagaaaataa >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_6|61_aa MRRVFKAANTTVPDTSYCERTEARVGQGLLIPDLWATCAPLVTQAFTPSFRMTQEGLELC M >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_6|186_bp atgaggcgagtcttcaaggcagccaacacaacagtgcccgacacctcctactgtgagagg acagaagcaagggtgggccaaggcctcctcatcccggacctgtgggcgacttgtgccccc ttggtaacccaggccttcaccccatcctttcggatgacacaggaaggcctagagctctgc atgtga >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_7|216_aa MPSRLPLYLASLFISLVFLLVNLTCAVLVKTGNWERKVIVSVRVAINDTLFVLCAVSLSI CLYKISKMSLANIYLESKGSSVCQVTAIGVTVILLYTSRACYNLFILSFSQNKSVHSFDY DWYNVSDQADLKNQLGDAGYVLFGVVLFVWELLPTTLVVYFFRVRNPTKDLTNPGMVPSH GFSPRSYFFDNPRRYDSDDDLAWNIAPQGLQGGLWL >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_7|651_bp atgccttccaggttgcccctctacctggcctccctcttcatcagccttgttttcctgttg gtgaatttaacctgtgctgtgctggtaaagacgggaaattgggagaggaaggttatcgtc tctgtgcgagtggccattaatgacacgctcttcgtgctgtgtgccgtctctctctccatc tgtctctacaaaatctctaagatgtccttagccaacatttacttggagtccaagggctcc tccgtgtgtcaagtgactgccatcggtgtcaccgtgatactgctttacacctctcgggcc tgctacaacctgttcatcctgtcattttctcagaacaagagcgtccattcctttgattat gactggtacaatgtatcagaccaggcagatttgaagaatcagctgggagatgctggatac gtattatttggagtggtgttatttgtttgggaactcttacctaccaccttagtcgtttat ttcttccgagttagaaatcctacaaaggaccttaccaaccctggaatggtccccagccat ggattcagtcccagatcttatttctttgacaaccctcgaagatatgacagtgatgatgac cttgcctggaacattgcccctcagggacttcagggaggcctgtggctctga >gi568815597f:236042623_236308155|GENSCAN_predicted_peptide_8|218_aa MSQGVRRAGAGQGVAAAVQLLVTLSFLRSVVEAQLSCVCQVASPGESFYTWLEETWGKPS WGPNIKEFKHRFDPVETKGEGPRRLKNLYFLYLIELRALSKVAPYFERSIVDLYTGNAEE DADTKTLLLNIFQDTKSFPMHFDEKSMFAGDKKGAKSLKTQGLGTALKILFSEKEIQKLP ENSPSKGFQLTRQEIVALLNAFGRNFRDKDILEIVSYF >gi568815597f:236042623_236308155|GENSCAN_predicted_CDS_8|657_bp atgagccaaggggtccgccgggcaggcgctgggcagggggtagcggccgcggtgcagctg ctggtcaccctgagcttcctgcggagcgtcgtcgaggcgcagctgagctgtgtctgccaa gtggcatcgccaggagaatcattctacacatggctagaagaaacctggggtaagcccagt tggggacctaatattaaagaattcaaacaccgctttgaccctgtggaaaccaagggagaa ggtccaagaaggctcaagaatctttactttttatacttgattgagcttcgagctttgtca aaggtggctccatattttgagcgctcaattgtcgatctttacactggaaatgcagaagaa gatgctgacacaaaaactcttctactgaatatctttcaagatacaaagtcctttcccatg cactttgatgagaaatccatgtttgcaggtgacaaaaaaggggccaagtcactaaagact cagggtttaggaactgccctgaagatattattctctgaaaaagaaatccaaaagcttcca gagaatagtccatctaaaggcttccaactcacccgacaggaaatagttgctcttttaaat gcttttggaagaaattttagggacaaagacattttggaaattgtcagttacttttag