GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:23:44 Sequence gi568815587f:43782653_44020007 : 237355 bp : 43.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15668 15775 108 0 0 120 58 109 0.993 11.36 1.02 Intr + 32785 32849 65 2 2 93 101 89 0.737 9.14 1.03 Intr + 33987 34020 34 2 1 97 80 44 0.827 2.30 1.04 Intr + 36751 36806 56 2 2 59 113 19 0.390 0.20 1.05 Intr + 48324 48358 35 2 2 121 113 -3 0.216 2.52 1.06 Intr + 55665 55746 82 0 1 101 93 -5 0.852 0.94 1.07 Intr + 57347 57412 66 1 0 94 109 69 0.925 8.70 1.08 Intr + 72063 72216 154 2 1 54 44 158 0.900 7.65 1.09 Term + 72492 72682 191 1 2 111 43 122 0.993 7.61 1.10 PlyA + 73519 73524 6 1.05 2.06 PlyA - 75398 75393 6 1.05 2.05 Term - 83329 83240 90 1 0 67 42 61 0.055 -2.88 2.04 Intr - 93118 93031 88 0 1 80 70 102 0.154 7.57 2.03 Intr - 98201 98093 109 0 1 47 78 50 0.074 -0.76 2.02 Intr - 107820 107690 131 2 2 96 92 80 0.865 9.54 2.01 Init - 109421 109417 5 2 2 61 87 0 0.349 -3.33 2.00 Prom - 112728 112689 40 -4.56 3.00 Prom + 113648 113687 40 -2.76 3.01 Init + 114804 115203 400 2 1 86 15 257 0.840 15.03 3.02 Intr + 115349 116185 837 0 0 78 -4 621 0.682 43.26 3.03 Intr + 116227 116528 302 2 2 -1 75 218 0.512 8.05 3.04 Intr + 116546 116782 237 1 0 -7 -7 280 0.446 6.81 3.05 Intr + 118864 119073 210 0 0 56 81 204 0.740 15.61 3.06 Intr + 136386 136484 99 2 0 67 94 49 0.032 3.71 3.07 Intr + 142642 142820 179 0 2 12 59 123 0.007 0.52 3.08 Intr + 143781 143895 115 0 1 126 77 15 0.930 4.65 3.09 Intr + 150441 150494 54 2 0 73 115 15 0.726 1.88 3.10 Intr + 153427 153607 181 0 1 79 68 96 0.950 6.14 3.11 Intr + 159732 160617 886 1 1 31 67 945 0.262 77.08 3.12 Intr + 167993 168121 129 2 0 71 76 136 0.332 10.51 3.13 Term + 175631 175739 109 2 1 59 44 116 0.832 2.28 3.14 PlyA + 178637 178642 6 1.05 4.08 PlyA - 179231 179226 6 1.05 4.07 Term - 183406 183324 83 2 2 28 36 114 0.022 -1.94 4.06 Intr - 198258 198177 82 0 1 91 78 47 0.253 3.21 4.05 Intr - 201769 201741 29 2 2 95 98 4 0.221 -0.07 4.04 Intr - 205776 205629 148 0 1 111 86 13 0.289 3.21 4.03 Intr - 207210 207145 66 0 0 107 66 42 0.205 3.00 4.02 Intr - 220287 220107 181 2 1 92 107 12 0.504 3.37 4.01 Init - 221325 221270 56 0 2 51 72 87 0.522 2.42 4.00 Prom - 229614 229575 40 -2.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 142690 142820 131 0 2 88 59 122 0.936 8.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:43782653_44020007|GENSCAN_predicted_peptide_1|263_aa XEKFKVETRTIAVDFASEDIYDKIKTGLAGLEIGILVNNVGMSYEYPEYFLDVPDLDNVI GEQVVFGYMNFILHAGSEHRSAYSVVITMTQLVLPGMVERSKGAILNISSGSGMLPVPLL TIYSATKTFVDFFSQCLHEEYRSKGVFVQSVLPYFVATKLAKIRKPTLDKPSPETFVKSA IKTVGLQSRTNGYLIHALMVGLDNLKPAFLDLFENSHEYEQVYTGSLSEENQEELSIDNC IVTWPDAPAYARSLQSTLLVLKI >gi568815587f:43782653_44020007|GENSCAN_predicted_CDS_1|792_bp naagaaaaattcaaagtggagacaagaaccattgctgttgactttgcatcagaagatatt tatgataaaattaaaacaggcttggctggtcttgaaatcggcatcttagtgaacaacgtg ggaatgtcgtatgagtatcctgaatactttttggatgttcctgacttggacaatgttatt ggggaacaggtggtgtttggttacatgaactttatccttcatgcaggatctgagcacaga tcagcatattcagtggtcattacgatgacacaattggtactgcctggcatggtggaaaga tccaaaggggctattctgaacatttcatctggcagtggcatgctccctgtcccactcttg accatctattctgcaaccaagacttttgtagatttcttctctcagtgcctccatgaggag tataggagcaagggcgtctttgtgcagagtgtcctgccatacttcgtagctacaaaactg gctaaaatccggaagccaactttggataagccctctccggagacgtttgtgaagtctgca attaaaacagtcggcctgcaatcccgaaccaatggatacctgatccatgctcttatggta gggctcgataatctcaaacctgccttcttggatttatttgaaaatagtcatgaatatgaa caagtctacacgggctcactatctgaagaaaaccaagaagaactaagcattgataactgc attgtaacttggccagatgctccagcatatgcacgttcactgcaaagcaccctactggtt ttgaaaatctga >gi568815587f:43782653_44020007|GENSCAN_predicted_peptide_2|140_aa MLKFKFTQCARKKFDARVQTPPSDRLWYNRHCWLVDARRDSGSNSVCDRNLKSEAPTPAT PNHRPRHQALPLDVTQRRHYFRSTSHPDILQNITLDNNNDDIVLIGYGEQKQGKDKLSME ENMTNFSLYGSLKIECLLYT >gi568815587f:43782653_44020007|GENSCAN_predicted_CDS_2|423_bp atgctaaaatttaaattcactcaatgtgccagaaaaaaattcgatgctagagtacagacg cccccaagtgaccgcctgtggtataacaggcactgttggcttgtggatgcccgcagagat tcaggttccaactcagtctgtgatcggaaccttaagagtgaagcaccgactccagcaact cccaatcacaggccgcggcatcaggcacttcctctcgacgttacgcagcgccgccactac ttccggtccacctctcatcctgacatcctacaaaacatcactctagacaacaataatgac gacattgtgctgattggatatggtgaacagaagcagggaaaggacaaactctctatggag gaaaatatgaccaacttcagtctttatggttctttaaaaatagaatgtctgctgtacact taa >gi568815587f:43782653_44020007|GENSCAN_predicted_peptide_3|1245_aa MVQKYQSPVRVYKYPFKLIMAAYERRFPTCPLIPTFMGSDTVNEFKSEDGAVHVIERRCK LDVDAPRLLKKIAGVDYVYFVQKNSLNSRERTLHIEGHNETFSNCYTVHPENEDWTRFEQ SASLDIKSFFGFEKTSSSSCKKQAASMAVVIPDAVLKEGLSGDALSSPSAPEPVVGIPDN KLDADYIKRYLGDLTPLQESCLIRLHGWLQETHKGEIPKDEHILQFLCAWDFNIDKAREI ICQSLAWRKQHQVDYILATWAVPQVLQNYYTGGWHHHDKDGWPLCMLRLGQMDTNGLVRA LGEEALLRYVLSINEEELRRCEENTKVFVWPISSWTCLADLEGVNMRHLWRPDVKVLRWI IEVVKASYPKRLGRLLILRSPRVFPVLWTLVSPFIDDNTRRKFLIYAGNDYQDFLSGECM CKVPEGGLVPKSLYWTMEELENEDLKLWTETIYHSASIFKGAPHKILIQIVDASSVTTWN FDMYKGDIVFNIYHSKRSPQPPKKDSLGAPPLQLIDKVWQWGRDYSMVELPLICKGEGVQ GSHVTTWPGLCILQWKFHSMPVCTISSLPQVDDVLASLQISSHNCKVMYYAKWHPVLRTL KNRIEENTGHTFNSLLCNLYRNEKDSVDWHSDDEPSLGRCPIIASLSFGATRTFEMRKKP PPEENGDYTYVERVKIPLDHGTLLIMEGATQADWQASVLSATLRGLVTRGNMGNKQPQKV TVPTGTALQGVVLIVSTLHQPGGWICGKDPCCSLRPLSNSVQNALACKSKQDYQAGILFK TRAFISRDCGSDAAEDSASKGETYTLTLEHKAFGGKKNVERGVDIRQGPRGPRLGISSNN TPTWFTSAVWNSTLGDMIRVQRCSGWLPLLNSRASRVLRGRGFSRNPRGRGLPSGAGWRG AGGAGEGAVTFPERRGDVRRKGAGRARFKWHSLSSELRAVWAAAGYISREPGRRGADGDS SGGERLGARRNSAPRAPCPPTGPPARPPSRGAPARAREGRRHPAADLDPPPGEPPAAASR GAPAQRPPSESPGAPPPGPADAGGAMAAKPGELMGICSSYQAVMPHFVCLADEFPQPVRP AKLPKGRGRLRRPRQSRFKTQPVTFDEIQEVEEEGVSPMEEEKAKKSFLQSLECLRRSTQ SLSLQREQLSSCKLRNSLDSSDSDSALGHRDVPTTQRDDQAHKPNGLGDARETRWKDPRS LNDLVEPNSLFFISHNQMAPPGPANWFYGPHPGTDSAEENSFDSL >gi568815587f:43782653_44020007|GENSCAN_predicted_CDS_3|3738_bp atggtgcaaaaataccagtcaccagtgagggtatacaaataccccttcaaattaattatg gctgcctatgaaaggaggttccctacgtgtcctttgattccgacattcatgggcagtgac actgtgaatgaattcaagagtgaagatggggctgttcatgtcattgaaaggcgctgcaag ctggatgtagatgcaccaagactgctgaagaagattgcaggagttgattacgtttatttt gtccagaagaactcactgaattctcgggaacgtactttgcacattgagggtcataatgaa acattttccaattgctataccgttcaccctgaaaatgaagattggacccgttttgaacag tctgcaagtttagatattaaatctttctttggttttgaaaagacatcttcgtcatcctgc aagaaacaagcagcgtccatggctgttgtcatcccagatgctgtcctcaaggaggggctg agtggcgatgccctcagcagccccagtgcacctgagcccgtggtgggcatccctgataac aaactagatgctgactacatcaagagatacctgggcgatttgactccgctgcaggagagc tgtctcattagacttcacgggtggctccaggagacccacaagggtgaaattccaaaagat gagcatattcttcagttcctatgtgcatgggattttaatattgacaaagccagagagatc atttgtcaatctttggcgtggaggaagcagcaccaggtagactacattcttgctacctgg gccgttccacaggtccttcagaattactacacgggaggctggcatcatcacgacaaagat gggtggcccctctgtatgctcaggctggggcagatggacaccaacggcttggtgagagca ctcggggaggaagccctgctgagatacgttctctccataaatgaagaagagctaaggcga tgtgaagagaatacaaaagtctttgtttggcctatcagctcatggacctgcctggcggac ttggaaggggtgaacatgcgccacttatggagacctgatgtcaaagtgctgcggtggatc atcgaggtggtgaaggccagttaccctaagagactgggccgacttctcatcctgcggtca cccagggtatttcctgtgctctggacgctggttagtccatttattgatgacaacaccaga aggaaattcctcatttatgcaggaaatgactaccaggatttcctgagtggagagtgcatg tgcaaagtgccagagggtggactggtccccaaatctctctactggaccatggaggagctg gagaatgaagacctcaagctctggactgagaccatctaccactctgcaagcatcttcaaa ggagccccacacaagattctcattcagattgtggatgcctcttcagtcaccacttggaat tttgacatgtacaaaggggacattgtctttaacatctatcactccaagaggtcgccacag ccacccaaaaaggactccctaggggccccacctctccagctcatagacaaagtctggcag tggggccgtgactacagcatggtggaattgcctctgatctgcaaaggagaaggcgtgcag ggctcgcatgtgaccacgtggccgggcctctgcatcctgcagtggaaattccacagcatg cccgtgtgcaccatcagcagcctgccccaggtggatgatgtgctcgcgtccctgcagatc tcttcgcacaactgtaaagtgatgtactacgccaaatggcaccctgtgctgcgcacacta aagaaccgcattgaagagaacactggccacaccttcaactccttactctgcaatctttat cgcaatgagaaggacagcgtggactggcacagtgatgatgaaccctcactagggaggtgc cccattattgcttcactaagttttggtgccacacgcacatttgagatgagaaagaagcca ccaccagaagagaatggagactacacatatgtggaaagagtgaagatacccttggatcat gggaccttgttaatcatggaaggagcgacacaagctgactggcaggccagtgttctttct gccactctgagaggcctggtgacaagagggaacatgggaaacaagcagccccagaaggtc acggtgcctactgggacagccctccaaggagtggtattgatcgtctccacgctgcaccag ccaggtggctggatatgtggcaaggatccctgctgcagcttgaggccactttcaaactct gtacagaacgccttggcctgcaagagtaaacaagattaccaggctggaattctgttcaag accagggcttttatatccagagattgtgggtcagatgcggcagaagactctgcttccaag ggagagacttatactttaaccttggagcataaggcctttggaggaaagaaaaatgttgaa agaggtgttgatatccgccaaggtccccgagggccacgtttgggcatcagcagtaacaac accccaacctggttcacttctgctgtgtggaactccacccttggagatatgattagagtg cagcgttgctcgggctggcttcctcttttaaattctcgggccagccgcgtgctccggggc cgcggcttctcccggaacccccgcgggcggggcctgccttccggagccggttggcggggg gcgggcggcgccggggaaggggccgttactttcccagagcggcggggcgacgtcaggcgg aagggcgcggggcgcgcacgctttaaatggcattcgctgtcatccgagctcagagccgtg tgggcagccgcgggctatataagccgcgagcctggccgccgcggggcagacggcgacagc agcggcggcgagcgcctcggagcgcggcggaacagcgccccccgagccccgtgccccccg acgggtccgcccgcccgcccgccctcccgaggagcgccggcccgggcccgcgagggccgc cgccaccccgcagcagatttggatcccccgcccggcgagcccccggctgctgcctcccgg ggggccccggcgcagcggccgccctcggagagccccggcgccccgccgcccggccccgca gacgccggaggcgccatggccgccaagcccggcgagctgatgggcatctgctccagttac caggcggtgatgccgcacttcgtgtgcctggccgacgagttcccgcagcccgtgcggccc gccaagctgcccaagggccggggccggctgcggcggccgcgccagtctcgcttcaagacg cagccggtgaccttcgacgagatccaggaggtggaggaggagggggtgtcccccatggag gaggagaaggccaagaagtcgttcctgcagagcctggagtgcctgcgccgcagcacgcag agcctgtcgctgcagcgggagcagctcagcagctgcaaactgaggaacagcctggactcc agcgactccgactcggccctggggcacagggatgtgcctacaacccagcgggatgatcaa gcccacaagcccaatggcctaggggatgccagagaaacaagatggaaggatcccagatcc ctgaatgaccttgtggagccgaacagcctgtttttcatttctcacaaccagatggcccca cctggacctgccaactggttctatggcccccacccaggaactgactcagcagaagagaac agcttcgactccctatga >gi568815587f:43782653_44020007|GENSCAN_predicted_peptide_4|214_aa MLTSKAWGNSSLLLAAARGKSPSPCRIQTPPALDKAMQPFPSTEQLPYPVPCQGLLLIVS FNLHNKKSLLARVTIPILPDPEEALEEAVWRVWKTPPPTPAKTLFWRIFPGSSLLAPSNK TPIDQNLSSDEELFVTHQANEPPCFGGNTKGQQITHSQQQVTYASEPHFLCLLNDDDDDN SAYPIVGETIDKSKVVKTYPYVFLQEFYGFSSYI >gi568815587f:43782653_44020007|GENSCAN_predicted_CDS_4|645_bp atgctcacctccaaggcctgggggaacagctccctgctactggctgctgccagaggaaag tcaccctcaccatgccgcatccagacccctccagccctggacaaggccatgcagccattc ccgtcaacagagcagctcccctacccggtgccttgccaggggcttttgcttattgtctca tttaacctccacaacaaaaaatcactcctagctagggtaactatccccattttaccggac cctgaggaggctctggaggaagctgtgtggagagtgtggaagactccacccccaacccct gcaaagacactgttttggagaatattcccagggagctccttacttgcgccaagtaataaa acgcctattgatcaaaacctgagttctgatgaagagttatttgttactcaccaggcaaac gaacccccgtgttttggaggtaacacgaagggccagcaaataacacattctcagcagcag gtgacgtacgcttctgagcctcactttctttgcctgttaaatgatgatgatgatgataat agcgcttaccccatagttggagaaaccattgacaaatccaaggtcgtgaagacttaccct tatgtcttcttgcaggagttttacggctttagctcttatatttag