GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:29:28 Sequence gi568815592r:158666917_158889371 : 222455 bp : 46.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 9062 9715 654 1 0 86 54 179 0.915 10.49 1.02 PlyA + 10627 10632 6 1.05 2.00 Prom + 23694 23733 40 -5.36 2.01 Init + 36308 36468 161 1 2 61 28 216 0.511 10.25 2.02 Intr + 41406 41475 70 0 1 84 59 85 0.982 4.38 2.03 Intr + 46154 46291 138 1 0 82 64 61 0.930 3.76 2.04 Intr + 46884 46962 79 2 1 101 106 58 0.814 8.02 2.05 Intr + 51171 51295 125 1 2 101 42 75 0.501 4.50 2.06 Intr + 58587 58721 135 2 0 113 94 -33 0.001 0.56 2.07 Intr + 78564 78742 179 2 2 75 94 136 0.005 11.62 2.08 Intr + 85012 85114 103 1 1 117 75 104 0.965 12.18 2.09 Intr + 90295 90465 171 0 0 124 59 216 0.867 22.44 2.10 Intr + 93724 93829 106 0 1 93 86 63 0.905 6.39 2.11 Term + 97579 97688 110 2 2 111 44 31 0.714 -0.33 2.12 PlyA + 97934 97939 6 1.05 3.19 PlyA - 98305 98300 6 1.05 3.18 Term - 100162 99998 165 1 0 49 42 384 0.999 27.82 3.17 Intr - 100596 100345 252 0 0 38 81 576 0.957 49.53 3.16 Intr - 102502 102410 93 1 0 87 96 128 0.948 13.66 3.15 Intr - 103028 102868 161 0 2 96 62 230 0.998 20.91 3.14 Intr - 103978 103848 131 0 2 80 55 244 0.999 20.64 3.13 Intr - 104491 104328 164 1 2 76 89 238 0.999 21.47 3.12 Intr - 109588 109492 97 0 1 93 95 90 0.983 10.21 3.11 Intr - 116750 116604 147 1 0 69 78 137 0.991 10.15 3.10 Intr - 117811 117728 84 0 0 94 89 72 0.991 6.84 3.09 Intr - 118718 118393 326 2 2 -4 105 438 0.953 30.87 3.08 Intr - 122455 122372 84 1 0 105 77 117 0.965 12.32 3.07 Intr - 128916 128798 119 1 2 104 54 31 0.668 1.48 3.06 Intr - 151250 151166 85 2 1 69 100 156 0.864 14.29 3.05 Intr - 167997 167879 119 1 2 78 57 66 0.262 2.68 3.04 Intr - 169181 169077 105 0 0 82 42 60 0.152 0.99 3.03 Intr - 175223 174963 261 0 0 49 4 238 0.020 8.96 3.02 Intr - 201438 201390 49 0 1 46 101 58 0.130 1.15 3.01 Init - 203593 203516 78 1 0 90 92 46 0.725 6.26 3.00 Prom - 208085 208046 40 -1.76 4.03 PlyA - 208293 208288 6 1.05 4.02 Term - 220189 219387 803 2 2 79 43 270 0.444 15.01 4.01 Init - 221750 221693 58 2 1 42 94 36 0.555 1.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 78600 78742 143 2 2 81 94 127 0.993 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:158666917_158889371|GENSCAN_predicted_peptide_1|217_aa MAILPKVIYIPIKLPMTFFTELEKTTFKFIWNQKRACIAKSILSQKNKAGGITLPDFKLY YKATVTKTTWYCYQNRDIDQWNRTEPSEIMLHIYNYLIFDKPDKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSTWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTE >gi568815592r:158666917_158889371|GENSCAN_predicted_CDS_1|654_bp atggccatactgcccaaggtaatttatatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttcaagttcatatggaaccaaaaaagagcctgcattgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacaacatggtactgttaccaaaacagagatatagaccaa tggaacagaacagagccctcagaaataatgttgcatatctacaactatctgatctttgac aaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa attaattcaacatggattaaagacttaaatgttagacctaaaaccataaaaaccctagaa gaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtctaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaaccaccatcagagtgaacaggcaacctacagaatga >gi568815592r:158666917_158889371|GENSCAN_predicted_peptide_2|458_aa MLRAVPGRHGCQPAMCLRQALAASDLGPGLQLFLSLAKTGIPAKVTRQPDRAGCKISVVP PTPPPVSESQCSRSPGRFQTETGVTFKILLYFLLEISNSPESFQYLRAHDVFIGQMDDWM GGNLQEFGQFRGFNKSVENLFLSLATHVKKLSKSQNDMTSEKHLLATGPRQCVGQTERRS QSDTAVNVTTRKVSAPDILKPLNQEDPKCSTNPILKQQNLPSSPAPSTIFSGGFRHGSLI SIDSTCTEMGNFDNANVTGEIEFAIHYCFKTHSLEICIKACKNLAYGEEKKKKCNPYVKT YLLPDRSSQGKRKTGVQRNTVDPTFQETLKYQVAPAQLVTRQLQVSVWHLGTLARRVFLG EVIIPLATWDFEDSTTQSFRWHPLRAKAEKYEDSVPQSNGELTVRAKLVLPSRPRKLQEA QEEGDTAVGGDACSLSKLQWQKVLSSPNLWTDMTLVLH >gi568815592r:158666917_158889371|GENSCAN_predicted_CDS_2|1377_bp atgctccgcgccgtgcctggcagacacggctgccaaccagccatgtgcctgagacaagcc ctggcagcctcagacctgggccctggcctgcagctgttcctgtccctggcaaagacgggt atccctgctaaagtcactcgccagccagaccgggcagggtgcaaaatttctgtggttcct cctactccacctcctgtcagcgagagccagtgcagccgcagtcctggcaggtttcaaact gagactggggtgaccttcaaaatcctgctctactttctcctggaaatttcaaattctcca gagtcgttccagtacctacgtgctcatgatgtatttattggacagatggatgattggatg ggtggtaatttacaggaatttggtcagtttagaggatttaataagtccgtggaaaatttg tttctgtctcttgctacccacgtgaaaaagctctccaaatcccagaatgatatgacttct gagaagcatcttctcgccacgggccccaggcagtgtgtgggacagacagagagacggagc cagtctgacactgcggtcaacgtcaccaccaggaaggtcagtgcaccagatattctgaaa cctctcaatcaagaggatcccaaatgctctactaaccctattttgaagcaacagaatctc ccatccagtccggcacccagtaccatattctctggaggttttagacacggaagtttaatt agcattgacagcacctgtacagagatgggcaattttgacaatgctaatgtcactggagaa atagaatttgccattcattattgcttcaaaacccattctttagaaatatgcatcaaggcc tgtaagaaccttgcctatggagaagaaaagaagaaaaagtgcaatccgtatgtgaagacc tacctgttgcccgacagatcctcccagggaaagcgcaagactggagtccaaaggaacacc gtggacccgacctttcaggagaccttgaagtatcaggtggcccctgcccagctggtgacc cggcagctgcaggtctcggtgtggcatctgggcacgctggcccggagagtgtttcttgga gaagtgatcattcctctggccacgtgggactttgaagacagcacaacacagtccttccgc tggcatccgctccgggccaaggcggagaaatacgaagacagcgttcctcagagtaatgga gagctcacagtccgggctaagctggttctcccttcacggcccagaaaactccaagaggct caagaagagggagacacagctgttggcggggatgcatgctcactatcgaagctccagtgg cagaaagtcctttccagccccaatctatggacagacatgactcttgtcctgcactga >gi568815592r:158666917_158889371|GENSCAN_predicted_peptide_3|839_aa MVTGYYLHLKDGESLRFPDVFSVTDWWVRGLSDFKNEATDLRVLECPNLKLKKPPWLQVL SAMIVYALMVVSYFLVTGGIIYDVIVEPPSIGSMTDEHGHQRPVAFLAYRVNEQCIMEGL ASSFLFTIGASRMGVVPGPSFSLTAQSAQPSLAPAGPVLSYVVEEALEWKQGFLYDSLSG WGCPDGKAMLPGYSHDRQAGPSSWVGTASSLLLDSRVFGDRGYSPETENAETMIPKFGLL KITCGSDSFLGCGCLKSFCGDSDGEADVGSTVINVRVTTMDAELEFAIQPNTTGKQLFDQ WAFVFRICVESVACFVQVSAQEVRKENPLQFKFRAKFYPEDVAEELIQDITQKLFFLQVK EGILSDEIYCPPETAVLLGSYAVQAKFGDYNKEVHKSGYLSSERLIPQRVMDQHKLTRDQ WEDRIQVWHAEHRGMLKDNAMLEYLKIAQDLEMYGINYFEIKNKKGTDLWLGVDALGLNI YEKDDKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRILQLCMGN HELYMRRRKPDTIEVQQMKAQAREEKHQKQLERQQLETEKKRRETVEREKEQMMREKEEL MLRLQDYEEKTKKAERELSEQIQRALQLEEERKRAQEEAERLEADRMAALRAKEELERQA VDQIKSQEQLAAELAEYTAKIALLEEARRRKEDEVEEWQHRAKEAQDDLVKTKEELHLVM TAPPPPPPPVYEPVSYHVQESLQDEGAEPTGYSAELSSEGIRDDRNEEKRITEAEKNERV QRQLLTLSSELSQARDENKRTHNDIIHNENMRQGRDKYKTLRQIRQGNTKQRIDEFEAL >gi568815592r:158666917_158889371|GENSCAN_predicted_CDS_3|2520_bp atggtgaccggctattatctgcatcttaaagatggggaaagtttgaggtttcctgatgtg ttcagcgtcacagactggtgggttcgtggtctcagtgacttcaagaatgaagccacagac cttcgcgtgctcgaatgtcccaacctgaagctgaagaagccgccctggctgcaagtgctg tcggccatgattgtgtatgctctgatggtggtgtcttacttcctcgtcactggaggaata atttatgatgttattgttgaacctccaagcattggctctatgactgatgaacacgggcat cagaggccagtagctttcttggcctacagagtaaatgaacaatgtattatggaaggactt gcatccagcttcctgtttacaataggagcttcacggatgggcgtggttccaggtcctagc ttctccctgactgcccagtctgcccagccttccttggctccagcaggccctgtcctgagc tatgtcgttgaagaggccttggagtggaaacaagggttcttgtatgactccctgagtggc tggggatgtccagatggcaaggctatgctgcccggttactcacacgacaggcaagctggg ccatcgtcctgggttgggacagcgtcttcgctgctgctggatagtcgtgttttcggggat cgaggatactcaccagaaaccgaaaatgccgaaaccatgattcccaaatttggcctcctc aaaatcacttgtggaagtgacagcttcctgggctgcgggtgtttgaaaagcttctgtggg gattctgatggtgaggctgatgtgggaagcactgtgatcaatgtccgagttaccaccatg gatgcagagctggagtttgcaatccagccaaatacaactggaaaacagctttttgatcag tgggccttcgtcttccggatttgtgtggagagtgtggcttgtttcgtgcaggtgtctgcc caggaggtcaggaaggagaatcccctccagttcaagttccgggccaagttctaccctgaa gatgtggctgaggagctcatccaggacatcacccagaaacttttcttcctccaagtgaag gaaggaatccttagcgatgagatctactgcccccctgagactgccgtgctcttggggtcc tacgctgtgcaggccaagtttggggactacaacaaagaagtgcacaagtctgggtacctc agctctgagcggctgatccctcaaagagtgatggaccagcacaaacttaccagggaccag tgggaggaccggatccaggtgtggcatgcggaacaccgtgggatgctcaaagataatgct atgttggaatacctgaagattgctcaggacctggaaatgtatggaatcaactatttcgag ataaaaaacaagaaaggaacagacctttggcttggagttgatgcccttggactgaatatt tatgagaaagatgataagttaaccccaaagattggctttccttggagtgaaatcaggaac atctctttcaatgacaaaaagtttgtcattaaacccatcgacaagaaggcacctgacttt gtgttttatgccccacgtctgagaatcaacaagcggatcctgcagctctgcatgggcaac catgagttgtatatgcgccgcaggaagcctgacaccatcgaggtgcagcagatgaaggcc caggcccgggaggagaagcatcagaagcagctggagcggcaacagctggaaacagagaag aaaaggagagaaaccgtggagagagagaaagagcagatgatgcgcgagaaggaggagttg atgctgcggctgcaggactatgaggagaagacaaagaaggcagagagagagctctcggag cagattcagagggccctgcagctggaggaggagaggaagcgggcacaggaggaggccgag cgcctagaggctgaccgtatggctgcactgcgggctaaggaggagctggagagacaggcg gtggatcagataaagagccaggagcagctggctgcggagcttgcagaatacactgccaag attgccctcctggaagaggcgcggaggcgcaaggaggatgaagttgaagagtggcagcac agggccaaagaagcccaggatgacctggtgaagaccaaggaggagctgcacctggtgatg acagcacccccgcccccaccaccccccgtgtacgagccggtgagctaccatgtccaggag agcttgcaggatgagggcgcagagcccacgggctacagcgcggagctgtctagtgagggc atccgggatgaccgcaatgaggagaagcgcatcactgaggcagagaagaacgagcgtgtg cagcggcagctgctgacgctgagcagcgagctgtcccaggcccgagatgagaataagagg acccacaatgacatcatccacaacgagaacatgaggcaaggccgggacaagtacaagacg ctgcggcagatccggcagggcaacaccaagcagcgcatcgacgagttcgaggccctgtaa >gi568815592r:158666917_158889371|GENSCAN_predicted_peptide_4|286_aa MTSGPQTNQPKEHLTNFKLDERESFSLAQSRADNRRLHEPDLLEGIRAVPREDPQWNYQA DSPGTAGQDYMVSCLVEGLKKAAYKAGNYDELKETTRGKDENPAQVMARLAATLRRFTAL DPQGPEGRLILNIRFITQSAPDMRKKLQKLEPGPQIPQQELINLAFKVHNNREEVARQQH ISELQLLDSSGRQPTTTSPAYKNFRTSKPQLPGVLQNILVDLASNAKSLATGPQNAHSLG FLLSHALSVRAPTGGRTVRLTSLPLLKPLEPKPNVPWPTPSQISSA >gi568815592r:158666917_158889371|GENSCAN_predicted_CDS_4|861_bp atgacctcaggtcctcagaccaaccagcccaaggaacatctcaccaatttcaaattggat gaacgggaaagtttttctctagcccagtctcgtgctgataaccgccggcttcatgagcca gacctcctggaaggcattagagcagttccccgagaggatccccagtggaactatcaggca gattccccaggtacagctgggcaagattacatggtttcctgcctagttgaagggcttaaa aaggcagcttacaaagctggtaattatgacgaacttaaggaaactacccgaggtaaagac gaaaacccagcccaggtcatggcccgcttggcagctacccttagacgctttaccgcccta gacccacaggggccagaaggccgccttattctcaatatacgttttatcacccagtcagct cctgacatgagaaaaaagcttcaaaaattggaacccggccctcaaatcccacaacaggaa ttaatcaacctcgccttcaaggtgcacaataatagagaggaggtagccagacagcaacac atttctgagttacagctacttgactcctctggaagacaacccacaaccacgtctccagca tacaagaacttcaggacatccaagccacagctcccaggggttcttcaaaacatcctcgtg gaccttgcttcaaatgccaaaagcctggccactgggcctcagaatgcccacagcctggga ttcctcctaagccatgccctgtctgtgcgggcccccactggaggtcggactgtccgactc acatcactgccactcctaaagcccctggagcccaaacccaatgttccttggccgactcct tcccagatctcctcggcttag