GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:25:32 Sequence gi568815597f:25298721_25499197 : 200477 bp : 44.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2226 2373 148 2 1 80 82 134 0.683 11.81 1.02 Intr + 2800 2966 167 2 2 84 105 126 0.974 13.58 1.03 Intr + 4602 4739 138 2 0 95 111 49 0.996 8.56 1.04 Intr + 7876 8009 134 0 2 65 34 204 0.970 12.14 1.05 Intr + 18280 18359 80 1 2 122 82 68 0.881 8.79 1.06 Intr + 52906 52973 68 2 2 78 99 87 0.555 7.42 1.07 Term + 63335 63376 42 1 0 113 41 24 0.090 -2.64 1.08 PlyA + 63527 63532 6 1.05 2.12 PlyA - 63611 63606 6 1.05 2.11 Term - 70716 70571 146 1 2 78 47 28 0.211 -4.23 2.10 Intr - 76708 76629 80 0 2 122 82 63 0.896 8.29 2.09 Intr - 87124 86991 134 1 2 78 34 151 0.979 8.14 2.08 Intr - 90393 90256 138 0 0 95 111 50 0.995 8.66 2.07 Intr - 92195 92029 167 0 2 84 89 62 0.936 5.58 2.06 Intr - 93421 93274 148 1 1 80 82 98 0.638 8.21 2.05 Intr - 102982 102851 132 1 0 -12 116 67 0.407 0.34 2.04 Intr - 104026 103876 151 0 1 89 92 137 0.388 14.36 2.03 Intr - 110149 109963 187 2 1 33 72 204 0.381 12.05 2.02 Intr - 110814 110755 60 1 0 111 100 -13 0.261 0.91 2.01 Init - 122066 121919 148 2 1 60 100 98 0.541 6.64 2.00 Prom - 123748 123709 40 -7.66 3.00 Prom + 124542 124581 40 -7.96 3.01 Init + 131253 131584 332 2 2 38 61 218 0.477 8.90 3.02 Intr + 132372 132458 87 0 0 19 99 142 0.520 7.49 3.03 Intr + 148042 148183 142 1 1 94 84 49 0.554 5.46 3.04 Intr + 150088 150214 127 0 1 100 111 -79 0.220 -4.15 3.05 Intr + 155539 155662 124 2 1 91 119 5 0.569 3.54 3.06 Intr + 157933 158111 179 1 2 71 100 107 0.644 9.76 3.07 Intr + 159671 160172 502 0 1 45 111 248 0.777 14.84 3.08 Intr + 185396 185581 186 2 0 77 71 258 0.887 21.80 3.09 Intr + 186893 187075 183 2 0 58 77 182 0.990 13.00 3.10 Intr + 190453 190573 121 1 1 73 110 129 0.999 14.10 3.11 Intr + 192690 192864 175 2 1 -1 97 212 0.903 12.71 3.12 Term + 199544 199746 203 0 2 93 49 200 0.999 14.15 3.13 PlyA + 199822 199827 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:25298721_25499197|GENSCAN_predicted_peptide_1|258_aa TDYHMNMMHIYVFAAYFGLSVAWCLPKPLPEGTEDKDQTATIPSLSAMLGALFLWMFWPS FNSALLRSPIERKNAVFNTYYAVAVSVVTAISGSSLAHPQGKISKTYVHSAVLAGGVAVG TSCHLIPSPWLAMVLGLVAGLISVGGAKYLPGCCNRVLGIPHSSIMGYNFSLLGLLGEII YIVLLVLDTVGAGNGMIGFQVLLSIGELSLAIVIALMSGLLTGLMQYRMDKSEVIVTVKV VWVKQCNSEYAQGMPATL >gi568815597f:25298721_25499197|GENSCAN_predicted_CDS_1|777_bp acagactaccacatgaacatgatgcacatctacgtgttcgcagcctattttgggctgtct gtggcctggtgcctgccaaagcctctacccgagggaacggaggataaagatcagacagca acgatacccagtttgtctgccatgctgggcgccctcttcttgtggatgttctggccaagt ttcaactctgctctgctgagaagtccaatcgaaaggaagaatgccgtgttcaacacctac tatgctgtagcagtcagcgtggtgacagccatctcagggtcatccttggctcacccccaa gggaagatcagcaagacttatgtgcacagtgcggtgttggcaggaggcgtggctgtgggt acctcgtgtcacctgatcccttctccgtggcttgccatggtgctgggtcttgtggctggg ctgatctccgtcgggggagccaagtacctgccggggtgttgtaaccgagtgctggggatt ccccacagctccatcatgggctacaacttcagcttgctgggtctgcttggagagatcatc tacattgtgctgctggtgcttgataccgtcggagccggcaatggcatgattggcttccag gtcctcctcagcattggggaactcagcttggccatcgtgatagctctcatgtctggtctc ctgacaggattaatgcagtatcgaatggacaagtccgaggtgatagttacagtgaaggtt gtctgggtcaaacagtgcaattcagaatatgctcagggaatgccagccaccttgtaa >gi568815597f:25298721_25499197|GENSCAN_predicted_peptide_2|496_aa MSSKYPRSVRRCLPLCALTLEAALILLFYFFTHYDASLEDQKGLVASYQGASLPLTCHSS LCSGSETGKVGQDLTVMAALGLGFLTSNFRRHSWSSVAFNLFMLALGVQWAILLDGFLSQ FPPGKVVITLFSIRLATMSAMSVLISAGAVLGKVNLAQLVVMVLVEVTALGTLRMVISNI FNSLTSSGSSKFVQPASSFSQCVSVTAHVLSLGEPVESLSLKAAVKTDYHMNLRHFYVFA AYFGLTVAWCLPKPLPKGTEDNDQRATIPSLSAMLGALFLWMFWPSVNSALLRSPIQRKN AMFNTYYALAVSVVTAISGSSLAHPQRKISMTYVHSAVLAGGVAVGTSCHLIPSPWLAMV LGLVAGLISIGGAKCLPVCCNRVLGIHHISVMHSIFSLLGLLGEITYIVLLVLHTVWNGN GMIGFQVLLSIGELSLAIVIALTSGLLTGHPLVNAVGRLDQQGPVLSQQVQANSGQAAAR CCAFVSGRSKTGTQAF >gi568815597f:25298721_25499197|GENSCAN_predicted_CDS_2|1491_bp atgagctctaagtacccgcggtctgtccggcgctgcctgcccctctgcgccctaacactg gaagcagctctcattctcctcttctatttttttacccactatgacgcttccttagaggat caaaaggggctcgtggcatcctatcaaggtgcatctcttccactcacctgccacagcagc ctctgctcagggtctgagactgggaaagtcggccaagatctgaccgtgatggcggccctt ggcttgggcttcctcacctcaaatttccggagacacagctggagcagtgtggccttcaac ctcttcatgctggcgcttggtgtgcagtgggcaatcctgctggacggcttcctgagccag ttccctcctgggaaggtggtcatcacactgttcagtattcggctggccaccatgagtgct atgtcggtgctgatctcagcgggtgctgtcttggggaaggtcaacttggcgcagttggtg gtgatggtgctggtggaggtgacagctttaggcaccctgaggatggtcatcagtaatatc ttcaacagcctcaccagcagtgggtccagcaagtttgtacagccagcatcttctttcagt cagtgcgtgtcagtaactgcacatgtcctctcattgggagagcctgtcgaaagtctaagt ttgaaggcagctgtgaagacagactaccacatgaacctgaggcacttctacgtgttcgca gcctattttgggctgactgtggcctggtgcctgccaaagcctctacccaagggaacggag gataatgatcagagagcaacgatacccagtttgtctgccatgctgggcgccctcttcttg tggatgttctggccaagtgtcaactctgctctgctgagaagtccaatccaaaggaagaat gccatgttcaacacctactatgctctagcagtcagtgtggtgacagccatctcagggtca tccttggctcacccccaaaggaagatcagcatgacttatgtgcacagtgcggtgttggca ggaggcgtggctgtgggtacctcgtgtcacctgatcccttctccgtggcttgccatggtg ctgggtcttgtggctgggctgatctccatcgggggagccaagtgcctgccggtgtgttgt aaccgagtgctggggattcaccacatctccgtcatgcactccatcttcagcttgctgggt ctgcttggagagatcacctacattgtgctgctggtgcttcatactgtctggaacggcaat ggcatgattggcttccaggtcctcctcagcattggggaactcagcttggccatcgtgata gctctcacgtctggtctcctgacaggccatcctctagtcaatgctgtgggtaggctggac cagcagggaccagtattgtcacagcaagtccaggccaacagtggtcaggctgctgcccgg tgttgtgcctttgtgagtggcagatccaagaccggaacccaggccttctga >gi568815597f:25298721_25499197|GENSCAN_predicted_peptide_3|786_aa MSTKREGSPAPRPRLAGARTRRAGSRSGPARPGGRCFAVVPSRVLRTRRPRGPQRCPACG RGARAAPPHTPRASLRYVARPHGSAPWRNPGRGRHLRLAGALSPRRGKLAGGRMKRRNAD CSKLRRPLKRNRITEGIYGSTFLYLKFLVVWALVLLADFVLEFRFEYLWPFWLFIRSVYD SFRYQGLAFSVFFVCVAFTSNIICLLFIPIQWLFFAASTYVWVQYVWHTERGVCLPTVSL WILFVYIEAAIRFKDLKNFHVDLCRPFAAHCIGYPVVTLGFGFKSYVSYKMRLRKQKEVQ KENEFYMQLLQQALPPEQQMLQKQEKEAEEAAKGLPDMDSSILIHHNGGIPANKKLSTTL PEIEYREKGKEKDKDAKKHNLGINNNNILQPVDSKIQEIEYMENHINSKRLNNDLVGSTE NLLKEDSCTASSKNYKNASGVVNSSPRSHSATNGSIPSSSSKNEKKQKCTSKSPSTHKDL MENCIPNNQLSKPDALVRLEQDIKKLKADLQASRQVEQELRSQISSLSSTERGIRSEMGQ LRQENELLQNKYVHLSGLWPLHNAVQMKQKDKQNISQLEKKLKAEQEARSFVEKQLMEEK KRKKLEEATAARAVAFAAASRGECTETLRNRIRELEAEGKKLTMDMKVKEDQIRELELKV QELRKYKENEKDTEVLMSALSAMQDKTQHLENSLSAETRIKLDLFSALGDAKRQLEIAQG QILQKDQEIKDLKQKIAEVMAVMPSITYSAATSPLSPVSPHYSSKFVETSPSGLDPNASV YQPLKK >gi568815597f:25298721_25499197|GENSCAN_predicted_CDS_3|2361_bp atgagtacgaagcgtgaagggtcgccggccccgcgtccccgccttgcaggagcccggacc cgccgagcaggctcccggtccggcccggcccggcccggaggccgatgcttcgcggtagtg ccctcgcgggtcctgcggactcggcgcccgcggggcccacagcgctgcccagcctgtgga cgtggagcccgggccgcaccgccgcacacgccccgggcatctctgcgctacgtcgcgcgc ccccacggctccgccccgtggcgtaacccggggcggggccgccacctccgattggccggc gcgctgtcaccacgtcgcgggaagctggcaggcgggaggatgaagcggcggaacgccgac tgcagtaagctccgccgccccctaaagcggaaccggatcaccgagggcatttacggcagt acatttttatacctgaaattcctggtggtgtgggcacttgtcctcctagcagattttgtc ctggagttcagatttgaatacctgtggccattctggcttttcatcagaagcgtctatgat tccttcagataccagggactggccttctcagtattttttgtttgtgtagcattcacgtca aatataatatgcctgctgttcatccccatacagtggcttttttttgctgctagcacatat gtatgggttcagtacgtatggcacacagaaaggggagtgtgtttgcctacagtgtctctc tggatcctctttgtttatattgaagcagccattagatttaaagatctcaaaaactttcat gtagacctttgtcgtccatttgctgctcactgtattgggtaccctgtggtaactttgggg tttggcttcaaaagttacgtaagctacaaaatgcggttaaggaagcaaaaagaagtacaa aaagagaacgagttttacatgcaacttcttcaacaagctctccctccagagcaacagatg ctacagaagcaagaaaaagaggccgaggaagcagccaaaggattacctgatatggattct tcgatccttatacaccacaatggaggtatcccagccaacaaaaaactctccacaactttg ccagagatagaataccgagaaaaagggaaagaaaaggacaaggatgccaaaaaacacaac cttggaataaataacaacaatattctacaacctgtagactctaaaatacaagagattgag tatatggaaaaccatatcaatagtaaaagattaaataatgatcttgtgggaagtacagaa aatctcttgaaagaggactcatgcactgcttcctcaaaaaattacaaaaatgccagtgga gttgtgaactcttcacctcgaagtcatagcgccacaaatgggagcattccttcctcatct agtaaaaatgagaagaagcagaaatgcactagcaagagcccaagtacacacaaggactta atggaaaactgtattcctaataaccagctaagcaaaccagacgcactggtcaggctggaa caagacattaaaaagttaaaggctgacctgcaagccagcagacaagtggaacaagagctc cgcagtcagatcagctccctttcgagcaccgagcgagggatccgctcagaaatgggccag cttcggcaggagaacgagctgctgcagaacaagtacgtgcacctttcaggcctttggccg ttacataatgctgtgcaaatgaagcaaaaagacaagcagaatatcagccagttggagaaa aagctaaaagctgagcaggaagcccgaagttttgtagagaaacagttaatggaagagaaa aagaggaagaagttagaagaagccactgctgcccgggctgttgcgtttgctgctgcatct aggggagaatgcaccgaaaccttacggaatcggatcagagaactagaagcagagggcaag aagctcacgatggacatgaaggtgaaagaagaccaaatcagagaactagaactaaaagtc caggagcttcggaaatataaggaaaatgagaaggacactgaggtgttaatgtcagccctc tcagccatgcaagacaaaacacagcacctggagaacagcttaagtgcagagacgagaatc aagctggacctgttctccgcactgggcgatgcaaagcggcagctcgagattgcccaagga caaatccttcagaaagatcaggaaatcaaggacctaaaacagaagatagccgaagtcatg gccgtcatgcccagcataacatacagtgccgccaccagccccctgagccctgtttccccc cactactcttccaaatttgtggagaccagcccctctggacttgaccccaatgcctctgtt taccagcccctgaagaaatga