GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:59:20 Sequence gi568815596f:180882043_181162885 : 280843 bp : 35.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 1773 1658 116 2 2 96 92 46 0.009 4.97 1.01 Init - 12518 12418 101 2 2 95 83 27 0.078 2.68 1.00 Prom - 12820 12781 40 -2.45 2.00 Prom + 32131 32170 40 -1.75 2.01 Sngl + 82712 83248 537 1 0 20 44 377 0.401 22.33 2.02 PlyA + 83660 83665 6 1.05 3.00 Prom + 89511 89550 40 -4.95 3.01 Init + 100001 100194 194 1 2 77 90 245 0.823 22.09 3.02 Intr + 102001 102051 51 1 0 72 110 57 0.755 3.40 3.03 Term + 109626 109752 127 0 1 80 37 59 0.172 -3.23 3.04 PlyA + 111410 111415 6 1.05 4.00 Prom + 127257 127296 40 -4.55 4.01 Sngl + 150866 151564 699 1 0 60 48 183 0.710 7.45 4.02 PlyA + 152632 152637 6 1.05 5.03 PlyA - 152726 152721 6 1.05 5.02 Term - 154530 154067 464 1 2 93 39 130 0.876 2.83 5.01 Init - 156109 156079 31 1 1 96 92 28 0.911 4.00 5.00 Prom - 167705 167666 40 -3.55 6.00 Prom + 168953 168992 40 -5.55 6.01 Init + 172318 172394 77 0 2 68 75 54 0.641 2.71 6.02 Intr + 175651 175783 133 1 1 64 116 86 0.896 8.73 6.03 Term + 194154 194330 177 2 0 113 43 72 0.043 1.90 6.04 PlyA + 194519 194524 6 1.05 7.00 Prom + 197098 197137 40 -5.25 7.01 Sngl + 208214 208900 687 1 0 64 49 473 0.793 36.94 7.02 PlyA + 209467 209472 6 1.05 8.00 Prom + 211974 212013 40 -5.95 8.01 Init + 212261 212385 125 1 2 71 91 81 0.854 6.39 8.02 Intr + 246883 247035 153 1 0 70 23 117 0.001 1.67 8.03 Intr + 250804 250861 58 1 1 117 111 -6 0.088 2.57 8.04 Intr + 255711 255889 179 2 2 49 64 87 0.110 0.20 8.05 Intr + 263008 263156 149 1 2 60 79 109 0.312 6.16 8.06 Term + 267136 267221 86 2 2 66 48 64 0.274 -2.96 8.07 PlyA + 268644 268649 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_1|73_aa MGLFFHLVLPDTLKGFDSSSFISKYFKEGPPEVRLAYAGTCVVRPKQSDLGASTWFAGML VVSASWPDVVHLX >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_1|219_bp atggggcttttttttcaccttgttcttccagacactctgaagggttttgattcttcaagt tttatttcaaagtacttcaaagagggacctcctgaagtgaggttggcttatgctggcact tgtgttgttaggcccaagcagtctgatttgggggcctccacatggtttgctgggatgttg gtagtttctgcttcctggcctgatgtggtacatctggnn >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_2|178_aa MICTNKWVHRSQEVRFENLRLHFRSCMETPRIPRQKFAAGVGPLWRTSARAVQKGNVGSE PPHRVPTGALPSGAVRRGPLSSRHQNGRSTDSLHCAAGKATGIQCQPMKAAGSGAVPCKA TGEELPKTMGIHLLHQLDLDVRHGVKGDHFGALRFHRRAQFRTCMGPTAPLLLPISPI >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_2|537_bp atgatatgcacaaacaaatgggtgcacagaagtcaagaagtgaggtttgaaaacctccgc ctgcatttcagaagttgtatggaaacacctagaatcccccggcagaaatttgctgcaggg gtggggcccttatggagaacctctgctagggcagtgcagaaaggaaatgtggggtcggag cccccacacagagttcctactggggcactgcctagtggagctgtgagaagagggccactg tcctccagacaccagaatggtagatccactgacagcttgcactgtgcagctggaaaagcc acaggcattcaatgccagcccatgaaagcagctgggagtggggctgtaccctgcaaagcc acaggggaggagctgccaaagaccatgggaatccacctcttgcatcagcttgacctggat gtgagacatggagtcaaaggagatcattttggagctttaagatttcaccgccgtgctcaa tttcggacttgcatggggcctacagctcctttgcttttgccaatttctcccatttga >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_3|123_aa MSSDRQRSDDESPSTSSGSSDADQRDPAAPEPEEQEERKPSATQQKKNTKLSSKTTAKLS TSAKRIQKELAEITLDPPPNCSSSAIVSVSVFYVWPKRVLLLLLWPRETKRLDTPGLEHI QYG >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_3|372_bp atgtccagtgataggcaaaggtccgatgatgagagccccagcaccagcagtggcagttca gatgcggaccagcgagacccagccgctccagagcctgaagaacaagaggaaagaaaacct tctgccacccagcagaagaaaaacaccaaactctctagcaaaaccactgctaagttatcc actagtgctaaaagaattcagaaggagctagctgaaataacccttgatcctcctcctaat tgcagctcatcagctattgttagtgttagtgtattttatgtgtggcccaagagagttctg ctgctgctgctgtggcccagggaaaccaaaagattggacactcctggactagagcatata cagtatggataa >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_4|232_aa MSELPFTIASKRIKYLGIQLTRDMKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KTAILPKVIYRFNAIPIKVPTTFFTELETTTLKFIWHQKRARIAKTILSQKSKAGGITLP DFKLYYKATVTKTAGYWYQNGDIDQWNRTEPSEIIPHIYNYLIFDKPDKNKKWGKDFLFN KWCWEKWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEENLGNTI >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_4|699_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatatgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgta aaaacggccatactgcccaaggtaatttatagattcaatgccatccccatcaaggtacca acgactttcttcacagaattggaaacaactactttaaagttcatatggcaccaaaaaaga gcccgcattgccaagacaatcctaagccaaaagagcaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcagggtactggtaccaaaac ggagatatagaccaatggaacagaacagagccttcagaaataataccacacatctacaac tatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggatttcctatttaat aaatggtgctgggaaaagtggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatattagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccatttag >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_5|164_aa MELSGLEVGLGGDNLGTPIHLTCTSLKFRREPEYTEKAHTDTGEDANSTQWPLPGIRFFP HERCKYNTMLNEPLFKDLLYFVYRYSVILTLYVEILSLLNYLGISAENQLTIYVWIYFWA PYSVSLIYMPNPCQYHTILITLCLPSTSESSSMSPPTSFFLCKN >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_5|495_bp atggagctttcaggactggaagttggtctgggtggggacaatttaggcacaccaattcac ttaacatgcacatctctgaaatttaggagggaaccagaatacacggagaaagcccacaca gacacgggagaagatgcaaactccacacaatggcccctgccaggaatcaggttttttcct catgaacgttgtaagtacaatacaatgttgaatgaaccattattcaaggacctgctgtat tttgtgtatagatactcagttattctaacactttatgtcgaaatcctttccctactgaat taccttggcatttctgctgaaaatcaactgaccatttatgtgtggatctacttctgggct ccttattctgtttcactgatctatatgcccaacccatgccaataccacactattttgatt actctatgtttacctagtacgtcagaatcaagtagtatgagtcctccaacttcgttcttt ctttgcaaaaattag >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_6|128_aa MDESSCYSTFSSAFVVGVLDLGHSNSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITF SSDYPFKPPKKRCGGKVFQNQRLSTQQYKYKFFIPASRIPPLQPCLLTGIMKQLNSGVKK FHPVVEVQ >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_6|387_bp atggatgagagttcctgttactccacattttcttcagcatttgttgttggtgttttggat ttgggccattctaatagtgctgggcctaaaggagataacatttatgaatggagatcaact atacttggtccaccgggttctgtatatgaaggtggtgtgttttttctggatatcacattt tcatcagattatccatttaagccaccaaagaagaggtgtggtggaaaggtcttccagaac cagaggctaagcacacagcagtacaaatataaattctttattccagcttccagaattcct cctcttcagccttgcctgctcactggaataatgaagcaattaaacagtggtgtaaagaag ttccatccagtggttgaagttcaatag >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_7|228_aa MRKNQRKKAENFKNRNDSSLPKDHNSSPAREQNCMENGFDELTEVGFRRWVITNSSELKE HVLTQCREAKNLDKRLEELPTRITNLEKNTNDPMELKNTAQELCETYTSMNSQIKQAEER ISEIEGQLNEIKCEDKIREKRMKRNEQSLQQIWDYVKRPNLRLIGVPESDGDNGTKLENT LQDIIQENFFNLARQANIQIQEIQRMPQRYSLRRATPRHIIVRSPRLK >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_7|687_bp atgaggaaaaaccagcgcaaaaaggctgaaaatttcaaaaaccggaatgactcttctctt ccaaaggatcacaactcctcaccagcgagggaacaaaactgtatggagaatgggtttgat gaattgacagaagtaggcttcagaagatgggtaataacaaactcctccgagctaaaggag catgttttaacccaatgcagggaagctaagaaccttgataaaaggttagaggaattgcca actagaataaccaatttagagaagaacacaaatgacccaatggagctgaaaaacacagca caagaactttgtgaaacatacacaagtatgaatagccaaatcaaacaagcagaagaaagg atatcagagattgaaggccaacttaatgaaataaagtgtgaagataagattagagaaaaa agaatgaaaaggaatgaacaaagcctccaacaaatatgggactatgtgaaaagaccaaac ctacgtttgattggtgtacctgaaagtgacggggataatggaaccaagttggaaaacaca cttcaggatattatccaggaaaacttcttcaacctagcaagacaggccaatattcaaatt caggaaatacagagaatgccacaaagatactccttgagaagagcaactccaagacacata attgtcagatcaccaaggttgaaatga >gi568815596f:180882043_181162885|GENSCAN_predicted_peptide_8|249_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRDMDEAGNHHSQQTITRTE NQTPHVLTHKWELNNEKTWTQDEEHHTLEPVGGLTEESLTYNKLHIFKVYNLLLSLGEPG NIDIPVAINTSITQMFVYKCHPPYKKTEMLGEIANPRVWEENVHLEPSGDRMSEAANFRR ECDSDSSQSTQSCSFFPFPVVCPSLWVRGLAGFSSKAADFHSLSGKPETRWRVLDGTFVA YTGTLQGGD >gi568815596f:180882043_181162885|GENSCAN_predicted_CDS_8|750_bp atgggcaaagacttcatgagtaaaacaccaaaagcaatggcaacaaaagccaaaattgat aaatgggatcttatcaaattaaagagcttctgcacagcaaaagaaactatcatcagagtg aacagggacatggatgaagctggaaaccatcattctcagcaaactatcacaaggacagaa aaccaaacaccacatgttctcactcataagtgggagttgaacaatgagaaaacatggaca caggatgaggaacatcacacactggagcctgttgggggcttaactgaagagtcattgaca tataataaactgcacatatttaaagtgtacaacttgctcctttcactgggagaacctgga aacattgatatcccagtagcaataaatacatctatcacccagatgtttgtttataaatgc catcctccatacaagaaaacagagatgcttggagaaatagctaatcccagagtttgggaa gagaatgtacaccttgagccttctggtgacagaatgagtgaagctgcaaactttcgccgt gagtgtgacagtgacagttcacaaagcacccagagttgttccttcttcccattcccagtt gtctgtccctccctgtgggttcgtggtcttgctggcttcagtagtaaagctgcagacttt cacagtctctcaggaaaacctgaaactcggtggagggtgttagatgggacttttgtggct tacactggaacactgcagggtggtgattaa