GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:11:06 Sequence gi568815596r:187246187_187485595 : 239409 bp : 33.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9229 9419 191 0 2 54 49 148 0.085 4.96 1.02 Intr + 18731 18786 56 2 2 90 59 50 0.423 0.00 1.03 Intr + 20386 20502 117 2 0 64 65 64 0.236 1.22 1.04 Term + 20867 21027 161 0 2 15 48 178 0.620 3.62 1.05 PlyA + 23024 23029 6 1.05 2.02 PlyA - 23043 23038 6 1.05 2.01 Sngl - 30620 30450 171 2 0 57 48 160 0.840 3.48 2.00 Prom - 41953 41914 40 -3.65 3.00 Prom + 53318 53357 40 -3.75 3.01 Sngl + 82667 82936 270 1 0 87 38 204 0.742 10.33 3.02 PlyA + 84038 84043 6 1.05 4.04 PlyA - 84093 84088 6 1.05 4.03 Term - 100213 99998 216 1 0 77 48 138 0.900 4.86 4.02 Intr - 106146 105928 219 0 0 94 80 198 0.957 17.18 4.01 Init - 111542 111537 6 2 0 61 66 17 0.126 -3.27 4.00 Prom - 114914 114875 40 -5.85 5.00 Prom + 118537 118576 40 -4.65 5.01 Init + 119608 119657 50 0 2 69 97 3 0.281 0.00 5.02 Term + 131790 132048 259 0 1 9 47 452 0.870 27.24 5.03 PlyA + 132758 132763 6 1.05 6.06 PlyA - 133076 133071 6 1.05 6.05 Term - 133658 133596 63 2 0 100 55 26 0.930 -2.59 6.04 Intr - 134393 134281 113 0 2 67 95 36 0.945 1.38 6.03 Intr - 134601 134491 111 1 0 30 98 85 0.880 3.13 6.02 Intr - 137106 136987 120 1 0 62 121 72 0.546 7.45 6.01 Init - 138801 138783 19 0 1 73 102 30 0.447 3.22 6.00 Prom - 139747 139708 40 -3.55 7.00 Prom + 150591 150630 40 -4.35 7.01 Init + 155735 155859 125 1 2 66 99 60 0.906 4.59 7.02 Term + 159464 159659 196 2 1 48 50 147 0.851 2.80 7.03 PlyA + 160329 160334 6 1.05 8.00 Prom + 168204 168243 40 -2.05 8.01 Init + 169345 169656 312 0 0 42 69 405 0.079 31.37 8.02 Term + 189035 189241 207 1 0 58 37 113 0.001 -0.44 8.03 PlyA + 190357 190362 6 1.05 9.06 PlyA - 190687 190682 6 1.05 9.05 Term - 210941 210794 148 1 1 21 47 202 0.674 5.79 9.04 Intr - 215609 215429 181 0 1 89 25 58 0.001 -2.40 9.03 Intr - 221746 221567 180 2 0 75 75 46 0.009 0.92 9.02 Intr - 238030 237938 93 2 0 85 91 44 0.859 3.42 9.01 Intr - 238801 238625 177 2 0 58 91 164 0.985 12.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 169345 169812 468 0 0 42 50 428 0.908 30.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_1|174_aa SVLLLTKNTLTVKQPHIGLSGDIPEKGIVIIGKENSMPVIIPENLLVKQAVEVKDSDADD PDPVGDGHIGVGNRRAEEREKGWTTVMEQNSTQVTWGPANRKCQPGICVLSLRRKEMDGS EASGRCIQINPTENQEQDAGQVGDTICAQNAEALASHAKVLTLEQHEAMKDSRQ >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_1|525_bp agtgtactcctacttactaaaaatacattaactgtaaaacagcctcacataggtctttca ggagatattccagaaaaaggcattgtcatcatagggaaagagaattctatgcctgttatt atccctgaaaaccttctagtgaaacaagctgtggaagtcaaagacagtgatgcagatgat cctgaccctgttggggacgggcacattggtgtgggaaataggagagctgaagagagagag aaggggtggactactgtcatggaacagaattccactcaggtaacatggggcccagccaat cgcaaatgccaaccaggcatttgtgtgctgtctttaagaagaaaggagatggatgggtct gaggccagtgggagatgcatacagataaatccaacagagaatcaggagcaagatgcagga caagtcggagatacaatttgtgctcagaatgccgaggcccttgccagccatgcaaaggtc ttaaccctagagcagcatgaagccatgaaggattctaggcagtga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_2|56_aa MAKETLPLTLQKQENTLGDYYKHLYEHKLENLEETDKILETCNLPRLKKEETETPN >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_2|171_bp atggcaaaggaaacattgccactgaccctgcagaaacaggaaaacacccttggagactat tacaaacacctctatgaacacaaactagaaaacttagaagaaacggataaaatcctagaa acatgcaacctcccaagattgaaaaaggaagaaactgaaacacctaactga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_3|89_aa MEPSRAVCRGNVRLEPPHRVPTGALPSGAVRRGPLPSRPENGRSIDSLHHASGKAAGTQC QPVKGAMGPVPYRTTGVELPKDLEIYSLC >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_3|270_bp atggagccctctagggcagtgtgcaggggaaatgtgaggttggagcccccacacagagtt cccactggggcactgcctagtggtgctgtgagaagaggaccactgccctccagacctgag aatggcagatccattgacagcttgcaccatgcatctggaaaagctgcaggcactcaatgc cagcctgtgaaaggagccatggggcctgtaccctacagaaccacaggagtggaactgccc aaggacttggaaatttattccttgtgttag >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_4|146_aa MLVNLFFLLNIVRVLITKLKVTHQAESNLYMKAVRATLILVPLLGIEFVLIPWRPEGKIA EEVYDYIMHILMHFQVQAILRRNWNQYKIQFGNSFSNSEALRSASYTVSTISDGPGYSHD CPSEHLNGKSIHDIENVLLKPENLYN >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_4|441_bp atgctggtgaatctttttttcttgttaaatattgtacgcgttctcatcaccaagttaaaa gttacacaccaagcggaatccaatctgtacatgaaagctgtgagagctactcttatcttg gtgccattgcttggcattgaatttgtgctgattccatggcgacctgaaggaaagattgca gaggaggtatatgactacatcatgcacatccttatgcacttccaggttcaagcaattctg agaagaaactggaatcaatacaaaatccaatttggaaacagcttttccaactcagaagct cttcgtagtgcgtcttacacagtgtcaacaatcagtgatggtccaggttatagtcatgac tgtcctagtgaacacttaaatggaaaaagcatccatgatattgaaaatgttctcttaaaa ccagaaaatttatataattga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_5|102_aa MNRKKDCMLSLTCGSFNRRRKLEEEGGVGRGIREEEEEEEKEEQEEGEGEEETRKRRRRR RKKKNKKKEKEKKKEEREGAAVAVAATGAKGAEDDYNAMKIQ >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_5|309_bp atgaacagaaagaaagactgcatgctctcactcacttgtggaagctttaatagaagaagg aagttggaggaagaaggaggggtaggaagaggaataagagaagaggaggaggaggaggaa aaagaagaacaagaagaaggagaaggagaagaagaaacaagaaagagaaggaggaggagg aggaaaaagaagaacaagaagaaggagaaggagaagaagaaagaagaaagagaaggagca gcagtggcggtggcagccacaggagcaaaaggagcagaagatgactacaatgcgatgaag atacaatga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_6|141_aa MGPKVEAELEESPEDSIQLGVTRNKIMTAQYECYQKIMQDPIQQAEGVYCNRTWDGWLCW NDVAAGTESMQLCPDYFQDFDPSEKVTKICDQDGNWFRHPASNRTWTNYTQCNVNTHEKV KLFDLEKAILHLSASVPSYMK >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_6|426_bp atggggcccaaggtggaagcagaattagaagagagtcctgaggactcaattcagttggga gttactagaaataaaatcatgacagctcaatatgaatgttaccaaaagattatgcaagac cccattcaacaagcagaaggcgtttactgcaacagaacctgggatggatggctctgctgg aacgatgttgcagcaggaactgaatcaatgcagctctgccctgattactttcaggacttt gatccatcagaaaaagttacaaagatctgtgaccaagatggaaactggtttagacatcca gcaagcaacagaacatggacaaattatacccagtgtaatgttaacacccacgagaaagtg aagctgtttgaccttgaaaaagccattttacatctctctgcttcagtcccctcatatatg aaatga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_7|106_aa MEKRNQGKKEGRKEEGKEGRKEEGKEGRKAGRKERNQYVYLRGRAKPVKLKVWDSLPGIS TRLLVGDKFVQGDKMQTAETVRKKSLQPAIKLSRKFVSLEEWLLCS >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_7|321_bp atggaaaaaagaaatcaaggaaagaaggaaggaagaaaggaagaagggaaggaaggaaga aaggaagaagggaaagaaggaaggaaggcaggaaggaaggagagaaatcaatatgtgtat ctcagaggaagagcaaaaccagtgaaactgaaggtgtgggattcattacctggaattagc actcggttattagttggagataaatttgtgcaaggagataaaatgcagacggctgaaaca gtaagaaagaaaagcttgcagccagcaattaagcttagcagaaagtttgttagtttggag gaatggctgctgtgttcctga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_8|172_aa MFGTGNSANVSVVDLTCRLEKPANYDDVKKVVNQASEGPLKDILGYTEHQVVSSDFNSDT HSSTFSEGAGIALNDHFVKLISLYDNEFGYTNRVVDHGPHGLQKSLRKSTIMTKGELTYP HISYLVATEGAREGAERSYTLSNNQTLCELTKRELTYHQGDGARSFMREPSP >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_8|519_bp atgtttgggacaggaaactctgccaatgtgtcagtggtggacctgacctgccgtctggaa aaacctgccaattatgatgacgtcaagaaggtggtgaaccaggcatcggagggccccctc aaggacattctgggctacactgagcaccaggttgtctcctccgacttcaacagtgacacc cactcttccaccttcagtgagggggctggcattgctctcaacgaccactttgtcaagctc atttccttgtatgacaatgaatttggctacaccaacagggtggtggatcatggtccacat ggcctccagaaaagcctcaggaagtctacaatcatgacaaaaggggagctgacatatcct catatctcctatctcgtggcaacagagggagcaagagagggagcagaaaggtcctatact ctttcaaacaaccagactttgtgtgaactaactaagcgagaactcacttatcaccaaggg gatggtgctaggtcattcatgagggagccatccccatga >gi568815596r:187246187_187485595|GENSCAN_predicted_peptide_9|259_aa XKPDFCFLEEDPGICRGYITRYFYNNQTKQCERFKYGGCLGNMNNFETLEECKNICEDGP NGFQVDNYGTQLNAVNNSLTPQSTKVPSLFEFHGPSWCLTPADRGLCRANENRFYYNSVI GKCRPFKYSGCGGNENNFTSKQECLRACKKVNGLLSCQCLSCALPLACSPRHPLDVRLLV SSSADPLLWTSRRLCIYLPARVSVFIRPGWGLFRRTTIQKKALFIIRDDSSMRIIAPEDL PVGQDVRVEDSDIDDPDHA >gi568815596r:187246187_187485595|GENSCAN_predicted_CDS_9|780_bp naaaagccagatttctgctttttggaagaagatcctggaatatgtcgaggttatattacc aggtatttttataacaatcagacaaaacagtgtgaacgtttcaagtatggtggatgcctg ggcaatatgaacaattttgagacactggaagaatgcaagaacatttgtgaagatggtccg aatggtttccaggtggataattatggaacccagctcaatgctgtgaataactccctgact ccgcaatcaaccaaggttcccagcctttttgaatttcacggtccctcatggtgtctcact ccagcagacagaggattgtgtcgtgccaatgagaacagattctactacaattcagtcatt gggaaatgccgcccatttaagtacagtggatgtgggggaaatgaaaacaattttacttcc aaacaagaatgtctgagggcatgtaaaaaagtcaatggcctgctgtcctgccagtgtctg tcatgtgctcttccactggcatgctcccctcgacatcctctcgatgtccggctgcttgtg tcttcttccgctgatcccctcctctggacgtccagacgcttgtgtatctatctgcctgct agggtctcagtttttatccgcccaggatgggggctgttcagaaggacgacaatccagaag aaggcattgtttatcatcagagatgacagctccatgcgtattattgcccctgaagacctt ccagtaggacaggatgtgagggtagaagacagtgatattgatgatcctgaccatgcgtag