GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:56:02 Sequence gi568815597f:20039428_20248398 : 208971 bp : 47.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 13968 14007 40 -2.56 1.01 Init + 34596 34638 43 2 1 72 69 58 0.357 2.88 1.02 Term + 37334 37449 116 0 2 86 33 108 0.424 3.83 1.03 PlyA + 37492 37497 6 1.05 2.00 Prom + 43543 43582 40 -3.96 2.01 Init + 46640 46800 161 1 2 96 85 248 0.816 24.60 2.02 Intr + 50362 50468 107 1 2 54 91 101 0.999 6.86 2.03 Term + 51141 51265 125 1 2 117 47 124 0.999 9.75 2.04 PlyA + 51712 51717 6 1.05 3.08 PlyA - 52234 52229 6 1.05 3.07 Term - 53863 53686 178 0 1 23 36 280 0.981 13.46 3.06 Intr - 68276 68110 167 2 2 81 103 84 0.708 7.96 3.05 Intr - 74183 74048 136 1 1 78 22 85 0.373 1.47 3.04 Intr - 74832 74694 139 1 1 94 -17 202 0.554 9.82 3.03 Intr - 76186 76080 107 0 2 94 68 127 0.995 11.16 3.02 Intr - 77050 76906 145 2 1 77 53 150 0.988 9.74 3.01 Init - 80071 80032 40 1 1 82 105 84 0.766 7.98 3.00 Prom - 83210 83171 40 -3.66 4.00 Prom + 83793 83832 40 -4.76 4.01 Init + 86690 86703 14 1 2 72 115 8 0.145 1.52 4.02 Intr + 100739 100791 53 2 2 114 101 89 0.816 11.35 4.03 Intr + 104019 104163 145 1 1 82 82 231 0.495 21.24 4.04 Intr + 105153 105262 110 0 2 120 73 193 0.999 20.93 4.05 Intr + 108763 108929 167 2 2 94 26 274 0.979 21.48 4.06 Intr + 110366 110550 185 1 2 98 11 82 0.032 0.09 4.07 Intr + 116145 116427 283 0 1 88 49 162 0.067 9.72 4.08 Intr + 117528 118209 682 2 1 44 77 346 0.546 20.26 4.09 Intr + 118680 118949 270 1 0 39 116 122 0.276 7.61 4.10 Term + 120203 120447 245 0 2 107 42 76 0.245 0.96 4.11 PlyA + 121119 121124 6 1.05 5.04 PlyA - 123651 123646 6 1.05 5.03 Term - 124730 124564 167 0 2 110 50 148 0.857 11.28 5.02 Intr - 133470 133367 104 2 2 5 59 177 0.694 6.32 5.01 Init - 135767 135580 188 2 2 74 94 159 0.483 11.79 5.00 Prom - 136436 136397 40 -7.46 6.00 Prom + 136654 136693 40 -10.55 6.01 Init + 137327 137405 79 1 1 70 50 76 0.422 3.22 6.02 Intr + 139525 139583 59 2 2 54 86 103 0.351 5.30 6.03 Intr + 142100 142291 192 1 0 15 121 65 0.408 2.19 6.04 Intr + 142559 142651 93 1 0 69 113 64 0.983 7.16 6.05 Intr + 143757 143820 64 2 1 52 94 94 0.920 4.69 6.06 Term + 146605 146726 122 2 2 76 43 63 0.804 -0.76 6.07 PlyA + 150959 150964 6 1.05 7.00 Prom + 150978 151017 40 -8.56 7.01 Sngl + 151135 151977 843 0 0 84 47 616 0.946 52.86 7.02 PlyA + 153800 153805 6 1.05 8.03 PlyA - 155668 155663 6 1.05 8.02 Term - 164437 164306 132 1 0 85 47 63 0.575 -0.01 8.01 Init - 164604 164551 54 0 0 125 94 23 0.647 6.04 8.00 Prom - 182074 182035 40 -0.86 9.04 PlyA - 182560 182555 6 1.05 9.03 Term - 184838 184754 85 1 1 39 55 62 0.032 -4.87 9.02 Intr - 187824 187713 112 1 1 54 64 90 0.128 2.64 9.01 Init - 204425 204383 43 2 1 88 103 42 0.486 6.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 111647 111751 105 1 0 111 52 62 0.917 3.31 S.002 Init + 159556 159654 99 0 0 80 95 52 0.811 5.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_1|52_aa MTVSAMKGKNRVTLGNARVDSISDEFYNIPETGKLPVERRIMEDFPEEEDLM >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_1|159_bp atgacggtcagtgccatgaaggggaaaaacagggtgaccttgggaaatgccagagtggac agcatatcagatgagttctataacattccagagacaggtaagctccctgtggaaagaagg atcatggaagacttcccggaggaggaggacttgatgtag >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_2|130_aa MGYLPGVPAVQGGLLDLKSMIEKVTGKNALTNYGFYGCYCGWGGRGTPKDGTDWCCWAHD HCYGRLEEKGCNIRTQSYKYRFAWGVVTCEPGPFCHVNLCACDRKLVYCLKRNLRSYNPQ YQYFPNILCS >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_2|393_bp atgggctatcttccaggtgtgcctgctgtgcaaggaggcttgctggacctaaaatcaatg atcgagaaggtgacagggaagaacgccctgacaaactacggcttctacggctgttactgc ggctggggcggccgaggaacccccaaggatggcaccgattggtgctgttgggcgcatgac cactgctatgggcggctggaggagaagggctgcaacattcgcacacagtcctacaaatac agattcgcgtggggcgtggtcacctgcgagcccgggcccttctgccatgtgaacctctgt gcctgtgaccggaagctcgtctactgcctcaagagaaacctacggagctacaacccacag taccaatactttcccaacatcctctgctcctag >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_3|303_aa MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK RNLDTYQKRLRFYWRPHCRGQTPGQALKSLQVEAMPVLNSVASLVPKATPEPRRSSINPS WVKPRQSRQEAIEVGSKCILLVGCLCWSPFQDPGLGWGERKPKLSLFPPQGFSPEMMQQL SLALRLGDKSKIPSEEEEGEGEGRGRRGGGREEEEEEEEEEEEEEEEEEEEEYPYTSYLI SPP >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_3|912_bp atggaacttgcactgctgtgtgggctggtggtgatggctggtgtgattccaatccagggc gggatcctgaacctgaacaagatggtcaagcaagtgactgggaaaatgcccatcctctcc tactggccctacggctgtcactgcggactaggtggcagaggccaacccaaagatgccacg gactggtgctgccagacccatgactgctgctatgaccacctgaagacccaggggtgcagc atctacaaggactattacagatacaacttttcccaggggaacatccactgctctgacaag ggaagctggtgtgagcagcagctgtgtgcctgtgacaaggaggtggccttctgcctgaag cgcaacctggacacctaccagaagcgactgcgtttctactggcggccccactgccggggg cagacccctgggcaagctctgaagtccctgcaggtggaggccatgcctgtcttaaactca gttgcatccctggtgcccaaagcaacaccagaaccaagaaggagctccataaatccttct tgggtgaagcctagacaaagccgccaggaggccattgaagtggggtcaaagtgcattctg ctcgtgggctgcctctgctggagccccttccaggatccagggctaggctggggtgagagg aagccaaagctgtccctgttccctcctcaaggattcagccctgagatgatgcagcagctg tcactggctctgcgcctgggtgacaagagcaaaattccatctgaagaagaagaaggagaa ggagaaggaagaggaagaagaggaggaggaagggaggaagaagaggaggaggaagaggag gaagaagaggaggaggaagaagaagaagaagaagaatatccatacacctcctacctcatc agcccaccttag >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_4|717_aa MTGTWSSLGMKKFFTVAILAGSVLSTAHGSLLNLKAMVEAVTGRSAILSFVGYGCYCGLG GRGQPKDEVDWCCHAHDCCYQELFDQGCHPYVDHYDHTIENNTEIVCSDLNKTECDKQTC MCDKNMVLCLMNQTYREEYRGFLNVYCQGPTPNCSIYEPPPEEVHLTFLLTCPSLNSGSA ASTEPAPPARKPLKPPSARTQQPPASSPGSAPGAPLSLGRESREGKNNTPSYTGKPQAII DLLQTIIQTHNPTWADCHQVLMYLFNTDERRRVLQAATKWLEEHVPADYQNPQEYVRIQL PGTDPQWDPDERQYAKAKPPGTKDYRLIQDLHLVNQATVTLHPTVPNPYTLLWLLPAEDS WFTYLGLKDAFFSIRLAPESQKLFAFQWEDPGSGVTTQYTWTQLPQGFKNSTTIFGEALA RDLQKFPTRDLGCVLLQYVDDLLLGHLMAVGCAKGMDTLLWHLEDCGYKVSKKKAQICQQ EVRYLGFTIQQGERSLGSERMQVICNLPEPKTKRQVREFLGAVGFCRLWIPNFAVLAKPL YEVIKGVSESPVERNYVEVLDTVYSSRPDLQDHPWTSVDWELYMDRSSFVNPQGERCAGY AVVTLDAVTEAKSLPQDTSAQKVKLIALIWALELSEGDRVWITDWNIASLQPRWRGPQTI VLTTPTAIKVEGIPAWIHHSHVKPKASETWEVRPSLDNPCKMTLKKMTSPAPVTPGS >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_4|2154_bp atgactggcacatggtctagcctgggtatgaagaagttcttcaccgtggccatccttgct ggcagcgttctgtccacagctcacggcagcctgctcaacctgaaggccatggtggaggcc gtcacagggaggagcgccatcctgtccttcgtgggctacggttgctactgtgggctgggg ggccgtggccagcccaaggatgaggtggactggtgctgccacgcccacgactgctgctac caggaactctttgaccaaggctgtcacccctatgtggaccactatgatcacaccatcgag aacaacactgagatagtctgcagtgacctcaacaagacagagtgtgacaagcagacatgc atgtgtgacaagaacatggttctgtgcctcatgaaccagacgtaccgagaggagtaccgt ggcttcctcaatgtctactgccagggccccacgcccaactgcagcatctatgaaccgccc cctgaggaggtgcaccttaccttcctccttacctgcccgagcctcaactcagggtcagca gcctccacggagccagccccacctgcccgaaaaccactgaagccaccctccgcccggact caacagccgccagcaagctcgcctggcagtgcccctggagcaccattgtcgctggggagg gagagccgggaagggaagaacaataccccatcctataccggaaagcctcaagctataatt gatttgctccaaactataatccagacccacaaccccacctgggctgattgccaccaggtg ctcatgtacctctttaacacagatgaaaggaggagagtgctccaagcagcaactaagtgg ctggaagaacatgttcctgctgattaccaaaacccccaagagtatgtgaggatccaatta ccaggaacagacccccagtgggacccagatgaaagacagtatgcaaaggctaaaccgcca gggaccaaggactacaggctgatacaggacttgcacttggtcaatcaagctacggtgact ttacatccaacagtacctaacccatacacattgttgtggttgctgccagctgaggacagc tggttcacctacctgggcctgaaggacgctttctttagcatcagactagcccctgagagc cagaaactgtttgcctttcagtgggaggatccagggtcaggtgtcaccactcagtacact tggacccagcttccccaagggttcaagaactccaccaccatcttcggggaggccctggct cgagacctgcaaaagtttcccaccagagacctaggctgcgtgttgctccagtatgtcgat gacctcctgctgggacacctcatggcagttgggtgtgccaagggaatggataccctgctc tggcacctggaggactgtgggtataaggtatccaagaagaaagctcagatatgtcaacag gaggtacgttacttgggatttactatccaacagggggagcgcagcctgggatcagaaaga atgcaggtcatttgcaacctaccggagcctaagaccaaaaggcaggtgagagaattctta ggagctgtggggttctgcaggttatggatcccaaactttgcagtactggccaaacctctg tacgaagtcataaagggagtgtcagagagcccagtcgaacgtaactatgtagaggtgttg gacacagtttattctagcaggcctgacctccaagaccatccttggacatcagtagactgg gagctgtacatggacaggagcagcttcgtcaacccacaaggagagaggtgtgcaggatat gcagtggtaaccctggatgctgtcactgaagccaaatcattgccccaggatacttcagcc caaaaggtcaaactcattgctttaatttgggccttagagctaagtgaaggtgatcgggta tggatcacggattggaacatagcctccttgcagccacggtggagaggaccccagaccatc gtcttgaccactcccacagccataaaggtagaaggaatcccagcctggattcaccacagc catgtaaaacctaaagcatctgagacctgggaggtgagaccaagcttggacaacccatgc aagatgactttgaagaagatgacaagccctgctccagtcacacctggaagctga >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_5|152_aa MLIATSFFLFFSSVVAAPTHSSFWQFQRRVKHITGRSAFFSYYGYGCYCGLGDKGIPVDD TDRHSPSSPSPYEKLKEFSCQPVLNSYQFHIVNGAVVCGCTLGPGASCHCRLKACECDKQ SVHCFKESLPTYEKNFKQFSSQPRCGRHKPWC >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_5|459_bp atgctcattgcaacttccttcttcctttttttctcatcggtggtggcagcccccacccac agcagtttctggcagtttcagaggagggtcaaacacatcacggggcgaagtgccttcttc tcatattacggatatggctgctactgtgggcttggggataaagggatccccgtggatgac actgacaggcacagcccctcatctccctctccctacgagaagctgaaggagttcagctgc cagcctgtgttgaacagctaccagttccacatcgtcaatggcgcagtggtttgtggatgc acccttggtcctggtgccagctgccactgcaggctgaaggcctgtgagtgtgacaagcaa tccgtgcactgcttcaaagagagcctgcccacctatgagaaaaacttcaagcagttctcc agccagcccaggtgtggcagacataagccctggtgctag >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_6|202_aa MDLDKPPLNESSKRVYQSPVDPKLFPRLGYQLLLTLSLTENHSLLQLSCFRAYPKHHREG KTQNSPSSEAGITTLLDSCFRCSSNKLLSFISQAQQPDRNGVRAASLNRKRGGDNDSGPP QNASKIPQAPEGENTWSTVAKPHNNPVLIITILEMRKLTSSEGRFRSRRKHVAPHPAALP GRPGLFTSRTEPGRRRRMERNG >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_6|609_bp atggatctggacaaacctccactcaacgaatccagcaaaagagtctatcagtctccagta gaccccaaactgtttcccaggctggggtaccagctgctgctgacgctgtccctcactgag aaccactctctgctgcagctctcctgcttccgtgcctatccaaaacaccaccgagaaggc aaaacccaaaacagcccaagcagcgaagcaggtatcacaacactcttagactcttgtttt agatgcagttctaacaagctgctcagcttcatttctcaggcccagcagccggacaggaat ggagtcagagcagcttctcttaataggaagagaggtggagacaatgattctggcccccca cagaatgccagcaagattccacaagctcctgaaggtgaaaatacttggtcaacagtagcc aagccgcacaacaatcctgtgcttatcatcaccattctagagatgcggaaactgacatcc agcgaaggccggttccggtcacgccggaagcacgtggccccccaccctgcggcgttgccc gggagaccagggttgtttaccagcaggacagagcccgggcgcaggcggcggatggagcgg aacggctag >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_7|280_aa MATEAPVNIAPPECSTVVSTAVDSLIWQPNSLNMHMIRPKSAKGRTRPSLQKSQGVEVCA HHIPSPPPAIPYELPSSQKPGACAPKSPNQGASDEIPELQQQVPTGASSSLNKYPVLPSI NRKNLEEEAVETVAKKASSLQLSSIRALYQDETGTMKTSEEDSRARACAVERKFIVRTKK QGSSRAGNLEEPSDQEPRLLLAVRSPTGQRFVRHFRPTDDLQTIVAVAEQKNKTSYRHCS IETMEVPRRRFSDLTKSLQECRIPHKSVLGISLEDGEGWP >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_7|843_bp atggccacagaagcccctgtgaatatagcaccacctgagtgtagcactgttgtcagcaca gcagttgacagcctcatttggcagccaaactcactaaatatgcacatgataaggcccaag tccgccaagggacggacaagaccgagtctgcagaaatcccagggcgtggaggtgtgcgct catcatataccatctccgcctccagccattccctatgagttgccaagcagccaaaaacca ggagcctgtgcacccaaatctccaaaccagggagcttctgatgagatccctgagctgcag cagcaagtacccactggggcttcctcttctctcaataagtatccagtccttccttccatc aacagaaagaacctggaggaggaggctgtggaaaccgttgccaaaaaggccagctcactg caactgagcagtatccgggctctttaccaagacgagacgggcaccatgaagacaagtgaa gaagattccagagctcgagcttgtgccgtggagaggaaattcatcgtccgaaccaagaaa cagggctcttccagggctggaaatctggaggaaccatcggaccaagaaccaaggttgctg cttgctgttagatcaccaacaggccaaaggtttgtacgccatttccggccaacagatgat ttgcaaaccattgttgctgtggccgaacagaaaaacaaaacctcctaccgacactgcagc attgaaacaatggaggtgcccaggaggcgattttctgacctcaccaaatctctgcaagag tgcagaatcccccacaagtctgtgctgggcatctcactggaagatggggaagggtggccc tga >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_8|61_aa MAKLLFLSESHFLLCRMELEIPPRFHRRNGTPGGHIGSSLGTFTVWTVQLRHRKNEALGK T >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_8|186_bp atggccaagttgctcttcctctctgagtcccatttcctcctctgtagaatggagctagaa attccccccaggttccaccgaagaaacggcaccccaggtggccacataggaagttctctg ggcactttcactgtctggacagtgcagcttcggcacagaaagaatgaggccttgggcaaa acatag >gi568815597f:20039428_20248398|GENSCAN_predicted_peptide_9|79_aa MAYGQQIERDGKISGAIVKADAEKARGGREVGNCSLKFYDKKQGKETLDKSWFHHHTVCE GYVGISRTQGPGDRNQTTD >gi568815597f:20039428_20248398|GENSCAN_predicted_CDS_9|240_bp atggcttatggtcagcagattgagagggatggtaaaatctcaggtgctattgtcaaggcc gatgcagagaaagcaagaggaggcagagaggtgggtaactgctctttgaagttctatgac aagaaacagggtaaagagactctggacaagtcctggtttcatcatcacacagtgtgtgaa ggttatgttggaatcagtaggacacaaggccctggtgacaggaaccagaccacggactga