GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:54:08 Sequence gi568815597f:20090562_20291401 : 200840 bp : 47.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1100 1095 6 1.05 1.07 Term - 2729 2552 178 1 1 23 36 280 0.979 13.46 1.06 Intr - 17142 16976 167 0 2 81 103 84 0.708 7.96 1.05 Intr - 23049 22914 136 2 1 78 22 85 0.372 1.47 1.04 Intr - 23698 23560 139 2 1 94 -17 202 0.554 9.82 1.03 Intr - 25052 24946 107 1 2 94 68 127 0.995 11.16 1.02 Intr - 25916 25772 145 0 1 77 53 150 0.988 9.74 1.01 Init - 28937 28898 40 2 1 82 105 84 0.766 7.98 1.00 Prom - 32076 32037 40 -3.66 2.00 Prom + 32659 32698 40 -4.76 2.01 Init + 35556 35569 14 2 2 72 115 8 0.145 1.52 2.02 Intr + 49605 49657 53 0 2 114 101 89 0.816 11.35 2.03 Intr + 52885 53029 145 2 1 82 82 231 0.495 21.24 2.04 Intr + 54019 54128 110 1 2 120 73 193 0.999 20.93 2.05 Intr + 57629 57795 167 0 2 94 26 274 0.979 21.48 2.06 Intr + 59232 59416 185 2 2 98 11 82 0.032 0.09 2.07 Intr + 65011 65293 283 1 1 88 49 162 0.067 9.72 2.08 Intr + 66394 67075 682 0 1 44 77 346 0.546 20.26 2.09 Intr + 67546 67815 270 2 0 39 116 122 0.276 7.61 2.10 Term + 69069 69313 245 1 2 107 42 76 0.245 0.96 2.11 PlyA + 69985 69990 6 1.05 3.04 PlyA - 72517 72512 6 1.05 3.03 Term - 73596 73430 167 1 2 110 50 148 0.857 11.28 3.02 Intr - 82336 82233 104 0 2 5 59 177 0.694 6.32 3.01 Init - 84633 84446 188 0 2 74 94 159 0.483 11.79 3.00 Prom - 85302 85263 40 -7.46 4.00 Prom + 85520 85559 40 -10.55 4.01 Init + 86193 86271 79 2 1 70 50 76 0.422 3.22 4.02 Intr + 88391 88449 59 0 2 54 86 103 0.351 5.30 4.03 Intr + 90966 91157 192 2 0 15 121 65 0.408 2.19 4.04 Intr + 91425 91517 93 2 0 69 113 64 0.983 7.16 4.05 Intr + 92623 92686 64 0 1 52 94 94 0.920 4.69 4.06 Term + 95471 95592 122 0 2 76 43 63 0.804 -0.76 4.07 PlyA + 99825 99830 6 1.05 5.00 Prom + 99844 99883 40 -8.56 5.01 Sngl + 100001 100843 843 1 0 84 47 616 0.946 52.86 5.02 PlyA + 102666 102671 6 1.05 6.03 PlyA - 104534 104529 6 1.05 6.02 Term - 113303 113172 132 2 0 85 47 63 0.575 -0.01 6.01 Init - 113470 113417 54 1 0 125 94 23 0.647 6.04 6.00 Prom - 130940 130901 40 -0.86 7.00 Prom + 146797 146836 40 -4.56 7.01 Init + 150021 150126 106 2 1 61 111 31 0.116 3.08 7.02 Intr + 167660 167715 56 0 2 120 59 40 0.436 3.00 7.03 Term + 170103 170210 108 2 0 114 45 25 0.207 -0.69 7.04 PlyA + 174609 174614 6 1.05 8.03 PlyA - 174747 174742 6 1.05 8.02 Term - 185007 184893 115 2 1 121 47 34 0.879 0.64 8.01 Init - 185521 185436 86 1 2 76 107 54 0.437 6.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 60513 60617 105 2 0 111 52 62 0.917 3.31 S.002 Init + 108422 108520 99 1 0 80 95 52 0.811 5.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_1|303_aa MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDAT DWCCQTHDCCYDHLKTQGCSIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLK RNLDTYQKRLRFYWRPHCRGQTPGQALKSLQVEAMPVLNSVASLVPKATPEPRRSSINPS WVKPRQSRQEAIEVGSKCILLVGCLCWSPFQDPGLGWGERKPKLSLFPPQGFSPEMMQQL SLALRLGDKSKIPSEEEEGEGEGRGRRGGGREEEEEEEEEEEEEEEEEEEEEYPYTSYLI SPP >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_1|912_bp atggaacttgcactgctgtgtgggctggtggtgatggctggtgtgattccaatccagggc gggatcctgaacctgaacaagatggtcaagcaagtgactgggaaaatgcccatcctctcc tactggccctacggctgtcactgcggactaggtggcagaggccaacccaaagatgccacg gactggtgctgccagacccatgactgctgctatgaccacctgaagacccaggggtgcagc atctacaaggactattacagatacaacttttcccaggggaacatccactgctctgacaag ggaagctggtgtgagcagcagctgtgtgcctgtgacaaggaggtggccttctgcctgaag cgcaacctggacacctaccagaagcgactgcgtttctactggcggccccactgccggggg cagacccctgggcaagctctgaagtccctgcaggtggaggccatgcctgtcttaaactca gttgcatccctggtgcccaaagcaacaccagaaccaagaaggagctccataaatccttct tgggtgaagcctagacaaagccgccaggaggccattgaagtggggtcaaagtgcattctg ctcgtgggctgcctctgctggagccccttccaggatccagggctaggctggggtgagagg aagccaaagctgtccctgttccctcctcaaggattcagccctgagatgatgcagcagctg tcactggctctgcgcctgggtgacaagagcaaaattccatctgaagaagaagaaggagaa ggagaaggaagaggaagaagaggaggaggaagggaggaagaagaggaggaggaagaggag gaagaagaggaggaggaagaagaagaagaagaagaatatccatacacctcctacctcatc agcccaccttag >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_2|717_aa MTGTWSSLGMKKFFTVAILAGSVLSTAHGSLLNLKAMVEAVTGRSAILSFVGYGCYCGLG GRGQPKDEVDWCCHAHDCCYQELFDQGCHPYVDHYDHTIENNTEIVCSDLNKTECDKQTC MCDKNMVLCLMNQTYREEYRGFLNVYCQGPTPNCSIYEPPPEEVHLTFLLTCPSLNSGSA ASTEPAPPARKPLKPPSARTQQPPASSPGSAPGAPLSLGRESREGKNNTPSYTGKPQAII DLLQTIIQTHNPTWADCHQVLMYLFNTDERRRVLQAATKWLEEHVPADYQNPQEYVRIQL PGTDPQWDPDERQYAKAKPPGTKDYRLIQDLHLVNQATVTLHPTVPNPYTLLWLLPAEDS WFTYLGLKDAFFSIRLAPESQKLFAFQWEDPGSGVTTQYTWTQLPQGFKNSTTIFGEALA RDLQKFPTRDLGCVLLQYVDDLLLGHLMAVGCAKGMDTLLWHLEDCGYKVSKKKAQICQQ EVRYLGFTIQQGERSLGSERMQVICNLPEPKTKRQVREFLGAVGFCRLWIPNFAVLAKPL YEVIKGVSESPVERNYVEVLDTVYSSRPDLQDHPWTSVDWELYMDRSSFVNPQGERCAGY AVVTLDAVTEAKSLPQDTSAQKVKLIALIWALELSEGDRVWITDWNIASLQPRWRGPQTI VLTTPTAIKVEGIPAWIHHSHVKPKASETWEVRPSLDNPCKMTLKKMTSPAPVTPGS >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_2|2154_bp atgactggcacatggtctagcctgggtatgaagaagttcttcaccgtggccatccttgct ggcagcgttctgtccacagctcacggcagcctgctcaacctgaaggccatggtggaggcc gtcacagggaggagcgccatcctgtccttcgtgggctacggttgctactgtgggctgggg ggccgtggccagcccaaggatgaggtggactggtgctgccacgcccacgactgctgctac caggaactctttgaccaaggctgtcacccctatgtggaccactatgatcacaccatcgag aacaacactgagatagtctgcagtgacctcaacaagacagagtgtgacaagcagacatgc atgtgtgacaagaacatggttctgtgcctcatgaaccagacgtaccgagaggagtaccgt ggcttcctcaatgtctactgccagggccccacgcccaactgcagcatctatgaaccgccc cctgaggaggtgcaccttaccttcctccttacctgcccgagcctcaactcagggtcagca gcctccacggagccagccccacctgcccgaaaaccactgaagccaccctccgcccggact caacagccgccagcaagctcgcctggcagtgcccctggagcaccattgtcgctggggagg gagagccgggaagggaagaacaataccccatcctataccggaaagcctcaagctataatt gatttgctccaaactataatccagacccacaaccccacctgggctgattgccaccaggtg ctcatgtacctctttaacacagatgaaaggaggagagtgctccaagcagcaactaagtgg ctggaagaacatgttcctgctgattaccaaaacccccaagagtatgtgaggatccaatta ccaggaacagacccccagtgggacccagatgaaagacagtatgcaaaggctaaaccgcca gggaccaaggactacaggctgatacaggacttgcacttggtcaatcaagctacggtgact ttacatccaacagtacctaacccatacacattgttgtggttgctgccagctgaggacagc tggttcacctacctgggcctgaaggacgctttctttagcatcagactagcccctgagagc cagaaactgtttgcctttcagtgggaggatccagggtcaggtgtcaccactcagtacact tggacccagcttccccaagggttcaagaactccaccaccatcttcggggaggccctggct cgagacctgcaaaagtttcccaccagagacctaggctgcgtgttgctccagtatgtcgat gacctcctgctgggacacctcatggcagttgggtgtgccaagggaatggataccctgctc tggcacctggaggactgtgggtataaggtatccaagaagaaagctcagatatgtcaacag gaggtacgttacttgggatttactatccaacagggggagcgcagcctgggatcagaaaga atgcaggtcatttgcaacctaccggagcctaagaccaaaaggcaggtgagagaattctta ggagctgtggggttctgcaggttatggatcccaaactttgcagtactggccaaacctctg tacgaagtcataaagggagtgtcagagagcccagtcgaacgtaactatgtagaggtgttg gacacagtttattctagcaggcctgacctccaagaccatccttggacatcagtagactgg gagctgtacatggacaggagcagcttcgtcaacccacaaggagagaggtgtgcaggatat gcagtggtaaccctggatgctgtcactgaagccaaatcattgccccaggatacttcagcc caaaaggtcaaactcattgctttaatttgggccttagagctaagtgaaggtgatcgggta tggatcacggattggaacatagcctccttgcagccacggtggagaggaccccagaccatc gtcttgaccactcccacagccataaaggtagaaggaatcccagcctggattcaccacagc catgtaaaacctaaagcatctgagacctgggaggtgagaccaagcttggacaacccatgc aagatgactttgaagaagatgacaagccctgctccagtcacacctggaagctga >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_3|152_aa MLIATSFFLFFSSVVAAPTHSSFWQFQRRVKHITGRSAFFSYYGYGCYCGLGDKGIPVDD TDRHSPSSPSPYEKLKEFSCQPVLNSYQFHIVNGAVVCGCTLGPGASCHCRLKACECDKQ SVHCFKESLPTYEKNFKQFSSQPRCGRHKPWC >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_3|459_bp atgctcattgcaacttccttcttcctttttttctcatcggtggtggcagcccccacccac agcagtttctggcagtttcagaggagggtcaaacacatcacggggcgaagtgccttcttc tcatattacggatatggctgctactgtgggcttggggataaagggatccccgtggatgac actgacaggcacagcccctcatctccctctccctacgagaagctgaaggagttcagctgc cagcctgtgttgaacagctaccagttccacatcgtcaatggcgcagtggtttgtggatgc acccttggtcctggtgccagctgccactgcaggctgaaggcctgtgagtgtgacaagcaa tccgtgcactgcttcaaagagagcctgcccacctatgagaaaaacttcaagcagttctcc agccagcccaggtgtggcagacataagccctggtgctag >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_4|202_aa MDLDKPPLNESSKRVYQSPVDPKLFPRLGYQLLLTLSLTENHSLLQLSCFRAYPKHHREG KTQNSPSSEAGITTLLDSCFRCSSNKLLSFISQAQQPDRNGVRAASLNRKRGGDNDSGPP QNASKIPQAPEGENTWSTVAKPHNNPVLIITILEMRKLTSSEGRFRSRRKHVAPHPAALP GRPGLFTSRTEPGRRRRMERNG >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_4|609_bp atggatctggacaaacctccactcaacgaatccagcaaaagagtctatcagtctccagta gaccccaaactgtttcccaggctggggtaccagctgctgctgacgctgtccctcactgag aaccactctctgctgcagctctcctgcttccgtgcctatccaaaacaccaccgagaaggc aaaacccaaaacagcccaagcagcgaagcaggtatcacaacactcttagactcttgtttt agatgcagttctaacaagctgctcagcttcatttctcaggcccagcagccggacaggaat ggagtcagagcagcttctcttaataggaagagaggtggagacaatgattctggcccccca cagaatgccagcaagattccacaagctcctgaaggtgaaaatacttggtcaacagtagcc aagccgcacaacaatcctgtgcttatcatcaccattctagagatgcggaaactgacatcc agcgaaggccggttccggtcacgccggaagcacgtggccccccaccctgcggcgttgccc gggagaccagggttgtttaccagcaggacagagcccgggcgcaggcggcggatggagcgg aacggctag >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_5|280_aa MATEAPVNIAPPECSTVVSTAVDSLIWQPNSLNMHMIRPKSAKGRTRPSLQKSQGVEVCA HHIPSPPPAIPYELPSSQKPGACAPKSPNQGASDEIPELQQQVPTGASSSLNKYPVLPSI NRKNLEEEAVETVAKKASSLQLSSIRALYQDETGTMKTSEEDSRARACAVERKFIVRTKK QGSSRAGNLEEPSDQEPRLLLAVRSPTGQRFVRHFRPTDDLQTIVAVAEQKNKTSYRHCS IETMEVPRRRFSDLTKSLQECRIPHKSVLGISLEDGEGWP >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_5|843_bp atggccacagaagcccctgtgaatatagcaccacctgagtgtagcactgttgtcagcaca gcagttgacagcctcatttggcagccaaactcactaaatatgcacatgataaggcccaag tccgccaagggacggacaagaccgagtctgcagaaatcccagggcgtggaggtgtgcgct catcatataccatctccgcctccagccattccctatgagttgccaagcagccaaaaacca ggagcctgtgcacccaaatctccaaaccagggagcttctgatgagatccctgagctgcag cagcaagtacccactggggcttcctcttctctcaataagtatccagtccttccttccatc aacagaaagaacctggaggaggaggctgtggaaaccgttgccaaaaaggccagctcactg caactgagcagtatccgggctctttaccaagacgagacgggcaccatgaagacaagtgaa gaagattccagagctcgagcttgtgccgtggagaggaaattcatcgtccgaaccaagaaa cagggctcttccagggctggaaatctggaggaaccatcggaccaagaaccaaggttgctg cttgctgttagatcaccaacaggccaaaggtttgtacgccatttccggccaacagatgat ttgcaaaccattgttgctgtggccgaacagaaaaacaaaacctcctaccgacactgcagc attgaaacaatggaggtgcccaggaggcgattttctgacctcaccaaatctctgcaagag tgcagaatcccccacaagtctgtgctgggcatctcactggaagatggggaagggtggccc tga >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_6|61_aa MAKLLFLSESHFLLCRMELEIPPRFHRRNGTPGGHIGSSLGTFTVWTVQLRHRKNEALGK T >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_6|186_bp atggccaagttgctcttcctctctgagtcccatttcctcctctgtagaatggagctagaa attccccccaggttccaccgaagaaacggcaccccaggtggccacataggaagttctctg ggcactttcactgtctggacagtgcagcttcggcacagaaagaatgaggccttgggcaaa acatag >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_7|89_aa MSDRRSQDDAWVPGFNHSMKKGNTGGRAGLERFISDQVPGFSGLGLVSSTPVLRSPLGTS YWKPKPRTDEDGVCSGRPLGTQSKSENSR >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_7|270_bp atgagcgacaggagatcacaggatgatgcctgggtccctggtttcaaccactccatgaag aagggcaacactggaggtagagcaggcttggagagatttatttcagaccaagttcctggc ttctccggcctggggttggtgtcctcaacaccagtgctcaggtctcctcttggtacctcc tactggaagccaaagccaaggacagatgaagatggggtttgcagtggccggcctctgggg acacagagcaaatccgagaacagcagatga >gi568815597f:20090562_20291401|GENSCAN_predicted_peptide_8|66_aa MGQADHQTLWGLTNMGFYLKGIQKPSENCFWNIGFTSRYCSLPGMLCLILSELTGSRVFP DGSAPG >gi568815597f:20090562_20291401|GENSCAN_predicted_CDS_8|201_bp atgggacaggctgaccaccagaccctgtggggtctaacgaacatgggcttttatctcaaa ggaattcagaagccatcagagaactgcttctggaacattggatttacttccagatactgc tctctgccaggaatgctctgccttatcctttcagagctgactgggagcagagtttttcct gatggctctgcccctggttga