GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:49:37 Sequence gi568815584r:31382967_31583965 : 200999 bp : 39.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 Intr - 805 555 251 2 2 119 90 204 0.970 19.93 1.15 Intr - 3609 3454 156 1 0 40 76 89 0.037 1.96 1.14 Intr - 4409 4154 256 2 1 87 47 68 0.054 -1.31 1.13 Intr - 6036 5879 158 1 2 47 47 191 0.047 9.71 1.12 Intr - 11260 11086 175 1 1 40 99 151 0.803 9.99 1.11 Intr - 12382 12233 150 1 0 21 93 108 0.933 4.04 1.10 Intr - 15815 15707 109 1 1 70 107 49 0.991 4.37 1.09 Intr - 17546 17335 212 2 2 102 87 179 0.996 15.99 1.08 Intr - 37298 37186 113 0 2 8 96 103 0.006 2.28 1.07 Intr - 44077 43919 159 2 0 79 89 145 0.108 12.74 1.06 Intr - 62753 62669 85 2 1 63 69 98 0.111 3.87 1.05 Intr - 65488 65214 275 2 2 84 36 132 0.224 3.93 1.04 Intr - 69331 69191 141 2 0 94 67 43 0.804 2.20 1.03 Intr - 70378 70309 70 1 1 70 71 69 0.837 1.24 1.02 Intr - 74134 73967 168 1 0 74 41 73 0.283 0.42 1.01 Init - 74427 74317 111 0 0 80 82 164 0.882 13.58 1.00 Prom - 96562 96523 40 -6.05 2.02 PlyA - 96600 96595 6 1.05 2.01 Sngl - 100765 99998 768 1 0 60 37 260 0.840 13.92 2.00 Prom - 108001 107962 40 -8.55 3.00 Prom + 108717 108756 40 -5.75 3.01 Init + 112472 113041 570 1 0 62 90 315 0.305 24.23 3.02 Term + 138588 138674 87 2 0 81 49 83 0.026 0.38 3.03 PlyA + 138821 138826 6 1.05 4.03 PlyA - 139885 139880 6 1.05 4.02 Term - 140523 140295 229 2 1 126 37 176 0.976 11.42 4.01 Init - 144988 144864 125 1 2 23 46 159 0.665 4.89 4.00 Prom - 152624 152585 40 -0.85 5.04 PlyA - 152639 152634 6 1.05 5.03 Term - 167081 166930 152 0 2 20 40 163 0.208 1.79 5.02 Intr - 178591 178458 134 0 2 93 42 180 0.526 13.27 5.01 Intr - 193874 193713 162 1 0 78 44 85 0.014 1.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 3601 3454 148 1 1 73 76 93 0.884 6.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:31382967_31583965|GENSCAN_predicted_peptide_1|863_aa MAEGSRIPQARALLQQCLHARLQIRPADGDVAAQWVEHHRSPAPVVAPERKSVSPFLGGP QVFETEFGELQALLSQTPCSSWSMLVGGKLVGQVQRGLVIYVCFFKGADKELLPKMEPLT EAVDDTVYTRCGSHQTWTTQVYIIIMEEFALAMCALPCGNKNRFNTLLNVKLSETENGKH VSILDLPGNILIIPQATLGGRLKGRNMQYHSNSGKEEGFELYSQFVTLCEKEVAANSKCA EARVVVEHGTYGNRQLFWPKVLKESSTDILSSQYMYDEDSAVERTFWQANMDEQTKDFVL IPFSPTFIDSPQTPPSHFLKSATDSIMQNLLSLATAGLESAEKGLTGAQGRLINSVVRSS SWTTGKSGFTCAQQNDVREKQKTLVEQLLSLLNSSPGPPTRKLLAKNLAILYSIGDTFSV HEAIDKCNDLIRSKDDSPSYLPTKLAAVVCLGSLYKKLGRILGNTFTDTVGNILKAMKSA ESQGRYEIMLSLQNILNGLGAAAAPCHRDVYKAARSCLTDRSMAVRCAAAKCLLELQNEA IFMWSTDLDSVATLCFKSFEGSNYDVRISVSKLLGIILAKAVISKHPGTASRQSIRRVSL EEVLELLGTGFLRGSSGFLRASGDMLKGTSSVSRDVRVGVTQAYVVFVSTLGGAWLEKNF AAFFSHILSLASPSHPKATQTQIDAVCCRRCVSFILRTTIGGLLGEKAQLAAVKDICQAI WKLKKVMDAVMSDGNLETRLGSTDVAASQHMLVCALQELGNLIHNLGTTAAPLLQDSSTG LLDSILSVILHPSISVRLAAAWCLHCIAVALPSYLTPLLDRCLERLTGHKSSPEAVTGFS FAVAALLGAVKHCPLGIPHGKGK >gi568815584r:31382967_31583965|GENSCAN_predicted_CDS_1|2589_bp atggctgagggtagccggattcctcaggcccgggcgctcctacagcagtgcctgcacgcc cggctgcaaattcgcccagccgatggggacgtcgcggcccagtgggtggagcatcaccgt tccccagcgccagttgtagcacccgagaggaagagcgtgtctcccttcttggggggacca caggtatttgaaactgagttcggtgaacttcaggccttactgtctcagaccccttgcagc tcgtggtctatgctggtcgggggcaagcttgtaggtcaggtccaaagaggactggtgatc tacgtgtgctttttcaagggagctgataaagaacttcttcccaaaatggagcccctaaca gaggcagtagatgacactgtatatacaagatgtgggtctcatcaaacctggacaactcaa gtttatatcattattatggaagaatttgcattagccatgtgtgcacttccttgtggaaat aagaatagatttaatacactgttaaatgtgaaattaagtgagacagaaaatggcaagcat gtctctatattggatctacctggcaacattcttattatccctcaagctacccttggagga agactaaaaggaagaaacatgcaatatcactctaactctggaaaagaagaagggtttgaa ctttactctcaatttgtgactctatgtgaaaaagaagtagctgctaatagcaagtgtgct gaagctagggttgtagtggaacatggcacttatgggaacaggcagctcttctggcccaag gtgctcaaagaatcttcaacagatattttatcttcacagtatatgtatgatgaggatagc gctgtggaaaggactttctggcaggcaaacatggatgagcagaccaaggactttgtttta attccgtttagcccgacttttattgattccccacaaacaccaccttcccacttcttaaag tctgctacagattccattatgcagaacctgctgtccctggccactgcagggctggagtcc gccgagaaggggctgactggggcacagggtcggctgataaacagcgtggtccgttcttcc tcctggactacgggaaagtctggcttcacctgtgcacaacagaatgatgtaagggagaaa cagaagactcttgttgaacagctcctgtctttgttgaacagctccccagggcctcctacc cgcaaactgcttgctaagaatctagccatactttatagtattggagacacattctccgtt catgaagcaatcgataaatgtaatgatcttattcgtagcaaagatgattctccaagttat cttcccactaagcttgctgctgtggtatgtttgggttccttgtacaagaagttgggtaga atactgggtaacacctttactgatacagtggggaatattcttaaagctatgaagagtgca gagtctcaaggccgatatgagattatgctaagtctgcaaaatatattgaatggactagga gctgccgctgcaccttgtcacagggatgtttataaagctgctagatcctgcttgacagat agatccatggctgttcgttgtgctgctgcaaagtgtctccttgaacttcagaatgaagcc atctttatgtggagtacggacctggacagtgtggccacactgtgttttaagtcctttgaa ggttccaattatgatgtgcggatttctgtttcaaagttactaggcataatattagctaaa gctgtaatttctaaacatccaggaacagcctcacgtcaaagcattcgcagagtatctttg gaggaagttctggaattactaggaacagggtttctacgtgggagttcaggattccttcga gccagtggagatatgctgaaaggaaccagttcagtcagtagggatgttcgagttggagtt actcaggcttatgtggtatttgtttcaacactaggaggagcttggctagagaaaaatttt gctgcctttttttctcatatcctaagccttgcgtcaccgtcacaccctaaagccacccaa actcagatcgatgccgtctgctgtcgccgttgtgtttcatttattcttcgaactactata ggtggtcttcttggagaaaaggctcagcttgctgctgtaaaggatatttgccaggccatc tggaagctaaagaaagttatggatgccgtaatgagtgatggtaatttggaaacccggctt ggttccacagatgtagccgctagccaacatatgctggtttgtgctttacaagaacttgga aatctcatacacaatcttggcaccacagcggcacctttgctacaggattcaagtacaggt ctccttgacagtatcttgtcagttattcttcatcctagcatttctgttcgactagcagca gcttggtgtttacactgcattgccgtggcattaccctcctacctaacaccactcttggat cgttgccttgaacggcttactggacataagtcttcacctgaagcagtgactggcttcagt tttgctgtagcagctttgttgggagcagtaaaacattgtcctttaggaattcctcatgga aaaggcaag >gi568815584r:31382967_31583965|GENSCAN_predicted_peptide_2|255_aa MILPFMATSQLQDNHWNFGTALCKVFNGTLSLGMFTSVFFLSAIGLDRYLLTLHPVWSQQ HRTPRWASSIVLGVWISAAALSIPYLIFRETHHDRKGKVTCQNNYAVSTNWESKEMQASR QWIHVACFISRFLLGFLLPFFIIIFCYERVASKVKERSLFKSSKPFKVMMTAIISFFVCW MPYHIHQGLLLTTNQSLLLELTLILTVLTTSFNTIFSPTLYLFVGENFKKVFKKSILALF ESTFSEDSSVERTQT >gi568815584r:31382967_31583965|GENSCAN_predicted_CDS_2|768_bp atgattctgccatttatggccacctcccaacttcaagacaatcactggaactttggaact gccttgtgcaaggtcttcaatggcactttgtctctggggatgttcacctctgttttcttc ctttcggccatcggtcttgatcgttaccttctcactcttcacccagtgtggtcccagcag caccgaaccccgcgctgggcttccagcattgtcctgggagtctggatttcagccgctgcc ctcagcatcccctatttgattttcagagagacacatcatgaccgtaaaggaaaggtgact tgccaaaataactatgctgtgtctactaactgggaaagcaaggagatgcaagcatcaagg cagtggattcatgtggcctgtttcatcagccgcttcttgctgggctttcttctgcctttc ttcatcatcatcttttgttatgaaagagtagccagcaaggtgaaagagaggagcctgttt aaatccagcaagcccttcaaagttatgatgactgccattatctctttctttgtgtgttgg atgccctaccatatacaccagggcttacttctcactacgaaccagtcactacttttagag ttgactttgatacttacagtgctaaccacttctttcaatactatcttttctcccacactc tacttatttgttggggagaatttcaaaaaggtcttcaagaagtccattcttgctctgttt gagtcaacatttagtgaagattcttctgtagaaaggacacaaacctaa >gi568815584r:31382967_31583965|GENSCAN_predicted_peptide_3|218_aa MRKTQHKNAENSINQNASSPPNDSNSSPARAQNGTENEFDKLAEVGFRRWVITNSSELKE HVLTQCKETKSLDKRLQELLTRKASFGRHRARCRKFFFIPQWRLEPQQDRTVHSSGKQAE AREPSGLTHINELMELRNTAREIREAYTGINSRTDQAEERISEIEDQLNEIKRENKIREK RMKRNEQSLQGKASFDELGSGVTPGSIANGNQWQSQLP >gi568815584r:31382967_31583965|GENSCAN_predicted_CDS_3|657_bp atgaggaaaacccagcacaaaaatgctgaaaattccataaaccagaatgcttcttctcct ccaaatgatagcaactcctctccagcaagggcacaaaatgggacggagaatgagtttgac aaattggcagaagtaggcttcagaaggtgggtaataacaaactcctctgagctaaaagag catgttctaacccaatgcaaggaaactaagagccttgataaaagattacaagaactgcta actagaaaagccagttttggcagacaccgagctcgctgcaggaagttttttttcataccc caatggcgcctggaaccccagcaagacagaactgttcactcttctggaaagcaggctgaa gccagggagccaagtggtctcactcacataaatgaactgatggaactgagaaacacagca agagaaattcgtgaagcatacacaggtatcaatagccgaactgatcaagcggaagaaagg atatcagagattgaagatcaacttaatgaaataaagcgtgaaaacaagattagagaaaaa agaatgaaaaggaatgaacaaagcctccaggggaaggcgtcatttgacgagcttgggtca ggtgtcacccctggatcaatcgccaatggcaatcaatggcaatcacagttaccttga >gi568815584r:31382967_31583965|GENSCAN_predicted_peptide_4|117_aa MHGGKATGGHSEKVATCKPGREPLEETNPASTLVLDFQPPELYLLNKYKAHVPVTLIVLS QNGIIKGTQSKNKQEFKNKASPLLVLSEHQEESTQFTSTLLKIPFYAVPSTVLTHPR >gi568815584r:31382967_31583965|GENSCAN_predicted_CDS_4|354_bp atgcacggaggaaaggccacaggaggacacagtgagaaggtggccacctgcaagccagga agagagcccttagaagaaaccaatccagctagcaccttggtcttggacttccagcctcca gaactatatttactaaataaatataaagctcatgtacctgtgacccttatagttctatcc cagaatggcatcatcaaaggcacacagtccaagaacaaacaggagttcaaaaataaagca agtcctttacttgtgttgtccgaacaccaagaagaaagcacacagttcacatcaacactt cttaaaatccctttctatgctgtcccatccactgtcctcacacatccccggtag >gi568815584r:31382967_31583965|GENSCAN_predicted_peptide_5|149_aa XAATKLVSEKPPDKLPMKSPFFTWNRKQDHVLGIRKLELDRGPDKSGNDLEEHLCTTDLA PTNHRSASPKRGSGPATSPERHPTKKQQTLPNPHDAERCSWAMEVRPSKKKKREKKEKEE EEEEEQEEEEKEEKEERKRGRRRRRKRRS >gi568815584r:31382967_31583965|GENSCAN_predicted_CDS_5|450_bp nnggcagctaccaaacttgtgagtgaaaagcctccagataagttaccaatgaaaagtcca tttttcacatggaataggaagcaagaccatgtgctaggaataaggaaattggaattggat agaggacctgataaaagtgggaacgatttagaagagcatctgtgcaccacggacctggcg cccacaaaccatcgctcggcttcccccaagcggggcagtggccccgccaccagcccggag cgacaccccaccaaaaagcagcagacgctgccaaatccccatgacgcggaacgctgctcc tgggcaatggaagtgagaccgtcaaaaaaaaaaaagagggagaagaaggagaaggaggag gaggaagaggaggagcaggaggaggaagagaaggaggagaaggaggagaggaagagagga agaaggagaaggagaaagagaagaagttga