GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:12:14 Sequence gi568815593r:133872774_134093012 : 220239 bp : 47.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 419 436 18 1 0 104 90 0 0.178 2.05 1.02 Term + 12561 12677 117 2 0 60 49 168 0.785 8.64 1.03 PlyA + 13676 13681 6 1.05 2.00 Prom + 14003 14042 40 -6.26 2.01 Init + 19658 19763 106 1 1 78 115 45 0.309 4.61 2.02 Intr + 27229 27315 87 2 0 48 113 47 0.486 3.14 2.03 Intr + 28467 28556 90 1 0 80 78 36 0.311 1.77 2.04 Intr + 44708 44796 89 0 2 39 84 92 0.100 3.59 2.05 Term + 51926 52105 180 1 0 105 49 52 0.109 0.61 2.06 PlyA + 55047 55052 6 1.05 3.02 PlyA - 59188 59183 6 -0.45 3.01 Sngl - 64014 63640 375 0 0 63 46 269 0.999 16.34 3.00 Prom - 68310 68271 40 -7.56 4.04 PlyA - 71193 71188 6 1.05 4.03 Term - 74183 74142 42 2 0 127 45 30 0.033 -0.24 4.02 Intr - 87247 86721 527 2 2 77 100 312 0.163 23.96 4.01 Init - 89767 89746 22 1 1 81 106 12 0.831 2.17 4.00 Prom - 91678 91639 40 -5.46 5.09 PlyA - 92385 92380 6 1.05 5.08 Term - 100089 99998 92 1 2 150 36 115 0.981 10.38 5.07 Intr - 101075 101018 58 2 1 57 91 15 0.844 -2.84 5.06 Intr - 103248 103098 151 2 1 115 72 222 0.884 23.46 5.05 Intr - 108183 107956 228 2 0 99 75 355 0.673 32.28 5.04 Intr - 118134 118082 53 0 2 70 86 44 0.964 0.11 5.03 Intr - 118381 118229 153 1 0 83 95 152 0.999 15.67 5.02 Intr - 119582 119533 50 0 2 114 111 4 0.999 3.70 5.01 Init - 120239 120173 67 2 1 74 101 90 0.946 10.13 5.00 Prom - 121337 121298 40 -6.06 6.00 Prom + 123885 123924 40 -3.56 6.01 Init + 129143 129386 244 1 1 70 95 123 0.787 9.28 6.02 Term + 132411 132751 341 1 2 86 54 123 0.476 3.40 6.03 PlyA + 132767 132772 6 1.05 7.00 Prom + 145186 145225 40 -3.56 7.01 Init + 145358 145384 27 1 0 70 81 57 0.316 0.91 7.02 Intr + 147073 147182 110 0 2 95 76 65 0.316 5.08 7.03 Intr + 154561 154606 46 1 1 107 75 19 0.559 0.91 7.04 Term + 158318 158416 99 1 0 79 45 96 0.893 2.53 7.05 PlyA + 159580 159585 6 1.05 8.06 PlyA - 165725 165720 6 1.05 8.05 Term - 174945 174851 95 1 2 61 44 89 0.121 -0.21 8.04 Intr - 180724 180601 124 1 1 87 64 35 0.103 1.16 8.03 Intr - 200031 199979 53 1 2 94 75 20 0.155 -0.07 8.02 Intr - 202117 202070 48 2 0 98 56 49 0.168 1.35 8.01 Intr - 207540 207387 154 0 1 58 79 144 0.906 10.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 71528 71629 102 1 0 57 103 61 0.823 4.74 S.002 Term + 71823 72035 213 2 0 55 43 139 0.935 3.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_1|44_aa MGEAVQGPDSWWTPITTPYTQTSATGYPQIGLNSFFNQDKGEEK >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_1|135_bp atgggggaggctgtgcagggcccagacagctggtggacaccaattaccaccccgtacacc cagacttcagcaacaggctaccctcaaattggcctcaactccttcttcaaccaggacaag ggggaggagaaatag >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_2|183_aa MEMSGALALASGTSLCSLRMAGSGVQVQLEYGSLAARDLLRMQETEVKGRCRAMVSDCSI NEGPRLPSSGQECSICPSDPLPAQVMNELTSHHRDPLKAATRYGSPPCLQRKYEDFPDID WGKRNLCGFHLACATAYHWMSEDASFRLPLRFTSNHQLATLQLPDFKCPSCAPQFCITSP NPP >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_2|552_bp atggagatgagcggagcccttgctctggcctctgggacatctctttgctctctgaggatg gctgggtctggggttcaagtgcagctggaatatgggagtcttgcagccagagatctcctg cggatgcaagagacggaggtcaaaggcaggtgccgggccatggtgagtgactgctccata aatgagggaccacgcttaccttcatctggccaggagtgctccatctgcccctctgaccct ctgcctgcccaggtcatgaacgagctgacttcacatcacagagaccctctgaaagcagca acacgctatggttcccctccatgtctccagagaaaatacgaggacttccctgacattgac tgggggaaaaggaatctttgtggattccacctggcctgtgccactgcttatcactggatg tctgaggatgcaagcttcaggcttcctctgcgtttcaccagcaatcatcagcttgcaaca ctgcagcttcctgacttcaaatgtccttcatgtgctccccaattttgtatcaccagcccc aacccaccctga >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_3|124_aa MRSHRTGGGATELEEETQNQRRGHRTGGGATEPEEGPQNQRRGHRTGRRATEPKEEPQNQ GRSHRTRGGATEPEEESQNQRRSHRTRRGATELEEEPQNQRRSHRTGGGATEPEEETQNR TKSH >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_3|375_bp atgaggagccacagaaccggaggaggggccacagaactggaggaggagacacagaaccag aggaggggccacagaaccggaggaggggccacagaaccagaggaggggccacagaaccag aggaggggccacagaaccggacgaagagccactgaaccgaaggaggagccacagaaccag gggaggagccacagaacgagaggaggagccacagaaccagaggaggagtcacagaaccag aggaggagccacagaaccagaagaggagccacagaactggaggaggagccacagaaccag aggaggagtcacagaactggaggaggagccacagaaccggaggaggagacacagaaccgg acaaagagccactga >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_4|196_aa MTLELDEIVSRTDSPSPTVLNSHISTPNVNALTHENQTKPSISQISTTLPPTTSTKKSGG ASVVPHPSPTPLSQEEADNNEDPSIEEEDLLMLNSSPSTAKDTLDNGDYGEPDYDWTTGP RDDDESDDTLEENRGYMEIEQSVKSFKMPSSNIEEEDSHFFFHLIIFAFCIAVVYITYHN KRKKQMSELEDTNFTL >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_4|591_bp atgactttggagttggatgaaattgtatcacggactgattcaccgagcccaaccgtactc aactcacatatttctaccccaaatgtgaatgctttaacacatgaaaaccaaaccaaacct tctatttcccaaatcagcaccaccctccctcccacgacgagtaccaagaaaagtggagga gcatctgtggtccctcatccctcgcctactcctctgtctcaagaggaagctgataacaat gaagatcctagtatagaggaggaggatcttctcatgctgaacagttctccatccacagcc aaagacactctagacaatggcgattatggagaaccagactatgactggaccacgggcccc agggacgacgacgagtctgatgacaccttggaagaaaacaggggttacatggaaattgaa cagtcagtgaaatcttttaagatgccatcctcaaatatagaagaggaagacagccatttc ttttttcatcttattatttttgctttttgcattgctgttgtttacattacatatcacaac aaaaggaagaaacagatgtctgagcttgaggacaccaatttcacattatag >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_5|283_aa MAVPPTYADLGKSARDVFTKGYGFGLIKLDLKTKSENGLEFTSSGSANTETTKVTGSLET KYRWTEYGLTFTEKWNTDNTLGTEITVEDQLARGLKLTFDSSFSPNTGKKNAKIKTGYKR EHINLGCDMDFDIAGPSIRGALVLGYEGWLAGYQMNFETAKSRVTQSNFAVGYKTDEFQL HTNVNDGTEFGGSIYQKVNKKLETAVNLAWTAGNSNTRFGIAAKYQIDPDACFSAKVNNS SLIGLGYTQTLKPGIKLTLSALLDGKNVNAGGHKLGLGLEFQA >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_5|852_bp atggctgtgccacccacgtatgccgatcttggcaaatctgccagggatgtcttcaccaag ggctatggatttggcttaataaagcttgatttgaaaacaaaatctgagaatggattggaa tttacaagctcaggctcagccaacactgagaccaccaaagtgacgggcagtctggaaacc aagtacagatggactgagtacggcctgacgtttacagagaaatggaataccgacaataca ctaggcaccgagattactgtggaagatcagcttgcacgtggactgaagctgaccttcgat tcatccttctcacctaacactgggaaaaaaaatgctaaaatcaagacagggtacaagcgg gagcacattaacctgggctgcgacatggatttcgacattgctgggccttccatccggggt gctctggtgctaggttacgagggctggctggccggctaccagatgaattttgagactgca aaatcccgagtgacccagagcaactttgcagttggctacaagactgatgaattccagctt cacactaatgtgaatgacgggacagagtttggcggctccatttaccagaaagtgaacaag aagttggagaccgctgtcaatcttgcctggacagcaggaaacagtaacacgcgcttcgga atagcagccaagtatcagattgaccctgacgcctgcttctcggctaaagtgaacaactcc agcctgataggtttaggatacactcagactctaaagccaggtattaaactgacactgtca gctcttctggatggcaagaacgtcaatgctggtggccacaagcttggtctaggactggaa tttcaagcataa >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_6|194_aa MRPDPAEEPTGQLFSVATTGLHGKPKAADYRISEHQEGLPPSFGWSSTTHAPLPVTTTED TTPREIPSKAKWTSVLQEHLPDPPGVDVAGAGPKPRLGGQSGPRPPIKARVGPIPRGQSQ AQRGHWAPGAAQIQSRGGDCCSRAPRSGYNVSGASDPPARRLDNLRENPCAPILHYRLAI FSCSKATSSQFPCI >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_6|585_bp atgcgcccagacccagctgaggagcccactggtcaacttttctcagtggctaccacagga ctccatggaaaacccaaagccgcagactacagaatctctgagcaccaggaggggctgcct ccaagcttcggctggtcctcaaccacacatgcgcctcttccagtcactaccaccgaagac actacaccacgggaaattccatcaaaggcgaagtggacttcagtgcttcaggagcatctt cctgatcccccgggagttgatgtggccggcgctggacccaagcctcgcctgggtggacaa agtggcccccggccccccatcaaagcccgcgtgggccccatcccccgagggcagtcccag gcccagcgcggacactgggcgcctggggcggcgcagatacagagtagaggcggcgactgt tgctcccgagctccgcgctccggatacaatgtgtctggcgcctccgacccgcccgcgcgg cgtctagacaatctcagagaaaacccctgtgcgccgattttgcactacagattagcgatt ttctcctgtagcaaagctactagcagccaatttccatgcatttga >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_7|93_aa MVRASARLLPPPESGEDLLPHWPHTNTSSSVLVARNDFLSDMGNDRSPSDPLLHDQFPSS KVVLDDHLVWGSLTLEEAICSPFQLIKKELNPT >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_7|282_bp atggtgcgggcatctgctcggcttctgcctcctcctgaatctggagaggacttgcttcct cactggccccacaccaacacctcctcctctgtcctggttgccaggaacgactttctctct gacatgggcaatgacaggtctccctcagacccactgcttcatgaccagttcccgagcagc aaggtggtcctggatgaccacctggtgtggggatcattaaccttggaagaagccatttgc agtccattccagctgatcaagaaagaactaaatccaacatag >gi568815593r:133872774_134093012|GENSCAN_predicted_peptide_8|157_aa AGQKETIPVSKGLEIVCFVGVKATRDCESGHRTCEKSFVSDDGFLSRANDHSFPGKGAFG GGCLHGTSGLQEVTMGLEETITGAEDPPSWRLPLARLTGALDNDGGLGNGSIDPQQTDKM QSIPRHGTLNQKGKQKKQNKKHPNGLQLMDGACSIVS >gi568815593r:133872774_134093012|GENSCAN_predicted_CDS_8|474_bp gccgggcaaaaggagacaatcccagtgtccaagggcctagaaatagtctgttttgtggga gtaaaggccaccagggactgtgaaagtggccaccggacatgtgagaaatccttcgtgtct gacgatgggttcttatcgagggcaaatgaccacagttttcctggcaaaggagcctttggt ggtggatgtcttcatggcacatctggactgcaggaggtcacaatggggttggaggagacc atcactggggcagaggatcctccttcctggcgcctgcccctggcacggctcacaggagcc ctagacaatgatggagggttgggcaatgggtccattgacccacagcaaactgacaaaatg cagagtatccccaggcatggtacactgaatcaaaaaggaaaacagaaaaaacagaacaag aaacaccccaatgggttgcagcttatggatggtgcctgcagcattgtgagctga