GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:40:28 Sequence gi568815579r:47771458_47986257 : 214800 bp : 46.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9407 9460 54 1 0 93 78 106 0.992 9.28 1.02 Intr + 9651 9725 75 2 0 96 95 32 0.938 4.41 1.03 Term + 9833 9913 81 1 0 137 38 187 0.999 16.29 1.04 PlyA + 13203 13208 6 1.05 2.02 PlyA - 14605 14600 6 1.05 2.01 Sngl - 23445 23098 348 0 0 68 37 373 0.999 26.24 2.00 Prom - 24439 24400 40 -4.26 3.04 PlyA - 24482 24477 6 1.05 3.03 Term - 26733 26575 159 0 0 102 43 117 0.446 6.54 3.02 Intr - 32216 32047 170 0 2 126 41 251 0.629 23.87 3.01 Init - 47647 47515 133 1 1 78 66 122 0.753 9.30 3.00 Prom - 58563 58524 40 -6.26 4.00 Prom + 58630 58669 40 -8.56 4.01 Init + 62314 62344 31 0 1 49 61 50 0.456 -1.69 4.02 Intr + 62952 63086 135 1 0 111 89 106 0.890 13.54 4.03 Intr + 64786 64937 152 2 2 84 75 225 0.982 20.68 4.04 Term + 67863 68510 648 2 0 127 44 372 0.999 30.78 4.05 PlyA + 69670 69675 6 1.05 5.00 Prom + 75275 75314 40 -6.96 5.01 Init + 78383 78434 52 1 1 92 46 32 0.275 0.75 5.02 Intr + 88656 88825 170 1 2 126 41 251 0.999 23.87 5.03 Term + 89876 90055 180 1 0 43 50 200 0.611 9.31 5.04 PlyA + 90924 90929 6 1.05 6.05 PlyA - 91112 91107 6 1.05 6.04 Term - 100110 99998 113 1 2 133 39 136 0.998 11.92 6.03 Intr - 103377 103200 178 0 1 113 93 120 0.999 14.49 6.02 Intr - 107673 107579 95 1 2 68 89 115 0.944 9.28 6.01 Init - 114800 114665 136 2 1 73 110 0 0.425 1.00 6.00 Prom - 116945 116906 40 -5.76 7.00 Prom + 129549 129588 40 -5.16 7.01 Init + 130610 130658 49 1 1 86 58 34 0.087 -0.69 7.02 Intr + 131195 131255 61 0 1 80 73 45 0.048 -0.01 7.03 Intr + 135215 135406 192 2 0 81 53 89 0.064 3.31 7.04 Intr + 140591 140782 192 2 0 89 43 112 0.237 5.41 7.05 Intr + 145964 146155 192 2 0 81 53 116 0.264 6.01 7.06 Intr + 151315 151506 192 1 0 81 53 92 0.480 3.61 7.07 Intr + 156359 156550 192 2 0 89 43 110 0.697 5.21 7.08 Intr + 161717 161908 192 2 0 81 53 92 0.058 3.61 7.09 Intr + 167072 167263 192 2 0 89 43 112 0.360 5.41 7.10 Intr + 172458 172649 192 0 0 81 53 92 0.124 3.61 7.11 Intr + 177829 178020 192 1 0 89 43 112 0.493 5.41 7.12 Intr + 183217 183408 192 1 0 89 43 112 0.356 5.41 7.13 Intr + 184101 184210 110 0 2 99 82 31 0.564 3.53 7.14 Term + 185887 185927 41 2 2 91 43 55 0.544 -1.35 7.15 PlyA + 185956 185961 6 1.05 8.04 PlyA - 187180 187175 6 1.05 8.03 Term - 205397 205255 143 0 2 132 45 45 0.980 2.69 8.02 Intr - 206047 205916 132 2 0 25 113 75 0.933 4.32 8.01 Intr - 208142 208113 30 0 0 89 111 -4 0.207 0.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_1|69_aa YLQLKKKLEDEFPGRLDICGEGTPQATGFFEVMVAGKLIHSKKKGDGYVDTESKFLKLVA AIKAALAQG >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_1|210_bp tatcttcagctcaagaagaagttagaagatgagttccccggccgcctggacatctgcggc gagggaactccccaggccaccgggttctttgaagtgatggtagccgggaagttgattcac tctaagaagaaaggcgatggctacgtggacacagaaagcaagtttctgaagttggtggcc gccatcaaagccgccttggctcagggctaa >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_2|115_aa MLPKAKEAPASPKAQAKPKALKAKKAVLKAVHSHTEEKIRRSPTFRRPKTARLRREPKYP QKSAPWRNKLGHSAIITFPPTTESAMKKTDDNNTLAFIIDVKAKEHQIKQAVKKL >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_2|348_bp atgctgccgaaagcgaaggaagctcctgcctctcctaaagcccaagccaaaccgaaggct ttaaaggccaagaaagcagtgttgaaagccgtccacagccacacagaagagaagatccgc aggtcacccaccttcaggcggcccaagacagcgcgactccggagggagcccaaatatcct cagaagagcgccccctggagaaacaagcttggccactctgcgatcatcacgtttccgccg accactgagtccgccatgaagaagacagacgacaacaacacacttgccttcattatagat gttaaagccaaggagcaccagatcaaacaggctgtgaagaagctctga >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_3|153_aa MEYYAAIKKDEFMSYAGTWMKLETIILSKLSQGQKTKHRMLSLIGPPLALDPPRRQRQER TVYTESQQKVLEFYFQKDQYPNYDQRLNLAEMLSLREQQLQSPVRRHLQNLSRVTEELKG RWDILVDDVKTLLKPVLAPSHASLLAVRLQLQQ >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_3|462_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctatgcagggacatggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaccaaacaccgcatg ctctcgctcataggccctcccctggccctggaccctccaaggagacagcggcaggagcgc acggtctacactgaaagccagcagaaagtgctagaattttactttcagaaggaccagtac ccgaactacgaccagcgactgaatctggcggagatgctcagcctcagggagcaacagctg cagagcccggtgaggcggcatctgcagaatctcagccgggtcacagaagagttgaagggg agatgggacatattggtggatgatgtcaaaacccttctcaagcccgtgctggcaccttct catgcctccttactggccgtgagacttcagttacagcagtag >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_4|321_aa MVKKEGILDEGPLTWASVSPKIMMAYMNPGPHYSVNALALSGPSVDLMHQAVPYPSAPRK QRRERTTFTRSQLEELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQR QQQKQQQQPPGGQAKARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVA TVSIWSPASESPLPEAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGL DPYLSPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGAYSPVDSLEFKDPTGTWKF TYNPMDPLDYKDQSAWKFQIL >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_4|966_bp atggtcaagaaagaagggatcttggatgagggccccctgacttgggcctcagtgtccccg aagatcatgatggcgtatatgaacccggggccccactattctgtcaacgccttggcccta agtggccccagtgtggatctgatgcaccaggctgtgccctacccaagcgcccccaggaag cagcggcgggagcgcaccaccttcacccggagccaactggaggagctggaggcactgttt gccaagacccagtacccagacgtctatgcccgtgaggaggtggctctgaagatcaatctg cctgagtccagggttcaggtttggttcaagaaccggagggctaaatgcaggcagcagcga cagcagcagaaacagcagcagcagcccccagggggccaggccaaggcccggcctgccaag aggaaggcgggcacgtccccaagaccctccacagatgtgtgtccagaccctctgggcatc tcagattcctacagtccccctctgcccggcccctcaggctccccaaccacggcagtggcc actgtgtccatctggagcccagcctcagagtcccctttgcctgaggcgcagcgggctggg ctggtggcctcagggccgtctctgacctccgccccctatgccatgacctacgccccggcc tccgctttctgctcttccccctccgcctatgggtctccgagctcctatttcagcggccta gacccctacctttctcccatggtgccccagctagggggcccggctcttagccccctctct ggcccctccgtgggaccttccctggcccagtcccccacctccctatcaggccagagctat ggcgcctacagccccgtggatagcttggaattcaaggaccccacgggcacctggaaattc acctacaatcccatggaccctctggactacaaggatcagagtgcctggaagtttcagatc ttgtag >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_5|133_aa MGIQLLTCTWGAGKDNDCPPLALDPPRRQRQERTVYTESQQKVLEFYFQKDQYPNYDQRL NLAEMLSLREQQLQSPYASNLSPDTQLYPDFTKLLPLLDRFEESSLSTTTSQYKEEDGFV DKNHSVPRSLLDL >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_5|402_bp atgggaatccagctcctgacatgcacctggggcgctgggaaagacaatgactgccctccc ctggccctggaccctccaaggagacagcggcaggagcgcacggtctacactgaaagccag cagaaagtgctagaattttactttcagaaggaccagtacccgaactacgaccagcgactg aatctggcggagatgctcagcctcagggagcaacagctgcagagcccctatgcctccaac ttgtcgccagacacccagttataccctgacttcaccaagctgctcccgctcctagaccgg ttcgaggaatcctcactctccaccacgacgtctcagtacaaagaggaggatggcttcgtg gacaaaaatcactcagtccccaggtcattactggatttatag >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_6|173_aa MSDDFLWFEGIAFPTMGFRSETLRKVRDEFVIRDEDVIILTYPKSVLYGSWFDHIHGWMP MREEKNFLLLSYEELKQDTGRTIEKICQFLGKTLEPEELNLILKNSSFQSMKENKMSNYS LLSVDYVVDKAQLLRKGVSGDWKNHFTVAQAEDFDKLFQEKMADLPRELFPWE >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_6|522_bp atgtcggacgatttcttatggtttgaaggcatagctttccctactatgggtttcagatcc gaaaccttaagaaaagtacgtgatgagttcgtgataagggatgaagatgtaataatattg acttaccccaaatcagtgctatatgggtcatggtttgaccacattcatggctggatgccc atgagagaggagaaaaacttcctgttactgagttatgaggagctgaaacaggacacagga agaaccatagagaagatctgtcaattcctgggaaagacgttagaacccgaagaactgaac ttaattctcaagaacagctcctttcagagcatgaaagaaaacaagatgtccaattattcc ctcctgagtgttgattatgtagtggacaaagcacaacttctgagaaaaggtgtatctggg gactggaaaaatcacttcacagtggcccaagctgaagactttgataaattgttccaagag aagatggcagatcttcctcgagagctgttcccatgggaataa >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_7|726_aa MGFHHVGQAGLQLLTSGSSILPEDLQNLQIFLLEKPHHTGLLTVPVAAGMCPHWASVLAV PTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQHFQDVVCHTGLLTVPVAAGMCPHRAS VLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVAAGMCP HWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVAA GMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQHFQDVVCHTGLLTV PVEAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTG LLTVPVAAGMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQHFQDVV CHTGLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDF QDVVCHTGLLTVPVAAGMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGP NQHFQDVVCHTGLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLN MAGPNQDFQDVVCHTGLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMAT EQLNMAGPNQDFQDVVWGRHGTQGPHWPWGLGDVRGEPLVIGSLTLRKIKGPKVSFSTPY LDKDKK >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_7|2181_bp atggggtttcaccatgttggccaggctggtctccaactcctgacctcaggttcatcaatc ttgccagaagatctgcagaatctgcagatattcttactagagaagccgcaccacacgggg ctcctcactgttcccgtagcagcaggcatgtgcccccactgggcctctgtactggctgtt cccactgcccgaacaccctcatgcaccatctgcactgtccaatatggccgcctctggcca cacatggctactgagcagttgaacatggctggtccaaaccaacatttccaagacgtcgta tgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctct gtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggc cgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagatttc caagacgtcgtgtgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccc cactgggcctctgtactggctgttcccactgcccgaacaccctcatgcaccatctgcact gtccaatacggccgcctctggccacacatggctactgagcagttgaacatggctggtcca aaccaagatttccaagacgtcgtatgccacacggggctcctcactgttcccgtagcagca ggcatgtgcccccactgggcctctgtactggctgttcccactgcccgaacaccctcatgc accatctgcactgtccaatacggccgcctctggccacacatggctactgagcagttgaac atggctggtccaaaccaacatttccaagacgtcgtatgccacacggggctcctcactgtt cccgtagaagcaggcatgtgcccccacagggcctctgtactggctgttcccactgcccga acaccctcatgcaccatctgcactgtccaatacggccgcctctggccacacatggctact gagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtgtgccacacgggg ctcctcactgttcccgtagcagcaggcatgtgcccccactgggcctctgtactggctgtt cccactgcccgaacaccctcatgcaccatctgcactgtccaatacggccgcctctggcca cacatggctactgagcagttgaacatggctggtccaaaccaacatttccaagacgtcgta tgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctct gtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggc cgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagatttc caagacgtcgtgtgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccc cactgggcctctgtactggctgttcccactgcccgaacaccctcatgcaccatctgcact gtccaatacggccgcctctggccacacatggctactgagcagttgaacatggctggtcca aaccaacatttccaagacgtcgtatgccacacggggctcctcactgttcccgtagcagca ggcatgtgcccccacagggcctctgtactggctgttcccactgcccgaacaccctcatgc accatctgcactgtccaatacggccgcctctggccacacatggctactgagcagttgaac atggctggtccaaaccaagatttccaagacgtcgtgtgccacacggggctcctcactgtt cccgtagcagcaggcatgtgcccccacagggcctctgtactggctgttcccactgcccga acaccctcatgcaccatctgcactgtccaatacggccgcctctggccacacatggctact gagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtgtgggggcgccat ggaacgcagggccctcactggccctggggactgggtgacgtcaggggtgagcctctggtg attggctccctcaccctgcgtaagatcaaagggcctaaagttagcttttctacaccctac ctggataaggataagaaatag >gi568815579r:47771458_47986257|GENSCAN_predicted_peptide_8|101_aa XTITHFPEVTDGECVFPFHYKNGTYYDCIKSKARHKWCSLNKTYEGYWKFCSAEDFANCV FPFWYRRLIYWECTDDGEAFGKKWCSLTKNFNKDRIWKYCE >gi568815579r:47771458_47986257|GENSCAN_predicted_CDS_8|306_bp naaactataactcattttccagaagttacagatggggagtgtgtctttccattccactat aaaaatggaacatattatgactgcatcaagtccaaggcaagacacaagtggtgctcgtta aacaagacctacgaaggatactggaagttttgcagtgcagaagattttgcaaactgtgta tttcccttctggtacagacgcttgatctactgggagtgtactgatgatggggaagcattt gggaaaaaatggtgttcactgaccaagaattttaacaaggaccgaatttggaaatactgt gaatga