GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:40:08 Sequence gi568815579r:47701778_47902983 : 201206 bp : 50.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 30 638 609 2 0 62 41 1039 0.999 92.80 1.02 PlyA + 1469 1474 6 -1.75 2.00 Prom + 2682 2721 40 -4.16 2.01 Init + 5403 5571 169 2 1 81 78 143 0.985 12.31 2.02 Intr + 11647 11761 115 2 1 49 109 82 0.895 5.91 2.03 Intr + 14781 15239 459 0 0 122 69 1017 0.920 95.50 2.04 Intr + 16732 16829 98 1 2 150 101 130 0.997 20.15 2.05 Intr + 24035 24447 413 0 2 104 72 1140 0.669 107.91 2.06 Intr + 34592 34756 165 1 0 50 75 295 0.996 24.56 2.07 Term + 39104 39655 552 1 0 85 52 1283 0.975 118.71 2.08 PlyA + 41335 41340 6 1.05 3.00 Prom + 43290 43329 40 -8.16 3.01 Init + 43783 44006 224 0 2 101 87 204 0.971 19.63 3.02 Intr + 45190 45254 65 1 2 100 100 107 0.963 11.46 3.03 Intr + 48401 48509 109 0 1 116 70 86 0.997 8.94 3.04 Intr + 49131 49330 200 0 2 73 79 272 0.982 23.79 3.05 Intr + 49743 49813 71 1 2 102 100 72 0.974 8.70 3.06 Intr + 50735 50830 96 1 0 99 86 60 0.956 7.11 3.07 Intr + 52750 52854 105 0 0 89 99 178 0.999 19.41 3.08 Intr + 52932 53114 183 2 0 125 93 178 0.995 21.98 3.09 Intr + 53571 53746 176 2 2 123 52 186 0.999 17.24 3.10 Intr + 53979 54045 67 0 1 115 42 134 0.838 10.41 3.11 Intr + 54751 54827 77 0 2 114 80 100 0.999 10.11 3.12 Intr + 54911 54967 57 2 0 98 81 143 0.582 12.60 3.13 Intr + 76988 77168 181 2 1 43 31 93 0.053 -1.03 3.14 Intr + 79087 79140 54 0 0 93 78 106 0.992 9.28 3.15 Intr + 79331 79405 75 1 0 96 95 32 0.938 4.41 3.16 Term + 79513 79593 81 0 0 137 38 187 0.999 16.29 3.17 PlyA + 82883 82888 6 1.05 4.02 PlyA - 84285 84280 6 1.05 4.01 Sngl - 93125 92778 348 2 0 68 37 373 0.999 26.24 4.00 Prom - 94119 94080 40 -4.26 5.04 PlyA - 94162 94157 6 1.05 5.03 Term - 96413 96255 159 2 0 102 43 117 0.446 6.54 5.02 Intr - 101896 101727 170 2 2 126 41 251 0.629 23.87 5.01 Init - 117327 117195 133 0 1 78 66 122 0.753 9.30 5.00 Prom - 128243 128204 40 -6.26 6.00 Prom + 128310 128349 40 -8.56 6.01 Init + 131994 132024 31 2 1 49 61 50 0.456 -1.69 6.02 Intr + 132632 132766 135 0 0 111 89 106 0.890 13.54 6.03 Intr + 134466 134617 152 1 2 84 75 225 0.982 20.68 6.04 Term + 137543 138190 648 1 0 127 44 372 0.999 30.78 6.05 PlyA + 139350 139355 6 1.05 7.00 Prom + 144955 144994 40 -6.96 7.01 Init + 148063 148114 52 0 1 92 46 32 0.275 0.75 7.02 Intr + 158336 158505 170 0 2 126 41 251 0.999 23.87 7.03 Term + 159556 159735 180 0 0 43 50 200 0.611 9.31 7.04 PlyA + 160604 160609 6 1.05 8.06 PlyA - 160792 160787 6 1.05 8.05 Term - 169790 169678 113 0 2 133 39 136 0.998 11.92 8.04 Intr - 173057 172880 178 2 1 113 93 120 0.999 14.49 8.03 Intr - 177353 177259 95 0 2 68 89 115 0.945 9.28 8.02 Intr - 190047 189893 155 2 2 34 97 30 0.052 -1.78 8.01 Intr - 194373 194257 117 2 0 98 84 57 0.332 5.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_1|202_aa MNGTVDHPPPAAPERKPLGTAPHCPRLPLRKTYRENVGGPGAPEGTPAGRARGGSPAPLP AKVDEATSGLIRELAAVEDELYQRMLKGPPPEPAASAAQGTGDPDWEAPGLPPAKRRKSE SPDVDQASFSSDSPQDDTLTEHLQSAIDSILNLQQAPGRTPAPSYPHAASAGTPASPPPL HRPEAYPPSSHNGGLGARTLTR >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_1|609_bp atgaacggcacggtggaccacccgccgcctgccgcccccgagcgcaagcccctgggcacc gccccgcactgcccgcgcctgccactgcgcaagacctaccgcgagaacgtggggggccct ggcgcgccggaggggacgcccgcaggcagggcacggggaggcagcccggcgccgctgccc gccaaagtggacgaggccaccagcgggctcatccgcgagctggcggccgtggaggacgag ctgtaccagcgtatgctgaagggccccccgccagagcccgcagccagcgccgcccaaggc accggggaccccgactgggaggcgcccgggctgccccctgccaagcggcgcaagtccgag tcgcccgacgtggaccaggccagcttctccagcgacagcccgcaggatgacacgctcacc gagcacctgcagagcgccatcgacagcatcctgaacctgcagcaggcccccggccggacg cccgcgccctcgtacccccacgctgcctcggccggcacccccgcatccccgccgcccctg cacaggcccgaggcctacccaccctccagtcacaacggtggcctcggcgccaggacgttg accagataa >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_2|656_aa MQEKREEATGELSGKSIQGRGNRQCKGPVAGACLECRGTAKKPMRLQPISDGERSGDGSG RDREHRPLRGRFGSSRDPTLSQPAQPAAAASAPRRQLSICTSLREPRERCAATMFSWLKR GGARGQQPEAIRTVTSALKELYRTKLLPLEEHYRFGAFHSPALEDADFDGKPMVLVAGQY STGKTSFIQYLLEQEVPGSRVGPEPTTDCFVAVMHGDTEGTVPGNALVVDPDKPFRKLNP FGNTFLNRFMCAQLPNQVLESISIIDTPGILSGAKQRVSRGYDFPAVLRWFAERVDLIIL LFDAHKLEISDEFSEAIGALRGHEDKIRVVLNKADMVETQQLMRVYGALMWALGKVVGTP EVLRVYIGSFWSQPLLVPDNRRLFELEEQDLFRDIQGLPRHAALRKLNDLVKRARLVRVH AYIISYLKKEMPSVFGKENKKKQLILKLPVIFAKIQLEHHISPGDFPDCQKMQELLMAHD FTKFHSLKPKLLEALDEMLTHDIAKLMPLLRQEELESTEVGVQGGAFEGTHMGPFVERGP DEAMEDGEEGSDDEAEWVVTKDKSKYDEIFYNLAPADGKLSGSKAKTWMVGTKLPNSVLG RIWKLSDVDRDGMLDDEEFALASHLIEAKLEGHGLPANLPRRLVPPSKRRHKGSAE >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_2|1971_bp atgcaggaaaagagggaggaagccacaggagagctctcagggaagagcatccaaggcaga gggaaccgccagtgcaaaggccctgtggcaggagcctgcctggagtgtcgaggaacagcg aaaaagccaatgcggctgcagcccataagtgatggggagaggtctggagacggctccgga cgggaccgcgagcacaggccgctccgcgggcgcttcggatcctcgcgggaccccaccctc tcccagcctgcccagcccgctgcagccgccagcgcgccccgtcggcagctctccatctgc acgtctctccgtgaaccccgtgagcggtgtgcagccaccatgttcagctggctgaagcgg ggcggggcacggggccagcagcccgaggccatccgcacggtgacctcggccctcaaggag ctgtaccgcacgaagctgctgccgctggaggagcactaccgctttggggccttccactcg ccggccctggaggacgcagacttcgacggcaagcccatggtgctggtggccggccagtac agcacgggcaagaccagcttcatccagtacctgctggagcaggaggtgcccggctcccgc gtggggcctgagcccaccaccgactgctttgtggccgtcatgcacggggacactgagggc accgtgcccggcaacgccctcgtcgtggacccggacaagcccttccgcaaactcaaccct ttcggaaacaccttcctcaacaggttcatgtgtgcccagctccctaatcaggtcctggag agcatcagcatcatcgacaccccgggtatcctgtcgggtgccaagcagagagtgagccgc ggctacgacttcccggccgtgctgcgctggttcgcggagcgcgtggacctcatcatcctg ctctttgatgcgcacaagctggagatctcggacgagttctcagaggccatcggcgcgttg cggggccatgaggacaagatccgcgtggtgctcaacaaggccgacatggtggagacgcag cagctgatgcgcgtctacggcgcgctcatgtgggcgctgggcaaggtggtgggcacgccc gaggtgctgcgcgtctacatcggctccttctggtcccagcccctcctcgtgcccgacaac cggcgcctcttcgagctggaggagcaggacctcttccgcgacatccagggcctgccccgg cacgcagccttgcgcaagctcaacgacctggtgaagagggcccggctggtgcgagttcac gcttacatcatcagctacctgaagaaggagatgccctctgtgtttgggaaggagaacaag aagaagcagctgatcctcaaactgcccgtcatctttgcgaagattcagctggaacatcac atctcccctggggactttcctgattgccagaaaatgcaggagctgctgatggcgcacgac ttcaccaagtttcactcgctgaagccgaagctgctagaggcactggacgagatgctgacg cacgacatcgccaagctcatgcccctgctgcggcaggaggagctggagagcaccgaggtg ggcgtgcaggggggcgcttttgagggcacccacatgggcccgtttgtggagcggggacct gacgaggccatggaggacggcgaggagggctcggacgacgaggccgagtgggtggtgacc aaggacaagtccaaatacgacgagatcttctacaacctggcgcctgccgacggcaagctg agcggctccaaggccaagacctggatggtggggaccaagctccccaactcagtgctgggg cgcatctggaagctcagcgatgtggaccgcgacggcatgctggatgatgaggagttcgcg ctggccagccacctcatcgaggccaagctggaaggccacgggctgcccgccaacctgccc cgtcgcctggtgccaccctccaagcgacgccacaagggctccgccgagtga >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_3|606_aa MAAGGSGVGGKRSSKSDADSGFLGLRPTSVDPALRRRRRGPRNKKRGWRRLAQEPLGLEV DQFLEDVRLQERTSGGLLSEAPNEKLFFVDTGSKEKGLTKKRTKVQKKSLLLKKPLRVDL ILENTSKVPAPKDVLAHQVPNAKKLRRKEQLWEKLAKQGELPREVRRAQARLLNPSATRA KPGPQDTVERPFYDLWASDNPLDRPLVGQDEFFLEQTKKKGVKRPARLHTKPSQAPAVEV APAGASYNPSFEDHQTLLSAAHEVELQRQKEAEKLERQLALPATEQAATQESTFQELCEG LLEESDGEGEPGQGEGPEAGDAEVCPTPARLATTEKKTEQQRRREKAVHRLRVQQAALRA ARLRHQELFRLRGIKAQVALRLAELARRQRRRQARREAEADKPRRLGRLKYQAPDIDVQL SSELTDSLRTLKPEGNILRDRFKSFQRRNMIEPRERAKFKRKYKVKLVEKRAFREIHGCG SPEPWLSPSESFIGKPSGQRPPSPTPAGTRFSEPGSGSPGERTHGSPCMGKGSPCMWYLQ LKKKLEDEFPGRLDICGEGTPQATGFFEVMVAGKLIHSKKKGDGYVDTESKFLKLVAAIK AALAQG >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_3|1821_bp atggcggcaggaggcagtggcgttggtgggaagcgcagctcgaaaagcgatgccgattct ggtttcctggggctgcggcccacttcggtggacccagcgctgaggcggcggcggcgaggc ccaagaaataagaagcggggctggcggcggcttgctcaggagccgctggggctggaggtt gaccagttcctggaagacgtgcggctacaggagcgcacgagcggtggcttgttgtcagag gccccaaatgaaaaactcttcttcgtggacactggctccaaggaaaaagggctgacaaag aagagaaccaaagtccagaagaagtcactgcttctcaagaaaccccttcgggttgacctc atcctcgagaacacatccaaagtccctgcccccaaagacgtcctcgcccaccaggtcccc aacgccaagaagctcaggcggaaggagcagctatgggagaagctggccaagcagggcgag ctgccccgggaggtgcgcagggcccaggcccggctcctcaacccttctgcaacaagggcc aagcccgggccccaggacaccgtagagcggcccttctacgacctctgggcctcagacaac cccctggacaggccgttggttggccaggatgagtttttcctggagcagaccaagaagaaa ggagtgaagcggccagcacgcctgcacaccaagccgtcccaggcacccgccgtggaggtg gcgcctgccggagcttcctacaatccatcctttgaagaccaccagaccctgctctcagcg gcccacgaggtggagttgcagcggcagaaggaggcggagaagctggagcggcagctggcc ctgcccgccacggagcaggccgccacccaggagtccacattccaggagctgtgcgagggg ctgctggaggagtcggatggtgagggggagccaggccagggcgaggggccggaggctggg gatgccgaggtctgtcccacgcccgcccgcctggccaccacagagaagaagacggagcag cagcggcggcgggagaaggctgtgcacaggctgcgggtacagcaggccgcgttgcgggcc gcccggctccggcaccaggagctgttccggctgcgcgggatcaaggcccaggtggccctg aggctggcggagctggcgcggcggcagaggcggcggcaggcgcggcgggaggctgaggct gacaagccccgaaggctggggcggctcaagtaccaggcacctgacatcgacgtgcagctg agctcggagctgacagactcgctcaggaccctgaagcccgagggcaacatccttcgagac cggttcaagagcttccagaggaggaatatgatcgagcctcgagagagagccaagttcaaa cgcaagtacaaggtgaagctggtggagaagcgggcgttccgtgagatccacggatgtggc agccccgagccatggctctcgccgtccgagtcgtttattggtaagcccagcggccagcgg cccccgtccccgacccccgccgggacccgattctcggagccggggtcagggagccccggg gagaggacccatgggagcccttgtatgggaaaagggagcccctgtatgtggtatcttcag ctcaagaagaagttagaagatgagttccccggccgcctggacatctgcggcgagggaact ccccaggccaccgggttctttgaagtgatggtagccgggaagttgattcactctaagaag aaaggcgatggctacgtggacacagaaagcaagtttctgaagttggtggccgccatcaaa gccgccttggctcagggctaa >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_4|115_aa MLPKAKEAPASPKAQAKPKALKAKKAVLKAVHSHTEEKIRRSPTFRRPKTARLRREPKYP QKSAPWRNKLGHSAIITFPPTTESAMKKTDDNNTLAFIIDVKAKEHQIKQAVKKL >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_4|348_bp atgctgccgaaagcgaaggaagctcctgcctctcctaaagcccaagccaaaccgaaggct ttaaaggccaagaaagcagtgttgaaagccgtccacagccacacagaagagaagatccgc aggtcacccaccttcaggcggcccaagacagcgcgactccggagggagcccaaatatcct cagaagagcgccccctggagaaacaagcttggccactctgcgatcatcacgtttccgccg accactgagtccgccatgaagaagacagacgacaacaacacacttgccttcattatagat gttaaagccaaggagcaccagatcaaacaggctgtgaagaagctctga >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_5|153_aa MEYYAAIKKDEFMSYAGTWMKLETIILSKLSQGQKTKHRMLSLIGPPLALDPPRRQRQER TVYTESQQKVLEFYFQKDQYPNYDQRLNLAEMLSLREQQLQSPVRRHLQNLSRVTEELKG RWDILVDDVKTLLKPVLAPSHASLLAVRLQLQQ >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_5|462_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctatgcagggacatggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaccaaacaccgcatg ctctcgctcataggccctcccctggccctggaccctccaaggagacagcggcaggagcgc acggtctacactgaaagccagcagaaagtgctagaattttactttcagaaggaccagtac ccgaactacgaccagcgactgaatctggcggagatgctcagcctcagggagcaacagctg cagagcccggtgaggcggcatctgcagaatctcagccgggtcacagaagagttgaagggg agatgggacatattggtggatgatgtcaaaacccttctcaagcccgtgctggcaccttct catgcctccttactggccgtgagacttcagttacagcagtag >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_6|321_aa MVKKEGILDEGPLTWASVSPKIMMAYMNPGPHYSVNALALSGPSVDLMHQAVPYPSAPRK QRRERTTFTRSQLEELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQR QQQKQQQQPPGGQAKARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVA TVSIWSPASESPLPEAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGL DPYLSPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGAYSPVDSLEFKDPTGTWKF TYNPMDPLDYKDQSAWKFQIL >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_6|966_bp atggtcaagaaagaagggatcttggatgagggccccctgacttgggcctcagtgtccccg aagatcatgatggcgtatatgaacccggggccccactattctgtcaacgccttggcccta agtggccccagtgtggatctgatgcaccaggctgtgccctacccaagcgcccccaggaag cagcggcgggagcgcaccaccttcacccggagccaactggaggagctggaggcactgttt gccaagacccagtacccagacgtctatgcccgtgaggaggtggctctgaagatcaatctg cctgagtccagggttcaggtttggttcaagaaccggagggctaaatgcaggcagcagcga cagcagcagaaacagcagcagcagcccccagggggccaggccaaggcccggcctgccaag aggaaggcgggcacgtccccaagaccctccacagatgtgtgtccagaccctctgggcatc tcagattcctacagtccccctctgcccggcccctcaggctccccaaccacggcagtggcc actgtgtccatctggagcccagcctcagagtcccctttgcctgaggcgcagcgggctggg ctggtggcctcagggccgtctctgacctccgccccctatgccatgacctacgccccggcc tccgctttctgctcttccccctccgcctatgggtctccgagctcctatttcagcggccta gacccctacctttctcccatggtgccccagctagggggcccggctcttagccccctctct ggcccctccgtgggaccttccctggcccagtcccccacctccctatcaggccagagctat ggcgcctacagccccgtggatagcttggaattcaaggaccccacgggcacctggaaattc acctacaatcccatggaccctctggactacaaggatcagagtgcctggaagtttcagatc ttgtag >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_7|133_aa MGIQLLTCTWGAGKDNDCPPLALDPPRRQRQERTVYTESQQKVLEFYFQKDQYPNYDQRL NLAEMLSLREQQLQSPYASNLSPDTQLYPDFTKLLPLLDRFEESSLSTTTSQYKEEDGFV DKNHSVPRSLLDL >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_7|402_bp atgggaatccagctcctgacatgcacctggggcgctgggaaagacaatgactgccctccc ctggccctggaccctccaaggagacagcggcaggagcgcacggtctacactgaaagccag cagaaagtgctagaattttactttcagaaggaccagtacccgaactacgaccagcgactg aatctggcggagatgctcagcctcagggagcaacagctgcagagcccctatgcctccaac ttgtcgccagacacccagttataccctgacttcaccaagctgctcccgctcctagaccgg ttcgaggaatcctcactctccaccacgacgtctcagtacaaagaggaggatggcttcgtg gacaaaaatcactcagtccccaggtcattactggatttatag >gi568815579r:47701778_47902983|GENSCAN_predicted_peptide_8|219_aa XNRNGGNKRNWNRSMRGIFKIQMSNQPEAVPTFRAQVQAIDYCQTWKEPKPDAGDKISGA HPSQSASAHVEPCLGFIYSFYNEMSHTYRKAVLYGSWFDHIHGWMPMREEKNFLLLSYEE LKQDTGRTIEKICQFLGKTLEPEELNLILKNSSFQSMKENKMSNYSLLSVDYVVDKAQLL RKGVSGDWKNHFTVAQAEDFDKLFQEKMADLPRELFPWE >gi568815579r:47701778_47902983|GENSCAN_predicted_CDS_8|660_bp nnaaatcggaatggggggaataaaagaaattggaataggagtatgagaggcatctttaag atacagatgtccaaccagcccgaagctgttccaaccttcagagcccaggtgcaagccata gattactgccagacatggaaagaacccaaaccagatgctggagataagatctcaggagct catcccagccaatctgccagtgcccacgtggagccatgtttaggttttatctattcattt tataatgaaatgtctcacacatatagaaaagcagtgctatatgggtcatggtttgaccac attcatggctggatgcccatgagagaggagaaaaacttcctgttactgagttatgaggag ctgaaacaggacacaggaagaaccatagagaagatctgtcaattcctgggaaagacgtta gaacccgaagaactgaacttaattctcaagaacagctcctttcagagcatgaaagaaaac aagatgtccaattattccctcctgagtgttgattatgtagtggacaaagcacaacttctg agaaaaggtgtatctggggactggaaaaatcacttcacagtggcccaagctgaagacttt gataaattgttccaagagaagatggcagatcttcctcgagagctgttcccatgggaataa