GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:53:24 Sequence gi568815579f:47734444_47939964 : 205521 bp : 48.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1926 2090 165 2 0 50 75 295 0.996 24.56 1.02 Term + 6438 6989 552 2 0 85 52 1283 0.975 118.71 1.03 PlyA + 8669 8674 6 1.05 2.00 Prom + 10624 10663 40 -8.16 2.01 Init + 11117 11340 224 1 2 101 87 204 0.971 19.63 2.02 Intr + 12524 12588 65 2 2 100 100 107 0.963 11.46 2.03 Intr + 15735 15843 109 1 1 116 70 86 0.997 8.94 2.04 Intr + 16465 16664 200 1 2 73 79 272 0.982 23.79 2.05 Intr + 17077 17147 71 2 2 102 100 72 0.974 8.70 2.06 Intr + 18069 18164 96 2 0 99 86 60 0.956 7.11 2.07 Intr + 20084 20188 105 1 0 89 99 178 0.999 19.41 2.08 Intr + 20266 20448 183 0 0 125 93 178 0.995 21.98 2.09 Intr + 20905 21080 176 0 2 123 52 186 0.999 17.24 2.10 Intr + 21313 21379 67 1 1 115 42 134 0.838 10.41 2.11 Intr + 22085 22161 77 1 2 114 80 100 0.999 10.11 2.12 Intr + 22245 22301 57 0 0 98 81 143 0.582 12.60 2.13 Intr + 44322 44502 181 0 1 43 31 93 0.053 -1.03 2.14 Intr + 46421 46474 54 1 0 93 78 106 0.992 9.28 2.15 Intr + 46665 46739 75 2 0 96 95 32 0.938 4.41 2.16 Term + 46847 46927 81 1 0 137 38 187 0.999 16.29 2.17 PlyA + 50217 50222 6 1.05 3.02 PlyA - 51619 51614 6 1.05 3.01 Sngl - 60459 60112 348 0 0 68 37 373 0.999 26.24 3.00 Prom - 61453 61414 40 -4.26 4.04 PlyA - 61496 61491 6 1.05 4.03 Term - 63747 63589 159 0 0 102 43 117 0.446 6.54 4.02 Intr - 69230 69061 170 0 2 126 41 251 0.629 23.87 4.01 Init - 84661 84529 133 1 1 78 66 122 0.753 9.30 4.00 Prom - 95577 95538 40 -6.26 5.00 Prom + 95644 95683 40 -8.56 5.01 Init + 99328 99358 31 0 1 49 61 50 0.456 -1.69 5.02 Intr + 99966 100100 135 1 0 111 89 106 0.890 13.54 5.03 Intr + 101800 101951 152 2 2 84 75 225 0.982 20.68 5.04 Term + 104877 105524 648 2 0 127 44 372 0.999 30.78 5.05 PlyA + 106684 106689 6 1.05 6.00 Prom + 112289 112328 40 -6.96 6.01 Init + 115397 115448 52 1 1 92 46 32 0.275 0.75 6.02 Intr + 125670 125839 170 1 2 126 41 251 0.999 23.87 6.03 Term + 126890 127069 180 1 0 43 50 200 0.611 9.31 6.04 PlyA + 127938 127943 6 1.05 7.05 PlyA - 128126 128121 6 1.05 7.04 Term - 137124 137012 113 1 2 133 39 136 0.998 11.92 7.03 Intr - 140391 140214 178 0 1 113 93 120 0.999 14.49 7.02 Intr - 144687 144593 95 1 2 68 89 115 0.944 9.28 7.01 Init - 151814 151679 136 2 1 73 110 0 0.425 1.00 7.00 Prom - 153959 153920 40 -5.76 8.00 Prom + 166563 166602 40 -5.16 8.01 Init + 167624 167672 49 1 1 86 58 34 0.087 -0.69 8.02 Intr + 168209 168269 61 0 1 80 73 45 0.047 -0.01 8.03 Intr + 172229 172420 192 2 0 81 53 89 0.064 3.31 8.04 Intr + 177605 177796 192 2 0 89 43 112 0.233 5.41 8.05 Intr + 182978 183169 192 2 0 81 53 116 0.255 6.01 8.06 Intr + 188329 188520 192 1 0 81 53 92 0.457 3.61 8.07 Intr + 193373 193564 192 2 0 89 43 110 0.643 5.21 8.08 Intr + 198731 198922 192 2 0 81 53 92 0.009 3.61 8.09 Intr + 204086 204277 192 2 0 89 43 112 0.007 5.41 8.10 Intr + 204974 205083 110 2 2 86 82 31 0.005 2.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_1|238_aa VHAYIISYLKKEMPSVFGKENKKKQLILKLPVIFAKIQLEHHISPGDFPDCQKMQELLMA HDFTKFHSLKPKLLEALDEMLTHDIAKLMPLLRQEELESTEVGVQGGAFEGTHMGPFVER GPDEAMEDGEEGSDDEAEWVVTKDKSKYDEIFYNLAPADGKLSGSKAKTWMVGTKLPNSV LGRIWKLSDVDRDGMLDDEEFALASHLIEAKLEGHGLPANLPRRLVPPSKRRHKGSAE >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_1|717_bp gttcacgcttacatcatcagctacctgaagaaggagatgccctctgtgtttgggaaggag aacaagaagaagcagctgatcctcaaactgcccgtcatctttgcgaagattcagctggaa catcacatctcccctggggactttcctgattgccagaaaatgcaggagctgctgatggcg cacgacttcaccaagtttcactcgctgaagccgaagctgctagaggcactggacgagatg ctgacgcacgacatcgccaagctcatgcccctgctgcggcaggaggagctggagagcacc gaggtgggcgtgcaggggggcgcttttgagggcacccacatgggcccgtttgtggagcgg ggacctgacgaggccatggaggacggcgaggagggctcggacgacgaggccgagtgggtg gtgaccaaggacaagtccaaatacgacgagatcttctacaacctggcgcctgccgacggc aagctgagcggctccaaggccaagacctggatggtggggaccaagctccccaactcagtg ctggggcgcatctggaagctcagcgatgtggaccgcgacggcatgctggatgatgaggag ttcgcgctggccagccacctcatcgaggccaagctggaaggccacgggctgcccgccaac ctgccccgtcgcctggtgccaccctccaagcgacgccacaagggctccgccgagtga >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_2|606_aa MAAGGSGVGGKRSSKSDADSGFLGLRPTSVDPALRRRRRGPRNKKRGWRRLAQEPLGLEV DQFLEDVRLQERTSGGLLSEAPNEKLFFVDTGSKEKGLTKKRTKVQKKSLLLKKPLRVDL ILENTSKVPAPKDVLAHQVPNAKKLRRKEQLWEKLAKQGELPREVRRAQARLLNPSATRA KPGPQDTVERPFYDLWASDNPLDRPLVGQDEFFLEQTKKKGVKRPARLHTKPSQAPAVEV APAGASYNPSFEDHQTLLSAAHEVELQRQKEAEKLERQLALPATEQAATQESTFQELCEG LLEESDGEGEPGQGEGPEAGDAEVCPTPARLATTEKKTEQQRRREKAVHRLRVQQAALRA ARLRHQELFRLRGIKAQVALRLAELARRQRRRQARREAEADKPRRLGRLKYQAPDIDVQL SSELTDSLRTLKPEGNILRDRFKSFQRRNMIEPRERAKFKRKYKVKLVEKRAFREIHGCG SPEPWLSPSESFIGKPSGQRPPSPTPAGTRFSEPGSGSPGERTHGSPCMGKGSPCMWYLQ LKKKLEDEFPGRLDICGEGTPQATGFFEVMVAGKLIHSKKKGDGYVDTESKFLKLVAAIK AALAQG >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_2|1821_bp atggcggcaggaggcagtggcgttggtgggaagcgcagctcgaaaagcgatgccgattct ggtttcctggggctgcggcccacttcggtggacccagcgctgaggcggcggcggcgaggc ccaagaaataagaagcggggctggcggcggcttgctcaggagccgctggggctggaggtt gaccagttcctggaagacgtgcggctacaggagcgcacgagcggtggcttgttgtcagag gccccaaatgaaaaactcttcttcgtggacactggctccaaggaaaaagggctgacaaag aagagaaccaaagtccagaagaagtcactgcttctcaagaaaccccttcgggttgacctc atcctcgagaacacatccaaagtccctgcccccaaagacgtcctcgcccaccaggtcccc aacgccaagaagctcaggcggaaggagcagctatgggagaagctggccaagcagggcgag ctgccccgggaggtgcgcagggcccaggcccggctcctcaacccttctgcaacaagggcc aagcccgggccccaggacaccgtagagcggcccttctacgacctctgggcctcagacaac cccctggacaggccgttggttggccaggatgagtttttcctggagcagaccaagaagaaa ggagtgaagcggccagcacgcctgcacaccaagccgtcccaggcacccgccgtggaggtg gcgcctgccggagcttcctacaatccatcctttgaagaccaccagaccctgctctcagcg gcccacgaggtggagttgcagcggcagaaggaggcggagaagctggagcggcagctggcc ctgcccgccacggagcaggccgccacccaggagtccacattccaggagctgtgcgagggg ctgctggaggagtcggatggtgagggggagccaggccagggcgaggggccggaggctggg gatgccgaggtctgtcccacgcccgcccgcctggccaccacagagaagaagacggagcag cagcggcggcgggagaaggctgtgcacaggctgcgggtacagcaggccgcgttgcgggcc gcccggctccggcaccaggagctgttccggctgcgcgggatcaaggcccaggtggccctg aggctggcggagctggcgcggcggcagaggcggcggcaggcgcggcgggaggctgaggct gacaagccccgaaggctggggcggctcaagtaccaggcacctgacatcgacgtgcagctg agctcggagctgacagactcgctcaggaccctgaagcccgagggcaacatccttcgagac cggttcaagagcttccagaggaggaatatgatcgagcctcgagagagagccaagttcaaa cgcaagtacaaggtgaagctggtggagaagcgggcgttccgtgagatccacggatgtggc agccccgagccatggctctcgccgtccgagtcgtttattggtaagcccagcggccagcgg cccccgtccccgacccccgccgggacccgattctcggagccggggtcagggagccccggg gagaggacccatgggagcccttgtatgggaaaagggagcccctgtatgtggtatcttcag ctcaagaagaagttagaagatgagttccccggccgcctggacatctgcggcgagggaact ccccaggccaccgggttctttgaagtgatggtagccgggaagttgattcactctaagaag aaaggcgatggctacgtggacacagaaagcaagtttctgaagttggtggccgccatcaaa gccgccttggctcagggctaa >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_3|115_aa MLPKAKEAPASPKAQAKPKALKAKKAVLKAVHSHTEEKIRRSPTFRRPKTARLRREPKYP QKSAPWRNKLGHSAIITFPPTTESAMKKTDDNNTLAFIIDVKAKEHQIKQAVKKL >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_3|348_bp atgctgccgaaagcgaaggaagctcctgcctctcctaaagcccaagccaaaccgaaggct ttaaaggccaagaaagcagtgttgaaagccgtccacagccacacagaagagaagatccgc aggtcacccaccttcaggcggcccaagacagcgcgactccggagggagcccaaatatcct cagaagagcgccccctggagaaacaagcttggccactctgcgatcatcacgtttccgccg accactgagtccgccatgaagaagacagacgacaacaacacacttgccttcattatagat gttaaagccaaggagcaccagatcaaacaggctgtgaagaagctctga >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_4|153_aa MEYYAAIKKDEFMSYAGTWMKLETIILSKLSQGQKTKHRMLSLIGPPLALDPPRRQRQER TVYTESQQKVLEFYFQKDQYPNYDQRLNLAEMLSLREQQLQSPVRRHLQNLSRVTEELKG RWDILVDDVKTLLKPVLAPSHASLLAVRLQLQQ >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_4|462_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctatgcagggacatggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaccaaacaccgcatg ctctcgctcataggccctcccctggccctggaccctccaaggagacagcggcaggagcgc acggtctacactgaaagccagcagaaagtgctagaattttactttcagaaggaccagtac ccgaactacgaccagcgactgaatctggcggagatgctcagcctcagggagcaacagctg cagagcccggtgaggcggcatctgcagaatctcagccgggtcacagaagagttgaagggg agatgggacatattggtggatgatgtcaaaacccttctcaagcccgtgctggcaccttct catgcctccttactggccgtgagacttcagttacagcagtag >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_5|321_aa MVKKEGILDEGPLTWASVSPKIMMAYMNPGPHYSVNALALSGPSVDLMHQAVPYPSAPRK QRRERTTFTRSQLEELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQR QQQKQQQQPPGGQAKARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVA TVSIWSPASESPLPEAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGL DPYLSPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGAYSPVDSLEFKDPTGTWKF TYNPMDPLDYKDQSAWKFQIL >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_5|966_bp atggtcaagaaagaagggatcttggatgagggccccctgacttgggcctcagtgtccccg aagatcatgatggcgtatatgaacccggggccccactattctgtcaacgccttggcccta agtggccccagtgtggatctgatgcaccaggctgtgccctacccaagcgcccccaggaag cagcggcgggagcgcaccaccttcacccggagccaactggaggagctggaggcactgttt gccaagacccagtacccagacgtctatgcccgtgaggaggtggctctgaagatcaatctg cctgagtccagggttcaggtttggttcaagaaccggagggctaaatgcaggcagcagcga cagcagcagaaacagcagcagcagcccccagggggccaggccaaggcccggcctgccaag aggaaggcgggcacgtccccaagaccctccacagatgtgtgtccagaccctctgggcatc tcagattcctacagtccccctctgcccggcccctcaggctccccaaccacggcagtggcc actgtgtccatctggagcccagcctcagagtcccctttgcctgaggcgcagcgggctggg ctggtggcctcagggccgtctctgacctccgccccctatgccatgacctacgccccggcc tccgctttctgctcttccccctccgcctatgggtctccgagctcctatttcagcggccta gacccctacctttctcccatggtgccccagctagggggcccggctcttagccccctctct ggcccctccgtgggaccttccctggcccagtcccccacctccctatcaggccagagctat ggcgcctacagccccgtggatagcttggaattcaaggaccccacgggcacctggaaattc acctacaatcccatggaccctctggactacaaggatcagagtgcctggaagtttcagatc ttgtag >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_6|133_aa MGIQLLTCTWGAGKDNDCPPLALDPPRRQRQERTVYTESQQKVLEFYFQKDQYPNYDQRL NLAEMLSLREQQLQSPYASNLSPDTQLYPDFTKLLPLLDRFEESSLSTTTSQYKEEDGFV DKNHSVPRSLLDL >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_6|402_bp atgggaatccagctcctgacatgcacctggggcgctgggaaagacaatgactgccctccc ctggccctggaccctccaaggagacagcggcaggagcgcacggtctacactgaaagccag cagaaagtgctagaattttactttcagaaggaccagtacccgaactacgaccagcgactg aatctggcggagatgctcagcctcagggagcaacagctgcagagcccctatgcctccaac ttgtcgccagacacccagttataccctgacttcaccaagctgctcccgctcctagaccgg ttcgaggaatcctcactctccaccacgacgtctcagtacaaagaggaggatggcttcgtg gacaaaaatcactcagtccccaggtcattactggatttatag >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_7|173_aa MSDDFLWFEGIAFPTMGFRSETLRKVRDEFVIRDEDVIILTYPKSVLYGSWFDHIHGWMP MREEKNFLLLSYEELKQDTGRTIEKICQFLGKTLEPEELNLILKNSSFQSMKENKMSNYS LLSVDYVVDKAQLLRKGVSGDWKNHFTVAQAEDFDKLFQEKMADLPRELFPWE >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_7|522_bp atgtcggacgatttcttatggtttgaaggcatagctttccctactatgggtttcagatcc gaaaccttaagaaaagtacgtgatgagttcgtgataagggatgaagatgtaataatattg acttaccccaaatcagtgctatatgggtcatggtttgaccacattcatggctggatgccc atgagagaggagaaaaacttcctgttactgagttatgaggagctgaaacaggacacagga agaaccatagagaagatctgtcaattcctgggaaagacgttagaacccgaagaactgaac ttaattctcaagaacagctcctttcagagcatgaaagaaaacaagatgtccaattattcc ctcctgagtgttgattatgtagtggacaaagcacaacttctgagaaaaggtgtatctggg gactggaaaaatcacttcacagtggcccaagctgaagactttgataaattgttccaagag aagatggcagatcttcctcgagagctgttcccatgggaataa >gi568815579f:47734444_47939964|GENSCAN_predicted_peptide_8|522_aa MGFHHVGQAGLQLLTSGSSILPEDLQNLQIFLLEKPHHTGLLTVPVAAGMCPHWASVLAV PTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQHFQDVVCHTGLLTVPVAAGMCPHRAS VLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVAAGMCP HWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVAA GMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQHFQDVVCHTGLLTV PVEAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTG LLTVPVAAGMCPHWASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQHFQDVV CHTGLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDF QDVVWGRHGTQGPHWPWGLGDVRGEPLVIGSLTLRKIKGPKX >gi568815579f:47734444_47939964|GENSCAN_predicted_CDS_8|1566_bp atggggtttcaccatgttggccaggctggtctccaactcctgacctcaggttcatcaatc ttgccagaagatctgcagaatctgcagatattcttactagagaagccgcaccacacgggg ctcctcactgttcccgtagcagcaggcatgtgcccccactgggcctctgtactggctgtt cccactgcccgaacaccctcatgcaccatctgcactgtccaatatggccgcctctggcca cacatggctactgagcagttgaacatggctggtccaaaccaacatttccaagacgtcgta tgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctct gtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggc cgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagatttc caagacgtcgtgtgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccc cactgggcctctgtactggctgttcccactgcccgaacaccctcatgcaccatctgcact gtccaatacggccgcctctggccacacatggctactgagcagttgaacatggctggtcca aaccaagatttccaagacgtcgtatgccacacggggctcctcactgttcccgtagcagca ggcatgtgcccccactgggcctctgtactggctgttcccactgcccgaacaccctcatgc accatctgcactgtccaatacggccgcctctggccacacatggctactgagcagttgaac atggctggtccaaaccaacatttccaagacgtcgtatgccacacggggctcctcactgtt cccgtagaagcaggcatgtgcccccacagggcctctgtactggctgttcccactgcccga acaccctcatgcaccatctgcactgtccaatacggccgcctctggccacacatggctact gagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtgtgccacacgggg ctcctcactgttcccgtagcagcaggcatgtgcccccactgggcctctgtactggctgtt cccactgcccgaacaccctcatgcaccatctgcactgtccaatacggccgcctctggcca cacatggctactgagcagttgaacatggctggtccaaaccaacatttccaagacgtcgta tgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctct gtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggc cgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagatttc caagacgtcgtgtgggggcgccatggaacgcagggccctcactggccctggggactgggt gacgtcaggggtgagcctctggtgattggctccctcaccctgcgtaagatcaaagggcct aaagnn