GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:54:10 Sequence gi568815596r:157637534_157899493 : 261960 bp : 39.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 8074 8113 40 -3.65 1.01 Init + 34629 34770 142 2 1 69 41 106 0.114 4.34 1.02 Intr + 37742 37899 158 0 2 46 55 164 0.034 7.71 1.03 Term + 52248 52472 225 2 0 12 42 206 0.055 4.20 1.04 PlyA + 53131 53136 6 1.05 2.00 Prom + 59303 59342 40 -4.15 2.01 Init + 67660 67949 290 0 2 91 71 162 0.701 9.92 2.02 Term + 70384 71272 889 1 1 24 42 286 0.328 8.54 2.03 PlyA + 71836 71841 6 1.05 3.00 Prom + 71953 71992 40 -5.65 3.01 Init + 72007 72366 360 0 0 62 42 230 0.602 12.92 3.02 Term + 78150 78176 27 2 0 106 41 34 0.440 -2.40 3.03 PlyA + 79008 79013 6 1.05 4.20 PlyA - 79491 79486 6 1.05 4.19 Term - 79892 79779 114 2 0 56 38 93 0.053 -1.31 4.18 Intr - 82242 81996 247 2 1 82 95 87 0.043 5.34 4.17 Intr - 99914 99738 177 1 0 48 97 105 0.042 5.51 4.16 Intr - 100772 100753 20 2 2 59 89 16 0.029 -6.71 4.15 Intr - 101037 100907 131 1 2 95 111 104 0.832 12.89 4.14 Intr - 112934 112625 310 2 1 50 16 166 0.059 0.56 4.13 Intr - 113842 113700 143 2 2 58 110 89 0.233 7.25 4.12 Intr - 123544 123347 198 2 0 105 78 185 0.351 17.60 4.11 Intr - 128663 128388 276 0 0 126 34 139 0.954 8.87 4.10 Intr - 132981 132835 147 1 0 87 97 24 0.838 2.49 4.09 Intr - 135328 135157 172 1 1 64 32 155 0.850 6.09 4.08 Intr - 136795 136742 54 1 0 99 84 43 0.702 3.26 4.07 Intr - 138070 137948 123 1 0 71 7 117 0.762 1.76 4.06 Intr - 140809 140598 212 2 2 93 92 177 0.932 16.31 4.05 Intr - 143067 142804 264 1 0 136 14 224 0.690 16.36 4.04 Intr - 161967 161894 74 2 2 107 115 40 0.050 6.73 4.03 Intr - 164871 164790 82 1 1 107 25 56 0.042 -1.02 4.02 Intr - 171157 171013 145 1 1 42 52 101 0.011 0.83 4.01 Init - 173247 173176 72 0 0 68 58 81 0.060 4.22 4.00 Prom - 178735 178696 40 -7.25 5.04 PlyA - 180575 180570 6 1.05 5.03 Term - 181026 180890 137 1 2 114 49 110 0.980 6.90 5.02 Intr - 182916 182761 156 1 0 50 53 92 0.591 0.96 5.01 Init - 192006 191856 151 0 1 54 48 104 0.678 3.25 5.00 Prom - 197066 197027 40 -5.75 6.00 Prom + 197132 197171 40 -5.65 6.01 Init + 203156 203398 243 1 0 40 52 181 0.566 7.48 6.02 Intr + 206534 206736 203 1 2 32 110 172 0.898 10.96 6.03 Intr + 222736 222803 68 1 2 48 110 66 0.263 2.43 6.04 Intr + 223252 223385 134 2 2 55 99 97 0.845 6.94 6.05 Intr + 229345 229508 164 0 2 57 82 79 0.277 2.05 6.06 Intr + 235382 235482 101 2 2 97 89 26 0.136 2.43 6.07 Term + 238762 238880 119 2 2 -62 38 224 0.079 0.12 6.08 PlyA + 239745 239750 6 1.05 7.00 Prom + 239755 239794 40 -6.55 7.01 Init + 240931 241180 250 0 1 63 106 130 0.530 10.03 7.02 Term + 257624 257856 233 0 2 54 42 123 0.314 -0.05 7.03 PlyA + 257894 257899 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_1|174_aa MMRESEMNDVGVVMYVVLGYYLPSGATLERGSSASGDPGSWSHDDFDGFWLALKSGMPQN PILHHTCSHPEVRDKSLTSGKWQLKEELLSPFLPRVDDPQRIGQSFEDLKPGIDFSSLAM NVLGGIFFQYKAISSTLNRRCLVKPPSSIILARSGYLAADSTSALAASPCAFMF >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_1|525_bp atgatgagagaaagtgaaatgaatgacgtaggtgttgtgatgtatgtagtgttaggctat tatttaccttctggtgctacattagaaagaggatcatctgcttcaggtgatcctgggtca tggagccatgacgattttgatggcttttggttggcattgaagagcgggatgccccagaac cctattttacaccacacatgttcccaccctgaagttcgggacaaatctttgaccagtggg aagtggcagctcaaggaggaacttctttcaccgttcttacctcgggtagatgatcctcag agaattggtcagtcctttgaagatttgaagccaggcattgacttttcttctctggctatg aacgttctaggtggcatcttcttccagtataaggctatttcatctacactgaacagacgt tgtttagtgaagccaccttcatcaatcattttagctagatctggataccttgctgcagat tctacatcagcacttgcagcttcaccttgtgcttttatgttctga >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_2|392_aa MHLIQGSSGWHLVGAPLGQSFQREEQAAIFAILQPPLVIPRQTGSGVDLQQIPADLQQRG LTVRKKTNKQKGGASTSTKRTSTQKPHAKVTSIKDQRLNQEEVESLNRPIRSSEIEAIIK SLPTKKSPGPDRFTAEFYQRYKEELVPFLLKLFQTIEKEGLLPNSFYEASIILIPKPGRD ATKKENFRPISLMKIDAKILNKILANRIQQHIKKLIQHDQVGFIPGMQGWFNICKSINVI LHINRTNDKNHLIISIDAEKAFNKMQQLFMLKMLNKLGIDGTYLKIIRAIYDKPTANIIR NGQKLEASPLKTGTRKGCCLSPLLFNTVLDVLANAIRQEKEIKRIQIGREEVKLFLFADD MIVYLENPIISAQNLCKLISNFSKVSGYQINV >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_2|1179_bp atgcacctcatacaggggagctctggctggcatctggtgggtgcccctctgggacaaagc ttccaaagggaggaacaggcagcaatctttgctattctgcagcctccactggtgataccc aggcaaacagggtctggagtggacctccagcaaataccagcagacctgcagcagaggggc ctgactgttagaaagaaaactaacaaacagaaaggaggggcatcaacatcaacaaaaagg acgtccacacagaaaccccatgcaaaggtcaccagcatcaaagaccaaagactaaaccag gaagaagtcgaatccctgaatagaccaataagaagttctgaaattgaggcaataattaag agcctaccaaccaaaaaaagtccaggaccagacagattcacagctgaattctaccagagg tacaaagaggagctggtaccattccttctgaaactattccaaacaatagaaaaagaggga ctcctccctaactcattttatgaggccagcatcatcctgataccaaaacctggcagagac gcaacaaaaaaagaaaatttcaggccaatatccctgatgaaaatcgatgcaaaaatcctc aataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaacacgatcaa gttggcttcatccctgggatgcaaggctggttcaacatatgcaaatcaataaatgttatc cttcacataaacagaaccaatgacaaaaaccacttgattatctcaatagatgcagaaaag gctttcaataaaatgcaacaactcttcatgctaaaaatgctcaataaactgggtatagat ggaacatatctcaaaataataagagctatttatgacaaacccacagccaatatcatacgg aatgggcaaaagctggaagcatcccctttgaaaactggcacaagaaaaggatgctgtctc tctccactcctattcaacacagtgttggacgttctggccaacgcaatcaggcaagagaaa gaaataaagcgtattcaaataggaagagaggaagtcaaattgtttctgtttgcagatgac atgattgtatatttagaaaaccccatcatctcagcccaaaatctctgtaagctgataagc aacttcagcaaagtctcaggataccaaatcaatgtgtaa >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_3|128_aa MSKDFMTKTPKAMATEAKIDKWDLIKLKSFCTAKETIIRVDRQSTEWEKIFAIYPSDKGL ISRIYKELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYVANKQMKKTSSSPVIREMQIKTT SITVLSDV >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_3|387_bp atgagcaaagacttcatgactaaaacaccaaaagcaatggcaacagaagccaaaattgac aaatgggatctaattaaactaaaaagcttctgcacagcaaaagaaaccatcatcagagtg gacaggcaatctacagaatgggagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgtggccaac aaacagatgaaaaaaacctcatcatcaccggtcattagagaaatgcaaatcaaaaccaca agcatcacagtcctgtctgatgtttaa >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_4|986_aa MKRVDIGDLGAEEVVVVEIVAYDDEMYIQETDAVSEVIRDGSGPISWCLQLEIQAIPSNW DIVHQGVVWYTQAFLGIRRLFVTLPDPLCPGHSCVFSILSCTMVDGVMILPVLIMIALPS PSMEDEKPKVNPKLYMCVCEGLSCGNEDHCEGQQCFSSLSINDGFHVYQKGCFQVYEQGK MTCKTPPSPGQAVECCQGDWCNRNITAQLPTKGKSFPGTQNFHLEVGLIILSVVFAVCLL ACLLGVALRKFKRRNQERLNPRDVEYGTIEGLITTNVGDSTLAGSKFLPLDAQLLVILPI NSTSQWCSEEVKEDEIHTNQCIALRSFTCTLTGHVSRIAALHVYGYCHCLLLLASPPRQI VNGENKDGKAPNVLINFNQSCPLVLTSILFDYEQDYLDSGKGRYGEVWRGSWQGENVAVK IFSSRDEKSWFRETELYNTVMLRHENILGFIASDMTSRHSSTQLWLITHYHEMGSLYDYL QLTTLDTVSCLRIVLSIASGLAHLHIEIFGTQGKPAIAHRDLKSKNILVKKNGQCCIADL GLAVMHSQSTNQLDVGNNPRVGTKRYMAPEVLDETIQVDCFDSYKRVDIWAFGLVLWEVA RRMVSNVPPAALPKDRCKTSQKWLPQGPREPIGHFLLLPVPLYFIQLSKLSQLQAEINGG CGEGFLGIGMPGRLVLGHKWLWRQARHAGPLLLDGICEHQWNRMQWTHPCVSRHLAQGPA MAAVVDRLIIRLLGGLCGIDDDSSGMGQPLGLQAASLGIVEDYKPPFYDVVPNDPSFEDM RKVVCVDQQRPNIPNRWFSDPKGLDLEEMESICLPPQMAALTRQTSYPAMCWGDIKTTLT SLDDCELGISRTVHTAETNVGQTLLQRLSLATPKAKVSVQIELSSFVHILHFSIIVYDDL LRAKPVGGTGDKKQEVSLLLSKETAMQKCKGYSPCIRSPEPGALYCFGKKCTIGLGRKHS QPQGSKLKEVAKREGDERNQRRKQAF >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_4|2961_bp atgaagagggtggatattggagaccttggggctgaggaggtggtggtagttgaaattgtt gcttatgatgatgagatgtatattcaagagacagacgctgtctcagaggtcattagggat ggctctgggcctatcagctggtgtctgcaactagagatccaagcaatcccttccaactgg gatattgtccaccagggagttgtctggtacacccaagccttcttgggtattcgccgactc tttgtgactcttccagaccctttgtgtcctggccattcttgcgttttttcaatcctgagt tgtacaatggtagatggagtgatgattcttcctgtgcttatcatgattgctctcccctcc cctagtatggaagatgagaagcccaaggtcaaccccaaactctacatgtgtgtgtgtgaa ggtctctcctgcggtaatgaggaccactgtgaaggccagcagtgcttttcctcactgagc atcaacgatggcttccacgtctaccagaaaggctgcttccaggtttatgagcagggaaag atgacctgtaagaccccgccgtcccctggccaagccgtggagtgctgccaaggggactgg tgtaacaggaacatcacggcccagctgcccactaaaggaaaatccttccctggaacacag aatttccacttggaggttggcctcattattctctctgtagtgttcgcagtatgtctttta gcctgcctgctgggagttgctctccgaaaatttaaaaggcgcaaccaagaacgcctcaat ccccgagacgtggagtatggcactatcgaagggctcatcaccaccaatgttggagacagc actttagcaggttctaaatttcttccgttggatgctcagcttttggtaattttgccgatc aacagtacttcacagtggtgctcagaagaagtgaaagaagatgaaattcacacaaatcaa tgtattgctttaaggagctttacatgtacactaacaggccacgtgtcccggattgctgcc cttcatgtctacggctattgtcactgtttgctgcttctcgcttctcctccacggcagatt gtgaatggagaaaataaagatggaaaagccccaaatgtgctcattaatttcaaccagagt tgtcctcttgtactaactagtatcttatttgattatgaacaagattatctggatagtggg aaaggcaggtatggtgaggtgtggaggggcagctggcaaggggagaatgttgccgtgaag atcttctcctcccgtgatgagaagtcatggttcagggaaacggaattgtacaacactgtg atgctgaggcatgaaaatatcttaggtttcattgcttcagacatgacatcaagacactcc agtacccagctgtggttaattacacattatcatgaaatgggatcgttgtacgactatctt cagcttactactctggatacagttagctgccttcgaatagtgctgtccatagctagtggt cttgcacatttgcacatagagatatttgggacccaagggaaaccagccattgcccatcga gatttaaagagcaaaaatattctggttaagaagaatggacagtgttgcatagcagatttg ggcctggcagtcatgcattcccagagcaccaatcagcttgatgtggggaacaatccccgt gtgggcaccaagcgctacatggcccccgaagttctagatgaaaccatccaggtggattgt ttcgattcttataaaagggtcgatatttgggcctttggacttgttttgtgggaagtggcc aggcggatggtgagcaatgttcctccggcagcccttcccaaggaccgctgtaagacaagt cagaaatggcttccccagggaccaagagagcccatagggcatttcctgctgcttcctgtg cccctgtatttcattcagctctctaaattgtctcagctccaggctgaaattaatggaggc tgtggtgagggttttcttgggatagggatgccaggaagactggtccttgggcacaagtgg ttgtggcggcaagccaggcatgctggtcctctgctcctggatggcatatgtgagcaccaa tggaatcgaatgcaatggactcatccttgtgtctccaggcatcttgctcaagggccagca atggcagcggtggtagacaggctcatcatcaggcttctgggtggcttgtgtggcattgat gatgacagtagtggaatgggccagccattgggcctccaggcagcaagcttgggtatagtg gaggattacaagccaccgttctacgatgtggttcccaatgacccaagttttgaagatatg aggaaggtagtctgtgtggatcaacaaaggccaaacatacccaacagatggttctcagac ccgaagggtttagatctagaagaaatggaatccatctgtctccctccccaaatggctgct ttgacaaggcagacgtcgtacccagccatgtgttggggagacatcaaaaccaccctaacc tcgctcgatgactgtgaactgggcatttcacgaactgttcacactgcagagactaatgtt ggacagacactgttgcaaaggctcagcctggccactcccaaggctaaggtgtcagtgcag attgaactgtcctcattcgttcacatacttcatttttctataatcgtctatgatgatcta ctaagagccaagcctgtaggaggtacaggagacaaaaaacaagaggtatccttgctgctt tcaaaggagacagccatgcaaaaatgcaaaggatactctccgtgtatcaggagtcctgag ccaggggctctttattgctttggcaagaaatgtacaattggcttgggaagaaaacacagc cagccacaggggagcaagctcaaagaagttgcaaaacgagaaggagatgagagaaatcag agaaggaagcaagcattctga >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_5|147_aa MKIKKSTPARRTIDGFKQRRGTSERYSMNNLLHSWQGCRGDLEGVSWATAGRTAEHWPPL KMRLDFGRGESSFSVNLPDFPGPSILENTSISAVCSETQGFAAWSIGKRHTAKVRAAGEL IIPGTPLLLSEYPSDQSERSSERGHAA >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_5|444_bp atgaaaatcaagaaatctacaccagctaggagaaccattgatggctttaagcagaggaga ggcactagtgagcgttactcaatgaacaatttactacactcatggcagggatgtagagga gaccttgaaggagtcagttgggccactgctggcaggacagctgaacactggcctcctctt aaaatgcggttagactttggcagaggggaaagcagctttagtgtgaatctgccagatttc cctggccccagcatcttggaaaataccagtatttcagcagtttgttctgaaactcaaggg tttgcggcctggagcattggtaagcgtcacactgccaaagtgagagctgctggagaactc ataatcccaggaacgcctcttctactctccgagtaccccagtgaccagagtgagagaagc tctgaacgagggcacgcggcttga >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_6|343_aa MSVKPAEKKARKPAWGQGAPALRLGSGQPKHKHLQAVSRQGDHAPHPAAVHFIINYCLAS KGRVRFKNTGNFKAGGIFRTMGRQRRPCHAEDIVKPSSPYGKQGWGPPSAWPSNCAVGDQ VQVLTRVSSRFVKPTGKDSSEDEDCQEPSESLASQDIPQSPLNPDSQSPYSGGRLSKLSH PLPLPTLKSNVSQTFRSIQITGGYCCKADSDPAGFETSTHFFPFPKGSSPPGHSHPLTNH QYKSNSQYKAKAPNPISRVSVPFSQLLTGLKSLQHTHTPHNHYSEVGVSVQVQTHIYDYL LFLTRGGCGGGGGCKEQEPEREEREEEEEEVAAERSANECLEL >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_6|1032_bp atgtcagtgaagccagctgagaaaaaggccaggaagccagcatgggggcagggagcacct gctctccggctgggttcaggtcagccaaagcacaagcacctgcaggctgtgagccgccaa ggcgatcacgcgccacatcctgcagccgtccactttatcattaactactgtctggccagc aagggcagggtccgcttcaagaacactgggaactttaaggcaggaggcatattcagaacc atgggcaggcaaaggaggccctgtcatgcagaagacattgtgaaaccatcatctccttat ggaaagcaaggctgggggccgccctcggcatggccctccaactgtgctgtgggagatcaa gttcaagtgctcacaagagtgtcttccaggtttgtaaaacccactggaaaggattccagt gaagatgaagactgccaagagcccagtgaaagtctggcatcccaggatatccctcagtct ccgctaaacccggacagtcagtcaccctactcaggtggtcgactttccaagctctcccat cctttaccccttcctacgctcaagtccaatgtttctcaaacttttagaagcattcagatc actgggggatactgctgcaaagctgattctgatccagcaggatttgagacctcaacccac ttcttcccatttccaaaaggaagctctcccccaggacattctcatcctctcaccaatcac cagtacaaatccaatagccaatacaaagccaaagccccaaaccctatctccagagtttca gttccattttcccagcttctcacagggctcaagtccctgcaacatacacacactccacat aatcattattctgaagttggggtttcggtgcaagtacaaacgcatatttatgactacctc ctattcctaacaaggggaggctgcggcggcggcggcggctgcaaagagcaggagccggag cgggaggagcgggaggaggaggaggaggaagtggctgcggagcgctcggccaatgagtgt ctggagctgtga >gi568815596r:157637534_157899493|GENSCAN_predicted_peptide_7|160_aa MESFLLLLSATSYFVYGVGVLSPLCFLNKLAFTLHCGLALNSFLREIQEPSLGVWIGTPA LKHIPGNHRRDFSAETLTQRLPLVQTRTDGEKLCWKGLWRATEKRNMTTQCVTHQLAAAT SPSTQRAAYGTSPGAQEERAYCRKGAQLENIRFTCVSLKI >gi568815596r:157637534_157899493|GENSCAN_predicted_CDS_7|483_bp atggaatcgttcctcctcttactttcggcaacatcctactttgtctatggagtaggtgtc ctttcaccactttgctttcttaataaactggcttttactttacactgcggactcgccctg aattctttcttgcgtgagatccaagaaccctctcttggggtctggattgggacccctgcc ctaaaacatattcctggcaaccacagaagggactttagtgcagaaaccctgacccaacgg ctacctttggtgcaaacgagaacagatggggaaaagttatgttggaaagggctttggagg gccacagagaagagaaatatgactactcaatgtgtcacacaccaacttgcagctgccacc agcccttccacacaacgagctgcgtatggaaccagcccgggtgctcaggaggagcgggct tactgcaggaagggagcgcaattagagaacattcgctttacttgtgtttccttgaaaata tga