GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:12:35 Sequence gi568815581f:5980659_6220658 : 240000 bp : 46.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 22096 22239 144 2 0 26 102 81 0.027 3.55 1.02 Intr + 54434 54592 159 0 0 67 75 79 0.101 4.46 1.03 Term + 64074 64204 131 1 2 74 37 97 0.419 1.54 1.04 PlyA + 64658 64663 6 1.05 2.00 Prom + 69304 69343 40 -2.26 2.01 Init + 71755 71826 72 0 0 86 59 63 0.649 4.27 2.02 Term + 78845 78916 72 1 0 89 55 38 0.117 -1.49 2.03 PlyA + 83308 83313 6 1.05 3.04 PlyA - 83417 83412 6 1.05 3.03 Term - 83790 83581 210 0 0 81 42 173 0.908 9.29 3.02 Intr - 84783 84747 37 2 1 67 56 23 0.342 -4.74 3.01 Init - 86160 86057 104 0 2 46 61 136 0.744 6.41 3.00 Prom - 88989 88950 40 -6.46 4.00 Prom + 92670 92709 40 -1.36 4.01 Init + 100001 100427 427 1 1 83 80 608 0.736 55.97 4.02 Intr + 107332 107446 115 2 1 102 101 126 0.995 14.81 4.03 Intr + 109663 109847 185 1 2 97 75 344 0.988 33.43 4.04 Intr + 114444 114565 122 1 2 102 69 92 0.915 8.91 4.05 Intr + 118931 119016 86 1 2 111 81 53 0.932 5.52 4.06 Intr + 126940 127063 124 1 1 -10 59 98 0.105 -2.31 4.07 Intr + 128949 129108 160 2 1 91 98 107 0.359 11.56 4.08 Intr + 130113 130277 165 1 0 54 115 77 0.788 6.93 4.09 Intr + 137330 137530 201 0 0 82 84 230 0.957 21.26 4.10 Term + 139651 140003 353 2 2 109 44 765 0.902 68.85 4.11 PlyA + 142493 142498 6 1.05 5.04 PlyA - 143763 143758 6 1.05 5.03 Term - 174629 174519 111 2 0 79 42 110 0.973 4.06 5.02 Intr - 174933 174841 93 0 0 89 100 42 0.559 5.66 5.01 Init - 182597 182541 57 2 0 85 42 121 0.444 6.51 5.00 Prom - 185685 185646 40 -5.46 6.00 Prom + 186103 186142 40 -2.66 6.01 Init + 189683 189732 50 1 2 86 72 65 0.933 5.12 6.02 Intr + 193596 193723 128 0 2 37 48 165 0.620 7.72 6.03 Intr + 193945 194081 137 2 2 91 66 32 0.487 1.59 6.04 Intr + 196747 196879 133 0 1 34 92 68 0.391 2.02 6.05 Term + 198464 198630 167 0 2 67 43 59 0.444 -2.62 6.06 PlyA + 199128 199133 6 1.05 7.02 PlyA - 199772 199767 6 1.05 7.01 Sngl - 213910 213047 864 1 0 36 42 1165 0.587 102.58 7.00 Prom - 217174 217135 40 -0.36 8.00 Prom + 222266 222305 40 -5.36 8.01 Init + 223183 223322 140 0 2 78 65 71 0.060 3.41 8.02 Term + 236273 236318 46 2 1 138 41 51 0.007 2.08 8.03 PlyA + 238673 238678 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_1|144_aa XHCQVVESTCWIVLATVDEPLIQPVPVMGLPMVSTALVSVLSSRLGTKGSDSHLPIGHCG QSGPSKHSIDSGAPQLRAPGGSYLSFMIKSMPLTWPSAPFTGSHGKLWDLEAMNQLYLAG KKSVTKSVVAFGLGKEKCRKELIQ >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_1|435_bp ngccattgccaggtcgtagagagcacctgttggatcgtcttggccacagtggatgagccc ctgattcaacctgtaccggtcatggggcttcccatggtgtccactgccctggtcagtgtg cttagctcaaggttgggcactaaaggcagtgacagccatcttccaattggccactgtggc cagagcggcccatcaaagcacagcattgactcaggagctccccagctcagggcccctggt ggctcctacctctccttcatgatcaagtccatgcctctcacctggccctcagcgcctttc acagggagtcatgggaagctatgggacctagaggccatgaatcagctttaccttgcaggc aagaaaagtgttacaaagagtgtagtagccttcggtttgggaaaagaaaagtgccgaaaa gaattaatacagtaa >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_2|47_aa MVNIPEAQDEEGWGTDEIEDEGSALGGNWVEAVREPDVHLSPVSPQV >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_2|144_bp atggtgaatatacctgaagcgcaggatgaggagggatggggaacagatgagattgaagac gaaggcagtgctcttggtgggaactgggtggaggctgtcagagagcctgatgtacacctg agccctgtcagccctcaggtgtga >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_3|116_aa MTRNLSVEEFREGFLEEVTCKLAFRGEVGIIWVTSLAVCSTKMLAHKGQQLGKSEGQKAL RDHRLQKLVGLEAEPVGSAIHWKVHGYWGSKKALVQEDGKSGPCFAPPAYLGCTSQ >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_3|351_bp atgactcgcaacctcagtgtggaggaattcagggaaggcttcctggaggaggtgacatgt aagctggcttttagaggagaagtgggcatcatctgggtcaccagcttggctgtatgctcc accaagatgttggcccacaaggggcagcagctgggcaagtctgagggacagaaagcattg agggaccacagattgcagaagttggtggggttagaggcagagcctgtgggatcagccatc cactggaaggtgcacgggtactggggcagcaagaaagccctggtgcaagaggatggcaaa agtggtccctgctttgctcctcctgcctacctggggtgcaccagccagtag >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_4|645_aa MAKPFFRLQKFLRRTQFLLFFLTAAYLMTGSLLLLQRVRVALPQGPRAPGPLQTLPVAAV ALGVGLLDSRALHDPRVSPELLLGVDMLQSPLTRPRPGPRWLRSRNSELRQLRRRWFHHF MSDSQGPPALGPEAARPAIHSRGTYIGCFSDDGHERTLKGAVFYDLRKMTVSHCQDACAE RSYVYAGLEAGAECYCGNRLPAVSVGLEECNHECKGEKGSVCGAVDRLSVYRVDELQPGS RKRRTATYRGCFRLPENITHAFPSSLIQANVTVGTCSGFCSQKVYRVSWPPPDNTMCSEH HDDLHLHIKGLSSAHSSQHGQRYLRHRWAAVCRGHPGCLFLLLPVMKVLVWNEEFPLAIL RGWECYCAYPTPRFNLRDAMDSSVCGQDPEAQRLAEYCEVYQTPVQDTRCTDRRFLPNKS KVFVALSSFPGAGNTWARHLIEHATGFYTGSYYFDGTLYNKGFKGEKDHWRSRRTICVKT HESGRREIEMFDSAILLIRNPYRSLVAEFNRKCAGHLGYAADRNWKSKEWPDFVNSYASW WSSHVLDWLKYGKRLLVVHYEELRRSLVPTLREMVAFLNVSVSEERLLCVENNKEGSFRR RGRRSHDPEPFTPEMKDLINGYIRTVDQALRDHNWTGLPREYVPR >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_4|1938_bp atggccaaacctttcttccgactccagaagtttctccgccgaacacagttcctgctgttc ttcctcacggctgcctacctgatgaccggcagcctgctgctgctgcagcgggtccgcgtg gctctcccacagggcccccgggcacccggccccctgcagaccttgccagtggccgccgtg gcgctgggcgtgggcttgctggacagcagagccctgcacgaccctcgagtcagcccagag ctgctgctgggtgtggacatgctgcagagccccctgacccggccccggcccggcccccgc tggctccggagccgcaactcggagctgcgtcagttgcgtcgccgctggttccaccacttc atgagtgactcccagggaccgcccgccctgggccccgaggctgccaggcccgccatccac agccgaggcacctacattggatgcttcagtgacgatggccacgagaggactctgaaagga gctgtgttttatgacttgagaaagatgactgtctcccactgccaggatgcgtgtgctgag cggtcctatgtctacgccggcttggaggccggggcggagtgttactgcgggaaccggctg ccagcggtgagcgtggggctggaagagtgtaaccatgagtgcaaaggcgagaagggctct gtgtgcggggctgtggaccggctctccgtgtaccgtgtggacgagctgcagccgggctcc aggaagcggcggaccgccacctaccgcggatgcttccgactgccagagaacatcacacat gccttccccagctccctgatacaggccaatgtgaccgtggggacttgctcgggcttttgt tcccagaaagtgtaccgagtgtcatggccacctccagataacaccatgtgttcagaacat catgatgacctgcatttgcatattaaaggactaagttcagcccatagcagccagcatgga cagaggtacctgagacacagatgggccgccgtgtgccgtggacatccgggctgcttgttt ctgctgctccctgtgatgaaggtgttggtgtggaatgaggagttccccttggccattctc aggggctgggaatgctactgtgcttaccctaccccccggttcaacctgcgggatgccatg gacagctcagtatgtggccaggaccctgaggcacagaggctggcagaatactgtgaggtc taccagacacctgtgcaagacactcgttgtacagacaggaggttcctgcctaacaaatcc aaagtgtttgtggctttgtcaagcttcccaggagccgggaacacatgggcacggcacctc attgagcatgccactggcttctatacagggagctactactttgatggaaccctctacaac aaagggttcaagggcgaaaaggaccactggcggagccgacgcaccatctgtgtcaaaacc cacgagagtggcaggagggagattgagatgtttgattcagccatcctgctaatccggaac ccatacaggtccctggtggcagaattcaacagaaaatgtgccgggcacctgggatatgca gctgaccgcaactggaagagcaaagagtggccggactttgtcaacagctacgcctcgtgg tggtcctcgcacgtcctggactggctcaagtacgggaagcggctgctggtggtgcactac gaggagctgcggcgcagcctggtgcccacgttacgggagatggtggccttcctcaacgtg tctgtgagcgaggagcggctgctctgcgtggagaacaacaaggagggcagcttccggcgg cgcggccggcgctcccacgaccctgagcccttcaccccggagatgaaagacttgatcaat ggctacatccggacggtggaccaagccctgcgtgaccacaactggacggggctgcccagg gagtatgtgcccagatga >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_5|86_aa MAQAPLALCTLATVALLMAMKMQTWNFTEVTTTGNQFWEQKGSRQSLESKLRQSKYKRRK RRRIIVLQAASSVEELSLHPLDDSHQ >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_5|261_bp atggcccaggccccactggctctctgcacccttgccacggttgccctcctaatggctatg aagatgcaaacttggaactttactgaggtgaccacaacaggcaaccaattctgggaacag aagggcagcaggcagtctttggagagcaagctgaggcaatcgaagtataagagaaggaaa aggaggaggattatcgtcctgcaagccgcctcctccgtggaggagctctcacttcaccct ttggacgacagccaccaatga >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_6|204_aa MVEGKEEQVTSYMDGSRPAGRGLCATPHSIPISWLKEAKRSGRGESRYSRHGSSNHSKPE KREATRQAWIIPSLSTNLPTHLHLYPHSERCELAHVPVPVAVAATKKHIKIQPDFGLELL GARTSSEEHLGQVMLSQVPAVCELEKRLPVSADVDGAAQNGHFIPSSEAPSCYSCKAKSQ GHFLSDEDPLPDPVAGQLYNSAYS >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_6|615_bp atggtggaaggcaaggaggagcaagtcacctcttacatggatggcagcagacctgctgga cgtggcctgtgtgctacaccccacagtatccccatctcctggctgaaggaagcaaagagg agtggtcggggtgagagccgctacagccgccacggcagcagcaaccacagcaaaccagag aaaagagaagcaaccagacaagcctggatcatcccctctctctccacaaatcttccgact cacctgcatctatacccacattctgaaagatgtgaattggcccatgtccctgtgcctgtg gctgtggcagccacgaagaaacacattaaaatccagcctgactttggacttgagcttttg ggtgctagaacatcttcagaagagcacctggggcaggtaatgctgtctcaggtccctgct gtgtgtgaactggagaagcggcttccagtttctgctgacgtagatggtgctgctcagaat ggccacttcatcccctcttctgaggccccttcctgttattcatgcaaagcaaaaagccag ggtcacttccttagtgacgaggacccactgccagacccagtggcagggcagctatacaac tctgcatattcataa >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_7|287_aa MGLASLQRHQRMSWCSPWIGSILATITITITNSITTTSFATISITTTSITTTPITITPIT TTPITITTTPITITPIITTPITTTTITTTSITTTPITTTTSITTTSITTTTTTSITTTSI TTTPITTTTITTTSITTTPITTTTSTTTTSITTTSITITTISITTTPITTTTTTSISTTP ITTISITTTSITITTTSITTTSITTTSITTTSITTTSITTTSITTTSNNTTSISSCLAEA IFFLALNFCHGPIKDENRNFSPPGSVGTSFLPLSLPPRSFLPRGEVS >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_7|864_bp atgggactggccagcctgcaaagacaccagaggatgtcttggtgtagcccgtggatagga agcatccttgccaccatcaccatcaccatcaccaactccatcaccaccacctccttcgcc accatctccatcaccaccacctccatcaccaccacccccatcaccatcacccccatcacc accacccccatcaccatcaccaccacccccatcaccatcacccccatcatcaccaccccc atcaccaccaccaccatcaccaccacctccatcaccaccacccccatcaccaccaccacc tccatcaccaccacctccatcaccaccaccaccaccacctccatcaccaccacctccatc accaccacccccatcaccaccaccaccatcaccaccacctccatcaccaccacccccatc accaccaccacctccaccaccaccacctccatcaccaccacctccatcaccatcaccacc atctccatcaccaccacccccatcaccaccaccaccaccacctccatcagcaccaccccc atcaccaccatctccatcaccaccacctccatcaccatcaccaccacctccatcaccacc acctccatcaccaccacctccatcaccaccacctccatcaccaccacctccatcaccacc acctccatcaccaccacctccaacaacaccacctccatcagctcctgtctagcagaagcc attttctttcttgccctaaatttctgccatggccccattaaagatgaaaataggaatttc agccctcctggaagtgtgggcacatcgtttttacccttgtctctaccccctcgctcattc ctccctaggggagaagtgtcttga >gi568815581f:5980659_6220658|GENSCAN_predicted_peptide_8|61_aa MENDRGDFAPETGALKTPSILSYSLDPWTTLFPELSKAWCCNVSLILDGTGKRLVTTILY S >gi568815581f:5980659_6220658|GENSCAN_predicted_CDS_8|186_bp atggaaaatgaccggggtgactttgctcctgaaactggagccctgaaaacaccgtccatt ctctcctactcactggatccttggacaactctctttcccgagctttccaaagcctggtgc tgcaatgtatctttaattctagacggcactggcaaacgcttagttaccaccatcctatat tcataa