GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:18:04 Sequence gi568815597f:180934008_181155262 : 221255 bp : 46.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1108 2663 1556 1 2 113 91 821 0.733 73.83 1.02 Intr + 4548 4704 157 1 1 81 113 104 0.945 11.27 1.03 Intr + 7038 7278 241 0 1 126 117 105 0.998 14.75 1.04 Intr + 10382 10509 128 1 2 91 80 72 0.920 6.18 1.05 Intr + 11296 11480 185 1 2 123 85 242 0.995 26.83 1.06 Intr + 16333 16430 98 2 2 128 33 196 0.077 17.83 1.07 Intr + 26381 26558 178 1 1 75 29 66 0.007 -1.11 1.08 Intr + 30509 30769 261 0 0 115 69 170 0.399 15.26 1.09 Term + 31632 31756 125 1 2 103 42 126 0.942 8.05 1.10 PlyA + 33610 33615 6 1.05 2.03 PlyA - 33862 33857 6 -0.45 2.02 Term - 35639 35333 307 1 1 96 38 141 0.244 4.49 2.01 Init - 36134 36082 53 2 2 55 55 63 0.523 0.33 2.00 Prom - 36709 36670 40 -3.16 3.03 PlyA - 38738 38733 6 1.05 3.02 Term - 42570 42442 129 0 0 69 38 137 0.860 4.98 3.01 Init - 44558 44508 51 2 0 85 83 27 0.831 3.07 3.00 Prom - 47113 47074 40 -3.36 4.05 PlyA - 47343 47338 6 1.05 4.04 Term - 50764 50653 112 0 1 96 42 70 0.339 1.23 4.03 Intr - 54338 54232 107 2 2 44 65 216 0.920 13.91 4.02 Intr - 56102 55977 126 2 0 125 80 152 0.999 18.98 4.01 Init - 65238 65236 3 0 0 68 115 0 0.363 0.70 4.00 Prom - 66463 66424 40 -4.46 5.03 PlyA - 67287 67282 6 1.05 5.02 Term - 71456 71246 211 1 1 82 42 189 0.968 10.47 5.01 Init - 88666 88632 35 1 2 85 96 57 0.237 5.54 5.00 Prom - 97538 97499 40 -1.46 6.00 Prom + 102932 102971 40 -4.86 6.01 Init + 112091 112292 202 1 1 97 44 102 0.701 3.83 6.02 Intr + 115045 115305 261 2 0 115 69 138 0.969 12.06 6.03 Intr + 116004 116279 276 1 0 107 103 246 0.998 25.49 6.04 Intr + 118228 118503 276 2 0 82 86 159 0.754 12.59 6.05 Intr + 119566 119670 105 2 0 81 78 34 0.780 1.89 6.06 Term + 143429 143604 176 0 2 54 49 133 0.052 3.92 6.07 PlyA + 145854 145859 6 1.05 7.00 Prom + 151024 151063 40 -3.66 7.01 Sngl + 154896 155879 984 2 0 46 52 1016 0.623 90.63 7.02 PlyA + 157250 157255 6 -0.45 8.06 PlyA - 157691 157686 6 1.05 8.05 Term - 159056 158995 62 0 2 79 53 70 0.161 0.57 8.04 Intr - 172236 172102 135 1 0 83 90 30 0.352 3.24 8.03 Intr - 179013 178972 42 1 0 117 61 19 0.232 0.31 8.02 Intr - 181360 181242 119 0 2 119 64 53 0.288 6.11 8.01 Init - 187096 187014 83 1 2 42 67 40 0.073 -2.26 8.00 Prom - 190307 190268 40 -2.36 9.05 PlyA - 190849 190844 6 1.05 9.04 Term - 194877 194836 42 0 0 162 42 6 0.642 0.56 9.03 Intr - 196527 196389 139 2 1 40 105 80 0.579 5.37 9.02 Intr - 196631 196581 51 1 0 71 91 39 0.567 0.52 9.01 Init - 197855 197728 128 2 2 110 84 81 0.962 8.92 9.00 Prom - 201702 201663 40 -3.26 10.00 Prom + 201842 201881 40 -6.56 10.01 Init + 205534 205615 82 0 1 90 40 71 0.813 1.81 10.02 Intr + 208199 208341 143 0 2 113 86 77 0.966 10.07 10.03 Term + 208511 208738 228 1 0 51 40 116 0.380 -0.27 10.04 PlyA + 209049 209054 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 16333 16526 194 2 2 128 46 198 0.912 17.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_1|976_aa XDGHRSPARDPRTTPACRDSLQNGHTSDSSSGESSGGHRPRRGPSPSHVRFEDESAREAE FRHLERLQQRQRQVLSTVLQAADQGPLRSKPDLADYINGAPRLRDAGQGTFHRLVGSLDR RGHPAPPAPGSERRCQACGSCIDDPRPAQGKAPPVPRTLQELQAACGMERVLGGLSSPLR LLPAEPRLHMEWIRETHIGDTVCPAEVDSALDSTDNSDNCRTDSEEAGTSQAGWACGRTQ GSSPRLRLRGSRPRGHRWSKKAEAELPWGLQAQQHLPRADDVEVENEVKEGRGHTPEGTL FLREDAKPPDLELKRVSLGPQWQPGPGLGSHQPHPLDSRTPCRTAYATTAPMTPESSGPG GQAQVTESHESLEIVSPSSLQQSHAEPSAPHQAWQPTASLCPEGWAPTPPPSRKTTSPVS HRKAALAGLLRLGDQTEPVGIPRPPSRSAVLRTCELPPSQTQPSRPQVRHPLLALSTNNC NNSAPRGLQEPYGGAVHEGRVERGPCSREPEPPLENSRDGGPQGFLGSADVATINSTGIT LSLSSEESESSKESEGSLQRTGSGSGGHVLSRASAGAGTGPGSPSAAPLDQNKKRSSSIA STLGLKKLFSALGQSSRPKLGKSRSYSVEQLQPAPPGLTSQSRAPSLQSLHPVSPSHQRR KAASFQNLHSLLSSKGNRSSLYLVAGPGDHSAAGRPAKTSPRRALSVEDVGAPSLARTVG RLVEVFPDGTSQLQLQRSPGGTFGFCVASGNGRPDSGIYVQEMADMSMAKLYSGLLGVGD EILEVNGAKFQSRFHIFGFLFSNAPLYWYQFTILVRFYTADKDIPKTGKKKRFSWTYSST WLGRIMAEGTHSLRYFRLGVSDPIHGVPEFISVGYVDSHPITTYDSVTQQKEPRAPWMAE NLVPDHWERYTQLLKGWQQMFRVELKRQQRHYNHSGSHTYQRMIGCELLEDGSTTGFLQY AYDGQNFLIFNKDTLS >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_1|2931_bp nnagatggccacagaagcccagcccgggaccccaggacgacccctgcctgcagagacagc ctccagaacgggcacacgagcgattcctccagcggagagtccagcggtgggcacaggccg aggcggggcccctcgccgtcgcacgtgcgctttgaggatgagtccgcccgcgaagccgag ttccgtcacctggagcggctgcagcagcgccagcgccaggtgctgagcaccgtgttgcag gccgcggaccagggccccctgcgctccaagcccgacctcgccgactacatcaacggggct ccccggctccgggacgcggggcaggggacattccacaggcttgtgggcagcctggaccgc aggggacacccggcaccgccggcaccgggcagcgagaggaggtgccaggcctgcggcagc tgcatcgacgacccgcgccccgcccaggggaaggcgccccccgtccccaggaccctccag gagctccaggctgcctgtgggatggagagggtgctgggtggcctgagctccccactccgg ctccttcctgcagagccccggctccacatggaatggatccgggaaacacacatcggagac accgtgtgccctgcggaggtggactctgccctggacagcacagacaactctgacaactgc aggaccgacagtgaggaggcggggacctctcaggctggctgggcgtgtgggcggacccaa ggcagcagcccgcgactgcgactgcggggctccaggcctcgaggccacaggtggtccaag aaggctgaggcggagctcccttggggccttcaggcccagcaacacctgcctagggctgat gatgtggaggtggaaaatgaggtgaaagagggcagaggacacacgcctgaaggaactcta tttttgagagaagatgccaagcctcctgacctggagttgaagcgggtgtccctgggaccc cagtggcagcctggaccagggctgggaagtcaccagcctcaccctttggattcccggact ccatgcaggacagcctatgccaccaccgcccccatgacgcctgaatcatcggggccagga ggccaggcccaggttacagaaagccacgagtccctggaaattgtctctccttcctccctg caacagagccatgcagagccttctgccccacaccaagcctggcagccaacagcttccttg tgtcctgaaggctgggcgccaacccctcccccttcgaggaaaaccacctcgccagtgtct cacaggaaggcagccctggctggactgctcaggctgggtgaccagacagagcctgtgggt atccctcggcctccttcaagaagcgcggttctcaggacctgtgagctgcccccatcacag acccagcccagccgccctcaggtcaggcacccactgctggccctgtccaccaacaactgc aacaacagcgcacctcgggggctgcaggagccctacgggggagccgtccacgagggtagg gtggagaggggcccctgcagccgggaaccggagccgcccctggagaacagcagagatgga ggaccccagggctttcttggctcagcagatgttgccaccatcaactccacgggcatcacc ctctccctgtcctcagaggagtcagagtccagcaaggaatcagagggaagcctgcagagg acagggtcaggatctggaggacatgtgctgtcaagagcatcagcaggagctggcacagga cccggctccccctcggctgcccctttggaccagaacaagaaaaggagcagcagcatagcc tccaccctggggctgaaaaagctcttctcagccctgggccagagttcccggcccaagctg ggcaagtcccgcagctacagtgtggagcagttgcagcccgccccgcctggcctgacgtca cagtccagggccccatcgttacaatccctgcacccggtgtcaccctctcaccagcgtcgg aaagctgcctcttttcagaacctccattctctgctgagcagcaaggggaaccggtccagc ctctacctggtagcagggccaggggaccacagtgcagctggcaggccggccaagacttca ccacggcgtgccctcagtgtggaggacgtgggtgctcccagcctggctcgcaccgtgggc cgcctggtggaggtgttcccagacggcaccagccagctgcagctgcagcgctccccaggg ggcactttcggcttctgcgtggcctctgggaatgggcgcccagactcagggatctacgtg caggagatggctgacatgagcatggccaagctgtactcagggctgctgggggtgggcgat gagatcctcgaggtgaacggggccaagttccaaagtcgcttccacattttcgggtttctt ttcagcaatgccccactctactggtaccaatttactatattagtccgtttttacactgct gacaaagacatacccaagactgggaagaaaaagaggtttagttggacttacagttccaca tggctggggaggatcatggcggaggggacgcactctctgagatattttcgcctgggcgtt tcggatcccatccatggggtccctgaatttatttcagttgggtatgtggactcgcaccct atcaccacatatgacagtgtcactcagcagaaggagccacgggccccatggatggcagag aacctcgtgcctgatcactgggagaggtacactcagctgctgaagggctggcagcagatg ttcagggtggaactgaagcgccagcagagacactacaatcactcagggtctcacacttac cagagaatgattggctgtgagctgctggaggatggaagcactacaggatttctgcagtat gcatatgatgggcagaatttcctgatcttcaataaagacaccctctcctga >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_2|119_aa MHDQATQRLPEAQCDMNCPFVSSEVWLLPLHLPQTQCHLARASASLAFQLPPSSAATQNL LAQTFIPNSRPGQIQKPLLVQPTRSGEKGERKSSSFLCFPAQGIGKSGWNLAFCFYHFE >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_2|360_bp atgcatgaccaggccactcagcggcttcctgaggcacagtgtgacatgaactgcccattt gtgtcctcagaggtctggctgctgccactgcacctgcctcagacccagtgtcacctcgcc agggcttcagccagcctggccttccagctgccacccagcagcgcagcaactcagaacctt ctggcccagaccttcatacccaactcccgcccagggcaaatacagaagcctctgctggtg cagccaacacggagtggggaaaaaggggagaggaaaagcagcagcttcctgtgcttccca gctcagggcattggtaagagtggatggaatttggcattctgcttctaccattttgaatga >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_3|59_aa MRGFFPIFAENLVKLLECCDGGASGCEFLLHMNRGEEEKLSTCDIAVYSHSYPGNILLH >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_3|180_bp atgaggggatttttcccaatattcgctgagaacctggtcaagctcctggagtgctgtgac ggcggggcctctgggtgcgagttcctcctgcatatgaaccgaggggaggaggagaagctg agcacgtgtgacattgccgtctactcacattcctatcctggaaacatactgctgcactga >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_4|115_aa MALLGDSGSQNWSTGTTDKYGRLDRELQRANSHFIEEQQAQQQLIVEQQDEQLELVSGSI GVLKNMSQRIGGELEEQAVMLEDFSHELESTQSRLDNVMKKLAKVSHMTSGMYGV >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_4|348_bp atggcactgctgggagacagtggcagccagaactggagcactggaacaacagataaatat gggcgtctggaccgagagctccagagagccaattctcatttcattgaggagcagcaggca cagcagcagttgatcgtggaacagcaggatgagcagttggagctggtctctggcagcatc ggggtgctgaagaacatgtcccagcgcatcggaggggagctggaggaacaggcagttatg ttggaagatttctctcacgaattggagagcactcagtcccggctggacaatgtgatgaag aaacttgcaaaagtatctcatatgaccagtggtatgtatggcgtttaa >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_5|81_aa MSMEDPFFVVKGEVQKAVNTAQGLFQRWTELLQDPSTATREEIDWTTNELRNNLRSIEWD LEDLDETINILFCGVCAIRDK >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_5|246_bp atgtccatggaggaccccttctttgtggtgaaaggagaggtacagaaagcagtcaacact gcccagggattgtttcagagatggacagagctcctccaggacccctccacagcaacaagg gaagaaatcgactggaccaccaacgagctgagaaataacctccggagcatagagtgggat ctagaggaccttgatgaaaccatcaatatccttttctgtggtgtctgtgccattcgggat aaatag >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_6|431_aa MPEPPTPSMGLCAARASPMSATPAPRRQSHDHPRAEECGRTAREWQAAPPAAPVGDPLGE ASWAPESGTHSLRYFRLGVSDPIHGVPEFISVGYVDSHPITTYDSVTRQKEPRAPWMAEN LAPDHWERYTQLLRGWQQMFKVELKRLQRHYNHSGSHTYQRMIGCELLEDGSTTGFLQYA YDGQDFLIFNKDTLSWLAVDNVAHTIKQAWEANQHELLYQKNWLEEECIAWLKRFLEYGK DTLQRTEPPLVRVNRKETFPGVTALFCKAHGFYPPEIYMTWMKNGEEIVQEIDYGDILPS GDGTYQAWASIELDPQSSNLYSCHVEHCGVHMVLQVPQESETIPLVMKAVSGSIVLVIVL AGVGVLVWRRRPRVMIGDISGPTEPGLAVPVDKEECERESFIDNEEEEVKVAGEESRKRR EGNREEGQSRQ >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_6|1296_bp atgcctgagcctcccaccccctccatgggcctctgtgcggcccgagcctccccgatgagc gccacccctgctccacggcgccagtcccatgaccacccaagggctgaggagtgcgggcgc acggcgcgggaatggcaggcagctccacctgcagccccagtgggggatccactgggtgaa gccagctgggctcctgagtctgggacgcactctctgagatattttcgcctgggcgtttcg gatcccatccatggggtccctgaatttatttcggttgggtacgtggactcgcaccctatc accacatatgacagtgtcactcggcagaaggagccacgggccccatggatggcagagaac ctcgcgcctgatcactgggagaggtacactcagctgctgaggggctggcagcagatgttc aaggtggaactgaagcgcctacagaggcactacaatcactcagggtctcacacttaccag agaatgattggctgtgagctgctggaggatggaagcaccacaggatttctgcagtatgca tatgacgggcaggatttcctgatcttcaataaagacaccctctcctggctggctgtagat aatgtggctcacaccatcaagcaggcatgggaggccaatcagcatgagttgctgtatcaa aagaattggctggaagaagaatgtattgcctggctaaagagattcctggagtatgggaaa gacaccctacaaagaacagagcccccactggtcagagtaaatcgcaaagaaacttttcca ggggttacagctctcttctgcaaagctcatggcttttaccccccagaaatttacatgaca tggatgaaaaacggggaagaaattgtccaagaaattgattatggagacattcttcccagt ggggatggaacctatcaggcgtgggcatcaattgagcttgatcctcagagcagcaacctt tactcctgtcatgtggagcactgcggtgtccacatggttcttcaggtcccccaggaatca gaaactatccctcttgtgatgaaagctgtctctgggtccattgtccttgtcattgtgctg gctggagttggtgttctagtctggagaagaaggccccgagtcatgataggagacatttca ggacccacagagcctgggctggcagtaccagttgacaaggaggagtgtgagagggagtct ttcatagacaatgaagaggaagaagtaaaagtggcaggcgaggagagcagaaaaaggaga gaaggaaacagggaggaaggacaaagccgacaatga >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_7|327_aa MEFKLEAHRIVSISLGKIYNSRVQRGGIKLHKNLLVSLVLRSARQVYLSDPCPGLYLAGP AGTPAPPPQQQPGEPAAGPPAGWGEPPPPAARASWPETEPQPERSSVSDAPRVGDEVPVA TVTGVGDVFQGGEADATEAAWSRVEGPRQAAAREAEGTAGGWGVFPEVSRAARRPCGCPL GGEDPPGTPAATPRAACCCAPQPAEDEPPAPPAVCPRKRCAAGVGGGPAGCPAPGSTPLK KPRRNLEQPPSGGEDDDAEEMETGNVANLISIFGSSFSGLLRKSPGGGREEEEGEESGPE AAEPGQICCDKPVLRDMNPWSTAIVAF >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_7|984_bp atggagttcaagctggaggctcatcgcatcgtcagcatctctctgggcaagatctacaac tcgcgggtccagcgcggcggcatcaagctgcataagaacctcctggtctcgctggtgctg cgcagcgcccgccaagtctacctgagcgacccgtgccccggcctctacctggccggtccc gctgggaccccggcgccgccaccgcagcagcagcccggggagccggcggccgggccaccc gccggctggggagagccgcccccgcccgccgctcgtgcctcttggccggagaccgagccg cagccggagcgctcctccgtctcagacgcgccgcgggtaggggacgaggtgccggtggcc acggtgactggagtcggggacgtttttcagggcggagaggcggacgcgacggaagctgcc tggagccgcgtggaggggccgcgccaggcggcggccagagaagccgagggtaccgccgga ggctggggcgtcttccccgaggtatctcgtgccgcgcgccgcccctgcggctgcccccta ggcggggaggacccgccgggtacaccggccgcgaccccccgcgctgcctgctgctgcgcg ccgcaaccagcggaggacgagccccccgcgccgcccgcggtgtgccccaggaagcgctgc gcggcgggggtgggcggcggcccagcgggctgcccggcgcccggctcgaccccgctcaag aagccccgccggaacttagagcagccgccgagtggaggagaggacgacgacgcggaggag atggagaccgggaacgtggctaacctcatcagcatcttcggttccagtttctcgggactc ctacggaaaagccccgggggcggcagagaggaagaggagggagaggagagcggtccggaa gccgccgagcccgggcagatctgctgcgataagccggtgctgagagacatgaacccctgg agcacagccatcgtggccttctga >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_8|146_aa MWHCKQSGSSPEGPFNEHNGFAFILTESFTTFLGKSVSTSVESHSQLGEMSVRTVRSAFL DASVVVVRRPEIVLSLFMAMGSQREKKKWKGNREKRQRHQESFWKSLEKGKDRKKMPIGR GEDNTKAYSSLKIELFDTIGKIIRLV >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_8|441_bp atgtggcactgcaaacaatcagggtccagtccagaagggcccttcaacgaacataatggg tttgcatttatcttgacagagagcttcaccacatttttggggaagagcgtttctacctct gttgaatcccactcccagttaggagagatgtcggtgagaactgtccgctcagcatttctg gatgcctctgttgtagttgttagaaggccagaaattgtactctcacttttcatggctatg ggctcccagagagaaaagaaaaagtggaaaggaaaccgtgaaaagaggcagagacatcaa gagagtttctggaagagtcttgaaaagggcaaggacagaaagaagatgccaataggaagg ggagaggacaatacaaaagcttactcctccctgaaaatagagttgtttgatacaattgga aaaatcatccggttggtgtga >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_9|119_aa MACTVAVSDPQHQLVTGALQLPGASKVPVPLGGAECKAKNHKGVLQGLIGTTCRAKPCTS WLSCEPVCRCTRTDISQDLCLLSAVRWPHLVTLLRDLLSRSLPFLKAHKRGQRHQMARC >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_9|360_bp atggcctgcacagtcgcagtctctgaccctcagcaccagctggtcactggggccctgcag ctccccggggcatccaaggttccagtgccactgggaggtgcagaatgtaaggccaagaat cacaaaggggtgctgcaaggactgattgggaccacctgccgtgcaaagccctgcacgtcg tggctcagctgtgagcctgtctgcagatgcacacgaacagacatctcccaagatctctgc ttgctgtcagctgtccgctggccacatctggtgaccttgctcagggatcttctgagccgc agcctgcccttcctcaaggcacacaaaaggggacagaggcaccagatggcccggtgctaa >gi568815597f:180934008_181155262|GENSCAN_predicted_peptide_10|150_aa MVEMLANCVLSEEGLRQWPLLLHDLHEGLLEGAECVGKSKNAYGEGEVKLPFLSLSRALS HYFTGGERRIRNKGPTASDPHGVKWILCQLESELKRLLCGPPQTDPSVLQGSPGVALRCP MLPPAGGSQTQPAPLGVEAQPVPHKWVFIT >gi568815597f:180934008_181155262|GENSCAN_predicted_CDS_10|453_bp atggtggaaatgctggcaaattgtgtcctgtcagaggaagggctgagacagtggcccttg cttctccacgacctgcatgaaggactcctcgaaggggctgagtgtgtgggaaaatctaaa aatgcttatggggagggggaggttaagctccccttcttgtccttgtcaagggctctctcc cactacttcactgggggtgaaaggcgcatccgcaacaagggccccactgcgtcagacccg catggagtaaaatggatcctttgtcagctggagtctgaattgaaacgccttctctgcggc cctccccagaccgaccccagcgttttacaagggagccctggcgttgccttacgctgcccg atgctgccccctgcaggcggctcgcagacacagcccgctccgctgggtgtggaggctcag cctgtcccacacaaatgggttttcatcacctag