GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:26:45 Sequence gi568815592f:159806956_160007930 : 200975 bp : 44.84% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1785 1861 77 1 2 92 84 103 0.371 9.53 1.02 Intr + 2060 2203 144 1 0 93 97 22 0.876 4.08 1.03 Intr + 3071 3140 70 1 1 24 91 40 0.977 -3.35 1.04 Intr + 4746 4831 86 1 2 115 90 75 0.983 9.94 1.05 Intr + 6646 6701 56 0 2 49 94 26 0.720 -2.92 1.06 Intr + 7245 7330 86 0 2 119 23 65 0.503 2.56 1.07 Intr + 11345 11468 124 0 1 57 42 98 0.804 1.74 1.08 Intr + 11600 11699 100 2 1 88 78 92 0.886 8.31 1.09 Intr + 11991 12166 176 2 2 83 33 140 0.894 6.74 1.10 Intr + 12299 12397 99 2 0 65 100 220 0.940 20.13 1.11 Intr + 15552 15801 250 0 1 103 -29 142 0.378 1.44 1.12 Intr + 16503 16670 168 2 0 83 94 9 0.207 1.14 1.13 Intr + 19107 19262 156 2 0 52 40 99 0.353 1.71 1.14 Term + 23570 23629 60 1 0 104 55 26 0.192 -1.30 1.15 PlyA + 24104 24109 6 -0.45 2.03 PlyA - 25500 25495 6 1.05 2.02 Term - 29651 29464 188 0 2 72 48 88 0.163 0.85 2.01 Init - 41018 40910 109 2 1 86 85 92 0.473 9.08 2.00 Prom - 49123 49084 40 -5.06 3.07 PlyA - 50003 49998 6 1.05 3.06 Term - 51606 51488 119 1 2 46 50 150 0.977 5.70 3.05 Intr - 52050 51990 61 0 1 91 95 51 0.732 4.41 3.04 Intr - 64640 64550 91 1 1 109 90 -17 0.074 0.60 3.03 Intr - 79832 79754 79 0 1 85 49 150 0.152 9.41 3.02 Intr - 81732 81575 158 2 2 5 55 136 0.606 1.65 3.01 Init - 82984 82617 368 1 2 53 43 138 0.631 2.30 3.00 Prom - 93579 93540 40 -4.86 4.00 Prom + 96751 96790 40 -5.66 4.01 Sngl + 100001 100978 978 1 0 54 48 994 0.998 88.79 4.02 PlyA + 101108 101113 6 -1.75 5.00 Prom + 103695 103734 40 -3.86 5.01 Init + 109070 109077 8 1 2 101 92 6 0.133 2.36 5.02 Intr + 109698 109780 83 0 2 73 89 41 0.147 1.98 5.03 Intr + 111320 111425 106 0 1 97 107 19 0.228 3.97 5.04 Intr + 111841 111929 89 1 2 113 83 34 0.186 5.01 5.05 Intr + 112575 112816 242 1 2 103 54 56 0.007 0.97 5.06 Intr + 125741 125906 166 1 1 68 70 73 0.111 3.03 5.07 Intr + 129004 129122 119 2 2 96 72 60 0.381 5.38 5.08 Intr + 134698 134794 97 0 1 74 77 57 0.284 2.78 5.09 Intr + 135735 135831 97 1 1 56 110 40 0.222 2.07 5.10 Intr + 139281 139476 196 0 1 117 51 19 0.147 0.52 5.11 Intr + 144625 144767 143 0 2 49 49 88 0.139 0.15 5.12 Term + 145136 145298 163 2 1 111 43 60 0.155 1.21 5.13 PlyA + 147722 147727 6 1.05 6.00 Prom + 154265 154304 40 -3.46 6.01 Init + 162292 162440 149 0 2 103 94 246 0.969 24.26 6.02 Intr + 184229 184368 140 2 2 102 106 74 0.795 10.71 6.03 Term + 191010 191068 59 1 2 81 54 34 0.079 -2.85 6.04 PlyA + 191877 191882 6 1.05 7.00 Prom + 193584 193623 40 -0.96 7.01 Init + 197383 197457 75 0 0 96 66 70 0.445 4.94 7.02 Term + 198628 198648 21 0 0 104 55 16 0.310 -1.79 7.03 PlyA + 199820 199825 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_1|550_aa XFQAFEVQLVLRQALPNIWTVLKDEGVVVKKVSKQHRWYLQNTSCDRESCWKENILLSAR GFSVFFQMLVKAQKPLVGHNMMMDLLHLHEKFFRPLPESYDQFKQNIHSLFPVLIDTKSV TKDIWKEMNFPRVSNLSEVYEVLNRGTQSRQSVPSQQDEAREAFGTGMAPCSGLGPSPNH HQQMVTVWKLFFYVATENAFRLASWGSTQPEPQDFIDPVPESSFPQYLDVLAPYVNQVNL IRAGVPKINFSGPDYPSIRPPILILSVKRWPGVSEQQVYHKFQNLCKFDVRRLTRSQFLL LTNKFKDARNILKEYRDHPTLCISLYRYWRHSPNVNCLLQLSSTCFPSFSPLLPGAQPVI RAPWLPSHSSMALGGAAPLDALPKSLVTLFSIALNSSPGPLDVGVQGPGTIRRDSAGPQH DSESTWCCVAVSPGLALHRWAVEFWNLQGPSVSSQASAPHGWRLCCVFRQGFEFVAAAQF LAVYPEGVTRMLLVVVDERQEPALGPGGLQTLVLSCASSRRLEPSALLCPLEEPDSNICL LAIVFLRKTL >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_1|1653_bp ngcttccaggcctttgaggtccaactggtgctgaggcaggccctccccaacatctggacg gtgctgaaagatgagggggtggtagtgaagaaagtgagtaaacaacatcgttggtatctt cagaacacctcttgtgaccgagagagctgttggaaggaaaatattcttctctcagcaagg ggtttttctgtctttttccaaatgctggtgaaagcccagaagcccttagtgggacataat atgatgatggacctgctgcacctccatgagaagttcttcagacccctcccagaaagctac gatcaatttaagcagaatatccacagcctatttcctgttctcattgataccaagagtgta acaaaggatatctggaaggagatgaatttcccgagggtgtcgaatctttcggaagtctat gaagtcctgaacaggggcacccagagccgccagtctgtccccagtcagcaggacgaggca cgagaggcgtttggaacagggatggcaccctgttctggattgggaccctcaccaaaccat caccagcagatggtgactgtctggaagctcttcttctatgtggccacggagaatgctttt agacttgccagctggggaagcacccaaccagagccccaagacttcatcgaccccgtgccc gagtcatcctttcctcagtaccttgacgtgctggctccttacgtgaaccaagtgaacctc atccgagcgggggtcccaaagatcaatttttctggtccagattatcccagtatccgacct cccatcctcatcctcagcgtcaaaaggtggcctggggtcagcgagcagcaagtctaccat aagtttcagaatctctgcaagtttgatgtcaggcgactcacaagaagccagttcttactc ctgaccaataagtttaaggatgcgcggaacatcctgaaggagtaccgggaccacccgacc ctgtgcatctccctgtaccgctactggaggcactccccaaacgtcaactgcctgctccag ctttcgagcacgtgctttccatcgttctcgcccctgctgcccggggcccagccagtcatt agggctccctggcttccctctcactcctccatggccctgggtggtgcagctcctctggac gctctccccaagtctctggtcactctcttctcgattgccctcaactcttcccctggccct ttagatgttggtgttcaaggacctggcaccatccgccgtgacagtgccgggccccagcat gactctgaaagcacttggtgctgtgtggcagtgtcccctggcctggccctgcacaggtgg gcagtggagttctggaacctgcaggggccctcagtatcctcacaagccagtgctccccat ggctggaggctctgctgtgtttttcgacagggatttgagtttgttgctgctgctcagttt ttggcggtttaccctgagggggttacgaggatgctgctggtggtggtggacgagaggcag gagccggcgctggggccgggtggtctgcagactctcgtcctctcctgcgcctcctcacgt cggttagagccatcggccctcctctgtcccctggaagaaccagattcaaacatttgcctg ctggcaatagtgtttctgaggaagacactgtga >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_2|98_aa MVESKEEQVPSYTDGSRQRENEEDTKAETPVKTIRSAPRRPWAFQPWPPPRGCSSTYLFL DGKRLLIIVHFSTHDFPHKLGSIQEALLGICHAKFSAL >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_2|297_bp atggtggaaagcaaggaggagcaagtcccatcttacacggatggcagcaggcaaagagag aatgaggaagatacaaaagcagaaacccctgttaaaaccatcagatctgctcctcggaga ccctgggccttccagccctggcctcccccgcgcggctgctcgagcacgtatctcttcttg gatggaaaacggctccttatcatcgtgcatttttctacccatgacttccctcacaaactg ggcagcatccaggaggctctgctggggatatgccatgccaagttttcagcattgtga >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_3|291_aa MKFPSDCFRKQEDHSPRAEPNYSCGKPDQQLGPPSGKKKGERRELPLKPAGIYWVGSAEQ QINRPGELGLEVGKVFSLPGSDAPRKEKFSLLKNFSFPEDGQKLKRKQAPSKADCKAEKK EHGSLEKQHPSFCHQTTENHNEGGLVEGLMGAGKKEAYLPTATQQTHKWQYAQVVGTSLE KVRIESQVKMYEHFVDDSLDPTYQRPWYFGPNSSEPASRACTGLVAGSVPLKTGYRKEPM ESSTQKAVKKSPGPEGQPAITTAIASATSTAQGPEGQLAMSTATTSAASTA >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_3|876_bp atgaagttcccctccgactgttttcgtaagcaggaagaccatagtcccagggcagagcca aactactcttgtgggaagcctgaccagcagctcggccctcccagtgggaagaagaaaggg gagagaagagagcttcccctgaagccagcaggcatctactgggtaggcagtgctgagcag caaatcaacagaccaggtgaactcggtcttgaagttggaaaggtgttctcattacctggt tctgatgctccaaggaaggagaagttcagcctcctgaagaactttagtttcccagaggat ggtcagaaactaaaaaggaaacaggcaccatctaaagcagactgcaaggcagaaaagaag gagcacggttccctagagaaacaacatccctctttctgccatcaaacaacagagaaccat aatgaagggggcctggtggagggccttatgggagcagggaagaaggaagcttatttaccc acggccacgcagcagacccataagtggcagtacgcacaagtggtagggacatccctagag aaagttcgcattgagtcacaggtgaaaatgtacgagcacttcgtggatgactcgctcgat ccgacatatcaaagaccctggtattttgggccaaacagctctgagcctgcttccagggcc tgcacaggccttgtcgctgggtctgtcccactgaagactggctacaggaaagaacccatg gaatcctcaacacaaaaagcagtcaagaaaagcccagggcccgagggccagcctgccatc actactgccattgccagtgccacaagcactgcccagggacctgagggccagcttgccatg agcactgccaccaccagtgctgcaagcactgcctag >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_4|325_aa MDGSNVTSFVVEEPTNISTGRNASVGNAHRQIPIVHWVIMSISPVGFVENGILLWFLCFR MRRNPFTVYITHLSIADISLLFCIFILSIDYALDYELSSGHYYTIVTLSVTFLFGYNTGL YLLTAISVERCLSVLYPIWYRCHRPKYQSALVCALLWALSCLVTTMEYVMCIDREEESHS RNDCRAVIIFIAILSFLVFTPLMLVSSTILVVKIRKNTWASHSSKLYIVIMVTIIIFLIF AMPMRLLYLLYYEYWSTFGNLHHISLLFSTINSSANPFIYFFVGSSKKKRFKESLKVVLT RAFKDEMQPRRQKDNCNTVTVETVV >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_4|978_bp atggatgggtcaaacgtgacatcatttgttgttgaggaacccacgaacatctcaactggc aggaacgcctcagtcgggaatgcacatcggcaaatccccatcgtgcactgggtcattatg agcatctccccagtggggtttgttgagaatgggattctcctctggttcctgtgcttccgg atgagaagaaatcccttcactgtctacatcacccacctgtctatcgcagacatctcactg ctcttctgtattttcatcttgtctatcgactatgctttagattatgagctttcttctggc cattactacacaattgtcacattatcagtgacttttctgtttggctacaacacgggcctc tatctgctgacggccattagtgtggagaggtgcctgtcagtcctttaccccatctggtac cgatgccatcgccccaagtaccagtcggcattggtctgtgcccttctgtgggctctttct tgcttggtgaccaccatggagtatgtcatgtgcatcgacagagaagaagagagtcactct cggaatgactgccgagcagtcatcatctttatagccatcctgagcttcctggtcttcacg cccctcatgctggtgtccagcaccatcttggtcgtgaagatccggaagaacacgtgggct tcccattcctccaagctttacatagtcatcatggtcaccatcattatattcctcatcttc gctatgcccatgagactcctttacctgctgtactatgagtattggtcgacctttgggaac ctacaccacatttccctgctcttctccacaatcaacagtagcgccaaccctttcatttac ttctttgtgggaagcagtaagaagaagagattcaaggagtccttaaaagttgttctgacc agggctttcaaagatgaaatgcaacctcggcgccagaaagacaattgtaatacggtcaca gttgagactgtcgtctaa >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_5|502_aa MGSEWSTGTVASERQRHEANSTTHLKPVFANLLPSWYGLKACVLPDSYVEILTLHVTVFG GGVLGRQEYGEERNEVITLGKWPVRPDEPMFFLSRGFNGRFCIQLHQHCLQTRKEPREWR TLCRDLGMDSREYLEIPTSQVHLTWWLFLLKKDPVVLCPEGHKQHKCSRAEPGSGRIKIF DMDMQEKGMLPGLPYQQPIRTTQWSGSSYDQMAWAAKSGVQKPQTPHHLMERLRLCILAE AALSNGSLPAVPCHMGFFTMATYFMSDFSRPETGPTVLESLVAGAREMGITCRVTPFAKT GRHDQFGQENREKRIRKTLAGHDDSDSADERSPGPSCSLGTKRHDGSCGPSESVPFLGAR ASHPTNTESSWSSFIQTLGSVFHAQCSSGYEKMSLGSVWRESGEPLRDHWPPSYCSHVLA ALIPAPCEASLKPIYQSMYLRTAATQPLRKFTSTSTFIFCSRGILEHLGPSGDGLESLEA QAESTKTSGLSDNRPGILPNSS >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_5|1509_bp atgggcagtgagtggagcactggcactgttgccagtgagaggcaaagacacgaggccaat tcaacaacccacctcaaacctgtatttgcaaacctcctgcccagctggtatggcctgaaa gcttgtgtactcccagattcatatgttgaaatcctaacccttcatgtgacggtatttgga ggtggagttcttgggagacaagagtatggtgaagaacggaatgaagttataacgcttggg aagtggcctgtgaggcctgatgagcccatgttctttctctcacgtggttttaatggaagg ttctgcattcagctgcaccaacactgcttgcaaacaagaaaagaaccgagagagtggaga acactgtgcagggacttgggtatggattccagggagtatctggaaattcccacctcacag gtccatctcacctggtggttgttcctactgaaaaaggatcccgtggtcctgtgccctgag ggccacaaacagcataaatgttccagagcagagcctggaagtgggaggataaagatattt gacatggacatgcaggagaaagggatgctcccaggcctgccgtatcagcaaccaatcaga actactcaatggagtggcagctcctatgaccagatggcctgggcagctaagtctggagtt caaaaaccccagactccccatcacttgatggagaggttgaggctttgcattcttgctgaa gctgctctcagcaatggcagtttgcctgcagttccttgccatatgggcttcttcaccatg gccacttacttcatgtcagatttttcaaggccagagacaggacctactgtgttagagtct ctggtggcaggtgctcgggagatgggcatcacctgtcgtgtaacaccctttgcaaagacc gggagacatgaccagtttgggcaggaaaatagggaaaagagaatcagaaagaccctcgca ggccatgatgatagtgatagcgcagacgagagatcacctggacccagctgctctcttggc accaagagacatgatggcagctgtggccccagtgaaagcgttccttttctgggagccagg gcctctcaccccacaaatacagaatcctcctggagttcattcatccaaactttaggaagt gtgttccatgcacagtgctcttcaggctacgagaaaatgagcctgggttctgtttggcgg gagtcgggagagccccttcgggaccactggcccccaagctactgtagccatgtccttgcc gccctgatccccgcaccctgcgaagcttctctcaagcccatctaccagagcatgtacctc aggactgctgctacccaaccactcaggaagtttacttccacaagcaccttcatcttctgt tctagggggattttggagcatttggggccctcgggtgatggattggaaagtcttgaggca caggctgagtcaactaaaaccagtggattgtcagacaacaggccaggcatcctccccaac agcagctag >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_6|115_aa MGAAAGRSPHLGPAPARRPQRSLLLLQLLLLVAAPGSTQAQAAPFPELCSYTWEAVDTKN NVLYKINICGSVDIVQCGPSSAVCMHDLKTRTYHSVETIQTEKPGERDLRPGIEI >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_6|348_bp atgggggccgccgccggccggagcccccacctggggcccgcgcccgcccgccgcccgcag cgctctctgctcctgctgcagctgctgctgctcgtcgctgccccggggtccacgcaggcc caggccgccccgttccccgagctgtgcagttatacatgggaagctgttgataccaaaaat aatgtactttataaaatcaacatctgtggaagtgtggatattgtccagtgcgggccatca agtgctgtttgtatgcacgacttgaagacacgcacttatcattcagtggagaccatccag actgaaaaacctggtgagagggatttgaggcctgggattgagatctga >gi568815592f:159806956_160007930|GENSCAN_predicted_peptide_7|31_aa MGLLGLGGATWWACRMRLGGLTEQHDVTRTL >gi568815592f:159806956_160007930|GENSCAN_predicted_CDS_7|96_bp atgggcttgcttgggcttggaggtgcgacctggtgggcctgcagaatgcgacttggtggc ctgactgagcagcacgacgtgacccgcacattatga