GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:26:39 Sequence gi568815578r:19723453_19923943 : 200491 bp : 43.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 47 42 6 1.05 1.02 Term - 2609 2529 81 2 0 33 46 100 0.274 -2.01 1.01 Init - 5435 5400 36 2 0 84 98 35 0.309 4.12 1.00 Prom - 26314 26275 40 0.34 2.00 Prom + 29329 29368 40 -4.86 2.01 Init + 30921 31038 118 2 1 84 109 93 0.920 11.37 2.02 Intr + 33486 33569 84 1 0 134 35 54 0.905 4.49 2.03 Intr + 33977 34285 309 0 0 9 57 169 0.687 2.18 2.04 Intr + 46040 46065 26 0 2 63 82 56 0.733 0.24 2.05 Intr + 47214 47390 177 2 0 106 72 43 0.865 4.62 2.06 Intr + 47824 47925 102 0 0 89 76 39 0.346 3.17 2.07 Intr + 59383 59512 130 0 1 55 79 116 0.661 7.67 2.08 Term + 85722 86176 455 1 2 62 43 206 0.670 8.82 2.09 PlyA + 89178 89183 6 1.05 3.02 PlyA - 90365 90360 6 1.05 3.01 Sngl - 100491 99994 498 0 0 90 47 488 0.946 40.95 3.00 Prom - 115006 114967 40 -4.96 4.03 PlyA - 116639 116634 6 1.05 4.02 Term - 121289 121116 174 2 0 -45 33 550 0.997 33.96 4.01 Init - 122352 122236 117 0 0 71 -33 107 0.191 -2.90 4.00 Prom - 129918 129879 40 -2.26 5.02 PlyA - 130675 130670 6 1.05 5.01 Sngl - 132088 131054 1035 1 0 58 37 385 0.898 27.52 5.00 Prom - 134017 133978 40 -6.46 6.00 Prom + 137498 137537 40 -6.56 6.01 Init + 144555 144642 88 2 1 73 76 75 0.554 5.61 6.02 Term + 151167 151249 83 1 2 129 47 46 0.492 2.46 6.03 PlyA + 152188 152193 6 1.05 7.03 PlyA - 152755 152750 6 1.05 7.02 Term - 193486 193320 167 2 2 96 38 65 0.805 0.38 7.01 Intr - 198321 198184 138 1 0 139 72 53 0.959 9.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 60716 60816 101 0 2 22 48 137 0.837 1.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_1|38_aa MSDSVSMRIQGTFSGISYWIDCPNKPEDVTSPNSAFDK >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_1|117_bp atgagtgactctgttagcatgcgcatccaaggcacattttctggaatcagttactggatt gattgtcccaacaaaccagaagatgtgacatcaccaaattcagcttttgataaatag >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_2|466_aa MAYGASEAIGQHQSSAAKPRRSQSESLGPEFQGLGEWLPGLQLPQVRETFALFTDISQAL ELRLAPAGWMLAGNRPRSSGVPACLRGLQLGLRRAPSADPNRVPGGVGGPLASEAEAKQP AQASRAYPTLENLPRNQEPQPAQASRVGPFQPLQQIPGSVVGRRGSLWERVIIIDPPSLQ EFHVFTYNLYVKKTKTCFSSALTTITNTEAFCDQMCGVFPHTPSSGHQLGILQLNSHRGL PAVNQLISMQKHHFGDSKDFWSCMAGNRVKDQRCIDFMAIQQVFDGLTASPAYLYAGCPA QACNRRSLECLAALDTGAEDQGKIADDLNDSSDHQQPTELPNQEIVRWKQRLQAKKAEAG QGRNETVINVGVKRLTSPVHTPAQRPASAGGWWRPCGGWGLGSGGEHGSRIWQSPSVPAT SPLKSVDDDGDLGASFECPEGETGEADTQLWEEIKGIKNSQHKSKV >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_2|1401_bp atggcctacggggcttccgaggcgatcgggcagcatcagtcttcagccgctaagccgaga aggagtcagtcagagagcctcgggccagagttccaggggctcggggagtggctgccaggc cttcagctcccccaggtgagggagacctttgctttgttcaccgacatatcccaagcatta gaactgcgcctggcaccagcaggctggatgctggctgggaaccggccccgcagctccggc gtccccgcctgcctgcgcggccttcagctggggttacggagggctcccagcgccgatcca aaccgcgttccaggcggggtgggcgggcccctggccagcgaggccgaagcaaaacaaccc gcccaggcgtccagagcgtatccaaccctggagaacctgccccggaaccaagagccgcag ccagctcaggcctctcgagtggggcctttccagcccctgcaacagatcccgggatctgta gttgggcggaggggctcgttatgggaaagggtcatcatcatcgaccctccatctctgcaa gaattccatgttttcacgtataacctgtacgtgaaaaaaacaaaaacgtgtttttcctct gctctcacaacaatcaccaacacagaagccttctgtgaccaaatgtgtggggtttttccc cacacaccaagtagtggacaccagctgggcatcctccagctcaattcccacaggggactg ccagctgtcaatcagctcattagcatgcagaaacatcactttggagattctaaggatttt tggagttgtatggcaggaaacagggttaaagaccagagatgtattgacttcatggccatt caacaggtgtttgacggtttgactgcatcccctgcctacctgtatgctggctgtccagct caggcctgcaacaggaggagcctagaatgcctagcagctttggacactggggcagaagat caggggaaaattgcggatgatctgaatgatagcagtgaccaccagcaacctacagagctg cccaaccaggaaattgtcagatggaagcaacgcttgcaggcaaagaaggctgaagcaggc caaggcaggaatgaaacagtcataaatgttggagtgaagcgactgacctccccggtccac accccggcccagcgtcctgcatctgctggaggatggtggagaccatgtggaggatgggga cttggcagtggaggagaacatggaagcaggatatggcagtcccctagcgtcccagccaca agtcctctgaagagtgtggatgacgatggtgacttaggtgcatccttcgaatgtcctgag ggggagactggagaagcagacacgcaattgtgggaagaaatcaaaggcataaagaactca cagcacaagtccaaagtctga >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_3|165_aa MLPKFDPNEIKVVYLRCTGGEVGATSALAPKIGPLGLSPKKVGDDIAKAMGDWKGLRITV KLTIQNRQAQIEVVPSASAMIIKALKEPPRDRKKQKNIKHSGNITFDEIVNIARQMRHQS LARELSGTTKEILGTAQSVGCNVDGRHPHDIIDDINSGAVECPAS >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_3|498_bp atgctgccgaagttcgacccgaacgagatcaaagtcgtatacctgaggtgcaccggaggt gaagtcggtgccacttctgccctggcccccaagatcggccccctgggtctgtctccaaaa aaggttggtgatgacattgccaaggcaatgggtgactggaagggcctgaggattacagtg aaactaaccattcagaacagacaggcccagattgaggtggtgccttctgcctctgccatg atcatcaaagccctcaaggaaccaccaagagacagaaagaaacagaaaaacattaaacac agtgggaatatcacttttgatgagatcgtcaacattgctcgacagatgcggcaccaatcc ttagccagagaactctctggaaccactaaagagatcctggggactgcccagtctgtgggc tgtaatgttgatggccgccaccctcatgacatcatagatgacatcaacagtggtgctgta gaatgcccagctagttaa >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_4|96_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRKKKKKKKKKKKKKKRKRKRKK KKKEEEEEEEEEEEEEEEEEEEEEEEEEAAAAAAAL >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_4|291_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaatagac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagaaag aagaagaagaagaagaagaagaagaagaagaagaagaagaggaagaggaagaggaagaag aagaagaaggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaggaagaagcagcagcagcagcagcagctctctag >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_5|344_aa MANVPTRKSPGPDGFTTEFYQRYEEELAPFLLKLFQSIEKEGILPNSFYEASIILIPKPD RDTTKKVNFRQISLMNIDAKVLNNILANRIQQHIKKLIHQDQVGFIPGMQGWFNIRKSIN VIQHINRTKDKNHMIISIDAEKAFDEIQQPFMPKTLNKLGIDGTYLKIIRAIYDKPTANI ILNGQKLEAFPLKTGTRQGCPLSPLLFNIALEVLARAIRQEKEIKGIQLGKEEVKLSLFA DDMIIYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTI ASKRTKYLGIQFIRDVKDLFKENYKPLLNETKEDTNKWKNIPCS >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_5|1035_bp atggcaaatgtaccaaccagaaaaagtccaggaccagacggattcacaactgaattctac cagaggtatgaggaggagctggcaccattccttctgaaactattccaatcaatagaaaaa gagggaatcctccctaattcattttatgaggccagcatcatcctgataccaaagcctgac agagacacaacaaaaaaagtgaattttagacaaatatccctgatgaacattgatgcaaaa gtcctcaataacatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccaa gatcaagtgggcttcatccctgggatgcaaggctggttcaacatacgcaaatcaataaat gtaattcagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagatgca gaaaaggcctttgacgaaattcaacagcccttcatgccaaaaactctcaataaattaggt attgatgggacgtatctcaaaataataagagctatttatgacaaacccacagccaatatc atactgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgc cctctctcaccactcctattcaacatagccttggaagttctggccagggcaatcaggcag gagaaagaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gatgacatgattatatatttagaaaaccccatcgtctcagcccaaaatctccttaagctg atcagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattc ttatacaccaataacagacaaacagagagccaaatcatgagtgaactaccattcactatt gcttcaaagagaacaaaatacctaggaatccaatttataagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaacgaaacaaaagaggacacaaacaaatggaagaac attccatgctcatag >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_6|56_aa MSGVGHAGAQRAEPSSTAGTDTKPGPKEALFPVTTSEGNNLHSNASLRVGFQGNGD >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_6|171_bp atgtctggcgtggggcatgctggtgcacaaagagccgaaccaagctccacagcaggaact gacaccaagccaggccccaaggaggccttgtttcctgtgactacctctgaaggaaacaat ttgcacagcaatgcttctctcagagttggcttccaggggaatggagactaa >gi568815578r:19723453_19923943|GENSCAN_predicted_peptide_7|101_aa XSNPHFTMRFSWEHFLIATFTQFFGSGPVSGEPDLGHPQPSCLAPLDAYEFQMHRWSQDL YIKQHLPKWFPPNTSSVGCEQARFQYKHYCSNNSGSTLNIR >gi568815578r:19723453_19923943|GENSCAN_predicted_CDS_7|306_bp ngctccaacccccacttcaccatgcgtttctcttgggagcacttcctgatagccactttc acacagttcttcggctctgggcctgtttctggggaacccgacctaggacacccccagcct tcctgcttggcgccgctcgatgcctatgaatttcaaatgcacagatggagccaggatctg tatatcaagcagcatctccccaagtggtttccacccaacacaagcagtgtaggatgtgaa caggcacgatttcaatacaaacactactgctcgaacaacagtggaagcacattaaacatt cgataa