GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:01:31 Sequence gi568815597f:9897259_10115829 : 218571 bp : 45.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4508 4676 169 2 1 103 82 88 0.726 9.65 1.02 Intr + 6105 6147 43 2 1 36 80 43 0.275 -3.89 1.03 Term + 8533 8609 77 2 2 82 42 120 0.477 4.80 1.04 PlyA + 9316 9321 6 1.05 2.09 PlyA - 9895 9890 6 1.05 2.08 Term - 25100 24997 104 0 2 85 42 101 0.887 3.64 2.07 Intr - 34714 34633 82 1 1 56 97 80 0.981 4.91 2.06 Intr - 37602 37504 99 0 0 103 83 67 0.990 8.01 2.05 Intr - 38369 38234 136 1 1 57 119 124 0.804 12.97 2.04 Intr - 46303 46091 213 0 0 60 96 181 0.828 13.93 2.03 Intr - 55211 55143 69 1 0 136 38 93 0.520 7.40 2.02 Intr - 63551 63456 96 1 0 115 87 22 0.418 3.92 2.01 Init - 64067 64057 11 2 2 67 86 11 0.298 -1.25 2.00 Prom - 64527 64488 40 -5.46 3.00 Prom + 66037 66076 40 -5.86 3.01 Init + 74816 74930 115 1 1 67 91 79 0.264 6.49 3.02 Intr + 78334 78517 184 2 1 53 63 244 0.998 17.25 3.03 Intr + 83773 83912 140 1 2 66 76 42 0.994 0.91 3.04 Term + 85043 85443 401 0 2 68 49 313 0.969 20.68 3.05 PlyA + 87286 87291 6 1.05 4.00 Prom + 99710 99749 40 -4.56 4.01 Init + 100001 100073 73 1 1 95 82 183 0.776 19.63 4.02 Intr + 110312 110490 179 0 2 41 93 244 0.784 19.84 4.03 Term + 151954 152076 123 0 0 47 36 164 0.709 5.48 4.04 PlyA + 152322 152327 6 -0.45 5.08 PlyA - 153298 153293 6 1.05 5.07 Term - 157370 157182 189 2 0 -36 48 183 0.093 -0.65 5.06 Intr - 161729 161469 261 2 0 -11 82 156 0.248 2.78 5.05 Intr - 162376 162083 294 1 0 43 49 147 0.409 3.51 5.04 Intr - 163138 163039 100 0 1 81 103 7 0.692 1.61 5.03 Intr - 163327 163185 143 1 2 34 111 84 0.947 4.45 5.02 Intr - 165390 165304 87 0 0 106 86 25 0.839 4.27 5.01 Init - 166832 166683 150 2 0 56 110 87 0.903 7.74 5.00 Prom - 167825 167786 40 -9.16 6.00 Prom + 167881 167920 40 -4.46 6.01 Init + 172833 172835 3 2 0 108 81 0 0.563 1.30 6.02 Intr + 174845 174956 112 1 1 99 115 3 0.563 4.05 6.03 Intr + 198203 198338 136 0 1 88 93 49 0.404 5.03 6.04 Intr + 203850 203937 88 0 1 31 81 132 0.400 6.77 6.05 Intr + 205690 205834 145 0 1 88 97 86 0.985 9.36 6.06 Intr + 208258 208486 229 2 1 76 68 141 0.992 7.83 6.07 Intr + 209062 209325 264 1 0 88 36 87 0.494 0.13 6.08 Intr + 209964 210116 153 0 0 62 69 282 0.722 22.89 6.09 Term + 211806 211929 124 0 1 89 55 13 0.280 -4.04 6.10 PlyA + 213245 213250 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:9897259_10115829|GENSCAN_predicted_peptide_1|96_aa XPCLPNQMLDAAAPPLSLKGQLISTGSATYSDMTRAGENNIAQGYASLSWHPPLTPNKFL HYNTTMKNGATEDKLHKSMPLKDRVLGYVHSIRNSA >gi568815597f:9897259_10115829|GENSCAN_predicted_CDS_1|291_bp nngccttgccttcctaatcagatgctggatgctgctgccccgcccctcagcttgaagggc cagctgattagcaccgggagtgccacatacagtgacatgacaagggctggtgaaaataac atcgcacaaggatatgcctcgctgtcctggcaccctcctctcactccaaataagtttcta cactacaacacaaccatgaaaaatggagccacagaagataaacttcataagagcatgcct cttaaagaccgtgttcttggctacgtccatagcatccgcaacagtgcctag >gi568815597f:9897259_10115829|GENSCAN_predicted_peptide_2|269_aa MEEGCSGPSVTMIHFCLAHTAYRLHCWLLRTHQLRLFYCAVCSDGAIATGTEVDVFCIWT KSDQFLRGSLRGAYRALTVRRPECYRRSPAARTPASGTSRSHSHWFQGAGSACVQRKCSP GAVRQASALREELDTDEYEETKKETLEQLSEFNDSLKKIMSGNMTLVDELSGMQLAIQAA ISQAFKTPEVIRLFAKKQPGQLRTRLAELTADDEAFLSANAGAILSQFEKVSTDLGRPPS YMNYLLDSHPSKNIDVPSKISFLLKIQDL >gi568815597f:9897259_10115829|GENSCAN_predicted_CDS_2|810_bp atggaagaagggtgctctgggcccagtgtcaccatgattcacttttgtttggcccacacg gcctatcgcttgcactgctggctgctgagaactcaccagctcaggctgttttactgtgca gtttgttcagatggagccattgccacgggcactgaagtggatgtcttctgcatttggaca aaaagcgaccagtttctcaggggctccttgcggggagcttaccgcgccctcactgtccgc cggcccgagtgctaccggagatcaccagcggcccggacgccagcgagtggaacatctcgg tcgcactctcattggttccagggggccggaagtgcgtgcgtccagcggaagtgctccccc ggcgcggtccgccaggccagtgccctcagagaggaacttgatacagatgaatatgaagaa accaaaaaggaaactctggagcaactaagtgaatttaatgattcactaaagaaaattatg tctggaaatatgactttggtagatgaactaagtggaatgcagctggctattcaggcagct atcagccaggcctttaaaaccccagaggtcatcagattgtttgcaaagaaacaaccaggt cagcttcggacaaggttagcagagctgactgcagatgatgaggccttcttgtcagcaaat gcaggtgctatactcagccagtttgagaaagtctctacagaccttggacggcccccaagt tacatgaactacctgctagactcacaccccagcaaaaatattgatgtccccagcaagatt agcttcctgttaaagatccaggacctgtga >gi568815597f:9897259_10115829|GENSCAN_predicted_peptide_3|279_aa MENSEKTEVVLLACGSFNPITNMHLRLFELAKDYMNGTGRYTVVKGIISPVGDAYKKKGL IPAYHRVIMAELATKNSKWVEVDTWESLQKEWKETLKVLRHHQEKLEASDCDHQQNSPTL ERPGRKRKWTETQDSSQKKSLEPKTKAVPKVKLLCGADLLESFAVPNLWKSEDITQIVAN YGLICVTRAGNDAQKFIYESDVLWKHRSNIHVVNEWIANDISSTKIRRALRRGQSIRYLV PDLVQEYIEKHNLYSSESEDRNAGVILAPLQRNTAEAKT >gi568815597f:9897259_10115829|GENSCAN_predicted_CDS_3|840_bp atggaaaattccgagaagactgaagtggttctccttgcttgtggttcattcaatcccatc accaacatgcacctcaggttgtttgagctggccaaggactacatgaatggaacaggaagg tacacagttgtcaaaggcatcatctctcctgttggtgatgcctacaagaagaaaggactc attcctgcctatcaccgggtcatcatggcagaacttgctaccaagaattctaaatgggtg gaagttgatacatgggaaagtcttcagaaggagtggaaagagactctgaaggtgctaaga caccatcaagagaaattggaggctagtgactgtgatcaccagcagaactcacctactcta gaaaggcctggaaggaagaggaagtggactgaaacacaagattctagtcaaaagaaatcc ctagagccaaaaacaaaagctgtgccaaaggtcaagctgctgtgtggggcagatttattg gagtcctttgctgttcccaatttgtggaagagtgaagacatcacccaaatcgtggccaac tatgggctcatatgtgttactcgggctggaaatgatgctcagaagtttatctatgaatcg gatgtgctgtggaaacaccggagcaacattcacgtggtgaatgaatggatcgctaatgac atctcatccacaaaaatccggagagccctcagaaggggccagagcattcgctacttggta ccagatcttgtccaagaatacattgaaaagcataatttgtacagctctgagagtgaagac aggaatgctggggtcatcctggcccctttgcagagaaacactgcagaagctaagacatag >gi568815597f:9897259_10115829|GENSCAN_predicted_peptide_4|124_aa MPADLSGTWTLLSSDNFEGYMLALGIDFATRKIAKLLKPQKVIEQNGDSFTIHTNSSLRN YFVKFKVGEEFDEDNRGLDNRKCKVLRADYVEMLLEYDEDLVFTFSDLRLQLGTQERNQL DFKC >gi568815597f:9897259_10115829|GENSCAN_predicted_CDS_4|375_bp atgcccgccgacctcagcggtacttggaccctgctcagcagcgacaacttcgagggctac atgctggccctaggtattgactttgccactcgtaaaatagccaagttgctgaagccacag aaagtgattgagcagaatggggattcttttaccatccacacgaacagcagcctaaggaac tactttgtgaaatttaaagttggagaagaatttgatgaagataacagaggcctggacaac agaaaatgcaaggtactgagggctgactatgtggagatgctgctcgaatatgacgaagac ctggtctttaccttcagtgacctcaggctgcagctggggacacaggaacgcaaccagcta gacttcaagtgctaa >gi568815597f:9897259_10115829|GENSCAN_predicted_peptide_5|407_aa MSVYGHGEAQTRKTSCNKLMTSCEYHPKTLNLFVKARKTSFEGFKQGNNQERFKVINKKC IKAAGTAAYACSPNTLGAKENAIHKHSGLIRTFWNQLSWGSQRGNTVMGCITTFQSMSDH QILLVVRYTKTHCYCTTAHRIQYSYMQYRFVAQEQYLGHTPRFHGFMHSLFVSWELHELN GLQAPPVQFIDNGNTSGQAQLHDGPFRESLPRLDCARKAAARSLNQHLFPLLDLGNYFFI PKGQSSDNSVLQTLTEEQAQATAPSRHADLYLALMVLDNSFCFVEASQTLIVPLQEVHQV ITTGSHIWSVASNTVQGVGIAVFRPEVTQMSKSQPAFLSASQKAQRINMHIEHIKRSKGQ HHFLKLMKENDQKKKKAKEKGTREAHCVRTNGKKPELLEPITYEFMA >gi568815597f:9897259_10115829|GENSCAN_predicted_CDS_5|1224_bp atgagtgtgtatggacatggtgaggcacagacaagaaagacaagctgcaacaagctcatg acgagctgtgaataccatcctaagacactgaacttgttcgtgaaggcaagaaagacatct tttgaaggatttaagcaggggaataaccaggaaagatttaaagtgattaacaaaaaatgc attaaggcggccggtacagcggcttacgcctgcagccccaacactttgggagccaaggaa aatgccatacataaacattcaggactcatccgaaccttctggaaccagctctcctgggga agtcagagaggaaatacagtcatgggctgcataacgacatttcagtcaatgtcagatcac cagatcttactggtggtccgatacacaaaaactcactgttattgtacaactgcccacagg attcagtatagttacatgcagtacaggtttgtagcccaggagcaatacctaggccatacc ccacgcttccatggctttatgcacagcctcttcgtctcctgggaactgcatgagctcaat gggcttcaagctcctcctgttcaattcatagacaatgggaataccagtgggcaggctcag ctccatgatggccccttccgggagagcctcccaaggctcgactgtgccagaaaagctgct gccaggagcctcaaccaacacctgtttcccctcctggatctggggaattatttcttcatt ccaaaagggcagagctctgacaatagtgtccttcagactcttacagaagagcaggcacaa gctaccgcgcccagccggcatgcagatctttatctggccctcatggttcttgacaacagc ttctgctttgttgaggccagtcagacccttatagtgccactccaggaggtgcaccaagtc atcactactgggagccacatctggtcagtggcatccaatactgtccagggggtcggcatt gctgtcttccgccctgaggtgacacagatgtcaaaatcacagccagcatttctcagtgcc tcccagaaggctcagagaattaatatgcatattgagcacattaagcgctctaaaggccag catcatttcctaaaactcatgaaggaaaatgatcagaaaaagaagaaagccaaagagaaa ggtaccagagaagcacactgtgtgagaaccaatgggaagaagcctgagctgctggaacct attacctatgaattcatggcataa >gi568815597f:9897259_10115829|GENSCAN_predicted_peptide_6|417_aa MRENPPGPPIAASAPGPSQSLGLNVHNMTPATSPIGASGVAHRSQSSEGVSSLSSSPSNS LETQSQSLSRSQSMDIDGVSCEKSMSQVDVDSGIENMEVDENDRREKRSLSDKEPSSGPE VSEEQALQLVCKIFRVSWKDRDRDVIFLSSLSAQFKQNPKEVFSDFKDLIGQILMEVLMM STQTRDENPFASLTATSQPIAAAARSPDRNLLLNTGSNPGTSPMFCSVASFGASSLSSPH SAASGTAAGSQPSSPRYRPYTVTHPWASSGVSILSSSPSPPALASSPQAVPASSSRQRPS STGPPLPPASPSATSRRPSSLRISPSMYDNPFSFLFLALSGDSSDEEDEEEDDDDGDGDD EGGGGGDDFSCVQFGSRRNKMYQVYKNSCKYSWSIVRQMWCQLCPCISPFTPRCSLL >gi568815597f:9897259_10115829|GENSCAN_predicted_CDS_6|1254_bp atgagggagaaccctccggggcctcccatagcggcatcagccccaggaccctctcagagt cttggtctcaatgtccacaacatgaccccagctacctccccaataggtgcatcaggagta gcccatcgaagccagagcagtgaaggagtcagttctctcagcagctcgccctctaatagc cttgaaacgcaatctcagtctctctcacgttcccagagcatggatatcgatggtgtctca tgtgagaaaagcatgtcccaggtggatgtggattcaggaattgaaaacatggaggttgat gaaaatgatcgaagagaaaagcggagcctcagtgataaggagccttcctcgggccctgaa gtgtctgaagagcaggccttacagctggtctgtaagatcttccgtgtctcttggaaggac cgggacagagatgtcatctttctttcttctctttctgcacagtttaagcagaacccaaaa gaagtattctccgattttaaggacttgattggccagattttaatggaagtgctaatgatg tccactcagaccagagatgaaaacccatttgccagtctgacagccacatcacagccaatt gctgcagcagcacggtcaccagacagaaatctcttgctaaacactggctccaatccagga acaagccccatgttctgcagcgtggcttcctttggtgccagctctttgtctagtcctcac agtgcagcctctggaactgctgcgggaagccagccttcatccccgcggtatcgcccctac actgtcactcacccatgggcgtcctcaggcgtctccattctgtcgagctccccaagtccc cctgccctcgccagtagcccccaagcagtgcccgccagcagttccagacagaggcccagc agcacgggtccacccctaccacccgcctcacccagtgccacgagcagacgcccctcctcc ctgaggatctctcctagtatgtacgacaatcctttctccttcctcttcctcgcactttct ggggacagtagtgatgaagaagatgaagaagaagatgatgatgatggtgatggtgatgat gaaggtggtggtggtggtgatgatttttcttgtgtccagtttgggtccaggaggaataaa atgtaccaagtgtataaaaatagctgcaaatactcatggagcattgtgcgccagatgtgg tgtcagctctgtccatgtatcagcccctttactcctcgctgtagcctgctgtga