GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:29:39 Sequence gi568815591r:98285797_98486584 : 200788 bp : 47.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 428 481 54 1 0 92 73 38 0.700 3.57 1.02 Intr + 6284 6459 176 1 2 80 101 76 0.701 6.84 1.03 Intr + 9510 9678 169 0 1 106 50 60 0.469 3.95 1.04 Intr + 10185 10309 125 2 2 82 41 66 0.405 0.68 1.05 Term + 11179 11284 106 1 1 126 48 61 0.589 3.78 1.06 PlyA + 12273 12278 6 1.05 2.00 Prom + 12490 12529 40 -6.26 2.01 Init + 16077 16218 142 2 1 74 111 24 0.727 3.86 2.02 Term + 16328 16557 230 0 2 42 34 159 0.271 2.69 2.03 PlyA + 16605 16610 6 1.05 3.11 PlyA - 17733 17728 6 1.05 3.10 Term - 18298 18239 60 1 0 107 33 10 0.260 -4.80 3.09 Intr - 18580 18400 181 0 1 -2 76 169 0.340 6.57 3.08 Intr - 20720 20643 78 1 0 73 63 104 0.508 5.07 3.07 Intr - 22100 21893 208 0 1 76 88 304 0.854 27.34 3.06 Intr - 22274 22229 46 2 1 94 56 20 0.528 -2.62 3.05 Intr - 26468 26301 168 2 0 81 46 139 0.682 9.14 3.04 Intr - 29816 29664 153 2 0 92 94 107 0.984 11.97 3.03 Intr - 34333 34262 72 1 0 74 111 56 0.971 6.10 3.02 Intr - 34502 34441 62 0 2 91 63 98 0.965 6.05 3.01 Init - 37041 36996 46 0 1 56 53 58 0.344 -0.06 3.00 Prom - 41190 41151 40 -4.16 4.00 Prom + 49630 49669 40 -0.16 4.01 Init + 53769 53892 124 2 1 112 38 135 0.968 9.27 4.02 Term + 55191 55312 122 1 2 136 43 -20 0.433 -3.06 4.03 PlyA + 56012 56017 6 1.05 5.00 Prom + 60338 60377 40 -2.96 5.01 Sngl + 63480 63755 276 2 0 39 43 183 0.841 4.28 5.02 PlyA + 64878 64883 6 1.05 6.06 PlyA - 65967 65962 6 1.05 6.05 Term - 68775 68711 65 1 2 77 42 65 0.788 -1.15 6.04 Intr - 69332 69246 87 0 0 70 80 115 0.684 8.84 6.03 Intr - 76636 76561 76 1 1 90 127 58 0.302 8.99 6.02 Intr - 93578 93463 116 0 2 76 46 115 0.743 6.27 6.01 Init - 95283 95238 46 0 1 84 62 59 0.540 3.74 6.00 Prom - 96858 96819 40 -6.76 7.03 PlyA - 96866 96861 6 1.05 7.02 Term - 100438 99998 441 1 0 -11 39 478 0.934 28.26 7.01 Init - 100788 100585 204 0 0 113 81 213 0.825 21.95 7.00 Prom - 107409 107370 40 -5.06 8.06 PlyA - 107561 107556 6 1.05 8.05 Term - 108667 108557 111 1 0 102 44 0 0.158 -4.44 8.04 Intr - 112508 112357 152 0 2 77 89 34 0.262 2.28 8.03 Intr - 112976 112826 151 2 1 86 77 14 0.303 -0.16 8.02 Intr - 115321 115006 316 0 1 113 102 206 0.234 20.57 8.01 Init - 148833 148706 128 0 2 55 62 192 0.363 12.93 8.00 Prom - 152088 152049 40 -1.36 9.04 PlyA - 152920 152915 6 1.05 9.03 Term - 158557 158321 237 1 0 89 37 82 0.035 -0.63 9.02 Intr - 167560 167437 124 0 1 51 90 55 0.440 2.59 9.01 Init - 168810 168419 392 0 2 86 62 439 0.595 37.35 9.00 Prom - 180426 180387 40 -1.16 10.05 PlyA - 180712 180707 6 1.05 10.04 Term - 185371 185270 102 1 0 72 43 90 0.493 1.18 10.03 Intr - 186647 186589 59 0 2 101 110 3 0.525 2.40 10.02 Intr - 191153 190908 246 0 0 90 34 122 0.227 4.43 10.01 Intr - 198293 198105 189 0 0 72 48 75 0.089 1.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 34209 34049 161 0 2 51 80 106 0.834 4.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_1|209_aa MAFSLGSSEPCCQPPLSLEKQMVTLLDLAGRSLKHRPRPLAYLHVTFPNLSLIGYYTYGK AKGEGISHSPSTQQHTRSPGQGIWGQFIVCFSSLEHKATDLPCTSYRALNGRKSDFLRGT QRFPSRPQRKSLLEQAGHTPMGTETLGQPWRNSPDQKHQTPVNGRVRTSFCADSSGGHLQ ATGSTATGTQGPAGTACCMSCVPNKTGDH >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_1|630_bp atggccttctccctgggcagctcagagccctgctgccagccccctctgtccttggagaag cagatggtaacactgctggacctggctggaaggagcctgaagcatcggccacgtccactg gcttacctgcatgtaacctttcccaatctcagcctcataggatactacacttatggcaag gctaaaggagaggggataagtcacagccccagcacccagcaacacacacgatcccctggg cagggcatctggggccaattcattgtctgcttttcatcactggagcacaaagccactgac ctgccttgcacatcctacagggctttaaatggccgcaaaagtgacttcctgcggggcact caacgcttccctagcaggcctcagaggaagtccttgctggaacaagcagggcacacaccc atgggcaccgagacactcgggcagccctggagaaacagcccagaccagaaacaccagacg cccgtcaatgggagagtcagaaccagcttctgtgcagacagcagtgggggacacctccaa gccacaggcagcactgccaccggcacccagggccccgcgggcactgcctgctgcatgtcc tgtgtccccaacaagacaggagaccactaa >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_2|123_aa MGQLRWWRLQGTRSPPAGSPSSAQPSRSGPSGTVVCLKLCCPRKWEAVLSKEESVITEYT DTYNQASSIYPIPCSKNGGDSTFFTKLYPHDKYADNDTFSFSNTEDYPGLAKVSVLSLTM PEL >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_2|372_bp atgggccagctgcgctggtggaggctgcagggcaccaggtcaccccctgcaggcagcccc agctcagcccaaccctccaggtcagggccctcggggacagtggtgtgcttaaagctctgc tgcccgaggaagtgggaagccgtgctaagtaaggaggaatctgtcattactgagtacact gatacttacaaccaggcttcttccatttatccaattccctgcagtaagaatggcggcgac agtactttcttcaccaagctttatccacacgataaatacgctgacaatgacacgttttca ttttctaacactgaagactatccagggttggccaaagtgtcagtgctctcacttacaatg ccagagctgtga >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_3|357_aa MLRLHFGKIPGICEEGHVLIEISSTHKKLNESLDENFKKFHKEIIHELEKKIELDVKYMN YVETVTSRQSEIQKFIADGCKEALLEEKRRFCFLVDKHCGFANHIHYYHLQSAELLNSKL PRWQETCVDAIKVPEKIMNMIEEIKTPASTPVSGTPQASPMIERSNVSLFLLGNDQSEEI GQGTSEDPSLQRSVSVATGLNMMKKQKVKTIFPHTAGSNKTLLSFAQGDVITLLIPEEKD GWLYGEHDVSKARGWFPSSYTKLLEENETEAVTVPTPSPTPVRSISTVNLSENSSVVIPP PDYLECLSMGAAADRRADSARTTSTFKAPASKPETAAPGESGVPGEEGGGDLLPLIL >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_3|1074_bp atgttacgcctgcattttggaaagatccctggcatctgtgaggaaggacatgtcctcata gagatttcaagtacccacaagaaactcaacgagagtcttgatgaaaattttaaaaaattc cacaaagagattatccatgagctggagaagaagatagaacttgacgtgaaatatatgaac tatgtggagaccgttacttctcgtcagagtgaaatccagaaattcattgcagatggttgc aaagaggctctgcttgaagagaagaggcgcttctgctttctggttgataagcactgtggc tttgcaaaccacatacattattatcacttacagtctgcagaactactgaattccaagctg cctcggtggcaggagacctgtgttgatgccatcaaagtgccagagaaaatcatgaatatg atcgaagaaataaagaccccagcctctacccccgtgtctggaactcctcaggcttcaccc atgatcgagagaagcaatgtgtccctgttccttctgggcaatgatcaaagcgaggaaata ggacagggtacttccgaagatcccagtttacagcgatcagtttcggttgcaacgggactg aacatgatgaagaagcagaaagtgaagaccatcttcccgcacactgcgggctccaacaag accttactcagctttgcacagggagatgtcatcacgctgctcatccccgaggagaaggat ggctggctctatggagaacacgacgtgtccaaggcgaggggttggttcccgtcgtcgtac acgaagttgctggaagaaaatgagacagaagcagtgaccgtgcccacgccaagccccaca ccagtgagaagcatcagcaccgtgaacttgtctgagaatagcagtgttgtcatcccccca cccgactacttggaatgcttgtccatgggggcagctgccgacaggagagcagattcggcc aggacgacatccacctttaaggccccagcgtccaagcccgagaccgcggctcctggggag agtggtgtccccggggaggagggaggaggggacttgctgcctttgatcctctag >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_4|81_aa MVCPGMRLAWPGLTRFMILAASIFEYPSLQPVTGATPFAATCFINYNPTFLIILAETFPV PKGRRKNIHPKIEMRKATFKV >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_4|246_bp atggtgtgcccgggaatgcggctggcgtggccaggcctcactcgcttcatgattctagcc gcatccatctttgagtatccctctcttcagcctgtcacgggagctacgccctttgctgct acctgttttataaattataaccctacctttctaattattctagctgagacctttcctgtt ccaaaagggagaagaaagaacatccacccaaaaattgagatgagaaaagccaccttcaag gtctaa >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_5|91_aa MRQQKCPARGGHIRRPVFLYGSTWRKGREILLRRHKSGAILLRRATPTLRLCDFHYPYPV TPGVYGWNPSETTISKGTVTANHDTCQPFRH >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_5|276_bp atgagacaacagaagtgcccggcacgcggtggacacatccggagacccgtgtttctgtat ggaagcacgtggcgcaagggcagggaaatacttctaagacggcacaagagcggagcaata cttctaagaagagccacgcccacgctacgcctctgtgatttccactatccttaccccgta actcctggcgtttatggttggaaccccagtgaaaccaccatttccaaggggactgtgacc gctaatcatgacacctgccagccattcagacactaa >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_6|129_aa MAVDGSIGINALQTSAMCSDELVELRVQPAAGSSVGAGISYEIQVSEPAAAENTNVMEQF NPGLRNLINLGKNYEKAVNAMILAGKAYYDGVAKIGEIATGSPVSTELGREGGYTLGSAR CRGDLGFCL >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_6|390_bp atggctgtggacggctccataggcataaatgctttgcagaccagtgcgatgtgttcagat gagctggtggagctgagggtacagcctgcagctggttcgtccgtgggtgcaggcatctct tatgaaatacaggtgtcagaacctgctgctgctgagaacactaatgttatggaacagttc aatcctgggctgcgaaatttaataaacctggggaaaaattatgagaaagctgtaaacgct atgatcctggcaggaaaagcctactacgatggagtggccaagatcggtgagattgccact gggtcccccgtgtcaactgaactgggcagagagggcggttacaccctgggcagtgcccgc tgccgaggggatctgggcttctgtttatag >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_7|214_aa MAVGKNKRLTKGGKKGAKKKVVDPFSKKDWYDVKAPAMFSIRNIGKTLVTRTQGTKIASD GLKGRVFETMIEAHVDVKTTDGYLLRLFCVGFTKKRNNQIRKTSYAQHQQVRQIRKKMIE IMTREVQTNDLKEVVNKLIPDSIGKDIEKACQSIYPLHDVFVRKVKMLKKPKFELGKLME LHGEGSSSGKATGDETGAKVERADGYEPPVQESV >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_7|645_bp atggcggttggcaagaacaagcgccttacgaaaggcggcaaaaagggagccaagaagaaa gtggttgatccattttctaagaaagattggtatgatgtgaaagcacctgctatgttcagt ataagaaatattggaaagacgctcgtcaccaggacccaaggaaccaaaattgcatctgat ggtctcaagggtcgtgtgtttgaaacaatgattgaagctcacgttgatgtcaagactacc gatggttacttgcttcgtctgttctgtgttggttttactaaaaaacgcaacaatcagata cggaagacctcttatgctcagcaccaacaggtccgccaaatccggaagaagatgatcgaa atcatgacccgagaggtgcagacaaatgacttgaaagaagtggtcaataaattgattcca gacagcattggaaaagacatagaaaaggcttgccagtctatttatcctctccatgatgtc ttcgttagaaaagtaaaaatgctgaagaagcccaagtttgaattgggaaagctcatggag cttcatggtgaaggcagtagttctggaaaagccactggggacgagacaggtgctaaagtt gaacgagctgatggatatgaaccaccagtccaagaatctgtttaa >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_8|285_aa MNALGHKTAKRMENTVNQETRSSMDEEVQAEDQQTAVTERVSRWGRPARPASSGGVRPLL LCSSKKARAALPQVLTFSQRRRARALSGRRLRRRPTLQSPLVAPRPSRCPESGGHEAQPR PPALARPAAAAMSRGPEEVNRLTESTYRTFSNNSLEGPYQGELQRHKCQPLGHVLLPKER KRQGKSVDFIILVIRICRPLPHDVQAENSLSLIWQKSGSVDESSAFKKRCPLYYLTIYHI LLIEMQHNQGVCNLGREPCGKSFMSEDKPRPRKFWSERPYSCGSH >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_8|858_bp atgaacgccctcgggcacaagacggcaaaaagaatggagaacacggtcaaccaggagacc aggtcctcaatggatgaggaggtacaagcggaagaccagcagacagcagtcaccgagagg gtcagcaggtggggccgcccggcccgcccagcttcctctggcggcgtccggccgcttctc ctctgctcctcgaagaaggccagggcggcgctgccgcaagttttgacattttcgcagcgg agacgcgcgcgggcactctcgggccgacggctgcggcggcggccgaccctccagagcccc ttagtcgcgccccggccctcccgctgcccggagtccggcggccacgaggcccagccgcgt cctcccgcgcttgctcgcccggcggccgcagccatgtcccgggggcccgaggaggtgaac cggctcacggagagcacctaccggacttttagtaataactccttagagggaccgtaccaa ggcgaattgcagcgtcacaaatgccaaccacttgggcatgtcttattacccaaagaacgg aaacggcagggaaaaagtgtggatttcatcattttggttatacgtatttgtagacccctg ccccatgatgttcaagctgaaaacagtttaagcttaatatggcaaaaatctggcagtgtt gatgaatctagtgcttttaagaagagatgtcctctatattacctaaccatctaccatatt ttactaatagaaatgcaacacaaccagggtgtatgtaaccttggcagggagccctgtggg aaatccttcatgtcagaagacaaaccaagacctaggaaattctggtctgagaggccatac agttgtggtagtcactag >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_9|250_aa MVNPTVFFDIAVIDEPLGRVSFKLFADKFPKTTENFHALSTGEKGYGYKGSCFHSIIPGF MCQGGDFTLLNGTGGKSIYREKFDDENFILKHTGPGILSMANAGLNTNGSQFFICTAKTE WWMAGMWSLASKLTTTGTSQGELMDRTTVSATGILSPSRNDSKFLRNGTPLKSGLLPDVD DLRNHLHLTGEYPGPAVKVHKPVTLFSFLPSSLAQLPAVAPYPTSQLWPNLHRAASVICS ISNEIVQSSS >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_9|753_bp atggtcaaccccaccgtgttcttcgacattgctgttatcgacgagcccttgggccgcgtc tccttcaagctgtttgcagacaagtttccaaagacaacagaaaactttcatgctctgagc actggagagaaaggatatggttataaaggttcctgctttcacagcattattccagggttt atgtgtcagggcggtgacttcacactccttaatggcactggtggcaagtccatctacagg gagaaatttgatgatgagaacttcattctaaagcatacaggtcctggcatcttgtccatg gcaaatgctggactcaacacaaatggttcccagttcttcatctgcacggccaagactgag tggtggatggcaggcatgtggtctttggcaagcaaactcaccacaactggcacatcccaa ggagagctcatggataggaccacagtgtctgccaccggaatcctttctccatcaagaaat gactccaagttcttaagaaatgggacacctcttaagagcggccttttgcccgatgtcgat gatctgcgaaatcatttacatctaacaggagaatacccaggcccagccgtgaaagtccac aagcctgtgacacttttttcttttttaccatcatccctggcccagcttcctgcagtggcc ccgtaccccacctcccaactttggcctaatctccatagagcagccagcgtgatctgttct atcagcaatgagatcgtgcagtcctcttcataa >gi568815591r:98285797_98486584|GENSCAN_predicted_peptide_10|198_aa XWDVEGLLVPYPLAESGPRFETRIMKAPGWLPLPLGGQLPDGVNEEVRMQQRLKDGRIEI ARDCMIRLELYWRSRKRALRKQSVPPCVTNHLARREDKRVLNAIASSRGTAEKIQGVILQ LSRGQESSKHTPPPPMILRVHPDHQAQPRLPVGASHLAPSPPKDQKRSSREPAQPGNPEE ITYQIEERQPSLKAPPTT >gi568815591r:98285797_98486584|GENSCAN_predicted_CDS_10|597_bp ngatgggacgtagaaggccttcttgtaccctacccattggctgaatctggccccagattt gagacccggataatgaaagcccctggatggcttcccctgcccttggggggacaactcccc gatggggttaatgaagaggtcagaatgcagcagaggctgaaagatgggaggatagagatt gcccgagactgcatgatccgcttagaactttattggagatctcgtaaaagagcccttagg aaacaatcagtgcctccatgtgtcaccaatcacttagcacgcagagaggacaaacgagtg ctaaatgcaattgccagttcccgtggcactgcagagaaaattcaaggtgtcatcctgcag ctttcccggggtcaagaaagctccaaacacaccccccccccacccatgatcctaagagtc catcctgaccaccaagcccagcctcgactccctgttggggccagccatcttgccccttct ccaccaaaggaccagaagcgctcctcccgagagccagctcagcctggcaaccctgaagaa atcacttatcaaatcgaggagcgccagccttccctcaaggcaccccctactacctaa