GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:34:38 Sequence gi568815587f:128658113_128910985 : 252873 bp : 46.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10412 10457 46 1 1 81 96 21 0.039 3.04 1.02 Intr + 28542 28589 48 1 0 53 116 51 0.058 3.05 1.03 Term + 32348 32568 221 0 2 4 48 224 0.234 7.20 1.04 PlyA + 34522 34527 6 1.05 2.04 PlyA - 34549 34544 6 -0.45 2.03 Term - 35099 34639 461 0 2 91 48 116 0.089 3.15 2.02 Intr - 49324 49187 138 2 0 125 71 6 0.370 3.04 2.01 Init - 53449 53371 79 1 1 63 82 27 0.164 0.82 2.00 Prom - 54411 54372 40 -5.76 3.00 Prom + 54526 54565 40 -3.16 3.01 Init + 66214 66280 67 0 1 83 94 23 0.196 3.64 3.02 Intr + 85990 86156 167 2 2 75 103 113 0.034 11.18 3.03 Intr + 100003 100214 212 0 2 106 115 297 0.989 31.91 3.04 Intr + 103133 103307 175 2 1 47 90 39 0.493 -0.06 3.05 Intr + 106819 106922 104 0 2 48 89 72 0.213 2.27 3.06 Intr + 110006 110160 155 2 2 103 91 281 0.999 29.62 3.07 Intr + 111036 111161 126 1 0 40 75 105 0.845 5.05 3.08 Intr + 114670 114873 204 2 0 105 106 176 0.547 20.27 3.09 Intr + 123846 123911 66 1 0 117 115 23 0.734 6.78 3.10 Intr + 136653 136687 35 1 2 89 64 31 0.045 -1.26 3.11 Term + 142922 143080 159 1 0 73 43 170 0.173 8.94 3.12 PlyA + 144295 144300 6 1.05 4.00 Prom + 146923 146962 40 -6.56 4.01 Init + 148142 148382 241 1 1 77 116 160 0.974 15.75 4.02 Intr + 149068 149127 60 2 0 134 82 5 0.888 3.21 4.03 Intr + 151045 151092 48 2 0 77 105 59 0.973 5.15 4.04 Intr + 152347 152708 362 2 2 71 61 518 0.427 42.34 4.05 Term + 162499 162519 21 0 0 123 34 24 0.118 -1.19 4.06 PlyA + 163267 163272 6 1.05 5.03 PlyA - 165746 165741 6 1.05 5.02 Term - 166792 166427 366 1 0 108 52 101 0.473 3.10 5.01 Init - 169932 169891 42 0 0 49 94 61 0.569 3.28 5.00 Prom - 171547 171508 40 -4.56 6.03 PlyA - 171945 171940 6 1.05 6.02 Term - 182152 181013 1140 1 0 77 42 611 0.881 47.36 6.01 Init - 184305 184270 36 0 0 85 111 55 0.987 7.51 6.00 Prom - 194024 193985 40 -1.96 7.06 PlyA - 194190 194185 6 -1.95 7.05 Term - 194703 194432 272 1 2 108 47 177 0.994 11.25 7.04 Intr - 196051 195955 97 1 1 85 61 88 0.730 5.38 7.03 Intr - 201925 201901 25 0 1 65 107 52 0.236 2.83 7.02 Intr - 202549 202495 55 2 1 78 58 45 0.554 -1.46 7.01 Init - 206891 206828 64 2 1 104 78 41 0.522 6.01 7.00 Prom - 209682 209643 40 -7.26 8.00 Prom + 211991 212030 40 -5.76 8.01 Init + 213001 213015 15 0 0 62 115 -5 0.028 0.01 8.02 Intr + 218199 218444 246 2 0 67 55 129 0.006 5.16 8.03 Intr + 222249 222384 136 2 1 25 44 98 0.003 -0.76 8.04 Intr + 232335 232412 78 1 0 58 91 88 0.930 5.62 8.05 Term + 232528 232871 344 2 2 117 38 109 0.354 3.47 8.06 PlyA + 235332 235337 6 1.05 9.07 PlyA - 235527 235522 6 -0.45 9.06 Term - 236458 236331 128 2 2 103 47 106 0.983 6.44 9.05 Intr - 242597 242450 148 2 1 40 113 85 0.706 6.01 9.04 Intr - 243773 243702 72 2 0 51 94 62 0.416 2.70 9.03 Intr - 244117 244057 61 0 1 6 53 60 0.163 -6.96 9.02 Intr - 245342 245255 88 0 1 56 105 54 0.487 2.93 9.01 Init - 246454 246319 136 1 1 61 95 131 0.813 9.71 9.00 Prom - 250239 250200 40 -3.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 86004 86156 153 2 0 83 103 121 0.866 13.08 S.002 Init - 143056 142930 127 1 1 65 22 112 0.805 2.84 S.003 Init + 186141 186165 25 2 1 95 82 43 0.851 4.09 S.004 Term - 218470 218121 350 2 2 31 39 243 0.850 8.35 S.005 Init + 231121 231130 10 0 1 39 116 9 0.923 -0.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_1|104_aa MVGFPQEEKKRDWKPATHDVSDSPRHVVLHQVWGRRSPGSPCLESLDRVKRADIERAKTR FDPPLPSRGKQDFRIFENTVKCEWCEASRLTSVPNENAPNPNIS >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_1|315_bp atggtagggttccctcaggaagagaagaagagggactggaagccagctacccacgatgtt tcagacagtccccgacacgtcgtcttacatcaagtctggggccgaagaagcccgggaagc ccgtgtctggaaagtcttgaccgggtcaaaagggctgacattgagcgagccaagacacgt ttcgacccgccgctcccctcccgggggaagcaggacttccgcatctttgaaaatactgtc aagtgcgaatggtgcgaggcaagtcgtctcaccagcgttccaaatgaaaatgctccaaat ccaaatatttcctga >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_2|225_aa MIRDYPFDKFQMPSFDKREIKYYILKEKCKKYRCPSSPSPLGKSYVAPKTVFEKEDLCPS TNVFLLWRSRRAEGGDPRRPRPPIRIFSSQTPSAYPDPTAQALTPRQESPFLVPRELTLP RDVPSRTVSRSQERSTQQRRADPDFSRGETRRPALWTVLPTQPGGRRSPAQPGDLPASLL GRRAPQLEQPRGAGPFRPDSLLPDIQNRLPCPRSPDHALDLLTFI >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_2|678_bp atgattcgtgactatccttttgataaattccaaatgccctcatttgataaaagggaaata aagtattacattcttaaagaaaagtgcaaaaagtacaggtgcccttcaagtccctcacct ctagggaagtcctatgttgcaccaaagaccgtatttgagaaagaggacttgtgcccttcg accaatgtgttcttattatggagaagtcgaagggcagagggcggagatccgcggcgccct aggccacctatccgcatcttctcgagccaaacccccagcgcttacccggacccgacggcc caagcactgactcccaggcaagagtcgcccttcctggtccccagggaactgacactaccg cgggacgttccctctaggacagtcagcaggtcccaggagcgcagcacgcaacagcgcaga gcagacccagacttttcccggggtgagacccggaggcccgcgctctggaccgtcctccct acccagcctggaggccggcgttccccggcccaacccggagacctccccgcctccctcctg ggccgccgcgcgccccagttggagcagcctcgaggggctgggccctttcggccggactcc cttctcccggacatccaaaacaggttaccttgtccacgctctccggaccacgctttggac ttgctcacttttatttaa >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_3|489_aa MGWCQASESATDMGQQVRRQDTEGRAKMTINGAMKLMNTELSELTRRQSGHIMHQQKWDN FERDKHFLEQQHKAELFQEALSVVSDDQSLFDSAYGAAAHLPKADMTASGSPDYGQPHKI NPLPPQQEWINQPVRVNVKREYDHMNGSSHMTLSLTSFGFLYLNLTFSEASPGHLIETSA SSLFTLLRVSVLVLITSLTDTWAGPETVLVQKPGTEDRAIFQRRNTLAGERPNRSPVQLE SRESPVDCSVSKCSKLVGGGESNPMNYNSYMDEKNGPPPPNMTTNERRVIVPAENKIIVQ SFSPLAGHSTPASTSKLSCLHKKGVYKLPRVELVPDPTLWTQEHVRQWLEWAIKEYSLME IDTSFFQNMDGKELCKMNKEDFLRATTLYNTEVLLSHLSYLRESSLLAYNTTSHTDQSSR LSVKEGICKIEKFNMGQKQTKIINWYTGADGHIFTHWAFGCHLKFVSSQEADSRHHIREG LHARSGFFS >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_3|1470_bp atgggatggtgccaggccagtgagagtgccacggatatggggcagcaggtgaggaggcag gacacagaagggagagcaaagatgacaatcaatggggccatgaagctcatgaacactgag ttatctgagcttactcgtagacagagtggccatataatgcatcaacaaaagtgggataat tttgagcgggataagcatttcctggaacaacagcataaagcggaactgttccaggaggct ctgtcggtggtgagcgacgaccagtccctctttgactcagcgtacggagcggcagcccat ctccccaaggccgacatgactgcctcggggagtcctgactacgggcagccccacaagatc aaccccctcccaccacagcaggagtggatcaatcagccagtgagggtcaacgtcaagcgg gagtatgaccacatgaatggatccagtcatatgactctctccctcacgagttttgggttt ctctacctgaacctcaccttttcagaggcctcccctggccacctgatcgaaaccagcgcc tcctcactctttaccctgcttagagtgtctgtcctagttcttatcacctccctcactgac acgtgggcagggcctgaaacggtcctagtgcagaagcctgggaccgaggacagggccatt ttccagaggagaaacaccctggcgggcgagcggccaaaccgatcacctgtccagctggag agcagggagtctccggtggactgcagcgttagcaaatgcagcaagctggtgggcggaggc gagtccaaccccatgaactacaacagctatatggacgagaagaatggcccccctcctccc aacatgaccaccaacgagaggagagtcatcgtccccgcagaaaataaaatcatcgttcag tccttcagtccattggctggacactccaccccagctagcaccagcaagctcagctgcctg cacaagaagggtgtttacaaactgccacgtgtagaactggtgccagaccccacactgtgg acacaggagcatgtgaggcaatggctggagtgggccataaaggagtacagcttgatggag atcgacacatcctttttccagaacatggatggcaaggaactgtgtaaaatgaacaaggag gacttcctccgcgccaccaccctctacaacacggaagtgctgttgtcacacctcagttac ctcagggaaagttcactgctggcctataatacaacctcccacaccgaccaatcctcacga ttgagtgtcaaagaagggatctgtaagatcgagaagtttaacatgggtcagaaacagacc aagatcatcaactggtacactggtgccgacggccacatttttacacactgggcctttggc tgccacctcaagtttgtcagttctcaggaggcagactccaggcaccacattcgagagggc ctccatgcacgctcaggcttcttctcatag >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_4|243_aa MHWSHKREREIKHTQNSFLLVGIKEGFVDLVALELALEGEAGFGPVAIWAGEDMSLACLT VSEDLRFLELRMCDAEEVWKGPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQ IQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSRALR YYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKEHQ IIQ >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_4|732_bp atgcactggtcccataagcgggagagagaaatcaagcacacacagaactcctttcttctg gttggtatcaaggaaggctttgtggacctggtagcacttgagttagctctggaaggagag gcaggatttggacctgtggcgatatgggctggggaagacatgagcttggcatgtctgaca gtcagtgaggacctcaggttcctggagcttaggatgtgtgatgcagaagaagtctggaaa ggtcctccccttggaggggcacaaacgatcagtaagaatacagagcaacggccccagcca gatccgtatcagatcctgggcccgaccagcagtcgcctagccaaccctggaagcgggcag atccagctgtggcaattcctcctggagctgctctccgacagcgccaacgccagctgtatc acctgggaggggaccaacggggagttcaaaatgacggaccccgatgaggtggccaggcgc tggggcgagcggaaaagcaagcccaacatgaattacgacaagctgagccgggccctccgt tattactatgataaaaacattatgaccaaagtgcacggcaaaagatatgcttacaaattt gacttccacggcattgcccaggctctgcagccacatccgaccgagtcgtccatgtacaag tacccttctgacatctcctacatgccttcctaccatgcccaccagcagaaggaacatcaa atcatccagtga >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_5|135_aa MNPIILQLHELELQPSAPCGSQHPRPPPPELPPASPNIPSPKAAPNPQAAPRALGHPHPA APELSPVSPGPQRPRPPPTRSCPLSSPHLPVPKAAPHPELLPEYPAPRALGCPSPLSCYP RTPPPAPSAAPHPQG >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_5|408_bp atgaatcctatcatcctacaactccacgaactggagctgcagccctcggccccctgtggc tcccagcaccctcggccacccccacccgagctgccccctgcgtcccccaacattccctcg cccaaggctgcccccaacccccaagctgctccccgcgccctcggccacccccacccggct gctcccgagctgtcccctgtatcccccggcccccaacgccctcggccaccccccacccgg agctgccccctgagttccccccacctccctgtgcccaaggctgccccccacccggagctg ctcccggagtaccccgccccgcgcgctctcggctgcccctcacccttgagctgctacccg cgtaccccgccccccgcgccctcggctgccccccacccccagggctga >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_6|391_aa MNASSRNVFDTLIRVLTESMFKHLRKWVVTRFFGHSRQRARLVSKDGRCNIEFGNVEAQS RFIFFVDIWTTVLDLKWRYKMTIFITAFLGSWFFFGLLWYAVAYIHKDLPEFHPSANHTP CVENINGLTSAFLFSLETQVTIGYGFRCVTEQCATAIFLLIFQSILGVIINSFMCGAILA KISRPKKRAKTITFSKNAVISKRGGKLCLLIRVANLRKSLLIGSHIYGKLLKTTVTPEGE TIILDQININFVVDAGNENLFFISPLTIYHVIDHNSPFFHMAAETLLQQDFELVVFLDGT VESTSATCQVRTSYVPEEVLWGYRFAPIVSKTKEGKYRVDFHNFSKTVEVETPHCAMCLY NEKDVRARMKRGYDNPNFILSEVNETDDTKM >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_6|1176_bp atgaatgcttccagtcggaatgtgtttgacacgttgatcagggtgttgacagaaagtatg ttcaaacatcttcggaaatgggtcgtcactcgcttttttgggcattctcggcaaagagca aggctagtctccaaagatggaaggtgcaacatagaatttggcaatgtggaggcacagtca aggtttatattctttgtggacatctggacaacggtacttgacctcaagtggagatacaaa atgaccattttcatcacagccttcttggggagttggtttttctttggtctcctgtggtat gcagtagcgtacattcacaaagacctcccggaattccatccttctgccaatcacactccc tgtgtggagaatattaatggcttgacctcagcttttctgttttctctggagactcaagtg accattggatatggattcaggtgtgtgacagaacagtgtgccactgccatttttctgctt atctttcagtctatacttggagttataatcaattctttcatgtgtggggccatcttagcc aagatctccaggcccaaaaaacgtgccaagaccattacgttcagcaagaacgcagtgatc agcaaacggggagggaagctttgcctcctaatccgagtggctaatctcaggaagagcctt cttattggcagtcacatttatggaaagcttctgaagaccacagtcactcctgaaggagag accattattttggaccagatcaatatcaactttgtagttgacgctgggaatgaaaattta ttcttcatctccccattgacaatttaccatgtcattgatcacaacagccctttcttccac atggcagcggagacccttctccagcaggactttgaattagtggtgtttttagatggcaca gtggagtccaccagtgctacctgccaagtccggacatcctatgtcccagaggaggtgctt tggggctaccgttttgctcccatagtatccaagacaaaggaagggaaataccgagtggat ttccataactttagcaagacagtggaagtggagacccctcactgtgccatgtgcctttat aatgagaaagatgttagagccaggatgaagagaggctatgacaaccccaacttcatcttg tcagaagtcaatgaaacagatgacaccaaaatgtaa >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_7|170_aa MKDLVCDAEELGQCPEGGENRGYCVSRPLGPADVGYKAHPRQGALLLLTCPLALGGVRDF TLELPTQDRNQTPFSDLQSPEADTRIGQEQFKTFAGGRSWEVSRAVSCSERAPGWAVPGN EDGGMLNASKKSPSEKTPAFAEGQLYGMEQRVTEAAGSAPSGSRAFPEST >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_7|513_bp atgaaggacctagtatgtgatgctgaggaacttgggcagtgtccggaaggtggtgagaac cggggctattgcgtctccaggccccttggtccagctgatgtgggttacaaggcccacccg cgccagggggcgctgctgctgctgacctgccctctggcactgggaggtgtgagggacttc accctggagctgccaacccaggatcgtaaccaaacaccgttctcagacctgcagagccca gaggcagacacgaggattggtcaagagcagttcaagacatttgctggaggacgcagctgg gaagtctcccgggcagtgtcctgttctgagagggctcctgggtgggctgtgcccggtaat gaggatgggggcatgctaaatgccagcaagaagagccccagtgagaagactccagcattt gctgaagggcagctctatggcatggaacaaagggtcaccgaagctgctggctctgctccc tctggctccagggctttcccagaaagcacttga >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_8|272_aa MCLPQVDRVERPVDSIALSWEQNLPGLCTFNPAGSQLSAVRRKAFTLQFPGDCSQDAWVS TTDSSAEGLLRGNFPPEKHRNETAMSPSHTGKGSSLREVELLDKRAEALLVMSTSHEDQA GSGAWGHQKGEGACPGSCLSYGFHELKNTLWSSSVLKAGRVSSHRRPAAAARRGEIVRGE QRARGQLAWVLSLDLLPLQLADQAPGPSLHGGNQSRRAGDAAPRLWLPRLPKGIPPSAPS LSGNGEMLPTVRLAHRGATEQPRRGSEPRFLP >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_8|819_bp atgtgtctcccccaggtagaccgcgtggagcgcccagtggactcgatagcgctgtcctgg gaacagaaccttcccggactgtgcacttttaaccctgcaggttctcagctgtctgccgtc agaaggaaagcattcacactccagttcccaggagactgcagccaggacgcttgggtttcc accacggattcctctgctgaggggctgctccgcggtaactttcctcccgagaagcacagg aatgaaacggcgatgagcccctctcacacagggaagggctccagcctgcgggaggtggag ctgctggataaaagagctgaggcccttctggtgatgtccaccagccacgaggatcaggcg gggtcaggggcctggggacaccagaaaggggagggtgcctgccctggcagctgcctttcc tacggcttccacgaattaaagaacacactctggtcatcatctgtcctcaaggcagggcgc gtttccagccatcgccgcccagccgccgccgcacgccggggagagatcgtccggggtgag cagcgagcccggggacagctagcctgggtcctgtccctggacctactccccctgcagctg gcggatcaggcccccggaccctcactccacggcgggaatcagagccgtcgggcgggggat gcagcgcccagactgtggctcccccgccttcccaaaggaatccccccatctgctccatcc ctctcgggaaacggcgaaatgctgccaaccgtccgcttagcgcaccgcggagcgactgag cagccccgccgaggctccgagccaagatttctgccctaa >gi568815587f:128658113_128910985|GENSCAN_predicted_peptide_9|210_aa MLTRLVLSAHLSSTTSPPWTHAAISWELDNVLMPSPRIWPQVTPTAGQDVHAIVTRTCES VLSSAVYTHGCGCVSGEGIVEGTAEVCEMSGMRPQRSLIAKLLSTLRKGAAGKADEASQD SGIVDTETPTVKFYVIQTLGDLAIGLPGGPGFWLHNKGGWLVLNGATEENKNKTQKDSFV ADDYLNSYLMTDPHLETLPALNDLSIAHTQ >gi568815587f:128658113_128910985|GENSCAN_predicted_CDS_9|633_bp atgttgacgcggctggtcctcagtgcacacctgagtagcacgacctctccgccctggacg cacgctgccatcagctgggagctggacaacgtgctgatgcctagtcccagaatctggccc caggtgactccaacagctgggcaggatgtgcatgccatagtaaccagaacctgtgagtct gtgctgagctctgccgtctacacccacggctgtggctgtgtgagcggtgagggcatcgtg gaagggactgcagaggtgtgcgagatgagtggaatgaggccccagcgttctctgattgcc aagctgttaagcactctgcggaagggagctgctgggaaggcggacgaggcatctcaggat tctggcattgtagacacagaaacaccgactgtgaagttttatgtaatacaaacactggga gatttagcaataggcctgccaggcggccctggcttctggctgcacaacaaaggggggtgg ctggtgctgaatggggcaacagaagaaaataaaaacaaaacacagaaagattccttcgtg gctgacgactatcttaacagttatctgatgacagatccgcacttggaaacactgccagcc ctcaatgacctgtctatagctcatacacagtga