GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:26:25 Sequence gi568815596f:44842105_45044757 : 202653 bp : 47.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 26858 26902 45 0 0 119 66 80 0.487 7.28 1.02 Intr + 27549 27656 108 1 0 108 28 45 0.387 0.76 1.03 Intr + 30509 30534 26 0 2 89 103 0 0.234 -0.66 1.04 Term + 32617 32628 12 0 0 150 42 0 0.433 -0.20 1.05 PlyA + 33812 33817 6 1.05 2.04 PlyA - 37967 37962 6 1.05 2.03 Term - 44963 44888 76 1 1 101 41 95 0.788 3.41 2.02 Intr - 47121 47022 100 1 1 62 76 51 0.715 0.47 2.01 Init - 50785 50686 100 1 1 61 80 103 0.568 7.22 2.00 Prom - 66010 65971 40 -4.36 3.05 PlyA - 69938 69933 6 1.05 3.04 Term - 81475 81410 66 1 0 110 49 53 0.406 1.54 3.03 Intr - 87330 87261 70 2 1 90 71 95 0.515 7.18 3.02 Intr - 89339 89150 190 0 1 33 8 192 0.464 4.34 3.01 Init - 91673 91667 7 2 1 61 89 0 0.436 -1.57 3.00 Prom - 94534 94495 40 -5.36 4.00 Prom + 96123 96162 40 -6.76 4.01 Init + 98896 99019 124 0 1 58 71 87 0.601 4.57 4.02 Intr + 99993 100806 814 1 1 140 92 1248 0.972 120.90 4.03 Term + 102464 102656 193 2 1 108 44 315 0.963 25.99 4.04 PlyA + 102842 102847 6 -0.45 5.00 Prom + 105957 105996 40 -4.06 5.01 Init + 106713 106873 161 2 2 79 103 24 0.583 2.31 5.02 Term + 107710 107818 109 1 1 128 48 72 0.567 5.18 5.03 PlyA + 108577 108582 6 1.05 6.05 PlyA - 109923 109918 6 1.05 6.04 Term - 114239 113572 668 0 2 44 53 263 0.003 12.39 6.03 Intr - 131804 131739 66 0 0 34 113 68 0.633 2.78 6.02 Intr - 136411 136281 131 0 2 115 60 38 0.664 4.04 6.01 Init - 149092 148956 137 1 2 72 116 55 0.420 6.31 6.00 Prom - 151612 151573 40 -6.16 7.00 Prom + 151646 151685 40 -7.96 7.01 Init + 151936 151999 64 0 1 54 83 77 0.515 5.11 7.02 Intr + 154487 154739 253 0 1 11 37 251 0.522 8.79 7.03 Term + 162509 162935 427 2 1 64 44 129 0.178 0.88 7.04 PlyA + 163127 163132 6 1.05 8.03 PlyA - 163822 163817 6 1.05 8.02 Term - 164381 164066 316 1 1 126 42 329 0.988 26.51 8.01 Init - 167006 166447 560 2 2 97 102 1223 0.932 119.07 8.00 Prom - 196141 196102 40 -2.76 9.02 PlyA - 197717 197712 6 1.05 9.01 Term - 201339 201228 112 2 1 85 41 116 0.457 4.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_1|63_aa XMPVAIYIPEDFTKSSSAEVPLLKEAYPDYSSSQSPLPLLNYQNPSSHYSPSVSSLSSKM VLW >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_1|192_bp nggatgcctgtggccatctatatcccggaggactttaccaaaagcagctctgctgaagtc ccactcctcaaggaagcctacccggactattccagctcacagtcacctctccctctactg aactatcagaacccctcgagccactactcccccagtgtttctagtctttcctctaagatg gtactatggtga >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_2|91_aa MWDSLELPRDLLNGFDQNADGNMDNEVQTEVVSVPKGGSSERGKGSIQSQHLQTIRPPPC APSTSPLTQLQIPQEEYSKKSLLIDPNWSKI >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_2|276_bp atgtgggacagtttggaacttcctagagacttgttgaatggctttgaccaaaatgctgat ggtaatatggacaatgaagtccagactgaagtggtctcagtccctaaaggaggaagttca gagaggggcaaaggttcaatccagtctcaacatttacagaccatccggccacccccttgt gccccaagcacgtctcccctgactcaacttcagattcctcaagaagagtactccaagaag tccctcctgattgatccaaactggtccaagatttaa >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_3|110_aa MPMECKSKKIHLDKEGGKKGLSGPASRMTTLQGAEEGPFCEEEINYPENATGGHVSADVV ALETEDPAPSTPTRKQPSGLRQRPEAVAQESLETHQAEEEEGNISTQAEF >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_3|333_bp atgccaatggagtgtaagtccaagaagattcatttagacaaagaaggtggaaaaaaagga ctttctgggccagcaagtcggatgaccaccctccaaggggcagaggagggcccattttgt gaagaagaaatcaactacccggaaaacgccacaggaggacatgtttctgcagatgtagtt gccctagaaacagaagaccccgcgccctccaccccaacacgcaaacagccctcggggctg cgtcagcggcctgaggccgtggcccaggaaagtctggagactcaccaagctgaggaggaa gaagggaacatctcaacccaagctgaattctga >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_4|376_aa MGTDGGGKSGLQHPGNWAQAPWRQIPSESGIPGWRHGPLGQGQSMVFRSPLDLYSSHFLL PNFADSHHRSILLASSGGGNGAGGGGGAGGGSGGGNGAGGGGAGGAGGGGGGGSRAPPEE LSMFQLPTLNFSPEQVASVCETLEETGDIERLGRFLWSLPVAPGACEAINKHESILRARA VVAFHTGNFRDLYHILENHKFTKESHGKLQAMWLEAHYQEAEKLRGRPLGPVDKYRVRKK FPLPRTIWDGEQKTHCFKERTRSLLREWYLQDPYPNPSKKRELAQATGLTPTQVGNWFKN RRQRDRAAAAKNRLQHQAIGPSGMRSLAEPGCPTHGSAESPSTAASPTTSVSSLTERADT GTSILSVTSSDSECDV >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_4|1131_bp atgggcaccgatgggggaggcaagagcggcctccagcaccctgggaattgggctcaggct ccctggcgccagataccctcggaatccgggatccccggctggcgccacggcccgctgggc caaggtcagtccatggtattccgctcccccctagacctctattcctcccacttcttgttg ccaaacttcgccgattctcaccaccgctccatacttctggcgagtagcggcggcgggaac ggtgcgggaggcggcggcggcgcgggaggcggcagcggcggcgggaacggtgcgggaggc ggcggtgctggcggagcaggcggcggcggcggcggcggctccagggcccccccggaagag ttgtccatgttccagctgcccaccctcaacttctcgccggagcaggtggccagcgtctgt gagacgctggaggagacgggcgacatcgagcggctgggccgcttcctctggtcgctgccc gtggcccccggggcgtgcgaggccatcaacaaacacgagtcgatcctgcgcgcgcgcgcc gtggtcgccttccacacgggcaacttccgcgacctctaccacatccttgagaaccacaag ttcaccaaggagtctcacggcaagctgcaggccatgtggctcgaggcgcactaccaggag gccgagaagctgcgcggccgcccactcggcccggtggacaagtaccgcgtgcgcaagaag ttcccgctgccacgcaccatctgggacggcgagcagaagacgcattgcttcaaggagcgg actcggagcctgttgcgggagtggtacctacaggacccctaccccaaccccagcaagaaa cgcgaactggcgcaggccaccggcctcactcccacacaagtaggcaactggtttaagaac cggcggcagcgcgaccgcgccgcggcggccaagaacaggctccagcaccaggccattgga ccgagcggcatgcgctcgctggccgagcccggctgccccacgcacggctcggcagagtcg ccgtccacggcggccagcccgaccaccagcgtgtccagcctgacggagcgcgcagacacc ggcacctccatcctctcggtaacctccagcgactcggaatgtgatgtatga >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_5|89_aa MQQPVGWLLCFWRSSESAPTTLHAAHFRLASQRSVLARSLPCVLRPPVYLSHLPAGAEMP GPRDARPPGTRAEAPTASHRSGPFRPQLS >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_5|270_bp atgcagcagccagtaggctggctcctctgcttctggcgttccagtgagtccgcacccact accctgcacgcagcacacttccgcctggcatcccagcggagtgttctggctcggagcctt ccctgtgtgcttagacctcctgtgtacctcagtcacttacccgctggagctgagatgccc ggcccgcgggacgctcgaccccctggtacgcgcgctgaggcccccacggcttcccaccgc tccggccctttccggccccagctgagctga >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_6|333_aa MERREHEDREGKSRMKRNEETGQKKEKRKEKAYRKKKTEEESRGIRQTNPGPVVPRPNPK VLLYMNKGKSLGSGVRDLSSNADSAIVIPGFLVGVTLVHQRPDLGLSSYQNGPGSRLPLA VSKVWQRQAYGRPSASPCGEERVLAPHGFPPDSLSPWKTAPHSRCRRPRGIYLQDKKLIV PKKGNISMRPIASPGGQPRDQVVKQEKGARKVTPNPTGQGGGSYPRKMVDFGIRQRAVPQ EEKQAIPVEEVLRKLEQKRTARQGKAPHTTHVHYQGPRCGGFACYAPSPRGGSKNPPVAP ERAAFRLQPAVLCARVRTASARGVRPRNRGVQG >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_6|1002_bp atggagagaagagaacatgaagacagagaaggaaaaagcagaatgaagagaaatgaagag actggacaaaaaaaggaaaagagaaaagaaaaagcttacaggaaaaagaagacagaagag gaatcaagaggtataaggcaaacaaaccctggccctgtggtccctagacccaaccccaaa gttctcctttatatgaacaaaggaaagagcctgggatctggagtcagagatttgagttcc aatgctgattctgccattgtaattccaggcttcctggttggtgtcaccctggttcatcag agacctgatcttggactttccagctaccagaacgggccagggtctaggctgccattggct gtctccaaggtgtggcagcgtcaggcatatggaaggccttcagcaagcccttgtggagag gagagggtactagctccgcatggcttcccaccagacagcctatcaccatggaagacagcc ccacactcccgctgtaggagacctagaggcatctatctgcaagacaagaagttaatagtg ccgaagaagggaaatattagcatgagacctattgcaagtccaggtgggcagccacgggat caagtagtaaagcaagagaaaggagctagaaaggtgactccaaatcccacaggccagggt ggagggtcctacccacgcaaaatggtggactttggtataagacagcgcgcagtccctcaa gaggagaagcaggctatcccggtggaagaggtgttgcgaaaactcgagcagaaaaggaca gcgcggcagggcaaggccccccacaccacccacgtgcactaccaaggcccgcggtgcgga gggttcgcatgctatgcaccatcgccgcgaggtggcagcaaaaacccgccagtggcccca gagcgcgctgctttccgcctccaacctgcagtgctttgcgccagggtgcgcacagccagc gcgaggggcgtccggccgcggaaccgcggggtccaaggctga >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_7|247_aa MELKFTEFADLLTVAQEGEDGGLFVLSLTAFKILENLVFGKGSQAKTFPEILLCLLLALF ASGLIHRVCVTTCFIFSVVGLCYINKISTLYQAAAAVLTGPWLLPSFTANPQKQVCELEY LCSTATRSTGTGRLELKSGGLSLGPANPRSASRPVSETATTAKEYARQETSAAQFLPQLW HLRRHVPALGTSSLVFHFDLEVEKEAGKGCARRWCGEGSRLPLPASAASFPSQEKGGSCF RNFSAFT >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_7|744_bp atggaactgaagttcacagagtttgctgacttgctcacggttgcacaggaaggtgaagat ggaggtctcttcgtgctctccctcactgccttcaagattctggagaatcttgtgtttggc aaaggatcccaagcaaagaccttccctgagattctcctgtgcctcctgttggctctcttt gcatctggcctcatccaccgagtctgtgtcaccacctgcttcatcttctccgtggttggt ctgtgctacatcaacaagatctccactctgtatcaggcagcagctgcagtcctcacagga ccttggctcttgccaagtttcactgcaaatccacagaagcaggtttgcgagctcgaatac ctttgctccactgccacacgcagcaccgggactgggcgtctggagcttaagtctgggggt ctgagcctgggaccggcaaatccgcgcagcgcatcgcgcccagtctcggagactgcaacc accgccaaggagtacgcgcggcaggaaacttctgcggcccaatttcttccccagctttgg catctccgaaggcacgtacccgccctcggcacaagctctctcgtcttccacttcgacctc gaggtggagaaagaggctggcaagggctgtgcgcgtcgctggtgtggggagggcagcagg ctgcccctccccgcttctgcagcgagttttcccagccaggaaaagggagggagctgtttc aggaatttcagtgccttcacctag >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_8|291_aa MSMLPTFGFTQEQVACVCEVLQQGGNIERLGRFLWSLPACEHLHKNESVLKAKAVVAFHR GNFRELYKILESHQFSPHNHAKLQQLWLKAHYIEAEKLRGRPLGAVGKYRVRRKFPLPRS IWDGEETSYCFKEKSRSVLREWYAHNPYPSPREKRELAEATGLTTTQVSNWFKNRRQRDR AAEAKERENNENSNSNSHNPLNGSGKSVLGSSEDEKTPSGTPDHSSSSPALLLSPPPPGL PSLHSLGHPPGPSAVPVPVPGGGGADPLQHHHGLQDSILNPMSANLVDLGS >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_8|876_bp atgtccatgctgcccaccttcggcttcacgcaggagcaagtggcgtgcgtgtgcgaggtg ctgcagcagggcggcaacatcgagcggctgggccgcttcctgtggtcgctgcccgcctgc gagcaccttcacaagaatgaaagcgtgctcaaggccaaggccgtggtggccttccaccgc ggcaacttccgcgagctctacaagatcctggagagccaccagttctcgccgcacaaccac gccaagctgcagcagctgtggctcaaggcacactacatcgaggcggagaagctgcgcggc cgacccctgggcgccgtgggcaaataccgcgtgcgccgcaaattcccgctgccgcgctcc atctgggacggcgaggagaccagctactgcttcaaggaaaagagtcgcagcgtgctgcgc gagtggtacgcgcacaacccctacccttcaccccgcgagaagcgtgagctggcggaggcc acgggcctcaccaccacacaggtcagcaactggttcaagaaccggcggcagcgcgaccgg gcggccgaggccaaggaaagggagaacaacgagaactccaattctaacagccacaacccg ctgaatggcagcggcaagtcggtgttaggcagctcggaggatgagaagactccatcgggg acgccagaccactcatcatccagccccgcactgctcctcagcccgccgccccctgggctg ccgtccctgcacagcctgggccaccctccgggccccagcgcagtgccagtgccggtgcca ggcggaggtggagcggacccactgcaacaccaccatggcctgcaggactccatcctcaac cccatgtcagccaacctcgtggacctgggctcctag >gi568815596f:44842105_45044757|GENSCAN_predicted_peptide_9|37_aa XFYETRGNSLLLFSVAVANSSNLLKCRLVGAVSLATD >gi568815596f:44842105_45044757|GENSCAN_predicted_CDS_9|114_bp nngttttatgagaccagaggtaattccctgctgctgttcagcgttgctgtggctaacagt tccaaccttctgaagtgccggctggttggggctgtctctctggcaactgattag