GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:08:54 Sequence gi568815579f:5355492_5556778 : 201287 bp : 50.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1548 1661 114 2 0 59 37 99 0.195 2.54 1.02 Intr + 9055 9157 103 0 1 22 93 65 0.053 0.15 1.03 Term + 20093 20334 242 0 2 21 42 335 0.454 18.39 1.04 PlyA + 21086 21091 6 1.05 2.04 PlyA - 22620 22615 6 1.05 2.03 Term - 34349 34189 161 0 2 106 37 36 0.026 -1.60 2.02 Intr - 41597 41454 144 0 0 93 80 36 0.196 3.55 2.01 Init - 46004 45974 31 2 1 110 67 60 0.353 4.77 2.00 Prom - 55159 55120 40 -3.26 3.00 Prom + 64541 64580 40 -1.96 3.01 Init + 64706 64721 16 1 1 71 69 16 0.070 -1.37 3.02 Intr + 80525 80806 282 0 0 107 85 118 0.545 10.79 3.03 Intr + 81216 81319 104 1 2 150 80 1 0.891 5.39 3.04 Term + 85419 85541 123 2 0 3 33 130 0.018 -2.62 3.05 PlyA + 89823 89828 6 1.05 4.00 Prom + 91913 91952 40 -3.26 4.01 Sngl + 100211 101290 1080 1 0 102 43 1861 0.993 178.10 4.02 PlyA + 101343 101348 6 1.05 5.09 PlyA - 101640 101635 6 1.05 5.08 Term - 108010 107876 135 1 0 25 38 261 0.643 12.82 5.07 Intr - 114171 114081 91 2 1 89 99 33 0.506 4.50 5.06 Intr - 124721 124666 56 2 2 77 75 26 0.256 -2.02 5.05 Intr - 125165 125021 145 1 1 109 94 65 0.814 9.48 5.04 Intr - 145314 145193 122 0 2 74 94 38 0.002 2.29 5.03 Intr - 157533 157447 87 0 0 94 78 18 0.194 1.57 5.02 Intr - 161205 161022 184 2 1 50 92 101 0.234 6.49 5.01 Init - 170309 170287 23 2 2 77 113 25 0.232 3.08 5.00 Prom - 176546 176507 40 -3.66 6.00 Prom + 176746 176785 40 -4.26 6.01 Init + 181268 181386 119 1 2 84 88 89 0.638 6.17 6.02 Term + 184883 184982 100 2 1 81 47 96 0.813 2.40 6.03 PlyA + 187412 187417 6 1.05 7.04 PlyA - 192151 192146 6 1.05 7.03 Term - 192564 192460 105 0 0 109 43 48 0.333 0.81 7.02 Intr - 192715 192634 82 0 1 69 52 31 0.187 -2.76 7.01 Init - 195494 195367 128 2 2 72 109 70 0.880 7.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 88089 88036 54 0 0 142 47 11 0.842 -0.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_1|152_aa LWAPGPQEWLGQDGYKENLIAPELKPNVFEESCQDSVKKCLKLKFLNAERRVYGECLAPG AQPMFGICSLWAEEQEEEEEKEEREGEGEGEGEGEGEEEEEEGEGEGEGKGEGEEEEGEG EEEGEGGGGAEKKERKKKKKKEEEEEEEEEEG >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_1|459_bp ctgtgggcgccaggccctcaagagtggctggggcaggacggctacaaagagaatttgatt gctccggagttgaagcccaacgtgttcgaggagtcatgccaagattcagtgaagaaatgt ctaaaactcaagtttttgaatgcggaacgaagggtgtatggagagtgtttagctcctggt gcccagcccatgtttggcatctgcagcctctgggctgaagaacaagaggaagaggaagaa aaagaagaaagagaaggagaaggagagggagagggagagggagaaggagaggaagaggaa gaagaaggagaaggagagggagagggaaaaggagaaggagaggaagaggaaggagaagga gaagaagaaggagagggaggaggaggagcagagaagaaggagaggaagaagaagaagaag aaggaggaggaggaggaggaggaagaagaagaaggctga >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_2|111_aa MPGRLLLLEGYTSQGSRFSHCGDVETAGNLMSQAWVQILTLLSDLGDASVKGDEADATDI SSQIFSPPSKVFSGTFMGNLPQNLLEPLEFLNSQSSYGSPDSCLAGDFRDV >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_2|336_bp atgcctggccggctgctgcttcttgagggatacacttctcaagggagtagattcagccac tgtggtgatgtggaaacagcagggaacttgatgtcacaggcctgggttcaaatcctcaca ttgctgagtgaccttggggatgcgtctgtaaaaggggatgaggcagatgccacagatata tcatcacagatcttctcccctccgagcaaggttttctctggtactttcatgggcaattta ccacagaacctattggaaccccttgagtttcttaattcacagagcagctatgggagtccg gattcctgcctagctggagatttcagagatgtgtag >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_3|174_aa MVVCAYLTVEPGPNSVVPYPQHPVWASDCACSSIEWIFTAVLGCGGFYLPHFTQGKLRLK DMEPSPPPLVGASRMSVDHARSLEFCGRNAAVSGSSTNGGSSGAGPCNPSRVPGARALLP PTGRPQPMAAPLDEKRGEKRIEEKRIEETKIEEMRIKRRERERREEEKMQSDAI >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_3|525_bp atggtggtgtgcgcctacctcaccgtggagcctgggcccaactcggtggtgccgtatccc cagcaccccgtatgggcaagtgattgtgcttgttcctcgattgaatggatcttcacagca gtcctgggatgtggaggcttttatctaccccatttcacacagggaaaactgaggctcaaa gacatggagccctcccccccaccacttgtgggtgcttcgaggatgtccgtggaccatgcg cgctccctggaattctgtgggaggaatgctgcggtctcgggcagctccacaaacggagga tcctccggggctggaccctgcaacccctcccgggtcccgggagctcgcgccctgctcccg cccactggacgcccgcagccaatggccgcgccgcttgatgagaaaagaggagagaagaga attgaagaaaagagaattgaagagacaaaaattgaagagatgagaattaagagaagagaa agagagagaagagaagaagagaagatgcagtcagatgctatttag >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_4|359_aa MGWPRPGRALVAVKALLVLSLLQVPAQAVVRAVLEDNSSSVDFADLPALFGVPLAPEGIR GYLMEVKPANACHPIEAPRLGNRSLGAIVLIRRYDCTFDLKVLNAQRAGFEAAIVHNVHS DDLVSMTHVYEDLRGQIAIPSVFVSEAASQDLRVILGCNKSAHALLLPDDPPCHDLGCHP VLTVSWVLGCTLALVVSAFFVLNHLWLWAQACCSHRRPVKTSTCQKAQVRTFTWHNDLCA ICLDEYEEGDQLKILPCSHTYHCKCIDPWFSQAPRRSCPVCKQSVAATEDSFDSTTYSFR DEDPSLPGHRPPIWAIQVQLRSRRLELLGRASPHCHCSTTSLEAEYTTVSSAPPEAPGQ >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_4|1080_bp atggggtggccacggccgggccgagccctcgtggcagtcaaagccttgctggtcttgtcg ctgctccaggtgcccgcgcaggcagtggtacgggccgtgctggaagacaactcgagctcg gtggactttgcggatctgccggcgctgttcggcgtccccctggcccccgagggcatacgg ggctacctgatggaggtcaagccagccaacgcgtgccatcccatcgaggccccgcgactg ggcaaccgctctctgggcgccatcgtgctgatccgccgctacgactgcaccttcgacctc aaggtgctgaacgcccagcgcgccggcttcgaggcggccatcgtgcacaacgtccactcc gacgacctcgtgagcatgacccacgtctacgaggacttgaggggccagatcgccatcccc tcagtgttcgtgagcgaggccgcctcgcaggacctgcgggtcatcctgggctgcaacaag tcggcccacgcgctgctcctgcccgacgacccaccgtgccacgacctgggctgtcacccc gtgctgaccgtgtcctgggtgctgggctgtaccctggccctggtcgtatcagccttcttt gtcctgaaccacctgtggctctgggcccaggcctgctgcagccacagacggccggtgaag acgtctacctgccagaaggcccaggtccgcaccttcacgtggcacaacgacctgtgtgcc atctgcctggatgagtatgaggagggcgaccaactcaagatcctgccctgctcccacacc taccactgcaaatgcattgacccctggttctcccaagccccccggcgctcctgccccgtg tgcaaacagtcggtggccgccacagaagacagctttgactccaccacctacagcttcagg gacgaggacccctccctaccgggccaccggccccccatctgggccattcaagtccagcta cgctcccggaggctggagctgctgggccgcgccagtccccactgccactgcagcaccacg tccctggaggcagagtataccactgtctcctcagcccctcctgaggcccctggtcagtaa >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_5|280_aa MDGQMDESLGNKLRGCRRRRRRRRRRRRREEEEEEEERRGRRTKSSREQGHLNLEADLSL VTLRCRWPQDALQCRSVFLHQEAGERPGVGGIKDAEAEVGKLTPGKGQRIPGLGSSQLPD PLFRNCTAVLPVGMEQPVRVKAQVLPEAYQALHDLPQPLPALSSSLSPPHSLCSSHMGLL VVPPTPQLALPLPAAVQKFVWRTDASQPPCEGAAVDTNSFLLLGQRLPPSPDPLRQSKTL SKEEEEEEEEEEEEEEEEEEEEEEEEEERRRRKRRKEVYV >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_5|843_bp atggatgggcagatggatgaaagcctgggtaacaaactaagaggctgtcgaagaagaagg agaagaagaagaagaaggagaagaagagaagaggaagaggaagaagaagagagaagagga agaagaacaaagagcagccgcgagcaaggccacttgaacttggaagcagatctttctctg gtgaccttgagatgtcggtggccccaggacgctctgcagtgtcgctcagtctttttgcac caggaagctggagagcgacccggagtgggagggataaaggacgccgaggcggaggtgggg aaacttacccctggaaaagggcagagaattcctggcttgggtagtagccagcttcctgac ccccttttcaggaactgcactgctgtgctacctgtgggcatggaacagccagtcagggtc aaagcccaagtcctccccgaggcctaccaggccctgcacgacctgccccagcccctccct gccctctcctcctccttgtctccccctcactcactctgctccagccacatgggcctcctc gttgttcctccaacaccccagctcgctctgcccctccctgcagcagttcagaaattcgtc tggcgcacagatgccagccagcccccttgcgaaggtgcagccgtggacaccaactccttc cttctgctggggcaaaggctccccccttccccagaccccctcaggcagagcaagactctg tctaaagaagaggaagaagaagaggaagaagaagaagaggaagaggaagaagaagaagaa gaagaagaagaagaagaagaagaaagaaggagaaggaaaagaagaaaggaagtctatgtc taa >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_6|72_aa MMRAAPRQEGALPAFPALSLHGCAIPGHLCPSLGLRGQESAIGHSGTVAVVPAFRPPPAT VMLMGTHQSSLQ >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_6|219_bp atgatgcgtgctgccccccgccaggaaggagccctgcctgccttcccagcactgtcactg cacggctgtgcgatcccgggacatctctgcccctctctgggccttagagggcaagagagc gctatcgggcactcgggcaccgtagctgtggttccagcattccggccgccaccagccact gtcatgctgatgggaacccatcagtccagtctgcagtga >gi568815579f:5355492_5556778|GENSCAN_predicted_peptide_7|104_aa MVYGASEAIGQRQSSAAKPRRSGKESVREPWARVPGALGVAARIKWCEDQRQTPYLQADL SPLDAERCQMIPDCLILPHGHSTLWWSVQRLQHFRNANAINVGN >gi568815579f:5355492_5556778|GENSCAN_predicted_CDS_7|315_bp atggtctacggggcttccgaggcgatcgggcagcgtcagtcttcagccgctaagccgaga agatctgggaaggagtcagtcagagagccttgggccagagttccaggggctctgggagtg gctgccaggataaaatggtgtgaggatcagcggcagactccctatctccaggcagatctg tctcccttagatgctgaaagatgccagatgattccggattgcttgatcctgccacatggg cacagcactctctggtggtcggtgcaaaggctgcagcattttagaaatgcaaatgccatc aacgtgggcaattaa