GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:37:14 Sequence gi568815585f:38249999_38460775 : 210777 bp : 35.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9388 9390 3 0 0 113 81 0 0.592 1.85 1.02 Intr + 15391 15830 440 0 2 31 17 403 0.025 18.89 1.03 Intr + 16383 16712 330 0 0 -22 86 233 0.013 5.92 1.04 Term + 23786 24284 499 2 1 26 48 195 0.469 2.31 1.05 PlyA + 24316 24321 6 1.05 2.00 Prom + 24915 24954 40 -6.35 2.01 Init + 25670 25802 133 1 1 78 81 46 0.020 3.25 2.02 Term + 39618 41152 1535 1 2 39 37 568 0.001 36.43 2.03 PlyA + 41326 41331 6 1.05 3.00 Prom + 53040 53079 40 -3.25 3.01 Init + 55990 56058 69 0 0 72 35 96 0.573 3.80 3.02 Intr + 99787 99923 137 0 2 -11 103 111 0.849 1.15 3.03 Intr + 100001 100057 57 2 0 153 65 88 0.896 10.08 3.04 Intr + 104241 104298 58 0 1 65 100 62 0.176 3.07 3.05 Intr + 108095 108134 40 1 1 65 95 30 0.052 -1.72 3.06 Term + 141505 141695 191 2 2 107 38 130 0.055 6.53 3.07 PlyA + 143741 143746 6 1.05 4.05 PlyA - 143799 143794 6 1.05 4.04 Term - 154162 153991 172 0 1 31 49 201 0.054 6.82 4.03 Intr - 177063 176942 122 0 2 76 103 92 0.206 7.87 4.02 Intr - 182483 182368 116 0 2 73 91 -18 0.120 -3.75 4.01 Intr - 185651 185476 176 1 2 75 100 168 0.652 15.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 15391 15834 444 0 0 31 41 378 0.961 21.65 S.002 Sngl - 38940 38524 417 0 0 59 46 278 0.849 16.75 S.003 Init - 160261 160139 123 1 0 84 111 70 0.875 9.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:38249999_38460775|GENSCAN_predicted_peptide_1|423_aa MESSSWQQVGALLGGSFQRKEQAAVFAVVQPPLVRSRRTRCGVDPYQTAADLQTRVLTEE KQTESNNKNNTNKKDSTKSPFKGHQPQRSKEDKSMKMEKNQHKSAENPKNQNASSAPNDH NTSPARAQNGAEAELDEVIEVGFRGWAWQADKIREKRMKRNEQNLRELWDYVKRLNLKMI EVPERNGENGTKVENTLQDIIQENFPNLARQANIQIQNIQRTAIRYSMKSSTRRHIIMRF FKVEMKQKMLTAAREKARVNDKNHMIISIDAEKAFNKIQHPFMLKTLNKLTTDGAYLKII RALYDKPIANIILNGQKLEAFPLKTGIRQGCPLSQLLFNIILEVLARAIRQEKEIKGIQR GRQKVKLSLFADDVIVYLENSIVSAPKLLKLISNFSKVSGYKINVQNSEAFLYIKNRQAE RQS >gi568815585f:38249999_38460775|GENSCAN_predicted_CDS_1|1272_bp atggagagctccagctggcagcaggttggtgcccttctgggaggaagcttccagaggaag gagcaggcagcagtctttgctgttgtgcagcccccactggtgagatccaggcgaacaagg tgtggagtagacccctaccaaactgcagcagacctgcagacaagggtactgacagaagaa aaacaaacagaaagcaacaacaaaaacaacaccaacaaaaaagactccacaaaatcccca ttcaaaggtcatcagcctcaaagatcaaaggaagataaatccatgaagatggagaaaaac caacacaaaagtgctgaaaatcccaaaaaccagaatgcctcttctgctccaaatgatcat aacacttctccagcaagggcacagaatggggctgaagctgagttggatgaagtgatagaa gtaggcttcagaggctgggcatggcaggcagacaagattagagaaaaaagaatgaaaagg aatgaacaaaacctccgagaactatgggactatgtaaagaggctgaacctcaaaatgatt gaagtacctgaaagaaatggggagaatggaaccaaggtggaaaacactcttcaggatatc atccaggaaaacttccctaacctagcaagacaggccaatattcaaattcagaatatccag agaactgcaataagatattccatgaaaagttctacccgaagacacataatcatgaggttc ttcaaggttgaaatgaagcaaaaaatgttaacggcagccagagagaaggcaagagttaat gacaaaaaccacatgattatctcaattgatgcagaaaaggccttcaacaaaattcaacat cctttcatgctaaaaactctcaataaactaactactgatggagcatatctcaaaataata agagctctttatgacaaacccatagccaatatcatattgaatggacaaaagctggaagca ttccctttgaaaactggcataagacaaggatgccctctctcacaactcctattcaacata atattggaagttctggccagggcaattaggcaagagaaagaaataaagggtattcaaaga ggaagacagaaagtcaaattgtctctgtttgcagatgatgtgattgtatatttagaaaac tccatcgtctcagcccccaaactccttaagctgataagcaacttcagcaaagtctcagga tacaaaatcaatgtgcaaaactcagaagcattcctgtacatcaaaaatagacaagcagag agacaatcatga >gi568815585f:38249999_38460775|GENSCAN_predicted_peptide_2|555_aa MEYHIAIKKNEIMSLAGTWVKLETVILSKLTQEQKTKHHMFSLKAPPHTYSKIDHILGSK ALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSTTWKLNNLLLNDYWVHNEMKAEIK MFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDNLTSQLKELEKQEQTH SKASRRQEITKITAELKEIETKKNPSKINESRSWFFERVNKIGRPLGRLIKKKREKNQID TIKNDKGDITTDPTEIQTTIREYYKHLYANKLEYLEEMDKFVDTHTLPRLNQEEVESLNR PIAGSEIVAIINRLPTKKSPGPDGITAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYE ASIILIPKPGRDTTKKENFRPISLMNIDAKILSKILANRIQQHIKKLIHHDRVGFIPGMQ GWFNIRKSINVIQHINRTNGKNHMIISIDEEKAFDKIQQPFMLKTVNKLGIDGTYFKIIR AIYDNPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARTIRQEKEIKGIQLG KEEVKLSLFVGDMII >gi568815585f:38249999_38460775|GENSCAN_predicted_CDS_2|1668_bp atggaataccatatagccataaaaaagaatgagatcatgtcccttgcagggacatgggtg aagctggaaaccgtcattctcagcaaactaacacaggaacagaaaaccaaacaccacatg ttctcacttaaagcaccaccccacacctattccaaaattgaccacatacttggaagtaaa gctctcctcagcaaatgtaaaagaacagaaattataacaaactgtctctcagaccacagt gcaatcaaactagaactcaggattaagaaactcactcaaaaccactcaactacatggaaa ctgaacaacctgctcctgaatgactactgggtacataacgaaatgaaggcagaaataaag atgttctttgaaaccaacgagaacaaagacacaacataccagaatctctgggacacattc aaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaaga tccaaaattgacaacctaacatcacaattaaaagaactagaaaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataactaaaattacagcagaactgaaggaaatagag acaaaaaaaaacccttcaaaaattaatgaatccaggagctggttttttgaaagggtcaac aaaattggtagaccactaggaagactaataaagaaaaaaagagagaagaatcaaatagac acaataaaaaatgataaaggggatatcaccactgatcccacagaaatacaaactaccatc agagaatactacaaacacctctatgcaaataaactagaatatctagaagaaatggataaa ttcgttgacacacacactctcccaagactaaaccaggaagaagttgaatctctgaataga ccaatagcaggatctgaaattgtggcaataatcaatcgcttaccaaccaaaaagagtcca ggaccagatggaatcacagccgaattctaccagaggtacaaggaggaactggtaccattc cttctgaaactattccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcatcctgataccaaagccgggcagagacacaaccaaaaaagagaattttaga ccaatatccttgatgaacattgatgcaaaaatcctcagtaaaatactggcaaaccgaatc cagcagcacatcaaaaagcttatccaccatgatcgagtgggcttcatccctgggatgcaa ggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacagaaccaatggc aaaaaccacatgattatctcaatagatgaagaaaaggcctttgacaaaattcaacaaccc ttcatgctaaaaactgtcaataaattaggtattgatgggacgtatttcaaaataataaga gctatctatgacaaccccacagccaatatcatactgaatgggcaaaaactggaagcattc cctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacatagtg ttggaagttctggccaggacaattaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgtaggcgacatgattatatag >gi568815585f:38249999_38460775|GENSCAN_predicted_peptide_3|183_aa MVEGEGEAGLSHGSSSSMRESREQQLRRSPDCRGRRGREWSGAVPSTLEEVVLPPRSCRV FWIHSGTTMSKVSFKITLTSDPRLPYKVLSVPESTPFTAVLKFAAEEFKVPAATSAIITN GMGLTVSQTAVTVIALLGLAQKGYQALAGARVCLQRVLLCYLSSGLPAMNTSTCSGGGGS RVK >gi568815585f:38249999_38460775|GENSCAN_predicted_CDS_3|552_bp atggtggaaggtgaaggagaagcaggtctttcacatggcagcagcagcagcatgagagag agcagggagcaacaacttcggaggtccccagattgcagagggagacgtggacgtgagtgg agcggggcggtccccagcacactagaggaagtcgtgctacccccgcggagttgtcgtgtg ttctggattcattccggcaccaccatgtcgaaggtttcctttaagatcacgctgacgtcg gacccacggctgccgtacaaagtactcagtgttcctgaaagtacacctttcacagcagtc ttaaagtttgcagcagaagaatttaaagttcctgctgcaacaagtgcaattattaccaat gggatggggcttactgtgagccagactgcagtgactgttatcgctcttctgggtctagcc cagaagggctaccaggctctggctggtgctagagtatgtctgcaaagagtcctgttatgc tatttgtcttcaggtcttccagcgatgaatactagcacctgctctggtggaggtggcagc agagtgaagtag >gi568815585f:38249999_38460775|GENSCAN_predicted_peptide_4|195_aa XIFVAQEVESGSKKAKQNMSQGVNSFLGGMSLNELVMRRGAFLMKGERETIESDGLIMQG NAWFVQRQKKRRNEIKTFNRSRKRIELKTYVSRGDTSQMVSEVESKIELPASPRSTEGPS TGPAGPWCPLFSHSNCRSCDGFCSGAAIGVSSSLSNYTFSNFSSSSNSNSSSSNSSRWLL LSIRTSSSSTERVGS >gi568815585f:38249999_38460775|GENSCAN_predicted_CDS_4|588_bp nntatcttcgtggcccaagaagtagagagcggtagtaagaaggcaaaacagaacatgtca caaggggtcaatagctttcttgggggaatgagtctaaatgaactagtcatgagaagagga gctttcctaatgaaaggagaaagagaaaccatagagagtgatggcttaataatgcaaggc aatgcctggtttgttcaaaggcagaaaaagagaagaaatgaaataaaaacatttaataga agcagaaaaagaattgagttgaaaacatatgtaagcaggggagatacttctcagatggtg tcagaggtggaatccaaaatagagcttccagcaagccccaggagcactgagggaccaagc acgggacctgctgggccctggtgtccactgttctctcacagcaactgcaggtcgtgtgat ggcttctgctcaggtgcagcaattggtgtctccagtagcctcagcaattacactttcagt aacttcagcagcagcagcaatagcaatagcagcagcagcaacagcagccgatggctcctg ctgagcatcagaacctccagcagctcaacagagcgtgtgggctcctga