GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:20:54 Sequence gi568815588r:117909001_118146163 : 237163 bp : 38.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 13654 13693 40 -1.35 1.01 Init + 16546 16649 104 0 2 49 100 63 0.017 3.59 1.02 Intr + 27037 27143 107 1 2 108 32 130 0.198 8.34 1.03 Term + 42643 42758 116 2 2 62 38 126 0.320 2.75 1.04 PlyA + 47095 47100 6 1.05 2.00 Prom + 48462 48501 40 -5.35 2.01 Init + 52422 52684 263 2 2 60 43 204 0.920 9.69 2.02 Intr + 60298 60470 173 1 2 114 97 94 0.934 11.56 2.03 Intr + 63319 63516 198 2 0 3 77 136 0.289 2.40 2.04 Term + 67112 67317 206 0 2 53 49 115 0.507 0.65 2.05 PlyA + 67686 67691 6 1.05 3.04 PlyA - 67741 67736 6 1.05 3.03 Term - 78261 78156 106 2 1 69 53 61 0.626 -2.40 3.02 Intr - 79370 79200 171 1 0 97 73 118 0.723 9.34 3.01 Init - 80363 80212 152 2 2 63 98 67 0.945 4.78 3.00 Prom - 97866 97827 40 -4.85 4.18 PlyA - 98237 98232 6 1.05 4.17 Term - 100225 99998 228 1 0 90 38 282 0.779 19.05 4.16 Intr - 106110 106065 46 2 1 67 131 24 0.803 2.09 4.15 Intr - 130440 129972 469 1 1 44 121 278 0.463 17.93 4.14 Intr - 131565 131123 443 2 2 96 103 261 0.989 20.57 4.13 Intr - 137192 136811 382 0 1 13 89 225 0.339 8.14 4.12 Intr - 138228 138088 141 1 0 37 80 82 0.105 1.70 4.11 Intr - 144294 144102 193 0 1 57 41 174 0.173 7.74 4.10 Intr - 148645 148463 183 1 0 64 87 92 0.363 5.76 4.09 Intr - 154067 153911 157 1 1 -45 92 97 0.023 -4.11 4.08 Intr - 158081 157935 147 1 0 120 39 60 0.027 2.73 4.07 Intr - 172803 172727 77 0 2 134 82 53 0.249 6.69 4.06 Intr - 194845 194657 189 1 0 90 76 142 0.717 12.06 4.05 Intr - 195058 194897 162 1 0 61 81 119 0.886 7.75 4.04 Intr - 201154 201065 90 1 0 43 78 70 0.049 0.77 4.03 Intr - 203343 203224 120 0 0 40 43 112 0.065 1.67 4.02 Intr - 208312 208198 115 0 1 78 37 98 0.140 3.23 4.01 Intr - 233672 233572 101 2 2 103 28 84 0.072 1.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 144294 144097 198 0 0 57 43 183 0.801 7.12 S.002 Sngl - 160054 159740 315 1 0 101 32 179 0.825 9.30 S.003 Init - 196570 196526 45 1 0 80 95 13 0.847 1.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:117909001_118146163|GENSCAN_predicted_peptide_1|108_aa MAALYIVAPQSPCMSPSTPSHSLGHQKSPDLSRIQFRNTARENERESPEQTEMSMAGAQG ANGDWKLKHWELKPKRMGTNNFNSHFLNHELERHLDDETVDAFTVPFG >gi568815588r:117909001_118146163|GENSCAN_predicted_CDS_1|327_bp atggccgcgctgtacatagttgctcctcagagcccgtgcatgtcccctagcaccccctcc cactctctgggccaccagaaatccccagacctctccaggatccagtttaggaacacagct cgagaaaatgaaagagagagtcctgagcagacagagatgagcatggcaggagctcaggga gcaaatggagactggaagctgaaacactgggaactaaaacccaagagaatgggaacaaat aactttaactcccactttttgaatcatgagcttgaaaggcacctggatgatgaaactgtg gacgctttcacagtcccctttgggtaa >gi568815588r:117909001_118146163|GENSCAN_predicted_peptide_2|279_aa MGHYILMEHTIALNFSEVQVWGSSYLGQLLECDGLRASDHASACCTPRRAQLHFTGITHL FMNNEVQTVPFLNTTLTLKELEIQHSVSNWSKYCACLSLSKEISHPATQGMCLFVTKDED VILHNSGNAGPSTAPTWAKSNSSFKDTLASPVQALLQGLQALGLADSHFDEISLFQGPGS ILSVQGHPISVQKCYLGSRRSGKAVRLASPKDSTLSSLSFHHRYCGCEFQVAFVSSHVSL PPLRTCLMTEDQKTATLGLQVNGTHPLALSLGILGSAIQ >gi568815588r:117909001_118146163|GENSCAN_predicted_CDS_2|840_bp atgggccattatattttgatggaacacacaattgccttgaacttctcagaagttcaggtg tggggatccagctacctggggcagttgctggaatgtgatggtctgcgggcttctgatcat gcatctgcctgctgcacccctcgtagggcccagctccatttcacgggaataactcaccta ttcatgaacaatgaggttcaaactgtacctttccttaacacaacattaactctcaaagag ctggaaattcagcactcggtttcaaattggtcaaagtattgtgcctgtttgtcactaagt aaagagatatcacatcccgcaactcagggaatgtgcctgtttgtcaccaaggatgaagat gtcatactgcacaactcaggaaatgcaggaccttccacagctcccacctgggccaaaagt aactcttcctttaaagacacactggccagtccagtccaggctctgctgcagggactgcag gccctgggacttgctgattcccattttgatgaaattagcctctttcagggccctggtagc atactctcagtccaaggacaccctatttctgtccagaaatgctatttggggagtaggcgt tcagggaaagcggttagattggccagtcccaaagattcaacattgtcctctctcagtttt catcatcgatactgtggatgtgagttccaggtagccttcgtttcatcacatgtatcacta ccaccacttagaacatgcctaatgacagaagatcagaaaactgccacattaggacttcaa gtgaatggaacacatcctctggcactttctttgggtatactgggctcggcaattcagtga >gi568815588r:117909001_118146163|GENSCAN_predicted_peptide_3|142_aa MGRQRCAQYAPYGPAILYFEVYPRQMLRQVPKGLVQVFHSRTWKQLGRVGRNLVLPDQVV EARIGPFAEMGEARQEQGGANSEPPFHHDMFQMPITYPGGEIKQTALILELTLSALLVLK PSNSIWNYTDRSPEIQLANLRS >gi568815588r:117909001_118146163|GENSCAN_predicted_CDS_3|429_bp atgggaaggcagcgctgtgcacagtacgcaccttatggcccagcaatcctgtatttcgag gtataccccagacagatgcttagacaggttcctaaaggactggtacaagtttttcatagt aggacctggaagcaactggggagagtgggcagaaatttggttctaccagatcaagtggtt gaggcaaggattgggccttttgctgaaatgggagaggcaagacaggagcagggtggggca aattcagagccccctttccaccacgacatgtttcagatgcccattacatatccaggtggt gaaattaaacagacagcacttatactggaacttacactgtctgctctcctggttctcaag ccttcaaactcaatctggaactacactgataggtctcctgagatccagcttgccaacctc agatcctga >gi568815588r:117909001_118146163|GENSCAN_predicted_peptide_4|1080_aa VNLKENALVTQDFGPRGESLQIPSKESFYLEEEKLLEHRAQLSLKSTEREEIRKGGGGKL MAKAIHMAPPPQDCSAKAARFSESEEAHLHTVSPTGAVLSLRTAKLNNHKDRMFHDRWPL RALSCFSNGSPADPQQIPWVGGVLSLSSSRILPGLWLLTLQRTMDLQLPLSSVCSGPNLL SISSSDLQLAMNFLAMFPWSLVLPLQDVAIPQDDHDSLCKVHLVQLPPDIDFETRTLMQV VYLGGEGNRKREVEKTDREAVYIGQKRIGWMDGWMDGWMDGWTNRIGKTTLKFIWNQKRA RIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWSKQEDKRRKWALRRHKTGTWYQKARS RGKGEICYQAGSQLPENQTEIRAEGQLHSRELVPPTSLSKGKSYADLRALALSLHPTTGH GARAAMICPASRKADASSQPWKLRLKATCEKYISVGRKEGRKEGKGGEGEGEEGGEGEEG EGEGEGKGGERGKGEGKGGEGEEREGEYGYTGTGHPGHSEHRSPALKDALVRGVSGSLMT SSHERSPALPRAHLPASAGNSGAEKQDRMMLSEQAQKWFPTHVQVTVLQAKDLKPKGKSG TNDTYTIIQLGKEKYSTSVAEKTLEPVWKEEASFELPGLLIQGSPEKYILFLIVMHRSLV GLDKFLGQVAINLNDIFEDKQRRKTEWFRLESKQGKRIKNRGEIKVNIQFMRNNMTASMF DLSMKDKTRSPFAKLKDKMKGRKNDGTFSDTSSAIIPSTHMPDANSEFSSGEIQMKSKPK KPFLLGPQRLSSAHSMSDLSGSHMSSEKLKAGTIGQTHLLGHQLDSFGTVPESGSLKSPH RRTLSFDTSKMNQPDSIVDEGELCFGRQNDPFTNVTASLPQKFATLPRKKNPFEESSETW DSSMNLFSKPIEIRKENKREKREKVSLFERVTGKKDSRRSDKLNNGGSDSPCDLKSPNAF SENRQDYFDYESTNPFTAKFRASNIMPSSSFHMSPTSNEDLRKIPDSNPFDATAGYRSLT YEEVLQELVKHKELLRRKDTHIRELEDYIDNLLVRVMEETPSILRVPYEPSRKAGKFSNS >gi568815588r:117909001_118146163|GENSCAN_predicted_CDS_4|3243_bp gtgaacttaaaagaaaatgccttggttacccaagattttggtccaagaggagaaagtctt cagataccttctaaggaatctttctatctggaagaagagaagctattggaacacagagct cagctgagcctgaaatctactgagagagaagaaattagaaaaggaggaggggggaagtta atggcaaaagctattcacatggcaccacccccacaggactgctccgcaaaggctgcccgg ttcagtgaaagtgaagaagcacaccttcacacagtctctcccacaggggcggtcctcagt ttgaggacagccaaattgaataatcataaagacaggatgtttcatgatcgttggccttta agggcgcttagttgtttcagcaatgggagccctgctgacccacagcagatcccctgggta ggaggggttctgagcctctcatcctcgcgtatcttgcctggcctgtggctcctgactctt caacggacaatggatcttcaactgcctctgtcatcagtgtgttctgggcccaatctgctg tcaatcagcagctctgacctacaacttgcaatgaatttcttggctatgttcccttggtcc ctggtcctgcccctgcaggatgttgctattccccaagatgaccatgattctctctgcaag gtgcatcttgttcagcttcctccagacattgactttgaaacaaggactctaatgcaagta gtttatttgggaggtgaaggaaacaggaaaagggaagtagaaaaaacagacagggaggct gtttatattggtcagaagagaattggatggatggatggatggatggatggatggatggat ggatggacaaatagaattggaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtcaaaacaagaagacaaa aggagaaagtgggcactgagaaggcacaaaacagggacctggtaccagaaggctaggtca cgaggtaagggcgagatctgttatcaggctggaagccaactccctgaaaaccaaacagag ataagagctgaaggacagctccattcacgtgaactggtcccgcccacatccctaagtaaa gggaaatcctatgcagatctcagagccctagctctctccctccacccaaccacaggtcac ggagccagagctgctatgatatgtcctgcctcaagaaaagctgatgcttcctctcaacca tggaagttaaggctaaaagctacttgtgagaaatacatttctgttggaaggaaggaagga aggaaggaaggaaaaggaggagaaggagaaggagaagaaggaggagaaggagaagaaggg gaaggggaaggggaagggaagggaggggaaaggggaaaaggggaggggaaaggaggagag ggagaggaaagggaaggagagtatggatacactggcactgggcatccaggacacagcgaa cacagatcccctgccctcaaagacgccctagttaggggcgtctccgggtctttgatgacc tcctctcacgagcggtccccagccctccccagggctcacctgccggcctctgcagggaac tcgggggcagagaaacaggacaggatgatgctgtccgagcaagcccaaaagtggtttcca acccacgtgcaggtcacagtgctccaagccaaagatctgaagccaaaaggcaaaagtggt accaatgacacatacactataattcagctgggcaaggaaaagtactccacctctgtagct gagaaaacccttgagccagtttggaaggaggaggcctctttcgagctacctggattgcta attcagggaagtccagagaaatacattcttttccttatagttatgcacaggtccctggtg ggtctggataaatttttagggcaggtggcaatcaatctcaatgacatctttgaggacaaa caaagaaggaaaacagagtggtttagattagaatccaaacaaggaaaacgaatcaaaaac aggggtgagataaaggtcaatattcagtttatgaggaacaatatgaccgcaagtatgttt gacttatcaatgaaggacaaaaccagatctccttttgcaaagttaaaagataagatgaag ggtagaaaaaatgatggaacattttctgatacgtcttctgcaatcattccaagtactcac atgcccgatgccaatagtgaattttcaagtggtgaaatacagatgaaatccaaaccaaaa aagccttttctcttgggtcctcagcgactctcgtcagcgcattcaatgtctgatttatct gggtcccatatgtcttctgagaaactgaaggctggcaccataggtcaaacacatcttctc ggacaccagttagattcctttggaacagttccagaaagtggaagtctcaaatctccacac agaagaacattaagctttgatacttctaaaatgaaccaacctgacagcattgtggatgaa ggtgaattgtgtttcggaagacaaaatgacccatttacaaatgtgactgcttcattaccc caaaaatttgcaacactgccaaggaagaaaaatccatttgaagaaagcagcgaaacatgg gacagcagcatgaatttattttcaaaaccaattgaaataagaaaagaaaataaaagagag aaaagggagaaagttagcctgtttgaaagagtgactggaaaaaaagatagcagaagatct gataaacttaacaatgggggatctgatagcccttgtgacttgaaatcacctaatgcattt agtgaaaatcgccaggactattttgattatgagtcaaccaatccatttacagcaaaattc agggcttcaaatataatgccatcttcaagttttcatatgagtccaacaagcaatgaagac ctcaggaaaatcccggacagcaacccctttgatgccactgcagggtatcgtagtctgacc tatgaagaggttctacaggagctggtgaaacacaaagaactccttaggaggaaagacacc cacatccgggaactcgaggactacatcgacaacctccttgtaagggtaatggaagaaacg cccagtattctcagagtgccgtatgaaccatccaggaaagctggcaaattctctaacagt taa