GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:45:27 Sequence gi568815589f:88901201_89102334 : 201134 bp : 43.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1071 1110 40 -2.66 1.01 Sngl + 6792 7130 339 2 0 82 37 257 0.990 16.03 1.02 PlyA + 7677 7682 6 1.05 2.00 Prom + 8211 8250 40 -4.96 2.01 Sngl + 8677 9738 1062 0 0 66 37 287 0.861 18.56 2.02 PlyA + 9931 9936 6 1.05 3.05 PlyA - 11223 11218 6 1.05 3.04 Term - 18408 18200 209 1 2 0 42 155 0.209 -0.40 3.03 Intr - 22296 22136 161 2 2 53 72 95 0.377 4.03 3.02 Intr - 24900 24775 126 2 0 77 11 110 0.065 1.99 3.01 Init - 35711 35557 155 2 2 60 63 144 0.456 8.56 3.00 Prom - 43238 43199 40 -3.26 4.03 PlyA - 43916 43911 6 1.05 4.02 Term - 53175 53070 106 2 1 67 54 112 0.227 3.58 4.01 Init - 70141 69921 221 1 2 95 34 161 0.214 9.60 4.00 Prom - 80278 80239 40 -3.66 5.00 Prom + 84777 84816 40 -3.86 5.01 Init + 90462 90495 34 2 1 73 78 49 0.034 2.48 5.02 Term + 90652 90788 137 2 2 53 42 167 0.187 6.78 5.03 PlyA + 92134 92139 6 -0.45 6.00 Prom + 93334 93373 40 -5.16 6.01 Init + 93426 93581 156 2 0 73 92 130 0.497 10.22 6.02 Term + 100031 101137 1107 1 0 70 43 1826 0.216 168.37 6.03 PlyA + 102586 102591 6 1.05 7.12 PlyA - 104593 104588 6 1.05 7.11 Term - 112375 112247 129 1 0 106 43 82 0.529 3.68 7.10 Intr - 123593 123510 84 2 0 54 86 48 0.004 1.22 7.09 Intr - 137088 136793 296 1 2 30 101 332 0.817 25.43 7.08 Intr - 138580 138317 264 2 0 -81 40 520 0.660 27.78 7.07 Intr - 138785 138586 200 1 2 -37 77 391 0.558 24.49 7.06 Intr - 139119 138838 282 2 0 39 -4 432 0.106 25.63 7.05 Intr - 140984 140816 169 0 1 35 -45 202 0.085 0.60 7.04 Intr - 144633 144534 100 0 1 37 100 55 0.454 1.28 7.03 Intr - 145794 145644 151 2 1 78 111 47 0.852 6.16 7.02 Intr - 150963 150837 127 1 1 70 109 121 0.926 12.14 7.01 Init - 151933 151885 49 1 1 68 54 35 0.107 -2.56 7.00 Prom - 152848 152809 40 -8.56 8.00 Prom + 153208 153247 40 -1.36 8.01 Sngl + 158055 158252 198 2 0 63 48 250 0.533 13.67 8.02 PlyA + 160485 160490 6 1.05 9.08 PlyA - 161945 161940 6 -0.45 9.07 Term - 163265 163081 185 0 2 29 55 140 0.919 2.51 9.06 Intr - 164380 164329 52 1 1 101 99 34 0.924 4.28 9.05 Intr - 170052 169999 54 0 0 50 131 50 0.581 4.68 9.04 Intr - 174028 173909 120 1 0 70 88 109 0.861 9.79 9.03 Intr - 176703 176640 64 2 1 87 79 92 0.788 6.92 9.02 Intr - 180723 180682 42 2 0 104 72 30 0.354 0.36 9.01 Init - 182591 182539 53 2 2 70 70 59 0.362 2.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 39386 39230 157 1 1 86 46 164 0.996 9.41 S.002 Init - 42758 42739 20 2 2 105 89 10 0.982 2.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_1|112_aa MGRNQNRKAKNSKNQSASAPPKDRSSLPAKEQSWMENDFDELTEVGFRRLVITNFSELKE DVRTHRKEAKNLEKRLEEWLTRINSIDKTLNDLMELKTMARELRDICTSFSS >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_1|339_bp atggggagaaaccagaacagaaaagctaaaaattctaaaaaccagagtgcctctgctcct ccaaaggatcgcagctccttgccagcaaaggaacaaagttggatggagaatgactttgac gagttgacagaagtaggtttcagaagattggtaataactaacttctcggaactaaaggag gatgttcgaacccatcgcaaggaagctaaaaaccttgaaaaaagattagaagaatggcta actagaataaacagcatagacaagaccttaaatgacctgatggagctgaaaaccatggca cgagaactgcgtgacatatgcacaagcttcagtagctga >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_2|353_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKLIALNTHKRKQERCKIETLTSQLKELE KQEQTHSKASRRQEIAKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIKKKR EKNQIDTIKNDKGDITIDPTEIQTTIRDYYKHLYANKLENLEEMDKFPDTYALPRLNQKE VESLNRLVTGSEIEAIINSLPTKRSPGPEGFTAEFYQRYKEELVPFLLKLFQSIEKEGML PNSFYEACIILIPKPGRDTTKKENFRPISLMNIDAKILSKILANRIQQHTKKLIHHDQVG FIPGMQGRFNIPKSINVFHHINRTKDKKPHDYLNRCRKVLQQNSTALHAKNSQ >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_2|1062_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagcagtgtgtagagggaaacttatagcactaaatacccac aagagaaagcaggaaagatgtaaaattgaaaccctaacatcacaattaaaagaactagag aagcaagagcaaacacattctaaagctagcagaaggcaagaaatagctaagatcagagca gaactgaaggagatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaggatcaacaaaatagatagactgctagcacgactaataaagaagaaaaga gagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccattgatcccaca gaaatacaaactaccatcagagactactataaacacctctatgcaaataaactagaaaat ctagaagaaatggataaattcccggacacatacgcactcccaagactaaaccagaaagaa gttgaatccctgaatagactagtaacaggctctgaaattgaggcaataattaatagccta ccaaccaaaagaagtccaggaccagaaggattcacagccgaattctaccagaggtacaaa gaggagctggtaccattccttctgaaactattccaatcaatagaaaaagagggaatgctc cctaactcattttatgaggcctgcatcatcctgataccaaagcctggcagagacacaaca aaaaaagagaattttagaccaatatccctgatgaacatcgatgcaaaaatcctcagtaaa atactggcaaaccgaatccagcagcacaccaaaaagcttatccaccacgatcaagttggc ttcatccctgggatgcaaggccggttcaacatacccaaatcaataaacgtattccatcat ataaacagaacaaaggacaaaaaaccacatgattatctcaatagatgcagaaaagtcctt caacaaaattcaacagcccttcatgctaaaaactctcaataa >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_3|216_aa MPDDVLVDGNLDHVVEEVPARFLLHEAAIFLFRYTIDEMRVTKSSSYSGEGKKDAGKGCG VDLQCFQEYCCHQEAMAFGSVALQGVRGIWAAGGQLRREKQKVVKYLKEKRRGACLLRAL CEQMRNTLKHMPGNTWRKNGFINQSLAGRDWNSLEGSEEDRKMWESLELPRHLLNGFDKN ADNDVDNEIQAEVVLDGDKELVGNWSKDDSCYVLAK >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_3|651_bp atgccagatgacgtccttgttgatggaaacctggaccacgtggttgaggaagtgcctgcc agattcctcctccatgaagctgctatttttctctttcgttacacaattgatgagatgaga gtcaccaagtccagctcatactcaggggaggggaagaaagatgctgggaaaggctgtggg gttgatttacaatgtttccaggagtactgctgtcaccaggaagccatggcctttggcagc gtggccctgcagggtgtgaggggcatctgggcagcaggcggacagctgaggagagaaaag cagaaagtggtgaagtatctgaaggagaagaggagaggagcatgcctgcttcgagctctg tgtgagcagatgaggaacacacttaaacacatgcctgggaacacctggaggaaaaatggc ttcatcaaccagagtcttgcaggcagagattggaacagtttggagggctcagaagaagac aggaaaatgtgggaaagtttggaacttcctagacacttgttgaatggttttgacaaaaat gctgataatgatgtggacaatgaaatccaggctgaggtggtcttagatggagataaggaa cttgttgggaactggagtaaagatgactcttgttatgttttagcaaagtga >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_4|108_aa MAEGRARAEKPEKSQRAGAARGPEEEAEKPVKTKTVSSSNGGESFSRSTEKGQLKELQTS QRSLQRSPHWIRHSVLTSGLLSASQSLVFLHVVEEAESPEYVVCYPTA >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_4|327_bp atggcggaggggagggcgagagcggagaagcctgaaaagtcacagcgagctggagcagcc agaggacctgaagaggaagcagaaaaacctgtgaaaacgaagaccgtttcttctagtaat ggaggggaaagtttcagtcgcagcactgagaagggtcagctgaaggagctgcagacctcc caacggagtctacaaagatctccacattggattcgccatagcgtcctcacctctggactc ctgtcagcttctcagagccttgtcttcctgcatgtggttgaggaagctgaatcaccagaa tatgtggtgtgctacccgacagcctga >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_5|56_aa MPVAPAPSADGEPLQEQGGGLFHRTRSVYNGLELNTWMKVERLFVEKFHQSFSLDN >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_5|171_bp atgccggtggccccagcgccctcagccgacggagagccgctgcaggaacagggaggaggc cttttccaccgcacccggagcgtttacaacgggctggagctgaatacctggatgaaagtg gagaggctgttcgtggagaagttccatcagtcgttttccttggacaattaa >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_6|420_aa MVFTEADARGRGLSFLGHSLHIWGSLDYAQIAHNVPALFVMEKDLGKSHREAPVRGNETL REHYQYVGKLAGRLKEASEGSTLTTVLFLVICSFIVLENLMVLIAIWKNNKFHNRMYFFI GNLALCDLLAGIAYKVNILMSGKKTFSLSPTVWFLREGSMFVALGASTCSLLAIAIERHL TMIKMRPYDANKRHRVFLLIGMCWLIAFTLGALPILGWNCLHNLPDCSTILPLYSKKYIA FCISIFTAILVTIVILYARIYFLVKSSSRKVANHNNSERSMALLRTVVIVVSVFIACWSP LFILFLIDVACRVQACPILFKAQWFIVLAVLNSAMNPVIYTLASKEMRRAFFRLVCNCLV RGRGARASPIQPALDPSRSKSSSSNNSSHSPKVKEDLPHTAPSSCIMDKNAALQNGIFCN >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_6|1263_bp atggtgttcacggaggcagatgctcggggccggggcctgagtttcctggggcacagcctc catatctggggttccctggactatgcccagatagctcataatgtcccagccttatttgtg atggagaaggacttggggaaaagccaccgggaagcaccggtgcgggggaacgagaccctg cgggagcattaccagtacgtggggaagttggcgggcaggctgaaggaggcctccgagggc agcacgctcaccaccgtgctcttcttggtcatctgcagcttcatcgtcttggagaacctg atggttttgattgccatctggaaaaacaataaatttcacaaccgcatgtactttttcatt ggcaacctggctctctgcgacctgctggccggcatcgcttacaaggtcaacattctgatg tctggcaagaagacgttcagcctgtctcccacggtctggttcctcagggagggcagtatg ttcgtggcccttggggcgtccacctgcagcttactggccatcgccatcgagcggcacttg acaatgatcaaaatgaggccttacgacgccaacaagaggcaccgcgtcttcctcctgatc gggatgtgctggctcattgccttcacgctgggcgccctgcccattctgggctggaactgc ctgcacaatctccctgactgctctaccatcctgcccctctactccaagaagtacattgcc ttctgcatcagcatcttcacggccatcctggtgaccatcgtgatcctctacgcacgcatc tacttcctggtgaagtccagcagccgtaaggtggccaaccacaacaactcggagcggtcc atggcactgctgcggaccgtggtgattgtggtgagcgtgttcatcgcctgctggtcccca ctcttcatcctcttcctcattgatgtggcctgcagggtgcaggcgtgccccatcctcttc aaggctcagtggttcatcgtgttggctgtgctcaactccgccatgaacccggtcatctac acgctggccagcaaggagatgcggcgggccttcttccgtctggtctgcaactgcctggtc aggggacggggggcccgcgcctcacccatccagcctgcgctcgacccaagcagaagtaaa tcaagcagcagcaacaatagcagccactctccgaaggtcaaggaagacctgccccacaca gccccctcatcctgcatcatggacaagaacgcagcacttcagaatgggatcttctgcaac tga >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_7|616_aa MATALWLLQEGHVDAYACHILECCDGLAQDVIGSIGQAFELRFKQYLQCPTKIPALHDRM QSLDEPWTEEEGDGSDHPYYNSIPSKMPPPGGFLDTRLKPRPHAPDTAQFAGKEQTYYQG RHLGDTFGEDWQQTPLRQGERQGSSDIYSTPEGKLHVAPTGEAPTYVNTQQIPPQAWPAA VSSAESSPRKDLFDMSGFLRSKYSGADVMMVVMMVLLLMVILMVVVVVMVVVMMMMVMML LMVVVLIVMMMVMMMVVMLLLMVLVVVMMVVMMLMMVLVAKVVVVMVIMVVRCADGADGG DATAAAVTAAAPVAAGDSDGDDDGYAGDNDGDDDDDDNGDDGDNGDDPDDADGNGGGDNG DSDDSVDGGDAIAAAAVAGSDSDGGDNDGDDDGYDGNNGDDGDNGDDDGDDGGDADGGGG ANKSECGDGGDVGDDADDVDDVDDSKGEPFEDALKNQPLGPVLSKAASVECISPVSPRAP DAKMLEELQAETWYQGEMSRKEAEGLLEKDGDFLVRKSTTNPGSFVLTGMHNGQAKHLLL VDPEGTEALLMSIPCSESFWDAITSSVRLNCAVWIRTKDRVFDSISHLINHHLESSLPIV SAGSELCLQQPVERKQ >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_7|1851_bp atggcgacagccctgtggctccttcaggagggacatgtggacgcttatgcttgtcacatt ttggaatgctgtgatgggctggcccaggatgtcatcggctccatcggacaagcctttgag ctccggtttaagcaatatttacagtgtcctaccaagattcccgctctccatgatcgaatg cagagtctggatgagccatggacggaagaggagggagatggctcagaccacccatactac aacagcatcccaagcaagatgcctcctccagggggctttcttgatactagactgaaaccc agaccccatgctcctgacacagcccagtttgcaggaaaagagcagacttattaccaggga agacacttaggagacacttttggcgaagactggcagcaaacacctttaaggcaaggtgag aggcaagggtcctcggacatctacagcacgccagaagggaaactgcacgtggcccccacg ggagaagcacccacctacgtcaacactcagcagatcccaccacaggcctggccggctgcg gtcagcagtgctgagagcagcccaaggaaagacctctttgacatgagtgggtttcttaga tctaaatatagcggtgctgatgtgatgatggtggtgatgatggtgctgctgctgatggta attttgatggtggtggtggtggtgatggtggtggtgatgatgatgatggtgatgatgctg ctgatggtggtggtgttgatagtgatgatgatggtgatgatgatggtggtgatgctgctg ctgatggtgctggtggtggtgatgatggtggtgatgatgctgatgatggtgctggtggca aaggtggtggtggtgatggtgataatggtggtgagatgtgctgatggtgctgatggtggt gatgctactgctgctgctgttactgctgccgctcctgttgctgctggtgacagtgatggt gatgatgatggttatgctggtgacaatgatggtgatgatgatgatgatgacaatggggat gatggtgataatggtgatgatcctgatgatgctgatggtaatggtggtggtgataatggg gatagtgatgatagtgttgatggtggtgatgctattgctgctgctgctgttgctggtagt gacagtgatggtggtgacaatgatggtgatgatgatggttatgatggtaacaatggggat gatggtgataatggtgatgacgatggcgacgatggcggtgatgctgatggtggtggtggt gccaataaaagtgaatgtggggatggtggcgatgttggggatgatgctgacgatgttgat gatgtagatgatagcaaaggtgaaccttttgaagatgctctcaagaaccagcccttgggg cccgtgttaagcaaggcagcctccgtggagtgcatcagccctgtgtcacctagagcccca gatgccaagatgctggaggaactgcaagccgagacttggtaccaaggagagatgagcagg aaggaggcagaggggctgctggagaaagacggagacttcctggtcaggaagagcaccacc aacccgggctcctttgtcctcacgggcatgcacaatggccaggccaagcacctgctgctc gtggacccagaaggcacggaggctctgctgatgtccattccctgctcagaatccttctgg gatgccatcacctccagtgtaaggttaaactgtgcagtgtggatccggacaaaggacaga gtctttgacagtatcagccacctcatcaaccaccacctagaaagcagcctgcccattgtc tctgcagggagtgagctgtgtctccagcagccagtggagaggaagcagtga >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_8|65_aa MVVEDIVEDVVEDMVEDGGGGRCGGWWWRTVVEDVVEDAVEDVVEDAVEDDGGGFSGGHG GGQWC >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_8|198_bp atggtggtggaggacattgtagaagatgtggtggaggacatggtggaggacggtggtgga ggacgttgtggaggatggtggtggaggacggtggtggaggatgtagtggaggatgcggtg gaggacgtagtggaggatgcagtagaggatgatggtggaggatttagtggtggacatggt ggaggacagtggtgttag >gi568815589f:88901201_89102334|GENSCAN_predicted_peptide_9|189_aa MEKMTHEKKREAVPKMKDHPYLTKGSGAAAEREAISRVCEAVPGAKGAFKKRKPPSKMLS SILGKSNLQFAGMSISLTISTASLNLRTPDSKQIIANHHMRSISFASGGDPDTTDYVAYV AKDPVNRRGANQEYFLPTIQFLSSDMGIRIDSTCPVAASPGSLRFACGDTVKTNARPVMQ KKISTIFGA >gi568815589f:88901201_89102334|GENSCAN_predicted_CDS_9|570_bp atggagaaaatgacccatgagaaaaagagagaggccgtccccaagatgaaagatcatccc tacctcaccaaggggtcaggagctgcagctgaaagggaagccatcagccgcgtctgtgaa gctgtgcctggtgcgaagggagccttcaagaagagaaagcctccaagcaaaatgctgtcc agcatcttgggaaagagcaacctccagtttgcgggaatgagcatctctctgaccatctcc acggccagtctgaacctgcgaactccggactccaaacagatcatagcgaatcaccacatg cggtccatctccttcgcctctgggggagacccggacacaactgactatgttgcatatgtg gctaaggaccctgttaatcgcagaggggcaaaccaagagtactttctgcccacaatacag ttcctgtccagcgacatgggaatcaggatcgattccacatgtcccgtggctgcctcacca ggctctctcaggtttgcatgtggtgatactgtgaagacaaatgccagacctgtgatgcag aaaaagatctccaccatttttggtgcttga