GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:04:08 Sequence gi568815593f:134014950_134246300 : 231351 bp : 47.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3182 3208 27 1 0 70 81 57 0.307 0.91 1.02 Intr + 4897 5006 110 0 2 95 76 65 0.299 5.08 1.03 Intr + 12385 12430 46 1 1 107 75 19 0.558 0.91 1.04 Term + 16142 16240 99 1 0 79 45 96 0.893 2.53 1.05 PlyA + 17404 17409 6 1.05 2.00 Prom + 17780 17819 40 -5.76 2.01 Init + 26513 26595 83 1 2 74 68 75 0.057 4.56 2.02 Intr + 26638 26833 196 1 1 80 97 15 0.033 1.02 2.03 Intr + 38504 38650 147 1 0 67 34 72 0.044 0.13 2.04 Intr + 53664 53713 50 2 2 95 43 89 0.097 2.58 2.05 Intr + 57559 57727 169 1 1 26 82 125 0.036 5.65 2.06 Intr + 61365 61436 72 2 0 89 82 37 0.031 2.80 2.07 Intr + 68638 68778 141 0 0 30 37 104 0.008 0.05 2.08 Intr + 77954 78178 225 1 0 53 70 88 0.021 1.68 2.09 Intr + 89324 89419 96 1 0 80 80 14 0.023 0.01 2.10 Intr + 98711 98847 137 1 2 37 24 129 0.018 0.77 2.11 Intr + 99203 99404 202 2 1 43 103 70 0.046 3.29 2.12 Intr + 99928 100206 279 0 0 42 83 527 0.853 45.27 2.13 Intr + 100372 100438 67 0 1 120 113 50 0.873 9.18 2.14 Intr + 100960 101084 125 2 2 80 119 140 0.740 16.60 2.15 Intr + 107432 107523 92 1 2 73 89 25 0.091 -0.11 2.16 Intr + 107763 107907 145 0 1 43 75 75 0.354 1.98 2.17 Intr + 114855 114951 97 2 1 79 94 -3 0.349 -1.02 2.18 Intr + 117571 117791 221 2 2 121 33 76 0.503 3.42 2.19 Intr + 123110 123215 106 1 1 102 82 54 0.583 5.99 2.20 Intr + 124002 124089 88 1 1 113 105 8 0.961 4.03 2.21 Intr + 127236 127355 120 0 0 120 76 115 0.994 13.11 2.22 Intr + 127772 127934 163 2 1 80 76 299 0.999 27.88 2.23 Intr + 128044 128151 108 0 0 119 83 174 0.964 20.48 2.24 Intr + 128643 128691 49 2 1 129 94 53 0.938 8.25 2.25 Intr + 131421 131536 116 1 2 114 92 1 0.329 3.27 2.26 Intr + 135461 135522 62 1 2 22 74 53 0.161 -5.17 2.27 Term + 136796 136979 184 2 1 79 44 106 0.464 2.32 2.28 PlyA + 137234 137239 6 1.05 3.13 PlyA - 137906 137901 6 1.05 3.12 Term - 142819 142784 36 1 0 85 44 54 0.902 -1.96 3.11 Intr - 143646 143506 141 0 0 50 96 212 0.994 18.75 3.10 Intr - 146181 146038 144 0 0 42 119 144 0.971 13.38 3.09 Intr - 152294 152221 74 0 2 29 116 64 0.401 2.43 3.08 Intr - 159073 158977 97 1 1 101 115 67 0.857 10.28 3.07 Intr - 166278 166219 60 0 0 82 108 4 0.171 0.73 3.06 Intr - 171420 171336 85 2 1 88 58 16 0.008 -1.58 3.05 Intr - 184255 184137 119 1 2 108 78 46 0.876 4.86 3.04 Intr - 187072 186899 174 1 0 82 91 78 0.994 7.64 3.03 Intr - 191182 190973 210 1 0 89 86 72 0.941 6.21 3.02 Intr - 210946 210748 199 0 1 -35 50 451 0.080 28.35 3.01 Intr - 211120 210973 148 2 1 81 7 250 0.810 15.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 65364 65211 154 0 1 58 79 144 0.877 10.15 S.002 Init + 99237 99404 168 2 0 77 103 107 0.824 8.88 S.003 Term - 182895 182823 73 2 1 73 39 73 0.809 -1.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:134014950_134246300|GENSCAN_predicted_peptide_1|93_aa MVRASARLLPPPESGEDLLPHWPHTNTSSSVLVARNDFLSDMGNDRSPSDPLLHDQFPSS KVVLDDHLVWGSLTLEEAICSPFQLIKKELNPT >gi568815593f:134014950_134246300|GENSCAN_predicted_CDS_1|282_bp atggtgcgggcatctgctcggcttctgcctcctcctgaatctggagaggacttgcttcct cactggccccacaccaacacctcctcctctgtcctggttgccaggaacgactttctctct gacatgggcaatgacaggtctccctcagacccactgcttcatgaccagttcccgagcagc aaggtggtcctggatgaccacctggtgtggggatcattaaccttggaagaagccatttgc agtccattccagctgatcaagaaagaactaaatccaacatag >gi568815593f:134014950_134246300|GENSCAN_predicted_peptide_2|1179_aa MANVLAEYKVLIQWSQDQLLREPWTSALGDGKDGGTELVGLISPDSNSEKRRPCLTSHSK GPWSHAWDLGLLPSTQGALHGPMRSPQLQASLLGSCEPCQGQAPGRRILQEEALSAACME SDPEDLTPGQLLWEEAWGETEQDFLIVFGVKKFHYNVARWLMFMDSTLGGKDQEFCELQG DTALLWAETPDKCPLGLTISLQLDHPQVQKCPSHQISAGSSHRVEAFLSPPDSQVTFCKR YVKELWNVQAPDFQASAGRGTPLHRCLHYEPSTGKDLPKATMCQARRPAELLEFPCLVVH SFDQGLPKTPAAKASFKLMHNHPSMGIKVICGFMTLSHKSTSNLLQLETAAFWPLASLHV LATTSPKKGGDLLTLSGCRGAGGCPQWVGIPERMQGLSQAKRAVEARVDADPGAIQMRWN PGTFPRSANLDVSAWIERGRPRRKSLIGTGMEDGARAATAPRSSNAAGSLGYEHSLKQPQ ALGSPQFKAQGALEPGHDTVHTTAAMRPAPRRAERTMPQLDSGGGGAGGGDDLGAPDELL AFQDEGEEQDDKSRDSAAGPERDLAELKSSLVNESEGAAGGAGIPGVPGAGAGARGEAEA LGREHAAQRLFPDKLPEPLEDGLKAPECTSGMYKETVYSAFNLLMHYPPPSGAGQHPQPQ PPLALPGSTKPVSTAAHGNIHTSTATSSATLRLNHTVTPLTPCDWCFLLLCTNKALVEET GVLSGHRGPHPQWYSAQILLLQAPLSDVKARRWVDWPHAPSRALCNCGNSLVPAGLRPEE TATFFAGKVLWGSSIWGFSESTVRGEGVSLLAVARAVSVHTACEQTAAVYRENADCVYFS PEPAGSWGHKANQPPHGVPQLSLYEHFNSPHPTPAPADISQKQVHRPLQTPDLSGFYSLT SGSMGQLPHTVSWFTHPSLMLGSGVPGHPAAIPHPAIVPPSGKQELQPFDRNLKTQAESK AEKEAKKPTIKKPLNAFMLYMKEMRAKVIAECTLKESAAINQILGRRWHALSREEQAKYY ELARKERQLHMQLYPGWSARDNYGKKKRRSREKHQESTTELLTSPAEPAPTSPGLSTALS LPTPGPPQAPRSTLQSTQRGRKGISSCIRFSENSTEQKGMLLNTGTQTRGSQQYTRLQGN KYIINELAIYHSNYVPHWKCVKQEHLRQHFSATTNDHLH >gi568815593f:134014950_134246300|GENSCAN_predicted_CDS_2|3540_bp atggccaacgtcctagcagaatataaggtgcttatccaatggtcccaggaccagctgctc agggagccctggacatcagcgcttggggatgggaaagatggtgggaccgagctggtgggc ctcatttcaccagactcaaactcagagaaacgcaggccttgcctaacatcacacagcaag gggccttggagccatgcctgggacttgggcctcctgccttccacacagggcgctttgcat ggccccatgaggagcccccagctccaggcatccctgttgggctcctgtgagccgtgccag gggcaggcgccaggaaggaggatcctgcaggaggaagcactgagcgcagcctgcatggag tcagacccggaggacctcaccccagggcagctgctctgggaggaagcctggggggaaaca gaacaggattttctcattgtctttggtgtcaagaagtttcactacaacgtggctaggtgg ctgatgttcatggacagcacactggggggaaaggaccaggagttctgtgaactgcagggt gacacagctctgctctgggcagaaacaccggacaaatgcccattgggtctgaccatcagc ctgcaactggaccatcctcaggtgcagaagtgtccaagccaccaaatctctgctggatcc tcacatcgtgtggaggccttcctgagtcccccagactcacaggtcactttctgcaagcgc tacgtcaaagagctctggaatgtccaggctccagacttccaggcctcagctggccgtggt acccctctgcacagatgtctccactatgagccttcaactggcaaggacttaccaaaagca actatgtgccaggctaggagaccagcggagctgctagaatttccttgtctggtggttcac agctttgaccaaggtttgccaaagacacctgcagcaaaggcctctttcaagctaatgcac aatcatccttccatggggataaaggtgatttgtggattcatgaccctgagtcataaatct accagcaatctgctgcagttggaaacagcagctttctggcccttggcaagcctgcatgta cttgccactacaagccccaagaagggaggtgacttgctcacactcagtgggtgcaggggt gctggcggctgcccacagtgggtgggcatccctgagcggatgcaggggctgagccaggcc aagcgcgccgtggaggcccgcgtggacgcagacccgggtgccatacagatgcgctggaat ccaggcactttcccgcgctccgcaaatctagatgtttcagcctggatcgagcgagggcgg cctagaaggaagtccctgattggcacagggatggaggatggggcaagagccgcaacagcg ccgcggagttccaacgctgccggttccctggggtacgagcacagcctcaagcagcctcaa gccctaggaagcccccagttcaaagcacagggcgcattggagcctgggcacgatacagtt cacaccacggctgcgatgcgccccgcgccccggcgggcggagcgcaccatgccgcagctg gactccggcgggggcggcgcgggcggcggcgacgacctcggcgcgccggacgagctgctg gccttccaggatgaaggcgaggagcaggacgacaagagccgcgacagcgccgccggtccc gagcgcgacctggccgagctcaagtcgtcgctcgtgaacgagtccgagggcgcggccggc ggcgcagggatcccgggggtcccgggggccggcgccggggcccgcggcgaggccgaggct ctcgggcgggaacacgctgcgcagagactcttcccggacaaacttccagagcccctggag gacggcctgaaggccccggagtgcaccagcggcatgtacaaagagaccgtctactccgcc ttcaatctgctcatgcattacccacccccctcgggagcagggcagcacccccagccgcag cccccgctggctctaccaggttccaccaagccagtttccacggcagcacacgggaatatt cataccagcactgccacctcctcagctactctgagactcaaccatacagtcacacccctc acaccttgtgactggtgcttcctgctgctgtgtacaaacaaggctctggtggaggaaaca ggtgttttatcaggtcacaggggcccccacccccaatggtacagtgcacagatcctcctg ctgcaggcccctttatctgatgtcaaggccaggcggtgggtggactggccacacgcacct agcagagctctgtgcaactgtggcaacagccttgtgccagcaggcttgaggccagaggaa acagcaactttcttcgctgggaaagtgttgtggggctcaagcatttgggggttttcagag tcaaccgtcagaggcgagggagtttctctgctggcggtggcaagggctgtcagcgtgcac acagcctgcgagcagaccgctgcggtctacagggaaaatgccgactgtgtttatttttca cctgagcctgcaggaagttgggggcacaaggccaatcagcccccccacggtgtcccccaa ctctctctctacgaacatttcaacagcccacatcccacccctgcacctgcggacatcagc cagaagcaagttcacaggcctctgcagacccctgacctctctggcttctactccctgacc tcaggcagcatggggcagctcccccacactgtgagctggttcacccacccatccttgatg ctaggttctggtgtacctggtcacccagcagccatcccccacccggccattgtgcccccc tcagggaagcaggagctgcagcccttcgaccgcaacctgaagacacaagcagagtccaag gcagagaaggaggccaagaagccaaccatcaagaagcccctcaatgccttcatgctgtac atgaaggagatgagagccaaggtcattgcagagtgcacacttaaggagagcgctgccatc aaccagatcctgggccgcaggtggcacgcgctgtcgcgagaagagcaggccaagtactat gagctggcccgcaaggagaggcagctgcacatgcagctatacccaggctggtcagcgcgg gacaactacgggaagaagaagaggcggtcgagggaaaagcaccaagaatccaccacagaa ctgcttactagccctgcggagccggcacctacatccccaggtctctccactgctctcagc ctcccaaccccagggcccccacaggccccccgcagcaccctgcagagcacacagagagga agaaagggaatcagttcctgcatccgtttctcagaaaacagcacagaacagaagggaatg ctgctcaatactggcacacagaccagaggtagccagcagtacacaagactgcaaggcaac aagtacattatcaatgaattagccatctatcactcaaactatgtcccgcattggaagtgt gtcaagcaagagcaccttcgacaacacttttcggccaccactaatgatcatctccactga >gi568815593f:134014950_134246300|GENSCAN_predicted_peptide_3|495_aa XSRALEPQRAEEEAQRPTAEYCGESQRASASLNSRQKYTRNRRRRVRVGPAERQPAGAGG IMDEKVFTKELDQWIEQLNECKQLSESQVKSLCEKVSCSTAADSRRGAEPAEEKAAAKEI LTKESNVQEVRCPVTVCGDVHGQFHDLMELFRIGGKSPDTNYLFMGDYVDRGYYSVETVT LLVALKVRYRERITILRGNHESRQITQVYGFYDECLRKYGNANVWKYFTDLFDYLPLTAL VDGQGYNWCHDRNVVTIFSAPNYCYRCGNQAAIMELDDTLKYSLLLCQFFPSSKYGNDPR LSTHTTFSLLFQRLFPFDSRNIIWPLPGKFSQMPSIKLQSSDGEIFEVDVEIAKQSVTIK TMLEDLGMDDEGDDDPVPLPNVNAAILKKVIQWCTHHKDDPPPPEDDENKEKRTDDIPVW DQEFLKVDQGTLFELILAANYLDIKGLLDVTCKTVANMIKGKTPEEIRKTFNIKNDFTEE EEAQVRKENQWCEEK >gi568815593f:134014950_134246300|GENSCAN_predicted_CDS_3|1488_bp nagagccgagctctggagcctcagcgagcggaggaggaggcgcagcggccgacggccgag tactgcggtgagagccagcgggccagcgccagcctcaacagccgccagaagtacacgagg aaccggcggcggcgtgtgcgtgtaggccccgcggagcggcagccggctggggcgggtggc atcatggacgagaaggtgttcaccaaggagctggaccagtggatcgagcagctgaacgag tgcaagcagctgtccgagtcccaggtcaagagcctctgcgagaaggtgagctgtagtacg gctgcggacagccgccgcggggccgagcccgccgaggaaaaggcggccgctaaagaaatc ctgacaaaagaatccaacgtgcaagaggttcgatgtccagttactgtctgtggagatgtg catgggcaatttcatgatctcatggaactgtttagaattggtggcaaatcaccagataca aattacttgtttatgggagattatgttgacagaggatattattcagttgaaacagttaca ctgcttgtagctcttaaggttcgttaccgtgaacgcatcaccattcttcgagggaatcat gagagcagacagatcacacaagtttatggtttctatgatgaatgtttaagaaaatatgga aatgcaaatgtttggaaatattttacagatctttttgactatcttcctctcactgccttg gtggatgggcagggatataactggtgccatgaccggaatgtagtaacgattttcagtgct ccaaactattgttatcgttgtggtaaccaagctgcaatcatggaacttgacgatactcta aaatactctttactcctttgccagttcttcccatcctctaaatatggaaatgacccaagg cttagtactcacacgactttttctctcctcttccagcgcttatttccatttgacagcaga aatataatctggccacttcctggcaaattcagtcaaatgccttcaattaagttgcagagt tctgatggagagatatttgaagttgatgtggaaattgccaaacaatctgtgactattaag accatgttggaagatttgggaatggatgatgaaggagatgatgacccagttcctctacca aatgtgaatgcagcaatattaaaaaaggtcattcagtggtgcacccaccacaaggatgac cctcctcctcctgaagatgatgagaacaaagaaaagcgaacagatgatatccctgtttgg gaccaagaattcctgaaagttgaccaaggaacactttttgaactcattctggctgcaaac tacttagacatcaaaggtttgcttgatgttacatgcaagactgttgccaatatgatcaag gggaaaactcctgaggagattcgcaagaccttcaatatcaaaaatgactttactgaagag gaggaagcccaggtacgcaaagagaaccagtggtgtgaagagaagtga