GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:24:02 Sequence gi568815593r:134058452_134274022 : 215571 bp : 46.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10162 10211 50 0 2 95 43 89 0.147 2.58 1.02 Intr + 14057 14225 169 2 1 26 82 125 0.055 5.65 1.03 Intr + 17863 17934 72 0 0 89 82 37 0.043 2.80 1.04 Intr + 25136 25276 141 1 0 30 37 104 0.011 0.05 1.05 Intr + 34452 34676 225 2 0 53 70 88 0.022 1.68 1.06 Intr + 45822 45917 96 2 0 80 80 14 0.024 0.01 1.07 Intr + 55209 55345 137 2 2 37 24 129 0.018 0.77 1.08 Intr + 55701 55902 202 0 1 43 103 70 0.046 3.29 1.09 Intr + 56426 56704 279 1 0 42 83 527 0.853 45.27 1.10 Intr + 56870 56936 67 1 1 120 113 50 0.873 9.18 1.11 Intr + 57458 57582 125 0 2 80 119 140 0.740 16.60 1.12 Intr + 63930 64021 92 2 2 73 89 25 0.091 -0.11 1.13 Intr + 64261 64405 145 1 1 43 75 75 0.354 1.98 1.14 Intr + 71353 71449 97 0 1 79 94 -3 0.349 -1.02 1.15 Intr + 74069 74289 221 0 2 121 33 76 0.503 3.42 1.16 Intr + 79608 79713 106 2 1 102 82 54 0.583 5.99 1.17 Intr + 80500 80587 88 2 1 113 105 8 0.961 4.03 1.18 Intr + 83734 83853 120 1 0 120 76 115 0.994 13.11 1.19 Intr + 84270 84432 163 0 1 80 76 299 0.999 27.88 1.20 Intr + 84542 84649 108 1 0 119 83 174 0.964 20.48 1.21 Intr + 85141 85189 49 0 1 129 94 53 0.938 8.25 1.22 Intr + 87919 88034 116 2 2 114 92 1 0.329 3.27 1.23 Intr + 91959 92020 62 2 2 22 74 53 0.161 -5.17 1.24 Term + 93294 93477 184 0 1 79 44 106 0.464 2.32 1.25 PlyA + 93732 93737 6 1.05 2.14 PlyA - 94404 94399 6 1.05 2.13 Term - 99317 99282 36 2 0 85 44 54 0.902 -1.96 2.12 Intr - 100144 100004 141 1 0 50 96 212 0.994 18.75 2.11 Intr - 102679 102536 144 1 0 42 119 144 0.971 13.38 2.10 Intr - 108792 108719 74 1 2 29 116 64 0.401 2.43 2.09 Intr - 115571 115475 97 2 1 101 115 67 0.857 10.28 2.08 Intr - 122776 122717 60 1 0 82 108 4 0.171 0.73 2.07 Intr - 127918 127834 85 0 1 88 58 16 0.008 -1.58 2.06 Intr - 140753 140635 119 2 2 108 78 46 0.876 4.86 2.05 Intr - 143570 143397 174 2 0 82 91 78 0.994 7.64 2.04 Intr - 147680 147471 210 2 0 89 86 72 0.941 6.21 2.03 Intr - 167444 167246 199 1 1 -35 50 451 0.080 28.35 2.02 Intr - 167618 167471 148 0 1 81 7 250 0.808 15.49 2.01 Init - 171105 171057 49 0 1 82 58 46 0.637 0.11 2.00 Prom - 182459 182420 40 -4.76 3.00 Prom + 182873 182912 40 -6.46 3.01 Init + 185583 185599 17 2 2 110 80 9 0.127 1.75 3.02 Intr + 188440 188506 67 1 1 74 79 47 0.089 1.31 3.03 Term + 208956 209144 189 2 0 49 48 151 0.489 4.65 3.04 PlyA + 209517 209522 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 21862 21709 154 1 1 58 79 144 0.863 10.15 S.002 Init + 55735 55902 168 0 0 77 103 107 0.824 8.88 S.003 Term - 139393 139321 73 0 1 73 39 73 0.809 -1.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:134058452_134274022|GENSCAN_predicted_peptide_1|1037_aa DFLIVFGVKKFHYNVARWLMFMDSTLGGKDQEFCELQGDTALLWAETPDKCPLGLTISLQ LDHPQVQKCPSHQISAGSSHRVEAFLSPPDSQVTFCKRYVKELWNVQAPDFQASAGRGTP LHRCLHYEPSTGKDLPKATMCQARRPAELLEFPCLVVHSFDQGLPKTPAAKASFKLMHNH PSMGIKVICGFMTLSHKSTSNLLQLETAAFWPLASLHVLATTSPKKGGDLLTLSGCRGAG GCPQWVGIPERMQGLSQAKRAVEARVDADPGAIQMRWNPGTFPRSANLDVSAWIERGRPR RKSLIGTGMEDGARAATAPRSSNAAGSLGYEHSLKQPQALGSPQFKAQGALEPGHDTVHT TAAMRPAPRRAERTMPQLDSGGGGAGGGDDLGAPDELLAFQDEGEEQDDKSRDSAAGPER DLAELKSSLVNESEGAAGGAGIPGVPGAGAGARGEAEALGREHAAQRLFPDKLPEPLEDG LKAPECTSGMYKETVYSAFNLLMHYPPPSGAGQHPQPQPPLALPGSTKPVSTAAHGNIHT STATSSATLRLNHTVTPLTPCDWCFLLLCTNKALVEETGVLSGHRGPHPQWYSAQILLLQ APLSDVKARRWVDWPHAPSRALCNCGNSLVPAGLRPEETATFFAGKVLWGSSIWGFSEST VRGEGVSLLAVARAVSVHTACEQTAAVYRENADCVYFSPEPAGSWGHKANQPPHGVPQLS LYEHFNSPHPTPAPADISQKQVHRPLQTPDLSGFYSLTSGSMGQLPHTVSWFTHPSLMLG SGVPGHPAAIPHPAIVPPSGKQELQPFDRNLKTQAESKAEKEAKKPTIKKPLNAFMLYMK EMRAKVIAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDN YGKKKRRSREKHQESTTELLTSPAEPAPTSPGLSTALSLPTPGPPQAPRSTLQSTQRGRK GISSCIRFSENSTEQKGMLLNTGTQTRGSQQYTRLQGNKYIINELAIYHSNYVPHWKCVK QEHLRQHFSATTNDHLH >gi568815593r:134058452_134274022|GENSCAN_predicted_CDS_1|3114_bp gattttctcattgtctttggtgtcaagaagtttcactacaacgtggctaggtggctgatg ttcatggacagcacactggggggaaaggaccaggagttctgtgaactgcagggtgacaca gctctgctctgggcagaaacaccggacaaatgcccattgggtctgaccatcagcctgcaa ctggaccatcctcaggtgcagaagtgtccaagccaccaaatctctgctggatcctcacat cgtgtggaggccttcctgagtcccccagactcacaggtcactttctgcaagcgctacgtc aaagagctctggaatgtccaggctccagacttccaggcctcagctggccgtggtacccct ctgcacagatgtctccactatgagccttcaactggcaaggacttaccaaaagcaactatg tgccaggctaggagaccagcggagctgctagaatttccttgtctggtggttcacagcttt gaccaaggtttgccaaagacacctgcagcaaaggcctctttcaagctaatgcacaatcat ccttccatggggataaaggtgatttgtggattcatgaccctgagtcataaatctaccagc aatctgctgcagttggaaacagcagctttctggcccttggcaagcctgcatgtacttgcc actacaagccccaagaagggaggtgacttgctcacactcagtgggtgcaggggtgctggc ggctgcccacagtgggtgggcatccctgagcggatgcaggggctgagccaggccaagcgc gccgtggaggcccgcgtggacgcagacccgggtgccatacagatgcgctggaatccaggc actttcccgcgctccgcaaatctagatgtttcagcctggatcgagcgagggcggcctaga aggaagtccctgattggcacagggatggaggatggggcaagagccgcaacagcgccgcgg agttccaacgctgccggttccctggggtacgagcacagcctcaagcagcctcaagcccta ggaagcccccagttcaaagcacagggcgcattggagcctgggcacgatacagttcacacc acggctgcgatgcgccccgcgccccggcgggcggagcgcaccatgccgcagctggactcc ggcgggggcggcgcgggcggcggcgacgacctcggcgcgccggacgagctgctggccttc caggatgaaggcgaggagcaggacgacaagagccgcgacagcgccgccggtcccgagcgc gacctggccgagctcaagtcgtcgctcgtgaacgagtccgagggcgcggccggcggcgca gggatcccgggggtcccgggggccggcgccggggcccgcggcgaggccgaggctctcggg cgggaacacgctgcgcagagactcttcccggacaaacttccagagcccctggaggacggc ctgaaggccccggagtgcaccagcggcatgtacaaagagaccgtctactccgccttcaat ctgctcatgcattacccacccccctcgggagcagggcagcacccccagccgcagcccccg ctggctctaccaggttccaccaagccagtttccacggcagcacacgggaatattcatacc agcactgccacctcctcagctactctgagactcaaccatacagtcacacccctcacacct tgtgactggtgcttcctgctgctgtgtacaaacaaggctctggtggaggaaacaggtgtt ttatcaggtcacaggggcccccacccccaatggtacagtgcacagatcctcctgctgcag gcccctttatctgatgtcaaggccaggcggtgggtggactggccacacgcacctagcaga gctctgtgcaactgtggcaacagccttgtgccagcaggcttgaggccagaggaaacagca actttcttcgctgggaaagtgttgtggggctcaagcatttgggggttttcagagtcaacc gtcagaggcgagggagtttctctgctggcggtggcaagggctgtcagcgtgcacacagcc tgcgagcagaccgctgcggtctacagggaaaatgccgactgtgtttatttttcacctgag cctgcaggaagttgggggcacaaggccaatcagcccccccacggtgtcccccaactctct ctctacgaacatttcaacagcccacatcccacccctgcacctgcggacatcagccagaag caagttcacaggcctctgcagacccctgacctctctggcttctactccctgacctcaggc agcatggggcagctcccccacactgtgagctggttcacccacccatccttgatgctaggt tctggtgtacctggtcacccagcagccatcccccacccggccattgtgcccccctcaggg aagcaggagctgcagcccttcgaccgcaacctgaagacacaagcagagtccaaggcagag aaggaggccaagaagccaaccatcaagaagcccctcaatgccttcatgctgtacatgaag gagatgagagccaaggtcattgcagagtgcacacttaaggagagcgctgccatcaaccag atcctgggccgcaggtggcacgcgctgtcgcgagaagagcaggccaagtactatgagctg gcccgcaaggagaggcagctgcacatgcagctatacccaggctggtcagcgcgggacaac tacgggaagaagaagaggcggtcgagggaaaagcaccaagaatccaccacagaactgctt actagccctgcggagccggcacctacatccccaggtctctccactgctctcagcctccca accccagggcccccacaggccccccgcagcaccctgcagagcacacagagaggaagaaag ggaatcagttcctgcatccgtttctcagaaaacagcacagaacagaagggaatgctgctc aatactggcacacagaccagaggtagccagcagtacacaagactgcaaggcaacaagtac attatcaatgaattagccatctatcactcaaactatgtcccgcattggaagtgtgtcaag caagagcaccttcgacaacacttttcggccaccactaatgatcatctccactga >gi568815593r:134058452_134274022|GENSCAN_predicted_peptide_2|511_aa MVVCQVGQAGLELLTSESRALEPQRAEEEAQRPTAEYCGESQRASASLNSRQKYTRNRRR RVRVGPAERQPAGAGGIMDEKVFTKELDQWIEQLNECKQLSESQVKSLCEKVSCSTAADS RRGAEPAEEKAAAKEILTKESNVQEVRCPVTVCGDVHGQFHDLMELFRIGGKSPDTNYLF MGDYVDRGYYSVETVTLLVALKVRYRERITILRGNHESRQITQVYGFYDECLRKYGNANV WKYFTDLFDYLPLTALVDGQGYNWCHDRNVVTIFSAPNYCYRCGNQAAIMELDDTLKYSL LLCQFFPSSKYGNDPRLSTHTTFSLLFQRLFPFDSRNIIWPLPGKFSQMPSIKLQSSDGE IFEVDVEIAKQSVTIKTMLEDLGMDDEGDDDPVPLPNVNAAILKKVIQWCTHHKDDPPPP EDDENKEKRTDDIPVWDQEFLKVDQGTLFELILAANYLDIKGLLDVTCKTVANMIKGKTP EEIRKTFNIKNDFTEEEEAQVRKENQWCEEK >gi568815593r:134058452_134274022|GENSCAN_predicted_CDS_2|1536_bp atggtggtttgccaggttggccaggctggtctcgaactcctgacctcagagagccgagct ctggagcctcagcgagcggaggaggaggcgcagcggccgacggccgagtactgcggtgag agccagcgggccagcgccagcctcaacagccgccagaagtacacgaggaaccggcggcgg cgtgtgcgtgtaggccccgcggagcggcagccggctggggcgggtggcatcatggacgag aaggtgttcaccaaggagctggaccagtggatcgagcagctgaacgagtgcaagcagctg tccgagtcccaggtcaagagcctctgcgagaaggtgagctgtagtacggctgcggacagc cgccgcggggccgagcccgccgaggaaaaggcggccgctaaagaaatcctgacaaaagaa tccaacgtgcaagaggttcgatgtccagttactgtctgtggagatgtgcatgggcaattt catgatctcatggaactgtttagaattggtggcaaatcaccagatacaaattacttgttt atgggagattatgttgacagaggatattattcagttgaaacagttacactgcttgtagct cttaaggttcgttaccgtgaacgcatcaccattcttcgagggaatcatgagagcagacag atcacacaagtttatggtttctatgatgaatgtttaagaaaatatggaaatgcaaatgtt tggaaatattttacagatctttttgactatcttcctctcactgccttggtggatgggcag ggatataactggtgccatgaccggaatgtagtaacgattttcagtgctccaaactattgt tatcgttgtggtaaccaagctgcaatcatggaacttgacgatactctaaaatactcttta ctcctttgccagttcttcccatcctctaaatatggaaatgacccaaggcttagtactcac acgactttttctctcctcttccagcgcttatttccatttgacagcagaaatataatctgg ccacttcctggcaaattcagtcaaatgccttcaattaagttgcagagttctgatggagag atatttgaagttgatgtggaaattgccaaacaatctgtgactattaagaccatgttggaa gatttgggaatggatgatgaaggagatgatgacccagttcctctaccaaatgtgaatgca gcaatattaaaaaaggtcattcagtggtgcacccaccacaaggatgaccctcctcctcct gaagatgatgagaacaaagaaaagcgaacagatgatatccctgtttgggaccaagaattc ctgaaagttgaccaaggaacactttttgaactcattctggctgcaaactacttagacatc aaaggtttgcttgatgttacatgcaagactgttgccaatatgatcaaggggaaaactcct gaggagattcgcaagaccttcaatatcaaaaatgactttactgaagaggaggaagcccag gtacgcaaagagaaccagtggtgtgaagagaagtga >gi568815593r:134058452_134274022|GENSCAN_predicted_peptide_3|90_aa MPGSTRNWKVRYGCSTLVVEQMCLPEAKPSWVTCMLLGGALPYRCRIPTQLKLIPDDMKE QIYKLAKKGLTPSQVSVILRDAHVLHKYAL >gi568815593r:134058452_134274022|GENSCAN_predicted_CDS_3|273_bp atgcctggctccaccaggaactggaaagtcaggtatggctgcagcacactggtggtagag cagatgtgcctgcctgaggccaagccatcatgggtcacatgcatgctccttggaggggct ctgccctatcgctgcaggatccccactcagctgaagttgatacctgatgacatgaaggag cagatttacaaactggccaaaaagggcctgactccctcacaagtcagtgtgatcctgaga gatgcacatgtgttgcacaagtacgctttgtga