GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:37:59 Sequence gi568815593f:10342168_10564944 : 222777 bp : 44.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11732 11829 98 1 2 91 64 93 0.045 4.98 1.02 Intr + 31413 31506 94 0 1 111 43 91 0.035 6.87 1.03 Intr + 33041 33418 378 1 0 -8 49 256 0.006 7.26 1.04 Intr + 35631 35727 97 2 1 59 115 32 0.193 2.58 1.05 Intr + 39633 39776 144 1 0 74 84 24 0.238 0.85 1.06 Intr + 44827 44899 73 2 1 83 92 60 0.819 4.36 1.07 Intr + 48165 48333 169 0 1 96 78 -9 0.714 -1.15 1.08 Intr + 49375 49564 190 0 1 60 57 258 0.796 19.06 1.09 Intr + 58617 58675 59 1 2 83 94 10 0.670 -0.30 1.10 Intr + 61240 61374 135 0 0 114 107 21 0.575 7.36 1.11 Intr + 64935 65035 101 2 2 101 111 -15 0.163 1.01 1.12 Intr + 69166 69370 205 1 1 53 73 75 0.129 1.70 1.13 Intr + 72266 72335 70 1 1 101 80 22 0.105 1.45 1.14 Intr + 73321 73502 182 2 2 72 80 58 0.104 2.99 1.15 Intr + 75103 75237 135 0 0 99 99 -11 0.093 1.86 1.16 Intr + 81568 81657 90 0 0 67 115 3 0.673 1.09 1.17 Intr + 84223 84355 133 0 1 93 84 93 0.998 9.62 1.18 Intr + 87726 87861 136 1 1 104 108 96 0.989 12.83 1.19 Term + 91427 91517 91 2 1 59 39 122 0.956 1.59 1.20 PlyA + 92640 92645 6 1.05 2.00 Prom + 96579 96618 40 -5.56 2.01 Init + 100001 100131 131 1 2 81 86 216 0.559 20.32 2.02 Intr + 106093 106216 124 1 1 52 80 55 0.734 1.69 2.03 Intr + 107785 107946 162 0 0 66 63 87 0.218 4.17 2.04 Intr + 119017 119192 176 0 2 55 92 183 0.325 14.14 2.05 Term + 138429 138615 187 0 1 67 54 182 0.785 9.66 2.06 PlyA + 143419 143424 6 1.05 3.00 Prom + 150116 150155 40 -5.86 3.01 Init + 153715 153727 13 0 1 114 96 1 0.714 4.07 3.02 Intr + 160542 160758 217 1 1 90 66 101 0.302 5.66 3.03 Term + 182624 182798 175 2 1 51 55 125 0.012 2.73 3.04 PlyA + 183125 183130 6 1.05 4.04 PlyA - 185282 185277 6 1.05 4.03 Term - 196574 196398 177 2 0 125 49 47 0.490 2.19 4.02 Intr - 198067 197906 162 1 0 58 22 109 0.100 1.47 4.01 Init - 206974 206819 156 1 0 47 15 166 0.107 5.23 4.00 Prom - 208737 208698 40 -4.36 5.00 Prom + 212037 212076 40 -4.96 5.01 Init + 222301 222666 366 0 0 100 98 698 0.971 67.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 75106 75237 132 0 0 63 99 117 0.840 8.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:10342168_10564944|GENSCAN_predicted_peptide_1|859_aa MDTAEEGKSATRGARALASAPGFVPRPGPRRSRQQCASRTVFAIPQAIVNGRPVTMFLEG SVPRLEFRVGMGLVGLHSEQPAGPAGPGSERLGTRASGCRGCTGSPSSAGPPALCSISCR ALAAFPLGRARDLQPAMPEPPTHSMGSCAARASPTSTTLCSTAPSPIDHPRAEECQRMAR DWQAAPPAARTYVECVGQKEHLRNRFIILVYVLAVLSLSIKNVYSPDMPSRLPIQDIFAG LVTSIGTAIRYWFHYTLVAFAWLGVVPLTACRIYKCLFTGSVSSLLTLPLDMLSTENLLA DCLQGCFVVTCTLCAFISLVWLREQIVHGGAPIWLEHAAPPFNAAGHHQNEAPAGGNGAE NVAADQPANPPAENAVVGENPDAQDDQAEEEEEDNEEEDDAGVEDAADANNGAQAFCPYH IGHFSLVGLGFEEHEMFDATLKDRELSFQSAPGTTMFLHWLVGMVYVFYFASFILLLREI VFGSIVLLMLWLPIRIIKSVLPNFLPYNVMLYRDLHSYLLGDQEENENSANQQVNNNQHA RNNNAIPVVGEGLHAAHQAILQQGGPVGFQPYRRPLNFPLRIFLLIVFMCITLLIASLIC LTLPVFAGRWLMSFWTGTAKIHELYTAACGLYVCWLTIRAVTVMVAWMPQGRRVIFQKVK EWSLMIMKTLIVAVLLAGVVPLLLGLLFELVIVAPLRVPLDQTPLFYPWQDWALGVLHAK IIAAITLMGPQWWLKTVIEQVYANGIRNIDLHYIVRKLAAPVISVLLLSLCVPYVIASGV VPLLGVTAEMQNLVHRRIYPFLLMVVVLMAILSFQVRQFKRLYEHIKNDKYLVGQRLVNY ERKSGKQGSSPPPPQSSQE >gi568815593f:10342168_10564944|GENSCAN_predicted_CDS_1|2580_bp atggacaccgcggaggaaggtaagtcggcgacgcgcggcgcccgagcccttgcgtcggcg cccgggttcgtaccccggccaggtccgcggcgctcgaggcagcaatgtgccagcaggact gtctttgccataccccaggccatagtcaatggccgacccgtcacgatgttcctggagggc agcgtcccacggctggagttccgggtgggcatgggcttggtgggcctgcactcggagcag ccggccggccctgccggcccaggcagtgagagacttggcacccgggccagcggctgcaga gggtgtactgggtcccccagcagtgccggcccaccggcgctgtgctcgatttcttgccgg gccttagctgccttcccgctgggcagggctcgggacctgcagcccgccatgcctgagcct cccacccactccatgggctcctgtgcggcccgagcctccccgacgagcaccaccctctgc tccacggcgcccagtcccatcgaccacccaagggctgaggagtgccagcgcatggcgcgg gactggcaggcagctccacctgcagcccggacatatgtagagtgtgtcggtcagaaggaa cacctgagaaaccgctttatcatccttgtgtatgtactggcagtattaagtttatccatc aagaatgtttattctccagatatgccttcacggcttccaattcaagacatatttgctgga ctggttacaagtattggcactgcaatacgatattggtttcattatacacttgtggccttt gcatggttgggagttgttcctcttacagcatgccgcatctacaagtgcttgtttactggc tccgtgagctcactactgacgctgccattagatatgctgtcaacggaaaatttgttggca gattgtttgcagggttgttttgtggtgacgtgcacactgtgtgcattcatcagcctggtg tggttgagagagcagatagtccatgggggagcaccaatttggttggagcatgctgcccca ccgttcaatgctgcggggcatcaccaaaatgaggctccagcaggaggaaatggtgcagaa aatgttgctgctgatcagcctgctaacccaccagctgagaacgcagtggtgggggaaaac cctgatgcccaggatgaccaggcagaagaggaggaggaggacaatgaggaggaagatgac gctggtgtggaggatgcggcagatgctaataacggagcccaggcattttgcccttaccat attggtcatttctcccttgttggtttgggatttgaagaacacgaaatgtttgatgctact ctgaaagatcgagaactgagctttcagtcggctccaggtactaccatgtttctgcattgg ctagtgggaatggtatatgtcttctactttgcctccttcattctactactgagagagatt gtctttggctccattgtcctcctgatgctttggcttcctatacgtataattaagagtgtg ctgcctaattttcttccatacaatgtcatgctctacagggatcttcattcttatttattg ggagaccaggaagaaaatgaaaacagtgcaaatcaacaagttaacaataatcagcatgct cgaaataacaacgctattcctgtggtgggagaaggccttcatgcagcccaccaagccata ctccagcagggagggcctgttggctttcagccttaccgccgacctttaaattttccactc aggatatttctgttgattgtcttcatgtgtataacattactgattgccagcctcatctgc cttactttaccagtatttgctggccgttggttaatgtcgttttggacggggactgccaaa atccatgagctctacacagctgcttgtggtctctatgtttgctggctaaccataagggct gtgacggtgatggtggcatggatgcctcagggacgcagagtgatcttccagaaggttaaa gagtggtctctcatgatcatgaagactttgatagttgcggtgctgttggctggagttgtc cctctccttctggggctcctgtttgagctggtcattgtggctcccctgagggttcccttg gatcagactcctcttttttatccatggcaggactgggcacttggagtcctgcatgccaaa atcattgcagctataacattgatgggtcctcagtggtggttgaaaactgtaattgaacag gtttacgcaaatggcatccggaacattgaccttcactatattgttcgtaaactggcagct cccgtgatctctgtgctgttgctttccctgtgtgtaccttatgtcatagcttctggtgtt gttcctttactaggtgttactgcggaaatgcaaaacttagtccatcggcggatttatcca tttttactgatggtcgtggtattgatggcaattttgtccttccaagtccgccagtttaag cgcctttatgaacatattaaaaatgacaagtaccttgtgggtcaacgactcgtgaactac gaacggaaatctggcaaacaaggctcatctccaccacctccacagtcatcccaagaataa >gi568815593f:10342168_10564944|GENSCAN_predicted_peptide_2|259_aa MPLPDTMFCAQQIHIPPELPDILKQFTKAAIRTQPADVLRWSAGYFSALSRGDPLPVKDR MEMPTATQKTDTGLTQGLLKVLHKQCHHKRYVELTDLEQKWKNLCLPKEKFKALLQLDPC ENKIKWINFLALGCSMLGGSLNTALKHLCEILTDDPEGGPARIPFKTFSYVYRYLARLDS DVSPLETESYLASLKENMHASECGLMLSVSSSVSGSSECGPRTRSISIPWESLGNAESWD PAQTQSESDFQAARVLIDV >gi568815593f:10342168_10564944|GENSCAN_predicted_CDS_2|780_bp atgccgcttcccgacaccatgttctgcgctcagcagatccacattcccccggagctgccg gacatcctgaagcaattcaccaaggctgccatccgcacccagccggccgacgtgctgcgg tggtccgcgggctatttttcagctctgtcgagaggagatccacttcctgtaaaggacaga atggaaatgcccacggcaacccagaaaacagacacaggcctgactcaaggactcctgaaa gttttgcacaagcagtgtcaccacaagcggtatgtggaattaacagatcttgagcagaag tggaagaacttgtgcctgccgaaggaaaaattcaaagcgctcttacaactggatccttgt gaaaacaaaatcaagtggataaactttttagcgcttggatgcagcatgcttggtgggtcc ttgaacactgcgctgaagcacctgtgcgagatcctcacggacgatccggagggcgggccc gctcgcatccccttcaagacgttttcctacgtttaccgctacttggccagattagactca gatgtgtctcccttggagacggaatcctaccttgcctctctaaaggaaaatatgcatgcc agcgagtgtggccttatgttgagtgtgtcctccagtgtgagtggctcctccgagtgtggt ccaaggacccgcagcatcagcatcccctgggagagccttggaaatgcagaatcttgggat ccagctcagacccagtcagaatcagatttccaggcagctcgtgtgctcattgatgtctga >gi568815593f:10342168_10564944|GENSCAN_predicted_peptide_3|134_aa MPGQASALFTPDGRQRWSQESSAHDCAPGGSTSVCTQKAHRRDVVACKALLPARLHGRLH RLLGNDSRSKDEGYSARGLAIAKPPHVAFPEDSTITDLTVIGPIRRQLWTPDGPGVEPHC WWLSSLAHFLPQHL >gi568815593f:10342168_10564944|GENSCAN_predicted_CDS_3|405_bp atgcccggccaggcttccgccttattcacccctgatgggaggcagaggtggagccaggag agctctgctcatgactgtgctccaggaggctccacctccgtgtgcacccagaaggcccac aggagagatgttgtagcctgcaaagcactactgcctgcacgactgcacggccggctccac aggctcctcggaaacgacagcaggagcaaagatgagggctacagtgccagggggctggcc attgccaagcctccccacgtggcctttccagaagacagtaccatcactgacctcactgtc attggccccattcgccggcagctctggacaccagatgggcctggcgtggagcctcattgc tggtggttaagcagtttagctcacttcttacctcagcatctttga >gi568815593f:10342168_10564944|GENSCAN_predicted_peptide_4|164_aa MPTRTLTQTHANMLTRTPTHADTHTNTCRHAHQHMPTRTLTHADTRANTCRHALATLNWV SRRCKASWNELEEACAIDQVSRICWYGLNVCLPANSCVEALTPSMMVAAYREHTKPRIKG CKGDAKPRLGGCREDTKHRLEGCREDAKPSLRGCREYAKPRLRG >gi568815593f:10342168_10564944|GENSCAN_predicted_CDS_4|495_bp atgccgacacgcacgctaacgcagacacatgctaacatgctgacacgcacaccaacacat gccgacacgcacaccaacacatgccgacacgcacaccaacacatgccgacacgcacgcta acacatgctgacacgcgtgctaacacatgccgacacgccttggcaaccctgaactgggta tcccgtcggtgcaaagcatcgtggaatgaactagaggaggcctgtgccatagatcaggtc tcccgtatctgctggtatggactgaatgtttgtctccccgccaattcttgtgttgaagcc ctaacccccagtatgatggtggctgcctacagagagcacaccaagccaaggatcaaaggc tgcaaaggggacgccaagcccaggctcgggggctgcagagaggacaccaagcataggctc gaaggctgcagagaggatgccaagcccagcctcagaggctgcagagaatacgccaagccc aggctcagaggctga >gi568815593f:10342168_10564944|GENSCAN_predicted_peptide_5|122_aa MVLLAGTGPEGGGARCMTPPPPSPPRGAQVEEDPADYEEFEDFSSLPDTRSIASDDSFYP FEDEEEHGVESAESVPEGVPESVPETATLLRAACANNVGLLRTLVRRGVSVEEAQETDRN GR >gi568815593f:10342168_10564944|GENSCAN_predicted_CDS_5|366_bp atggtgctgctggccgggaccgggccggagggcggcggggcgcgctgcatgaccccacca ccgccgtccccaccccggggcgcgcaggtcgaggaggaccccgctgactacgaagagttt gaggacttctcgagtctgccagacacccgcagcatcgcctcggacgactctttctaccct ttcgaggacgaggaggagcacggcgtcgagagcgcggagagcgtcccggagggcgtcccg gaaagcgtcccggagacggcgaccctcctgcgcgccgcctgcgccaacaacgtggggctg ctgcggacgctggtgcggcgcggggtgagcgtcgaggaggcgcaggagactgaccgcaac ggcagg