GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:59:38 Sequence gi568815591r:22718000_22922779 : 204780 bp : 41.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 666 720 55 2 1 69 93 40 0.416 4.10 1.02 Intr + 8307 8669 363 1 0 -4 27 232 0.076 2.13 1.03 Intr + 9445 9635 191 2 2 93 90 216 0.706 20.68 1.04 Intr + 10694 10807 114 1 0 85 66 105 0.896 7.72 1.05 Intr + 11515 11661 147 0 0 102 98 112 0.932 13.11 1.06 Term + 13407 13574 168 2 0 94 39 201 0.998 12.70 1.07 PlyA + 13897 13902 6 1.05 2.00 Prom + 15834 15873 40 -6.55 2.01 Sngl + 23178 23582 405 2 0 91 38 147 0.746 6.03 2.02 PlyA + 23908 23913 6 1.05 3.03 PlyA - 23953 23948 6 1.05 3.02 Term - 48822 48693 130 2 1 -23 37 186 0.074 -0.93 3.01 Init - 55994 55651 344 2 2 75 75 362 0.148 30.35 3.00 Prom - 71163 71124 40 -5.25 4.03 PlyA - 71566 71561 6 1.05 4.02 Term - 78064 77616 449 2 2 -17 47 260 0.481 5.79 4.01 Init - 78548 78449 100 2 1 100 113 49 0.404 9.08 4.00 Prom - 87650 87611 40 -4.55 5.00 Prom + 93257 93296 40 -2.65 5.01 Init + 97344 97420 77 2 2 67 53 21 0.175 -2.88 5.02 Intr + 101938 102060 123 1 0 63 64 113 0.311 5.18 5.03 Intr + 104101 104204 104 1 2 72 110 55 0.175 5.00 5.04 Intr + 104650 104681 32 2 2 80 99 11 0.168 -1.67 5.05 Intr + 104772 104852 81 2 0 53 69 123 0.159 5.82 5.06 Term + 115536 115691 156 2 0 120 55 71 0.806 3.95 5.07 PlyA + 116224 116229 6 1.05 6.05 PlyA - 116745 116740 6 1.05 6.04 Term - 126711 126541 171 0 0 27 54 118 0.245 -0.86 6.03 Intr - 128647 128546 102 1 0 94 92 57 0.506 6.15 6.02 Intr - 135854 135661 194 0 2 60 46 130 0.254 4.29 6.01 Init - 147357 147087 271 0 1 68 14 254 0.715 13.18 6.00 Prom - 147633 147594 40 -10.25 7.00 Prom + 149010 149049 40 -11.64 7.01 Init + 150207 150275 69 2 0 51 80 88 0.041 5.40 7.02 Intr + 157030 157213 184 0 1 91 7 225 0.013 13.14 7.03 Intr + 163769 164016 248 0 2 113 81 119 0.577 9.96 7.04 Term + 164089 164391 303 0 0 -25 48 386 0.607 17.49 7.05 PlyA + 164414 164419 6 1.05 8.04 PlyA - 165087 165082 6 1.05 8.03 Term - 170614 170531 84 1 0 87 42 137 0.993 5.67 8.02 Intr - 173110 172936 175 0 1 82 94 151 0.720 14.12 8.01 Init - 174060 174020 41 0 2 22 103 68 0.598 1.41 8.00 Prom - 178603 178564 40 -7.55 9.00 Prom + 185123 185162 40 -4.45 9.01 Init + 186939 186965 27 2 0 90 81 38 0.640 1.76 9.02 Intr + 198111 198231 121 2 1 74 92 40 0.671 2.15 9.03 Intr + 199319 199816 498 0 0 84 82 225 0.594 13.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 55994 55647 348 2 0 75 49 365 0.841 27.09 S.002 Term - 157193 157042 152 0 2 82 42 221 0.879 13.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_1|345_aa MIQHVAGENLSAYNSVGQVISGLQETWGPCRCPSETVVKRLSGNGESTGSTRQTSGTESK VLTGRIPKGSLGRGQGSSQPPLSGLKQVKKVAEATRWQKGVTHSTWRRLEVTARNLRMAR QFYNSRSQGEPEHRRTQMTGAFGPVAFSLGLLLVLPAAFPAPVPPGEDSKDVAAPHRQPL TSSERIDKQIRYILDGISALRKETCNKSNMCESSKEALAENNLNLPKMAEKDGCFQSGFN EETCLVKIITGLLEFEVYLEYLQNRFESSEEQARAVQMSTKVLIQFLQKKAKNLDAITTP DPTTNASLLTKLQAQNQWLQDMTTHLILRSFKEFLQSSLRALRQM >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_1|1038_bp atgatccagcacgtggctggggaaaatctgagcgcatataattctgtaggacaagtaatt tcagggctccaggagacctggggcccatgcaggtgccccagtgaaacagtggtgaagaga ctcagtggcaatggggagagcactggcagcacaaggcaaacctctggcacagagagcaaa gtcctcactgggaggattcccaaggggtcacttgggagagggcagggcagcagccaacct cctctaagtgggctgaagcaggtgaagaaagtggcagaagccacgcggtggcaaaaagga gtcacacactccacctggagacgccttgaagtaactgcacgaaatttgaggatggccagg cagttctacaacagccgctcacagggagagccagaacacagaagaactcagatgactggc gccttcggtccagttgccttctccctggggctgctcctggtgttgcctgctgccttccct gccccagtacccccaggagaagattccaaagatgtagccgccccacacagacagccactc acctcttcagaacgaattgacaaacaaattcggtacatcctcgacggcatctcagccctg agaaaggagacatgtaacaagagtaacatgtgtgaaagcagcaaagaggcactggcagaa aacaacctgaaccttccaaagatggctgaaaaagatggatgcttccaatctggattcaat gaggagacttgcctggtgaaaatcatcactggtcttttggagtttgaggtatacctagag tacctccagaacagatttgagagtagtgaggaacaagccagagctgtgcagatgagtaca aaagtcctgatccagttcctgcagaaaaaggcaaagaatctagatgcaataaccacccct gacccaaccacaaatgccagcctgctgacgaagctgcaggcacagaaccagtggctgcag gacatgacaactcatctcattctgcgcagctttaaggagttcctgcagtccagcctgagg gctcttcggcaaatgtag >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_2|134_aa MDKWDHVKLKSFRVAKDTINKVKRQPTEWEKISANYPSDKELVIRIYKELKQPCRKKKSN KSIKIWAKDLNRHFSKEGIQMANRYMKRCSTSLIIREIVSPQSKLLICKRQTITNAGENV EKRELLYAVGGNVN >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_2|405_bp atggacaaatgggatcacgtcaagttaaaaagcttccgcgtggcaaaggatacaatcaac aaagtgaagagacaacccacagaatgggagaaaatatctgccaactacccatctgacaag gaattagtaataagaatatacaaggagctcaaacaaccctgtaggaaaaaaaaatctaat aaatcgatcaaaatatgggcaaaagatttgaatagacatttctcaaaagaaggcatacaa atggcaaacaggtatatgaaaaggtgctcaacatcactgatcatcagagaaattgtctca ccccagtcaaaactgcttatatgcaaaagacagacaataacaaatgctggcgagaatgta gagaaaagggaactgttgtatgctgttggtgggaatgtaaattag >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_3|157_aa MTKKRRNNVRAKKGRGHVQPIRCTNCARCVPKDNAIKKFVIRNIVEAAAVRDISEASVFD AYVLPKLCVKLHYCVSCVIHSKVVRNRSREARKDRTPPPRFRPAGAAARPPPKPITLTTP NADKDVKQQELSFIVGGNAKFGGQLVISYKTKHTFTI >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_3|474_bp atgacaaagaaaagaaggaacaacgttcgtgccaaaaagggccgcggccacgtgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaggacaacgccattaagaaattcgtc attcgaaacatagtggaggctgcagcagtcagggacatttctgaagcgagcgtcttcgat gcctatgtgcttcccaagctgtgtgtgaagctacattactgtgtgagttgtgtaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccgcacgtcccccaccaaagcccataacactgacaacacca aatgctgacaaggatgtgaagcaacaagaactctccttcattgttggtggaaatgcaaaa tttggaggacagttggtgatttcttacaaaactaaacatacttttaccatatga >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_4|182_aa MAERGQHRAQAMVSEGGSLKRWQLSCDVVPVSAQDLRECLDAQAEVCCRDGFLIWTASAR AVQKGNVGWDPPHRVPTGAPPSRAVRRRPPSSRPQNDRSTGSLYHMPGKATDAQCQSMKA AGREAVSCKATEAELPETMGTYLLHQRDPDVRHGVKGDHFGPLRFDCPAGFRTCVGPVAP LF >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_4|549_bp atggctgaaaggggccaacatagagctcaggccatggtttcagagggtggaagcctcaag cgttggcagctttcatgtgatgttgtgcctgtgagtgcacaagatctacgggaatgcctg gatgcccaggcagaagtttgttgtagggatgggttcctcatatggacagcctctgctagg gcagtgcagaagggaaatgtggggtgggatcccccacacagagtccctactggggcacca cctagtcgggctgtgagaagaaggccaccatcctccagaccccagaatgatagatccact ggcagcttgtaccacatgcctggaaaagccacagatgctcaatgccagtccatgaaagca gctgggagggaggctgtatcctgcaaagccacagaggcagagctacccgagaccatggga acctacctcttgcatcagcgtgacccggatgtgagacatggagtcaaaggagatcatttt ggacctttaagatttgactgccctgctggatttcggacttgtgtggggcctgtagcccct ttgttttga >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_5|190_aa MQVQPKICTRCNALSIPVQRTTKTASTILPEALKVMIEPKLHTKLNPQLLVLVRVEVEEV KEQPGASQHSCATLAKKFLYGCFLTCKMGAVIEIIQTLHRTVTFPVLLPLTQLHHGDGRV AQGGPLTATTASGIRKGKEGVFEQTNNFLLPGYSLELLSTPGKHMYEIKFLQRRDSSKSI TGTPTKCKAL >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_5|573_bp atgcaggtccagcctaaaatttgtacaagatgcaatgctttgtcaattcctgtccaaagg accaccaagacggccagcactatactccctgaagcactgaaagttatgattgaacctaaa ttacacacaaagttaaatccgcagttgctggttttagtgagagttgaggtggaggaagta aaagaacaacctggagccagtcagcatagctgtgcgaccttggcaaaaaagtttctctac ggctgtttcctcacctgtaaaatgggggcagtaatagaaatcatccaaacccttcatcgt acagtcactttcccggttcttctcccactgacccagcttcaccatggcgacggccgtgtg gcgcagggaggaccccttacagcaaccacagcgtcgggaatccgaaagggaaaggagggt gtttttgagcagacaaacaatttcttattgccaggatacagcctggaactcttaagcaca ccaggcaagcatatgtatgaaataaaatttctacaaagaagagattcgtcaaaaagcatc actgggacaccaaccaagtgtaaggctctgtga >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_6|245_aa MVDGGPDQDIHVEATAGWLEQPGTKRLAGALQRQTLEDKAHYLQIILQSRAKCGDQDRGN QEAFTGPEVDLRENLEAQLAEKPIRQSEAGNSSNYVAEDWLPLKDVNISGGRGCKLIEEG LKHGEDSWCLECCMLRMMATARKYLAPKARLCHWTIICSRTGVPNPQAADWYQSEACQEP GSTAGGERWKRKRYQRPFSLPGTHEQRKDHVRHCEKAAAHTPGRESSPATIPAGTLILDL ELSEL >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_6|738_bp atggtggatgggggacctgatcaggacatccatgtggaagccacagcagggtggctggag cagcctggaacaaaacgcctggctggtgctctgcaaaggcagacactggaggacaaagca cattatctccagataattcttcagtccagagctaagtgcggggatcaggacagaggcaac caggaggcctttacagggccagaggtggatctcagggagaatctagaagctcagctggct gaaaaaccaatcaggcagtcagaagctgggaactcgtctaattacgtagcagaagactgg ctcccgttaaaggatgtgaacattagcggaggtcgggggtgcaaactcatcgaagagggg ttgaagcacggggaggactcctggtgcctggagtgctgcatgcttagaatgatggccacg gcccgcaaatatcttgcccccaaagcacgtctgtgtcattggactattatttgctctagg acaggggtccccaacccccaggctgcagactggtatcagtccgaggcctgtcaggaaccg ggcagcacagcaggaggcgaacgatggaagaggaagagataccagaggcccttctctctc cccggcacacatgaacagaggaaagaccatgtgaggcactgcgagaaggcagctgcccat actccaggaagagagtcctcaccagcaaccatccctgctggtaccttgatcttggacttg gagctttcagaactgtga >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_7|267_aa MNDTWLMAEFRQQLFGADQEQRQLIRGSVPRAVPLTNQLASVVTRQLTIQSLWMVTVSAV PARFAKATRVVTSAQQATYRRRGKGLTNKLLFQESLQKTATRLSAFSWEEAKVQLSWVVL NLATSAMLPKFYPNEIKVTYLRCTGDEVGATSVLAPKISPLGLSSEKNREAQIEVVTSAS ALIIKALKEPPRDRKKQKNIKHSGNITFDEIVNVAQHMWHRSLARELSGIIKEILGTPQS VGCNVDGCHPHDIIDDINGGAVECPAN >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_7|804_bp atgaacgacacatggttgatggcagagttcagacaacaattatttggagcagaccaggag caaagacagctgattagaggatcagtacccagggcggtaccactcaccaaccagctggcc tctgttgtcacccgtcagctcacgatccagtccctctggatggtgacggtcagcgccgtg ccggccaggtttgccaaggccactagggtggtcaccagtgcacagcaggccacctaccga aggaggggaaagggtcttacaaacaagctgttattccaagaatctcttcaaaaaacagca acaaggctttcagctttcagctgggaggaggccaaggtgcaactttcttgggtcgtcctg aatctagccacctctgccatgctgccgaagttctaccccaatgagatcaaagtcacatac ctaaggtgtactggggatgaagtgggtgccacgtctgtgttggcccccaagatcagcccc ctaggtctgtcttcagaaaagaacagagaggcccagattgaggtggtgacttctgcctct gccctgatcatcaaagccctcaaggaaccgccaagagacagaaagaaacagaaaaacatt aaacacagtggaaatatcacttttgatgagattgtcaatgttgctcaacacatgtggcac cgatctttagccagagaactctctggaatcattaaagagatcctggggactccccagtct gtgggctgcaatgttgatggctgccaccctcatgatatcatagatgacatcaacggtggt gctgtggaatgcccagctaattaa >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_8|99_aa MSVLDLKGVDGIKCSGPGKYSKRGVSFSNAWLCLPCGCPEMGWWNSKRSLASLQDSGAAE PSSYHDYSAFQQHVVMVGELNDTENSGNEEKQTKATGTG >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_8|300_bp atgtcagttttggaccttaaaggtgtggacggcataaagtgctcaggacctggtaaatac agcaaacggggagtgagtttctcaaatgcctggctgtgtctgccctgtggctgccccgag atggggtggtggaatagcaagaggagccttgccagcctgcaggactcaggtgctgctgag ccctccagttaccacgactacagcgctttccagcagcatgtagtaatggtgggagaacta aatgacactgaaaattctggaaatgaggagaaacagactaaagccacgggcaccggatga >gi568815591r:22718000_22922779|GENSCAN_predicted_peptide_9|216_aa MVLASTRLLHALKGLKPVIARLLQHGLLKPINSPYNSPILPSKNRTSLTGYTLLARLLES LISFPSWKSILKEITSQCSICYSTTPEGLFRPPLFPTHQARGFAPVQDWQIDFTHMPRVR KLKYLLVWVDTFTGWVEAFPTGPEKATVVISSLPSDIIPGFGLPTSIIQSDNGLAFISQI AQAVSQALGIQWLLVLPQTTTLKSLLKWIKDLQWQX >gi568815591r:22718000_22922779|GENSCAN_predicted_CDS_9|648_bp atggtgctggcatccacaagactcctgcatgctttaaaaggattaaagcctgttatcgct cgcctgttacagcatggccttttaaagcctataaactctccttacaattcccccatttta ccatccaaaaaccggacaagtcttacaggttacacgctgctagcccgcctcttagaatct ctcatttcctttccatcgtggaaatctattctcaaggaaataacttctcagtgttccatc tgctattctacgactcctgagggattgttcaggccccctctcttccctacacatcaagct cggggatttgcccctgtccaggactggcaaattgactttactcacatgccccgagtcagg aaactaaaatacctcttggtctgggtagacactttcactggatgggtagaggcctttccc acagggcctgaaaaggccactgtggtcatttcttcccttccgtcagacataattcctggg tttggccttcccacctctataatacagtctgataacggattggcctttattagtcaaatc gcccaagcagtttctcaggctcttggtattcagtggctcctggttttacctcaaaccacc acccttaagtctctcttaaagtggataaaagatcttcagtggcaagnn