GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:16:57 Sequence gi568815597r:205670345_205874956 : 204612 bp : 44.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 240 332 93 2 0 109 72 61 0.980 6.76 1.02 Intr + 1784 1974 191 1 2 85 94 56 0.742 4.28 1.03 Intr + 2429 2545 117 2 0 18 80 86 0.375 0.38 1.04 Intr + 4837 4894 58 1 1 82 71 10 0.207 -2.41 1.05 Term + 5470 5688 219 0 0 98 48 68 0.243 0.84 1.06 PlyA + 6855 6860 6 -1.75 2.06 PlyA - 7434 7429 6 1.05 2.05 Term - 8450 8317 134 0 2 110 40 103 0.808 5.95 2.04 Intr - 17134 17042 93 2 0 80 81 67 0.459 5.14 2.03 Intr - 20014 19970 45 2 0 122 88 -2 0.317 1.58 2.02 Intr - 23001 22969 33 1 0 122 92 13 0.162 3.29 2.01 Init - 33617 33605 13 2 1 81 61 1 0.020 -2.86 2.00 Prom - 38235 38196 40 -2.36 3.07 PlyA - 39599 39594 6 1.05 3.06 Term - 48135 47936 200 1 2 91 42 163 0.997 9.56 3.05 Intr - 49332 49183 150 1 0 70 91 169 0.996 15.53 3.04 Intr - 50309 50157 153 0 0 50 36 209 0.978 11.94 3.03 Intr - 53637 53582 56 2 2 73 67 82 0.917 3.22 3.02 Intr - 57489 57356 134 0 2 -14 82 142 0.798 2.84 3.01 Init - 59233 59228 6 1 0 61 111 0 0.609 0.57 3.00 Prom - 61767 61728 40 -4.36 4.00 Prom + 64906 64945 40 -2.66 4.01 Init + 78689 78782 94 1 1 65 80 63 0.782 3.74 4.02 Intr + 79106 79206 101 0 2 87 81 9 0.630 -0.07 4.03 Term + 79620 80051 432 2 0 40 54 182 0.561 5.30 4.04 PlyA + 83712 83717 6 1.05 5.06 PlyA - 84179 84174 6 1.05 5.05 Term - 100109 99998 112 1 1 85 39 89 0.986 1.73 5.04 Intr - 100510 100389 122 1 2 81 107 41 0.591 4.59 5.03 Intr - 101309 101128 182 0 2 116 95 193 0.999 22.39 5.02 Intr - 102223 102152 72 2 0 19 103 103 0.934 4.28 5.01 Init - 104612 104489 124 2 1 86 78 198 0.970 19.18 5.00 Prom - 107961 107922 40 -1.46 6.12 PlyA - 107968 107963 6 1.05 6.11 Term - 121374 121189 186 0 0 92 33 161 0.991 8.49 6.10 Intr - 124674 124526 149 1 2 110 87 168 0.940 18.85 6.09 Intr - 125134 125000 135 2 0 98 57 168 0.983 15.24 6.08 Intr - 126659 126580 80 1 2 79 77 41 0.964 1.29 6.07 Intr - 127707 127560 148 1 1 95 96 149 0.999 15.69 6.06 Intr - 128471 128325 147 0 0 101 35 259 0.814 22.11 6.05 Intr - 128757 128613 145 0 1 106 86 243 0.971 25.76 6.04 Intr - 129486 129415 72 0 0 108 110 89 0.997 12.70 6.03 Intr - 130716 130609 108 0 0 61 73 111 0.990 7.38 6.02 Intr - 140021 139726 296 0 2 117 91 339 0.979 33.83 6.01 Init - 143070 142776 295 0 1 64 51 244 0.931 13.65 6.00 Prom - 154973 154934 40 -2.16 7.14 PlyA - 157701 157696 6 1.05 7.13 Term - 158399 158276 124 1 1 86 51 135 0.778 7.46 7.12 Intr - 160035 159936 100 1 1 88 113 135 0.998 15.17 7.11 Intr - 162464 162254 211 2 1 23 95 220 0.703 14.69 7.10 Intr - 169979 169908 72 2 0 79 87 111 0.782 9.70 7.09 Intr - 171545 171467 79 1 1 107 87 36 0.542 4.95 7.08 Intr - 173442 173323 120 2 0 52 119 99 0.978 9.01 7.07 Intr - 173873 173743 131 2 2 53 70 165 0.978 10.69 7.06 Intr - 174553 174467 87 1 0 53 65 114 0.952 5.77 7.05 Intr - 175213 174981 233 2 2 110 6 244 0.881 15.89 7.04 Intr - 177627 177541 87 1 0 76 82 63 0.771 4.44 7.03 Intr - 178054 177985 70 1 1 -9 57 37 0.149 -10.35 7.02 Intr - 179502 179348 155 1 2 117 58 113 0.396 10.99 7.01 Init - 179728 179560 169 1 1 83 61 227 0.954 17.20 7.00 Prom - 182303 182264 40 -3.66 8.04 PlyA - 182468 182463 6 1.05 8.03 Term - 186466 186360 107 2 2 70 50 64 0.074 -0.63 8.02 Intr - 199903 199784 120 2 0 101 67 11 0.535 0.77 8.01 Init - 202267 202228 40 1 1 68 111 83 0.979 8.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_1|225_aa GLEARDLEFALLSCLFFTVTGSLLLSSLSERWRKGFLIPKQERLWLGETKYTFFSVSIPP PHPRASPTTPAPLSAPSSKRNAKNATASPGAGPNSFYTCKMRANLDNAEGSLHLCNQVIS HAPPTPGSCFFTGRNGVGPLCNEGLMTYYQTRQPPGNQAVKYPLSLLNLNCSNQAAPRVG RCPRSSGSWICSPSPTAILPTSASAPGKQEAQALRRVGTQEKAYG >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_1|678_bp ggcctggaggccagggatttagagttcgccctgctgagctgccttttcttcaccgttact ggaagccttctgcttagcagcttgtcagagaggtggaggaagggctttctcattcccaaa caggagcggctgtggttaggggagaccaagtacactttcttctcggtttccattcccccc cctcacccacgtgcctcccccaccacaccagcgcctctttcagcccctagcagcaagagg aatgccaagaatgcaactgcctccccaggtgctggccccaacagtttctacacctgtaaa atgagagctaatctggataatgccgaaggttctttacatctgtgcaaccaggtgatctca catgcaccccctacccccggcagttgcttcttcacgggcaggaatggggtaggacccctc tgcaatgagggccttatgacctactatcaaacaaggcagccccctggaaaccaagccgtc aagtatcccctctccctgctaaatctcaactgctctaaccaggcagcccccagggttgga cgctgcccacggtcctcagggtcttggatttgcagtccttcccccactgccatcctcccc actagtgcttcagctcctggaaagcaggaagcacaggccctgagaagagtaggtacccaa gagaaagcctatgggtaa >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_2|105_aa MEMIGKKSPVLPDFSGLDGLCKMVLCLGARGATLKPEVTSCVVKGSFVSNSKALPSCDDS QGLSWSQDSVVLKVLRKCSLSQQNEFEDKNTAVDECTAVDRQAPI >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_2|318_bp atggaaatgatagggaagaaaagtcctgttttaccagacttctcaggcctggatggattg tgcaaaatggtactatgccttggagcaagaggggcaacactcaaacctgaggtcaccagc tgtgtagtgaagggctcatttgtcagcaactccaaggctctgccatcttgtgatgacagt caaggcctgtcgtggagccaggactctgttgttctgaaagtgctcagaaaatgcagcttg agtcagcagaatgagtttgaagataaaaatacagctgtggatgaatgtacagctgtggac agacaggccccaatttag >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_3|232_aa MQLIKFEKHCLDEDYGRDSGPPTKKIRSSPREAKNKRRSGKNSQEDSEDSEDKDVKTKKD DSHSAEDSEDEKEDHKNVRQQRQAASKAASKQREMLMEDVGSEEEQEEEDEAPFQEKDSG SDEDFLMEDDDDSDYGSSKKKNKKMVKKSKPERKEKKMPKPRLKATVTPSPVKGKGKVGR PTASKASKEKTPSPKEEDEEPESPPEKKTSTSPPPEKSGDEGSEDEAPSGED >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_3|699_bp atgcagttaattaagtttgaaaaacactgtttagatgaagattatggaagagattcgggc cctcccactaagaaaattcgatcatctccccgagaagctaaaaataagaggcgatctgga aagaattcacaggaagatagtgaggactcagaagacaaagatgtgaagaccaagaaggat gattctcactcagcagaggatagtgaagatgaaaaagaagatcataaaaatgtgcgccaa caacggcaggcggcatctaaagcagcttctaaacagagagagatgctcatggaagatgtg ggcagtgaggaagaacaagaagaggaggatgaggcaccattccaggagaaagattccggc agcgatgaagatttcctaatggaagatgatgacgatagtgactatggcagttcgaaaaag aaaaacaaaaagatggttaagaagtccaaacctgaaagaaaagaaaagaaaatgcccaaa cccagactaaaggctacagtgacgccaagtccagtgaaaggcaaagggaaagtgggtcgc cccacagcttcaaaggcatcaaaggaaaagactccttctcccaaagaagaagatgaggaa ccggaaagcccgccagaaaagaaaacatctacaagccccccacccgagaaatctggggat gaagggtctgaagatgaagccccttctggggaggattaa >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_4|208_aa MKSDGYPNLALLLLKSLGDKRVTKISRDAYAGTPPSATERAQGPHPRASRWGKGQYGCLR PSSLKAATCSLSKQDRVEKPKTRTPPTPRARRPTPPELQPMGPLLPNPELLASQTPLLFG SGLLEQTSPPLPRLFKMDESNQPKMRQSRRATKPTRVCALLPNTPAHWRLLPDAPRSLVR LNHHVFQAPPCSLIGSTFTVGVLLVKML >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_4|627_bp atgaagtctgatggttacccgaacctagcccttttattactaaagtcactcggggacaag agggttacaaaaatatcccgagacgcgtacgcgggaacacccccgtccgctacagagagg gctcagggcccccacccccgcgccagtcgctgggggaaggggcaatacgggtgcctccgc ccctcctcgctgaaggccgcgacatgttcgctgtcgaaacaggaccgagtcgagaagcca aagaccaggaccccccccaccccgcgcgctcggcgccccaccccccccgaacttcagccg atgggaccgctgctgccgaaccccgagctgctggcttctcaaactccgctgctctttggt tcagggctcctggaacagacgagccccccgctcccccgtctcttcaaaatggatgaatca aaccagccgaaaatgcgccaaagccgccgtgcaaccaaacccactagggtttgtgcgctc ctccccaacacgcctgctcattggagacttctgccagatgcgcccagatcattagtccgt ttgaatcatcacgtcttccaggccccgccctgctctctgataggctccaccttcaccgtg ggggtcctgttagtcaagatgctctga >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_5|203_aa MGSRDHLFKVLVVGDAAVGKTSLVQRYSQDSFSKHYKSTVGVDFALKVLQWSDYEIVRLQ LWDIAGQERFTSMTRLYYRDASACVIMFDVTNATTFSNSQRWKQDLDSKLTLPNGEPVPC LLLANKCDLSPWAVSRDQIDRFSKENGFTGWTETSVKENKNINEAMRVLIEKMMRNSTED IMSLSTQGDYINLQTKSSSWSCC >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_5|612_bp atgggcagccgcgaccacctgttcaaagtgctggtggtgggggacgccgcagtgggcaag acgtcgctggtgcagcgatattcccaggacagcttcagcaaacactacaagtccacggtg ggagtggattttgctctgaaggttctccagtggtctgactacgagatagtgcggcttcag ctgtgggatattgcagggcaggagcgcttcacctctatgacacgattgtattatcgggat gcctctgcctgtgttattatgtttgacgttaccaatgccactaccttcagcaacagccag aggtggaaacaggacctagacagcaagctcacactacccaatggagagccggtgccctgc ctgctcttggccaacaagtgtgatctgtccccttgggcagtgagccgggaccagattgac cggttcagtaaagagaacggtttcacaggttggacagaaacatcagtcaaggagaacaaa aatattaatgaggctatgagagtcctcattgaaaagatgatgagaaattccacagaagat atcatgtctttgtccacccaaggggactacatcaatctacaaaccaagtcctccagctgg tcctgctgctag >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_6|586_aa MGAGTRALQAHGAGSRFPSLLLLRFPGAPPPGHEAKEGGRAEVDPAPSPVPDWLLLLSIR ESIFDGAGGVPPSLPQITFATGSQPVGPESAGKKRLGAHGPGREPLAGTSEFLGPDGAGV EVVIESRANAKGVREEDALLENGSQSNESDDVSTDRGPAPPSPLKETSFSIGLQVLFPFL LAGFGTVAAGMVLDIVQHWEVFQKVTEVFILVPALLGLKGNLEMTLASRLSTAANIGHMD TPKELWRMITGNMALIQVQATVVGFLASIAAVVFGWIPDGHFSIPHAFLLCASSVATAFI ASLVLGMIMIGVIIGSRKIGINPDNVATPIAASLGDLITLALLSGISWGLYLELNHWRYI YPLVCAFFVALLPVWVVLARRSPATREVLYSGWEPVIIAMAISSVGGLILDKTVSDPNFA GMAVFTPVINGVGGNLVAVQASRISTFLHMNGMPGENSEQAPRRCPSPCTTFFSPDVNSR SARVLFLLVVPGHLVFLYTISCMQGGHTTLTLIFIIFYMTAALLQVLILLYIADWMVHWM WGRGLDPDNFSIPYLTALGDLLGTGLLALSFHVLWLIGDRDTDVGD >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_6|1761_bp atgggcgctgggacccgcgcgctccaggcgcatggagccggctcccggttcccgtcactc ctcctactgcgtttccccggcgccccgcctcctgggcacgaagcgaaggaagggggccgg gccgaggttgatcccgccccctccccagtccctgattggctgctgctgttgtccatccga gaatctatttttgatggagcggggggggtgccacccagtctgccccagatcacgtttgcc accggcagccaaccagttgggcccgagtccgcgggcaagaagcgattgggggcgcatggc ccagggagagagcccttggctgggacctcagagttcctggggcctgatggggctggggta gaggtggtgattgagtctcgggccaacgccaagggggttcgggaggaggacgccctgctg gagaacgggagccagagcaacgaaagtgacgacgtcagcacagaccgtggccctgcgcca ccttccccgctcaaggagacctccttttccatcgggctgcaagtactgtttccattcctc ctggcaggctttgggaccgtggctgctggcatggtgttggacatcgtgcagcactgggaa gtcttccagaaggtgacagaggtcttcatcctagtgcctgcgctgctggggctcaaaggg aacctggaaatgaccctggcatcaaggctttccactgcagccaacattggacacatggac acacccaaggagctctggcggatgatcactgggaacatggccctcatccaggtgcaggcc acggtggtgggcttcctggcgtccatcgcagccgtcgtctttggctggatccctgatggc cacttcagtattccgcacgccttcctgctctgtgctagcagcgtggccacagccttcatt gcctccctggtactgggtatgatcatgattggagtcatcattggctctcgcaagattggg atcaacccagacaacgtggccacacccattgctgccagcctgggcgacctcatcaccttg gcgctgctctcaggcatcagctggggactctacctggaactgaatcactggcgatacatc tacccactggtgtgtgctttctttgtggccctgctgcctgtctgggtggtgctggcccga cgaagtccagccacaagggaggtgttgtactcgggctgggagcctgttatcattgccatg gccatcagcagtgtgggaggcctcatcttggacaagactgtctcagaccccaactttgct gggatggctgtcttcacgcctgtgattaatggtgttgggggcaatctggtggcagtgcag gccagccgcatctccaccttcctgcacatgaatggaatgcccggagagaactctgagcaa gctcctcgccgctgtcccagtccttgtaccaccttcttcagccctgatgtgaattctcgc tcagcccgggtcctcttcctcctcgtggtcccaggacacctggtgttcctctacaccatc agctgtatgcagggcgggcacaccaccctcacactcatcttcatcatcttctatatgaca gctgcactgctccaggtgctgattctcctgtacatcgcagactggatggtgcactggatg tggggccggggcctggacccggacaacttctccatcccatacttgactgctctgggggac ctgcttggcactgggctcctagcactcagcttccatgttctctggctcataggggaccga gacacggatgtcggggactag >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_7|545_aa MAQRCVCVLALVAMLLLVFPTVSRSMGPRSGEHQRASRIPSQFSKEERVAMKEALKGQPV TPRFRRKASATAAGAPTRTPARPQHPSQHIVPLSGDATLPPYLAPGTLGPRTVGALRMMK LGFIKAIGDFRGAIQIPTVTFSSEKSNTTALAEFGKYIHKVFPTVVSTSFIQHEVVEEYS HLFTIQGSDPSLQPYLLMAHFDVVPAPEEGWEVPPFSGLERDGIIYGRGTLDDKNSVMAL LQALELLLIRKYIPRRSFFISLGHDEESSGTGAQRISALLQSRGVQLAFIVDEGGFILDD FIPNFKKPIALIAVSEKGSMNLMLQVNMTSGHSSAPPKETSIGILAAAVSRFMERNPLTN AIIRTTTALTIFKAGVKFNVIPPVAQATVNFRIHPGQTVQEKSIRNRTPFVSTLEVLELT KNIVADNRVQFHVLSAFDPLPVSPSDDKALGYQLLRQTVQSVFPEVNITAPVTSIGNTDS RFFTNLTTGIYRFYPIYIQPEDFKRIHGVNEKISVQAYETQVKFIFELIQNADTDQEPVS HLHKL >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_7|1638_bp atggctcagcggtgcgtttgcgtgctggccctggtggctatgctgctcctagttttccct accgtctccagatcgatgggcccgaggagcggggagcatcaaagggcgtcgcgaatccct tctcagttcagcaaagaggaacgcgtcgcgatgaaagaggcgctgaaaggtcaacccgtg actccccgcttccgccgcaaagcatctgctactgcagctggagccccgacacgcacccca gcccggccccaacacccttcccagcacatcgtgcccctgtctggggacgccacacttccc ccatacctggctcctggaactctgggacccaggacagtgggtgccttgagaatgatgaag cttggattcatcaaagcaattggtgatttcagaggtgccatccagattccaacagtgact tttagctctgagaagtccaatactacagccctggctgagttcggaaaatacattcataaa gtctttcctacagtggtcagcaccagctttatccagcatgaagtcgtggaagagtatagc cacctgttcactatccaaggctcggaccccagcttgcagccctacctgctgatggctcac tttgatgtggtgcctgcccctgaagaaggctgggaggtgcccccattctctgggttggag cgtgatggcatcatctatggtcggggcacactggacgacaagaactctgtgatggcatta ctgcaggccttggagctcctgctgatcaggaagtacatcccccgaagatctttcttcatt tctctgggccatgatgaggagtcatcagggacaggggctcagaggatctcagccctgcta cagtcaaggggcgtccagctagccttcattgtggacgaggggggcttcatcttggatgat ttcattcctaacttcaagaagcccatcgccttgattgcagtctcagagaagggttccatg aacctcatgctgcaagtaaacatgacttcaggccactcttcagctcctccaaaggagaca agcattggcatccttgcagctgctgtcagccggtttatggagagaaatcccttaaccaat gcaataatcaggaccaccacggcactcaccatattcaaagcaggggtcaagttcaatgtc atccccccagtggcccaggccacagtcaacttccggattcaccctggacagacagtccaa gagaaatcaataagaaatcgaaccccctttgtttctaccctggaggtcctagaactcacg aagaacattgtggctgataacagagtccagttccatgtgttgagtgcctttgaccccctc cccgtcagcccttctgatgacaaggccttgggctaccagctgctccgccagaccgtacag tccgtcttcccggaagtcaatattactgccccagttacttctattggcaacacagacagc cgattctttacaaacctcaccactggcatctacaggttctaccccatctacatacagcct gaagacttcaaacgcatccatggagtcaacgagaaaatctcagtccaagcctatgagacc caagtgaaattcatctttgagttgattcagaatgctgacacagaccaggagccagtttct cacctgcacaaactgtga >gi568815597r:205670345_205874956|GENSCAN_predicted_peptide_8|88_aa MGAVREALRQYSPGGKCSPHGKNAPKMYSGEFGPVRVYVPFSLSDLKQIKIDLDGFYYFE LCPFYADFAGGFHHKVMLDFGKCFFCIH >gi568815597r:205670345_205874956|GENSCAN_predicted_CDS_8|267_bp atgggggctgtccgcgaagccttgcggcagtacagcccaggtgggaaatgttccccacac ggcaaaaatgcacctaagatgtattctggagaatttggcccagtcagagtgtatgtacct ttttccctctcagacttgaagcaaattaaaatagacctagatggcttttattactttgag ttatgtcccttctatgctgactttgctggtggctttcatcataaagtgatgctggatttt ggcaaatgctttttctgcatccattga