GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:35:56 Sequence gi568815592r:42598298_42822334 : 224037 bp : 43.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5291 5421 131 1 2 116 78 44 0.927 5.69 1.02 Intr + 7424 7562 139 2 1 43 95 97 0.961 6.37 1.03 Intr + 8292 8354 63 2 0 67 103 22 0.573 0.51 1.04 Intr + 18943 19041 99 0 0 56 105 74 0.983 6.21 1.05 Intr + 19112 19210 99 1 0 85 89 88 0.990 8.91 1.06 Intr + 34255 34418 164 0 2 74 108 85 0.968 7.87 1.07 Intr + 34508 34607 100 2 1 104 91 4 0.907 2.41 1.08 Intr + 44119 44184 66 0 0 74 95 37 0.156 2.10 1.09 Intr + 46176 46239 64 2 1 98 111 23 0.984 3.89 1.10 Intr + 47169 47293 125 1 2 70 95 40 0.901 3.20 1.11 Intr + 49821 49873 53 2 2 59 89 32 0.341 -1.89 1.12 Intr + 51987 52089 103 0 1 106 86 40 0.350 5.78 1.13 Intr + 60349 60527 179 0 2 -36 66 181 0.368 2.22 1.14 Intr + 64961 65122 162 2 0 53 108 40 0.510 1.59 1.15 Intr + 71795 71943 149 2 2 73 82 65 0.838 4.28 1.16 Intr + 78486 78576 91 1 1 48 113 55 0.547 3.05 1.17 Intr + 86497 86574 78 1 0 106 99 -4 0.603 1.17 1.18 Intr + 89919 90089 171 0 0 27 94 96 0.741 3.16 1.19 Intr + 91272 91373 102 0 0 84 110 13 0.822 2.39 1.20 Term + 92735 92876 142 2 1 116 32 64 0.809 1.00 1.21 PlyA + 92962 92967 6 1.05 2.04 PlyA - 92976 92971 6 1.05 2.03 Term - 100210 99998 213 1 0 100 55 357 0.960 30.83 2.02 Intr - 106314 106068 247 2 1 51 80 350 0.902 27.96 2.01 Init - 106936 106881 56 1 2 50 95 102 0.987 5.91 2.00 Prom - 121592 121553 40 -3.56 3.02 PlyA - 121742 121737 6 -0.45 3.01 Sngl - 124037 123453 585 2 0 107 48 877 0.945 81.79 3.00 Prom - 126825 126786 40 -6.96 4.00 Prom + 127465 127504 40 -8.96 4.01 Sngl + 128823 129227 405 2 0 25 53 835 0.682 67.98 4.02 PlyA + 129875 129880 6 1.05 5.02 PlyA - 133124 133119 6 1.05 5.01 Sngl - 147776 146736 1041 2 0 92 47 1339 0.591 127.34 5.00 Prom - 151886 151847 40 -4.76 6.03 PlyA - 152796 152791 6 1.05 6.02 Term - 159965 159765 201 2 0 76 54 131 0.975 5.89 6.01 Init - 160093 160085 9 1 0 75 81 14 0.658 -0.91 6.00 Prom - 169924 169885 40 -3.66 7.04 PlyA - 170786 170781 6 1.05 7.03 Term - 173675 173589 87 2 0 131 36 71 0.631 3.86 7.02 Intr - 183423 183359 65 1 2 74 92 11 0.352 -1.46 7.01 Init - 185060 184877 184 2 1 68 85 181 0.714 13.08 7.00 Prom - 189793 189754 40 -4.06 8.04 PlyA - 189931 189926 6 -0.45 8.03 Term - 190770 190609 162 0 0 106 49 75 0.611 3.34 8.02 Intr - 204197 204101 97 1 1 20 106 83 0.496 3.31 8.01 Init - 210245 210169 77 2 2 77 42 92 0.574 4.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_1|759_aa DPLVHLSEDVIARTYNIFAITFRYAVEILTWEKESELPADLEMVEKSDTYYCMLFNDEVH TYEQVIYTLQKAVNCTQKEAIGFATTVDRDGRRSVRYGDFQYCEQAKSVIVNYQQLQRDF MEDDHERAVSVTALSVQFFTAPTLNYERLQSDYVTDDHDREFSVADLSVQIFTVPSLARM LITEENLMSIIIKTFMDHLRHRDAQGRFQFERYTALQAFKFRRVQSLILDLKYVLISKPT EWSDELRQKFLEGFDAFLELLKCMQIYYYHNVKCRREMFDKDVVMLQDVVQQNNTLIEEM LYLIIMLVGERFSPGVGQVNATDEIKREIIHQLSIKPMAHSELVKSLPEDENKETGMESV IEAVAHFKKPGLTGRGMYELKPECAKEFNLYFYHFSRAEQSKSSRDKDKAERKRKAEIAR LRREKIMAQMSEMQRHFIDENKELFQQTLELDASTSAVLDHRYFDSVQAKEQRRQQRLRL HTSYDVENGEFLCPLCECLSNTVIPLLLPPRNIFNKIPYSESIKEMLTTFGTATYKVGLK VHPNEEDPRVPIMCWGSCAYTIQSIEENGMDQENPPCEEESAVLALYKTLHQYTGRYPRE SNKLINLPEDYSSLINQASNFSCPKSGGDKSRAPTLCLVCGSLLCSQSYCCQTELEGEDV GACTAHTYSCGSGVGIFLRVRECQVLFLAGKTKGCFYSPPYLDDYGETDQGLRRGNPLHL CKERFKKIQKLWHQHSVTEEIGHAQEANQTLVGIDWQHL >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_1|2280_bp gatcctcttgttcatttatcagaagatgtgatagcaagaacttataacatttttgctatt acgtttcggtatgcagtagaaatattaacctgggaaaaagaaagtgaattgccagcagat ttagagatggtagagaagagtgacacctactattgcatgctgtttaatgatgaggttcac acctatgaacaagttatttatactcttcagaaagctgttaactgtacacaaaaagaagct attggttttgcaactacagtagatcgagatgggcgtaggtctgttcgatatggagatttt cagtattgtgagcaagcaaaatcagtaattgtgaattaccagcagttgcagagagatttt atggaggatgatcacgagcgagcagtgtcggtgactgctctatctgtccagttcttcacc gcacctactctgaactatgagcgtttgcagagtgattatgtgacagatgaccacgacaga gagttttcagtcgcagacctctcggttcagatattcacggttccttcacttgctcgaatg ctcatcacagaagaaaacttaatgagcattatcattaagacttttatggatcatttgaga catcgagatgcccagggcagatttcagtttgaacgatacactgctttacaagccttcaaa tttaggagagtacagagccttattttagatctcaagtatgtgttaattagcaaaccaact gaatggtcagatgagctgaggcagaagttcctagaagggtttgatgcctttttggaatta ctaaaatgtatgcagatttattactaccataatgtgaaatgcagacgtgagatgtttgac aaggatgtagtaatgcttcaggatgttgttcagcagaacaatactctaatagaagaaatg ctatacctcattataatgcttgttggagagagatttagtcctggagttggacaggtaaat gctacagatgaaatcaagcgagagattatccatcagttgagtatcaagcctatggctcat agtgaattggtaaagtctttacctgaagatgagaacaaggagactggcatggagagtgta atcgaagcagttgcccatttcaagaaacctggattaacaggacgaggcatgtatgaactg aaaccagaatgtgccaaagagttcaacttgtatttctatcacttttcaagggcagaacag tccaagagttcaagggacaaagacaaagctgagaggaagagaaaagcagagattgccaga ctgcgcagagaaaagatcatggctcagatgtctgaaatgcagcggcattttattgatgaa aacaaagaactctttcagcagacattagaactggatgcctcaacctctgctgttcttgat cataggtattttgattccgttcaagctaaagaacagcgaaggcaacagagattacgctta catacgagctatgatgtagaaaacggagaattcctttgccccctttgtgaatgcttgagt aatactgttattcctctgctgcttcctccaagaaatatttttaacaagatcccttattct gagagcataaaagaaatgctaacgacatttggaactgctacctacaaggtgggactaaag gttcatcccaatgaagaggatcctcgtgttcccataatgtgttggggtagctgcgcgtac accatccaaagcatagaagagaatggcatggatcaagaaaatcccccttgtgaagaagaa tcagcagttcttgctttgtataaaacacttcaccagtatacgggaagatatccaagagaa tctaacaaattaataaaccttccagaggattacagcagcctcattaatcaagcatccaat ttctcgtgcccgaaatcaggtggtgataagagcagagccccaactctgtgccttgtgtgc ggatctctgctgtgctcccagagttactgctgccagactgaactggaaggggaggatgta ggagcctgcacagctcacacctactcctgtggctctggagtgggcatcttcctgagagta cgggaatgtcaggtgctatttttagctggcaaaaccaaaggctgtttttattctcctcct taccttgatgactatggggagaccgaccagggactcagacggggaaatcctttacattta tgcaaagagcgattcaagaagattcagaagctctggcaccaacacagtgtcacagaggaa attggacatgcacaggaagccaatcagacactggttggcattgactggcaacatttataa >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_2|171_aa MLGAPPISLENLLAAVSLSRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYS YDHQTEELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSN PEESESESQGWLLERSVPETWKAFLESVKKLGKGNQVEAEGADAGQAPEAG >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_2|516_bp atgctgggagccccgcccatcagcctggagaacctgcttgcagctgtgtcccttagtcga atcaagagcaacgtggatgggcggtacctggtggacggcgtccctttcagctgctgcaat cctagctcgccacggccctgcatccagtatcagatcaccaacaactcagcacactacagt tacgaccaccagacggaggagctcaacctgtgggtgcgtggctgcagggctgccctgctg agctactacagcagcctcatgaactccatgggtgtcgtcacgctcctcatttggctcttc gaggtgaccattacaattgggctgcgctacctacagacgtcgctggatggtgtgtccaac cccgaggaatctgagagcgagagccagggctggctgctggagaggagcgtgccggagacc tggaaggcctttctggagagtgtgaagaagctgggcaagggcaaccaggtggaagccgag ggcgcagacgcaggccaggccccagaggctggctga >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_3|194_aa MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIELRKRSDVMNNSESHFV PNSLIGMGVLSCVFNSLAGKICYDALDPAKYARWKPWLKPYLAICVLFNIILFLVALCCF LLRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQIEFKCCGNNGFRDWFEIQWI SNRYLDFSSKEVKE >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_3|585_bp atggcgctactgaaagtcaagtttgaccagaagaagcgggtcaagttggcccaagggctc tggctcatgaactggttctccgtgttggctggcatcatcatcttcagcctaggactgttc ctgaagattgaactccgaaagaggagcgatgtgatgaataattctgagagccattttgtg cccaactcattgatagggatgggggtgctatcctgtgtcttcaactcgctggctgggaag atctgctacgacgccctggacccagccaagtatgccagatggaagccctggctgaagccg tacctggctatctgtgttctcttcaacatcatcctcttccttgtggctctctgctgcttt ctgcttcggggctcgctggagaacaccctgggccaagggctcaagaacggcatgaagtac taccgggacacagacacccctggcaggtgtttcatgaagaagaccatcgacatgctgcag atcgagttcaaatgctgcggcaacaacggttttcgggactggtttgagattcagtggatc agcaatcgctacctggacttttcctccaaagaagtcaaagagtga >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_4|134_aa MLIVFAAACVLAFAPGPASPRSSPPPRPQSLPPPHPQTMSESKNGPEYASFFAVMAASAA MVFSAPRAAYGTVKTGAGIAAMSVMRPELIMKSIIPVVTAGIIAIYGLVVTVLIASSPND DISLYRSCLQLAPA >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_4|405_bp atgctgatcgtcttcgccgccgcctgcgtgctcgccttcgcgcccggcccggcctcgccc cggtcttcgcctccgcctcggccgcagagcttgccccctccccacccgcagacaatgtcc gagtccaagaacggccccgagtatgcttcgtttttcgccgtcatggcagcctcggccgcc atggtcttcagcgccccgcgcgctgcctatggcacggtcaagaccggtgccggcatcgcg gccatgtctgtcatgcggccggagctgatcatgaagtccatcatcccggtggtcacggct ggcatcatcgccatctatggcctggtggtgacagtcctcatcgccagctccccgaatgac gacatcagcctctacaggagctgcctccagctagcgccggcctga >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_5|346_aa MESVSCSAAAVRTGDMESQRDLSLVPERLQRREQERQLEVERRKQKRQNQEVEKENSHFF VATFVRERAAVEELLERAESVERLEEAASRLQGLQKLINDSVFFLAAYDLRQGQEALARL QAALAERRRGLQPKKRFAFKTRGKDAASSTKVDAAPGIPPAVESIQDSPLPKKAEGDLGP SWVCGFSNLESQVLEKRASELHQRDVLLTELSNCTVRLYGNPNTLRLTKAHSCKLLCGPV STSVFLEDCSDCVLAVACQQLRIHSTKDTRIFLQVTSRAIVEDCSGIQFAPYTWSYPEID KDFESSGLDRSKNNWNDVDDFNWLARDMASPNWSILPEEERNIQWD >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_5|1041_bp atggagtccgtcagttgctccgctgctgctgtcaggaccggagacatggagtcccagcgg gacctgagcctggtgcctgagcggcttcagagacgcgaacaagaacggcagctggaagtt gaaaggcggaaacaaaagcggcagaaccaggaggtagagaaggagaacagccactttttc gtcgccacctttgttcgggagcgagcggccgtggaagagcttctggagcgcgcggagtcg gtcgagcggctggaggaggcggcctctcggctccaggggctgcagaaactaatcaacgac tcagtttttttcctagccgcttacgacctgcggcagggacaagaggcgctggcgcggctg caggcggccttggccgagcggcgccgggggctgcagcccaagaagcgtttcgctttcaag acccggggaaaggatgctgcttcgtctaccaaagtagacgcggctcctggcatccccccg gcagttgaaagcatacaggactccccgctgcccaagaaggcggaaggagacctcggcccc agctgggtctgcggtttctccaacctggagtcccaagtcttggagaagagagccagcgag ttgcaccagcgcgacgttcttttgaccgaactgagcaactgcacggtcagactgtatgga aatcccaacaccctgcggctaaccaaggcccacagctgcaagctgctctgcggtccggtg tctacctctgttttcctggaggactgcagtgactgcgtgctggcagtggcctgccaacag ctccgcatacacagtacgaaagacacccgcatcttcctgcaggtgaccagcagggccatc gtggaggactgcagtgggatccagttcgccccttacacctggagctacccggagatcgac aaggacttcgagagctctggtttagataggagcaaaaataactggaacgatgttgacgat tttaactggctggcccgggatatggcctccccaaactggagtattcttcctgaagaggag cgaaatatccagtgggactaa >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_6|69_aa MRLGNNWLRYRELGDSGKPKGNETRWETVTIAQGEEDEEGAKPEEREPGQRDRERDQRAL TVFVNWVDQ >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_6|210_bp atgaggctgggcaataactggctacggtatagagaactgggggattcagggaagcccaag ggtaacgagaccagatgggaaactgtaacaatagcccagggagaagaagatgaggaagga gccaaacctgaagagagggagccaggccagagagacagagagagagatcagagagcatta acagtatttgtcaactgggtggaccagtga >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_7|111_aa MRRPSRTSLLAGAANSAGLAGDARLGRRSSARLGARPDAAADPRRCFEKAARRLGRGSPA AEMYWFIQETVLFGFKTITKIHETGTYHNQISISCDSSTAADKCRNANIGE >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_7|336_bp atgcgccgtcccagccgcacatctcttctggccggcgcggcgaacagcgcgggcctcgca ggagacgcccggctgggccgccggtcctccgcgcggctgggggctcggccggacgcggcc gcggatcctcggcggtgcttcgaaaaggcagcgaggcggctgggaaggggcagtcccgcg gcagaaatgtattggtttattcaagaaacagttttatttggctttaaaacaattactaag atccacgagacaggcacataccacaaccaaatcagcatatcttgtgacagttccacagct gctgacaaatgcagaaatgcgaatattggagaataa >gi568815592r:42598298_42822334|GENSCAN_predicted_peptide_8|111_aa MVNLAMETEIRENQDIHDYFSKQENSLGNKGETPSQNNNNNNNNNNNKKKHEKLKEPKLT FQGSSSKSPPCNYSQASGTASPLSNLQGKTRILGELITCLLFAALITAECC >gi568815592r:42598298_42822334|GENSCAN_predicted_CDS_8|336_bp atggtgaatctggcaatggaaacagaaatccgtgaaaatcaagacatccatgattatttt agcaagcaagagaatagcctgggcaacaagggcgaaactccatctcaaaacaacaacaac aacaacaacaacaacaacaacaaaaaaaaacacgaaaagctgaaagagccaaaactaaca tttcaaggttcatcatcaaaatcacccccttgcaactactctcaagcctctggcactgct tctcctctgtcaaacctgcagggcaaaacacgaattttgggtgaactgatcacctgcctg ctcttcgctgccctcataacagctgaatgttgctag