GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:23:19 Sequence gi568815583r:98339634_98580780 : 241147 bp : 44.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.27 PlyA - 836 831 6 1.05 1.26 Term - 20041 19973 69 1 0 106 36 68 0.272 1.34 1.25 Intr - 24668 24581 88 1 1 76 92 70 0.396 6.17 1.24 Intr - 30125 29988 138 1 0 62 59 80 0.325 2.08 1.23 Intr - 34042 33996 47 1 2 74 105 23 0.184 -0.19 1.22 Intr - 35832 35743 90 0 0 64 116 12 0.201 1.79 1.21 Intr - 55400 55296 105 2 0 28 91 63 0.068 1.01 1.20 Intr - 60298 60234 65 2 2 150 94 -12 0.061 4.04 1.19 Intr - 64521 64344 178 0 1 73 38 95 0.030 2.49 1.18 Intr - 83108 82846 263 0 2 74 110 50 0.084 3.01 1.17 Intr - 87899 87769 131 1 2 67 94 46 0.281 3.44 1.16 Intr - 89146 88857 290 1 2 43 46 143 0.539 1.64 1.15 Intr - 91244 91105 140 0 2 63 94 80 0.434 6.28 1.14 Intr - 94010 93925 86 1 2 90 65 38 0.189 1.16 1.13 Intr - 100117 100011 107 1 2 75 37 124 0.267 5.01 1.12 Intr - 101534 101438 97 1 1 5 86 54 0.108 -3.09 1.11 Intr - 102322 102156 167 1 2 61 105 58 0.166 3.56 1.10 Intr - 106495 106395 101 2 2 116 26 62 0.224 2.63 1.09 Intr - 109517 109310 208 2 1 16 35 172 0.306 3.35 1.08 Intr - 110749 110615 135 1 0 88 2 85 0.255 0.66 1.07 Intr - 111178 111098 81 1 0 127 61 5 0.329 1.53 1.06 Intr - 112377 112229 149 1 2 129 20 185 0.452 15.75 1.05 Intr - 118997 118959 39 0 0 121 93 -1 0.006 1.90 1.04 Intr - 135739 135719 21 2 0 78 107 18 0.168 0.22 1.03 Intr - 141144 140967 178 0 1 88 100 45 0.912 5.19 1.02 Intr - 146344 146259 86 2 2 84 113 43 0.968 5.94 1.01 Init - 147501 147399 103 0 1 74 74 63 0.715 3.90 1.00 Prom - 149191 149152 40 -4.16 2.00 Prom + 158708 158747 40 -5.26 2.01 Init + 161663 161746 84 1 0 86 66 38 0.217 2.30 2.02 Intr + 165562 165694 133 0 1 41 48 105 0.213 2.02 2.03 Term + 182323 182483 161 2 2 92 49 92 0.811 3.80 2.04 PlyA + 182643 182648 6 1.05 3.00 Prom + 187594 187633 40 -4.36 3.01 Init + 189007 189054 48 0 0 90 116 -11 0.608 2.85 3.02 Intr + 197930 198053 124 1 1 102 66 73 0.370 6.66 3.03 Intr + 210425 210550 126 0 0 65 44 108 0.017 4.75 3.04 Term + 221866 222080 215 2 2 73 41 118 0.556 2.99 3.05 PlyA + 223097 223102 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:98339634_98580780|GENSCAN_predicted_peptide_1|1053_aa MSRSPLLSSGTHVYTWNGSGDASEAESAGLGDAWVVAVYIKESWCSTEDALRTSDPAREG LMKVQSFGERVVLFILNAIIFGRLERNLDDDDMFFLPHSVKEQAKILWRRGAAVGFYTTK MKGGINPVAGKSGISSLFGSVAGRLCGDGTGACYLLPVFDTVFIRRKHWHRGLGTAMLRD FCETFPEDEALGVQEHSGAQAAGPLSTPMSFQALSSLFQGGKDSTGPRGGTHLAHAAPSN LAQGARTPEFLPVATVGLSGPHPMAEGFEEAAEEKFEASRGWLMKFKERRCLHNIKVQDE AAGADTEATASYLEDLAQIIDKGGYTTQQIFNVVCRQFLLVCLEERGRLWEVERRGAWGQ CTNIWLKGSEHHTHTYGFGGVGCGGAKEDKKVIPFQVGTDLVGEEGDSLNVGDYDLSCEQ NHRAAHPGNSEDVSRHARTSQNDRPRQPAPGDGSKERMCGEELEDTKDDPECGVEEEDAG LAGQPPGKLTRAQNITMAKQEGLELGILPSSNHSNAVTGGSLDTWCVVLLVSADASTPNP CPNQEVTLWPASSRSQPASSLSLATWASSSFRGKNTACRSQGSGHGPGGALEKTETRVAA VSPILAGGRKRRSQDRLAESADSRKKKRRGRLLAARPLSEGNTHGGHTMAALPKPKPAGR APSFFDVLFGFFSAAAAAAAKILWGQCASFIKWGQCLSGRIRWEDEECKPPECKHLALTD CLDAALGTNSLCRPGAGSDFNSWWIWAARAQRVGESSTFSSIVPPFPCPQSASLAFPEKS HTEALELCHCRWSRGCDEAPRAKCCGIGSKTYSPESAWVRGPWGPHGTDLKDLQGWKPGK QKSFIIAQKFKATGFKFWLLPPSRLELGGEKQGKESSLKIGAANIDKKIWKLARVNAGNC ANRKGLPGGQGWGDDAPYSHPPRPIDLYEFTISFLFAEDKPHSFTLAGPVNSSSSWPSCS SKDIPVLSALEPGTLGLHFMLHMDPEQHQRRLNLLVSWSKLLEFWEYVSLHAHLRGPLPL WSNSTTKASEAPIQHEDGEDKGPYDDPLPLNST >gi568815583r:98339634_98580780|GENSCAN_predicted_CDS_1|3162_bp atgtccaggtctcccctgctgagcagcgggacccatgtctacacttggaatggttctggg gatgcttctgaggcagagtcagcagggcttggagatgcctgggttgtggctgtttacata aaggaatcctggtgctccacggaagatgcactaagaacctctgatcctgcaagagaaggg ttgatgaaggttcagtcgtttggggaaagggttgtgcttttcattctaaacgccattatt tttggaagattggagagaaatttggatgatgatgacatgttttttctacctcactctgtg aaggaacaagctaaaatcctgtggagacgtggagcggctgttgggttttacacaaccaag atgaaaggtggaattaatcctgtggctggtaaatcaggtatttcttctttgtttgggtcc gttgctggcagactgtgtggtgatggcaccggtgcatgctacctgctgcctgtctttgac accgtgttcatcaggaggaaacactggcaccgaggcttgggaacagccatgctgcgggac ttctgtgagacattcccagaggacgaggccctgggggtgcaggagcattcaggggcccag gcagctggtcctctgtccacacccatgtccttccaggccctctcctctttgtttcagggg ggcaaggactccacgggcccaagaggaggtacccacctggctcatgctgctccttctaat ctggctcagggtgccaggaccccagaattcctgccagtggccactgtgggcctgagtgga cctcatcccatggctgagggatttgaggaagctgcagaagaaaagtttgaagctagcaga ggttggctcatgaagtttaaggaaagacgctgtctccataacataaaagtgcaggatgaa gcagcaggtgctgatacagaagctacagcaagttatctggaagatctagctcagatcatt gataaaggtggctacactacacaacagattttcaatgtagtctgcaggcagttcctgctg gtctgtctggaggagcgaggacgcctgtgggaggtggagcgccggggggcctggggccag tgcaccaatatctggctaaagggatctgaacaccacacacacacatatggatttgggggc gtggggtgtggaggtgcaaaggaggacaagaaagtcatccctttccaagtcggtactgat ttggtaggagaggagggagactcactgaacgtgggtgattatgacctctcttgtgagcaa aaccaccgggcagcacacccaggaaattccgaggacgtgagccgtcatgcgaggacttct cagaatgacagacccagacagcctgcccctggagacggcagcaaggagaggatgtgtgga gaagaactggaagacacaaaggatgatccagagtgtggagtggaagaggaggatgccggg ctggcagggcagccaccaggcaagcttacaagggcacaaaacatcaccatggcaaagcaa gagggattggagctgggtattctaccgagctcaaaccacagcaatgcagtcacgggaggc tccctggacacctggtgtgtggtcctgctggtgtctgcggatgcttctaccccaaacccc tgtcctaaccaggaagtgacgttgtggccggcaagttctcgctctcaacctgcatcgtct ctgtccctggccacatgggcaagttcttcctttagaggaaaaaacaccgcctgccgcagt caaggctcggggcacggcccaggaggtgccctggagaagactgagacccgggttgcagca gtgagccccattctggctggtgggagaaaacggcgatctcaggacagacttgcggagagc gcggactcccgtaaaaagaagaggcgcgggcgcctgctcgctgcccggccgctttccgag ggaaatacccacggcggccacacgatggcagccttgcccaaaccaaagcccgccgggcgc gctccgagtttttttgatgtgttatttggcttcttttcagcagcagcagcagcagcagcc aaaatactctggggccagtgtgcttcgtttataaaatggggacagtgtctctctggaagg attcgctgggaagatgaagaatgtaagcctccagagtgtaagcatcttgccctcactgac tgtctggatgctgccttggggaccaacagcctttgcaggcctggggctgggtccgacttc aactcctggtggatctgggctgccagagctcagcgggtgggagagagctccacgttcagc tccatcgtaccccctttcccatgcccccaaagcgcctctctggcttttcctgaaaagtcc cacactgaggctctggaactctgccattgtaggtggagcaggggatgcgatgaggcccca agggctaaatgttgtggcattggaagcaaaacctacagcccagagtcagcttgggtgaga ggcccatggggccctcacggcaccgacctcaaagacctacagggatggaagcccggcaaa cagaaatccttcataattgcccagaagttcaaagctactggtttcaagttctggcttctg cctccctcaaggctagagcttgggggagagaaacagggaaaagagagcagcctaaaaatt ggagctgcaaacatagataagaaaatctggaagcttgcacgggtgaatgctggcaactgt gccaatagaaaagggctacctgggggccagggttggggggatgatgcaccttattcccac cccccccgaccaattgatttgtatgagtttactattagttttctttttgcagaggataag ccacactccttcaccctagcaggtccagtcaattccagtagcagctggccaagctgttct tccaaggacatcccagtgttgtccgccctggagccaggcacactggggctgcatttcatg ctgcacatggacccagagcagcaccagagaaggctcaacttgctagtcagttggtcaaag cttttggaattctgggaatatgtcagcttgcatgcacatctccgaggccctcttcctctt tggagtaattcaacgaccaaggccagcgaggcacctattcaacatgaagatggcgaggat aaaggcccttatgatgatcctcttccacttaatagtacataa >gi568815583r:98339634_98580780|GENSCAN_predicted_peptide_2|125_aa MAALGSQGTIFPGSRVLLKLQPKEVELVVVPEAGSGKSPLCILRVHESAGPSHFKARKIA ITAPHAKTQTSLDKPGRKHLLGAPCGTAASPACFGSTIPPANHYSGIKDCNKWQKVSSLQ STKFA >gi568815583r:98339634_98580780|GENSCAN_predicted_CDS_2|378_bp atggctgctctgggcagccaaggtactattttcccaggaagcagggtactgctcaagctc cagcccaaggaggtagagctggtggtggtccctgaagcaggctccgggaaatctccactc tgcatccttcgtgttcatgagtctgcggggccatcacacttcaaagccagaaaaattgca atcaccgctccacacgccaaaacccagacttcactagataagcctggacgtaaacaccta cttggtgccccttgtggaacagcagcttcccctgcttgtttcggaagcaccattccccct gccaatcactattctggaatcaaagactgcaataagtggcaaaaagtgtcaagtcttcag tccaccaagtttgcctag >gi568815583r:98339634_98580780|GENSCAN_predicted_peptide_3|170_aa MVTAAHPSTQRRREQKMGRTTCERSTGRTLNVPAIQKTKSASDDYDVFTDADQQVSIAPR PIDHPTAEEYGHTAQDWQAAPPAASVQDPLGKASWAPESVRAPCVYIEPTRMTPESLPEF PLPGKVWYIHRDRGFGMDIFEGATILPTTSSKKPTDDSEAAAGAVEGRFT >gi568815583r:98339634_98580780|GENSCAN_predicted_CDS_3|513_bp atggtgacagcagcacatccctctacccagaggaggagagaacagaagatgggcaggact acctgtgagcgcagcactgggagaactttgaatgtcccagccattcagaaaactaaaagt gcaagtgatgattatgatgtcttcacagatgcagaccaacaggtctccatagcgcccagg cccatcgaccacccaacggctgaggagtacgggcacactgcacaggactggcaggcagct ccacctgcggcctccgtgcaggatccactgggtaaagccagctgggctcctgagtctgtt agggccccttgtgtttacattgagcccaccaggatgaccccagagagtctccctgaattt cctttgccaggtaaggtgtggtatatccatagggaccggggattcggaatggacatcttt gaaggagccacaattctgcctaccacatcttcaaagaagcctacagatgactctgaagct gcagccggcgcagttgagggacgtttcacctga