GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:25:40 Sequence gi568815584f:34896710_35129149 : 232440 bp : 41.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8788 8844 57 0 0 111 64 34 0.191 2.66 1.02 Term + 23954 24298 345 1 0 43 48 309 0.497 15.91 1.03 PlyA + 25061 25066 6 1.05 2.00 Prom + 27069 27108 40 -4.85 2.01 Init + 42616 42797 182 0 2 91 -128 339 0.308 11.31 2.02 Intr + 42972 43202 231 0 0 70 51 219 0.575 12.37 2.03 Term + 43218 43626 409 0 1 -31 39 450 0.981 21.90 2.04 PlyA + 43760 43765 6 1.05 3.00 Prom + 64365 64404 40 -1.95 3.01 Init + 65537 65619 83 1 2 66 57 80 0.319 3.19 3.02 Intr + 85219 85293 75 1 0 74 98 94 0.272 6.71 3.03 Intr + 85701 85906 206 0 2 62 30 200 0.700 9.52 3.04 Intr + 86421 86506 86 1 2 73 115 60 0.761 5.82 3.05 Intr + 102849 102940 92 2 2 114 97 95 0.894 10.87 3.06 Intr + 114800 114950 151 2 1 50 92 166 0.825 12.44 3.07 Intr + 116637 116785 149 2 2 44 85 139 0.803 7.31 3.08 Intr + 117093 117193 101 0 2 108 76 46 0.963 4.23 3.09 Intr + 118035 118121 87 1 0 113 31 113 0.724 7.12 3.10 Intr + 122282 122365 84 0 0 26 116 53 0.250 0.77 3.11 Intr + 126201 126371 171 1 0 44 116 134 0.944 10.79 3.12 Intr + 137489 137634 146 0 2 90 22 88 0.001 1.48 3.13 Intr + 149299 149421 123 0 0 27 86 105 0.008 4.06 3.14 Intr + 149719 149919 201 0 0 37 96 206 0.008 14.86 3.15 Intr + 156569 156742 174 1 0 28 94 186 0.039 12.41 3.16 Intr + 165107 165197 91 1 1 46 39 106 0.021 0.15 3.17 Intr + 182218 182315 98 2 2 70 69 161 0.997 11.21 3.18 Term + 184313 184519 207 1 0 92 38 248 0.977 16.56 3.19 PlyA + 184996 185001 6 1.05 4.08 PlyA - 185203 185198 6 1.05 4.07 Term - 189069 188881 189 0 0 112 39 203 0.997 14.27 4.06 Intr - 194498 194361 138 2 0 104 110 44 0.992 7.94 4.05 Intr - 198475 198339 137 2 2 82 22 119 0.959 4.07 4.04 Intr - 202675 202543 133 1 1 109 98 16 0.524 4.00 4.03 Intr - 213920 213816 105 2 0 40 110 83 0.908 5.19 4.02 Intr - 220028 219901 128 0 2 72 110 46 0.609 4.68 4.01 Init - 220241 220205 37 2 1 57 34 53 0.182 -2.98 4.00 Prom - 223125 223086 40 -6.75 5.00 Prom + 224826 224865 40 -7.55 5.01 Init + 225481 225573 93 0 0 59 13 105 0.080 0.53 5.02 Intr + 225774 225926 153 2 0 66 84 102 0.095 6.95 5.03 Intr + 225988 226058 71 0 2 44 72 86 0.050 -0.34 5.04 Intr + 226102 226201 100 1 1 56 94 84 0.093 4.99 5.05 Intr + 226243 226400 158 0 2 100 56 65 0.068 2.39 5.06 Intr + 226566 227522 957 0 0 32 76 393 0.066 21.75 5.07 Intr + 230770 230902 133 1 1 32 109 154 0.976 11.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100078 78 1 0 85 87 58 0.948 6.51 S.002 Term - 143154 143060 95 1 2 85 49 80 0.864 0.81 S.003 Intr - 149908 149771 138 2 0 71 119 228 0.916 23.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:34896710_35129149|GENSCAN_predicted_peptide_1|133_aa MDGQARWLTPVIPALWEAEVFIEHMLYALNILRTVLGRARTLSLNHRCRLLLLSLLVLHC VRSVRSWYLFCEAAAEKTLAFAMAEEKPKALSMGQIRFRFDSQPINETDTPVQVEMEDID IIDVFHQQIGGVY >gi568815584f:34896710_35129149|GENSCAN_predicted_CDS_1|402_bp atggatggccaggcgcgatggctcacacctgtaatcccagcactttgggaggccgaggta tttattgagcatatgttgtatgccctcaacattttaagaactgtacttggacgtgcacgc accctgtccctcaaccaccgctgccgcctccttcttctgtcactcctggtgctgcactgt gttcgttcagtgagaagctggtacctcttttgtgaagcggcagctgagaagactctggca tttgccatggccgaggaaaagcccaaggcattgtcaatggggcagatcagattccgattt gacagtcagccaatcaatgaaacagacacacctgtacaggtggaaatggaggatatagat ataattgatgtgttccaccagcagataggaggtgtctactga >gi568815584f:34896710_35129149|GENSCAN_predicted_peptide_2|273_aa MAAEDKLLLPQLPELFETSKQLLDDVEVVTEPAGSRIVQKKVFKGLDLLEKAAEMLSQLD LWQSLSCRKPRTQLKNHTANFSMAYPSLIAMASQRQAKRERHKQKELEHRLSAIKSAVET GQADDERVREYYLLHLQSLEEIESIDQEIKILRERDSSREASTSNSSRQERPPVKPFILT RNMAQVKVFGAGYPSLATMTVSDWYKQHWKYGALPDQGIAKATPEEFRKAAQQQEGQEEK EEEDDEQTLYRVREWDDWKGIHPRDYGSQQSMG >gi568815584f:34896710_35129149|GENSCAN_predicted_CDS_2|822_bp atggctgctgaagacaaattactgctgccacagctccccgagctgtttgaaaccagcaaa cagcttctggatgatgtagaagtagtgactgaacctgccggttcccggatagtccagaag aaggtgttcaagggcctggacctccttgagaaggctgccgaaatgttatcgcagcttgac ttgtggcagagcttgagctgccgaaaaccaagaactcagctgaaaaatcacactgctaat ttctccatggcttatcctagcctcattgctatggcatctcaaagacaggctaaaagagag agacacaagcagaaggagttggagcacaggttgtctgcaatcaaatctgctgtggaaact ggtcaagcagatgatgagcgtgttcgtgaatattatcttcttcaccttcagagcttagaa gagattgagagcattgaccaggaaataaagatcctgagagaaagagactcttcaagagag gcatcaacttctaattcatctcgccaggagaggcctccagtgaaacccttcattctcact cggaacatggcccaagtcaaagtatttggagctggttatccaagtctggcaactatgacg gtgagtgactggtataagcaacattggaaatatggagcattaccagatcagggaatagcc aaggcaacaccagaggaattcagaaaagcagctcagcaacaggaaggtcaagaagaaaag gaggaagaggatgatgaacaaacactctacagagttcgggagtgggatgactggaagggc atccatcctagggactatggcagccaacagagcatgggctga >gi568815584f:34896710_35129149|GENSCAN_predicted_peptide_3|774_aa MRHCTWPKSFYRVADGHRSYKPGKYVDGPFSTEQIHLRDPRRNSHRQAVASEGSGHLHLR LVRIHCNSAWRSAGATVRSGRSITACLAVGNFTLLEQAAQALAGIAHAQAETAPPSAEQA SAFLGFFRRRGASLRNSRPLRATALAVSFAVLNAMLKEVCTALLEADVNIKLVKQLRENV NYTEMDPVIIASEGVEKFKNENFEIIIVDTSGRHKQEDSLFEEMLQVANAIQPDNIVYVM DASIGQACEAQAKAFKDKVDVASVIVTKLDGHAKGGGALSAVAATKSPIIFIGTGEHIDD FEPFKTQPFISKLLGMGDIEGLIDKVNELKLDDNEALIEKLKHDFMSKGNEQESMARLKK LMTIMDSMNDQELDSTDGAKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVKKM GGIKGLFKGNSPPPGCFHRLVLSVAFPGAQCKQLVDLPFWRLEDGGPFLTAPLSSAPPLK YNDTAVFILQVQKPRSCGVTAPDAAGDGLRRGRDARLQPGSASDWAGRPRMEVGLPAITL FLTSASSPVVATTMDQEPVGGVERGEAVAASGAAAAAAFGESAGQMSNERGFENVELGVI GKKKKVPRRVIHFVSGETMEEYSTDEDEVDGLEKKDVLPTVDPNLPLLPYELGKGDQGPG VLSGAMLKVEPPFLCDFLGEKIASVLGISTPKYQYAIDEYYRMKKEEEEEEEENRMSEEA EKQYQQNKLQTDSIVQTDQPETVISSSFVNVNFEMEGDSEVIMESKQNPVSVPP >gi568815584f:34896710_35129149|GENSCAN_predicted_CDS_3|2325_bp atgaggcattgcacctggccaaaaagtttttatcgggttgctgatggccacagaagctac aagccaggaaagtatgtggatggaccattctctacagagcaaattcatctacgagatcca agacgcaactcccaccgccaggcggtcgcctcagaaggctctggtcacttacacttacgg ttagttcgcatccactgtaacagcgcctggcggtcggcaggagccacagtgcgaagcggc cgcagcatcactgcctgcctcgcagtgggaaattttaccttgctggagcaagccgcgcag gcactggctggcattgcgcatgcgcaagcagagaccgccccaccctccgcggaacaagcc tccgctttcttgggtttcttccgacggcgtggggcctcgctaaggaattcccggcccctc agggccacggctttagcggtgtcttttgcggtattgaatgctatgctaaaagaagtctgt accgctttgttggaagcagatgttaatattaaactagtgaagcaactaagagaaaatgtt aactatacagaaatggatcctgtcatcattgcttctgaaggagtagagaaatttaaaaat gaaaattttgaaattattattgttgatacaagtggccgccacaaacaagaagactctttg tttgaagaaatgcttcaagttgctaatgctatacaacctgataacattgtttatgtgatg gatgcctccattgggcaggcttgtgaagcccaggctaaggcttttaaagataaagtagat gtagcctcagtaatagtgacaaaacttgatggccatgcaaaaggaggtggtgcactcagt gcagtcgctgccacaaaaagtccgattattttcattggtacaggggaacatatagatgac tttgaacctttcaaaacacagccttttattagcaaacttcttggtatgggcgacattgaa ggactgatagataaagtcaacgagttgaagttggatgacaatgaagcacttatagagaag ttgaaacatgattttatgagcaaaggaaatgaacaggagtcaatggcaaggctaaagaaa ttaatgacaataatggatagtatgaatgatcaagaactagacagtacggatggtgccaaa gtttttagtaaacaaccaggaagaatccaaagagtagcaagaggatcgggtgtatcaaca agagatgttcaagaacttttgacacaatataccaagtttgcacagatggtaaaaaagatg ggaggtatcaaaggacttttcaaagggaacagcccccctcccggctgctttcacaggctg gtgttgtctgtggcttttccaggtgcacagtgcaagcagttggtggatctaccattctgg cgtctggaggatggtggcccttttctcacagctccactaagtagtgccccacctttgaag tacaacgacacagctgtcttcattttacaagtgcaaaaaccgaggtcctgcggggtgact gccccagacgccgctggtgacgggctgcgccgaggtcgagacgccaggcttcagcctggc tcggccagcgactgggcggggagaccaaggatggaagtgggcttaccggccattaccctc tttctcaccagcgccagcagccctgtggtggcgacgacgatggaccaggagccagtgggc ggtgtggaacgaggagaagccgtcgcagcctcgggagctgcggccgccgcggcattcggg gaatctgcagggcagatgagtaacgaaagaggctttgaaaatgtagaactgggagtcata ggaaaaaagaagaaagtcccaaggagagtcatccactttgttagtggtgaaacaatggaa gaatatagcacagatgaagacgaagttgatggcctggagaagaaagatgttttgcctact gttgatccgaacttgcctctgctgccttatgaactgggtaaaggcgatcagggccctggt gttctcagtggtgccatgctcaaggtagagcctccattcttgtgtgacttccttggagag aagattgcatctgttttgggtatcagcaccccaaagtaccaatatgccattgatgaatat tatcggatgaagaaggaggaagaagaagaagaagaagaaaacaggatgtctgaagaagca gaaaaacaatatcaacagaataaattgcagactgattccattgttcagacagatcaacca gagacagtgatatccagctcatttgtgaatgtcaattttgaaatggagggagacagtgaa gtaattatggaaagcaagcaaaatccagtctctgtcccaccataa >gi568815584f:34896710_35129149|GENSCAN_predicted_peptide_4|288_aa MVEDVEEAASVEEKKSEQELKDEEMDLFTKYYSEWKGGRKNTNEFYKTIPRFYYRLPAED EVLLQKLREESRAVFLQRKSRELLDNEELQDLENYILELIPTLPQLDGLEKSFYSFYVCT AVRKFFFFLDPLRTGQYLNLDKDHNGMLSKEELSRYGTATMTNVFLDRVFQECLTYDGEM DYKTYLDFVLALENRKEPAALQYIFKLLDIENKGYLNVFSLNYFFRDEIFDMVKPKDPLK ISLQDLINSNQGDTVTTILIDLNGFWTYENREALVANDSENSADLDDT >gi568815584f:34896710_35129149|GENSCAN_predicted_CDS_4|867_bp atggttgaagatgtagaggaggctgcttcagtagaggaaaaaaaaagtgaacaagaatta aaagatgaagaaatggatttatttacaaaatattactccgaatggaaaggaggtagaaaa aacacaaatgaattctataagaccattccccggttttattataggctgcctgctgaagat gaagtcttactacagaaattaagagaggaatcaagagctgtctttctacaaagaaaaagc agagaactgttagataatgaagaattacaggatttagaaaactacatattggaacttatc cctacgttgccacaattagatggtctggaaaaatctttctactccttttatgtttgtaca gcagttaggaagttcttcttctttttagatcctttaagaacaggccagtacttgaatctt gataaagatcacaatggcatgctcagtaaagaagaactctcacgctatggaacagctacc atgaccaatgtcttcttagaccgtgttttccaggagtgtctcacttatgatggagaaatg gactataagacctacttggactttgtccttgcattagaaaacagaaaggaacctgcagct ctacaatatattttcaaactgcttgatattgagaacaaaggatacctgaatgtcttttca cttaattatttctttagggatgaaatctttgacatggtaaaaccaaaggatcctttgaaa atctctcttcaggatttaatcaacagtaatcaaggagacacagtaaccaccattctaatc gatttgaatggcttctggacttacgagaacagagaggctcttgttgcaaatgacagtgaa aactctgcagaccttgatgatacatga >gi568815584f:34896710_35129149|GENSCAN_predicted_peptide_5|555_aa MVLLQRRPKLTKPDFKGHDKPAFSFIVCLWRAERLSPGQSRPSVGAGAVHRGHCSFRRVY GRVKSECRFELLNEVNFICQRSLTRHLLPPRPPTGGVLRSVESYAGLELGVPGYGFGFTP PFQQASRGNLFLFRTAVIHVHELECKRHQRIPALSPGLRLATLDIPGLLFNRENSPAFLL LSSPKSRTLICLFPKLWKSPYLGLGPGHSYVSLFLADRCGIRNQQRLFSLKTMSPQNTKA TNLIAKARYLRKDEGSNKQVYSVPHFFLAGAAKERSQMNSQTEDHALAPVRNTIQLPTQP LNSEEWDKLKEDLKENTGKTSFESWIISQMAGCHSSIDVAKSLLAWVAAKNNGIVSYDLL VKYLYLCVFHMQTSEVIDVFEIMKARYKTLEPRGYSLLIRGLIHSDRWREALLLLEDIKK VITPSKKNYNDCIQGALLHQDVNTAWNLYQELLGHDIVPMLETLKAFFDFGKDIKDDNYS NKLLDILSYLRNNQLYPGESFAHSIKTWFESGQCSGCGKTIESIQLSPEEYECLKGKIMR DVIDGGDQYRKTTPQ >gi568815584f:34896710_35129149|GENSCAN_predicted_CDS_5|1665_bp atggttctccttcagaggcgacccaagctcaccaagccggattttaaaggccacgataag cccgcgttttcctttatcgtgtgcctttggcgcgcggaacgcctaagtccgggtcagtct cgtccgtcggtcggggctggcgcggtgcatcgtgggcactgtagtttccgccgcgtttat ggccgcgttaagtctgagtgccgctttgagttgttgaatgaagtgaacttcatttgtcag cgttcgctgactcgccacctcctccctcctcgtccccccaccggaggagttttgcggtct gtagagagctatgcagggctggagttgggggtccctggatacggttttggctttacaccc cctttccagcaagcttcccgtgggaatctgttccttttcaggacagctgtgatccacgtt catgaactggaatgtaagaggcaccagaggattcctgctctgtcccctggtttgcggctt gcgacgttggacatccccggattgttgtttaatagagaaaactcacctgccttcttgctt ttaagtagccccaaaagcagaaccttgatttgtctctttccgaagctttggaagagccca taccttgggctaggcccagggcactcttatgtctcgctgtttctggcagaccgctgtggc atcaggaaccagcagaggttgttttctcttaaaacaatgtctccacagaataccaaagca acgaatctgattgccaaggccagatatctcaggaaagatgagggcagtaataagcaagtt tattctgttcctcatttttttttagctggagcagctaaggagagatcacagatgaattct caaactgaagatcatgccttggcacctgtgaggaacactattcaactcccaacacaacct ttgaattcagaggagtgggataaacttaaggaagatttaaaagaaaacaccggaaagacc agtttcgaaagttggatcatttcacagatggctggctgtcatagctctatagatgtggct aaatctctgctggcatgggtagcagccaaaaataatggtattgtaagttacgatttactg gtcaagtatttgtatctctgtgtctttcatatgcagacatctgaagttattgatgtcttt gaaattatgaaagccagatataagactttagaacctagaggttacagtcttctcatccgg ggattgatccattcagacagatggagagaagcattgttgctgttagaggacatcaaaaaa gttataactccttcaaaaaagaactataatgactgtatccagggagctctccttcatcaa gatgtaaacacagcttggaatttatatcaggaattgctaggtcatgatattgttcctatg ttggaaactttaaaagctttctttgattttggaaaagacataaaggatgataactattca aataaactactagatattctttcatatctaagaaataatcagctgtatccaggggagtca tttgcacacagtataaaaacatggtttgagagtggccagtgttcgggctgtggaaaaacc atagagtctattcagctgagtccagaagaatatgaatgtcttaagggaaaaatcatgagg gatgtgatagatggaggtgaccagtacagaaagacaacacctcag