GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:54:03 Sequence gi568815590r:123403376_123641014 : 237639 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3744 3868 125 2 2 48 37 116 0.801 2.14 1.02 Term + 7816 8008 193 1 1 81 43 135 0.695 5.19 1.03 PlyA + 8213 8218 6 -0.45 2.00 Prom + 8274 8313 40 -5.36 2.01 Init + 13475 13557 83 1 2 86 81 135 0.997 13.06 2.02 Intr + 24549 24648 100 0 1 60 115 59 0.854 5.91 2.03 Intr + 33078 33226 149 2 2 71 78 28 0.218 -0.87 2.04 Intr + 33835 33959 125 1 2 102 83 21 0.299 3.23 2.05 Intr + 44163 44217 55 1 1 78 100 -7 0.153 -2.46 2.06 Intr + 45343 45548 206 1 2 55 101 96 0.157 6.44 2.07 Intr + 51231 51440 210 1 0 28 107 106 0.606 5.38 2.08 Intr + 81416 81507 92 0 2 69 90 73 0.892 5.31 2.09 Term + 81654 81740 87 2 0 135 46 36 0.617 1.76 2.10 PlyA + 82586 82591 6 1.05 3.10 PlyA - 84674 84669 6 1.05 3.09 Term - 100087 99998 90 1 0 95 41 45 0.673 -1.78 3.08 Intr - 101372 101229 144 2 0 70 95 72 0.936 6.58 3.07 Intr - 103199 103017 183 2 0 128 76 268 0.999 29.58 3.06 Intr - 110007 109823 185 1 2 57 98 129 0.813 10.31 3.05 Intr - 110958 110865 94 0 1 79 105 92 0.801 9.54 3.04 Intr - 128615 128523 93 2 0 86 114 98 0.998 12.36 3.03 Intr - 129865 129816 50 2 2 97 83 36 0.996 2.40 3.02 Intr - 131439 131327 113 2 2 82 58 120 0.995 8.42 3.01 Init - 137639 137524 116 2 2 74 94 214 0.796 20.28 3.00 Prom - 137867 137828 40 -4.86 4.00 Prom + 139813 139852 40 -4.16 4.01 Sngl + 159795 159998 204 2 0 74 38 199 0.538 8.69 4.02 PlyA + 161152 161157 6 1.05 5.00 Prom + 162052 162091 40 1.64 5.01 Sngl + 178859 179320 462 1 0 73 43 177 0.894 7.86 5.02 PlyA + 180436 180441 6 1.05 6.07 PlyA - 181595 181590 6 1.05 6.06 Term - 184998 184864 135 0 0 84 47 108 0.403 4.32 6.05 Intr - 216834 216778 57 0 0 98 94 60 0.329 6.68 6.04 Intr - 220838 220681 158 0 2 31 43 59 0.140 -4.57 6.03 Intr - 221252 221116 137 1 2 66 98 52 0.499 4.21 6.02 Intr - 230789 230662 128 2 2 126 50 10 0.264 0.48 6.01 Init - 230912 230850 63 2 0 52 102 24 0.281 1.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 60343 60260 84 1 0 136 47 43 0.840 2.65 S.002 Init - 153827 153808 20 2 2 84 119 7 0.823 2.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:123403376_123641014|GENSCAN_predicted_peptide_1|105_aa MPRFASNHCKPGEKHGTDSSSKSPERINLADALISDFQTLELIKAAKLQIILQMQPQMQS MTKIYRGPLDWPASPCSDVNDIEGTPPQEISTAQPLLRPNSARSS >gi568815590r:123403376_123641014|GENSCAN_predicted_CDS_1|318_bp atgccaaggtttgccagcaaccactgcaagccaggagagaagcatggaacagattcttcc tccaaatctccagaaagaatcaacctggctgacgctttgatttcagacttccagactctt gaactaatcaaagctgcaaaactacaaatcattcttcaaatgcagccccagatgcagtct atgactaagatctaccgcggacccctggactggcctgctagcccatgctccgatgttaat gacatcgaaggcacccctccccaggaaatttcaactgcacaacccctactacgccccaat tcagcaagaagcagttag >gi568815590r:123403376_123641014|GENSCAN_predicted_peptide_2|368_aa MEGNGPAAVHYQPASPPRDACVYSSCYCEENIWKLCEYIKNHDQYPLEECYAVFISNERK MDYHVVLLHVSSGGQNFIYDLDTVLPFPCLFDTYVEDAFKSDDDIHPQFRRKFRVIRADS YLKNFASDRSHMKDSSGNWREPPPPYPCIETGDEETDAERGFHIAHQVTSQMFTNPEHRR LSSPVVLHEVQEPCGKEAKWSVFSFRVQTFIGLSRGLEVLKDVFARGEKNMNRSLRENQC VAVGDALTSSLGAQTCELWQYCHWFRASTTSYGTEPQCHQVVSASGYCSSVSNGPFILPS RRSPTPGDHGHKCEVVSYSSNRKRMQGDSQPLRIKQQQGEARPSQKLQARNPQMQFSRST SQATKKTK >gi568815590r:123403376_123641014|GENSCAN_predicted_CDS_2|1107_bp atggaaggtaatggccccgctgctgtccactaccagccggccagccccccgcgggacgcc tgcgtctacagcagctgctactgtgaagaaaatatttggaagctctgtgaatacatcaaa aaccatgaccagtatcctttagaagaatgttatgctgtcttcatatctaatgagaggaag atggattaccatgttgttttgcttcatgtttcaagtggaggacagaacttcatttatgat ctcgatactgtcttgccatttccctgcctctttgacacttatgtagaagatgcctttaag tctgatgatgacattcacccacagtttaggaggaaatttagagtgatccgtgcagattca tatttgaagaactttgcttctgaccgatctcacatgaaagactccagtgggaattggaga gagcctccgccgccatatccctgcattgagactggagatgaggaaactgatgcagagaga ggttttcatatagctcaccaagtcactagccaaatgttcacaaaccctgaacacagacgt ctatccagtcctgttgttttgcatgaggttcaggagccttgtgggaaagaagccaaatgg tctgtgttctcatttagggttcagacgtttataggcttgtcccgtgggctagaggtcctc aaagatgtctttgctagaggagagaaaaacatgaacaggtccctgagagaaaatcaatgt gtagcagtcggagatgcactcacttctagccttggtgcccagacctgtgaactctggcag tactgtcactggttcagagcatctactacatcttatgggacagaaccccagtgtcaccag gtggtgtctgccagcggatactgtagctcagtctccaatgggccattcattctaccatca agaagatcacctacccctggggaccacggccacaaatgtgaggtagtttcttatagcagc aataggaagcggatgcagggggattcacagcccttacgcataaagcagcagcaaggggag gccaggcctagccagaagctacaggcaaggaacccacagatgcagttcagcaggtccacc tcccaggccacaaagaagacaaagtag >gi568815590r:123403376_123641014|GENSCAN_predicted_peptide_3|355_aa MPFLGQDWRSPGQNWVKTADGWKRFLDEKSGSFVSDLSSYCNKEVYNKENLFNSLNYDVA AKKRKKDMLNSKTKTQYFHQEKWIYVHKGSTKERHGYCTLGEAFNRLDFSTAILDSRRFN YVVRLLELIAKSQLTSLSGIAQKNFMNILEKVVLKVLEDQQNIRLIRELLQTLYTSLCTL VQRVGKSVLVGNINMWVYRMETILHWQQQLNNIQITRPAFKGLTFTDLPLCLQLNIMQRL SDGRDLVSLGQAAPDLHVLSEDRLLWKKLCQYHFSERQIRKRLILSDKGQLDWKKMYFKL VRCYPRKEQYGDTLQLCKHCHILSWKGTDHPCTANNPESCSVSLSPQDFINLFKF >gi568815590r:123403376_123641014|GENSCAN_predicted_CDS_3|1068_bp atgccattcctcgggcaggactggcggtcccccgggcagaactgggtgaagacggccgac ggctggaagcgcttcctggatgagaagagcggcagtttcgtgagcgacctcagcagttac tgcaacaaggaggtatacaataaggagaatcttttcaacagcctgaactatgatgttgca gccaagaagagaaagaaggacatgctgaatagcaaaaccaaaactcagtatttccaccaa gaaaaatggatctatgttcacaaaggaagtactaaagagcgccatggatattgcaccctg ggggaagctttcaacagactggacttctcaactgccattctggattccagaagatttaac tacgtggtccggctgttggagctgatagcaaagtcacagctcacatccctgagtggcatc gcccaaaagaacttcatgaatattttggaaaaagtggtactgaaagtccttgaagaccag caaaacattagactaataagggaactactccagaccctctacacatccttatgtacactg gtccaaagagtcggcaagtctgtgctggtcgggaacattaacatgtgggtgtatcggatg gagacgattctccactggcagcagcagctgaacaacattcagatcaccaggcctgccttc aaaggcctcaccttcactgacctgcctttgtgcctacaactgaacatcatgcagaggctg agcgacgggcgggacctggtcagcctgggccaggctgcccccgacctgcacgtgctcagc gaagaccggctgctgtggaagaaactctgccagtaccacttctccgagcggcagatccgc aaacgattaattctgtcagacaaagggcagctggattggaagaagatgtatttcaaactt gtccgatgttacccaaggaaagagcagtatggagatacccttcagctctgcaaacactgt cacatcctttcctggaagggcactgaccatccgtgcactgccaataacccagagagctgc tccgtttcactttcaccccaggactttatcaacttgttcaagttctga >gi568815590r:123403376_123641014|GENSCAN_predicted_peptide_4|67_aa MAAQTMPKIVPEKSLVRMLLASPLNRGHYRVYHCCHHGPWISDIVATAAGLTRMPLPLAL LRCIISF >gi568815590r:123403376_123641014|GENSCAN_predicted_CDS_4|204_bp atggcagcccagactatgcccaaaatcgtacctgagaagagtctagtgaggatgctgctg gcatccccgctgaacagaggacactaccgagtgtaccactgctgccatcatggtccctgg atatcagacattgttgctactgctgctggcctcaccagaatgcctcttccattggccctt cttcgttgcatcatcagcttctga >gi568815590r:123403376_123641014|GENSCAN_predicted_peptide_5|153_aa MKTIPSGIVSIARYLKSFSEHVDHWQKLDPFLTPYTKIGSRWIEDLNVKPQTIKTLEENL GNTIQYIGIGKDFMTKTAKTIATKAKIDKWDLIKLKSFCTAIETIIRVNRQPTEWEKTFV SYPSDKGLISRIYKELKPIYKKKTTPSKSGQRI >gi568815590r:123403376_123641014|GENSCAN_predicted_CDS_5|462_bp atgaagactataccttcagggatcgtttctatagctcgttacttgaaaagtttctctgaa catgtagatcactggcagaaactggaccccttccttacaccttatacaaaaattggctca agatggattgaagacttgaatgtaaaaccccaaaccataaaaactctagaagaaaaccta ggcaataccattcagtacataggcataggcaaagacttcatgactaaaacagcaaaaaca attgcaacaaaagcaaaaattgacaaatgggatctaatcaaactaaagagcttctgcaca gcaatagaaactatcatcagagtgaacaggcaacctacagaatgggagaaaacttttgta agctacccatctgacaaaggtctaatatccagaatctacaaggaacttaaaccaatttac aagaaaaaaacaaccccatcaaaaagtgggcaaaggatatga >gi568815590r:123403376_123641014|GENSCAN_predicted_peptide_6|225_aa MKMSKFLLPCASSFTTQLLHKGSLGKNVPRPHISVSITEGMGTCPCPQLLTRGDDETTPY STPLWTDWTWTDGSRGQQLTQPGYSRKSSKMATIDSFAPSMYMLLLISKGSESFSSSSAA PVQKPQLMLHGAEMRYPAKSRPDFWPINHEIKYNFFKPPSSGGLENVSGVFAEMGNWSRK EKSKPVIFGDAFHSWETGSHPQTKDRIAQFAQHNHQVPAVNAPDF >gi568815590r:123403376_123641014|GENSCAN_predicted_CDS_6|678_bp atgaagatgtccaagtttttgctaccttgtgccagctccttcaccactcaactgcttcac aagggatccttgggaaagaatgtaccaaggcctcatatctctgtgtccatcacagagggt atgggcacttgtccttgtccccagctcctcactagaggtgatgatgagaccacaccatac tcaactcctttatggaccgactggacatggactgatggatccaggggtcaacagctgacc caaccagggtatagcagaaagtcttcaaagatggccaccatcgattcctttgctccttct atgtatatgttacttctcatatcaaaaggaagtgaatccttcagctctagttcagcagcc ccagttcagaagccccagctgatgctgcatggagcagagatgagatacccagccaagtcc cgcccagacttctggcccatcaatcatgaaatcaaatacaatttttttaagccaccaagt tctgggggcctggaaaatgtttctggcgtctttgcagaaatgggaaactggagcaggaag gagaagtcaaagcccgtgatattcggggatgccttccactcgtgggaaacaggttcccat ccacaaaccaaagacagaatagcacaatttgcacaacacaaccaccaagtcccagcggtc aatgctccagacttttga