GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:00:27 Sequence gi568815591f:104567749_104769570 : 201822 bp : 39.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 2968 2853 116 2 2 56 84 101 0.028 5.75 1.01 Init - 19679 19565 115 2 1 83 35 97 0.151 4.32 1.00 Prom - 19856 19817 40 -3.65 2.02 PlyA - 21024 21019 6 1.05 2.01 Sngl - 22980 22555 426 0 0 49 43 209 0.878 8.54 2.00 Prom - 23073 23034 40 -7.55 3.02 PlyA - 23609 23604 6 1.05 3.01 Sngl - 24496 24188 309 1 0 98 44 356 0.989 27.85 3.00 Prom - 25999 25960 40 -6.15 4.05 PlyA - 26173 26168 6 1.05 4.04 Term - 42843 42702 142 2 1 75 38 131 0.900 3.22 4.03 Intr - 47014 46721 294 0 0 9 50 237 0.782 7.20 4.02 Intr - 48087 47937 151 1 1 125 23 117 0.660 7.20 4.01 Init - 61910 61736 175 2 1 75 48 123 0.130 6.56 4.00 Prom - 69763 69724 40 -1.95 5.00 Prom + 77266 77305 40 -5.75 5.01 Init + 100001 100967 967 1 1 125 15 1397 0.021 129.64 5.02 Intr + 121418 122197 780 0 0 92 91 295 0.165 20.72 5.03 Term + 122302 122717 416 2 2 -3 44 259 0.617 6.84 5.04 PlyA + 122831 122836 6 1.05 6.00 Prom + 131591 131630 40 -4.85 6.01 Init + 131975 132082 108 1 0 88 32 88 0.324 3.47 6.02 Intr + 138512 138647 136 1 1 -3 86 129 0.604 2.82 6.03 Intr + 140016 140099 84 1 0 55 83 59 0.426 0.97 6.04 Intr + 168927 169163 237 1 0 78 116 197 0.450 18.16 6.05 Term + 169448 169473 26 0 2 108 44 26 0.915 -2.39 6.06 PlyA + 169552 169557 6 1.05 7.05 PlyA - 169598 169593 6 1.05 7.04 Term - 171207 171133 75 0 0 94 36 93 0.370 1.56 7.03 Intr - 174909 174762 148 2 1 80 46 107 0.458 5.02 7.02 Intr - 177566 177283 284 2 2 91 102 72 0.474 4.19 7.01 Init - 181832 181095 738 2 0 82 70 209 0.257 13.29 7.00 Prom - 182765 182726 40 -5.65 8.07 PlyA - 183246 183241 6 1.05 8.06 Term - 184237 183342 896 2 2 -29 42 509 0.293 26.17 8.05 Intr - 184780 184256 525 2 0 34 30 444 0.481 25.19 8.04 Intr - 185043 184967 77 2 2 31 115 137 0.739 8.94 8.03 Intr - 186162 186108 55 1 1 63 51 45 0.420 -4.58 8.02 Intr - 192074 191955 120 0 0 103 94 110 0.913 12.65 8.01 Init - 194034 193896 139 0 1 72 55 71 0.725 2.55 8.00 Prom - 197073 197034 40 -6.15 9.02 PlyA - 197125 197120 6 1.05 9.01 Term - 198431 198258 174 2 0 93 42 121 0.671 4.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 25906 25319 588 1 0 49 48 257 0.977 13.73 S.002 Sngl + 100001 100996 996 1 0 125 41 1380 0.966 133.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_1|77_aa MDEAGNHHSQQTIAGTENQTLHVLTHTSELNNENTWTQGTPEPKIKVKKEKEQSLGGWTL MIAETAARHLFQDKQRQ >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_1|231_bp atggatgaagctggaaaccatcactctcagcaaactatcgcagggacagaaaaccaaaca ctgcatgttctcactcatacgtcggaattgaacaatgagaacacttggacacagggtacc cccgaacctaaaataaaagttaaaaaagaaaaagaacagtctcttggtggatggacactg atgatagcagagacagctgctaggcatttgttccaagacaaacagcgacag >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_2|141_aa MGDFNTPLSTLDRSMRQKVNRDIQELNSALHQADLIDNYRTLHPRSTEYTFFSAPHHTYS KTDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTSKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTT >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_2|426_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac agggatatccaggaactgaactcagctctgcaccaagcggacctaatagacaattacaga actctccaccccagatcaacagaatatacattcttctcagcaccacatcacacttattcc aaaactgaccacatagttggaagtaaagcactcctcagtaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatcgaaactgaacaacctgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acatag >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_3|102_aa MGRNQSRKAENSRNQNTSSPPKESSSSPAMEQSWMENDFDELREEGSRRSVITNFSELKE DVRTHRKEAKNVEKRLDEWLTRINSVEKTLNDLMELKTVARE >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_3|309_bp atggggagaaaccagagcagaaaagctgaaaattctagaaatcagaatacctcttctcct ccaaaggaaagcagctcctcaccagcaatggaacaaagctggatggagaatgactttgac gagttgagagaagaaggctctagacgatcagtaataacaaacttctctgagctaaaggag gatgttcgaacccatcgcaaagaagctaaaaacgttgaaaaaagattagacgaatggcta actagaataaacagtgtagagaagaccttaaatgacctgatggagctgaaaaccgtggca cgagaatga >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_4|253_aa MSSELEHRHSICTEARLGVALALQGWKQMTDNRRPANQLPFPGEKGIKRNVDFLRKIEGT WMKLETIIRQTNTGTENQTPHVLTHKWELNKENTWTPGGEHYTLGPVRGKKRKERKRKKE RKKERKKERKKERKKEGRKEGRKEGEKRREEKRREEKRREEKRREEREEKRRLFGQYRTV VLEEFGRKWAYSELSPHSTHTQCVFTRLRFELEVTLVALLVLKPVDPHRNYITISLGLQF ANQLEKSWISQPP >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_4|762_bp atgagcagtgagctggaacacagacactctatctgcactgaagccaggcttggtgtagca ttagctctgcagggttggaaacaaatgacagataacagaaggcctgctaaccagcttcca tttccgggtgaaaagggcatcaagagaaatgtggacttcttgaggaagatagaagggaca tggatgaagctggaaaccatcattcggcaaactaacacaggaaccgaaaaccaaacaccg catgttctcactcataagtgggagttgaacaaagagaacacatggacaccgggaggggaa cattacacactggggcctgtcaggggaaagaaaagaaaagagagaaagagaaagaaagaa agaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaagaaggaaggaaggaa ggaaggaaggaaggagagaagagaagagaagagaagagaagagaagagaagagaagagaa gagaagagaagagaagagagagaagagaagaggaggctctttggacagtaccgtactgta gtcctggaggaatttggaagaaagtgggcctattcagaactttctcctcattccacacat actcagtgtgtgtttaccagactcagatttgaactggaagttacactagtggctctcctg gttctcaagcctgtggacccacacaggaactatatcaccatctctcttgggctccagttt gccaaccagctggagaaatcttggatttctcaacctccatga >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_5|720_aa MAASAKKKNKKGKTISLTDFLAEDGGTGGGSTYVSKPVSWADETDDLEGDVSTTWHSNDD DVYRAPPIDRSILPTAPRAAREPNIDRSRLPKSPPYTAFLGNLPYDVTEESIKEFFRGLN ISAVRLPREPSNPERLKGFGYAEFEDLDSLLSALSLNEESLSNRRIRVDVADQALDKDRD DPPFGRDRNRDSDKTDTDWRARPATDTFDDYPPRRGDDSFGDKYRDRYDSDRYRDGYRDG PRRDMDRYGGRDRYDDRGSRDYDRGYDSRIGSGRRAFGSGYRRDDDYREGRDCYEDQYDR RDDRSWSSRDDYSRDDYRRDDREFRVGTALTQLENLNAMGIIGSQGGSVQVVALNCQRQG GCSYHSGQQRQSSNQNSLTPVELRHWLINHGVPRSEIDRKPTAFLLNLYNQETSRLSGQK TNLNYKNREPQPLSQFPDLNQFTDPEPLEWRGHWVPLRRDPTTLLTIYTVNLSPILPQQD LQPFSRVTVHWEKGNHQTFWGLLDTGSELMLIPGDPKYHCGPPVTVEAYGGQVINEVLAQ VQLTVGPFGPRTHSVMISPGPECIIGTGILSSWQNPHIGCLTGIAEISATIKDLKDAGVV IPNTSPFNSPIWPVPKTDGSWRTTVGYCKLTQVVAPVAAAVPDVVSLLKQINTSPGTWCA ATDLANVFFSIPIYKAHQKQFAFSWQGQQYTFTILPQGYINSPTLCHNLVRKDLDHFSLP >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_5|2163_bp atggcggcctcagcaaaaaagaagaataagaaggggaagactatctccctaacagacttt ctggctgaggatgggggtactggtggaggaagcacctatgtttccaaaccagtcagctgg gctgatgaaacggatgacctggaaggagatgtttctacaacttggcacagtaacgatgac gatgtgtacagggcgcctccaattgaccgttccatccttcccactgctccacgggctgct cgggaacccaatatcgaccggagccgtcttcccaaatcgccaccctacactgcttttcta ggaaacctaccctatgatgttacagaagagtcaattaaggaattctttcgaggattaaat atcagtgcagtgcgtttaccacgtgaacccagcaatccagagaggctgaaaggttttggt tatgctgaatttgaggacctggattccctgctcagtgccctgagtctcaatgaagagtct ctaagtaacaggagaattcgagtggacgttgctgatcaagcactggataaagacagggat gatcctccttttggccgtgatagaaatcgggattctgacaaaacagatacagactggagg gctcgtcctgctacagacacctttgatgactacccacctagaagaggtgatgatagcttt ggagacaagtatcgagatcgttatgattcagaccggtatcgggatgggtatcgggatggc ccacgccgggatatggatcgatatggtggccgggatcgctatgatgaccgaggcagcaga gactatgatagaggctatgattcccggataggcagtggcagaagagcatttggcagtggg tatcgcagggatgatgactacagagaaggcagggactgctatgaagaccaatatgacaga cgggatgatcggtcgtggagctccagagatgattactctcgggatgattataggcgtgat gatagagagtttagagtgggaactgcactcactcaattagaaaatttaaatgccatggga ataattggatcccaaggtggcagtgtccaagtagtggcactcaactgtcaaaggcaaggt gggtgtagttaccacagtggacagcagaggcaaagcagcaatcagaatagtctgactcct gtagagctcaggcattggttaattaatcatggtgttcctagaagtgaaattgataggaag cctactgcattcttacttaatttgtataaccaggaaacttccaggttgagtggacaaaag actaatttgaattataaaaatagagaaccacagcccctcagtcaatttccagacttgaac cagtttacagacccagaaccccttgaatggaggggacactgggtccccttaaggagggac cccactacactactgacaatttatactgttaatctttctcccatccttccccaacaagac ctccagccttttagcagagtaactgtgcattgggaaaaggggaatcatcagactttttgg ggactactggacactggctctgagctgatgttgattccaggggacccaaagtatcattgt ggtcctccagttacagtagaggcttatggaggtcaggtaattaatgaagttttagctcag gtccaacttacagtgggtccatttggtccccggactcattctgtgatgatttccccaggg ccagaatgcataattggcacaggcatacttagcagctggcagaatccccacattggctgc ctgacagggattgcagagattagtgccaccatcaaggacttaaaagatgcaggggtggtg attcccaacacatccccattcaactctcctatttggcctgtgccaaagacagatggatct tggagaacgacagtgggttattgtaagcttacccaagtggtggccccagttgcagctgct gtaccagatgtggtttcattgcttaaacaaattaacacatctcctggtacctggtgtgca gccactgatttggcaaatgtctttttctccattcctatttataaggcccaccagaagcaa tttgccttcagctggcaaggccagcaatataccttcactatcctacctcaggggtatatc aactctccgactttgtgtcataatcttgttcgcaaagatcttgatcacttttcccttcca taa >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_6|196_aa MTLSKQFSPASPRRTEFSYPHSFGARPLVYTLTSAGGITCCQPVSPPRGEPEKDITHKPL TRTLTIWCLALADDSASLISLECKEPLRFPIQCRKEDALEEKRYVFGNWTACLVLGCMIF PDGWDSDEVKRMCGEKTDKYTLGACSVRWAYILAIIGILDALILSFLAFVLGNRQDSLMA EELKAENKGLPSELNG >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_6|591_bp atgaccctatcaaagcagttttccccagcatcccctcgcaggacagaatttagttatcct catagctttggtgcaaggcccttggtttacacgctgacatcagctgggggcatcacctgc tgccagccagtgtcaccaccgagaggtgaaccagaaaaggatattactcacaagcctctg acaaggacactgactatatggtgtctagctctggccgatgactcagcatccctcatatca ctggaatgtaaagaaccattacgattccctatccagtgcaggaaggaggatgcattagag gagaagagatacgtgtttggaaactggactgcctgccttgtgcttggctgtatgattttc cctgatggctgggactcagatgaagtaaaacggatgtgtggagaaaagacagacaagtac actcttggggcttgctcagtccgctgggcatacatcctggctattattggaattttggat gccctgatcctctcatttctagcatttgtgcttggtaatcgacaagacagcttgatggca gaggaactgaaggcagaaaacaaaggtttgccatcagaacttaatggctaa >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_7|414_aa MIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPATRFQWKVLPQGMLNSPT ICQTFVGRALQPVRDKFSDCYIIHYIDDILCATETRDKLIDCYTFLQAEVANAGLAIASD KIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLKTLNDFQKLLGDINWIRPTLGIPTYAMS NLFSILRGNSDLNSKRMLTPEATKEIKLVEEKIQSAQINRIDPLAPRWAPDVGVGCPSTP VGVSRKSLDGGCSSCYLFYVPLGRRRKRKGQLLYQESKSFPGTLQPTPADNLLAREEREL GSVSQPPGSALAWIFCGEYQKLSFGNTKFEKLIRYSKKGIGPGSLILELLRYWPKAPQLV SETGLALGSLIQVDNLSTTPHRHSQCGHSMNKMEETTVTYANGEDDEHRKIMQR >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_7|1245_bp atgatcccaaaagattggcctttaattataattgatctaaaggattgcttttttaccatc cctctggcggagcaggattgcgaaaaatttgcctttactataccagccataaataataaa gaaccagccaccaggtttcagtggaaagtgttacctcagggaatgcttaatagtccaact atttgtcagacttttgtaggtcgagctcttcaaccagttagagacaagttttcagactgt tatattattcattatattgatgatattttatgtgctacagaaacgagagataaattaatt gactgttatacatttctgcaagcagaggttgccaatgcaggactggcaatagcatctgat aagatccaaacctctactccttttcattatttagggatgcagatagaaaatagaaaaatt aagccacaaaaaatagaaataagaaaagacacattaaaaacactaaatgattttcaaaaa ttgctgggagatattaattggattcggccaactctaggcattcctacttatgccatgtca aatttgttctctatcttaagaggaaactcagacttaaatagtaaaagaatgttaacccca gaggcaacaaaagaaattaaattagtggaagaaaaaattcagtcagcgcaaataaataga atagatcccttagccccacgttgggcgccagatgtaggggtgggttgcccctccacacct gtgggtgtttctcgtaagtcattggatggtggctgcagttcctgctatctcttttatgtt cctttaggaagaaggagaaagaggaaggggcagcttctgtatcaggaaagcaaaagtttc ccaggaaccctgcagccgactcctgcggacaatttgttggccagagaggaaagggagttg ggaagtgtcagccaaccaccagggtctgctttggcttggatattttgtggggagtatcag aagctcagttttggaaatactaagtttgagaagctaattagatattcaaaaaagggtatc gggccaggaagcttaattttggaactattaagatattggcccaaggctcctcagctggtg tcagagacaggactagcactgggcagcttgatccaagtggacaatctttccactacacca caccgccactcccagtgtggccatagcatgaacaagatggaagaaactacggtcacatat gcaaatggagaagatgatgaacacagaaaaattatgcaaagataa >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_8|603_aa MKFRLVTSCSEMKIPAGYVFEEIHSTTVLLASESNQMKGSCVYELYGLLWGKSDDVSPCS HSKCPENLGAFPLPFPEAELLRKNPRAREGKSMPMILSKFCQPERVKVHSSVVIEDKSTR DPEYVYSQPYEQGTLDLKDWKRIGKELKQAGRKGNIIPLTVWNDWAIIKAALEPFQTEED SVSVSDAPGSCLIDCNEKTRKKSQKETEGLHCEYVAEPVMAQSTQNVDYNQLQEVIYPET LKLEGKGPELVGPSESKPRGTSPLPAGQVPVTLQPQTQVKENKTQPPVAYQYWPPAELQY RPPPERMPPAPQGRAPYPQPPTRRLNPTAPPSRQGSELHEIIDKSRKEGDTEAWQFPVTL EPMPPGEGAQEGEPLTVEARYKSFSIKMLKDMKERVKQYGPNSPYMRTLLDSIAHGHRLI PYDWEILAKSSLSPSQFLQFKTWWIDGVQEQVRRNRAANPPVNIDADQLLGIGQNWSTIS QQALMQNEAIEQVRAICLRAWEKIQDPGSACPSFNTVRQGSKEPYPDFVARLQDVAQKSI ADEKACKVTVELMAYENANPECQSAIKPLKGKVPAGSDVISEYVKAWIESEELCIKLCLW LKQ >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_8|1812_bp atgaagttcaggcttgtgacaagctgcagtgaaatgaaaattccggcaggctatgtattt gaagaaattcactctacgactgttctacttgcttctgagtcaaatcaaatgaagggatcc tgcgtttatgaactgtacggtctgttgtggggaaagtcagatgatgtgagtccttgttca cacagtaagtgcccagaaaaccttggtgcttttcctcttcccttccctgaagctgaactt ctgcggaaaaatccaagagcacgtgaaggcaaatctatgcctatgatcctgtctaaattc tgccagcctgaaagggtgaaggtacactcgagcgtggtcattgaggacaagtcgacgaga gatcccgagtacgtctacagtcagccttacgaacaaggaactttagatctaaaagattgg aaaagaattggtaaggaactaaaacaagcaggtaggaagggtaatatcattccacttaca gtatggaatgattgggccattattaaagcagctttagaaccatttcaaacagaagaagat agcgtttcagtttctgatgcccctggaagctgtttaatagattgtaatgaaaagacaagg aaaaaatcccagaaagaaacagaaggtttacattgcgaatatgtagcagagccggtaatg gctcagtcaacgcaaaatgttgactataatcaattacaggaggtgatatatcctgaaacg ttaaaattagaaggaaaaggtcccgaattagtggggccatcagagtctaaaccacgaggc acaagtcctcttccagcaggtcaggtgcctgtaacattacaacctcaaacgcaggttaaa gaaaataagacccaaccgccagtagcctatcaatactggccgccggctgaacttcagtat cggccacccccagaaagaatgcccccagcaccacagggcagggcgccataccctcagccg cccactaggagacttaatcctacggcaccacctagtagacagggtagtgaattacatgaa attattgataaatcaagaaaggaaggagatactgaggcatggcaattcccagtaacgtta gaaccgatgccacctggagaaggagcccaagagggagagcctctcacagttgaggccaga tacaagtctttttcgataaaaatgctaaaagatatgaaagagagagtaaaacagtatgga cccaactccccttatatgaggacattattagattccattgctcatggacatagactcatt ccttatgattgggagattctggcaaaatcgtctctctcaccctctcaatttttacaattt aagacttggtggattgatggggttcaagaacaggtccgaagaaatagggctgccaatcct ccagttaacatagatgcagatcaactattaggaataggtcaaaattggagtactattagt caacaagcattaatgcaaaatgaggccattgagcaagttagagctatctgccttagagcc tgggaaaaaatccaagacccaggaagtgcctgcccctcatttaatacagtaagacaaggt tcaaaagagccctaccctgattttgtggcaaggctccaagatgttgctcaaaagtcaatt gccgatgaaaaagcctgtaaggtcacagtggagttgatggcatatgaaaacgccaatcct gagtgtcaatcagccattaagccattaaaaggaaaggttcctgcaggatcagatgtaatc tcagaatatgtaaaagcctggattgaatcggaggagctatgcataaagctatgcttatgg ctcaagcaataa >gi568815591f:104567749_104769570|GENSCAN_predicted_peptide_9|57_aa KAKIPTKLDITIHRNYFSIRIIPYHHMDFCANSGTNDGIAEKILKQAIQQGLANYES >gi568815591f:104567749_104769570|GENSCAN_predicted_CDS_9|174_bp aaagccaagataccaacaaagctcgacattaccatccacagaaactactttagcattagg atcattccgtatcatcatatggatttctgtgccaactcaggaacaaatgatggtatagca gaaaagatattgaaacaagccatacagcagggtttagcaaattatgagagctga