GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:40:55 Sequence gi568815586f:12713368_12929551 : 216184 bp : 44.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4473 4947 475 2 1 53 47 489 0.644 36.94 1.02 Term + 5458 5579 122 2 2 39 41 96 0.445 -1.36 1.03 PlyA + 6123 6128 6 1.05 2.00 Prom + 53150 53189 40 -3.26 2.01 Sngl + 73392 74285 894 2 0 27 54 1376 0.202 122.24 2.02 PlyA + 77698 77703 6 1.05 3.00 Prom + 82851 82890 40 -6.36 3.01 Init + 100001 100087 87 1 0 91 85 79 0.960 8.54 3.02 Intr + 100764 100857 94 2 1 29 100 54 0.966 0.24 3.03 Intr + 107841 108029 189 1 0 101 103 81 0.981 10.46 3.04 Intr + 108598 108716 119 2 2 71 83 128 0.999 10.78 3.05 Intr + 109294 109365 72 0 0 89 62 89 0.984 6.00 3.06 Intr + 109836 109952 117 2 0 87 93 17 0.869 2.76 3.07 Intr + 110503 110649 147 0 0 59 110 5 0.510 0.23 3.08 Intr + 111173 111310 138 1 0 61 113 98 0.992 10.26 3.09 Intr + 112633 112703 71 0 2 105 98 33 0.995 3.98 3.10 Intr + 113879 114008 130 2 1 69 68 150 0.969 11.80 3.11 Term + 116056 116187 132 0 0 126 37 184 0.991 15.19 3.12 PlyA + 116587 116592 6 1.05 4.00 Prom + 120821 120860 40 -2.96 4.01 Init + 122357 122541 185 1 2 99 75 23 0.246 0.70 4.02 Term + 126490 126583 94 1 1 77 54 89 0.805 1.70 4.03 PlyA + 126990 126995 6 1.05 5.00 Prom + 136814 136853 40 -2.66 5.01 Sngl + 137498 138337 840 1 0 86 49 518 0.697 43.45 5.02 PlyA + 138989 138994 6 1.05 6.00 Prom + 144741 144780 40 -6.16 6.01 Sngl + 162132 162743 612 2 0 96 52 956 0.890 87.00 6.02 PlyA + 162748 162753 6 1.05 7.08 PlyA - 166378 166373 6 1.05 7.07 Term - 172472 172416 57 2 0 87 44 67 0.068 -0.11 7.06 Intr - 177349 177262 88 0 1 118 18 41 0.171 0.07 7.05 Intr - 178349 178131 219 1 0 84 72 110 0.639 6.42 7.04 Intr - 179521 179439 83 1 2 104 64 68 0.906 4.44 7.03 Intr - 187104 187069 36 0 0 121 108 -21 0.688 1.66 7.02 Intr - 188011 187893 119 2 2 99 107 -41 0.350 -0.92 7.01 Init - 190843 190690 154 1 1 58 63 86 0.474 3.24 7.00 Prom - 193291 193252 40 -4.16 8.00 Prom + 193319 193358 40 -11.14 8.01 Init + 194883 195804 922 2 1 75 87 990 0.962 91.74 8.02 Intr + 198717 198775 59 1 2 133 111 66 0.999 12.00 8.03 Intr + 203041 203149 109 0 1 47 24 77 0.050 -2.94 8.04 Intr + 208442 208489 48 0 0 123 83 -4 0.038 1.25 8.05 Intr + 214337 214436 100 0 1 97 103 52 0.021 6.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 149921 149826 96 1 0 95 74 76 0.856 6.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_1|198_aa MSNVRVSNGSPSLERMDARQAEHPKPSACRNLFGPVDHEELTRDLEKHCRDMEEASQRKW NFDFQNHKPLEGKYEWQEVEKGSLPEFYYRPPRPPKGACKVPAQESQDVSGSRPAAPLIG APANSEDTHLVDPKTDPSDSQTGLAEQCAGIRKRPATDDSSTQNKRANRTEENVSDGSPN AGSVEQTPKKPGLRRRQT >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_1|597_bp atgtcaaacgtgcgagtgtctaacgggagccctagcctggagcggatggacgccaggcag gcggagcaccccaagccctcggcctgcaggaacctcttcggcccggtggaccacgaagag ttaacccgggacttggagaagcactgcagagacatggaagaggcgagccagcgcaagtgg aatttcgattttcagaatcacaaacccctagagggcaagtacgagtggcaagaggtggag aagggcagcttgcccgagttctactacagacccccgcggccccccaaaggtgcctgcaag gtgccggcgcaggagagccaggatgtcagcgggagccgcccggcggcgcctttaattggg gctccggctaactctgaggacacgcatttggtggacccaaagactgatccgtcggacagc cagacggggttagcggagcaatgcgcaggaataaggaagcgacctgcaaccgacgattct tctactcaaaacaaaagagccaacagaacagaagaaaatgtttcagacggttccccaaat gccggttctgtggagcagacgcccaagaagcctggcctcagaagacgtcaaacgtaa >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_2|297_aa MIHWKQTRSPSVAVAAPLNSCQVPAGVRAAGRERRLARRLQADRVSMSPQGMERPAAREP HGPDALRRFQGLLLDRRGRLHGQVLRLREVARRLERLRRRSLVANVAGSSLSATGALAAI VGLSLSPVTLGTSLLVSAVGLGVATAGGAVTITSDLSLIFCNSRELRRVQEIAATCQDQM REILSCLEFFCRWQGCGDRQLLQCGRNASIALYNSVYFIVFFGSRGFLIPRRAEGDTKVS QAVLKAKIQKLAESLESCTGALDELSEQLESRVQLCTKSSRGHDLKISADQRAGLFF >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_2|894_bp atgatccactggaaacagacccgttcccctagcgtggcagtggctgctccgctgaactcg tgccaagttcccgctggcgtccgggcagcagggcgggagcggcggctggcacggagactc caggctgaccgcgtgtctatgtccccgcagggaatggagaggccggcggcccgggagccg catgggcccgacgcgctgcggcgcttccagggactgctgctggaccgccgaggccggctg cacggccaggtgctgcgcctgcgcgaggtggcccggcgcctggagcgcctgcgcaggcgc tccctcgtagccaacgtggccggcagctcgctgagcgcaacgggcgccctcgccgccatc gtggggctctcgctcagcccggtcaccctggggacctcgctgctggtgtcggccgtgggg ctgggggtggccacagccggaggggccgtcaccatcacgtccgatctctcgctgatcttc tgcaactcccgggagctgcggagggtgcaggagatcgcggccacctgccaggaccagatg cgagagatcctgagctgcctcgagtttttctgccgctggcagggctgcggggaccgccag ctgctgcagtgcgggaggaacgcctccatcgccctgtacaattctgtctacttcatcgtc ttctttggctcacgtggcttcctcatccccaggcgggcggagggggacaccaaggttagc caggccgtgctgaaggccaagattcagaaactggccgagagcctggagtcctgcaccggg gctctggacgaactcagcgagcagctggagtctcgggttcagctctgcaccaagtccagt cgtggccacgacctcaagatctctgctgaccagcgtgcagggctgtttttctga >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_3|431_aa MAAPEEHDSPTEASQPIVEEEETKTFKDLGVTDVLCEACDQLGWTKPTKIQIEAIPLALQ GRDIIGLAETGSGKTGAFALPILNALLETPQRLFALVLTPTRELAFQISEQFEALGSSIG VQSATPGRLIDHLENTKGFNLRALKYLVMDEADRILNMDFETEVDKILKVIPRDRKTFLF SATMTKKVQKLQRAALKNPVKCAVSSKYQTVEKLQQYYIFIPSKFKDTYLVYILNELAGN SFMIFCSTCNNTQRTALLLRNLGFTAIPLHGQMSQSKRLGSLNKFKAKARSILLATDVAS RGLDIPHVDVVVNFDIPTHSKDYIHRVGRTARAGRSGKAITFVTQYDVELFQRIEHLIGK KLPGFPTQDDEVMMLTERVAEAQRFARMELREHGEKKKRSREDAGDNDDTEGAIGVRNKV AGGKMKKRKGR >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_3|1296_bp atggcggcacccgaggaacacgattctccgaccgaagcgtcccagccgattgtggaagag gaggaaactaaaacatttaaagacctgggtgtgacagatgtgttgtgtgaagcttgtgac cagttgggatggacaaaacccaccaagatccagattgaagctattcctttggccttacaa ggtcgtgatatcattgggcttgcagaaactggctctggaaagacaggcgcctttgctttg cccattctaaacgcactgctggagaccccgcagcgtttgtttgccctagttcttaccccg actcgggagctggcctttcagatctcagagcagtttgaagccctggggtcctctattgga gtgcagagtgcaactcctggtcgactgattgaccacttggaaaatacgaaaggtttcaac ttgagagctctcaaatacttggtcatggatgaagccgaccgaatactgaatatggatttt gagacagaggttgacaagatcctcaaagtgattcctcgagatcggaaaacattcctcttc tctgccaccatgaccaagaaggttcaaaaacttcagcgagcagctctgaagaatcctgtg aaatgtgccgtttcctctaaataccagacagttgaaaaattacagcaatattatattttt attccctctaaattcaaggatacctacctggtttatattctaaatgaattggctggaaac tcctttatgatattctgcagcacctgtaataatacccagagaacagctttgctactgcga aatcttggcttcactgccatccccctccatggacaaatgagtcagagtaagcgcctagga tcccttaataagtttaaggccaaggcccgttccattcttctagcaactgacgttgccagc cgaggtttggacatacctcatgtagatgtggttgtcaactttgacattcctacccattcc aaggattacatccatcgagtaggtcgaacagctagagctgggcgctccggaaaggctatt acttttgtcacacagtatgatgtggaactcttccagcgcatagaacacttaattgggaag aaactaccaggttttccaacacaggatgatgaggttatgatgctgacagaacgcgtcgct gaagcccaaaggtttgcccgaatggagttaagggagcatggagaaaagaagaaacgctcg cgagaggatgctggagataatgatgacacagagggtgctattggtgtcaggaacaaggtg gctggaggaaaaatgaagaagcggaaaggccgttaa >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_4|92_aa MQSLGGGPQGLLRVLELLFARSHPSPYTIQTQRMSPVTVTLQSSASAVKEAGRSRRNWER NSIYVWVNSELCVITKGSLEGDERALQGQVSD >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_4|279_bp atgcagtctctaggaggtgggccccaaggtctgctcagagtcttggagcttttgtttgct cgtagccatccatccccttataccatccaaacacaaaggatgtctcctgtcactgtcacc cttcaatcatctgccagtgccgtcaaagaggccggaaggagcaggaggaattgggaaagg aacagcatctatgtctgggtgaactcagaattatgtgtgatcaccaagggcagccttgaa ggggacgagagggccctacaaggacaagtctctgactga >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_5|279_aa MVESKGGARHVLHGGKQERSRGELPLESHQVLEAFSCFKMKLNISFPVTGCQKLIEVDDE CKLRTFYEKLMATEVAADTLGEEWKGYVVRISGGNNKQGFPMKQGVLTHGRVHLLLSKGH SCYRPRRTGERKRKSVRGCIVDANLSILNLIIVKKKKKVKKDIPGLTDTMVPCRLGPKKA SRICKLSNLSEEDDVRQYVVRKPSNKGGKKPRTKAPKIQHLVTPHFLQHKGQHIALKKPC TKKNKEEAAEYAKLLGKGMKEAKEKRQEQIAKRHRLSSL >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_5|840_bp atggtggaaagcaaaggaggagcaaggcatgtcttacatggtggcaagcaagagagaagt agaggggaactccctttagaaagccatcaggtcttggaggcattcagctgcttcaagatg aagctgaacatctccttcccggtcactggctgccagaaactcattgaagtggatgatgaa tgcaaacttcgtactttttatgagaagcttatggccacagaagttgctgctgacactctg ggtgaagaatggaaaggttatgtggtccgaatcagtggtgggaacaacaaacaaggtttt cccatgaagcagggtgtcttgacccatggccgtgtccacctgctactgagtaaggggcat tcctgttacagaccaaggagaactggagaaagaaagagaaaatcagttcgtggttgcata gtggatgccaatctgagcattctcaacttgattattgttaaaaaaaaaaaaaaagtaaag aaggatattcctggactgactgatactatggtgccttgtcgcctggggcccaaaaaagct agcagaatctgcaaactttccaatctctctgaagaagatgatgtccgccagtatgttgta agaaagccctcaaacaaaggaggtaagaaacctaggaccaaagcacccaagattcagcat cttgttactccacatttcctgcagcacaaagggcagcatattgctctgaagaagccgtgt actaagaaaaataaggaagaggctgcagaatatgctaaacttttaggcaagggaatgaag gaggctaaagagaagcgccaggaacaaattgcgaagagacacagactttcctctctgtga >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_6|203_aa MAEVQVLVLDGRGHLLGHLAAIVAKQVLLGRKVVVVCCEGINISGNFYRNKLKYLAFLRK RMNSNPSRGPYPLQAPSRIFWQTMRGMPPHKTKPGQAALDCLKVFDGIPPPYDKKKRMVV PAALKVVRLKPARKFAYLGRLAHEVGWKYQAVTATLEEKRKEKAKIHYRKKKQFMRLWKQ AEKNVEKKIDKYTEVLKTHGLLV >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_6|612_bp atggcggaggtacaggtcctggtgcttgatggtcgaggccatctcctgggccacctggcg gccatcgtggctaaacaggtactgctgggccggaaggtggtggttgtatgctgtgaaggc atcaacatttctggcaatttctacagaaacaagttgaagtacctggctttcctccgcaag cggatgaacagcaacccttcccgaggcccctaccccctccaggcccccagccgcatcttc tggcagaccatgcgaggtatgccgccccacaagaccaagccaggccaggccgctctggac tgcctcaaggtgtttgacggcatcccaccgccctacgacaagaaaaagcggatggtggtt cctgctgccctcaaggttgtgcgtctgaagcctgcaagaaagtttgcctatctggggcgc ctggctcacgaggttggctggaagtaccaggcagtgacagccaccctggaggagaagagg aaagagaaagccaagatccactaccggaagaagaaacagttcatgaggctatggaaacag gccgagaagaacgtggagaagaaaattgacaaatacacagaagtcctcaagacccacgga ctcctggtctga >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_7|251_aa MRALPGYTVNVEEEDPISFKGGERKELRPCALHCVSPHPGRLITRVLSSRKGCQKLHFGS EIRSAWPCLNPAPILVAAGGSQESMNPEVLLGEGVSAARLCEQPIVSEVAEVFPFRNEPT AVADSAEPFNRRFKSGGPSALGSWRLTYPSAKALAVNLVLLPARKRAAGELLSAGGDLGQ LSRPRTLSSCYKGGPRRREGTSEGIKPIGPKLVFVYVWNDPGVAARLCRGSSQQQYTFEM YTFEIFAYASM >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_7|756_bp atgagggctctcccaggctacaccgtgaatgtggaagaggaagaccctatttccttcaag ggcggcgagcggaaggaactcaggccctgtgcacttcactgtgtctctccacaccccggc agattaatcaccagggtgttgtctagcaggaaaggctgccaaaaattgcactttgggtct gagattaggagcgcctggccatgtctgaaccccgctcctatcttagtggccgcaggtggc tcccaggaaagcatgaatcctgaggttttactgggagaaggggtttcagctgccaggctg tgtgaacaacccattgtgtctgaagtagctgaagtgttccccttcaggaatgaacccaca gcagtggctgattcggccgaacccttcaacagacgatttaaatccggaggccccagcgct ctgggctcctggcgcctcacttaccctagtgccaaggcgttggccgtgaacttggtgctg cttcccgcgcgcaagagggcagcaggcgagctcctcagtgctgggggagaccttggacag ctatcccgccctcgcactctgagcagttgttataaaggcggccctcgccggagggaggga acgagcgaggggatcaagccaatcggaccgaaactcgtctttgtttacgtgtggaacgat cctggagtggctgcccgcctgtgtcggggctcaagccagcaacaatacacctttgaaatg tacacctttgaaatttttgcatatgccagcatgtga >gi568815586f:12713368_12929551|GENSCAN_predicted_peptide_8|413_aa MATTVPDGCRNGLKSKYYRLCDKAEAWGIVLETVATAGVVTSVAFMLTLPILVCKVQDSN RRKMLPTQFLFLLGVLGIFGLTFAFIIGLDGSTGPTRFFLFGILFSICFSCLLAHAVSLT KLVRGRKPLSLLVILGLAVGFSLVQDVIAIEYIVLTMNRTNVNVFSELSAPRRNEDFVLL LTYVLFLMALTFLMSSFTFCGSFTGWKRHGAHIYLTMLLSIAIWVAWITLLMLPDFDRRW DDTILSSALAANGWVFLLAYVSPEFWLLTKQRNPMDYPVEDAFCKPQLVKKSYGVENRAY SQEEITQGFEETGDTLYAPYSTHFQLQMALTLQASTLMASQPCKGSHLKSVSCVGHQAEL QTPSENGNALHSWDGPSPFSAGRPLRKCGQPTPREESGEKEHKTPEACQRIKP >gi568815586f:12713368_12929551|GENSCAN_predicted_CDS_8|1239_bp atggctacaacagtccctgatggttgccgcaatggcctgaaatccaagtactacagactt tgtgataaggctgaagcttggggcatcgtcctagaaacggtggccacagccggggttgtg acctcggtggccttcatgctcactctcccgatcctcgtctgcaaggtgcaggactccaac aggcgaaaaatgctgcctactcagtttctcttcctcctgggtgtgttgggcatctttggc ctcaccttcgccttcatcatcggactggacgggagcacagggcccacacgcttcttcctc tttgggatcctcttttccatctgcttctcctgcctgctggctcatgctgtcagtctgacc aagctcgtccgggggaggaagcccctttccctgttggtgattctgggtctggccgtgggc ttcagcctagtccaggatgttatcgctattgaatatattgtcctgaccatgaataggacc aacgtcaatgtcttttctgagctttccgctcctcgtcgcaatgaagactttgtcctcctg ctcacctacgtcctcttcttgatggcgctgaccttcctcatgtcctccttcaccttctgt ggttccttcacgggctggaagagacatggggcccacatctacctcacgatgctcctctcc attgccatctgggtggcctggatcaccctgctcatgcttcctgactttgaccgcaggtgg gatgacaccatcctcagctccgccttggctgccaatggctgggtgttcctgttggcttat gttagtcccgagttttggctgctcacaaagcaacgaaaccccatggattatcctgttgag gatgctttctgtaaacctcaactcgtgaagaagagctatggtgtggagaacagagcctac tctcaagaggaaatcactcaaggttttgaagagacaggggacacgctctatgccccctat tccacacattttcagctgcagatggcgctgacacttcaggcatcaaccctcatggcctct cagccttgcaaaggcagccacttaaagtcggtgtcctgtgtggggcaccaagctgagctg cagacacccagtgaaaatggaaatgctcttcactcttgggatgggccctcaccttttagt gctggcaggccactgcgcaagtgtggacagcctacaccaagggaagagtcaggggagaag gaacacaagactccggaagcatgccagcgtataaaaccn