GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:10:19 Sequence gi568815583f:92611808_92812107 : 200300 bp : 44.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2930 3007 78 1 0 42 68 89 0.787 3.37 1.02 Intr + 4492 4656 165 0 0 118 41 36 0.586 2.06 1.03 Intr + 4756 4803 48 0 0 74 100 40 0.631 2.68 1.04 Intr + 5994 6248 255 2 0 62 105 48 0.296 1.54 1.05 Intr + 6629 6664 36 1 0 108 94 -7 0.443 0.36 1.06 Intr + 10246 10458 213 0 0 0 97 172 0.361 8.21 1.07 Term + 11698 11859 162 0 0 73 45 79 0.530 0.04 1.08 PlyA + 13133 13138 6 1.05 2.04 PlyA - 13953 13948 6 1.05 2.03 Term - 14767 14692 76 0 1 101 37 12 0.064 -5.29 2.02 Intr - 18538 18407 132 0 0 149 92 208 0.889 27.06 2.01 Init - 26863 26688 176 1 2 74 68 115 0.964 6.92 2.00 Prom - 33824 33785 40 -2.46 3.03 PlyA - 34823 34818 6 1.05 3.02 Term - 41791 41656 136 0 1 41 43 117 0.463 -0.01 3.01 Init - 43852 43509 344 1 2 64 80 681 0.991 59.61 3.00 Prom - 53467 53428 40 -2.36 4.06 PlyA - 54410 54405 6 1.05 4.05 Term - 55210 55115 96 1 0 72 39 67 0.756 -1.83 4.04 Intr - 55498 55356 143 2 2 70 81 92 0.713 6.77 4.03 Intr - 55775 55664 112 2 1 89 80 56 0.719 4.85 4.02 Intr - 66402 66323 80 1 2 59 95 51 0.183 2.17 4.01 Init - 68736 68679 58 0 1 72 98 23 0.176 3.18 4.00 Prom - 73912 73873 40 -2.96 5.00 Prom + 88577 88616 40 -4.46 5.01 Sngl + 88631 89194 564 1 0 82 46 165 0.958 7.85 5.02 PlyA + 89696 89701 6 1.05 6.00 Prom + 91756 91795 40 -3.26 6.01 Init + 98532 98552 21 2 0 76 89 58 0.474 2.77 6.02 Term + 99887 100303 417 1 0 -5 38 389 0.645 19.88 6.03 PlyA + 101079 101084 6 1.05 7.00 Prom + 101546 101585 40 -5.26 7.01 Sngl + 102755 103138 384 1 0 63 49 202 0.350 7.99 7.02 PlyA + 103932 103937 6 1.05 8.03 PlyA - 110713 110708 6 1.05 8.02 Term - 113226 113210 17 1 2 100 45 22 0.203 -2.50 8.01 Init - 122388 122178 211 0 1 98 95 74 0.622 8.05 8.00 Prom - 123774 123735 40 -2.76 9.00 Prom + 123781 123820 40 -7.26 9.01 Init + 124963 125060 98 0 2 69 68 108 0.723 6.68 9.02 Term + 143681 144098 418 2 1 26 44 277 0.583 11.85 9.03 PlyA + 144723 144728 6 1.05 10.00 Prom + 144850 144889 40 -5.96 10.01 Sngl + 145721 146182 462 1 0 66 42 162 0.639 5.56 10.02 PlyA + 146186 146191 6 1.05 11.00 Prom + 148483 148522 40 -2.46 11.01 Init + 151705 152111 407 0 2 109 78 151 0.684 12.31 11.02 Intr + 177075 177157 83 0 2 72 101 49 0.514 3.88 11.03 Term + 178178 178242 65 0 2 75 42 60 0.525 -1.85 11.04 PlyA + 179909 179914 6 1.05 12.00 Prom + 182316 182355 40 -1.96 12.01 Init + 184189 184750 562 0 1 45 25 448 0.454 29.38 12.02 Term + 197787 197929 143 1 2 11 53 132 0.128 0.09 12.03 PlyA + 198253 198258 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_1|318_aa MVEVVVLVATIPGGAGGAGGARHLQQTLIFWVISPDFLKMTPTAKNFTKGGKHGTQTYGC IQGPLRIMCHVRIKHPPSPQGVFHGTAGETDFVEEVKMGENREGKSTKKVKCREQRLHAG PPWLAIPCVLGRLRHPHPGLPTGSALSCRGETAFLAGGGCRAPMPLPTYPNSSQKTEEPK MEGPPGSCGVSWVQACGLWGKQPAESTEMQTLYRCHAEIHLRKCLGIPRDCSESYGDPLP GSLPSPHPQVAVYIRVLKKRPHIHKDDTGLQPSHTRRVTCPNLLLAHVLQEACPAGKIAL NTGTPVAAPVSSVWHPGS >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_1|957_bp atggtggaggtggtggtgctagtggcgaccattcctggtggtgctggtggtgccggtggt gccaggcacctgcagcagaccctgatcttctgggtaatttcccctgatttccttaaaatg acaccgacagccaagaatttcaccaaagggggcaagcacgggacacaaacttatgggtgc atccagggccctttgaggataatgtgccatgtccgtataaagcatcccccatccccacag ggtgtgttccacggcacagcaggggagacagattttgtagaagaggtcaagatgggggaa aacagagaaggaaagtcaacaaagaaggtgaaatgcagggagcagagactacacgcaggc cccccgtggctggcaataccatgcgtgctgggccggctgcgccaccctcacccagggctt cccacgggctctgctctttcctgcagaggcgaaactgccttcctggcaggcggaggctgc cgagctcccatgccccttcccacatatcccaactcttctcagaaaactgaagaaccgaag atggagggacctcctgggtcctgtggggtctcctgggtccaggcctgcgggctctgggga aagcagccggcagaatcaacggagatgcaaactctgtaccgctgccatgctgaaattcac ctccgtaagtgtctgggcatcccacgggactgctctgaatcttacggggatcccctacca ggctctctgccctcacctcacccccaggttgctgtctacatccgtgtgcttaagaaacgc ccacacatacacaaggatgacactgggttacaaccatcccacacgaggagagtgacatgt cccaacttgcttctggcccatgttttacaagaagcttgtccggcaggcaaaatagccctc aacacaggaacacctgttgcagcacctgtcagtagcgtctggcaccctggcagctga >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_2|127_aa MDEEIRVHDERLAQKHTNRGIAGLTPEPQSLHFELVALTVTLKTELSLKGLVLFYSLRGS GKRLKKTRKYDIITTPAERVEMAPLNEEDDEDEDSTVFDIKYRAASFSLLSFWGQCQKQI SLQAAIS >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_2|384_bp atggatgaagaaattcgggttcatgacgagaggcttgctcagaagcacaccaacagggga attgcagggctgacccctgaacctcagtcgctgcactttgagttggtggctctgactgtc acactcaaaactgaactgagtctaaagggtttagtcctgttttattctcttcgtgggtcg ggaaagaggttaaagaagacacgcaagtatgatatcatcaccactccagcagagcgagtg gaaatggcgccactaaatgaagaggatgatgaagatgaggactccacagtattcgacatc aaatacagggctgccagcttctctctgctgtcattctggggacagtgtcaaaaacagatt tctctgcaagccgccatctcatga >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_3|159_aa MRAVPLPAPLLPLLLLALLAAPAARASRAESVSAPWPEPERESRPPPGPGPGNTTRFGSG AAGGSGSSSSNSSGDALVTRISILLRDLPTLKAAVIVAFAFTTLLIACLLLRVFSRGPLR LGSPVHLRVSMPDAGLPFSRGAIVIYGLPDCSVGVKVNA >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_3|480_bp atgcgcgccgtgccgctgcccgccccgctcctgccgctgctgctgctcgcgctcctggcc gctcccgccgcccgcgccagcagagccgagtccgtctccgcgccgtggcccgaacccgag cgcgagtcgcggccaccgcccggcccggggcccgggaacaccacccggtttgggtctggg gcggcgggcggcagcggcagctccagctccaacagcagtggcgacgccttggtgacccgc atttccatcctcctccgcgacctacccaccctcaaggcagccgtgatcgtggcgttcgcc tttaccaccctcctcatcgcctgcctgctgctgcgcgtcttcagcagagggcccttgagg ttgggaagtccagttcatcttcgtgtctccatgcccgatgcgggcctgccattcagcaga ggtgccattgtcatctacggacttcctgattgttcagttggagtgaaggttaatgcctag >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_4|162_aa MDGCYMSVLAAYFPGLIPKESLCKKISKMVRTRCGSLTLDFLVPMMRPELCLGIKTQIPA NVRSNSTVTERLTMPAVGIAVGTAFLGKDVCGLLSSALLVFASRSKATLMCMGKVLHCER NCGQMVHLSSKALLGDTSETFIEIFLFGNSSLFPWFDDTSQL >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_4|489_bp atggatggctgttacatgtcagtactggcagcttactttccaggtcttatccctaaagag tctctatgtaaaaagatcagcaagatggtccgcaccagatgtggctccttgaccttggac ttcttagttcccatgatgaggcccgagctctgcctgggaattaaaacacaaattcctgca aacgtgaggtctaactccactgtcacggagaggctgacgatgccagctgtgggaattgca gtggggacagcttttctggggaaagatgtctgtggcctgcttagctctgccctgctggtc ttcgcctctcgatccaaagcaactttgatgtgtatgggaaaagtactgcactgtgaaagg aactgtggacagatggtccacctgagctcaaaggctctcctaggggacacttcagaaact ttcatagaaatatttctctttggcaacagctcgctgtttccatggtttgacgatactagc cagctgtaa >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_5|187_aa MGKDFMSKTPKAMATKDKIDQWDLIKLKSFCTAKETTIRVNTQPTEWEKIFAIHPSDKGL ISRIYKELKQIYKKKSNNPVKKWAKDMNRHSSKEDIYAANRHTKKYSSSLDIREMQIKTT MRYHLTPVGMAIIKKSGNNRDWRGCGERGTLLYCWWDCKLVQPLWKTVWQFLKDLELEIP FDPAIPL >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_5|564_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgac caatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacacgcaacctacagaatgggagaaaatttttgcaatccacccatctgacaaagggcta atatccaggatctacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccgtc aaaaagtgggcaaaggatatgaacagacactcctcaaaagaagacatttatgcagccaat agacacacgaaaaaatactcatcatcactggacatcagagaaatgcaaatcaaaaccacc atgagataccatctcacaccagttggaatggcaatcattaaaaagtcaggaaacaacagg gactggagaggatgtggagaaagaggaacacttttatactgttggtgggactgtaaacta gtccaaccattgtggaagacagtgtggcaattcctcaaggatctagaactagaaatacca tttgacccagccatcccattatag >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_6|145_aa MKPRTFALSSSGGGRSGSGKAAQFREGSRRAAARRHPARAFPAARMPKRKVSSAEGAAKE ESKRRSARLSAKPPAKVEAKPKKAAAKDKSSDKKAQTKGKRGAKGKQAEVANQETKEDLP AENGETKTEESPASDEAGEKEAKSD >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_6|438_bp atgaagccgcggaccttcgcgctcagcagctcgggcggcgggaggagcggcagcggcaag gcagcccagtttcgcgaaggctctcggcgcgccgcggcccgcaggcacccggcacgcgcc ttccccgccgccaggatgcccaagaggaaggtcagctccgccgaaggcgccgccaaggaa gagtccaagaggagatcggcgcggttgtcagctaaacctcctgcaaaagtggaagcgaag ccgaaaaaggcagcagcgaaggataaatcttcagacaaaaaagcgcaaacaaaagggaaa aggggagcaaagggaaaacaggccgaagtggctaaccaagaaactaaagaagatttacct gcagaaaacggggaaacgaaaactgaggagagtccagcctctgatgaagcaggagagaaa gaagccaagtctgattaa >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_7|127_aa MAGVRARAPLPLALLLSLPAAPGGRDPSASRARFPQRLGRAPCFEVGLRKPPPPPLLSPP SFSSGSSRPLQRPRGPKDGAGRKVCAKLVKRLPGESGSCEDGQSAPAQPPRRRTGTRACP PRAPLWR >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_7|384_bp atggccggagtccgggcgcgagcgccgcttcccctcgccctcctcctctcccttcccgcg gcccctggaggccgagacccatccgcgtcccgcgcgcgctttcctcagaggctcggccgg gcgccctgcttcgaagtgggcctgaggaagcctccgcctccgccgctgctttctccccct tccttctccagcggctcctccaggccgctccagagaccccgaggacccaaggacggtgcc gggcggaaagtctgtgcgaagctggtgaaaaggttgcccggagagagcggaagctgtgag gatgggcaaagcgcccctgcccagccgccccgccgcagaaccggtacacgcgcctgcccg ccccgggcgcccctttggcgctga >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_8|75_aa MAGGKAGKDSGKAKAKAVSRSQRAGLQFPVGRIHRHLKTRTTSHGRVGATAAVHRAAIPG YLTAEVLELAGVGYG >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_8|228_bp atggccggaggcaaagctggaaaggacagtgggaaggccaaggctaaggcagtatctcgc tcacagagagctgggctacagtttcctgtgggccgcatccacagacacttgaagactcgc accacaagccatggaagggtgggtgccactgctgccgtgcacagggctgcgattccgggg tacctcactgcagaggtgctggagctggcaggagtgggctatgggtag >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_9|171_aa MENATIFSIPIDGLKDQQELDSKIKEANHGDTRQTGSGVDLQQNPTDLQLRVLTVRRKTN KQKGHPHQNPICTSPSSKTKVDKTTKTGRNQSRKAENSKNQSTSSPPKEHSSLPAMEQSW TENDFDELREEGFRRSVKTNFSELKEDVQTHRKEAKNLEKRLDEWLTESTV >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_9|516_bp atggaaaatgccaccatctttagtattcctatagatggcctcaaagatcagcaagagctg gacagcaaaattaaagaggccaatcatggagataccaggcaaacagggtctggagtggac ctccagcaaaatccaacagacctgcagctgagggtcctgactgttagaaggaaaactaac aaacagaaaggacatccacaccaaaaccccatctgtacatcaccatcatcaaagaccaaa gtagataaaaccacaaagacggggagaaaccagagcagaaaagctgaaaattctaaaaat cagagcacctcttctcctccaaaggaacacagctccttgccagcaatggaacaaagctgg acggagaatgactttgatgagttgagagaagaaggcttcagacgatcagtaaaaacaaac ttctccgagctaaaggaggatgttcaaacccatcgcaaagaagctaaaaaccttgaaaaa agattagacgaatggctaactgaatcaacagtgtag >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_10|153_aa MKAEIKMFFETNKNKNTTYQNLWDTFKVVCRGKFIALNAHKRKQERSKIDTLPSQLKELE KQEKTHSKASRRQEIAKIRAELKEIETQKALQKINESRSWFFEKINKIDRPLARLIKKKR EKNQIDEIKNDKGENTTDPPEIQTTIREYYKHL >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_10|462_bp atgaaggcagaaataaagatgttctttgaaaccaataagaataaaaacacaacataccag aatctctgggacacatttaaagtagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatcgaaaattgacaccctaccatcacagttaaaagaactagag aagcaagagaaaacacattcaaaagctagcagaaggcaagaaatagctaagatcagagca gaactgaaggaaatagagacacaaaaagcccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccactagcaagactaataaagaagaagaga gagaagaatcaaatagatgaaataaaaaatgataaaggggagaacaccaccgatccccca gaaatacaaactaccatcagagaatactataaacacctctag >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_11|184_aa MEATTVLEKNLIQALLELQARGSAYTDPHLCDFLKSVFIDEEVKLIKKMGNHPTNLFRLS GPQADWTSVSLKGSTLSTSRSLLNPATPEGPLSKHQGFCLRLSLQPLGSFLTTLEPSPCL GPNRNKAFCNSKKTGSAGIVSGVGSFWWVRGLADFKNGAADLRDSLEETETQKLALEQLL REIV >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_11|555_bp atggaggccaccacagtcctggagaagaacctgatccaggcccttttggagcttcaggcc cggggttctgcctacacagacccccatctctgtgacttcctaaagagcgtcttcatagat gaggaagtgaaactcattaaaaagatgggcaatcacccgaccaacctcttcaggctgtct ggcccccaggccgactggaccagtgtctctttgaaaggctcaaccttaagcacatctagg agcctactgaacccagcaactccagaagggcccctctcaaagcatcagggcttctgcctg agactctccctccagccactgggcagctttttaaccaccctagagccctctccatgcctt ggaccaaatagaaataaagctttttgcaacagcaaaaaaacagggagtgctgggattgtg tctggagttggttccttctggtgggttcgtggtctcgctgacttcaagaacggggccgcg gacctccgcgacagtttagaagaaacggaaactcagaagcttgcacttgagcagctgcta agagaaattgtataa >gi568815583f:92611808_92812107|GENSCAN_predicted_peptide_12|234_aa MLLQHGASVQPESDLASPVHEAARRGHVECVDSLTAYRGKNDHNISHVGTSLYLACENQQ IACVKKLLESGADLNPGRGSPLHAVAFMKALMKDSPLHAVARTASEELACLLMDFGADTQ AKNAEGKCHVELVPPESPLIQLFLEREGPPSFDAVMPRNQKGFGIQQHHKITKVVLPEDL KWFLLHLCPAGNPAAQFRKAASVLTSRGPFLLSLRLLLRANKEDRKLFIVAFSS >gi568815583f:92611808_92812107|GENSCAN_predicted_CDS_12|705_bp atgcttctgcagcatggagccagcgttcaacctgagagtgatctggcatcccccgtccat gaagctgctaggagaggccacgtggagtgtgtcgactctcttacagcttataggggcaaa aatgaccataacatcagccacgtgggcacttcactgtatttggcttgtgaaaaccagcag atagcctgtgtcaagaagcttctggagtcaggagcagacctgaacccagggagaggttcc ccacttcatgcagtggccttcatgaaggccctcatgaaggattccccacttcatgcagtg gccaggacagccagtgaagagctggcctgcctgctcatggattttggagcagacacccag gccaagaatgctgaaggcaaatgtcatgtggagctggtgcctccagagagccctttgatc cagctcttcttggagagagaagggcccccttcttttgatgcagttatgcctagaaatcag aagggctttggaatccagcagcatcataagataaccaaagtcgtcctcccagaggatctg aaatggtttctcctacatctttgccctgcggggaatcccgcggcgcagttccgcaaagct gccagcgtgcttaccagtcggggcccgtttcttttgtctctgcgtttgctgctccgtgca aacaaggaggacagaaaactctttatcgtggcgttttcttcctga