GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:35:01 Sequence gi568815584f:60679978_60880592 : 200615 bp : 37.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 980 975 6 1.05 1.02 Term - 8900 8716 185 0 2 35 44 234 0.839 10.42 1.01 Init - 16311 16263 49 0 1 86 58 40 0.433 -0.04 1.00 Prom - 30830 30791 40 -6.35 2.05 PlyA - 31949 31944 6 1.05 2.04 Term - 34226 33430 797 0 2 100 43 573 0.946 46.45 2.03 Intr - 40468 39729 740 0 2 80 87 410 0.630 30.25 2.02 Intr - 42168 41776 393 2 0 -6 0 319 0.463 6.54 2.01 Init - 44034 43235 800 0 2 35 105 867 0.556 75.32 2.00 Prom - 44928 44889 40 -8.45 3.00 Prom + 50729 50768 40 -5.15 3.01 Sngl + 50783 51169 387 1 0 71 39 186 0.897 7.96 3.02 PlyA + 51848 51853 6 1.05 4.03 PlyA - 52604 52599 6 1.05 4.02 Term - 54998 54689 310 1 1 116 49 326 0.983 25.05 4.01 Init - 61898 61825 74 2 2 59 92 20 0.572 0.09 4.00 Prom - 75838 75799 40 -3.15 5.00 Prom + 76841 76880 40 -2.85 5.01 Init + 100001 100280 280 1 1 97 65 262 0.642 22.02 5.02 Term + 100473 100618 146 1 2 41 49 103 0.360 -1.21 5.03 PlyA + 102868 102873 6 1.05 6.00 Prom + 108908 108947 40 -7.05 6.01 Init + 111657 111724 68 2 2 48 97 55 0.976 3.00 6.02 Intr + 116240 116392 153 2 0 116 95 97 0.983 11.47 6.03 Intr + 118110 118183 74 0 2 70 84 69 0.986 2.83 6.04 Intr + 128348 128451 104 0 2 78 28 86 0.980 0.57 6.05 Intr + 132010 132150 141 0 0 56 84 180 0.997 14.03 6.06 Intr + 138745 138870 126 0 0 60 103 56 0.576 4.26 6.07 Intr + 143042 143173 132 1 0 74 90 35 0.461 2.22 6.08 Term + 149132 149278 147 1 0 114 49 18 0.332 -2.58 6.09 PlyA + 150468 150473 6 1.05 7.04 PlyA - 150787 150782 6 1.05 7.03 Term - 156319 155949 371 2 2 72 47 319 0.973 20.02 7.02 Intr - 159555 159476 80 2 2 73 105 54 0.461 3.88 7.01 Init - 172609 172521 89 1 2 62 91 58 0.294 3.66 7.00 Prom - 172699 172660 40 -4.85 8.04 PlyA - 172816 172811 6 1.05 8.03 Term - 174491 174112 380 0 2 21 41 332 0.553 15.77 8.02 Intr - 174763 174570 194 0 2 34 -20 188 0.876 0.81 8.01 Init - 175777 175719 59 1 2 107 86 75 0.989 10.03 8.00 Prom - 178082 178043 40 -3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 199217 199079 139 2 1 71 31 141 0.971 7.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_1|77_aa MGFLHVGQAGLELLTSGSWRFEQRIGQNVQQSKERMMQRKNKCRDLLQMKVRSTAWERPE QRPKGSRYRIFSGPNNG >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_1|234_bp atggggtttctccatgttggtcaggctggtctcgaactcctgacctcaggttcttggcgt tttgaacaaagaattggacaaaatgtacagcagagcaaggaaagaatgatgcagcgaaag aacaaatgcagagatttattgcaaatgaaagtacgctccacagcgtgggagcggccagag cagcggcccaagggctccagatacagaatcttctccggtccaaacaacggctag >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_2|909_aa MESASEGQEAHREVAGGAAVGLSPPAPAPFPLEPGDAATAAARVSGEEGAVAAAAAGAAA DQVQLHSELLGRHHHAAAAAAQTPLAFSPDHVACVCEALQQGGNLDRLARFLWSLPQSDL LRGNESLLKARALVAFHQGIYPELYSILESHSFESANHPLLQQLWYKARYTEAERARGRP LGAVDKYRLRRKFPLPRTIWDGEETVYCFKEKSRNALKELYKQNRYPSPAEKRHLAKITG LSLTQVSNWFKNRRQRDRNPSETQSKRRERIQKQLEGSSRGSKLGAGDPARLRQVKVITS AFQSRSQPALAASGPLWGSDGSGSGVPRPRLGGRLGALSGPRGNQNKRCTGMCVRLPQIC GEVQRRRFGYAEPPALRPLEAADPALGNQRNPQGRGRGESDGNPSTEDESSKGHEDLSPH PLSSSSDGITNLSLSSHMEPVYMQQIGNAKISLSSSGVLLNGSLVPASTSPVFLNGNSFI QGPSGVILNGLNVGNTQAVALNPPKMSSNIVSNGISMTDILGSTSQDVKEFKVLQSSANS ATTTSYSPSVPVSFPGLIPSTEVKREGIQTVASQDGGSVVTFTTPVQINQYGIVQIPNSG ANSQFLNGSIGFSPLQLPPVSVAASQGNNLIWYLNAPANVFISCCNISVSSSTSDGSTFT SESTTVQQGKVFLSSLAPSAVVYTVPNTGQTIGSVKQEGLERSLVFSQLMPVNQNAQVNA NLSSENISGSGLHPLASSLVNVSPTHNFSLSPSTLLNPTELNRDIADSQPMSAPVASKST VTSVSNTNYATLQNCSLITGQDLLSVPMTQAALGEIVPTAEDQVGHPSPAVHQDFVQEHR LVLQSVANMKENFLSNSESKATSSLMMLDSKSKYVLDGMVDTVCEDLETDKKELAKLQTV QLDEDMQDL >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_2|2730_bp atggaaagcgcctcggaagggcaggaggcgcaccgagaagtggcggggggcgcggcggta gggctgagccccccggctccagccccttttcccctggagccgggggacgccgcgaccgct gccgccagggtgagcggagaggaaggggcagtggcggcggcggcggccggagcggcggcg gatcaggtacaactccactcggaacttctgggcaggcaccaccacgccgccgccgccgcc gcgcagaccccgctggccttctcgcccgaccacgtcgcctgcgtgtgcgaggcactgcag caggggggcaacctggaccgcctggcccggttcctgtggtccctgccccagagcgacctg ctacgtggcaacgagagcctgctgaaggcgcgggcgctcgtggccttccaccagggcatc taccccgagctctacagcatcctcgagagccacagcttcgagtcggccaaccacccgctg ctgcagcagctctggtacaaggcgcgctacaccgaggccgagcgagcccgcggccggccg ctgggagccgtagacaagtaccggctgcgcaggaaattccccctgccccgcaccatctgg gacggcgaggagacggtgtattgtttcaaggagaagtcgcgcaacgcgctcaaggagctc tacaagcagaatcgctacccttcgcccgccgagaagcggcacctggccaagatcaccggc ctctccctcacccaggtcagcaactggttcaagaaccgccggcagcgcgacaggaacccc tccgagacccagtccaaaaggagagaaagaatccaaaagcagctcgaaggttcttctcgg ggaagcaaactgggagccggggatccagcccgcctgcgccaggtgaaggtgatcaccagc gcattccagagccggtctcagcccgcccttgccgcttctgggcccctgtgggggtccgac ggctcgggctccggcgttcctcgcccaaggctgggagggaggcttggtgccctatccggc cctcgcggtaaccaaaacaaaaggtgcaccgggatgtgcgtgcgccttccgcagatatgc ggagaggtccagagaaggcgctttggttacgccgagccacctgccctgcgcccactagag gccgcggatcccgcgctcggaaaccaacggaatccgcagggccggggcagaggtgagtca gatggcaaccccagcactgaagatgaatccagcaagggacatgaggatttatctcctcac ccactctccagttcatctgatggcatcaccaacctcagcctttccagtcatatggagcca gtatatatgcaacaaattggaaatgctaagatatcattaagctcttctggagttctgttg aatggaagcttggtacctgcaagtacttcacctgtcttccttaatggaaattcttttatt cagggacccagtggagttatccttaatggattaaatgtgggaaatacacaggcagtggca ttgaacccaccaaaaatgtcatcaaacattgtgagcaatggtatatccatgactgacata ctggggtctacttcccaggacgtgaaggaattcaaagtcctccagagttctgctaactca gcaaccaccacgtcctacagccccagtgtccctgtctcattcccaggcctgatacccagc actgaggtgaaaagagaaggcattcaaacagtggcttcccaagatggagggtctgtagtg acttttactacaccagtgcaaattaaccagtatggcattgtccagatccccaattccgga gcaaacagccagttccttaatgggagcattggattctctccactgcagctgccccctgtg tcagtggcagcttcacaaggtaacaatctcatttggtaccttaatgcaccagcaaatgtg ttcatcagctgctgtaatatctcagtaagctcaagcacttcagatggaagcacatttaca agtgagtctaccacagtccagcaaggaaaggttttcttgagctctcttgctcccagtgca gtggtatacacggttcctaatacaggccagactataggatctgtgaaacaggaaggcttg gaaaggagcctggtattttctcagttgatgcctgtcaatcagaatgcacaagtaaatgca aacctgtcttctgaaaacatctcggggagtggcctgcatccactggcctcctcattagtt aatgtatctccaactcacaatttttctctcagtccctctacactactaaatcccactgag ctaaaccgcgacattgccgatagccaaccaatgtctgcaccggtggcaagcaaatctact gtgacatctgtcagcaacactaactatgcaactcttcagaactgctcccttattactggt caagacctattgtcagtccctatgactcaggctgcccttggggaaatagttcctacagct gaagatcaggtaggtcacccctccccagcagtacatcaggattttgtccaagaacatcgt ttggttctgcaatcggtagctaacatgaaagagaatttcttatcaaattctgagagcaaa gcaacaagtagcttaatgatgctggactctaaatccaagtatgtcttagatggcatggtt gatactgtctgtgaagacctggaaacagacaaaaaagagcttgccaagctccagactgtc cagctggatgaagatatgcaagacttatga >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_3|128_aa MGKDFMTKTPKAMATKAKIDKWDLMKLKSFCTAKETTIRMNRQPTDWEKIFGIYSSDKGL ISRIYKELKQIYKKKVKQPHQKVDKGYEQTLFQRRDLCSQQTHETMLIITGHQRNANQNH NEIPPHTS >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_3|387_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagctaaaatagac aaatgggatctaatgaaactaaagagcttctgcacagcaaaagaaactaccatcagaatg aacaggcaacctacagactgggagaaaatttttggaatctactcatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaagtcaaacaaccccat caaaaagtggacaaaggatatgaacagacacttttccaaagaagagatttatgcagccaa cagacacatgaaacaatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccac aatgagataccacctcacaccagttag >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_4|127_aa MARPSEVESPRDRHKLVESKWMGARESTAAQLTECVRTHSPSASRRGSDIWWSYTEGNPD RPWRFPSYRLSQAPVSARATYKPPQTRPSRFLPTGAWQRRSLRERAPARGEALQHKRSGS QFPRSCI >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_4|384_bp atggcaagacccagtgaagtagagtctcccagagacagacacaaattggttgaaagtaaa tggatgggggccagggaatccactgccgcccaactcacagagtgtgtccgcacacattca ccatcagcttcaaggaggggttccgatatttggtggtcttacaccgagggcaaccctgat cgtccatggcggtttccctcctacagactctcgcaggcgcctgtttcagccagagccacc tacaagccccctcagacgcgaccaagcaggttcctaccaacaggcgcttggcagagacgg tcccttcgcgaaagagcaccggcaaggggcgaggcgctgcaacacaaacgttccggcagt cagttcccccggtcttgcatctag >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_5|141_aa MALQLSREQGITLRGSAEIVDEFFSFGINSILYQRGIYPSEIFTRVQKYGLTLLVTTDLE LIKYLNNVVEQLKDWLYKCSVQKLVVVISNIESDKDLVVPEKWEESGPQFITNSEEVCLC SFTTTIHKVNSMVAYKIPVND >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_5|426_bp atggcgctgcagctctcccgggagcagggaatcaccctgcgtgggagtgccgaaatcgtg gacgagttcttctcattcggcatcaacagcattttatatcagcgtggcatttatccatct gaaatctttactcgagtgcagaaatacggactcaccttgcttgtaactactgatcttgag ctcataaaatacctaaataatgtggtggaacaactaaaagattggttatacaagtgttca gttcagaaactggttgtagttatctcaaatattgaaagtgacaaagatttggttgtacct gaaaaatgggaagagtcgggaccacagtttattaccaattctgaggaagtctgcctttgt tcatttactactacaatccacaaagtaaatagcatggtggcctacaaaattcctgtcaat gactga >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_6|314_aa MIIVDKVNWIPWLIGLKSMYANNCESCVDLLFVRGAGNCPECGTPLRKSNFRVQLFEDPT VDKEVEIRKKVLKIYNKREEDFPSLREYNDFLEEVEEIVFNLTNNVDLDNTKKKMEIYQK ENKDVIQKNKLKLTREQEELEEALEVERQENEQRRLFIQKEEQLQQILKRKNKQAFLDEL ESSDLPVALLLAQHKDRSTQLEMQLEKPKPVKPVTFSTGIKMSVGITLCPAEEHTFNDER SFNLSSSWSKFNKPPSTLPFKVLDEERLVELKVPNTLITKLVPFAISPCSEAIQDHPAIL LAYKKTPFWRLQTF >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_6|945_bp atgattatcgtagacaaagttaactggatcccgtggctaattggcttgaaatccatgtat gcaaataactgtgaaagttgtgtagatttactgtttgtgagaggagctggaaactgccct gagtgtggtactccactcagaaagagcaacttcagggtacaactctttgaagatcccact gttgacaaggaggttgagatcaggaaaaaagtgctaaagatatacaataaaagggaagaa gattttcctagtctaagagaatacaatgatttcttggaagaagtggaagaaattgttttc aacttgaccaacaatgtggatttggacaacaccaaaaagaaaatggagatataccaaaag gaaaacaaagatgttattcagaaaaataaattaaagctgactcgagaacaggaagaactg gaagaagctttagaagtggaacgacaggaaaatgaacaaagaagattatttatacaaaaa gaagaacaactgcagcagattctaaaaaggaagaataagcaggcttttttagatgagctg gagagttctgatctccctgttgctctgcttttggctcagcataaagatagatctacccaa ttagaaatgcaacttgagaaacccaaacctgtaaaaccagtgacgttttccacaggcatc aaaatgagtgttgggattacactgtgcccggctgaggagcacacttttaatgatgaaaga agtttcaatctcagctcatcttggagcaaatttaataaacctccctctactctgcctttt aaagtattggatgaagagaggttggtggaactgaaagttcccaacacactaattacaaag ttggttccctttgcaataagcccctgttctgaggctatccaggaccacccagccatctta ttagcatacaaaaagacaccattttggagactacaaacgttttag >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_7|179_aa MATKAKTDKWDLTKLKSFCMAKETIIRMNSSFSPAIHLVQSFYPTSRKNEVCGQLEGSQL LTSKGTNWTENEFDELTEVGFTRWIITNSSELKEHDITQRKEAKNLEKRLEELLTRISSL KKNINDLMELKNTAQELREAYKSINNQIDQGEERISEIEDQFNETKCEDEIREKNNEKE >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_7|540_bp atggcaacaaaagccaaaactgacaaatgggatctaaccaagctaaagagcttctgcatg gcaaaagaaactatcatcagaatgaacagctctttcagtcccgccattcacctggtccag agtttttatcccacatctaggaagaatgaggtatgtggacaactggaaggatcacaactc ctcaccagcaagggaacaaactggacagagaatgagtttgatgaactgacagaagtaggc ttcacaaggtggataataacaaactcctctgagctaaaggagcatgatataacccaacgc aaggaagctaagaaccttgaaaaaaggttagaggaattgctaaccagaataagcagttta aagaagaacataaatgacctgatggagctgaaaaacacagcacaagaacttcgtgaggca tacaaaagtatcaataaccaaatcgatcaaggggaagaaaggatatcagagatagaagat caatttaatgaaacaaagtgtgaagacgagattagagaaaaaaataatgaaaaggaatga >gi568815584f:60679978_60880592|GENSCAN_predicted_peptide_8|210_aa MAEQEQLQSASLSETNAEGRGQQTRHTGELRLASGRCPSGTKLPEEGAGSNLCYSAGFTG DTQVNRVWSGPPANCSRPAEEGTDCHQLQRSKVDKSTTMRKNLRKNAENSKNQNASSSPN DYNSSQARAQNWMENEIDELTEVGFRSWEITNSSELKEHVLTQCKKVKNLDKKLQELLTR IPSLERNINDMMELKNTARELREAYTSINS >gi568815584f:60679978_60880592|GENSCAN_predicted_CDS_8|633_bp atggctgaacaggaacagctacagtctgcatctctcagtgagaccaacgcagaaggaaga ggtcaacagacacgtcatacaggagagctcaggctggcatcaggtcggtgcccctctggg acaaagcttccagaggaaggagcaggcagcaatctttgctattctgcgggcttcactggt gatacccaagtgaacagggtctggagtggacctccagcaaactgcagcagacctgccgaa gaagggactgactgtcatcagcttcaaagatcaaaggtagataaatccacgacgatgagg aaaaacctgcgtaaaaatgctgaaaattccaaaaaccagaatgcctcttcttctccaaat gattacaactcctctcaagcaagggcacaaaactggatggagaatgagattgatgaactg acagaagtaggcttcagaagctgggaaataacaaactcctctgagctaaaggagcacgtt ctaactcaatgcaagaaagttaaaaaccttgacaaaaagttacaagaactgctaactaga atacccagtttagagaggaatataaatgacatgatggagctgaaaaacacagcacgagaa cttcgtgaagcatacacaagtatcaatagctga