GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:39:12 Sequence gi568815591f:95385972_95637756 : 251785 bp : 38.45% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 416 579 164 1 2 70 72 100 0.842 5.85 1.02 Term + 898 1276 379 1 1 31 52 223 0.734 6.28 1.03 PlyA + 1424 1429 6 1.05 2.04 PlyA - 2679 2674 6 1.05 2.03 Term - 4238 4135 104 0 2 106 42 56 0.530 0.26 2.02 Intr - 8743 8673 71 0 2 66 76 138 0.649 8.31 2.01 Init - 10379 10306 74 2 2 98 95 161 0.999 16.42 2.00 Prom - 13167 13128 40 -6.35 3.06 PlyA - 13648 13643 6 1.05 3.05 Term - 19517 19359 159 2 0 53 32 85 0.404 -3.64 3.04 Intr - 21097 21016 82 0 1 76 93 65 0.577 4.52 3.03 Intr - 24130 23930 201 0 0 114 16 129 0.682 5.78 3.02 Intr - 25808 25682 127 0 1 97 75 64 0.984 4.82 3.01 Init - 26446 26341 106 1 1 65 107 52 0.879 5.23 3.00 Prom - 33066 33027 40 -5.05 4.03 PlyA - 34595 34590 6 1.05 4.02 Term - 41489 41159 331 1 1 95 33 138 0.841 2.14 4.01 Init - 48980 48907 74 2 2 103 96 192 0.609 20.09 4.00 Prom - 53049 53010 40 -7.25 5.00 Prom + 54254 54293 40 -6.45 5.01 Init + 64794 64867 74 2 2 65 81 31 0.124 0.69 5.02 Intr + 74574 74814 241 0 1 17 75 211 0.312 9.33 5.03 Intr + 74920 75321 402 0 0 55 57 200 0.007 7.30 5.04 Intr + 85707 85785 79 2 1 68 64 66 0.110 0.41 5.05 Intr + 87393 87520 128 1 2 48 45 88 0.741 -0.02 5.06 Term + 87722 87838 117 1 0 64 46 153 0.687 6.26 5.07 PlyA + 88006 88011 6 1.05 6.00 Prom + 92132 92171 40 -6.85 6.01 Init + 100001 100187 187 1 1 82 93 154 0.999 14.57 6.02 Intr + 109787 110086 300 0 0 129 99 112 0.968 12.18 6.03 Intr + 115374 115427 54 1 0 75 91 61 0.873 3.13 6.04 Intr + 118367 118420 54 0 0 75 91 61 0.802 3.13 6.05 Term + 118937 119013 77 0 2 35 47 93 0.381 -3.08 6.06 PlyA + 119590 119595 6 1.05 7.00 Prom + 119607 119646 40 -5.65 7.01 Init + 127163 127202 40 1 1 87 116 64 0.890 9.50 7.02 Intr + 141842 142332 491 0 2 107 100 729 0.124 67.80 7.03 Intr + 150466 150579 114 0 0 97 84 97 0.807 9.92 7.04 Term + 164692 164805 114 0 0 76 33 64 0.045 -2.71 7.05 PlyA + 165028 165033 6 1.05 8.02 PlyA - 165649 165644 6 1.05 8.01 Sngl - 166606 166238 369 1 0 97 48 268 0.719 19.08 8.00 Prom - 172562 172523 40 -8.35 9.00 Prom + 174755 174794 40 -3.35 9.01 Sngl + 181474 181869 396 0 0 69 35 209 0.735 9.70 9.02 PlyA + 181979 181984 6 1.05 10.12 PlyA - 183539 183534 6 1.05 10.11 Term - 199810 199670 141 1 0 113 49 122 0.950 7.75 10.10 Intr - 201152 201039 114 2 0 101 63 109 0.998 9.42 10.09 Intr - 201557 201447 111 2 0 95 91 98 0.999 10.46 10.08 Intr - 201854 201756 99 2 0 77 86 111 0.989 9.19 10.07 Intr - 203301 203212 90 0 0 102 24 58 0.472 0.07 10.06 Intr - 203745 203669 77 1 2 78 86 10 0.975 -1.88 10.05 Intr - 206626 206540 87 2 0 79 95 72 0.983 6.02 10.04 Intr - 206973 206789 185 2 2 15 98 118 0.729 4.01 10.03 Intr - 207799 207728 72 0 0 110 63 75 0.710 4.80 10.02 Intr - 209193 209052 142 1 1 78 119 83 0.992 8.89 10.01 Init - 210322 210193 130 1 1 90 77 266 0.996 24.06 10.00 Prom - 212179 212140 40 -2.25 11.02 PlyA - 212843 212838 6 1.05 11.01 Sngl - 219723 219367 357 0 0 72 39 190 0.053 6.83 11.00 Prom - 234223 234184 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_1|180_aa MDKFLDTYILPRLNQEKVESLNRPITGSEIEAIINSLPTNESPGPDGFTDKFYQRTNDKN HMIISIDAEKAFDKIQQPFMLKTLNKLGIDRTYLKIIRAIYDKSTANIILNRQKLEAFHL KTCTRQGCPLSPLLFNIVLEILARAIGQEKEIKGIQLGKEDGKLSLSADDMIVYLENPII >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_1|543_bp atggataaattcctcgacacatacatcctcccaagattaaaccaggaaaaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcctaccaaccaat gaaagtccaggaccagatggattcacagacaaattctaccagagaaccaatgacaaaaac cacatgattatctcaatagatgcagaaaaggccttcgacaaaattcaacagcccttcatg ctaaaaactctcaataaactaggtattgatagaacgtatctcaaaataataagagctatt tatgacaaatccacagccaatatcatactgaataggcaaaaactggaagcattccatttg aaaacctgcacaagacaaggatgccctctctcaccactcctattcaacatagtattggaa attctggccagggcaattggacaagagaaagaaataaagggtattcagttaggaaaagag gatggcaaattgtctctgtctgcagatgacatgattgtatatttagaaaatcccatcatc tga >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_2|82_aa MGKLVALVLLGVGLSLVGEMFLAFRERVNASREVEPVEPENCHLIEELESGSEDIDILPS GLAFISSVSIFHAALMLTSIAV >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_2|249_bp atggggaagctcgtggcgctggtcctgctgggggtcggcctgtccttagtcggggagatg ttcctggcgtttagagaaagggtgaatgcctctcgagaagtggagccagtagaacctgaa aactgccaccttattgaggaacttgaaagtggctctgaagatattgatatacttcctagt gggctggcttttatctccagtgtgagtattttccatgctgctctcatgctcacatcaata gctgtatag >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_3|224_aa MDLKEEKPRARELRISRGFDLASFNPHGISTFIDNDDTVYLFVVNHPEFKNTVEIFKFEE AENSLLHLKTVKHELLPSVNDITAVGPAHFYATNDHYFSDPFLKYLETYLNLHWANVVYY SPNEVKVVAEGFDSANGINISPDDKYIYVADILAHEIHVLEKHTNMNLTQLKVLRIQNIL SEKPTVTTVYANNGSVLQGSSVASVYDGKLLIGTLYHRALYCEL >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_3|675_bp atggatctaaaagaagaaaaaccaagggcacgggaattaagaatcagtcgtgggtttgat ttggcctcattcaatccacatggcatcagcactttcatagacaacgatgacacagtttat ctctttgttgtaaaccacccagaattcaagaatacagtggaaatttttaaatttgaagaa gcagaaaattctctgttgcatctgaaaacagtcaaacatgagcttcttccaagtgtgaat gacatcacagctgttggaccggcacatttctatgccacaaatgaccactacttctctgat cctttcttaaagtatttagaaacatacttgaacttacactgggcaaatgttgtttactac agtccaaatgaagttaaagtggtagcagaaggatttgattcagcaaatgggatcaatatt tcacctgatgataagtatatctatgttgctgacatattggctcatgaaattcatgttttg gaaaaacacactaatatgaatttaactcagttgaaggttctccgcatccagaacattcta tctgagaagcctacagtgactacagtttatgccaacaatgggtctgttctccaaggaagt tctgtagcctcagtgtatgatgggaagctgctcataggcactttataccacagagccttg tattgtgaactctaa >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_4|134_aa MGRLVAVGLLGIALALLGERLLALRTQCLSPKPGSYDYTSIQRSLGKQIFALPTTTIETT REKERGLGMALILSAARVCHPENRSPLTSRELIVLPGKKSGSFLLKKFGSLDGPGRIKAK PSFSFIKKESQEKN >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_4|405_bp atggggcggctggtggctgtgggcttgctggggatcgcgctggcgctcctgggcgagagg cttctggcactcagaacacagtgtctttccccaaaacctgggtcatatgactacacctca atacagcggagtctgggaaagcaaatatttgctttgcccaccactacaatagaaacgaca agggagaaagaaagggggttgggcatggcgttaattttgtcagccgctagggtctgccac ccagaaaacagaagtccactaacaagcagggaactgattgttctgccagggaaaaagtct ggcagtttcctactgaagaaatttggaagtcttgatggtccaggaaggataaaggcaaaa ccttcattttcttttatcaaaaaggaaagccaagaaaaaaattag >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_5|346_aa MVLQSESSGEPVHLTSASQFFPQLRDLWSFEIDRDDLGYLVEEISKWQSIQEEADHKSLE NLQLYNTVEKKNPFSGEKFKPAAESCTSNEEPNVNHQDNGENVSRPRDMVPCIPAAEAPA MANRGQCAAQAIASEGASPKPWQLTCGGCPGRGVLQGQSPHGEHLLGQCKREMWGVETPH RVPTGALPSGSVRRRPLSSRLHNDRSADSLHCVPGKATDTQCQPMKTARMEVVPCKATEE CPYMNIRTLEDSSGGNSGVDCSWGKDVKDPRHREGKQLSQESSCLAVRATGLSRTITSSD EFPAEPGWGIQERIEEAVPVFEAFAIIAGETEPTDGEQWSGTPGLI >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_5|1041_bp atggttttacagtctgagtcttctggtgagcctgtgcacttaaccagcgcttctcagttt ttcccccaactaagagatctgtggagctttgaaattgacagagatgatttagggtatctg gtggaagaaatttctaagtggcaaagcattcaagaggaagcagatcataaaagtttggaa aatttgcagctttacaatacagtagaaaagaaaaacccattttctggggagaaattcaag ccagctgcagaaagttgcacaagtaatgaggagccaaatgttaatcaccaagacaatggg gaaaatgtctccaggcctagagacatggtgccctgcatcccagctgctgaagctccagcc atggctaataggggccaatgtgcagctcaggccattgcttcagagggtgcaagccccaag ccttggcagcttacgtgtggaggatgtccaggcagaggtgtgctgcaggggcaaagtcct catggagaacatctgctagggcagtgcaaaagggaaatgtggggggtggagacccctcac agagtccccactggggcactgcctagtggatctgtgagaagaaggccactatcctccaga ctccacaatgatagatctgctgacagcttgcactgtgtgcctggaaaagccacagacact caatgccagcccatgaaaacagccaggatggaagttgtaccctgcaaagccacagaggaa tgcccttacatgaacattaggactttagaggacagcagtggtggcaacagtggagttgac tgcagctggggcaaagacgtgaaggacccaaggcacagagagggtaagcagttaagtcag gagagcagctgcctggctgtgagagccacaggtttatccaggacaataacaagcagtgat gagttcccagcggagcctggctggggaattcaggagaggattgaagaagcggttcctgtt tttgaggcatttgcaatcattgctggagagacagaaccaacagatggggaacaatggtca gggactccaggcctgatttaa >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_6|223_aa MDGTTAPVTKSGAAKLVKRNFLEALKSNDFGKLKAILIQRQIDVDTVFEVEDENMVLASY KQGYWLPSYKLKSSWATGLHLSVLFGHVECLLVLLDHNATINCRPNGKTPLHVACEMANV DCVKILCDRGAKLNCYSLSGHTALHFCTTPSSILCAKQLVWRVTQVNHMLGNSLVNEVEH VTQVNHMLGNSLVNEVEHVSQQLLQSRKACSTKNHTTLQETQP >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_6|672_bp atggacggcaccactgcccctgtcactaaatctggagctgccaagttagttaagagaaat ttccttgaggcgctaaagtccaatgacttcggaaaattgaaggctattttgatccaaagg caaatagatgtggacactgtttttgaagtcgaagatgagaatatggttttggcatcttat aaacaaggttactggttgcctagctataaattgaagtcttcctgggccacaggcctccat ctctctgtcttgtttggccatgtggaatgtcttctggtgctactggaccacaatgctaca atcaactgtagacccaatgggaaaacccctcttcacgtggcttgtgaaatggccaatgtg gattgtgttaagatcctctgtgatcgtggggcaaagctcaattgctactccttaagtgga cacacagctttgcacttttgtacaactccaagttccattctctgtgccaagcaattggtt tggagagtgacacaagtcaaccacatgttaggaaattccctggtcaatgaagtggaacat gtgacacaagtcaaccacatgttaggaaattccctggtcaatgaagtggaacatgtcagt cagcagttactccagtcccgtaaagcatgttccacgaaaaatcataccacacttcaggaa acacaaccttaa >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_7|252_aa MVTKCGKPLEAGKGANVNMKTNNQDEETPLHTAAHFGLSELVAFYVEHGAIVDSVNAHME TPLAIAAYWALRFKEQEYSTEHHLVCRMLLDYKAEVNARDDDFKSPLHKAAWNCDHVLMH MMLEAGAEANLMDINGCAAIQYVLKVTSVRPAAQPEICYQLLLNHGAARIYPPQFHKVIQ ACHSCPKAIEVVVNAYEHIRWNTKWRRAIPDDDLERLRPWIFEVEPKENSQQLCPFAQEE SELLLFLISPLL >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_7|759_bp atggtcacgaaatgtgggaagcctctagaagctggaaaaggggcgaatgtgaacatgaag accaacaaccaagatgaggagacgcccttgcacacggctgcccacttcggcctttcggag ctggtggccttctacgtggaacacggggccatagtggacagcgtgaatgcccacatggag acccccctggccatcgccgcctactgggccctccgctttaaggagcaggagtacagcacg gagcaccacctggtctgccgcatgctgcttgactacaaagccgaagtcaatgcccgagat gacgactttaaatctcccctccacaaggcagcctggaactgtgaccacgtgctcatgcac atgatgctggaagctggcgccgaagccaatctcatggatatcaacggctgtgctgccatc cagtacgtgctgaaggtcacctccgtgcgccctgctgcccagcctgagatctgctaccag ctcctgttgaaccatggggctgcccgaatataccctccacagttccataaggtgatacag gcctgccattcttgtcctaaagcaattgaagttgtagtcaatgcctatgaacacatcaga tggaacacaaagtggagaagagctatccccgatgatgacttggagagactcaggccgtgg atctttgaggtagaacccaaggaaaatagtcaacaactttgtccatttgcccaggaagag tctgagttgttgctgttcctcattagtcctctgctttag >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_8|122_aa MPEPLPSMGSCVAGASPTSAVPCSTAPSPIDHPRAEESGHTMQDWQAAPPAAPVWDPLGE ASWVPDSGGDLQNLCVDTVYLANLVGTWRTVVSSSGIVNAPISALSKQTTWLYQSAGWGG AR >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_8|369_bp atgcctgagcctctcccttccatgggctcctgtgtggcaggagcctccccgacaagcgcc gtcccctgctccaccgcgcccagtcccatcgaccacccaagggctgaggagagcgggcac acgatgcaggactggcaggcagctccacctgcagccccggtgtgggatccactgggtgaa gccagctgggtacctgactctggtggggacttgcagaacctttgtgtggacactgtttat ctagctaatctagtggggacgtggagaaccgttgtgtctagctcagggattgtaaacgca ccaatcagcgccctgtcaaaacagaccacttggctctaccaatcagcaggatggggtggg gccagataa >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_9|131_aa MLFCKTNKEKKREESNRCNKNDKRDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLD TYTLPRLNQEELESLNRPITGSEIVAIINSLPTKRVQDQMDSQPNSTRDTRRNWYHSSET IPINRKRGNPP >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_9|396_bp atgctgttttgcaagactaataaagaaaaaaagagagaagaatcaaatagatgcaataaa aatgataaaagggatatcaccactgatcccacagaaatacaaactaccatcagagaatac tataaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctcgac acatacaccctcccaagactaaaccaggaagaacttgaatctctgaatagaccaataaca ggttctgaaattgtggcaataatcaatagcttaccaacaaaaagagtccaggaccagatg gattcacagccgaattctaccagagatacaaggaggaactggtaccattcctctgaaact attccaatcaatagaaaaagagggaatcctccctaa >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_10|415_aa MKAARFVLRSAGSLNGAGLVPREVEHFSRYSPSPLSMKQLLDFGSENACERTSFAFLRQE LPVRLANILKEIDILPTQLVNTSSVQLVKSWYIQSLMDLVEFHEKSPDDQKALSDFVDTL IKVRNRHHNVVPTMAQGIIEYKDACTVDPVTNQNLQYFLDRFYMNRISTRMLMNQHILIF SDSQTGNPSHIGSIDPNCDVVAVVQGKFPDQPIHIVYVPSHLHHMLFELFKAGPTPNSAV QVQNSCRWYGSIQSPECAKSTNAMRATVEHQENQPSLTPIEVIVVLGKEDLTIKISDRGG GVPLRIIDRLFSYTYSTAPTPVMDNSRNAPLAGFGYGLPISRLYAKYFQGDLNLYSLSGY GTDAIIYLKALSSESIEKLPVFNKSAFKHYQMSSEADDWCIPSREPKNLAKEVAM >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_10|1248_bp atgaaggcggcccgcttcgtgctgcgcagcgctggctcgctcaacggcgccggcctggtg ccccgagaggtggagcatttctcgcgctacagcccgtccccgctgtccatgaagcagcta ctggactttggttcagaaaatgcatgtgaaagaacttcttttgcatttttgcgacaagaa ttgcctgtgagactcgccaacattctgaaggaaattgatatcctcccgacccaattagta aatacctcttcagtgcaattggttaaaagctggtatatacagagcctgatggatttggtg gaattccatgagaaaagcccagatgaccagaaagcattatcagactttgtagatacactc atcaaagttcgaaatagacaccataatgtagtccctacaatggcacaaggaatcatagag tataaagatgcctgtacagttgacccagtcaccaatcaaaatcttcaatatttcttggat cgattttacatgaaccgtatttctactcggatgctgatgaaccagcacattcttatattt agtgactcacagacaggaaacccaagccacattggaagcattgatcctaactgtgatgtg gtagcagtggtccaaggaaaatttccagaccaaccaattcacatcgtgtatgttccttct cacctccatcatatgctctttgaactatttaaggctggtcctactcctaactcagcagtt caggtccagaattcctgtaggtggtatggctccattcagagtccagaatgtgctaagagc acaaatgcaatgcgggcaacagttgaacaccaggaaaatcagccttcccttacaccaata gaggttattgttgtcttgggaaaagaagaccttaccattaagatttcagacagaggaggt ggtgttcccctgagaattattgaccgcctctttagttatacatactccactgcaccaacg cctgtgatggataattcccggaatgctcctttggctggttttggttacggcttgccaatt tctcgtctgtatgcaaagtactttcaaggagatctgaatctctactctttatcaggatat ggaacagatgctatcatctacttaaaggctttgtcttctgagtctatagaaaaacttcca gtttttaacaagtcagccttcaaacattatcagatgagctctgaggctgatgactggtgt atcccaagcagggaaccaaagaacctggcaaaagaagtggccatgtga >gi568815591f:95385972_95637756|GENSCAN_predicted_peptide_11|118_aa MGAAVRAACQSHAMCPHSSVLGRSMGPGAVEQGVALVRGPSAPSAAAGLGAKPLTARGWQ HEPAAPSAGPTVPTSTQNSCWPMRVACSPGSRLCLSLHTSLQAEGAGSDLGQPREGLP >gi568815591f:95385972_95637756|GENSCAN_predicted_CDS_11|357_bp atgggggctgcagtcagagctgcctgccagtcccacgccatgtgcccacactcctcagtc cttgggcggtcgatgggaccgggtgctgtggagcagggggtggcgcttgtcaggggaccc agcgcaccttctgcagctgctggcctaggtgctaagcccctcactgcccggggctggcag cacgagccggctgctccgagtgcggggcccaccgtgcccacgtccactcagaactcatgc tggcccatgagagttgcgtgcagccccggttcccgcctgtgcctctccctccacacctcc ctgcaagcagagggagctggctccgacctcggccagcccagagaggggctcccatag