GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:31:50 Sequence gi568815586f:10113130_10321849 : 208720 bp : 39.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 2585 2580 6 1.05 1.06 Term - 2900 2751 150 2 0 72 48 81 0.036 -0.57 1.05 Intr - 12319 12168 152 2 2 83 111 5 0.414 1.26 1.04 Intr - 14716 14618 99 2 0 93 92 67 0.539 6.76 1.03 Intr - 15370 15194 177 2 0 8 28 157 0.470 0.67 1.02 Intr - 17022 16851 172 0 1 125 71 122 0.998 12.79 1.01 Init - 25850 25773 78 2 0 24 91 75 0.351 2.51 1.00 Prom - 28550 28511 40 -6.45 2.00 Prom + 34244 34283 40 -2.55 2.01 Sngl + 40273 40779 507 0 0 53 48 265 0.883 14.79 2.02 PlyA + 41444 41449 6 1.05 3.07 PlyA - 43126 43121 6 1.05 3.06 Term - 46892 46751 142 1 1 84 38 117 0.833 2.72 3.05 Intr - 47333 47218 116 2 2 64 89 17 0.456 -2.27 3.04 Intr - 47796 47657 140 1 2 86 105 34 0.791 4.16 3.03 Intr - 53828 53565 264 0 0 -31 82 277 0.500 11.76 3.02 Intr - 56046 55945 102 1 0 110 101 -21 0.587 0.63 3.01 Init - 58948 58873 76 1 1 54 74 106 0.881 7.10 3.00 Prom - 61804 61765 40 -7.45 4.00 Prom + 65796 65835 40 -5.45 4.01 Init + 66446 66499 54 1 0 51 105 146 0.994 11.95 4.02 Intr + 69421 69464 44 0 2 74 98 28 0.972 -1.48 4.03 Intr + 73291 73460 170 1 2 83 84 171 0.999 14.87 4.04 Term + 76767 77011 245 1 2 112 45 255 0.983 18.68 4.05 PlyA + 78640 78645 6 1.05 5.00 Prom + 81994 82033 40 -4.55 5.01 Init + 100001 100090 90 1 0 81 86 197 0.958 19.34 5.02 Intr + 104934 105012 79 2 1 120 55 82 0.984 6.31 5.03 Intr + 107311 107570 260 2 2 66 56 187 0.718 9.56 5.04 Term + 108658 108723 66 0 0 121 48 81 0.734 4.36 5.05 PlyA + 109974 109979 6 1.05 6.00 Prom + 111329 111368 40 -8.35 6.01 Init + 111430 111532 103 0 1 60 68 109 0.421 4.57 6.02 Term + 112399 113102 704 2 2 47 49 302 0.396 14.90 6.03 PlyA + 114881 114886 6 1.05 7.00 Prom + 117549 117588 40 -2.35 7.01 Init + 121099 121280 182 0 2 75 -16 193 0.144 4.38 7.02 Intr + 124241 124421 181 2 1 92 56 103 0.167 6.45 7.03 Intr + 138813 139031 219 2 0 100 80 98 0.772 7.78 7.04 Intr + 164011 164034 24 0 0 117 90 20 0.013 2.40 7.05 Intr + 167523 167651 129 2 0 65 78 69 0.014 3.57 7.06 Intr + 188192 188240 49 1 1 62 97 38 0.001 -0.57 7.07 Intr + 194865 194955 91 1 1 90 91 34 0.021 2.03 7.08 Intr + 201544 201644 101 1 2 97 58 103 0.116 7.03 7.09 Term + 205855 205964 110 2 2 78 36 77 0.054 -0.81 7.10 PlyA + 206352 206357 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_1|275_aa MSNKVTKVNAMEHRAAMGKIKRDDPQTVISGAERKELPNAISIQGLSRTMEYHPDLENLD EDGYTQLHFDSQSNTRIAVVSEKAISSKNQWDFRDPTNVKTCRCQCGTEIRSIQLSSQCA EITIPAIEEEGILTGGEAQQILGSCAASPPWRLIAVILGILCLVILVIAVVLGTMGVLSS PCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELLIWKYTLINRKHPP FSKRQKTSTLFQETENIHTFPRDRKGEIAASRTSV >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_1|828_bp atgtccaataaagtaacaaaagtgaatgcaatggaacacagagccgctatgggtaaaata aagagggatgacccacagacagtcatctcaggagcagaaagaaaagagctcccaaatgct atatctattcaggggctctcaagaacaatggaatatcatcctgatttagaaaatttggat gaagatggatatactcaattacacttcgactctcaaagcaataccaggatagctgttgtt tcagagaaagctatttcctctaagaatcagtgggatttcagggatcctactaatgtcaag acctgtagatgtcaatgtggaactgaaattaggagcatacagttatcaagccaatgtgcc gaaattacaattccagcaattgaagaagaaggtatccttactggtggagaagcacagcaa atcctaggatcgtgtgctgcatctcctccttggcgcctcattgctgtaattttgggaatc ctatgcttggtaatactggtgatagctgtggtcctgggtaccatgggggttctttccagc ccttgtcctcctaattggattatatatgagaagagctgttatctattcagcatgtcacta aattcctgggatggaagtaaaagacaatgctggcaactgggctctaatctcctaaagata gacagctcaaatgaattgcttatatggaaatatacgttaataaacagaaaacatccaccc ttttccaagagacagaaaacatccacacttttccaagagacagaaaacatccacactttt ccaagagacagaaaaggtgaaatagctgctagccgaacctcagtctga >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_2|168_aa MAHKSNGQRCGFLRFSELATTSCGVAQRGELTPPAPLSSLRRVLSSHYRHSRISFARLSG LGSSLRRTPMAGRPSAPAPNLGISLPRFSETRDSLPAGAQAMNLGPALRRCTPQPRALAP ASTSAILHPRFRHGLRRGLGTAPKRLGKNSVRHVGLQSTQDGQWSLHR >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_2|507_bp atggcgcataagagtaacggccagcgctgcgggtttttacgtttcagcgagctcgcaaca acgtcctgtggtgtggcccagcgcggggagctgaccccaccagctccgctctcaagcctt cggagagtcctctccagtcactatcgtcattcccgcatttcctttgctaggctgtccggg ctgggcagctccctcaggcggacacccatggctggcagaccttccgcgcctgcccctaat ctgggcatatcactccctcggttttctgagactcgggactccttgcccgctggagcgcag gccatgaatctgggtcccgcactgcggcgctgcacgccgcagcccagggctttggcccca gccagcacatccgccatcctgcaccccaggttccggcacgggctgcgacggggcctcgga actgctcccaagcggctgggaaagaactcagtcaggcatgtcggcctgcaaagcacccag gatgggcagtggagtctgcaccgttga >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_3|279_aa MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ MELHHQNLNLQETLKRVANCSGIGRRVAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLS LDAKLLKINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAV SQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_3|840_bp atgacttttgatgacctaaagatccagactgtgaaggaccagcctgatgagaagtcaaat ggaaaaaaagctaaaggtcttcagtttctttactctccatggtggtgcctggctgctgcg actctaggggtcctttgcctgggattagtagtgaccattatggtgctgggcatgcaatta tcccaggtgtctgacctcctaacacaagagcaagcaaacctaactcaccagaaaaagaaa ctggagggacagatctcagcccggcaacaagcagaagaagcttcacaggagtcagaaaac gaactcaaggaaatgatagaaacccttgctcggaagctgaatgagaaatccaaagagcaa atggaacttcaccaccagaatctgaatctccaagaaacactgaagagagtagcaaattgt tcaggtattgggagaagggtggctccttgtccgcaagactggatctggcatggagaaaac tgttacctattttcctcgggctcatttaactgggaaaagagccaagagaagtgcttgtct ttggatgccaagttgctgaaaattaatagcacagctgatctggacttcatccagcaagca atttcctattccagttttccattctggatggggctgtctcggaggaaccccagctaccca tggctctgggaggacggttctcctttgatgccccacttatttagagtccgaggcgctgtc tcccagacatacccttcaggtacctgtgcatatatacaacgaggagctgtttatgcggaa aactgcattttagctgccttcagtatatgtcagaagaaggcaaacctaagagcacagtga >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_4|170_aa MGVRVHVVAASALLYFILLSGTRCEENCGNPEQLLVVIGALLLLCGLTSLCFRCCCLSRQ QNGEDGGPPPCEVTVIAFDHDSTLQSTITSLQSVFGPAARRILAVAHSHSSLGQLPSSLD TLPGYEEALHMSRFTVAMCGQKAPDLPPVPEEKQLPPTEKESTRIVDSWN >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_4|513_bp atgggagtccgagttcatgtcgtggcggcctcagccctgctgtatttcatcctgctttct gggacgagatgtgaggaaaactgtggtaatcctgaacagttgctagtggtaattggcgcg ctgcttctcctgtgtggcctgacgtccctgtgcttccgctgctgctgtctgagccgccag caaaatggggaagatgggggcccaccaccctgtgaagtgaccgtcattgctttcgatcac gacagcactctccagagcactatcacatctctgcagtcggtgtttggccctgcagctcgg aggatcctggctgtggctcactcccacagctccctgggccagctgccctcctctttggac accctcccagggtatgaagaagctcttcacatgagtcgcttcacagtagccatgtgcggg cagaaagcacctgatctacccccagtacctgaagaaaagcagctgcctccaacagagaag gagtcgactcgaatagttgactcttggaactga >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_5|164_aa MKFQYKEDHPFEYRKKEGEKIRKKYPDRVPVIVEKAPKARVPDLDKRKYLVPSDLTVGQF YFLIRKRIHLRPEDALFFFVNNTIPPTSATMGQLYEVMVLVAQYWMPSSAVWHPLALVLD ALITHLRSGAEGVIYPDPLTYGSDNHEEDYFLYVAYSDESVYGK >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_5|495_bp atgaagttccagtacaaggaggaccatccctttgagtatcggaaaaaggaaggagaaaag atccggaagaaatatccggacagggtccccgtgattgtagagaaggctccaaaagccagg gtgcctgatctggacaagaggaagtacctagtgccctctgaccttactgttggccagttc tacttcttaatccggaagagaatccacctgagacctgaggacgccttattcttctttgtc aacaacaccatccctcccaccagtgctaccatgggccaactgtatgaggtaatggttctg gttgcacaatactggatgccgtccagtgcagtctggcatcctctagcccttgttctagat gcgttgataacacatctgagaagtggggcagaaggtgttatttatccggatcctcttaca tatggcagtgacaatcatgaggaagactattttctgtatgtggcctacagtgatgagagt gtctatgggaaatga >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_6|268_aa MKPRTLAVSVTALKVACLESVPSDVQMCSEFLPSDSGAQLASPSGSHTGAAGGAACQSRA VRSHSSALGLFVPSRSMGLGAVEQGVVLVGEARAAQVPMEWVGGSGMAGCRSRALPRGKA AKARREIERSAGGPALLGDPVHPPQPLARVLSPPLPRASRAGWLAAPSAGPAKSTPTRNS SWRASAPRSPGSARASPSTPPSKLREWAPALASPERGSHSAVGGLKGSSNATKVGAQAGE VPRASEGSEDCQHAVTSQQAGLDLALLY >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_6|807_bp atgaagccgcggaccctcgcggtgagtgttacagctcttaaggtggcgtgtctggagtct gtcccttctgatgttcagatgtgttcggagtttcttccttctgactcgggagcccagctg gcttcacccagtggatcccacaccggggctgcaggtggagctgcctgccagtcccgcgcc gtgcgctcgcattcctcagcccttgggttgtttgttccttcccggtcgatgggactgggc gccgtggagcagggggtggtgctcgtcggggaggctcgggccgcacaggtgcccatggag tgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgcgggaaggca gctaaggcccggcgagaaatcgagcgcagcgccggtgggccagcactgctgggggaccca gtacaccctccgcagccactggcccgggtgctaagtcccccattgccccgggccagcagg gctggctggctggctgctccgagtgcggggcccgccaagtccacgcccacccggaactcc agctggcgcgcaagcgccccacgcagccccggttccgctcgtgcctctccctccacacct ccctccaagctgagggagtgggctccagccttggccagcccagaaaggggctcccacagt gcagtgggggggctgaagggctcctcaaatgccaccaaagtgggagcccaggcaggggag gtgccgagagcaagcgagggctctgaggactgccagcatgctgtcacttctcagcaggct ggattggatttggcacttctttactga >gi568815586f:10113130_10321849|GENSCAN_predicted_peptide_7|361_aa MALGISAPVALQGTAPLLAVLSGCSFPKHMLQTVNGSPFWGLENGGPLLRARLGSAPVET LELFSSLNKILHSYHSSVVKCDLILLGRWTKAWDPLSAGGGCHTGPLPLQVEGNHPTGSY RVPNRPQYRSVAWGLGTSGLVNYTFLLNSGETTYQFLRGNKDFLKNHIKLNYCFLLIEVD NLTLVFVIEKTLGQIFDIPKVELLFSYQCFPMVENRQKPEGEEDCVIQLSELSCTECSKK AWRMEVLHTNKTTNATQCGGPAQLQQFNAVLSEKVHIVPSLLRSWNIISHGRFPSFETFN TKNCIAYNPNGNALDESCEDKNRYIWVEKLFSKSSIAYIFNFVDQSLLKLLNLAMVVLKK T >gi568815586f:10113130_10321849|GENSCAN_predicted_CDS_7|1086_bp atggccttgggaatctctgcccctgtggctttgcagggtacagccccactcctggctgtg cttagtggctgcagctttcccaagcacatgttgcaaactgtcaatggatcaccattctgg ggtttggagaatggtggccctcttctcagagctcgactaggcagtgctccagtggagact ctagaactgttttcatcgctcaataaaattctccactcttaccattcttcagttgtcaag tgtgacctcattcttcttggacgctggacaaaagcttgggacccactaagtgcaggtgga ggctgtcacactggccctttgcccttgcaagtggagggcaaccaccccactggcagctac agggttcctaacaggccacagtaccggtctgtggcctggggcttggggacctctggtctg gtcaattacacatttctcctgaattctggagagacaacataccagttcctcagaggaaac aaagattttcttaaaaatcacatcaaattaaattactgctttttgcttattgaagtggat aatcttactcttgtttttgtcattgaaaagacactaggccagatatttgatattccaaag gtagagcttctcttctcctaccaatgctttccaatggttgaaaacagacagaagccagag ggtgaggaagactgtgtgatacagttgtcagagctcagctgcacagaatgcagcaaaaaa gcatggagaatggaggttctgcataccaacaaaaccaccaatgccacccagtgtggaggg cccgctcagcttcaacaattcaacgctgttctttctgaaaaagtacacatcgtgccttct ctacttcgctcttggaacataatttctcatggcagatttccatcatttgaaacttttaat acaaagaactgcatagcgtataatccaaatggaaatgctttagatgaatcctgtgaagat aaaaatcgttatatctgggttgaaaaacttttctcaaaaagctctatagcatatattttc aactttgtggaccagtctctgttgaaacttctcaaccttgccatggtggtgctaaagaag acataa