GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:43:55 Sequence gi568815586r:10059883_10272077 : 212195 bp : 39.76% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4917 4971 55 2 1 75 95 77 0.461 6.86 1.02 Term + 8227 8447 221 2 2 115 37 71 0.340 0.92 1.03 PlyA + 8653 8658 6 1.05 2.07 PlyA - 9652 9647 6 1.05 2.06 Term - 11631 11451 181 2 1 85 37 140 0.348 4.70 2.05 Intr - 13477 13411 67 2 1 86 67 53 0.717 0.04 2.04 Intr - 21531 21355 177 1 0 2 107 136 0.193 5.87 2.03 Intr - 29340 29242 99 1 0 117 75 72 0.466 7.96 2.02 Intr - 39106 38926 181 1 1 41 80 213 0.509 14.32 2.01 Init - 39459 39391 69 0 0 44 87 26 0.424 -0.80 2.00 Prom - 40054 40015 40 -9.75 3.00 Prom + 45431 45470 40 -6.15 3.01 Sngl + 46882 47292 411 0 0 32 42 226 0.161 8.44 3.02 PlyA + 47393 47398 6 1.05 4.00 Prom + 48639 48678 40 -3.65 4.01 Init + 48816 48895 80 2 2 83 47 53 0.790 1.28 4.02 Term + 51809 52424 616 2 1 70 41 822 0.758 68.65 4.03 PlyA + 53097 53102 6 1.05 5.07 PlyA - 55832 55827 6 1.05 5.06 Term - 56147 55998 150 2 0 72 48 81 0.055 -0.57 5.05 Intr - 65566 65415 152 2 2 83 111 5 0.479 1.26 5.04 Intr - 67963 67865 99 2 0 93 92 67 0.538 6.76 5.03 Intr - 68617 68441 177 2 0 8 28 157 0.469 0.67 5.02 Intr - 70269 70098 172 0 1 125 71 122 0.998 12.79 5.01 Init - 79097 79020 78 2 0 24 91 75 0.351 2.51 5.00 Prom - 81797 81758 40 -6.45 6.00 Prom + 87491 87530 40 -2.55 6.01 Sngl + 93520 94026 507 0 0 53 48 265 0.883 14.79 6.02 PlyA + 94691 94696 6 1.05 7.07 PlyA - 96373 96368 6 1.05 7.06 Term - 100139 99998 142 1 1 84 38 117 0.833 2.72 7.05 Intr - 100580 100465 116 2 2 64 89 17 0.456 -2.27 7.04 Intr - 101043 100904 140 1 2 86 105 34 0.791 4.16 7.03 Intr - 107075 106812 264 0 0 -31 82 277 0.500 11.76 7.02 Intr - 109293 109192 102 1 0 110 101 -21 0.587 0.63 7.01 Init - 112195 112120 76 1 1 54 74 106 0.881 7.10 7.00 Prom - 115051 115012 40 -7.45 8.00 Prom + 119043 119082 40 -5.45 8.01 Init + 119693 119746 54 1 0 51 105 146 0.994 11.95 8.02 Intr + 122668 122711 44 0 2 74 98 28 0.972 -1.48 8.03 Intr + 126538 126707 170 1 2 83 84 171 0.999 14.87 8.04 Term + 130014 130258 245 1 2 112 45 255 0.983 18.68 8.05 PlyA + 131887 131892 6 1.05 9.00 Prom + 135241 135280 40 -4.55 9.01 Init + 153248 153337 90 1 0 81 86 197 0.958 19.34 9.02 Intr + 158181 158259 79 2 1 120 55 82 0.984 6.31 9.03 Intr + 160558 160817 260 2 2 66 56 187 0.718 9.56 9.04 Term + 161905 161970 66 0 0 121 48 81 0.734 4.36 9.05 PlyA + 163221 163226 6 1.05 10.00 Prom + 164576 164615 40 -8.35 10.01 Init + 164677 164779 103 0 1 60 68 109 0.421 4.57 10.02 Term + 165646 166349 704 2 2 47 49 302 0.396 14.90 10.03 PlyA + 168128 168133 6 1.05 11.00 Prom + 170796 170835 40 -2.35 11.01 Init + 174346 174527 182 0 2 75 -16 193 0.147 4.38 11.02 Intr + 177488 177668 181 2 1 92 56 103 0.172 6.45 11.03 Intr + 192060 192278 219 2 0 100 80 98 0.803 7.78 11.04 Term + 199001 199105 105 1 0 51 39 104 0.674 -0.77 11.05 PlyA + 200152 200157 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_1|91_aa MDTADAGFGKMAPLLLLAWLMTCFLLINPESWFNILRKRKSLLCTGDLSSTRKGWAPRLT VPAISSELHKQEPRKKKVSSSRKTSSYSAPL >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_1|276_bp atggacacagcggacgctggctttggcaagatggctcctctccttctcctggcctggctc atgacctgtttcctgttgataaatcctgaaagctggttcaatatcctcaggaaaagaaag agcttattgtgtaccggagatctcagctctaccaggaaaggctgggctcctcgtttgact gttcctgcaatttcttcagaattacataaacaagaacctagaaagaaaaaagtctcctca tccagaaaaaccagttcatacagtgcccctctctga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_2|257_aa MCVLPSYYYSADSEGTTVLHRAILTVARRPRAIRPHFTLTAVGIQMQAKYSSTRDMLDDD GDTTMSLHSQGSATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLLIGLAALGLLFFQY YQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRELYNKAGGLLRP DSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKDCKELKRCVCERRAGM VKPESLHVPPETLGEGD >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_2|774_bp atgtgtgtccttccaagttattattattctgcagactctgaaggaaccacagtgctacac cgtgctattctcacagtagcccggcggcccagggcaatccgaccacatttcactctcacc gctgtaggaatccagatgcaggccaagtacagcagcacgagggacatgctggatgatgat ggggacaccaccatgagcctgcattctcaaggctctgccacaactcggcatccagagccc cggcgcacagagcacagggctccctcttcaacgtggcgaccagtggccctgaccctgctg actttgtgcttggtgctgctgatagggctggcagccctggggcttttgttttttcagtac taccagctctccaatactggtcaagacaccatttctcaaatggaagaaagattaggaaat acgtcccaagagttgcaatctcttcaagtccagaatataaagcttgcaggaagtctgcag catgtggctgaaaaactctgtcgtgagctgtataacaaagctggagggcttttgcgccct gacagtggcaaggcctggctgtggatggatggaacccctttcacttctgaactgttccat attataatagatgtcaccagcccaagaagcagagactgtgtggccatccttaatgggatg atcttctcaaaggactgcaaagaattgaagcgttgtgtctgtgagagaagggcaggaatg gtgaagccagagagcctccatgtcccccctgaaacattaggcgaaggtgactga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_3|136_aa MMISIDAEKAFNKIQQPFMLKTLNKLDIDGTYLKIIRAFYDKPTANIILNGQQQEAFPLK TGTRQGCLLSPLLFNIVLEVLARTIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AENLLKLISNFRKVSG >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_3|411_bp atgatgatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcatgcta aaaactctcaataaactagatattgatgggacgtatctcaaaataataagagctttttat gacaaacccacagccaacatcatactgaatgggcaacaacaggaagcattccctttgaaa actggcacaagacaaggatgccttctctcaccactcctattcaacatagtgttggaagtt ctggccaggacaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctctttgcagatgacatgattgtatatttagaaaaccccatcgtctca gccgaaaatctcctcaagctgataagcaacttcagaaaagtctcaggataa >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_4|231_aa MDEAGNHHSEQTITRTENQTPHVLTRRRLAPTRTQAAPRASRRSDRSALERESKSKIAED EQINASKNEEDAAKMLVGGLSWDTSKKDLKDYFPKFGEVIDCTIKMDPNIGRARGFGFIL FKDATNVEKVLDQKEYRLDVRVIDPKKAMAMKKDPVKKIFAEGLNPEATEEKIREYFGEF GEIEAIEIPVDPKLNKRQGFVFITFKKEPVKKVLEKKFHTISGSKREITVA >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_4|696_bp atggatgaagctggaaaccatcattctgagcaaactatcacaaggacagaaaaccaaaca ccacatgttctcactcgtagacgcctggcgccaacgagaacccaggccgccccgagggcg agtcgtcggagcgaccgcagcgccctcgagcgggaatcaaaatcaaaaatcgccgaggac gaacagatcaacgccagcaagaacgaggaggacgcagcaaaaatgctcgttggtggcctg agctgggataccagcaaaaaagatttaaaagactatttccctaaatttggagaggtcatt gactgtacaataaaaatggatcccaacattggacgggcaagagggtttgggtttatcctg ttcaaagatgcaaccaatgtggagaaggtcctagaccagaaggagtacaggctggatgtc cgtgtcattgaccctaaaaaggccatggctatgaagaaggacccggtgaagaaaatcttc gccgagggtctgaatcctgaagccactgaggaaaagatcagggagtactttggcgagttt ggggagatcgaggccattgaaattccagtggatccaaagttgaacaaaagacaaggtttt gtgtttatcacctttaaaaaagaacctgtgaagaaagttctggagaaaaagttccatact atcagtggaagtaagcgtgagatcacggtggcctag >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_5|275_aa MSNKVTKVNAMEHRAAMGKIKRDDPQTVISGAERKELPNAISIQGLSRTMEYHPDLENLD EDGYTQLHFDSQSNTRIAVVSEKAISSKNQWDFRDPTNVKTCRCQCGTEIRSIQLSSQCA EITIPAIEEEGILTGGEAQQILGSCAASPPWRLIAVILGILCLVILVIAVVLGTMGVLSS PCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELLIWKYTLINRKHPP FSKRQKTSTLFQETENIHTFPRDRKGEIAASRTSV >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_5|828_bp atgtccaataaagtaacaaaagtgaatgcaatggaacacagagccgctatgggtaaaata aagagggatgacccacagacagtcatctcaggagcagaaagaaaagagctcccaaatgct atatctattcaggggctctcaagaacaatggaatatcatcctgatttagaaaatttggat gaagatggatatactcaattacacttcgactctcaaagcaataccaggatagctgttgtt tcagagaaagctatttcctctaagaatcagtgggatttcagggatcctactaatgtcaag acctgtagatgtcaatgtggaactgaaattaggagcatacagttatcaagccaatgtgcc gaaattacaattccagcaattgaagaagaaggtatccttactggtggagaagcacagcaa atcctaggatcgtgtgctgcatctcctccttggcgcctcattgctgtaattttgggaatc ctatgcttggtaatactggtgatagctgtggtcctgggtaccatgggggttctttccagc ccttgtcctcctaattggattatatatgagaagagctgttatctattcagcatgtcacta aattcctgggatggaagtaaaagacaatgctggcaactgggctctaatctcctaaagata gacagctcaaatgaattgcttatatggaaatatacgttaataaacagaaaacatccaccc ttttccaagagacagaaaacatccacacttttccaagagacagaaaacatccacactttt ccaagagacagaaaaggtgaaatagctgctagccgaacctcagtctga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_6|168_aa MAHKSNGQRCGFLRFSELATTSCGVAQRGELTPPAPLSSLRRVLSSHYRHSRISFARLSG LGSSLRRTPMAGRPSAPAPNLGISLPRFSETRDSLPAGAQAMNLGPALRRCTPQPRALAP ASTSAILHPRFRHGLRRGLGTAPKRLGKNSVRHVGLQSTQDGQWSLHR >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_6|507_bp atggcgcataagagtaacggccagcgctgcgggtttttacgtttcagcgagctcgcaaca acgtcctgtggtgtggcccagcgcggggagctgaccccaccagctccgctctcaagcctt cggagagtcctctccagtcactatcgtcattcccgcatttcctttgctaggctgtccggg ctgggcagctccctcaggcggacacccatggctggcagaccttccgcgcctgcccctaat ctgggcatatcactccctcggttttctgagactcgggactccttgcccgctggagcgcag gccatgaatctgggtcccgcactgcggcgctgcacgccgcagcccagggctttggcccca gccagcacatccgccatcctgcaccccaggttccggcacgggctgcgacggggcctcgga actgctcccaagcggctgggaaagaactcagtcaggcatgtcggcctgcaaagcacccag gatgggcagtggagtctgcaccgttga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_7|279_aa MTFDDLKIQTVKDQPDEKSNGKKAKGLQFLYSPWWCLAAATLGVLCLGLVVTIMVLGMQL SQVSDLLTQEQANLTHQKKKLEGQISARQQAEEASQESENELKEMIETLARKLNEKSKEQ MELHHQNLNLQETLKRVANCSGIGRRVAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLS LDAKLLKINSTADLDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAV SQTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_7|840_bp atgacttttgatgacctaaagatccagactgtgaaggaccagcctgatgagaagtcaaat ggaaaaaaagctaaaggtcttcagtttctttactctccatggtggtgcctggctgctgcg actctaggggtcctttgcctgggattagtagtgaccattatggtgctgggcatgcaatta tcccaggtgtctgacctcctaacacaagagcaagcaaacctaactcaccagaaaaagaaa ctggagggacagatctcagcccggcaacaagcagaagaagcttcacaggagtcagaaaac gaactcaaggaaatgatagaaacccttgctcggaagctgaatgagaaatccaaagagcaa atggaacttcaccaccagaatctgaatctccaagaaacactgaagagagtagcaaattgt tcaggtattgggagaagggtggctccttgtccgcaagactggatctggcatggagaaaac tgttacctattttcctcgggctcatttaactgggaaaagagccaagagaagtgcttgtct ttggatgccaagttgctgaaaattaatagcacagctgatctggacttcatccagcaagca atttcctattccagttttccattctggatggggctgtctcggaggaaccccagctaccca tggctctgggaggacggttctcctttgatgccccacttatttagagtccgaggcgctgtc tcccagacatacccttcaggtacctgtgcatatatacaacgaggagctgtttatgcggaa aactgcattttagctgccttcagtatatgtcagaagaaggcaaacctaagagcacagtga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_8|170_aa MGVRVHVVAASALLYFILLSGTRCEENCGNPEQLLVVIGALLLLCGLTSLCFRCCCLSRQ QNGEDGGPPPCEVTVIAFDHDSTLQSTITSLQSVFGPAARRILAVAHSHSSLGQLPSSLD TLPGYEEALHMSRFTVAMCGQKAPDLPPVPEEKQLPPTEKESTRIVDSWN >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_8|513_bp atgggagtccgagttcatgtcgtggcggcctcagccctgctgtatttcatcctgctttct gggacgagatgtgaggaaaactgtggtaatcctgaacagttgctagtggtaattggcgcg ctgcttctcctgtgtggcctgacgtccctgtgcttccgctgctgctgtctgagccgccag caaaatggggaagatgggggcccaccaccctgtgaagtgaccgtcattgctttcgatcac gacagcactctccagagcactatcacatctctgcagtcggtgtttggccctgcagctcgg aggatcctggctgtggctcactcccacagctccctgggccagctgccctcctctttggac accctcccagggtatgaagaagctcttcacatgagtcgcttcacagtagccatgtgcggg cagaaagcacctgatctacccccagtacctgaagaaaagcagctgcctccaacagagaag gagtcgactcgaatagttgactcttggaactga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_9|164_aa MKFQYKEDHPFEYRKKEGEKIRKKYPDRVPVIVEKAPKARVPDLDKRKYLVPSDLTVGQF YFLIRKRIHLRPEDALFFFVNNTIPPTSATMGQLYEVMVLVAQYWMPSSAVWHPLALVLD ALITHLRSGAEGVIYPDPLTYGSDNHEEDYFLYVAYSDESVYGK >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_9|495_bp atgaagttccagtacaaggaggaccatccctttgagtatcggaaaaaggaaggagaaaag atccggaagaaatatccggacagggtccccgtgattgtagagaaggctccaaaagccagg gtgcctgatctggacaagaggaagtacctagtgccctctgaccttactgttggccagttc tacttcttaatccggaagagaatccacctgagacctgaggacgccttattcttctttgtc aacaacaccatccctcccaccagtgctaccatgggccaactgtatgaggtaatggttctg gttgcacaatactggatgccgtccagtgcagtctggcatcctctagcccttgttctagat gcgttgataacacatctgagaagtggggcagaaggtgttatttatccggatcctcttaca tatggcagtgacaatcatgaggaagactattttctgtatgtggcctacagtgatgagagt gtctatgggaaatga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_10|268_aa MKPRTLAVSVTALKVACLESVPSDVQMCSEFLPSDSGAQLASPSGSHTGAAGGAACQSRA VRSHSSALGLFVPSRSMGLGAVEQGVVLVGEARAAQVPMEWVGGSGMAGCRSRALPRGKA AKARREIERSAGGPALLGDPVHPPQPLARVLSPPLPRASRAGWLAAPSAGPAKSTPTRNS SWRASAPRSPGSARASPSTPPSKLREWAPALASPERGSHSAVGGLKGSSNATKVGAQAGE VPRASEGSEDCQHAVTSQQAGLDLALLY >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_10|807_bp atgaagccgcggaccctcgcggtgagtgttacagctcttaaggtggcgtgtctggagtct gtcccttctgatgttcagatgtgttcggagtttcttccttctgactcgggagcccagctg gcttcacccagtggatcccacaccggggctgcaggtggagctgcctgccagtcccgcgcc gtgcgctcgcattcctcagcccttgggttgtttgttccttcccggtcgatgggactgggc gccgtggagcagggggtggtgctcgtcggggaggctcgggccgcacaggtgcccatggag tgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgcgggaaggca gctaaggcccggcgagaaatcgagcgcagcgccggtgggccagcactgctgggggaccca gtacaccctccgcagccactggcccgggtgctaagtcccccattgccccgggccagcagg gctggctggctggctgctccgagtgcggggcccgccaagtccacgcccacccggaactcc agctggcgcgcaagcgccccacgcagccccggttccgctcgtgcctctccctccacacct ccctccaagctgagggagtgggctccagccttggccagcccagaaaggggctcccacagt gcagtgggggggctgaagggctcctcaaatgccaccaaagtgggagcccaggcaggggag gtgccgagagcaagcgagggctctgaggactgccagcatgctgtcacttctcagcaggct ggattggatttggcacttctttactga >gi568815586r:10059883_10272077|GENSCAN_predicted_peptide_11|228_aa MALGISAPVALQGTAPLLAVLSGCSFPKHMLQTVNGSPFWGLENGGPLLRARLGSAPVET LELFSSLNKILHSYHSSVVKCDLILLGRWTKAWDPLSAGGGCHTGPLPLQVEGNHPTGSY RVPNRPQYRSVAWGLGTSGLVNYTFLLNSGETTYQFLRGNKDFLKNHIKLNYCFLLIEVD NLTLVFVIEKTLGQIYTPNLGKRESVRPTKKPKKLRSFVAKGAMSRDK >gi568815586r:10059883_10272077|GENSCAN_predicted_CDS_11|687_bp atggccttgggaatctctgcccctgtggctttgcagggtacagccccactcctggctgtg cttagtggctgcagctttcccaagcacatgttgcaaactgtcaatggatcaccattctgg ggtttggagaatggtggccctcttctcagagctcgactaggcagtgctccagtggagact ctagaactgttttcatcgctcaataaaattctccactcttaccattcttcagttgtcaag tgtgacctcattcttcttggacgctggacaaaagcttgggacccactaagtgcaggtgga ggctgtcacactggccctttgcccttgcaagtggagggcaaccaccccactggcagctac agggttcctaacaggccacagtaccggtctgtggcctggggcttggggacctctggtctg gtcaattacacatttctcctgaattctggagagacaacataccagttcctcagaggaaac aaagattttcttaaaaatcacatcaaattaaattactgctttttgcttattgaagtggat aatcttactcttgtttttgtcattgaaaagacactaggccagatatatacaccaaacctt gggaaaagggaatctgtaagacctacgaagaagccaaagaaacttagatcatttgtagca aaaggagcaatgagcagagataaatag