GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:03:45 Sequence gi568815587r:86851145_87055085 : 203941 bp : 42.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6008 6081 74 1 2 81 86 109 0.894 10.59 1.02 Intr + 6486 6765 280 0 1 -32 88 263 0.450 10.86 1.03 Intr + 9629 9783 155 1 2 101 84 100 0.799 8.85 1.04 Term + 14532 14661 130 0 1 8 47 140 0.229 -1.43 1.05 PlyA + 16343 16348 6 1.05 2.03 PlyA - 17388 17383 6 1.05 2.02 Term - 26709 26607 103 2 1 136 34 63 0.471 2.47 2.01 Init - 29122 28752 371 1 2 68 22 152 0.227 3.11 2.00 Prom - 31543 31504 40 -5.75 3.00 Prom + 32382 32421 40 -2.05 3.01 Init + 34802 34857 56 1 2 79 92 112 0.999 11.51 3.02 Intr + 35433 35518 86 0 2 67 80 81 0.063 3.74 3.03 Intr + 54172 54294 123 2 0 101 34 108 0.229 6.34 3.04 Intr + 68733 68894 162 1 0 54 65 83 0.019 1.63 3.05 Term + 81202 81311 110 2 2 43 54 114 0.189 1.19 3.06 PlyA + 83836 83841 6 1.05 4.05 PlyA - 84105 84100 6 1.05 4.04 Term - 87073 86939 135 1 0 98 48 159 0.848 9.94 4.03 Intr - 97850 97719 132 2 0 88 54 57 0.339 2.22 4.02 Intr - 101326 100064 1263 1 0 67 98 661 0.113 51.36 4.01 Init - 103941 103657 285 0 0 93 80 473 0.471 42.12 4.00 Prom - 110777 110738 40 -4.75 5.02 PlyA - 110862 110857 6 1.05 5.01 Sngl - 113374 112721 654 1 0 39 48 246 0.809 11.62 5.00 Prom - 113535 113496 40 -9.85 6.02 PlyA - 113720 113715 6 1.05 6.01 Sngl - 114550 113849 702 1 0 49 37 310 0.801 17.96 6.00 Prom - 114643 114604 40 -5.95 7.02 PlyA - 114812 114807 6 1.05 7.01 Sngl - 116046 115744 303 0 0 88 54 313 0.985 23.48 7.00 Prom - 126070 126031 40 -5.45 8.00 Prom + 126655 126694 40 -7.55 8.01 Init + 131608 131624 17 0 2 63 111 -6 0.495 -0.91 8.02 Intr + 131940 132158 219 0 0 69 75 202 0.546 13.50 8.03 Intr + 161895 162060 166 0 1 119 42 56 0.294 3.14 8.04 Intr + 163549 163593 45 0 0 122 77 39 0.300 3.89 8.05 Term + 186857 187078 222 1 0 99 34 244 0.126 16.03 8.06 PlyA + 188296 188301 6 1.05 9.00 Prom + 196460 196499 40 -7.45 9.01 Sngl + 197064 198080 1017 2 0 88 43 698 0.956 62.07 9.02 PlyA + 198308 198313 6 1.05 10.00 Prom + 198477 198516 40 -9.55 10.01 Sngl + 199426 200400 975 0 0 70 42 301 0.949 20.21 10.02 PlyA + 200432 200437 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 35089 35221 133 1 1 88 43 111 0.865 3.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_1|212_aa MGMTTAYITDATTPFLGDDDTAEVRWADGHKSVIGHQSQEHVFYTCKKDKERHLCQAARI RDDSAMRLHVHNHLGNCGRGETNVSPGQVGEEEVHGGVEVGVRAASQDDEQVPQHRDQGG EDNNTALSIAGNVRHPVIFFLISRGRENDITPNIVEVYSPPVILFLISVGWPIPGFGPTE DSVDSAASSGPYSSPPNSDKVFSCSLETSRPD >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_1|639_bp atggggatgaccacagcgtacatcactgatgccaccactccattcctgggagatgatgac acagctgaagtcagatgggcagatggccacaaatcggtcataggccatcagagtcaggag catgtcttctatacatgcaaaaaggacaaagaaagacatctgtgtcaggcagcccgcata agagatgactctgctatgcgactgcatgtccacaatcatcttgggaactgtggccgaggt gaaaccaatgtcagcccaggacaggttggagaggaagaagtacatgggggtgtggaagtg ggggtcagagctgccagccaggatgatgaacaggttcctcagcatcgtgaccagggcggg gaagacaataatactgccctcagtatcgcgggaaatgtacgtcaccctgtgatatttttc ctaatatccagagggagagagaatgatattactcccaatatcgtagaagtgtacagcccc cctgtgatattgttcttaatatccgtgggatggccaattcctggctttggtcccactgag gactccgtggacagtgctgccagctcaggtccctactcctccccacctaattctgacaag gtgttctcatgctccttggaaacatcaagaccagactaa >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_2|157_aa MTLNEHAAFKHLFNKAHLTPPLIHLTLSGHSTCFREHRVGGKVTDQQDHKAEEFFLVQNK MKSLPCLPLSTQTRQPSDFSILSPPFSPFYSTKPPLSSWPVLNELLGTPPRRGGGRAEGL LTSQNEKVDPKIHMELQGTLNSQKSFENKQSWRVHTS >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_2|474_bp atgactcttaatgagcatgctgccttcaagcatctgtttaacaaagcacatcttacaccg cccttaatccatttaaccctgagtggacacagcacatgcttcagagagcacagggttggg ggtaaggtcaccgatcaacaggatcacaaggcagaagaatttttcttagtacagaacaaa atgaaaagtctcccgtgtctacctctttctacacagacacggcaaccatccgatttctca atcctttccccgcctttctcccctttctattccacaaaaccgccattgtcatcatggccc gttctcaatgagctgttgggtacacctcccagacggggtggtggccgggcagaggggctc ctcacttcccaaaatgaaaaagttgatcctaaaattcatatggaattgcaagggaccctg aatagccaaaagagttttgaaaacaaacaaagttggagagttcacacttcctga >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_3|178_aa MASVGYLVVNDDDDDGDSRICLSEDPTCNNPVMKIKCNSPHKVDLQPGLWVLSMIKHVKA QDSDKAPFKRMVLFLRHIIDTLALASSGAASLSMALTWSPPRFRKVHPPFVHQLLPSLNL ANVAKIKLQVDCRRQREGLCFCGFEGYSKKGSRLSVPEAGAITISDTATQPPEEFAGG >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_3|537_bp atggcaagtgtgggctatctggtagttaatgatgatgatgatgatggtgatagcagaatt tgtctctcagaggacccaacctgcaacaatcctgttatgaagattaaatgtaatagtccc cataaagtggacttgcagccagggctatgggttttaagcatgataaagcatgtgaaagcc caggacagtgataaggcaccattcaaacgtatggtgctcttcctacggcacatcattgat actcttgctctagcctcttcaggagcagcttcacttagcatggctctaacctggtctccc cctagattcagaaaagtgcatccaccctttgttcaccagctgcttcccagcctcaacttg gccaatgttgctaaaatcaaactacaagtagactgcagacgccagcgtgaaggcctgtgc ttctgtgggtttgaaggatacagcaaaaaaggaagcagactgtcagttcctgaagcaggg gccatcaccatcagtgacacggccacccagcctccagaagagtttgctgggggctga >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_4|604_aa MAWRGAGPSVPGAPGGVGLSLGLLLQLLLLLGPARGFGDEEERRCDPIRISMCQNLGYNV TKMPNLVGHELQTDAELQLTTFTPLIQYGCSSQLQFFLCSVYVPMCTEKINIPIGPCGGM CLSVKRRCEPVLKEFGFAWPESLNCSKFPPQNDHNHMCMEGPGDEEVPLPHKTPIQPGEE CHSVGTNSDQYIWVKRSLNCVLKCGYDAGLYSRSAKEFTDIWMAVWASLCFISTAFTVLT FLIDSSRFSYPERPIIFLSMCYNIYSIAYIVRLTVGRERISCDFEEAAEPVLIQEGLKNT GCAIIFLLMYFFGMASSIWWVILTLTWFLAAGLKWGHEAIEMHSSYFHIAAWAIPAVKTI VILIMRLVDADELTGLCYVGNQNLDALTGFVVAPLFTYLVIGTLFIAAGLVALFKIRSNL QKDGTKTDKLERLMVKIGVFSVLYTVPATCVIACYFYEISNWALFRYSADDSNMAVEMLK IFMSLLVGITSGMWIWSAKTLHTWQKCSNRLVNSGKTVPITVHGPGARTQGQSLPLLTLC SGVLGVVQGLWPRQGKEDQAMCLSFSRARAVLAPNAGSGARQMFTDILAAAICGTGLLHL AAVF >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_4|1815_bp atggcctggcggggcgcagggccgagcgtcccgggggcgcccgggggcgtcggtctcagt ctggggttgctcctgcagttgctgctgctcctggggccggcgcggggcttcggggacgag gaagagcggcgctgcgaccccatccgcatctccatgtgccagaacctcggctacaacgtg accaagatgcccaacctggttgggcacgagctgcagacggacgccgagctgcagctgaca actttcacaccgctcatccagtacggctgctccagccagctgcagttcttcctttgttct gtttatgtgccaatgtgcacagagaagatcaacatccccattggcccatgcggcggcatg tgtctttcagtcaagagacgctgtgaacccgtcctgaaggaatttggatttgcctggcca gagagtctgaactgcagcaaattcccaccacagaacgaccacaaccacatgtgcatggaa gggccaggtgatgaagaggtgcccttacctcacaaaacccccatccagcctggggaagag tgtcactctgtgggaaccaattctgatcagtacatctgggtgaaaaggagcctgaactgt gtgctcaagtgtggctatgatgctggcttatacagccgctcagccaaggagttcactgat atctggatggctgtgtgggccagcctgtgtttcatctccactgccttcacagtactgacc ttcctgatcgattcttctaggttttcctaccctgagcgccccatcatatttctcagtatg tgctataatatttatagcattgcttatattgtcaggctgactgtaggccgggaaaggata tcctgtgattttgaagaggcagcagaacctgttctcatccaagaaggacttaagaacaca ggatgtgcaataattttcttgctgatgtacttttttggaatggccagctccatttggtgg gttattctgacactcacttggtttttggcagcaggactcaaatggggtcatgaagccatt gaaatgcacagctcttatttccacattgcagcctgggccatccccgcagtgaaaaccatt gtcatcttgattatgagactggtggatgcagatgaactgactggcttgtgctatgttgga aaccaaaatctcgatgccctcaccgggttcgtggtggctcccctctttacttatttggtc attggaactttgttcattgctgcaggtttggtggccttgttcaaaattcggtcaaatctt caaaaggatgggacaaagacagacaagttagaaagactgatggtcaagattggggtgttc tcagtactgtacacagttcctgcaacgtgtgtgattgcctgttatttttatgaaatctcc aactgggcactttttcggtattctgcagatgattccaacatggctgttgaaatgttgaaa atttttatgtctttgttggtgggcatcacttcaggcatgtggatttggtctgccaaaact cttcacacgtggcagaagtgttccaacagattggtgaattctggaaagacagttcccata actgtccacggccctggagcacgcacccaggggcagagcctgcccttactcacgctctgc tctggtgtcttgggagttgtgcagggactctggcccaggcaggggaaggaagaccaggcg atgtgtttgagcttctcaagggcaagagctgttcttgcgcccaatgctggttcgggtgct cgacaaatgttcactgacatcctggcagcagccatttgtggcactggattgttgcatttg gctgcagtgttttga >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_5|217_aa MNIIVKTLNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIHHINRTSDKNHM IISIDAEKAFDKIQQPFMLKTLNKLGVDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSSLLFNMVLELLARAVRQEKEIQDIQLGKQEVKLSLFADDMIVYLENPTVAA QNLLKLTSNFSKVSGYKINMQKSQAFLYTKNTQTATS >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_5|654_bp atgaatatcattgtgaaaaccctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccacgatcaagtgggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaatgtaatccatcacataaacagaaccagtgacaaaaaccacatg attatctcaatagatgcagaaaaggccttcgacaaaattcaacagcctttcatgctaaaa accctcaataaactaggtgttgatggaatgtatctcaaaataataagagctatttatgac aaaccgacagccaatatcatactgaatgggcaaaaactagaagcattccctttgaaaacc ggcacaagacaaggatgccctctctcatcactcctattcaacatggtattggaacttctg gccagggcagtcaggcaagagaaagaaatacaggatattcaattaggaaaacaggaagtc aaattgtctctgtttgcagatgacatgattgtatatttagaaaaccccactgtcgcagcc caaaatctccttaagctgacaagcaacttcagcaaagtctcaggatacaaaatcaatatg caaaaatcacaagcattcctatacaccaagaacacacaaacagccacatcatga >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_6|233_aa MGDFNTPLSTSDRSMRQKINKDIQNLNSALDQVDLIDIYRTLHPKSTEYTFFSAPHDTYS KIDYIIGSKTLLSECKRTEITTNCLSDHSAIKLELRIKKLTKNHTTTWKLNNLLLNNYWV NNEMKAEIKMFFETNDNKDTMYQNLWDTFKAVCRGKFIALNAHKRKKERSKIDTLTSQFK EQEKQEQTNSKASRRQEITRLRAELQEIETQKTLQKINESRSWFFEKINKIDY >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_6|702_bp atgggagactttaacaccccactgtcaacatcagacagatcaatgagacagaaaattaac aaggatattcagaacttgaactcagctctggaccaagtggacctaatagacatctacaga actcttcaccccaaatcaacagaatatacattcttctcagcaccacatgacacttattcc aaaattgactatataattggaagtaaaacactcctcagcgaatgtaaaagaacagaaatc acaacaaactgtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactc actaaaaaccacacaactacatggaaactgaacaacctgctcctgaataactactgggta aataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgataacaaagacaca atgtaccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagaaggaaagatctaaaatcgacaccctaacatcacaatttaaa gaacaagagaagcaagagcaaacaaattcaaaagctagcagaaggcaagaaataactagg ctcagagcagaactgcaggagatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaatagactactag >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_7|100_aa MGRKKSRKAKNSKNQNASSTPKEHNSLPAREQNWTENEFDKLTEAGFRRSAITNFSELKE HVLTHRKEAKNLEKRLDKWLIRISSEEKSLNDLMELKTTV >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_7|303_bp atggggagaaaaaagagcagaaaggctaaaaattccaaaaaccaaaacgcctcttctact ccaaaggaacataactccttgccagcaagggaacaaaactggacagagaatgagtttgac aagttgacagaagcaggcttcagaaggtcggcaataacaaacttctctgagctaaaggag catgttctaacccatcgcaaggaagctaaaaaccttgaaaaaaggttagacaaatggcta attcgaatatccagtgaagagaagagcttaaatgacctgatggagctgaaaaccacagta tga >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_8|222_aa MCQAICGEWRSWSSCPAAQQRVDEQLRHTDEQPGLIQAAACNSLAGPIQDFLQKYHITAA GWLDFANCLTPSCTGNFSRAASLQNSRGTIHIVPLRICPSQSITIWMMSPEVVQLSTEQI FQDQVIVLFPSSKALGFGVLKPQEERFYKGSGSPPPVFSALFLVMAALSKSIPHNCYEIG HTWHPSCRVSFLQITGGALEESLKIYAPLYLVRPVTRPAGRV >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_8|669_bp atgtgccaggcaatttgtggggagtggcggtcctggagttcttgcccagctgcccagcag agagtggacgagcagctgaggcacactgacgaacagcctgggttaatccaggctgcagct tgcaactcccttgcaggacccatccaggatttccttcagaagtatcacatcacagcagct ggctggcttgactttgccaactgcttgacccctagctgcacaggaaacttcagcagggca gcttccttgcagaactccaggggtaccatccacattgtgcctctcagaatatgtccatca caatctattacaatatggatgatgtcccctgaagttgtgcagcttagcactgaacagata tttcaagaccaagtcatcgtattattcccttcttccaaggcacttggttttggagtgcta aagccacaagaggagaggttctacaagggctcaggctctccccctcctgtcttctccgcg ctgttcctcgtcatggcggccctcagcaagtccatccctcataactgctatgagatcggc cacacttggcacccttcctgccgggtctccttcctgcagatcaccgggggcgccctggag gagtccctgaagatctatgctcctctgtacttggtgagacccgtcacccgtcccgcaggg cgagtttag >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_9|338_aa MGKKQNRKTGNSKMQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLHLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSLRRATPRHIIVRFTKVEMKEKMLRAAREKGQV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_9|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaatgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacatctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactccttgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcaggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccactgcaaaatcatgccaaaatgtaa >gi568815587r:86851145_87055085|GENSCAN_predicted_peptide_10|324_aa MDTFLHTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIHKSINVFQHINRAKDKNHMIISIDAEKTFDKI QQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYINNRQRAKS >gi568815587r:86851145_87055085|GENSCAN_predicted_CDS_10|975_bp atggatacattcctccacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacacaaatcaataaatgtattccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaaacctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgtcctctctcaccgctcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacatcaacaacagacag agagccaaatcataa