GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:39:45 Sequence gi568815588r:80173784_80389423 : 215640 bp : 44.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 339 372 34 2 1 90 89 36 0.421 2.09 1.02 Intr + 5444 5472 29 0 2 101 87 4 0.009 -0.57 1.03 Term + 18551 18652 102 1 0 72 44 120 0.803 4.28 1.04 PlyA + 19178 19183 6 1.05 2.00 Prom + 22046 22085 40 -2.56 2.01 Init + 31829 31979 151 1 1 88 35 127 0.114 6.07 2.02 Intr + 34426 34448 23 2 2 108 116 13 0.049 3.36 2.03 Intr + 59881 60000 120 0 0 129 113 97 0.987 16.99 2.04 Intr + 61315 61460 146 0 2 99 111 17 0.782 4.18 2.05 Term + 66130 66139 10 1 1 109 47 4 0.164 -3.93 2.06 PlyA + 68414 68419 6 -0.45 3.05 PlyA - 69142 69137 6 1.05 3.04 Term - 73837 73436 402 1 0 47 36 568 0.061 42.65 3.03 Intr - 78816 78610 207 0 0 -50 80 247 0.723 9.37 3.02 Intr - 79275 79021 255 0 0 -44 53 465 0.520 27.44 3.01 Init - 79600 79532 69 1 0 56 75 104 0.870 6.95 3.00 Prom - 80311 80272 40 -1.46 4.10 PlyA - 84588 84583 6 1.05 4.09 Term - 100100 99998 103 1 1 38 43 90 0.379 -2.75 4.08 Intr - 100870 100737 134 1 2 18 103 251 0.733 19.04 4.07 Intr - 101416 101234 183 1 0 125 115 216 0.999 27.98 4.06 Intr - 102811 102593 219 1 0 71 57 488 0.995 42.50 4.05 Intr - 106533 106390 144 0 0 78 99 160 0.999 16.58 4.04 Intr - 107030 106897 134 0 2 89 70 150 0.623 13.66 4.03 Intr - 110255 110133 123 0 0 116 72 218 0.989 23.56 4.02 Intr - 111806 111729 78 0 0 91 83 91 0.994 8.42 4.01 Init - 115640 115550 91 2 1 69 114 53 0.984 6.65 4.00 Prom - 125958 125919 40 -4.26 5.00 Prom + 126957 126996 40 -4.46 5.01 Sngl + 131286 131567 282 2 0 88 37 153 0.785 5.69 5.02 PlyA + 132169 132174 6 1.05 6.00 Prom + 132695 132734 40 -6.36 6.01 Sngl + 134143 135528 1386 0 0 42 32 458 0.887 31.69 6.02 PlyA + 135637 135642 6 -0.45 7.00 Prom + 135920 135959 40 -2.46 7.01 Init + 136357 136489 133 0 1 78 47 90 0.551 4.20 7.02 Term + 141908 141993 86 0 2 131 55 99 0.992 8.72 7.03 PlyA + 142403 142408 6 1.05 8.00 Prom + 151849 151888 40 -3.16 8.01 Init + 154520 154522 3 1 0 93 95 0 0.190 1.20 8.02 Intr + 177065 177271 207 1 0 73 34 83 0.108 0.67 8.03 Intr + 182814 182918 105 2 0 81 78 84 0.190 7.11 8.04 Intr + 188652 188807 156 2 0 95 85 72 0.214 7.81 8.05 Intr + 189168 189290 123 2 0 96 97 146 0.999 17.08 8.06 Term + 192905 193168 264 1 0 93 39 194 0.857 10.51 8.07 PlyA + 193342 193347 6 1.05 9.03 PlyA - 194436 194431 6 1.05 9.02 Term - 205216 205080 137 2 2 39 43 114 0.539 0.18 9.01 Init - 210385 210373 13 1 1 70 119 6 0.360 2.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 52902 52893 10 0 1 71 82 9 0.854 -1.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_1|54_aa MAHCSLDLLGSARLRSACHMITAKGHRTSEKVYTKKGRDQNRRKGTKETDNDEK >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_1|165_bp atggctcactgcagcctggacctcctgggctcagccaggttaagatctgcctgccacatg attacagcaaagggtcacaggacatctgagaaagtctacaccaagaaagggagagaccaa aacagaagaaaaggaaccaaagaaacagacaacgatgagaagtag >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_2|149_aa MRTGHGGPARSSSLPAAELLQEVSGSGGPSAAPNPCGAQCRAEPHSPSCRGLIETSYKVV KVLIVLTPQFLSRDKDQLTKELQQHVKSVTVSCKSPRKVLSHITRLHPPSKGQGENLTHS VDSIKATIWCQPVWETVEGQRRRVGNCQD >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_2|450_bp atgaggactgggcacggcgggcccgcgaggagcagcagccttccggctgcggagctgctc caggaagtctcgggctccggaggtccctctgccgcccctaacccctgcggcgcccagtgc cgcgcagagccgcattcgccctcctgcagggggctcatagaaacatcatacaaggtcgtg aaggtgttgattgtcctgaccccacagtttctgtcccgtgacaaggaccagctgaccaag gagctgcagcagcatgtaaagtcagtgacagtctcatgcaagtccccaaggaaggtcctg agtcacatcacacggctacatccaccttcaaagggtcaaggggagaacctgacacactcg gtggattccattaaggccaccatatggtgtcagcctgtgtgggagactgtggaggggcag aggaggagggtagggaattgccaggactga >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_3|310_aa MEIIEVTEEIHVKVVIEGPDATQEVDEVPEDQQLQEVDEVPEDLQLQGVDEVPEDQQLQE VNEVPEDQQLQKVDEVPEDHQLQEVDEVPEDHQLREVDEVPEDRQLQEELDKAPENNRVE EVVKFSGDSLVQEVAEFPEDSRVEVVEFPEDSPVEEFVEVPENLQMEGVFEFPDNTQCSA LRKNGFVVLKGWPCKIVEMSASKTGKHGHAKVHLVGIDIFTGKKYEDICPSTHNMDVPNI RRNDFQLIGIQDGYLSLLQDSGEVPEDLRLPEGDLGKETEQKYDCGEEILITVLSAMTEE AAVAIKAMAK >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_3|933_bp atggagattattgaagtaaccgaggagattcatgtgaaggtggttattgaggggccagac gccacccaggaggtggatgaggtcccagaggaccaacagctgcaggaggtggatgaggtc ccagaagacctacagctgcagggtgtggatgaagtcccagaggaccaacagctgcaggag gtgaatgaggtcccggaagaccaacagctgcaaaaggtggatgaggtcccggaggaccac cagctgcaggaggtggatgaggttccggaggaccaccagcttcgagaggtggatgaggtc ccggaggaccgacagctgcaggaggagctggataaggccccagagaacaatcgagtggag gaggtggttaagttttcaggggactctctagtgcaggaggtggctgagttcccagaggac agtcgagtggaggtggttgaattcccagaggactctccagtggaggagtttgttgaggtc ccagaaaaccttcagatggagggagtgtttgagttcccagacaacacccagtgctcagca ttacgtaagaatggctttgtggtgctcaaaggctggccatgtaagatcgtggagatgtct gcttcgaagactggcaagcacggccacgccaaggtccatctggttggtattgacatcttt actgggaagaaatatgaagatatctgcccgtcaactcataatatggatgtccccaacatc agaaggaatgacttccagctgattggcatccaggatgggtacctatcactgctccaggac agcggggaggtaccagaggaccttcgtctccctgagggagaccttggcaaggagactgag cagaagtacgactgtggagaagagatcctgatcacggtgctgtctgccatgacagaggag gcagctgttgcaatcaaggccatggcaaaataa >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_4|402_aa MNGPVDGLCDHSLSEGVFMFTSESVGEGHPDKICDQISDAVLDAHLKQDPNAKVACETVC KTGMVLLCGEITSMAMVDYQRVVRDTIKHIGYDDSAKGSLCFRLGFDFKTCNVLVALEQQ SPDIAQCVHLDRNEEDVGAGDQGLMFGYATDETEECMPLTIILAHKLNARMADLRRSGLL PWLRPDSKTQVTVQYMQDNGAVIPVRIHTIVISVQHNEDITLEEMRRALKEQVIRAVVPA KYLDEDTVYHLQPSGRFVIGGPQGDAGVTGRKIIVDTYGGWGAHGGGAFSGKDYTKVDRS AAYAARWVAKSLVKAGLCRRVLVQVSYAIGVAEPLSISIFTYGTSQKTERELLDVVHKNF DLRPGVIVRDLDLKKPIYQKTACYGHFGRSEFPWEVPRKLVF >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_4|1209_bp atgaatggaccggtggatggcttgtgtgaccactctctaagtgaaggagtcttcatgttc acatcggagtctgtgggagagggacacccggataagatctgtgaccagatcagtgatgca gtgctggatgcccatctcaagcaagaccccaatgccaaggtggcctgtgagacagtgtgc aagaccggcatggtgctgctgtgtggtgagatcacctcaatggccatggtggactaccag cgggtggtgagggacaccatcaagcacatcggctacgatgactcagccaagggctcactt tgcttccgcctaggctttgacttcaagacttgcaacgtgctggtggctttggagcagcaa tccccagatattgcccagtgcgtccatctggacagaaatgaggaggatgtgggggcagga gatcagggtttgatgttcggctatgctaccgacgagacagaggagtgcatgcccctcacc atcatccttgctcacaagctcaacgcccggatggcagacctcaggcgctccggcctcctc ccctggctgcggcctgactctaagactcaggtgacagttcagtacatgcaggacaatggc gcagtcatccctgtgcgcatccacaccatcgtcatctctgtgcagcacaacgaagacatc acgctggaggagatgcgcagggccctgaaggagcaagtcatcagggccgtggtgccggcc aagtacctggacgaagacaccgtctaccacctgcagcccagtgggcggtttgtcatcgga ggtccccagggggatgcgggtgtcactggccgtaagattattgtggacacctatggcggc tggggggctcatggtggtggggccttctctgggaaggactacaccaaggtagaccgctca gctgcatatgctgcccgctgggtggccaagtctctggtgaaagcagggctctgccggaga gtgcttgtccaggtttcctatgccattggtgtggccgagccgctgtccatttccatcttc acctacggaacctctcagaagacagagcgagagctgctggatgtggtgcataagaacttc gacctccggccgggcgtcattgtcagggatttggacttgaagaagcccatctaccagaag acagcatgctacggccatttcggaagaagcgagttcccatgggaggttcccaggaagctt gtattttag >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_5|93_aa MGRNQSRKAENSKKQSAPSSPKDRSSSPAMEQSRTENDFDELTEVGFRKLVINFFELKED VQTHRKEAKNFEKRLNEWLTKINSVEKTLNDLM >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_5|282_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaacagagcgccccttcttct ccaaaggatcgcagctcctcgccagcaatggaacaaagcaggacagagaatgattttgat gagttgacagaagtaggcttcagaaagttggtaataaacttctttgagctaaaggaggat gttcaaacccatcgcaaggaagctaaaaactttgaaaaaagattaaatgaatggctaact aaaataaacagtgtagagaagaccttaaatgacctgatgtag >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_6|461_aa MIISIDAEKAFNKIQQPFIIKVFNKLGIDGTYLKVIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNITLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIAYLENPVVS AQNLLKLISNFSKVSAYKINVQKSQAFLDTNNRQTESQIMSELPLTIATKIIKYLGIQLT KDVKDLFKENYKPLFNEIKEDTNKWKNIPCSWTGKINIMKTAILPKVIYRFSAITIKLPM TFFTELEKTTLKFIWSQKRARIAKTILSQKNKAGGIMLPDFKLYYKATVTQTAWYWYQNR DIDQWNRTEASEITPHIYNPLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKALEENLGKTIQDIDMGKDFMTKTSKAMATKAKIDKWDLI KLKSFCTAKETIIRVNRHPTEWDKSFAIYSSDKRLISRIYK >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_6|1386_bp atgattatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcataata aaagttttcaataaactaggtattgatgggacatatctcaaagtaataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacaaggatgccctctctcaccactcctattcaacataacgttggaagtt ctggccagggcaatcaggcaggagaaagaaataaagggtattcagttaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgcgtatttagaaaaccccgtcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcagcatacaaaatcaat gtgcaaaaatcacaggcattcttagacaccaataacagacaaacagagagccaaatcatg agtgaactcccactcacaattgctacaaagataataaaatacctaggaatccaacttaca aaggatgtgaaggacctcttcaaggagaactacaaaccactgttcaatgaaataaaagag gacacaaacaaatggaagaacattccatgctcatggacaggaaaaatcaatatcatgaaa acggccatactgcccaaggtaatttatagattcagtgccatcaccatcaaactaccaatg acattcttcacagaattggaaaaaactactttaaagttcatatggagccaaaaaagagcc cgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaacccaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaaccct ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttacacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaagccctagaagaaaacctaggcaaaaccattcaggacatagacatgggcaaggacttc atgactaaaacatcaaaagcaatggcaacaaaagccaaaatagacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactatcatcagagtgaacaggcatcctaca gaatgggacaaaagttttgcaatctactcatctgacaaaaggctaatatccagaatctac aaataa >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_7|72_aa MEYYAAIKKDEFMSFVGTRMKLETIILSKLSQGQKNKHHTSSLIGGLLRSPSLPNEGCSL PQGAQSHQLPKG >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_7|219_bp atggaatactatgcagccataaagaaggatgagttcatgtcctttgtagggacacggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaacaaacaccatacg tcctcactcataggtgggctcctgcgcagcccgagcctccccaacgagggctgctccctg ccccagggcgcccagtcccatcaactgcccaagggctga >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_8|285_aa MLVLHDPSLQQFPYQYLQLPNHATPILLQLKGLPAPLLHSDIWAQVKRITSSHNQIKWHD YQYLIFNCYQVSPARRPRPNGQNRLCPPDPAKLGEQIPKTVRAFPAARMETNYLKRCFGN CLAQALAEVAKVRPSDPIEYLAHWLYHYRKTAKAKEENREKKIHLQEEYDSSLKEMEMTE MLKQEEYQIQQNCEKCHKELTSETVSTKKTIFMQEDTNPLEKEALKQEFLPGTSSLIPGM PQQVPPSESAGQIDQNFKMPQEINYKEAFQHEVAHEMPPGSKSPF >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_8|858_bp atgcttgtactccatgacccatcccttcaacaattcccttaccaatatcttcagctccct aatcatgctaccccaatcctgcttcaattaaaaggtctgcctgctccacttcttcactca gacatctgggcccaagtgaagagaatcacatcatcacataaccaaatcaaatggcacgat tatcagtacttgatcttcaactgctaccaagtgagtcctgctcgacgcccaaggcccaac ggtcagaaccgcctctgcccgccggacccagcgaagctaggggaacaaatcccaaaaaca gttcgggcctttccggctgccaggatggaaactaactacctgaagaggtgctttggaaat tgcctggcccaggcactggcagaggtggcgaaggttcggcccagtgacccaatagaatac ctggctcactggctttatcattacaggaaaacagcaaaagcaaaagaagagaatagggaa aagaagatccacctgcaggaggaatatgacagtagcctcaaggaaatggaaatgacagaa atgctgaaacaggaagagtatcagattcaacagaactgtgaaaagtgtcacaaggaactg acttctgaaactgtttccacgaagaagaccatattcatgcaggaggacacaaaccccctt gagaaggaggccttgaagcaggaattcctgccaggtacttccagtctgattccaggaatg cctcaacaggttcctccttcagagtctgctggccagattgaccagaacttcaaaatgcca caagaaataaattacaaggaggcttttcagcatgaagttgctcatgaaatgcctcctggc tccaaatctcctttttag >gi568815588r:80173784_80389423|GENSCAN_predicted_peptide_9|49_aa MAVSEIAIATPAFSNYHPDQSATINVKARPTTSKKIYNSEKIHMIVSIF >gi568815588r:80173784_80389423|GENSCAN_predicted_CDS_9|150_bp atggctgtgtctgaaattgccatagccaccccagccttcagcaactaccaccctgatcag tcagcaaccatcaatgttaaggcaagacccaccaccagcaaaaagatttacaattcggag aagattcacatgattgttagcattttttag