GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:01:07 Sequence gi568815595f:160000851_160202788 : 201938 bp : 43.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 408 530 123 2 0 29 111 76 0.205 4.78 1.02 Intr + 14380 14423 44 0 2 67 71 47 0.013 -2.06 1.03 Intr + 19796 19894 99 2 0 138 119 9 0.573 8.23 1.04 Term + 50898 50964 67 0 1 83 49 89 0.022 1.91 1.05 PlyA + 51004 51009 6 1.05 2.07 PlyA - 51606 51601 6 1.05 2.06 Term - 59655 59449 207 0 0 104 53 46 0.450 0.04 2.05 Intr - 60051 59969 83 1 2 66 97 57 0.080 3.76 2.04 Intr - 61020 60966 55 0 1 31 89 51 0.051 -1.95 2.03 Intr - 69404 69288 117 2 0 41 73 56 0.103 0.06 2.02 Intr - 73935 73758 178 2 1 80 44 63 0.130 1.02 2.01 Init - 79508 79495 14 2 2 112 96 17 0.311 3.85 2.00 Prom - 83709 83670 40 -2.76 3.00 Prom + 93414 93453 40 -5.56 3.01 Sngl + 100001 100441 441 1 0 85 44 464 0.994 37.95 3.02 PlyA + 100628 100633 6 1.05 4.00 Prom + 100895 100934 40 -6.36 4.01 Sngl + 101480 101941 462 1 0 70 54 353 0.936 26.26 4.02 PlyA + 102932 102937 6 1.05 5.00 Prom + 105743 105782 40 -5.46 5.01 Sngl + 115191 115427 237 2 0 70 39 172 0.914 5.59 5.02 PlyA + 116385 116390 6 1.05 6.00 Prom + 120078 120117 40 -4.46 6.01 Init + 120882 121014 133 2 1 78 -9 124 0.670 2.00 6.02 Intr + 125800 125883 84 2 0 84 80 59 0.183 4.49 6.03 Intr + 131018 131070 53 0 2 34 110 51 0.015 0.53 6.04 Intr + 135967 136090 124 0 1 7 77 104 0.018 1.36 6.05 Intr + 136945 137049 105 2 0 88 57 39 0.549 0.99 6.06 Intr + 140641 140668 28 2 1 104 103 16 0.212 1.87 6.07 Intr + 154981 155074 94 1 1 4 46 132 0.061 0.57 6.08 Intr + 155444 155553 110 1 2 97 77 76 0.860 6.48 6.09 Term + 169953 170058 106 0 1 90 52 107 0.236 5.18 6.10 PlyA + 171003 171008 6 1.05 7.00 Prom + 174060 174099 40 -1.46 7.01 Init + 190768 190840 73 0 1 73 99 43 0.719 3.14 7.02 Term + 194995 195089 95 2 2 106 42 79 0.756 3.09 7.03 PlyA + 195234 195239 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 135979 136090 112 0 1 92 77 84 0.936 8.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_1|110_aa VADVLETHSCTTASRNPNNDLICWFPSYPTRSKSEALQGSSFTIHVYPHFNWTYNREWAT PGTLHILAFVSCLSPLECKLQRNCIRSTWDKTAAALMSVTEDVVINGIEK >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_1|333_bp gttgcagatgtcttagaaacacattcctgcaccacagcttcaagaaatccaaataatgac ctgatctgctggtttccaagctaccctaccaggagcaagtcagaggccctgcagggctcc tcgttcaccatccacgtctaccctcattttaactggacttacaatagggaatgggccact cctggcaccctgcacattttagcatttgtttcctgtctctccccactagaatgcaagctg cagaggaattgtatccgtagcacctgggacaagacagcagctgccctgatgtctgttact gaagatgtagttatcaatgggattgagaaatga >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_2|217_aa MGKTRHSTPWPQGPVVTAPVCSAQETLPLEIHRTPRKALMKGGVKTVQLLWKVPVWKECG VSTQEHTQMYTKACGKMLKTSLSSLPKTGSNQTPLTVEEQIKENWCVLGLIDFKNEAADP RALGRSMGLRAVEQGAALVAGAQATQEPKGEGAGSSLGHPRKGLLQCSGRLKSSSSVARV GAEAEEVPRVSEGCEGFEGCQHAVTSQHHCTPIWVTE >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_2|654_bp atgggcaagaccagacactcaacaccctggccacagggccctgtggtgactgcacccgtg tgctcagcccaggagactcttcctctggagatccataggaccccaaggaaggcactgatg aaaggaggtgtgaagacagtccagttactctggaaggtccctgtgtggaaggaatgtggg gtcagcacacaggagcacacacagatgtacaccaaagcctgtggcaagatgctcaaaact agtttgtcatcactgccaaaaacgggaagtaatcaaactcctttaacagtggaagaacag atcaaggagaattggtgcgttcttggtctcattgacttcaagaatgaagctgcggaccca cgtgcccttgggcggtctatgggactccgtgctgtggagcagggggcggcactcgtggcg ggggctcaggccacgcaggagcccaagggtgagggagccggctccagcctcggccatccc aggaaggggctcctacagtgcagcggcaggctgaagagctcctcaagtgtggccagagtg ggcgccgaggccgaggaggtgccaagagtgagcgagggatgcgagggctttgagggctgc cagcacgctgtcacctctcaacaccactgcactccaatctgggtgacagagtga >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_3|146_aa MGKKHKKHKSDKHLHEEYVEKPLKLVLKVGGNEVTELSMGSSRHDSSLFEDKNDHDKHKD RKRKKRKKGEKQIPGEEKGRKRRRVKEDKKKRDRDRVENEAEKDLQCHAPVRLDLPPEKP LTSSLAKQEVEQTPLQEALNQLMRQL >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_3|441_bp atgggcaagaagcacaagaagcacaagtcggacaaacacctccacgaggagtatgtagag aagccattgaagctggtcctcaaagtaggagggaacgaagtcaccgaactctccatgggc agctcgaggcacgactccagcctcttcgaagacaaaaacgatcatgacaaacacaaggac agaaagcggaaaaagagaaagaaaggagagaagcagattccaggggaagaaaaggggaga aaacggagaagagttaaggaggataaaaagaagcgagatcgagaccgggtggagaatgag gcagaaaaagacctccagtgtcacgcccctgtgagattagacttgccccctgagaagcct ctcacaagctctttagccaaacaagaagtagaacagacaccccttcaagaagctttgaat caactgatgagacaattgtag >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_4|153_aa MEITEVEPPGRLESNTQDRLIALKAVTNFGVPVEVFDSEEAEIFQKKFGETTRLLRELQE AQNERLSTRPPPNMICLLGPSYREMHLAEQVTNNLKELAQQVTPGDIVSMYGVRKAMGIS IPSPVMENNFVDLTEDTEKPKKTDVAECGPGGS >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_4|462_bp atggagattacagaagtagagcccccagggcgtttggagtccaatactcaagacaggctc atagcgctgaaagcagtaacaaattttggcgttccagttgaagtttttgactctgaagaa gctgaaatattccagaagaaatttggtgagaccaccagattgctcagggaactccaggaa gcccagaatgaacgtttgagcaccagaccccctcccaacatgatctgtctcttgggtccc tcatacagagaaatgcatcttgctgaacaagtgaccaataaccttaaagaacttgcacag caagtaactccaggtgatatcgtaagcatgtatggagttcgaaaagcaatgggtatttcc attccttcccccgtcatggaaaacaactttgtggatttgacagaagacactgaaaaacct aaaaagacggatgttgctgagtgtggacctggtggaagttga >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_5|78_aa MTSPNELNKLPWTNPGETEICDLSDTEFKISVLKNLKEIQDNTEKESRILSDKYKKQIEI IKGNQAEILELRNADGTL >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_5|237_bp atgacttcaccaaacgaactaaataagctgccatggaccaatcctggagaaacagagata tgtgacctttcagacacagaattcaaaatatctgtgttgaagaacctcaaagaaattcaa gataacacagagaaggaatccagaattctatcagacaaatataagaaacagattgaaata attaaagggaatcaagcagaaattctggagttgagaaatgcagatggcacactttag >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_6|278_aa MEGYAAIKKDEFMAFAGTWMKLETITLSKLTQEQKTKHYMFSLINSSLWETTAFEESANQ LESEYNHPFSKMKLEMTVLVAIKSAKQHEQPCVAMVTENLGPAKIAFPNSPEVYIALSEW KLQVQDKVFWEDSMGPKTLFVKIKKKGSKKSKNFKRTRIRSPRTDGESLLWDFVIIDVMK KTTVGSGIEKKEIAISDDVVQEDFMEEKMHIFSKALISTPFETITEQLKVLSTLGREELQ TKRSLNPTVNLSSLPTVPVPTQLKKSRENTQGFKELNG >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_6|837_bp atggaaggctatgcagccataaaaaaggatgagttcatggcctttgcagggacatggatg aagctggaaaccatcactctcagcaaactaacacaagaacagaaaaccaaacactacatg ttctcactcataaattcttcactgtgggagactacggcttttgaagagtcagccaaccag cttgagagtgaatataatcacccattttctaaaatgaaacttgaaatgacagtgttagta gccatcaagtcggcaaagcagcatgaacagccttgtgtagccatggtcacagagaacttg gggcctgctaaaatagcctttcccaacagccctgaggtttatattgcccttagtgaatgg aagttacaagtccaagataaggtcttctgggaagatagtatgggtcccaagaccttattt gtgaagattaagaaaaaggggagcaagaagagcaaaaatttcaaaagaactagaataaga agtccaagaactgatggagagagcctactgtgggactttgtgatcattgatgtgatgaag aaaaccacagtgggatcagggatagagaaaaaggagattgctatttcagatgatgtggtc caggaagacttcatggaggagaagatgcacattttcagcaaagctctcatctccacgccc tttgagactatcacagaacagctaaaagtgctgagcacattagggagggaagagctgcag acaaagagaagtttaaatcccacggtcaacctcagctctctgcctactgtgcctgtgccc acacagctgaagaagtccagagaaaacacacaaggcttcaaggaattgaatgggtga >gi568815595f:160000851_160202788|GENSCAN_predicted_peptide_7|55_aa MGKGWGTAKGAAHTTSQLQIPRTAGTRVELEAIILSKLTQEEKTKYRMLSLINES >gi568815595f:160000851_160202788|GENSCAN_predicted_CDS_7|168_bp atggggaaggggtggggaacagcaaagggagcggcccacaccacctcccagctccagatc cctagaactgcaggaacacgggtggagctggaggccattatccttagcaaactaactcag gaagagaagaccaaatatcgcatgctctcacttataaatgagagctaa