GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:24:42 Sequence gi568815578f:33972075_34178546 : 206472 bp : 44.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2678 2829 152 0 2 33 47 195 0.561 7.97 1.02 PlyA + 2877 2882 6 1.05 2.00 Prom + 2919 2958 40 -6.36 2.01 Init + 10414 10602 189 0 0 107 89 62 0.557 7.21 2.02 Intr + 20562 20688 127 2 1 68 76 65 0.566 3.55 2.03 Intr + 24730 24784 55 2 1 135 103 47 0.637 8.94 2.04 Term + 28023 28128 106 0 1 52 49 68 0.167 -2.82 2.05 PlyA + 28151 28156 6 1.05 3.00 Prom + 28988 29027 40 -2.56 3.01 Init + 31443 31482 40 2 1 93 79 33 0.348 3.25 3.02 Intr + 40954 41066 113 2 2 64 102 1 0.008 -0.80 3.03 Intr + 45363 45431 69 2 0 108 79 1 0.019 0.58 3.04 Intr + 87329 87392 64 1 1 81 92 51 0.434 3.09 3.05 Term + 89149 89165 17 2 2 130 43 22 0.403 0.30 3.06 PlyA + 91923 91928 6 1.05 4.00 Prom + 93785 93824 40 -1.36 4.01 Init + 99857 99868 12 1 0 73 96 5 0.565 0.01 4.02 Intr + 99992 100256 265 1 1 105 94 360 0.632 35.39 4.03 Intr + 101489 101561 73 0 1 82 75 8 0.998 -2.64 4.04 Intr + 101745 101792 48 0 0 113 99 183 0.999 19.70 4.05 Intr + 103800 103966 167 0 2 96 65 211 0.999 19.20 4.06 Intr + 104628 104741 114 1 0 83 76 190 0.997 17.72 4.07 Intr + 104954 105171 218 0 2 82 81 274 0.999 24.32 4.08 Intr + 107836 107998 163 0 1 110 49 120 0.962 9.85 4.09 Term + 109110 109165 56 1 2 76 44 45 0.284 -3.38 4.10 PlyA + 111089 111094 6 1.05 5.10 PlyA - 111102 111097 6 1.05 5.09 Term - 117831 117656 176 1 2 87 28 139 0.945 5.72 5.08 Intr - 118528 118443 86 0 2 64 89 10 0.811 -1.84 5.07 Intr - 121657 121601 57 0 0 87 116 14 0.670 2.10 5.06 Intr - 124731 124583 149 0 2 75 93 124 0.975 10.63 5.05 Intr - 125442 125342 101 1 2 78 94 48 0.982 4.23 5.04 Intr - 126559 126424 136 1 1 63 90 164 0.975 14.24 5.03 Intr - 131491 131388 104 2 2 100 62 111 0.604 9.59 5.02 Intr - 133471 133294 178 1 1 85 100 203 0.840 20.69 5.01 Init - 135232 135188 45 1 0 81 87 -2 0.329 -0.32 5.00 Prom - 136950 136911 40 -6.76 6.00 Prom + 139289 139328 40 -8.36 6.01 Sngl + 139530 140324 795 2 0 42 48 355 0.975 22.79 6.02 PlyA + 142080 142085 6 1.05 7.00 Prom + 148065 148104 40 -1.86 7.01 Init + 150396 150450 55 2 1 81 7 113 0.869 3.95 7.02 Intr + 150555 150987 433 1 1 -5 32 375 0.743 15.30 7.03 Term + 151012 151216 205 1 1 19 46 249 0.969 10.64 7.04 PlyA + 153699 153704 6 1.05 8.00 Prom + 162828 162867 40 -2.76 8.01 Init + 187887 187978 92 2 2 95 96 59 0.759 7.36 8.02 Term + 202167 202305 139 0 1 100 44 89 0.808 3.14 8.03 PlyA + 202378 202383 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_1|50_aa XPSGGIPEEGIVIIGVDSSMYVLPPKTFQWHEMWVDLEYDDIDDPDPVWA >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_1|153_bp ngtccttcaggaggtattcctgaagaaggcattgttatcataggagttgacagctccatg tatgtattgccgccgaagaccttccagtggcatgagatgtgggtggacctggaatatgat gatattgatgatcctgatcctgtgtgggcctag >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_2|158_aa MATRDIIGNYETTLICMSYGDSYENYQVGSKPSLFICALGGPSMGGRGSGSSSHTAGFGP CGSGREVIKQQKDNYNCWYGKARTLNSNSAPFSGIPSVKQVLGKRSPENYRSTFCEFDYS RNLIWKGENDSRNVNLLVLQLPPGYCKEIVLRVQLTDP >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_2|477_bp atggccaccagggacatcattgggaactatgaaaccactttaatttgtatgagctatgga gattcttatgagaactatcaagttggtagcaagccctctctcttcatctgtgccttagga ggcccatcaatgggaggaaggggatctggaagttcctcccatacagcagggtttggtccc tgtgggtcaggcagggaggtcatcaaacaacagaaggacaattataactgctggtatggc aaagccaggactctgaattcaaactctgctccattctcaggaatcccgtctgttaagcag gttctggggaagcgatcccctgaaaactatcgttctactttctgtgaatttgactactct aggaacctcatctggaagggagaaaatgacagccgtaatgttaacctgcttgtcctacaa cttccccctgggtactgtaaggaaattgtgctcagggtgcagctcacagacccttag >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_3|100_aa MEQKTGGELFSGNIKYSINYVRYSTLYYKIGFVLVDFAHLKASVNVLRIFKTERELRLSP WSCSKTGGNSLKPVEDLQQLSILGVFLSMLTWHMSGCSMT >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_3|303_bp atggagcaaaagactggaggtgaactgttcagcgggaacataaagtattcaataaattat gtgagatattcaacactgtattataaaataggctttgtgttagttgattttgctcacctg aaggctagtgttaatgttctgcgcatattcaagactgaacgggagctcaggctgtctccc tggagctgttcaaagactggaggaaattctttaaaacctgtggaagacctgcagcagctg tccatcctgggagtattcctgtctatgcttacgtggcacatgtcaggatgtagcatgacg tag >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_4|371_aa MQEQVGTMSLKLQASNVTNKNDPKSINSRVFIGNLNTALVKKSDVETIFSKYGRVAGCSV HKGYAFVQYSNERHARAAVLGENGRVLAGQTLDINMAGEPKPDRPKGLKRAASAIYSGYI FDYDYYRDDFYDRLFDYRGRLSPVPVPRAVPVKRPRVTVPLVRRVKTNVPVKLFARSTAV TTSSAKIKLKSSELQAIKTELTQIKSNIDALLSRLEQIAAEQKANPDGKKKGDGGGAGGG GGGGGSGGGGSGGGGGGGSSRPPAPQENTTSEAGLPQGEARTRDDGDEEGLLTHSEEELP DRSNGHQQVKGIAAPGLKPGTQPWMPPPSGYQRKAGSRRLLPQRIPASAMSSAGYKYLPW SSELRDSYVAT >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_4|1116_bp atgcaggaacaggtgggcaccatgtccttgaagcttcaggcaagcaatgtaaccaacaag aatgaccccaagtccatcaactctcgagtcttcattggaaacctcaacacagctctggtg aagaaatcagatgtggagaccatcttctctaagtatggccgtgtggccggctgttctgtg cacaagggctatgcctttgttcagtactccaatgagcgccatgcccgggcagctgtgctg ggagagaatgggcgggtgctggccgggcagaccctggacatcaacatggctggagagcct aagcctgacagacccaaggggctaaagagagcagcatctgccatatacagtggctacatc tttgactatgattactaccgggacgacttctacgacaggctcttcgactaccggggccgt ctgtcgcccgtgccagtgcccagggcggtccctgtgaagcgaccccgggtcacagtccct ttggtccggcgtgtcaaaactaacgtacctgtcaagctctttgcccgctccacagctgtc accaccagctcagccaagatcaagttaaagagcagtgagctgcaggccatcaagacggag ctgacacagatcaagtccaatatcgatgccctgctgagccgcttggagcagatcgctgcg gagcaaaaggccaatccagatggcaagaagaagggtgatggaggtggcgccggcggcggc ggcggtggtggtggcagcggtggcggtggcagtggtggtggcggtggcggtggcagcagc cggccaccagccccccaagagaacacaacttctgaggcaggcctgccccagggggaagca cggacccgagacgacggcgatgaggaagggctcctgacacacagcgaggaagagctgcct gacaggagcaatggccaccagcaggtgaagggcatcgctgccccaggcctcaagccgggc acccaaccctggatgccaccccccagcgggtaccagaggaaagctggcagcaggcgcctc ctcccccaacgcatcccagccagtgccatgtcctctgcaggttacaagtacctaccctgg tccagtgaactgcgggattcatatgtggccacttag >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_5|343_aa MLNLSMVLIEYGERLMIFDPTMSKKKKKKKKPFMLDEEGDTQTEETQPSETKEVEPEPTE DKDLEADEEDTRKKDASDDLDDLNFFNQKKKKKKTKKIFDIDEAEEGVKDLKIESDVQEP TEPEDDLDIMLGNKKKKKKNVKFPDEDEILEKDEALEDEDNKKDDGISFSNQTGPAWAGS ERDYTYEELLNRVFNIMREKNPDMVAGEKRKFVMKPPQVVRVGTKKTSFVNFTDICKLLH RQPKHLLAFLLAELGTSGSIDGNNQLVIKGRFQQKQIENVLRRYIKEYVTCHTCRSPDTI LQKDTRLYFLQCETCHSRCSVASIKTGFQAVTGKRAQLRAKAN >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_5|1032_bp atgctgaatttaagcatggtattaatagaatatggagagagattgatgatttttgatcct actatgagcaagaagaaaaagaagaagaagaagccttttatgttagatgaggaaggggat acccaaacagaggaaacccagccttcagaaacaaaagaagtggagccagagccaactgag gacaaggatttggaagctgatgaagaggacactaggaaaaaagatgcttctgatgatcta gatgacttgaacttctttaatcaaaagaaaaagaagaaaaaaactaaaaagatatttgat attgatgaagctgaagaaggtgtaaaggatcttaagattgaaagtgatgttcaagaacca actgaaccagaggatgaccttgacattatgcttggcaataaaaagaagaaaaagaagaat gttaagttcccagatgaggatgaaatactagagaaagatgaagctctagaagatgaagac aacaaaaaagatgatggtatctcattcagtaatcagacaggccctgcttgggcaggctca gaaagagactacacatacgaggagctgctgaatcgagtgttcaacatcatgagggaaaag aatccagatatggttgctggggagaaaaggaaatttgtcatgaaacctccacaagtcgtc cgagtaggaaccaagaaaacttcttttgtcaactttacagatatctgtaaactattacat cgtcagcccaaacatctccttgcatttttgttggctgaattgggtacaagtggttctata gatggtaataaccaacttgtaatcaaaggaagattccaacagaaacagatagaaaatgtc ttgagaagatatatcaaggaatatgtcacttgtcacacatgccgatcaccggacacaatc ctgcagaaggacacacgactctatttcctacagtgcgaaacttgtcattctagatgttct gttgccagtatcaaaaccggcttccaggctgtcacgggcaagcgagcacagctccgtgcc aaagctaactaa >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_6|264_aa MLGRQTRAGYRPRQDSLESFNEGVIPSQERLLAAPLAPPAPSEASWGGSAQQKGMQGSPA PVSFGVINRAFEPLPGGRLNVCSPFRKRQPRQEVRLSAAALRKAGCGQACGSGVLAGSAA PGRLGAAASHGGGRLLGRRPVLPHWSGLGFSPQSLALTPGSDPHLVPRHGCGSSGLGTDG KSDGSAPGPGGSAAPADTSPTTALGSCISERKRHPALSSKRWKCACAELAKAQARRRANL ARFVWRELPGDAASPGMPGLVVHL >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_6|795_bp atgctaggaaggcaaacaagggcgggataccggccccgacaagatagcctagaaagtttt aacgagggcgttataccaagccaagagaggctcctcgcggcacccctggcacccccggca cccagtgaggcgagctggggcggcagtgcccagcagaaaggaatgcagggatcgccagcc ccagtctcgttcggggttattaaccgggcattcgagccccttccgggaggacgactcaac gtgtgctctccattccggaaacgccaaccacggcaggaggtccgcctcagcgctgccgcc ctgaggaaagcgggctgcggccaagcctgcggctccggagtcctcgccggctctgccgct ccgggccgcctaggcgcggctgcctcacatggcggcggccggctgctaggccgcaggccc gttctcccacactggagtgggttaggcttctcgcctcaatccttagcccttacgccgggc tcagatcctcacctcgtccccagacatggctgcggctcgagtgggctcggcacggacggg aagtcagacgggtcagccccaggccccggcggcagcgctgcccctgccgatacctctccc accaccgcactaggctcttgcatcagcgaaaggaaacgacaccccgccctctcctccaag cgttggaaatgcgcctgcgcagagctcgccaaggcgcaggcgcgcagaagagccaaccta gcgcgcttcgtgtggcgcgagttaccaggcgacgccgcgtcgccaggcatgccgggactt gtagtccatttataa >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_7|230_aa MGNRGGFRGGFGGGIRGQGHEDQVPGGCLFSLPIKESETIDFFLGSSLKEEGLKIMPVQK QTRAGQRTRFKAFVATGDYNGHVGLCVKCSKEVATAIRGSVILTKHSIVPVRRGYGGTRW TSPHRPLQGDRPLRLCAGAPHPCTQGHWHRLRTCAQEAPHDGCQGLHCHPGNFAKATFDT ISKTYSYLTLDLWKETIFTKSPYQEFTDHLVKTHTRVSMQRTQAPAVATA >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_7|693_bp atggggaaccgcggtggcttccgcggaggtttcggcggtggcatccggggccagggacat gaagatcaagtacctggaggatgtctcttctccctgcccatcaaggaatctgagaccatt gacttttttctggggtcctctctcaaggaagagggtttgaagattatgccggtgcagaag cagacccgcgcgggccagcgcaccaggttcaaggcgtttgttgccaccggggactacaat ggccacgtcggtctgtgcgttaagtgctccaaggaggtggctactgccatccgcgggtcc gtcatcctgaccaagcactccattgtccccgtgcgcagaggctacgggggaacaagatgg acaagcccacaccgtcccttgcaaggtgacaggccgctgcggctctgtgctggtgcacct catccctgcacccaggggcactggcatcgtctccgcacctgtgcccaagaagctcctcat gatggctgccaggggctgcactgccaccctggcaacttcgccaaggccacctttgacacc atctctaagacctacagctacctgaccctcgacctctggaaggagactatattcaccaag tctccctatcaggaattcactgaccacctcgtcaagacccacaccagagtctccatgcag aggacccaggctccagctgtggctacagcatag >gi568815578f:33972075_34178546|GENSCAN_predicted_peptide_8|76_aa MKNLSQARTIIVPHSPGNSNWSKEQAYDPESPPPEISADQNRSLKKLPEAGESSTRKEQK EQSSEYQPQQDKIHYV >gi568815578f:33972075_34178546|GENSCAN_predicted_CDS_8|231_bp atgaagaatttgtctcaggcccggacaatcatagttcctcattcccctggcaacagtaat tggtccaaggagcaggcatatgacccagaaagtccccctcctgaaattagtgctgatcag aacagaagtttgaagaaactacctgaggctggagaaagtagcaccagaaaagagcagaag gaacaatcctcagagtaccagccccaacaagataaaattcactatgtctga