GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:35:53 Sequence gi568815585f:21040610_21247339 : 206730 bp : 43.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 5417 5076 342 2 0 68 94 320 0.935 27.84 1.00 Prom - 10604 10565 40 -6.16 2.00 Prom + 13122 13161 40 -3.26 2.01 Init + 20210 20643 434 1 2 87 48 247 0.745 14.29 2.02 Term + 20743 20983 241 1 1 87 46 151 0.496 6.30 2.03 PlyA + 22861 22866 6 1.05 3.05 PlyA - 23297 23292 6 1.05 3.04 Term - 30507 30389 119 1 2 89 55 7 0.038 -3.80 3.03 Intr - 31249 31204 46 1 1 63 92 83 0.052 4.18 3.02 Intr - 35849 35778 72 2 0 126 95 -21 0.022 2.00 3.01 Init - 54105 53842 264 0 0 111 69 345 0.456 31.91 3.00 Prom - 54291 54252 40 -2.06 4.04 PlyA - 55043 55038 6 1.05 4.03 Term - 62693 62388 306 2 0 89 32 368 0.295 26.42 4.02 Intr - 80011 79843 169 0 1 49 66 111 0.266 4.95 4.01 Init - 86947 86928 20 1 2 74 105 2 0.195 0.15 4.00 Prom - 87990 87951 40 -3.76 5.00 Prom + 89268 89307 40 -2.26 5.01 Init + 100001 100072 72 1 0 90 67 108 0.659 9.97 5.02 Intr + 100277 100386 110 1 2 132 73 170 0.608 18.98 5.03 Intr + 135785 135897 113 2 2 100 24 115 0.260 6.32 5.04 Term + 136303 136616 314 2 2 99 36 427 0.381 33.76 5.05 PlyA + 136883 136888 6 1.05 6.00 Prom + 143914 143953 40 -3.26 6.01 Init + 178382 178491 110 1 2 46 91 108 0.701 6.59 6.02 Intr + 179830 179915 86 1 2 49 36 70 0.149 -2.64 6.03 Intr + 185370 185419 50 1 2 87 101 23 0.198 1.90 6.04 Intr + 185774 185880 107 1 2 36 98 81 0.430 2.91 6.05 Term + 191419 191875 457 1 1 59 37 215 0.435 8.10 6.06 PlyA + 192768 192773 6 1.05 7.00 Prom + 195579 195618 40 -0.76 7.01 Init + 205724 205880 157 1 1 80 55 99 0.035 5.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 106577 106733 157 2 1 91 32 138 0.996 5.91 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_1|114_aa MRPKTFPATTYSGNSRQRLQEIREGLKQPSKSSVQGLPAGPNSDTSLDAKVLGSKDATRQ QQQMRATPKFGPYQKALREIRYSLLPFANESGTSAAAEVNRQMLQELVNAGCDQ >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_1|342_bp atgaggccaaagacttttcctgccacgacttattctggaaatagccggcagcgactgcaa gagattcgtgaggggttaaaacagccatccaagtcttcggttcaggggctacccgcagga ccaaacagtgacacttccctggatgccaaagtcctggggagcaaagatgccaccaggcag cagcagcagatgagagccaccccaaagttcggaccttatcagaaagccttgagggaaatc agatattccttgttgccttttgctaatgaatcgggcacctctgcagctgcagaagtgaac cggcaaatgctgcaggaactggtgaacgcaggatgcgaccag >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_2|224_aa MAPDPWDRCAAGPGVPRLSGAPGRASSEKPIPRRQRKQRRARRYGLSGLGHRPRPAPPLP RRDPGRPGRRTTKRRSPAAPARRRPPTSPAPLEPAPRAPYGLDYEPPGGGGSGRARTVGA GRRAPSGQTSSAAPPGALPGASAPQGAVYTLGGVLAGGSRASRAGRPGTGGWSCCWPRRD GGAPGGGGGVDGDCSIFPGRSRRGSKAQTQVGHSLHCWHSTGVT >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_2|675_bp atggccccggatccgtgggaccgctgcgccgccggaccgggggtcccgaggctgtcaggg gcgcccggccgggcgtctagtgaaaagccgatcccccggagacagcggaagcagaggcgc gcccggcgctacggcctttcgggtctaggacaccgcccccggcccgcgccgcccctgccc cgccgggatcccggccgccccggtcgccgcacaacaaagcggcgcagccccgctgccccc gcccgacgccggcctccaacttccccggctcctctggaaccggctccgcgggctccgtac ggcctggactacgagccgccgggcgggggcggctccgggcgggcgcggaccgtgggggca gggcgcagggctccgtccggacaaacttcctcggccgccccgccgggcgcccttcccgga gcctcggcaccacagggcgcggtctacaccctcgggggcgtcctggccggcggctcccgg gcctctcgcgccgggagaccgggcactggtggctggagctgctgctggccccgcagggac gggggagccccgggcggcggcggtggcgtggacggcgactgctccatcttcccggggcgc tcacgccgcggttccaaagcgcagacccaagtgggacattcgctacattgttggcattcc acgggcgtcacgtga >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_3|166_aa MAKEGIAAGGVMDINTALQEVLKTTLIHDGLARGIHEAAKPLDKGQAHLYVLASNCDETV YVKLVEAICAKHQINFIKVDDNKKVGEWPQPPPLRPCTVRRLQPPRALPPEDVCNHNHIY SSPFAKSGESPYHNLIMVFIIKPIYQAGKDNSFGFYLSALVDKAKY >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_3|501_bp atggccaaggaaggcattgctgctggaggtgtaatggacattaatactgctttacaagag gtgctgaagaccaccctcatccacgatggcctagcacgtggaattcacgaagctgccaaa cccttagacaagggccaagcccatctttatgtgcttgcatccaactgtgatgagactgtg tatgtcaagttggtggaagccatttgtgctaaacaccaaatcaacttcattaaggttgat gacaacaagaaagtaggggaatggcctcagccaccgcccctccgcccctgcactgtgcga cgactgcagcctccacgtgcccttcctcctgaagatgtttgtaaccacaaccacatctac agctccccctttgccaagtcaggtgaatcaccttatcataatctcatcatggttttcatt atcaaacccatttatcaagcaggcaaagataactcattcggtttttatttatcagctctg gttgataaagcaaaatactga >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_4|164_aa MGLAGRSAGIAGVSQSVRPSLTYKEFLEIEKINPNEKWATDTVYRTRNANGPGINERMLN LSHVQQYCITMTAKDCSIMIALSPCLQDARSDQRPVIPSSRPRVAFSMSVVDLDLKPYES IPYQYKLDGKIINYYSKTVRAKDNTMMSTRFKESEDCTLVLHKI >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_4|495_bp atggggcttgcaggcagaagtgctgggattgcaggtgtgagccaatctgtccggccctct ctgacttataaggagttcttagaaatagaaaagattaatccaaatgaaaaatgggcaacg gacacagtttatagaacaagaaatgcaaatggccctggaatcaatgaacggatgctcaac ctctctcatgtgcagcagtactgcatcaccatgactgccaaggactgctccatcatgatt gcactctctccgtgtctgcaggatgccagatctgatcaaaggcctgtcatcccttcatca aggcccagggttgctttctccatgtctgtggtggaccttgacctcaagccctacgagagc attccctatcagtataaactggatggcaagatcatcaactactattcaaagactgtacgt gccaaagacaataccatgatgtcgactcggttcaaggaaagtgaagattgcacattagtt ctccacaagatctaa >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_5|202_aa MAVESRVTQEEIKKEPEKPIDREKTCPLLLRVFTTNNGRHHRMDEFSRGNVPSSELQIYT CARCSRAVSQSSVLARDRSFPQKLRIGSMLSTAGKDSRRTMFLTALLWRGRIPGRQWIGK HRRPRFVSLRAKQNMIRRLEIEAENHYWLSMPYMTREQERGHAAVRRREAFEAIKAAATS KFPPHRFIADQLDHLNVTKKWS >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_5|609_bp atggcggtggagtcgcgcgttacccaggaggaaattaagaaggagccagagaaaccgatc gaccgcgagaagacatgcccactgttgctacgggtcttcaccaccaataacggccgccac caccgaatggacgagttctcccggggaaatgtaccgtccagcgagttgcagatctacact tgcgctcgctgcagccgggccgtctcgcagtccagcgtgctggccagagaccgcagcttc ccgcagaagctccggatagggtccatgctgagcacagcggggaaggactccaggcgcacc atgttcctgactgcgctcctctggcgcggccgcattcccggccgtcagtggatcgggaag caccggcggccgcggttcgtgtcgttgcgcgccaagcagaacatgatccgccgcctggag atcgaggcggagaaccattactggctgagcatgccctacatgacccgggagcaggagcgc ggccacgccgcggtgcgcaggagggaggccttcgaggccataaaggcggccgccacttcc aagttccccccgcatagattcattgcggaccagctcgaccatctcaatgtcaccaagaaa tggtcctaa >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_6|269_aa MLYKIIFEWATIYKINYKLSDELNDATEPELGTTLSCGSSENKEVVEKLSAKMPPTLVSQ EIFKEESLSDGMITMATVKVLQSYVCHMRNRNRLHFRVAEEMESEQQVHSRGNGGCKSKY GNKMMISNRSLIKECRKSTYGIEVSENKQVLTGSVLIMPLLKQKQNILVHQGALEKAELL LLRLYHSSADCDMETTGAPSTDGKIVISQTASKSHKALSKCCSECFIEQHLANLKAGDVT IGLIEGGVAECISQSKHKEQQHGKLKRGQ >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_6|810_bp atgctttacaagattatttttgaatgggctacaatatataaaatcaactacaagctttca gatgagctgaatgatgctacagaacctgaacttggaactaccttgtcttgtggctcgtct gaaaataaagaagttgttgaaaagctctcagcaaaaatgccaccaacacttgtatcacag gaaatcttcaaagaggaaagcttatcagatggaatgatcacaatggcgacggtgaaagtt ctacagtcctacgtgtgtcacatgagaaacaggaaccgcctacatttccgagttgccgag gagatggaaagtgaacagcaggtgcacagtaggggtaacggtggctgtaaaagcaaatat ggcaacaagatgatgatatccaacaggtctctaataaaggaatgcaggaaaagcacatac ggaatagaagtgagtgagaacaagcaagtgcttacaggttctgtgctgatcatgcctctg ctaaaacaaaaacaaaacattttggttcatcaaggtgcgctggagaaagctgagctgctg ctgctcagactctatcacagctccgctgactgtgacatggaaacaacaggagccccatct acagatggaaagatcgtaatttctcagaccgcatcaaagtctcacaaagccttgtctaaa tgctgttctgagtgtttcattgagcaacatttggcaaatctcaaggctggagatgtaaca ataggattaatagaaggaggagttgctgaatgcatttctcagtccaaacacaaggagcaa caacacggaaagttaaaacgtggccagtga >gi568815585f:21040610_21247339|GENSCAN_predicted_peptide_7|53_aa MGMDLNECSDPQPVSSQTVNTSVLDDTKEQLVAVTSVTVEHCAGAKGLSTGLX >gi568815585f:21040610_21247339|GENSCAN_predicted_CDS_7|159_bp atggggatggatctgaatgaatgttcagatccccagcctgtctcttctcaaacagtgaac acctctgtgctagatgacaccaaggaacagctggtagctgtgacctcagtaactgtggag cactgtgctggagccaaagggctctcaaccgggctggnn