GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:28:35 Sequence gi568815586r:7589881_7795728 : 205848 bp : 44.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2629 2668 40 -3.26 1.01 Sngl + 5632 6054 423 0 0 58 49 229 0.974 12.30 1.02 PlyA + 6239 6244 6 1.05 2.00 Prom + 9979 10018 40 -2.46 2.01 Init + 21960 22294 335 2 2 73 81 149 0.646 9.47 2.02 Term + 38805 38985 181 0 1 10 49 225 0.242 7.88 2.03 PlyA + 41526 41531 6 1.05 3.03 PlyA - 41874 41869 6 1.05 3.02 Term - 50066 49795 272 0 2 63 42 130 0.798 1.55 3.01 Init - 50349 50175 175 0 1 79 97 140 0.760 11.51 3.00 Prom - 57466 57427 40 -4.26 4.04 PlyA - 57601 57596 6 1.05 4.03 Term - 61389 61364 26 1 2 107 38 24 0.305 -2.31 4.02 Intr - 62955 62558 398 2 2 89 98 185 0.750 13.82 4.01 Init - 80169 80117 53 0 2 107 59 26 0.230 2.23 4.00 Prom - 97853 97814 40 -5.16 5.03 PlyA - 97880 97875 6 1.05 5.02 Term - 100824 99998 827 1 2 137 45 535 0.947 47.33 5.01 Init - 105848 105581 268 2 1 72 116 150 0.924 12.16 5.00 Prom - 113900 113861 40 -5.06 6.00 Prom + 115656 115695 40 -0.86 6.01 Init + 121691 121772 82 1 1 60 115 61 0.835 7.13 6.02 Term + 123403 123506 104 2 2 77 38 27 0.253 -4.96 6.03 PlyA + 123615 123620 6 -0.45 7.05 PlyA - 124819 124814 6 1.05 7.04 Term - 125454 125310 145 2 1 51 54 96 0.332 -0.12 7.03 Intr - 126795 126658 138 2 0 85 110 55 0.399 6.98 7.02 Intr - 141032 140917 116 2 2 134 94 26 0.838 7.05 7.01 Init - 147590 147549 42 2 0 59 91 56 0.810 3.43 7.00 Prom - 148597 148558 40 -5.26 8.04 PlyA - 148637 148632 6 -0.45 8.03 Term - 149695 149535 161 2 2 60 47 90 0.891 0.20 8.02 Intr - 151651 151541 111 2 0 63 82 87 0.480 5.95 8.01 Init - 157468 157438 31 1 1 77 99 35 0.806 3.40 8.00 Prom - 163622 163583 40 -5.66 9.00 Prom + 167583 167622 40 -3.06 9.01 Init + 180115 180435 321 0 0 56 80 172 0.437 10.54 9.02 Intr + 197838 197963 126 2 0 37 109 49 0.011 2.78 9.03 Intr + 203108 203332 225 1 0 68 96 107 0.692 7.68 9.04 Intr + 204577 204663 87 0 0 99 92 61 0.874 7.77 9.05 Term + 204799 205215 417 0 0 113 49 167 0.477 10.58 9.06 PlyA + 205757 205762 6 -3.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 14127 14193 67 2 1 88 111 21 0.816 2.88 S.002 Term + 17332 17456 125 2 2 42 49 103 0.804 0.35 S.003 Init + 197862 197963 102 2 0 70 109 78 0.846 8.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_1|140_aa MELKNTARELHEAYTSFNSRIDQEEERISVIGDQLNEIKREDKIREKRMKRNEQNLQEIW DYVKRPNLRLIGVPENDEENKINLENIFQHIIQENFPNLARQANIQIQEIQRTAPRYSSR RATPRHIIVRFTKVEIRKKY >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_1|423_bp atggagctgaaaaacacagcacgagaacttcatgaagcatacacaagtttcaatagccga atcgatcaagaggaagaaagaatatcagtgattggagatcaacttaatgaaataaagaga gaagacaagattagagaaaaaagaatgaaaaggaatgaacaaaacctccaagaaatatgg gactatgtgaaaagaccaaatctacgtttgattggtgtacctgaaaatgacgaggagaat aaaatcaacttggaaaacatttttcagcatattatacaggagaacttccccaacctggca agacaggccaacattcaaattcaggaaatacagagaacagcaccaagatactcctcaaga agagcaaccccaagacatataattgtcagattcaccaaggttgaaataaggaaaaaatat taa >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_2|171_aa MDKFLETYNLPRLNQEEIESLNRPIMSSKIESVIKSLPTRKSPGLYEFTAKFCQMYKKEL VPLLQKVFKKIEEEKLFPNSFYEASIILIPKPGRDTQTKRKFLANILNEHISQCRAAAPA AAAAAPAAAAAAAAPAAAAPAAAAAAAPLALLCGERRGHLQDGGRPFHKPE >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_2|516_bp atggataaattcctggaaacatacaatctcccaagattgaaccaggaagagattgaatcc ctgaacagaccaataatgagttccaaaattgaatcagtaataaaaagcctaccaaccaga aaaagcccaggactatatgaattcacagccaaattctgccagatgtacaagaaagagctt gtgccattgcttcagaaagtattcaagaaaattgaggaggagaaactttttcctaactca ttctacgaagccagcatcatcctaataccaaaacctggcagagacacacaaacaaaaagg aagtttctggccaatatcctcaatgaacatataagtcagtgccgggctgccgcccctgcc gccgccgccgccgcccctgccgccgccgccgccgccgccgcccctgccgccgccgcccct gccgccgccgccgccgccgctcctctagcgctcctctgtggagagcgccgcgggcacctg caagacggggggcgccccttccacaaacccgagtag >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_3|148_aa MRLHSSALGRSMGLGAVEQGAALVGEAPAAQEPTEAGEGSGMAGCSPKACPAGRQLRPGP AGCSECGAAKPTPPRNSSWPASAARSPGSRSRVSLHTSLQAEGAGCGLRQPRKGLPQCSG GLKGSSSAAKVGAQAEEAPRASESCEDC >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_3|447_bp atgcgcttgcactcctcagcccttgggcggtcgatgggactgggcgccgtggagcagggg gcggcgctcgtcggggaggctccggctgcacaggaacccacggaggcgggggaaggctca ggcatggcgggctgcagtcccaaggcctgccccgcgggaaggcagctaaggcccgggccg gccggctgctccgagtgcggggccgccaagcccacgcccccccggaactccagctggccc gcaagcgccgcgcgcagccccggttcccgctcgcgcgtctccctccacacctccctgcaa gctgagggagcgggctgcggccttcgccagcccagaaaggggctcccacagtgcagcggt gggctgaagggctcctcaagtgccgccaaagtgggagcccaggcagaggaggcgccgaga gcgagcgagagctgtgaggactgctag >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_4|158_aa MGPLYSNLGNKSKILSPGRRIEPWEFDVFYDPRELRKEACLLYEIKWGMSRKIWRSSGKN TTNHVEVNFIKKFTSERDFHPSMSCSITWFLSWSPCWECSQAIREFLSRHPGVTLVIYVA RLFWHMDQQNRQGLRDLVNSGVTIQIMRASGGLMSSSN >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_4|477_bp atggggccattgtactccaacctgggcaacaagagcaaaattttgtctccagggagaaga atcgaaccctgggagtttgacgtcttctatgaccccagagaacttcgtaaagaggcctgt ctgctctacgaaatcaagtggggcatgagccggaagatctggcgaagctcaggcaaaaac accaccaatcacgtggaagttaattttataaaaaaatttacgtcagaaagagattttcac ccatccatgagctgctccatcacctggttcttgtcctggagtccctgctgggaatgctcc caggctattagagagtttctgagtcggcaccctggtgtgactctagtgatctacgtagct cggcttttttggcacatggatcaacaaaatcggcaaggtctcagggaccttgttaacagt ggagtaactattcagattatgagagcatcagggggtctaatgtccagcagcaactag >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_5|364_aa MLRFLPDLAFSFLLILALGQAVQFQEYVFLQFLGLDKAPSPQKFQPVPYILKKIFQDREA AATTGVSRDLCYVKELGVRGNVLRFLPDQGFFLYPKKISQASSCLQKLLYFNLSAIKERE QLTLAQLGLDLGPNSYYNLGPELELALFLVQEPHVWGQTTPKPGKMFVLRSVPWPQGAVH FNLLDVAKDWNDNPRKNFGLFLEILVKEDRDSGVNFQPEDTCARLRCSLHASLLVVTLNP DQCHPSRKRRAAIPVPKLSCKNLCHRHQLFINFRDLGWHKWIIAPKGFMANYCHGECPFS LTISLNSSNYAFMQALMHAVDPEIPQAVCIPTKLSPISMLYQDNNDNVILRHYEDMVVDE CGCG >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_5|1095_bp atgcttcgtttcttgccagatttggctttcagcttcctgttaattctggctttgggccag gcagtccaatttcaagaatatgtctttctccaatttctgggcttagataaggcgccttca ccccagaagttccaacctgtgccttatatcttgaagaaaattttccaggatcgcgaggca gcagcgaccactggggtctcccgagacttatgctacgtaaaggagctgggcgtccgcggg aatgtacttcgctttctcccagaccaaggtttctttctttacccaaagaaaatttcccaa gcttcctcctgcctgcagaagctcctctactttaacctgtctgccatcaaagaaagggaa cagttgacattggcccagctgggcctggacttggggcccaattcttactataacctggga ccagagctggaactggctctgttcctggttcaggagcctcatgtgtggggccagaccacc cctaagccaggtaaaatgtttgtgttgcggtcagtcccatggccacaaggtgctgttcac ttcaacctgctggatgtagctaaggattggaatgacaacccccggaaaaatttcgggtta ttcctggagatactggtcaaagaagatagagactcaggggtgaattttcagcctgaagac acctgtgccagactaagatgctcccttcatgcttccctgctggtggtgactctcaaccct gatcagtgccacccttctcggaaaaggagagcagccatccctgtccccaagctttcttgt aagaacctctgccaccgtcaccagctattcattaacttccgggacctgggttggcacaag tggatcattgcccccaaggggttcatggcaaattactgccatggagagtgtcccttctca ctgaccatctctctcaacagctccaattatgctttcatgcaagccctgatgcatgccgtt gacccagagatcccccaggctgtgtgtatccccaccaagctgtctcccatttccatgctc taccaggacaataatgacaatgtcattctacgacattatgaagacatggtagtcgatgaa tgtgggtgtgggtag >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_6|61_aa MDPSQFNPTYIPGSPQMLTEENSRDDSAVSPASRRKNAGLYIRVGKKNERTPSLSLSLRL P >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_6|186_bp atggacccatcacagtttaatccaacctacatcccagggtctccacaaatgctcaccgaa gaaaattcccgggacgattcagctgtgtcccctgcgtccagaagaaagaatgctggtcta tatatccgagtgggaaagaaaaatgaacgtacccccagtctctccctttccttgcgtctg ccttag >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_7|146_aa MGADLVVINTREEQDFIIQNLKRNSSYFLGLSDPGGRRHWQWVDQTPYNENVTHSAEVFV LLLYSATVKIFVTARESINLTPRVTPSVNYELWMIILSGHSSSISLRTAAPTDSRRSKAS DRGETDSLLALIVKLLRFFINVSEEI >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_7|441_bp atgggggctgatctggtggtgatcaacaccagggaagaacaggatttcatcattcagaat ctgaaaagaaattcttcttattttctggggctgtcagatccagggggtcggcgacattgg caatgggttgaccagacaccatacaatgaaaatgtcacacactctgctgaagtgtttgtc ctcttactatacagtgctaccgtgaagatatttgtgacagcacgtgaaagtatcaaccta acaccaagagtgacccctagtgtaaactatgaactctggatgataatcctgtcaggccac tcatcttcgatttccctgaggactgctgctcctacagactctcgacggagtaaagcttcc gatagaggggaaacagattcgctactagcgttgatagtcaagttactaaggttctttatc aacgtctcggaggagatttga >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_8|100_aa MVPEEEPQDRVPHNFMYSKTVKRLSKLREYQQYHPSLTCVMEGKDIEGIIGRLCPGSGTA YSVVCRAGGRQVPPQKCLYGLNPLQEGNMDPKGIKQLFQE >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_8|303_bp atggtgcctgaagaagagcctcaagaccgagtgcctcacaattttatgtatagcaaaact gtcaagaggctgtccaagttacgagagtatcaacagtatcatccaagcctgacctgcgtc atggaaggaaaggacatagaagggataattggtcggctgtgtccaggatcagggactgcc tacagcgtcgtatgcagagccggagggagacaagttccaccccagaaatgcctctatggc cttaatcctttgcaggaaggaaatatggacccaaagggcatcaaacaattatttcaagaa tga >gi568815586r:7589881_7795728|GENSCAN_predicted_peptide_9|391_aa MPWDQDPEQSTGNYSEDEQNGKQKWREEGEAGRKREREKEEKNEKELQDEQENKRKRENE KQKQYPEKRLVSKSLMHTLWAKFKLNRCPTIQESLSLSFEFDMTHKQTQWDRERGMGEFS SGFYAKTPFCKEQSFWYLPFGELRASSASDSPDSSTSPKGKQPTSAEKSVAKKEDKVPVK KQKTRTVFSSTQLCVLNDRFQRQKYLSLQQMQELSNILNLSYKQVKTWFQNQRMKSKRWQ KNNWPKNSNGVTQKASAPTYPSLYSSYHQGCLVNPTGNLPMWSNQTWNNSTWSNQTQNIQ SWSNHSWNTQTWCTQSWNNQAWNSPFYNCGEESLQSCMQFQPNSPASDLEAALEAAGEGL NVIQQTTRYFSTPQTMDLFLNYSMNMQPEDV >gi568815586r:7589881_7795728|GENSCAN_predicted_CDS_9|1176_bp atgccttgggatcaagatccagaacaatcaactggaaattacagtgaagatgaacaaaat ggaaagcagaaatggagagaagaaggagaagcaggcagaaagagagaacgagaaaaagaa gaaaaaaacgaaaaggagctgcaagatgaacaggaaaacaaaaggaaaagggaaaatgag aaacagaaacagtatcccgagaaaagattagtcagcaaatccctcatgcatactctctgg gcaaagtttaagttaaacaggtgccccactatacaagagagtctatcactgtcatttgaa tttgacatgacacataaacagacacaatgggacagggagcgggggatgggggaattcagc tcaggcttttatgcaaagacccccttctgcaaagaacaaagcttctggtacctgcccttt ggagagctgcgggcaagctcagcctcggacagccctgattcttccaccagtcccaaaggc aaacaacccacttctgcagagaagagtgtcgcaaaaaaggaagacaaggtcccggtcaag aaacagaagaccagaactgtgttctcttccacccagctgtgtgtactcaatgatagattt cagagacagaaatacctcagcctccagcagatgcaagaactctccaacatcctgaacctc agctacaaacaggtgaagacctggttccagaaccagagaatgaaatctaagaggtggcag aaaaacaactggccgaagaatagcaatggtgtgacgcagaaggcctcagcacctacctac cccagcctttactcttcctaccaccagggatgcctggtgaacccgactgggaaccttcca atgtggagcaaccagacctggaacaattcaacctggagcaaccagacccagaacatccag tcctggagcaaccactcctggaacactcagacctggtgcacccaatcctggaacaatcag gcctggaacagtcccttctataactgtggagaggaatctctgcagtcctgcatgcagttc cagccaaattctcctgccagtgacttggaggctgccttggaagctgctggggaaggcctt aatgtaatacagcagaccactaggtattttagtactccacaaaccatggatttattccta aactactccatgaacatgcaacctgaagacgtgtga