GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:19:13 Sequence gi568815592f:150834968_151035563 : 200596 bp : 43.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 5190 5929 740 1 2 63 45 480 0.524 34.83 1.02 PlyA + 6676 6681 6 1.05 2.08 PlyA - 7247 7242 6 1.05 2.07 Term - 14510 14422 89 0 2 44 49 112 0.714 0.72 2.06 Intr - 14716 14591 126 2 0 42 73 68 0.459 1.35 2.05 Intr - 17881 17814 68 0 2 125 2 70 0.581 0.65 2.04 Intr - 21438 21363 76 1 1 43 117 74 0.921 4.37 2.03 Intr - 21700 21652 49 1 1 58 101 53 0.848 1.85 2.02 Intr - 21986 21749 238 1 1 67 94 44 0.759 0.52 2.01 Init - 23339 23275 65 2 2 78 98 52 0.863 5.82 2.00 Prom - 24618 24579 40 -7.46 3.00 Prom + 25804 25843 40 -2.76 3.01 Init + 26637 26736 100 2 1 76 58 43 0.272 0.52 3.02 Intr + 30620 31082 463 0 1 81 109 380 0.699 31.61 3.03 Intr + 31111 31309 199 1 1 31 80 100 0.513 2.85 3.04 Intr + 42667 42717 51 0 0 56 78 81 0.802 3.00 3.05 Intr + 47795 47919 125 1 2 99 86 141 0.997 14.38 3.06 Intr + 50667 50767 101 0 2 93 91 72 0.835 7.75 3.07 Intr + 52878 53014 137 1 2 44 94 89 0.332 5.39 3.08 Intr + 70683 70794 112 2 1 89 99 74 0.356 8.55 3.09 Intr + 83610 83701 92 1 2 83 72 42 0.309 1.81 3.10 Intr + 90905 90942 38 1 2 107 59 13 0.829 -2.64 3.11 Intr + 91155 91328 174 0 0 101 86 186 0.919 18.75 3.12 Intr + 101837 101973 137 2 2 65 79 64 0.715 3.41 3.13 Intr + 103732 103778 47 2 2 85 94 42 0.707 2.63 3.14 Intr + 109519 109626 108 0 0 46 100 194 0.999 16.88 3.15 Intr + 110500 110574 75 0 0 89 106 40 0.980 5.61 3.16 Intr + 114064 114166 103 0 1 83 90 102 0.998 9.65 3.17 Intr + 121028 121104 77 0 2 89 96 100 0.999 10.13 3.18 Intr + 125308 125448 141 0 0 47 97 212 0.954 18.55 3.19 Intr + 130002 130070 69 2 0 97 100 57 0.989 7.18 3.20 Intr + 136980 137091 112 2 1 87 97 133 0.989 14.05 3.21 Intr + 167493 167525 33 1 0 80 91 34 0.044 1.09 3.22 Intr + 174852 174991 140 1 2 89 84 201 0.875 19.98 3.23 Intr + 178812 178853 42 2 0 110 94 18 0.889 3.04 3.24 Intr + 179913 180013 101 2 2 105 108 86 0.999 11.21 3.25 Intr + 180549 180726 178 0 1 52 110 137 0.996 12.22 3.26 Intr + 199526 199633 108 1 0 85 98 106 0.926 11.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:150834968_151035563|GENSCAN_predicted_peptide_1|246_aa XSVQRCSVVVSQPNKENWCQDHLYNSLGRKGISAKSQPYHRSQSSSSVLINKSMDSINYP SDVGKQQLLSLHRSSRCESHQDLLPDIADSHQQGTEKLSDLTLQDSQKVVVVNRNLPLNA QIATQNYFSNFKETDGDEDDYVEIKSEEDESELELSHNRRRKSDSKFVDADFSDNVCSGN TLHSLNSPRTPKKPVNSKLGLSPYLTPYNDSDKLNDYLWRGPSPNQQNIVQSLREKFQCL SSSSFA >gi568815592f:150834968_151035563|GENSCAN_predicted_CDS_1|741_bp ngctctgtgcagagatgcagcgtggtagtaagtcagcccaacaaagagaactggtgtcag gaccatctttacaactccttgggtcggaaagggatcagcgctaaatctcagccttatcac aggtcccagtcatcttcctccgtcttgatcaacaaatcaatggattccatcaactaccct agtgatgtgggaaagcagcagctgctgtctttacacagaagttcaaggtgtgagagtcac caggacttgctgccagatattgctgactcgcatcaacagggcactgaaaaactctcagat ctcacactccaagactcacagaaagttgtggtggtcaatagaaatttacccttaaatgcc caaattgcaacacagaattatttttccaatttcaaagagactgatggagatgaagatgac tatgtggaaatcaagtcagaagaagatgagtcggagttggagctatctcacaatcgtaga aggaaatctgactcaaagtttgtggatgctgacttttctgataatgtctgcagcggcaac acattgcattctttgaatagtccgcgcactccaaaaaagccggttaacagcaaacttggc ctttcaccatatctgacaccatataatgattctgacaaactgaatgactatctttggagg gggccatctcccaatcaacaaaatattgtccagtctctaagggaaaaatttcagtgtctc agttcaagcagctttgcttaa >gi568815592f:150834968_151035563|GENSCAN_predicted_peptide_2|236_aa MELKVTQSTQTEDVGRTAAVRRKDYLSHEWFAGLGEKLWFDAAFSKAPRECSLGLICDCL GWKAWPWKVQAWSIGVWVAYAPTMLCKICLIININQRMALQWVLGLTDFKNEAVDPRGVK LQIFVVSVTPHKGSASRVVHSSRGLMMDQEIIPLAYLTGFDYLSHGVKLQTFAVSVTVHK GGASGVVRSSRWVRGLAGFRSEAADLRRVKLQTFVVSVTAHKGSADPKSEQQQDLL >gi568815592f:150834968_151035563|GENSCAN_predicted_CDS_2|711_bp atggaacttaaagtgacccagagcacgcagacagaagatgtgggaagaacagcagcagtg agaaggaaggactacctgagccatgagtggtttgctgggcttggggagaagctctggttt gatgcagctttttccaaagctcccagggagtgcagtctgggcctgatctgtgattgttta ggctggaaagcatggccctggaaggtgcaagcctggtcgataggagtatgggtagcatac gcaccaacaatgctctgcaaaatctgtcttattattaatatcaaccagcgtatggcactg cagtgggttcttggtctcactgacttcaagaatgaagctgtggaccctcgcggagtgaag ctgcagatcttcgtggtgagtgttacacctcataaaggcagcgcatccagagttgttcat tcctcccggggccttatgatggaccaagagattattcctttggcatacctcacaggcttt gactacctgtcacatggagtgaagctgcagaccttcgcagtgagtgttacagttcataaa ggcggcgcatccggagttgttcgttcctcccggtgggttcgtggtctcgctggcttcagg agtgaagctgcagaccttcgcagagtgaagctgcagaccttcgtggtgagtgttacagct cataaaggcagtgcggacccaaagagtgagcagcagcaagatttattgtga >gi568815592f:150834968_151035563|GENSCAN_predicted_peptide_3|1021_aa MCSQKRAVSNQGLMDLGLCRLCLVNNVFAGSMPAYQTKVPKAVTSRPAARRPPPSARHGR RVQCPLGAPGTRRKRQVLPAAAAAAAACSPGTRPAALGSRSSVSPENQPSRAMGTRLPLV LRQLRRPPQPPGPPRRLRVPCRASSGGGGGGGGGREGLLGQRRPQDGQARSSCSPGGRTP AARDSIVRSPAAVAPTQGRSARDRRGERGAGLRVFGANSLGGPGGVSGNAGLGTAAGSAG GERGIQYPTGPAAQAGDDNLMQEINQNLAEEIIDEILKINEDTRVHGLALQISENLFSNK VLNALKPEKDVDGVTDINLGKLVRGDAHECFVSPVAKAVIELLEKSGVNLDGKKILVVGA HGSLEAALQCLFQRKGSMTMSIQWKTRQLQSKLHEADIVVLGSPKPEEIPLTWIQPGTTV LNCSHDFLSGKVGCGSPRIHFGGLIEEDDVILLAAALRIQVLYFHLDTILGSFDIEISRG QTPKAVDVLAKEIGLLADEIEIYGKSKAKVRLSVLERLKDQADGKYVLVAGITPTPLGEG KSTVTIGLVQALTAHLNVNSFACLRQPSQGPTFGVKGGAAGGGYAQVIPMEEFNLHLTGD IHAITAANNLLAAAIDTRILHENTQTDKALYNRLVPLVNGVREFSEIQLARLKKLGINKT DPSTLTEEEVSKFARLDIDPSTITWQRVLDTNDRFLRKITIGQGNTEKGHYRQAQFDIAV ASEIMAVLALTDSLADMKARLGRMVVASDKSGQPVTADDLGVTGALTVLMKDAIKPNLMQ TLEGTPVFVHAGPFANIAHGNSSVLADKIALKLVGEEGFVEWFFPIVCTLSVTEAGFGAD IGMEKFFNIKCRASGLVPNVVVLVATVRALKMHGGGPSVTAGVPLKKEYTEENIQLVADG CCNLQKQIQITQLFGVPVVVALNVFKTDTRAEIDLVCELAKRAGAFDAVPCYHWSVGGKG SVDLARAVREAASKRSRFQFLYDVQVPIVDKIRTIAQAVYGAKDIELSPEAQAKIDRYTQ Q >gi568815592f:150834968_151035563|GENSCAN_predicted_CDS_3|3063_bp atgtgctcgcagaaacgtgctgtatcaaatcaaggtttaatggatttagggctgtgcagg ctgtgccttgttaacaatgtgtttgcaggcagtatgcctgcttaccaaaccaaagtgccc aaagccgtgacatcccggccggcggctcgcaggcccccgccctccgcacgtcacggccgc cgggtgcagtgccccctaggggcccctgggacgaggaggaagcgccaggtccttcccgcc gccgccgccgccgccgccgcctgctcccctggcacgcgccccgccgccctcggcagccgc agctccgtgtcccctgagaaccagccgtcccgcgccatgggcacgcgtctgccgctcgtc ctgcgccagctccgccgcccgccccagcccccgggccctccgcgccgcctccgtgtgccc tgtcgcgctagcagcggcggcggcggaggcggcggcggtggccgggagggcctgcttgga cagcggcggccgcaggatggccaggcccggagcagctgcagccccggcggccgaacgccc gcggcgcgggactccatcgtcaggtctccagcggctgtggccccgacccaggggcggagc gcccgggaccgccgtggggagcggggcgcggggctgcgagtgtttggtgccaactcatta ggggggccgggcggtgtgtcgggaaacgcgggcttgggcactgcggccgggagcgctggc ggagaacgggggatccagtacccgaccgggcccgcagcgcaggcaggtgacgacaacttg atgcaggaaatcaaccagaatttggctgaggagattatagatgaaatcttaaagatcaat gaagataccagagtacatggccttgcccttcagatctctgagaacttgtttagcaacaaa gtcctcaatgccttgaaaccagaaaaagatgtggatggagtaacagacataaacctgggg aagctggtgcgaggggatgcccatgaatgttttgtttcacctgttgccaaagctgtaatt gaacttcttgaaaaatcaggtgtcaacctagatggaaagaagattttggtagtgggggcc catgggtctttggaagctgctctacaatgcctgttccagagaaaagggtccatgacaatg agcatccagtggaaaacacgccagcttcaaagcaagcttcacgaggctgacattgtggtc ctaggctcacctaagccagaagagattccccttacttggatacaaccaggaactactgtt ctcaactgctcccatgacttcctgtcagggaaggttgggtgtggctctccaagaatacat tttggtggactcattgaggaagatgatgtgattctccttgctgcagctctgcgaattcag gtgttatacttccacctggacacaattctaggatcttttgacattgagatttcaagagga caaactccaaaagctgtggatgtccttgccaaggagattggattgcttgcagatgaaatt gaaatctatggcaaaagcaaagccaaagtacgtttgtccgtgctagaaaggttaaaggat caagcagatggaaaatacgtcttagttgctgggatcacacccacccctcttggagaaggg aagagcacagtcaccatcgggcttgtgcaggctctgaccgcacacctgaatgtcaactcc tttgcctgcttgaggcagccttcccaaggaccgacgtttggagtgaaaggaggagccgcg ggtggtggatatgcccaggtcatccccatggaggagttcaaccttcacttgactggagac atccacgccatcaccgctgccaataacttgctggctgccgccatcgacacgaggattctt catgaaaacacgcaaacagataaggctctgtataatcggctggttcctttagtgaatggt gtcagagaattttcagaaattcagcttgctcggctaaaaaaactgggaataaataagact gatccgagcacactgacagaagaggaagtgagtaaatttgcccgtctcgacatcgaccca tctaccatcacgtggcagagagtattggatacaaatgaccgatttctacgaaaaataacc atcgggcagggaaacacagagaagggccattaccggcaggcgcagtttgacatcgcagtg gccagcgagatcatggcggtgctggccctgacggacagcctcgcagacatgaaggcacgg ctgggaaggatggtggtggccagtgacaaaagcgggcagcctgtgacagcagatgatttg ggggtgacaggtgctttgacagttttgatgaaagatgcaataaaaccaaacctgatgcag accctggaagggacacctgtgttcgtgcatgcgggcccttttgctaacattgctcacggc aactcttcagtgttggctgataaaattgccctgaaactggttggtgaagaaggatttgta gaatggtttttccccatcgtgtgcacattgagcgtgaccgaagctggctttggtgctgac atcggaatggagaaattcttcaacatcaagtgccgagcttccggcttggtgcccaacgtg gttgtgttagtggcaacggtgcgagctctgaagatgcatggaggcgggccaagtgtaacg gctggtgttcctcttaagaaagaatatacagaggagaacatccagctggtggcagacggc tgctgtaacctccagaagcaaattcagatcactcagctctttggggttcccgttgtggtg gctctgaatgtcttcaagaccgacacccgcgctgagattgacttggtgtgtgagcttgca aagcgggctggtgcctttgatgcagtcccctgctatcactggtccgttggtggaaaagga tcggtggacttggctcgggctgtgagagaggctgcgagtaaaagaagccgattccagttc ctgtatgatgttcaggttccaattgtggacaagataaggaccattgctcaggctgtctat ggagccaaagatattgaactctctcctgaggcacaagccaaaatagatcgttacactcaa cag