GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:39:33 Sequence gi568815585r:29414483_29636188 : 221706 bp : 44.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14361 14414 54 2 0 90 32 62 0.269 2.08 1.02 Intr + 25501 25567 67 0 1 85 111 70 0.398 7.48 1.03 Term + 35501 35631 131 0 2 32 47 121 0.153 0.74 1.04 PlyA + 36208 36213 6 1.05 2.00 Prom + 48176 48215 40 -3.86 2.01 Sngl + 49545 49808 264 2 0 22 37 195 0.540 3.16 2.02 PlyA + 52092 52097 6 1.05 3.04 PlyA - 52535 52530 6 1.05 3.03 Term - 53045 52929 117 2 0 75 38 54 0.315 -2.36 3.02 Intr - 57483 57346 138 0 0 87 105 71 0.831 9.36 3.01 Init - 58053 57985 69 0 0 65 59 89 0.930 4.75 3.00 Prom - 59151 59112 40 -3.26 4.00 Prom + 60804 60843 40 -3.86 4.01 Init + 65598 65882 285 2 0 94 82 354 0.300 32.48 4.02 Intr + 73418 73523 106 1 1 92 65 156 0.430 13.49 4.03 Intr + 78164 78237 74 0 2 89 92 70 0.744 6.63 4.04 Intr + 82720 82854 135 0 0 49 3 199 0.523 8.26 4.05 Intr + 83936 84055 120 1 0 88 105 107 0.971 13.09 4.06 Intr + 86615 86712 98 1 2 81 96 98 0.998 8.71 4.07 Intr + 91156 91337 182 1 2 105 23 106 0.476 5.31 4.08 Term + 93042 93223 182 1 2 76 49 141 0.133 6.77 4.09 PlyA + 94697 94702 6 1.05 5.16 PlyA - 94951 94946 6 1.05 5.15 Term - 100101 99998 104 1 2 101 48 211 0.982 16.84 5.14 Intr - 100682 100250 433 2 1 51 37 312 0.721 15.62 5.13 Intr - 100997 100912 86 0 2 45 63 26 0.794 -4.66 5.12 Intr - 101764 101656 109 1 1 81 77 170 0.966 15.06 5.11 Intr - 102828 102662 167 1 2 111 105 159 0.999 19.58 5.10 Intr - 103308 103091 218 2 2 69 82 149 0.635 10.55 5.09 Intr - 105067 104965 103 2 1 82 92 98 0.933 8.83 5.08 Intr - 107974 107835 140 0 2 77 105 57 0.987 6.41 5.07 Intr - 109006 108784 223 2 1 125 94 403 0.999 41.79 5.06 Intr - 109771 109650 122 0 2 81 91 186 0.968 18.34 5.05 Intr - 116230 116056 175 2 1 83 89 134 0.996 12.00 5.04 Intr - 118500 118342 159 1 0 99 103 161 0.999 18.66 5.03 Intr - 121720 121337 384 2 0 52 86 569 0.012 47.92 5.02 Intr - 125389 125274 116 0 2 44 93 67 0.013 2.89 5.01 Init - 135798 135722 77 0 2 101 90 31 0.360 3.46 5.00 Prom - 151148 151109 40 -5.46 6.00 Prom + 151953 151992 40 -5.46 6.01 Init + 155680 155774 95 0 2 29 22 164 0.065 1.75 6.02 Intr + 162830 162916 87 2 0 87 81 40 0.223 2.29 6.03 Intr + 175680 175752 73 0 1 104 59 59 0.084 4.01 6.04 Intr + 177734 177934 201 1 0 39 99 45 0.081 0.18 6.05 Intr + 190017 190109 93 2 0 106 85 9 0.697 2.56 6.06 Term + 190341 190403 63 2 0 120 39 75 0.811 3.69 6.07 PlyA + 194919 194924 6 1.05 7.02 PlyA - 195667 195662 6 1.05 7.01 Term - 221215 220998 218 2 2 49 48 167 0.962 6.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 121706 121337 370 2 1 101 86 555 0.987 53.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_1|83_aa MGHCCCKPYNCLQCLDKTNESALVKEKELSIELANIRDEVGSLQPLSYDRKVEDHDAPRP RCSVTLGVTVITSIFQVAGWKKE >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_1|252_bp atgggccattgctgctgcaagccttataactgccttcagtgcctggacaagacgaatgaa agtgcccttgtgaaagaaaaagagctgtcaatcgaacttgcaaacatcagggatgaagtt ggctccttacagccactgtcctatgaccgcaaggtggaggaccatgatgccccccgacca cggtgcagtgtgacacttggtgtgacagtcatcacatctatattccaggtggcaggctgg aagaaggaatga >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_2|87_aa MLAGGDGVVEPFLQTLEVLAAGCGQDLKVVVLSKPFLQMWKVPARRLGTGVERTAVSRCL RAERDHSVTLRDVEDEKLKAEAWKQIQ >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_2|264_bp atgttggctggtggagatggcgtggtagagccattcctgcagactctggaggtcctggct gctggctgtggacaggatctgaaggttgtggtgttgagcaagccattcttgcagatgtgg aaggtgcctgcaaggaggctgggaacaggtgtggagaggacggctgtgagcaggtgcctg agagcagaaagggatcactctgtgactctaagagatgtggaggatgagaagctgaaggcg gaagcctggaagcaaattcagtaa >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_3|107_aa MPPGGKCLCPNPLGHRNIRKVVRVPAASLGPGDQKGAGAVKITGGICECFLGAKHLPHTV THSKPTRGPVNLGLQSFQISTAPVHIIHYMGAIQKARAVERPDNSSL >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_3|324_bp atgccccctggtggcaaatgtctttgtccaaacccgttaggtcacaggaacatccgcaag gtggtcagggtccctgcagcatctctcgggcctggtgaccaaaagggagctggcgctgtg aaaatcacaggtggcatctgcgagtgcttcctgggtgccaagcacctgcctcataccgtt acccacagtaagcctactaggggtcccgttaatttgggcttgcagagttttcagatctcc acggcaccggttcatataatacactacatgggggccatccagaaggcccgggcagtggaa aggccagataatagctccctgtga >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_4|393_aa MEAESFPCLLRPFFKERSCALRPAFHTAKCEKLQKEKEELERRFEDEVKRLGWQQQAELQ ELEERLQLQFEAEMARLQEEHGDQLLSIRCQHQEQVEDLTASHDAALLEMENNHTVAITI LQDDHDHKVQELMSTHELEKKELEENFEKLRLSLQWPQLDSLYHYLQDQVDTLTFQSQSL RDRARRFEEALRKNTEEQLEIALAPYQHLEEDMKSLKQVLEMKNQQIHEQEKKILELEKL AEKNIILEEKIQVLQQQNEDLKARIDQNTVVTRLERRFHVRMLQPLHLPQTDPPDTTCLF VAFGRCVGGLPPDRRPCPPHAGTRFWLDVRMDGGTHTTDTKRAPATQLLTVHSGFSHDCT AKFTRQALGFVPPPALWEQRFRTSWHSSSGALF >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_4|1182_bp atggaggctgagtccttcccatgcctcctacggccattttttaaagaaagatcttgtgct ttgcgcccagccttccatacagcaaagtgcgagaaactacaaaaggagaaggaggagctg gagaggcggttcgaggacgaggtgaagaggctgggctggcagcagcaggccgagctccag gagctggaggagcggctgcagctgcaattcgaggcggaaatggcgcgcctgcaggaggag cacggtgaccagctgctgagcatccggtgtcaacaccaggagcaggtggaagatctcacc gccagccatgatgctgctctcctagagatggaaaataaccacacagttgccatcacaatc ctgcaggatgaccacgaccacaaagtccaagaattgatgtccactcatgagcttgaaaag aaagaattggaagaaaattttgaaaaactgcggctgtcattgcagtggccccagctggac tccttgtatcattacttgcaggaccaggtggacacgctgaccttccagagccagtctctg cgggacagagcccgccgcttcgaagaggccttgaggaagaacacagaggagcagctggag attgcattggctccttatcagcacttggaagaagacatgaagagtctgaagcaggtatta gaaatgaagaatcagcaaatacacgagcaagaaaagaagattcttgagctggaaaagctg gcagaaaagaacattatcctagaagaaaagatccaggttctccaacagcagaacgaagac ctcaaagcaaggattgaccaaaacacagttgtcaccaggctggagcgacgtttccacgtt cggatgctgcagcctctgcacctgcctcagacagacccaccagataccacctgtctcttc gtggcatttgggagatgcgtgggcggcctgccccccgaccgccgcccctgcccaccacat gctgggacaaggttctggctggatgtcagaatggatgggggcactcacaccacagacacg aagagagcacctgccacgcagctgctgactgtgcatagcggcttctcacatgactgcacc gccaaattcacgcggcaggcactgggcttcgtccctcctcctgcactgtgggagcagcgg ttcagaaccagctggcactcatcatcaggtgcattgttttaa >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_5|871_aa MDALSRHWPCWLVGGSGVPSPTLGARLSALGFSAAVGCIFVPVSSKGIQFALSLDSKQYL RSNQALNSNMGCKVLLNIGQQMLRRKVVDCSREETRLSRCLNTFDLVALGVGSTLGAGVY VLAGAVARENAGPAIVISFLIAALASVLAGLCYGEFGARVPKTGSAYLYSYVTVGELWAF ITGWNLILSYIIGTSSVARAWSATFDELIGRPIGEFSRTHMTLNAPGVLAENPDIFAVII ILILTGLLTLGVKESAMVNKIFTCINVLVLGFIMVSGFVKGSVKNWQLTEEDFGNTSGRL CLNNDTKEGKPGVGGFMPFGFSGVLSGAATCFYAFVGFDCIATTGEEVKNPQKAIPVGIV ASLLICFIAYFGVSAALTLMMPYFCLDNNSPLPDAFKHVGWEGAKYAVAVGSLCALSASL LGSMFPMPRVIYAMAEDGLLFKFLANVNDRTKTPIIATLASGAVAAVMAFLFDLKDLVDL MSIGTLLAYSLVAACVLVLRYQPEQPNLVYQMASTSDELDPADQNELASTNDSQLGFLPE AEMFSLKTILSPKNMEPSKISGLIVNISTSLIAVLIITFCIVTVLGREALTKGALWAVFL LAGSALLCAVVTGVIWRQPESKTKLSFKVPFLPVLPILSIFVNVYLMMQLDQGTWVRFAV WMLIDGCCLISQGPRGVLFLQEILCGPLSAGSRPHKVAPAPGQVVRSIGPGPTLESRVQE QAVDSYVKSIPGAQCLLQSDAQLCGEMGSGTVQAESLETSSTCGKPHLHSAHSRYSRGLR GHQGAKLKDPMASGGPGPAALSAQETPQHKAVQRNVGLCQMSKQTVAFVWQLLQMWKGFI IYFGYGLWHSEEASLDADQARTPDGNLDQCK >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_5|2616_bp atggatgccctgagccgtcactggccctgctggctggtgggtggctctggagtgcccagt ccgactttaggagcgagattatcagccctgggcttctcagcagcagttggctgcatcttt gttccagtgtcttccaaaggaattcagtttgccctctccctagatagcaagcaatactta agaagtaaccaagctctgaacagcaacatggggtgcaaagtcctgctcaacattgggcag cagatgctgcggcggaaggtggtggactgtagccgggaggagacgcggctgtctcgctgc ctgaacacttttgatctggtggccctcggggtgggcagcacactgggtgctggtgtctac gtcctggctggagctgtggcccgtgagaatgcaggccctgccattgtcatctccttcctg atcgctgcgctggcctcagtgctggctggcctgtgctatggcgagtttggtgctcgggtc cccaagacgggctcagcttacctctacagctatgtcaccgttggagagctctgggccttc atcaccggctggaacttaatcctctcctacatcatcggtacttcaagcgtagcgagggcc tggagcgccaccttcgacgagctgataggcagacccatcggggagttctcacggacacac atgactctgaacgcccccggcgtgctggctgaaaaccccgacatattcgcagtgatcata attctcatcttgacaggacttttaactcttggtgtgaaagagtcggccatggtcaacaaa atattcacttgtattaacgtcctggtcctgggcttcataatggtgtcaggatttgtgaaa ggatcggttaaaaactggcagctcacggaggaggattttgggaacacatcaggccgtctc tgtttgaacaatgacacaaaagaagggaagcccggtgttggtggattcatgcccttcggg ttctctggtgtcctgtcgggggcagcgacttgcttctatgccttcgtgggctttgactgc atcgccaccacaggtgaagaggtgaagaacccacagaaggccatccccgtggggatcgtg gcgtccctcttgatctgcttcatcgcctactttggggtgtcggctgccctcacgctcatg atgccctacttctgcctggacaataacagccccctgcccgacgcctttaagcacgtgggc tgggaaggtgccaagtacgcagtggccgtgggctccctctgcgctctttccgccagtctt ctaggttccatgtttcccatgcctcgggttatctatgccatggctgaggatggactgcta tttaaattcttagccaacgtcaatgataggaccaaaacaccaataatcgccacattagcc tcgggtgccgttgctgctgtgatggccttcctctttgacctgaaggacttggtggacctc atgtccattggcactctcctggcttactcgttggtggctgcctgtgtgttggtcttacgg taccagccagagcagcctaacctggtataccagatggccagtacttccgacgagttagat ccagcagaccaaaatgaattggcaagcaccaatgattcccagctgggctttttaccagag gcagagatgttctctttgaaaaccatactctcacccaaaaacatggagccttccaaaatc tctgggctaattgtgaacatttcaaccagcctcatagctgttctcatcatcaccttctgc attgtgaccgtgcttggaagggaggctctcaccaaaggggcgctgtgggcagtctttctg ctcgcagggtctgccctcctctgtgccgtggtcacgggcgtcatctggaggcagcccgag agcaagaccaagctctcatttaaggttcccttcctgccagtgctccccatcctgagcatc ttcgtgaacgtctatctcatgatgcagctggaccagggcacctgggtccggtttgctgtg tggatgctgatagatggctgctgcctcatctcccagggccctcgaggggtgcttttcctc caggaaatactttgcggcccactgtctgctggcagccggccacacaaggtggccccagcc cctgggcaggtggtaagaagcataggtcctggacccaccctggagtcccgtgtgcaggaa caggctgtggactcctatgtgaagtccattcctggggctcagtgtctcctgcagtctgac gctcagttatgtggagaaatgggcagtgggactgtccaggctgagagcctggagacctcc agcacctgcgggaaaccccatctccatagtgcacattctcgatactctcgtggcctccgg gggcatcagggggctaagctgaaggacccgatggcctctggtggccctggcccagctgct ctgtctgctcaggaaacaccgcagcataaggccgtccagaggaacgtagggctttgccag atgtcgaagcagacagtggccttcgtgtggcagctgctccagatgtggaaaggcttcatc atctactttggctatggcctgtggcacagcgaggaggcgtccctggatgccgaccaagca aggactcctgacggcaacttggaccagtgcaagtga >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_6|203_aa MAALLCAVVPLLNLLCPELSMEEEATACHVPGSGLARARRQIQQAPTVGAKYTSQLQVPR WWVSPTTPVLYGDILSYRPKHGMSELSSVFTKCNTCHSPLFLTITRGKKPNESHWVWQEG QTTDQPESGPKPRNIFSEAQVPGMTKSRPVGQERAQAWHPTVDKIFRPLALGKKVTRVTS SEKPTQCENEDDDLDDNPLSLND >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_6|612_bp atggctgctctgctatgtgcagtggtccctctgctgaacctcctctgccccgaactcagc atggaggaggaggccacggcctgccacgtgcctgggtcagggttagccagagccaggcgc cagattcagcaggctcccacagttggagccaaatacacgagtcagctccaagtacctcgc tggtgggtgtcgcccaccacccctgtgctgtatggagacatcctatcctacagacccaag cacgggatgagtgagttatccagtgtgttcaccaaatgcaatacctgccattccccactg ttcctgacaataactagaggaaaaaagcccaatgaaagccactgggtgtggcaggagggg caaaccacagaccagccagaatctggtccaaagcccagaaacatcttttctgaagcccaa gttcctgggatgaccaaatctcgaccagtgggccaggaaagggctcaggcctggcatcca actgttgacaagattttcaggccattggcactaggtaagaaggtaaccagggtaacaagc tctgaaaagcctactcaatgtgaaaatgaggatgatgaccttgatgataatccactttca cttaatgactag >gi568815585r:29414483_29636188|GENSCAN_predicted_peptide_7|72_aa XREQNWVKNEFAKLTEVGFKKWIITNSPELKEHILTQCKEAKNLEKSLEELLTRITSLEK NVNDLMELKNTA >gi568815585r:29414483_29636188|GENSCAN_predicted_CDS_7|219_bp ncaagggaacaaaactgggtgaagaatgagtttgccaaactgacagaagtgggcttcaaa aagtggataataacaaactcccccgagctaaaggagcacattttaacccaatgcaaggaa gctaagaaccttgaaaaaagtttagaggaattgctaactagaataaccagtctagaaaag aatgtaaatgacctgatggagctgaaaaacacagcataa