GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:44:38 Sequence gi568815578r:17924639_18157634 : 232996 bp : 44.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4334 4377 44 1 2 78 93 42 0.141 3.59 1.02 Intr + 14581 14677 97 1 1 82 91 76 0.459 7.31 1.03 Intr + 15402 15552 151 2 1 69 103 14 0.369 0.74 1.04 Term + 18857 18984 128 0 2 72 42 73 0.371 -0.46 1.05 PlyA + 19434 19439 6 1.05 2.12 PlyA - 19919 19914 6 -0.45 2.11 Term - 23007 22843 165 0 0 77 53 181 0.967 11.42 2.10 Intr - 24338 24252 87 2 0 89 97 127 0.999 13.87 2.09 Intr - 24465 24426 40 2 1 75 86 6 0.927 -2.67 2.08 Intr - 25569 25494 76 1 1 92 106 55 0.987 6.27 2.07 Intr - 25758 25653 106 0 1 109 33 68 0.984 3.19 2.06 Intr - 26957 26862 96 2 0 87 115 48 0.994 7.61 2.05 Intr - 28072 27949 124 0 1 72 67 55 0.993 2.39 2.04 Intr - 29479 29358 122 1 2 69 94 221 0.998 20.09 2.03 Intr - 30837 30727 111 0 0 113 68 51 0.985 6.18 2.02 Intr - 32399 32295 105 2 0 19 115 110 0.963 7.21 2.01 Init - 38206 38150 57 1 0 65 111 -37 0.303 -2.11 2.00 Prom - 39789 39750 40 -7.06 3.00 Prom + 43423 43462 40 -6.26 3.01 Init + 43631 44302 672 1 0 73 63 236 0.608 14.79 3.02 Intr + 45336 45492 157 2 1 8 100 124 0.596 5.18 3.03 Intr + 51046 51265 220 2 1 131 89 107 0.739 12.46 3.04 Intr + 63528 63660 133 0 1 69 71 91 0.972 6.15 3.05 Term + 65301 65471 171 2 0 73 50 188 0.999 11.33 3.06 PlyA + 66024 66029 6 1.05 4.00 Prom + 66055 66094 40 -6.26 4.01 Init + 87318 87571 254 2 2 67 21 509 0.727 38.91 4.02 Term + 89310 89499 190 0 1 96 51 90 0.933 3.02 4.03 PlyA + 90347 90352 6 1.05 5.06 PlyA - 93763 93758 6 1.05 5.05 Term - 100314 99998 317 1 2 101 55 490 0.616 42.10 5.04 Intr - 117085 116896 190 1 1 114 99 325 0.491 35.36 5.03 Intr - 118490 118398 93 2 0 62 66 48 0.235 0.16 5.02 Intr - 132239 132019 221 0 2 68 79 411 0.152 36.22 5.01 Init - 132996 132897 100 0 1 104 99 121 0.977 15.24 5.00 Prom - 144465 144426 40 -3.76 6.00 Prom + 185714 185753 40 -2.86 6.01 Init + 201388 202514 1127 0 2 60 41 339 0.070 20.07 6.02 Intr + 213179 213413 235 2 1 59 67 216 0.048 14.29 6.03 Term + 217570 217719 150 0 0 88 38 119 0.324 4.81 6.04 PlyA + 217838 217843 6 1.05 7.00 Prom + 217850 217889 40 -13.15 7.01 Init + 218023 218281 259 0 1 66 72 327 0.965 26.20 7.02 Intr + 220595 220713 119 0 2 126 89 16 0.927 5.68 7.03 Intr + 226183 226304 122 0 2 119 63 42 0.226 4.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 72836 72960 125 2 2 92 90 70 0.822 7.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_1|139_aa MRSLSTGSVAGVCQRQKADGARFPSKRKSPFTELSLGITTKKPHKEKQDSCGFPPDAVPP AAKCLPAFPGLSTPHRSQSASKPIPPPQTTMNAPSHTGTSDWEQGWSMGGGEPHTSPMAC ENASQKKTYQVHANNVQQH >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_1|420_bp atgaggagcctatccacgggatcagttgctggtgtttgccaaaggcagaaagcggatggt gcaaggtttccttccaaacggaaatcgcccttcacagagctgagcctgggaattaccaca aagaaacctcacaaagaaaagcaagatagctgtggattcccacctgatgctgttcctcct gctgccaagtgcctgcctgcctttccaggtctcagcacaccacacaggtcgcagtcagca tctaaacccatccccccaccccagaccactatgaatgcaccctcacatacaggtaccagt gactgggaacagggctggagtatgggtggtggtgagccacatacttcaccaatggcttgt gaaaatgcttctcaaaagaagacataccaggttcatgcaaacaacgttcagcagcactag >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_2|362_aa MPCSLQCSTVNCLCVGRLPLRSVSVDLNVDPSLQIDIPDALSERDKVKFTVHTKTTLPTF QSPEFSVTRQHEDFVWLHDTLIETTDYAGLIIPPAPTKPDFDGPREKMQKLGEGEGSMTK EEFAKMKQELEAEYLAVFKKTVSSHEVFLQRLSSHPVLSKDRNFHVFLEYDQDLSVRRKN TKEMFGGFFKSVVKSADEVLFTGVKEVDDFFEQEKNFLINYYNRIKDSCVKADKMTRSHK NVADDYIHTAACLHSLALEEPTVIKKYLLKVAELFEKLRKVEGRVSSDEDLKLTELLRYY MLNIEAAKDLLYRRTKALIDYENSNKALDKARLKSKDVKLAEAHQQECCQKFEQLSESAK EG >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_2|1089_bp atgccctgcagtctgcagtgttctactgtgaactgcttgtgtgttggcaggctaccgctg agatctgtatctgtggacctgaatgttgatccctcgcttcagattgacatacctgatgcg ctcagtgagagagacaaagtcaaatttacagtgcacacaaagaccacactgcccacgttt cagagcccagagttttctgttacaaggcaacatgaagactttgtgtggctacatgacact cttattgaaacaacagactatgctgggcttattattccacctgctcctacgaagcccgac tttgatggtcctcgagagaagatgcagaaactgggagaaggtgaagggtctatgaccaaa gaagaatttgccaagatgaaacaagaactggaagctgagtatctcgctgtgtttaagaag actgtgtcctcccatgaagtctttcttcagcggctttcttctcaccctgttctcagtaaa gatcgcaactttcatgttttcctggaatatgatcaggatctaagtgttaggcggaaaaat actaaagagatgtttggtggcttcttcaaaagtgtggtgaaaagtgctgatgaagtcctt tttactggagttaaggaggtagatgacttctttgagcaagagaagaacttccttattaac tattacaataggatcaaagattcttgtgtgaaagctgacaaaatgaccagatctcataaa aatgttgccgatgactatatccacaccgcagcctgcttacatagcctggctttagaagag cccacagtcatcaaaaagtacctattgaaggttgctgagctatttgaaaaactaaggaaa gtagagggtcgagtttcatcagatgaagatttgaagctaacagagctcctccgatactac atgctcaacattgaagctgctaaggatctcttatacagacgcaccaaagccctcattgac tatgagaactcaaacaaagctctggataaggcccggttaaagagcaaagacgtcaagttg gctgaggcacaccagcaggagtgctgccagaaatttgaacaactttccgaatctgcaaaa gaaggttga >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_3|450_aa MAASSHGLWTRKGKNAATPELAGPPAQESEAGRTSPCCGPPPAAAATREPRPWRRGTRAG AAWLCEERRSWAAAAAAWAPLGGGHGPASAGLPARRRQEASGLRHHPSCPGSRRAGRHVL PQSSLPVPAAVHLGAGQRRHVGPTLAPLHEEAANRRAATRTGSNGHSPSKTRLEKDRVSV RGEFITQPESKMAASADVTRSRESRRARFGGSRASETPALPLGEKKVNPYEEVDQEKYSN LVQSVLSSRGVAQTPGSVEEDALLCGPVSKHKLPNQDVFLQGKRFHEALESILSPQETLK ERDENLLKSGYIESVQHILKDVSGVRALESAVQHETLNYIGLLDCVAEYQGKLCVIDWKT SEKPKPFIQSTFDNPLQVVAYMGAMNHDTNYSFQVQCGLIVVAYKDGSPAHPHFMDAELC SQYWTKWLLRLEEYTEKKKNQNIQKPEYSE >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_3|1353_bp atggcggcgagcagccacggcctatggacgcgcaagggaaagaatgcagcgaccccggag ctcgcagggccgcccgcccaggagtctgaggctgggaggacctcaccttgctgcggtcct cctcctgctgctgcagcaactcgggaaccgcggccatggcgacgcgggactcgagcaggg gccgcctggctgtgcgaggaaagaagaagctgggccgccgccgccgccgcctgggcgcct ctcgggggcggccacggccccgcctccgccggcctccctgcccgacggcggcaggaggcc tccggactccgccaccatcccagctgccccgggagcaggcgagcagggcgccacgtgctc ccccagagcagcctcccagtccccgctgccgtccatcttggagccgggcaaagacgccac gtggggcctacccttgctccgctccacgaggaggccgccaaccgcagggccgcgacacgg acgggaagcaacggacactctcccagcaagacgcgtctagagaaagaccgcgtttcggtg cggggggaatttattactcagcccgagtccaagatggcagcgagcgctgacgtcaccaga tctcgtgagagcagaagggcgcgatttggaggctcccgcgcttcggagacgccggccctt ccgctcggagagaaaaaagtgaacccatatgaagaagtggaccaagaaaaatactctaat ttagttcagtctgtcttgtcatccagaggcgtcgcccagaccccgggatcggtggaggaa gatgctttgctctgtggacccgtgagcaagcataagctgccaaaccaagacgtcttttta caagggaaacggttccacgaagccttggaaagcatactttcaccccaggaaaccttaaaa gagagagatgaaaatctcctcaagtctggttacattgaaagtgtccagcatattctgaaa gatgtcagtggagtgcgagctcttgaaagtgctgttcaacatgaaaccttaaactatata ggtctgctggactgtgtggctgagtatcagggcaagctctgtgtgattgattggaagaca tcagagaaaccaaagccttttattcaaagtacatttgacaacccactgcaagttgtggca tacatgggtgccatgaaccatgataccaactacagctttcaggttcaatgtggcttaatt gtggtggcctacaaagatggatcacctgcccacccacatttcatggatgcagagctctgt tcccagtactggaccaagtggcttcttcgactagaagaatatacggaaaagaaaaagaac cagaatattcagaaaccagaatattcagaatag >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_4|147_aa MSDAAVDTSSEITTKDLKKKEAVEEAENGRDTPANGKANEENGEQEADNEVDEEEEEGGE EDEEEEEGDGEEEDGDEDEEAESARTFSFGTYHVLGAQTSPHREETIRRGPCGEELRPLA NHQHELPGSRENGPSDEAQAPAFESFS >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_4|444_bp atgtcagacgcagccgtagacaccagctccgaaatcaccaccaaggacttaaagaagaag gaagctgtggaggaagcggaaaatggaagagacacccctgctaatgggaaggctaatgag gaaaatggggagcaggaagctgacaatgaagtagatgaagaagaggaagaaggtggggag gaagacgaggaggaagaagaaggcgatggtgaggaagaggatggtgatgaagacgaggaa gctgagtccgctaggaccttctcctttggaacctaccacgtccttggagcccaaaccagc ccacacagagaggagaccataaggagaggtccatgtggagaggaattgaggcccctagcc aaccaccagcatgaactaccaggttcacgagagaatgggccttcagatgaagcccaggcc ccagcctttgagtctttcagctga >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_5|306_aa MPKVFLVKRRSLGVSVRSWDELPDEKRADTYIPVGLGRLLHDPPEDCRSDGGSSSGSGSS SAGEPGGAESSSSPHAPESETPEPGDAEGPDGHLATKQRPVARSKIKVTVGALVSRKEWR RKKAPFAPLQGDFYFIGQFTTGTCSDSVVHSCDLCGKGFRLQRMLNRHLKCHNQVKRHLC TFCGKGFNDTFDLKRHVRTHTGIRPYKCNVCNKAFTQRCSLESHLKKIHGVQQQYAYKQR RDKLYVCEDCGYTGPTQEDLYLHVNSAHPGSSFLKKTSKKLAALLQGKLTSAHQENTSLS EEEERK >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_5|921_bp atgcccaaagtcttcctggtgaagaggaggagcctgggggtctcggtccgcagctgggat gagctcccggatgagaaaagggcagacacctacatcccagtgggcctaggccgcctgctc cacgacccccccgaggactgccgcagcgacggcggcagcagcagcggcagcggcagcagc agcgcgggggagcctggaggagcagagagcagctcgtccccgcacgcccccgagagcgaa acccccgagcccggcgacgccgagggccccgatggacacctggcgaccaagcagcgcccg gtcgccagatcgaaaatcaaggtgactgttggtgctctagtcagcaggaaggagtggagg aggaagaaggcaccgtttgctccccttcaaggagatttctatttcattggccagttcacc acaggcacgtgcagcgactcggtggttcacagctgtgacctgtgtggcaagggcttccgt ctgcagcgcatgctgaaccgtcacctcaagtgccacaaccaggtgaaaagacacctgtgc accttctgcggcaagggcttcaacgacaccttcgacctgaagaggcacgtccgcacacac acaggcattcgtccctacaaatgcaacgtctgcaataaagccttcacccagcgctgctct ctggagtcccacctgaagaaaatccatggggtgcagcagcagtatgcctataagcagcgg cgggacaagctctacgtctgcgaggattgcggctacacgggccccacccaggaggacctg tacctgcacgtgaacagtgcccatccgggcagctcgtttctcaaaaagacatctaaaaaa ctggcagcccttctgcagggcaagctgacatccgcacaccaggagaataccagcctgagt gaggaggaggagaggaagtga >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_6|503_aa MSELPFTTASKRIKYLGIQLTMDVKDLFKENYKLLLSEIKEFTNKWNNIPCSWIGRLNIV RMAILPQVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRTHIATSILSQKSKAGSITLP DFKLYYKAAVTKTAWYWYQNRDIDQWNRIEPSEIIPHIYNHLIFDKPDKNKKWGKDFVFN KWCWENWLAVCRKLKLDPFLTPYAKINSRWFKDLNVRLKTIKTLEENLGNTIQAIGMGKD FMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFTIYPSDKGLISRI YKELKQIYKKKSNNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNSGERRTRRLGLCARGVEPGQYRRRCALCGGLCASGGREREAAAAS VGMSRSSKVVLGLSVLLTAATVAGVHVKQQWDQQRLRDGVIRDIERQIRKKENIRLLGEQ IILTEQLEAEREKMLLAKGSQKS >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_6|1512_bp atgagtgaactcccatttacaactgcttcaaagagaataaaatacctaggaatccagctt acaatggatgtgaaggacctcttcaaggagaactacaaattactgctcagtgaaataaaa gagttcacaaacaaatggaataacattccatgctcatggataggaagactcaatattgtg agaatggccatactgccccaggtaatttataggttcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga acccacattgccacatcaatcctaagccaaaagagcaaagctggaagcatcacactacct gacttcaaactatactacaaggctgcagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaatagagccctcggaaataataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggggaaaggatttcgtatttaat aaatggtgctgggaaaactggctagccgtatgtagaaagctgaaactggatcccttcctt acaccttatgcaaaaattaattcaagatggtttaaagacttaaatgttagacttaaaacc ataaaaactctagaagaaaacctaggcaataccattcaggccataggcatgggcaaggac ttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaatttttacaatctacccatctgacaaagggttaatatccagaatc tacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagatatcat ctcacaccagttagaatggcaatcattaaaaagtcaggaaataacagcggcgagcggcgc acgcgacggctggggctctgcgctcgaggggtcgagcctgggcagtacaggcggcggtgc gcactctgcggcggcctctgcgcctcgggcgggcgggagagagaggccgcggccgccagc gtggggatgtctaggagctcgaaggtggtgctgggcctctcggtgctgctgacggcggcc acagtggccggcgtacatgtgaagcagcagtgggaccagcagaggcttcgtgacggagtt atcagagacattgagaggcaaattcggaaaaaagaaaacattcgtcttttgggagaacag attattttgactgagcaacttgaagcagaaagagagaagatgttattggcaaaaggatct caaaaatcatga >gi568815578r:17924639_18157634|GENSCAN_predicted_peptide_7|167_aa MDSSIHLSSLISRHDDEATRTSTSEGLEEGEVEGETLLIVESEDQASVDLSHDQSGDSLN SDEGDVSWMEEQLSYFCDKCQKWIPASQLREQLSYLKGDNFFRFTCSDCSADGKEQYERL KLTWQQVVMLAMYNLSLEGSGRQGYFRWKEDICAFIEKHWTFLLGNS >gi568815578r:17924639_18157634|GENSCAN_predicted_CDS_7|501_bp atggatagtagcatccacctgagtagtctgatcagtcggcatgatgacgaagccacgaga acatcgacctcagaaggactggaggaaggtgaagtggagggagagacgctcctgatcgtc gaatccgaggatcaggcatcagtggacttatcgcacgaccagagtggggattccctcaac agtgatgaaggagacgtgtcttggatggaggagcagctgtcctacttctgtgacaagtgc caaaaatggataccagccagtcagctgagggaacagctcagttaccttaagggtgataat ttttttaggtttacttgttcggattgctcagcagatggcaaggagcagtatgaaaggctg aagctgacatggcagcaagtcgtcatgttggcaatgtacaacttgtctctggaaggaagt ggacgtcaaggttatttcaggtggaaagaagatatctgtgcttttattgagaaacattgg acttttttactagggaatagn