GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:28:25 Sequence gi568815593r:81909599_82110093 : 200495 bp : 38.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5864 5918 55 1 1 47 101 26 0.139 1.30 1.02 Intr + 13548 13624 77 1 2 75 83 63 0.030 2.82 1.03 Intr + 44338 44638 301 0 1 65 17 240 0.768 9.98 1.04 Intr + 46660 46781 122 2 2 80 92 28 0.835 1.69 1.05 Term + 46948 47769 822 0 0 51 43 259 0.836 9.80 1.06 PlyA + 49575 49580 6 1.05 2.00 Prom + 69253 69292 40 -4.55 2.01 Init + 77973 78080 108 2 0 57 116 43 0.313 4.27 2.02 Term + 87406 87783 378 0 0 23 51 217 0.128 5.30 2.03 PlyA + 88041 88046 6 1.05 3.02 PlyA - 96472 96467 6 1.05 3.01 Sngl - 100495 99998 498 1 0 76 38 476 0.999 37.19 3.00 Prom - 101281 101242 40 -6.25 4.02 PlyA - 101323 101318 6 1.05 4.01 Sngl - 105989 105441 549 2 0 59 54 233 0.920 12.86 4.00 Prom - 107420 107381 40 -5.45 5.00 Prom + 108329 108368 40 -9.75 5.01 Init + 110364 110672 309 2 0 65 19 242 0.179 10.66 5.02 Term + 118238 118339 102 1 0 89 47 100 0.392 3.30 5.03 PlyA + 118419 118424 6 1.05 6.00 Prom + 119939 119978 40 -9.05 6.01 Init + 120526 120696 171 0 0 68 23 184 0.732 9.39 6.02 Term + 122228 122350 123 1 0 93 43 73 0.649 0.70 6.03 PlyA + 123597 123602 6 1.05 7.03 PlyA - 125074 125069 6 1.05 7.02 Term - 133690 133071 620 2 2 47 39 319 0.144 16.51 7.01 Init - 138872 138704 169 2 1 21 86 122 0.302 5.04 7.00 Prom - 141995 141956 40 -4.75 8.00 Prom + 149696 149735 40 -2.55 8.01 Init + 174268 174627 360 0 0 79 31 463 0.681 36.82 8.02 Term + 174664 175308 645 0 0 -82 49 303 0.566 2.53 8.03 PlyA + 175520 175525 6 1.05 9.02 PlyA - 176215 176210 6 1.05 9.01 Sngl - 191066 190806 261 2 0 40 49 189 0.752 5.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_1|458_aa MNSSATIEKVERSVRKEDGPYSAVSCLGQFLVKLGTWMDARQLRRQRRHVLSVDPKPQRW SRTREGRDASLIIHPGFRGVRPRRDACLGPSPLAASPAFLGEGQVLQPLLSVSLPLLRLS GGQETPNPFSFTLSGKSRFSGEGATCFERIKACYHSPATAWPFKAYKLSLQFPHFTCPKT RQDWQPLFTFTWTDPDTHQAQQLTWAVLPQDFKDSPHYFSQVLSQVSPSKAQISSPSVTY LGIILHKNTRALPVDLVRLISQTPTPSTKQQLLSFLGIVRYFRLWIPGFTILTKPLYKLT KGNLADPVDPKYFPHSSFRSLKTALETAPTLALPDSSQPFSLHTAKVQGCTVRILIQGPG SRPVAFLSKQLDLTVLGWPPCLRAAAAAALILSEALKITNYAQLTLYISHNFQNLFSSSY LMHILSAPWLLQLNSLFVKSHNYHCSWPGLQSGLPHYS >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_1|1377_bp atgaatagtagtgcaacaatagagaaagtggagagatcagttagaaaggaagatggtccc tacagtgcagtttcctgccttggacagttcctggtcaaactggggacttggatggatgcc aggcagctgaggagacaaaggagacatgttttatccgtggacccaaaaccccagcgctgg tcacggactcgggaaggcagggacgcctctctgattattcacccaggtttcagaggtgtc agaccacgcagggatgcctgccttggtccttcacccttagcggcaagtcccgcttttctg ggggaggggcaagtactccaaccccttctctccgtgtctctaccccttctccgcctttct gggggacaagaaacccccaaccccttctccttcacccttagcggcaagtcccgcttttct ggggaaggggcaacatgctttgaaaggattaaagcctgttaccactcgcctgctacagca tggccttttaaagcctataaactctccttacaattcccccattttacctgtcctaaaacc agacaagactggcagcctctcttcactttcacttggactgaccctgacacccatcaggct cagcagcttacctgggctgtgctgccacaagatttcaaggacagccctcattacttcagc caagttctttctcaagtatccccctccaaagctcaaatttcttctccatccgttacctac ctcggcataattcttcataaaaacacacgtgctctccctgtcgatcttgtccgactgatc tctcaaaccccaaccccttctacaaaacaacaactgctttccttcctaggcatagttaga tactttcgcctttggatacctggtttcaccatcctaacaaaaccattatataaactcaca aaaggaaacctagctgaccccgtagatcctaaatactttccccactcctctttccgttcc ttgaagacagctttagagactgcccccactctagctctccctgattcatcccaacccttt tcattacacacagccaaagtgcagggctgtacagtcagaattcttatacaaggaccagga tcgcgtcctgtagcctttttgtccaaacaacttgaccttactgttttaggttggccacca tgtctccgagcagcggctgctgctgccctaatactttcagaggccctcaaaatcacaaac tatgctcaacttactctctatatttctcataacttccaaaatctattttcttcctcatac ctgatgcatatactttctgctccctggctccttcagctgaactcactcttcgttaagtcc cacaattaccattgttcctggcctggacttcaatccggcctcccacattattcctga >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_2|161_aa MEEDEFIGEKTFQRYCAEFIKHSQQIGDSWEWRPSKHPSDTSAGAHTHMDASDPTPSPCA TIVNIHMEAISPAPSSPQPHLTSMHPATLLLLLVRANEHRSHCHRPTKDFGWHHPLECCG QWSGNTLAPPVQQVHNLVGPENKAEGPIPNPSELEHTAQEG >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_2|486_bp atggaagaagatgagttcattggagaaaaaacattccaacgttattgtgcagaattcatt aaacattcacaacagataggtgatagttgggaatggagaccatcaaagcatccctctgac accagtgctggtgcacacacacacatggatgccagtgaccctaccccctcaccctgtgcc accattgtgaacattcacatggaggccatcagccctgcaccctccagtccccagccacat ctaacaagtatgcaccctgccacactgctgttgctactagtacgtgcaaatgaacacaga tcccactgccaccgcccaacgaaggactttggttggcatcacccattggagtgttgtggc cagtggtctgggaacaccttggctcctccagtgcagcaggttcataaccttgtggggcca gagaacaaagctgagggcccaatacccaacccctcagaattagagcacacagctcaggag ggctga >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_3|165_aa MVNPTVFFDIAVDGEPLGRVSFELLADKVPKTAENFHALSTGEKGFGYKGSCFHRIIPGF MCQGGDFTRHNGTSGKSIYGEKFEDENFILKHTGPGILSMANAGPNTNGSQFFICTAKTE WLDGMHVICGKVKEGMNIVEVMECFGSRNGKTSKKITIADCGQLE >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_3|498_bp atggtcaaccccaccgtgttcttcgacattgccgtcgatggcgagcccttgggccgcgtc tcctttgagctgttagctgacaaggtcccaaagacagcagaaaattttcatgctctgagc actggagagaaaggatttggttataagggttcctgctttcacagaattattccagggttt atgtgtcagggtggtgacttcacacgccataatggcactagtggcaagtccatctatggg gagaaatttgaagatgagaacttcatcctaaagcatacaggtcctggcatcttgtccatg gcaaatgctggacccaacacaaatggctcccagtttttcatctgcactgccaagactgag tggttggatggcatgcatgtgatctgtggcaaagtgaaagaaggcatgaatattgtggag gtcatggagtgctttgggtccaggaatggcaagaccagcaagaagatcaccattgctgac tgtggacaactcgaataa >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_4|182_aa MEDQMNELKREEKFREKRIKRHEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANIQIQEIQRMPQRYSSRRATPQHIIVRFTKVEMKEKILRAARQKGRV THKGKPIRLTVDLSAETLQARREWRPIFNILKEENFQPRITYPAKLSFISEGEINTLQTS KC >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_4|549_bp atggaagatcaaatgaatgaattgaagcgagaagagaagtttagagaaaaaagaataaaa agacacgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg ataggtgtacctgaaagtgacggggagaatggaaccaagctggaaaacactcttcaggat attatccaggagaacttccccaacctagcaaggcaggccaatattcaaattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccacaacacatcattgtcaga ttcaccaaagttgaaatgaaggaaaaaatattaagggcagccagacagaaaggtcgggtt acccacaaagggaagcccatcagactaacagtggatctctcagcagaaactctacaagcc aggagagagtggaggccaatattcaacattcttaaagaagagaatttccaacccagaatc acatatccagccaaactaagcttcataagtgaaggagaaataaatactttacagacaagc aaatgctga >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_5|136_aa MVTAWIREVCQGLVSLQAPLKRPCSHPMGELAAVCLPRGEILTTRECHRSNSKQEEDCDC IQLNQSMSLLPVFKPWAPRSSEGQEAFLSKSMLHFEKLIQAAEPKCLTGKEKLRVEDKVP SKLHWHLTTKTQQEKI >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_5|411_bp atggtgactgcctggattagggaagtgtgccaggggttggtctccctgcaggcgcctctg aagcgcccatgcagtcaccccatgggggagttagcagctgtgtgtctgccacgtggagag atattgacaaccagggagtgtcacagaagcaacagcaaacaagaagaggactgtgactgc atccagttaaaccagtcgatgtccctccttccagtttttaagccatgggctcctagaagt agtgaggggcaggaggcttttctgagtaagagcatgcttcattttgagaagcttattcaa gctgctgagcctaaatgccttactggcaaggaaaagttacgagttgaagacaaggtgcca agcaaactgcactggcacctcactacaaaaactcagcaggaaaagatttga >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_6|97_aa MDATYSRASSRTVTWVFLGFIAVVCGKAEEYGLILKCSGRIFSLQQPIIYVAVLKNMNVV SQKSCYLYLQIVLTILPVGNTVWKESELYFGLDLEQL >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_6|294_bp atggatgctacatacagtagggcctcttcacggactgtgacttgggtgtttctgggcttc atcgcagttgtttgtgggaaagctgaagaatatggtctgatcttaaaatgttctgggaga atatttagtctgcagcagcccatcatctatgtagctgtgcttaagaatatgaatgtagtc agtcagaagtcatgctacttgtacctccaaattgtgctaactattctacctgttggcaac actgtatggaaagaatctgagctgtattttggcttagatttggagcaactatag >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_7|262_aa MVKIPRNPTYKGCEGPLQGELQTTAQGNKRGYKQMEEHSMLMGRKNQYRENGHTAQAAPA VAKRGHGTARATASEGANPKPWQLPCGIGPAGVQKARIEVWEPTPRFQRMYGNAYMSRQK SAAGVEPSWRASTRAVQRENVGLEPLHRVPTGALSSGPVRKGPLSSIHQKGRSTKSLHHA PGKAVGTQNQPGKAAMGAVSCRATEAELPKPMGAHPLHQHAQDMRHGVKGDYFGALRFND CPAGFQTCMGTVAPLFWPMSPI >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_7|789_bp atggtaaaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaa ctacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacattccatg ctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagctgctccagct gtggctaaaaggggccatggtacagctagggccacggcttcagagggtgcaaatcccaag ccttggcagcttccatgtggtattgggcctgcaggtgtgcagaaggcaagaattgaggtt tgggaacctacacctagatttcagaggatgtatggaaatgcctatatgtccaggcagaag tctgctgcaggggtggagccctcatggagagcctctactagggcagtgcagagggaaaat gtggggttggagcccctacacagagtccccactggagcactgtcaagtggacctgtgaga aaagggccactgtcctccatacatcagaaaggtagatccaccaagagcttgcaccatgca cctggaaaagctgtaggcactcaaaaccagccggggaaagcagccatgggagctgtatcc tgcagagccacagaggcggagctgcccaaacccatgggagcccaccccttgcatcagcat gcccaggatatgagacatggagtcaaaggagattattttggagctttaagatttaatgac tgccctgctgggtttcagacttgcatggggactgtagcccctttgttttggccaatgtct cccatttag >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_8|334_aa MQRNKSRKAENSKNQNTSSPPKECSSLPATEQSWMENDYDALREEGFRRSVITDFSELKE DVRTHHKEAKNLEKRLDEWLTRINSVEKSLNDLMDLKTMARELRDACTSFSSQSDQVEER QEEKFRVKRVKRNKQSLQEIWNYVKIPNLRLIGVPESDRENGTKLGNTLQDIIQENFPNL ARQANIQIEEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTHKGKPIRL TADLSAETLQARREWGTVFNILKEKNFQPRISYPAKLSFISEGEIKSITDKQMLRDFVTT RPALQEPLKEALNMERNNGTSHCQNMPNCKDHQC >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_8|1005_bp atgcagagaaacaagagcagaaaagctgaaaattctaaaaatcagaacacctcttctcct ccaaaggaatgcagctccttgccagcaacggaacaaagctggatggagaatgactatgac gcgttgagagaagaaggcttcagacgatcggtaataacagacttctctgagctaaaagag gatgttcgaacccatcacaaagaagctaaaaacctggaaaaaagattagacgaatggcta actagaataaacagtgtagagaaatccttaaatgacctgatggacctgaaaaccatggca cgagagctacgtgatgcatgcacaagcttcagtagccaatctgatcaagtggaagaaagg caagaagagaagtttagagtaaaaagagtaaaaagaaacaaacaaagcctccaagaaata tggaactatgtgaaaataccaaatctacgtctgattggtgtacctgaaagtgacagggag aatggaaccaagttgggaaacactcttcaggatattatccaggagaacttccccaaccta gcaaggcaggccaacattcaaattgaggaaatacaaagaacgccacaaagatactcatca agaagagcaactccaagacacatcattgtcagattcaccaaagttgaaatgaaggaaaaa atgttaagggcagccagagagaaaggtcgggttacccacaaagggaagcccatcagacta acagcggatctctcggcagaaactctacaagccagaagagagtgggggacggtattcaac attcttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagcttcata agtgaaggagaaataaaatccattacagacaagcaaatgttgagagattttgtcaccacc aggcctgccttacaagagcccctgaaggaagcattaaacatggaaaggaacaacggtacc agccactgccaaaacatgccaaattgtaaagaccatcaatgctag >gi568815593r:81909599_82110093|GENSCAN_predicted_peptide_9|86_aa MIKAKDRALLLPRSTLKISLLVGKKQMMNALGMLVQPPLNSQRSFSLSKNSGANYWAMGS PAGGAQEATFPNYYHLERDWQYELTQ >gi568815593r:81909599_82110093|GENSCAN_predicted_CDS_9|261_bp atgatcaaggcaaaagacagagcccttcttttgcccagatctacactgaagatctctctc ctagtgggcaagaagcaaatgatgaatgctctggggatgctagttcaaccacctctgaat tcccaaagatctttttctctttctaagaattcaggggccaactactgggccatgggaagc ccagcaggtggagctcaggaagccaccttccccaactactaccacctagagagagactgg cagtatgagctgacacagtga