GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:38:09 Sequence gi568815596r:206076183_206277247 : 201065 bp : 43.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8481 8606 126 2 0 84 74 10 0.376 0.08 1.02 Term + 9212 9625 414 1 0 82 47 133 0.600 3.86 1.03 PlyA + 10649 10654 6 1.05 2.16 PlyA - 10885 10880 6 1.05 2.15 Term - 40097 39988 110 0 2 82 36 100 0.723 2.87 2.14 Intr - 50429 50357 73 2 1 89 91 58 0.910 5.18 2.13 Intr - 50662 50528 135 1 0 29 87 119 0.977 6.66 2.12 Intr - 51790 51678 113 2 2 86 64 54 0.471 2.90 2.11 Intr - 54060 53906 155 2 2 72 115 20 0.957 2.82 2.10 Intr - 56923 56763 161 1 2 84 58 51 0.911 0.49 2.09 Intr - 62432 62303 130 1 1 74 47 87 0.949 3.90 2.08 Intr - 65887 65759 129 0 0 51 73 57 0.550 0.31 2.07 Intr - 66649 66504 146 1 2 84 72 105 0.618 7.58 2.06 Intr - 67950 67836 115 2 1 77 53 45 0.895 0.35 2.05 Intr - 68844 68710 135 2 0 50 76 121 0.986 6.78 2.04 Intr - 70906 70721 186 0 0 109 65 85 0.983 7.10 2.03 Intr - 71479 71349 131 1 2 46 93 99 0.985 5.69 2.02 Intr - 73743 73636 108 0 0 108 75 67 0.966 7.88 2.01 Init - 74037 73972 66 0 0 69 67 30 0.904 0.07 2.00 Prom - 78388 78349 40 -6.66 3.00 Prom + 81442 81481 40 -6.66 3.01 Init + 82664 82716 53 1 2 53 49 73 0.715 0.53 3.02 Intr + 83731 83877 147 1 0 106 72 231 0.884 22.65 3.03 Intr + 84406 84528 123 1 0 89 98 84 0.998 9.20 3.04 Intr + 85164 85290 127 0 1 -13 64 195 0.952 7.68 3.05 Intr + 85856 85922 67 1 1 88 56 69 0.999 2.18 3.06 Intr + 86307 86432 126 1 0 78 84 53 0.907 4.55 3.07 Term + 86547 86701 155 1 2 66 38 187 0.894 9.58 3.08 PlyA + 86727 86732 6 1.05 4.02 PlyA - 88165 88160 6 1.05 4.01 Sngl - 101065 99998 1068 1 0 59 38 494 0.917 38.65 4.00 Prom - 104760 104721 40 -5.06 5.00 Prom + 108013 108052 40 -4.66 5.01 Init + 111177 111215 39 2 0 81 91 25 0.163 2.32 5.02 Intr + 164261 164406 146 1 2 119 92 78 0.699 10.38 5.03 Term + 177327 177411 85 0 1 48 36 124 0.196 0.33 5.04 PlyA + 178459 178464 6 1.05 6.02 PlyA - 180241 180236 6 1.05 6.01 Sngl - 199016 198444 573 2 0 77 48 286 0.909 17.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:206076183_206277247|GENSCAN_predicted_peptide_1|179_aa SPRCSLHLPLRRSCPEGGFWTQRNRDERGWVLKLKITPPVGGALTPLSRQPGPAAPRESL RAGAGFGPAAPPSHPETPGDSRGRAEARGAGAGRGCRGAYRPEAYRRPPLPPHTPTWPAQ AAPLAPLRLRLDPCPPAVAARGRPGGPQAPGLHHPGAGPTHPFILRPGPGVGRRIPGLT >gi568815596r:206076183_206277247|GENSCAN_predicted_CDS_1|540_bp tctcctcgctgctccctacatcttcctctccgccggagctgtccagaagggggattttgg acccaaaggaatcgggatgaaagggggtgggtactcaagctgaaaataacacccccagtt ggtggggccctgacgcccctgtcccggcagcccgggcctgccgccccgcgggaaagcctc cgggccggcgcgggattcggcccagcagcgccgccgtcccacccggagaccccgggggac tcgagggggcgcgcggaggcccggggtgcgggggcagggcgcggctgccgtggggcctac cggcccgaggcctaccggcgcccccccctcccgccgcacacccccacctggcccgcgcag gccgctccgctcgccccactccggctccggctggacccctgcccgcccgccgtcgccgcc cgcgggcgccccggaggccctcaggcccccggcctccaccacccgggcgccggccccact caccctttcattctccgcccggggcctggcgtgggccggcgaatccccggcctcacgtaa >gi568815596r:206076183_206277247|GENSCAN_predicted_peptide_2|630_aa MDVGSQVFSTVEGYFIQFLPHKACEKVGMQIPRFCYHERLSVAGNCRMCLVEIEKAPKDQ SMMFGNDRSRFLEGKRAVEDKNIGPLVKTIMTRCIQCTRCIRFASEIAGVDDLGTTGRGN DMQVGTYIEKMFMSELSGNIIDICPVGALTSKPYAFTARPWETRKTESIDVMDAVGSNIV VSTRTGEVMRILPRMHEDINEEWISDKTRFAYDGLKRQRLTEPMVRNEKGLLTYTSWEDA LSRVAGMLQSFQGKDVAAIAGGLVDAEALVALKDLLNRVDSDTLCTEEVFPTAGAGTDLR SNYLLNTTIAGVEEADVVLLVGTNPRFEAPLFNARIRKSWLHNDLKVALIGSPVDLTYTY DHLGDSPKILQDIASGSHPFSQVLKEAKKPMVVLGSSALQRNDGAAILAAVSSIAQKIRM TSGVTGDWKVMNILHRIASQVAALDLGYKPGVEAIRKNPPKVLFLLGADGGCITRQDLPK DCFIIYQGHHGDVGAPIADVILPGAAYTEKSATYVNTEGRAQQTKIAGMTLPYDTLDQVR NRLEEVSPNLVRYDDIEGANYFQQANELSKLVNQQLLADPLVPPQLTIKDFYMTGEVTEI NEALAENPGLVNKSRYEDGWLIKMTLSNPS >gi568815596r:206076183_206277247|GENSCAN_predicted_CDS_2|1893_bp atggatgtgggaagtcaagtcttcagcacagtggaaggatactttatccagtttcttcca cataaggcttgtgagaaggttggcatgcagatccctcgattctgttatcatgaaaggttg tctgttgctggaaactgcaggatgtgccttgttgaaattgagaaagcccctaaggaccag tccatgatgtttggaaatgataggagccgatttttagaggggaagcgtgctgtggaagac aagaacattgggccattggtaaagaccatcatgacaagatgtatacagtgtactcgctgc atcaggtttgcaagtgagattgcaggagtagatgatttgggaacaacaggcagaggaaat gatatgcaagttggcacatacattgaaaagatgttcatgtctgaactgtctgggaatatc attgatatctgccctgtaggtgccctaacctctaagccctatgcctttactgcccggcct tgggaaacaagaaagacagaatccattgatgtaatggatgcggttggaagtaatattgtg gttagcacaagaactggagaagtgatgaggattttgccacgtatgcatgaggacatcaat gaagagtggatctctgataaaaccagatttgcctatgatgggctaaaacgtcaaagactt accgagccaatggtcagaaatgaaaaagggcttttaacctatacttcttgggaggatgcg ctctctcgcgtagctggaatgttgcagagttttcaaggcaaagatgtggcagcaattgca ggtggcttggtggatgctgaagccctggtagctctcaaagatttgcttaatagagtggac tctgacaccttatgcactgaagaggtcttccccactgcaggagctggcacagatttgcgt tccaattatcttcttaatactacaattgctggtgtggaagaggcagatgttgttcttctg gttggtacaaacccacgttttgaggcaccactgtttaatgctagaattcgaaagagctgg ctgcataatgacttaaaagtggcccttataggcagtccagtggacctcacttacacatat gaccacctgggagactcccccaaaattcttcaagacattgcttcgggaagccatccattt agccaggtcctaaaggaagctaaaaaaccaatggtggttttaggcagttctgcactccaa agaaatgatggagcagcaattcttgcagctgtttctagcattgcacaaaagattcggatg actagtggtgttactggtgattggaaagttatgaatatccttcataggattgcaagtcaa gtagctgctttggaccttggctataagcctggggtggaagcaattcggaagaaccctccc aaggtgctgtttctcctgggagcagatggaggttgtatcacacgacaggatttgccaaag gattgtttcattatttatcaaggacatcatggtgatgttggggctcccatagctgatgtt attctcccaggagctgcttacacagagaagtctgctacatatgtcaacactgagggtaga gctcagcagactaagattgctggaatgactcttccatatgatactctggatcaagtaagg aacagattggaagaagtctctcctaatcttgttcgatatgatgatattgaaggggctaat tacttccagcaagcaaatgagctctcaaagctagtgaaccagcagcttcttgctgaccca cttgttccacctcagctaactataaaagacttctacatgacaggagaagtaaccgaaatt aatgaagctcttgcagaaaatccaggacttgtaaacaaatctcgttatgaagatggttgg ctgatcaagatgacactgagtaacccttcataa >gi568815596r:206076183_206277247|GENSCAN_predicted_peptide_3|265_aa MLSNVVNGPNKQGQQTEAVGRPQFARSLSAAPQLSDTADTMGFGDLKSPAGLQVLNDYLA DKSYIEGYVPSQADVAVFEAVSSPPPADLCHALRWYNHIKSYEKEKASLPGVKKALGKYG PADVEDTTGSGATDSKDDDDIDLFGSDDEEESEEAKRLREERLAQYESKKAKKPALVAKS SILLDVKPWDDETDMAKLEECVRSIQADGLVWGSSKLVPVGYGIKKLQIQCVVEDDKVGT DMLEEQITAFEDYVQSMDVAAFNKI >gi568815596r:206076183_206277247|GENSCAN_predicted_CDS_3|798_bp atgctgtccaacgtggtcaatggaccaaacaaacaaggacaacaaacagaagccgtgggg cgcccacaatttgcgcgctctctttctgctgctccccagctctcggatacagccgacacc atgggtttcggagacctgaaaagccctgccggcctccaggtgctcaacgattacctggcg gacaagagctacatcgaggggtatgtgccatcacaagcagatgtggcagtatttgaagcc gtgtccagcccaccgcctgccgacttgtgtcatgccctacgttggtataatcacatcaag tcttacgaaaaggaaaaggccagcctgccaggagtgaagaaagctttgggcaaatatggt cctgccgatgtggaagacactacaggaagtggagctacagatagtaaagatgatgatgac attgacctctttggatctgatgatgaggaggaaagtgaagaagcaaagaggctaagggaa gaacgtcttgcacaatatgaatcaaagaaagccaaaaaacctgcacttgttgccaagtct tccatcttactagatgtgaaaccttgggatgatgagacagatatggcgaaattagaggag tgcgtcagaagcattcaagcagacggcttagtctggggctcatctaaactagttccagtg ggatacggaattaagaaacttcaaatacagtgtgtagttgaagatgataaagttggaaca gatatgctggaggagcagatcactgcttttgaggactatgtgcagtccatggatgtggct gctttcaacaagatctaa >gi568815596r:206076183_206277247|GENSCAN_predicted_peptide_4|355_aa MEDLEETLFEEFENYSYDLDYYSLESDLEEKVQLGVVHWVSLVLYCLAFVLGIPGNAIVI WFTGFKWKKTVTTLWFLNLAIADFIFLLFLPLYISYVAMNFHWPFGIWLCKANSFTAQLN MFASVFFLTVISLDHYIHLIHPVLSHRHRTLKNSLIVIIFIWLLASLIGGPALYFRDTVE FNNHTLCYNNFQKHDPDLTLIRHHVLTWVKFIIGYLFPLLTMSICYLCLIFKVKKRSILI SSRHFWTILVVVVAFVVCWTPYHLFSIWELTIHHNSYSHHVMQAGIPLSTGLAFLNSCLN PILYVLISKKFQARFRSSVAEILKYTLWEVSCSGTVSEQLRNSETKNLCLLETAQ >gi568815596r:206076183_206277247|GENSCAN_predicted_CDS_4|1068_bp atggaagatttggaggaaacattatttgaagaatttgaaaactattcctatgacctagac tattactctctggagtctgatttggaggagaaagtccagctgggagttgttcactgggtc tccctggtgttatattgtttggcttttgttctgggaattccaggaaatgccatcgtcatt tggttcacggggttcaagtggaagaagacagtcaccactctgtggttcctcaatctagcc attgcggatttcatttttcttctctttctgcccctgtacatctcctatgtggccatgaat ttccactggccctttggcatctggctgtgcaaagccaattccttcactgcccagttgaac atgtttgccagtgtttttttcctgacagtgatcagcctggaccactatatccacttgatc catcctgtcttatctcatcggcatcgaaccctcaagaactctctgattgtcattatattc atctggcttttggcttctctaattggcggtcctgccctgtacttccgggacactgtggag ttcaataatcatactctttgctataacaattttcagaagcatgatcctgacctcactttg atcaggcaccatgttctgacttgggtgaaatttatcattggctatctcttccctttgcta acaatgagtatttgctacttgtgtctcatcttcaaggtgaagaagcgaagcatcctgatc tccagtaggcatttctggacaattctggttgtggttgtggcctttgtggtttgctggact ccttatcacctgtttagcatttgggagctcaccattcaccacaatagctattcccaccat gtgatgcaggctggaatccccctctccactggtttggcattcctcaatagttgcttgaac cccatcctttatgtcctaattagtaagaagttccaagctcgcttccggtcctcagttgct gagatactcaagtacacactgtgggaagtcagctgttctggcacagtgagtgaacagctc aggaactcagaaaccaagaatctgtgtctcctggaaacagctcaataa >gi568815596r:206076183_206277247|GENSCAN_predicted_peptide_5|89_aa MVERNFGAEHSLQDQFSLRRLFIDSCKQISGGNCSKAFQMCTGMDESEQKWTDVAQMEGS AITLLHVIPPAALYTTWSRLRYRMQHSPR >gi568815596r:206076183_206277247|GENSCAN_predicted_CDS_5|270_bp atggttgagcgcaactttggggcagagcacagcttacaggatcaattttctctcaggagg ctcttcattgactcatgcaaacagatttctggcgggaactgctccaaggccttccagatg tgcacaggcatggatgaatcagagcagaagtggacagatgtagctcaaatggagggttct gctatcactctattgcatgtaataccgccggcggcactctacacgacttggagccgcctg cgctaccgtatgcagcactctccacggtag >gi568815596r:206076183_206277247|GENSCAN_predicted_peptide_6|190_aa MQPAPHLLPSPRHLDPGPEPPTPLTQQGQKGTRGGGKPRTCEAAAGAQRAVLASLGKKAP RAPGEQAPPAGVAPAAPRPRRGTHLRGGCPDSFRAAVETEDRAGRSPFLFRFRGGARIPD RPSATAPLPAEPEIGDFGPCGPEARSGAGARLSGRGPRVATPLTQASKATWRCNPDGGLT SGKVRKPDPF >gi568815596r:206076183_206277247|GENSCAN_predicted_CDS_6|573_bp atgcagcctgcgccccacctccttccctcgccccgccacctcgatcccggcccggagccg ccgacacctctcacgcagcaggggcagaaagggacgcggggtggggggaagccgaggacc tgcgaggcggccgcgggagcccagcgagcggtcctggcgtccctcgggaagaaggcgccc cgggcgcccggcgagcaggccccgcccgcgggcgtcgccccggccgcgccccgcccccgc cgcggcactcacctccgcggaggctgcccggattccttccgggccgcggtggagaccgag gaccgagccgggcgctctcccttcttattccggttccgcggcggcgcccgaatcccagac cgtccgagcgccacggccccgctgcccgcggagccagaaataggcgacttcggcccctgc ggtccggaggcgagaagcggcgcgggagcgcgtctctcaggacgcgggccccgagtcgcg actccactcacgcaggcatccaaggcaacgtggcgttgcaaccctgatggaggtttgacg tcagggaaggtgcgaaaaccagaccccttctga