GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:41:30 Sequence gi568815597r:27566384_27769314 : 202931 bp : 48.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 779 910 132 2 0 119 76 9 0.293 2.56 1.02 Intr + 1717 1786 70 1 1 4 119 67 0.427 0.58 1.03 Intr + 2368 2629 262 0 1 86 60 134 0.715 7.46 1.04 Intr + 6126 6272 147 1 0 48 61 88 0.286 2.31 1.05 Intr + 15813 15854 42 1 0 103 80 21 0.050 1.01 1.06 Term + 36825 37141 317 1 2 66 52 154 0.309 4.70 1.07 PlyA + 40333 40338 6 1.05 2.14 PlyA - 42234 42229 6 1.05 2.13 Term - 46754 46531 224 0 2 47 38 299 0.615 17.88 2.12 Intr - 46967 46836 132 0 0 132 41 200 0.999 20.32 2.11 Intr - 48200 48047 154 2 1 85 44 266 0.997 21.55 2.10 Intr - 48543 48467 77 1 2 88 93 47 0.999 4.43 2.09 Intr - 49230 49051 180 1 0 -78 49 477 0.552 27.04 2.08 Intr - 49461 49306 156 1 0 88 55 309 0.999 27.58 2.07 Intr - 50623 50474 150 2 0 39 68 287 0.990 21.93 2.06 Intr - 50913 50810 104 2 2 127 97 57 0.993 10.32 2.05 Intr - 55274 55176 99 1 0 93 89 96 0.974 9.43 2.04 Intr - 56761 56659 103 2 1 56 102 204 0.999 17.83 2.03 Intr - 57543 57308 236 2 2 51 109 303 0.590 26.03 2.02 Intr - 68790 68682 109 1 1 34 115 28 0.129 -0.56 2.01 Init - 71590 71584 7 1 1 63 116 0 0.206 1.39 2.00 Prom - 75942 75903 40 -1.16 3.00 Prom + 78613 78652 40 -6.16 3.01 Init + 78860 78954 95 1 2 95 71 58 0.360 3.60 3.02 Intr + 79765 79942 178 1 1 41 110 53 0.277 2.72 3.03 Intr + 82036 82196 161 0 2 21 80 97 0.532 0.99 3.04 Term + 82906 83248 343 1 1 75 55 302 0.267 19.58 3.05 PlyA + 83328 83333 6 1.05 4.03 PlyA - 83578 83573 6 1.05 4.02 Term - 92515 92410 106 0 1 114 46 46 0.492 0.88 4.01 Init - 93698 93571 128 2 2 96 34 164 0.388 9.43 4.00 Prom - 93996 93957 40 -6.06 5.08 PlyA - 94746 94741 6 1.05 5.07 Term - 100092 99998 95 1 2 121 42 163 0.994 12.99 5.06 Intr - 101992 101843 150 2 0 122 109 233 0.983 28.93 5.05 Intr - 102152 102075 78 0 0 121 90 127 0.999 15.72 5.04 Intr - 102963 102586 378 1 0 108 110 189 0.039 18.04 5.03 Intr - 119087 119056 32 1 2 135 60 10 0.087 0.67 5.02 Intr - 121035 120789 247 1 1 78 46 281 0.550 19.42 5.01 Init - 123290 123233 58 2 1 58 70 48 0.713 1.47 5.00 Prom - 128524 128485 40 -1.16 6.00 Prom + 131374 131413 40 -4.46 6.01 Init + 159698 159778 81 1 0 103 91 149 0.981 17.67 6.02 Intr + 161089 161153 65 0 2 99 65 89 0.996 5.22 6.03 Intr + 166220 166274 55 2 1 80 127 34 0.993 5.48 6.04 Intr + 167648 167800 153 1 0 91 94 121 0.995 13.27 6.05 Intr + 172802 172999 198 1 0 3 35 145 0.191 0.25 6.06 Intr + 173612 174028 417 1 0 33 5 263 0.069 6.72 6.07 Intr + 178271 178428 158 1 2 138 91 67 0.840 10.81 6.08 Intr + 188812 188947 136 1 1 28 86 155 0.957 9.87 6.09 Intr + 193143 193244 102 2 0 95 62 87 0.966 7.17 6.10 Term + 194112 194198 87 2 0 96 48 75 0.984 1.96 6.11 PlyA + 194710 194715 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:27566384_27769314|GENSCAN_predicted_peptide_1|323_aa XLAEDPACCCLLPLLLAGKQQDCEFRICNPNAEPRPTGLQLPRRRGTPNPVGEKGPIVQG GQPAEMQPVGACGGPGRAGALFVGGELGAETGDPPESAQPSLGLDAYKRRLVLSRDACAF SNIPTAALQGQLHPHTRTKPARVGRGGLAKAPEIGPTRPYPSQPHMHCSEALTTPLQIPP QQYMTTPIPNSATQLNIHKQLTNPDFQEQNSESHRELAGFEPSPAEPRAPVETKTDRTAG TAATGSSGPDRGTQAPPVRCTKEPGRPVPSGGLGLGRRRHSPECWAAGSVGPSQARGFGG RLSSGGGSDAGARLLRRGVVEEP >gi568815597r:27566384_27769314|GENSCAN_predicted_CDS_1|972_bp nntctagctgaagatcctgcatgctgctgcttgctgccactgctgcttgcagggaagcag caagattgtgaattcaggatctgtaatcctaacgcagagcccaggcctaccgggctccag ttgccaaggcgacgaggcacacccaaccctgtgggtgagaagggcccgatagtacaaggt ggacagcccgcagaaatgcagccggtgggagcctgcggcgggccgggccgggccggggcg ctcttcgtgggcggggagctgggcgcggaaacaggggatcccccagagagcgcccagccc agcctggggctggacgcgtacaaaaggcgacttgttctttctcgggatgcctgcgccttc tcaaacatacctacagccgcactccagggacagctacacccacacacccgcacaaagcct gccagggtggggcgcggggggctggcgaaagccccagaaatcggccctacacgtccgtac cctagccagccacacatgcactgttcagaggccctgacaacacccctacagataccccca caacagtacatgacaactccaattccaaacagtgccactcagctcaacatccataaacaa ctgacaaacccagactttcaggaacagaacagcgaatcccacagggagttagcaggcttc gagccgagcccagccgagccgcgggccccggtggaaacaaagaccgatcgcaccgcgggc accgcagccaccgggtccagcggaccggaccgggggacgcaggccccccccgtgcgctgc acaaaagagcctggccggccggtccccagcgggggtctggggctggggaggaggcgccac tcaccggagtgctgggctgccggctccgtgggtccgagccaggcgaggggttttggggga cgtctgtcttccggcggtggcagcgacgcgggcgcccggctgctgaggagaggagtcgtg gaggagccctga >gi568815597r:27566384_27769314|GENSCAN_predicted_peptide_2|576_aa MKGAAVRGGQHPSSPRTPALAAAGAPAGSPRGRAELGSREPGMGCVFCKKLEPVATAKED AGLEGDFRSYGAADHYGPDPTKARPASSFAHIPNYSNFSSQAINPGFLDSGTIRGVSGIG VTLFIALYDYEARTEDDLTFTKGEKFHILNNTEGDWWEARSLSSGKTGCIPSNYVAPVDS IQAEEWYFGKIGRKDAERQLLSPGNPQGAFLIRESETTKGAYSLSIRDWDQTRGDHVKHY KIRKLDMGGYYITTRVQFNSVQELVQHYMEVNDGLCNLLIAPCTIMKPQTLGLAKDAWEI SRSSITLERRLGTGCFGDVWLGTWNGSTKVAVKTLKPGTMSPKAFLEEAQVMKLLRHDKL VQLYAVVSEEPIYIVTEFMCHGSLLDFLKNPEGQDLRLPQLVDMAAQVAEGMAYMERMNY IHRDLRAANILVGERLACKIADFGLARLIKDDEYNPCQGSKFPIKWTAPEAALFGRFTIK SDVWSFGILLTELITKGRIPYPETHPSGMNKREVLEQVEQGYHMPCPPGCPASLYEAMEQ TWRLDPEERPTFEYLQSFLEDYFTSAEPQYQPGDQT >gi568815597r:27566384_27769314|GENSCAN_predicted_CDS_2|1731_bp atgaaaggggcggcggtcagaggcgggcagcaccccagttctccccgcacgccggcactc gcggctgctggagccccggctggctcaccccggggccgggcagaattgggctccagggaa cctggaatgggctgtgtgttctgcaagaaattggagccggtggccacggccaaggaggat gctggcctggaaggggacttcagaagctacggggcagcagaccactatgggcctgacccc actaaggcccggcctgcatcctcatttgcccacatccccaactacagcaacttctcctct caggccatcaaccctggcttccttgatagtggcaccatcaggggtgtgtcagggattggg gtgaccctgttcattgccctgtatgactatgaggctcgaactgaggatgacctcaccttc accaagggcgagaagttccacatcctgaacaatactgaaggtgactggtgggaggctcgg tctctcagctccggaaaaactggctgcattcccagcaactacgtggcccctgttgactca atccaagctgaagagtggtactttggaaagattgggagaaaggatgcagagaggcagctg ctttcaccaggcaacccccagggggcctttctcattcgggaaagcgagaccaccaaaggt gcctactccctgtccatccgggactgggatcagaccagaggcgatcatgtgaagcattac aagatccgcaaactggacatgggcggctactacatcaccacacgggttcagttcaactcg gtgcaggagctggtgcagcactacatggaggtgaatgacgggctgtgcaacctgctcatc gcgccctgcaccatcatgaagccgcagacgctgggcctggccaaggacgcctgggagatc agccgcagctccatcacgctggagcgccggctgggcaccggctgcttcggggatgtgtgg ctgggcacgtggaacggcagcactaaggtggcggtgaagacgctgaagccgggcaccatg tccccgaaggccttcctggaggaggcgcaggtcatgaagctgctgcggcacgacaagctg gtgcagctgtacgccgtggtgtcggaggagcccatctacatcgtgaccgagttcatgtgt cacggcagcttgctggattttctcaagaacccagagggccaggatttgaggctgccccaa ttggtggacatggcagcccaggtagctgagggcatggcctacatggaacgcatgaactac attcaccgcgacctgagggcagccaacatcctggttggggagcggctggcgtgcaagatc gcagactttggcttggcgcgtctcatcaaggacgatgagtacaacccctgccaaggttcc aagttccccatcaagtggacagccccagaagctgccctctttggcagattcaccatcaag tcagacgtgtggtcctttgggatcctgctcactgagctcatcaccaagggccgaatcccc tacccagagacccatccttcaggcatgaataaacgggaagtgttggaacaggtggagcag ggctaccacatgccgtgccctccaggctgcccagcatccctgtacgaggccatggaacag acctggcgtctggacccggaggagaggcctaccttcgagtacctgcagtccttcctggag gactacttcacctccgctgaaccacagtaccagcccggggatcagacatag >gi568815597r:27566384_27769314|GENSCAN_predicted_peptide_3|258_aa MGNLGQVRRLSLWDYLLGLTHPRGLTTSQPGRSGLSPPAPPQQSFCMCQNVTPGIMALGM SAVYFQVSGTKEQPVPGHPMQSILLELWGFQVHHCVPGNPRPDFMEHSKDLTLSLLDHSC HWHGRSHSSKEYLELHRENFLLILRSAFPTGLLRAWPRDGISQYLLVELKNNMFRFLVAG SAEGAAGAMSGHEGGKKKPPKQPKKQAKEMDEEDKAFKQKQKEEQKKLEELKAKAAGKGP LATGGIKKSGEKQAVPCA >gi568815597r:27566384_27769314|GENSCAN_predicted_CDS_3|777_bp atggggaacctgggccaagtacggcggctgagcctgtgggactacctcttggggctgacc caccccagggggctcacaacaagtcaacctggcaggtcaggactgagccctccagctccg ccacagcagtccttctgcatgtgccaaaatgtgacaccagggatcatggccttggggatg tctgctgtttatttccaagtcagtggcactaaggagcagccagtcccagggcacccaatg cagagcatcctgcttgagctttggggcttccaggtacaccactgcgtgccagggaacccc aggccagacttcatggaacacagcaaggacctgaccctcagcctcttggatcactcctgc cactggcatgggaggagccatagcagcaaggagtacctggagctgcacagggagaacttc ctcctcatcctcagatctgcctttcccactggactcttgagggcctggcccagagatggc atctctcagtatttgctggttgaattgaaaaataacatgtttaggtttctggtggcaggg tctgcggaaggggcggcaggtgccatgtccggccacgaaggtggcaagaagaagccaccg aaacagcccaagaagcaggccaaggagatggacgaggaagacaaggctttcaagcagaaa caaaaagaggagcagaagaaactcgaggagctaaaagcgaaggccgcggggaaggggccc ttggccacaggtggaattaagaaatctggcgaaaagcaagctgttccttgtgcctga >gi568815597r:27566384_27769314|GENSCAN_predicted_peptide_4|77_aa MGGGGGPAFAQTAVGRWGGGAVGRPRRLQGVGVSRGTFMKPASDTNLCWDHFFPEPSPHT FYLANSSSLIIMEEDGS >gi568815597r:27566384_27769314|GENSCAN_predicted_CDS_4|234_bp atgggcggagggggaggcccggcgttcgcgcagacggcggtggggcggtggggcggtggg gcggtggggcggccgcgccggcttcagggcgtcggggtctcccggggcacgttcatgaag ccggcgagtgacacaaacctctgctgggatcacttcttccctgagccatcccctcacacc ttttacctggctaactcctcatccctgatcatcatggaagaggatggatcctga >gi568815597r:27566384_27769314|GENSCAN_predicted_peptide_5|345_aa MKLKESAVSDLGAKPSSDFRVLLSDWPGDNTLFQLKFMVKQLEKPDKKAEKHSYANQAKV KKSLQQKNVECTRVYTKNTICKENSSVNWLYMASAWTQWPPRISEPLGGDPIGLVTEPAR GATMRQKAVSLFLCYLLLFTCSGVEAGENAGKDAGKGTGKGAGKDASKGAGKDAGKDAGK DAGKDAGKDAGKDAGKGAGKDAGKDAGKDAGKDAGKDAGKGAGKDAGKDAGKDVGKDAGK KKCSESSDSGSGFWKALTFMAVGGGLAVAGLPALGFTGAGIAANSVAASLMSWSAILNGG GVPAGGLVATLQSLGAGGSSVVIGNIGALMGYATHKYLDSEEDEE >gi568815597r:27566384_27769314|GENSCAN_predicted_CDS_5|1038_bp atgaaattgaaagagagtgctgtgtcagaccttggtgctaaaccaagttcagatttcagg gtgcttctgtcagactggccaggggacaataccctgttccagttgaaatttatggtgaag cagctggagaaaccagacaagaaggcagaaaagcactcctacgccaaccaggccaaagtg aagaagtcccttcagcagaaaaatgtagagtgcacccgtgtgtacaccaagaacaccatc tgcaaggagaacagcagcgtgaactggctctacatggcatccgcctggacacagtggccc ccgaggatttcagagcccctgggaggagatcccattggtctagtgacggagcccgcgcgc ggcgccaccatgcggcagaaggcggtatcgcttttcttgtgctacctgctgctcttcact tgcagtggggtggaggcaggtgagaatgcgggtaaggatgcaggtaaggggacaggtaag ggtgcaggtaaggatgcaagtaagggtgcaggtaaggatgcgggtaaggatgcaggtaaa gatgcgggtaaggatgcaggtaaggatgcgggtaaggatgcaggtaagggtgcaggtaag gatgcgggtaaggatgcaggtaaggatgcgggtaaggatgcaggtaaggatgcaggtaag ggtgcgggtaaggatgcgggtaaggatgcaggtaaggatgtgggtaaggatgcaggtaag aaaaagtgctcggagagctcggacagcggctccgggttctggaaggccctgaccttcatg gccgtcggaggaggactcgcagtcgccgggctgcccgcgctgggcttcaccggcgccggc atcgcggccaactcggtggctgcctcgctgatgagctggtctgcgatcctgaatgggggc ggcgtgcccgccggggggctagtggccacgctgcagagcctcggggctggtggcagcagc gtcgtcataggtaatattggtgccctgatgggctacgccacccacaagtatctcgatagt gaggaggatgaggagtag >gi568815597r:27566384_27769314|GENSCAN_predicted_peptide_6|483_aa MAALYACTKCHQRFPFEALSQGQQLCKECRIAHPVVKCTYCRTEYQQESKTNTICKKCAQ NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQCAFDRKDDRKKDA GGKAGAPIVCMLNQGLKHVSEGCVFAYCQVEEKPYWKDPNNDFRKNLEVTTVPTLLKYGT PQKLQPGQFRGQVPPYSRLWVHLDVMDGHFVPNVTFGYPVVESFQKQLVQDLFFDIHVMV SKLEQWVKSIAIAEASQYTFHLEATKYSEALIEDIWENGMKVGLTIKPGTTVEYLAPWTN QIDTALVITVEPGFGRQKFMDDMVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSSRAGHQ EKEQYSRLSGGGHYNSFSPDLALDSPGTDHFVIIAQLKEEVATLKKMLHQKDQMILEKEK KITELKADFQYQESQMRAKMNQMEKTHKEVTEQLQAKNRELLKQAAALSKSKKSEKSGAI TSP >gi568815597r:27566384_27769314|GENSCAN_predicted_CDS_6|1452_bp atggcggcgctctacgcctgcaccaagtgccaccagcgcttccccttcgaggcgctgtct caggggcagcagctgtgcaaggaatgtcggattgcacaccctgttgtgaagtgcacctac tgcaggactgagtaccagcaggagagtaaaaccaatacaatatgcaagaaatgtgctcag aacgtgcagttgtatggaacgcccaaaccttgtcagtattgcaacataattgcagcattt attgggaataaatgccagcgctgcacaaattcagaaaagaagtatggaccaccctattct tgtgaacagtgcaagcagcagtgtgcatttgacaggaaagatgatagaaagaaggatgct ggaggaaaagctggtgcccctattgtatgcatgctgaaccaggggctgaagcatgttagt gaaggatgtgtgttcgcctactgccaagtagaagaaaagccttattggaaagatccaaat aatgacttcagaaaaaacttggaagtaaccacagtgcctacactacttaaatatggaaca cctcaaaaactgcaacctggccagtttaggggtcaagtgcctccgtattctagactctgg gtccacctggatgtcatggatgggcactttgttcccaacgtcacctttggttatcctgtg gtagaaagctttcaaaagcagctagtccaggaccttttctttgacatacatgtgatggtg tccaagctggaacagtgggtaaaatcaatagctatagcagaagccagtcagtacaccttt catcttgaggctactaagtactcagaggctttgattgaagacatttgggagaatgggatg aaggttggccttaccatcaaaccaggaactacagttgagtatttggcaccatggactaat caaatagatacggccttggttatcacagtggaacctgggtttggaaggcagaaattcatg gatgatatggtagatgggaaattgctgtgctggctgtgcacactttcatacaaacgggtc cttcagaagaccaaagagcagaggaaacacctgagtagctcttctcgtgctggccaccag gagaaggagcagtatagtcgcctgagtggtggtggccattataacagcttctccccagac ctggctctggactcaccaggcactgaccactttgtcatcattgcccaactgaaggaagaa gtggctaccctgaagaagatgttgcatcaaaaggatcaaatgattttagagaaagagaag aagattacagagttgaaggctgattttcagtaccaggaatcgcagatgagagccaaaatg aaccagatggagaaaacccacaaagaagtcacagaacaactgcaggccaaaaaccgagag ctcctgaagcaggcagctgctttgtccaagagcaagaagtcagagaagtcaggagctata acctctccatga