GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:19:26 Sequence gi568815597f:53954423_54196470 : 242048 bp : 45.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2742 2766 25 1 1 98 87 42 0.759 2.18 1.02 Intr + 3727 3858 132 1 0 84 52 120 0.987 7.76 1.03 Intr + 5934 6052 119 0 2 86 103 22 0.923 3.61 1.04 Intr + 7612 7700 89 2 2 55 116 7 0.918 -0.11 1.05 Intr + 7874 7987 114 1 0 82 78 79 0.988 6.94 1.06 Intr + 8722 8787 66 0 0 68 115 32 0.855 3.00 1.07 Intr + 11874 11958 85 2 1 48 28 96 0.728 -1.01 1.08 Term + 13243 13517 275 2 2 109 38 167 0.891 9.43 1.09 PlyA + 13723 13728 6 1.05 2.04 PlyA - 14032 14027 6 1.05 2.03 Term - 21042 20644 399 0 0 10 48 426 0.805 25.92 2.02 Intr - 21645 21484 162 0 0 38 40 156 0.685 5.97 2.01 Init - 21677 21672 6 2 0 89 83 0 0.797 0.47 2.00 Prom - 22128 22089 40 -9.26 3.00 Prom + 22447 22486 40 -0.76 3.01 Sngl + 28184 28387 204 1 0 98 53 232 0.970 13.95 3.02 PlyA + 28614 28619 6 1.05 4.10 PlyA - 29723 29718 6 1.05 4.09 Term - 50140 49962 179 2 2 53 37 154 0.790 4.65 4.08 Intr - 52152 52032 121 0 1 58 116 117 0.998 11.57 4.07 Intr - 52810 52725 86 2 2 51 84 72 0.039 2.64 4.06 Intr - 55988 55860 129 0 0 105 91 89 0.975 11.57 4.05 Intr - 57858 57721 138 1 0 73 78 117 0.852 9.64 4.04 Intr - 59942 59814 129 0 0 47 71 101 0.842 4.97 4.03 Intr - 63005 62954 52 2 1 105 98 54 0.973 6.58 4.02 Intr - 63770 63653 118 1 1 113 94 40 0.921 7.57 4.01 Init - 74338 74160 179 1 2 49 72 140 0.547 7.23 4.00 Prom - 76031 75992 40 -4.36 5.03 PlyA - 77616 77611 6 1.05 5.02 Term - 91364 91281 84 2 0 100 53 67 0.710 2.05 5.01 Init - 98766 98578 189 0 0 90 97 237 0.993 21.81 5.00 Prom - 108463 108424 40 -2.56 6.00 Prom + 113365 113404 40 -4.16 6.01 Init + 128315 128324 10 1 1 105 94 0 0.811 3.05 6.02 Intr + 134175 134368 194 1 2 113 90 151 0.993 17.01 6.03 Term + 141863 142051 189 1 0 86 43 174 0.879 10.15 6.04 PlyA + 145829 145834 6 1.05 7.06 PlyA - 147224 147219 6 1.05 7.05 Term - 157256 157146 111 2 0 67 42 148 0.897 6.66 7.04 Intr - 166574 166415 160 1 1 110 72 25 0.146 3.09 7.03 Intr - 167762 167674 89 2 2 81 81 17 0.149 -0.93 7.02 Intr - 168047 167880 168 2 0 65 64 128 0.932 8.24 7.01 Init - 176940 176773 168 0 0 71 38 65 0.237 -0.67 7.00 Prom - 177481 177442 40 -0.26 8.07 PlyA - 177752 177747 6 -1.75 8.06 Term - 178929 178546 384 0 0 23 42 552 0.597 38.99 8.05 Intr - 182386 182208 179 2 2 63 91 345 0.998 31.94 8.04 Intr - 185684 185331 354 0 0 91 60 638 0.981 56.56 8.03 Intr - 187011 186676 336 1 0 26 91 536 0.513 43.19 8.02 Intr - 190391 190044 348 0 0 99 42 502 0.957 42.03 8.01 Init - 198500 198422 79 2 1 66 102 187 0.998 17.12 8.00 Prom - 201518 201479 40 -7.66 9.12 PlyA - 204325 204320 6 1.05 9.11 Term - 205522 205421 102 1 0 84 49 34 0.009 -2.62 9.10 Intr - 207029 206922 108 2 0 114 48 42 0.025 3.28 9.09 Intr - 209171 209051 121 1 1 63 70 67 0.032 2.90 9.08 Intr - 220400 220246 155 2 2 111 49 75 0.489 4.77 9.07 Intr - 224930 224727 204 2 0 83 99 203 0.975 20.30 9.06 Intr - 229843 229739 105 1 0 129 82 38 0.887 7.71 9.05 Intr - 233317 233230 88 0 1 119 113 81 0.846 13.67 9.04 Intr - 236474 236326 149 2 2 111 38 205 0.638 16.83 9.03 Intr - 236989 236913 77 2 2 56 71 25 0.568 -3.17 9.02 Intr - 237446 237269 178 2 1 48 28 141 0.356 3.59 9.01 Intr - 241212 240997 216 0 0 51 94 152 0.310 10.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 52850 52725 126 2 0 74 84 89 0.803 7.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_1|301_aa XVILLMEERYLVISEKLEEIKSFRELTCLDLSCCKLGDEHELLEHLTNEALSSVTQLHLK DNCLSDAGVRKMTAPVRVMKRGLENLTLLDLSCNPEITDAGIGYLFSFRKLNCLDISGTG LKDIKTVKHKLQTHIGLVHSKVPLKEFDHSNCKTEGWADQLQAIMQDGRTIHLLDCSVTS PQIVLQWERVTAEAVKPRETSEPRAAAQRFYGKRSRAEAPLKCPLADTHMNSSEKLQFYK EKAPDCHGPVLKHEAISSQESKKSKKRPFEESETEQNNSSQPSKQKYVCLAVEDWDLLNS Y >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_1|906_bp nttgtgatcctgctgatggaagagaggtatctcgtgatttcagaaaagcttgaggagatt aagtctttccgggagctgacctgcctggatctttcctgttgcaagcttggagatgagcat gaacttctagaacatctcaccaatgaagccctgtctagtgtaactcagctccacctgaag gataattgtttatctgatgctggggtgcggaagatgacagcaccagttcgagtgatgaaa agaggccttgagaatctaacattattagacttatcatgtaaccctgagatcacagatgca ggcattggatacctcttttcttttaggaaactaaactgcttagatatctctgggacaggg ctcaaggacatcaaaaccgtcaagcacaagctccagacccacataggccttgttcactcc aaagtgcctttgaaggaatttgatcatagtaactgcaagacagagggctgggctgaccag ctccaagccataatgcaggatggtagaactatccacctgctggattgtagcgtcacctct ccacagatcgttctgcagtgggagcgtgtgactgcggaagctgtgaagccacgggagacc tcggagcctagagcagcagctcagcgcttctatgggaagcggtctcgagcagaagcccca ctgaagtgtcccctggcagacacccacatgaactcttccgagaaactccagttctataaa gagaaagccccagattgccatgggccagtgttgaaacacgaagctatctcaagccaggag tcaaagaagagcaagaagagaccttttgaggagtcagagacagaacagaataactcttca caaccttcaaagcagaaatatgtatgtcttgctgtggaagactgggacttgttaaattcc tattga >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_2|188_aa MEPGSGHRRCCQGEEGHDPKEREQLRKPFIGGLSFETTDDGLREHFEKWVTLTDCVRGCG DGSGNFIVEKTLEVVEVILAVVETLVEEEAMVVEVVAAEIVMEEVMVDIMDLEVMVATMA AVLVIVVEGAMVVVDQDVETKVVDMVAVVEDMMVTMKEEILAVVTMVVVGTIMILEIIMD NSNQIKDS >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_2|567_bp atggagcccggctccggccatcgccgttgctgccagggggaggagggccatgatccaaag gaacgagagcagttgagaaaaccgtttattggtggtctgagctttgaaactacagatgat ggtttaagagaacattttgagaaatgggtcacactcacagattgtgtgagaggttgtgga gatggatctggcaattttattgtggaaaaaactttggaggtggtggaggtaattttggct gtggtggaaactttggtggaggaggaggctatggtggtggaggtggtggcagcagagata gttatggaggaggtgatggtggatataatggatttggaggtgatggtggcaactatggcg gcggtcctggttatagtagtagagggggctatggtcgtggtggaccaggatgtggaaacc aaggtggtggatatggtggcggtggtggaggatatgatggttacaatgaaggaggaaatt ttggccgtggtaactatggtggtggtgggaactataatgattttggaaattataatggac aacagcaatcaaattaaggactcatga >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_3|67_aa MAPAACGGSFQAAAQGRGAEAEPNRIPELKQQELPGQHTGEEERHRDISGDNAQRENFSG VLLSMCV >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_3|204_bp atggctccagctgcctgcgggggaagtttccaggctgcagcacagggaaggggagcagag gcggagcccaacagaattcctgaattgaagcagcaggagctcccaggacagcatactgga gaggaggagcggcacagagacatctccggagataatgcgcagagagagaacttcagcgga gtgctgctcagcatgtgcgtgtga >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_4|376_aa MTKHRAHRRSCMHYGSTQEDFWEKVIAKLKPGEKGNAKHNIESSVPDSGDTVLKDLEARG LALVTMQGTRAELSAHSPSRAAGNRRHEQGLPPGESCPQGENGYTAAESKAHPGGEAGGG HLCCSRRGACLSASLLLLLATVAALIALVTILGLPSCTPGAQACITLTNRTGFLCHDQRS CIPASGVCDGVRTCTHGEDEDESLCRDVPQSLPHFLVAHCGDPASWIYSDQKCDGTNNCG DCSDELSPGTWMKLEAIILSKLTQEQKTKHRMFSLISCKQELREPSGILSSDPEPEESSF PDEEDFGLALEARRISEGQRKFRVGVGSAGRTQSGQPAHKPGAVRGLALGPVAAEDALDF PAVLAHWRCARFLAGP >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_4|1131_bp atgaccaagcacagagcacataggaggagctgtatgcattatgggagcacacaggaggac ttctgggagaaagtgatagccaagcttaaacctggagaaaaagggaacgccaaacacaat atagagagcagtgtaccagacagtggggacactgtgctcaaagacctagaggccagaggg ttggccttggtgacaatgcagggaacccgtgctgagctctctgcacacagccccagtcgg gcagcaggaaaccgaagacatgaacaaggtcttcccccaggtgagagctgtccccaggga gagaatggctacactgctgctgaatccaaagcccaccctggaggggaagcaggcggcggc cacctctgctgctcacgtcgcggggcctgcctctctgcctctctgctgctcctcctggca actgtggcggccctcatcgccttggtcaccattcttggactcccatcatgcaccccagga gcccaagcttgtataacactgacaaacaggacaggcttcttgtgccatgaccagaggagc tgcattccagccagtggggtctgtgatggcgttcgcacctgtacccacggcgaggacgag gatgagagcttgtgccgagatgtgccccagagcctcccccacttccttgtggcccactgt ggagacccggcctcctggatctactcagaccaaaaatgtgatggcactaacaactgcggg gactgttcagatgaactgagcccagggacatggatgaagctggaagccatcatcctcagc aaactaacacaggaacagaaaaccaaacaccgcatgttctcactcataagttgcaagcag gagctgagagagccctcgggcatccttagttctgaccctgaaccagaggagtcttccttc ccagacgaggaggactttgggctggctctggaggctagaagaatttccgaaggtcagcgc aagttccgggtgggcgtgggctcggcgggccgcactcagagcggccagccagcccacaag cccggggcagtgaggggcttagcacttgggccagtagctgcggaggatgcactggatttc ccagcagtgctggcccactggcgctgtgctcgatttctcgccgggccttag >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_5|90_aa MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHT YPKHVQKHIPNLMSNMLAILVARISCHSLN >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_5|273_bp atggcggcgccgaaggggagcctctgggtgaggacccaactggggctcccgccgctgctg ctgctgaccatggccttggccggaggttcggggaccgcttcggctgaagcatttgactcg gtcttgggtgatacggcgtcttgccaccgggcctgtcagttgacctaccccttgcacacc taccctaagcatgtacagaagcatattcccaatctgatgagcaatatgcttgccatcttg gttgccagaatcagctgccattcgctgaactga >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_6|130_aa MVTGHTVNKMRKHSDSEVASLAREVYTEWKTFTEKHSNRPSIEVRSDPKTESLRKNAQKL LSEALELKMDHLLVENIERETFHLCSRLINGPYRRTVRALVFTLKHRAEIRAQVKSGSLP VGTFVQTHKK >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_6|393_bp atggtgacaggtcacactgtgaacaagatgcgtaaacactcagattcagaagtggcttct cttgccagagaagtttacactgagtggaaaactttcactgaaaaacattcaaatagacct tctattgaagttagaagtgatcccaaaaccgagtcgttgaggaaaaatgctcagaaatta ctctcagaagccttggaattaaagatggatcacctactggttgaaaatattgaacgggaa acgtttcatctctgctcccgcctcattaatgggccgtaccggcggacggtgagagccctg gtcttcacattaaagcaccgagctgaaatccgggctcaggtgaagagcggctcgctgcca gtcggcacgtttgtacagacccacaaaaagtga >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_7|231_aa MLPTGIKLKKLWPNYNSLNNPTSIHWIPDFVSVAVAEAPGTQQHTRQMRFLPARRLFSFK AERDAGRGDPAGVRCPDRTCSSEQPELKFVSRVSVCPLLAFPGVPSYVAGAQSCEGHGRG SGKGEFKGKVEPQQSGAPDSPRSLIQLCPPTYHHLSSPLLSLSPHCHHQTTQLPPPAMPQ FRPDHDQATECGALRGCREGYMGELCDGYHHHIFIFIIIKCLGLTREQRGQ >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_7|696_bp atgctgcccactggcatcaagttaaaaaaactatggcccaattacaattcactcaacaac ccaacaagcatccactggattccagattttgtgtctgttgctgtggcagaggctccaggc acacagcagcacacacgccagatgagattcctacctgcaaggcgtttgtttagtttcaag gcagaaagagatgctgggaggggggaccccgctggtgtcagatgtccagaccggacctgc agctccgagcagcccgagctcaagtttgtgagccgcgtgtccgtgtgtcccctgctggca ttcccgggcgttccctcctatgtggctggggcacagagctgtgaggggcacggcaggggt agtgggaagggagagttcaaggggaaggtggagcctcagcaaagcggggcgcctgacagc ccaaggtctctcatccagctctgtccgcccacgtaccaccatctctccagtccactcctt tctctctcgccccactgccaccaccagaccacccagctcccgcctccagccatgccccaa tttcgtccagaccacgaccaagccacggagtgtggtgcactaaggggttgtcgtgagggt tacatgggagagctgtgtgatggctaccatcatcacatcttcatcttcatcatcatcaag tgtttaggcctcacccgggagcaacggggccagtaa >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_8|559_aa MLAEWGACLLLAVALLGPGLQAQAMEGVKCGGVLSAPSGNFSSPNFPRLYPYNTECSWLI VVAEGSSVLLTFHAFDLEYHDTCSFDFLEIYNGASPDKGNLLGRFCGKVPPPPFTSSWHV MSVIFHSDKHVASHGFSAGYQKDVCGGVLTGLSGVLTSPEYPNNYPNSMECHWVIRAAGP AHVKLVFVDFQVEGNEECTYDYVAVLGGPGPTRGHHYCGSTRPPTLVSLGHELQVVFKSD FNIGGRGFKAYYFSGECQEVYMAMRGNFSSPQYPSSYPNNIRCHWTIRLPPGYQVKVFFL DLDLEEPNSLTKTCDFDHLAAFDGASEEAPLLGNWCGHHLPPPVTSSHNQLLLLLHTDRS TTRRGFSVAYIGVVPMNVSCSRTDFQILISTQALAPLERTKVYLGSRSCAAQEVGGNLRI QARFDTCGTESQGNKFSPSSFRCDEYPVTPQRRNNTSVIVSVLYIDFSAAGREDIHEYEV RCEPRRKEASVHLLSGSHWLGPYAATAEHLQEAPPMDEAEALEGPVSMVAQDTSDIVFLG LCILAGILMVIAIVVLMLL >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_8|1680_bp atgctggcagagtggggggcttgcctgctgctggcagtggcactgctgggcccagggctc caggcccaagccatggaaggtgtcaaatgtgggggtgtgctctcagcaccttctggaaac ttctccagccccaacttccctagactgtacccctacaacacagagtgcagctggctgatc gtggtggccgagggatcctcggtgctgctcaccttccatgcctttgacctagagtaccac gacacctgcagcttcgactttctggagatctacaatggggcctcaccagacaagggcaac ctgctggggaggttctgcggcaaggtgcccccgccgcccttcacctcctcctggcatgtc atgtctgtcatcttccactcggacaagcatgtggccagccatggcttttctgcgggctac cagaaagatgtgtgtggcggcgtcctgactggcctgtcaggggtcctcaccagtcctgag tatcccaacaactacccgaacagcatggagtgccactgggtgatccgggccgctggccct gcccacgtcaagctggtgttcgtggacttccaggtggagggcaatgaagagtgcacctat gactacgtggctgtgcttggggggcctggccccacccgtgggcaccactactgtggcagc accaggccccccaccctcgtgtctctgggccacgaactgcaggtggtcttcaagtccgac ttcaacatcggaggccgtggcttcaaggcctactacttctcaggagaatgccaggaggta tacatggccatgcggggcaacttctccagcccacagtaccccagctcctaccccaacaac atccgctgccactggaccatccgcctgcccccgggctaccaggtcaaggtgttcttcctg gacctggacctggaggagcccaacagcctgaccaagacctgtgactttgaccatctggcg gccttcgatggggccagcgaggaggcacccctgctggggaattggtgtggacaccacctg ccaccacccgtgacctcaagccacaaccagcttctgcttctgctgcacacagaccgcagc accacccgcaggggcttctctgtggcctacatcggagtggtgcccatgaacgtgagctgc tcccgcacggacttccagatcctgatctccacgcaggcgctggccccgctggagcggacc aaggtctacctgggcagccggagctgtgccgcccaggaggtcggcggcaacctcaggatc caggcccgctttgatacctgcggcactgagtctcagggaaacaagttcagcccaagctcc ttcaggtgtgatgagtaccctgtgaccccacagagaagaaacaacacttcagtgattgtc agcgtgctgtacatcgacttctcagccgcggggcgggaggacatccatgagtacgaggtc cgctgtgagccacggcgcaaggaggcttctgtccacctgctgtctggctctcactggctg gggccctatgctgccactgcggagcaccttcaggaagcaccacccatggatgaggcggag gcactggagggcccagtgagcatggtggcccaggataccagtgacatcgtcttcctgggc ctttgcatcctggctggaatcctcatggttattgccatcgtggtcttgatgctgctttga >gi568815597f:53954423_54196470|GENSCAN_predicted_peptide_9|500_aa AAQAPLMMAEREEDDDTEEAWMQLRPTEPLPSQCCGSGCSPCVFDLYHRDLARWEAAQAS KDRSLLRGPESQSSVLSKAPGSCAREYPGTLADDEEVILDGGCGKGHSGDKALNAKPKNF CQLSDTVSTTMDIGLDILRSHQLINYANLDFSSFEEWSCPSKLNPETFVAFCIIAMDRLT KDTYRVRFALPGNSQLGLRPGQHLILRGIVDDLEIQRAYTPISPANAEGYFEVLIKCYQM GLMSRYVESWRVGDTAFWRGPFGDFFYKPNQYGELLLLAAGTGLAPMVPILQSITDNEND ETFVTLVGCFKTFESIYLKTFLQEQARFWNVRTFFVLSQESSSEQLPWSYQEKTHFGHLG QDLIKELVSCCRRKPFALVCGSAEFTKDIARFSAEFLRTLTRGVHRLLGAADAVKTNVKG LVLGLLPSRHSDPLPSDSPGGPIWWLRGPALQLDSPGFESWFCHLEDVSPDLCGQGGVRQ HLAITWAPPLDPGTSDRIQI >gi568815597f:53954423_54196470|GENSCAN_predicted_CDS_9|1503_bp gctgcccaagccccactgatgatggctgagagggaagaggacgacgacactgaggaagcc tggatgcagctacggcccacagaacccttgccttcccagtgctgcggcagtggctgctca ccctgtgtgtttgacctctatcaccgagatctggcaaggtgggaggcagcccaagccagc aaggacaggagcctgctgcgtgggccagagtcacagtccagtgttctttccaaagcacca ggcagttgtgccagagagtacccaggtactctggctgatgatgaagaggtcatcctggat ggtgggtgtgggaaggggcacagtggagacaaggccttgaatgccaagcccaagaatttc tgtcagctcagtgatactgtcagcactaccatggatattgggttagacattctgaggtcc catcaacttataaattatgcaaatctggattttagcagctttgaagaatggagctgcccc tccaagctgaacccagagaccttcgtggccttctgcatcattgccatggacaggctcact aaggacacctaccgtgtccggtttgctctacccgggaacagccagcttggcctgcggccc ggccagcacctcatcctacgagggatagtagatgacttagaaattcagagagcctatacg cccatcagccctgccaacgcagaaggatactttgaagtgttaattaagtgctaccagatg gggctgatgtcccggtatgttgagtcctggagagtaggagacacagctttctggcgagga cctttcggagatttcttctataaaccaaaccagtatggtgagctcctcttgctggctgcg ggcacgggcctggcccccatggtgcctatcctgcagagcatcacagacaatgagaatgac gagacttttgtcactctggtcggttgcttcaagacctttgagagcatctacctgaaaacc ttcctccaagagcaggcccgtttctggaatgtccgtaccttctttgtactcagccaggag agctcctcagagcagcttccctggagttaccaagagaaaacccactttggccacctgggc caggacctaattaaagagctggtcagctgctgtcggagaaagccattcgcactggtctgt ggctcggctgagttcaccaaagacatagccaggttttctgctgagttcctgaggaccctg actcgaggagtccaccgtctcctgggggctgcagatgctgtcaaaacaaatgtgaaaggt ctggtgctcggcctgctacccagtagacactcagaccctcttccatccgactcccctggc ggcccgatctggtggctcagaggaccagcattgcagctggacagtcccggatttgaatcc tggttctgccacttagaggatgtcagcccagacctctgtggacaaggaggtgttaggcag caccttgctatcacctgggcaccacccctggacccgggaaccagtgacaggatacagata tga