GENSCAN 1.0 Date run: 2-Feb-117 Time: 14:40:17 Sequence gi568815597r:53932153_54153188 : 221036 bp : 44.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14297 14397 101 2 2 107 96 60 0.764 8.45 1.02 Intr + 19834 20241 408 2 0 85 61 387 0.456 29.94 1.03 Intr + 25012 25036 25 2 1 98 87 42 0.549 2.18 1.04 Intr + 25997 26128 132 2 0 84 52 120 0.989 7.76 1.05 Intr + 28204 28322 119 1 2 86 103 22 0.923 3.61 1.06 Intr + 29882 29970 89 0 2 55 116 7 0.918 -0.11 1.07 Intr + 30144 30257 114 2 0 82 78 79 0.988 6.94 1.08 Intr + 30992 31057 66 1 0 68 115 32 0.855 3.00 1.09 Intr + 34144 34228 85 0 1 48 28 96 0.728 -1.01 1.10 Term + 35513 35787 275 0 2 109 38 167 0.891 9.43 1.11 PlyA + 35993 35998 6 1.05 2.04 PlyA - 36302 36297 6 1.05 2.03 Term - 43312 42914 399 1 0 10 48 426 0.805 25.92 2.02 Intr - 43915 43754 162 1 0 38 40 156 0.685 5.97 2.01 Init - 43947 43942 6 0 0 89 83 0 0.797 0.47 2.00 Prom - 44398 44359 40 -9.26 3.00 Prom + 44717 44756 40 -0.76 3.01 Sngl + 50454 50657 204 2 0 98 53 232 0.970 13.95 3.02 PlyA + 50884 50889 6 1.05 4.10 PlyA - 51993 51988 6 1.05 4.09 Term - 72410 72232 179 0 2 53 37 154 0.790 4.65 4.08 Intr - 74422 74302 121 1 1 58 116 117 0.998 11.57 4.07 Intr - 75080 74995 86 0 2 51 84 72 0.039 2.64 4.06 Intr - 78258 78130 129 1 0 105 91 89 0.975 11.57 4.05 Intr - 80128 79991 138 2 0 73 78 117 0.852 9.64 4.04 Intr - 82212 82084 129 1 0 47 71 101 0.842 4.97 4.03 Intr - 85275 85224 52 0 1 105 98 54 0.973 6.58 4.02 Intr - 86040 85923 118 2 1 113 94 40 0.921 7.57 4.01 Init - 96608 96430 179 2 2 49 72 140 0.547 7.23 4.00 Prom - 98301 98262 40 -4.36 5.03 PlyA - 99886 99881 6 1.05 5.02 Term - 113634 113551 84 0 0 100 53 67 0.710 2.05 5.01 Init - 121036 120848 189 1 0 90 97 237 0.993 21.81 5.00 Prom - 130733 130694 40 -2.56 6.00 Prom + 135635 135674 40 -4.16 6.01 Init + 150585 150594 10 2 1 105 94 0 0.811 3.05 6.02 Intr + 156445 156638 194 2 2 113 90 151 0.993 17.01 6.03 Term + 164133 164321 189 2 0 86 43 174 0.879 10.15 6.04 PlyA + 168099 168104 6 1.05 7.06 PlyA - 169494 169489 6 1.05 7.05 Term - 179526 179416 111 0 0 67 42 148 0.897 6.66 7.04 Intr - 188844 188685 160 2 1 110 72 25 0.146 3.09 7.03 Intr - 190032 189944 89 0 2 81 81 17 0.149 -0.93 7.02 Intr - 190317 190150 168 0 0 65 64 128 0.932 8.24 7.01 Init - 199210 199043 168 1 0 71 38 65 0.237 -0.67 7.00 Prom - 199751 199712 40 -0.26 8.07 PlyA - 200022 200017 6 -1.75 8.06 Term - 201199 200816 384 1 0 23 42 552 0.597 38.99 8.05 Intr - 204656 204478 179 0 2 63 91 345 0.998 31.94 8.04 Intr - 207954 207601 354 1 0 91 60 638 0.981 56.56 8.03 Intr - 209281 208946 336 2 0 26 91 536 0.513 43.19 8.02 Intr - 212661 212314 348 1 0 99 42 502 0.957 42.03 8.01 Init - 220770 220692 79 0 1 66 102 187 0.998 17.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 75120 74995 126 0 0 74 84 89 0.803 7.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_1|471_aa XFVPGVLSPSPSAGPREEGAVTEELPLAAPGRGTVATKAMSYYLSSENHLDPGPIYMREN GQLHMVNLALDGVRSSLQKPRPFRLFPKGFSVELCMNREDDTARKEKTDHFIFTYTREGN LRYSAKSLFSLVLGFISDNVDHIDSLIGFPEQIAEKLFSAAEARQKFTEPVVILLMEERY LVISEKLEEIKSFRELTCLDLSCCKLGDEHELLEHLTNEALSSVTQLHLKDNCLSDAGVR KMTAPVRVMKRGLENLTLLDLSCNPEITDAGIGYLFSFRKLNCLDISGTGLKDIKTVKHK LQTHIGLVHSKVPLKEFDHSNCKTEGWADQLQAIMQDGRTIHLLDCSVTSPQIVLQWERV TAEAVKPRETSEPRAAAQRFYGKRSRAEAPLKCPLADTHMNSSEKLQFYKEKAPDCHGPV LKHEAISSQESKKSKKRPFEESETEQNNSSQPSKQKYVCLAVEDWDLLNSY >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_1|1416_bp nnctttgtgcccggggtgctgagcccttccccgtcagccgggccgcgggaggagggagcc gtcaccgaggagctgccgctcgctgccccgggcaggggcacagttgcaaccaaggcaatg tcttactacctcagctcagaaaaccacctggacccagggcccatctacatgcgagaaaat gggcagctgcacatggtcaatctggctctggatggtgtcaggagtagcctgcagaagcca aggcctttcagactgttccccaaaggcttttctgtggagctttgcatgaacagggaagac gacactgcacggaaagagaagactgatcatttcatcttcacatacacccgagaggggaat cttcggtactccgccaaatccctcttcagccttgtcctgggtttcatctccgacaatgtg gatcacattgattcccttattggctttcctgagcagattgctgaaaagctgttctctgct gctgaagccagacagaaattcactgagccagttgtgatcctgctgatggaagagaggtat ctcgtgatttcagaaaagcttgaggagattaagtctttccgggagctgacctgcctggat ctttcctgttgcaagcttggagatgagcatgaacttctagaacatctcaccaatgaagcc ctgtctagtgtaactcagctccacctgaaggataattgtttatctgatgctggggtgcgg aagatgacagcaccagttcgagtgatgaaaagaggccttgagaatctaacattattagac ttatcatgtaaccctgagatcacagatgcaggcattggatacctcttttcttttaggaaa ctaaactgcttagatatctctgggacagggctcaaggacatcaaaaccgtcaagcacaag ctccagacccacataggccttgttcactccaaagtgcctttgaaggaatttgatcatagt aactgcaagacagagggctgggctgaccagctccaagccataatgcaggatggtagaact atccacctgctggattgtagcgtcacctctccacagatcgttctgcagtgggagcgtgtg actgcggaagctgtgaagccacgggagacctcggagcctagagcagcagctcagcgcttc tatgggaagcggtctcgagcagaagccccactgaagtgtcccctggcagacacccacatg aactcttccgagaaactccagttctataaagagaaagccccagattgccatgggccagtg ttgaaacacgaagctatctcaagccaggagtcaaagaagagcaagaagagaccttttgag gagtcagagacagaacagaataactcttcacaaccttcaaagcagaaatatgtatgtctt gctgtggaagactgggacttgttaaattcctattga >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_2|188_aa MEPGSGHRRCCQGEEGHDPKEREQLRKPFIGGLSFETTDDGLREHFEKWVTLTDCVRGCG DGSGNFIVEKTLEVVEVILAVVETLVEEEAMVVEVVAAEIVMEEVMVDIMDLEVMVATMA AVLVIVVEGAMVVVDQDVETKVVDMVAVVEDMMVTMKEEILAVVTMVVVGTIMILEIIMD NSNQIKDS >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_2|567_bp atggagcccggctccggccatcgccgttgctgccagggggaggagggccatgatccaaag gaacgagagcagttgagaaaaccgtttattggtggtctgagctttgaaactacagatgat ggtttaagagaacattttgagaaatgggtcacactcacagattgtgtgagaggttgtgga gatggatctggcaattttattgtggaaaaaactttggaggtggtggaggtaattttggct gtggtggaaactttggtggaggaggaggctatggtggtggaggtggtggcagcagagata gttatggaggaggtgatggtggatataatggatttggaggtgatggtggcaactatggcg gcggtcctggttatagtagtagagggggctatggtcgtggtggaccaggatgtggaaacc aaggtggtggatatggtggcggtggtggaggatatgatggttacaatgaaggaggaaatt ttggccgtggtaactatggtggtggtgggaactataatgattttggaaattataatggac aacagcaatcaaattaaggactcatga >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_3|67_aa MAPAACGGSFQAAAQGRGAEAEPNRIPELKQQELPGQHTGEEERHRDISGDNAQRENFSG VLLSMCV >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_3|204_bp atggctccagctgcctgcgggggaagtttccaggctgcagcacagggaaggggagcagag gcggagcccaacagaattcctgaattgaagcagcaggagctcccaggacagcatactgga gaggaggagcggcacagagacatctccggagataatgcgcagagagagaacttcagcgga gtgctgctcagcatgtgcgtgtga >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_4|376_aa MTKHRAHRRSCMHYGSTQEDFWEKVIAKLKPGEKGNAKHNIESSVPDSGDTVLKDLEARG LALVTMQGTRAELSAHSPSRAAGNRRHEQGLPPGESCPQGENGYTAAESKAHPGGEAGGG HLCCSRRGACLSASLLLLLATVAALIALVTILGLPSCTPGAQACITLTNRTGFLCHDQRS CIPASGVCDGVRTCTHGEDEDESLCRDVPQSLPHFLVAHCGDPASWIYSDQKCDGTNNCG DCSDELSPGTWMKLEAIILSKLTQEQKTKHRMFSLISCKQELREPSGILSSDPEPEESSF PDEEDFGLALEARRISEGQRKFRVGVGSAGRTQSGQPAHKPGAVRGLALGPVAAEDALDF PAVLAHWRCARFLAGP >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_4|1131_bp atgaccaagcacagagcacataggaggagctgtatgcattatgggagcacacaggaggac ttctgggagaaagtgatagccaagcttaaacctggagaaaaagggaacgccaaacacaat atagagagcagtgtaccagacagtggggacactgtgctcaaagacctagaggccagaggg ttggccttggtgacaatgcagggaacccgtgctgagctctctgcacacagccccagtcgg gcagcaggaaaccgaagacatgaacaaggtcttcccccaggtgagagctgtccccaggga gagaatggctacactgctgctgaatccaaagcccaccctggaggggaagcaggcggcggc cacctctgctgctcacgtcgcggggcctgcctctctgcctctctgctgctcctcctggca actgtggcggccctcatcgccttggtcaccattcttggactcccatcatgcaccccagga gcccaagcttgtataacactgacaaacaggacaggcttcttgtgccatgaccagaggagc tgcattccagccagtggggtctgtgatggcgttcgcacctgtacccacggcgaggacgag gatgagagcttgtgccgagatgtgccccagagcctcccccacttccttgtggcccactgt ggagacccggcctcctggatctactcagaccaaaaatgtgatggcactaacaactgcggg gactgttcagatgaactgagcccagggacatggatgaagctggaagccatcatcctcagc aaactaacacaggaacagaaaaccaaacaccgcatgttctcactcataagttgcaagcag gagctgagagagccctcgggcatccttagttctgaccctgaaccagaggagtcttccttc ccagacgaggaggactttgggctggctctggaggctagaagaatttccgaaggtcagcgc aagttccgggtgggcgtgggctcggcgggccgcactcagagcggccagccagcccacaag cccggggcagtgaggggcttagcacttgggccagtagctgcggaggatgcactggatttc ccagcagtgctggcccactggcgctgtgctcgatttctcgccgggccttag >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_5|90_aa MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHT YPKHVQKHIPNLMSNMLAILVARISCHSLN >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_5|273_bp atggcggcgccgaaggggagcctctgggtgaggacccaactggggctcccgccgctgctg ctgctgaccatggccttggccggaggttcggggaccgcttcggctgaagcatttgactcg gtcttgggtgatacggcgtcttgccaccgggcctgtcagttgacctaccccttgcacacc taccctaagcatgtacagaagcatattcccaatctgatgagcaatatgcttgccatcttg gttgccagaatcagctgccattcgctgaactga >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_6|130_aa MVTGHTVNKMRKHSDSEVASLAREVYTEWKTFTEKHSNRPSIEVRSDPKTESLRKNAQKL LSEALELKMDHLLVENIERETFHLCSRLINGPYRRTVRALVFTLKHRAEIRAQVKSGSLP VGTFVQTHKK >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_6|393_bp atggtgacaggtcacactgtgaacaagatgcgtaaacactcagattcagaagtggcttct cttgccagagaagtttacactgagtggaaaactttcactgaaaaacattcaaatagacct tctattgaagttagaagtgatcccaaaaccgagtcgttgaggaaaaatgctcagaaatta ctctcagaagccttggaattaaagatggatcacctactggttgaaaatattgaacgggaa acgtttcatctctgctcccgcctcattaatgggccgtaccggcggacggtgagagccctg gtcttcacattaaagcaccgagctgaaatccgggctcaggtgaagagcggctcgctgcca gtcggcacgtttgtacagacccacaaaaagtga >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_7|231_aa MLPTGIKLKKLWPNYNSLNNPTSIHWIPDFVSVAVAEAPGTQQHTRQMRFLPARRLFSFK AERDAGRGDPAGVRCPDRTCSSEQPELKFVSRVSVCPLLAFPGVPSYVAGAQSCEGHGRG SGKGEFKGKVEPQQSGAPDSPRSLIQLCPPTYHHLSSPLLSLSPHCHHQTTQLPPPAMPQ FRPDHDQATECGALRGCREGYMGELCDGYHHHIFIFIIIKCLGLTREQRGQ >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_7|696_bp atgctgcccactggcatcaagttaaaaaaactatggcccaattacaattcactcaacaac ccaacaagcatccactggattccagattttgtgtctgttgctgtggcagaggctccaggc acacagcagcacacacgccagatgagattcctacctgcaaggcgtttgtttagtttcaag gcagaaagagatgctgggaggggggaccccgctggtgtcagatgtccagaccggacctgc agctccgagcagcccgagctcaagtttgtgagccgcgtgtccgtgtgtcccctgctggca ttcccgggcgttccctcctatgtggctggggcacagagctgtgaggggcacggcaggggt agtgggaagggagagttcaaggggaaggtggagcctcagcaaagcggggcgcctgacagc ccaaggtctctcatccagctctgtccgcccacgtaccaccatctctccagtccactcctt tctctctcgccccactgccaccaccagaccacccagctcccgcctccagccatgccccaa tttcgtccagaccacgaccaagccacggagtgtggtgcactaaggggttgtcgtgagggt tacatgggagagctgtgtgatggctaccatcatcacatcttcatcttcatcatcatcaag tgtttaggcctcacccgggagcaacggggccagtaa >gi568815597r:53932153_54153188|GENSCAN_predicted_peptide_8|559_aa MLAEWGACLLLAVALLGPGLQAQAMEGVKCGGVLSAPSGNFSSPNFPRLYPYNTECSWLI VVAEGSSVLLTFHAFDLEYHDTCSFDFLEIYNGASPDKGNLLGRFCGKVPPPPFTSSWHV MSVIFHSDKHVASHGFSAGYQKDVCGGVLTGLSGVLTSPEYPNNYPNSMECHWVIRAAGP AHVKLVFVDFQVEGNEECTYDYVAVLGGPGPTRGHHYCGSTRPPTLVSLGHELQVVFKSD FNIGGRGFKAYYFSGECQEVYMAMRGNFSSPQYPSSYPNNIRCHWTIRLPPGYQVKVFFL DLDLEEPNSLTKTCDFDHLAAFDGASEEAPLLGNWCGHHLPPPVTSSHNQLLLLLHTDRS TTRRGFSVAYIGVVPMNVSCSRTDFQILISTQALAPLERTKVYLGSRSCAAQEVGGNLRI QARFDTCGTESQGNKFSPSSFRCDEYPVTPQRRNNTSVIVSVLYIDFSAAGREDIHEYEV RCEPRRKEASVHLLSGSHWLGPYAATAEHLQEAPPMDEAEALEGPVSMVAQDTSDIVFLG LCILAGILMVIAIVVLMLL >gi568815597r:53932153_54153188|GENSCAN_predicted_CDS_8|1680_bp atgctggcagagtggggggcttgcctgctgctggcagtggcactgctgggcccagggctc caggcccaagccatggaaggtgtcaaatgtgggggtgtgctctcagcaccttctggaaac ttctccagccccaacttccctagactgtacccctacaacacagagtgcagctggctgatc gtggtggccgagggatcctcggtgctgctcaccttccatgcctttgacctagagtaccac gacacctgcagcttcgactttctggagatctacaatggggcctcaccagacaagggcaac ctgctggggaggttctgcggcaaggtgcccccgccgcccttcacctcctcctggcatgtc atgtctgtcatcttccactcggacaagcatgtggccagccatggcttttctgcgggctac cagaaagatgtgtgtggcggcgtcctgactggcctgtcaggggtcctcaccagtcctgag tatcccaacaactacccgaacagcatggagtgccactgggtgatccgggccgctggccct gcccacgtcaagctggtgttcgtggacttccaggtggagggcaatgaagagtgcacctat gactacgtggctgtgcttggggggcctggccccacccgtgggcaccactactgtggcagc accaggccccccaccctcgtgtctctgggccacgaactgcaggtggtcttcaagtccgac ttcaacatcggaggccgtggcttcaaggcctactacttctcaggagaatgccaggaggta tacatggccatgcggggcaacttctccagcccacagtaccccagctcctaccccaacaac atccgctgccactggaccatccgcctgcccccgggctaccaggtcaaggtgttcttcctg gacctggacctggaggagcccaacagcctgaccaagacctgtgactttgaccatctggcg gccttcgatggggccagcgaggaggcacccctgctggggaattggtgtggacaccacctg ccaccacccgtgacctcaagccacaaccagcttctgcttctgctgcacacagaccgcagc accacccgcaggggcttctctgtggcctacatcggagtggtgcccatgaacgtgagctgc tcccgcacggacttccagatcctgatctccacgcaggcgctggccccgctggagcggacc aaggtctacctgggcagccggagctgtgccgcccaggaggtcggcggcaacctcaggatc caggcccgctttgatacctgcggcactgagtctcagggaaacaagttcagcccaagctcc ttcaggtgtgatgagtaccctgtgaccccacagagaagaaacaacacttcagtgattgtc agcgtgctgtacatcgacttctcagccgcggggcgggaggacatccatgagtacgaggtc cgctgtgagccacggcgcaaggaggcttctgtccacctgctgtctggctctcactggctg gggccctatgctgccactgcggagcaccttcaggaagcaccacccatggatgaggcggag gcactggagggcccagtgagcatggtggcccaggataccagtgacatcgtcttcctgggc ctttgcatcctggctggaatcctcatggttattgccatcgtggtcttgatgctgctttga