GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:08:41 Sequence gi568815586r:56634147_56852298 : 218152 bp : 45.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.32 Intr - 985 799 187 0 1 86 94 118 0.014 11.89 1.31 Intr - 4277 4181 97 0 1 94 50 111 0.052 6.97 1.30 Intr - 5161 4960 202 1 1 110 94 221 0.967 23.76 1.29 Intr - 6046 5834 213 1 0 50 109 206 0.999 17.81 1.28 Intr - 8434 8312 123 1 0 97 110 95 0.999 13.38 1.27 Intr - 8685 8527 159 0 0 63 94 77 0.983 5.98 1.26 Intr - 9441 9257 185 1 2 93 121 225 0.999 25.81 1.25 Intr - 9812 9691 122 1 2 77 76 112 0.974 9.04 1.24 Intr - 10809 10635 175 1 1 108 98 138 0.999 15.80 1.23 Intr - 11207 11025 183 0 0 62 106 177 0.988 16.66 1.22 Intr - 11870 11691 180 0 0 47 119 128 0.483 11.64 1.21 Intr - 30654 30630 25 0 1 106 106 35 0.870 4.70 1.20 Intr - 32120 32058 63 2 0 47 95 94 0.550 4.91 1.19 Intr - 36218 36129 90 2 0 43 107 72 0.556 4.79 1.18 Intr - 38663 38594 70 1 1 50 101 -13 0.505 -4.62 1.17 Intr - 38919 38806 114 2 0 113 111 106 0.805 14.96 1.16 Intr - 53441 53351 91 0 1 41 84 53 0.003 -0.75 1.15 Intr - 54236 53852 385 2 1 34 82 122 0.001 0.62 1.14 Intr - 78762 78640 123 0 0 116 115 141 0.995 20.38 1.13 Intr - 79044 78916 129 0 0 69 59 95 0.980 5.59 1.12 Intr - 79537 79391 147 1 0 110 113 126 0.995 17.73 1.11 Intr - 80293 80216 78 1 0 76 116 108 0.999 12.15 1.10 Intr - 80541 80456 86 1 2 51 94 79 0.992 4.34 1.09 Intr - 82549 81725 825 2 0 87 30 222 0.127 7.69 1.08 Intr - 90377 90306 72 0 0 97 96 49 0.844 5.98 1.07 Intr - 100099 100001 99 2 0 61 83 91 0.972 5.98 1.06 Intr - 104379 104288 92 2 2 70 94 125 0.995 10.94 1.05 Intr - 105217 105148 70 2 1 104 95 59 0.999 6.44 1.04 Intr - 107430 107289 142 0 1 87 79 101 0.998 9.03 1.03 Intr - 112886 112797 90 2 0 45 48 94 0.209 1.29 1.02 Intr - 117049 116892 158 2 2 83 52 130 0.937 8.63 1.01 Init - 118152 118050 103 0 1 57 49 167 0.750 10.10 1.00 Prom - 119745 119706 40 -4.06 2.00 Prom + 124361 124400 40 -1.96 2.01 Init + 139707 140019 313 2 1 51 64 383 0.995 29.69 2.02 Intr + 147828 148086 259 1 1 83 78 175 0.991 12.52 2.03 Intr + 150707 150870 164 2 2 74 79 118 0.967 9.12 2.04 Intr + 160882 161046 165 2 0 43 44 93 0.371 0.33 2.05 Term + 161608 162161 554 2 2 56 49 229 0.696 10.28 2.06 PlyA + 163299 163304 6 1.05 3.07 PlyA - 163637 163632 6 1.05 3.06 Term - 165550 165482 69 1 0 81 48 65 0.149 -0.26 3.05 Intr - 169905 169800 106 2 1 59 92 39 0.233 1.62 3.04 Intr - 170894 170787 108 1 0 90 89 19 0.262 1.60 3.03 Intr - 173935 173800 136 2 1 28 76 75 0.243 -0.07 3.02 Intr - 175289 175119 171 0 0 67 45 74 0.164 0.91 3.01 Init - 176736 176367 370 0 1 33 26 180 0.084 3.46 3.00 Prom - 177507 177468 40 -1.26 4.00 Prom + 195430 195469 40 -2.06 4.01 Sngl + 210747 211445 699 2 0 75 34 168 0.660 6.32 4.02 PlyA + 213129 213134 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 588 780 193 2 1 96 90 65 0.971 6.63 S.002 Term - 4277 4177 101 0 2 94 55 117 0.946 7.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:56634147_56852298|GENSCAN_predicted_peptide_1|1626_aa METFDPTELPELLKLYYRRLFPYSQYYRWLNYGGVIKNYFQHREFSFTLKDDIYIRYQSF NNQSDLEKEMQKMNPYKIDIGAVYSHRPNQHNTVKLGAFQAQEKELVFDIDMTDYDDNNI KNDKYGPWLEWEIMLQYCFPRLDINVSKGINHLLKSPFSVHPKTGRISVPIDLQKVDQFD PFTVPTISFICRELDAISTNEEEKEENEAESDVKHRTRDYKKTSLAPYVKVFEHFLENLD KSRKGELLKKSEMPGEATETVPATEQELPQPQAETEAPTPPAVTPPSPEKGPATPAPKGT PTSPPVTPSSLKDSPTSPASVTCKMGATVPQASKGLPAKKGPTALKEVLVAPAPESTPII TAPTRKGPQTKKSSATSPPICPDPSAKNGSKGPLSTVAPAPLLPVQKDSSKTAKGKDASH SPKGPLAPPESKASTPLTAAAFEKVLPKPESASVSAAPSPPVSLPLAPSPVPTLPPKQQF LPSSPGLVLESPSKPLAPADEDELLPLIPPEPISGGVPFQSVLVNMPTPKSAGIPVPTPS AKQPVTKNNKGSGTESDSDESVPELEEQDSTQATTQQAQLAAAAEIDEEPVSKAKQSRSE KKARKAMSKLGLRQVTGVTRVTIRKSKNILFVITKPDVYKSPASDTYIVFGEAKIEDLSQ QAQLAAAEKFKVQGEAVSNIQENTQTPTVQEESEEEEVDETGVEVKDIELVMSQANVSRA KAVRALKNNSNDIVNAIMISLESRAGAQKQGLRTRGGKARAFCCVTSRRPASPPLVCTHA RSFSVLAPLFLHFPLLPDRRSRSFRAVHSGARGRARRCRRRLREARRGRDRREKAESPPE RSRLPSSSRRQRGPPTSSPVPLPRSQCPESSAPVVGAARCLAELLGEPHADLAVSCRQPA SAKWYDRRDYVFIEFCVEDSKDVNVNFEKSKLTFSCLGGSDNFKHLNEIDLFHCIDPNLN WLSVDFNNWKDWEDDSDEDMSNFDRFSEMMNNMGGDEDVDLPEVDGADDDSQDSDDEREQ AAAVAAAFSLHPDYAMLGFVGRVAAAPASGALRRLTPSASLPPAQLLLRAAPTAVHPVRD YAAQTSPSPKAGAATGRIVAVIGAVVDVQFDEGLPPILNALEVQGRETRLVLEVAQHLGE STVRTIAMDGTEGLVRGQKVLDSGAPIKIPVGPETLGRIMNVIGEPIDERGPIKTKQFAP IHAEAPEFMEMSVEQEILVTGIKVVDLLAPYAKGGKIGLFGGAGVGKTVLIMELINNVAK AHGGYSVFAGVGERTREGNDLYHEMIESGVINLKDATSKVALVYGQMNEPPGARARVALT GLTVAEYFRDQEGQDVLLFIDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGTMQER ITTTKKGSITSVQAIYVPADDLTDPAPATTFAHLDATTVLSRAIAELGIYPAVDPLDSTS RIMDPNIVGSEHYDVARGVQKILQDYKSLQDIIAILGMDELSEEDKLTVSRARKIQRFLS QPFQVAEVFTGHMGKLVPLKETIKGFQQILAGEYDHLPEQAFYMVGPIEEAVAKADKLAE EHSSGSFATALLSGCPHLGSSCPFKHHLLPSAAAGPGLGEEAQPPPGSATCASAAAAAAC GPALLQ >gi568815586r:56634147_56852298|GENSCAN_predicted_CDS_1|4878_bp atggagacgtttgaccccaccgagctgcccgagctgcttaaactttattaccggaggctc tttccctactctcagtactatcgctggctcaactacggtggagtgataaagaattacttt caacaccgtgaattttcattcacattgaaagatgatatttacattcgctaccaatccttc aacaaccagagtgatctggaaaaggagatgcagaaaatgaatccatacaagattgatata ggcgcagtatattctcacagacccaatcaacacaatacagtgaagctgggagctttccag gctcaggaaaaagaactggtatttgacattgacatgacagactatgacgataataacatc aaaaatgacaaatatggaccctggctggagtgggagattatgctccagtactgttttcca cggctggatatcaatgtcagcaaaggaatcaatcatctactgaagagcccttttagtgtt catcctaaaacaggtcgcatatctgtgcctattgatttgcagaaagtggaccagtttgat ccatttactgttccgaccataagcttcatctgccgtgaattggatgccatttccactaat gaagaggaaaaagaggagaatgaagctgaatctgatgtcaaacatagaaccagagattat aagaagaccagtctagcaccttatgtgaaagtttttgaacattttcttgaaaatctggat aaatcccgaaaaggagaacttcttaagaagagtgaaatgcccggcgaagccacagaaacc gtccctgctacagagcaggagttgccgcagccccaggctgagacagaggcacctactcct ccagctgtgactcctccatcccccgaaaagggcccagcaactccagcccccaaagggact cccacttccccacctgtgactccttcctccctcaaagactcccctacttccccagcttct gtcacatgtaaaatgggggccactgttcctcaagcatctaaagggcttccagcaaagaaa ggccccacagctctgaaagaagtacttgttgccccagctccagaaagcacgccaatcatc acagctcccactcggaaaggtccacagaccaaaaagagttctgctacttcacctcctata tgcccagatccctcagctaagaatggttctaaaggacccctttccacagtggctccagcc cctctactccctgttcagaaagactcttcaaagacagcaaaaggcaaagatgcttctcat tccccaaagggccccttggctcctcctgagtctaaggcgtccacccctctaacagcagct gcctttgagaaggtccttcctaaacctgaatcagcatctgtctctgcagcaccctcccca ccagtctctctgcctcttgctccctccccagttcccactctgcctcctaaacagcaattt ctgccgtcctctcctgggctggtgttggaatcaccctctaaaccccttgcccctgctgat gaggatgagctgctgcctctgattcccccggaaccaatctctgggggagtgcctttccag tcggtcctcgtcaacatgcccacccctaaatctgctggaatccctgtcccaaccccctct gccaagcaacctgttacgaagaacaacaaggggtctggaacagaatctgacagtgatgaa tcagtaccagagcttgaagaacaggattccacccaggcaaccacacaacaagcccagctg gcggcagcagctgaaattgatgaagaaccagtcagtaaagcaaaacagagtcggagtgaa aagaaggcacggaaggctatgtccaaactgggtcttcggcaggttacaggagttactaga gtcactatccggaaatctaagaatatcctctttgtcatcacaaaaccagatgtctacaag agccctgcttcagatacttacatagtttttggggaagccaagatcgaagatttatcccag caagcacaactagcagctgctgagaaattcaaagttcaaggtgaagctgtctcaaacatt caagaaaacacacagactccaactgtacaagaggagagtgaagaggaagaggtcgatgaa acaggtgtagaagttaaggacattgaattggtcatgtcacaagcaaatgtgtcgagagca aaggcagtccgagccctgaagaacaacagtaatgatattgtaaatgcgattatgatatct ctagaaagccgcgccggagcccaaaaacaaggactgcgcacgcgcggcggcaaggcccgg gcattttgctgcgtcaccagccgccgcccggcctcaccacccctcgtttgcacgcacgca cgttcattctccgtcctcgcgccccttttcctacactttcctcttctccccgaccggagg agccgctctttccgcgcggtgcattctggggcccgaggtcgagcccgccgctgccgccgt cgcctgagggaagcgagaagaggccgcgaccggagagaaaaagcggagtcgccaccggag agaagtcgactccctagcagcagccgccgccagagaggcccgcccaccagttcgcccgtc cccctgccccgttcacaatgccctgagtcgagtgcacctgtggtgggtgccgccaggtgc ctggcggagctcctgggagagccccacgcggacctcgccgtgagctgcaggcagcctgct tctgcaaagtggtacgatcgaagggactatgtcttcattgaattttgtgttgaagacagt aaggatgttaatgtaaattttgaaaaatccaaacttacattcagttgtctcggaggaagt gataattttaagcatttaaatgaaattgatctttttcactgtattgatccaaatcttaat tggcttagtgtcgacttcaataattggaaagactgggaagatgattcagatgaagacatg tctaattttgatcgtttctctgagatgatgaacaacatgggtggtgatgaggatgtagat ttaccagaagtagatggagcagatgatgattcacaagacagtgatgatgaaagagagcag gcggctgcggttgctgcagccttcagtctccacccggactacgccatgttggggtttgtg ggtcgggtggccgctgctccggcctccggggccttgcggagactcaccccttcagcgtcg ctgcccccagctcagctcttactgcgggccgctccgacggcggtccatcctgtcagggac tatgcggcgcaaacatctccttcgccaaaagcaggcgccgccaccgggcgcatcgtggcg gtcattggcgcagtggtggacgtccagtttgatgagggactaccaccaattctaaatgcc ctggaagtgcaaggcagggagaccagactggttttggaggtggcccagcatttgggtgag agcacagtaaggactattgctatggatggtacagaaggcttggttagaggccagaaagta ctggattctggtgcaccaatcaaaattcctgttggtcctgagactttgggcagaatcatg aatgtcattggagaacctattgatgaaagaggtcccatcaaaaccaaacaatttgctccc attcatgctgaggctccagagttcatggaaatgagtgttgagcaggaaattctggtgact ggtatcaaggttgtcgatctgctagctccctatgccaagggtggcaaaattgggcttttt ggtggtgctggagttggcaagactgtactgatcatggagttaatcaacaatgtcgccaaa gcccatggtggttactctgtgtttgctggtgttggtgagaggacccgtgaaggcaatgat ttataccatgaaatgattgaatctggtgttatcaacttaaaagatgccacctctaaggta gcgctggtatatggtcaaatgaatgaaccacctggtgctcgtgcccgggtagctctgact gggctgactgtggctgaatacttcagagaccaagaaggtcaagatgtactgctatttatt gataacatctttcgcttcacccaggctggttcagaggtgtctgcattattgggccgaatc ccttctgctgtgggctatcagcctaccctggccactgacatgggtactatgcaggaaaga attaccactaccaagaagggatctatcacctctgtacaggctatctatgtgcctgctgat gacttgactgaccctgcccctgctactacgtttgcccatttggatgctaccactgtactg tcgcgtgccattgctgagctgggcatctatccagctgtggatcctctagactccacctct cgtatcatggatcccaacattgttggcagtgagcattacgatgttgcccgtggggtgcaa aagatcctgcaggactacaaatccctccaggatatcattgccatcctgggtatggatgaa ctttctgaggaagacaagttgaccgtgtcccgtgcacggaaaatacagcgtttcttgtct cagccattccaggttgctgaggtcttcacaggtcatatggggaagctggtacccctgaag gagaccatcaaaggattccagcagattttggcaggtgaatatgaccatctcccagaacag gccttctatatggtgggacccattgaagaagctgtggcaaaagctgataagctggctgaa gagcattcatctggatccttcgcgactgctctcctgagcggttgtcctcacctcggtagt tcctgcccctttaagcaccacctcctcccctccgccgccgccggcccagggctgggggag gaggcgcagccgccgcccggctcggccacctgcgcctctgccgccgccgccgccgcctgc ggcccggccctgctccag >gi568815586r:56634147_56852298|GENSCAN_predicted_peptide_2|484_aa MWLYLAAFVGLYYLLHWYRERQVVSHLQDKYVFITGCDSGFGNLLARQLDARGLRVLAAC LTEKGAEQLRGQTSDRLETVTLDVTKMESIAAATQWVKEHVGDRGLWGLVNNAGILTPIT LCEWLNTEDSMNMLKVNLIGVIQVTLSMLPLVRRARGRIVNVSSILGRVAFFVGGYCVSK YGVEAFSDILRREIQHFGVKISIVEPGYFRTGMTNMTQSLERMKQSWKEAPKHIKETYGQ QYFDARVKPQTFTVSVTALKVARLELFVPPGGLVVLLGSGVKLQIFEVSVTAHKSSMDLK NSGAQLASPSGSRTGAAGGAACQYCAVRSHSSALGWSMGLGAVEQGVALVGEAPAAQEPT EGVGGSGMAGCRSQALSHGKAAKAWREIERSADGPALLGDPVHPPQPLARVLSPPLPGAS RAAWLLRVRGPPSPRPPRTSAGLQVPHTAPVTARVSPSTPPCKLREWALALASPERGSHS AVGD >gi568815586r:56634147_56852298|GENSCAN_predicted_CDS_2|1455_bp atgtggctctacctggcggccttcgtgggcctgtactaccttctgcactggtaccgggag aggcaggtggtgagccacctccaagacaagtatgtctttatcacgggctgtgactcgggc tttgggaacctgctggccagacagctggatgcacgaggcttgagagtgctggctgcgtgt ctgacggagaagggggccgagcagctgaggggccagacgtctgacaggctggagacggtg accctggatgttaccaagatggagagcatcgctgcagctactcagtgggtgaaggagcat gtgggggacagaggactctggggactggtgaacaatgcaggcattcttacaccaattacc ttatgtgagtggctgaacactgaggactctatgaatatgctcaaagtgaacctcattggt gtgatccaggtgaccttgagcatgcttcctttggtgaggagagcacggggaagaattgtc aatgtctccagcattctgggaagagttgctttctttgtaggaggctactgtgtctccaag tatggagtggaagccttttcagatattctgaggcgtgagattcaacattttggggtgaaa atcagcatagttgaacctggctacttcagaacgggaatgacaaacatgacacagtcctta gagcgaatgaagcaaagttggaaagaagcccccaagcatattaaggagacctatggacag cagtattttgatgcccgagtgaagccgcagaccttcacagtgagtgttacagctcttaag gtagcgcgtctggagttgttcgttcctcccggtgggctcgtggtcttgctgggctcagga gtgaagctgcagatcttcgaggtgagtgttacagctcataaaagcagtatggacctaaag aactcaggagcccagctggcttcacctagtggatcccgcaccggggctgcaggtggagct gcctgccagtactgcgccgtgcgctcgcattcctcagcccttgggtggtcgatgggactg ggcgccgtggagcagggggtggcgctcgtcggggaggctccggccgcacaggagcccacg gagggggtgggaggctcaggcatggcgggctgcaggtcccaagccctgtcccatgggaag gcagccaaggcctggcgagaaatcgagcgcagcgccgatgggccggcactgctgggggac ccagtacaccctccacagccactggcccgggtgctaagtcccccattgcccggggccagc agggctgcctggctgctccgagtgcggggcccaccaagcccacgcccacccagaacttca gctggcctgcaagtgccgcacacagccccggttaccgctcgtgtctctccctccacacct ccctgcaagctgagggagtgggctctggccttggccagcccagaaaggggctcccacagt gcagtgggggactga >gi568815586r:56634147_56852298|GENSCAN_predicted_peptide_3|319_aa MSFSYSQNKNPIMLAAVRRRDWGSMLHGASGIWGQAGAPPLLSWWGRSSLDAATATQIMA ANLGLLLHGGGKSPGTPNAVDLSLPVLLGRPELGSSPDCLGTAATTHTTAADPDYPALLG AQEVPGISKLPATTIFPGASHGSCLRCTWSSLSLAESQCPCQHLKLPGPLKQLACLTVHT GRSQSQLPLQQMPGEYGPIKVQVPFSLQDLRQIKGDLGKFLDSSNRLGSACSCRLASPCC RYPLRSKSKVMVKPELCHSPASASRRSCLQCAWSSCSLTESQSPCQHLELPGPLRQLAPV YDVPFPVSMSSRRSTPTYE >gi568815586r:56634147_56852298|GENSCAN_predicted_CDS_3|960_bp atgtctttctcctactcccagaacaagaaccccatcatgctggctgcagtaaggaggcgc gactggggctctatgcttcatggagccagcgggatctggggacaagcaggagctccgcca cttctgagttggtggggcaggagctccctggatgcagccacagccacccaaatcatggct gcaaacctgggcctcctgctgcatggaggaggcaaaagccctggcaccccgaatgctgtg gacctgagcctccctgtgctcttagggaggccagaactaggcagcagccctgactgccta ggcacagctgcaaccacccacactacagctgcagacccagactaccctgcactcttgggg gcccaggaagttcctggcatctccaagcttccagccaccaccatattccctggcgccagc catggaagctgcttgcggtgcacctggtccagccttagcctagcagagagccagtgccca tgccagcacctgaagctgcctggcccactgaagcagctggcatgtctgactgtgcacact ggcaggtcccagtcacaactgcccctacaacagatgcctggtgaatatggccccattaag gtccaggttcccttttctctacaggacttaaggcaaattaagggggatcttggcaagttt ttagacagctctaacaggcttggaagtgcctgctcctgccgcctggcttctccctgctgt cggtacccactccgatctaagagcaaagtcatggtcaagcctgagctctgtcacagccca gccagtgccagccgtagaagctgcttgcagtgtgcctggtctagctgcagcctcacagag agccagagcccatgccagcacctggagctgcccggcccactgcggcagctggccccagtg tatgatgttcccttccccgtgtccatgagctctcgtcgttcaactcccacttatgaataa >gi568815586r:56634147_56852298|GENSCAN_predicted_peptide_4|232_aa MAILPKIIYRFNAIPIKLPLTFFIELEKTTLNFIWNQKRACIAKTILSKKNKAGGIMLPD FKLHYKATVTKTAWYWYQNREIDQWNRTEASEITPHIYNHLIFDKPDKNKQWGKDSLFNN YCWENSLAICRKLKLDPFLTPYTKINSRWIKALNVRPKTIKTLEANVGNTIQDIGMGKDF MAKTPKAVVTKAKIEKWDLIKLKNFCTAKETERTGNLQNGRKFLQSIHLTKG >gi568815586r:56634147_56852298|GENSCAN_predicted_CDS_4|699_bp atggccatactgcccaagataatttatagattcaatgctatccccatcaagctaccactg actttcttcatagaattggaaaaaactactttaaatttcatatggaaccaaaaaagagcc tgcatagccaagacaatcctaagcaaaaagaacaaagctggaggcatcatgctacctgac ttcaaactacactacaaggctacagtaaccaaaacagcatggtattggtaccaaaacaga gagatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattcgctatttaataat tattgctgggaaaactcgctagccatatgtagaaagctgaaactggatcccttccttaca ccttacacaaaaattaactcaaggtggattaaagccttaaatgtaagacctaaaaccata aaaaccctagaagcaaacgtaggcaataccattcaggacataggcatgggcaaggacttc atggctaaaacaccaaaagcagtggtaacaaaagccaaaatagaaaaatgggatctaatt aaactaaagaacttctgcacagcaaaagaaacagagagaacaggcaacctacagaatggg agaaaatttttgcaatctatccatctgacaaagggctaa