GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:25:21 Sequence gi568815578r:36652791_36871194 : 218404 bp : 47.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 59 54 6 1.05 1.11 Term - 911 730 182 0 2 91 47 119 0.905 5.87 1.10 Intr - 2049 2011 39 1 0 85 82 50 0.656 2.30 1.09 Intr - 3621 3570 52 0 1 91 79 60 0.991 3.88 1.08 Intr - 3742 3707 36 1 0 124 110 9 0.913 5.16 1.07 Intr - 12307 12256 52 0 1 106 103 -4 0.968 1.81 1.06 Intr - 12511 12446 66 0 0 78 106 44 0.881 3.22 1.05 Intr - 13602 13499 104 0 2 81 15 133 0.820 4.27 1.04 Intr - 18607 18551 57 1 0 73 111 26 0.009 2.48 1.03 Intr - 28112 28026 87 2 0 81 106 48 0.478 6.07 1.02 Intr - 29788 29728 61 0 1 78 92 41 0.709 2.24 1.01 Init - 31669 31623 47 1 2 89 100 26 0.848 4.21 1.00 Prom - 38217 38178 40 -5.96 2.04 PlyA - 39073 39068 6 1.05 2.03 Term - 41339 41277 63 2 0 108 39 86 0.966 3.59 2.02 Intr - 54217 54182 36 1 0 110 110 36 0.276 6.46 2.01 Init - 68945 68889 57 2 0 60 98 49 0.699 4.41 2.00 Prom - 71637 71598 40 -3.56 3.10 PlyA - 71771 71766 6 1.05 3.09 Term - 84187 84117 71 2 2 116 55 13 0.009 -1.10 3.08 Intr - 93419 93255 165 0 0 54 109 181 0.009 16.73 3.07 Intr - 102060 101973 88 0 1 92 111 134 0.997 15.64 3.06 Intr - 103039 102892 148 0 1 83 103 98 0.995 11.04 3.05 Intr - 105371 105297 75 1 0 116 66 15 0.617 0.73 3.04 Intr - 109758 109671 88 1 1 56 86 36 0.420 -0.77 3.03 Intr - 118187 118083 105 0 0 53 75 67 0.350 2.09 3.02 Intr - 118683 118635 49 0 1 57 115 7 0.452 -1.45 3.01 Init - 121111 120872 240 1 0 45 96 361 0.977 28.47 3.00 Prom - 129056 129017 40 -6.96 4.23 PlyA - 129063 129058 6 -0.45 4.22 Term - 130591 130514 78 1 0 94 47 40 0.759 -1.74 4.21 Intr - 131568 131407 162 0 0 113 94 40 0.645 7.27 4.20 Intr - 131961 131845 117 0 0 102 23 50 0.287 0.56 4.19 Intr - 141809 140459 1351 1 1 138 92 1476 0.740 141.39 4.18 Intr - 144162 144092 71 0 2 120 34 104 0.804 6.18 4.17 Intr - 144783 144712 72 0 0 83 105 113 0.941 12.10 4.16 Intr - 150317 150066 252 2 0 115 94 481 0.960 49.03 4.15 Intr - 152112 151945 168 0 0 103 99 343 0.939 37.04 4.14 Intr - 153167 153063 105 2 0 48 78 140 0.984 9.41 4.13 Intr - 155953 155723 231 1 0 82 94 242 0.987 22.17 4.12 Intr - 157317 157192 126 0 0 121 96 249 0.740 29.88 4.11 Intr - 160054 159914 141 1 0 39 90 194 0.996 15.25 4.10 Intr - 163603 162336 1268 2 2 79 110 2162 0.366 204.80 4.09 Intr - 166817 166690 128 1 2 10 76 116 0.241 2.92 4.08 Intr - 171257 171163 95 2 2 116 56 35 0.407 1.86 4.07 Intr - 176422 176264 159 1 0 97 78 321 0.812 32.18 4.06 Intr - 182476 182429 48 1 0 107 42 41 0.438 0.28 4.05 Intr - 186651 186427 225 0 0 164 82 528 0.965 57.98 4.04 Intr - 194349 194319 31 2 1 83 103 5 0.447 -0.37 4.03 Intr - 197094 196942 153 2 0 86 46 98 0.404 4.59 4.02 Intr - 202455 202409 47 0 2 77 57 31 0.031 -3.89 4.01 Init - 210554 209868 687 2 0 110 81 858 0.702 80.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 37671 37916 246 2 0 62 43 163 0.810 4.28 S.002 Intr + 80507 80648 142 1 1 79 86 96 0.810 8.43 S.003 Init - 93465 93255 211 0 1 91 109 210 0.921 22.35 S.004 Term - 100107 99998 110 1 2 123 38 33 0.954 0.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:36652791_36871194|GENSCAN_predicted_peptide_1|260_aa MDELAEMLPPVLTHLSLKSIIGIGVGAGAYILSRFALNHPELVEGLVLINVDPCAKGWID WAASKLSGLTTNVVDIILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGR RDLEIERPILGQNDNKSKTLKCSTLLVVGDNSPAVEAVMADCGGLPQVVQPGKLTEAFKY FLQGMGYIPYVQLSHLSTESVPSASMTRLARSRTHSTSSSLGSGESPFSRSVTSNQSDGT QESCESPDVLDRHQTMEVSC >gi568815578r:36652791_36871194|GENSCAN_predicted_CDS_1|783_bp atggatgagctggctgaaatgctgcctcctgttcttacccacctaagcctgaaaagcatc attggaattggagttggagctggagcttacatcctcagcagatttgcactcaaccatcca gagcttgtggaaggccttgtgctcattaatgttgacccttgcgctaaaggctggattgac tgggcagcttccaaactctctggcctgacaaccaatgttgtggacattattttggctcat cactttgggcaggaagagttacaggccaacctggacctgatccaaacctacagaatgcat attgcccaagacatcaaccaagacaacctgcagctcttcttgaattcctacaatggacgc agagacctggagatcgaaagacccatactgggccaaaatgataacaaatcaaaaacatta aagtgttctactttactggtggtaggggacaattcgcctgcagttgaggctgtgatggcg gactgtgggggactgccccaggtagttcagcctgggaagctcaccgaggccttcaagtac tttttgcagggaatgggctacatcccgtatgtgcagctcagtcacctgagcaccgagtca gtaccatctgccagcatgactcggctcgcccgatcacgaacccactcaacctcgagtagc ctcggctctggagaaagtcccttcagccggtctgtcaccagcaatcagtcagatggaact caagaatcctgtgagtcccctgatgtcctggacagacaccagaccatggaggtgtcctgc taa >gi568815578r:36652791_36871194|GENSCAN_predicted_peptide_2|51_aa MDELQDVQLTEIKPLLNDKNGTRNFQDFDCQPTQQDNKDEDLYDDPLPLNE >gi568815578r:36652791_36871194|GENSCAN_predicted_CDS_2|156_bp atggatgaacttcaggatgttcagctcacagagatcaaaccacttctaaatgataagaat ggtacaagaaacttccaggactttgactgtcagcctactcaacaagataacaaggatgaa gacctttatgatgatccacttccacttaatgaatag >gi568815578r:36652791_36871194|GENSCAN_predicted_peptide_3|342_aa MRVGRRCTHVHRRRQAFCARVPEKLTAAHAPLVGGSFGGGDRDVEGGDRFTLIREYRLRV RKASRSDLGLWRATRGLVFQSLPPRMTSVTRSEIIDEQSASYQDRRQSWRRASMKETNRR KSLHPIHQGITASSLSEELKHFADGLETDGTLQKCFEDSNGFSLERQTWDQLLLHYQQEA KEILSRGSTEAKITEVKVEPMTYLGSSQNEVLNTKPDYQKILQNQSKVFDCMELVMDELQ GSVKQLQAFMDESTQCFQKVSVQLVPEVDGHSTHAPPWRPRRACADRSTSPAPLAAAAAA AAAAAAAAAAALTAGARASGKYEGCEWPLSRPEDEQGYFLLQ >gi568815578r:36652791_36871194|GENSCAN_predicted_CDS_3|1029_bp atgcgcgtcgggcggcggtgtacgcatgtgcatcgccggaggcaggcgttctgcgcgcgc gtgcccgagaaactgacggccgcgcatgcgcccttggttggcgggagtttcggaggcggt gaccgtgacgtagaaggtggagaccgcttcaccctgatcagggagtatcggctgcgggtg cgcaaggcgtccaggagtgacctggggctgtggagagcgacccgtggccttgtgtttcag agtttaccacctaggatgacttcagtgactagatcagagatcatagatgaacaatctgcc agttatcaagacaggaggcaatcctggcggcgagcaagtatgaaagaaacgaaccggcgg aagtcgctgcatcccattcaccagggcatcacagcatcttctctttctgaagaattgaaa cattttgcagacggactggaaactgatggaactctacaaaaatgttttgaagattcaaat gggttttctttagaacgtcagacttgggatcagctcttgcttcactaccagcaggaggct aaagagatattgtccagaggatcaactgaggccaaaattactgaggtcaaagtggaacct atgacatatcttgggtcttctcagaatgaagttcttaatacaaaacctgactaccagaaa atattacagaaccagagcaaagtctttgactgtatggagttggtgatggatgaactgcaa ggatcagtgaaacagctgcaggcctttatggatgaaagtacccagtgcttccagaaggtg tcagtacagctcgttcccgaggtggacgggcacagcactcatgcgccgccctggcgtcct cggcgcgcctgcgcggaccgcagcacctcccccgcccctctcgccgccgccgccgccgcc gccgccgccgccgccgccgccgctgctgctgcactgacggcgggtgcccgcgcctcagga aaatatgaggggtgtgaatggcccttatctaggcctgaggatgagcaggggtacttcctg ttacagtga >gi568815578r:36652791_36871194|GENSCAN_predicted_peptide_4|1904_aa MEAPAAEPPVRGCGPQPAPAPAPAPERKKSHRAPSPARPKDVAGWSLAKGRRGPGPGSAV ACSAAFSSRPDKKGRAVAPGARGAGVRVAGVRTGVRAKGRPRSGAGPRPPPPPPSLTDSS SEVSDCASEEARLLGLELALSSDAESAAGGPAGVRTGQPAQPAPSAQQPPRPPASPDEPS VAASSVGSSRLPLSASLAFSDLTEEMLDCGPSGLVRELEELRSENDYLKNGDVDVLAPHV AVRVPGSWCLVTFISSPTAHQLLQANSVIAMHLEPRAKVPVGPKSKLPPAVLSVQTNVFS TYLTGSDEIEELRAEMLEMRDVYMEEDVYQLQELRQQLDQASKTCRILQYRLRKAERRSL RAAQTGQVDGELIRGLEQDVKDLQNLGGAFAQHEEGKVSKDISMRLHKELEVVEKKRARL EEENEELRQRLIETELAKQVLQTELERPREDPVVSSETWKRLPTPVTASFPGTAFSSGIP TCNSDVYQVENHCAGYTLALIREDTFSPTNSLEFGQELCQELVPAHSAPQNAMAGTETPS TNLPGLCLSLWSQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGD LDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHG GGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELA KFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCD LASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSK AIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELN AFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPR IREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQER QLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQF KHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACT GDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKG HTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQE KLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHD KEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPG DPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVP VVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAG PGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRT VSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQ RRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGST ESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQP EGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNELCCPPFLTSPVSCL SHWLDFDVAHCSMWGSASHGLALVPAEHQGHASPASRLLHAPVTGVGRAGSQWSCGPVPL LDAAALTHEVRGALQGYLQSQISPFWNDDFIGMQIKSFWCISGS >gi568815578r:36652791_36871194|GENSCAN_predicted_CDS_4|5715_bp atggaggcgccggcggccgagcccccggtccggggctgcggtccccagcccgcgcctgca cccgcgcctgccccggagaggaaaaagagccaccgcgcgccgtcgccggcccggcccaaa gacgtggccggctggtcgctggctaagggccgccgcggcccaggcccgggctcagcagtc gcctgcagcgccgcgttctcctcacggccggacaagaaggggcgcgcagtggctcccggg gcgcgcggtgcgggtgtgcgggtggccggggtccgcaccggggtccgtgccaagggccgt ccgcgctcgggtgcggggccgcgaccgccaccgccgccccccagcctcacggatagcagc tcggaggtgtcggactgcgcgtcggaggaggcgcgcctgctgggcctggagctggcgctg agcagcgacgccgagtccgcggccgggggcccggcgggggtccgtacggggcagccggcc cagcccgcgccctccgcgcagcagcccccgcggccgcccgcctccccggacgagccgtcg gtggccgcgtcgtcggtgggcagcagccgcttgccgctcagcgcctcgcttgccttctcc gacctcaccgaggagatgctggactgcgggcccagcggcttggtgcgggagctggaggag ctgcgctcggagaacgactatctcaagaatggggatgtggatgtccttgccccgcacgtt gccgtgagggttccagggtcctggtgtttggtcactttcatctccagccccacagctcac cagctgctgcaggccaattccgtcatagccatgcatctggagccacgtgcaaaggttcca gtgggccccaaatctaaacttccccccgctgtcctcagtgtacaaacgaacgttttcagc acctacctcacagggtcggacgagattgaggagctgcgggccgagatgctggagatgcgg gacgtctatatggaggaggacgtgtatcagctgcaggagctgcgacagcagctggaccag gccagcaagacctgccgcatcctgcagtaccggctgcgcaaagccgagcgccgcagtctc cgtgccgcccagaccggccaggtggacggcgagcttatccgtggtctggagcaggatgtc aaggacctacagaacctcggtggggcctttgcccagcatgaggagggcaaggtctctaag gacatctccatgcggctgcataaggagctcgaggtggtggagaagaaacgggcgcggctg gaggaggagaacgaagagcttcgtcagcggctcatcgagactgagctggctaagcaggtg ctgcagacggagctggagcgaccgagagaggaccctgttgtgtcttcagaaacctggaag aggctgcccactcctgtgactgcttccttccctggcacagccttctcatccggaattcct acctgtaattctgatgtgtaccaggttgagaaccactgtgctggctacactctggccctc atcagagaagacacattttctcctaccaactctttggaatttggtcaagagctatgtcaa gaactggtcccagcacatagtgctccccaaaatgccatggctggtactgagactcccagc actaaccttcctgggttgtgtctgtctctgtggtcccaggaggacagtgcagacctgaag tgccagttgcactttgcaaaggaggagtcagccctcatgtgcaagaagctcactaagctt gccaaggagaatgacagcatgaaggaggagctgctgaagtaccgctcgctctatggggac ctggacagcgcgctgtcagccgaggagctggccgatgccccccactcgcgggagaccgag ctgaaggtgcacctgaagctggtggaggaggaagccaacctgctgagccgccgcatcgtg gagctggaggtggagaaccgaggcctgcgggctgagatggacgacatgaaggatcatgga ggtggctgtgggggtcctgaggcacgcctggccttctccgcgctgggtggcggagagtgc ggggagagcttggcagagctgcggcgacacctgcagtttgtcgaagaggaggccgagctg ctgcggcgctcctctgccgagctcgaggaccagaacaagctgctgctgaacgagctggcc aagttccgctcggagcacgagctggacgtggcgctgtcggaggacagttgttctgtgctc agcgaaccttcacaggaggagctggcggccgccaagctgcagatcggcgagctcagcggc aaggtcaagaagctgcagtacgagaaccgcgtgctcctctccaacctccagcgctgtgac ctcgcctcctgccagagtacgcggcccatgctggagacggacgccgaggccggggactct gcccagtgtgtgcctgctcccctgggcgagacacacgagtcccatgcggtccgactctgc agagccagggaggccgaggtgctgcctgggctgagagagcaggccgccctggtcagtaag gccatcgatgtcctggtggctgatgccaatggcttcacggctggcctccggctgtgtctg gacaacgagtgtgctgacttccggctgcatgaggcccccgacaacagcgagggccccagg gacaccaagctcatccatgccatcctggtgcgcctgagcgtgctgcagcaggagctgaat gccttcacgcggaaggcagatgcagtcctcgggtgctctgtcaaggaacagcaggagtcc ttctcatcactgccccccttgggctcccaggggctctctaaggagattcttctggcaaaa gaccttggctcagactttcagccacctgacttcagggacctgccggaatgggagcccagg atccgagaggctttccgcactggtgacttggactctaagcccgaccccagccggagcttc aggccttaccgagctgaagacaatgattcctatgcctctgagatcaaggagctgcagctg gtgctggctgaggcccacgacagcctccggggcttgcaagagcagctctcccaggagcgg cagctacgaaaggaggaggccgacaatttcaaccagaaaatggtccagctgaaggaggac cagcagagggcgctcctgaggcgggagtttgagctgcagagtctgagcctccagcggagg ctggagcagaaattctggagccaggagaagaacatgctggtgcaggagtcccagcaattc aagcacaacttcctgctgctcttcatgaagctcaggtggttcctcaagcgctggcggcag ggcaaggttttgcccagcgaaggggatgacttcctcgaggtgaacagcatgaaggagctg tacttgctgatggaggaagaggagataaacgctcagcattctgataacaaggcctgcacg ggggacagctggacccagaacacgcccaatgagtacatcaagacactggccgacatgaag gtgacgctgaaggagctgtgctggctgctccgggatgaacgccgtggtctgacggagctt cagcaacagtttgccaaggccaaggctacctgggagacagagcgggcagagctcaagggc catacctcccagatggagctgaagacagggaagggggccggggagcgggcagggcccgac tggaaggcagccctacagcgggagcgtgaggagcagcagcacctcctagctgagtcctac agcgctgtcatggagctgactcggcagctgcagatcagtgagcgcaactggagccaggaa aagctgcagctggtggagcggctgcagggtgagaagcagcaggtggagcagcaggtgaag gagctgcagaaccgcctaagccagctgcagaaggctgccgacccctgggtcctgaagcac tcggagctggagaagcaggacaacagctggaaggagacacgcagtgagaagatccacgac aaggaggctgtttccgaagttgagcttggaggaaatggtttaaagagaaccaaatctgtt tcttccatgtctgagtttgaaagtttgctcgactgttccccttaccttgctggcggagat gcccggggcaagaagctgcctaacaaccctgcctttggctttgtgagctccgagccaggg gatccagagaaagacaccaaggagaagcctgggctctcgtcgagggactgcaaccacctg ggtgccctggcctgccaggaccccccagggaggcagatgcagcgcagctacacggctcct gacaagacgggcatccgagtctactatagtcccccggtggcccggcgcctcggagtccct gtggttcatgacaaagagggcaagatcattatcgagcccggcttcctcttcaccacagcc aagcccaaagagtcggccgaggctgatgggctggctgagagctcctatggtcggtggctc tgcaacttctcacggcagcgcctggacggaggctcagcgggcagcccctcggcggccggg cctggcttcccagcggccctgcatgactttgagatgtcaggcaacatgagtgatgacatg aaggagatcaccaactgtgtgcgccaggccatgcgctccggctcactggagaggaaagtg aagagcacatccagccagacggtgggcctggccagtgtgggcacacagaccatccgcacg gtcagcgtgggcctgcagaccgacccaccccgcagcagcctccatggcaaggcctggtca ccccgcagctcttcgctcgtgtctgtgcgcagcaagcagatctcctcctccctggacaag gtccattcgcgcatcgagcggccctgctgctcccccaagtatggctcaccaaagctccag aggcggtctgtgtccaagctggacagcagcaaggaccgcagcctgtggaacctgcaccag ggcaagcagaacggctcggcctgggcccgctccaccaccacgcgggacagccctgtattg agaaacatcaacgatggactctccagcctcttcagtgtggtggagcactcagggagcacg gagtctgtctggaaactaggcatgtctgagacgcgggccaagcccgagcctcccaagtac ggcattgtgcaggaattcttccgtaatgtgtgtggccgggcaccgagccccacctcatca gcaggagaggagggcaccaagaagccagagcccctctccccagccagctaccatcagcca gagggtgtggccaggatcctgaacaagaaggcagccaagttgggcagcagtgaggaggtc agactcaccatgctcccccaggtggggaaggatggtgtcctccgggacggagatggagcc gtggtccttcccaatgagctctgctgcccgcccttcctcacgtcccctgtgagctgcctg agccattggttggatttcgatgtggctcattgcagcatgtggggcagcgcctcccatggc ctcgccttggtgccggctgaacaccagggtcatgccagtcccgccagccgcctcctccat gccccagtgactggtgtgggcagagcaggcagccagtggagctgtgggccagttccgctc ttggatgctgctgctctcacccatgaggtcaggggggccctccaaggttatctccagagc caaatcagccccttttggaatgatgacttcattggaatgcaaatcaagtcattttggtgc atcagtggctcttag