GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:47:16 Sequence gi568815577r:34692138_35148899 : 456762 bp : 43.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15143 15252 110 1 2 68 116 77 0.969 7.58 1.02 Intr + 15807 15932 126 0 0 106 80 137 0.999 14.49 1.03 Intr + 16563 16669 107 0 2 45 70 79 0.969 1.66 1.04 Intr + 17220 17401 182 1 2 93 22 273 0.805 20.79 1.05 Term + 24184 24345 162 0 0 113 48 114 0.998 7.84 1.06 PlyA + 26161 26166 6 1.05 2.00 Prom + 38007 38046 40 -4.86 2.01 Init + 39032 39083 52 1 1 65 77 76 0.067 5.52 2.02 Intr + 51152 51489 338 0 2 37 64 179 0.048 5.64 2.03 Intr + 62290 62468 179 0 2 76 76 125 0.723 8.82 2.04 Intr + 64340 64467 128 2 2 46 11 100 0.068 -1.58 2.05 Intr + 70868 71035 168 0 0 110 86 27 0.676 4.62 2.06 Term + 72937 73163 227 2 2 34 32 191 0.776 5.04 2.07 PlyA + 74505 74510 6 1.05 3.04 PlyA - 76327 76322 6 1.05 3.03 Term - 88725 88706 20 1 2 97 47 19 0.257 -2.82 3.02 Intr - 90207 90159 49 0 1 74 109 -4 0.423 -1.45 3.01 Init - 91093 90983 111 1 0 76 82 126 0.899 9.10 3.00 Prom - 93373 93334 40 -4.46 4.10 PlyA - 93394 93389 6 1.05 4.09 Term - 100473 99998 476 1 2 84 54 858 0.994 77.05 4.08 Intr - 107325 107164 162 1 0 96 82 83 0.978 8.45 4.07 Intr - 127201 127040 162 2 0 100 68 78 0.384 6.95 4.06 Intr - 138586 138536 51 2 0 62 87 59 0.684 2.08 4.05 Intr - 142464 142273 192 1 0 96 110 113 0.915 13.76 4.04 Intr - 167441 167337 105 0 0 107 100 88 0.829 12.09 4.03 Intr - 172504 172373 132 2 0 108 81 10 0.185 2.92 4.02 Intr - 188576 188420 157 2 1 65 94 175 0.950 15.38 4.01 Init - 194975 194706 270 2 0 46 79 427 0.644 34.57 4.00 Prom - 195245 195206 40 -11.04 5.00 Prom + 195338 195377 40 -7.16 5.01 Init + 196802 197087 286 1 1 57 37 305 0.914 17.82 5.02 Term + 197910 198235 326 1 2 143 49 150 0.977 11.53 5.03 PlyA + 200849 200854 6 1.05 6.00 Prom + 203074 203113 40 -4.06 6.01 Init + 233287 233439 153 0 0 82 94 89 0.703 8.88 6.02 Intr + 241099 241173 75 0 0 85 75 28 0.152 0.91 6.03 Intr + 289737 289823 87 2 0 80 96 -1 0.006 0.07 6.04 Intr + 293103 293203 101 2 2 48 58 56 0.002 -2.49 6.05 Intr + 299401 299530 130 1 1 17 21 175 0.012 4.40 6.06 Intr + 307241 307288 48 1 0 74 96 62 0.174 4.48 6.07 Intr + 307385 307418 34 1 1 107 62 22 0.103 -0.70 6.08 Intr + 314832 314972 141 1 0 69 58 64 0.584 1.82 6.09 Intr + 321281 321342 62 0 2 91 57 98 0.044 5.45 6.10 Intr + 329991 330118 128 2 2 -11 89 76 0.013 -2.72 6.11 Term + 334521 334848 328 0 1 67 54 182 0.392 6.88 6.12 PlyA + 335578 335583 6 1.05 7.03 PlyA - 339765 339760 6 1.05 7.02 Term - 343925 343876 50 0 2 134 55 24 0.572 1.17 7.01 Init - 344331 344271 61 0 1 63 84 21 0.555 0.61 7.00 Prom - 344421 344382 40 -1.86 8.08 PlyA - 344691 344686 6 1.05 8.07 Term - 357619 357478 142 0 1 76 44 155 0.655 7.30 8.06 Intr - 372896 372799 98 2 2 45 109 10 0.301 -2.39 8.05 Intr - 378278 378141 138 2 0 68 88 49 0.341 3.56 8.04 Intr - 380630 380455 176 0 2 60 95 23 0.146 -0.14 8.03 Intr - 382595 382494 102 0 0 69 99 11 0.166 0.45 8.02 Intr - 386767 386612 156 2 0 125 58 80 0.969 8.68 8.01 Init - 402401 402266 136 2 1 33 116 103 0.456 7.90 8.00 Prom - 407503 407464 40 -3.06 9.00 Prom + 423295 423334 40 -5.76 9.01 Init + 424556 424684 129 1 0 70 71 71 0.185 3.75 9.02 Intr + 430421 430459 39 1 0 99 98 5 0.045 1.02 9.03 Intr + 438747 438783 37 2 1 69 82 14 0.062 -3.26 9.04 Intr + 439032 439130 99 1 0 53 87 59 0.399 2.38 9.05 Intr + 444590 444712 123 0 0 66 100 65 0.627 6.06 9.06 Term + 446600 446781 182 0 2 113 41 15 0.307 -2.93 9.07 PlyA + 451969 451974 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 51169 51489 321 0 0 6 64 252 0.888 10.00 S.002 Init - 145640 145535 106 2 1 84 35 83 0.841 2.99 S.003 Init - 390278 390206 73 2 1 39 119 81 0.982 7.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_1|228_aa AGYDGESIGNCPFSQRLFMILWLKGVIFNVTTVDLKRKPADLQNLAPGTNPPFMTFDGEV KTDVNKIEEFLEEKLAPPRYPKLGTQHPESNSAGNDVFAKFSAFIKNTKKDANEIHEKNL LKALRKLDNYLNSPLPDEIDAYSTEDVTVSGRKFLDGDELTLADCNLLPKLHIIKIVAKK YRDFEFPSEMTGIWRYLNNAYARDEFTNTCPADQEIEHAYSDVAKRMK >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_1|687_bp gctggttatgatggtgagagtatcggaaattgcccgttttctcagcgtctctttatgatt ctctggctgaaaggcgttatatttaatgtgaccacagtggacctgaaaaggaaacccgca gacctgcagaacctggctcccggaacaaaccctcctttcatgacttttgatggtgaagtc aagacggatgtgaataagatcgaggagttcttagaggagaaattagctcccccgaggtat cccaagctggggacccaacatcccgaatctaattccgcaggaaatgacgtgtttgccaaa ttctcagcgtttataaaaaacacgaagaaggatgcaaatgagattcatgaaaagaacctg ctgaaggccctgaggaagctggataattacttaaatagccctctgcctgatgaaatagat gcctacagcaccgaggatgtcactgtttctggaaggaagtttctggatggggacgagctg acgctggctgactgcaacctcttacccaagctccatattattaagattgtggccaagaag tacagagattttgaatttccttctgaaatgactggcatctggagatacttgaataatgct tatgctagagatgagttcacaaatacgtgtccagctgatcaagagattgaacacgcatat tcagatgttgcaaaaagaatgaaatga >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_2|363_aa MEEEVEVDENPGTGFSIGVSTLQMFRRRISTCLEVFLVSASCLQYRGPQIHDTSPGAPQE SLAEDLTIPSKTKHCCPGSATVTSLQLSSGHNLQIFLAIQTFTIIRINACRTKEPDTWVQ TCPTIYSESLVQIFRGMRGLERAFIHTRLKLQQVPDAQNNQPFPPCGPHPRDNGNVFVAS WNEKSIFLLSLHPAASHKLWAVPNTAQGFGVDGTKDDNSVNRQHPRREKSQREKQLPKPQ HLVSTFSALFGFGQDACGPNMAVSTWLSVFYSSPSSSSSFFFFIFWIAEVEAIALALMEL TVGIRYRMNDVIKGKCQIMAQTVSLGDCLKDAEAAMGQQCFVEKMGIVLSLEGCCFWNKL KDN >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_2|1092_bp atggaggaggaagttgaagtggatgagaatccaggaacagggtttagcattggggtgagc acattgcaaatgttccgccggcgaatcagcacctgcctggaggtgttcttggtgtctgcg tcctgcctccagtacagaggacctcaaatccatgatacttcacctggtgctccccaggaa agcctggcagaggacctgacgattcccagcaagaccaagcactgctgtccagggtcagcc acggtcacttctctgcagctgtcatccgggcacaatctccaaatatttctagccattcag acgttcactataatcagaattaacgcatgtagaaccaaagagccagacacatgggttcaa acctgccccaccatttacagtgagagcttagttcagatcttcaggggtatgcgtggcctg gagagggccttcatccacacacgcttaaagcttcagcaagttcctgatgcccagaacaac caacccttcccgccctgcggcccccaccctcgggataatgggaatgttttcgtagcaagt tggaacgagaaatcaattttcctgctgagccttcatcctgctgcttcccataagctgtgg gcagttcccaacactgcacaaggatttggtgttgatggaaccaaggatgataattctgtc aacaggcagcacccacgaagggagaagtcccaacgagaaaagcagctgcctaaaccacag catctggtgtcaacattctctgccctctttggctttggccaagatgcctgtggtcccaac atggcagtcagcacatggctgtctgtgttttactcttctccttcctcctcctcctccttc tttttcttcatcttttggatcgcagaagtggaagccatagctcttgccctcatggagctt acagttggtatacgatatagaatgaatgatgtaataaaaggcaagtgccaaattatggcc cagacagtgagtcttggtgattgtctgaaggatgcagaggctgccatgggccagcagtgc tttgtggagaagatgggaattgtactaagccttgaaggttgctgcttctggaataagctc aaggacaactaa >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_3|59_aa MGCGASCIPFLCLALTQASPLHYSPAPDELSLHPRVNFQRKEPHLWWEELHKIGCQSLD >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_3|180_bp atgggctgtggtgcatcctgcatccccttcctgtgcctggcgctgactcaggcctcgcct ctgcattactcccctgcaccagatgagctcagcctgcaccctagagtcaatttccagaga aaagaaccacatctctggtgggaagaactgcataagatcggatgccagtccttggattag >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_4|568_aa MRIPVDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALAGKLRSGDRSMVEVLADHPG ELVRTDSPNFLCSVLPTHWRCNKTLPIAFKVVALGDVPDGTLVTVMAGNDENYSAELRNA TAAMKNQVARFNDLRFVGRSGRGHPVTPWWPPLEPRAADFQQHTQDQCWGHLCTDQSLCW PHQTKARKSFTLTITVFTNPPQVATYHRAIKITVDGPREPRRHRQKLDDQTKPGSLSFSE RLSELEQLRRTAMRVSPHHPAPTPNPRASLNHSTAFNPQPQSQMQDREIEIQRLKIAQYL KDSCGSCHVMTTLPGARALGLPQQSENHKFLKNRYEEERLFLLKDMDPLRNSIQWCYTRQ IQPSPPWSYDQSYQYLGSIASPSVHPATPISPGRASGMTTLSAELSSRLSTAPDLTAFSD PRQFPALPSISDPRMHYPGAFTYSPTPVTSGIGIGMSAMGSATRYHTYLPPPYPGSSQAQ GGPFQASSPSYHLYYGASAGSYQFSMVGGERSPPRILPPCTNASTGSALLNPSLPNQSDV VEAEGSHSNSPTNMAPSARLEEAVWRPY >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_4|1707_bp atgcgtatccccgtagatgccagcacgagccgccgcttcacgccgccttccaccgcgctg agcccaggcaagatgagcgaggcgttgccgctgggcgccccggacgccggcgctgccctg gccggcaagctgaggagcggcgaccgcagcatggtggaggtgctggccgaccacccgggc gagctggtgcgcaccgacagccccaacttcctctgctccgtgctgcctacgcactggcgc tgcaacaagaccctgcccatcgctttcaaggtggtggccctaggggatgttccagatggc actctggtcactgtgatggctggcaatgatgaaaactactcggctgagctgagaaatgct accgcagccatgaagaaccaggttgcaagatttaatgacctcaggtttgtcggtcgaagt ggaagaggccaccctgtgaccccgtggtggccccctctggaaccaagagctgccgacttc cagcagcacacacaggatcagtgctggggccatctgtgcactgaccaaagcctctgctgg cctcaccagaccaaggccaggaaaagcttcactctgaccatcactgtcttcacaaaccca ccgcaagtcgccacctaccacagagccatcaaaatcacagtggatgggccccgagaacct cgaagacatcggcagaaactagatgatcagaccaagcccgggagcttgtccttttccgag cggctcagtgaactggagcagctgcggcgcacagccatgagggtcagcccacaccaccca gcccccacgcccaaccctcgtgcctccctgaaccactccactgcctttaaccctcagcct cagagtcagatgcaggacagagaaatcgaaattcagaggttaaaaattgcccagtatctc aaagattcgtgtggctcttgccatgtcatgaccacactcccgggggctcgagctctgggc ttgcctcagcagtctgagaaccacaagtttctaaagaacaggtatgaggaggagagactt ttcttgctcaaagacatggatccactcagaaacagcattcaatggtgctatacaaggcag atccaaccatccccaccgtggtcctacgatcagtcctaccaatacctgggatccattgcc tctccttctgtgcacccagcaacgcccatttcacctggacgtgccagcggcatgacaacc ctctctgcagaactttccagtcgactctcaacggcacccgacctgacagcgttcagcgac ccgcgccagttccccgcgctgccctccatctccgacccccgcatgcactatccaggcgcc ttcacctactccccgacgccggtcacctcgggcatcggcatcggcatgtcggccatgggc tcggccacgcgctaccacacctacctgccgccgccctaccccggctcgtcgcaagcgcag ggaggcccgttccaagccagctcgccctcctaccacctgtactacggcgcctcggccggc tcctaccagttctccatggtgggcggcgagcgctcgccgccgcgcatcctgccgccctgc accaacgcctccaccggctccgcgctgctcaaccccagcctcccgaaccagagcgacgtg gtggaggccgagggcagccacagcaactcccccaccaacatggcgccctccgcgcgcctg gaggaggccgtgtggaggccctactga >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_5|203_aa MLARGPARVRTPFRSHTQACAAPRVATVRRPGKRKQAPPARGGPGRRAAPGEARPREPLQ AGAVAATRLPNRAAARGRPQILRGRPGPGLRFQGGGLGCCGRCRAGTVPGRCRGPGRRRG PGLTGVEAFPGRARTARSCRDAPRALADARGQQPPSLDAARSPRSSRNAPDGAGLCGPPR GLRVAREELAFEDKRQEAAPEPT >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_5|612_bp atgctggcccggggccccgcccgcgtgcggacccctttccgcagccacacgcaggcttgt gcggctccgcgagtggccacggtccggagacctggaaaaagaaagcaggccccgccggcc cgaggaggacccggccggcgcgccgcacccggagaggcccggccccgcgagccgctgcag gcaggcgcagtggccgccacgaggctcccgaaccgggctgcagcccgcggacggccccag atcctgcgcggccgcccagggccaggcctccgcttccagggcggggggctagggtgctgt ggccgctgccgcgcagggactgtccccgggcgttgccgcgggcccggacgcaggaggggg ccggggttgactggcgtggaggcctttcccgggcgggcccggactgcgcggagctgtcgg gacgcgccgcgggctctggcggacgccagggggcagcagccgccctccctggacgccgcg cgcagtccccggagctcccggaacgcccccgacggcgcggggctgtgcggcccgcctcgt ggccttcgggtcgcccgggaagaactagcgttcgaggataaaagacaggaagccgcccca gagcccacttga >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_6|428_aa MTGVLIRRGKPWEDTDTMGESLVISKAETEVMQLQAKECQGLLATIKNWEECPSAELLVH EYCEMRPCYMLQVAPQVIAISAARSPPQQISLLYMGLMLGQKPARFLAIGMTEQGVIGCQ QDLNGVTQSWVQAHRGRTRDYDDDDDDDDDDGDDDCYFCFTGGKAEAQSGDVTCLKSHGY KVVTKPRYIKVMEPPQLRNGPFGLQISGSGGTLGGFYSCTAWYWCHQPHVGKKHVAETAR KCNLHTRSQRLGKEKESLQTGRNGYQNNSNSEKLLRELRAGVDNSLEGADWRFRGLPAAP GKGMWLTFEQTKEGGSVCTRICAHPPGDLLVDPSLMAEQQGDPLEKAMGEVTERATQKAT ARNGGDASGPDRKGGGGAALVRSAPQRRNFCKSCLPARYQRRAGLWFSRSRNPALTAGLF FDKQDASI >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_6|1287_bp atgactggtgttctcataagaagaggaaagccatgggaagacacagacacaatgggagaa agccttgtgatatcaaaggcagagactgaagtgatgcagttacaagccaaggagtgtcaa ggattgctggcaaccatcaaaaactgggaagagtgcccttctgcagagctgttagtgcat gagtactgtgagatgcgtccttgttacatgctacaagtagcccctcaggtcattgccatc agtgcagcaaggagcccccctcaacaaatctctcttctttatatggggctgatgcttggc caaaagcctgcaaggtttcttgcaatagggatgacagagcagggagtaataggctgccag caggacctcaatggagtgacccagtcttgggttcaggcccatcggggcagaactagggat tacgatgatgatgatgatgatgatgatgatgatggtgatgatgactgttatttctgtttt acaggtggaaaagcagaggcacaaagcggtgatgtgacttgcctgaagtcacacggctac aaagtggtgacaaaaccaagatacatcaaggtcatggagcccccacagctgaggaatggt ccttttggcctgcagatctcagggtctggaggtaccttaggtggcttctacagctgcact gcttggtattggtgccaccagccccatgtgggcaagaagcatgtggctgagacagctcga aagtgtaacctgcacaccagatctcaaagactcgggaaagagaaagagtctctgcagaca ggaagaaatggctaccagaataacagcaactctgagaagttattacgagagctgagagcg ggtgtagacaatagtcttgaaggagctgattggcggttccgtggtttaccagcagcaccg ggcaaaggaatgtggcttacctttgaacagacgaaagagggtggaagcgtctgcaccagg atctgcgcgcatccgcccggggacttgttggtggatccatccctcatggcggagcagcaa ggggatcctttagaaaaagcaatgggcgaagtaactgaaagagcgacgcagaaagcaaca gccagaaacggcggggacgcgagcggcccagacaggaagggaggcggtggcgcagctctg gtgcgcagcgcgccgcagcgacggaacttctgcaaaagctgcctgcccgcgcgttatcag cggcgcgcaggcctgtggttttctcgctctcgcaaccctgctttaactgccggtttattt ttcgacaaacaggatgcctccatctga >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_7|36_aa MLKDAIINDFLPAKPAHSQPWIMASTVTFRPNLGSS >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_7|111_bp atgctgaaggatgcaatcataaatgactttctccccgcaaagcctgctcacagccagcca tggataatggcctccactgtcaccttcagacccaacttgggttcttcctga >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_8|315_aa MQIIDVWNCYLEIEKQELQMTPAIVELNHSLEEESKFMWVMHYTKGEWSHLPGPFRPGFL SLNYLDAPLELTDVILLYKELRSTQRPVYLCNPNSPECSSSQIPQHFVLFKKTGKCKVPF SQFAQIPKNEPDSPAFGSPQIPRQPGLSHSADSLMAPKSVKRKGLPLQGHLLISSPFPKQ DETVCRARKKDVVDIAMCQFQALRPAEAVHISAYFLCHFHVNVTVLASWRMRDHMECSCH PAINAPGLSFGNRAEKNNTTASSPGPESSFTYAMIDVTKDLTLPELMPSILNDGIHILSE QRLDASVELLLNKQQ >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_8|948_bp atgcaaatcatagatgtatggaactgctatctggagatagaaaaacaggagttgcagatg actcctgccatcgtggaactaaaccacagtctagaagaggagagcaagtttatgtgggtt atgcattacactaaaggtgagtggtcccatctgccaggcccctttaggcctggatttctc tccctcaactaccttgatgcccctctggaactgacagatgtcattctgctgtataaagaa ctaagaagcactcagagacctgtctacctttgtaacccaaacagccccgagtgctcaagc agtcagattccacaacattttgtactctttaagaagactggaaaatgcaaagtgccattt agccagtttgcacagattccaaaaaatgaaccagatagtcctgctttcggatcaccccag atcccccgccaacctggcctcagccattctgctgatagcctgatggcgcccaagtcggtc aagaggaaaggcctgcccctccaaggccaccttctaatctctagcccctttcctaagcaa gatgagacagtgtgcagggccaggaaaaaggatgtagttgatatagcaatgtgccagttc caagctttaagacctgcagaggctgtacacatttctgcttactttctctgtcacttccat gtgaacgtgactgtgctagcatcctggaggatgagagaccacatggagtgtagctgccac ccagcaattaacgcgcctgggctctcctttggaaaccgagcagagaagaacaataccaca gcgagttcgcctggacccgaaagcagtttcacttatgccatgatagacgttaccaaggac ttaactctcccggagctgatgcctagcattttaaatgatgggatccacatcctgtcggag cagcggcttgatgccagcgttgaattactattgaataagcagcaatga >gi568815577r:34692138_35148899|GENSCAN_predicted_peptide_9|202_aa MGARRSDLAKEEDMPVERSNPVSKPSYNLANSGNSQFWVSNLKETNLTHKDSHKLKFFSS SSSIGITTGALGNFNLQKSFGLCPPGLATGKPLHAPPNKETDTASLVSTGKLDIGRPSCT KTPSCERLGEDFALDTQGSLWEGARGAKKSKTKEALFPPEPHSSTRSEQLARSDSHLLHL AFPCVLTEGLSESLLFRLIIIQ >gi568815577r:34692138_35148899|GENSCAN_predicted_CDS_9|609_bp atgggggctaggagaagtgacttggctaaagaagaggacatgccagtagagagatctaac ccagtgtccaagccatcttataatcttgccaactctggaaacagccagttctgggtttcc aacctgaaggagactaacctaacacataaggactcacataaacttaagtttttctcttct tcttcttcaattggaatcaccactggagcattgggtaacttcaacttgcaaaagagcttt ggcctatgtccaccagggctggccactgggaagccactccatgctccacccaacaaggag acagacacagccagcctggtgagcacaggcaaactggacattggaagacccagctgcaca aagactccttcctgtgagaggcttggagaagactttgctctagacacacaagggagcctg tgggaaggtgccaggggagccaagaagagcaaaaccaaggaggcattgtttcctccagag cctcattcatcaactcgctctgaacagttagcacgctcagacagtcatcttctgcacctt gcctttccctgtgtcttgactgagggcttatctgagagccttttgttcaggctcataatt attcagtga