GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:43:03 Sequence gi568815586r:111544202_111785792 : 241591 bp : 46.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3668 3760 93 0 0 91 94 47 0.750 5.54 1.02 Term + 26888 26901 14 0 2 121 47 2 0.014 -2.24 1.03 PlyA + 28416 28421 6 1.05 2.00 Prom + 31529 31568 40 -3.06 2.01 Sngl + 37066 37617 552 0 0 28 39 515 0.707 36.72 2.02 PlyA + 37722 37727 6 1.05 3.04 PlyA - 43656 43651 6 1.05 3.03 Term - 48244 48206 39 1 0 79 49 49 0.471 -2.71 3.02 Intr - 53792 53711 82 1 1 104 68 76 0.915 6.84 3.01 Init - 54833 54583 251 2 2 86 94 523 0.601 49.64 3.00 Prom - 59903 59864 40 -5.96 4.08 PlyA - 60830 60825 6 1.05 4.07 Term - 100361 99998 364 1 1 74 43 507 0.995 38.84 4.06 Intr - 105841 105738 104 1 2 103 56 94 0.960 6.67 4.05 Intr - 111454 111365 90 1 0 72 100 60 0.980 5.79 4.04 Intr - 114644 114535 110 0 2 58 94 88 0.999 6.40 4.03 Intr - 115144 115006 139 1 1 86 53 101 0.998 6.44 4.02 Intr - 116474 116399 76 1 1 76 100 72 0.998 6.72 4.01 Init - 121568 121438 131 2 2 76 94 149 0.993 13.92 4.00 Prom - 123192 123153 40 -5.56 5.08 PlyA - 125402 125397 6 1.05 5.07 Term - 125858 125649 210 2 0 5 41 171 0.318 1.39 5.06 Intr - 128573 128460 114 2 0 63 95 94 0.493 8.24 5.05 Intr - 135080 134950 131 0 2 64 98 7 0.716 -0.29 5.04 Intr - 139106 138945 162 0 0 66 85 106 0.966 8.05 5.03 Intr - 141636 141510 127 0 1 92 100 128 0.943 14.65 5.02 Intr - 148127 148061 67 1 1 105 64 3 0.476 -1.49 5.01 Init - 150689 150682 8 2 2 103 101 0 0.595 3.30 5.00 Prom - 152041 152002 40 -3.36 6.00 Prom + 152281 152320 40 -6.16 6.01 Init + 155436 155487 52 2 1 82 59 35 0.781 1.35 6.02 Intr + 157961 158109 149 0 2 69 101 30 0.823 2.35 6.03 Intr + 161537 161731 195 1 0 102 91 99 0.965 11.21 6.04 Intr + 165325 165483 159 0 0 77 85 137 0.999 12.48 6.05 Intr + 168297 168456 160 2 1 40 91 125 0.997 7.56 6.06 Intr + 171620 171761 142 0 1 84 111 70 0.999 8.31 6.07 Intr + 177470 177538 69 2 0 105 98 55 0.988 6.50 6.08 Intr + 183761 183942 182 2 2 107 105 172 0.999 20.31 6.09 Intr + 185605 185755 151 2 1 90 76 115 0.782 9.72 6.10 Intr + 189722 189867 146 2 2 80 75 142 0.999 12.03 6.11 Intr + 192630 192803 174 1 0 127 95 41 0.976 8.61 6.12 Intr + 200442 200842 401 1 2 69 98 181 0.717 11.42 6.13 Intr + 201943 202083 141 0 0 69 67 93 0.964 5.85 6.14 Intr + 202848 202985 138 2 0 113 98 161 0.990 20.26 6.15 Intr + 203094 203184 91 2 1 87 68 70 0.973 4.47 6.16 Intr + 204116 204274 159 0 0 109 94 170 0.994 19.66 6.17 Intr + 204972 205144 173 1 2 70 77 111 0.995 7.86 6.18 Intr + 209571 209714 144 2 0 107 97 251 0.631 28.38 6.19 Intr + 211467 211544 78 2 0 117 99 43 0.996 8.05 6.20 Term + 212132 212272 141 1 0 92 43 298 0.943 23.63 6.21 PlyA + 218618 218623 6 1.05 7.00 Prom + 222302 222341 40 -2.96 7.01 Init + 222782 222895 114 1 0 60 99 224 0.981 18.93 7.02 Intr + 223155 223348 194 2 2 105 45 8 0.404 -3.51 7.03 Intr + 229273 229361 89 1 2 78 76 47 0.528 2.11 7.04 Intr + 230577 230683 107 1 2 68 67 64 0.388 2.23 7.05 Intr + 237717 237821 105 2 0 98 100 75 0.912 10.11 7.06 Intr + 238957 239097 141 0 0 81 101 214 0.999 22.55 7.07 Intr + 241066 241145 80 0 2 119 72 154 0.763 15.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_1|35_aa XLFVEKSFNNHGTYWLGTVVHAYNPSNLGGPGEET >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_1|108_bp nctttatttgtggagaaatcctttaacaaccatggaacttactggctgggcacagtggtt catgcctacaatcccagcaatttgggaggcccaggggaagaaacttaa >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_2|183_aa MSTKKPPEPEKSKLPGKGRGPLRTTPATRPQLVFPGHHEPQCPNLFTPANIGHPPNYEML KEEQEVAVLGAPHNPAPPMSTVIPIHSENSVPDHVVWSLFNTHFMNSCCLGFIALAYSVK SRDKKMVGDLIRAQAYAFTAKCLNIWALIVRIITTILLSIIPVLIFQVYQQIRRHYLGQE LCP >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_2|552_bp atgagtactaaaaaaccaccagagccagagaaatccaaactaccagggaaagggaggggc ccactgagaaccaccccagcaacccgaccacagctggtcttccctggacaccatgaacca caatgtccaaacctcttcactcctgccaacatcggccatccccctaactatgagatgctc aaggaggagcaagaggtggctgtgctgggggcaccccacaaccctgctcccccaatgtcc accgtgatccccatccacagcgagaactccgtgcccgaccatgtcgtctggtccctgttc aacacccacttcatgaactcctgctgcctgggcttcatagcattggcctactccgtgaag tctagggacaagaagatggttggcgacctgatcagggcccaggcctatgcctttaccgcc aagtgcctgaacatctgggccctgattgtgcgcatcattacgaccattctgctcagcatc atcccagtgttgatctttcaagtctatcaacagatcaggaggcattatctaggccaggag ctctgcccgtga >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_3|123_aa MSLKPQQQQQQQQQQQQQQQQQQQQQQQPPPAAANVRKPGGSGLLASPAAAPSPSSSSVS SSSATAPSSVVAATSGGGRPGLGRPLRRAGGGRSALLFEAAVEKAPVEHRWDIWVIYDVT VTI >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_3|372_bp atgtcgctgaagccccagcagcagcagcagcagcagcagcagcagcagcagcagcaacag cagcagcagcagcagcagcagcagccgccgcccgcggctgccaatgtccgcaagcccggc ggcagcggccttctagcgtcgcccgccgccgcgccttcgccgtcctcgtcctcggtctcc tcgtcctcggccacggctccctcctcggtggtcgcggcgacctccggcggcgggaggccc ggcctgggcagacctctccggcgcgcgggtggtggccgatccgcattgctgttcgaggcc gcagtggagaaggcgcctgtggaacatcggtgggatatctgggtcatctatgatgttact gttaccatctaa >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_4|337_aa MDLTELPKCTVCLERMDESVNGILTTLCNHSFHSQCLQRWDDTTCPVCRYCQTPEPVEEN KCFECGVQENLWICLICGHIGCGRYVSRHAYKHFEETQHTYAMQLTNHRVWDYAGDNYVH RLVASKTDGKIVQYECEGDTCQEEKIDALQLEYSYLLTSQLESQRIYWENKIVRIEKDTA EEINNMKTKFKETIEKCDNLEHKLNDLLKEKQSVERKCTQLNTKVAKLTNELKEEQEMNK CLRANQVLLQNKLKEEERVLKETCDQKDLQITEIQEQLRDVMFYLETQQKINHLPAETRQ EIQEGQINIAMASASSPASSGGSGKLPSRKGRSKRGK >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_4|1014_bp atggacctgactgaactccccaagtgcacggtgtgtctggagcgcatggacgagtctgtg aatggcatcctcacaacgttatgtaaccacagcttccacagccagtgtctacagcgctgg gacgataccacgtgtcctgtttgccggtactgtcaaacgcccgagccagtagaagaaaat aagtgttttgagtgtggtgttcaggaaaatctttggatttgtttaatatgcggccacata ggatgtggacggtatgtcagtcgacatgcttataagcactttgaggaaacgcagcacacg tatgccatgcagcttaccaaccatcgagtctgggactatgctggagataactatgttcat cgactggttgcaagtaaaacagatggaaaaatagtacagtatgaatgtgagggggatact tgccaggaagagaaaatagatgccttacagttagagtattcatatttactaacaagccag ctggaatctcagcgaatctactgggaaaacaagatagttcggatagagaaggacacagca gaggaaattaacaacatgaagaccaagtttaaagaaacaattgagaagtgtgataatcta gagcacaaactaaatgatctcctaaaagaaaagcagtctgtggaaagaaagtgcactcag ctaaacacaaaagtggccaaactcaccaacgagctcaaagaggagcaggaaatgaacaag tgtttgcgagccaaccaagtcctcctgcagaacaagctaaaagaggaggagagggtgctg aaggagacctgtgaccaaaaagatctgcagatcaccgagatccaggagcagctgcgtgac gtcatgttctacctggagacacagcagaagatcaaccatctgcctgccgagacccggcag gaaatccaggagggacagatcaacatcgccatggcctcggcctcgagccctgcctcttcg gggggcagtgggaagttgccctccaggaagggccgcagcaagaggggcaagtga >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_5|272_aa MPRPLSNSFFTLQSEYSLKIVNLGQPRLSPPGPAPASACPMSVSLVVIRLELAEHSPVPA GFGFSAAAGEMSDEEIKKTTLASAVACLEGKSPGEKVAIIHQHLGRREMTDVIIETMKSN PVPAAMTSHDLMKFVAPFNEVIEQMKIIRDSTPNQYMVLIKFRAQADADSFYMTCNGRQF NSIEDDVCQLVYVERAEVLKSEDEQQLLMKMKIVNQRKSPEAKMKMKNIGRDTPTSAGPN SFNKRKHGFFDNQKLWERNIKSYVGNVNDQDN >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_5|819_bp atgcccagacctctctccaattctttcttcaccctccagtcagagtattctttaaaaatt gtaaatctgggccagcctcgcctgagcccgccggggcccgcgccggccagcgcctgccct atgagtgtgtcactggttgttatccgattggagctcgcggaacactcgcctgtccccgcc ggcttcggcttcagcgccgcggccggggaaatgtctgatgaggagataaaaaagacgaca ctagcctcagctgtagcctgtttagaaggcaagtcaccaggagagaaagtagcgattatc catcagcatctcggccgtcgagaaatgacagatgtgatcattgagaccatgaagtccaac ccagtccctgctgcaatgaccagtcatgaccttatgaagtttgttgccccatttaacgaa gtaattgaacaaatgaaaattatcagagactctactcccaaccaatatatggtgctgata aagtttcgtgcacaggctgatgcggatagtttttatatgacatgcaatggccgccagttc aactcaatagaagatgacgtttgccagctagtgtatgtggaaagagctgaagtgctcaaa tctgaagatgagcagcagcttttaatgaagatgaagatagtgaaccagaggaaatctcca gaagcaaagatgaagatgaagaatattggaagggatacaccaacatcagctggaccaaac tccttcaataaaagaaagcatgggttttttgataaccagaagctatgggagcgaaatata aaatcttatgttggaaatgtcaatgaccaagacaattaa >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_6|1014_aa MVDVMAYGYDPSTLGGREWEVQNRIPSGTILKALMEGGENGPWMRFMRAEITAEGFLREF GRLCSEMLKTSVPVDSFFSLLTSERVAKQFPVMTEAITQIRAKGLQTAVLSNNFYLPNQK SFLPLDRKQFDVIVESCMEGICKPDPRIYKLCLEQLGLQPSESIFLDDLGTNLKEAARLG IHTIKVNDPETAVKELEALLGFTLRVGVPNTRPVKKTMEIPKDSLQKYLKDLLGIQTTGP LELLQFDHGQSNPTYYIRLANRDLVLRKKPPGTLLPSAHAIEREFRIMKALANAGVPVPN VLDLCEDSSVIGTPFYVMEYCPGLIYKDPSLPGLEPSHRRAIYTAMNTVLCKIHSVDLQA VGLEDYGKQGDYIPRQVRTWVKQYRASETSTIPAMERLIEWLPLHLPRQQRTTVVHGDFR LDNLVFHPEEPEVLAVLDWELSTLGDPLADVAYSCLAHYLPSSFPVLRGINDCDLTQLGI PAAEEYFRMYCLQMGLPPTENWNFYMAFSFFRVAAILQGVYKRSLTGQASSTYAEQTGKL TEFVSNLAWDFAVKEGFRVFKEMPFTNPLTRSYHTWARPQSQWCPTGSRSYSSVPEASPA HTSRGGLVISPESLSPPVRELYHRLKHFMEQRVYPAEPELQSHQASAARWSPSPLIEDLK EKAKAEGLWNLFLPLEADPEKKYGAGLTNVEYAHLCELMGTSLYAPEVCNCSAPDTGNME LLVRYGTEAQKARWLIPLLEGKARSCFAMTEPQVASSDATNIEASIREEDSFYVINGHKW WITGILDPRCQLCVFMGKTDPHAPRHRQQSVLLVPMDTPGIKIIRPLTVYGLEDAPGGHG EVRFEHVRVPKENMVLGPGRGFEIAQGRLGPGRIHHCMRLIGFSERALALMKARVKSRLA FGKPLVEQGTVLADIAQSRVEIEQARLLVLRAAHLMDLAGNKAAALDIAMIKMVAPSMAS RVIDRAIQAFGAAGLSSDYPLAQFFTWARALRFADGPDEVHRATVAKLELKHRI >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_6|3045_bp atggtggatgtgatggcttatggctacgatcccagcactttgggaggccgagaatgggag gtacagaatcgtatcccttctggaactatattaaaggccttgatggaaggtggtgaaaat gggccctggatgagatttatgagagcagaaataacagcagagggttttttacgagaattt gggagactttgctctgaaatgttaaagacctccgtgcctgtggactcatttttctctctg ttgaccagtgagcgagtggcaaagcagttcccagtgatgactgaggccataactcaaatt cgggcaaaaggtcttcagactgcagtcttgagcaataatttttatcttcccaaccagaaa agctttttgcccctggaccggaaacagtttgatgtgattgtggagtcctgcatggaaggg atctgtaagccagaccctaggatctacaagctgtgcttggagcagctcggcctgcagccc tctgagtccatctttcttgatgaccttggaacaaatctaaaagaagctgccagacttggt attcacaccattaaggttaatgacccagagactgcagtaaaggaattagaagctctcttg ggttttacattgagagtaggtgttccaaacactcggcctgtgaaaaagacgatggaaatt ccgaaagattccttgcagaagtacctcaaagacttactgggtatccagaccacaggccca ttggaactacttcagtttgatcacgggcagtcaaatccaacttactacatcaggctggct aatcgtgatctagttctgaggaagaagcccccagggacactccttccatctgcccatgcc atagagagggagttcaggattatgaaagcccttgcaaatgctggagtacctgtccctaac gttcttgatctctgtgaagattcaagtgtcattggcacccccttctatgtgatggagtac tgcccaggtctcatctacaaagacccttccctgccaggcttggagcccagccacagacga gccatatacactgccatgaacacagtcctgtgcaaaattcacagtgtggatctgcaggct gtgggacttgaagactatgggaagcaaggggactatattccacgccaggtacgaacctgg gttaagcagtatcgagcttccgaaactagcaccatcccagccatggagaggctgatcgaa tggctgcccctccatcttccccgtcagcagaggaccacagtggtgcacggggacttcagg ctcgacaacctggtgtttcatccagaagagccagaggtgcttgctgtccttgactgggaa ctttctaccttgggcgacccccttgctgatgtggcctacagctgcctggctcattacctg ccatccagttttcccgtgctgagaggtattaatgactgtgacttgacacagctgggaatc cctgctgcagaggagtatttcaggatgtactgtctccaaatggggctccctcccactgag aactggaacttctatatggctttttcctttttccgtgtggctgcaatcctacagggagtc tacaagcgatcactcacagggcaagcaagctccacatatgcggaacaaactggaaagctg accgaatttgtgtctaacctggcgtgggatttcgcagtcaaagaagggttccgggttttc aaagagatgcccttcacaaatccgttaacaaggtcctaccacacgtgggccaggccccag tcccagtggtgccccacaggcagcaggagttatagctccgttccagaagcttccccagct catacctcaaggggaggtctggttatctctccagagagcctctctccacctgtcagagag ctgtatcaccggctgaagcacttcatggagcaacgtgtgtaccctgcagagccagagctg cagagtcaccaggcctcagcagccaggtggagcccctccccactgatcgaagacctcaag gagaaagccaaagctgaaggactttggaaccttttcctacccttagaggctgatcccgag aaaaaatacggagcaggactgaccaatgtggaatatgcacatctgtgtgagctcatgggc acgtccctgtatgcccccgaggtatgtaactgctctgcgcctgacacgggcaacatggag ctgctggtgaggtatggcaccgaagcgcagaaggctcgctggctgattcctctgctggag gggaaagcccgctcctgttttgctatgaccgagccccaggttgcctcttcagatgccacc aacattgaggcttccatcagagaggaggacagcttctatgtcataaacggtcacaaatgg tggatcacaggcatcctggatcctcgttgccaactctgtgtgtttatgggaaaaacagac ccacatgcaccaagacaccggcagcagtctgtgctcttggttcccatggataccccaggg ataaaaatcatccggcctctgacggtgtatggactggaagatgcaccaggtggccatggt gaagtccgatttgagcacgtgcgtgtgcccaaagagaacatggtcctgggccctggccga ggctttgagatcgcccagggcagactgggccccggcaggatccatcactgcatgaggctg atcgggttctcagagagggccctggcactcatgaaggcccgcgtgaagtcccgcttggct tttgggaagcccctggtggagcagggcacagtgctggcggacatcgcgcagtcgcgcgtg gagattgagcaggcacggctgctggtgctgagagctgcccacctcatggacctggcagga aacaaggctgcagccttggatatagccatgattaaaatggtcgccccgtccatggcctcc cgagtgattgatcgtgcgattcaggcctttggagcagcaggcctgagcagcgactaccca ctggctcagttcttcacctgggcccgagccctgcgctttgccgacggccctgacgaggtg caccgggccacggtggccaagctagagctgaagcaccgcatttag >gi568815586r:111544202_111785792|GENSCAN_predicted_peptide_7|277_aa MLRAAARFGPRLGRRLLSAAATQAVPAPNQQPEVFCNQGQLSGFRSPHGPCFRVSAGSPP SPTQKTRPSPSLQELSCLFLSHPLYPLTHALIQGVQKEKPGKRPSTEDFYLHFTFHPTRN SISLVVLLSRLAGPVVLLWALTLPQLLSPSTTTITTKSLPGMENKIAKIFINNEWHDAVS RKTFPTVNPSTGEVICQVAEGDKEDVDKAVKAARAAFQLGSPWRRMDASHRGRLLNRLAD LIERDRTYLAALETLDNGKPYVISYLVDLDMVLKCLR >gi568815586r:111544202_111785792|GENSCAN_predicted_CDS_7|831_bp atgttgcgcgctgccgcccgcttcgggccccgcctgggccgccgcctcttgtcagccgcc gccacccaggccgtgcctgcccccaaccagcagcccgaggtcttctgcaaccagggccaa ctctcggggttccgttctccccatggtccttgctttcgggtctccgcagggtccccaccc tcacccactcagaagacgcgaccaagtccctctttgcaggaactttcctgtcttttcctt agtcatcccctttaccccctgactcatgccctcatccaaggagttcaaaaggaaaaacca ggaaagaggccctctactgaggacttctacctgcatttcacctttcatcctacccgcaac tctataagcctggtagtattgctgtcccgtcttgcgggtccagttgtcctgctctgggca ctgaccctgccccagctcctgagcccctcgaccaccacaatcaccactaagtctctgcct ggcatggagaacaagatagccaagattttcataaacaatgaatggcacgatgccgtcagc aggaaaacattccccaccgtcaatccgtccactggagaggtcatctgtcaggtagctgaa ggggacaaggaagatgtggacaaggcagtgaaggccgcccgggccgccttccagctgggc tcaccttggcgccgcatggacgcatcacacaggggccggctgctgaaccgcctggccgat ctgatcgagcgggaccggacctacctggcggccttggagaccctggacaatggcaagccc tatgtcatctcctacctggtggatttggacatggtcctcaaatgtctccgn