GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:02:33 Sequence gi568815576f:29667399_29869716 : 202318 bp : 51.03% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 935 1048 114 1 0 105 99 -21 0.535 1.85 1.02 Intr + 4428 4550 123 2 0 135 20 162 0.995 15.39 1.03 Intr + 5871 6088 218 2 2 55 78 394 0.997 32.93 1.04 Intr + 7438 7543 106 1 1 136 105 173 0.997 24.52 1.05 Intr + 10798 10925 128 0 2 61 53 132 0.924 6.98 1.06 Intr + 14041 14203 163 1 1 87 75 175 0.945 16.59 1.07 Term + 16317 16466 150 2 0 96 48 16 0.408 -3.38 1.08 PlyA + 16495 16500 6 1.05 2.04 PlyA - 17237 17232 6 1.05 2.03 Term - 20014 19877 138 1 0 37 49 121 0.548 1.37 2.02 Intr - 23204 23030 175 1 1 66 31 67 0.399 -0.74 2.01 Init - 23762 23599 164 2 2 67 63 165 0.768 11.07 2.00 Prom - 26114 26075 40 -1.81 3.00 Prom + 32787 32826 40 -4.81 3.01 Init + 35491 35565 75 0 0 86 49 102 0.529 7.14 3.02 Intr + 39628 39761 134 0 2 76 80 77 0.488 5.65 3.03 Intr + 39833 39948 116 2 2 118 73 -15 0.340 0.59 3.04 Intr + 42703 42836 134 2 2 39 99 54 0.198 2.47 3.05 Intr + 44270 44406 137 1 2 61 76 25 0.304 -1.53 3.06 Intr + 46661 46730 70 2 1 123 92 -20 0.414 1.58 3.07 Intr + 47391 47572 182 2 2 46 67 115 0.505 4.38 3.08 Intr + 52927 53188 262 1 1 88 72 296 0.070 26.03 3.09 Intr + 53254 53733 480 0 0 8 -15 259 0.330 1.42 3.10 Intr + 55343 55451 109 1 1 62 60 73 0.384 2.26 3.11 Intr + 60077 60184 108 0 0 53 19 101 0.409 0.46 3.12 Intr + 60264 60407 144 1 0 79 89 342 0.979 34.16 3.13 Intr + 61232 61344 113 0 2 94 63 132 0.998 11.90 3.14 Intr + 61657 61810 154 0 1 40 70 406 0.927 34.16 3.15 Term + 62044 62171 128 2 2 30 42 314 0.909 19.75 3.16 PlyA + 64409 64414 6 -0.45 4.06 PlyA - 64483 64478 6 -0.45 4.05 Term - 66627 66465 163 2 1 82 53 83 0.705 1.92 4.04 Intr - 71043 70932 112 1 1 123 92 76 0.945 11.44 4.03 Intr - 73332 73252 81 1 0 90 83 132 0.992 13.01 4.02 Intr - 75082 75020 63 2 0 111 103 74 0.983 10.38 4.01 Init - 81146 81020 127 2 1 99 90 335 0.984 35.08 4.00 Prom - 82420 82381 40 -1.51 5.00 Prom + 90317 90356 40 0.49 5.01 Init + 91537 91585 49 0 1 99 103 28 0.983 6.46 5.02 Intr + 94938 95009 72 1 0 10 105 80 0.472 1.77 5.03 Term + 95053 95126 74 2 2 53 49 82 0.818 -0.84 5.04 PlyA + 95335 95340 6 1.05 6.00 Prom + 96376 96415 40 -4.51 6.01 Init + 100001 100150 150 1 0 101 60 399 0.939 38.41 6.02 Intr + 106105 106165 61 0 1 108 71 60 0.018 5.10 6.03 Intr + 108365 108484 120 0 0 62 75 34 0.006 0.47 6.04 Term + 117741 117805 65 1 2 115 46 52 0.136 2.04 6.05 PlyA + 120372 120377 6 1.05 7.21 PlyA - 120690 120685 6 1.05 7.20 Term - 121786 121615 172 0 1 96 53 203 0.891 15.21 7.19 Intr - 123150 123071 80 0 2 109 84 24 0.934 2.94 7.18 Intr - 125137 125035 103 0 1 88 77 144 0.879 13.98 7.17 Intr - 126092 125962 131 2 2 102 94 318 0.977 33.80 7.16 Intr - 126278 126179 100 1 1 93 80 215 0.890 21.81 7.15 Intr - 133712 133593 120 1 0 98 105 185 0.993 21.31 7.14 Intr - 134810 134596 215 2 2 75 75 380 0.998 33.44 7.13 Intr - 137432 137240 193 1 1 -12 87 215 0.121 11.42 7.12 Intr - 139155 139087 69 2 0 98 105 121 0.999 13.59 7.11 Intr - 139506 139399 108 2 0 102 115 56 0.969 9.60 7.10 Intr - 140787 140713 75 2 0 65 61 68 0.645 0.92 7.09 Intr - 146144 146032 113 2 2 46 110 50 0.902 2.68 7.08 Intr - 147369 147259 111 0 0 77 95 42 0.956 4.88 7.07 Intr - 148675 148608 68 2 2 125 81 49 0.997 7.02 7.06 Intr - 155066 154937 130 2 1 84 75 151 0.958 14.17 7.05 Intr - 157859 157689 171 2 0 99 80 234 0.999 24.35 7.04 Intr - 158382 158224 159 0 0 52 92 155 0.980 13.00 7.03 Intr - 160790 160720 71 0 2 38 48 21 0.024 -7.51 7.02 Intr - 164944 164847 98 0 2 101 93 121 0.614 14.05 7.01 Intr - 170874 170780 95 0 2 28 105 180 0.620 12.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 105369 105416 48 2 0 38 119 35 0.864 2.51 S.002 Term + 106105 106260 156 0 0 108 43 114 0.947 7.15 S.003 Intr - 137401 137240 162 1 0 53 87 184 0.862 15.49 S.004 Term - 138892 138772 121 0 1 103 42 157 0.985 10.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_1|333_aa ILQLCIGNHDLFMRRRKADSLEVQQMKAQAREEKARKQMERQRLAREKQMREEAERTRDE LERRLLQMKEEATMANEALMRSEETADLLAEKAQITEEEAKLLAQKAAEAEQEMQRIKAT AIRTEEEKRLMEQKVLEAEVLALKMAEESERRAKEADQLKQDLQEAREAERRAKQKLLEI ATKPTYPPMNPIPAPLPPDIPSFNLIGDSLSFDFKDTDMKRLSMEIEKEKVEYMEKSKHL QEQLNELKTEIEALKLKERETALDILHNENSDRGGSSKHNTIKKAEQRCGQSELTLEQYT HGTLERSGHWLLSELTECLLLFSCNSSLFLQHA >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_1|1002_bp attctccagctatgtatcgggaaccatgatctatttatgaggagaaggaaagccgattct ttggaagttcagcagatgaaagcccaggccagggaggagaaggctagaaagcagatggag cggcagcgcctcgctcgagagaagcagatgagggaggaggctgaacgcacgagggatgag ttggagaggaggctgctgcagatgaaagaagaagcaacaatggccaacgaagcactgatg cggtctgaggagacagctgacctgttggctgaaaaggcccagatcaccgaggaggaggca aaacttctggcccagaaggccgcagaggctgagcaggaaatgcagcgcatcaaggccaca gcgattcgcacggaggaggagaagcgcctgatggagcagaaggtgctggaagccgaggtg ctggcactgaagatggctgaggagtcagagaggagggccaaagaggcagatcagctgaag caggacctgcaggaagcacgcgaggcggagcgaagagccaagcagaagctcctggagatt gccaccaagcccacgtacccgcccatgaacccaattccagcaccgttgcctcctgacata ccaagcttcaacctcattggtgacagcctgtctttcgacttcaaagatactgacatgaag cggctttccatggagatagagaaagaaaaagtggaatacatggaaaagagcaagcatctg caggagcagctcaatgaactcaagacagaaatcgaggccttgaaactgaaagagagggag acagctctggatattctgcacaatgagaactccgacaggggtggcagcagcaagcacaat accattaaaaaggctgagcagagatgtggtcagagtgaactcacattggaacagtacact cacggcaccctggagaggagcggtcactggttgctgagtgaattaaccgaatgtttgctg cttttctcctgcaacagttcattattcctccagcatgcctag >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_2|158_aa MSILKVIQEEEVSKFQQVMKQKHLRVYNQGLEDSGCTSLGLAKSGQLPLNPSEDKSMRLY CYLTVHLHIPPLIHLLGPGFYFLPNIETQPRQGPFSPAHLSIKLPAVCDSQQAGSLHKLK MTLDFYVFLIQEKLLALEINVKAPLLPVPPSVETTAGP >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_2|477_bp atgagtatcttgaaggttatccaggaagaagaggtctccaaatttcagcaggtaatgaag cagaagcacctcagggtgtacaaccagggtctcgaggacagtggctgcaccagtctggga cttgccaagtctggtcaactgcctttgaatccttctgaagacaaatctatgaggctttat tgctatctgactgttcatcttcacatcccccctctgatccacttgttgggtcctggcttt tacttcctgcccaacatcgagactcagccccggcagggtcccttttcccctgcccacttg tccataaaactgccagctgtttgtgacagccagcaggctgggagcctgcacaaattgaag atgacattggacttttatgtcttcctgatccaagagaagctgctggcgcttgagataaat gtcaaggctcctcttcttcctgttcctccatcagtggaaaccacagctggaccttga >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_3|781_aa MGVTGPGYRRSGYGYSHGREPEWNQPFSRPQFPLCTSEQQFSKGSTPRSRLAPSQHRRDS MMDEDPAGGRSSAPSGQAQCQADGWATHWATLAEAASCTLDTAHQGQQDTQPQMHTSAPT AHIHPDLFVHISVTCPQTLRAHMGSLFLWGAGQAVPSSSSIPQLSTSILCVPQNQGRSSE QNPAPGKEGVCGVTRRRARLRVPTSQGVPHCGERFHSDAVSQGVPGKATKLTWQHLDILM TPLSRGGNQSGDGVSTKFKVILGLLITALPEPRVLLGEKEEGSRAPRPLQPPPGPPPAHE PRPQSLRRAGGRGASKMPFHPVTAALMYRGIYTVPNLLSEQRPVDIPEDELEGECPPGSP PRRPSYLCAQVGAPASSCAPRQDLSAPRGAAVGVAQAESGPAPARVVPSAAETCRAPQRG HGAAQSAVTVRQRGGLGRGSALSPQRRPGRSTRILIPSAAPARGSLLALTFRSCRFKNSI CSGSSRRGEGDCPALARCGAAPVPTCGRSPFPAALRGPPARRAGKAAREAGGLGETQALC REDAIRGGATCARKRELGSPGWNAADTCPEVPEYGTLVHPRAMLSPDLQALGWALKAMLR LQEIREAFKVFDRDGNGFISKQELGTAMRSLGYMPNEVELEVIIQRLDMDGDGQVDFEEF VTLLGPKLSTSGIPEKFHGTDFDTVFWKCDMQKLTVDELKRLLYDTFCEHLSMKDIENII MTEEESHLGTAEECPVDVETCSNQQIRQTCVRKSLICAFAIAFIISVMLIAANQVLRSGM K >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_3|2346_bp atgggagtgactggcccaggataccgcaggagtggctatggctacagccatggccgggag ccggagtggaaccagcccttctccaggcctcagtttcccttgtgcaccagcgagcagcag ttttctaaaggttccaccccgaggagccggctggcccccagccagcatcgcagggacagc atgatggatgaggatcccgcggggggcaggagctcggctccatctggtcaggcgcagtgc caggctgatggatgggccactcactgggccaccctcgccgaggctgcctcctgcactctg gatactgcacaccagggtcagcaggacacccagccacagatgcacacatctgcaccaaca gcccacatccacccagacctgttcgtgcacatcagtgtcacatgcccacagacactgaga gcccacatggggtccctgttcctctggggagcagggcaggcagtccctagctcttcatcc atcccacaactaagtacgagcatcctctgtgtaccccaaaaccagggacgcagcagcgag caaaacccagccccaggcaaggaaggagtctgcggtgtcactaggagaagggcaaggctc agggtgccaacctcacagggtgtcccccactgtggagagaggttccactctgatgcggta tcacagggtgtgccgggcaaggccacaaagctgacctggcagcacctggatattttaatg accccattgtccagaggaggaaatcaaagtggagacggtgtaagcaccaagttcaaggtc atcctgggactgttgatcacagccctacctgagccgcgggtcctgctgggagagaaggag gaggggagccgcgcgccccgcccgctccagccgcccccggggccgccaccggcccatgag ccccggcctcaaagtttgcggcgggcgggcgggcgcggagcctccaagatgccgttccac ccggtgacggcggcgttgatgtaccggggcatctacaccgtgcccaacctgctgtcggag cagcgcccggtggacatcccggaggacgagctggagggtgagtgtccgccgggatccccg ccccggcggccctcctacctgtgcgcccaggtgggcgccccagctagcagctgtgccccg cggcaagacctgtccgcaccccggggcgccgcggtgggggtcgctcaggcggagagcggc ccagcccctgcccgcgtggtccccagcgctgcggaaacttgccgggccccgcagcggggt cacggggccgcgcagtcggcggtgacggtgcggcaacgcggcggactggggcgggggtcc gcgctgagcccccagcgccggcccggccggagcacccgcatcctgatcccctccgcggcg cccgcccgcggctctctgctcgcattgacattccgctcgtgtcgctttaaaaattcaatc tgctcgggcagcagcagaaggggagagggcgactgccctgctcttgcccgctgcggggcc gcccccgtccccacctgcggccgtagccccttccctgcagccctgcggggacccccagcc cggcgcgccgggaaggcggcccgggaggcgggcggtctgggcgagacccaggccctctgc cgggaggacgccattcgcggaggagccacatgtgccaggaagagggagctgggcagcccg ggatggaatgctgcagacacctgcccagaggtcccagaatacggcacccttgtgcaccct cgggccatgctctcaccagatctgcaggctctgggttgggctctcaaggccatgctcagg ctgcaggagatccgagaggccttcaaggtgtttgaccgtgacggcaatggcttcatctcc aagcaggagctgggcacagccatgcgctcactgggttacatgcccaacgaggtggagctg gaggtcatcatccagcggctggacatggatggtgatggtcaagtggactttgaggagttt gtgacccttctgggacccaaactctccacctcagggatcccagagaagttccatggcacc gactttgatactgtcttctggaagtgcgacatgcagaagctgacggtggatgagctgaag cggctgctctacgacaccttctgcgagcacctgtccatgaaggacatagagaacatcatc atgacggaggaggagagccacctgggcacagccgaggagtgtcccgtggatgtggagacc tgctccaaccagcagatccgccagacttgcgtgcgcaagagtctcatctgcgccttcgcc atcgccttcatcatcagtgtcatgctcattgcggccaaccaggtgctgcgcagtggcatg aagtag >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_4|181_aa MGKRYFCDYCDRSFQDNLHNRKKHLNGLQHLKAKKVWYDMFRDAAAILLDEQNKRPCRKF LLTGQCDFGSNCRFSHMSERDLQELSIQVEEERRAREWLLDAPELPEGHLEDWLEKRAKR LSSAPSSRTAAGKMQESGSCPHAQDKQVTREATSRVVPDTHLNVPKALAHVSGALAVCQC V >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_4|546_bp atggggaagcgatacttctgtgactactgcgaccgctccttccaggacaacctccacaac cgcaagaagcacctgaacgggctgcagcacctcaaggccaagaaggtctggtacgacatg ttccgagatgcagctgccatcttgctggatgagcagaacaagcggccctgcaggaagttt ctactgacaggccagtgcgactttggctccaactgcagattttcccacatgtcagagcga gacctgcaggagctgagcatccaggtggaggaggagaggcgagccagggagtggctacta gatgctcctgagctccccgagggccatctggaggactggctggagaagagagccaagcgg ctgagctcagccccaagtagcaggaccgcggctgggaaaatgcaagagagtggctcctgt cctcacgcacaggacaaacaggtcaccagagaagctacgagcagagttgtgccagacacg cacctcaatgtccccaaggctttggctcacgtgtctggggccctggcagtgtgccagtgt gtgtga >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_5|64_aa MEGSRDLVHSDEPARGGDPNPQAPERYGSMACQELGRTAGATPHCSHYRLSFVSCQIIGS IRFS >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_5|195_bp atggagggttccagagacctggttcactcggatgaaccagccaggggtggggaccccaac ccccaggccccggaacggtacgggtccatggcctgtcaggagctgggccgcacagcagga gccactccccattgctcgcattaccgcctgagcttcgtctcctgtcagatcatcggcagc attagattctcatag >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_6|131_aa MAAATLTSKLYSLLFRRTSTFALTIIVGVMFFERAFDQGADAIYDHINEGPDVIQPELDY PESSLMEEDSGLPPQMPMRTCPSVALPNESCLPTPPPQSSLDGTAMQVGRAGLTAQSLIS ENKLQTLRAAL >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_6|396_bp atggcggccgcgacgttgacttcgaaattgtactccctgctgttccgcaggacctccacc ttcgccctcaccatcatcgtgggcgtcatgttcttcgagcgcgccttcgatcaaggcgcg gacgctatctacgaccacatcaacgaggggcctgatgtgattcagccagagctggattat ccagaatcttctctgatggaggaagattcaggactccctcctcagatgccaatgaggact tgtccctcggtggcactgccaaatgagagctgcctccccacccccccgccccagagctcc ctggatgggactgccatgcaggtgggaagagccgggctgaccgcccagtccctgatttct gagaacaagctgcaaacactgagagctgcgctttga >gi568815576f:29667399_29869716|GENSCAN_predicted_peptide_7|793_aa FDPGSAGAQGAVTVVGGGGGGGGTEPVVEPPRRVTQHNASSAPGPTPDHPQGPEDRKAED FTSAVLILECLEPWRKSACSGKPSLILQHPEQKADRYFVLYKPPPKDNIPALVEEYLERA TFVANDLDWLLALPHDKFWCQVIFDETLQKCLDSYLRYVPRKFDEGVASAPEVVDMQKRL HRSVFLTFLRMSTHKESKDHFISPSAFGEILYNNFLFDIPKILDLCVLFGKGNSPLLQKM IGNIFTQQPSYYSDLDETLPTILQVFSNILQHCGLQGDGANTTPQKLEERGRLTPSDMPL LELKDIVLYLCDTCTTLWAFLDIFPLACQTFQKHDFCYRLASFYEAAIPEMESAIKKRRL EDSKLLGDLWQRLSHSRKKLMEIFHIILNQICLLPILESSCDNIQGFIEEFLQIFSSLLQ EKRDETRTAYILQAVESAWEGVDRRKATDAKDPSVIEEPNGEPNGVTVTAEAVSQASSHP ENSEEEECMGAAAAVGPAMCGVELDSLISQVKDLLPDLGEGFILACLEYYHYDPEQVINN ILEERLAPTLSQLDRNLDREMKPDPTPLLTSRHNVFQNDEFDVFSRDSVDLSRVHKGKST RKEENTRSLLNDKRAVAAQRQRYEQYSVVVEEVPLQPGESLPYHSVYYEDEYDDTYDGNQ VGANDADSDDELISRRPFTIPQVLRTKVPREGQEEDDDDEEDDADEEAPKPDHFVQDPAV LREKAEARRMAFLAKKGYRHDSSTAVAGSPRGHGQSRETTQERRKKEANKATRANHNRRT MADRKRSKGMIPS >gi568815576f:29667399_29869716|GENSCAN_predicted_CDS_7|2382_bp tttgaccccggaagtgcgggcgctcagggagctgtcaccgtggtcggcggcggcggcggc ggcggcggcacagagccggtggtggagccgccgaggagggtcacgcagcacaatgccagc tctgcccctggaccaactccagatcacccacaaggacccgaagacaggaaagctgaggac ttcaccagcgctgtccttatcctcgaatgcttggagccttggaggaagtctgcttgctct gggaagccgtcccttatcctccagcaccccgagcagaaggcagaccggtattttgtgtta tacaaaccgccccctaaagacaacattcccgccctagtggaggagtacctggaacgcgcc accttcgtagccaatgacctcgactggctcctggccttgcctcacgataaattctggtgc caggtgatctttgacgagactctacagaagtgcctggactcctacctgcgctatgtcccc cgcaaattcgacgagggggtggcctcagcccctgaggttgttgacatgcagaagcgcctc catcgaagtgtttttctcaccttcctccgcatgtccactcacaaggaatccaaagatcac ttcatttccccttctgcgtttggagaaatcctctacaataacttcctctttgacattcca aagatcctggacctctgcgtgctctttggaaaaggcaactcaccactgctccagaagatg ataggaaacatctttacacagcagccaagttactacagtgacctggatgaaaccctgcct accatccttcaggtcttcagcaatatcctccagcactgtggtttgcaaggggacggggcc aataccacaccccagaagcttgaggagaggggccgattgacccccagtgacatgcctctc ctggaattaaaggacattgttctctacctttgtgatacctgcaccacactttgggccttt ctggatatcttccctttggcttgccagaccttccagaagcacgacttttgttacagacta gcttccttctacgaagcagcaattcccgaaatggagtctgcaattaagaagaggaggctt gaagatagcaagcttcttggtgacctgtggcagaggctctcccattccaggaagaagcta atggagattttccacatcatcctgaaccagatctgcctccttcccatcctagaaagcagc tgtgacaacattcagggcttcatcgaagagttccttcagatcttcagctccttgctgcag gagaagagggacgagacgcggactgcctacatcctccaggcagtcgagagtgcatgggaa ggggtggacagacggaaagccacagatgctaaagacccatcggtgattgaggagcctaat ggggagcctaacggggtcacggtgacagcagaggcagtcagtcaagcatcatcacatccg gagaactcggaggaagaggagtgcatgggagcagccgcggctgtgggccctgccatgtgt ggggtggaactggactctctcatctcccaagtgaaggacctgctgccagaccttggtgag ggcttcatcctggcctgcctggagtactaccactacgacccagagcaggtgatcaacaat atcctggaggagcggctggcccccaccctcagccagctggaccgcaacctagacagagaa atgaaaccagaccctacacccctgctgacgtctcgccacaacgtcttccagaatgacgag tttgatgtgttcagcagggactcagtagacctgagccgggtgcacaagggcaagagcacc aggaaggaggaaaacacgcggagtttgctgaacgacaagcgtgcagtggcggcacagcgg cagcgctacgagcagtacagcgtggtggtggaggaggtgccactgcagccaggcgagagc ctgccctaccacagtgtctactacgaggatgagtacgatgacacatacgatggcaaccag gtgggcgccaatgatgcagactctgatgacgagctcatcagccgcaggccattcaccatc cctcaggtgctgagaaccaaagtgcctagagaagggcaggaggaggatgacgacgatgag gaagacgatgctgacgaggaggctcccaagcccgaccattttgttcaggaccctgcagtg ctgagagagaaggcagaagccaggcgcatggcctttctcgccaagaaagggtaccggcat gacagctcaacagcagtggccggcagcccccgaggccatgggcagagccgcgagacaacc caggaacgcaggaagaaggaagccaacaaggcgacaagagccaaccacaaccggagaacc atggccgaccgcaagaggagcaaaggcatgatcccatcctga