GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:02:19 Sequence gi568815596r:9306837_9518697 : 211861 bp : 45.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3458 3474 17 1 2 76 110 12 0.305 1.02 1.02 Intr + 5531 5661 131 2 2 116 66 36 0.743 4.54 1.03 Intr + 11892 11983 92 1 2 40 73 54 0.537 -1.19 1.04 Intr + 13452 13501 50 2 2 104 82 52 0.819 3.68 1.05 Intr + 16285 16414 130 1 1 67 94 152 0.991 14.40 1.06 Intr + 20990 21075 86 1 2 73 108 76 0.972 6.72 1.07 Intr + 27902 27977 76 2 1 87 113 65 0.957 8.42 1.08 Intr + 28257 28343 87 2 0 62 78 70 0.903 3.57 1.09 Intr + 37696 37803 108 0 0 66 55 119 0.292 6.88 1.10 Intr + 43630 43731 102 0 0 53 94 37 0.470 1.17 1.11 Intr + 43972 44059 88 0 1 71 85 86 0.941 6.14 1.12 Intr + 49211 49259 49 0 1 76 101 37 0.935 1.54 1.13 Intr + 49343 49509 167 2 2 100 60 235 0.993 21.50 1.14 Intr + 51920 52053 134 0 2 87 51 83 0.900 4.86 1.15 Intr + 61589 61683 95 1 2 80 103 70 0.955 6.46 1.16 Intr + 67919 68108 190 2 1 41 66 243 0.823 16.99 1.17 Intr + 72028 72223 196 0 1 93 95 215 0.864 21.69 1.18 Intr + 73905 73972 68 1 2 62 105 105 0.998 8.22 1.19 Intr + 78409 78522 114 0 0 79 62 175 0.998 14.64 1.20 Intr + 81458 81710 253 1 1 100 86 187 0.998 16.71 1.21 Intr + 82927 83058 132 2 0 60 59 62 0.443 1.12 1.22 Intr + 86646 86811 166 1 1 149 86 68 0.911 11.72 1.23 Intr + 92465 92595 131 2 2 66 38 85 0.521 1.64 1.24 Intr + 93906 93994 89 1 2 95 92 19 0.987 2.69 1.25 Intr + 94438 94560 123 0 0 90 92 239 0.999 25.28 1.26 Term + 96417 96491 75 2 0 110 38 78 0.991 2.84 1.27 PlyA + 96816 96821 6 1.05 2.08 PlyA - 97883 97878 6 1.05 2.07 Term - 100069 99998 72 1 0 60 41 34 0.362 -6.19 2.06 Intr - 100762 100613 150 1 0 85 85 72 0.711 6.96 2.05 Intr - 105569 105433 137 0 2 113 98 96 0.999 13.39 2.04 Intr - 107420 107342 79 2 1 89 88 31 0.570 2.32 2.03 Intr - 113264 113154 111 2 0 100 66 51 0.474 4.68 2.02 Intr - 116985 116912 74 1 2 98 60 49 0.115 2.23 2.01 Init - 117848 117842 7 2 1 76 64 0 0.102 -2.53 2.00 Prom - 124016 123977 40 -5.66 3.00 Prom + 124519 124558 40 -7.26 3.01 Init + 125694 125852 159 2 0 85 75 111 0.769 9.32 3.02 Intr + 127035 127124 90 2 0 87 87 43 0.896 4.29 3.03 Intr + 129375 129525 151 2 1 48 57 30 0.711 -4.36 3.04 Intr + 133655 133830 176 0 2 54 113 153 0.740 14.06 3.05 Intr + 134982 135140 159 2 0 101 86 41 0.976 5.38 3.06 Intr + 136679 136825 147 1 0 88 37 50 0.537 0.33 3.07 Intr + 141362 141514 153 1 0 41 93 148 0.952 10.87 3.08 Intr + 146077 146185 109 0 1 99 100 1 0.996 2.26 3.09 Intr + 148823 148921 99 0 0 92 89 78 0.988 8.38 3.10 Intr + 150097 150191 95 2 2 76 63 39 0.810 -0.12 3.11 Intr + 152695 152782 88 0 1 94 116 29 0.855 5.84 3.12 Intr + 160871 160940 70 0 1 129 100 5 0.967 4.04 3.13 Intr + 164507 164603 97 2 1 104 79 84 0.998 9.11 3.14 Term + 166080 166181 102 2 0 87 53 178 0.999 12.48 3.15 PlyA + 166238 166243 6 1.05 4.00 Prom + 167171 167210 40 -11.92 4.01 Init + 167731 167811 81 0 0 73 96 153 0.911 13.59 4.02 Intr + 170003 170034 32 1 2 83 92 13 0.720 -1.87 4.03 Intr + 171386 171534 149 2 2 76 93 55 0.974 4.68 4.04 Intr + 174450 174611 162 1 0 16 92 161 0.943 9.25 4.05 Intr + 177596 177714 119 0 2 105 96 137 0.997 16.38 4.06 Term + 179307 179405 99 2 0 69 47 90 0.915 1.13 4.07 PlyA + 181602 181607 6 1.05 5.11 PlyA - 181671 181666 6 1.05 5.10 Term - 183682 183341 342 1 0 84 28 350 0.877 23.11 5.09 Intr - 184315 184265 51 1 0 116 84 8 0.750 2.30 5.08 Intr - 186150 186062 89 1 2 56 103 10 0.864 -1.01 5.07 Intr - 186989 186911 79 2 1 106 74 74 0.947 6.92 5.06 Intr - 187931 187801 131 0 2 89 82 73 0.996 7.21 5.05 Intr - 190412 190278 135 0 0 104 89 127 0.960 14.94 5.04 Intr - 195440 195337 104 1 2 100 79 64 0.920 6.52 5.03 Intr - 198529 198330 200 1 2 72 94 225 0.976 19.65 5.02 Intr - 203295 203143 153 0 0 27 95 105 0.782 5.37 5.01 Init - 211287 211267 21 0 0 101 110 10 0.643 4.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:9306837_9518697|GENSCAN_predicted_peptide_1|982_aa MQASLSCCLLLTPRLPRLAGPTPALRPPSMPFLLPSLPLPPPHDAPSFEASSDLPSWDHL SAKKVVTSGLSGRPVGTTKLDLKKPFDKAWKDYETKITKIEKEKKEHAKLHGMIRTEISG AEIAEEMEKERRFFQLQMCEYLLKVNEIKIKKGVDLLQNLIKYFHAQCNFFQDGLKAVES LKPSIETLSTDLHTIKQAQDEERRQLIQLRDILKSALQVEQKEDSQIRQSTAYSLHQPQG NKEHGTERNGSLYKKSDGYESELNVLGLAMQYLDLKAPTASSLKAGIACPLVQANRPPAK LNLLTCQVKTNPEEKKCFDLISHDRTYHFQAEDEQECQIWMSVLQNSKEEALNNAFKGDD NTGENNIVQELTKEIISEVQRMTGNDVCCDCGAPDPTWLSTNLGILTCIECSGIHRELGV HYSRMQSLTLDVLGTSELLLAKNIGNAGFNEIMECCLPAEDSVKPNPGSDMNARKDYITA KYIERRYARKKHADNAAKLHSLCEAVKTRDIFGLLQAYADGVDLTEKIPLANGHPRLPAG QSLVVQPVTHYALSLFLFSGSGNLDKQTGKGSTALHYCCLTDNAECLKLLLRGKASIEIA NESGETPLDIAKRLKHEHCEELLTQALSGRFNSHVHVEYEWRLLHEDLDESDDDMDEKLQ PSPNRREDRPISFYQLGSNQLQSNAVSLARDAANLAKEKQRAFMPSILQNETYGALLSGS PPPAQPAAPSTTSAPPLPPRNVGKAHIPDRVHPWLDEDVPEHRRQLQDEATQSHLLPVKP PGWLVGAQDPLTPTPPPPVAKTPSVMEALSQPSKPAPPGISQIRPPPLPPQPPSRLPQKK PAPGSRRSTGELKPLALVLSSGVDGSCVTPFADENADSPEPIYPGLPVDLSATEALGPLS NAMVLQPPAPMPRKSQATKLKPKRVKALYNCVADNPDELTFSEGDVIIVDGEEDQEWWIG HIDGDPGRKGAFPVSFVHFIAD >gi568815596r:9306837_9518697|GENSCAN_predicted_CDS_1|2949_bp atgcaggcaagcctcagctgctgccttctgctgactcctcgactgccacggctggcaggg cccacaccagcacttcgaccaccctctatgccattccttttgcccagtctacccctgccc ccacctcacgatgccccatcttttgaagcttcctctgatttgccatcttgggaccacctt tcagccaaaaaggtggtgacttctggcctttctggccgtcctgttggcacaacaaagttg gatctgaaaaagccttttgataaagcttggaaggactatgaaacaaaaataaccaagata gaaaaggagaaaaaggaacacgccaagctccatgggatgattcggactgaaataagcgga gcggaaattgccgaagagatggaaaaggagaggcgcttcttccagctacagatgtgcgag tatctgctgaaggtcaacgaaatcaagattaaaaagggagtagatttacttcagaatctg atcaaatactttcatgcccaatgcaatttttttcaggatggactcaaagccgtggaaagc ctcaaaccttccattgaaacgctgtctacggatcttcacacgatcaaacaggcccaggat gaagaaagaaggcagttgatacagcttcgagatattttgaaatccgcattgcaggttgaa cagaaagaggactcccaaattcgtcagagcacagcttatagcttacatcagcctcaggga aacaaggaacatgggaccgagcggaacggcagcctctacaagaagagtgacgggtacgaa tcagaactgaatgttttgggcttagccatgcaatatcttgatctgaaggcacccactgcc agcagcctcaaagctggcatcgcttgtcctcttgtccaggctaaccggcctcctgcaaag ctcaacctgctaacctgccaggtgaagaccaaccctgaggagaagaagtgctttgacctc atttcacatgacagaacttaccactttcaagctgaagatgaacaggaatgtcaaatatgg atgtctgtgctgcaaaatagcaaagaagaagctttaaacaatgcatttaagggggatgac aatactggagaaaataacatcgtccaagaactgacaaaggagatcatctcagaagtgcag aggatgacgggcaatgacgtctgctgtgactgtggggcgccagatcctacatggctttcc accaacctgggcatcctgacctgcatcgagtgttccggaatccaccgagagctgggggtt cattattccaggatgcagtccctgaccttagatgtactgggaacatctgagctgctgctc gccaagaatattgggaatgcaggctttaatgagatcatggaatgttgcctaccagctgag gactcagtcaaacccaacccaggcagcgacatgaatgcaagaaaggactacatcacagcc aagtacatcgagaggagatacgcaaggaagaagcacgcggataacgcggcgaagcttcac agtctttgcgaggccgtcaaaacgagagatatttttggattgctccaagcttatgctgat ggtgtggatcttacggaaaaaatcccactggccaacggacatcccaggctccctgccggg cagtccctggtcgtccagcctgtgacacactatgctctctctctgttcctgttctcgggc agtgggaacctggataaacagacagggaaaggcagcacagccctgcactactgctgcctg accgacaatgccgagtgcctcaagttgctcctgcgggggaaggcctccatcgagatagca aacgagtcaggagagactccgctggacattgccaagcgcctcaagcacgagcactgtgag gagctgctgacccaagccttatctggaagatttaattctcacgttcacgttgaatatgaa tggcgactactccacgaagacctggatgaaagtgatgacgacatggatgagaaattgcag cccagtcccaaccggcgggaagaccggcccatcagcttctaccagctgggctccaaccag cttcagtctaacgctgtatctttggccagagatgctgcaaaccttgccaaggagaagcag agggctttcatgcccagcatcttgcagaatgagacttacggagccctcctgagtggcagc ccacctcccgcccagcctgcagcccccagcaccaccagcgcccccccgcttcctccacgg aatgttggcaaagcccacatcccagaccgcgtccatccctggcttgatgaggacgtccct gagcacagaaggcagctacaagatgaagccacacagtcgcatctcctccctgtcaaaccc ccaggctggctggtcggggcccaggatcccctgacccccacgccgcccccacccgttgcc aagacgcccagcgtaatggaagccttgagccagccgagcaagcctgccccgcctgggatc tcacagatcaggcccccacctctgcccccacagccgcccagccgcctcccgcagaagaag cctgcgccggggtccaggagatccaccggggagctgaagccactggcactggtccttagc tctggagttgacggatcttgtgtcacaccttttgcagatgaaaatgctgactctccagaa cccatttatccagggctgccagtggatctctctgcaacggaagctctgggtcctctgtcc aatgctatggtcctgcagccccctgcacccatgcctaggaagtcgcaggcaaccaagttg aagcctaagcgggtgaaagcgctctataactgtgtggctgacaaccccgatgagctcacc ttctccgagggggatgtgatcatcgtggacggggaggaggaccaggagtggtggattggc cacattgatggagatcctggtcgcaaaggcgcattcccggtgtcatttgtgcactttatc gctgactga >gi568815596r:9306837_9518697|GENSCAN_predicted_peptide_2|209_aa MTGVGSAAGRSPQQESQTSRKCEGEAGVRSFSEKHHPGQNERKLSVQTAVCGTFEDSQIV FNSISVDSSLGGLSRSSTVASLDTDSTKSSGQSNNNSDTCAEFRIKYVGAIEKLKLSEGK GLEGPLDLINYIDVAQDVLHRHALYLIIRMVCYDDGLGAGKSLLALKTTDASNEEYSLWV YQCNSLEQAQAICKVLSTAFDSVLTSEKP >gi568815596r:9306837_9518697|GENSCAN_predicted_CDS_2|630_bp atgacaggggtcggatcagcagctggtcgctctcctcagcaggaatcgcagacatcccga aagtgtgagggcgaagcgggggtgcgtagtttctctgaaaagcatcacccaggtcagaat gaaaggaaactctctgtgcagaccgctgtatgtgggacctttgaagacagtcaaatagtg ttcaattctatatctgtggattctagccttgggggtctttcacgatccagcactgtggcc agcctcgacacagattccaccaaaagctcaggacaaagcaacaataattcagatacctgt gcagaatttcgaataaaatatgttggtgccattgagaaactgaaactctccgagggaaaa ggccttgaagggccattagacctgataaattatatagacgttgcccaggatgttttgcac aggcatgctctctacttaataatccggatggtgtgttacgatgacggtctgggggcggga aaaagcttactggctctgaagaccacagatgcaagcaatgaggaatacagcctgtgggtt tatcagtgcaacagcctggaacaagcacaagccatttgcaaggttttatccaccgctttt gactctgtattaacatctgagaaaccctga >gi568815596r:9306837_9518697|GENSCAN_predicted_peptide_3|564_aa MLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDF SRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPV FALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ NPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVE GTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGE QNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQR VSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLCYQLQKLTGDVEEL EIQEKPALKVFKNITVIQEPGMVVLEWLANPSNDMYADTVTTVILEVQSNPKIRKGAVQK VSKKLEMHVYSKRLEIMLQDIFGEDCVSVKDDSILSVTVDGKTANLNLETRTVECEEGSE DDESLREMVELAAQRLYEALTPVH >gi568815596r:9306837_9518697|GENSCAN_predicted_CDS_3|1695_bp atgctgtataccgagacagatttggaagaaagcatggacaaaattgaaactatcaacttt catgaagttaaggaagttgcgggaatcaagttttggtgttaccatgcaggtcacgtccta ggagccgccatgttcatgattgagatcgcaggcgtgaagcttttgtacactggtgatttc tcaagacaagaagataggcacttaatggcagctgaaattcctaatattaagcctgatatt cttatcattgaatctacttatgggacccatatccatgagaaacgtgaagagcgagaagca agattctgtaacactgtccacgatattgtaaacagaggaggcaggggtctcattcctgtc tttgctcttggaagggctcaggagctgctcttgattctagatgagtactggcagaatcac ccagaactacatgacattccaatatactatgcatcatctttggccaagaagtgtatggca gtgtaccagacatatgtaaatgccatgaatgacaaaatccgcaaacagatcaacatcaat aatccctttgttttcaaacacattagtaacctcaagagcatggatcattttgatgacatt ggtcccagtgttgtaatggcctccccaggcatgatgcaaagtggcttatccagagaatta tttgaaagctggtgtactgataagaggaatggtgtcattatagcgggatactgtgtagaa gggacacttgccaagcacatcatgtctgaacctgaagaaatcactactatgtctggacag aagttaccactgaaaatgtctgttgattacatttctttctcagctcacacggattaccag caaaccagtgaatttattcgtgctttgaaaccgcctcatgtgattttagtccatggagaa cagaatgaaatggccagattgaaagcagcactgattcgagaatatgaagataacgatgaa gttcacatagaggttcataatcctcggaatacagaagcagtgaccttaaacttcagagga gaaaaactagccaaggttatgggatttttagcagacaaaaaaccagaacaaggccagcgg gtctcaggaatacttgttaaaagaaactttaattatcacatactttctccttgcgacctg tccaattatactgacctggccatgagcacggtgaagcagacccaagccattccatatact ggtccctttaatttgctctgttaccagctgcagaaattgacaggtgatgtggaagaatta gaaattcaagaaaaacctgctctgaaagtgttcaaaaatattactgtaatacaagaacca ggcatggtggtattagaatggctggcaaacccttctaatgatatgtatgcagatacagta acaactgtgatattggaagttcagtcaaatcccaaaataagaaaaggtgcagtacagaag gtttctaaaaaattagaaatgcacgtttacagcaagaggttggagatcatgctccaggac atatttggagaagactgtgtaagtgtaaaggatgactctattcttagcgtcacagtggac gggaaaactgccaaccttaacttggagacacggactgtagaatgtgaagagggaagtgaa gacgatgaatccctccgagaaatggtggagctggctgcacagagactgtacgaggccctg acgccagttcactga >gi568815596r:9306837_9518697|GENSCAN_predicted_peptide_4|213_aa MALCEAAGCGSALLWPRLLLFGDSITQLLQATWMYTCRKCDVLNRGFSGYNTRWAKIILP RLIRKGNSLDIPVAVTIFFGANDSALKDENPKQHIPLEEYAANLKSMVQYLKSVDIPENR VILITPTPLCETAWEEQCIIQGCKLNRLNSVVGEYANACLQVAQDCGTDVLDLWTLMQDS QGKWSPVGKSYYHVDAESPVTFGGPESLSRSGP >gi568815596r:9306837_9518697|GENSCAN_predicted_CDS_4|642_bp atggcgctgtgcgaggccgcgggctgcgggagtgccctgctctggcctcgcttgttgctc ttcggggactccatcacccagttgctccaggccacctggatgtatacgtgcagaaaatgt gatgttctgaatcgtggattttcaggttacaataccaggtgggccaaaattatccttcca agattaatcaggaaaggaaacagtttggacatcccagtagcagttacaattttctttggg gccaatgacagtgcactaaaagatgagaatcccaagcagcacattcccctggaggagtac gctgcgaacctaaagagcatggtgcagtacctgaagtccgtggacatccctgagaatcga gtcattctcatcacgccgaccccactttgtgaaacagcctgggaagaacagtgcatcata caaggttgcaaactaaatcgcctgaactctgttgttggtgaatatgccaatgcgtgttta caagtggcccaagactgtgggactgacgtacttgacctgtggaccctgatgcaggacagc cagggaaaatggtcaccagtgggaaagtcatactaccacgtggatgccgagtcaccagtc acctttggaggccctgagagcctaagcagatctggaccctag >gi568815596r:9306837_9518697|GENSCAN_predicted_peptide_5|434_aa MEVFVQREADLVTTHELGHNFGAEHDPDGLAECAPNEDQGGKYVMYPIAVSGDHENNKMF SNCSKQSIYKTIESKAQECFQERSNKVCGNSRVDEGEECDPGIMYLNNDTCCNSDCTLKE GVQCSDRNSPCCKNCQFETAQKKCQEAINATCKGVSYCTGNSSECPPPGNAEDDTVCLDL GKCKDGKCIPFCEREQQLESCACNETDNSCKVCCRDLSGRCVPYVDAEQKNLFLRKGKPC TVGFCDMNGKCEKRVQDVIERFWDFIDQLSINTFGKFLADNIVGSVLVFSLIFWIPFSIL VHCVDKKLDKQYESLSLFHPSNVEMLSSMDSASVRIIKPFPAPQTPGRLQPAPVIPSAPA APKLDHQRMDTIQEDPSTDSHMDEDGFEKDPFPNSSTAAKSFEDLTDHPVTRSEKAASFK LQRQNRVDSKETEC >gi568815596r:9306837_9518697|GENSCAN_predicted_CDS_5|1305_bp atggaggtgtttgtccaaagggaagctgacctggttacaactcatgaattgggacataat tttggagcagaacatgatccggatggtctagcagaatgtgccccgaatgaggaccaggga gggaaatatgtcatgtatcccatagctgtgagtggcgatcacgagaacaataagatgttt tcaaactgcagtaaacaatcaatctataagaccattgaaagtaaggcccaggagtgtttt caagaacgcagcaataaagtttgtgggaactcgagggtggatgaaggagaagagtgtgat cctggcatcatgtatctgaacaacgacacctgctgcaacagcgactgcacgttgaaggaa ggtgtccagtgcagtgacaggaacagtccttgctgtaaaaactgtcagtttgagactgcc cagaagaagtgccaggaggcgattaatgctacttgcaaaggcgtgtcctactgcacaggt aatagcagtgagtgcccgcctccaggaaatgctgaagatgacactgtttgcttggatctt ggcaagtgtaaggatgggaaatgcatccctttctgcgagagggaacagcagctggagtcc tgtgcatgtaatgaaactgacaactcctgcaaggtgtgctgcagggacctttctggccgc tgtgtgccctatgtcgatgctgaacaaaagaacttatttttgaggaaaggaaagccctgt acagtaggattttgtgacatgaatggcaaatgtgagaaacgagtacaggatgtaattgaa cgattttgggatttcattgaccagctgagcatcaatacttttggaaagtttttagcagac aacatcgttgggtctgtcctggttttctccttgatattttggattcctttcagcattctt gtccattgtgtggataagaaattggataaacagtatgaatctctgtctctgtttcacccc agtaacgtcgaaatgctgagcagcatggattctgcatcggttcgcattatcaaacccttt cctgcgccccagactccaggccgcctgcagcctgcccctgtgatcccttcggcgccagca gctccaaaactggaccaccagagaatggacaccatccaggaagaccccagcacagactca catatggacgaggatgggtttgagaaggaccccttcccaaatagcagcacagctgccaag tcatttgaggatctcacggaccatccggtcaccagaagtgaaaaggctgcctcctttaaa ctgcagcgtcagaatcgtgttgacagcaaagaaacagagtgctaa