GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:51:55 Sequence gi568815587r:35160897_35417517 : 256621 bp : 41.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 432 675 244 2 1 45 87 185 0.972 12.04 1.02 Intr + 5716 5879 164 2 2 41 51 119 0.391 2.27 1.03 Intr + 9568 9652 85 0 1 102 24 82 0.560 1.67 1.04 Intr + 15706 15844 139 2 1 77 110 202 0.696 19.90 1.05 Intr + 19378 19511 134 1 2 121 77 82 0.960 9.77 1.06 Intr + 25936 26004 69 2 0 85 44 109 0.875 4.44 1.07 Intr + 28939 29169 231 2 0 91 72 233 0.978 18.82 1.08 Intr + 37225 37350 126 2 0 40 63 133 0.502 5.73 1.09 Intr + 39750 39964 215 1 2 38 61 174 0.849 7.21 1.10 Intr + 43616 43763 148 1 1 63 38 168 0.352 8.19 1.11 Intr + 50350 50553 204 2 0 80 91 135 0.989 11.25 1.12 Intr + 53956 54018 63 2 0 94 101 21 0.721 1.77 1.13 Intr + 58420 58491 72 2 0 78 109 35 0.715 3.06 1.14 Intr + 60758 60836 79 0 1 43 116 76 0.776 3.59 1.15 Term + 68233 68437 205 1 1 45 41 305 0.956 17.36 1.16 PlyA + 69103 69108 6 1.05 2.03 PlyA - 69863 69858 6 1.05 2.02 Term - 77546 77255 292 1 1 17 32 195 0.029 0.63 2.01 Init - 86856 86705 152 0 2 67 61 111 0.088 5.86 2.00 Prom - 98783 98744 40 -3.45 3.20 PlyA - 99972 99967 6 1.05 3.19 Term - 100069 99998 72 1 0 51 49 116 0.534 0.93 3.18 Intr - 104862 104631 232 2 1 138 83 162 0.967 17.55 3.17 Intr - 112106 111949 158 2 2 91 20 135 0.633 4.89 3.16 Intr - 120105 119971 135 0 0 92 87 125 0.675 12.64 3.15 Intr - 121362 121170 193 2 1 66 74 51 0.763 0.07 3.14 Intr - 126055 125861 195 0 0 122 100 257 0.887 27.91 3.13 Intr - 131624 131391 234 1 0 141 80 79 0.505 8.28 3.12 Intr - 140749 140623 127 2 1 61 103 76 0.835 5.22 3.11 Intr - 145346 145178 169 2 1 63 68 197 0.581 13.80 3.10 Intr - 151552 151302 251 2 2 90 91 246 0.819 21.33 3.09 Intr - 154279 154127 153 2 0 131 110 54 0.999 11.02 3.08 Intr - 156620 156481 140 1 2 110 74 158 0.848 15.79 3.07 Intr - 161783 161665 119 2 2 80 94 26 0.013 0.74 3.06 Intr - 182731 182449 283 0 1 57 68 148 0.074 6.20 3.05 Intr - 194657 194526 132 1 0 104 54 85 0.057 5.54 3.04 Intr - 198623 198501 123 1 0 33 111 95 0.122 5.08 3.03 Intr - 203156 202941 216 1 0 45 86 90 0.033 1.20 3.02 Intr - 208557 208507 51 2 0 81 99 59 0.450 3.40 3.01 Init - 210355 210264 92 1 2 64 56 73 0.176 1.71 3.00 Prom - 213225 213186 40 -7.45 4.00 Prom + 213947 213986 40 -6.05 4.01 Init + 214655 214866 212 1 2 62 84 162 0.970 11.51 4.02 Intr + 215849 216057 209 2 2 37 47 144 0.167 3.00 4.03 Intr + 220343 220464 122 0 2 53 87 61 0.130 1.79 4.04 Intr + 224599 224675 77 0 2 63 95 50 0.588 0.59 4.05 Intr + 230121 230219 99 0 0 125 48 65 0.707 4.51 4.06 Intr + 233688 233775 88 0 1 60 115 40 0.035 2.95 4.07 Intr + 235912 236211 300 0 0 16 86 121 0.014 0.71 4.08 Intr + 237067 237233 167 0 2 85 19 132 0.115 3.84 4.09 Intr + 243402 243577 176 0 2 55 49 181 0.430 9.56 4.10 Term + 254241 254446 206 1 2 87 54 131 0.236 6.15 4.11 PlyA + 254753 254758 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 157956 157859 98 0 2 63 74 61 0.807 2.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:35160897_35417517|GENSCAN_predicted_peptide_1|725_aa MHNIPQELKPGLGPKSFLDLSDPSHMVTAISQYNEGKSLQSPVTSNCEAMDGIDVRGSGE PCKRSEGLGVDTSQLHSAGQRSLTAMTGSRLHGKKAEKSGGPARRQDVGVPCISKCQVMN SSVESNPGHILEEILWPYSNERAKGLVAFLPREVSSPNPKVASNSVFHVEKNGRYSISRT EAADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNTGVYIL TSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGEYRTNPED IYPSNPTDDDVSSGSSSERSSTSGGYIFYTFSTVHPIPDEDSPWITDSTDRIPATSTSSN TISAGWEPNEENEDERDRHLSFSGSGIDDDEDFISSTIICLFTRRIYKQHTVTKSLGFQV QRDTTDCMDGQNGAFGYPRWRAGVFKAVLPTAAASLTVLSGRSHVLNPKSRQLLVVQRKK QLPRRNSGLATDGMRDIAKHPKKTPIRQQGQLVMDGLTNRNDVTGGRRDPNHSEGSTTLL EGYTSHYPHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTFHPSGGSHT THGSESDGHSHGSQEGGANTTSGPIRTPQIPEWLIILASLLALALILAVCIAVNSRRRCG QKKKLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTADETRNLQNVD MKIGV >gi568815587r:35160897_35417517|GENSCAN_predicted_CDS_1|2178_bp atgcataacatcccacaagagctcaagccaggacttggacccaagtctttcttggatctt tccgatccaagccatatggttactgcaatctcacaatacaatgagggtaaatccttgcaa tcaccagttaccagcaattgtgaagcaatggatggcatagatgtcagggggtcaggagag ccttgcaagcgttcagagggcttgggagtggacacatcacagttgcactctgctgggcag cgctcgttgactgccatgacaggctccaggctccacggaaaaaaggcagagaaatcaggg ggtccagccaggcgccaagacgttggggtgccttgcatttctaagtgccaagtgatgaat agctcagttgaaagcaacccagggcacattctggaggaaattctctggccctatagtaat gagcgtgctaaaggcctggtggcatttctaccacgggaagtctcctctccaaatccaaaa gtggcctcaaacagtgtattccacgtggagaaaaatggtcgctacagcatctctcggacg gaggccgctgacctctgcaaggctttcaatagcaccttgcccacaatggcccagatggag aaagctctgagcatcggatttgagacctgcaggtatgggttcatagaagggcacgtggtg attccccggatccaccccaactccatctgtgcagcaaacaacacaggggtgtacatcctc acatccaacacctcccagtatgacacatattgcttcaatgcttcagctccacctgaagaa gattgtacatcagtcacagacctgcccaatgcctttgatggaccaattaccataactatt gttaaccgtgatggcacccgctatgtccagaaaggagaatacagaacgaatcctgaagac atctaccccagcaaccctactgatgatgacgtgagcagcggctcctccagtgaaaggagc agcacttcaggaggttacatcttttacaccttttctactgtacaccccatcccagacgaa gacagtccctggatcaccgacagcacagacagaatccctgctaccagtacgtcttcaaat accatctcagcaggctgggagccaaatgaagaaaatgaagatgaaagagacagacacctc agtttttctggatcaggcattgatgatgatgaagattttatctccagcaccattatctgc ctcttcacccggaggatctacaaacagcacacagtgactaagtccctgggctttcaagtg cagagggatactacagactgcatggatgggcaaaatggcgcctttggttatcctcggtgg agggctggtgttttcaaagctgtccttcctactgctgcagcttctttgactgttttatct ggaagatctcatgttctgaaccccaagtccaggcaactcctagtagtacaacggaagaaa cagctacccagaaggaacagtggtttggcaacagatggcatgagggatatcgccaaacac ccaaagaagactcccattcgacaacagggacagctggtaatggatggtttaacaaatagg aatgatgtcacaggtggaagaagagacccaaatcattctgaaggctcaactactttactg gaaggttatacctctcattacccacacacgaaggaaagcaggaccttcatcccagtgacc tcagctaagactgggtcctttggagttactgcagttactgttggagattccaactctaat gtcaatcgttccttatcaggagaccaagacacattccaccccagtggggggtcccatacc actcatggatctgaatcagatggacactcacatgggagtcaagaaggtggagcaaacaca acctctggtcctataaggacaccccaaattccagaatggctgatcatcttggcatccctc ttggccttggctttgattcttgcagtttgcattgcagtcaacagtcgaagaaggtgtggg cagaagaaaaagctagtgatcaacagtggcaatggagctgtggaggacagaaagccaagt ggactcaacggagaggccagcaagtctcaggaaatggtgcatttggtgaacaaggagtcg tcagaaactccagaccagtttatgacagctgatgagacaaggaacctgcagaatgtggac atgaagattggggtgtaa >gi568815587r:35160897_35417517|GENSCAN_predicted_peptide_2|147_aa MEDCRTTKGNTCKERELCGLPPKALALVWAPPSRHNVINRMPVFCTSKNHSLECKDAHLH PLLSGFTLRIRERKSGGRTQFLRRQRACCKASTRGRIINRAEKANKMVENEVSAKDVNLD GKIFITLILGLTDEHIALFSVSFYTKV >gi568815587r:35160897_35417517|GENSCAN_predicted_CDS_2|444_bp atggaggattgtagaaccacaaagggcaatacctgcaaagagcgtgaactgtgtggcctg ccacctaaggcactggccctagtgtgggcacctccttccaggcacaatgtcataaacagg atgccagtgttctgtacgtcaaagaaccacagccttgagtgcaaggatgcccaccttcac cctcttctttctggttttaccttgaggattcgagaacggaagtcaggcggaaggactcag tttctacgaagacaaagggcatgttgtaaagcatccactagggggcgcattatcaacagg gcagagaaggccaacaagatggttgaaaacgaagtttctgcaaaggacgtgaaccttgat gggaaaattttcattactctcatccttggcttgacagatgaacacattgcacttttctca gtcagcttttacacaaaggtctaa >gi568815587r:35160897_35417517|GENSCAN_predicted_peptide_3|1024_aa MREAHKQEANETMGNKEMGLAFRVSTVGETNWGVGESLKASGKEPGACLCAWDGSNGKQC RVLGLRVILFLPVPLPPAKLLSLVEQEIPLSAGTDLACLVHLRVLVLCRCLNENQSVRKS DLGLLSQMTAVDRTVFDVFEVKTLARKTLKQLTAFSLQYSSWCIDHFLLFCADTQQNKAV SRHNPWMLFFPVNSSTLTSANHLKWLLTPEPEGIQGLVRKIVKVWRVIASEKYTLKKGEC VCTQESHAQWGLRLLTLRVSLTKEWNIHEGSWKKVKISQNCGATHFYTKYGCFGTVMVLG DLVSSSRAISGHLYLPSTSPDPEAWGQGGEVRCQQWKDSANNMPKQVEVRMHDSHLGSEE PKHRHLGLRLCDKLGKNLLLTLTVFGVILGAVCGGLLRLASPIHPDVVMLIAFPGDILMR MLKMLILPLIISSLITGLSGLDAKASGRLGTRAMVYYMSTTIIAAVLGVILVLAIHPGNP KLKKQLGPGKKNDEVSSLDAFLDLIRNLFPENLVQACFQQIQTVTKKVLVAPPPDEEANA TSAVVSLLNETVTEVPEETKMVIKKGLEFKDGMNVLGLIGFFIAFGIAMGKMGDQAKLMV DFFNILNEIVMKLVIMIMWYSPLGIACLICGKIIAIKDLEVVARQLGMYMVTVIIGLIIH GGIFLPLIYFVVTRKNPFSFFAGIFQAWITALGTASSAGTLPVTFRCLEENLGIDKRVTR FVLPVGATINMDGTALYEAVAAIFIAQMNGVVLDGGQIVTVRWIQGNWSALSLDTMDGEG KGVVGGSVDHTQGKCLCKDGPKPGNDGNRGKNSSNHRLSRGLNLPNPHSHPGKRRRGQYP QCRAGHHAPHSDSRGPANRGHQPAGGCGLAAEFQTISHFPVGGFRLRDEDEREYLGTCFS FSEPDPVTQCEVTRSSPSLIFGSWDRMRTSVNVVGDSFGAGIVYHLSKSELDTIDSQHRV HEDIEMTKTQSIYDDMKNHRESNSNQCVYAAHNSVIVDECKVTLAANGKSADCSVEEEPW KREK >gi568815587r:35160897_35417517|GENSCAN_predicted_CDS_3|3075_bp atgagggaagctcataaacaagaggcaaatgaaacaatgggaaacaaggaaatggggctt gcattcagagtttcaacagtgggtgagacaaactggggagtgggagagagcctcaaggca tctggtaaggaacctggagcttgcctctgtgcatgggatggcagcaatggaaaacagtgc agggtcctgggtctgagggtcatcctctttctgccggtgcccctgccacctgctaagctc ttgtctttggtggaacaggaaataccactcagtgcaggtacagacctagcctgccttgta cacctccgtgtcctggtcctttgtagatgcttgaatgagaaccaatcagtgaggaaaagt gaccttgggctcctgagccaaatgacagcagtcgataggacagtttttgatgtctttgag gtcaaaacactggctaggaaaactttaaagcagctgacagctttttctctgcagtatagc agctggtgcattgaccacttcctgctgttttgtgcagacacgcagcaaaacaaggcagtc agcaggcacaacccatggatgctttttttcccagtgaactccagtacacttacaagtgct aatcatctgaaatggctcttgacaccagagccagaaggaattcaaggactagtcaggaaa atagtgaaagtatggagagttattgcaagtgaaaagtatacactcaagaaaggggagtgt gtgtgtactcaagaaagtcatgcccagtggggtttgcggcttctaactttacgagtttct ttaaccaaggagtggaatattcatgaaggttcctggaaaaaggtgaagatttctcagaac tgtggtgccacccatttttacaccaaatatggttgtttcggaactgtcatggtgctgggt gacctagtgtcttccagtagggcaatctctggacatctttatctccccagtacctctcca gatcctgaagcctggggccagggaggagaggttagatgtcagcagtggaaggacagtgcc aacaatatgcccaagcaggtggaagtgcgaatgcacgacagtcatcttggctcagaggaa cccaagcaccggcacctgggcctgcgcctgtgtgacaagctggggaagaatctgctgctc accctgacggtgtttggtgtcatcctgggagcagtgtgtggagggcttcttcgcttggca tctcccatccaccctgatgtggttatgttaatagccttcccaggggatatactcatgagg atgctaaaaatgctcattctccctctaatcatctccagcttaatcacagggttgtcaggc ctggatgctaaggctagtggccgcttgggcacgagagccatggtgtattacatgtccacg accatcattgctgcagtactgggggtcattctggtcttggctatccatccaggcaatccc aagctcaagaagcagctggggcctgggaagaagaatgatgaagtgtccagcctggatgcc ttcctggaccttattcgaaatctcttccctgaaaaccttgtccaagcctgctttcaacag attcaaacagtgacgaagaaagtcctggttgcaccaccgccggacgaggaggccaacgca accagcgctgttgtctctctgttgaacgagactgtgactgaggtgccggaggagactaag atggttatcaagaagggcctggagttcaaggatgggatgaacgtcttaggtctgataggg tttttcattgcttttggcatcgctatggggaagatgggagatcaggccaagctgatggtg gatttcttcaacattttgaatgagattgtaatgaagttagtgatcatgatcatgtggtac tctcccctgggtatcgcctgcctgatctgtggaaagatcattgcaatcaaggacttagaa gtggttgctaggcaactggggatgtacatggtaacagtgatcataggcctcatcatccac gggggcatctttctccccttgatttactttgtagtgaccaggaaaaaccccttctccttt tttgctggcattttccaagcttggatcactgccctgggcaccgcttccagtgctggaact ttgcctgtcacctttcgttgcctggaagaaaatctggggattgataagcgtgtgactaga ttcgtccttcctgttggagcaaccattaacatggatggtacagccctttatgaagcggta gccgccatctttatagcccaaatgaatggtgttgtcctggatggaggacagattgtgact gtaagatggattcaaggaaattggagtgctctgagccttgacaccatggacggggagggg aagggtgtggttggggggtctgtagatcacacccagggcaagtgtctatgcaaggatggg cctaaacctgggaatgatggcaataggggaaaaaacagctcaaaccacaggctctccagg ggccttaatcttccaaaccctcacagccaccctggcaagcgtcggcgcggccagtatccc cagtgccgggctggtcaccatgctcctcattctgacagccgtgggcctgccaacagagga catcagcctgctggtggctgtggactggctgctgagttccagacgatttcacatttccct gttggaggatttagactgagagatgaagatgagagagagtaccttggtacttgtttttct ttcagtgaaccagacccagtgactcagtgtgaagtaactcggtcaagcccctctctcatt tttggttcctgggacaggatgagaacttcagtcaatgttgtgggtgactcttttggggct gggatagtctatcacctctccaagtctgagctggataccattgactcccagcatcgagtg catgaagatattgaaatgaccaagactcaatccatttatgatgacatgaagaaccacagg gaaagcaactctaatcaatgtgtctatgctgcacacaactctgtcatagtagatgaatgc aaggtaactctggcagccaatggaaagtcagccgactgcagtgttgaggaagaaccttgg aaacgtgagaaataa >gi568815587r:35160897_35417517|GENSCAN_predicted_peptide_4|551_aa MEPPCGSSIQALGPPVKVAISVGPREPCLKTPGPCCKVVTLVRATQVLADPGSGISRSLG LDWDEEGYRGGHLDLLQKKKQKQKPDYFHMLQDMNLSSPPLFDSYPQCRYSTNFLNPCTL MEAARKSLDHLEVEGHFVASMVSPRMGVAEAQPMRLFSGPLPRSIRKEALSIAEIIFSED AKNLQRGLGSSQMFFLRTPYSDMVWDLVATRKSSILSVASAIFLLDGDDHEQGQSTMVHR EFTPLYHLQFLLVSCKATIPPVKDGMEPQLISNFSKVSGYKINVQKSQAFLYTNNTQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRIN IVKMAILPKSLKLTKHIVRYDRRIAGPAASSTGRAKTWVSQHALTVVLRITRGLEHPESI HSDDKKNQAEHTQAPEPANLVRTPLMTPRGSQEDKDKDEDEDPGPAGEWQLLSTHFQPQP VVPAEASSLKDFSLSDTSFKAKAKGDTKHLIVLHGRGKEVAIDTDEQICLLVFNNMEFLP HKFHLDLEIIS >gi568815587r:35160897_35417517|GENSCAN_predicted_CDS_4|1656_bp atggaacccccgtgtggtagtagcatccaagctttgggccctccagtcaaggtggctatt tcagttgggccacgagaaccatgtctaaaaaccccaggtccttgctgcaaagtcgtcact ctggtcagagccacacaagtacttgcagacccgggcagtggcatcagcaggtcactgggc ctggactgggatgaggagggctacagaggaggacacctggatctgcttcagaaaaaaaag caaaaacaaaaaccagattacttccacatgcttcaagatatgaatctatcatctccccct ctctttgactcatatcctcaatgtcgctattcgacaaatttcctcaacccctgcacattg atggaagctgctaggaaaagcctggatcacttggaggttgaaggtcattttgtagccagc atggtcagcccaagaatgggtgtggctgaagcccagccaatgagactcttctctgggcct cttcctagaagtatcagaaaagaggcactttccattgctgaaatcattttctctgaggat gctaagaacctgcagaggggtctgggtagcagtcaaatgttctttctcagaacaccttac agcgacatggtttgggaccttgtggctacaagaaaatctagtattctatctgtggcctct gctatatttctgttggatggggatgatcacgagcagggtcagtcaaccatggtccacaga gaattcactccattgtaccaccttcagtttctcctagtcagctgcaaggccaccatccca ccagttaaggatgggatggaaccccagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtgcaaaaatcacaagcattcttatacaccaacaacacacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaagaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaat attgtgaaaatggccatactgcccaagtctttaaaactcacaaagcatattgtgagatat gacagaaggattgctggtccagcagccagcagcacaggcagggctaaaacctgggtttct caacatgctttgactgtggtcctcaggataacaaggggactggaacatcctgagtccatc cacagtgatgacaaaaaaaaccaggctgaacatacccaagctccggaaccagcaaatctt gttcgaaccccgctgatgactcccaggggaagccaagaggacaaagacaaggatgaggac gaggacccaggacccgctggtgaatggcaactgctgtcaactcactttcaacctcagcca gttgtcccagctgaagcatcatcacttaaggacttttccttgtctgacacatcctttaaa gccaaagccaaaggagacacaaaacacctaattgtgttgcatggaagaggaaaagaagtg gccattgatactgatgaacagatttgtcttttggtttttaataacatggagtttctgcct cacaagtttcatctggatttagaaattatttcctga