GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:32:17 Sequence gi568815597f:10350074_10551592 : 201519 bp : 46.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2558 2726 169 0 1 107 19 206 0.506 14.60 1.02 Intr + 10856 10970 115 2 1 91 78 149 0.999 14.65 1.03 Intr + 11619 11752 134 2 2 94 88 124 0.800 12.44 1.04 Intr + 13210 13271 62 1 2 76 115 4 0.920 0.28 1.05 Intr + 15027 15226 200 1 2 86 3 154 0.514 5.77 1.06 Intr + 15468 15575 108 2 0 83 70 196 0.589 17.78 1.07 Intr + 18394 18465 72 0 0 105 105 69 0.995 9.90 1.08 Intr + 21068 21189 122 1 2 77 100 34 0.982 2.79 1.09 Intr + 24243 24392 150 0 0 79 113 98 0.995 10.68 1.10 Intr + 24781 24973 193 1 1 85 73 156 0.959 13.29 1.11 Intr + 25182 25300 119 2 2 87 107 164 0.962 17.46 1.12 Term + 26472 26514 43 0 1 130 42 29 0.828 -0.87 1.13 PlyA + 26747 26752 6 -0.45 2.00 Prom + 30887 30926 40 -2.86 2.01 Init + 48901 49052 152 0 2 69 81 95 0.761 6.53 2.02 Intr + 49556 49631 76 2 1 114 91 190 0.995 21.42 2.03 Intr + 50320 50499 180 0 0 102 73 203 0.987 20.26 2.04 Intr + 52998 53063 66 2 0 77 98 79 0.987 6.90 2.05 Intr + 54088 54206 119 0 2 76 89 46 0.753 2.76 2.06 Intr + 57998 58037 40 2 1 123 0 59 0.695 -1.17 2.07 Intr + 61345 61479 135 0 0 107 94 247 0.993 27.96 2.08 Intr + 62989 63178 190 0 1 76 23 240 0.973 15.46 2.09 Intr + 66914 67044 131 0 2 75 81 160 0.977 14.41 2.10 Intr + 67195 67278 84 0 0 62 44 70 0.502 0.02 2.11 Intr + 67303 67436 134 0 2 45 115 127 0.991 10.54 2.12 Intr + 68753 68852 100 2 1 75 91 140 0.997 13.11 2.13 Intr + 69344 69466 123 1 0 90 115 59 0.999 9.58 2.14 Term + 69557 69676 120 1 0 117 43 115 0.999 8.37 2.15 PlyA + 70050 70055 6 1.05 3.00 Prom + 73045 73084 40 -3.16 3.01 Init + 76278 76385 108 2 0 82 35 205 0.908 12.83 3.02 Intr + 80430 80495 66 2 0 61 92 66 0.247 3.40 3.03 Intr + 83769 83892 124 2 1 73 48 154 0.911 10.06 3.04 Intr + 84584 84617 34 0 1 121 98 37 0.722 5.28 3.05 Term + 90159 90297 139 0 1 58 41 64 0.138 -3.86 3.06 PlyA + 91017 91022 6 1.05 4.00 Prom + 92162 92201 40 -5.16 4.01 Init + 100151 100249 99 1 0 65 105 209 0.987 18.57 4.02 Term + 103912 103944 33 0 0 82 55 19 0.273 -4.41 4.03 PlyA + 107830 107835 6 1.05 5.08 PlyA - 109872 109867 6 1.05 5.07 Term - 111629 111417 213 2 0 53 43 231 0.994 12.33 5.06 Intr - 113136 112985 152 1 2 118 64 143 0.999 14.78 5.05 Intr - 113547 113358 190 0 1 53 97 139 0.982 10.46 5.04 Intr - 117259 117117 143 2 2 63 103 138 0.995 12.87 5.03 Intr - 119265 119104 162 1 0 61 110 152 0.997 14.65 5.02 Intr - 121851 121792 60 1 0 69 92 71 0.949 4.31 5.01 Init - 122385 122205 181 0 1 69 76 203 0.905 16.55 5.00 Prom - 122986 122947 40 -5.66 6.00 Prom + 123278 123317 40 -6.36 6.01 Init + 124894 124929 36 0 0 91 110 30 0.407 5.55 6.02 Intr + 145201 145248 48 0 0 101 119 25 0.718 5.78 6.03 Intr + 145616 145648 33 1 0 83 115 3 0.477 0.92 6.04 Term + 145840 145851 12 0 0 129 41 17 0.579 -0.70 6.05 PlyA + 146927 146932 6 1.05 7.05 PlyA - 147025 147020 6 1.05 7.04 Term - 157803 157465 339 0 0 57 37 125 0.106 -1.16 7.03 Intr - 161029 160945 85 0 1 10 59 134 0.072 2.52 7.02 Intr - 176841 176609 233 0 2 6 100 145 0.028 4.07 7.01 Init - 183537 183445 93 0 0 88 103 30 0.485 4.88 7.00 Prom - 189537 189498 40 -3.26 8.04 PlyA - 190242 190237 6 1.05 8.03 Term - 191276 191209 68 0 2 123 45 41 0.054 1.40 8.02 Intr - 199504 199466 39 2 0 118 101 9 0.700 3.40 8.01 Intr - 201433 201296 138 2 0 2 100 136 0.817 6.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 97072 96968 105 1 0 31 56 125 0.807 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_1|495_aa XRIRNKPEVDEAAVDAILSLNIISAKYLKSSHNSSRWDTQSSVKKSTLAGVNWYTVRTFY RFEAVWDSSLHNSLLLNRVTPYGEKIYMTLSAYLELDHCIQPAVITKDVCMVFYSRDAKI SPPRSLRSLFGSGYSKSPDSNRVTGIYELSLCKMSDTGSPGMQRRRRKILDTSVAYVRGE ENLAGWRPRGDSLILEHQWELEKLELLHEVSRGRVVQMQELSDKIAKISTTTFESAITPS ESSGYDSGDIESLVDREKELATKCLQLLTHTFNREFSQVHGSVSDCKLSDISPIGRDPSE SSFSSATLTPSSTCPSLVDSRSNSLDQKTPEANSRASSPCPEFEQFQIVPAVETPYLARA GKNEFLNLVPDIEEIRPSSVVSKKGYLHFKEPLYSNWAKHFVVVRRPYVFIYNSDKDPVE RGIINLSTAQVEYSEDQQAMVKTPNTFAVCTKHRGVLLQALNDKDMNDWLYAFNPLLAGT IRSKLSRRCPSQSKY >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_1|1488_bp ngtcgtattcggaataagcctgaggtggatgaagctgcagttgatgccatcctctcccta aatattatttctgccaagtacctgaagtcttcccacaactctagcaggtgggacacccag agcagtgtgaagaagtccacacttgcaggcgttaattggtacaccgttaggaccttctac cgctttgaggctgtgtgggatagctctctgcataactcccttcttctgaaccgagtgaca ccctatggagaaaagatctacatgaccttgtcggcctacctagagctggatcattgcatc cagccggctgtcatcaccaaggatgtgtgcatggtcttctactcccgagatgccaagatc tcaccaccacgctctctgcgtagcctctttggcagcggctactcaaagtcaccagattcg aatcgagtcactggcatttacgaactcagcttatgcaaaatgtcagacacaggtagtcca ggtatgcagagaaggagaagaaaaatcttagatacgtcagtggcatatgtgcggggagaa gagaacttagcaggctggcggccccgtggagacagcctcatccttgagcaccagtgggag ctggagaagctggagctcctacatgaggtatccaggggcagggttgttcagatgcaagaa ctctcggacaagattgccaaaatctcaaccactacctttgaaagcgccatcacacctagc gagagcagtggctatgattcaggagacatcgaaagcctggtggaccgagagaaagagctg gctaccaagtgcctgcaacttctcacccacactttcaacagagaattcagccaggtgcac ggcagcgtcagtgactgtaagttgtctgatatctctccaattggacgggatccctctgag tccagtttcagcagtgccaccctcactccctcctccacctgtccctctctggtagactct aggagcaactctctggatcagaagaccccagaagccaattcccgggcctctagtccctgc ccagaatttgaacagtttcagattgtcccagctgtggaaacaccatatttggcccgagca ggaaaaaacgaatttctcaatcttgttccagatattgaagaaattagaccaagctcagtg gtctctaagaaaggataccttcatttcaaggagcctctttacagtaactgggctaaacat tttgttgtcgtccgtcggccttatgtcttcatctataacagtgacaaagaccctgtggag cgtggaatcattaacctgtccacagcacaggtggagtacagtgaggaccagcaggccatg gtgaagacaccaaacacctttgctgtctgcacaaagcaccgtggggtccttttgcaggcc ctcaatgacaaagacatgaacgactggttgtatgccttcaacccacttctagctggcaca atacggtcaaagctttcccgcagatgcccgagccagtcgaaatactaa >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_2|549_aa MAPPPSIRLAGAEKPGVSGRSFWREPLRVFPSLVLRASPLFGSALSAAMAQADIALIGLA VMGQNLILNMNDHGFVVCAFNRTVSKVDDFLANEAKGTKVVGAQSLKEMVSKLKKPRRII LLVKAGQAVDDFIEKLVPLLDTGDIIIDGGNSEYRDTTRRCRDLKAKGILFVGSGVSGGE EGARYGPSLMPGGNKEAWPHIKTIFQGIAAKVGDEGAGHFVKMVHNGIEYGDMQLICEAY HLMKDVLGMAQDEMAQAFEDWNKTELDSFLIEITANILKFQDTDGKHLLPKIRDSAGQKG TGKWTAISALEYGVPVTLIGEAVFARCLSSLKDERIQASKKLKGPQKFQFDGDKKSFLED IRKNIGPSGISTADENKTGRHKAVTLLMAILALYASKIISYAQGFMLLRQAATEFGWTLN YGGIALMWRGGCIIRSVFLGKIKDAFDRNPELQNLLLDDFFKSAVENCQDSWRRAVSTGV QAGIPMPCFTTALSFYDGYRHEMLPASLIQAQRDYFGAHTYELLAKPGQFIHTNWTGHGG TVSSSSYNA >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_2|1650_bp atggctccacccccttccattcgattggccggcgccgaaaagccgggcgtgagcggccgc agtttctggagggagccgctgcgggtctttccctcactcgtcctccgcgcgtcgccgctc ttcggttctgctctgtccgccgccatggcccaagctgacatcgcgctgatcggattggcc gtcatgggccagaacttaattctgaacatgaatgaccacggctttgtggtctgtgctttt aataggactgtctccaaagttgatgatttcttggccaatgaggcaaagggaaccaaagtg gtgggtgcccagtccctgaaagagatggtctccaagctgaagaagccccggcggatcatc ctcctggtgaaggctgggcaagctgtggatgatttcatcgagaaattggtaccattgttg gatactggtgacatcatcattgacggaggaaattctgaatatagggacaccacaagacgg tgccgagacctcaaggccaagggaattttatttgtggggagcggagtcagtggtggagag gaaggggcccggtatggcccatcgctcatgccaggagggaacaaagaagcgtggccccac atcaagaccatcttccaaggcattgctgcaaaagtgggagatgagggagcaggccacttc gtgaagatggtgcacaacgggatagagtatggggacatgcagctgatctgtgaggcatac cacctgatgaaagacgtgctgggcatggcgcaggacgagatggcccaggcctttgaggat tggaataagacagagctagactcattcctgattgaaatcacagccaatattctcaagttc caagacaccgatggcaaacacctgctgccaaagatcagggacagcgcggggcagaagggc acagggaagtggaccgccatctccgccctggaatacggcgtacccgtcaccctcattgga gaagctgtctttgctcggtgcttatcatctctgaaggatgagagaattcaagctagcaaa aagctgaagggtccccagaagttccagtttgatggtgataagaaatcattcctggaggac attcggaagaatattggcccttctgggatctccactgctgatgagaataagactggtaga cataaggcggtcactctcctaatggcaatcctagcactctacgcttccaagatcatctct tacgctcaaggctttatgctgctaaggcaggcagccaccgagtttggctggactctcaat tatggtggcatcgccctgatgtggagagggggctgcatcattagaagtgtattcctagga aagataaaggatgcatttgatcgaaacccggaacttcagaacctcctactggacgacttc tttaagtcagctgttgaaaactgccaggactcctggcggcgggcagtcagcactggggtc caggctggcattcccatgccctgttttaccactgccctctccttctatgacgggtacaga catgagatgcttccagccagcctcatccaggctcagcgggattacttcggggctcacacc tatgaactcttggccaaaccagggcagtttatccacaccaactggacaggccatggtggc accgtgtcatcctcgtcatacaatgcctga >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_3|156_aa MRPHSPALGWSMGLGAVEQGATLIGEARATQEPTEWGRPAVMEEEAETEEQQRFSYQQRL KAAVHYTVGCLCEEVALDKEMQFSKQTIAAISELTFRQCENFAKDLEMFARMGYLEIPAS LKSDRELLCFIKTLNNTKHFGDLLLPFLQTCEKNHN >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_3|471_bp atgcgcccacactccccagcccttgggtggtcgatgggactgggcgccgtggagcagggg gccacgctcatcggggaggctcgggccacacaggagcccacggagtggggtcggcccgca gtgatggaggaggaggcggagaccgaggagcagcagcgattctcttaccaacagaggcta aaggcagcagttcactatactgtgggttgtctttgcgaggaagttgcattggacaaagag atgcagttcagcaaacagaccattgcggccatttcggagctgactttccgacagtgtgaa aattttgccaaagaccttgaaatgtttgcaaggatgggctacttagagatccctgctagt ttgaagtctgaccgtgaacttctgtgtttcatcaaaactttaaataataccaaacatttt ggtgacttgcttctgccctttctgcagacatgcgaaaagaaccacaattaa >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_4|43_aa MPLSPGLLLLLLSGATATAALPLEGGPTGRDSEDLAPEQLPNC >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_4|132_bp atgccattgtcccccggcctcctgctgctgctgctctccggggccacggccaccgctgcc ctgcccctggagggtggccccaccggccgagacagcgaggatttggcaccagaacagctc cctaactgctga >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_5|366_aa MEVTGDAGVPESGEIRTLKPCLLRRNYSREQHGVAASCLEDLRSKGWLGTPGEGVGEPGP GTYAGKRELLLTISGGPDKEPCDILAIDKSLTPVTLVLAEDGTIVDDDDYFLCLPSNTKF VALASNEKWAYNNSDGGTAWISQESFDVDETDSGAGLKWKNVARQLKEDLSSIILLSEED LQMLVDAPCSDLAQELRQSCATVQRLQHTLQQVLDQREEVRQSKQLLQLYLQALEKEGSL LSKQEESKAAFGEEVDAVDTGISRETSSDVALASHILTALREKQAPELSLSSQDLELVTK EDPKALAVALNWDIKKTETVQEACERELALRLQQTQSLHSLRSISASKASPPGDLQNPKR ARQDPT >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_5|1101_bp atggaggtgaccggggacgccggggtaccagaatctggcgagatccggactctaaagccg tgtctgctgcgccgcaactacagccgcgaacagcacggcgtggccgcctcctgcctcgaa gacctgaggagcaagggttggctcgggaccccgggcgagggtgtgggggagccagggccg ggaacctatgcaggaaagagggagctgctactgaccatttctggaggtccagacaaggaa ccctgtgacattctggccattgataagtccctgacaccagtcaccctggtcctggcagag gatggcaccatagtggatgatgacgattactttctgtgtctaccttccaatactaagttt gtggcattggctagtaatgagaaatgggcatacaacaattcagatggaggtacagcttgg atttcccaagagtcctttgatgtagatgaaacagacagcggggcagggttgaagtggaag aatgtggccaggcagctgaaagaagatctgtccagcatcatcctcctatcagaggaggac ctccagatgcttgttgacgctccctgctcagacctggctcaggaactacgtcagagttgt gccaccgtccagcggctgcagcacacactccaacaggtgcttgaccaaagagaggaagtg cgtcagtccaagcagctcctgcagctgtacctccaggctttggagaaagagggcagcctc ttgtcaaagcaggaagagtccaaagctgcctttggtgaggaggtggatgcagtagacacg ggtatcagcagagagacctcctcggacgttgcgctggcgagccacatccttactgcactg agggagaagcaggctccagagctgagcttatctagtcaggatttggagttggttaccaag gaagaccccaaagcactggctgttgccttgaactgggacataaagaagacggagactgtt caggaggcctgtgagcgggagctcgccctgcgcctgcagcagacgcagagcttgcattct ctccggagcatctcagcaagcaaggcctcaccacctggtgacctgcagaatcctaagcga gccagacaggatcccacatag >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_6|42_aa MASSEQAEQPSQPSSTPGSENVLPREPLGASASVVFKEQGLL >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_6|129_bp atggcgtcctcggagcaggcagagcagccgagccagccaagctctactccaggaagtgaa aatgtgctgcctcgagagccgctgggtgcgagcgcctcagtggtctttaaagaacagggc ctgctgtaa >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_7|249_aa MECSVISNSFKKNVGDIAKNSINERIQFKHQTHQKTEESQQSRSATNGMPPSPRSCYRMS IPNLKIQNPKCSKIQNFLSANMTVKGHAQRKCSLEHFGFGIFRFEMLNQQTSSSAACTFN DAYRCTTATEQDVENEEVNFAALTLTMRNLWMLIETQIKDLPEGAEKVEIKDPGYCIFFT YADSSDSLETCRHHFKSWSPEQIAQLSNLLPASKRQNNKRLLPPRSYQGHSTLLYRHHMK RSYYLLSIP >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_7|750_bp atggagtgctctgtcattagtaattcatttaagaagaacgtgggggatatagcaaagaac agcataaacgagcgaatccaatttaaacatcagacccaccagaaaacagaagaaagtcaa caatcaagatctgctactaacgggatgccaccaagtcctagatcctgctacaggatgagc atccctaatctgaaaatccaaaatccaaaatgctccaaaatccagaactttttgagtgcc aacatgacagtcaaaggtcatgcacaaaggaaatgctcactggagcatttcggatttggg attttcagatttgagatgctcaaccaacaaaccagcagctctgctgcctgcacatttaac gatgcctatcggtgcaccacagctacggaacaggatgtggagaacgaggaggtgaacttt gctgccttaactctgacaatgaggaatttgtggatgttgatagaaacacagataaaggat ttacctgagggggcagagaaagtagaaataaaagatcctggatattgtattttctttact tatgcggattccagtgactccctggagacctgcaggcaccactttaagagctggagtcca gagcaaattgctcaattgtccaaccttctccccgccagcaagagacagaacaacaagcgg ctgctgccccctagaagctaccaggggcatagcaccttgctatatagacatcacatgaaa agatcctattatctcctctccatcccataa >gi568815597f:10350074_10551592|GENSCAN_predicted_peptide_8|81_aa XSPSRIRAAGIKLPNAPASSALCNDIHANGDELGFMEVRLLRMDLKGLPKGEPLIGKKKC SKTKPVLSPLVALTPEKGFYR >gi568815597f:10350074_10551592|GENSCAN_predicted_CDS_8|246_bp naatctccaagccgaatccgtgctgctgggataaagctgcctaatgcaccagcctccagc gcgctgtgtaatgacatccacgctaatggagatgagctcggcttcatggaggttcgactc ctgagaatggatcttaaaggactgcctaaaggagagcctttgattgggaaaaaaaaatgt tccaaaaccaaacccgttttgtctcccctggtagctctcactcctgagaaaggcttttac cgctaa