GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:14:17 Sequence gi568815577f:38022332_38256399 : 234068 bp : 40.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4812 5056 245 2 2 54 72 154 0.632 7.65 1.02 Intr + 5450 5481 32 2 2 90 82 19 0.628 -1.64 1.03 Intr + 6386 6489 104 0 2 20 87 173 0.543 9.37 1.04 Term + 25811 26077 267 1 0 6 43 317 0.125 13.51 1.05 PlyA + 26743 26748 6 1.05 2.00 Prom + 29277 29316 40 -9.15 2.01 Init + 30047 30135 89 1 2 4 82 133 0.936 4.46 2.02 Intr + 30977 31062 86 2 2 107 96 42 0.880 5.44 2.03 Intr + 31478 31504 27 0 0 89 47 76 0.437 0.77 2.04 Term + 31936 32168 233 2 2 57 45 141 0.617 2.35 2.05 PlyA + 32190 32195 6 1.05 3.00 Prom + 36731 36770 40 -8.25 3.01 Init + 37197 37406 210 2 0 90 80 199 0.276 18.40 3.02 Term + 42387 42500 114 2 0 -5 53 107 0.030 -4.51 3.03 PlyA + 43118 43123 6 1.05 4.00 Prom + 44175 44214 40 -4.75 4.01 Init + 51337 51418 82 0 1 50 111 69 0.379 6.58 4.02 Intr + 51518 51646 129 0 0 39 17 131 0.031 0.85 4.03 Term + 67149 67309 161 1 2 22 37 199 0.015 5.32 4.04 PlyA + 67451 67456 6 1.05 5.00 Prom + 68757 68796 40 -3.65 5.01 Init + 72201 72234 34 2 1 44 97 56 0.482 2.18 5.02 Term + 76399 76469 71 2 2 138 42 63 0.533 3.82 5.03 PlyA + 76917 76922 6 1.05 6.00 Prom + 81834 81873 40 -5.05 6.01 Init + 91071 91279 209 2 2 56 35 153 0.064 5.18 6.02 Intr + 92505 92631 127 0 1 52 119 63 0.052 5.56 6.03 Intr + 98734 98936 203 0 2 7 92 139 0.132 3.36 6.04 Intr + 108011 108227 217 2 1 81 64 118 0.151 6.28 6.05 Term + 118288 118428 141 0 0 82 48 159 0.353 8.25 6.06 PlyA + 118687 118692 6 1.05 7.00 Prom + 134987 135026 40 -5.75 7.01 Init + 136697 136778 82 1 1 52 46 47 0.222 -1.92 7.02 Intr + 138325 138575 251 2 2 89 113 197 0.800 18.53 7.03 Term + 138703 138879 177 0 0 45 49 104 0.755 -1.10 7.04 PlyA + 139882 139887 6 1.05 8.10 PlyA - 140297 140292 6 1.05 8.09 Term - 141456 141311 146 1 2 -68 38 296 0.788 6.09 8.08 Intr - 143774 143623 152 1 2 16 59 65 0.140 -4.71 8.07 Intr - 146543 146381 163 0 1 79 94 71 0.422 4.91 8.06 Intr - 148515 148410 106 0 1 78 78 92 0.798 6.07 8.05 Intr - 149704 149571 134 2 2 68 48 55 0.863 -1.06 8.04 Intr - 149900 149772 129 0 0 22 53 111 0.560 0.75 8.03 Intr - 150271 149928 344 0 2 -5 32 285 0.782 7.75 8.02 Intr - 151471 151342 130 2 1 69 16 155 0.695 5.23 8.01 Init - 152452 152389 64 1 1 56 78 59 0.564 3.06 8.00 Prom - 155035 154996 40 -4.75 9.00 Prom + 156052 156091 40 -7.75 9.01 Init + 156121 156541 421 0 1 86 53 523 0.186 45.19 9.02 Intr + 180148 180186 39 2 0 132 93 17 0.070 3.88 9.03 Term + 180377 180501 125 0 2 103 47 73 0.073 2.27 9.04 PlyA + 181753 181758 6 1.05 10.04 PlyA - 181833 181828 6 1.05 10.03 Term - 183322 182848 475 0 1 12 49 341 0.276 15.97 10.02 Intr - 184650 184419 232 1 1 104 9 169 0.946 6.51 10.01 Init - 185476 185299 178 1 1 71 97 39 0.855 2.57 10.00 Prom - 201844 201805 40 -0.45 11.04 PlyA - 202322 202317 6 1.05 11.03 Term - 205420 205091 330 1 0 27 32 245 0.356 6.47 11.02 Intr - 205974 205797 178 2 1 -16 53 165 0.518 1.60 11.01 Init - 206754 206684 71 0 2 64 28 80 0.770 0.17 11.00 Prom - 207205 207166 40 -0.45 12.03 PlyA - 207705 207700 6 1.05 12.02 Term - 216372 215896 477 0 0 45 42 313 0.160 16.25 12.01 Init - 233017 232964 54 1 0 65 75 42 0.076 1.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 11233 10970 264 1 0 66 42 224 0.825 10.55 S.002 Term + 25851 26077 227 1 2 57 43 259 0.869 14.26 S.003 Sngl + 93408 93899 492 2 0 -3 29 339 0.825 14.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_1|215_aa MIRKKHGTSQLQALKGSALQRIIRHSWYISYLDHTVQPVLGQVAERVACPLYEERNLLLN LLAELFSDLTSDSKVMEIFLASVPLPATPKHSAWMTRARLHLEEEEEEEEGEGGGGGGGG GGGGGGKTLKLDTEAGVLCAENANAEIVVPSGNTSCPTADPCPDALLGNQLVKCWECCDA SARIHLELLLIRIHHSWHEFTSDGSKMFLPVSNAS >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_1|648_bp atgataaggaaaaagcatggaacttcacaactacaagctctgaaagggtctgctctacag aggatcatcagacattcctggtacatatcctatttggatcacactgtgcaaccagtccta ggtcaggtggcagaaagggttgcctgccctctttatgaggagaggaatctacttcttaac ctgcttgctgaactcttcagtgacctcacctctgacagcaaagtcatggagatctttctg gccagtgtgcctctccccgcaacacccaaacactcagcctggatgacaagagcgagactc catctagaagaagaagaagaggaagaagaaggagaaggaggaggaggaggtggtggtgga ggaggaggaggaggaggaaagacactgaaattagacactgaagctggtgtcctctgtgca gagaatgctaatgctgagattgtggtccctagtggcaacaccagctgccccactgccgac ccgtgccctgacgctcttcttggaaatcagcttgtcaaatgttgggagtgctgcgatgcg tctgcacggatccatttagaactccttctcattcgcattcatcacagctggcatgaattc acaagtgatggcagcaaaatgtttctgccagtttctaatgcctcttag >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_2|144_aa MKQKAPLTLEEQTGTQEAFTAGSHGGLLSSVSSDLLEIRASPIAAERSNHFLVITACKDK AATDGEVDTHTLPKNPAVLCESGRKSPLFVDLLSLGEWLVPTGSTRNLHDRKAWAQDKPF HCHAAIMAVVFPYGLTWPLSRGRH >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_2|435_bp atgaaacagaaagctcccctaaccctggaagagcaaacaggaacacaggaagcgttcaca gctggctctcatggaggcctgctttccagtgtgtcctcggatctgctggaaatcagagcc agtccaatagctgcagaaaggtccaatcatttccttgtaattactgcttgtaaggacaaa gcggccacagatggagaagtagacacgcacacccttcccaaaaacccagcagttctctgt gagtctggtagaaagtccccattatttgtggacctactcagccttggtgagtggctggtc ccaacaggatccacaagaaacttgcatgaccgcaaggcctgggcacaagacaaacccttc cactgccatgctgccattatggctgttgtttttccatatggtctcacatggcctctcagc aggggacgacactga >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_3|107_aa MMVEMEQEVKVVMEVGVGAVMVMVIVMVMVVVMVGGGDGGDIGDGVACDAVVVMIMVVVV VVLMVMVVMEEQQVLKKDAGTRDPETQPGGLTASGKLCFGTRFGAYL >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_3|324_bp atgatggtggaaatggagcaggaggtaaaagttgtgatggaggtaggggtgggggctgtg atggtaatggtaatagtgatggtgatggtggtggtgatggttggtggtggtgatggtggt gatattggtgatggtgttgcctgtgatgctgtggtagtgatgatcatggtggtggtggtg gtggtgctaatggtgatggtggtgatggaggaacagcaagtcctgaagaaagatgcagga acaagggacccagaaacccagccaggtgggcttacggcatctgggaaactgtgttttgga acaagatttggggcctatttgtga >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_4|123_aa MWAAERTSGRMTDRCHEFGNVEWMLVLETEVESGSRMGESEEMAFKDGSKNKAGVSHYLS NLLQSGSNQNILKMKMEEGTTEECGWALEAENDARKEASKGMHPERLWKQIYSPNLQEGT QPH >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_4|372_bp atgtgggcagctgagaggactagtggaagaatgacagacaggtgccacgaatttggcaat gttgaatggatgctggtccttgaaacagaagtggagagtggatctagaatgggtgagagt gaagagatggcttttaaagatggatcaaagaacaaagctggggtgtcacattacctttca aatttattacaaagcggtagtaaccaaaacattttgaagatgaagatggaggaagggacc accgaggaatgtgggtgggctttagaagccgagaacgatgcccgcaaggaagcaagcaag ggaatgcacccggaacgactttggaagcagatttattccccaaacctccaggaaggaaca cagccccactaa >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_5|34_aa MVQGRYTDLNKASPTHFLEAMVLVDYGDLLPPIK >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_5|105_bp atggttcaagggagatacacggacctgaacaaagcctccccaacccatttcctggaagcc atggtccttgttgactatggggatcttttgccccctataaagtag >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_6|298_aa MLDPASHTSSILVVHMETSPLFTVFSLLQLQPLSGQHPSLETSDGPPLQLLPGGSEGPSG KKFDRHCPIEPIITGRAHTVPTVSRKQLEDKTSIPMPKICHCCSVSGEHGVLLTLRWRLA CRTFLTGCSWDQYLWRGEAEAGSGSGEVECNAGEAASLSGVNIRGSSSLVKMINDMDTHG RGLNQRMSVSFSSVGLGHCSFSLGTHAGKRECGTPLAQFPGTWGPSKLTLSDSSDGWTVG MTLVFLAGNKGQSGSLKNEPVAGCQGVMMVLRKSSRQLSRAAVSLLMPFSHWAVNEDL >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_6|897_bp atgctggacccagccagtcacaccagcagcattttggtggtccatatggagacctctccc ctgttcactgttttttctcttctccaactccaacctcttagtggacagcatccaagcctg gagacatctgatggccccccactgcagctactccctggtggatcagaaggtcccagtgga aagaagtttgaccgtcactgcccgatcgaaccaataatcacaggtagggcccacactgtg cccacagtcagcaggaagcagttggaagataagacctccataccaatgccaaagatttgt cattgttgttctgtcagcggggaacatggagtcctgctgactctgagatggagattagcg tgcaggacatttcttacgggttgttcttgggatcaatacctgtggagaggagaagctgag gcaggatcaggaagcggggaagtagagtgcaatgctggtgaagcggcatcactgtctggg gtaaatatccggggttcatcatctctcgtcaagatgattaacgacatggacacacatggt aggggtttgaaccagcgaatgtcagtttccttcagctctgttggactggggcactgctcc tttagcttagggacacatgccgggaagagggaatgtgggactcctctggcccaatttcca ggcacctggggccccagcaagcttaccttgtcagattcatcagatggttggactgtaggc atgaccttggtgttccttgcaggaaacaaaggccagagtggttctctgaagaacgagcct gttgctggctgtcagggtgtcatgatggtattgcgcaaatcctcgaggcagctttccagg gctgctgtcagcctgctcatgccctttagccactgggcagtaaatgaggatctctag >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_7|169_aa MHQQPSAKGKHRAAGLTWQRGTPRGMRGLRAAGNAVATRYLSVKNQQFHGAKRVQVGSGW VGGSVCKTVSWSLSVALKEQRYRGTKDMALCAAGTVSTLSDLPRKSPLKHKMNLGLSSSS LCSTFSDFQAIKPTGLYFSGQDPHLSAETEKQILPETAAWGSGPLDAFF >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_7|510_bp atgcatcagcagccttcggcaaaggggaaacaccgtgcagcagggctgacttggcaaaga ggcacccccagaggaatgcgagggcttagagctgcaggcaatgcagttgctacacggtat ctcagtgtcaagaatcaacagttccatggagccaagagggtgcaggttggcagtggttgg gtggggggatctgtttgcaagacggtgtcatggagcctttctgtggcattgaaggaacaa agatacaggggcacaaaggacatggcactttgtgctgctggcactgtttctacattgagt gacctccccaggaaatccccactgaaacacaagatgaaccttggactcagttcatcttcc ctttgctctacattctctgatttccaagcaataaaaccaacagggctgtatttctctgga caagaccctcacctatcagcagaaacagaaaagcaaatcctccccgaaacagcagcttgg ggaagtggcccattggatgcctttttctag >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_8|455_aa MIRAEEHYLLECKSQREKVVQRPPIVQISKRAGEGPLNSVLNGFSAYEQQYPEQQLKASP IKGVGGTAHTNQLEDPHLTEELNQYENTASSSSCPMPSPCTLRSTTDLHTLVNSKTLKNP NPKTPRGDKFEVSSRLIQRLCNEPSFSAATGCLGVLTCWLPIRQRTYYGYSHYWHLVVEV HPVAPRQNVNSPEVEKLWTKPNLRACSGCVTFIQAKLIPVMAGVQMLPALHRCRPRSKLK TLGRETQPPIGAGQSCLRPWPADLQGLINALVLVWGKAGAGEASRRSMETAPFSPRDTEM WGARACYLWELISIEEWLAGPCELSSTDIASPVSLVSHVGCSRAAVGVTPKDQDPTGMRN SAKVSVTIWTPTDLASKIDKECGSFLQSIAFGMQMKMNKQEPNFIMAEVGGRREEKDRDE EEGEGAGAGVEGEVGGGEEKEEGGGKGGGRGGGEQ >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_8|1368_bp atgatcagagcagaggaacattacctcctggagtgtaagagccagagagagaaggtggtt caaagaccaccaatagtgcaaatatccaagagggctggtgagggtcccctgaatagtgtg ctcaatggctttagtgcatatgagcagcaataccctgagcaacagctgaaagcttctccc attaaaggagttggtggaacagctcacaccaaccagttagaagaccctcacctcacagag gaactgaatcagtatgagaatacagcgtcttcatcttcttgtccaatgccttcaccctgc actcttcgatcaaccactgatctccacactttggtcaactctaaaacccttaaaaaccct aaccctaaaactcctcggggagacaaatttgaggtttcctcccgtcttattcagcggctc tgcaatgaaccctctttctctgctgcaactggctgtctcggtgtactgacttgctggctg cccatcaggcagcgaacctattacggttacagccattactggcatctagtggtggaggtg catccagtagccccacgacaaaacgtcaatagtcctgaggttgaaaaactgtggactaaa ccaaacctcagagcctgctcagggtgtgtgactttcattcaggccaaactcatccctgtg atggcaggagtgcaaatgctccctgctctccatagatgccggccccgaagcaaactcaag acactgggtcgagaaacacaacctcccataggggcaggacaaagctgccttcgcccctgg cctgctgacctccaggggctgatcaatgctcttgttctggtttgggggaaggctggtgct ggagaagcttctagaaggagcatggaaacagccccatttagccccagggatacagagatg tggggagccagagcctgttacttgtgggaactgatcagcattgaggaatggctcgctggg ccgtgtgagttgtcaagcactgacatcgcttcacctgtctccctggtcagccatgtaggc tgtagtagagcagctgtgggggtgaccccaaaggaccaggaccccactggaatgaggaac tctgcaaaggtcagtgtcaccatctggactccaactgacttggcaagtaaaatagacaaa gaatgcgggtcttttctacaatcgattgcttttggcatgcagatgaaaatgaataagcag gagccaaattttataatggctgaagtgggaggaagaagagaagaaaaagacagagatgaa gaagaaggagaaggagcaggagcaggagtagaaggagaagttggaggaggagaagagaaa gaggaaggaggaggaaaaggaggaggacgaggaggaggagaacaataa >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_9|194_aa MGVKMLLVLQSETGRGRRRRGGRGGRRGRRRRRRREEKKKEDEEEEEAVAVAVEEEEGEE EGEEGEEEGVEEEEEGRRGRKKKKEEERRRKKKKKEEGRRRKKKKNTVWWELYSRLHSWL LQVKKMRLQEVVRLTWQDEGSKSGISSWFGCIAGMWLPEGQAVVIVISLLDLATQQVYQA PDWYWGLSAQNPVM >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_9|585_bp atgggagtgaagatgcttctggttcttcaaagtgagacaggaagaggaagaagaagaaga ggaggaagaggaggaagaagaggaagaagaaggaggaggaggagggaagagaagaagaag gaggacgaggaggaggaggaggcggtggcagtggcggtggaggaggaggagggggaggag gagggggaggagggggaggaggagggggtggaggaggaagaagaaggaagaagaggaagg aagaagaagaaagaagaagaaagaagaaggaagaagaagaagaaagaagaaggaagaaga agaaagaagaagaagaacactgtttggtgggagctatattccaggctccatagctggtta ctccaagtgaagaaaatgcggcttcaagaggttgtgaggcttacgtggcaggatgaggga agtaaatcagggatttcttcttggtttggatgcattgctgggatgtggcttcctgagggc caagctgtagtgattgttatctctcttctggatctagccacccagcaagtctaccaggct ccggattggtactggggattgtctgcacagaatcctgtgatgtga >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_10|294_aa MGRGGEEWREEEGRREEERRGGRERRGEKEEEGKGGERRGEKGRRGERRQKGKEGGGEED RDSETNCSRQSPGLHEEALSLTLPGKRFSGIQAEKDLPLCRCMQTSEDDSVDWRLSPGPP WEACTKDSRATQGKEPGTHLKRIKEGWRKTSEGNNSLNENTHLAESPEVLDQEIARWKQT VQERKERKKSWKPLNAQIKQVMKDTMSQLKSMAENLLDTMQLGDFSRDHSDEDSKVQLKR KAENEDCLPQPLPPNYQSEGHLKSVLGDGDLNVFCESQEKSRNSLTIVGKKINE >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_10|885_bp atggggagaggaggggaggagtggagggaagaagaggggaggagggaagaggagaggagg ggaggaagggagagaaggggagagaaggaggaggaggggaaaggaggggagaggagggga gagaaggggagacgaggggaaaggaggcaaaaagggaaggagggaggaggagaggaagat cgagattctgagactaattgctcccggcagagtccagggctgcacgaggaggctctgagc ctgactctgccggggaagaggtttagtggaatccaggctgagaaggacctgcccctctgc agatgcatgcagacttctgaagatgactcagtggattggagactgtccccagggccacct tgggaggcatgtaccaaagatagcagggccacacaagggaaagagccagggacacatctt aaaaggatcaaagaaggatggagaaaaacttcagaaggaaataacagtttgaatgaaaac actcatttagcagaaagtccagaagtgctcgaccaagaaattgcaagatggaagcagaca gtacaggaaaggaaggagagaaagaaaagctggaaacctttgaatgcacaaataaaacag gtcatgaaagataccatgagccaattaaaatctatggctgaaaacctgctggacaccatg cagttgggagatttctctagggaccacagtgacgaggactcgaaagtacagctaaagagg aaagcagaaaatgaagactgcctgccccagcccctcccgcccaactatcagtcagaaggc catctgaagagcgtcctgggtgatggtgacttaaatgtgttctgtgaaagtcaagaaaaa tcaagaaatagcttgacaattgtgggaaaaaaaatcaatgaatga >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_11|192_aa MWEWLTKSDETYTPPREKEWGAGKNKGKATGKTSSAGITRGDRNPPYSPIYPPLLRPAPE ESNSNGNEPWALPQKEKSKPLPQTLAVTALLAQEANKLTLGQQLTIQLPHSVITLMNQRG HHWLSNPGMTQYQGLLCKNPHIALETVNTLNPATLLLIELGTPLHDCVKMVDEVFLSRGD LTDQPLRDPVVK >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_11|579_bp atgtgggagtggctcacaaagtcggatgaaacctacaccccacccagagaaaaggaatgg ggagcagggaaaaacaaagggaaagccacaggaaaaaccagttctgcaggaatcaccaga ggagataggaatcctccctacagtccaatctacccgcctttactaaggccagcccccgag gagtcaaattcaaatggtaacgaaccctgggccttgcctcaaaaagaaaaatcaaaacca ctgccccagacactagctgtcactgccctactggcacaagaagctaacaaactgactcta gggcagcagctgaccatccagttaccacattcagttataactctaatgaaccagagaggg catcactggttatcaaatccaggaatgactcagtaccagggactcctatgcaaaaatccc cacatagctttagaaacagtaaacacccttaacccagccaccttgctcctgattgaactg ggaaccccactccatgactgtgtgaaaatggtagatgaagtattcttgagtaggggagac cttacagaccaacccctcagggacccagttgttaagtag >gi568815577f:38022332_38256399|GENSCAN_predicted_peptide_12|176_aa MKEGKLRIKGSEYIPAAQRTFLLEFKTLQSPEPSANSVSAHNPLWLHSFTSHKDWMDKCV CLVTAFSQHMHHVPVALPEMVRALSAQQQTLRQITICGDRQAKDTKVLVQCVHPVYIPNM VLILADGDPSSFLSHWLPFLSTLRRQEDQATAYVCENQACSMLIMDPCELRKLLHP >gi568815577f:38022332_38256399|GENSCAN_predicted_CDS_12|531_bp atgaaagagggaaaactgagaattaaaggatctgagtatataccagctgctcagaggaca ttcctcctagagtttaaaacattacagtctccagagcccagcgccaattccgtgtcagcc cacaacccgctctggttgcacagcttcacgagtcacaaggactggatggacaagtgtgtg tgcctagtgactgccttctcccagcacatgcaccatgtcccggtggcattgccggagatg gtccgtgccctctcagcccagcagcagaccctcaggcagatcaccatctgtggagaccgc caggccaaggacaccaaagttctggtgcagtgcgtccaccctgtctacattcctaacatg gtgctgattctggctgatggggacccctcgagtttcctgtcccactggctgcctttcctg agtaccctccgaaggcaggaagaccaggccactgcgtatgtgtgtgagaatcaagcctgc tcaatgctcatcatggatccctgtgaattacgaaaactgctacatccgtga