GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:08:56 Sequence gi568815591f:26260973_26472569 : 211597 bp : 42.32% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15397 15444 48 0 0 88 83 53 0.039 5.80 1.02 Term + 30302 30370 69 1 0 31 48 164 0.588 3.66 1.03 PlyA + 32593 32598 6 1.05 2.00 Prom + 38789 38828 40 -3.55 2.01 Init + 45765 46136 372 2 0 81 81 71 0.222 2.73 2.02 Intr + 47512 47774 263 0 2 43 81 96 0.466 -0.24 2.03 Intr + 48507 48641 135 0 0 74 50 118 0.368 5.36 2.04 Intr + 69238 69346 109 1 1 39 73 90 0.002 1.97 2.05 Term + 76656 76823 168 2 0 78 54 57 0.025 -1.80 2.06 PlyA + 80333 80338 6 1.05 3.05 PlyA - 81555 81550 6 1.05 3.04 Term - 84640 84266 375 1 0 48 40 314 0.651 16.35 3.03 Intr - 85284 84987 298 2 1 114 27 267 0.861 19.15 3.02 Intr - 85355 85327 29 2 2 65 58 1 0.485 -9.20 3.01 Init - 85671 85630 42 0 0 84 92 53 0.374 5.79 3.00 Prom - 87498 87459 40 -4.15 4.00 Prom + 87995 88034 40 -6.45 4.01 Init + 99991 100089 99 0 0 34 103 60 0.213 2.41 4.02 Intr + 103563 103663 101 2 2 101 78 4 0.659 -1.31 4.03 Intr + 104075 104173 99 2 0 95 87 101 0.893 9.01 4.04 Intr + 110849 111061 213 2 0 73 63 180 0.899 10.91 4.05 Term + 111519 111600 82 0 1 121 48 40 0.951 -0.51 4.06 PlyA + 111884 111889 6 -0.45 5.08 PlyA - 111905 111900 6 1.05 5.07 Term - 112324 112295 30 1 0 69 39 33 0.017 -6.62 5.06 Intr - 115366 115115 252 1 0 70 94 246 0.723 20.11 5.05 Intr - 116404 116162 243 1 0 20 46 193 0.636 5.07 5.04 Intr - 121306 121201 106 0 1 41 51 27 0.054 -6.40 5.03 Intr - 122603 122403 201 1 0 70 62 152 0.089 8.28 5.02 Intr - 138334 138204 131 1 2 96 47 90 0.307 4.27 5.01 Init - 139483 139169 315 1 0 109 76 109 0.822 9.20 5.00 Prom - 143319 143280 40 -10.25 6.04 PlyA - 143515 143510 6 1.05 6.03 Term - 143762 143522 241 1 1 78 37 163 0.112 4.71 6.02 Intr - 152848 152705 144 0 0 46 97 101 0.614 5.28 6.01 Init - 153195 153122 74 0 2 72 49 51 0.438 0.19 6.00 Prom - 163676 163637 40 -3.45 7.03 PlyA - 164225 164220 6 1.05 7.02 Term - 165011 164766 246 2 0 76 43 159 0.741 5.01 7.01 Init - 168910 168809 102 1 0 75 58 83 0.551 4.29 7.00 Prom - 173592 173553 40 -6.05 8.00 Prom + 174713 174752 40 -5.05 8.01 Init + 174843 174960 118 2 1 57 46 86 0.694 1.71 8.02 Intr + 180625 180861 237 2 0 125 85 141 0.661 14.16 8.03 Intr + 186910 187046 137 2 2 14 92 119 0.390 4.27 8.04 Term + 200467 200634 168 0 0 -2 47 227 0.247 6.50 8.05 PlyA + 201486 201491 6 1.05 9.00 Prom + 204645 204684 40 -7.95 9.01 Init + 206040 206374 335 2 2 88 96 61 0.660 3.71 9.02 Intr + 207035 207110 76 2 1 79 100 56 0.960 4.50 9.03 Intr + 207293 207421 129 1 0 108 45 83 0.805 5.97 9.04 Intr + 207753 207873 121 2 1 45 68 99 0.366 2.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 145029 145466 438 2 0 62 43 197 0.849 8.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_1|38_aa MGMATVKKDKPELRKKVGLEWSLEDDQRQAVSNGKIHA >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_1|117_bp atggggatggccacagtcaagaaagacaaaccagagctcagaaaaaaggtgggcctggaa tggtccctggaggatgatcagcgacaagctgtatctaatgggaaaattcatgcttga >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_2|348_aa MRKAPQHFSYWGSWESLRMNSDCSWHSRSQAQASMLAHLIFTVDHEAGSYLLLQMRTLKH RDEVIWERAECEFTTRWSGVKCLTLLLPLTVLILLTLGGHSPLNCSSAVDLHDRGTVKER GLFPSSLFNEVCVKGPRGQGLEGFWKAEHVEVPGGWCTWGGMGAPYPYLVLCVSICVLCT ILYNKPVISFVSRSSKLIKPKEEVMGSPVYSQWPFIASDPNQVCRVNALHFSQYYLFSED KYYVFSGIRLLASPGFMRSEKVGKSVWRGRQSHRHEEQEAGEKQVLLGNEGEEHVAEFPS LLRLNGTHCMERPHFAHPLHFIDGHLGYFHLLAIVNNTAVDVDPQIAL >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_2|1047_bp atgaggaaggcacctcagcatttttcatattgggggtcctgggaaagtctgagaatgaac agtgactgcagctggcacagtcggagccaggcacaggcctccatgcttgctcatttgatc tttacagtagatcatgaagcaggcagttatctcctcttacagatgagaacactgaagcac agagatgaagtaatttgggaaagggcagaatgtgaattcacaaccaggtggtctggagtc aaatgcctcacactacttttaccactcactgtgctcatcctgctcactttaggggggcat tcccctctaaactgctcctcagcagtggatcttcatgacagaggcaccgtgaaggaaaga gggttatttccatcatccctgttcaatgaagtctgtgtaaaaggcccaagaggacagggt ttggagggcttctggaaggctgaacacgtggaggttcctggggggtggtgcacctgggga ggtatgggagctccatacccataccttgtcctgtgcgtctccatctgtgtcctttgtact atcctttataataaaccagtaataagttttgtgagccgctctagcaaattaatcaaaccc aaggaggaggtcatgggatccccagtttatagccagtggcccttcatagctagtgaccct aaccaggtctgccgagtgaatgcactgcacttttctcagtattacctcttttcggaagat aagtattacgtcttttcaggtatccggctgttggcttctcccgggttcatgaggtcagaa aaagtggggaagtcagtgtggaggggtcggcaaagccacaggcatgaggagcaggaagct ggagagaagcaggtcttgctaggaaatgagggggaggagcatgtggcagaatttccttcc cttttaaggttgaatggtacccattgtatggagagaccacattttgctcatccattacat ttcattgatggacacttaggttacttccacctcttggctattgtgaataatactgctgtg gatgtggacccacaaatagctctttga >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_3|247_aa MKAMQAPMVGVEEQSPPRPTVPDMTKVNISPHQGLGVWTPKSPRQLDDDRPSNRKASQSG WGWSLSELARKLEMPGSRSHVRGDAGWKRRAERSTTRGSPDAFTGNSNLLGSGPSCDTYI TPESLYTHVTGTIGNTGKQEHRLRFQGRVEVQTRDPTGPLEGASELHPSSSRKLSKRKGV PLPWSSQRTHGEGMGEQGIAVFKEKKQDLQGSGGATTTPTEPGEKRALWSRDRDQARSAG GQGSQRV >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_3|744_bp atgaaggctatgcaggcaccaatggtgggagtagaggagcagagccccccccgccccaca gtccctgacatgaccaaggtgaacatcagtccacaccaggggttaggagtctggacccca aagagtccaaggcagctggatgatgacagaccttcaaaccgcaaggccagccagtcaggc tggggctggagccttagtgaattggcgaggaagctggagatgccggggagcagaagccat gtgcggggggatgctggctggaaaagacgcgcagagagatccaccaccagaggcagccct gacgcttttaccggaaactcaaaccttctgggatcaggcccttcctgtgacacttatatt actccagagagtctctatactcatgtcactggcaccatagggaacacagggaagcaggaa cacaggctccgcttccagggccgtgtggaggtgcagacgagagaccccacagggcctctg gaaggggcaagcgagctgcacccgagcagcagcaggaagctttccaagaggaagggagtc cctctcccctggagcagtcagaggacccatggagagggaatgggagagcaggggatcgca gtgttcaaggaaaaaaagcaggacctccaggggagcggaggagcaacaacaacacccaca gagcctggtgaaaagcgtgccctatggtccagagatcgtgaccaggccagatcagctggg ggacagggttctcagagggtgtag >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_4|197_aa MPLQEFVSVWVRDPRIQKEDFWHSYIDYEICIHTNSMCFTMKTSCVRRRYREFVWLRQRL QSNALLVQLPELPSKNLFFNMNNRQHVDQRRQGLEDFLRKVLQNALLLSDSSLHLFLQSH LNSEDIEACVSGQTKYSVEEAIHKFALMNRRFPEEDEEGKKENDIDYDSESSSSGLGHSS DDSSSHGCKVNTAPQES >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_4|594_bp atgccattacaggaatttgtaagtgtctgggttcgagatcctaggattcagaaggaggac ttctggcattcttacattgactatgagatatgtattcatactaatagcatgtgttttaca atgaaaacatcctgtgtacgaagaagatatagagaattcgtgtggctgaggcagagactc caaagtaatgcgttgctggtacaactgccagaacttccatctaaaaacctgtttttcaac atgaacaatcgccagcacgtggatcagcgtcgccagggtctggaagatttcctcagaaaa gtcctacagaatgcacttttgctttcagatagcagccttcacctcttcttacagagccat ctgaattcagaagacattgaggcgtgtgtttctgggcagactaagtactctgtggaagaa gcaattcacaagtttgccttaatgaatagacgtttccctgaagaagatgaagaaggaaaa aaagaaaatgatatagattatgattcagaaagttcatcctctgggcttggacacagtagt gatgacagcagttcacatggatgtaaagtaaatacagctccgcaggaatcctga >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_5|425_aa MVGLNWGSSGSAIRTMSTPISVKLYKTLPLSPDHCHDRPKQLQSIVSMCIWGVFNTRIIG ELHHKSPVGAVAVCGINAHSGTSPRPDHLQRIVSMHTPGAPTPKPGCNCRTAFSRQPDRI AVCWVSAIIKNDHKYAFGKRNENSSRPKRKRADWSNQGKHVLRMAEPLLQSRSLGGPLEQ GLLTCFGELPEREIDFSLIGAIAIWGVFVTATVLSRLVPKLWLQKLEFEWGGAERARDCR VRFTLRATGRTGFCEYTSDQMMLLSTTDTYVGNRFFLLGSPNTTPLRGQLARPNQGRVLQ WAAAASALARQRPLLSSGGLAPDVSQAARFFRLPWVTRAVAMGYRTEAGWLATCSPVAAA SGAARAGRGGSARVRDAARGMRAGRGGAATNRRPRVQPRERHRQWGGGQSYKEGPEKPFV EADFG >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_5|1278_bp atggtgggacttaactggggcagttccggttctgccattaggaccatgagcactccaatc tcagtgaaattgtataagacactgccattatccccggatcattgccacgatagacccaaa caactccagtctatcgtgtcaatgtgtatttggggagtctttaacacacggatcattggc gagctgcaccataaaagccctgtaggagccgtggccgtgtgtggaatcaatgcacactcg gggacttccccacgcccggatcatttgcagcgcatcgtgtcaatgcatactccgggagcc cctactccgaagcctggctgcaactgccgaacagccttttctcggcagcctgacagaata gctgtgtgttgggtttctgcaatcataaaaaatgaccacaaatatgcattcggaaagagg aacgaaaactcctctaggcccaaacgaaagagggctgactggagcaatcagggaaagcac gtgttgaggatggcagagccactcctccagtccaggtccctgggcggccccctggagcag ggcttgcttacctgctttggagagttacctgaaagagaaattgatttctccctcattgga gccattgccatctggggtgtctttgttacagctacagtgttgtccagactagttcctaag ctatggctgcaaaagttggaattcgagtggggaggagccgagagggccagggactgccgt gtcagatttactctgagagctactgggagaacagggttctgcgaatacacgagcgaccaa atgatgctcttaagtacaaccgacacctacgtagggaacaggttctttctacttgggagc cctaacaccactccactacgcgggcagctcgccagacccaaccaagggcgcgttctgcag tgggcggctgcagcctccgcactggcccggcagcgcccccttctgtcctccggcggcctg gcacctgatgtgtcacaggccgcgcgctttttccggttgccatgggttaccagggcggtt gccatgggttaccggaccgaggccggctggctcgctacctgcagcccagtcgccgccgcc agcggagcggcccgggcgggacgcggcgggagcgcgcgtgtgcgggacgcagcgcggggg atgcgcgcgggccgcggaggcgccgcaaccaacaggcggccgagggtgcagccgcgggag cgccaccgccagtggggtggggggcaaagctataaagaaggcccagagaaaccatttgtt gaagctgattttggataa >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_6|152_aa MRFKEPLFYGPPIISPGQSDDSHRGEVGVTVPILQMRTLGLRGHKQQSQDKNPDPQTPAL TLFLLSPSIKPARLVYIPLSLLPGAPGTKNNMSAILNPRLYTEPGTWYTGSGQLSKQTTP TIKTTLNQAAFYNSVEAKAPAFKGLIQWRSCL >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_6|459_bp atgagattcaaggaacccttattttacggtcccccaatcatctctcctggacaaagtgat gattcccatcgtggtgaggttggtgttacggtccccattttacagatgaggacactgggg ctgaggggccataagcagcagagccaggataagaatccagatcctcaaacaccagctctg accctctttctgctctccccctccatcaagcctgcaagactggtttacatcccactgtct ctcctacccggagctcctggaaccaagaataacatgtcagccatcttgaatccccggctt tacacagagccaggcacctggtacacaggaagtggccaattaagcaaacaaacaacacca acaataaaaacaactctcaaccaggccgcattctacaattctgtggaggccaaagctcct gctttcaaggggcttattcagtggaggagttgcttataa >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_7|115_aa MYNMRINIVDPVEQNGGNSPKKQGRKGSPNKTTHSDGSFTKATQLPQESETKDSSSMSRE LEHVVERTVDLKPEDNLGLVMSFSSWGHWAKHMHALNSNSLSFYKMEGDSDVCPH >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_7|348_bp atgtacaatatgagaattaacattgtcgatcctgtagaacaaaatggtggcaactcacca aaaaagcagggaagaaaaggctcccccaacaaaacaacacatagtgatggatcttttaca aaagccacacaattaccccaggaaagtgaaacaaaggattcatcctcaatgagcagagag ttggagcatgtggtggaaagaaccgtggatttgaagccagaagacaacttaggcctggtt atgtcatttagcagctggggacactgggcaaagcatatgcatgctctgaacagcaactcc ttatctttctataaaatggagggagatagtgatgtctgccctcactaa >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_8|219_aa MEKATREELSPPANSHVSDFDCNLMRNLEPALEFLTDRNYSKVFPNSTKTVNGDHLNLES GCGQSCVRDLWDAPAPPVGSVAVVIPNAQAQAWAQSLLSRFMGQIPFLAAASQKPCLPVR LPEAILNDFPQFPLALVLRVLPLGLPQHMVLKQKPMGLEYAEEQLGQFLPVVTLLTSAGC RAFDLELNASPGQWEHSKLVPAKEASHRKAAAASPQRIP >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_8|660_bp atggaaaaggctacacgagaggaactgagccctcctgctaacagccatgtgagtgatttt gattgcaacctcatgagaaaccttgagccagctcttgaattcttgaccgacagaaactat tccaaagtcttccccaattctacaaaaacagtaaatggcgatcatctgaatttggaatct ggctgtggacagtcttgtgtgagggacctctgggacgcgcctgctcccccggtggggagt gtggcagttgtgataccaaatgcacaggctcaagcctgggcgcagtcactcctttccaga ttcatgggccaaatacctttcctggctgctgcctcacagaagccctgtcttcctgtcagg cttcccgaggccatcctgaatgacttcccgcagttcccccttgcccttgttctgagagtc ctgcccttgggcttgccccaacacatggtgctcaaacagaaaccaatgggactggagtat gcagaggagcagctgggccaattcttaccagttgtcaccctgctgacaagtgcagggtgc cgagcctttgacttagagctgaatgccagccctggacagtgggaacacagcaagctggtg cctgctaaggaggcttcccacagaaaagctgctgctgcttcacctcagcgtataccttga >gi568815591f:26260973_26472569|GENSCAN_predicted_peptide_9|221_aa MGSSSVDTIEEGCCKEEREMGQELKNAIKQRLFFFFKTREIAAGLYADGIDMTEEQTMKN VCFTGQGVGEEVLKQHPGGGEGEALVHRRGSAWIAVGDLLAESAGGRADRHRLDEREVKG SSPPSTGHLQIFDKLTKLFKHWMDLSGIGGAAREAGGVCISSVVVIGEPENLQLPEAGRE VCTDEAPPSVKNWAVTPVFMIAEIRGLSPCGPQRKPTSMPX >gi568815591f:26260973_26472569|GENSCAN_predicted_CDS_9|663_bp atggggtcttccagtgtagacaccattgaagaagggtgctgcaaagaggagagggaaatg gggcaggagttgaaaaatgcaattaagcaaagacttttttttttttttaagacgagagaa attgctgcaggtttgtatgctgatgggattgatatgacagaggagcaaacaatgaagaat gtatgcttcacaggtcagggggtaggggaagaagttctgaaacagcatcctgggggcggt gagggggaagccctggtgcacaggaggggctcggcctggatagcagtgggagatctgctg gcagagtcggcgggaggcagggcggacaggcacagattggatgagcgtgaggtcaagggc agctctcctcctagcactgggcatctacaaatatttgacaaattgaccaagctctttaaa cactggatggatttaagcggaatcggaggggctgcaagggaggccgggggtgtttgtatt tcctctgtggtagttatcggtgaaccagagaatcttcagcttcccgaagctggcagagag gtgtgcactgatgaagcacccccctcagtgaaaaactgggctgtgactccagtcttcatg atagcagagatacggggactgtcgccttgtggtccccagcgtaagcccaccagcatgcca cnn