GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:22:56 Sequence gi568815588f:58285548_58495071 : 209524 bp : 38.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2729 2856 128 1 2 82 103 99 0.505 9.36 1.02 Term + 7113 7179 67 0 1 61 50 39 0.063 -6.17 1.03 PlyA + 7264 7269 6 1.05 2.03 PlyA - 8141 8136 6 1.05 2.02 Term - 10918 10885 34 0 1 105 42 28 0.528 -4.02 2.01 Init - 12252 11372 881 0 2 42 39 423 0.420 26.79 2.00 Prom - 15633 15594 40 -5.35 3.04 PlyA - 15888 15883 6 1.05 3.03 Term - 20859 20709 151 2 1 93 42 51 0.323 -2.60 3.02 Intr - 21733 21639 95 1 2 78 89 54 0.424 2.34 3.01 Init - 24162 24115 48 0 0 98 46 77 0.638 5.50 3.00 Prom - 35688 35649 40 -3.15 4.04 PlyA - 35882 35877 6 1.05 4.03 Term - 41662 41154 509 2 2 59 47 258 0.454 12.38 4.02 Intr - 41922 41805 118 0 1 58 75 49 0.439 -0.28 4.01 Init - 42601 42518 84 1 0 31 53 112 0.548 2.87 4.00 Prom - 44713 44674 40 -4.85 5.00 Prom + 45504 45543 40 -9.85 5.01 Init + 49655 50293 639 1 0 71 42 275 0.016 14.73 5.02 Intr + 75791 75854 64 1 1 81 67 75 0.083 2.07 5.03 Intr + 82376 82469 94 0 1 54 106 47 0.080 1.20 5.04 Intr + 88056 88137 82 0 1 68 67 79 0.033 2.52 5.05 Intr + 99412 99598 187 0 1 54 60 154 0.009 7.54 5.06 Intr + 99924 100101 178 1 1 -22 96 194 0.013 7.26 5.07 Intr + 100673 100791 119 2 2 58 110 11 0.008 -0.51 5.08 Intr + 102643 102713 71 2 2 78 80 54 0.170 1.58 5.09 Intr + 103123 103272 150 0 0 82 87 54 0.215 4.14 5.10 Intr + 105218 105313 96 1 0 77 103 62 0.954 5.89 5.11 Intr + 108811 108867 57 0 0 89 81 49 0.836 2.46 5.12 Term + 109381 109527 147 0 0 54 42 187 0.924 7.62 5.13 PlyA + 110170 110175 6 1.05 6.03 PlyA - 110269 110264 6 1.05 6.02 Term - 126713 126484 230 0 2 47 49 182 0.698 6.01 6.01 Init - 128294 128240 55 2 1 68 98 20 0.420 2.51 6.00 Prom - 131966 131927 40 -5.35 7.03 PlyA - 132090 132085 6 1.05 7.02 Term - 133762 132923 840 1 0 75 39 200 0.858 5.74 7.01 Init - 135473 134982 492 2 0 66 54 256 0.635 15.20 7.00 Prom - 135897 135858 40 -6.15 8.04 PlyA - 136067 136062 6 1.05 8.03 Term - 137139 136846 294 0 0 -19 48 327 0.461 12.32 8.02 Intr - 140175 140141 35 1 2 77 76 22 0.383 -3.08 8.01 Init - 141217 140881 337 1 1 87 98 96 0.198 7.99 8.00 Prom - 143227 143188 40 -3.45 9.00 Prom + 151819 151858 40 -5.25 9.01 Init + 152728 152854 127 0 1 57 50 128 0.340 6.27 9.02 Intr + 162017 162253 237 0 0 46 62 170 0.003 6.86 9.03 Intr + 162666 162818 153 1 0 27 23 176 0.003 4.12 9.04 Term + 194717 194868 152 0 2 -2 42 249 0.339 8.39 9.05 PlyA + 195008 195013 6 1.05 10.00 Prom + 200817 200856 40 -5.65 10.01 Init + 208407 208580 174 2 0 48 70 179 0.572 11.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 49655 50302 648 1 0 71 37 294 0.962 16.67 S.002 Init - 65597 65516 82 2 1 64 115 20 0.938 3.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_1|64_aa IFNEDGSIKGANSENLVKGDIRFKLVMKNVLKSNVPEFWPKEWATRCPDICSDILLGVSV SGFG >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_1|195_bp atatttaatgaagatggatccattaaaggagccaatagtgaaaatttagttaaaggtgat attagatttaaattagtcatgaagaatgtcttaaaatctaatgttccagaattctggccc aaggaatgggctacaaggtgcccagatatttgttcagacattcttctgggtgtttctgtg agtggttttggatga >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_2|304_aa MIISIDAEKAFDKIQQPFMLKTLNKLGINGKYLKILRAIYEKPTANSILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLLADDMIVYLENPIVF AQNLLKLISNFSKVSGYKINVEKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYEALLNEIKEDTNKWKNIPSSWVGRINIVKMAILPKLIYRFNAILIKLPM TFFTELEKTTLKFIWNQKRAHIAKSILNQKNKAAGIMLPDFKLYYKATVTKTAWDSPNVL NNEQ >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_2|915_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattaatgggaagtatctcaaaatactaagagctatctat gagaagcccacagccaatagcatactgaatgggcaaaaactggaggcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaagaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgcttgcagatgacatgatagtatatctagaaaaccccattgtcttt gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtagaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctgttcaaggagaactacgaagcactgctcaatgaaataaaagag gatacaaacaaatggaagaacattccaagctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaagttaatttatagattcaatgccatcctcatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaaaccaaaagaacaaagctgcaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatgggattcccccaatgtgctc aataatgagcagtga >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_3|97_aa MAEGKEGADISHDTAGGAGVQCPSMTTSPQGKPQIVFQKMCRGTAMPKNLENIPVNAQVK SLAVILFSHTTFNPSANLAHSTFKIYPESDQLSSTPS >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_3|294_bp atggcggaaggcaaagagggagcagacatctcacacgacacagcgggaggtgctggagtc cagtgtccctccatgacaacttctccccagggtaaacctcaaatcgtctttcagaaaatg tgcagagggacagccatgcccaaaaatctggagaacatcccagtaaatgctcaggtcaaa agccttgcagtaatccttttctcacacactacatttaatccatcagcaaatctggcccat tctaccttcaaaatatatcctgaatctgaccaactttctagcactccatcctag >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_4|236_aa MHLNHPETTTPSSVENLSSMKLVPGAEKGCCFRHSIRPNKKGKKISKSPSPSICKVEILH PGCASGQRLTRHLPRSGRPGKERGRRRSRDPYDRHPQQCEAWASCHPAAGLQQPSSPTPG PAYGRGHRDLAAVRSSDLRSLCRIRSRQQELSAHGIQSGSRVTMGCGGGGGGRWSRERWS RERGADLEKSLERITGQVEEADEELPGCAADQGRSGGRVKATVGNRAIYEDVEGRS >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_4|711_bp atgcacttgaatcatcccgaaaccaccaccccatcatccgtggaaaatttgtcttccatg aaactggtccctggtgccgaaaagggatgctgtttccgtcattccatccgaccaaataag aaaggaaagaagatctcaaaatctcccagccccagcatctgtaaagtagaaattctccac cctggctgtgcctcaggacagcggctgactcgtcacctgccccgctccgggaggccaggg aaggagcgaggacgccgcaggagccgggatccatatgaccgtcacccacagcaatgcgaa gcctgggcctcctgtcaccccgcagcagggctgcagcaacccagttctcccacacccggc ccagcttacggaagaggccacagggatcttgcagccgtcagaagctcagatttaagaagt ctctgcaggatacggagcaggcagcaggagttgtcagcgcacggaatacagagtggttcc cgggtcacgatggggtgtggaggcgggggtggggggagatggtccagagaaagatggtcc agagaaagaggggcagatttggagaagtctttggagaggatcactggccaagtggaagag gcggatgaagagctccctgggtgtgctgctgaccaaggcagatctggaggcagagtcaaa gcaacagtcggcaatcgggcaatttatgaagacgttgaaggaaggagttga >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_5|627_aa MALKRIQKVSAGAGPGAAGQRPPAHADSASGPERHAGRARDGRVGRGAGRELKARAGPAG IGQVRWEQGQVPGGRRSLKAGGPRAFAPEPLPEPQGREPLARPDRPEICASRSRPGFALH GASFDPCHAVFSPGCFLSLSSFMFLLTRPPREGKEGGVGLQPFSAELSGWFFLSSAVGSC SVARMGWKAHLRPRSQEKAPAPSILVPLLLPLQELSDLQRDPPAHCSAGPVGDDFLLSIC SLLCDPNPDDPLVPDIAQIYKSDKEKFLAQDDSDGLRLEAFGGFSTHTLAPGQVCESPPL PLLFLNSELQPWLELRRSAGRAAAADRTSGSWMQDCLLRTALVTVTGGHRPTPGWVVIAG VVYCQEALRDWGRVTASSTGAMAFLRSMWGVLSALGRSGAELCTGCGSRLRSPFSFVYLP RWFSSVLASCPKKPVSSYLRFSKEQLPIFKAQNPDAKTTELIRRIAQRWRELPDSKKKIY QDAYRAEWQVYKEEISRFKEQLTPSQIMSLEKEIMDKHLKRKAMTKKKELTLLGKPKRPR SAYNVYVAERFQEAKGDSPQEKLKTVKENWKNLSDSEKELYIQHAKEDETRYHNEMKSWE EQMIEVGRKDLLRRTIKKQRKYGAEEC >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_5|1884_bp atggcgctgaagaggattcagaaagtgagtgccggggccgggcctggggctgcggggcag cggccccctgcccacgctgactcggccagtggtcccgagagacatgcggggagagcaagg gatggaagggtgggccggggtgcgggcagggagctgaaggctcgggccgggccagcgggc atcggacaggtgaggtgggagcagggtcaggtcccaggcggccggaggagcttgaaggct gggggcccgcgcgcgttcgcgccggagccgctcccggagccccagggtagagagcctctt gcgcgtccagaccgtcccgaaatctgtgcttcacggtcacggccgggttttgccctccat ggggctagttttgacccctgtcatgctgtgttttctccaggctgtttcttaagcctgtcc tcttttatgttcttgcttacccggcctccccgggagggcaaggaaggtggggtgggcctg cagcctttctccgcggagctgtcgggttggttctttctctcctctgcggttggctcttgc tcagttgcccgaatgggctggaaagcccatttgcggccaagaagtcaggagaaggctcct gccccttctattttggtgcctctgctgctgccacttcaggaattgagtgatctacagcgc gatccacctgctcactgttcagctggacctgtgggagatgactttttattgtccatatgt tctctactttgtgatcctaatccagatgaccccttagtaccagatattgcacaaatctat aaatcagacaaagaaaagttcttggctcaggatgactcagatggcttgaggctggaagca tttggaggtttctccactcatacgttggcaccaggacaggtttgcgaatccccgcctctc ccgttactatttctgaactccgagctccagccctggcttgaactgagacgctccgctggg cgcgcagcagccgccgatcggacctcggggtcctggatgcaggactgtctgttacgtaca gcccttgtgaccgtcacgggcggacaccggccaacgccgggttgggttgtgattgctgga gttgtgtattgccaggaggctctccgagattggggtcgggtcactgcctcatccaccgga gcgatggcgtttctccgaagcatgtggggcgtgctgagtgccctgggaaggtctggagca gagctgtgcaccggctgtggaagtcgactgcgctcccccttcagttttgtgtatttaccg aggtggttttcatctgtcttggcaagttgtccaaagaaacctgtaagttcttaccttcga ttttctaaagaacaactacccatatttaaagctcagaacccagatgcaaaaactacagaa ctaattagaagaattgcccagcgttggagggaacttcctgattcaaagaaaaaaatatat caagatgcttatagggcggagtggcaggtatataaagaagagataagcagatttaaagaa cagctaactccaagtcagattatgtctttggaaaaagaaatcatggacaaacatttaaaa aggaaagctatgacaaaaaaaaaagagttaacactgcttggaaaaccaaaaagacctcgt tcagcttataacgtttatgtagctgaaagattccaagaagctaagggtgattcaccgcag gaaaagctgaagactgtaaaggaaaactggaaaaatctgtctgactctgaaaaggaatta tatattcagcatgctaaagaggacgaaactcgttatcataatgaaatgaagtcttgggaa gaacaaatgattgaagttggacgaaaggatcttctacgtcgcacaataaagaaacaacga aaatatggtgctgaggagtgttaa >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_6|94_aa MVILWPIGTVAWDLWMEEGVFIQVQNDMYTIICLALEENLGNTTQDIGMGKDFMTKTPKA MATKAKIDKWDLIKLKSFCTAKETSIRVNRQSTE >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_6|285_bp atggtcattttgtggccaattggaactgtggcttgggatctttggatggaggaaggtgtg ttcatacaggtgcaaaatgacatgtatacgattatttgccttgccttagaagaaaaccta ggcaataccactcaggacataggcatgggcaaagacttcatgactaaaacaccaaaagca atggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcaca gcaaaagaaactagcatcagagtgaacaggcagtctacggaatag >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_7|443_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRRKFIALNDHKRKQERSKMDTLTSQLKELE KQEQTHSKASRRQEITKIRAELMEIETQKTLQKINESRSWFSEKINKMERPLARLIKKKR EKNQIDAIKNDKGDITTDPTEIQTIIREYYKQLYANKLENLEEMNWKKTTLNFIWNQKRA RIAKTILSQKNKTGGITLSDFKLYYKATVTKTAWYWYQNRDIDQWNRIEPLEIIPHVYNH LIFDKPDTNKQWGKESLFIKWCCENWLAICRKLKLDPFPTLCTKINSRWIKDLNVRPKTI KTLEENLGNTIQDIGMGKDFMTKPPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPT EWEKIFTMYPSDEGLISRIYKELKQIYKKKIKQPHQKVGQGCEQTLLKRRHLCSQQTHEK MLIIIGHQRNANQNHNEIPSHTS >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_7|1332_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacatttaaagcagtgtgtagaaggaaatttatagcactaaatgaccac aagagaaagcaggaaagatctaaaatggacaccctaacatcacaattaaaagaactagag aaacaagagcaaacacattcaaaagctagtagaaggcaagaaataactaagatcagagca gaactgatggagatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttctgaaaagatcaacaaaatggagagaccactagcaagactaataaagaagaaaaga gagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccactgatcccaca gaaatacaaactatcatcagagaatactataaacaactctatgcaaataaactggaaaat ctagaagaaatgaattggaaaaaaaccactttaaatttcatatggaaccaaaaaagagct cgcatagccaagacaatcctaagccaaaagaacaaaactggaggcatcacgttatctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaatagagccattggaaataataccacacgtctacaaccat ctgatctttgacaaacctgacacaaacaagcaatggggaaaggagtccctatttattaaa tggtgctgcgaaaactggctagccatatgtagaaagctgaaactggatcccttccctaca ctttgtacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgactaaaccaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagccaaagaaacaaccatcagagtgaacaggcaacctaca gaatgggagaaaatttttacaatgtacccatctgacgaagggctaatatccagaatctac aaagaacttaaacaaatttacaagaaaaaaatcaaacaaccccatcaaaaagtgggccaa ggatgtgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaa atgctcatcatcattggccatcagagaaatgcaaatcaaaaccacaatgagataccatct cacaccagttag >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_8|221_aa MGKMSPGHVRDFHGRPSHHRPRGPGGKSGFVGKVQGPHAVCSLGTWCLVSQPLHPWLKGA NVQLRLWLQKVEAPSLGSFHVVLSLWVHRSQELKFGNLCLDFRRFMETPGCPEFPCVVRE IREELKEDVRTHHKGDKNLEKRLDEWLTRINSVEKSLNDLMELKTMAQELHDACTSFSSQ FDQVEERVSVIEDQMNEMKQEEKFREKRVKRNKAAKKYGTM >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_8|666_bp atggggaaaatgtctccaggccatgtcagagactttcacggcagaccctcccatcacagg cccagaggcccaggaggaaaaagtggttttgtgggcaaggtccagggtccccatgctgtg tgcagcctagggacttggtgccttgtgtcccagccactccacccatggctgaaaggggcc aatgtacagctcaggctgtggcttcagaaggtggaagccccaagccttggcagcttccac gtggtgttgagcctatgggtgcacagaagtcaagaactgaagtttgggaacctttgccta gatttcagaagatttatggaaacacctggatgcccagaattcccatgtgttgtgcgagag atccgggaggagctaaaggaggatgttcgaacccatcacaaaggagataaaaaccttgaa aaaagattagatgaatggctaactagaataaacagtgtagagaagtccttaaatgacctg atggagctgaaaaccatggcacaagaactacatgatgcatgcacaagcttcagtagccaa tttgatcaagtggaagaaagggtgtcagtgattgaagatcaaatgaatgaaatgaagcaa gaagagaagtttagagaaaaaagagtaaaaagaaacaaagccgccaagaaatatgggact atgtga >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_9|222_aa MNVCKDSDDGEMHGAIEGIMEEYDQKRGQGRFLISEYQLSNSGVKLQTFTVSATAHKGSV DPKIYGKEQKNKASPAQQETPAACHCWLGQPAFISLSGPTRILLIGPFYRELIGLFHREL IGAELLAGRAARQSRAVRQHSSVLGRSMGPGAAEQGAALLGEAPAVQEPTAGAKSKREAR DDDSGGGGSDDDDDDEDDNDDGEETEGEEKEENPVGHLWWSC >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_9|669_bp atgaatgtgtgcaaagacagtgatgatggagaaatgcatggtgctattgaaggcattatg gaagaatatgaccaaaaaagaggacagggtcgctttcttatcagtgaatatcagctgagc aattcaggagtgaagctgcagaccttcacggtgagtgctacagctcataaaggtagtgtg gacccaaagatttacggcaaagagcaaaaaaacaaagcttccccagcgcagcaagagacc ccagcagcttgccactgttggctggggcagcctgcttttatttccttatctggccccacc cgcatcctgctgattggtccattttacagagagctgattggtctgtttcatagagagctg attggggccgagctgctggctggcagagctgcccgccagtcccgcgctgtgcgccagcac tcctcagtccttgggcggtcgatgggacccggcgctgcggagcagggggcggcactcctt ggggaggctcctgctgtgcaggagcccacagcgggggccaagtcaaagagggaagctaga gatgatgatagtggtggtggtggtagtgatgatgatgatgatgatgaagatgataatgat gatggagaggagacagagggggaagagaaggaggagaatcccgtagggcatctgtggtgg tcgtgttag >gi568815588f:58285548_58495071|GENSCAN_predicted_peptide_10|58_aa MQDETASADVEAAASDPEDPAKIIGECDYTKEIFNVDKTVFYWKYMPSMTFLAREEKL >gi568815588f:58285548_58495071|GENSCAN_predicted_CDS_10|174_bp atgcaagatgaaacagcaagtgctgatgtagaagctgcagccagtgatccagaagatcca gctaaaatcattggtgaatgtgactatactaaagagattttcaatgtagataaaacagtc ttctattggaagtatatgccatctatgactttcctagctagagaagagaagttg