GENSCAN 1.0 Date run: 10-Mar-117 Time: 10:43:25 Sequence gi568815595f:122154190_122385188 : 230999 bp : 40.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1132 1707 576 0 0 10 42 212 0.513 4.72 1.02 PlyA + 1718 1723 6 1.05 2.05 PlyA - 3482 3477 6 1.05 2.04 Term - 15369 15223 147 0 0 105 40 82 0.335 2.02 2.03 Intr - 18328 18215 114 1 0 54 58 123 0.352 5.62 2.02 Intr - 30381 30220 162 0 0 89 42 135 0.470 8.25 2.01 Init - 47298 47209 90 0 0 83 59 53 0.161 2.44 2.00 Prom - 54345 54306 40 -5.65 3.00 Prom + 64871 64910 40 -2.15 3.01 Sngl + 69100 69303 204 0 0 61 47 173 0.762 5.44 3.02 PlyA + 69381 69386 6 1.05 4.00 Prom + 70910 70949 40 -4.75 4.01 Init + 73147 73231 85 0 1 88 48 80 0.373 5.16 4.02 Term + 73389 73555 167 1 2 17 49 230 0.848 9.10 4.03 PlyA + 73718 73723 6 1.05 5.04 PlyA - 74136 74131 6 1.05 5.03 Term - 76222 76013 210 1 0 23 44 223 0.786 7.71 5.02 Intr - 76590 76426 165 0 0 57 94 127 0.035 9.44 5.01 Init - 85033 84881 153 1 0 11 76 121 0.009 3.52 5.00 Prom - 94629 94590 40 -7.25 6.00 Prom + 99604 99643 40 -2.35 6.01 Init + 100001 100185 185 1 2 90 110 118 0.998 10.90 6.02 Intr + 102892 103198 307 1 1 123 78 215 0.993 19.53 6.03 Intr + 107339 108223 885 1 0 125 96 887 0.847 83.78 6.04 Intr + 121623 121853 231 2 0 109 92 297 0.978 29.25 6.05 Intr + 127924 128047 124 0 1 131 110 86 0.998 14.34 6.06 Term + 129498 131002 1505 1 2 63 38 1823 0.996 164.49 6.07 PlyA + 131183 131188 6 -0.45 7.00 Prom + 131769 131808 40 -5.95 7.01 Init + 132771 132819 49 2 1 93 61 117 0.558 10.66 7.02 Intr + 138738 138882 145 1 1 88 36 151 0.920 8.32 7.03 Intr + 142051 142219 169 1 1 66 24 128 0.033 3.23 7.04 Intr + 157461 157491 31 2 1 88 83 35 0.027 -0.21 7.05 Term + 161054 161322 269 0 2 10 54 240 0.102 7.47 7.06 PlyA + 161335 161340 6 1.05 8.00 Prom + 161427 161466 40 -9.35 8.01 Init + 163618 163805 188 0 2 55 8 211 0.477 8.38 8.02 Intr + 164141 164349 209 2 2 26 69 231 0.103 12.80 8.03 Intr + 171072 171169 98 1 2 66 113 75 0.020 6.61 8.04 Intr + 183358 183459 102 0 0 109 72 116 0.997 11.55 8.05 Term + 187250 187378 129 1 0 109 43 132 0.996 8.00 8.06 PlyA + 187454 187459 6 1.05 9.06 PlyA - 187509 187504 6 1.05 9.05 Term - 205730 205680 51 2 0 93 43 89 0.847 1.35 9.04 Intr - 208838 208779 60 2 0 116 70 37 0.639 2.71 9.03 Intr - 214133 213987 147 2 0 84 115 57 0.982 7.51 9.02 Intr - 217611 217486 126 0 0 17 84 97 0.662 2.16 9.01 Init - 230078 230022 57 2 0 76 93 53 0.116 5.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 77095 77037 59 1 2 62 115 70 0.878 7.93 S.002 Init + 181148 181174 27 1 0 77 115 -6 0.887 0.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_1|191_aa MKASTIDQAEQLDQAEERILEIEGRSFEITQSDENKGKRIKKNEQSLCDIWDTIKGPNIW IFCVLESKEKKTKELEKLFNKIIDENFASLARDLDIQIQEAQRSSNRYNSRRSSPQHIIV KLSKPKTKRRILKSAREKHLGIYKGILIRVTADFSAEILQSRREWNNISKVLKEREKNCG HEYYTQQSYPS >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_1|576_bp atgaaagcttcaacaatagatcaagcagaacaactagatcaagcagaagaaagaatctta gaaattgaaggtaggtcttttgaaataactcagtcagatgaaaataaaggaaaaagaata aaaaagaatgagcaaagcctatgtgacatatgggacaccataaagggaccaaatatttgg attttttgtgtcctagaaagcaaagaaaaaaaaacaaaagaattagaaaagctatttaat aaaataatagatgaaaactttgcaagtctagcaagagatttagatatccagatacaggag gctcagaggtcctcaaatagatacaattcaagaaggtcttctccacagcacattatagtc aaactgtcaaaaccaaagacaaaaagaagaattctaaaatcagcaagagaaaagcatctg ggcatttataaaggaatcctcatcagagtaacagcagatttctcagcagaaatcctacag tccaggagagaatggaataatatatccaaagtactgaaagaaagagaaaaaaactgcggg cacgaatattatacccagcaaagttatccttcatga >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_2|170_aa MTMAVLWNRKGGKVGKRLRNRMVAVSVWREGRFPPALLGFPGPTCAAARAPSHRGSSGPS AVGLHEDELWSDLRTARSPAPYAKKLSQPLQPSATTLLISQQPLTAGKTLHQQKENDSLK AQWLCTMKTLCTWRRESAAIMGLCIGTQYYPATVESNTGQNLAGAHGGSI >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_2|513_bp atgacaatggcagttttgtggaatagaaaagggggaaaggtggggaaaagattgagaaat cggatggttgctgtgtctgtgtggagagaggggcgctttcctcccgccctccttgggttc cccggtcctacctgcgccgcggccagggctccctcgcacagaggcagctccggcccctcg gccgtgggtctccacgaggatgagctctggtctgatctcaggactgcgcgcagccccgct ccctatgccaagaaattgtcacagccacttcaaccttcagcaacaaccttattgatcagt cagcagccattaacagcaggcaagactctccaccagcaaaaagaaaacgactcactgaag gctcagtggctgtgtaccatgaagactctgtgcacttggaggagggagagtgcagcaatt atgggactttgcattggaactcagtactacccagccacagtggaaagcaacacagggcag aacttagctggtgcccatggagggagcatttaa >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_3|67_aa MDRPLARLIKKKEKIQINTIGNDKSDITTNRTDIQKILSEYYQNLYAHKLENLEEMDKIL EIQASQD >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_3|204_bp atggataggccactagctagattaataaagaaaaaagagaagatccaaataaacacaatt ggaaatgacaaaagtgacattacaaccaaccgcacagacatacaaaaaatccttagcgaa tattatcaaaacctctatgcacacaaactagaaaacctagaagaaatggataaaatcctg gaaatccaagcctctcaagattga >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_4|83_aa MAGCRSQALPHGEAAKAREKSSTAAAGPAERTGSSLGQPRKGLPQRSSDLKGSSSTATVG AKAEEAPRVSEGCEGCQHAVTSQ >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_4|252_bp atggcgggctgcaggtcccaagccctgccccacggggaggcagctaaggcccgtgagaag tcgagcacagcagctgctggcccagctgagcgaacaggctctagccttggccagcccaga aaggggctcccacagcgcagcagtgacctgaagggctcctcaagcacggccacagtggga gccaaggccgaggaggcaccgagagtgagtgagggctgcgagggctgccagcacgctgtc acctctcagtag >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_5|175_aa MLPGLGLTLQGNGLPSGPWQVQKCCLGAYTWARDPRARLVLHTLVAELVPKLLPSLGFGV FSCKMTKWYHGLGGSQEAPSMQVEPQWKRLDLEVGSLPGSFPSPAQAVLEQKDLGSIFTP DVAAASARRRGCFVMRGAKGAFSLDLGSGPHREASSLPLLYWSMRVTPSNSCRQA >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_5|528_bp atgctgccaggcttgggactcacccttcaaggcaatgggctcccctctggcccatggcag gtccagaaatgctgtctaggagcctacacctgggctcgggacccaagagcccgcttggtg ctccacacccttgtggctgaactggtacctaagctgcttccttctctgggctttggtgtc ttctcttgtaagatgaccaagtggtatcacgggcttggtgggtcccaggaagccccttct atgcaggtggaacctcagtggaagagactggatttggaagtgggctctcttcctggttcc tttccaagtcctgctcaggctgtgctggagcagaaagatctaggctcgattttcaccccg gatgtggcggcagcgtcagcaaggaggagaggttgttttgtaatgcgtggcgctaaagga gcgttttccctggacttgggttctggccctcaccgcgaggcgtcatcgctgcctttgctt tactggtccatgcgagtgactccttcaaacagctgccggcaagcctag >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_6|1078_aa MAFYSCCWVLLALTWHTSAYGPDQRAQKKGDIILGGLFPIHFGVAAKDQDLKSRPESVEC IRYNFRGFRWLQAMIFAIEEINSSPALLPNLTLGYRIFDTCNTVSKALEATLSFVAQNKI DSLNLDEFCNCSEHIPSTIAVVGATGSGVSTAVANLLGLFYIPQVSYASSSRLLSNKNQF KSFLRTIPNDEHQATAMADIIEYFRWNWVGTIAADDDYGRPGIEKFREEAEERDICIDFS ELISQYSDEEEIQHVVEVIQNSTAKVIVVFSSGPDLEPLIKEIVRRNITGKIWLASEAWA SSSLIAMPQYFHVVGGTIGFALKAGQIPGFREFLKKVHPRKSVHNGFAKEFWEETFNCHL QEGAKGPLPVDTFLRGHEESGDRFSNSSTAFRPLCTGDENISSVETPYIDYTHLRISYNV YLAVYSIAHALQDIYTCLPGRGLFTNGSCADIKKVEAWQVLKHLRHLNFTNNMGEQVTFD ECGDLVGNYSIINWHLSPEDGSIVFKEVGYYNVYAKKGERLFINEEKILWSGFSREVPFS NCSRDCLAGTRKGIIEGEPTCCFECVECPDGEYSDETDASACNKCPDDFWSNENHTSCIA KEIEFLSWTEPFGIALTLFAVLGIFLTAFVLGVFIKFRNTPIVKATNRELSYLLLFSLLC CFSSSLFFIGEPQDWTCRLRQPAFGISFVLCISCILVKTNRVLLVFEAKIPTSFHRKWWG LNLQFLLVFLCTFMQIVICVIWLYTAPPSSYRNQELEDEIIFITCHEGSLMALGFLIGYT CLLAAICFFFAFKSRKLPENFNEAKFITFSMLIFFIVWISFIPAYASTYGKFVSAVEVIA ILAASFGLLACIFFNKIYIILFKPSRNTIEEVRCSTAAHAFKVAARATLRRSNVSRKRSS SLGGSTGSTPSSSISSKSNSEDPFPQPERQKQQQPLALTQQEQQQQPLTLPQQQRSQQQP RCKQKVIFGSGTVTFSLSFDEPQKNAMAHRNSTHQNSLEAQKSSDTLTRHEPLLPLQCGE TDLDLTVQETGLQGPVGGDQRPEVEDPEELSPALVVSSSQSFVISGGGSTVTENVVNS >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_6|3237_bp atggcattttatagctgctgctgggtcctcttggcactcacctggcacacctctgcctac gggccagaccagcgagcccaaaagaagggggacattatccttggggggctctttcctatt cattttggagtagcagctaaagatcaagatctcaaatcaaggccggagtctgtggaatgt atcaggtataatttccgtgggtttcgctggttacaggctatgatatttgccatagaggag ataaacagcagcccagcccttcttcccaacttgacgctgggatacaggatatttgacact tgcaacaccgtttctaaggccttggaagccaccctgagttttgttgctcaaaacaaaatt gattctttgaaccttgatgagttctgcaactgctcagagcacattccctctacgattgct gtggtgggagcaactggctcaggcgtctccacggcagtggcaaatctgctggggctcttc tacattccccaggtcagttatgcctcctccagcagactcctcagcaacaagaatcaattc aagtctttcctccgaaccatccccaatgatgagcaccaggccactgccatggcagacatc atcgagtatttccgctggaactgggtgggcacaattgcagctgatgacgactatgggcgg ccggggattgagaaattccgagaggaagctgaggaaagggatatctgcatcgacttcagt gaactcatctcccagtactctgatgaggaagagatccagcatgtggtagaggtgattcaa aattccacggccaaagtcatcgtggttttctccagtggcccagatcttgagcccctcatc aaggagattgtccggcgcaatatcacgggcaagatctggctggccagcgaggcctgggcc agctcctccctgatcgccatgcctcagtacttccacgtggttggcggcaccattggattc gctctgaaggctgggcagatcccaggcttccgggaattcctgaagaaggtccatcccagg aagtctgtccacaatggttttgccaaggagttttgggaagaaacatttaactgccacctc caagaaggtgcaaaaggacctttacctgtggacacctttctgagaggtcacgaagaaagt ggcgacaggtttagcaacagctcgacagccttccgacccctctgtacaggggatgagaac atcagcagtgtcgagaccccttacatagattacacgcatttacggatatcctacaatgtg tacttagcagtctactccattgcccacgccttgcaagatatatatacctgcttacctggg agagggctcttcaccaatggctcctgtgcagacatcaagaaagttgaggcgtggcaggtc ctgaagcacctacggcatctaaactttacaaacaatatgggggagcaggtgacctttgat gagtgtggtgacctggtggggaactattccatcatcaactggcacctctccccagaggat ggctccatcgtgtttaaggaagtcgggtattacaacgtctatgccaagaagggagaaaga ctcttcatcaacgaggagaaaatcctgtggagtgggttctccagggaggtgcccttctcc aactgcagccgagactgcctggcagggaccaggaaagggatcattgagggggagcccacc tgctgctttgagtgtgtggagtgtcctgatggggagtatagtgatgagacagatgccagt gcctgtaacaagtgcccagatgacttctggtccaatgagaaccacacctcctgcattgcc aaggagatcgagtttctgtcgtggacggagccctttgggatcgcactcaccctctttgcc gtgctgggcattttcctgacagcctttgtgctgggtgtgtttatcaagttccgcaacaca cccattgtcaaggccaccaaccgagagctctcctacctcctcctcttctccctgctctgc tgcttctccagctccctgttcttcatcggggagccccaggactggacgtgccgcctgcgc cagccggcctttggcatcagcttcgtgctctgcatctcatgcatcctggtgaaaaccaac cgtgtcctcctggtgtttgaggccaagatccccaccagcttccaccgcaagtggtggggg ctcaacctgcagttcctgctggttttcctctgcaccttcatgcagattgtcatctgtgtg atctggctctacaccgcgcccccgtcaagctaccgcaaccaggagctggaggatgagatc atcttcatcacgtgccacgagggctccctcatggccctgggcttcctgatcggctacacc tgcctgctggctgccatctgcttcttctttgccttcaagtcccggaagctgccggagaac ttcaatgaagccaagttcatcaccttcagcatgctcatcttcttcatcgtctggatctcc ttcattccagcctatgccagcacctatggcaagtttgtctctgccgtagaggtgattgcc atcctggcagccagctttggcttgctggcgtgcatcttcttcaacaagatctacatcatt ctcttcaagccatcccgcaacaccatcgaggaggtgcgttgcagcaccgcagctcacgct ttcaaggtggctgcccgggccacgctgcgccgcagcaacgtctcccgcaagcggtccagc agccttggaggctccacgggatccaccccctcctcctccatcagcagcaagagcaacagc gaagacccattcccacagcccgagaggcagaagcagcagcagccgctggccctaacccag caagagcagcagcagcagcccctgaccctcccacagcagcaacgatctcagcagcagccc agatgcaagcagaaggtcatctttggcagcggcacggtcaccttctcactgagctttgat gagcctcagaagaacgccatggcccacaggaattctacgcaccagaactccctggaggcc cagaaaagcagcgatacgctgacccgacacgagccattactcccgctgcagtgcggggaa acggacttagatctgaccgtccaggaaacaggtctgcaaggacctgtgggtggagaccag cggccagaggtggaggaccctgaagagttgtccccagcacttgtagtgtccagttcacag agctttgtcatcagtggtggaggcagcactgttacagaaaacgtagtgaattcataa >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_7|220_aa MWERDCEQKVVTQDDPTVVDPQDAVLEAELHLAFYYEGEPAGRGDGKTAGTCTAGEGATG ECGNNHFPPTVTTLLLRYESPLVLVGVRVKSDWQDPVAVVPTPIAMAPSSKICLTILPSA WTLHNAAEDKKHPSGHTHKQLDIKRNTPMVEDTSGWKLRGTHPRKSTPTGTDRFRQAIDW WNNAEFGWGGRRRAQLPGSPIAGENHLPTPAPFWPSHLPR >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_7|663_bp atgtgggagcgcgactgtgagcagaaggtggtgactcaggatgacccaactgtggtagat ccacaagatgctgtgctggaggctgaactacacctggcattttactatgagggagaacct gctggcagaggagatggaaagacagcgggaacttgcactgcaggggagggtgctacaggg gagtgtggcaacaatcattttccacccacagtcaccaccctgctccttcgctacgaatcc ccacttgtactggttggggtcagagttaagtctgattggcaagaccctgttgcagtggtc cctacacctattgccatggccccctccagtaaaatctgcctgaccatcttgcctagtgct tggacccttcacaatgcagctgaggacaaaaaacatcctagcgggcacacacacaagcag ctggacatcaagaggaacacaccaatggttgaggacacaagcggctggaaattgagagga acacacccgcgtaagagcacacccacaggcactgacagatttcggcaggccatcgactgg tggaacaatgcagagtttggctggggtggtcggagaagagcccagctgcccggaagcccg attgcaggggaaaaccaccttcccactccagcccccttctggccttcccatctacctcgc tga >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_8|241_aa MEEVDAAMNARPHKEDGRVVEPKRAVSREDSQRLGAHLTLKKIFVGGIKEDTEEHHLRDY FEDCGDGGYGGNKDGYNGFGSDSGYEGGSPSYSGGSRGYGSGGQDSGNQGSGYDGGGSHD SYNNGVGVSDFGASCPAKKQSAKMIPGGLSEAKPATPEIQEIVDKVKPQLEEKTNETYGK LEAVQYKTQVVAGTNYYIKVRAGDNKYMHLKVFKSLPGQNEDLVLTGYQVDKNKDDELTG F >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_8|726_bp atggaggaggtagatgcagccatgaatgcaaggccacacaaggaggatggaagagttgtg gaaccaaagagagctgtctcaagagaagattctcagagactaggtgcccacttaactctg aaaaagatatttgttggtggcattaaagaagacactgaagaacatcacctaagagattat tttgaagactgtggtgatggtggatatggtggcaataaggatggctataatggatttggt agtgatagtggttatgaaggaggcagccctagttactctggaggaagcagaggctacgga agtggtggacaggattctggaaaccagggcagtggctatgatgggggtggcagccatgac agctataacaatggagtaggcgtaagtgactttggtgcatcctgtccagcaaagaagcaa tcagccaaaatgatacctggaggcttatctgaggccaaacccgccactccagaaatccag gagattgttgataaggttaaaccacagcttgaagaaaaaacaaatgagacttacggaaaa ttggaagctgtgcagtataaaactcaagttgttgctggaacaaattactacattaaggta cgagcaggtgataataaatatatgcacttgaaagtattcaaaagtcttcccggacaaaat gaggacttggtacttactggataccaggttgacaaaaacaaggatgacgagctgacgggc ttttag >gi568815595f:122154190_122385188|GENSCAN_predicted_peptide_9|146_aa MATCYSADQRRTSPGTVALELLKVMRTIDDRIVHELNTTVPTASFAGKIDASQTCKQLYE SLMAAHASRDRVIKNCIAQTSAVVKNLREEREKNLDDLTLLKQLRKEQTKLKWMQSELNV EEVVNDRSWKVFNERCRIHFKPPKNE >gi568815595f:122154190_122385188|GENSCAN_predicted_CDS_9|441_bp atggccacttgctactccgccgaccagcgcagaacttcgccggggacggtggcgctggaa ttactcaaggtgatgaggacaattgatgacagaatagtacatgaattaaacactacggtt ccaacagcttcctttgcagggaaaattgatgccagccaaacctgtaaacaactttatgag tctttgatggcagctcatgccagtagagacagagtcataaaaaactgtatagcccagact tcagcagtagtaaaaaacctccgagaagagagagaaaagaatttggacgatttaacgtta ttaaaacaacttagaaaagagcagacaaagttgaaatggatgcagtcagaactgaatgtt gaagaagtggtaaatgacaggagctggaaggtgtttaatgaacgctgccgaattcacttc aagcctccaaagaatgaataa