GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:09:06 Sequence gi568815575f:89822086_90022808 : 200723 bp : 36.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2767 2868 102 0 0 81 76 45 0.283 2.89 1.02 Intr + 17235 17323 89 2 2 32 92 112 0.167 3.85 1.03 Intr + 51155 51534 380 2 2 44 81 203 0.025 8.88 1.04 Intr + 51645 51909 265 1 1 67 89 118 0.202 5.45 1.05 Term + 81041 81437 397 2 1 35 40 195 0.188 2.96 1.06 PlyA + 81465 81470 6 1.05 2.00 Prom + 94434 94473 40 -4.85 2.01 Sngl + 100001 100726 726 1 0 62 38 969 0.977 83.27 2.02 PlyA + 102109 102114 6 1.05 3.00 Prom + 102926 102965 40 -4.65 3.01 Init + 104160 104262 103 2 1 60 68 113 0.936 5.38 3.02 Intr + 104398 104589 192 2 0 19 90 215 0.931 13.34 3.03 Term + 104852 105024 173 0 2 41 49 109 0.956 -0.69 3.04 PlyA + 105073 105078 6 1.05 4.02 PlyA - 105347 105342 6 1.05 4.01 Sngl - 113455 112514 942 1 0 83 41 302 0.839 21.27 4.00 Prom - 113737 113698 40 -12.52 5.03 PlyA - 113975 113970 6 1.05 5.02 Term - 114562 114231 332 2 2 0 36 238 0.846 3.63 5.01 Init - 114859 114637 223 1 1 88 96 176 0.918 17.16 5.00 Prom - 119901 119862 40 -6.45 6.00 Prom + 119983 120022 40 -5.25 6.01 Sngl + 120784 121149 366 0 0 21 42 268 0.991 11.34 6.02 PlyA + 122103 122108 6 1.05 7.00 Prom + 125109 125148 40 -6.55 7.01 Init + 127973 128039 67 1 1 87 81 8 0.118 1.29 7.02 Intr + 141572 141644 73 0 1 84 66 43 0.134 -0.75 7.03 Intr + 142532 142599 68 2 2 65 101 79 0.355 4.53 7.04 Intr + 142708 142832 125 2 2 53 101 70 0.336 4.18 7.05 Term + 143020 143205 186 0 0 33 41 148 0.309 1.11 7.06 PlyA + 148125 148130 6 1.05 8.06 PlyA - 148237 148232 6 1.05 8.05 Term - 153054 152898 157 2 1 91 55 114 0.243 4.82 8.04 Intr - 154386 154335 52 1 1 88 97 15 0.054 -0.55 8.03 Intr - 163141 162981 161 0 2 99 90 36 0.039 3.61 8.02 Intr - 169577 169413 165 1 0 52 61 89 0.048 0.75 8.01 Init - 185865 185729 137 0 2 91 102 29 0.146 4.27 8.00 Prom - 187615 187576 40 -7.05 9.00 Prom + 189823 189862 40 -5.15 9.01 Sngl + 190452 191228 777 2 0 47 36 255 0.323 11.96 9.02 PlyA + 193599 193604 6 1.05 10.02 PlyA - 196081 196076 6 1.05 10.01 Term - 197994 197564 431 1 2 103 49 134 0.870 5.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 51321 51534 214 2 1 77 81 156 0.845 11.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_1|410_aa MGRESPLLKRRGKSEKDFVLWLACQLRHRRIEQQLASGNGSLQMDIGNLGASDRQIPDGT LGVGRNAQVKAGQQCCGPLTAETRSPQLEQQRLAAVGCVAHLHFPPTKVALGFINTLGGM HMCWASLLSLWPWSGKDLASSSGSCKGLVSDLWECALRGTQSHSCSVQMKVGQVHWRPKD DKAHLAGSSEGAAGVEAAVKGLSVTSGSGALKRMLNCGQRSDGDGMGVLGARRWQASFGR EQQSWKVTWCTVWLLHNTMDVVSIFGVHESTWSFSLVGCTIKLEIKIKKFTQNHTITWKL NDLLLKDFWVNNEIKVEIKFFETNENKDTTYQNLWDTAKAVLRGKFIALNALTKKLEKSQ FTNLTSQLKELKNQEQTILKASRRQEITKIRAELMEIETRKTILKNESKR >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_1|1233_bp atggggagagaatccccgcttctgaaaagaagaggaaaaagtgaaaaagactttgtcttg tggcttgcatgccagctcagacacagacgaatagagcagcagctggcatcagggaacggt agtctccaaatggatataggaaacctaggggctagtgatcggcagattcctgatgggacc ttgggagttgggagaaatgctcaggtgaaggcagggcagcagtgctgtgggcctttaact gcagaaaccaggtcccctcagctggagcaacagagactggcagctgtggggtgtgtggca cacttgcacttccctcctacaaaagtagctctgggttttattaatactcttgggggcatg cacatgtgctgggcctccttgctctctctctggccctggagtggcaaggacttagccagt agcagtggcagctgcaaggggcttgtcagcgacctctgggagtgtgctctcagaggaaca cagagccacagctgcagtgttcagatgaaggtggggcaggtgcactggaggcctaaagat gacaaggcccatttagcaggaagcagtgaaggggcagcaggggtggaggcagctgtaaag ggcttgtcagtgacctctgggagtggtgctctcaaaagaatgctgaactgtggccagcgt tcagatggagatggtatgggtgtgcttggggccagaaggtggcaagcctcatttggtagg gagcagcagagttggaaagtcacgtggtgcacagtctggctgctccacaataccatggat gtagtttctatctttggggtacatgaaagtacctggtctttctctttggtggggtgcaca atcaaattagaaatcaagattaagaaatttactcaaaaccatacaattacatggaaattg aatgacctgctcctgaaggacttttgggtgaataatgaaataaaggttgaaatcaagttc tttgaaactaatgaaaacaaagatacaacataccagaatctctgggacacagctaaggca gtgttaagaggaaaatttatagcactaaatgcccttaccaaaaagttagaaaaatctcaa tttaccaacctaacatcacaactaaaagaactaaagaaccaagagcaaaccattcttaaa gctagcagaagacaagaaataaccaaaatcagagctgaactgatggagattgagacacga aaaaccattttaaaaaatgaatccaagaggtga >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_2|241_aa MEAAADGPAETQSPVEKDSPAKTQSPAQDTSIMSRNNADTGRVLALPEHKKKRKGNLPAE SVKILRDWMYKHRFKAYPSEEEKQMLSEKTNLSLLQISNWFINARRRILPDMLQQRRNDP IIGHKTGKDAHATHLQSTEASVPAKSGPSGPDNVQSLPLWPLPKGQMSREKQPDPESAPS QKLTGIAQPKKKVKVSVTSPSSPELVSPEEHADFSSFLLLVDAAVQRAAELELEKKQEPN P >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_2|726_bp atggaggccgctgcggacggcccggctgagacccaaagcccggtggaaaaagacagcccg gcgaagacccaaagcccagcccaagacacctcaatcatgtcgagaaataacgcagataca ggcagagttcttgccttaccagagcacaagaagaagcgcaagggaaacttgccagccgag tccgttaagatcctccgcgactggatgtataagcatcggtttaaggcctacccttcagaa gaagagaagcaaatgctgtcagagaagaccaatttgtctttgttgcagatttctaactgg tttatcaatgctcgcagacgcattctcccggatatgcttcaacagcgtagaaacgacccc atcattggccacaaaacgggcaaagatgcccatgccacccacctgcagagcaccgaggcg tctgtgccggccaagtcagggcccagtggtccagacaatgtacaaagcctgcccctgtgg cccttgccaaagggccagatgtcaagagagaagcaaccagatccggagtcggcccctagc cagaagctcaccggaatagcccagccgaagaaaaaggtcaaggtttctgtcacatccccg tcttctccagaacttgtgtctccagaggagcacgccgacttcagcagcttcctgctgcta gtcgatgcagcagtacaaagggctgccgagctggagctagagaagaagcaagagcctaat ccatga >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_3|155_aa MKPWTLVVSVTALKVARLEFVPPDVRMCSEFLPSGFVNAPIDTVSSYSGGDLEHLCVDTR YLADLVGMWRTFASSSGIVNAPISALSKQTTQLYQSAGAEGAGSGLGQPRKGLPQCSGRL KGSSSAAKVGAQAEEALRASEGREDCQHAVTSQDE >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_3|468_bp atgaagccgtggaccctcgtggtgagtgttacagctcttaaggtggcgcgtctggagttt gttcctcctgatgttcggatgtgttcggaatttcttccttctgggtttgtgaatgcacca atcgacactgtatctagctactctggtggggacttggagcacctttgtgtggacactcgg tatctagctgatctggtggggatgtggagaacctttgcgtctagctctgggattgtaaac gcaccaatcagcgccctgtcaaaacagaccactcagctctaccaatcagcaggagctgag ggagccggctctggccttggccagcccagaaagggtctcccacagtgcagtggcaggctg aagggctcctcaagtgctgccaaagtgggagcccaggcagaggaggcgctgagagcgagc gagggccgtgaggactgccagcacgctgtcacctctcaggatgaatag >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_4|313_aa MQQEELTILNIYAPNTGGPRFIKQVLRDLQRDLDSHTITVGDFNTPLSILDRSTRQKINN DVQDLNLALDQVEPIDLYRTLHPKSTEYTFSSAPHCTYSKKDHIIGSKTLLSKCKRMEII TNSLSDHSAIKLELRIKKLIQNHTNTWKMNNLLLNDYCVNNKIKAEINKFFKTNENEDTT YQNLWDTFKAVFRGKFIALNAHNRKEERSKIDILTSKLKELEKQQQTNSKASRRHETTKI RAELKELETQITLQKINESRSCFLKKINKIDRLLARLIKEKREKNQIDAIKNDIRDITTD LTEICGYSHQRIL >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_4|942_bp atgcagcaagaagagctaactatcctaaatatatatgcacccaatacaggaggacccaga ttcataaagcaagttcttagagacctacaaagagacttagactctcacacaataacagtg ggagactttaacaccccactgtcaatattggacagatcaacaagacagaaaattaacaac gatgttcaggacttgaacttagctctagaccaagtggaaccaatagacctctacagaact ctccaccccaaatcaacagaatatacattctcttcagcacctcattgcacttattctaaa aaggaccacataattggaagtaaaacactcctcagcaaatgcaaaagaatggaaatcata acaaacagtctctcagaccacagtgcaatcaaattagaactcaggattaagaaactcatt caaaaccacacaaatacatggaaaatgaacaacctgctcctgaatgactactgtgtaaat aacaaaattaaggcagaaataaataagttcttcaaaaccaatgagaacgaagacacaacg taccagaatctctgggacacatttaaagcagtgtttagaggaaaatttatagcactaaat gcccacaacagaaaggaggaaagatctaaaattgacatcctaacatcaaaattaaaagaa ctagagaagcaacagcaaacaaattcaaaagctagcagaagacatgaaacaactaagatc agagcagaactgaaggagttagagacacaaataacccttcaaaaaatcaatgaatccagg agctgctttttgaaaaaaatcaacaaaatagatagactgctagccagactaataaaggag aaaagagagaagaatcaaatagatgcaataaaaaatgatataagggatatcaccactgat ctcacagaaatatgtgggtattcccatcagagaatactgtaa >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_5|184_aa MGKSQCKKAENSKNQNASPPPKDHNSSPAREQNWTENEFDELTEVGFRRWVITNNSELKE HVLSQCKEAKNLEKAQKLCEEYTSINSRVNQAEERISVIEGQLNEIKYEDKIREKRIKRN EQNLQEIWDYVKRTNLRLIGVPESDRENGTKLENTLQGIIRENFPNLAKQDNIQIQEIQR IPQR >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_5|555_bp atggggaaaagccagtgcaaaaaggctgaaaattccaaaaaccagaatgcctctcctcct ccaaaggatcacaactcatctccagcaagggaacaaaactggacagagaatgagtttgat gaactgacagaagtaggcttcagaaggtgggtaataacaaacaactccgagctaaaggag catgttctatcccaatgcaaggaagctaagaatcttgaaaaagcacaaaaactttgtgaa gaatacacaagtatcaacagccgagtcaatcaagcagaagaaaggatatcagtgattgaa ggtcaacttaatgaaataaagtatgaagacaagattagagaaaaaagaataaaaaggaat gaacaaaacctccaagaaatatgggactatgtgaaaagaacaaacctacgtttgattggt gtacctgaaagtgacagggagaatggaaccaagttggaaaacactcttcagggtattatc cgggagaacttccccaacctagcaaaacaggacaacattcaaattcaggaaatacagaga ataccacaaagataa >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_6|121_aa MSRGAEEQSGREWKRAVDSSRVSQKRRKEEASEHREFSRELPDPSTFQLPIHLAELHLQH SIKPCTHPLSPRVIGSFWDTGQVLRIQKAVTLALCPCDKAEGPLSLFTIKQSADGKTERP L >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_6|366_bp atgtcaagaggagcagaggaacagagcggcagagagtggaagagagcagtagacagcagc agggtgtcacagaaaagaaggaaggaagaagcgtctgaacatcgagagttcagcagagaa ctccccgacccctccaccttccagctccccatccatcttgctgagctccacctccaacat tcaataaaaccttgcactcatcctttgagcccacgtgtgattggatccttctgggacacg gggcaagtgctcaggatacagaaggctgtcacattggccctctgcccttgcgataaggca gagggtccattgagcttatttaccatcaaacaatctgcagatggcaaaactgaaagacct ttgtag >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_7|172_aa MEASAQLGRPHGAFNRGGRQRGIPRSESPVNKSKHASTQFAVESQLRYCVRNWWVLGLTD FKNEAADLRGVKLQTFAVSVTALKTARLELFVPPGGLVVSLALGVKLQTFASRVAYFDRA LIGAFTIPELDTKVLHLLIRLVRYRVSTHRFSKAPPEQLDTECRLVHSQTLS >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_7|519_bp atggaggcttctgctcaactgggcaggcctcacggagctttcaatcgtggtggaaggcaa aggggaattccaagaagtgaaagtcctgtaaataaaagcaagcatgcaagtacgcagttt gctgtagaaagtcaattgagatattgtgtccggaattggtgggttcttggtctcactgac tttaagaatgaagccgcggaccttcgcggagtgaagttgcagaccttcgcggtgagtgtt acagctcttaagacagcgcgtctggagttgtttgttcctcctggtgggctcgtggtctcg ctggctttaggagtgaagctgcagactttcgcgagccgagtggcctattttgacagggcg ctgattggtgcgtttacaatccctgagctagatacaaaggttctccacctcctcatcaga ttagttagatacagagtttccacacacaggttctccaaggccccaccagagcagctagat acagagtgtcgattggtgcactcacaaaccttgagctaa >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_8|223_aa MEWPALDWGKRHCIEPHFSGSHQRRLSVSRRHTSWEQKDEAFINLRNQEKAQEREIRDTR KRQKLSWLPCRLTNPRIPEKHLLTATNMASHGTLGKTALEEDVLEKGYTVAAAHFWLILT YCSESPVKQPKFSLPQSADCRKLQSNIGIGWNREGLEIFLHFENDMERFRRRKEFKGNTH AGDRLPNLAINWPQNWPQTKSLQHCDMFVMATMPTLKVVGLPE >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_8|672_bp atggagtggcctgccctagactggggaaagaggcactgtatcgagccccattttagtggt agccatcagagacgcttatctgtgtcccgcagacacactagctgggaacaaaaagatgaa gcctttataaacttaaggaatcaggagaaggctcaggagagagagataagagacacaaga aaaaggcagaagctctcatggctgccttgcaggctcacaaatcccagaatccctgagaag cacctgttaactgctacaaatatggcaagccatggcactttaggaaaaactgccctggaa gaggatgtcttagaaaagggctatactgttgcagctgcccacttttggttgatacttaca tattgctcagaatcacccgtaaagcagcccaagttttctcttcctcaatcagctgattgt agaaaactccaatcaaacataggtattggatggaatagagaaggcttggagatctttctg cattttgagaatgacatggagaggttcagaagaaggaaagaattcaaaggaaacacacat gctggagataggctcccaaatctggccataaactggccccaaaactggccacaaacaaaa tctctgcagcactgtgacatgtttgtgatggccacgatgcccacgctgaaggttgtgggt ttaccggaatga >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_9|258_aa MTVPPITGAGGLGGKNDFVGLGPGIPCYVQPQDMVACIPATQAVDKRGHGTGQSIDSEGA SPKPWHLPHDAGFAGTQKTKRFGNFHLDFRRCIQNLEYSGRSVLQGQSPHGEPLLWQCRR EMWSWSPHTESPVSYCLRRGQPSSSPPNDRSNDGLHSRCRKVSDTQCQPMRAARKGAVPC KATGVELPMTMGVHLLHHCDLDVRQEAKRDHSGALIFNDCPVGYQTGMGTRSPFVLADFS HLEWEHLYNACTPIVSWK >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_9|777_bp atgacagtccctcccatcacaggggctggaggcttaggaggaaaaaatgattttgtgggc ctaggtccagggatcccctgttatgtgcagcctcaggacatggtggcctgcatcccagcc actcaagctgtggataaaaggggtcatggtacaggtcagtccattgattcagagggtgca agccccaagccttggcaccttccacatgatgctggttttgcaggtacgcagaagacaaaa aggtttgggaacttccatttagatttcagaagatgtatacaaaatcttgaatattcaggc agaagtgtgctgcaggggcagagccctcacggagaacctctgctatggcagtgcagaagg gaaatgtggagttggagcccccacacagagtccccagtcagctactgtctgagaagaggg caaccgtcctccagccccccaaatgatagatccaatgacggcttgcacagtagatgcaga aaagtctcagacactcaatgccaaccaatgagagcagccaggaagggggctgtaccctgc aaagccacaggggtggagctgcccatgaccatgggagtccacctcttacatcattgtgac ctggatgtgagacaggaagctaaaagagatcattctggagctttaatatttaatgactgc cctgttggatatcagactggcatggggactcgtagcccctttgttttggcggatttctcc catttggaatgggagcatttatacaatgcctgtacccccattgtatcatggaaataa >gi568815575f:89822086_90022808|GENSCAN_predicted_peptide_10|143_aa XPTIISLCSHECLQQQYASCPEPMPKEMTKTPTPETVRHTIHKGTHRHPQAAWPNTQLSL APVSLELVLFQWRVPGELPNPCSSACSMTTSLSLILIFQRAHHTVCHSAVLTDTWLDNQP GPLLQAKPQHYITNFHITHEDYS >gi568815575f:89822086_90022808|GENSCAN_predicted_CDS_10|432_bp ngtcccaccatcatttcactgtgttcacatgagtgcctgcagcaacaatacgccagttgc ccagaacctatgcctaaagaaatgactaaaaccccaacacctgaaactgtgaggcacact atccataaaggaacacacaggcatcctcaggctgcctggcccaacacacagctgtctcta gcaccagtcagcctggagctggtcctattccagtggagagtgcctggggagctacccaac ccatgttcctctgcctgcagcatgacaaccagcctgtctctgatcctaatcttccagaga gcccaccacacagtctgccattctgctgtactcacagatacctggcttgacaatcaacca gggcccctgctgcaagcaaaaccacaacactacattacaaacttccacattacacatgaa gattatagctga