GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:56:39 Sequence gi568815575f:48009980_48219631 : 209652 bp : 44.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 28706 28822 117 1 0 73 57 99 0.218 5.50 1.02 Intr + 36248 36379 132 1 0 99 55 60 0.385 4.64 1.03 Term + 38420 38593 174 1 0 38 47 138 0.964 2.46 1.04 PlyA + 40555 40560 6 1.05 2.05 PlyA - 42478 42473 6 1.05 2.04 Term - 50224 48489 1736 2 2 126 38 672 0.918 53.81 2.03 Intr - 50566 50471 96 2 0 74 111 12 0.775 2.08 2.02 Intr - 50966 50840 127 2 1 7 82 173 0.970 8.85 2.01 Init - 56907 56893 15 0 0 47 111 14 0.621 -0.19 2.00 Prom - 75492 75453 40 -3.16 3.00 Prom + 90961 91000 40 -1.46 3.01 Init + 92389 92459 71 0 2 53 48 87 0.388 1.72 3.02 Intr + 99949 100069 121 1 1 115 100 123 0.933 16.70 3.03 Intr + 100504 100618 115 0 1 93 60 152 0.999 12.92 3.04 Intr + 101281 101376 96 2 0 98 96 121 0.933 13.88 3.05 Intr + 103214 103263 50 0 2 37 113 36 0.546 -0.60 3.06 Intr + 107033 107168 136 1 1 46 99 130 0.329 10.04 3.07 Intr + 109555 109651 97 2 1 90 32 112 0.729 4.87 3.08 Term + 110121 110247 127 0 1 95 48 70 0.377 1.46 3.09 PlyA + 110688 110693 6 1.05 4.00 Prom + 114950 114989 40 -7.76 4.01 Init + 120709 120853 145 0 1 86 37 334 0.989 28.38 4.02 Intr + 121282 121440 159 2 0 115 115 211 0.997 26.46 4.03 Intr + 122271 122349 79 1 1 127 119 84 0.811 14.01 4.04 Intr + 147900 148035 136 0 1 117 100 71 0.971 11.77 4.05 Term + 148469 148543 75 1 0 114 43 108 0.993 6.74 4.06 PlyA + 149268 149273 6 1.05 5.03 PlyA - 151247 151242 6 1.05 5.02 Term - 153247 153141 107 2 2 70 48 115 0.911 4.27 5.01 Init - 155849 155759 91 2 1 103 106 40 0.940 8.19 5.00 Prom - 163163 163124 40 -2.46 6.09 PlyA - 163785 163780 6 1.05 6.08 Term - 177752 177652 101 0 2 82 35 103 0.107 2.69 6.07 Intr - 180289 180154 136 1 1 36 99 114 0.965 7.44 6.06 Intr - 182383 182253 131 2 2 47 113 -3 0.578 -1.49 6.05 Intr - 184245 184150 96 1 0 94 96 74 0.986 8.78 6.04 Intr - 184875 184761 115 0 1 108 60 178 0.999 17.02 6.03 Intr - 185399 185311 89 0 2 73 100 83 0.796 7.69 6.02 Intr - 188866 188698 169 1 1 84 110 87 0.873 10.02 6.01 Init - 190231 190118 114 1 0 91 93 67 0.924 7.71 6.00 Prom - 202374 202335 40 -1.86 7.00 Prom + 203881 203920 40 -5.06 7.01 Init + 204529 204661 133 0 1 65 99 135 0.057 12.60 7.02 Intr + 206880 206976 97 1 1 85 32 104 0.087 3.57 7.03 Term + 207451 207577 127 1 1 95 48 75 0.047 1.96 7.04 PlyA + 208012 208017 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 61460 61547 88 1 1 90 92 39 0.837 5.37 S.002 Init + 144633 144778 146 2 2 79 41 92 0.827 3.19 S.003 Init - 171202 171161 42 1 0 95 116 27 0.823 6.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_1|140_aa MEEVMQRVKVMLEGTEKKEDTAKDSKKGKNRTLAKKSYRAEKDTKGPGSQHEYLGEDTGS EEQPALHLSHSAISIQRNTNQDLHDYARSLKGEAICLLKLLLFLEPDKVEWKPANLSNII LGDILGTSYDDELDQLLMTE >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_1|423_bp atggaggaagttatgcagagagtgaaggtgatgctggaggggacagagaaaaaagaggac acagcaaaggacagtaaaaagggtaagaataggacgttggcaaagaaatcctacagggca gaaaaagacacaaagggtccaggaagccagcatgaatacctgggtgaagacacaggttcc gaggaacagccagcactccatctgagtcacagcgccattagtattcaaagaaacacaaat caggatttacacgattatgccagatccctgaaaggtgaggccatatgtctgctgaaacta cttctctttttggaacctgacaaggttgaatggaagcctgcaaacctgagtaacatcatt ctgggagacattttgggtacatcatatgatgatgaactggaccaattgttgatgactgag tga >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_2|657_aa MIESQEPVTFEDVAVDFTQEEWQQLNPAQKTLHRDVMLETYNHLVSVGCSGIKPDVIFKL EHGKDPWIIESELSRWIYPDRVKGLESSQQIISGELLFQREILERAPKDNSLYSVLKIWH IDNQMDRYQGNQDRVLRQVTVISRETLTDEMGSKYSAFGKMFNRCTDLAPLSQKFHKFDS CENSLKSNSDLLNYNRSYARKNPTKRFRCGRPPKYNASCSVPEKEGFIHTGMEPYGDSQC EKVLSHKQAHVQYKKFQAREKPNVCSMCGKAFIKKSQLIIHQRIHTGEKPYVCGDCRKAF SEKSHLIVHQRIHTGEKPYECTKYGRAFSRKSPFTVHQRVHTGEKPYECFECPKAFSQKS HLIIHQRVHTREKPFECSECRKAFCEMSHLFIHQITHTGKKPYECTECGKTFPRKTQLII HQRTHTGEKPYKCGECGKTFCQQSHLIGHQRIHTGEKPYVCTDCGKAFSQKSHLTGHQRL HTGEKPYMCTECGKSFSQKSPLIIHQRIHTGEKPYQCGECGKTFSQKSLLIIHLRVHTGE KPYECTECGRAFSLKSHLILHQRGHTGEKPYECSECGKAFCGKSPLIIHQKTHPREKTPE CAESGMTFFWKSQMITYQRRHTGEKPSRCSDCGKAFCQHVYFTGHQNPYRKDTLYIC >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_2|1974_bp atgattgagtcccaggaaccagtgacatttgaggatgtggctgtggacttcacgcaggaa gagtggcagcagttgaatcctgctcagaagaccctgcatagggatgtgatgctggagacc tataatcacctggtctccgtggggtgttcaggtataaaaccagatgtaatctttaagttg gaacatggaaaggacccatggatcatagagagtgagttgtcaaggtggatctacccagac agagtgaaaggccttgaatcttcccagcagatcatttctggagaacttttatttcaaagg gagatactagaaagagccccaaaggataattcattgtactctgttttaaaaatctggcat attgataatcagatggatagatatcaaggaaatcaagacagagttttgaggcaggtcaca gtcatcagtcgtgaaacattgactgatgagatgggttccaagtacagtgcatttgggaaa atgttcaatcggtgcacagaccttgctcctttaagtcaaaaattccataagtttgattca tgtgaaaatagcttgaagtctaattcagacttactaaattataacaggagctatgcaaga aagaaccccactaagagatttagatgtgggagaccacctaagtataatgcttcctgttct gtgcctgagaaggaaggcttcattcatactggaatggagccctatggagatagtcaatgt gaaaaagttctcagtcataagcaagcccatgttcagtataagaaatttcaagccagagag aaacccaatgtttgtagtatgtgtgggaaagcctttatcaagaagtcacagctcattata catcaaagaattcatactggagagaaaccatatgtatgtggagattgtaggaaagccttc agtgagaaatcacacctcattgtgcatcagaggattcatactggggagaaaccctatgaa tgtactaagtatggaagagcattctcccggaagtcacctttcactgttcatcagagagtc catactggagagaaaccctatgagtgttttgagtgtccaaaagctttctcccagaagtca catctaattatacatcagagagttcataccagagagaagccctttgaatgcagtgaatgc aggaaagccttctgtgagatgtctcacctttttatacaccagataactcatactgggaag aagccctatgaatgtactgaatgtgggaagaccttccctcggaaaacacagctcattata catcagagaacgcatactggagagaagccctataagtgtggtgaatgtgggaaaactttc tgccaacagtcccacctcataggacatcaaagaattcatacaggagaaaaaccttatgtg tgtactgactgtgggaaggccttttcccagaagtcacacctcactggccatcaaagactt catactggagagaaaccttatatgtgtactgaatgtggaaaatccttctctcagaaatca cctcttatcatacaccagagaattcatacaggggagaaaccttatcagtgtggtgaatgt ggcaaaaccttctcccagaaatcactcctcattattcatctgagagttcacacaggggag aaaccttatgagtgtactgagtgtgggagggccttttccctgaagtcacatctcattcta catcagagaggtcatactggagagaaaccctatgaatgtagtgaatgtggaaaggccttc tgtggaaagtctccactcattatacatcagaaaactcatcctagggagaaaacccctgaa tgtgctgagtctggaatgacttttttctggaaatcacagatgattacatatcagagaaga cacactggggagaaaccctccagatgcagtgactgtgggaaggcattctgccagcatgta tactttactgggcatcagaatccatataggaaagacaccttgtatatatgctga >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_3|270_aa MIGSFLKPSLEADAGAMLAQSAELTGKTQAVSLAGQSAPGAMNGDDAFAKRPRDDAKASE KRSKAFDDIAKYFSKEEWEKMKFSEKISCVHMKRKYEAMTKLGFNVTLSLFMRNKRATDS QRNDSDNDRNRGNEVERPQMTFGRLQRIIPKIMPEKPAEEGSDSKGVPEASGPQNDGKKL CPPGKASSSEKIHERSGPKRGKHAWTHRLRERKQLVIYEEISDPEEDDNLRDTTHAHDEK QNVVTFHERGHGCGPLVIRCIASESKCSQQ >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_3|813_bp atgattggaagcttcctgaagccctcactagaagcagatgctggtgctatgcttgcacag tctgcagaactgactggaaagactcaggctgtttctcttgcaggtcagagtgctcctggt gccatgaacggagacgacgcctttgcaaagagacccagggatgatgctaaagcatcagag aagagaagcaaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaag atgaaattctcggagaaaatcagctgtgtgcatatgaagagaaagtatgaggccatgact aaactaggtttcaacgtcaccctctcacttttcatgcgtaataaacgggccacagactct cagaggaatgattctgataatgaccgtaaccgtgggaatgaggttgaacgtcctcagatg acttttggcaggctccagagaatcatcccgaagatcatgcccgagaagccagcagaggaa ggaagtgattcgaagggagtgccagaagcatctggcccacagaacgatgggaaaaagctg tgcccgccgggaaaagcaagtagctctgagaagattcacgagagatctggacccaaaagg gggaaacatgcctggacccacagactgcgtgagagaaagcagctggtgatttatgaagag atcagcgaccctgaggaagatgacaacctcagggatacgacacatgcccatgatgagaag cagaacgtggtgacctttcacgaacgtgggcatggctgcggacccctcgtcatcaggtgt atagcaagtgaaagcaagtgttcacaacagtga >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_4|197_aa MKAWGTVVVTLATLMVVTVDAKIYERCELAARLERAGLNGYKGYGVGDWLCMAHYESGFD TAFVDHNPDGSSEYGIFQLNSAWWCDNGITPTKNLCHMDCHDLLNRHILDDIRCAKQIVS SQNGLSAWTKGPAAGKTQAVSLAGQIAPSATNGDDAFARRPRVGSQIPENMQKVFDDIAK YFSKKECEKTKAWEKII >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_4|594_bp atgaaggcctggggcactgtggtagtgaccttggccacgctgatggttgtcactgtggat gccaagatctatgaacgctgcgagctggcggcaagactggagagagcagggctgaacggc tacaagggctacggcgttggagactggctgtgcatggctcattatgagagtggctttgac accgccttcgtggaccacaatcctgatggcagcagtgaatatggcattttccaactgaat tctgcctggtggtgtgacaatggcattacacccaccaagaacctctgccacatggattgt catgacctgctcaatcgccatattctggatgacatcaggtgtgccaagcagattgtgtcc tcacagaatgggctttctgcctggaccaaaggtcctgcagctggaaagactcaggctgtt tctcttgcaggtcagattgctcccagtgccacgaatggagacgacgcctttgcaaggaga cccagggttggttctcaaataccagagaacatgcaaaaggtcttcgatgatattgccaaa tatttctctaagaaagaatgcgaaaagacgaaagcctgggagaaaatcatctag >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_5|65_aa MPMFVKGHHILLLIMGMCHIPEAEARGEKERTATATPPFSNHHLDQPAAINTEARPSTSK KSVTY >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_5|198_bp atgcccatgttcgtgaaaggtcaccacattctgcttctcatcatgggcatgtgtcatatc cctgaggctgaggcaagaggagagaaggaaagaactgccacagccactccacccttcagc aaccaccaccttgatcagccagcagccatcaacactgaggcaagaccctccaccagtaaa aagagtgtgacttactga >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_6|316_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKMDTSLSAAGKRHRHRVSAPRKGLSKEDYIRSD IFMINSQPSTSSGQSYVYFISNEQNQAGSTRWDEGQSAPGAMNGDDAFVRRPRVGSQIPE KMQKAFDDIAKYFSEKEWEKMKASEKIIYVYMKRKYEAMTKLGFKATLPPFMRNKRVADF QGNDFDNDPNRGNQECPHATEVSLEFGNLYQQRKILMYSLSVEHPQMTFGRLQGIFPKIT PEKPAEEGNDSKGVPEASGPQNNGKQLRPSGKLNTSEKVNKTSGPKRGKHAWTHRVRERK QLVIYEEISDPQEDDE >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_6|951_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaaatggacacctcattaagtgctgctgggaagcgccac cgacacagagtgtcagcccccagaaagggcctttccaaggaggactatatcaggtctgac attttcatgatcaacagccagccatctaccagttctggccaatcctatgtgtatttcatc agcaatgaacagaaccaagctgggagcacgagatgggatgagggacagagtgctcccggt gccatgaacggagacgatgcctttgtacggagacctagggttggttctcaaataccagag aagatgcaaaaggccttcgatgatattgccaaatacttctctgagaaagagtgggaaaag atgaaagcctcggagaaaatcatctatgtgtatatgaagagaaagtatgaggccatgact aaactaggtttcaaggccaccctcccacctttcatgcgtaataaacgggtcgcagacttc caggggaatgattttgataatgaccctaaccgtgggaatcaggaatgccctcatgcaact gaagtctctctagagtttggaaatctttaccaacaaagaaaaattctgatgtattctctt tcagttgaacatcctcagatgactttcggcaggctccagggaatcttcccgaagatcacg cccgagaagccagcagaggaaggaaatgattcgaagggagtgccagaagcatctggccca cagaacaatgggaaacagctgcgcccctcaggaaaactaaatacctctgagaaggttaac aagacatctggacccaaaagggggaaacatgcctggacccacagagtgcgtgagagaaag caactggtgatttatgaagagatcagcgaccctcaggaagatgacgagtaa >gi568815575f:48009980_48219631|GENSCAN_predicted_peptide_7|118_aa MPEKPAEEGNDSKEVLEASGPQNDGKQLCPPGKASSSEKIHERSGTKRGKHAWIHRLRER KQLVIYEEISDPEEDDNLGDMTHAHDEKQNVVTFHEHGHGCGPLIIRCIASESKCSQQ >gi568815575f:48009980_48219631|GENSCAN_predicted_CDS_7|357_bp atgcctgagaagccagcagaggaaggaaatgattcaaaggaagtgctagaagcatctggc ccacagaacgatgggaaacagctgtgccccccgggaaaagcaagtagctctgagaagatt cacgagagatctggaaccaaaagagggaaacatgcctggatccacagactgcgtgagaga aagcaactggtgatttatgaagagatcagcgaccctgaggaagacgacaacctcggggat atgacacatgcccatgatgagaagcagaatgtggtgacctttcacgaacatgggcatggc tgcggacccctcatcatcaggtgcatagcaagtgaaagcaagtgttcacaacagtga