GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:26:36 Sequence gi568815589f:112474081_112718913 : 244833 bp : 39.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 952 947 6 1.05 1.05 Term - 11083 10929 155 2 2 41 55 109 0.363 0.00 1.04 Intr - 11687 11611 77 1 2 59 82 16 0.447 -3.66 1.03 Intr - 12681 12310 372 2 0 -20 85 435 0.584 25.65 1.02 Intr - 13399 13033 367 2 1 95 39 153 0.397 4.38 1.01 Init - 21922 21901 22 1 1 87 89 33 0.429 1.45 1.00 Prom - 34821 34782 40 -5.65 2.00 Prom + 36836 36875 40 -3.55 2.01 Init + 38439 38553 115 2 1 40 103 75 0.317 4.62 2.02 Term + 46450 46556 107 2 2 49 32 130 0.465 1.09 2.03 PlyA + 46618 46623 6 1.05 3.03 PlyA - 47513 47508 6 1.05 3.02 Term - 51918 51830 89 1 2 -50 36 331 0.875 10.94 3.01 Init - 60619 60583 37 1 1 71 72 24 0.216 -0.67 3.00 Prom - 69799 69760 40 -5.45 4.00 Prom + 77040 77079 40 -5.25 4.01 Sngl + 100001 101242 1242 1 0 77 41 797 0.968 69.47 4.02 PlyA + 102950 102955 6 1.05 5.00 Prom + 109416 109455 40 -4.95 5.01 Init + 109703 109770 68 1 2 81 88 64 0.033 6.30 5.02 Intr + 115579 115655 77 1 2 63 80 39 0.011 -1.16 5.03 Term + 143791 144836 1046 2 2 87 47 1073 0.832 94.40 5.04 PlyA + 145242 145247 6 1.05 6.00 Prom + 161016 161055 40 -2.55 6.01 Init + 170238 170259 22 2 1 62 100 -7 0.340 -2.07 6.02 Intr + 171570 171742 173 1 2 96 116 111 0.753 13.44 6.03 Term + 180413 180493 81 1 0 38 40 103 0.452 -2.79 6.04 PlyA + 181884 181889 6 1.05 7.00 Prom + 182890 182929 40 -6.05 7.01 Sngl + 185087 185989 903 1 0 48 52 984 0.988 86.76 7.02 PlyA + 188718 188723 6 1.05 8.06 PlyA - 189757 189752 6 1.05 8.05 Term - 189949 189788 162 1 0 10 38 200 0.276 4.15 8.04 Intr - 195061 194932 130 0 1 56 3 127 0.046 0.78 8.03 Intr - 205515 205377 139 1 1 22 61 103 0.057 -0.40 8.02 Intr - 207578 207443 136 2 1 52 93 151 0.973 11.22 8.01 Init - 208417 208283 135 1 0 67 92 106 0.945 9.09 8.00 Prom - 209885 209846 40 -8.55 9.05 PlyA - 212259 212254 6 1.05 9.04 Term - 213553 213458 96 1 0 86 49 92 0.654 2.09 9.03 Intr - 215537 215447 91 1 1 34 99 71 0.294 1.88 9.02 Intr - 220153 220051 103 2 1 130 40 71 0.314 4.81 9.01 Intr - 243952 243907 46 1 1 52 111 105 0.789 6.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 108510 108409 102 1 0 99 94 63 0.843 7.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_1|330_aa MASLKFVNPRAGAGAPAPRTAPAAAAETRAAGRRGGPGQADRSRHRGSLRLAPPSPALPT SGRPGPCPRSGLAPPRGDKRKVQPGHRAAADGEAESTPARPPPTQRGSPPGSRRRRTRPR PARGLSPTCSRHRKRAVVGETHRKCVTTPPTARTELAVVAAPRRCRSSLLPHTRSHAAAG ALDREVGAIHPPRARAQSLLPRQVWNPPSPLWSEEVYSTAPYYTSWARGPRCPPELCAPP LKRKEQTEPTPSVRPYVVPTSWPHLICDRHTPVTCTLGVALFTIAKRLKQPKCSSTDEEM NENVIYTYNGILFSLKKEGNLITGYNIDEP >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_1|993_bp atggcctcactcaagtttgtcaacccaagagcaggagcgggggcgccagccccacgcact gcgcctgcagccgccgctgagacccgagctgccgggcggcggggcggcccggggcaggcg gaccggagccgccaccgggggtctctgcgcctcgcacccccaagcccggcgctgccgacg agcggccgcccgggtccctgccctcgctccggcctcgctcccccgagaggagacaaaagg aaggttcagccgggtcaccgcgcggccgctgacggggaagcagagtcgaccccggcgcgg ccgccccccacacaaaggggctccccgcctgggtcgcggcgtcggcgcactcgtccccgc cccgcgcgcgggctctcacccacctgcagccggcaccggaaacgcgccgtggtcggcgag acccacaggaagtgtgtaacgacacctcctacggcacgcaccgagctcgcagtcgttgct gccccgaggaggtgtagatcgagcctcctgccgcacacacgttcgcacgcggcggccggc gcgctggaccgagaggttggagcaatccatcctccgcgcgcacgagctcagtccctactg ccccgacaggtgtggaacccgccgtccccgctatggtccgaggaagtgtacagcactgct ccctactacacgtcctgggcccgcggtccccgatgccccccggagctctgtgcacctcca ctgaaacgcaaggagcagacagaacctacaccatccgtgaggccttacgtagtgcccaca tcttggcctcacctcatctgtgacaggcataccccagtcacttgtacccttggagtagca ctattcacaatagccaagaggttgaagcagcccaaatgttcatcaacagatgaagagatg aatgaaaatgtgatatatacatataatggaatattattcagccttaaaaaggaaggaaat cttatcacaggctacaacatagatgaaccttga >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_2|73_aa MCETPTANFMFNNLRLNALNSDDSVSSIRSGNVTEHVAYRRVAEEAEALIEKEVKKQRNE QGVETEKSMNFSQ >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_2|222_bp atgtgtgaaacacctacagctaacttcatgtttaataatttaagactgaatgctctcaat tcagatgactctgtctcatccattcgttcaggaaatgtcactgagcacgtagcatatagg agagtggcagaagaagcagaagcactgattgaaaaggaagtgaagaaacagagaaatgaa cagggtgttgaaactgagaaatcaatgaatttttcacagtaa >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_3|41_aa MVVDAIIPVTQEEEEEEEEEEEEEEEEEEEEEEEEEEEERI >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_3|126_bp atggtggtggatgccataatcccagttactcaagaggaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaga atataa >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_4|413_aa MEDCLHTSSENLSKLVSWAHSHGTICSLIPNLKHLLSEGSHGNLTAMWGCSAGHAYHWPL TATCRAGSQERVCFQDNRSFNSDSPSIIGVPSETQTSPVERYPGRPVKAKLDCNRTRDSC DFSYCSEPSELDETVEEYEDENTLFDMVCESSVTDEDSDFEPQTQRPQSIARKRPGVVPS SLHSSSQTQMVDECSNDVIIKKIKQEIPEDYYIVANAELTGGVDGPALSLTQMAKPKPQT HAGPSCVGSAKLIPHVTSAISTELDPHGMSASPSVISRPIVQKTARVSLASPNRGPPGTH GTNQQVAMQMPVSTSHPNKQISIPLSALQLPGQDEQVASEEFLSHLPSQVSSCEVALSPS VNTEPEVSSSQQQPPVAPAITTEATAQCIPGMAHEATVSPSSTHARILRRQHF >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_4|1242_bp atggaggattgtcttcatacctcatctgagaatctgtccaaattggtcagctgggcccat agccatgggactatttgcagcctcattccaaacctgaaacacttgctttctgaaggttcc catgggaacctgacagcaatgtggggctgtagtgctggccatgcttatcactggccacta acagctacttgcagagctgggtcccaagagagggtctgtttccaggataacagaagtttt aactctgatagtcccagtataatcggggtgccctctgagacacagactagccctgttgaa aggtaccctgggagaccagtgaaagcaaagctagactgtaaccggaccagagactcttgt gacttctcctactgtagtgagccctctgaactggatgaaactgttgaagaatatgaagat gagaacaccctgtttgacatggtttgtgagtcttctgttacagatgaggatagtgacttt gaaccccaaacccaaaggcctcaaagcattgctcgcaaaagacctggggtagtcccatct tccctccattcaagctcccagacgcagatggttgacgaatgcagcaatgatgtcatcatc aagaaaatcaaacaagaaatccccgaagattattacattgtggcaaatgcagaactgaca ggaggagtagatggaccagccctgtccttgacacagatggcaaaacccaagcctcagact cacgctggtccctcctgtgtagggtctgctaaactgattccccatgtcacatctgccatc agcacggagctagacccacacggtatgtctgcatccccctctgtgatctccagaccaatt gtccagaagactgctagggtatctctggcttcaccaaacagaggaccccctggtacacat ggcaccaaccaacaggtggccatgcaaatgcctgtgagcacatcccatcctaacaaacag atcagtatccccttgtctgccctgcagctgcctggacaggatgagcaagttgcctctgaa gagttcctgtcccatctgcccagccaggtctcctcctgtgaggtagccctttctccctca gttaacacagagccagaagtgagctccagtcagcagcagcccccagtcgctccagccata accactgaggccacagcacagtgcataccaggtatggcacatgaggcgacagtgagtccc tcatccacgcacgccagaattctgcgccgtcagcacttctaa >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_5|396_aa MDWKETPQIHSSGFLRGGGNGNRPESRMGYVVKSRTEQVDWQGCLSRADQDERAAELSRE QNEKTIRSTQTALRNFREFLISKYPSETREIYVIPCKELDAYLASFFVDARQKDGSEYEP NSLANYQCGLERYLKEHRYGYSITRDKEFKRSQEALKQKQIELRCKGKGNKPHKSMKLTF ADELILRKRGLLSRYNPEGLLNLVWLNNTKAFGHCTGFHGSTLKWGDIRLRVTETGLEYL EWMGQDTGDLNAKTKRGGTDSRVYATQHAPQTCPVQDYKEYAQRRPPAMRYEDAPFYLSI KPVVNLAALHWYNCQALGKNKLAKMVKTMCEKGNIPGRKTNFSVYQSCSTLSEAQSNQLV LICNNLSQQAAQSVAGHSNNGNFIVSASYDSSSDTA >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_5|1191_bp atggactggaaggagacaccccagatacacagtagtggttttcttcgaggagggggtaat ggaaacaggcctgaaagtaggatgggatatgttgtgaaaagtaggacggaacaagttgat tggcagggctgcctttccagagcagaccaggatgaaagagcagctgagctcagcagggag cagaacgagaaaaccatccggagcacgcagaccgcgctccgcaatttccgtgagttcctc atctccaagtatccttctgaaacaagagagatttatgtcatcccttgcaaggagttggat gcctaccttgcctctttctttgttgatgccaggcagaaggatgggtccgaatacgaaccc aacagcttggccaattaccagtgtgggctcgaaaggtacctgaaagaacacaggtatggc tatagcatcaccagggataaggaattcaagcgttcccaagaggccctgaagcagaagcaa attgaactccgctgtaaaggaaaaggaaataagccacacaagtccatgaagctcaccttt gctgacgagctcatcctgcggaaaaggggactgctaagccgatataaccccgagggtttg ctcaacctagtctggctcaacaacacaaaagcttttgggcattgcacaggcttccatgga tctaccttaaaatggggtgatatccggctccgggtaacagagacgggtctcgagtacttg gagtggatgggtcaggacactggagacttgaatgccaaaaccaagagaggggggacagac tcccgtgtgtatgccacccagcacgccccacagacctgccctgtccaggactataaggag tatgcccagcggcggcctcccgccatgcgctacgaggatgcccctttctacttatccatc aagccagtcgtgaacctggcggctctgcattggtacaactgccaggcccttggcaagaac aagctggccaagatggtgaagaccatgtgtgagaagggcaacatccctggcaggaaaacc aacttcagtgtgtatcagagctgcagcaccttgtctgaggcccagagcaaccagctcgtg ctgatctgtaacaatctgagccagcaggctgcccagtcagtggccggccactccaacaat ggcaatttcatcgtctccgcctcctatgactcttcctcagacaccgcttga >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_6|91_aa MLRGSRKAYSTKLNKFPVFNINDDLNDLCTSAVSPNTTKATRYALNVWRYWCMTNGLKDH TDITKLCEKSTPHKEAEAKDENPDGLGVRSF >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_6|276_bp atgctcagagggagcaggaaagcttattccactaagctcaacaaatttcctgtatttaat attaatgatgacttgaatgatctgtgtaccagtgcagtaagcccaaatactaccaaagcc acgcggtacgccttgaatgtgtggcgttattggtgcatgaccaacgggctcaaagaccac acagacatcaccaagctatgtgaaaaaagcactccgcacaaagaagcagaggcgaaagat gagaatcctgatggccttggagtccgctcgttctga >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_7|300_aa MSSTQPLVRIRRKEGSGCVSVKCVTSVFVSFEIPAVKLNELLENFYVTVKKSDGSDFLAT SLHAIRRGLDRILKNAGVGFSITSSTFSSSTKKLKEKLWVLSKAGMSGARSRNIVYFSLS DEEEMWQAGCLGDDSPITLLSTVVKYNSQYLNMRTLQEHADLMYGDIELLKDPQNQPYFA RTDSVKRESRSGSTRVCHGKIYHEHSRGHKQCPYCLLYKYMYIHRPPTQMEAKSPFYLTA RKEATDMGSVWYEEQRMGLRSLRGIVPNLAKKVKLENCENFTFVSFTQVSRRLGSHSCCQ >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_7|903_bp atgtcctcaacacagcccctggtgaggattagaaggaaggaggggtctggctgtgtgagt gtcaagtgtgtaacctccgtgtttgtctcatttgagatccctgcagtgaagttgaacgag ctgctcgagaacttttatgtcaccgtcaagaagagcgacggctcggacttcctggccacc tcgctccatgctattcgccgaggcctggaccgcatcctgaagaatgcaggtgtcggcttt tccatcaccagcagcaccttcagctcctccaccaagaaactcaaggagaagctgtgggtg ctgagtaaggcaggcatgtcgggcgcgcgttctcgcaacatcgtctacttctccctttct gacgaggaggagatgtggcaggcagggtgtctgggggatgacagccctatcactctcctg tccactgtggtcaagtacaacagccagtacctgaacatgcggacgctgcaggagcatgcg gatctgatgtatggtgacatcgagctgctcaaagacccccaaaaccagccctactttgcc cggacggacagcgtcaagcgggagagtcggagcggctccaccagagtgtgtcacgggaag atctaccatgagcattcccggggacacaaacagtgcccttactgcctcctctacaagtac atgtacatccaccggccgcccacccaaatggaggccaagtcccccttctacctgactgcc aggaaggaggccacagacatgggcagcgtgtggtatgaggagcagaggatggggctgcgc tctcttcggggaattgtcccaaacttagccaagaaggtcaagctggaaaactgtgagaac ttcacctttgtctcgttcactcaggtctcccggaggcttggctcccacagctgctgccag tga >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_8|233_aa MSRQITGNEANRRLDPFQSIWNSRVRQTSSLVHSGETLNICGHRQICIGHAPSPTAGVLQ GKPTVPQQQEWALHAPNQSGSFLIATVIASELSMLPISRSSHPVVSGAHSSQASVSTDSL KLSLENCQSSPLGQIQWDVCTGATIWGKKEKAGDGTITSFTLLTASYIVQNDVSSVKKLW LSKVISAFAQVRSEVALSPVIFQVQQDASVSTRVLLCPDSPNDYVYENAECGI >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_8|702_bp atgtctaggcagataactggaaacgaggctaaccggaggttagatccattccagagcatc tggaattccagggtgcgccagacctctagcttggtacacagcggggagacattgaacatt tgtgggcataggcagatttgtattggccacgctccttccccaacagccggtgtccttcag gggaagccaacagtgccccaacaacaggaatgggccttacatgctccgaaccagtcaggg tcattccttattgccacagtgattgcctcagagttgtctatgctgcccatctccagatcc tctcatcccgttgtctctggagcccactccagtcaagcttctgtctccacagactcactg aaattatccttggaaaactgccagtcatctccacttggccaaatccaatgggatgtatgc acaggtgcgaccatctgggggaagaaggaaaaagctggagacggtacaataacatcattc acgttactgaccgcttcctacatcgttcagaatgatgtttccagtgttaagaaattatgg ctaagcaaagtgatatcggcatttgcccaagtgaggagtgaagtagctctgagtccagtt atctttcaagtacagcaagatgcttccgtttctacaagagtattactttgtccagattct cctaatgactatgtctatgaaaatgctgagtgtggcatataa >gi568815589f:112474081_112718913|GENSCAN_predicted_peptide_9|111_aa SSDCKERRSRCPRVPGFQNKNRVAILAELDKEKRKLLMQNQSSTNHPGASIALSRPSLNK DFRDHAEQQHIAAQQKAALQHAHAHSSGYFITQDSAFGNLILPVLPRLDPE >gi568815589f:112474081_112718913|GENSCAN_predicted_CDS_9|336_bp agcagcgattgtaaggagaggcggtcccggtgtcctcgggtcccaggttttcaaaacaaa aatagagttgcaatcttggcagaactggacaaagagaaaagaaaactacttatgcagaac cagtcttcaacaaatcatcctggagctagcattgcactctcgagaccctctcttaataag gacttccgggatcacgctgagcagcagcatattgcagcccaacagaaggcagctttgcag catgctcatgcacattcatctggatacttcatcactcaagactctgcatttgggaacctt attcttcctgttttacctcgccttgacccagaatga