GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:26:09 Sequence gi568815597r:31160075_31396751 : 236677 bp : 46.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 3513 3508 6 1.05 1.12 Term - 6139 6122 18 1 0 100 46 26 0.404 -2.08 1.11 Intr - 9568 9439 130 0 1 66 106 30 0.417 3.30 1.10 Intr - 21966 21786 181 1 1 105 44 181 0.170 14.33 1.09 Intr - 22516 22456 61 1 1 85 68 188 0.999 14.81 1.08 Intr - 23940 23743 198 0 0 105 92 237 0.995 25.35 1.07 Intr - 25253 25173 81 2 0 113 115 25 0.993 7.53 1.06 Intr - 28113 27976 138 0 0 118 68 323 0.988 33.96 1.05 Intr - 30003 29896 108 0 0 115 17 68 0.673 2.88 1.04 Intr - 48996 48875 122 1 2 30 78 87 0.012 2.11 1.03 Intr - 60941 60808 134 1 2 99 -8 124 0.058 4.19 1.02 Intr - 65832 65798 35 0 2 53 82 61 0.035 -0.98 1.01 Init - 79473 79420 54 0 0 101 107 107 0.950 13.80 1.00 Prom - 95998 95959 40 -2.16 2.08 PlyA - 96009 96004 6 1.05 2.07 Term - 109374 109253 122 1 2 101 44 48 0.543 0.34 2.06 Intr - 111425 111305 121 2 1 102 93 91 0.990 11.07 2.05 Intr - 121422 121300 123 0 0 87 95 205 0.999 21.88 2.04 Intr - 129345 129180 166 2 1 101 98 64 0.984 8.66 2.03 Intr - 131932 131839 94 2 1 77 109 101 0.975 10.12 2.02 Intr - 133274 133145 130 2 1 84 45 108 0.932 6.37 2.01 Init - 136677 136537 141 0 0 90 115 89 0.936 11.93 2.00 Prom - 158929 158890 40 -3.06 3.00 Prom + 162547 162586 40 -6.66 3.01 Init + 164686 164746 61 0 1 83 16 120 0.607 5.71 3.02 Intr + 177101 177201 101 0 2 106 88 37 0.827 5.33 3.03 Intr + 178883 178974 92 1 2 93 80 61 0.882 4.59 3.04 Intr + 186566 186666 101 2 2 62 97 29 0.879 0.95 3.05 Intr + 188755 188900 146 2 2 116 63 90 0.983 9.30 3.06 Term + 203958 204119 162 2 0 83 54 203 0.987 14.34 3.07 PlyA + 205407 205412 6 -1.95 4.07 PlyA - 205602 205597 6 1.05 4.06 Term - 205865 205812 54 2 0 123 43 35 0.195 0.06 4.05 Intr - 207420 207319 102 0 0 82 99 98 0.839 10.67 4.04 Intr - 208096 207962 135 1 0 48 27 156 0.982 6.26 4.03 Intr - 209483 209311 173 0 2 114 115 136 0.978 18.56 4.02 Intr - 212952 212868 85 0 1 35 97 147 0.822 9.69 4.01 Init - 213899 213777 123 2 0 106 39 94 0.772 6.49 4.00 Prom - 217541 217502 40 -6.26 5.00 Prom + 221749 221788 40 -2.96 5.01 Sngl + 226607 227608 1002 1 0 56 43 401 0.678 29.35 5.02 PlyA + 229998 230003 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 60941 60791 151 1 1 99 46 133 0.834 7.58 S.002 Init - 65820 65798 23 0 2 98 82 43 0.930 3.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:31160075_31396751|GENSCAN_predicted_peptide_1|419_aa MGKCSGRCTLVAFCCLQLCPTSMEVAVTEIRYYAVVILLYFHIKSLSLDWELLEAWKCPL SVDMECGPQKELSEFLAQWDGLPVKATEGIFPVKGELDGSQDVSYPIGSTPTPWELGLFV ANDKNQQDKEGEPALGMAVFRGLKDGSGHSLVAALERQIFDFLGYQWAPILANFLHIMAV ILGIFGTVQYRSRYLILYAAWLVLWVGWNAFIICFYLEVGQLSQDRDFIMTFNTSLHRSW WMENGPGCLVTPVLNSRLALEDHHVISVTGCLLDYPYIEALSSALQIFLALFGFVFACYV SKVFLEEEDSWWESCPQILPLQDPVSLSSPPPPIPNMRRPSSAVDFIGGFDSYGYQAPQK TSHLQLQPLYTNCPRYWATAVNRTDKALDFFLTAYLLDRININALVSLPNNPKKDNDNP >gi568815597r:31160075_31396751|GENSCAN_predicted_CDS_1|1260_bp atgggcaagtgcagcgggcgctgcacgctggtcgccttctgctgcctgcagctgtgtccc accagcatggaggtggcagtcaccgaaatcaggtactatgctgttgtaattctgctctat ttccacatcaaatccctctccctggactgggagctccttgaggcctggaagtgtccactg agtgtggacatggagtgtggccctcagaaggaactcagtgaattcttggcccagtgggat ggtcttcctgttaaagccacggagggcatcttccctgtgaagggagagcttgatggctcc caggatgtcagttaccccattggatccactcccaccccctgggagttaggactctttgtt gcaaatgacaaaaatcagcaggacaaggagggagaaccagctttaggcatggctgtattc agaggcctgaaagatggctcaggacattcccttgtggctgcgctggagcggcagatcttt gacttcctgggctaccagtgggctcccatcctagccaacttcctgcacatcatggcagtc atcctgggcatctttggcaccgtgcagtaccgctcccggtacctcatcctgtatgcagcc tggctggtgctctgggttggctggaatgcatttatcatctgcttctacttggaggttgga cagctgtcccaggaccgggacttcatcatgaccttcaacacatccctgcaccgctcctgg tggatggagaatgggccaggctgcctggtgacacctgttctgaactcccgcctggctctg gaggaccaccatgtcatctctgtcactggctgcctgcttgactacccctacattgaagcc ctcagcagcgccctgcagatcttcctggcactgttcggcttcgtgttcgcctgctacgtg agcaaagtgttcctggaggaggaggacagctggtgggagtcctgccctcagattcttccc cttcaggacccagtctccctctcctcgcccccgccgccgatccctaacatgcgccgcccc tcttccgcagttgacttcatcggcggctttgactcctacggataccaggcgccccagaag acgtcgcatttacagctgcagcctctgtacacgaactgtcctagatactgggctacagca gtgaacaggacagacaaggccctagatttctttcttacagcttacctactggacagaatc aacataaatgctcttgttagtcttcctaacaatcctaagaaagacaatgacaacccctga >gi568815597r:31160075_31396751|GENSCAN_predicted_peptide_2|298_aa MIEQQKRKGPELPLVPVKRQRHELLLGAGSGPGAGQQQATPGALLQAGPPRCSSLQAPIM LLSGHEGEVYCCKFHPNGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVMELHYNTD GSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQLVCTGSDDGTVKLWD IRKKAAIQTFQNTYQVLAVTFNDTSDQIISGGIDNDIKVWDLRQNKLTYTMRGHADSVTG LSLSSEGSYLLSNAMDNTASPSYWDLDVFSCLKKKRALTGSSMSSWTLPTEETASAID >gi568815597r:31160075_31396751|GENSCAN_predicted_CDS_2|897_bp atgatagaacagcagaagcgtaagggcccagagttgccgctggttccagtcaagcggcag cggcatgagttgctgttgggagcggggtctggcccaggagccgggcagcagcaggcgacg ccgggagccttgctgcaagcgggacctccaagatgttcctcccttcaagccccaatcatg ctgctctctggacatgaaggggaagtctactgctgcaagttccaccccaacggatccacc ttagcatctgcaggatttgaccgactgatattactgtggaatgtctatggtgactgtgat aactatgccacactgaagggacacagtggagcagtgatggaattgcattacaacacagat ggcagtatgcttttctcagcatccacagataaaaccgtggctgtgtgggatagtgaaaca ggtgagagggttaaaaggctaaagggacatacttcctttgtgaattcctgttatccagcc aggagaggccctcagcttgtctgcactggcagtgacgatggcacagttaagctttgggac atccggaagaaagcagccatccagacatttcagaacacgtaccaggtgttagctgtgacc ttcaatgacacaagtgatcagattatttctggtggaatagacaatgatatcaaggtctgg gacctgcgccagaacaagctaacctacaccatgagaggccatgcagattcagtgactggc ctgagtttaagttctgaaggctcttatcttttgtccaatgcaatggacaatacagctagc cccagttattgggatttagatgtgttttcctgcttgaaaaagaaaagagccttaacaggt agcagtatgtcaagttggactttgcccaccgaggaaacagcatctgccattgattga >gi568815597r:31160075_31396751|GENSCAN_predicted_peptide_3|220_aa MGLQVPLGTEQLGVMDNMIDGLVHRTHMSSCRVDKPSEIVDVGDKVWVKLIGREMKNDRI KVSLSMKVVNQGTGKDLDPNNVIIEQEERRRRSFQDYTGQKITLEAVLNTTCKKCGCKGH FAKDCFMQPGGTKYSLIPDEEEEKEEAKSAEFEKPDPTRNPSRKRKKEKKKKKHRDRKSS DSDSSDSESDTGKRARHTSKDSKAAKKKKKKKKHKKKHKE >gi568815597r:31160075_31396751|GENSCAN_predicted_CDS_3|663_bp atgggcctgcaggtgcccctgggcacagaacaactgggcgtcatggacaacatgattgat ggtctggtccatcgaactcatatgtcatcctgtcgggtggataagccctctgagatagta gatgttggagataaagtgtgggtgaagcttattggccgagagatgaaaaatgatagaata aaagtatccctctccatgaaggttgtcaatcaagggactgggaaagaccttgatcccaac aatgttatcattgagcaagaagagaggcggaggcgatccttccaggattacactgggcag aagatcacccttgaggctgtcttgaacactacctgcaagaagtgtggctgtaaaggccac tttgcaaaagattgtttcatgcaaccaggtgggactaaatactctctgatacctgatgag gaagaggaaaaggaagaggcaaagtcagcagagtttgagaagcctgaccctacaaggaat ccttctagaaaaagaaagaaggagaagaagaaaaagaaacatagagataggaagtcatct gactctgacagctcagactctgagagtgatacaggcaagagggcaaggcacacatcaaaa gacagcaaggcagcaaagaagaagaaaaagaagaagaagcacaagaagaagcacaaggag tga >gi568815597r:31160075_31396751|GENSCAN_predicted_peptide_4|223_aa MGLEELTELAEFLQEGSWERPELKLGAVRRNRKSPEHLQGRPSITMVDAFLGTWKLVDSK NFDDYMKSLGVGFATRQVASMTKPTTIIEKNGDILTLKTHSTFKNTEISFKLGVEFDETT ADDRKVKERELEELPNENQLMYVCWQHPELRNHFKGIQSQDFVVAALLLAKESIVTLDGG KLVHLQKWDGQETTLVRELIDGKLILTLTHGTAVCTRTYEKEA >gi568815597r:31160075_31396751|GENSCAN_predicted_CDS_4|672_bp atgggccttgaagagctgaccgaattggcagaatttctgcaggaggggagctgggaacga cctgagctaaagctcggagctgtgcgaagaaaccggaaaagcccagagcacttgcagggg cggcccagcatcactatggtggacgctttcctgggcacctggaagctagtggacagcaag aatttcgatgactacatgaagtcactcggtgtgggttttgctaccaggcaggtggccagc atgaccaagcctaccacaatcatcgaaaagaatggggacattctcaccctaaaaacacac agcaccttcaagaacacagagatcagctttaagttgggggtggagttcgatgagacaaca gcagatgacaggaaggtcaaggaaagagagctggaagagctgccaaatgagaaccagctg atgtatgtatgctggcagcacccagagctgaggaaccacttcaagggcatccagtcacag gactttgtggttgctgccctcttgttggctaaagagtccattgtgacactggatggaggg aaacttgttcacctgcagaaatgggacgggcaagagaccacacttgtgcgggagctaatt gatggaaaactcatcctgacactcacccacggcactgcagtttgcactcgcacttatgag aaagaggcatga >gi568815597r:31160075_31396751|GENSCAN_predicted_peptide_5|333_aa MHIPRHIHTDNFTIHTPRHIHTDDITHTPRPIHTDDLTIHNPRHIHTDDLTHTPRHNHTD DLTHTPRHIHTDDHTIDTPRHIYTDDHTIDTPRHIHTEDLTIHTQTHPHRRPHHPHPQTH PHRGPHHPHPDTSPRTTSPTPPDTSTQMTSPTPPGPSTQMTSPSTPPDTSTQTTSPTPPG TTTQTTSPTPPDTSTQMTTPSTPPDTSTQMTTPSTPPDTSTQMTTPSIPPDTSTQMTSPS TLRHIPTEDLTIHTQTHPHRPHYPHPDTSTQTTSPSTPRHIPTEDLTIHTQTHPHRGPHH PHPDTSPQRTSPSTPRHIHTDDLTIHTQTHPHG >gi568815597r:31160075_31396751|GENSCAN_predicted_CDS_5|1002_bp atgcacatacccaggcacatccacacagacaacttcaccatccacactcccagacacatc cacacagatgacatcacccacacccccaggcccatccacacagatgacctcaccatccac aacccaagacatatccacacagatgacctcacccacacccctaggcacaaccacacagac gacctcacccacacccccagacacatccacacagatgaccacaccatcgacacccccaga cacatctacacagatgaccacaccatcgacacccccagacacatccacacagaggacctc accatccacacccagacacatccacacagacgacctcaccatccacacccccagacacat ccacacagaggacctcaccatccacacccagacacatccccacggacgacctcacccaca cccccagacacttccacacagatgacatcacccacacccccaggcccatccacacagatg acctcaccatccacacccccagacacatccacacagacgacctcacccacacccccaggc acaaccacacagacgacctcacccacacccccagacacatccacacagatgaccacacca tcgacacccccagacacatccacacagatgaccacaccatcgacacccccagacacatcc acacagatgaccacaccatcgatacccccagacacatccacacagatgacctcaccatcc acactcagacatatccccacagaggacctcaccatccacacgcagacacatccccacaga cctcactatccacacccagacacatccacacagacgacctcaccatccacacccagacac atccccacagaggacctcaccatccacacccagacacatccccacagaggacctcaccat ccacacccagacacgtccccacagaggacctcaccatccacacccagacacatccacaca gacgacctcaccatccacacccagacacatccccacggatga