GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:31:50 Sequence gi568815594f:73640659_73842033 : 201375 bp : 37.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 5827 5632 196 1 1 101 100 72 0.355 7.77 1.10 Intr - 9417 9317 101 1 2 75 65 45 0.372 -0.19 1.09 Intr - 12317 12132 186 0 0 33 36 172 0.020 5.24 1.08 Intr - 17869 17729 141 2 0 85 84 142 0.675 12.90 1.07 Intr - 18452 18295 158 1 2 19 41 82 0.036 -4.67 1.06 Intr - 25881 25767 115 1 1 78 50 117 0.205 5.49 1.05 Intr - 27835 27397 439 1 1 22 91 247 0.291 10.56 1.04 Intr - 28192 27983 210 1 0 41 55 127 0.392 2.89 1.03 Intr - 29514 29241 274 2 1 54 25 169 0.178 3.82 1.02 Intr - 29755 29593 163 2 1 81 28 119 0.193 3.31 1.01 Init - 41327 41279 49 2 1 68 111 38 0.059 5.26 1.00 Prom - 46811 46772 40 -4.25 2.00 Prom + 47592 47631 40 -0.95 2.01 Init + 52188 52237 50 2 2 25 99 64 0.057 1.67 2.02 Intr + 60863 60983 121 2 1 120 99 -4 0.080 3.48 2.03 Intr + 66878 66945 68 1 2 64 105 35 0.026 -0.42 2.04 Intr + 67351 67444 94 1 1 59 76 97 0.093 4.65 2.05 Intr + 90735 90839 105 2 0 101 -6 99 0.076 1.29 2.06 Term + 93845 93886 42 1 0 78 47 80 0.108 -0.92 2.07 PlyA + 93994 93999 6 1.05 3.00 Prom + 99874 99913 40 -3.75 3.01 Init + 100001 100064 64 1 1 90 121 120 0.999 15.08 3.02 Intr + 100884 101019 136 1 1 116 92 73 0.999 9.21 3.03 Intr + 101291 101374 84 2 0 93 107 74 0.974 7.82 3.04 Term + 104106 104184 79 0 1 106 54 45 0.858 -0.84 3.05 PlyA + 104398 104403 6 1.05 4.00 Prom + 108720 108759 40 -2.95 4.01 Init + 121405 121468 64 0 1 78 55 68 0.169 3.86 4.02 Term + 123296 123435 140 0 2 96 54 51 0.201 -0.36 4.03 PlyA + 123705 123710 6 1.05 5.03 PlyA - 124058 124053 6 1.05 5.02 Term - 125316 124868 449 1 2 58 48 246 0.122 11.99 5.01 Init - 130605 130368 238 0 1 62 99 103 0.212 7.02 5.00 Prom - 138376 138337 40 -5.65 6.00 Prom + 140965 141004 40 -5.45 6.01 Init + 141021 141104 84 2 0 82 116 -16 0.440 1.47 6.02 Term + 147553 147984 432 0 0 -328 54 962 0.573 45.51 6.03 PlyA + 148704 148709 6 1.05 7.00 Prom + 155664 155703 40 -2.35 7.01 Init + 159287 159477 191 1 2 64 88 60 0.288 2.06 7.02 Term + 162194 162311 118 2 1 92 48 119 0.379 5.33 7.03 PlyA + 162862 162867 6 1.05 8.00 Prom + 163586 163625 40 -2.55 8.01 Init + 167774 167836 63 1 0 62 116 56 0.859 7.00 8.02 Term + 184982 185044 63 1 0 87 36 69 0.021 -1.49 8.03 PlyA + 185846 185851 6 1.05 9.02 PlyA - 185980 185975 6 1.05 9.01 Sngl - 186405 186019 387 0 0 37 45 228 0.556 9.36 9.00 Prom - 194016 193977 40 -3.65 10.00 Prom + 194315 194354 40 -7.05 10.01 Init + 196093 196201 109 0 1 70 81 160 0.988 11.93 10.02 Intr + 196306 196438 133 2 1 94 78 134 0.830 11.78 10.03 Intr + 196545 196628 84 0 0 97 68 58 0.591 2.72 10.04 Term + 198359 198392 34 2 1 88 51 49 0.419 -2.72 10.05 PlyA + 199694 199699 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 16775 16561 215 0 2 34 37 149 0.845 0.81 S.002 Sngl + 59802 59987 186 2 0 46 40 205 0.861 6.33 S.003 Term - 133597 133387 211 0 1 27 36 193 0.869 3.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_1|678_aa MFKPESLKSGTAQRSFDSARVTDSPGVTGYLEAWFLFYKPGSLEASGSLEAWSTVAGGSL DFRPMKDLLSKAIAQAREYGDPKAWQFPVILQPPCRPAAQNQLQPADVAQQAADLAANSR SVLISCFSAVQKAVAAAGLVIAPEKIQTSSPYHYLGMQLEDKDRPWWNEPIKHSKINFEN RTLKQRDAATPHAQLNLAPFTLNFLNLARNQPFTVAEQHFTGSKFDPQKGMRMKRSQKQK AKTPQKPRRKFRLLTHKLHDLDIETKSRCVTYNTTRSTPPTWGQIKVLSHHTENLLKEKG IPKTTSNIILAVFMVVSAAVSIPPVGATQNYTHWAYVPFPPLIWSVSWMDSSVEVYTNNS AFMPVPNDDRFPAQPEEEVCRCLQGVRKQVQSQRQAMMVMAILVNKKGADVGGKPIRIAT LVYLPSKLGKRAPRGTKVDLLSSVHGYGVPPNITLNQRHTQETKELAHGRTSRQEKESPS WLGNADHGFDPDHQEDVEGDRPTDTYDRGRKEYAGTALTLSHGRVPECEGEQASAQAQSS AVKVQKWFVSSHRKEKQQTVVNTLFQSLVPEPATSGCSGLYIVQDRKKTLHFAHKLQVCW ALQGCGTETQLSQQPLLCTWERDGVAIVSLSIELSAALSQWKAERGCTQLMPARGANIPG RPSPEGNCHPSSQSLNSX >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_1|2034_bp atgtttaagcctgaaagtctgaaatctgggacagcacagagatcttttgactcggctaga gtcacggactcccctggagtcacaggataccttgaggcctggttccttttctacaaaccg ggttcccttgaagcctcaggctcccttgaagcctggtccacagttgccggtggcagcctg gatttcaggcccatgaaagacctgctcagcaaggctatagcacaggcaagagaatatggg gatcccaaagcctggcaatttcctgtaattttacagcctccctgccgtcctgcggcacaa aatcagctacagcccgctgatgttgctcagcaagcagctgatctcgcggcaaattctcgc tctgtgttaatatcttgtttttctgcagtacaaaaagcagttgcagcagccggtttagtt attgccccagaaaaaatccaaacttcttctccttatcactatttaggaatgcagctagaa gacaaggacaggccttggtggaacgagccaataaaacactcaaagatcaacttcgaaaac aggacactaaaacagagggatgctgccactcctcatgctcagttaaatctggctcctttt acactaaatttcttaaacctagcaagaaatcaaccttttacagtggcagaacaacatttc actggaagtaaatttgatccacaaaaaggaatgcggatgaaacgttcccagaaacaaaag gcaaagacccctcaaaaaccgaggaggaaatttcgcctcctgacacataaacttcatgac ctcgacattgagacaaaatctcgctgtgtgacctacaacaccactcgttcaactccacca acatggggtcaaataaaggtcttatcccatcacacagaaaatttattaaaagaaaaagga atcccaaaaacgaccagtaatataattctggctgtctttatggtagtcagtgcagcagta agtataccaccagttggggcaactcaaaattatacccactgggcatatgttccttttccc cctttaatttggtctgtctcctggatggactcctcagtagaagtttacactaataatagt gcattcatgccggtccctaatgatgataggtttccagctcaaccggaagaagaagtctgc agatgcctccaaggagtccggaaacaagtccaaagtcaacgacaagcaatgatggtgatg gcgatcctagttaataaaaagggggcagatgtgggtggcaagccaatcaggattgctaca cttgtctatttaccatcaaaattgggcaagagagcaccaagaggcactaaagtagatctt ctgagttcagtgcatgggtatggggtgccacctaacatcacgcttaatcagagacacaca caggagacaaaggaattggctcatggccgcaccagcaggcaagaaaaggagtcaccatca tggcttggtaatgctgatcacggatttgaccccgaccatcaggaggacgtagaaggagat aggcctactgatacatatgataggggcaggaaggaatatgctggcactgccctcactctc agccatggcagggtccctgaatgtgaaggcgagcaggccagcgcacaggcacagagttca gcagtgaaggtgcagaagtggtttgtgtcaagccatagaaaagagaagcagcaaacagtg gtcaacaccctattccaaagtctggttccagaaccagcaacatcaggatgctctggcctc tacatagttcaagaccgaaagaaaacactccattttgcccacaaactacaggtctgctgg gcattgcagggttgtgggactgagacacagttgtctcaacaaccacttctatgcacttgg gagagggatggggtagcgattgtgagcctttccattgaactcagtgctgccctgtcacag tggaaagcagaacggggctgtactcagcttatgcctgcacgtggagcaaacattccgggc aggccctcgccagaggggaattgccatcccagcagtcagagcttgaattccann >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_2|159_aa MSTVLSYSMGTGNIPTRYSSHAHGPINHMFTNHHLRDLPREVYLRPLTYTIPHNQLQHRS PVPGPGPVCDLLGTRMHSRRLSSMKLVPGAKTVGNRCFIALKISKVPAAKDLSECQIMSL IFQRHCNGSPHRVKAGILAMTRKAYLEPRRYRMVNSPVE >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_2|480_bp atgtctacagtattgagttattctatgggaaccggaaacattcctaccaggtacagttcc catgctcatggtccaataaaccacatgtttacaaatcaccatctcagagatcttcccagg gaggtctacctaagacccctaacttacaccataccccacaatcagctccagcacaggtcc ccagtccccgggcctggaccagtgtgtgacctattaggaaccaggatgcacagcaggaga ttgtcttccatgaaactggtccctggtgccaaaacggttgggaaccgctgctttatagca ctcaagataagcaaagtgcctgcagccaaggatctgtcagagtgtcagatcatgtcactg attttccaaagacactgcaatggttctccccacagagtgaaagctggaatccttgcgatg acccgcaaggcctatctagagcctagaagatatcggatggttaattcaccagtggaatga >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_3|120_aa MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFIKELRVIESGPH CANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKSPQQSPECDVPLPVSMCSPCSIPTYE >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_3|363_bp atgacttccaagctggccgtggctctcttggcagccttcctgatttctgcagctctgtgt gaaggtgcagttttgccaaggagtgctaaagaacttagatgtcagtgcataaagacatac tccaaacctttccaccccaaatttatcaaagaactgagagtgattgagagtggaccacac tgcgccaacacagaaattattgtaaagctttctgatggaagagagctctgtctggacccc aaggaaaactgggtgcagagggttgtggagaagtttttgaagagcccacaacagtcccca gagtgtgatgttccccttcctgtgtccatgtgttctccttgttcaattcccacctatgag tga >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_4|67_aa MVTLETQDVIRKEHGVGFKGTVSLSKWSSLMPALKWTAVSSTPAIQFSPHSLTHTLLQGV KSCWLSK >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_4|204_bp atggtaaccctagagacacaggatgtgattagaaaggaacatggagtaggatttaagggt actgtaagtctgagtaagtggagttcactcatgccagcactgaagtggacagctgtttcc agcactcctgcaatccagttctcgcctcattcactcacgcacacccttctgcaaggagtt aagagctgctggctgagtaaatga >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_5|228_aa MSFGSEEEYFPDPFTGLTTEVPHLLSQQLSTLVGGSTLSNEAGTGLHEPWNQLAALVPAG ANSTHLHLLCSTPFRRECTEVLVLEGGMLPPGDTTMISLNWKLRLSPGHLAFLLPLSQQA KKGVTLLAGVTDSDHQDEISLLLHNGGKEECTWNTGDPLGCLLVLPCPMIKVNGKLQRPN PDRTTNGPEPSGIKVSVTLPGKKPQPAEVLAEGKGNTEWVVEVVINTS >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_5|687_bp atgtcttttggtagtgaagaagaatatttccctgaccctttcacaggactcacaacagag gtgcctcatttactcagtcagcagctttcaactcttgtgggagggagcacgctatcaaat gaggcaggaactggattgcatgagccctggaaccagctggctgctttggtgccagcagga gcaaactccactcacttgcacctgctgtgttccacccctttcaggagggagtgcacagag gtcttagttctagagggaggaatgctgccaccaggagacacaactatgatttcattaaac tggaagttaagattgtcacctggccaccttgcgttcctcctacctctaagtcaacaggct aagaaaggagttacattgttggctggggtgactgactcagaccatcaagatgaaatcagc ctactactccacaatggaggtaaggaagagtgcacatggaatacaggagatcccttaggg tgtctcttagtattaccatgccctatgataaaggtcaatgggaaactacaacggcccaat ccagacaggactacaaatggcccagaaccttcaggaataaaggtttcggtcactctacca ggtaaaaaaccacaacctgctgaagtgcttgctgaaggcaaagggaatacagaatgggta gtagaagtagtcatcaataccagctaa >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_6|171_aa MGETAPMIQLPPPGPALDMWRLWGLQFKEKEEEEEKEEEEDEEKEEDEKEEDEEDEKKED EKEEEEKEDEEKEEEEKEEDEKEEDEEEEEKEKEEKKEEEKEEEEKEKEEEEEKEEEEEK EEEEKKKRQQSQSPFHTRISLFHSGSFTSNITKDFIEAKRKVELVTRVCML >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_6|516_bp atgggggaaacagctcccatgattcaattacccccacctggtcctgcacttgacatgtgg agattatggggattacaattcaaggagaaggaggaggaggaggaaaaggaggaggaggag gatgaggagaaggaggaggatgagaaagaggaggatgaggaggatgagaagaaggaggat gagaaggaggaggaagaaaaggaggatgaggagaaggaggaggaggagaaggaggaggat gagaaggaggaggatgaggaggaggaggagaaggagaaggaggagaaaaaggaggaggag aaggaggaggaggagaaggagaaggaggaggaagaggagaaggaggaggaagaggagaag gaggaggaggagaagaagaagagacaacagagtcagagtccctttcacactaggatttct ctctttcattctggaagttttacttctaatattacaaaggacttcattgaagccaagcgg aaagttgaacttgtgacaagagtgtgtatgctgtga >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_7|102_aa MPACELHQLAYLEYLVGVIGEELFLLEPMYKGWKLYLLLQMCNHKHKPIMITDNQGNMTP IGMSSFMSVEVELAEATDNPFSTAATAVPTLAATTLGKKQRV >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_7|309_bp atgcctgcctgtgaactccaccaactggcctatctagaatatctggtgggagtgattggt gaagagctgtttcttttggagccaatgtataaaggctggaagctgtatctacttcttcaa atgtgcaaccacaaacacaagcccataatgatcacagataatcagggaaacatgacacca ataggaatgagtagctttatgtctgttgaggtggagcttgcagaagcaactgacaacccc ttttccactgctgctacagcagtacccacccttgctgccacaacactggggaaaaaacaa agagtctga >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_8|41_aa MDTKKGAIDIRAFFKVEGGKKSLAKDSWRNRELAVLLTVTS >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_8|126_bp atggacacaaagaagggagcaatagacatcagagccttctttaaggtggagggtgggaag aagagtctagctaaagacagttggaggaacagggagttggcagtcctactcacggtaact tcatag >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_9|128_aa MKNYFLWMNKEWFLEMESTPGENAVMIVEMITKDTEYYIINFVDKATSGIERIDSNFGRG SAVDKMLPNNIMCYREIFCERKSQSMPQISLLTSFKKLPQPPQPSAANFLISEQPSTLRQ DSPPGKVL >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_9|387_bp atgaagaattacttcttatggatgaacaaagagtggtttcttgaaatggaatctactcct ggtgaaaatgctgtgatgattgttgaaatgataacaaaggatacggagtattacatcata aattttgttgataaagcaacatcaggaattgaaaggattgactccaattttggaagaggt tctgctgtggataaaatgctgccaaacaacataatgtgctatagagaaatcttttgtgaa aggaagagtcaatcaatgccacaaatctcattgctgacttcttttaagaaattgccacag ccacctcaaccatcagcagccaacttcctgatcagtgagcagccatcaacattgaggcaa gactctccaccaggaaaagtattatga >gi568815594f:73640659_73842033|GENSCAN_predicted_peptide_10|119_aa MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDRHEIKVAKHDE >gi568815594f:73640659_73842033|GENSCAN_predicted_CDS_10|360_bp atgagcctcccgtccagccgcgcggcccgtgtcccgggtccttcgggctccttgtgcgcg ctgctcgcgctgctgctcctgctgacgccgccggggcccctcgccagcgctggtcctgtc tctgctgtgctgacagagctgcgttgcacttgtttacgcgttacgctgagagtaaacccc aaaacgattggtaaactgcaggtgttccccgcaggcccgcagtgctccaaggtggaagtg gtagcctccctgaagaacgggaagcaagtttgtctggacccggaagccccttttctaaag aaagtcatccagaaaattttggacagacatgaaataaaagttgctaaacatgatgaatga