GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:13:18 Sequence gi568815580f:24327916_24550844 : 222929 bp : 41.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 600 721 122 2 2 82 100 34 0.728 2.37 1.02 Intr + 2421 2541 121 0 1 89 82 38 0.811 2.88 1.03 Intr + 7406 7598 193 1 1 38 103 173 0.931 11.94 1.04 Term + 9040 9128 89 2 2 109 44 65 0.907 1.04 1.05 PlyA + 9420 9425 6 1.05 2.00 Prom + 25681 25720 40 -3.65 2.01 Init + 25858 25937 80 0 2 83 75 37 0.463 2.48 2.02 Intr + 27691 27744 54 1 0 89 116 42 0.908 4.28 2.03 Intr + 28928 29007 80 2 2 38 88 81 0.033 1.38 2.04 Term + 29993 30198 206 0 2 35 39 220 0.035 8.35 2.05 PlyA + 31764 31769 6 1.05 3.00 Prom + 39064 39103 40 -3.95 3.01 Init + 47974 48136 163 0 1 60 44 181 0.680 8.86 3.02 Intr + 48579 48763 185 1 2 60 27 153 0.696 4.99 3.03 Intr + 49045 49199 155 0 2 35 58 115 0.031 1.15 3.04 Term + 70106 70289 184 2 1 52 39 216 0.928 9.13 3.05 PlyA + 71017 71022 6 1.05 4.00 Prom + 82813 82852 40 -5.65 4.01 Init + 98842 98877 36 0 0 83 94 34 0.561 3.57 4.02 Intr + 100004 100132 129 1 0 55 80 127 0.814 8.57 4.03 Intr + 102424 102469 46 0 1 57 92 8 0.053 -4.84 4.04 Intr + 112581 112703 123 1 0 66 75 136 0.984 9.74 4.05 Intr + 115134 115237 104 1 2 69 103 102 0.994 8.77 4.06 Intr + 117478 117551 74 0 2 68 109 30 0.968 0.39 4.07 Intr + 120178 120268 91 1 1 48 92 98 0.558 5.28 4.08 Intr + 121904 122038 135 1 0 18 95 88 0.781 2.34 4.09 Intr + 135448 135608 161 0 2 97 77 -18 0.012 -4.14 4.10 Intr + 136285 136423 139 1 1 76 44 127 0.865 6.65 4.11 Intr + 138822 138891 70 2 1 61 69 54 0.557 -1.36 4.12 Intr + 140873 141036 164 0 2 88 87 35 0.488 2.17 4.13 Intr + 148832 149096 265 1 1 87 39 174 0.588 8.56 4.14 Intr + 155065 155230 166 2 1 44 78 127 0.724 5.40 4.15 Intr + 157370 157475 106 2 1 33 66 39 0.114 -4.50 4.16 Term + 162735 163007 273 2 0 47 34 182 0.217 3.19 4.17 PlyA + 163149 163154 6 -0.45 5.09 PlyA - 163575 163570 6 1.05 5.08 Term - 163976 163817 160 1 1 109 43 120 0.926 6.03 5.07 Intr - 168017 167874 144 1 0 29 76 118 0.453 3.18 5.06 Intr - 171032 170907 126 1 0 27 51 119 0.239 0.97 5.05 Intr - 172141 172019 123 0 0 73 52 85 0.288 2.18 5.04 Intr - 175334 175170 165 1 0 59 77 65 0.159 0.65 5.03 Intr - 179431 179171 261 0 0 34 72 267 0.024 15.38 5.02 Intr - 180940 180468 473 1 2 12 82 382 0.032 20.95 5.01 Init - 181750 181604 147 1 0 105 84 31 0.768 4.55 5.00 Prom - 182033 181994 40 -6.05 6.00 Prom + 188290 188329 40 -7.85 6.01 Init + 189818 189876 59 1 2 61 94 65 0.110 5.25 6.02 Intr + 190750 190927 178 1 1 82 100 146 0.805 14.20 6.03 Term + 192888 192992 105 2 0 107 43 90 0.970 3.83 6.04 PlyA + 192999 193004 6 -1.75 7.05 PlyA - 193893 193888 6 1.05 7.04 Term - 194980 194448 533 2 2 -16 43 479 0.300 26.42 7.03 Intr - 195333 195243 91 0 1 63 75 62 0.161 1.05 7.02 Intr - 213365 213006 360 2 0 36 48 207 0.062 5.99 7.01 Intr - 216465 216303 163 2 1 98 68 125 0.974 10.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 28928 29014 87 2 0 38 103 77 0.803 2.37 S.002 Term + 29961 30198 238 0 1 59 39 252 0.855 12.06 S.003 Intr - 43693 43572 122 1 2 71 61 117 0.812 5.67 S.004 Init - 44593 44543 51 1 0 36 109 52 0.839 3.31 S.005 Init + 69388 69431 44 0 2 49 58 85 0.835 1.56 S.006 Init - 90469 90356 114 1 0 76 91 96 0.961 8.96 S.007 Term - 180940 180464 477 1 0 12 43 383 0.810 20.05 S.008 Term - 213365 213000 366 2 0 36 49 213 0.844 5.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_1|174_aa GHRERLSGEAEFELALKARLGFTRRTRKTRALLAERIVCARIPFPKLLLIKILLDYNGIV INYHYYQDKKAQDKKTIMVMKTKELRHQEVMEFNQGHTIRVEELGFELKQASSRVHALIT TLPLKSYLQEKSNWQSDLCERMEYVGHLLITKGKDAFTVDHKVIRVNITNNETK >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_1|525_bp ggacacagggaaaggctctctggggaagcagagtttgagctagctcttaaagcaagacta gggtttaccagacggacaagaaagacaagagcacttctggcagagagaatagtatgtgca agaataccctttcccaaactcctgcttatcaaaatcttactagattataatggtattgtt attaactaccattattatcaagacaaaaaggctcaagataaaaagacaataatggtaatg aagacaaaggaactgaggcaccaagaggttatggaatttaaccaaggtcacacaatcaga gtggaggagctgggatttgaactcaagcaggctagttccagagtccatgctcttatcact acgcttcctctgaagtcatacttacaggaaaagagcaactggcaatccgatttgtgtgag agaatggaatatgtcggtcatttattaattacaaagggaaaagatgcctttacagtggac cacaaagtgatcagagttaacatcaccaataatgaaacaaagtga >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_2|139_aa MDEAGNHHTQQTIARTRNQTPHVLTHRTHPSTTQSAPSKDINSERWNVEDDTELPETQPN SDPNYSHSATSGFGAAYSTPKDNALIRMLEGTKGCLNIGVLASKDQQARIVTILAEVIEL DQQKKVGLYTLGMYVEEQV >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_2|420_bp atggatgaagctggaaaccatcatactcagcaaactatcgcaaggacgagaaaccaaaca ccgcatgttctcactcacaggacccatccaagtacaactcaatctgcaccatcaaaagat atcaacagtgaaagatggaatgtggaggatgacactgaacttccagaaactcaaccaaat agtgaccccaattatagccactctgccacctcaggttttggggcagcttattccacacct aaagataatgctctgattcgtatgctagaaggcaccaaaggctgcctgaacattggggtc cttgcatccaaggaccagcaggcaagaatagtcaccatcttagcagaagtaattgaactt gatcagcagaaaaaagtagggctgtacacattggggatgtacgtggaagaacaggtttag >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_3|228_aa MKPRTLAVSVTALKAARLELFVPPGGLVVSLASGVKLQTFAVSVTAHKSSVDPKNSGAQL ASPSGSRTGAAGGAACQSCTMRPHSSALGWWMGLGAVEQGAVLIEEAWAAQEPMEGRLSF HTSLQAEGAGSSLGQHRKGLPQCSDGLKGSSSAAKVGAQAEEARKRARFCQLLGTRKAFA LLPPHPEGRSGRPYPETTRNRPCEPVEDSRPQAKYKSWVNSLKSVIKQ >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_3|687_bp atgaagccgcggaccctcgcggtgagtgttacagctcttaaggcggcgcgtctggagttg tttgttcctcccggtgggctcgtggtctccctggcttcaggagtgaagctgcagaccttc gcagtgagtgttacagctcataaaagcagtgtggacccaaagaactcaggagcccagctg gcttcacccagtggatcccgcactggggctgcaggtggagctgcctgccagtcctgcacc atgcgcccgcactcctcagcccttgggtggtggatgggactgggcgctgtggagcagggg gcagtgctcatcgaggaggcttgggccgcacaggagcccatggaggggcgcctctccttc cacacctccctgcaagccgagggagccggctccagccttggccagcacagaaaggggctc ccacagtgcagcgatgggctgaagggctcctcaagtgccgccaaagtgggagcccaggca gaggaggcccgaaagcgagcgaggttttgccagctactagggacgcgtaaagccttcgcg ctcctcccaccacaccccgagggaagaagtggcaggccttacccggagactacaaggaac cgcccgtgtgagccagtggaagactcgcgtccccaggcaaaatataagtcctgggttaat agtctgaagtctgtcattaagcaatag >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_4|693_aa MAEGDAGSDQRQNEEIEAMAAIYGEEWCVIDDCAKIFCIRISDDIDDPKWTLCLQGKNVR IYQIALRKYICPDVKKKTEEEDVECEDDLILACQPESSLKALDFDISETRTEVEVEELPP IDHGIPITDRRSTFQAHLAPVVCPKQVKMVLSKLYENKKIASATHNIYAYRIYCEDKQTF LQDCEDDGETAAGGRLLHLMEILNVKNVMVVVSRWYGGILLGPDRFKHINNCARNILVEK NYTNSPLHPEVCIFNAGLTMTQSLGAFCNRIQVFWQCCSFSFAMLGNEPFVSAWKKAWTC SGAGDIPDQDSGQYWFLMRAVFLACRRLPSTCVLKRPFSECAQRERTNLVLMKKWEFLEV PDTFEVTQQSVISIPLYIPHTLFEWDFGKEICVFWLTTDYLLCTASVYNIVLISYDRYLS VSNAVSYRTQHTGVLKIVTLMVAVWVLAFLVNGPMILVSESWKDEGSECEPGFFSEWYIL AITSFLEFVIPVILVAYFNMNIYWSLWKRDHLRLGHPKGWGQLVLRLPHGVEGQPWRLQL VPRMGYIEVGGLLCTAAGEMSTHARSAKLLSTGSENDTLPVPSLASRSLCPSVLSLGSFP SCQSCLSDQMSQCDTEPERKSFLSMMQGTQHFDNPDGMWSSHGRNVSSGGLHNHCILQMG TGSAEASHPEGPRGGQGQVTTRATTQKRVAASG >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_4|2082_bp atggctgagggggacgcagggagcgaccagaggcagaatgaggaaattgaagcaatggca gccatttatggcgaggagtggtgtgtcattgatgactgtgccaaaatattttgtattaga attagcgacgatatagatgaccccaaatggacactttgcttgcagggcaagaacgtgcgg atttatcaaatagccttgaggaaatatatatgcccagatgtaaagaagaaaactgaagag gaagatgttgaatgtgaagatgatctcattttagcatgtcagccggaaagttcgcttaaa gcattggattttgatatcagtgaaactcggacagaagtagaagtagaagaattacctccg attgatcatggcattcctattacagaccgaagaagtacttttcaggcacacttggctcca gtggtttgtcccaaacaggtgaaaatggttctttccaaattgtatgagaataagaaaata gctagtgccacccacaacatctatgcctacagaatatattgtgaggataaacagaccttc ttacaggattgtgaggatgatggggaaacagcagctggtgggcgtcttcttcatctcatg gagattttgaatgtgaagaatgtcatggtggtagtatcacgctggtatggagggattctg ctaggaccagatcgctttaaacatatcaacaactgtgccagaaacatactagtggaaaag aactacacaaattcacctcttcatccagaggtctgcatatttaatgcagggctcaccatg acacagtccctgggagctttttgtaacaggattcaagttttctggcagtgctgttctttc tcctttgccatgctgggaaatgaaccttttgtctcagcctggaagaaggcgtggacgtgt tctggagctggggacatcccagatcaagattctggccaatattggttcctgatgagggct gtctttctggcttgcagacggctgccgtctacctgcgtcctcaagaggcctttctctgag tgcgcacagagagagagaactaatttggttctcatgaaaaaatgggaattcctggaagta cctgatacatttgaagtaactcaacaaagtgtgatctccattcctttgtacatccctcac acgctgttcgaatgggattttggaaaggaaatctgtgtattttggctcactactgactat ctgttatgtacagcatctgtatataacattgtcctcatcagctatgatcgatacctgtca gtctcaaatgctgtgtcttatagaactcaacatactggggtcttgaagattgttactctg atggtggccgtttgggtgctggccttcttagtgaatgggccaatgattctagtttcagag tcttggaaggatgaaggtagtgaatgtgaacctggatttttttcggaatggtacatcctt gccatcacatcattcttggaattcgtgatcccagtcatcttagtcgcttatttcaacatg aatatttattggagcctgtggaagcgtgatcatctcaggcttgggcatcccaagggatgg ggccagctggtgctcagactgccacatggggttgagggacagccgtggcggctgcagctg gtccctcggatggggtacattgaagtaggaggcttgttgtgcactgctgctggagagatg tcaactcacgctagaagtgctaaactgctgtcaacaggcagtgaaaatgacaccttgcca gttccaagcctagcctcaagaagcctgtgcccatctgttctatctcttggcagttttccc agctgccagagctgccttagtgaccagatgtcacaatgtgatacagagccagagaggaag tcatttttgtccatgatgcaaggaacccagcattttgacaacccagatggaatgtggagc tcccatggaagaaacgtgtcatctggaggcctgcataaccactgcattctccagatgggc actggaagcgctgaggcttcccacccagaaggtccacgtggtggacaagggcaagtgaca accagagccacgactcaaaagagggtggctgcttcaggctga >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_5|532_aa MGSKVVPGAGTSVCKCLGVKQLEVFLDYNLLSVLWNRSVVRQDSGRNCRTCYSQLSTVEH TLHRNCTCVDHEGEALISVQIFISPRRGTVERDLSEKNKGRAILLKSHRVHTQEMPFPCS KVGKGYLVSSGLFQHQAIHNEKPCRSAMYGDMFHTQQGHFKCIDYGEAFSPKDTPGQHQI IHTGEKPYVCTECEKTCTRSSNLIQHKDFTLEQALMCTAGVGNPTAEMSTLQGTRKFTTQ KGFMSGDNVREPLAVPLTLLSTRKFTLQKPYECIECGKVSAKECLVQHQKLTLNDIVGTL DGHSWNSKCTFQKNQSPTVPNTPSLRGAGWNDPSDSTQIHIAVFQASKSFFFKRLPCSIQ KCFPITSFPSGRSARFPLTQNKLLLGPHFWCPHSLAARPWWTRSMSHPEPLGEGSYLRSS LTHGFLTYEPGRAWQRWPPLLHEASFEVAQRGLEDPLSRMVPSHGCQVGASYQLGSESEI HVIYPRNPGSNWNIEARVGSMPSPFPLDVLIIHVIPACFLPWVIRVITDAVS >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_5|1599_bp atggggtcaaaggttgtccctggagcaggaaccagtgtctgcaaatgcttgggggtgaaa cagttggaagtgttcctagactacaatctcctttctgtcttgtggaacaggagtgtagtg aggcaggactcaggccggaactgcaggacatgctacagccagctgagcaccgtggaacac accctacacagaaactgtacatgtgtggaccatgagggagaagctttaatttcagtgcaa atcttcatcagccccagaaggggcacagtggagagagacctttctgaaaaaaacaagggc agggccatcttgttgaagagccacagagttcacactcaagagatgccatttccatgcagc aaggttggaaaaggctacctggtcagctctggcctcttccagcaccaagcaattcacaat gagaagccatgcaggagtgccatgtatggggacatgtttcatactcaacaaggacatttc aaatgcattgactatggagaagcattcagtcctaaagacactcctggccagcaccagata attcacactggagaaaagccttatgtgtgcactgaatgtgagaaaacctgcacaagaagt tccaatctcattcagcacaaagacttcactctggagcaagcccttatgtgtacagcaggt gtgggaaatcctacagcagaaatgtccaccttgcagggcaccagaaagttcacaacacag aaaggttttatgagtggggacaatgtgagagagcctttggctgtccctctaaccttgctc agcaccagaaagttcaccctgcaaaagccttatgaatgcatagaatgtgggaaggtttca gccaaggagtgccttgttcagcaccaaaagttgacactgaatgacattgtgggcactctg gatggccactcctggaactccaaatgcacatttcaaaaaaatcaatcacccaccgttcca aacacaccctcactcagaggagcaggatggaatgacccatctgattcaactcagattcat attgctgttttccaggccagcaaaagttttttcttcaaacgactgccgtgcagtatccaa aaatgctttccaattacatcattcccttctggaaggtctgcacgatttccactgactcag aataagctcctacttgggcctcacttctggtgtccccactccctggcagcaaggccttgg tggactcgatcgatgtcccacccagagccccttggcgagggctcttacctgcggtcttct ctaactcatggcttcttgacctatgagcctgggagggcttggcagagatggccacctctg ctccatgaagcatcatttgaggtggcccaacgaggactagaggatccactttcaagaatg gttccctcacatggctgccaagttggtgccagctatcagctgggatccgaatcagagatc catgttatttatccacgcaatcccggaagtaattggaatatagaagctcgagtgggttct atgccctctccatttcctctggacgtactaattatccatgttatacccgcttgcttctta ccatgggtaataagggttatcactgacgctgttagttaa >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_6|113_aa MQGLQPDQECRLEGNAAAPRCGDGLLLFHAAIATGTQHLFLKSVMQDFHSTACEQPHTQR PPAHTVLGPPPAQCTPAATERQPSWGISTKSVKSPRLPAGVGFKGLKDIPVAE >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_6|342_bp atgcagggcctgcagccagaccaggaatgtcgcctggagggcaatgcggcagcacccagg tgtggggatgggctgctgctgttccacgctgccatcgccaccgggacccaacatctcttc ctgaagtccgtgatgcaggattttcactccaccgcttgcgaacagccccacacccagcgc cctcccgcacacaccgtcttggggcctcctccagcgcagtgcactcctgctgccacggaa agacagccttcctggggcatcagcacaaagtctgtaaagtccccaaggcttcctgctggt gtgggcttcaaaggcttgaaggatatcccagtggcagagtaa >gi568815580f:24327916_24550844|GENSCAN_predicted_peptide_7|382_aa XSSGVTIMQIGQSKSAKNAETDLQMLPNSSVQQPSTVLIWSNGDIQWSIKLHLDEKKEGK TTEAASNYQSHPNVGQPMPFTDKIMKDWIENWPSKCRVKCKRESGVRQIFEGHQILDNAR NKAYHVTMTRTMTRPWDGPGVSRVSRGETLEPKRPAEESDDLSSPREHSGPGVTENDLPD ACAHNAKDFSERKSKKLKLDSWWLEDLRHVQVLASSHTSSSHQCYTASRNPRDDQQVHDG PNPHLGELTLEGIKQPFVAVEREEWKFDIPCDLYEMLTVIQVVIFCNTKRKVDWLMEKMR DAALTVSSMHRDTPQKGPQSVSKEFPSGTSTGLISTDVWAGRSDVPQVSVIINYDLPKKQ NCTHTEFGDQVDMAGRLWPLTL >gi568815580f:24327916_24550844|GENSCAN_predicted_CDS_7|1149_bp nnaagctctggggtgactataatgcaaattggacaaagcaagagtgctaaaaatgcagag acagaccttcaaatgctcccaaattctagcgtccagcagccgtccacagtacttatatgg agcaatggtgacatccagtggtcaattaagttacacctggatgagaaaaaagaggggaaa acaacagaagctgcttccaattatcaatcacatccaaatgtgggccaacccatgccattt accgacaaaataatgaaggattggatagagaactggccttcgaagtgcagagtgaaatgc aagagagaatccggtgtcaggcagatctttgaaggtcaccagattcttgataatgccaga aacaaggcctaccacgtgaccatgacgagaacgatgaccaggccctgggacggccctggt gtgagcagagtaagcagaggtgaaaccttggagcccaaaaggccggcagaagaaagtgat gatttaagctcaccaagagagcattcaggtcctggggtaactgagaatgatctgccagat gcatgtgctcataatgcaaaggatttctctgaaaggaaaagcaagaaactgaagctggac tcttggtggctggaagatttgcgacatgttcaggtacttgcctccagtcacacaagtagt tctcatcagtgttacactgcctcgcgaaatcctagagatgaccaacaagtacatgatgga cccaatccacatcttggtgaattgactctggaaggcatcaaacagccttttgtggcagtg gaaagagaagagtggaaattcgacatcccatgtgatctctacgaaatgctaactgttatt caggtggtcatcttctgtaacaccaaaaggaaagttgactggctgatggagaaaatgaga gatgctgctctcaccgtgtcctcgatgcacagagacacacctcagaaagggccacagtcc gtcagtaaggagttcccatccggcaccagcacagggctcatttccacagatgtctgggcc gggaggtcagatgtccctcaggtgtccgtcatcattaactatgacctgcccaaaaaacag aattgtactcacacagaatttggagatcaggtcgatatggctggaaggctgtggccatta actttgtaa