GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:01:25 Sequence gi568815580f:24360729_24577559 : 216831 bp : 41.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15161 15323 163 1 1 60 44 181 0.676 8.86 1.02 Intr + 15766 15950 185 2 2 60 27 153 0.694 4.99 1.03 Intr + 16232 16386 155 1 2 35 58 115 0.031 1.15 1.04 Term + 37293 37476 184 0 1 52 39 216 0.928 9.13 1.05 PlyA + 38204 38209 6 1.05 2.00 Prom + 50000 50039 40 -5.65 2.01 Init + 66029 66064 36 1 0 83 94 34 0.561 3.57 2.02 Intr + 67191 67319 129 2 0 55 80 127 0.814 8.57 2.03 Intr + 69611 69656 46 1 1 57 92 8 0.053 -4.84 2.04 Intr + 79768 79890 123 2 0 66 75 136 0.984 9.74 2.05 Intr + 82321 82424 104 2 2 69 103 102 0.994 8.77 2.06 Intr + 84665 84738 74 1 2 68 109 30 0.968 0.39 2.07 Intr + 87365 87455 91 2 1 48 92 98 0.558 5.28 2.08 Intr + 89091 89225 135 2 0 18 95 88 0.781 2.34 2.09 Intr + 102635 102795 161 1 2 97 77 -18 0.012 -4.14 2.10 Intr + 103472 103610 139 2 1 76 44 127 0.865 6.65 2.11 Intr + 106009 106078 70 0 1 61 69 54 0.557 -1.36 2.12 Intr + 108060 108223 164 1 2 88 87 35 0.488 2.17 2.13 Intr + 116019 116283 265 2 1 87 39 174 0.588 8.56 2.14 Intr + 122252 122417 166 0 1 44 78 127 0.724 5.40 2.15 Intr + 124557 124662 106 0 1 33 66 39 0.114 -4.50 2.16 Term + 129922 130194 273 0 0 47 34 182 0.217 3.19 2.17 PlyA + 130336 130341 6 -0.45 3.09 PlyA - 130762 130757 6 1.05 3.08 Term - 131163 131004 160 2 1 109 43 120 0.926 6.03 3.07 Intr - 135204 135061 144 2 0 29 76 118 0.453 3.18 3.06 Intr - 138219 138094 126 2 0 27 51 119 0.239 0.97 3.05 Intr - 139328 139206 123 1 0 73 52 85 0.288 2.18 3.04 Intr - 142521 142357 165 2 0 59 77 65 0.159 0.65 3.03 Intr - 146618 146358 261 1 0 34 72 267 0.024 15.38 3.02 Intr - 148127 147655 473 2 2 12 82 382 0.032 20.95 3.01 Init - 148937 148791 147 2 0 105 84 31 0.768 4.55 3.00 Prom - 149220 149181 40 -6.05 4.00 Prom + 155477 155516 40 -7.85 4.01 Init + 157005 157063 59 2 2 61 94 65 0.110 5.25 4.02 Intr + 157937 158114 178 2 1 82 100 146 0.805 14.20 4.03 Term + 160075 160179 105 0 0 107 43 90 0.970 3.83 4.04 PlyA + 160186 160191 6 -1.75 5.07 PlyA - 161080 161075 6 1.05 5.06 Term - 162167 161635 533 0 2 -16 43 479 0.300 26.42 5.05 Intr - 162520 162430 91 1 1 63 75 62 0.161 1.05 5.04 Intr - 180552 180193 360 0 0 36 48 207 0.062 5.99 5.03 Intr - 183652 183490 163 0 1 98 68 125 0.968 10.56 5.02 Intr - 205889 205812 78 1 0 63 94 47 0.168 0.55 5.01 Init - 209371 209052 320 1 2 46 68 172 0.719 7.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 36575 36618 44 1 2 49 58 85 0.835 1.56 S.002 Init - 57656 57543 114 2 0 76 91 96 0.961 8.96 S.003 Term - 148127 147651 477 2 0 12 43 383 0.810 20.05 S.004 Term - 180552 180187 366 0 0 36 49 213 0.842 5.82 S.005 Init - 183726 183713 14 0 2 75 91 12 0.914 -0.13 S.006 Term - 201938 201811 128 0 2 93 47 96 0.881 3.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:24360729_24577559|GENSCAN_predicted_peptide_1|228_aa MKPRTLAVSVTALKAARLELFVPPGGLVVSLASGVKLQTFAVSVTAHKSSVDPKNSGAQL ASPSGSRTGAAGGAACQSCTMRPHSSALGWWMGLGAVEQGAVLIEEAWAAQEPMEGRLSF HTSLQAEGAGSSLGQHRKGLPQCSDGLKGSSSAAKVGAQAEEARKRARFCQLLGTRKAFA LLPPHPEGRSGRPYPETTRNRPCEPVEDSRPQAKYKSWVNSLKSVIKQ >gi568815580f:24360729_24577559|GENSCAN_predicted_CDS_1|687_bp atgaagccgcggaccctcgcggtgagtgttacagctcttaaggcggcgcgtctggagttg tttgttcctcccggtgggctcgtggtctccctggcttcaggagtgaagctgcagaccttc gcagtgagtgttacagctcataaaagcagtgtggacccaaagaactcaggagcccagctg gcttcacccagtggatcccgcactggggctgcaggtggagctgcctgccagtcctgcacc atgcgcccgcactcctcagcccttgggtggtggatgggactgggcgctgtggagcagggg gcagtgctcatcgaggaggcttgggccgcacaggagcccatggaggggcgcctctccttc cacacctccctgcaagccgagggagccggctccagccttggccagcacagaaaggggctc ccacagtgcagcgatgggctgaagggctcctcaagtgccgccaaagtgggagcccaggca gaggaggcccgaaagcgagcgaggttttgccagctactagggacgcgtaaagccttcgcg ctcctcccaccacaccccgagggaagaagtggcaggccttacccggagactacaaggaac cgcccgtgtgagccagtggaagactcgcgtccccaggcaaaatataagtcctgggttaat agtctgaagtctgtcattaagcaatag >gi568815580f:24360729_24577559|GENSCAN_predicted_peptide_2|693_aa MAEGDAGSDQRQNEEIEAMAAIYGEEWCVIDDCAKIFCIRISDDIDDPKWTLCLQGKNVR IYQIALRKYICPDVKKKTEEEDVECEDDLILACQPESSLKALDFDISETRTEVEVEELPP IDHGIPITDRRSTFQAHLAPVVCPKQVKMVLSKLYENKKIASATHNIYAYRIYCEDKQTF LQDCEDDGETAAGGRLLHLMEILNVKNVMVVVSRWYGGILLGPDRFKHINNCARNILVEK NYTNSPLHPEVCIFNAGLTMTQSLGAFCNRIQVFWQCCSFSFAMLGNEPFVSAWKKAWTC SGAGDIPDQDSGQYWFLMRAVFLACRRLPSTCVLKRPFSECAQRERTNLVLMKKWEFLEV PDTFEVTQQSVISIPLYIPHTLFEWDFGKEICVFWLTTDYLLCTASVYNIVLISYDRYLS VSNAVSYRTQHTGVLKIVTLMVAVWVLAFLVNGPMILVSESWKDEGSECEPGFFSEWYIL AITSFLEFVIPVILVAYFNMNIYWSLWKRDHLRLGHPKGWGQLVLRLPHGVEGQPWRLQL VPRMGYIEVGGLLCTAAGEMSTHARSAKLLSTGSENDTLPVPSLASRSLCPSVLSLGSFP SCQSCLSDQMSQCDTEPERKSFLSMMQGTQHFDNPDGMWSSHGRNVSSGGLHNHCILQMG TGSAEASHPEGPRGGQGQVTTRATTQKRVAASG >gi568815580f:24360729_24577559|GENSCAN_predicted_CDS_2|2082_bp atggctgagggggacgcagggagcgaccagaggcagaatgaggaaattgaagcaatggca gccatttatggcgaggagtggtgtgtcattgatgactgtgccaaaatattttgtattaga attagcgacgatatagatgaccccaaatggacactttgcttgcagggcaagaacgtgcgg atttatcaaatagccttgaggaaatatatatgcccagatgtaaagaagaaaactgaagag gaagatgttgaatgtgaagatgatctcattttagcatgtcagccggaaagttcgcttaaa gcattggattttgatatcagtgaaactcggacagaagtagaagtagaagaattacctccg attgatcatggcattcctattacagaccgaagaagtacttttcaggcacacttggctcca gtggtttgtcccaaacaggtgaaaatggttctttccaaattgtatgagaataagaaaata gctagtgccacccacaacatctatgcctacagaatatattgtgaggataaacagaccttc ttacaggattgtgaggatgatggggaaacagcagctggtgggcgtcttcttcatctcatg gagattttgaatgtgaagaatgtcatggtggtagtatcacgctggtatggagggattctg ctaggaccagatcgctttaaacatatcaacaactgtgccagaaacatactagtggaaaag aactacacaaattcacctcttcatccagaggtctgcatatttaatgcagggctcaccatg acacagtccctgggagctttttgtaacaggattcaagttttctggcagtgctgttctttc tcctttgccatgctgggaaatgaaccttttgtctcagcctggaagaaggcgtggacgtgt tctggagctggggacatcccagatcaagattctggccaatattggttcctgatgagggct gtctttctggcttgcagacggctgccgtctacctgcgtcctcaagaggcctttctctgag tgcgcacagagagagagaactaatttggttctcatgaaaaaatgggaattcctggaagta cctgatacatttgaagtaactcaacaaagtgtgatctccattcctttgtacatccctcac acgctgttcgaatgggattttggaaaggaaatctgtgtattttggctcactactgactat ctgttatgtacagcatctgtatataacattgtcctcatcagctatgatcgatacctgtca gtctcaaatgctgtgtcttatagaactcaacatactggggtcttgaagattgttactctg atggtggccgtttgggtgctggccttcttagtgaatgggccaatgattctagtttcagag tcttggaaggatgaaggtagtgaatgtgaacctggatttttttcggaatggtacatcctt gccatcacatcattcttggaattcgtgatcccagtcatcttagtcgcttatttcaacatg aatatttattggagcctgtggaagcgtgatcatctcaggcttgggcatcccaagggatgg ggccagctggtgctcagactgccacatggggttgagggacagccgtggcggctgcagctg gtccctcggatggggtacattgaagtaggaggcttgttgtgcactgctgctggagagatg tcaactcacgctagaagtgctaaactgctgtcaacaggcagtgaaaatgacaccttgcca gttccaagcctagcctcaagaagcctgtgcccatctgttctatctcttggcagttttccc agctgccagagctgccttagtgaccagatgtcacaatgtgatacagagccagagaggaag tcatttttgtccatgatgcaaggaacccagcattttgacaacccagatggaatgtggagc tcccatggaagaaacgtgtcatctggaggcctgcataaccactgcattctccagatgggc actggaagcgctgaggcttcccacccagaaggtccacgtggtggacaagggcaagtgaca accagagccacgactcaaaagagggtggctgcttcaggctga >gi568815580f:24360729_24577559|GENSCAN_predicted_peptide_3|532_aa MGSKVVPGAGTSVCKCLGVKQLEVFLDYNLLSVLWNRSVVRQDSGRNCRTCYSQLSTVEH TLHRNCTCVDHEGEALISVQIFISPRRGTVERDLSEKNKGRAILLKSHRVHTQEMPFPCS KVGKGYLVSSGLFQHQAIHNEKPCRSAMYGDMFHTQQGHFKCIDYGEAFSPKDTPGQHQI IHTGEKPYVCTECEKTCTRSSNLIQHKDFTLEQALMCTAGVGNPTAEMSTLQGTRKFTTQ KGFMSGDNVREPLAVPLTLLSTRKFTLQKPYECIECGKVSAKECLVQHQKLTLNDIVGTL DGHSWNSKCTFQKNQSPTVPNTPSLRGAGWNDPSDSTQIHIAVFQASKSFFFKRLPCSIQ KCFPITSFPSGRSARFPLTQNKLLLGPHFWCPHSLAARPWWTRSMSHPEPLGEGSYLRSS LTHGFLTYEPGRAWQRWPPLLHEASFEVAQRGLEDPLSRMVPSHGCQVGASYQLGSESEI HVIYPRNPGSNWNIEARVGSMPSPFPLDVLIIHVIPACFLPWVIRVITDAVS >gi568815580f:24360729_24577559|GENSCAN_predicted_CDS_3|1599_bp atggggtcaaaggttgtccctggagcaggaaccagtgtctgcaaatgcttgggggtgaaa cagttggaagtgttcctagactacaatctcctttctgtcttgtggaacaggagtgtagtg aggcaggactcaggccggaactgcaggacatgctacagccagctgagcaccgtggaacac accctacacagaaactgtacatgtgtggaccatgagggagaagctttaatttcagtgcaa atcttcatcagccccagaaggggcacagtggagagagacctttctgaaaaaaacaagggc agggccatcttgttgaagagccacagagttcacactcaagagatgccatttccatgcagc aaggttggaaaaggctacctggtcagctctggcctcttccagcaccaagcaattcacaat gagaagccatgcaggagtgccatgtatggggacatgtttcatactcaacaaggacatttc aaatgcattgactatggagaagcattcagtcctaaagacactcctggccagcaccagata attcacactggagaaaagccttatgtgtgcactgaatgtgagaaaacctgcacaagaagt tccaatctcattcagcacaaagacttcactctggagcaagcccttatgtgtacagcaggt gtgggaaatcctacagcagaaatgtccaccttgcagggcaccagaaagttcacaacacag aaaggttttatgagtggggacaatgtgagagagcctttggctgtccctctaaccttgctc agcaccagaaagttcaccctgcaaaagccttatgaatgcatagaatgtgggaaggtttca gccaaggagtgccttgttcagcaccaaaagttgacactgaatgacattgtgggcactctg gatggccactcctggaactccaaatgcacatttcaaaaaaatcaatcacccaccgttcca aacacaccctcactcagaggagcaggatggaatgacccatctgattcaactcagattcat attgctgttttccaggccagcaaaagttttttcttcaaacgactgccgtgcagtatccaa aaatgctttccaattacatcattcccttctggaaggtctgcacgatttccactgactcag aataagctcctacttgggcctcacttctggtgtccccactccctggcagcaaggccttgg tggactcgatcgatgtcccacccagagccccttggcgagggctcttacctgcggtcttct ctaactcatggcttcttgacctatgagcctgggagggcttggcagagatggccacctctg ctccatgaagcatcatttgaggtggcccaacgaggactagaggatccactttcaagaatg gttccctcacatggctgccaagttggtgccagctatcagctgggatccgaatcagagatc catgttatttatccacgcaatcccggaagtaattggaatatagaagctcgagtgggttct atgccctctccatttcctctggacgtactaattatccatgttatacccgcttgcttctta ccatgggtaataagggttatcactgacgctgttagttaa >gi568815580f:24360729_24577559|GENSCAN_predicted_peptide_4|113_aa MQGLQPDQECRLEGNAAAPRCGDGLLLFHAAIATGTQHLFLKSVMQDFHSTACEQPHTQR PPAHTVLGPPPAQCTPAATERQPSWGISTKSVKSPRLPAGVGFKGLKDIPVAE >gi568815580f:24360729_24577559|GENSCAN_predicted_CDS_4|342_bp atgcagggcctgcagccagaccaggaatgtcgcctggagggcaatgcggcagcacccagg tgtggggatgggctgctgctgttccacgctgccatcgccaccgggacccaacatctcttc ctgaagtccgtgatgcaggattttcactccaccgcttgcgaacagccccacacccagcgc cctcccgcacacaccgtcttggggcctcctccagcgcagtgcactcctgctgccacggaa agacagccttcctggggcatcagcacaaagtctgtaaagtccccaaggcttcctgctggt gtgggcttcaaaggcttgaaggatatcccagtggcagagtaa >gi568815580f:24360729_24577559|GENSCAN_predicted_peptide_5|514_aa MLEGFLEVICSSPSSALLKAATQRGCQVTFLREHRCTGADQKLEAKPPACWNGGLSQTGN VASTCLLMIQPFLGSLTQRGSLSYRDNAAVLYPSQTLPKRAILSFNRRQRKHARLFAFDL HDVNRQPSPRKSRSSGVTIMQIGQSKSAKNAETDLQMLPNSSVQQPSTVLIWSNGDIQWS IKLHLDEKKEGKTTEAASNYQSHPNVGQPMPFTDKIMKDWIENWPSKCRVKCKRESGVRQ IFEGHQILDNARNKAYHVTMTRTMTRPWDGPGVSRVSRGETLEPKRPAEESDDLSSPREH SGPGVTENDLPDACAHNAKDFSERKSKKLKLDSWWLEDLRHVQVLASSHTSSSHQCYTAS RNPRDDQQVHDGPNPHLGELTLEGIKQPFVAVEREEWKFDIPCDLYEMLTVIQVVIFCNT KRKVDWLMEKMRDAALTVSSMHRDTPQKGPQSVSKEFPSGTSTGLISTDVWAGRSDVPQV SVIINYDLPKKQNCTHTEFGDQVDMAGRLWPLTL >gi568815580f:24360729_24577559|GENSCAN_predicted_CDS_5|1545_bp atgctggaagggttcttagaggttatctgctccagtccctcctctgctcttctgaaagct gccacccagagagggtgtcaagtgaccttcctacgagaacacagatgcacaggggcagat caaaaattggaggccaagcctcctgcgtgctggaatgggggcttgtcccagacaggaaat gtagcctctacatgcctgctgatgatacagccctttctgggctctctcactcagagggga agtttatcctacagggataatgcagcagtgttgtacccgagccagacactgccaaagagg gccattctttcattcaacaggagacagcgcaagcatgccaggctctttgcatttgacctt catgatgtgaacaggcagccctctcccaggaagtccagaagctctggggtgactataatg caaattggacaaagcaagagtgctaaaaatgcagagacagaccttcaaatgctcccaaat tctagcgtccagcagccgtccacagtacttatatggagcaatggtgacatccagtggtca attaagttacacctggatgagaaaaaagaggggaaaacaacagaagctgcttccaattat caatcacatccaaatgtgggccaacccatgccatttaccgacaaaataatgaaggattgg atagagaactggccttcgaagtgcagagtgaaatgcaagagagaatccggtgtcaggcag atctttgaaggtcaccagattcttgataatgccagaaacaaggcctaccacgtgaccatg acgagaacgatgaccaggccctgggacggccctggtgtgagcagagtaagcagaggtgaa accttggagcccaaaaggccggcagaagaaagtgatgatttaagctcaccaagagagcat tcaggtcctggggtaactgagaatgatctgccagatgcatgtgctcataatgcaaaggat ttctctgaaaggaaaagcaagaaactgaagctggactcttggtggctggaagatttgcga catgttcaggtacttgcctccagtcacacaagtagttctcatcagtgttacactgcctcg cgaaatcctagagatgaccaacaagtacatgatggacccaatccacatcttggtgaattg actctggaaggcatcaaacagccttttgtggcagtggaaagagaagagtggaaattcgac atcccatgtgatctctacgaaatgctaactgttattcaggtggtcatcttctgtaacacc aaaaggaaagttgactggctgatggagaaaatgagagatgctgctctcaccgtgtcctcg atgcacagagacacacctcagaaagggccacagtccgtcagtaaggagttcccatccggc accagcacagggctcatttccacagatgtctgggccgggaggtcagatgtccctcaggtg tccgtcatcattaactatgacctgcccaaaaaacagaattgtactcacacagaatttgga gatcaggtcgatatggctggaaggctgtggccattaactttgtaa