GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:17:00 Sequence gi568815586f:38216894_38421210 : 204317 bp : 36.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 33 28 6 1.05 1.01 Sngl - 10869 9829 1041 0 0 42 42 353 0.539 22.97 1.00 Prom - 10998 10959 40 -10.15 2.02 PlyA - 11046 11041 6 -0.45 2.01 Sngl - 11700 11179 522 0 0 44 42 284 0.901 15.20 2.00 Prom - 19748 19709 40 -4.05 3.10 PlyA - 19766 19761 6 1.05 3.09 Term - 21018 20947 72 0 0 147 35 27 0.038 0.23 3.08 Intr - 33158 33059 100 1 1 86 116 73 0.003 9.09 3.07 Intr - 44905 44760 146 1 2 14 83 118 0.001 2.06 3.06 Intr - 79591 79410 182 2 2 17 80 143 0.040 5.07 3.05 Intr - 79962 79799 164 2 2 45 87 63 0.042 0.60 3.04 Intr - 90150 90030 121 1 1 108 79 8 0.025 0.53 3.03 Intr - 102049 101921 129 2 0 68 103 11 0.402 0.35 3.02 Intr - 102398 102285 114 0 0 111 92 75 0.748 9.70 3.01 Init - 106026 106002 25 0 1 59 115 40 0.721 3.54 3.00 Prom - 121630 121591 40 -4.55 4.00 Prom + 121977 122016 40 -6.45 4.01 Init + 126582 126719 138 2 0 80 74 122 0.849 10.19 4.02 Term + 128637 128780 144 2 0 20 48 165 0.947 2.63 4.03 PlyA + 129709 129714 6 1.05 5.00 Prom + 129851 129890 40 -7.85 5.01 Init + 130057 130383 327 0 0 58 49 227 0.628 13.17 5.02 Intr + 131936 132062 127 1 1 19 48 114 0.254 -0.17 5.03 Term + 138158 138234 77 0 2 76 45 104 0.551 1.92 5.04 PlyA + 138753 138758 6 1.05 6.03 PlyA - 139598 139593 6 1.05 6.02 Term - 156163 156054 110 2 2 123 48 44 0.603 1.59 6.01 Init - 161314 161230 85 1 1 86 111 29 0.951 6.03 6.00 Prom - 165216 165177 40 -3.65 7.03 PlyA - 165706 165701 6 1.05 7.02 Term - 168934 168528 407 2 2 29 42 239 0.924 7.76 7.01 Init - 186426 186384 43 0 1 77 81 66 0.276 5.43 7.00 Prom - 198134 198095 40 -2.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 44407 44663 257 2 2 90 44 187 0.912 9.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_1|346_aa MITSIDAEKAFDKIQQPFMLKTRNKLGIDGTYLKIIRAIYDKPTANIILSEQKLEVFPLK TGTRQGCPLSSLLFNVVLEVLSRAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIIS AQNLLKLIGNFSKVSGYKINVRKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLK RDVKDLFKENYKPLLNEIKEDTNKWKNIPFSWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRAHIAKSILNQKNKAGGITLPDFKLYYKATVIKTAWYWCQNT DIDQWNRTEPSEIMPHIYNHLIFDKPDKNKNGERIPYLINGAGKTG >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_1|1041_bp atgattacctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgcta aaaactcgcaataaattaggtattgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactcagtgagcaaaagctggaagtattccctttgaaa actggcacaagacagggatgccctctctcctcactcctatttaacgtagtgttggaagtt ctgtccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccatcatctca gcccaaaatctccttaagctgataggcaacttcagtaaagtctcaggatacaaaatcaat gtgcgaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatatctaggaatccaactaaaa agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagag gatacaaacaaatggaagaacattccattctcatgggtaggaagaatcaatattgtgaaa atggccatactgcccaaggtaatttatagattcaatgccatccctatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacattgccaagtcaatcctaaaccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactgtactacaaggctacagtaatcaaaacagcgtggtactggtgccaaaacaca gatatagaccaatggaacagaacagagccctcagaaataatgccacatatctacaaccat ctgatctttgacaaacctgacaaaaacaagaatggggaaaggattccctatttaataaat ggtgctgggaaaactggctag >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_2|173_aa MVKGSIQQEELTILNIYAPNTQTHSKASRRQEITKIRAELKEIETPKTLQKINESRSWFF EKINKTDRPLARLIKKKREKNQIDAIKNDKGDITTNPTEIQTTIIEYYKHLYANKLESLE EMYKFLNTYIFPRLNKEEVESLNRPITGSEIEAIISRLPTKNSPGPVGFTAKF >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_2|522_bp atggtaaagggatcaattcaacaagaagagctaactatcctaaatatatatgcacccaat acacaaacacattcaaaagctagcagaaggcaagaaataactaagatcagagcagaactg aaggaaatagagacaccaaaaacccttcaaaaaatcaatgaatccaggagctggtttttt gaaaagatcaacaaaactgatagaccactggcaagactaataaagaagaagagagagaag aatcaaatagatgcaataaaaaatgataaaggggatatcaccaccaatcccacagaaata caaactaccatcatagaatactataaacacctctatgcaaataaactagaaagtctagaa gaaatgtataaattcctcaacacatacatcttcccaagactaaacaaggaagaagttgaa tctctgaatagaccaataacaggctctgaaattgaggcaataattagtagattgccaacc aaaaacagtccaggaccagttggattcacagccaaattctaa >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_3|350_aa MDDRGGHYGLIPHPTQPYPFIRTQLWYHLLCEDLLNATYSSPSQKSNEDMEPLRVNLPTV TKLVKWQGENSNPGNRPPGPALLPHQKSSATSIHQVFELIQKFISRGVKVVIKSVSLELL PFIFFNIAGRACQRSSGKPLPSQVQSPRRKNWFHGPVPGPPCSLKSRDLVTWIPAAPAIA KRGQVWKRNVGSEPPQRVPAGTPPSGAVGRGPLSSGPQNGRSTDCLHHVPGKTRHSTPAC ESSQENEVVGISVIDHSCKQEVLRLWVACTLVLSDSGSGSGSGSISIRESHSLGQKSGVL VTNVCPGCDCAHVWGQLFGKQKDREEKARRENYDLSYIPCSRGTTEREKR >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_3|1053_bp atggatgatcgtggaggacattatggattaattcctcaccctacacagccctatccgttc atcaggactcagctctggtaccatcttctctgtgaggacttactgaatgccacatactcc tctccctcccagaaaagcaatgaagacatggagccactaagggttaacctgcccacagta acaaaactggttaagtggcagggagagaattctaacccaggcaatcgacctccggggcct gcactattaccacatcaaaagtcctcagcaacttcaattcaccaggtatttgaattaata cagaagtttatatccagaggagttaaagttgttataaaatcagtctccttggaactgcta ccatttatctttttcaatattgcaggcagggcatgtcagaggtcttcagggaagcccctc ccatcacaggtccagagtccgaggaggaaaaactggtttcatgggccagtcccaggacca ccctgctctctaaagtctagggacttggtgacctggattccagctgctccagcgattgct aaaaggggccaagtgtggaagagaaatgtggggtcagagcccccacaaagagtccctgct gggacaccacctagtggagctgtgggaagagggccactgtcctctggaccccagaatggt agatccactgactgcttgcaccatgtacctggaaaaaccagacactcaacgccagcctgt gaaagcagccaggagaatgaagtggttgggatttctgttattgatcacagctgtaagcag gaggttctcaggctttgggtagcatgcacgttagtccttagtgacagtggcagtggcagt ggcagtggcagcatcagcatcagagaaagccattccttaggccagaaatctggggttttg gtgactaatgtctgccctggctgtgactgtgcccacgtctggggacaactatttggaaaa cagaaagatagagaagaaaaggctagaagagaaaactatgacctttcttatattccatgc agcagaggaacaacagagagagaaaagagataa >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_4|93_aa MRRLLSWNFSSGMTARTNDCRNPKRTHRPSEGSGLFLQNLGDTPNTLTSNGSNQEEIPEI PEKEFRRLVMKLTKEAPEKGEAQCKEIQYQYKN >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_4|282_bp atgcggaggctcttatcgtggaattttagctccggaatgactgcaagaacaaatgactgc aggaatcccaagaggacccacagaccctctgaaggaagcggactgttcctgcagaacctg ggagacaccccaaatactctcaccagcaatggatcaaaccaagaagaaatccctgaaata cctgaaaaagaattcaggaggttagttatgaagctaaccaaggaggcaccagagaaaggt gaagcccaatgcaaggaaatccaataccaatacaagaattga >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_5|176_aa MDSQQNSTIERRSGKNPFDTIPQDKEGTFPNSFYEASITLIPKPANDVTKKENYGPISLM KIDAKILKKILANQIQQHIKKRIHHDQVDFIPAMQGWLNILKSINVIHHGNANQNHNAVP PSSCKNGHNRKIKKTVAVGVDGVIREHFYTAEKLKDGKDLRCCDPDVTDGEIESKR >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_5|531_bp atggattcacagcagaattctaccattgaaagaagaagtggcaaaaatccttttgacact attccacaagataaagagggaaccttccctaattcattctatgaagccagcatcacccta ataccaaaaccagcaaatgacgtaaccaaaaaagaaaactatggaccaatatccctgatg aagatagatgctaaaatccttaagaaaatactagctaaccaaatccagcagcatatcaaa aagagaatccaccatgatcaagtggatttcataccagcgatgcagggatggcttaacata ctcaagtcaataaatgtaatacaccacgggaatgcaaatcaaaaccacaatgcagtacct ccttcctcctgcaagaatggacataatcgaaaaatcaaaaaaacagtagctgttggtgtg gatggggtgatcagggaacacttctacactgctgagaaactgaaagatggaaaagatctc agatgctgtgaccccgatgtaacagatggagaaattgaatccaagagatga >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_6|64_aa MPTLTTLQHSNRSPSQSRQEKEIKGIKIGSCIESKIIIRGLLGDPGIRAEVAGTLIAGLN GLLF >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_6|195_bp atgcccactctcaccactcttcaacatagtaacagaagtcctagccagagcagacaagag aaagaaataaaaggcatcaaaatcggaagctgcatagaaagtaaaattataatcagagga ttgcttggtgatcctggaatcagagcagaggttgctggaactttgattgcagggcttaat ggcctcttgttttga >gi568815586f:38216894_38421210|GENSCAN_predicted_peptide_7|149_aa MEKSLEVPHKTGSTDHLQEQNSNLKRTHKPSEGSRLLLQDLGDISNTVSAPIAKVGKGDT PAQTHTPTAELEGLLVGEIPDFTWSSVKLEIKAKQNTGWGGGGSGEALGSHWVPKQPNTA WHHRDPSGGWPEEQGIKLHREKEFSSRTW >gi568815586f:38216894_38421210|GENSCAN_predicted_CDS_7|450_bp atggagaaaagtttggaagttcctcataaaaccggaagtacagatcacctgcaagaacaa aacagcaatcttaagaggacccataaaccctctgaaggaagcagactgctcctccaggac ctgggagacatctcaaatactgtgagtgccccaattgcaaaagtgggaaagggagacact cctgcacaaacacacacccccactgcagaacttgaaggtctgcttgtaggagaaattccc gactttacctggagctcagtcaagttagagatcaaagccaagcaaaatacagggtggggt ggaggaggcagtggagaggccctaggatctcactgggtccccaagcagcccaatactgcc tggcatcacagggatccctcaggagggtggccagaggagcaggggataaaactccacagg gagaaggaattctctagccgaacttggtaa