GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:33:44 Sequence gi568815583f:23465783_23667303 : 201521 bp : 40.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 24491 24592 102 1 0 125 66 82 0.842 10.17 1.02 Intr + 37448 37533 86 1 2 55 68 138 0.004 6.20 1.03 Intr + 39295 39328 34 1 1 91 86 5 0.002 -2.09 1.04 Intr + 46035 46112 78 2 0 61 92 41 0.412 0.63 1.05 Term + 53588 53800 213 1 0 83 49 184 0.803 10.25 1.06 PlyA + 54754 54759 6 1.05 2.00 Prom + 54819 54858 40 -9.15 2.01 Init + 54867 55177 311 2 2 60 92 199 0.741 14.33 2.02 Intr + 71362 71516 155 1 2 79 119 10 0.013 1.99 2.03 Intr + 80511 80634 124 1 1 31 106 41 0.023 -1.08 2.04 Term + 81174 81492 319 0 1 25 49 165 0.042 -0.33 2.05 PlyA + 83464 83469 6 1.05 3.00 Prom + 86736 86775 40 -4.55 3.01 Sngl + 100001 101524 1524 1 0 97 39 1143 0.988 105.41 3.02 PlyA + 102215 102220 6 1.05 4.00 Prom + 114997 115036 40 -2.35 4.01 Init + 124212 124341 130 2 1 44 58 142 0.226 7.16 4.02 Intr + 131013 131128 116 1 2 44 52 52 0.318 -3.55 4.03 Intr + 131467 131583 117 0 0 86 81 80 0.592 6.84 4.04 Intr + 137128 137286 159 0 0 53 67 84 0.056 2.06 4.05 Intr + 149784 149855 72 2 0 114 84 32 0.085 4.08 4.06 Term + 161984 162049 66 1 0 120 47 132 0.574 9.26 4.07 PlyA + 163221 163226 6 1.05 5.02 PlyA - 163347 163342 6 1.05 5.01 Sngl - 181636 178211 3426 1 0 56 38 2622 0.627 242.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 38099 38200 102 1 0 86 61 92 0.812 6.59 S.002 Intr + 39114 39204 91 2 1 45 76 79 0.831 1.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:23465783_23667303|GENSCAN_predicted_peptide_1|170_aa MAPSLLYRGPQQSLPLRTSTATAKASSLTIEDPAGVTNDSPQRLGSLKIAADNSEVHSSN KDRLGTRYGDCRDKLFEYLLLSEKLSAQLPCAGPCLLIVQVPFQVKPMKLTKSKEERKAG RRRKKTQQIGDRRGGKVLSARLERRLQEAGEKLTSCLNVESKIQAAALQL >gi568815583f:23465783_23667303|GENSCAN_predicted_CDS_1|513_bp atggcccccagcctgctctacagaggaccgcagcagtctttaccactgaggacctcaaca gccacagcaaaggcttcatctctcaccattgaggaccctgcaggtgtgaccaatgatagt cctcagcgtctggggagcttgaagattgctgctgataacagtgaagtacacagcagcaac aaagataggttaggaaccaggtacggggactgtagagacaagctatttgagtatcttctc ttatctgaaaagctgtctgcacaactcccctgtgcaggaccctgcctgcttattgtgcag gtaccttttcaggtgaagccgatgaagttaacgaaaagcaaagaagaaaggaaagctgga agaagaaggaaaaagacacaacaaatcggggacaggaggggagggaaagtgttgtctgca aggttggaaaggagacttcaggaagctggagaaaaactcaccagttgcctcaacgttgaa tcaaagatccaggcagccgctcttcagctgtga >gi568815583f:23465783_23667303|GENSCAN_predicted_peptide_2|302_aa MNLKVVQKNLRGKVGPGGYQGATALARCTPLELAPCARCTPLELAPCARCTPLELAPCAR CTPLELAPCARCTPLELAPCARCTPLELAPCARCRVYCFYFRKQTSLEVTILAYYHCQQR NFKVKLERNHNNKDHKNGCSGQGTWSGPQERKQNAVTSLTRRNALENTVAVNSHPVNPWI LVLAEALCAGKANMYLKLHMGSKCCKVTGGPELSIQSQLRPHGRPTAISQKRAVTCGGRQ GFASKSYGYVLTFACRGLLQTPNSILICHGYFNYQWICWIIWSEQPSSLHSNLHLLQSLL LL >gi568815583f:23465783_23667303|GENSCAN_predicted_CDS_2|909_bp atgaacttgaaggtagttcagaagaatcttagaggcaaggtaggacctggaggctaccag ggagccacagctcttgcacgctgcacacctctagaattagcaccatgtgcacgctgcaca cctctagaattagcaccatgtgcacgctgcacacctctagaattagcaccatgtgcacgc tgcacacctctagaattagcaccatgtgcacgctgcacacctctagaattagcaccatgt gcacgctgcacacctctagaattagcaccatgtgcacgctgcagagtttactgcttctac tttcggaagcagacttccctagaggtaaccatcttggcttattaccactgtcaacaaaga aacttcaaggtcaagttggaaagaaaccataacaataaagatcataaaaatgggtgcagc ggacagggtacgtggtcaggaccacaagaaaggaaacagaatgctgtgacttccttgacc agaagaaatgctctggaaaatactgtggcagttaatagtcatccagtaaatccatggata ttagttttggcagaagcactgtgtgcaggaaaggcaaatatgtatctaaaattacacatg ggcagcaaatgttgcaaggttactggtgggccagagttaagcattcagtctcagctaaga ccccatggtaggccaacagccatttctcaaaagagagcagtaacctgtggaggaaggcag ggctttgcttcaaaatcctatggatatgtgctgacattcgcgtgtagaggcctgctgcag acaccaaacagcatccttatctgtcacggatacttcaattaccagtggatctgctggatc atatggtctgagcagccaagcagcttgcacagcaacctgcatctgttgcagagccttctc ttactatga >gi568815583f:23465783_23667303|GENSCAN_predicted_peptide_3|507_aa MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPV APVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHD LSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPPAASSLSLPVIGSAAERGFF EAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSSETERKQMAVGSG LRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRACIEAHEKDMELS FAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWRSARQFENRIVKS CPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRGNCPFGDTCFYKH EYPEGWGDEPPGPGGGSFSAYWHQLVEPVRMGEGNMLYKSIKKELVVLRLASLLFKRFLS LRDELPFSEDQWDLLHYELEEYFNLIL >gi568815583f:23465783_23667303|GENSCAN_predicted_CDS_3|1524_bp atggaagagcctgcagctccctcagaagcccacgaggcagccggggcccaggcaggtgct gaggcagcaagggagggtgtgtctgggccggaccttcccgtctgtgagccctccggggaa tctgctgctccagattcagccctgccacatgcggcaaggggctgggcccccttccctgta gctccagtccctgcccacctccgcagaggaggcctgaggcctgccccagcctcaggagga ggagcctggcccagtccgttgccaagccgaagcagcggcatttggacaaagcagatcatc tgcaggtattatatacatgggcagtgcaaggagggggagaactgtcgctattcgcacgac ctttctggtcggaagatggccactgagggtggcgtttcgccgcctggggcctctgcaggt ggaggccctagcacggctgcgcacatcgagcccccgactcaggaagtggcggaagccccc ccggctgcatcctccctttccttgcctgtgattggctcggctgctgaaaggggtttcttt gaagccgagagagacaatgcagaccgtggagctgctggaggagcaggtgtagaaagctgg gcggatgccattgagtttgttccagggcagccctaccggggccgctgggttgcatctgcc cccgaggctcctctacagagctcagagactgagaggaagcagatggctgtgggcagtggg ttgcggttttgctattatgcttccaggggagtttgctttcgtggggagagctgtatgtac ctccatggagacatatgcgacatgtgtgggctgcagaccttgcaccccatggatgctgcc cagagggaagaacatatgagggcctgcattgaagcacacgagaaagatatggaactctcg tttgctgtgcagcgtggtatggacaaggtgtgtggcatctgcatggaggttgtctatgag aaggccaaccccaatgaccgccgctttggcattctttccaattgcaaccattccttctgt attaggtgtatccgcaggtggagaagtgccagacagtttgagaacaggatcgtcaagtct tgcccacagtgcagggtcacctctgaattggtcattcccagtgagttctgggtggaggag gaggaagagaagcagaaacttattcagcaatacaaggaggcaatgagcaacaaggcctgc aggtattttgcggaaggcaggggtaactgcccatttggagacacatgcttttacaagcat gaataccctgagggctggggagatgagcctcctgggccaggtggtgggtcattcagcgca tactggcatcaacttgtggagcctgtgcgaatgggagagggcaacatgctctataaaagc attaagaaggagcttgtcgtgcttcggctggccagtctgttgtttaagcggtttctttca ctgagagatgagttacccttctctgaggaccagtgggacttgcttcattatgagctggaa gaatatttcaatttgattctgtag >gi568815583f:23465783_23667303|GENSCAN_predicted_peptide_4|219_aa MVASVLDLIASLGFGPSRFSDTPNLHGHKDAEPDLSMTEKPSFWEVKRPGFSPLQTAGKS RGSTPPSQCSGGHYSERISGEMQNEDSTTIQRLSEIRQDSACSEFPYGLSHQPTSTFTST PEACLGNALDLNKHFGFQALCSFPLASRTSVHVPDAPTLPIGTQTRAALFSMGPKPLIPS VLLLPLVPPSSFFVDISSVNGMDSENQEESQVIKGSVPE >gi568815583f:23465783_23667303|GENSCAN_predicted_CDS_4|660_bp atggtcgcatccgttttggacctaattgcctctctgggctttggaccttccaggttttca gatacgcccaatctccatggacataaggatgcagaaccagatctgtcaatgacagagaag cctagtttctgggaagtgaagaggccaggcttctccccgctgcaaacagcaggaaagtcc cgaggctccaccccgccctcccagtgctcaggtgggcattactcagagagaatcagtggg gaaatgcaaaacgaagacagcactacaattcaaagactttctgagatcagacaagatagc gcatgttcagagttcccgtatggccttagccatcagcccacatctacattcacatcaacc ccagaagcctgcctgggtaatgccttggatctcaataagcattttggttttcaggccctt tgctcattccctcttgcttctcgcacaagcgtgcatgtcccagatgcccccacccttccc atcggcactcagacacgagctgccctcttctcaatgggacctaaacccctcatcccctcg gtgcttctcttacctctggttcctccttcctccttttttgtagatatctcctctgtgaac ggcatggattctgagaaccaggaagagagccaagtcataaaaggatcggtgcccgagtga >gi568815583f:23465783_23667303|GENSCAN_predicted_peptide_5|1141_aa MVHPPPPGAPMAQPPTPGVLMVHPSAPGAPMAHPPPPGTPMSHPPPPGTPMAHPPPPGTP MAHPPPPGTPMVHPPPPGTPMAHPPPPGTPMAHPPPPGTPMAHPPPPGTPMAHPPPPGTP MAQPPAPGVLMAQPLTPGVLMVQPAAPGAPMVQPPPAAMMTQPQPSGAPMAKPPGPGVLM IHPPGARAPMTQPPASGAPMAQPAAPPAQPMAPPAQPMASWAPQAQPLILQIQSQVIRAP PQVPQGPQAPPAQLATPPGWQATSPGWQATQQGWQATPLTWQTTQVTWQAPAVTWQVPPP MRQGPPPIRPGPPPIRPGPPPVRQAPPLIRQAPPVIRQAPPVIRQAPPVIRQAPAVIRQA PPVIRQAPPVIRQAPPVIRQAPPLIRQAPPPIRPAPQVLATQPPLWQALPPPPPLRQAPQ ARLPAPQVQAAPQVPTAPPATQVPAAPPAGPQVPQPVLPAPLSAPLSAPQAVHCPSIIWQ APKGQPPVPHEIPTSMEFQEVQQTQALAWQAQKAPTHIWQPLPAQEAQRQAPPLVQLEQP FQGAPPSQKAVQIQLPPQQAQASGPQAEVPTLPLQPSWQAPPAVLQAQPGPPVAAANFPL GSAKSLMTPSGECRASSIDRRGSSKERRTSSKERRAPSKDRMIFAATFCAPKAVSAARAH LPAAWKNLPATPETFAPSSSVFPATSQFQPASLNAFKGPSAASETPKSLPYALQDPFACV EALPAVPWVPQPNMNASKASQAVPTFLMATAAAPQATATTQEASKTSVEPPRRSGKATRK KKHLEAQEDSRGHTLAFHDWQGPRPWENLNLSDWEVQSPIQVSGDWEHPNTPRGLSGWEG PSTSRILSGWEGPSASWALSAWEGPSTSRALGLSESPGSSLPVVVSEVASVSPGSSATQD NSKVEAQPLSPLDERANALVQFLLVKDQAKVPVQRSEMVKVILREYKDECLDIINRANNK LECAFGYQLKEIDTKNHAYIIINKLGYHTGNLVASYLDRPKFGLLMVVLSLIFMKGNCVR EDLIFNFLFKLGLDVRETNGLFGNTKKLITEVFVRQKYLEYRRIPYTEPAEYEFLWGPRA FLETSKMLVLRFLAKLHKKDPQSWPFHYLEALAECEWEDTDEDEPDTGDSAHGPTSRPPP R >gi568815583f:23465783_23667303|GENSCAN_predicted_CDS_5|3426_bp atggtgcatcctccacctccgggagccccgatggcccagcctccgaccccgggagtcctg atggtgcatccttcagctcccggagctcccatggcccatcctcctcctccggggacccca atgtcccaccctccccctccggggaccccaatggcccatcctcctcctccggggaccccg atggcccatcctcctcctccggggaccccgatggtgcatcctcctcctccggggaccccg atggctcatcctccccctccggggacaccgatggctcatcctccccctccggggacaccg atggctcatcctccacctccggggacaccgatggctcatcctccccctccgggtacaccg atggcccagcctccagctccgggagtcctgatggcccagcctctgactccgggagtcctg atggtccagcctgctgctccgggagcaccgatggtccagccgcctccagcagccatgatg acccagcctcagccttcaggagcaccgatggccaagcctccaggtccaggagtcctgatg attcatcctccaggtgcgagagctccgatgacccagcctccagcttcaggagcaccgatg gcacagccggcggccccacctgcacagccgatggccccacctgcacagccgatggcttct tgggccccgcaggctcagcctctgatcctgcaaatccagtctcaagttataagggctcct ccgcaggttccccagggcccgcaggcacccccagcgcagctagccacacccccgggctgg caggcgacctcgccaggatggcaggccacgcagcaaggctggcaggccactcccctgact tggcagaccacgcaggtcacctggcaggcaccagccgttacctggcaggtgccgccgccc atgcgccaggggcccccgcccatccgccctggcccaccacccatccgccctggcccacca ccggtgcgacaggccccaccgctgatccgccaggccccaccggtgatccgccaggcccca cccgtgatccgccaggccccacccgtgatccgccaggcccccgctgtgatccgccaggcc ccacctgtgatccgccaggccccacctgtgatccgccaggctccacctgtgatccgccag gccccgccgctgatccgccaggcgccgccgcccatccgacctgccccacaggtcctggcc acccagccaccgctctggcaggccctgccacccccacctccactgcggcaggccccgcag gctaggctgccggccccgcaggtgcaggcggcgccgcaggtgcctacggccccacctgct acgcaggtacccgcggcgccgcccgctggcccgcaggtgccccagcctgtgctgccggcc ccgctgtctgccccactgtctgccccgcaggctgtgcactgcccttccatcatctggcag gcccccaaaggtcagcccccggtgccacacgagattccaacgtcaatggaattccaggag gtgcagcagacacaggcgctggcctggcaggcccagaaggcccccactcacatctggcag cccctgcctgcccaggaggcccagaggcaggctccccccttggtccagctggagcagccc tttcagggagccccgccctcccaaaaagccgtgcaaatccagctacccccccagcaggcc caggcatcgggtccgcaagcggaggtgcccacactgccgctccagccttcctggcaggca ccgcctgcagtcttgcaggcccagcccggacccccggtagcagcggcaaattttcccctg ggctccgctaaatcattgatgactccatcaggagaatgcagggcctcttctatagaccgc aggggctcctctaaagagcgcaggacctcctcgaaggagcgcagggccccttcaaaagac cgcatgatctttgctgccaccttctgtgctcccaaggcagtgtcagctgcgcgagcacac ctgccagctgcctggaaaaacctgcctgccacaccggagacctttgctccctcctcaagt gtcttcccagctacctcccagtttcagcctgcctctctgaatgcctttaaaggcccctct gctgcctcagagaccccaaagtcactgccatatgctctgcaggatccctttgcctgtgta gaggccctgcctgcagttccatgggtcccacagcccaatatgaatgcctcaaaggcatcg caggcagtgcccaccttcctgatggctacagcagctgccccccaggcaactgccaccact caagaggcctccaagacctccgtcgagccgccacgccgctccggcaaggccacccggaag aagaagcatctggaagcccaagaggacagccgtggccacacgctagcctttcatgactgg cagggcccaaggccctgggagaatctaaatctgagtgactgggaggtccaaagccctatc caggtctcgggtgactgggagcacccaaacaccccccgtggcctgagtggttgggagggc cctagcacctccaggatcctgagtggctgggaagggcccagcgcatcctgggccctgagt gcctgggagggcccgagcacctccagggccctgggtctctctgaaagcccagggagctct ctgcccgtagttgtgtctgaggtcgcaagtgtctctccgggatccagtgccacccaggat aattccaaggtggaggcacagcccttgtctcccttggatgagagggcaaatgcgttggtg cagttcctcttagtcaaggaccaagccaaggtgcctgtccagcgctcggagatggtgaaa gtcatcctccgagagtataaagatgagtgcttagatatcatcaaccgtgccaacaataag ctggagtgtgcctttggttatcaattgaaagaaattgataccaaaaaccacgcctatatt atcatcaacaagctgggctaccatacagggaatttggtggcatcctatttagacaggccc aagtttggccttctgatggtggtcttgagcctcatctttatgaaaggcaactgtgtcagg gaggatctgatctttaattttctgttcaagttagggttggatgtccgggagacaaacggt ctctttggaaatactaagaagctcatcaccgaagtgtttgtcaggcagaagtacctagag tacaggcgaatcccttacactgagcccgcagagtatgagttcctctggggccctcgagca ttcctggaaaccagcaagatgcttgtcctgaggtttttggccaagctccataagaaagat ccacagagctggccattccattaccttgaagcgctcgcagagtgtgagtgggaagacaca gatgaggatgaacctgacaccggtgacagtgcccacggccccaccagcaggccccctccc cgctaa