GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:19:33 Sequence gi568815586f:12617840_12818943 : 201104 bp : 42.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6583 6790 208 0 1 22 91 218 0.544 13.23 1.02 Intr + 8033 8055 23 0 2 71 37 17 0.069 -8.76 1.03 Intr + 17938 18135 198 0 0 22 95 233 0.869 16.03 1.04 Intr + 27866 27939 74 1 2 66 77 39 0.073 -2.11 1.05 Intr + 29433 29570 138 0 0 43 44 151 0.108 4.86 1.06 Intr + 40003 40125 123 1 0 -14 91 165 0.083 5.38 1.07 Intr + 42772 42869 98 1 2 66 92 66 0.033 3.53 1.08 Intr + 43918 44444 527 2 2 -60 66 642 0.042 38.93 1.09 Intr + 47496 47621 126 2 0 67 -2 146 0.023 3.46 1.10 Intr + 55710 55835 126 2 0 63 9 116 0.003 1.16 1.11 Intr + 70989 71075 87 2 0 46 63 102 0.033 2.75 1.12 Intr + 77047 77129 83 0 2 66 111 26 0.094 0.22 1.13 Intr + 78621 78859 239 0 2 -51 14 226 0.007 -2.36 1.14 Term + 96651 97018 368 1 2 117 42 199 0.948 11.98 1.15 PlyA + 97473 97478 6 1.05 2.00 Prom + 99494 99533 40 -9.55 2.01 Sngl + 100001 100540 540 1 0 53 37 538 0.555 40.63 2.02 PlyA + 100942 100947 6 1.05 3.04 PlyA - 102059 102054 6 1.05 3.03 Term - 106375 106150 226 0 1 8 55 213 0.148 5.27 3.02 Intr - 134005 133977 29 1 2 114 86 15 0.011 -0.10 3.01 Init - 141919 141842 78 1 0 82 44 91 0.361 5.21 3.00 Prom - 146867 146828 40 -5.35 4.00 Prom + 148678 148717 40 -4.45 4.01 Init + 167155 167274 120 0 0 51 33 163 0.412 7.34 4.02 Term + 168896 169813 918 1 0 39 54 1080 0.477 90.92 4.03 PlyA + 173226 173231 6 1.05 5.03 PlyA - 173603 173598 6 1.05 5.02 Term - 181616 181485 132 2 0 23 41 165 0.090 2.41 5.01 Init - 188133 187999 135 0 0 38 116 80 0.296 5.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 43983 44444 462 2 0 65 66 565 0.847 47.74 S.002 Init + 195529 195615 87 0 0 91 85 100 0.866 10.69 S.003 Intr + 196292 196385 94 1 1 29 100 87 0.812 2.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:12617840_12818943|GENSCAN_predicted_peptide_1|805_aa GVSEAGYFPDPFTGLTTGASYTQPTTLNSLQEGACEQMSEGTGVNEHGNRPAALVLAGVN FMQAQWQHPEFYSVWDKVVGGKVKKPGKRGRKPAKIDLKAKLERSRQSARECRARKKLRY QYLEELVSSRERAICALREELEMLAFAGSPEENQLCASLPNLMFSDIRLYRKHDWGGLRK LTVMMEDKVRAGILHGSIGTRDRGYHTLLNNRIFSRLDINVFDEDEKKGRIQIPLKGCIP VPSLLTDGLREPGARILIKIQPQKTCQQVPIYGVLSIADLASGYLEEAHQEVDDSVGSAF PRGGEEIVTMTVPVGAIEKEHRGHKACIKDPRCRNHFLGFFSGHLEAQRIDDGVEPVYAD GEENVDLDTWSEILKISHNLARCTTQRPPSSGELEQDERRAGNADEKVSTCHGDHKVVGG RLSPPTPMDDQTNQGIAEDRKQPQNPKEDAGCGHFPGFQHIVKLLYTHESVCPFPQPNHL EAEDPNAQKQDEGARGVPESRFPLLGLVLDGVVRERFSEEVIWQLKPQRQEVRHNIQAEE MMSAKTLVGHEFTMSSPAAGFSITGQHVPQGSGRQGHFPMHFTHTAQLPGYTLGCQNEMK SLSWLAAFLWGRGIPRRRSLRGPEGRTTTSSETKFFKKVKIEEVSQFITLEPAFCLAFPL FTLRKSPLAATQVTVLLPLRLRGGCEDYRNLGWLMGRFSALSRKTAVKMATKTSRRTGGF GETAHFRAVQCRAFRLLPGSTFNTVYLKIFPLAYSLAAAHFAEGWRSDLYAPCSFRDSIV FFFTRRRYYSRRLRSHYLLYRSPVP >gi568815586f:12617840_12818943|GENSCAN_predicted_CDS_1|2418_bp ggagtcagtgaagcaggatatttccctgaccctttcacgggactcacaacgggtgcctca tatactcagcccaccactctcaactccttgcaggagggagcatgtgagcaaatgagtgag ggaactggagtgaatgagcacgggaatcggccagctgctttggtgctagcaggagtgaac tttatgcaggcccaatggcagcatccagagttttacagtgtttgggacaaagtggttgga ggcaaagtaaagaagcccggtaaacgtggtcggaagccagccaaaattgacttgaaagca aaacttgagaggagccggcagagtgcaagagaatgccgagcccgaaaaaagctgagatat cagtatttggaagagttggtatccagtcgagaaagagctatatgtgccctcagagaggaa ctggaaatgctggcttttgctggctccccagaggagaaccaattgtgtgcatctcttccc aacttgatgttcagtgacatcaggctgtacaggaagcatgactggggaggcctcaggaaa cttacagtcatgatggaagacaaagtgagagcaggcatcttgcatggcagcattgggacc agggatcgggggtaccacacgcttttaaataaccggatcttcagcaggctggacattaac gtattcgatgaagatgagaagaaagggagaattcagattccactcaaaggatgcatccct gtcccttccctcctcaccgacggtttacgagaacctggagccagaatcctgattaaaatc caaccacagaagacttgccagcaagttccaatatatggcgtgctttccatagcagatctg gctagtggttacttggaggaagcccaccaagaagtggatgacagtgtaggcagtgccttc ccaagaggaggggaggaaatagttacaatgactgtcccagttggagccatagaaaaagag cacaggggtcacaaagcctgcatcaaagacccacgatgccgcaatcattttcttggcttt ttctctggacaccttgaagctcagaggatagacgatggtgtagaaccggtctatgcagat ggagaggagaacgtagatctggacacctggagtgagatattgaaaatatcgcacaacctt gcacgttgcactacccagcgtccaccttccagtggtgaactggagcaggacgaaaggcgt gctggcaacgctgatgagaaggtcagcacatgccatggagaccacaaagtagttggtggt agactgagtcctcctactcctatggatgaccaaacaaaccagggaattgccgaagataga aaacaaccacagaatcccaaagaagatgctggctgtggccacttccccgggtttcagcac atagtgaagctgctttacacccatgagtctgtctgcccatttccccagcccaaccacttg gaagcagaggatcccaatgctcagaaacaggacgagggagcccgaggagttcctgagtct agatttcccctgctgggtctggttttagatggagtggttagagaaaggttctctgaagag gttatatggcagctgaaaccccaacgacaggaagtccgccataacattcaggcagaagaa atgatgagtgcaaagaccctggtgggccacgagttcaccatgagttcaccagctgcaggc ttcagcatcacaggacagcatgttccccagggatcaggcagacaagggcattttcctatg cacttcacacatacagcgcagctgccaggatatactttaggctgccaaaatgaaatgaag tcactttcttggctagcggcatttctctggggccgaggaattccacgaagaagatctctg cgaggcccagaaggccgcacaactacttcttcagaaacaaaattttttaaaaaagtaaaa atagaggaagtcagccagtttatcaccttggaaccagcgttttgtttggcttttccgctt ttcactctacgaaaaagcccattggcggctacccaggttaccgtcctgttgccattgcgc ctgcgcggcggttgtgaagattacagaaatctgggatggcttatgggacgcttctcagcc ctaagtaggaaaacagcagtgaaaatggcaaccaaaacatcacgcaggactgggggtttt ggggaaacagctcactttagagcagtgcagtgtagagctttccgtcttttaccagggtcc acctttaacactgtttatctgaaaattttccccctggcttactcgcttgcagctgcccac tttgcagaaggatggcgctctgatctctacgctccctgttccttcagggactccatagta ttttttttcacgcgtcgtcgctactacagcagacgcctgcgttctcattatttgctgtac agatctccggtgccttga >gi568815586f:12617840_12818943|GENSCAN_predicted_peptide_2|179_aa MSNVRVSNGSPSLERMDARQAEHPKPSACRNLFGPVDHEELTRDLEKHCRDMEEASQRKW NFDFQNHKPLEGKYEWQEVEKGSLPEFYYRPPRPPKGACKVPAQESQDVSGSRPAAPLIG APANSEDTHLVDPKTDPSDSQTGLAEQCAGIRKRPATDGNDPFPTIECVWGPALPAGGC >gi568815586f:12617840_12818943|GENSCAN_predicted_CDS_2|540_bp atgtcaaacgtgcgagtgtctaacgggagccctagcctggagcggatggacgccaggcag gcggagcaccccaagccctcggcctgcaggaacctcttcggcccggtggaccacgaagag ttaacccgggacttggagaagcactgcagagacatggaagaggcgagccagcgcaagtgg aatttcgattttcagaatcacaaacccctagagggcaagtacgagtggcaagaggtggag aagggcagcttgcccgagttctactacagacccccgcggccccccaaaggtgcctgcaag gtgccggcgcaggagagccaggatgtcagcgggagccgcccggcggcgcctttaattggg gctccggctaactctgaggacacgcatttggtggacccaaagactgatccgtcggacagc cagacggggttagcggagcaatgcgcaggaataaggaagcgacctgcaaccgacggtaat gaccctttcccaaccatagaatgtgtttggggccccgctttgcctgctggagggtgttaa >gi568815586f:12617840_12818943|GENSCAN_predicted_peptide_3|110_aa MPIYKLVTPNRVTGYTDVKAARAADGRSMDYIGHTSSDIPTAGEAAGYSGGESVCGEEWE LSLLSQRRGEEEGGKFLRNRPARLRGGGAAARARELRATTATVRGAARYR >gi568815586f:12617840_12818943|GENSCAN_predicted_CDS_3|333_bp atgcccatttataaactagtcactcccaacagggtaactggatatactgacgtcaaggca gcccgagctgcagacgggagatcaatggattatattggacacacaagctccgatatcccc acggccggggaggcggccggttactcaggtggagagtccgtttgcggagaggagtgggag ctttcgctgctttctcagcgcagaggagaggaggagggaggaaagtttctgagaaaccgc ccagcccggctgcgcggcggaggcgcggccgcccgggcgcgggaactgcgcgcgacgacg gcgacagtgcggggggctgcacgttacagatga >gi568815586f:12617840_12818943|GENSCAN_predicted_peptide_4|345_aa MQPDPTDFSDACDLECALSETANVFSTAGDAQNCTQHFLWNRSLRTRLMIHWKQTRSPSV AVAAPLNSCQVPAGVRAAGRERRLARRLQADRVSMSPQGMERPAAREPHGPDALRRFQGL LLDRRGRLHGQVLRLREVARRLERLRRRSLVANVAGSSLSATGALAAIVGLSLSPVTLGT SLLVSAVGLGVATAGGAVTITSDLSLIFCNSRELRRVQEIAATCQDQMREILSCLEFFCR WQGCGDRQLLQCGRNASIALYNSVYFIVFFGSRGFLIPRRAEGDTKVSQAVLKAKIQKLA ESLESCTGALDELSEQLESRVQLCTKSSRGHDLKISADQRAGLFF >gi568815586f:12617840_12818943|GENSCAN_predicted_CDS_4|1038_bp atgcagcctgaccccactgatttttctgatgcatgtgacctggagtgtgccctttctgag accgccaatgtcttctccacagctggagatgcccaaaactgcacgcagcacttcctgtgg aacaggtctttgagaactcgtctcatgatccactggaaacagacccgttcccctagcgtg gcagtggctgctccgctgaactcgtgccaagttcccgctggcgtccgggcagcagggcgg gagcggcggctggcacggagactccaggctgaccgcgtgtctatgtccccgcagggaatg gagaggccggcggcccgggagccgcatgggcccgacgcgctgcggcgcttccagggactg ctgctggaccgccgaggccggctgcacggccaggtgctgcgcctgcgcgaggtggcccgg cgcctggagcgcctgcgcaggcgctccctcgtagccaacgtggccggcagctcgctgagc gcaacgggcgccctcgccgccatcgtggggctctcgctcagcccggtcaccctggggacc tcgctgctggtgtcggccgtggggctgggggtggccacagccggaggggccgtcaccatc acgtccgatctctcgctgatcttctgcaactcccgggagctgcggagggtgcaggagatc gcggccacctgccaggaccagatgcgagagatcctgagctgcctcgagtttttctgccgc tggcagggctgcggggaccgccagctgctgcagtgcgggaggaacgcctccatcgccctg tacaattctgtctacttcatcgtcttctttggctcacgtggcttcctcatccccaggcgg gcggagggggacaccaaggttagccaggccgtgctgaaggccaagattcagaaactggcc gagagcctggagtcctgcaccggggctctggacgaactcagcgagcagctggagtctcgg gttcagctctgcaccaagtccagtcgtggccacgacctcaagatctctgctgaccagcgt gcagggctgtttttctga >gi568815586f:12617840_12818943|GENSCAN_predicted_peptide_5|88_aa MIDLTWNVKARNVSGIPRFLASAGSKVAEPFTQLGNTGGGANMWKALKMEEGATSQGMQE PLEGGKGKKTRSPLSPLGKRIALPTAGF >gi568815586f:12617840_12818943|GENSCAN_predicted_CDS_5|267_bp atgattgatttaacatggaacgtgaaagccaggaatgtgtcggggattcccagatttttg gcttcagcaggtagcaaagtggcagaaccattcacacagctaggaaacactggaggaggg gcaaatatgtggaaggctttgaagatggaggaaggggccacaagccagggaatgcaggag cctctagaaggtggaaaaggcaagaaaacacgctctcccttatcgcctctaggaaaaagg atagccctgccgactgctggattttag