GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:54:22 Sequence gi568815586r:12561204_12762448 : 201245 bp : 42.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 302 462 161 0 2 105 39 159 0.814 9.82 1.02 PlyA + 1443 1448 6 1.05 2.10 PlyA - 2294 2289 6 1.05 2.09 Term - 7212 6998 215 1 2 37 42 258 0.892 12.51 2.08 Intr - 21272 21197 76 2 1 51 75 33 0.298 -3.53 2.07 Intr - 21360 21307 54 0 0 66 63 87 0.408 2.26 2.06 Intr - 35440 35280 161 2 2 126 39 79 0.266 5.59 2.05 Intr - 39047 38929 119 1 2 100 22 81 0.166 1.89 2.04 Intr - 39372 39275 98 0 2 125 40 53 0.110 2.09 2.03 Intr - 50838 50668 171 0 0 85 47 126 0.005 7.42 2.02 Intr - 55644 55547 98 1 2 76 34 112 0.260 3.41 2.01 Init - 55871 55826 46 2 1 49 58 47 0.563 -1.30 2.00 Prom - 55980 55941 40 -5.95 3.00 Prom + 62157 62196 40 -5.95 3.01 Init + 62622 62669 48 2 0 43 75 47 0.399 -0.10 3.02 Intr + 63219 63426 208 2 1 22 91 218 0.536 13.23 3.03 Intr + 64669 64691 23 2 2 71 37 17 0.068 -8.76 3.04 Intr + 74574 74771 198 2 0 22 95 233 0.869 16.03 3.05 Intr + 84502 84575 74 0 2 66 77 39 0.073 -2.11 3.06 Intr + 86069 86206 138 2 0 43 44 151 0.108 4.86 3.07 Intr + 96639 96761 123 0 0 -14 91 165 0.083 5.38 3.08 Intr + 99408 99505 98 0 2 66 92 66 0.033 3.53 3.09 Intr + 100554 101080 527 1 2 -60 66 642 0.042 38.93 3.10 Intr + 104132 104257 126 1 0 67 -2 146 0.023 3.46 3.11 Intr + 112346 112471 126 1 0 63 9 116 0.003 1.16 3.12 Intr + 127625 127711 87 1 0 46 63 102 0.033 2.75 3.13 Intr + 133683 133765 83 2 2 66 111 26 0.094 0.22 3.14 Intr + 135257 135495 239 2 2 -51 14 226 0.007 -2.36 3.15 Term + 153287 153654 368 0 2 117 42 199 0.948 11.98 3.16 PlyA + 154109 154114 6 1.05 4.00 Prom + 156130 156169 40 -9.55 4.01 Sngl + 156637 157176 540 0 0 53 37 538 0.555 40.63 4.02 PlyA + 157578 157583 6 1.05 5.04 PlyA - 158695 158690 6 1.05 5.03 Term - 163011 162786 226 2 1 8 55 213 0.148 5.27 5.02 Intr - 190641 190613 29 0 2 114 86 15 0.014 -0.10 5.01 Init - 198555 198478 78 0 0 82 44 91 0.451 5.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100619 101080 462 1 0 65 66 565 0.847 47.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:12561204_12762448|GENSCAN_predicted_peptide_1|53_aa XSVNKEVTRRRAPQRQDPSPCPKIEGFFGWLVEESSTVTTQVASLLTPKANWG >gi568815586r:12561204_12762448|GENSCAN_predicted_CDS_1|162_bp nctagtgtaaataaagaagtcactaggcgtcgggccccccaacgccaagacccgagcccc tgtccgaaaattgagggctttttcggttggttggttgaagaaagcagcaccgtcaccacc caggtggcctcgctgcttacaccaaaggcgaactggggataa >gi568815586r:12561204_12762448|GENSCAN_predicted_peptide_2|345_aa MAECGELAANSFITTGHSHHTSKPQSSSSLTLKKMGRESVTHTECEGKFASLMPESRGRG TSSGMAERTLEAAPNASLRRRRLLLPLHDSLWVFTDALESRRDRRVRPKYVLVTFLPMAS RGRDHNFSSWSPTPPPMLHEKSYLCSFARSPQGSSVPSQSAKMHGKAVLLFPQVPLGGEE RVAPLVQVPTLEEREERQRGSEGKEETEKEGVKEREKERQRERGRDRDKKACCQQQQPSF TGQGVSTAPKKTTQHSAGDPALIPLSPPNHKGHAEGERDCVQKQAYSHGIVRKLKVDKSR KKLLVDQAEAHWPKTKEVCKLHDECLQAKKEEIIKMLSKEEETER >gi568815586r:12561204_12762448|GENSCAN_predicted_CDS_2|1038_bp atggcagagtgtggagaactcgcagcaaactcatttatcaccactgggcacagccaccat acttccaaaccccaatcctcatcttcccttacactgaagaaaatgggacgagagagtgtg actcatacagaatgtgagggaaaattcgcttccctgatgccagagtcacggggacgaggg accagttcaggaatggcagagcggaccctggaggcggccccaaacgcctcccttcgccgc cgccgcctcctcctccccctccatgacagtctctgggtgtttacagacgctctggagagc cgccgagaccggagggtgaggcccaaatacgtgcttgtaaccttcctgcccatggcttcc agaggacgagatcataacttctcttcatggtcgcccactcctccccctatgctgcatgag aaaagctacctctgcagttttgccaggtccccgcaaggcagttcagtccccagccaaagt gctaagatgcacggcaaggctgttctcctgtttccacaggtgcccctgggaggggaagag agagtagctccactcgtgcaagtccctaccctagaggagagagaggagaggcagagaggc agtgagggaaaggaagagacagaaaaagaaggagtcaaagagagagagaaagagaggcag agagagagaggaagagacagagacaaaaaggcatgttgccagcagcagcagccttctttc acaggacagggagtcagtacagcaccaaagaagaccactcagcacagtgccggagaccct gccctgattcccctttctcctcccaaccacaaaggccatgcagaaggtgaaagggattgt gttcaaaaacaagcgtattctcatggaattgtccgcaagctgaaggtagacaagtcccgc aagaagctcctggtggaccaggctgaggcccactggcctaagactaaggaagtgtgcaaa cttcatgacgagtgcctccaggccaagaaggaggagatcatcaagatgttgtccaaggag gaagagaccgagagataa >gi568815586r:12561204_12762448|GENSCAN_predicted_peptide_3|821_aa MWMGRKSSDKCPYKTKGVSEAGYFPDPFTGLTTGASYTQPTTLNSLQEGACEQMSEGTGV NEHGNRPAALVLAGVNFMQAQWQHPEFYSVWDKVVGGKVKKPGKRGRKPAKIDLKAKLER SRQSARECRARKKLRYQYLEELVSSRERAICALREELEMLAFAGSPEENQLCASLPNLMF SDIRLYRKHDWGGLRKLTVMMEDKVRAGILHGSIGTRDRGYHTLLNNRIFSRLDINVFDE DEKKGRIQIPLKGCIPVPSLLTDGLREPGARILIKIQPQKTCQQVPIYGVLSIADLASGY LEEAHQEVDDSVGSAFPRGGEEIVTMTVPVGAIEKEHRGHKACIKDPRCRNHFLGFFSGH LEAQRIDDGVEPVYADGEENVDLDTWSEILKISHNLARCTTQRPPSSGELEQDERRAGNA DEKVSTCHGDHKVVGGRLSPPTPMDDQTNQGIAEDRKQPQNPKEDAGCGHFPGFQHIVKL LYTHESVCPFPQPNHLEAEDPNAQKQDEGARGVPESRFPLLGLVLDGVVRERFSEEVIWQ LKPQRQEVRHNIQAEEMMSAKTLVGHEFTMSSPAAGFSITGQHVPQGSGRQGHFPMHFTH TAQLPGYTLGCQNEMKSLSWLAAFLWGRGIPRRRSLRGPEGRTTTSSETKFFKKVKIEEV SQFITLEPAFCLAFPLFTLRKSPLAATQVTVLLPLRLRGGCEDYRNLGWLMGRFSALSRK TAVKMATKTSRRTGGFGETAHFRAVQCRAFRLLPGSTFNTVYLKIFPLAYSLAAAHFAEG WRSDLYAPCSFRDSIVFFFTRRRYYSRRLRSHYLLYRSPVP >gi568815586r:12561204_12762448|GENSCAN_predicted_CDS_3|2466_bp atgtggatgggccgtaaatccagtgacaagtgtccttataagactaagggagtcagtgaa gcaggatatttccctgaccctttcacgggactcacaacgggtgcctcatatactcagccc accactctcaactccttgcaggagggagcatgtgagcaaatgagtgagggaactggagtg aatgagcacgggaatcggccagctgctttggtgctagcaggagtgaactttatgcaggcc caatggcagcatccagagttttacagtgtttgggacaaagtggttggaggcaaagtaaag aagcccggtaaacgtggtcggaagccagccaaaattgacttgaaagcaaaacttgagagg agccggcagagtgcaagagaatgccgagcccgaaaaaagctgagatatcagtatttggaa gagttggtatccagtcgagaaagagctatatgtgccctcagagaggaactggaaatgctg gcttttgctggctccccagaggagaaccaattgtgtgcatctcttcccaacttgatgttc agtgacatcaggctgtacaggaagcatgactggggaggcctcaggaaacttacagtcatg atggaagacaaagtgagagcaggcatcttgcatggcagcattgggaccagggatcggggg taccacacgcttttaaataaccggatcttcagcaggctggacattaacgtattcgatgaa gatgagaagaaagggagaattcagattccactcaaaggatgcatccctgtcccttccctc ctcaccgacggtttacgagaacctggagccagaatcctgattaaaatccaaccacagaag acttgccagcaagttccaatatatggcgtgctttccatagcagatctggctagtggttac ttggaggaagcccaccaagaagtggatgacagtgtaggcagtgccttcccaagaggaggg gaggaaatagttacaatgactgtcccagttggagccatagaaaaagagcacaggggtcac aaagcctgcatcaaagacccacgatgccgcaatcattttcttggctttttctctggacac cttgaagctcagaggatagacgatggtgtagaaccggtctatgcagatggagaggagaac gtagatctggacacctggagtgagatattgaaaatatcgcacaaccttgcacgttgcact acccagcgtccaccttccagtggtgaactggagcaggacgaaaggcgtgctggcaacgct gatgagaaggtcagcacatgccatggagaccacaaagtagttggtggtagactgagtcct cctactcctatggatgaccaaacaaaccagggaattgccgaagatagaaaacaaccacag aatcccaaagaagatgctggctgtggccacttccccgggtttcagcacatagtgaagctg ctttacacccatgagtctgtctgcccatttccccagcccaaccacttggaagcagaggat cccaatgctcagaaacaggacgagggagcccgaggagttcctgagtctagatttcccctg ctgggtctggttttagatggagtggttagagaaaggttctctgaagaggttatatggcag ctgaaaccccaacgacaggaagtccgccataacattcaggcagaagaaatgatgagtgca aagaccctggtgggccacgagttcaccatgagttcaccagctgcaggcttcagcatcaca ggacagcatgttccccagggatcaggcagacaagggcattttcctatgcacttcacacat acagcgcagctgccaggatatactttaggctgccaaaatgaaatgaagtcactttcttgg ctagcggcatttctctggggccgaggaattccacgaagaagatctctgcgaggcccagaa ggccgcacaactacttcttcagaaacaaaattttttaaaaaagtaaaaatagaggaagtc agccagtttatcaccttggaaccagcgttttgtttggcttttccgcttttcactctacga aaaagcccattggcggctacccaggttaccgtcctgttgccattgcgcctgcgcggcggt tgtgaagattacagaaatctgggatggcttatgggacgcttctcagccctaagtaggaaa acagcagtgaaaatggcaaccaaaacatcacgcaggactgggggttttggggaaacagct cactttagagcagtgcagtgtagagctttccgtcttttaccagggtccacctttaacact gtttatctgaaaattttccccctggcttactcgcttgcagctgcccactttgcagaagga tggcgctctgatctctacgctccctgttccttcagggactccatagtattttttttcacg cgtcgtcgctactacagcagacgcctgcgttctcattatttgctgtacagatctccggtg ccttga >gi568815586r:12561204_12762448|GENSCAN_predicted_peptide_4|179_aa MSNVRVSNGSPSLERMDARQAEHPKPSACRNLFGPVDHEELTRDLEKHCRDMEEASQRKW NFDFQNHKPLEGKYEWQEVEKGSLPEFYYRPPRPPKGACKVPAQESQDVSGSRPAAPLIG APANSEDTHLVDPKTDPSDSQTGLAEQCAGIRKRPATDGNDPFPTIECVWGPALPAGGC >gi568815586r:12561204_12762448|GENSCAN_predicted_CDS_4|540_bp atgtcaaacgtgcgagtgtctaacgggagccctagcctggagcggatggacgccaggcag gcggagcaccccaagccctcggcctgcaggaacctcttcggcccggtggaccacgaagag ttaacccgggacttggagaagcactgcagagacatggaagaggcgagccagcgcaagtgg aatttcgattttcagaatcacaaacccctagagggcaagtacgagtggcaagaggtggag aagggcagcttgcccgagttctactacagacccccgcggccccccaaaggtgcctgcaag gtgccggcgcaggagagccaggatgtcagcgggagccgcccggcggcgcctttaattggg gctccggctaactctgaggacacgcatttggtggacccaaagactgatccgtcggacagc cagacggggttagcggagcaatgcgcaggaataaggaagcgacctgcaaccgacggtaat gaccctttcccaaccatagaatgtgtttggggccccgctttgcctgctggagggtgttaa >gi568815586r:12561204_12762448|GENSCAN_predicted_peptide_5|110_aa MPIYKLVTPNRVTGYTDVKAARAADGRSMDYIGHTSSDIPTAGEAAGYSGGESVCGEEWE LSLLSQRRGEEEGGKFLRNRPARLRGGGAAARARELRATTATVRGAARYR >gi568815586r:12561204_12762448|GENSCAN_predicted_CDS_5|333_bp atgcccatttataaactagtcactcccaacagggtaactggatatactgacgtcaaggca gcccgagctgcagacgggagatcaatggattatattggacacacaagctccgatatcccc acggccggggaggcggccggttactcaggtggagagtccgtttgcggagaggagtgggag ctttcgctgctttctcagcgcagaggagaggaggagggaggaaagtttctgagaaaccgc ccagcccggctgcgcggcggaggcgcggccgcccgggcgcgggaactgcgcgcgacgacg gcgacagtgcggggggctgcacgttacagatga