GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:53:45 Sequence gi568815586f:12535774_12737715 : 201942 bp : 42.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5600 5744 145 1 1 91 67 26 0.052 1.13 1.02 Term + 25732 25892 161 2 2 105 39 159 0.636 9.82 1.03 PlyA + 26873 26878 6 1.05 2.10 PlyA - 27724 27719 6 1.05 2.09 Term - 32642 32428 215 0 2 37 42 258 0.898 12.51 2.08 Intr - 46702 46627 76 1 1 51 75 33 0.298 -3.53 2.07 Intr - 46790 46737 54 2 0 66 63 87 0.408 2.26 2.06 Intr - 60870 60710 161 1 2 126 39 79 0.266 5.59 2.05 Intr - 64477 64359 119 0 2 100 22 81 0.166 1.89 2.04 Intr - 64802 64705 98 2 2 125 40 53 0.110 2.09 2.03 Intr - 76268 76098 171 2 0 85 47 126 0.005 7.42 2.02 Intr - 81074 80977 98 0 2 76 34 112 0.260 3.41 2.01 Init - 81301 81256 46 1 1 49 58 47 0.563 -1.30 2.00 Prom - 81410 81371 40 -5.95 3.00 Prom + 87587 87626 40 -5.95 3.01 Init + 88052 88099 48 1 0 43 75 47 0.399 -0.10 3.02 Intr + 88649 88856 208 1 1 22 91 218 0.536 13.23 3.03 Intr + 90099 90121 23 1 2 71 37 17 0.068 -8.76 3.04 Intr + 100004 100201 198 1 0 22 95 233 0.869 16.03 3.05 Intr + 109932 110005 74 2 2 66 77 39 0.073 -2.11 3.06 Intr + 111499 111636 138 1 0 43 44 151 0.108 4.86 3.07 Intr + 122069 122191 123 2 0 -14 91 165 0.083 5.38 3.08 Intr + 124838 124935 98 2 2 66 92 66 0.033 3.53 3.09 Intr + 125984 126510 527 0 2 -60 66 642 0.042 38.93 3.10 Intr + 129562 129687 126 0 0 67 -2 146 0.023 3.46 3.11 Intr + 137776 137901 126 0 0 63 9 116 0.003 1.16 3.12 Intr + 153055 153141 87 0 0 46 63 102 0.033 2.75 3.13 Intr + 159113 159195 83 1 2 66 111 26 0.094 0.22 3.14 Intr + 160687 160925 239 1 2 -51 14 226 0.007 -2.36 3.15 Term + 178717 179084 368 2 2 117 42 199 0.948 11.98 3.16 PlyA + 179539 179544 6 1.05 4.00 Prom + 181560 181599 40 -9.55 4.01 Sngl + 182067 182606 540 2 0 53 37 538 0.555 40.63 4.02 PlyA + 183008 183013 6 1.05 5.03 PlyA - 184125 184120 6 1.05 5.02 Term - 188481 188216 266 1 2 -63 55 275 0.401 3.79 5.01 Init - 189530 189440 91 2 1 41 -2 184 0.611 3.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 126049 126510 462 0 0 65 66 565 0.847 47.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:12535774_12737715|GENSCAN_predicted_peptide_1|101_aa MKNKRGTGLDKVTLTLFPPNSISHKALRRENKKLLLKLVYLIGRPSKVASVNKEVTRRRA PQRQDPSPCPKIEGFFGWLVEESSTVTTQVASLLTPKANWG >gi568815586f:12535774_12737715|GENSCAN_predicted_CDS_1|306_bp atgaagaataaaagaggtacaggccttgacaaggttacactgacattatttccacccaac agtatctcccacaaagcgttaaggagggaaaataagaaactcttattaaaattagtctat cttattggaaggcccagtaaggtagctagtgtaaataaagaagtcactaggcgtcgggcc ccccaacgccaagacccgagcccctgtccgaaaattgagggctttttcggttggttggtt gaagaaagcagcaccgtcaccacccaggtggcctcgctgcttacaccaaaggcgaactgg ggataa >gi568815586f:12535774_12737715|GENSCAN_predicted_peptide_2|345_aa MAECGELAANSFITTGHSHHTSKPQSSSSLTLKKMGRESVTHTECEGKFASLMPESRGRG TSSGMAERTLEAAPNASLRRRRLLLPLHDSLWVFTDALESRRDRRVRPKYVLVTFLPMAS RGRDHNFSSWSPTPPPMLHEKSYLCSFARSPQGSSVPSQSAKMHGKAVLLFPQVPLGGEE RVAPLVQVPTLEEREERQRGSEGKEETEKEGVKEREKERQRERGRDRDKKACCQQQQPSF TGQGVSTAPKKTTQHSAGDPALIPLSPPNHKGHAEGERDCVQKQAYSHGIVRKLKVDKSR KKLLVDQAEAHWPKTKEVCKLHDECLQAKKEEIIKMLSKEEETER >gi568815586f:12535774_12737715|GENSCAN_predicted_CDS_2|1038_bp atggcagagtgtggagaactcgcagcaaactcatttatcaccactgggcacagccaccat acttccaaaccccaatcctcatcttcccttacactgaagaaaatgggacgagagagtgtg actcatacagaatgtgagggaaaattcgcttccctgatgccagagtcacggggacgaggg accagttcaggaatggcagagcggaccctggaggcggccccaaacgcctcccttcgccgc cgccgcctcctcctccccctccatgacagtctctgggtgtttacagacgctctggagagc cgccgagaccggagggtgaggcccaaatacgtgcttgtaaccttcctgcccatggcttcc agaggacgagatcataacttctcttcatggtcgcccactcctccccctatgctgcatgag aaaagctacctctgcagttttgccaggtccccgcaaggcagttcagtccccagccaaagt gctaagatgcacggcaaggctgttctcctgtttccacaggtgcccctgggaggggaagag agagtagctccactcgtgcaagtccctaccctagaggagagagaggagaggcagagaggc agtgagggaaaggaagagacagaaaaagaaggagtcaaagagagagagaaagagaggcag agagagagaggaagagacagagacaaaaaggcatgttgccagcagcagcagccttctttc acaggacagggagtcagtacagcaccaaagaagaccactcagcacagtgccggagaccct gccctgattcccctttctcctcccaaccacaaaggccatgcagaaggtgaaagggattgt gttcaaaaacaagcgtattctcatggaattgtccgcaagctgaaggtagacaagtcccgc aagaagctcctggtggaccaggctgaggcccactggcctaagactaaggaagtgtgcaaa cttcatgacgagtgcctccaggccaagaaggaggagatcatcaagatgttgtccaaggag gaagagaccgagagataa >gi568815586f:12535774_12737715|GENSCAN_predicted_peptide_3|821_aa MWMGRKSSDKCPYKTKGVSEAGYFPDPFTGLTTGASYTQPTTLNSLQEGACEQMSEGTGV NEHGNRPAALVLAGVNFMQAQWQHPEFYSVWDKVVGGKVKKPGKRGRKPAKIDLKAKLER SRQSARECRARKKLRYQYLEELVSSRERAICALREELEMLAFAGSPEENQLCASLPNLMF SDIRLYRKHDWGGLRKLTVMMEDKVRAGILHGSIGTRDRGYHTLLNNRIFSRLDINVFDE DEKKGRIQIPLKGCIPVPSLLTDGLREPGARILIKIQPQKTCQQVPIYGVLSIADLASGY LEEAHQEVDDSVGSAFPRGGEEIVTMTVPVGAIEKEHRGHKACIKDPRCRNHFLGFFSGH LEAQRIDDGVEPVYADGEENVDLDTWSEILKISHNLARCTTQRPPSSGELEQDERRAGNA DEKVSTCHGDHKVVGGRLSPPTPMDDQTNQGIAEDRKQPQNPKEDAGCGHFPGFQHIVKL LYTHESVCPFPQPNHLEAEDPNAQKQDEGARGVPESRFPLLGLVLDGVVRERFSEEVIWQ LKPQRQEVRHNIQAEEMMSAKTLVGHEFTMSSPAAGFSITGQHVPQGSGRQGHFPMHFTH TAQLPGYTLGCQNEMKSLSWLAAFLWGRGIPRRRSLRGPEGRTTTSSETKFFKKVKIEEV SQFITLEPAFCLAFPLFTLRKSPLAATQVTVLLPLRLRGGCEDYRNLGWLMGRFSALSRK TAVKMATKTSRRTGGFGETAHFRAVQCRAFRLLPGSTFNTVYLKIFPLAYSLAAAHFAEG WRSDLYAPCSFRDSIVFFFTRRRYYSRRLRSHYLLYRSPVP >gi568815586f:12535774_12737715|GENSCAN_predicted_CDS_3|2466_bp atgtggatgggccgtaaatccagtgacaagtgtccttataagactaagggagtcagtgaa gcaggatatttccctgaccctttcacgggactcacaacgggtgcctcatatactcagccc accactctcaactccttgcaggagggagcatgtgagcaaatgagtgagggaactggagtg aatgagcacgggaatcggccagctgctttggtgctagcaggagtgaactttatgcaggcc caatggcagcatccagagttttacagtgtttgggacaaagtggttggaggcaaagtaaag aagcccggtaaacgtggtcggaagccagccaaaattgacttgaaagcaaaacttgagagg agccggcagagtgcaagagaatgccgagcccgaaaaaagctgagatatcagtatttggaa gagttggtatccagtcgagaaagagctatatgtgccctcagagaggaactggaaatgctg gcttttgctggctccccagaggagaaccaattgtgtgcatctcttcccaacttgatgttc agtgacatcaggctgtacaggaagcatgactggggaggcctcaggaaacttacagtcatg atggaagacaaagtgagagcaggcatcttgcatggcagcattgggaccagggatcggggg taccacacgcttttaaataaccggatcttcagcaggctggacattaacgtattcgatgaa gatgagaagaaagggagaattcagattccactcaaaggatgcatccctgtcccttccctc ctcaccgacggtttacgagaacctggagccagaatcctgattaaaatccaaccacagaag acttgccagcaagttccaatatatggcgtgctttccatagcagatctggctagtggttac ttggaggaagcccaccaagaagtggatgacagtgtaggcagtgccttcccaagaggaggg gaggaaatagttacaatgactgtcccagttggagccatagaaaaagagcacaggggtcac aaagcctgcatcaaagacccacgatgccgcaatcattttcttggctttttctctggacac cttgaagctcagaggatagacgatggtgtagaaccggtctatgcagatggagaggagaac gtagatctggacacctggagtgagatattgaaaatatcgcacaaccttgcacgttgcact acccagcgtccaccttccagtggtgaactggagcaggacgaaaggcgtgctggcaacgct gatgagaaggtcagcacatgccatggagaccacaaagtagttggtggtagactgagtcct cctactcctatggatgaccaaacaaaccagggaattgccgaagatagaaaacaaccacag aatcccaaagaagatgctggctgtggccacttccccgggtttcagcacatagtgaagctg ctttacacccatgagtctgtctgcccatttccccagcccaaccacttggaagcagaggat cccaatgctcagaaacaggacgagggagcccgaggagttcctgagtctagatttcccctg ctgggtctggttttagatggagtggttagagaaaggttctctgaagaggttatatggcag ctgaaaccccaacgacaggaagtccgccataacattcaggcagaagaaatgatgagtgca aagaccctggtgggccacgagttcaccatgagttcaccagctgcaggcttcagcatcaca ggacagcatgttccccagggatcaggcagacaagggcattttcctatgcacttcacacat acagcgcagctgccaggatatactttaggctgccaaaatgaaatgaagtcactttcttgg ctagcggcatttctctggggccgaggaattccacgaagaagatctctgcgaggcccagaa ggccgcacaactacttcttcagaaacaaaattttttaaaaaagtaaaaatagaggaagtc agccagtttatcaccttggaaccagcgttttgtttggcttttccgcttttcactctacga aaaagcccattggcggctacccaggttaccgtcctgttgccattgcgcctgcgcggcggt tgtgaagattacagaaatctgggatggcttatgggacgcttctcagccctaagtaggaaa acagcagtgaaaatggcaaccaaaacatcacgcaggactgggggttttggggaaacagct cactttagagcagtgcagtgtagagctttccgtcttttaccagggtccacctttaacact gtttatctgaaaattttccccctggcttactcgcttgcagctgcccactttgcagaagga tggcgctctgatctctacgctccctgttccttcagggactccatagtattttttttcacg cgtcgtcgctactacagcagacgcctgcgttctcattatttgctgtacagatctccggtg ccttga >gi568815586f:12535774_12737715|GENSCAN_predicted_peptide_4|179_aa MSNVRVSNGSPSLERMDARQAEHPKPSACRNLFGPVDHEELTRDLEKHCRDMEEASQRKW NFDFQNHKPLEGKYEWQEVEKGSLPEFYYRPPRPPKGACKVPAQESQDVSGSRPAAPLIG APANSEDTHLVDPKTDPSDSQTGLAEQCAGIRKRPATDGNDPFPTIECVWGPALPAGGC >gi568815586f:12535774_12737715|GENSCAN_predicted_CDS_4|540_bp atgtcaaacgtgcgagtgtctaacgggagccctagcctggagcggatggacgccaggcag gcggagcaccccaagccctcggcctgcaggaacctcttcggcccggtggaccacgaagag ttaacccgggacttggagaagcactgcagagacatggaagaggcgagccagcgcaagtgg aatttcgattttcagaatcacaaacccctagagggcaagtacgagtggcaagaggtggag aagggcagcttgcccgagttctactacagacccccgcggccccccaaaggtgcctgcaag gtgccggcgcaggagagccaggatgtcagcgggagccgcccggcggcgcctttaattggg gctccggctaactctgaggacacgcatttggtggacccaaagactgatccgtcggacagc cagacggggttagcggagcaatgcgcaggaataaggaagcgacctgcaaccgacggtaat gaccctttcccaaccatagaatgtgtttggggccccgctttgcctgctggagggtgttaa >gi568815586f:12535774_12737715|GENSCAN_predicted_peptide_5|118_aa MRVRGSGAAAGARGRRAGSAARRERHDGRRAPEGRRTREGSPLSSDIPTAGEAAGYSGGE SVCGEEWELSLLSQRRGEEEGGKFLRNRPARLRGGGAAARARELRATTATVRGAARYR >gi568815586f:12535774_12737715|GENSCAN_predicted_CDS_5|357_bp atgcgcgtccgaggctccggggccgcggccggagcgcggggcaggcgcgcggggagcgca gcccggcgcgagcgccatgatggtcgccgcgcgccggagggtcggcgaactcgggaaggc tctccccttagctccgatatccccacggccggggaggcggccggttactcaggtggagag tccgtttgcggagaggagtgggagctttcgctgctttctcagcgcagaggagaggaggag ggaggaaagtttctgagaaaccgcccagcccggctgcgcggcggaggcgcggccgcccgg gcgcgggaactgcgcgcgacgacggcgacagtgcggggggctgcacgttacagatga