GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:34:03 Sequence gi568815596r:212907363_213248645 : 341283 bp : 35.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3641 3700 60 1 0 57 94 74 0.421 6.20 1.02 Term + 6186 6275 90 2 0 67 55 75 0.390 -1.16 1.03 PlyA + 6622 6627 6 1.05 2.03 PlyA - 6672 6667 6 1.05 2.02 Term - 8848 8658 191 2 2 59 49 192 0.978 9.03 2.01 Init - 14935 14782 154 1 1 83 98 51 0.743 5.79 2.00 Prom - 18261 18222 40 -5.65 3.00 Prom + 23820 23859 40 -3.95 3.01 Init + 26790 26797 8 2 2 69 76 0 0.151 -2.53 3.02 Intr + 29401 29625 225 1 0 77 47 157 0.398 6.68 3.03 Intr + 33689 33851 163 2 1 41 61 111 0.341 2.76 3.04 Term + 45429 45554 126 2 0 -27 43 259 0.243 7.20 3.05 PlyA + 47043 47048 6 1.05 4.03 PlyA - 49624 49619 6 1.05 4.02 Term - 66910 66819 92 2 2 93 45 101 0.655 3.20 4.01 Init - 85464 85338 127 0 1 71 72 80 0.651 5.07 4.00 Prom - 88133 88094 40 -4.25 5.07 PlyA - 89565 89560 6 1.05 5.06 Term - 100722 99998 725 1 2 102 47 667 0.999 56.65 5.05 Intr - 106572 106429 144 1 0 13 94 209 0.548 13.33 5.04 Intr - 114768 114631 138 1 0 66 64 172 0.932 12.11 5.03 Intr - 142518 142351 168 1 0 107 119 39 0.965 7.90 5.02 Intr - 149737 149471 267 2 0 70 107 349 0.261 31.58 5.01 Init - 177495 177450 46 0 1 50 81 59 0.302 2.30 5.00 Prom - 180628 180589 40 -3.65 6.06 PlyA - 181462 181457 6 1.05 6.05 Term - 191547 191377 171 0 0 72 54 154 0.952 7.24 6.04 Intr - 199827 199714 114 0 0 87 88 82 0.847 7.82 6.03 Intr - 236908 236829 80 2 2 49 86 56 0.007 -0.15 6.02 Intr - 245033 244953 81 0 0 80 50 126 0.020 6.69 6.01 Init - 257014 256957 58 1 1 63 100 0 0.201 0.22 6.00 Prom - 258259 258220 40 -7.25 7.00 Prom + 258798 258837 40 -2.55 7.01 Init + 261356 261450 95 1 2 58 77 106 0.357 6.40 7.02 Term + 282371 282488 118 2 1 98 48 74 0.370 1.43 7.03 PlyA + 282725 282730 6 1.05 8.00 Prom + 286288 286327 40 -6.05 8.01 Init + 290912 291245 334 1 1 83 37 195 0.497 11.40 8.02 Term + 298895 299013 119 0 2 80 38 122 0.924 4.12 8.03 PlyA + 299607 299612 6 1.05 9.02 PlyA - 300060 300055 6 1.05 9.01 Term - 307278 306991 288 0 0 77 43 294 0.974 18.19 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_1|49_aa MTYRMTAQLHATERRGCPLKNRYKIHTQKKGNGADTQDWRKTGLQIEKG >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_1|150_bp atgacttatcggatgacagcacaacttcatgccacagaaagaagaggatgccctttaaag aatcgttataagatccacacccaaaaaaagggcaatggagctgacacgcaggactggaga aaaacaggactccaaattgagaaggggtga >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_2|114_aa MESLKSKLLTLENKKYNWEIRKQTNKENPKTRVKQYRKQTSDCVNDKDFMEGPIHTSVMM EGGGSGNLGRVGITIVFSSREAGTQKQRELLGAFAQWKFAEITETALNVPKSWN >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_2|345_bp atggaatctttgaaaagcaagcttctaactttagaaaacaagaaatataattgggaaatt aggaaacaaacaaacaaagaaaaccccaagacaagagtaaagcaatacaggaaacagacc tcagactgtgtaaatgacaaagacttcatggaaggacccatccatacatctgttatgatg gaaggaggtggtagtggaaatctgggaagggttggtatcaccattgtgttcagcagcagg gaagcaggaacccagaaacagagggagctcctgggagcttttgcccagtggaaatttgca gaaatcacagaaacagctctgaatgtacccaagtcttggaattag >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_3|173_aa MGRTSVIISGKAGRDREQARGDWQPPEEPGTAVVVLHVWWRKVTTFGKVRNAEQASYLIP LLEIKMENSFYGLSAFSSQEGNGLPVAVTKRLMANKPSYLHEVLPRSIRLTPAFTVVLLK FGKQAGISGKSQKQLAKKIGMEHTEKGEKNQKHVSLNDDNDDDDDDDDDDDDD >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_3|522_bp atgggaagaacgtcagtcatcatcagcgggaaggcaggcagggacagggagcaggcacgg ggggactggcagccacctgaagaaccgggcacagccgtggttgttctccatgtctggtgg cggaaagtcacaacatttggaaaagtcagaaatgcagagcaagctagctacttaattcct ttactggaaataaagatggaaaacagtttttatggcttgtcagctttcagcagccaagaa ggaaatggattacctgtggctgtgactaagagacttatggccaacaagccaagttacctg catgaggttctgccaagaagtattagactaactccagctttcacagtggttttactcaag tttggaaaacaggcaggaatatctggaaaaagtcagaagcaactagcaaagaagataggg atggagcacacagagaagggggagaaaaaccaaaaacatgtgtccttgaatgatgataat gatgatgatgatgatgatgatgacgacgacgacgacgactag >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_4|72_aa MERYRFQRYSKGGKYWTALCWMYVGSEEGGLIKDDILVSDWEVKAHNDGSVTERGPVTLE DESSRKNLQWNP >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_4|219_bp atggagagatacagattccaaagatattcaaaaggtggaaagtactggactgctctgtgc tggatgtatgtcggtagtgaggaaggggggttaattaaggatgatatcctggtttctgac tgggaagtgaaggcccacaatgatgggtcagtaactgagaggggccctgtgaccctagaa gatgaatcctcaaggaagaatctccagtggaatccctga >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_5|495_aa MRTSNAKGYYMSKESSNSVKLEMQSDEECDRKPLSREDEIRGHDEGSSLEEPLIESSEVA DNRKVQELQGEGGIRLPNGKLKCDVCGMVCIGPNVLMVHKRSHTGERPFHCNQCGASFTQ KGNLLRHIKLHSGEKPFKCPFCSYACRRRDALTGHLRTHSVGKPHKCNYCGRSYKQRSSL EEHKERCHNYLQNVSMEAAGQVMSHHVPPMEDCKEQEPIMDNNISLVPFERPAVIEKLTG NMGKRKSSTPQKFVGEKLMRFSYPDIHFDMNLTYEKEAELMQSHMMDQAINNAITYLGAE ALHPLMQHPPSTIAEVAPVISSAYSQVYHPNRIERPISRETADSHENNMDGPISLIRPKS RPQEREASPSNSCLDSTDSESSHDDHQSYQGHPALNPKRKQSPAYMKEDVKALDTTKAPK GSLKDIYKVFNGEGEQIRAFKCEHCRVLFLDHVMYTIHMGCHGYRDPLECNICGYRSQDR YEFSSHIVRGEHTFH >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_5|1488_bp atgaggacatccaatgcaaagggatactacatgagcaaagagagttcaaattcagtaaag ctagaaatgcagagtgatgaagagtgtgacaggaaacccctgagccgtgaagatgagatc aggggccatgatgagggtagcagcctagaagaacccctaattgagagcagcgaggtggct gacaacaggaaagtccaggagcttcaaggcgagggaggaatccggcttccgaatggtaaa ctgaaatgtgacgtctgtggcatggtttgcattgggcccaatgtgcttatggtacataaa aggagtcacactggtgaacgccccttccactgtaaccagtgtggagcttcttttactcag aagggcaaccttctgagacacataaagttacactctggagagaagccgttcaaatgtcct ttctgtagctacgcctgtagaagaagggacgccctcacaggacacctcaggacccattct gtgggtaaacctcacaagtgcaactactgtggacgaagctacaagcagcgcagttcactg gaggagcacaaggaacgctgccacaactatctccagaatgtcagcatggaggctgctggg caggtcatgagtcaccatgtacctcctatggaagattgtaaggaacaagagcctattatg gacaacaatatttctctggtgccttttgagagacctgctgtcatagagaagctcacgggg aatatgggaaaacgtaaaagctccactccacaaaagtttgtgggggaaaagctcatgcga ttcagctacccagatattcactttgatatgaacttaacatatgagaaggaggctgagctg atgcagtctcatatgatggaccaagccatcaacaatgcaatcacctaccttggagctgag gcccttcaccctctgatgcagcacccgccaagcacaatcgctgaagtggccccagttata agctcagcttattctcaggtctatcatccaaataggatagaaagacccattagcagggaa actgctgatagtcatgaaaacaacatggatggccccatctctctcatcagaccaaagagt cgaccccaggaaagagaggcctctcccagcaatagctgcctggattccactgactcagaa agcagccatgatgaccaccagtcctaccaaggacaccctgccttaaatcccaagaggaaa caaagcccagcttacatgaaggaggatgtcaaagctttggatactaccaaggctcctaag ggctctctgaaggacatctacaaggtcttcaatggagaaggagaacagattagggccttc aagtgtgagcactgccgagtccttttcctagaccatgtcatgtacaccattcacatgggt tgccatggctaccgggacccactggaatgcaacatctgtggctacagaagccaggaccgt tatgagttttcatcacacattgttcgaggggagcacacattccactag >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_6|167_aa MSYVTNTVFPLLSDYYVCRRPLLSRFRSRSMRCAFRNGAAELPAETTVKWINCTVNAIYQ SDGQVACMESEEVPECFVMVFHCVSEGTLSVVSDSEQANLLIIDNGPVDKQALDNGETSA CMCWILLVGSSSSRMTTEVTSAPWKAGKLELEQLSQSPDKQVLTGTE >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_6|504_bp atgagctatgtaaccaacactgtgtttcctcttttatctgattactatgtttgcagaaga cccctcttatcccgcttcaggtcccggtcaatgcggtgcgctttccgaaatggggccgca gagttgccggcggagaccactgttaagtggatcaactgtacagtcaatgctatttatcag tctgatggccaggtggcatgtatggaatcggaagaagtacctgagtgctttgtgatggtc tttcactgtgtgtctgagggaaccctaagtgttgtttctgactctgagcaggctaatttg ctcatcatagacaatggacctgttgacaaacaggcactggataatggagaaacaagtgcc tgcatgtgttggattttgttagttggatctagctccagcagaatgacaactgaggtcacg tcagcaccttggaaggccgggaagctggaactggagcagctgagccaatccccagacaaa caggttttaaccgggacagagtga >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_7|70_aa MAAIGYVILDLSMKPISVIEVTNTRNCESQERALRLKPFFLEEWGGEELVPFVAKRLKLS RVTFAAIENQ >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_7|213_bp atggcagcaattggatatgtgatcttggacctgagtatgaagccaatatcggtgatagag gtaacaaacacacggaattgtgagtctcaagaaagggccttgaggcttaagcctttcttt ctagaagaatggggaggggaggagctagtcccatttgttgctaagcgactaaagctttct cgggtcacgtttgcagctattgagaatcagtga >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_8|150_aa MEEIQCNQPIKNGFLISPKNDAVSSVDVCCWQVDHSAVAVVILILAEREAIRLSACTDST MTIIIFALLWLRRESRSVDIHRKSHTVLCSGYALVCTGVRHKSSCFVPSVTAFCNKSMIQ VTTSSRKMRDIGNRPGPIAEFGPNVQSSHV >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_8|453_bp atggaggagatccaatgtaatcaacctatcaagaatggctttcttatctccccaaagaat gatgctgtatcaagtgttgatgtctgctgctggcaggttgatcattcagcagtggcagta gttatcctcatccttgctgagagggaggctatacggttgagtgcatgtacagactccacc atgacaattatcatttttgcactactatggctaagaagagagagtcggtcagttgacatc cacaggaaaagtcatactgttctttgtagtggatatgctctggtatgcacaggtgttaga cacaaatcctcatgctttgtgcccagtgtgacagctttctgcaataagagtatgattcag gtaaccactagttcaaggaagatgagagatattggaaacagacctggacccattgcagag tttggacccaatgttcagtccagccatgtctag >gi568815596r:212907363_213248645|GENSCAN_predicted_peptide_9|95_aa RVSEAGSEVELLATPGALSEYARHLQDPLVSIDLSERWRSWVSSLVDFAAPRIPVLSSPS FFKQISDPNWVWSNDRNWTGSRNGFDLGINWLGSS >gi568815596r:212907363_213248645|GENSCAN_predicted_CDS_9|288_bp agggtgtcagaagcgggatctgaagtggagcttctagcgaccccaggagccctgagtgaa tatgcaaggcacctgcaggacccactggtgtccattgatctctcagagcgctggagatcg tgggtaagctccctcgtggatttcgcagctccacggattcctgttttgagctctccgagt ttctttaagcaaatttctgatccaaactgggtttggagtaacgacagaaactggactgga tccaggaatggatttgatttgggaattaactggcttggatccagttag