GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:32:49 Sequence gi568815597r:9750721_9972064 : 221344 bp : 48.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 961 753 209 0 2 60 95 440 0.973 40.62 1.07 Intr - 4589 4394 196 0 1 81 97 242 0.815 22.87 1.06 Intr - 5790 5761 30 1 0 106 54 44 0.160 0.90 1.05 Intr - 22674 22552 123 1 0 56 91 218 0.875 19.46 1.04 Intr - 27247 27218 30 2 0 138 77 -2 0.554 1.80 1.03 Intr - 41924 41836 89 1 2 54 50 97 0.002 2.11 1.02 Intr - 42868 42769 100 2 1 75 71 58 0.001 1.97 1.01 Init - 73013 72923 91 2 1 64 99 316 0.299 28.95 1.00 Prom - 74895 74856 40 -6.66 2.03 PlyA - 75586 75581 6 1.05 2.02 Term - 82973 82921 53 0 2 62 54 69 0.392 -1.51 2.01 Init - 88462 88345 118 1 1 87 90 79 0.717 8.37 2.00 Prom - 92924 92885 40 -1.56 3.21 PlyA - 97576 97571 6 1.05 3.20 Term - 100056 99998 59 1 2 103 37 72 0.647 1.45 3.19 Intr - 101320 101219 102 2 0 88 64 61 0.523 3.85 3.18 Intr - 108050 107965 86 1 2 85 82 47 0.218 3.26 3.17 Intr - 108590 108485 106 0 1 34 99 39 0.217 -1.13 3.16 Intr - 112401 112228 174 1 0 54 99 34 0.122 1.01 3.15 Intr - 120557 120467 91 2 1 104 96 190 0.922 20.97 3.14 Intr - 121368 121249 120 0 0 105 92 114 0.985 14.19 3.13 Intr - 121672 121533 140 2 2 30 72 84 0.889 1.18 3.12 Intr - 122073 122004 70 0 1 65 75 95 0.654 4.65 3.11 Intr - 127269 127185 85 2 1 126 110 11 0.211 6.92 3.10 Intr - 133018 132985 34 2 1 104 92 25 0.221 1.78 3.09 Intr - 138977 138859 119 1 2 65 36 66 0.020 -0.79 3.08 Intr - 142344 142289 56 0 2 106 98 24 0.021 2.98 3.07 Intr - 159469 159375 95 2 2 82 115 59 0.504 7.68 3.06 Intr - 181252 181171 82 1 1 56 97 80 0.981 4.91 3.05 Intr - 184140 184042 99 0 0 103 83 67 0.990 8.01 3.04 Intr - 184907 184772 136 1 1 57 119 124 0.804 12.97 3.03 Intr - 192841 192629 213 0 0 60 96 181 0.830 13.93 3.02 Intr - 201749 201681 69 1 0 136 38 93 0.580 7.40 3.01 Intr - 210089 209994 96 1 0 115 87 22 0.655 3.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 35616 35635 20 2 2 72 100 51 0.823 4.21 S.002 Intr + 41634 41743 110 0 2 65 94 97 0.897 7.93 S.003 Term - 171638 171535 104 0 2 85 42 101 0.887 3.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:9750721_9972064|GENSCAN_predicted_peptide_1|290_aa MLRRPAPALAPAARLLLAGLLCGGGVWAARESFRMASIHEPSVRPLFSVGECIKLFHSKP VSARDTVAKGMVPDTRHPWLESVSLVINTLQDGCVGTEGQSEVVNKHKPWLEPTYHGIVT ENDNTVLLDPPLIALDKDAPLRFAESFEVTVTKEGEICGFKIHGQNVPFDAVVVDKSTGE GVIRSKEKLDCELQKDYSFTIQAYDCGKGPDGTNVKKSHKATVHIQVNDVNEYAPVFKEK SYKATVIEGKQYDSILRVEAVDADCSPQFSQICSYEIITPDVPFTVDKDX >gi568815597r:9750721_9972064|GENSCAN_predicted_CDS_1|870_bp atgctgcgccgccccgctcccgcgctggccccggccgcccggctgctgctggccgggctg ctgtgcggcggcggggtctgggccgcgcgagaaagctttcgaatggcctccattcatgaa ccttctgtgaggccattgttctctgttggggagtgcatcaaattgttccacagcaaacct gtcagcgccagggataccgtagccaagggcatggttcccgacacccgacacccgtggctg gagtctgtgtccctggtaattaataccctacaggatggctgtgttggaacagagggacag agcgaggttgttaacaagcacaagccctggctggagcccacctaccacggcatagtcaca gagaacgacaacaccgtgctcctcgaccccccactgatcgcgctggataaagatgcgcct ctgcgatttgcagagagttttgaggtgacagtcaccaaagaaggtgagatttgtggattt aaaattcacgggcagaatgtcccctttgatgcagtggtagtggataaatccactggtgag ggagtcattcgctccaaagagaaactggactgtgagctgcagaaagactattcattcacc atccaggcctatgattgtgggaagggacctgatggcaccaacgtgaaaaagtctcataaa gcaactgttcatattcaggtgaacgacgtgaatgagtacgcgcccgtgttcaaggagaag tcctacaaagccacggtcatcgaggggaagcagtacgacagcattttgagggtggaggcc gtggatgccgactgctcccctcagttcagccagatttgcagctacgaaatcatcactcca gacgtgccctttactgttgacaaagatgnn >gi568815597r:9750721_9972064|GENSCAN_predicted_peptide_2|56_aa MPKSSYYGSRSLIVTSRQTATRGPTPASVSHLADTACDRGSFKVKEKAPKSQRSLI >gi568815597r:9750721_9972064|GENSCAN_predicted_CDS_2|171_bp atgccgaagagcagttactatggcagccgatctctcatcgtgacttctagacagacagca acgcggggccccacccctgcttccgtgtcccacctggcagacactgcctgcgacagaggg agcttcaaggtcaaggaaaaagcaccaaaaagccagagaagcctcatctga >gi568815597r:9750721_9972064|GENSCAN_predicted_peptide_3|677_aa XCSGPSVTMIHFCLAHTAYRLHCWLLRTHQLRLFYCAVCSDGAIATGTEVDVFCIWTKSD QFLRGSLRGAYRALTVRRPECYRRSPAARTPASGTSRSHSHWFQGAGSACVQRKCSPGAV RQASALREELDTDEYEETKKETLEQLSEFNDSLKKIMSGNMTLVDELSGMQLAIQAAISQ AFKTPEVIRLFAKKQPGQLRTRLAELTADDEAFLSANAGAILSQFEKVSTDLAAARAEHA PEQAAARCAPDPLPLPARPGRAAQTGQGLVTHQEFLSRYRFPQASSLVVNISQISRYSVG AVMREEEMQADVQTQASRQPREGTDRLDNGDSTGAAPSYFCPATALPSQLSTCLPEVNFQ KQVRSLLHVDTLLAFPAACRSYPLTGRNNQQSRSSQFSWTQQLHAQRQQAGQPLQRLLEN RAAQFSLLGPAWESPEPGRGMNREGAPGKSPEEMYIQQKVRVLLMLRKMGSNLTASEEEF LRTYAGVVNSQLSQLPPHSIDQGRVALTPAAYSAPGCGSREGRFDPITPRHGFCCLELRT DARMASEKSLWDSTLPPSSPGAKLALGIGWEGLLDTLSWFRKLRLCQCPGPGFQGRYLPR YCDYVSEQNKVPDDMRCTFQWRKEERIPAVAQTTCQKDHGQPDWYFSGQSEDLAFYKGGA EDVVMAFSRSETEDRRQ >gi568815597r:9750721_9972064|GENSCAN_predicted_CDS_3|2034_bp nngtgctctgggcccagtgtcaccatgattcacttttgtttggcccacacggcctatcgc ttgcactgctggctgctgagaactcaccagctcaggctgttttactgtgcagtttgttca gatggagccattgccacgggcactgaagtggatgtcttctgcatttggacaaaaagcgac cagtttctcaggggctccttgcggggagcttaccgcgccctcactgtccgccggcccgag tgctaccggagatcaccagcggcccggacgccagcgagtggaacatctcggtcgcactct cattggttccagggggccggaagtgcgtgcgtccagcggaagtgctcccccggcgcggtc cgccaggccagtgccctcagagaggaacttgatacagatgaatatgaagaaaccaaaaag gaaactctggagcaactaagtgaatttaatgattcactaaagaaaattatgtctggaaat atgactttggtagatgaactaagtggaatgcagctggctattcaggcagctatcagccag gcctttaaaaccccagaggtcatcagattgtttgcaaagaaacaaccaggtcagcttcgg acaaggttagcagagctgactgcagatgatgaggccttcttgtcagcaaatgcaggtgct atactcagccagtttgagaaagtctctacagaccttgcggctgctcgggctgagcacgcc ccggaacaggccgccgcgcgctgcgcgccggacccgctgcccctgccggcccggccgggt cgggcggcccagactggacaaggccttgtgacccatcaggagtttttgtcccgataccgt ttcccacaagcttccagtctggtggtgaacatttctcagatatccaggtacagtgttggg gcggtgatgcgagaagaggaaatgcaggcggatgtgcagacacaggcctcgaggcagccc agagaagggaccgacagacttgacaacggtgacagcactggggcggcaccttcctacttc tgcccagccacagccctcccctcacagttgagcacctgtttgcctgaagttaatttccag aagcaggttaggtccctgctgcatgtggacacgctgctggccttcccagctgcctgcagg agctaccctctgacaggtagaaacaaccagcagtccaggtcatctcagttttcctggacg cagcagcttcatgcacagaggcagcaggcaggccagcccttgcagcgcctgctggagaac agagctgctcagttcagcctgctgggaccagcatgggagtccccagagccaggcaggggg atgaaccgcgagggagctcccgggaagagtccggaggagatgtacattcagcagaaggtc cgagtgctgctcatgctgcggaagatgggatcaaacctgacagccagcgaggaggagttc ctgcgcacctatgcaggggtggtcaacagccagctcagccagctgcctccgcactccatc gaccaggggagggtcgctttgaccccagctgcttactcagcacctggctgcggctccagg gagggtcgctttgaccccatcaccccgcgtcatgggttctgttgtttagaactgagaaca gatgcccgaatggcctcagagaagtccttgtgggactctacacttcccccctcaagccct ggggcaaagctggcattgggcataggatgggagggcctgctggataccctgtcctggttt agaaagctgcgcctttgccagtgtcctggccctggtttccagggcaggtacttgcccaga tactgtgactatgtcagtgaacaaaacaaagtccctgatgacatgagatgtacgttccag tggagaaaggaagagagaatacctgcagtagcacagacaacatgtcagaaagaccatggg cagccagactggtacttttcggggcaaagtgaagatctggccttttacaaagggggtgca gaggacgtggtgatggcgttttccaggtcggagacggaagaccggaggcagtag