GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:30:41 Sequence gi568815597r:207787741_208011080 : 223340 bp : 44.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 14461 14228 234 0 0 55 46 236 0.365 12.90 1.01 Init - 18323 18265 59 2 2 77 113 71 0.927 7.28 1.00 Prom - 29060 29021 40 -1.06 2.12 PlyA - 29533 29528 6 -0.45 2.11 Term - 30741 30638 104 1 2 144 48 36 0.939 3.64 2.10 Intr - 31589 31445 145 2 1 46 26 153 0.542 4.76 2.09 Intr - 32219 32123 97 1 1 114 34 49 0.846 2.11 2.08 Intr - 35646 35570 77 0 2 110 116 0 0.858 3.31 2.07 Intr - 36841 36721 121 0 1 78 72 63 0.857 4.20 2.06 Intr - 44402 44396 7 0 1 112 96 0 0.001 -4.01 2.05 Intr - 57478 57353 126 2 0 76 97 14 0.145 1.75 2.04 Intr - 62082 61966 117 1 0 98 69 60 0.361 5.54 2.03 Intr - 73048 72860 189 2 0 84 61 89 0.249 5.36 2.02 Intr - 78986 78914 73 2 1 131 84 6 0.101 3.48 2.01 Init - 81188 81105 84 2 0 90 27 165 0.349 9.42 2.00 Prom - 84681 84642 40 -5.66 3.08 PlyA - 84972 84967 6 1.05 3.07 Term - 101106 100849 258 0 0 86 43 170 0.452 7.85 3.06 Intr - 101473 101421 53 2 2 64 115 40 0.908 2.93 3.05 Intr - 101881 101725 157 1 1 109 110 51 0.898 8.98 3.04 Intr - 109833 109753 81 0 0 71 75 39 0.577 0.73 3.03 Intr - 111486 111233 254 1 2 81 98 31 0.401 0.55 3.02 Intr - 112263 112081 183 1 0 87 98 87 0.847 9.36 3.01 Init - 123340 123262 79 1 1 69 66 122 0.588 7.32 3.00 Prom - 142996 142957 40 -2.66 4.00 Prom + 160039 160078 40 -2.86 4.01 Sngl + 171548 171835 288 1 0 78 40 173 0.579 5.10 4.02 PlyA + 174178 174183 6 1.05 5.04 PlyA - 174358 174353 6 -0.45 5.03 Term - 175575 175218 358 2 1 72 42 194 0.565 7.18 5.02 Intr - 176027 175839 189 1 0 59 44 175 0.538 8.90 5.01 Init - 214118 214060 59 2 2 75 103 8 0.035 1.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 185554 185633 80 2 2 49 36 118 0.810 0.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:207787741_208011080|GENSCAN_predicted_peptide_1|98_aa MAIWAWLPAPPIWLEHTSVTSHHQHECVRRAFECGDCHILLDNNVLGVDCHGAGERAVHL EDHFVHIDTISLLLEDALEYSALIAGHPKSDLPPGLSS >gi568815597r:207787741_208011080|GENSCAN_predicted_CDS_1|294_bp atggccatctgggcttggcttccggctccccccatctggctggaacacacatcagtcacc agccatcaccagcatgaatgtgttcgtagggcctttgagtgtggcgattgtcatattctg ttggataacaatgtattgggtgtcgattgtcatggggcaggggagagggcagtacacctg gaggaccattttgtccacatcgacaccatcagtctgctcttagaggatgccctggagtat tcggcgttgattgcggggcacccgaaatcagacttgccacctggactgtcgagn >gi568815597r:207787741_208011080|GENSCAN_predicted_peptide_2|379_aa MPRAPAALRSRPTPAAARPWGAAGLAAGLGSWQLRRSWSRCRGKGMDVITGQVLAVFTSS LDSHHSDESAPGKVHGNSHDPDSSGHFFIFLTGALQFGTADHTLFLDAVSSLGFQVFMGN LGSEILRHLTILPSGSCFQAIAKAPVVQFPIESPCCSECLRGIHQTSGQAGEGEPETYPG TEPEGAFMNVLSNSSFGCRNSREKKVLVFGQKGKKQQKSAMTVILDLSVAQRVNGTVVRW TDRLVLQVVWVSASICLSVQVLCGRFTQDDGFFLVAEIDGLTDSVILVAFLVQEEGEQWL PASLTLLSPTGGELGTLKPLVTNSGAAEQSVTTASESSVDGNGDPGETVGLEFLTLLILF ASSWRLWVLGFSPYRSFAL >gi568815597r:207787741_208011080|GENSCAN_predicted_CDS_2|1140_bp atgcccagggccccggccgcgctccggtcgcgccccaccccggctgcagcgcggccttgg ggcgctgctggcctcgccgcggggctcggcagttggcagttgcgtaggagttggagtaga tgcagggggaagggcatggacgtcatcacagggcaggtccttgctgtgttcacatcctct cttgactcacaccactctgatgaaagtgctcctggcaaggtccatggtaactcccatgat cctgactctagtggtcacttcttcatcttcctcacaggagctcttcagtttggcaccgct gatcatacccttttccttgacgctgtttcttcacttggcttccaggtatttatggggaac cttggtagtgaaatacttcggcatctaacaatattgccttcaggaagttgtttccaagcc atcgctaaagctcctgtggtccagtttcccatagaatccccctgctgctcagagtgcctg agaggaatacaccagacctcagggcaggctggggagggtgaaccggagacttatcctggt acggagccagaaggagccttcatgaatgtcttgtcaaactcttcatttggctgcagaaat tccagggagaaaaaggtcctggtttttggccaaaaaggcaaaaaacaacagaaatcagct atgactgtaatcctggacttgtctgtggcacaaagagttaatgggacagtggtcaggtgg acagacagactggttctccaagttgtttgggtgtcagcgtccatctgtctgtccgtgcag gtcttatgtggaaggttcactcaagatgatggcttttttcttgtagctgaaattgatgga ttaactgactcagttatcctggttgccttcctcgtacaagaggagggagagcagtggctg cctgcttccctgacgctgctgagtcccacgggaggggagctgggcaccctgaagcccctc gttaccaattcgggggccgcggagcagagtgtgacgacagccagtgagtcgtcagtcgac ggcaatggggaccctggtgaaacagtgggccttgagttcttgacccttttaattttgttt gcgtcatcatggcggctgtgggttctaggcttttctccataccgcagctttgcgctatag >gi568815597r:207787741_208011080|GENSCAN_predicted_peptide_3|354_aa MLVRRGARAGPRMPRGWTALCLLSLLPSGFMSLDNNGTATPELPTQGTFSNVSTNVSYQE TTTPSTLGSTSLHPVSQHGNEATTNITETTVKFTSTSVITSVYGNTNSSVQSQTSVISTV FTTPANVSTPETTLKPSLSPGNVSDLSTTSTSLATSPTKPYTSSSPILSDIKAEIKCSGI REVKLTQGICLEQNKTSSCAEFKKDRGEGLARVLCGEEQADADAGAQVCSLLLAQSEVRP QCLLLVLANRTEISSKLQLMKKHQSDLKKLGILDFTEQDVASHQSYSQKTLIALVTSGAL LAVLGITGYFLMNRRSWSPTGERLVSSGGQGKGNEEDSGFLGSSVDVMEQEEKY >gi568815597r:207787741_208011080|GENSCAN_predicted_CDS_3|1065_bp atgctggtccgcaggggcgcgcgcgcagggcccaggatgccgcggggctggaccgcgctt tgcttgctgagtttgctgccttctgggttcatgagtcttgacaacaacggtactgctacc ccagagttacctacccagggaacattttcaaatgtttctacaaatgtatcctaccaagaa actacaacacctagtacccttggaagtaccagcctgcaccctgtgtctcaacatggcaat gaggccacaacaaacatcacagaaacgacagtcaaattcacatctacctctgtgataacc tcagtttatggaaacacaaactcttctgtccagtcacagacctctgtaatcagcacagtg ttcaccaccccagccaacgtttcaactccagagacaaccttgaagcctagcctgtcacct ggaaatgtttcagacctttcaaccactagcactagccttgcaacatctcccactaaaccc tatacatcatcttctcctatcctaagtgacatcaaggcagaaatcaaatgttcaggcatc agagaagtgaaattgactcagggcatctgcctggagcaaaataagacctccagctgtgcg gagtttaagaaggacaggggagagggcctggcccgagtgctgtgtggggaggagcaggct gatgctgatgctggggcccaggtatgctccctgctccttgcccagtctgaggtgaggcct cagtgtctactgctggtcttggccaacagaacagaaatttccagcaaactccaacttatg aaaaagcaccaatctgacctgaaaaagctggggatcctagatttcactgagcaagatgtt gcaagccaccagagctattcccaaaagaccctgattgcactggtcacctcgggagccctg ctggctgtcttgggcatcactggctatttcctgatgaatcgccgcagctggagccccaca ggagaaaggctggtcagttctgggggccagggtaaaggaaatgaggaagatagtgggttt ctggggagttcagtggatgtcatggagcaggaggagaaatactag >gi568815597r:207787741_208011080|GENSCAN_predicted_peptide_4|95_aa MIQNRPSALAPAPGASEPGLAVSAPSEPCGPRARARRDPLRQDPADPAAAGPSASRRGLT PTVHACNTRAVRRVRSWVCAWDSGLPALLVPHPLP >gi568815597r:207787741_208011080|GENSCAN_predicted_CDS_4|288_bp atgattcagaaccgcccgtcagccttggctccagcgcctggcgcgagcgaaccgggcttg gctgtctcggccccgtctgaaccctgcgggccgcgggccagggcgcgccgggatccgctg cgccaggaccctgcagacccggccgcggccgggccgagtgcctcccgccggggccttacg cccacggtccacgcttgcaacaccagggctgtacggcgggtccgcagctgggtctgcgcc tgggactcggggctccctgcgctcctcgtgccgcaccccttgccttaa >gi568815597r:207787741_208011080|GENSCAN_predicted_peptide_5|201_aa MGERATAQGHLRMQGSVKSSAPPLRGSGRAMAPCGHERAAAGRRNTTAPGFPSLRLRRPE ADPRRSLAKRPASASGCRGCSLHLSKVAFILYRQKFLYPEARSLGEGHWERRICRAPVTG LWTVFLLLEVSGSEQETIPELSQAPMRRKTEAAHCPGPWSLPVKSKPGHVGPLLGTLSWH PQLLEKVSAPKASRPASIVWP >gi568815597r:207787741_208011080|GENSCAN_predicted_CDS_5|606_bp atgggggagagagcaacagcccaaggtcacctcaggatgcaaggatcggtcaaatccagc gcacctccgctgcgtggaagcggccgtgccatggcgccttgcggccacgagagggcagcc gcgggccgcagaaacaccaccgcgcccggcttcccgagcctgcgcctgcgcagacccgag gccgacccgaggcggtcgctggccaagcgcccggccagcgcttccggctgccgaggctgc agccttcacctgagcaaggtcgccttcatcctttacagacagaagttcctctacccggag gcccggagcctgggagagggccactgggaacgtcgcatttgcagagccccagtaaccggg ctgtggaccgtcttcttactgcttgaagtttcaggctcagagcaggaaacgattccagag ctcagccaggcaccgatgagaagaaaaacagaagcagcacattgcccagggccatggagc ttgccagtgaagagcaaaccaggccacgtagggccgctgcttgggaccctgtcctggcac ccccagctcttggagaaagtcagcgctcccaaggcctccaggcccgccagcatcgtctgg ccctga