GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:37:30 Sequence gi568815587f:114300672_114507804 : 207133 bp : 39.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11330 11630 301 1 1 57 85 339 0.009 27.96 1.02 Intr + 15528 15713 186 1 0 78 24 165 0.027 7.84 1.03 Intr + 24846 24955 110 1 2 74 93 27 0.012 0.88 1.04 Intr + 27106 27241 136 0 1 56 102 69 0.064 4.32 1.05 Intr + 39522 39805 284 1 2 77 9 169 0.182 4.11 1.06 Intr + 46546 46686 141 0 0 66 60 87 0.178 3.33 1.07 Intr + 47172 47258 87 2 0 57 115 62 0.218 5.05 1.08 Term + 47702 48169 468 1 0 62 42 229 0.960 9.69 1.09 PlyA + 49100 49105 6 1.05 2.05 PlyA - 49724 49719 6 1.05 2.04 Term - 63496 63437 60 1 0 92 49 52 0.561 -1.47 2.03 Intr - 65522 65329 194 0 2 120 66 136 0.739 12.89 2.02 Intr - 70910 70833 78 0 0 45 72 105 0.469 3.20 2.01 Init - 76123 76039 85 1 1 19 71 98 0.284 2.23 2.00 Prom - 76640 76601 40 -4.55 3.00 Prom + 82062 82101 40 -5.75 3.01 Init + 98885 98972 88 1 1 77 98 47 0.001 5.45 3.02 Intr + 99346 99578 233 2 2 53 6 221 0.000 6.97 3.03 Intr + 99983 100366 384 1 0 -6 26 266 0.000 5.12 3.04 Intr + 101027 101189 163 1 1 88 93 76 0.770 6.73 3.05 Intr + 102157 102244 88 2 1 64 90 16 0.509 -2.49 3.06 Intr + 105035 105128 94 2 1 67 66 42 0.462 -1.05 3.07 Intr + 113285 113453 169 1 1 107 70 123 0.277 11.00 3.08 Intr + 115869 115971 103 1 1 13 105 94 0.182 1.91 3.09 Intr + 125020 125107 88 1 1 102 15 88 0.005 1.95 3.10 Intr + 138795 139004 210 2 0 19 105 302 0.933 23.19 3.11 Intr + 139985 140068 84 1 0 66 62 67 0.665 1.00 3.12 Intr + 143185 143262 78 0 0 97 87 92 0.985 8.83 3.13 Intr + 143870 143981 112 1 1 42 98 103 0.867 5.73 3.14 Intr + 145308 145416 109 1 1 61 110 83 0.996 6.22 3.15 Intr + 147155 147208 54 2 0 45 97 78 0.712 1.58 3.16 Intr + 149175 149295 121 0 1 77 84 148 0.932 12.88 3.17 Intr + 152561 152690 130 1 1 58 52 45 0.094 -2.75 3.18 Intr + 166186 166278 93 2 0 70 94 69 0.457 4.72 3.19 Term + 173847 173953 107 1 2 23 44 161 0.526 2.79 3.20 PlyA + 174026 174031 6 1.05 4.00 Prom + 182805 182844 40 -2.75 4.01 Init + 188356 188622 267 0 0 68 41 197 0.201 10.03 4.02 Intr + 196693 196761 69 0 0 60 115 49 0.325 3.26 4.03 Intr + 203674 203813 140 0 2 36 80 116 0.099 3.94 4.04 Term + 205827 206286 460 0 1 -4 48 213 0.062 1.68 4.05 PlyA + 206294 206299 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 19008 19173 166 0 1 125 38 93 0.810 4.41 S.002 Sngl - 99660 99289 372 0 0 59 41 370 0.997 25.27 S.003 Init + 100001 100366 366 1 0 83 26 264 0.818 16.85 S.004 Sngl + 205849 206286 438 0 0 65 48 203 0.895 10.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:114300672_114507804|GENSCAN_predicted_peptide_1|570_aa MTGVENNVCGFVFFRVKGPEKEEKLRQAVKQVLKCDVTQSQPLGAVPLPPADCVLSTLCL DAACPDLPTYCRALRNLGSLLKPGGFLVIMDALKSSYYMIGPTFWGNVQQSHFKQTFVMQ EAASLSVTSWGESGEKKAIKKVVNPFGVMQLSLRDSHHLAVKSLRALHMSPPSWIHQEGK RSKPLYGRKDFLFTDETSKEIRGASLECLWLEGDSRMSPVRAISILQSNLSDLSRNRKAN ISGEVHVNWKSATTKNVTQTGLNVKENLLIYKPEKPKDIRLQSRLDSATQGQQRPRRLFI SPLCVTQHVFSPSGHKMAASSFQRCDVLAHIQRDENVPQESPSAVWKLGDDRSCYFLLNC GVEGAPQLRFPREGSLQCRSARMGWRDWVEVVMPPMPIPSPPSTPTPIMSGRTVQTSGLI PSSSRGQQTGIPSPIFKYIMAPALANMSFPSKGVGLSGIIRKACEKNKVPQSQFRGCEKY LVTGCPRTPLMVTDLEDSCLGRKSKTEKAAPVSRRKLVSWPSMVKLIWGSVRVMACSGAC RRHPQSCCWTILLVASGPEDLLPQGQCAVQ >gi568815587f:114300672_114507804|GENSCAN_predicted_CDS_1|1713_bp atgactggagtggaaaacaatgtctgtgggtttgtgtttttcagagtcaagggtccagag aaggaggagaagttgagacaggcggtcaagcaggtgctgaagtgtgatgtgactcagagc cagccactgggggccgtccccttacccccggctgactgcgtgctcagcacactgtgtctg gatgccgcctgcccagacctccccacctactgcagggcgctcaggaacctcggcagccta ctgaagccagggggcttcctggtgatcatggatgcgctcaagagcagctactacatgatt ggtcctactttctggggaaatgttcaacagtctcactttaagcaaacctttgtgatgcag gaagctgcaagtctctctgtaacatcctggggagagagtggggagaagaaagctattaaa aaggtcgttaacccatttggtgtcatgcagctcagtcttcgtgattcccatcatctggct gtcaaaagccttcgggctttgcacatgtcacccccgtcctggatccaccaggaaggtaaa cggtcaaaaccattgtatgggaggaaggattttctgtttacagatgaaacaagtaaggag attaggggagccagcttagaatgtctgtggctggaaggagactctagaatgtctccagtg agagccatttccatacttcagagcaacttgtcagatctttccagaaacagaaaggccaac atctctggagaagtgcatgttaattggaaatctgcaactaccaaaaatgtcactcaaact ggcttgaacgttaaagaaaatttattgatttacaaacctgaaaagcctaaagatatcagg cttcagtcaagacttgattcagcaactcaagggcagcaaagaccccggaggcttttcatt tctcctctctgcgttacacaacatgtattttctccaagtggccacaagatggctgcaagt agctttcagcgctgcgatgtccttgctcatatccaaagagacgagaatgtccctcaagaa agcccaagtgctgtttggaagttgggcgacgatcgttcttgctacttcctgctgaactgc ggggtagaaggggctccgcagttgaggtttcctcgggaggggagtcttcagtgtcgtagt gcaagaatgggttggcgggattgggtggaggtggtaatgcctccaatgccgattcctagt cctcctagtaccccaaccccaataatgagtggcaggacagtacagacttcaggattaatt ccttcctccagtagagggcagcaaacgggtattccttctcctatattcaagtatataatg gcccctgctttagctaatatgtccttccctagtaaaggagtagggctctcaggcataata agaaaggcatgtgagaaaaataaggttccccagtcacaatttagagggtgtgagaaatac ctagtgactggctgtcctaggacccctttgatggtgacagacctggaggatagttgtctg ggacgaaagagtaaaactgagaaggccgcgccagtgtccaggaggaagttagtttcctgg ccctcaatggttaagcttatctggggctctgtgagggtgatggcatgctctggcgcctgc cgcaggcaccctcagtcctgttgttggaccatcttgttagtggcctctggcccagaggac cttctccctcaggggcagtgtgccgtccagtga >gi568815587f:114300672_114507804|GENSCAN_predicted_peptide_2|138_aa MTGELWTDDRTKGALKVSIAKQAVSLLISKCENQSRVLSTSPQTEKLEVEGILLGKSHCG EDRKEHNNACHQEEEAPDVNPLHGLQASGSPGKTQRIKSQALEGTRLYSHSEETRSGLIL PPSGNLRPTPFDLIRGRL >gi568815587f:114300672_114507804|GENSCAN_predicted_CDS_2|417_bp atgacaggagagctttggactgatgaccgcacaaagggtgctcttaaagtgagtattgcc aagcaagctgtcagtcttttgataagtaagtgtgagaaccagtcgagagtgttaagcacc agtccacagactgagaaattggaagtagaaggaatactgctgggaaagtcccactgtggg gaggataggaaagagcataacaatgcctgccaccaggaggaagaggccccagatgtcaac ccattgcatggtttgcaggcatcgggatcaccgggaaagactcaaagaatcaagtcacag gcactggagggaacaaggctttactcacatagcgaagagacaagatcaggcttaatactc cctccatcaggaaacttaagaccaactccatttgatctgatcagaggacgtctataa >gi568815587f:114300672_114507804|GENSCAN_predicted_peptide_3|835_aa MEDGKCQGKDGRINQCNKKEHFFFANRSKAHFLGDRVQRQGLGKVRIWLGGSGPGHLVPL GSGRHHPLDGSRPERLVAPPAEVLDESLQAWKRGSRLRRPSPTLTPAGGGDAEMGAAAAE ADRTLFVGNLETKVTEELLFELFHQVSGWVRPFAFRFPSRLGPGQRPPRFLFVAVRGPDG WLFGGERRVWVAKRSLGVAFLEDLGSFRPKVTTKGGGGILDQVSRGRDPGVLGNTAGPVI KVKIPKDKDGKPKQFAFVNFKHEVSVPYAMNLLNGIKLYGRPIKIQFRSGSSHAPQDVSL SYPQHHVGNSSPTSTSPSSRYERTMDNMTSSAQIIQRSFSSPENFQRQAVVVSSNLYTDQ LSHEYGRKPAGLREFSPCATYRFSVLCTGHSSCLCFAGLSAVSSELTIGLHISLAQNTEK FEEMCQQLEELLVGCCHMPTRHRSFTPLPPPLQTHEEYCQATANIHIRPKRLLRLRQRRL RDWGRGCWSRVMLGGSLGSRLLRGVGGSHGRFGARGVREGGAAMAAGESMAQRMVWVDLE MTGLDIEKDQIIEMACLITDSDLNILAEGPNLIIKQPDELLDSMSDWCKEHHGKSGLTKA VKESTITLQQAEYEFLSFVRQQTPPGLCPLAGNSVHEDKKFLDKYMPQFMKHLHYRIIDV STVKELCRRWYPEEYEFAPKKAASHRALDDISESIKELQFYRNNIFKKKIDEKKRKIIEN GENEKTRDFVDLRTDSSNYLPIRVMQWNNFIQALGKTKTTLYNALLKHSRGRWSFGSDDV APIAELSWRLKHVSSDPCQFLYQVRLTVIIGFSHSAVIDDFDESSSGGGMRTKPD >gi568815587f:114300672_114507804|GENSCAN_predicted_CDS_3|2508_bp atggaagatggtaaatgccaaggaaaagatggaaggataaatcagtgtaataaaaaggag cacttctttttcgccaacagaagtaaagcacacttcttaggagatcgggttcaacggcag ggattgggtaaggtgagaatctggcttggcggctccggccccggccatctggttcccttg ggctccggccgccaccatccactcgacggctctcggcccgaacgcttggtcgcaccgcct gccgaggtcctagatgaatcgcttcaggcctggaaacgaggaagccgtctccggagacca tcgccaacgctgacgcccgcgggagggggcgacgctgagatgggggcggcggcggcggaa gcggatcgcactctctttgtgggcaaccttgaaacgaaagtgaccgaggagctccttttc gagcttttccaccaggtaagcggctgggttcggccctttgcctttcgttttccgtctcgc ctagggcctggccagcggccaccccgttttcttttcgtagccgtcaggggacccgacggg tggctgtttgggggtgaaaggcgggtctgggttgcgaaacgctcgctgggtgtcgctttc ctggaagatcttggttcgtttaggccgaaagtgacgactaaaggtggtggagggatcctc gatcaggtttcccgtggtagagatccaggggtccttgggaacacagctgggccagtaata aaggtgaaaattccaaaagataaggatggtaaaccaaagcagtttgcgtttgtgaatttc aaacatgaagtgtctgttccttatgcaatgaatctacttaatggaatcaaactttatgga aggcctatcaaaattcaatttagatcaggaagtagtcatgccccacaagatgtcagtttg tcatatccccaacatcatgttggaaattcaagccctacctccacatctcctagcagcagg tacgaaaggactatggataacatgacttcatcagcacagataattcagagatctttctct tctccagaaaattttcagagacaagcagtggtagtttcttcaaacctgtatactgatcag ctgtctcatgaatacgggagaaaacctgcaggtctccgagagttctctccctgtgcaact tatcgcttttcagtgctgtgtaccggccattccagctgcctttgtttcgctggactctca gctgtgtcttctgaactcactatagggctgcacatatctttggcccagaacacagagaag tttgaggagatgtgtcagcagctggaggagctgctggttgggtgctgccacatgccaaca aggcacaggagcttcaccccactgccaccaccactacagacccatgaggagtactgccag gctaccgccaatattcacataaggcccaagcgactattgcgcctgcgccagcgccggctg cgagactggggccgtggctgctggtcccgggtgatgctaggcggctccctgggctccagg ctgttgcggggtgtaggtgggagtcacggacggttcggggcccgaggtgtccgcgaaggt ggcgcagccatggcggcaggggagagcatggctcagcggatggtctgggtggacctggag atgacaggattggacattgagaaggaccagattattgagatggcctgtctgataactgac tctgatctcaacattttggctgaaggtcctaacctgattataaaacaaccagatgagttg ctggacagcatgtcagattggtgtaaggagcatcacgggaagtctggccttaccaaggca gtgaaggagagtacaattacattgcagcaggcagagtatgaatttctgtcctttgtacga cagcagactcctccagggctctgtccacttgcaggaaattcagttcatgaagataagaag tttcttgacaaatacatgccccagttcatgaaacatcttcattatagaataattgatgtg agcactgttaaagaactgtgcagacgctggtatccagaagaatatgaatttgcaccaaag aaggctgcttctcatagggcacttgatgacattagtgaaagcatcaaagagcttcagttt taccgaaataacatcttcaagaaaaaaatagatgaaaagaagaggaaaattatagaaaat ggggaaaatgagaagaccagagattttgtggatctgaggacagactccagtaactaccta cctattagagtcatgcaatggaacaacttcatccaagctcttggaaagacaaagacaact ttgtacaatgccctgttgaaacactcaaggggaagatggtcatttggcagtgatgatgtg gccccgattgcagaattgagttggaggctgaagcatgtgagctctgatccttgccagttc ctgtatcaagtaagacttacagttatcattggatttagccattcagcagtgattgatgac tttgatgagagcagttctggtggagggatgcggacaaagcctgattga >gi568815587f:114300672_114507804|GENSCAN_predicted_peptide_4|311_aa MTTDPPEIQITIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIE AIINSLPTKKSPGPDGFTAEFYQRYKEELAFAFTHHSLTQSEQLPVLQAPFMRRHPDLCH PKPMPPPVQQHTQSPAGPPHTLTNCLASIAVVNAHQGSSKCKRTEIMKDRLLDHITIKLE IKTKKFTHNYTITWKLNNLLLNGFWVNNEIKAEIKKFFETNEDKDITYQNLWDTAKAVSR GKLITLNTHIKKLDRSQINNLTSQLKELENQKQTNPKASRRQEITKIRAELKEIETQKNP AKDQQIQELVF >gi568815587f:114300672_114507804|GENSCAN_predicted_CDS_4|936_bp atgaccactgatcctccagaaatacaaattaccatcagagaatactataaacacctctat gcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacaccctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaattgag gcaataattaatagcttaccaaccaaaaaaagtccgggaccagatggattcacagccgaa ttctaccagaggtacaaagaggagctggccttcgcatttactcaccactcactgactcag tcagagcaacttccagtcctgcaagctccatttatgagaaggcatccagacctgtgccac cccaagccaatgccacctccagtgcaacagcacacacagtctccagcagggcccccccac accctcaccaactgccttgcctctatcgctgtggtgaacgcccaccagggaagcagcaaa tgcaaaagaactgaaatcatgaaagacagactcttggaccacatcacaatcaaattagaa atcaagactaagaaattcacccacaactatacaattacatggaaattgaataacctgcta ctgaatggcttttgggtaaataatgaaattaaggcagaaatcaagaagttctttgaaact aatgaggacaaagatatcacataccagaatctctgggacacagctaaggcagtgtcaaga gggaaattaataacactaaacacccatatcaaaaagttagataggtctcaaattaacaac ctaacatcacaactaaaagaattagagaaccagaagcaaacaaatcccaaagctagcaga aggcaagaaataaccaaaatcagagctgaactgaaggagattgagacacaaaaaaaccct gcaaaagatcaacaaatccaggagctggttttttga