GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:57:02 Sequence gi568815576f:25121430_25331769 : 210340 bp : 46.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 387 382 6 1.05 1.02 Term - 5231 4882 350 0 2 84 49 183 0.946 8.65 1.01 Init - 14487 14436 52 0 1 65 103 33 0.519 3.83 1.00 Prom - 22393 22354 40 -4.46 2.00 Prom + 23771 23810 40 -2.46 2.01 Init + 24758 24844 87 1 0 78 62 31 0.080 0.46 2.02 Term + 39131 39178 48 1 0 107 48 86 0.667 3.80 2.03 PlyA + 41311 41316 6 1.05 3.00 Prom + 44933 44972 40 -3.16 3.01 Init + 48954 49139 186 2 0 56 91 56 0.435 -0.23 3.02 Intr + 49391 49509 119 1 2 90 75 154 0.918 13.56 3.03 Intr + 52811 53060 250 2 1 89 95 182 0.984 16.44 3.04 Intr + 55919 56093 175 1 1 71 67 149 0.250 10.61 3.05 Intr + 60216 60394 179 1 2 22 76 138 0.187 5.64 3.06 Intr + 63548 63690 143 1 2 82 78 94 0.912 6.95 3.07 Term + 70280 70403 124 2 1 95 41 121 0.804 5.96 3.08 PlyA + 70588 70593 6 1.05 4.00 Prom + 75510 75549 40 -5.26 4.01 Init + 79968 80042 75 2 0 98 79 59 0.960 7.11 4.02 Intr + 81245 81363 119 1 2 53 78 194 0.999 14.16 4.03 Intr + 82334 82466 133 2 1 115 68 104 0.997 11.75 4.04 Intr + 83791 83933 143 0 2 72 77 215 0.992 17.95 4.05 Term + 85618 85783 166 1 1 86 48 359 0.999 29.09 4.06 PlyA + 85899 85904 6 1.05 5.00 Prom + 94549 94588 40 -5.86 5.01 Init + 94909 94915 7 0 1 108 28 0 0.313 -2.98 5.02 Intr + 95370 95421 52 1 1 85 61 34 0.409 -1.63 5.03 Intr + 98134 98237 104 1 2 96 99 57 0.857 7.42 5.04 Intr + 99975 100054 80 1 2 129 98 63 0.947 10.67 5.05 Intr + 103489 103607 119 0 2 104 103 110 0.999 13.36 5.06 Intr + 106424 106556 133 2 1 101 77 157 0.963 16.55 5.07 Intr + 108007 108149 143 0 2 101 113 251 0.999 28.05 5.08 Term + 110175 110343 169 0 1 116 39 289 0.932 24.15 5.09 PlyA + 110418 110423 6 1.05 6.00 Prom + 116666 116705 40 -5.26 6.01 Init + 120078 120177 100 2 1 78 100 36 0.134 4.22 6.02 Intr + 141101 141195 95 0 2 59 74 70 0.916 2.38 6.03 Intr + 141441 141674 234 2 0 75 87 180 0.973 14.49 6.04 Intr + 142211 142262 52 1 1 127 110 16 0.814 6.18 6.05 Term + 156116 156210 95 0 2 77 49 56 0.013 -1.41 6.06 PlyA + 157632 157637 6 1.05 7.05 PlyA - 158979 158974 6 1.05 7.04 Term - 160862 160693 170 0 2 115 55 131 0.740 10.54 7.03 Intr - 161025 160909 117 1 0 76 25 81 0.608 1.04 7.02 Intr - 161861 161249 613 2 1 63 36 382 0.544 22.46 7.01 Init - 171506 171423 84 2 0 79 115 -1 0.628 2.52 7.00 Prom - 171710 171671 40 -4.16 8.00 Prom + 187874 187913 40 -3.76 8.01 Init + 195974 196000 27 1 0 82 90 39 0.239 3.10 8.02 Intr + 196844 196946 103 1 1 65 84 13 0.265 -1.65 8.03 Term + 198338 198657 320 0 2 129 39 356 0.914 29.84 8.04 PlyA + 202356 202361 6 1.05 9.00 Prom + 202374 202413 40 -7.36 9.01 Init + 205671 205713 43 2 1 69 101 -1 0.561 0.13 9.02 Intr + 206133 206309 177 1 0 43 98 187 0.927 15.09 9.03 Intr + 208658 208719 62 0 2 81 81 80 0.717 5.05 9.04 Intr + 210010 210117 108 0 0 73 86 13 0.251 0.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 57074 56907 168 2 0 78 82 94 0.872 7.94 S.002 Init - 89745 89659 87 0 0 82 101 63 0.861 7.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_1|133_aa MQSIKGLEAFSTVPWAHAVTKKEVNWKDLDVLTTKAQELLHEEQFHTNLMPLSYLFTWAP RAHLKTFMDNETQHVICQHGYLCTLFKCSTRGSRTSQVFPAKMSATWLPHPTLILPKAEG NYQEEKNRLLATN >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_1|402_bp atgcaaagcatcaaaggacttgaagcatttagcacagtgccctgggcacatgctgtaaca aaaaaagaagtgaactggaaggatctagatgttctgacaaccaaggctcaggaactcctc catgaagaacaattccacaccaacctcatgcccctcagctacctgtttacatgggcacct agggctcatttaaaaacatttatggataatgaaactcaacacgtcatctgccagcatggc tacctttgtacgctcttcaaatgcagcaccagaggctccagaacatcccaggtttttcct gccaagatgtctgcaacgtggctgccacatccaaccctcatccttcccaaagcagaaggc aattaccaagaagaaaagaacaggctcctagctacgaattaa >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_2|44_aa MATMYGLREVYRVAQSHTAGGQQNQDPTQACTNPRDDRKAFLPH >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_2|135_bp atggctaccatgtatggcctcagagaggtttatcgagttgcccaaagtcacacggctggt gggcagcagaaccaagacccaacccaggcttgcaccaaccccagagatgaccgcaaggcc ttcctgccccactga >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_3|391_aa MRLGNTVVVGTWASGCMELPPARSMSLSNTEDERSVPRLYCEEERGPLRWGPGDALQGST CQDQLKQCFSRQPTEPKDTDTLVHEAGSQYGTWTEQCQSGESLATESPDSSATSTRKQPP SSRLSSLSSQTEPTSAGDQYDCSRDQRSTSVDHSSTDLESTDGMEGPPPPDACPEKRVDD FSFIDQTSVLDSSALKTRVQLSKRSRRRAPISHSLRRSRFSESESRSPLEDETDNTWMFK DSTGPQYMAAADEFSALFFATEEKSPRKEESDEEETASKAERTPVSHPQRMPAFPGMDPA VLKAQLHKRPEVDSPGETPSWAPQPKSPKSPFQPGVLGSRVLPSSMDKDESIGAHGTCTI KVSVDVMSTRDSPSLIHKAKFQKALQNKAFS >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_3|1176_bp atgcggttgggaaacacggtggtggtaggaacctgggcaagcgggtgcatggagcttccc ccagccagatccatgagtctgagtaacactgaagatgagagaagtgtgcccaggctgtac tgtgaggaggagagagggccattgaggtgggggccaggcgatgccctgcagggctccaca tgccaggaccagctgaagcagtgtttctcccggcagcccactgaacccaaggacactgac accctcgtgcacgaagccggcagccagtatgggacgtggacagagcagtgccagagtggg gagagcttggccactgagtccccagatagcagtgccacatcgacaaggaaacagcccccc agcagccgtttgtcttctctgtcctcccaaacggagcccacctcggcaggggaccagtat gactgctccagggaccagcggagcaccagcgtggaccactccagcactgacctggaatcc accgatgggatggaggggccgcctccaccggacgcctgccctgaaaagagagtagatgac ttctccttcattgatcaaacctcagtcctcgactcaagtgccctcaagacccgggtgcag ctcagcaagagaagccgccgccgggcccccatctcccactccctccggcgcagccgattt agtgagtccgagagcagatcacctttggaggatgagactgacaacacgtggatgttcaaa gactcaacgggacctcaatacatggcagctgctgatgaattcagtgcccttttctttgca acagaggagaaatcacccaggaaggaggagtcggatgaggaggagacggcatccaaagct gagaggacccctgtcagccatcctcagaggatgcctgcgtttccaggcatggatccggca gtgctaaaggctcagctgcacaagaggccagaggtggacagtcctggcgagacccccagc tgggcaccccaacccaagagccccaagtcccccttccagcctggggtgctgggcagtcgc gtgctgccttccagcatggacaaggatgagagtataggagctcatggtacgtgcaccatc aaagtcagcgttgatgtgatgagcacaagagactcaccatcccttatccacaaagccaaa ttccagaaagccctgcaaaacaaagctttttcctaa >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_4|211_aa MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSPHHKLHLFE NPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYRH WNEWDASQPQLQSVRRIRDQKWHKRGRFPSS >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_4|636_bp atggcggaacagcacggagcacccgaacaggctgcagctggcaagagccatggagacctt gggggcagctacaaggtgatcttgtacgaactagagaacttccaaggcaaacgctgcgag ctctcggccgagtgccccagcctgaccgacagcctgctggagaaggtgggctccatccaa gtggagtccgggccgtggctggcatttgagtccagggccttccgcggggagcagtttgtt ctggagaagggggattatcctcgctgggatgcctggtccaacagccgtgatagtgacagc cttctgtccctccggcctctgaatattgatagtccacatcacaagctgcatctgtttgag aacccagctttcagtggccgcaagatggagatagtggatgatgacgtgcccagcctgtgg gctcatggcttccaggaccgtgtggcgagtgtccgtgccatcaacgggacgtgggttggc tatgagttccccggctaccgtgggcgccagtacgtgtttgagcggggcgagtaccgccac tggaatgagtgggacgccagccagccgcagctgcagtctgtgcgccgcatccgtgaccag aagtggcacaagcggggccgcttccccagcagctga >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_5|268_aa MTAPSNLYSTSSLHEFAYPRSTAVTHLGTGTSGEGINATSRWPSFTALAGAGVTGHSCTG QSTMASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAG PWVGYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQEHKIILYENPNF TGKKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDF GAPHPQVQSVRRIRDMQWHQRGAFHPSN >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_5|807_bp atgactgcccccagtaacctctattctacttccagtctccatgaatttgcctaccctagg agcacagcggtcactcatcttggcaccggcacttctggggaaggtataaatgccacctcc cgctggccgagcttcacggcactcgcaggggctggtgtcactggtcattcctgcacagga cagtccaccatggcctcagatcaccagacccaggcgggcaagccacagtccctcaacccc aagatcatcatctttgagcaggaaaactttcaaggccactcgcatgagctcaatgggccc tgccccaacctgaaggaaactggcgtggagaaggcaggttctgtcctagtgcaggctgga ccctgggtgggctatgaacaggccaactgcaagggcgagcagtttgtgtttgagaagggt gagtacccccgctgggactcatggaccagcagccgaaggacggactccctcagctccctg aggcccatcaaagtggacagccaagagcacaagatcatcctctatgaaaaccccaacttc accgggaagaagatggaaatcatagatgacgatgtacccagcttccacgcccatggctac caggagaaggtgtcatctgtgcgggtgcagagtggcacgtgggttggctaccagtacccc ggctaccgtgggctgcagtacctgctggagaagggagactacaaggacagcagcgacttt ggggcccctcacccccaggtgcagtccgtgcgccgtatccgcgacatgcagtggcaccaa cgtggtgccttccacccctccaactag >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_6|191_aa MRAQDGMPASESPGIWISTPPIFCYSNGIGFYQENSERLGNIAPFQQKAQKNQGTVASVL CCTWKAQYMGLCLGPQRSEGSKEQTMGSTQTSATHWSLSPLLQDPSTTRRREKVYDPPPC NSPMISMDTQRYTLQHPAKTWYLVLECGYESWNSGAILLPDTATIRFLNLFPTFPPFRFH KTAIVIMAHSQ >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_6|576_bp atgcgtgctcaggatgggatgcctgcctcagaatcccctggaatatggatttccactcca cccatcttctgctacagcaatggaattggtttctatcaagagaactccgagaggttggga aatattgctccatttcaacagaaggcccagaaaaatcaaggcaccgtggccagcgttctc tgctgcacctggaaggcccagtacatgggcttatgtctggggccccagaggagtgagggc agcaaagaacagacaatgggctccacgcagacctctgcaacacactggagcctgagccca ctgctgcaggacccatcaacaacaagaaggagggagaaggtgtatgacccccctccctgc aactctcccatgatctccatggacacccaaagatatacactgcagcacccagcaaagacg tggtacttggtgttggagtgtggatatgagtcatggaactctggggctatattgctgcca gacacggcaaccatccgatttctcaatcttttccccacctttcccccctttcgattccac aaaaccgccattgtcatcatggcccattctcaatga >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_7|327_aa MTCVNTKWLEIYQPYLSWPTGVLLHKCMGAALGTLERSLPESPPPAIRCPISETGTGRPA PHRQFRGVTRTQDSDDSMPDPPGRRRSLTAAWGRRRGDQRGPGQLVPGIGQDGPLAGAGG RSRLALLHGHQVTGICQHSLRVRHVGEFWRRRRLLGPGWDALPPPPPPPPAEPRPGGTGA APGQPLLGAPHQRPASAAGCSPPGSQCAFRSAASRAAPQLIPRGSAGSSRPDAHPEKPWA PCPLLKGHRGGLCYLDTVVLKRDSINQHPPGGNDEAPALVQATCEQRPAGRAPNLAPERT RKPAVRIQRRVKAGESCVSQAKQSKVG >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_7|984_bp atgacttgtgttaataccaaatggctggaaatataccaaccctacctgtcatggcctact ggggtcctgctacataaatgcatgggcgccgcgctgggcactttggagcgctctctcccg gagtcccccccaccggccatccgatgccccatttcagagactggaacggggaggccggcc ccgcatcgccagttccgcggtgtcacccgcacccaggacagcgacgactcaatgcccgac cctcccggacgcaggaggtcgctgacggccgcttggggacggcggcgtggggaccagcgc gggccggggcagctggtacctgggatcgggcaggacggtcctcttgctggcgcgggcggc cggagtcgccttgctcttctccatggccatcaggtaactggcatctgccagcacagcctc cgggtccgccacgttggcgagttttggcggcggcggcggttactcggacccggatgggac gcgctgccgccaccgccgccgccgccgccagctgagccccgacctggtgggactggggca gccccagggcagcccctgctcggcgcgccccaccagcgccctgcaagcgccgcaggctgt agcccgccggggtcgcagtgcgccttccgctccgccgcctcgcgcgccgccccccagctc attccgcgagggtctgcggggagctcgcggcccgacgctcatccagagaagccctgggcc ccgtgcccactactcaaggggcatcggggaggcctctgctacctggacacagtagttctc aaacgggacagcatcaatcagcatcctccaggaggtaatgatgaagcccctgctctggtc caagctacctgtgaacagcgcccagcgggccgggcccccaacctcgctcctgaacgtacc cgcaagcccgccgtgagaattcagcgaagagtaaaggctggagaaagttgcgtgtctcag gctaagcagagcaaagtggggtga >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_8|149_aa MGHGETDARRGSWTGPGCWPRGFQSKHNSVKHVFGSGTQLTVLGQPKATPSVTLFLPSSE ELQANKATLVCLMNDFYLGILTVTWKADGTPITQGVEMTTPSKQSNSKYMASSYLSLTPE QWRSRRSYSCQVMHEGSTAEKTVAPAECS >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_8|450_bp atgggccatggagaaacagatgccaggcgcggctcctggactggccccgggtgctggccc cgggggtttcaatccaagcataattcagtgaagcatgtgtttggcagcgggacccagctc accgttttaggtcagcccaaggccaccccctcggtcactctgttcctgccgtcctctgag gagctccaagccaacaaggccacactggtgtgtctcatgaatgacttctatctgggaatc ttgacggtgacctggaaggcagatggtacccccatcacccagggcgtggagatgaccacg ccctccaaacagagcaacagcaagtacatggccagcagctacctgagcctgacgcccgag cagtggaggtcccgcagaagctacagctgccaggtcatgcatgaagggagcactgcagag aagacggtggcccctgcagaatgttcatag >gi568815576f:25121430_25331769|GENSCAN_predicted_peptide_9|130_aa MGIVRHWGGCGAGHVSSDCGWPRGKAASSYESDIYEAVAAATSESTTVEPGKLDVGATEG QDLQHISNQKMPTASVLDLEDIALDIAEEVLSEMPPESCHSSNKKEEKLQHHHTLYINDL FTPSPQLYLK >gi568815576f:25121430_25331769|GENSCAN_predicted_CDS_9|390_bp atgggcattgtgaggcactgggggggctgtggggctggacatgtcagctccgactgtggc tggcccagggggaaggcagcatccagttatgaatctgatatttatgaggccgtggctgct gcaacatcagaatccactaccgtagagcctggcaagctggatgtgggagccacggagggc caagacctgcagcacatcagcaaccaaaagatgcccacagcttctgttctggacctggaa gatattgcgcttgatattgcggaggaggtgttatctgaaatgcctcccgagtcatgtcat tcttctaataaaaaggaagaaaaacttcaacaccaccatacactttacataaatgacctc tttactccctcaccacagctgtacctgaag