GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:55:59 Sequence gi568815576f:25101397_25307209 : 205813 bp : 47.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 98 93 6 1.05 1.03 Term - 1522 1453 70 0 1 149 47 114 0.708 10.81 1.02 Intr - 3028 2994 35 1 2 66 16 45 0.353 -7.88 1.01 Init - 3928 3875 54 1 0 64 94 30 0.531 0.48 1.00 Prom - 4212 4173 40 -1.56 2.00 Prom + 5698 5737 40 -4.96 2.01 Init + 10782 11066 285 2 0 81 61 263 0.965 20.03 2.02 Term + 17670 17747 78 2 0 89 49 43 0.224 -1.74 2.03 PlyA + 18032 18037 6 1.05 3.03 PlyA - 19447 19442 6 1.05 3.02 Term - 25264 24915 350 2 2 84 49 183 0.947 8.65 3.01 Init - 34520 34469 52 2 1 65 103 33 0.521 3.83 3.00 Prom - 42426 42387 40 -4.46 4.00 Prom + 43804 43843 40 -2.46 4.01 Init + 44791 44877 87 0 0 78 62 31 0.080 0.46 4.02 Term + 59164 59211 48 0 0 107 48 86 0.667 3.80 4.03 PlyA + 61344 61349 6 1.05 5.00 Prom + 64966 65005 40 -3.16 5.01 Init + 68987 69172 186 1 0 56 91 56 0.435 -0.23 5.02 Intr + 69424 69542 119 0 2 90 75 154 0.918 13.56 5.03 Intr + 72844 73093 250 1 1 89 95 182 0.984 16.44 5.04 Intr + 75952 76126 175 0 1 71 67 149 0.250 10.61 5.05 Intr + 80249 80427 179 0 2 22 76 138 0.187 5.64 5.06 Intr + 83581 83723 143 0 2 82 78 94 0.912 6.95 5.07 Term + 90313 90436 124 1 1 95 41 121 0.804 5.96 5.08 PlyA + 90621 90626 6 1.05 6.00 Prom + 95543 95582 40 -5.26 6.01 Init + 100001 100075 75 1 0 98 79 59 0.960 7.11 6.02 Intr + 101278 101396 119 0 2 53 78 194 0.999 14.16 6.03 Intr + 102367 102499 133 1 1 115 68 104 0.997 11.75 6.04 Intr + 103824 103966 143 2 2 72 77 215 0.992 17.95 6.05 Term + 105651 105816 166 0 1 86 48 359 0.999 29.09 6.06 PlyA + 105932 105937 6 1.05 7.00 Prom + 114582 114621 40 -5.86 7.01 Init + 114942 114948 7 2 1 108 28 0 0.313 -2.98 7.02 Intr + 115403 115454 52 0 1 85 61 34 0.409 -1.63 7.03 Intr + 118167 118270 104 0 2 96 99 57 0.857 7.42 7.04 Intr + 120008 120087 80 0 2 129 98 63 0.947 10.67 7.05 Intr + 123522 123640 119 2 2 104 103 110 0.999 13.36 7.06 Intr + 126457 126589 133 1 1 101 77 157 0.963 16.55 7.07 Intr + 128040 128182 143 2 2 101 113 251 0.999 28.05 7.08 Term + 130208 130376 169 2 1 116 39 289 0.932 24.15 7.09 PlyA + 130451 130456 6 1.05 8.00 Prom + 136699 136738 40 -5.26 8.01 Init + 140111 140210 100 1 1 78 100 36 0.134 4.22 8.02 Intr + 161134 161228 95 2 2 59 74 70 0.916 2.38 8.03 Intr + 161474 161707 234 1 0 75 87 180 0.973 14.49 8.04 Intr + 162244 162295 52 0 1 127 110 16 0.814 6.18 8.05 Term + 176149 176243 95 2 2 77 49 56 0.013 -1.41 8.06 PlyA + 177665 177670 6 1.05 9.05 PlyA - 179012 179007 6 1.05 9.04 Term - 180895 180726 170 2 2 115 55 131 0.740 10.54 9.03 Intr - 181058 180942 117 0 0 76 25 81 0.608 1.04 9.02 Intr - 181894 181282 613 1 1 63 36 382 0.544 22.46 9.01 Init - 191539 191456 84 1 0 79 115 -1 0.632 2.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 77107 76940 168 1 0 78 82 94 0.872 7.94 S.002 Init - 109778 109692 87 2 0 82 101 63 0.861 7.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_1|52_aa MGQAWWLMPVIPTLWEAQTTFGFFQISRRRWSNTVSQKAADGTGLPRKACIA >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_1|159_bp atgggccaggcgtggtggcttatgcccgtaatcccaacactttgggaggctcagacaacc ttcggctttttccagatctcccgcaggaggtggagcaacaccgtcagtcagaaggcagcg gatggcacagggcttccgcgtaaggcctgcatcgcctga >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_2|120_aa MDYLGDKLTVAQTHMTQWMGTVRRSFQEALNLVTTTVGHERTSAESASRTPFKRTSSFRH LASRSRESFRRFSARSQQRFSSLRKRHTDSEPPDIDVRSMRARALAVLVHLDIASTQNSG >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_2|363_bp atggattacctgggagataaactcacagtggcccagacccacatgacccagtggatgggc accgtgaggaggtccttccaggaagccctcaatttggtgaccaccacggtgggccacgaa cggacgagtgcagagtcggccagccggacccccttcaagcgtacttcctccttccgacac ctggcttcccggagtcgggaatccttccgccgcttctctgctcgcagccaacagagattt tcttcactgaggaaaaggcatacagattcggaaccgccagatattgatgtaaggtccatg agggccagggctttggccgtcctggttcaccttgatattgccagtacccagaacagtggc tag >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_3|133_aa MQSIKGLEAFSTVPWAHAVTKKEVNWKDLDVLTTKAQELLHEEQFHTNLMPLSYLFTWAP RAHLKTFMDNETQHVICQHGYLCTLFKCSTRGSRTSQVFPAKMSATWLPHPTLILPKAEG NYQEEKNRLLATN >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_3|402_bp atgcaaagcatcaaaggacttgaagcatttagcacagtgccctgggcacatgctgtaaca aaaaaagaagtgaactggaaggatctagatgttctgacaaccaaggctcaggaactcctc catgaagaacaattccacaccaacctcatgcccctcagctacctgtttacatgggcacct agggctcatttaaaaacatttatggataatgaaactcaacacgtcatctgccagcatggc tacctttgtacgctcttcaaatgcagcaccagaggctccagaacatcccaggtttttcct gccaagatgtctgcaacgtggctgccacatccaaccctcatccttcccaaagcagaaggc aattaccaagaagaaaagaacaggctcctagctacgaattaa >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_4|44_aa MATMYGLREVYRVAQSHTAGGQQNQDPTQACTNPRDDRKAFLPH >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_4|135_bp atggctaccatgtatggcctcagagaggtttatcgagttgcccaaagtcacacggctggt gggcagcagaaccaagacccaacccaggcttgcaccaaccccagagatgaccgcaaggcc ttcctgccccactga >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_5|391_aa MRLGNTVVVGTWASGCMELPPARSMSLSNTEDERSVPRLYCEEERGPLRWGPGDALQGST CQDQLKQCFSRQPTEPKDTDTLVHEAGSQYGTWTEQCQSGESLATESPDSSATSTRKQPP SSRLSSLSSQTEPTSAGDQYDCSRDQRSTSVDHSSTDLESTDGMEGPPPPDACPEKRVDD FSFIDQTSVLDSSALKTRVQLSKRSRRRAPISHSLRRSRFSESESRSPLEDETDNTWMFK DSTGPQYMAAADEFSALFFATEEKSPRKEESDEEETASKAERTPVSHPQRMPAFPGMDPA VLKAQLHKRPEVDSPGETPSWAPQPKSPKSPFQPGVLGSRVLPSSMDKDESIGAHGTCTI KVSVDVMSTRDSPSLIHKAKFQKALQNKAFS >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_5|1176_bp atgcggttgggaaacacggtggtggtaggaacctgggcaagcgggtgcatggagcttccc ccagccagatccatgagtctgagtaacactgaagatgagagaagtgtgcccaggctgtac tgtgaggaggagagagggccattgaggtgggggccaggcgatgccctgcagggctccaca tgccaggaccagctgaagcagtgtttctcccggcagcccactgaacccaaggacactgac accctcgtgcacgaagccggcagccagtatgggacgtggacagagcagtgccagagtggg gagagcttggccactgagtccccagatagcagtgccacatcgacaaggaaacagcccccc agcagccgtttgtcttctctgtcctcccaaacggagcccacctcggcaggggaccagtat gactgctccagggaccagcggagcaccagcgtggaccactccagcactgacctggaatcc accgatgggatggaggggccgcctccaccggacgcctgccctgaaaagagagtagatgac ttctccttcattgatcaaacctcagtcctcgactcaagtgccctcaagacccgggtgcag ctcagcaagagaagccgccgccgggcccccatctcccactccctccggcgcagccgattt agtgagtccgagagcagatcacctttggaggatgagactgacaacacgtggatgttcaaa gactcaacgggacctcaatacatggcagctgctgatgaattcagtgcccttttctttgca acagaggagaaatcacccaggaaggaggagtcggatgaggaggagacggcatccaaagct gagaggacccctgtcagccatcctcagaggatgcctgcgtttccaggcatggatccggca gtgctaaaggctcagctgcacaagaggccagaggtggacagtcctggcgagacccccagc tgggcaccccaacccaagagccccaagtcccccttccagcctggggtgctgggcagtcgc gtgctgccttccagcatggacaaggatgagagtataggagctcatggtacgtgcaccatc aaagtcagcgttgatgtgatgagcacaagagactcaccatcccttatccacaaagccaaa ttccagaaagccctgcaaaacaaagctttttcctaa >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_6|211_aa MAEQHGAPEQAAAGKSHGDLGGSYKVILYELENFQGKRCELSAECPSLTDSLLEKVGSIQ VESGPWLAFESRAFRGEQFVLEKGDYPRWDAWSNSRDSDSLLSLRPLNIDSPHHKLHLFE NPAFSGRKMEIVDDDVPSLWAHGFQDRVASVRAINGTWVGYEFPGYRGRQYVFERGEYRH WNEWDASQPQLQSVRRIRDQKWHKRGRFPSS >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_6|636_bp atggcggaacagcacggagcacccgaacaggctgcagctggcaagagccatggagacctt gggggcagctacaaggtgatcttgtacgaactagagaacttccaaggcaaacgctgcgag ctctcggccgagtgccccagcctgaccgacagcctgctggagaaggtgggctccatccaa gtggagtccgggccgtggctggcatttgagtccagggccttccgcggggagcagtttgtt ctggagaagggggattatcctcgctgggatgcctggtccaacagccgtgatagtgacagc cttctgtccctccggcctctgaatattgatagtccacatcacaagctgcatctgtttgag aacccagctttcagtggccgcaagatggagatagtggatgatgacgtgcccagcctgtgg gctcatggcttccaggaccgtgtggcgagtgtccgtgccatcaacgggacgtgggttggc tatgagttccccggctaccgtgggcgccagtacgtgtttgagcggggcgagtaccgccac tggaatgagtgggacgccagccagccgcagctgcagtctgtgcgccgcatccgtgaccag aagtggcacaagcggggccgcttccccagcagctga >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_7|268_aa MTAPSNLYSTSSLHEFAYPRSTAVTHLGTGTSGEGINATSRWPSFTALAGAGVTGHSCTG QSTMASDHQTQAGKPQSLNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAG PWVGYEQANCKGEQFVFEKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQEHKIILYENPNF TGKKMEIIDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDF GAPHPQVQSVRRIRDMQWHQRGAFHPSN >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_7|807_bp atgactgcccccagtaacctctattctacttccagtctccatgaatttgcctaccctagg agcacagcggtcactcatcttggcaccggcacttctggggaaggtataaatgccacctcc cgctggccgagcttcacggcactcgcaggggctggtgtcactggtcattcctgcacagga cagtccaccatggcctcagatcaccagacccaggcgggcaagccacagtccctcaacccc aagatcatcatctttgagcaggaaaactttcaaggccactcgcatgagctcaatgggccc tgccccaacctgaaggaaactggcgtggagaaggcaggttctgtcctagtgcaggctgga ccctgggtgggctatgaacaggccaactgcaagggcgagcagtttgtgtttgagaagggt gagtacccccgctgggactcatggaccagcagccgaaggacggactccctcagctccctg aggcccatcaaagtggacagccaagagcacaagatcatcctctatgaaaaccccaacttc accgggaagaagatggaaatcatagatgacgatgtacccagcttccacgcccatggctac caggagaaggtgtcatctgtgcgggtgcagagtggcacgtgggttggctaccagtacccc ggctaccgtgggctgcagtacctgctggagaagggagactacaaggacagcagcgacttt ggggcccctcacccccaggtgcagtccgtgcgccgtatccgcgacatgcagtggcaccaa cgtggtgccttccacccctccaactag >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_8|191_aa MRAQDGMPASESPGIWISTPPIFCYSNGIGFYQENSERLGNIAPFQQKAQKNQGTVASVL CCTWKAQYMGLCLGPQRSEGSKEQTMGSTQTSATHWSLSPLLQDPSTTRRREKVYDPPPC NSPMISMDTQRYTLQHPAKTWYLVLECGYESWNSGAILLPDTATIRFLNLFPTFPPFRFH KTAIVIMAHSQ >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_8|576_bp atgcgtgctcaggatgggatgcctgcctcagaatcccctggaatatggatttccactcca cccatcttctgctacagcaatggaattggtttctatcaagagaactccgagaggttggga aatattgctccatttcaacagaaggcccagaaaaatcaaggcaccgtggccagcgttctc tgctgcacctggaaggcccagtacatgggcttatgtctggggccccagaggagtgagggc agcaaagaacagacaatgggctccacgcagacctctgcaacacactggagcctgagccca ctgctgcaggacccatcaacaacaagaaggagggagaaggtgtatgacccccctccctgc aactctcccatgatctccatggacacccaaagatatacactgcagcacccagcaaagacg tggtacttggtgttggagtgtggatatgagtcatggaactctggggctatattgctgcca gacacggcaaccatccgatttctcaatcttttccccacctttcccccctttcgattccac aaaaccgccattgtcatcatggcccattctcaatga >gi568815576f:25101397_25307209|GENSCAN_predicted_peptide_9|327_aa MTCVNTKWLEIYQPYLSWPTGVLLHKCMGAALGTLERSLPESPPPAIRCPISETGTGRPA PHRQFRGVTRTQDSDDSMPDPPGRRRSLTAAWGRRRGDQRGPGQLVPGIGQDGPLAGAGG RSRLALLHGHQVTGICQHSLRVRHVGEFWRRRRLLGPGWDALPPPPPPPPAEPRPGGTGA APGQPLLGAPHQRPASAAGCSPPGSQCAFRSAASRAAPQLIPRGSAGSSRPDAHPEKPWA PCPLLKGHRGGLCYLDTVVLKRDSINQHPPGGNDEAPALVQATCEQRPAGRAPNLAPERT RKPAVRIQRRVKAGESCVSQAKQSKVG >gi568815576f:25101397_25307209|GENSCAN_predicted_CDS_9|984_bp atgacttgtgttaataccaaatggctggaaatataccaaccctacctgtcatggcctact ggggtcctgctacataaatgcatgggcgccgcgctgggcactttggagcgctctctcccg gagtcccccccaccggccatccgatgccccatttcagagactggaacggggaggccggcc ccgcatcgccagttccgcggtgtcacccgcacccaggacagcgacgactcaatgcccgac cctcccggacgcaggaggtcgctgacggccgcttggggacggcggcgtggggaccagcgc gggccggggcagctggtacctgggatcgggcaggacggtcctcttgctggcgcgggcggc cggagtcgccttgctcttctccatggccatcaggtaactggcatctgccagcacagcctc cgggtccgccacgttggcgagttttggcggcggcggcggttactcggacccggatgggac gcgctgccgccaccgccgccgccgccgccagctgagccccgacctggtgggactggggca gccccagggcagcccctgctcggcgcgccccaccagcgccctgcaagcgccgcaggctgt agcccgccggggtcgcagtgcgccttccgctccgccgcctcgcgcgccgccccccagctc attccgcgagggtctgcggggagctcgcggcccgacgctcatccagagaagccctgggcc ccgtgcccactactcaaggggcatcggggaggcctctgctacctggacacagtagttctc aaacgggacagcatcaatcagcatcctccaggaggtaatgatgaagcccctgctctggtc caagctacctgtgaacagcgcccagcgggccgggcccccaacctcgctcctgaacgtacc cgcaagcccgccgtgagaattcagcgaagagtaaaggctggagaaagttgcgtgtctcag gctaagcagagcaaagtggggtga