GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:33:18 Sequence gi568815591f:74797652_75005671 : 208020 bp : 48.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 614 22 593 0 2 38 57 546 0.943 38.82 1.13 Intr - 4362 4319 44 2 2 108 94 -12 0.346 -0.72 1.12 Intr - 5830 5647 184 2 1 108 111 147 0.997 17.85 1.11 Intr - 7209 7126 84 1 0 72 111 43 0.965 4.79 1.10 Intr - 9360 9283 78 1 0 76 80 73 0.962 4.82 1.09 Intr - 11522 11457 66 0 0 64 96 73 0.866 4.58 1.08 Intr - 13301 13245 57 0 0 98 72 44 0.859 2.66 1.07 Intr - 21946 21903 44 0 2 128 68 45 0.971 4.38 1.06 Intr - 22372 22318 55 2 1 103 93 16 0.577 1.64 1.05 Intr - 24804 24776 29 2 2 87 106 -8 0.383 -1.34 1.04 Intr - 25156 24973 184 2 1 73 29 120 0.503 3.45 1.03 Intr - 35292 35154 139 0 1 91 43 156 0.894 11.44 1.02 Intr - 38732 38629 104 0 2 95 81 70 0.621 6.89 1.01 Init - 51941 51461 481 2 1 87 110 331 0.649 30.72 1.00 Prom - 67209 67170 40 -3.46 2.07 PlyA - 67281 67276 6 1.05 2.06 Term - 85390 85235 156 1 0 81 47 95 0.705 2.63 2.05 Intr - 85812 85698 115 2 1 84 56 66 0.691 3.45 2.04 Intr - 86184 86131 54 2 0 108 -1 118 0.669 2.99 2.03 Intr - 86572 86493 80 1 2 109 78 81 0.969 7.55 2.02 Intr - 90148 90032 117 1 0 79 98 114 0.951 12.16 2.01 Init - 90582 90553 30 0 0 59 47 68 0.933 -2.30 2.00 Prom - 92135 92096 40 -10.35 3.00 Prom + 92142 92181 40 -5.36 3.01 Sngl + 92506 93186 681 0 0 67 54 324 0.976 21.09 3.02 PlyA + 93976 93981 6 1.05 4.00 Prom + 94748 94787 40 -7.56 4.01 Init + 96138 96205 68 2 2 76 48 50 0.939 0.34 4.02 Intr + 96467 96606 140 2 2 50 111 96 0.952 8.21 4.03 Intr + 98495 98581 87 0 0 62 97 97 0.994 7.94 4.04 Intr + 98758 98860 103 2 1 42 44 41 0.985 -5.67 4.05 Intr + 100001 100184 184 2 1 81 87 156 0.924 14.59 4.06 Term + 105187 105243 57 0 0 25 48 95 0.278 -3.11 4.07 PlyA + 105480 105485 6 1.05 5.11 PlyA - 106676 106671 6 1.05 5.10 Term - 108120 108016 105 0 0 107 43 91 0.998 4.91 5.09 Intr - 109444 109201 244 0 1 80 90 271 0.969 24.00 5.08 Intr - 112197 112067 131 0 2 53 17 94 0.788 -1.71 5.07 Intr - 113244 113186 59 1 2 86 116 35 0.946 4.70 5.06 Intr - 114496 114278 219 2 0 42 54 265 0.578 16.77 5.05 Intr - 115675 115500 176 0 2 42 94 97 0.288 5.28 5.04 Intr - 116082 115941 142 1 1 76 31 40 0.335 -3.49 5.03 Intr - 118891 118811 81 2 0 37 89 69 0.324 1.51 5.02 Intr - 134402 134234 169 2 1 91 75 123 0.967 10.82 5.01 Init - 135215 135207 9 2 0 100 62 3 0.487 -0.66 5.00 Prom - 139409 139370 40 -7.36 6.00 Prom + 139537 139576 40 -3.16 6.01 Init + 155439 155523 85 2 1 79 77 47 0.773 3.68 6.02 Intr + 166847 167447 601 0 1 75 115 301 0.039 23.07 6.03 Intr + 169338 169365 28 0 1 120 60 -1 0.272 -1.58 6.04 Term + 174826 174921 96 0 0 75 42 103 0.258 2.37 6.05 PlyA + 175870 175875 6 1.05 7.03 PlyA - 176928 176923 6 -0.45 7.02 Term - 178948 178827 122 2 2 85 42 76 0.536 1.34 7.01 Init - 179119 178990 130 1 1 67 97 35 0.450 2.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 132941 132859 83 0 2 77 50 74 0.909 0.36 S.002 Term + 156342 156388 47 1 2 100 54 47 0.850 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_1|714_aa MAPKHKSSDAGNLDRPKRSRKVLPLSEKVKVLDLIRKDKKSYAEVAKIYGKNESSIREIV KKEKEIRASFAVSPPTAKVTATVRDKCLVKMEQALHLWVEEMNRKRVPIDSNMLRQKALS LYQDFCKGCSETDTKPFTASKGWLHRFRHRFSHHYKKKKKGIMAQVAVSTLPVEEESSSE TRMVVTFLVSALESMCKELAKSKAEVACIAVYETDVFVVGTERGCAFVNARTDFQKDFAK YCKALGTTVMVPVPYEKMLRDQSAVVVQGLPEGVAFQHPENYDLATLKWILENKAGISFI INRPFLGPESQLGGPGMVTDAERSIVSPSESCGPINVKTEPMEDSGSHPSSTSNEVIEME LPMEDSTPLVPSEEPNEDPEAEVKIEGNTNSSSVTNSAAGVEDLNIVQVTVPDNEKERLS SIEKIKQLREQVNDLFSRKFGEAIGVDFPVKVPYRKITFNPGCVVIDGMPPGVVFKAPGY LEISSMRRILEAAEFIKFTVIRPLPGLELSNGEYSTVGKRKIDQEGRVFQEKWERAYFFV EVQNIPTCLICKQSMSVSKEYNLRRHYQTNHSKHYDQYMERMRDEKLHELKKGLRKYLLG SSDTECPEQKQVFANPSPTQKSPVQPVEDLAGNLWEKLREKIRSFVAYSIAIDEITDINN TTQLAIFIRGVDENFDVSEELLDTVPMTGTKSGNEIFSRVEKSLKNFCIDWSKL >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_1|2142_bp atggccccaaagcacaagagtagtgatgctgggaatttggataggccaaagagaagccgt aaagtgcttcctctaagtgaaaaggtgaaagttctcgacttaatcaggaaagacaaaaaa tcctatgctgaggttgctaagatctacgggaagaatgaatcttccatccgtgaaattgtg aagaaggaaaaagaaattcgtgctagttttgctgtctcacctccaactgctaaagtgacg gccacagtgcgtgataagtgcttagttaagatggaacaggcactgcatttgtgggtggaa gagatgaacagaaaacgtgttcccattgacagcaacatgttgcgccagaaagctttgagc ctataccaagacttctgcaagggatgctctgaaactgacaccaagccatttactgcgagt aagggatggttacacagattcaggcatagattctcacatcattacaagaagaagaagaag gggatcatggcccaggtagcagtgtccaccctgcctgttgaagaagagtcctcctcagag accaggatggtggtgacattcctcgtgtctgccctcgaatccatgtgtaaagaactggcc aagtccaaggcagaagtggcctgcatcgcagtgtacgaaacagacgtgtttgtcgtcgga accgagagaggatgcgcttttgttaatgccaggacggattttcagaaagattttgcaaaa tactgtaaagccttagggacaacagtgatggtgcctgttccctatgagaagatgctgcga gaccagtcggctgtggtagtgcaggggcttccggaaggcgttgcctttcaacaccctgag aattacgaccttgcaaccctgaaatggattttggagaacaaagcagggatttcattcatc ataaatagacccttcctaggaccagagagtcagctgggtggccctgggatggtaacagat gcggagagatccatagtatcaccaagtgaaagctgcggccccatcaatgtgaaaactgaa cccatggaagattctggaagccacccttcttccacaagcaatgaagtaatagaaatggaa ttaccaatggaagattccactccgctggtcccttcagaagaaccaaatgaggaccctgaa gccgaggtgaaaatcgaaggaaacacaaattcatccagtgttacaaattctgcagcaggt gttgaagatcttaacatcgttcaagtgactgttccagataatgagaaggaaagattatca agcattgaaaagattaaacagctaagagaacaagttaatgacctctttagccgaaaattt ggtgaagcaattggcgtggatttccctgtgaaagttccctacaggaagatcacattcaac cctggctgtgtggtgattgatggcatgcccccgggggtggtattcaaggcccccggctat ctggaaatcagttccatgaggaggatcttggaggcagctgagtttatcaaattcacagtc atcaggccgcttccagggcttgagctcagtaatggtgagtattctacagtgggaaaacgc aagatagaccaggagggccgtgtgtttcaagaaaagtgggagagagcgtatttcttcgtg gaagtacagaatattccaacatgtctcatatgcaaacaaagcatgtctgtgtccaaagaa tataacctaagacgccactatcaaaccaatcacagcaagcattatgaccagtatatggaa agaatgcgtgacgagaagcttcacgagctgaaaaaagggctcaggaagtatctcttaggc tcatcagacaccgagtgtcccgagcaaaaacaagtgtttgcaaacccaagtccaacccag aaatcccccgtgcagcctgtagaggacctagctgggaacttatgggagaagttacgtgaa aaaatcaggtcttttgtggcatattctatcgcaatcgatgagatcacggatataaataat accacccagttggccatattcatccgtggtgtcgatgagaatttcgatgtgtccgaagaa cttctggacacggtgcccatgacgggtacaaaatctggcaacgagatcttttcgcgtgtt gagaaaagcctgaaaaacttctgtatcgactggtcgaaatta >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_2|183_aa MARTLGAELAVAKYPKKGSQAVHRHSRKQSEPPANDIFNAAKAAKSDMQDWMVSMIVDRE YSVAVEAVRLLILILKNMEGVLMDVDCESVYPIVLFYPECEIRTMGGREQRQSPGAQRTF FQLLLSFFVESKLHDHAAYLVDNLWDCAGTQLKDWEGLTSLLLEKDQSTCHMEPGPGTFH LLG >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_2|552_bp atggcacgaaccctgggggcagagctggcagtggcaaaatatccaaagaaagggtcccaa gcggtacatcgtcatagccggaaacagtcagagccaccagccaatgatattttcaatgct gcgaaagctgccaaaagtgacatgcaggactggatggtttccatgatcgtggacagagag tacagtgtggcagtggaggccgtcagattactgatacttatccttaagaacatggaaggg gtgctgatggacgtggactgtgagagcgtctaccccattgtacttttctaccctgagtgc gagataagaacgatgggtggaagagagcaacgccagagcccaggcgcccagaggactttc ttccagcttctgctgtccttctttgtggagagcaagctccacgaccacgctgcttactta gtagacaacctgtgggactgtgcagggactcagctgaaggactgggagggtctgacaagc ctgctgctggagaaggaccagagcacgtgccacatggagccagggccagggaccttccac ctcctagggtga >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_3|226_aa MPGWGRESVGTHSIRPRGTRASTRPPRVRSGRGAPRSYLQVASAKCWAAAPAVHVGEPVH AGGLHTERGADPVIGLYLVHRHGATKAREGGACQTPTVGNRQTPTVRNRQTPTLGNRQTP TLGIHARPRRRATTSLLTLLRAFGKKKRSPVCSDWSRLFDVTDSTFDRATRRKGETGSIF SAPPGKGGAQRRKQPMGAQEAGRLWEPWRALSQSPRRIRCCILGES >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_3|681_bp atgccggggtggggccgcgaatcggttgggacgcactctatccggcctaggggcacccgg gccagcacccggccgccgcgcgtgcgcagtgggcggggggccccgcgctcctacctgcaa gtggccagtgccaagtgctgggccgccgctcctgccgtgcatgttggggagccagtacat gcaggtgggctccacacggagaggggcgcagaccccgtgatagggctttacctggtacat cggcatggcgcaaccaaagcaagagagggtggcgcgtgccagacaccaacggtcggaaac cgccagacaccaacggtcagaaaccgccagacaccaacgctcggaaaccgccagacacca acgctcggaatacacgccagaccacgacggagggcgaccacctcccttctgaccctgctg cgggcgttcggaaaaaaaaaacgcagtccggtgtgctctgattggtccaggctctttgac gtcacggactcgacctttgacagagccactaggcgaaaaggagagacgggaagtattttt tccgccccgcccggaaagggtggagcacaacgtcgaaagcagccaatgggagcccaggag gcggggcgcctgtgggagccgtggagggcactttcccagtccccgaggcggatccggtgt tgcatccttggagagagctga >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_4|212_aa MLSVFKKEDTIIAKDFGNLRDTITEPAKAIKPIDRKSVHQICSGQVVLSLSTAVKKIVEN SLDAGATNIDLKLKDYGMDLIEVSGNGCGVEEENFKGLTLKHHTSKIQEFADLTRVETFG FRGEALSSLCALSDVTISTCHVSAKVGTRLVFDHDGKIIQKTPYPHPRGTTVSVKQLFST LPVRHKEFQRNIKKPVGEKYMRSVDTQACSCI >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_4|639_bp atgctttcagttttcaagaaagaagacaccattattgccaaagattttggtaatttgaga gatacaattacagaacctgctaaggccatcaaacctattgatcggaagtcagtccatcag atttgctctgggcaggtggtactgagtctaagcactgcggtgaagaagatagtagaaaac agtctggatgctggtgccactaatattgatctaaagcttaaggactatggaatggatctc attgaagtttcaggcaatggatgtggggtagaagaagaaaactttaaaggcttaactctg aaacatcacacatctaagattcaagagtttgccgacctaactcgggttgaaacttttggc tttcggggggaagctctgagctcactttgtgcactgagtgatgtcaccatttctacctgc cacgtatcggcgaaggttgggactcgactggtgtttgatcacgatgggaaaatcatccag aaaaccccctacccccaccccagagggaccacagtcagcgtgaagcagttattttctacg ctacctgtgcgccataaggaatttcaaaggaatattaagaagccagttggtgagaagtac atgcggtctgtggacacccaagcttgcagctgcatctga >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_5|444_aa MAQAQRLRQELQMLMTECLTWARNGSSKHPLSLAQKPEFLFHQLQSRDNAICPIAEQKAE GKFSSRYRNHDSAENRRDGSMTRPRTDWTLKRACTEKRRPQHASSFEWRGHSLCWDREQR DAETLKMLLDSGLSVQKKTKDRTETRFGEMGQILGKIMMSHQPQPQEEQSPQRSTSGYPL QEVVDDEVSGPSAPGVDPSPPRRSLGWKRKRECLDESDDEPEKELAPEPEETWVAETLCG LKMKAKRRRVSLVLPEYYEAFNRLLEDPVIKRLLAWDKDLRVSDKIPSEPTILGASPKTL PPASRICIRPSNTPPPRNFHMSTVTPMLSYLANDMEEDDEAPKQNIFYFLYEETRSHIPL LRELWFQLCRYMNPRARKNCSQIALFRKYRFHFFCSMRCRAWVSLEELEENTGPRGDVDF QQELYSNANGRHQEGGEEPFVQII >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_5|1335_bp atggctcaggctcagaggctcaggcaagagctacagatgctcatgaccgaatgtctcacc tgggcccggaatggcagcagcaagcaccctctcagcctagcccagaagccagagttccta tttcatcagttgcaaagcagagacaatgccatctgcccgatagcagagcaaaaggcagag gggaagttctcttcaaggtacagaaaccacgactccgcagagaaccgcagagatggcagc atgaccagaccacggacagactggactctgaaacgggcatgtacagagaagaggagaccc caacacgcttcaagctttgagtggagaggacacagcctctgctgggacagggaacagagg gatgcggagaccctgaagatgcttttggacagtggtctgagtgtccagaagaagaccaag gacagaacagagactaggtttggtgagatgggacagattttgggaaagatcatgatgagc catcaaccgcagccccaggaagagcagagcccccagcggagcacctcagggtaccccctc caggaggtggtggatgatgaagtgtcgggaccatcagcccctggggtagatcccagcccc ccacgtaggtcccttggctggaaaaggaagagggaatgtttggatgaatctgatgatgag ccagagaaggagctcgcccctgagcctgaggagacctgggtggcggagacgctgtgtggc ctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctccctgagtactacgaggcc ttcaacaggctgcttgaggatcctgtcattaaaagactcctggcctgggacaaagatctg agggtgtcggacaagatcccatcggagcccaccatcctgggagcatcaccgaaaaccctt cctccggcttctcggatttgcatccgaccttcgaatacccctccaccccgcaatttccac atgagcacagtcaccccaatgctgagctatctggccaatgacatggaggaggacgatgag gcccccaaacaaaacatcttctacttcctgtacgaggagacccgctctcatatacccttg ctccgtgagctttggttccagttatgccgttacatgaacccgagggccaggaagaactgc tctcagatagccttgttccggaagtatcggttccacttcttttgttccatgcgctgcagg gcttgggtttccctggaggagttggaagagaacaccggacccaggggagatgtggatttt cagcaggaactttattccaatgctaatggcagacatcaggaaggaggagaggaaccattt gtgcagatcatctag >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_6|269_aa MVPASAASEDRRKLPIIVEDEGGPTRSRVLRVPALLHFPCTGLPVAPAAQPPRQQQQQQH RGSPPPGGLQVPAGRARAAHAQRGPAFEWQVVCYSEHQLLPAPGAARRCSAARPLGCCSA GAASLAPRLPLATWRASSLLSGPARPPPPPAPGAELPGLRAGCRCRQGAEGAGAAARPPA DEPPRTGRGRTMELHILEHRLQVASVAKESIPLFTYGLIKLAFLSSKTRKQPVLSQAQKV HWIGKNYTGKYAVSMGPWKLPAAFLGRIC >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_6|810_bp atggtgccagcatctgctgctagtgaggaccgcaggaagcttcctatcatagtggaagat gaaggggggccgacaagatcacgtgtcctgagggtgcccgcgctcctgcatttcccgtgc accgggctgccggtagctccggccgcccagcccccgcggcagcaacagcagcaacagcac cgggggagcccccccccaggcggactacaagtcccggcaggccgcgcgcgggccgcgcat gcgcagcggggaccggcgtttgagtggcaagttgtttgttacagcgaacaccagctgctc cccgcgccgggcgccgcgcgccgctgctccgccgctcggcccctcggctgctgctccgcc ggcgctgcctccctcgccccgcggctcccccttgcaacttggcgggcctcctcccttttg tccggcccggcccggccgccgccgccccccgcgcccggcgccgagctcccgggtctccgg gccggctgtcggtgccggcagggcgcggagggggcgggggccgcggctcgtccccccgcg gatgagccgccgcggacggggcgcgggcggacgatggaactccacatcctggagcaccgg ctgcaagttgccagcgtcgccaaggagagtatcccgctgttcacctacggcctgatcaaa cttgccttcctgtcctccaagaccagaaagcagcctgtcctgtcccaggcccagaaagtc cactggattggcaaaaactacacaggcaagtatgcagtaagcatgggcccatggaaactt cctgctgccttcctgggccggatctgctga >gi568815591f:74797652_75005671|GENSCAN_predicted_peptide_7|83_aa MGDPRKPLTRHQGGFLEERMLKPTSRSVCVRFKDSQDFSGWTKGCCCYYCDDGEEEEEEE EEEEEETGSHYVVQAGVQWCNLG >gi568815591f:74797652_75005671|GENSCAN_predicted_CDS_7|252_bp atgggtgatcccaggaagccactcaccaggcaccagggtggcttcctggaggagaggatg ctgaagccgacctcaagaagtgtctgtgttaggtttaaggactcacaagacttctctggc tggactaaaggttgttgttgttattattgtgatgatggtgaggaggaggaggaggaggag gaggaggaggaggaggagacagggtctcattatgttgtccaggctggagtacagtggtgc aatctcggctga