GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:02:29 Sequence gi568815586f:67549134_67758710 : 209577 bp : 40.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 2862 2251 612 0 0 32 85 213 0.323 10.77 1.00 Prom - 4278 4239 40 -7.35 2.06 PlyA - 4374 4369 6 1.05 2.05 Term - 5800 5304 497 2 2 -8 42 455 0.437 25.04 2.04 Intr - 6269 5900 370 2 1 96 19 178 0.327 5.45 2.03 Intr - 9407 9304 104 0 2 60 92 106 0.111 7.17 2.02 Intr - 12257 12191 67 2 1 -51 98 96 0.005 -5.74 2.01 Init - 27970 27863 108 1 0 99 98 117 0.910 14.07 2.00 Prom - 36525 36486 40 -3.95 3.00 Prom + 41620 41659 40 -8.15 3.01 Init + 43048 43424 377 0 2 46 87 176 0.521 9.75 3.02 Intr + 48477 48576 100 0 1 44 37 121 0.297 1.79 3.03 Term + 58075 58239 165 0 0 101 43 126 0.823 6.33 3.04 PlyA + 58714 58719 6 1.05 4.00 Prom + 61484 61523 40 -6.85 4.01 Init + 75476 75770 295 1 1 80 6 279 0.143 16.29 4.02 Intr + 93347 93427 81 0 0 70 98 33 0.302 1.19 4.03 Intr + 94084 94241 158 2 2 93 6 83 0.407 -0.59 4.04 Intr + 94843 94876 34 0 1 102 81 54 0.328 2.98 4.05 Intr + 98545 98612 68 2 2 83 39 57 0.352 -1.99 4.06 Intr + 98940 99075 136 2 1 90 45 122 0.360 7.32 4.07 Intr + 100012 100049 38 2 2 41 107 77 0.506 1.96 4.08 Intr + 100208 100394 187 1 1 69 16 123 0.560 1.54 4.09 Intr + 100664 100812 149 0 2 80 93 175 0.955 16.23 4.10 Term + 107973 109580 1608 2 0 54 49 1702 0.532 151.96 4.11 PlyA + 111680 111685 6 1.05 5.00 Prom + 117835 117874 40 -6.05 5.01 Init + 120641 120701 61 1 1 50 89 103 0.104 8.06 5.02 Intr + 124148 124240 93 0 0 87 46 83 0.023 3.02 5.03 Term + 132418 132584 167 2 2 77 54 129 0.879 5.50 5.04 PlyA + 132731 132736 6 1.05 6.04 PlyA - 133628 133623 6 1.05 6.03 Term - 137468 137355 114 2 0 36 48 121 0.672 0.49 6.02 Intr - 138070 137901 170 2 2 13 85 112 0.527 2.14 6.01 Init - 146706 146637 70 0 1 87 78 51 0.569 5.26 6.00 Prom - 149831 149792 40 -6.45 7.05 PlyA - 150067 150062 6 1.05 7.04 Term - 150934 150830 105 1 0 87 55 63 0.019 0.33 7.03 Intr - 165592 165465 128 2 2 60 75 103 0.956 5.68 7.02 Intr - 166384 166189 196 1 1 64 94 110 0.959 7.27 7.01 Init - 170215 169892 324 1 0 97 44 234 0.855 17.36 7.00 Prom - 176763 176724 40 -3.45 8.04 PlyA - 177531 177526 6 1.05 8.03 Term - 200577 200406 172 2 1 87 45 105 0.728 2.42 8.02 Intr - 200838 200705 134 0 2 -82 59 203 0.751 -1.08 8.01 Intr - 204635 204406 230 0 2 76 89 158 0.753 11.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 20134 20223 90 0 0 99 43 84 0.804 1.74 S.002 Init + 130444 130507 64 0 1 89 64 46 0.907 1.66 S.003 Term + 194967 195224 258 2 0 88 47 135 0.821 3.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_1|204_aa MNRTNDKNHMIISIDAEKAFSKIQHPFMLKSLNKLGIDGTYLKIIRAIYDKPTANITLNG QKLEAFLLKTSTRQGHPLSPLLFNIVLEVLARAIRQEKEIKGIQSGKEEVELFLFADDMI VCLENPIVSAPNLLKLISNFSSLRIQNQCEKSQAFLYTNNRQTESQIMSELLFTIATKRI KYLAIQLTRDVKDLCKENYKPLLK >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_1|612_bp atgaacagaaccaatgacaaaaaccacatgattatctcaatagatgcagaaaaggccttc agcaaaattcaacaccccttcatgctaaaatctctcaataaactaggtattgatggaacg tatctcaaaataataagagctatttatgacaaacccacagccaatatcacactgaatggg caaaaactggaagcatttcttttgaaaaccagcacaagacaaggacaccctctctcacca ctcctattcaacatagtattggaagttctggccagggcaatcaggcaagagaaagaaata aagggtattcaatcaggaaaagaggaagttgaattattcctgtttgcagatgacatgatt gtatgtttagaaaaccccattgtctcagccccaaatctcctcaagctgattagcaacttc agcagtctcaggatacaaaatcaatgtgaaaaatcacaagcattcctatacactaacaac agacaaacagagagccaaatcatgagtgaactccttttcacaattgctacaaaaagaata aaatacctagcaatacaacttacaagggatgtgaaggacctctgcaaggagaactacaaa ccactgctcaag >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_2|381_aa MEIDEDTGKSSISGLVGTTAQQEWVQERMESEEVKKCTSVKLVEACKELVITAVAGVRAA CFGESKGVAFGVNELQRCLLSLPTLRPMPGEDQGISEKKAAAPVRDLQIKPPSPWDRVPG GRGGCGCSFSRLKRPCLMAKKRADLPAQRSSSAKGQTDSSSGSLAPMYPDWETPPSMGRQ TPHTGELSLAPGRHPSGMKLPEEGTGSNLCCSAASAERNSININKRDIHSEALSEGHQHQ RPKVDKSMKMGRNQRKNAENSKNQNASSPPKNHNSSRTREQTWMENEFDELTEVGFRRWV ITNSSELKEHVITQCKEAKNLEERLEELLTRMTSLEKNINDLMELKNTAQELCEASTSIN SRIDQAEERISEIEDQLNERK >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_2|1146_bp atggaaatcgatgaagacactggcaaaagcagtatcagtggattggtggggacaacagct caacaggagtgggttcaagaaagaatggaaagtgaggaagtgaagaagtgcaccagtgtg aagcttgttgaagcctgcaaagaactggtcatcacagcagtggctggagttagggctgct tgctttggtgaaagcaaaggtgtagcatttggtgtcaatgagctacagaggtgccttctc tctctccccacactgaggcccatgccaggtgaagaccagggcatctctgaaaaaaaggca gcagcaccagtcagggacttacagataaaacccccatctccctgggacagagtacctggg ggaaggggcggctgtgggtgcagcttcagcagacttaaacgtccctgcctgatggctaag aagagagcagatctcccagcacagcgctcaagctctgctaagggtcagactgactcctca agtggttccctggcccccatgtatcctgactgggagacacctcccagtatgggccgacag acacctcatacaggagagctctcactagcacctggcaggcacccctctgggatgaagctt ccagaggaaggaactggcagcaatctttgctgttctgcagcctctgctgaaaggaatagc atcaacatcaacaaaagggacatccactcagaggccctatccgaaggtcaccaacatcaa agaccaaaggtagataaatccatgaagatggggagaaaccagcgcaaaaatgctgaaaat tccaaaaaccagaatgcctcttctcctccaaagaatcacaactcctcgcgaacaagggaa caaacctggatggagaatgagtttgacgaattgacagaagtaggcttcagaagatgggta ataacaaactcctctgagctaaaggagcatgttataactcaatgcaaggaagccaagaac cttgaagaaaggttagaggaattgctgactagaatgaccagtttagagaagaacataaat gacctgatggagctgaaaaacacagcacaagaactttgtgaagcatccacaagtatcaat agccgaatcgatcaagcagaagaaaggatatcagagattgaagatcaacttaatgaaaga aagtaa >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_3|213_aa MKEGRKRERERKEGREKRKKEREREKEGKERKKEKERPSKQGGREEERKEKREKEGRRER KKEERKERKRERKEKEKKKGRKEGRKEGRKEGRKGKREKEGRGKKKRKKRKEGRQQQQQK QISNEKTQQEGSHLQAMKRALTRNKACRHLDLGLLDSRTPKTEAPGKNSEALEDGRATDG KMPCRTVCLFATAAITERHKLGGFNNKNPLPQF >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_3|642_bp atgaaagaaggaaggaagagagagagagaaagaaaggaaggaagggagaaaagaaagaaa gaaagagagagagagaaagaaggaaaagaaagaaagaaagaaaaagaaagaccaagcaag cagggagggagggaggaagaaaggaaggagaaaagagaaaaggaaggaaggagagagaga aagaaagaagaaaggaaagaaagaaagagagaaagaaaagaaaaagagaaaaagaaagga aggaaggaaggaaggaaggaaggaaggaaggaaggacggaaagggaaaagagaaaaggaa ggaaggggaaaaaagaaaagaaagaaaaggaaggaaggaaggcaacaacaacaacaaaaa cagatttcaaatgaaaagacacagcaagaaggcagccatctgcaagccatgaagagagcc ctcaccagaaacaaagcatgccggcaccttgatcttggacttctagactccagaactcca aagacagaggctccaggaaagaactctgaggctctagaagatggcagagctacagatgga aagatgccctgcaggactgtgtgtttgtttgctacggctgccataacagagcgccacaaa ctgggtggcttcaacaacaaaaatccattgcctcagttctaa >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_4|917_aa MEGVAFCPYWYRHFGYGFAFAECNASDKDTICGLTEYLIHYYVISNSIASDQEITLTAKE VQQWARAHEIPWSYPPSCSSYLDRMVEWSFEDSSGIKLGLLPRDSHKLLTSFKSAEISPY QKGLHSARSSMIPRTQESACSLLAAPRAAPWAYLLEADSTGTYFISCVNCVATSLRRLVV LDLGDGDIEVEVAKPVTFEQRRTVCHARRGADRLASGHLVSVNRGGCVVEVGGQPPLSPL LRVNELSGRPSRDLPLRGNLRPPLPPPTRPGCARPRRPLRGEPSARRGFCLPGARAGTVQ ACRPALRCPGLGVAVAPCSAVQTHLPAAGRLQRRGGDSAVRQLQASPGLGAGATRSGVGT GPPSPIALPPLRASNAAAAAHTIGGSKHTMNDHLHVGSHAHGQIQVQQLFEDNSNKRTVL TTQPNGLTTVGKTGLPVVPERQLDSIHRRQGSSTSLKSMEGMGKVKATPMTPEQAMKQYM QKLTAFEHHEIFSYPEIYFLGLNAKKRQGMTGGPNNGGYDDDQGSYVQVPHDHVAYRYEV LKVIGKGSFGQVVKAYDHKVHQHVALKMVRNEKRFHRQAAEEIRILEHLRKQDKDNTMNV IHMLENFTFRNHICMTFELLSMNLYELIKKNKFQGFSLPLVRKFAHSILQCLDALHKNRI IHCDLKPENILLKQQGRSGIKVIDFGSSCYEHQRVYTYIQSRFYRAPEVILGARYGMPID MWSLGCILAELLTGYPLLPGEDEGDQLACMIELLGMPSQKLLDASKRAKNFVSSKGYPRY CTVTTLSDGSVVLNGGRSRRGKLRGPPESREWGNALKGCDDPLFLDFLKQCLEWDPAVRM TPGQALRHPWLRRRLPKPPTGEKTSVKRITESTGAITSISKLPPPSSSASKLRTNLAQMT DANGNIQQRTVLPKLVS >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_4|2754_bp atggaaggtgtagcattttgtccttactggtatagacactttggatatggatttgccttt gctgagtgtaatgcttctgacaaagataccatctgtggacttacagaataccttatccac tactatgttatttcaaacagcattgcttctgatcaagaaattaccctcacagcaaaagaa gtgcagcaatgggcccgtgctcatgaaattccctggtcttacccaccatcctgcagcagc tatcttgatagaatggtagaatggtctttcgaagattcatctggcatcaagctaggcctt ctgcccagagattcccataagcttctcacttctttcaaatctgctgagatatcaccttat cagaaaggccttcacagtgcaaggtccagcatgatccccaggacccaggaatcagcctgt tctttgctggctgctccccgtgcagccccatgggcttatctcctagaggctgactccact ggcacctactttataagctgtgtcaattgcgtggccactagcctgagaagactggtagtt ctggaccttggtgatggagacatagaagttgaagtggcaaaacctgtgacctttgaacag aggcgcactgtgtgccacgcacgacgaggcgcggatcgactggcgagcggtcacttggtg tcagtaaaccggggagggtgtgtggtggaggtgggagggcagccccctctttcgcccctg ctcagggtcaacgagctcagcggccgcccctcccgggacttgcctctccgaggaaacctt cggccgccgctcccgccgcctacccgaccgggttgtgcgcggccccgaaggcccctccgc ggggagccctcggcccgaagaggcttctgcctgccgggggcccgggcggggaccgtccag gcctgccgccccgccctgcgctgcccggggcttggcgtggccgtggctccctgctcagct gtccaaacccacctcccggctgctggccgtctgcaacgccgaggtggggacagcgccgtt cgtcagcttcaggcttccccggggctcggtgcaggggccacccggagcggagtggggact ggcccgccctcccccatcgccctgccgcctctccgggccagcaacgctgccgccgcagcc cacacgattggcggcagtaagcacacaatgaatgatcacctgcatgtcggcagccacgct cacggacagatccaggttcaacagttgtttgaggataacagtaacaagcggacagtgctc acgacacaaccaaatgggcttacaacagtgggcaaaacgggcttgccagtggtgccagag cggcagctggacagcattcatagacggcaggggagctccacctctctaaagtccatggaa ggcatggggaaggtgaaagccacccccatgacacctgaacaagcaatgaagcaatacatg caaaaactcacagccttcgaacaccatgagattttcagctaccctgaaatatatttcttg ggtctaaatgctaagaagcgccagggcatgacaggtgggcccaacaatggtggctatgat gatgaccagggatcatatgtgcaggtgccccacgatcacgtggcttacaggtatgaggtc ctcaaggtcattgggaaggggagctttgggcaggtggtcaaggcctacgatcacaaagtc caccagcacgtggccctaaagatggtgcggaatgagaagcgcttccaccggcaagcagcg gaggagatccgaatcctggaacacctgcggaagcaggacaaggataacacaatgaatgtc atccatatgctggagaatttcaccttccgcaaccacatctgcatgacgtttgagctgctg agcatgaacctctatgagctcatcaagaagaataaattccagggcttcagtctgcctttg gttcgcaagtttgcccactcgattctgcagtgcttggatgctttgcacaaaaacagaata attcactgtgaccttaagcccgagaacattttgttaaagcagcagggtagaagcggtatt aaagtaattgattttggctccagttgttacgagcatcagcgtgtctacacgtacatccag tcgcgtttttaccgggctccagaagtgatccttggggccaggtatggcatgcccattgat atgtggagcctgggctgcattttagcagagctcctgacgggttaccccctcttgcctggg gaagatgaaggggaccagctggcctgtatgattgaactgttgggcatgccctcacagaaa ctgctggatgcatccaaacgagccaaaaattttgtgagctccaagggttatccccgttac tgcactgtcacgactctctcagatggctctgtggtcctaaacggaggccgttcccggagg gggaaactgaggggcccaccggagagcagagagtgggggaacgcgctgaaggggtgtgat gatccccttttccttgacttcttaaaacagtgtttagagtgggatcctgcagtgcgcatg accccaggccaggctttgcggcacccctggctgaggaggcggttgccaaagcctcccacc ggggagaaaacgtcagtgaaaaggataactgagagcaccggtgctatcacatctatatcc aagttacctccaccttctagctcagcttccaaactgaggactaatttggcgcagatgaca gatgccaatgggaatattcagcagaggacagtgttgccaaaacttgttagctga >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_5|106_aa MNVGRECGEMERKMMSKGGASTQCKLSVDLPFWGLEDSDPLLTAPLGSATVGLKQRLPVT ATCVDKRELDKAELSALNEVTYLSCRYESIAKDYENLKKNEDKRAG >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_5|321_bp atgaacgtcggccgtgaatgtggggagatggagagaaagatgatgagcaaaggtggagca agtacacagtgcaagctgtcagtggatctaccattctggggtctggaggacagtgaccct cttctcacagctccactaggcagtgccacagtaggtctaaagcaaaggctgccagtgaca gccacttgtgtagacaagagagagctggacaaggccgagctatctgcattgaatgaggtg acttacctcagttgtaggtatgaatctatagccaaggattatgaaaacctgaagaaaaat gaggacaaaagagcaggttga >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_6|117_aa MEALNTVAKEALTKETFEQRLEREKDNVNNPISAKEIGFVDRNLPTKKTPDSGSVENYIK HFEKEIISILYKLFQKFEDKIRNKTRMLALKLLFSTVSEIVASAVIEEKEIKSIQIG >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_6|354_bp atggaagccttaaatacagtagccaaggaagccctcactaaggagacatttgagcaaaga cttgaaagagaaaaagataatgtgaacaacccaatatctgcgaaagaaattggatttgta gatagaaaccttcctacaaagaaaactccagactcaggttcagtggagaattatatcaaa cattttgagaaggaaataatatcaattctgtataaactcttccaaaaatttgaagacaag atcaggaacaagacaaggatgcttgctctcaaacttctattcagtactgtatcggagatt gtggccagtgcagtcatagaagaaaaagaaataaagagcatccagattggataa >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_7|250_aa MPEPPTPSVGSCAAQASPMSNAPCSTAPSTINHPRAEECGRTARDWQAAPPAAPVRDPLG EASWAPESGGDVENLYVQLRDCKYTNQHPVSSSGFVNAPMDTLYLATLRSTSVGPERTLC IPGLQPLDSSNSRKRQPLIPDLEFTKGQCLQEFSSDLMIDALPIMSPRKSPMPRAEGLIG KREGRKQKEEAPQYRDRGWGAPKPRKEVASVVDTSQWSYIYPFYQISFDIARQLFQVDQL ADRRAESKMQ >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_7|753_bp atgcctgagcctcccaccccctccgtgggctcctgtgcggcccaagcctccccgatgagc aacgccccctgctccacggcgcccagtaccatcaaccacccaagggctgaggagtgcggg cgcacggcgcgggactggcaggcagctccacctgcagcccctgtgcgggatccactgggt gaagccagctgggctcctgagtctggtggggacgtggagaacctttatgtccagctcagg gattgtaaatacaccaatcagcaccctgtgtctagctcagggtttgtgaatgcaccaatg gacactctgtatctagctactctgagaagcacttcagtgggaccagagagaacactgtgc atcccaggattgcagccactggactccagcaactcgaggaagaggcagccattgatacca gacctagaattcactaaaggtcaatgccttcaggaattttcatctgatttgatgattgat gccctccccatcatgagccccaggaagtctcccatgccacgagcagagggtttaataggc aagagggaagggagaaaacagaaggaagaagctccccagtacagagacagagggtggggg gctccaaagccacgaaaggaggtggcaagtgtggtggacaccagccagtggagctatatt tacccgttttatcagatatcttttgatattgcaaggcagttgtttcaggtagaccagctt gcagacaggagagctgagagcaaaatgcagtga >gi568815586f:67549134_67758710|GENSCAN_predicted_peptide_8|178_aa XPSKGAIGKAQTESAFKSSLASHLLLLRASCGKREPLACGRGISYFSKPLGEHFSKAKGN LPMKLLVTKPALSKTVVAVSELENEGSQQCAQSKSEDLRTKEDNGVNLSSRRKVQEPKGP LVPSANWRVPTHIEGGYSPLSPLTYMPVSSANVLTPRAAQSFNQTPSHLALRNKFSAY >gi568815586f:67549134_67758710|GENSCAN_predicted_CDS_8|537_bp ntaccttcaaagggagcgataggtaaagcacaaacagaatcagctttcaaaagttccctg gcttcccacctgctgctgctgcgggcttcctgtgggaaaagggaaccgctggcctgtggg agaggaatcagctatttctcaaagccactgggagagcacttttcaaaggcaaaaggaaat cttcccatgaagctgttagttaccaagcccgcactgagcaagacagtggtggctgtcagc gagctggagaatgaaggaagccagcagtgtgctcagtccaagtctgaagaccttagaacc aaggaagacaatggtgtaaatctcagttcaaggcgaaaggtccaagagcccaagggtcca ctggtgccctcagccaactggagagtgcctactcacattgaaggtggatattctccactc agcccactgacttacatgccagtctcctctgcaaacgtcctcacacctcgggcagcccaa tcatttaaccaaactccaagtcacctggctttgaggaacaagttcagtgcctactga