GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:53:25 Sequence gi568815595r:81390410_81861517 : 471108 bp : 35.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1201 1386 186 0 0 109 75 117 0.571 11.61 1.02 Term + 1929 2078 150 2 0 49 43 126 0.484 1.13 1.03 PlyA + 2432 2437 6 1.05 2.03 PlyA - 2736 2731 6 1.05 2.02 Term - 2894 2864 31 1 1 119 48 8 0.085 -3.75 2.01 Init - 5654 5491 164 2 2 46 67 182 0.153 11.34 2.00 Prom - 12654 12615 40 -3.65 3.05 PlyA - 13360 13355 6 1.05 3.04 Term - 27519 27314 206 1 2 102 53 86 0.490 3.05 3.03 Intr - 28147 28074 74 0 2 49 77 56 0.419 -1.27 3.02 Intr - 38175 38069 107 0 2 92 87 101 0.621 8.49 3.01 Init - 51166 51053 114 1 0 90 35 128 0.796 7.96 3.00 Prom - 57827 57788 40 -5.05 4.10 PlyA - 58866 58861 6 1.05 4.09 Term - 60850 60703 148 0 1 68 38 104 0.436 -0.21 4.08 Intr - 61444 61324 121 2 1 90 82 61 0.295 4.33 4.07 Intr - 61975 61844 132 2 0 110 84 -11 0.223 0.40 4.06 Intr - 70413 70281 133 0 1 79 72 22 0.085 -1.00 4.05 Intr - 85658 85546 113 0 2 39 103 94 0.126 5.18 4.04 Intr - 87094 87011 84 2 0 108 93 13 0.144 2.67 4.03 Intr - 108879 108697 183 1 0 73 97 56 0.091 3.84 4.02 Intr - 120315 120282 34 0 1 105 80 37 0.040 1.48 4.01 Init - 135381 135202 180 0 0 70 41 135 0.716 6.23 4.00 Prom - 141907 141868 40 -6.75 5.09 PlyA - 142614 142609 6 1.05 5.08 Term - 144916 144764 153 1 0 76 42 115 0.950 2.64 5.07 Intr - 146686 146502 185 2 2 82 115 100 0.972 10.69 5.06 Intr - 187687 187516 172 1 1 91 106 204 0.859 21.09 5.05 Intr - 190866 190756 111 0 0 81 99 116 0.992 11.66 5.04 Intr - 195781 195683 99 1 0 95 110 74 0.864 9.69 5.03 Intr - 200755 200628 128 2 2 73 88 103 0.966 8.28 5.02 Intr - 203578 203499 80 0 2 -6 69 130 0.333 -0.02 5.01 Init - 210981 210947 35 0 2 73 78 32 0.544 0.09 5.00 Prom - 215874 215835 40 -6.85 6.00 Prom + 215896 215935 40 -6.75 6.01 Init + 220183 220288 106 0 1 59 39 98 0.777 2.43 6.02 Term + 222426 222670 245 1 2 82 38 535 0.998 42.98 6.03 PlyA + 223301 223306 6 1.05 7.10 PlyA - 223416 223411 6 1.05 7.09 Term - 241711 241600 112 0 1 69 55 108 0.919 2.65 7.08 Intr - 252581 252372 210 1 0 58 96 158 0.968 10.71 7.07 Intr - 256073 255983 91 0 1 45 98 95 0.898 4.33 7.06 Intr - 258582 258447 136 0 1 62 111 34 0.861 2.32 7.05 Intr - 259512 259387 126 0 0 90 78 70 0.857 6.16 7.04 Intr - 280544 280429 116 0 2 97 75 45 0.002 3.35 7.03 Intr - 305944 305834 111 2 0 116 68 37 0.208 3.93 7.02 Intr - 315204 315035 170 2 2 93 83 156 0.906 14.27 7.01 Init - 371108 370966 143 2 2 74 86 283 0.675 26.35 7.00 Prom - 383023 382984 40 -4.95 8.02 PlyA - 385238 385233 6 1.05 8.01 Sngl - 390582 390133 450 0 0 66 48 244 0.289 14.17 8.00 Prom - 406336 406297 40 -2.75 9.03 PlyA - 407101 407096 6 1.05 9.02 Term - 421028 420706 323 0 2 17 42 278 0.523 10.10 9.01 Init - 421715 421511 205 2 1 97 44 136 0.740 7.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 167035 167205 171 0 0 60 41 188 0.816 5.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_1|111_aa MAEGETHILHGSRQERKMRAERKGKPLIKASDLMRLIYYHEESMGQTVPVIQLSPTRSLQ QQTRGIISVSSQVMPESIRDFLHDLDIKAPRQDDVKALRFQRRGHKPLDKM >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_1|336_bp atggcggaaggcgaaactcacatcttacacggcagcaggcaagagagaaaaatgagagcc gagcgaaaagggaaaccccttataaaagcatcagatctcatgagacttatttactaccat gaggagagtatggggcaaaccgtccccgtgattcaattatctcccaccaggtccctccaa caacagacacgaggaatcatttctgtcagttctcaggttatgcctgaatctatccgggat tttcttcatgaccttgacattaaagctcccagacaggatgatgtcaaggccctaagattc cagcgaagaggacacaaaccactggataagatgtag >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_2|64_aa MVSRRRLFCDWRSSDLWLLLAQCPGLLPPSVLKRFCQHKRPEHQTVPETLKVNQMFLGIS RNCI >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_2|195_bp atggtttccagacggagactcttctgtgactggcggtcatctgacctgtggctgcttctg gcccagtgccctggtctgctcccaccatctgtcttgaaaaggttctgccagcataagcgg cctgaacaccagactgtgcctgagactctgaaagtgaatcagatgtttttgggaatcagt agaaactgtatttga >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_3|166_aa MVPASAPGEGLRKLTIMVQGEGEPASHMTRSSKREREEKLQGSIYIKGTSSTTTLTKNET LQLRLSTCVENVELVIVLAARQTRCAVAAIIHALSRGGGTSYIWNRLKNSNKVEKIAQSL CSSVFGDYVNHGYPRLCVHYTLFTAANDSNHFILTVPRSYDEESWL >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_3|501_bp atggtgccagcatctgctcctggtgaaggcctcaggaagcttacaatcatggtgcaaggt gaaggcgagccagccagtcacatgacgagaagcagcaagagagaaagggaggagaaactt cagggctctatttacatcaaagggacatcaagtaccacaacgctcactaaaaatgaaacc ctacagctcagactgtcaacttgtgtggagaatgtagagctggtgattgttttggcagca agacagacaagatgtgcggttgctgccattattcatgctttgtcaagaggaggtgggacc tcttatatctggaatcgacttaaaaacagtaataaggttgagaagatagctcaatctctg tgctcatcagtgtttggggactacgtgaaccatggctacccgaggctgtgtgttcattat acactattcacagccgctaatgatagcaaccattttatattaacagtaccaagatcttat gatgaggagagctggttgtga >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_4|375_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKDKL VFGEQVVFGYMTRILQCLIQLLNCCEFLKSCTFKIVLDSDAAEYGGHQRLDHSTDFFSEA FEHNGRPYSLLVILVLIAFLSFFSKSLGLSSFTFIRLCMPVHEANPYGNTQHADKISISI SGDSGTRATNENAVYKEVKMFLFQPFALLIIVLNRDVRLKAFPFQKFKCGASPTSLVGIF RKDEPYSKPTGPGLDLQSTPHILNPNLLKSCLNDSRSHHNFLQEDFLCQAMKQALKGRKF FRLHYGAFKGNTILNEKMSLLKLLAPSWLFQIENVEEEEDLFLVGGTCCLRQSCKQYKDV VAFRGDLEGDRTYLL >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_4|1128_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaggacaagctg gtttttggggaacaggtggtgtttggttacatgacacgtattttacaatgtctcattcaa ctcctcaactgttgtgaattccttaagagctgtacattcaaaattgtgctagattcagat gcagcggaatatggagggcatcagagactggaccacagcactgactttttttctgaggct tttgaacataatgggcgtccctattctcttttggtaatccttgtgctcatcgctttcttg tcattcttttccaagtcacttggactctccagcttcactttcattaggctctgtatgcca gtgcatgaggctaatccatatggaaacacccagcatgctgataaaatttcaatctccatc tcaggagattcgggaactagagctacaaatgaaaatgccgtgtacaaagaagtgaaaatg tttctattccagccatttgctttattaatcattgtgctgaatagagatgttaggttgaaa gcctttccatttcagaaatttaaatgtggtgcttcaccaacttctctagtaggaattttc cggaaggatgaaccttattctaagccaactggcccaggccttgatcttcaaagcacacct catattttaaacccaaacttactaaaatcctgtctaaatgatagtagaagtcaccataat tttctgcaggaagacttcttgtgtcaggctatgaaacaagctcttaaaggaaggaagttt ttcaggctacattatggtgcttttaaagggaatacaattcttaatgagaaaatgagtctc ctgaagctcctggctccaagctggctcttccaaattgagaatgtggaggaggaagaggat ctttttctagttggtgggacttgttgcttgagacaaagctgtaagcagtacaaagacgtg gtagcttttagaggagacctggagggtgacagaacttacctgttatga >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_5|320_aa MLNVEYKIAVKRWWLEEYRFDGFRFDGVTSMLYHHHGVGQGFSGDYSEYFGLQVDEDALT YLMLANHLVHTLCPDSITIAEDVSGMPALCSPISQGGGGFDYRLAMAIPDKWIQLLKEFK DEDWNMGDIVYTLTNRRYLEKCIAYAESHDQALVGDKSLAFWLMDAEMYTNMSVLTPFTP VIDRGIQLHKMIRLITHGLGGEGYLNFMGNEFGHPEWLDFPRKGNNESYHYARRQFHLTD DDLLRYKFLNNFDRDMNRLEERYGWLAAPQAYVSEKHEGNKIIAFERAGLLFIFNFHPSK SYTDYRVGTALPGKYPFCYQ >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_5|963_bp atgctgaatgtggagtataaaattgcagttaagagatggtggttggaagaatatcgcttt gatggatttcgttttgatggtgttacgtccatgctttatcatcaccatggagtgggtcaa ggtttctcaggtgattacagtgaatatttcggactacaagtagatgaagatgccttgact tacctcatgttggcaaatcatttggttcacacgctgtgtcccgattctataacaatagct gaggatgtatcaggaatgccagctctgtgctctccaatttcccagggagggggtggtttt gactatcgactagccatggcaattccagataagtggattcagctacttaaagagtttaaa gatgaagactggaacatgggcgatatagtatacacgctcacaaacaggcgctaccttgaa aagtgcattgcttatgcagagagccatgatcaggcattggttggggataagtcgctggca ttttggttgatggatgccgaaatgtatacaaacatgagtgtcctgactccttttactcca gttattgatcgtggaatacagcttcataaaatgattcgactcattacgcatgggcttggt ggagaaggctatctcaatttcatgggtaatgaatttgggcatcctgaatggttagacttc ccaagaaaaggaaataatgagagttaccattatgccaggcggcagtttcatttaactgac gacgaccttcttcgctacaagttcctaaataattttgacagggatatgaatagattggaa gaaagatatggttggcttgcagctccacaggcctacgtgagtgaaaaacatgaaggcaat aagatcattgcttttgaaagagcaggtcttcttttcattttcaacttccatccaagcaag agctacactgactaccgagttggaacagcattgccagggaaatatcctttttgttaccag tga >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_6|116_aa MAQELQRYCHHNGWQKQKYHSSLKRQGLRRTPVTAGVDELGEFIKDNIWPKPLQYYLVPD MDDEEGKGEKDDDDDDDDDAKEEGLEDTDEVGDKDEGEEDEDDEGEEREEDEGEDD >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_6|351_bp atggcccaggaacttcagaggtactgtcatcacaatggctggcagaagcaaaaataccat agctctctgaagcgccaaggcttgcggagaacacctgtgactgcaggtgtggatgagtta ggagagttcatcaaagacaatatttggccaaaaccattacagtactacttggttcctgat atggatgatgaagaaggaaaaggagaaaaagatgatgatgatgatgacgatgatgacgca aaggaggaaggattagaagatactgatgaagtaggggataaggatgaaggtgaagaagat gaagatgatgaaggggaggaaagagaggaggatgaaggagaagatgactaa >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_7|404_aa MAAPMTPAARPEDYEAALNAALADVPELARLLEIDPYLKPYAVDFQRRYKQFSQILKNIG ENEGGIDKFSRGYESFGVHRCADGGLYCKEWAPGAEGVFLTGDFSSLSEYRFALWRGLQK CGKSVKAVQHCFCSSKQLFHADGWNPFSYPYKKLDYGKWELYIPPKQNKSVLVPHGSKLK VVITSKSGEILYRISPWAKYVVREGDNVNYDWIHWDPEHSYEFKHSRPKKPRSLRIYESH VGISSHEGKVASYKHFTCNVLPRIKGLGYNCIQLMAIMEHAYYASFGYQITSFFAASSRY GTPEELQELVDTAHSMGIIVLLDVVHSHASKNSADGLNMFDGTDSCYFHSGPRGTHDLWD SRLFAYSRGMDEAGNRHSQQTNTGTENQTLHVLTDKSESNNENI >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_7|1215_bp atggcggctccgatgactcccgcggctcggcccgaggactacgaggcggcgctcaatgcc gccctggctgacgtgcccgaactggccagactcctggagatcgacccgtacttgaagccc tacgccgtggacttccagcgcaggtataagcagtttagccaaattttgaagaacattgga gaaaatgaaggtggtattgataagttttccagaggctatgaatcatttggcgtccacaga tgtgctgatggtggtttatactgcaaagaatgggccccgggagcagaaggagtttttctt actggagattttagtagcttatctgagtatcgttttgcattgtggagaggtttacagaaa tgtggtaaatctgtcaaagctgtacagcattgtttttgctcatccaagcagttgtttcat gcagatggttggaatccattttcgtacccatacaaaaaactggattatggaaaatgggag ctgtatatcccaccaaagcagaataaatctgtactcgtgcctcatggatccaaattaaag gtagttattactagtaaaagcggagagatcttgtatcgtatttcaccgtgggcaaagtat gtggttcgtgaaggtgataatgtgaattatgattggatacactgggatccagaacactca tatgagtttaagcattccagaccaaagaagccacggagtctaagaatttatgaatctcat gtgggaatttcttcccatgaaggaaaagtagcttcttataaacattttacatgcaatgta ctaccaagaatcaaaggccttggatacaactgcattcagttgatggcaatcatggagcat gcttactatgccagctttggttaccaaatcacaagcttctttgcagcttccagccgttat ggaacacctgaagagctacaagaactggtagacacagctcattccatgggtatcatagtc ctcttagatgtggtacacagccatgcttcaaaaaattcagcagatggattgaatatgttt gatgggacagattcctgttattttcattctggacctagagggactcatgatctttgggat agcagattgtttgcctactccaggggcatggatgaagctggaaaccgtcattctcagcaa actaacacaggaacagaaaaccaaacactgcatgttctcactgataaatcggagtcaaac aatgagaacatatga >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_8|149_aa MLCAALDLMLCIPATLAMAKRGQSTAQAIASEDMSPQPWQLPHDYMCPTAMQKTRIEVWE PPPTFQRMYGNAWMSRQRCAAGAEPSWRTFARAVQKGNVELESPHRVPTGALSSGAVRRG HRPPDPRMVDTMTACTVCLEKPQTLNASP >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_8|450_bp atgctgtgcgcagccttggatttgatgctctgcatcccagccactctagcaatggctaaa aggggccaaagtacagctcaggccattgcttcagaggatatgagcccccagccttggcag cttccacatgactacatgtgtcctacagctatgcagaagacaagaattgaggtttgggaa cctccacctacatttcagaggatgtatggaaatgcctggatgtccaggcagaggtgtgct gcaggggcagagccctcatggagaacctttgctagggcagtgcagaagggaaatgtggag ttggagtccccacacagagtccccactggggcactctctagtggagctgtgagaagaggg caccgtcctccagaccccagaatggtagatacaatgacagcttgcactgtgtgcctggaa aagccacagacactcaatgccagcccatga >gi568815595r:81390410_81861517|GENSCAN_predicted_peptide_9|175_aa MPEPPTHSVSSCAAPASLTSATPCSTAPSPIDHPRAEECGCRAWDWQAAPPAAPVRDPLG KASWAPESAKVCSFTPEANKTTNPPGGTTNSRRAALRAVTLTSRVRDFTPEPARPRTHQK EETPNTSEHQKEQKSRRATLRAVTLTARVCSFILEVSETKNPPIPDTRGVGHIFA >gi568815595r:81390410_81861517|GENSCAN_predicted_CDS_9|528_bp atgcctgagcctcccacccactccgtgagctcctgtgcggccccagcctccctgacgagc gccaccccctgctccacggcacccagccctatcgaccacccaagggctgaggagtgtggg tgcagggcgtgggactggcaggcagctccacctgcagccccggtgcgggatccactgggt aaagccagctgggctcctgagtctgcgaaggtctgcagcttcactcctgaagccaacaag accacgaacccaccgggaggaactaccaattccagacgcgccgccttaagagctgtaaca ctcacctccagggtccgcgacttcactcctgagccagcgagaccacgaacccaccagaag gaagaaactccgaacacatccgaacatcagaaggaacaaaagtccagacgcgccacctta agagctgtaacactcaccgcgagggtctgcagcttcattcttgaagtcagtgagaccaag aacccaccaattccggacacacggggagtagggcacatctttgcttag