GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:36:14 Sequence gi568815592f:65202522_65404954 : 202433 bp : 33.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4313 4388 76 1 1 59 92 39 0.306 2.70 1.02 Term + 12813 13195 383 1 2 40 32 201 0.909 3.72 1.03 PlyA + 13600 13605 6 1.05 2.04 PlyA - 14302 14297 6 1.05 2.03 Term - 19236 19016 221 1 2 120 44 120 0.826 7.02 2.02 Intr - 19360 19343 18 2 0 111 90 30 0.602 1.16 2.01 Init - 49331 49178 154 2 1 73 54 66 0.305 1.89 2.00 Prom - 54233 54194 40 -3.65 3.00 Prom + 71258 71297 40 -1.95 3.01 Sngl + 100001 102418 2418 1 0 86 54 2274 0.988 215.96 3.02 PlyA + 104034 104039 6 1.05 4.06 PlyA - 105425 105420 6 1.05 4.05 Term - 122558 122454 105 2 0 78 48 104 0.017 2.83 4.04 Intr - 129896 129827 70 1 1 60 75 21 0.031 -3.73 4.03 Intr - 132625 132459 167 1 2 81 68 141 0.488 9.24 4.02 Intr - 141656 141517 140 0 2 85 94 115 0.946 11.06 4.01 Init - 149044 149005 40 1 1 56 92 3 0.501 -2.10 4.00 Prom - 152450 152411 40 -2.95 5.00 Prom + 165180 165219 40 -3.45 5.01 Sngl + 172903 173100 198 0 0 65 48 189 0.624 7.52 5.02 PlyA + 173281 173286 6 1.05 6.00 Prom + 176041 176080 40 -4.25 6.01 Sngl + 176848 177507 660 0 0 51 48 289 0.979 17.12 6.02 PlyA + 177865 177870 6 1.05 7.00 Prom + 178623 178662 40 -4.25 7.01 Sngl + 189028 190158 1131 0 0 69 39 231 0.740 12.62 7.02 PlyA + 191205 191210 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 122660 122454 207 2 0 49 48 213 0.930 8.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_1|152_aa MKTLNKLAIEGKYIKIIRAICDKPTGVQASVEEGNTNTVEITRELELEEGPKDAIELLQS HDKTCIDKEFFMKEQIKWFLKMECTSCEGAVNIVEITTNYLKSYKNLVHETMAGLKGIAL NFERSSTVGKTLSNSNTCHREILCKRNCAPMW >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_1|459_bp atgaaaaccctcaacaaactagcaattgaaggaaaatacatcaaaataataagagccata tgtgacaaacccacaggtgttcaggcttcggtggaagaaggaaatacaaatacggtagaa ataacaagagaactagaattagaagaagggcctaaagatgcgattgaactgctgcaatct catgataaaacttgtatagataaggagttctttatgaaggagcaaataaaatggtttctt aagatggaatgtacttcttgtgaaggtgctgtgaacattgttgaaataacaacaaattat ttaaaatcttacaaaaacttagttcatgaaaccatggcaggcttaaagggaattgcctta aattttgaaagaagctccactgtgggtaaaacgctgtcaaacagcaacacatgccacaga gaaatcttgtgtaaaaggaactgtgcaccgatgtggtaa >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_2|130_aa MGKESGILFRHWESPFPLFESEPEFSHHSLTPSVASSEALRSDYGYLRGKNGHPDARGAW CKLSVDLPFLGLEDSGPFLTSPLGSAPVGTLCGGSNSTFPFHTDLAEVLHEGPAPAANFC PDIQVFPYIL >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_2|393_bp atggggaaggaatctggtattctctttaggcattgggagtctcctttcccactctttgag tcagaaccagagttctcccatcactctctaactccatctgtggccagttctgaagcgttg cgttcagactatggatatttgaggggaaaaaatggtcaccctgatgcaagaggtgcatgg tgcaagctgtcagtggatctaccattcttgggtctagaggacagtggcccttttctcaca tctccactaggcagtgccccagtgggcactctgtgtgggggctccaactccacatttccc ttccacactgacctagcagaggttctccatgagggccctgcccctgcagcaaacttctgc ccagatatccaggtgtttccatacatcctttga >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_3|805_aa MPNQGEDCYFFFYSTCTKGDSCPFRHCEAALGNETVCTLWQEGRCFRRVCRFRHMEIDKK RSEIPCYWENQPTGCQKLNCAFHHNRGRYVDGLFLPPSKSVLPTVPESPEEEVKASQLSV QQNKLSVQSNPSPQLRSVMKVESSENVPSPKHPPVVINAADDDEDDDDQFSEEGDETKTP TLQPTPEVHNGLRVTSVRKPAVNIKQGECLHFGIKTLEEIKSKKMKEKSEEQGEGSSGVS SLLLHPEPVPGPEKENVRTVVRTVTLSTKQGEEPLVRLGLTETLGKRKFSTGGDSDPPLK RSLAQRLGKKVEAPETNTDETPKKAQVSKSLKERLGMSADPNNEDATDKVNKVGEIHVKT LEEMLLERASQKHGESQTKLKTEGPSKTDDSTSGARSSSTIRIKTFSEVLAEEEHRQQEA ERQKSKKDTTCIKLKTDSEIKKTVVLPPIVASKGQSEEPAGKTKSMQEVHMKTVEEIKLE KALRVQQSSESSTSSPSQHEATPGARLLLRITKRTWRKEEKKLQEGNEVDFLSRVRMEAT EASVETTGVDITKIQVKRCEIMRETRMQKQQEREKSVLTPLQGDVASCNTQVAEKPVLTA VPGITWHLTKQLPTKSSQKVEVETSGIADSLLNVKWSAQTLEKRGEAKPTVNVKQSVVKV VSSPKLAPKRKAVEMHPAVTAAVKPLSSSSVLQEPPAKKAAVDAVVLLVSEDKSVTVPET ENPRDSLVLPLTQSSSDSSPPEVSGPSSSQMSMKTRRLSSASTGKPPLSVEDDFEKLTWE ISGGKLEAEIDLDPGKDEDDLPLEL >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_3|2418_bp atgcctaatcaaggagaagactgctatttttttttctattctacatgtaccaaaggtgac agctgcccattccgtcactgtgaagctgcactaggcaatgaaactgtttgcacattatgg caagaagggcgctgttttcgacgggtgtgcaggtttcggcacatggagattgataaaaaa cgcagtgaaattccttgttattgggaaaatcagccaacaggatgtcaaaaattaaactgc gctttccatcacaatagaggacgatatgttgatggccttttcctacctccgagcaaaagt gtgttgcccactgtgcctgagtcaccagaagaggaagtgaaggctagccaactttcagtt cagcagaacaaattgtctgtccagtccaatccttcccctcagctgcggagcgttatgaaa gtagaaagttccgaaaatgttcctagccccaagcatccaccagttgtaattaatgctgca gatgatgatgaagatgatgatgatcagttttctgaggaaggtgatgaaaccaaaacacct accctgcaaccaactcctgaagttcacaatggattacgagtgacttctgtccggaaacct gcagtcaatataaagcaaggtgaatgtttgcattttggaataaaaactcttgaggaaatt aagtcaaagaaaatgaaggaaaaatctgaggagcaaggtgagggttcttcaggagtttcc agtcttttactccaccctgagcctgttccaggtcctgaaaaagaaaatgtcaggactgtg gtgaggacagtaactctctccaccaaacaaggagaagaacccttggttagattgggcctt actgagacactggggaaacgaaaattttcgacaggcggtgacagtgatcctccattaaag cgcagcctggcacagaggctagggaagaaagttgaagctccagaaactaacactgacgaa acaccaaagaaagctcaagtttccaagtctcttaaggagcgattaggcatgtcagctgat ccaaataatgaggacgcaacagataaagttaataaagttggtgagatccatgtgaagaca ttagaagaaatgcttcttgaaagagccagtcagaaacatggggaatcgcaaactaaactc aagacagaaggaccttcaaaaactgatgattctacttcaggagcaagaagctcctccact atccgtatcaaaaccttctctgaggtcctggctgaagaagaacataggcagcaggaagca gagagacaaaaaagcaaaaaggatacaacttgcatcaagctaaagactgatagtgaaatt aaaaaaacagtagttttgccacccattgttgccagcaaaggacaatcagaggagcctgca ggtaaaacaaagtccatgcaggaggtgcacatgaagacggtggaagaaattaaactggag aaggcactgagggtgcagcagagctctgagagcagcaccagctccccgtctcaacatgag gccactccaggggcaaggttgctactgcgaatcaccaaaagaacatggaggaaagaagag aagaaacttcaggaaggaaatgaagttgattttctgagccgtgttagaatggaagctaca gaggcttcagttgagaccacaggagttgacatcactaaaattcaagtcaagagatgtgag atcatgagagagacgcgcatgcagaaacagcaggagagggaaaaatcagtcttgacacct cttcagggagatgtagcctcttgcaatacccaagtggcagagaaaccagtgctcactgct gtgccaggaatcacatggcacctgaccaagcagcttcccacaaagtcatcccagaaggtg gaggtagaaacctcagggattgcagactcattattgaatgtgaaatggtcagcacagacc ttggaaaaaaggggtgaagctaaacccacagtgaacgtgaagcaatctgtggttaaagtt gtgtcatcccccaaattggccccaaaacgtaaggcagtggagatgcaccctgctgtcact gccgctgtgaagccactcagctccagcagcgtcctacaggaacccccagccaaaaaggca gctgtggatgctgttgtcctgcttgtctctgaggacaaatcagtcactgtgcctgaaaca gaaaatcctagagacagtcttgtgctgcctctaacccagtcctcttcagattcctcaccc ccggaggtgtctggcccttcctcatcccaaatgagcatgaaaactcgccgactcagctct gcctcaacaggaaagcccccactctctgtggaggatgattttgagaaactaacatgggag atttcaggaggcaaattggaagctgagattgacctggatcctgggaaagatgaagatgac cttccgcttgagctatga >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_4|173_aa MAEVFLRENGRSNRSEGEKCQGVIDAYFFLAANCTEDATYVNDPEDNNSSCWFPHEGTKE ICANGCSCLSEEDSQEYRYLCFLRWAGNMYLENTTDDQENECQHEAVCKDEINRPRRILN TVIPHQIQQHIERFIQHDQGQLPGGVPVPMVLPLTEVIGACLPKALPPPANAD >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_4|522_bp atggcagaagtatttttgagggaaaatggaaggagtaatagatctgaaggcgaaaagtgc caaggggttattgatgcctatttctttctggctgcaaactgcactgaagatgcaacctat gtgaacgatcctgaagataataattcttcatgttggttcccacatgaaggcacaaaagag atttgtgcaaatggatgcagttgtttgagtgaagaagacagtcaggaatatcggtatcta tgttttctcagatgggctggcaacatgtatctggaaaatacaactgatgatcaagaaaat gagtgtcaacatgaagctgtttgtaaagatgaaattaatagacccagaagaatcctcaac acggtgataccacaccaaattcagcaacatatagaaaggtttatacagcatgaccagggc cagctgcctgggggtgtacctgtgccaatggtgctgccactaactgaagttataggggca tgtctgccgaaagctcttccaccacctgctaatgcagactga >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_5|65_aa MRKNQQNNAENSKNQNASSPPNDHDSSPARAQNWTENEFDELTEVGFRKWVITNSSELKE AMQGS >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_5|198_bp atgaggaaaaaccagcaaaacaatgctgaaaattccaaaaaccagaatgcttcttctcct ccaaatgatcacgactcctctccagcaagggcacaaaactggacagagaatgagtttgat gaactgacagaagtaggcttcagaaagtgggtaataacaaactcctctgagctaaaggaa gcaatgcaaggaagctaa >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_6|219_aa MIIPIDAEKAFDEIQHRFMLKTLNKLGVGGTYIKIIRAIYDKPIANIILNGQKLKAFPLK TDTRQGCPLSPLLFNIMLEVLARAIKQEKEIKGIQIGREKVKLCLFAEYMIVYLENPIVS VQQLLKLICNFSKVSGYNINVQKSQAFLYTINRQAERQIMSELPLRIATKRIKYLGIQFT RDVKDLFKENYKPLLKEIRDDTNKWKKTFHAHGQEKSIL >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_6|660_bp atgattatcccaatagatgcagaaaaggccttcgacgaaattcaacaccgcttcatgcta aaaactctcaataaactaggtgttggtggaacatatatcaaaataataagggctatttat gacaaacccatagccaatatcatattgaatgggcaaaagctgaaagcattccctttgaaa accgacacaagacaaggatgccctctctcaccactcctgttcaacataatgttggaagtt ctggctagagcaatcaagcaagagaaagaaataaagggtattcaaataggaagagagaaa gtcaagttgtgtctgtttgcagaatacatgattgtatatttagaaaaccccatcgtctca gtccaacaactccttaagctgatatgcaacttcagcaaagtctcaggatacaatatcaat gtgcaaaaatcacaagcattcctatacaccattaacagacaagcagagaggcaaattatg agtgaactcccattgagaattgctacaaagagaataaaatacctaggaatacaatttaca agagatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataagagat gacacaaacaaatggaaaaaaacattccatgctcatggacaggaaaaatcaatattgtga >gi568815592f:65202522_65404954|GENSCAN_predicted_peptide_7|376_aa MGELPFTIASKRIKYLGFQLTRDVKDLFKENYKPPLKEIKEDTNKWKNIPCSWVGRINIM KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLYYKAIVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPERNKQWGKDSLFN KWCWGNWLAIWRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQATKWEKIFAIYSSDKGLISRI YNELKQIYKKTTNNPIKKWEKDMKRHFSKEDICAGKRHMKKCSPSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNR >gi568815592f:65202522_65404954|GENSCAN_predicted_CDS_7|1131_bp atgggtgaactcccattcacaattgcttcaaagagaataaaatacctaggattccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccaccgctcaaggaaataaaa gaggatacaaacaaatggaagaatattccatgctcatgggtaggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgactttctttacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctatagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaac tatctgatctttgacaaacctgagagaaacaagcaatggggaaaggattccctatttaat aaatggtgctggggaaactggctagccatatggagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaagca acaaaatgggagaaaattttcgcaatctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaacaacaaacaaccccatcaaaaagtgggag aaggatatgaaaagacacttctcaaaagaagacatttgtgcaggcaaaagacacatgaaa aaatgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggtaa