GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:31:31 Sequence gi568815586r:76758955_76966260 : 207306 bp : 38.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1912 1907 6 1.05 1.01 Sngl - 5914 5162 753 1 0 40 48 304 0.709 17.57 1.00 Prom - 13238 13199 40 -5.55 2.00 Prom + 14926 14965 40 -4.35 2.01 Init + 22676 22729 54 1 0 87 56 37 0.250 1.74 2.02 Intr + 38480 38583 104 1 2 101 70 53 0.245 2.85 2.03 Intr + 46363 46485 123 1 0 131 55 171 0.842 16.88 2.04 Intr + 50759 50903 145 2 1 83 98 53 0.830 5.16 2.05 Intr + 56192 56256 65 1 2 104 77 44 0.965 1.50 2.06 Intr + 56903 57065 163 2 1 117 93 42 0.989 6.66 2.07 Intr + 63452 63577 126 1 0 50 92 94 0.925 5.96 2.08 Term + 69871 70056 186 0 0 96 41 46 0.314 -2.79 2.09 PlyA + 71091 71096 6 1.05 3.12 PlyA - 71168 71163 6 1.05 3.11 Term - 92222 91752 471 2 0 80 35 211 0.441 8.94 3.10 Intr - 94652 94628 25 1 1 26 127 9 0.012 -4.29 3.09 Intr - 100074 100002 73 1 1 130 63 88 0.175 8.05 3.08 Intr - 100686 100593 94 0 1 91 93 48 0.934 4.22 3.07 Intr - 101459 101330 130 1 1 46 98 130 0.948 9.58 3.06 Intr - 104390 104222 169 0 1 86 105 256 0.994 25.18 3.05 Intr - 107307 107195 113 2 2 81 92 45 0.619 3.30 3.04 Intr - 111599 111539 61 0 1 14 98 72 0.062 -2.33 3.03 Intr - 119497 119312 186 2 0 88 87 94 0.136 8.04 3.02 Intr - 121224 120947 278 2 2 66 30 159 0.411 4.14 3.01 Init - 121764 121712 53 0 2 76 36 57 0.257 -0.02 3.00 Prom - 139244 139205 40 -4.85 4.05 PlyA - 140597 140592 6 1.05 4.04 Term - 160950 160699 252 0 0 8 36 280 0.793 9.45 4.03 Intr - 191959 191848 112 0 1 94 58 46 0.125 1.66 4.02 Intr - 203939 203860 80 2 2 0 75 138 0.151 1.13 4.01 Intr - 204581 204225 357 2 0 39 42 237 0.162 8.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100074 99998 77 1 2 130 44 83 0.823 5.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:76758955_76966260|GENSCAN_predicted_peptide_1|250_aa MLKKGRPTQGARGATHGITWKDITMPMKLSPHSGLKGSNRYRHSPWRRKKKSKPDALPPG PKRLGARLGSKLRFKNHTRPLRLCSQTQRGDTAEPSTPPPLSSRHSQGLRPRARPGRARP AISSPVIRSQPRSGLDSPPHPPQLRHVPLPHTRRPPPSASPGKKGWRAAGQGIHSARFLT SGWRRGTQPASVSYSSGPSAILVLNPSSRCMLGESVSPSRGSGEAEGERGASASPAGGDA GLLLLLLLPP >gi568815586r:76758955_76966260|GENSCAN_predicted_CDS_1|753_bp atgctcaaaaaaggacgcccgacacaaggtgcccgaggtgctacccatggcattacctgg aaggacatcactatgcccatgaaactgtcccctcacagtggactaaaaggaagcaataga taccgtcattcaccctggcgaaggaaaaagaaatcaaaaccagacgcgcttccacctggc cccaagaggcttggagcaagactagggtctaagctgcggttcaaaaaccacaccaggccc ctccgcctctgctctcaaacgcagcgcggcgacacagcagaaccatcaacacctccacct ctgtcctccaggcacagccagggactgcgtcccagagcccgccccggtcgggcccgcccc gctatctcctcccccgttatccgctcgcagccgcgctcaggcctggattcgcctccccac cctccccaacttcggcacgtcccactaccccacacacgtcggccgccgccctcggcgagt ccggggaagaaaggctggagggccgcagggcaaggaatccactcagcacggttcctcacc tctgggtggagaaggggcacacagcccgcttcggtatcgtactcatccgggccgtccgcc atcttggtgttaaatccctcctcccgctgcatgctgggagaaagcgtttcaccctcccgg ggctcgggcgaggcggagggcgagcgcggggcgagcgcgagccccgccggaggcgacgcg ggcctcctcctccttcttctcctgcccccgtga >gi568815586r:76758955_76966260|GENSCAN_predicted_peptide_2|321_aa MALPDVGPFTLDYSAFRTEIKPQSHYNHGYGEPLGRKTHIDDYSTWDIVKATQYGIYERC RELVEAGYDVRQPDKENVTLLHWAAINNRIDLVKQGHLSMVVQLMKYGADPSLIDGEGCS CIHLAAQFGHTSIVAYLIAKGQDVDMMDQNGMTPLMWAAYRTHSVDPTRLLLTFNVSVNL GDKYHKNTALHWAVLAGNTTVISLLLEAGANVDAQNIKGESALDLAKQRKNVWMINHLQE ARQAKGYDNPSFLRKLKADKIQGLHVKVCCRDILCNGEKNKHPHFKKWVKDTDTDRHFSK DIQVAKREKMLDITNHQRNAN >gi568815586r:76758955_76966260|GENSCAN_predicted_CDS_2|966_bp atggccttgccagatgtgggccccttcaccttggactactcagccttcagaactgaaatc aaaccccaaagccattataaccatggatatggtgaacctcttggacggaaaactcatatt gatgattacagcacatgggacatagtcaaggctacacaatatggaatatatgaacgctgt cgagaattggtggaagcaggttatgatgtacggcaaccggacaaagaaaatgttaccctc ctccattgggctgccatcaataacagaatagatttagtcaaacaaggccatctatccatg gttgtgcaactaatgaaatatggtgcagatccttcattaattgatggagaaggatgtagc tgtattcatctggctgctcagttcggacatacctcaattgttgcttatctcatagcaaaa ggacaggatgtagatatgatggatcagaatggaatgacgcctttaatgtgggcagcatat agaacacatagtgtggatccaactagattgcttttaacattcaatgtttcagttaacctt ggtgacaagtatcacaaaaacactgctctgcattgggcagtgctagcagggaataccaca gtcattagccttcttctggaagctggagctaatgttgatgcccagaatatcaagggcgaa tcagcgcttgatttggcaaaacagagaaaaaatgtgtggatgatcaaccacttacaagag gcaaggcaagcaaaaggatatgacaatccgtccttccttagaaagctgaaagctgataag attcaggggttacatgtgaaggtttgttgtagggatatattgtgtaatggagaaaagaac aagcatcctcatttcaaaaaatgggtaaaggacacagacacagacagacatttctccaag gacatacaagtggccaaacgtgaaaaaatgttggacatcactaatcatcagagaaatgca aattaa >gi568815586r:76758955_76966260|GENSCAN_predicted_peptide_3|550_aa MAMEFRDNMSLTESKQLRGHLAMSRDFLIVTTSGAGCNWHLVGGGLLQCTGQPPTIKNYP AQDVNGVEDEKLWTRGGNSIYGMRGFWSLTFLNSPLRKYQFSGIIASGRRGVEWGFYIPP SPGYCSCRRTGLLASPLFGGVCPPSTEPLYVEGTAGNVNHLLLAFSRTFYDSDGVENTEM WSPPSRDQKLIAQMPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDST TVAIHDEEIYCKSCYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTS KFAQKYGGAEKCSRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYC KGCYAKNFGPKGFGYGQGAGALVHAQFFFPQNNTKWQCVLSMFTFVNKSVEKKTVKFTIS PTHSMLLFKGCGFECEKDTDRHNFQAPLSNMLHRIRCRYTSWYPDPDIWSYSIVYCLVQS TITGRKRPQHRNSKKSIIFLTHPWLKTAPQKENTSVSTNDLIHEFNIIFVLNIFFSKLKN SGYLQEGTNS >gi568815586r:76758955_76966260|GENSCAN_predicted_CDS_3|1653_bp atggctatggaatttagggacaacatgagcttgacggaaagcaaacagctcagagggcat ttggcaatgtccagagactttttgattgtcacaacgtcgggggcaggttgcaactggcat ctagtgggtggaggcctcctacaatgcacaggacagccccctacaataaagaattatcca gcccaagatgtcaatggtgtggaggatgagaaactctggaccagaggaggcaatagcatt tatggcatgagggggttttggtcactgacttttttgaactccccactccgcaagtatcag ttctccgggatcattgcttctggcaggagaggggtggaatggggcttctacatccccccg agcccagggtactgctcctgccgccgtaccgggctcctggcatccccattgtttggtggt gtctgcccacccagcaccgaacccctctatgttgaaggaacagcagggaatgtcaatcac ttactcctagctttttccagaactttctacgattcagatggtgtggagaatacagagatg tggtccccaccatcaagggaccagaaacttattgcacaaatgcctgtctggggaggtgga aacaagtgtggggcctgtgggaggaccgtgtaccacgcagaagaggtgcagtgtgatggc aggagcttccaccgctgctgctttctctgcatggtttgcaggaaaaatttagatagcaca acagtggcaattcacgatgaagagatctactgcaaatcctgctacggaaagaagtatggg ccaaaaggctacggttatggccagggcgctggcacgcttaacatggaccgtggcgagagg ctgggcatcaaaccagagagtgttcagcctcacaggcctacaacaaatccaaacacttct aaatttgctcagaaatatggaggtgctgagaagtgttccagatgtggggattctgtatat gctgccgagaagataattggagctggaaagccctggcacaaaaactgtttccgatgtgca aagtgtgggaagagtcttgaatcaacaactctgactgaaaaagaaggtgaaatctattgt aaaggatgctatgcaaagaactttgggcccaagggatttggctatggccaaggagcaggg gctcttgttcatgcccagtttttctttccacagaataacacgaaatggcagtgtgttctg tccatgtttacttttgttaataaaagtgttgaaaaaaagactgtaaaattcaccattagc cctacacatagcatgctcttgttcaaaggatgtggattcgagtgtgagaaagacacggac agacacaattttcaggcaccactcagcaatatgcttcataggataagatgtcgctacacc agctggtacccagatcctgatatttggtcatattctattgtatactgcctggtccagtcc acgataacaggacgaaagaggccacagcatcgaaattcaaagaagtctataatatttctt acacatccatggctaaaaacagcaccccaaaaagaaaacacgtcagtcagtacaaatgat ttaatacatgaattcaatataattttcgtgttaaatatatttttttctaaacttaaaaac tcaggttacttgcaagaaggaaccaattcataa >gi568815586r:76758955_76966260|GENSCAN_predicted_peptide_4|266_aa VGIILPDEQQMTCWLQTVRGESVFPFPGVFFHVEKSTTSVILLALQPNLITGVNTVTSLS DKIGKNKRGILMSSSPFFLLVANLIARMSGRSCQKPHGTFHQPQRESLPFIQHCQWDKED RLWLLTVHTRVPDADAKQHSLERMGREVIHSNFLLAMNICSDSVHQFNLSPNKVLSKPWA IEQRLDAGPLCMQPVVVQQPIHKGLTTTSPAGSRHARQSLWEVQGACGDNRAGTWPKIDN SNSDSSSRSSGAGEMAADTGERMGSV >gi568815586r:76758955_76966260|GENSCAN_predicted_CDS_4|801_bp gtgggtatcattctaccagatgagcaacaaatgacgtgctggctgcagactgttagaggg gagagtgtctttccctttcctggcgttttctttcatgttgagaaaagcactaccagtgta atactgctggctctccagcccaatctcatcacaggagtgaacacagtcacttctctatct gataaaattgggaagaacaaacgaggtattctcatgtcatcttctccctttttcctcttg gtagcaaacctgattgccaggatgagtggtaggagctgccaaaaacctcacgggaccttc caccagccccagagagagagcctgcccttcatccagcactgccagtgggacaaagaggac cgtctgtggctgctaactgttcacactcgggtccctgatgctgatgccaagcaacactca ctcgagaggatgggcagggaagtaattcattcaaacttccttctggccatgaacatttgc tctgattctgttcatcagtttaacctgtcacccaacaaggtcctgagcaaaccctgggcc atagaacaaaggctagatgctggaccactttgcatgcaacctgttgtggtccagcagccc atccacaaagggctcaccaccaccagcccggcgggcagcaggcatgcacggcagtccctc tgggaagtccagggggcttgtggggacaaccgagctgggacctggcccaagattgacaat agcaacagtgacagcagcagcagaagtagtggagcaggagaaatggcagcagatacagga gagagaatgggctctgtataa