GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:52:45 Sequence gi568815586f:64355871_64601378 : 245508 bp : 41.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 738 752 15 2 0 72 111 14 0.665 2.39 1.02 Intr + 6378 6490 113 2 2 58 96 106 0.751 6.66 1.03 Intr + 7728 7815 88 0 1 30 44 83 0.119 -2.85 1.04 Term + 19672 20028 357 0 0 -30 40 294 0.693 6.33 1.05 PlyA + 20549 20554 6 1.05 2.03 PlyA - 21495 21490 6 1.05 2.02 Term - 24870 24853 18 0 0 88 50 45 0.103 -1.96 2.01 Init - 34695 34444 252 0 0 89 93 367 0.506 34.69 2.00 Prom - 35796 35757 40 -11.44 3.00 Prom + 36636 36675 40 -9.05 3.01 Init + 38485 38527 43 0 1 71 94 -13 0.204 -3.58 3.02 Term + 40868 41130 263 0 2 15 54 387 0.305 22.70 3.03 PlyA + 41161 41166 6 1.05 4.00 Prom + 43395 43434 40 -7.15 4.01 Init + 49065 49143 79 2 1 57 34 155 0.009 6.28 4.02 Intr + 54092 54225 134 0 2 79 110 94 0.024 10.14 4.03 Intr + 63006 63209 204 2 0 82 2 222 0.809 11.47 4.04 Intr + 64200 64384 185 2 2 67 97 86 0.877 5.06 4.05 Intr + 64483 64651 169 1 1 69 77 107 0.945 6.73 4.06 Intr + 65365 65601 237 0 0 38 111 143 0.958 8.49 4.07 Intr + 67312 67374 63 0 0 72 94 72 0.948 4.20 4.08 Intr + 68729 68853 125 1 2 31 92 108 0.999 3.96 4.09 Intr + 69168 69312 145 0 1 55 93 134 0.997 9.96 4.10 Intr + 69945 70039 95 2 2 39 108 34 0.780 -1.66 4.11 Intr + 72181 72250 70 1 1 113 95 21 0.782 3.57 4.12 Intr + 74179 74417 239 0 2 55 115 215 0.998 16.39 4.13 Intr + 75668 75953 286 2 1 48 81 243 0.996 16.02 4.14 Intr + 77544 77733 190 2 1 56 81 151 0.389 9.44 4.15 Intr + 78637 78753 117 2 0 72 78 66 0.900 3.52 4.16 Intr + 78924 79039 116 1 2 51 99 63 0.871 2.95 4.17 Intr + 83374 83445 72 0 0 48 110 57 0.741 2.58 4.18 Intr + 87343 87475 133 0 1 30 94 60 0.330 0.10 4.19 Intr + 96097 96317 221 2 2 -7 106 130 0.007 2.40 4.20 Intr + 104319 104459 141 2 0 75 107 73 0.984 7.53 4.21 Intr + 108464 108593 130 1 1 77 71 85 0.979 5.05 4.22 Intr + 111031 111212 182 2 2 27 111 220 0.942 16.87 4.23 Term + 113022 113216 195 2 0 95 45 158 0.983 8.63 4.24 PlyA + 113780 113785 6 1.05 5.00 Prom + 114793 114832 40 -3.55 5.01 Init + 118369 118520 152 0 2 27 95 90 0.437 3.16 5.02 Intr + 124142 124252 111 2 0 98 123 86 0.999 11.68 5.03 Intr + 125972 126151 180 2 0 55 102 95 0.968 5.66 5.04 Intr + 132617 132718 102 2 0 75 65 67 0.616 1.47 5.05 Intr + 134171 134249 79 2 1 86 89 47 0.733 3.23 5.06 Intr + 139613 139734 122 1 2 33 56 110 0.770 0.67 5.07 Intr + 141079 141180 102 1 0 82 113 68 0.985 7.07 5.08 Intr + 141293 141393 101 2 2 88 60 87 0.057 4.73 5.09 Intr + 151304 151459 156 0 0 76 98 55 0.075 4.36 5.10 Intr + 156166 156342 177 2 0 65 49 111 0.319 3.87 5.11 Term + 162472 162632 161 2 2 23 49 166 0.220 3.32 5.12 PlyA + 163894 163899 6 1.05 6.05 PlyA - 164492 164487 6 1.05 6.04 Term - 165163 165044 120 1 0 60 42 138 0.181 3.89 6.03 Intr - 166122 166032 91 2 1 36 81 57 0.077 -1.12 6.02 Intr - 177435 177180 256 1 1 7 84 177 0.176 4.78 6.01 Init - 177528 177462 67 0 1 103 91 62 0.604 9.29 6.00 Prom - 189693 189654 40 -4.45 7.00 Prom + 197410 197449 40 -5.25 7.01 Init + 200150 200242 93 1 0 46 100 77 0.285 5.13 7.02 Intr + 201785 201898 114 1 0 83 40 84 0.487 2.82 7.03 Term + 203106 203285 180 2 0 81 40 228 0.555 13.93 7.04 PlyA + 205473 205478 6 1.05 8.02 PlyA - 205519 205514 6 1.05 8.01 Term - 211972 211755 218 2 2 34 41 190 0.149 5.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 54166 54225 60 0 0 82 110 75 0.918 10.40 S.002 Term - 96159 96044 116 1 2 10 55 146 0.814 1.25 S.003 Init + 100001 100087 87 1 0 98 96 18 0.935 4.30 S.004 Intr + 141293 141389 97 2 1 88 92 70 0.925 6.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_1|190_aa MVTEQYFVIFGLRLALEGTVTGKWSRYRTQERVHGSGTRKNSGTYYVLNAVPPIDDHGNS QSSHVQIFLRKKKFCKGDQSLEDEEHRGWPLEADNNKLRAIIKADPLTTTQKVLEELNIN HSMVIQHLKQIRNVKKLDKCMSHELSENLKNIVLKCHLLLFYATTNSFSDWIVMCDEKWI LYYNQRQPAQ >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_1|573_bp atggttacagaacagtattttgtaatatttggcctaagacttgctttggaaggaactgtt accggaaagtggtctcgatacagaacccaagagagggttcatggatcaggcacaagaaag aattcaggcacctactatgttctcaatgccgtgccacctatagatgatcatgggaatagc cagagtagtcacgtacaaatctttttgcggaaaaagaagttctgcaaaggagaccagagc cttgaagatgaggagcatagaggctggccattggaagctgacaataacaaattaagagca atcatcaaagctgatcctcttacaactacacaaaaagttttggaagaactcaacatcaac cattctatggtcattcagcatctgaagcaaattagaaatgtgaaaaagcttgataaatgc atgtctcatgagttgagcgaaaatttaaaaaatattgttctaaagtgtcatcttctctta ttttatgcaacaacgaacagtttttctgattggattgtgatgtgcgatgaaaagtggatt ttatactacaatcagcgacaaccagctcagtag >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_2|89_aa MASPLPSGFPARRNSRLDVFLRRHLPPEVYDAVRAYEPCIVVSNSENHILKYVVLSDRLV YLTENPPKSIRRVVALRDVVAIDLNTRDP >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_2|270_bp atggccagccccttgccgtccggcttccccgcgcgcaggaacagccgcctggatgtgttc ctgcggcggcatctgccgcccgaggtctacgacgcggtccgcgcctacgagccatgcatc gtggtgtccaactctgagaaccacatcctcaagtatgtggtgctaagcgaccggctcgtc tacctaaccgagaacccgcccaagtccatccggcgggtagtggctctgcgggacgtcgtg gccattgacctgaacactcgagatccttag >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_3|101_aa MVAHTCNPGTLGGPDPVWHGDQDGDAEDHCHPEKLSPLHPQVQLLREVPQELVHAPVPLL QDVQISDIVTLDKCQSLNKTVYFNVLKVTKVADIKKQFQKF >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_3|306_bp atggtggctcacacctgtaatcccggcactctgggaggcccagatcctgtatggcatggt gaccaagatggagatgcagaggaccactgtcatccagagaaactatctccactacatccg caagtacaattgcttcgagaagtaccacaagaacttgtccatgcacctgttcctctgctt caggatgtccagatcagcgacattgtcacgttggataagtgccagtccctgaacaagacg gtgtacttcaatgtgctcaaggtcaccaaggtcgcagacatcaagaagcaattccagaag ttctga >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_4|1175_aa MAVHPLAGRTGWGWVRAGAAAEERVPNVYKFFCGIRGHAYYNQKTTSITARMDEQALLGL NPNADSDFRQRMLNPQPEKTFIRNKAAQVFALLFVTEYLTKWPKFFFDILSVVDLNPRGV DLYLRILMAIDSELVDRDVEARRNTLIKDTMREQCIPNLVESWYQILQNYQFTNSEVTCQ CLEVVGAYVSWIDLSLIANDRFINMLLGHMSIEVLREEACDCLFEVVNKGMDPVDKMKLV ESLCQVLQSAGFFSIDQEEDVDFLARFSKLVNGMGQSLIVSWSKLIKNGDIKNAQEALQA IETKVALMLQLLIHEDDDISSNIIGFCYDYLHILKQAIMLAVMKKLTYDEEYNFENEGED EAMFVEYRKQLKLLLDRLAQVSPELLLASVRRVFSSTLQNWQTTRFMEVEVAIRLLYMLA EALPVSHGAHFSGDVSKASALQDMMRTMAFLDHRGLRHSSAKVRSRTAYLFSRFVKSLNK QMNPFIEDILNRIQDLLELSPPENGHQSLLSSDDQLFIYETAGVLIVNSEYPAERKQALM RNLLTPLMEKFKILLEKLMLAQDEERQASLADCLNHAVGFASRTSKAFSNKQTVKQCGCS EVYLDCLQTFLPALSCPLQKDILRSGVRTFLHRMIICLEEEVLPFIPSASEHMLKDCEAK DLQEFIPLINQITAKFKIQVSPFLQQMFMPLLHAIFEVLLRPAEENDQSAALEKQMLRRS YFAFLQTVTGSGMSEVIANQGAENVERVLVTVIQGAVEYPDPIAQKTCFIILSKLVELWG GKDGPVGFADFVYKHIVPACFLAPLKQTFDLADAQTVLGPECVQYLQQEYLPSLQVAPEI IQLPMCVIIHSYDLHQVPEHYYITLVWVLQKGKGPFIGGGSQTPKRDIPLITVAAEADSA VAAAAVVTTRRAIGVRTRTRTGAPAVGHVASGQGLRSRKCPESRGGRGSPPAVARRRPGW KTGDLFAIKVFNNISFLRPVDVQMREFEVLKKLNHKNIVKLFAIEEETTTRHKVLIMEFC PCGSLYTVLEEPSNAYGLPESEFLIVLRDVVGGMNHLRENGIVHRDIKPGNIMRVIGEDG QSVYKLTDFGAARELEDDEQFVSLYGTEEYLKRNSTTSPHTPRQFPPSLLAENLLHQSSL LHQSEPYCISLASSPLQPVLLRAQDAYGVLWLSEL >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_4|3528_bp atggcggtgcaccccctggctgggcggaccgggtgggggtgggtacgagccggggccgcc gccgaggagcgcgtcccgaatgtttataaattcttctgtgggatcagagggcacgcctat tacaaccagaaaactacaagtataacagcgaggatggatgaacaggctctattagggcta aatccaaatgctgattcagactttagacaaaggatgctgaatccccaaccagagaagacc tttatacgaaataaagccgcccaagtcttcgccttgctttttgttacagagtatctcact aagtggcccaagtttttttttgacattctctcagtagtggacctaaatccaaggggagta gatctctacctgcgaatcctcatggctattgattcagagttggtggatcgtgatgtggag gctcgtaggaatactctcataaaagataccatgagggaacagtgcattccaaatctggtg gaatcatggtaccaaatattacaaaattatcagtttactaattctgaagtgacgtgtcag tgccttgaagtagttggggcttatgtctcttggatagacttatcccttatagccaatgat aggtttataaatatgctgctaggtcatatgtcaatagaagttctacgggaagaagcatgt gactgtttatttgaagttgtaaataaaggaatggaccctgttgataaaatgaaactagtg gaatctttgtgtcaagtattacagtctgctgggtttttcagcattgaccaggaagaagat gttgacttcctggccagattttctaagttggtaaatggaatgggacagtcattgatagtt agttggagtaaattaattaagaatggggatattaagaatgctcaagaggcactacaagct attgaaacaaaagtggcactgatgttgcagctactaattcatgaggatgatgatatttct tctaatattattggattttgttacgattatcttcatattttgaaacaggcaatcatgttg gccgttatgaaaaaattgacttacgatgaagaatataactttgaaaatgagggtgaagat gaagccatgtttgtagaatatagaaaacaactgaagttactgttggacaggcttgctcaa gtttcaccagagttactactggcctctgttcgcagagtttttagttctacactgcagaat tggcagactacacggtttatggaagttgaagtagcaataagattgctgtatatgttggca gaagctcttccagtatctcatggtgctcacttctcaggtgatgtttcaaaagctagtgct ttgcaggatatgatgcgaactatggctttcttagatcacagaggtctgcggcattccagt gcaaaagttcggagcaggacggcttacctgttttctagatttgtcaaatctctcaataag caaatgaatcctttcattgaggatattttgaatagaatacaagatttattagagctttct ccacctgagaatggccaccagtccttactgagcagcgatgatcaactttttatttatgag acagctggagtgctgattgttaatagtgaatatccggcagaaaggaaacaagccttaatg aggaatctgttgactccactaatggagaagtttaaaattctgttagaaaagttgatgctg gcacaagatgaagaaaggcaagcctctctagcagactgtcttaaccatgctgttggattt gcaagtcgaaccagtaaagctttcagcaacaaacagactgtgaaacaatgtggctgttcc gaagtttatctggactgtttacagacattcttgccagccctcagttgtcccttacaaaag gatattctcagaagtggagtccgtactttccttcatcgaatgattatttgcctggaggaa gaagttcttccgttcattccatctgcttcagaacatatgctcaaagattgtgaagcaaaa gatctccaggagttcattcctcttatcaaccagattacggccaaattcaagatacaggta tccccgtttttacaacagatgttcatgcccctgcttcatgcaatttttgaagtgctgctc cggccagcagaagaaaatgaccagtctgctgctttagagaagcagatgttgcggaggagt tactttgctttcctgcaaacagtcacaggcagtgggatgagcgaagttatagcaaatcaa ggtgcagagaatgtagaaagagtgttggttactgttatccaaggagcagttgaatatcca gatccaattgcacagaaaacatgttttatcatcctctcaaagttggtagaactctgggga ggtaaagatggaccagtgggatttgctgattttgtttataagcacattgtccccgcatgt ttcctagcacctttaaaacaaacctttgacctggcagatgcacaaacagtattgggccca gaatgtgttcagtatcttcaacaagaatacctgccctccttgcaagtagctccagaaata attcagctccccatgtgtgttataattcactcttatgaccttcaccaagtccctgaacac tactacatcaccctggtctgggtattgcagaagggtaagggccccttcataggtggtggc agtcaaacaccaaagagagacatccccctcattaccgtggccgcggaagccgactcggca gttgccgccgcggctgtggtgactaccagacgggccataggcgtgcgcacgcgcacccgc accggcgcgccggccgtcggtcacgtggcctccggccagggcttgcgaagccggaagtgt cctgagtctcgaggaggccgcgggagcccgccggcggtggcgcggcggagacccggctgg aaaactggtgatttatttgctatcaaagtatttaataacataagcttccttcgtccagtg gatgttcaaatgagagaatttgaagtgttgaaaaaactcaatcacaaaaatattgtcaaa ttatttgctattgaagaggagacaacaacaagacataaagtacttattatggaattttgt ccatgtgggagtttatacactgttttagaagaaccttctaatgcctatggactaccagaa tctgaattcttaattgttttgcgagatgtggtgggtggaatgaatcatctacgagagaat ggtatagtgcaccgtgatatcaagccaggaaatatcatgcgtgttataggggaagatgga cagtctgtgtacaaactcacagattttggtgcagctagagaattagaagatgatgagcag tttgtttctctgtatggcacagaagaatatttgaaaagaaactcaaccaccagtcctcat actcctcggcagttccctccatctctgcttgctgagaacctactgcatcagtctagccta ctgcatcagtctgaaccctactgcatcagtctagcctcctctcccctacaacctgtgttg ttgagggcccaggatgcctatggggtcctctggctgtctgagttgtga >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_5|480_aa MYERAVLRKDHQKKYGATVDLWSIGVTFYHAATGSLPFRPFEGPRRNKEVMYKIITGKPS GAISGVQKAENGPIDWSGDMPVSCSLSRGLQVLLTPVLANILEADQEKCWGFDQFFAETS DILHRMVIHVFSLQQMTAHKIYIHSYNTELIKDDYNETVHKKTEVVITLDFCIRNIEKTV KVYEKLMKINLEAAELGEISDIHTKLLRLSSSQGTIETSLQDIDSRLSPGGSLADAWAHQ EGTHPKDRKQKLYYHATKAMTHFTDECVKKYEAFLNKSEEWIRKMLHLRKQLLSLTNQCF DIEEEVSKYQEYTNEVASHSGTLLVLAMAIRSDFHETFRRIARLFYWFPKSLTCSKNRKS KLLWSQGGGKKRKEGGCLLDSTPLVSSIVWLSQGHLCRSEALRERTIMEELAEEHSSMDA LVSCLLEDCIGSMVLTSAPVKARKLPVMVEGKEEDSVSHGKREQERQKEVPNSFKQPDLT >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_5|1443_bp atgtatgagagagcagtgctaagaaaagatcatcagaagaaatatggagcaacagttgat ctttggagcattggggtaacattttaccatgcagctactggatcactgccatttagaccc tttgaagggcctcgtaggaataaagaagtgatgtataaaataattacaggaaagccttct ggtgcaatatctggagtacagaaagcagaaaatggaccaattgactggagtggagacatg cctgtttcttgcagtctttctcggggtcttcaggttctacttacccctgttcttgcaaac atccttgaagcagatcaggaaaagtgttggggttttgaccagttttttgcagaaactagt gatatacttcaccgaatggtaattcatgttttttcgctacaacaaatgacagctcataag atttatattcatagctataatactgaattaattaaagatgattacaatgaaactgttcac aaaaagacagaagttgtgatcacattggatttctgtatcagaaacattgaaaaaactgtg aaagtatatgaaaagttgatgaagatcaacctggaagcggcagagttaggtgaaatttca gacatacacaccaaattgttgagactttccagttctcagggaacaatagaaaccagtctt caggatatcgacagcagattatctccaggtggatcactggcagacgcatgggcacatcaa gaaggcactcatccgaaagacagaaagcaaaaactgtattaccatgccacaaaagctatg acgcactttacagatgaatgtgttaaaaagtatgaggcatttttgaataagtcagaagaa tggataagaaagatgcttcatcttaggaaacagttattatcgctgactaatcagtgtttt gatattgaagaagaagtatcaaaatatcaagaatatactaatgaggtagcgagtcattct ggaacactcctggttctcgctatggccataagaagtgactttcatgagaccttcagaaga atagcccggctcttttactggttccccaaatcactgacctgctcaaagaatcggaaatcc aagctgctctggagccaaggtggggggaagaagagaaaggaggggggctgcttattagac agcaccccacttgtctccagcattgtatggctttctcaaggacacctatgtagatcagaa gccctgagagagagaaccatcatggaagaactggcagaggaacactctagcatggatgct ctggtgtcttgtctattggaggactgtataggaagcatggtgctgacatctgctcctgtg aaggccaggaagcttccagtcatggtggaaggcaaagaagaagacagtgtatcacacggc aagagggagcaagaaagacagaaggaagttccaaactcttttaaacaaccagatctcaca tga >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_6|177_aa MGTYSVQQPWWPQTIRSKGNLPVDFARELINRSSHLRARPCCAAEYRETADGNRLSRPGL ADLRDSEPTREQALGLSSLPGFMYFSKWSPACSAALLTGTVKCSHLCLLPSPIEMNQPSQ QKGCVPSIWEKGPQRVLKMTRQPFPAQCRKCGLVHSPSLSVDLFHQKATSGALRDAV >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_6|534_bp atggggacttattctgtccagcagccctggtggcctcagaccatccgaagcaagggtaac ctaccagttgattttgcccgagagcttataaatcgcagttcccatcttcgagcccggccc tgctgtgcagcagagtacagagagacagctgatggtaacaggctgagccggccaggcctg gctgacctgcgtgactcagagcccaccagggagcaggccctcggcctctcttcactccct ggcttcatgtatttttccaagtggtcaccagcatgctcagcagctctgctcacagggacc gtcaagtgttcacacctgtgtctactcccttctccaattgagatgaatcaaccaagccag cagaagggctgtgttcccagcatttgggagaagggaccccagagggttctgaagatgacc cgacaaccatttccagctcagtgccgtaaatgtggcttagtccattcaccttctctctct gtagatctctttcatcagaaggctacatctggagcccttcgggatgctgtttag >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_7|128_aa MSVMDKGEGAREREKPSHSNTELTPVKREGKIHLFFLRGTQHHRDISDLMHIPHPEDGTR YLHGVASELLPSGPALWLQPDLASHTGDCGCLAPDTLEPHIPMDINESSSATVTSAIGAD LRRRATTA >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_7|387_bp atgtctgtgatggataaaggggagggagccagagaaagagaaaagccttcccacagcaac acagagctgacacctgtgaagagagaggggaagatccatttattctttctacggggaaca cagcaccacagagacatctctgatttaatgcatattcctcatcctgaagatggcactcga tatttgcatggtgttgcctctgagctgctcccttcaggtccagccctctggctgcagcct gaccttgctagtcatactggtgattgtggctgcctggctcctgacactctggagcctcat attcccatggatattaatgagtccagctctgctactgtcacttctgccatcggcgctgac ctgcggaggagggccaccactgcttag >gi568815586f:64355871_64601378|GENSCAN_predicted_peptide_8|72_aa XGTDVDTPQPSSARSAEAALVCGATSPRPVAPSVRLVRLLSGNRTLAVAVPKCIIQGMST VVGNPKILVFML >gi568815586f:64355871_64601378|GENSCAN_predicted_CDS_8|219_bp ncagggacagatgtggacaccccacagcccagcagtgcacgctctgcagaggcagctctg gtgtgtggagccacatctcccaggccagtagctccatcagtgcgacttgtgagactctta tcagggaacaggacgttagcagttgcagttccaaagtgtattattcagggcatgtcaact gtcgtagggaatcccaaaatcctagtcttcatgctgtag