GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:04:43 Sequence gi568815592f:110858770_111067787 : 209018 bp : 43.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10 123 114 0 0 74 80 132 0.998 11.21 1.02 Intr + 203 333 131 1 2 83 17 167 0.547 8.59 1.03 Intr + 16327 16590 264 1 0 41 91 277 0.501 19.93 1.04 Intr + 30088 30214 127 1 1 83 108 -21 0.315 0.08 1.05 Intr + 31485 31587 103 2 1 39 108 55 0.452 2.35 1.06 Intr + 33556 33674 119 2 2 32 60 109 0.530 2.68 1.07 Intr + 33966 34058 93 2 0 117 76 111 0.999 12.96 1.08 Intr + 34141 34296 156 0 0 82 91 108 0.999 10.71 1.09 Intr + 46374 46532 159 2 0 77 100 44 0.193 4.68 1.10 Term + 46617 46724 108 2 0 65 43 92 0.232 0.91 1.11 PlyA + 46790 46795 6 1.05 2.00 Prom + 47966 48005 40 -4.96 2.01 Sngl + 49889 50551 663 1 0 60 42 224 0.973 11.18 2.02 PlyA + 50815 50820 6 -0.45 3.00 Prom + 51187 51226 40 -2.46 3.01 Init + 58633 58946 314 0 2 72 74 182 0.375 12.00 3.02 Term + 62490 62571 82 0 1 3 44 140 0.215 -1.73 3.03 PlyA + 63074 63079 6 1.05 4.00 Prom + 64163 64202 40 -5.16 4.01 Sngl + 64797 65198 402 2 0 77 43 444 0.999 35.07 4.02 PlyA + 68203 68208 6 1.05 5.00 Prom + 80799 80838 40 -4.76 5.01 Init + 100001 100060 60 1 0 115 60 120 0.651 13.15 5.02 Intr + 100403 100483 81 1 0 74 97 30 0.701 2.33 5.03 Intr + 101645 101708 64 1 1 113 97 100 0.999 11.69 5.04 Intr + 101803 101847 45 2 0 103 115 26 0.969 5.18 5.05 Intr + 103623 103736 114 1 0 92 78 41 0.936 3.92 5.06 Intr + 108741 108986 246 1 0 68 18 207 0.071 9.13 5.07 Intr + 123211 123280 70 2 1 -4 41 149 0.003 -0.76 5.08 Intr + 126237 126369 133 0 1 62 103 88 0.826 8.35 5.09 Intr + 138414 138495 82 2 1 61 97 0 0.013 -2.59 5.10 Intr + 140942 141018 77 0 2 80 116 17 0.054 2.93 5.11 Intr + 149269 149368 100 0 1 113 111 87 0.951 13.18 5.12 Intr + 156985 157087 103 2 1 67 97 43 0.456 2.33 5.13 Intr + 169556 169641 86 2 2 89 47 93 0.164 4.76 5.14 Term + 172709 172731 23 0 2 68 45 14 0.055 -6.43 5.15 PlyA + 172794 172799 6 1.05 6.03 PlyA - 173054 173049 6 1.05 6.02 Term - 188679 188099 581 1 2 76 39 688 0.672 57.35 6.01 Init - 202708 202699 10 1 1 77 93 4 0.068 0.57 6.00 Prom - 204451 204412 40 -2.56 7.02 PlyA - 204732 204727 6 -0.45 7.01 Sngl - 206231 205263 969 2 0 42 39 248 0.968 12.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 108741 109021 281 1 2 68 43 197 0.917 8.81 S.002 Init - 139587 139442 146 0 2 96 97 79 0.831 9.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_1|457_aa MTAYSTRRHLYDPKNHILPPMDNLTISLQMGTNKCASQMGYTQGAKQSGQVFGLGRQIHD PKFCPEGTVADRAPSGAQARGSLTVMEAAHFFEGTEKLLEVWFSRQQPDANQGSGDLRTI PRWVPGALADIRAWGLSPPPRHQPRVEPEFPQLSVGGKSAAWGRFGGLGSESSMFVSKRR FILKTCGTTLLLKALVPLLKLARDYSGFDSIQSFFYSRKNFMKPSHQGYPHRNFQEEIEF LNAIFPKSRVISQPDQTLEILMSELDPAVMDQFYMKDGVTAKDVTRESGIRDLIPGSVID ATMFNPCGYSMNGMKSDGTYWTIHITPEPEFSYVSFETNLSQTSYDDLIRKVVEVFKPGK FVTTLFVNQPPLVIPRQTGSGVDLQQTPTDLQLRVLTVRRNTNKQDIYTKTLSARHHHQR PKERNSSPAMEQSWMENDFDELREEGFRRSVITTSPS >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_1|1374_bp atgaccgcgtacagcacaaggaggcatctctacgaccccaagaaccacatcctgcccccc atggacaacttaaccatcagcctccagatgggtacaaacaagtgtgccagccagatgggc tacacgcagggcgccaagcaaagcggccaggtctttggcctgggccggcagatacatgac cccaagttctgcccagaaggcacagtggccgacagggctccctcgggcgcgcaggctcgg gggagtctcacggtgatggaagctgcacattttttcgaagggaccgagaagctgctggag gtttggttctcccggcagcagcccgacgcaaaccaaggatctggggatcttcgcactatc ccaaggtgggtccccggggcgctcgctgacatccgggcctgggggctgtcgccgccgccg aggcaccagccacgggtggagcccgagttccctcagctttcagttgggggcaagtctgcg gcctggggtcgcttcggcggccttggaagtgagagtagcatgtttgtctccaagagacgt ttcattttgaagacatgtggtaccaccctcttgctgaaagcactggttcccctgttgaag cttgctagggattacagtgggtttgactcaattcaaagcttcttttattctcgtaagaat ttcatgaagccttctcaccaagggtacccacaccggaatttccaggaagaaatagagttt cttaatgcaattttcccaaagagtcgggtaatcagtcagccagatcaaaccttggaaatt ctgatgagtgagcttgacccagcagttatggaccagttctacatgaaagatggtgttact gcaaaggatgtcactcgtgagagtggaattcgtgacctgataccaggttctgtcattgat gccacaatgttcaatccttgtgggtattcgatgaatggaatgaaatcggatggaacttat tggactattcacatcactccagaaccagaattttcttatgttagctttgaaacaaactta agtcagacctcctatgatgacctgatcaggaaagttgtagaagtcttcaagccaggaaaa tttgtgaccaccttgtttgttaatcagcctccgctggtgatacccaggcaaacagggtct ggagtggacctccagcaaactccaacagacttgcagctgagggtcctgactgttagaagg aacactaacaaacaggacatctacaccaaaaccctatctgcacgtcaccatcatcaaaga ccaaaggaacgcaactcctcaccagcaatggaacaaagctggatggagaatgactttgat gagctgagagaagaaggcttcagacgatcggtaataacaacttctccgagctaa >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_2|220_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KVAILPKVLYRFNAIPIKLPMTFFMELEKTTLKFIWNKKRACIAKRVLSQKNKAGGITLH DFKLYYKATVTKTAWYSYQNRDIDQWNTTEPSEIIPHIYNHLIFDKPDKNKKWRKDSLFN KWCWENWLAICRELKLDPFLTPYTKINSRQIKDLNVRPKP >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_2|663_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtg aaagtggccatactgcccaaggtactttatagattcaatgccatccccatcaagctacca atgactttcttcatggaattggaaaaaactactttaaagttcatatggaacaaaaaaaga gcctgcattgccaagagagtcctgagtcaaaagaacaaagctggaggcatcacgctacat gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactcgtaccaaaac agagatatagaccaatggaacacaacagagccctcagaaataataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggagaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagagagctgaaactggatcccttcctt acaccttacacaaaaattaattcaagacagattaaagacttaaatgttagaccaaaacca taa >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_3|131_aa MGQVWALIHSTLEPFHTNDEEEGKYNEVAEEVTEQVCLPAKAKAAKEEEVHPYPSASSHY FEEKEWPDPPDLSFLEDTGQKVVAPVTVRAAPRVTAFSSTQAGIQCYQLLNNIYRNQLQR QKQNNWFGGEI >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_3|396_bp atgggacaagtgtgggctctgattcattccaccttggaaccttttcacactaatgatgag gaggaaggaaagtataacgaagtagcagaagaggtgacagagcaggtttgtttgccagct aaagctaaagcagcaaaagaggaagaggttcatccctacccttctgcatcctctcattat tttgaagaaaaagagtggcctgaccctccagatctttcttttctggaggacactgggcaa aaagtagttgccccagtgactgttcgagcagcacctcgagtgaccgctttcagttctact caggcaggaatccaatgctatcagctgctgaacaacatctacagaaaccagctgcaaaga cagaagcagaacaactggtttggtggagagatctga >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_4|133_aa MIPKGGRKGGHKGWARQYTSPEEIDTQLQSEKQKAREEEEQKEGGDGAAGDPKKEKKSLD SDESEDEDDYQQRRKGVEGLIDIENPNRVAQTTKKVTQLDLDGPKELLRREREEIEKQKA KERYMKMHLAGKR >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_4|402_bp atgatacctaaaggagggagaaagggaggccacaaaggctgggcgaggcaatatacaagc cctgaggagatcgacacgcagctgcagtctgagaagcagaaggccagggaagaagaggag caaaaagaaggtggagacggggctgcaggtgaccccaaaaaggagaagaaatctctagac tcagatgagagtgaagatgaagatgactaccagcaaaggcgcaaaggtgttgaagggctc attgacatcgagaatcccaaccgggtggcacagacaaccaaaaaggtcacacaactggat ctggatgggccaaaggagcttttgaggagagaacgagaggagattgagaagcagaaggca aaagagcgttacatgaaaatgcacttggctgggaagagatag >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_5|427_aa MAAAADERSPEDGEDEEEEVEQLVLVELSGIIDSDFLSKCENKCKVLGIDTERPILQVDS CVFAGEYEDTLGTCVIFEENVEHADTEGNNKTVLKYKCHTMKKLSMTRTLLTEKKEGEEN IGGVEWLQIKDNDFSYRPNMICNFLHENEDEEVVASAPDKSLELEEEEIQMNDSSNLSCE QEKPMHLEIEDSGPLIDIPSETEDALYEAVPGSGVTRSFRFRLFRRTKPKTKRAKRFLEK REPKLNENIKNAMLIKGGNANATVTKVLKDVEFFSKKSDCSLFMFGSHNKKRPNNLVIGR MYDYHVLDMIELGIENFVSLKDIKNSKCPEGTKPMLIFAGDDFDVTEDYRRLKSLLIDFF RGPTVSNIRLAGLEYVLHFTALNGKIYFRSYNYLRLKMEGFKMEEVKTTKPIAKIRTTKE DTFPVER >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_5|1284_bp atggcggcggcggcggacgagcggagtccagaggacggagaagacgaggaagaggaggta gagcagttggttctggtggaattatcaggaattattgattcagacttcctctcaaaatgt gaaaataaatgcaaggttttgggcattgacactgagaggcccattctgcaagtggacagc tgtgtctttgctggggagtatgaagacactctagggacctgtgttatatttgaagaaaat gttgaacatgctgatacagaaggcaataataaaacagtgctaaaatataaatgccataca atgaagaagctcagcatgacaagaactctcctgacagagaagaaggaaggagaagaaaac ataggtggggtggaatggctgcaaataaaggataatgatttctcctatcgacccaacatg atttgtaactttctacatgaaaatgaagacgaagaagtggtagcttcagccccagataaa tctttggaattggaagaggaagagattcaaatgaacgacagttcaaacctgagttgtgaa caggagaaaccaatgcacttggaaatagaagattctggtcctcttattgatataccttct gagacagaagacgccctctacgaggccgtccccggaagtggcgttacgcgcagcttccgg ttccgcctgttccggcgcacaaagcccaaaacgaaaagagccaagagattccttgagaag agagaaccgaaactcaatgaaaatattaaaaatgccatgctgattaaagggggaaatgca aatgcaacagtgacaaaagtacttaaagatgtggaattcttttcaaagaagtcagattgt tctttattcatgtttggctcccataataagaagcggccaaataatctagtaataggtcgt atgtatgactaccatgtgctggatatgattgaattaggtattgagaattttgtctctcta aaagacattaagaacagtaaatgtcctgagggaacaaaacccatgctgatatttgctggc gatgatttcgatgtaacagaagattatagaagactaaaaagtcttcttattgatttcttc agaggccccacagtatcaaatatccgcctggctggattagagtatgttctgcacttcact gcactgaatgggaagatttactttcgaagctataactacttaaggctaaaaatggaaggc ttcaaaatggaagaagttaaaactactaaacccattgctaagatccgaacaacaaaggaa gatactttcccggtagagcgttga >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_6|196_aa MWNYLSYEEKKYMMGDAPDYDRSQWLNEKFKLGLDFPNLPYLIDGAHKITQSKAILGCIA YKHNLCGETEGEKIWEDILENQLVDNHVQLARLCYNPDFKKLKPEYLEALPAMLKLYSQF LGKQLLFLGDKITLVDFIAYGILERNQVFEPKWLDAFPNLKDFISRFEGLEISAYMKSSC FLLRPVFTKMAVWGNK >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_6|591_bp atgtggaactacttaagctatgaggaaaagaagtacatgatgggggacgctcctgactat gacagaagccagtggctgaatgaaaaattcaagctgggcctggactttcccaatctgccc tacttgattgatggggctcacaagatcacccagagcaaggccatcctgggctgcattgcc tacaagcacaacctgtgtggggagacagaaggggagaagatttgggaagacattttggag aaccagcttgtggacaaccacgtgcagctggccagactctgctacaacccagattttaag aaactgaagccagaatacctggaggcactccctgcaatgctgaagctctactcacagttt ctggggaagcagctattgtttcttggggacaagatcacacttgtggatttcatcgcgtat ggcatccttgagagaaaccaagtatttgagcccaagtggttggacgccttcccaaacctg aaggacttcatctcccgatttgagggcttggagatctctgcctacatgaaatccagctgc ttcctcctgagacctgtgttcacaaagatggctgtctggggcaacaagtag >gi568815592f:110858770_111067787|GENSCAN_predicted_peptide_7|322_aa MIISIDAEKAFDKIQQPFMLKTLNEVGIDGTYLKIISAIYDKPTTNIILNGQKVEAFPLK TDTRQGCPLSPLLFNIVLEVLAREIRQEKEIKGIQLGKEEIKLSPFADDMIVHLENPIVS SQNLLKLISNFSKVSGYKINMQKSQTFLYTNNRQTESQIMSELPFTIATKRKKYLGIQLT RDVKDLFKENYKPLLNEIKEDINKWKNIPCSWIGRNNVMKMAILPKVIYRFNAVPIKLAL TFFTELEKKKTTLNFIWNQKRARIAKTILSKKNKAGGITLHDFKLYYKATVTKTAWYRYQ NRYVDQWNRTEASEIKTISTAI >gi568815592f:110858770_111067787|GENSCAN_predicted_CDS_7|969_bp atgattatctcaatagatgcagaaaaggccttcgacaaaattcaacagcccttcatgcta aaaactctcaatgaagtaggtattgatggaacatatctcaaaataataagtgctatttat gacaaacccacaaccaatatcatattgaatgggcaaaaggtggaagcattccctttgaaa accgacacaagacaaggatgccctctctcaccactcctattcaacatagtattggaagtt ctggccagggaaatcaggcaggagaaagaaataaagggtattcagttaggaaaagaggaa atcaaattgtccccgtttgcagatgacatgattgtacatttagaaaaccccatcgtctca tcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat atgcaaaaatcacaaacattcctatataccaataatagacaaacagagagccaaatcatg agtgaactcccattcacaattgctacaaagagaaaaaaatacctaggaatacaacttaca agggatgtgaaggacctcttcaaggagaactataaaccactgctcaacgaaataaaagag gacataaacaaatggaagaacattccatgctcatggataggaagaaacaatgtcatgaaa atggccatactgcccaaggtaatttatagattcaatgctgtccccatcaagctagcactg actttctttacagaattggaaaaaaaaaaaactactttaaatttcatatggaaccaaaaa agagcccgcattgcaaagacaatcctaagcaaaaagaacaaagctggaggcatcacacta catgacttcaaactatactacaaagctacagtaaccaaaacagcatggtaccggtaccaa aacagatatgtagaccaatggaacagaacagaggcctcagaaataaaaaccatatctaca gccatctga