GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:35:47 Sequence gi568815582f:154398_265153 : 110756 bp : 51.78% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8764 8819 56 0 2 44 105 143 0.970 12.30 1.02 Intr + 10085 10289 205 2 1 79 91 535 0.832 52.63 1.03 Term + 10631 10759 129 1 0 77 53 193 0.999 13.19 1.04 PlyA + 10839 10844 6 1.05 2.00 Prom + 11542 11581 40 -5.21 2.01 Init + 11601 11692 92 2 2 94 67 117 0.750 10.20 2.02 Intr + 11871 12075 205 0 1 60 93 487 0.811 46.13 2.03 Term + 12183 12311 129 2 0 106 52 234 0.991 20.09 2.04 PlyA + 12347 12352 6 1.05 3.00 Prom + 16397 16436 40 -7.20 3.01 Init + 18516 18610 95 2 2 107 78 136 0.957 14.31 3.02 Intr + 18728 18932 205 2 1 143 105 560 0.996 62.93 3.03 Intr + 19075 19170 96 0 0 104 39 112 0.405 8.61 3.04 Intr + 22302 22414 113 2 2 87 78 140 0.076 12.58 3.05 Intr + 22532 22736 205 2 1 143 105 560 0.996 62.93 3.06 Term + 22886 23014 129 1 0 114 43 154 0.996 11.99 3.07 PlyA + 23104 23109 6 1.05 4.00 Prom + 24261 24300 40 -7.20 4.01 Init + 26090 26184 95 1 2 93 86 244 0.999 24.51 4.02 Intr + 26269 26473 205 1 1 115 99 350 0.995 38.53 4.03 Intr + 26583 26701 119 2 2 19 -8 160 0.034 -0.73 4.04 Intr + 29972 30029 58 2 1 116 86 33 0.092 5.28 4.05 Term + 31551 32897 1347 2 0 34 44 611 0.862 43.37 4.06 PlyA + 34312 34317 6 1.05 5.11 PlyA - 34616 34611 6 1.05 5.10 Term - 34942 34801 142 0 1 106 46 107 0.848 6.01 5.09 Intr - 35738 35571 168 1 0 59 32 156 0.013 6.68 5.08 Intr - 36173 36144 30 1 0 132 92 -7 0.015 1.83 5.07 Intr - 38618 38530 89 2 2 71 83 51 0.021 2.17 5.06 Intr - 44841 44665 177 0 0 126 75 141 0.334 17.23 5.05 Intr - 51750 51607 144 0 0 116 73 142 0.987 16.49 5.04 Intr - 53791 53681 111 1 0 68 91 175 0.624 16.88 5.03 Intr - 66350 66252 99 2 0 109 95 119 0.934 15.51 5.02 Intr - 72939 72845 95 1 2 14 90 42 0.146 -2.82 5.01 Init - 74942 74882 61 2 1 81 113 122 0.485 13.64 5.00 Prom - 76678 76639 40 -3.81 6.02 PlyA - 78701 78696 6 1.05 6.01 Sngl - 80375 80028 348 2 0 74 55 162 0.757 5.95 6.00 Prom - 80880 80841 40 -4.01 7.00 Prom + 89520 89559 40 -3.71 7.01 Init + 93530 93595 66 1 0 84 94 35 0.588 3.21 7.02 Intr + 95152 95257 106 0 1 77 96 88 0.492 8.79 7.03 Intr + 99991 100284 294 2 0 -41 93 191 0.018 4.23 7.04 Intr + 105086 105202 117 0 0 67 86 148 0.996 13.44 7.05 Intr + 105599 105763 165 0 0 104 96 144 0.803 17.25 7.06 Intr + 106987 107117 131 2 2 65 73 231 0.750 20.22 7.07 Intr + 107696 107828 133 1 1 117 88 111 0.905 14.72 7.08 Intr + 108027 108156 130 1 1 63 94 178 0.999 16.16 7.09 Intr + 108865 109005 141 1 0 97 77 215 0.993 21.28 7.10 Intr + 109303 109378 76 1 1 81 79 119 0.999 10.31 7.11 Intr + 109619 109774 156 1 0 104 76 105 0.978 11.62 7.12 Intr + 110217 110319 103 2 1 71 113 94 0.991 10.45 7.13 Intr + 110414 110614 201 0 0 82 42 194 0.585 13.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 22320 22414 95 2 2 107 78 136 0.911 14.31 S.002 Term + 26583 26711 129 2 0 19 43 220 0.936 9.09 S.003 Intr - 35738 35534 205 0 1 59 38 186 0.821 9.48 S.004 Sngl + 76737 77075 339 2 0 70 28 231 0.992 11.74 S.005 Init + 100017 100284 268 2 1 59 93 181 0.978 12.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:154398_265153|GENSCAN_predicted_peptide_1|129_aa MWAKISTQADTIGTETLERLFLSHPQTKTYFPHFDLHPGSAQLRAHGSKVVAAVGDAVKS IDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAARFPADFTAEAHAAWAKFLSVV SSVLTEKYR >gi568815582f:154398_265153|GENSCAN_predicted_CDS_1|390_bp atgtgggccaagatctccacgcaggccgacaccatcggcaccgagactctggagaggctc ttcctcagccacccgcagaccaagacctacttcccgcacttcgacctgcacccggggtcc gcgcagttgcgcgcgcacggctccaaggtggtggccgccgtgggcgacgcggtgaagagc atcgacgacatcggcggcgccctgtccaagctgagcgagctgcacgcctacatcctgcgc gtggacccggtcaacttcaagctcctgtcccactgcctgctggtcaccctggccgcgcgc ttccccgccgacttcacggccgaggcccacgccgcctgggccaagttcctatcggtcgta tcctctgtcctgaccgagaagtaccgctga >gi568815582f:154398_265153|GENSCAN_predicted_peptide_2|141_aa MLSAQERAQIAQVWDLIAGHEAQFGAELLLRLFTVYPSTKVYFPHLSACQDATQLLSHGQ RMLAAVGAAVQHVDNLRAALSPLADLHALVLRVDPANFPLLIQCFHVVLASHLQDEFTVQ MQAAWDKFLTGVAVVLTEKYR >gi568815582f:154398_265153|GENSCAN_predicted_CDS_2|426_bp atgctcagcgcccaggagcgcgcccaaatcgcgcaggtctgggacctgattgcgggccac gaggcgcaattcggggcggagctgctgctcaggctcttcacggtgtaccccagcaccaag gtctacttcccgcacctgagcgcctgccaggacgcgacgcagctgctgagccacgggcag cgcatgctggcggctgtgggcgcggcggtgcagcacgtggacaacctgcgcgccgcgctg agcccgctggcggacctgcacgcgctcgtgctgcgcgtggacccagccaactttccgctg ctaatccagtgtttccacgtcgtgctggcctcccacctgcaggacgagttcaccgtgcaa atgcaagcggcgtgggacaagttcctgactggtgtggccgtggtgctgaccgaaaaatac cgctga >gi568815582f:154398_265153|GENSCAN_predicted_peptide_3|280_aa MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP AVHASLDKFLASTQREPTMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKT YFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLL SHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR >gi568815582f:154398_265153|GENSCAN_predicted_CDS_3|843_bp atggtgctgtctcctgccgacaagaccaacgtcaaggccgcctggggtaaggtcggcgcg cacgctggcgagtatggtgcggaggccctggagaggatgttcctgtccttccccaccacc aagacctacttcccgcacttcgacctgagccacggctctgcccaggttaagggccacggc aagaaggtggccgacgcgctgaccaacgccgtggcgcacgtggacgacatgcccaacgcg ctgtccgccctgagcgacctgcacgcgcacaagcttcgggtggacccggtcaacttcaag ctcctaagccactgcctgctggtgaccctggccgcccacctccccgccgagttcacccct gcggtgcacgcctccctggacaagttcctggcttctactcagagagaacccaccatggtg ctgtctcctgccgacaagaccaacgtcaaggccgcctggggtaaggtcggcgcgcacgct ggcgagtatggtgcggaggccctggagaggatgttcctgtccttccccaccaccaagacc tacttcccgcacttcgacctgagccacggctctgcccaggttaagggccacggcaagaag gtggccgacgcgctgaccaacgccgtggcgcacgtggacgacatgcccaacgcgctgtcc gccctgagcgacctgcacgcgcacaagcttcgggtggacccggtcaacttcaagctccta agccactgcctgctggtgaccctggccgcccacctccccgccgagttcacccctgcggtg cacgcctccctggacaagttcctggcttctgtgagcaccgtgctgacctccaaataccgt taa >gi568815582f:154398_265153|GENSCAN_predicted_peptide_4|607_aa MALSAEDRALVRALWKKLGSNVGVYTTEALERTFLAFPATKTYFSHLDLSPGSSQVRAHG QKVADALSLAVERLDDLPHALSALSHLHACQLRVDPASFQLLGHCLLVTLARHYPGDFSP ALQASLDKFLSHVISALVSETTLNPLRASVKAHALPKPKKTKRKAHVLNKYPESHLQEKV QVRNTDTGGIRTRREQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGG TATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRH GGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGT ATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHG GNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTA TRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGG NSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNSDTGGTATRGEQRHGGNTNTGGTAT RREYRHG >gi568815582f:154398_265153|GENSCAN_predicted_CDS_4|1824_bp atggcgctgtccgcggaggaccgggcgctggtgcgcgccctgtggaagaagctgggcagc aacgtcggcgtctacacgacagaggccctggaaaggaccttcctggctttccccgccacg aagacctacttctcccacctggacctgagccccggctcctcacaagtcagagcccacggc cagaaggtggcggacgcgctgagcctcgccgtggagcgcctggacgacctaccccacgcg ctgtccgcgctgagccacctgcacgcgtgccagctgcgagtggacccggccagcttccag ctcctgggccactgcctgctggtaaccctcgcccggcactaccccggagacttcagcccc gcgctgcaggcgtcgctggacaagttcctgagccacgttatctcggcgctggtttccgag actacccttaacccactcagggcttcggtcaaggcccatgcgctccccaagcccaagaag acgaagaggaaagcacatgtgctgaacaagtaccctgagtcccacctgcaggaaaaggtg caggtaaggaatacggacacgggaggaatacggacacggagggaacagcgacacgggggg aacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgac acggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacgggggga acagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgaca cggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaa cagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacac ggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaac agcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacg gggggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaaca gcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacgg ggggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacag cgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacggg gggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagc gacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacgggg ggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcg acacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacggggg gaacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcga cacggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacgggggg aacagcgacacggggggaacagcgacacggggggaacagcgacacggggggaacagcgac acggggggaacagcgacacggggggaacagcgacacggggggaacagcgacacgggggga acagcgacacggggggaacagcgacacggggggaataccaacacgggaggaacagcaaca cgcagggaatatcgacatgggtga >gi568815582f:154398_265153|GENSCAN_predicted_peptide_5|371_aa MSAQAQMRALLDQLMGTARDGDETRQRVKFTDDRVCKSHLLDCCPHDILAGTRMDLGECT KIHDLALRADYEIASKERDLFFELDAMDHLESFIAECDRRTELAKKRLAETQEEISAEVS AKAEKVHELNEEIGKLLAKAEQLGAEGNVDESQKILMEVEKVRAKKKEAEEEYRNSMPAS SFQQQKLRVCEVCSAYLGLHDNDRRLADHFGGKLHLGFIQIREKLDQLRKTVAEKQEKRN QDRLRRREEREREERLSRRSGSRTRDRRRSRSRDRRRRRSRSTSRERRKLSRSRSRDRHR RHRSRSRSHSRGHRRASRDRSAKYKFSRERASREESWESGRSERGPPDWRLESSNGKMAS RRSEEKEAGEI >gi568815582f:154398_265153|GENSCAN_predicted_CDS_5|1116_bp atgtccgcccaggcgcagatgcgggccctgctggaccagctcatgggcacggctcgggac ggagacgaaaccagacagagggtcaagtttacagatgaccgtgtctgcaagagtcacctt ctggactgctgcccccatgacatcctggctgggacgcgcatggatttaggagaatgtacc aaaatccacgacttggccctccgagcagattatgagattgcaagtaaagaaagagacctg ttttttgaattagatgcaatggatcacttggagtcctttattgctgaatgtgatcggaga actgagctcgccaagaagcggctggcagaaacacaggaggaaatcagtgcggaagtttct gcaaaggcagaaaaagtacatgagttaaatgaagaaataggaaaactccttgctaaagcc gaacagctaggggctgaaggtaatgtggatgaatcccagaagattcttatggaagtggaa aaagttcgtgcgaagaaaaaagaagctgaggaagaatacagaaattccatgcctgcatcc agttttcagcagcaaaagctgcgtgtctgcgaggtctgttcagcctaccttggtctccat gacaatgaccgtcgcctggcagaccacttcggtggcaagttacacttggggttcattcag atccgagagaagcttgatcagttgaggaaaactgtcgctgaaaagcaggagaagagaaat caggatcgcttgaggaggagagaggagagggaacgggaggagcgtctgagcaggaggtcg ggatcaagaaccagagatcgcaggaggtcacgctcccgggatcggcgtcggaggcggtca agatctacctcccgagagcgacggaaattgtcccggtcccggtcccgagatagacatcgg cgccaccgcagccgttcccggagccacagccggggacatcgtcgggcttcccgggaccga agtgcgaaatacaagttctccagagagcgggcatccagagaggagtcctgggagagcggg cggagcgagcgagggcccccggactggaggcttgagagctccaacgggaagatggcttca cggaggtcagaagagaaggaggccggcgagatctga >gi568815582f:154398_265153|GENSCAN_predicted_peptide_6|115_aa MRRLRPRPTAMLRLLGNGDLGRCLRREPGGVLFVAVAMDAERPRPSSFPSPLRPRSSPSS FPSFSASSLPSSSSPDLVLFPPSFLTSTPDLRPRPRFRVLSPVFLLFDHEENCWH >gi568815582f:154398_265153|GENSCAN_predicted_CDS_6|348_bp atgcgcaggctccgaccccgccccaccgccatgcttcggttgctcggcaacggggatctt gggcgctgcctccgccgggaacctggcggtgtcctgtttgtcgctgtcgcaatggatgct gaacgtcctcgcccctcttcgttcccgtcccccctccgtcctcgctcctctccgtcctct ttcccctccttctctgcgtcctcgctcccctcctcctcctctcccgaccttgtcctcttc ccgccttctttcttgacgtcgacccccgatctgcgtcctcgcccccgcttccgggtgctc tccccagtgtttctcctctttgaccacgaggagaattgctggcactga >gi568815582f:154398_265153|GENSCAN_predicted_peptide_7|607_aa MEALRGLTVWEPSQSGAQCWVQVHLGLRRHRFPVHSSRPDAPAGVHIEDVLRALPRPGVK LWDVPVMLDHKDLEAEIHPLKNEERKSQENLGNPSKNEDNVKSAPPQSRLSRCRAAAFFL SLFLCLFVVFVVSFVIPCPDRPASQRMWRIDYSAAVIYDFLAVDDINGDRIQDVLFLYKN TNSSNNFSRSCVDEAAVSGANGSTLWERPVAQDVALVECAVPQPRGSEAPSACILVGRPS SFIAVNLFTGETLWNHSSSFSGNASILSPLLQVPDVDGDGAPDLLVLTQEREEVSGHLYS GSTGHQIGLRGSLGVDGESGFLLHVTRTGAHYILFPCASSLCGCSVKGLYEKVTGSGGPF KSDPHWESMLNATTRRMLSHSSGAVRYLMHVPGNAGADVLLVGSEAFVLLDGQELTPRWT PKAAHVLRKPIFGRYKPDTLAVAVENGTGTDRQILFLDLGTGAVLCSLALPSLPGGPLSA SLPTADHRSAFFFWGLHELGSTSETETGEARHSLYMFHPTLPRVLLELANVSTHIVAFDA VLFEPSRHAAYILLTGPADSEAPGLVSVIKHKVRDLVPSSRVVRLGEGGPDSDQAIRDRF SRLRYQX >gi568815582f:154398_265153|GENSCAN_predicted_CDS_7|1821_bp atggaggcactgcggggcctgactgtctgggaaccctctcagagtggagcccagtgctgg gttcaggtccacctgggcttgcgaaggcacagattccccgtccacagctcacgaccagat gcaccagcaggagtccacatcgaggacgtcctccgggcactcccacgaccaggagttaaa ctttgggatgtgcccgtgatgttggaccacaaggacttagaggccgaaatccaccccttg aaaaatgaagaaagaaaatcgcaggaaaatctgggaaatccatcaaaaaatgaggataac gtgaaaagcgcgcctccacagtcccggctctcccggtgccgagcggcggcgttttttctt tcattgtttctctgcctttttgtggtgttcgtcgtctcattcgtcatcccgtgtccagac cggccggcgtcacagcgaatgtggaggatagactacagtgccgctgttatctatgacttt ctggctgtggatgatataaacggggacaggatccaagatgttctttttctttataaaaac accaacagcagcaacaatttcagccgatcctgtgtggacgaagctgctgtgtcgggggcc aacggcagcacgctctgggagagacctgtggcccaagacgtggccctcgtggagtgtgct gtgccccagccaagaggcagtgaggcaccttctgcctgcatcctggtgggcagacccagt tctttcattgcagtcaacttgttcacaggggaaaccctgtggaaccacagcagcagcttc agcgggaatgcgtccatcctgagccctctgctgcaggtgcctgatgtggacggcgatggg gccccagacctgctggttctcacccaggagcgggaggaggttagtggccacctctactcc ggcagcaccgggcaccagattggcctcagaggcagccttggtgtggacggggaaagtggc ttcctccttcacgtcaccaggacaggtgcccactacatcctctttccctgcgcaagctcc ctctgcggctgctctgtgaagggtctctacgagaaggtgaccgggagcggcggcccgttc aagagtgacccgcactgggagagcatgctcaatgccaccacccgcaggatgctttcccac agctctggagcagtgcgctacctgatgcatgtcccagggaacgccggtgcagatgtgctt cttgtgggctcagaggccttcgtgctgctggacgggcaggagctgacgcctcgctggaca cccaaggcagcccatgtcctgagaaaacccatcttcggccgctacaaaccagacaccttg gctgtagccgttgaaaacggaactggcaccgacagacagatcctgtttctggaccttggc actggagccgtcctgtgtagcctagccctcccgagcctccctgggggtccactgtccgcc agcctgccgaccgcagaccaccgctcagccttcttcttctggggcctccacgagctgggg agcaccagcgagacggagaccggggaggcccggcacagcctgtacatgttccaccccacc ctgccgcgcgtgctgctggagctggccaatgtctctacccacattgtcgcctttgacgcc gtcctgtttgagccaagccgccacgccgcctacatccttctgacaggcccggcagactca gaggcacccggcctggtctctgtgatcaagcacaaggtgcgggaccttgtcccaagcagc agggtggtccgcctgggtgagggtgggccagacagtgaccaagccatcagggaccggttc tcccggctgcggtaccagann