GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:43:54 Sequence gi568815588f:72174217_72375185 : 200969 bp : 48.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1350 1345 6 1.05 1.07 Term - 15751 15632 120 1 0 86 42 42 0.228 -2.13 1.06 Intr - 22773 22595 179 1 2 106 115 171 0.999 21.24 1.05 Intr - 29308 29211 98 0 2 90 77 8 0.421 -0.45 1.04 Intr - 36615 36516 100 1 1 55 78 123 0.555 7.17 1.03 Intr - 39115 38971 145 1 1 34 84 152 0.043 9.26 1.02 Intr - 49808 49632 177 2 0 32 44 117 0.006 1.82 1.01 Init - 52415 52374 42 2 0 62 66 38 0.008 -2.58 1.00 Prom - 52903 52864 40 -4.76 2.00 Prom + 52954 52993 40 -5.16 2.01 Init + 53897 53945 49 1 1 67 70 -3 0.180 -3.09 2.02 Intr + 56150 56224 75 0 0 101 103 33 0.972 5.59 2.03 Term + 58785 58900 116 1 2 96 43 175 0.966 12.53 2.04 PlyA + 59508 59513 6 1.05 3.00 Prom + 63177 63216 40 -2.96 3.01 Init + 69524 69703 180 1 0 84 32 75 0.037 0.68 3.02 Intr + 86591 86725 135 1 0 157 47 13 0.060 4.86 3.03 Intr + 99941 100205 265 1 1 112 105 298 0.994 30.99 3.04 Term + 100479 100972 494 1 2 112 43 799 0.999 72.57 3.05 PlyA + 101799 101804 6 1.05 4.00 Prom + 104620 104659 40 -4.76 4.01 Init + 106467 106544 78 2 0 96 73 107 0.563 11.07 4.02 Intr + 111518 111572 55 1 1 48 103 31 0.353 -0.85 4.03 Intr + 117675 117743 69 1 0 101 94 60 0.707 7.05 4.04 Intr + 120171 120208 38 1 2 118 53 3 0.421 -2.22 4.05 Term + 121684 121974 291 0 0 115 47 156 0.891 9.54 4.06 PlyA + 122782 122787 6 -0.45 5.02 PlyA - 122877 122872 6 1.05 5.01 Sngl - 126655 126140 516 1 0 88 43 235 0.934 13.11 5.00 Prom - 129003 128964 40 -6.76 6.00 Prom + 133022 133061 40 -4.16 6.01 Init + 135059 135150 92 1 2 65 105 22 0.514 1.57 6.02 Intr + 135628 135842 215 1 2 81 63 43 0.215 -0.54 6.03 Intr + 145323 145375 53 1 2 98 47 42 0.126 -0.27 6.04 Intr + 147555 147577 23 2 2 138 98 -2 0.538 2.14 6.05 Intr + 147978 148078 101 0 2 110 83 26 0.557 4.05 6.06 Intr + 149709 149820 112 1 1 85 69 4 0.161 -2.36 6.07 Term + 151727 152237 511 2 1 6 33 338 0.481 13.95 6.08 PlyA + 152540 152545 6 1.05 7.00 Prom + 153345 153384 40 -7.86 7.01 Init + 153863 154010 148 1 1 94 45 114 0.892 6.41 7.02 Term + 154220 154368 149 0 2 91 45 131 0.994 7.16 7.03 PlyA + 156323 156328 6 -0.45 8.11 PlyA - 157624 157619 6 -0.45 8.10 Term - 159411 159238 174 0 0 83 41 93 0.468 1.86 8.09 Intr - 161323 161186 138 1 0 89 28 70 0.545 1.76 8.08 Intr - 161715 161618 98 1 2 79 46 89 0.934 3.53 8.07 Intr - 162480 162308 173 2 2 63 70 277 0.987 22.99 8.06 Intr - 164095 163986 110 1 2 73 84 88 0.914 5.98 8.05 Intr - 166652 166573 80 0 2 112 76 159 0.989 16.37 8.04 Intr - 166954 166769 186 2 0 114 94 288 0.997 31.66 8.03 Intr - 169295 169150 146 1 2 87 121 189 0.807 22.03 8.02 Intr - 170911 170734 178 2 1 69 71 164 0.989 11.78 8.01 Init - 180681 180549 133 0 1 89 101 299 0.492 31.60 8.00 Prom - 188715 188676 40 -4.86 9.02 PlyA - 188769 188764 6 1.05 9.01 Term - 194139 193979 161 1 2 99 40 111 0.941 5.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 39082 38971 112 1 1 56 84 115 0.902 8.27 S.002 Sngl + 41426 41671 246 1 0 53 48 197 0.829 7.28 S.003 Term + 86591 86658 68 0 2 157 49 29 0.811 4.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_1|286_aa MRSQLTAASTSQAQVKRAFRGGARSETLKPDPVTELPLTPPAEEDDDEAAISSLAEFLLH YRERKTAIGKFMEGIIGISFGESVMEVLRPQLIRIDGRNYRKNPVQEQTYQHEEDEEDFY QGSMECADEPCDAYEVEQTPQGFRSTLRAPSLLYKHIVGKRGDTRKKIEMETKTSISIPK PGQDGEIVITGQHRNGVISARTRIDVLLDTFRRKQPFTHFLAFFLNEVEVQEGFLRFQEE VLAKCSMLLDILVCTCLGLPDGCRQTERLYFLNLIVKTEIKAFGIK >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_1|861_bp atgcgatcacagctcactgcagcctccacctcccaggctcaggtgaaaagggctttccgt ggtggggcaaggtctgagacactgaaaccagatccagtgacagaacttccactgacccca ccagctgaggaggatgatgatgaagcagccatttcctctctagcagagttcttacttcac tacagagaaagaaaaacagcaattggcaagtttatggaaggcataattggaatatcattt ggagaaagtgtcatggaagttctgcgtccacagcttataagaattgatggccggaattac aggaagaatccagtccaagaacagacctatcaacatgaagaagatgaagaggacttctat caaggctccatggagtgtgctgatgagccctgtgatgcctacgaggtggagcagacccca caaggattccggtctactttgagggcccccagcttgctctataagcatatagttggaaag agaggggacactaggaagaaaatagaaatggagaccaaaacttctattagcattcctaaa cctggacaagacggggaaattgtaatcactggccagcatcgaaatggtgtaatttcagcc cgaacacggattgatgttcttttggacacttttcgaagaaagcagcccttcactcacttc cttgcctttttcctcaatgaagttgaggttcaggaaggattcctgagattccaggaggaa gtactggcgaagtgctccatgctcctggatatacttgtgtgcacttgtcttggcttacct gatggctgccgccagacagaaaggctgtactttcttaatcttatagtgaagacagaaata aaagcatttggaataaaatag >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_2|79_aa MDAVFNLKRFNLMLHPYGSERFLCESVFSYQVASTLKQVKHDQQVARMEKLAGLVEELEA DEWRFKPIEQLLGFTPSSG >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_2|240_bp atggatgctgtttttaatctcaaaaggttcaacttgatgttacatccttatggctctgag agattcctctgcgaatctgtttttagctatcaagtggcatccacgcttaaacaggtgaaa catgatcagcaagttgctcggatggaaaaactagctggtttggtagaagagctggaggct gacgagtggcggtttaagcccatcgagcagctgctgggattcaccccctcttcaggttga >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_3|357_aa MEAKRTWEWECRGDDKEPFLEKASLLLNMMMRDSAPHTKGKMPKKRSTHREQHVQSQELQ GSHQKGANPAPCPHPAAGAPNAELSISVAWLPSSRAVGGASFLPRRLLRSGTLSSSANAL ASVLTMPSLWDRFSSSSTSSSPSSLPRTPTPDRPPRSAWGSATREEGFDRSTSLESSDCE SLDSSNSGFGPEEDTAYLDGVSLPDFELLSDPEDEHLCANLMQLLQESLAQARLGSRRPA RLLMPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPSLVPTFQL TLVLRLDSRLWPKIQGLFSSANSPFLPGFSQSLTLSTGFRVIKKKLYSSEQLLIEEC >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_3|1074_bp atggaagccaagaggacttgggagtgggaatgccgaggagatgacaaggagcctttccta gagaaggcgagcctgctcttgaacatgatgatgagagacagcgcgccacacacaaaggga aagatgccaaagaagcgttccacgcacagggaacagcatgtgcaaagtcaggaactccag ggctctcaccagaaaggcgccaaccctgcgccctgcccccacccagccgccggggcccca aatgctgagctcagcatctctgtggcctggctgccctcctcccgggcagtgggaggagcc tccttcctcccgaggcggcttctacgctccggcactctgagttcatcagcaaacgccctg gcgtctgtcctcaccatgcctagcctttgggaccgcttctcgtcgtcgtccacctcctct tcgccctcgtccttgccccgaactcccaccccagatcggccgccgcgctcagcctggggg tcggcgacccgggaggaggggtttgaccgctccacgagcctggagagctcggactgcgag tccctggacagcagcaacagtggcttcgggccggaggaagacacggcttacctggatggg gtgtcgttgcccgacttcgagctgctcagtgaccctgaggatgaacacttgtgtgccaac ctgatgcagctgctgcaggagagcctggcccaggcgcggctgggctctcgacgccctgcg cgcctgctgatgcctagccagttggtaagccaggtgggcaaagaactactgcgcctggcc tacagcgagccgtgcggcctgcggggggcgctgctggacgtctgcgtggagcagggcaag agctgccacagcgtgggccagctggcactcgaccccagcctggtgcccaccttccagctg accctcgtgctgcgcctggactcacgactctggcccaagatccaggggctgtttagctcc gccaactctcccttcctccctggcttcagccagtccctgacgctgagcactggcttccga gtcatcaagaagaagctgtacagctcggaacagctgctcattgaggagtgttga >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_4|176_aa MGSFMETSGAADAACGYVELAALNVESHSKYLNRQVLLHPLLQGDLVNQQVGEGDLLLPE AVGAGARGHQGTDSRLSSYQPIGAGEAAGVGPVNRFANQHRVIPALPLVSVWSHDPNCPV TVKPGTCVTIPKDRASFYSTGTWDDVPLQLQQVLMEPKHFPRQQRQSTNKYKDSGK >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_4|531_bp atgggcagcttcatggaaacttctggggcagcagatgcagcctgtggctatgtggagctg gcggctctgaatgtagagtcccactccaaatacctcaacaggcaagtcctgctgcaccca ctcctacaaggtgacttggttaatcagcaggttggggagggggacctgctgctcccagag gccgtaggtgctggagcacgaggacaccagggaactgactcaaggctctcctcttaccag cccattggagcaggtgaggctgcaggtgtggggcctgtgaacagatttgccaatcaacac agggttattcctgccttgcccctggtatctgtgtggtcacatgaccccaactgtccagtt acagtgaaacccgggacatgtgtcacaattcccaaggacagagcctctttctactcaact ggaacttgggatgatgtgcccctgcagctgcagcaggtgctcatggagcccaaacacttc ccaagacagcaaaggcaaagtacaaacaagtacaaagactctgggaaatga >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_5|171_aa MAGCRSRALPRGEAAKARQEIERSGPALLGGPAVLEDRVHPPQPLARVLSPPLPEAAPNA GPAEPTPTGNSRWPTSTARRPGCHRRLSLHTSLQAEGAGSGLGQPRKGLPQCSGRLKGSS AAKVGAQAEEVPRASEGCEDCQHAVTSQNDSQCRASFSQRYTCGNNPCSYM >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_5|516_bp atggcgggatgcaggtcccgagccctgccccgcggggaggcagctaaggcccggcaagaa atcgagcgcagtgggccggcactgctgggtgggccggcagtgctggaggaccgagtacac cctccgcagccgctggcccgggtgctaagccccccattgcccgaggccgctcctaatgcg gggcccgccgagcccacgcccaccgggaactcgcgctggcccacaagcaccgcgcgcaga cccggttgccaccggcgcctctccctccacacctccctgcaagctgagggagccggctct ggccttggccagcccagaaaggggctcccacagtgcagcggcaggctgaagggctcaagt gccgccaaagtgggagcccaggcagaggaggtgccgagagcgagcgagggctgtgaggac tgccagcatgctgtcacctctcagaatgactctcagtgtcgggcctcattctctcaacgt tacacctgtgggaataatccatgcagttatatgtag >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_6|368_aa MRFGDRVPGNLIWGDSQVQMPLQACTVSARRPEHFAASRNTHGDGTATHTLPSTHSHVLF AAVKALLTHTFSPAPASTAVPCCLEPGPSSRTTHMFSVGFSQYQTGQRAAIKGVWDEPGG VGPGGMSQVWLKHTVGPQSTLLTREMSPGMREMTHEAGLGEAQGWGGGMVCVPGSSSDTA ELGDLGKSLLFSGARFPQWEVWRERREREPGLRAALAGQLEFRVGVGLAAPHSEQPAGPA APGNEGLSTRASGCGGCTGSPSSASPPALLSISHRVLAAFLRGRARDLQPAMPEPPTPSM GSCAARASPMSAAPCSTAPSPIDHPRAEECGPTARDWQAAPPAAPVHDPLSEASWAPESG GDVENLYV >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_6|1107_bp atgaggtttggggaccgagtcccaggaaatctcatatggggtgattcccaggtccagatg cccttgcaggcttgcactgtgtctgccaggagaccagaacattttgctgccagcagaaac acacatggtgatggcactgctacccacaccctgccgtcaacacactcccacgtgctgttt gccgctgtcaaagccctgctcacacacacattctcacccgctcctgccagcacagctgtt ccctgctgcctggaaccaggcccttcatcccgcacgacacacatgttctctgtgggtttt tctcaataccagacggggcaaagggcagccatcaaaggtgtctgggacgagcccggcggg gtggggccaggtggaatgtcacaggtgtggctcaagcacacagtgggtcctcagtccacc ctgctgaccagggagatgagtcctggcatgagggagatgactcatgaagccgggctgggc gaggcccagggttggggcggggggatggtctgtgtcccaggcagcagctccgacaccgct gagctgggcgaccttggcaagtcacttctcttttctggggctcgatttcctcagtgggag gtgtggagggagaggcgcgagcgggaacccgggctgcgcgcggcgcttgcgggccagcta gagttccgggtgggcgtgggcttggcggccccgcactcggagcagcccgccggccctgcc gccccggggaatgaggggcttagcacccgggccagcggctgcggagggtgtactgggtcc cccagcagtgccagcccaccggcgctgctctcgatttctcaccgggtcttagctgccttc ctgaggggcagggctcgggacctgcagcccgccatgcctgagcctcccaccccctccatg ggctcctgtgcggcccgagcctccccgatgagcgccgccccctgctccacagcgcccagt cccatcgaccacccaagggctgaggagtgcgggcccacagcgcgggactggcaggcagct ccacctgcagccccggtgcacgatccactgagtgaagccagctgggctcctgagtctggt ggggacgtggagaacctttatgtctag >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_7|98_aa MDTLAAHVAPAVVVAGLFDLWLEAKLGPTMPPSARRPWAGSQQMAEPARGRGDLGPFQQD QQDSASPRRQEKEKAPAQCPSHERLLPPATVQLDAHRG >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_7|297_bp atggacacacttgctgcccatgtggcccctgcagtggtggtagctggcttgtttgacttg tggctggaagcaaaattagggcccacgatgccaccttctgcacgcaggccatgggctggc agccagcagatggcagagcctgccagagggagaggagacctggggccttttcagcaggat cagcaggactctgcttccccccggaggcaggaaaaggaaaaagcccccgcccagtgcccg tcccatgagcgactgctgcctccagccaccgtccagctggatgcacaccgtggctga >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_8|471_aa MESNKDEAERCISIALKAIQSNQPDRALRFLEKAQRLYPTPRVRALIESLNQKPQTAGDQ PPPTDTTHATHRKAGGTDAPSANGEAGGESTKGYTAEQVAAVKRVKQCKDYYEILGVSRG ASDEDLKKAYRRLALKFHPDKNHAPGATEAFKAIGTAYAVLSNPEKRKQYDQFGDDKSQA ARHGHGHGDFHRGFEADISPEDLFNMFFGGGFPSSNVHVYSNGRMRYTYQQRQDRRDNQG DGGLGVFVQLMPILILILVSALSQLMVSSPPYSLSPRPSVGHIHRRVTDHLGVVYYVGDT FSEEYTGSSLKTVERNVEDDYIANLRNNCWKEKQQKEGLLYRARYFGDTDMYHRAQKMGT PSCSRLSEGCGGPQNQAEWELETVAEGHPGLLGPLQPSVSTALSPAQQTIVRALVPQGTA PSLPLVFPKSCTKDFRKFFLRVSQWCRDPRAASSTPEEKAVPVAASVFTRC >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_8|1416_bp atggaatccaacaaggatgaagctgagcgctgtatcagcatcgccctcaaggccatccag agcaaccagcccgaccgggcgctccgcttcctggagaaggcacagcggctgtatccgacg ccgcgagttcgcgccctgattgagtccctcaaccagaaaccacagactgccggtgaccaa cccccacccacagacacaacccatgccacccacaggaaagcaggtgggaccgatgccccc tcggccaacggtgaagctggaggagagagcaccaaaggctacactgcagaacaggttgca gctgtgaaaagggtcaagcaatgtaaagattactatgagatcctgggggtgagcagaggg gcctcggatgaggacctgaagaaggcctaccgcagactggccctcaaattccacccagac aagaaccacgcacctggtgccactgaagccttcaaagccattggcacagcatatgcggta ctcagcaacccggagaagaggaagcagtatgaccagttcggcgatgacaagagccaggcg gcccggcacggccatgggcatggggatttccaccgtggctttgaggccgacatctcccct gaagacctcttcaacatgttctttggcggcggcttcccttctagtaacgtccacgtctac agcaacggccgcatgcgctatacctaccagcaaaggcaggaccgcagggacaaccagggt gatggcgggctaggggtgtttgtgcagctgatgcctatcctcatcctgattctcgtgtca gctctcagccagctcatggtctccagtccaccctacagtctgagtccaagaccgtccgtg ggccacatccacaggcgagtcactgaccacctgggtgtcgtctactatgtgggagacact ttctccgaagagtacacaggctccagcctcaaaacagtcgagcggaatgtggaagatgat tatatcgccaacctccggaacaactgctggaaggagaagcagcagaaggaaggcttgctg taccgggcacgctactttggcgacacagatatgtaccacagagcacagaagatgggcacc cccagctgcagccgactgtcagaggggtgtgggggaccccagaaccaggcagagtgggaa cttgaaactgttgctgagggccacccggggcttcttggtccactccagccatcagtcagc acggccctttctccagcacagcagactattgtccgtgccctggtgccacagggcacagct ccatcattgccactggtttttcctaagtcatgcaccaaggacttcagaaagttcttcctc cgagtcagtcagtggtgccgggaccccagggcagcctcctcgactcctgaagaaaaggca gtgcccgtggctgcctctgtcttcactcgctgttga >gi568815588f:72174217_72375185|GENSCAN_predicted_peptide_9|53_aa XNGELSNKEFVSIMKQRLMRGLEKPKDMGFTRLMQAMWKCAQETAWDFALPKQ >gi568815588f:72174217_72375185|GENSCAN_predicted_CDS_9|162_bp ngcaatggcgaactgagcaataaggaatttgtttccatcatgaagcaacggctgatgaga ggcctggaaaagcccaaagacatgggtttcactcgcctcatgcaggccatgtggaaatgt gcacaggaaactgcctgggacttcgctttacccaaacagtaa