GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:12:05 Sequence gi568815590f:18122048_18322917 : 200870 bp : 39.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 138 133 6 1.05 1.07 Term - 7412 7157 256 1 1 111 44 124 0.375 4.37 1.06 Intr - 22575 22506 70 1 1 2 110 60 0.010 -3.08 1.05 Intr - 24107 24059 49 2 1 121 91 41 0.101 5.03 1.04 Intr - 51070 50928 143 2 2 88 18 104 0.101 2.55 1.03 Intr - 60182 60121 62 1 2 88 99 68 0.847 5.36 1.02 Intr - 61399 61225 175 2 1 79 59 105 0.893 4.78 1.01 Init - 62140 62038 103 1 1 76 90 9 0.782 0.35 1.00 Prom - 63287 63248 40 -5.85 2.00 Prom + 70471 70510 40 -4.45 2.01 Init + 72010 72129 120 0 0 28 46 96 0.412 -0.36 2.02 Term + 72856 73020 165 0 0 72 45 142 0.901 5.23 2.03 PlyA + 74025 74030 6 1.05 3.00 Prom + 74645 74684 40 -5.15 3.01 Init + 77626 77749 124 0 1 89 82 84 0.382 8.28 3.02 Term + 80659 80747 89 2 2 47 49 121 0.751 0.94 3.03 PlyA + 81203 81208 6 1.05 4.03 PlyA - 82503 82498 6 1.05 4.02 Term - 83354 83281 74 0 2 93 47 61 0.453 -0.41 4.01 Init - 84314 84155 160 2 1 84 109 144 0.967 16.13 4.00 Prom - 84618 84579 40 -8.65 5.00 Prom + 85746 85785 40 -7.15 5.01 Init + 87171 87241 71 2 2 73 48 105 0.991 5.57 5.02 Intr + 87734 87908 175 2 1 112 95 83 0.959 10.42 5.03 Term + 88590 88691 102 2 0 65 38 113 0.504 1.30 5.04 PlyA + 90043 90048 6 1.05 6.05 PlyA - 91046 91041 6 1.05 6.04 Term - 93966 93793 174 0 0 63 48 119 0.530 2.18 6.03 Intr - 95399 95247 153 2 0 71 45 92 0.388 2.55 6.02 Intr - 96224 96055 170 0 2 36 50 123 0.889 2.04 6.01 Init - 97016 96917 100 2 1 73 116 73 0.436 9.07 6.00 Prom - 99287 99248 40 -7.45 7.00 Prom + 99656 99695 40 -9.65 7.01 Sngl + 100001 100873 873 1 0 93 39 624 0.810 53.69 7.02 PlyA + 101083 101088 6 1.05 8.00 Prom + 103830 103869 40 -6.45 8.01 Init + 117375 117409 35 2 2 83 66 27 0.268 -0.61 8.02 Intr + 121436 121628 193 2 1 118 91 45 0.211 6.37 8.03 Intr + 139729 139868 140 0 2 28 42 192 0.112 6.94 8.04 Intr + 148416 148471 56 0 2 94 111 45 0.350 5.10 8.05 Term + 156066 156148 83 1 2 36 48 147 0.071 2.38 8.06 PlyA + 157096 157101 6 1.05 9.05 PlyA - 157183 157178 6 1.05 9.04 Term - 157925 157617 309 2 0 33 33 223 0.130 5.38 9.03 Intr - 162832 162721 112 0 1 47 81 71 0.121 1.76 9.02 Intr - 167957 167882 76 0 1 31 111 48 0.017 -0.95 9.01 Init - 175134 175035 100 0 1 78 36 110 0.188 5.27 9.00 Prom - 179976 179937 40 -6.95 10.03 PlyA - 180813 180808 6 1.05 10.02 Term - 190942 190854 89 2 2 120 53 80 0.700 4.54 10.01 Init - 195697 195595 103 1 1 34 77 130 0.534 6.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_1|285_aa MNFFNKTSKDVMASHGAQAETCHRGGATTESLHQDGTLDFGLTGTIEKELMYFICEDVSF GGPQPECCGLKWFECLSPSPTDVEICNANSIERNIKWNNYPCKKTPSKELKKSGHSQESP VREIYSSSSTVLCLTIKHHGRLAPEERSWAPHENGLRMIKCGPDVCTQQPGILAVQTERI PEATDPYPNLTAELLRKYMSRAFQAFKGDKELGSLSLHGHCRHSSTTEHLKLRYPATLAD SYIPSPDGLEENKKEFPVFPSKVSHSLPSYFPHGMVLDYLELRKG >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_1|858_bp atgaatttcttcaacaagacttcaaaggatgttatggcaagtcatggggcccaggcagag acttgtcacaggggtggggccaccacagagagtcttcatcaagatgggactctggacttt ggacttaccgggactattgagaaagaattaatgtattttatatgcgaggacgtgagtttt gggggaccacagccagaatgctgtggtttaaaatggtttgaatgtttgtccccttcccca actgatgttgaaatttgcaatgcaaacagtattgagaggaacatcaaatggaacaactat ccttgcaagaagacaccttcaaaagagctaaagaaatcaggtcacagccaggagagccca gtcagagaaatctacagcagcagtagcactgttctctgcctgacaataaaacaccatgga agactcgcccctgaagagaggtcctgggctccccatgagaacggactgaggatgataaaa tgtggacctgatgtctgcacacagcagcctggcatcctagccgttcagacagaaagaatt ccagaggcgactgatccctatccgaatctgactgctgagcttctaaggaaatatatgagc agggccttccaagcattcaaaggggacaaagagttgggttccctaagcctgcatggtcac tgcagacattcaagcactacagaacaccttaagctcaggtatcctgcaactcttgcagac tcctatatacctagccctgatggacttgaggaaaataaaaaagaattccctgtgttccca agtaaagtttctcactctcttccctcttattttccacatggaatggtgctggactatttg gagctgcggaaggggtga >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_2|94_aa MCSPLEPDPYWGMIRAEWNWDNRMASLLCVHWRWGSADPLLRSCMGVIVLVSVRWIPACS QMEQRSDPLLTNTSSKLLAVSAVVQKMVKSLFDE >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_2|285_bp atgtgtagccccctggagccagatccttactggggtatgattagggctgaatggaattgg gacaaccggatggccagcctgctttgtgttcactggaggtgggggagtgccgacccactg ctaaggagctgcatgggagtaattgtgctggtttctgttagatggatacctgcatgttct cagatggagcagagatcagaccctcttctcactaatacgtcatccaaattgctagcagtt tcagcagtggtccagaaaatggtcaaatctctgtttgatgagtaa >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_3|70_aa MTFECVMADKSLVTAAVLEVAHSIYNEWLLQLGATRFFVHLGMKRQTLAVSVTAYKGSED PKSEEQQDLL >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_3|213_bp atgacctttgagtgtgtgatggctgataagtcactggtgacggctgcagttttggaagta gctcatagcatttacaatgagtggttattacagctgggggctactcgtttctttgtgcat ttaggaatgaagcggcagaccctggcggtgagtgttacagcttacaaaggtagtgaggac ccaaagagtgaggagcagcaagatttattgtga >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_4|77_aa MTGILLEDLQREGTESEQREHTEGGLRGGNLGTLHGSAAYLVLSLCPSSSRGTGHREGTQ TCVVCQHPTPKPIPPPV >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_4|234_bp atgactggtatactcctagaagatcttcaaagggaaggcactgaaagtgaacagagggaa cacacagaaggtggactgagagggggaaacctgggaaccctacatgggtctgctgcatac ctggtgttgtccctgtgccccagcagctccaggggaacagggcacagagaaggcacacag acctgtgttgtctgccagcaccctacccccaagccaataccaccaccagtatga >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_5|115_aa MPRESLRIESRVHVAGRQRHVLEQLPVLLFLDLFQALTFHITHGTSSVRLPESRNQVKYK STVLPAGGMGIIGQYNSPPFFMDPVNPTGSKLVIIPAPLALADKKPLMILDVQED >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_5|348_bp atgccaagagaaagccttcgcatagagtcccgagttcacgtggcaggacgccagaggcat gttctagaacagttaccagttctgcttttcctggacctgttccaagctctcacgttccac atcacacatgggacatctagtgtcaggctcccagagagcaggaaccaggtgaaatataag agcacagtcctcccagccggtggcatggggataatcggacaatacaactctccacccttt tttatggacccagttaatcctacaggcagcaagctcgtaataattcctgctcctcttgca ctagctgataaaaagcccctaatgatcctggatgtccaggaagattga >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_6|198_aa MTYTKHIKDQDVVLAPGLPLGTAIIRHVEADKQELGPGDQEYRITKCPDNVPQFSTSSVE TSSCGREYEIETQKEIIEEKRKHWNKEQETAQHLPPRKIVYIYSASKGRKASEDKLQLCL PEVKYLGHIILVKGLSINPDNLLHENGASRYPILSPLNLQSLKSSSEKGIDLSPRPTSLT LANKSPKMIETRLVIFLN >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_6|597_bp atgacatataccaaacacatcaaagaccaggatgtagtgctagcacctggtcttcctcta ggtacagctataattcggcatgttgaggcagataaacaagaattggggccaggagaccaa gaataccgaattacaaaatgtcctgacaacgtaccacaatttagtactagtagtgttgaa acatcgtcttgtggccgggaatatgaaatagagacacagaaggaaataatagaggagaag agaaaacactggaataaagagcaggaaacggcacagcatcttcctcccaggaagatagtc tacatttactcagccagcaagggacgcaaagcgtctgaagacaaacttcagctgtgctta ccggaagttaagtatttggggcatattatcttggtcaaaggactgagtattaaccctgat aatctcttacatgaaaacggtgcttctcgatatcccatcctttcccctttaaatttgcag tccttaaaatcatcttcagagaaaggcatagacctgtctcccaggcccacatccttaact ttggcaaataaatctcctaaaatgattgagactcgtctcgtcattttcctcaattga >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_7|290_aa MDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDAMDLGLEAIFDQVV RRNRGGWCLQVNHLLYWALTTIGFETTMLGGYVYSTPAKKYSTGMIHLLLQVTIDGRNYI VDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDL LEDSKYRKIYSFTLKPRTIEDFESMNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT HRRFNYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_7|873_bp atggacattgaagcatatcttgaaagaattggctataagaagtctaggaacaaattggac ttggaaacattaactgacattcttcaacaccagatccgagctgttccctttgagaacctt aacatccattgtggggatgccatggacttaggcttagaggccatttttgatcaagttgtg agaagaaatcggggtggatggtgtctccaggtcaatcatcttctgtactgggctctgacc actattggttttgagaccacgatgttgggagggtatgtttacagcactccagccaaaaaa tacagcactggcatgattcaccttctcctgcaggtgaccattgatggcaggaactacatt gtcgatgctgggtttggacgctcataccagatgtggcagcctctggagttaatttctggg aaggatcagcctcaggtgccttgtgtcttccgtttgacggaagagaatggattctggtat ctagaccaaatcagaagggaacagtacattccaaatgaagaatttcttcattctgatctc ctagaagacagcaaataccgaaaaatctactcctttactcttaagcctcgaacaattgaa gattttgagtctatgaatacatacctgcagacatctccatcatctgtgtttactagtaaa tcattttgttccttgcagaccccagatggggttcactgtttggtgggcttcaccctcacc cataggagattcaattataaggacaatacagatctaatagagttcaagactctgagtgag gaagaaatagaaaaagtgctgaaaaatatatttaatatttccttgcagagaaagcttgtg cccaaacatggtgatagattttttactatttag >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_8|168_aa MAKVTDFTKVNRRTHVIVQVFSMHTDKSGWECNRKTFSSRENNFSASLQLSSFALEVSLC FKKHLLYIQEVHLVLQENFAMGPRNKKLNEVFLSPYVLESLTYDEVGAASRSPREETGEK MISTFGKDETIMEFNKQYLSSDLRQHDIEKAPVIDGFYELPVLMDFTE >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_8|507_bp atggccaaagtgactgattttacgaaagtgaatagaaggacacatgtgattgtccaagtg ttctccatgcacacagataagtctgggtgggaatgtaaccgaaagactttttcaagtaga gagaacaacttttccgcatcccttcaactatctagctttgcattggaagtgtctctgtgc ttcaagaagcatcttctctacattcaggaagttcaccttgttctgcaggaaaactttgca atgggtccccgaaataagaaactgaatgaggttttcctctcgccttatgtccttgagagc ttaacttatgatgaagtaggagcagcctctcgttcaccacgtgaggaaacaggggagaaa atgatcagcacttttggtaaagatgagaccatcatggaattcaacaaacagtatctgtcc tcagatctcaggcagcatgacattgagaaagcccctgttatcgacggcttttatgagctt cctgtcctcatggatttcacagaatga >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_9|198_aa MNRHLSKEDIQAANTDEKLLIITNHQRNANQNRMMSMNYLSETIWPQGVEEADCGSQTSC QQSLRPESTKHDFMGGISNKSCNERALKDLMFHAGKVSAISYVIEMETSRRKQKKNNLRC LMKGLEQIFLSFDTVGNRLMGTAFQVLAKMIDTVNGHLANCIKVHVGQDTQNCFSLPRQL NKQSRLRSKVSGTRVQEL >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_9|597_bp atgaacaggcacctctcaaaagaagacatacaagcagccaacacagatgaaaaactgctc atcatcactaatcatcagagaaatgcaaatcaaaaccgcatgatgtcaatgaattatctg agtgaaacaatatggcctcaaggagtggaggaagccgactgtgggagtcagaccagctgc caacaatctttgaggcctgagtcaaccaagcatgatttcatgggcggtatttcaaataag agttgcaatgagagagccttgaaggacttgatgttccatgcgggaaaggtgtcagctata agctatgtgatagaaatggaaacttctagaagaaagcaaaagaagaataatctcagatgc ttaatgaaaggactggagcagatattcttgagttttgacactgtaggaaatagacttatg ggcacagcctttcaagtgttagctaaaatgattgacaccgtgaatggacacttggcaaac tgcattaaagtacatgtgggacaagatactcagaactgctttagtctcccaaggcagttg aataagcaaagtcgactgcgttctaaggtcagtggaacaagagtacaagaattgtag >gi568815590f:18122048_18322917|GENSCAN_predicted_peptide_10|63_aa MGSCWKTAVPNVGTALEHHPPFTPFGMGISAGENASSAQRRGDVNTSESEMTAVYRPREE PSE >gi568815590f:18122048_18322917|GENSCAN_predicted_CDS_10|192_bp atggggagctgttggaagacagctgttcccaatgtggggacagctcttgagcatcatcct ccgttcacccctttcggcatgggaatctctgctggagagaatgcctcaagtgcacaaaga agaggtgatgtgaacacaagtgagagtgagatgacagctgtctataggccaagagaagag ccctcagaatga