GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:43:19 Sequence gi568815586f:93278323_93499376 : 221054 bp : 41.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1513 1749 237 0 0 95 36 122 0.592 2.88 1.02 PlyA + 2579 2584 6 1.05 2.00 Prom + 5084 5123 40 -4.75 2.01 Init + 17886 18035 150 2 0 86 48 73 0.118 3.23 2.02 Intr + 35065 35276 212 0 2 92 30 116 0.647 2.99 2.03 Intr + 36310 36489 180 1 0 8 55 210 0.599 7.76 2.04 Intr + 83086 83239 154 1 1 64 90 35 0.012 0.45 2.05 Intr + 87698 87739 42 1 0 43 107 58 0.244 0.72 2.06 Intr + 92754 92909 156 2 0 62 67 104 0.522 4.99 2.07 Intr + 99805 100099 295 0 1 17 75 225 0.029 9.76 2.08 Intr + 100286 100371 86 0 2 68 96 29 0.033 0.32 2.09 Intr + 116194 116397 204 0 0 55 78 166 0.394 10.77 2.10 Intr + 120446 120533 88 1 1 38 111 65 0.517 2.42 2.11 Term + 120855 121057 203 1 2 98 44 85 0.893 1.67 2.12 PlyA + 121220 121225 6 -0.45 3.08 PlyA - 121383 121378 6 1.05 3.07 Term - 121463 121411 53 0 2 96 43 56 0.008 -1.49 3.06 Intr - 132552 132412 141 1 0 82 70 142 0.758 11.20 3.05 Intr - 132977 132731 247 2 1 98 115 242 0.999 24.01 3.04 Intr - 163481 163379 103 1 1 91 20 76 0.009 0.36 3.03 Intr - 164981 164923 59 2 2 86 96 25 0.010 -0.14 3.02 Intr - 170925 170845 81 0 0 127 59 32 0.294 3.12 3.01 Init - 175513 175508 6 1 0 72 131 9 0.702 4.03 3.00 Prom - 177173 177134 40 -7.15 4.04 PlyA - 177601 177596 6 1.05 4.03 Term - 189649 189228 422 2 2 3 49 223 0.225 4.37 4.02 Intr - 192451 192365 87 2 0 64 82 53 0.253 1.32 4.01 Init - 198395 198278 118 2 1 35 69 157 0.932 8.91 4.00 Prom - 207141 207102 40 -4.35 5.00 Prom + 215685 215724 40 -2.85 5.01 Sngl + 218729 219034 306 1 0 49 43 238 0.989 11.02 5.02 PlyA + 219111 219116 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 69055 68886 170 2 2 99 44 148 0.986 8.56 S.002 Term + 100527 100596 70 0 1 78 47 129 0.874 4.23 S.003 Term + 151274 151414 141 1 0 1 47 234 0.870 7.55 S.004 Term + 163826 164098 273 1 0 37 51 231 0.838 8.79 S.005 Term + 179608 179739 132 0 0 131 49 70 0.800 4.51 S.006 Term - 205846 205646 201 1 0 61 38 138 0.835 2.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:93278323_93499376|GENSCAN_predicted_peptide_1|78_aa ARVIAHVPSPNKASKPTTQTWVQRQTLPLTERSSVSLSGEWESSLTGAKTLALGMIKFKA FLGHSHNQSLQAFTCTTI >gi568815586f:93278323_93499376|GENSCAN_predicted_CDS_1|237_bp gcaagagtgattgcccatgttccatcccccaacaaagcatcaaaaccaaccacacagacc tgggttcaaaggcagactctaccacttactgagagatcatcagtgtccttatctggagaa tgggaatcatccctaacaggtgcaaagactctagccttgggcatgatcaaattcaaagcc ttcttgggacactctcataatcagtcactgcaggctttcacctgcacaaccatctag >gi568815586f:93278323_93499376|GENSCAN_predicted_peptide_2|589_aa MEPKFLATVRECCLRPTLSDWGGNMRVLTGDFPKESLTPSGLKDDSKWHTQSGLKKTSGL KKLPTLVLAIGNTVYKCTLLQLVLENLGTDASPQGNIRVGDQVKSQSYLQDLPYILALFS TCGRFQSLRSEPAPGDLYPESGRKANRRKPTAEAIALRMIRGNRPSLTFAHWKSQSQVSL GHFRPFVLLFQTAQNAKIKSRILSFGGMLEVDFISGAQTPLRSRVKSFKFLKSSSAIIAF DGDWILLSGSSLNPVCLRNLHSVSIPSPRVWGGTLSGMRVLGITIGKVEKDYSPTLEQRP VPPASERPPAPGLTGLAAPPAPTSAPGRSCARLALAPTPRRGGGGRAPTAAAGAAAAAAG ARLYDEVQAQPDADLRPRGLQEAGGVPVLPERAGGRDETKGARPAPVSPSGHSRRRLVPG KVPGRNRVITLQRFENVVYIRSASQAGSIQLWFTLQVLLVSSSRYPDQWIVPGGGMEPEE EPGGAAVREVYEEQNQDRKHRTYVYVLTVTEILEDWEDSVNIGRKREWFKVEDAIKVLQC HKPVHAEYLEKLKLGCSPANGNSTVPSLPDNNALFVTAAQTSGLPSSVR >gi568815586f:93278323_93499376|GENSCAN_predicted_CDS_2|1770_bp atggagccaaagttccttgccacagtgagagaatgctgccttcgccccaccctcagtgac tggggaggaaacatgagggtcctcacaggggattttccaaaagagagtttaaccccaagt ggtctgaaagatgattctaaatggcacacacaaagtggcctcaagaagacctctggcctc aagaagctccccactcttgtactggcaattggcaatacagtgtacaaatgtacactgctt caactggtcctagaaaacctgggaaccgatgcatccccacaaggaaatattagagttggg gaccaggtgaaatcacagtcataccttcaggaccttccatatatacttgccttattcagc acgtgtggcagattccagagcctgcggtctgaaccagcccctggtgacctttaccccgag agcggaagaaaggcaaacaggcggaagcccacagccgaggccatcgcgctgcgcatgatc cgtggcaaccgtccttccttgacctttgcccactggaaaagccaatcccaagtcagcctg gggcacttccggccatttgttttactctttcaaacagctcaaaatgccaaaattaaatca agaattttatcatttggaggcatgcttgaagtagattttatttctggtgcccaaacccct ctcaggtccagagtaaaaagcttcaaattcctgaagagttcctctgccatcattgccttt gatggtgactggattctgctatctggaagctctctgaacccagtgtgtttacggaatctt catagtgttagcattccttcccccagggtatggggcggaaccctctctggaatgagggtc ttaggaatcacaatcggaaaggtggagaaagattacagtcctaccttggagcagcgtccc gtcccgcccgcgtcggagcggccgccggccccgggactgaccggcctcgccgcacctccc gcaccgactagcgctcccgggcgctcctgcgcccgactcgccctcgcccccactccccgg cggggtggcggcggccgggcccccacggcggcggccggagcagcagcagcagcagcagga gcccgcctctatgatgaagttcaagcccaaccagacgcggacctacgaccgcgagggctt caagaagcgggcggcgtgcctgtgcttccggagcgagcaggaggacgagatgagacaaag ggggctcgcccagccccagtttctccatctgggcactcgagaaggcgcctggtccccggg aaggtcccgggccgcaatagagttataaccttgcagaggtttgagaatgttgtttatata cgaagtgcttctcaggctggaagtatacagttatggttcaccttacaggtgctgctggtg agtagcagccggtacccagaccagtggattgtcccaggaggaggaatggaacccgaggag gaacctggcggtgctgccgtgagggaagtttatgaggagcagaaccaagaccgaaagcac agaacatatgtttatgttctaacagtcactgaaatattagaagattgggaagattctgtt aatattggaaggaagagagagtggttcaaagtagaagatgctatcaaagttctccagtgt cataaacctgtacatgcagagtatctggaaaagctaaagctgggttgttccccagccaat ggaaattctacagtcccttcccttccggataataatgccttgtttgtaaccgctgcacag acctctgggttgccatctagtgtaagatag >gi568815586f:93278323_93499376|GENSCAN_predicted_peptide_3|229_aa MPIASDIIEKHGRNQSDLRQTGDGEGGRKLNAISQAISYAWHTLPYKVSWAGSAASRLLR PAARSGRPEAARRGVGRGGEGSRETQRLLAEPVPGIKAEPDESNARYFHVVIAGPQDSPF EGGTFKLELFLPEEYPMAAPKVRFMTKIYHPNVDKLGRICLDILKDKWSPALQIRTVLLS IQALLSAPNPDDPLANDVAEQWKTNEAQAIETDGFAEYANAVNEKTILG >gi568815586f:93278323_93499376|GENSCAN_predicted_CDS_3|690_bp atgccgatagcaagtgacataatagagaaacatgggagaaatcaatcagacttgagacag acaggtgatggtgaaggaggaagaaagctcaatgccatttcacaggccatttcctatgct tggcatactcttccctacaaggtcagctgggcaggcagcgcggcctctcgcctccttcgg cccgcggcccgctctgggaggcccgaggcggcgcggagaggggttggccgcggcggcgag ggaagtcgggaaacccagcgtttgctggcagaaccagttcctggcatcaaagccgaacca gatgagagcaacgcccgttattttcatgtggtcattgctggccctcaggattcccccttt gagggagggacttttaaacttgaactattccttccagaagaatacccaatggcagcccct aaagtacgtttcatgaccaaaatttatcatcctaatgtagacaagttgggaagaatatgt ttagatattttgaaagataagtggtccccagcactgcagatccgcacagttctgctatcg atccaggccttgttaagtgctcccaatccagatgatccattagcaaatgatgtagcggag cagtggaagaccaacgaagcccaagccatagaaacagatggatttgctgaatatgctaat gctgtgaatgagaaaacaattttggggtag >gi568815586f:93278323_93499376|GENSCAN_predicted_peptide_4|208_aa MVNIGKDGPHRKATKQIVQVQESLFSGDDVQGALKNEKITTQMQLEAIILSELMQKQPNT TCSNLQVGAQRKARPAGSQIIHPRKIISEIKSLSPEFNPKTKATEPLRVRGLRNRLPRAP ELFDLKESLEPQAASPPHGRPPGNKGGAIRNAQPHAGSSAGRTKAGDAGNLALPGTTGGR PLQLPTNTVGLPSFEEVLDRRSRGKRLP >gi568815586f:93278323_93499376|GENSCAN_predicted_CDS_4|627_bp atggtaaacattggcaaagatgggccacacagaaaggctaccaaacaaattgtgcaggtt caagaaagccttttcagtggagatgatgtccaaggtgctctgaagaatgagaaaataaca acacagatgcagctggaggccattatcctaagtgaattaatgcaaaaacagccaaatact acatgttctaacttacaagtaggagcccagagaaaagcacgccctgctgggagtcagatt attcaccccaggaagatcatttcagagatcaagtccctcagcccggaattcaaccccaaa acaaaagctacagagcctctgagggtcagaggtttgaggaacaggctaccaagagcacca gaactcttcgacctaaaggagtccctagagccacaagctgcatcacctccccacgggcgt ccgccagggaacaaagggggcgctataagaaacgcgcagccacatgccggctcttctgca ggaaggacaaaggctggggatgcaggcaacctcgccctgccgggaacgacaggcgggaga cctctacaactgccgaccaacactgtgggactgcccagtttcgaggaagtcctcgacagg aggagtagagggaagcggttaccttga >gi568815586f:93278323_93499376|GENSCAN_predicted_peptide_5|101_aa MNSETESVIKKQQPKSSGPDRSTAEFYQMCKEPVPILLILFPQQQQQKIEEEGLLPNSFY KNGIILIPKSDRDITEKQNYGPVSLINMDAKILNKIRANQI >gi568815586f:93278323_93499376|GENSCAN_predicted_CDS_5|306_bp atgaattctgaaactgaatcagtaataaaaaaacaacaaccaaaaagctctggaccagac agatccacagctgaattctaccagatgtgcaaagagccagtaccaatcctactgatacta ttcccccaacaacaacaacaaaaaatcgaggaggaaggactcctccccaactcattctac aaaaacggcatcatcctgatacccaaatctgacagagacataacagaaaaacaaaactat gggccagtatccctgataaacatggatgctaaaatcctcaacaaaatacgagcaaaccaa atctag