GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:23:12 Sequence gi568815582r:11154846_11355478 : 200633 bp : 51.19% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 661 538 124 1 1 88 65 79 0.231 6.26 1.02 Intr - 880 797 84 1 0 76 73 70 0.943 4.81 1.01 Init - 2872 2864 9 1 0 99 94 2 0.927 2.36 1.00 Prom - 4448 4409 40 -1.11 2.00 Prom + 6787 6826 40 -3.21 2.01 Init + 7416 7467 52 2 1 97 97 2 0.476 1.32 2.02 Intr + 11543 11707 165 0 0 93 72 129 0.719 12.25 2.03 Term + 23490 23845 356 1 2 68 47 350 0.001 23.91 2.04 PlyA + 24393 24398 6 -3.64 3.03 PlyA - 24480 24475 6 1.05 3.02 Term - 25618 25545 74 2 2 142 49 21 0.570 1.96 3.01 Init - 26811 26631 181 0 1 80 40 163 0.523 8.03 3.00 Prom - 33235 33196 40 -2.51 4.03 PlyA - 34921 34916 6 -0.45 4.02 Term - 38978 38637 342 2 0 15 54 436 0.977 27.77 4.01 Init - 39616 39548 69 1 0 51 90 48 0.484 2.30 4.00 Prom - 42202 42163 40 -2.51 5.00 Prom + 45562 45601 40 -3.91 5.01 Init + 49591 49599 9 0 0 68 80 20 0.530 -1.51 5.02 Intr + 49686 49786 101 2 2 27 91 127 0.757 6.31 5.03 Intr + 50383 50507 125 1 2 45 71 59 0.741 0.63 5.04 Intr + 51125 51168 44 0 2 108 80 94 0.558 9.05 5.05 Intr + 51810 51925 116 2 2 84 76 14 0.341 -0.45 5.06 Intr + 53090 53170 81 2 0 60 46 94 0.153 1.65 5.07 Intr + 56832 56958 127 0 1 84 78 121 0.628 11.89 5.08 Intr + 69256 69319 64 0 1 120 80 22 0.206 3.38 5.09 Intr + 73478 73589 112 0 1 59 71 53 0.509 0.64 5.10 Intr + 75003 75093 91 0 1 67 32 83 0.367 1.10 5.11 Intr + 75192 75356 165 2 0 77 94 86 0.779 8.77 5.12 Intr + 83553 83578 26 2 2 107 81 13 0.035 -0.19 5.13 Term + 88156 88633 478 1 1 65 43 349 0.238 22.80 5.14 PlyA + 93051 93056 6 1.05 6.05 PlyA - 95927 95922 6 1.05 6.04 Term - 100683 99998 686 1 2 137 47 1171 0.882 112.13 6.03 Intr - 101392 101234 159 2 0 -4 99 107 0.031 3.07 6.02 Intr - 114375 114018 358 0 1 45 80 248 0.338 15.09 6.01 Init - 115199 115116 84 2 0 98 80 6 0.758 1.78 6.00 Prom - 115884 115845 40 -6.40 7.02 PlyA - 116018 116013 6 1.05 7.01 Sngl - 118750 118439 312 1 0 74 46 536 0.994 44.25 7.00 Prom - 119567 119528 40 -5.01 8.02 PlyA - 120812 120807 6 1.05 8.01 Sngl - 121525 121217 309 1 0 88 47 166 0.698 8.73 8.00 Prom - 121669 121630 40 -4.21 9.05 PlyA - 121876 121871 6 1.05 9.04 Term - 122838 122698 141 0 0 108 37 58 0.434 0.94 9.03 Intr - 126190 126175 16 0 1 105 92 15 0.526 -0.37 9.02 Intr - 126456 126282 175 1 1 92 80 147 0.320 13.81 9.01 Init - 132338 132239 100 2 1 90 14 96 0.398 2.78 9.00 Prom - 134073 134034 40 -2.71 10.00 Prom + 144430 144469 40 -2.01 10.01 Init + 151017 151038 22 2 1 74 78 35 0.102 0.89 10.02 Intr + 155143 155261 119 2 2 128 49 14 0.302 2.19 10.03 Intr + 158041 158092 52 0 1 101 110 2 0.274 2.67 10.04 Intr + 161607 161624 18 1 0 117 93 12 0.344 1.66 10.05 Intr + 162120 162289 170 1 2 90 51 88 0.724 5.48 10.06 Term + 166489 166554 66 0 0 77 42 60 0.188 -1.47 10.07 PlyA + 166772 166777 6 -0.45 11.06 PlyA - 167659 167654 6 -0.45 11.05 Term - 168472 168411 62 2 2 121 32 20 0.291 -2.04 11.04 Intr - 173751 173626 126 1 0 65 111 48 0.775 5.96 11.03 Intr - 177024 176889 136 0 1 86 98 21 0.096 3.55 11.02 Intr - 186048 185939 110 1 2 65 100 42 0.048 3.60 11.01 Init - 187535 187529 7 2 1 72 75 7 0.335 -1.59 11.00 Prom - 188173 188134 40 -3.51 12.00 Prom + 188306 188345 40 -1.51 12.01 Init + 190627 190921 295 0 1 91 95 401 0.953 36.45 12.02 Intr + 195797 195915 119 0 2 88 60 65 0.259 4.39 12.03 Term + 199151 199198 48 1 0 112 43 64 0.465 1.89 12.04 PlyA + 199438 199443 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 23498 23845 348 1 0 75 47 350 0.985 25.97 S.002 Init + 93461 93519 59 1 2 90 94 66 0.815 8.13 S.003 Intr - 101405 101234 172 2 1 32 99 103 0.808 5.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_1|73_aa MGQIGARPDRWLMTFYDTSSTTRTEDTIWNYTCGTWLTGRLDTSLLLSAGEKPEEQQLGF RLVTFTLATGYGX >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_1|219_bp atgggccagattggggctaggccagaccgatggctgatgacgttctatgacacatcatct acaacaagaactgaagatactatctggaactatacctgtgggacatggctaaccggacgg ctggacacgagccttctgctatctgcaggggagaaacctgaagagcagcaacttggcttt aggcttgtcacttttactctggctacaggatacggtgnn >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_2|190_aa MGIVSPHCWSGLPGPALGFAVAQCINQHSSPSLSSQSPPSASGSPSGSGSTSHCDSGGTS SSSTPSTAQSPADAPMSPELPKPHLPDQLVIVNETEADSKPSKNVARSAAVETASLSPSL VPARQPTISLLCEDTADTLSVESLTLVPPVDPHSLRSLTGMPPLSTPAAACTEPVGEEAA CAEPVGTAED >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_2|573_bp atgggcatagtttccccccactgttggagtggcctgcctggccctgctcttggcttcgcc gtggcccagtgcataaaccagcacagctccccgtccctgtcctcacagtcgccaccctcc gccagcgggagccccagcggcagcgggagcaccagccactgcgactctggaggcaccagc tcgtcctccaccccctccacagcccagagtccagcagatgcccccatgagtccagaactg cctaagcctcaccttcctgaccagttggtaatcgtcaacgaaacggaagcagactctaag cccagcaagaacgtggccaggagcgcagccgtggagacagccagcctgtcccccagcctc gtccctgcccggcagcccaccatttccctgctctgcgaggacacggctgacacgctgagc gtcgaatcgctgacccttgtccccccagttgacccccacagcctccgcagcctcaccggc atgcccccgctgtccacgccggctgccgcctgcacagagcccgtgggcgaagaggctgca tgtgctgagcctgtgggcaccgctgaggactga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_3|84_aa MVKAGPSCLGERLSWPHLTCWDRYLARPERDEHTSTTIFKAKVWEFLRPALLPDLANPAG GILQMATVVVVCPSFGFLPMWPPF >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_3|255_bp atggtgaaggctggcccttcctgcctgggggagcgcctcagctggccccatctgacctgc tgggacaggtaccttgcacgtccagaaagggacgagcacacctccacaacaatattcaaa gccaaggtttgggagttcctgcggccagccctgctccctgacttggcaaacccagccgga gggattctgcagatggccacggtggtcgtggtttgtccttcctttgggtttctgcccatg tggccccctttctga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_4|136_aa MHEFTRPKKSPAKALKHKIANVKSEIPSPKRRGGRRRKKEGGGGEEEEEEEEEEEEEEQD EEEEGGKEGKDEEEEEEELEEEEKMIIMIMTDDEEKEEEEEDEEEGEERKRRKKKSYDAD GPGTMAYTCNPSTLRD >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_4|411_bp atgcatgagttcaccaggcccaagaaatcgcctgcaaaggctctgaaacacaaaatagca aatgtcaagagtgagatcccatctccaaaaagaagaggaggaagaagaaggaagaaggaa ggaggaggaggagaagaggaggaagaggaggaagaggaggaagaagaagaagagcaggat gaggaggaagagggaggcaaagaaggaaaagatgaggaggaggaagaggaggagttagag gaggaggaaaagatgatcatcatgataatgactgatgatgaagagaaagaagaagaggaa gaggatgaagaggaaggggaagagaggaagaggagaaagaagaaaagttatgatgcagat gggccaggcaccatggcttacacctgtaatcccagcactttgagagactga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_5|512_aa MLMKNDQKTVTNYNVGNDDAEEEEQGPMEKDRVHTTSPNGWTSLFIPILQMQKLSFREAE ELTLGHTASKWHSQDPNPGDAAELLGAFMSDIQTCRSSSGKLLLAPRSQLSLPGQVPLLL PTVLSHPWLLPRTLSGGSLGHMEFLAAVKKEDFTRLALCPARTICWSRAEAASFERDLYP PLPYRCTTPIHHHTYDVIFSEVGKLRPREVSGMSEAAQRRVRDSLSRFTSGFLLLDPEDI LNDASGYFGLCEPCLPDCARINASVIPIFQLRRKSSRSELAPIGTAEGAVVRPDPGPDAR GPFLLASLQSLMCCLMATSYWESPGGLEQDQGTVKWTKAFLRRKQTPVTQDSGFSSFDLD YDFQRDYDRMYSYPARVPPPPPIARAVVPLKHQRVSGNTSQRGKSGFNSKSGQRGSSKSG KLKGDDLQAIKQELTQIKQKVDSLLEDPEKMEKKQSKQAVEMKNGKSEEKQSSSSRETHV KIESEGGADDSAEERDLLDDEDNEDWGMTSWS >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_5|1539_bp atgctcatgaaaaatgaccagaaaacagttacaaattacaatgttggcaacgatgatgct gaggaggaagagcaggggcccatggagaaggacagggtccataccaccagccctaacgga tggacatcgctattcattcccattctacagatgcagaaactgagcttcagggaagctgag gagctcaccctgggtcatacggccagcaagtggcacagtcaagatccgaacccaggggat gctgctgagctgcttggagccttcatgtcggacatccagacatgccgcagcagcagtggc aaacttctgcttgcccccagaagccagctgagtcttccaggtcaagtccctctcctgctc cccacagtcctgtcacacccctggctgctccccaggactctttctggggggagcctgggc cacatggagttcctggcagctgtgaagaaggaggacttcaccaggctggctctgtgccct gcaagaaccatctgctggagccgagcagaggctgcctcctttgaacgggacctgtaccct ccactcccctaccgctgcaccacaccaatccatcatcacacttatgatgtcatcttctca gaggtggggaagctgaggcccagagaggtgagtggcatgtccgaagcagcccagcgcagg gtgcgagattctttaagccggtttacatcaggattccttttacttgaccccgaagacatc cttaatgatgcctctggttattttggtctttgtgagccgtgtctgcctgactgtgccagg ataaatgccagtgttattcccatttttcagctgcgacgcaagagctcaagaagtgaactg gctcccattggcacggcagaaggagccgtggtaaggcctgatcctgggccagatgccaga gggcccttcctgttggcatctctgcaaagcctcatgtgttgtctgatggcaacatcgtac tgggagagccctggaggcctggagcaggaccaagggacagtcaagtggacaaaggccttt ctgaggaggaagcagactccagttacccaggactctgggttctcctcttttgacttggac tatgactttcaacgggattatgataggatgtacagttacccagcacgtgtacctcctcct cctcctattgctcgggctgtagtgcccttgaaacatcagcgtgtatcaggaaacacctca caaaggggcaaaagtggcttcaattctaagagtggacagcggggatcttccaagtctgga aagttgaaaggagatgaccttcaggccattaagcaggagttgacccagataaaacaaaaa gtggattctctcctggaagacccggaaaaaatggaaaagaaacagagcaaacaagcagta gagatgaagaatggtaagtcagaagagaagcagagcagcagctcacgtgagactcatgtg aagatagagtctgaaggtggtgcagatgactctgctgaggagagggacctactggatgat gaggataatgaagattgggggatgaccagctggagttga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_6|428_aa MAYPRLDISANWDLNPALNFPTLDLAGELHSNSQPQSRTCTRHCQTFSQSCRQSHRGSRS QSSSQSPASHRNPTGAHSSSGHQSQSPNTSPPPKRHKKTMNSHHSPMRPTILHCRCPKNR KNLEGKLKKKKMAKRIQQVYKTKTRSSAGLKDWRRGGRRTERAAAVAAARLLAPEHAREP PRSAPEPPAVPPAASRAPPPAHPRTLWPTPPAGPFCRMVAHNQVAADNAVSTAAEPRRRP EPSSSSSSSPAAPARPRPCPAVPAPAPGDTHFRTFRSHADYRRITRASALLDACGFYWGP LSVHGAHERLRAEPVGTFLVRDSRQRNCFFALSVKMASGPTSIRVHFQAGRFHLDGSRES FDCLFELLEHYVAAPRRMLGAPLRQRRVRPLQELCRQRIVATVGRENLARIPLNPVLRDY LSSFPFQI >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_6|1287_bp atggcctacccaagactggacatcagtgcaaactgggatttgaacccggctctgaatttc ccgactttagatctggctggggagctccatagcaactctcagccccaaagccgcacctgc acccgccattgccaaaccttcagccagagttgcagacagagccatcgtggcagccggagc cagagctccagccagagcccggccagccaccgcaacccaactggagcccacagctcatcc ggccaccagagccagagtcccaacactagtccaccaccaaagcgccacaaaaagactatg aactcccaccactctcccatgcggcccaccatcctgcactgccgctgccccaagaacaga aagaacttggaaggcaagctgaaaaagaaaaaaatggccaagaggatccagcaggtgtac aaaaccaagacgcggagctcagccggtttaaaagactggcgcaggggcgggcgccgaaca gagcgagctgcggccgtggcagctgcacggctcctggccccggagcatgcgcgagagccg ccccggagcgccccggagccccccgccgtcccgcccgcggcgtcccgcgccccgccgcca gcgcacccccggacgctatggcccacccctccggctggccccttctgtaggatggtagca cacaaccaggtggcagccgacaatgcagtctccacagcagcagagccccgacggcggcca gaaccttcctcctcttcctcctcctcgcccgcggcccccgcgcgcccgcggccgtgcccc gcggtcccggccccggcccccggcgacacgcacttccgcacattccgttcgcacgccgat taccggcgcatcacgcgcgccagcgcgctcctggacgcctgcggattctactgggggccc ctgagcgtgcacggggcgcacgagcggctgcgcgccgagcccgtgggcaccttcctggtg cgcgacagccgccagcggaactgctttttcgcccttagcgtgaagatggcctcgggaccc acgagcatccgcgtgcactttcaggccggccgctttcacctggatggcagccgcgagagc ttcgactgcctcttcgagctgctggagcactacgtggcggcgccgcgccgcatgctgggg gccccgctgcgccagcgccgcgtgcggccgctgcaggagctgtgccgccagcgcatcgtg gccaccgtgggccgcgagaacctggctcgcatccccctcaaccccgtcctccgcgactac ctgagctccttccccttccagatttga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_7|103_aa MGSRCAKLNTGQSPGHSPGHSTGHGRGHESSMKKLMACVSQDNFSLSSAGEEEEEEEEEG EEEEKEELPVQGKLLLLEPERQEEGHKDNAEAQQSPEPKRTPS >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_7|312_bp atgggttcccgctgtgccaagctcaacacaggccagagcccaggccacagcccaggccac agcacgggccatggccggggccacgaatcctccatgaaaaagctcatggcctgtgtgagt caggataacttctccttgtcatcagcgggcgaggaagaggaggaagaggaggaggagggg gaagaggaggagaaagaagagctgccggtgcagggcaagctgctgctgctggagcctgag cggcaggaggagggccacaaggacaacgccgaggcccagcagagccccgagcccaagcgg acaccctcctga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_8|102_aa MVRYRVRSLSERSHEVYRQQLHGQEQGHHGQEEQGLSPEHVEVYERTHGQSHYRRRHCSR RRLHRIHRRQHRSCRRRKRRSCRHRRRHRRGLPAPPPCPACP >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_8|309_bp atggtccgataccgcgtgaggagcctgagcgaacgctcgcacgaggtgtacaggcagcag ttgcatgggcaagagcaaggacaccacggccaagaggagcaagggctgagcccggagcac gtcgaggtctacgagaggacccatggccagtctcactataggcgcagacactgctctcga aggaggctgcaccggatccacaggcggcagcatcgctcctgcagaaggcgcaaaagacgc tcctgcaggcaccggaggaggcatcgcagaggtctgcctgcgcccccgccttgccctgca tgtccctga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_9|143_aa MIKDRNVTAQRSLKGDGILEQENIEEKLKQSEESWLAQPRWCPALSIQAKPILHHGQVQM LSQPEPEQILPPETKKSQTKEAELPDTEESHEVLPPQDPRDCTGPTQIIWDDLCIFKSLN LCICHIPTFSLPSMKGKLVLQFL >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_9|432_bp atgatcaaagacaggaatgtcacagcccagaggagcctaaagggagacgggatcctggaa caggaaaacattgaggaaaaactaaagcaatctgaggaaagttggctggctcagccaagg tggtgccctgctctgagcattcaggccaagcccatcctgcaccatggccaggtacagatg ctgtcgcagccagagccggagcagatattaccgccagagacaaagaagtcgcagacgaag gaggcggagctgccagacacggaggagagccatgaggtgctgccgccccaggaccctcgt gactgcactgggccaacccagatcatctgggatgatctctgcatcttcaagtccttaaat ttgtgtatctgccacattcccacattctccctgccctccatgaaagggaaactggttttg cagtttctgtga >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_10|148_aa MSERGKHAPWGSDCITRVCFCDQTSYKQNHVVYASLCLASSSEHDVQDLGTGGARHLMPC PRLAGPSISAETKLWQLERDGVGRVGYSKIGKQHRPQMPAEGERTSKCDEAPVCFAAITV SKSHNHTASGPQTSVGTSATCSTHLELQ >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_10|447_bp atgagtgaacgtggaaaacacgctccctggggatcagactgtatcaccagagtctgtttc tgtgaccaaacgtcctataaacagaatcacgtggtgtatgcttctctgtgcctggcttcc tcctctgaacatgatgtccaggatctgggcactggaggcgccagacatcttatgccctgt ccaaggctagcagggccatccatcagtgctgaaacgaaactttggcaactggagagggat ggagtggggcgggtggggtacagcaagatcgggaaacagcaccgtcctcaaatgcctgca gaaggtgagagaaccagcaagtgtgatgaagcacctgtgtgcttcgcagccattaccgtg tctaaatctcacaaccacacagccagtggtcctcaaacatcagtaggcaccagtgccacc tgcagcacccacctggagctccagtag >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_11|146_aa MASLKQPPVPASPSSSFPVRSKILGRRFPATCTSEDKGKRGPLSTARLDNEFPRSPTSGP KKGVQGNPHTPPLSTALANGPHAAGICEQRGLAAKGLEDGPSAKVSAPKAEGLGELEEAG FQEGWPGTPHMGFGVHPCSRFQILRS >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_11|441_bp atggctagcctgaagcaaccccctgtgccagcctccccgagctccagcttcccagtgaga tcaaagatccttggaaggcggtttcctgccacctgcaccagtgaggacaaaggtaagaga ggacctctgtctacagccagacttgataatgaattccccaggagtccaacctctggccct aaaaagggggttcaaggaaatcctcataccccacctctttcaacagcactggccaatggc ccccatgctgcagggatctgtgagcagagaggcctggctgcaaagggtctggaggacggc ccttctgctaaggtgtccgctcccaaggcagagggcttgggggagctggaggaggctggg ttccaggaggggtggccagggacacctcacatgggatttggggtccacccatgttctcgt ttccagatcctcagatcctaa >gi568815582r:11154846_11355478|GENSCAN_predicted_peptide_12|153_aa MAAAADSFSGGPAGVRLPRSPPLKVLAEQLRRDAEGGPGAWRLSRAAAGRGPLDLAAVWM QGRVVMADRGEARLRDPSGDFSVRGLERVPRGRPCLVPGKYVMVMGVVQACSPEPCLQAV KMTDLSDNPIHESMWELEGPIRDTTFHFVIMSP >gi568815582r:11154846_11355478|GENSCAN_predicted_CDS_12|462_bp atggcggcggctgcggactcgttctcaggcggccccgcgggggtgcggcttccgaggtcg ccgccactcaaggtgctggcggagcagctgcggcgcgacgcggagggcggcccgggcgcg tggcggctgtcacgggcggcggcgggccgcgggccgctggacctggcggccgtgtggatg cagggcagggtagtgatggcggaccgcggcgaggctcggctgagggacccgagcggggac ttctcggtccgcggcctggagcgggtgccgcgcgggcggccctgtctagtcccaggaaag tatgtgatggtgatgggagtggttcaggcctgcagccctgagccctgcctgcaggctgtg aagatgacagacctttctgataatcccatccatgaaagtatgtgggaactggagggtccc atccgggacaccacattccatttcgtcatcatgtctccctag