GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:45:26 Sequence gi568815582r:11180995_11381238 : 200244 bp : 50.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1503 1644 142 0 1 88 57 53 0.516 2.56 1.02 Intr + 2521 2598 78 0 0 122 14 55 0.518 1.25 1.03 Term + 4763 4918 156 1 0 84 47 100 0.694 3.43 1.04 PlyA + 5736 5741 6 1.05 2.03 PlyA - 6044 6039 6 1.05 2.02 Term - 12829 12488 342 1 0 15 54 436 0.979 27.41 2.01 Init - 13467 13399 69 0 0 51 90 48 0.506 2.35 2.00 Prom - 16053 16014 40 -5.26 3.00 Prom + 19413 19452 40 -6.66 3.01 Init + 23442 23450 9 2 0 68 80 20 0.385 -1.44 3.02 Intr + 23537 23637 101 1 2 27 91 127 0.723 5.81 3.03 Intr + 24234 24358 125 0 2 45 71 59 0.704 0.13 3.04 Intr + 24976 25019 44 2 2 108 80 94 0.707 8.56 3.05 Intr + 25661 25776 116 1 2 84 76 14 0.368 -0.95 3.06 Intr + 26941 27021 81 1 0 60 46 94 0.251 1.15 3.07 Intr + 30683 30809 127 2 1 84 78 121 0.630 11.38 3.08 Intr + 43107 43170 64 2 1 120 80 22 0.309 2.89 3.09 Intr + 47329 47440 112 2 1 59 71 53 0.499 0.14 3.10 Intr + 48854 48944 91 2 1 67 32 83 0.345 0.60 3.11 Intr + 49043 49207 165 1 0 77 94 86 0.783 8.26 3.12 Intr + 57404 57429 26 1 2 107 81 13 0.212 -0.68 3.13 Term + 62007 62484 478 0 1 65 43 349 0.373 22.41 3.14 PlyA + 66902 66907 6 1.05 4.05 PlyA - 69778 69773 6 1.05 4.04 Term - 74534 73849 686 0 2 137 47 1171 0.927 111.70 4.03 Intr - 75243 75085 159 1 0 -4 99 107 0.080 2.56 4.02 Intr - 88226 87869 358 2 1 45 80 248 0.382 14.53 4.01 Init - 89050 88967 84 1 0 98 80 6 0.712 1.82 4.00 Prom - 89735 89696 40 -9.16 5.12 PlyA - 89869 89864 6 1.05 5.11 Term - 92570 92290 281 0 2 17 46 484 0.028 32.71 5.10 Intr - 95422 95106 317 0 2 89 30 193 0.085 9.31 5.09 Intr - 100307 100133 175 0 1 92 80 147 0.451 13.30 5.08 Intr - 111631 111507 125 0 2 50 20 93 0.003 -1.07 5.07 Intr - 126423 126370 54 2 0 86 76 34 0.016 0.09 5.06 Intr - 129636 129521 116 0 2 53 71 112 0.049 5.25 5.05 Intr - 129840 129731 110 1 2 8 111 59 0.010 0.20 5.04 Intr - 147602 147477 126 0 0 65 111 48 0.706 5.45 5.03 Intr - 150875 150740 136 2 1 86 98 21 0.166 3.04 5.02 Intr - 159899 159790 110 0 2 65 100 42 0.054 3.10 5.01 Init - 161386 161380 7 1 1 72 75 7 0.212 -1.52 5.00 Prom - 162024 161985 40 -6.26 6.00 Prom + 162157 162196 40 -4.26 6.01 Init + 164478 164772 295 2 1 91 95 401 0.988 36.45 6.02 Intr + 169648 169766 119 2 2 88 60 65 0.413 3.88 6.03 Term + 173002 173049 48 0 0 112 43 64 0.583 1.60 6.04 PlyA + 175525 175530 6 1.05 7.05 PlyA - 177547 177542 6 -1.75 7.04 Term - 177785 177585 201 2 0 102 42 120 0.671 6.19 7.03 Intr - 180840 180714 127 2 1 55 34 118 0.244 3.78 7.02 Intr - 191475 191218 258 2 0 56 96 101 0.348 4.28 7.01 Intr - 191673 191522 152 0 2 71 10 134 0.234 2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 92601 92290 312 0 0 74 46 536 0.961 43.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_1|125_aa XWLPIQASGSASVDLDSQALPRRTSVASSVIVARLGRGKQEGSPPRTEAPGLLALLQPWV YLPHLETSHLGSQAVSELRPSCCWYLSDERDLKQGGVGGSAGFGVADEVQVPAPMLPIPV ALSTW >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_1|378_bp nngtggctgcccatccaagcctcaggctctgcatctgtggatttggacagccaagcccta cctcgcaggacaagcgtagcgagcagcgtcattgttgccaggctgggccgagggaagcag gaaggctccccaccgaggacagaggccccggggctgctggccctgctccagccctgggtc tacctgccccacttagagaccagccacctgggctcccaggccgtatctgagctgagaccg agctgctgctggtatctcagtgatgagcgagacctcaagcagggtggagtaggtggcagt gcaggctttggagttgctgacgaggttcaagtccctgcaccaatgctccccatccctgtg gccttgagtacatggtga >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_2|136_aa MHEFTRPKKSPAKALKHKIANVKSEIPSPKRRGGRRRKKEGGGGEEEEEEEEEEEEEEQD EEEEGGKEGKDEEEEEEELEEEEKMIIMIMTDDEEKEEEEEDEEEGEERKRRKKKSYDAD GPGTMAYTCNPSTLRD >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_2|411_bp atgcatgagttcaccaggcccaagaaatcgcctgcaaaggctctgaaacacaaaatagca aatgtcaagagtgagatcccatctccaaaaagaagaggaggaagaagaaggaagaaggaa ggaggaggaggagaagaggaggaagaggaggaagaggaggaagaagaagaagagcaggat gaggaggaagagggaggcaaagaaggaaaagatgaggaggaggaagaggaggagttagag gaggaggaaaagatgatcatcatgataatgactgatgatgaagagaaagaagaagaggaa gaggatgaagaggaaggggaagagaggaagaggagaaagaagaaaagttatgatgcagat gggccaggcaccatggcttacacctgtaatcccagcactttgagagactga >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_3|512_aa MLMKNDQKTVTNYNVGNDDAEEEEQGPMEKDRVHTTSPNGWTSLFIPILQMQKLSFREAE ELTLGHTASKWHSQDPNPGDAAELLGAFMSDIQTCRSSSGKLLLAPRSQLSLPGQVPLLL PTVLSHPWLLPRTLSGGSLGHMEFLAAVKKEDFTRLALCPARTICWSRAEAASFERDLYP PLPYRCTTPIHHHTYDVIFSEVGKLRPREVSGMSEAAQRRVRDSLSRFTSGFLLLDPEDI LNDASGYFGLCEPCLPDCARINASVIPIFQLRRKSSRSELAPIGTAEGAVVRPDPGPDAR GPFLLASLQSLMCCLMATSYWESPGGLEQDQGTVKWTKAFLRRKQTPVTQDSGFSSFDLD YDFQRDYDRMYSYPARVPPPPPIARAVVPLKHQRVSGNTSQRGKSGFNSKSGQRGSSKSG KLKGDDLQAIKQELTQIKQKVDSLLEDPEKMEKKQSKQAVEMKNGKSEEKQSSSSRETHV KIESEGGADDSAEERDLLDDEDNEDWGMTSWS >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_3|1539_bp atgctcatgaaaaatgaccagaaaacagttacaaattacaatgttggcaacgatgatgct gaggaggaagagcaggggcccatggagaaggacagggtccataccaccagccctaacgga tggacatcgctattcattcccattctacagatgcagaaactgagcttcagggaagctgag gagctcaccctgggtcatacggccagcaagtggcacagtcaagatccgaacccaggggat gctgctgagctgcttggagccttcatgtcggacatccagacatgccgcagcagcagtggc aaacttctgcttgcccccagaagccagctgagtcttccaggtcaagtccctctcctgctc cccacagtcctgtcacacccctggctgctccccaggactctttctggggggagcctgggc cacatggagttcctggcagctgtgaagaaggaggacttcaccaggctggctctgtgccct gcaagaaccatctgctggagccgagcagaggctgcctcctttgaacgggacctgtaccct ccactcccctaccgctgcaccacaccaatccatcatcacacttatgatgtcatcttctca gaggtggggaagctgaggcccagagaggtgagtggcatgtccgaagcagcccagcgcagg gtgcgagattctttaagccggtttacatcaggattccttttacttgaccccgaagacatc cttaatgatgcctctggttattttggtctttgtgagccgtgtctgcctgactgtgccagg ataaatgccagtgttattcccatttttcagctgcgacgcaagagctcaagaagtgaactg gctcccattggcacggcagaaggagccgtggtaaggcctgatcctgggccagatgccaga gggcccttcctgttggcatctctgcaaagcctcatgtgttgtctgatggcaacatcgtac tgggagagccctggaggcctggagcaggaccaagggacagtcaagtggacaaaggccttt ctgaggaggaagcagactccagttacccaggactctgggttctcctcttttgacttggac tatgactttcaacgggattatgataggatgtacagttacccagcacgtgtacctcctcct cctcctattgctcgggctgtagtgcccttgaaacatcagcgtgtatcaggaaacacctca caaaggggcaaaagtggcttcaattctaagagtggacagcggggatcttccaagtctgga aagttgaaaggagatgaccttcaggccattaagcaggagttgacccagataaaacaaaaa gtggattctctcctggaagacccggaaaaaatggaaaagaaacagagcaaacaagcagta gagatgaagaatggtaagtcagaagagaagcagagcagcagctcacgtgagactcatgtg aagatagagtctgaaggtggtgcagatgactctgctgaggagagggacctactggatgat gaggataatgaagattgggggatgaccagctggagttga >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_4|428_aa MAYPRLDISANWDLNPALNFPTLDLAGELHSNSQPQSRTCTRHCQTFSQSCRQSHRGSRS QSSSQSPASHRNPTGAHSSSGHQSQSPNTSPPPKRHKKTMNSHHSPMRPTILHCRCPKNR KNLEGKLKKKKMAKRIQQVYKTKTRSSAGLKDWRRGGRRTERAAAVAAARLLAPEHAREP PRSAPEPPAVPPAASRAPPPAHPRTLWPTPPAGPFCRMVAHNQVAADNAVSTAAEPRRRP EPSSSSSSSPAAPARPRPCPAVPAPAPGDTHFRTFRSHADYRRITRASALLDACGFYWGP LSVHGAHERLRAEPVGTFLVRDSRQRNCFFALSVKMASGPTSIRVHFQAGRFHLDGSRES FDCLFELLEHYVAAPRRMLGAPLRQRRVRPLQELCRQRIVATVGRENLARIPLNPVLRDY LSSFPFQI >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_4|1287_bp atggcctacccaagactggacatcagtgcaaactgggatttgaacccggctctgaatttc ccgactttagatctggctggggagctccatagcaactctcagccccaaagccgcacctgc acccgccattgccaaaccttcagccagagttgcagacagagccatcgtggcagccggagc cagagctccagccagagcccggccagccaccgcaacccaactggagcccacagctcatcc ggccaccagagccagagtcccaacactagtccaccaccaaagcgccacaaaaagactatg aactcccaccactctcccatgcggcccaccatcctgcactgccgctgccccaagaacaga aagaacttggaaggcaagctgaaaaagaaaaaaatggccaagaggatccagcaggtgtac aaaaccaagacgcggagctcagccggtttaaaagactggcgcaggggcgggcgccgaaca gagcgagctgcggccgtggcagctgcacggctcctggccccggagcatgcgcgagagccg ccccggagcgccccggagccccccgccgtcccgcccgcggcgtcccgcgccccgccgcca gcgcacccccggacgctatggcccacccctccggctggccccttctgtaggatggtagca cacaaccaggtggcagccgacaatgcagtctccacagcagcagagccccgacggcggcca gaaccttcctcctcttcctcctcctcgcccgcggcccccgcgcgcccgcggccgtgcccc gcggtcccggccccggcccccggcgacacgcacttccgcacattccgttcgcacgccgat taccggcgcatcacgcgcgccagcgcgctcctggacgcctgcggattctactgggggccc ctgagcgtgcacggggcgcacgagcggctgcgcgccgagcccgtgggcaccttcctggtg cgcgacagccgccagcggaactgctttttcgcccttagcgtgaagatggcctcgggaccc acgagcatccgcgtgcactttcaggccggccgctttcacctggatggcagccgcgagagc ttcgactgcctcttcgagctgctggagcactacgtggcggcgccgcgccgcatgctgggg gccccgctgcgccagcgccgcgtgcggccgctgcaggagctgtgccgccagcgcatcgtg gccaccgtgggccgcgagaacctggctcgcatccccctcaaccccgtcctccgcgactac ctgagctccttccccttccagatttga >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_5|518_aa MASLKQPPVPASPSSSFPVRSKILGRRFPATCTSEDKGKRGPLSTARLDNEFPRSPTSGP KKGVQGNPHTPPLSTALANGPHAAGICEQRGLAAKGLEDGPSAKVSAPKAEGLGELEEAG FQEGWPGAAHTQHTHQKVQSENGKTKSGEDGEQLEPPLTAAVNDMSNRNVVVRSPKDTFK HTHISAIHDSSKLGTAELVDISNHYSTFCFSESDFSGYLLSPVTATLPCPWVTSESTSDV IDTAAAYPLPSALCPPGFLDPGWLAQPRWCPALSIQAKPILHHGQVQMLSQPEPEQILPP ETKKSQTKEAELPDTEESHEAHCSLSPGATRSPNTMVRYRVRSLSERSHEVYRQQLHGQE QGHHGQEEQGLSPEHVEVYERTHGQSHYRRRHCSRRRLHRIHRRQHRSCRRRKRRSCRHR RRHRRGQSPGHSPGHSTGHGRGHESSMKKLMACVSQDNFSLSSAGEEEEEEEEEGEEEEK EELPVQGKLLLLEPERQEEGHKDNAEAQQSPEPKRTPS >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_5|1557_bp atggctagcctgaagcaaccccctgtgccagcctccccgagctccagcttcccagtgaga tcaaagatccttggaaggcggtttcctgccacctgcaccagtgaggacaaaggtaagaga ggacctctgtctacagccagacttgataatgaattccccaggagtccaacctctggccct aaaaagggggttcaaggaaatcctcataccccacctctttcaacagcactggccaatggc ccccatgctgcagggatctgtgagcagagaggcctggctgcaaagggtctggaggacggc ccttctgctaaggtgtccgctcccaaggcagagggcttgggggagctggaggaggctggg ttccaggaggggtggccaggagctgcccacacacaacacacccaccagaaggtgcaaagt gaaaatggcaaaaccaagagtggtgaggatggggagcaactggaacccccactcactgct gctgtgaacgatatgtccaacagaaatgtggtggtacgttctccaaaagacacgttcaag cacacacacatcagcgctattcatgacagctccaaactgggaacagcagagcttgttgac atcagcaaccactattccactttctgtttctctgaatccgacttctccgggtacctcctg tcgccagtaactgccacattgccatgtccatgggtgacctctgagtcgacctctgatgtg attgacacagctgctgcttaccccctccccagtgccctttgtccaccgggcttcctggac ccaggttggctggctcagccaaggtggtgccctgctctgagcattcaggccaagcccatc ctgcaccatggccaggtacagatgctgtcgcagccagagccggagcagatattaccgcca gagacaaagaagtcgcagacgaaggaggcggagctgccagacacggaggagagccatgag gcccactgcagcctcagcccaggagccaccagatctcccaacaccatggtccgataccgc gtgaggagcctgagcgaacgctcgcacgaggtgtacaggcagcagttgcatgggcaagag caaggacaccacggccaagaggagcaagggctgagcccggagcacgtcgaggtctacgag aggacccatggccagtctcactataggcgcagacactgctctcgaaggaggctgcaccgg atccacaggcggcagcatcgctcctgcagaaggcgcaaaagacgctcctgcaggcaccgg aggaggcatcgcagaggccagagcccaggccacagcccaggccacagcacgggccatggc cggggccacgaatcctccatgaaaaagctcatggcctgtgtgagtcaggataacttctcc ttgtcatcagcgggcgaggaagaggaggaagaggaggaggagggggaagaggaggagaaa gaagagctgccggtgcagggcaagctgctgctgctggagcctgagcggcaggaggagggc cacaaggacaacgccgaggcccagcagagccccgagcccaagcggacaccctcctga >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_6|153_aa MAAAADSFSGGPAGVRLPRSPPLKVLAEQLRRDAEGGPGAWRLSRAAAGRGPLDLAAVWM QGRVVMADRGEARLRDPSGDFSVRGLERVPRGRPCLVPGKYVMVMGVVQACSPEPCLQAV KMTDLSDNPIHESMWELEGPIRDTTFHFVIMSP >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_6|462_bp atggcggcggctgcggactcgttctcaggcggccccgcgggggtgcggcttccgaggtcg ccgccactcaaggtgctggcggagcagctgcggcgcgacgcggagggcggcccgggcgcg tggcggctgtcacgggcggcggcgggccgcgggccgctggacctggcggccgtgtggatg cagggcagggtagtgatggcggaccgcggcgaggctcggctgagggacccgagcggggac ttctcggtccgcggcctggagcgggtgccgcgcgggcggccctgtctagtcccaggaaag tatgtgatggtgatgggagtggttcaggcctgcagccctgagccctgcctgcaggctgtg aagatgacagacctttctgataatcccatccatgaaagtatgtgggaactggagggtccc atccgggacaccacattccatttcgtcatcatgtctccctag >gi568815582r:11180995_11381238|GENSCAN_predicted_peptide_7|245_aa LPDNDKIHGWDRITSMECRETQNRTCCKRTEAGLIATGLKILPGDMKLSHWRQCGQQVPA LQRKLHLAPVHAQEIYSLIQQPFGKPLLQNADCVVGDTKPGPQLLGTGFPCLIQGSFQAG VSLKGSDGPKPLNQKPRTLVGLWPRQIICPWQHAFPGNSNEEEKVRTIDFRVSENNFVKK LGACMDPLVLVLRAPGSPPEEAFTSSQPVPRAALTPPVAQLSLGQEAVDLWLPSLVDDET HRRAF >gi568815582r:11180995_11381238|GENSCAN_predicted_CDS_7|738_bp ctccctgacaatgacaagattcatggctgggacagaataaccagcatggaatgccgggaa actcagaaccgcacgtgctgcaaacgcacggaagcagggctcattgccactgggctcaaa atacttccgggggacatgaaactctcccactggagacagtgcggccagcaagtaccagct ctgcagagaaaactccacctggcccctgttcatgcccaagagatttattcacttattcaa cagccatttgggaagccactgcttcagaacgctgactgcgtggtgggggacacgaagcca ggtcctcagctgcttggtactggctttccctgcctgatccaggggtcttttcaggcagga gtttccttgaagggaagtgatggccccaagcctcttaatcagaagcccaggacactggtg gggctgtggccccgtcagatcatctgtccctggcaacacgcttttccagggaactccaat gaggaagagaaagtgcggaccatcgacttccgtgtctctgaaaacaactttgtaaaaaag cttggggcatgcatggaccccttggtcctggtactcagggctccagggtccccaccagag gaagccttcaccagttcacaaccagtgccacgggcagctctgacccctcctgtggcccag ctatctctgggccaggaagctgtagatctgtggctgccaagcctggttgatgacgagacc caccggagagctttctaa