GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:32:55 Sequence gi568815590r:16010088_16277988 : 267901 bp : 36.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 5212 5069 144 1 0 38 30 147 0.019 3.46 1.04 Intr - 13283 13016 268 1 1 109 86 79 0.928 6.41 1.03 Intr - 15754 15604 151 2 1 65 42 134 0.952 4.80 1.02 Intr - 18307 18144 164 0 2 44 87 123 0.998 6.50 1.01 Init - 20931 19805 1127 0 2 60 53 343 0.730 21.70 1.00 Prom - 21468 21429 40 -10.65 2.00 Prom + 23191 23230 40 -14.71 2.01 Sngl + 23444 23887 444 1 0 88 36 334 0.904 24.19 2.02 PlyA + 24321 24326 6 1.05 3.05 PlyA - 25666 25661 6 1.05 3.04 Term - 31217 31029 189 2 0 59 36 141 0.751 2.47 3.03 Intr - 32820 32707 114 0 0 70 86 55 0.711 3.22 3.02 Intr - 38625 38379 247 2 1 32 9 217 0.066 4.74 3.01 Init - 52992 52934 59 0 2 90 48 62 0.277 3.23 3.00 Prom - 61991 61952 40 -4.65 4.00 Prom + 72055 72094 40 -4.05 4.01 Init + 79751 79851 101 1 2 89 80 57 0.757 4.78 4.02 Intr + 80968 81139 172 1 1 72 52 154 0.908 9.22 4.03 Intr + 104281 104393 113 0 2 131 66 52 0.017 5.56 4.04 Term + 124813 125119 307 1 1 15 44 244 0.123 6.30 4.05 PlyA + 126300 126305 6 1.05 5.10 PlyA - 127742 127737 6 1.05 5.09 Term - 130658 130585 74 0 2 86 40 53 0.045 -2.61 5.08 Intr - 133524 133471 54 1 0 78 97 45 0.052 2.43 5.07 Intr - 140224 140144 81 2 0 67 67 116 0.074 6.09 5.06 Intr - 145057 144977 81 2 0 41 52 108 0.048 1.19 5.05 Intr - 154164 153978 187 0 1 72 93 188 0.972 16.04 5.04 Intr - 158783 158371 413 0 2 39 94 331 0.049 21.78 5.03 Intr - 165213 165100 114 1 0 57 97 121 0.878 9.40 5.02 Intr - 167905 167799 107 0 2 74 91 66 0.688 4.44 5.01 Init - 168325 168177 149 1 2 83 37 55 0.465 -0.49 5.00 Prom - 183761 183722 40 -4.65 6.00 Prom + 192412 192451 40 -2.75 6.01 Sngl + 195078 195467 390 2 0 81 54 448 0.823 36.67 6.02 PlyA + 195491 195496 6 1.05 7.07 PlyA - 197556 197551 6 1.05 7.06 Term - 203185 203080 106 0 1 54 48 105 0.487 0.00 7.05 Intr - 205102 205072 31 2 1 125 86 29 0.892 2.67 7.04 Intr - 210532 210450 83 0 2 31 55 91 0.030 -1.54 7.03 Intr - 212613 212177 437 0 2 23 87 248 0.109 9.65 7.02 Intr - 239888 239772 117 2 0 58 69 71 0.092 1.94 7.01 Init - 250350 250315 36 0 0 64 119 22 0.201 2.96 7.00 Prom - 250429 250390 40 -3.55 8.00 Prom + 263312 263351 40 -4.05 8.01 Sngl + 265611 265898 288 2 0 71 44 157 0.234 4.94 8.02 PlyA + 266751 266756 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 113885 113762 124 2 1 64 40 77 0.851 0.88 S.002 Init - 205566 205506 61 0 1 83 47 74 0.867 4.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_1|618_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYTFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGIMLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYNYLIFDKPEKNKPWGKDSLFN KWCWENRLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMSKTPKAMARKAKIDKWDLIKLKSFCTAKDTTIRVNRQPTEWEKIFATYSSDKGLISRI YNELKQIYKKETNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNSPANASNNTGSFLQWAKFEYRRCQGKATELSVQYSQATKQASPSR TQPFRGTVWADSAVGEIHTNLGPEVIIESRTLGLNSEFGSLTSDGKDGSAEEWGSKDSGM ILFVFLHCCKRSEEAREFRSRLIIPGESEREGEKHNSLVSPYQELEKSCLESHPKDCCSH IGYSPKPKQKTAKKLGQAPPTLIIIYPKLWVQQDVIGLLGNHREQLSLNVRGSNYVCMRF KKHYKEKVAFEIRVGICQ >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_1|1854_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acgagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaagtaatttatacattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaacagagccctcagaaataatgctgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagccatggggaaaggattccctatttaat aaatggtgctgggaaaacaggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaagaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagatactaccatcagagtgaacaggcaacct acagaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaggaaacaaacaaccccatcaaaaagtgggcg aaggatatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat ctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagtccagccaatgcc agcaacaatactggaagctttctccagtgggccaagtttgaatacaggaggtgccagggc aaagctacagagctcagtgttcagtattcacaagccaccaagcaggcatccccatccaga acccagcccttcagaggaactgtttgggcagactctgctgtgggggaaattcatacaaat ttggggccagaagttatcatagaaagtagaactctgggattgaattctgaatttggatca ctgacatcagatggcaaagatggcagtgctgaagagtggggtagcaaggattctggcatg atattgtttgtcttccttcattgttgcaagaggtctgaagaagctcgtgaatttagatct agactcattatacctggggaaagtgagagagagggggaaaagcacaactctttagtgtct ccttatcaagagctagaaaaatcatgtttggaatcccatccaaaagactgctgctctcat ataggttactcgcccaaaccaaaacaaaaaacagcaaagaaactagggcaagcaccacca acacttattataatttatcccaagctgtgggtgcaacaagatgtaattggacttctagga aatcacagagaacaactatctctgaacgtaagagggagcaactacgtctgcatgagattc aagaaacactacaaagaaaaggtagcttttgaaatccgagtaggaatttgccaa >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_2|147_aa MGNKQSRKPRNSKNQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELKEEVR TDGKETENFEKKVDEWITRITNAEKSLKDLMELKTMARGLRDKCRSLSNQCNQLEERVSA MEDKMNEMKREEKFREKRIKRNEQSLQ >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_2|444_bp atggggaataaacagagcagaaaacccagaaactctaaaaatcagagtgcctctcctcct ccaaaggaacgcagctcctcaccagcaacagaacaaagctggatggagaatgactttgac gagttgagagaggaaggcttcagaagatcaaactactccgagctaaaggaggaagttcga actgatggcaaagaaactgaaaactttgaaaaaaaagtagatgaatggataactagaata accaatgcagagaagtccttaaaggacctgatggagctaaaaaccatggcacgaggacta cgtgacaaatgcagaagcctcagtaaccaatgcaatcaactggaagaaagggtatcagca atggaagacaaaatgaatgaaatgaagcgtgaagagaaatttagagaaaaaagaataaaa agaaatgaacaaagcctccaataa >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_3|202_aa MNTHQKPTDAGTMLLQPAELNIRISSSAPLTLTVVTEECEYAYWKKVWSPMTALSLKNAL GSSEIGRDPPKPTLPGGSNSDNRGNGRADGSAQHLLHVGKFLNIYETCAIYVSSPGERIL NQRFGPHGAFILVREYDDIRFSHSGPPSTCPNFPGPSYYTARESAYALKLRALEIIQISQ STKMQRNLGNPILPAIHQQPPQ >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_3|609_bp atgaatactcaccagaagccaacagatgctggaaccatgcttttacagcctgcagaactg aatataagaatcagcagtagtgcccctttgacattgacagtggtgactgaagaatgtgag tatgcctactggaaaaaggtgtggagccccatgacagctttgagcctaaagaatgcgctg ggatccagtgaaattggcagggatcccccaaaacccacattacctggaggaagtaatagt gacaacagagggaatggcagagcagatggcagtgcccagcatctattgcatgtgggaaaa ttcctgaacatttatgaaacctgtgctatatatgtgtcaagccctggagaaagaattctg aaccagagatttgggccccatggagcttttattctggtgagagagtatgatgatatacgt ttctcacattctgggccaccaagcacctgccctaatttccctggtcccagttactacaca gctagagagagtgcctatgccctgaagctcagagccctggaaattattcaaattagccag tccacaaagatgcagagaaacctagggaaccccatactacctgccatacatcagcagccc ccacagtag >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_4|230_aa MDITHVMNWVSEAIMFSDANALPETVASKKILYNFACEKIILDFPVWSLNLMTFVCQRGL TQNEEGGGNATLEEEIAVIQLQVKKLPEAGRVWSFSTINSGEEAIVTPDLTSLISIHTGS DIPGTNSWRQIQVQSKALPLFNSMEAEKGEETAGEKLEASRGWFIRFKERSHLHNIKVQD DLASVDIEASENCPEDLVRIIDEGYYTTQQIFNVDETASVEKDAIYNFHT >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_4|693_bp atggatataactcatgtcatgaactgggtcagtgaagcaattatgttttctgatgccaat gctctccctgaaactgtagcatcaaagaaaattctctacaattttgcatgtgagaagatt atcctggactttcctgtttggtccctaaatctaatgacatttgtgtgtcagagaggtttg acacagaatgaagaaggaggaggcaatgcgaccttggaggaggagattgcagtgatacaa ctacaagtcaagaagctgcctgaagctggaagagtttggagtttcagcactataaactca ggagaagaagcaatagttactccagatcttacttccctcatcagcatccatacaggctca gatatacctggaacaaattcttggagacaaatccaagtccagagcaaggctctacctctc ttcaattctatggaggctgagaaaggtgaggaaactgcaggagaaaagttggaagctagc agaggttggttcataaggtttaaggaaagaagccatctccataacataaaagtacaagat gatttagcaagtgtggatatagaagcttcagaaaattgtccagaagatctagttaggata attgatgaaggttactacactacacaacagattttcaatgtagatgaaacagcttctgtt gaaaaagatgccatctataactttcatacttag >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_5|419_aa MDEAGNHHSQQTIARTKTQTPHVLTHRWKLNNENTWSQEGERHTLGPVVGSMEQWDHFHN QQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLVFAVLIPLIG IVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSNMEKRIQHIL DMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSLISLNTTLLD LQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEIKGEVKVLNN ITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIGPPGLKGDRG AIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTWTPLMHGLEGCSNQSSVLCHPLT >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_5|1260_bp atggatgaagctggaaaccatcattctcagcaaactatcgcaaggacaaaaacccaaaca ccgcatgttctcactcataggtggaaattgaacaatgagaacacatggtcacaggaaggg gaacgtcacacactggggcctgttgtgggaagtatggagcagtgggatcactttcacaat caacaggaggacactgatagctgctccgaatctgtgaaatttgatgctcgctcaatgaca gctttgcttcctccgaatcctaaaaacagcccttcccttcaagagaaactgaagtccttc aaagctgcactgattgccctttacctcctcgtgtttgcagttctcatccctctcattgga atagtggcagctcaactcctgaagtgggaaacgaagaattgctcagttagttcaactaat gcaaatgatataactcaaagtctcacgggaaaaggaaatgacagcgaagaggaaatgaga tttcaagaagtctttatggaacacatgagcaacatggagaagagaatccagcatatttta gacatggaagccaacctcatggacacagagcatttccaaaatttcagcatgacaactgat caaagatttaatgacattcttctgcagctaagtaccttgttttcctcagtccagggacat gggaatgcaatagatgaaatctccaagtccttaataagtttgaataccacattgcttgat ttgcagctcaacatagaaaatctgaatggcaaaatccaagagaataccttcaaacaacaa gaggaaatcagtaaattagaggagcgtgtttacaatgtatcagcagaaattatggctatg aaagaagaacaagtgcatttggaacaggaaataaaaggagaagtgaaagtactgaataac atcactaatgatctcagactgaaagattgggaacattctcagaccttgagaaatatcact ttaattcaaggtcctcctggacccccgggtgaaaaaggagatcgaggtcccactggagaa agtggtccacgaggatttccaggtccaataggtcctccgggtcttaaaggtgatcgggga gcaattggctttcctggaagtcgaggactcccaggatatgccggaaggccaggaaattct ggaccaaaaggccagaaaggggaaaaggggagtggaaacacattaacctggacacctctc atgcacggactggagggctgctccaaccagtcctcagttctctgccacccattgacctag >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_6|129_aa MKKKQNRKTGNSKNQSTSPPPKERSSSPAMEQSWTENDFDELREEGFRQSNFCELKEEVR THRKELKNHEKRSDEWLTRITNAEESLKDPMELKTTPRELRDECTSFSSQFDQLEEKVSV IEDQMNEMK >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_6|390_bp atgaagaaaaaacagaacagaaaaactggaaattctaaaaatcagagcacctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacagagaatgactttgac gagttgagagaagaaggcttcagacaatcaaacttctgtgagctaaaggaggaagttcga acccatcgcaaagaacttaaaaaccatgaaaaaagatcagatgaatggctaactagaata accaatgcagaggagtccttaaaggacccgatggagctgaaaaccacaccacgagaacta cgtgacgaatgcacaagcttcagtagccaattcgatcaactggaagaaaaggtttcagtg attgaagatcaaatgaatgaaatgaagtga >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_7|269_aa MGKHVSQNYGRLTGFSPRPCRQLHEHHPHPGQAKAKSATSTTSHLPLALQPTGVRERSEE NGINLENIFQDIIKNFHNLAVQVNIQIQEMWRTPVKYSMRRSSSRHIIIKFFRIKTEEKM LRAAREKAQVTYERKPIRKGKIRSFSDKQMLREFVTTRAALQNRLKETLNIEGKNYYQPL RKHTEVHRPVILQRNHRQNQEEMEILNSQKKAHSQMDSTLNSTIGSLMRTRDVQSSPGVE TKSLHHRFDENWRFERQPPQKIMESLVNL >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_7|810_bp atgggtaagcatgtgagtcaaaactacggaagactgactgggttcagcccgaggccctgc agacagctccacgagcatcaccctcatcctggtcaggctaaagctaaaagtgcaacttct accacctctcatctccctctagccctgcagcctactggggtacgtgaaagaagtgaggag aatggaatcaatttggaaaacatatttcaagatatcatcaagaacttccataacctagct gtacaggtcaacattcaaattcaggaaatgtggagaaccccagtaaaatactccatgaga agatcatcctcaagacacataatcatcaaattcttcaggatcaaaacggaagaaaaaatg ttaagggcagccagagagaaagcccaggtcacctatgaaaggaagcccataagaaaagga aaaataagatctttttcagacaagcaaatgctgagggaatttgtcaccaccagagctgcc ttacaaaatcgcctgaaggaaacactaaatatagaagggaaaaattattaccaaccacta cgaaaacacactgaagtacacagaccagtgatactacaaagaaaccacaggcagaaccag gaagaaatggaaatcctgaacagccaaaaaaaggcccatagccagatggattccacgctg aattctaccatagggagtctcatgcgcacgcgggatgttcaaagcagcccaggagtagaa accaaaagtttacatcacaggtttgatgaaaattggcgatttgaaagacagcccccgcag aagataatggaaagtttggtgaacctgtga >gi568815590r:16010088_16277988|GENSCAN_predicted_peptide_8|95_aa MLKIKKIGDEDFAFISKCKDPKVRTILLQGNGKETLSKMDHNIQDAVQVYCNFLFDPHLV PGDGASEMAVIHVLTEKSKVMTGMNNGHTGLLPGL >gi568815590r:16010088_16277988|GENSCAN_predicted_CDS_8|288_bp atgctgaaaatcaagaaaattggagatgaggactttgcattcatcagtaaatgcaaagac cccaaggttcgcaccattctcctccaggggaatggcaaagagactctctctaaaatggac cacaacatccaggatgctgtgcaagtttattgcaactttctttttgacccgcatttggtg ccaggggatggggcctctgagatggctgtgatccatgtcttgacagaaaaatccaaggtc atgactggtatgaacaatggccatacagggctgttacccgggttgtag