GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:42:25 Sequence gi568815588r:56258421_56461236 : 202816 bp : 34.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 892 894 3 0 0 103 101 0 0.227 2.85 1.02 Intr + 59038 59206 169 0 1 96 94 87 0.810 8.70 1.03 Term + 59305 59435 131 2 2 57 49 108 0.757 1.16 1.04 PlyA + 60014 60019 6 1.05 2.10 PlyA - 60413 60408 6 1.05 2.09 Term - 65798 65678 121 1 1 96 42 113 0.488 4.47 2.08 Intr - 72019 71873 147 0 0 83 64 82 0.079 3.73 2.07 Intr - 100304 100116 189 1 0 115 80 71 0.075 6.78 2.06 Intr - 100527 100385 143 0 2 75 107 189 0.999 17.73 2.05 Intr - 101112 101056 57 0 0 83 96 104 0.998 8.86 2.04 Intr - 101433 101267 167 1 2 80 81 258 0.995 23.06 2.03 Intr - 101721 101598 124 0 1 66 78 115 0.995 7.54 2.02 Intr - 101963 101873 91 1 1 111 68 117 0.998 11.08 2.01 Init - 102939 102776 164 0 2 82 89 131 0.981 11.79 2.00 Prom - 119852 119813 40 -1.35 3.03 PlyA - 120177 120172 6 1.05 3.02 Term - 120372 120269 104 1 2 132 38 72 0.863 4.06 3.01 Init - 136681 136654 28 1 1 78 101 10 0.286 1.11 3.00 Prom - 136805 136766 40 -1.45 4.00 Prom + 136936 136975 40 -4.65 4.01 Init + 138356 138392 37 1 1 72 81 28 0.123 0.72 4.02 Intr + 155440 155555 116 2 2 60 53 101 0.165 3.05 4.03 Intr + 164383 164632 250 0 1 56 55 101 0.117 -0.31 4.04 Intr + 166088 166158 71 0 2 12 99 93 0.489 0.78 4.05 Term + 167394 167654 261 2 0 42 54 168 0.643 3.34 4.06 PlyA + 167985 167990 6 1.05 5.04 PlyA - 168453 168448 6 1.05 5.03 Term - 183280 183201 80 2 2 73 48 80 0.228 -0.55 5.02 Intr - 185671 184119 1553 0 2 2 60 315 0.144 8.11 5.01 Init - 186906 185984 923 0 2 69 -14 450 0.447 26.38 5.00 Prom - 187368 187329 40 -6.25 6.02 PlyA - 187538 187533 6 1.05 6.01 Sngl - 188410 187766 645 1 0 77 43 271 0.473 17.42 6.00 Prom - 191935 191896 40 -3.45 7.00 Prom + 195595 195634 40 -5.65 7.01 Init + 197109 197241 133 2 1 70 87 86 0.934 7.05 7.02 Term + 198766 198998 233 2 2 6 42 170 0.428 -0.15 7.03 PlyA + 199298 199303 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 93747 93851 105 2 0 66 37 104 0.803 0.53 S.002 Intr - 100304 100136 169 1 1 115 94 64 0.822 8.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_1|100_aa MCLSTSFLLVVGQQFGTSQKVGVKRAVTRVSAHQATGVKKTAERHTTPFTKLWAAGTVFR HLCVPLFRMPGPKAEAGCSMPRPASWLNAEPQLAQDSGVA >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_1|303_bp atgtgtctgtccacctcattcctcttggttgtgggacaacaatttggaacctcccaaaag gtgggtgtgaaaagagctgtaacacgcgtttctgctcaccaagctacaggagtgaaaaaa actgctgaacgccacacaaccccattcaccaagctgtgggcagcaggaacagttttcaga cacctttgtgtccccctcttccggatgccagggcccaaggcagaagctggttgcagcatg cccagaccagcctcttggctgaacgcagagccacagctggcacaggattctggtgtagca tga >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_2|400_aa MVLDARPLTSAANRLPRREVDCDFKVILGIVGRQLNSAPGKMEAAETEAEAAALEVLAEV AGILEPVGLQEEAELPAKILVEFVVDSQKKDKLLCSQLQVADFLQNILAQEDTAKGLDPL ASEDTSRQKAIAAKEQWKELKATYREHVEAIKIGLTKALTQMEEAQRKRTQLREAFEQLQ AKKQMAMEKRRAVQNQWQLQQEKHLQHLAEVSAEVRERKTGTQQELDRVFQKLGNLKQQA EQERDKLQRYQTFLQLLYTLQGKLLFPEAEAEAENLPDDKPQQPTRPQEQSTGDTMGRDP GVSFKVNERWNRPRMPRSGGQNYVKRGALGARGTTVLLSSAASSLRSPYSGAALLCCPSP SCSARNGPAEENLACYHQQKPGVEECEVSPPNDDRPGCAA >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_2|1203_bp atggtccttgatgcccgcccactaacctctgcagccaatcgcctgccccggcgcgaagtc gactgtgacttcaaagtaatcttagggattgtgggaaggcagctgaactcggcgcctgga aagatggaggcagcggagacagaggcggaagctgcagccctagaggtcctggctgaggtg gcaggcatcttggaacctgtaggcctgcaggaggaggcagaactgccagccaagatcctg gttgagtttgtggtggactctcagaagaaagacaagctgctctgcagccagcttcaggta gcggatttcctgcagaacatcctggctcaggaggacactgctaagggtctcgaccccttg gcttctgaagacacgagccgacagaaggcaattgcagctaaggaacaatggaaagagctg aaggccacctacagggagcacgtagaggccatcaaaattggcctcaccaaggccctgact cagatggaggaagcccagaggaaacggacacaactccgggaagcctttgagcagctccag gccaagaaacaaatggccatggagaaacgcagagcagtccagaaccagtggcagctacaa caggagaagcatctgcagcatctggcggaggtttctgcagaggtgagggagcgtaagaca gggactcagcaggagcttgacagggtgtttcagaaacttggaaacctgaagcagcaggca gaacaggagcgggacaagctgcagaggtatcagaccttcctccagcttctgtataccctg cagggtaagctgttgttccctgaggctgaggctgaggcagagaatcttccagatgataaa ccccagcagccgactcgaccccaggagcagagtacaggagacaccatggggagagaccct ggtgtgtccttcaaggtaaatgagagatggaacaggcccagaatgccaagatctggggga cagaactatgtcaaaagaggagccttgggtgctagagggaccacggtactactgtccagt gctgcctcaagtctccgctccccatattctggtgcagcactcctttgctgccccagcccc agctgctcagcaaggaatggtcctgctgaggagaatctggcttgttatcatcagcaaaag ccaggtgtggaggaatgtgaggtctccccgccaaatgatgatagaccaggttgcgctgca tga >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_3|43_aa MEYYAAIKNGIITYSANYVIDNNKRILKAEKKADRLDTTGLEK >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_3|132_bp atggaatactatgcagccataaaaaatggtataattacatactctgcaaattatgtaata gacaacaataaacggattctgaaagctgaaaagaaggcagacaggctagacactacagga cttgagaaataa >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_4|244_aa MTITKSKKTTDAAISALGSTPSLVMRWPLQTHRGTALVVGDKILKNPLDYQACLIMGITL TELENLNAVEVTGFWVQQGPSCGAQLQRQGWLGQQSQSSSQNNLTFADLWSQLVEYGVPR SEIVRNPTKLLLNLSFNSADPIIFEVSEMFGAFGRPLQQWMSAVGPCLWDPMVLPYSPRP RSNWIDRMVESPIKNSAAAPARWQYLAELRQILQKAVCVLNQHSIYGAVFTIAEFMAPGT KECK >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_4|735_bp atgactattactaaaagtaaaaagacaacagatgctgccatatctgcattagggagcact ccaagcctagtaatgcgatggcccttgcagactcacagaggtactgccttggtagtcggg gataagatcttgaagaatcctctggattaccaggcatgccttataatgggaatcacgctc actgaattggaaaacctaaatgctgtggaagtaactggattctgggtacagcaggggcca agttgtggtgctcaactacaaaggcaaggttggcttggacagcagagtcaaagcagcagt caaaataatctgacttttgcagacctatggtctcagctagttgaatatggtgttcctaga agtgaaatagttaggaatcctactaaattactacttaatttatcatttaattcagcagat ccaatcatatttgaagtatcagagatgtttggagcatttggaaggcccctgcagcaatgg atgtcagcagtgggcccatgcttatgggatccaatggtcttaccatattctccacgaccc agaagcaattggattgatagaatggtggaatcgcctattaaaaactctgctgcagcacca gctaggtggcaatatcttgcagaactgcgacaaattctccagaaggctgtatgtgttctg aatcagcattcaatatacggtgctgtttttaccatagcagaatttatggctccaggaacc aaggagtgtaaatga >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_5|851_aa MKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAQKRKQERSKIDTLTSQLRELE KQEQTYSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIKNKR EKNQIDAITNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLQRLNQEE VESLNRPITGSGIVAIINSSPTKKSPGPDGFTAEFYQSYKEELVPFLLKLFQSIEKEGIL PNSFYEASIILIPKPGRYTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVG FIPGMQGWAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISIFSKVS GYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLL NEIKDDTNKWKNIPCSWVGRINIVKMARPPMVIYRFNAFPIKLPMPFFTELEKTTLKFIW NQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMP HIYNYLIFDKPDKNKQWGKESLFNKWCWENWLAIYRKLKLEPFVTPYTKINSRWIKDLHV RPKTRKTLEENLGNTIQDIGMGKDFTSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRV NRQPTKWEKIFTTYSSDKGLISRIYNELKQIYKIKTNNPIKKWAKDMNRHFSKEDIYAAK KHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKL VQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPKDYKSCCYKGTCTLPPIYIDNLEAREKN GNRRSMSPYGI >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_5|2556_bp atgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcactaaatgcccaa aagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaagagaattagaa aagcaagagcaaacatattcaaaagctagcagaaggcaagaaataactaaaatcagagca gaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaaggatcaacaaaattgataggctgctagcaagactaataaagaataaaaga gagaagaatcaaatagatgcaataacaaatgataaaggggatatcaccaccgatcccaca gagatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacaccctccaaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggctctggaattgtggcaataatcaatagctca ccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccagagttacaag gaggagctagtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcatcctgataccaaagccgggcagatacacaacc aaaaaagagaattttagaccaatatccttgatgaacatcgatgcaaaaatcctcaataaa atactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctgggcaatcaggcaggagaaggaaataaagggtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatctagaa aaccccatcgtctcagctcaaaatctccttaagctgataagcatcttcagcaaagtttca ggatacaaaatcaatgtacaaaaatcgcaagcattcttatacaccaataacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctg ggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctc aatgaaataaaagatgatacaaacaaatggaagaacattccatgctcatgggtaggaaga atcaatattgtgaaaatggccagaccgcccatggtaatttatagattcaatgccttcccc atcaagctaccaatgcctttcttcacagaattggaaaaaactactttaaagttcatatgg aaccaaaaaagagcccacattgccaagtcaatcctaagccaaaagaacaaagccggaggc atcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtac tggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatgccg catatctacaactatctgatctttgacaaacctgacaaaaacaagcaatggggaaaggaa tcactatttaataaatggtgctgggaaaactggctagccatatatagaaagctgaaactg gaacccttcgttacaccttacacaaaaattaattcaagatggattaaagacttacatgtt agacctaaaaccagaaaaaccctagaagaaaacctaggcaataccattcaggacataggc atgggtaaggacttcacgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacaaaatgggagaaaatttttacaacctactcatctgacaaagggcta atatccagaatatacaatgaactcaaacaaatttacaagataaaaacaaacaaccccatc aaaaagtgggcgaaggacatgaacagacatttctcaaaggaagacatttatgcagccaaa aaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagg tgctggagaggatgtggagaaataggaacacttttacactgttggtgggactgtaaacta gttcaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaatacca tttgacccagccatcccattactgggtatatacccaaaggactataaatcatgctgctat aaaggcacatgcacactacctcctatctatatagataatttggaagccagggagaagaat gggaaccgaagatctatgagcccttatggcatatga >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_6|214_aa MKEMKQEGKFREKRIKRNKQCLQEIWDYVKRPNLHLIGVPESDGENGTQLENTLQDIIQE NFPNLARQANIQIQEIQRMPQRYSSGRATPRYIIVRFTKVEMKEKMLRAVREKGWVTQKG KPIRLTADLSAETLQARREWGPMFNILKEKNFQRRISYPDKLSFISEGEIKYFTDKQMLR DYVTTRPALKELLKEALNMERNNRYQPLQKHAKL >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_6|645_bp atgaaggaaatgaagcaagaagggaagtttagagaaaaaagaataaaaagaaacaaacaa tgtctccaagaaatatgggactatgtgaaaagaccaaatctacatctgattggtgtacct gaaagtgacggggagaatggaacacagttggaaaacactctgcaggatattatccaggag aacttccccaatctagcaaggcaggccaacattcagattcaggaaatacagagaatgcca caaagatactcctcgggaagagcaactccaagatacataattgtcagattcaccaaagtt gaaatgaaggaaaaaatgttaagggcagtcagagagaaaggttgggttacccagaaaggg aagccaatcagactaacagcagatctctcagcagaaactctacaagccagaagagagtgg gggccaatgttcaacattcttaaagaaaagaattttcaacgaagaatttcatatccagac aaactaagcttcataagtgaaggagaaataaaatactttacagacaagcaaatgctgaga gactatgtcaccaccaggcctgccctaaaagagctcctgaaggaagcccttaacatggaa aggaacaacaggtaccagccactgcaaaaacatgccaaattgtaa >gi568815588r:56258421_56461236|GENSCAN_predicted_peptide_7|121_aa MKLKEAVGKGLKQNVESNYGILEKREPLSSSGRHSFVVMWKIRKAQPNTATTGTLASLLT VQGGTHLIVLQLCQCMPPYSTRKVTFNLLLPSLGPNDWPIWHPHPQENFTTASTNNHNIR R >gi568815588r:56258421_56461236|GENSCAN_predicted_CDS_7|366_bp atgaagcttaaggaggctgttggtaaaggtctgaagcaaaatgtagaaagtaactatgga attctagagaaaagggagcccttgtcaagtagtggaaggcattcatttgtggtgatgtgg aaaataagaaaagcccagcccaacactgccactactggcacccttgcaagcctccttact gtccaaggagggacccacctaatcgtattacaactgtgccagtgtatgccaccatacagc acaaggaaagtcacattcaacctgctgctgccatcactgggacccaatgactggcccatc tggcatcctcatccccaggaaaacttcaccacagcctccactaataaccataacataagg cgatga