GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:09:27 Sequence gi568815593r:156929736_157157943 : 228208 bp : 42.12% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 242 237 6 1.05 1.05 Term - 8328 8189 140 1 2 47 34 139 0.579 1.54 1.04 Intr - 19996 19916 81 2 0 124 75 81 0.815 9.09 1.03 Intr - 22055 21777 279 0 0 50 101 299 0.999 23.93 1.02 Intr - 25021 24680 342 2 0 110 99 141 0.649 11.88 1.01 Init - 33463 33406 58 1 1 72 80 24 0.482 1.53 1.00 Prom - 33530 33491 40 -7.45 2.00 Prom + 51054 51093 40 -6.75 2.01 Init + 55747 55892 146 0 2 81 58 116 0.757 7.54 2.02 Intr + 64774 64813 40 1 1 93 84 22 0.009 -0.39 2.03 Term + 77878 78156 279 0 0 71 41 250 0.813 13.06 2.04 PlyA + 79704 79709 6 1.05 3.00 Prom + 83618 83657 40 -3.65 3.01 Init + 85439 85518 80 1 2 83 47 55 0.148 1.48 3.02 Intr + 93981 94089 109 0 1 109 91 25 0.628 4.27 3.03 Term + 95685 95840 156 2 0 65 48 77 0.553 -1.65 3.04 PlyA + 96245 96250 6 1.05 4.11 PlyA - 98880 98875 6 1.05 4.10 Term - 100106 99998 109 1 1 76 47 119 0.913 3.60 4.09 Intr - 103152 103119 34 1 1 82 121 16 0.935 0.66 4.08 Intr - 107626 107512 115 1 1 88 106 84 0.976 9.30 4.07 Intr - 107803 107704 100 0 1 36 106 39 0.149 -0.31 4.06 Intr - 118353 118169 185 0 2 55 110 39 0.065 0.46 4.05 Intr - 118849 118719 131 2 2 89 98 -9 0.059 -0.31 4.04 Intr - 122919 122626 294 1 0 70 98 407 0.073 35.96 4.03 Intr - 125798 125466 333 0 0 130 113 122 0.989 13.52 4.02 Intr - 128220 128163 58 0 1 101 111 2 0.550 1.34 4.01 Init - 130674 130564 111 0 0 58 61 93 0.455 3.86 4.00 Prom - 154729 154690 40 -3.65 5.08 PlyA - 155102 155097 6 -0.45 5.07 Term - 157527 157367 161 1 2 102 36 228 0.969 16.12 5.06 Intr - 165724 165517 208 1 1 88 -6 161 0.000 4.43 5.05 Intr - 169166 169123 44 0 2 102 98 -9 0.242 -1.46 5.04 Intr - 177227 176892 336 0 0 97 109 373 0.752 34.87 5.03 Intr - 203662 203618 45 2 0 63 87 73 0.036 2.16 5.02 Intr - 208054 207971 84 2 0 98 56 92 0.103 5.87 5.01 Init - 209696 209072 625 2 1 81 40 511 0.521 41.15 5.00 Prom - 213387 213348 40 -5.15 6.03 PlyA - 213981 213976 6 1.05 6.02 Term - 223091 222979 113 0 2 36 48 144 0.893 2.94 6.01 Init - 226456 226444 13 1 1 86 100 -2 0.332 1.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 122914 122631 284 2 2 10 46 566 0.906 39.20 S.002 Intr - 165724 165571 154 1 1 88 101 221 0.999 22.12 S.003 Init - 179248 179191 58 1 1 35 108 77 0.976 4.10 S.004 Sngl - 212987 212724 264 2 0 91 38 177 0.843 8.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:156929736_157157943|GENSCAN_predicted_peptide_1|299_aa MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSHNSNSMCWGKDQCP YSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGWF NDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTTG TPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSSV ESTSADTVLLTSKGHGKVVGGIAILSASSAEGTEKGRGMKETTFVLTNAIVRKKEIGCT >gi568815593r:156929736_157157943|GENSCAN_predicted_CDS_1|900_bp atgtccaaagaacctctcattctctggctgatgattgagttttggtggctttacctgaca ccagtcacttcagagactgttgtgacggaggttttgggtcaccgggtgactttgccctgt ctgtactcatcctggtctcacaacagcaacagcatgtgctgggggaaagaccagtgcccc tactccggttgcaaggaggcgctcatccgcactgatggaatgagggtgacctcaagaaag tcagcaaaatatagacttcaggggactatcccgagaggtgatgtctccttgaccatctta aaccccagtgaaagtgacagcggtgtgtactgctgccgcatagaagtgcctggctggttc aacgatgtaaagataaacgtgcgcctgaatctacagagagcctcaacaaccacgcacaga acagcaaccaccaccacacgcagaacaacaacaacaagccccaccaccacccgacaaatg acaacaaccccagctgcacttccaacaacagtcgtgaccacacccgatctcacaaccgga acaccactccagatgacaaccattgccgtcttcacaacagcaaacacgtgcctttcacta accccaagcacccttccggaggaagccacaggtcttctgactcccgagccttctaaggaa gggcccatcctcactgcagaatcagaaactgtcctccccagtgattcctggagtagtgtt gagtctacttctgctgacactgtcctgctgacatccaaagggcatggaaaagttgtagga ggaattgccattctcagtgcaagttcagcagaaggaacagagaaaggcagaggaatgaag gaaaccacatttgtactcaccaatgccattgttaggaagaaagaaattggatgcacttga >gi568815593r:156929736_157157943|GENSCAN_predicted_peptide_2|154_aa MKKNQSKKAENSQNQNGSSPKDHNSSPAREQNWTVNEFDELTEEVSEDRLYKTNPTPHIM PSGCRLLVVPPCWNLDSGSLIPTTPLGSAQLGTLYEGSNPTFHLGIVTVEALAGKGSTPQ QASAWAPRLSHSTSETQVEAAKPSSLLHSVHLQA >gi568815593r:156929736_157157943|GENSCAN_predicted_CDS_2|465_bp atgaagaaaaaccagagcaaaaaggctgaaaattcccaaaaccagaatggctcttctcca aaggatcacaactcctctccagcaagggaacaaaactggacggtgaatgaatttgatgaa ttgacagaggaggtttcagaagatagattgtataaaacaaatcctacacctcatatcatg ccatctggttgccggctgctggtggtgccaccatgctggaatctggacagtggcagtctt attccaacaactccactaggcagtgcccaactggggactctgtatgagggctccaacccc acatttcaccttggtattgtgacagtagaggctcttgcagggaagggctccaccccgcag caggcttctgcctgggcacccaggctttcccattcaacctctgaaacccaggtagaagct gccaagccttcttcactcttgcattctgtacacctacaggcctaa >gi568815593r:156929736_157157943|GENSCAN_predicted_peptide_3|114_aa MDEAGNHHSQQTITRTENQTLDVLTHREHYEPVTNIHKLDHENQSVNKLPLHFLGLPIFW RLQKQPLTNQLKLHHFLPPNKHPLPQWFSTVPAGTLSVVTTGGCDWHLVGGGQG >gi568815593r:156929736_157157943|GENSCAN_predicted_CDS_3|345_bp atggatgaagctggaaaccatcattctcagcaaactatcacaagaacagaaaaccaaaca ctggatgttctcactcatagagagcactacgaacctgtaacaaacatccacaaactagac catgaaaaccagagtgtaaataagcttcccctacattttttgggtcttcctatattttgg agactccagaagcagccgctgaccaaccagctaaagctacaccacttcctaccccccaac aaacaccctctacctcagtggttctcaactgtgcccgcagggacattgtcagttgtcacc actggaggatgtgactggcatctagtgggtggaggccagggatga >gi568815593r:156929736_157157943|GENSCAN_predicted_peptide_4|489_aa MENTATEKALGLRQVGKKALVAGAESAGERVVEEIYGADPIMHPQVVILSLILHLADSVA GSVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTYRKDTRYKL LGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVTTTPIVTTVP TVTTVRTSTTVPTTTTVPMTTVPTTTVPTTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTT TSVPVTTTVSTFVPPMPLPRQNHEPVLGLQASATVPSCFVICKKDVIPRSCRMLVRIKWD ERFGTWLQGGNSQTGFSVSSHTTSTTIINSDEDFCDRMCGVFSPYIKQWTPAVSLIQFLH CLPGERVISHSCPRFSPYGYKLRSFGLTLMFQKSPSTNTGKCGWQLFLEHSLLTANTTKG IYAGVCISVLVLLALLGVIIAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIY IENSLYATD >gi568815593r:156929736_157157943|GENSCAN_predicted_CDS_4|1470_bp atggagaacacagccacggaaaaggccttagggttgaggcaagttggaaagaaagctcta gtagctggggctgagtcagcaggggagagagtggtagaagaaatctatggggctgatccc ataatgcatcctcaagtggtcatcttaagcctcatcctacatctggcagattctgtagct ggttctgtaaaggttggtggagaggcaggtccatctgtcacactaccctgccactacagt ggagctgtcacatccatgtgctggaatagaggctcatgttctctattcacatgccaaaat ggcattgtctggaccaatggaacccacgtcacctatcggaaggacacacgctataagcta ttgggggacctttcaagaagggatgtctctttgaccatagaaaatacagctgtgtctgac agtggcgtatattgttgccgtgttgagcaccgtgggtggttcaatgacatgaaaatcacc gtatcattggagattgtgccacccaaggtcacgactactccaattgtcacaactgttcca accgtcacgactgttcgaacgagcaccactgttccaacgacaacgactgttccaatgacg actgttccaacgacaactgttccaacaacaatgagcattccaacgacaacgactgttctg acgacaatgactgtttcaacgacaacgagcgttccaacgacaacgagcattccaacaaca acaagtgttccagtgacaacaactgtctctacctttgttcctccaatgcctttgcccagg cagaaccatgaaccagtgctgggattacaggcatcagccaccgtgcccagctgttttgtc atttgtaaaaaggatgtaataccaagatcttgcagaatgcttgtgaggatcaaatgggat gaaagatttgggacatggttacaggggggaaactcccaaactgggttttctgtctcttct cacaccacgtcaacaacaatcatcaactcagatgaagacttctgtgaccgaatgtgtgga gttttttccccatacatcaagcagtggacaccggctgtgtctttaattcagttcttacac tgtctacccggagagagggtcatatcccacagttgtcctagattttcaccgtatggttat aaactgagatcatttggtcttacccttatgttccagaagtccccaagcaccaatacaggt aaatgtggctggcaactgttcctagaacatagtctactgacggccaataccactaaagga atctatgctggagtctgtatttctgtcttggtgcttcttgctcttttgggtgtcatcatt gccaaaaagtatttcttcaaaaaggaggttcaacaactaagtgtttcatttagcagcctt caaattaaagctttgcaaaatgcagttgaaaaggaagtccaagcagaagacaatatctac attgagaatagtctttatgccacggactaa >gi568815593r:156929736_157157943|GENSCAN_predicted_peptide_5|500_aa MGEPQQVSALPPPPMQYIKEYTDENIQEGLAPKPPPPIKDSYMMFGNQFQCDDLIIRPLE SQGIERLHPMQFDHKKELRKLNMSILINFLDLLDILIRSPGSIKREEKLEDLKLLFVHVH HLINEYRPHQARETLRVMMEVQKRQRLETAERFQKHLERVIEMIQNCLASLPDDLPHSEA GMRVKTEPMDADDSNNCTGQNEHQRENSAAGWTVVSRISLEATEDGKDCQSGSLNDQQSC EEPSKSDGIYGWSSEVEYRAEVGQNAYLPCFYTPAAPGNLVPVCWGKGACPVFECGNVVL RTDERDVNYWTSRYWLNGDFRKGDVSLTIENVTLADSGIYCCRIQIPGIMNDEKFNLKLV IKPAETQTLGSLPDINLTQISTLANELRDSRLANDLRDSGATIRIGIYIGAGICAGLALA LIFGALIFKCKCFCFSLPLITNSSVRTRLANAVAEGIRSEENIYTIEENVYEVEEPNEYY CYVSSRQQPSQPLGCRFAMP >gi568815593r:156929736_157157943|GENSCAN_predicted_CDS_5|1503_bp atgggtgaaccacagcaagtgagtgcacttccaccacctccaatgcaatatatcaaggaa tatacggatgaaaatattcaagaaggcttagctcccaagcctccccctccaataaaagac agttacatgatgtttggcaatcagttccaatgtgatgatcttatcatccgccctttggaa agtcagggcatcgaacggcttcatcctatgcagtttgatcacaagaaagaactgagaaaa cttaatatgtctatccttattaatttcttggaccttttagatattttaataaggagccct gggagtataaaacgagaagagaaactagaagatcttaagctgctttttgtacacgtgcat catcttataaatgaataccgaccccaccaagcaagagagaccttgagagtcatgatggag gtccagaaacgtcaacggcttgaaacagctgagagatttcaaaagcacctggaacgagta attgaaatgattcagaattgcttggcttctttgcctgatgatttgcctcattcagaagca ggaatgagagtaaaaactgaaccaatggatgctgatgatagcaacaattgtactggacag aatgaacatcaaagagaaaattcagctgctggttggacggtcgtatccaggataagtttg gaagccaccgaagatggcaaagattgtcagtctggttctttgaatgaccaacaatcctgt gaagaaccttctaaatctgatggtatttatggctggtcctcagaagtggaatacagagcg gaggtcggtcagaatgcctatctgccctgcttctacaccccagccgccccagggaacctc gtgcccgtctgctggggcaaaggagcctgtcctgtgtttgaatgtggcaacgtggtgctc aggactgatgaaagggatgtgaattattggacatccagatactggctaaatggggatttc cgcaaaggagatgtgtccctgaccatagagaatgtgactctagcagacagtgggatctac tgctgccggatccaaatcccaggcataatgaatgatgaaaaatttaacctgaagttggtc atcaaaccagcagagacacagacactggggagcctccctgatataaatctaacacaaata tccacattggccaatgagttacgggactctagattggccaatgacttacgggactctgga gcaaccatcagaataggcatctacatcggagcagggatctgtgctgggctggctctggct cttatcttcggcgctttaattttcaaatgtaagtgtttttgtttctctctccctctgata acaaattcttcagtgagaaccagattggcaaatgcagtagcagagggaattcgctcagaa gaaaacatctataccattgaagagaacgtatatgaagtggaggagcccaatgagtattat tgctatgtcagcagcaggcagcaaccctcacaacctttgggttgtcgctttgcaatgcca tag >gi568815593r:156929736_157157943|GENSCAN_predicted_peptide_6|41_aa MENRMYNTKSEDQCELWTSGTNDASMSAHQNQMNHSGAGCR >gi568815593r:156929736_157157943|GENSCAN_predicted_CDS_6|126_bp atggaaaacagaatgtataacaccaagagtgaagaccagtgtgaactatggacttcaggt actaatgatgcgtcaatgtcagctcatcaaaatcaaatgaaccactctggtgcaggatgt cgatga