GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:15:41 Sequence gi568815596f:171985678_172188254 : 202577 bp : 42.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14293 14355 63 0 0 125 58 71 0.147 9.01 1.02 Intr + 38719 38907 189 0 0 47 91 102 0.208 5.26 1.03 Intr + 70608 70658 51 2 0 90 57 56 0.071 0.89 1.04 Intr + 71788 71861 74 0 2 74 67 56 0.129 -0.61 1.05 Intr + 76794 76956 163 0 1 45 93 83 0.252 3.56 1.06 Intr + 79927 80075 149 0 2 63 66 174 0.563 10.81 1.07 Intr + 80587 80629 43 1 1 69 79 42 0.903 -1.28 1.08 Intr + 85230 85393 164 2 2 62 115 90 0.963 6.95 1.09 Intr + 89794 89941 148 1 1 97 99 99 0.982 11.22 1.10 Intr + 95451 95570 120 2 0 64 64 168 0.300 11.77 1.11 Term + 95627 95764 138 1 0 44 48 154 0.189 3.98 1.12 PlyA + 96734 96739 6 -1.95 2.00 Prom + 96761 96800 40 -4.75 2.01 Init + 100016 100427 412 1 1 92 59 352 0.215 29.42 2.02 Intr + 100977 101176 200 1 2 91 82 231 0.995 20.95 2.03 Intr + 102326 102576 251 1 2 137 45 122 0.635 8.11 2.04 Intr + 102619 102725 107 1 2 83 41 61 0.665 -0.16 2.05 Intr + 104492 104660 169 0 1 68 42 168 0.922 8.28 2.06 Term + 104963 105365 403 2 1 69 43 176 0.365 4.84 2.07 PlyA + 106032 106037 6 -0.45 3.14 PlyA - 106241 106236 6 1.05 3.13 Term - 109818 109285 534 0 0 6 54 223 0.155 3.96 3.12 Intr - 110548 110300 249 1 0 8 -1 226 0.265 2.41 3.11 Intr - 110992 110931 62 2 2 88 78 65 0.356 3.03 3.10 Intr - 113106 112931 176 2 2 54 64 77 0.725 0.56 3.09 Intr - 113734 113632 103 2 1 67 64 80 0.763 1.81 3.08 Intr - 114753 114570 184 0 1 107 88 99 0.990 10.24 3.07 Intr - 115267 114890 378 1 0 87 105 231 0.987 18.94 3.06 Intr - 115969 115785 185 2 2 102 75 244 0.702 23.09 3.05 Intr - 116862 116462 401 2 2 66 72 533 0.323 42.72 3.04 Intr - 120367 119976 392 1 2 69 80 275 0.227 17.40 3.03 Intr - 121001 120818 184 1 1 41 44 251 0.645 14.87 3.02 Intr - 124563 124413 151 1 1 74 35 169 0.563 8.50 3.01 Init - 127175 127157 19 2 1 57 101 4 0.526 -1.09 3.00 Prom - 132119 132080 40 -6.35 4.00 Prom + 132213 132252 40 -5.35 4.01 Init + 134537 134674 138 1 0 100 87 64 0.402 7.69 4.02 Intr + 135912 136066 155 2 2 35 68 143 0.648 4.95 4.03 Intr + 148790 148808 19 2 1 65 111 24 0.050 -1.90 4.04 Term + 154254 154502 249 2 0 -1 48 290 0.111 10.82 4.05 PlyA + 156848 156853 6 1.05 5.03 PlyA - 156891 156886 6 1.05 5.02 Term - 160685 160419 267 2 0 49 49 183 0.448 5.01 5.01 Init - 167533 167504 30 1 0 78 58 29 0.036 -1.29 5.00 Prom - 167736 167697 40 -2.45 6.00 Prom + 168028 168067 40 -8.15 6.01 Init + 175970 176260 291 1 0 81 77 127 0.040 7.37 6.02 Intr + 181797 181910 114 2 0 50 105 52 0.298 2.82 6.03 Term + 182213 182644 432 1 0 45 48 223 0.391 8.31 6.04 PlyA + 185537 185542 6 1.05 7.02 PlyA - 186647 186642 6 1.05 7.01 Term - 193850 193620 231 2 0 62 45 164 0.834 4.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_1|433_aa MAAPSGVHLLVRRGKRVEESPLPAERPLPSMEAQGVFTNKNNSEKPKPQSPTNDLSRQIK IGLQNPNHEKRHDFLHVIFTNGKQGNVFEVLSTELGTSKLLTVQLIVLAATSPKRVFGQR RQLGLRRKEKEISGEVNCIALAISVNFSMTAQQPFCLQVITLPDHRAEWRQEHRLFGGDL VDMTTEEIDALVHREIISHNAYPSPLGYGGFPKSVCTSVNNVLCHGIPDSRPLQDGDIIN IDVTVYYNGYHGDTSETFLVGNVDECGKKLVEVARRCRDEAIAACRAGAPFSVIGNTISM PVKSCRHTPTPHATVRVTSTFFQIRGVKFATTGHLDCLCEFGLKPYNQPSKASPEVKFEA LPPPTPHARTHARPFAALGIRAWAPLAQTGPSRIGTRACTLQEPQTSAVLETERVPERGD TGAFESKGLLGRD >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_1|1302_bp atggcggcgcccagtggcgtccacctgctcgtccgcagaggtaagcgcgtggaggagagc cccttacctgctgaaagaccattaccaagcatggaggcccagggggtctttacaaacaag aataattctgagaaacccaaaccacaaagcccaaccaatgacctctccaggcagataaaa attggcctccagaaccctaaccatgaaaaaagacatgacttcctacatgtaatcttcacc aatggcaagcagggtaatgtctttgaagtactcagcaccgagcttggcaccagtaagctc ttgactgtccaactaatagttctggcagccacctccccaaagcgtgtgtttggtcagcga aggcagcttggcttaagaagaaaagaaaaagaaatctctggtgaggtgaattgtattgct ttagccatttcagtcaatttcagcatgacagcccaacagcccttctgcctgcaagttata accttgccagatcacagagcagagtggagacaagaacacaggctctttgggggagacctg gttgacatgacaactgaagagatagatgctcttgttcatcgggaaatcatcagtcataat gcctatccctcacctctaggctatggaggttttccaaaatctgtttgtacctctgtaaac aacgtgctctgtcatggtattcctgacagtcgacctcttcaggatggagatattatcaac attgatgtcacagtctattacaatggctaccatggagacacctctgaaacatttttggtg ggcaatgtggacgaatgtggtaaaaagttagtggaggttgccaggaggtgtagagatgaa gcaattgcagcttgcagagcaggggctcccttctctgtaattggaaacacaatcagcatg ccagtcaagagctgcagacatactccaactccccatgccacagttagggttaccagtact ttctttcaaattagaggagtgaaatttgctactactggtcacttggattgtctctgtgaa tttggacttaaaccctacaatcagccctccaaggccagtccagaggtgaagtttgaggcc ctccccccacccaccccacacgcacgcacgcacgctagaccgtttgctgcactaggaatt cgagcttgggccccactcgcccagactggcccttctcgcatcgggacccgcgcttgcacg ctgcaggagccgcaaacgtcagctgttctggaaaccgagagggtcccagagagaggagat acgggcgcatttgagagcaagggcctacttggccgggactga >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_2|513_aa MPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSASSFSR PLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGTCACQGEGEEEVQGREGKKE RGRRGEGEREKEKRGERGADSEKSTVVEGGEVRFNGKGKKIRKPRTIYSSLQLQALNRRF QQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSA GSPPVPPGWNPNSSSGKGSGGNAGSYIPSYTSWYPSAHQEAMQQPQLMSLPPPGPSIPSG KEGPRGKKEQWRRDALHLLGAPRGLPAFACPEALDSVAEKTQPDAKQGPGKTRLSDCPSI NQCAPGDRAQSQAATLGAPQSRSRAEAVPESCGPRQAAKRCSYFSRTRLPALLPAGAGPP PLGGLFRRAVGSRARKAVICVVTSTHFSIGAGERSDAPGSSDALWEGEEEVVGLLGVGGG GVMEAGKVVGGGGMMPGTSKPLPGLRLGLLRST >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_2|1542_bp atgccagaaagtctcaacagccccgtgtcgggcaaggcggtgtttatggagtttgggccg cccaaccagcaaatgtctccttctcccatgtcccacgggcactactccatgcactgttta cactcggcgggccattcgcagcccgacggcgcctacagctcagcctcgtccttctcccga ccgctgggctacccctacgtcaactcggtcagcagccacgcatccagcccctacatcagt tcggtgcagtcctacccgggcagcgccagcctcgcccagagccgcctggaggacccaggt acgtgcgcttgccagggagagggagaggaggaggtacaagggagagagggaaagaaggag cgggggagaagaggagagggagagagagagaaagagaagagaggagagcgaggggcggac tcggagaagagcacggtggtggaaggcggtgaagtgcgcttcaatggcaagggaaaaaag atccgtaaacccaggacgatttattccagtttgcagttgcaggctttgaaccggaggttc cagcaaactcagtacctagctctgccggagagggcggagctcgcggcctctttgggactc acacagactcaggtcaagatctggttccaaaacaagcgatccaagttcaagaagctgatg aagcagggtggggcggctctggagggtagtgcgttggccaacggtcgggccctgtctgct ggctccccacccgtgccgcccggctggaaccctaactcttcatccgggaagggctcagga ggaaacgcgggctcctatatccccagctacacatcgtggtacccttcagcgcaccaagaa gctatgcagcaaccccaacttatgtccctcccgcctccaggtccatccatcccgtccgga aaagaaggacccagagggaagaaggaacagtggaggcgggacgccctccatctcctcgga gccccgcgaggcctccctgcttttgcatgcccggaggcgctggattccgtcgcggaaaag acgcagccagacgccaagcaggggcccggaaagacacgtctgtcagactgcccctctatc aatcagtgcgctccaggggacagggcacagtcccaggcagccaccctgggagctccacaa tctcgcagccgagccgaggcagtcccggagagctgcgggcccagacaggctgccaagcgc tgttcctacttctcccgcactcgccttccagcgctgctgcccgcgggagctggccctcca cctttggggggtcttttcaggcgcgcggtggggagcagagcccggaaggcagtgatctgt gttgtgacaagcacccatttttcaattggtgctggtgaaagatcagatgcgccaggctct tccgacgccctttgggaaggggaggaggaagtggtggggctgctgggggtggggggaggc ggtgtgatggaagcagggaaagtagtcgggggtgggggaatgatgccggggacctccaag ccccttccaggcctgagattgggacttctccgcagcacctag >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_3|1005_aa MKIPLGVPDGFTMPLDHPLPPRRPTPKSPGLAWSTGSRKQRLPSPWLPPGQGLLRHETNP KGIRKRRFLELLLRGRKHKGTETGIASWTSGPETPVSGTLDETVRSRSSSACSPPIPRGL ALGSYEGRGRSAASAWDKRSKAEESLSDGIRTRKHRAETSGSPYLPPPFSMFLRSKRMPK SEKSENREGERKCPGAEEVAGRPQAPVFSQTRFKPPLHSSQITKETGVTLTVDKTVNSLC HCFPDFRPKMTGVFDSLVADMHSTQIAASSTYHQHQQPPSGGGAGPGGNSSSSSSLHKPQ ESPTLPVSTATDSSYYTNQQHPAGGGGGGGSPYAHMGSYQYQASGLNNVPYSAKSSYDLG YTAAYTSYAPYGTSSSPANNEPEKEDLEPEIRIVNGKPKKVRKPRTIYSSFQLAALQRRF QKTQYLALPERAELAASLGLTQTQVKIWFQNRRSKFKKMWKSGEIPSEQHPGASASPPCA SPPVSAPASWDFGVPQRMAGGGGPGSGGSGAGSSGSSPSSAASAFLGNYPWYHQTSGSAS HLQATAPLLHPTQTPQPHHHHHHHGGGGAPAATCSGPSQALLTPGATKRAPLSGEPDSSL AKASLKGRISDLSPGSRLFRAPPRTCLPSPPGSGQSRAQSSSRCPGREASRPRATFTSFR GCDTARRTHNDTHSNTVTSHTHTPSHSYNTDTRSHAHSHSYPYTLIDKLTYAHSCILTHR LIYTGLSQLRQNAGTELQGQLPAAQFSGRRRGAEEAGAKRSSEVVPSAQENNVREQEAPS PLKKSFLRAVKLECNSPTPGVPRTRSKVKSPLLSLHGTATTSGFNLFREKKVQAGLGGAA RRLHAKNADSLQTDLSSLGLSGSPTSPPGPERTSCSVPAVCGLRGSAPKEALLGRVPSRV GSRARRWGPRVGCRVGEGAEPAHEAGRGCWAEDYNFGAGEARAKGLDLEHARVTPGDRLG PARSKRGRLPTLPLLRPDCGPGVSDTREVKSSHPEQWGLLAGEPD >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_3|3018_bp atgaagattcccctaggcgtgcctgacggcttcacaatgcccctggaccacccactgccg cctcggaggcccacgcccaaatcccctggcctggcctggtccactggatcccggaaacag cgtctgccttcaccttggctgccgccaggacaaggcctgcttagacacgagacaaacccc aaagggattcgaaagcgcaggttcctagagctactacttcggggacgaaaacacaaaggt accgaaactggcattgcctcttggaccagcggcccggaaactcccgtctccgggaccctc gacgagactgtgaggagtaggagcagcagcgcatgcagccctcccatcccccgagggttg gcgttaggctcctacgaggggagaggtcgcagcgcggcgtcagcctgggacaagcgatcc aaagcagaagaaagtctcagcgatgggattcgaacacgcaagcacagagcagaaacttca ggctccccttatttacccccaccgttctctatgttcctgcggagcaaaagaatgccgaag tcagagaaatccgaaaacagagaaggggaaagaaagtgccctggggcagaagaggtagct ggccgccctcaagcaccagtgttctcccaaactcgttttaagcctccccttcactcctct cagatcactaaggagacgggcgtcactctgactgtagacaaaactgtcaattccctatgc cactgcttccctgacttcagaccaaagatgactggagtctttgacagtctagtggctgat atgcactcgacccagatcgccgcctccagcacgtaccaccagcaccagcagcccccgagc ggcggcggcgccggcccgggtggcaacagcagcagcagcagcagcctccacaagccccag gagtcgcccacccttccggtgtccaccgccaccgacagcagctactacaccaaccagcag cacccggcgggcggcggcggcggcgggggctcgccctacgcgcacatgggttcctaccag taccaagccagcggcctcaacaacgtcccttactccgccaagagcagctatgacctgggc tacaccgccgcctacacctcctacgctccctatggaaccagttcgtccccagccaacaac gagcctgagaaggaggaccttgagcctgaaattcggatagtgaacgggaagccaaagaaa gtccggaaaccccgcaccatctactccagtttccagctggcggctcttcagcggcgtttc caaaagactcaatacttggccttgccggagcgagccgagctggcggcctctctgggcctc acccagactcaggtcaaaatctggttccagaaccgccggtccaagttcaagaagatgtgg aaaagtggtgagatcccctcggagcagcaccctggggccagcgcttctccaccttgtgct tcgccgccagtctcagcgccggcctcctgggactttggtgtgccgcagcggatggcgggc ggcggtggtccgggcagtggcggcagcggcgccggcagctcgggctccagcccgagcagc gcggcctcggcttttctgggcaactacccctggtaccaccagacctcgggatccgcctca cacctgcaggccacggcgccgctgctgcaccccactcagaccccgcagccgcatcaccac caccaccatcacggcggcgggggcgccccggctgcgacctgcagtggcccgtctcaggcc ctgctcactcccggggccaccaaacgggcccctctctcgggggaaccggacagcagcttg gcaaaggcctccctaaaaggccgcatttctgacctgagccccgggtctcggctgtttcga gccccgcctcggacttgccttccctcccctccggggtccggtcagtcacgtgctcagagc tcaagccgctgcccaggccgggaagccagccggccgagagctacgtttacatctttccga ggctgcgacaccgcgaggcgcacacacaatgacacacactcgaacaccgtcacatcacac actcatactccatcacactcttacaacactgacacacgaagtcatgcacactcgcattca tacccttacacactgatagacaaactcacatatgcacactcatgtatacttactcacaga cttatatacactgggctctcccaactcaggcagaacgcgggcactgagctccagggccaa ctgcctgccgcccagttcagtggtagaagacgaggggcagaagaggcgggtgcaaagaga agctctgaagtggtcccctccgcccaagagaacaatgtccgggagcaagaagccccgtcc cctctcaaaaagtcgttcttacgggcagtaaaactcgagtgcaactcccccactcctggc gtcccccgaactcgctcaaaagtcaaaagcccgctcctttccctccatggaacagcgacc accagtggctttaacctgtttagggaaaagaaagtgcaggcaggtcttggaggagcagct cggcgcctccacgccaagaacgcagacagcctccagacagacctttcttccctcggcctc tccggctcccccactagtcctcccgggccagagcgcacaagctgcagtgtcccggcagtt tgcgggctgcgtggcagcgctccgaaagaggcccttttggggagggtccccagtcgagtc gggtcccgggctaggagatgggggccgagggtcgggtgccgagtgggagaaggtgcagaa ccggcccacgaggcgggtcgggggtgctgggcagaggattacaacttcggggcaggagaa gcgagggcgaaaggactggacttggagcacgctcgggtgaccccgggggacagactgggg cccgcacggagcaaacgtggcaggctgcccacgctgccgcttctccgtcccgattgtggt ccaggagtctcggacacccgggaggtgaaatcatcgcacccggaacaatggggacttctg gccggagaaccggactga >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_4|186_aa MTNTAKAKSLFAMIKEKAGPDYDVEFTVCSGWFKQFQNCYPLHNVKAISATEEDTEKTLM QIWEDYNIYDCIKKNLAWAWGDVTKECMDGIWKKRLKRGSTARLSMKAERDEEAAEEMLE ASRGWFMSFKERSCLHNIKVQGEVASADVEATASYSEDLAKTIDEGDCMKQKLFHVDETA LEEDAI >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_4|561_bp atgaccaacacagccaaagcaaaaagtttgtttgcaatgataaaagaaaaggctggacct gactacgatgttgaatttactgtttgctctgggtggtttaaacaattccagaattgttat ccattacataatgtgaaagctatttctgcaactgaagaagacactgagaaaacactgatg caaatctgggaggattacaacatctatgactgcattaagaagaatcttgcttgggcttgg ggtgatgtcaccaaggagtgtatggatggcatctggaagaagagactcaagaggggaagt acagcacgcttgtctatgaaggctgagagagatgaggaagctgcagaagaaatgttggaa gccagcagaggttggttcatgagctttaaggaaagaagctgtctccataacataaaagtg caaggtgaagtggcaagtgctgatgtagaagctacagcaagttattcagaagatctagct aagaccatcgatgaaggtgactgcatgaaacaaaagcttttccatgtagatgaaacagcc ttggaagaagatgccatctag >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_5|98_aa MEKDEKFKMLNLSRREVNTMQDEVVSSWKPEGAQAESIPATDKNELQTPRPGPWHPGAPS RAMEEAAPSKAFPCLETPVKSKAWAVEEGSCEQQAQKT >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_5|297_bp atggaaaaagatgaaaagtttaagatgctgaacctgagtcgacgagaggtcaacactatg caggatgaggtggtcagctcatggaagcccgagggagcccaggcagaaagcatcccagcc acggacaaaaatgagctccagacacccaggcctgggccatggcacccaggcgcaccttcc agggctatggaggaggctgccccatcaaaagcatttccatgtctggaaacacctgtgaaa tctaaggcgtgggcagtggaagagggcagctgtgagcagcaggcacagaagacctga >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_6|278_aa MHSQAGWLLQWDRQLQVPAQVPAHCEAAAGLRAPQAAFIAGTEEHSGALKLGDSRNCRAP KGESQPWLGELPGLSHPTEGCSASLLLSSFLLAPHNVQRDWQHLPRPRDLWNFELEKDDL GYLAEAISKQQSIQEPIDLVPCVPAAPAMAERGQCRAHAVASEGGSPKPWQLPCGVESVG TQKSRIEVWEPPPRFQKMYGNAWLSRQKFAAGVRPSWRTSARAVWKGNVGLEPPHRVPTG ALPIGAEEVHHPPEPRMVDPQTACTVHLKKLQTLNASP >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_6|837_bp atgcacagtcaggcaggctggctgctgcagtgggacaggcagctccaggtgccggcacag gtgccagctcactgtgaggctgcagctggactaagggcaccacaagcagctttcatagct ggcactgaggaacacagtggtgccttgaagcttggagactccaggaactgcagggcccca aaaggggagtcacagccctggcttggggagctcccaggtctgtcccaccccactgagggc tgcagtgcttctctccttctctcttctttcctccttgctccccacaacgtgcaaagagac tggcagcatttgccccgccctagagatttgtggaactttgaacttgagaaagatgattta gggtatctggcagaagccatttctaagcagcaaagcattcaagagcctatagacttggtg ccctgtgtcccagctgctccagccatggctgaaaggggccaatgtagagctcatgctgtg gcttcagagggtggaagccccaagccttggcagcttccatgtggtgttgagtctgtgggt acacagaagtcaagaattgaggtttgggaacctccacctagatttcagaagatgtatgga aatgcctggctgtccaggcaaaagtttgctgcaggggtgaggccctcatggagaacctct gctagggcagtgtggaagggaaatgtggggttggaacctccacacagagtccctactggg gcactgcctattggagctgaagaggtccaccatcctccagaacccagaatggtagatcca cagacagcttgcaccgtgcacctgaaaaagctgcagacactcaatgctagcccgtga >gi568815596f:171985678_172188254|GENSCAN_predicted_peptide_7|76_aa GGLRVGQSQDQTLALTQPWGNVAPSGKSLMAASAASTINGQCLAQLRATGPLVMPGLLQR GEKKKEECIVKCLKVL >gi568815596f:171985678_172188254|GENSCAN_predicted_CDS_7|231_bp ggagggctgagggtggggcagtcacaggatcagactctggctctaactcagccttgggga aatgttgctccttctgggaaatccctcatggctgcatctgctgcatctaccatcaatggg caatgcctggcccaactcagagccacaggtcctctggtcatgccaggtttgctgcagaga ggagagaagaaaaaggaggaatgcattgtaaagtgccttaaagttctctga