GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:24:06 Sequence gi568815596f:158903845_159104319 : 200475 bp : 41.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3663 3731 69 2 0 91 105 73 0.920 10.40 1.02 Intr + 9516 9677 162 2 0 67 -10 156 0.158 2.95 1.03 Intr + 17959 18051 93 0 0 67 89 34 0.000 0.64 1.04 Term + 28119 28235 117 2 0 -1 41 161 0.000 0.06 1.05 PlyA + 29828 29833 6 1.05 2.04 PlyA - 30544 30539 6 1.05 2.03 Term - 31548 31364 185 1 2 74 44 153 0.128 6.22 2.02 Intr - 40461 40308 154 0 1 50 70 143 0.643 7.42 2.01 Init - 59942 59886 57 2 0 62 68 106 0.905 7.36 2.00 Prom - 60003 59964 40 -6.65 3.00 Prom + 60253 60292 40 -10.55 3.01 Init + 61485 61545 61 2 1 88 63 23 0.880 1.42 3.02 Intr + 64059 64215 157 1 1 94 97 152 0.945 14.85 3.03 Intr + 65612 65922 311 2 2 19 75 236 0.151 10.43 3.04 Intr + 84949 85138 190 2 1 72 86 66 0.073 2.52 3.05 Term + 91668 91734 67 0 1 71 44 110 0.330 1.33 3.06 PlyA + 92578 92583 6 1.05 4.00 Prom + 95544 95583 40 -4.05 4.01 Sngl + 98127 98498 372 2 0 31 47 295 0.735 15.58 4.02 PlyA + 98682 98687 6 1.05 5.00 Prom + 99016 99055 40 -7.65 5.01 Sngl + 100176 100478 303 2 0 77 42 382 0.808 28.08 5.02 PlyA + 100857 100862 6 1.05 6.00 Prom + 107220 107259 40 -5.55 6.01 Init + 112324 112545 222 0 0 98 100 96 0.154 10.40 6.02 Intr + 129890 130074 185 1 2 65 85 81 0.013 3.16 6.03 Term + 131066 131210 145 2 1 56 55 127 0.470 2.60 6.04 PlyA + 131249 131254 6 -0.45 7.02 PlyA - 131655 131650 6 1.05 7.01 Sngl - 133636 133208 429 1 0 66 42 236 0.902 12.83 7.00 Prom - 135377 135338 40 -9.95 8.02 PlyA - 135402 135397 6 1.05 8.01 Sngl - 136765 136310 456 1 0 58 44 261 0.203 14.64 8.00 Prom - 139799 139760 40 -6.35 9.00 Prom + 143869 143908 40 -4.55 9.01 Init + 145876 146010 135 0 0 71 87 52 0.058 3.59 9.02 Intr + 162052 162216 165 0 0 125 14 91 0.254 4.64 9.03 Intr + 170293 170376 84 0 0 128 93 10 0.042 4.60 9.04 Intr + 183851 183910 60 1 0 73 83 49 0.085 0.91 9.05 Intr + 185907 186102 196 2 1 28 33 150 0.063 1.57 9.06 Term + 186895 187013 119 2 2 52 41 141 0.323 3.52 9.07 PlyA + 187556 187561 6 1.05 10.02 PlyA - 187574 187569 6 1.05 10.01 Term - 190858 190646 213 1 0 31 55 187 0.079 5.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_1|146_aa MEKSRPPETYFGRITGRFADGLETWTQIGTYAVICLASQAFGLRLELHHQLSLVTSLLTG DLGASQLLYSHEPMPYNDPMTTATAYSACSERGCMSKQAQGLATAHSQNDGCGWRFMSPL RAMAREELAKERCKIRFTAAGEESTK >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_1|441_bp atggagaaaagcaggccaccagaaacatattttggacggataactggaagatttgctgat ggactggagacttggactcagattggaacttacgctgtcatctgtctggcttctcaggcc tttggactcagactggaactccaccatcagctctccttggtcaccagcttattgactgga gatcttggggcttctcagcttctctattcgcatgaaccaatgccttataacgatcccatg accactgcaactgcgtactcagcctgcagcgagagggggtgcatgagcaagcaagcacag ggcctggccacagcacacagtcagaatgatggttgtggctggcggtttatgtcccctcta agagccatggccagagaggaacttgcaaaggaaagatgcaagatcaggtttacagctgct ggtgaagagtccactaaataa >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_2|131_aa MRNVDIVDDEENQLDKIEEPCSLAALESVTTSGKISCGIGKGPGDTIKANPVRGPFRATY LAAKWLWAWQEESSKMTVPLILSSKPRVVLSEKGGEDGYARADIAPILALAGLAGTGTVG QENRELEIPCF >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_2|396_bp atgaggaatgttgatattgtggatgatgaagaaaatcaacttgataaaattgaagagcct tgtagcttggctgcccttgagtcagttaccacctctggcaaaatcagctgtggaataggg aagggaccaggagataccatcaaagcaaatcctgtccgaggaccattcagggctacctac ctagctgccaagtggctgtgggcgtggcaagaggagtcaagtaaaatgaccgtgcctctc attctgtcttccaaacctagggtggtcctctctgagaaaggaggagaagatgggtatgca agagcagatatagctcccatccttgcccttgctggactggcaggtactgggacagtgggg caggagaacagagaacttgaaattccctgcttttga >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_3|261_aa MVLASVYGHGSLHTLGLWCREARRWPSVNVAQTARASPERSQEAPDPRPQGRPGNHRPQN VPHSFWEIKRRSRGGALRRAVIVKIVSPLVLGCRVVIFGGRYQGTGFLEDFGVRARSAVG VRGSRLWRGGFPERLAGKEALGIVWCVCRQKGPWTGVLKTYWTYCVKLREEMWRWPGEAL YLPPGLMAGRLDPCHLLRCCPPHPLLAHSHYRLPNRHFLGRLYLTKIGHFVDPLGVLVQS TLPEGGTLGNFKLRLPMAVIA >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_3|786_bp atggtgttggcatccgtgtatggccatggatcgttacacacgctgggactgtggtgtagg gaagcacgacgctggccctctgtgaatgtagcgcagacagcacgggccagcccggagcgg tcgcaggaggctccagacccgcggccccagggacgcccgggaaatcaccggccccagaac gtcccccattccttttgggaaataaaaagacgcagccggggtggggcgctgagaagagcg gtaattgtcaagattgtgagcccccttgttcttggatgcagagtggtcattttcggggga cgatatcagggcactggcttcctggaggactttggggtgcgggcgcgcagcgctgtgggt gtgcggggctcccggctttggcgcggcggctttccggaacgcctggctgggaaggaagct ctaggcattgtgtggtgtgtttgcagacagaaaggtccctggaccggtgttctcaaaact tactggacttattgtgtaaagctccgggaggaaatgtggcgttggccaggtgaggctctg tacctacctccaggtctcatggcaggtcgactggacccctgccacctcctgcggtgctgc cctcctcacccactccttgcccactctcactatcggcttcctaataggcatttcctgggg cgactgtatctgactaaaattggtcattttgtagatcctttaggggtcctggtccaaagc acattacccgagggcgggactcttggaaacttcaaactgaggctccccatggctgtcatt gcttaa >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_4|123_aa MRSRMDRTILQCPDWACAVMELAKSPSDTNLFSERGWRLWRSGRAQRGVCQCDPKKVLSA EDSSQTGTCGRRGWRYGQCGHEFSLGASSHLPPECPFFLLACPRGPRLPIEPEVSKLIVN LGA >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_4|372_bp atgagaagtcggatggacaggaccatcctccagtgtccagactgggcatgtgctgtcatg gaactggcaaaaagtccaagtgacaccaacttgttctcagaaaggggctggaggttatgg agaagtgggagagcccagcgtggggtgtgccagtgtgatccaaagaaagtcttgtcagca gaggattcttcccagactgggacctgcggccggaggggctggcggtatggacagtgtgga catgagttcagcctgggtgcctcttctcatcttcctcctgaatgccctttcttcctgctg gcctgcccgaggggacccaggctgcccattgagcctgaagtcagcaagctgatcgtgaat ttaggtgcatga >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_5|100_aa MIKDDGTVIHFNNPKVQASLSANTFAITGHAEAKPITEMLPGILSQLGADSLTSLRKLAE QFPRQVLDSKAPKPEDIDEEDDDVPDLVENFDEASKNEAN >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_5|303_bp atgattaaagatgatgggacagttattcatttcaacaatcccaaagtccaagcttccctt tctgctaatacctttgcaattactggtcatgcagaagccaaaccaatcacagaaatgctt cctggaatattaagtcagcttggtgctgacagtttaacaagccttaggaagttagctgaa cagttcccacggcaagtcttggacagtaaagcaccaaaaccagaagacattgatgaggaa gatgatgatgttccagatcttgtagaaaattttgatgaggcatcaaagaatgaagctaac taa >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_6|183_aa MAEGKKEQVPSYMDCSRQRENEEDAKAETPNKTIRSHETYSLPQEQYGGTTPMIQLSPTE SLPQHVGIMGVQFKASPQVLSYPASVTSENIMMLMKSRSHSTCMCASWFYTKESSVLQAL WLLVVILECVVLCQTCLSGPHIVVKAVFNAAEPFTAGGFGPSHGIDGDMLWSMNTYSSLC LVV >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_6|552_bp atggcggaaggtaagaaggaacaagtcccatcttatatggattgcagcaggcaaagagaa aatgaggaagatgcaaaagcagaaacccctaataaaacaatcagatctcatgagacttat tcactaccacaagaacagtatgggggaacgacccccatgattcaattatctcccactgag tctctcccacaacacgtgggaattatgggagtacaattcaaggcttcaccccaggttctt tcttatccagcctctgtcacatctgaaaacatcatgatgctcatgaaatcaaggtcacat tcaacctgcatgtgcgccagctggttttataccaaggaaagctctgttcttcaggcactg tggctcttagtagttatccttgaatgcgttgtgctttgtcaaacttgcctctctgggcct cacattgtagtcaaagcagtgttcaatgctgcagaaccctttacagctggaggatttgga ccgtcacatggaatcgatggagacatgctttggtccatgaacacatacagtagcctttgt cttgtggtctga >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_7|142_aa MKALEENLGNTIPDIGTGKDFMTKTILATKAKIDKWDLIKLKSFCTAKETTIRVNWQPTE WEKIFAIYPSDKELIPRIYKELKQIFKKKSNNPIKKWAKEMNRHFSKEDIYAANRHMKKC SSSTGHQRNANQNHNEIPSHTS >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_7|429_bp atgaaagccctagaagaaaacctaggcaataccattccggacataggcacgggcaaagac ttcatgactaaaacaattttggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaactggcaacctacagaa tgggagaaaatttttgcaatctacccatctgacaaagagctaatacccagaatctacaaa gaacttaagcaaattttcaagaaaaaatcaaacaaccccatcaaaaagtgggcaaaggaa atgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaatgc tcatcatccactggtcatcagagaaatgcaaatcaaaaccacaatgagataccatctcac accagttag >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_8|151_aa MELKIMARELHDACTSFSSQFDQVEERESVIEDQMNEMKREEKFRERRVKRNEQSLQEIW DYVKRPNLHLIGIPESDGDNVTKLKNTLQDIIQENFPNLARHANIQIQEIQRTPQRYSLR RATPRLKNKGMEEDLPSKWKAKKSRGCNPSL >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_8|456_bp atggagctgaaaatcatggcacgagaactacatgatgcatgcacaagcttcagtagccaa tttgatcaagtggaagaaagggaatcagtgattgaagatcaaatgaatgaaatgaagcga gaagagaagtttagagaaagaagagtaaaaagaaacgaacaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacatttgattggtatacctgaaagtgatggggacaat gtaaccaagctgaaaaacactcttcaggatattatccaggagaacttccccaacctagca aggcacgccaacattcaaattcaggaaatacagagaacaccacaaagatactccttgaga agagcaacccccaggctcaaaaataaagggatggaggaagatctaccaagcaaatggaaa gcaaaaaaaagcaggggttgcaatcctagtctctga >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_9|252_aa MRKRRGFESRELKRMVLYQQEPGPRLVSGFWKWGEEDGFSFGHVEKQVLKMLKAVLKKSR EGGKGGKKEAGMCARMRSSAMDSGQQAAPHPRPDGFVALGGQASYTSSAMKIPLFLSYVL VTTVQWVKAFGVGREGTRVVGEIVFTKKDCGRLSTAGIQPSEQGTGTLLPVMPVSTVVSC KQACLRAVPTHMSLRAHALWNRASFHPLAKLGLSRMTGASATVLQKQQAQMLQELINVQY PRRGEETRVLDD >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_9|759_bp atgagaaagaggaggggctttgagtctcgtgaactaaaaaggatggtgctttatcagcag gaacctggacccagactagtgagtgggttttggaagtggggagaggaagatgggtttagt tttggacatgttgagaaacaagtgttgaaaatgttaaaggctgtgctgaagaagagccga gagggaggaaagggaggcaagaaggaagcaggtatgtgtgcaagaatgagaagctcagcc atggacagcgggcagcaagctgctcctcaccccaggccggatggatttgttgctttggga gggcaggctagttatacctcatcagccatgaaaataccattgttcttgtcctatgttctg gtgaccacagtgcagtgggtaaaggctttcggtgttggacgggaggggacaagggttgtg ggtgaaattgtgttcaccaaaaaggactgtggcaggctcagcactgctggcatccaacct tcagagcagggcactggcactcttcttcctgttatgcccgtgtccactgtagtcagctgc aaacaagcttgcctccgagcagtgcctacacacatgtcacttcgagcacatgctttgtgg aatagagcttcatttcacccacttgccaagttaggccttagcagaatgaccggagccagc gccactgtgctgcagaagcagcaggcccaaatgctgcaagagttaataaatgtccagtac ccacggagaggagaggaaactagggtgcttgatgactaa >gi568815596f:158903845_159104319|GENSCAN_predicted_peptide_10|70_aa NHRGQSYSTVHESGIFEKPPRGFRPGHEFSQLLSLQLSGEETDLKAELGCTMMPARSGAG CPLTRQTCKS >gi568815596f:158903845_159104319|GENSCAN_predicted_CDS_10|213_bp aaccacaggggccagtcttatagcactgtgcatgagtctggaatctttgagaaaccacct cgaggctttcggccaggtcacgagttttcccagctgctttcccttcaactgtctggcgag gaaacagatcttaaggcagagctaggctgcaccatgatgccagcgcgcagtggagccgga tgccctctgacccgacaaacttgtaaaagctga